BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 007204
(613 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255573091|ref|XP_002527475.1| conserved hypothetical protein [Ricinus communis]
gi|223533115|gb|EEF34873.1| conserved hypothetical protein [Ricinus communis]
Length = 840
Score = 941 bits (2431), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 450/603 (74%), Positives = 513/603 (85%), Gaps = 15/603 (2%)
Query: 8 STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
S PLK+TFNGPAKH+TD+IPIGNGR+GAM+ GG+ SE ++LNEDTLWTGVPG+YTNP+
Sbjct: 20 SYNKPLKVTFNGPAKHWTDSIPIGNGRIGAMISGGMQSEIIQLNEDTLWTGVPGNYTNPN 79
Query: 68 APKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
A +ALS+VR LVD G YAEATAASVK FG+PADVYQLLGD++LEFDDSHL YA+ETY RE
Sbjct: 80 ALEALSEVRKLVDDGLYAEATAASVKFFGNPADVYQLLGDVKLEFDDSHLTYADETYYRE 139
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
LDL+TATARV+YSVG+V+FT+E+F+SNPDQV V KISGS+SGSLSF VSLDS LD+H YV
Sbjct: 140 LDLDTATARVQYSVGDVKFTKEYFASNPDQVAVIKISGSKSGSLSFTVSLDSKLDHHCYV 199
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
N NQIIMEG CP KRIPPK +AN++PKGI+FSA+L++ +SD G I L++KKLKVEGS
Sbjct: 200 NVENQIIMEGSCPEKRIPPKMSANENPKGIKFSAVLDLHVSDGVGVIHVLDNKKLKVEGS 259
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
DW VLLL ASSSF+ P PSDSKKDPTSES+ AL++I NLSYSDLY RHL DYQKLFHR
Sbjct: 260 DWGVLLLAASSSFESPLTKPSDSKKDPTSESLRALKAITNLSYSDLYARHLHDYQKLFHR 319
Query: 308 VSIQLSRSPKDIVTDTCSEENI---------------DTVPSAERVKSFQTDEDPSLVEL 352
VS QL +S IV D N D VP+ ER+KSFQ+DEDPSLVEL
Sbjct: 320 VSFQLWKSSNRIVGDESQLTNNLIPSANALYVKGIKDDAVPTVERIKSFQSDEDPSLVEL 379
Query: 353 LFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
LFQFGRYLLIS SRPGTQVANLQG+WN+DL PTWDSAPH+NINLEMNYW SLPCNL+ECQ
Sbjct: 380 LFQFGRYLLISCSRPGTQVANLQGVWNKDLEPTWDSAPHLNINLEMNYWLSLPCNLNECQ 439
Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
EPLFDF+ LS+NGSKTAQVNY ASGWVIHHK+DIWAKSSADRG VWALWP+GGAWLCT
Sbjct: 440 EPLFDFIKSLSVNGSKTAQVNYGASGWVIHHKSDIWAKSSADRGDAVWALWPIGGAWLCT 499
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGK 532
HLWEHYNYTMD++FLE AY LLEGC SFLLDWL+EG +GYLETNPSTSPEH FI PDGK
Sbjct: 500 HLWEHYNYTMDKEFLENEAYFLLEGCVSFLLDWLVEGSEGYLETNPSTSPEHMFITPDGK 559
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
ACVSYSSTMDMAIIREVFS+ +SA+EVL +N+D LV+ V +LPRLRPTKIAEDGSIME
Sbjct: 560 PACVSYSSTMDMAIIREVFSSFVSASEVLGRNKDVLVQNVHTALPRLRPTKIAEDGSIME 619
Query: 593 WVQ 595
WV+
Sbjct: 620 WVR 622
>gi|224103693|ref|XP_002313157.1| predicted protein [Populus trichocarpa]
gi|222849565|gb|EEE87112.1| predicted protein [Populus trichocarpa]
Length = 836
Score = 918 bits (2373), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 442/610 (72%), Positives = 512/610 (83%), Gaps = 16/610 (2%)
Query: 1 MMNAEST--STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG 58
M N ST + PLKIT GPAK++TDAIPIGNGRLGAMVWGGV SE ++LNEDTLWTG
Sbjct: 17 MWNPTSTYLEDSKPLKITSTGPAKYWTDAIPIGNGRLGAMVWGGVSSELIQLNEDTLWTG 76
Query: 59 VPGDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLK 118
P DYTNPDAP+AL++VR+LVDSG++AEA+ A+ KL G A+VYQLLGDI+LEFD +L
Sbjct: 77 TPIDYTNPDAPEALAEVRNLVDSGEFAEASDAAAKLSGTNANVYQLLGDIKLEFD-GYLM 135
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
AEETY RELDL+TATARVKYSVG+VEFTREHF+S PDQVIVTKI+GS+ GS+SF VSLD
Sbjct: 136 CAEETYYRELDLDTATARVKYSVGDVEFTREHFASYPDQVIVTKIAGSKEGSVSFTVSLD 195
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
S LD+H Y+ +QI+MEGRCPGKRIPPK ANDDPKGI F+A+L ++ISD G +S L+
Sbjct: 196 SKLDHHCYITDESQIVMEGRCPGKRIPPKVKANDDPKGILFAAVLGLQISDGAGLMSVLD 255
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D +LKVEG++W VL +VASSSF+GPF PS+S+KDP S S+SAL+SI+N SYS+LY+RHL
Sbjct: 256 DGRLKVEGANWVVLHMVASSSFEGPFTKPSESEKDPASVSLSALKSIKNQSYSELYSRHL 315
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDT-------------CSEENIDTVPSAERVKSFQTDE 345
DDYQ LFHRVS+QL + + D C E N D VP+ +R++SFQ+DE
Sbjct: 316 DDYQNLFHRVSLQLCKGSDRNIGDRSLEIKNLMPSGKRCVEGNKDVVPTVDRIRSFQSDE 375
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN+DL P WDSAPH+NINLEMNYW SLP
Sbjct: 376 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNKDLEPKWDSAPHLNINLEMNYWPSLP 435
Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
CNLSECQEPLF+F+ LSING KTAQVNY SGWV+HHK+DIWAK SAD+G+VVWA+WPM
Sbjct: 436 CNLSECQEPLFEFIKSLSINGCKTAQVNYKTSGWVVHHKSDIWAKPSADKGEVVWAIWPM 495
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHE 525
GGAWLCTHLWEHY+YTMD DFL +AYPLLEGCASFLLDWLIEGH GYLETNPSTSPEH
Sbjct: 496 GGAWLCTHLWEHYSYTMDEDFLRNKAYPLLEGCASFLLDWLIEGHGGYLETNPSTSPEHM 555
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
FIAPDGK A VSYSSTMDMA+I+EVFSAIISA+EVL +NEDA V+KV K+ PRL PTKI
Sbjct: 556 FIAPDGKSASVSYSSTMDMALIKEVFSAIISASEVLGRNEDAFVQKVHKAQPRLYPTKID 615
Query: 586 EDGSIMEWVQ 595
E+GSIMEW Q
Sbjct: 616 EEGSIMEWAQ 625
>gi|224103687|ref|XP_002313154.1| predicted protein [Populus trichocarpa]
gi|222849562|gb|EEE87109.1| predicted protein [Populus trichocarpa]
Length = 803
Score = 912 bits (2357), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 435/594 (73%), Positives = 510/594 (85%), Gaps = 7/594 (1%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
M + ++ + LKITFNGPAKH+TDAIPIGNGRLGAM+WGGV ETL+LNEDTLWTG P
Sbjct: 1 MDDDDNGENSRSLKITFNGPAKHWTDAIPIGNGRLGAMIWGGVSLETLQLNEDTLWTGTP 60
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
G+YTNP AP+ALS VR LVD+GQYA+AT A+ KL P+DVYQLLGDI+LEFD+SHLKY
Sbjct: 61 GNYTNPHAPEALSVVRKLVDNGQYADATTAAEKLSHDPSDVYQLLGDIKLEFDNSHLKYV 120
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
E++Y RELDL+TATARVKYSVG+VE+TRE+F+SNP+QVI TKISGS+SGS+SF V LDS
Sbjct: 121 EKSYHRELDLDTATARVKYSVGDVEYTREYFASNPNQVIATKISGSKSGSVSFTVYLDSK 180
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ ++SYV G NQIIMEG CPGKRIPPK NA+D+PKGIQF+AIL ++IS+ RG + L+ +
Sbjct: 181 MHHYSYVKGENQIIMEGSCPGKRIPPKLNADDNPKGIQFTAILNLQISNSRGVVHVLDGR 240
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
KLKVEGSDWA+LLLV+SSSFDGPF P DSKKDPTS+S+SAL+SI NLSY+DLY HLDD
Sbjct: 241 KLKVEGSDWAILLLVSSSSFDGPFTKPIDSKKDPTSDSLSALKSINNLSYTDLYAHHLDD 300
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
YQ LFHRVS+QLS+S K SE+N TV +AERVKSF+TDEDPSLVELLFQ+GRYL
Sbjct: 301 YQSLFHRVSLQLSKSSK-----RRSEDN--TVSTAERVKSFKTDEDPSLVELLFQYGRYL 353
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LIS SRPGTQVANLQGIWN+D+ P WD A H+NINL+MNYW +LPCNL ECQ+PLF++++
Sbjct: 354 LISCSRPGTQVANLQGIWNKDIEPPWDGAQHLNINLQMNYWPALPCNLKECQDPLFEYIS 413
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
LSINGSKTA+VNY A GWV H +DIWAK+S DRG+ VWALWPMGGAWLCTHLWEHY Y
Sbjct: 414 SLSINGSKTAKVNYDAKGWVAHQVSDIWAKTSPDRGQAVWALWPMGGAWLCTHLWEHYTY 473
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
TMD+DFL+ +AYPLLEGC+ FLLDWLIEG GYLETNPSTSPEH FI PDGK A VSYSS
Sbjct: 474 TMDKDFLKNKAYPLLEGCSLFLLDWLIEGRGGYLETNPSTSPEHMFIDPDGKPASVSYSS 533
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
TMDM+II+EVFSAIISAAE+L KNED +V+KV ++ PRL PT+IA DGSIMEW
Sbjct: 534 TMDMSIIKEVFSAIISAAEILGKNEDEIVQKVREAQPRLLPTRIARDGSIMEWA 587
>gi|224056204|ref|XP_002298754.1| predicted protein [Populus trichocarpa]
gi|222846012|gb|EEE83559.1| predicted protein [Populus trichocarpa]
Length = 808
Score = 906 bits (2341), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 427/594 (71%), Positives = 505/594 (85%), Gaps = 1/594 (0%)
Query: 2 MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
M + ++ PL++TF+GPAKH+TDAIPIGNGRLGAM+WGGV ETL+LNEDTLWTG+PG
Sbjct: 1 MEDNNGESSKPLRVTFSGPAKHWTDAIPIGNGRLGAMIWGGVALETLQLNEDTLWTGIPG 60
Query: 62 DYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAE 121
DYTNP+AP AL +VR LVD+GQYAEAT A+ KL G+ +DVYQLLGDI+LEFDDSHLKY E
Sbjct: 61 DYTNPNAPAALLEVRKLVDNGQYAEATTAAEKLSGNQSDVYQLLGDIKLEFDDSHLKYDE 120
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
+TY+RELDL+TATARVKYSV ++E+TREHF+SNP+QVIVTKISGS+ GS+SF VSLDS +
Sbjct: 121 KTYKRELDLDTATARVKYSVADIEYTREHFASNPNQVIVTKISGSKPGSVSFTVSLDSKM 180
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
+HSYV G NQII+EG CPG R K N ND P+GIQF+AIL++++S+ RG + ED K
Sbjct: 181 SHHSYVKGENQIIIEGSCPGNRYAQKLNENDSPQGIQFTAILDLQVSEARGLVRVSEDSK 240
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L+VEGSDWAVLLLV+SSSFDGPF P DSKK+PTS+S+S L+SI NLSY DLY HLDDY
Sbjct: 241 LRVEGSDWAVLLLVSSSSFDGPFTKPIDSKKNPTSDSLSVLKSIGNLSYVDLYAHHLDDY 300
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
Q LFHRVS+QLS+S K+ E+ DTV +AERVK+FQTDEDPSLVELLFQ+GRYLL
Sbjct: 301 QSLFHRVSLQLSKSSKNSDISLNGSED-DTVSTAERVKAFQTDEDPSLVELLFQYGRYLL 359
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
IS SRPGTQVANLQGIWN+DL+P WD A H+NINL+MNYW SL CNL ECQEPLF++++
Sbjct: 360 ISCSRPGTQVANLQGIWNKDLTPPWDGAQHLNINLQMNYWPSLSCNLKECQEPLFEYISS 419
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
LSI+GS+TA+VNY A GWV H +D+WAK+S D G+ +WALWPMGGAWLCTHLWEHY Y
Sbjct: 420 LSISGSRTAKVNYEAKGWVAHQVSDLWAKTSPDAGQALWALWPMGGAWLCTHLWEHYTYA 479
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
D+DFL +AYPLLEGC SFLLDWLIEG GYLETNPSTSPEH FIAPDGK A VSYSST
Sbjct: 480 KDKDFLRDKAYPLLEGCTSFLLDWLIEGPGGYLETNPSTSPEHMFIAPDGKPASVSYSST 539
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
MDM+II+EVFSAI+SAA++L +NED LV+KVL++LPRL PTKIA DGSIMEW Q
Sbjct: 540 MDMSIIKEVFSAIVSAAKILGRNEDELVQKVLEALPRLLPTKIARDGSIMEWAQ 593
>gi|359475494|ref|XP_002270199.2| PREDICTED: alpha-L-fucosidase 2-like [Vitis vinifera]
Length = 817
Score = 884 bits (2284), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 433/584 (74%), Positives = 498/584 (85%), Gaps = 9/584 (1%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+ F GPAKH+TDA+PIGNGRLGAMVWGGV SETL+LNE TLWTG PG+YTNPDAPKA
Sbjct: 34 PLKVRFFGPAKHWTDALPIGNGRLGAMVWGGVASETLQLNEGTLWTGTPGNYTNPDAPKA 93
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
LS+VR LVD+G Y AT A+VKL G+P+DVYQLLGDI LEF+DSHL YAEETY RELDL+
Sbjct: 94 LSEVRKLVDNGDYVAATEAAVKLSGNPSDVYQLLGDINLEFEDSHLAYAEETYSRELDLD 153
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TAT +KYSVG+VE+TREHF+S PDQVIVTKISGS+ GS+SF VSLDS +HS +G +
Sbjct: 154 TATVTIKYSVGDVEYTREHFASYPDQVIVTKISGSKPGSVSFTVSLDSKSHHHSNSSGKS 213
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
QIIMEG CPGKRIPPK ND+P+GI FSA+L+++ISD RG I+ L+DKKLKVEGSDWAV
Sbjct: 214 QIIMEGSCPGKRIPPKVYENDNPQGILFSAVLDLQISDGRGVINVLDDKKLKVEGSDWAV 273
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
L LVASSSFDGPF P DSK +PTSE++S L+SI N SYSDLY RHL+DYQ LFHRVS+Q
Sbjct: 274 LYLVASSSFDGPFTKPIDSKINPTSEALSTLKSIGNFSYSDLYARHLNDYQNLFHRVSLQ 333
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
LS+S K + ++ V +A RVKSF TDEDPSLVELLFQ+GRYLLIS SRPG+Q
Sbjct: 334 LSKSSKSV---------MNRVSTAARVKSFGTDEDPSLVELLFQYGRYLLISCSRPGSQP 384
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
ANLQGIWN+D+ P WD APH+NINL+MNYW SLPCNLSECQEPLFD+++ LSINGSKTA+
Sbjct: 385 ANLQGIWNKDIEPAWDGAPHLNINLQMNYWPSLPCNLSECQEPLFDYMSSLSINGSKTAK 444
Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
VNY ASGWV H +DIWAK+S DRG+ VWALWPMGGAWLCTHLWEHY +TMD+DFL+ +A
Sbjct: 445 VNYEASGWVTHQVSDIWAKTSPDRGQAVWALWPMGGAWLCTHLWEHYTFTMDKDFLKNKA 504
Query: 492 YPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
YPLLEGCA FLLDWLIEG GYLETNPSTSPEH FIAPDGK A VSYS+TMD+AIIREVF
Sbjct: 505 YPLLEGCARFLLDWLIEGRGGYLETNPSTSPEHMFIAPDGKPASVSYSTTMDIAIIREVF 564
Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
SA++SAAEVL KNED LV+KV ++ P+L PTKIA DGSIMEW Q
Sbjct: 565 SAVVSAAEVLGKNEDELVQKVRQAQPKLPPTKIARDGSIMEWAQ 608
>gi|356574288|ref|XP_003555281.1| PREDICTED: alpha-L-fucosidase 2-like [Glycine max]
Length = 876
Score = 867 bits (2241), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/598 (68%), Positives = 491/598 (82%), Gaps = 14/598 (2%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+TF PA H+TDAIPIGNGRLGAMVWG VPSE L+LNEDTLWTG+PGDYTN A +A
Sbjct: 65 PLKVTFAEPATHWTDAIPIGNGRLGAMVWGAVPSEALQLNEDTLWTGIPGDYTNKSAQQA 124
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
L++VR LVD +++EATAA+VKL G P+DVYQLLGDI+LEF DSHL Y++E+Y RELDL+
Sbjct: 125 LAEVRKLVDDRKFSEATAAAVKLSGDPSDVYQLLGDIKLEFHDSHLNYSKESYYRELDLD 184
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TATA++KYSVG+VEFTREHF+SNPDQVIVT++S S+ GSLSF V DS + + S V+G N
Sbjct: 185 TATAKIKYSVGDVEFTREHFASNPDQVIVTRLSTSKPGSLSFTVYFDSKMHHDSRVSGQN 244
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
QII+EGRCPG RI P N+ D+P+GIQFSA+L+++IS D+G I L+DKKL+VEGSDWA+
Sbjct: 245 QIIIEGRCPGSRIRPIVNSIDNPQGIQFSAVLDMQISKDKGVIHVLDDKKLRVEGSDWAI 304
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
LLL ASSSFDGPF P DSKKDP SES+S + S++ +SY DLY RHL DYQ LFHRVS+Q
Sbjct: 305 LLLTASSSFDGPFTKPEDSKKDPASESLSRMVSVKKISYGDLYARHLADYQNLFHRVSLQ 364
Query: 312 LSRSPKDI----VTD----TCSEENI------DTVPSAERVKSFQTDEDPSLVELLFQFG 357
LS+S K + V D S+ NI DT+P++ RVKSFQTDEDPS VELLFQ+G
Sbjct: 365 LSKSSKTVSGKSVLDRRKLVSSQTNISQMGGDDTIPTSARVKSFQTDEDPSFVELLFQYG 424
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLIS SRPGTQVANLQGIWN+D+ P WD APH+NINL+MNYW SL CNL ECQEPLFD
Sbjct: 425 RYLLISCSRPGTQVANLQGIWNKDVEPAWDGAPHLNINLQMNYWPSLACNLHECQEPLFD 484
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
F++ LS+ G KTA+VNY A+GWV+H +DIW K+S DRG+ VWALWPMGGAWLCTHLWEH
Sbjct: 485 FISSLSVIGKKTAKVNYEANGWVVHQVSDIWGKTSPDRGEAVWALWPMGGAWLCTHLWEH 544
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y YTMD+ FL+ +AYPLLEGC SFLLDWLIEG G LETNPSTSPEH F APDGK A VS
Sbjct: 545 YTYTMDKVFLKNKAYPLLEGCTSFLLDWLIEGRGGLLETNPSTSPEHMFTAPDGKTASVS 604
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
YSSTMD++II+EVFS IISAAEVL ++ D ++++V + +L PTK+A DGSIMEW +
Sbjct: 605 YSSTMDISIIKEVFSMIISAAEVLGRHNDTIIKRVTEYQSKLPPTKVARDGSIMEWAE 662
>gi|356575686|ref|XP_003555969.1| PREDICTED: alpha-L-fucosidase 2-like [Glycine max]
Length = 874
Score = 863 bits (2229), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 408/609 (66%), Positives = 495/609 (81%), Gaps = 16/609 (2%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
+ N ES PLK+TF PA H+TDAIPIGNGRLGAMVWG VPSE L+LNEDTLWTG+P
Sbjct: 54 LTNGESPP--RPLKVTFAEPATHWTDAIPIGNGRLGAMVWGAVPSEALQLNEDTLWTGIP 111
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
DYTN AP+AL++VR LVD +++EATAA+VKL G P++VYQLLGDI+LEF DSHL Y+
Sbjct: 112 RDYTNSSAPQALAEVRKLVDDRKFSEATAAAVKLSGDPSEVYQLLGDIKLEFHDSHLNYS 171
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
+E+Y RELDL+TATA +KYSVG+VEFTREHF+SNPDQVIVT++S S+ GSLSF V DS
Sbjct: 172 KESYYRELDLDTATANIKYSVGDVEFTREHFASNPDQVIVTRLSTSKPGSLSFTVYFDSK 231
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ + S V+G NQIIMEGRCPG RIPP+ N+ D+P+GIQFSA+L+++IS D+G I L+DK
Sbjct: 232 MHHDSRVSGQNQIIMEGRCPGSRIPPRVNSIDNPQGIQFSAVLDMQISKDKGFIHVLDDK 291
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
KL+VEGSDWA+LLL ASSSFDGPF P DSKKDP SES+S + S++ +SY DLY RHL D
Sbjct: 292 KLRVEGSDWAILLLTASSSFDGPFTKPEDSKKDPASESLSRMVSVKKISYGDLYARHLAD 351
Query: 301 YQKLFHRVSIQLSRSPKDI----VTD----TCSEENI------DTVPSAERVKSFQTDED 346
YQ LFHRVS+QLS+S K + V D S+ NI DT+P++ RVKSFQTDED
Sbjct: 352 YQNLFHRVSLQLSKSSKTVSGKSVLDRRKLVSSQTNISQMGGDDTIPTSARVKSFQTDED 411
Query: 347 PSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPC 406
PS VELLFQ+GRYLLIS SRPGTQVANLQGIWN+D+ P W+ APH+NINL++NYW SL C
Sbjct: 412 PSFVELLFQYGRYLLISCSRPGTQVANLQGIWNKDVEPAWEGAPHLNINLQINYWPSLAC 471
Query: 407 NLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
NL ECQEPLFDF++ LS+ G KTA+V+Y A+GWV HH +DIW K+S +G+ VWA+WPMG
Sbjct: 472 NLHECQEPLFDFISSLSVIGKKTAKVSYEANGWVAHHVSDIWGKTSPGQGQAVWAVWPMG 531
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEF 526
GAWLCTHLWEHY YT+D+DFL+ +AYPLLEGC SFLLDWLIEG G LETNPSTSPEH F
Sbjct: 532 GAWLCTHLWEHYTYTLDKDFLKNKAYPLLEGCTSFLLDWLIEGRGGLLETNPSTSPEHMF 591
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE 586
APDGK A VSYSSTMD++II+EVFS IISAAEVL ++ D ++++ + +L PTK+A
Sbjct: 592 TAPDGKTASVSYSSTMDISIIKEVFSMIISAAEVLGRHNDTIIKRATEYQSKLPPTKVAR 651
Query: 587 DGSIMEWVQ 595
DGSIMEW +
Sbjct: 652 DGSIMEWAE 660
>gi|356536151|ref|XP_003536603.1| PREDICTED: alpha-L-fucosidase 2-like [Glycine max]
Length = 877
Score = 862 bits (2228), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 405/598 (67%), Positives = 486/598 (81%), Gaps = 14/598 (2%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+TF PA H+TDAIPIGNGRLGAMVWG VPSE L+LNEDTLWTG+PGDYTN AP+A
Sbjct: 66 PLKVTFAEPATHWTDAIPIGNGRLGAMVWGAVPSEALQLNEDTLWTGIPGDYTNKSAPQA 125
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
L++VR LV+ ++AEATAA+VKL G P+DV+QLLGDI+LEF DSHL Y++E+Y RELDL+
Sbjct: 126 LAEVRKLVNDRKFAEATAAAVKLSGEPSDVFQLLGDIKLEFHDSHLNYSKESYYRELDLD 185
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TATA++KYSVG+VEFTREHF+SNPDQVIVT++S S+ GSLSF V DS + + S V+G N
Sbjct: 186 TATAKIKYSVGDVEFTREHFASNPDQVIVTRLSASKPGSLSFTVYFDSKMHHDSRVSGQN 245
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
QI +EGRCPG RI P+ N+ D+P+GIQFSA+L+++IS D+G I L+DKKL+VEGSD A+
Sbjct: 246 QIKIEGRCPGSRIRPRVNSIDNPQGIQFSAVLDMQISKDKGVIHVLDDKKLRVEGSDSAI 305
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
LLL ASSSFDGPF P DSKKDP SES+S + S++ SY DLY RHL DYQ LFHRVS+Q
Sbjct: 306 LLLTASSSFDGPFTKPEDSKKDPASESLSRMVSVKKFSYDDLYARHLADYQNLFHRVSLQ 365
Query: 312 LSRSPK--------------DIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
LS+S K T+ + DT+P++ RVKSFQTDEDPS VELLFQ+G
Sbjct: 366 LSKSSKTGSGKSVLEGRKLVSSQTNISQKRGDDTIPTSARVKSFQTDEDPSFVELLFQYG 425
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLIS SRPGTQVANLQGIWN+D+ P WD APH+NINL+MNYW SL CNL ECQEPLFD
Sbjct: 426 RYLLISCSRPGTQVANLQGIWNKDVEPAWDGAPHLNINLQMNYWPSLACNLHECQEPLFD 485
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
F++ LS+ G KTA+VNY A+GWV H +DIW K+S DRG+ VWALWPMGGAWLCTHLWEH
Sbjct: 486 FISSLSVIGKKTAKVNYEANGWVAHQVSDIWGKTSPDRGEAVWALWPMGGAWLCTHLWEH 545
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y YTMD+DFL+ +AYPLLEGC +FLLDWLIEG G LETNPSTSPEH F APDGK A VS
Sbjct: 546 YIYTMDKDFLKNKAYPLLEGCTTFLLDWLIEGRGGLLETNPSTSPEHMFTAPDGKTASVS 605
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
YSSTMD++II+EVFS IISAAEVL ++ D ++++V K +L PTK+A DGSIMEW +
Sbjct: 606 YSSTMDISIIKEVFSMIISAAEVLGRHNDTIIKRVTKYQSKLPPTKVARDGSIMEWAE 663
>gi|224056206|ref|XP_002298755.1| predicted protein [Populus trichocarpa]
gi|222846013|gb|EEE83560.1| predicted protein [Populus trichocarpa]
Length = 843
Score = 861 bits (2225), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 410/596 (68%), Positives = 496/596 (83%), Gaps = 14/596 (2%)
Query: 10 TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
+ PLK+TF+GPAK++TD IPIGNGRLGAMVWGGV SE ++LNEDTLWTG P D+T+P P
Sbjct: 28 SRPLKVTFSGPAKYWTDGIPIGNGRLGAMVWGGVSSELIQLNEDTLWTGTPTDFTDPAIP 87
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
+ALS+VR+LVDSG+++EAT A+ ++FG +VY+LLGDI+LEF+ S YAE TY RELD
Sbjct: 88 QALSEVRNLVDSGKFSEATKAAARMFGKYTNVYKLLGDIKLEFNGS--TYAEGTYYRELD 145
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L+TAT RVKY+V +VEFTREHF+SNPDQVIVTKISGS++ S+SF VSLDS+L++ Y+
Sbjct: 146 LDTATGRVKYTVDDVEFTREHFASNPDQVIVTKISGSKAQSVSFAVSLDSILEHQCYLTD 205
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
NQ++MEG CPGKR+ + ANDDPKG++F+A+L+++IS+ + L+D KLKV G+DW
Sbjct: 206 ENQLVMEGICPGKRMTTEVKANDDPKGMKFTAVLDLQISNGARLVRLLDDNKLKVVGADW 265
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
AVLLLVASSSF+GPF++PSDSKK+PTS+S+ A+ SI+ LSYS LY+RHLDD+Q LFHRVS
Sbjct: 266 AVLLLVASSSFEGPFVDPSDSKKNPTSDSLQAMNSIKKLSYSQLYSRHLDDFQNLFHRVS 325
Query: 310 IQLSRSP---------KDIVTDTCS--EENIDTV-PSAERVKSFQTDEDPSLVELLFQFG 357
+QL +S K+++ E N D V P+ ER+KSF++DEDPSLVELLFQFG
Sbjct: 326 LQLEKSSAIGDGVSEIKNLMPSVIEDFEGNKDVVVPTVERIKSFESDEDPSLVELLFQFG 385
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLIS SRPGTQVANLQGIWN+DL P WDSAP +NINLEMNYW SLPCNL ECQEPLFD
Sbjct: 386 RYLLISCSRPGTQVANLQGIWNKDLYPAWDSAPTLNINLEMNYWPSLPCNLRECQEPLFD 445
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
F+ LSINGSK AQVNY+ SGWV HH++DIW K+SAD G WA+WPM GAW+CTHLWEH
Sbjct: 446 FIKSLSINGSKVAQVNYITSGWVAHHRSDIWEKASADMGNPKWAIWPMAGAWVCTHLWEH 505
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y YT+D+DFL AYPLLEGCASFL+DWLIEG+DGYLETNPSTSPEH FIAPDG A VS
Sbjct: 506 YTYTLDKDFLINTAYPLLEGCASFLMDWLIEGNDGYLETNPSTSPEHMFIAPDGNSASVS 565
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
YSSTMDMAII EVFSAI+SA+EVL ++EDALV+KVLK+ PRL P KIA DGSIMEW
Sbjct: 566 YSSTMDMAIINEVFSAIVSASEVLGRSEDALVQKVLKAQPRLYPPKIAPDGSIMEW 621
>gi|255573093|ref|XP_002527476.1| conserved hypothetical protein [Ricinus communis]
gi|223533116|gb|EEF34874.1| conserved hypothetical protein [Ricinus communis]
Length = 849
Score = 856 bits (2212), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 412/599 (68%), Positives = 496/599 (82%), Gaps = 15/599 (2%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLKI F+GPAKH+TDAIPIGNGRLGAMV+GGV SETL++NEDTLWTG PG+YTNP+AP+A
Sbjct: 36 PLKIVFSGPAKHWTDAIPIGNGRLGAMVFGGVASETLRINEDTLWTGTPGNYTNPNAPEA 95
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
L+ VR LV +YAEAT +VKL G P+++YQ+LGDI+LEFDDSHL Y E+TY+RELDL+
Sbjct: 96 LTQVRKLVGDRKYAEATTEAVKLSGLPSEIYQVLGDIKLEFDDSHLSYDEKTYQRELDLD 155
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TATARVKYS+G+VE+TREHF+SNP+QV+VTKI+ S+ GS+SF V LDS L +HSY G N
Sbjct: 156 TATARVKYSLGDVEYTREHFASNPNQVVVTKIAASKPGSVSFTVLLDSELHHHSYTKGEN 215
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
QI +EG CPGKR PP+ A+D PKGI+F+AIL+++IS+ RG I L+D+KLKVEGSDWAV
Sbjct: 216 QIFIEGSCPGKRAPPQIYASDGPKGIEFAAILKLQISEGRGKIHVLDDRKLKVEGSDWAV 275
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
L LVASSSFDGPF PS SKKDPTS + AL ++NLSY+DLY RHLDDYQ LFHRVS++
Sbjct: 276 LSLVASSSFDGPFTMPSASKKDPTSACLHALDLVKNLSYTDLYARHLDDYQTLFHRVSLR 335
Query: 312 LSRSPKDIVTD---------------TCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
LS+S K I+ + + +E DT+ +AERVKSF+TDEDPSLVELLFQ+
Sbjct: 336 LSKSSKSILGNGPLNMKKFLSFKNYLSLNESKDDTISTAERVKSFRTDEDPSLVELLFQY 395
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLIS SRPGTQVANLQGIW++D +P WD A H+NINL+MNYW +L CNL EC EPLF
Sbjct: 396 GRYLLISCSRPGTQVANLQGIWSKDNAPPWDGAQHLNINLQMNYWPALSCNLHECHEPLF 455
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
++++ LSINGS TA+VNY A+GWV H +D+WAK+S DRG+ VWALWPMGGAWLC HLWE
Sbjct: 456 EYMSSLSINGSMTAKVNYEANGWVAHQVSDLWAKTSPDRGEAVWALWPMGGAWLCIHLWE 515
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
HY YTMD+DFL+ +AYPLLEGCA+FLLDWLIEG GYLETNPSTSPEH FIAPDGK A V
Sbjct: 516 HYTYTMDKDFLKNKAYPLLEGCATFLLDWLIEGPGGYLETNPSTSPEHMFIAPDGKPASV 575
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S S+TMD+ II+EVFS I+SAAEVL + ED L++KV ++ PRLRP KIA DGSIMEW Q
Sbjct: 576 SNSTTMDVEIIQEVFSEIVSAAEVLGRKEDELIQKVREAQPRLRPIKIARDGSIMEWAQ 634
>gi|449446103|ref|XP_004140811.1| PREDICTED: alpha-L-fucosidase 2-like [Cucumis sativus]
Length = 803
Score = 852 bits (2202), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 406/588 (69%), Positives = 486/588 (82%), Gaps = 2/588 (0%)
Query: 9 TTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
+++PLK+TFN PAKH+TDAIPIGNGRLGAMVWGGV +E L+LNEDTLWTG P DYTNPDA
Sbjct: 4 SSDPLKLTFNAPAKHWTDAIPIGNGRLGAMVWGGVDTEILQLNEDTLWTGTPADYTNPDA 63
Query: 69 PKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRREL 128
P+AL +VR LVD G+YAEAT A+VKL G P+DVYQLLGDI+LEF+ SH Y ETY REL
Sbjct: 64 PEALREVRKLVDDGKYAEATEAAVKLSGKPSDVYQLLGDIKLEFEVSHQSYTPETYHREL 123
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV- 187
DLNTATARVKYSVG+VEFTREHF+SNPDQ IVTKI+ S+ GSL+F VS+DS L + S+V
Sbjct: 124 DLNTATARVKYSVGDVEFTREHFASNPDQAIVTKIAASKPGSLTFIVSIDSKLHHSSHVV 183
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
+G + I++ G C G RIPPK + +D+PKGIQ+SA+L +++SD + L++KKLKV GS
Sbjct: 184 DGQSLIVLHGSCRGVRIPPKMDFDDNPKGIQYSAVLSLQVSDGSVVVHDLDEKKLKVNGS 243
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
DWAVL LVASSSF GPF PS S KDP+SES++ ++ I+ LSYS+LY RHL+DYQ LF R
Sbjct: 244 DWAVLRLVASSSFKGPFTQPSLSGKDPSSESLATMKKIKGLSYSNLYARHLNDYQSLFQR 303
Query: 308 VSIQLSRSPKDIVTDTC-SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
VS+ LS+S K+ + + + +AERVKSFQTDEDPSLVELLFQ+ RYLLIS SR
Sbjct: 304 VSLHLSKSSKNESSSPNSGGKEVRVASTAERVKSFQTDEDPSLVELLFQYSRYLLISCSR 363
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PGTQVANLQGIWN+++ P WD APH+NINL+MNYW SL CNL ECQEPLFDF ++LS+NG
Sbjct: 364 PGTQVANLQGIWNKNVEPAWDGAPHLNINLQMNYWPSLSCNLKECQEPLFDFTSFLSVNG 423
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
KTA+ NY ASGWV H +DIWAKSS DRG+ VWALWPMGGAWLCTHLWEHY YTMD++F
Sbjct: 424 RKTAKANYEASGWVAHQVSDIWAKSSPDRGQAVWALWPMGGAWLCTHLWEHYTYTMDKNF 483
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
L+ +AYPL+EGCASFLLDWLI+G DGYLETNPSTSPEH FIAPDGK A VSYS+TMDMAI
Sbjct: 484 LKNKAYPLMEGCASFLLDWLIDGKDGYLETNPSTSPEHMFIAPDGKPASVSYSTTMDMAI 543
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+EVFS+IISAAE+L K +D ++KV K+ RL P KIA+DGS+MEW
Sbjct: 544 TKEVFSSIISAAEILGKTKDTFIDKVRKAQARLLPYKIAKDGSLMEWA 591
>gi|158302693|dbj|BAF85832.1| alpha-1,2-fucosidase [Lilium longiflorum]
Length = 854
Score = 839 bits (2167), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 407/617 (65%), Positives = 482/617 (78%), Gaps = 34/617 (5%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
PLK+ F PAKH+TDA PIGNGRLGAMVWGGVP+ETL+LN+DTLWTGVPG+YTNPDAP
Sbjct: 31 QPLKLRFLEPAKHWTDAAPIGNGRLGAMVWGGVPTETLQLNDDTLWTGVPGNYTNPDAPT 90
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
LS VR LVD G+YAEA+ A+ L GHP+DVYQ LG + LEF DSH+ Y+ Y+RELDL
Sbjct: 91 VLSKVRKLVDDGKYAEASLAAFDLSGHPSDVYQPLGTMNLEFGDSHVAYS--NYQRELDL 148
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
TATA+V YS+G+VEFTREHFSSNP QV+VTKIS ++SGSLSF VSLDS L + S +G
Sbjct: 149 TTATAKVTYSLGDVEFTREHFSSNPHQVLVTKISANKSGSLSFIVSLDSKLHHQSSADGV 208
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
N+IIMEG CPG+RI PK N ++ KGIQFSA+L++KI + + LED KLKVEGSDWA
Sbjct: 209 NRIIMEGSCPGRRIAPKGNLFENNKGIQFSAVLDLKIGGNDSNVQVLEDMKLKVEGSDWA 268
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
VLLL ASSSF+GPFINPSDS+KDP S S+ L +I+ +S+S L+T H++DYQ LFH V++
Sbjct: 269 VLLLAASSSFEGPFINPSDSEKDPKSASLDTLNAIQKISFSQLFTHHVEDYQSLFHCVTL 328
Query: 311 QLSRSPKD---------------IVTDTCSEENIDTV----PS-------------AERV 338
QLS+ I+ TCS N++ V PS AERV
Sbjct: 329 QLSKGSNSGGRTTVPLSQSYDSSILGTTCSLNNMEKVNTSNPSYSDQLTEEVLISTAERV 388
Query: 339 KSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEM 398
KSF+ DEDPSLVELLF +GRYLLIS SRPGTQ+ANLQGIW++D+ P WD+APH+NINL+M
Sbjct: 389 KSFKVDEDPSLVELLFHYGRYLLISCSRPGTQIANLQGIWSKDIEPAWDAAPHLNINLQM 448
Query: 399 NYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKV 458
NYW SL CNLSECQEPLFD++ L+ING+KTA+VNY ASGWV H +DIWAK+S DRG
Sbjct: 449 NYWPSLSCNLSECQEPLFDYIASLAINGAKTAKVNYEASGWVAHQVSDIWAKTSPDRGDP 508
Query: 459 VWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNP 518
VWALWPMGGAWLCTHLWEHY ++MD+ FLE AYPLLEGCASFLLDWLIEG GYLETNP
Sbjct: 509 VWALWPMGGAWLCTHLWEHYTFSMDKVFLENTAYPLLEGCASFLLDWLIEGRGGYLETNP 568
Query: 519 STSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR 578
STSPEH FIAPD K A VSYSSTMDMAIIREVFS IS+AE+L + E LV+++ K++PR
Sbjct: 569 STSPEHSFIAPDSKTASVSYSSTMDMAIIREVFSEFISSAEILGRVESKLVKQIKKAIPR 628
Query: 579 LRPTKIAEDGSIMEWVQ 595
L PTKIA DG+IMEW Q
Sbjct: 629 LPPTKIARDGTIMEWAQ 645
>gi|357479527|ref|XP_003610049.1| Macrophage migration inhibitory factor-like protein [Medicago
truncatula]
gi|355511104|gb|AES92246.1| Macrophage migration inhibitory factor-like protein [Medicago
truncatula]
Length = 855
Score = 827 bits (2137), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 392/609 (64%), Positives = 486/609 (79%), Gaps = 29/609 (4%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
+ NA+ + PLK+TF+ AK++TDAIPIGNGRLGAM+WGG+ SE L+LNEDTLWTG+P
Sbjct: 22 LANADDDEPSMPLKVTFSRSAKYWTDAIPIGNGRLGAMIWGGIQSEVLQLNEDTLWTGIP 81
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
G+YT+ +AP+AL++VR LVD +Y+EAT A++KL G P +VYQLLGDIEL+FDDSHLKY+
Sbjct: 82 GNYTDKNAPEALAEVRKLVDDRKYSEATTAALKLLGPPGEVYQLLGDIELQFDDSHLKYS 141
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
EE+Y RELDL+ AT HF+SNPDQV+VTK S S SGSLSF VSLDS
Sbjct: 142 EESYHRELDLDNAT---------------HFASNPDQVLVTKFSTSNSGSLSFTVSLDSK 186
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
L +++ ++ NQIIMEG CPGKRIPP+ N++D+PKGIQFSA+L+++IS+++G I L+DK
Sbjct: 187 LHHNTRLSSKNQIIMEGSCPGKRIPPQVNSSDEPKGIQFSAVLDVQISNEKGVIHVLDDK 246
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
KL+VEGSDWA+LLL ASSSFDGPF NP +SKKD TSES+S ++ + +L Y D+Y RHLDD
Sbjct: 247 KLRVEGSDWAILLLTASSSFDGPFTNPENSKKDLTSESLSKMKFVTSLKYDDIYARHLDD 306
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEE--------NI------DTVPSAERVKSFQTDED 346
YQ LFHRVS+QLS+S K ++ +E NI D VP++ R+KSFQ DED
Sbjct: 307 YQNLFHRVSLQLSKSSKTVLGKPILDEGKMVSCQTNISQLRGGDIVPTSSRIKSFQNDED 366
Query: 347 PSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPC 406
PS VELLFQ+GRYLLI+ SRPGTQVANLQGIWN+D+ P WD APH+NINL+MNYW SL C
Sbjct: 367 PSFVELLFQYGRYLLIACSRPGTQVANLQGIWNKDVVPKWDGAPHLNINLQMNYWPSLSC 426
Query: 407 NLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
NL ECQEPLFD ++ LS+NGSKTA+VNY A+GWV HH +D+WAK+S RG VWALWPMG
Sbjct: 427 NLHECQEPLFDCISSLSVNGSKTAKVNYDANGWVAHHVSDLWAKTSTYRGPAVWALWPMG 486
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEF 526
GAWLCTHLWEHY YT D++FL+ +AYPLLEGC SFLLDWLIEG G LETNPSTSPEH F
Sbjct: 487 GAWLCTHLWEHYTYTTDKEFLKNKAYPLLEGCTSFLLDWLIEGPGGLLETNPSTSPEHMF 546
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE 586
IA D K A VSYSSTMD++II+EVFS +ISAAE+L + +DA++++V +S +L P KIA
Sbjct: 547 IASDQKRASVSYSSTMDISIIKEVFSIVISAAEILGRQDDAIIKRVFESQSKLPPIKIAR 606
Query: 587 DGSIMEWVQ 595
DGSIMEW +
Sbjct: 607 DGSIMEWAE 615
>gi|356495827|ref|XP_003516773.1| PREDICTED: alpha-L-fucosidase 2-like [Glycine max]
Length = 802
Score = 809 bits (2089), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 395/593 (66%), Positives = 468/593 (78%), Gaps = 11/593 (1%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
AE + N LKI F KH+TDA+PIGNGRLGAMV G V SET+ LNEDTLWTG P DY
Sbjct: 2 AEGRGSRN-LKIRFREGGKHWTDAVPIGNGRLGAMVCGHVHSETIHLNEDTLWTGTPADY 60
Query: 64 TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA-EE 122
TN AP ALS VR+LV Y +ATAAS L G+P++ Y LLGDI+L+FD SHL ++
Sbjct: 61 TNSKAPPALSHVRNLVHRQHYPQATAASSALTGNPSEAYLLLGDIQLDFDYSHLTPGLQQ 120
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
Y RELDL+TAT +V+YSVG+V+FTREHF+S PDQ+IVT+IS S+ LSF VSL S +
Sbjct: 121 PYERELDLDTATVKVRYSVGDVQFTREHFASYPDQLIVTQISSSKPAKLSFTVSLLSKII 180
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
N +YVN NQIIM+G CPGKRI +P GIQFSAIL++KI G I L++ KL
Sbjct: 181 NQTYVNAPNQIIMKGSCPGKRI------QHNPHGIQFSAILDLKIGGTDGVIHILDNNKL 234
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
KVE SDWAVLLLVASSSF GPF PSDSKKDPTS+ + L SI N+SYS LY RHL+DYQ
Sbjct: 235 KVEASDWAVLLLVASSSFSGPFTAPSDSKKDPTSQCFTTLSSISNVSYSHLYARHLNDYQ 294
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
LFHRVS+QL RS + +++ + + +++RVKSFQTDEDPSLVELLFQ+GRYLLI
Sbjct: 295 GLFHRVSLQLMRSTRPNISE---DSTVTQASTSDRVKSFQTDEDPSLVELLFQYGRYLLI 351
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SSSRPGTQVANLQGIWN+DL P WD APH+NINLEMNYW +LPCNLSECQEPLFD+++ L
Sbjct: 352 SSSRPGTQVANLQGIWNKDLEPVWDGAPHLNINLEMNYWPALPCNLSECQEPLFDYISLL 411
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
S+NGSKTA VNY A+GWV H K+DIWA++SA +G VVWALWPMGGAWLCTHLWEHY YTM
Sbjct: 412 SVNGSKTAHVNYQANGWVAHSKSDIWARTSAGQGDVVWALWPMGGAWLCTHLWEHYAYTM 471
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
D DFL+ +AYPL+EGC SFLL WLIE +GYLETNPSTSPEH FIAP+G+ ACVS SSTM
Sbjct: 472 DEDFLKYKAYPLMEGCVSFLLSWLIEDSEGYLETNPSTSPEHYFIAPNGEPACVSQSSTM 531
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
D+AII EVFS +SAAEV+ + +D +V +V K+ PRLRP IA+DGSIMEWV+
Sbjct: 532 DVAIINEVFSTFLSAAEVIGRTKDNIVGEVRKAQPRLRPINIAQDGSIMEWVK 584
>gi|297802554|ref|XP_002869161.1| hypothetical protein ARALYDRAFT_912968 [Arabidopsis lyrata subsp.
lyrata]
gi|297314997|gb|EFH45420.1| hypothetical protein ARALYDRAFT_912968 [Arabidopsis lyrata subsp.
lyrata]
Length = 844
Score = 802 bits (2072), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 380/595 (63%), Positives = 469/595 (78%), Gaps = 17/595 (2%)
Query: 10 TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
+ PLK+TF GP++++TDAIPIGNGRLGA +WGGV SETL +NEDT+WTGVP DYTNP+AP
Sbjct: 48 SRPLKLTFGGPSRNWTDAIPIGNGRLGATIWGGVSSETLNINEDTIWTGVPADYTNPNAP 107
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
+AL++VR LVD YAEAT+ +VKL G P+DVYQL+GD+ LEF SH KY + +YRRELD
Sbjct: 108 EALAEVRRLVDEKNYAEATSEAVKLSGQPSDVYQLVGDLNLEFGSSHRKYTQTSYRRELD 167
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L TA A+V YSVG V+F+RE F+SNPDQVIV KI S+ GSLSF VS DS L +HS N
Sbjct: 168 LETAVAKVSYSVGAVDFSREFFASNPDQVIVAKIYASKPGSLSFKVSFDSELHHHSETNP 227
Query: 190 N-NQIIMEGRCPGKRIP----PKANAN----DDPKGIQFSAILEIKISDDRGTISALEDK 240
NQI+M G C KR+P NA DD KG+QF++ILE+++S+ G++S+L K
Sbjct: 228 KANQILMRGSCRPKRLPVNLKKSINATNIPYDDHKGLQFASILEVRVSNG-GSVSSLGGK 286
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
KL VE +DWAVLLL ASS+FDGPF P+DSK+DP E + S++ SYSDLY RHL D
Sbjct: 287 KLSVEKADWAVLLLAASSNFDGPFTMPADSKRDPAKECAKRISSVQKYSYSDLYARHLGD 346
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
YQKLF+RVS+QLS S + + +AERV+SF+TDEDP+LVELLFQ+GRYL
Sbjct: 347 YQKLFNRVSLQLSGSSGNKTVQQAAS-------TAERVRSFKTDEDPALVELLFQYGRYL 399
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSRPGTQVANLQGIWN D+ P WD APH+NINL+MNYW SLP N+ ECQEPLFD+++
Sbjct: 400 LISSSRPGTQVANLQGIWNRDIQPPWDGAPHLNINLQMNYWHSLPGNIRECQEPLFDYMS 459
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
L+ING KTAQ+NY ASGWV H +DIWAK+S DRG+ VWALWPMGGAWLCTH WEHY Y
Sbjct: 460 ALAINGRKTAQMNYGASGWVAHQVSDIWAKTSPDRGEAVWALWPMGGAWLCTHAWEHYTY 519
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
TMD++FL+K+ YPLLEGC SFLLDWLI+G DG+L+TNPSTSPEH F AP+GK A VSYSS
Sbjct: 520 TMDKEFLKKKGYPLLEGCTSFLLDWLIKGKDGFLQTNPSTSPEHMFTAPNGKPASVSYSS 579
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
TMD+AII+EVF+ I++A+E+L K D L+ KV+ + +L PT+I++DGSIMEW +
Sbjct: 580 TMDIAIIKEVFADIVTASEILGKTNDTLIGKVIAAQAKLPPTRISKDGSIMEWAE 634
>gi|30689979|ref|NP_195152.2| alpha-L-fucosidase 2 [Arabidopsis thaliana]
gi|75245768|sp|Q8L7W8.1|FUCO2_ARATH RecName: Full=Alpha-L-fucosidase 2; AltName:
Full=Alpha-1,2-fucosidase 2; AltName:
Full=Alpha-L-fucoside fucohydrolase 2; Flags: Precursor
gi|21928117|gb|AAM78086.1| AT4g34260/F10M10_30 [Arabidopsis thaliana]
gi|27363438|gb|AAO11638.1| At4g34260/F10M10_30 [Arabidopsis thaliana]
gi|332660949|gb|AEE86349.1| alpha-L-fucosidase 2 [Arabidopsis thaliana]
Length = 843
Score = 791 bits (2042), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 378/595 (63%), Positives = 465/595 (78%), Gaps = 17/595 (2%)
Query: 10 TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
+ PLK+TF GP++++TDAIPIGNGRLGA +WGGV SE L +NEDT+WTGVP DYTN AP
Sbjct: 49 SRPLKLTFGGPSRNWTDAIPIGNGRLGATIWGGVSSEILNINEDTIWTGVPADYTNQKAP 108
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
+AL++VR LVD YAEAT+ +VKL G P+DVYQ++GD+ LEFD SH KY + +YRRELD
Sbjct: 109 EALAEVRRLVDERNYAEATSEAVKLSGQPSDVYQIVGDLNLEFDSSHRKYTQASYRRELD 168
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L TA A+V YSVG V+F+RE F+SNPDQVI+ KI S+ GSLSF VS DS L +HS N
Sbjct: 169 LETAVAKVSYSVGAVDFSREFFASNPDQVIIAKIYASKPGSLSFKVSFDSELHHHSETNP 228
Query: 190 N-NQIIMEGRCPGKRIP----PKANAN----DDPKGIQFSAILEIKISDDRGTISALEDK 240
NQI+M G C KR+P NA DD KG+QF++ILE+++S+ G++S+L K
Sbjct: 229 KANQILMRGSCRPKRLPVNLKKSINATNIPYDDHKGLQFASILEVRVSNG-GSVSSLGGK 287
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
KL VE +DWAVLLL ASS+FDGPF P DSK DP E ++ + S++ SYSDLY RHL D
Sbjct: 288 KLSVEKADWAVLLLAASSNFDGPFTMPVDSKIDPAKECVNRISSVQKYSYSDLYARHLGD 347
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
YQKLF+RVS+ LS S + +E +AERV+SF+TD+DPSLVELLFQ+GRYL
Sbjct: 348 YQKLFNRVSLHLSGS-------STNETVQQATSTAERVRSFKTDQDPSLVELLFQYGRYL 400
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSRPGTQVANLQGIWN D+ P WD APH+NINL+MNYW SLP N+ ECQEPLFD+++
Sbjct: 401 LISSSRPGTQVANLQGIWNRDIQPPWDGAPHLNINLQMNYWHSLPGNIRECQEPLFDYMS 460
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
L+ING KTAQVNY ASGWV H +DIWAK+S DRG+ VWALWPMGGAWLCTH WEHY Y
Sbjct: 461 ALAINGRKTAQVNYGASGWVAHQVSDIWAKTSPDRGEAVWALWPMGGAWLCTHAWEHYTY 520
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
TMD++FL+K+ YPLLEGC SFLLDWLI+G DG+L+TNPSTSPEH F AP GK A VSYSS
Sbjct: 521 TMDKEFLKKKGYPLLEGCTSFLLDWLIKGKDGFLQTNPSTSPEHMFTAPIGKPASVSYSS 580
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
TMD+AII+EVF+ I+SA+E+L K D L+ KV+ + +L PT+I++DGSI EW +
Sbjct: 581 TMDIAIIKEVFADIVSASEILGKTNDTLIGKVIAAQAKLPPTRISKDGSIREWAE 635
>gi|449531868|ref|XP_004172907.1| PREDICTED: alpha-L-fucosidase 2-like, partial [Cucumis sativus]
Length = 764
Score = 773 bits (1997), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/546 (68%), Positives = 444/546 (81%), Gaps = 3/546 (0%)
Query: 52 EDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELE 111
EDTLWTG P DYTNPDAP+AL +VR LVD G+YAEAT A+VKL G P+DVYQLLGDI+LE
Sbjct: 7 EDTLWTGTPADYTNPDAPEALREVRKLVDDGKYAEATEAAVKLSGKPSDVYQLLGDIKLE 66
Query: 112 FDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL 171
F+ SH Y ETY RELDLNTATARVKYSVG+VEFTREHF+SNPDQ IVTKI+ S+ GSL
Sbjct: 67 FEVSHQSYTPETYHRELDLNTATARVKYSVGDVEFTREHFASNPDQAIVTKIAASKPGSL 126
Query: 172 SFNVSLDSLLDNHSYV-NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD 230
+F VS+DS L + S+V +G + I++ G C G RIPPK + +D+PKGIQ+SA+L +++SD
Sbjct: 127 TFIVSIDSKLHHSSHVVDGQSLIVLHGSCRGVRIPPKMDFDDNPKGIQYSAVLSLQVSDG 186
Query: 231 RGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSY 290
+ L++KKLKV GSDWAVL LVASSSF GPF PS S KDP+SES++ ++ I+ LSY
Sbjct: 187 SVVVHDLDEKKLKVNGSDWAVLRLVASSSFKGPFTQPSLSGKDPSSESLATMKKIKGLSY 246
Query: 291 SDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC-SEENIDTVPSAERVKSFQTDEDPSL 349
S+LY RHL+DYQ LF RVS+ LS+S K+ + + + +AERVKSFQTDEDPSL
Sbjct: 247 SNLYARHLNDYQSLFQRVSLHLSKSSKNESSSPNSGGKEVRVASTAERVKSFQTDEDPSL 306
Query: 350 VELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLS 409
VELLFQ+ RYLLIS SRPGTQVANLQGIWN+++ P WD APH+NINL+MNYW SL CNL
Sbjct: 307 VELLFQYSRYLLISCSRPGTQVANLQGIWNKNVEPAWDGAPHLNINLQMNYWPSLSCNLK 366
Query: 410 ECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAW 469
ECQEPLFDF ++LS+NG KTA+ NY ASGWV H +DIWAKSS DRG+ VWALWPMGGAW
Sbjct: 367 ECQEPLFDFTSFLSVNGRKTAKANYEASGWVAHQVSDIWAKSSPDRGQAVWALWPMGGAW 426
Query: 470 LCTHLWEHYNYTMDR-DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIA 528
LCTHLWEHY YTMD+ F + +AYPL+EGCASFLLDWLI+G DGYLETNPSTSPEH FIA
Sbjct: 427 LCTHLWEHYTYTMDKVKFFKNKAYPLMEGCASFLLDWLIDGKDGYLETNPSTSPEHMFIA 486
Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDG 588
PDGK A VSYS+TMDMAI +EVFS+IISAAE+L K +D ++KV K+ RL P KIA+DG
Sbjct: 487 PDGKPASVSYSTTMDMAITKEVFSSIISAAEILGKTKDTFIDKVRKAQARLLPYKIAKDG 546
Query: 589 SIMEWV 594
S+MEW
Sbjct: 547 SLMEWA 552
>gi|4455171|emb|CAB36703.1| hypothetical protein [Arabidopsis thaliana]
gi|7270376|emb|CAB80143.1| hypothetical protein [Arabidopsis thaliana]
Length = 847
Score = 760 bits (1963), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 372/600 (62%), Positives = 460/600 (76%), Gaps = 23/600 (3%)
Query: 10 TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
+ PLK+TF GP++++TDAIPIGNGRLGA +WGGV SE L +NEDT+WTGVP DYTN AP
Sbjct: 49 SRPLKLTFGGPSRNWTDAIPIGNGRLGATIWGGVSSEILNINEDTIWTGVPADYTNQKAP 108
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
+AL++VR LVD YAEAT+ +VKL G P+DVYQ++GD+ LEFD SH KY + +YRRELD
Sbjct: 109 EALAEVRRLVDERNYAEATSEAVKLSGQPSDVYQIVGDLNLEFDSSHRKYTQASYRRELD 168
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L TA A+V YSVG V+F+RE F+SNPDQVI+ KI S+ GSLSF VS DS L +HS N
Sbjct: 169 LETAVAKVSYSVGAVDFSREFFASNPDQVIIAKIYASKPGSLSFKVSFDSELHHHSETNP 228
Query: 190 N-NQIIMEGRCPGKRIP----PKANAN----DDPKGIQFSAILEIKISDDRGTISALEDK 240
NQI+M G C KR+P NA DD KG+QF++ILE+++S+ G++S+L K
Sbjct: 229 KANQILMRGSCRPKRLPVNLKKSINATNIPYDDHKGLQFASILEVRVSNG-GSVSSLGGK 287
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
KL VE +DWAVLLL ASS+FDGPF P DSK DP E ++ + S++ SYSDLY RHL D
Sbjct: 288 KLSVEKADWAVLLLAASSNFDGPFTMPVDSKIDPAKECVNRISSVQKYSYSDLYARHLGD 347
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
YQKLF+RVS+ LS S + +E +AERV+SF+TD+DPSLVELLFQ+GRYL
Sbjct: 348 YQKLFNRVSLHLSGS-------STNETVQQATSTAERVRSFKTDQDPSLVELLFQYGRYL 400
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTW-----DSAPHVNINLEMNYWQSLPCNLSECQEPL 415
LISSSRPGTQVANLQ + L+P APH+NINL+MNYW SLP N+ ECQEPL
Sbjct: 401 LISSSRPGTQVANLQA-FVVSLTPLLLLRYCSGAPHLNINLQMNYWHSLPGNIRECQEPL 459
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
FD+++ L+ING KTAQVNY ASGWV H +DIWAK+S DRG+ VWALWPMGGAWLCTH W
Sbjct: 460 FDYMSALAINGRKTAQVNYGASGWVAHQVSDIWAKTSPDRGEAVWALWPMGGAWLCTHAW 519
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
EHY YTMD++FL+K+ YPLLEGC SFLLDWLI+G DG+L+TNPSTSPEH F AP GK A
Sbjct: 520 EHYTYTMDKEFLKKKGYPLLEGCTSFLLDWLIKGKDGFLQTNPSTSPEHMFTAPIGKPAS 579
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
VSYSSTMD+AII+EVF+ I+SA+E+L K D L+ KV+ + +L PT+I++DGSI EW +
Sbjct: 580 VSYSSTMDIAIIKEVFADIVSASEILGKTNDTLIGKVIAAQAKLPPTRISKDGSIREWAE 639
>gi|78708252|gb|ABB47227.1| large secreted protein, putative, expressed [Oryza sativa Japonica
Group]
gi|222612646|gb|EEE50778.1| hypothetical protein OsJ_31136 [Oryza sativa Japonica Group]
Length = 851
Score = 749 bits (1934), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 366/612 (59%), Positives = 457/612 (74%), Gaps = 32/612 (5%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PL++ F P+++FTDA PIGNG LGA+VWGGV SE L+LN DTLWTG PG+YTNP AP
Sbjct: 34 PLEVVFASPSRYFTDAAPIGNGSLGALVWGGVASEKLQLNHDTLWTGGPGNYTNPKAPAV 93
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET-YRRELDL 130
LS VR LV+ GQYA+ATA + L G VYQ LGDI+L FD+ + E+T Y+R LDL
Sbjct: 94 LSKVRDLVNRGQYAKATAVAYGLSGDQTQVYQPLGDIDLAFDE----HVEDTNYKRNLDL 149
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
TAT V Y++G V +REHFSSNP QVIVTKIS + G++SF VSL + L++ V
Sbjct: 150 RTATVNVSYTIGEVVHSREHFSSNPHQVIVTKISADKPGNVSFTVSLTTPLNHQIRVTNA 209
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
N+IIMEG CPG+R NA+D P GI+FSAIL +++S GT+ L DK LK+ G+D A
Sbjct: 210 NEIIMEGYCPGERPTEYGNASDHPVGIKFSAILYLQMSGSNGTVEILNDKMLKLVGADSA 269
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
VLLL A++SF+GPF+NPS+SK DPT+ +++ L RN+SYS L H+DDYQ LF RVS+
Sbjct: 270 VLLLAAATSFEGPFVNPSESKLDPTASALTTLTVARNMSYSQLKAYHVDDYQNLFQRVSL 329
Query: 311 QLSRS-------------PKDIVTDT-----------CSEE---NIDTVPSAERVKSFQT 343
QLSR P++ + +T CS N P+ +R+ SF+
Sbjct: 330 QLSRDSNDALGGNGLVNLPENSLQETSVSDYAVQMVECSRFQGFNNSGKPTVDRILSFRD 389
Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
DEDPSLVELLFQFGRYLLIS SRPGTQ++NLQGIWN++ SP WD+APH NINL+MNYW +
Sbjct: 390 DEDPSLVELLFQFGRYLLISCSRPGTQISNLQGIWNDETSPPWDAAPHPNINLQMNYWPA 449
Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
LPCNLSECQEPLFDF+ LS+NG+KTA+VNY ASGWV H TD+WAK+S D G +WALW
Sbjct: 450 LPCNLSECQEPLFDFIGSLSVNGAKTAKVNYEASGWVSHQVTDLWAKTSPDAGDPMWALW 509
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
PMGG WL THLWEHY+YTMD+ FLEK AYPLLEG ASFLLDWLIEG+ YLETNPSTSPE
Sbjct: 510 PMGGPWLATHLWEHYSYTMDKQFLEKTAYPLLEGSASFLLDWLIEGNGDYLETNPSTSPE 569
Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTK 583
H FIAPDG+ ACVSYS+TMDM+IIREVFSA++ ++++L K++ +V+++ K++PRL P K
Sbjct: 570 HYFIAPDGRKACVSYSTTMDMSIIREVFSAVLMSSDILGKSDSDMVQRIKKAIPRLPPIK 629
Query: 584 IAEDGSIMEWVQ 595
+A DG+IMEW Q
Sbjct: 630 VARDGTIMEWAQ 641
>gi|218197301|gb|EEC79728.1| hypothetical protein OsI_21058 [Oryza sativa Indica Group]
Length = 815
Score = 748 bits (1930), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 350/586 (59%), Positives = 456/586 (77%), Gaps = 5/586 (0%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+ F+ PA+HFTDA PIGNG LGAMVWG V SE L+LN DTLWTGVPG+YT+P+AP A
Sbjct: 21 PLKVVFDSPAEHFTDAAPIGNGSLGAMVWGSVASEKLQLNHDTLWTGVPGNYTDPNAPYA 80
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
L+ VR LVD ++ +AT A+ LFG P +VYQ LGDI LEFD S L Y +Y+RELDL
Sbjct: 81 LAVVRKLVDGEKFVDATEAASGLFGGPTEVYQPLGDINLEFDSSSLGYT--SYKRELDLR 138
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TAT + Y++G V+++REHF SNP QV TKIS ++SG +SF +SL+S L+++ + N
Sbjct: 139 TATVCISYNIGEVQYSREHFCSNPHQVFATKISANKSGHVSFTLSLNSQLNHNVRITNAN 198
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
++IM+G CPG+R N +D GI+F+ + ++I ++ ++D+KL+++ +DW V
Sbjct: 199 EMIMQGTCPGRRPALHHNGANDAIGIKFATAVGLQIGGTSAKVTIIDDQKLRIDAADWVV 258
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
LL+ A+SSFDGPF+NPS+SK +P +++ L RN ++S L HL+DYQ LFHRV++Q
Sbjct: 259 LLVAAASSFDGPFVNPSESKLNPEVAALNTLNISRNATFSQLKAAHLEDYQGLFHRVTLQ 318
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
LS++ + D E + D +AER+ SF++DEDPSLVELLFQ+GRYLLISSSRPGTQV
Sbjct: 319 LSQASM-LEKDILEEVDHDVKTTAERINSFRSDEDPSLVELLFQYGRYLLISSSRPGTQV 377
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
+NLQGIWN+D +P W+++PH+NINLEMNYW +LPCNLSECQEPLFD + L++NG+KTA+
Sbjct: 378 SNLQGIWNQDFAPAWEASPHLNINLEMNYWPTLPCNLSECQEPLFDLIGSLAVNGTKTAK 437
Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
VNY ASGWV HH TDIWAKSSA ++ALWPMGGAWLCTHLWE+Y Y++D++FLEKRA
Sbjct: 438 VNYQASGWVTHHVTDIWAKSSAYYVDAMYALWPMGGAWLCTHLWENYQYSLDKEFLEKRA 497
Query: 492 YPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP--DGKLACVSYSSTMDMAIIRE 549
YPLLEGCA FL+DWLI+G YLETNPSTSPEH FIAP G LA VSYS+TMD++IIRE
Sbjct: 498 YPLLEGCAMFLIDWLIKGPGDYLETNPSTSPEHPFIAPGTGGHLASVSYSTTMDISIIRE 557
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
VF A+IS+AEVL K++ LVE++ K+LP L P KI++DG+IMEW Q
Sbjct: 558 VFLAVISSAEVLGKSDTNLVERIKKALPMLPPVKISKDGTIMEWAQ 603
>gi|110288916|gb|ABG66022.1| large secreted protein, putative, expressed [Oryza sativa Japonica
Group]
gi|222612642|gb|EEE50774.1| hypothetical protein OsJ_31132 [Oryza sativa Japonica Group]
Length = 815
Score = 746 bits (1927), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/586 (59%), Positives = 456/586 (77%), Gaps = 5/586 (0%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+ F+ PA+HFTDA PIGNG LGAMVWG V SE L+LN DTLWTGVPG+YT+P+AP A
Sbjct: 21 PLKVVFDSPAEHFTDAAPIGNGSLGAMVWGSVASEKLQLNHDTLWTGVPGNYTDPNAPYA 80
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
L+ VR LVD ++ +AT A+ LFG P +VYQ LGDI LEFD S L Y +Y+RELDL
Sbjct: 81 LAVVRKLVDGEKFVDATEAASGLFGGPTEVYQPLGDINLEFDSSSLGYT--SYKRELDLR 138
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TAT + Y++G V+++REHF SNP QV TKIS ++SG +SF +SL+S L+++ + N
Sbjct: 139 TATVCISYNIGEVQYSREHFCSNPHQVFATKISANKSGHVSFTLSLNSQLNHNVRITNAN 198
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
++IM+G CPG+R N +D GI+F+ + ++I ++ ++D+KL+++ +DW V
Sbjct: 199 EMIMQGTCPGRRPALHHNGANDAIGIKFATAVGLQIGGTSAKVTIIDDQKLRIDAADWVV 258
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
LL+ A+SSFDGPF+NPS+SK +P +++ L RN ++S L HL+DYQ LFHRV++Q
Sbjct: 259 LLVAAASSFDGPFVNPSESKLNPEVAALNTLNISRNATFSQLKAAHLEDYQGLFHRVTLQ 318
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
LS++ + D E + D +AER+ SF++DEDPSLVELLFQ+GRYLLISSSRPGTQV
Sbjct: 319 LSQASM-LEKDILEEVDHDVKTTAERINSFRSDEDPSLVELLFQYGRYLLISSSRPGTQV 377
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
+NLQGIWN+D +P W+++PH+NINLEMNYW +LPCNL+ECQEPLFD + L++NG+KTA+
Sbjct: 378 SNLQGIWNQDFAPAWEASPHLNINLEMNYWPTLPCNLTECQEPLFDLIGSLAVNGTKTAK 437
Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
VNY ASGWV HH TDIWAKSSA ++ALWPMGGAWLCTHLWE+Y Y++D++FLEKRA
Sbjct: 438 VNYQASGWVTHHVTDIWAKSSAYYVDAMYALWPMGGAWLCTHLWENYQYSLDKEFLEKRA 497
Query: 492 YPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP--DGKLACVSYSSTMDMAIIRE 549
YPLLEGCA FL+DWLI+G YLETNPSTSPEH FIAP G LA VSYS+TMD++IIRE
Sbjct: 498 YPLLEGCAMFLIDWLIKGPGDYLETNPSTSPEHPFIAPGTGGHLASVSYSTTMDISIIRE 557
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
VF A+IS+AEVL K++ LVE++ K+LP L P KI++DG+IMEW Q
Sbjct: 558 VFLAVISSAEVLGKSDTNLVERIKKALPMLPPVKISKDGTIMEWAQ 603
>gi|218184333|gb|EEC66760.1| hypothetical protein OsI_33136 [Oryza sativa Indica Group]
Length = 851
Score = 746 bits (1927), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 365/612 (59%), Positives = 456/612 (74%), Gaps = 32/612 (5%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PL++ F P+++FTDA PIGNG LGA+VWGGV SE L+LN DTLWTG PG+YTNP AP
Sbjct: 34 PLEVVFASPSRYFTDAAPIGNGSLGALVWGGVASEKLQLNHDTLWTGGPGNYTNPKAPAV 93
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET-YRRELDL 130
LS VR LV+ GQYA+ATA + L G VYQ LGDI+L FD+ + E+T Y+R LDL
Sbjct: 94 LSKVRDLVNRGQYAKATAVAYGLSGDQTQVYQPLGDIDLAFDE----HVEDTNYKRNLDL 149
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
TAT V Y++G V +REHFSSNP QVIVTKIS + G++SF VSL + L++ V
Sbjct: 150 RTATVNVSYTIGGVVHSREHFSSNPHQVIVTKISADKPGNVSFTVSLTTPLNHQIRVTNA 209
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
N+IIMEG CPG+R NA+D P GI+FSAIL +++S GT+ L DK LK+ G+D A
Sbjct: 210 NEIIMEGYCPGERPTEYGNASDHPVGIKFSAILYLQMSGSNGTVEILNDKMLKLVGADSA 269
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
VLLL AS+SF+GPF+NPS+SK DPT+ +++ L RN+ YS L H+DDYQ LF RVS+
Sbjct: 270 VLLLAASTSFEGPFVNPSESKLDPTASALTTLTVARNMPYSQLKAYHVDDYQNLFQRVSL 329
Query: 311 QLSRS-------------PKDIVTDT-----------CSEE---NIDTVPSAERVKSFQT 343
QLS+ P++ + +T CS N P+ +R+ SF+
Sbjct: 330 QLSQDSNDALGGNGLVNLPENSLQETSVSDYAVQMVECSRFQGFNNSGKPTVDRILSFRD 389
Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
DEDPSLVELLFQFGRYLLIS SRPGTQ++NLQGIWN++ SP WD+APH NINL+MNYW +
Sbjct: 390 DEDPSLVELLFQFGRYLLISCSRPGTQISNLQGIWNDETSPPWDAAPHPNINLQMNYWPA 449
Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
LPCNLSECQEPLFDF+ LS+NG+KTA+VNY ASGWV H TD+WAK+S D G +WALW
Sbjct: 450 LPCNLSECQEPLFDFIGSLSVNGAKTAKVNYEASGWVSHQVTDLWAKTSPDAGDPMWALW 509
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
PMGG WL THLWEHY+YTMD+ FLEK AYPLLEG ASFLLDWLIEG+ YLETNPSTSPE
Sbjct: 510 PMGGPWLATHLWEHYSYTMDKQFLEKTAYPLLEGSASFLLDWLIEGNGDYLETNPSTSPE 569
Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTK 583
H FIAPDG+ ACVSYS+TMDM+IIREVFSA++ ++++L K++ +V+++ K++PRL P K
Sbjct: 570 HYFIAPDGRKACVSYSTTMDMSIIREVFSAVLMSSDILGKSDSDMVQRIKKAIPRLPPIK 629
Query: 584 IAEDGSIMEWVQ 595
+A DG+IMEW Q
Sbjct: 630 VARDGTIMEWAQ 641
>gi|15451592|gb|AAK98716.1|AC090483_6 Hypothetical protein [Oryza sativa Japonica Group]
Length = 872
Score = 738 bits (1905), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 369/644 (57%), Positives = 461/644 (71%), Gaps = 59/644 (9%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PL++ F P+++FTDA PIGNG LGA+VWGGV SE L+LN DTLWTG PG+YTNP AP
Sbjct: 34 PLEVVFASPSRYFTDAAPIGNGSLGALVWGGVASEKLQLNHDTLWTGGPGNYTNPKAPAV 93
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET-YRRELDL 130
LS VR LV+ GQYA+ATA + L G VYQ LGDI+L FD+ + E+T Y+R LDL
Sbjct: 94 LSKVRDLVNRGQYAKATAVAYGLSGDQTQVYQPLGDIDLAFDE----HVEDTNYKRNLDL 149
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
TAT V Y++G V +REHFSSNP QVIVTKIS + G++SF VSL + L++ V
Sbjct: 150 RTATVNVSYTIGEVVHSREHFSSNPHQVIVTKISADKPGNVSFTVSLTTPLNHQIRVTNA 209
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
N+IIMEG CPG+R NA+D P GI+FSAIL +++S GT+ L DK LK+ G+D A
Sbjct: 210 NEIIMEGYCPGERPTEYGNASDHPVGIKFSAILYLQMSGSNGTVEILNDKMLKLVGADSA 269
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
VLLL A++SF+GPF+NPS+SK DPT+ +++ L RN+SYS L H+DDYQ LF RVS+
Sbjct: 270 VLLLAAATSFEGPFVNPSESKLDPTASALTTLTVARNMSYSQLKAYHVDDYQNLFQRVSL 329
Query: 311 QLSRS-------------PKDIVTDT-----------CSEE---NIDTVPSAERVKSFQT 343
QLSR P++ + +T CS N P+ +R+ SF+
Sbjct: 330 QLSRDSNDALGGNGLVNLPENSLQETSVSDYAVQMVECSRFQGFNNSGKPTVDRILSFRD 389
Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
DEDPSLVELLFQFGRYLLIS SRPGTQ++NLQGIWN++ SP WD+APH NINL+MNYW +
Sbjct: 390 DEDPSLVELLFQFGRYLLISCSRPGTQISNLQGIWNDETSPPWDAAPHPNINLQMNYWPA 449
Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
LPCNLSECQEPLFDF+ LS+NG+KTA+VNY ASGWV H TD+WAK+S D G +WALW
Sbjct: 450 LPCNLSECQEPLFDFIGSLSVNGAKTAKVNYEASGWVSHQVTDLWAKTSPDAGDPMWALW 509
Query: 464 PMGGAWLCTHLWEHYNYTMD--------------------RDFLEKRAYPLLEGCASFLL 503
PMGG WL THLWEHY+YTMD + FLEK AYPLLEG ASFLL
Sbjct: 510 PMGGPWLATHLWEHYSYTMDKKENVFRPNKVDMIVLKDAKKQFLEKTAYPLLEGSASFLL 569
Query: 504 DWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEK 563
DWLIEG+ YLETNPSTSPEH FIAPDG+ ACVSYS+TMDM+IIREVFSA++ ++++L K
Sbjct: 570 DWLIEGNGDYLETNPSTSPEHYFIAPDGRKACVSYSTTMDMSIIREVFSAVLMSSDILGK 629
Query: 564 NEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQRRLNTSFSTCKL 607
++ +V+++ K++PRL P K+A DG+IMEW+ FS C L
Sbjct: 630 SDSDMVQRIKKAIPRLPPIKVARDGTIMEWL-------FSECLL 666
>gi|296083105|emb|CBI22509.3| unnamed protein product [Vitis vinifera]
Length = 781
Score = 737 bits (1903), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 385/622 (61%), Positives = 439/622 (70%), Gaps = 121/622 (19%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+ F GPAKH+TDA+PIGNGRLGAMVWGGV SETL+LNE TLWTG PG+YTNPDAPKA
Sbjct: 34 PLKVRFFGPAKHWTDALPIGNGRLGAMVWGGVASETLQLNEGTLWTGTPGNYTNPDAPKA 93
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPAD------------------------------- 100
LS+VR LVD+G Y AT A+VKL G+P+D
Sbjct: 94 LSEVRKLVDNGDYVAATEAAVKLSGNPSDDELPSLLLDSFFDCDHVGLEVCVKYAPLLMG 153
Query: 101 -------VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSS 153
VYQLLGDI LEF+DSHL YAEETY RELDL+TAT +KYSVG+VE+TREHF+S
Sbjct: 154 YLKFNFGVYQLLGDINLEFEDSHLAYAEETYSRELDLDTATVTIKYSVGDVEYTREHFAS 213
Query: 154 NPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDD 213
PDQVIVTKISGS+ GS+SF VSLDS +IPPK
Sbjct: 214 YPDQVIVTKISGSKPGSVSFTVSLDS-----------------------KIPPKV----- 245
Query: 214 PKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKD 273
G I+ L+DKKLKVEGSDWAV
Sbjct: 246 ------------------GVINVLDDKKLKVEGSDWAVF--------------------- 266
Query: 274 PTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVP 333
L+SI N SYSDLY RHL+DYQ LFHRVS+QLS+S K + ++ V
Sbjct: 267 -------TLKSIGNFSYSDLYARHLNDYQNLFHRVSLQLSKSSKSV---------MNRVS 310
Query: 334 SAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVN 393
+A RVKSF TDEDPSLVELLFQ+GRYLLIS SRPG+Q ANLQGIWN+D+ P WD APH+N
Sbjct: 311 TAARVKSFGTDEDPSLVELLFQYGRYLLISCSRPGSQPANLQGIWNKDIEPAWDGAPHLN 370
Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA 453
INL+MNYW SLPCNLSECQEPLFD+++ LSINGSKTA+VNY ASGWV H +DIWAK+S
Sbjct: 371 INLQMNYWPSLPCNLSECQEPLFDYMSSLSINGSKTAKVNYEASGWVTHQVSDIWAKTSP 430
Query: 454 DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY 513
DRG+ VWALWPMGGAWLCTHLWEHY +TMD+DFL+ +AYPLLEGCA FLLDWLIEG GY
Sbjct: 431 DRGQAVWALWPMGGAWLCTHLWEHYTFTMDKDFLKNKAYPLLEGCARFLLDWLIEGRGGY 490
Query: 514 LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL 573
LETNPSTSPEH FIAPDGK A VSYS+TMD+AIIREVFSA++SAAEVL KNED LV+KV
Sbjct: 491 LETNPSTSPEHMFIAPDGKPASVSYSTTMDIAIIREVFSAVVSAAEVLGKNEDELVQKVR 550
Query: 574 KSLPRLRPTKIAEDGSIMEWVQ 595
++ P+L PTKIA DGSIMEW Q
Sbjct: 551 QAQPKLPPTKIARDGSIMEWAQ 572
>gi|357146134|ref|XP_003573887.1| PREDICTED: alpha-L-fucosidase 2-like [Brachypodium distachyon]
Length = 857
Score = 735 bits (1898), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 362/613 (59%), Positives = 445/613 (72%), Gaps = 30/613 (4%)
Query: 10 TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
+ PLK+ F PAK+FTDA PIGNGRLGAMVWGGV SE L+LN DTLWTG PG+YTNP+AP
Sbjct: 38 SRPLKVVFASPAKYFTDAAPIGNGRLGAMVWGGVASERLQLNHDTLWTGGPGNYTNPNAP 97
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
LS VRSLV G YAEATA + L G +YQ LGDI+L F H+KY Y+R LD
Sbjct: 98 TVLSKVRSLVGKGLYAEATAVAYDLSGDQTQIYQPLGDIDLAFGQ-HIKYTN--YKRYLD 154
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L +AT V Y+VG V ++REHFSSNP QVI TK+S ++ G++SF VSL + LD+ +V
Sbjct: 155 LESATVNVTYTVGEVVYSREHFSSNPHQVIATKVSANKPGAVSFTVSLATPLDHRIHVTD 214
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
N+IIMEG C G+R +A+DDP GI+F AIL ++IS GT+ L D LK++G+D
Sbjct: 215 TNEIIMEGCCAGERPVGDDSASDDPTGIKFCAILYLQISGANGTLQVLNDNMLKLDGADS 274
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
AVLLL A++SF+GPF+ PS+S +P + + + L R +SYS L H+DDYQ LF RVS
Sbjct: 275 AVLLLAAATSFEGPFVKPSESTLNPKTSAFTTLNMARTMSYSQLKAYHMDDYQSLFQRVS 334
Query: 310 IQLSR-----------------SPKDIVTDTCSEE----------NIDTVPSAERVKSFQ 342
+QLSR S +DI C E+ N P+ +R+ SF
Sbjct: 335 LQLSRGSDNVLRGNSLPNSPENSCQDIAVSHCVEQISDRSWLKELNNSDKPTVDRIISFV 394
Query: 343 TDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQ 402
DEDPSLVELLFQFGRYLLIS SRPGTQ++NLQGIW+ D P WD+APH NINL+MNYW
Sbjct: 395 DDEDPSLVELLFQFGRYLLISCSRPGTQISNLQGIWSNDTRPPWDAAPHPNINLQMNYWP 454
Query: 403 SLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWAL 462
+LPCNLSECQEPLFDF+ LSING+KTA+VNY ASGWV H TD+WAK+S D G +WAL
Sbjct: 455 ALPCNLSECQEPLFDFIESLSINGAKTAKVNYEASGWVSHQVTDLWAKTSPDAGDPMWAL 514
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSP 522
WPMGG+WL THLWEHY++T+D FLEK AYPLLEG ASFLL WLIEG G LETNPSTSP
Sbjct: 515 WPMGGSWLATHLWEHYSFTLDTQFLEKTAYPLLEGSASFLLSWLIEGQGGQLETNPSTSP 574
Query: 523 EHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT 582
EH FIAPDGK ACVSYS+TMDM++IREVFSA++ +A++L K+ +V+++ K+LPRL P
Sbjct: 575 EHYFIAPDGKKACVSYSTTMDMSVIREVFSAVLLSADILGKSGTDVVQRIKKALPRLPPI 634
Query: 583 KIAEDGSIMEWVQ 595
KIA D +IMEW +
Sbjct: 635 KIARDITIMEWAR 647
>gi|326518094|dbj|BAK07299.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 832
Score = 731 bits (1887), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/588 (59%), Positives = 451/588 (76%), Gaps = 7/588 (1%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+ F+ PA++FTDA PIGNG LGAMVWGGV S+ L+LN DTLWTGVPG+YT+P AP
Sbjct: 34 PLKVAFSSPAEYFTDAAPIGNGSLGAMVWGGVSSDKLQLNHDTLWTGVPGNYTDPKAPGV 93
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
L++VR LVD G++A+ATA++ LFG ++VYQ LG++ +EF S Y ++Y+RELDL+
Sbjct: 94 LAEVRGLVDQGRFADATASAKGLFGGLSEVYQPLGELNIEFSTSEQVY--DSYKRELDLH 151
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TATA V Y++G V++TREHF SNP Q IVT+ S S G +S +SL S L++ V N
Sbjct: 152 TATALVTYNIGGVQYTREHFCSNPHQAIVTRFSASTPGHVSCTLSLSSQLNHSVTVINEN 211
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
++IMEG CPG+R + N D+ GI+F+A L +++ + L D+KL+++ +DW V
Sbjct: 212 EMIMEGICPGQRPGMRENGGDNVTGIRFTAALGLQMGGSAAKSTVLNDQKLRLDSADWVV 271
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
++ A+SSF GP +NP+DSK DPTS ++S L RN ++ L HLDDYQ LF+RV++Q
Sbjct: 272 FVVAAASSFYGPHVNPADSKLDPTSLALSMLNHSRNFTFDQLKAAHLDDYQSLFNRVTLQ 331
Query: 312 LSRSPKDI---VTDTCSEENI--DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
LS+ D VT T +E + D SA+RVKSF +DEDPSLVELLFQ+GRYLLIS SR
Sbjct: 332 LSQGSNDACTSVTRTDIQEQVAEDIRTSADRVKSFSSDEDPSLVELLFQYGRYLLISCSR 391
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PGTQV+NLQGIW++D++P WD+APH+NINL+MNYW +LPCNLSECQEPLFDFL L++NG
Sbjct: 392 PGTQVSNLQGIWSQDIAPEWDAAPHLNINLQMNYWPALPCNLSECQEPLFDFLGSLAVNG 451
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
+KTA+VNY A GWV HH +DIWAKSSA A+WPMGGAWLCTHLWEHY +++D+DF
Sbjct: 452 TKTAKVNYQAGGWVTHHVSDIWAKSSAFLKNPKHAVWPMGGAWLCTHLWEHYQFSLDKDF 511
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
LE AYPLLEGCA+FL+DWLIEG GYLETNPSTSPEH F+APDGK A VSYS+TMD++I
Sbjct: 512 LENTAYPLLEGCANFLVDWLIEGPGGYLETNPSTSPEHAFVAPDGKPASVSYSTTMDVSI 571
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
IREVF A++S+AE+L K + LVE++ K+LPRL P +IA D ++MEW
Sbjct: 572 IREVFLAVLSSAELLGKADIDLVERIKKALPRLPPIQIARDRTVMEWA 619
>gi|326508462|dbj|BAJ95753.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 857
Score = 728 bits (1878), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 357/611 (58%), Positives = 444/611 (72%), Gaps = 30/611 (4%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+ F PA++FTDA PIGNGRLGA+VWGGV SE L+LN DTLWTG PG+YTNP AP
Sbjct: 40 PLKVVFASPARYFTDAAPIGNGRLGALVWGGVTSEKLQLNHDTLWTGGPGNYTNPKAPTV 99
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
LS+VRSLVD G Y EATA + L G YQ LGDI+L F + H+KY Y R LDL
Sbjct: 100 LSEVRSLVDKGLYPEATAVAYGLSGDETQSYQPLGDIDLAFGE-HIKYTN--YTRYLDLE 156
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
+AT V YSVG V ++REHFSSNP QVI TKIS ++ G++S VSL + LD+ V N
Sbjct: 157 SATVNVTYSVGEVVYSREHFSSNPHQVIATKISANKPGAVSCTVSLATPLDHRIRVTDAN 216
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
+IIMEG CPG++ NA+D P G++F AIL + +S G + L DK LK++G+D AV
Sbjct: 217 EIIMEGSCPGEKPAGDGNASDHPPGMRFCAILYLLMSGANGQVQVLNDKMLKLDGADSAV 276
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
LLL A++SF+GPF+ P++S DP + + + L R++SY+ L H+DDYQ LF RVS+Q
Sbjct: 277 LLLAAATSFEGPFVKPTESTLDPVASAFTTLNMARSMSYAQLKAYHMDDYQSLFQRVSLQ 336
Query: 312 LSRS-------------PKDIVTDT----CSEENIDTV----------PSAERVKSFQTD 344
LSRS P++I DT C+ + +D P+ +R+ SF+ D
Sbjct: 337 LSRSSNDVLGGSTLARLPENISQDTAVSDCTVQMVDCSRLNELNNSEKPTVDRIISFRHD 396
Query: 345 EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSL 404
EDPSLVELLFQFGRYLLIS SRPGTQV+NLQGIWN + + W +APH NINL+MNYW SL
Sbjct: 397 EDPSLVELLFQFGRYLLISCSRPGTQVSNLQGIWNNETNAPWGAAPHPNINLQMNYWPSL 456
Query: 405 PCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWP 464
PCNLSECQ+PLFDF+ LS+NG+KTA+VNY SGWV H TD+WAK+S D G WALWP
Sbjct: 457 PCNLSECQDPLFDFIGSLSVNGAKTAKVNYGVSGWVSHQVTDLWAKTSPDAGDPSWALWP 516
Query: 465 MGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEH 524
MGG WL THLWEHY++TMDR+FLE+ AYPLLEG ASFLL WLIEG +GYLETNPSTSPEH
Sbjct: 517 MGGPWLATHLWEHYSFTMDREFLERTAYPLLEGSASFLLSWLIEGQEGYLETNPSTSPEH 576
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
FIAPDGK A VSYS+TMDM+IIREVFSA++ +A++L K+ +V+++ +LPRL P KI
Sbjct: 577 YFIAPDGKRASVSYSTTMDMSIIREVFSAVLLSADILGKSSTDVVQRIKAALPRLPPIKI 636
Query: 585 AEDGSIMEWVQ 595
DG+IMEW +
Sbjct: 637 GRDGTIMEWAR 647
>gi|357116946|ref|XP_003560237.1| PREDICTED: alpha-L-fucosidase 2-like [Brachypodium distachyon]
Length = 818
Score = 721 bits (1861), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/586 (58%), Positives = 436/586 (74%), Gaps = 5/586 (0%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+ F PA+HFTDA PIGNG LGAMVWGGV SE L+LN DTLWTGVPG+YT+P P A
Sbjct: 20 PLKVVFASPAEHFTDAAPIGNGSLGAMVWGGVASEKLQLNLDTLWTGVPGNYTDPSVPSA 79
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
++ VR LV Q+ +AT A+ L+G P +VYQ LGD+ +EF S Y+ +Y+RELDL+
Sbjct: 80 VAVVRKLVHDRQFVDATNAASGLYGGPTEVYQPLGDVNIEFGTSSQDYS--SYKRELDLH 137
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TAT V Y++G V++TREHF SNP QVIVTK+S ++SG +S +SLDS L + V N
Sbjct: 138 TATVLVTYNIGEVQYTREHFCSNPHQVIVTKLSANKSGHISCTLSLDSKLTHSVRVTNAN 197
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
++IM+G CPG+R + N +D GI+F+A+L +++ L D L+++ +DW +
Sbjct: 198 EMIMDGTCPGQRHVLQQNETNDATGIKFTAVLSLQMGGAMAKAEVLNDHNLRIDNADWVL 257
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
LL+ A+SSF GPFINPS+SK DP S ++ L RN+++ L HL DYQ LFHRVS+
Sbjct: 258 LLVTAASSFSGPFINPSNSKIDPESVALRNLNMSRNVTFDQLKAAHLKDYQGLFHRVSLI 317
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
LS +P I +E +AERV SF+++EDPSLVELLFQ+GRYLLIS SRPGTQV
Sbjct: 318 LSHAPA-IEKTNLNETGEAIKITAERVNSFRSNEDPSLVELLFQYGRYLLISCSRPGTQV 376
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
+NLQGIWN+DLSP W SAPH+NINL+MNYW +LPCNL ECQEPL DF+ L++NG+KTA+
Sbjct: 377 SNLQGIWNQDLSPAWQSAPHLNINLQMNYWPTLPCNLGECQEPLIDFIAALAVNGTKTAK 436
Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
+NY SGWV HH +DIWAKSSA +A+WPMGGAWLCTHLWEHY Y++D++FL+ A
Sbjct: 437 INYQTSGWVTHHVSDIWAKSSAFNEDAKYAVWPMGGAWLCTHLWEHYQYSLDKEFLKNTA 496
Query: 492 YPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD--GKLACVSYSSTMDMAIIRE 549
YPLLEGCA FL DWL EG +GYLETNPS SPEH FIAPD G+ A VSYS+TMD++IIRE
Sbjct: 497 YPLLEGCALFLADWLTEGRNGYLETNPSISPEHSFIAPDSGGQQASVSYSTTMDVSIIRE 556
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+F AIIS+AEVL K++ LV K+ K+L RL P IA+D +IMEW Q
Sbjct: 557 IFMAIISSAEVLGKSDSTLVPKIKKALSRLTPIMIAKDHTIMEWAQ 602
>gi|414868290|tpg|DAA46847.1| TPA: hypothetical protein ZEAMMB73_390456 [Zea mays]
Length = 727
Score = 717 bits (1850), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 363/611 (59%), Positives = 444/611 (72%), Gaps = 30/611 (4%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+ F PAK+FTDA PIGNGRLGAMVWG V SE L+LN DTLWTG PG+YTNP+AP
Sbjct: 41 PLKVVFGSPAKYFTDAAPIGNGRLGAMVWGCVESERLQLNHDTLWTGGPGNYTNPNAPAV 100
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
LS VRSLV++G+Y EAT+A+ L G V+Q LGDI+L F + +KY YRRELDL+
Sbjct: 101 LSKVRSLVENGKYPEATSAAYDLSGDQTQVFQPLGDIDLVFGED-IKYTN--YRRELDLH 157
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TAT V Y+VG++ +TREHFSSNP QVIVTKIS ++ G++SF VSL S LD+ V N
Sbjct: 158 TATVTVTYTVGDIVYTREHFSSNPHQVIVTKISANKPGNVSFTVSLTSPLDHKIRVTHAN 217
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
+IIMEG CPG+R A D P GI+FSAIL ++I+ T+ L D LK++ +D V
Sbjct: 218 EIIMEGSCPGQRPEEIKTAADQPIGIKFSAILYLQINGANSTVEVLNDNMLKLDCADSVV 277
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
LLL A++SF FI PS+SK DPT + + L R SYS L H+DDYQ LF RVS+Q
Sbjct: 278 LLLAATTSFQSAFIKPSESKLDPTVSAFTTLSIARRTSYSQLKAYHIDDYQTLFQRVSLQ 337
Query: 312 LS-------RSPKDIVTDTCSEENIDTV--------------------PSAERVKSFQTD 344
LS R + + + S + + P+ ER+ +F+ +
Sbjct: 338 LSQGSNYDLRRSRLVQSAETSSQGANVSDYGFQISGCTRLTSLNSFVKPTVERIVTFKDN 397
Query: 345 EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSL 404
EDPSLVELLFQFGRYLLIS SRPGTQ++NLQGIW+ D SP WD+APH NINL+MNYW +L
Sbjct: 398 EDPSLVELLFQFGRYLLISCSRPGTQISNLQGIWSNDTSPPWDAAPHPNINLQMNYWPAL 457
Query: 405 PCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWP 464
PCNLSECQEPLFDF+ LSING+KTA+VNY ASGWV H TD+WAK+S D G VWALWP
Sbjct: 458 PCNLSECQEPLFDFIGSLSINGAKTAKVNYEASGWVSHQVTDLWAKTSPDAGDPVWALWP 517
Query: 465 MGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEH 524
MGG WL THLWEHY +T+D+ FLEK AYPLLEG A FLLDWLIEGH GYLETNPSTSPEH
Sbjct: 518 MGGPWLATHLWEHYCFTLDKHFLEKTAYPLLEGSARFLLDWLIEGHRGYLETNPSTSPEH 577
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
FIAPDGK ACVSYS+TMD++IIREVFSA+I +A++L K++ +V+++ K+LP L P K+
Sbjct: 578 YFIAPDGKEACVSYSTTMDISIIREVFSALILSADILGKSDTNVVQRIKKALPNLPPMKV 637
Query: 585 AEDGSIMEWVQ 595
A DG+IMEW Q
Sbjct: 638 ARDGTIMEWAQ 648
>gi|326493958|dbj|BAJ85441.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 636
Score = 711 bits (1834), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/597 (58%), Positives = 434/597 (72%), Gaps = 30/597 (5%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+ F PA++FTDA PIGNGRLGA+VWGGV SE L+LN DTLWTG PG+YTNP AP
Sbjct: 40 PLKVVFASPARYFTDAAPIGNGRLGALVWGGVTSEKLQLNHDTLWTGGPGNYTNPKAPTV 99
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
LS+VRSLVD G Y EATA + L G YQ LGDI+L F + H+KY Y R LDL
Sbjct: 100 LSEVRSLVDKGLYPEATAVAYGLSGDETQSYQPLGDIDLAFGE-HIKYTN--YTRYLDLE 156
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
+AT V YSVG V ++REHFSSNP QVI TKIS ++ G++S VSL + LD+ V N
Sbjct: 157 SATVNVTYSVGEVVYSREHFSSNPHQVIATKISANKPGAVSCTVSLATPLDHRIRVTDAN 216
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
+IIMEG CPG++ NA+D P G++F AIL + +S G + L DK LK++G+D AV
Sbjct: 217 EIIMEGSCPGEKPAGDGNASDHPPGMRFCAILYLLMSGANGQVQVLNDKMLKLDGADSAV 276
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
LLL A++SF+GPF+ P++S DP + + + L R++SY+ L H+DDYQ LF RVS+Q
Sbjct: 277 LLLAAATSFEGPFVKPTESTLDPVASAFTTLNMARSMSYAQLKAYHMDDYQSLFQRVSLQ 336
Query: 312 LSRS-------------PKDIVTDT----CSEENIDTV----------PSAERVKSFQTD 344
LSRS P++I DT C+ + +D P+ +R+ SF+ D
Sbjct: 337 LSRSSNDVLGGSTLARLPENISQDTAVSDCTVQMVDCSRLNELNNSEKPTVDRIISFRHD 396
Query: 345 EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSL 404
EDPSLVELLFQFGRYLLIS SRPGTQV+NLQGIWN + + W +APH NINL+MNYW SL
Sbjct: 397 EDPSLVELLFQFGRYLLISCSRPGTQVSNLQGIWNNETNAPWGAAPHPNINLQMNYWPSL 456
Query: 405 PCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWP 464
PCNLSECQ+PLFDF+ LS+NG+KTA+VNY SGWV H TD+WAK+S D G WALWP
Sbjct: 457 PCNLSECQDPLFDFIGSLSVNGAKTAKVNYGVSGWVSHQVTDLWAKTSPDAGDPSWALWP 516
Query: 465 MGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEH 524
MGG WL THLWEHY++TMDR+FLE+ AYPLLEG ASFLL WLIEG +GYLETNPSTSPEH
Sbjct: 517 MGGPWLATHLWEHYSFTMDREFLERTAYPLLEGSASFLLSWLIEGQEGYLETNPSTSPEH 576
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
FIAPDGK A VSYS+TMDM+IIREVFSA++ +A++L K+ +V+++ +LPRL P
Sbjct: 577 YFIAPDGKRASVSYSTTMDMSIIREVFSAVLLSADILGKSSTDVVQRIKAALPRLPP 633
>gi|326513306|dbj|BAK06893.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 815
Score = 710 bits (1832), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/594 (57%), Positives = 440/594 (74%), Gaps = 7/594 (1%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
A+ PLK+ F PA+HFTDA PIGNG LGAMVWGGV S+ L+LN DTLWTGVPGDY
Sbjct: 12 ADEAEEERPLKVVFASPAEHFTDAAPIGNGSLGAMVWGGVASDKLQLNLDTLWTGVPGDY 71
Query: 64 TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET 123
T+P AP AL+ VR LVD G++ +AT+A+ LFG +VYQ LGD+ LEFD S+ +Y+ +
Sbjct: 72 TDPKAPAALAAVRKLVDDGRFVDATSAASGLFGGQTEVYQPLGDMNLEFDISNQEYS--S 129
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
Y+RELDL+TAT + Y++G V+ TREHF SNP QVIVTKIS ++S +S +SL+S L++
Sbjct: 130 YKRELDLHTATTVITYNIGEVQHTREHFCSNPHQVIVTKISANKSEHVSLTLSLNSKLNH 189
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
V N++IMEG CP R+ N D GI F+A+L +++S + L D+KL+
Sbjct: 190 RVRVMNANEMIMEGSCPVHRL--HENEASDASGIGFAAVLSLQMSGAAAKVVVLNDQKLR 247
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
++ +DW +L + A+SSF+GP +NPSDSK DP S ++ A+ RNL++ L HL DYQ
Sbjct: 248 IDNADWVLLRVTAASSFNGPSVNPSDSKLDPESAALRAMNMSRNLTFDQLKASHLKDYQG 307
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
LFHRVS++LS+SP I E +AERV F++DED SLVELLFQ+GRYLLIS
Sbjct: 308 LFHRVSLRLSQSPA-IEKINMKEVGEAIKTTAERVNGFRSDEDSSLVELLFQYGRYLLIS 366
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SRPGTQ++NLQGIWN+DL P W+ APH+NINL+MNYW +LPCNL ECQEPL DF+ L+
Sbjct: 367 CSRPGTQISNLQGIWNQDLLPQWECAPHLNINLQMNYWPTLPCNLIECQEPLLDFIASLA 426
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+NG+KTA++NY ASGWV HH TDIWAKSSA +++WPMGGAWLCTHLWEHY Y +D
Sbjct: 427 VNGTKTAKINYQASGWVTHHVTDIWAKSSAFNEDAKYSVWPMGGAWLCTHLWEHYQYLLD 486
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDG--KLACVSYSST 541
+DFL+ AYPLLEGCA FL DWLIEG G LETNPSTSPEH FIAP A VSYS+T
Sbjct: 487 KDFLKNTAYPLLEGCALFLTDWLIEGPRGLLETNPSTSPEHAFIAPGSGDHQASVSYSTT 546
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
MD+AIIRE+FSA+IS+AE+L K++ LV+K+ ++LPRL IA+D +++EW Q
Sbjct: 547 MDIAIIREIFSAVISSAEILGKSDTPLVQKIKEALPRLPQNTIAKDQTLVEWAQ 600
>gi|242047972|ref|XP_002461732.1| hypothetical protein SORBIDRAFT_02g007180 [Sorghum bicolor]
gi|241925109|gb|EER98253.1| hypothetical protein SORBIDRAFT_02g007180 [Sorghum bicolor]
Length = 864
Score = 686 bits (1769), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/618 (56%), Positives = 446/618 (72%), Gaps = 37/618 (5%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PL + F PA++FTDA PIGNG LG MVWGGV ++ L+LN DTLWTG PG YT+PDAP A
Sbjct: 47 PLTVVFASPAENFTDAAPIGNGSLGGMVWGGVATDKLQLNHDTLWTGAPGSYTDPDAPAA 106
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEF--DDSHLKYAEETYRRELD 129
L+ VR LVD G++A+ATAA+ +LFG ++VYQ +GD+ LE S + A ++Y+RELD
Sbjct: 107 LAAVRELVDQGRFADATAAATRLFGGQSEVYQPMGDVNLELGGSGSDQQPAYDSYKRELD 166
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L+TAT V YSVG V++TREHF SNP QVI+T+I+ SE G +S +SL S L N V
Sbjct: 167 LHTATVLVTYSVGPVQYTREHFCSNPHQVIITRIAASEPGHVSCTLSLSSQLKNTVTVTN 226
Query: 190 NNQIIMEGRCPG-------------KRIPPKANANDDPKGIQFSAILEIKISDDRGTISA 236
NQ++MEG CP + + GI+F+A+L +++ D+ +
Sbjct: 227 ANQVVMEGVCPRQRPPAPPRLMLLRNSSSGDDDDDLTTGGIKFAAVLGVQMGGDKAKAAV 286
Query: 237 LEDK-KLKVEGSDWAVLLLVASSSFDGPFINPSDSK-KDPTSESMSALQSIRNLSYSDLY 294
L D+ KL +E +DW VL++ ASSSFDGPF++PSDS+ DPTS +++ L +L+Y L
Sbjct: 287 LNDENKLSLESADWIVLIVAASSSFDGPFVSPSDSRLDDPTSAAVATLNRATSLTYEQLK 346
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTD-------------------TCSEENIDTVPSA 335
HLDDYQ+LFHRV+++LS ++ D +E I SA
Sbjct: 347 AAHLDDYQRLFHRVTLRLSPPGGGLLEDARGGGLMMTGGKETMLKRGVGGDEGIIRT-SA 405
Query: 336 ERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNIN 395
+RVKSF TDEDPSLVELLFQ+GRYLLIS SRPGTQV+NLQGIWN++++P WD+APH+NIN
Sbjct: 406 DRVKSFATDEDPSLVELLFQYGRYLLISCSRPGTQVSNLQGIWNQEVAPAWDAAPHLNIN 465
Query: 396 LEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADR 455
L+MNYW +LPCNLSECQEPLFDFL L++NG+KTA+VNY A GWV HH +DIWAKSSA
Sbjct: 466 LQMNYWPTLPCNLSECQEPLFDFLQSLAVNGTKTAKVNYQARGWVTHHVSDIWAKSSAFI 525
Query: 456 GKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLE 515
A+WPMGGAWLCTHLWEHY Y++D+DFLE AYPLLEGCA+FL+DWLIEG G+L+
Sbjct: 526 KNPKHAVWPMGGAWLCTHLWEHYQYSLDKDFLEYTAYPLLEGCATFLVDWLIEGPGGFLQ 585
Query: 516 TNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS 575
TNPSTSPEH F APDGK A VSYS+TMD++IIREV SA++ +AE+LEK++ LVEK+ K+
Sbjct: 586 TNPSTSPEHAFTAPDGKPASVSYSTTMDISIIREVSSAVLLSAEILEKSDTDLVEKIKKA 645
Query: 576 LPRLRPTKIAEDGSIMEW 593
LPRL P + A D +IMEW
Sbjct: 646 LPRLPPIQFARDNTIMEW 663
>gi|110288917|gb|ABG66023.1| large secreted protein, putative, expressed [Oryza sativa Japonica
Group]
Length = 708
Score = 627 bits (1617), Expect = e-177, Method: Compositional matrix adjust.
Identities = 292/497 (58%), Positives = 386/497 (77%), Gaps = 5/497 (1%)
Query: 101 VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIV 160
VYQ LGDI LEFD S L Y +Y+RELDL TAT + Y++G V+++REHF SNP QV
Sbjct: 3 VYQPLGDINLEFDSSSLGYT--SYKRELDLRTATVCISYNIGEVQYSREHFCSNPHQVFA 60
Query: 161 TKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFS 220
TKIS ++SG +SF +SL+S L+++ + N++IM+G CPG+R N +D GI+F+
Sbjct: 61 TKISANKSGHVSFTLSLNSQLNHNVRITNANEMIMQGTCPGRRPALHHNGANDAIGIKFA 120
Query: 221 AILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMS 280
+ ++I ++ ++D+KL+++ +DW VLL+ A+SSFDGPF+NPS+SK +P +++
Sbjct: 121 TAVGLQIGGTSAKVTIIDDQKLRIDAADWVVLLVAAASSFDGPFVNPSESKLNPEVAALN 180
Query: 281 ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKS 340
L RN ++S L HL+DYQ LFHRV++QLS++ + D E + D +AER+ S
Sbjct: 181 TLNISRNATFSQLKAAHLEDYQGLFHRVTLQLSQASM-LEKDILEEVDHDVKTTAERINS 239
Query: 341 FQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
F++DEDPSLVELLFQ+GRYLLISSSRPGTQV+NLQGIWN+D +P W+++PH+NINLEMNY
Sbjct: 240 FRSDEDPSLVELLFQYGRYLLISSSRPGTQVSNLQGIWNQDFAPAWEASPHLNINLEMNY 299
Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVW 460
W +LPCNL+ECQEPLFD + L++NG+KTA+VNY ASGWV HH TDIWAKSSA ++
Sbjct: 300 WPTLPCNLTECQEPLFDLIGSLAVNGTKTAKVNYQASGWVTHHVTDIWAKSSAYYVDAMY 359
Query: 461 ALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPST 520
ALWPMGGAWLCTHLWE+Y Y++D++FLEKRAYPLLEGCA FL+DWLI+G YLETNPST
Sbjct: 360 ALWPMGGAWLCTHLWENYQYSLDKEFLEKRAYPLLEGCAMFLIDWLIKGPGDYLETNPST 419
Query: 521 SPEHEFIAP--DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR 578
SPEH FIAP G LA VSYS+TMD++IIREVF A+IS+AEVL K++ LVE++ K+LP
Sbjct: 420 SPEHPFIAPGTGGHLASVSYSTTMDISIIREVFLAVISSAEVLGKSDTNLVERIKKALPM 479
Query: 579 LRPTKIAEDGSIMEWVQ 595
L P KI++DG+IMEW Q
Sbjct: 480 LPPVKISKDGTIMEWAQ 496
>gi|302773137|ref|XP_002969986.1| hypothetical protein SELMODRAFT_171005 [Selaginella moellendorffii]
gi|300162497|gb|EFJ29110.1| hypothetical protein SELMODRAFT_171005 [Selaginella moellendorffii]
Length = 791
Score = 619 bits (1596), Expect = e-174, Method: Compositional matrix adjust.
Identities = 296/588 (50%), Positives = 411/588 (69%), Gaps = 10/588 (1%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + F PA+++ +A+P+GNGRLGAMV+GG S+ ++LNEDTLW+G P D+ NP+A + L
Sbjct: 5 LSVDFFDPARYWVEALPVGNGRLGAMVFGGSSSDLIQLNEDTLWSGGPRDWNNPNAVQVL 64
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
VR LV +YAEA+ S ++ G +VYQ LGDI+L+F SH Y ++Y R+LDLNT
Sbjct: 65 PKVRQLVWDEKYAEASDLSKEMLGPYTEVYQPLGDIKLDFGASHATYDAQSYHRQLDLNT 124
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A V Y+VG + +TRE F+S P QVIV +I+ S++G++SF+ +LDS L ++YV +N
Sbjct: 125 ALVSVSYAVGGINYTREVFASYPHQVIVIRITSSKAGAVSFSATLDSPLQTNAYVKDSNF 184
Query: 193 IIMEGRCPGKRIPPKANA----NDDPKGIQFSAILEIKISDDRGT-ISALEDKKLKVEGS 247
I+++G+CP P ++ +D G+ F+A++E++ S G+ I+ L ++++VE
Sbjct: 185 IVVQGQCPLHVEEPTLSSPRCESDQKTGMSFAAVMEVRTSSGAGSVITKLGIQQVRVENV 244
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
DWA+L+L ASSSFDGPF +P+ + KDP + S++ L+ + LSY LY HL DYQ LFHR
Sbjct: 245 DWAMLVLAASSSFDGPFKDPTSTGKDPVAASLATLKLVEALSYKKLYAAHLKDYQALFHR 304
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
VS+Q+++ ++ + + + ER+++F ++EDP++V LLFQFGRYLLISSSRP
Sbjct: 305 VSLQINKKSRENSVVSSTSMSTQ-----ERIQAFASNEDPAMVVLLFQFGRYLLISSSRP 359
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
GT VANLQGIWN+DL P W PH+NINLEMNYW + CNL+EC EPLFDF++ ++INGS
Sbjct: 360 GTFVANLQGIWNKDLKPAWRCVPHLNINLEMNYWPAEVCNLAECHEPLFDFVSSMAINGS 419
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
TA+VNY GWV HH DIW +++ G V+AL+PMGGAWLC HLWEHY +++D +FL
Sbjct: 420 HTAKVNYNMRGWVTHHNADIWVQTAPIGGDPVYALFPMGGAWLCLHLWEHYRFSLDMEFL 479
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
+AYPLL GCA FL DWL + G L TNPSTSPEH FIAPDGK A VSY+S MDMAII
Sbjct: 480 RSKAYPLLTGCAQFLFDWLTGDNHGMLVTNPSTSPEHVFIAPDGKEASVSYASAMDMAII 539
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
R VF A SAA +L++ + + L P +I+ G +MEW +
Sbjct: 540 RAVFDATSSAATILQEPNSQFTANLKHATENLFPPEISSSGLLMEWAK 587
>gi|302799394|ref|XP_002981456.1| hypothetical protein SELMODRAFT_178891 [Selaginella moellendorffii]
gi|300150996|gb|EFJ17644.1| hypothetical protein SELMODRAFT_178891 [Selaginella moellendorffii]
Length = 788
Score = 616 bits (1588), Expect = e-173, Method: Compositional matrix adjust.
Identities = 300/589 (50%), Positives = 413/589 (70%), Gaps = 15/589 (2%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + F PA+++ +A+P+GNGRLGAMV+GG S+ ++LN DTLW+G P D+ NP+A + L
Sbjct: 5 LSVDFFDPARYWVEALPVGNGRLGAMVFGGSSSDLIQLN-DTLWSGGPRDWNNPNAVQVL 63
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
VR LV +YAEA+ S ++ G +VYQ LGDI+L+F SH Y ++Y R+LDLN
Sbjct: 64 PKVRQLVWDEKYAEASDLSKQMLGPYTEVYQPLGDIKLDFGTSHATYDAQSYHRQLDLNA 123
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A V+Y++G V +TRE F+S P QVIV +IS S++G++SF+ +LDS L ++YV +N
Sbjct: 124 ALVSVRYAIGGVNYTREVFASYPHQVIVIRISSSKAGAVSFSATLDSPLQTNAYVKDSNF 183
Query: 193 IIMEGRCPGKRIPPKANA----NDDPKGIQFSAILEIKISDDRGT-ISALEDKKLKVEGS 247
I+++G+CP P ++ +D G+ F+A++E++ S G+ I+ L ++++VE
Sbjct: 184 IVVQGQCPLHVEEPTLSSPRCESDQKTGMSFAAVMEVRTSSGAGSVITKLGIQQVRVENV 243
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
DWA+L+L ASSSFDGPF NP+ KDP + S++ L+S+ LSY LY HL DYQ LFHR
Sbjct: 244 DWAMLVLAASSSFDGPFKNPTG--KDPVAASLATLKSVEALSYEKLYATHLKDYQALFHR 301
Query: 308 VSIQLSR-SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
VS+++++ S ++ V T S + + ER+++F ++EDP++V LLFQFGRYLLISSSR
Sbjct: 302 VSLRINKKSGENSVASTTS------MSTQERIQAFASNEDPAMVSLLFQFGRYLLISSSR 355
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PGT VANLQGIWN+DL P W PH+NINLEMNYW + CNL+EC EPLFDF++ ++ING
Sbjct: 356 PGTFVANLQGIWNKDLKPAWRCVPHLNINLEMNYWPAEVCNLAECHEPLFDFVSSMAING 415
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
S TA+VNY GWV HH DIW +++ G V+AL+PMGGAWLC HLWEHY +++D +F
Sbjct: 416 SHTAKVNYNMRGWVTHHNADIWVQTAPIGGDPVYALFPMGGAWLCLHLWEHYRFSLDMEF 475
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
L +AYPLL GCA FL DWL + G L TNPSTSPEH FIAPDGK A VSY+S MDMAI
Sbjct: 476 LRSKAYPLLTGCAQFLFDWLTGDNHGMLVTNPSTSPEHVFIAPDGKQASVSYASAMDMAI 535
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
IR VF A SAA +L++ + + L P +I+ G +MEW +
Sbjct: 536 IRSVFDATSSAAAILQEPNSQFTANLKHATENLFPPEISSSGLLMEWAK 584
>gi|168043560|ref|XP_001774252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674379|gb|EDQ60888.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 818
Score = 584 bits (1506), Expect = e-164, Method: Compositional matrix adjust.
Identities = 287/590 (48%), Positives = 390/590 (66%), Gaps = 32/590 (5%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGH 97
MV GGV SE ++LNEDTLW+G P D+ NP A + L VR LV G+YAEAT + K+ G
Sbjct: 1 MVHGGVKSELVQLNEDTLWSGGPTDWNNPKALETLPRVRELVKEGKYAEATTEAQKMLGP 60
Query: 98 PADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQ 157
+VYQ LGD++LEFDDSH Y +E+YRR+LDL+TA V Y +G+V + R+ F+S P Q
Sbjct: 61 DPEVYQPLGDLKLEFDDSHNTYDKESYRRQLDLDTAMTYVNYEIGDVSYLRQAFTSYPHQ 120
Query: 158 VIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCP--GKRIPPKANANDDPK 215
V +I+GS+SGS+SF+V+LDS L V G+ I ++G+CP ++ A+ K
Sbjct: 121 VFAMRIAGSKSGSVSFSVTLDSQLMLGKEVVGSKYIALKGQCPIDSNKVTEVASPTRSSK 180
Query: 216 --GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKD 273
G++F A+L++++S + G + ++ + LKV +DWAVL L ASSSFDGPF +PS S +
Sbjct: 181 KQGMEFVAVLQVEVSGEAGRLQVVDKQTLKVHQADWAVLYLTASSSFDGPFKDPSISGIE 240
Query: 274 PTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKD-----------IVTD 322
PTS + +AL ++ +LS+ D+ HL DYQ LFHRVS+ + KD IV
Sbjct: 241 PTSLAFAALANLVDLSFDDILAAHLADYQTLFHRVSLHVDNEEKDLGLWELIVPSEIVES 300
Query: 323 TCSEENI-----------------DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
E + + + +R+ +F DEDP LV LLFQFGRYLLI+SS
Sbjct: 301 KTVESGAQVSTGVDGEVYPQNAWKERISTRDRILNFDGDEDPDLVVLLFQFGRYLLIASS 360
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RP + V+NLQG+W+ L P W P +NINLEMNYW + C+L+EC PLFDFL +++
Sbjct: 361 RPNSFVSNLQGVWSNSLHPAWRCCPTLNINLEMNYWPAETCSLAECHLPLFDFLEQIAVT 420
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G+ TA+VNY GWV HH DIWA S+ G VWALWPM GAW+C HLWEHY ++ D +
Sbjct: 421 GATTAKVNYGLGGWVSHHNADIWAHSAPVSGDPVWALWPMSGAWICLHLWEHYTFSQDEE 480
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
FL RAYPL +GCA F ++WL+E G+L TNPSTSPEH FIAPDG+ ACVSY STMDMA
Sbjct: 481 FLRNRAYPLFKGCAEFFVNWLVEDGKGHLVTNPSTSPEHHFIAPDGQSACVSYGSTMDMA 540
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I+ F+A++SAA+++ ++E LV +V ++ RL P KI DG ++EWV+
Sbjct: 541 ILHNFFNAVVSAAKIVGQDEAELVSEVKSAVGRLLPAKIGSDGRLLEWVE 590
>gi|414868291|tpg|DAA46848.1| TPA: hypothetical protein ZEAMMB73_390456 [Zea mays]
Length = 567
Score = 553 bits (1424), Expect = e-154, Method: Compositional matrix adjust.
Identities = 285/500 (57%), Positives = 350/500 (70%), Gaps = 30/500 (6%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+ F PAK+FTDA PIGNGRLGAMVWG V SE L+LN DTLWTG PG+YTNP+AP
Sbjct: 41 PLKVVFGSPAKYFTDAAPIGNGRLGAMVWGCVESERLQLNHDTLWTGGPGNYTNPNAPAV 100
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
LS VRSLV++G+Y EAT+A+ L G V+Q LGDI+L F + +KY YRRELDL+
Sbjct: 101 LSKVRSLVENGKYPEATSAAYDLSGDQTQVFQPLGDIDLVFGED-IKYTN--YRRELDLH 157
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TAT V Y+VG++ +TREHFSSNP QVIVTKIS ++ G++SF VSL S LD+ V N
Sbjct: 158 TATVTVTYTVGDIVYTREHFSSNPHQVIVTKISANKPGNVSFTVSLTSPLDHKIRVTHAN 217
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
+IIMEG CPG+R A D P GI+FSAIL ++I+ T+ L D LK++ +D V
Sbjct: 218 EIIMEGSCPGQRPEEIKTAADQPIGIKFSAILYLQINGANSTVEVLNDNMLKLDCADSVV 277
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
LLL A++SF FI PS+SK DPT + + L R SYS L H+DDYQ LF RVS+Q
Sbjct: 278 LLLAATTSFQSAFIKPSESKLDPTVSAFTTLSIARRTSYSQLKAYHIDDYQTLFQRVSLQ 337
Query: 312 LS-------RSPKDIVTDTCSEENIDTV--------------------PSAERVKSFQTD 344
LS R + + + S + + P+ ER+ +F+ +
Sbjct: 338 LSQGSNYDLRRSRLVQSAETSSQGANVSDYGFQISGCTRLTSLNSFVKPTVERIVTFKDN 397
Query: 345 EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSL 404
EDPSLVELLFQFGRYLLIS SRPGTQ++NLQGIW+ D SP WD+APH NINL+MNYW +L
Sbjct: 398 EDPSLVELLFQFGRYLLISCSRPGTQISNLQGIWSNDTSPPWDAAPHPNINLQMNYWPAL 457
Query: 405 PCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWP 464
PCNLSECQEPLFDF+ LSING+KTA+VNY ASGWV H TD+WAK+S D G VWALWP
Sbjct: 458 PCNLSECQEPLFDFIGSLSINGAKTAKVNYEASGWVSHQVTDLWAKTSPDAGDPVWALWP 517
Query: 465 MGGAWLCTHLWEHYNYTMDR 484
MGG WL THLWEHY +T+D+
Sbjct: 518 MGGPWLATHLWEHYCFTLDK 537
>gi|337750325|ref|YP_004644487.1| alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
gi|336301514|gb|AEI44617.1| Alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
Length = 831
Score = 470 bits (1210), Expect = e-130, Method: Compositional matrix adjust.
Identities = 247/596 (41%), Positives = 360/596 (60%), Gaps = 38/596 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+T++ PA+ +T+A+P GNGRLGAMV+GGV E L+LNEDTLW+G PGD+ NP A + L
Sbjct: 31 MKLTYDKPARVWTEALPAGNGRLGAMVFGGVEHELLQLNEDTLWSGAPGDHNNPRAREVL 90
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
+VR L G+Y EA ++ G Y LGD+ L F H +A + Y R LD+
Sbjct: 91 PEVRRLALEGRYREADRLCKEMLGPYTQSYLPLGDLSLRF--HHGDHAGD-YERHLDVEG 147
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
+ R Y +G V +TRE F S+PDQV+V +++ G+LSF LDS L + + + +
Sbjct: 148 SILRTSYRIGAVTYTRELFVSHPDQVLVLRLTADRPGALSFTAKLDSALKHRTAADAGD- 206
Query: 193 IIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLK 243
++++GR P K + P D+P G++F A L ++ G ++ L
Sbjct: 207 LVLKGRAPAK-VDPNYYRTDEPVRYAEGDAGGGMRFEARLRVQAD---GAELQVDGGALH 262
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
VE + LLL A++SF+G P++ +D + + L++ L+Y +L RH DDY+
Sbjct: 263 VERATEVTLLLTAATSFNGYDKQPAEQGRDESRAAADDLRAASGLTYDELLQRHQDDYRA 322
Query: 304 LFHRVSIQL--SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
LF RV++ L SR+P+ + TD R+ + DP L ELLF +GRYLL
Sbjct: 323 LFGRVTLSLGASRAPEGMPTD-------------RRIAEYGAS-DPGLAELLFHYGRYLL 368
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
ISSSR GTQ ANLQGIWN+++ W S +NIN +MNYW + CNLSEC EPL F+
Sbjct: 369 ISSSREGTQPANLQGIWNKEVRAPWSSNYTLNINAQMNYWPAETCNLSECHEPLLGFIGR 428
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEH 477
L++NG+KT VNY GW HH +DIWA+S+ G VWA WPM GAWL HLWEH
Sbjct: 429 LAVNGAKTVSVNYGLRGWTAHHNSDIWAQSAPVGAYGHGDPVWAYWPMAGAWLSAHLWEH 488
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y + + D+L ++AYP+++ A F LDWL+E DG+L ++PSTSPEH F+ +G+LA V+
Sbjct: 489 YAFCREEDYLREQAYPVMKEAALFCLDWLVEDADGFLVSSPSTSPEHRFVTAEGELAAVT 548
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
++TMD+A++ ++F+ I AA L + + + +L RL+P +I + G + EW
Sbjct: 549 AAATMDLALVHDLFTNCIEAARTLGTDVE-FSAGLQDALDRLQPLQIGQYGQLQEW 603
>gi|386726157|ref|YP_006192483.1| alpha-L-fucosidase [Paenibacillus mucilaginosus K02]
gi|384093282|gb|AFH64718.1| alpha-L-fucosidase [Paenibacillus mucilaginosus K02]
Length = 801
Score = 470 bits (1210), Expect = e-130, Method: Compositional matrix adjust.
Identities = 246/596 (41%), Positives = 360/596 (60%), Gaps = 38/596 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+T++ PA+ +T+A+P GNGRLGAMV+GG+ E L+LNEDTLW+G PGD+ NP A + L
Sbjct: 1 MKLTYDKPARVWTEALPAGNGRLGAMVFGGMEHELLQLNEDTLWSGAPGDHNNPRAREVL 60
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
+VR L G+Y EA ++ G Y LGD+ L F H +A + Y R LD+
Sbjct: 61 PEVRRLALEGRYREADRLCKEMLGPYTQSYLPLGDLSLRF--HHGDHAGD-YERHLDVEG 117
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
+ R Y +G V +TRE F S+PDQV+V +++ G+LSF LDS L + + + +
Sbjct: 118 SILRTSYRIGAVTYTRELFVSHPDQVLVLRLTTDRPGALSFTAKLDSALKHRTAADAGD- 176
Query: 193 IIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLK 243
++++GR P K + P D+P G++F A L ++ G ++ L
Sbjct: 177 LVLKGRAPVK-VDPNYYRTDEPVRYAEGDAGGGMRFEARLRVQAD---GAELQVDGGALH 232
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
VE + LLL A++SF+G P++ +D + + + L++ L+Y +L RH DDY+
Sbjct: 233 VERATEVTLLLTAATSFNGYDKQPAEQGRDESRAAANDLRAASGLTYEELLQRHQDDYRA 292
Query: 304 LFHRVSIQL--SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
LF RV++ L SR+P+ + TD R+ + DP L ELLF +GRYLL
Sbjct: 293 LFGRVTLSLGASRAPEGMPTD-------------RRITEYGAS-DPGLAELLFHYGRYLL 338
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
ISSSR GTQ ANLQGIWN+++ W S +NIN +MNYW + CNLSEC EPL F+
Sbjct: 339 ISSSREGTQPANLQGIWNKEVRAPWSSNYTLNINAQMNYWPAETCNLSECHEPLLGFIGR 398
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEH 477
L++NG+KT VNY GW HH +DIWA+S+ G VWA WPM GAWL HLWEH
Sbjct: 399 LAVNGAKTVSVNYGLRGWTAHHNSDIWAQSAPVGAYGHGDPVWAYWPMAGAWLSAHLWEH 458
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y + + D+L ++AYP+++ A F LDWL+E DG+L + PSTSPEH F+ +G+LA V+
Sbjct: 459 YAFCREEDYLREQAYPVMKEAALFCLDWLVEDADGFLVSAPSTSPEHRFVMAEGELAAVT 518
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
++TMD+A++ ++F+ I AA L + + + +L RL+P +I + G + EW
Sbjct: 519 AAATMDLALVHDLFTNCIEAARTLGTDVE-FSAGLQDALDRLQPLQIGQYGQLQEW 573
>gi|379723425|ref|YP_005315556.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
gi|378572097|gb|AFC32407.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
Length = 801
Score = 469 bits (1208), Expect = e-129, Method: Compositional matrix adjust.
Identities = 247/596 (41%), Positives = 360/596 (60%), Gaps = 38/596 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+T++ PA+ +T+A+P GNGRLGAMV+GGV E L+LNEDTLW+G PGD+ NP A + L
Sbjct: 1 MKLTYDKPARVWTEALPAGNGRLGAMVFGGVEHELLQLNEDTLWSGAPGDHNNPRAREVL 60
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
+VR L G+Y EA ++ G Y LGD+ L F H +A + Y R LD+
Sbjct: 61 PEVRRLALEGRYREADRLCKEMLGPYTQSYLPLGDLSLRF--HHGDHAGD-YERHLDVEG 117
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
+ R Y +G V +TRE F S+PDQV+V +++ G+LSF LDS L + + + +
Sbjct: 118 SILRTSYRIGAVTYTRELFVSHPDQVLVLRLTADRPGALSFTAKLDSALKHRTAADAGD- 176
Query: 193 IIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLK 243
++++GR P K + P D+P G++F A L ++ G ++ L
Sbjct: 177 LVLKGRAPAK-VDPNYYRTDEPVRYAEGDAGGGMRFEARLRVQAD---GAELQVDSGALH 232
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
VE + LLL A++SF+G P++ +D + + L++ L+Y +L RH DDY+
Sbjct: 233 VERATEVTLLLTAATSFNGYDKQPAEQGRDESRAAADDLRAASGLTYDELLQRHQDDYRA 292
Query: 304 LFHRVSIQL--SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
LF RV++ L SR+P+ + TD R+ + DP L ELLF +GRYLL
Sbjct: 293 LFGRVTLSLGASRAPEGMPTD-------------RRIAEYGAS-DPGLAELLFHYGRYLL 338
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
ISSSR GTQ ANLQGIWN+++ W S +NIN +MNYW + CNLSEC EPL F+
Sbjct: 339 ISSSREGTQPANLQGIWNKEVRAPWSSNYTLNINAQMNYWPAETCNLSECHEPLLGFIGR 398
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEH 477
L++NG+KT VNY GW HH +DIWA+S+ G VWA WPM GAWL HLWEH
Sbjct: 399 LAVNGTKTVSVNYGLRGWTAHHNSDIWAQSAPVGAYGHGDPVWAYWPMAGAWLSAHLWEH 458
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y + + D+L ++AYP+++ A F LDWL+E DG+L ++PSTSPEH F+ +G+LA V+
Sbjct: 459 YAFCREEDYLREQAYPVMKEAALFCLDWLVEDADGFLVSSPSTSPEHRFVTAEGELAAVT 518
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
++TMD+A++ ++F+ I AA L + + + +L RL+P +I + G + EW
Sbjct: 519 AAATMDLALVHDLFTNCIEAARTLGTDVE-FSAGLQDALDRLQPLQIGQYGQLQEW 573
>gi|379721553|ref|YP_005313684.1| hypothetical protein PM3016_3718 [Paenibacillus mucilaginosus 3016]
gi|378570225|gb|AFC30535.1| hypothetical protein PM3016_3718 [Paenibacillus mucilaginosus 3016]
Length = 806
Score = 459 bits (1181), Expect = e-126, Method: Compositional matrix adjust.
Identities = 246/595 (41%), Positives = 352/595 (59%), Gaps = 37/595 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ I F PA ++T+A+P+GNGRLGAM++GGV E + LNEDTLW+G P D+ NP+A + L
Sbjct: 14 MNIQFQSPAVYWTEALPVGNGRLGAMIFGGVEKERIALNEDTLWSGYPTDWNNPEAREVL 73
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
VR L+ +Y EA + G Y GD+ + + H + Y R+LDL+T
Sbjct: 74 PKVRQLIAEQRYEEADRFCKFMMGPFTQSYLPFGDLHILME--HGQVCGRGYERKLDLST 131
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
V Y +G+V +TRE F+S+PDQVIV +++ S+ G LSF LDS L + S + ++
Sbjct: 132 GIVTVTYDIGDVSYTREVFASHPDQVIVVRLTASKEGLLSFRAKLDSPLRSSSKPDADH- 190
Query: 193 IIMEGRCPGKRIPPKANANDD--------PKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+ G P P N + PK ++F L + G +E L +
Sbjct: 191 YTLSGIAPEYVAPNYYNVKNPVHYGDQQAPKSLKFYGRLS---AVHEGGNMKVEADGLSI 247
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
G+ A L A++SFD P I S + + P + A+Q+I YSD+ H+DD+ +L
Sbjct: 248 VGATSATLYFSAATSFD-PLIGASSTNRVPEQVTEEAIQAILGKKYSDIRKHHVDDHSRL 306
Query: 305 FHRVSIQL--SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
FHRV + L S +P+D+ TD +R+ + + DP LVELLF +GRYL+I
Sbjct: 307 FHRVDLHLGESSAPQDLPTD-------------QRIAEYGS-RDPGLVELLFHYGRYLMI 352
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
+SSRPGTQ ANLQGIWNED W S +NIN EMNYW + CN++E EPL DF+ L
Sbjct: 353 ASSRPGTQPANLQGIWNEDTRAPWSSNYTLNINAEMNYWPAETCNMAELHEPLIDFIGRL 412
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHY 478
++NG KTA+VNY A GWV HH +D+WA+++ G VWA WP+GG WL HLWEHY
Sbjct: 413 AVNGRKTAEVNYGARGWVAHHNSDVWAQTAPVGDYGHGDPVWAFWPLGGVWLTQHLWEHY 472
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
++ + FL AYP+++ A F LDWL DGY T+PSTSPEH+F+ D + A V
Sbjct: 473 AFSGNEAFLRDTAYPIMKQAALFCLDWLTPNEDGYWITSPSTSPEHKFMIGDQRYA-VGA 531
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
++TMD+A+I E+FS I++AE L+ +E+ +L++ +L P +I + G + EW
Sbjct: 532 AATMDLALIGELFSNCITSAETLQVDEE-FANTLLETKQKLLPMQIGKKGQLQEW 585
>gi|337748528|ref|YP_004642690.1| hypothetical protein KNP414_04288 [Paenibacillus mucilaginosus
KNP414]
gi|336299717|gb|AEI42820.1| hypothetical protein KNP414_04288 [Paenibacillus mucilaginosus
KNP414]
Length = 806
Score = 458 bits (1179), Expect = e-126, Method: Compositional matrix adjust.
Identities = 246/595 (41%), Positives = 351/595 (58%), Gaps = 37/595 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ I F PA ++T+A+P+GNGRLGAM++GGV E + LNEDTLW+G P D+ NP+A + L
Sbjct: 14 MNIQFQSPAVYWTEALPVGNGRLGAMIFGGVEKERIALNEDTLWSGYPTDWNNPEAREVL 73
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
VR L+ +Y EA + G Y GD+ + + H + Y R+LDL+T
Sbjct: 74 PKVRQLIAEQRYEEADRFCKFMMGPFTQSYLPFGDLHIVME--HGQVCGRGYERKLDLST 131
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
V Y +G+V +TRE F+S+PDQVIV +++ S+ G LSF LDS L + S + ++
Sbjct: 132 GIVTVTYDIGDVSYTREVFASHPDQVIVVRLTASKEGLLSFRAKLDSPLRSSSKPDADH- 190
Query: 193 IIMEGRCPGKRIPPKANANDD--------PKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+ G P P N + PK ++F L + G +E L +
Sbjct: 191 YTLSGIAPEYVAPNYYNVKNPVHYGDQQAPKSLKFYGRLS---AVHEGGNMKVEADGLSI 247
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
G+ A L A++SFD P I S + + P + A+Q+I YSD+ H+DD+ +L
Sbjct: 248 VGATSATLYFSAATSFD-PLIGASSTNRMPEQVTEEAIQAILGKKYSDIRKHHVDDHSRL 306
Query: 305 FHRVSIQL--SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
FHRV + L S +P+D+ TD R+ + + DP LVELLF +GRYL+I
Sbjct: 307 FHRVDLHLGESSAPQDLPTD-------------RRIAEYGS-RDPGLVELLFHYGRYLMI 352
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
+SSRPGTQ ANLQGIWNED W S +NIN EMNYW + CN++E EPL DF+ L
Sbjct: 353 ASSRPGTQPANLQGIWNEDTRAPWSSNYTLNINAEMNYWPAETCNMAELHEPLIDFIGRL 412
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHY 478
++NG KTA+VNY A GWV HH +D+WA+++ G VWA WP+GG WL HLWEHY
Sbjct: 413 AVNGRKTAEVNYGARGWVAHHNSDVWAQTAPVGDYGHGDPVWAFWPLGGVWLTQHLWEHY 472
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
++ + FL AYP+++ A F LDWL DGY T+PSTSPEH+F+ D + A V
Sbjct: 473 AFSGNEAFLRDTAYPIMKQAALFCLDWLTPNEDGYWITSPSTSPEHKFMIGDQRYA-VGA 531
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
++TMD+A+I E+FS I++AE L+ +E+ +L++ +L P +I + G + EW
Sbjct: 532 AATMDLALIGELFSNCITSAETLQVDEE-FANTLLETKQKLLPMQIGKKGQLQEW 585
>gi|261406875|ref|YP_003243116.1| alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
gi|261283338|gb|ACX65309.1| Alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
Length = 802
Score = 453 bits (1166), Expect = e-124, Method: Compositional matrix adjust.
Identities = 239/588 (40%), Positives = 346/588 (58%), Gaps = 29/588 (4%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA + DA+ +GNGRLG MV+GG+ E + LNEDTLW+G P D N +A L V+
Sbjct: 16 YRNPAAEWVDALAVGNGRLGGMVYGGIFRERISLNEDTLWSGHPYDPNNREAAAYLETVQ 75
Query: 77 SLVDSGQYAEATAA-SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
LV G+Y EA + G ++ YQ LGD+ LE +++ E YRRELDLN A
Sbjct: 76 KLVFEGKYPEAQRTIEEHMLGPWSESYQPLGDLYLELEETG---KAEHYRRELDLNDAVC 132
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
R ++++ V + RE F S DQV+V + + + G ++ + SLDS L + + +++ M
Sbjct: 133 RTRFTLNGVRYVRETFVSAVDQVMVVRFTADQPGRIAVSASLDSQLRHQALRVSADKLAM 192
Query: 196 EGRCPGKRIPPKANAND-----DPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
+GR P P A +ND + +GI+F A ++ + G + + ++++EG+D
Sbjct: 193 KGRSPSHVEPLHARSNDPVIYEEGRGIRFEA--QLLALPEGGATTEDGEGRIRIEGADAV 250
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
LL AS+SF+G NP ++P S L + LSY +L RH+ DY+ L+ RV +
Sbjct: 251 TFLLAASTSFNGFDKNPVLEGRNPAELCRSCLDAAAKLSYGELLDRHVQDYRALYGRVEL 310
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGT 369
+L +P + +P+ ER+++ + D+ D L L FQFGRYLL+SSSRPGT
Sbjct: 311 ELD-AP-----------GLQHLPTDERIRALREDKTDEQLAVLFFQFGRYLLLSSSRPGT 358
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN+ + P W VNIN +MNYW + CNL+EC EPLF L L I G +T
Sbjct: 359 QAANLQGIWNQSMRPPWSCNYTVNINTQMNYWPAEVCNLAECHEPLFRLLEDLRIAGRET 418
Query: 430 AQVNYLASGWVIHHKTDIWAKSS----ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
A +Y A GWV HH D+W ++ G WA WPMGGAWL H+WEHY + DR
Sbjct: 419 ASAHYKARGWVSHHAVDLWRITTPSGGPSGGPASWAYWPMGGAWLSQHVWEHYRFGGDRT 478
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
FL + YP+++ A F LD+L+E DGYL +NPSTSPE+ F PDG+ A VS +TMD+A
Sbjct: 479 FLSQVGYPIMKEAALFFLDYLVEDADGYLVSNPSTSPENTFALPDGRKAAVSMDATMDIA 538
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
++RE+F + A++ L + + +E + + RLRP +I G + EW
Sbjct: 539 LLRELFGNCMEASDHLGIDRELRLE-LAAARARLRPFQIGRRGQLQEW 585
>gi|15613405|ref|NP_241708.1| hypothetical protein BH0842 [Bacillus halodurans C-125]
gi|10173457|dbj|BAB04561.1| BH0842 [Bacillus halodurans C-125]
Length = 795
Score = 444 bits (1141), Expect = e-122, Method: Compositional matrix adjust.
Identities = 248/596 (41%), Positives = 343/596 (57%), Gaps = 39/596 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+KI F+ PA +T+A+PIGNG LGAMV+G V E + LNEDTLW+G P D+ NP A + L
Sbjct: 1 MKIQFDFPASFWTEALPIGNGNLGAMVFGKVEKERIALNEDTLWSGYPKDWNNPKAKEVL 60
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
VR L+ +Y EA S + G Y GD+ + D H + Y RELDL+T
Sbjct: 61 PKVRELIAQEKYEEADQLSRDMMGPYTQSYLPFGDLNIFMD--HGQVVAPHYHRELDLST 118
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
V Y++G V++TRE F + PD+ IV +++ S+ G LSF LDSLL + S V G
Sbjct: 119 GIVTVTYTIGGVQYTRELFVTYPDRAIVVRLTASKEGFLSFRAKLDSLLRHVSSV-GAEH 177
Query: 193 IIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLK 243
+ G P + + P ++P +G+ F L + + G ++ L
Sbjct: 178 YTISGTAP-EHVSPSYYDEENPVRYGHPDMSQGMTFHGRL---AAVNEGGSLKVDADGLH 233
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V G+ A L AS+SFD P S ++DP+ ++ +++I Y ++ RHL+DY K
Sbjct: 234 VMGATCATLYFSASTSFD-PSTGASCLERDPSLRTIETIKAICKRGYKEIVNRHLEDYTK 292
Query: 304 LFHRVSIQLSRS--PKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
LF+RVS+ L S P D+ TD +R+K + + D LVELLFQ+GRYL+
Sbjct: 293 LFNRVSLHLGESIAPADMSTD-------------QRIKEYGS-RDLGLVELLFQYGRYLM 338
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
I+SSRPGTQ ANLQGIWNE+ W S +NIN EMNYW + CNL+E +PL F+
Sbjct: 339 IASSRPGTQPANLQGIWNEETRAPWSSNYTLNINAEMNYWPAETCNLAELHKPLIHFIER 398
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEH 477
L+ NG KTA++NY A GWV HH D+W +++ G VWA WPMGG WL HLWEH
Sbjct: 399 LAANGKKTAEINYGARGWVAHHNADLWGQTAPVGDFGHGDPVWAFWPMGGVWLTQHLWEH 458
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y + D +L AYP+++ A F LDWLIE GYL T+PSTSPE F + K VS
Sbjct: 459 YTFGEDEAYLRDTAYPIMKEAALFCLDWLIENEAGYLVTSPSTSPEQRFRIGE-KGYAVS 517
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
++TMD+++I E F I AA+ L +ED V+ + + RL P +I + G + EW
Sbjct: 518 SATTMDLSLIAECFDNCIQAAKRLSIDED-FVKALSDAKQRLLPLQIGKRGQLQEW 572
>gi|251797558|ref|YP_003012289.1| alpha-L-fucosidase [Paenibacillus sp. JDR-2]
gi|247545184|gb|ACT02203.1| Alpha-L-fucosidase [Paenibacillus sp. JDR-2]
Length = 790
Score = 439 bits (1128), Expect = e-120, Method: Compositional matrix adjust.
Identities = 240/596 (40%), Positives = 347/596 (58%), Gaps = 39/596 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ +N + +TDA+P GNGRLGAM++GG E ++LNEDTLW+G P N +A K L
Sbjct: 1 MKLQYNRASVRWTDALPTGNGRLGAMMFGGSEMERIQLNEDTLWSGGPRYGDNDNAVKVL 60
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
+VR L++ GQYA A ++ G Y + D+ ++F + + YRR L L
Sbjct: 61 PEVRKLIEEGQYAAADRLCKQMMGTYTQSYLPMADLYIKFLHGNTM---KNYRRALHLGD 117
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
AT+ V+Y +GNV +TR F S PDQV+V ++ S+ G L+F L+S L + + +
Sbjct: 118 ATSTVEYQIGNVTYTRRLFVSYPDQVVVLRLEASQPGKLNFLARLESPLRYETAFD-QDA 176
Query: 193 IIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLK 243
+I+ G P +++ P D P ++F + ++ D G S D L+
Sbjct: 177 LILRGDAP-EQVDPSYYDTDMPVKYGEPGSANAMRFEGRMAARL--DEGQASYGHDG-LR 232
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V G+ L+ A++SF+G +P KD ++ + + L+ + LSY L RH++D++K
Sbjct: 233 VTGATAVTLIFSAATSFNGYDRSPGSEGKDESAAASAYLEQAKKLSYESLLQRHVEDHRK 292
Query: 304 LFHRVSIQLSRS--PKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
LF+RV + L S P D TD R++ + DP LVELL+ +GRYL+
Sbjct: 293 LFNRVELSLGESVAPPDYPTDA-------------RIRDYGAS-DPGLVELLYHYGRYLM 338
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
I SSR GTQ ANLQGIWNE+ W +NIN EMNYW + CNL++C PL DF+
Sbjct: 339 IGSSRKGTQPANLQGIWNEETRAPWSGNYTLNINAEMNYWPAETCNLADCHTPLLDFIGN 398
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEH 477
LS NG KTA NY A+GW HH +DIW +S+ G WA WPMGG WLC HLWEH
Sbjct: 399 LSKNGRKTASTNYGAAGWTAHHNSDIWCQSAPAGDYGHGDPGWAFWPMGGVWLCQHLWEH 458
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y + +D FL +AYP+++ A F LDWL E DG L T+PSTSPEH+F +G LA VS
Sbjct: 459 YAFGLDEAFLRDKAYPVMKEAALFCLDWLHEDKDGRLITSPSTSPEHKFRTAEG-LAAVS 517
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+STMD+++I ++F+ +I A+ +L +E E++ + RL P +I E+G + EW
Sbjct: 518 AASTMDLSLIWDLFTNLIEASTILGVDE-PFRERLADTRSRLHPLQIGENGRLQEW 572
>gi|298246864|ref|ZP_06970669.1| Alpha-L-fucosidase [Ktedonobacter racemifer DSM 44963]
gi|297549523|gb|EFH83389.1| Alpha-L-fucosidase [Ktedonobacter racemifer DSM 44963]
Length = 809
Score = 438 bits (1127), Expect = e-120, Method: Compositional matrix adjust.
Identities = 240/596 (40%), Positives = 346/596 (58%), Gaps = 33/596 (5%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+ + PA + +A+P+GNG LGAMV GG+ E L+LNEDTLW+G P D NPDA
Sbjct: 15 PLKLWYRQPATQWLEALPVGNGHLGAMVHGGISEEVLQLNEDTLWSGEPYDTDNPDAVTH 74
Query: 72 LSDVRSLV-DSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
L ++R L+ + Y A + ++ G + YQ LG + L+F+ + + Y+R LDL
Sbjct: 75 LPELRRLILEERDYVAAQELAHRMQGPYNESYQPLGYVRLKFEQ---RGEVQAYQRALDL 131
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
NTA A V+Y G++ F+RE FSS D ++V +++ +LS L+SL G+
Sbjct: 132 NTALATVQYKAGDILFSREVFSSAADDLLVIRLTSDTPHALSLTAHLESLHPFTCAPAGS 191
Query: 191 NQIIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKK 241
N+I M GRCP + + P + DP G++F L+ + + G ISA D
Sbjct: 192 NKIRMTGRCP-RHVDPDYLSTSDPVIYDHGEDGHGMRFETQLQAMV--EGGRISADVDGA 248
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L+VE + L A++S+ G P S + + L + + Y L H++DY
Sbjct: 249 LRVENAHAVTFFLSAATSYRGFASRPDLSAHVLEQQCTTRLAAGMSKGYEVLRAAHINDY 308
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYL 360
Q+LF RV++ L S + +P+ ER+ + Q D +L+ L FQ+GRYL
Sbjct: 309 QQLFQRVTLDLGTS------------DGQELPTDERLAAVQKGASDDALLALYFQYGRYL 356
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LI+SSRPGTQ ANLQGIWN+ + P W S +NIN +MNYW + CNL+EC PLFD L
Sbjct: 357 LIASSRPGTQSANLQGIWNDHVRPAWSSNYTININTQMNYWLAETCNLAECHSPLFDLLE 416
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA---DRGKVVWALWPMGGAWLCTHLWEH 477
S++G +TAQV Y GWV HH D+W ++ G WA W MGGAWLC HLWEH
Sbjct: 417 EASVSGERTAQVYYGCRGWVAHHNMDLWRNTAPVGNGSGGPQWANWNMGGAWLCQHLWEH 476
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y ++ DR FL +RAYP+++ A FLLD+L+E G+L T PST+PE+ FI G+L+ VS
Sbjct: 477 YAFSGDRSFLSQRAYPIMKKAAQFLLDFLVEDKQGHLTTCPSTAPENLFITESGELSGVS 536
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
STMD+AI E+F+ I+A++VL+ ++ ++ ++L RL I G + EW
Sbjct: 537 AGSTMDIAITHELFTHCIAASQVLDIDQ-GFAHELAQALARLPQPGIGSYGQLQEW 591
>gi|182419971|ref|ZP_02951207.1| twin-arginine translocation pathway signal [Clostridium butyricum
5521]
gi|237666001|ref|ZP_04525989.1| alpha-L-fucosidase 2 (Alpha-L-fucosidefucohydrolase 2)
(Alpha-1,2-fucosidase 2) [Clostridium butyricum E4 str.
BoNT E BL5262]
gi|182376222|gb|EDT73807.1| twin-arginine translocation pathway signal [Clostridium butyricum
5521]
gi|237658948|gb|EEP56500.1| alpha-L-fucosidase 2 (Alpha-L-fucosidefucohydrolase 2)
(Alpha-1,2-fucosidase 2) [Clostridium butyricum E4 str.
BoNT E BL5262]
Length = 799
Score = 438 bits (1127), Expect = e-120, Method: Compositional matrix adjust.
Identities = 237/594 (39%), Positives = 348/594 (58%), Gaps = 27/594 (4%)
Query: 10 TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDA 68
+ L++ + PA+ + +A+P+GNGR+GAMV+GGV E L+LNEDTLW+GVP + T+ +
Sbjct: 2 NDKLRLWYTKPAEKWVEALPLGNGRIGAMVFGGVYRERLQLNEDTLWSGVPITEETDENF 61
Query: 69 PKALSDVRSLVDSGQYAEA-TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
L R L+ G+Y ++ + KL G + Y LG++ +FD+ Y + Y R+
Sbjct: 62 IDDLEKARKLIFEGKYCKSENIINNKLLGPWNESYLPLGNLYFDFDNEG-DYVD--YERD 118
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
L+L A++ VKY++ N+ + R F S D IV K S+ G +SF S DSLL
Sbjct: 119 LNLEDASSCVKYTMNNIRYKRTTFISKSDNAIVIKFESSKEGKISFKASFDSLLRYTVVT 178
Query: 188 NGNNQIIMEGRCPGKRIPPKANAN-----DDPKGIQFSAILEIKISDDRGTISALEDKKL 242
N I + G+ P +P + DD +G+ F A+LE+ + G I + E+ L
Sbjct: 179 ENKNSISLLGKAPIHVLPSYEDGEKPVIYDDKRGMNFKAVLEV--NGINGDIKS-ENGIL 235
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
KV+ +D ++ +V +SF+G KD +++Q IR+ +Y +LY H +Y+
Sbjct: 236 KVKDADEVIIKIVVHTSFNGYKNEAGTQGKDVNDLCENSIQKIRDKTYVNLYNAHKIEYK 295
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLL 361
LF R+ L+ D ++ P+ +R+++F+ ++ D L+ L FQ+GRYLL
Sbjct: 296 SLFDRLQFTLNSDFTD-----------NSTPTDKRIENFKENKNDLGLISLYFQYGRYLL 344
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
ISSSR GTQ ANLQGIWNEDL P W S NINLEMNYW + CNL EC EPLF F+
Sbjct: 345 ISSSRKGTQPANLQGIWNEDLRPAWSSNYTTNINLEMNYWLAEVCNLQECHEPLFKFIRE 404
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
+S G +TA++ Y GW +H D+W ++S G WA WPM GAWLC+H+WEHY +T
Sbjct: 405 VSEVGKETAKIRYNCRGWTANHNIDLWRQTSPAGGSTEWAYWPMAGAWLCSHIWEHYEFT 464
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
D FL K YP+++ CA FL+DWL+E +GYL T PS SPE+ FI +G+ +CVS +ST
Sbjct: 465 NDVKFL-KEMYPIMKSCAEFLVDWLMEDENGYLVTCPSISPENNFITEEGEKSCVSIAST 523
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
MDM+I + +F I AA +LE ++ E + L P KI + G + EW +
Sbjct: 524 MDMSITKNLFKNCIDAANILEIDKKFRSE-LKNYYNNLYPYKIGKFGQLQEWFK 576
>gi|298246866|ref|ZP_06970671.1| Alpha-L-fucosidase [Ktedonobacter racemifer DSM 44963]
gi|297549525|gb|EFH83391.1| Alpha-L-fucosidase [Ktedonobacter racemifer DSM 44963]
Length = 809
Score = 436 bits (1121), Expect = e-119, Method: Compositional matrix adjust.
Identities = 241/610 (39%), Positives = 349/610 (57%), Gaps = 36/610 (5%)
Query: 1 MMNAESTSTTN---PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWT 57
M A++ + PLK+ + PA + +A+P+GNG LGAM+ GG+ E L+LNEDTLW+
Sbjct: 1 MYQAQAAGVSQDKPPLKLWYRQPATQWLEALPVGNGHLGAMIHGGIGEEVLQLNEDTLWS 60
Query: 58 GVPGDYTNPDAPKALSDVRSLV-DSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSH 116
G P D NPDA L ++R L+ + Y A + ++ G + YQ LG + L+F+
Sbjct: 61 GEPYDTDNPDAVTLLPELRRLILEERDYVAAQELAHRMQGPYNESYQPLGYVRLKFEQ-- 118
Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
+ + Y+R LDLNTA A V+Y G++ F+RE FSS D ++V +++ +LS
Sbjct: 119 -RGEVQAYQRALDLNTALATVQYKAGDILFSREVFSSAADDLLVIRLTSDTPHALSLTAH 177
Query: 177 LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKI 227
L+SL G+N+I M GRCP + + P DP G++F L+ +
Sbjct: 178 LESLHPFTCAPAGSNKIRMTGRCP-RHVDPDYLPTSDPVIYDHGEDGHGMRFETQLQAMV 236
Query: 228 SDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRN 287
+ G ISA D L+VE + L A++S+ G P S + + L +
Sbjct: 237 --EGGRISADVDGALRVENAHTVTFFLSAATSYRGFASRPDLSAHVLEQQCTTRLAVGMS 294
Query: 288 LSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-ED 346
Y L H+ DYQ+LF RV++ L RS + + +P+ ER+ + Q D
Sbjct: 295 KGYEVLRAAHISDYQRLFQRVTLDLGRS------------DGENLPTDERLVAVQKGASD 342
Query: 347 PSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPC 406
+L+ L FQ+GRYLLISSSRPGTQ A+LQGIWN+ + P W S +N+N +MNYW + C
Sbjct: 343 DALLALFFQYGRYLLISSSRPGTQPAHLQGIWNDHVRPAWSSNWTINMNTQMNYWPAETC 402
Query: 407 NLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA---DRGKVVWALW 463
NL+EC PLFD L S++G +TAQV Y GWV HH D+W ++ G WA W
Sbjct: 403 NLAECHSPLFDLLEEASVSGERTAQVYYGCRGWVAHHNMDLWRNTAPVGNGSGDPQWANW 462
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
MGGAWLC HLWEHY ++ DR FL +RAYP+++ A FLLD+L+E G+L T PS SPE
Sbjct: 463 NMGGAWLCQHLWEHYAFSGDRSFLSQRAYPIMKKAAQFLLDFLVEDRQGHLTTCPSMSPE 522
Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTK 583
+ FI G+L+ VS STMD+AI E+F+ I+A++VL+ ++ ++ ++L RL
Sbjct: 523 NLFITESGELSGVSAGSTMDIAITHELFTHCIAASQVLDIDQ-GFAHELAQALARLPQPG 581
Query: 584 IAEDGSIMEW 593
I G + EW
Sbjct: 582 IGSYGQLQEW 591
>gi|253574360|ref|ZP_04851701.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
str. D14]
gi|251846065|gb|EES74072.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
str. D14]
Length = 817
Score = 435 bits (1118), Expect = e-119, Method: Compositional matrix adjust.
Identities = 246/598 (41%), Positives = 353/598 (59%), Gaps = 39/598 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA +T+A+P+GNGRLGAM++GGV ET+ LNEDTLW+G P D+ NP A + L
Sbjct: 6 KLQYDRPATVWTEALPVGNGRLGAMIYGGVERETISLNEDTLWSGYPRDWNNPSARQVLP 65
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
+VR LV G+Y EA ++ G + Y GD++L F+ A +YRR LDL A
Sbjct: 66 EVRKLVREGRYEEADQLGRQMLGPYTESYLPFGDLQLTFEHGA---ACRSYRRTLDLADA 122
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
+Y+VG V + RE F S+PD++I +++ S+ G+L+F+ LDS L + + V +
Sbjct: 123 IHVTEYTVGKVSYKREIFVSHPDRIIAMRLTCSQPGALAFHARLDSPLRHIAAVE-DGIF 181
Query: 194 IMEGRCPGKRIPPKANAN-----DDPK---GIQFSAILEIKISDDRGTISALEDKKLKVE 245
+M G P + P NA+ DP + F L + +D R ++ + ++V
Sbjct: 182 VMRGTAPERVEPNYVNADRPIRYGDPAVSPAMAFEGRLAVTETDGRVSV---DGDGIRVL 238
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLS------YSDLYTRHLD 299
+ AVL A++SFD P + + ++A ++ +L+ Y ++ RH++
Sbjct: 239 DATEAVLYFSAATSFDRFDQIPGAGRPESVPADVAAARARADLTGALANRYLEIRARHIE 298
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
DYQ LF RVS++L +T + E +DT ER DP LVELLF +GRY
Sbjct: 299 DYQALFSRVSLRLG--------ETAAPEGLDT----ERRIVEYGAADPGLVELLFHYGRY 346
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LLI+SSRPGTQ ANLQGIWN P W S +NIN EMNYW + CNL+EC PL + +
Sbjct: 347 LLIASSRPGTQAANLQGIWNAMTRPPWSSNWTLNINAEMNYWPAEVCNLAECHWPLLEMI 406
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLW 475
L+ NG+KTA VNY GWV HH +DIW +++ G VWALWP+GG WL HLW
Sbjct: 407 GNLAENGAKTAAVNYGTRGWVAHHNSDIWGQTAPVGDFGGGDPVWALWPLGGVWLTQHLW 466
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
EHY + D +L AYP+L+ A F LDWLIE G+L T+PSTSPEH+F +G +A
Sbjct: 467 EHYVFGGDVAYLHDFAYPILKDAALFALDWLIEDESGHLVTSPSTSPEHKFRTANG-VAA 525
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+S STMD+++I E+F+ I AA VL +E A E++ ++ RL P ++ + G + EW
Sbjct: 526 ISEGSTMDLSLIWELFTNCIEAAGVLGIDE-AFREELRQARERLLPLQVGKYGQLQEW 582
>gi|261406536|ref|YP_003242777.1| alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
gi|261282999|gb|ACX64970.1| Alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
Length = 806
Score = 434 bits (1117), Expect = e-119, Method: Compositional matrix adjust.
Identities = 241/600 (40%), Positives = 354/600 (59%), Gaps = 41/600 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + PA +T+A+PIGNGRLG MV+G V ET+ LNEDTLW+G P D+ NP A +AL
Sbjct: 1 MKLQYVKPATVWTEALPIGNGRLGGMVYGCVERETISLNEDTLWSGYPRDWNNPSALEAL 60
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
++R L G+Y EA K+ G + Y LGD+ L FD + + +YRR LD+
Sbjct: 61 PEIRELASQGRYMEADQLGRKMMGPYTESYLPLGDLCLRFDHGGVFH---SYRRTLDIAN 117
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A R +Y +G V +TRE F+S+PDQ+I +++ S + +L+F+ L+S L ++ +
Sbjct: 118 AVQRTEYRIGEVTYTRECFASSPDQMIALRLTSSAACALNFHAYLESPL-RYTVKTEEDM 176
Query: 193 IIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLK 243
M G P +R+ P ++D P + F+ L + +D R T+ + +
Sbjct: 177 YAMSGFAP-ERVEPSYVSSDHPIRYGDPDHTAAMAFNGRLAVAETDGRVTV---DSAGIH 232
Query: 244 VEGSDWAVLLLVASSSFDGPFINPS--DSKKDPTSE----SMSALQSIRNLSYSDLYTRH 297
V + AV+ A++SF+G P D P + + +++ + S+++L RH
Sbjct: 233 VLDASEAVIYFTAATSFNGFDQIPGHRDGGDHPAAAAAALTAGTMKAACSQSWTELRDRH 292
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
++DY+ LF RVS++L +T + E++DT ER++ F DP LVELLF +G
Sbjct: 293 INDYRSLFDRVSLRLG--------ETLAAEDMDT---GERIERFGA-RDPGLVELLFHYG 340
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLISSSRPGTQ ANLQGIWN P W S +NIN +MNYW + CNL+EC +PL +
Sbjct: 341 RYLLISSSRPGTQAANLQGIWNASTRPPWSSNWTLNINAQMNYWPAEVCNLAECHQPLLE 400
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTH 473
+ LS+NG++TA V+Y GW +HH TDIWA ++ G WALW MGG WL H
Sbjct: 401 LIRSLSVNGAETAAVHYGTRGWTVHHNTDIWAHTAPVGNYGDGDPSWALWQMGGIWLTQH 460
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
LWEHY Y+ D +L AYPL++ + F LDWLIE G+L T+PSTSPEH+F +G +
Sbjct: 461 LWEHYAYSGDEAYLRSFAYPLMKEASLFALDWLIENDAGHLVTSPSTSPEHKFRTSEG-M 519
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
A +S +TMD+++I E+F+ + AA +L +E+ E+ RL P K+ G + EW
Sbjct: 520 AAISEGATMDISLIWELFTNCMEAAGILGVDEE-FREEWSSKRERLLPLKVGRYGQLQEW 578
>gi|329926959|ref|ZP_08281359.1| hypothetical protein HMPREF9412_5205 [Paenibacillus sp. HGF5]
gi|328938789|gb|EGG35165.1| hypothetical protein HMPREF9412_5205 [Paenibacillus sp. HGF5]
Length = 812
Score = 432 bits (1110), Expect = e-118, Method: Compositional matrix adjust.
Identities = 243/600 (40%), Positives = 352/600 (58%), Gaps = 41/600 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + PA +T+A+PIGNGRLG MV+GGV ET+ LNEDTLW+G P D+ NP A +AL
Sbjct: 5 MKLQYVKPATVWTEALPIGNGRLGGMVYGGVERETISLNEDTLWSGYPRDWNNPSAREAL 64
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
++R L G+Y EA K+ G Y LGD+ L FD + + +YRR LD+
Sbjct: 65 PEIRELASQGRYMEADQLGRKMMGPYTQSYLPLGDLCLRFDHGGVFH---SYRRTLDIAN 121
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A R +Y +G V +TRE F+S+PDQ+I +++ S + SL+F+ L+S L ++ +
Sbjct: 122 AVQRTEYRIGEVTYTRECFASSPDQMIALRLTSSAACSLNFHAYLESPL-RYTVKTEEDM 180
Query: 193 IIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLK 243
M G P +R+ P ++D P + F L + +D R T+ A +
Sbjct: 181 YAMSGFAP-ERVEPSYVSSDRPIRYGDPEHTAAMAFDGRLAVAETDGRVTMDA---AGIH 236
Query: 244 VEGSDWAVLLLVASSSFDGPFINPS--DSKKDPTSESM----SALQSIRNLSYSDLYTRH 297
V + AV+ A++SF+G P D P + + +++ + S+++L RH
Sbjct: 237 VLEASEAVIYFTAATSFNGFDQIPGHRDGGDHPAAAAAAIAAGTMKAACSQSWTELRDRH 296
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
++DY+ LF RVS++L +T + ++DT ER++ F DP LVELLF +G
Sbjct: 297 VNDYRSLFDRVSLRLG--------ETLAVGDMDT---EERIERFGA-RDPGLVELLFHYG 344
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLISSSRPGTQ ANLQGIWN P W S +NIN +MNYW + CNL+EC +PL +
Sbjct: 345 RYLLISSSRPGTQAANLQGIWNASTRPPWSSNWTLNINAQMNYWPAEVCNLAECHQPLLE 404
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTH 473
+ LS+NG++TA V+Y GW +HH TDIWA ++ G WALW MGG WL H
Sbjct: 405 LIRSLSVNGAETAAVHYGTRGWTVHHNTDIWAHTAPVGNYGDGDPSWALWQMGGIWLTQH 464
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
LWEHY Y+ D +L AYPL++ + F +DWLIE G+L T+PSTSPEH+F +G L
Sbjct: 465 LWEHYAYSGDEAYLRSFAYPLMKEASLFAMDWLIENDAGHLLTSPSTSPEHKFRTSEG-L 523
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
A VS +TMD+++I E+F+ + AA +L +E+ E+ RL P ++ G + EW
Sbjct: 524 AAVSEGATMDISLIWELFTNCMEAAVILGVDEE-FREEWSSKRERLLPLQVGRYGQLQEW 582
>gi|410456476|ref|ZP_11310337.1| alpha-L-fucosidase [Bacillus bataviensis LMG 21833]
gi|409928145|gb|EKN65268.1| alpha-L-fucosidase [Bacillus bataviensis LMG 21833]
Length = 789
Score = 429 bits (1102), Expect = e-117, Method: Compositional matrix adjust.
Identities = 231/593 (38%), Positives = 336/593 (56%), Gaps = 33/593 (5%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
+++ A H+T+A+P+GNGR+GAM +GGV +E +LNEDTLW+G P + +L
Sbjct: 4 LSYKKAASHWTEALPLGNGRIGAMHFGGVETERFQLNEDTLWSGPPQHKREYNDQASLKK 63
Query: 75 VRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTAT 134
VR L+D +Y +A + + +FG + Y LG++ + + A + Y+R LD+NTA
Sbjct: 64 VRKLLDEEKYEDAISETKNMFGPYTESYMPLGNLFIHYLHGD---AAQKYQRTLDINTAI 120
Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQII 194
+ VKY+VG + +TRE F S+P QV+ +++ S + L+ N+SLDSLL + N +
Sbjct: 121 STVKYTVGKINYTREAFISHPHQVLAIQLTSSAANQLNVNISLDSLL-KYQTANSKEALS 179
Query: 195 MEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++G CP K P N ++ P K I F L + + D S + +L ++
Sbjct: 180 LQGVCPEKCAPVYFNESETPIVYGEFGETKAIHFEGRLGLVLEDGTALTS---NGRLSIQ 236
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
+ VL ++SF G P ++ ++ + L ++ Y L H+ DYQ L+
Sbjct: 237 DATRVVLYFSVATSFKGYDQLPGTDFEELIQKNEAILAKAMSIPYEQLRETHIQDYQTLY 296
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
+RV L + SEE +DT ERV + D D +VELLF +GRYLLI+SS
Sbjct: 297 NRVGFSLG--------NKQSEEMLDT---DERVTKYSAD-DLEMVELLFHYGRYLLIASS 344
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
R GTQ ANLQGIWN+ W S +NIN EMNYW + NL+EC PL + LS+
Sbjct: 345 REGTQPANLQGIWNDITRAPWSSNYTLNINTEMNYWPAEVTNLAECHRPLLQAIKELSVT 404
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSS----ADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
G Y GW HH TD+W + G WA WPM G WLC HLWEHY Y+
Sbjct: 405 GENMVNQRYGLHGWTAHHNTDLWRHAHPVGDERHGDPNWAFWPMSGPWLCRHLWEHYQYS 464
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
DRDFLEK A+P+++G A F L+WL+E +GYL T+PSTSPEH F DG+L V+ ST
Sbjct: 465 QDRDFLEKEAFPVMKGAAQFCLEWLVEDENGYLITSPSTSPEHHFYTEDGQLGSVTKGST 524
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
MD+ II ++FS I AAE+ +E+ +++V ++ RL P +I + G + EW+
Sbjct: 525 MDLQIIWDLFSNCIEAAEICGVDEE-WIQQVREAKDRLHPNQIGKYGQLQEWL 576
>gi|253574718|ref|ZP_04852058.1| twin-arginine translocation pathway signal [Paenibacillus sp. oral
taxon 786 str. D14]
gi|251845764|gb|EES73772.1| twin-arginine translocation pathway signal [Paenibacillus sp. oral
taxon 786 str. D14]
Length = 799
Score = 428 bits (1101), Expect = e-117, Method: Compositional matrix adjust.
Identities = 233/594 (39%), Positives = 338/594 (56%), Gaps = 24/594 (4%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA + +A+P+GNGR+G MV+GG+ E + LNEDTLW+G P D N DA + L
Sbjct: 13 KLWYDRPASRWEEALPVGNGRIGGMVFGGIHRERIALNEDTLWSGFPRDPQNYDALRHLG 72
Query: 74 DVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAE---ETYRRELD 129
R L+ +G+Y EA K+ G + YQ LGD+ LE DS + + +RRELD
Sbjct: 73 PARELIFAGKYKEAEKLIDAKMLGRRTESYQPLGDLWLEQGDSATEADGNELQGFRRELD 132
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN- 188
L T A Y +G E+ RE F S DQV+V +I+ S ++ SLDSLL + ++
Sbjct: 133 LATGIATTTYRIGGAEYRREVFISAVDQVMVLRITALGSEPVNMAASLDSLLRHQAFGGP 192
Query: 189 -GNNQIIMEGRCPG------KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
+I M G+ P + P++ +D G+ F A L + + + GT+ A +
Sbjct: 193 AETARICMRGQAPSHIADNYRGDHPQSVLYEDGLGLTFEAQL-LALPEGGGTVQADASGR 251
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L V G+ LLL A++ + G P DP +AL + L Y L RH D+
Sbjct: 252 LTVSGAKAVTLLLAAATDYAGYDQAPGSGGIDPAERCQAALDAAAALGYEQLRQRHEADH 311
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYL 360
++LF RV ++L P+ ER+++++ E D L L F +GRYL
Sbjct: 312 RRLFGRVELRLG--------RAEEAAERAARPTDERLEAYRRGESDLGLESLYFHYGRYL 363
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
L++SSR GT+ A+LQGIWN + P W+ NIN +MNYW + L++C EPLF+ +
Sbjct: 364 LMASSRTGTEAAHLQGIWNPHVQPPWNCGYTTNINTQMNYWHAEVAGLADCHEPLFELIR 423
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
LS+ G++TA+++Y A GWV HH D+W +S+ G+ WA WPMGG WLC HLWEHY +
Sbjct: 424 DLSVTGARTARIHYGARGWVAHHNVDVWRQSTPSDGEASWAFWPMGGVWLCRHLWEHYEF 483
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC-VSYS 539
+D FL + AYPL++G A F DWL+ G DG L T PSTSPE++F+ PDG C VS
Sbjct: 484 GLDEQFLRETAYPLMKGAAEFCQDWLVPGPDGQLVTAPSTSPENKFLTPDGGEPCSVSAG 543
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
STMD+ +IRE+ I A+E+L +E A +++ L R+ +I DG + EW
Sbjct: 544 STMDLFLIRELLEHTIQASEILGVDE-AWRQELSHMLARMAEPQIGPDGRLQEW 596
>gi|261409383|ref|YP_003245624.1| alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
gi|261285846|gb|ACX67817.1| Alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
Length = 799
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 234/603 (38%), Positives = 357/603 (59%), Gaps = 28/603 (4%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
+ +AE +TT + + PA + +A+P+GNGRLGAMV+GGV E ++ NEDTLW+G P
Sbjct: 3 LYSAEHRNTT----LWYRKPAAKWEEALPLGNGRLGAMVFGGVQEECMQWNEDTLWSGFP 58
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKY 119
D N +A + L+ R L+ SG+YAEA ++ G + + LGD+ + S +
Sbjct: 59 RDTNNYEALRYLAKARELIASGKYAEAEQLIEGRMVGRNTESFLPLGDLLIR--QSGIGD 116
Query: 120 AEETYRRELDLNTATARVKYSVG--NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL 177
+ YRREL+L+ A ++ G N F+R+ F S DQV V + S SGS+ + L
Sbjct: 117 SCSEYRRELNLDMGIASTRFQGGGSNHIFSRDMFISAVDQVGVIRYECSGSGSIQLEIGL 176
Query: 178 DSLLDNHSYVNGNNQIIMEGRCPG------KRIPPKANANDDPKGIQFSAILEIKISDDR 231
S L + + + +++ G P + P + +D GI++ + + D
Sbjct: 177 RSPLQHRTRTEEDGTLVLHGHAPTHIADNYRGDHPGSVLYEDGLGIRYE--MRLLALTDS 234
Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYS 291
G ++ ++D +++ + LL+ A+++F+G +P DP+ LQ +
Sbjct: 235 GQVT-VDDSGMRICAAGSVTLLIAAATNFEGFDRSPGSGGTDPSGICRERLQDAMRHGFE 293
Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLV 350
L +RH+ D+Q LF RV +QL R P++ E +I + + ER+++++ ED +L
Sbjct: 294 QLRSRHVQDHQALFRRVELQLGR-PEN-------ERSIAALATDERMEAYREGREDSALE 345
Query: 351 ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
L+FQFGRYLLI+SSRPGTQ A+LQGIWN + P W+S NIN EMNYW + L+E
Sbjct: 346 ALMFQFGRYLLIASSRPGTQPAHLQGIWNPHVQPPWNSDYTTNINTEMNYWPAETTRLNE 405
Query: 411 CQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWL 470
C EPL + LS++G++TA+++Y A GWV HH D+W +S G+ +WA WPMGGAWL
Sbjct: 406 CHEPLIQMIRELSVSGARTAKIHYGARGWVAHHNVDLWRMASPSDGRAMWAFWPMGGAWL 465
Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD 530
C HLWE Y + D ++L + AYPL+ G A F LD LIE +G+L T+PSTSPE++F+ +
Sbjct: 466 CRHLWERYQFQPDLEYLRETAYPLMRGAALFCLDLLIEDGEGHLVTSPSTSPENQFLTAE 525
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSI 590
G VS STMDMAIIR++F I A+++LE++ D L E+ ++ RL P I ++G +
Sbjct: 526 GLPCSVSAGSTMDMAIIRDLFHNCIEASQLLEQD-DELREEWKAAVARLLPYAIDDEGRL 584
Query: 591 MEW 593
MEW
Sbjct: 585 MEW 587
>gi|374374701|ref|ZP_09632359.1| Alpha-L-fucosidase [Niabella soli DSM 19437]
gi|373231541|gb|EHP51336.1| Alpha-L-fucosidase [Niabella soli DSM 19437]
Length = 855
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 234/609 (38%), Positives = 354/609 (58%), Gaps = 35/609 (5%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
A+ + ++ LK+ + PA + +A+P+GNG+ GAMV+GGV +E +LN++TLW+G P
Sbjct: 20 AQRSQSSQELKLWYTKPASIWEEALPLGNGKTGAMVFGGVGTERFQLNDNTLWSGAPNPG 79
Query: 64 TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET 123
P P L+ VR LV +GQY A ++ G + Y + D+ L+ +
Sbjct: 80 NTPGGPAILAAVRKLVFAGQYDSAAVVWKQMHGPYSARYLPMADLWLKLKGADT--IASA 137
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
Y R+LDL+TATA V Y++ V +TR+ F S PD+ +V +I+ + ++SF +L S L
Sbjct: 138 YYRDLDLHTATATVNYTLHGVRYTRQTFISYPDKAMVIRITADKKNAVSFTAALSSKLKY 197
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANAN-----DDPKGIQFSAILEIKISDDRGTISALE 238
+NG N ++++G+ P K + +A DD G + +++K+ GT++
Sbjct: 198 KVALNGKNGLLLKGKAP-KFVANRAYEKEQVVYDDWNGEGTNFEVQVKVIAQEGTVNG-A 255
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D++L V ++ + L ++SF+G +P KDP E+ + +Q ++ + + L H
Sbjct: 256 DEQLTVSNANAVTIYLTNATSFNGFDKSPGKEGKDPHVEATATMQRVQVMPFERLLQNHT 315
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFG 357
DY++LF+RVS + + +P+ ER+K F + +D L L +QFG
Sbjct: 316 TDYRRLFNRVSFAIENRSANA-----------KLPTNERLKVFTKAPDDFGLQTLYYQFG 364
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYL+I++SRPG+Q NLQGIWN+ + P W S VNIN EMNYW + NLSEC +PLFD
Sbjct: 365 RYLMIAASRPGSQPTNLQGIWNDQVQPPWGSNYTVNINTEMNYWPAENTNLSECHQPLFD 424
Query: 418 FLTYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSSADRG--------KVVWALWPMGGA 468
F+ L++NG+ TA+VNY + GW +HH +DIWAK+S G K W+ WPM G
Sbjct: 425 FMKELAVNGAVTAKVNYGIKEGWTVHHNSDIWAKTSPPGGQGWVDPSAKTRWSCWPMAGG 484
Query: 469 WLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH-DGYLETNPSTSPEHEFI 527
W THLWEHY YT D FL AYPL++G A FL WL++ GY TNPSTSPE+ +
Sbjct: 485 WFSTHLWEHYLYTGDEAFLRNTAYPLMKGAAQFLQHWLVKDPVTGYWVTNPSTSPENT-M 543
Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP-RLRPTKIAE 586
+GK V+ +STMDM+IIRE+F+ +I AA VL+ DA L ++ +L P I +
Sbjct: 544 KVNGKEYEVAMASTMDMSIIRELFTDVIKAAAVLK--TDAAFAATLSTIKEKLYPFHIGQ 601
Query: 587 DGSIMEWVQ 595
G + EW +
Sbjct: 602 YGQLQEWFK 610
>gi|338213674|ref|YP_004657729.1| alpha-L-fucosidase [Runella slithyformis DSM 19594]
gi|336307495|gb|AEI50597.1| Alpha-L-fucosidase [Runella slithyformis DSM 19594]
Length = 880
Score = 427 bits (1098), Expect = e-117, Method: Compositional matrix adjust.
Identities = 241/605 (39%), Positives = 356/605 (58%), Gaps = 41/605 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ F PA+ + +A+P+GNG+ GAMV+G V E +LN++TLW+G P + NP+ P L
Sbjct: 43 LKLWFTQPARIWEEALPLGNGKTGAMVFGRVNRERYQLNDNTLWSGYPIEGNNPNGPTVL 102
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFD--DSHLKYAEETYRRELDL 130
+VR + G+Y +A + K+ G Y +GD+ L+F DS Y RELDL
Sbjct: 103 PEVRKAIFEGKYDKADSLWKKMQGPYCARYLPMGDLHLDFGFRDS----TATDYYRELDL 158
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
NTA A VKY+VG V +TRE F S+P V+V +I+ ++ S++ + +L S L
Sbjct: 159 NTAVAIVKYTVGGVTYTRETFISHPASVMVVRITANKKNSINMSAALSSRLRFSVLPGET 218
Query: 191 NQIIMEGRCPGKRI------PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
N+I+++G+ P K + P + +DDPKG + L +K + G I+ ++ KL +
Sbjct: 219 NEIVLKGKAP-KHVAHRAAEPQQIVYDDDPKGEGTNFELRVKAQTEGGKITN-QNGKLLI 276
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
G++ + ++SF+G +P KDP+ E+ + L+ + SY+ L + H+ DYQ+L
Sbjct: 277 SGANAVTYYVAGATSFNGFDKSPGREGKDPSVETNAILKKAGSQSYAQLKSAHISDYQRL 336
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAER-VKSFQTDEDPSLVELLFQFGRYLLIS 363
F RVS+ L P+ + +P+ ER ++ D L L +QFGRYLLI+
Sbjct: 337 FQRVSLDLGTDPEAL-----------KLPTDERLIRQQNGPADTHLQTLYYQFGRYLLIA 385
Query: 364 SSRPGTQ-----VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
SSR G ANLQGIWN+ + P W S NIN EMNYW + NLSEC P+ F
Sbjct: 386 SSRNGASGAAGTPANLQGIWNDHIQPPWGSNFTTNINFEMNYWLAENANLSECHLPMLQF 445
Query: 419 LTYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSSAD-------RGKVVWALWPMGGAWL 470
+ +L++NG+KTA+VNY + GW+ HH TDIWAK+SA R + W+ W M GAWL
Sbjct: 446 IGHLAVNGAKTAKVNYGINEGWITHHGTDIWAKTSAGGGYEWDPRSRGSWSSWLMAGAWL 505
Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD 530
THLWEHY +T D+ FL + YPL++ A F+L WL+E G+L TNPS+SPE+ +
Sbjct: 506 STHLWEHYQFTGDQTFLRDQGYPLMKSAAQFMLHWLLEDGQGHLITNPSSSPENT-VKIS 564
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSI 590
GK ++ +STMDMAIIRE+FS I AA+ L K + A ++ ++ RL P +I + G +
Sbjct: 565 GKEYQITMASTMDMAIIRELFSDCIQAAKQL-KTDAAFQTQLEQAKARLYPYQIGQYGQL 623
Query: 591 MEWVQ 595
EW +
Sbjct: 624 QEWYR 628
>gi|158430814|pdb|2RDY|A Chain A, Crystal Structure Of A Putative Glycoside Hydrolase Family
Protein From Bacillus Halodurans
gi|158430815|pdb|2RDY|B Chain B, Crystal Structure Of A Putative Glycoside Hydrolase Family
Protein From Bacillus Halodurans
Length = 803
Score = 426 bits (1096), Expect = e-116, Method: Compositional matrix adjust.
Identities = 245/595 (41%), Positives = 332/595 (55%), Gaps = 37/595 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LKI F+ PA +T+A+PIGNG LGA V+G V E + LNEDTLW+G P D+ NP A + L
Sbjct: 3 LKIQFDFPASFWTEALPIGNGNLGAXVFGKVEKERIALNEDTLWSGYPKDWNNPKAKEVL 62
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
VR L+ +Y EA S G Y GD+ + D H + Y RELDL+T
Sbjct: 63 PKVRELIAQEKYEEADQLSRDXXGPYTQSYLPFGDLNIFXD--HGQVVAPHYHRELDLST 120
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
V Y++G V++TRE F + PD+ IV +++ S+ G LSF LDSLL + S V G
Sbjct: 121 GIVTVTYTIGGVQYTRELFVTYPDRAIVVRLTASKEGFLSFRAKLDSLLRHVSSV-GAEH 179
Query: 193 IIMEGRCP--------GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+ G P + P + D +G F L + + G ++ L V
Sbjct: 180 YTISGTAPEHVSPSYYDEENPVRYGHPDXSQGXTFHGRL---AAVNEGGSLKVDADGLHV 236
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
G+ A L AS+SFD P S ++DP+ ++ +++I Y ++ RHL+DY KL
Sbjct: 237 XGATCATLYFSASTSFD-PSTGASCLERDPSLRTIETIKAICKRGYKEIVNRHLEDYTKL 295
Query: 305 FHRVSIQLSRS--PKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
F+RVS+ L S P D TD +R+K + + D LVELLFQ+GRYL I
Sbjct: 296 FNRVSLHLGESIAPADXSTD-------------QRIKEYGS-RDLGLVELLFQYGRYLXI 341
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
+SSRPGTQ ANLQGIWNE+ W S +NIN E NYW + CNL+E +PL F+ L
Sbjct: 342 ASSRPGTQPANLQGIWNEETRAPWSSNYTLNINAEXNYWPAETCNLAELHKPLIHFIERL 401
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHY 478
+ NG KTA++NY A GWV HH D+W +++ G VWA WP GG WL HLWEHY
Sbjct: 402 AANGKKTAEINYGARGWVAHHNADLWGQTAPVGDFGHGDPVWAFWPXGGVWLTQHLWEHY 461
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
+ D +L AYP+ + A F LDWLIE GYL T+PSTSPE F + K VS
Sbjct: 462 TFGEDEAYLRDTAYPIXKEAALFCLDWLIENEAGYLVTSPSTSPEQRFRIGE-KGYAVSS 520
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
++T D+++I E F I AA+ L +ED V+ + + RL P +I + G + EW
Sbjct: 521 ATTXDLSLIAECFDNCIQAAKRLSIDED-FVKALSDAKQRLLPLQIGKRGQLQEW 574
>gi|375148572|ref|YP_005011013.1| alpha-L-fucosidase [Niastella koreensis GR20-10]
gi|361062618|gb|AEW01610.1| Alpha-L-fucosidase [Niastella koreensis GR20-10]
Length = 850
Score = 424 bits (1090), Expect = e-116, Method: Compositional matrix adjust.
Identities = 235/597 (39%), Positives = 347/597 (58%), Gaps = 31/597 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ +N PA + +A+P+GNG+ GAMV+GGV +E L+LN++TLW+G P NP+ P L
Sbjct: 25 LKLWYNKPADAWEEALPLGNGKTGAMVFGGVATERLQLNDNTLWSGYPEAGNNPNGPTVL 84
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
VR V G Y +A A K+ G + Y LGD+ A TY RELDLN
Sbjct: 85 PQVRQAVFEGDYEKAAALWKKMQGPYSARYLPLGDLWWRVQSKDTLPA--TYYRELDLNK 142
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A + V+Y +G V + RE F S P +++V +I+ + G + + L S L +
Sbjct: 143 AVSTVRYKIGEVTYQRETFISYPSKLLVMRITADKKGVIDGVLDLTSKLHFKVTTTDADY 202
Query: 193 IIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
+++ G+ P + P+ D G + + +KI + G + + LKV G++
Sbjct: 203 LVLRGKAPKFVANRDYEPQQVGYDSANGEGMNFEVHVKIKTEGGKVEQ-SNNALKVSGAN 261
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
+ L ++SF+G +P KDP++E+ + LQ L+Y L H+ DYQ LF RV
Sbjct: 262 TVTIYLSEATSFNGFNKSPGLEGKDPSTEAKANLQKALRLTYEQLKAAHMRDYQNLFKRV 321
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRP 367
+ L +P+ ER+K + ++ D L L +QFGRYLLI+SSRP
Sbjct: 322 ELNLGPG-----------NGAAKLPTDERLKQYASNPTDQQLQVLYYQFGRYLLIASSRP 370
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
G++ ANLQGIWN+ + P W S NIN EMNYW + NLSEC +PLFDF+ L++NG+
Sbjct: 371 GSRPANLQGIWNDHIQPPWGSNYTTNINTEMNYWLAENTNLSECHQPLFDFMKELAVNGA 430
Query: 428 KTAQVNY-LASGWVIHHKTDIWAKSSA-------DRGKVVWALWPMGGAWLCTHLWEHYN 479
+TA+VNY ++ GWV+HH +D+WAK+S +G W+ WPM GAWL THLWEHY
Sbjct: 431 QTAKVNYNISEGWVVHHNSDLWAKTSPPGGWDWDPKGMPRWSAWPMAGAWLSTHLWEHYL 490
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
YT D+ FL K A+PL++G A F++ WLI + +G L TNPSTSPE+ + GK V
Sbjct: 491 YTGDKTFL-KNAWPLMKGAAQFMIHWLITDPANGLLVTNPSTSPENT-MKIKGKEYQVGM 548
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
++TMDM+IIRE+F+A+I + VL + + ++V+K+ +L P I + G + EW +
Sbjct: 549 ATTMDMSIIRELFTAVIKTS-VLLQTDAVFRDQVIKAKEKLYPFHIGQYGQLQEWFK 604
>gi|326800263|ref|YP_004318082.1| alpha-L-fucosidase [Sphingobacterium sp. 21]
gi|326551027|gb|ADZ79412.1| Alpha-L-fucosidase [Sphingobacterium sp. 21]
Length = 855
Score = 424 bits (1089), Expect = e-115, Method: Compositional matrix adjust.
Identities = 239/599 (39%), Positives = 362/599 (60%), Gaps = 34/599 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PA + +A+P+GN + GAMV+GGV E +LN++TLW+G P NP+ PK L
Sbjct: 30 LKLWYTKPASVWEEALPLGNAKTGAMVFGGVQVERYQLNDNTLWSGFPNPGNNPNGPKIL 89
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFD--DSHLKYAEETYRRELDL 130
VR + G Y +A + ++ G + Y LGD+ L+F DS +Y+R+LDL
Sbjct: 90 PRVRRAIFDGDYEKAASLWKQMQGPYSARYLPLGDLLLDFHRPDS----LTTSYQRDLDL 145
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
+ A + +KY+ V +TRE F S PD+ + +I+ ++ G+++F+V+L S L + + +
Sbjct: 146 DKALSTIKYTYRGVMYTRETFISRPDKTMAIRITANKPGAVAFDVALTSKLKHQTKAARH 205
Query: 191 NQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ +I++G+ P + P+ DD G + + +K+ G + +D +L V G
Sbjct: 206 DYLILQGKAPKFVANREYEPQQIVYDDRDGEGMNFEIHVKVQAIGGEVKT-DDNRLCVSG 264
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D +L L ++SF+G +P + KDP E+ + ++ SY ++ +RH+ D+ LF
Sbjct: 265 ADSVILWLTEATSFNGFDKSPGLNGKDPAVEAAACMERASKSSYQEVKSRHIADHAALFR 324
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQFGRYLLISSS 365
RVSI L + P+ + +P ER+ + + D +L L +Q+GRYLLI+SS
Sbjct: 325 RVSIDLGKDPEAV-----------RLPIDERMLRLAEGKSDNALQALYYQYGRYLLIASS 373
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPG + ANLQGIWN+ + P W S NIN EMNYW + NLSEC +PLFDF+ L++N
Sbjct: 374 RPGGRPANLQGIWNDMVQPPWGSNYTTNINTEMNYWLAENTNLSECHQPLFDFMKELAVN 433
Query: 426 GSKTAQVNY-LASGWVIHHKTDIWAKSSAD-------RGKVVWALWPMGGAWLCTHLWEH 477
G+ TA+VNY + GWV HH +D+WAK+S +G W+ WPM GAW CTHLWEH
Sbjct: 434 GAVTAKVNYNIDDGWVTHHNSDLWAKTSPPGGYDWDPKGMPRWSAWPMAGAWFCTHLWEH 493
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACV 536
Y YT D+ FL++ AYPL++G ASF+L WLIE YL TNPSTSPE+ + GK +
Sbjct: 494 YLYTGDKKFLKEEAYPLMKGAASFMLHWLIEDPGSHYLITNPSTSPENT-VKIAGKEYQL 552
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +STMDMAIIRE+F+A I +A++L ++D EK++ + +L P I + G + EW Q
Sbjct: 553 SMASTMDMAIIRELFNACIRSADILGSDKD-FKEKLIMAKAKLYPYHIGQYGQLQEWYQ 610
>gi|315649545|ref|ZP_07902630.1| Alpha-L-fucosidase [Paenibacillus vortex V453]
gi|315275018|gb|EFU38393.1| Alpha-L-fucosidase [Paenibacillus vortex V453]
Length = 796
Score = 421 bits (1082), Expect = e-115, Method: Compositional matrix adjust.
Identities = 232/590 (39%), Positives = 345/590 (58%), Gaps = 24/590 (4%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ + PA + +A+P+GNGRLGAMV+GGV E ++ NEDTLW+G P D N +A + L+
Sbjct: 10 KLWYREPAAKWEEALPLGNGRLGAMVFGGVEEERIQWNEDTLWSGFPRDTNNYEARRHLA 69
Query: 74 DVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
R L+ SG+Y EA K+ G + + LGD+ + H E YRRELDL+T
Sbjct: 70 AARKLITSGKYKEAEELIEDKMVGRGTESFLPLGDLLIRQSGIHGHRTE--YRRELDLDT 127
Query: 133 ATARVKY-SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY-VNGN 190
A V++ S G+ + R+ F S DQV V + +G + ++ LDS L + + +
Sbjct: 128 GIASVRFQSGGSATYARDMFISAVDQVAVIRCAGPNYEDIRLDIRLDSPLRHGTRRCAED 187
Query: 191 NQIIMEGRCPG------KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+++ G P K P + ++ GI++ + + D G ++ ++D+ + +
Sbjct: 188 GSLVLYGHAPTHIADNYKGDHPGSVLYEEGLGIRYE--MRLLALPDSGQVT-VDDRGMHI 244
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
GS LL+ A+++F G +P DP+ LQ Y +L RH+ D+Q L
Sbjct: 245 NGSGPVTLLIAAATNFAGFDRSPGSGGIDPSVICRKRLQDAVQHGYEELRARHVKDHQAL 304
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLIS 363
F RV ++L + C E + ++ + ER+K++ + EDP+L L+FQFGRYLL++
Sbjct: 305 FRRVDLRLE-------SLDC-ERSTESAATDERMKAYREGQEDPALEALMFQFGRYLLMA 356
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSRPGTQ A+LQGIWN + P W+S NIN EMNYW + +LSEC EPL + LS
Sbjct: 357 SSRPGTQPAHLQGIWNPHVQPPWNSDYTTNINTEMNYWPAETTHLSECHEPLIQMIRELS 416
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
++G +TA+++Y A GWV HH D+W +S G+ +WA WPMGGAWLC HLWE Y + D
Sbjct: 417 VSGRRTAKIHYGARGWVAHHNVDLWRMASPSDGRAMWAFWPMGGAWLCRHLWERYQFQPD 476
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
++L AYPL+ A F LDWLIE G+L T+PSTSPE++F+ +G VS STMD
Sbjct: 477 LEYLRGTAYPLMREAALFCLDWLIEDGKGHLVTSPSTSPENQFLTAEGVPCSVSAGSTMD 536
Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
MAIIR++F I A+++L ++ D L E+ + RL P + +G +MEW
Sbjct: 537 MAIIRDLFHNCIEASQLLGQDAD-LREEWESAAARLLPYGMDGEGKLMEW 585
>gi|325103197|ref|YP_004272851.1| alpha-L-fucosidase [Pedobacter saltans DSM 12145]
gi|324972045|gb|ADY51029.1| Alpha-L-fucosidase [Pedobacter saltans DSM 12145]
Length = 868
Score = 420 bits (1080), Expect = e-114, Method: Compositional matrix adjust.
Identities = 235/606 (38%), Positives = 356/606 (58%), Gaps = 36/606 (5%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
A+S S N L + + P+K + +A+PIGNG GAMV+GGV E +LN TLW+G P
Sbjct: 20 AQSKSDPN-LVLWYKEPSKIWEEALPIGNGFQGAMVFGGVGKERFQLNNGTLWSGFPNPG 78
Query: 64 TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADV-YQLLGDIELEFDDSHLKYAEE 122
NP P AL VR +D G YA+A K P Y + D+ L+F+ H +
Sbjct: 79 NNPKGPAALPQVRKAIDDGDYAKAAEIWKKNNQGPYSARYLTMADLYLDFN--HKDSDVQ 136
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
Y+R LDLN+A V Y VG V + RE SNPD+V+ +++ + +LSF L S L
Sbjct: 137 AYKRSLDLNSAVHTVTYKVGGVTYKRETLMSNPDKVMAIRLTADKKNALSFTTDLISKLK 196
Query: 183 NHSYVNGNNQIIMEGRCPGKRI------PPKANANDDPKGIQFSAILEIKISDDRGTISA 236
+ G N +I++G+ P K + P + +++ +G+ F + +K+ ++ GT+
Sbjct: 197 YKTNAVGQNALILKGKAP-KHVAHRPTEPEQIIYDENGEGMTFE--VHLKVLNEGGTVKT 253
Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
+ +K + V+ ++ + L + +SF+G +P+ + K+P+ E+ + L + Y +
Sbjct: 254 VGNK-ITVQNANAVTIYLSSGTSFNGFDKSPTIAGKNPSIEASANLAAAVGKKYDVMKQA 312
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQ 355
H+ DY KLF+RV ++L P ++ +P+ R+ + Q D L L FQ
Sbjct: 313 HIADYSKLFNRVVLKLGNRP-----------DLANLPTNIRLSRQGQKGNDQELQVLYFQ 361
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
FGRYL+ISSSRPG+Q NLQG+WN+ + P W S VNIN EMNYW + NLSE PL
Sbjct: 362 FGRYLMISSSRPGSQATNLQGLWNDHVQPPWGSNYTVNINTEMNYWLAENTNLSELHYPL 421
Query: 416 FDFLTYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSSAD-------RGKVVWALWPMGG 467
FDFL L++NG +TA++NY + GWV+HH TDIWAK+S +G W+ WPMGG
Sbjct: 422 FDFLERLAVNGKETAKINYNINKGWVLHHNTDIWAKTSPTGGYDWDPKGSPRWSAWPMGG 481
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFI 527
AWL THL++HY +T D+ FL+++AYPL++G A FLL WL+ GYL TNPSTSPE+ F
Sbjct: 482 AWLSTHLYDHYLFTGDKRFLKEKAYPLMKGAAEFLLAWLVPDQSGYLITNPSTSPENTFT 541
Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAED 587
+ K +S +TMD+ I+ E+F+A I +A+ L+ + + V+++ + +L P +I +
Sbjct: 542 I-NKKQYEISKGTTMDLGIMLELFNACIQSAKALDTDAN-FVKQLEAAKAKLYPYQIGKY 599
Query: 588 GSIMEW 593
G + EW
Sbjct: 600 GQLQEW 605
>gi|403743768|ref|ZP_10953247.1| aliphatic sulfonates family ABC transporter, periplasmic
ligand-binding protein [Alicyclobacillus hesperidum
URH17-3-68]
gi|403122358|gb|EJY56572.1| aliphatic sulfonates family ABC transporter, periplasmic
ligand-binding protein [Alicyclobacillus hesperidum
URH17-3-68]
Length = 804
Score = 414 bits (1063), Expect = e-112, Method: Compositional matrix adjust.
Identities = 239/596 (40%), Positives = 329/596 (55%), Gaps = 30/596 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP-DAPKA 71
+ + + PA +TDA+PIGNGRLG MV+GG+ E + LNEDTLW+G P P A +
Sbjct: 6 VALWYEKPAVAWTDALPIGNGRLGGMVFGGIEHERIHLNEDTLWSGYPRTLAVPRKAEET 65
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
L VR LV +G+Y EA AS L G ++ Y LG +EL F+ L + YRR LDL
Sbjct: 66 LRQVRELVLAGRYQEAHEASRGLSGPYSESYLPLGWLELVFEHGDLAH---DYRRSLDLR 122
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TA A V Y +G +FTRE F S+PD+ +V ++ L+F + + S L H+
Sbjct: 123 TAVATVSYRIGRTQFTREMFVSHPDEAMVIHLTADGPLPLAFTLCMGSKL-RHAIAEMAG 181
Query: 192 QIIMEGRCPGKRIPP--------KANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ + G+ P P + A DDP+ I+F+A + + D GT++ D L+
Sbjct: 182 DLALTGQAPIHVAPSYEVDDHPIQYAAPDDPRPIRFAARITVARCD--GTVAWCGDG-LR 238
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
+EG+ LLL A ++F + P D D ++ L +R +++L +RH+ D+Q+
Sbjct: 239 IEGATRVTLLLGAGTNFRSFALRP-DEALDVSANLGRQLADLRTTPFAELKSRHVADHQR 297
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
LF RV L+ D E +P+ E + + LVELLF +GRYLLI+
Sbjct: 298 LFDRVEFVLADPRPD------ENEGYRDLPTDELIARYGVHAK-RLVELLFHYGRYLLIA 350
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSRPGTQ ANLQGIWN+ P W S +NIN EMN+W CN+ EC EPL + L+
Sbjct: 351 SSRPGTQPANLQGIWNDATRPPWSSNLTLNINAEMNFWPVEVCNIGECHEPLLRMIGELA 410
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHYN 479
G + A+ Y GWV HH TDIW + A RG W++WPM G WLC HLWEHY
Sbjct: 411 QTGREVAK-RYGCRGWVAHHNTDIWRMAHAAGGDGRGDPSWSMWPMAGPWLCAHLWEHYL 469
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
++ D FL+ AYPL+ A F +DWL G PSTSPEH F+ DG+ A VS S
Sbjct: 470 FSRDHAFLQNVAYPLMRDAALFCIDWLASDPSGRGLAIPSTSPEHHFVTQDGQKAAVSAS 529
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
STMD+ ++RE+FS I AA L + + E RLRP +I DG + EW++
Sbjct: 530 STMDVMLMRELFSHCIEAASTLGVDAELSAEWAAWQ-ERLRPLRIGRDGRLQEWME 584
>gi|329930748|ref|ZP_08284172.1| Alpha-L-fucosidase 2 family protein [Paenibacillus sp. HGF5]
gi|328934680|gb|EGG31180.1| Alpha-L-fucosidase 2 family protein [Paenibacillus sp. HGF5]
Length = 673
Score = 412 bits (1060), Expect = e-112, Method: Compositional matrix adjust.
Identities = 232/585 (39%), Positives = 331/585 (56%), Gaps = 51/585 (8%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLV 79
PA + +A+PIGNGRLGAM++GG+ E L+LNED++W G P D N DA L +R LV
Sbjct: 21 PATDWNEALPIGNGRLGAMIFGGIAEEKLQLNEDSVWYGGPRDRNNEDALPHLPVIRELV 80
Query: 80 DSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATAR 136
+G+ EA A A + + G P Y LGD+ + FD + + Y RELDL +R
Sbjct: 81 MNGRLHEAEALAGMAMAGLPESQRHYLPLGDLLISFDRHEMA---KDYERELDLEHGVSR 137
Query: 137 VKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ---- 192
Y +G + +TRE F+S PDQ I+ +IS + G++S + N Y+ ++
Sbjct: 138 SSYRIGEIRYTRELFASYPDQAIIMRISADKPGAVSLKARFNR--RNWRYMEKTDKWDQQ 195
Query: 193 -IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
++M+G C GK G F AI++ + G + + L VE +D
Sbjct: 196 GLVMQGECGGK------------GGSSFCAIVK---ALSEGGVCKTIGEYLLVENADAVT 240
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
LLL A ++F P DP L+ + +SY++L RH+ DY +LF RV++
Sbjct: 241 LLLTAGTTFRHP---------DPELYGKRRLEELSQVSYTELLVRHIKDYTELFGRVTLS 291
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
LS SP +T+P+ +R+K + + +ED L+E FQFGRYLLISSSRPG+
Sbjct: 292 LSESPGK-----------NTLPTDDRLKRYREGEEDNGLIETYFQFGRYLLISSSRPGSL 340
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWN+ +P WDS +NIN +MNYW + CNL+EC EPLF+ + + G TA
Sbjct: 341 PANLQGIWNDSYTPPWDSKFTININTQMNYWPAENCNLAECHEPLFELIERMREPGRVTA 400
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
V Y G+ HH TDIWA ++ + + WPMG AWLC HLWEHY + DR FL R
Sbjct: 401 GVMYGCRGFTAHHNTDIWADTAPQDTYLPASFWPMGAAWLCLHLWEHYRFGQDRYFL-AR 459
Query: 491 AYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREV 550
AY ++ A FLLD+LIE +G L T PS SPE+ + P+G+ + +TMD II +
Sbjct: 460 AYETMKEAALFLLDYLIEDGEGRLVTCPSVSPENRYKLPNGETGVLCAGATMDFQIIEAL 519
Query: 551 FSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
F A I + E++EK+E A E++ +L RL +I + G I EW++
Sbjct: 520 FEACIRSGEIIEKDE-AFREELAAALKRLPKPQIGKYGQIQEWME 563
>gi|374603684|ref|ZP_09676661.1| alpha-L-fucosidase [Paenibacillus dendritiformis C454]
gi|374390787|gb|EHQ62132.1| alpha-L-fucosidase [Paenibacillus dendritiformis C454]
Length = 818
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 237/624 (37%), Positives = 332/624 (53%), Gaps = 44/624 (7%)
Query: 2 MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
M T+ LK+ + PA +T+A+P+GNGR GAMV+GGV E ++LNEDTLW G P
Sbjct: 1 MATSKTARDEDLKLWYTRPADKWTEALPLGNGRFGAMVFGGVRRERIQLNEDTLWAGHPV 60
Query: 62 DYTNPDAPKALSDVRSLVDSGQYAEAT----AASVKLFGHPADVYQLLGDIELEFDDSHL 117
NP A + L + R L+ +G+YAEA V GH YQ LG++ LEFD
Sbjct: 61 SEYNPAAGELLPEARQLLHAGKYAEAMELIGTRMVGTEGHGIQPYQPLGNVYLEFDGPEA 120
Query: 118 KYAEET-------YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGS 170
Y+REL L A A G+ R F S DQV+V ++
Sbjct: 121 TGGAAGGKPAAPAYKRELQLKQALAVTSCQAGDSLEKRTVFVSAADQVMVVRLESDSPYG 180
Query: 171 LSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGK-------RIPP------KANANDDPKGI 217
+ VSLDS L++ + ++M GRCP + +PP A + + + +
Sbjct: 181 VRVTVSLDSRLEHSVAADDEGGLVMTGRCPQRVRNHNNSAVPPIAYDGDGAESEESGRAL 240
Query: 218 QFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSE 277
+F+ + + D + + D +LK+ G LL A++SF G P ++ P
Sbjct: 241 RFAVKMAVLEEDGETRVRCI-DNRLKIGGGRAVTLLFAAATSFRGYDRMPDEAAVPPAER 299
Query: 278 SMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAER 337
+ L+ SY L H+ DY++LF RVS++L D D + +P+ ER
Sbjct: 300 CHAVLKEALRRSYGQLLDAHIQDYRRLFERVSLEL-----DDADDAGRK-----LPTDER 349
Query: 338 VKSFQTD-EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINL 396
++ D + LLFQ+GRYLLISSSRPGTQ ANLQGIWN+++ P W+ H+NINL
Sbjct: 350 LRRIGAGGSDNGIYALLFQYGRYLLISSSRPGTQAANLQGIWNDEVQPPWNCDYHLNINL 409
Query: 397 EMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSAD-R 455
+MNYW + C+L EC +PLF + L++ G+ ++V+Y GW+ H TD W +
Sbjct: 410 QMNYWLAEVCHLQECHDPLFRLMEELAVTGAAASRVHYGCGGWMAHAMTDQWRNHNVGPS 469
Query: 456 GKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYL 514
G WA WPMGGAWLC HLWEHY YT DR FL +RA+PLL G A+FLLDW++ E DG L
Sbjct: 470 GDPSWAYWPMGGAWLCRHLWEHYEYTRDRAFLAERAWPLLRGAAAFLLDWVVQEDEDGRL 529
Query: 515 ETNPSTSPEHEFIAPDG----KLAC-VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALV 569
T+PS SPE+ F+ P K C VS SS MDM I +++ + A +VL + D
Sbjct: 530 MTSPSVSPENAFLIPGAEEGEKQTCTVSQSSAMDMQIAYDLWMIVKQANDVLGLD-DTFA 588
Query: 570 EKVLKSLPRLRPTKIAEDGSIMEW 593
+ RL +I G +MEW
Sbjct: 589 RACEAAALRLPQPRIGARGQLMEW 612
>gi|379721956|ref|YP_005314087.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
gi|378570628|gb|AFC30938.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
Length = 768
Score = 410 bits (1055), Expect = e-112, Method: Compositional matrix adjust.
Identities = 234/592 (39%), Positives = 334/592 (56%), Gaps = 33/592 (5%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA + +A+P GNGRLGAMV+GG E + LNEDTLW+G P D DA L R
Sbjct: 12 YEQPAGSWFEALPAGNGRLGAMVFGGTCRERIALNEDTLWSGEPRDTVREDAHLHLDPAR 71
Query: 77 SLVDSGQYAEATAASVKLFGHP-ADVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTAT 134
L+ G++AEA + P + Y LGD+EL+ D K E T YRREL L+ A
Sbjct: 72 KLIFEGRHAEAEEIIQQYMQGPDIESYLPLGDLELQSD----KEGEITDYRRELILDEAV 127
Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQII 194
R +Y TRE F S DQV+ +I + L+ +SL S L G++ +
Sbjct: 128 VRTQYRTDGALQTRELFVSAADQVLALRIESEQP--LNLTISLGSPLQYAVRRTGSSGMA 185
Query: 195 MEGRCPGKRIPPKANANDDP------KGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
+ GRCP R+ P +D+P +GI F A L + + ++G I + +++V
Sbjct: 186 LSGRCP-VRVLPNTVRSDEPARYEEGRGIAFEAALHV--TAEKGRIES-SGGRIRVVSGR 241
Query: 249 WAVLLLVASSSFDGPFINPSDSK--KDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
LLL A++S+DG +P+ + P + L+ L YS L RHL ++ + +
Sbjct: 242 GVTLLLAAATSYDGFDRDPAAASLAGRPRALCAERLREAAGLGYSRLKERHLREHAEKYG 301
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSS 365
RV ++L + S + D +P+ R+++ Q +DP L L FQ+GRYLL+SSS
Sbjct: 302 RVDLELG------GSAADSGADADALPTDARIRAAAQGADDPGLAALFFQYGRYLLLSSS 355
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPGTQ ANLQGIWN+ L P W S+ NIN++MNYW + NL+EC EPL F+ L +
Sbjct: 356 RPGTQPANLQGIWNDKLQPPWCSSWTANINVQMNYWPAEAANLAECHEPLLRFVDDLRES 415
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G + A V+Y GW HH D+W ++ G WA WPM GAWLC HLWEHY ++ D
Sbjct: 416 GRRAASVHYRCRGWTAHHNIDLWRTATPVGGSPSWAFWPMAGAWLCEHLWEHYAFSRDEK 475
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
+L R YP+L+ A F LDWL+EG DG+L T PSTSPE+ F+ DG CV+Y+STMD+A
Sbjct: 476 YL-ARVYPVLKEAAQFGLDWLVEGPDGFLVTCPSTSPENHFLTADGSQGCVTYASTMDIA 534
Query: 546 IIREVFSAIISAAEVLEKNE--DALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
++R +F + A+ L+K+ L+E+ L+ +P P +I G + EW +
Sbjct: 535 LLRNLFGRCMEASRQLQKDTAFRVLLEQTLRRMP---PYRIGRHGQLQEWAE 583
>gi|375310399|ref|ZP_09775670.1| alpha-L-fucosidase, partial [Paenibacillus sp. Aloe-11]
gi|375077548|gb|EHS55785.1| alpha-L-fucosidase, partial [Paenibacillus sp. Aloe-11]
Length = 643
Score = 410 bits (1055), Expect = e-112, Method: Compositional matrix adjust.
Identities = 227/613 (37%), Positives = 341/613 (55%), Gaps = 48/613 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L++ F PA+ + +A+P+GNGRLGAMV+GG+ E L+LNEDTLW+G P D DA + L
Sbjct: 10 LRLWFRQPAEVWEEALPVGNGRLGAMVFGGIRKERLQLNEDTLWSGFPRDGVQYDALRYL 69
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET-YRRELDL 130
VR L+ +G+Y +A + + G + YQ LGD+ + + + E T Y RELDL
Sbjct: 70 KPVRELIAAGKYKDAEHLINTHMLGCDTEAYQPLGDLWI----TQKGFGEITHYERELDL 125
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--------DSLLD 182
T TA V + + +TRE +S+PD +I+ ++ +G ++ +V + +S D
Sbjct: 126 PTGTAAVAFHSDGIRYTREVIASSPDGIIMVSLTADRAGQINASVRITTPHPCEDESGED 185
Query: 183 NHSYV---------------NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL---- 223
H V N I + GR P + D P+ + + L
Sbjct: 186 EHFAVLSQWDSDVAEGLSDEATRNCITLNGRAPSH--VESNDHGDHPQSVVYEHDLGMAF 243
Query: 224 --EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA 281
++++ + G ++A +D + V G+D + L A++ F G + P +
Sbjct: 244 AVQVRMVSEGGIVTAKDDGTVIVSGADTLTVYLAAATGFRGFDVMPDSDPAESAEACQIT 303
Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF 341
L +L + RH D++ LF RV+++L +DT +EE I +P+ R++ +
Sbjct: 304 LDKAISLGSEQVRQRHEQDHRTLFERVALELG-------SDTRTEELI--LPTDLRLERY 354
Query: 342 -QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
Q + DP L LLFQ+GRYLL+ SSRPG+Q ANLQGIWN+ + P W+S NIN +MNY
Sbjct: 355 KQGEADPGLEVLLFQYGRYLLMGSSRPGSQPANLQGIWNDRVQPPWNSNYTTNINTQMNY 414
Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVW 460
W + CNL+EC EPL + +S G + A VNY A GW HH D+W + G W
Sbjct: 415 WPAEICNLAECHEPLLHMVGEISRTGRRVASVNYGAQGWAAHHNVDLWRYAGPSGGHASW 474
Query: 461 ALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPST 520
A WP+GG WL HLWE Y +T D +L ++AYPL++G A+F +DWLIEG DG+L T+PST
Sbjct: 475 AFWPLGGVWLTAHLWERYLFTQDTAYLAEQAYPLMKGAAAFCMDWLIEGPDGWLVTSPST 534
Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLR 580
SPE++FI G+ +S STMDM +IRE+ I AA++LE +E+ + ++ RL
Sbjct: 535 SPENKFITSSGEECSISMGSTMDMTLIRELLGNCIQAADLLELDEE-FRNRCEETQQRLL 593
Query: 581 PTKIAEDGSIMEW 593
P ++ G + EW
Sbjct: 594 PYQMGRHGQLQEW 606
>gi|337748987|ref|YP_004643149.1| alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
gi|336300176|gb|AEI43279.1| Alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
Length = 827
Score = 410 bits (1053), Expect = e-111, Method: Compositional matrix adjust.
Identities = 233/590 (39%), Positives = 334/590 (56%), Gaps = 29/590 (4%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA + +A+P GNGRLGAMV+GG E + LNEDTLW+G P D DA L R
Sbjct: 12 YEQPAGSWFEALPAGNGRLGAMVFGGTCRERIALNEDTLWSGEPRDTVREDAHLHLDPAR 71
Query: 77 SLVDSGQYAEATAASVKLFGHP-ADVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTAT 134
L+ G++AEA + P + Y LGD+EL+ D K E T YRREL L+ A
Sbjct: 72 KLIFEGRHAEAEEIIEQYMQGPDIESYLPLGDLELQSD----KEGEITDYRRELILDDAV 127
Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQII 194
R +Y RE F S DQV+ +I + L+ +SL S L G++ +
Sbjct: 128 IRTQYRTDGALQIRELFVSAADQVLALRIESEQP--LNLTISLGSPLQYAVRRTGSSGMA 185
Query: 195 MEGRCPGKRIPPKANANDDP------KGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
+ GRCP R+ P +D+P +GI F A L + + ++G I + +++V
Sbjct: 186 LSGRCP-VRVLPNTVRSDEPARYEEGRGIAFEAALHV--TAEKGRIES-SGGRIRVVSGR 241
Query: 249 WAVLLLVASSSFDGPFINPSDSK--KDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
LLL A++S+DG +P+ + P + L+ L YS L RHL ++ + +
Sbjct: 242 GVTLLLAAATSYDGFDRDPAAASLAGRPRALCAERLREAAGLGYSRLKERHLREHAEKYG 301
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSS 365
RV ++L + S + D +P+ R+++ Q +DP L L FQ+GRYLL+SSS
Sbjct: 302 RVDLELG------GSAADSGADADALPTDARIRAAAQGADDPGLAALFFQYGRYLLLSSS 355
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPGTQ ANLQGIWN+ L P W S+ NIN++MNYW + NL+EC EPL F+ L +
Sbjct: 356 RPGTQPANLQGIWNDKLQPPWCSSWTANINVQMNYWPAEAANLAECHEPLLRFVDDLRES 415
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G + A V+Y GW HH D+W ++ G WA WPM GAWLC HLWEHY ++ D +
Sbjct: 416 GRRAASVHYRCRGWTAHHNIDLWRTATPVGGSPSWAFWPMAGAWLCEHLWEHYAFSRDEE 475
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
+L R YP+L+ A F LDWL+EG DG+L T PSTSPE+ F+ DG CV+Y+STMD+A
Sbjct: 476 YL-ARVYPVLKEAAQFGLDWLVEGPDGFLVTCPSTSPENHFLTADGSQGCVTYASTMDIA 534
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
++R +F + A+ L+K+ A E + ++L R+ P +I G + EW +
Sbjct: 535 LLRNLFGRCMEASRQLQKD-TAFRELLEQTLRRMPPYRIGRHGQLQEWAE 583
>gi|255532589|ref|YP_003092961.1| alpha-L-fucosidase [Pedobacter heparinus DSM 2366]
gi|255345573|gb|ACU04899.1| Alpha-L-fucosidase [Pedobacter heparinus DSM 2366]
Length = 868
Score = 409 bits (1052), Expect = e-111, Method: Compositional matrix adjust.
Identities = 232/598 (38%), Positives = 355/598 (59%), Gaps = 37/598 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PAK + +A+P+GNG+ GAMV+G V E +LN++TLW+G P NP P L
Sbjct: 29 LKLWYTQPAKVWEEALPLGNGKTGAMVFGRVNKERFQLNDNTLWSGSPEAGNNPKGPANL 88
Query: 73 SDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEET-YRRELDL 130
VR V G YA A A K L G + Y + D+ L+F+ LK + T Y RELD+
Sbjct: 89 PLVRQAVFEGDYARAAALWKKNLQGPYSARYLTMADLFLDFN---LKDSIPTAYHRELDI 145
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
+ A + V Y+VG + + RE S PD+ +V +I+ + +L+F+ S+ S L + G
Sbjct: 146 DNAISTVTYTVGGITYKRESLISYPDKAVVIRITTDQKNALNFSTSISSKLKYTARAVGA 205
Query: 191 NQIIMEGRCPGKRIPPKAN-----ANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
+ ++++G+ P K + +A DD +G+ F ++++I + GT +A + ++ V
Sbjct: 206 DLLVLKGKAP-KHVAHRATEAAQVVYDDKEGMTFE--VDVRIKAEGGTTTA-KGTEILVS 261
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
++ + L ++SF+G +P K+P +E+ L+ + YS + T H+ DY+ LF
Sbjct: 262 KANAVTIYLSGATSFNGYNKSPGLEGKNPATEAAGILKKVYPKPYSTIKTAHVADYKALF 321
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQFGRYLLISS 364
RVS L S ++ +P+ R+ + D L L +QFGRYL+I+S
Sbjct: 322 DRVSFSLG-----------SNAELEGLPTNVRLSRQGAMGNDQGLQVLYYQFGRYLMIAS 370
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
SRPG+Q NLQGIWN+ + P W S VN N +MNYW + NLSE +PLFDF+ +++
Sbjct: 371 SRPGSQATNLQGIWNDHVQPPWGSNYTVNANTQMNYWLAEQTNLSELHQPLFDFIGRMAV 430
Query: 425 NGSKTAQVNY-LASGWVIHHKTDIWAKSSAD-------RGKVVWALWPMGGAWLCTHLWE 476
NG+KTA++NY + GWV+HH TDIWAKSS +G W+ WPMGGAWL THL++
Sbjct: 431 NGAKTAKINYDIRQGWVVHHNTDIWAKSSPTGGYDWDPKGAPRWSAWPMGGAWLTTHLYD 490
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLAC 535
HY +T D+ FL+++ YPL++G A F+L WL++ YL TNPSTSPE+ F +GK
Sbjct: 491 HYLFTGDKQFLKEKGYPLMKGAAEFMLKWLVKDDKTEYLVTNPSTSPENIFKI-EGKEYE 549
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
VS ++TMDM II+E+F+ I+A+++L+ + D VE + K+ +L P I G + EW
Sbjct: 550 VSKATTMDMGIIKELFTDCIAASKILDMDADFRVE-LEKAKAKLYPFNIGRYGQLQEW 606
>gi|294054095|ref|YP_003547753.1| alpha-L-fucosidase [Coraliomargarita akajimensis DSM 45221]
gi|293613428|gb|ADE53583.1| Alpha-L-fucosidase [Coraliomargarita akajimensis DSM 45221]
Length = 783
Score = 409 bits (1051), Expect = e-111, Method: Compositional matrix adjust.
Identities = 229/591 (38%), Positives = 340/591 (57%), Gaps = 41/591 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + ++ PA +TDA+P+GNG +GAMV+GG+ E ++ N+DTLW G P Y + DA L
Sbjct: 26 LTLRYDRPADAWTDALPVGNGSMGAMVFGGIEKERIQFNQDTLWAGEPRSYAHEDAVDVL 85
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHP--ADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
++R+L+ G+ AEAT A + P YQ GD+ ++F ++ + E Y R LD
Sbjct: 86 PEIRTLLFDGKQAEATKLAGERFMSEPLRQAAYQPFGDLWIQFP-AYGQAGE--YERSLD 142
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L+ A A Y++G+VEFTR F+S PD VI +I S+ G ++F L + ++S V
Sbjct: 143 LDGALATTSYTIGDVEFTRTVFASYPDGVIAIRIEASKPGMVNFTAGLTTPHQSNSVVEP 202
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
N+ + R K ++F A ++++ D G A ++V G+
Sbjct: 203 LNRNTLRLRGQVDAFTDKKETFTFEGAMRFEA--QLRVYTDGGMCQA-SGGVVEVGGATS 259
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L LVA++ F N +P S + L+++ + SY+D+ RH D++ LF R S
Sbjct: 260 ATLYLVAATDF----TNYKRLAGNPNSRCTTTLRALNSASYADVLQRHQADHRALFRRAS 315
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
I+L + + +T+P+ ER+ +Q DPSLV LLFQ+GRYLLI+SSRPG+
Sbjct: 316 IELGGT------------DANTMPTNERLNQYQAKPDPSLVALLFQYGRYLLIASSRPGS 363
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
+ ANLQG+WNE P W+S +NIN EMNYW + NLSEC EPLFD + LS+ G++
Sbjct: 364 EAANLQGLWNESQQPAWESKYTLNINAEMNYWPAELTNLSECHEPLFDLIEDLSVTGAEV 423
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+++Y A GWV HH TD+W + +A +WP GGAWLCTHLWEH+ YT DR FL+
Sbjct: 424 AELHYDARGWVAHHNTDLW-RGAAPINAANHGIWPTGGAWLCTHLWEHFLYTGDRQFLKS 482
Query: 490 RAYPLLEGCASFLLDWLIEG---HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
RAYPL++G A F +D L+E +G+L + PS SPE + TMD I
Sbjct: 483 RAYPLMKGAAQFFVDTLVEDPVFDEGWLISGPSNSPER---------GGLVMGPTMDHQI 533
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSL-PRLRPTKIAEDGSIMEWVQR 596
IR +F A AA+VL + DA L+ L ++ P+++ ++G + EW+ +
Sbjct: 534 IRSLFHATADAADVLGR--DAAFAAELRELAAKITPSQVGQEGQVKEWLYK 582
>gi|386724573|ref|YP_006190899.1| alpha-L-fucosidase [Paenibacillus mucilaginosus K02]
gi|384091698|gb|AFH63134.1| alpha-L-fucosidase [Paenibacillus mucilaginosus K02]
Length = 714
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 235/586 (40%), Positives = 329/586 (56%), Gaps = 42/586 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
F PAK + +A+P+GNGRLGAMV+G E ++LNEDT+W G P D NPDA + L ++R
Sbjct: 8 FKQPAKDWNEALPLGNGRLGAMVFGLPRKERIQLNEDTVWYGGPVDRHNPDALRYLPEIR 67
Query: 77 SLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
+ SG+ AEA A++ L G P Y LGD+ + D H E YRRELDL+
Sbjct: 68 EKLLSGRLAEAHKLAAMALSGIPESQRHYMPLGDLWITMD--HPPGVAEEYRRELDLSKG 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVNGN 190
A + Y +G+ F RE F S+PDQ +V +I G++ F LD S + G
Sbjct: 126 VAGLHYRIGDTAFIRETFISHPDQALVLRIRADRPGAVGFTARLDRGKSRYLDEIEAAGP 185
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
N ++M G C GK G F A L +D G + + L VEG+D
Sbjct: 186 NMLVMRGNCGGK------------GGSDFRAALR---ADAEGGSVRIIGEHLIVEGADAV 230
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
L L A+++F ++DP + ++ L S Y+ L RH +DY+ L+ RV +
Sbjct: 231 TLYLSAATTF---------RQEDPEAYCLNTLSSAAARGYASLLERHTEDYRGLYDRVQL 281
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGT 369
L ++ TD + + +P+ ER++ + EDP L+ L FQ+GRYLLISSSRPG+
Sbjct: 282 SL-----ELQTDEAAAAAV--LPTDERLELVKKGGEDPGLIPLYFQYGRYLLISSSRPGS 334
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
ANLQGIWNE + P WDS +NIN +MNYW + C+LSEC EPLFD + +S GS+T
Sbjct: 335 LPANLQGIWNEQMRPPWDSKYTININTQMNYWPAESCHLSECHEPLFDLIQRMSERGSRT 394
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+V Y GW HH TD+W ++ + WP+GGAWLC HLWEHY + L +
Sbjct: 395 AEVMYGCRGWTAHHNTDLWGDTAPQDIYLPATHWPLGGAWLCLHLWEHYRFGGGTARLAE 454
Query: 490 RAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
YP+++G A FLLD++IE DG+L T PS SPE+ +I P+G+ + MD I RE
Sbjct: 455 -FYPVMKGAARFLLDYMIEAKDGHLITCPSVSPENTYILPNGESGTLCAGPAMDSQIARE 513
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+F A AA L +ED E L +L R+ ++AE G + EW++
Sbjct: 514 LFQACREAARELGTDEDFRSELEL-ALQRIPLPQVAEGGYLQEWLE 558
>gi|374320465|ref|YP_005073594.1| putative alpha-l-fucosidase; glycoside hydrolase family 95
[Paenibacillus terrae HPL-003]
gi|357199474|gb|AET57371.1| putative alpha-l-fucosidase; glycoside hydrolase family 95
[Paenibacillus terrae HPL-003]
Length = 829
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 228/611 (37%), Positives = 335/611 (54%), Gaps = 41/611 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L++ + PAK + +A+P+GNGRLGAMV+GG+ E L+LNEDTLW+G P D DA + L
Sbjct: 10 LRLWYRQPAKVWEEALPVGNGRLGAMVFGGIGEERLQLNEDTLWSGFPRDGVQYDALRYL 69
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
VR L+ G+Y +A + + G + YQ LGD+ + + AE Y RELDL
Sbjct: 70 KPVRELIADGKYKDAEHLINANMLGRDTEAYQPLGDLWIT-QEGLGSIAE--YERELDLV 126
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL------------DS 179
T TA V + G + +TRE +S PD +I+ +++ G ++ V + D+
Sbjct: 127 TGTAAVTFQGGGIRYTREVIASAPDGIIMVRLTADTPGKINATVRITTPHSCEAEAGEDA 186
Query: 180 LLDNHSYVNGNNQ-----------IIMEGRCPGK------RIPPKANANDDPKGIQFSAI 222
+ S + + + I + GR P P++ +D G+ F+
Sbjct: 187 HFGDSSEWDNDKEDDSSGEPERDLITLTGRAPSHVESDYHGYHPQSVVYEDELGMAFA-- 244
Query: 223 LEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSAL 282
++ +I + GT++ D ++V G+D + L A++ F G P + T L
Sbjct: 245 IQARIIAEGGTLTRGADGVIRVAGADKLTVYLAAATGFRGFDTQPDIDATESTGVCEVTL 304
Query: 283 QSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ 342
+L Y + RH D+ +LF RV ++L + TD ++ I T E+ + Q
Sbjct: 305 ARAVSLGYEQVRHRHEQDHWELFGRVELELGDEGR---TDPSTKRQIPTDLRLEQYREGQ 361
Query: 343 TDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQ 402
D D L LFQ+GRYLLI+SSR G+Q ANLQGIWN+ + P W+S NIN +MNYW
Sbjct: 362 ADLD--LEVTLFQYGRYLLIASSRSGSQPANLQGIWNDHVQPPWNSDYTTNINTQMNYWP 419
Query: 403 SLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWAL 462
+ CNL+EC EPL + +S G + A + Y A GW HH D+W + G WA
Sbjct: 420 AEICNLAECHEPLLHMVGEVSRTGRRVASIYYGAQGWTAHHNVDVWRYAGPSGGHASWAF 479
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSP 522
WP+GG WL HLWE Y T D +L ++AYPL++G A+F +DWL+EG DG+L T+PSTSP
Sbjct: 480 WPLGGVWLTAHLWERYLLTQDTAYLAEQAYPLMKGAAAFCMDWLVEGPDGWLVTSPSTSP 539
Query: 523 EHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT 582
E++FI PDG+ +S STMDM +IRE+ S I A E+LE + D + ++L RL P
Sbjct: 540 ENKFITPDGEHCSISMGSTMDMTLIRELLSNCIQATELLELD-DEFRNRCEETLQRLLPY 598
Query: 583 KIAEDGSIMEW 593
+I G + EW
Sbjct: 599 QIGRHGQLQEW 609
>gi|390452435|ref|ZP_10237963.1| candidate alpha-l-fucosidase; glycoside hydrolase family 95
[Paenibacillus peoriae KCTC 3763]
Length = 826
Score = 407 bits (1046), Expect = e-111, Method: Compositional matrix adjust.
Identities = 223/615 (36%), Positives = 342/615 (55%), Gaps = 48/615 (7%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
PL++ + PA+ + +A+P+GNGRLGAMV+GG+ E L+LNEDTLW+G P D DA +
Sbjct: 8 QPLRLWYRQPAEVWEEALPVGNGRLGAMVFGGIREERLQLNEDTLWSGFPRDGVQYDALR 67
Query: 71 ALSDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET-YRREL 128
L VR L+ +G+Y +A + + G + YQ LGD+ + + E T Y REL
Sbjct: 68 YLKPVRELIAAGKYKDAEHLINTHMLGCDTEAYQPLGDLWI----AQEGLGEITHYEREL 123
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--------DSL 180
DL T TA V + + +TRE +S+PD +I+ ++ + +G ++ +V + ++
Sbjct: 124 DLPTGTAAVAFHSDGIRYTREVIASSPDGIIMVSLTANRAGQINASVRITTPHPCEDEAG 183
Query: 181 LDNHSYV---------------NGNNQIIMEGRCPGKRIP------PKANANDDPKGIQF 219
D H V N I + GR P P++ + G+ F
Sbjct: 184 EDEHFAVLSQWDSDVAEGPSDEAARNCITLTGRAPSHVESNYHGDHPQSVVYEHDLGMAF 243
Query: 220 SAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESM 279
+ ++ ++ + G ++ D + V G+D + L A++ F G P +
Sbjct: 244 A--VQARMVSEGGIVTTKADGTVIVSGADTLTIYLAAATGFRGFHTMPDSDPAESAEVCQ 301
Query: 280 SALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVK 339
L + +L + RH D++ LF RV+++L DT +EE+I +P+ R++
Sbjct: 302 VTLDKVISLGSEQVRQRHEQDHRALFDRVALELG-------GDTRTEESI--LPTDLRLE 352
Query: 340 SF-QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEM 398
+ Q + DP L LLFQ+GRYLL+ SSRPG+Q ANLQGIWN+ + P W+S NIN +M
Sbjct: 353 RYKQGEADPRLEVLLFQYGRYLLMGSSRPGSQPANLQGIWNDRVQPPWNSNYTTNINTQM 412
Query: 399 NYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKV 458
NYW + CNL+EC EPL + +S G + A VNY A GW HH D+W + G
Sbjct: 413 NYWPAEVCNLAECHEPLLHMIGEISRTGRRVASVNYGAQGWAAHHNVDLWRYAGPSGGHA 472
Query: 459 VWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNP 518
WA WP+GG WL HLW+ Y +T D +L ++AYPL++G A+F +DWL+EG +G+L T+P
Sbjct: 473 SWAFWPLGGVWLTAHLWDRYLFTQDTAYLAEQAYPLMKGAAAFCMDWLVEGPNGWLVTSP 532
Query: 519 STSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR 578
STSPE++FI P G+ +S STMDM +IRE+ I AA++LE +E+ + ++ R
Sbjct: 533 STSPENKFITPSGEECSISMGSTMDMTLIRELLGNCIQAADLLELDEE-FRNRCEETQQR 591
Query: 579 LRPTKIAEDGSIMEW 593
L P ++ G + EW
Sbjct: 592 LLPYQMGRHGQLQEW 606
>gi|251795949|ref|YP_003010680.1| alpha-L-fucosidase [Paenibacillus sp. JDR-2]
gi|247543575|gb|ACT00594.1| Alpha-L-fucosidase [Paenibacillus sp. JDR-2]
Length = 787
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 234/595 (39%), Positives = 332/595 (55%), Gaps = 37/595 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ + PA + +A+PIGNGR+G MV+ G + + LNEDTLW G P D N +A + L+
Sbjct: 8 KLWYEQPASVWEEALPIGNGRIGGMVFAGTEIDQILLNEDTLWAGFPRDPINYEAQRYLA 67
Query: 74 DVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
R L+ SG+YAEA + G + Y LG + + + + A Y+REL LN
Sbjct: 68 KARQLIFSGKYAEAERLIESTMQGRDVEPYLPLGGLSIVRREDR-ESAVSQYKRELHLNE 126
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A Y G+V ++F S PDQ +V + + G+L+ ++ +DSLL G Q
Sbjct: 127 GIAAACYQDGDVTVQSQYFVSVPDQALVVRYEAA-GGTLNRDIVMDSLLQYRLEEAGERQ 185
Query: 193 IIMEGRCPGK------RIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ + G+ P + P ++ G+ F + +K+ D GT+ E K L+V
Sbjct: 186 LHLIGQAPSHVAGNYHKDHPMDVLYEEGLGLPFE--IRVKVETD-GTVKNGE-KGLEVRN 241
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIR-----NLSYSDLYTRHLDDY 301
+ + + L A + F G + P E+ SA SIR L + L +RH +D+
Sbjct: 242 AAYLHIYLTAETGFAG-------YDQSPDQEACSARCSIRLEKAAALGFEGLLSRHTEDH 294
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYL 360
++LF RVS L+ E + P+ R+ +QT +D L L F FGRYL
Sbjct: 295 RQLFDRVSFSLA-----------DETDGSDKPTDRRLADYQTTKQDSHLEALYFHFGRYL 343
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
L+ SSRPGTQ ANLQGIWN +SP W S +NIN +MNYW + CNLSEC EPLF L
Sbjct: 344 LMGSSRPGTQPANLQGIWNHHVSPPWHSDYTININTQMNYWPAEVCNLSECHEPLFTMLR 403
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
+S GS+TA+++Y + GW HH DIW ++ G WA WP+GGAWL +WE Y Y
Sbjct: 404 EMSEAGSRTARIHYGSRGWTAHHNVDIWRMTTPTGGSASWAFWPLGGAWLVRQVWESYLY 463
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
MD+DFL ++AYPLL+G A F LDWL+EG +G L TNPSTSPE++F+ +G+ VSY S
Sbjct: 464 NMDKDFLGEKAYPLLKGAALFCLDWLVEGPNGDLVTNPSTSPENKFLTSEGEPCSVSYGS 523
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
TMD+AIIR++F + A + L E +++L SL RL KI G + EW +
Sbjct: 524 TMDIAIIRDLFQNCLEAIDALGVEEAEFRDELLASLDRLPAYKIGRHGQLQEWYE 578
>gi|337748853|ref|YP_004643015.1| alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
gi|336300042|gb|AEI43145.1| Alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
Length = 762
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 234/586 (39%), Positives = 330/586 (56%), Gaps = 42/586 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
F PAK + +A+P+GNGRLGAMV+G E ++LNEDT+W G P D NPDA + L ++R
Sbjct: 8 FRQPAKDWNEALPLGNGRLGAMVFGLPRKERIQLNEDTVWYGGPVDRHNPDALRYLPEIR 67
Query: 77 SLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
+ SG+ AEA A++ L G P Y LGD+ + D H E YRRELDL+ +
Sbjct: 68 EKLLSGRLAEAHKLAAMALSGIPESQRHYMPLGDLWITMD--HPPGEAEEYRRELDLSKS 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVNGN 190
A + Y +G+ F RE F S+PDQ +V ++ G++ LD S + G
Sbjct: 126 VAGLHYRIGDTAFIRETFISHPDQALVLRLRADRPGAIGLTARLDRGKSRYLDEIEAAGP 185
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
N ++M G C GK G F A L +D G + + L VEG+D
Sbjct: 186 NVLVMRGNCGGK------------GGSDFRAALR---ADAEGGSVRIIGEHLIVEGADAV 230
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
L L A+++F ++DP + ++ L S Y+ L RH +DY+ L+ RV +
Sbjct: 231 TLYLSAATTF---------RQEDPEAYCLNTLSSAAARGYASLLERHTEDYRGLYDRVQL 281
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGT 369
L ++ TD + + +P+ ER++ + EDP L+ L FQ+GRYLLISSSRPG+
Sbjct: 282 SL-----ELQTDEAAAAAV--LPTDERLELVKKGGEDPGLIPLYFQYGRYLLISSSRPGS 334
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
ANLQGIWNE + P WDS +NIN +MNYW + C+LSEC EPLFD + +S GS+T
Sbjct: 335 LPANLQGIWNEQMRPPWDSKYTININTQMNYWPAESCHLSECHEPLFDLIQRMSERGSRT 394
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+V Y GW HH TD+W ++ + WP+GGAWLC HLWEHY + D L +
Sbjct: 395 AEVMYGCRGWTAHHNTDLWGDTAPQDIYLPATHWPLGGAWLCLHLWEHYRFGGDTQRLAE 454
Query: 490 RAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
YP+++G A FLLD++IE DG+L T PS SPE+ +I P+G+ + MD I RE
Sbjct: 455 -FYPVMKGAARFLLDYMIEAKDGHLITCPSVSPENTYILPNGESGTLCAGPAMDSQIARE 513
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+F A AA L +ED E L +L R+ ++AE G + EW++
Sbjct: 514 LFQACREAARELGTDEDFRSELEL-ALQRIPLPQLAEGGYLQEWLE 558
>gi|149199357|ref|ZP_01876394.1| putative large secreted protein [Lentisphaera araneosa HTCC2155]
gi|149137599|gb|EDM26015.1| putative large secreted protein [Lentisphaera araneosa HTCC2155]
Length = 840
Score = 405 bits (1040), Expect = e-110, Method: Compositional matrix adjust.
Identities = 228/600 (38%), Positives = 324/600 (54%), Gaps = 33/600 (5%)
Query: 2 MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
++ E+ + N L + + PA H+ +A+P+GNGRLGAMV+GG+ E L+LNEDT+W+G P
Sbjct: 60 LSGEAVAPANDLSLWYRKPASHWVEALPVGNGRLGAMVYGGINKEWLQLNEDTMWSGEPV 119
Query: 62 DYTNPDAPKALSDVRSLVDSGQYAEATAASVKL-----FGHPADVYQLLGDIELEFDDSH 116
+ P+ +++ R L+ +Y EA + G YQ++ D+EL F
Sbjct: 120 ERDKPNVQAGIAEARKLLFDEKYVEAQKVVEEKVMGTSLGRGTHNYQMMADLELIFPK-- 177
Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
+ YRR+L+L A + V+Y + RE FSS DQ I ++S E +SF+ S
Sbjct: 178 -RDEVSNYRRDLNLENAISSVQYEFAGTTYKRELFSSAVDQAIYLRLSSDEKAKISFSAS 236
Query: 177 LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISA 236
L + + N ++++G+ + KG+ F +K+ ++ G I
Sbjct: 237 LTRPQSSQLKMMENGALVLKGQARTSKKKVIEQFPSAAKGVAFET--HLKVLNEGGKIFY 294
Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
ED ++VE +D L+LVASS + G K T+ L SY T
Sbjct: 295 EEDS-IRVENADAVTLVLVASSDYYG--------DKKLTASCQKQLNHATQKSYHQARTD 345
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
H+ DYQKLF RV + L SP + ID + + D L E FQ+
Sbjct: 346 HIQDYQKLFKRVDLDLGASPS--AHKPTDQRLIDLI---------KGQYDAQLFEQYFQY 394
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLISSSRPGT ANLQG+W + L P W+S H+NIN +MNYW + NLSEC P F
Sbjct: 395 GRYLLISSSRPGTMPANLQGLWTDGLMPAWNSDFHININFQMNYWHAETTNLSECHMPAF 454
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
L L G + AQ N+ GW H TD W +S GK + +WP+GGAW HLWE
Sbjct: 455 YLLERLQERGREVAQKNFGCRGWTAGHTTDAWFFASLI-GKPQYGMWPVGGAWCSRHLWE 513
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLAC 535
HY + D+DFL RAYP+++G A F +DWL+E G L + PSTSPE+ F PDGK A
Sbjct: 514 HYEFNGDKDFLRNRAYPIMKGAALFCMDWLVENPATGLLVSGPSTSPENRFKTPDGKEAN 573
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
++ TMD I+R++F+ I +AE+L +++ E L L +L PTKIA+DG IMEW +
Sbjct: 574 LTMGPTMDHQIMRDLFTNTIKSAEILNIDQEFRKELNL-ILQKLSPTKIAKDGRIMEWAE 632
>gi|320107748|ref|YP_004183338.1| alpha-L-fucosidase [Terriglobus saanensis SP1PR4]
gi|319926269|gb|ADV83344.1| Alpha-L-fucosidase [Terriglobus saanensis SP1PR4]
Length = 814
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 232/591 (39%), Positives = 333/591 (56%), Gaps = 31/591 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + PA + DA+P+GNGRLGAMV+G E + LNEDTLW G P D TNPDA L
Sbjct: 35 LTLWMETPAAQWADALPLGNGRLGAMVFGEPLKERIALNEDTLWAGQPRDTTNPDAKNHL 94
Query: 73 SDVRSLV-DSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETY-RRELDL 130
VR LV + Y A K+ G ++ LGD+ +E HL E T+ +R LDL
Sbjct: 95 PIVRKLVLEDKNYVAADKECQKMQGPENFAFEPLGDLHIE----HLGLTEATHLKRSLDL 150
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
+TA A+ + V F+RE F S PDQV+ +I+ S+ SL+ +SL + + + +
Sbjct: 151 DTAVAKTSFQSSGVTFSREVFVSFPDQVVALRITASKPSSLNLRLSLTCEMPAKTSAHAD 210
Query: 191 NQIIMEGRCPGKRIPPKANA----NDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+++ G+ P + P +++ D +G++F+A+L K + GT+ E L +
Sbjct: 211 GTLLLAGKVPTENNPQISDSIRYSEVDGEGMRFAAVLSAKA--EGGTVQP-EGDTLAISK 267
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ LLL A++ F G F P D+ E + ++ +Y+ L T+H+ D++ LF
Sbjct: 268 ATSVTLLLTAATGFRG-FAFPPDTPAAALEEKCRKGLAGKS-AYAVLKTKHVADHRALFR 325
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV L+ + D +P+ R+K+F T +DP+L+ L FQ+GRYLLI+SSR
Sbjct: 326 RVGANLNSTVPDGAN----------LPTDARLKNFPTTQDPALLALYFQYGRYLLIASSR 375
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PGTQ ANLQGIWN+ + P W S NIN++MNYW NL+E PL D +++ G
Sbjct: 376 PGTQPANLQGIWNDLVRPPWSSNWTANINIQMNYWPVFTANLAELNGPLVDLTQDMTVTG 435
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+KTA VNY A GW HH D+W ++S G WA + M G WLC HL+EH+ +T D
Sbjct: 436 AKTASVNYGARGWCSHHNIDLWRQASPVGMGSGDPTWANFAMSGPWLCQHLYEHFQFTGD 495
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
D+L KR YP+L A F LDWL+ DG L T PS S E+ F P + A VS T+D
Sbjct: 496 VDYLRKRVYPILRSSALFCLDWLVPAGDGTLTTCPSFSTENNFFTPQHQKAVVSAGCTLD 555
Query: 544 MAIIREVFSAIISAAEVLEKNED-ALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+A+I E+F ISA++VL NED A +K+ +L +L P K+ G + EW
Sbjct: 556 LALIHELFGNCISASQVL--NEDQAFADKLKAALAKLPPYKVGSAGELQEW 604
>gi|379721830|ref|YP_005313961.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
gi|378570502|gb|AFC30812.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
Length = 781
Score = 404 bits (1037), Expect = e-110, Method: Compositional matrix adjust.
Identities = 233/586 (39%), Positives = 329/586 (56%), Gaps = 42/586 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
F PAK + +A+P+GNGRLGAMV+G E ++LNEDT+W G P D NPDA + L ++R
Sbjct: 8 FRQPAKDWNEALPLGNGRLGAMVFGLPRKERIQLNEDTVWYGGPVDRHNPDALRYLPEIR 67
Query: 77 SLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
+ SG+ AEA A++ L G P Y LGD+ + D H E YRRELDL+ +
Sbjct: 68 EKLLSGRLAEAHKLAAMALSGIPESQRHYMPLGDLWITMD--HPPGEAEEYRRELDLSKS 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVNGN 190
A + Y +G+ F RE F S+PDQ +V ++ G++ LD S + G
Sbjct: 126 VAGLHYRIGDTAFIRETFISHPDQALVLRLRADRPGAIGLTARLDRGKSRYLDEIEAAGP 185
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
N ++M G C GK G F A L +D G + + L VEG+D
Sbjct: 186 NVLVMRGNCGGK------------GGSDFRAALR---ADAEGGSVRIIGEHLIVEGADAV 230
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
L L A+++F ++DP + ++ L S Y+ L RH +DY+ L+ RV +
Sbjct: 231 TLYLSAATTF---------RQEDPEAYCLNTLSSAAARGYASLLERHTEDYRGLYDRVQL 281
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGT 369
L ++ TD + + +P+ ER++ + EDP L+ L FQ+GRYLLISSSRPG+
Sbjct: 282 SL-----ELQTDEAAAAAV--LPTDERLELVKKGGEDPGLIPLYFQYGRYLLISSSRPGS 334
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
ANLQGIWNE + P WDS +NIN +MNYW + C+LSEC EPLFD + +S GS+T
Sbjct: 335 LPANLQGIWNEQMRPPWDSKYTININTQMNYWPAESCHLSECHEPLFDLIKRMSERGSRT 394
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+V Y GW HH TD+W ++ + WP+GGAWLC HLWEHY + L +
Sbjct: 395 AEVMYGCRGWTAHHNTDLWGDTAPQDIYLPATHWPLGGAWLCLHLWEHYRFGGGTARLAE 454
Query: 490 RAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
YP+++G A FLLD++IE DG+L T PS SPE+ +I P+G+ + MD I RE
Sbjct: 455 -FYPVMKGAARFLLDYMIEAKDGHLITCPSVSPENTYILPNGESGTLCAGPAMDSQIARE 513
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+F A AA L +ED E L +L R+ ++AE G + EW++
Sbjct: 514 LFQACREAARELGTDEDFRSELEL-ALQRIPLPQVAEGGYLQEWLE 558
>gi|261407087|ref|YP_003243328.1| alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
gi|261283550|gb|ACX65521.1| Alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
Length = 755
Score = 403 bits (1036), Expect = e-109, Method: Compositional matrix adjust.
Identities = 226/591 (38%), Positives = 332/591 (56%), Gaps = 50/591 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + +A+PIGNGRLGAM++GG E L+LNED++W G P D N DA L
Sbjct: 12 RLWYRKPAAEWNEALPIGNGRLGAMIFGGTAEEKLQLNEDSVWYGGPRDRNNEDALPHLP 71
Query: 74 DVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
++R L+ G+ EA A++ + G P Y LGD+ L F H + AE+ Y RELDL
Sbjct: 72 EIRKLIMEGRLQEAEELAAMTMAGLPEAQRHYVPLGDLLLSFG-QHGQLAED-YMRELDL 129
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
+RV Y +G + +TRE F+S PDQ +V +I+ + +++F + N YV
Sbjct: 130 ERGVSRVSYRIGGIRYTRELFASYPDQAVVIRITADKQEAVTFKARFNR--RNWRYVEKT 187
Query: 191 NQ-----IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ ++M G C G+ G FSA+L+ + G + + L V+
Sbjct: 188 DKWEASGLVMRGDCGGE------------GGSSFSAVLK---AVPEGGVCRTLGEYLLVD 232
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G+ LLL A ++F P DP + L+ + + Y++L RH+ DY++L+
Sbjct: 233 GASSVTLLLAAGTTFRHP---------DPELDGKRRLEELSRVPYAELLARHVADYRELY 283
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ-TDEDPSLVELLFQFGRYLLISS 364
RV ++L +P +P+ ER+K FQ +ED L+ FQFGRYLLI+S
Sbjct: 284 GRVELKLPENPDKAA-----------LPTDERLKRFQHGEEDHGLIATYFQFGRYLLIAS 332
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
SRPG+ ANLQGIWN+ +P WDS +NIN +MNYW + CNL+EC EPLF+ + +
Sbjct: 333 SRPGSLPANLQGIWNDSFTPPWDSKFTININAQMNYWHAENCNLAECHEPLFELIERMRE 392
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA V Y G+ HH TDIWA ++ + + WPMG AWLC HLWEHY + DR
Sbjct: 393 PGRVTAGVMYGCRGFTAHHNTDIWADTAPQDTYLPASFWPMGAAWLCLHLWEHYRFGQDR 452
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
FL RAY ++ A FLLD+LIE +G L T PS SPE+ + P+G+ + +TMD
Sbjct: 453 YFL-ARAYETMKEAALFLLDYLIEDGEGRLVTCPSVSPENRYKLPNGETGVLCTGATMDF 511
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
II +F A + +AE+ ++E A E++ +L RL +I + G I EW++
Sbjct: 512 QIIEALFDACMQSAEIFGRDE-AFREELAAALKRLPKPQIGKYGQIQEWME 561
>gi|329926814|ref|ZP_08281220.1| hypothetical protein HMPREF9412_3407 [Paenibacillus sp. HGF5]
gi|328938931|gb|EGG35301.1| hypothetical protein HMPREF9412_3407 [Paenibacillus sp. HGF5]
Length = 764
Score = 403 bits (1035), Expect = e-109, Method: Compositional matrix adjust.
Identities = 219/566 (38%), Positives = 332/566 (58%), Gaps = 24/566 (4%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
MV+GGV E ++ NEDTLW+G P D N +A + L+ R L+ SG+YAEA ++ G
Sbjct: 1 MVFGGVQEECIQWNEDTLWSGFPRDTNNYEALRYLAKARELIASGKYAEAEQLIEGRMVG 60
Query: 97 HPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVE--FTREHFSSN 154
+ + LGD+ + S + + YRREL+L+T A ++ V + F+R+ F S
Sbjct: 61 RNTESFLPLGDLLIR--QSGIGDSCSEYRRELNLDTGIASTRFQVSGSDPIFSRDMFISA 118
Query: 155 PDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPG------KRIPPKA 208
DQV V + + S S+ + L S L + + + +++ G P + P +
Sbjct: 119 VDQVGVIRYESTGSSSVQLEIGLRSPLQHRTRTEEDGTLVLHGHAPTHIADNYRGDHPGS 178
Query: 209 NANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPS 268
+D GI++ + + D G ++ ++D +++ + LL+ A+++F+G P
Sbjct: 179 VLYEDGLGIRYE--MRLLALTDSGQVT-VDDSGMRISAAGSVTLLIAAATNFEGFDRFPG 235
Query: 269 DSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEEN 328
DP+ LQ + L +RH+ D+Q LF RV +QL R P++ E +
Sbjct: 236 SGGTDPSGICRERLQDAMRHGFEQLRSRHVQDHQALFRRVELQLGR-PEN-------ERS 287
Query: 329 IDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWD 387
I + + ER+++++ ED +L L+FQFGRYLLI+SSRPGTQ A+LQGIWN + P W+
Sbjct: 288 IAALATDERMEAYREGREDAALEALMFQFGRYLLIASSRPGTQPAHLQGIWNPHVQPPWN 347
Query: 388 SAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDI 447
S NIN EMNYW + LSEC EPL + LS++G++TA+++Y A GWV HH D+
Sbjct: 348 SDYTTNINTEMNYWPAETTRLSECHEPLIQMIRELSVSGARTAKIHYGARGWVAHHNVDL 407
Query: 448 WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI 507
W +S G+ +WA WPMGGAWLC HLWE Y + D ++L + AYPL+ G A F LDWLI
Sbjct: 408 WRMASPSDGRAMWAYWPMGGAWLCRHLWERYQFQPDIEYLRETAYPLMRGAALFCLDWLI 467
Query: 508 EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDA 567
E +G+L T+PSTSPE++F+ +G VS STMDMAIIR++F I A+++LE++ D
Sbjct: 468 EDGEGHLVTSPSTSPENQFLTEEGLPCSVSAGSTMDMAIIRDLFHNCIEASQLLEQD-DE 526
Query: 568 LVEKVLKSLPRLRPTKIAEDGSIMEW 593
L E+ ++ RL P I +G +MEW
Sbjct: 527 LREEWKMAVERLLPYAIDNEGRLMEW 552
>gi|218263534|ref|ZP_03477615.1| hypothetical protein PRABACTJOHN_03302 [Parabacteroides johnsonii
DSM 18315]
gi|218222657|gb|EEC95307.1| hypothetical protein PRABACTJOHN_03302 [Parabacteroides johnsonii
DSM 18315]
Length = 811
Score = 403 bits (1035), Expect = e-109, Method: Compositional matrix adjust.
Identities = 232/598 (38%), Positives = 343/598 (57%), Gaps = 31/598 (5%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPK 70
P + F PA + +A+PIGNG++GAM++GGV E ++LNE TLW+G P NP+A K
Sbjct: 22 PKTLWFEQPANQWVEALPIGNGQIGAMIFGGVEEELIQLNEGTLWSGSPLKKNVNPEAYK 81
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
L+ VR + Y +AT K+ G + + LGD++++ D H K Y+R L L
Sbjct: 82 FLAPVREALAKEDYQQATKLCKKMQGFFTENFLPLGDLKIKQDFGH-KARVVDYKRILQL 140
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
+ A A +++ V V +TR+ F+S PD V+V + + + L+ ++ L SLL +H NG
Sbjct: 141 DKAIASIEFVVDEVHYTRKMFTSAPDSVMVIQFTADKLRKLTLDIHLTSLLKHHVTANGK 200
Query: 191 NQIIMEGRCPG----------KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ ++ G+ P R P D +G++F +L K D GTI + ++K
Sbjct: 201 DLFVLSGQAPACVDPIYYERPGREPIVQVDKDGLQGMRFQTVL--KAIPDGGTIVS-DEK 257
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ V+ ++ LLL A++SF+G +P KD S + I + ++ L RH+ D
Sbjct: 258 GIHVKDANSLTLLLSAATSFNGFNKHPDSEGKDEKVISCHRIDRIDKVDFAVLKKRHITD 317
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ-TDEDPSLVELLFQFGRY 359
++ F RVS+ L TDT + +P+ R+K + + DP L EL FQ+GRY
Sbjct: 318 FKSYFDRVSLHL--------TDTLNSTINKKLPTDFRLKLYSYGNYDPQLEELYFQYGRY 369
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LLIS+SRPG NLQG+W+ ++ P W S +NIN EMNYW + NLSE + L +F+
Sbjct: 370 LLISASRPGGSAINLQGLWSNEVRPPWASNYTININTEMNYWLAESTNLSEMHQSLLNFI 429
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLW 475
LSI G TA+ Y A GW+ HH +DIWA S++ G WA W MGG WL HLW
Sbjct: 430 KNLSITGEDTAKEYYHARGWMAHHNSDIWALSNSVGNCGDGNPSWASWYMGGNWLSLHLW 489
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
EHY YT D++FL+ AYP+++G A F DWL+E +GYL T+PSTSPE+ F D +
Sbjct: 490 EHYCYTGDKEFLKNEAYPIMKGAALFCFDWLLE-KNGYLITSPSTSPENNFFV-DNNVYA 547
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
VS ++TMDMAII ++F+ +I A+E+L ++ E V+K RL P +I G + EW
Sbjct: 548 VSEAATMDMAIIHDLFTNVIEASEILGIDKKFRSE-VIKKKERLFPYQIGSFGQLQEW 604
>gi|376260116|ref|YP_005146836.1| hypothetical protein [Clostridium sp. BNL1100]
gi|373944110|gb|AEY65031.1| hypothetical protein Clo1100_0760 [Clostridium sp. BNL1100]
Length = 775
Score = 402 bits (1034), Expect = e-109, Method: Compositional matrix adjust.
Identities = 225/589 (38%), Positives = 333/589 (56%), Gaps = 35/589 (5%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA+ + +A+PIGNGRLG MV GG+ E + LN DTLW+G+PG + N + L V+
Sbjct: 7 YKSPARIWEEALPIGNGRLGGMVHGGISQECIDLNNDTLWSGLPGQHINKNILPVLPKVQ 66
Query: 77 SLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
LV+ G+ EA + + Y LG + L ++ L + Y R L LNTA
Sbjct: 67 RLVNQGKNYEAQKLIEENILTGYSQSYLPLGRLLLTYE---LSGDAKGYNRSLSLNTAVC 123
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
+Y+ G V + RE S PD V+ I+ +SG+L+FN++LDS L + NN +IM
Sbjct: 124 ETRYTSGGVNYCREVICSYPDDVMAVHITADKSGALTFNITLDSQL-RYQIAKMNNTLIM 182
Query: 196 EGRCPGKRIPPKANAN--------DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
G CP IP A+ + + I+FS + + +G ++ ++ V +
Sbjct: 183 TGDCPSCMIPDYVEADKHLIYDHEEYSRSIRFSVGMRANV---KGGSLIVDADRISVTAA 239
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D +L+L ++++F+G P S DP ++ M L + S+++L +RH D+ LF R
Sbjct: 240 DEVLLILSSTTNFEGFDKMPGSSGNDPLTKCMRILDNTVGYSWNELLSRHKADHAALFER 299
Query: 308 VSIQL-SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSS 365
V + L ++SP +P+ +R+ ++ DPSL LLF +GRYLLI+ S
Sbjct: 300 VCLDLGTQSP---------------MPTDKRLAAYAAGHHDPSLDSLLFAYGRYLLIACS 344
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPGTQ ANLQGIWN++L+ W S NIN EMNYW + NL EC PLFD L +S
Sbjct: 345 RPGTQAANLQGIWNKELTAPWSSNYTTNINTEMNYWPAETANLPECHIPLFDLLKDVSKA 404
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
GS+ + V+Y G+V+HH TD+W +S+ G+ W WPMGGAWL H+ EHY ++ D D
Sbjct: 405 GSEISLVHYGCRGFVLHHNTDLWRMASSVSGQARWGFWPMGGAWLSIHIMEHYRFSCDTD 464
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
FL+ Y + E FLLD+L +GY TNPSTSPE+ FI DG++ ++ STMD+A
Sbjct: 465 FLKDYYYIMREAVL-FLLDYLKPDDNGYFLTNPSTSPENAFIDADGRICSITKGSTMDLA 523
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
IIRE+F + I A +L K + L + + L +L P +I G ++EW+
Sbjct: 524 IIRELFESCIEAQSIL-KIDSYLSGLLAQRLCKLPPFQIGSKGQLLEWL 571
>gi|310644025|ref|YP_003948783.1| candidate alpha-l-fucosidase; glycoside hydrolase family 95
[Paenibacillus polymyxa SC2]
gi|309248975|gb|ADO58542.1| Candidate alpha-L-fucosidase; Glycoside hydrolase family 95
[Paenibacillus polymyxa SC2]
Length = 824
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 219/612 (35%), Positives = 343/612 (56%), Gaps = 46/612 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L++ + PA+ + +A+P+GNGRLGAMV+GG+ E L+LNEDTLW+G P D DA + L
Sbjct: 10 LRLWYRQPAEVWEEALPVGNGRLGAMVFGGIREEHLQLNEDTLWSGFPRDGVQYDALRYL 69
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDD-SHLKYAEETYRRELDL 130
R L+ G+Y EA + + G + YQ LGD+ + ++ + + Y RELD+
Sbjct: 70 EPARKLIADGKYKEAEQLITSNMLGRDTEAYQPLGDLWITQENLGEIAH----YERELDM 125
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS----------- 179
T TA V + V +TR+ +S PD VI+ ++ ++ G + +V + +
Sbjct: 126 QTGTAAVTFQSDGVRYTRKVIASAPDGVIMVSLTANKVGKIHASVRMTTPHSCDDEAGED 185
Query: 180 --LLDNHSYVNGNNQ--------IIMEGRCPGKRIP------PKANANDDPKGIQFSAIL 223
D+ + + N+ I + GR P P++ ++ G+ F+ +
Sbjct: 186 VHFSDSSQWASDNDPSEEPTRDFITLTGRAPSHVESNYHGDHPQSVVYENDLGMAFA--V 243
Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ 283
+ ++ + GT++ +D L + +D + L A++ F G P+ + L
Sbjct: 244 QARVIPEGGTLTTRDDGALIISDADKITVYLAAATGFRGFQAMPNSDATESAEACKVILD 303
Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
+L + RH D++KLF RV+++L +DT ++E++ +P+ R++ +Q
Sbjct: 304 GAISLGSEQVRQRHEQDHRKLFDRVALELG-------SDTLTDESV--LPTDLRLERYQK 354
Query: 344 DE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQ 402
+ D L LLFQ+GRYLL+ SSRPG+Q ANLQGIWN+ + P W+S NIN +MNYW
Sbjct: 355 GQADRGLEVLLFQYGRYLLMGSSRPGSQPANLQGIWNDRVQPPWNSNYTTNINTQMNYWP 414
Query: 403 SLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWAL 462
+ CNL+EC EPL + +S G + A ++Y A GW HH D+W + G WA
Sbjct: 415 AEVCNLAECHEPLLHMIGEVSRTGRRVASIHYGAQGWTAHHNIDVWRYAGPSAGHASWAF 474
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSP 522
WP+GG WL HLWE Y +T+D +L ++AYPL++G A+F LDWL EG DG L T+PSTSP
Sbjct: 475 WPLGGVWLTAHLWERYLFTLDTTYLAEQAYPLMKGAAAFCLDWLAEGPDGRLATSPSTSP 534
Query: 523 EHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT 582
E++FI P G+ +S STMDM +IRE+ S I AA++LE + D ++ ++ RL P
Sbjct: 535 ENKFITPGGEDCSISMGSTMDMTLIRELLSNCIQAADLLELD-DEFRKRCEETRERLVPY 593
Query: 583 KIAEDGSIMEWV 594
+I G + EW+
Sbjct: 594 QIGRHGQLQEWL 605
>gi|392304738|emb|CCI71101.1| Alpha-L-fucosidase 2 [Paenibacillus polymyxa M1]
Length = 867
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 219/612 (35%), Positives = 343/612 (56%), Gaps = 46/612 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L++ + PA+ + +A+P+GNGRLGAMV+GG+ E L+LNEDTLW+G P D DA + L
Sbjct: 53 LRLWYRQPAEVWEEALPVGNGRLGAMVFGGIREEHLQLNEDTLWSGFPRDGVQYDALRYL 112
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDD-SHLKYAEETYRRELDL 130
R L+ G+Y EA + + G + YQ LGD+ + ++ + + Y RELD+
Sbjct: 113 EPARKLIADGKYKEAEQLITSNMLGRDTEAYQPLGDLWITQENLGEIAH----YERELDM 168
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS----------- 179
T TA V + V +TR+ +S PD VI+ ++ ++ G + +V + +
Sbjct: 169 QTGTAAVTFQSDGVRYTRKVIASAPDGVIMVSLTANKVGKIHASVRMTTPHSCDDEAGED 228
Query: 180 --LLDNHSYVNGNNQ--------IIMEGRCPGKRIP------PKANANDDPKGIQFSAIL 223
D+ + + N+ I + GR P P++ ++ G+ F+ +
Sbjct: 229 VHFSDSSQWASDNDPSEEPTRDFITLTGRAPSHVESNYHGDHPQSVVYENDLGMAFA--V 286
Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ 283
+ ++ + GT++ +D L + +D + L A++ F G P+ + L
Sbjct: 287 QARVIPEGGTLTTRDDGALIISDADKITVYLAAATGFRGFQAMPNSDATESAEACKVILD 346
Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
+L + RH D++KLF RV+++L +DT ++E++ +P+ R++ +Q
Sbjct: 347 GAISLGSEQVRQRHEQDHRKLFDRVALELG-------SDTLTDESV--LPTDLRLERYQK 397
Query: 344 DE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQ 402
+ D L LLFQ+GRYLL+ SSRPG+Q ANLQGIWN+ + P W+S NIN +MNYW
Sbjct: 398 GQADRGLEVLLFQYGRYLLMGSSRPGSQPANLQGIWNDRVQPPWNSNYTTNINTQMNYWP 457
Query: 403 SLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWAL 462
+ CNL+EC EPL + +S G + A ++Y A GW HH D+W + G WA
Sbjct: 458 AEVCNLAECHEPLLHMIGEVSRTGRRVASIHYGAQGWTAHHNIDVWRYAGPSAGHASWAF 517
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSP 522
WP+GG WL HLWE Y +T+D +L ++AYPL++G A+F LDWL EG DG L T+PSTSP
Sbjct: 518 WPLGGVWLTAHLWERYLFTLDTTYLAEQAYPLMKGAAAFCLDWLAEGPDGRLATSPSTSP 577
Query: 523 EHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT 582
E++FI P G+ +S STMDM +IRE+ S I AA++LE + D ++ ++ RL P
Sbjct: 578 ENKFITPGGEDCSISMGSTMDMTLIRELLSNCIQAADLLELD-DEFRKRCEETRERLVPY 636
Query: 583 KIAEDGSIMEWV 594
+I G + EW+
Sbjct: 637 QIGRHGQLQEWL 648
>gi|256424518|ref|YP_003125171.1| alpha-L-fucosidase [Chitinophaga pinensis DSM 2588]
gi|256039426|gb|ACU62970.1| Alpha-L-fucosidase [Chitinophaga pinensis DSM 2588]
Length = 841
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 236/603 (39%), Positives = 342/603 (56%), Gaps = 38/603 (6%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAP 69
N LK+ + PA ++ A+P+GNGR+GAMV+GG E ++LNE TLW+G P NP A
Sbjct: 38 NNLKLWYKEPAIEWSQALPLGNGRVGAMVFGGTSEELIQLNEATLWSGGPVSKQVNPAAA 97
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIEL--EFDDSHLKYAEETYRRE 127
L VR+ + S +Y EA + K+ G + + LGDI + + D+ + Y R+
Sbjct: 98 SYLPAVRAALFSEKYHEADSLLRKMQGAFSQSFLPLGDIRIHQQLKDTLV----SQYSRD 153
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
LD+ A + ++ G + +TRE F S PDQVIV ++ S+ G+L F S L + V
Sbjct: 154 LDIANAKSITRFVSGGITYTRELFISAPDQVIVIRLRSSKKGALQFKADPSSQLHYQNSV 213
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALE 238
G +I M G+ P + P N N +P KG+++ L ++ GT++ +
Sbjct: 214 TGAKEIAMRGKAPSQVDPSYINYNAEPIQYEAAGSCKGMRYE--LRMRAISPDGTVTT-D 270
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+ V+ + A+LLL A++SF+G P D + + ++ LSY++L RH
Sbjct: 271 ATGITVKNATEAILLLTAATSFNGFDKCPDSEGLDEKAIAGGQMKKAAALSYANLLQRHE 330
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFG 357
DY K F+RVS+ LS ++ P+ ER++ + +D +L L FQFG
Sbjct: 331 QDYHKYFNRVSLNLS------------GDDQSAQPTDERLRRYTAGGKDQALESLYFQFG 378
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLIS SR + ANLQGIWN++L W S +NIN +MNYW + CNL E Q+PL+
Sbjct: 379 RYLLISCSRTPSAPANLQGIWNKELRAPWSSNYTININTQMNYWPAEVCNLMEMQQPLYQ 438
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS--ADRGKV--VWALWPMGGAWLCTH 473
L LS+ G+ TA Y GWV HH TDIWA ++ D+GK WA W MGG WLC
Sbjct: 439 LLKELSVTGAATAGEFYNTRGWVAHHNTDIWAIANPVGDKGKGDPQWANWMMGGNWLCQF 498
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGK 532
LW+HY YT D FL AYP+++ A F LD+L++ GYL T P+TSPE++F+ +G
Sbjct: 499 LWQHYCYTGDEKFLRDTAYPIMKSAALFSLDFLVKDPASGYLVTAPATSPENKFLLANGT 558
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
VS +STMDM IIRE+F+ +I A EVL K ++ L + + + RL P KI +DGS+ E
Sbjct: 559 QESVSIASTMDMTIIRELFNNVIKAGEVL-KVDNGLRDSLQVAADRLYPFKIGKDGSLQE 617
Query: 593 WVQ 595
W +
Sbjct: 618 WYK 620
>gi|354582995|ref|ZP_09001895.1| Alpha-L-fucosidase [Paenibacillus lactis 154]
gi|353198412|gb|EHB63882.1| Alpha-L-fucosidase [Paenibacillus lactis 154]
Length = 758
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 228/591 (38%), Positives = 327/591 (55%), Gaps = 50/591 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + +A+PIGNGRLGAM++GG E L+LNED++W G P D N DA L
Sbjct: 12 RLWYRKPAAEWNEALPIGNGRLGAMIFGGTAEEKLQLNEDSVWYGGPRDRNNEDALPHLP 71
Query: 74 DVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
++R L+ G+ EA A++ + G P Y LGD+ L F SH Y RELDL
Sbjct: 72 EIRKLIMEGRLREAEELAAMTMAGLPEAQRHYMPLGDLLLSF--SHHDLPAVDYVRELDL 129
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
+RV Y +G + +TRE F+S PDQ IV +IS + G++S + N Y+
Sbjct: 130 ENGISRVSYRIGEIRYTRELFASYPDQAIVIRISADKQGTVSLKARFNR--RNWRYLEKT 187
Query: 191 NQ-----IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ + M G C G+ G FSA+L K D G L + L V+
Sbjct: 188 DKWKESGLAMRGDCGGE------------GGSSFSAVL--KAVPDGGVCRTL-GEYLLVD 232
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G+ LL+ A ++F P DP + L+ + + Y++L RH+ DY++L+
Sbjct: 233 GASSVTLLITAGTTFRHP---------DPELDGKRRLEMLSRVPYAELLARHVADYRELY 283
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISS 364
RV ++L SP V +P+ ER+ FQ ED L+ FQFGRYLLI+S
Sbjct: 284 GRVDLKLPESPDKTV-----------LPTDERLMQFQQGGEDHGLIATYFQFGRYLLIAS 332
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
SRPG+ ANLQGIWN++ +P WDS +NIN +MNYW + CNL+EC EPLF+ + +
Sbjct: 333 SRPGSLPANLQGIWNDNFTPPWDSKFTININAQMNYWHAENCNLAECHEPLFELIERMRE 392
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA V Y G+ HH TDIWA ++ + + WPMG AWLC HLWEHY + DR
Sbjct: 393 PGRVTAHVMYGCRGFTAHHNTDIWADTAPQDTYLPASFWPMGAAWLCLHLWEHYRFGQDR 452
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
FL R Y ++ A FLLD+LIE +G L T PS SPE+ + P+G+ + + MD
Sbjct: 453 YFL-ARVYETMKEAALFLLDYLIEDAEGRLVTCPSVSPENRYKLPNGETGVLCVGAAMDF 511
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
II +F A I A+E++ ++E A +++ +L RL +I + G I EW++
Sbjct: 512 QIIEALFDACIRASEIIGRDE-AFRDELTGTLKRLPQPQIGKYGQIQEWME 561
>gi|308070789|ref|YP_003872394.1| hypothetical protein PPE_04076 [Paenibacillus polymyxa E681]
gi|305860068|gb|ADM71856.1| Conserved hypothetical protein [Paenibacillus polymyxa E681]
Length = 822
Score = 400 bits (1029), Expect = e-109, Method: Compositional matrix adjust.
Identities = 220/612 (35%), Positives = 336/612 (54%), Gaps = 47/612 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L++ + PAK + +A+P+GNGRLGAMV+GG+ E L+LNEDTLW+G P D + DA + L
Sbjct: 10 LRLWYRQPAKVWEEALPVGNGRLGAMVFGGIREEHLQLNEDTLWSGFPRDGVHYDALRYL 69
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET-YRRELDL 130
VR + G+Y EA + + G + YQ LGD+ + + E Y RELDL
Sbjct: 70 QPVRKRIADGKYKEAEQLINTNMLGRDTEAYQPLGDLWV----TQEGLGEIVHYERELDL 125
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
T TA V + V +TRE +S PD +++ ++ ++ G + +V + S V +
Sbjct: 126 LTGTAAVTFQSDGVRYTREVIASAPDGIMMVSLTANKLGRIHASVRITSPHPCEDEVGED 185
Query: 191 NQ----------------------IIMEGRCPGKRIP------PKANANDDPKGIQFSAI 222
I + GR P P++ ++ G+ F+
Sbjct: 186 AHFGDSSKWDSDNDDSSDESSGDFITLTGRAPSHVESNYHGDHPQSVVYENDLGMAFA-- 243
Query: 223 LEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSAL 282
++ ++ + GT++ D L + G+D + L A++ F G P+ + L
Sbjct: 244 VQARVIPEGGTLTKGADGALIISGADKITVYLAAATGFQGFHAMPNSDATESVDACQVIL 303
Query: 283 QSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ 342
+L + RH D++KLF RV+++L DT + E++ +P+ +R++ +Q
Sbjct: 304 DGAISLGSEQVRQRHEQDHRKLFDRVALELG-------GDTLTNESV--LPTDQRLELYQ 354
Query: 343 TDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
+ DP L LLFQ+GRYLL+ SSRPG+Q ANLQGIWN+ + P W+S NIN +MNYW
Sbjct: 355 KGQADPGLEVLLFQYGRYLLMGSSRPGSQPANLQGIWNDRVQPPWNSNYTTNINTQMNYW 414
Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+ CNL+EC EPL + ++ G + A ++Y A GW HH D+W + G WA
Sbjct: 415 PAEVCNLAECHEPLLHMIGEVARTGRRVASIHYGAQGWAAHHNVDVWRYAGPSGGHASWA 474
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTS 521
WP+GG WL HLWE Y +T+D +L ++AYPL++G A+F +DWL+EG G L T+PSTS
Sbjct: 475 FWPLGGVWLTAHLWERYLFTLDTAYLAEQAYPLMKGAAAFCMDWLVEGPKGRLVTSPSTS 534
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
PE++F PDG+ +S STMDM +IRE+ S I AA++LE ++D + + RL P
Sbjct: 535 PENKFKTPDGEECSISMGSTMDMTLIRELLSNCIQAADLLELDDD-FRNRCEGTRARLMP 593
Query: 582 TKIAEDGSIMEW 593
+I G + EW
Sbjct: 594 YQIGRHGQLQEW 605
>gi|430742223|ref|YP_007201352.1| hypothetical protein Sinac_1268 [Singulisphaera acidiphila DSM
18658]
gi|430013943|gb|AGA25657.1| hypothetical protein Sinac_1268 [Singulisphaera acidiphila DSM
18658]
Length = 806
Score = 400 bits (1029), Expect = e-109, Method: Compositional matrix adjust.
Identities = 228/588 (38%), Positives = 342/588 (58%), Gaps = 35/588 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ I + PA+ +T+A+PIGNG+LGAMV+GG SE + LNEDT+W G D TNPDA K+L
Sbjct: 38 MVIHYRRPAEAWTEALPIGNGQLGAMVFGGTGSERIALNEDTVWAGERRDRTNPDALKSL 97
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
++R L+ G+ EA A A + P + YQ LGD+ + F + YRRELD
Sbjct: 98 PEIRRLLRVGKPDEAEALAERTMIAVPKRLPPYQPLGDLRILFPGHD---QADDYRRELD 154
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L++A RV Y VG+ F RE F+S DQV+V +++ G L+F+ +LD D +
Sbjct: 155 LDSAMVRVSYRVGDATFRREVFASAKDQVLVVRLTCDRPGRLAFSATLDRERDARAEAVA 214
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPK-GIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
+++++ G I D+ K G++FSA L + R E +++V +D
Sbjct: 215 PDRVLLRGEA----IARDERHEDERKVGVKFSAFLRVVTEGGR---VFTEGDRVEVRDAD 267
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
A L LVA++ F KDP + AL + + Y L + H DD++ F RV
Sbjct: 268 AATLRLVAATDF---------RSKDPDAACERALAAA-DRPYEPLRSEHEDDHRSFFRRV 317
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRP 367
S++ + +P D +++ +P+ R+ + E DP+L+ FQFGRYLLI+SSRP
Sbjct: 318 SLEFA-APGD-------KDDRAALPTDVRLARVRKGESDPALIAQYFQFGRYLLIASSRP 369
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
GT ANLQGIWNE L+P W+S +NIN +MNYW + NL+E +PLFD + + +G
Sbjct: 370 GTMPANLQGIWNESLTPPWESKYTININTQMNYWPAEVANLAELHQPLFDLIEAMRPSGR 429
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
+TA+ Y A G++ HH TD+WA + KV LWPMG AWL HLW+HY++ DRDFL
Sbjct: 430 QTAKALYGARGFMAHHNTDLWAH-TVPVDKVGSGLWPMGAAWLSLHLWDHYDFGRDRDFL 488
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
+RAYP+++ A FLLD+L++ G L PS SPE+ + DGK+A + TMD+ I
Sbjct: 489 AQRAYPVMKEAAEFLLDYLVDDGQGQLIPGPSISPENRYRTADGKVAKLCMGPTMDVEIA 548
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+F ++ A+E+L+ + D ++V ++ RL +I + G + EW++
Sbjct: 549 HALFGRVVEASELLDLDPD-FRKRVAEARRRLPSLRIGKHGQLQEWLE 595
>gi|326801540|ref|YP_004319359.1| alpha-L-fucosidase [Sphingobacterium sp. 21]
gi|326552304|gb|ADZ80689.1| Alpha-L-fucosidase [Sphingobacterium sp. 21]
Length = 801
Score = 400 bits (1028), Expect = e-108, Method: Compositional matrix adjust.
Identities = 230/598 (38%), Positives = 330/598 (55%), Gaps = 33/598 (5%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD--YTNPDA 68
N LK+ ++ PA F +A+P+GNGRLGAMV+GGV E L LNE TLW+G P D NP A
Sbjct: 26 NNLKLWYSKPAGKFEEALPLGNGRLGAMVYGGVQEERLSLNEATLWSGKPVDENKVNPQA 85
Query: 69 PKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRREL 128
L V+ + + Y A + + G + Y+ LG++ + F + +RREL
Sbjct: 86 KDHLPAVQEALFNEDYQTADSLIRFMQGAYSQSYEPLGNLLIHFKH---QGTPTHFRREL 142
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
D++ A ARV Y + + RE F+S+PDQ+IV +++ L F +SLL + S
Sbjct: 143 DISQAIARVSYQLNGTSYRREIFASHPDQLIVIRLTAEGKDRLDFTCRFNSLLRSKS-KK 201
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDP------KGIQFSAILEIKISDDRGTISALEDKKL 242
+ + M G P P N +P ++F+++L++ +D + ++ +D L
Sbjct: 202 QSTSLWMHGWAPIHTEPNYRNKEKNPVVYDTLNSMRFASMLKVLKNDGQ---TSWQDSSL 258
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
+ + VLLL ++S+ G NP + K+ ++S L+ S++ L +H+ DY+
Sbjct: 259 AISNAKEVVLLLSMATSYSGFDKNPGRAGKNELDLALSYLKEAEKQSFASLQAKHIQDYR 318
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLL 361
F RVSI L K +P+ ER++ F + D D +LV L +Q+ RYLL
Sbjct: 319 HYFDRVSINLGHGEKA------------NLPTDERLERFAKGDGDNNLVALFYQYSRYLL 366
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
ISSSRPG Q NLQ +WNE + P W S NIN EMNYW + NL E +PLFDF+
Sbjct: 367 ISSSRPGGQPTNLQALWNEIVRPPWSSNYTTNINTEMNYWGTEVANLPEMHQPLFDFIGR 426
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEH 477
L+ G+ TA+ Y A GWV HH TDIWA + G WA W M G WL THLWEH
Sbjct: 427 LAQTGAITAKNYYNADGWVCHHNTDIWAMTHPVGHFGEGHPSWANWQMAGVWLSTHLWEH 486
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
+ +T D DFL K+AYPL++G F L +L DGYL T PSTSPE+ +I G V
Sbjct: 487 FAFTADADFLRKQAYPLMKGAVDFCLSFLTTNKDGYLVTAPSTSPENIYITDKGYKGAVL 546
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
Y ST D+A+IRE+F+ + AA +L+K++ E V +L +L P KI G++ EW
Sbjct: 547 YGSTADIAMIRELFADYLKAAVILKKDKKT-QEAVTNALAKLPPYKIGRKGNLREWYH 603
>gi|333380580|ref|ZP_08472271.1| hypothetical protein HMPREF9455_00437 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826575|gb|EGJ99404.1| hypothetical protein HMPREF9455_00437 [Dysgonomonas gadei ATCC
BAA-286]
Length = 823
Score = 400 bits (1027), Expect = e-108, Method: Compositional matrix adjust.
Identities = 237/602 (39%), Positives = 348/602 (57%), Gaps = 35/602 (5%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY-TNPDAP 69
N L++ + PA +T+A+P+GNG +G M++GGV +E ++LNE +LW+G P NP+A
Sbjct: 22 NKLQLWYEKPAGKWTEALPVGNGFIGGMIFGGVDNELIQLNEGSLWSGGPQKKNVNPEAY 81
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELE---FDDSHLKYAEETYRR 126
K L +R + Y AT K+ G+ + + LGD+ ++ D+ LK YRR
Sbjct: 82 KYLQPIREALAKEDYKLATELCKKMQGYYGESFLPLGDLHIKQTYADNRRLK----NYRR 137
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
LDL A A ++ + V++ RE F+S PD V+V I+ S G ++ VSL+S L
Sbjct: 138 TLDLENAIATTEFEINGVKYIREIFTSAPDSVLVMHITASMPGMINLEVSLNSQLSGTLS 197
Query: 187 VNGNNQIIMEGRCPGK----------RIPPKANANDDPKGIQFSAILEIKISDDRGTISA 236
+G N+I++ G+ P + R P + + G++F +++ + S D IS
Sbjct: 198 ADGKNRIVLRGKAPARVDPNYYNKPGRNPIEQTDAEGCNGMRFQTVVQAR-SKDGAIIS- 255
Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
++ + ++ + LLL A++SF+G P KD S S + +++ Y DL T
Sbjct: 256 -DNNGIYIKNATSVTLLLSAATSFNGFDKCPDSEGKDEKRISESYIAHVQDKGYYDLKTT 314
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ-TDEDPSLVELLFQ 355
H++DYQK F+RVS L P +T + + +PS R+K + + DP L L F
Sbjct: 315 HINDYQKYFNRVSFSL---PNTTITRDVNRK----LPSDMRLKLYSYGNYDPELESLFFH 367
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
+GRYLLIS+SRPG ANLQG+WN++ P W S +NIN +MNYW + NLSE +PL
Sbjct: 368 YGRYLLISASRPGGSAANLQGLWNKEFRPPWSSNYTININTQMNYWPAEIANLSEMHQPL 427
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA--DR--GKVVWALWPMGGAWLC 471
F+ LS G+ TAQ Y A GWV HH TDIW S+A DR G WA W MGG WLC
Sbjct: 428 LQFIQNLSKTGTITAQEYYRAKGWVAHHNTDIWGLSNAVGDRGDGDPNWANWYMGGNWLC 487
Query: 472 THLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDG 531
HLWEHY +T D+ FL+ AYP+++ A F DWLIE DGYL T+PSTSPE F+ DG
Sbjct: 488 QHLWEHYQFTGDKGFLKDIAYPVMKEAALFCFDWLIE-KDGYLITSPSTSPEAAFVTADG 546
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
K V+ ++TMD+AIIR++F+ +I A++ L ++ E+++K +L P KI G +
Sbjct: 547 KRYSVTEAATMDIAIIRDLFTNLIEASQELNFDK-KFREQLIKKRDKLLPYKIGSQGQLQ 605
Query: 592 EW 593
EW
Sbjct: 606 EW 607
>gi|410098957|ref|ZP_11293931.1| hypothetical protein HMPREF1076_03109 [Parabacteroides goldsteinii
CL02T12C30]
gi|409220088|gb|EKN13045.1| hypothetical protein HMPREF1076_03109 [Parabacteroides goldsteinii
CL02T12C30]
Length = 848
Score = 399 bits (1026), Expect = e-108, Method: Compositional matrix adjust.
Identities = 228/622 (36%), Positives = 336/622 (54%), Gaps = 52/622 (8%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN- 65
T L + +N P++++ +A+PIGNGR GAMV+GGV E L+LNE+TL++G P
Sbjct: 21 TQKKESLVLWYNEPSENWNEALPIGNGRAGAMVFGGVDKEQLQLNENTLYSGEPSTVFKD 80
Query: 66 -PDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEET 123
P+ V L+ + +Y EA+ K G YQ GD+ +E + K E +
Sbjct: 81 IKITPEMFDKVVGLMKAQKYDEASDLVCKHWLGRLHQYYQPFGDLFIENN----KPGEVS 136
Query: 124 -YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
Y+REL+++ A R + V++ RE F+S+PD VI+ + S L +++ S
Sbjct: 137 GYKRELNISDAVTRTVFEQNGVQYEREIFASHPDDVIIVHLKSSTPDGLDLSLNFTSPHP 196
Query: 183 NHSYVNGNNQIIMEGRCPG----------------------------KRIPPKANAND-- 212
G +++++ G+ PG ++ + D
Sbjct: 197 TAKQSKGTDRLVLHGQAPGYVERRTFEQMEAWGDQYKHPELYDEKGNRKFDKRVLYGDEI 256
Query: 213 DPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKK 272
D KG+ F A ++K +G + D + V ++ +L ++SF+G +PS
Sbjct: 257 DNKGMFFEA--QLKPVLPKGGDYEITDAGVHVYNTNEVYFVLSMATSFNGFDKSPSREGV 314
Query: 273 DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTV 332
DP++++ L Y L RH+ DYQKLF RV +QL SP+ +
Sbjct: 315 DPSAKAAGILDKALAYDYKQLKQRHMADYQKLFDRVDLQLPSSPEQ-----------KAM 363
Query: 333 PSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHV 392
P+ +R+ F+T DP L LLFQFGRYL+IS SRPG Q NLQGIWN+D+ P W+S +
Sbjct: 364 PTDQRIAQFETMGDPDLAALLFQFGRYLMISGSRPGGQPLNLQGIWNKDVVPAWNSGYTI 423
Query: 393 NINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS 452
NIN EMNYW + NLSEC EPLF + L+++G++TA+ Y GWV HH T IW +S
Sbjct: 424 NINTEMNYWPAEVTNLSECHEPLFRLIDELAVSGAETARNMYNRRGWVGHHNTSIWRESV 483
Query: 453 ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG 512
+ + WPM WLC+HLWEHY YT D+DFL+ RAYPL++G A F DWLI+ +G
Sbjct: 484 PNDNVPTASFWPMVQGWLCSHLWEHYLYTQDQDFLKNRAYPLMKGAAEFFADWLIDDGNG 543
Query: 513 YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV 572
L T SPE+ FI +GK ++ TMDMAI+RE F+ + AAE+L +E +L ++
Sbjct: 544 RLVTPVGVSPENRFIMDNGKQGAMTMGPTMDMAIVRETFTRTLQAAEMLGLDE-SLQAEL 602
Query: 573 LKSLPRLRPTKIAEDGSIMEWV 594
LPRL P +I G + EW+
Sbjct: 603 KDKLPRLLPYQIGARGQLQEWM 624
>gi|340617674|ref|YP_004736127.1| alpha-L-fucosidase [Zobellia galactanivorans]
gi|339732471|emb|CAZ95739.1| Alpha-L-fucosidase, family GH95 [Zobellia galactanivorans]
Length = 807
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 226/593 (38%), Positives = 337/593 (56%), Gaps = 36/593 (6%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-YTNPDAPKALSDV 75
F+ PA+HF + + +GNG+ GA ++GGV ++++ LN+ TLW+G P D Y NP+A K L +
Sbjct: 37 FDRPAEHFEETLVLGNGKAGASIFGGVATDSIYLNDATLWSGEPVDPYMNPEAYKNLPAI 96
Query: 76 RSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
R + + Y A + KL G + Y LG + L F+ K ++Y R+L+L A +
Sbjct: 97 REALKNENYKLADSLQSKLQGSFSQSYMPLGTVYLNFEH---KNQPQSYHRQLELEKALS 153
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
V Y V V FTRE+F S+ DQ +V ++ S+ G+L+FN+ +SLL NG + +
Sbjct: 154 TVTYKVDGVTFTREYFISHADQAMVIRLKSSKKGALNFNIGFNSLLKYELATNGPT-LEV 212
Query: 196 EGRCPGKRIP------PKANANDDPKGIQFSAILEIKISDDR--GTISALEDKKLKVEGS 247
G P P P D +G +F+++ IK +D + GT D + ++ +
Sbjct: 213 NGYAPYHVEPSYRGKMPNPVQFDPNRGTRFTSLFRIKHTDGKLIGT-----DNTVALKDA 267
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
AV+ + ++SF+G NP+ D + + S L + + L+ HL D+QK F+R
Sbjct: 268 TEAVVYVSIATSFNGFDKNPATEGLDHKAMASSQLSKASSKPFDALFEAHLKDHQKYFNR 327
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSR 366
V + L +S + +P+ ER+K + + +ED +L L FQ+GRYLLISSSR
Sbjct: 328 VHLDLGKS------------TAEDLPTDERLKRYAKGEEDKNLEVLYFQYGRYLLISSSR 375
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
ANLQGIWN + P W S +NIN E NYW + NLSE +P+ F+ ++ G
Sbjct: 376 TPNVPANLQGIWNPYIRPPWSSNYTLNINAEENYWLAENANLSEMHQPMLGFIENIAQTG 435
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
TA+ Y A GW H +DIWA S+ +G + WA W MGG WL +HLWEHY ++
Sbjct: 436 KITAKTFYGAGGWAACHNSDIWAMSNPVGDFGQGGINWANWNMGGTWLSSHLWEHYTFSQ 495
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
D DFL+ RAYPLL+G A F L+WL+E DG L T+P TSPE++FI PDG Y ST
Sbjct: 496 DLDFLKNRAYPLLKGAAEFCLEWLVEDKDGNLVTSPGTSPENKFITPDGYQGATLYGSTS 555
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
D+A+IRE F I+A+E L K + A ++ K+L +L P ++ + G++ EW
Sbjct: 556 DLAMIRECFQQTIAASETL-KTDAAFRTQLEKALAKLYPYQVGKKGNLQEWYH 607
>gi|255035537|ref|YP_003086158.1| hypothetical protein Dfer_1752 [Dyadobacter fermentans DSM 18053]
gi|254948293|gb|ACT92993.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
Length = 833
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 236/620 (38%), Positives = 340/620 (54%), Gaps = 37/620 (5%)
Query: 1 MMNAESTSTT----NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLW 56
++NA ST LK+ ++ PA + +A+P+GNG +GAMV+GGV E ++LNE TLW
Sbjct: 12 LLNALSTDVIAQKGQDLKLWYSKPASRWVEALPVGNGHIGAMVFGGVEEELMQLNESTLW 71
Query: 57 TGVP-GDYTNPDAPKALSDVR-SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDD 114
+G P NP + L VR +L++ Y +A K+ G + Y + D+++ D
Sbjct: 72 SGGPVKTNVNPASASYLPQVRKALLEEQDYQKANELLKKMQGLYTESYMPMADLKIVHD- 130
Query: 115 SHLK-YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF 173
LK Y R+LD+ + A ++S G V++ RE F+S PD ++V K+S S+ +L+F
Sbjct: 131 --LKGQPASAYYRDLDIAHSKATTRFSAGGVDYKREVFTSAPDNIMVIKVSASKPNALNF 188
Query: 174 NVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANA-------NDDPKGIQFSAILEIK 226
VSL S L +GN ++++ G+ P P N DDP G +
Sbjct: 189 TVSLSSQLRYRLEASGNKELLVNGKAPSHVAPNYYNPPGQEPIIYDDPNGCNGTRFQIRT 248
Query: 227 ISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIR 286
+ RG + ++ + V+ + V+ L A++SF+G P KD + + + L
Sbjct: 249 KAVSRGGTTVVDTAGIHVKNATEVVIFLSAATSFNGFDKCPDKDGKDEKALAKNYLDKAL 308
Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDE 345
Y+ L T H DY F+RVS VTDT + +PS ER+ ++ + D
Sbjct: 309 AKGYATLATSHQHDYHSYFNRVSFS--------VTDTLTRNPNTALPSDERLMAYAKGDY 360
Query: 346 DPSLVELLFQFGRYLLISSSR------PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMN 399
DP L L +QFGRYLLISSSR P ANLQGIWN+++ P W S +NIN +MN
Sbjct: 361 DPGLETLYYQFGRYLLISSSRAALPGVPAGPPANLQGIWNKEMRPPWSSNYTININTQMN 420
Query: 400 YWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS----ADR 455
YW + NLSE PL ++ LS G+ TA+ Y A GWV HH DIW S+
Sbjct: 421 YWPAEVANLSEMHRPLLSWIKDLSQTGAVTAKEFYDAKGWVAHHNADIWGMSNPVGNVGD 480
Query: 456 GKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLE 515
G VWA W MG WLC HLWEHY ++ D+ FL + YPL++ A F LDWL+E DGYL
Sbjct: 481 GDPVWANWYMGANWLCQHLWEHYRFSGDKAFLRDKGYPLMKEAALFTLDWLVEDKDGYLV 540
Query: 516 TNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS 575
T PSTSPE++F P G A VS ++TMD++II ++FS +I AAEVL +ED + +++
Sbjct: 541 TAPSTSPENKFKDPKGGEAAVSVATTMDISIIHDLFSNLIDAAEVLGTDED-FRKLLIEK 599
Query: 576 LPRLRPTKIAEDGSIMEWVQ 595
+L P KI G + EW +
Sbjct: 600 RAKLYPLKIDGRGRLQEWYK 619
>gi|423342630|ref|ZP_17320344.1| hypothetical protein HMPREF1077_01774 [Parabacteroides johnsonii
CL02T12C29]
gi|409217547|gb|EKN10523.1| hypothetical protein HMPREF1077_01774 [Parabacteroides johnsonii
CL02T12C29]
Length = 844
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 223/626 (35%), Positives = 335/626 (53%), Gaps = 50/626 (7%)
Query: 2 MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
M E T PL + ++ PA+++ +A+PIGNGR GAM++G +E L+LNE+TL++G P
Sbjct: 14 MACEETPQKEPLVLWYDSPARNWDEALPIGNGRSGAMIFGRTDNEQLQLNENTLYSGEPS 73
Query: 62 DYTN--PDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLK 118
P+ V L+ +G+Y EA+ K G YQ GD+ ++ ++ +
Sbjct: 74 VVFKDVKITPEMFDKVVGLMKAGKYTEASDLVCKNWLGRLHQYYQPFGDLHIQ---NNKQ 130
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y+R L+++ A A Y G + RE F+S+PD VIV ++ + + +++
Sbjct: 131 GEANRYKRTLNISDAVATTVYEQGGTHYEREVFASHPDNVIVMRLKSNTPDGIDISLNFT 190
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGK----------------RIPPKANAND---------- 212
S ++++I+ G+ PG + P +AN
Sbjct: 191 SPHPTALQKGRDDRLILHGQAPGYVERRTFEQIEQWGDPYKHPELYDANGKRKFNKRMLY 250
Query: 213 ----DPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPS 268
D KG+ F A L+ D + D + V +D +L ++SF+G +PS
Sbjct: 251 GEEIDGKGMFFEAQLKPVFPKD--GKCDITDSGIHVYDTDEVYFVLSMATSFNGFDKSPS 308
Query: 269 DSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEEN 328
DP++++ L + +Y L RH +DY+ LF+RV +L+ SP+
Sbjct: 309 REGIDPSAKAAGILDKALSYNYQTLKQRHTEDYRSLFNRVDFKLASSPEQ---------- 358
Query: 329 IDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDS 388
+P+ +R++ F DP L LLFQFGRYL+IS SRPG Q NLQG+WN+D P W+
Sbjct: 359 -KAMPTDKRIEQFAQTADPELAALLFQFGRYLMISGSRPGGQPLNLQGMWNKDTIPAWNC 417
Query: 389 APHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIW 448
+NIN EMNYW + NLSECQ+PLF + L+++G++TA+ Y GWV HH T IW
Sbjct: 418 GYTININTEMNYWPAELTNLSECQQPLFRMIRELAVSGAETARNMYNRRGWVAHHNTSIW 477
Query: 449 AKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE 508
+S + + WPM WLC+HLWEHY +T D FL+ AYPL++G A F DWLIE
Sbjct: 478 RESLPNDNVPTASFWPMVQGWLCSHLWEHYQFTQDETFLKNEAYPLMKGAAEFFADWLIE 537
Query: 509 GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL 568
+GYL T SPE+ FI DG+ A +S TMDMAIIRE F+ I A+E+ +E +L
Sbjct: 538 DENGYLVTPVGVSPENRFITEDGQTAAMSMGPTMDMAIIRETFTRTIEASEMFNLDE-SL 596
Query: 569 VEKVLKSLPRLRPTKIAEDGSIMEWV 594
++ L RL+P +I E G + EW+
Sbjct: 597 RNELKNKLARLQPYQIGERGQLQEWI 622
>gi|218258383|ref|ZP_03474775.1| hypothetical protein PRABACTJOHN_00430 [Parabacteroides johnsonii
DSM 18315]
gi|218225510|gb|EEC98160.1| hypothetical protein PRABACTJOHN_00430 [Parabacteroides johnsonii
DSM 18315]
Length = 844
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 223/626 (35%), Positives = 335/626 (53%), Gaps = 50/626 (7%)
Query: 2 MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
M E T PL + ++ PA+++ +A+PIGNGR GAM++G +E L+LNE+TL++G P
Sbjct: 14 MACEETPQKKPLVLWYDSPARNWDEALPIGNGRSGAMIFGRTDNEQLQLNENTLYSGEPS 73
Query: 62 DYTN--PDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLK 118
P+ V L+ +G+Y EA+ K G YQ GD+ ++ ++ +
Sbjct: 74 VVFKDVKITPEMFDKVVGLMKAGKYTEASDLVCKNWLGRLHQYYQPFGDLHIQ---NNKQ 130
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y+R L+++ A A Y G + RE F+S+PD VIV ++ + + +++
Sbjct: 131 GEANRYKRTLNISDAVATTVYEQGGTHYEREVFASHPDNVIVMRLKSNTPDGIDISLNFT 190
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGK----------------RIPPKANAND---------- 212
S ++++I+ G+ PG + P +AN
Sbjct: 191 SPHPTALQKGRDDRLILHGQAPGYVERRTFEQIEQWGDPYKHPELYDANGKRKFNKRMLY 250
Query: 213 ----DPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPS 268
D KG+ F A L+ D + D + V +D +L ++SF+G +PS
Sbjct: 251 GEEIDGKGMFFEAQLKPVFPKD--GKCDITDSGIHVYDTDEVYFVLSMATSFNGFDKSPS 308
Query: 269 DSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEEN 328
DP++++ L + +Y L RH +DY+ LF+RV +L+ SP+
Sbjct: 309 REGIDPSAKAAGILDKALSYNYRTLKQRHTEDYRSLFNRVDFKLASSPEQ---------- 358
Query: 329 IDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDS 388
+P+ +R++ F DP L LLFQFGRYL+IS SRPG Q NLQG+WN+D P W+
Sbjct: 359 -KAMPTDKRIEQFAQTADPELAALLFQFGRYLMISGSRPGGQPLNLQGMWNKDTIPAWNC 417
Query: 389 APHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIW 448
+NIN EMNYW + NLSECQ+PLF + L+++G++TA+ Y GWV HH T IW
Sbjct: 418 GYTININTEMNYWPAELTNLSECQQPLFRMIRELAVSGAETARNMYNRRGWVAHHNTSIW 477
Query: 449 AKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE 508
+S + + WPM WLC+HLWEHY +T D FL+ AYPL++G A F DWLIE
Sbjct: 478 RESLPNDNVPTASFWPMVQGWLCSHLWEHYQFTQDETFLKNEAYPLMKGAAEFFADWLIE 537
Query: 509 GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL 568
+GYL T SPE+ FI DG+ A +S TMDMAIIRE F+ I A+E+ +E +L
Sbjct: 538 DENGYLVTPVGVSPENRFITEDGQTAAMSMGPTMDMAIIRETFTRTIEASEMFNLDE-SL 596
Query: 569 VEKVLKSLPRLRPTKIAEDGSIMEWV 594
++ L RL+P +I E G + EW+
Sbjct: 597 RNELKNKLARLQPYQIGERGQLQEWI 622
>gi|332668180|ref|YP_004450968.1| alpha-L-fucosidase [Haliscomenobacter hydrossis DSM 1100]
gi|332336994|gb|AEE54095.1| Alpha-L-fucosidase [Haliscomenobacter hydrossis DSM 1100]
Length = 861
Score = 396 bits (1017), Expect = e-107, Method: Compositional matrix adjust.
Identities = 231/620 (37%), Positives = 342/620 (55%), Gaps = 52/620 (8%)
Query: 8 STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNP 66
+ N L++ ++ PA +T+A+PIGNG +GAMV+G E L+LNE TL++G P G +T+
Sbjct: 17 AQNNHLQLWYDQPASVWTEALPIGNGYMGAMVFGDPLQEHLQLNEGTLYSGDPKGTFTSI 76
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEETYR 125
+ KA V +L+++ +Y EA K G +YQ +GD+ L D H K + + Y+
Sbjct: 77 NVRKAYPQVTALLEAKKYQEAQPLITKEWLGRNHQMYQPMGDLWL--DVEHDKSSIKAYK 134
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
R LDL TATA +Y G+ + R +F+S PD V+V K++ + G + N +L + +
Sbjct: 135 RGLDLQTATAFTEYQSGSTTYRRTYFTSYPDHVLVMKMTATGPGKI--NCTLRQSTPHTA 192
Query: 186 ---YVNGNNQIIMEGRCPG---------------------------KRIPPKANANDDPK 215
Y+ N + M+ R PG +R P AN D +
Sbjct: 193 PAKYLGQGNVLRMQSRAPGFALRRNFDLVEKLGDQHKYPELYEKTGERKPGAANFLYDQQ 252
Query: 216 --GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKD 273
G+ + +K+ GTIS + D K++V+ + V++L A++S++G +P+ KD
Sbjct: 253 IEGLGMAFESRLKVIHTGGTISNV-DGKIRVQNATELVIILSAATSYNGFDKSPAYEGKD 311
Query: 274 PTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVP 333
P + ++I N +S LY RHL DYQ LF RV I L+ +E +P
Sbjct: 312 PAKLLDTYFRAIDNKPFSTLYQRHLLDYQNLFKRVEINLA-----------AETEQSKLP 360
Query: 334 SAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVN 393
+ RV+ F +DP+ L FQFGRYL+I+ SRPG Q NLQGIWN+ L+P W+ A +N
Sbjct: 361 TDRRVELFSNGQDPAFAALYFQFGRYLMIAGSRPGGQPLNLQGIWNDQLTPPWNGAYTIN 420
Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA 453
IN +MNYW + NL+ECQEP F + L+ING +TA+ Y +GWV HH DIW + +
Sbjct: 421 INAQMNYWPAEITNLAECQEPFFKAIKELAINGRETARNMYGNAGWVAHHNMDIW-RHAE 479
Query: 454 DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY 513
+ WPMGG WL +HLWEHY ++ D+ FL+ +PLL+G F WL++ GY
Sbjct: 480 PIDNCACSFWPMGGGWLVSHLWEHYLFSGDQQFLKNEVFPLLKGVVDFYQGWLVKNEAGY 539
Query: 514 LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL 573
L T SPE F+ K A S TMDMAI+RE F+ + AA+VL D V+ V
Sbjct: 540 LVTPVGHSPEQNFVYEGNKQATYSPGPTMDMAIVREAFARYLEAAQVLGV-ADKSVDSVR 598
Query: 574 KSLPRLRPTKIAEDGSIMEW 593
++L +L P +I + G + EW
Sbjct: 599 QNLAKLLPYQIGKYGQLQEW 618
>gi|224539148|ref|ZP_03679687.1| hypothetical protein BACCELL_04050 [Bacteroides cellulosilyticus
DSM 14838]
gi|224519233|gb|EEF88338.1| hypothetical protein BACCELL_04050 [Bacteroides cellulosilyticus
DSM 14838]
Length = 822
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 236/599 (39%), Positives = 338/599 (56%), Gaps = 34/599 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-DYTNPDAPKA 71
L++ + PA + +A+P+GNG +GAMV+G V +E ++LNE TLWTGVP NPDA
Sbjct: 24 LRLWYEKPANTWVEALPLGNGYIGAMVYGKVENELIQLNEGTLWTGVPCVKSVNPDAYSY 83
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELE--FDDSHLKYAEETYRRELD 129
LS++R + +A A S K+ G+ + + LGD+E++ F D Y Y+RELD
Sbjct: 84 LSEMREALSRDDFAAAGTLSKKMQGYFSQSFLPLGDLEIKQSFGDRKAWYL--GYKRELD 141
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
LN A + G V++ RE F+S PD+V+V + + S+ G L+ + + S L + G
Sbjct: 142 LNEAILTTSFWEGGVQYVREMFTSAPDRVMVLRFTASQKGKLALDFTTKSRLSDAVEALG 201
Query: 190 NNQIIMEGRCPGKRIPPKANAN----------DDPKGIQFSAILEIKISDDRGTISALED 239
+N + M+G P + P N + G++F ++L K GT++ +
Sbjct: 202 DNCLAMDGAAPARLDPAYYNRKGREPMMRVDENGCSGMRFRSLL--KAIPVGGTVTT-DK 258
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
K + + G+D +++ A++SF+G P+ KD + L S+ +L H+
Sbjct: 259 KGIHINGADEILVIWTAATSFNGFDKCPACEGKDEKMLAGQYLAKASIKSFDELKDSHIR 318
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ-TDEDPSLVELLFQFGR 358
D+ F RVS+QL TDT + +PS R+K + + DP L ELLFQ+GR
Sbjct: 319 DFASYFERVSLQL--------TDTVGSKVNAQLPSDFRLKLYSYGNYDPQLEELLFQYGR 370
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSSR G ANLQGIWN+D P W S +NIN EMNYW + NLSE PL +
Sbjct: 371 YLLISSSRLGGTAANLQGIWNKDFRPPWSSNYTININTEMNYWLAETTNLSEMHTPLLSW 430
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKS----SADRGKVVWALWPMGGAWLCTHL 474
+ LS G TA+ Y A GWV HH +DIW S + G WA W MGG WLC HL
Sbjct: 431 IKDLSKAGRATAKEFYHAKGWVAHHNSDIWGLSNPVGNKGDGSPEWANWTMGGNWLCQHL 490
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
WEHY +T D+ FL AYP+++ A F LDWL+E D YL T+PS SPE+ F+ DGK
Sbjct: 491 WEHYCFTGDKQFLADEAYPVMKEAALFCLDWLVERGD-YLITSPSVSPENLFVV-DGKKY 548
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
VS +STMDMAIIR++FS +I A+EVL + ++++ + +L P +I G + EW
Sbjct: 549 AVSEASTMDMAIIRDLFSNLIEASEVLNIDRK-FRKQLVTAKNKLFPYQIGAKGQLQEW 606
>gi|255530725|ref|YP_003091097.1| alpha-L-fucosidase [Pedobacter heparinus DSM 2366]
gi|255343709|gb|ACU03035.1| Alpha-L-fucosidase [Pedobacter heparinus DSM 2366]
Length = 786
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 221/589 (37%), Positives = 343/589 (58%), Gaps = 32/589 (5%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-YTNPDAPKALSDV 75
+N PA+ F + + +GNG+LGA V+GG+ S+ + LN+ TLW+G P + Y NP+A K + +
Sbjct: 32 YNKPAQFFEETMVLGNGKLGAAVFGGIKSDKIFLNDATLWSGEPVNPYMNPEAYKQIPSI 91
Query: 76 RSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
R + + Y A + K+ G + Y LG + ++F+ + + YRRELD++ + +
Sbjct: 92 REALKNENYKLANELNRKVQGAFSQSYAPLGTMHIKFNHTD---SASMYRRELDISKSLS 148
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
++ Y+V V FTRE+F S P +V++ K++ S+ G+LSFNV +SLL N N + +
Sbjct: 149 KITYNVSGVTFTREYFISKPARVMMIKLTSSKKGALSFNVDFESLLK-FEITNQGNTLRV 207
Query: 196 EGRCPGKRIPP-KAN-AN----DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
+G P P + N AN D+ +G +FS++ IK +D + I + + ++
Sbjct: 208 KGYAPYHAEPVYRGNIANSVKFDENRGTRFSSLFRIKNTDGQVII---QHGSIGLKNGTE 264
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A+L + +SF+G NP+ K + S L+ + ++Y + H++DYQ F+RVS
Sbjct: 265 AILYIAIETSFNGFDKNPATEGKSDALLADSCLKKVVPVNYESVKHAHINDYQNYFNRVS 324
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPG 368
L ++ N +P+ ER+K + + ED +L L FQFGRYLLISSSR
Sbjct: 325 FNLGKT------------NAPELPTDERLKRYAEGKEDKNLEILYFQFGRYLLISSSRTA 372
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
ANLQGIWN + P W S NINL+ NYW + NLSE EPL F+ +++ G
Sbjct: 373 GVPANLQGIWNPYIRPPWSSNYTTNINLQENYWLAENTNLSELHEPLMKFIGHVAHTGKV 432
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
TA+ Y GW + H +DIWA S+ +G VWA W MGG WL THLWEHY +T+D+
Sbjct: 433 TAKTFYGVEGWALCHNSDIWAMSNPVGGFGQGDPVWANWNMGGTWLSTHLWEHYIFTLDK 492
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
+FL+++AYPL++G A F L+WL++ G L T+PSTSPE FI DG Y T D+
Sbjct: 493 NFLKQKAYPLMKGAARFCLNWLVKDKKGNLITSPSTSPEASFITADGSKGSTLYGGTADL 552
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
A+IRE F I A+++L + ++V +L +L+P ++ ++G++ EW
Sbjct: 553 AMIRECFLQTIRASQIL-GTDITFRKEVESALRQLQPYQVGKNGNLQEW 600
>gi|336429038|ref|ZP_08609009.1| hypothetical protein HMPREF0994_05015 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336003732|gb|EGN33810.1| hypothetical protein HMPREF0994_05015 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 779
Score = 392 bits (1007), Expect = e-106, Method: Compositional matrix adjust.
Identities = 227/583 (38%), Positives = 323/583 (55%), Gaps = 35/583 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+++ + PA ++ +A+P+GNGRLGAMVW G E + LNED+LW+G P + A +
Sbjct: 1 MELWYKEPASYWEEALPLGNGRLGAMVWSGTDQEKISLNEDSLWSGYPQSHDISGAAEYY 60
Query: 73 SDVRSLVDSGQYAEATAA-SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
R L +Y EA A + G Y LG EL D +H + Y+R L+L
Sbjct: 61 LQARRLSMEKKYEEAQALLEQNVLGEYTQSYLPLG--ELTLDMAHPEGEIRNYKRALELE 118
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
A +R++YS G+ +TRE F S PDQV+V IS G +S L + N
Sbjct: 119 KALSRLEYSAGDTNYTREMFISAPDQVMVMHISADRPGMVSLKAGFSCQLRAEVSIE-EN 177
Query: 192 QIIMEGRCPGKRIPPKANAND--------DPKGIQFSAILEIKISDDRGTISALEDKKLK 243
++I++G P + P ++ D + KG+QF A+LEI + + G + L + L+
Sbjct: 178 RMILDGIAPSQVDPSYIDSPDPVIYEDAPEKKGMQFCAVLEIDV--EGGEMKRLPEG-LE 234
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V +D L L A +SF+GPF +P K + LQ+ R + Y L RH+++YQ+
Sbjct: 235 VIHADSVTLFLAARTSFNGPFRHPFLEGKPYKEPCFAELQAAREMGYDRLLERHIEEYQQ 294
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
F+RVS+ L +++ P ER+ + D DP+ LLFQ+GRYLLIS
Sbjct: 295 YFNRVSMDLGPGREEL-------------PVPERLADWDKDVDPARFTLLFQYGRYLLIS 341
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSRPGTQ ANLQGIWN+ L W S VNIN EMNYW + NL E EPLFD + L
Sbjct: 342 SSRPGTQPANLQGIWNQHLRAPWSSNYTVNINTEMNYWGAETVNLPEMHEPLFDLIRNLR 401
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSS--ADRGK--VVWALWPMGGAWLCTHLWEHYN 479
I+G TA+++Y A G+V HH +DIW S+ +RGK V+A WP+ WL H+++HY
Sbjct: 402 ISGGNTARIHYNAGGFVSHHNSDIWCLSTPVGNRGKGTAVYAFWPLSAGWLSAHVYDHYL 461
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
++ D DFL + YP++ A F LD L E DG L PSTSPE++FI GK+ VS +
Sbjct: 462 FSGDLDFLRQTGYPVIHDAARFFLDVLTENEDGELIFAPSTSPENQFIY-HGKVCAVSQT 520
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLR 580
+TM MAI+REV + +L +++ L E+ L LP R
Sbjct: 521 TTMTMAIVREVLENAAACCRLLGIDQEFLAEAEEALGRLPSYR 563
>gi|354584579|ref|ZP_09003473.1| Alpha-L-fucosidase [Paenibacillus lactis 154]
gi|353194100|gb|EHB59603.1| Alpha-L-fucosidase [Paenibacillus lactis 154]
Length = 761
Score = 392 bits (1006), Expect = e-106, Method: Compositional matrix adjust.
Identities = 210/565 (37%), Positives = 321/565 (56%), Gaps = 23/565 (4%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
MV+GG+ E ++ NEDTLW+G P D N +A + L R L+ S +YAEA ++ G
Sbjct: 1 MVFGGIQEERIQWNEDTLWSGFPRDTNNYEALRYLQAARELIASEKYAEAEKLIEERMVG 60
Query: 97 HPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVE-FTREHFSSNP 155
+ + LGD+ +E + + + YRRELDL A V + G E F RE F S
Sbjct: 61 RNTEAFLPLGDLLIE--QTGIDDWQSNYRRELDLGNGVASVVFRTGRGEHFQREMFISAA 118
Query: 156 DQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPG------KRIPPKAN 209
DQ+ V + +GS GS+ + L S L + + + + G P + P++
Sbjct: 119 DQIAVIRYTGSAEGSIHLKLKLQSPLRYETEITPGGVMRLFGHAPTHIADNYRGDHPQSV 178
Query: 210 ANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD 269
++ G+++ +++ + D G I + L V G+ L + A++ F+G + P
Sbjct: 179 LYEEGSGLRYE--MQVAVRADGGRI-GINGDVLTVTGASAVTLHVAAATDFEGFDVMPGA 235
Query: 270 SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
DP + L++ L RH +++ LF RV+++L D +
Sbjct: 236 KGSDPARLCSARLEAAAGYDDEALRLRHTEEHWALFGRVAVELG--------DAEHRARM 287
Query: 330 DTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDS 388
+ +P+ +R+ ++ EDPSL L+FQ+GRYLL++SSRPGTQ A+LQG+WN + P W+S
Sbjct: 288 EAIPTDQRLAAYAGGQEDPSLEALMFQYGRYLLMASSRPGTQPAHLQGLWNPHVQPPWNS 347
Query: 389 APHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIW 448
NIN EMNYW + NLSEC EPL + L+++G++TA+++Y A GW HH D+W
Sbjct: 348 NYTTNINTEMNYWAAETGNLSECHEPLIQMVRELAVSGARTAKIHYNARGWAAHHNVDLW 407
Query: 449 AKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE 508
++ G+ +WA WPM G WLC HLWEHY + D ++L AYPL+ A F LDWLIE
Sbjct: 408 RMANPSNGRAMWAFWPMAGPWLCRHLWEHYVFNPDPEYLRNTAYPLMREAALFCLDWLIE 467
Query: 509 GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL 568
+G+L T+PSTSPE++F+ +G VS STMDMA+IRE+F + A+E+LE + + L
Sbjct: 468 NGEGHLVTSPSTSPENQFLTKEGVPCSVSAGSTMDMALIRELFRHCLEASELLEIDRE-L 526
Query: 569 VEKVLKSLPRLRPTKIAEDGSIMEW 593
E++ +L RL P +I +DG +MEW
Sbjct: 527 QEELRSALERLLPYQIDDDGRLMEW 551
>gi|414868292|tpg|DAA46849.1| TPA: hypothetical protein ZEAMMB73_390456 [Zea mays]
Length = 457
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 214/404 (52%), Positives = 269/404 (66%), Gaps = 30/404 (7%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+ F PAK+FTDA PIGNGRLGAMVWG V SE L+LN DTLWTG PG+YTNP+AP
Sbjct: 41 PLKVVFGSPAKYFTDAAPIGNGRLGAMVWGCVESERLQLNHDTLWTGGPGNYTNPNAPAV 100
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
LS VRSLV++G+Y EAT+A+ L G V+Q LGDI+L F + +KY YRRELDL+
Sbjct: 101 LSKVRSLVENGKYPEATSAAYDLSGDQTQVFQPLGDIDLVFGED-IKYTN--YRRELDLH 157
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TAT V Y+VG++ +TREHFSSNP QVIVTKIS ++ G++SF VSL S LD+ V N
Sbjct: 158 TATVTVTYTVGDIVYTREHFSSNPHQVIVTKISANKPGNVSFTVSLTSPLDHKIRVTHAN 217
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
+IIMEG CPG+R A D P GI+FSAIL ++I+ T+ L D LK++ +D V
Sbjct: 218 EIIMEGSCPGQRPEEIKTAADQPIGIKFSAILYLQINGANSTVEVLNDNMLKLDCADSVV 277
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
LLL A++SF FI PS+SK DPT + + L R SYS L H+DDYQ LF RVS+Q
Sbjct: 278 LLLAATTSFQSAFIKPSESKLDPTVSAFTTLSIARRTSYSQLKAYHIDDYQTLFQRVSLQ 337
Query: 312 LS-------RSPKDIVTDTCSEENIDTV--------------------PSAERVKSFQTD 344
LS R + + + S + + P+ ER+ +F+ +
Sbjct: 338 LSQGSNYDLRRSRLVQSAETSSQGANVSDYGFQISGCTRLTSLNSFVKPTVERIVTFKDN 397
Query: 345 EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDS 388
EDPSLVELLFQFGRYLLIS SRPGTQ++NLQGIW+ D SP WD+
Sbjct: 398 EDPSLVELLFQFGRYLLISCSRPGTQISNLQGIWSNDTSPPWDT 441
>gi|21218886|ref|NP_624665.1| large hypothetical protein [Streptomyces coelicolor A3(2)]
gi|5912520|emb|CAB56146.1| putative large secreted protein [Streptomyces coelicolor A3(2)]
Length = 809
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 222/592 (37%), Positives = 326/592 (55%), Gaps = 47/592 (7%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
P+++ + PA+ + +A+P+GNGRLGAMV+GG +E L+LNED+LW G PGDY PDA +
Sbjct: 50 PMRLWYRAPAQEWLEALPVGNGRLGAMVFGGTDTERLQLNEDSLWAGGPGDYARPDAVRH 109
Query: 72 LSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRREL 128
L+++R LV ++ A + G P++ YQ+LGD+EL + Y REL
Sbjct: 110 LAEIRRLVVEEKWNRAQRLIDAEFLGSPSEQAAYQVLGDLELTLAG---EGEAADYEREL 166
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
DL TA AR Y+ G V RE F+S PDQV+V ++S G++ F S +
Sbjct: 167 DLETAVARTTYTRGGVRHVREVFASAPDQVLVVRLSADTPGAVGFTARFTSPQRSGGSAV 226
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEI-----KISDDRGTISALEDKKLK 243
+ I ++G + P ++F + ++S D GT L
Sbjct: 227 DAHTIALDG--------VGGDWYGRPGSVRFRGLARAESEGGRVSTDGGT--------LT 270
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
VEG+D A L++ ++S+ N D DP S + + L Y+ L TRH+ D+++
Sbjct: 271 VEGADAATLVISLATSYR----NYLDVGADPASRARNHLAPAARKPYAHLRTRHVADHRR 326
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
LF RV++ L S + +P+ ER+ F +DP L L FQ+GRYLL S
Sbjct: 327 LFGRVALDLGPSERA------------ELPTDERIPLFADGKDPQLAALYFQYGRYLLAS 374
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SR Q ANLQG+WN+ L+P W+S VNIN EMNYW + P NL+EC +P + L+
Sbjct: 375 CSRSPGQPANLQGLWNDSLNPAWESKYTVNINFEMNYWPAGPGNLAECWDPAVRMVHELA 434
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+G++TA+ Y A GWV+HH TD W + +A + +WP GGAWLC LW+HY +T D
Sbjct: 435 ESGTRTAKALYDAPGWVLHHNTDGW-RGTAPVDAAQYGMWPTGGAWLCVMLWDHYRFTGD 493
Query: 484 RDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
L R YP+++G F LD L ++ G+L TNPS SPE +G+ + TM
Sbjct: 494 TGAL-SRNYPVMKGAVEFFLDTLQVDAETGWLVTNPSQSPEVTHHQDEGESVSICAGPTM 552
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
DM ++R++F A AAEVL+++ LV +V + RL PT++ G I EW+
Sbjct: 553 DMQLLRDLFDAYRQAAEVLDRDSR-LVGRVTEVRDRLAPTRVGHLGQIQEWL 603
>gi|389792551|ref|ZP_10195739.1| alpha/beta hydrolase domain-containing protein [Rhodanobacter
fulvus Jip2]
gi|388436250|gb|EIL93122.1| alpha/beta hydrolase domain-containing protein [Rhodanobacter
fulvus Jip2]
Length = 791
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 227/590 (38%), Positives = 331/590 (56%), Gaps = 34/590 (5%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
+ ++ PA + +A+P+GNG +GAMV+GGVP E ++LN TLW G P DY A L
Sbjct: 25 LVYDKPASQWNEALPLGNGLMGAMVFGGVPDERVQLNLGTLWGGAPNDYIAQGAASRLKP 84
Query: 75 VRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
++ L+ SG+ A+A A S G P + +Q GD+ L ++ K Y+REL L+
Sbjct: 85 IQKLIFSGKVAQAEALSAGFMGDPKLLMPFQPFGDLHLHVEN---KGKVSDYQRELRLDD 141
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY-VNGNN 191
A + V Y+V V F RE F S PD+V+V +S + + +F V+L S + G +
Sbjct: 142 AISTVSYAVDGVHFRRETFMSYPDRVLVMHLSADQPAAQNFTVTLTSPQPGAKVALVGKD 201
Query: 192 QIIMEGRCPGKRIPPKANANDDPK-GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
I + G+ + P + K G+ ++ L IK G+I D L+V G+D
Sbjct: 202 TIALTGQIEPRTNPASSWTGSWSKPGMTYAGRLVIKTKG--GSIRQAGDH-LEVRGADAV 258
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
L+ ++SF + D + + + + L SY L HL DY+ LF RV +
Sbjct: 259 TLVFSGATSFK----SYRDISGNAEAAARAPLDKAVQRSYEALKNAHLADYRALFDRVHL 314
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
+L D S EN+ T +R++ F+T +DPSLV L +Q+GRYLLISSSR G Q
Sbjct: 315 RLG--------DDASRENVAT---DKRIRDFKTHDDPSLVALYYQYGRYLLISSSRAGGQ 363
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWN+DL P W S NINLEMNYW + L E Q PL+D + L + G+KTA
Sbjct: 364 PANLQGIWNQDLLPAWGSKWTTNINLEMNYWPAETGALWETQTPLWDLIDDLQVAGAKTA 423
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
Q Y A GWV+HH +D+W ++ G W LWPMGG WL +W+HY ++ D FL R
Sbjct: 424 QRYYGAHGWVLHHNSDLWRATTPVDGP--WGLWPMGGVWLSNQMWDHYTFSGDETFLRNR 481
Query: 491 AYPLLEGCASFLLDWLIEGHD-----GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
AYP ++G A F+LD+L+E G L TNPSTSPE+ ++ GK ++Y+ TMD+
Sbjct: 482 AYPAMKGAAEFVLDFLVEAPKGSPVAGKLVTNPSTSPENRYLL-GGKPVGLTYAPTMDIE 540
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+I ++F+ + +AA L + ALV ++ + PRL P +I G + EW++
Sbjct: 541 LINDLFNHVRAAARHLGVDA-ALVSRIDAAQPRLPPLQIGHKGQLQEWIE 589
>gi|409198450|ref|ZP_11227113.1| alpha-L-fucosidase [Marinilabilia salmonicolor JCM 21150]
Length = 767
Score = 390 bits (1002), Expect = e-105, Method: Compositional matrix adjust.
Identities = 220/591 (37%), Positives = 330/591 (55%), Gaps = 50/591 (8%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
+PL + ++ PA + +A+PIGNG +GAM++GG+ E ++LNE+T+WT PD K
Sbjct: 25 SPLTLWYDQPASQWEEALPIGNGHMGAMIFGGIDKERIQLNEETIWTKRDEFTDKPDGHK 84
Query: 71 ALSDVRSLVDSGQYAEATAASVK-----LFGHPADVYQLLGDIELEFDDSHLKYAE-ETY 124
++ +R+L+ QY EA + + + YQ LGD+ L+F+ K+ + Y
Sbjct: 85 YINKIRTLLFEEQYEEAEKLVRRHLLEDRMPNNTNTYQTLGDLHLDFE----KFEQISQY 140
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
RR+L+L ATA V + V ++RE FSSNP K+S + G +SF SL+ +
Sbjct: 141 RRQLNLENATASVSFISDGVHYSRESFSSNPANATFMKLSADKPGRISFTASLNRPGEGE 200
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+ + IIM + D+ G+ + ++I+ GT+ A +DK +K+
Sbjct: 201 NISVDGHTIIMNQKV------------DNKDGVTYETRIQIRAKG--GTLEA-KDKSIKI 245
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
G+ VL+ VA++ + G ++PT L+ I SY DL H+ DYQ L
Sbjct: 246 SGAAEVVLIQVAATDYRG---------ENPTQSCKKYLKDIAEKSYDDLRKEHISDYQSL 296
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLIS 363
F+RVS+ L S D + P ER+ + + EDP+L L +QFGRYLLIS
Sbjct: 297 FNRVSLDLGTS--DAIY----------FPVDERLTALRKGAEDPALFSLYYQFGRYLLIS 344
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSRPG+ ANLQG+W L+P W++ H+NIN++MNYW ++ NL EC P +F+ L
Sbjct: 345 SSRPGSLPANLQGLWESTLTPPWNADYHININIQMNYWPAVVTNLPECHLPFLNFIGQLR 404
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
NG KTA Y A G+ HH TD W ++A +G+ WA+WPMG AW TH+WEH+ +T D
Sbjct: 405 ENGRKTANTLYGARGFTAHHTTDAWHFTTA-QGQPQWAMWPMGAAWASTHIWEHFLFTRD 463
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
FL + +++ A FL D+L++ + G L + PS SPE+ F P G A V +M
Sbjct: 464 TTFLRNYGFDVMKEAALFLSDFLVKDPETGRLVSGPSMSPENTFFTPRGNRASVVMGPSM 523
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
D II +FS++I AA+VL ED K+ + L +L P++I EDG I+EW
Sbjct: 524 DHQIIHHLFSSVIEAAKVLNA-EDHFTRKITRQLKQLTPSEIGEDGRILEW 573
>gi|295689298|ref|YP_003592991.1| alpha-L-fucosidase [Caulobacter segnis ATCC 21756]
gi|295431201|gb|ADG10373.1| Alpha-L-fucosidase [Caulobacter segnis ATCC 21756]
Length = 781
Score = 390 bits (1001), Expect = e-105, Method: Compositional matrix adjust.
Identities = 223/595 (37%), Positives = 333/595 (55%), Gaps = 48/595 (8%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
T +PL++ + PAK + +A+P+G GRLGAMV+GGV E L+LNEDTLW G P + NP
Sbjct: 27 TPKASPLRLWYRQPAKTWVEALPVGTGRLGAMVFGGVDVERLQLNEDTLWAGGPYEPINP 86
Query: 67 DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAE-E 122
+A AL ++R L+D+G YA+A A K G P YQ +GD++L+F AE
Sbjct: 87 EAGAALPEIRRLIDTGDYAKAAQLAETKFVGVPKQQMSYQTIGDLKLDFPG----LAEPA 142
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
+Y REL+L+ A A ++ G V+ RE +S PD VI +++ S G++S ++ S L
Sbjct: 143 SYVRELNLDGAIATTRFKAGGVDHVREVIASAPDGVIAVRLTASRRGAISVDLGFASPLK 202
Query: 183 NH--SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALED 239
+ + V G + ++ A AND +GI E ++ +G + +
Sbjct: 203 SAPAARVEGRSLVL-------------AGANDSQQGIPAKLRFECRVDVRAKGGRVSGQG 249
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+ L + +D +LL+ A++S+ +D DPT+ + + L + N ++ + H
Sbjct: 250 ETLSIRDADEVILLIAAATSYR----RYNDVSGDPTALNKATLARLSNKPWAKILAGHQA 305
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
D+ LF RV + R+ ++ P+ ER+K+ +DPSL L +Q+GRY
Sbjct: 306 DHHALFRRVEVDFGRTRAELS------------PTDERIKASPMTDDPSLAALYYQYGRY 353
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LLI+ SRPGTQ ANLQG+WN+ S W +NIN EMNYW + P +L E EPL +
Sbjct: 354 LLIACSRPGTQPANLQGVWNDKPSAPWGGKYTININTEMNYWPAEPTSLPELVEPLIALV 413
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
LS G++TA+ Y A GWV HH TD+W +++A W +WP GGAWLC HLW+HY+
Sbjct: 414 RDLSETGARTAKAMYGARGWVAHHNTDLW-RATAPVDGAPWGVWPTGGAWLCKHLWDHYD 472
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
Y DR +L R YPL++G A F LD L ++ G L TNPS SPE++ G A +
Sbjct: 473 YGRDRAYL-ARVYPLMKGSARFFLDTLVVDPKFGVLVTNPSLSPENDH----GHGASIVA 527
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
TMD AIIR++F + A VL ++ V ++ + +L P K+ +DG + EW
Sbjct: 528 GPTMDQAIIRDLFDNCLKAEAVLGADQ-TFVAELKTARDKLAPYKVGKDGQLQEW 581
>gi|436835055|ref|YP_007320271.1| Alpha-L-fucosidase [Fibrella aestuarina BUZ 2]
gi|384066468|emb|CCG99678.1| Alpha-L-fucosidase [Fibrella aestuarina BUZ 2]
Length = 874
Score = 390 bits (1001), Expect = e-105, Method: Compositional matrix adjust.
Identities = 230/616 (37%), Positives = 328/616 (53%), Gaps = 54/616 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPKA 71
L + ++ PA +T+A+PIGNG +GAM++GGV E L+LNE TL++G P G +T D K
Sbjct: 32 LTLWYDKPAAAWTEALPIGNGYMGAMLFGGVEQEHLQLNEGTLYSGDPSGTFTAIDVRKK 91
Query: 72 LSDVRSLVDSGQYAEA-TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
V SLV G Y EA + G YQ LGD+ + F + YRR LDL
Sbjct: 92 FKAVDSLVKQGNYKEAQNLVAADWLGRNHQDYQPLGDLWMAFTHTG---PVTKYRRSLDL 148
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKI--SGSES--GSLSFNVSLDSLLDNHSY 186
+T ++++Y+V N + RE F+S PD+VIV ++ G E+ G + F+ L Y
Sbjct: 149 STGISQIQYTVANTTYRREIFASYPDRVIVIRLLAEGKETINGEIRFSTPHKPLA---RY 205
Query: 187 VNGNNQIIMEGRCPG---------------KRIPPKANAND--------------DPKGI 217
+Q+IM G+ PG + P+ A D D G
Sbjct: 206 SASADQLIMAGKAPGFVLRRTVKLVQKLGDQHKYPEVFAKDGSVLPNASDVLYGADATGW 265
Query: 218 QFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSE 277
++ + GT+ A D+ +K+ G+ +L+L ++SF+G +P +P +
Sbjct: 266 GMGFEARLRATQQGGTLQA-TDQTIKISGAREVLLVLTCATSFNGFDKSPVTQGLNPAAS 324
Query: 278 SMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAER 337
+ L S+ SY DL HL DYQ LF R +Q+ T S+++ T + +R
Sbjct: 325 TQKYLASVAGRSYDDLAKTHLSDYQHLFSRSQLQIG---------TVSDQSART--TDQR 373
Query: 338 VKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLE 397
+ F +D SLV LL+QFGRYL+I+ SRPG Q NLQGIWN+ + P W+ A VNIN +
Sbjct: 374 IALFANGKDQSLVGLLYQFGRYLMIAGSRPGGQPLNLQGIWNDKVIPPWNGAYTVNINAQ 433
Query: 398 MNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK 457
MNYW + NLSEC EP + L+ING+ TA+ Y +GWV+HH TDIW + +
Sbjct: 434 MNYWPAELTNLSECHEPFLTAVRELAINGAVTARAMYGNNGWVVHHNTDIW-RHTEPVDY 492
Query: 458 VVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETN 517
A WPM G WL +H WE Y + D FL YPLL+G F DWLI DGYL T
Sbjct: 493 CNCAFWPMAGGWLTSHFWERYLFRGDTTFLRTDVYPLLKGVVLFYKDWLIPNKDGYLVTP 552
Query: 518 PSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP 577
SPEH F+ +G+ + +S TMDMAIIRE F+ I A++ L +E L +++ L
Sbjct: 553 IGHSPEHAFVYGNGQTSTLSPGPTMDMAIIRESFTRFIEASDKLGTSEQPLYDEIKAKLA 612
Query: 578 RLRPTKIAEDGSIMEW 593
+L P +I + G + EW
Sbjct: 613 KLLPYQIGKYGQLQEW 628
>gi|442803588|ref|YP_007371737.1| alpha-1,2-L-fucosidase [Clostridium stercorarium subsp.
stercorarium DSM 8532]
gi|442739438|gb|AGC67127.1| alpha-1,2-L-fucosidase [Clostridium stercorarium subsp.
stercorarium DSM 8532]
Length = 761
Score = 390 bits (1001), Expect = e-105, Method: Compositional matrix adjust.
Identities = 220/560 (39%), Positives = 321/560 (57%), Gaps = 45/560 (8%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
A+P+GNGR+GAM++GGV +E ++LNED++W G P D NP+A + L +R L+ G+ E
Sbjct: 30 ALPLGNGRIGAMIYGGVENELIQLNEDSIWYGGPRDRNNPEAVRYLPTIRKLISEGRIRE 89
Query: 87 A-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
A A++ L G P YQ LG++ L F++ YRRELD++ A ARV+Y + +
Sbjct: 90 AENLAAIALSGIPESQRHYQPLGELYLNFENHK---NPSYYRRELDIDNAVARVEYKIVD 146
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPG 201
+TRE F S P QV+ KI S S+SF L + +N +N + M G C G
Sbjct: 147 TLYTREMFVSAPQQVLAIKIKAEGSKSISFRTKLRRSRYFEKVDALN-HNTLKMAGSCGG 205
Query: 202 KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFD 261
+ I + A+L +I + G++ A+ + L V+ S V+ L +++F
Sbjct: 206 E------------GAINYCALL--RIIPENGSVEAI-GEHLVVKNSKSVVIFLSVATTF- 249
Query: 262 GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVT 321
++P ES+ L+ L Y +L H++DY+ LF RV + +T
Sbjct: 250 --------RHEEPEKESLRILEEAEKLRYDELLQNHIEDYRSLFDRVDL--------YIT 293
Query: 322 DTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNE 380
+ +++N+D++P+ ER++ + ++DP LV L FQFGRYLLISSSRPGT ANLQGIWN+
Sbjct: 294 NHSADKNVDSLPTDERLERVKAGNDDPGLVSLYFQFGRYLLISSSRPGTLPANLQGIWNK 353
Query: 381 DLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWV 440
D P WDS +NIN +MNYW + CNLSEC PLFD + + G KTA+V Y G+
Sbjct: 354 DYLPPWDSKYTININTQMNYWPAEVCNLSECHLPLFDLIERMREPGRKTARVMYGCRGFC 413
Query: 441 IHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCAS 500
HH TDIWA ++ WPMG AWLC HLWEHY +T D++FL + AY ++
Sbjct: 414 AHHNTDIWADTAPQDIYFGATYWPMGAAWLCLHLWEHYEFTRDKEFLAQ-AYLTMKEAVE 472
Query: 501 FLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEV 560
FLLD+L E G L T+PS SPE+ +I P+G+ + +MD II E+F I A +
Sbjct: 473 FLLDFLTEDDKGRLVTSPSVSPENTYILPNGESGRLCQGPSMDSQIIHELFGVCIKATSI 532
Query: 561 LEKNEDALVE--KVLKSLPR 578
L + + E KVL+ +P+
Sbjct: 533 LNIDGEFAAELGKVLERVPK 552
>gi|373958368|ref|ZP_09618328.1| alpha-L-fucosidase [Mucilaginibacter paludis DSM 18603]
gi|373894968|gb|EHQ30865.1| alpha-L-fucosidase [Mucilaginibacter paludis DSM 18603]
Length = 827
Score = 390 bits (1001), Expect = e-105, Method: Compositional matrix adjust.
Identities = 228/608 (37%), Positives = 345/608 (56%), Gaps = 37/608 (6%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY-T 64
S+ N K+ ++ PAK +T+A+P+GNGRLGAM++G V E ++LNE TLW+G P +
Sbjct: 18 SSFAQNSSKLWYSHPAKVWTEALPLGNGRLGAMIFGRVDQELIQLNEGTLWSGGPVKHNV 77
Query: 65 NPDAPKALSDVR-SLVDSGQYAEATAASVKLFGHPADVYQLLGDIEL--EFDDSHLKYAE 121
NPDA L R +L+ Y +A A + K+ G ++ ++ LGD+ + +F ++ +
Sbjct: 78 NPDAYSYLLQTREALLKEENYVKAAALARKMQGVYSESFEPLGDVMISQKFKEA----SP 133
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
Y R+LD++ A + ++++ +FTR+ F S PDQVIV ++ S+ G L+F VS S L
Sbjct: 134 SAYYRDLDISDAVSTTRFTIDGTQFTRQMFISAPDQVIVIRLKASKPGQLNFKVSTKSQL 193
Query: 182 D-NHSYVNGNNQIIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDR 231
+S +NG+ QI M G P P N N P +G++++ +L+ +
Sbjct: 194 KFGNSVINGS-QIAMLGHAPLHADPSYVNYNKTPVIYQDSTGKQGMRYALLLK---AVGN 249
Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYS 291
GTI+ + L V+ +L L A++SF+G +P +D + L + +
Sbjct: 250 GTITT-DTSGLSVKNGSDIILFLSAATSFNGFDKSPDKDGQDEVRIATQYLNTALKKDWQ 308
Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLV 350
L+ HL DY + ++RV+ L+ +PKD +P+ ER+ + + +DP+L
Sbjct: 309 SLFDAHLADYHRYYNRVTFNLA-APKDNTNAL--------LPTDERLIGYTRGTKDPALE 359
Query: 351 ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
L + +GRYLLIS SRPG ANLQGIWN + P W S NIN +MNYW S NLSE
Sbjct: 360 TLYYNYGRYLLISCSRPGGAAANLQGIWNNIVRPPWSSNFTTNINTQMNYWPSEMTNLSE 419
Query: 411 CQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA---DRGKVVWALWPMGG 467
EPLF+ + +L++ G TA+ Y A GW +HH +DIWA S+ RG WA W MG
Sbjct: 420 LNEPLFEQIKHLAVTGKATAKEFYHAEGWAVHHNSDIWALSNPVGDKRGDPKWANWSMGS 479
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFI 527
WL HLW HY +T D+ FL+ AYPL++G A F L WL+E DG L T PS SPE++FI
Sbjct: 480 PWLSQHLWTHYQFTGDKLFLKDTAYPLMKGAAQFCLSWLVENKDGLLVTAPSVSPENDFI 539
Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAED 587
G VS ++TMDM+II ++F+ +I A VL + D + ++ +L P I +
Sbjct: 540 DDRGHEGSVSIATTMDMSIIWDLFTNVIEACNVLNTDRD-FRDLIIAKRAKLFPLHIGKK 598
Query: 588 GSIMEWVQ 595
G++ EW +
Sbjct: 599 GNLQEWYK 606
>gi|408674119|ref|YP_006873867.1| hypothetical protein Emtol_2704 [Emticicia oligotrophica DSM 17448]
gi|387855743|gb|AFK03840.1| hypothetical protein Emtol_2704 [Emticicia oligotrophica DSM 17448]
Length = 785
Score = 389 bits (999), Expect = e-105, Method: Compositional matrix adjust.
Identities = 230/603 (38%), Positives = 349/603 (57%), Gaps = 40/603 (6%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
A+ST+T + + PA++F + + +GNG+LGA V+GGV S+ + LN+ TLW+G P +
Sbjct: 8 AQSTNT-----LWYKQPAQYFEETLVLGNGKLGATVFGGVESDKIYLNDATLWSGEPVNA 62
Query: 64 T-NPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEE 122
NP+A K L +R + + Y A + KL G ++ Y LG + L +D Y
Sbjct: 63 NMNPEAYKHLPAIREALRNENYKLADQLNKKLQGKFSESYAPLGTMYLT-NDKATNYT-- 119
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
Y RELD++ A ++V Y V V++TRE+F S PDQ++V K++ S+ G+LSF+V +SLL
Sbjct: 120 NYYRELDISKAISKVTYEVDGVKYTREYFVSYPDQIMVIKLTSSKKGALSFDVKFNSLLK 179
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDP------KGIQFSAILEIKISDDRGTISA 236
+ VN + + + G P P +D+P KGI+F+ + +IK +D G I +
Sbjct: 180 YKTIVN-DKTLKINGYAP-IHAEPNYRRSDNPVIFDENKGIRFTTLAKIKNTD--GAIVS 235
Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
D L ++ + A++ + ++SF+G NP+ + + + ++L +Y +
Sbjct: 236 -TDTTLGIKNASEAIVYVSIATSFNGFDKNPATQGLNNQAIAATSLAKAYAKTYEQIRQS 294
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQ 355
HL DYQK F+RVS+ L ++ +P+ +R++ + + +ED +L L FQ
Sbjct: 295 HLLDYQKFFNRVSLDLGKT------------TAPNLPTDDRLRRYAKGEEDKNLEVLYFQ 342
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
+GRYLLISSSR ANLQGIWN + P W S NIN E NYW + NLSE PL
Sbjct: 343 YGRYLLISSSRTMGVPANLQGIWNPYIRPPWSSNYTTNINAEENYWLAENTNLSEMHAPL 402
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLC 471
F+ ++ G+ TA+ Y A+GWV+ H +DIWA S+ G WA W MGG WL
Sbjct: 403 LGFIKNVAKTGAITAKTFYGANGWVVAHNSDIWAMSNPVGAFGEGDPGWANWNMGGTWLS 462
Query: 472 THLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDG 531
THLWEHY +T D++FL+ AYPL+ G A F L+W++E +G L T+PSTSPE+ +IAPDG
Sbjct: 463 THLWEHYIFTKDQNFLKNEAYPLMRGAAQFCLEWMVEDKNGKLITSPSTSPENIYIAPDG 522
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDA-LVEKVLKSLPRLRPTKIAEDGSI 590
Y + D+A+IRE F I A+++L N DA K+ +L +L P +I + G++
Sbjct: 523 YKGATMYGGSADLAMIRECFIQTIKASKIL--NTDANFRTKLETALAKLYPYQIGKKGNL 580
Query: 591 MEW 593
EW
Sbjct: 581 QEW 583
>gi|423346384|ref|ZP_17324072.1| hypothetical protein HMPREF1060_01744 [Parabacteroides merdae
CL03T12C32]
gi|409220202|gb|EKN13158.1| hypothetical protein HMPREF1060_01744 [Parabacteroides merdae
CL03T12C32]
Length = 844
Score = 387 bits (995), Expect = e-105, Method: Compositional matrix adjust.
Identities = 224/622 (36%), Positives = 330/622 (53%), Gaps = 50/622 (8%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
T + PL + ++ PA+++ +A+PIGNGR GAMV+GGV E L+LNE+TL++G P
Sbjct: 18 GTPSKAPLTLWYDKPAQNWDEALPIGNGRAGAMVFGGVEKEQLQLNENTLYSGEPSVVFK 77
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEE 122
P+ V L+ +G+Y A+ K G YQ GD+ ++ +
Sbjct: 78 DVKITPEMFDKVVGLMKAGKYKTASDLVCKNWLGRLHQYYQPFGDLHIQNNKPG---DAA 134
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
Y+R L+++ A A Y V++ RE F+S+PD VIV + + ++ S
Sbjct: 135 GYKRALNISDAVATTVYKQNGVKYEREVFASHPDNVIVMHLKSDTPNGIDISLDFTSPHP 194
Query: 183 NHSYVNGNNQIIMEGRCPGK----------------RIPPKANAND-------------- 212
++++I+ G+ PG + P +AN
Sbjct: 195 TALQKGTDDRLILHGQAPGYVERRTFEQIEQWGDQYKHPELYDANGKRKFDKRMLYGDEI 254
Query: 213 DPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKK 272
D KG+ F A L+ D + D + + +D +L ++SF+G +PS
Sbjct: 255 DGKGMFFEAQLKPVFPKD--GKCEITDAGIHIYNTDEVYFILSMATSFNGFDKSPSRDGI 312
Query: 273 DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTV 332
DP++++ S L+ + Y L RH +DY LF RV +QL S SE+ +
Sbjct: 313 DPSAKAASILEKALSYDYQTLKQRHTEDYHSLFDRVDLQLVSS---------SEQK--AM 361
Query: 333 PSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHV 392
P+ +R++ F DP+L LLFQFGRYL+IS SRPG Q NLQGIWN+D P W+ +
Sbjct: 362 PTDKRLEQFTQTADPALAALLFQFGRYLMISGSRPGGQPLNLQGIWNKDTIPAWNCGYTI 421
Query: 393 NINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS 452
NIN EMNYW + NLSECQEPLF + LS++G++TA+ Y GWV HH T IW +S
Sbjct: 422 NINTEMNYWPAELTNLSECQEPLFRMIRELSVSGAETARNMYNRRGWVAHHNTSIWRESL 481
Query: 453 ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG 512
+ + WPM WLC+HLWEHY +T D FL+ AYPL++G A F DWLI+ +G
Sbjct: 482 PNDNVPTASFWPMVQGWLCSHLWEHYQFTQDEAFLKNEAYPLMKGAAEFFADWLIDDGNG 541
Query: 513 YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV 572
+L T SPE+ FI DG+ A +S TMDMAIIRE F+ I+A+E+ +E + ++
Sbjct: 542 HLVTPVGVSPENRFITEDGQTAAMSMGPTMDMAIIRETFTRTIAASEMFNLDE-SFRNEL 600
Query: 573 LKSLPRLRPTKIAEDGSIMEWV 594
L RL P +I + G + EW+
Sbjct: 601 KDKLARLLPYQIGKRGQLQEWI 622
>gi|374605049|ref|ZP_09677992.1| alpha-L-fucosidase [Paenibacillus dendritiformis C454]
gi|374389319|gb|EHQ60698.1| alpha-L-fucosidase [Paenibacillus dendritiformis C454]
Length = 779
Score = 387 bits (994), Expect = e-105, Method: Compositional matrix adjust.
Identities = 216/588 (36%), Positives = 331/588 (56%), Gaps = 44/588 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + +A+PIGNGRLGAM +GGV S+ L+LNED++W G P NPDA L
Sbjct: 12 RLWYRQPAGQWVEALPIGNGRLGAMQFGGVDSDRLQLNEDSVWYGGPAARENPDAAAYLP 71
Query: 74 DVRSLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET-YRRELD 129
+R + G+ EA AS+ L P YQ LG++++ F H + E + Y REL
Sbjct: 72 VIRQYLLEGKPEEAERIASLALASVPKHFGPYQTLGELKMFF---HGEEGEVSGYSRELS 128
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL-LDNHSYVN 188
L ARV+Y+ + ++RE SS PDQVI +++ S + LS ++ L+ ++ + V
Sbjct: 129 LPDGLARVEYTRNGIAYSRELLSSVPDQVIALRLTASAAKRLSLSLYLNRRSFEDGTTVI 188
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
++ I M+G+C G+++ L K D G ++A+ D L ++ +D
Sbjct: 189 ASDTIAMQGQC-------------GAGGVRYCVAL--KALADNGEVTAIGDC-LSIDAAD 232
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
L + A+++F + +P + +++ Y + + H+ D++ L+ RV
Sbjct: 233 AVTLYVAAATTF---------RESNPLQTCLRQVEAAAAKGYQQVRSDHVRDHRALYERV 283
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRP 367
+++L SE+++ +P+ ER+K Q DP L L FQ+GRYLL+ SSRP
Sbjct: 284 ALRLG---------ATSEDSLCRLPTDERLKRVRQGQADPGLFALFFQYGRYLLMGSSRP 334
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
GT ANLQGIWN ++P W+S H+NINL+MNYW + NL+EC EP+FD L L NG
Sbjct: 335 GTLPANLQGIWNPHMTPPWESDFHLNINLQMNYWPAEAANLAECHEPVFDLLDRLRTNGR 394
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
TA V Y A G+V HH T++WA ++ V WPMGGAWL H WEHY Y D FL
Sbjct: 395 HTAAVMYGADGFVAHHATNLWADTAPVSDVVSATFWPMGGAWLALHAWEHYQYGGDETFL 454
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
+RAYP+++ A FLL++L+E G T+PS SPE+ + P+G+ + +MD I+
Sbjct: 455 RERAYPVMKDAALFLLNYLVENAQGEWVTSPSISPENRYRLPNGQQGTLCMGPSMDTQIM 514
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
R +F A + A+ EDA E++ ++ RL P +I DG ++EW +
Sbjct: 515 RALFQACLDAS-AGRTEEDAFRERLQAAMTRLPPHRIGRDGQLLEWAE 561
>gi|284036792|ref|YP_003386722.1| alpha-L-fucosidase [Spirosoma linguale DSM 74]
gi|283816085|gb|ADB37923.1| Alpha-L-fucosidase [Spirosoma linguale DSM 74]
Length = 825
Score = 387 bits (993), Expect = e-104, Method: Compositional matrix adjust.
Identities = 222/598 (37%), Positives = 328/598 (54%), Gaps = 32/598 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPKA 71
LK+ + PA +T+A+P+GNGR+GAM++G V E ++LNE TLW+G P NP++P
Sbjct: 23 LKLWYTKPAAVWTEALPVGNGRIGAMIFGKVEDELIQLNESTLWSGGPVSGNVNPESPSY 82
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET-YRRELDL 130
L VR ++ Y +A K+ G Y LGD+ L+ +L A T Y R+LD+
Sbjct: 83 LPQVREALNREDYKQAVTLVKKMQGLYTQSYMPLGDLSLK---QNLNGATPTGYYRDLDI 139
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A +++ V + RE F+S PD V+V +++ S+ G LSF+ S S L + N
Sbjct: 140 QKALATTRFTANGVTYKREMFTSAPDGVMVIRLTASKPGQLSFDASTSSQLRAENMRGSN 199
Query: 191 NQIIMEGRCPGKRIPPKANANDDP----------KGIQFSAILEIKISDDRGTISALEDK 240
++M+G+ P + P N D KG++F L +K + GT+ + +
Sbjct: 200 GDLVMKGKAPTQVDPNYYNPKDREHVIYEDATGCKGMRFQ--LRLKALNKGGTVQT-DKE 256
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ V + +L + A++SF+G P KD + ++ SY L RH D
Sbjct: 257 GIHVRNASEVLLFVAATTSFNGYDKCPDKDGKDENKLAEELIRKATATSYQALLNRHTAD 316
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRY 359
YQ F+R S Q +TDT S +PS ER++ + DP + L Q+GRY
Sbjct: 317 YQSYFNRFSFQ--------ITDTTSVNKNAALPSDERLEMYSKGVYDPGIETLYCQYGRY 368
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LLISSSR ANLQGIWN++L W S +NIN +MNYW NLSE PL F+
Sbjct: 369 LLISSSRVTNVPANLQGIWNKELRAPWSSNYTININTQMNYWPVEVTNLSELHRPLLSFI 428
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS--ADRGK--VVWALWPMGGAWLCTHLW 475
L+ G+ TA+ Y +GWV+HH TDIWA S+ D+G+ WA W G WL HLW
Sbjct: 429 GELAKTGAVTAKEFYNMNGWVVHHNTDIWAISNPVGDKGQGDPKWANWNQGAGWLSQHLW 488
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
EHY +T D+ FL + AYP+++G A F LDWL+ DGYL +PS SPE++FI G+ A
Sbjct: 489 EHYRFTGDKKFLRESAYPIMKGAAEFYLDWLVADKDGYLVVSPSVSPENDFIDAKGQPAS 548
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+S ++TMDM+I+ ++F+ +I A+ VL D + +++ + P I G++ EW
Sbjct: 549 ISVATTMDMSIMWDLFTNLIDASTVLNIEPD-FRKMLIEKRSKFYPLHIGHKGNLQEW 605
>gi|399029093|ref|ZP_10730146.1| hypothetical protein PMI10_01974 [Flavobacterium sp. CF136]
gi|398073115|gb|EJL64299.1| hypothetical protein PMI10_01974 [Flavobacterium sp. CF136]
Length = 802
Score = 387 bits (993), Expect = e-104, Method: Compositional matrix adjust.
Identities = 218/589 (37%), Positives = 348/589 (59%), Gaps = 33/589 (5%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPKALSDV 75
++ PA+ F +++ +GNG+LGA V+GGV S+ + LN+ TLW+G P + NP+A K + V
Sbjct: 32 YDKPAEFFEESLVLGNGKLGATVFGGVNSDKIYLNDATLWSGEPVNANMNPEAYKNIPAV 91
Query: 76 RSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
R + + Y A + K+ G ++ + LG +E+ ++ K Y RELD++ A +
Sbjct: 92 REALKNENYKLAEELNKKIQGKNSESFAPLGTLEI---NNSEKGKAVNYHRELDISNAVS 148
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
+V Y + +++TRE+F S PDQ+++ K++ + G+L+F+++L SLL ++ V NN ++M
Sbjct: 149 KVSYEMAGIKYTREYFVSAPDQIMIIKLTSDQKGALNFDINLKSLLKSNVEVR-NNILVM 207
Query: 196 EGRCP-----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
G P G + PK + +G +F+ +++IK +D + T S + L ++ + A
Sbjct: 208 TGSAPIHENAGYAVLPKY-LDIKERGTRFTTLIQIKKTDGKITNSR---ESLTLKDATEA 263
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
++ + ++SF+G NP+ D + ++ + S+ L H+ DYQK ++RVS+
Sbjct: 264 IIYVSVATSFNGFDKNPATEGLDDVAIALQNMNKAFAKSFDKLKQSHITDYQKFYNRVSL 323
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPGT 369
L ++ T S +P+ ER+ + +ED +L L FQ+GRYLLISSSR
Sbjct: 324 DLGKT-------TAS-----NLPTDERLLRYADGNEDKNLEILYFQYGRYLLISSSRTLG 371
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
ANLQGIWN L+P W S +NINLE NYW + NLSE PL F+ LSI G T
Sbjct: 372 VPANLQGIWNPYLNPPWSSNYTMNINLEENYWLAENTNLSEMHLPLLSFIKNLSITGKIT 431
Query: 430 AQVNY-LASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
A+ Y + GW H +DIWA ++ + + +WA WPM GAWL TH+WEHY +T D+
Sbjct: 432 AKTFYGVDKGWAAGHNSDIWAMTNPVGQFGKEEPMWACWPMAGAWLSTHIWEHYVFTQDK 491
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
++L+K YPL++G A F L W++ +G L T+PSTSPE+++IAPDG + Y T D+
Sbjct: 492 EYLKKEGYPLMKGAAEFCLGWMVTDKNGNLITSPSTSPENQYIAPDGFVGATMYGGTADL 551
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
A+IRE F I A++VL + D K+ +L +L P +I + G++ EW
Sbjct: 552 AMIRECFDKTIKASKVLNIDAD-FRAKLETALSKLHPYQIGKKGNLQEW 599
>gi|436838082|ref|YP_007323298.1| alpha-L-fucosidase [Fibrella aestuarina BUZ 2]
gi|384069495|emb|CCH02705.1| alpha-L-fucosidase [Fibrella aestuarina BUZ 2]
Length = 801
Score = 386 bits (992), Expect = e-104, Method: Compositional matrix adjust.
Identities = 223/590 (37%), Positives = 328/590 (55%), Gaps = 32/590 (5%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-YTNPDAPKALSDV 75
+ PA +F + + +GNG GA V+GGV S+ + LN+ TLW+G P D NP+A K + +
Sbjct: 29 YKQPAHYFEETLVLGNGTQGASVFGGVRSDKIYLNDATLWSGGPVDPNMNPEAYKNIPAI 88
Query: 76 RSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
R + + Y A KL G ++ Y LG + F D+ + Y R+L+L AT+
Sbjct: 89 REALQNENYQLADQFQKKLQGKFSESYAPLGTL---FIDTDAPADPQNYYRQLNLADATS 145
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
+V+Y+V V FTR++F S PDQ++V ++ S G+L F V +S L N GN +
Sbjct: 146 QVRYTVNGVTFTRDYFISKPDQLMVIRLKSSRKGALGFTVRFNSQLRNQVSATGN-VLKA 204
Query: 196 EGRCPGKRIP------PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
G P K P P A D KG +F+ ++ IK D G A D L ++G
Sbjct: 205 TGYAPQKAEPNYRGNIPNAVVFDPAKGTRFTTLMGIKTQD--GGTVATTDTSLTLKGGTE 262
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A+L + ++SF+G +P+ + + + L + SY+ L H+ DYQ+LF+RVS
Sbjct: 263 ALLFVSIATSFNGFDKDPATNGLPHETIAAERLSRAMSKSYAQLLAAHVSDYQRLFNRVS 322
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPG 368
++L+ S E I +P+ ER++ + + D L +L F FGRYLLISSSR
Sbjct: 323 LRLT-----------SAETIPNLPTDERLQRYAEGKPDTDLEQLYFNFGRYLLISSSRTP 371
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
ANLQGIWN + P W S NINL+ NYW + NL E EP+ F+ L+ G+
Sbjct: 372 GVPANLQGIWNPYMRPPWSSNYTTNINLQENYWPAETANLPEMHEPMLSFIGNLAKTGTI 431
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
TA+ Y A+GW + H +DIWA ++ +G VWA W MGGAW+ THLWEH+ + D+
Sbjct: 432 TARTFYGANGWTVAHNSDIWAMTNPVGDFGQGDPVWANWNMGGAWISTHLWEHFTFGQDK 491
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
+L + AYPLL+G A F LDWL+ G L T+P TSPE++++ P G + T D+
Sbjct: 492 TYLRETAYPLLKGAAQFCLDWLVRDKAGKLVTSPGTSPENQYLTPSGYKGATLFGGTADL 551
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLK-SLPRLRPTKIAEDGSIMEW 593
A++RE S + AA+VL N DA + LK +L L P +I + G++ EW
Sbjct: 552 AMVRECLSQTLQAAQVL--NTDADFQATLKQTLADLHPYQIGKAGNLQEW 599
>gi|182416090|ref|YP_001821156.1| alpha/beta hydrolase domain-containing protein [Opitutus terrae
PB90-1]
gi|177843304|gb|ACB77556.1| Alpha/beta hydrolase fold-3 domain protein [Opitutus terrae PB90-1]
Length = 1094
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 234/606 (38%), Positives = 341/606 (56%), Gaps = 53/606 (8%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
+A + T LK+ + PA + +A+P+GNGRLGAMV+GG+ E L+LNEDTLW G P D
Sbjct: 337 SAPEEAATAALKLWYRQPAAQWVEALPVGNGRLGAMVFGGIQQERLQLNEDTLWAGGPYD 396
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASV-KLFGHP--ADVYQLLGDIELEFDDSHLKY 119
+P+A AL ++R L+ +G YA A + K G P YQ +GD+ + S
Sbjct: 397 PASPEARAALPEIRRLISAGNYAAAQQLTQGKFMGRPIVQMPYQTVGDLMITQAGSE--- 453
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSES-------GSLS 172
YRRELDL+TA AR +Y +G V F RE F+S DQVIV +++ S + G LS
Sbjct: 454 QVANYRRELDLDTAIARTEYVLGGVTFVREVFASPVDQVIVIRLTASRNPPRPEWGGPLS 513
Query: 173 FNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK--ISDD 230
F ++ S + +G ++++ G +N D GI+ E + + +
Sbjct: 514 FTLAFQSPQRATAAADGA-ELVLSG------------SNSDAAGIKGRLKFEARARLIVE 560
Query: 231 RGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSY 290
G + A + L+V+G+ A +LL A++S+ D DP + + + L ++ Y
Sbjct: 561 GGAVVA-DGTDLQVQGAHAATILLAAATSYR----RYDDVSGDPAALNRATLAAVATKPY 615
Query: 291 SDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLV 350
+ H+ ++Q+LF RVS+ D+ T ++ +P+ ERV+ T DP+L
Sbjct: 616 EAIRAAHVAEHQRLFRRVSL-------DLGTSYAAQ-----LPTDERVRLSTTSVDPALA 663
Query: 351 ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
L FQ+ RYLLISSSRPG+Q ANLQG+WN+ ++P W S +NIN EMNYW + NL+E
Sbjct: 664 ALYFQYARYLLISSSRPGSQPANLQGLWNDHVTPPWGSKYTININTEMNYWPAEVANLAE 723
Query: 411 CQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWL 470
C EP+F + L+ G+K AQ Y A GWV+HH TD+W +++A W +WP GGAWL
Sbjct: 724 CTEPVFSMIRDLTETGTKMAQAQYGARGWVVHHNTDLW-RAAAPIDGAFWGMWPTGGAWL 782
Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAP 529
C WEHY Y+ DR+FL R YP L+G A F LD L+ E +L T+PS SPE+
Sbjct: 783 CRTAWEHYLYSGDREFL-ARIYPWLKGAAEFFLDTLVEEPRHRWLVTSPSISPENAH--- 838
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
+S TMD IIR++FS +I+A+E L + D +KV + RL P +I G
Sbjct: 839 -HPGVTISAGPTMDEQIIRDLFSEVITASEQLGVDAD-FRQKVAAARARLAPNQIGAQGQ 896
Query: 590 IMEWVQ 595
+ EWV+
Sbjct: 897 LQEWVE 902
>gi|399073647|ref|ZP_10750601.1| hypothetical protein PMI01_01669 [Caulobacter sp. AP07]
gi|398041300|gb|EJL34368.1| hypothetical protein PMI01_01669 [Caulobacter sp. AP07]
Length = 783
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 220/586 (37%), Positives = 334/586 (56%), Gaps = 42/586 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PAK + +A+P+G GR+GAMV+GGV E L+LN+DTLW G P D NP A AL
Sbjct: 35 RLWYRQPAKEWVEALPVGTGRIGAMVFGGVAEERLQLNDDTLWAGGPYDPVNPQARAALP 94
Query: 74 DVRSLVDSGQYAEAT-AASVKLFGHP--ADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
++R L+ +G AEAT A + P YQ +GD+ L F L + Y R+LDL
Sbjct: 95 EIRRLIAAGDIAEATKVADARFLATPRYQMSYQTIGDLRLAF--PGLPETADDYVRDLDL 152
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH--SYVN 188
+ A A ++S G FTRE +S PD+VI +++ ++ +LS ++S S L++ +
Sbjct: 153 DGAIATTRFSAGATRFTREVIASAPDRVIAVRLTADKAKALSLDLSFASPLNSRPTARAE 212
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
G + +++ G + N ++F +++ + GT+ A + L V G+D
Sbjct: 213 GADTLVLAGTGEAQ--------NGVEAALKFEC--RVRVLNKGGTVVA-DGAGLAVRGAD 261
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
VLLL+AS++ F D DP + + +A+++ + DL RH D++KLF RV
Sbjct: 262 -EVLLLIASATSYRRF---DDVGGDPAAINRTAVEAASARPWRDLLARHQADHRKLFRRV 317
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
++ L + + P+ ER+K+ T +DP+L L +Q+GRYLLI+ SRPG
Sbjct: 318 AVDLGTTSAALK------------PTDERIKASPTTDDPALAALYYQYGRYLLIACSRPG 365
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
Q ANLQG+WN+ +P W S +NIN EMNYW + P L+EC PL + + LS+ G++
Sbjct: 366 GQPANLQGLWNDQAAPPWGSKYTININTEMNYWPAEPTGLAECVAPLVEMVRDLSVTGAR 425
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
TAQ Y A GWV HH TD+W +++A + +WP GGAWLC HLW+HY+Y D+ +L
Sbjct: 426 TAQAMYGARGWVAHHNTDLW-RATAPIDGAKYGVWPTGGAWLCKHLWDHYDYGRDQAYLA 484
Query: 489 KRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
YPL+ G A F +D L+ + G + T+PS SPE++ G + TMD AII
Sbjct: 485 D-VYPLMRGAALFFVDTLVRDPRTGQVVTSPSISPENDH----GHGGSLVAGPTMDQAII 539
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
R++FS+ I+AA +L + L + + RL P KI +DG + EW
Sbjct: 540 RDLFSSCIAAAAIL-GTDAPLAAILAAARDRLAPYKIGKDGQLQEW 584
>gi|430749774|ref|YP_007212682.1| hypothetical protein Theco_1545 [Thermobacillus composti KWC4]
gi|430733739|gb|AGA57684.1| hypothetical protein Theco_1545 [Thermobacillus composti KWC4]
Length = 845
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 228/639 (35%), Positives = 348/639 (54%), Gaps = 67/639 (10%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + +A+PIGNGRLGAM++GGV + + LNEDTLW G P + + +A + L+
Sbjct: 7 RLWYRRPAGVWEEALPIGNGRLGAMLFGGVRLDRILLNEDTLWAGYPRETVDCEARRHLA 66
Query: 74 DVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
R L+ +G+ EA ++ G Y LG++ +E+ D + Y R L +
Sbjct: 67 RARELIFAGRLTEAQRLIESRMTGRNVQPYLPLGELAIEWLDGEDDAPD--YVRSLRIFD 124
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY-VNGNN 191
A V+++ G + R +++S PDQVIV + +E G ++ +L S + + ++
Sbjct: 125 GVADVRFASGGLRMRRAYWASAPDQVIVVRYE-AEGGMMNLAAALSSPVRSSVSVMDDGR 183
Query: 192 QIIMEGRCPG------KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
+++ GR P + P+ ++ +G++F A +++ D G + A E ++L V
Sbjct: 184 TLVLAGRAPSHVADNWRGDHPEPVLYEEGRGMRFEA--RVRLETD-GVVEA-EGERLIVR 239
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G+ + A+++F + P D ++ + L+ Y L RHL D++
Sbjct: 240 GASRLTAYIAAATAFVD-WRTPPDESGAHSARCEAWLREAERSGYEALLERHLADHRAFM 298
Query: 306 HRVSIQLSR----------SP------KDIV-TDTCSEENIDT----------------- 331
RVS++L+ SP KD +DT + + +
Sbjct: 299 GRVSLRLAGGEAAGLPDADSPGSHAAGKDATGSDTAGSDAVGSAAATAESGQAGMDRSEA 358
Query: 332 ---------------VPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQ 375
+P+ ER+K++Q+ + DP+L L FQ+GRYLL++SSRPGTQ ANLQ
Sbjct: 359 GWTASFGLNRVSMNDLPTDERLKAYQSGNPDPALEALYFQYGRYLLLASSRPGTQPANLQ 418
Query: 376 GIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYL 435
GIWN + P W S +NIN EMNYW + CNLSEC EPLF L L+ +G++TA+++Y
Sbjct: 419 GIWNPHVQPPWFSDYTININTEMNYWPAEVCNLSECHEPLFAMLGELAESGTRTARIHYG 478
Query: 436 ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLL 495
GW HH D+W S+ G WA WPMGGAWL THLWE Y + D DFL AYPL+
Sbjct: 479 CRGWTAHHNVDLWRMSTPSDGSASWAFWPMGGAWLATHLWERYLFEPDLDFLRGTAYPLM 538
Query: 496 EGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAII 555
G A F LDWL+ G DG L TNPSTSPE+ F+ P+G+ V++ STMDMAIIRE+F+A I
Sbjct: 539 RGAAQFCLDWLVPGPDGTLVTNPSTSPENVFLTPEGEPCSVTWGSTMDMAIIRELFAACI 598
Query: 556 SAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
A+ +L +E L ++ +L +L P +I G + EW
Sbjct: 599 EASRLLGTDE-PLRGELEAALAKLPPYRIGRHGQLQEWA 636
>gi|414868293|tpg|DAA46850.1| TPA: hypothetical protein ZEAMMB73_390456 [Zea mays]
Length = 579
Score = 385 bits (989), Expect = e-104, Method: Compositional matrix adjust.
Identities = 175/241 (72%), Positives = 205/241 (85%)
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
QFGRYLLIS SRPGTQ++NLQGIW+ D SP WD+APH NINL+MNYW +LPCNLSECQEP
Sbjct: 129 QFGRYLLISCSRPGTQISNLQGIWSNDTSPPWDAAPHPNINLQMNYWPALPCNLSECQEP 188
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
LFDF+ LSING+KTA+VNY ASGWV H TD+WAK+S D G VWALWPMGG WL THL
Sbjct: 189 LFDFIGSLSINGAKTAKVNYEASGWVSHQVTDLWAKTSPDAGDPVWALWPMGGPWLATHL 248
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
WEHY +T+D+ FLEK AYPLLEG A FLLDWLIEGH GYLETNPSTSPEH FIAPDGK A
Sbjct: 249 WEHYCFTLDKHFLEKTAYPLLEGSARFLLDWLIEGHRGYLETNPSTSPEHYFIAPDGKEA 308
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
CVSYS+TMD++IIREVFSA+I +A++L K++ +V+++ K+LP L P K+A DG+IMEW
Sbjct: 309 CVSYSTTMDISIIREVFSALILSADILGKSDTNVVQRIKKALPNLPPMKVARDGTIMEWA 368
Query: 595 Q 595
Q
Sbjct: 369 Q 369
Score = 125 bits (315), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 59/85 (69%), Positives = 69/85 (81%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+ F PAK+FTDA PIGNGRLGAMVWG V SE L+LN DTLWTG PG+YTNP+AP
Sbjct: 41 PLKVVFGSPAKYFTDAAPIGNGRLGAMVWGCVESERLQLNHDTLWTGGPGNYTNPNAPAV 100
Query: 72 LSDVRSLVDSGQYAEATAASVKLFG 96
LS VRSLV++G+Y EAT+A+ L G
Sbjct: 101 LSKVRSLVENGKYPEATSAAYDLSG 125
>gi|357008575|ref|ZP_09073574.1| alpha-L-fucosidase [Paenibacillus elgii B69]
Length = 765
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 215/587 (36%), Positives = 328/587 (55%), Gaps = 40/587 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + ++ PA+ + +A+PIG GRLG MV+G V + ++LNED++W G P NPDA +
Sbjct: 8 LALWYSAPARRWEEALPIGGGRLGGMVFGTVGQDKIQLNEDSVWYGGPKKANNPDARANV 67
Query: 73 SDVRSLVDSGQYAEAT-AASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
++R L+ G+ EA A + L P + YQ LGD+ L + H K + Y RELD
Sbjct: 68 PEIRRLLMEGKQQEAEHLARMALMSAPKYLHPYQPLGDLLL-YMLGHDK-PPQAYERELD 125
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL-DSLLDNHSYVN 188
L A RV+Y + V +TRE+FSS QV+ +++ + GSL+F+ + D S
Sbjct: 126 LERALVRVRYDMDGVRYTREYFSSAVHQVLAVRLTAARPGSLTFSTHMMRRPFDMGSQKY 185
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
G + +IM G C +G++FS +L+ D ++ + D + VEG+D
Sbjct: 186 GEDTMIMYGEC-------------GTEGVRFSVVLKAVAEGD--SVKPIGDF-ISVEGAD 229
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
LLL A ++F DP + + + +L Y +L H +D+ + F RV
Sbjct: 230 AVTLLLAAGTTF---------RHDDPKAVCLEQIARAASLPYEELKRAHTEDHDRYFRRV 280
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
++L++ D ++E + ERVK + +DP LVE FQFGRYLL+S SRPG
Sbjct: 281 GLELAKPEPDAAASLPTDERL------ERVK--EGHDDPGLVETFFQFGRYLLLSCSRPG 332
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
+ A LQGIWN++ +P W+S +NIN +MNYW + C+L EC EPLFD + + NG
Sbjct: 333 SLAATLQGIWNDNYTPPWESKYTININTQMNYWPAEVCHLQECLEPLFDLIERMRENGRV 392
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
TA+ Y G++ HH T++W + + V ++WPMG AWL HLWEHY + +DR FL
Sbjct: 393 TAREVYGCGGFMAHHNTNLWGDTHVEGIPVSASIWPMGAAWLSLHLWEHYRFGLDRSFLA 452
Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
RAYP+++ A FLLD+L+E G L T PS SPE++F+ +G + + +MD I
Sbjct: 453 DRAYPVMKEAAQFLLDYLLEDEQGRLLTGPSISPENKFVLSNGVTGNLCMAPSMDSQIAF 512
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+F A AA VL +E A +++ +++ +L +I G IMEW++
Sbjct: 513 TLFDACREAAAVLGLDE-AFRQRLAEAMAKLPQPQIGRHGQIMEWLE 558
>gi|312792729|ref|YP_004025652.1| alpha-L-fucosidase [Caldicellulosiruptor kristjanssonii 177R1B]
gi|312179869|gb|ADQ40039.1| Alpha-L-fucosidase [Caldicellulosiruptor kristjanssonii 177R1B]
Length = 752
Score = 385 bits (988), Expect = e-104, Method: Compositional matrix adjust.
Identities = 226/590 (38%), Positives = 334/590 (56%), Gaps = 44/590 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ FN PA+ + +A+PIGNG LGAM++GGV ET++LNE+++W+ P NPDA K L
Sbjct: 6 LKVIFNKPARCWEEALPIGNGSLGAMIYGGVKYETIQLNEESIWSCGPRRRENPDAFKYL 65
Query: 73 SDVRSLVDSGQYAEATAASV-KLFG--HPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
++R + G A SV L G H Y+ LG +++ F++ + Y R LD
Sbjct: 66 PEIRKTILEGNIKRAEELSVFALSGTPHSEGNYEPLGYLDIYFEEVESDKVK-NYTRYLD 124
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLS----FNVSLDSLLDNHS 185
++ A +V++ V N+ + + +FSS PD+VIV KI S++G++S F +D
Sbjct: 125 ISNAICKVEFDVDNIRYKKIYFSSYPDKVIVVKICSSKTGAVSLRAKFRREYQEDIDKCG 184
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
V+ N++I E C + +G+ FSA+L+ +S D G + + D L V+
Sbjct: 185 KVD-NDKIFFE--CLA----------GEGRGVSFSAVLK-AVSKD-GDVYTIGDN-LFVK 228
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
+ +LL+ +++S+ +KD + + ++ + +LY RH +DY+ LF
Sbjct: 229 NATEVMLLITSTTSY---------KEKDYFNWCLKTVEQASKYVFENLYKRHTEDYKSLF 279
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
RV + T+ + E I+ + + D L+ LLFQFGRYLLISSS
Sbjct: 280 SRVEFYIDTKDSSKCTELTTPERINLLREGYK--------DEELIVLLFQFGRYLLISSS 331
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPG NLQGIWN+++ P W S +NINL+MNYW + CNLSEC PLFD L + N
Sbjct: 332 RPGCLPPNLQGIWNKEMKPPWGSKYTININLQMNYWPAEVCNLSECHMPLFDLLEKMYEN 391
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G TAQ Y G+ HH TDIW ++ + WPMG AWLC H+WEHY YT D +
Sbjct: 392 GKITAQRMYGCRGFCAHHNTDIWGDTAPQDIYLPATYWPMGAAWLCLHIWEHYEYTGDIN 451
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
FL KR Y L++ A FLLD+LIE +GYL T PS SPE+ + +G++ ++Y TMD+
Sbjct: 452 FL-KRYYYLMKEAALFLLDYLIEDKNGYLVTCPSCSPENRY-KLNGEVYSLTYMPTMDIQ 509
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
II +F + A VL+ N D +VEK+ +L +L P KI + G I EW++
Sbjct: 510 IITALFEKVKKANNVLKLN-DEIVEKIEYALNKLPPIKIGKHGQIQEWIE 558
>gi|375145718|ref|YP_005008159.1| alpha-L-fucosidase [Niastella koreensis GR20-10]
gi|361059764|gb|AEV98755.1| Alpha-L-fucosidase [Niastella koreensis GR20-10]
Length = 825
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 225/607 (37%), Positives = 343/607 (56%), Gaps = 35/607 (5%)
Query: 8 STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NP 66
S + L + +N PA+ + +A+P+GNG +G M++G V E ++LNE TL++G P + NP
Sbjct: 23 SAQSGLSLWYNKPAEAWVEALPVGNGHIGGMIFGRVEEELIQLNESTLYSGGPVKQSINP 82
Query: 67 DAPKALSDVR-SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYR 125
DA + L+ +R +L+ Y++A + K+ G+ + Y LGD+ L+ S Y+
Sbjct: 83 DAFQYLAPIREALLKEQDYSKANELAKKMQGYFTESYLPLGDLLLK--QSFNGRTPSAYQ 140
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
R LDL TA A +++V VE+TRE F S P V+V +I G++ +V+L+S L
Sbjct: 141 RRLDLQTAIATTRFTVDGVEYTREVFCSAPANVMVIRIRAGVPGAIDLSVALNSPLHYTI 200
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDP----------KGIQFSAILEIKISDDRGTIS 235
NN++IM G+ P P N D G++F +K GT++
Sbjct: 201 SAKANNEVIMSGKAPAHVDPSYYNPKDRQPVIYEDTAGCNGMRFQC--RVKAITKTGTVT 258
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
A + L V+ + VL++ A++SF+G P K+ + + + + SY+ L
Sbjct: 259 A-DTLGLHVQHATELVLIVSAATSFNGFDKCPDKEGKNEQAIAAGLIDAAAKRSYTGLQQ 317
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENID-TVPSAERVKSFQTDE-DPSLVELL 353
H++D+Q+ F+RVS I+ DT + N + T+P +R++++ DP+L L
Sbjct: 318 DHVNDHQRYFNRVSF--------ILKDTGAASNTNSTLPVDKRLQAYSAGAYDPALETLY 369
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
+Q+GRYLLI++SRPG ANLQGIWN++L W S +NIN +MNYW + NLSE
Sbjct: 370 YQYGRYLLIAASRPGGPPANLQGIWNKELRAPWSSNYTININTQMNYWPAESTNLSEMHL 429
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIW--AKSSADRGK--VVWALWPMGGAW 469
PL +L LS+ G++ A+ Y GWV HH +DIW A DRG VWA W MGG W
Sbjct: 430 PLLQWLKILSVTGARVAREFYHCDGWVAHHNSDIWGCANPVGDRGAGDPVWANWYMGGNW 489
Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP 529
LC HLWEHY +T D+ FL AYP+++ A F L+WL++ GY T PSTSPE++F
Sbjct: 490 LCQHLWEHYAFTQDKKFLAT-AYPIMKQAAVFTLNWLVKDSSGYWVTAPSTSPENKFRDE 548
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDG 588
G+ VS ++TMDM+IIR++F+ +I A+E L N D L L + + L P + G
Sbjct: 549 KGRAQAVSVATTMDMSIIRDLFTNVIEASEAL--NTDQLFRNRLTEVRKHLYPLRKGSKG 606
Query: 589 SIMEWVQ 595
++EW +
Sbjct: 607 ELLEWYK 613
>gi|332662390|ref|YP_004445178.1| alpha-L-fucosidase [Haliscomenobacter hydrossis DSM 1100]
gi|332331204|gb|AEE48305.1| Alpha-L-fucosidase [Haliscomenobacter hydrossis DSM 1100]
Length = 801
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 225/586 (38%), Positives = 333/586 (56%), Gaps = 31/586 (5%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY-TNPDAPKALSDVRSL 78
PAKHF +++ +GNGR+GA+V GGV S+ + LN+ TLW G P D NP A L +R
Sbjct: 34 PAKHFEESLVLGNGRIGAVVHGGVKSDKIFLNDATLWAGSPVDPDMNPAAHTHLPAIREA 93
Query: 79 VDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARV 137
+ Y +A + + + L G ++ Y LG + + D +H + A YRR+LDL+TA +
Sbjct: 94 LRQEDYRKADSLNRRHLQGKFSESYAPLGTMYI--DMAHTETASN-YRRQLDLSTAISTT 150
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
Y V +TRE+F S+P QV++ +++ S+ G LSFN+ +SLL H N + G
Sbjct: 151 SYQQAGVTYTREYFISHPQQVLLIRMTASQLGKLSFNLRFNSLL-RHQVNTSTNVLNASG 209
Query: 198 RCPGKRIP-----PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
R P P P DD K ++F ++++I +D + D + V+G A++
Sbjct: 210 RAPAHAEPSYRRVPDPIQYDDQKSMRFLSLVKIIKTDGK---IVRTDSTIGVQGGKEAII 266
Query: 253 LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
++ ++SF+G NP+ KD + + L+ + +SY+ + H+ D+Q+ F+RV QL
Sbjct: 267 MVSIATSFNGFDQNPALHGKDEVTLANEWLKKAQIISYATIKAAHIKDHQQFFNRVQFQL 326
Query: 313 SRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
+ + ++P+ ER+K F + +DP L L F FGRYLLI+SSR
Sbjct: 327 AGRSSNA-----------SLPTDERLKRFAEGAKDPDLELLYFNFGRYLLIASSRTPQVP 375
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
ANLQGIWN L P W S +NIN EMNYW + NLSE +PL FL L+ G+ TA+
Sbjct: 376 ANLQGIWNHHLQPPWSSNYTININTEMNYWPAESGNLSELHQPLLGFLGNLAKTGAVTAK 435
Query: 432 VNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
Y A GW H TDIWA S+ +G WA W MGGAWL THLWEH++YT D +L
Sbjct: 436 TFYNAGGWCAAHNTDIWAMSNPVGHFGQGSPSWANWNMGGAWLATHLWEHFDYTRDTIWL 495
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
+ Y L++G A F LD L++ G L T+PSTSPE+ FI P G Y +T D+ +I
Sbjct: 496 KTYGYGLMKGAAQFCLDILVDDGKGNLVTSPSTSPENIFITPSGYKGATLYGATADLGMI 555
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
RE+F I+AA+ L ++ D +++ SL +L P +I++ G + EW
Sbjct: 556 RELFLQTIAAAKTLVQDAD-FQQQLEASLSKLYPYQISKKGHLQEW 600
>gi|154489941|ref|ZP_02030202.1| hypothetical protein PARMER_00170 [Parabacteroides merdae ATCC
43184]
gi|154089383|gb|EDN88427.1| hypothetical protein PARMER_00170 [Parabacteroides merdae ATCC
43184]
Length = 846
Score = 384 bits (985), Expect = e-104, Method: Compositional matrix adjust.
Identities = 220/622 (35%), Positives = 327/622 (52%), Gaps = 50/622 (8%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
T + PL + ++ PA+++ +A+PIGNGR GAMV+GGV E L+LNE+TL++G P
Sbjct: 20 GTPSKAPLTLWYDKPAQNWDEALPIGNGRAGAMVFGGVEKEQLQLNENTLYSGEPSVVFK 79
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEE 122
P+ V L+ +G+Y A+ K G YQ GD+ ++ +
Sbjct: 80 DVKITPEMFDKVVGLMKAGKYKTASDLVCKNWLGRLHQYYQPFGDLHIQNNKPG---DAA 136
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
Y+R L+++ A A Y V++ RE F+S+PD VIV + + ++ S
Sbjct: 137 GYKRALNISDAVATTVYKQNGVKYEREVFASHPDNVIVMHLKSDTPNGIDISLDFTSPHP 196
Query: 183 NHSYVNGNNQIIMEGRCPGK----------------RIPPKANANDD------------- 213
++++I+ G+ PG + P +AN
Sbjct: 197 TALQKGTDDRLILHGQAPGYVERRTFEQIEQWGDQYKHPELYDANGKRKFDKRMLYGDEI 256
Query: 214 -PKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKK 272
KG+ F A L+ D + D + + +D +L ++SF+G +PS
Sbjct: 257 GGKGMFFEAQLKPVFPKD--GKCEITDAGIHIYNTDEVYFILSMATSFNGFDKSPSRDGI 314
Query: 273 DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTV 332
DP++++ S L+ + Y L RH +DY+ LF RV +L SP+ +
Sbjct: 315 DPSAKAASILEKALSYDYQTLKQRHTEDYRSLFDRVDFELFSSPEQ-----------KAM 363
Query: 333 PSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHV 392
P+ +R++ F + DP L LLFQFGRYL+IS SRP Q NLQGIWN+D P W+ +
Sbjct: 364 PTDKRLEQFAGNADPDLAALLFQFGRYLMISGSRPDGQPLNLQGIWNKDTIPAWNCGYTI 423
Query: 393 NINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS 452
NIN EMNYW + NLSECQEPLF + LS++G++TA+ Y GWV HH T IW +S
Sbjct: 424 NINTEMNYWPAELTNLSECQEPLFRMIRELSVSGAETARNMYNRRGWVAHHNTSIWRESL 483
Query: 453 ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG 512
+ + WPM WLC+HLWEHY +T D FL+ AYPL++G A F DWLI+ +G
Sbjct: 484 PNDNVPTASFWPMVQGWLCSHLWEHYQFTQDETFLKNEAYPLMKGAAEFFADWLIDDGNG 543
Query: 513 YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV 572
+L T SPE+ FI DG+ A +S TMDMAIIRE F+ I+A+E+ +E + ++
Sbjct: 544 HLVTPVGVSPENRFITEDGQTAAMSMGPTMDMAIIRETFTRTIAASEMFNLDE-SFRNEL 602
Query: 573 LKSLPRLRPTKIAEDGSIMEWV 594
L RL P +I + G + EW+
Sbjct: 603 KDKLARLLPYQIGKRGQLQEWI 624
>gi|315500396|ref|YP_004089199.1| alpha-l-fucosidase [Asticcacaulis excentricus CB 48]
gi|315418408|gb|ADU15048.1| Alpha-L-fucosidase [Asticcacaulis excentricus CB 48]
Length = 783
Score = 384 bits (985), Expect = e-103, Method: Compositional matrix adjust.
Identities = 222/592 (37%), Positives = 327/592 (55%), Gaps = 45/592 (7%)
Query: 8 STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
+ +N L + + PA +T+A+P+GNGRLGAMV+GG+ E L+LNEDTL+ G P NPD
Sbjct: 32 TASNDLTLWYREPANEWTEALPLGNGRLGAMVFGGIARERLQLNEDTLYAGAPYQPANPD 91
Query: 68 APKALSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETY 124
P AL ++R L+ G+Y EA A K G+P YQ +G++ L F S A Y
Sbjct: 92 GPAALPEIRKLIFEGKYLEAQALIQAKFMGNPMRQVSYQTIGEMTLTFGPSSNASA---Y 148
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
RRELDL A + V Y V +TRE F S DQV+V ++S + G +SF + ++
Sbjct: 149 RRELDLTKALSTVTYRQDGVTYTRETFISPVDQVLVMRLSADKPGKVSFQLGFETPQLGA 208
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+ +I++ GR G N ++F + +++ G S D+ L V
Sbjct: 209 VTIESPQEIVLSGRNGGH--------NGKDGALRFES--RVRVVASGGQQSTGTDE-LVV 257
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
G+D A++ + A++++ + D D T+ + + + S+ LY+ HLD ++ +
Sbjct: 258 SGADSALVFMAAATNYK----SFRDVSGDATAITKDQITRAASRSFGALYSAHLDAHKAV 313
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F RVS+ R+ + +P+ ER+ T DP+L L FQ+GRYLLI+
Sbjct: 314 FDRVSVDFGRT------------EVADLPTNERIAKSLTLNDPALAALYFQYGRYLLIAC 361
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
SRPGTQ ANLQG+WNE L+ W +NIN EMNYW + P L E EPL + +SI
Sbjct: 362 SRPGTQPANLQGLWNEKLNAPWGGKYTININTEMNYWPAEPTALPELTEPLIRMVREISI 421
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G++TA++ Y A GWV HH TD+W +++A + WP GGAWLC HLW+ Y+Y D
Sbjct: 422 TGAETAKIMYGARGWVAHHNTDLW-RATAPIDAAFYGTWPTGGAWLCLHLWDRYDYGRDP 480
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPE--HEFIAPDGKLACVSYSST 541
+L + YP+L+G + F LD L++ GY+ T PS SPE H+F G C T
Sbjct: 481 AYL-REIYPILKGASQFFLDTLVKDPASGYMVTAPSISPENQHKF----GTSICA--GPT 533
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
MDM IIR++F+ AAE+L K + + +VL +L P +I + G + EW
Sbjct: 534 MDMQIIRDLFANTARAAEIL-KTDKSFRAEVLAMRNKLVPNQIGKAGQLQEW 584
>gi|220928453|ref|YP_002505362.1| hypothetical protein Ccel_1020 [Clostridium cellulolyticum H10]
gi|219998781|gb|ACL75382.1| conserved hypothetical protein [Clostridium cellulolyticum H10]
Length = 759
Score = 384 bits (985), Expect = e-103, Method: Compositional matrix adjust.
Identities = 220/585 (37%), Positives = 336/585 (57%), Gaps = 47/585 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ + PAK + +A+PIGNGRLGAMV+G V +E ++LNED++W G P D NPDA L+
Sbjct: 4 KLWYKSPAKEWNEALPIGNGRLGAMVYGCVKNENIQLNEDSIWYGDPIDRNNPDALANLA 63
Query: 74 DVRSLVDSGQYAEATAASV-KLFGHPADV--YQLLGDIELEF--DDSHLKYAEETYRREL 128
++R+ + G+ EA +V L G P YQ LG+++L F D+S ++ Y REL
Sbjct: 64 EIRNFLSDGRIKEAEKLAVLSLSGVPESQRPYQTLGNLKLNFEIDESDIR----DYSREL 119
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF--NVSLDSLLDNHSY 186
D+ A A VK+ V +TRE+F+S DQVIV ++ G +SF N+ LDN
Sbjct: 120 DIENACASVKFVSKGVMYTREYFASAVDQVIVVRLFADAPGKISFTANMRRGRFLDNSGA 179
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
++G K I A+ D KG++F ++ ++ + G ++ + + L VE
Sbjct: 180 IDG------------KTIGMFASCGSD-KGVRFCSM--VRAVSEGGKVNTI-GENLIVEE 223
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D LL+ ++SF K+ ++ + L + +Y++L + H++DY +L+
Sbjct: 224 ADAVTLLISTATSF---------YHKEYETQCLKYLDGVEEKTYTELMSNHIEDYSQLYG 274
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSS 365
RV +++ + + + I ++ +AER++ ++ + D L L F FGRYLLIS S
Sbjct: 275 RVELEIGNAEE--------HDKIQSLDTAERLERLESGKPDHQLECLYFSFGRYLLISCS 326
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPG+ ANLQGIWN+D+ P WDS +NIN EMNYW + CNLSEC PLFD + +
Sbjct: 327 RPGSLPANLQGIWNQDILPAWDSKYTININTEMNYWPAETCNLSECHFPLFDHIERMRAP 386
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G +TA+V Y SG+V HH TDIW ++ + WPMG AWL HLWEHY + +D++
Sbjct: 387 GRRTARVMYGCSGFVAHHNTDIWGDTAPQDIYIPATYWPMGAAWLSLHLWEHYEFGLDKE 446
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
FL K AYP+++ A F LD+LIE G L T+PS SPE+ +I +G+ C+ +MD
Sbjct: 447 FL-KDAYPVMKEAAQFFLDFLIEDSKGRLVTSPSVSPENTYILENGEKGCLCIGPSMDSQ 505
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSI 590
I+ +FS I A+ +L+ + + EK++K L +I G I
Sbjct: 506 ILYALFSGCIEASNILD-TDISFAEKLIKVRDSLPKPQIGRYGQI 549
>gi|423722949|ref|ZP_17697102.1| hypothetical protein HMPREF1078_01162 [Parabacteroides merdae
CL09T00C40]
gi|409241779|gb|EKN34546.1| hypothetical protein HMPREF1078_01162 [Parabacteroides merdae
CL09T00C40]
Length = 864
Score = 384 bits (985), Expect = e-103, Method: Compositional matrix adjust.
Identities = 220/622 (35%), Positives = 327/622 (52%), Gaps = 50/622 (8%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
T + PL + ++ PA+++ +A+PIGNGR GAMV+GGV E L+LNE+TL++G P
Sbjct: 38 GTPSKAPLTLWYDKPAQNWDEALPIGNGRAGAMVFGGVEKEQLQLNENTLYSGEPSVVFK 97
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEE 122
P+ V L+ +G+Y A+ K G YQ GD+ ++ +
Sbjct: 98 DVKITPEMFDKVVGLMKAGKYKTASDLVCKNWLGRLHQYYQPFGDLHIQNNKPG---DAA 154
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
Y+R L+++ A A Y V++ RE F+S+PD VIV + + ++ S
Sbjct: 155 GYKRALNISDAVATTVYKQNGVKYEREVFASHPDNVIVMHLKSDTPNGIDISLDFTSPHP 214
Query: 183 NHSYVNGNNQIIMEGRCPGK----------------RIPPKANANDD------------- 213
++++I+ G+ PG + P +AN
Sbjct: 215 TALQKGTDDRLILHGQAPGYVERRTFEQIEQWGDQYKHPELYDANGKRKFDKRMLYGDEI 274
Query: 214 -PKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKK 272
KG+ F A L+ D + D + + +D +L ++SF+G +PS
Sbjct: 275 GGKGMFFEAQLKPVFPKD--GKCEITDAGIHIYNTDEVYFILSMATSFNGFDKSPSRDGI 332
Query: 273 DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTV 332
DP++++ S L+ + Y L RH +DY+ LF RV +L SP+ +
Sbjct: 333 DPSAKAASILEKALSYDYQTLKQRHTEDYRSLFDRVDFELFSSPEQ-----------KAM 381
Query: 333 PSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHV 392
P+ +R++ F + DP L LLFQFGRYL+IS SRP Q NLQGIWN+D P W+ +
Sbjct: 382 PTDKRLEQFAGNADPDLAALLFQFGRYLMISGSRPDGQPLNLQGIWNKDTIPAWNCGYTI 441
Query: 393 NINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS 452
NIN EMNYW + NLSECQEPLF + LS++G++TA+ Y GWV HH T IW +S
Sbjct: 442 NINTEMNYWPAELTNLSECQEPLFRMIRELSVSGAETARNMYNRRGWVAHHNTSIWRESL 501
Query: 453 ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG 512
+ + WPM WLC+HLWEHY +T D FL+ AYPL++G A F DWLI+ +G
Sbjct: 502 PNDNVPTASFWPMVQGWLCSHLWEHYQFTQDETFLKNEAYPLMKGAAEFFADWLIDDGNG 561
Query: 513 YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV 572
+L T SPE+ FI DG+ A +S TMDMAIIRE F+ I+A+E+ +E + ++
Sbjct: 562 HLVTPVGVSPENRFITEDGQTAAMSMGPTMDMAIIRETFTRTIAASEMFNLDE-SFRNEL 620
Query: 573 LKSLPRLRPTKIAEDGSIMEWV 594
L RL P +I + G + EW+
Sbjct: 621 KDKLARLLPYQIGKRGQLQEWI 642
>gi|430751376|ref|YP_007214284.1| hypothetical protein Theco_3231 [Thermobacillus composti KWC4]
gi|430735341|gb|AGA59286.1| hypothetical protein Theco_3231 [Thermobacillus composti KWC4]
Length = 765
Score = 383 bits (984), Expect = e-103, Method: Compositional matrix adjust.
Identities = 224/595 (37%), Positives = 331/595 (55%), Gaps = 39/595 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + PA + +A+PIGNGRLGAMV GG+ E L++NE+T W+G P DY P A + L
Sbjct: 1 MKLWYAKPASDWLEALPIGNGRLGAMVHGGMERERLQINEETFWSGGPHDYRRPGASRYL 60
Query: 73 SDVRSLVDSGQYAEATAA-SVKLFGHPADVYQLL--GDIELEFDDSHLKYAEETYRRELD 129
VR L+ + EA ++ G P ++ L D+ L F H Y RELD
Sbjct: 61 RQVRELIFQDKVEEAQQLFDERMKGDPELLHAFLPCCDMMLHFP-GHAD--GRDYYRELD 117
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL-LDNHSYVN 188
L+ A A +Y V V +TRE F S PDQ I+ +IS G + L + +
Sbjct: 118 LDRAVATTRYRVNGVTYTREVFCSYPDQAIIMRISSDCPGKIDMAGELAAANGEQRVRFA 177
Query: 189 GNNQIIMEGRCPGKR--IPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
G++ +++ G+ GKR P + NA D G++F A ++ + G + E + L+V G
Sbjct: 178 GDDTLVLTGQA-GKREARPRRLNAGWDGPGVRFEA--RLRAFSEGGRVLRGE-QALEVRG 233
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D L+ A++SF +N DP +++ ++ ++ +Y +L RHL+DY L+
Sbjct: 234 ADAVTLIFSAATSF----VNYRSIDGDPGAKAAGVIERLQGKTYGELLGRHLEDYTALYR 289
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV ++L D P+ ERV+ + EDP L L +Q+GRYLLI+SSR
Sbjct: 290 RVELELGDGAGD------------GTPTDERVRMYAETEDPGLAALFYQYGRYLLIASSR 337
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQGIWN+D P W S NIN++MNYW + NL EC PLFD + L I G
Sbjct: 338 PGGQPANLQGIWNDDPWPLWGSKWTTNINVQMNYWPAESGNLRECHLPLFDLIDDLRITG 397
Query: 427 SKTAQVNYLASGWVIHHKTDIW-AKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
++TA+ +Y G+V+HH TD+W A + D A+WPMGG WL HLW+HY Y D+
Sbjct: 398 AETAETHYGCRGFVVHHNTDLWRAATPVDYDA---AVWPMGGVWLVQHLWDHYEYCPDQA 454
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHD-----GYLETNPSTSPEHEFIAPDGKLACVSYSS 540
FL R YP L A F+LD+L E + G L TNPS SPE+ +I G+ ++ ++
Sbjct: 455 FLRNRVYPALREAALFVLDYLTEAPEGTRLAGKLVTNPSYSPENHYIDDKGRRRYLTCAA 514
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
TMD+ +IR++F + AAE+L +ED E + +++ RL +I + G + EW +
Sbjct: 515 TMDIQLIRDLFQRCMKAAEMLGVDEDFRGE-LEEAMARLPGMQIGKYGQLQEWAE 568
>gi|188991901|ref|YP_001903911.1| hypothetical protein xccb100_2506 [Xanthomonas campestris pv.
campestris str. B100]
gi|167733661|emb|CAP51866.1| conserved exported protein [Xanthomonas campestris pv. campestris]
Length = 790
Score = 383 bits (984), Expect = e-103, Method: Compositional matrix adjust.
Identities = 230/596 (38%), Positives = 327/596 (54%), Gaps = 46/596 (7%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+ + + L + + PA + A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D TN
Sbjct: 38 AVAPDDALHLWYREPANEWVQALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDATN 97
Query: 66 PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEE 122
P A AL VR+L+ +G+YAEA A KL P YQ LGD+ L+FD +
Sbjct: 98 PQALAALPQVRALIFAGRYAEAEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GIS 154
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
YRR+LDL+TA A + G RE F S Q IV ++S G +S V +DS
Sbjct: 155 EYRRQLDLDTAVATTTFRSGGAVHRREVFVSAQSQCIVVRLSCDRPGGISLRVGIDSPQS 214
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGI--QFSAILEIKISDDRGTISALEDK 240
V ++ GR N GI + L + G+++A+ D+
Sbjct: 215 GEVTVE-QGSLLFSGR------------NGSFAGIDGKLRFALRVLPQVKGGSVTAVRDR 261
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
L+++G+D VLLL A++S+ + DP + + ++LQ LSY+ L HL D
Sbjct: 262 -LRIQGADEVVLLLTAATSYQ----RFDAVEGDPLALTAASLQKAGKLSYAALLRAHLAD 316
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
+Q+LF RV+I L S T+P+ ERV+ F DP+L L Q+GRYL
Sbjct: 317 HQRLFRRVAIDLGSS------------EAATLPTDERVQRFAEGNDPALAALYHQYGRYL 364
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LI SSRPGTQ ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL L
Sbjct: 365 LICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLF 424
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
L+ G+ TA+ Y A GWV+H+ TD+W ++ G W+LWPMGG WL LW+ ++Y
Sbjct: 425 DLARTGAHTARALYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDY 483
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
DR +L K YPL +G A F + L+ + G + TNPS SPE++ P G C
Sbjct: 484 GRDRAYLAK-IYPLFKGAAEFFVATLVRDPGTGAMVTNPSMSPENQH--PFGAAVCA--G 538
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
TMD ++R++F+ I+ +++L+ + AL +++ +L P +I + G + EW Q
Sbjct: 539 PTMDAQLLRDLFAQCIAMSKLLDVDA-ALAQQLATLREQLPPNRIGKAGQLQEWQQ 593
>gi|374324082|ref|YP_005077211.1| alpha-L-fucosidase [Paenibacillus terrae HPL-003]
gi|357203091|gb|AET60988.1| alpha-L-fucosidase [Paenibacillus terrae HPL-003]
Length = 772
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 232/603 (38%), Positives = 338/603 (56%), Gaps = 49/603 (8%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
N I FN PA+ + +AIPIGNG LG M++G E ++LNED+LW G P D NP + +
Sbjct: 2 NEKMIWFNQPAEKWEEAIPIGNGTLGGMIFGKTSIERIQLNEDSLWYGGPMDRNNPHSFE 61
Query: 71 ALSDVRSLVDSGQYAEATA-ASVKLFGHP--ADVYQLLGDIELEFDDSHLKYAEETYRRE 127
L ++RSL+ SGQ +A ASV L G P Y+ LGD+ L D + + YRR+
Sbjct: 62 YLDEIRSLLFSGQIKQAEELASVALVGVPDGQRHYESLGDLYLNIGDGEEEIKD--YRRQ 119
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV------------ 175
LDL+ V Y V V + RE+FSS PDQV+V +++ SE G+LSF+
Sbjct: 120 LDLDHGIVSVNYRVNQVNYCREYFSSFPDQVLVVRLNSSEYGALSFSALFGRGIVLEPTP 179
Query: 176 ---SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRG 232
L + H+Y++ +E R P I + ++ GI+F + I+I + G
Sbjct: 180 WSDVLKHPVGLHAYLDR-----IETRSPADLIIRGRSGGEE--GIRFCCV--IRIVTEEG 230
Query: 233 TISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSD 292
IS + +L ++ + A +L+ A + F P K+ +E + L SY
Sbjct: 231 QIS-YSNGQLSLKDVNAATILVSACTDFRIP-------KEQMEAECICRLDRAAGKSYDQ 282
Query: 293 LYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVEL 352
L T H++DYQ LF RV + L + V T + + T ER+K+ ED L+ L
Sbjct: 283 LRTGHIEDYQALFGRVELSLQGN----VDSTSTSSFLTTDQRLERIKN--GAEDNELISL 336
Query: 353 LFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
FQFGRYLLISSSRPG+ ANLQGIWN+D+ P WDS +NIN +MNYW + CNL+EC
Sbjct: 337 YFQFGRYLLISSSRPGSLPANLQGIWNKDMLPIWDSKYTININTQMNYWPAEICNLAECH 396
Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
PL DF+ + G +TA++ Y G+V HH +DIWA ++ + W MG AWL
Sbjct: 397 IPLIDFIDRMQERGKETARIMYRCRGFVAHHNSDIWADTAPQDVCITSTFWTMGAAWLSL 456
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGK 532
HLW+HY + D FL K AY ++ A FLLD+LIE G L +PS+SPE+ ++ P+G+
Sbjct: 457 HLWDHYEFGQDASFL-KEAYDTMKEAAFFLLDYLIEDPYGNLVISPSSSPENRYVLPNGE 515
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNED--ALVEKVLKSLPRLRPTKIAEDGSI 590
+ Y ++MD IIRE+F I + +L+++++ A++ K LK +P+L + + G I
Sbjct: 516 SGALCYGASMDSQIIRELFERCIKSTIILQEDQEFGAMLRKALKRIPKL---AVGKHGQI 572
Query: 591 MEW 593
EW
Sbjct: 573 QEW 575
>gi|294146663|ref|YP_003559329.1| hypothetical protein SJA_C2-02340 [Sphingobium japonicum UT26S]
gi|292677080|dbj|BAI98597.1| hypothetical protein SJA_C2-02340 [Sphingobium japonicum UT26S]
Length = 777
Score = 382 bits (982), Expect = e-103, Method: Compositional matrix adjust.
Identities = 226/590 (38%), Positives = 327/590 (55%), Gaps = 43/590 (7%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
+PL + + PA +T+A+PIGNGRLGAM++GGV E L+LNE TLW G P D NP+A
Sbjct: 33 HPLTLWYRQPAAAWTEALPIGNGRLGAMLFGGVARERLQLNEGTLWAGQPYDPVNPEAKA 92
Query: 71 ALSDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRE 127
L VR L+ +G+ AEA A + K L P YQ LGD+ L+F A Y RE
Sbjct: 93 NLPQVRELIFAGRIAEAEALADKTLMAKPLAQMPYQTLGDLILDFPGVGQATA---YHRE 149
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL-DSLLDNHSY 186
LDL++ATA +++ G V R+ +S D VI +S +G L ++SL S +
Sbjct: 150 LDLDSATATTRFTAGGVAHVRQAIASPADNVIAVHLS--STGRLDVDISLRSSQIGVQVA 207
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+G N +++ GR R + N ++F+A L ++ T SA D L + G
Sbjct: 208 ADGPNGLLLTGRNGASR---GIDGN-----LRFAARLAARVEGGHATHSA--DGSLSIRG 257
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ LLL ++ F D DP + + + L R+ S++ + T D +++LF
Sbjct: 258 AKSVTLLLAMATGFR----RFDDVGGDPVAGTAATLARARDRSFATIATDAADAHRRLFR 313
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV++ L +P +P+ R+ QT +DP+L L F + RYLLI SSR
Sbjct: 314 RVTLDLGSTPAA------------QLPTDRRIADSQTSDDPALAALYFHYARYLLICSSR 361
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQG+WN+ L P W S +NIN +MNYW + P L EC PL + + L++ G
Sbjct: 362 PGGQPANLQGLWNDSLDPPWGSKYTININTQMNYWPAEPAALGECVAPLVEMVRDLAVTG 421
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
++TA+ Y A GWV HH TD+W +++A + LWP GGAWLC HLW+HY+Y DR +
Sbjct: 422 ARTARSMYGARGWVAHHNTDLW-RATAPIDGAQFGLWPTGGAWLCMHLWDHYDYHRDRAY 480
Query: 487 LEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L YPL+ G A F LD L + G+L TNPS SPE+ P G + TMDMA
Sbjct: 481 LAS-VYPLMAGAARFFLDTLQRDPASGFLVTNPSMSPEN----PHGHGGTICAGPTMDMA 535
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I+R++F+ + AA +L+++ +LV ++ + RL P +I G + EW Q
Sbjct: 536 ILRDLFTRTMEAAAILDRDA-SLVAEMRAARDRLAPYRIGRQGQLQEWQQ 584
>gi|384427644|ref|YP_005637003.1| hypothetical protein XCR_1996 [Xanthomonas campestris pv. raphani
756C]
gi|341936746|gb|AEL06885.1| expressed protein [Xanthomonas campestris pv. raphani 756C]
Length = 764
Score = 382 bits (982), Expect = e-103, Method: Compositional matrix adjust.
Identities = 231/594 (38%), Positives = 331/594 (55%), Gaps = 42/594 (7%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+ + + L + + PA + A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D TN
Sbjct: 12 AVAPDDALHLWYREPANEWVQALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDATN 71
Query: 66 PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEE 122
P A AL VR+L+ +G+YAEA A KL P YQ LGD+ L+FD +
Sbjct: 72 PQALAALPQVRALIFAGRYAEAEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GIS 128
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
YRR+LDL+TA A + G RE F S Q IV ++S G +S V +DS
Sbjct: 129 EYRRQLDLDTAVATTTFRSGGAVQRREVFVSAQSQCIVVRLSCDRPGGISLRVGIDSPQS 188
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
V ++ GR + A D K ++F+ L + G+++A+ D+ L
Sbjct: 189 GEVTVE-QGSLLFSGRN-------GSFAGIDGK-LRFA--LRVLPQVKGGSVTAVRDR-L 236
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
+++G+D VLLL A++S+ + DP + + ++LQ LSY+ L HL D+Q
Sbjct: 237 RIQGADEVVLLLTAATSYQ----RFDAVEGDPLALTAASLQKAGKLSYAALLRAHLADHQ 292
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
+LF RV+I L S T+P+ ERV+ F DP+L L Q+GRYLLI
Sbjct: 293 RLFRRVAIDLGSS------------EAATLPTDERVQRFAEGNDPALAALYHQYGRYLLI 340
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SSRPGTQ ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL L L
Sbjct: 341 CSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFDL 400
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
+ G+ TA+ Y A GWV+H+ TD+W ++ G W+LWPMGG WL LW+ ++Y
Sbjct: 401 ARTGAHTARAMYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDYGR 459
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
DR +L K YPL +G A F + L+ + G + TNPS SPE++ P G C T
Sbjct: 460 DRAYLAK-IYPLFKGAAEFFVATLVRDPGTGAMVTNPSMSPENQH--PFGAAVCA--GPT 514
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
MD ++R++F+ I+ +++L+ + AL +++ +L P +I + G + EW Q
Sbjct: 515 MDAQLLRDLFAQCIAMSKLLDVDA-ALAQQLATLREQLPPNRIGKAGQLQEWQQ 567
>gi|374991896|ref|YP_004967391.1| hypothetical protein SBI_09142 [Streptomyces bingchenggensis BCW-1]
gi|297162548|gb|ADI12260.1| hypothetical protein SBI_09142 [Streptomyces bingchenggensis BCW-1]
Length = 822
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 233/596 (39%), Positives = 333/596 (55%), Gaps = 45/596 (7%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
A T+ L + + PA + +A+PIGNGRLGAMV+GG +E L+LNEDT+W G P D
Sbjct: 49 AGGTTLPGELTLWYPRPASEWLEALPIGNGRLGAMVFGGTDTERLQLNEDTVWAGGPYDP 108
Query: 64 TNPDAPKALSDVRSLVDSGQYAEATAASVKLF-GHPAD--VYQLLGDIELEFDDSHLKYA 120
NP L ++R V +G++ +A A F G+P YQ +GD+ L F +
Sbjct: 109 ANPQGLSNLPEIRRRVFAGEWGDAQALIDSTFMGNPLSELPYQTVGDLRLTFSS---QGE 165
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
YRRELD+++AT V+Y+ V + RE +S+PDQVI +++ GS+SF + DS
Sbjct: 166 VSDYRRELDIDSATTSVRYTQSGVTYRREIIASHPDQVIALRLTADTPGSISFTAAFDSP 225
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALED 239
I ++G G ++F A+ + + GT+ + ED
Sbjct: 226 QSVTGSSPDRITIAIDG---------TGQTRSGITGQVRFRAL--ARACAEGGTVGS-ED 273
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
KL V G+D A LL+ +S+ F NP+ D T+ + + L + ++ ++ L RH D
Sbjct: 274 GKLTVAGADSATLLVSIGTSYTD-FGNPT---GDHTARAAAPLNAASDVPFTTLRKRHTD 329
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
DY++LF RV++ L + + +P+ ERVK+F + DP LV L +QFGRY
Sbjct: 330 DYRRLFRRVTLDLGST------------DAAKLPTDERVKNFASASDPQLVSLHYQFGRY 377
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LLIS SRPGTQ ANLQGIWN+ LSP W +NIN EMNYW + NL EC EP+FD L
Sbjct: 378 LLISCSRPGTQPANLQGIWNDLLSPPWSCRYTININTEMNYWPAPVTNLLECWEPVFDML 437
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
LS++G++TA+ Y A GWV HH D W + +A + + WP GGAWL T +W+HY
Sbjct: 438 ADLSVSGARTARTQYGARGWVAHHNVDGW-RGTAPCDQAFYGTWPTGGAWLATSIWDHYL 496
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
+T D++ L KR YP+L G F LD L+ + G+L T PS SPEH PD A V
Sbjct: 497 FTGDKEALRKR-YPVLRGAVLFFLDTLVTDPSSGHLVTCPSMSPEHAH-HPD---ASVCA 551
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVE-KVLKSLPRLRPTKIAEDGSIMEW 593
TMD I+R+VF + A+E+L ++ D E + ++ +L P KI G + EW
Sbjct: 552 GPTMDNQILRDVFDGFVIASELLGEDADMRAEARTVRG--KLPPMKIGAQGQLQEW 605
>gi|337748975|ref|YP_004643137.1| alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
gi|379721944|ref|YP_005314075.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
gi|386724687|ref|YP_006191013.1| alpha-L-fucosidase [Paenibacillus mucilaginosus K02]
gi|336300164|gb|AEI43267.1| Alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
gi|378570616|gb|AFC30926.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
gi|384091812|gb|AFH63248.1| alpha-L-fucosidase [Paenibacillus mucilaginosus K02]
Length = 786
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 214/586 (36%), Positives = 319/586 (54%), Gaps = 40/586 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ + PA+ +TDA+P+GNGRLGAMV+G V E L++NED++W G P + NPD K L
Sbjct: 11 KLWYEKPARAWTDALPVGNGRLGAMVFGKVNQERLQINEDSVWYGGPLNGDNPDGRKYLP 70
Query: 74 DVRSLVDSGQYAEAT-AASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
+VR L+ G+ EA AA + L P + YQ LGD+ + D K Y R+LD+
Sbjct: 71 EVRRLLLKGKQLEAEEAAQMGLMSIPKSMRPYQPLGDLHIYHDGE--KKMISNYYRDLDI 128
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL-LDNHSYVNG 189
A V Y + V RE FSS D V+ +I+ L+ +++ D +
Sbjct: 129 EEGIAHVSYCLNEVPHVREVFSSAVDGVLAVRITCGPDAKLNLRMNVSRRPFDEGTQQLA 188
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
++ I M G + G+ + + +K + G ++A D L V ++
Sbjct: 189 HDTIAMCG-------------ENGKNGVTY--CMAVKAVPEGGWVNAFGDF-LAVRDANA 232
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
+ + ++F DP +E + L+ Y + H+ D++ L+ RV+
Sbjct: 233 VTIYIAGGTTF---------RSDDPLAECVRQLEQAERKGYEAVRRDHVADHRSLYRRVN 283
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPG 368
++L P S + T+P+ R++ F + EDP L L FQ+GRYL+++SSRPG
Sbjct: 284 LELDPEP-------VSGPDPSTLPTDARLQRFREGGEDPGLFRLYFQYGRYLMMASSRPG 336
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
+ ANLQGIWNE +P W+S +NIN EMNYW + CNL EC EPLFD + + NG K
Sbjct: 337 SNPANLQGIWNESFTPPWESKYTININTEMNYWPAESCNLPECHEPLFDLIDRMRPNGRK 396
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
TA+ Y G+V HH TD+W + + + ++WPMG AWL HLWEHY Y ++ FL
Sbjct: 397 TAEQLYGCRGFVAHHNTDMWGSTQVEGNYMPGSIWPMGAAWLSLHLWEHYRYGLEETFLR 456
Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
+RAYP+++ A F LD+L E +G L T PSTSPE++FI PDG + ++ +MD+ I+
Sbjct: 457 ERAYPVMKEAAEFFLDYLFEDKEGRLVTGPSTSPENKFIMPDGSVGTLTIGPSMDIQIVY 516
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+ SA AAE+L + +D L EK + L RL P +I G + EW
Sbjct: 517 SLLSACTDAAEIL-RTDDLLREKWEEVLRRLPPPQIGRHGQLQEWT 561
>gi|344997079|ref|YP_004799422.1| alpha-L-fucosidase [Caldicellulosiruptor lactoaceticus 6A]
gi|343965298|gb|AEM74445.1| alpha-L-fucosidase [Caldicellulosiruptor lactoaceticus 6A]
Length = 752
Score = 382 bits (980), Expect = e-103, Method: Compositional matrix adjust.
Identities = 224/590 (37%), Positives = 332/590 (56%), Gaps = 44/590 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ FN PA+ + +A+PIGNG LGAM++GGV ET++LNE+++W+ P NPDA K L
Sbjct: 6 LKVIFNKPARCWEEALPIGNGSLGAMIYGGVKYETIQLNEESIWSCGPRRRENPDAFKYL 65
Query: 73 SDVRSLVDSGQYAEATAASV-KLFG--HPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
++R + G A SV L G H Y+ LG +++ F++ + Y R LD
Sbjct: 66 PEIRKTILEGNIKRAEELSVFALSGTPHSEGNYEPLGYLDIYFEEVESDKVK-NYTRYLD 124
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLS----FNVSLDSLLDNHS 185
++ A +V++ V N+ + + +FSS PD+VIV KI S++G++S F +D
Sbjct: 125 ISNAICKVEFDVDNIRYKKIYFSSYPDKVIVVKICSSKTGAVSLRAKFRREYQEDIDKCG 184
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
V+ N++I E C + +G+ FSA+L+ +S D G + + D L V+
Sbjct: 185 KVD-NDKIFFE--CLA----------GEGRGVSFSAVLK-AVSKD-GDVYTIGDN-LFVK 228
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
+ +LL+ +++S+ +KD + + ++ + +LY RH +DY+ LF
Sbjct: 229 NATEVMLLITSTTSY---------KEKDYFNWCLKTVEQASKYVFENLYKRHTEDYKSLF 279
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
RV + T+ + E I+ + + D L+ LLFQFGRYLLISSS
Sbjct: 280 SRVEFYIDTKDSSKCTELTTPERINLLREGYK--------DEELIVLLFQFGRYLLISSS 331
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPG NLQGIWN+++ P W S +NINL+MNYW + CNLSEC PLFD L + N
Sbjct: 332 RPGCLPPNLQGIWNKEMKPPWGSKYTININLQMNYWPAEVCNLSECHMPLFDLLEKMYEN 391
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G TAQ Y G+ HH TDIW ++ + WPMG AWLC H+W+HY YT D +
Sbjct: 392 GKITAQRMYGCRGFCAHHNTDIWGDTAPQDIYIPATYWPMGAAWLCLHIWDHYEYTGDLE 451
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
FL K Y L+ A FLLD+LIE +GYL T PS SPE+ + +G++ ++Y TMD+
Sbjct: 452 FL-KEYYYLMREAALFLLDYLIEDRNGYLVTCPSCSPENRY-KLNGEVYSLTYMPTMDIQ 509
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
II +F + A VL+ N D +VEK+ +L +L P KI + G I EW++
Sbjct: 510 IITALFEKVKKANNVLKLN-DEIVEKIEYALNKLPPIKIGKHGQIQEWIE 558
>gi|21231206|ref|NP_637123.1| hypothetical protein XCC1756 [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|66768787|ref|YP_243549.1| hypothetical protein XC_2479 [Xanthomonas campestris pv. campestris
str. 8004]
gi|21112850|gb|AAM41047.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. ATCC 33913]
gi|66574119|gb|AAY49529.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. 8004]
Length = 790
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 230/594 (38%), Positives = 331/594 (55%), Gaps = 42/594 (7%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+ + + L + + PA + A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D TN
Sbjct: 38 AVAPDDALHLWYREPANEWVQALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDATN 97
Query: 66 PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEE 122
P A AL VR+L+ +G+YAEA A KL P YQ LGD+ L+FD +
Sbjct: 98 PQALAALPQVRALIFAGRYAEAEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GIS 154
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
YRR+LDL+TA A + G RE F S Q IV ++S G +S V +DS
Sbjct: 155 EYRRQLDLDTAVATTTFRSGGAVHRREVFVSAQSQCIVVRLSCDRPGGISLRVGIDSPQS 214
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
V ++ GR + A D K ++F+ L + G+++A+ D+ L
Sbjct: 215 GEVTVE-QGSLLFSGRN-------GSFAGIDGK-LRFA--LRVLPQVKGGSVTAVRDR-L 262
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
+++G+D VLLL A++S+ + DP + ++++LQ LSY+ L HL D+Q
Sbjct: 263 RIQGADEVVLLLTAATSYQ----RFDAVEGDPLALTVASLQKAGKLSYAALLRAHLADHQ 318
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
+LF RV+I L S +P+ ERV+ F DP+L L Q+GRYLLI
Sbjct: 319 RLFRRVAIDLGSS------------EAARLPTDERVQRFAEGNDPALAALYHQYGRYLLI 366
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SSRPGTQ ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL L L
Sbjct: 367 CSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFDL 426
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
+ G+ TA+ Y A GWV+H+ TD+W ++ G W+LWPMGG WL LW+ ++Y
Sbjct: 427 ARTGAHTARALYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDYGR 485
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
DR +L K YPL +G A F + L+ + G + TNPS SPE++ P G C T
Sbjct: 486 DRAYLAK-IYPLFKGAAEFFVATLVRDPGTGAMVTNPSMSPENQH--PFGAAVCA--GPT 540
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
MD ++R++F+ I+ +++L+ + AL +++ +L P +I + G + EW Q
Sbjct: 541 MDAQLLRDLFAQCIAMSKLLDVDA-ALAQQLATLREQLPPNRIGKAGQLQEWQQ 593
>gi|240144516|ref|ZP_04743117.1| alpha-L-fucosidase 2 [Roseburia intestinalis L1-82]
gi|257203465|gb|EEV01750.1| alpha-L-fucosidase 2 [Roseburia intestinalis L1-82]
Length = 741
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 222/586 (37%), Positives = 320/586 (54%), Gaps = 54/586 (9%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PAK + +A+P+GNGR+GAM++GGV E +++NE+++W G P D NPDA L ++R
Sbjct: 6 YKEPAKVWEEALPLGNGRIGAMIFGGVEQERIQVNEESIWYGGPVDRNNPDAKAHLEEIR 65
Query: 77 SLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIEL---EFDDSHLKYAEETYRRELDL 130
+ G+ EA ++ + G P + YQ LGDI + +D E Y+R L+L
Sbjct: 66 QHIFEGRLKEAQRLMNLTMSGCPDSMHPYQTLGDINIYSSGIEDV------ENYKRSLNL 119
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A V++ +V F RE F S P +V + + +S +SF +L Y +G
Sbjct: 120 EEAVCLVEFDSRSVHFKREMFLSYPKDCLVIRFTADKSSQISFQANLS----RGRYFDGI 175
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
N++ G C N G F ++ IK G SA+ L V+G+D
Sbjct: 176 NKLGENGIC--------LYGNLGRGGSDF--VMGIKAWAKGGVASAV-GGNLCVQGADEV 224
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRN----LSYSDLYTRHLDDYQKLFH 306
+L A+SSF K E + ++ N L+Y +L+ H +DY+ LF
Sbjct: 225 LLTFCAASSF---------RNKKKCDELLREIEEKMNNAAMLTYEELFEEHKEDYRTLFA 275
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQFGRYLLISSS 365
RV QL E D +P+ ER+ ++ + D L ++LF +GRYLLIS S
Sbjct: 276 RVEFQLD-----------GVEKFDVIPTNERIERAAKETPDIGLSKMLFDYGRYLLISCS 324
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPG A LQGIWN+D +P W+S +NIN EMNYW + CNLSEC PLFD L + N
Sbjct: 325 RPGGLPATLQGIWNQDFTPPWESKYTININTEMNYWLAESCNLSECHMPLFDLLERMVEN 384
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G +TA+ Y G+V HH TDI ++ W MG AWLCTHLW HY YT+DR+
Sbjct: 385 GRRTAEKMYGCRGFVAHHNTDIHGDTAPQDTWYPATYWVMGAAWLCTHLWTHYEYTLDRE 444
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
FLE R+YP++ A F +D+L+E DGYL T PS SPE+ + P+G++ VSY +TMD
Sbjct: 445 FLE-RSYPIMCEAALFFIDFLVE-KDGYLVTCPSLSPENTYCLPNGEMGAVSYGATMDNQ 502
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
I+R++FS ++A ++L+ A +EK L +L PT+I DG IM
Sbjct: 503 ILRDLFSQCLAAGKILQATNSAFLEKAEYVLQKLLPTRIGSDGRIM 548
>gi|345013386|ref|YP_004815740.1| large hypothetical protein [Streptomyces violaceusniger Tu 4113]
gi|344039735|gb|AEM85460.1| large secreted protein [Streptomyces violaceusniger Tu 4113]
Length = 805
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 226/590 (38%), Positives = 316/590 (53%), Gaps = 41/590 (6%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
PL + + PA + A+P+GNGRLGAMV+G +E L+LN DTLW G P Y N
Sbjct: 44 RPLALWYREPAADWLSALPLGNGRLGAMVFGATETERLQLNADTLWAGGPHSYDNHKGLA 103
Query: 71 ALSDVRSLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRE 127
AL +R LV G++ EA T + G P YQ +G + L A YRRE
Sbjct: 104 ALPRIRQLVFDGKWPEAETLINSDFLGVPGGQAQYQTVGSLLLSLPTGG---AVTGYRRE 160
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
LDL++A A Y+ V FTRE F+S PD+VIV ++S S+ G+LSF + +S L
Sbjct: 161 LDLDSAVATTTYTRDGVTFTREAFASAPDRVIVVRLSASKKGALSFGATFESPLRTSLSS 220
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
++G +A G + F A++ + + + V G
Sbjct: 221 PDPLTAALDG---------TGDATGGVDGAVGFRALVRVLAEG---GTTTSAGGTVTVRG 268
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D A +L+ +++ +N ++ D ++ + L N Y L +RH+DD++ LF
Sbjct: 269 ADAATVLVAIGTTY----VNWENANGDAAGQAAADLNPAANRPYGQLRSRHVDDHRALFR 324
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
R S+ + + +P+ ERV F + DP LVEL FQ+GRYLLI++SR
Sbjct: 325 RTSLDVGSG------------DAAALPTDERVSRFASGGDPQLVELHFQYGRYLLIAASR 372
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PGTQ A LQGIWN+ SP W S +NIN EMNYW + P NL EC EP+F L L++ G
Sbjct: 373 PGTQPATLQGIWNDLTSPPWGSKYTININTEMNYWPAAPANLLECWEPVFALLDELAVAG 432
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
TA+ Y A GWV HH TD+W + +A W +WPMGGAW+ +WEHY YT D +
Sbjct: 433 RSTARTQYGADGWVTHHNTDVW-RGTAPVDGAFWGMWPMGGAWMSMAIWEHYRYTRDTEK 491
Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L R YP+L+G A F LD L+ + G L T PS SPE+ + G C TMDM
Sbjct: 492 LRAR-YPVLKGAAQFFLDALVTDPATGALVTCPSVSPENAHHSGGGGSLCA--GPTMDMQ 548
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
++R++F A+ SAA+ L + AL ++VL + RL P KI G + EW Q
Sbjct: 549 LLRDLFGAVASAADTL-GTDAALRDQVLAARGRLAPMKIGAQGRLQEWQQ 597
>gi|237722004|ref|ZP_04552485.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
gi|423291145|ref|ZP_17269993.1| hypothetical protein HMPREF1069_05036 [Bacteroides ovatus
CL02T12C04]
gi|229448873|gb|EEO54664.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
gi|392664179|gb|EIY57721.1| hypothetical protein HMPREF1069_05036 [Bacteroides ovatus
CL02T12C04]
Length = 792
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 226/598 (37%), Positives = 327/598 (54%), Gaps = 50/598 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ +N PA + +A+PIGNGR+GAMV+G E +LNE+++W+G P D+ NP A AL
Sbjct: 27 KLWYNAPATVWEEALPIGNGRIGAMVYGNPLQEVYQLNEESIWSGYPQDWNNPKAANALP 86
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
VR VD G YA+A+ K + L L D A YR EL+++ A
Sbjct: 87 QVREAVDRGDYAKASEL-WKANAQGPYTARYLPMANLMLDQLTRGEARNLYR-ELNISNA 144
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
+ V Y V++ R F S PDQV+V KI+ ++S ++ L+SLL G +
Sbjct: 145 LSTVTYEADGVKYRRTSFISYPDQVMVIKIAADRPQAVSLHIRLNSLLRYTVQTKGEKTL 204
Query: 194 IMEGRCPG----KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
I+ G+ P + P DD +G QF +++++ D G A D L V ++
Sbjct: 205 ILNGKAPAYVANRDYDPHQVVYDDKRGTQFK--VQVELLPDGGHCEA-NDSALTVRNANE 261
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
VLLL A + F + K+ Y +L RH DD+Q+LF+R+
Sbjct: 262 VVLLLSAVTDFGNKKMTLKKCKR----------------PYQELLQRHTDDHQQLFNRLQ 305
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPG 368
+ L T+ +E +P+ ER+KSF+ D D L EL +Q+GRYLLI+SSRPG
Sbjct: 306 LSLG-------TENLQKE---ALPTNERLKSFEQDPTDNGLTELYYQYGRYLLIASSRPG 355
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
ANLQGIWN + P W S NIN EMNYW + NL EC PL DF+ L++NG++
Sbjct: 356 GLPANLQGIWNRHVQPPWGSNYTTNINTEMNYWPAEITNLPECFLPLSDFIGRLAVNGAQ 415
Query: 429 TAQVNY-LASGWVIHHKTDIWAKS-------SADRGKVVWALWPMGGAWLCTHLWEHYNY 480
TA+VNY + GW+ HH +D+WA++ S +G W+ WPM G WLC HLWEHY +
Sbjct: 416 TAKVNYGINRGWLAHHNSDVWAQTAPTGGYDSDPKGAPRWSCWPMAGVWLCQHLWEHYAF 475
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEF--IAPDGKL--AC 535
D+ +L K AYPL++G A FLL WL + + GY TNPSTSPE+ F I +GK
Sbjct: 476 GGDKKYLSKTAYPLMKGAAEFLLQWLQKDPETGYWITNPSTSPENRFRYIDKEGKKQNGE 535
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+S SS MD+ + ++ + I A+ VL+ ++ A ++ + L+P +I G ++EW
Sbjct: 536 ISRSSGMDLGLAWDLLTNCIEASTVLDTDK-AFRQQCMDVRANLQPFRIGSKGQLLEW 592
>gi|333379822|ref|ZP_08471540.1| hypothetical protein HMPREF9456_03135 [Dysgonomonas mossii DSM
22836]
gi|332884726|gb|EGK04982.1| hypothetical protein HMPREF9456_03135 [Dysgonomonas mossii DSM
22836]
Length = 813
Score = 380 bits (977), Expect = e-103, Method: Compositional matrix adjust.
Identities = 227/586 (38%), Positives = 335/586 (57%), Gaps = 42/586 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ + PAK + +A+P+GN RLGAMV+G E L+LNE+T+W G P +P+ K L
Sbjct: 24 KLLYKRPAKEWVEALPLGNSRLGAMVFGNPAREQLQLNEETMWGGGPHRNDSPNMLKVLD 83
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+VRSL+ +G+ EA A K P + YQ +G + L+F H KY+ Y R+LDL
Sbjct: 84 EVRSLIFAGKEKEAEALLEKNMRTPHNGMPYQTIGSLYLDFA-GHNKYS--NYSRQLDLT 140
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TA A KY+V + +TRE FSS D VI+ +I+ + S+SF DS + ++ +
Sbjct: 141 TAVATTKYTVDGINYTREVFSSFTDNVIIMRITADKPNSISFTAGYDSPVKDYKVQAKGD 200
Query: 192 QIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
++I++G ++ KG I+F +IK G +E KL V+ ++
Sbjct: 201 KLILKGM---------GAEHEGIKGVIRFENQTQIKT---EGGSVKVESNKLSVKAANSV 248
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
V+ + +++F +N D + ++ + L++ + Y H+ Y+K F RVS+
Sbjct: 249 VIYISIATNF----VNYQDVSANESTSATHFLKTAISKPYEKALADHIKYYKKQFDRVSL 304
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
L +S D+ EE + RV++F+ +D SLV LLFQFGRYLLISSS+PG Q
Sbjct: 305 DLGKS------DSILEE------TDVRVRNFKEGKDQSLVTLLFQFGRYLLISSSQPGGQ 352
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWN+ L P WDS +NIN EMNYW + NLSE +PLF L L++ G +TA
Sbjct: 353 PANLQGIWNDQLVPPWDSKYTININTEMNYWPAEVTNLSETHQPLFQMLKELAVTGQETA 412
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
+V Y A+GWV HH TD+W + G +WP GGAWL H+W+HY YT D+ FL K
Sbjct: 413 KVMYNANGWVAHHNTDLWRTTGPVDG-AFHGMWPNGGAWLSQHMWQHYLYTGDKSFL-KE 470
Query: 491 AYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
AYP+L+G A F LD+L+E H Y + T+PSTSPE P GK ++ STMD I+
Sbjct: 471 AYPVLKGAADFFLDFLVE-HPTYKWMVTSPSTSPEQ---GPPGKNTSITAGSTMDNQIVF 526
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+V + + A++ L ++A +K+ + RL P +I + + EW+
Sbjct: 527 DVLNNALEASKTLGVGDEAYNQKLEDMISRLAPMQIGKYNQLQEWL 572
>gi|340619498|ref|YP_004737951.1| alpha-L-fucosidase [Zobellia galactanivorans]
gi|339734295|emb|CAZ97672.1| Alpha-L-fucosidase, family GH95 [Zobellia galactanivorans]
Length = 792
Score = 380 bits (977), Expect = e-103, Method: Compositional matrix adjust.
Identities = 228/600 (38%), Positives = 333/600 (55%), Gaps = 53/600 (8%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
N K+ + PA + +A+PIGNG+LGAMV+GGV SE L+LNE+++W G P A K
Sbjct: 34 NGNKLWYTQPAADWMEALPIGNGKLGAMVFGGVESERLQLNEESVWAGPPIPENRVGAFK 93
Query: 71 ALSDVRSLVDSGQYAEATAASV-KLFGH--PADVYQLLGDIELEFDDSHLKYAEETYRRE 127
++ R+L+ G Y EA + G YQ LG++ L F+ LK + YRRE
Sbjct: 94 SIEKARALIFQGDYLEANKVMQDNVMGERIAPRSYQPLGNLILNFN---LKGSPTDYRRE 150
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
LDL A A+ ++V V +TRE+FSS + IV ++ ++ ++S + +D D
Sbjct: 151 LDLKRAIAKTDFTVNGVRYTREYFSSAIENTIVVVLTANQPKAISLELKMDRKADFEVAG 210
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI-SDDRGTISALEDKKLKVEG 246
G N++ M G+ KG E ++ + +G + E+ +K+
Sbjct: 211 VGKNRLRMWGQA-------------SQKGKHLGVKYETQVMALPKGGKMSSENGNIKITA 257
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDP--------TSESMSALQSIRNLSYSDLYTRHL 298
++ VLL+ A + ++ KKDP ++ S L+ S L H+
Sbjct: 258 ANSVVLLVSAKTDYN---------KKDPFSPFTENLSTACASVLKKTARKSVKKLKEEHI 308
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
DDYQ F+RV + L P + D + E ++ V + +DP L+EL FQ+GR
Sbjct: 309 DDYQHYFNRVVLDLGSFPGE---DKPTNERLEAVINGA--------DDPGLMELYFQYGR 357
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSSRPG+ ANLQGIWN+ L+ W+S H NIN++MNYW + NLSEC EP F+F
Sbjct: 358 YLLISSSRPGSLPANLQGIWNDHLAAPWNSDYHTNINMQMNYWPAEVANLSECHEPFFEF 417
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ L +G KTA+ Y + G+V+HH TD+W +S GKV + +WPMGGAW H EHY
Sbjct: 418 IESLVPSGKKTAKEVYDSEGFVVHHTTDVWHWTSP-IGKVQYGMWPMGGAWCTRHFMEHY 476
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDG--KLAC 535
++T D FL ++AYP+++ A FLLDWL+ + G L + PSTSPE++F P K A
Sbjct: 477 SFTGDTTFLAEQAYPIMKESAKFLLDWLVTDPRSGKLVSGPSTSPENKFYTPKNGEKFAN 536
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
V + MD II + FS ++ AA++L K EDA V++V +L L KI DG +MEW Q
Sbjct: 537 VDMGNAMDQEIIWDNFSNVLEAAKIL-KIEDAFVDEVKAALSNLSLPKIGSDGRLMEWSQ 595
>gi|423214472|ref|ZP_17201000.1| hypothetical protein HMPREF1074_02532 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692887|gb|EIY86123.1| hypothetical protein HMPREF1074_02532 [Bacteroides xylanisolvens
CL03T12C04]
Length = 792
Score = 380 bits (977), Expect = e-103, Method: Compositional matrix adjust.
Identities = 226/598 (37%), Positives = 327/598 (54%), Gaps = 50/598 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ +N PA + +A+PIGNGR+GAMV+G E +LNE+++W+G P D+ NP A AL
Sbjct: 27 KLWYNAPATVWEEALPIGNGRIGAMVYGNPLQEVYQLNEESIWSGYPQDWNNPKAANALP 86
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
VR VD G YA+A+ K + L L D A YR EL+++ A
Sbjct: 87 QVREAVDRGDYAKASEL-WKANAQGPYTARYLPMANLMLDQLTRGEARNLYR-ELNISNA 144
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
+ V Y V++ R F S PDQV+V KI+ ++S ++ L+SLL G +
Sbjct: 145 LSTVTYEADGVKYRRTSFISYPDQVMVIKIAADRPQAVSLHIRLNSLLRYTVQTKGEKTL 204
Query: 194 IMEGRCPG----KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
I+ G+ P + P DD +G QF +++++ D G A D L V ++
Sbjct: 205 ILNGKAPAYVANRDYDPHQVVYDDKRGTQFK--VQVELLPDGGHCEA-NDSALTVRNANE 261
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
VLLL A + F + K+ Y +L RH DD+Q+LF+R+
Sbjct: 262 VVLLLSAVTDFGNKKMTLKKCKR----------------PYQELLQRHTDDHQQLFNRLQ 305
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPG 368
+ L T+ +E +P+ ER+KSF+ D D L EL +Q+GRYLLI+SSRPG
Sbjct: 306 LSLG-------TENLQKE---ALPTNERLKSFEQDPTDNGLTELYYQYGRYLLIASSRPG 355
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
ANLQGIWN + P W S NIN EMNYW + NL EC PL DF+ L++NG++
Sbjct: 356 GLPANLQGIWNRHVQPPWGSNYTTNINTEMNYWPAEITNLPECFLPLSDFIGRLAVNGAQ 415
Query: 429 TAQVNY-LASGWVIHHKTDIWAKS-------SADRGKVVWALWPMGGAWLCTHLWEHYNY 480
TA+VNY + GW+ HH +D+WA++ S +G W+ WPM G WLC HLWEHY +
Sbjct: 416 TAKVNYGINRGWLAHHNSDVWAQTAPTGGYDSDPKGAPRWSCWPMAGVWLCQHLWEHYAF 475
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEF--IAPDGKL--AC 535
D+ +L K AYPL++G A FLL WL + + GY TNPSTSPE+ F I +GK
Sbjct: 476 GGDKKYLSKTAYPLMKGAAEFLLQWLQKDPETGYWITNPSTSPENRFRYIDKEGKKQNGE 535
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+S SS MD+ + ++ + I A+ VL+ ++ A ++ + L+P +I G ++EW
Sbjct: 536 ISRSSGMDLGLAWDLLTNCIEASTVLDTDK-AFRQQCMDVRANLQPFRIGSKGQLLEW 592
>gi|390958737|ref|YP_006422494.1| hypothetical protein Terro_2924 [Terriglobus roseus DSM 18391]
gi|390413655|gb|AFL89159.1| hypothetical protein Terro_2924 [Terriglobus roseus DSM 18391]
Length = 824
Score = 379 bits (974), Expect = e-102, Method: Compositional matrix adjust.
Identities = 225/593 (37%), Positives = 321/593 (54%), Gaps = 35/593 (5%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
P ++ F PA + DA+PIGNGRLG MV+GG + + LNEDTLW+G P D NP A
Sbjct: 38 PYQLWFRTPAAEWIDALPIGNGRLGGMVFGGALEDHIALNEDTLWSGYPQDGNNPAAKSK 97
Query: 72 LSDVR-SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
L VR +++ + Y A ++ G + YQ LG + + H + YRR+L+L
Sbjct: 98 LPLVRQAVLKNKDYHLADTLCKEMQGPYSAAYQPLGGLHVTL---HQEGELADYRRDLNL 154
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
+TA A+ Y +G+V +++ F S PD V+V I ++ ++ + LDS L + V G+
Sbjct: 155 DTAIAKTTYRLGDVSVSKKAFVSFPDDVLVMLIETTKP--VTMEIRLDSKLRHEVSVAGH 212
Query: 191 NQIIMEGRCPGKRIP-------PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ ++G+ P P P ++ KG+ F+A I SD ++ +D L+
Sbjct: 213 -ALQLKGKAPVVSRPNYVKSQDPIQYSDTPGKGMFFAAGASIH-SDG---VTNAKDGALQ 267
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
+ + V+LL A + F G + P + L + + + L H+ ++
Sbjct: 268 IANAKSVVILLAAGTGFRGHGLLPDKPMAEIMGRVQQTLANASRKTAAQLERVHIAAHRA 327
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
+F R + L + +D+ T AER+ F DPSL+ L FQFGRYLLIS
Sbjct: 328 VFRRTLLDLGK--QDLTRST-----------AERLSDFAAHPDPSLLALYFQFGRYLLIS 374
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSRPGTQ ANLQGIWN+DL W NIN++MNYW + CNLS+ P FD L LS
Sbjct: 375 SSRPGTQPANLQGIWNDDLRAPWSCNWTSNINIQMNYWLAETCNLSDFHAPFFDLLQSLS 434
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSA---DRGKVVWALWPMGGAWLCTHLWEHYNY 480
G++TA+ NY GWV HH DIW+ SS G WA + M WLC HLW+HY +
Sbjct: 435 ETGARTAKTNYGLPGWVSHHNIDIWSLSSPVGEGEGDPSWANFAMSAPWLCAHLWDHYCF 494
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
T D++FL RAYPL++G A F WLI G L T PS S E++F APDGK A VS
Sbjct: 495 TQDQNFLRTRAYPLMKGAAQFCSSWLIPDDQGNLTTCPSVSTENQFTAPDGKRASVSAGC 554
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
TMD+A+IRE+FS AA+VL + D ++ + +L P + + G + EW
Sbjct: 555 TMDIALIREIFSNCAEAAKVLNVDHD-WANQLQQQSAKLVPYAVGQYGQLQEW 606
>gi|399028921|ref|ZP_10730010.1| hypothetical protein PMI10_01838 [Flavobacterium sp. CF136]
gi|398073242|gb|EJL64421.1| hypothetical protein PMI10_01838 [Flavobacterium sp. CF136]
Length = 820
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 219/597 (36%), Positives = 340/597 (56%), Gaps = 32/597 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPKA 71
+K+ ++ PA + +A+P+GNGR+GAMV+G V E ++LNE +LW+G P NP A +
Sbjct: 23 IKLWYDKPAAQWVEALPLGNGRIGAMVFGSVEDELIQLNEGSLWSGGPMKKNVNPKAYQY 82
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
L +R + + + +A K+ G+ ++ + +GD+ + D K + Y R+L L+
Sbjct: 83 LQPLREALYAEDFQKADELCRKMQGYFSESFLPMGDLVIHHDFGSDK--SQNYYRDLKLD 140
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
A + ++V V+++RE F S P +++ K+ S+ G+L+F+ L S+L N V ++
Sbjct: 141 QAVSTTNFTVKGVKYSREIFISAPANIMIVKMKASKKGALTFDAKLSSVLTNSVSVLADD 200
Query: 192 QIIMEGRCPGKRIPPKANA-NDDP---------KGIQFSAILEIKISDDRGTISALEDKK 241
+++++G+ P + P N N P G++F L+ + D G++ +
Sbjct: 201 RLVLDGKAPARVDPSYYNKKNRQPIILEDTTGCNGMRFRMDLKASLKD--GSVKT-DANG 257
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
+ V + +L A++SF+G P K+ + S +++ Y L H+ DY
Sbjct: 258 IHVTNATEVILYFAAATSFNGFDKCPDSEGKNEKVITDSIIKNSTAQKYESLKKDHIADY 317
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYL 360
QK F+RV++ L + + +N +P ER+K++ +DP L + +Q+GRYL
Sbjct: 318 QKYFNRVNLDLE--------EENTNKNTSVLPWDERLKAYTAGGKDPILEQTFYQYGRYL 369
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSR G Q ANLQGIWN++L W S +NIN +MNYW + NLSE +PL D++
Sbjct: 370 LISSSRLGGQPANLQGIWNKELRAPWSSNYTININTQMNYWPAEQTNLSEMHQPLLDWIG 429
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWE 476
LS G A Y A+GWV HH +DIWA S+A G WA W MGG WLC HLWE
Sbjct: 430 NLSQTGRTAASEYYHANGWVAHHNSDIWALSNAVGNKGDGSPTWANWYMGGNWLCQHLWE 489
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
HY +T D++FL K AYP+++ A F DWL E DGYL T PS+SPE+E I +GK V
Sbjct: 490 HYIFTGDKEFLRKTAYPVMKEAALFSFDWLQE-KDGYLVTAPSSSPENE-IHINGKNYGV 547
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +STMDM+I R++F +I A+E+L +ED E +K +L P KI G ++EW
Sbjct: 548 TVASTMDMSICRDLFGNLIKASEILNIDEDFRKELEVKK-AKLFPLKIGSKGQLLEW 603
>gi|330467858|ref|YP_004405601.1| cellulose-binding family ii [Verrucosispora maris AB-18-032]
gi|328810829|gb|AEB45001.1| cellulose-binding family ii [Verrucosispora maris AB-18-032]
Length = 998
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 226/573 (39%), Positives = 310/573 (54%), Gaps = 43/573 (7%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
A+PIGNGRLGAMV+G +E L+LNEDT+W G P D +NP +L+++R LV + Q+ +
Sbjct: 61 ALPIGNGRLGAMVFGNSDTERLQLNEDTVWAGGPHDSSNPRGQGSLAEIRRLVFANQWTQ 120
Query: 87 A-TAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
A + + G+P YQ +G++ L F + Y R+LDL TAT V Y +
Sbjct: 121 AQNLINQTMLGNPVGQLAYQTVGNLRLAFASAS---GTSQYNRQLDLTTATTSVSYVMNG 177
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKR 203
V F RE F+S PDQVI +++ S S++F + DS I ++G
Sbjct: 178 VRFQREVFASAPDQVIAMRLTADRSASITFTATFDSPQRTTVSSPDGATIALDG------ 231
Query: 204 IPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGP 263
N ++F L + + G + L+V G+ LL+ SS+
Sbjct: 232 --VSGNQEGVTGAVRF---LALAHATVSGGTVSSSGGTLRVTGATSVTLLVSIGSSY--- 283
Query: 264 FINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDT 323
+N + D + L + R SY L RH+ DYQ LF RVS+ L R+ +
Sbjct: 284 -VNFRNVGGDYQGIARRHLTAARASSYDQLRARHVADYQALFGRVSLDLGRT-------S 335
Query: 324 CSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLS 383
+++ P+ R+ + DP LLFQ+GRYLLISSSRPGTQ ANLQGIWN+ L+
Sbjct: 336 AADQ-----PTDVRIAQHNSVNDPQFSTLLFQYGRYLLISSSRPGTQPANLQGIWNDSLT 390
Query: 384 PTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHH 443
P WDS +N NL MNYW + NLSEC +P+F + L+++G++TAQV Y A GWV HH
Sbjct: 391 PAWDSKYTINANLPMNYWPADTTNLSECYQPVFSMIQDLTVSGARTAQVQYGAGGWVTHH 450
Query: 444 KTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
TD W SS G W +W GGAWL T +W+HY +T D DFL YP ++G A F L
Sbjct: 451 NTDAWRGSSVVDG-AFWGMWQTGGAWLATMIWDHYLFTGDLDFLRAN-YPAMKGAAQFFL 508
Query: 504 DWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLE 562
D L+ E GYL TNPS SPE A A V TMD I+R++F A+E+L
Sbjct: 509 DTLVTEPSLGYLVTNPSNSPEIGHHAD----ASVCAGPTMDNQILRDLFDGCARASEIL- 563
Query: 563 KNEDALVE-KVLKSLPRLRPTKIAEDGSIMEWV 594
N DA +V + RL PT+I G+IMEW+
Sbjct: 564 -NTDATFRAQVRATRDRLAPTRIGSRGNIMEWL 595
>gi|404484444|ref|ZP_11019648.1| hypothetical protein HMPREF9448_00050 [Barnesiella intestinihominis
YIT 11860]
gi|404339449|gb|EJZ65880.1| hypothetical protein HMPREF9448_00050 [Barnesiella intestinihominis
YIT 11860]
Length = 802
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 223/594 (37%), Positives = 339/594 (57%), Gaps = 31/594 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPKA 71
+K+ ++ PA++F +A+ IGNG +GA ++GGV + + N+ TLWTG P + ++PDA
Sbjct: 25 MKLHYDRPAEYFEEALVIGNGTMGATLYGGVKKDKISFNDITLWTGEPESENSSPDAFNV 84
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+ ++R+L+D+ Y A A K+ GH ++ YQ LG + +E+ D ++ Y R LD+
Sbjct: 85 IPEIRALLDNEDYEGADKAQYKVQGHYSENYQPLGTLTIEYLDDTAGISD--YHRWLDIG 142
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
ATAR +Y FT ++F+S PD VIV ++ + +S DS L + S V +N
Sbjct: 143 NATARTQYLKDGKLFTSDYFASAPDSVIVIRLKSENKEGIHALLSFDSPLPHSSQV-ADN 201
Query: 192 QIIMEGRCPGKRIPPKANAND----DP-KGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+I +EG P A D DP +GI F ++ + +S D + D +++++G
Sbjct: 202 EISVEGYAAYHSFPVYYKAEDKHRYDPERGIHFKTLVRV-LSVDGSVKNRYSDSRIEIDG 260
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
S ++L+ +SF+G +P ++ S ++ +Y L H+ DY+ F
Sbjct: 261 STEVLILIANVTSFNGFDKDPVKEGRNYRSHVEKRMKCAIGKTYDALREAHIRDYKYYFD 320
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD---EDPSLVELLFQFGRYLLIS 363
RV + L + DI +P+ +++ F TD ++P L EL FQFGRYLLIS
Sbjct: 321 RVKLDLGNTDDDIAA----------LPTDKQL-LFYTDCKQQNPDLEELYFQFGRYLLIS 369
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSR ANLQG+WNE + P W S VNINLE NYW S NL E Q PL +F+ LS
Sbjct: 370 SSRTPGVPANLQGLWNESVLPPWSSNYTVNINLEENYWASGTTNLIEMQYPLIEFIANLS 429
Query: 424 INGSKTAQVNY-LASGWVIHHKTDIWAKS---SADRGKVVWALWPMGGAWLCTHLWEHYN 479
G KTA+ Y + GW + H +D+WA + + G WA W MGG WL TH+WEHY
Sbjct: 430 KTGRKTAKDYYGVERGWCLGHNSDVWAMTCPVGLNEGDPSWACWTMGGTWLSTHIWEHYL 489
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
+T+D+ FL K YP+L+G A F +DWL+E DG L T+P TSPE+++I PDG + SY
Sbjct: 490 FTLDKGFLCK-FYPVLKGAAEFCMDWLVE-KDGKLVTSPGTSPENKYITPDGYVGATSYG 547
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+T D+A+IRE A++VL ++ + +++ K+L RL P +I DG++ EW
Sbjct: 548 NTSDLAMIRECLIDAAEASKVLGVDK-SFRKRIKKTLSRLYPYQIGTDGNLQEW 600
>gi|302872475|ref|YP_003841111.1| alpha-L-fucosidase [Caldicellulosiruptor obsidiansis OB47]
gi|302575334|gb|ADL43125.1| Alpha-L-fucosidase [Caldicellulosiruptor obsidiansis OB47]
Length = 753
Score = 378 bits (971), Expect = e-102, Method: Compositional matrix adjust.
Identities = 223/590 (37%), Positives = 326/590 (55%), Gaps = 44/590 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LKI FN PA + +A+PIGNG LGAM++GGV ET++LNE+++W+ P NPDA + L
Sbjct: 6 LKILFNHPANCWEEALPIGNGSLGAMIYGGVEYETIQLNEESIWSCGPRRRENPDALRYL 65
Query: 73 SDVRSLVDSGQYAEATAASVKLFG---HPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
++R + G A SV H Y+ LG +++ F+ K E Y R LD
Sbjct: 66 QEIRKSILEGNIKRAEELSVFALSGTPHSEGNYEPLGYLDIYFEGIE-KDKIENYCRYLD 124
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLS----FNVSLDSLLDNHS 185
++ A +V++SVG + + +FSS PD+VIV KIS SE ++ F +D
Sbjct: 125 ISNAICKVEFSVGKARYDKLYFSSFPDKVIVIKISCSEKCGVTLRAKFRREFQEDIDRCG 184
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
+ GN++I E R G+ FSA+L+ +S D G + + D L ++
Sbjct: 185 KI-GNDKIFFECTAGSGR------------GVSFSAMLK-AVSKD-GDVYTIGDN-LFIK 228
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
+ +LL+ +++S+ +KD + + L+ + + +LY RH +DY+ LF
Sbjct: 229 NATEVMLLITSTTSY---------KEKDYFNWCLKTLEQVSKHDFEELYKRHTEDYKSLF 279
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
RV + + + + E I+ + R D L+ LLFQFGRYLLISSS
Sbjct: 280 DRVEFYIDTANTNDRIGLTTPERINLLKKGYR--------DEELIVLLFQFGRYLLISSS 331
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPG NLQGIWN+++ P W S +NINL+MNYW + CNLSEC PLF L + N
Sbjct: 332 RPGCLPPNLQGIWNKEMKPPWGSKYTININLQMNYWPAEICNLSECHLPLFTLLERMYEN 391
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G TAQ Y G+ HH TDIW ++ + WPMG AWLC H+WEHY YT D D
Sbjct: 392 GKITAQKMYNCRGFCAHHNTDIWGDTAPQDIYIPATYWPMGAAWLCLHIWEHYEYTGDLD 451
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
FL K+ Y L+ A FLLD+LIE +GYL T PS SPE+ + +G + ++Y T+D+
Sbjct: 452 FL-KKYYYLMREAALFLLDYLIEDKNGYLVTCPSCSPENSY-KLNGNVYSLTYMPTIDIQ 509
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
II +F + A ++L+ N D ++EK+ +L +L P KI + G I EW++
Sbjct: 510 IISVLFEKVKKANDILKLN-DEIIEKIDYALEKLPPIKIGKYGQIQEWIE 558
>gi|430751368|ref|YP_007214276.1| hypothetical protein Theco_3223 [Thermobacillus composti KWC4]
gi|430735333|gb|AGA59278.1| hypothetical protein Theco_3223 [Thermobacillus composti KWC4]
Length = 768
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 220/597 (36%), Positives = 328/597 (54%), Gaps = 53/597 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + ++ PA + +A+PIGNGR+GAMV+G SE L+LNED+LW G P D NPDA K L
Sbjct: 1 MVMKYDRPAAEWNEALPIGNGRMGAMVFGHPVSERLQLNEDSLWYGGPRDRNNPDAAKVL 60
Query: 73 SDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
++R L+ G+ EA +V L G P Y+ LG + L F+ A E Y+R LD
Sbjct: 61 PEIRRLIFEGKPREAERLAVTGLSGIPETQRHYEPLGQLLLHFEGIDPD-AVEQYQRSLD 119
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN- 188
L A A V++ V RE+++S PDQ I+ + + G +S L+ YV+
Sbjct: 120 LERAVASVEFLHRGVRHRREYYASCPDQAIIVRATADRPGQISLTARLERA--RWRYVDA 177
Query: 189 ----GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
G + I M G A+ +G+ F+A + + G++ A+ + L V
Sbjct: 178 TGRSGTDAIYMTG------------ASGGAEGVSFAAAVTARTEG--GSLDAI-GEHLVV 222
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
E +D L++ A++SF +K+P + ++ +++ + Y RH+ DY++L
Sbjct: 223 EHADSVTLVISAATSF---------REKEPLAHCLAHARTVCAAPDDERYARHVRDYREL 273
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLIS 363
F RVS+ L +E +P ER++ + +EDP+L L FQ+GRYLLI+
Sbjct: 274 FGRVSLALG-----------GDEERSVLPVPERLERLRKGEEDPALAALYFQYGRYLLIA 322
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSRPG+ ANLQGIWN+ P WDS +NIN +MNYW + C L EC EPLFD + L
Sbjct: 323 SSRPGSLPANLQGIWNDHFLPPWDSKYTININAQMNYWPAESCALPECHEPLFDLIERLR 382
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
G +TA+V Y G+ HH TDIWA ++ + + WP+G AWLC HLWEHY +T D
Sbjct: 383 EPGRRTARVMYGCRGFAAHHNTDIWADTAPQDTYIPASYWPLGAAWLCLHLWEHYRFTQD 442
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
FLE R+ ++ A F++D+L+EG G L T PS SPE+ ++ P+G+ + TMD
Sbjct: 443 LPFLE-RSLETMKEAARFVMDYLVEGPSGELVTCPSVSPENSYVLPNGETGVLCAGPTMD 501
Query: 544 MAIIREVFSAIISAAEVL-----EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
IIR + SA + A VL + +++A + + L RL KI + G+I EW +
Sbjct: 502 TQIIRALLSACVEAERVLSDRTGKASDEAFIREAELVLKRLPKEKIGKLGTIQEWYE 558
>gi|312621675|ref|YP_004023288.1| alpha-L-fucosidase [Caldicellulosiruptor kronotskyensis 2002]
gi|312202142|gb|ADQ45469.1| Alpha-L-fucosidase [Caldicellulosiruptor kronotskyensis 2002]
Length = 752
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 233/601 (38%), Positives = 335/601 (55%), Gaps = 50/601 (8%)
Query: 2 MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
MN++S LKI F+ PA + +A+PIGNG LGAM++GGV ET++LNE+++W+ P
Sbjct: 1 MNSQS------LKIIFDKPASCWEEALPIGNGSLGAMIYGGVEYETIQLNEESIWSCGPR 54
Query: 62 DYTNPDAPKALSDVRSLVDSGQYAEATAASV-KLFG--HPADVYQLLGDIELEFDDSHLK 118
NPDA K L ++R + G A SV L G H Y+ LG +++ F+
Sbjct: 55 RRENPDAIKYLPEIRKSILEGNIKRAEELSVFALSGTPHSQGNYEPLGYLDIYFEGIEAD 114
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL----SFN 174
E Y R LD++ AT +V++ V ++ + + +FSS PD+VIV KI ++ G+L F
Sbjct: 115 KVER-YTRYLDISNATCKVEFDVDDIRYEKIYFSSYPDKVIVVKICCNKKGALFLRAKFR 173
Query: 175 VSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
+D V+ N++I +E R G+ FSA+L+ +S D G +
Sbjct: 174 REYQEDIDRCGRVD-NDKIFIECSAGSGR------------GVSFSAVLK-AVSKD-GDV 218
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
+ D L V+ + VLL+ +++S+ KD + + L+ + +LY
Sbjct: 219 YTIGDN-LFVKDATEVVLLITSTTSYKA---------KDYFNWCVKTLEQASKHDFEELY 268
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
RH +DY+ LF RV + + T+ + E I+ + ER K D L+ LLF
Sbjct: 269 KRHTEDYKSLFDRVEFYIDTENTNKRTELTTPERINLL--KERYK------DEELIVLLF 320
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
QFGRYLLISSSRPG NLQGIWN+++ P W S +NINL+MNYW + CNLSEC P
Sbjct: 321 QFGRYLLISSSRPGCLPPNLQGIWNKEMKPPWGSKYTININLQMNYWPAEVCNLSECHMP 380
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
LFD L + NG TAQ Y G+ HH TDIW ++ + WPMG AWLC H+
Sbjct: 381 LFDLLEKMYENGKITAQRMYGCRGFCAHHNTDIWGDTAPQDIYIPATYWPMGAAWLCLHI 440
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
+HY YT D DFL K+ Y L+ A FLLD+LIE +GYL T PS SPE+ + +G +
Sbjct: 441 LDHYEYTGDLDFL-KKYYYLMREAALFLLDYLIEDKNGYLVTCPSCSPENSY-KLNGDVY 498
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
++Y TMD+ II +F I A +VL+ N D +VEK+ +L +L P KI + G I EW+
Sbjct: 499 SMTYMPTMDIQIITALFDKIKKANDVLKLN-DEIVEKIEYALNKLPPLKIGKYGQIQEWI 557
Query: 595 Q 595
+
Sbjct: 558 E 558
>gi|312135763|ref|YP_004003101.1| alpha-L-fucosidase [Caldicellulosiruptor owensensis OL]
gi|311775814|gb|ADQ05301.1| Alpha-L-fucosidase [Caldicellulosiruptor owensensis OL]
Length = 752
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 225/595 (37%), Positives = 333/595 (55%), Gaps = 46/595 (7%)
Query: 9 TTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
++ LKI F+ PA + +A+PIGNG LGAM++GGV ETL+LNE+++W+ P NPDA
Sbjct: 2 SSQNLKIIFDKPASCWEEALPIGNGSLGAMIYGGVEYETLQLNEESIWSCGPRRRENPDA 61
Query: 69 PKALSDVRSLVDSGQYAEATAASV-KLFG--HPADVYQLLGDIELEFDDSHLKYAEETYR 125
K L +R + G A SV L G H Y+ LG +++ F+ E+ Y
Sbjct: 62 LKYLQVIRKSILEGNIKRAEELSVFALSGTPHSEGNYEPLGYLDIYFEGVKTDKVEK-YT 120
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL----SFNVSLDSLL 181
R LD++ AT +V+++V ++ + + +FSS PD+VIV KI S+ G++ F +
Sbjct: 121 RYLDISNATCKVEFNVDDIRYEKTYFSSYPDKVIVVKICCSKKGAIFLRAKFRREYQEDI 180
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
D V+ N++I E R G+ FSA+L+ +S D G + + D
Sbjct: 181 DRCGRVD-NDKIFFECSAGSGR------------GVSFSAVLK-AVSKD-GDVYTIGDN- 224
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L V+ + +LL+ +++S+ +KD + + L+ + + +LY RH +DY
Sbjct: 225 LFVKNATEVMLLITSTTSY---------KEKDYFNWCLKTLEQVSKHDFEELYKRHTEDY 275
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYL 360
+ LF RV + DT + N + + ER+ + +D L+ LLFQFGRYL
Sbjct: 276 KSLFDRVEFYI---------DTANTNNRIELTTPERINLLKEGYKDEELIVLLFQFGRYL 326
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSRPG NLQGIWN+++ P W S +NINL+MNYW + CNLSEC LFD L
Sbjct: 327 LISSSRPGCLPPNLQGIWNKEMKPPWGSKYTININLQMNYWPAEVCNLSECHMSLFDLLE 386
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
+ NG TAQ Y G+ HH TDIW ++ + WPMG AWLC H+W+HY Y
Sbjct: 387 KMYENGKITAQRMYGCRGFCAHHNTDIWGDTAPQDIYIPATYWPMGAAWLCLHIWDHYEY 446
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
T D DFL K+ Y L+ A FLLD+LIE +GYL T PS SPE+ + +G + ++Y
Sbjct: 447 TGDLDFL-KKYYYLMREAALFLLDYLIEDENGYLVTCPSCSPENSY-KLNGDVYSLTYMP 504
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
TMD+ +I +F + A ++L+ N D +VEK+ +L + P KI + G I EW++
Sbjct: 505 TMDIQVISALFEKVKKANDILKLN-DEIVEKIEYALNKFPPIKIGKYGQIQEWIE 558
>gi|392965675|ref|ZP_10331094.1| alpha-L-fucosidase [Fibrisoma limi BUZ 3]
gi|387844739|emb|CCH53140.1| alpha-L-fucosidase [Fibrisoma limi BUZ 3]
Length = 846
Score = 377 bits (969), Expect = e-102, Method: Compositional matrix adjust.
Identities = 222/600 (37%), Positives = 325/600 (54%), Gaps = 31/600 (5%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPK 70
PL I + PA+++ +A+P+GNGRLGAMV+G V E ++LNE +LW+G P + NP A
Sbjct: 22 PLTIWYRQPARNWNEALPVGNGRLGAMVFGRVNDELIQLNEASLWSGGPVNLNPNPGAAT 81
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
L VR + Y EA + G + YQ LGD+ + L Y R L++
Sbjct: 82 YLPQVREALFREDYKEADKLVRNMQGLYTEAYQPLGDLTIR---QILTGEPADYYRNLNI 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A+A ++ G V +TRE F S PDQVIV ++ + G L+ + S V
Sbjct: 139 TEASATTRFKSGGVGYTREIFVSAPDQVIVIRLRADQKGKLNVTLGTRSPHPISKVVVSR 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKK 241
+++ M G+ P P N N P +G +F L++K +D + A +
Sbjct: 199 DELAMRGKSPAHADPNYVNYNKVPVRYTDSSGCRGTRFDLRLKVKSTDGQ---VATDTAG 255
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
+++ + AV+ L A++SF+G P K+ + S L S + H+ DY
Sbjct: 256 IRITNATEAVVYLSAATSFNGFDKCPDKDGKNEIQLAQSYLNKALAKSPDAIRKAHVADY 315
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYL 360
Q+ +RVS L+ D + N ++P ER+ + E DP+L L FQFGRYL
Sbjct: 316 QRYLNRVSFTLN--------DAQTPGNPASLPMDERLMRYAGGEPDPALETLYFQFGRYL 367
Query: 361 LISSSRPGTQVA-NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LISSSRPGT +A NLQGIWN + P W S NIN +MNYW + NLSE PL D +
Sbjct: 368 LISSSRPGTGIAANLQGIWNPMVRPPWSSNYTTNINAQMNYWPAEMTNLSEFHRPLIDQI 427
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLW 475
+ ++ G TA+ Y A GW +HH +DIWA S+ +G +WA W MGGAWL HLW
Sbjct: 428 KHAAVTGKATAKNFYGAGGWTVHHNSDIWAASNPVGDLGKGGPMWANWSMGGAWLAQHLW 487
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
EHY +T DR +L++ AYPL++ A F +DWL+E G+L T P+TSPE+ F+ G
Sbjct: 488 EHYAFTGDRTYLKQTAYPLMKDAAQFCVDWLVEDKQGHLVTAPATSPENVFVTEKGDKES 547
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
VS ++TMDM +I ++FS +I A+E L + D + + + +L P +I G++ EW +
Sbjct: 548 VSVATTMDMGLIWDLFSNVIEASEHLGIDVD-FRKMLTEKKSKLFPLQIGRKGNLQEWYK 606
>gi|295132871|ref|YP_003583547.1| six-hairpin glycosidase [Zunongwangia profunda SM-A87]
gi|294980886|gb|ADF51351.1| six-hairpin glycosidase [Zunongwangia profunda SM-A87]
Length = 819
Score = 377 bits (967), Expect = e-101, Method: Compositional matrix adjust.
Identities = 215/597 (36%), Positives = 333/597 (55%), Gaps = 43/597 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPKALSDV 75
++ PA+ + +A+P+GNG++GAMV+G V E ++LNE +L++G P NPDA L +
Sbjct: 28 YDAPAREWVEALPLGNGKIGAMVFGRVTDELIQLNESSLYSGGPVPQRINPDAASYLQPL 87
Query: 76 RSLVDSGQYAEATAASVKLFGHPADVYQLLGDI----ELEFDDSHLKYAEETYRRELDLN 131
R + YA+AT + K+ G+ Y +GD+ +L+ D H Y+R L++
Sbjct: 88 REAIFDKDYAQATLLAKKMQGYYTQSYMPMGDLLLHQDLQNDSVH------AYKRSLNIE 141
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
A + V +TRE F+S PD V+V K++ + +L+ N+S +S L V N
Sbjct: 142 NAITTTSFESDGVNYTREFFTSAPDNVLVMKLTADSAKALNLNLSAESQLRAEVSVTKNQ 201
Query: 192 QIIMEGRCPGKRIPPKANAN-------DDPKG---IQFSAILEIKISDDRGTISALEDKK 241
++++ G+ P P N DDP+G ++F +++ +D + T +D
Sbjct: 202 ELVVSGKAPANVNPNYYNPEGVEPITYDDPEGCDGMRFQYRIKVLKTDGKLTT---QDTS 258
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L + + V+LL A++SF+G P D + +Q+ SY+ L + H+ D+
Sbjct: 259 LAIADASEVVILLTAATSFNGFDKCPDKDGLDEAKLASEFMQAASAKSYAQLKSDHIADF 318
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYL 360
RV++ L ++PKD + P+ R+K++ + DP L L FQ+GRYL
Sbjct: 319 STYMQRVALDLGKTPKDQLDQ----------PTDSRLKAYSEGANDPELEALYFQYGRYL 368
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
L+S+SRPG ANLQGIWN+++ P W S NIN EMNYW + NLSE +P ++
Sbjct: 369 LVSASRPGGIAANLQGIWNKEMRPPWSSNYTTNINAEMNYWPAETTNLSEMHQPFLAYIQ 428
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSS--ADR--GKVVWALWPMGGAWLCTHLWE 476
++ G + A+ Y A GWV+HH +DIWA ++ DR G +WA W MGG WL HLWE
Sbjct: 429 NAAVTGGRVAKEFYDAPGWVVHHNSDIWATANPVGDRGDGDPLWANWYMGGNWLTLHLWE 488
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
HY +T D +L + YP+++ A F LDWL+E HDG L T PSTSPE+ F+ +GK V
Sbjct: 489 HYAFTQDTSYL-AQVYPVMKEAAVFTLDWLVE-HDGKLITAPSTSPENLFLV-NGKGYAV 545
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +TMD+AIIRE+F+ I A+++L K D ++ + RL P +I G + EW
Sbjct: 546 TEGATMDIAIIRELFNNTIKASKILGKEAD-FRHELSAAQDRLIPYQIGAKGQLQEW 601
>gi|325915867|ref|ZP_08178165.1| hypothetical protein XVE_2098 [Xanthomonas vesicatoria ATCC 35937]
gi|325537923|gb|EGD09621.1| hypothetical protein XVE_2098 [Xanthomonas vesicatoria ATCC 35937]
Length = 776
Score = 377 bits (967), Expect = e-101, Method: Compositional matrix adjust.
Identities = 223/596 (37%), Positives = 330/596 (55%), Gaps = 46/596 (7%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+ + T+ L++ + PA + +A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D T+
Sbjct: 24 AVAPTDALQLWYREPANEWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTS 83
Query: 66 PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEE 122
P+ AL VR+L+ G+YAEA A KL P YQ LGD+ L+FD +
Sbjct: 84 PEGLAALPQVRALIFGGRYAEAEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GIS 140
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
YRR+LDL+TA A + G R+ F Q IV ++S ++S V +DS
Sbjct: 141 EYRRQLDLDTAVATTSFRSGGALHQRDVFVCAQSQCIVVRLSCDRPRAISLRVGIDSPQS 200
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD--DRGTISALEDK 240
V ++ GR N GI+ +++ G ++AL D+
Sbjct: 201 GEVTVE-QGGLLFTGR------------NGSFAGIEGKLRFALRVVPRVKGGAVTALRDR 247
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
L++EG+D VLLL A++S+ + D DP + + ++L+ + L Y+ L HL D
Sbjct: 248 -LRIEGADEVVLLLTAATSYR--RFDAVDG--DPLALAAASLRKAQALDYAALLRAHLAD 302
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
+Q+LF RV+I L S + +P+ +RV+ F DP+L L Q+GRYL
Sbjct: 303 HQRLFRRVAIDLGTS------------DAAALPTDQRVRQFAGGNDPALAALYHQYGRYL 350
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LI SSRPGTQ ANLQGIWN+ + P W+S +N+N EMNYW S L EC EPL +
Sbjct: 351 LICSSRPGTQPANLQGIWNDLMQPPWESKYTINVNTEMNYWPSEANALHECVEPLESMVF 410
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
L+I G+ TA+ Y A GWV+H+ TD+W ++ G W+LWPMGG WL LW+ ++Y
Sbjct: 411 DLAITGAHTARALYGAPGWVVHNNTDLWRQAGPIDG-AKWSLWPMGGVWLLQQLWDRWDY 469
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
DR +L K YPL +G A F + L+ + G + TNPS SPE++ P G C
Sbjct: 470 GRDRAYLSK-IYPLFKGAAEFFVATLVRDPQTGAMVTNPSISPENQH--PFGAAICA--G 524
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
TMD ++R++F+ I+ +++L+ + AL +++ +L P +I + G + EW Q
Sbjct: 525 PTMDAQLLRDLFAQCIAMSKLLDVDA-ALAQQLATLREQLPPNRIGKAGQLQEWQQ 579
>gi|334137826|ref|ZP_08511252.1| hypothetical protein HMPREF9413_0062 [Paenibacillus sp. HGF7]
gi|333604667|gb|EGL16055.1| hypothetical protein HMPREF9413_0062 [Paenibacillus sp. HGF7]
Length = 852
Score = 376 bits (966), Expect = e-101, Method: Compositional matrix adjust.
Identities = 214/559 (38%), Positives = 313/559 (55%), Gaps = 42/559 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ + PA+ +T+A+P+GNGRLGAM++G V E + LNE++LW G P D TNP+A AL
Sbjct: 5 KLWYIKPAQAWTEALPVGNGRLGAMIFGRVEEELISLNEESLWYGGPKDRTNPEAAAALL 64
Query: 74 DVRSLVDSGQYAEATA-ASVKLFGHP--ADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
++R L+ G+ EA A + L P A YQ LGD+ + F + TYRRELDL
Sbjct: 65 EIRRLLLEGRVTEAQELAHMGLTPIPKYAGPYQPLGDLRIWFAEHEPDAG--TYRRELDL 122
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL-LDNHSYVNG 189
T RV+Y+ TRE F+S P V+ +++ + L+F L D + +G
Sbjct: 123 ATGLCRVEYAWQGASCTRELFASAPAGVLACRLTTAHPEGLTFRFHLGRRPFDEGAAPDG 182
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
+ ++M+GRC P G++++A+ +S + GT+ + D + V G+
Sbjct: 183 PHAVLMQGRC-------------GPDGVRYAAL--ASVSPEGGTVRTIGDF-VHVAGAAE 226
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A + + A +SF +DP + ++ R Y + H DY LF R+S
Sbjct: 227 ATIYVAAQTSF---------RHEDPAAACRRQVEEARRKGYEAVKAEHGADYMPLFARMS 277
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
++L DI +P+ ER+ + + EDP L+ L FQ+GRYLL++SSRPG
Sbjct: 278 LELGTPGADI----------RLLPTDERLDRVREGGEDPELLALFFQYGRYLLLASSRPG 327
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
T ANLQGIWN D P W+ +NINL+MNYW + CNL EC EPLFDF+ L NG +
Sbjct: 328 TLPANLQGIWNADYQPPWECNYTLNINLQMNYWPAEVCNLRECHEPLFDFIDRLVANGRE 387
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
TA+ Y G+V HH +++WA+S + A+WPMGG WL HLWEHY + DR FL+
Sbjct: 388 TARKLYGCRGFVAHHNSNLWAESGINGMLPRAAVWPMGGVWLALHLWEHYRFGGDRHFLD 447
Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
+RAYP+++ A FLLD++ E G L T PS SPE++++ P GK + + MD+ + R
Sbjct: 448 RRAYPVMKEAALFLLDYMTEDGKGGLLTGPSVSPENKYVLPGGKSGYLCMAPAMDIQLAR 507
Query: 549 EVFSAIISAAEVLEKNEDA 567
+F A+ AA VL A
Sbjct: 508 TLFGAVREAAAVLACERGA 526
>gi|261406666|ref|YP_003242907.1| alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
gi|261283129|gb|ACX65100.1| Alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
Length = 775
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 222/590 (37%), Positives = 317/590 (53%), Gaps = 48/590 (8%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
I F+ PA+++ +A+PIGNGRLG MV+G E ++ NED++W G P D NPDA + L
Sbjct: 9 IWFDQPAQNWNEALPIGNGRLGGMVFGCAQQEKIQFNEDSVWYGGPRDRNNPDALRHLPL 68
Query: 75 VRSLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+R L+ G+ EA S F G P Y GD ++ D H + YRRELDL
Sbjct: 69 IRKLLFEGRLKEAHRLSETAFSGTPRSQRPYLTAGDFCIQVD--HPQGELSHYRRELDLE 126
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS---YVN 188
A A Y G V FTRE F S PDQV+V ++ G L+ + H + +
Sbjct: 127 KAIAVTSYQYGGVTFTREVFCSYPDQVMVIRLEADRPGVLTLTARFERQKGKHMDAVHRH 186
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
G + ++M C GK G+ +SA + + GT+ + + L V+ +D
Sbjct: 187 GTDTVVMTNDCGGK------------DGLTYSAAAKAITAG--GTVRVV-GEHLLVDQAD 231
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
V++L A+S+F DP L+ N Y+ L RH+ DYQ LF RV
Sbjct: 232 EVVIILAAASTF---------RVDDPKLRCAELLEHAANQGYAALKKRHIADYQPLFERV 282
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDED-PSLVELLFQFGRYLLISSSRP 367
+ L R+P D + +P+ +R++ + ED L L F FGRYLLI+ SRP
Sbjct: 283 KLDL-RAPAD--------QERHLLPTPKRLERVRAGEDDAGLYTLYFHFGRYLLIACSRP 333
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
G+ ANLQGIWN+ ++P WDS +NIN +MNYW + CNLSEC EPLF+ + + NG
Sbjct: 334 GSLPANLQGIWNDSMAPPWDSKFTININTQMNYWPAESCNLSECHEPLFELIERMRDNGR 393
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
TA+ Y G+V HH TDIWA ++ W MG AWL HLWEHY + + DFL
Sbjct: 394 VTARTMYGCRGFVAHHNTDIWADTAPQDIYPPATQWVMGAAWLTLHLWEHYKFNPNPDFL 453
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
KRAY ++ A F D+L+E +GYL TNPS SPE+ ++ +G+ + Y +MD II
Sbjct: 454 -KRAYETMKEAALFFTDFLVESPEGYLVTNPSVSPENRYLLRNGESGTLCYGPSMDTQII 512
Query: 548 REVFSAIISAAEVLEKNEDALVE--KVLKSLPRLRPTKIAEDGSIMEWVQ 595
E++SA I A+ L+ +E+A E ++ LP + K+ G + EW++
Sbjct: 513 SELYSACIQASLELDIDENARQEWAAIMDRLPEM---KVGRHGQLQEWLE 559
>gi|325923835|ref|ZP_08185445.1| hypothetical protein XGA_4498 [Xanthomonas gardneri ATCC 19865]
gi|325545691|gb|EGD16935.1| hypothetical protein XGA_4498 [Xanthomonas gardneri ATCC 19865]
Length = 795
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 229/594 (38%), Positives = 330/594 (55%), Gaps = 42/594 (7%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+ + + L++ + PA + A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D T+
Sbjct: 43 AAAAGDALQLWYREPANEWVQALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDATS 102
Query: 66 PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEE 122
PDA AL VR+L+ +G+YAEA A A K+ P YQ LGD+ L+FD +
Sbjct: 103 PDALAALPQVRALIFAGRYAEAEALADAKMLSRPLKQMPYQPLGDLLLDFDRAD---GIS 159
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
YRR+LDL+T + G RE F S Q IV ++S ++S V +DS
Sbjct: 160 EYRRQLDLDTGVVTTTFRSGGAVHKREVFVSAQSQCIVVRLSCDRPRAISLRVGIDSPQT 219
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
V ++ GR + A D K ++F+ + +I GT+S L D+ L
Sbjct: 220 GEVTVE-QGGLLFSGRN-------GSFAGIDGK-LRFALRVLPQIKG--GTVSDLRDR-L 267
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
++EG+D VLLL A++S+ + D DP + + ++L+ L Y+ L HL D+Q
Sbjct: 268 RIEGADEVVLLLTAATSYQ--RFDAVDG--DPLALTAASLKKAGKLDYTALLRAHLADHQ 323
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
+LF RV+I L S +P+ ERV++F DP+L L QFGRYLLI
Sbjct: 324 RLFRRVAIDLGTS------------EAAKLPTDERVQAFAKGNDPALAALYHQFGRYLLI 371
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SSRPG+Q ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL L L
Sbjct: 372 CSSRPGSQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLESMLFDL 431
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
+ G+ TA+ Y A GWV+H+ TD+W ++ G W+LWPMGG WL LW+ ++Y
Sbjct: 432 AKTGAHTARAMYDAPGWVVHNNTDLWRQAGPIDG-AKWSLWPMGGVWLLQQLWDRWDYGR 490
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
DR +L K YPL +G A F + L++ G + TNPS SPE++ P C T
Sbjct: 491 DRAYLGK-IYPLFKGAAEFFVATLVKDPQTGAMVTNPSISPENQH--PFNAALCA--GPT 545
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
MD ++R++F+ I+ +++L K +DA + + +L P +I + G + EW Q
Sbjct: 546 MDAQLLRDLFAQCIAMSKLL-KVDDAFAQHLSTLREQLPPNRIGKAGQLQEWQQ 598
>gi|338214785|ref|YP_004658848.1| alpha-L-fucosidase [Runella slithyformis DSM 19594]
gi|336308614|gb|AEI51716.1| Alpha-L-fucosidase [Runella slithyformis DSM 19594]
Length = 835
Score = 374 bits (960), Expect = e-101, Method: Compositional matrix adjust.
Identities = 223/597 (37%), Positives = 327/597 (54%), Gaps = 35/597 (5%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPKALS 73
I + PA+++ +A+P+GNGRLG M +G V E L+LNE+TLW+G P + NPDA K L
Sbjct: 24 IHYKQPARNWNEALPVGNGRLGVMTFGRVNEELLQLNEETLWSGGPVEKNPNPDALKHLP 83
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
VR ++ Y A+ K+ G + YQ LGD+ ++ + Y R+LDL A
Sbjct: 84 AVREALNREDYEMASKELQKIQGLYTEAYQPLGDVLIK---QPFEAQPTAYFRDLDLQNA 140
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
TA ++++ V ++RE F S PDQVIV +++ S+ G L+F+ S S + G N++
Sbjct: 141 TAHTQFTIEGVTYSRELFVSAPDQVIVLRLTASQKGKLNFSASTRSPHPFLKQITGKNEL 200
Query: 194 IMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLKV 244
M G+ P P N N P KG++F ++++ +D G ++A + + +
Sbjct: 201 SMRGKAPAHADPNYVNYNAKPVYYEDPSGCKGMRFDWRVKVQTTD--GKVTA-DTSGISI 257
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
+ A+LL+ A++SF+G P +D + + L+ S + H+ DY+K
Sbjct: 258 SNATEAILLVTAATSFNGFDKCPDSQGRDEKALVEAYLKRASAKSMDLIRKAHIADYRKY 317
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLIS 363
F RV + L +S + +P R+ + Q DP L L F FGRYLLIS
Sbjct: 318 FDRVKLTLGQSGEAA-----------HLPMDARLARYAQLGNDPELEALYFDFGRYLLIS 366
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSRPG ANLQGIWN P W S NIN EMNYW + NLSE D++ +
Sbjct: 367 SSRPGGIPANLQGIWNPMTRPPWSSNYTTNINAEMNYWPAEVANLSELHTTFTDWIAGAA 426
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSS--ADRGK--VVWALWPMGGAWLCTHLWEHYN 479
G +TA+ Y GW +HH +DIW S+ D+GK WA W MGGAWL HLWEHY
Sbjct: 427 ATGRETAKNFYGMKGWTVHHNSDIWGASNPVGDKGKGSPSWANWAMGGAWLSQHLWEHYV 486
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
Y+ D +L+ AYPL+ A F LDWL++ G T+PSTSPE+ FI G VS +
Sbjct: 487 YSGDEKYLKNYAYPLMRDAAQFCLDWLVKDAGGNWITSPSTSPENVFITEKGITQAVSVA 546
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLK-SLPRLRPTKIAEDGSIMEWVQ 595
+TMDMA++ +VF+ +I A+E L+ DA + K L+ + L P +I + G++ EW +
Sbjct: 547 TTMDMALVYDVFTNVIHASEHLKV--DAELRKTLEDRVQHLFPLQIGKKGNLQEWYK 601
>gi|268608709|ref|ZP_06142436.1| hypothetical protein RflaF_04322 [Ruminococcus flavefaciens FD-1]
Length = 772
Score = 374 bits (959), Expect = e-100, Method: Compositional matrix adjust.
Identities = 219/597 (36%), Positives = 333/597 (55%), Gaps = 55/597 (9%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
+ +N PA +F +A+P+GNGR+GAM++G E + LNED++W+G NPDA + L +
Sbjct: 7 LRYNDPAANFNEALPLGNGRIGAMIYGDAAFEKIPLNEDSVWSGGLRHRVNPDAAEGLEE 66
Query: 75 VRSLVDSGQYAEATAASV-KLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLN 131
VR L+ G EA + KL G ++ Y LGD+ ++ + L Y R LD+
Sbjct: 67 VRRLIKEGNIPEAERIAFDKLQGVTPNMRRYMPLGDLHIDLE---LSGRARNYNRRLDIG 123
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
A A V ++V +V + +E+F S PD+V+ +IS +E G ++ + +Y++G
Sbjct: 124 NAVADVTFTVNDVLYRKEYFISAPDEVMAVRISCAERGMINLS----------AYIDGRE 173
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
+ R GK + + GI F+A+L K G+I L ++ VE +D +
Sbjct: 174 DYYDDNRPCGKNMILFTGGSGSRDGIFFAAVLGAKARG--GSIRTL-GGRIAVEKADEVI 230
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
L+ +SF G + +K ++ AL++ Y +L H++DY+ +F RV
Sbjct: 231 LIFSVRTSFYG-----DNYEKSALIDAEMALKT----EYDELRLHHVNDYKDMFDRVDFS 281
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-----------DPSLVELLFQFGRYL 360
L + +EEN+D + +AER+K + DE D L+EL F FGRYL
Sbjct: 282 LCDN---------TEENLDRLDTAERIKRLKGDELDNKDCERLIHDNKLIELYFNFGRYL 332
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
+IS+SRPGTQ NLQGIWNE++ W S VNIN EMNYW + CNLSEC PLFD L
Sbjct: 333 MISASRPGTQPMNLQGIWNEEMIAPWGSRYAVNINTEMNYWPAESCNLSECHLPLFDLLE 392
Query: 421 YLSINGSKTAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
+ NG TA+ Y + G+V HH TDIW ++ V LWP GGAWL H++EHY
Sbjct: 393 RVCENGHITAREMYGVNKGFVCHHNTDIWGDTAPQDMWVPGTLWPTGGAWLALHIFEHYE 452
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
YT+D++FL ++ Y +L+ A F ++LIE G L T PS SPE+ + PDG C+
Sbjct: 453 YTLDKEFLAEK-YHILKQAAEFFTEFLIEDESGMLVTCPSVSPENTYKLPDGTKGCLCMG 511
Query: 540 STMDMAIIREVFSAIISAAEVLEKNED--ALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+MD II +F+ +I AAE+L+K++ A ++++LK +P+ ++ + G I EW+
Sbjct: 512 PSMDSQIITVLFTDVIRAAEILDKDKTFAAKLKRMLKKIPQ---PEVGKYGQIKEWL 565
>gi|374296937|ref|YP_005047128.1| hypothetical protein [Clostridium clariflavum DSM 19732]
gi|359826431|gb|AEV69204.1| hypothetical protein Clocl_2638 [Clostridium clariflavum DSM 19732]
Length = 742
Score = 373 bits (958), Expect = e-100, Method: Compositional matrix adjust.
Identities = 223/588 (37%), Positives = 324/588 (55%), Gaps = 48/588 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ + PA + +A+PIGNGR+GAM++G + +E ++LNED++W G D NPDA K L
Sbjct: 3 KLWYTKPAGCWEEALPIGNGRMGAMIFGSIETEHIQLNEDSVWYGAFVDRNNPDALKNLP 62
Query: 74 DVRSLVDSGQYAEATAASV-KLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
+R L+ GQ EA V L G P YQ LGD+ + F ++ + Y R L L
Sbjct: 63 KIRELIIKGQIPEAEELMVYALSGIPQSQRPYQSLGDLTIRFKG--MEGDKSGYIRCLSL 120
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVN 188
+ A VK V + RE F S D V+V +I+ +SF+ L + D V
Sbjct: 121 DDAIHTVKVKVAENTYKRETFLSAADDVLVMRITSDGDKKISFSALLTRERFYDRVIKV- 179
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
G + ++++G N G+ F ++ +K + G+ + + L V +D
Sbjct: 180 GQDAVMLDG-------------NLGKGGLDF--VMMLKAVAEGGSCDVV-GEHLIVNDAD 223
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
LL A ++F F N + K L N SY DL RH++DY L++RV
Sbjct: 224 AVTLLFTAGTTFR--FQNLKEQLK-------KILNDAANRSYDDLRKRHVEDYMSLYNRV 274
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRP 367
S +L+ + E + + + ER+K + E D L +L F FGRYLLIS SR
Sbjct: 275 SFELNGT-----------EKYEELTTEERLKKAKEGEVDKGLAKLYFDFGRYLLISCSRE 323
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
G+ ANLQG+WN+D++P WDS +NIN +MNYW + CNLSEC +PLFD + + NG
Sbjct: 324 GSLPANLQGVWNKDMNPAWDSKYTININTQMNYWPAEVCNLSECHKPLFDLIKRMVPNGQ 383
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
KTA+ Y G+V HH TDIW ++ + + W MG AWLCTHLW HY YT D+DFL
Sbjct: 384 KTARTMYNCRGFVAHHNTDIWGDTAVQDHWIPASYWVMGAAWLCTHLWMHYEYTQDKDFL 443
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
K A+P++ F LD+LIE GYL+T PS SPE+ +I P+G V+ +TMD I+
Sbjct: 444 -KEAFPIMREAVLFFLDFLIE-DKGYLKTCPSVSPENTYILPNGVQGSVTIGATMDNQIL 501
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
R++FS I AAE+L + D + + +++ +L PT+I G+IMEW +
Sbjct: 502 RDLFSQCIKAAEIL-RVCDQMNRDIEETVKKLEPTRIGSRGNIMEWTE 548
>gi|198277528|ref|ZP_03210059.1| hypothetical protein BACPLE_03750 [Bacteroides plebeius DSM 17135]
gi|198270026|gb|EDY94296.1| hypothetical protein BACPLE_03750 [Bacteroides plebeius DSM 17135]
Length = 809
Score = 373 bits (957), Expect = e-100, Method: Compositional matrix adjust.
Identities = 214/596 (35%), Positives = 327/596 (54%), Gaps = 31/596 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-DYTNPDAPKA 71
L + +N PA+ F +A+ IGNG +GA+++GG + L LN+ TLWTG P T P+A KA
Sbjct: 32 LVLHYNRPAEFFEEALVIGNGTMGAILYGGTDKDVLSLNDITLWTGEPDRKVTTPNAYKA 91
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+ ++R+L+D Y A A K+ GH ++ YQ LG + + + K + Y+R LD++
Sbjct: 92 IPEIRALLDKEDYRGADRAQRKVQGHYSENYQPLGQLSITYSAEPAKVSH--YQRTLDIS 149
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
A AR Y +F ++F+S PD VIV ++ + L +S +SLL + + NGN
Sbjct: 150 RAMARTAYQRNGADFACDYFASAPDSVIVLRLQTESTEGLQATLSFNSLLPHATTANGN- 208
Query: 192 QIIMEGRCPGKRIP------PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
+I EG P + D +G F + I++ + + + +LKV+
Sbjct: 209 EISAEGYAAYHSYPVYFDGVNNKHLYDPERGTHFRTL--IRVIAPQSEVKSFPSGELKVK 266
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G A++L+ +SF+G +P +D + ++ ++ +L H+ DY+ F
Sbjct: 267 GGKEALILIANVTSFNGFDKDPMKEGRDYRNLVTRRMERAAQKTFEELENAHVADYKSFF 326
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF--QTDEDPSLVELLFQFGRYLLIS 363
RV + L ++ ++ I +P+ E++ + ++ +P L L FQ+GRYLLIS
Sbjct: 327 DRVELHLGKT----------DQAIAALPTDEQLLQYTDKSQRNPELEALYFQYGRYLLIS 376
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSR ANLQG+WNE L P W NINLE NYW + NLSE PL DF+ L
Sbjct: 377 SSRTPGVPANLQGLWNERLLPPWSCNYTSNINLEENYWAAETANLSEMHRPLMDFIANLQ 436
Query: 424 INGSKTAQVNY-LASGWVIHHKTDIWAKS---SADRGKVVWALWPMGGAWLCTHLWEHYN 479
G ++A+ Y + GW + TDIWA + + G WA W MGGAWL TH+WE Y
Sbjct: 437 HTGEESAKAYYGVQKGWCLGQNTDIWAMTCPVGLNVGDPSWACWTMGGAWLSTHIWERYT 496
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
+T D++FL+K YP+L+G A F L+WLIE DG L T+P TSPE++F+ PDG SY
Sbjct: 497 FTQDKEFLQKY-YPVLKGAAEFCLNWLIE-KDGKLITSPGTSPENKFLTPDGYAGATSYG 554
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
T D+A+ RE AAE L ++D +++ K+LPRL P ++ + G++ EW
Sbjct: 555 CTSDLAMTRECLIDAAKAAEALGTDKD-FRKQIEKTLPRLLPYQVGKKGNLQEWFH 609
>gi|372210566|ref|ZP_09498368.1| alpha-L-fucosidase [Flavobacteriaceae bacterium S85]
Length = 793
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 218/589 (37%), Positives = 325/589 (55%), Gaps = 39/589 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + +N P+ + DA+P+GNGRLGAMV+GG E ++ NE+TLW+G P DY N A K+L
Sbjct: 30 LTLWYNQPSNTWNDALPVGNGRLGAMVYGGKTKEVIQFNEETLWSGQPHDYVNRRAFKSL 89
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELD 129
+ +++ + G+ EA A+ K +P + YQ ++ ++F + H + Y+R LD
Sbjct: 90 AKIKNSLWDGKRKEAEEIANKKFMSNPINQSSYQSFANVLIDFKN-HSNVTD--YKRSLD 146
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L A A Y + RE F+S+PDQVIV ++ S G L+F+++LDS ++
Sbjct: 147 LERAIASTVYKLDKAVIKREVFASHPDQVIVVHLTSSVKGILNFDITLDSNHSDYKVSIE 206
Query: 190 NNQIIMEGRCPGKRIPPKANANDDP-KGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
N+I+++G+ + N N P I+F A L++ +G ++ K+ ++ +
Sbjct: 207 ENEIVIKGKADNFKRDLDINKNKFPLSKIKFEARLKLV---QKGGELISKNNKVTIKNAT 263
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
LV +++F +N D +P + + N Y+ + H+ D+QK F+R+
Sbjct: 264 EVTCYLVGATNF----VNFKDISGNPHKRCKEYFKKLNNKPYNLVKENHIKDFQKYFNRL 319
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
I L E I P+ ER+ SF D DP+LV LL+Q+GRYLLISSSR G
Sbjct: 320 HIDLG------------ETKISRRPTNERLMSFSQDMDPNLVALLYQYGRYLLISSSRKG 367
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
TQ ANLQGIWN+ +SP W S +NINLEMNYW + NLSE EPL + LS G K
Sbjct: 368 TQPANLQGIWNDRISPPWGSKYTLNINLEMNYWITEVTNLSELSEPLIKLIDDLSNTGEK 427
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
A+ +Y GWV HH TDIW + +A + +WP GGAWL HLW HY +T ++DFL+
Sbjct: 428 IAKEHYNMPGWVAHHNTDIW-RGAAPINRSNHGIWPTGGAWLSQHLWWHYEFTQNKDFLK 486
Query: 489 KRAYPLLEGCASFLLDWLIEGHDG--YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
K AYP+L+ + F ++L+E D L + PS SPEH + TMD I
Sbjct: 487 KMAYPILKKASLFFSNYLLEFPDNKELLISGPSNSPEH---------GGLVMGPTMDHQI 537
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
IR +F I A+++L + K+ K + R+ P KI + G + EWV+
Sbjct: 538 IRNLFRVTIEASKILNVDR-GFRMKLEKKMNRIMPNKIGKHGQLQEWVK 585
>gi|313149824|ref|ZP_07812017.1| glycoside hydrolase family 95 [Bacteroides fragilis 3_1_12]
gi|313138591|gb|EFR55951.1| glycoside hydrolase family 95 [Bacteroides fragilis 3_1_12]
Length = 824
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 228/620 (36%), Positives = 330/620 (53%), Gaps = 58/620 (9%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAP 69
N L + + PA ++ +A+P+GNG LGAMV+G E L+LNE TL++G P P
Sbjct: 25 NNLSLWYRQPAANWNEALPLGNGYLGAMVFGDAGREHLQLNESTLYSGEPFSGVGVPSIG 84
Query: 70 KALSDVRSLVDSGQYAEATAASVKLF-GHPADVYQLLGDIELEFDDSHLKYAEETYRREL 128
++V +L++ G YA A + + G + YQ L D+ L FD ++ E Y REL
Sbjct: 85 SVYNEVLALLNHGDYAGAHRLITRNWQGRLSQSYQPLADLFLSFD---VQGKVENYVREL 141
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
+L A ++Y G + +TRE+F SNPD+V+V +IS S ++ VS S
Sbjct: 142 NLQDAVHTIRYQAGGIRYTREYFISNPDRVMVIRISASRRSPVNVAVSYTSEHPTAKVDG 201
Query: 189 GNNQIIMEGRCPG---------------------------KRIPPKANANDDP---KGIQ 218
++I+ G+ PG +R K D KG+
Sbjct: 202 TGEELILSGQAPGCVERRTLDFLEKNRLTDRHPELFDSHGRRKTDKQVLYADEVGGKGMF 261
Query: 219 FSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSES 278
F + +K+ T L+D +LKV G +LL+ A++S++G +PS D ++
Sbjct: 262 FQS--RVKVLKGNAT---LQDNQLKVSGEGEIILLVAAATSYNGFDRSPSQDGSDYQAKL 316
Query: 279 MSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV 338
+ L L Y DL RHL DYQ+LF RV++ L SE++ +P+ R+
Sbjct: 317 DTILSVAGQLPYEDLKKRHLADYQRLFGRVALTLK-----------SEKDYSGLPTDRRI 365
Query: 339 KSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEM 398
F+ + D +L LLFQ+GRYLLI+SSR G Q ANLQGIWN+D+ P W S+ +NIN EM
Sbjct: 366 IGFRDNPDNALAALLFQYGRYLLIASSREGGQPANLQGIWNKDVVPAWSSSYTININTEM 425
Query: 399 NYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKV 458
NYW + L EC EPLF + L++NGS TA Y GW HH T IW +S G+
Sbjct: 426 NYWPAETTGLPECSEPLFRLIRELAVNGSATAAKMYNLPGWTSHHITSIWRESGPADGEP 485
Query: 459 VWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNP 518
W +W M WLC HLW+HY ++ D+ FL + AYPL+ A F WL+E DG +T
Sbjct: 486 TWFMWNMSAGWLCRHLWDHYLFSGDKKFLRETAYPLMRDAARFYNAWLVE-KDGMWQTPL 544
Query: 519 STSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNE-----DALVEKVL 573
SPE++F+ P+ K + V+ + MDMAIIRE+FS AA +L + D L+ V+
Sbjct: 545 GVSPENQFLTPEKKTSAVAPAPAMDMAIIRELFSNTAEAAAILAADSILPPADTLLLHVM 604
Query: 574 KSLPRLRPTKIAEDGSIMEW 593
+ +L P +I + G IMEW
Sbjct: 605 GA-KQLVPYRIGKRGQIMEW 623
>gi|146300857|ref|YP_001195448.1| hypothetical protein Fjoh_3112 [Flavobacterium johnsoniae UW101]
gi|146155275|gb|ABQ06129.1| Candidate alpha-L-fucosidase; Glycoside hydrolase family 95
[Flavobacterium johnsoniae UW101]
Length = 822
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 220/598 (36%), Positives = 334/598 (55%), Gaps = 33/598 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPKA 71
LK+ +N PA +T+A+PIGNG LGAMV+G V SE ++LNE TLW+G P NP+A +
Sbjct: 26 LKLQYNQPAVEWTEALPIGNGTLGAMVFGRVDSELIQLNEATLWSGGPVQKNVNPNAFQN 85
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
L+ +R + + + +A + + G ++ + LGD+ L D K + Y R LD+
Sbjct: 86 LALIREALKAEDFDKAYNLTKNMQGAYSESFMPLGDLLLTQDLGSKK--TDFYNRSLDIQ 143
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
T A + V + RE F+S P + IV K+S + LS ++ SLL N + N
Sbjct: 144 TGLAVTNFKADGVNYKREIFASAPAKCIVMKLSADQLKKLSVSIDASSLLKNQKEIQ-NQ 202
Query: 192 QIIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKL 242
++++G+ P P + N +P +G++F I++ + D GT+S E K+
Sbjct: 203 SLVLKGKAPSHADPNYIDYNKEPVIYDDPAGCRGMRFELIVKPIVKD--GTVS-YEGNKI 259
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
++ + VL + A++SF+G P KD + + + ++ Y L HL D+Q
Sbjct: 260 VIKNASEIVLFISAATSFNGFDKCPDSQGKDEHAFAENPIKKASVKKYDILVKEHLQDFQ 319
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLL 361
K F+RVS+QL+ E + +P+ R++ + E D L L FQ+GRYLL
Sbjct: 320 KFFNRVSLQLNEK----------ETHKSNLPTDIRLEQYAKGEKDAGLEALFFQYGRYLL 369
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
ISSSR ANLQGIWN L W S NINL+MNYW +LSE PL DF+
Sbjct: 370 ISSSRTHNAPANLQGIWNNKLRAPWSSNYTTNINLQMNYWPVESASLSELFFPLDDFVKN 429
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEH 477
+S+ G++TA+ Y A+GWV+HH +DIWA ++ +G +WA W MG WL HLWEH
Sbjct: 430 VSVTGAETAKSYYHANGWVLHHNSDIWATTNPVGDFGKGDPMWANWYMGANWLSRHLWEH 489
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y YT D ++L K+ YP+++G A F LDWL + +GYL T PSTSPE+++ K V+
Sbjct: 490 YQYTGDTEYL-KKVYPIIKGAAEFSLDWLQQDKNGYLVTMPSTSPENKYFYDGKKGGVVT 548
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+STMD+ II+++F A+++L + D +KV K+ +L P +I G + EW +
Sbjct: 549 TASTMDIGIIKDLFENTSQASKILNIDAD-FRQKVDKAANQLLPFQIGAKGQLQEWYK 605
>gi|227538538|ref|ZP_03968587.1| possible alpha-L-fucosidase [Sphingobacterium spiritivorum ATCC
33300]
gi|227241457|gb|EEI91472.1| possible alpha-L-fucosidase [Sphingobacterium spiritivorum ATCC
33300]
Length = 826
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 228/595 (38%), Positives = 330/595 (55%), Gaps = 49/595 (8%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
N LK+ ++ PA ++ +A+PIGNGRLGAMV+G E ++LNE+T+W G PG+ + +A
Sbjct: 28 NSLKLEYDKPAGNWNEALPIGNGRLGAMVFGQPDLEQIQLNEETIWAGGPGNNVSKNAYD 87
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPADV-------YQLLGDIELEFDDSHLKYAEET 123
+ +R L+ G+ EA S F PA YQ GD+ + F D H +Y+ +
Sbjct: 88 KIQQIRRLLFEGKAKEAQDLSNATFPRPAPTGIDYGMPYQTFGDLRISFPD-HKQYS--S 144
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
Y RELD+ A R +Y G V +TRE F+S D V++ K+S SLSF++ L S DN
Sbjct: 145 YSRELDIQDAITRTRYKAGAVNYTREVFASLKDDVVIIKLSADTKKSLSFSIGLTSPHDN 204
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKL 242
N Q+ + G + +++ G IQF+ I+ + +G +D +L
Sbjct: 205 THITVENKQLTLSG---------ISGSHEGKTGQIQFTGIVRPIL---KGGKLIQKDNQL 252
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
+V +D +L + ++F N +D + T+++++ L Y H+ YQ
Sbjct: 253 EVTHADEVILYISIGTNFK----NYNDITGNATAKALNILNKASGNKYGKAKADHIQKYQ 308
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
+ F+RVS+ L SP+ S++ D R++ F +DP LV L FQFGRYLLI
Sbjct: 309 QYFNRVSLYLGESPQ-------SKKMTDI-----RIREFGGADDPELVTLYFQFGRYLLI 356
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SSS+PG Q A LQGIWN+ LSP WDS VNIN EMNYW + NL E EPLF L L
Sbjct: 357 SSSQPGGQPATLQGIWNDKLSPPWDSKYTVNINTEMNYWPAEVTNLKELHEPLFAMLKDL 416
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
++ G ++A+ Y A GW IHH TD+W S G + +WPMGGAWL HLW+H+ Y+
Sbjct: 417 AVTGQESAKELYHARGWNIHHNTDLWRISGVVDGG-FYGMWPMGGAWLSQHLWQHFLYSG 475
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIE--GHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
DR FL K Y +L+G A F LD L E H +L PS SPE+ ++ G VS +
Sbjct: 476 DRSFL-KEYYHVLKGKALFYLDVLQEEPTHQ-WLVVAPSMSPENSYLPGVG----VSAGT 529
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
TMD ++ +VF I A+ VL+++ D L + V +L RL P +I + + EW+Q
Sbjct: 530 TMDNQLVFDVFHNFIQASAVLKQDAD-LRDSVQVALDRLPPMQIGQHNQLQEWLQ 583
>gi|257053761|ref|YP_003131594.1| alpha/beta hydrolase domain-containing protein [Halorhabdus
utahensis DSM 12940]
gi|256692524|gb|ACV12861.1| alpha/beta hydrolase domain-containing protein [Halorhabdus
utahensis DSM 12940]
Length = 784
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 216/593 (36%), Positives = 326/593 (54%), Gaps = 45/593 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA + +A+PIGNGRLG M++G E ++ N DTLW G D TNPDA + + +VR
Sbjct: 13 YDEPASAWLEALPIGNGRLGGMIFGRPGCERVQFNADTLWAGGHEDRTNPDAREHVEEVR 72
Query: 77 SLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
L+ G+ A A A KL G P + YQ GD+ ++ A YRRELDL+
Sbjct: 73 RLLFDGEVQRAQALADEKLMGDPIRLRPYQTFGDLSIDVGHD----AVTDYRRELDLSAG 128
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
ARV+Y + RE+F+S PD IV +++ E G+++ V LD D V + +
Sbjct: 129 VARVRYDHEGTTYVREYFASAPDDAIVIRLTAEEPGAVTATVGLDREQDADDSVR-DGTL 187
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS-----D 248
+ GR + +G+ F A ++ D G + + E S +
Sbjct: 188 QLRGRVVDDPDDDRGAGG---EGMAFEA--RASVTADAGNVQRVTGADAPEESSVGFRAE 242
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
A + + + F G +DP + S L ++ + SY DL H+ D+++LF RV
Sbjct: 243 AADAMTIVLTGFTG------HETEDPGAACESVLDAVADQSYDDLRDTHVADHRELFDRV 296
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
+ L P D TD E +D V + E DP+L L QFGRYLLI+SSRPG
Sbjct: 297 ELDLG-EPLDRPTD----ERLDRVATGE--------ADPNLTALYAQFGRYLLIASSRPG 343
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
T+ ANLQG+WN++ P W+S +NINLEMNYW +L NL+EC PL+DF+ L G +
Sbjct: 344 TEPANLQGVWNQEFDPPWNSGYTLNINLEMNYWPALQTNLAECAAPLYDFVDDLREPGRR 403
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
A+ +Y +G+ +HH +D+W +++A W LWPMG AWL +++HY +T D D L
Sbjct: 404 VAETHYDCAGFAVHHNSDLW-RNAAPVDGAHWGLWPMGAAWLSRLVFDHYAFTRDEDHLR 462
Query: 489 KRAYPLLEGCASFLLDWLIE--GHDG----YLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+ A P+L A+F+ D+L+E +G +L T PS SPE+ ++ DG+ A V+Y+ TM
Sbjct: 463 ETAEPILREAAAFVADFLVEHPAEEGEAEDWLVTAPSNSPENAYVTDDGQEATVTYAPTM 522
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
D+ + R++F I+AAE+LE ED + + +L RL P ++ E G + EW++
Sbjct: 523 DVQLTRDLFEHTIAAAEILEV-EDEFHDDLRAALDRLPPMQVGEHGQLQEWIE 574
>gi|393773725|ref|ZP_10362119.1| twin-arginine translocation pathway signal protein [Novosphingobium
sp. Rr 2-17]
gi|392720900|gb|EIZ78371.1| twin-arginine translocation pathway signal protein [Novosphingobium
sp. Rr 2-17]
Length = 852
Score = 370 bits (950), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 219/591 (37%), Positives = 310/591 (52%), Gaps = 45/591 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
+I N PA + P+GNGRLGAM+ G V + + LN DTLWTG P + + D L+
Sbjct: 56 RIADNSPATEWLLGHPVGNGRLGAMMGGSVRRDVISLNHDTLWTGQPSPHPDHDGRATLA 115
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
VR V +G YA A S L G + + + D+ LE D + A YRRELDL+ A
Sbjct: 116 AVRKAVFAGDYAAADLLSRPLQGTFSQSFAPMADMTLELDHTQ---AVTAYRRELDLDRA 172
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A V Y G+V F RE F+S PD VIV ++S S + ++S + L + L + GN
Sbjct: 173 IASVAYHCGDVAFRRELFASYPDNVIVLRLSASRAAAISGRIGLATSLLGSTRAAGNTLR 232
Query: 194 IMEGRCPGKRIP-------PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+M G+ P + P P A + +G+ F+ +L +++ G + A D L V G
Sbjct: 233 LM-GKAPTRCEPNYREVPDPVAYSEQPGQGMAFATVLGVEVQG--GEVVASGDA-LSVRG 288
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D V+ + A++ F + P + ++ + + L SY L RHL D+Q L+
Sbjct: 289 ADVVVIRIAAATGFRRFDLLPDIAAEEVAAVAERNLAIAHQNSYGSLLKRHLADHQALYR 348
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
R SI+L + D VT P AER LF GRYLLI+SSR
Sbjct: 349 RASIELQGAGDDQVT-----------PKAER---------------LFNLGRYLLIASSR 382
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
P T ANLQG+WN + P W + NINL+MNYW + CNL+EC PL D + L++NG
Sbjct: 383 PDTMPANLQGLWNAQVRPPWSANYTTNINLQMNYWSAETCNLAECHLPLMDHIERLALNG 442
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+K A+ Y GW +HH +D+WA ++ A G WA WPM G WL H+WEHY ++ D
Sbjct: 443 AKVARDLYGMPGWSVHHNSDVWAMANPVGAGDGDPNWANWPMAGPWLAQHVWEHYRFSGD 502
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTM 542
FL KR + L+ CA F WL+ + L T PS SPE+ F+ P GK + +S TM
Sbjct: 503 IAFLAKRGFALMRDCAEFCAAWLVRDPSSHRLTTAPSISPENLFLGPHGKPSAISSGCTM 562
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
D+A+ RE+F I+AA ++ + L + L L P +I G + EW
Sbjct: 563 DLALTRELFENCIAAANLV-GDRSGLAVHLKGLLQELEPYRIGRYGQLQEW 612
>gi|395804709|ref|ZP_10483944.1| hypothetical protein FF52_22604 [Flavobacterium sp. F52]
gi|395433097|gb|EJF99055.1| hypothetical protein FF52_22604 [Flavobacterium sp. F52]
Length = 823
Score = 370 bits (950), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 223/605 (36%), Positives = 334/605 (55%), Gaps = 33/605 (5%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYT 64
S S LK+ + PA +T+A+P+GNG LGAMV+G V +E ++LNE TLW+G P
Sbjct: 20 SASAQKDLKLQYKQPAVEWTEALPVGNGTLGAMVFGRVEAEFIQLNEATLWSGGPVHKNV 79
Query: 65 NPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETY 124
NPDA K L+ +R + + + +A + + G ++ + LGD+ L+ D K A +Y
Sbjct: 80 NPDAFKNLALIREALKNEDFEKANVLTKNMQGPYSESFMPLGDLILKQDFGGQKAA--SY 137
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
R LD+ T A ++ G V + RE F+S P Q IV K+S + LS + SLL N
Sbjct: 138 DRSLDIQTGLAVTSFNAGGVNYKREIFASAPAQCIVIKLSADQLKKLSVTIDAASLLKNQ 197
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTIS 235
V N ++++G+ P P + N +P +G++F I++ + D G IS
Sbjct: 198 KAVQ-NQTLVLKGKAPSHADPNYIDYNKEPVIYEDVTGCRGMRFELIIKPVVKD--GQIS 254
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
+ E KL ++ + +L + A++SF+G P KD + + ++ + Y L
Sbjct: 255 S-EGDKLVIKNASEILLFVSAATSFNGFDKCPDSQGKDEHKFAEAPIKKVAGKKYDSLLK 313
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLF 354
H+ D+QK F+RVS+ L+ E + +P+ R++ + E D L L F
Sbjct: 314 EHIADFQKFFNRVSLMLNEK----------ETSKSDLPTDIRLEQYAKGEKDAGLEALFF 363
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
QFGRYLLISSSR ANLQGIWN L W S NINL+MNYW +LSE
Sbjct: 364 QFGRYLLISSSRTHNAPANLQGIWNNKLRAPWSSNYTTNINLQMNYWPVESGSLSELFFS 423
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWL 470
L +F+ S G++TA+ Y A+GWV+HH +DIWA ++ +G +WA W MG WL
Sbjct: 424 LDEFIKNASATGAETAKSYYHANGWVLHHNSDIWAMTNPVGDFGKGDPMWANWYMGANWL 483
Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD 530
HLWEHY YT D+++L K+ YP+++G A F LDWL + +G+L T PSTSPE+ F
Sbjct: 484 SRHLWEHYQYTGDKNYL-KKVYPIIKGAAEFSLDWLQKDKNGHLVTMPSTSPENIFYYDG 542
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSI 590
K V+ +STMD+AII+++F I A++VL + + +KV + L P +I G +
Sbjct: 543 KKQGTVTTASTMDIAIIKDLFENTIEASKVLYADLE-FRQKVNSAREELLPFQIGSKGQL 601
Query: 591 MEWVQ 595
EW +
Sbjct: 602 QEWYK 606
>gi|395803591|ref|ZP_10482835.1| hypothetical protein FF52_16993 [Flavobacterium sp. F52]
gi|395434145|gb|EJG00095.1| hypothetical protein FF52_16993 [Flavobacterium sp. F52]
Length = 816
Score = 370 bits (950), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 220/595 (36%), Positives = 332/595 (55%), Gaps = 44/595 (7%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+ + N LK+ ++ PA + +A+P+GNGRLGAMV+G E L+LNE+T+W G P +
Sbjct: 18 TATAQNDLKLWYDKPASIWNEALPLGNGRLGAMVFGDPAVERLQLNEETIWAGSPNSNAH 77
Query: 66 PDAPKALSDVRSLVDSGQYAEATAASVKLF------GHPADVYQLLGDIELEFDDSHLKY 119
+ +AL VR L+ G++ EA + K G P YQ G + + F+ H KY
Sbjct: 78 TKSIEALPKVRQLIFEGKFDEAQDLATKDIMSQTNDGMP---YQTFGSVYISFN-GHQKY 133
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+ Y R+LD++ ATA+VKY V VEFTRE ++ DQVIV K+S S+ G ++ NV ++S
Sbjct: 134 TD--YYRDLDISNATAKVKYKVNGVEFTREILTAFSDQVIVMKLSASKPGQITCNVFMNS 191
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+D NQII+ G N + ++F L K + G I A +
Sbjct: 192 PIDKTVTSTEGNQIILSGTG--------TNFENVKGKVKFQGRLTAK--NKGGEIDA-SN 240
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
L + +D +L + +++F N D D ++S L + ++ H+D
Sbjct: 241 GVLSINKADEVILYISIATNFK----NYKDISGDEIAKSKDYLAKAEIKDFENIKKAHVD 296
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
YQK F+RV++ L S E + P+ ER++ F DP L L FQFGRY
Sbjct: 297 YYQKFFNRVALDLG-----------SNELVKK-PTNERIRDFSKQFDPQLASLYFQFGRY 344
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LLISSS+PG Q ANLQGIWN+ ++P WDS NIN EMNYW + NL E EP
Sbjct: 345 LLISSSQPGGQPANLQGIWNDMVTPPWDSKYTTNINAEMNYWPAQVTNLQELHEPFVQMA 404
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
L+I G++TA++ Y A+GWV+HH TDIW + +A +WP GGAW+C LWE Y
Sbjct: 405 KELAITGAETARMMYNANGWVLHHNTDIW-RVTAPVDSAASGMWPTGGAWVCQDLWERYL 463
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
YT D+ +L + YP+++G A F LD++I + + GYL PS+SPE+ GK + ++
Sbjct: 464 YTGDKKYLAE-IYPIMKGAADFFLDFMIVDPNTGYLVVVPSSSPENTHAGGTGK-STIAS 521
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+TMD +I ++F+ ++ A+ ++ + A V+KV ++L ++ P KI + + EW
Sbjct: 522 GTTMDNQLIFDLFTHVMEASALISPDA-AYVKKVSEALAKMPPMKIGKHSQLQEW 575
>gi|302548581|ref|ZP_07300923.1| alpha-L-fucosidase 2 [Streptomyces hygroscopicus ATCC 53653]
gi|302466199|gb|EFL29292.1| alpha-L-fucosidase 2 [Streptomyces himastatinicus ATCC 53653]
Length = 809
Score = 370 bits (950), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 225/590 (38%), Positives = 328/590 (55%), Gaps = 49/590 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + + A + +A+PIGNGRLGAMV+GG SE L+LNEDT+W G P + +P A +L
Sbjct: 49 LALWYPRAASTWLEALPIGNGRLGAMVFGGAESELLQLNEDTVWAGGPYEPASPKALASL 108
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEET--YRRE 127
++R V +G++ A + G P +YQ +G++ L FD A E YRR
Sbjct: 109 PEIRRRVFAGEWEAAQSLIDSDFLGTPKGELMYQPVGNLRLAFD-----AAGEVGDYRRT 163
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
LDL++A A V+Y+ G V + RE F+S+PDQVIV +++ G++SF + DS
Sbjct: 164 LDLDSAVASVRYAQGGVTYDRECFASHPDQVIVMRLTADRPGAVSFTAAFDS-------- 215
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGI--QFSAILEIKISDDRGTISALEDKKLKVE 245
Q ++ P + ++ +G+ Q + D GT+S+ E+ L V
Sbjct: 216 ---PQTVIAS-SPDRITVAIDGTSETREGVTGQVRFRALARARADGGTVSS-ENGTLTVT 270
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G+D LL+ +S+ + NP+ D + + + L + ++ Y+ L RH+ DY+ LF
Sbjct: 271 GADSVTLLVSVGTSYTD-YRNPT---GDHAARATAPLNAASDVPYARLRKRHVADYRGLF 326
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
RV + L TD + +P+ ERV +F + DP LV L FQ+GRYLLISSS
Sbjct: 327 RRVGLDLG------TTDAAA------LPTDERVANFASATDPQLVALHFQYGRYLLISSS 374
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPGTQ ANLQGIWN+ LSP+WDS +NIN EMNYW + NL EC EP+FD L LS+
Sbjct: 375 RPGTQPANLQGIWNDSLSPSWDSKYTININTEMNYWPAPVTNLLECWEPVFDLLADLSVA 434
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G+ TA+ Y A GWV HH TD W + +A + +W GGAWL T +W+HY +T D+
Sbjct: 435 GATTAKRQYGAGGWVTHHNTDAW-RGTAPVDRAFPGMWQTGGAWLSTGIWDHYLFTGDKK 493
Query: 486 FLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L +R YP+L G F LD L+ + G+ T P+ SPE+ V TMD
Sbjct: 494 ALRRR-YPVLRGSVRFFLDTLVTDPATGHFVTCPANSPENAHHTN----VSVCAGPTMDN 548
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEW 593
I+R++F + A+E+L ++ DA + ++ + R L P KI G + EW
Sbjct: 549 QILRDLFDGFVKASELLGEDADAGMRAEVRRVRRKLPPMKIGAQGQLREW 598
>gi|334144837|ref|YP_004538046.1| alpha/beta hydrolase domain-containing protein [Novosphingobium sp.
PP1Y]
gi|333936720|emb|CCA90079.1| alpha/beta hydrolase domain-containing protein [Novosphingobium sp.
PP1Y]
Length = 806
Score = 370 bits (949), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 220/589 (37%), Positives = 322/589 (54%), Gaps = 46/589 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA+ +T+A+P+GNGR+GAMV+GG E L+LNEDTLWTG P + NP A +AL
Sbjct: 63 RLWYCQPAREWTEALPVGNGRIGAMVFGGTGLERLQLNEDTLWTGGPYNPVNPSAREALP 122
Query: 74 DVRSLVDSGQYAEA-TAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEE-TYRRELD 129
+R L++ G + +A T A +L P YQ GD+ + HL E+ +Y RELD
Sbjct: 123 QIRRLIEQGHFTQAQTLADARLMARPLSQMAYQTFGDLTIAM--PHLGTIEQGSYLRELD 180
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L+ A A + V ++R+ +S QVI +S G + V L + D ++G
Sbjct: 181 LDAALAATTFKADGVSWSRKVIASPDHQVIAVHLSADRPGRMHCLVGLGAPHDGVLSIDG 240
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK--ISDDRGTISALEDKKLKVEGS 247
+I GR N+ G++ + E + + G IS + D KL VEG+
Sbjct: 241 GT-LIFGGR------------NNAAHGVEGALRFEARARVLPQGGRIS-VSDNKLAVEGA 286
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D +L+ ++S+ D DP+ + S +++ S++ + +++L+ R
Sbjct: 287 DAVTILIAMATSYR----QFDDVGGDPSQITRSQIEAASRHSFARIAADTAASHRRLYRR 342
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
VS+ L +P P+ ER+++ +T +D +L L FQ+GRYLLI SSRP
Sbjct: 343 VSLDLGETPAA------------HRPTDERIRTSETSQDSALAALYFQYGRYLLICSSRP 390
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
G+Q ANLQGIWN+ P W S +NIN EMNYW + P L EC PL + L+ G+
Sbjct: 391 GSQPANLQGIWNDSDDPPWGSKYTININTEMNYWPAEPTALGECVAPLVALVRDLAQTGA 450
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
TA+ Y A GWV HH TD+W +++A W LWPMGGAWLCTHLW+HY+Y D FL
Sbjct: 451 STAREMYGARGWVAHHNTDLW-RATAPIDGAAWGLWPMGGAWLCTHLWDHYDYHRDTAFL 509
Query: 488 EKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
+ YPLL G A F LD L + GYL TNPS SPE+E P G C S +D I
Sbjct: 510 -RSVYPLLRGAALFFLDTLQRDPASGYLVTNPSISPENEH--PGGASVCAGPS--VDRQI 564
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+R++F+ AA +L ++D L ++L + RL P +I G + EW++
Sbjct: 565 LRDLFAQTARAATILGLDDD-LSAQILDTSRRLAPDEIGAQGQLQEWLE 612
>gi|424794811|ref|ZP_18220740.1| exported protein [Xanthomonas translucens pv. graminis ART-Xtg29]
gi|422795776|gb|EKU24406.1| exported protein [Xanthomonas translucens pv. graminis ART-Xtg29]
Length = 775
Score = 370 bits (949), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 221/587 (37%), Positives = 321/587 (54%), Gaps = 42/587 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + + PA + +A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D T+P+A AL
Sbjct: 30 LTLWYPRPATQWVEALPLGNGRLGAMVWGGIAHERLQLNEDTLYAGQPYDATSPEALAAL 89
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
VR+L+ +G+Y EA A A KL P YQ L D+ L++D + + YRRELD
Sbjct: 90 PQVRALIFAGRYVEAEALADAKLLSRPRKQMPYQPLADLLLDYDRAD---GIDGYRRELD 146
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L+TA A ++ RE F S +Q I+ ++S G ++ + +DS +
Sbjct: 147 LDTALASTRFVSDGATHLREVFVSATEQCILVRLSCDHPGRIALRIGIDSP-QAGEVTHE 205
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
++ GR A G++F+ + + S G + +E +++++G+D
Sbjct: 206 QGALLFAGR--------NAGFAGIEGGLRFALRVLPRAS---GGSTRIERGRIRIDGADE 254
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
VLLL A++S+ D DP + S + L++ LSY+ L RHL ++++LF RV+
Sbjct: 255 VVLLLTAATSYR----RYDDVGGDPLALSAAQLRTAAALSYAQLRERHLAEHRRLFRRVA 310
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
I L S +P+ ERV+ + DP+L L Q+GRYLLISSSRPG+
Sbjct: 311 IDLGSSAAA------------QLPTDERVRRYADGNDPALAALYHQYGRYLLISSSRPGS 358
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQG+WNE + P W S VNIN EMNYW S L EC EPL L L+ G+ T
Sbjct: 359 QPANLQGVWNELMQPPWQSKYTVNINTEMNYWPSEANALHECVEPLEAMLFDLAETGAHT 418
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
AQ Y A GWV+H+ TD+W ++ G V W+LWPMGG WL LW+ ++Y DR +L +
Sbjct: 419 AQAMYAAPGWVVHNNTDLWRQAGPVDG-VKWSLWPMGGVWLLQQLWDRWDYGRDRAYL-R 476
Query: 490 RAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
R YPL +G A F + L+ + G + TNPS SPE+ P G C MD ++R
Sbjct: 477 RIYPLFKGAAEFFVATLVRDPQSGAMVTNPSLSPENRH--PFGAALCA--GPAMDAQLLR 532
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
++F+ I +L + A E++ +L P +I G + EW Q
Sbjct: 533 DLFAQCIKMGALLGVDA-AFGERLATLRTQLPPDRIGRAGQLQEWQQ 578
>gi|326800280|ref|YP_004318099.1| alpha-L-fucosidase [Sphingobacterium sp. 21]
gi|326551044|gb|ADZ79429.1| Alpha-L-fucosidase [Sphingobacterium sp. 21]
Length = 826
Score = 370 bits (949), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 222/607 (36%), Positives = 340/607 (56%), Gaps = 53/607 (8%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
+ +A + + ++ K+ ++ PA H+ +A+PIGNGRLGAM++GGV + L+LNE+T+W+G P
Sbjct: 21 IYSAVNATGSDSYKLWYDKPAAHWNEALPIGNGRLGAMLFGGVKQDHLQLNEETIWSGGP 80
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADV-------YQLLGDIELEFD 113
G+ ++ D + ++R L+ +G+Y EA S K + YQ GD+ ++F
Sbjct: 81 GNNSSKDLYSTMQEIRRLLFAGKYKEAQDLSNKEMPREPEANNNYGMSYQPAGDLWIDF- 139
Query: 114 DSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLS 172
L E YRRELD+ A + V Y VG V + RE+ ++ DQVI+ +++ +GS+S
Sbjct: 140 ---LHEGETVAYRRELDIADALSTVTYRVGEVTYKREYLATAHDQVIMMRVTADRAGSIS 196
Query: 173 FNVSLDS--LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISD 229
N+ L++ L+ ++ N+I + G K+ + KG ++FS +E K+
Sbjct: 197 CNLKLNTPHLIHQQPFIG--NRIYVNGTSGDKQ---------NKKGQVKFSIAVEPKV-- 243
Query: 230 DRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLS 289
+G E + L+V +D + + ++F+ N D D + L + S
Sbjct: 244 -KGGALQAEGEMLRVRQADELTVYIAIGTNFN----NYHDLGGDARERADDYLNTALKKS 298
Query: 290 YSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSL 349
Y + ++H++DY++ F RVS+ L ++ + + +++ RV F DP L
Sbjct: 299 YRKIKSKHVEDYRRYFDRVSLDLGQT---VAMNKATDQ---------RVADFHLGNDPQL 346
Query: 350 VELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLS 409
V L FQFGRYLLISSSRPGTQ ANLQGIWN+ LSP W S VNIN EMNYW + NLS
Sbjct: 347 VSLYFQFGRYLLISSSRPGTQPANLQGIWNDKLSPPWSSKYTVNINTEMNYWPAEVTNLS 406
Query: 410 ECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAW 469
E EPLF L LS+ G ++A Y A GW +HH TDIW + G + +WPMGGAW
Sbjct: 407 EMHEPLFAMLEDLSVTGKESAWNYYRARGWNMHHNTDIWRVTGIIDGG-FYGMWPMGGAW 465
Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIA 528
L H+W+HY + D FL K YP+L+G F +D L E +L PS SPE+ + +
Sbjct: 466 LSQHIWQHYLFNGDNAFLAKY-YPILKGVTQFYVDVLQEEPKHKWLVVAPSMSPENSYQS 524
Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDG 588
G +S +TMD ++ +VFS + AA VL+ +ED ++ V L RL P +I + G
Sbjct: 525 GVG----ISAGTTMDNQLVFDVFSNFLEAAHVLQVDED-FMDTVASKLKRLPPMQIGKLG 579
Query: 589 SIMEWVQ 595
+ EW++
Sbjct: 580 QLQEWME 586
>gi|319786653|ref|YP_004146128.1| alpha-L-fucosidase [Pseudoxanthomonas suwonensis 11-1]
gi|317465165|gb|ADV26897.1| Alpha-L-fucosidase [Pseudoxanthomonas suwonensis 11-1]
Length = 805
Score = 369 bits (948), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 231/597 (38%), Positives = 326/597 (54%), Gaps = 43/597 (7%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+ + PL++ + PA + +A+P+GNGRLGAMVWGG SE L+LNEDTL+ G P D
Sbjct: 47 TAAPGRPLRLWYPRPATRWVEALPLGNGRLGAMVWGGGRSERLQLNEDTLYAGRPYDPVP 106
Query: 66 PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDD-SHLKYAE 121
A +AL +VR L+ +G++AEA A A + G P YQ LGD+ L+F + S L
Sbjct: 107 DGALEALPEVRRLLFAGRHAEAEALADATMMGAPRKQMPYQPLGDLCLDFVEVSDL---- 162
Query: 122 ETYRRELDLNTATARVKYSVG-NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
+ YRRELDL+ A A + G +E TRE F S DQ + ++ S+ G + + LDS
Sbjct: 163 DDYRRELDLDRAVATTSFGSGWKLEHTREAFVSAEDQCLAVRLRTSQPGRVRVRIGLDSD 222
Query: 181 LDNHSYV-NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
V +G+ +++ GR +A G++F+A L +++ RG
Sbjct: 223 HAQAEVVPDGDAGLLLRGR--------NGDAFGIEGGLRFAARLGVQV---RGGTLRRRG 271
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+++VEG+D VLLL A++SF D DP + + + L++ S+ L H
Sbjct: 272 DRIEVEGADEVVLLLTAATSFR----RYDDIGGDPEATTRTQLEAAARRSWDALLAAHEA 327
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
+Q+LF RV+I L RS E + +P ERV F DP L L QFGRY
Sbjct: 328 AHQRLFRRVAIDLGRS----------AEEVAALPIDERVARFAEGHDPELAALYHQFGRY 377
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LL+ SSRPGTQ ANLQGIWN+ L+P W+S +NIN EMNYW + L EC EPL +
Sbjct: 378 LLVCSSRPGTQPANLQGIWNDLLAPPWESKYTININTEMNYWPAEANALPECVEPLERMV 437
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
L+ G+ A+ Y A GWV+HH TD+W +++ G W LWP+GGAWL HLW+ ++
Sbjct: 438 AELAQTGADVARRMYGAPGWVVHHNTDLWRQAAPIDG-AKWGLWPLGGAWLLQHLWDRWD 496
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSY 538
Y + +LEK +PL G A F L+E G + T PS SPE+E P G C
Sbjct: 497 YGREPGYLEK-VWPLFRGAAEFFAATLVEDPTTGAMVTAPSISPENEH--PHGAALCAGP 553
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S MD I+R++F I A +L + D L ++ + RL P +I G + EW Q
Sbjct: 554 S--MDAQILRDLFGQCIEIAGLLGVDAD-LAARLARLRERLPPHRIGRAGQLQEWQQ 607
>gi|424665666|ref|ZP_18102702.1| hypothetical protein HMPREF1205_01541 [Bacteroides fragilis HMW
616]
gi|404573919|gb|EKA78670.1| hypothetical protein HMPREF1205_01541 [Bacteroides fragilis HMW
616]
Length = 821
Score = 369 bits (947), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 227/620 (36%), Positives = 329/620 (53%), Gaps = 58/620 (9%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAP 69
N L + + PA ++ +A+P+GNG LGAMV+G E L+LNE TL++G P P
Sbjct: 22 NNLSLWYRQPAANWNEALPLGNGYLGAMVFGDAGREHLQLNESTLYSGEPFSGVGVPSIG 81
Query: 70 KALSDVRSLVDSGQYAEATAASVKLF-GHPADVYQLLGDIELEFDDSHLKYAEETYRREL 128
++V +L++ G YA A + + G + YQ L D+ L FD ++ E Y REL
Sbjct: 82 SVYNEVLALLNHGDYAGAHRLITRNWQGRLSQSYQPLADLFLSFD---VQGKVENYVREL 138
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
+L A ++Y + +TRE+F SNPD+V+V +IS S ++ VS S
Sbjct: 139 NLQDAVHTIRYQAEGIRYTREYFISNPDRVMVIRISASRRSPVNVAVSYTSEHPTAKVDG 198
Query: 189 GNNQIIMEGRCPG---------------------------KRIPPKANANDDP---KGIQ 218
++I+ G+ PG +R K D KG+
Sbjct: 199 TGEELILSGQAPGCVERRTLDFLEKNRLTDRHPELFDSHGRRKTDKQVLYADEVGGKGMF 258
Query: 219 FSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSES 278
F + +K+ T L+D +LKV G +LL+ A++S++G +PS D ++
Sbjct: 259 FQS--RVKVLKGNAT---LQDNQLKVSGEGEIILLVAAATSYNGFDRSPSQDGSDYQAKL 313
Query: 279 MSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV 338
+ L L Y DL RHL DYQ+LF RV++ L SE++ +P+ R+
Sbjct: 314 DTILSVAGQLPYEDLKKRHLADYQRLFGRVALTLK-----------SEKDYSGLPTDRRI 362
Query: 339 KSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEM 398
F+ + D +L LLFQ+GRYLLI+SSR G Q ANLQGIWN+D+ P W S+ +NIN EM
Sbjct: 363 IGFRDNPDNALAALLFQYGRYLLIASSREGGQPANLQGIWNKDVVPAWSSSYTININTEM 422
Query: 399 NYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKV 458
NYW + L EC EPLF + L++NGS TA Y GW HH T IW +S G+
Sbjct: 423 NYWPAETTGLPECSEPLFRLIRELAVNGSVTAAKMYNLPGWTSHHITSIWRESGPADGEP 482
Query: 459 VWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNP 518
W +W M WLC HLW+HY ++ D+ FL + AYPL+ A F WL+E DG +T
Sbjct: 483 TWFMWNMSAGWLCRHLWDHYLFSEDKKFLRETAYPLMRDAARFYNAWLVE-KDGMWQTPL 541
Query: 519 STSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNE-----DALVEKVL 573
SPE++F+ P+ K + V+ + MDMAIIRE+FS AA +L + D L+ V+
Sbjct: 542 GVSPENQFLTPEKKTSAVAPAPAMDMAIIRELFSNTAEAAAILAADSILPPADTLLLHVM 601
Query: 574 KSLPRLRPTKIAEDGSIMEW 593
+ +L P +I + G IMEW
Sbjct: 602 GA-KQLVPYRIGKRGQIMEW 620
>gi|392964290|ref|ZP_10329711.1| Alpha-L-fucosidase [Fibrisoma limi BUZ 3]
gi|387847185|emb|CCH51755.1| Alpha-L-fucosidase [Fibrisoma limi BUZ 3]
Length = 821
Score = 369 bits (946), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 222/593 (37%), Positives = 325/593 (54%), Gaps = 48/593 (8%)
Query: 13 LKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
K+ +N PA + + +A+PIGNGRLGAMV+G V ET++LNE T+W+G P NPDA A
Sbjct: 25 FKLWYNQPAGQTWENALPIGNGRLGAMVYGNVARETIQLNEHTVWSGGPNRNDNPDALAA 84
Query: 72 LSDVRSLVDSGQYAEATAASVKLF----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
L ++R+L+ G+ EA + K H ++Q +G++ L F+ H Y Y R+
Sbjct: 85 LPEIRTLIFDGKQKEAEKLANKAIITKKAH-GQMFQPVGNLHLTFN-GHDNYTN--YYRD 140
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
LD+ A A+ Y+V V +TRE F+S PDQVIV ++ S+ G + F S +
Sbjct: 141 LDIERAIAKTTYTVDGVAYTREVFTSFPDQVIVVHLTASKPGRIDFTASYST-------- 192
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDP--KG-IQFSAILEIKISDDRGTISALEDKKLKV 244
Q P K + +D KG ++F I IK ++GT+++ D L V
Sbjct: 193 ---QQKADRKTTPAKDLTIAGTTSDHEGVKGMVRFKGITRIKT--EKGTLAS-TDTTLTV 246
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
+G++ A + + +++F+ + D D + + S L SY+ + T H+ YQ
Sbjct: 247 KGANAATIYISIATNFN----SYKDVSGDENARAESYLNKAYPKSYAAMLTPHVAAYQNY 302
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV + L +P + +P+ ER+K+F+T DP L +Q+GRYLLISS
Sbjct: 303 FNRVRLDLGSTPTEAAK----------LPTDERLKNFRTATDPEFATLYYQYGRYLLISS 352
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+PG Q ANLQGIWN + P WDS +NIN +MNYW + NL+E EP + LS
Sbjct: 353 SQPGGQPANLQGIWNHRMRPPWDSKYTININAQMNYWPAEKTNLAELHEPFLRMVNELSE 412
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G +TA+V Y A GW+ HH TDIW + A G W +W GG W HLWEHY Y D+
Sbjct: 413 AGQETARVMYGARGWMAHHNTDIWRTTGAIDG-ATWGMWIAGGGWTAQHLWEHYLYNGDK 471
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+L YP+L+G A F +D+LIE H Y L NP TSPE+ A G + + +TM
Sbjct: 472 AYLAS-VYPILKGAAQFYVDYLIE-HPKYHWLVVNPGTSPENAPKAHGG--SSLDAGTTM 527
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
D I +VFS I AAE+L K + A V+ + + +L P + + G + EW++
Sbjct: 528 DNQIAFDVFSTAIRAAEIL-KTDVAFVDTLKQKRSQLPPMHVGQHGQLQEWLE 579
>gi|326204164|ref|ZP_08194024.1| Alpha-L-fucosidase [Clostridium papyrosolvens DSM 2782]
gi|325985675|gb|EGD46511.1| Alpha-L-fucosidase [Clostridium papyrosolvens DSM 2782]
Length = 775
Score = 368 bits (945), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 218/587 (37%), Positives = 317/587 (54%), Gaps = 33/587 (5%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA+ + +A+P+GNG LG MV GG+ E + LN DTLW+G+PG N + L +V+
Sbjct: 7 YKSPARIWEEALPVGNGGLGGMVHGGISHECIDLNNDTLWSGLPGQLINKNILPLLPEVQ 66
Query: 77 SLVDSGQ-YAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
LVD G Y + + Y LG + L + L Y R L LNTA
Sbjct: 67 CLVDEGNNYDAQKLIEENILTGYSQSYLPLGRLLLTCE---LSGEINNYSRSLSLNTAVC 123
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
+Y+ G V RE S PD V+ ++ +S S + +LDS L G +IM
Sbjct: 124 ETRYTCGGVNHCREVICSYPDNVMAVHMTADKSESFTLTATLDSQLRYQVNKKGRT-LIM 182
Query: 196 EGRCPGKRIPPKANAN--------DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
G CP IP A + + I FS + I +G +E+ + + +
Sbjct: 183 TGDCPSCMIPDYVEAGKHIVYDSEEHSRSIGFSVGMRAYI---KGGSVIVEENGISINAA 239
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D +L+L +S++F+G I P S DP S+ + L S+++L +RH DD+ LF R
Sbjct: 240 DEVLLVLSSSTNFEGFDIMPGSSGVDPLSKCIRTLDKAAGYSWNELLSRHKDDHSSLFKR 299
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSR 366
V + L + +P+ ER+ ++ + DPSL L+F +GRYLLI+ SR
Sbjct: 300 VCLDLGTQSQ--------------LPTDERLAAYAKGQYDPSLDSLMFAYGRYLLIACSR 345
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PGTQ ANLQGIWN+DL+ W S NINLEMNYW + NLSEC +PLFD L +S G
Sbjct: 346 PGTQAANLQGIWNKDLAAPWSSNYTTNINLEMNYWPAETANLSECHKPLFDLLKDVSKAG 405
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
S+ ++ NY G+V+HH TD+W +SA G+ W WPMGGAWL H+ EHY ++ D F
Sbjct: 406 SEISRENYGCRGFVLHHNTDLWRMASAVSGQARWGFWPMGGAWLSLHIMEHYRFSCDVVF 465
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
L+ Y + E F LD++ GY TNPSTSPE+ FI +G++ ++ STMD+ I
Sbjct: 466 LQNHYYIMREA-VLFFLDYMKPDKKGYYITNPSTSPENAFIDKEGRICSITKGSTMDLFI 524
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
IRE+F + + A +L K + L +++ L +L P +I + G ++EW
Sbjct: 525 IRELFESCVEAQSIL-KIDSELSGLLVQRLCKLPPFRIGKKGQLLEW 570
>gi|346225024|ref|ZP_08846166.1| alpha-L-fucosidase [Anaerophaga thermohalophila DSM 12881]
Length = 828
Score = 368 bits (945), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 227/602 (37%), Positives = 330/602 (54%), Gaps = 53/602 (8%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
A+ + LK+ ++ PA + +A+PIGNGRLGAMV+G +E ++LNE+T W+G P
Sbjct: 20 AKEMAQKTDLKLWYDKPANVWNEALPIGNGRLGAMVFGDPANEKIQLNEETFWSGGPSHN 79
Query: 64 TNPDAPKALSDVRSLVDSGQYAEA------TAASVKLFGHPADVYQLLGDIELEFDDSHL 117
NP A KAL VR L+ G+Y EA + + +L G +YQ +G++ L FD H
Sbjct: 80 DNPKALKALPKVRQLIFEGKYYEAEKMVNESMVAEQLHG---SMYQTIGNLNLSFD-GHE 135
Query: 118 KYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL 177
Y Y RELD+ A Y+V +V F RE F+S P+Q+I K+S + GSLSF SL
Sbjct: 136 NYT--NYYRELDIENALFSTTYTVNDVNFKREVFASFPNQIIAVKLSSDQHGSLSFTASL 193
Query: 178 DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISA 236
+ L ++ V N + M G +++++ +G ++F+ KI +D G I
Sbjct: 194 NGPLAKNTQVLDTNILEMTG---------ISSSHEGVEGQVKFNT--RAKILNDGGKIKT 242
Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
+ K+ V +D V+L+ +++F ++ + + L S+++L
Sbjct: 243 -DGNKITVTKADEVVILISMATNF----VDYKTLSANENEQCQKFLSEASQKSFAELKNA 297
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
H+ DY+K F R S+ L +P SE P+ R+K+F DP+LV L +QF
Sbjct: 298 HIKDYRKYFTRSSLNLGTTP-------ASE-----YPTDVRIKNFSQTNDPALVALYYQF 345
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLISSSRPG Q ANLQGIWN P WDS +NIN EMNYW + CNL+E EPL
Sbjct: 346 GRYLLISSSRPGGQPANLQGIWNNSTHPAWDSKYTININTEMNYWPAEKCNLTELHEPLI 405
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
+ LS GS TAQ Y GWV HH TDIW G W +WPMGGAWL HLWE
Sbjct: 406 QMVRELSETGSHTAQTMYGCDGWVTHHNTDIWRICGVVDG-AFWGMWPMGGAWLSQHLWE 464
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH-DGYLETNPSTSPEHEFIAPDGKLAC 535
+ Y D +L Y +++ F ++LIE +G+L +PS SPE+ AP G+
Sbjct: 465 KFLYNGDMKYLAS-VYSIMKSACRFYQNFLIEEPVNGWLVVSPSVSPEN---APAGR-PS 519
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEW 593
++ +TMD I+ ++FS I AA +L ++E+ + +L SLP P +I + G + EW
Sbjct: 520 ITAGATMDNQILFDLFSKTIKAATLLNQDENLISDFRNILDSLP---PMQIGQYGQLQEW 576
Query: 594 VQ 595
++
Sbjct: 577 ME 578
>gi|189465240|ref|ZP_03014025.1| hypothetical protein BACINT_01585 [Bacteroides intestinalis DSM
17393]
gi|189437514|gb|EDV06499.1| hypothetical protein BACINT_01585 [Bacteroides intestinalis DSM
17393]
Length = 826
Score = 368 bits (944), Expect = 6e-99, Method: Compositional matrix adjust.
Identities = 219/601 (36%), Positives = 343/601 (57%), Gaps = 44/601 (7%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
+++ ++ T ++ ++ PA+ + +A+PIGNGR+GAMV+GG+ E ++LNE+T+WTG P
Sbjct: 20 LLSCQNNPDTTIWRLWYDQPAEKWEEALPIGNGRIGAMVFGGITKEKIQLNEETVWTGEP 79
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEATA---ASVKLFGHPADVYQLLGDIELEFDDSHL 117
+NPDA A+ D+R L+ G+Y EA V + +YQ +GD+ L F
Sbjct: 80 NSNSNPDALNAIPDIRKLIFQGKYKEAQKLVDEKVISKTNHGMIYQPVGDLNLTFPGHE- 138
Query: 118 KYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL 177
+ Y RELD+ +A A+ +Y+V +VE+ RE F+S DQVIV ++ S G + F+ L
Sbjct: 139 --TAKNYYRELDIESAIAKTRYTVNDVEYQREIFTSFTDQVIVIHLTASRKGKIVFSAEL 196
Query: 178 DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISA 236
+S + + + N + ++G G ++ +G I FS + +KI ++G +
Sbjct: 197 NSPQKSQT-ITLENGLSLQGSTEG---------HEGLEGKISFSTL--VKIVPEKGQMKT 244
Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
E ++ V +D AV + V+ ++ F+N ++ +P + S LQ Y+ L T
Sbjct: 245 -EASRITVSNAD-AVTIYVSIAT---NFVNYANLSGNPDQKVKSYLQHATQKDYAKLKTD 299
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
H+D Y+ F+RV +L VT+ + + R+ F +DP+L L FQF
Sbjct: 300 HMDYYRDYFNRVKFKLD------VTEAIQKT------TDVRIAEFAQGKDPNLAALYFQF 347
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLIS S+PGTQ ANLQGIWNE + P WDS NINLEMNYW + NLSE EPL
Sbjct: 348 GRYLLISCSQPGTQPANLQGIWNERMKPAWDSKYTTNINLEMNYWPTEITNLSELHEPLI 407
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
+ L++ G TA++ Y A GW++HH TD+W + A DR +WP GAWL HLW
Sbjct: 408 QMIKELAVTGGHTAKIMYGARGWMLHHNTDLWRTTGAVDRSGP--GMWPTCGAWLSRHLW 465
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLA 534
EH+ Y+ D+ +LE+ YP+++G A FLLD+ +E + + L PS+SPE+ F + KL
Sbjct: 466 EHFLYSGDKTYLEE-VYPIMKGAALFLLDFAVEEPEHHWLVIAPSSSPENTFDKKN-KLT 523
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+ TMD ++ E+FS +ISA E+LE+++ + + + R+ P +I + EW+
Sbjct: 524 NTA-GVTMDNQLMFELFSNLISATEILERDQH-FADTLRQIRTRIPPMQIGRYSQLQEWM 581
Query: 595 Q 595
Sbjct: 582 H 582
>gi|300770084|ref|ZP_07079963.1| possible alpha-L-fucosidase [Sphingobacterium spiritivorum ATCC
33861]
gi|300762560|gb|EFK59377.1| possible alpha-L-fucosidase [Sphingobacterium spiritivorum ATCC
33861]
Length = 826
Score = 367 bits (943), Expect = 7e-99, Method: Compositional matrix adjust.
Identities = 225/601 (37%), Positives = 326/601 (54%), Gaps = 47/601 (7%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
A N LK+ ++ PA ++ +A+PIGNGRLGAMV+G E ++LNE+T+W G PG+
Sbjct: 21 ATCLQAQNSLKLQYDKPAGNWNEALPIGNGRLGAMVFGQPDQEQIQLNEETIWAGGPGNN 80
Query: 64 TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADV-------YQLLGDIELEFDDSH 116
+ +A + +R L+ G+ EA S F PA YQ GD+ + F H
Sbjct: 81 VSKNAYDKIQQIRRLLFEGKAKEAQDLSNATFPRPAPSGIDYGMPYQTFGDLRISFP-GH 139
Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
+Y +Y RELD+ A R +Y G V +TRE F+S D V++ K+S SLSF++
Sbjct: 140 KQYT--SYSRELDIQDAITRTRYKAGAVTYTREVFASLKDDVVIIKLSADTKKSLSFSIG 197
Query: 177 LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTIS 235
L S DN N Q+ + G + +++ G IQFS I+ + +G
Sbjct: 198 LTSPHDNTHITVENKQLTLSG---------ISGSHEGKTGRIQFSGIVRPVL---KGGTL 245
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
+D +L++ +D +L + ++F +D + ++++ L Y
Sbjct: 246 IQKDNQLEITNADEVILYISIGTNFK----KYNDITSNAAAKALDILNKATARKYEKAKA 301
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
H+ YQ+ F+RVS+ L SP+ S++ D R++ F +DP LV L FQ
Sbjct: 302 DHIQKYQQYFNRVSLYLGESPQ-------SKKMTDI-----RIREFGGADDPELVTLYFQ 349
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
FGRYLLISSS+PG+Q A LQGIWN+ LSP WDS VNIN EMNYW + NL E EPL
Sbjct: 350 FGRYLLISSSQPGSQPATLQGIWNDKLSPPWDSKYTVNINTEMNYWPAEVTNLKELHEPL 409
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
F L L++ G ++A+ Y A GW IHH TD+W S G + +WPMGGAWL HLW
Sbjct: 410 FAMLKDLAVTGQESAKELYHARGWNIHHNTDLWRISGVVDGG-FYGIWPMGGAWLSQHLW 468
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLA 534
+H+ Y+ DR FL K Y +L+G A F LD L E +L PS SPE+ + G
Sbjct: 469 QHFLYSGDRSFL-KEYYHVLKGKALFYLDVLQEEPTHKWLVVAPSMSPENSYQPGVG--- 524
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
VS +TMD ++ +VF I A+E+L+++ D L + V +L RL P +I + + EW+
Sbjct: 525 -VSAGTTMDNQLVFDVFHNFIQASEILKEDAD-LRDSVQVALHRLPPMQIGQHNQLQEWL 582
Query: 595 Q 595
Q
Sbjct: 583 Q 583
>gi|261406479|ref|YP_003242720.1| alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
gi|261282942|gb|ACX64913.1| Alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
Length = 783
Score = 367 bits (942), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 214/591 (36%), Positives = 321/591 (54%), Gaps = 38/591 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ PA+ +T+A P+GNGRLGAMV+GGV +E + LNED++W G P + NP+A + L
Sbjct: 7 KLVERRPAQVWTEAFPVGNGRLGAMVFGGVSTERIGLNEDSVWYGGPKQHDNPEAIEKLD 66
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPADV---YQLLGDIELEFDDSHLKYAEETYRRELDL 130
D+RSL+ G+ EA ++ F + YQ LGD+ L+F + YRREL+L
Sbjct: 67 DIRSLLRCGELREAEQLALTHFTNAPPYFGPYQPLGDLLLQFKSGTSEVNH--YRRELNL 124
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL-LDNHSYVNG 189
T A V + + + RE F+S QV+V +IS SE ++ + L D +
Sbjct: 125 RTGVASVSWEENGILYEREVFASAVHQVLVIRISSSEPAAIHLSARLSRRPFDGNIKREN 184
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
+ MEG C P G+ ++ +L+ + G L ++ +D
Sbjct: 185 ERTLAMEGIC-------------GPDGVTYATVLQ---AHTIGGKCHTVGNYLDIQSADA 228
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
LLL A +SF DP E++ +S L Y+ L H+ D+ L RVS
Sbjct: 229 VTLLLAAQTSF---------RCDDPYREALRQAESAVLLPYASLLEEHITDHCALLERVS 279
Query: 310 IQL-----SRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLIS 363
+++ S +P + + +E P++ER++ + Q DP L L +Q+GRYL+++
Sbjct: 280 LEIEAADTSIAPVSEESASEAEAVAVDRPTSERLQLYRQGGNDPGLEALFYQYGRYLMMA 339
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSRPG+ ANLQGIWNE +P W+S H+NINL+MNYW + NL EC EPLFDF+ L
Sbjct: 340 SSRPGSLPANLQGIWNESFTPPWESDYHLNINLQMNYWIAETGNLPECHEPLFDFIDRLV 399
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
ING KTA Y A G+ H +++WA+S WPMGGAWL HLWEHY Y +
Sbjct: 400 INGRKTAASLYGARGFTAHASSNLWAESGLFGAWTPAIFWPMGGAWLALHLWEHYRYNLS 459
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
FL +RAYP+L+ + F LD+L+ +G L T+PS SPE+ +I G++ +S +MD
Sbjct: 460 ESFLSERAYPVLKEASLFFLDFLVFDENGSLVTSPSLSPENSYINEKGQIGSLSSGPSMD 519
Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+I + +A I AAE+L +++ + + + +L +I G +MEW
Sbjct: 520 SQMIYALLTACIEAAEILGLDKE-WSRQWMDTRAKLPQPQIGRYGQVMEWA 569
>gi|346724703|ref|YP_004851372.1| hypothetical protein XACM_1797 [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346649450|gb|AEO42074.1| hypothetical protein XACM_1797 [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 790
Score = 367 bits (941), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 222/595 (37%), Positives = 321/595 (53%), Gaps = 46/595 (7%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+ L++ + PA + +A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D T+P
Sbjct: 39 VAAAQALQLWYREPANQWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSP 98
Query: 67 DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
DA AL VR+L+ +G+YAEA A KL P YQ LGD+ L+FD +
Sbjct: 99 DALAALPQVRALIFAGRYAEAEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GISD 155
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
YRR+LDL+TA A + G RE F S Q IV ++S G +S V +DS
Sbjct: 156 YRRQLDLDTAVATTTFRSGGAVHRREVFVSAQAQCIVVRLSCDRPGGISVRVGIDSPQTG 215
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD--RGTISALEDKK 241
++ GR N GI+ +++ G +S + D +
Sbjct: 216 EVTAE-QGGLLFSGR------------NGSFAGIEGKLRFALRVLPQVRGGKLSQVRD-R 261
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L+++ +D VLLL A++S+ + D DP + + + L+ L + L HL D+
Sbjct: 262 LRIDAADEVVLLLSAATSYQ--RFDAVDG--DPLASTAACLRKAAKLDFPALLRAHLADH 317
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
Q+LF RV+I L S +P+ ERV+ F DP+L L Q+GRYLL
Sbjct: 318 QRLFRRVAIDLGSSAA------------TQLPTDERVQRFAEGNDPALAALYHQYGRYLL 365
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
I SSRPGTQ ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL L
Sbjct: 366 ICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECAEPLEAMLFD 425
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
L+ G+ TA+ Y A GWV+H+ TD+W ++ G W+LWP+GG WL LW+ ++Y
Sbjct: 426 LAQTGAHTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPLGGVWLLQQLWDRWDYG 484
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
DR +L K YPL +G A F + L+ + G + TNPS SPE++ P G C S
Sbjct: 485 RDRAYLSK-VYPLFKGAAEFFVATLMRDPQTGAMVTNPSMSPENQH--PFGAAVCAGPS- 540
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
MD ++R++F+ I+ +++L + + L +++ +L P +I + G + EW Q
Sbjct: 541 -MDAQLLRDLFAQCIAMSKLLGIDAE-LAQQLAALREQLPPNRIGKAGQLQEWQQ 593
>gi|304404820|ref|ZP_07386481.1| Alpha-L-fucosidase [Paenibacillus curdlanolyticus YK9]
gi|304346627|gb|EFM12460.1| Alpha-L-fucosidase [Paenibacillus curdlanolyticus YK9]
Length = 769
Score = 367 bits (941), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 206/588 (35%), Positives = 332/588 (56%), Gaps = 39/588 (6%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
N + + PA+ + +A PIGNG+LGAMV+G E ++LNE+++W G P N +A
Sbjct: 2 NNTTLRYKKPAQEWVEAFPIGNGKLGAMVFGRPFEERIQLNEESVWHGGPLQRDNVEALP 61
Query: 71 ALSDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRE 127
L ++R L+ +GQ EA + + + P D+ YQ LG++ ++FD + Y RE
Sbjct: 62 NLPEIRRLLFAGQPDEAEKLAFQTMISTPEDLGPYQTLGELAIQFDRED-QGEPSDYVRE 120
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
LDL T V Y G V F R+ F+S PD VIV ++S L F +L S +
Sbjct: 121 LDLATGVVSVHYEAGGVRFRRDSFASGPDGVIVYRLSADRQRRLFFTSTLSREEGTVSPL 180
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
G++ ++++G+C P+G+Q++A+L +I + G +SA E + + +
Sbjct: 181 -GSDTLVLQGQC-------------GPEGVQYAAVL--RIVCEGGRLSA-EGNTIMISDA 223
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D A + + A+++F + D + S L + + ++ H+ +++ LF R
Sbjct: 224 DTATIYIAAATTF---------READLLAVSEQKLNAAIAKGFEEVRRSHIAEHRGLFDR 274
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSR 366
V+++L ++ D +E +++P+ ER+ F+ D + L+EL F FGRYLL+SSSR
Sbjct: 275 VALELRKA-----GDHPAEH--ESLPTDERLARFRNGDRESGLIELFFHFGRYLLLSSSR 327
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
G+ ANLQGIWN+ ++P W+S H NIN++MNYW + NL+EC EPLFD++ L +NG
Sbjct: 328 RGSLPANLQGIWNDSMTPPWESDFHTNINIQMNYWPAEVTNLAECHEPLFDYIDQLRVNG 387
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
+TAQ Y A G+ +HH +++WA +S + WPMGGAWL H+WEHY Y D F
Sbjct: 388 RRTAQAMYGARGFCVHHTSNLWADASITSRWLPAMFWPMGGAWLTLHMWEHYLYGGDIAF 447
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
L RAYP + A F LD++++ G T PS SPE+ + P+G + +MD +
Sbjct: 448 LRDRAYPAMRESALFFLDFMVQDPQGRWVTAPSVSPENSYRLPNGNEGALCAGPSMDTQM 507
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
IR +F A ++A E+LE++ D + ++ + L + IA +G++MEW
Sbjct: 508 IRMLFEACLTALELLEES-DEIASELRERLAGMPEQGIASNGTLMEWA 554
>gi|291545123|emb|CBL18232.1| hypothetical protein RUM_22260 [Ruminococcus champanellensis 18P13]
Length = 776
Score = 367 bits (941), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 213/589 (36%), Positives = 315/589 (53%), Gaps = 44/589 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA++F A+P+GNGR+GAMV+GGV +E LKLNED++W+G + NPDA + + +R
Sbjct: 9 YTKPAENFDQALPVGNGRMGAMVFGGVETEHLKLNEDSIWSGGLRNRNNPDAYQGMQQIR 68
Query: 77 SLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
L+ + +EA + + + G P + Y LGD+++ F H + YRR LDL++
Sbjct: 69 MLLQQEKISEAEELAFQTMQGCPENSRHYMPLGDLDVVF---HKESHSTAYRRTLDLSSG 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A +Y++ V++ R F S PD V+V +S + G +SF S G +
Sbjct: 126 IALTEYTLDGVQYQRSVFVSEPDNVLVLHVSADQPGQVSFAASF----------GGRDDY 175
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
E R G+ +GIQF+ ++ + R +L VEG+D A LL
Sbjct: 176 YDENRPDGEASICVTGGQGGQQGIQFAVVMTAAVQGGRAFTRG---NQLCVEGADEATLL 232
Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSA-LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
L +SF K + E+ + + S+ +L RH+DDY+ LF RV ++L
Sbjct: 233 LAVQTSF---------YKGEGYLEAAQLDAEYAADCSFHELMVRHVDDYRALFDRVKLEL 283
Query: 313 -------SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
++ P D + D +A + D L EL F +GRYL+IS S
Sbjct: 284 EDNSGEGAQLPTDARLSRLRGNDFDGKDAAGLIL------DNKLTELYFNYGRYLMISGS 337
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPG+Q NLQGIWN+D+ P W S VNIN EMNYW + CNLSEC PLFD + + N
Sbjct: 338 RPGSQPLNLQGIWNQDMWPAWGSRFTVNINTEMNYWCAESCNLSECHLPLFDLIRRMRPN 397
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G +TA+ Y G+V HH TD+W + + +WPMG AWLC H++EHY YT+DRD
Sbjct: 398 GEQTARDMYHCGGFVCHHNTDLWGDCAPQDRWMPATIWPMGAAWLCLHIFEHYQYTLDRD 457
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
FL ++ + L G A F +++ E G L T PS SPE+ ++ G + +MD
Sbjct: 458 FLAQQ-FDTLCGAAQFFTEYMFENSAGQLVTGPSVSPENTYLTASGAKGSLCIGPSMDSQ 516
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
II +F+ ++ AA +LE+ E L+EK+ + LPRL +I + G I EW
Sbjct: 517 IITLLFTDVLEAARILER-ESPLLEKIRQMLPRLPMPEIGKYGQIKEWA 564
>gi|294675358|ref|YP_003575974.1| hypothetical protein PRU_2729 [Prevotella ruminicola 23]
gi|294473191|gb|ADE82580.1| conserved hypothetical protein [Prevotella ruminicola 23]
Length = 821
Score = 367 bits (941), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 217/595 (36%), Positives = 330/595 (55%), Gaps = 52/595 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA + +A+PIGN LGAMV+GG+ +E ++LNE+T W+G P + NPDA A+
Sbjct: 23 KLWYSKPAAQWLEALPIGNSHLGAMVYGGIGTEQIQLNEETFWSGSPHNNNNPDAKVAMK 82
Query: 74 DVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEET-YRRELDL 130
DVR L+ G+ EA A K F G Y LGD+ L FD + AE + YRREL+L
Sbjct: 83 DVRRLIFEGKEKEAEALIDKTFFKGPHGQKYLPLGDLMLSFD--YQNGAEPSNYRRELNL 140
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A + V +V++ R F+S D I+ +++ S+ +L+F VS
Sbjct: 141 GDALCTTSFDVADVKYIRTAFASQADNAIIIQLTASKKKALNFGVSYQ-----------R 189
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGI--QFSAILEIKISDDRGTISALEDKKLKVEGSD 248
NQ +EG K N + +GI + A + +K+ D GT++ + ++V +
Sbjct: 190 NQQAVEGGAVAKNEHAYIINNVEHEGIAGKLQAEVRVKVVAD-GTVTDM-GSDMQVRNAT 247
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
A + + A++++ +N DP +++ +Q ++ +Y L RHLD YQ + RV
Sbjct: 248 NATIFITAATNY----VNYQTINGDPVAKNNLTMQLLKGKNYKQLLKRHLDKYQDQYDRV 303
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ-TDEDPSLVELLFQFGRYLLISSSRP 367
S+ L++S + +P+ ER+ +F TD D +V L+ Q+GRYLLISSS+P
Sbjct: 304 SLSLAKSAQS------------ELPTDERLAAFDGTDLD--MVSLMMQYGRYLLISSSQP 349
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
G Q ANLQG+WN + P WDS +NIN EMNYW + NL+E QEPLF + LS+ G+
Sbjct: 350 GGQPANLQGVWNHKMDPAWDSKYTININAEMNYWPANVGNLAETQEPLFSMIRDLSVTGA 409
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
KTA+ Y GWV HH TD+W + G W ++P GGAWL THLW++Y YT D+ FL
Sbjct: 410 KTARTMYNCPGWVAHHNTDLWRIAGPVDG-TSWGMFPTGGAWLTTHLWQYYLYTGDKRFL 468
Query: 488 EKRAYPLLEGCASFLLDWL--------IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
+ YP+L+G + FLL ++ ++ G+L T P+ SPEH P GK V+
Sbjct: 469 DA-CYPILKGASDFLLSYMQEYPKNGEVKQAAGWLVTVPTVSPEH---GPVGKNTTVTAG 524
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
STMD I+ +V S+ + A ++L N + ++ +L P +I G + EW+
Sbjct: 525 STMDNQIVFDVLSSTLRAHQILGYNNVVYTTMLSNAIAKLPPMQIGRYGQLQEWL 579
>gi|284039852|ref|YP_003389782.1| alpha-L-fucosidase [Spirosoma linguale DSM 74]
gi|283819145|gb|ADB40983.1| Alpha-L-fucosidase [Spirosoma linguale DSM 74]
Length = 864
Score = 366 bits (940), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 217/613 (35%), Positives = 318/613 (51%), Gaps = 48/613 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-YTNPDAPKA 71
L + +N PA +++A+P+GNG +GAMV+G E L+LNE TL++G P + + K
Sbjct: 25 LTLWYNKPATVWSEALPLGNGYMGAMVFGDPAKEHLQLNEGTLYSGDPASTFKAINVRKD 84
Query: 72 LSDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
V +L+ + QY EA + K G +YQ +GD ++ D H A YRR+ D+
Sbjct: 85 FKQVSALLAAKQYQEAQSLIAKEWLGRNHQLYQPMGDFWIDVD--HKNEAITDYRRQFDI 142
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS-YVNG 189
TATA +Y VGN +TR +F+S PD VIV K++ + G ++ L + ++ + Y
Sbjct: 143 ATATATTRYKVGNTTYTRTYFASYPDHVIVVKLTANGPGKINCTFHLSTPHESTARYAAQ 202
Query: 190 NNQIIMEGRCPG---------------------------KRIPPKANANDDPK--GIQFS 220
N + M G+ PG +R P N D + G+ +
Sbjct: 203 GNTLTMRGKVPGFGLRRTFEQIEKAGDQYKYPEVYEKNGQRKPGIDNMLYDRQINGLGMA 262
Query: 221 AILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMS 280
+K+ G I ++ L V+ + V +L A++S++G +P+ DP
Sbjct: 263 FETRVKVQHTGGRIRQ-DNNALTVQDASEVVFVLSAATSYNGFDKSPAYEGVDPKPILDQ 321
Query: 281 ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKS 340
++I SY+ LY HL DY+KLF RV IQL+ +E P+ +RV+
Sbjct: 322 RFKAIEKKSYAALYQTHLADYKKLFDRVDIQLA-----------AETEQSQRPTDQRVEL 370
Query: 341 FQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
F DPS L FQ+GRYL+I+ SRPG Q NLQG+WN+ + P W+ +NIN +MNY
Sbjct: 371 FSNGLDPSFAALYFQYGRYLMIAGSRPGGQPLNLQGMWNDLMVPPWNGGYTININAQMNY 430
Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVW 460
W + NLSECQEP F + L+ING +TA+ Y GWV HH DIW + +
Sbjct: 431 WPAELTNLSECQEPFFKAVKELAINGHETARSMYGNDGWVAHHNMDIW-RHAEPVDLCNC 489
Query: 461 ALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPST 520
+ WPM WL +H WE Y ++ D FL+K +PLL+G F WL++ GYL T
Sbjct: 490 SFWPMAAGWLTSHFWERYLFSGDPIFLKKEVFPLLKGAVQFYQGWLVKNEQGYLVTPVGH 549
Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLR 580
SPE F+ D K A S TMDMAI+RE FS + A + L +D V ++L +L
Sbjct: 550 SPEQNFLYDDKKQATFSPGPTMDMAIVRESFSRYLEACKTLGITDD-FTAGVKQNLSQLL 608
Query: 581 PTKIAEDGSIMEW 593
P +I + G + EW
Sbjct: 609 PYQIGKYGQLQEW 621
>gi|261416181|ref|YP_003249864.1| alpha-L-fucosidase [Fibrobacter succinogenes subsp. succinogenes
S85]
gi|385791048|ref|YP_005822171.1| carbohydrate binding protein, CMB family 6 [Fibrobacter
succinogenes subsp. succinogenes S85]
gi|261372637|gb|ACX75382.1| Alpha-L-fucosidase [Fibrobacter succinogenes subsp. succinogenes
S85]
gi|302326443|gb|ADL25644.1| carbohydrate binding protein, CMB family 6 [Fibrobacter
succinogenes subsp. succinogenes S85]
Length = 999
Score = 366 bits (940), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 226/595 (37%), Positives = 328/595 (55%), Gaps = 53/595 (8%)
Query: 8 STTNPLKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+T NPL + +N A FT+A+PIGNG +G +++GGV + + LNE T+W+G PGD
Sbjct: 30 TTDNPLTLWYNSDAGTEFTNALPIGNGYMGGLIYGGVEKDYIGLNESTVWSGGPGDNNKQ 89
Query: 67 DAPKALSDVRSLVDSGQYAEA-TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYR 125
A L D R + G Y A + S + G +Q +GD L SH YR
Sbjct: 90 GAASHLKDARDALWRGDYRTAESIVSQYMIGPGPASFQPVGD--LVISTSH--KGSSNYR 145
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
RELDL TA A+ Y+VG V+ TRE+F+S PD VIV +S + GS+SF ++ + N+
Sbjct: 146 RELDLKTAIAKTTYTVGGVKHTREYFASYPDHVIVVHLSADKDGSVSFGATMTTPHRNNR 205
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
+ N +I + I+F + + D GT+S + + + V+
Sbjct: 206 MTSSGNTLIYDVTV---------------NSIKFQN--RLTVVADGGTVS-VSNGNINVQ 247
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G++ A L+L +++F + +D DP + + + + SY DL HL DYQ +F
Sbjct: 248 GANSATLILTTATNFK----SYNDVSGDPGAIASEIMSKVAKKSYEDLLAAHLKDYQTIF 303
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
+RV + L + K S +I ++ RVK+F + DPSLVEL +Q+GRYLLI+SS
Sbjct: 304 NRVKLDLGTADK-------SAGDI----TSTRVKNFNSTNDPSLVELHYQYGRYLLIASS 352
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
R G Q ANLQGIWN+D +P W S NINLEMNYW + NL EC PL D + +
Sbjct: 353 RKGGQPANLQGIWNKDTNPIWGSKYTTNINLEMNYWPAESGNLEECVWPLIDKIKSMVPQ 412
Query: 426 GSKTAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT-MD 483
G KTA+V++ + GWV HH TD+W +S+ G W LWP G WL THLWEH+ Y D
Sbjct: 413 GEKTAKVHWGVDEGWVEHHNTDLWNRSAPIDG--AWGLWPTGAGWLTTHLWEHFLYNPTD 470
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIE---GHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
+ +L+ Y ++G A F ++ L+E + YL T PS SPE++ G C +
Sbjct: 471 KAYLQD-VYSTMKGAALFFVNSLVEEPTTGNKYLVTAPSDSPENDH---GGYNVC--FGP 524
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
TMD IIR+V + I A+++L +ED + K+ ++ RL PTK + G I EW+Q
Sbjct: 525 TMDNQIIRDVLNYTIEASKILGVDED-VRAKMEATVKRLPPTKTGKYGQITEWLQ 578
>gi|255035049|ref|YP_003085670.1| hypothetical protein Dfer_1256 [Dyadobacter fermentans DSM 18053]
gi|254947805|gb|ACT92505.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
Length = 768
Score = 366 bits (940), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 214/601 (35%), Positives = 323/601 (53%), Gaps = 79/601 (13%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PL + + PA+ + +A+PIGNG L AM++GGV +E ++ NE+TLWTG P Y + A
Sbjct: 25 PLTLWYEQPARQWEEALPIGNGALAAMIFGGVETEQIQFNEETLWTGEPRSYAHKGASAY 84
Query: 72 LSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRREL 128
L +R L++ G+ EA A A+ + P YQ GD+ L+F H+++ Y REL
Sbjct: 85 LEQIRRLLNEGKQKEAEALANEQFMSQPMRQMAYQAFGDVYLDFP-GHVQH--RAYHREL 141
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
DL AT + Y G V +TRE F+S P + I I+ S+ L F V + ++
Sbjct: 142 DLRAATVKSSYESGGVRYTREAFASYPAKAIYYHINSSQKSKLDFTVRMSTI-------- 193
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE---------- 238
PK NA + +E+++ + G + L
Sbjct: 194 --------------HAKPKVNAEKN--------TIELEVQVENGALHGLARLKLLTDGKL 231
Query: 239 ---DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
D K++V G+ A ++L A++++ IN + DP ++ +ALQ+ + Y +
Sbjct: 232 KTADGKIEVTGATSATIVLSAATNY----INYKNVNGDPRAKVTAALQNAPD-DYKKAAS 286
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLF 354
HL DYQKLF+R ++ L S +P+ +R+ F+ + +DP+L+ L
Sbjct: 287 GHLADYQKLFNRFALDLPASKGS------------ALPTDQRLSQFKHNPDDPALLALYV 334
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
QF RYLLI+SSRPGT ANLQG WN L+P+WDS VNIN EMNYW + NLSEC +P
Sbjct: 335 QFARYLLITSSRPGTHPANLQGKWNHKLNPSWDSKYTVNINTEMNYWPAELTNLSECHQP 394
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
LF + +S G++ A+ +Y A+GWV+HH TD+W + +A +W GGAWL HL
Sbjct: 395 LFQMVKEVSETGAEVAKEHYNANGWVLHHNTDVW-RGAAPINASNHGIWVTGGAWLSLHL 453
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKL 533
WEHY +T D+ FL+ AYPL++G A F LD+L++ G+L ++PS SPE +G L
Sbjct: 454 WEHYRFTEDKAFLQNTAYPLMKGAAQFFLDFLVKDPKTGHLVSSPSNSPE------NGGL 507
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
TMD IIR +F A A +L K + +K+ ++ ++ P +I G + EW
Sbjct: 508 VA---GPTMDHQIIRALFKACAETAGIL-KTDAVFAQKLTETAKQIAPNQIGRHGQLQEW 563
Query: 594 V 594
+
Sbjct: 564 M 564
>gi|325927089|ref|ZP_08188358.1| hypothetical protein XPE_2359 [Xanthomonas perforans 91-118]
gi|325542534|gb|EGD14007.1| hypothetical protein XPE_2359 [Xanthomonas perforans 91-118]
Length = 790
Score = 366 bits (939), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 225/596 (37%), Positives = 322/596 (54%), Gaps = 48/596 (8%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+ L++ + PA + +A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D T+P
Sbjct: 39 VAAAQALQLWYREPANQWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSP 98
Query: 67 DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
DA AL VR+L+ +G+YAEA A KL P YQ LGD+ L+FD +
Sbjct: 99 DALAALPQVRALIFAGRYAEAEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GISD 155
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
YRR+LDL+TA A + G RE F S Q IV ++S G +S V +DS
Sbjct: 156 YRRQLDLDTAVATTTFRSGGAVHRREVFVSAQAQCIVVRLSCDRPGGISVRVGIDSPQTG 215
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD--RGTISALEDKK 241
++ GR N GI+ +++ G +S + D +
Sbjct: 216 EVTAE-QGGLLFSGR------------NGSFAGIEGKLRFALRVLPQVRGGKLSQVRD-R 261
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L+++ +D VLLL A++S+ + D DP + + + L+ L + L HL D+
Sbjct: 262 LRIDAADEVVLLLSAATSYQ--RFDAVDG--DPLASTAACLRKAAKLDFPALLRAHLADH 317
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
Q+LF RV+I L S +P+ ERV+ F DP+L L Q+GRYLL
Sbjct: 318 QRLFRRVAIDLGSSAA------------TQLPTDERVQRFAEGNDPALAALYHQYGRYLL 365
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
I SSRPGTQ ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL L
Sbjct: 366 ICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFD 425
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
L+ G++TA+ Y A GWV+H+ TD+W ++ G W+LWP+GG WL LW+ ++Y
Sbjct: 426 LAQTGARTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPLGGVWLLQQLWDRWDYG 484
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
DR +L K YPL +G A F + L+ + G + TNPS SPE++ P G C S
Sbjct: 485 RDRAYLSK-VYPLFKGAAEFFVATLMRDPQTGAMVTNPSMSPENQH--PFGAAVCAGPS- 540
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL-PRLRPTKIAEDGSIMEWVQ 595
MD ++R++F+ I+ +++L DA + L +L +L P +I + G + EW Q
Sbjct: 541 -MDAQLLRDLFAQCIAMSKLL--GIDAEFAQQLAALREQLPPNRIGKAGQLQEWQQ 593
>gi|146298534|ref|YP_001193125.1| hypothetical protein Fjoh_0772 [Flavobacterium johnsoniae UW101]
gi|146152952|gb|ABQ03806.1| Candidate alpha-L-fucosidase; Glycoside hydrolase family 95
[Flavobacterium johnsoniae UW101]
Length = 802
Score = 366 bits (939), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 213/590 (36%), Positives = 337/590 (57%), Gaps = 35/590 (5%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPKALSDV 75
+ PA+ F +++ +GNG++G+ V+GGV S+ + LN+ TLW+G P + NP+A K + +
Sbjct: 32 YKQPAEFFEESLVLGNGKMGSTVFGGVNSDKIYLNDITLWSGEPVNANMNPEAYKNIPAI 91
Query: 76 RSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
R + + Y A + K+ G ++ Y LG +E+ ++ K YRRELD++ A +
Sbjct: 92 RETLQNENYKLAEELNKKVQGKNSESYAPLGTLEI---NNSEKGKAVNYRRELDISNAVS 148
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
+V Y + +++TRE+F S DQ+++ K++ + G+L+F+++L SLL ++ V NN ++M
Sbjct: 149 KVSYEMAGIKYTREYFVSAQDQIMIIKLTADQKGALNFDINLKSLLKSNVEVR-NNILVM 207
Query: 196 EGRCP-----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
G P G + PK A D +G +F+ +++IK +D + T S + L ++ + A
Sbjct: 208 TGSAPIHENAGYNVLPKYLALKD-RGTRFTGLVQIKKTDGKITSSR---ETLTLKDATEA 263
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
++ + ++SF+G NP+ D + + L + + H+ DYQK ++RV +
Sbjct: 264 IIYVSVATSFNGFDKNPASEGLDDIAIAAQNLNKAFEKPFDKIKESHIADYQKFYNRVDL 323
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPGT 369
L ++ +P+ ER+ + +ED +L L F +GRYLLISSSR
Sbjct: 324 NLGKT------------TAPDLPTDERLLRYADGNEDKNLEILYFNYGRYLLISSSRTLG 371
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
ANLQG+WN LSP W S +NINLE NYW + NLSE + L F+ LS+ G T
Sbjct: 372 VPANLQGLWNLHLSPPWSSNYTMNINLEENYWLAENTNLSEMHKSLLSFIKNLSVTGKVT 431
Query: 430 AQVNY-LASGWVIHHKTDIWAKSS--ADRGKV--VWALWPMGGAWLCTHLWEHYNYTMDR 484
A+ Y + GW H +DIWA ++ GK +WA WPM GAWL TH+WEHY +T D
Sbjct: 432 AKTFYGVDKGWAAAHNSDIWAMTNPVGQFGKEDPMWACWPMAGAWLSTHIWEHYIFTQDE 491
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
+L+K YPL++G A F L WL+ G L T+PSTSPE+++ DG + Y T D+
Sbjct: 492 TYLKKEGYPLMKGAAEFCLGWLVTDKKGNLITSPSTSPENQYKLEDGFVGATFYGGTADL 551
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEW 593
A+IRE F I A++VL N DA L++ L +L P +I + G++ EW
Sbjct: 552 AMIRECFDKTIKASKVL--NTDASFRVKLETVLSKLHPYQIGKKGNLQEW 599
>gi|408671641|ref|YP_006870551.1| alpha-L-fucosidase [Emticicia oligotrophica DSM 17448]
gi|387857648|gb|AFK05740.1| alpha-L-fucosidase [Emticicia oligotrophica DSM 17448]
Length = 868
Score = 366 bits (939), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 218/615 (35%), Positives = 319/615 (51%), Gaps = 60/615 (9%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPKALSDV 75
++ PA +T+A+PIGN +GAM++G E ++LNE TL++G P + N K V
Sbjct: 31 YDKPASVWTEALPIGNSYMGAMIFGDSRQEHIQLNESTLYSGEPDATFKNISVRKYYQQV 90
Query: 76 RSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTAT 134
L+ +G+Y EA A K L G VYQ LGD F+ A Y+R LD+++AT
Sbjct: 91 TELLKAGKYQEADAIVAKELLGRNHQVYQPLGDFWANFEHGQ---AVSAYKRWLDISSAT 147
Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS-LLDNHSYVNGNNQI 193
A +Y VGN +F R++F+S PD +IV K S + ++ + + + Y N +
Sbjct: 148 AYTEYVVGNTKFKRQYFASYPDHIIVVKFSTEGTDKINCTLRFTTPHISTAKYEANGNML 207
Query: 194 IMEGRCP---------------------------GKRIPPKANAND-------DPKGIQF 219
M G+ P G R KANA + +GI F
Sbjct: 208 KMMGKAPYFVQRREFEQVESVGDQYKYPELYENDGTR---KANAKNILYDSTKGGRGISF 264
Query: 220 SAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESM 279
+ + KI + G + D +KVE + V++L A++S++G +PS K+ +
Sbjct: 265 ES--QAKILNLGGKLIRTGD-SIKVENASEIVVVLTAATSYNGFDKSPSKQGKNSSFLVN 321
Query: 280 SALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVK 339
S L+SI ++ LY+ HL DY+KLF RV +L+ E +P+ +RV
Sbjct: 322 SYLKSIEKKIFTQLYSTHLTDYKKLFDRVDFELAE-----------ETEQSKLPTDQRVS 370
Query: 340 SFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMN 399
F +DPS L FQ+ RYL+I+ SRP Q NLQGIWN+ + P W+ NIN EMN
Sbjct: 371 LFSNGKDPSFPSLYFQYSRYLMIAGSRPNGQPLNLQGIWNDQIVPPWNGGYTTNINTEMN 430
Query: 400 YWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVV 459
YW + NLSEC EPLF + L++NG TA+ Y GW HH DIW +++ + +
Sbjct: 431 YWIAESTNLSECHEPLFKAIKELAVNGKNTAKFMYGNEGWTSHHNMDIW-RNAEPIDRCL 489
Query: 460 WALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNP 518
+ WPMG WL +H WE Y +T D+ FL+ YP+L+G F WL+ + GYL T
Sbjct: 490 CSFWPMGAGWLTSHFWERYLHTGDKVFLKNEVYPVLKGVVEFYQGWLVKDAKTGYLITPI 549
Query: 519 STSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR 578
SPE F+ D K A +S TMDM I+RE F+ + + L N D LV+ + + LP+
Sbjct: 550 GHSPESYFLYEDNKRATISQGPTMDMGIVREAFARYVEMCQTLGIN-DELVKNIKQQLPQ 608
Query: 579 LRPTKIAEDGSIMEW 593
L P +I + G + EW
Sbjct: 609 LLPYQIGKYGQLQEW 623
>gi|336414990|ref|ZP_08595333.1| hypothetical protein HMPREF1017_02441 [Bacteroides ovatus
3_8_47FAA]
gi|335941851|gb|EGN03702.1| hypothetical protein HMPREF1017_02441 [Bacteroides ovatus
3_8_47FAA]
Length = 815
Score = 366 bits (939), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 229/591 (38%), Positives = 320/591 (54%), Gaps = 46/591 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA +T+A+P+GN RLG MV+GG SE L+LNE+T+W G P NP A AL
Sbjct: 25 LKLWYSRPATVWTEALPLGNSRLGVMVYGGAGSEELQLNEETVWGGGPHRNDNPKALAAL 84
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+R LV G+Y EA + F P + YQ +G + L+F H K + Y R+LD+
Sbjct: 85 PQIRQLVFEGRYREAQEMVAQNFETPRNGMPYQTIGSLMLDFP-GHEKATD--YYRDLDI 141
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A +Y VG V + RE F+S D VI+ +++ ++ G+LSF S S L +
Sbjct: 142 ERAIATTRYKVGEVTYNREVFTSFVDNVIIVRLTANKQGTLSFTASYKSPLQH------- 194
Query: 191 NQIIMEGRCPGKRIPPKANANDD---PKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
E R GKR+ + P I+ E+K + G + + ++V G+
Sbjct: 195 -----EVRKSGKRLVLIGKGTEHEGVPGAIRVETQTEVK---NEGGHVVVTGENIQVNGA 246
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D L + A+++F +N D D +S S L R Y H+ YQ F+R
Sbjct: 247 DAVTLYISAATNF----VNYKDVSGDAHRKSKSYLDIARKKKYEQAREAHIAYYQNQFNR 302
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
V + L T E +T RVK F +D SL L+FQ+GRYLLISSS+P
Sbjct: 303 VKLDLG---------TSEEAKRET---HLRVKHFNKGKDVSLATLMFQYGRYLLISSSQP 350
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
G Q ANLQGIWN++L WD VNINLEMNYW S NLSE PL L LS G
Sbjct: 351 GGQPANLQGIWNDNLLAPWDGKYTVNINLEMNYWPSEVTNLSETHLPLMQMLKELSETGR 410
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
+TA+ Y GWV+HH TDIW + + K W +WP GGAWLC HLW+HY +T D+ FL
Sbjct: 411 ETARTMYGCDGWVLHHNTDIW-RCTGLVDKAFWGMWPNGGAWLCQHLWQHYLFTGDKAFL 469
Query: 488 EKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSS-TMDMA 545
K+AYP+++G + F L +L+E G++ T PS SPEH + K A + + TMD
Sbjct: 470 -KKAYPIMKGASDFFLHFLVEHPKYGWMVTCPSNSPEHGPEGDEKKNAPSTVAGCTMDNQ 528
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVL-KSLPRLRPTKIAEDGSIMEWVQ 595
I+ ++FS + A ++L EDA+ K L K + RL P +I + EW++
Sbjct: 529 IVFDLFSNTLQACKILM--EDAVYAKHLQKMIDRLPPMQIGRYNQLQEWLE 577
>gi|329925668|ref|ZP_08280486.1| hypothetical protein HMPREF9412_3835 [Paenibacillus sp. HGF5]
gi|328939695|gb|EGG36038.1| hypothetical protein HMPREF9412_3835 [Paenibacillus sp. HGF5]
Length = 767
Score = 365 bits (938), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 222/590 (37%), Positives = 315/590 (53%), Gaps = 48/590 (8%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
I F+ PA+++ +A+PIGNGRLG MV+G V E ++ NED++W G P D NPDA L
Sbjct: 9 IWFDQPAQNWNEALPIGNGRLGGMVFGSVMQEKIQFNEDSVWYGGPRDRNNPDALLHLPL 68
Query: 75 VRSLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+R L+ G+ EA S F G P Y GD ++ D H + YRRELDL
Sbjct: 69 IRKLLFEGRLKEAHRLSETAFSGTPRSQRPYMTAGDFCIQVD--HPQGELSHYRRELDLE 126
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS---YVN 188
A Y G V FTRE F S PDQV+V ++ G+L+ + H +
Sbjct: 127 KAITVTSYQYGGVTFTREVFCSYPDQVMVIRLEADRPGALTLTSRFERQKGKHMDAVHRA 186
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE-IKISDDRGTISALEDKKLKVEGS 247
G + ++M C GK G+ +SA + I + GT+ + + L V+ +
Sbjct: 187 GTDTVVMTNDCGGK------------DGLTYSAAAKAIAVG---GTVRVV-GEHLLVDQA 230
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D V++L A+S+F +D K +E L+ N Y+ L RH+ DYQ LF R
Sbjct: 231 DEVVIILAAASTFR------ADDSKLRCNE---LLEHAANQGYAALKKRHIADYQPLFDR 281
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSR 366
V + L ++ VP+ +R++ + D+D L L F FGRYLLI+ SR
Sbjct: 282 VKLDLG---------AAADREHHLVPTPKRLERVRAGDDDAGLYTLYFHFGRYLLIACSR 332
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG+ ANLQGIWN+ ++P WDS +NIN +MNYW + CNL EC EPLF+ + + NG
Sbjct: 333 PGSLPANLQGIWNDSMAPPWDSKFTININTQMNYWPAESCNLPECHEPLFELIERMKDNG 392
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
TA+ Y G+V HH TDIWA ++ W MG AWL HLWEHY + + DF
Sbjct: 393 RVTARKMYGCRGFVAHHNTDIWADTAPQDIYPPATQWVMGAAWLTLHLWEHYKFNPNPDF 452
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
L +RAY ++ A F D+L+E +GYL TNPS SPE+ ++ +G+ + Y +MD I
Sbjct: 453 L-RRAYETMKEAALFFTDFLVESPEGYLVTNPSVSPENRYMLRNGESGTLCYGPSMDTQI 511
Query: 547 IREVFSAIISAAEVLEKNEDALVE-KVLKSLPRLRPTKIAEDGSIMEWVQ 595
I E+FSA I A+ L+ +E A E +K RL K+ G + EW++
Sbjct: 512 ISELFSACIEASLELDTDESARREWAAIKD--RLPEMKVGRHGQLQEWLE 559
>gi|335437953|ref|ZP_08560710.1| alpha/beta hydrolase domain-containing protein [Halorhabdus
tiamatea SARL4B]
gi|334893557|gb|EGM31768.1| alpha/beta hydrolase domain-containing protein [Halorhabdus
tiamatea SARL4B]
Length = 784
Score = 365 bits (938), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 213/596 (35%), Positives = 325/596 (54%), Gaps = 51/596 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA + +A+PIGNGRLGAM++G +E ++ N DTLW G D TNPDA + + +VR
Sbjct: 13 YDAPASAWLEAVPIGNGRLGAMLFGRPGTERVQFNADTLWAGGHEDSTNPDAREHVEEVR 72
Query: 77 SLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
L+ G+ A A A L G P + YQ GD+ ++ A YRRELDL+
Sbjct: 73 RLLFDGEVERAQALADEHLMGDPFRLRPYQSFGDLSIDVGHD----AVTDYRRELDLSAG 128
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
RV+Y + RE+F+S PD IV +++ GS++ V LD D + G+ +
Sbjct: 129 VTRVRYDHDGTTYVREYFASAPDDAIVIRLATDSPGSVTATVGLDRERDARADARGDT-L 187
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS--------ALEDKKLKVE 245
+ G P + +G+ F A +++ D G + A L+ E
Sbjct: 188 TLRGTVVDD---PDDDRGAGGEGMAFEA--RARVTADGGDVQRVTGADAPAGSSVGLRTE 242
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
+D + L ++ + DP + L ++ + Y DL H+ D+++LF
Sbjct: 243 AADAVTIALTGFTTHE---------TDDPGEACEAVLDALADRPYHDLRETHVADHRELF 293
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
RV + L P D TD E +D V + E EDP L L QFGRYLLI+SS
Sbjct: 294 DRVELDLG-DPVDRPTD----ERLDRVAAGE--------EDPHLAALYAQFGRYLLIASS 340
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPGT+ ANLQG+WN++ P W+S +N+NLEMNYW +L NL+EC PL+DF+ L
Sbjct: 341 RPGTEPANLQGVWNQEFDPPWNSGYTLNVNLEMNYWPALQTNLAECAAPLYDFVDDLREP 400
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G + A+ +Y G+ +HH +D+W +++A W LWPMG AWL +++HY +T D
Sbjct: 401 GRRVAEAHYDCDGFAVHHNSDLW-RNAAPVDGARWGLWPMGAAWLSRLVFDHYAFTKDET 459
Query: 486 FLEKRAYPLLEGCASFLLDWLIE--GHDG----YLETNPSTSPEHEFIAPDGKLACVSYS 539
FL + AYP+L A+F+LD+L+E +G +L T PS SPE+ ++ DG+ A V+Y+
Sbjct: 460 FLRETAYPILREAAAFVLDFLVEHPAEEGEAEDWLVTAPSISPENAYVTDDGEEATVTYA 519
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
TMD+ + R++F I AAE+L+ E A +++ +L RL P ++ G + EW++
Sbjct: 520 PTMDVQLTRDLFEHTIDAAEILDV-ESAFHDELRAALDRLPPMQVGAHGQLQEWIE 574
>gi|340616355|ref|YP_004734808.1| alpha-L-fucosidase [Zobellia galactanivorans]
gi|339731152|emb|CAZ94416.1| Alpha-L-fucosidase, family GH95 [Zobellia galactanivorans]
Length = 791
Score = 365 bits (937), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 220/595 (36%), Positives = 337/595 (56%), Gaps = 44/595 (7%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
+ T LK+ ++ PA+ + +A+P+GNG LGAMV+G E ++ NEDT W G P +
Sbjct: 29 KETGGKAELKLWYDRPAEIWEEALPVGNGSLGAMVFGRPVMERIQFNEDTFWAGGPITPS 88
Query: 65 NPDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELE---FDDSHLKYA 120
P+ L +VR LV G+Y EA A K + G Y +GD+ +E DD
Sbjct: 89 KPETKSYLPEVRKLVFDGKYKEADALINKHIIGPKMMPYLPMGDVVIEMKGLDDI----- 143
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
+RRELDL TA ++V +S + + RE FS+ + IV ++ S+ SL+F+++LD+
Sbjct: 144 -TDFRRELDLRTAISKVGFSSKGIAYKREVFSAVEENAIVIRLEASKEKSLNFSIALDNQ 202
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ S V N + + G P + AN + ++F + L I +D I+ D
Sbjct: 203 IGATSQVLDANNLELSGTAPDR-----ANRKSE---LRFVSRLNIGENDGHTIIN---DS 251
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ V G+ LLL A+++F N D +P + + L + S+ + +H+ +
Sbjct: 252 TITVSGASKVTLLLFAATNFK----NYKDVSGNPDFKCKTLLDLVHLKSFEQIREQHITN 307
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
+Q+LF R+ D+ T++ S +P+ ER++ FQ + DPSLV L +QFGRYL
Sbjct: 308 HQRLFERLDF-------DMPTNSNS-----GLPTNERLEKFQEETDPSLVALYYQFGRYL 355
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
L+SSSR +Q ANLQGIWN++ +P WDS NINLEMNYW + NL+EC PLF +
Sbjct: 356 LMSSSRGNSQPANLQGIWNQNPTPPWDSKYTTNINLEMNYWPAEASNLAECAIPLFTSIR 415
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
L+ G+ TA+ NY A GWV+HH TDIW ++ G W +WP GGAWL THLWEHY +
Sbjct: 416 QLAEAGAVTAKNNYGADGWVLHHNTDIWKTTTPLDG-AAWGIWPTGGAWLTTHLWEHYLF 474
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYS 539
+ D FL + YP+++G A F ++ L+ + GYL TNPS SPE+ + +G ++ V
Sbjct: 475 SEDEAFL-RLHYPVIKGAAEFFVNTLVAHPEYGYLVTNPSISPENRHM--EGNIS-VCAG 530
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
MD +IR++F+ I A+E+L + D E ++++ +L P KI +G + EW+
Sbjct: 531 PAMDTQLIRDLFAQCIKASEILNVDSD-FRELLVETRSKLAPDKIGSEGQLQEWL 584
>gi|159127378|gb|EDP52493.1| conserved hypothetical protein [Aspergillus fumigatus A1163]
Length = 745
Score = 365 bits (937), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 215/586 (36%), Positives = 324/586 (55%), Gaps = 47/586 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA ++ +A+P+GNGRLGAMV+G +E L+LNED++W G P + DA + L +R
Sbjct: 7 YQQPAGNWEEALPVGNGRLGAMVYGRTDTEMLQLNEDSVWYGGPQNRVPHDAFECLPRLR 66
Query: 77 SLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
SL+ G +AEA + F HP Y+ LG + L+F HL + YRR LD+ A
Sbjct: 67 SLIREGNHAEAEKLVRLAFFSHPISQRHYEPLGTLFLDF--GHLPECTQNYRRSLDIERA 124
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL--DNHSYVNGNN 191
T RV+Y V+ RE +SNPD VI ++ S+ + ++ S L + + Y++
Sbjct: 125 TTRVEYEHKGVKVRREVIASNPDSVIAIRVQASQKTDFTLRLTRMSELQYETNEYLD--- 181
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
+ E R I P + K + +++++ ++D+ +++ + +K L V D A+
Sbjct: 182 DVTTEDRTITMHITPGGH-----KSNRACCMVKVRTAEDQDSVTQIGNKLL-VNAQD-AL 234
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
+L+ A +++ D K +S+ +AL S +++ RH++DY+ L+ R+ +
Sbjct: 235 ILISAQTTY-----RCDDIDKKASSDLETALLH----STDEIWERHVNDYRSLYGRMELH 285
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
LS S D+ TD K + DP L+ L + RYLLIS SR G +V
Sbjct: 286 LSPSNCDMPTD----------------KRIKNSRDPGLIALYHNYCRYLLISCSRNGDKV 329
Query: 372 --ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
A LQGIWN P W +NINL+MNYW + CNLS+C+ PLF L ++ +G +T
Sbjct: 330 LPATLQGIWNPSFHPAWGCKYTININLQMNYWPANICNLSDCEMPLFSLLERVAKSGEET 389
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
AQ Y GWV HH TDIWA +S + LWP+GGAWLC H+W+H+ +T D++FLE
Sbjct: 390 AQKMYGCRGWVAHHCTDIWADTSPGDTWMPATLWPLGGAWLCVHIWDHFRFTRDKEFLE- 448
Query: 490 RAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
R +P+L+GC FLLD+L+E G YL TNPS SPE+ F +G+ + ST+D+ I+
Sbjct: 449 RMFPILQGCVQFLLDFLVEDASGEYLVTNPSLSPENTFYEKNGERGVLCEGSTIDIQIVN 508
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
V SA + + E LE D L L +L RL P +I G + EW
Sbjct: 509 AVLSAYLKSVEELEI-VDKLAPAALDALHRLPPLRIGSFGQLQEWA 553
>gi|386819251|ref|ZP_10106467.1| hypothetical protein JoomaDRAFT_1168 [Joostella marina DSM 19592]
gi|386424357|gb|EIJ38187.1| hypothetical protein JoomaDRAFT_1168 [Joostella marina DSM 19592]
Length = 818
Score = 365 bits (936), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 215/590 (36%), Positives = 333/590 (56%), Gaps = 51/590 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA ++ +A+PIGNGR+GAM++GG + ++LNE+T+W G PG+ D + + +R
Sbjct: 27 YDEPADNWNEALPIGNGRIGAMLYGGEKVDQIQLNEETVWAGSPGNNIAKDYYQDVESIR 86
Query: 77 SLVDSGQYAEATAASVKLF----------GHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
L+ +G+Y EA ++++F G P YQ +G+I+L F + H K + +RR
Sbjct: 87 ELLFNGKYTEAQQKALEVFPKNTPDNTNYGMP---YQTVGNIKLAFKN-HNKIS--NFRR 140
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
EL++ A A+V Y V++ R++F S PDQV+ + ++S L+F++ + S H
Sbjct: 141 ELNIENAVAKVSYLADGVQYNRQYFVSYPDQVMAIHLQANKSEKLNFDIEIQSA-QKHVA 199
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
NN + ++G + + P ++FS ++ KI + +S + KL VE
Sbjct: 200 SIENNILHLKGVSETRE--------NKPGKVKFSTLIYPKIIGEGKIVS--REGKLSVEK 249
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ +L + ++F +D ++ L +++N S L H++DYQ LF
Sbjct: 250 AQEVLLFISIGTNFK----KYNDLSNAEDEVALKFLNNVKNKSIEALLESHIEDYQDLFK 305
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV ++L + EN+ + + ER+K+F + D SL+ L FQFGRYLLISSSR
Sbjct: 306 RVDLKLGK------------ENLSNLTTDERLKTFSKNHDLSLISLYFQFGRYLLISSSR 353
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
G Q ANLQGIWN LSP WDS VNIN EMNYW + NLSE PLF L LS G
Sbjct: 354 EGGQPANLQGIWNNKLSPPWDSKYTVNINTEMNYWPAEVTNLSELHAPLFSMLEDLSETG 413
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
++A Y A GW +HH TDIW S G + WPMGGAWL HLW+H+ +T D +F
Sbjct: 414 KESAHKMYHARGWNMHHNTDIWRISGIVDGG-FYGFWPMGGAWLSQHLWQHFLFTGDINF 472
Query: 487 LEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L K+ YP+L+ A F +D L E +G+L PS SPE+++I DG V+Y +TMD
Sbjct: 473 L-KKYYPILKETALFYVDVLQKEPKNGWLVVTPSISPENKYI--DG--VGVTYGTTMDNQ 527
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
++ +VF+ +I+AA+ L + D ++ V + +L P +I + + EW++
Sbjct: 528 LVFDVFNNVITAAKTLNIDAD-FIKVVEEKKSKLPPMQIGKHAQLQEWIE 576
>gi|409196602|ref|ZP_11225265.1| alpha-L-fucosidase [Marinilabilia salmonicolor JCM 21150]
Length = 823
Score = 365 bits (936), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 223/596 (37%), Positives = 328/596 (55%), Gaps = 59/596 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PAK + +A+PIGNGRLGAMV+G E ++LNE+T W+G P NP A +AL
Sbjct: 30 LKLWYDKPAKVWNEALPIGNGRLGAMVFGDPTLENIQLNEETFWSGSPSRNDNPKAIEAL 89
Query: 73 SDVRSLVDSGQYAEA------TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
+VR+L+ G+Y EA + +L G +YQ +G++ L F+ H Y+ Y R
Sbjct: 90 PEVRNLIFEGKYHEAEKIVNENMVAEQLHG---SMYQTIGNLNLTFE-GHENYS--NYSR 143
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
ELD+ A Y+V +V F RE F+S PDQVIV K+S + SLSF +L L ++
Sbjct: 144 ELDIEKALHTTSYTVDDVNFKREIFASFPDQVIVVKLSADQPESLSFTANLIGPLAKNTK 203
Query: 187 VNGNNQIIMEG------RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ + M G R GK ++F+ + +I +D G SA DK
Sbjct: 204 AVDASTLEMTGISGNHERVEGK--------------VEFNTLAKILNTD--GATSADGDK 247
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ S+ +L+ +A++ F++ D + L + + YS++ H+ D
Sbjct: 248 ITVKDASEVVILISMATN-----FVDYKTLTADENEKCRKFLTAAQTKEYSEIKEAHIRD 302
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
Y+K F R S+ L +P P+ R+K+F DP+LV L +QFGRYL
Sbjct: 303 YRKYFTRSSLDLGTTPAS------------QRPTDVRIKNFSHTNDPALVSLYYQFGRYL 350
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSRPG Q ANLQGIWN +P WDS +NIN EMNYW + NL E EPL + +
Sbjct: 351 LISSSRPGGQPANLQGIWNNSTNPAWDSKYTININTEMNYWPAEKTNLPELHEPLIEMVK 410
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
LS GS+TA+ Y +GWV HH TDIW + G W +WPMGGAWL HLW+ Y Y
Sbjct: 411 DLSEAGSQTARNMYGCNGWVTHHNTDIWRITGVVDG-AFWGMWPMGGAWLTQHLWDKYLY 469
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
+ +R++L YP+++ F D+L+E +G+L NPS SPE+ AP G+ V+
Sbjct: 470 SGNREYLAS-VYPIMKSACKFYQDFLVEEPSNGWLVVNPSNSPEN---APVGR-PSVTAG 524
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+TMD I+ ++F+ AA +L ++E L+ + + RL P +I + G + EW++
Sbjct: 525 ATMDNQILFDLFTKTKKAATLLNEDE-KLINDFQRIIDRLPPMQIGQHGQLQEWME 579
>gi|375145023|ref|YP_005007464.1| alpha-L-fucosidase [Niastella koreensis GR20-10]
gi|361059069|gb|AEV98060.1| Alpha-L-fucosidase [Niastella koreensis GR20-10]
Length = 834
Score = 365 bits (936), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 216/592 (36%), Positives = 330/592 (55%), Gaps = 50/592 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L++ + PA+ + +A+P+GNG+LG MV+GG E + ++EDTLWTG P AP+ L
Sbjct: 46 LELWYQKPAEKWLEALPVGNGKLGGMVFGGPVQERISISEDTLWTGGPYQPAVEVAPETL 105
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHP--ADVYQLLGDIELEFDDSHLKYAEET--YRREL 128
+ +R L G++AEA +L G P YQ +G+++L F D ET YRR L
Sbjct: 106 ASIRKLSFEGKFAEAQELVKQLQGKPHRQAAYQTVGEVQLNFSD-----ITETSDYRRSL 160
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL-DNHSYV 187
+L A V+++ + + F+S PD VIVT+I+ + + ++ SL D +
Sbjct: 161 NLQNGVAGVQFTANGTFYKHKTFASYPDHVIVTRITAGKP--IHLTITCTSLHPDKKLTI 218
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
GNN +IM+G+ + P + + + ++I RG + D ++V G+
Sbjct: 219 AGNNTLIMDGKNGDLVVEGDGTI---PAALTWQCRVLVQI---RGGVQTAVDNGIQVIGA 272
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D ++L A++S+ + +D P + ++ SY L+ HL DYQ LF++
Sbjct: 273 DEVLILTTAATSY----VRYNDVSGKPDQLCAAVIKKCIAKSYDILFEAHLKDYQPLFNK 328
Query: 308 VSIQLSR-SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
V ++L+ +P ++ P+ ER+K+F T DPSL L FQ+GRYLL++SSR
Sbjct: 329 VKLKLTNLAPSNL-------------PTTERIKNFATGNDPSLAALYFQYGRYLLLTSSR 375
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG+Q ANLQG WN+ LS +W VNIN EMNYW + NL+ C+ PL + + L+I G
Sbjct: 376 PGSQPANLQGRWNDSLSASWGGKYTVNINTEMNYWPAQKTNLASCELPLLELVKDLAITG 435
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
TAQ Y A GWV HH TD+W +S+A + WP GGAWLC HL++HY Y+ D +
Sbjct: 436 QITAQKTYHARGWVCHHNTDLW-RSTAPIDSAFFGQWPTGGAWLCNHLYQHYLYSGDTAY 494
Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYS--STMD 543
L++ YPL++G A F D L+ E G+ T+PS SPE +G+ VS S TMD
Sbjct: 495 LQE-LYPLMKGSARFFFDTLVQEPKHGWYVTSPSMSPE------NGRAKGVSNSPGPTMD 547
Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP-RLRPTKIAEDGSIMEWV 594
M I+RE+F+ +AA VL+K+ D +K + +L P +I + G + EW+
Sbjct: 548 MQILRELFTHCATAAAVLKKDAD--FQKACNDMVFKLAPDQIGKGGQLQEWL 597
>gi|289669688|ref|ZP_06490763.1| hypothetical protein XcampmN_14597 [Xanthomonas campestris pv.
musacearum NCPPB 4381]
Length = 790
Score = 364 bits (935), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 221/595 (37%), Positives = 324/595 (54%), Gaps = 46/595 (7%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+ L++ + PA + +A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D T+P
Sbjct: 39 VAAAQALQLWYREPANQWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSP 98
Query: 67 DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
A AL VR+L+ +G+YAEA A L P YQ LGD+ L+FD +
Sbjct: 99 GALAALPQVRALIFAGRYAEAEQLADATLLSRPLKQMPYQPLGDLLLDFDRAD---GISD 155
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
YRR+LDL+TA A + G RE F S Q IV ++S G +S V +DS +
Sbjct: 156 YRRQLDLDTAVAITTFRSGEAVHRREVFVSAQAQCIVVRLSCDHPGGISLRVGIDSP-QS 214
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD--RGTISALEDKK 241
++ GR N GI+ +++ G +S + D+
Sbjct: 215 GDVTAEQGGLLFSGR------------NGSFAGIEGKLRFALRVLPQVTGGKLSQVRDR- 261
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L++E +D VLLL A++S+ + D DP + + ++L+ +L + L HL D+
Sbjct: 262 LRIEAADEVVLLLSAATSYQ--RFDAVDG--DPLALTAASLRKAASLDFPALLHAHLADH 317
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
Q+LF RV+I L S + +P+ ERV+ F DP+L L Q+GRYLL
Sbjct: 318 QRLFRRVAIDLGSS------------DAAQLPTDERVQRFAEGNDPALAALYHQYGRYLL 365
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
I SSRPGTQ ANLQGIWN+ + P W+S +NIN EMNYW S + EC EPL +
Sbjct: 366 ICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANAMHECVEPLESMVFD 425
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
L+ G+ TA+ Y ASGWV+H+ TD+W ++ G W+LWPMGG WL LW+ ++Y
Sbjct: 426 LAKTGAHTAKAIYDASGWVVHNNTDLWRQAGPIDG-AKWSLWPMGGVWLLQQLWDRWDYG 484
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
DR +L K YPL +G A F + L+ + G + TNPS SPE++ P G C
Sbjct: 485 RDRAYLSK-IYPLFKGAAEFFVATLLRDPQTGAMVTNPSMSPENQH--PFGAAVCA--GP 539
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
TMD ++R++F+ I+ +++L + + L +++ +L P +I + G + EW Q
Sbjct: 540 TMDAQLLRDLFAQCIAMSKLLGVDAE-LAQQLATLREQLPPNRIGKAGQLQEWQQ 593
>gi|294666331|ref|ZP_06731579.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292603880|gb|EFF47283.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 830
Score = 364 bits (934), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 226/591 (38%), Positives = 321/591 (54%), Gaps = 50/591 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L++ + PA + +A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D T+PDA AL
Sbjct: 85 LQLWYREPANEWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSPDALAAL 144
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
VR+L+ +G+YAEA A KL P YQ LGD+ L+FD + YRR+LD
Sbjct: 145 PQVRALIFAGRYAEAEKLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GISDYRRQLD 201
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L+TA A + G RE F S Q IV ++S + G +S V +DS N
Sbjct: 202 LDTAVATTTFRSGGAVHRREVFVSAQAQCIVVRLSCNRPGGISLRVGIDSP-QNGEVTAE 260
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI--SDDRGTISALEDKKLKVEGS 247
++ GR N GI+ +++ G +S + D+ L++E +
Sbjct: 261 QGGLLFSGR------------NGSFAGIEGKLRFALRVLPQVSGGKLSQVRDR-LRIEAA 307
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D VLLL A++S+ + D DP + + ++L+ L + L HL D+Q+LF R
Sbjct: 308 DEVVLLLSAATSYQ--RFDAVDG--DPLALTAASLRRAAKLDFPALSRAHLADHQRLFRR 363
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
V+I L S D P+ ERV+ F DP+L L Q+GRYLLI SSRP
Sbjct: 364 VAIDLGSS------DALQR------PTDERVQRFAEGNDPALAALYHQYGRYLLICSSRP 411
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
GTQ ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL L L+ G+
Sbjct: 412 GTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFDLAQTGA 471
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
TA+ Y A GWV+H+ TD+W ++ G W+LWPMGG WL LW+ ++Y DR +L
Sbjct: 472 HTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDYGRDRAYL 530
Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
K YPL +G A F + L+ + G + TNPS SPE++ P G C S MD +
Sbjct: 531 SK-IYPLFKGAAEFFVATLMRDPQTGAMVTNPSISPENQH--PFGAAVCAGPS--MDAQL 585
Query: 547 IREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+R++F+ I+ +++L + + + + LP P +I + G + EW Q
Sbjct: 586 LRDLFAQCIAMSKLLGIDAQLAQQLAALREQLP---PNRIGKAGQLQEWQQ 633
>gi|294626600|ref|ZP_06705197.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
gi|292599020|gb|EFF43160.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
Length = 830
Score = 364 bits (934), Expect = 8e-98, Method: Compositional matrix adjust.
Identities = 226/591 (38%), Positives = 320/591 (54%), Gaps = 50/591 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L++ + PA + +A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D T+PDA AL
Sbjct: 85 LQLWYREPANEWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSPDALAAL 144
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
VR+L+ +G+YAEA A KL P YQ LGD+ L+FD + YRR+LD
Sbjct: 145 PQVRALIFAGRYAEAEKLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GISDYRRQLD 201
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L+TA A + G RE F S Q IV ++S G +S V +DS N
Sbjct: 202 LDTAVATTTFRSGGAVHRREVFVSAQAQCIVVRLSCDRPGGISLRVGIDSP-QNGEVTAE 260
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI--SDDRGTISALEDKKLKVEGS 247
++ GR N GI+ +++ G +S + D+ L++E +
Sbjct: 261 QGGLLFSGR------------NGSFAGIEGKLRFALRVLPQVSGGKLSQVRDR-LRIEAA 307
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D VLLL A++S+ + D DP + + ++L+ L + L HL D+Q+LF R
Sbjct: 308 DEVVLLLSAATSYQ--RFDAVDG--DPLALTAASLRRAAKLDFPALSRAHLADHQRLFRR 363
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
V+I L S D P+ ERV+ F DP+L L Q+GRYLLI SSRP
Sbjct: 364 VAIDLGSS------DALQR------PTDERVQRFAEGNDPALAALYHQYGRYLLICSSRP 411
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
GTQ ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL L L+ G+
Sbjct: 412 GTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFDLAKTGA 471
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
TA+ Y A GWV+H+ TD+W ++ G W+LWPMGG WL LW+ ++Y DR +L
Sbjct: 472 HTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDYGRDRAYL 530
Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
K YPL +G A F + L+ + G + TNPS SPE++ P G C S MD +
Sbjct: 531 SK-IYPLFKGAAEFFVATLVRDPQTGAMVTNPSISPENQH--PFGAAVCAGPS--MDAQL 585
Query: 547 IREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+R++F+ I+ +++L + + + + LP P +I + G + EW Q
Sbjct: 586 LRDLFAQCIAMSKLLGIDAQLAQQLAALREQLP---PNRIGKAGQLQEWQQ 633
>gi|192359217|ref|YP_001984046.1| alpha-L-fucosidase [Cellvibrio japonicus Ueda107]
gi|190685382|gb|ACE83060.1| alpha-L-fucosidase, putative, afc95B [Cellvibrio japonicus Ueda107]
Length = 839
Score = 364 bits (934), Expect = 9e-98, Method: Compositional matrix adjust.
Identities = 220/599 (36%), Positives = 330/599 (55%), Gaps = 44/599 (7%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
++ S++T L++ +N PA + A+PIGNGRLGAMV+G E L+LNEDT+W G P +
Sbjct: 37 SSHSSATKQDLRLWYNTPASDWNQALPIGNGRLGAMVFGQPAQEQLQLNEDTIWAGGPNN 96
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-----GHPADVYQLLGDIELEFDDSHL 117
NP A + + V L+ GQ+ +A + + G P YQ LG++ L+F H
Sbjct: 97 NVNPAAAQTIEQVTRLLLQGQHQQAQTLADQQIRSLNNGMP---YQTLGNLRLDFA-GHG 152
Query: 118 KYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL 177
+ + Y R+LDL A ARV Y V FTRE FSS DQVIV ++S S+ G ++ +
Sbjct: 153 QV--DDYYRDLDLANAIARVSYVKAGVTFTRELFSSLSDQVIVVRLSASKPGQINTRIGF 210
Query: 178 DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
DS + + V+ + ++GR ++ D K I+F+A++ ++ RG
Sbjct: 211 DSPMQHQLSVH-ERWLQVDGRG-------GSHEGLDGK-IRFTALIAPEL---RGGTLRR 258
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
+DK L++EG+D ++ + A+++F + +D D + + + L + ++ L H
Sbjct: 259 DDKALRIEGADEVLIRIAAATNF----VRYNDLGGDSLARAQAYLSAAEGKGFAQLQQAH 314
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
+ YQ F+RVS+ L S P+ +R+ F +DP L L FQ+G
Sbjct: 315 VAAYQAQFNRVSLDLGTSAAM------------ARPTDQRIAEFAHSQDPHLAMLYFQYG 362
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLISSS+PGTQ ANLQGIWN SP WDS VNIN EMNYW + L E +PLF
Sbjct: 363 RYLLISSSQPGTQPANLQGIWNPHTSPPWDSKYTVNINTEMNYWPAEVTQLPELHQPLFA 422
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
L L++ G +AQ Y A GW++HH TD+W + + K + W GGAWLC H+W H
Sbjct: 423 MLEDLALTGRASAQQLYGARGWMMHHNTDLW-RITGQVDKAFYGQWQTGGAWLCQHIWYH 481
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
Y ++ DRDFL+ R YP+L + F +D L +E + G L PS SPE+ + G +
Sbjct: 482 YLHSGDRDFLQ-RYYPVLREASRFFVDSLTLEPNSGALVVVPSNSPENTY-ERAGYPTSI 539
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +TMD ++ ++FS I AA +L + D L ++ + RL P +I G + EW++
Sbjct: 540 SAGTTMDNQLVFDLFSITIDAAHILGVDSD-LAAQLRQKRERLAPMRIGHFGQLQEWLE 597
>gi|427384395|ref|ZP_18880900.1| hypothetical protein HMPREF9447_01933 [Bacteroides oleiciplenus YIT
12058]
gi|425727656|gb|EKU90515.1| hypothetical protein HMPREF9447_01933 [Bacteroides oleiciplenus YIT
12058]
Length = 809
Score = 364 bits (934), Expect = 9e-98, Method: Compositional matrix adjust.
Identities = 210/585 (35%), Positives = 316/585 (54%), Gaps = 41/585 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PAK +T+A+P+GN RLGAM++GGV +E ++LNE+T+W G P +P A L
Sbjct: 23 LKLWYSQPAKVWTEALPLGNSRLGAMLYGGVVNEQIQLNEETVWGGGPHRNDSPKALGVL 82
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ +G+ EA F G +Q +G + LEFD H Y++ YRRELDL
Sbjct: 83 PQVRELLFTGREKEAEKMIADNFFTGQHGMPFQTIGSLMLEFD-GHADYSD--YRRELDL 139
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A V+Y +G V +TR F+S D ++ +I + G++SF + ++
Sbjct: 140 EKAIASVRYKIGEVNYTRTVFTSLADNALIVRIEADKPGAVSFTTRYSTPYKEYAVKKSG 199
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
+++ G P A I+F +IK ++G +S D ++V+G+D A
Sbjct: 200 KSLLLSGHGSAHEGIPGA--------IRFETRTQIKA--EKGKVSVTNDC-IEVKGADAA 248
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
V+ + A+++F +N D + T + L Y+ + H + YQKLF RVS+
Sbjct: 249 VIYVTAATNF----VNYKDVSANETRRATEFLAKAMKRPYAQALSAHEEAYQKLFGRVSL 304
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
+ S K+ ++ R+K F +DP LV L+FQFGRYLLISSS+PG Q
Sbjct: 305 NVGASAKE--------------ETSYRIKHFNEGKDPGLVALMFQFGRYLLISSSQPGGQ 350
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
A LQGIWN +L WD +NIN EMNYW + NL+E EPLF + LS + TA
Sbjct: 351 PAGLQGIWNHELFAPWDGKYTININTEMNYWPAEVTNLTEMHEPLFQMVKELSESAQGTA 410
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
Y GW +HH TD+W + G +WP+GGAWL HLW+HY YT D+ FL+
Sbjct: 411 HTLYDCRGWTVHHNTDLWRMAGPVDGASY--VWPLGGAWLSQHLWQHYLYTGDQAFLQT- 467
Query: 491 AYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
AYP L+G A F LD+L+E G++ PS SPE P G ++ TMD I+ +
Sbjct: 468 AYPALKGAADFFLDFLVEHPKYGWMVCAPSMSPEQ---GPPGTGTMLTAGCTMDTQIVLD 524
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
++++SA ++L + + + + + RL P +I + + EW+
Sbjct: 525 ALTSVLSATKLLYPDHTSYCDSLQSMIKRLPPMQIGKHNQLQEWL 569
>gi|322512626|gb|ADX05719.1| putative carbohydrate-active enzyme [uncultured organism]
Length = 999
Score = 363 bits (933), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 224/596 (37%), Positives = 326/596 (54%), Gaps = 55/596 (9%)
Query: 8 STTNPLKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+T NPL + +N A FT+A+PIGNG +G +++GGV + + LNE T+W+G PGD
Sbjct: 30 TTDNPLTLWYNSDAGSEFTNALPIGNGYMGGLIYGGVTKDFIGLNESTVWSGGPGDNNKQ 89
Query: 67 DAPKALSDVRSLVDSGQY--AEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETY 124
A L D R + G Y AE+ + PA +Q +GD+ + S Y
Sbjct: 90 GAASHLKDARDALFRGDYRAAESIVNQYMIGPGPAS-FQPVGDLIISTSHS----GASDY 144
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
RRELDL TA A+ Y+ V+ TRE+F+S PD VIV +S +SGS+SF ++ + ++
Sbjct: 145 RRELDLKTAIAKTTYTHSGVKHTREYFASYPDHVIVVYLSADKSGSVSFGATMTTPHNSK 204
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
N N +I + I+F L + + ++S + + V
Sbjct: 205 RMSNDGNTLIYDVTV---------------NSIKFQNRLTVVTDGGKASVS---NGNINV 246
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
EG++ A L+L +++F +D DP + + + + SY DL HL DYQ +
Sbjct: 247 EGANSATLILTTATNFKAY----NDVSGDPGAIAAEIMSKVAKKSYEDLLAAHLKDYQTI 302
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV + L + K S +I ++ RVK+F + DPSLVEL +Q+GRYLLI+S
Sbjct: 303 FNRVKLDLGTADK-------SAGDI----TSTRVKNFNSTNDPSLVELHYQYGRYLLIAS 351
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
SR G Q ANLQGIWN+D +P W S NINLEMNYW + NL EC PL D + +
Sbjct: 352 SRKGGQPANLQGIWNKDTNPIWGSKYTTNINLEMNYWPAESGNLEECVWPLIDKIKSMVP 411
Query: 425 NGSKTAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT-M 482
G KTA+V++ + GWV HH TD+W +S+ G W LWP G WL THLWEH+ Y
Sbjct: 412 QGEKTAKVHWGVDEGWVEHHNTDLWNRSAPIDG--AWGLWPSGAGWLSTHLWEHFLYNPT 469
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLI---EGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
D+ +L+ YP ++G A F ++ L+ E + YL T PS SPE++ G C +
Sbjct: 470 DKAYLQD-VYPTMKGAALFFVNSLVEEPETGNKYLVTAPSDSPENDH---GGYNVC--FG 523
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
TMD IIR+V + I A+++L +ED + K+ ++ RL PTK + G I EW+Q
Sbjct: 524 PTMDNQIIRDVLNYTIEASKILGVDED-VRAKMEATVKRLPPTKTGKYGQITEWLQ 578
>gi|78047362|ref|YP_363537.1| hypothetical protein XCV1806 [Xanthomonas campestris pv.
vesicatoria str. 85-10]
gi|78035792|emb|CAJ23483.1| conserved hypothetical protein [Xanthomonas campestris pv.
vesicatoria str. 85-10]
Length = 856
Score = 363 bits (932), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 224/596 (37%), Positives = 320/596 (53%), Gaps = 48/596 (8%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+ L++ + PA + +A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D +P
Sbjct: 105 VAAAQALQLWYREPANQWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSNSP 164
Query: 67 DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
DA AL VR+L+ +G+YAEA A KL P YQ LGD+ L+FD +
Sbjct: 165 DALAALPQVRALIFAGRYAEAEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GISD 221
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
YRR+LDL+TA A + G RE F S Q IV ++S G +S V +DS
Sbjct: 222 YRRQLDLDTAVATTTFRSGGAVHRREVFVSAQAQCIVVRLSCDRPGGISVRVGIDSPQTG 281
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD--RGTISALEDKK 241
++ GR N GI+ +++ G +S + D +
Sbjct: 282 EVTAE-QGGLLFSGR------------NGSFAGIEGKLRFALRVLPQVRGGKLSQVRD-R 327
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L+++ +D VLLL A++S+ + D DP + + + L+ L + L HL D+
Sbjct: 328 LRIDAADEVVLLLSAATSYQ--RFDAVDG--DPLASTAACLRKAAKLDFPALLRAHLADH 383
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
Q+LF RV+I L S +P+ ERV+ F DP+L L Q+GRYLL
Sbjct: 384 QRLFRRVAIDLGSSAA------------TQLPTDERVQRFAEGNDPALAALYHQYGRYLL 431
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
I SSRPGTQ ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL L
Sbjct: 432 ICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFD 491
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
L+ G+ TA+ Y A GWV+H+ TD+W ++ G W+LWP+GG WL LW+ ++Y
Sbjct: 492 LAQAGAHTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPLGGVWLLQQLWDRWDYG 550
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
DR +L K YPL +G A F + L+ + G + TNPS SPE++ P G C S
Sbjct: 551 RDRAYLSK-VYPLFKGAAEFFVATLMRDPQTGAMVTNPSMSPENQH--PFGAAVCAGPS- 606
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL-PRLRPTKIAEDGSIMEWVQ 595
MD ++R++F+ I+ +++L DA + L +L +L P +I + G + EW Q
Sbjct: 607 -MDAQLLRDLFAQCIAMSKLL--GIDAEFAQQLAALREQLPPNRIGKAGQLQEWQQ 659
>gi|70999286|ref|XP_754362.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
gi|66851999|gb|EAL92324.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
Length = 745
Score = 363 bits (932), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 214/586 (36%), Positives = 323/586 (55%), Gaps = 47/586 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA ++ +A+P+GNGRLGAMV+G +E L+LNED++W G P + DA + L +R
Sbjct: 7 YQQPAGNWEEALPVGNGRLGAMVYGRTDTEMLQLNEDSVWYGGPQNRVPHDAFECLPRLR 66
Query: 77 SLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
SL+ G +AEA + F HP Y+ LG + L+F HL + YRR LD+ A
Sbjct: 67 SLIREGNHAEAEKLVRLAFFSHPISQRHYEPLGTLFLDF--GHLPECTQNYRRSLDIERA 124
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL--DNHSYVNGNN 191
T RV+Y V+ RE +SNPD VI ++ S+ + ++ S L + + Y++
Sbjct: 125 TTRVEYEHKGVKVRREVIASNPDSVIAIRVQASQKTDFTLRLTRMSELQYETNEYLD--- 181
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
+ E R I P + K + +++++ ++D+ +++ + +K L V D A+
Sbjct: 182 DVTTEDRTITMHITPGGH-----KSNRACCMVKVRTAEDQDSVTQIGNKLL-VNAQD-AL 234
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
+L+ A +++ D K +S+ +AL S +++ RH++DY+ L+ R+ +
Sbjct: 235 ILISAQTTY-----RCDDIDKKASSDLETALLH----STDEIWERHVNDYRSLYGRMELH 285
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
LS S D+ TD K + DP L+ L + RYLLIS SR G +
Sbjct: 286 LSPSNCDMPTD----------------KRIKNSRDPGLIALYHNYCRYLLISCSRNGDKA 329
Query: 372 --ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
A LQGIWN P W +NINL+MNYW + CNLS+C+ PLF L ++ +G +T
Sbjct: 330 LPATLQGIWNPSFHPAWGCKYTININLQMNYWPANICNLSDCEMPLFSLLERVAKSGEET 389
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
AQ Y GWV HH TDIWA +S + LWP+GGAWLC H+W+H+ +T D++FLE
Sbjct: 390 AQKMYGCRGWVAHHCTDIWADTSPGDTWMPATLWPLGGAWLCVHIWDHFRFTRDKEFLE- 448
Query: 490 RAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
R +P+L+GC FLLD+L+E G YL TNPS SPE+ F +G+ + ST+D+ I+
Sbjct: 449 RMFPILQGCVQFLLDFLVEDASGEYLVTNPSLSPENTFYEKNGERGVLCEGSTIDIQIVN 508
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
V SA + + E LE D L L +L RL P +I G + EW
Sbjct: 509 AVLSAYLKSVEELEI-VDKLAPAALDALHRLPPLRIGSFGQLQEWA 553
>gi|384098831|ref|ZP_09999943.1| hypothetical protein W5A_09224 [Imtechella halotolerans K1]
gi|383834974|gb|EID74405.1| hypothetical protein W5A_09224 [Imtechella halotolerans K1]
Length = 786
Score = 363 bits (932), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 218/605 (36%), Positives = 333/605 (55%), Gaps = 43/605 (7%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
A + ++ + ++ + PA + +A+P+GNGRLGAM++G +E ++LNED++W G P
Sbjct: 17 ANAQNSQSKERLWYKEPATKWMEALPVGNGRLGAMIFGQPINERIQLNEDSMWPGGPDWG 76
Query: 64 TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAE 121
+ P+ L +R L+ GQY +A V F + V +Q +GD+ ++F +
Sbjct: 77 DSKGTPEDLVYIRQLLKEGQYHKADEEIVTRFSNKGVVRSHQTMGDLYIDFSTKKVA--- 133
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
Y RELD+ TA A Y+ +T+E F+S P V++ + + + + + ++
Sbjct: 134 -NYYRELDIETAVATTSYNSEGYNYTQEVFASAPHNVLIIRYTTTNPKGMDATLRMNRPK 192
Query: 182 D---NHSYVN--GNNQIIMEGRCP--GKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
D N V+ NQI M+G G R+ +A D G++F L +K + G I
Sbjct: 193 DEGFNTVQVSSPAPNQIQMKGMVTQNGGRLNSEAKPLD--YGVKFDTRLVVK---NNGGI 247
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
+D L+++ + AVLLLV S+SF + S + L ++ LSY+++
Sbjct: 248 VVSKDGILELKNVNEAVLLLVGSTSFY--------HGNNYESYNEQLLGQVQELSYNEML 299
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ-TDEDPSLVELL 353
+ H+ DYQ L+ RV++ L + + +P+ ER+K + D +L LL
Sbjct: 300 SAHVADYQSLYKRVTLDLGGN------------EFNKIPTDERLKKIKDGGTDKALSALL 347
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQ+GRYLLISSSRPGT ANLQGIWNE + W++ H+N+NL+MNYW + NLSEC
Sbjct: 348 FQYGRYLLISSSRPGTNPANLQGIWNEHIRAPWNADYHLNVNLQMNYWPAEVTNLSECHS 407
Query: 414 PLFDFLTYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
PLFD+ L G TA+ Y + G VIHH +DIWA + + W W GG WL
Sbjct: 408 PLFDYTDRLINRGRITAKDQYGIHRGAVIHHTSDIWAPAWMHAERAYWGAWIHGGGWLAQ 467
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYL-ETNPSTSPEHEFIAPDG 531
H WEHY+YT D DFL+ RA+P ++ A F LDWLI D ++P TSPE+ ++APDG
Sbjct: 468 HYWEHYSYTNDIDFLKNRAWPAMKALAEFYLDWLIYDQDSKTWVSSPETSPENSYMAPDG 527
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSI 590
A VS+ + M II EVF+ + AA +L+ N+D V++V L ++ P + DG I
Sbjct: 528 TPAAVSHGAAMGHQIIGEVFNNTLKAASILKINDD-FVQEVKSKLKKIHPGVVLGPDGRI 586
Query: 591 MEWVQ 595
+EW +
Sbjct: 587 LEWTK 591
>gi|289664854|ref|ZP_06486435.1| hypothetical protein XcampvN_17740 [Xanthomonas campestris pv.
vasculorum NCPPB 702]
Length = 792
Score = 363 bits (932), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 221/589 (37%), Positives = 323/589 (54%), Gaps = 46/589 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L++ + PA + +A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D T+P A AL
Sbjct: 47 LQLWYREPANEWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSPGALAAL 106
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
VR+L+ +G+YAEA A L P YQ LGD+ L+FD + YRR+LD
Sbjct: 107 PQVRALIFAGRYAEAEQLADATLLSRPLKQMPYQPLGDLLLDFDRAD---GISDYRRQLD 163
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L+TA A + G RE F S Q IV ++S G +S V +DS +
Sbjct: 164 LDTAVAITTFRSGEAVHRREVFVSAQAQCIVVRLSCDHPGGISLRVGIDSP-QSGDVTAE 222
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD--RGTISALEDKKLKVEGS 247
++ GR N GI+ +++ G +S + D+ L++E +
Sbjct: 223 QGGLLFSGR------------NGSFAGIEGKLRFALRVLPQVTGGKLSQVRDR-LRIEAA 269
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D VLLL A++S+ + D DP + + ++L+ +L + L HL D+Q+LF R
Sbjct: 270 DEVVLLLSAATSYQ--RFDAVDG--DPLALTAASLRKAASLDFPALLHAHLADHQRLFRR 325
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
V+I L S + +P+ ERV+ F DP+L L Q+GRYLLI SSRP
Sbjct: 326 VAIDLGSS------------DAAQLPTDERVQRFAEGNDPALAALYHQYGRYLLICSSRP 373
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
GTQ ANLQGIWN+ + P W+S +NIN EMNYW S + EC EPL + L+ G+
Sbjct: 374 GTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANAMHECVEPLESMVFDLAKTGA 433
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
TA+ Y ASGWV+H+ TD+W ++ G W+LWPMGG WL LW+ ++Y DR +L
Sbjct: 434 HTAKAIYDASGWVVHNNTDLWRQAGPIDG-AKWSLWPMGGVWLLQQLWDRWDYGRDRAYL 492
Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
K YPL +G A F + L+ + G + TNPS SPE++ P G C TMD +
Sbjct: 493 SK-IYPLFKGAAEFFVATLLRDPQTGAMVTNPSMSPENQH--PFGAAVCA--GPTMDAQL 547
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+R++F+ I+ +++L + + L +++ +L P +I + G + EW Q
Sbjct: 548 LRDLFAQCIAMSKLLGVDAE-LAQQLATLREQLPPNRIGKAGQLQEWQQ 595
>gi|380694480|ref|ZP_09859339.1| alpha-L-fucosidase [Bacteroides faecis MAJ27]
Length = 804
Score = 363 bits (931), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 217/620 (35%), Positives = 327/620 (52%), Gaps = 54/620 (8%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
T+ N L + + PAK + +A+P+GNGRLGAM++G E ++ NE+TL++G P N
Sbjct: 10 GTNAQNHLTLWYKSPAKAWEEALPVGNGRLGAMIFGDTQKERIQFNENTLYSGEPETPKN 69
Query: 66 PDAPKALSDVRSLVDSGQYAEA-TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETY 124
+ L+ +R L+ G+ AEA T K G + YQ GD+ ++FD K A Y
Sbjct: 70 INIVPDLAHIRQLLGEGKNAEAGTIMQEKWIGRLNEAYQPFGDLYIDFDS---KEAVTDY 126
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
LD+ A Y V+ +RE F+S P Q IV + S+ L+F L S +
Sbjct: 127 MHSLDMENAVVTTSYKQNGVDISREVFASYPAQAIVIHLKSSKP-VLNFTAYLAS--PHP 183
Query: 185 SYVNGNNQII-MEGRCPG---------------KRIPPK--------------ANAND-D 213
++Q++ ++G+ P +R+ P+ N+ D
Sbjct: 184 VTKESDSQVVYLKGQAPAHAQRRDTDHMKRFNTQRLHPEYFDASGHIIQKKQVIYGNEMD 243
Query: 214 PKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKD 273
KG F A L + +G ++ D ++ L+L A++S++GP +PS K+
Sbjct: 244 GKGTFFEACL---LPTHKGGQLSISDNQITARNCSEVTLMLYAATSYNGPRKSPSKEGKN 300
Query: 274 PTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVP 333
P M+ + +Y +L +H DYQ LF+RVS L + + +P
Sbjct: 301 PHQAIMNYRRISEGETYKELKRQHTTDYQALFNRVSFDLPANKQQ-----------KELP 349
Query: 334 SAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVN 393
+ ER+K F+ +ED +L+ LFQFGRYL+I+ SR Q NLQG+WN+ + P W+S +N
Sbjct: 350 TDERLKRFKDEEDQALIAQLFQFGRYLMIAGSRGEGQPLNLQGLWNDQILPPWNSGYTLN 409
Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA 453
INLEMNYW + NLSEC +PLF + ++ G A+ Y +GW IHH IW ++
Sbjct: 410 INLEMNYWPAEVTNLSECHQPLFKLIEEIADKGKDLARDMYGLNGWAIHHNISIWREAYP 469
Query: 454 DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY 513
G V W W M G WLC HLWEHY +T D +FL K+ YP+L+G A+F +WL++ G
Sbjct: 470 SDGFVYWFFWNMSGPWLCNHLWEHYLFTKDANFL-KKYYPILKGAATFCSEWLVKNSKGE 528
Query: 514 LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL 573
L T STSPE+ ++ D A V STMD+AIIR +FS I AAE+L+ + D E ++
Sbjct: 529 LVTPVSTSPENAYLMGDHTPASVCEGSTMDIAIIRSLFSNTIQAAEILQTDMDFRSE-LI 587
Query: 574 KSLPRLRPTKIAEDGSIMEW 593
K +L+ +I G ++EW
Sbjct: 588 KKRNKLKKYQIGSKGQLLEW 607
>gi|313203234|ref|YP_004041891.1| alpha-L-fucosidase [Paludibacter propionicigenes WB4]
gi|312442550|gb|ADQ78906.1| Alpha-L-fucosidase [Paludibacter propionicigenes WB4]
Length = 822
Score = 363 bits (931), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 214/583 (36%), Positives = 317/583 (54%), Gaps = 48/583 (8%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYA 85
+A+PIGNG LGAMV+G V E ++LNE TLW+G P D NP A +ALS +R+ + G+Y
Sbjct: 55 NALPIGNGFLGAMVYGNVNQELIQLNEKTLWSGSPDDNNNPQAAEALSQIRNFLFEGKYK 114
Query: 86 EATAASVK-------------LFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
EA + K P YQ LG++ +F + E Y RELDLN
Sbjct: 115 EANELTNKTQICKGVGSGTGSGTNVPYGSYQTLGNLFFDFGKTA---PFENYVRELDLNR 171
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
V YS V + RE F+S PD+ ++ ++ + G+LSF L + V N+
Sbjct: 172 GVVTVSYSQNGVRYKREIFASYPDRALIIHLTADKKGALSFTTELTRPERFETRVE-NDH 230
Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
++M G + G++++A L+ + RG ++ +++VEG+D ++
Sbjct: 231 LLMTGALTNGQ---------GGDGMKYAARLK---ATTRGGKLNYKNNEIRVEGADEVIM 278
Query: 253 LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
+L AS+++ + PS DP + + L + Y L H DY LF +VS+ L
Sbjct: 279 ILTASTNYKQEY--PSFVGDDPRLTTQNQLSKASSKPYPTLLKNHTVDYAALFGKVSLNL 336
Query: 313 SRSPKDIVTDTCSEENIDTVPSAERVKS-FQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
S + + DT+P+ R+++ + +D L E+ FQFGRYLLISSSR G+
Sbjct: 337 S------------DNDPDTIPTDRRLRNQTKNPDDLHLQEVYFQFGRYLLISSSREGSLP 384
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
ANLQGIW + W+ H NIN++MNYW + NLSEC PL + L G +A
Sbjct: 385 ANLQGIWCNKIQAPWNCDYHSNINVQMNYWGADIVNLSECFSPLSRLIESLVKPGEISAA 444
Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
V Y ASGW + T++W +S G + W L+ GG WLC HLW+HY +T+DR++L+ R
Sbjct: 445 VQYNASGWCVQPITNVWGYTSPGEG-INWGLYVAGGGWLCRHLWDHYTFTLDRNYLQ-RV 502
Query: 492 YPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREV 550
YP++ A F LDWL+ + G L + PSTSPE+ FIAPDG + + D II E+
Sbjct: 503 YPVMLNAARFYLDWLVTDPKTGKLVSGPSTSPENSFIAPDGSRGSICMGPSHDQEIIHEL 562
Query: 551 FSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
F+ +++A++VL KN D L+ K+ +L L KI DG +MEW
Sbjct: 563 FTNVLTASKVL-KNTDPLLAKIDIALRNLATPKIGSDGRLMEW 604
>gi|146301819|ref|YP_001196410.1| hypothetical protein Fjoh_4083 [Flavobacterium johnsoniae UW101]
gi|146156237|gb|ABQ07091.1| Candidate alpha-L-fucosidase; Glycoside hydrolase family 95
[Flavobacterium johnsoniae UW101]
Length = 816
Score = 362 bits (930), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 220/585 (37%), Positives = 323/585 (55%), Gaps = 38/585 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA + +A+P+GNGRLGAMV+G E L+LNE+T+W G P + + KAL
Sbjct: 25 LKLWYDKPASIWNEALPLGNGRLGAMVFGDPAVERLQLNEETIWAGSPNGNAHNKSIKAL 84
Query: 73 SDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELD 129
VR L+ G++ EA A+ + D YQ G + + F H KYA+ Y R+LD
Sbjct: 85 PIVRQLIFDGKFDEAQDLATQDIMSQTNDGMPYQTFGSVYISFA-GHQKYAD--YYRDLD 141
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
++ ATA+VKY V VEFTRE ++ DQVIV K+S S+ G ++ NV ++S +D
Sbjct: 142 ISNATAKVKYKVNGVEFTREILTAFSDQVIVVKLSASQPGQITCNVFMNSPIDKTVASTE 201
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
NQII+ G N ++F L K + G I A + L + +D
Sbjct: 202 GNQIILSGVG--------TNFEGVKGKVKFQGRLTAK--NKGGEIDA-SNGVLSINKADE 250
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
L + +++F N D D ++S L + + H+D YQK F+RVS
Sbjct: 251 VTLYISIATNFK----NYQDISGDEIAKSKDYLAKAEVKDFETIKKAHVDYYQKFFNRVS 306
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L + D+V P+ ER++ F DP L L FQFGRYLLISSS+PG
Sbjct: 307 LNLGSN--DLVKK----------PTNERIRDFSKQFDPQLASLYFQFGRYLLISSSQPGG 354
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN+ ++P WDS NIN EMNYW + NL E EP L++ G++T
Sbjct: 355 QPANLQGIWNDMVTPPWDSKYTTNINAEMNYWPAQVTNLQEMHEPFVQMAKELAVTGAET 414
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+ Y ASGWV+HH TDIW + +A +WP GGAW+C LWE Y YT D+ +L +
Sbjct: 415 AKTMYNASGWVLHHNTDIW-RVTAPVDSAASGMWPTGGAWVCQDLWERYLYTGDKKYLVE 473
Query: 490 RAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
YP+++G A F LD++ I+ + YL PS+SPE+ GK A ++ +TMD ++
Sbjct: 474 -IYPIMKGAADFFLDFMVIDPNTKYLVVVPSSSPENTHAGGTGK-ATIASGTTMDNQLVF 531
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
++F+ +I A+ ++ + A +KV +L ++ P KI + + EW
Sbjct: 532 DLFTHVIEASALVSPDV-AYAKKVSDALAKMPPMKIGKYNQLQEW 575
>gi|325105420|ref|YP_004275074.1| alpha-L-fucosidase [Pedobacter saltans DSM 12145]
gi|324974268|gb|ADY53252.1| Alpha-L-fucosidase [Pedobacter saltans DSM 12145]
Length = 768
Score = 362 bits (930), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 218/586 (37%), Positives = 318/586 (54%), Gaps = 45/586 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ +N PA + +A+P+GNG LGAM++G +E L+LNE ++W G D+ NP A +L
Sbjct: 28 LKLWYNKPALDWNEALPVGNGSLGAMIFGNTFNEVLQLNESSVWAGKDEDFVNPRAKASL 87
Query: 73 SDVRSLVDSGQYAEAT-AASVKLFGHPA--DVYQLLGDIELEFDDSHLKYAEETYRRELD 129
VR+L+ +Y EA A L G YQ LG++ L+F S+ + Y REL+
Sbjct: 88 KKVRNLLFQEKYTEAQDLADSSLMGDKKIWSSYQELGNLRLDFKKSNRSVS--NYNRELN 145
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
+ A A ++V F RE FSS + K+S +++ +S + +D +
Sbjct: 146 IENAIATTTFNVDGTLFEREVFSSAVANTVFIKLSSNKTKQISLTIGMDRAGNLAKISAS 205
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
++QI + ++ G+ +I I R ++S + K+ VE +D
Sbjct: 206 DHQIYLTEHV------------NNGVGVILHSIANIANKGGRLSVS---NNKIIVENADE 250
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
V+ L A+++F+ NP ++ K SES++ +Y H+ DYQ+ F+RV
Sbjct: 251 VVITLAAATNFN--HTNPLETVKSRISESLAK-------AYQQHKEEHIKDYQQYFNRVK 301
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPG 368
+ L + N P+ R+ + + DPSL+ L +Q+GRYLLISSSRPG
Sbjct: 302 LNLGNN------------NSSLFPTDARLSALKNGNFDPSLITLFYQYGRYLLISSSRPG 349
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
ANLQGIW E L W+ H+NIN +MNYW + NLSE P D+LT L +G K
Sbjct: 350 GLPANLQGIWAEGLQVPWNGDYHININAQMNYWLAENTNLSEMHMPFLDYLTNLGKDGKK 409
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
TA+ Y SG V H +DI+ + GK WA+WP G AW H WEHY YT D+ FLE
Sbjct: 410 TAKDMYGLSGEVAHFASDIFYYTEP-WGKPKWAMWPTGLAWCSQHAWEHYLYTQDKAFLE 468
Query: 489 KRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
K+ Y +L+ + F LDWL++ G L + PS SPE+ F PDGK+A V MD II
Sbjct: 469 KQGYEILKQSSIFFLDWLVKNPKTGLLVSGPSISPENTFKTPDGKIATVIMGPAMDHMII 528
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
RE+F ISAA++L K++ LV K+ K+L +L PT+I DG I+EW
Sbjct: 529 RELFGNTISAAQILGKDKK-LVTKLQKALKQLTPTQIGSDGRILEW 573
>gi|189462578|ref|ZP_03011363.1| hypothetical protein BACCOP_03268 [Bacteroides coprocola DSM 17136]
gi|189430739|gb|EDU99723.1| intein C-terminal splicing region [Bacteroides coprocola DSM 17136]
Length = 866
Score = 362 bits (929), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 220/590 (37%), Positives = 324/590 (54%), Gaps = 42/590 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PAK + +A+P+GN +GAMV+GG E L+LNE+TLW G P NP A ++L
Sbjct: 68 LKLWYQQPAKTWVEALPVGNSSMGAMVYGGTSREELQLNEETLWGGGPYRNDNPKALESL 127
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
++VR+L+ SG+ +A + F G YQ +G + +E H K + Y R+L+L
Sbjct: 128 AEVRNLIFSGKTMDAQNLIDQTFYTGRNGMPYQTIGSLIIE-APGHEK--AKNYYRDLNL 184
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A +Y V V F RE F+S PD+VI+ + + + G L+F VS DS L + G
Sbjct: 185 ERAVATTRYQVDGVNFQREVFASFPDRVIIVRFTTDKPGELNFKVSYDSPLQSTVRKQGK 244
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD---RGTISALEDKKLKVEGS 247
++++ G+ D +G++ ++E++ G +L DK + VE +
Sbjct: 245 -KLVLRGK------------GGDHEGVK--GVIEVETQSQVIAEGGKVSLTDKYISVEHA 289
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
A L + A+++F +N + K + + ++ + L YS+ H D YQ F+R
Sbjct: 290 TAATLYIAAATNF----VNYHNVKGNESKKASALLAGAMKKEYSEALKAHTDYYQSQFNR 345
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
VS+ L T T +E + +R+ F DP+L L+FQ+GRYLLISSS+P
Sbjct: 346 VSLSLGGEN----TKTARQETV------KRIAGFSQGNDPALAALMFQYGRYLLISSSQP 395
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
G Q ANLQGIWN L+ WD +NIN EMNYW + NLSE EPLF + LS+ G
Sbjct: 396 GGQPANLQGIWNHQLNAPWDGKYTININTEMNYWPAEVTNLSETHEPLFGLVQDLSVTGR 455
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
+TA+ Y +GWV HH TDIW + + K + WP+GGAWL THLW+HY YT D+DFL
Sbjct: 456 ETARTMYGCNGWVAHHNTDIW-RVTGPVDKAFYGTWPVGGAWLTTHLWQHYLYTGDKDFL 514
Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS-TMDMA 545
K +YP ++G A F L ++I G+ T PS SPEH D K A S TMD
Sbjct: 515 RK-SYPAMKGAADFFLGYMIPHPKYGWKVTAPSMSPEHGPKGEDTKKASTIVSGCTMDNQ 573
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
II +V S ++A+E+LE + A + + L + P +I + EW++
Sbjct: 574 IIFDVLSNTLAASEILELSA-AYRDSLRTLLSEMAPMQIGRYNQLQEWLE 622
>gi|300785873|ref|YP_003766164.1| large protein [Amycolatopsis mediterranei U32]
gi|384149183|ref|YP_005531999.1| large protein [Amycolatopsis mediterranei S699]
gi|399537756|ref|YP_006550418.1| large protein [Amycolatopsis mediterranei S699]
gi|299795387|gb|ADJ45762.1| large secreted protein [Amycolatopsis mediterranei U32]
gi|340527337|gb|AEK42542.1| large protein [Amycolatopsis mediterranei S699]
gi|398318526|gb|AFO77473.1| large protein [Amycolatopsis mediterranei S699]
Length = 949
Score = 362 bits (929), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 231/591 (39%), Positives = 319/591 (53%), Gaps = 46/591 (7%)
Query: 11 NPLKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
N L + ++ PA + A+PIGNGRLGAMV G +E L+LNEDT+W G P DY+N
Sbjct: 39 NDLALWYDKPAGTEWLRALPIGNGRLGAMVSGNTDTERLQLNEDTVWAGGPHDYSNAQGA 98
Query: 70 KALSDVRSLVDSGQYAEATA-ASVKLFGHPA--DVYQLLGDIELEFDDSHLKYAEETYRR 126
ALS +R LV + Q+ +A + K+ G PA YQ +G + L + +Y+R
Sbjct: 99 GALSQIRQLVFANQWTQAQSLIDQKMLGTPAAQQPYQPVGTLSLALPGNS---GVSSYQR 155
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
LDL TAT V Y NV + RE F+S DQVIV +++ GS+SF+ SL + +
Sbjct: 156 WLDLTTATTVVTYVANNVRYRREVFASAADQVIVLRLTAETPGSISFSASLGTPQRATTS 215
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSA-ILEIKISDDRGTISALEDKKLKVE 245
I ++G + D +GI S L + + G ++ L+V
Sbjct: 216 SPNGTTIALDG------------ISGDSRGIAGSVRFLALAGATAEGGSTSSSGGTLRVS 263
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G+D LL+ +S+ ++ D + S L + + L + L RHL DYQKLF
Sbjct: 264 GADAVTLLISIGTSY----VDYRTVNGDYQGIARSRLAAAQALPHDTLRGRHLADYQKLF 319
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
R ++ L R T + + P+ R+ + DP LLFQFGRYLLISSS
Sbjct: 320 GRTTLDLGR--------TAAADQ----PTDVRIAQHNSVNDPQFAALLFQFGRYLLISSS 367
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPGTQ ANLQGIWN+ L+P+W+S +N NL MNYW + NL+EC EP+F + L++
Sbjct: 368 RPGTQPANLQGIWNDQLNPSWESKYTLNANLPMNYWPADVTNLAECYEPVFAMIGDLAVT 427
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSS-ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G++TAQV Y A GWV HH TD W SS D + +W GGAWL T +W+HY +T D
Sbjct: 428 GARTAQVEYGARGWVTHHNTDGWRGSSIVDFAQA--GMWQTGGAWLATMIWDHYRFTGDV 485
Query: 485 DFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
+FL R YPLL+G A F LD L+ E GYL TNP+ SPE A A V TMD
Sbjct: 486 EFLRAR-YPLLKGAAQFFLDTLVTEPSLGYLVTNPANSPELNHHAN----ASVCAGPTMD 540
Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
M I+R++F A +VL + ++V + RL P K+ G+I EW+
Sbjct: 541 MQILRDLFDGCAGACQVLGVDA-TFADQVTAARQRLAPMKVGSRGNIQEWL 590
>gi|395776471|ref|ZP_10456986.1| large protein [Streptomyces acidiscabies 84-104]
Length = 802
Score = 362 bits (928), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 226/577 (39%), Positives = 316/577 (54%), Gaps = 49/577 (8%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
A+P+GNGRLGAMV+G +E L+LNEDTLW G P +Y NP AL +R LV + Q+ +
Sbjct: 46 ALPVGNGRLGAMVFGNTDTERLQLNEDTLWAGGPHNYDNPRGAAALGRIRQLVFADQWGQ 105
Query: 87 AT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
A + + G PA YQ +GD+ L F A Y R LDL TAT V Y+ N
Sbjct: 106 AQDLINQTMLGDPAAQLAYQPVGDLRLTFPAGS---AVSAYERLLDLTTATTAVTYTANN 162
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKR 203
V + RE F+S PDQVIV +++ GS++F+ + S I ++G
Sbjct: 163 VSYRREVFASAPDQVIVMRLTARTPGSITFSATFASPQRTSLSSPDGTTIALDG------ 216
Query: 204 IPPKANANDDPKGI----QFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSS 259
+ D +GI +F A+ K + G++++ L+V G+D LL+ +S
Sbjct: 217 ------VSGDMRGIAGTVRFLAL--AKAVAEGGSVTS-SGGTLRVTGADSVTLLVSIGTS 267
Query: 260 FDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDI 319
+ ++ D + + L + + ++Y L RH+ DYQ LF RVS+ + R+P
Sbjct: 268 Y----VDYRTVDGDYQGIARTHLDAAQGVAYDTLRARHVADYQALFGRVSLDVGRTP--- 320
Query: 320 VTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN 379
+++ P+ R+ + +DP LLFQ+GRYLLISSSRPGTQ ANLQGIWN
Sbjct: 321 ----AADQ-----PTDVRIAQHGSADDPQFSALLFQYGRYLLISSSRPGTQPANLQGIWN 371
Query: 380 EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGW 439
+ L+P+WDS +N NL MNYW + NL+EC P+F + L+ G++TAQ Y A GW
Sbjct: 372 DQLTPSWDSKYTINANLPMNYWPADTTNLAECLAPVFAMIDDLTATGARTAQAQYGARGW 431
Query: 440 VIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCA 499
V HH TD W +S G VW +W GGAWL + +W+HY +T D +FL +R YP L+G A
Sbjct: 432 VTHHNTDAWRGTSVVDG-AVWGMWQTGGAWLASLIWDHYRFTGDVEFL-RRNYPALKGAA 489
Query: 500 SFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAA 558
F LD L+ G+L TNPS SPE PD V TMDM I+R +F SA+
Sbjct: 490 RFFLDTLVPHPGLGHLVTNPSNSPELTH-HPD---VSVCAGPTMDMQILRSLFDGCASAS 545
Query: 559 EVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
EVL + A +V + RL P KI G+I EW+
Sbjct: 546 EVLGVDA-AFRAQVRSARRRLAPMKIGSRGNIQEWLH 581
>gi|372220893|ref|ZP_09499314.1| alpha-L-fucosidase [Mesoflavibacter zeaxanthinifaciens S86]
Length = 805
Score = 362 bits (928), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 203/592 (34%), Positives = 338/592 (57%), Gaps = 32/592 (5%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAPKAL 72
+I F+ PA +F + + +GNG++GA ++GG+ +E + LN+ TLW+G P ++ N P+A K L
Sbjct: 33 EIWFDKPATYFEETLVLGNGKMGASIFGGIQTEKIFLNDITLWSGEPMNHNNNPEAYKNL 92
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
++R+ + + Y A + + KL G + Y LG + L F + + Y+R LDL T
Sbjct: 93 PEIRAALKAENYKLADSLNKKLQGQFSQSYAPLGTLWLHFKN---ETNITNYKRSLDLTT 149
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A A V Y V++ RE+F SNP +V+V +++ ++SF++ +S L +++
Sbjct: 150 AIADVSYESNGVKYKREYFISNPKKVMVVRLTSDRKKAISFDLKFESQL-RFKIKELDSK 208
Query: 193 IIMEGRCPGKRIPPKANANDDP------KGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+I G P P + +P KG +F++ IK +D GT+ ++D L V+
Sbjct: 209 LIATGYAPVHVEPSYRGSIKNPIVFDADKGTRFTSAFSIKQTD--GTVK-IQDSVLSVQN 265
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ LL+ ++SF+G NP+ + + ++ ++S + +Y++L H+ DY +L++
Sbjct: 266 ATEVELLVAVATSFNGFDKNPATEGLNHENIALEQIKSSKKETYANLKKEHVADYSELYN 325
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL-FQFGRYLLISSS 365
RV +LS + + VP+ +R+ ++T + +E+L F +GRYLLI+SS
Sbjct: 326 RVDFKLSH------------KELPNVPTDQRLLRYETGANDQNLEILYFNYGRYLLIASS 373
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
R ANLQG+WN + P W S +NINL+ NYW + NLSE +PL F+ LS
Sbjct: 374 RTKEVPANLQGLWNPHIRPPWSSNYTININLQENYWLAETANLSELHQPLLSFIGNLSKT 433
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHYNYT 481
G+ TA+ Y +GW H +DIWA ++ +G WA W MGG WL +HLWEHY YT
Sbjct: 434 GAITAKTYYGTNGWAAGHNSDIWALTNPVGDFGQGNPNWANWNMGGVWLTSHLWEHYLYT 493
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
D +L++ AYP+++G A+F +WLI+ G ++PSTSPE+ + P+G + Y +T
Sbjct: 494 KDTTYLKEYAYPIIKGAATFASEWLIKDQHGQFISSPSTSPENLYKTPEGYVGATLYGAT 553
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
DMA+I+E+F + ++A++ L +D K+ +L L P KI + G++ EW
Sbjct: 554 ADMAMIKELFYSYLNASKTLAIQDD-FTRKIKFNLENLSPYKIGQKGNLQEW 604
>gi|410096950|ref|ZP_11291934.1| hypothetical protein HMPREF1076_01112 [Parabacteroides goldsteinii
CL02T12C30]
gi|409224744|gb|EKN17668.1| hypothetical protein HMPREF1076_01112 [Parabacteroides goldsteinii
CL02T12C30]
Length = 804
Score = 362 bits (928), Expect = 4e-97, Method: Compositional matrix adjust.
Identities = 220/606 (36%), Positives = 327/606 (53%), Gaps = 51/606 (8%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
A++T+T NP K ++ A+ + A+P+GNG LGAMV+G V E ++LNE+T+W+G D
Sbjct: 39 ADATATDNPNK-GYDDDAE-WLKALPLGNGSLGAMVFGDVHKERIQLNEETMWSGSIQDS 96
Query: 64 TNPDAPKALSDVRSLVDSGQYAEAT-------AASVKLFGH------PADVYQLLGDIEL 110
NP+A K + +++ L+ G+Y EAT + K GH P YQ +GD+ +
Sbjct: 97 DNPEAAKHIEEIKQLLFDGKYKEATDLTNRTQICTGKGSGHGQGSNAPFGCYQTMGDLWI 156
Query: 111 EFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGS 170
+FD+ K YRREL+L+ ATAR+ Y G+V F RE F S+PDQ +V +IS +
Sbjct: 157 DFDN---KSPYTDYRRELNLDDATARISYKQGDVNFKREIFISHPDQSMVMRISADKKQQ 213
Query: 171 LSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD 230
LSF ++ + +S N Q+IM G +D G + +K
Sbjct: 214 LSFTCRMNRP-ERYSTYTENEQLIMAGAL-----------SDGKGGDGLQYMTRLKAVPM 261
Query: 231 RGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSY 290
G+++ D L V+ +D +L L AS+ + + P +D +S + ++L N SY
Sbjct: 262 NGSVT-YSDSTLTVKDADEVLLFLTASTDYKLEY--PIYKGRDFSSITEASLNKAINKSY 318
Query: 291 SDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSL 349
+ LY H+ +Y F R ++QL+ +P DT+P+ +V + + DP L
Sbjct: 319 NQLYETHVKEYTDYFQRANLQLTNTP-------------DTIPTDIKVMNARKGMIDPHL 365
Query: 350 VELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLS 409
E +FQ+GRYLLISSSRPGT ANLQGIW L W+ H ++N+EMNYW + NLS
Sbjct: 366 YEQMFQYGRYLLISSSRPGTMPANLQGIWANKLQTAWNGDYHTDVNIEMNYWPAEVTNLS 425
Query: 410 ECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAW 469
E P+FD + L GSKTAQ+ Y GWV+H T++W +S W + AW
Sbjct: 426 EMHLPMFDLIASLVEPGSKTAQIQYNKKGWVVHPITNVWGYTSPGEA-ASWGMHTGAPAW 484
Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIA 528
+C H+ EHY +T D+DFL ++ YP+L+G F +DWL E L + P+ SPE+ F+A
Sbjct: 485 ICQHIGEHYRFTGDKDFL-RKTYPVLKGAIEFYMDWLTENPKTKELVSGPAVSPENTFVA 543
Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDG 588
PDG + +S D I ++F + L ++D +V + RL TKI DG
Sbjct: 544 PDGSHSQISMGPAHDQQTIWQLFDDFAMISSELSIDDD-FTRQVADAKDRLADTKIGSDG 602
Query: 589 SIMEWV 594
IMEW
Sbjct: 603 RIMEWA 608
>gi|390989152|ref|ZP_10259452.1| hypothetical protein XAPC_114 [Xanthomonas axonopodis pv. punicae
str. LMG 859]
gi|372556186|emb|CCF66427.1| hypothetical protein XAPC_114 [Xanthomonas axonopodis pv. punicae
str. LMG 859]
Length = 790
Score = 361 bits (927), Expect = 5e-97, Method: Compositional matrix adjust.
Identities = 224/597 (37%), Positives = 321/597 (53%), Gaps = 50/597 (8%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+ L++ + PA + +A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D T+P
Sbjct: 39 VAAAEALQLWYREPANEWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSP 98
Query: 67 DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
DA AL VR+L+ +G+YAEA A KL P YQ LGD+ L+FD +
Sbjct: 99 DALAALPQVRALIFAGRYAEAEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GISD 155
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
YRR+LDL+TA A + G RE F Q IV ++S G +S V +DS
Sbjct: 156 YRRQLDLDTAVATTTFRSGGAVHRREVFVCAQAQCIVVRLSCDRPGGISLRVGIDSPQTG 215
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI--SDDRGTISALEDKK 241
++ GR N GI+ +++ G +S + D+
Sbjct: 216 EITAEPGG-LLFSGR------------NGSFAGIEGRLRFALRVLPQVSGGKLSQVRDR- 261
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L+++ +D VLLL A++S+ + D DP + + + L+ NL + L HL D+
Sbjct: 262 LRIDAADEVVLLLSAATSYQ--RFDAVDG--DPLALTAARLRKAANLDFPALLRAHLADH 317
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
Q+LF RV+I D S E + +P+ ERV+ F DP+L L Q+GRYLL
Sbjct: 318 QRLFRRVAI-----------DLGSSEAVQ-LPTNERVQRFAEGNDPALAALYHQYGRYLL 365
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
I SSRPGTQ ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL L
Sbjct: 366 ICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFD 425
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
L+ G+ TA+ Y A GWV+H+ TD+W ++ G W+LWPMGG WL LW+ ++Y
Sbjct: 426 LAQTGTHTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDYG 484
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
DR +L K YPL +G A F + L+ + G + TNPS SPE++ P G C S
Sbjct: 485 RDRAYLSK-VYPLFKGAAEFFVATLMRDPQTGAMVTNPSMSPENQH--PFGAAVCAGPS- 540
Query: 541 TMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
MD ++R++F+ I+ +++L + + + + LP P +I + G + EW Q
Sbjct: 541 -MDAQLLRDLFAQCIAMSKLLGIDAQLAQQLAALREQLP---PNRIGKAGQLQEWQQ 593
>gi|408671718|ref|YP_006875526.1| alpha-L-fucosidase [Emticicia oligotrophica DSM 17448]
gi|387857567|gb|AFK05662.1| alpha-L-fucosidase [Emticicia oligotrophica DSM 17448]
Length = 818
Score = 361 bits (927), Expect = 6e-97, Method: Compositional matrix adjust.
Identities = 218/593 (36%), Positives = 326/593 (54%), Gaps = 48/593 (8%)
Query: 12 PLKITFNGPAKH-FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
PLK+ + P+ + + +A+PIGNGRLGAM++G V E ++LNE T+W+G P NP A +
Sbjct: 22 PLKLWYKQPSGNTWENAMPIGNGRLGAMIYGNVEQEIIQLNEHTVWSGSPNRNDNPLALE 81
Query: 71 ALSDVRSLVDSGQYAEA----TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
L+++R L+ G + EA A + H ++ +G++ L F + Y R
Sbjct: 82 KLAEIRKLIFEGNHKEAEKLANQAIISKTSH-GQKFEPVGNLNLVFAGQE---NYKNYYR 137
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
ELD+ A ++ Y VG+V +TRE F+S D+VI+ KIS +++G++SFN ++ S +
Sbjct: 138 ELDIERAISKTTYQVGDVTYTREAFASLADRVIIMKISANKAGNVSFNANISSPQKRKTI 197
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDP--KG-IQFSAILEIKISDDRGTISALEDKKLK 243
P K + +D KG + F I IK+ + G++ + D L
Sbjct: 198 AT----------TPNKDLTLSGITSDHETVKGMVAFKGISRIKL--EGGSLQS-TDTSLV 244
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V+G++ A++ + +++F+ N D D + L + +Y+ L + H+ YQK
Sbjct: 245 VKGANSAIIFISIATNFN----NYQDLSGDENKRANDYLNNAFAKTYTTLLSSHILAYQK 300
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
LF+RV I L E + +P+ ER+++F+ DP +V L +QFGRYLLIS
Sbjct: 301 LFNRVKIDLG------------ETDAAKLPTDERLRNFRNINDPQMVALYYQFGRYLLIS 348
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SS+PG Q ANLQGIWN ++P WDS +NIN EMNYW + NLSE EP + LS
Sbjct: 349 SSQPGGQPANLQGIWNNRINPPWDSKYTININAEMNYWPAEKTNLSELHEPFLKMVKELS 408
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
I G KTA+ Y A GW+ HH TDIW + A G W +W GG W+ HLWEHY YT D
Sbjct: 409 ITGQKTAKDMYGARGWMAHHNTDIWRATGAIDG-AFWGMWTAGGGWVSQHLWEHYLYTGD 467
Query: 484 RDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
+ FL AYP L G A F D+L+ + +L NP SPE+ A DG + + T
Sbjct: 468 KAFLAS-AYPALRGAAQFYADFLVPHPNKNNWLVVNPGNSPENAPAAHDG--SSLDAGVT 524
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
MD I+ +VF+ ISAAE+L+ + + V+ + K +L P I + + EW+
Sbjct: 525 MDNQIVFDVFNKAISAAEILKIDAN-FVDSLKKLRAKLPPMHIGQHNQLQEWL 576
>gi|333381508|ref|ZP_08473190.1| hypothetical protein HMPREF9455_01356 [Dysgonomonas gadei ATCC
BAA-286]
gi|332830478|gb|EGK03106.1| hypothetical protein HMPREF9455_01356 [Dysgonomonas gadei ATCC
BAA-286]
Length = 813
Score = 361 bits (926), Expect = 7e-97, Method: Compositional matrix adjust.
Identities = 217/588 (36%), Positives = 332/588 (56%), Gaps = 44/588 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + PAK + +A+P+GN RLGAMV+G E L+LNE+T+W G P +P +L
Sbjct: 23 IKLQYKRPAKEWVEALPLGNSRLGAMVFGSPVRERLQLNEETMWGGGPHRNDSPALLGSL 82
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
++VRSL+ +G+ EA A K P + YQ +G++ L+F H Y++ Y R LDL
Sbjct: 83 NEVRSLIFAGKEKEAEALLDKTMRTPHNGMPYQTIGNLYLDFT-GHDNYSD--YSRNLDL 139
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
TA A +Y+V V +TRE F+S D VI+ +I+ ++ S++F+ S DS + +S
Sbjct: 140 KTAVATTRYAVDGVTYTREVFTSFTDNVIIMRITADKANSINFSASYDSQVKGYSVSVKG 199
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE--IKISDDRGTISALEDKKLKVEGSD 248
N+++++G D +GI+ E +I + GT+ A +D + +
Sbjct: 200 NRLVLKG------------TGSDHEGIKGVVRFENQTEIKTEGGTVKAGKDNIVVKNANT 247
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
+ + +A++ D ++ ++++K T L+S Y T H+ YQK F+RV
Sbjct: 248 ATIYISIATNFIDYKNVSGNEARKAET-----ILKSALTKPYQTALTDHIKYYQKQFNRV 302
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
+ L SE D S RV++F+ +D +LV LLFQFGRYLLISSS+PG
Sbjct: 303 ELDLG----------TSERMNDETDS--RVRNFKDGKDQNLVTLLFQFGRYLLISSSQPG 350
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
Q + LQGIWN+ L P WDS +NIN EMNYW + NLSE PLF+ + ++ G +
Sbjct: 351 GQPSTLQGIWNDQLVPPWDSKYTININTEMNYWPAEVTNLSETHFPLFEMVKEIAETGKE 410
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
TA+V Y A+GWV HH TDIW + G + +WP GGAWL H+W+HY YT D+ FL
Sbjct: 411 TAKVMYNANGWVTHHNTDIWRTTGPVDG-AFYGMWPDGGAWLSRHMWQHYLYTGDKAFLS 469
Query: 489 KRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
+ YP+L+G A F LD+L+E H Y + + PSTSPE P G ++ STMD I
Sbjct: 470 E-VYPVLKGAADFFLDFLVE-HPKYKWMVSAPSTSPEQ---GPPGTGTSITAGSTMDNQI 524
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+ +V S ++A+ L+ ++A +++ + RL P +I + + EW+
Sbjct: 525 VFDVLSDALNASRALQLADNAYEKRLEDMISRLAPMQIGKYNQLQEWL 572
>gi|198275212|ref|ZP_03207743.1| hypothetical protein BACPLE_01371 [Bacteroides plebeius DSM 17135]
gi|198271795|gb|EDY96065.1| hypothetical protein BACPLE_01371 [Bacteroides plebeius DSM 17135]
Length = 800
Score = 361 bits (926), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 212/583 (36%), Positives = 308/583 (52%), Gaps = 47/583 (8%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
+P+GNG LGA+V+G V E ++LNE+T+W+G P + NPDAP+ L +R L+ G+Y E
Sbjct: 56 GLPLGNGSLGAVVFGDVAMERIQLNEETMWSGSPQECDNPDAPQYLDKIRQLLLEGKYKE 115
Query: 87 ATAASVK-------------LFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
AT + + P +Q +GD+ ++F + K A YRREL+L A
Sbjct: 116 ATELTNRTQVCTGKGSGGGNGSTVPFGCFQTMGDLWIDFAN---KEAYSDYRRELNLEDA 172
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
TA V Y+ G+V F RE F S+PDQV+V ++S + +SF + ++ + Q+
Sbjct: 173 TATVTYTQGDVHFKREIFISHPDQVMVIRLSADKQQQMSFTCRMTRPEYFFTHTE-DGQL 231
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
IM G + G+Q+ A L+ + +G D L V G+D +LL
Sbjct: 232 IMSGALSDGK---------GGDGLQYMARLK---AVTKGGEVICTDSTLTVSGADEVMLL 279
Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
L AS+ + P +D S + ++ ++ LY H +Y F R S QL+
Sbjct: 280 LAASTDYQ--LTYPHYKGRDYLSLTRESIAKAEKKTFESLYQAHQKEYAAYFDRASFQLA 337
Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVAN 373
SP + TD E A ++ +P L EL+FQ+GRYLLISSSRPGT AN
Sbjct: 338 ESPDTLATDVLVAE-----AKAGKI-------NPHLYELMFQYGRYLLISSSRPGTMPAN 385
Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVN 433
LQGIW L W+ H ++N+EMNYW + NLSE P+FD + L G+KTAQ
Sbjct: 386 LQGIWANKLQTPWNGDYHTDVNIEMNYWPAEVTNLSEMHLPMFDLIASLVAPGTKTAQTQ 445
Query: 434 YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYP 493
Y GWV+H T++W +S W + AW+C H+ EHY +T D+DFL K+ YP
Sbjct: 446 YQKKGWVVHPITNVWGYTSPGE-SASWGMHTGAPAWICQHIGEHYRFTGDKDFL-KKMYP 503
Query: 494 LLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFS 552
+L+G F +DWL+ + G L + P+ SPE+ F+APDG +S T D I ++F
Sbjct: 504 VLKGAVEFYMDWLVTDPKTGKLVSGPAVSPENTFVAPDGSQCQISMGPTHDQQTIWQLFD 563
Query: 553 AIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
A+E L+ N DA + V + +L T+I DG IMEW Q
Sbjct: 564 DFEMASEALQIN-DAFTQAVGDAKGKLLETRIGSDGRIMEWAQ 605
>gi|329849976|ref|ZP_08264822.1| alpha-fucosidase [Asticcacaulis biprosthecum C19]
gi|328841887|gb|EGF91457.1| alpha-fucosidase [Asticcacaulis biprosthecum C19]
Length = 806
Score = 360 bits (925), Expect = 9e-97, Method: Compositional matrix adjust.
Identities = 223/620 (35%), Positives = 324/620 (52%), Gaps = 55/620 (8%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
A++ S + L + + PA + +A+P+GNGRLGAMV+G V E L+LNEDTLW G P D
Sbjct: 25 AQAKSRPSDLTLWYAQPAGPWVEALPVGNGRLGAMVFGRVAQERLQLNEDTLWAGSPYDP 84
Query: 64 TNPDAPKALSDVRSLVDSGQYAEAT-AASVKLFGHPADV--YQLLGDIELEFDDSHLKYA 120
NP + L+ R+L+D+ ++ +A+ + + P Y GD+ L+F H
Sbjct: 85 NNPGCLENLAKCRALIDAEKFKDASDLVNASMMAQPKTQMPYGAAGDLLLDF---HGLAQ 141
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS---- 176
YRR LDL+TA A + +G +TRE FSS DQV+V +++ G L F++
Sbjct: 142 PSDYRRSLDLDTAVATTTFKIGATTYTREVFSSAVDQVLVVRLTAKGKGRLDFDLGYRHP 201
Query: 177 -------------LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKAN------ANDDPKGI 217
+ L + + + E R +N AN GI
Sbjct: 202 DQVDYGAPVYDGKVTDTLSQGAAWDKREGLSRERRPQSLAFAASSNELLVTGANIASAGI 261
Query: 218 QFSAILEIKI-SDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTS 276
++I + G I+A D L V G+ LL+ A++SF + D+ DP +
Sbjct: 262 PAGLTYAVRIRAIGDGNITAAGDS-LTVRGATTVTLLIAAATSF----VRFDDTGGDPIA 316
Query: 277 ESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAE 336
+ +AL + Y+ L H+ ++ LF R++I L + + C+ +I
Sbjct: 317 RT-AALNTAAAKPYAALKADHIAAHRALFRRMTIDLGNT-----SAACAATDI------- 363
Query: 337 RVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINL 396
R+ +DP L L QF RYL+ISSSRPGTQ ANLQGIWNE ++P W S +NIN
Sbjct: 364 RIGKSLASDDPQLAALYVQFARYLMISSSRPGTQPANLQGIWNEGVNPPWGSKYTININT 423
Query: 397 EMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRG 456
EMNYW P N+ C EPL + LS+ G+KTA+V Y ASGW+ HH TD+W ++SA
Sbjct: 424 EMNYWLVEPANIGVCVEPLVRMVEDLSMTGAKTAKVMYGASGWMAHHNTDLW-RASAPID 482
Query: 457 KVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LE 515
W +WP GGAWLC LW+HY+Y D +FL KR YPLL+G + F D L+E G L
Sbjct: 483 GAWWGMWPTGGAWLCKTLWDHYDYNRDPEFL-KRIYPLLKGASQFFADTLVEDPKGRGLV 541
Query: 516 TNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS 575
T+PS SPE+E + G C MD IIR++F++ I+A ++L +D K+
Sbjct: 542 TSPSISPENEHM--KGVATCA--GPAMDSQIIRDLFASTIAAQKLLANGDDGFTAKLAAM 597
Query: 576 LPRLRPTKIAEDGSIMEWVQ 595
RL +I G + EW++
Sbjct: 598 HARLPADRIGAQGQLQEWLE 617
>gi|332662485|ref|YP_004445273.1| alpha-L-fucosidase [Haliscomenobacter hydrossis DSM 1100]
gi|332331299|gb|AEE48400.1| Alpha-L-fucosidase [Haliscomenobacter hydrossis DSM 1100]
Length = 819
Score = 360 bits (924), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 219/589 (37%), Positives = 323/589 (54%), Gaps = 43/589 (7%)
Query: 13 LKITFN-GPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
LK+ +N + +A+PIGNGRLGAMV+G V ET++LNE T+W+G P NP A +
Sbjct: 24 LKLWYNQSSGTKWENALPIGNGRLGAMVYGNVDKETIQLNEHTVWSGSPNRNDNPAALDS 83
Query: 72 LSDVRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDDSHLKYAEETYRREL 128
L+++R L+ G++ A + ++ ++Q +G + L F H Y+ Y REL
Sbjct: 84 LAEIRKLIFEGKHKAAERLANRVIITKKSHGQMFQPVGSLHLSFP-GHENYSN--YYREL 140
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
D+ A A+ Y+V V +TRE +S PD+VIV +++ S++GSLSF+ + S +
Sbjct: 141 DIEKAVAKTSYTVDGVTYTREALASFPDRVIVVRLTASKAGSLSFSANYSSPQRKKVFAT 200
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGS 247
+ + I + ++ KG ++F I IK+ D G++S+ D L V+G+
Sbjct: 201 TATKDLT--------ISGTTSDHEGVKGMVEFKGITRIKL--DGGSLSS-NDTSLTVKGA 249
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
+ A L + +++F+ N D D + L +Y+ + T H+ YQK F R
Sbjct: 250 NSATLFISIATNFN----NYKDVSGDEEKRAADYLNKAYPKAYATILTGHIAAYQKYFKR 305
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
V + L +P +P ER+K+F + DP LV L +QFGRYLLISSS+P
Sbjct: 306 VKLDLGTTPAA------------NLPIDERLKNFSSSNDPHLVSLYYQFGRYLLISSSQP 353
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
G Q ANLQGIWN L+P WDS +NIN EMNYW + NL+E PL + + LSI G
Sbjct: 354 GGQPANLQGIWNNRLNPPWDSKYTININTEMNYWPAERTNLAELHRPLLEMVKELSITGQ 413
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
+TA+ Y GW+ HH TDIW + A G W +W GGAWL HLWEHY Y D+ +L
Sbjct: 414 ETARTMYGTRGWMAHHNTDIWRMNGAIDG-AFWGMWTAGGAWLTQHLWEHYLYNGDKTYL 472
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
YP L+G A F +D+LIE H Y L +P SPE+ A G + + +TMD
Sbjct: 473 AS-VYPALKGAALFYVDFLIE-HPQYKWLVVSPGNSPENAPKAHGG--SSLDAGTTMDNQ 528
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
I+ +VFS+ I A++L K+ A V+ + + RL P I + + EW+
Sbjct: 529 IVYDVFSSTIRTAQLLGKDA-AFVDTLKQLRSRLAPMHIGQHNQLQEWL 576
>gi|311746497|ref|ZP_07720282.1| alpha-L-fucosidase 2 [Algoriphagus sp. PR1]
gi|126575394|gb|EAZ79726.1| alpha-L-fucosidase 2 [Algoriphagus sp. PR1]
Length = 826
Score = 360 bits (924), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 212/590 (35%), Positives = 337/590 (57%), Gaps = 43/590 (7%)
Query: 12 PLKITFNGPAKH-FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
PL + + PA +T+A+PIGNG+LGAMV+G V +E ++LNE T+W+G P NPDA
Sbjct: 32 PLTLWYEQPAGEVWTNALPIGNGKLGAMVYGNVENELIQLNEHTVWSGGPNRNDNPDALA 91
Query: 71 ALSDVRSLVDSGQYAEA---TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
AL ++R L+ G+ EA + +++ +Q +GD+ + F+ H + YRRE
Sbjct: 92 ALPEIRRLIFEGKQKEAEELASKTIQTKKSNGQKFQPVGDLNIAFE-GHTTFT--NYRRE 148
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY- 186
LD+ A ++V Y V V +TRE +S + VI ++ S+ G +SF S+ + N S
Sbjct: 149 LDIERAVSKVTYEVDGVVYTREAIASFAENVIAVHLTASKPGMISFIASMTTPQPNASIA 208
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVE 245
+N +N++ + G ++ KG I+F ++ +IK + T + + V+
Sbjct: 209 LNSDNELAISGTT---------TDHEGVKGKIKFKSLTKIKNIGGKLTSTG---TSIAVK 256
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
+D A + + +++F+ N D + D S + L + S++DL +L DYQ F
Sbjct: 257 NADEATIYIAIATNFN----NYLDLEGDENSRAKGFLVNATTQSFNDLLKTNLVDYQNYF 312
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
+RVS+ L E + +P+ ER+++F+T DPSLV L +Q+GRYLLISSS
Sbjct: 313 NRVSLSLG------------ETDASKLPTDERLRNFRTGNDPSLVSLYYQYGRYLLISSS 360
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
+PG Q ANLQGIWN+++SP WDS +NIN +MNYW + NL+E EP ++ ++
Sbjct: 361 QPGGQPANLQGIWNKEMSPPWDSKYTININAQMNYWPAEKTNLAELHEPFLKMVSEMAEA 420
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G +TA+V Y A GW+ HH TDIW + + + W +W GGAW HLW+H+ Y+ D +
Sbjct: 421 GEETARVMYGARGWMAHHNTDIW-RITGPVDAIFWGIWSGGGAWTSQHLWDHFQYSGDME 479
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
+L K YP+L+G A F +D+L+E D +L NP TSPE+ A DG + + +TMD
Sbjct: 480 YL-KSIYPILKGAAMFYVDFLVEHPDKPWLVVNPGTSPENAPAAHDG--SSLDAGTTMDN 536
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
++ + FS +I A+E+L K + A + + +L P +I + G + EW+
Sbjct: 537 QLVFDAFSTVIQASELL-KIDQAFADTLQLMRDQLPPMQIGKHGQLQEWL 585
>gi|443622308|ref|ZP_21106841.1| putative Large secreted protein [Streptomyces viridochromogenes
Tue57]
gi|443344193|gb|ELS58302.1| putative Large secreted protein [Streptomyces viridochromogenes
Tue57]
Length = 973
Score = 360 bits (923), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 219/576 (38%), Positives = 309/576 (53%), Gaps = 49/576 (8%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D N ++++R V + Q+
Sbjct: 60 ALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPYDSANTRGAANIAEIRRRVFADQWGP 119
Query: 87 AT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
A + + G PA YQ +G++ L F + Y R LDL TATA Y +
Sbjct: 120 AQDLINQTMLGSPAGQLAYQPVGNLRLSFGSA---TGASQYNRTLDLTTATAITTYVLNG 176
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKR 203
V + RE F+ PDQVIV +++ + S++F + DS I ++G
Sbjct: 177 VRYQREVFAGAPDQVIVVRLTADRANSIAFIATFDSPQRTTVSSPDGATIALDG------ 230
Query: 204 IPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDG 262
+ A + G ++F A+ ++ GT+S+ L+V G+ +L+ SS+
Sbjct: 231 ---ISGAMEGIAGRVRFLALANAAVTG--GTVSS-SGGTLRVSGATSVTMLVSIGSSY-- 282
Query: 263 PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTD 322
+N + D + S L + R++ L +RHL DYQ LF+RVS+ L R
Sbjct: 283 --VNFRKADGDYQGIARSHLNAARDVGIDVLRSRHLADYQALFNRVSVDLGR-------- 332
Query: 323 TCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDL 382
T + + P+ R+ DP LLFQFGRYLLISSSRPGTQ ANLQGIWN+ +
Sbjct: 333 TAAADQ----PTDVRIAQHAQVNDPQFSALLFQFGRYLLISSSRPGTQPANLQGIWNDQM 388
Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIH 442
+P+WDS +N NL MNYW + NLSEC P+FD + L++ G++ AQ Y A GWV H
Sbjct: 389 APSWDSKFTINANLPMNYWPADTTNLSECFRPVFDMINDLTVTGARVAQAQYGAGGWVTH 448
Query: 443 HKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFL 502
H TD W +S G W +W GGAWL T +W+HY +T D DFL YP L+G A F
Sbjct: 449 HNTDAWRGASVVDG-AQWGMWQTGGAWLATLIWDHYLFTGDIDFLRSN-YPALKGAAQFF 506
Query: 503 LDWLIEGHD--GYLETNPSTSPE--HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAA 558
LD L+ H G+L TNPS SPE H A V TMD I+R++F+++ A
Sbjct: 507 LDTLVA-HPALGHLVTNPSNSPELAHH------TNATVCAGPTMDNQILRDLFNSVARAG 559
Query: 559 EVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
E+L + + L + RL PT++ G+I EW+
Sbjct: 560 EILGADA-TFRAQALAARDRLPPTRVGSRGNIQEWL 594
>gi|418518724|ref|ZP_13084861.1| hypothetical protein MOU_18224 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
gi|418519757|ref|ZP_13085808.1| hypothetical protein WS7_01835 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410702673|gb|EKQ61175.1| hypothetical protein MOU_18224 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
gi|410704417|gb|EKQ62899.1| hypothetical protein WS7_01835 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
Length = 790
Score = 360 bits (923), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 223/597 (37%), Positives = 320/597 (53%), Gaps = 50/597 (8%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+ L++ + PA + +A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D T+P
Sbjct: 39 VAAAEALQLWYREPANEWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSP 98
Query: 67 DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
DA AL VR+L+ +G+YAEA A KL P YQ LGD+ L+FD +
Sbjct: 99 DALAALPQVRALIFAGRYAEAEKLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GISD 155
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
YRR+LDL+TA A + G RE F Q IV ++S G +S V +DS
Sbjct: 156 YRRQLDLDTAVATTTFRSGGAVHRREVFVCAQAQCIVVRLSCDRPGGISLRVGIDSPQTG 215
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI--SDDRGTISALEDKK 241
++ GR N GI+ +++ G +S + D+
Sbjct: 216 EVTAEPGG-LLFSGR------------NGSFAGIEGRLRFALRVLPQVSGGKLSQVRDR- 261
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L+++ +D VLLL A++S+ + D DP + + + L+ L + L HL D+
Sbjct: 262 LRIDAADEVVLLLSAATSYQ--RFDAVDG--DPLALTAARLRKAAKLDFPALLRAHLADH 317
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
Q+LF RV+I D S E + +P+ ERV+ F DP+L L Q+GRYLL
Sbjct: 318 QRLFRRVAI-----------DLGSSEAVQ-LPTDERVQRFAEGNDPALAALYHQYGRYLL 365
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
I SSRPGTQ ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL L
Sbjct: 366 ICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFD 425
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
L+ G+ TA+ Y A GWV+H+ TD+W ++ G W+LWPMGG WL LW+ ++Y
Sbjct: 426 LAQTGAHTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDYG 484
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
DR +L K YPL +G A F + L+ + G + TNPS SPE++ P G C S
Sbjct: 485 RDRAYLSK-VYPLFKGAAEFFVATLMRDPQTGAMVTNPSMSPENQH--PFGAAVCAGPS- 540
Query: 541 TMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
MD ++R++F+ I+ +++L + + + + LP P +I + G + EW Q
Sbjct: 541 -MDAQLLRDLFAQCIAMSKLLGIDAQLAQQLAALREQLP---PNRIGKAGQLQEWQQ 593
>gi|21242520|ref|NP_642102.1| hypothetical protein XAC1774 [Xanthomonas axonopodis pv. citri str.
306]
gi|21107972|gb|AAM36638.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 790
Score = 360 bits (923), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 223/597 (37%), Positives = 320/597 (53%), Gaps = 50/597 (8%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+ L++ + PA + +A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D T+P
Sbjct: 39 VAAAEALQLWYREPANEWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSP 98
Query: 67 DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
DA AL VR+L+ +G+YAEA A KL P YQ LGD+ L+FD +
Sbjct: 99 DALAALPQVRALIFAGRYAEAEKLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GISD 155
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
YRR+LDL+TA A + G RE F Q IV ++S G +S V +DS
Sbjct: 156 YRRQLDLDTAVATTTFRSGGAVHRREVFVCAQAQCIVVRLSCDRPGGISLRVGIDSPQTG 215
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI--SDDRGTISALEDKK 241
++ GR N GI+ +++ G +S + D+
Sbjct: 216 EVTAEPGG-LLFSGR------------NGSFAGIEGRLRFALRVLPQVSGGKLSQVRDR- 261
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L+++ +D VLLL A++S+ + D DP + + + L+ L + L HL D+
Sbjct: 262 LRIDAADEVVLLLSAATSYQ--RFDAVDG--DPLALTAARLRKAAKLDFPALLRAHLADH 317
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
Q+LF RV+I D S E + +P+ ERV+ F DP+L L Q+GRYLL
Sbjct: 318 QRLFRRVAI-----------DLGSSEAVQ-LPTDERVQRFAEGNDPALAALYHQYGRYLL 365
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
I SSRPGTQ ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL L
Sbjct: 366 ICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFD 425
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
L+ G+ TA+ Y A GWV+H+ TD+W ++ G W+LWPMGG WL LW+ ++Y
Sbjct: 426 LAQTGTHTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDYG 484
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
DR +L K YPL +G A F + L+ + G + TNPS SPE++ P G C S
Sbjct: 485 RDRAYLSK-VYPLFKGAAEFFVATLMRDPQTGAMVTNPSMSPENQH--PFGAAVCAGPS- 540
Query: 541 TMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
MD ++R++F+ I+ +++L + + + + LP P +I + G + EW Q
Sbjct: 541 -MDAQLLRDLFAQCIAMSKLLGIDAQLAQQLAALREQLP---PNRIGKAGQLQEWQQ 593
>gi|381169519|ref|ZP_09878684.1| hypothetical protein XMIN_121 [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
gi|380690109|emb|CCG35171.1| hypothetical protein XMIN_121 [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
Length = 790
Score = 359 bits (922), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 223/597 (37%), Positives = 320/597 (53%), Gaps = 50/597 (8%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+ L++ + PA + +A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D T+P
Sbjct: 39 VAAAEALQLWYREPANEWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSP 98
Query: 67 DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
DA AL VR+L+ +G+YAEA A KL P YQ LGD+ L+FD +
Sbjct: 99 DALAALPQVRALIFAGRYAEAEKLADAKLLSRPLKKMPYQPLGDLLLDFDRAD---GISD 155
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
YRR+LDL+TA A + G RE F Q IV ++S G +S V +DS
Sbjct: 156 YRRQLDLDTAVATTTFRSGGAVHRREVFVCAQAQCIVVRLSCDRPGGISLRVGIDSPQTG 215
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI--SDDRGTISALEDKK 241
++ GR N GI+ +++ G +S + D+
Sbjct: 216 EVTAEPGG-LLFSGR------------NGSFAGIEGRLRFALRVLPQVSGGKLSQVRDR- 261
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L+++ +D VLLL A++S+ + D DP + + + L+ L + L HL D+
Sbjct: 262 LRIDAADEVVLLLSAATSYQ--RFDAVDG--DPLALTAARLRKAAKLDFPALLRAHLADH 317
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
Q+LF RV+I D S E + +P+ ERV+ F DP+L L Q+GRYLL
Sbjct: 318 QRLFRRVAI-----------DLGSSEAVQ-LPTDERVQRFAEGNDPALAALYHQYGRYLL 365
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
I SSRPGTQ ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL L
Sbjct: 366 ICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFD 425
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
L+ G+ TA+ Y A GWV+H+ TD+W ++ G W+LWPMGG WL LW+ ++Y
Sbjct: 426 LAQTGTHTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDYG 484
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
DR +L K YPL +G A F + L+ + G + TNPS SPE++ P G C S
Sbjct: 485 RDRAYLSK-VYPLFKGAAEFFVATLMRDPQTGAMVTNPSMSPENQH--PFGAAVCAGPS- 540
Query: 541 TMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
MD ++R++F+ I+ +++L + + + + LP P +I + G + EW Q
Sbjct: 541 -MDAQLLRDLFAQCIAMSKLLGIDAQLAQQLAALREQLP---PNRIGKAGQLQEWQQ 593
>gi|393719778|ref|ZP_10339705.1| alpha/beta hydrolase domain-containing protein [Sphingomonas
echinoides ATCC 14820]
Length = 811
Score = 359 bits (921), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 224/631 (35%), Positives = 336/631 (53%), Gaps = 73/631 (11%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
++ A+++ ++ L++ + PA +T+A+P+GNGRLGAMV+G V E L+LNEDTLW G P
Sbjct: 28 LLAAKASDASSDLRLWYRQPAGAWTEALPVGNGRLGAMVFGRVAQERLQLNEDTLWAGAP 87
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEAT-AASVKLFGHPADV--YQLLGDIELEFDDSHL 117
D NP+A AL +VR+L+ +G+Y +AT AS K+ G P Y LGD+ L F +H+
Sbjct: 88 YDPDNPEALAALPEVRALLAAGRYKDATDLASAKMMGKPPAQMPYGTLGDVLLTFASAHV 147
Query: 118 KYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL 177
YRRELDL + A ++ + + RE +S PDQVIV ++ +E+G+L F+++
Sbjct: 148 P---TVYRRELDLASGIATTEFETADGRYRREVLASAPDQVIVMRLE-AEAGTLDFDLAY 203
Query: 178 DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDD------------------------ 213
+ ++ EG P P + +D
Sbjct: 204 RA----PRAISTPRAQFSEGATPQTTRPTEWMQREDAERPGPDVTIAADGAHALLVTGSN 259
Query: 214 ------PKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP 267
P G++++ L ++ D G I A K + V G+ +L+ A++S+ +
Sbjct: 260 EAALGVPAGLRYA--LRVQAVGD-GVIIA-NQKGITVSGARSVTVLITAATSYR----SY 311
Query: 268 SDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEE 327
SD+ DP +A ++ Y L H+ D+ LF V I L SP
Sbjct: 312 SDTGGDPVGAVRAAGRAAERKGYPALRRSHVADHAALFGGVKIDLGTSPAA--------- 362
Query: 328 NIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWD 387
+P+ R+ + T DP+L L Q+GRYLLI+SSRPG+Q + LQGIWNE +P W
Sbjct: 363 ---ALPTDARIAAGATAVDPALAALYLQYGRYLLIASSRPGSQPSTLQGIWNEGTTPPWG 419
Query: 388 SAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDI 447
S +NIN EMNYW + P L C EPL + LS+ G++TA+ Y A GWV HH TD+
Sbjct: 420 SKYTININTEMNYWAADPGGLGLCVEPLVRMVEDLSVTGARTARTMYGARGWVAHHNTDL 479
Query: 448 WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI 507
W +++A +W LWP GGAWLC L+ H+++ D L R YPLL+G A F +D LI
Sbjct: 480 W-RATAPIDGPLWGLWPCGGAWLCNTLFTHWDFARDPALL-ARLYPLLKGAAHFFVDTLI 537
Query: 508 EGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNED 566
E G L T+PS SPE+E P G CV MD I+R++F+ + A L ++ +
Sbjct: 538 EDPKGRGLVTSPSLSPENEH--PFGSSLCV--GPAMDRQIVRDLFTNTVVAGRTLGRDGE 593
Query: 567 --ALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
A++E+V R+ P +I G + EW++
Sbjct: 594 WLAMLEQVGA---RIAPDRIGAGGQLQEWLE 621
>gi|116622997|ref|YP_825153.1| hypothetical protein Acid_3901 [Candidatus Solibacter usitatus
Ellin6076]
gi|116226159|gb|ABJ84868.1| conserved hypothetical protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 759
Score = 359 bits (921), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 215/592 (36%), Positives = 312/592 (52%), Gaps = 73/592 (12%)
Query: 9 TTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
+ +PL + + PA +TDA+P+GNGR+GAMV+GG E ++ NE T+WTG P DY + A
Sbjct: 15 SQSPLTLWYTHPADIWTDALPVGNGRMGAMVFGGAAHERIQFNEQTVWTGEPHDYAHKGA 74
Query: 69 PKALSDVRSLVDSGQYAEATA-ASVKLFGHP--ADVYQLLGDIELEFDDSHLKYAEETYR 125
K+L +R L+ +G+ EA A A + P YQ LGD+ +E + A Y+
Sbjct: 75 SKSLQQIRELLWAGKQKEAEALAMTEFMSEPLHQKAYQALGDLIIETPGAETPTA---YK 131
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
R LDL+T A +++ + + RE F+S+P IV ++ S+ S +L H+
Sbjct: 132 RSLDLDTGIAVTEFTANGITYRREVFASHPASAIVVHLTSSQPAEFS-----ATLKCAHA 186
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
G M G+ + I+F + LE I
Sbjct: 187 ACKGG--ATMSGQV-------------ENSAIRFDSRLEKHIDSPTS------------- 218
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
A LLL A+++F D DP +++ L +I N SY L H+ D+Q LF
Sbjct: 219 ----ATLLLTAATNFK----TYQDVTADPVQRNLATLVAIGNKSYDALRAEHIRDHQSLF 270
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
RV++ L + +P+ ER+ +F DP+L+ LLFQFGRYL+I SS
Sbjct: 271 RRVTLDLGATAAS------------QLPTDERIAAFAKGSDPALITLLFQFGRYLMIGSS 318
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPG Q ANLQG+WNE +P WDS NIN EMNYW NLSEC PLFD L L+ +
Sbjct: 319 RPGGQPANLQGLWNESNTPAWDSKYTDNINTEMNYWPVEETNLSECHLPLFDALKDLAQS 378
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G+ TA+ Y A GWV+HH D+W + +A +W GGAWL THLWEHY +T DR+
Sbjct: 379 GAITAREQYNARGWVLHHNFDLW-RGTAPINASNHGIWQTGGAWLSTHLWEHYLFTGDRE 437
Query: 486 FLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
FL AYPL++G ++F +D L++ G+L T PS SPE + TMD
Sbjct: 438 FLRAAAYPLMKGASTFFIDALVKDPKTGFLYTGPSNSPEQ---------GGLVMGPTMDR 488
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWVQ 595
I+R +F I+AA++L N D +++ L +L + + P +I + G + EW++
Sbjct: 489 EIVRSLFGETIAAAKIL--NLDPALQEQLATLRKQIAPLQIGKYGQLQEWME 538
>gi|254472686|ref|ZP_05086085.1| large secreted protein [Pseudovibrio sp. JE062]
gi|211958150|gb|EEA93351.1| large secreted protein [Pseudovibrio sp. JE062]
Length = 835
Score = 358 bits (920), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 219/623 (35%), Positives = 322/623 (51%), Gaps = 64/623 (10%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-YTNPDAPKALSDV 75
++ PA H+ +A+P+GNGRLGAMV+G S + LNEDTL++G P Y P+ + V
Sbjct: 17 YDTPAAHWNEALPLGNGRLGAMVYGNPLSARIHLNEDTLYSGEPTRIYPVPEIAHQIDHV 76
Query: 76 RSLVDSGQYAEATAASVKLF-GHPADVYQLLGDIELEF-DDSHLKYAEETYRRELDLNTA 133
+L+ G+ EA K + G YQ +G++ + DDS + YRR LD+ +
Sbjct: 77 EALLRDGKLFEAQEFVRKNWTGRQGQAYQPVGNLFITMADDSPVS----NYRRALDIRHS 132
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD--NHSYVNGNN 191
Y +F R F+S PD VIV +++ + +LSFN+ DS ++ N
Sbjct: 133 LHHESYEQNGTKFERTSFASFPDNVIVVRLTADKPCALSFNLRYDSPHPTCRTTHEGENT 192
Query: 192 QIIMEGRCP---------------------------GKRIPPKANANDDPKG-------- 216
++ + G+ P GK P N D +G
Sbjct: 193 RLHLRGQAPAFTSSRVIERIEHDLEQHRNPEIFGADGKLRPFAENEQDGHRGGIVYSEDG 252
Query: 217 ----IQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKK 272
F A L +++ R E +L +EG+ L + ++SF+GP +PS K
Sbjct: 253 LGEGTYFEAGLSVELEGGR---IRPERGELHIEGATAVTLRIAMATSFNGPDKSPSREGK 309
Query: 273 DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTV 332
DP S L + ++SY+D+ +H DD +LF R+S++L D ++D +
Sbjct: 310 DPAPIVKSILNAAGSVSYADMLQKHSDDVLRLFDRISLKLG---NDAISD---------L 357
Query: 333 PSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHV 392
P++ R++ FQ DP+L L FQ+GRYLLI+SSR G+Q NLQGIWN P W S +
Sbjct: 358 PTSTRLEQFQEKGDPALAALQFQYGRYLLIASSRAGSQPPNLQGIWNNLRRPQWSSNYTM 417
Query: 393 NINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS 452
NINLEMNYW + LS+ EPLF + L+++G++TA+ + A GW H T IW S
Sbjct: 418 NINLEMNYWPAEITGLSDLHEPLFMLIEELAVSGARTAKKMFNAPGWCAFHNTTIWRDSV 477
Query: 453 ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG 512
A WPM WL +H+WEH+ YT D++FL+ RAYPL++ A F WL E DG
Sbjct: 478 PSPCDPASAFWPMAAGWLLSHMWEHFLYTGDKEFLKNRAYPLMKSAAEFYEWWLCENKDG 537
Query: 513 YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV 572
YL STSPE+ ++ DG + V STMD AIIRE F+ +AA++L + + L +
Sbjct: 538 YLVPKVSTSPENRYLDEDGHVITVDQGSTMDCAIIRETFANTATAAKLLGLDAE-LANTL 596
Query: 573 LKSLPRLRPTKIAEDGSIMEWVQ 595
+ RL P +I G + EW Q
Sbjct: 597 EEKAARLLPYQIGAQGQVQEWSQ 619
>gi|436835731|ref|YP_007320947.1| alpha-L-fucosidase [Fibrella aestuarina BUZ 2]
gi|384067144|emb|CCH00354.1| alpha-L-fucosidase [Fibrella aestuarina BUZ 2]
Length = 821
Score = 358 bits (919), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 223/596 (37%), Positives = 327/596 (54%), Gaps = 52/596 (8%)
Query: 13 LKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
LK+ +N P+ + + +A+PIGNGRLGAMV+G VP ET++LNE TLW+G P NP+A +
Sbjct: 24 LKLWYNTPSGQTWENALPIGNGRLGAMVYGNVPRETIQLNEHTLWSGGPNRNDNPEALAS 83
Query: 72 LSDVRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDDSHLKYAEETYRREL 128
L ++R L+ + + EA A + K ++Q +G + L FD H Y Y REL
Sbjct: 84 LPEIRQLIFTNKQKEAEALANKTIITKKSHGQMFQPVGSLHLTFD-GHENYTN--YYREL 140
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY-V 187
D+ A A+ Y+V V +TRE +S PDQV+V +++ S+ G L+F S +
Sbjct: 141 DIERAVAKTTYTVDGVTYTREILASLPDQVLVMQLTASKPGRLAFRASYATPQAKPVIKT 200
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
N N++ + G A+ +D KG +++ I IK G++SA +D L V+G
Sbjct: 201 NSTNELTIAG---------TASDHDGVKGLVRYKGIARIKTQG--GSVSA-DDSTLTVKG 248
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ A + L +++F I +D D + + + L + +Y+ + T H+ YQ+ F
Sbjct: 249 ATTATIYLSVATNF----IKYNDVSGDENARAATYLNNAFPKTYAAILTPHVAAYQRYFK 304
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RVS L + +P+ ER+K+F+T DP LV L +Q+GRYLLISSS+
Sbjct: 305 RVSFDLGST------------EAANLPTDERLKNFRTANDPQLVTLYYQYGRYLLISSSQ 352
Query: 367 PGT-----QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
PG Q ANLQGIWN + P WDS +NIN +MNYW + NL+E EP +
Sbjct: 353 PGRDGVMGQPANLQGIWNNKMRPPWDSKYTININAQMNYWPAEKTNLAELHEPFLQMVRD 412
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
LS G +TA+V Y A GW+ HH TDIW + A G W +W GG W HLWEHY Y+
Sbjct: 413 LSETGQETARVMYGARGWMAHHNTDIWRATGAIDG-AFWGMWIAGGGWTSQHLWEHYLYS 471
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYS 539
D+ +L YP+L+G A F D+L+E H Y L NP +SPE+ A G + +
Sbjct: 472 GDKTYLAS-VYPILKGAALFYADFLVE-HPTYHWLVANPGSSPENAPKAHGG--SSLDAG 527
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL-PRLRPTKIAEDGSIMEWV 594
+TMD I +VF+ I AA++L+ DA LK L +L P + + G + EW+
Sbjct: 528 TTMDNQIAFDVFTTTIRAADILKT--DAAFADTLKQLRSKLPPMHVGQYGQLQEWL 581
>gi|399031123|ref|ZP_10731262.1| hypothetical protein PMI10_03140 [Flavobacterium sp. CF136]
gi|398070592|gb|EJL61884.1| hypothetical protein PMI10_03140 [Flavobacterium sp. CF136]
Length = 821
Score = 357 bits (916), Expect = 9e-96, Method: Compositional matrix adjust.
Identities = 216/594 (36%), Positives = 331/594 (55%), Gaps = 38/594 (6%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
A + + + LK+ +N PA + +A+P+GNGRLGAMV+G E L+LNE+T+W G P
Sbjct: 18 ASTAQSKSELKLWYNKPATIWNEALPLGNGRLGAMVFGDPAVERLQLNEETIWAGSPNSN 77
Query: 64 TNPDAPKALSDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYA 120
+ + +AL VR LV G++ EA A+ + D YQ G + F H KY
Sbjct: 78 AHTKSIEALPKVRKLVFEGKFDEAQDLATRDIMSQTNDGMPYQTFGSAYISFP-GHQKYT 136
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
Y R+LD+ A+A+VKY+V +EFTRE +S DQVIV K+S S+ G ++ NV ++S
Sbjct: 137 --NYYRDLDIENASAKVKYTVNGIEFTREILTSFSDQVIVVKLSASQPGQITANVFMNSP 194
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+D NQII+ G N ++F +E K + G +SA +
Sbjct: 195 IDKTVPSTEGNQIILSGVG--------TNFEGVKGKVKFQGRIEAK--NKGGEVSA-SNG 243
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
L + +D L + +++F N D +D ++S L+ + + + H+
Sbjct: 244 ILIINKADEVTLYISIATNFK----NYQDITEDEVAKSKVYLEKAISKDFETIKKAHVAY 299
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
YQK F+RV++ L + D + P+ ER++ F+ + DP L L FQFGRYL
Sbjct: 300 YQKFFNRVALDLGSN------DAIKK------PTNERIRDFKKEFDPQLASLYFQFGRYL 347
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSS+PG Q ANLQGIWN+ ++P WDS NIN EMNYW + NL+E EP
Sbjct: 348 LISSSQPGGQPANLQGIWNDMVTPPWDSKYTTNINAEMNYWPAEVTNLTEMHEPFIQMAK 407
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
LS+ G++TA+ Y A+GWV+HH TDIW + +A +W GGAW+ LWE Y Y
Sbjct: 408 ELSVAGAETAKTMYNANGWVLHHNTDIW-RVTAPVDSAASGMWMTGGAWVSQDLWERYLY 466
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
T D ++L K YP+++G A F LD++I + + GYL PS+SPE+ GK + ++
Sbjct: 467 TGDINYL-KEIYPVIKGAADFFLDFMITDPNTGYLVVVPSSSPENTHAGGTGK-STIASG 524
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+TMD ++ ++FS +I A++++ +E+ +K+ +L ++ P KI + + EW
Sbjct: 525 TTMDNQLVFDLFSNVIKASKLVAPDEN-YTKKLSDALAKMPPMKIGKHSQLQEW 577
>gi|87200424|ref|YP_497681.1| twin-arginine translocation pathway signal protein [Novosphingobium
aromaticivorans DSM 12444]
gi|87136105|gb|ABD26847.1| Twin-arginine translocation pathway signal [Novosphingobium
aromaticivorans DSM 12444]
Length = 824
Score = 357 bits (915), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 218/593 (36%), Positives = 315/593 (53%), Gaps = 34/593 (5%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ F+ PA+ + +A+P+GNGRLGAM+ G + E L LNEDTLW+G P A L
Sbjct: 45 RLVFDSPAREWIEALPVGNGRLGAMMHGLLDGERLSLNEDTLWSGQP-SVGGAAADGLLE 103
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
+R L+ +G Y A + ++ GH ++ Y L D+ ++ D + A RR LDL A
Sbjct: 104 QMRDLIFAGDYPGADRLARRMQGHFSEAYLPLADLHVDLDQAGPARA---IRRTLDLREA 160
Query: 134 TARVKYSV-GNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
TA V+ G +E R F S P Q++V +I + +V LD L + +
Sbjct: 161 TAGVEIDRDGGIE-RRTLFVSAPAQLVVFRIEREGAARFGASVRLDCQLRSSIRAVSPRR 219
Query: 193 IIMEGRCPGKRIPPKANANDDPK-------GIQFSAILEIKISDDRGTISALEDKKLKVE 245
+++ G+ P P N D + G+ F+AI EI D G++ E L+VE
Sbjct: 220 LVLAGKAPTVCEPDYRNVPDPVRYSDRAGYGMAFAAIAEI---DTDGSVRKGE-GALRVE 275
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
+ W + L A++ + GP + P + + + L+ R ++ L H D++ L+
Sbjct: 276 NAGWLEIRLAAATGYRGPHVLPDLDPGAVEALAAAPLRRARGKPHTRLLADHRRDHRALY 335
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
R ++ L DT D +P+ R + DP+L LL+ +GRYLLI+SS
Sbjct: 336 ERSALALGGG------DTARRH--DGLPTDARRAA--DPGDPALAALLYNYGRYLLIASS 385
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPGT+ ANLQGIWN L W NIN+ MNYW + NL++C PL DF L+ N
Sbjct: 386 RPGTRPANLQGIWNAQLRAPWSCNYTTNINVPMNYWMAETANLADCHRPLVDFAEALARN 445
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
G TA+ Y GW +HH TD+WA S+ A G WA WPMG W+ HLWEHY ++
Sbjct: 446 GGDTARDYYRMPGWCLHHNTDLWAMSNPVGAGEGDPNWANWPMGAPWIAQHLWEHYRFSG 505
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
D FL RA+P++ G A F + WL+ + G L T PS SPE+ F+ DG+ A +S T
Sbjct: 506 DLAFLRDRAWPVMRGAADFCVGWLVRDPASGQLTTAPSISPENLFVTADGRTAAISAGCT 565
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP-RLRPTKIAEDGSIMEW 593
MD+A+IRE+F I+AA VL EDA KVL++L L P +I G + EW
Sbjct: 566 MDIAMIRELFGNCIAAAAVL--GEDAAFAKVLRNLSEELPPYRIGRHGQLQEW 616
>gi|365118140|ref|ZP_09336940.1| autotransporter-associated beta strand [Tannerella sp.
6_1_58FAA_CT1]
gi|363651034|gb|EHL90117.1| autotransporter-associated beta strand [Tannerella sp.
6_1_58FAA_CT1]
Length = 1402
Score = 355 bits (912), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 218/609 (35%), Positives = 338/609 (55%), Gaps = 57/609 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA ++ +A+P+GNGRL AMV+G + +T+++NEDT W+G P + NP+A L
Sbjct: 26 LKLWYDRPADYWVEALPLGNGRLAAMVYGTILQDTIQINEDTYWSGSPYNNANPNAKTHL 85
Query: 73 SDVRSLVDSGQYAEATA-------ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYR 125
+ +R ++ G+YAEA A + GH +Y+ +G++ L+F +SH Y
Sbjct: 86 NQIREYINDGEYAEAQKIALANIIADRNITGHGM-IYESIGNLLLDFPESH--KTPTNYY 142
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH- 184
RELDL+ A A+V Y+V V++TRE F+S D +I+ KIS S+ G ++FN S L ++
Sbjct: 143 RELDLSNAIAKVTYTVDGVDYTREAFTSFTDDLIIIKISASKQGMVNFNTSFVGPLKSNR 202
Query: 185 -----SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISA-LE 238
V+G N I PGK A ++ + I++ + GT SA
Sbjct: 203 VKASTEIVSGTNNTIRVKNTPGKT------AEENIPNL-LRPTTYIRVVAEGGTQSADSS 255
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+K LKV +D A + + ++++ FIN D D ++++S L + Y H+
Sbjct: 256 NKILKVSDADVAYIYISSATN----FINYKDISGDSDAKALSYLNKF-DKDYEQAKNDHI 310
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
YQ+ F RVS+ D+ ++ E+ P+ +R++ F DPSL L FQFGR
Sbjct: 311 TRYQEQFGRVSL-------DLGNNSVQEKK----PTDKRIEEFSNTNDPSLASLYFQFGR 359
Query: 359 YLLISSSRPGTQVANLQGIWNEDLS--PTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSS+PG+Q ANLQGIWN + P WDS NIN+EMNYW + NLSEC +P
Sbjct: 360 YLLISSSQPGSQPANLQGIWNPNAGQYPAWDSKYTTNINVEMNYWPAEVTNLSECHQPFL 419
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
+ + +S+ G ++A+ Y GW +HH TD+W +S+ K +WP AW C+HLWE
Sbjct: 420 EMVKDVSVTGQESAETMYGCRGWTLHHNTDLW-RSTGAVDKSACGIWPTCNAWFCSHLWE 478
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHE-----FIAPD 530
HY +T D++FL + YP+L+ F D+LI + GY +PS SPE+ ++
Sbjct: 479 HYLFTGDKEFLSE-VYPILKSACEFYQDFLITDPKTGYKVVSPSNSPENHPGLFSYVDDS 537
Query: 531 GKLACVSYSS--TMDMAIIREVFSAIISAAEVLEKNED--ALVEKVLKSLPRLRPTKIAE 586
G V+ S TMD ++ ++ I AAE+L K+ D A ++K+ LP P + +
Sbjct: 538 GNKQNVALFSGVTMDNQMVFDLLKNTIDAAEILGKDADFAADLKKLKDQLP---PMHVGK 594
Query: 587 DGSIMEWVQ 595
G + EW++
Sbjct: 595 YGQLQEWLE 603
>gi|393788377|ref|ZP_10376507.1| hypothetical protein HMPREF1068_02787 [Bacteroides nordii
CL02T12C05]
gi|392656050|gb|EIY49691.1| hypothetical protein HMPREF1068_02787 [Bacteroides nordii
CL02T12C05]
Length = 809
Score = 355 bits (911), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 221/618 (35%), Positives = 314/618 (50%), Gaps = 54/618 (8%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+ N L + + PA+ + +A+P+GNGRLGAMV+G E ++ NE+TL++G P
Sbjct: 17 VNAQNDLTLWYTTPARVWEEALPLGNGRLGAMVFGDTQKERIQFNENTLYSGEPAALNRS 76
Query: 67 DA--PKALSDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET 123
P+ VR L+ G+ AEA + G +VYQ GD+ +F +K
Sbjct: 77 TCILPQ-YEKVRDLLKQGKNAEAEKIMQYEWIGRLNEVYQPFGDVCFDFK---MKGEVTE 132
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
Y LD+ A +Y G E RE F+S P Q IV + +E L F + L SL
Sbjct: 133 YVHSLDMEQAVVTTRYKQGGTEILREVFASFPGQAIVIHLK-AEKPVLHFEMQLASLHPV 191
Query: 184 HSYVNGNNQIIMEGRCP---------------------------GKRIPPKANANDDPKG 216
H G ++ MEGR P GK I + + G
Sbjct: 192 HLSCEGE-RLQMEGRAPAHVQRRTIEGMRKYNTERLHPEYFDEKGKVIRTEQVIYAEDAG 250
Query: 217 IQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTS 276
+ F A + + + D G I+ +D +L V+ + LL A++S++G +PS + K+
Sbjct: 251 MAFEAYV-VPLKKD-GVIT-FKDNRLVVKDASEITFLLYAATSYNGFDKSPSKAGKNIAK 307
Query: 277 ESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAE 336
E + + + Y + H+ DYQ LF RV + L SP N P+
Sbjct: 308 ELQAQRKKLAGKEYQQIRNEHVADYQSLFKRVDLALPSSP-----------NQKDKPTDI 356
Query: 337 RVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINL 396
R+K FQT D SL+ LFQ+GRYL+IS SRPG Q NLQG+WN+ + P W+S NINL
Sbjct: 357 RLKEFQTKTDLSLIAQLFQYGRYLMISGSRPGGQPLNLQGLWNDKIIPPWNSGYTTNINL 416
Query: 397 EMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRG 456
+MNYWQ+ NLSEC +PLF F+ ++ +G + A Y +GW+ HH IW ++ G
Sbjct: 417 QMNYWQAEVTNLSECHQPLFTFIEEIAQSGKEAAHNMYGRNGWIAHHNMSIWREAYPADG 476
Query: 457 KVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLET 516
V W W M G WLC+H+WEHY YT D FL + Y +L+ A F +WL++ G T
Sbjct: 477 FVHWFFWNMSGPWLCSHIWEHYLYTKDVAFL-REYYSILKESARFCSEWLVQNTKGEWVT 535
Query: 517 NPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL 576
STSPE+ F PDG+ A V STMDMAIIR +F I AAE+L D K+L+
Sbjct: 536 PVSTSPENAFRMPDGREAAVCEGSTMDMAIIRNLFGNTIHAAELL--GVDVEFRKMLEQK 593
Query: 577 PR-LRPTKIAEDGSIMEW 593
+ L +I G ++EW
Sbjct: 594 SKYLAGYRIGSHGQLLEW 611
>gi|182413173|ref|YP_001818239.1| hypothetical protein Oter_1354 [Opitutus terrae PB90-1]
gi|177840387|gb|ACB74639.1| hypothetical protein Oter_1354 [Opitutus terrae PB90-1]
Length = 1139
Score = 355 bits (910), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 221/608 (36%), Positives = 314/608 (51%), Gaps = 47/608 (7%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
+ F+ PA+HFT A P+GNGRLG M +GGV E + LNE +W+G P D P+A AL +
Sbjct: 321 VRFDAPARHFTAATPLGNGRLGLMPFGGVDEERVVLNEAGMWSGSPQDADRPNAAAALPE 380
Query: 75 VRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIELEFDDSHLKYAE 121
+R L+ +GQ AEA + F P YQ+LG++ L F S
Sbjct: 381 IRRLLLAGQNAEAEKVVAENFTCAGAGSGRGRGANVPYGSYQVLGELRLAFASSASGTEV 440
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
Y RELDL A +RV Y V F RE F S PD+V V +++ ++ G++SF ++L+
Sbjct: 441 TNYARELDLADAVSRVSYERDGVRFEREAFVSAPDEVAVIRLTANKRGAISFELALERPE 500
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
+ V +++M GR R + + F+ I I +RG D
Sbjct: 501 RATTRVLEGGRLLMSGRLSDGR---------GGENVGFATIARIV---NRGGSVESGDGV 548
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNL--SYSDLYTRHLD 299
L+V +D ++L+ A++ I +K + + + R+ S+ L HL
Sbjct: 549 LRVRAADEVLVLVTAATD-----IKSFAGRKVEDAAATAMADMDRSAQKSFGALRAAHLA 603
Query: 300 DYQKLFHRVSIQLSR----------SPKDIVTD-TCSEENIDTVPSAERVKSFQTDEDPS 348
Y+ LF RV ++LS SP + TD +E N A V DP
Sbjct: 604 HYRGLFDRVLLRLSEDGTEGGRRVPSPPQMTTDDRGAERNPRPTTQARLVAQAAGANDPG 663
Query: 349 LVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL 408
L +L F FGRYLLISS+RP NLQGIW + + W+ H+NIN++MN+W + C L
Sbjct: 664 LAQLYFDFGRYLLISSTRPDGFPPNLQGIWADGVQTPWNGDWHLNINVQMNFWPAEICGL 723
Query: 409 SECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGA 468
E + LF F L+ G++TA+ Y A GWV H + W +S G W G A
Sbjct: 724 PELHDSLFSFTQSLTEPGARTARAYYGARGWVAHVLANPWGFTSPGEG-ASWGATTTGSA 782
Query: 469 WLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFI 527
WLC HLW+HY +T DR FLE RAYP+++G A F LD LI E G+L T P+ SPE+EF+
Sbjct: 783 WLCQHLWDHYLFTGDRAFLE-RAYPMMKGSAEFYLDMLIEEPTHGWLVTAPANSPENEFV 841
Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAED 587
DG A V T D I+R +F+A AA VL+ + + L ++ RL PT+IA D
Sbjct: 842 LADGTKAHVCLGPTFDNQILRSLFTATAEAARVLDVDAE-LQRELGAKTARLPPTRIAPD 900
Query: 588 GSIMEWVQ 595
G +MEW++
Sbjct: 901 GRVMEWLE 908
>gi|332882277|ref|ZP_08449905.1| hypothetical protein HMPREF9074_05703 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|357046479|ref|ZP_09108106.1| hypothetical protein HMPREF9441_02131 [Paraprevotella clara YIT
11840]
gi|332679661|gb|EGJ52630.1| hypothetical protein HMPREF9074_05703 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|355530718|gb|EHH00124.1| hypothetical protein HMPREF9441_02131 [Paraprevotella clara YIT
11840]
Length = 807
Score = 355 bits (910), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 226/599 (37%), Positives = 320/599 (53%), Gaps = 59/599 (9%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
S + LK+ ++ PAK +T+A+P+GN RLGAMV+GG E L+LNE+T W G P D N
Sbjct: 15 SVAWAGELKLWYSKPAKDWTEALPVGNSRLGAMVYGGTGREELQLNEETFWAGGPYDNNN 74
Query: 66 PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV-YQLLGDIELEFDDSHLKYAEET 123
+A L VR+L+ G+ EA H + Y +G + L+F H + E
Sbjct: 75 TNALYVLPVVRNLIFQGKTREAQQLVDANFLAHKDGMSYLTMGSLFLDFP-GHEEATE-- 131
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
+ R+L++ ATA +Y V V +TR F+S D VIV ++ ++G+L+F VS D+ L +
Sbjct: 132 FYRDLNIEDATATTRYKVDGVTYTRRVFASFTDSVIVVRLQADKAGALAFTVSYDAPLKH 191
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQ--FSAILEIKISDDRGTISALEDKK 241
G+ I C GK D +G++ A +K+ D TI+ E K
Sbjct: 192 EVSAEGDLLTIT---CEGK----------DQEGVKAALRAECRVKVVSDGQTIT--EGKN 236
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
LKV G+ A L L A++++ +N D D + + LQ + Y H+ Y
Sbjct: 237 LKVTGATEATLYLSAATNY----VNYHDVSGDAAARADCCLQRAVQIPYKKALENHVAYY 292
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
+KLF RV + L VT S+E + R++ F DPSL LLFQ+GRYLL
Sbjct: 293 RKLFGRVQLDLG------VTAASSKE------TTLRIRDFSQGNDPSLATLLFQYGRYLL 340
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
ISSS+PG Q ANLQGIWN + WDS +NIN EMNYW + NLSE +PLF L
Sbjct: 341 ISSSQPGGQPANLQGIWNRSTNAPWDSKYTININTEMNYWLAEVANLSEMHQPLFSMLED 400
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHY 478
LS+ G+KTA+ Y GWV HH TD+W G V +A +WP GGAWL HLW+HY
Sbjct: 401 LSVTGAKTAREMYGCGGWVAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHLWQHY 456
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACV 536
+T D+DFL K YP+L+G A F LD+L+E H Y PS SPEH V
Sbjct: 457 LFTADKDFL-KTYYPVLKGTARFFLDFLVE-HPSYKWWVVAPSVSPEH---------GPV 505
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+ TMD I+ + + A+E++ ++ A + + + L +L P ++ G + EW+Q
Sbjct: 506 TAGCTMDNQIVFDALRNTLLASEIV-GDDAAFRDSLAQMLDKLPPMQVGRHGQLQEWLQ 563
>gi|423223718|ref|ZP_17210187.1| hypothetical protein HMPREF1062_02373 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638093|gb|EIY31946.1| hypothetical protein HMPREF1062_02373 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 809
Score = 355 bits (910), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 208/585 (35%), Positives = 311/585 (53%), Gaps = 41/585 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PAK +T+A+P+GN RLGAMV+GGV +E ++LNE+T+W G P +P A L
Sbjct: 23 LKLWYQQPAKVWTEALPLGNSRLGAMVYGGVVNEQIQLNEETVWGGGPHRNDSPKAFGVL 82
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ +G+ EA F G +Q +G + LEFD H Y+ YRR+LDL
Sbjct: 83 PKVRELIFAGREKEAEKVMADNFFTGQHGMPFQTIGSLMLEFD-GHADYS--NYRRDLDL 139
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A V+Y +G V +TR F+S D ++ +I + G+++F + +
Sbjct: 140 ERAVASVRYKIGEVNYTRTIFTSLVDNALIIRIETDKPGAVNFTTRYSTPYKEYEIKKNG 199
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
+++ G P A I+F +IK ++G ++ D ++V+G+D A
Sbjct: 200 KSLLLSGHGSAHEGIPGA--------IRFETRTQIKA--EKGKVNVTNDC-IEVKGADAA 248
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
V+ + A+++F +N D + T + L Y+ T H + YQKLF RVS+
Sbjct: 249 VIYVTAATNF----VNYKDVSANETRRATEFLAKAMKRPYAQALTAHEEAYQKLFGRVSL 304
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
+ S ++ ++ R+K F +D LV L+FQFGRYLLISSS+PG Q
Sbjct: 305 NIGPSSQE--------------ETSYRIKHFNERKDLGLVALMFQFGRYLLISSSQPGGQ 350
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
A LQGIWN +L WD +NIN EMNYW + NL E EPLF + LS + TA
Sbjct: 351 PAGLQGIWNHELLAPWDGKYTININTEMNYWPAEVTNLPEMHEPLFQMVKELSESAQGTA 410
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
+ Y GW +HH TD+W + G +WP+GGAWL HLW+HY YT D+ FL K
Sbjct: 411 RTLYECRGWTVHHNTDLWRMAGPVDGASY--VWPLGGAWLSQHLWQHYLYTGDQAFL-KT 467
Query: 491 AYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
AYP L+G A F LD+L+E G++ PS SPE P G ++ TMD I+ +
Sbjct: 468 AYPALKGAADFFLDFLVEHPKYGWMVCTPSMSPEQ---GPPGTGTMITAGCTMDTQIVLD 524
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
++++SA ++L + + + + RL P +I + + EW+
Sbjct: 525 ALTSVLSATQLLYPANTSYRDSLQSMIKRLPPMQIGKHNQLQEWL 569
>gi|115391619|ref|XP_001213314.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114194238|gb|EAU35938.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 749
Score = 354 bits (909), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 218/592 (36%), Positives = 317/592 (53%), Gaps = 51/592 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + +A+P+GNGRLGAMV G +E L+LNED++W G PGD T A + L
Sbjct: 3 ELWYRSPAATWDEALPVGNGRLGAMVHGRTTTELLQLNEDSVWYGGPGDRTPVGASRYLQ 62
Query: 74 DVRSLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
+R + G +AEA ++F HP Y+ LG + L+F HL+ YRR LDL
Sbjct: 63 QLRQYIRKGAHAEAEELVRRVFFAHPISQRHYEPLGTLFLDF--GHLESEVTEYRRSLDL 120
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
RV+Y V F RE +S+PD VI ++ SE + F V L + D N
Sbjct: 121 QRGITRVQYMHTGVHFEREVLASHPDAVIAIRVRASEP--VEFVVRLTRMSDLEYETNEY 178
Query: 191 -NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD-DRGTISALEDKKLKVEGSD 248
+ + ++ C + P ++ + + I+ D D TI+ + +KL V +
Sbjct: 179 LDDVAVDDNCVTMHVTPGGRNSN-----RACCKVAIRCDDPDGATIARVGGRKLMVRARE 233
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYS--DLYTRHLDDYQKLFH 306
LLLVA+ + + + + +AL L +S ++++RH++DYQ+L+
Sbjct: 234 --TLLLVAAQT----------TYRYQDIDGRAALDVADALRWSTEEIWSRHIEDYQQLYA 281
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
R+++ +S I TD ER+K DP LV L FGRYLLI+SSR
Sbjct: 282 RMTLAMSPDASHIPTD-------------ERIKH---SRDPGLVSLYHNFGRYLLIASSR 325
Query: 367 PGTQ----VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
G ANLQGIWN P W S +NINL+MNYW + CNL+EC+ PLFD L +
Sbjct: 326 EGNGNKVLPANLQGIWNPSFHPAWGSKYTLNINLQMNYWPANVCNLAECEMPLFDLLERI 385
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
+ G KTA Y GW +HH TDIWA ++ + LWP+GGAWLC H+WE + ++
Sbjct: 386 ASAGQKTAHEVYGCRGWAVHHCTDIWADTAPVDQWMPATLWPLGGAWLCFHVWERFLFSK 445
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSST 541
D FL +R +P+L GC FLLD+L+E G YL T+PS SPE+ F +G+ + ST
Sbjct: 446 DEMFL-RRMFPVLRGCVEFLLDFLVEDATGQYLVTSPSLSPENLFYDAEGRQGVLCEGST 504
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+DM ++ VF A I + +L N+D LV +V + RL P +I G + EW
Sbjct: 505 IDMQLVDAVFHAFIQSVNILNLNDD-LVSRVNHASERLPPARIGSFGQLQEW 555
>gi|383641029|ref|ZP_09953435.1| hypothetical protein SchaN1_11878 [Streptomyces chartreusis NRRL
12338]
Length = 953
Score = 354 bits (909), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 224/590 (37%), Positives = 314/590 (53%), Gaps = 44/590 (7%)
Query: 11 NPLKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
N + ++ PA + A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D NP
Sbjct: 23 NDFALWYDKPAGTEWLRALPIGNGRLGAMVFGNVDNERLQLNEDTVWAGGPYDSANPRGA 82
Query: 70 KALSDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRR 126
++++R V + Q+ A + + G PA YQ +G++ L + Y R
Sbjct: 83 ANIAEIRRRVFADQWGPAQDLINQTMLGSPAGQLAYQPVGNLLLSLGSA---TGASQYNR 139
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
LDL TATA Y +G V + RE F+S PDQVIV +++ + S++FN + DS
Sbjct: 140 TLDLTTATAVTTYVLGGVRYQREVFASAPDQVIVVRLTADRANSIAFNATFDSPQRTTVS 199
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
I ++G ++F A+ ++ GT+S+ L+V G
Sbjct: 200 SPDGATIALDGVS--------GTMEGITGRVRFLALAHAAVTG--GTVSS-SGGTLRVSG 248
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ +V +LV S +++ D + L + R++ L RHL DYQ LF+
Sbjct: 249 AT-SVTVLV---SIGSGYVDFRRVDGDYQGIARRHLNAARDIGIDQLRKRHLADYQALFN 304
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RVS+ L R T + + P+ R+ DP L LLFQFGRYLLISSSR
Sbjct: 305 RVSVDLGR--------TAAADQ----PTDVRIAQHAQANDPQLSALLFQFGRYLLISSSR 352
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PGTQ ANLQGIWN+ ++P+WDS +N NL MNYW + NLSEC P+FD + L++ G
Sbjct: 353 PGTQPANLQGIWNDQMAPSWDSKFTINANLPMNYWPADTTNLSECFLPVFDMIDDLTVTG 412
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSS-ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
++ AQ Y A GWV HH TD W +S D + W +W GGAWL T +W+HY +T D D
Sbjct: 413 ARVAQAQYGAGGWVTHHNTDAWRGASVVDEAR--WGMWQTGGAWLATLIWDHYLFTGDTD 470
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
FL YP L+G A F LD L+ GYL TNPS SPE A A V TMD
Sbjct: 471 FLRSN-YPALKGAAQFFLDTLVAHPSLGYLVTNPSNSPELAHHAN----ATVCAGPTMDN 525
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
I+R++F+++ A EVL + + L + RL PTK+ G++ EW+
Sbjct: 526 QILRDLFNSVARAGEVLGVDA-GFRAQALAARDRLAPTKVGSRGNVQEWL 574
>gi|431797172|ref|YP_007224076.1| hypothetical protein Echvi_1807 [Echinicola vietnamensis DSM 17526]
gi|430787937|gb|AGA78066.1| hypothetical protein Echvi_1807 [Echinicola vietnamensis DSM 17526]
Length = 792
Score = 354 bits (908), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 205/588 (34%), Positives = 323/588 (54%), Gaps = 39/588 (6%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA + +A+P+GNGRLGAMV+G +E ++LNED++W G + +P L+ +R
Sbjct: 37 YEQPAGSWEEALPVGNGRLGAMVFGQTSTERIQLNEDSMWPGAADWGDSKGSPADLASLR 96
Query: 77 SLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTAT 134
+LV SG+ EA + F + V +Q +GD+ ++F D + YRR+L L+ A
Sbjct: 97 ALVKSGRVHEADKEIIDKFSYRGIVRSHQTMGDLFIDFGDER---EIQHYRRQLSLDDAL 153
Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN-HSYVNGN--- 190
V+Y G ++T E F+S D +V +++ ++ ++F + L D+ H VN N
Sbjct: 154 VSVRYQSGGEQYTEEVFASAVDDALVIRLTTTDEAGMNFKLRLGRPKDDGHPTVNVNAPA 213
Query: 191 -NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
++++M+G + + G++F L++ S G S+ E+ +L++EG
Sbjct: 214 ADELVMDGEVTQYKAAKEGQPTPLDYGVKFQTKLKVVTS---GGASSAENGELRLEGVKE 270
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
AV+ LV ++S+ + D S++ LQ + + +L H +D+ + + RVS
Sbjct: 271 AVIYLVCNTSY---------YEDDYASKNEKTLQKLGTKGFDELLLAHQEDFDEYYSRVS 321
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPG 368
+ L +DT+P+ +R+K Q +D L LFQ+GRYLLISSSRPG
Sbjct: 322 LDLGGHA------------LDTLPTDKRLKRVQDGRKDEGLAAALFQYGRYLLISSSRPG 369
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
T ANLQGIWN+D+ W++ H+NINL+MNYW + P +L E PLFD++ L G
Sbjct: 370 TNPANLQGIWNKDIEAPWNADYHLNINLQMNYWPAGPTHLPEMHLPLFDYVDQLIQRGKI 429
Query: 429 TAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
TA+ Y + G V+HH +D+WA + W W GG W+ H WE++ +T D FL
Sbjct: 430 TAKEQYGVERGSVVHHASDLWAAPWMRANRAYWGAWIHGGGWISRHYWEYFQFTGDTTFL 489
Query: 488 EKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
++R YP L+ A+F +DWL + G + P TSPE+ ++A DG+ A +SY + M I
Sbjct: 490 KERGYPALKEFAAFYMDWLQKDDQTGLYVSYPETSPENSYLAADGQPAAISYGAAMGHQI 549
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEW 593
I +VF +SAA+VL ED E+V L +L P I DG I+EW
Sbjct: 550 ISDVFQNTLSAAKVLSI-EDDFTEEVSGKLAKLYPGVGIGPDGRILEW 596
>gi|189464329|ref|ZP_03013114.1| hypothetical protein BACINT_00670 [Bacteroides intestinalis DSM
17393]
gi|189438119|gb|EDV07104.1| hypothetical protein BACINT_00670 [Bacteroides intestinalis DSM
17393]
Length = 794
Score = 354 bits (908), Expect = 9e-95, Method: Compositional matrix adjust.
Identities = 207/585 (35%), Positives = 312/585 (53%), Gaps = 41/585 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PAK +T+A+P+GN RLGAMV+GGV +E ++LNE+T+W G P +P A L
Sbjct: 8 LKLWYKQPAKVWTEALPLGNSRLGAMVYGGVVNEQIQLNEETVWGGGPHRNDSPKALGVL 67
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ SG+ EA F G +Q +G + LEF+ H Y++ YRRELDL
Sbjct: 68 PTVRELLFSGREKEAEKVIADNFFTGQHGMPFQTIGSLMLEFE-GHADYSD--YRRELDL 124
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A V+Y +G V +TR F+S D ++ +I + G+++F + +
Sbjct: 125 EKAIASVRYKIGEVNYTRTVFTSLADNALIVRIEADKPGAVNFTTRYSTPYKEYEIKKNG 184
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
+++ G P A I+F +IK ++G ++ D ++V+G+D A
Sbjct: 185 KSLLLSGHGSAHEGIPGA--------IRFETRTQIKA--EKGKVNVTNDC-IEVKGADAA 233
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
V+ + A+++F +N D + T + L Y+ H + YQKLF RVS+
Sbjct: 234 VIYVTAATNF----VNYKDVSANETRRATEFLSQAMKRPYAQALAAHEEAYQKLFGRVSL 289
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
+ S K+ ++ R+K F +D LV L+FQFGRYLLISSS+PG Q
Sbjct: 290 NVGASSKE--------------ETSYRIKHFNEGKDLGLVALMFQFGRYLLISSSQPGGQ 335
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
A LQGIWN +L WD +NIN EMNYW + NL E +PLF + LS + TA
Sbjct: 336 PAGLQGIWNHELLAPWDGKYTININTEMNYWPAEVTNLPEMHQPLFQMVKELSESAQGTA 395
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
+ Y GW +HH TD+W + G +WP+GGAWL HLW+HY YT D+ FL+
Sbjct: 396 RTLYDCRGWTVHHNTDLWRMAGPVDGASY--VWPLGGAWLSQHLWQHYLYTGDQAFLQT- 452
Query: 491 AYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
AYP L+G A F LD+L+E G++ PS SPE P G ++ TMD I+ +
Sbjct: 453 AYPALKGAADFFLDFLVEHPKYGWMVCAPSMSPEQ---GPPGTGTMLTAGCTMDTQIVLD 509
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
++++SA ++L + + + + + RL P +I + + EW+
Sbjct: 510 ALTSVLSATKLLYPDHTSYCDSLQGMIKRLPPMQIGKHNQLQEWL 554
>gi|237710563|ref|ZP_04541044.1| glycoside hydrolase family 95 protein [Bacteroides sp. 9_1_42FAA]
gi|265750338|ref|ZP_06086401.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_33FAA]
gi|345516324|ref|ZP_08795817.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
gi|423232070|ref|ZP_17218472.1| hypothetical protein HMPREF1063_04292 [Bacteroides dorei
CL02T00C15]
gi|423238857|ref|ZP_17219973.1| hypothetical protein HMPREF1065_00596 [Bacteroides dorei
CL03T12C01]
gi|423246621|ref|ZP_17227674.1| hypothetical protein HMPREF1064_03880 [Bacteroides dorei
CL02T12C06]
gi|229433914|gb|EEO43991.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
gi|229455285|gb|EEO61006.1| glycoside hydrolase family 95 protein [Bacteroides sp. 9_1_42FAA]
gi|263237234|gb|EEZ22684.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_33FAA]
gi|392625607|gb|EIY19671.1| hypothetical protein HMPREF1063_04292 [Bacteroides dorei
CL02T00C15]
gi|392635319|gb|EIY29221.1| hypothetical protein HMPREF1064_03880 [Bacteroides dorei
CL02T12C06]
gi|392647735|gb|EIY41433.1| hypothetical protein HMPREF1065_00596 [Bacteroides dorei
CL03T12C01]
Length = 819
Score = 353 bits (907), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 222/589 (37%), Positives = 324/589 (55%), Gaps = 42/589 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PA + +A+P+GN R+GAMV+GG E L+LN++T+W G P P+A K+L
Sbjct: 23 LKLWYKQPAGTWVEALPVGNSRMGAMVYGGTAREELQLNDETMWGGSPYRNDRPEALKSL 82
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ +G+ EA F G YQ +G + +E H K + Y R+LDL
Sbjct: 83 PQVRELIFAGKNMEAQNLIQDNFYAGKHGMPYQTIGSLIIE-TPGHEKVTD--YYRDLDL 139
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A +Y V V F RE F+S PD+VIV +++ G L+F V S L+ H
Sbjct: 140 ERAVATTRYKVDGVTFQREVFASFPDKVIVVRLTADRPGKLNFKVGYVSPLE-HKVSRKG 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD-RGTISALEDKKLKVEGSDW 249
++++ G+ D +G++ +E + D G ++D+ + VEG+D
Sbjct: 199 KKLVLTGK------------GRDHEGVKGLIRMETQTQADVDGGKVKIDDQNITVEGAD- 245
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
+V L V+S + FIN D + + ++ L YS + H+ Y++ F RV
Sbjct: 246 SVTLYVSSGT---NFINYHDISGNESKKASGYLSLALGRPYSQVLQEHIALYKEQFDRVR 302
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L T ++TV +R++ F +D SL LLFQ+GRYLLISSS+PG
Sbjct: 303 LDLG---------TSERAKLETV---KRIELFNEGKDVSLAVLLFQYGRYLLISSSQPGG 350
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN L+ WD +NIN EMNYW + NLSE +PLF+ + LS+ G +T
Sbjct: 351 QPANLQGIWNNKLAAPWDGKYTININTEMNYWPAEVTNLSETHQPLFEMVKELSVTGRET 410
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+ Y +GWV HH TDIW +++ K + WPMGGAWL THLW+HY Y+ D+ FL +
Sbjct: 411 ARTMYGCNGWVAHHNTDIW-RATGPVDKAFYGTWPMGGAWLTTHLWQHYLYSGDKLFLSE 469
Query: 490 RAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLA-CVSYSSTMDMAII 547
AYP L+G A F LD+LIE + G++ T PS SPEH D K A + TMD II
Sbjct: 470 -AYPALKGAADFYLDYLIEHPEYGWMVTAPSMSPEHGPSGEDTKKASTIVAGCTMDNQII 528
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWVQ 595
+V S + A+ +L+ + A + L+S L RL P +I + + EW++
Sbjct: 529 FDVLSNALHASRILKMS--ASYQDSLRSMLNRLAPMQIGKYNQLQEWLE 575
>gi|212695001|ref|ZP_03303129.1| hypothetical protein BACDOR_04539 [Bacteroides dorei DSM 17855]
gi|212662454|gb|EEB23028.1| hypothetical protein BACDOR_04539 [Bacteroides dorei DSM 17855]
Length = 819
Score = 353 bits (907), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 222/589 (37%), Positives = 324/589 (55%), Gaps = 42/589 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PA + +A+P+GN R+GAMV+GG E L+LN++T+W G P P+A K+L
Sbjct: 23 LKLWYKQPAGTWVEALPVGNSRMGAMVYGGTAREELQLNDETMWGGSPYRNDRPEALKSL 82
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ +G+ EA F G YQ +G + +E H K + Y R+LDL
Sbjct: 83 PQVRELIFAGKNMEAQNLIQDNFYAGKHGMPYQTIGSLIIE-APGHEKVTD--YYRDLDL 139
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A +Y V V F RE F+S PD+VIV +++ G L+F V S L+ H
Sbjct: 140 ERAVATTRYKVDGVTFQREVFASFPDKVIVVRLTADRPGKLNFKVGYVSPLE-HKVSRKG 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD-RGTISALEDKKLKVEGSDW 249
++++ G+ D +G++ +E + D G ++D+ + VEG+D
Sbjct: 199 KKLVLTGK------------GRDHEGVKGLIRMETQTQADVDGGKVKIDDQNITVEGAD- 245
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
+V L V+S + FIN D + + ++ L YS + H+ Y++ F RV
Sbjct: 246 SVTLYVSSGT---NFINYHDISGNESKKASGYLSLALGRPYSQVLQEHIALYKEQFDRVR 302
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L T ++TV +R++ F +D SL LLFQ+GRYLLISSS+PG
Sbjct: 303 LDLG---------TSERAKLETV---KRIELFNEGKDVSLAVLLFQYGRYLLISSSQPGG 350
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN L+ WD +NIN EMNYW + NLSE +PLF+ + LS+ G +T
Sbjct: 351 QPANLQGIWNNKLAAPWDGKYTININTEMNYWPAEVTNLSETHQPLFEMVKELSVTGRET 410
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+ Y +GWV HH TDIW +++ K + WPMGGAWL THLW+HY Y+ D+ FL +
Sbjct: 411 ARTMYGCNGWVAHHNTDIW-RATGPVDKAFYGTWPMGGAWLTTHLWQHYLYSGDKLFLSE 469
Query: 490 RAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLA-CVSYSSTMDMAII 547
AYP L+G A F LD+LIE + G++ T PS SPEH D K A + TMD II
Sbjct: 470 -AYPALKGAADFYLDYLIEHPEYGWMVTAPSMSPEHGPSGEDTKKASTIVAGCTMDNQII 528
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWVQ 595
+V S + A+ +L+ + A + L+S L RL P +I + + EW++
Sbjct: 529 FDVLSNALHASRILKMS--ASYQDSLRSMLNRLAPMQIGKYNQLQEWLE 575
>gi|224536380|ref|ZP_03676919.1| hypothetical protein BACCELL_01254 [Bacteroides cellulosilyticus
DSM 14838]
gi|224522018|gb|EEF91123.1| hypothetical protein BACCELL_01254 [Bacteroides cellulosilyticus
DSM 14838]
Length = 793
Score = 353 bits (907), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 207/585 (35%), Positives = 312/585 (53%), Gaps = 41/585 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PAK +T+A+P+GN RLGAMV+GGV +E ++LNE+T+W G P +P A L
Sbjct: 7 LKLWYQQPAKVWTEALPLGNSRLGAMVYGGVVNEQIQLNEETVWGGGPHRNDSPKAFGVL 66
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ +G+ EA F G +Q +G + LEFD H Y+ YRR+LDL
Sbjct: 67 PKVRELIFAGREKEAEKVMADNFFTGQHGMPFQTIGSLMLEFD-GHADYS--NYRRDLDL 123
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A V+Y +G V +TR F+S D ++ +I + G+++F + +
Sbjct: 124 ERAVASVRYKIGEVNYTRTIFTSLVDNALIIRIEADKPGAVNFTTRYSTPYKEYEIKKNG 183
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
+++ G P A I+F +IK ++G ++ + + ++V+G+D A
Sbjct: 184 KSLLLSGHGSAHEGIPGA--------IRFETRTQIKA--EKGKVN-VTNNCIEVKGADAA 232
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
V+ + A+++F +N D + T + L Y+ T H + YQKLF RVS+
Sbjct: 233 VIYVTAATNF----VNYKDVSANETRRATEFLVKAMKRPYAQALTAHEEAYQKLFGRVSL 288
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
+ S ++ ++ R+K F +D LV L+FQFGRYLLISSS+PG Q
Sbjct: 289 NIGPSSQE--------------ETSYRIKHFNERKDLGLVALMFQFGRYLLISSSQPGGQ 334
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
A LQGIWN +L WD +NIN EMNYW + NL E EPLF + LS + TA
Sbjct: 335 PAGLQGIWNHELLAPWDGKYTININTEMNYWPAEVTNLPEMHEPLFQMVKELSESAQGTA 394
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
+ Y GW +HH TD+W + G +WP+GGAWL HLW+HY YT D+ FL K
Sbjct: 395 RTLYECRGWTVHHNTDLWRMAGPVDGASY--VWPLGGAWLSQHLWQHYLYTGDQAFL-KT 451
Query: 491 AYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
AYP L+G A F LD+L+E G++ PS SPE P G ++ TMD I+ +
Sbjct: 452 AYPALKGAADFFLDFLVEHPKYGWMVCAPSMSPEQ---GPPGTGTMITAGCTMDTQIVLD 508
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
++++SA ++L + + + + RL P +I + + EW+
Sbjct: 509 ALTSVLSATQLLYPANTSYRDSLQSMIKRLPPMQIGKHNQLQEWL 553
>gi|260910947|ref|ZP_05917588.1| fibronectin type III domain protein [Prevotella sp. oral taxon 472
str. F0295]
gi|260634938|gb|EEX52987.1| fibronectin type III domain protein [Prevotella sp. oral taxon 472
str. F0295]
Length = 792
Score = 353 bits (906), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 216/593 (36%), Positives = 329/593 (55%), Gaps = 36/593 (6%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP-- 69
P K+ ++ PA F +A+PIGNG+LGAMV+G V ++ L LN+ TLW+G P D N DA
Sbjct: 24 PQKLWYDKPATFFEEALPIGNGKLGAMVYGDVWNDNLFLNDLTLWSGQPID-PNEDAGAH 82
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSH--LKYAEETYRRE 127
K + ++R + Y A + +++ GH + YQ L + ++ +S + + + YRRE
Sbjct: 83 KWIPEIRKALFEENYKLADSLQLRVQGHNSAWYQPLSIVSIQPINSQGSSQASIKNYRRE 142
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
LDL++A A+V Y + V + RE+ +++PD+ I+ +++ S+ +L+ +SL S+L +
Sbjct: 143 LDLDSALAKVSYEIDGVTYRREYLATHPDRAILLRLTASKPRALNLRLSLTSILSH---- 198
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ R G I +A P + F +L+ K +D GTI+A +D L +
Sbjct: 199 --------QLRAEGDLIRLTGHAMGHPDSTVHFCNLLQAKATD--GTITA-QDTTLLINN 247
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ VL LV +S++G +P + + L+S+++ S+ L HLDDYQ LF
Sbjct: 248 ATQVVLYLVNETSYNGFDKHPVTQGAPYVQLAEADLKSLQDCSFEQLKQNHLDDYQALFG 307
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RVS+QL + D T ++ +D E +P L L FQFGRYLLISSSR
Sbjct: 308 RVSLQLGGAQFD-TNRTTEQQLLDYTDKCE--------ANPYLEALYFQFGRYLLISSSR 358
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
ANLQG+WN L W S VNINLE NYW + NL+E PL + LS+NG
Sbjct: 359 TPGVPANLQGLWNPHLKAQWRSNYTVNINLEENYWPAQVANLAEMTMPLTGMVKALSVNG 418
Query: 427 SKTAQVNY-LASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
A+ Y + GW H TD+WA ++ R WA W +GGAWL ++LWE Y++T
Sbjct: 419 RYAARNYYGINEGWCSSHNTDLWAMTNPVGEKRESPEWADWNLGGAWLLSNLWEQYDFTR 478
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
DR++L + +PL++G F+L WLI G L T PSTSPE+E++ P+G Y
Sbjct: 479 DRNYLRETLFPLMKGACDFMLQWLIGNPKKPGELITAPSTSPENEYVTPEGYHGTTMYGG 538
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
T D+AI+RE+F+ +A E L A +K+ +++ RL P I ++G + EW
Sbjct: 539 TADLAILRELFANTATADETLNGRPTAYSKKLRQTIARLHPYTIGKEGDLNEW 591
>gi|354584080|ref|ZP_09002977.1| Alpha-L-fucosidase [Paenibacillus lactis 154]
gi|353197342|gb|EHB62835.1| Alpha-L-fucosidase [Paenibacillus lactis 154]
Length = 844
Score = 353 bits (905), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 213/615 (34%), Positives = 329/615 (53%), Gaps = 55/615 (8%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
S + PL++ + PA + +A+PIGNGRLG MV+G E ++LNED+LW G PG N
Sbjct: 31 SGAVERPLRLWYTSPAAEWNEALPIGNGRLGGMVFGRTGLERVQLNEDSLWYGGPGRGGN 90
Query: 66 PDAPKALSDVRSLVDSGQYAEAT-AASVKLFGHPA--DVYQLLGDIELEFDDSHLKYAEE 122
P+A L D+R L+ G+ AEA A + + P YQ LGD+ L+F ++
Sbjct: 91 PNAIPYLGDIRQLLQDGRQAEAEHLARMAMTSSPKYEQPYQPLGDLLLKFLNAEAPATH- 149
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL-DSLL 181
Y RELDL + A V Y+ G + + R++F+S PD V+V +++ GSL+F +L
Sbjct: 150 -YERELDLQRSMAAVTYTSGGITYRRQYFASAPDGVLVIRLTADRPGSLTFAANLMRRPF 208
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
D + GN+ + M+G +A A+ G+ F A L + + + G I + D
Sbjct: 209 DCGTRSIGNDTLTMKG---------EAGAD----GVSFCASL--RGAAEGGNIRIIGDF- 252
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
+ VEG+D LLL A ++F + P + L ++ Y L++RH+++Y
Sbjct: 253 MSVEGADAVTLLLSAQTTF---------RCRKPEEMCLQQLDHASSIPYERLFSRHVEEY 303
Query: 302 QKLFHRVSIQL---------SRSPKDI----------VTDTCSEENIDTVPSAERVKSFQ 342
++ F R S++L + P D V+++ + ++ E
Sbjct: 304 REKFGRFSLKLEVDAGARDYASLPTDQRLNLLKERVRVSNSGANPEGNSGADPEGNSGAY 363
Query: 343 TDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQ 402
D+DP L+EL Q+GRYLL+SSSRPG+ ANLQGIWN+ +P W+S +N N++MNYW
Sbjct: 364 PDDDPGLIELYVQYGRYLLLSSSRPGSLAANLQGIWNDSFTPPWESKYTINANIQMNYWP 423
Query: 403 SLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWAL 462
+ L EC EPLFD + + NG KTA Y G+ HH T++W ++ + + +
Sbjct: 424 AELLGLPECHEPLFDLIHRMLPNGRKTAGEMYGCRGFAAHHNTNVWGETRPEGILMTCTV 483
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSP 522
WPMG AWLC HLWEH + D DFL RAYP+++ A FLLD++ +G T PS SP
Sbjct: 484 WPMGAAWLCLHLWEHVRFGGDADFLRDRAYPVMKEAAIFLLDYMTIDGEGRRITGPSVSP 543
Query: 523 EHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLR 580
E+ F+ PDG + + +MD I + A + A +L ++ L +E ++++P
Sbjct: 544 ENRFVLPDGAVGSLCMGPSMDSQIAHALLQACLEAGRLLGEDTRFLDELEAAIRNIP--- 600
Query: 581 PTKIAEDGSIMEWVQ 595
+I G IMEW++
Sbjct: 601 APQIGRHGGIMEWLE 615
>gi|119491166|ref|XP_001263205.1| hypothetical protein NFIA_064720 [Neosartorya fischeri NRRL 181]
gi|119411365|gb|EAW21308.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 744
Score = 352 bits (904), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 211/589 (35%), Positives = 322/589 (54%), Gaps = 47/589 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA ++ +A+P+GNGRLGAMV+G +E L+LNED++W G P + DA + L
Sbjct: 3 ELWYQQPAGNWEEALPVGNGRLGAMVYGRTDTEMLQLNEDSVWYGGPQNRVPRDAFECLP 62
Query: 74 DVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
+RSL+ G +AEA + F HP Y+ LG + L+F H + YRR LD+
Sbjct: 63 RLRSLIREGNHAEAEKLVRLAFFSHPISQRHYEPLGTLFLDF--GHAPEYMQNYRRSLDI 120
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD--NHSYVN 188
AT+RV+Y V+ RE +SNPD VI +I S+ + ++ S L+ + Y++
Sbjct: 121 ERATSRVEYEHKGVKVRREVIASNPDGVIAIRIQASQKTEFALRLTRMSELEYETNEYLD 180
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
+ E R I P + K + + +++ +DD+ +++ + +K L V D
Sbjct: 181 ---DVTAEDRTITMHITPGGH-----KSNRACCMAKVRTADDQDSVTQIGNKLL-VNAQD 231
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
A++L+ A +++ D K+ +S+ +AL S +++ RH++DY+ L+ R+
Sbjct: 232 -ALVLISAQTTY-----RCDDIDKEASSDLETALLH----STDEIWERHVNDYRSLYGRM 281
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
+ LS + D+ TD K + DP L+ L + RYLLIS SR
Sbjct: 282 ELHLSPNNCDMPTD----------------KRIKNSRDPGLIALYHNYCRYLLISCSRNE 325
Query: 369 TQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
+ A LQGIWN P W +NINL+MNYW + CNLS+C+ PLF L ++ +G
Sbjct: 326 DKALPATLQGIWNPSFHPAWGCKYTININLQMNYWPANICNLSDCEMPLFSLLERVAKSG 385
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
+ AQ Y GWV HH TDIWA +S + LWP+GGAWLC H+W+H+ +T D+ F
Sbjct: 386 EEAAQTMYGCRGWVAHHCTDIWADTSPVDTWMPATLWPLGGAWLCVHIWDHFRFTRDKGF 445
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L+ R +P+L+GC FLLD+L+E G YL TNPS SPE+ F +G+ + ST+D+
Sbjct: 446 LQ-RMFPILQGCVQFLLDFLVEDASGEYLVTNPSLSPENTFYDKNGERGVLCEGSTIDIQ 504
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
I+ V SA + + E LE E L L +L RL P +I G + EW
Sbjct: 505 IVNAVLSAYLKSVEELEI-EAKLAPAALDALHRLPPLRIGSYGQLQEWA 552
>gi|390943730|ref|YP_006407491.1| hypothetical protein Belba_2169 [Belliella baltica DSM 15883]
gi|390417158|gb|AFL84736.1| hypothetical protein Belba_2169 [Belliella baltica DSM 15883]
Length = 836
Score = 352 bits (904), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 209/587 (35%), Positives = 326/587 (55%), Gaps = 42/587 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ ++ PA+ + +A+PIGNGRLGAMV+G E ++LNE+T + G P NP+A KAL
Sbjct: 45 MKLWYDRPAQQWVEALPIGNGRLGAMVFGNPQEEVIQLNENTFYAGHPYRNDNPNALKAL 104
Query: 73 SDVRSLVDSGQYAEAT-AASVKLFGHPADV-YQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ G+Y +A FG P + YQ +G+++L++ D E Y RELDL
Sbjct: 105 EGVRKLIFDGEYVQAQDTIDQNFFGGPHGMPYQTIGNLKLKYQDES---EVENYYRELDL 161
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A ++ V F+ + SS PDQVIV KI+ + S+SF+ ++D G
Sbjct: 162 EYAVVSNRFKKSGVNFSTKIISSFPDQVIVAKITADKPKSISFSATMDRPGPFEITTTGE 221
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE--IKISDDRGTISALEDKKLKVEGSD 248
+Q+IM G + D +GI+ + + +K + G+I + E+K++ + +D
Sbjct: 222 DQLIMSG------------ISSDHEGIKGAVKFQANVKFVNKNGSIKS-ENKEIIISEAD 268
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
+ + +++F +N D D + +S S L+ + +Y +H+ DY+ LF RV
Sbjct: 269 EVTIYISIATNF----VNYKDISADASEKSTSLLEKAIENDFERIYKKHVTDYRNLFDRV 324
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
+ L +S D V +P+ +R+ F D L L FQFGRYLLI++SRPG
Sbjct: 325 QLDLGKS--DAVN----------LPTDKRIAQFAEGNDAHLAALYFQFGRYLLIAASRPG 372
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
Q ANLQGIWN ++P WDS VNIN EMNYW + NLSE EP LS +G +
Sbjct: 373 GQPANLQGIWNHQMNPAWDSKYTVNINAEMNYWPAEITNLSELHEPFIQMAKDLSESGQQ 432
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
TA+ Y A GWV+HH TD+W + + +WP+GGAW+ HL+E Y+++ D +L
Sbjct: 433 TARNMYGARGWVLHHNTDLW-RVTGPIDFAAAGMWPLGGAWVSQHLFEKYDFSGDEKYL- 490
Query: 489 KRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
K YP+ + A+F LD+L++ G+ +PS SPE+ I + V+ +TMD ++
Sbjct: 491 KSVYPVAKEAATFFLDFLVKDPQTGFWVVSPSVSPEN--IPYQFHNSAVAAGNTMDNQLV 548
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
++F+ I AAE+L +ED L+ ++ + L L P +I + G + EW+
Sbjct: 549 FDLFTKTIRAAEIL-GDEDDLINEMKEKLSMLPPMQIGKWGQLQEWM 594
>gi|383114822|ref|ZP_09935584.1| hypothetical protein BSGG_1004 [Bacteroides sp. D2]
gi|313693469|gb|EFS30304.1| hypothetical protein BSGG_1004 [Bacteroides sp. D2]
Length = 812
Score = 352 bits (903), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 210/612 (34%), Positives = 322/612 (52%), Gaps = 52/612 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + + PAK + +A+P+GNGRLGAM++G E ++ NE+TL++G P + + L
Sbjct: 24 LTLWYKSPAKVWEEALPVGNGRLGAMIFGEPQKERIQFNENTLYSGEPETPKDINVASDL 83
Query: 73 SDVRSLVDSGQYAEA-TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+R L++ G+ EA K G + YQ GD+ +EF K A Y LD+N
Sbjct: 84 GHIRQLLNEGKNTEAGNIIQQKWIGRLNEAYQPFGDLYIEFAS---KGAITDYIHSLDMN 140
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
+ Y + RE F+S P Q I+ +S S+ L+F L+S H ++
Sbjct: 141 NSIVTTSYKQNGIAIRREVFASYPAQAIIIHLSASKP-VLNFTAHLES---PHPVTQDSD 196
Query: 192 Q--IIMEGRCPG---------------KRIPPKANANDDPKGIQFSAILEIKISDDRGT- 233
I ++G+ P +R+ P+ + IQ ++ +GT
Sbjct: 197 SQAIYLKGQAPAHAQRRDIEHMKRFNTQRLHPEY-FDQTGHVIQKKQVIYGNELGGKGTF 255
Query: 234 -----ISALEDKKLKVEGSDW-------AVLLLVASSSFDGPFINPSDSKKDPTSESMSA 281
+S+ +D KL +E + + L+L A++S++G +PS K+P E +
Sbjct: 256 FEACLLSSHKDGKLVIENNQFIAQDCSEVTLVLYAATSYNGLHKSPSKEGKNPHQEINNY 315
Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF 341
+ SY L H+ DYQ LF RVS L + + + P+ +R+K F
Sbjct: 316 RKISEKHSYKKLKEEHITDYQSLFKRVSFNLH-----------TNKQLKKTPTDQRLKLF 364
Query: 342 QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
+ ED +++ LFQFGRYL+I+ SR Q NLQG+WN ++ P W+S +NINLEMNYW
Sbjct: 365 KKKEDQTIITQLFQFGRYLMIAGSRGEGQPLNLQGLWNNEVLPPWNSGYTLNINLEMNYW 424
Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+ NLSEC +PLF + ++ G A+ Y +GW IHH IW ++ G V W
Sbjct: 425 PAEVTNLSECHQPLFKLIEEIADKGKNLARDMYGLNGWAIHHNISIWREAYPSDGFVYWF 484
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTS 521
W M G WLC H+WEHY YT D DFL K+ YP+L+G A+F +WL+E +G L T STS
Sbjct: 485 FWNMSGPWLCNHIWEHYLYTKDIDFL-KKYYPILKGSATFCSEWLVENSEGELVTPVSTS 543
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
PE+ ++ PDG A V STMD+AIIR +FS I+A++VL+ + ++ + + +L+
Sbjct: 544 PENAYLMPDGISASVCEGSTMDIAIIRSLFSNTINASKVLQ-TDSLFCAELTQKVNKLKK 602
Query: 582 TKIAEDGSIMEW 593
+I G ++EW
Sbjct: 603 YQIGSKGQLLEW 614
>gi|373952811|ref|ZP_09612771.1| alpha-L-fucosidase [Mucilaginibacter paludis DSM 18603]
gi|373889411|gb|EHQ25308.1| alpha-L-fucosidase [Mucilaginibacter paludis DSM 18603]
Length = 833
Score = 352 bits (903), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 216/602 (35%), Positives = 325/602 (53%), Gaps = 50/602 (8%)
Query: 6 STSTTNP-----LKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV 59
+ + TNP L++ +N P+ K + +A+PIGNGRLGAM++G V ET++LNE TLW+G
Sbjct: 26 AKAQTNPKDQTTLRLWYNKPSGKVWENALPIGNGRLGAMIYGNVGVETIQLNEHTLWSGG 85
Query: 60 PGDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDDSH 116
P NP A +L+ +R L+ +G+ +A + K+ +++ G++ L F++
Sbjct: 86 PNRNDNPLALDSLAAIRKLIFNGKQKQAEQLANKVIISKKSQGQIFEPAGELYLAFNNQE 145
Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
Y RELD+ A ++ Y VG+V FTRE F+S PD+VIV ++ S+ GS+SF
Sbjct: 146 ---NYTNYYRELDIEKAISKTSYQVGDVSFTREAFASIPDRVIVMHLTASKPGSISFTAF 202
Query: 177 LDSLLDNHSYVNGN-NQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTI 234
S + + QI G ++ KG +++ I E K + GT
Sbjct: 203 YSSPQHDVAVATFQARQITFAGTTID---------HEGVKGMVRYKGIAEFKTNG--GTK 251
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
SA D + + G++ + + +++F+ N D + T + + L SY++L
Sbjct: 252 SA-TDTSVTIYGANDVTIYISIATNFN----NYHDLGGNETERAANYLNKASGKSYTELQ 306
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ YQK F+RV L + +I +P+ ER+K+F +DP L F
Sbjct: 307 KTHIAAYQKYFNRVRFSLGAA------------DISKLPTDERLKNFNQGQDPQFAALYF 354
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
Q+GRYLLISSS+PG Q ANLQGIWN L P WDS +NIN EMNYW + NL E EP
Sbjct: 355 QYGRYLLISSSQPGGQPANLQGIWNNKLYPAWDSKYTININAEMNYWPAEKTNLPEIHEP 414
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
+ L++NG +TA+V Y A GW+ HH TDIW + A G W +W GG W HL
Sbjct: 415 FLQMVKELAVNGEQTAKVMYGARGWMAHHNTDIWRATGAVDG-AFWGIWNQGGGWTSEHL 473
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGK 532
WEHY Y D+D+L + Y +L G A F +D+L+E H +L NP SPE+ A G
Sbjct: 474 WEHYLYNGDKDYL-RSVYGVLRGAALFYVDFLVEQPVHH-WLVINPDMSPENAPAAHQG- 530
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
+ + +TM I+ +VFS+ I AAE+L ++ V+ + + +L P I + G + E
Sbjct: 531 -SSLDAGTTMSNQIVFDVFSSTIRAAEILNIDK-PFVDTLKQMRSKLSPMHIGQFGQLQE 588
Query: 593 WV 594
W+
Sbjct: 589 WL 590
>gi|315607320|ref|ZP_07882320.1| possible alpha-L-fucosidase [Prevotella buccae ATCC 33574]
gi|315251023|gb|EFU31012.1| possible alpha-L-fucosidase [Prevotella buccae ATCC 33574]
Length = 787
Score = 352 bits (903), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 210/604 (34%), Positives = 327/604 (54%), Gaps = 46/604 (7%)
Query: 2 MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
M A S +P K+ + PA+ +TDA+P+GNGRLGAMV+G +E ++LNE+T+W G P
Sbjct: 16 MLASLFSQAHPYKLWYREPAQVWTDALPLGNGRLGAMVYGIPATERIQLNEETIWAGQPN 75
Query: 62 DYTNPDAPKALSDVRSLVDSGQYAEAT------AASVKLFGHPADVYQLLGDIELEFDDS 115
N A KA+ ++ L+ G+Y +A S +G P YQ G++ +
Sbjct: 76 GNANAKALKAIPVIQDLIWKGEYKKAQDLATSDVMSATNWGMP---YQAFGNVYISMPGM 132
Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
Y REL L++A A +++ V + RE +S D V+ + + + G ++FN
Sbjct: 133 G---NYTNYYRELSLDSARAITRWTANGVTYRREVITSLADNVVTVRFTADQRGCITFNA 189
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSA-ILEIKISDDRGT 233
+ D+ I+++ + + ++ KG ++F + + G
Sbjct: 190 YFTTPHDD---------IMIKSEGDEATLFGVTSKHEGLKGKVRFMGRMAAVAKGKGEGA 240
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
++ +D + V+G+D AVL + +++F+ N D D S L++ Y+
Sbjct: 241 VTHSKDGIVSVKGADEAVLYISIATNFN----NYKDISGDEAVRSEQILRAAMARDYAQS 296
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+ +++L HRV++ L E+ +P+ ER+ F +D LV
Sbjct: 297 KAEHISRFRQLMHRVTLNLG------------EDQYKDLPTDERIIRFADRDDNYLVATY 344
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P WDS NINLEMNYW + P L+E E
Sbjct: 345 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPAWDSKYTTNINLEMNYWPAEPTQLTELTE 404
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCT 472
PLF + +S G+KTA+ Y SGWV+HH TDIW + D + +W GGAWLC
Sbjct: 405 PLFRLIREVSETGAKTARTMYGKSGWVLHHNTDIWCVTGGIDHAQS--GMWMTGGAWLCR 462
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDG 531
HLWEHY YTMD+DFL +R YP+++G A FL LI E G+L +PS SPE+ + DG
Sbjct: 463 HLWEHYLYTMDKDFL-RRYYPVMKGAAEFLDQMLIPEPQHGWLVISPSVSPENSHPSKDG 521
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
K+A +S +TMD+ ++ E+F +++A++VL ++ AL + L + P ++ + G +
Sbjct: 522 KVA-ISAGTTMDVQLVNELFREVMAASKVLGEDA-ALAAHYAERLKLMPPMQVGKWGQLQ 579
Query: 592 EWVQ 595
EW++
Sbjct: 580 EWME 583
>gi|423311596|ref|ZP_17289533.1| hypothetical protein HMPREF1058_00145 [Bacteroides vulgatus
CL09T03C04]
gi|392690241|gb|EIY83511.1| hypothetical protein HMPREF1058_00145 [Bacteroides vulgatus
CL09T03C04]
Length = 819
Score = 352 bits (903), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 220/589 (37%), Positives = 324/589 (55%), Gaps = 42/589 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PA + +A+P+GN R+GAMV+GG E L+LN++T+W G P P+A ++L
Sbjct: 23 LKLWYKQPAGTWVEALPVGNSRMGAMVYGGTAREELQLNDETMWGGSPYRNDKPEALESL 82
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ +G+ EA + F G YQ +G + +E H K + Y R+LDL
Sbjct: 83 PQVRELIFAGKNMEAQNLIQENFYAGKHGMPYQTIGSLIIE-TPGHEKVTD--YYRDLDL 139
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A +Y V V F RE F+S PD+V+V +++ G L+F V S L+ H
Sbjct: 140 ERAVATTRYKVDGVTFQREVFASFPDKVVVVRLTADRPGKLNFKVGYVSPLE-HKVSRKG 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD-RGTISALEDKKLKVEGSDW 249
++++ GR D +G++ +E + D G ++D+ + VEG+D
Sbjct: 199 KKLVLTGR------------GRDHEGVKGLIRMETQTQADVDGGKVKIDDQNITVEGAD- 245
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
+V L V+S + FIN D + + ++ L YS + H+ Y++ F RV
Sbjct: 246 SVTLYVSSGT---NFINYHDISGNESKKASGYLSLALGRPYSQVLQEHIALYKEQFDRVR 302
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L T ++TV +R++ F +D SL LLFQ+GRYLLISSS+PG
Sbjct: 303 LDLG---------TSERAKLETV---KRIELFNEGKDVSLAVLLFQYGRYLLISSSQPGG 350
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN L+ WD +NIN EMNYW + NLSE +PLF+ + LS+ G +T
Sbjct: 351 QPANLQGIWNNKLAAPWDGKYTININTEMNYWPAEVTNLSETHQPLFEMVKELSVTGRET 410
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+ Y +GWV HH TDIW +++ K + WPMGGAWL THLW+HY Y+ D+ FL +
Sbjct: 411 ARTMYGCNGWVAHHNTDIW-RATGPVDKAFYGTWPMGGAWLTTHLWQHYLYSGDKLFLSE 469
Query: 490 RAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLA-CVSYSSTMDMAII 547
AYP L+G A F LD+L E + G++ T PS SPEH D K A + TMD II
Sbjct: 470 -AYPALKGAADFYLDYLTEHPEYGWMVTAPSMSPEHGPSGEDTKKASTIVAGCTMDNQII 528
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWVQ 595
+V S + A+ +L+ + A + L+S L RL P +I + + EW++
Sbjct: 529 FDVLSNALHASRILKMS--ASYQDSLRSMLNRLAPMQIGKYNQLQEWLE 575
>gi|319640719|ref|ZP_07995432.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_40A]
gi|345517731|ref|ZP_08797196.1| glycoside hydrolase family 95 [Bacteroides sp. 4_3_47FAA]
gi|254836837|gb|EET17146.1| glycoside hydrolase family 95 [Bacteroides sp. 4_3_47FAA]
gi|317387531|gb|EFV68397.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_40A]
Length = 819
Score = 352 bits (903), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 220/589 (37%), Positives = 324/589 (55%), Gaps = 42/589 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PA + +A+P+GN R+GAMV+GG E L+LN++T+W G P P+A ++L
Sbjct: 23 LKLWYKQPAGTWVEALPVGNSRMGAMVYGGTAREELQLNDETMWGGSPYRNDKPEALESL 82
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ +G+ EA + F G YQ +G + +E H K + Y R+LDL
Sbjct: 83 PQVRELIFAGKNMEAQDLIQENFYAGKHGMPYQTIGSLIIE-TPGHEKVTD--YYRDLDL 139
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A +Y V V F RE F+S PD+V+V +++ G L+F V S L+ H
Sbjct: 140 ERAVATTRYKVDGVTFQREVFASFPDKVVVVRLTADRPGKLNFKVGYVSPLE-HKVSRKG 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD-RGTISALEDKKLKVEGSDW 249
++++ GR D +G++ +E + D G ++D+ + VEG+D
Sbjct: 199 KKLVLTGR------------GRDHEGVKGLIRMETQTQADVDGGKVKIDDQNITVEGAD- 245
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
+V L V+S + FIN D + + ++ L YS + H+ Y++ F RV
Sbjct: 246 SVTLYVSSGT---NFINYHDISGNESKKASGYLSLALGRPYSQVLQEHIALYKEQFDRVR 302
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L T ++TV +R++ F +D SL LLFQ+GRYLLISSS+PG
Sbjct: 303 LDLG---------TSERAKLETV---KRIELFNEGKDVSLAVLLFQYGRYLLISSSQPGG 350
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN L+ WD +NIN EMNYW + NLSE +PLF+ + LS+ G +T
Sbjct: 351 QPANLQGIWNNKLAAPWDGKYTININTEMNYWPAEVTNLSETHQPLFEMVKELSVTGRET 410
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+ Y +GWV HH TDIW +++ K + WPMGGAWL THLW+HY Y+ D+ FL +
Sbjct: 411 ARTMYGCNGWVAHHNTDIW-RATGPVDKAFYGTWPMGGAWLTTHLWQHYLYSGDKLFLSE 469
Query: 490 RAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLA-CVSYSSTMDMAII 547
AYP L+G A F LD+L E + G++ T PS SPEH D K A + TMD II
Sbjct: 470 -AYPALKGAADFYLDYLTEHPEYGWMVTAPSMSPEHGPSGEDTKKASTIVAGCTMDNQII 528
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWVQ 595
+V S + A+ +L+ + A + L+S L RL P +I + + EW++
Sbjct: 529 FDVLSNALHASRILKMS--ASYQDSLRSMLNRLAPMQIGKYNQLQEWLE 575
>gi|251798253|ref|YP_003012984.1| alpha-L-fucosidase [Paenibacillus sp. JDR-2]
gi|247545879|gb|ACT02898.1| Alpha-L-fucosidase [Paenibacillus sp. JDR-2]
Length = 767
Score = 351 bits (901), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 211/586 (36%), Positives = 319/586 (54%), Gaps = 44/586 (7%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
+ ++ PAK + +A+PIGNGRLGAM++G +E ++LNED+LW G P D NPDA L++
Sbjct: 12 LLYHSPAKQWEEALPIGNGRLGAMIFGDPRAERVQLNEDSLWYGGPRDRHNPDALPNLAE 71
Query: 75 VRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAE-ETYRRELDL 130
+R L+ G+ EA AS+ L P Y LGD+ L F+ + AE Y R LDL
Sbjct: 72 IRKLIFEGKLQEAERLASLALTAIPESQRHYVPLGDLFLRFEHA----AEIRNYERRLDL 127
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
+ A V Y+ G +F RE F+S PD+ IV +++ G +SF + + YV+
Sbjct: 128 SEAIVHVSYTAGETKFAREIFASYPDRAIVLRLTADSPGQISFTARMGR--ERFRYVD-- 183
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
E R RI N+ G+++ +L + G++ + + L V +D
Sbjct: 184 -----EIRAEEGRIVMCGNSGG---GVRYCGVL--ACVPEGGSMRTI-GEHLVVSNADAV 232
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
+L++ AS+ F + DP + ++ + +YS+L H+ DY+ L+ R +
Sbjct: 233 LLVVTASTDF---------READPEAAALGDAGRVAAAAYSELKASHISDYRSLYDRTRL 283
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGT 369
+ + + SE ++ER+ + + EDP L L F +GRYLLI+SSRPG+
Sbjct: 284 WIG--AESGLKPEISE-------TSERLVNVKAGREDPGLTALYFHYGRYLLIASSRPGS 334
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
ANLQGIWN+D+ P WDS +NIN +MNYW + C L EC PLF+ + + NG T
Sbjct: 335 LPANLQGIWNKDMLPAWDSKFTININTQMNYWPAESCYLPECHLPLFELIERMIPNGRHT 394
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+ Y G HH TDIWA ++ WP+G AWL HLWEHY Y D FLE
Sbjct: 395 ARSMYGCRGSAAHHNTDIWADTAPQDLWPSSTYWPLGLAWLSLHLWEHYRYGGDTAFLE- 453
Query: 490 RAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
R YP+++ A FLLD+L+E G T+PS SPE+ + P+G+ + Y +MD I RE
Sbjct: 454 RVYPMMKEAAVFLLDYLVELPSGEWVTSPSVSPENTYRLPNGETGVLCYGPSMDSQIARE 513
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+F A +A E + N D L+ ++ +++ +L P +I G ++EW +
Sbjct: 514 LFQACAAAGERIGSN-DELLGELRQAIDKLPPPRIGRYGQLLEWYE 558
>gi|408371030|ref|ZP_11168802.1| alpha-L-fucosidase [Galbibacter sp. ck-I2-15]
gi|407743587|gb|EKF55162.1| alpha-L-fucosidase [Galbibacter sp. ck-I2-15]
Length = 821
Score = 351 bits (901), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 221/601 (36%), Positives = 323/601 (53%), Gaps = 60/601 (9%)
Query: 10 TNPLKITFNGPAKH-FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
T+PLK+ ++ P+ + +A+P+GNG +GAMV+G V E +LNE T+W+G P NP A
Sbjct: 21 TDPLKLWYDEPSGDVWENALPLGNGNIGAMVYGNVSKEIFQLNESTVWSGSPNRNDNPAA 80
Query: 69 PKALSDVRSLVDSGQYAEAT-AASVKLFGHPA--DVYQLLGDIELEFDDSHLKYAEETYR 125
+AL +R L+ QY A A+ K+ + ++Q +G++EL F+ H + Y
Sbjct: 81 LEALPKIRQLIFDKQYKAAEDLANEKIITKKSHGQMFQPVGNLELTFE-GHQDF--HNYS 137
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
REL++ A ++ Y+V V +TRE F+S D+V+V KIS + G +SF +
Sbjct: 138 RELNIGNAVSKTTYTVDGVTYTREAFTSLTDKVLVIKISADQPGKISFKADFTTPHKKQK 197
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGI----QFSAILEIK-----ISDDRGTISA 236
+N + + G D +G+ +F A+L IK I+ R TI
Sbjct: 198 IAIMDNNLSLWG------------VTSDHEGVLGKVEFQALLRIKTLNGDITQGRNTI-- 243
Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
+V +D A L + +S+F N D D T + + L +Y +L
Sbjct: 244 ------EVTNADSATLYISIASNFK----NYDDLSADETLRAKNDLDKAFIENYENLKDA 293
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
H+ YQ F+RVS+QL T N P+ ER+++F+ ++DPS V L FQ+
Sbjct: 294 HIKAYQNYFNRVSLQLG---------TIEASN---QPTDERLENFRKNQDPSFVSLYFQY 341
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLISSS+PG Q ANLQGIWN+ L+P WDS +NIN +MNYW + NLSE EP
Sbjct: 342 GRYLLISSSKPGGQAANLQGIWNKSLTPPWDSKYTININAQMNYWPAEKTNLSELHEPFL 401
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
+ + LS G KTA Y A GW+ HH TDIW + A G W +W GGAWL H+WE
Sbjct: 402 NMVQELSQTGKKTANDMYGARGWMAHHNTDIWRVTGAIDG-AFWGIWNGGGAWLSQHIWE 460
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLAC 535
HY YT D +FL + Y LL+G A F +D+L + D YL P SPE+ G
Sbjct: 461 HYLYTGDTEFL-RENYDLLKGAALFYVDFLAQHPDHPYLVVAPGNSPENAAQGRQG--TS 517
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP-RLRPTKIAEDGSIMEWV 594
++ STMD ++ ++F+A+ISA+E L N D LK + +L P +I + + EW+
Sbjct: 518 ITAGSTMDNQLVEDIFNAVISASEAL--NTDTAFTDSLKVIKNKLPPMQIGKHNQLQEWL 575
Query: 595 Q 595
+
Sbjct: 576 E 576
>gi|150005495|ref|YP_001300239.1| glycoside hydrolase [Bacteroides vulgatus ATCC 8482]
gi|294778696|ref|ZP_06744115.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
gi|149933919|gb|ABR40617.1| glycoside hydrolase family 95 [Bacteroides vulgatus ATCC 8482]
gi|294447352|gb|EFG15933.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
Length = 819
Score = 351 bits (901), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 220/589 (37%), Positives = 324/589 (55%), Gaps = 42/589 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PA + +A+P+GN R+GAMV+GG E L+LN++T+W G P P+A ++L
Sbjct: 23 LKLWYKQPAGTWVEALPVGNSRMGAMVYGGTAREELQLNDETMWGGSPYRNDKPEALESL 82
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ +G+ EA + F G YQ +G + +E H K + Y R+LDL
Sbjct: 83 PQVRELIFAGKNMEAQNLIQENFYAGKHGMPYQTIGSLIIE-TPGHEKVTD--YYRDLDL 139
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A +Y V V F RE F+S PD+V+V +++ G L+F V S L+ H
Sbjct: 140 ERAVATTRYKVDGVTFQREVFASFPDKVVVVRLTADRPGKLNFKVGYVSPLE-HKVSRKG 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD-RGTISALEDKKLKVEGSDW 249
++++ G+ D +G++ +E + D G ++D+ + VEG+D
Sbjct: 199 KKLVLTGK------------GRDHEGVKGLIRMETQTQADVDGGKVKIDDQNITVEGAD- 245
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
+V L V+S + FIN D + + ++ L YS + H+ Y++ F RV
Sbjct: 246 SVTLYVSSGT---NFINYHDISGNESKKASGYLSLALGRPYSQVLQEHIALYKEQFDRVR 302
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L T ++TV +R++ F +D SL LLFQ+GRYLLISSS+PG
Sbjct: 303 LDLG---------TSERAKLETV---KRIELFNEGKDVSLAVLLFQYGRYLLISSSQPGG 350
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN L+ WD +NIN EMNYW + NLSE +PLF+ + LS+ G +T
Sbjct: 351 QPANLQGIWNNKLAAPWDGKYTININTEMNYWPAEVTNLSETHQPLFEMVKELSVTGRET 410
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+ Y +GWV HH TDIW +++ K + WPMGGAWL THLW+HY Y+ D+ FL +
Sbjct: 411 ARTMYGCNGWVAHHNTDIW-RATGPVDKAFYGTWPMGGAWLTTHLWQHYLYSGDKLFLSE 469
Query: 490 RAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSS-TMDMAII 547
AYP L+G A F LD+L E + G++ T PS SPEH D K A S TMD II
Sbjct: 470 -AYPALKGAADFYLDYLTEHPEYGWMVTAPSMSPEHGPSGEDTKKASTIVSGCTMDNQII 528
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWVQ 595
+V S + A+ +L+ + A + L+S L RL P +I + + EW++
Sbjct: 529 FDVLSNALHASRILKMS--ASYQDSLRSMLNRLAPMQIGKYNQLQEWLE 575
>gi|237720803|ref|ZP_04551284.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
gi|229449638|gb|EEO55429.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
Length = 822
Score = 351 bits (900), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 220/603 (36%), Positives = 338/603 (56%), Gaps = 55/603 (9%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E S+ K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+++W G P +
Sbjct: 23 EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NPDA + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 83 NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y+ Y REL L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196
Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
S +Q +M EG C + ++ ++ KG ++F L K ++G
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
A D L VEG+D AV+ + +++F+ N D + T + + L + +
Sbjct: 242 KIACADGVLSVEGADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+D Y++ RVS+ L R + V + +RV++F+ D LV
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLF + +S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSDTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDG 531
LWE Y YT D +FL + YP+L+ F + +++ H+ +L PS SPE+ +G
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKDPVHN-WLVVCPSNSPENVHSGSNG 522
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
K A + TMD +I ++++AIISA+++L+ +++ + + L + P ++ G +
Sbjct: 523 K-ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQHLKEMAPMQVGHWGQLQ 580
Query: 592 EWV 594
EW+
Sbjct: 581 EWM 583
>gi|336402504|ref|ZP_08583239.1| hypothetical protein HMPREF0127_00552 [Bacteroides sp. 1_1_30]
gi|335948353|gb|EGN10067.1| hypothetical protein HMPREF0127_00552 [Bacteroides sp. 1_1_30]
Length = 822
Score = 351 bits (900), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 220/603 (36%), Positives = 338/603 (56%), Gaps = 55/603 (9%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E S+ K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+++W G P +
Sbjct: 23 EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NPDA + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 83 NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y+ Y REL L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196
Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
S +Q +M EG C + ++ ++ KG ++F L K ++G
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
A D L VEG+D AV+ + +++F+ N D + T + + L + +
Sbjct: 242 KIACADGVLSVEGADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+D Y++ RVS+ L R + V + +RV++F+ D LV
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLF + +S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSDTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDG 531
LWE Y YT D +FL + YP+L+ F + +++ H+ +L PS SPE+ +G
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKDPVHN-WLVVCPSNSPENVHSGSNG 522
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
K A + TMD +I ++++AIISA+++L+ +++ + + L + P ++ G +
Sbjct: 523 K-ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQRLKEMAPMQVGHWGQLQ 580
Query: 592 EWV 594
EW+
Sbjct: 581 EWM 583
>gi|338213645|ref|YP_004657700.1| alpha-L-fucosidase [Runella slithyformis DSM 19594]
gi|336307466|gb|AEI50568.1| Alpha-L-fucosidase [Runella slithyformis DSM 19594]
Length = 829
Score = 351 bits (900), Expect = 8e-94, Method: Compositional matrix adjust.
Identities = 215/615 (34%), Positives = 323/615 (52%), Gaps = 70/615 (11%)
Query: 11 NPLKIT-FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
NP ++ +N PAK + DA+P+GNGRLGAMV+G E ++LNE+T W+G P
Sbjct: 47 NPSTVSWYNAPAKKWEDALPVGNGRLGAMVFGRSGEERIQLNEETYWSGGPYSTVVKGGY 106
Query: 70 KALSDVRSLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRR 126
K L +++ LV +Y A L G+P + YQ L ++ L F + + Y+R
Sbjct: 107 KVLPEIQKLVFEEKYLAAHNLFGRHLMGYPVEQQKYQSLANLHLFFQNQD---STTEYKR 163
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
L+L + V Y + + R+ F+S PDQVIV +++ +SGS+SF +L + N ++
Sbjct: 164 WLNLESGITSVSYKSNGITYQRDVFASAPDQVIVIRLTADKSGSISFKANLRGV-RNQAH 222
Query: 187 VN-----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI-SDDRGTI 234
N G++ +I+ G+ D G+ E +I + G
Sbjct: 223 SNYATDYFRMDPYGSDGLILTGKSA------------DYMGVAGKLKYEARIKAIPEGGR 270
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
+ L +E ++ L A+++F +N D + +P I++ SY+ +
Sbjct: 271 MKTDGVDLIIENANTVTLYFAAATNF----VNYKDVRANPHQRVEDYFARIKSKSYTSIL 326
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
L DY+ F RVS+QL + + P ER++ Q+ DPSL L +
Sbjct: 327 EAALADYKHFFDRVSLQLPTTENSFL------------PLPERIQKIQSSPDPSLSALSY 374
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
FGRYL+I+SSRPGT+ ANLQGIWN++++P WDS NIN +MNYW NLSEC EP
Sbjct: 375 NFGRYLMIASSRPGTEPANLQGIWNDNMNPDWDSKYTTNINTQMNYWPVESSNLSECAEP 434
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
L F+ L+ G++ A+ +Y A GWV H TD+W + +A W + +GGAWLCTHL
Sbjct: 435 LVRFIKELTDQGTQVAREHYGAKGWVFHQNTDLW-RVAAPMDGPTWGTFTVGGAWLCTHL 493
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDG-- 531
WEHY YTMD FL K YPL++G F +D+L +G +L TNPSTSPE+ PDG
Sbjct: 494 WEHYQYTMDAAFL-KETYPLMKGSVQFFMDFLKPHPNGKWLVTNPSTSPEN---FPDGGG 549
Query: 532 -------------KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR 578
+ + S++DM I+ ++F I A+ +L N A V++V + +
Sbjct: 550 NKPYFDEVTAGFREGTTICAGSSIDMQILFDLFGYFIEASAILGDN-SAFVQQVKVAREK 608
Query: 579 LRPTKIAEDGSIMEW 593
L P +I DGS+ EW
Sbjct: 609 LVPPQIGRDGSLQEW 623
>gi|288925248|ref|ZP_06419183.1| alpha-L-fucosidase 2 (Alpha-L-fucosidefucohydrolase 2)
(Alpha-1,2-fucosidase 2) [Prevotella buccae D17]
gi|288338013|gb|EFC76364.1| alpha-L-fucosidase 2 (Alpha-L-fucosidefucohydrolase 2)
(Alpha-1,2-fucosidase 2) [Prevotella buccae D17]
Length = 787
Score = 350 bits (899), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 209/604 (34%), Positives = 327/604 (54%), Gaps = 46/604 (7%)
Query: 2 MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
M A S +P K+ + PA+ +TDA+P+GNGRLGAMV+G +E ++LNE+T+W G P
Sbjct: 16 MLASLFSQAHPYKLWYREPAQVWTDALPLGNGRLGAMVYGIPATERIQLNEETIWAGQPN 75
Query: 62 DYTNPDAPKALSDVRSLVDSGQYAEAT------AASVKLFGHPADVYQLLGDIELEFDDS 115
N A KA+ ++ L+ G+Y +A S +G P YQ G++ +
Sbjct: 76 GNANAKALKAIPVIQDLIWKGEYKKAQDLATSDVMSATNWGMP---YQAFGNVYISMPGM 132
Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
Y REL L++A A +++ V + RE +S D V+ + + + G ++FN
Sbjct: 133 G---NYTNYYRELSLDSARAITRWTADGVTYRREVITSLADNVVTVRFTADQRGCITFNA 189
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSA-ILEIKISDDRGT 233
+ D+ II++ + + ++ KG ++F + + G
Sbjct: 190 YFTTPHDD---------IIIKSEGDEATLFGVTSKHEGLKGKVRFMGRMAAVAKGKGEGA 240
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
++ +D + V+G+D AVL + +++F+ N D D S L++ Y+
Sbjct: 241 VTHSKDGIVSVKGADEAVLYISIATNFN----NYKDISGDEAVRSEQILRAAMARDYAQS 296
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+ +++L HRV++ L E+ +P+ ER+ F +D LV
Sbjct: 297 KAEHISRFRQLMHRVTLNLG------------EDQYKDLPTDERIIRFADHDDNYLVATY 344
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P WDS NINLEMNYW + P L+E E
Sbjct: 345 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPAWDSKYTTNINLEMNYWPAEPTQLTELNE 404
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCT 472
PLF + +S G++TA+ Y SGWV+HH TDIW + D + +W GGAWLC
Sbjct: 405 PLFRLIREVSETGAETARTMYGKSGWVLHHNTDIWRVTGGIDHAQS--GMWMTGGAWLCR 462
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDG 531
HLWEHY YTMD+DFL +R YP+++G A FL LI E G+L +PS SPE+ + DG
Sbjct: 463 HLWEHYLYTMDKDFL-RRYYPVMKGAAEFLDQMLIPEPQHGWLVISPSVSPENSHPSKDG 521
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
K+A ++ +TMD+ ++ E+F +++A++VL ++ AL + L + P ++ + G +
Sbjct: 522 KMA-IAAGTTMDVQLVNELFREVMAASKVLGEDA-ALAAHYAERLKLMPPMQVGKWGQLQ 579
Query: 592 EWVQ 595
EW++
Sbjct: 580 EWME 583
>gi|399078665|ref|ZP_10752953.1| hypothetical protein PMI01_04050 [Caulobacter sp. AP07]
gi|398033293|gb|EJL26598.1| hypothetical protein PMI01_04050 [Caulobacter sp. AP07]
Length = 786
Score = 350 bits (899), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 212/584 (36%), Positives = 316/584 (54%), Gaps = 44/584 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PAK + +A+PIGNGRLGAM++G V +E L+LNE+TLW+G P D NP A + L VR
Sbjct: 39 YDQPAKEWVEALPIGNGRLGAMIFGDVWAERLQLNENTLWSGGPYDPVNPRAREGLEPVR 98
Query: 77 SLVDSGQYAEA-TAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
+L+ +G++AEA A+ L P YQ GD+ L + + + A YRR LD++ A
Sbjct: 99 ALIAAGRFAEAEQRANETLVATPPREMAYQPFGDLGLRW--AGARGAVSGYRRSLDIDNA 156
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A + + V + R +S DQVI +++ S G+L F+++L + +I
Sbjct: 157 VAETTFEIDGVRYRRRAVASPVDQVIALELTASRPGALDFDLTL-------APAQTVREI 209
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
++E R +I + N + + ++ G++ D ++ V G+ A +
Sbjct: 210 VVE-RPDTLKISGRNNDGEGGVSGALTYCGRARVVTQGGSVKG-ADGQIAVRGASRATIY 267
Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
L ++S+ D DP + + + S+ L ++ LF RVS+ L
Sbjct: 268 LAMATSYR----RYDDVGGDPDAITRGQIDKAAAKSFDQLARAATAAHRALFDRVSLDLG 323
Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVAN 373
+++I P+ R+ +T +DP LVEL FQ+ RYLLI+ SRPG Q AN
Sbjct: 324 -----------GKDDIG-APTDIRIARNETTDDPGLVELYFQYARYLLIACSRPGGQPAN 371
Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVN 433
LQG+WN+ + P W S +NIN +MNYW + L+EC EPLFDF+ L+ G+ TA+
Sbjct: 372 LQGLWNDQVKPPWGSNYTININTQMNYWPAEAGGLAECAEPLFDFIAELAERGAVTAREM 431
Query: 434 YLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
Y A GWV HH +D+W ++ D K LWP GGAWLC HLW+HY+Y D+ FL RAY
Sbjct: 432 YGARGWVAHHNSDLWRGTAPFDHAKA--GLWPTGGAWLCVHLWDHYDYGRDKRFL-ARAY 488
Query: 493 PLLEGCASFLLDWL-IEGHDGYLETNPSTSPE--HEFIAPDGKLACVSYSSTMDMAIIRE 549
PL++G + F LD L + G+L T+PS SPE H F G C TMDM I+R+
Sbjct: 489 PLMKGASQFFLDTLQTDAATGWLVTSPSVSPENRHGF----GSTLCA--GPTMDMQILRD 542
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+F A +L + D E + ++ RL PT+I G +MEW
Sbjct: 543 LFDHTREAGRILGLDPD-FGEDLARARDRLAPTRIGAGGQLMEW 585
>gi|423213429|ref|ZP_17199958.1| hypothetical protein HMPREF1074_01490 [Bacteroides xylanisolvens
CL03T12C04]
gi|392693889|gb|EIY87119.1| hypothetical protein HMPREF1074_01490 [Bacteroides xylanisolvens
CL03T12C04]
Length = 822
Score = 350 bits (897), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 219/602 (36%), Positives = 335/602 (55%), Gaps = 53/602 (8%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E S+ K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+++W G P +
Sbjct: 23 EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NPDA + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 83 NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y+ Y REL L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196
Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
S +Q +M EG C + ++ ++ KG ++F L K ++G
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
A D L VEG+D A + + +++F+ N D + T + + L + +
Sbjct: 242 KIACADGVLSVEGADEATVYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+D Y++ RVS+ L R + V + +RV++F+ D LV
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLF + +S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSNTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
LWE Y YT D +FL + YP+L+ F + ++ E +L PS SPE+ +GK
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK 523
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
A + TMD +I ++++AIISA+++L+ +++ + + L + P ++ G + E
Sbjct: 524 -ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQRLKEMAPMQVGHWGQLQE 581
Query: 593 WV 594
W+
Sbjct: 582 WM 583
>gi|393781509|ref|ZP_10369704.1| hypothetical protein HMPREF1071_00572 [Bacteroides salyersiae
CL02T12C01]
gi|392676572|gb|EIY70004.1| hypothetical protein HMPREF1071_00572 [Bacteroides salyersiae
CL02T12C01]
Length = 827
Score = 349 bits (896), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 220/596 (36%), Positives = 331/596 (55%), Gaps = 60/596 (10%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ +T+A+P+GNGRLGAMV+G E L+LNE+TLW G P + NP+ K +
Sbjct: 38 KLWYDRPAQVWTEALPLGNGRLGAMVFGNPAVEQLQLNEETLWAGRPNNNANPEGLKYIP 97
Query: 74 DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +Y + Y RE
Sbjct: 98 KVRELVFAGKYLEAQTLATEKVMSKTNSGMP---YQSFGDLRISFP-GHTRYRD--YYRE 151
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL-----DSLLD 182
L+L++A +V Y V +V + RE F+S DQVI+ +++ G ++FN L D+L+D
Sbjct: 152 LNLDSACVKVGYRVDDVNYLREMFTSFTDQVIMVRLTADRPGKITFNAVLTTPHQDALVD 211
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKK 241
+G C + ++ ++ KG ++F L ++ +G + D
Sbjct: 212 T------------DGEC--VTLSGVSSWHEGLKGKVEFQGRLATRV---QGGAVSCRDGV 254
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L VEG+D AV+ + +++F IN D D + L+ +Y++ H+D +
Sbjct: 255 LTVEGADEAVVYVSLATNF----INYKDISADQVERARQYLEKAMQKNYTEAKQSHVDFF 310
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
+ RVS+ L T S E + P+ +RV+ F+T D LV FQFGRYLL
Sbjct: 311 KAYMDRVSLNLG---------TGSTEQL---PTDKRVEKFKTTHDAGLVATYFQFGRYLL 358
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
I SS+PG Q ANLQGIWN+ L P+WDS NIN+EMNYW + NLSE EPLF
Sbjct: 359 ICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINVEMNYWPAEVTNLSELHEPLFRMTRE 418
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
+S G +TA++ Y A GWV+HH TDIW + + K +WP GGAWLC HLWE Y YT
Sbjct: 419 VSETGKETAEIMYGAKGWVLHHNTDIW-RITGPLDKAPSGMWPSGGAWLCRHLWERYLYT 477
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
D +FL + AYP+++ F + ++ E +L PS SPE+ GK A +
Sbjct: 478 GDVEFL-RSAYPIMKEAGRFFDETMVKEPLHNWLVVCPSNSPENTHAGSGGK-ATTAAGC 535
Query: 541 TMDMAIIREVFSAIISAAEVLEKNED--ALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
TMD ++ +++++II+ A +L + + + +E+ LK +P P +I G + EW+
Sbjct: 536 TMDNQLVFDLWTSIIATARLLGVDTEYASHLEERLKEMP---PMQIGRWGQLQEWM 588
>gi|374311601|ref|YP_005058031.1| alpha-L-fucosidase [Granulicella mallensis MP5ACTX8]
gi|358753611|gb|AEU37001.1| Alpha-L-fucosidase [Granulicella mallensis MP5ACTX8]
Length = 790
Score = 349 bits (896), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 210/607 (34%), Positives = 334/607 (55%), Gaps = 47/607 (7%)
Query: 1 MMNAESTST-TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV 59
+M+AE S+ ++ ++ ++ PA + +A+PIGNGR+G M++GG E+ L E T W+G
Sbjct: 14 LMHAEGQSSPSHKTELWYSRPATRWMEAVPIGNGRIGGMIYGGTSIESFALTESTTWSGA 73
Query: 60 PGDY-TNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEF-DD 114
P D P A L +R L+ +G+YAE L G+P + + +EL F +D
Sbjct: 74 PNDKNVKPTALANLGKIRELMFAGKYAEGGELCKEHLLGNPGSFGTHLPMATLELAFPED 133
Query: 115 SHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFN 174
H + YRR L+L+ A V YS G + F RE F+SNPD ++ IS ++ S+S +
Sbjct: 134 EH----PQNYRRSLNLDEGIAYVDYSRGGLSFHREVFASNPDNALIAHISCNQPKSVSCS 189
Query: 175 VSLDSL-LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGT 233
+S L L GN+ ++++G + ++ +G+ F +++S G
Sbjct: 190 ISFPKLTLPGEVTTEGNDTLVLKGNAF------EHLHSNGKQGVAFET--RVRVSAKGGE 241
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
++A E L ++G+D L +V +++F G + ++ ++ LQ +R +++ L
Sbjct: 242 VTAHEGA-LHLKGADAVTLHVVIATNFRG---------ANASTRNVQTLQVLRPKTFAQL 291
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVEL 352
H+ D+Q LF RV+I D+ T++ +E P+ ER K+ + +DP L L
Sbjct: 292 RAAHVADHQSLFRRVAI-------DLGTNSSAESK----PTDERRKAVEAGADDPGLASL 340
Query: 353 LFQFGRYLLISSSRPGTQVA-NLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLS 409
FQ+GRYL I+ SR + + LQGIWN+ L+ + W H++IN E NYW + CNLS
Sbjct: 341 FFQYGRYLTIAGSRVNSPLPLALQGIWNDGLASSMGWTDDFHLDINTEQNYWAAEVCNLS 400
Query: 410 ECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAW 469
ECQ PLFDF+ LSI G TA+ Y A GWV H T+ W ++A G + W ++ GG W
Sbjct: 401 ECQSPLFDFVEGLSIAGRSTARDMYGAPGWVAHVVTNPWGFTAAGWG-LGWGIFSTGGVW 459
Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIA 528
L LWEHY +T D+ FL++R YP+ +G A F L ++++ G+L T PS SPE+ FIA
Sbjct: 460 LALQLWEHYRFTGDKQFLQQRLYPVYKGAAEFFLAYMVKHPQHGWLVTGPSVSPENWFIA 519
Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDG 588
PDGK S T+D + + S I A+ L +E+ K ++L +L P +I + G
Sbjct: 520 PDGKQCSESMGPTVDRVFVHSLLSGCIEASTTLGIDEE-FRAKATEALKQLPPFQIGKHG 578
Query: 589 SIMEWVQ 595
+ EW++
Sbjct: 579 QLQEWLE 585
>gi|254445766|ref|ZP_05059242.1| hypothetical protein VDG1235_4013 [Verrucomicrobiae bacterium
DG1235]
gi|198260074|gb|EDY84382.1| hypothetical protein VDG1235_4013 [Verrucomicrobiae bacterium
DG1235]
Length = 784
Score = 349 bits (896), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 223/590 (37%), Positives = 312/590 (52%), Gaps = 39/590 (6%)
Query: 8 STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
S+ + LK G + ++ +PIGNG LGA+V G E + LN DTLW G P D + P+
Sbjct: 24 SSASILKYDEPGQFEPLSEGLPIGNGSLGALVMGRTAEERIVLNHDTLWAGGPYDPSYPE 83
Query: 68 APKALSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETY 124
A + L ++RSL+ ++ EA A P YQ + D+ L H + + Y
Sbjct: 84 AAEVLPEIRSLIFQDKHREAQALVQSSFMSKPMRQMSYQAMADLLL-LVPGHERV--DDY 140
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
R LDL+ A A V Y V V +TREH +S D V+ +I + GS+ + LDSL
Sbjct: 141 ERSLDLDKAIATVSYEVDGVRYTREHIASAVDGVVAIRIRADKPGSVDLTLQLDSL---- 196
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+ Q E G RI + A++ G +E+ + D G S D LKV
Sbjct: 197 -----HEQTRSEYWPEGMRISGRNGASEGIAG-ALDWSVEVAVQLD-GGWSMPGDGYLKV 249
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
+D LL+ A +S+ +N +D +P ++ + + +S+L RHL+D+Q L
Sbjct: 250 READSVTLLVAADTSY----VNWNDVSGNPRQKNAKTIVAASEFDFSELNERHLEDFQSL 305
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
+ RV ++L+ S ++ E N D R+ SF D+DP + EL F F RYL+IS
Sbjct: 306 YGRVDLELNTSRPEL-----GERNTDA-----RIASFSKDQDPKMAELYFNFARYLIISC 355
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
SRPG+Q ANLQG+WN+ L W S +NIN EMNYW + L EC EPL L LSI
Sbjct: 356 SRPGSQSANLQGLWNDKLFAPWGSKYTININTEMNYWPTQVVQLGECMEPLAAMLQDLSI 415
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
+G +TA+ Y ASGWV HH TD+W + G W +WPMGGAWL LWE Y +T D
Sbjct: 416 SGQRTAKNFYGASGWVTHHNTDLWRATGPIDG-AFWGMWPMGGAWLSLFLWERYEFTGDV 474
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
D LE Y +L+G A F LD L+E GYL T PS SPE+ A A TMD
Sbjct: 475 DQLETD-YAILKGSAQFFLDTLVEDPRTGYLVTAPSNSPENAHHAGVSNAA----GPTMD 529
Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
AI+R++F+A A+ +L + A E VL++ +L P K+ + G + EW
Sbjct: 530 NAILRDLFAATAEASRIL-GVDSAFRESVLQTSNQLPPFKVGKAGQLQEW 578
>gi|294808085|ref|ZP_06766858.1| putative lipoprotein [Bacteroides xylanisolvens SD CC 1b]
gi|294444726|gb|EFG13420.1| putative lipoprotein [Bacteroides xylanisolvens SD CC 1b]
Length = 698
Score = 349 bits (895), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 219/602 (36%), Positives = 335/602 (55%), Gaps = 53/602 (8%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E S+ K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+++W G P +
Sbjct: 23 EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NPDA + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 83 NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y+ Y REL L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196
Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
S +Q +M EG C + ++ ++ KG ++F L K ++G
Sbjct: 197 S----------PHQDVMIASEEGNCV--TLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
A D L VE +D AV+ + +++F+ N D + T + + L + +
Sbjct: 242 KIACADGILSVEKADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+D Y++ RVS+ L R + V + +RV++F+ D LV
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLF + +S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSNTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
LWE Y YT D +FL + YP+L+ F + ++ E +L PS SPE+ +GK
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK 523
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
A + TMD +I ++++AIISA+++L+ +++ + + L + P ++ G + E
Sbjct: 524 -ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQRLKEMAPMQVGHWGQLQE 581
Query: 593 WV 594
W+
Sbjct: 582 WM 583
>gi|443289925|ref|ZP_21029019.1| Extracellular cellulase [Micromonospora lupini str. Lupac 08]
gi|385886837|emb|CCH17093.1| Extracellular cellulase [Micromonospora lupini str. Lupac 08]
Length = 947
Score = 348 bits (894), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 220/574 (38%), Positives = 309/574 (53%), Gaps = 45/574 (7%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D N L+++R V + Q+ +
Sbjct: 61 ALPIGNGRLGAMVFGNVDTERLQLNEDTIWAGGPYDSANTRGAANLAEIRRRVFADQWTQ 120
Query: 87 AT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
A + + G+P YQ +G++ L F + Y R LDL TAT Y +
Sbjct: 121 AQDLINQTMMGNPGGQLAYQPVGNLRLAFGSAS---GASQYNRTLDLTTATVTTTYVLNG 177
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKR 203
V + RE F+S PDQVIV +++ +GS++FN + DS I ++G
Sbjct: 178 VRYQRESFASAPDQVIVIRLTADRAGSITFNATFDSPQRTTVSSPDAATIGVDG------ 231
Query: 204 IPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDG 262
+ A + G ++F A+ + GT+S+ L+V G+ +L+ SS+
Sbjct: 232 ---ISGAMEGVNGSVRFLALAHAVATG--GTVSS-SGGTLRVSGATSVTVLISIGSSY-- 283
Query: 263 PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTD 322
+N D + + L + R +++ L +RHL DYQ LF+RV+I L R
Sbjct: 284 --VNFRTVNGDYQGIARTRLNAARGVAFDQLRSRHLADYQALFNRVTIDLGR-------- 333
Query: 323 TCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDL 382
T + + P+ R+ + DP LLFQFGRYLLISSSRPGTQ ANLQGIWN+ +
Sbjct: 334 TAAADQ----PTDVRIAQHASTNDPQFSALLFQFGRYLLISSSRPGTQPANLQGIWNDSM 389
Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIH 442
+P WDS +N NL MNYW + NL EC P+FD + L++ G++ AQ Y A GWV H
Sbjct: 390 TPPWDSKYTINANLPMNYWPADTTNLPECFLPVFDMIKDLTVTGARVAQAQYGAGGWVTH 449
Query: 443 HKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFL 502
H TD W +S G +W +W GGAWL T +WEHY +T D FL YP L+G A F
Sbjct: 450 HNTDGWRGASVVDG-ALWGMWQTGGAWLSTLIWEHYLFTGDVGFLSAN-YPALKGAAQFF 507
Query: 503 LDWLIEGHD--GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEV 560
LD L+ H GYL TNPS SPE P A V TMD I+R++F A+ A EV
Sbjct: 508 LDTLVA-HPTLGYLVTNPSNSPE----LPHHSNASVCAGPTMDNQILRDLFDAVAQAGEV 562
Query: 561 LEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
L + +V + RL P+++ G++ EW+
Sbjct: 563 LGVDA-TFRSQVRTARDRLAPSRVGSRGNVQEWL 595
>gi|294648173|ref|ZP_06725715.1| putative lipoprotein [Bacteroides ovatus SD CC 2a]
gi|292636492|gb|EFF54968.1| putative lipoprotein [Bacteroides ovatus SD CC 2a]
Length = 822
Score = 348 bits (894), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 219/602 (36%), Positives = 335/602 (55%), Gaps = 53/602 (8%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E S+ K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+++W G P +
Sbjct: 23 EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NPDA + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 83 NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y+ Y REL L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196
Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
S +Q +M EG C + ++ ++ KG ++F L K ++G
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
A D L VE +D AV+ + +++F+ N D + T + + L + +
Sbjct: 242 KIACADGILSVEKADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+D Y++ RVS+ L R + V + +RV++F+ D LV
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLF + +S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSNTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
LWE Y YT D +FL + YP+L+ F + ++ E +L PS SPE+ +GK
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK 523
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
A + TMD +I ++++AIISA+++L+ +++ + + L + P ++ G + E
Sbjct: 524 -ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQRLKEMAPMQVGHWGQLQE 581
Query: 593 WV 594
W+
Sbjct: 582 WM 583
>gi|302873491|ref|YP_003842124.1| alpha-L-fucosidase [Clostridium cellulovorans 743B]
gi|307688330|ref|ZP_07630776.1| Alpha-L-fucosidase [Clostridium cellulovorans 743B]
gi|302576348|gb|ADL50360.1| Alpha-L-fucosidase [Clostridium cellulovorans 743B]
Length = 769
Score = 348 bits (894), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 207/591 (35%), Positives = 317/591 (53%), Gaps = 45/591 (7%)
Query: 13 LKITFNGPAK--HFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
K+ ++ PA+ ++ A+P+GNG+LGAMV+G V E ++LNE++LW+G D NPDA
Sbjct: 13 FKLWYDEPAEVWNWDQALPVGNGKLGAMVFGHVHKEQIQLNEESLWSGGYLDRNNPDALA 72
Query: 71 ALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRE 127
L VR L+ G+ EA ++ + G P Y+ LGD+ ++F H + YRRE
Sbjct: 73 QLPKVRQLLFDGKLKEAERLCAIAMMGTPEHQRHYETLGDLFIDF--YHDSDEVKNYRRE 130
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFN--VSLDSLLDNHS 185
LD+N A V+Y + V F RE SS D IV +I+ + ++SF V + +D +
Sbjct: 131 LDINKAMVTVQYEIDGVNFKREILSSAVDDAIVIRITADKKEAISFRGFVGRELFMDTRT 190
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
+N ++ + + G C G P I +S IL K + + G + + + VE
Sbjct: 191 ALN-DSTVALRGGCGG------------PDSINYSIIL--KGTSEGGNLYTM-GGNIVVE 234
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
+D L L + +S+ D + ++S +++ +Y + H+ +YQ F
Sbjct: 235 NADAVTLYLTSKTSY---------LSNDFDAVAISTAEAVSKRTYESILQDHIAEYQSYF 285
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISS 364
R+++QL + + + +P+ ER++ + + D L+ L F FGRYLLIS
Sbjct: 286 SRMTLQLGNKQEAL--------ELSKIPTDERLERVKEGKLDDGLISLYFHFGRYLLISC 337
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
SRPGT ANLQGIWN+ + W +NIN EMNYW + CNLS+C PLFD + +
Sbjct: 338 SRPGTLPANLQGIWNKHHTSPWGCKFTININTEMNYWPAETCNLSDCHTPLFDLIEKMRE 397
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+V Y G+V HH D+W ++ + +WPMG AWLC HLWEHY +T D
Sbjct: 398 PGRHTAKVMYDCGGFVAHHNVDLWGDTAPQDHWMPATVWPMGAAWLCLHLWEHYEFTCDL 457
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
FL K+AY L+ A F +D+LIE +GYL T PS SPE+ + G+ + +MD
Sbjct: 458 KFL-KKAYETLKESAEFFVDYLIEDRNGYLVTCPSVSPENTYRLESGETGSLCIGPSMDS 516
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
II +FS+ I A+E+L +++ E ++ RL I + G IMEW +
Sbjct: 517 QIIYALFSSCIEASELLNTDKE-FAETLISLRERLPKPSIGKYGQIMEWAE 566
>gi|295084327|emb|CBK65850.1| hypothetical protein [Bacteroides xylanisolvens XB1A]
Length = 822
Score = 348 bits (894), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 219/602 (36%), Positives = 335/602 (55%), Gaps = 53/602 (8%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E S+ K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+++W G P +
Sbjct: 23 EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NPDA + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 83 NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y+ Y REL L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196
Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
S +Q +M EG C + ++ ++ KG ++F L K ++G
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
A D L VE +D AV+ + +++F+ N D + T + + L + +
Sbjct: 242 KIACADGILSVEKADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+D Y++ RVS+ L R + V + +RV++F+ D LV
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLF + +S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSDTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
LWE Y YT D +FL + YP+L+ F + ++ E +L PS SPE+ +GK
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK 523
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
A + TMD +I ++++AIISA+++L+ +++ + + L + P ++ G + E
Sbjct: 524 -ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQRLKEMAPMQVGHWGQLQE 581
Query: 593 WV 594
W+
Sbjct: 582 WM 583
>gi|298481330|ref|ZP_06999523.1| alpha-L-fucosidase 2 [Bacteroides sp. D22]
gi|298272534|gb|EFI14102.1| alpha-L-fucosidase 2 [Bacteroides sp. D22]
Length = 822
Score = 348 bits (894), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 219/602 (36%), Positives = 335/602 (55%), Gaps = 53/602 (8%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E S+ K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+++W G P +
Sbjct: 23 EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NPDA + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 83 NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y+ Y REL L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196
Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
S +Q +M EG C + ++ ++ KG ++F L K ++G
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
A D L VE +D AV+ + +++F+ N D + T + + L + +
Sbjct: 242 KIACADGILSVEKADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+D Y++ RVS+ L R + V + +RV++F+ D LV
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLF + +S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSNTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
LWE Y YT D +FL + YP+L+ F + ++ E +L PS SPE+ +GK
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK 523
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
A + TMD +I ++++AIISA+++L+ +++ + + L + P ++ G + E
Sbjct: 524 -ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQRLKEMAPMQVGHWGQLQE 581
Query: 593 WV 594
W+
Sbjct: 582 WM 583
>gi|302549607|ref|ZP_07301949.1| large secreted protein [Streptomyces viridochromogenes DSM 40736]
gi|302467225|gb|EFL30318.1| large secreted protein [Streptomyces viridochromogenes DSM 40736]
Length = 953
Score = 348 bits (894), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 220/590 (37%), Positives = 312/590 (52%), Gaps = 44/590 (7%)
Query: 11 NPLKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
N L + ++ PA + A+PIGNGRLGAMV+G +E L+LNEDT+W G P D NP
Sbjct: 23 NDLALWYDKPAGADWLRALPIGNGRLGAMVFGNADTERLQLNEDTVWAGGPYDSANPRGA 82
Query: 70 KALSDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRR 126
++++R V + Q+ A + + G PA YQ +G++ L F + Y R
Sbjct: 83 ANIAEIRRRVFADQWGPAQDLINQTMLGSPAGQLAYQPVGNLLLSFGSA---TGVSQYNR 139
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
LDL TATA Y + V + RE F+S PDQVIV +++ + S++FN + DS
Sbjct: 140 TLDLTTATAVTTYVLNGVRYQREVFASAPDQVIVVRLTADRANSIAFNATFDSPQRTTVS 199
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
I ++G ++F A+ ++ GT+S+ L+V G
Sbjct: 200 SPDGATIALDGVS--------GTMEGITGRVRFLALANAAVTG--GTVSS-SGGTLRVSG 248
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ +L+ SS+ ++ D + L + R++ L RHL DYQ LF+
Sbjct: 249 ATSVTVLVAIGSSY----VDFRRVDGDYQGIARRHLNAARDIGIDQLRRRHLADYQALFN 304
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RVS+ L R+ T +++ P+ R+ DP LLFQFGRYLLISSSR
Sbjct: 305 RVSVDLGRT-------TAADQ-----PTDVRIAQHAQANDPQFSALLFQFGRYLLISSSR 352
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PGTQ ANLQGIWN+ ++P+WDS VN NL MNYW + NLSEC P+FD + L++ G
Sbjct: 353 PGTQPANLQGIWNDQMAPSWDSKFTVNANLPMNYWPADTTNLSECFLPVFDMIDDLTVTG 412
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSS-ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
++ AQ Y A GWV HH TD W +S D + W +W GGAWL T +W+HY +T D D
Sbjct: 413 ARVAQAQYGAGGWVTHHNTDAWRGASVVDEAR--WGMWQTGGAWLATLIWDHYLFTGDID 470
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
FL YP L+G A F LD L+ G+L TNPS SPE A A V TMD
Sbjct: 471 FLRSN-YPALKGAAQFFLDTLVAHPSLGHLVTNPSNSPELAHHAD----ATVCAGPTMDN 525
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
I+R++F ++ A E+L+ + + RL PTK+ G++ EW+
Sbjct: 526 QILRDLFHSVARAGEILDVDAAFRAQAKAAR-ERLAPTKVGSRGNVQEWL 574
>gi|238062935|ref|ZP_04607644.1| large secreted protein [Micromonospora sp. ATCC 39149]
gi|237884746|gb|EEP73574.1| large secreted protein [Micromonospora sp. ATCC 39149]
Length = 932
Score = 348 bits (894), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 215/573 (37%), Positives = 308/573 (53%), Gaps = 43/573 (7%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D N L+++R V + Q+ +
Sbjct: 42 ALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPYDSANTRGAANLAEIRRRVFADQWTQ 101
Query: 87 AT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
A + + G+PA YQ +G++ L F + Y R LDL TATA Y +
Sbjct: 102 AQDLINQTMVGNPAGQLAYQPVGNLRLAFGSAS---GASQYNRALDLTTATATTTYVLNG 158
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKR 203
V + RE F+S PDQVIV +++ + S++FN + DS + I ++G
Sbjct: 159 VRYQREVFASAPDQVIVIRLTADRANSITFNATFDSPQRTTVSSPDSATIGLDG------ 212
Query: 204 IPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGP 263
AN + ++F A+ ++ GT+S+ L+V G+ +L+ +S+
Sbjct: 213 --ISANMDGVTGQVRFLALANASVTG--GTVSS-SGGTLRVSGATSVTVLVSIGTSY--- 264
Query: 264 FINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDT 323
+N D + + L + R + L RHL DYQ LF+RV+I L R+
Sbjct: 265 -VNYRTVNGDYQGIARTRLNAARTAGFDQLRARHLADYQALFNRVTIDLGRT-------A 316
Query: 324 CSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLS 383
+++ D R+ DP LLFQFGRYLLISSSRPGTQ ANLQGIWN+ ++
Sbjct: 317 AADQTTDV-----RIAQHANTNDPQFSALLFQFGRYLLISSSRPGTQPANLQGIWNDQMA 371
Query: 384 PTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHH 443
P+WDS +N NL MNYW + NLSEC P+FD + L++ G++ AQ Y A GWV HH
Sbjct: 372 PSWDSKYTINANLPMNYWPADTTNLSECFLPVFDMIKDLTVTGARVAQAQYGAGGWVTHH 431
Query: 444 KTDIWAKSS-ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFL 502
TD W +S D + +W GGAWL T +W+HY +T D +FL YP ++G A F
Sbjct: 432 NTDAWRGASVVDYAQS--GMWQTGGAWLATMIWDHYLFTGDLEFLRAN-YPAMKGAAQFF 488
Query: 503 LDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVL 561
LD L+ YL TNPS SPE + A V TMD I+R++F+ + A+EVL
Sbjct: 489 LDTLVAHPTLSYLVTNPSNSPELSHHSN----AFVCAGPTMDNQILRDLFNGVALASEVL 544
Query: 562 EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+ +V + RL PTK+ G++ EW+
Sbjct: 545 GVDA-TFRTQVRTAKDRLPPTKVGSRGNVQEWL 576
>gi|325103216|ref|YP_004272870.1| alpha-L-fucosidase [Pedobacter saltans DSM 12145]
gi|324972064|gb|ADY51048.1| Alpha-L-fucosidase [Pedobacter saltans DSM 12145]
Length = 822
Score = 348 bits (893), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 205/594 (34%), Positives = 320/594 (53%), Gaps = 48/594 (8%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
NP+++ +N PA ++ +A+PIGNG L MV+GGV + ++LNE+T+W G PG+ P+
Sbjct: 27 NPMELWYNQPAANWNEALPIGNGFLAGMVFGGVQKDRIQLNEETIWAGEPGNNIIPNVYP 86
Query: 71 ALSDVRSLVDSGQYAEATAASVKLF-------GHPADVYQLLGDIELEFDDSHLKYAEET 123
A++++R L+ G+Y EA S K F G+ YQ G++ L+F
Sbjct: 87 AIAEIRKLLVEGKYKEAQDLSNKAFPRQAPKGGNYGMQYQTAGNLFLDFGHGGFI----N 142
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
YRR LD+ ATA + Y +++ RE+ + P +VI +++ S++ S+SF + +D+
Sbjct: 143 YRRNLDIEKATASISYQANGIDYKREYIALIPKKVIAIRLTASKTKSISFTIDMDAPFKE 202
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKL 242
+ ++++++ +++ D KG ++F + K+ + GT+ ++D KL
Sbjct: 203 FQKIALTDRLLLKAV---------SSSVDGKKGRVKFETQVVPKL--EGGTLE-IKDNKL 250
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
V+ ++ L + ++F+ N D + L + SY L H+ YQ
Sbjct: 251 VVKEANAVTLFISIGTNFN----NYQDISANENIRVKQRLAEVTGQSYKKLKANHIKSYQ 306
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
+ F+RV + L VT + P+ +RV F+ DP+LV L FQFGRYLLI
Sbjct: 307 QYFNRVKLDLG------VTSVMDK------PTNQRVIDFKEGNDPALVSLYFQFGRYLLI 354
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SS PG+Q ANLQG WNE LSP WDS VNIN EMNYW + NL E +PLF L L
Sbjct: 355 CSSFPGSQPANLQGKWNEKLSPPWDSKYTVNINTEMNYWPAEVTNLPEMHQPLFKMLKEL 414
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
S G ++A Y A GW +HH TD+W + G + +WPMGGAWL H+W+HY Y
Sbjct: 415 SETGKESAGQMYKARGWNLHHNTDLWRITGPVDGG-FYGMWPMGGAWLSQHIWQHYLYNG 473
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
D DFL + Y +L+G A F +D L E +L PS SPE+ ++ G V +T
Sbjct: 474 DNDFL-REYYDVLKGAAMFYVDVLQEEPKHKWLVVAPSMSPENTYLPSVG----VGAGTT 528
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
MD ++ +VF+ I +E+L K + + + V + RL P ++ + + EW+Q
Sbjct: 529 MDNQLVFDVFANFIRTSEIL-KQDQSFADTVRNMINRLPPMQVGQHAQLQEWLQ 581
>gi|262408009|ref|ZP_06084557.1| glycoside hydrolase family 95 [Bacteroides sp. 2_1_22]
gi|345511517|ref|ZP_08791057.1| glycoside hydrolase family 95 [Bacteroides sp. D1]
gi|229444055|gb|EEO49846.1| glycoside hydrolase family 95 [Bacteroides sp. D1]
gi|262354817|gb|EEZ03909.1| glycoside hydrolase family 95 [Bacteroides sp. 2_1_22]
Length = 822
Score = 348 bits (892), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 219/603 (36%), Positives = 337/603 (55%), Gaps = 55/603 (9%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E S+ K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+++W G P +
Sbjct: 23 EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NPDA + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 83 NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y+ Y REL L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196
Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
S +Q +M EG C + ++ ++ KG ++F L K ++G
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
A D L VE +D AV+ + +++F+ N D + T + + L + +
Sbjct: 242 KIACADGILSVEKADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+D Y++ RVS+ L R + V + +RV++F+ D LV
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLF + +S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSDTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDG 531
LWE Y YT D +FL + YP+L+ F + +++ H+ +L PS SPE+ +G
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKDPVHN-WLVVCPSNSPENVHSGSNG 522
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
K A + TMD +I ++++AIISA+++L+ +++ + + L + P ++ G +
Sbjct: 523 K-ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQRLKEMAPMQVGHWGQLQ 580
Query: 592 EWV 594
EW+
Sbjct: 581 EWM 583
>gi|443292342|ref|ZP_21031436.1| Alpha-L-fucosidase [Micromonospora lupini str. Lupac 08]
gi|385884621|emb|CCH19587.1| Alpha-L-fucosidase [Micromonospora lupini str. Lupac 08]
Length = 1000
Score = 348 bits (892), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 219/582 (37%), Positives = 308/582 (52%), Gaps = 44/582 (7%)
Query: 19 GPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSL 78
G + A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D +N AL+++R L
Sbjct: 53 GAGTDWLRALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPHDPSNTRGAAALAEIRRL 112
Query: 79 VDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
V++ Q+ +A + + G+P YQ +G++ L F + + R LDL TAT
Sbjct: 113 VNANQWTQAQDLINQTMMGNPGGQLAYQTVGNLRLAFGSAS---GASQHNRTLDLTTATT 169
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
Y + + + RE F+S PDQVI +++ S S+SF + DS I +
Sbjct: 170 TTSYVLNGIRYQREVFASAPDQVIAMRLTADRSNSISFTATFDSPQRTTVSSPDGATIGL 229
Query: 196 EGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLV 255
+G N ++F L + + G + L+V + +L+
Sbjct: 230 DG--------VSGNMEGVTGQVRF---LALANATVSGGTVSSSGGTLRVTNATSVTVLVS 278
Query: 256 ASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR- 314
SS+ +N + D + L + R SY L +RH+ DYQ LF RV++ L R
Sbjct: 279 IGSSY----VNYRNVGGDYGGIARQRLSAARASSYDQLRSRHVADYQALFGRVTLDLGRT 334
Query: 315 SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANL 374
S D TD R+ + DP LLFQFGRYLLISSSRPGTQ ANL
Sbjct: 335 SAADQTTDV-------------RIAQHNSVNDPQFSALLFQFGRYLLISSSRPGTQPANL 381
Query: 375 QGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY 434
QGIWN+ L+P+WDS +N NL MNYW + NL+EC P+FD + L++ G++TAQV Y
Sbjct: 382 QGIWNDSLAPSWDSKYTINANLPMNYWPANTTNLAECHNPVFDLVRDLAVTGTRTAQVQY 441
Query: 435 -LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYP 493
ASGWV HH TD W +++A W +W GGAWL T +W+HY + D +FL YP
Sbjct: 442 GAASGWVTHHNTDAW-RATAVVDGAFWGMWQTGGAWLSTLIWDHYLFNGDIEFLRTN-YP 499
Query: 494 LLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFS 552
++G A F L+ L+ E GYL TNPS SPE A A V TMD I+R++F
Sbjct: 500 AMKGAAQFFLNTLVTEPTLGYLVTNPSNSPELSHHAN----ASVCAGPTMDNQILRDLFD 555
Query: 553 AIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
A A+E+L+ + +V + RL P K+ G+IMEW+
Sbjct: 556 ACARASEILDV-DSTFRAQVRATRDRLPPMKVGSRGNIMEWL 596
>gi|402306106|ref|ZP_10825157.1| hypothetical protein HMPREF1146_1457 [Prevotella sp. MSX73]
gi|400379873|gb|EJP32702.1| hypothetical protein HMPREF1146_1457 [Prevotella sp. MSX73]
Length = 785
Score = 348 bits (892), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 209/604 (34%), Positives = 326/604 (53%), Gaps = 46/604 (7%)
Query: 2 MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
M A S +P K+ + PA+ +TDA+P+GNGRLGAMV+G +E ++LNE+T+W G P
Sbjct: 14 MLASLFSQAHPYKLWYREPAQVWTDALPLGNGRLGAMVYGIPATERIQLNEETIWAGQPN 73
Query: 62 DYTNPDAPKALSDVRSLVDSGQYAEAT------AASVKLFGHPADVYQLLGDIELEFDDS 115
N A KA+ ++ L+ G+Y +A S +G P YQ G++ +
Sbjct: 74 GNANAKALKAIPVIQDLIWKGEYKKAQDLATSDVMSATNWGMP---YQAFGNVYISMPGM 130
Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
Y REL L++A A +++ V + RE +S D V+ + + + G ++FN
Sbjct: 131 G---NYTNYYRELSLDSARAITRWTADGVTYRREVITSLADNVVTVRFTADQRGCITFNA 187
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSA-ILEIKISDDRGT 233
+ D+ II++ + + ++ KG ++F + + G
Sbjct: 188 YFTTPHDD---------IIIKSEGDEATLFGVTSKHEGLKGKVRFMGRMAAVAKGKGEGA 238
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
++ +D + V+G+D AVL + +++F+ N D D S L++ Y+
Sbjct: 239 VTHSKDGIVSVKGADEAVLYISIATNFN----NYKDISGDEAVRSEQILRAAMARDYAQS 294
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+ +++L HRV++ L E+ +P+ ER+ F +D LV
Sbjct: 295 KAEHISRFRQLMHRVTLNLG------------EDQYKDLPTDERIIRFAAHDDNYLVATY 342
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P WDS NINLEMNYW + L+E E
Sbjct: 343 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPAWDSKYTTNINLEMNYWPAELTQLTELNE 402
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCT 472
PLF + +S G++TA+ Y SGWV+HH TDIW + D + +W GGAWLC
Sbjct: 403 PLFRLIREVSETGAETARTMYGKSGWVLHHNTDIWRVTGGIDHAQS--GMWMTGGAWLCR 460
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDG 531
HLWEHY YTMD+DFL +R YP+++G A FL LI E G+L +PS SPE+ + DG
Sbjct: 461 HLWEHYLYTMDKDFL-RRYYPVMKGAAEFLDQMLIPEPQHGWLVISPSVSPENSHPSKDG 519
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
K+A +S +TMD+ ++ E+F +++A++VL ++ AL + L + P ++ + G +
Sbjct: 520 KVA-ISAGTTMDVQLVNELFREVMAASKVLGEDA-ALAAHYAERLKLMPPMQVGKWGQLQ 577
Query: 592 EWVQ 595
EW++
Sbjct: 578 EWME 581
>gi|383113013|ref|ZP_09933793.1| hypothetical protein BSGG_0144 [Bacteroides sp. D2]
gi|382948895|gb|EFS29444.2| hypothetical protein BSGG_0144 [Bacteroides sp. D2]
Length = 822
Score = 347 bits (891), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 216/602 (35%), Positives = 335/602 (55%), Gaps = 53/602 (8%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E+ + K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+T+W G P +
Sbjct: 23 ETNVSAQEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEETIWAGRPNNNA 82
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NPDA + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F SH +
Sbjct: 83 NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-SHTR 138
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y+ Y REL L++A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 139 YS--NYYRELSLDSARVIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196
Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
S +Q +M EG C + ++ ++ KG ++F L K ++G
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
A D L VE +D A++ + +++F+ N D + + + L+ + +
Sbjct: 242 EIACADGILSVEKADEAIVYVSIATNFN----NYQDITGNQIERAKNYLEKAMVHPFIES 297
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+D Y++ RVS+ L + + VP+ +RV++F+ D LV
Sbjct: 298 KKNHIDFYRQYLTRVSLDLGK------------DQYSNVPTDKRVENFKNTNDAHLVATY 345
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLF + +S G +TA+V Y A+GWV+HH TDIW + A K +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSDTGKETAKVMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
LWE Y YT D +FL + YP+L+ F + ++ E +L PS SPE+ +GK
Sbjct: 465 LWERYLYTGDIEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK 523
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
A + TMD ++ ++++ IISA+++L+ +++ + + L + P ++ G + E
Sbjct: 524 -ATTAAGCTMDNQLVFDLWTTIISASQILDTDQE-FATHLAQRLKEMAPMQVGHWGQLQE 581
Query: 593 WV 594
W+
Sbjct: 582 WM 583
>gi|315506426|ref|YP_004085313.1| cellulose-binding family II [Micromonospora sp. L5]
gi|315413045|gb|ADU11162.1| cellulose-binding family II [Micromonospora sp. L5]
Length = 936
Score = 347 bits (891), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 220/590 (37%), Positives = 313/590 (53%), Gaps = 44/590 (7%)
Query: 11 NPLKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
N L + ++ PA + A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D N
Sbjct: 44 NDLALWYDEPAGTDWLRALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPYDSANTRGA 103
Query: 70 KALSDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRR 126
L+++R V + Q+ A + + G P YQ +GD+ L F + Y R
Sbjct: 104 ANLAEIRRRVFADQWTSAQDLINQTMMGSPGGQLAYQTVGDLRLAFGSAS---GATQYNR 160
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
LDL TAT Y G V + RE F+S PDQV+V +++ + +++F+ + DS
Sbjct: 161 TLDLTTATITTTYVQGGVRYQREMFASAPDQVMVLRLTADRANAITFSAAFDSPQRTTVS 220
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
I ++G + ++F A+ ++ GT+S+ L+V G
Sbjct: 221 SPDGATIALDG--------VSGSMEGVTGSVRFLALANAAVTG--GTVSS-SGGTLRVSG 269
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ +L+ +S+ +N D + + L + ++++ L TRH DYQ LF+
Sbjct: 270 ATSVTVLVSIGTSY----VNYRTVNGDYQGIARNRLNAAKSVAVDQLRTRHRADYQALFN 325
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV+I L R T + + P+ R+ + DP LLFQFGRYLLISSSR
Sbjct: 326 RVTIDLGR--------TAAADQ----PTDVRIAQHASTNDPQFAALLFQFGRYLLISSSR 373
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PGTQ ANLQGIWN+ L+P+WDS VN NL MNYW + NLSEC P+FD + L++ G
Sbjct: 374 PGTQPANLQGIWNDSLTPSWDSKYTVNANLPMNYWPADTTNLSECFLPVFDMVKDLTVTG 433
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
++ AQ Y A GWV HH TD W +S G W +W GGAWL T +W+HY +T D F
Sbjct: 434 ARVAQAQYGAGGWVTHHNTDAWRGASVVDG-AFWGMWQTGGAWLSTLIWDHYLFTGDSGF 492
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHD--GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L+ YP L+G A F LD L+ H GYL TNPS SPE A A V TMD
Sbjct: 493 LQAN-YPALKGAAQFFLDTLVA-HPTLGYLVTNPSNSPELAHHAN----ASVCAGPTMDN 546
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
I+R++F A A+EVL + +V + RL P+++ G++ EW+
Sbjct: 547 QILRDLFDAAARASEVLGV-DTTFRSQVRTARDRLPPSRVGSRGNVQEWL 595
>gi|423223594|ref|ZP_17210063.1| hypothetical protein HMPREF1062_02249 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638219|gb|EIY32066.1| hypothetical protein HMPREF1062_02249 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 823
Score = 347 bits (890), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 209/588 (35%), Positives = 316/588 (53%), Gaps = 49/588 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA+++ +A+P+GNGRLGAMV+G +E ++LNE+T+ G P NP+ LS++R
Sbjct: 31 YDSPAEYWEEALPLGNGRLGAMVYGNPVNEEIQLNEETISAGAPYKNYNPETKNYLSEIR 90
Query: 77 SLVDSGQYAEA-TAASVKL-----FGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
L+ G+Y EA T A +L FG P YQ G + L F D +RRELDL
Sbjct: 91 QLIFEGKYPEAQTLAGERLLSKNGFGMP---YQTAGSLRLRFQDQE---GYTNFRRELDL 144
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A Y+V V++ RE F+S DQ+++ +++ S+ G L+F +L D +G
Sbjct: 145 EKAVASTTYTVDGVDYKREVFTSFADQLVIIRLTASQPGKLTFTTALTCPQDVDVTTSGK 204
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
+ + MEG G A ++F L++ + +G ++ D L V ++ A
Sbjct: 205 DAMTMEGVTKGNEFVEGA--------VRFRTDLKLNV---QGGKTSANDSTLIVTRANSA 253
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
+ L S++F IN D DP + L++ +Y+ H+ +YQK ++RVS+
Sbjct: 254 TIYLAISTNF----INYKDISGDPVKRNKVYLKNAGK-NYTKALQAHISEYQKYYNRVSL 308
Query: 311 QLSRSPK-DIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
L R+ + D TD RVK F T DP LV L FQFGRYLLISSS+PG
Sbjct: 309 NLGRTAQADKPTDI-------------RVKEFATANDPHLVALYFQFGRYLLISSSQPGG 355
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN+ L+P W NIN EMNYW + NL E EP + L NG +
Sbjct: 356 QPANLQGIWNQKLNPAWKCRYTTNINAEMNYWPAEVTNLPEMHEPFLQMIKELYENGQEA 415
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+ Y GW++HH TD+W + A K WP AWLC HLW+ Y Y+ D+DFL +
Sbjct: 416 AREMYGCRGWMLHHNTDLWRMNGA-VDKAYCGPWPTCNAWLCHHLWDRYLYSGDKDFLAQ 474
Query: 490 RAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSS-TMDMAII 547
AYP+++ + F +D+L++ + GY+ PS SPE+ P + ++ TMD ++
Sbjct: 475 -AYPIMKSASEFFVDFLVKDPNTGYMVVTPSNSPENS--PPQWRTKANLFAGITMDNQLV 531
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
++F+ AA +LEK+E + +L +L P ++ + G + EW +
Sbjct: 532 FDLFTNTERAARLLEKDE-LFCDTILSLRKQLPPMQVGQYGQLQEWFE 578
>gi|56962910|ref|YP_174637.1| hypothetical protein ABC1138 [Bacillus clausii KSM-K16]
gi|56909149|dbj|BAD63676.1| conserved hypothetical protein [Bacillus clausii KSM-K16]
Length = 782
Score = 347 bits (890), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 200/596 (33%), Positives = 320/596 (53%), Gaps = 42/596 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+++T PA+ +T+A PIGNGR+GAMV+GGV E + LN D+LW+G P +
Sbjct: 1 MQLTEQQPAQTWTEAYPIGNGRIGAMVYGGVEHEKIALNVDSLWSGPPAKRKQAPVKGTV 60
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
+D+R+ + + + A+ + + G Y LGD+ + F ++ Y R L L T
Sbjct: 61 ADMRAAIAARDFQAASRYAKDMQGPYTQSYLPLGDLHILF--PLCTHSSTRYERTLQLET 118
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
AT V+ + + R F+S PD+ I+ ++ LSF+ L S L + + +
Sbjct: 119 ATVTVEDGL----YKRSVFASKPDEAIILRLEAVAELPLSFSAWLTSPLRTIGWPD-QDH 173
Query: 193 IIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLK 243
+ + G CP + + P + +P I+F++ +++ +D +A+++ KL
Sbjct: 174 VGLAGWCP-EYVAPNYVPSSEPIRYTSYETSSAIRFASAVQLLETDGN---AAVKNNKLV 229
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
VE + +A +L+ +SF + K+P + L +Y L +RHL DYQ
Sbjct: 230 VEDARYATVLVHMETSFASA---QAPQGKEPITLIRKRLSETVTSTYETLQSRHLQDYQS 286
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
LF R++ L+ + ++ ++ ++ER+ + + D LVELLFQ GRYLLI+
Sbjct: 287 LFQRMTFTLNETEREKLS------------TSERLAKYGAN-DGKLVELLFQMGRYLLIA 333
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSR GT+ ANLQGIWNE + P W S +NIN +MNYW + L EC +P F+ LS
Sbjct: 334 SSREGTEAANLQGIWNEHIRPPWSSNYTLNINAQMNYWPAETAALPECHQPFLTFIEELS 393
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHYN 479
G AQ Y GW HH +DIW ++ G VWA WPM WL HLWEHY
Sbjct: 394 EQGKAVAQNYYQCRGWTAHHNSDIWRQAEPVGGFGGGDPVWAFWPMAAPWLTRHLWEHYL 453
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
++ DR +L +RAYP+++G F LDWL++ G + T+PSTSPEH F+ G+ VS
Sbjct: 454 FSADRAYLTERAYPVMKGAILFCLDWLVQDESGAVYTSPSTSPEHRFLY-KGQPYPVSEG 512
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+ MD+A++ +VF ++A E++ ++ L V +L +L+ ++ +G++ EW
Sbjct: 513 AVMDLALLEDVFHLFLAANELVGGDQQ-LATDVKDALNQLKKPPLSAEGALQEWTH 567
>gi|224536536|ref|ZP_03677075.1| hypothetical protein BACCELL_01411 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521792|gb|EEF90897.1| hypothetical protein BACCELL_01411 [Bacteroides cellulosilyticus
DSM 14838]
Length = 811
Score = 347 bits (889), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 209/588 (35%), Positives = 316/588 (53%), Gaps = 49/588 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA+++ +A+P+GNGRLGAMV+G +E ++LNE+T+ G P NP+ LS++R
Sbjct: 19 YDSPAEYWEEALPLGNGRLGAMVYGNPVNEEIQLNEETISAGAPYKNYNPETKNYLSEIR 78
Query: 77 SLVDSGQYAEA-TAASVKL-----FGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
L+ G+Y EA T A +L FG P YQ G + L F D +RRELDL
Sbjct: 79 QLIFEGKYPEAQTLAGERLLSKNGFGMP---YQTAGSLRLRFQDQE---GYTNFRRELDL 132
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A Y+V V++ RE F+S DQ+++ +++ S+ G L+F +L D +G
Sbjct: 133 EKAVASTTYTVDGVDYKREVFTSFADQLVIIRLTASQPGKLTFTTALTCPQDVDVTTSGK 192
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
+ + MEG G A ++F L++ + +G ++ D L V ++ A
Sbjct: 193 DAMTMEGVTKGNEFVEGA--------VRFRTDLKLNV---QGGKTSANDSTLVVTRANSA 241
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
+ L S++F IN D DP + L++ +Y+ H+ +YQK ++RVS+
Sbjct: 242 TIYLAISTNF----INYKDISGDPVKRNKVYLKNAGK-NYTKALQAHISEYQKYYNRVSL 296
Query: 311 QLSRSPK-DIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
L R+ + D TD RVK F T DP LV L FQFGRYLLISSS+PG
Sbjct: 297 DLGRTAQADKPTDI-------------RVKEFATANDPHLVALYFQFGRYLLISSSQPGG 343
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN+ L+P W NIN EMNYW + NL E EP + L NG +
Sbjct: 344 QPANLQGIWNQKLNPAWKCRYTTNINAEMNYWPAEVTNLPEMHEPFLQMIKELYENGQEA 403
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+ Y GW++HH TD+W + A K WP AWLC HLW+ Y Y+ D+DFL +
Sbjct: 404 AREMYGCRGWMLHHNTDLWRMNGA-VDKAYCGPWPTCNAWLCHHLWDRYLYSGDKDFLAQ 462
Query: 490 RAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSS-TMDMAII 547
AYP+++ + F +D+L++ + GY+ PS SPE+ P + ++ TMD ++
Sbjct: 463 -AYPIMKSASEFFVDFLVKDPNTGYMVVTPSNSPENS--PPQWRTKANLFAGITMDNQLV 519
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
++F+ AA +LEK+E + +L +L P ++ + G + EW +
Sbjct: 520 FDLFTNTERAARLLEKDE-LFCDTILSLRKQLPPMQVGQYGQLQEWFE 566
>gi|345513950|ref|ZP_08793465.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
gi|423230895|ref|ZP_17217299.1| hypothetical protein HMPREF1063_03119 [Bacteroides dorei
CL02T00C15]
gi|423244606|ref|ZP_17225681.1| hypothetical protein HMPREF1064_01887 [Bacteroides dorei
CL02T12C06]
gi|229435764|gb|EEO45841.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
gi|392630015|gb|EIY24017.1| hypothetical protein HMPREF1063_03119 [Bacteroides dorei
CL02T00C15]
gi|392641455|gb|EIY35231.1| hypothetical protein HMPREF1064_01887 [Bacteroides dorei
CL02T12C06]
Length = 824
Score = 347 bits (889), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 215/589 (36%), Positives = 325/589 (55%), Gaps = 51/589 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA+++ +A+P+GNGRLGAMV+G +E ++LNE+T+ G P NP+A AL+ +R
Sbjct: 31 YDKPARYWEEALPLGNGRLGAMVYGNPVTEEIQLNEETVSAGSPYKNYNPEAKGALATIR 90
Query: 77 SLVDSGQYAEATA-ASVKL-----FGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
L+ +G+Y EA A A K+ FG P YQ +G + L+F SH Y +RRELDL
Sbjct: 91 QLIFAGRYPEAQALAGEKILSKNGFGMP---YQTVGSLRLDFP-SHENYT--NFRRELDL 144
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A Y+V +++ RE F+S DQ+++ +++ S+ G L+F+ SL V+G
Sbjct: 145 EKAVATTAYTVNGIDYKREVFTSFVDQLVIVRLTASQPGKLTFSASLTCPQKVDVTVSGK 204
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
N +I+EG G +D KG I F A L++ D +G S D L V ++
Sbjct: 205 NALILEGTTKG---------DDFTKGSICFRADLKL---DLQGGKSVAGDTLLSVTNANS 252
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A + + +++F +N D +P+ + ++++ +Y+ H+ YQK ++RVS
Sbjct: 253 ATIYIAMATNF----VNYKDISGNPSGRNKVSMKNAGK-NYARALQAHISAYQKYYNRVS 307
Query: 310 IQLSR-SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
+ L R S D TD R+K F +DP LV L FQFGRYLLISSS+PG
Sbjct: 308 LNLGRTSQADKPTDV-------------RIKEFAISDDPHLVALYFQFGRYLLISSSQPG 354
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
Q ANLQGIWN+ L+P W NIN EMNYW + NL E EP + L NG +
Sbjct: 355 GQPANLQGIWNQKLNPAWKCRYTTNINAEMNYWPAEVTNLREMHEPFLQMVKELYENGQE 414
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
A+ Y GWV+HH TD+W + A DR WP AWLC HLW+ Y Y+ D+++L
Sbjct: 415 AAREMYGCRGWVLHHNTDLWRMNGAVDRAYC--GPWPTCNAWLCQHLWDRYLYSGDKEYL 472
Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
YP+L+ + F +D+L+ + + GYL PS SPE+ GK A + TMD +
Sbjct: 473 AS-VYPILKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPSIWKGK-ANLFAGITMDNQL 530
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+ ++FS SAA++L ++ + +L +L P ++ + G + EW +
Sbjct: 531 VSDLFSNTRSAAQILNLDKQ-FCDTILSLKRQLPPMQVGQYGQLQEWFE 578
>gi|293369104|ref|ZP_06615699.1| putative lipoprotein [Bacteroides ovatus SD CMC 3f]
gi|292635816|gb|EFF54313.1| putative lipoprotein [Bacteroides ovatus SD CMC 3f]
Length = 822
Score = 346 bits (888), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 219/602 (36%), Positives = 334/602 (55%), Gaps = 53/602 (8%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E S+ K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+++W G P +
Sbjct: 23 EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NPDA + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 83 NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y+ Y REL L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196
Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
S +Q +M EG C + ++ ++ KG ++F L K ++G
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
A D L VE +D AV+ + +++F+ N D + T + + L + +
Sbjct: 242 KIACADGILSVEKADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+D Y++ RVS+ L R + V + +RV++F+ D LV
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLF + +S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSNTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
LWE Y YT D +FL + YP+L+ F + ++ E +L PS SPE+ +GK
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK 523
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
A + TMD +I ++++AIISA+++L+ + + + + L + P ++ G + E
Sbjct: 524 -ATTAAGCTMDNQLIFDLWTAIISASQILDTDLE-FASHLTQRLKEMAPMQVGHWGQLQE 581
Query: 593 WV 594
W+
Sbjct: 582 WM 583
>gi|284036403|ref|YP_003386333.1| alpha-L-fucosidase [Spirosoma linguale DSM 74]
gi|283815696|gb|ADB37534.1| Alpha-L-fucosidase [Spirosoma linguale DSM 74]
Length = 842
Score = 346 bits (888), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 207/594 (34%), Positives = 322/594 (54%), Gaps = 49/594 (8%)
Query: 13 LKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
LK+ +N PA K +T A+P+GNGRLGAMV+G E +KLNE T+W+G P NPDA A
Sbjct: 37 LKLWYNQPAGKVWTSALPVGNGRLGAMVYGNPEQELIKLNEATVWSGGPNRNDNPDALAA 96
Query: 72 LSDVRSLVDSGQYAEA---TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRREL 128
L ++R L+ +G+ AEA AA+++ + YQ +G+++L F + Y REL
Sbjct: 97 LPEIRRLIFAGKQAEAQKLAAANIETKKNNGMKYQPVGNLQLSFTGHQ---SVTNYYREL 153
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
D+ A A Y+V V + R+ +S PDQVI +++ + G LSF L+S V
Sbjct: 154 DIEKAIATTMYTVDGVRYMRQVIASVPDQVIAVRLTADKPGKLSFTAFLNSPQKVQRSVE 213
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGS 247
+++M G + ++ KG + F+A + + + T + D + + G+
Sbjct: 214 ETTKLVMTGTT---------SDHEGVKGQVNFNAHVRVVAEGGQTTKT---DTSVVISGA 261
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
+ L + +++ ++ DP + + S L S++ + H+ YQ+ F R
Sbjct: 262 NATTLYVSMATNV----VDYKTLTADPKTRADSYLTPAAKRSFNAVLAAHVAAYQRYFKR 317
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
V++ L S + +P+ ER++ F + DP LV L FQFGRYLLIS+S+P
Sbjct: 318 VNLDLGTS------------DAAKLPTDERIRQFASGNDPQLVSLYFQFGRYLLISASQP 365
Query: 368 GT-----QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
QVA LQG+WN+ + P WDS +NIN EMNYW + NL+E EPL + L
Sbjct: 366 SRNGVVGQVATLQGLWNDRMDPPWDSKYTININTEMNYWPAEVTNLTELHEPLVQMVKEL 425
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
S G +TA+V Y ASGW+ HH TD+W + + + +++WPMGGAWL HLWE Y Y+
Sbjct: 426 SQTGQETARVMYGASGWLAHHNTDLW-RITGPVDPIYYSMWPMGGAWLSQHLWEKYQYSG 484
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLAC-VSYSS 540
D+ +L K YP ++G A F +D+L+E + YL P SPE+ AP + +
Sbjct: 485 DKAYL-KSVYPAMKGAAQFFVDYLVEDPNHHYLVVCPGMSPEN---APSTRPGVSIDAGV 540
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
TMD ++ ++F+ I AA+ L + D V+ V L +L P ++ + G + EW+
Sbjct: 541 TMDNQLVFDIFTNTIRAAQALGTDAD-FVKIVASKLAQLPPMQVGKHGQLQEWI 593
>gi|443288639|ref|ZP_21027733.1| Secreted Ricin B-related lectin [Micromonospora lupini str. Lupac
08]
gi|385888040|emb|CCH15807.1| Secreted Ricin B-related lectin [Micromonospora lupini str. Lupac
08]
Length = 952
Score = 346 bits (888), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 220/585 (37%), Positives = 307/585 (52%), Gaps = 67/585 (11%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D N L+++R V + Q+ +
Sbjct: 61 ALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPYDSANTRGAANLAEIRRRVFADQWTQ 120
Query: 87 AT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
A + + G P YQ +GD+ L F + Y+R LDL TAT Y +
Sbjct: 121 AQDLINQTMLGSPVGQLAYQPVGDLRLAFGSAS---GASQYQRTLDLTTATTTTSYVLNG 177
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL-----------DSLLDNHSYVNGNNQ 192
V F RE F+S PDQVIV +++ + +++F + D+ V+G+
Sbjct: 178 VRFQREMFASAPDQVIVIRLTADRANAITFTATFSSPQRTTVSSPDAATIGLDGVSGS-- 235
Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
MEG R ANA+ + L+V G+ L
Sbjct: 236 --MEGITGQVRFLALANASVSGG------------------TVSSSGGTLRVSGATSVTL 275
Query: 253 LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
L+ SS+ +N D + L + R + + L RH+ DYQ LF+RVSI L
Sbjct: 276 LVSIGSSY----VNYRTVNGDYQGIARRHLDAARAIGFDQLRGRHVADYQALFNRVSIDL 331
Query: 313 SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVA 372
R+ T +++ D R+ + DP LLFQ+GRYLLISSSRPG+Q A
Sbjct: 332 GRT-------TAADQTTDV-----RIAQHASVNDPQFSALLFQYGRYLLISSSRPGSQPA 379
Query: 373 NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQV 432
NLQGIWN+ ++P+WDS +N NL MNYW + NL+EC P+FD + L++ G++TAQV
Sbjct: 380 NLQGIWNDQMAPSWDSKFTINANLPMNYWPADTTNLAECYLPVFDMIKDLTVTGARTAQV 439
Query: 433 NYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
Y A GWV HH TD W SS + +W +W GGAWL T +W+HY +T D +FL Y
Sbjct: 440 QYGAGGWVTHHNTDAWRGSSV-VDEALWGMWQTGGAWLATMIWDHYQFTGDIEFLRAN-Y 497
Query: 493 PLLEGCASFLLDWLIEGHD--GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREV 550
P ++G A F LD L+ H GYL TNPS SPE A V TMD I+R++
Sbjct: 498 PAMKGAAQFFLDTLVS-HPTLGYLVTNPSNSPELRHHTN----ASVCAGPTMDNQILRDL 552
Query: 551 FSAIISAAEVLEKNEDALVE-KVLKSLPRLRPTKIAEDGSIMEWV 594
F+ + A+EVL N DA +VL + RL PT++ G++ EW+
Sbjct: 553 FNGVARASEVL--NVDATYRAQVLTARDRLPPTRVGSRGNVQEWL 595
>gi|300725824|ref|ZP_07059290.1| alpha-L-fucosidase 2 [Prevotella bryantii B14]
gi|299776871|gb|EFI73415.1| alpha-L-fucosidase 2 [Prevotella bryantii B14]
Length = 802
Score = 346 bits (887), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 215/603 (35%), Positives = 323/603 (53%), Gaps = 46/603 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY-TNPDAPKA 71
+++ ++ PA +F +++PIGNG+LG +V+G +T+ LN+ TLWTG P D A
Sbjct: 23 MQLLYHEPAHYFEESLPIGNGKLGGLVYGNPKHDTIYLNDITLWTGKPVDLDEGKGASLW 82
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIEL-EFDDSHLKYAEETYRRELDL 130
L ++R + + Y +A + + L G + YQ LG ++L D +Y++ Y+R+LDL
Sbjct: 83 LPEIRKALFAENYRKADSLQLHLQGKNSAFYQPLGTLQLTSLTDE--RYSD--YQRQLDL 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN-- 188
+++ ++ Y G V + RE+F+ NPD ++ +ISG + GS+S ++S+ SLL +
Sbjct: 139 DSSLVKISYRQGGVLYQREYFADNPDNMLAIRISGDKKGSVSMDISIGSLLPVQVKASLT 198
Query: 189 -------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
Q+ M G G + F +L+ + GT+ + K
Sbjct: 199 RSLQANTAQGQLTMLGHAQGV----------SSESTHFCTMLQARAQG--GTVQVIHGK- 245
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L+VE +D ++ +V +SF G +P ++ L ++N SY +L +RH+ DY
Sbjct: 246 LRVEHADTLIIYIVNETSFAGADKHPVQDGAPYLAQVTDDLWHLQNYSYDELRSRHVADY 305
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQFGRYL 360
QK ++RV ++L T + + +DT + K+ Q D L L FQ+GRYL
Sbjct: 306 QKFYNRVKLRLG-------TVDHAPQTVDTWSLLKNYGKNHQAYLDRYLETLYFQYGRYL 358
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LIS SR ANLQG+WN L W VNINLE NYW + NLSE +EP+ DF+
Sbjct: 359 LISCSRTSGVPANLQGLWNHYLEAPWRGNYTVNINLEENYWPAEVANLSEMEEPIHDFMA 418
Query: 421 YLSINGSKTAQVNY-LASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAWLCTHLWE 476
L+ NG TA Y + GW H +DIWAK++ R W+ W MGGAWL + LWE
Sbjct: 419 SLAQNGHFTAHHFYGIDRGWCSSHNSDIWAKTAPVGEGRESPEWSNWNMGGAWLSSTLWE 478
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLA 534
HY YT D DFL + AYP+L G + F+L WL++ G L T PSTSPE+E++ G
Sbjct: 479 HYLYTQDLDFLRRTAYPILNGASQFVLRWLVDNPQKSGELITAPSTSPENEYVTDKGYHG 538
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVL---EKNEDAL-VEKVLKSLPRLRPTKIAEDGSI 590
Y T D+AIIRE+ + A +VL EK ED V ++L RL P + +DG +
Sbjct: 539 TTCYGGTADLAIIRELLLNTLHARQVLGLKEKKEDQKGYPTVSEALARLHPYTVGKDGDL 598
Query: 591 MEW 593
EW
Sbjct: 599 NEW 601
>gi|408378982|ref|ZP_11176577.1| alpha-L-fucosidase [Agrobacterium albertimagni AOL15]
gi|407747109|gb|EKF58630.1| alpha-L-fucosidase [Agrobacterium albertimagni AOL15]
Length = 805
Score = 346 bits (887), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 215/604 (35%), Positives = 316/604 (52%), Gaps = 60/604 (9%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + P ++F +A+P+GNG LGAM+ GG + + LN+D W G P L
Sbjct: 27 RLWYTAPGRNFNEALPLGNGSLGAMIRGGTAEDLVCLNDDRFWAGRDAPAPVATGPLVLE 86
Query: 74 DVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
+VR + +G A A A KL Y D+ +++D A E Y R+LDLNT
Sbjct: 87 EVRRRLFAGDVAGAEALVEQKLLTDFNQPYLTAADLVIQWDHD----AVERYTRQLDLNT 142
Query: 133 ATARVKY---SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
A A V Y VG V R FSS PDQV V ++ +SL S + S ++
Sbjct: 143 AVAEVNYVASRVGGVR--RRAFSSFPDQVFVLDAGFADPSQARTVLSLSSKTRHVSRMSA 200
Query: 190 NNQIIM-------EGRCPGKRIPPKANA--NDDP--KGIQFSAILEIKISDDRGTISALE 238
+ I++ + R RI N DP + + + +L +S +
Sbjct: 201 RDLIVVADAPSMVDWRGIDDRIRDGENIFYEVDPPRRCLTVACVLAASVS--------VH 252
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+ L V G D+ VL+ + S G + + ++ L++ + +S L RH+
Sbjct: 253 GEGLVV-GGDFTVLVATSVGSDVGLLLE----------DCLARLEAAESRGFSALLERHV 301
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQFG 357
++ L+ R ++ L RSP + +P+ ER+ + DP+L LLF +G
Sbjct: 302 AAHRALYDRAALTL-RSPV----------GLSALPTDERLHRQASKMRDPALEALLFNYG 350
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYL+I+SSRPG++ NLQGIWN+ + P W S +NINL+MNYW + PCNL+EC EPLFD
Sbjct: 351 RYLMIASSRPGSRAINLQGIWNDKVQPPWWSNYTININLQMNYWPAEPCNLAECHEPLFD 410
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRG--------KVVWALWPMGGAW 469
F+ LS+ G++TA V Y GWV HH+ D +++A + + LW MGGAW
Sbjct: 411 FVKNLSLAGARTASVQYGMRGWVAHHQVDGRFQTTAIGALNGRAYDFPIRYGLWTMGGAW 470
Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP 529
LC H W+HY + D FL + A+P+L A F LDW++E DG L T PSTSPE+ ++ P
Sbjct: 471 LCQHFWQHYLFNGDTKFLRETAWPILRNAAEFYLDWVVELPDGSLTTAPSTSPENSYLLP 530
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
DG +S +TMD+AI+RE FS I+ AA VL +D + +LPRL IA DG
Sbjct: 531 DGTRHALSIGATMDIAILREFFSTIVDAASVLGIPDDPIAISASAALPRLPGYGIAADGQ 590
Query: 590 IMEW 593
++EW
Sbjct: 591 LLEW 594
>gi|330996330|ref|ZP_08320214.1| hypothetical protein HMPREF9442_01297 [Paraprevotella xylaniphila
YIT 11841]
gi|329573380|gb|EGG54991.1| hypothetical protein HMPREF9442_01297 [Paraprevotella xylaniphila
YIT 11841]
Length = 809
Score = 345 bits (886), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 216/590 (36%), Positives = 314/590 (53%), Gaps = 55/590 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PAK +T+A+P+GN +LGAMV+GG E L+LNE+T W G P D NP+A L
Sbjct: 22 LKLWYGKPAKDWTEALPVGNSKLGAMVYGGTGREELQLNEETFWAGGPYDNNNPNALYVL 81
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR+L+ G+ EA F D Y +G + L+F H K + + R+LD+
Sbjct: 82 PVVRNLIFQGKTREAQRLVDANFFTRKDGMSYLTMGSLFLDFP-GHDKATD--FYRDLDI 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
ATA +Y V V + R F+S D VIV ++ ++G+L+F V D+ L + +G+
Sbjct: 139 GNATATTRYKVDGVAYARTVFASFTDSVIVVRLQADKAGALAFTVGYDAPLKHEVSADGD 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
++ C GK D +G++ + E ++ + + KKL+V G+ A
Sbjct: 199 ---MLSIACEGK----------DQEGVKAALCAECRVKVVSDGKTTADGKKLEVVGATKA 245
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
L L A++++ ++ D D + + LQ + Y +H+ Y+ LF RV +
Sbjct: 246 TLYLSAATNY----VDYHDVSGDAAARADRCLQRAVQIPYKKALEKHVAYYRNLFGRVEL 301
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
L T+ + E + R++ F DPSL LLFQ+GRYLLISSS+PG Q
Sbjct: 302 DLGE------TEAAARE------TPLRIRDFSQGGDPSLAALLFQYGRYLLISSSQPGGQ 349
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWN + WDS +NIN EMNYW + NLSE +PLF L LS+ G+KTA
Sbjct: 350 PANLQGIWNRSTNAPWDSKYTININTEMNYWLAEVANLSEMHQPLFSMLEDLSVTGAKTA 409
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDFL 487
+ Y GWV HH TD+W S G V +A +WP GGAWL HLW+HY +T D+ FL
Sbjct: 410 RDMYNCGGWVAHHNTDLWRIS----GVVDFAAAGMWPSGGAWLAQHLWQHYLFTADKKFL 465
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
K YP+L+G A F LD+L E H Y PS SPEH V+ TMD
Sbjct: 466 -KAYYPVLKGTARFFLDFLTE-HPSYKWWVVAPSVSPEH---------GPVTAGCTMDNQ 514
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I+ + + A+E++ ++ A + + + L RL P ++ G + EW+Q
Sbjct: 515 IVFDALYNTLQASEIV-GDDAAFRDSLAQMLDRLPPMQVGRHGQLQEWLQ 563
>gi|456392980|gb|EMF58323.1| hypothetical protein SBD_0995 [Streptomyces bottropensis ATCC
25435]
Length = 974
Score = 345 bits (886), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 219/573 (38%), Positives = 307/573 (53%), Gaps = 43/573 (7%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D N ++++R V + Q+
Sbjct: 61 ALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPYDSANTRGAANIAEIRRRVFADQWGP 120
Query: 87 AT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
A + G PA YQ +G++ L F + Y R LDL TATA Y +
Sbjct: 121 AQDLIDQTMLGSPAGQLAYQPVGNLLLSFGGA---TGVSQYNRTLDLTTATALTTYVLNG 177
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKR 203
V + RE F+S PD+VIV +++ + SL+FN + DS I ++G
Sbjct: 178 VRYQREVFASAPDRVIVVRLTADRANSLTFNATFDSPQRTTVSSPDGATIALDGTS---- 233
Query: 204 IPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGP 263
A ++F A+ ++ GT+S+ L+V G+ +L+ SS+
Sbjct: 234 ----ATMEGIAGRVRFLALANAAVTG--GTVSS-SGGTLRVSGATSVTVLVSIGSSY--- 283
Query: 264 FINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDT 323
+N + D + S L + R++ L +RHL DYQ LF+RVS+ L R+ T
Sbjct: 284 -VNFRNVAGDYQGTARSRLNAARDVGIDALRSRHLADYQALFNRVSVDLGRT-------T 335
Query: 324 CSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLS 383
+++ P+ R+ DP LLFQFGRYLLISSSRPGTQ ANLQGIWN+ ++
Sbjct: 336 AADQ-----PTDVRIAQHAQVNDPQFSALLFQFGRYLLISSSRPGTQPANLQGIWNDQMA 390
Query: 384 PTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHH 443
P+WDS +N NL MNYW + NLSEC P+FD + L++ G++ AQ Y A GWV HH
Sbjct: 391 PSWDSKFTINANLPMNYWPADTTNLSECFLPVFDMINDLTVTGARVAQAQYGAGGWVTHH 450
Query: 444 KTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
TD W +S G W +W GGAWL T +W+HY +T D DFL YP L+G A F L
Sbjct: 451 NTDAWRGASVVDG-AQWGMWQTGGAWLATLIWDHYLFTGDIDFLRSN-YPALKGAAQFFL 508
Query: 504 DWLIEGHD--GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVL 561
D L+ H GYL TNPS SPE P A V TMD I+R++F+++ A E+L
Sbjct: 509 DTLVA-HPTLGYLVTNPSNSPE----LPHHANATVCAGPTMDNQILRDLFNSVARAGELL 563
Query: 562 EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+ + V RL P ++ G++ EW+
Sbjct: 564 GVDAAFRAQAVAAR-DRLAPMRVGSRGNVQEWL 595
>gi|388259826|ref|ZP_10136995.1| hypothetical protein O59_004218 [Cellvibrio sp. BR]
gi|387936552|gb|EIK43114.1| hypothetical protein O59_004218 [Cellvibrio sp. BR]
Length = 836
Score = 345 bits (886), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 210/603 (34%), Positives = 330/603 (54%), Gaps = 55/603 (9%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
S+ + +P + + A+H+ +A+P+GNGRLGAMV+GGV + +++NE+T W G P + N
Sbjct: 29 SSPSVSPHTLWYEQAAQHWEEALPLGNGRLGAMVYGGVTRDNIQINENTFWAGGPHNNVN 88
Query: 66 PDAPKALSDVRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDDSHLKYAEE 122
P A ++L ++R L+ +G+Y A A + K G YQ G++ LEF +H +++
Sbjct: 89 PKALESLPEIRRLITAGEYLAAEALAEKTITSQGSNGMPYQTAGNLHLEFP-AHKQFSH- 146
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
Y R+LD+ A A +Y VG+V +TRE FSS DQV+V K+S S+ G LSF L
Sbjct: 147 -YYRDLDIGKAIATTRYQVGDVVYTREVFSSFVDQVVVVKLSASKPGQLSFTAHLSHPAT 205
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE--IKISDDRGTISALEDK 240
N+ ++M+G + D +GI+ L + ++ G++S +
Sbjct: 206 MQFAQENNHTLLMQG------------MSKDHEGIKGQVKLATLVDVNTSGGSLSQ-NNN 252
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR---- 296
++ V +D A++L+ +++F +N D D + + + L S +N + YT
Sbjct: 253 RIAVSNADSALILISMATNF----VNYKDISGDALARARNYLASAKNQFTHNQYTARKHV 308
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
H + Y++ F RV++QL +S ++E P+ +R++ F + DP L L FQF
Sbjct: 309 HSNFYKQYFDRVALQLGKS-------EFAQE-----PTDQRIRLFASRHDPELASLYFQF 356
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLIS S+PG Q NLQGIWN + P WDS +NIN EMNYW S L+E EP
Sbjct: 357 GRYLLISGSQPGGQPTNLQGIWNHRMDPPWDSKYTLNINAEMNYWPSEVTQLNELNEPFI 416
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
+ L+ G +TA+ Y A GW+ HH TDIW + D+ W WP AWL HLW
Sbjct: 417 QMVKELAQTGQQTAKEMYGARGWMAHHNTDIWRITGGIDK---TWGSWPTSNAWLSQHLW 473
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLA 534
E Y Y+ D+ +L YP+++ +F D+LIE D +L +PS SPE+ AP
Sbjct: 474 EKYLYSGDKTYLAD-VYPVMKSAVTFFEDFLIESPDKKWLIVSPSMSPEN---APTATGV 529
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
++ TMD ++ ++ S I+AAE+L +K + + +K+L LP P +I + + E
Sbjct: 530 KIAAGVTMDNQLLFDLLSNTIAAAEILGQDKTQIPVWKKILSRLP---PMQIGKHHQLQE 586
Query: 593 WVQ 595
W++
Sbjct: 587 WLE 589
>gi|423241477|ref|ZP_17222590.1| hypothetical protein HMPREF1065_03213 [Bacteroides dorei
CL03T12C01]
gi|392641370|gb|EIY35147.1| hypothetical protein HMPREF1065_03213 [Bacteroides dorei
CL03T12C01]
Length = 824
Score = 345 bits (885), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 215/589 (36%), Positives = 325/589 (55%), Gaps = 51/589 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA+++ +A+P+GNGRLGAMV+G +E ++LNE+T+ G P NP+A AL+ +R
Sbjct: 31 YDKPARYWEEALPLGNGRLGAMVYGNPVTEEIQLNEETVSAGSPYKNYNPEAKGALATIR 90
Query: 77 SLVDSGQYAEATA-ASVKL-----FGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
L+ + +Y EA A A K+ FG P YQ +G + L+F SH Y +RRELDL
Sbjct: 91 QLIFADRYPEAQALAGEKILSKNGFGMP---YQTVGSLRLDFP-SHENYT--NFRRELDL 144
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A Y+V V++ RE F+S DQ+++ +++ S+ G L+F+ SL V+G
Sbjct: 145 EKAVATTAYTVNGVDYKREVFTSFVDQLVIVRLTASQPGKLTFSASLTCPQKVDVTVSGK 204
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
N +I+EG G +D KG I+F A L++ D +G S D L V ++
Sbjct: 205 NALILEGTTKG---------DDFTKGSIRFRADLKL---DLQGGKSVAGDTLLSVTNANS 252
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A + + +++F +N D +P+ + ++++ +Y+ H+ YQK ++RVS
Sbjct: 253 ATIYIAMATNF----VNYKDISGNPSGRNKVSMKNAGK-NYARALQAHISAYQKYYNRVS 307
Query: 310 IQLSR-SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
+ L R S D TD R+K F +DP LV L FQFGRYLLISSS+PG
Sbjct: 308 LNLRRTSQADKPTDV-------------RIKEFAISDDPHLVALYFQFGRYLLISSSQPG 354
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
Q ANLQGIWN+ L+P W NIN EMNYW + NL E EP + L NG +
Sbjct: 355 GQPANLQGIWNQKLNPAWKCRYTTNINAEMNYWPAEVTNLREMHEPFLQMVKELYENGQE 414
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
A+ Y GWV+HH TD+W + A DR WP AWLC HLW+ Y Y+ D+++L
Sbjct: 415 AAREMYGCRGWVLHHNTDLWRMNGAVDRAYC--GPWPTCNAWLCQHLWDRYLYSGDKEYL 472
Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
YP+L+ + F +D+L+ + + GYL PS SPE+ GK A + TMD +
Sbjct: 473 AS-VYPILKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPSIWKGK-ANLFAGITMDNQL 530
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+ ++FS SAA++L ++ + +L +L P ++ + G + EW +
Sbjct: 531 VSDLFSNTRSAAQILNLDKQ-FCDTILSLKRQLPPMQVGQYGQLQEWFE 578
>gi|423289667|ref|ZP_17268517.1| hypothetical protein HMPREF1069_03560 [Bacteroides ovatus
CL02T12C04]
gi|423298161|ref|ZP_17276220.1| hypothetical protein HMPREF1070_04885 [Bacteroides ovatus
CL03T12C18]
gi|392663702|gb|EIY57249.1| hypothetical protein HMPREF1070_04885 [Bacteroides ovatus
CL03T12C18]
gi|392667378|gb|EIY60888.1| hypothetical protein HMPREF1069_03560 [Bacteroides ovatus
CL02T12C04]
Length = 810
Score = 344 bits (882), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 222/600 (37%), Positives = 325/600 (54%), Gaps = 55/600 (9%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
T LK+ ++ PA+ + +A+P+GN RLGAM++G E ++LNE+T+W G P NP
Sbjct: 16 TVRAEELKLWYSHPAEEWVEALPLGNSRLGAMIYGNPFEEEIQLNEETVWGGSPYRNDNP 75
Query: 67 DAPKALSDVRSLVDSGQYAEATA-------ASVKLFGHPADVYQLLGDIELEFDDSHLKY 119
+A LS+VR L+ +G+ E TA A K G P YQ +G ++L F H KY
Sbjct: 76 EAYGVLSEVRKLIFAGR--EITAEKLWKEHAFTKQNGMP---YQTVGSLKLHFP-GHEKY 129
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+ Y R+L++ A A V Y VG+V +TR F+S D ++ + S++F S +
Sbjct: 130 TD--YYRDLNIENAVATVSYKVGDVTYTRTLFTSLADNALIIHLEADRPHSIAFEASYST 187
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDD-PKGIQFSAILEIKISDDRGTISALE 238
+ + + N++ + KA+A+++ P I+ + IK S G + + +
Sbjct: 188 PFEESAVIASKNRLTLSA---------KASAHEEVPAAIRLESQARIKTSG--GKVES-D 235
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+ KL V +D + + A+++F +N D + + L + SY L H+
Sbjct: 236 NGKLIVTEADVVTIYVSAATNF----VNYQDVSANESKRVDVILNQVGKKSYRQLLDSHI 291
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
YQ+ F RV + L S S++ R+K F+ +DP+LV L+FQFGR
Sbjct: 292 GKYQQQFGRVKLDLGHS-------LASQKETPV-----RLKEFREGKDPALVTLMFQFGR 339
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSS+PG Q ANLQGIWN+ L WD +NIN EMNYW + NL E EPLF
Sbjct: 340 YLLISSSQPGGQPANLQGIWNQHLLAPWDGKYTININTEMNYWPAEITNLPETHEPLFRL 399
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ L+ G KTAQ Y +GWV HH TDIW + G + WP GGAWL HLW+HY
Sbjct: 400 VNELAETGKKTAQTMYHCNGWVAHHNTDIWRATGPVDGP-FYGTWPNGGAWLSQHLWQHY 458
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACV 536
YT D+DFL K YP+L+G A F +D+L+E H Y L T PS SPE AP GK +
Sbjct: 459 LYTGDKDFLIKN-YPVLKGAADFYMDFLVE-HPQYHWLVTIPSISPEQG--AP-GKETSL 513
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE-KVLKSLPRLRPTKIAEDGSIMEWVQ 595
+ TMD I+ +V S + AA+++ ED + + +V K L RL P +I + + EW++
Sbjct: 514 TAGCTMDNQIVFDVLSNTLQAAKIV--GEDIVYQDRVKKVLDRLPPMQIGKYNQLQEWLE 571
>gi|424878767|ref|ZP_18302405.1| hypothetical protein Rleg8DRAFT_5620 [Rhizobium leguminosarum bv.
trifolii WU95]
gi|392520277|gb|EIW45007.1| hypothetical protein Rleg8DRAFT_5620 [Rhizobium leguminosarum bv.
trifolii WU95]
Length = 747
Score = 343 bits (881), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 215/586 (36%), Positives = 306/586 (52%), Gaps = 46/586 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA+ +TDA+P+GNGRLGAMV+G E L++NE T W G P NPDA L VR
Sbjct: 8 YDAPAQLWTDALPLGNGRLGAMVFGDPLREHLQINESTFWAGGPYQPVNPDAFGHLGTVR 67
Query: 77 SLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEET--YRRELDLN 131
L+ G YA+A A A +L P YQ +GD+ LEF K+AE YRR LDL+
Sbjct: 68 QLIFDGHYADAEALAEKRLMARPIKQMSYQPIGDLRLEF-----KFAESVSGYRRALDLD 122
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TA A Y+ + + RE F S D V+V ++S ++S +S+DS + +
Sbjct: 123 TAIATSSYTANGIAYLREAFVSPVDGVLVLRLSADRKRAISCRISIDSPQQGEMSIGEGS 182
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
Q+ G+ GK A A ++F+ +++ + GT+ A L VEG+D +
Sbjct: 183 QLSFSGK--GKAESGIAAA------LRFA--FGVRLINSGGTVKA-SGGGLSVEGADEVL 231
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
+ L A++SF D P + + L+ + + L H+ ++++LF +I
Sbjct: 232 VFLDAATSFR----RYDDVLGHPERDIVDRLERAASRDFVSLRDDHIAEHRRLFSAFAID 287
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
L +P ++P+ +R+ F +DP+L L QFGRYL+I+SSRPGTQ
Sbjct: 288 LGSTPAA------------SLPTDQRIAGFAGGDDPALAALYVQFGRYLMIASSRPGTQP 335
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
ANLQGIWN P W S NINL+MNYW P NL EC EPL + L+ G A
Sbjct: 336 ANLQGIWNAQTDPPWGSKYTANINLQMNYWLPAPANLRECLEPLVEMAEELAETGKAMAH 395
Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
V+Y ASGWV+HH TD+W + G W LWPMGG WL L + +Y D + + +R
Sbjct: 396 VHYRASGWVMHHNTDLWRATGPIDG-AKWGLWPMGGIWLMAQLLDACDYLDDAEAMRRRL 454
Query: 492 YPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
+P+ A FL D L+ G D YL TNPS SPE+ P G C MD +IR+
Sbjct: 455 FPIAREAAHFLFDVLVPFPGTD-YLVTNPSLSPENAH--PYGASICA--GPAMDSQLIRD 509
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
F ++ V E LV + + L RL P +I +G + EW++
Sbjct: 510 -FLGLLRPLAVSIGGEPELVADIDRVLSRLAPDRIGANGQLQEWLE 554
>gi|408371866|ref|ZP_11169623.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
gi|407742715|gb|EKF54305.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
Length = 803
Score = 343 bits (881), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 213/601 (35%), Positives = 330/601 (54%), Gaps = 60/601 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PA+ + +IP+GNGRLGAM GGV E + LN+ TLW+G P D +P+A K L
Sbjct: 26 LKLWYKQPAELWEGSIPLGNGRLGAMPDGGVSQENIVLNDITLWSGGPQDADDPNAIKYL 85
Query: 73 SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
++R L+ G+ ++A A K F G+ ADV YQ+LG++ + HL
Sbjct: 86 PEIRRLLFEGKNSQAEALMYKTFVSKGPGSGKGNGADVPYGSYQILGNLHFNY---HLPN 142
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+ Y+RELD+ ATA +SV VE+TRE+F+S D VIV K++ S++ +SF++ +D
Sbjct: 143 KAQDYKRELDITNATATTTFSVDGVEYTREYFTSFSDDVIVFKLTASKAAQISFDLGVDR 202
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+ + +++M+G+ N D G++++ L +++ + GT+ A +D
Sbjct: 203 P-ERFTTTTQGEELLMQGQL---------NNGTDGNGMKYA--LRVRVIPEGGTLKA-KD 249
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESM--SALQSIRNLSYSDLYTRH 297
L+V G++ AV+L+ A++ + F+ P E + L Y+ L H
Sbjct: 250 GTLQVNGANSAVILISAATDY---FV--------PNVEQWVETQLDKAEKKPYNTLKETH 298
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ-TDEDPSLVELLFQF 356
+D Y+ +F R SI+L SE + +P+ ER+K F+ T +DP L EL FQ+
Sbjct: 299 IDFYKNMFDRASIELG-----------SETQAEALPTDERLKRFEITKDDPGLAELYFQY 347
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYL ISS+RPG NLQG+W + W+ H+NINL+MN+W NL +P +
Sbjct: 348 GRYLAISSTRPGLLPPNLQGLWANTVQTPWNGDYHLNINLQMNHWPIDVVNLPMLNQPYY 407
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
+ L G KTA+ Y GWV H T+IW +S W G W+C LW
Sbjct: 408 KLIKGLVEPGEKTAKTYYGGDGWVAHVITNIWGYTSPGE-HPSWGSTNSGSGWMCQMLWR 466
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLAC 535
HY + D D+L K+ YP+L+G A F L+E D +L T PS SPE+ F +G+ A
Sbjct: 467 HYAFNQDMDYL-KKIYPILKGSAQFYNSTLVEHPDRDWLVTAPSNSPENAFFLTNGEKAN 525
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLK-SLPRLRPTKIAEDGSIMEWV 594
V+ + T+D IIR +F +I A+++L+ D K LK + +L P +IA++G +MEW+
Sbjct: 526 VAIAPTIDNQIIRSLFQNVIEASQLLDV--DKQFRKQLKHRITKLPPNQIAKNGRLMEWI 583
Query: 595 Q 595
+
Sbjct: 584 K 584
>gi|298482732|ref|ZP_07000916.1| alpha-L-fucosidase 2 [Bacteroides sp. D22]
gi|298271195|gb|EFI12772.1| alpha-L-fucosidase 2 [Bacteroides sp. D22]
Length = 823
Score = 343 bits (880), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 218/590 (36%), Positives = 324/590 (54%), Gaps = 53/590 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA+++ +A+P+GNGRLGAMV+G +E ++LNE+T+ G P NP+A AL+ +R
Sbjct: 32 YDKPARYWEEALPLGNGRLGAMVYGNPVAEEIQLNEETVSAGSPYKNYNPEAKGALATIR 91
Query: 77 SLVDSGQYAEATA-ASVKL-----FGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
L+ +G+Y EA A K+ FG P YQ +G + L+F SH Y +RRELDL
Sbjct: 92 QLIFAGRYPEAQELAGEKILSKNGFGMP---YQTVGSLCLDFP-SHENYT--NFRRELDL 145
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A Y+V V++ RE F+S DQ+++ +++ S+ G L+F+ SL V+G
Sbjct: 146 EKAVATTAYTVNGVDYKREVFTSFVDQLVIVRLTASQPGKLTFSASLTCPQKVDVTVSGK 205
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
N + +EG G +D KG I+F A L++ D +G S D L V ++
Sbjct: 206 NALTLEGTTKG---------DDFTKGSIRFRADLKL---DLQGGKSVAGDTLLSVTNANS 253
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A + + +++F +N D +P+ + ++++ +Y H+ YQK ++RVS
Sbjct: 254 ATIYIAMATNF----VNYKDISGNPSGRNKVSMKNAGK-NYVRALQAHISAYQKYYNRVS 308
Query: 310 IQLSR-SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
+ L R S D TD R+K F +DP LV L FQFGRYLLISSS+PG
Sbjct: 309 LNLGRTSQADKPTDV-------------RIKEFAISDDPHLVALYFQFGRYLLISSSQPG 355
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
Q ANLQGIWN+ L+P W NIN EMNYW + NL E EP + L NG +
Sbjct: 356 GQPANLQGIWNQKLNPAWKCRYTTNINAEMNYWPAEVTNLREMHEPFLQMVKELYENGQE 415
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
A+ Y GWV+HH TD+W + A DR WP AWLC HLW+ Y Y+ D+++L
Sbjct: 416 AAREMYGCRGWVLHHNTDLWRMNGAVDRAYC--GPWPTCNAWLCQHLWDRYLYSGDKEYL 473
Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
YP+L+ + F +D+L+ + + GYL PS SPE+ GK A + TMD +
Sbjct: 474 AS-VYPILKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPSIWKGK-ANLFAGITMDNQL 531
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWVQ 595
+ ++FS SAA++L N+D + SL R L P ++ + G + EW +
Sbjct: 532 VSDLFSNTRSAAQIL--NQDKQFCDTILSLKRQLPPMQVGQYGQLQEWFE 579
>gi|241518404|ref|YP_002979032.1| Alpha-L-fucosidase [Rhizobium leguminosarum bv. trifolii WSM1325]
gi|240862817|gb|ACS60481.1| Alpha-L-fucosidase [Rhizobium leguminosarum bv. trifolii WSM1325]
Length = 747
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 215/586 (36%), Positives = 307/586 (52%), Gaps = 46/586 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA+ +TDA+P+GNGRLGAMV+G E L++NE T W G P NPDA L VR
Sbjct: 8 YDAPAQLWTDALPLGNGRLGAMVFGDPLREHLQINESTFWAGGPYQPVNPDAFGHLGTVR 67
Query: 77 SLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEET--YRRELDLN 131
L+ G YA+A A A +L P YQ +GD+ LEF K+AE YRR LDL+
Sbjct: 68 QLIFDGHYADAEALAEKRLMARPIKQMSYQPIGDLRLEF-----KFAESVSGYRRALDLD 122
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TA A Y+ + + RE F S D V+V ++S ++S +S+DS + +
Sbjct: 123 TAIATSSYTANGIAYLREAFVSPVDGVLVLRLSADRKRAISCRISIDSPQQGEMSIGERS 182
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
+ G+ GK A A ++F+ +++ + GT++A L VEG+D +
Sbjct: 183 LLSFSGK--GKAESGIAAA------LRFA--FGVRLINSGGTVNA-SGGGLSVEGADEVL 231
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
+ L A++SF D P + + L+ + + L H++++++LF +I
Sbjct: 232 VFLDAATSFR----RYDDILGHPERDIIDRLERAASRDFVSLRDDHIEEHRRLFSAFAID 287
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
L +P ++P+ +R+ F +DP+L L QFGRYL+I+SSRPGTQ
Sbjct: 288 LGSTPAA------------SLPTDQRIAGFAGGDDPALAALYVQFGRYLMIASSRPGTQP 335
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
ANLQGIWN P W S NINL+MNYW P NL EC EPL + L+ G A
Sbjct: 336 ANLQGIWNAQTDPPWGSKYTANINLQMNYWLPAPANLRECLEPLVEMAEELAETGKVMAH 395
Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
V+Y A GWV+HH TD+W + G W LWPMGG WL L E +Y D + + +R
Sbjct: 396 VHYRARGWVMHHNTDLWRATGPIDG-AKWGLWPMGGIWLMAQLLEACDYLDDAEAMRRRL 454
Query: 492 YPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
+P+ A FL D L+ G D YL TNPS SPE+ P G C MD +IR+
Sbjct: 455 FPIALEAAHFLFDVLVPFPGTD-YLVTNPSLSPENAH--PYGASICA--GPAMDSQLIRD 509
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
F ++ V E LV + + LPRL P +I +G + EW++
Sbjct: 510 -FLGLLRPLAVSIGGEPELVADIDRVLPRLAPDRIGANGQLQEWLE 554
>gi|302867165|ref|YP_003835802.1| cellulose-binding family II protein [Micromonospora aurantiaca ATCC
27029]
gi|302570024|gb|ADL46226.1| cellulose-binding family II [Micromonospora aurantiaca ATCC 27029]
Length = 936
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 219/590 (37%), Positives = 312/590 (52%), Gaps = 44/590 (7%)
Query: 11 NPLKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
N L + ++ PA + A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D N
Sbjct: 44 NDLALWYDEPAGTDWLRALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPYDSANTRGA 103
Query: 70 KALSDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRR 126
L+++R V + Q+ A + + G P YQ +GD+ L F + Y R
Sbjct: 104 ANLAEIRRRVFADQWTLAQDLINQTMMGSPGGQLAYQTVGDLRLAFGSAS---GATQYNR 160
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
LDL TAT Y G V + RE F+S PDQV+V +++ + +++F+ + DS
Sbjct: 161 TLDLTTATVTTTYVQGGVRYQREVFASAPDQVMVLRLTADRANAITFSAAFDSPQRTTVS 220
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ ++G + ++F A+ ++ GT+S+ L+V G
Sbjct: 221 SPDGATVALDG--------VSGSMEGVTGSVRFLALANAAVTG--GTVSS-SGGTLRVSG 269
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ +L+ SS+ +N D + + L + ++++ L TRH DYQ LF
Sbjct: 270 ATSVTVLVSIGSSY----VNYRTVNGDYQGIARNRLNAAKSVAVDQLRTRHRADYQALFD 325
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV+I L R T + + P+ R+ + DP LLFQFGRYLLISSSR
Sbjct: 326 RVTIDLGR--------TAAADQ----PTDVRIAQHASTNDPQFAALLFQFGRYLLISSSR 373
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PGTQ ANLQGIW++ L+P+WDS VN NL MNYW + NLSEC P+FD + L++ G
Sbjct: 374 PGTQPANLQGIWSDSLTPSWDSKYTVNANLPMNYWPADTTNLSECFLPVFDMVKDLTVTG 433
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
++ AQ Y A GWV HH TD W +S G W +W GGAWL T +W+HY +T D F
Sbjct: 434 ARVAQAQYGAGGWVTHHNTDAWRGASVVDG-AFWGMWQTGGAWLSTLIWDHYLFTGDSGF 492
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHD--GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L+ YP L+G A F LD L+ H GYL TNPS SPE A A V TMD
Sbjct: 493 LQAN-YPALKGAAQFFLDTLVA-HPTLGYLVTNPSNSPELAHHAN----ASVCAGPTMDN 546
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
I+R++F A A+EVL + +V + RL P+++ G++ EW+
Sbjct: 547 QILRDLFDAAARASEVLGV-DTTFRSQVRTARDRLPPSRVGSRGNVQEWL 595
>gi|383642312|ref|ZP_09954718.1| alpha-l-fucosidase [Sphingomonas elodea ATCC 31461]
Length = 788
Score = 343 bits (879), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 209/597 (35%), Positives = 321/597 (53%), Gaps = 42/597 (7%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
S +PL + + PA+ + +A+P+GNGRLGAMV+GG +E +LNEDT + G P D
Sbjct: 33 GGAGASPRDPLTLWYRQPAQEWVEALPLGNGRLGAMVFGGTTTERFQLNEDTFFAGSPYD 92
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKY 119
TNP A A+ +R LV G+ EA A + K + G PA YQ +GD+ L F
Sbjct: 93 ATNPAAGPAIRRIRQLVFEGKGKEAQALADKDVIGRPAGQMPYQPIGDLLLLFPGLE--- 149
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKIS-GSESGSLSFNVSLD 178
Y R LDL+ A A ++ G+ RE +S DQVI +++ G G ++ ++L
Sbjct: 150 GIRGYERSLDLDGAIATTRFRTGSTTHVREAIASAVDQVIAIRLTAGQGRGGVTTTLALT 209
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
S + S+V G + +++ G PG R P GI+F + + +D G ++A +
Sbjct: 210 SPQQSESFVEGGDTLVLRGIGPGAR--------GVPGGIRFETRVRMIATD--GIVTAGK 259
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
L VE + VLLLVA+++ + D DP++ + + + ++ L H
Sbjct: 260 -SDLSVEQAS-EVLLLVATAT---SYRRWDDIGGDPSAIVRAQIDAAAGKGWARLLADHQ 314
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
D+++LF R+++ L R+P +P+ ER++ +DP+L L QFGR
Sbjct: 315 ADHRRLFRRMTLDLGRTPAA------------ALPTDERIRRSTELDDPALATLYHQFGR 362
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLI++SRPGTQ ANLQGIWNE + P+WDS +NIN EMNYW + L E EPL
Sbjct: 363 YLLIAASRPGTQPANLQGIWNERVHPSWDSKWTLNINAEMNYWPADMTGLGELTEPLLRL 422
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ LS+ G +TA+ ++ A GW+ +H D++ ++ G VW LWPM GAWL + LW+H+
Sbjct: 423 VKELSVAGQRTARNDWGARGWMSYHNVDLFRNTALIDG-AVWGLWPMAGAWLLSSLWDHW 481
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
+Y+ DR FL + YPL+ G F LD L+ G L NPS SPE++ A V+
Sbjct: 482 DYSRDRTFLAE-LYPLMAGACDFYLDALVPHPTTGELVMNPSNSPENQHHAG----ISVT 536
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+ MD ++R++F AA +L ++E + + +I + G + EW+
Sbjct: 537 AGAAMDSQLLRDLFGRTAEAARLLGRDESRARAVLAARARLPK-DRIGKAGQLQEWL 592
>gi|365122610|ref|ZP_09339511.1| hypothetical protein HMPREF1033_02857 [Tannerella sp.
6_1_58FAA_CT1]
gi|363642358|gb|EHL81716.1| hypothetical protein HMPREF1033_02857 [Tannerella sp.
6_1_58FAA_CT1]
Length = 852
Score = 343 bits (879), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 214/587 (36%), Positives = 312/587 (53%), Gaps = 43/587 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PA + +A+P+GN RLGAMV+G +E ++LNE+T+W G P NP+A L
Sbjct: 64 LKLWYKQPATQWVEALPLGNSRLGAMVYGIPDNEEIQLNEETVWGGGPHRNDNPEAKDIL 123
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+VR L+ G+ EA K F P + YQ +G ++L FD H Y + Y R+LDL
Sbjct: 124 PEVRRLIFEGKSKEAKPIMEKKFRTPRNGMPYQTIGSLKLHFD-GHENYTD--YYRDLDL 180
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A +Y V V +TRE F+S D V++ +I+ + G+L+F S L H+
Sbjct: 181 TRAVATTRYKVNGVTYTRELFTSFADNVVIMQITSDKQGALNFTADYVSPL-KHTVSTKK 239
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
++I+ G+ A+ P I+ IK +D + S D K+ V + A
Sbjct: 240 GKLILSGKG--------ADHEGVPGVIRLENQTFIKTTDGKVKTS---DNKISVSDATTA 288
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
+ + A+++F +N +D + + + +++ Y H+ Y+KLF RV++
Sbjct: 289 TIYISAATNF----VNYNDVSANEHKRADAYMKAALKKPYEKALADHIAYYKKLFDRVTL 344
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
L S + EE + RVK+F+ D SL L+FQFGRYLLISSS+PG Q
Sbjct: 345 DLGTSKE------AQEE------THLRVKNFKNGNDVSLAVLMFQFGRYLLISSSQPGGQ 392
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWNE L WD +NIN EMNYW + NLSE EPL + LS++G +TA
Sbjct: 393 PANLQGIWNEKLQAPWDGKYTININTEMNYWPAEVTNLSETHEPLIQMVKELSVSGQETA 452
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
+ Y +GWV HH TD+W G +WP GGAWL H+W+HY YT D+++L+
Sbjct: 453 KEMYGCNGWVTHHNTDLWRSCGPVDGADY--VWPNGGAWLSQHVWQHYLYTGDKEYLQD- 509
Query: 491 AYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
YP L+G A F LD+L E H Y + T PS+SPEH P G + TMD I
Sbjct: 510 VYPALKGVADFFLDFLTE-HPTYKWMVTVPSSSPEH---GPRGNGNSIVAGCTMDNQIAF 565
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+ S + A ++L + D K+ + RL P +I + + EW+Q
Sbjct: 566 DALSNALQATKILNGDAD-YCNKLQNMIDRLAPMQIGQYNQLQEWLQ 611
>gi|281421059|ref|ZP_06252058.1| putative large secreted protein [Prevotella copri DSM 18205]
gi|281404977|gb|EFB35657.1| putative large secreted protein [Prevotella copri DSM 18205]
Length = 790
Score = 342 bits (878), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 208/606 (34%), Positives = 326/606 (53%), Gaps = 48/606 (7%)
Query: 2 MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
+ A S PLK+ +N PA F +++PIGNG+LGA+++GG ++++ LN+ TLWTG P
Sbjct: 17 LQAVPKSNIPPLKLWYNKPATAFEESLPIGNGKLGALIYGGANNDSIYLNDITLWTGKPV 76
Query: 62 DYT-NPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
+ DA K + +R + Y A + + + GH ++ YQ L I ++ D + +++
Sbjct: 77 NREEGGDAYKWIPKIREALFKEDYKAADSLQLHVQGHNSEYYQPLAIINIK-DANKGQFS 135
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
Y+REL L+ ATA + Y+ G +++ RE+F+S+PD++I ++ ++ +++ ++SL SL
Sbjct: 136 --NYKRELSLDNATAALSYTRGGIQYQREYFASHPDKMIAIHLTATQKKAINCDISLTSL 193
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ H N Q+ + G GK I F +IL IK D GTI+A D
Sbjct: 194 IP-HQVKASNKQLTITGHAMGK----------PENSIHFCSILSIKNQD--GTITA-SDS 239
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIR-------NLSYSDL 293
L ++G AV+ LV +S++G K P E ++ + N +Y +L
Sbjct: 240 ILHLQGVSEAVIYLVNETSYNG-------FDKHPVKEGAPYIEKVNDNAWHLVNYTYPEL 292
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
RH+ DYQ +F+R L + D T ++ D E ++P L L
Sbjct: 293 KQRHITDYQNIFNRAKFALKGAKFD-NKRTTDQQLFDYTEKEE--------QNPYLEMLY 343
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQ+GRYLLIS SR ANLQG+W W +NINLE NYW + N+SE
Sbjct: 344 FQYGRYLLISCSRTPGIPANLQGLWAPARKSPWRGNYTININLEENYWPAEVTNMSELVM 403
Query: 414 PLFDFLTYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAW 469
P+ + +S+ G TA+ Y + +GW H TD WA ++ + W+ W MGGAW
Sbjct: 404 PVDGLVKAMSVTGKYTAKHYYGIENGWCGGHNTDAWAMTNPVGTKKESPKWSNWNMGGAW 463
Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFI 527
L LW+HY+YT D+++L + AYPL++G A F+LDW+IE G L T P TSPE E+I
Sbjct: 464 LVQTLWDHYDYTRDKEYLRQTAYPLMKGAADFMLDWIIENPKKPGELLTAPCTSPEAEYI 523
Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAED 587
G C Y T D+ I+RE+F + A++L+ ++ A K+ ++ RL P +I +
Sbjct: 524 TDKGYQGCSFYGGTADLTILRELFKNTLKGAQILDIDQ-AYQAKLQDAINRLHPYQIGKR 582
Query: 588 GSIMEW 593
G++ EW
Sbjct: 583 GNLQEW 588
>gi|329928902|ref|ZP_08282716.1| hypothetical protein HMPREF9412_5464 [Paenibacillus sp. HGF5]
gi|328937273|gb|EGG33698.1| hypothetical protein HMPREF9412_5464 [Paenibacillus sp. HGF5]
Length = 874
Score = 342 bits (878), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 208/626 (33%), Positives = 318/626 (50%), Gaps = 67/626 (10%)
Query: 8 STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
S L++ ++ PA + +A+PIGNGRLG MV+G E ++LNED+LW G PG NP+
Sbjct: 52 SANRRLRLWYDSPAAEWNEALPIGNGRLGGMVFGKPSLERVQLNEDSLWYGGPGRGGNPN 111
Query: 68 APKALSDVRSLVDSGQYAEAT-AASVKLFGHPA--DVYQLLGDIELEFDDSHLKYAEETY 124
A + LS++R ++ G+ AEA A + + P YQ LGD+ L+F D + E Y
Sbjct: 112 ASRYLSEIRQMLFDGRQAEAEHLARMAMTSSPKYEQPYQPLGDLLLKFLDG--EETVEHY 169
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL-DSLLDN 183
RELDL + V YS + F R++F++ PD V+V ++S G+L+F +L D
Sbjct: 170 ERELDLERSMVTVSYSSRGIRFRRQYFATAPDGVLVIRLSADRPGALTFAANLMRRPFDG 229
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ ++ ++MEG C GI F + ++ + G + + D L
Sbjct: 230 GTASLRHDTLLMEGEC-------------GADGISFG--MALRAAAVGGIVQTIGDF-LS 273
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
VEG+D LLL A +SF + P + L +SY L RH +Y++
Sbjct: 274 VEGADSVTLLLSAQTSF---------RCRQPVQVCLEQLDRAAGMSYEQLVNRHQAEYRE 324
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENI------DTVPSAERVK----------SFQTDE-- 345
F R S+ L C + + + +++RV+ S TD
Sbjct: 325 KFERFSLTLGTGKNGAGRTECVDSGTSFSNGTEVIRASDRVEYPNGIEDDQPSLPTDRRL 384
Query: 346 -----------------DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDS 388
DP L+ L Q+GRYLLIS SRP + ANLQGIWN+ +P W+S
Sbjct: 385 NLLKDRVKTEGASAENSDPELIALYVQYGRYLLISCSRPESLAANLQGIWNDSFTPPWES 444
Query: 389 APHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIW 448
+N+N++MNYW + L+EC EPLFD + + NG TA+ Y G+ HH T++W
Sbjct: 445 KYTINVNIQMNYWPAELLGLAECHEPLFDLIDRMLPNGRDTAREMYGCRGFAAHHNTNLW 504
Query: 449 AKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE 508
++ + + +WPMG AWLC HLWEHY + D DFL +RAYP+++ A FLLD++
Sbjct: 505 GETRPEGILMTCTVWPMGAAWLCLHLWEHYRFGGDADFLRERAYPVMKEAAEFLLDYMTV 564
Query: 509 GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL 568
+G T PS SPE+ F+ +G + + MD I +F A + A ++ +E A
Sbjct: 565 DEEGRRMTGPSVSPENRFVLSNGAVGSLCMGPAMDGQIATALFRACLEAGHLV-GDEPAF 623
Query: 569 VEKVLKSLPRLRPTKIAEDGSIMEWV 594
+ ++ +L + +I G IMEW+
Sbjct: 624 LGELQTALEEIPAPQIGRHGGIMEWL 649
>gi|260066219|gb|ACX30659.1| Fuc19 [Sphingobacterium sp. TN19]
Length = 821
Score = 342 bits (877), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 216/593 (36%), Positives = 324/593 (54%), Gaps = 56/593 (9%)
Query: 13 LKITFNGPA--KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
LK+ ++ P + + A+PIGNGRLGAMV+G E L+LNE+T++ G P NP+A
Sbjct: 33 LKLWYDQPVVDQIWEQALPIGNGRLGAMVYGIPEREELQLNEETIYAGGPYRNDNPNALN 92
Query: 71 ALSDVRSLVDSGQYAEATAAS-----VKLFGHPADVYQLLGDIELEFDDSHLKYAEETYR 125
AL ++ L+ +G+ EA + K G P YQ G + L F D H Y + Y
Sbjct: 93 ALPQIQQLIFAGKTEEADRLTNQSFFTKTHGMP---YQTAGSVILNFPD-HKHY--QHYY 146
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
RELDL A R +Y+V V +TR+ FSS D VIV +I+ S+ G+L+F++ + +
Sbjct: 147 RELDLEKAVVRSRYTVEGVTYTRQVFSSFADDVIVMEITASKKGALNFDLEYANPSECKV 206
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKV 244
Y +G + +I+EG +++ +G I++ +K D R T L D KL V
Sbjct: 207 YKSGQS-LILEG---------SGTSHEGIEGKIRYQKHTAVKNKDGRVT---LTDNKLTV 253
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
G+ V+ + +++F +N ++ ++ S L + ++ +H+ Y K
Sbjct: 254 SGATSVVIYMAVATNF----VNYKTVDQNAGVKAASTLALAQKKAFQTALKQHIAMYSKQ 309
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F R + L + T +EN+ T +R++SF+T +DP+LV LL QFGRYLLI S
Sbjct: 310 FARFKLDLGQ--------TAGQENLTTT---KRIESFKTTQDPALVALLVQFGRYLLICS 358
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+PG Q ANLQGIWN ++P WDS VNIN EMNYW + NLSE EPLF + LS
Sbjct: 359 SQPGGQPANLQGIWNRSMNPPWDSKYTVNINTEMNYWPAEVTNLSETHEPLFQLIKELSE 418
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
+G +TA+V Y A GWV HH TD+W +S +WP GG WL HLWEHY YT D+
Sbjct: 419 SGRETARVLYGADGWVTHHNTDLWRVTSPIDFAAA-GMWPTGGTWLTQHLWEHYLYTGDQ 477
Query: 485 DFLEKRAYPLLEGCASFLLDWLIE--GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
FL + YP+++G A F+L LI H +L PS SPEH +S TM
Sbjct: 478 KFLTE-VYPVMKGAADFILSILIAHPKHKDWLVIAPSISPEH---------GPISTGITM 527
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
D + ++ + A+E+++++ A K++K+ +L P ++ + EW++
Sbjct: 528 DNQLAFDILTRTALASEIVDQDA-AYKAKLIKTARKLPPMQVGRYAQLQEWLE 579
>gi|295132887|ref|YP_003583563.1| six-hairpin glycosidase [Zunongwangia profunda SM-A87]
gi|294980902|gb|ADF51367.1| six-hairpin glycosidase [Zunongwangia profunda SM-A87]
Length = 820
Score = 342 bits (877), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 207/590 (35%), Positives = 330/590 (55%), Gaps = 46/590 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + ++ PAK + +A+P+GNGRLGAMV+G ET++LNE+T+W G PG+ + + L
Sbjct: 27 MTLNYDEPAKVWEEALPVGNGRLGAMVFGRTGMETIQLNEETVWAGEPGNNVVTLSEEQL 86
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPAD----VYQLLGDIELEFDDSHLKYAEETYRREL 128
++R + +Y +A + K + YQ +G++ L F +S+ A Y+REL
Sbjct: 87 EEIRKAIFQEEYQKAQQLADKYLSKKDNNSGMSYQTVGNLILNFPNSN---AVRDYKREL 143
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
D++ A + V Y G V + R SS PD VI+ +++ ++ GS+SF + L S +H
Sbjct: 144 DISKAVSTVTYKTGGVAYKRRIISSFPDDVIMVELTANKPGSISFEMGLKSPHKSHDIQI 203
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGS 247
N+++ + G ++ ++ KG ++F I + KI + G I E++ LK+ G+
Sbjct: 204 KNDEVWLSGT---------SSDQENKKGKVKFLVIAKPKI--EGGRIETTENR-LKITGA 251
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
+ AV+ + +S+F N D +D S++++ L ++ + H+ +YQ+ F+R
Sbjct: 252 NRAVIYISIASNFK----NYKDLSEDAESKAIALLNAVYIKEFGKCLDAHIAEYQQYFNR 307
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
V + D+ T + D R++ F +DP L+ L FQFGRYLLISSS P
Sbjct: 308 VQL-------DLGTSNAINKTTDI-----RLEEFNDSDDPQLIALYFQFGRYLLISSSMP 355
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
GTQ ANLQGIWN++++ WDS VNIN EMNYW + NLSE +PLF + +S G
Sbjct: 356 GTQPANLQGIWNKEINAPWDSKYTVNINTEMNYWPAEVANLSEMHKPLFGLIKDISETGK 415
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
++A+ Y A GW +HH TDIW + S + LWP GG WL HLW+HY +T D FL
Sbjct: 416 ESAEKMYHARGWNMHHNTDIW-RISGVVDPPFYGLWPHGGGWLSQHLWQHYLFTGDTKFL 474
Query: 488 EKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
K YP+L+G A F D L E + ++ NPS SPE+ + ++ +TM I
Sbjct: 475 -KEVYPILKGTALFYKDILQQEPENKWMVVNPSNSPENGHTGG----SSLAAGTTMGNQI 529
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSL-PRLRPTKIAEDGSIMEWVQ 595
+++VFS + A+++L NED +K++ P L P +I + G + EW++
Sbjct: 530 VQDVFSNFLEASQIL--NEDKKFSDSIKNVTPNLAPMQIGKWGQLQEWMK 577
>gi|345562260|gb|EGX45329.1| hypothetical protein AOL_s00170g36 [Arthrobotrys oligospora ATCC
24927]
Length = 826
Score = 342 bits (876), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 210/598 (35%), Positives = 322/598 (53%), Gaps = 49/598 (8%)
Query: 8 STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
S ++PL+I +F D+ IGNGR+GA + GG SE +++NED+LW+G NPD
Sbjct: 30 SASHPLRIWTTSAGSYFNDSYLIGNGRIGAALPGGAASEVIRVNEDSLWSGGKLSRVNPD 89
Query: 68 APKALSDVRSLVDSGQYAEATA-ASVKLFGHP--ADVYQLLGDIELEFDDSHLKYAEETY 124
A + D++SL+ + EA A G P A Y+ LGD++L + S + Y
Sbjct: 90 ANGKMRDIQSLLTQQRNPEAARLAGFAYAGTPVSARHYEPLGDLQLVMNHSS---STTGY 146
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD-----S 179
R LDL ++ V Y+VG V + RE+ +SNPD +I I+ S+ S+SFN+ L +
Sbjct: 147 ERWLDLFDSSVGVYYTVGGVSYRREYIASNPDNIIAIHITASKPASVSFNIHLRKGQSLN 206
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
++++Y G++ +M G GK G++FSA K+ G + L D
Sbjct: 207 RWEDYTYKVGSDTTVMGGESQGK------------DGVKFSA--GTKVVASGGKVYTLGD 252
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+ + +D A + A +++ ++DP ++ +S L SI SYSD+ H+
Sbjct: 253 YVI-CDNADEATIFFTAWTAY---------RQQDPINKVLSDLSSISVKSYSDIRATHVA 302
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
DYQK F RVS+ L S + + + +R+ + + DP LV L FQFGRY
Sbjct: 303 DYQKYFGRVSLSLG----------SSSDTQKALSTPKRLAAIASTFDPELVALYFQFGRY 352
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
L ISSSR T NLQGIWN+++ P W S VNINL+MNYW SL N+ E PL+D +
Sbjct: 353 LFISSSRVNTLPPNLQGIWNQEMDPQWGSKYTVNINLQMNYWPSLVTNMIELTTPLYDLI 412
Query: 420 TYLSINGSKTAQVNYLAS-GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
L +G KTAQ Y S GWV HH TDIWA ++ WP G AWL H+ E Y
Sbjct: 413 ARLHSSGKKTAQSMYGNSQGWVCHHNTDIWADTAPQDNYASSTWWPAGSAWLVHHIIEEY 472
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGK-LACVS 537
+T D++FL+K Y ++ A F ++L + G+ TNP+ SPE+ F K ++
Sbjct: 473 RFTRDKEFLQKY-YNTIKDAALFFTEFLTN-YKGWKVTNPTLSPENTFYLLGTKTTTAIT 530
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
ST+D ++I E+F +++ ++L K+++++ + +L P +I + G IMEW++
Sbjct: 531 LGSTLDNSLIWELFGSLLEIMDILGKHDNSMKSTLHDLRAKLPPLRINKWGGIMEWIE 588
>gi|340619504|ref|YP_004737957.1| alpha-L-fucosidase [Zobellia galactanivorans]
gi|339734301|emb|CAZ97678.1| Alpha-L-fucosidase, family GH95 [Zobellia galactanivorans]
Length = 817
Score = 342 bits (876), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 209/600 (34%), Positives = 324/600 (54%), Gaps = 51/600 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA + +A+P+GNGR+GAMV+G E ++ NE+T W+G P K L +++
Sbjct: 42 YDKPASMWEEALPVGNGRIGAMVYGKSGEEKIQFNEETYWSGGPYSQVVKGGYKKLPEIQ 101
Query: 77 SLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
+ +G+ +A + L G+P + YQ L ++ L F + + YRR LDL T
Sbjct: 102 KYIFNGEPIKAHKLFGRALMGYPVEQQKYQSLANLHLFFGQDSV----DNYRRSLDLKTG 157
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
V+Y+ G V +T+E F+S DQ I +I+ + GS++F+ L + ++ +
Sbjct: 158 VVTVEYTYGGVNYTKEVFASAVDQTIAIRITADKPGSINFDAELRGVRNSAHSNYATDYF 217
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILE--IKISDDRGTISALEDKKLKVEGSDWAV 251
M+G GK + D G++ E IK + GT+S ++ L ++ +D A
Sbjct: 218 RMDGL--GKDQLKLTGKSADYMGVEGKLRYEARIKAVPEGGTMS-IDGTMLSIKNADAAT 274
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
L VA+++F +N D D L ++ S+ + L DY++ F RVS+
Sbjct: 275 LYFVAATNF----VNYKDVSADENKRVEDMLAKVQQSSFDAIKKSALADYKEYFDRVSLT 330
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
L + + P+ +R+ Q+ DP L L + FGRYLLISSSRPGTQ
Sbjct: 331 LPTTDNSFL------------PTDKRMVEIQSSPDPQLSTLCYNFGRYLLISSSRPGTQP 378
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
ANLQGIWN D++P WDS NIN EMNYW NLSE EPL + L+ G+K A+
Sbjct: 379 ANLQGIWNNDMNPAWDSKYTTNINTEMNYWAVESANLSELSEPLTTMVKELTDQGAKVAK 438
Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
+Y A GWV H TD+W + +A W + +GGAWL THLWEHY +T D+++L K
Sbjct: 439 EHYGADGWVFHQNTDLW-RVAAPMDGPTWGTFTVGGAWLTTHLWEHYLFTQDKEYL-KDI 496
Query: 492 YPLLEGCASFLLDWLIE--GHDGYLETNPSTSPEHEFIAPDGK--------------LAC 535
YP+++G F +D+L+E G D +L TNPS SPE+ P+GK
Sbjct: 497 YPVMKGSVEFFMDFLVEYPGTD-WLVTNPSNSPEN---PPEGKGYKYFYDEITGMYYFTT 552
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+ ST+DM I++++FS SA+E+L+ + + L ++V + RL P++I +DG++ EW +
Sbjct: 553 IVAGSTIDMQILKDLFSYYDSASEILDVDPE-LRKQVSIARSRLVPSQIGKDGTLQEWTE 611
>gi|260642325|ref|ZP_05415419.2| alpha-L-fucosidase 2 [Bacteroides finegoldii DSM 17565]
gi|260622630|gb|EEX45501.1| hypothetical protein BACFIN_06792 [Bacteroides finegoldii DSM
17565]
Length = 824
Score = 342 bits (876), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 227/600 (37%), Positives = 342/600 (57%), Gaps = 49/600 (8%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E+ ++T K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+T+W G P +
Sbjct: 25 ETNASTQEYKLWYDRPAQVWTEALPLGNGRLGAMVYGIPGTEQIQLNEETIWAGRPNNNA 84
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NP+A + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 85 NPNALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 140
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y++ Y REL L++A A V+Y V V++ RE +S DQV++ +++ S G ++FN L
Sbjct: 141 YSD--YYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTASRPGQITFNAQLT 198
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISAL 237
S H V +++ EG C + ++ ++ KG ++F L + +RG A
Sbjct: 199 S---PHQDVMISSE---EGNC--VTLSGVSSWHEGLKGKVEFQGRLTAR---NRGGKIAC 247
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
D L VEG+D A++ + +++F+ N D + + L + + H
Sbjct: 248 ADGILSVEGADEAIIYVSIATNFN----NYLDITGNQIERTKDYLSKAMKHPFPEAKKNH 303
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
D Y++ RVS+ L ++ ENI T +RV++F+ D LV FQFG
Sbjct: 304 TDFYRRYLTRVSLNLGKN---------RYENITT---DKRVENFKDTNDAHLVATYFQFG 351
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE EPLF
Sbjct: 352 RYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNEPLFR 411
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ +S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC HLWE
Sbjct: 412 LIKEVSETGKETAKIMYGANGWVLHHNTDIWRVTGAI-DKAPSGMWPSGGAWLCRHLWER 470
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACV 536
Y YT D DFL + YP+L+ F + ++ E +L PS SPE+ +GK A
Sbjct: 471 YLYTGDTDFL-RSIYPILKESGRFFDEIMVKEPIHNWLVVCPSNSPENVHSGNNGK-ATT 528
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNED--ALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+ TMD +I ++++AIISA+E+L+ ++D +++ LK +P P +I G + EW+
Sbjct: 529 AAGCTMDNQLIFDLWTAIISASEILDTDKDFATHLKQRLKEMP---PMQIGHWGQLQEWM 585
>gi|222106243|ref|YP_002547034.1| hypothetical protein Avi_5141 [Agrobacterium vitis S4]
gi|221737422|gb|ACM38318.1| conserved hypothetical protein [Agrobacterium vitis S4]
Length = 741
Score = 342 bits (876), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 214/585 (36%), Positives = 312/585 (53%), Gaps = 41/585 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ ++ A +T+A+P+GNGRLGAMV+G +E L++NE T W+G P NPDA AL
Sbjct: 5 ELWYDRAASVWTEALPVGNGRLGAMVFGDAWNERLQINESTFWSGGPYQPINPDARAALP 64
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPAD---VYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+VR+L+ + +Y EA + + D YQ +GD+ L D H YRR LDL
Sbjct: 65 EVRNLILAERYQEADRKAYEGAMAKPDRQTSYQPIGDVWL---DLHHDMTVTNYRRSLDL 121
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
TA A +Y V F R+ F+S VIV KIS + G+LS V L S + +
Sbjct: 122 ETAVAVTQYDCHGVHFRRDVFASAIQDVIVCKISVDQPGALSMTVMLSSPQNGDPIDIAD 181
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
+ +GR N ++F+ +++ + G + + ++ ++V +
Sbjct: 182 ATLGYDGR--------NRRQNGIDSALRFA--FRVRVLAEGGFVD-IGEETIRVREASSV 230
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
+LL+ A +SF N DP ++ + L + LSY L H+ ++++LF+R+ I
Sbjct: 231 MLLIDAGTSFQ----NYRTVDGDPQAQIKARLDAAAMLSYEALLEAHVTEHRRLFNRMQI 286
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
L P + T+P+ +RV ++ +DPSL L Q+GRYL IS SRPGTQ
Sbjct: 287 ALGDKP------------VPTLPTDKRVAAYAEGDDPSLAALYLQYGRYLAISCSRPGTQ 334
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWNED+ P W S VNINLEMNYW + NLSE PL + + ++ G + A
Sbjct: 335 AANLQGIWNEDILPAWGSKYTVNINLEMNYWLADVANLSETFLPLVELVEDVAETGREMA 394
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
+ +Y A GWV+HH TDIW + G W LWPMGGAWLC L++HY + DR LE R
Sbjct: 395 KAHYGARGWVLHHNTDIWRATGPIDGP-HWGLWPMGGAWLCAQLYDHYRFNPDRAVLE-R 452
Query: 491 AYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
YPL++G F LD L+ D YL T PS SPE+ P G C + MD I+R+
Sbjct: 453 IYPLIKGAVEFALDTLVALPDSNYLGTCPSLSPENSH--PFGSSLCA--APAMDNQILRD 508
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+F A A+ L ++ + E + RL +I + G + EW+
Sbjct: 509 LFEAFADASATLGRDGELRTEAA-ATRARLPEDRIGKGGQLQEWM 552
>gi|336251922|ref|YP_004585890.1| alpha-L-fucosidase [Halopiger xanaduensis SH-6]
gi|335339846|gb|AEH39084.1| Alpha-L-fucosidase [Halopiger xanaduensis SH-6]
Length = 786
Score = 341 bits (874), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 210/595 (35%), Positives = 320/595 (53%), Gaps = 47/595 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA + DA+P+GNGRLGAM +GG+ E ++ NE+TLW G + A + ++R
Sbjct: 11 YDEPADEWIDALPLGNGRLGAMAYGGLERERIQCNEETLWAGGHEEKVVEGASEHGEEIR 70
Query: 77 SLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
L G+Y EA + L G P + L +L + A YRRELDL
Sbjct: 71 QLCFEGEYEEAQRRCNEHLQGEPPGIRPYLPFCDLLIEQPGHDEAT-AYRRELDLADGCY 129
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
RV+Y + +TRE+F S PD V+V ++ S+ ++ LD + V+ N++++
Sbjct: 130 RVEYDLEGTTYTREYFVSAPDDVLVVRLECDGPRSIDASIRLDRDRCARAGVDEENRLLL 189
Query: 196 EGRCPGKRIPPKANANDDPKG--IQF---------SAILEIKISDDRGTISALEDKKLKV 244
G+ +P A+ G ++F A +E + DD G + + V
Sbjct: 190 RGQV--IDVPNTADMYQGSGGWGLRFEGRAAVRTSGASVEPNVDDDWGQSPS----AVTV 243
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
G+D ++ A++ FDG DP+ + + L++ + Y +L RH+DD++ L
Sbjct: 244 TGADAVTVVFAAATDFDG---------DDPSDATTATLEAAADRRYEELKRRHVDDHRAL 294
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F RVS++L P D D E + V + R DP LV+L FQ+GRYLL++S
Sbjct: 295 FDRVSLELG-DPVDAPID----ERLAAVRNGSR--------DPHLVQLYFQYGRYLLLAS 341
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
SRPGT ANLQGIWNE+ P W S +++NLEMNYW + NL+EC EPL F+ +
Sbjct: 342 SRPGTLPANLQGIWNEEYDPPWHSCYTLDVNLEMNYWHAEVANLAECAEPLVAFVDSMRE 401
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
+G +TA+ Y G+ H TD+W +++ W WPM AWLC +LW+HY ++ DR
Sbjct: 402 SGRRTAREYYDCDGFAAHVDTDLW-RTTVQTVDARWGHWPMAPAWLCRNLWDHYAFSGDR 460
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
LE YP+L+ A FLLD+L+E D G+L T PS SPE++F PDG+ A V TMD
Sbjct: 461 TDLET-IYPILKDAARFLLDFLVEHPDRGWLVTAPSASPENQFRTPDGQEATVCEGPTMD 519
Query: 544 MAIIREVFSAIISAAE---VLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+ + ++F+ I AA V + +++ V + +L RL P +I E G + EW++
Sbjct: 520 VQLATDLFTHCIEAATELGVADGADESFVADLSDALERLPPMQIGEHGQLQEWLE 574
>gi|238060476|ref|ZP_04605185.1| large secreted protein [Micromonospora sp. ATCC 39149]
gi|237882287|gb|EEP71115.1| large secreted protein [Micromonospora sp. ATCC 39149]
Length = 826
Score = 340 bits (873), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 217/580 (37%), Positives = 305/580 (52%), Gaps = 41/580 (7%)
Query: 19 GPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSL 78
G + A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D N L+++R
Sbjct: 53 GAGTDWLRALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPYDSANTRGAANLAEIRRR 112
Query: 79 VDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
V + Q++ A + + G P YQ +G++ L F + Y R LDL TAT
Sbjct: 113 VFADQWSSAQDLINQTMMGTPGGQLAYQTVGNLRLAFGSAS---GASQYNRTLDLTTATV 169
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
Y + V + RE F+S PDQVIV +++ + S++F+ + DS N I
Sbjct: 170 TTTYVLNGVRYQREVFASAPDQVIVLRLTADRASSITFSATFDSPQRTTMSSPDANTIAA 229
Query: 196 EGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLV 255
+G + ++F A+ + GT+S+ L+V G+ +L+
Sbjct: 230 DG--------ISGSMEGINGSVRFLALAHAVATG--GTVSS-SGGTLRVSGATSVTVLIS 278
Query: 256 ASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRS 315
+SS+ +N D + + L + R +S L +RH+ DYQ LF+RV+I L R
Sbjct: 279 IASSY----VNYRTVNGDYQGIARTRLNAARTVSIDQLRSRHIADYQALFNRVTINLGR- 333
Query: 316 PKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQ 375
T + + P+ R+ + DP LLFQFGRYLLISSSRPGTQ ANLQ
Sbjct: 334 -------TAAADQ----PTDVRIAQHASSNDPQFSALLFQFGRYLLISSSRPGTQPANLQ 382
Query: 376 GIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYL 435
GIWN+ L+P+WDS +N NL MNYW + NLSEC P+FD + L++ G++ AQ Y
Sbjct: 383 GIWNDSLAPSWDSKYTINANLPMNYWPADTTNLSECFLPVFDMIKDLTVTGARVAQAQYG 442
Query: 436 ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLL 495
A GWV HH TD W +S G +W +W GGAWL T +WEHY +T D FL+ YP L
Sbjct: 443 AGGWVTHHNTDAWRGASVVDG-ALWGMWQTGGAWLATLIWEHYLFTGDVGFLQAN-YPAL 500
Query: 496 EGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAI 554
+G A F LD L + YL TNPS SPE P V TMD I+R++F A
Sbjct: 501 KGAAQFFLDTLVVHPTLNYLVTNPSNSPE----LPHHSNVSVCAGPTMDNQILRDLFDAA 556
Query: 555 ISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
A+E L + +V + RL P+++ G+I EW+
Sbjct: 557 ARASETLGV-DTTFRSQVRTAKDRLPPSRVGSRGNIQEWL 595
>gi|149197929|ref|ZP_01874977.1| hypothetical protein LNTAR_21070 [Lentisphaera araneosa HTCC2155]
gi|149138841|gb|EDM27246.1| hypothetical protein LNTAR_21070 [Lentisphaera araneosa HTCC2155]
Length = 765
Score = 340 bits (872), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 210/600 (35%), Positives = 312/600 (52%), Gaps = 75/600 (12%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
A P GNGRLGAMV+G + E + LN+DTL+ G D NPD L +R L+ G+ +E
Sbjct: 19 AFPAGNGRLGAMVFGDIDEERIALNDDTLYNGGQRDRFNPDCLPNLDCIRQLIFDGKLSE 78
Query: 87 ATAASVK-LFGHPADV--YQLLGDIEL---------------EFDDSHLKYAE------E 122
A A + + + G P + Y+ L D+ + FD L Y +
Sbjct: 79 AEALTQEAVTGLPPIMRNYEPLADLLISQKYSKEAYKQVDPNNFDPMDLAYGKIYQAAFS 138
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---- 178
YR+ LDL + ++ V +++ RE SS PD +I ++S SE S++ + ++
Sbjct: 139 DYRKSLDLENSIITTEFEVAGIKYKRELISSFPDDLIYLRLSASEKKSINVKLRIERGDA 198
Query: 179 SLLDNHSYVNGN----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
++ Y + N + +EGR +GI F A L ++ +G
Sbjct: 199 AMYSTRHYDKVSSPVENSLFIEGRTGSN------------EGIDFVAGLRTQV---QGGS 243
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
+ L ++ +D V+ + +S + P + +L+ +N + ++Y
Sbjct: 244 CEKIGESLIIKDADEVVIAICGHTSV---------RQNSPMTSLKKSLE--KNFDWQEVY 292
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELL 353
RH +DYQKL+ RV ++++ +EN+ P+ ER++ Q ++ D L +L
Sbjct: 293 LRHREDYQKLYKRVKLEIAHQ---------DDENL---PTDERLRKAQNNQSDVVLDQLY 340
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
F FGRYLLIS SRPG+ ANLQGIWN+ SP+W S +NIN++MNYW + CNLSEC E
Sbjct: 341 FNFGRYLLISCSRPGSMTANLQGIWNDSFSPSWGSKYTININIQMNYWPAEVCNLSECHE 400
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLFD L L ING +TA+ Y G+V HH TD + V + WPMGGAWL H
Sbjct: 401 PLFDMLEKLHINGQETAKKMYNCRGFVCHHNTDNTYDTYPTDRNVTASYWPMGGAWLALH 460
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
LWEHY +T DRDFL K Y ++ A F +D+L E G L T+PS SPE+ ++ P+G+
Sbjct: 461 LWEHYKFTQDRDFLSK-YYQIIHDAALFFVDFLCENPKGQLVTSPSVSPENTYLLPNGEY 519
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ TMD +IIRE+ A A+ +L K D + +L LP P +I + G IMEW
Sbjct: 520 GTICAGPTMDNSIIREIILATQEASRLLNKTLDQDYDGILAKLP---PLEIGKHGQIMEW 576
>gi|224538524|ref|ZP_03679063.1| hypothetical protein BACCELL_03418 [Bacteroides cellulosilyticus
DSM 14838]
gi|224519862|gb|EEF88967.1| hypothetical protein BACCELL_03418 [Bacteroides cellulosilyticus
DSM 14838]
Length = 1061
Score = 340 bits (872), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 219/595 (36%), Positives = 318/595 (53%), Gaps = 50/595 (8%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
TS N +K+ +N PA+ + +A+P+GN RLGAMV+GG E L+LNE+T W G P + NP
Sbjct: 265 TSAQN-MKLWYNRPAQDWLEALPLGNSRLGAMVFGGTAREELQLNEETFWAGGPYNNNNP 323
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETY 124
+ L ++R L+ G+ EA + + P Y LG + L F H +E Y
Sbjct: 324 KGLQVLPEIRRLIFEGKTLEAQKLIDENYMTPQHGMRYLTLGSLFLNFP-GHENPSE--Y 380
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
R+L+L ATA +Y V V+F R F+S D VI+ +I ++ +L+F VS S L +
Sbjct: 381 YRDLNLENATATTRYEVDGVKFVRTAFASLSDDVIIVRIQADKAKALNFAVSYSSPLKSD 440
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
V G II C G A P ++ A ++++ D G +S E+ L V
Sbjct: 441 VQVKGGKLII---SCQG------AEHEGIPAAMR--AECQVQVRTD-GKVSK-EESTLAV 487
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
G+ A L + A+++F +N D + + + + LQ + Y H+ Y+K
Sbjct: 488 NGATEATLYISAATNF----VNYHDVSANESKRAATYLQKATRIPYEQALKSHIASYRKQ 543
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
+ RVS+ L + + + + RV+ F D ++ L+FQ+GRYLLISS
Sbjct: 544 YDRVSLTLEST------------GVSALETPVRVQRFMEGNDMAMAALMFQYGRYLLISS 591
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+PG Q ANLQGIWN WDS VNIN EMNYW + NLSE EPLFD +T L++
Sbjct: 592 SQPGGQPANLQGIWNHSPYAPWDSKYTVNINAEMNYWPAEVTNLSETHEPLFDMVTDLAV 651
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
GS+TA+V Y A GWV HH TDIW ++ + +WP GGAWL HLW+HY +T D+
Sbjct: 652 TGSETAKVLYDAKGWVAHHNTDIW-RACGPVDAAYFGMWPNGGAWLAQHLWQHYLFTGDK 710
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+FL K YPLL+G A F L L+E H Y + T PS SPEH + G ++ TM
Sbjct: 711 EFLRKY-YPLLKGTADFYLSHLVE-HPKYKWMVTVPSMSPEHGY---RGSQTTITAGCTM 765
Query: 543 DMAIIREVFSAIISAAEVL---EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
D I + + A+ +L ++ ED+L + +L LP P +I + + EW+
Sbjct: 766 DNQIAFDALYNTLQASRILGGDKQYEDSL-QVMLSKLP---PMQIGKHNQLQEWL 816
>gi|224538426|ref|ZP_03678965.1| hypothetical protein BACCELL_03320 [Bacteroides cellulosilyticus
DSM 14838]
gi|224519961|gb|EEF89066.1| hypothetical protein BACCELL_03320 [Bacteroides cellulosilyticus
DSM 14838]
Length = 828
Score = 340 bits (871), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 209/603 (34%), Positives = 321/603 (53%), Gaps = 48/603 (7%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
M N + +P+ + ++ PA+++ +A+P+GNGRLGAMV+G E ++LNE+T+ G P
Sbjct: 22 MGNVNVYAQKHPI-LWYDKPAQYWEEALPLGNGRLGAMVYGNPVHEEIQLNEETVSAGSP 80
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKL-----FGHPADVYQLLGDIELEFDD 114
+ NP+A ALS +R L+ G+Y EA A A K+ FG P YQ +G + L+F
Sbjct: 81 YNNYNPEAKNALSTIRQLIFDGKYPEAQALAETKILSKNGFGMP---YQTVGSLRLDFQG 137
Query: 115 SHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFN 174
Y+ +RRELDL A YSV V++ RE F+S DQ+I+ +++ S++G L+F+
Sbjct: 138 QE-NYS--NFRRELDLERAVTTTTYSVDGVKYKREVFASLTDQLIIIRLTASQAGKLTFS 194
Query: 175 VSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
+L G N++IMEG G P A + F A +E+ D +G
Sbjct: 195 AALTCPQKVDVSTLGKNRLIMEGTTKGDGFTPGA--------VCFRADVEL---DLQGGK 243
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
S D L + + A + + +++F IN D +P + L++ R Y+
Sbjct: 244 SVANDTLLSITNATSATIYIAMATNF----INYKDISGNPVERNKVYLKNARK-PYTKAL 298
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H++ YQK + RV++ L +P+ P+ RVK F T DP LV L F
Sbjct: 299 QAHVNMYQKYYRRVALDLGYTPQA------------DKPTDIRVKEFATSNDPHLVALYF 346
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
Q+GRYLLIS S+PG Q ANLQGIWN +P W NIN EMNYW + NL E EP
Sbjct: 347 QYGRYLLISCSQPGGQPANLQGIWNHKTNPAWRCRYTTNINAEMNYWPAEVTNLREMHEP 406
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTH 473
+ L NG + A+ Y GW++HH TD+W + A DR WP AWLC H
Sbjct: 407 FLQMIRELYENGQEAAREMYGCRGWMLHHNTDLWRMNGAVDRPYC--GPWPTCNAWLCQH 464
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGK 532
LW+ Y Y+ D+++L YP+++ + F +D+L++ + GY+ PS SPE+ GK
Sbjct: 465 LWDRYLYSGDKEYLNS-IYPIMKSASEFFVDFLVKDPNTGYMVVTPSNSPENSPKLWKGK 523
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
+ TMD ++ ++FS +AA++L +++ + +L RL P ++ + G + E
Sbjct: 524 SNLFA-GVTMDNQLVFDLFSNTNAAAQILNRDKQ-FCDTILSLKKRLPPMQVGQYGQLQE 581
Query: 593 WVQ 595
W +
Sbjct: 582 WFE 584
>gi|406858935|gb|EKD12015.1| alpha-l-fucosidase [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 835
Score = 340 bits (871), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 218/600 (36%), Positives = 320/600 (53%), Gaps = 52/600 (8%)
Query: 8 STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
S + PL++ + PA F+D+ IGNGR+GA + G E L LNED+LW+G P D NPD
Sbjct: 33 SASVPLRLWDSAPAGGFSDSYLIGNGRIGAALSGSAQKEYLGLNEDSLWSGGPIDRVNPD 92
Query: 68 APKALSDVRSLVDSGQYAEA-TAASVKLFGHP--ADVYQLLGDIELEFDDSHLKYAEETY 124
A + +++S V G++ E T AS G+P A Y LG+++L + Y
Sbjct: 93 ASAYMGNIQSSVSKGRFQEGQTTASFAYVGNPVSARHYDYLGELQLVMNHGT---KVTGY 149
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV------SLD 178
R LDL +TA ++YSV V F RE+ +SNP V+ KIS ++G++ FN+ +L+
Sbjct: 150 ERWLDLQDSTAGLQYSVDGVTFQREYLASNPAGVMAIKISADKAGAVDFNILLRRGGTLN 209
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
+D +S GN+ I+M G G K + F+A + S R + +
Sbjct: 210 RWVD-YSVKVGNDTIVMGGGSGGV------------KPVVFAAGASVVASGGR--VYTIG 254
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D +KVEG+D A + A + F K+DP + S L+S+++ SY + H+
Sbjct: 255 DY-VKVEGADEAWIYFSAWTDF---------RKEDPRAAVESDLKSVKSQSYKSIREAHV 304
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ L RVSI L S D S RV DP +V L FQFGR
Sbjct: 305 EDYQSLASRVSIDLGTSSAKQKKDATSA----------RVAGLGAAFDPEIVALAFQFGR 354
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
Y+LISS+R GT LQGIWN+D +P W S +NIN +MN+W +L NL+E EPLF
Sbjct: 355 YMLISSARQGTLAPTLQGIWNKDPNPQWGSRYTININTQMNHWLALVTNLAELNEPLFSL 414
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ + G +TAQ Y A+G V HH TDIW S+ + WP G WL TH+ + Y
Sbjct: 415 IENVRQTGLQTAQKMYGAAGAVCHHNTDIWGDSAPVDNWALSTWWPTGLVWLVTHIHDTY 474
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD--GKLACV 536
+T + LEK+ Y L A+F LD I + G++ TNPS SPE+ + P+ G A +
Sbjct: 475 LFTGNATLLEKK-YDTLVDAAAFFLD-FITPYKGWMVTNPSVSPENVYRIPNGGGGTAAM 532
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAED-GSIMEWVQ 595
+ TMD +++R +FS ++ A VL K + AL +++ + L P +++ G I EW++
Sbjct: 533 TAGPTMDNSLLRALFSIVLEAQSVLGKKDTALADRLEAARASLPPLMVSKRYGGIQEWIE 592
>gi|424876717|ref|ZP_18300376.1| hypothetical protein Rleg5DRAFT_1090 [Rhizobium leguminosarum bv.
viciae WSM1455]
gi|393164320|gb|EJC64373.1| hypothetical protein Rleg5DRAFT_1090 [Rhizobium leguminosarum bv.
viciae WSM1455]
Length = 747
Score = 340 bits (871), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 210/584 (35%), Positives = 308/584 (52%), Gaps = 42/584 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA+ +TDA+P+GNGRLGAMV+G E L++NE T W G P NPDA L VR
Sbjct: 8 YDTPAQLWTDALPLGNGRLGAMVFGDPLREHLQINEATFWAGGPYQPVNPDAFGHLETVR 67
Query: 77 SLVDSGQYAEATAASVK-LFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
L+ +YA+A A + K L P YQ +GD+ LEFD + + YRR LDL+TA
Sbjct: 68 QLIFDCRYADAEALAEKHLMARPIKQMSYQPIGDLHLEFDH---RESVSGYRRALDLDTA 124
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y+ + + RE F S D V+V ++S ++S +S+DS + +Q+
Sbjct: 125 IATSSYTADGIAYLREAFVSPVDGVLVLRLSADRKRAISCRISIDSPQQGEMRIGQGSQL 184
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
G+ GK A A ++F+ +++ + GT++A L VEG+D ++
Sbjct: 185 SFSGK--GKAESGIAAA------LRFT--FGVRMVNSGGTVNA-SRGALSVEGADEVLVF 233
Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
L A++SF D P + + L+ + ++ L H++++++LF +I L
Sbjct: 234 LDAATSFR----RYDDVLGHPERDIVDRLERAASRDFASLRDDHIEEHRRLFSAFAIDLG 289
Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVAN 373
+P ++P+ +R+ F +DP+L L QFGRYL+I+SSRPGTQ AN
Sbjct: 290 STPAA------------SLPTDQRIAGFAGGDDPALAALYVQFGRYLMIASSRPGTQPAN 337
Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVN 433
LQGIWN + P W S NINL+MNYW P NL EC EPL + L+ G A ++
Sbjct: 338 LQGIWNAETDPPWGSKYTANINLQMNYWLPAPANLPECLEPLVEMAEELAETGKAMAHIH 397
Query: 434 YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYP 493
Y A GWV+HH TD+W + G W LWP GG WL L + +Y D + + +R +P
Sbjct: 398 YRARGWVMHHNTDLWRATGPIDG-AKWGLWPTGGIWLMAQLLDACDYLDDAEAMRRRLFP 456
Query: 494 LLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
+ A FL D L+ G D YL TNPS SPE+ P G C MD +IR+ F
Sbjct: 457 VAREAAHFLFDVLVPFPGTD-YLVTNPSLSPENAH--PHGASICA--GPAMDSQLIRD-F 510
Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
++ V E LV + + LPRL P +I +G + EW++
Sbjct: 511 LGLLRPLAVSIGGEPDLVADIDRVLPRLAPDRIGANGQLQEWLE 554
>gi|423221840|ref|ZP_17208310.1| hypothetical protein HMPREF1062_00496 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392645258|gb|EIY38987.1| hypothetical protein HMPREF1062_00496 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 1074
Score = 339 bits (870), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 217/595 (36%), Positives = 319/595 (53%), Gaps = 50/595 (8%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
TS N +K+ +N PA+ + +A+P+GN RLGAMV+GG E L+LNE+T W G P + NP
Sbjct: 278 TSAQN-MKLWYNRPAQDWLEALPLGNSRLGAMVFGGTAREELQLNEETFWAGGPYNNNNP 336
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETY 124
+ L ++R L+ G+ EA + + P Y LG + L F H +E Y
Sbjct: 337 KGLQVLPEIRRLIFEGKTLEAQKLIDENYMTPQHGMRYLTLGSLFLNFP-GHENPSE--Y 393
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
R+L+L ATA +Y V V+F R F+S D VI+ +I ++ +L+F VS S L +
Sbjct: 394 YRDLNLENATATTRYEVDGVKFVRTAFASLSDDVIIVRIQADKAKALNFAVSYSSPLKSD 453
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
V G II C G A P ++ A ++++ D G +S E+ L V
Sbjct: 454 VQVKGGKLII---SCQG------AEHEGIPAAMR--AECQVQVRTD-GKVSK-EESTLAV 500
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
G+ A L + A+++F +N D + + + + LQ + Y H+ Y+K
Sbjct: 501 NGATEATLYISAATNF----VNYHDVSANESKRAATYLQKATRIPYEQALKSHIASYRKQ 556
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
+ RV++ L + + + + RV+ F D ++ L+FQ+GRYLLISS
Sbjct: 557 YDRVALTLEST------------GVSALETPVRVQRFMEGNDMAMAALMFQYGRYLLISS 604
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+PG Q ANLQGIWN WDS +NIN EMNYW + NLSE EPLFD +T L++
Sbjct: 605 SQPGGQPANLQGIWNHSPYAPWDSKYTININAEMNYWPAEVTNLSETHEPLFDMVTDLAV 664
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
GS+TA+V Y A GWV HH TDIW ++ + +WP GGAWL HLW+HY +T D+
Sbjct: 665 TGSETAKVLYDAKGWVAHHNTDIW-RACGPVDAAYFGMWPNGGAWLAQHLWQHYLFTGDK 723
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+FL K+ YPLL+G A F L L+E H Y + T PS SPEH + G ++ TM
Sbjct: 724 EFL-KKYYPLLKGTADFYLSHLVE-HPKYKWMVTVPSMSPEHGY---RGSQTTITAGCTM 778
Query: 543 DMAIIREVFSAIISAAEVL---EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
D I + + A+ +L ++ ED+L + +L LP P +I + + EW+
Sbjct: 779 DNQIAFDALYNTLQASRILGGDKQYEDSL-QVMLSKLP---PMQIGKHNQLQEWL 829
>gi|255532590|ref|YP_003092962.1| alpha-L-fucosidase [Pedobacter heparinus DSM 2366]
gi|255345574|gb|ACU04900.1| Alpha-L-fucosidase [Pedobacter heparinus DSM 2366]
Length = 825
Score = 339 bits (869), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 206/598 (34%), Positives = 328/598 (54%), Gaps = 44/598 (7%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
++ T LK+ ++ PA ++ +A+PIGNGRLGAMV+G E L+LNE+T+W+G P
Sbjct: 21 GQAKKTDGTLKLWYDRPAANWNEALPIGNGRLGAMVFGNPAKEQLQLNEETVWSGGPNSN 80
Query: 64 TNPDAPKALSDVRSLVDSGQYAEATA-ASVKLF--GHPADVYQLLGDIELEFDDSHLKYA 120
+ A+ +R L+ G++ EA A A V++F + +YQ +G++ LEF+ +
Sbjct: 81 VTAASGAAIPALRKLIFEGKFEEAQALADVEMFPKKNSGMIYQPVGNLFLEFEGTE---K 137
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
Y R+L++ A A V Y G + + RE FSS DQV++ +++ + G ++F +D+
Sbjct: 138 ARNYYRDLNIEKALATVTYEAGGIRYKREIFSSFTDQVLIVRLTADKPGKITFRALMDTE 197
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ +++++ G A+ + I+F++ ++K+ + G S L++
Sbjct: 198 QKGGLRME-KDRLLLSGLT--------ADHEGEQGKIRFAS--QVKVVAEGGKAS-LQNN 245
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
V+ ++ A + + +++F N D D ++ S L +Y++ H+
Sbjct: 246 AWIVKAANSATVYVSIATNFK----NYHDVSADAGLKAASFLDRAVKKNYAEALAAHIKF 301
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
YQ+ F+RV + +TD ++ P+ ER+ +F DP L L FQFGRYL
Sbjct: 302 YQQYFNRVKFDIG------ITDAVNK------PTDERIAAFARSNDPHLTALYFQFGRYL 349
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSS+PG Q LQGIWN+ + WDS +NIN EMNYW + NLSE +PLF L
Sbjct: 350 LISSSQPGNQPPTLQGIWNDKMLAPWDSKYTININTEMNYWPAEVTNLSELHDPLFKMLK 409
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSS-ADRGKVVWALWPMGGAWLCTHLWEHYN 479
LS+ G +TA++ Y A GWV HH TD+W + DR LWPMGG WL HLW+HY
Sbjct: 410 DLSVTGRETAKLMYGAKGWVTHHNTDLWRITGPVDRPYA--GLWPMGGNWLSQHLWDHYM 467
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSY 538
+T D+ FL K YP+L+G + F LD L E +L +PS SPE+ ++ GK ++
Sbjct: 468 FTGDKQFL-KEYYPVLKGASEFYLDVLQEEPTHKWLVVSPSNSPENTYVP--GKRVSIAA 524
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWVQ 595
+TMD ++ ++F+ AAE+L DA +LK+ L RL P +I + + EW+
Sbjct: 525 GTTMDNQLLFDLFTRTGKAAELL--GMDAEFRGLLKTALGRLAPMQIGKYSQLQEWMH 580
>gi|431796298|ref|YP_007223202.1| hypothetical protein Echvi_0919 [Echinicola vietnamensis DSM 17526]
gi|430787063|gb|AGA77192.1| hypothetical protein Echvi_0919 [Echinicola vietnamensis DSM 17526]
Length = 813
Score = 339 bits (869), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 207/590 (35%), Positives = 331/590 (56%), Gaps = 49/590 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA ++ +A+P+GNGRLGAMV+G E L+LNE+T+W G P + A +A+
Sbjct: 26 KLWYDQPASNWNEALPLGNGRLGAMVFGVPAMERLQLNEETIWAGSPNSNAHTSAKEAIP 85
Query: 74 DVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ G Y A A+ K+ D Y+ G++ + F H Y + Y R+L+L
Sbjct: 86 YVRRLIFDGDYQAAQELANEKIMSQTNDGMPYETFGNVYISFP-GHQDY--QDYYRDLNL 142
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
AT+ V+YSV V++TRE S+ D VI+ K++ GS++ NV + S DN
Sbjct: 143 EDATSTVRYSVDGVQYTREVLSAFEDDVIMVKLTADRPGSITCNVHMTSPHDNAEARVRG 202
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
+Q+ + G + +D +G ++F IK ++ G + A++D + V+G+D
Sbjct: 203 DQLTLSG---------VSQTHDHQRGGVKFQG--RIKATNKGGQL-AVKDGLISVDGADE 250
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
L + +++F N +D + ++ + L + ++ + H++ YQ+ + RV+
Sbjct: 251 VTLYISIATNFK----NYNDLSVEYERKAEALLDAALQKDFAAIKREHIEHYQQFYDRVA 306
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
I D+ + +E+ P+ +R++ F DP L L FQF RYLLIS S+PG
Sbjct: 307 I-------DLGSTEAAEK-----PTDQRIQQFSEVHDPQLAALYFQFARYLLISCSQPGG 354
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN+ L P W+S VNIN EMNYW + NLSE EP + +S G +T
Sbjct: 355 QPANLQGIWNDMLFPPWESKYTVNINAEMNYWPAELTNLSEMHEPFLQMVREVSETGQQT 414
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
A++ Y A GWV+HH TDIW + G + +A +WP GGAWL HLWE Y Y+ D DF
Sbjct: 415 AKMMYGARGWVLHHNTDIWRIT----GPIDYAASGMWPSGGAWLSQHLWERYLYSGDEDF 470
Query: 487 LEKRAYPLLEGCASFLLDWLIEGH-DGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L K AYP+++G A F LD LIE +G+L +PS+SPE+ + A ++ TMD
Sbjct: 471 L-KEAYPIMKGAAQFFLDVLIEEPVNGWLVVSPSSSPENSHVHG----ATIAAGVTMDNQ 525
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
++ ++FS +I ++E+L +++ A + + + +L P ++ + G + EW+
Sbjct: 526 LLFDLFSNLIRSSEILGEDQ-AFADTLKATRSKLAPMQVGQYGQLQEWMH 574
>gi|384419108|ref|YP_005628468.1| hypothetical protein XOC_2159 [Xanthomonas oryzae pv. oryzicola
BLS256]
gi|353462021|gb|AEQ96300.1| hypothetical protein XOC_2159 [Xanthomonas oryzae pv. oryzicola
BLS256]
Length = 776
Score = 338 bits (867), Expect = 5e-90, Method: Compositional matrix adjust.
Identities = 220/595 (36%), Positives = 312/595 (52%), Gaps = 47/595 (7%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+ L++ + PA + +A+P+GNGRLGAMVWGG L+LNEDTL+ G P D T+P
Sbjct: 41 VAAAEALQLWYPQPANEWVEALPVGNGRLGAMVWGGSAHAHLQLNEDTLYAGGPYDATSP 100
Query: 67 DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
DA AL VR+L+ +G YAE A KL P YQ LGD+ L+FD +
Sbjct: 101 DALAALPQVRALIFAGGYAEVEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GMSD 157
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
YRR+LDL+TA A + G RE F S Q +V ++S G +S V +DS N
Sbjct: 158 YRRQLDLDTAVATTTFRSGGAVHRREVFVSAHAQCVVVRLSCDHPGGISLRVGIDSP-QN 216
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQ--FSAILEIKISDDRGTISALEDKK 241
++ GR N GI+ L + G S + D+
Sbjct: 217 GEVTAEQGGLLFSGR------------NGSCAGIEGKLRFALPVLPQVTGGKRSQVRDR- 263
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L+++ +D VLLL A++S ++ D DP + + ++L+ L ++ L HL D+
Sbjct: 264 LRIDAADEVVLLLSAATSDQ--RVDTVDG--DPLALTAASLRKAAKLEFAALLRAHLADH 319
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
Q+LF RV+I L S D V + + ERV+ F +DP+L L Q+GRYLL
Sbjct: 320 QRLFRRVAINLGSS--DAVQ----------LSTNERVQRFAEGDDPALAALYHQYGRYLL 367
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
I SSRP TQ ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL
Sbjct: 368 ICSSRPCTQPANLQGIWNDLMQPPWESKYTININAEMNYWPSEANALHECVEPLEAMWFD 427
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
L+ G+ TA+ Y A WV+H+ TD+W ++ G W LWPMGG W LW ++Y
Sbjct: 428 LAKTGAHTAKAMYDAPAWVVHNNTDLWRQAGPIDG-AKWRLWPMGGVWQ-QQLWHRWDYG 485
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
DR L YPL +G A F + L+ + G + TNPS SPE+++ P G C
Sbjct: 486 RDRADLST-IYPLFKGAAEFFVATLLRDPQTGAMVTNPSMSPENQY--PFGAALCA--VP 540
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
TMD ++R++F+ I+ ++L + D L +++ RL P +I + G + EW Q
Sbjct: 541 TMDAQLLRDLFAQCIAMRKLLCIDAD-LAQQLAALRERLPPNRIGKAGQLQEWQQ 594
>gi|423299820|ref|ZP_17277845.1| hypothetical protein HMPREF1057_00986 [Bacteroides finegoldii
CL09T03C10]
gi|408473629|gb|EKJ92151.1| hypothetical protein HMPREF1057_00986 [Bacteroides finegoldii
CL09T03C10]
Length = 824
Score = 338 bits (866), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 226/600 (37%), Positives = 340/600 (56%), Gaps = 49/600 (8%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E+ ++T K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+T+W G P +
Sbjct: 25 ETNASTQEYKLWYDRPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEETIWAGRPNNNA 84
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NP+A + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 85 NPNALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 140
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y++ Y REL L++A A V+Y V V++ RE +S DQV++ +++ S G ++FN L
Sbjct: 141 YSD--YYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTASRPGQITFNAQLT 198
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISAL 237
S H V +++ EG C + ++ ++ KG ++F L + +RG A
Sbjct: 199 S---PHQDVMISSE---EGNC--VTLSGVSSWHEGLKGKVEFQGRLTAR---NRGGKIAC 247
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
D L VEG+D AV+ + +++F+ N D + + L + + H
Sbjct: 248 ADGILSVEGADEAVIYVSIATNFN----NYLDITGNQIERAKDYLSKAMKHPFPEAKKNH 303
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
Y++ RVS+ L ++ ENI T +RV++F+ D LV FQFG
Sbjct: 304 TGFYRRYLTRVSLNLGKN---------RYENITT---DKRVENFKDTNDAHLVATYFQFG 351
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE EPLF
Sbjct: 352 RYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVSNLSELNEPLFR 411
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ +S G +TA++ Y A+GWV+HH TDIW + A K +W GGAWLC HLWE
Sbjct: 412 LIKEVSETGKETARIMYGANGWVLHHNTDIWRVTGAI-DKAPSGMWSSGGAWLCRHLWER 470
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACV 536
Y YT D DFL + YP+L+ F + ++ E +L PS SPE+ +GK A
Sbjct: 471 YLYTGDTDFL-RSIYPILKESGRFFDEIMVKEPIHNWLVVCPSNSPENVHSGSNGK-ATT 528
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNED--ALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+ TMD +I ++++AIISA+E+L+ ++D +++ LK +P P +I G + EW+
Sbjct: 529 AAGCTMDNQLIFDLWTAIISASEILDTDKDFATHLKQRLKEMP---PMQIGHWGQLQEWM 585
>gi|256426140|ref|YP_003126793.1| alpha-L-fucosidase [Chitinophaga pinensis DSM 2588]
gi|256041048|gb|ACU64592.1| Alpha-L-fucosidase [Chitinophaga pinensis DSM 2588]
Length = 811
Score = 337 bits (865), Expect = 8e-90, Method: Compositional matrix adjust.
Identities = 212/592 (35%), Positives = 314/592 (53%), Gaps = 50/592 (8%)
Query: 13 LKITFNGPAKH-FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
LK+ + PA + +T A+P+GNGR+ MV+G E L+LNE T+WTG P NP+A A
Sbjct: 22 LKLWYKQPAGNVWTAALPVGNGRIAGMVFGNPAEELLQLNEATVWTGSPNRNENPEALAA 81
Query: 72 LSDVRSLVDSGQYAEAT-----AASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
L +R L+ G+ EA KL G +YQ +G + L F H Y + Y R
Sbjct: 82 LPQIRQLIFDGKQKEAQDLAGEKIQTKLSG--GQMYQPVGTLHLAFP-GHEHY--DNYYR 136
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
ELD+ A A Y V V++TRE F+S P Q I+ ++S S+ G+L F+ L + N
Sbjct: 137 ELDIEKAVATTTYMVDGVKYTREVFASVPAQTIIVRLSSSKPGTLGFSAYLTTPQKNAVV 196
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVE 245
+ + G +++ +G ++F+ I + S G A D + ++
Sbjct: 197 KASGKDLTVNGIT---------GSHEGVEGKVKFNGITRVIAS---GGSVATSDTAVTIK 244
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
++ A+L + ++++ +N D D ++ + L + Y+ L H+ YQ+ F
Sbjct: 245 NANSALLFISMATNY----VNYQDLSADEVKKASAYLNAAVKQPYATLLKEHIAAYQRYF 300
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
+RV I L S D+ D P+ R+ +F DP + L FQFGRYLLIS S
Sbjct: 301 NRVKIDLGTS--DVAKD----------PTDVRLVNFSKTYDPQFISLYFQFGRYLLISCS 348
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
+PG Q A LQG+WN ++SP WDS +NIN EMNYW + NL E EPL + LS+
Sbjct: 349 QPGGQPATLQGLWNSEMSPPWDSKYTININTEMNYWPAEKDNLPEMHEPLVQMVKELSVT 408
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G TA++ Y A GWV HH TD+W + + ++ + +W MGGAWL HLW+ Y Y DR
Sbjct: 409 GQGTARILYGARGWVAHHNTDLW-RITGPVDRIFYGIWSMGGAWLAQHLWDRYLYNGDRR 467
Query: 486 FLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSS--TM 542
+L YP ++G A F +D L+E YL NP TSPE+ AP + VS+ + TM
Sbjct: 468 YLAD-VYPAIKGAALFFVDDLVEDPKRKYLVVNPGTSPEN---APSTR-PNVSFDAGCTM 522
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
D I+ + SA I+AAE+L K+ ALV+ RL P ++ + G + EW+
Sbjct: 523 DNQIVFDALSAAINAAEILGKDA-ALVDTFKTVRRRLPPMQVGQYGQLQEWI 573
>gi|300726579|ref|ZP_07060021.1| alpha-L-fucosidase 2 [Prevotella bryantii B14]
gi|299776147|gb|EFI72715.1| alpha-L-fucosidase 2 [Prevotella bryantii B14]
Length = 803
Score = 337 bits (865), Expect = 9e-90, Method: Compositional matrix adjust.
Identities = 209/593 (35%), Positives = 325/593 (54%), Gaps = 48/593 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ +N PA+ +TDA+P+GNGRLGAMV+G +E ++LNE+T+WTG P N A A+
Sbjct: 6 KLWYNEPAQVWTDALPLGNGRLGAMVYGIPSTEHIQLNEETIWTGQPNHNANKKALNAIP 65
Query: 74 DVRSLVDSGQY--AEATAASVKLFG-HPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
++ L+ G+Y A+ A + G + YQ GD+ + ++ L+Y YRREL L
Sbjct: 66 KIQQLLFEGRYHTADKMANDNVMSGTNWGMAYQTFGDVYITTPNA-LRYT--NYRRELSL 122
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
++A A Y+V V + RE +S VI ++ S+ G L+F + + +
Sbjct: 123 DSAIAVTTYTVDGVTYRREVITSFDSNVITIHLTASKPGKLTFGAHYSTPQEEILIRSEK 182
Query: 191 NQIIMEG------RCPGK-RIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
N+ I+EG C GK R + G++ A + D ++
Sbjct: 183 NEAILEGVSGKLEGCKGKVRFMGRMLCETMKNGVRQEA--------------SSRDGEIT 228
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
VE +D A + + +++F +N D D ++S L+ +Y H+ +Q
Sbjct: 229 VENADEATIYISIATNF----VNYKDISGDEVAKSEQILRQAIAKNYEQSKKTHIAKFQS 284
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
+RVS+ L KD+ + P+ +R+ +F +D L+ F FGRYLLI
Sbjct: 285 FMNRVSLSLG---KDLYQNE---------PTDQRIINFAHRDDNGLIATYFNFGRYLLIC 332
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SS+PG Q ANLQGIWN + P+WDS NINLEMNYW S NLS+ EPLF + +S
Sbjct: 333 SSQPGGQAANLQGIWNHRVWPSWDSKYTTNINLEMNYWPSEIANLSDLNEPLFRLIREVS 392
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+GS +A++ Y GWV+HH TDIW + + +W +GGAWLC HLW+HY YT D
Sbjct: 393 ESGSISAKMMYGKDGWVLHHNTDIW-RVTGGIDHASSGMWMLGGAWLCAHLWQHYLYTGD 451
Query: 484 RDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
++FL K+AYPL++G A FL + LI E G+L +PS SPE+ + DGK+A ++Y +TM
Sbjct: 452 KEFL-KKAYPLMKGAAIFLDEMLIPEPEHGWLVISPSVSPENYHPSKDGKIA-ITYGTTM 509
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
D ++ E+F+++ A+++L +D L + L ++ P +I + G + EW++
Sbjct: 510 DNTLLHELFNSVSVASQILGV-DDTLKSYYAERLKKMAPMQIGKWGQLQEWLK 561
>gi|116248791|ref|YP_764632.1| hypothetical protein pRL120117 [Rhizobium leguminosarum bv. viciae
3841]
gi|115253441|emb|CAK11831.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae
3841]
Length = 747
Score = 337 bits (865), Expect = 9e-90, Method: Compositional matrix adjust.
Identities = 210/584 (35%), Positives = 308/584 (52%), Gaps = 42/584 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA+ +TDA+P+GNGRLGAMV+G E L++NE T W G P NPDA L VR
Sbjct: 8 YDTPAQLWTDALPLGNGRLGAMVFGDPLREHLQINEATFWAGGPYQPVNPDAFGHLETVR 67
Query: 77 SLVDSGQYAEATAASVK-LFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
L+ +YA+A A + K L P YQ +GD+ LEFD + + YRR LDL+TA
Sbjct: 68 QLIFDCRYADAEALAEKHLMARPIKQMSYQPIGDLHLEFDH---RESVSGYRRALDLDTA 124
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y+ + + RE F S D V+V ++S +++ +S+DS + +Q+
Sbjct: 125 IATSSYTADGIAYLREAFVSPVDGVLVLRLSADRKRAINCRISIDSPQQGEMRIGQGSQL 184
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
G+ GK A A ++F+ +++ + GT++A L VEG+D ++
Sbjct: 185 SFSGK--GKAESGIAAA------LRFA--FGVRLINSGGTVNA-SGGALSVEGADEVLVF 233
Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
L A++SF D P + + L+S + + L H++++++LF +I L
Sbjct: 234 LDAATSFR----RYDDVLGHPERDIVDRLESAVSRDFVSLRDDHIEEHRRLFSAFAIDLR 289
Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVAN 373
+P ++P+ +R+ F +DP+L L QFGRYL+I+SSRPGTQ AN
Sbjct: 290 STPAA------------SLPTDQRIAGFAGGDDPALAALYVQFGRYLMIASSRPGTQPAN 337
Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVN 433
LQGIWN + P W S NINL+MNYW P NL EC EPL + L+ G A V+
Sbjct: 338 LQGIWNAETDPPWGSKYTANINLQMNYWLPAPANLPECLEPLVEMAEELAETGKAMAHVH 397
Query: 434 YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYP 493
Y A GWV+HH TD+W + G W LWP GG WL L + +Y D + + +R +P
Sbjct: 398 YRARGWVMHHNTDLWRATGPIDG-AKWGLWPTGGIWLMAQLLDACDYLDDAEAMRRRLFP 456
Query: 494 LLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
+ A FL D L+ G D +L TNPS SPE+ P G C MD +IR+ F
Sbjct: 457 IAREAAHFLFDVLVPFPGTD-HLVTNPSLSPENAH--PHGASICA--GPAMDSQLIRD-F 510
Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
++ V E LV + + LPRL P +I +G + EW++
Sbjct: 511 LGLLRPLAVSIGGEPDLVADIDRVLPRLAPDRIGANGQLQEWLE 554
>gi|427383711|ref|ZP_18880431.1| hypothetical protein HMPREF9447_01464 [Bacteroides oleiciplenus YIT
12058]
gi|425728416|gb|EKU91274.1| hypothetical protein HMPREF9447_01464 [Bacteroides oleiciplenus YIT
12058]
Length = 1074
Score = 337 bits (864), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 211/595 (35%), Positives = 319/595 (53%), Gaps = 50/595 (8%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
TS N +K+ + PA+ + +A+P+GN RLGAMV+GG E L+LNE+T W G P + NP
Sbjct: 278 TSAQN-MKLWYGRPAQDWLEALPLGNSRLGAMVFGGTAREELQLNEETFWAGGPYNNNNP 336
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETY 124
+ L ++R L+ G+ EA + + P Y +G + L F H +E Y
Sbjct: 337 RGLQVLPEIRRLIFEGKTLEAQKLIDENYMTPQHGMRYLTMGSLFLNFP-GHENPSE--Y 393
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
R+L+L ATA ++Y V V+F R F+S D VI+ +I ++ +L+F +S +S L ++
Sbjct: 394 YRDLNLENATATIRYEVDGVKFVRTAFASLSDDVIIVRIQADKAKALNFAISYNSPLKSN 453
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
V G II C G A P ++ +++K G +S E+ L V
Sbjct: 454 VQVKGGKLII---SCQG------AEHEGVPAAMRAECQVQVKTD---GKVSK-EESSLAV 500
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
G+ A L + A+++F +N D + + + + LQ + Y H+ Y+K
Sbjct: 501 NGATEATLYISAATNF----VNYHDVSANESKRAATYLQKATRIPYEQALKSHIASYRKQ 556
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
+ RV++ L + + + + RV+ F D ++ L+FQ+GRYLLISS
Sbjct: 557 YDRVALTLEST------------KVSALETPVRVQRFMEGNDMAMAALMFQYGRYLLISS 604
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+PG Q ANLQGIWN WDS +NIN EMNYW + NLSE EPLFD + L++
Sbjct: 605 SQPGGQPANLQGIWNHSPYAPWDSKYTININAEMNYWPAEVTNLSETHEPLFDMVADLAV 664
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
GS+TA+V Y A GWV HH TDIW ++ + +WP GGAWL HLW+HY +T D+
Sbjct: 665 AGSETAKVLYDAKGWVAHHNTDIW-RACGPVDAAYFGMWPNGGAWLAQHLWQHYLFTGDK 723
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+FL K+ YP+L+G A F L L+E H Y + T PS SPEH + G ++ TM
Sbjct: 724 EFL-KKYYPVLKGTADFYLSHLVE-HPKYKWMVTVPSMSPEHGY---RGSQTTITAGCTM 778
Query: 543 DMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
D I + + + A+ +L+ + ED+L + +L LP P +I + + EW+
Sbjct: 779 DNQIAFDALYSTLQASRILDGDKQYEDSL-QTMLDKLP---PMQIGKHNQLQEWL 829
>gi|406867099|gb|EKD20138.1| hypothetical protein MBM_02090 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 743
Score = 337 bits (863), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 209/593 (35%), Positives = 311/593 (52%), Gaps = 58/593 (9%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA ++ ++PIGNGRLGAMV+G +E L+LNED++W G P D DA K L
Sbjct: 4 RLHYTTPATEWSQSLPIGNGRLGAMVYGRTTTELLQLNEDSVWYGGPQDRIPRDALKNLP 63
Query: 74 DVRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+R L+ + Q++EA K F H Y+ LG LEF H Y+RELDL
Sbjct: 64 RLRELIRAEQHSEAEDLVRKAFFATPHSKRHYEPLGTFTLEF--GHEDSEVTDYKRELDL 121
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSES--GSLSFNVSLDSLLDNHSYVN 188
TA A V+Y V++ R+ F+S PD VIV ++ SE +L + + + Y++
Sbjct: 122 ETAIASVQYRYRGVDYKRKVFASGPDNVIVLQLKSSERVRATLRLTRVSEREYETNEYLD 181
Query: 189 G----NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
N+ I+ PG R +P ++++K +D GT+ A+ L +
Sbjct: 182 SVTASNDGSIVMRATPGGR-------GSNP----LCCVVKVKC-EDGGTLEAV-GGCLVI 228
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
E S ++++ A + F P DP S ++ + R L+ L RH+++Y+ L
Sbjct: 229 E-SKATMIVISAQTKFRSP---------DPESAALE--DATRALTRGGLRGRHVENYRSL 276
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
+ R+ +QL ++ TD K DP LV L +GRYLL++S
Sbjct: 277 YARMKLQLGSPASELSTD----------------KRLLRSVDPGLVALYHNYGRYLLVAS 320
Query: 365 SRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SRPG + A LQGIWN P W S +NIN +MNYW + CNL+EC+ PLFD L +
Sbjct: 321 SRPGPRALPATLQGIWNPSFQPAWGSRYTININTQMNYWPANLCNLAECEMPLFDLLERM 380
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
+I G +TAQ Y GW HH TDIWA + V +WP+ GAWLC H+WE+Y +
Sbjct: 381 AIRGKQTAQEMYGCRGWCAHHNTDIWADTDPQDRWVPATVWPLAGAWLCFHIWENYLFNG 440
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDG--YLETNPSTSPEHEFIAPDGKLACVSYSS 540
LE R +P+L+G F+LD+L+E YL TNPS SPE+ F++ + + + S
Sbjct: 441 STTLLE-RMFPILKGSVQFILDFLVEDATSGQYLVTNPSLSPENTFLSANNREGVLCEGS 499
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
T+D+ II +F A I A L++ +D L+ V+ + RL P + G + EW
Sbjct: 500 TIDIQIINALFGAFIDALGELDRTDD-LLPAVIHARDRLPPMAVGSLGQLQEW 551
>gi|380695292|ref|ZP_09860151.1| hypothetical protein BfaeM_15197 [Bacteroides faecis MAJ27]
Length = 824
Score = 337 bits (863), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 221/600 (36%), Positives = 341/600 (56%), Gaps = 49/600 (8%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E + K+ ++ PA+ +T+A+P+GNGRLGAMV+G E ++LNE+T+W G P +
Sbjct: 25 EKKVSAQEYKLWYDRPAQVWTEALPLGNGRLGAMVYGTPGIEQIQLNEETIWAGRPNNNA 84
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NP+A + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 85 NPNALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 140
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y++ Y REL L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 141 YSD--YYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 198
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISAL 237
S H V N++ +G C + ++ ++ KG ++F L ++ ++G A
Sbjct: 199 S---PHQDVMINSE---KGNC--VILSGVSSLHEGLKGKVEFQGRLTVR---NQGGKIAC 247
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
D L VEG+D A + + +++F+ N D + T + S L +++ H
Sbjct: 248 TDGVLSVEGADEATIYVSIATNFN----NYLDITGNQTERAKSYLSEALVHPFAEAKKNH 303
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
++ Y++ RVS+ L E+ V + +RV++F+ D LV FQFG
Sbjct: 304 VEFYRRYLTRVSLDLG------------EDQYKNVTTDKRVENFKDTHDAHLVATYFQFG 351
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLS+ EPLF
Sbjct: 352 RYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSDLNEPLFR 411
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ +S +G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC HLWE
Sbjct: 412 LIKEVSESGKETARIMYGANGWVLHHNTDIWRITGA-LDKAPSGMWPSGGAWLCRHLWER 470
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACV 536
Y YT D +FL + YP+L+G F + ++ E +L PS SPE+ DGK A
Sbjct: 471 YLYTGDTEFL-RSVYPILKGSGLFFDEIMVKEPVHNWLVVCPSNSPENVHSGSDGK-ATT 528
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNED--ALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+ TMD +I ++++AIISA+ +L+ +++ A +E+ LK + P ++ G + EW+
Sbjct: 529 AAGCTMDNQLIFDLWTAIISASRILDTDKEFAAHLEQRLKEMA---PMQVGHWGQLQEWM 585
>gi|338209373|ref|YP_004646344.1| alpha-L-fucosidase [Runella slithyformis DSM 19594]
gi|336308836|gb|AEI51937.1| Alpha-L-fucosidase [Runella slithyformis DSM 19594]
Length = 849
Score = 336 bits (862), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 218/598 (36%), Positives = 335/598 (56%), Gaps = 47/598 (7%)
Query: 6 STSTTNPLKITFNGPAKH-FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
S+ LK+ + P+ + + +A+PIGNG+LGAMV+G V ET++LNE T+W+G P
Sbjct: 47 SSQEVKSLKLWYTKPSGNTWENALPIGNGQLGAMVYGNVEKETIQLNEHTVWSGSPNRND 106
Query: 65 NPDAPKALSDVRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDDSHLKYAE 121
NP+A AL ++R L+ G+ +A + K+ ++Q +G++ L FD H Y +
Sbjct: 107 NPEALAALPEIRQLIFDGKQKDAERLANKVIITKKSHGQMFQPVGNLHLTFD-GHGNYTD 165
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
Y RELDL A A+ Y+V V++TRE +S PD+VIV ++ + SLSF S +
Sbjct: 166 --YYRELDLERAVAKTAYTVNGVKYTREILASFPDRVIVMHLTADKPNSLSFVASYATQH 223
Query: 182 DNHSYVN--GNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALE 238
+ +N +N++ + G + ++ KG + F + IK + GT++A
Sbjct: 224 KKRA-INPTASNELSLSGTT---------SDHEGVKGMVNFKGVTRIKT--EGGTVAA-N 270
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D + V+G+ A L + +++F+ + D D + + + L SY+ + T H+
Sbjct: 271 DSSIAVKGATTATLYVSIATNFN----SYKDISGDENARATAYLNKAYPKSYAAILTPHM 326
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
YQK F+RV D+ T ++ +P+ ER+K+F+T DP +V L +QFGR
Sbjct: 327 AAYQKYFNRVQF-------DLGTTEAAK-----LPTDERLKNFRTVNDPHMVTLYYQFGR 374
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSS+PG+Q ANLQGIWN ++P WDS +NIN +MNYW + NLSE P
Sbjct: 375 YLLISSSQPGSQPANLQGIWNHRMNPPWDSKYTININAQMNYWPAEKTNLSELHAPFLKM 434
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ LS G +TA+V Y A GW+ HH TDIW + A G +W GG W HLWEHY
Sbjct: 435 VKELSETGQETARVMYGAKGWMAHHNTDIWRATGAIDGAFW-GMWTGGGGWTAQHLWEHY 493
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACV 536
Y+ D+ FL + YP+L+G A+F D+L+E H Y L NP +SPE+ A G + +
Sbjct: 494 LYSGDKAFLTE-IYPILKGAAAFYADFLVE-HPKYHWLVINPGSSPENAPKAHAG--SSL 549
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+TMD I+ + FS I AAE+L+K + A V+ + + +L P + + G + EW+
Sbjct: 550 DAGTTMDNQIVFDAFSTAIRAAELLKK-DAAFVDTLRQLRNKLAPMHVGQHGQLQEWL 606
>gi|298385755|ref|ZP_06995313.1| alpha-L-fucosidase 2 [Bacteroides sp. 1_1_14]
gi|298261896|gb|EFI04762.1| alpha-L-fucosidase 2 [Bacteroides sp. 1_1_14]
Length = 824
Score = 336 bits (861), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 221/600 (36%), Positives = 340/600 (56%), Gaps = 49/600 (8%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E + K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+T+W G P +
Sbjct: 25 EKKVSVQEYKLWYDRPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEETIWAGRPNNNA 84
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NP+A + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 85 NPNALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 140
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y++ Y REL L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 141 YSD--YYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 198
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISAL 237
S H V N++ EG C + ++ ++ KG ++F L + ++G A
Sbjct: 199 S---PHQDVMINSE---EGNC--VTLSGVSSLHEGLKGKVEFQGRLTAR---NQGGKIAC 247
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
D L VEG+D A + + +++F+ N D + T + S L +++ H
Sbjct: 248 TDGVLSVEGADEATIYVSIATNFN----NYLDITGNQTERAKSYLSEALVRPFAEAKKNH 303
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
++ Y++ RVS+ L E+ V + +RV++F+ D LV FQFG
Sbjct: 304 VEFYRRYLTRVSLDLG------------EDQYKNVTTDKRVENFKDTHDAHLVATYFQFG 351
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLS+ EPLF
Sbjct: 352 RYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSDLNEPLFR 411
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ +S +G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC HLWE
Sbjct: 412 LIKEVSESGKETAKIMYGANGWVLHHNTDIWRITGA-LDKAPSGMWPSGGAWLCRHLWER 470
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACV 536
Y YT D +FL + YP+L+ F + ++ E +L PS SPE+ DGK A
Sbjct: 471 YLYTGDTEFL-RSVYPILKESGLFFDEIMVKEPVHNWLVVCPSNSPENVHSGSDGK-ATT 528
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNED--ALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+ TMD +I ++++AIISA+ +L+ +++ A +E+ LK + P ++ G + EW+
Sbjct: 529 AAGCTMDNQLIFDLWTAIISASRILDTDKEFAAHLEQRLKEMA---PMQVGHWGQLQEWM 585
>gi|399025527|ref|ZP_10727523.1| hypothetical protein PMI13_03496 [Chryseobacterium sp. CF314]
gi|398077904|gb|EJL68851.1| hypothetical protein PMI13_03496 [Chryseobacterium sp. CF314]
Length = 820
Score = 336 bits (861), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 209/595 (35%), Positives = 316/595 (53%), Gaps = 57/595 (9%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ + +A+PIGNGRL AMV+G E L+LNE T W+G P NPD PK L
Sbjct: 27 KLWYDKPARQWVEALPIGNGRLAAMVFGDPFKEKLQLNESTFWSGGPSRNDNPDGPKVLD 86
Query: 74 DVRSLVDSGQYAEATAASVKLFGHP---ADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+R + + Y +A + K +Q +GD+ LEF++ E Y RELD+
Sbjct: 87 SIRYYLFNENYKKAEILANKGLTAKTLHGSAFQNIGDLNLEFNNPG---DIENYYRELDI 143
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A +S + + RE F+S PD VI+ K+S + +L+FN +S L +
Sbjct: 144 EKALITTTFSSNGIHYKREAFASVPDNVIIIKLSSDKKNALNFNAGFNSELKKNVKTIDA 203
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
N + M+G ++ D +G ++F+ + + +G +++ D ++ V +D
Sbjct: 204 NTLQMDGI---------SSTLDGVQGQVKFNVLAKFIT---KGGTNSVSDNRISVANADE 251
Query: 250 AVLLLVASSSF-DGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
++L+ +++F D +N D S+S + +++ L+ HL+ YQK F R+
Sbjct: 252 VLILISIATNFTDYKTLN-----TDEVSKSKKYISQSETKNFNTLFKNHLNAYQKYFKRI 306
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
L SP P+ RVK+F + DP L+ L +QFGRYLLISSS+PG
Sbjct: 307 DFSLGTSPAA------------QFPTDLRVKNFASGYDPELISLYYQFGRYLLISSSQPG 354
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
Q ANLQGIWN P WDS +NIN EMNYW + NL+E EPL + LS+ G +
Sbjct: 355 GQPANLQGIWNNSNKPAWDSKYTININTEMNYWPAEKTNLAEMHEPLVQLVKDLSVTGVE 414
Query: 429 TAQVNYLASGWVIHHKTDIWAKSS----ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
TA++ Y + GWV HH TDIW + A+ G+ WPMGGAWL HLWE Y Y D+
Sbjct: 415 TARIMYKSRGWVAHHNTDIWRITGVVDFANAGQ-----WPMGGAWLSQHLWEKYLYGGDK 469
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
++L K Y +L+ A F D+LIE H +L +PS SPE+ I + + +S +TM
Sbjct: 470 NYL-KSIYTVLKSAALFYEDFLIEEPVHQ-WLVVSPSISPEN--IPKRNRGSALSAGNTM 525
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALV--EKVLKSLPRLRPTKIAEDGSIMEWVQ 595
D +I ++FS AA++L + D + ++ LP P KI G + EW++
Sbjct: 526 DNQLIFDLFSKTKKAAQILNVDSDKIPVWNTIISKLP---PMKIGRYGQLQEWME 577
>gi|189467819|ref|ZP_03016604.1| hypothetical protein BACINT_04211 [Bacteroides intestinalis DSM
17393]
gi|189436083|gb|EDV05068.1| hypothetical protein BACINT_04211 [Bacteroides intestinalis DSM
17393]
Length = 1061
Score = 335 bits (859), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 212/593 (35%), Positives = 314/593 (52%), Gaps = 46/593 (7%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
TS N +K+ + PA+ + +A+P+GN RLGAMV+GG E L+LNE+T W G P + NP
Sbjct: 265 TSAQN-MKLWYARPAQDWLEALPLGNSRLGAMVFGGTAREELQLNEETFWAGGPYNNNNP 323
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETY 124
+ L ++R L+ G+ EA + + P Y +G + L F H +E Y
Sbjct: 324 KGLQVLPEIRRLIFEGKTLEAQKLIDENYMTPQHGMRYLTMGSLFLNFP-GHENPSE--Y 380
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
R+L+L ATA +Y V V+F R F+S D VI+ +I ++ +L+F VS S L +
Sbjct: 381 YRDLNLENATATTRYEVDGVKFVRTAFASLSDDVIIVRIQADKAKALNFAVSYSSPLKSD 440
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
V G II C G A P ++ +++K G +S E L V
Sbjct: 441 VQVKGGKLII---SCQG------AEHEGIPAAMRAECQVQVKTD---GKVSKAESA-LAV 487
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
G+ L + A+++F +N D + + + + LQ + Y H+ Y+K
Sbjct: 488 NGATEVTLYISAATNF----VNYHDVSANESKRAATYLQKATRIPYEQALKSHIASYRKQ 543
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
+ RV++ L + + + + RV+ F D ++ L+FQ+GRYLLISS
Sbjct: 544 YDRVALTLEST------------GVSALETPVRVQRFIEGNDMAMAALMFQYGRYLLISS 591
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+PG Q ANLQGIWN L WDS +NIN EMNYW + NLSE EPLFD +T L++
Sbjct: 592 SQPGGQPANLQGIWNHSLYAPWDSKYTININAEMNYWPAEVTNLSETHEPLFDMVTDLAV 651
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
GS+TA+V Y A GWV HH TDIW ++ + +WP GGAW+ HLW+HY +T D+
Sbjct: 652 TGSETAKVLYDAKGWVAHHNTDIW-RACGPVDAASFGMWPNGGAWVAQHLWQHYLFTGDK 710
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+FL K+ YP+L+G A F L L+E H Y + T PS SPEH + G ++ TM
Sbjct: 711 EFL-KKYYPILKGTADFYLSHLVE-HPKYKWMVTVPSMSPEHGY---RGSQTTITAGCTM 765
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWV 594
D I + + + A+ +L D L E L++ L +L P +I + + EW+
Sbjct: 766 DNQIAFDALYSTLLASRIL--GGDKLYEDSLQAMLDKLPPMQIGKHNQLQEWL 816
>gi|224535714|ref|ZP_03676253.1| hypothetical protein BACCELL_00578 [Bacteroides cellulosilyticus
DSM 14838]
gi|224522669|gb|EEF91774.1| hypothetical protein BACCELL_00578 [Bacteroides cellulosilyticus
DSM 14838]
Length = 822
Score = 335 bits (858), Expect = 6e-89, Method: Compositional matrix adjust.
Identities = 217/598 (36%), Positives = 333/598 (55%), Gaps = 45/598 (7%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
ES + K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+T+W G P +
Sbjct: 23 ESRLSAQEYKLWYDRPAQVWTEALPLGNGRLGAMVYGTPATEQIQLNEETIWAGRPNNNA 82
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NP+A + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 83 NPNALEYIPRVRDLVFAGKYLEAQTLATEKVMAKSNSGMP---YQSFGDLRIAFP-GHTR 138
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y Y REL L++A V+Y V V++ RE +S DQVI+ +++ + G ++FN L
Sbjct: 139 YT--NYYRELSLDSARTLVRYEVDGVQYRRETITSFTDQVIMVRLTANRPGRITFNAQLT 196
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISAL 237
S H V ++ EG C + ++ ++ KG ++F L + + R T +
Sbjct: 197 S---PHQDVVITSE---EGNC--VTLSGVSSLHEGLKGKVEFQGRLTARNTGGRMTCA-- 246
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
D L VEG+D A++ + +++F+ N D +P + L S+++ H
Sbjct: 247 -DGVLSVEGADEAIVYVSIATNFN----NYQDITGNPAERAKDYLVRAMTHSFTEARKNH 301
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
D Y++ RVS+ L + + V + +RV++F+ D LV FQFG
Sbjct: 302 TDFYRRYLTRVSLDLG------------DNRYEHVTTDKRVENFKQTNDAHLVATYFQFG 349
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE EPLF
Sbjct: 350 RYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNEPLFR 409
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ +S G +TA++ Y A+GWV+HH TDIW + A K LWP GGAWLC HLWE
Sbjct: 410 LIREVSETGKETARIMYGANGWVLHHNTDIWRITGA-VDKAPSGLWPSGGAWLCRHLWER 468
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACV 536
Y YT D +FL + YP+L F + ++ E +L PS SPE+ +GK +
Sbjct: 469 YLYTGDTEFL-RSVYPILRESGRFFDEIMVKEPAHNWLVVCPSNSPENVHSGSNGK-STT 526
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+ T+D +I ++++AII+A+++L+ + A ++ + L + P ++ G + EW+
Sbjct: 527 AAGCTLDNQLIFDLWTAIIAASDILDTDR-AFAARLSQRLREMAPMQVGRWGQLQEWM 583
>gi|383122650|ref|ZP_09943342.1| hypothetical protein BSIG_0605 [Bacteroides sp. 1_1_6]
gi|382984352|gb|EES70332.2| hypothetical protein BSIG_0605 [Bacteroides sp. 1_1_6]
Length = 822
Score = 334 bits (857), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 217/598 (36%), Positives = 336/598 (56%), Gaps = 45/598 (7%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E + K+ ++ PA+ +T+A+P+GNGRLGAMV+G E ++LNE+T+W G P +
Sbjct: 23 EKKVSAQEYKLWYDRPAQVWTEALPLGNGRLGAMVYGTPGIEQIQLNEETIWAGRPNNNA 82
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NP+A + + VR L+ +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 83 NPNALEYIPKVRELIFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y++ Y REL L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 139 YSD--YYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISAL 237
S H N++ EG C + ++ ++ KG ++F L + ++G A
Sbjct: 197 S---PHQDAMINSE---EGNC--VTLSGVSSLHEGLKGKVEFQGRLTAR---NQGGKIAC 245
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
D L VEG+D A + + +++F+ N D + T + S L +++ H
Sbjct: 246 TDGVLSVEGADEATIYVSIATNFN----NYLDITGNQTERAKSYLSEALVHPFAEAKKNH 301
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
++ Y++ RVS+ L E+ V + +RV++F+ D LV FQFG
Sbjct: 302 VEFYRQYLTRVSLDLG------------EDQYKNVTTDKRVENFKDTHDAHLVATYFQFG 349
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLS+ EPLF
Sbjct: 350 RYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSDLNEPLFR 409
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ +S +G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC HLWE
Sbjct: 410 LIKEVSESGKETARIMYGANGWVLHHNTDIWRITGA-LDKAPSGMWPSGGAWLCRHLWER 468
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACV 536
Y YT D +FL + YP+L+G F + ++ E +L PS SPE+ DGK A
Sbjct: 469 YLYTGDTEFL-RSVYPILKGSGLFFDEIMVKEPVHNWLVVCPSNSPENVHSGNDGK-ATT 526
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+ TMD +I ++++AIISA+ +L+ +++ + + L + P ++ G + EW+
Sbjct: 527 AAGCTMDNQLIFDLWTAIISASRILDTDKE-FAAHLEQRLKEMAPMQVGHWGQLQEWM 583
>gi|408370425|ref|ZP_11168202.1| hypothetical protein I215_05947 [Galbibacter sp. ck-I2-15]
gi|407744183|gb|EKF55753.1| hypothetical protein I215_05947 [Galbibacter sp. ck-I2-15]
Length = 792
Score = 334 bits (856), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 205/588 (34%), Positives = 311/588 (52%), Gaps = 39/588 (6%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ A+ + A+P+GNGRLGAM++G E L+LNED++W G P + + L +R
Sbjct: 35 YEQAAEDWMQALPVGNGRLGAMIFGNPDIEHLQLNEDSMWPGGPTLGDSKGTVEDLVALR 94
Query: 77 SLVDSGQYAEATAASVKLFGH--PADVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
+L+D G+ +A V F H +Q GD+ L+F + E T Y R LDL+ A
Sbjct: 95 ALIDQGKVHQADKFIVDKFSHLEVTRSHQTAGDLFLDFK----RKGEVTDYYRGLDLDKA 150
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS-----YVN 188
A V Y V +FT + +SN D ++ + + L F++ L +D + +
Sbjct: 151 VATVSYKVDGDQFTEKIIASNVDDALIISLETTAEKGLDFDIQLSRPMDKSAPTVLVTTH 210
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
++++IM+G + + +G++F ++ + + GTI D L++ G
Sbjct: 211 NSDELIMDGMVTQRGGVVENKPYPMQEGVEFQT--RLRATTEGGTIEP-SDGILELRGVR 267
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
AV+ LV +SF +D +++ L + + S+ +L RH D+ + + RV
Sbjct: 268 KAVIYLVTKTSF---------YHQDFKAKAQENLNEVASKSFDELLRRHSQDFGEFYDRV 318
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRP 367
+ L S ++D++P+ +R++ ++ + D L LF +GRYLLISSSR
Sbjct: 319 NFSLGSS------------DLDSLPTDKRLQRYKDGQVDLDLQTKLFDYGRYLLISSSRE 366
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
GT ANLQGIWN +S W++ H+NINL+MNYW S+ NLSE Q+PLFDF L G
Sbjct: 367 GTNPANLQGIWNNHISAPWNADYHLNINLQMNYWPSMVANLSELQQPLFDFSDRLLQRGK 426
Query: 428 KTAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
KTA+ Y + G V+HH TD+WA + + W W GG WL H W+HY +T D DF
Sbjct: 427 KTAKEQYGIQRGAVMHHTTDLWAPAFMFSSQPYWGSWIHGGGWLAQHYWDHYRFTQDADF 486
Query: 487 LEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
LE RAYP ++ A F +DWL + G + P TSPE+ ++A DGK A VS + M
Sbjct: 487 LENRAYPFMKEIALFYMDWLQKDATTGKWVSYPETSPENSYLAADGKPAAVSKGAAMGHQ 546
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
II EVF +SAA+VL N++ E K + EDG I+EW
Sbjct: 547 IIAEVFDNALSAAKVLNINDEFTQELKAKRADLTPGIVLGEDGRILEW 594
>gi|325105288|ref|YP_004274942.1| alpha-L-fucosidase [Pedobacter saltans DSM 12145]
gi|324974136|gb|ADY53120.1| Alpha-L-fucosidase [Pedobacter saltans DSM 12145]
Length = 826
Score = 334 bits (856), Expect = 9e-89, Method: Compositional matrix adjust.
Identities = 217/595 (36%), Positives = 314/595 (52%), Gaps = 60/595 (10%)
Query: 13 LKITFNGPA--KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
LK+ +N P + A+PIGNGRLGAMV+G E L+LNE+T+W G P N A +
Sbjct: 39 LKLWYNKPVIDNVWEQALPIGNGRLGAMVYGIPQREQLQLNEETIWGGGPYRNDNNKALE 98
Query: 71 ALSDVRSLVDSGQYAEATAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYR 125
L V+ +V GQ EA + F G P +Q G + L F H +Y E Y
Sbjct: 99 VLPLVQKMVFDGQTQEADKLINQSFFTQTHGMP---FQTAGSLILNFP-GHNQY--ENYY 152
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
RELDLN A + Y+V V++TRE FSS D VI+ +++ SE G L+F++ + H+
Sbjct: 153 RELDLNKAVVKTTYTVNGVKYTREVFSSFTDDVIIMQLTSSEKGGLNFDIGYVNP-SQHT 211
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK--ISDDRGTISALEDKKLK 243
+N +++EGR D +GI+ +I +S G + A+ D K+
Sbjct: 212 VSKKDNSLVLEGR------------GSDHEGIEGKIRYQIHTLVSHADGHV-AVSDHKIN 258
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
+ + A + + ++F N +P + S L + ++ +H Y K
Sbjct: 259 ITEASSATIYISIGTNF----TNYKSVDANPAERAASKLAVAKKKNFKSALQQHSATYYK 314
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
F R + L D EE P+ R+++F+ +DP+LV LL QFGRYLLIS
Sbjct: 315 QFGRFKLNLGSQ------DISKEE-----PTDVRIRNFKETQDPALVTLLTQFGRYLLIS 363
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SS+PG Q +NLQGIW + P WDS +NIN EMNYW + NLS+ EPLF L LS
Sbjct: 364 SSQPGGQPSNLQGIWCNSMHPAWDSKYTININTEMNYWPAEVTNLSDTHEPLFQMLKDLS 423
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+G +TA+ Y A GWV HH TDIW +S +WP GGAWL HLWEHY +T D
Sbjct: 424 ESGRETAKTLYGADGWVAHHNTDIWRVTSPIDFAAA-GMWPTGGAWLSQHLWEHYLFTGD 482
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIE--GHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
R FL + AYP+L+G A F L +LIE + G++ +PS SPEH ++ T
Sbjct: 483 RKFLAE-AYPILKGSADFFLSFLIEHPKYKGWMVVSPSISPEH---------GPITAGVT 532
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP-RLRPTKIAEDGSIMEWVQ 595
MD ++ +V + + A E+L K+ + + LKS+ R+ P +I + + EW++
Sbjct: 533 MDNQLVFDVLTRTVVAGEMLGKDTNYIAR--LKSMAKRIPPMQIGKYTQLQEWLE 585
>gi|160886122|ref|ZP_02067125.1| hypothetical protein BACOVA_04129 [Bacteroides ovatus ATCC 8483]
gi|423286896|ref|ZP_17265747.1| hypothetical protein HMPREF1069_00790 [Bacteroides ovatus
CL02T12C04]
gi|156108935|gb|EDO10680.1| hypothetical protein BACOVA_04129 [Bacteroides ovatus ATCC 8483]
gi|392674434|gb|EIY67882.1| hypothetical protein HMPREF1069_00790 [Bacteroides ovatus
CL02T12C04]
Length = 822
Score = 334 bits (856), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 218/602 (36%), Positives = 335/602 (55%), Gaps = 53/602 (8%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E S+ K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+++W G P +
Sbjct: 23 EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NP+A + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 83 NPNALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y+ Y REL L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196
Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
S +Q +M EG C + ++ ++ KG ++F L K ++G
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
A D L VE +D AV+ + +++F+ N D + T + + L + +
Sbjct: 242 KIACADGILSVEKADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+D Y++ RVS+ L R + V + +RV++F+ D LV
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLF + +S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSDTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
LWE Y YT D +FL + YP+L+ F + ++ E +L PS SPE+ +GK
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK 523
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
A + TMD +I ++++AIISA+++L+ +++ + + L + P ++ G + E
Sbjct: 524 -ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQRLKEMAPMQVGHWGQLQE 581
Query: 593 WV 594
W+
Sbjct: 582 WM 583
>gi|325103196|ref|YP_004272850.1| alpha-L-fucosidase [Pedobacter saltans DSM 12145]
gi|324972044|gb|ADY51028.1| Alpha-L-fucosidase [Pedobacter saltans DSM 12145]
Length = 821
Score = 334 bits (856), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 203/600 (33%), Positives = 326/600 (54%), Gaps = 44/600 (7%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
+ NA + LK+ ++ P++++ +A+PIGNGRLGAMV+G E ++LNE+T+W+G P
Sbjct: 15 VANANAQQHDKTLKLWYDAPSRNWNEALPIGNGRLGAMVFGNPDREKIQLNEETVWSGGP 74
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLF--GHPADVYQLLGDIELEFDDSHL 117
++ A+ +R L+ ++ EA A A V +F + +YQ +GD+ + F H
Sbjct: 75 NTNITAESGAAIPKLRQLIFEEKFLEAQALADVDMFPKKNSGMIYQPVGDLLINFP-GHA 133
Query: 118 KYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL 177
+ E Y R+L++ A V Y + V + RE F+S PDQVI+ +++ + ++FN SL
Sbjct: 134 QV--EKYYRDLNIEKAVTTVSYRLNGVNYKRETFASFPDQVIIVRLTADKPNKITFNASL 191
Query: 178 DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
S ++ + N ++I+ G A+ + I+F ++ K+ +G + L
Sbjct: 192 TSPQNSAQKIE-NGKLILTGLT--------ADHEGEKGQIKFETQVKTKV---KGGKAEL 239
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
KV ++ A++ + +++F + +D + ++ + L +Y D +H
Sbjct: 240 TGSLWKVTNANEAIIYISMATNF----VKYNDISGNQHVKASNYLDKAFVKNYDDALKQH 295
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
+ YQ+ F+RV D+ + + P+ R+ F DP L L FQFG
Sbjct: 296 IAFYQQYFNRVKF-------DVGVNASVNK-----PTDRRIYEFAKSFDPHLAALYFQFG 343
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLI SS+PG Q LQGIWN+ + WDS +NIN EMNYW + NLSE +PLF+
Sbjct: 344 RYLLICSSQPGNQPPTLQGIWNDRMDAPWDSKYTININTEMNYWPAEVTNLSELHQPLFN 403
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS-ADRGKVVWALWPMGGAWLCTHLWE 476
L L++ G TAQ Y A GWV HH TD+W + DR LWPMGG WL HLW+
Sbjct: 404 MLEDLAVTGQATAQSMYGAKGWVTHHNTDLWRITGPVDRPYA--GLWPMGGNWLSQHLWD 461
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLAC 535
HY +T ++DFL K+ YP+L+G + F LD L E +L +PS SPE+ ++ +GK
Sbjct: 462 HYQFTGNKDFL-KKYYPVLKGASDFYLDILQEEPKHKWLVVSPSNSPENTYV--EGKRVS 518
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWV 594
++ +TMD ++ ++FS AAE+L ++D +LK + RL P +I + + EW+
Sbjct: 519 IAAGTTMDNQLLFDLFSKTAKAAEILGIDKD--YSTLLKQKINRLAPMQIGKYSQLQEWM 576
>gi|332663343|ref|YP_004446131.1| alpha-L-fucosidase [Haliscomenobacter hydrossis DSM 1100]
gi|332332157|gb|AEE49258.1| Alpha-L-fucosidase [Haliscomenobacter hydrossis DSM 1100]
Length = 818
Score = 333 bits (854), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 202/597 (33%), Positives = 316/597 (52%), Gaps = 43/597 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA+ + +A+P+GNGRLGAMV+G E ++LNE+T WTG P + L +++
Sbjct: 37 YKEPAQKWEEALPVGNGRLGAMVFGKSGEERIQLNEETYWTGGPYSTVVKGGHEVLPEIQ 96
Query: 77 SLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
V G+ +A + G+P + YQ L ++ L F ++ Y+R LDL T
Sbjct: 97 KYVFEGKMLKAHNLFGRRTMGYPVEQQKYQSLANLHLFFAEAE---PATVYKRWLDLETG 153
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
V+Y V V + R+ F S PDQV+V +++ SE+ +SF +L + + G +
Sbjct: 154 ITSVEYRVQEVRYRRDVFVSAPDQVVVLRLTASEAQKISFKANLRGVRNPAHSNYGTDYF 213
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILE--IKISDDRGTISALEDKKLKVEGSDWAV 251
M+ G+ + D G++ E +K+ + GT+ +D L VE +D
Sbjct: 214 TMDPY--GQDGLMLKGKSSDYLGVEGKLRFEGQVKVVAEGGTVRT-DDVDLWVEKADAVT 270
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
+ A+++F +N D DP + + +++ SY + + D+QK F R ++Q
Sbjct: 271 VYFTAATNF----VNYHDVSADPHARVEAVWKNMAGKSYPQIRDAAVKDHQKYFQRTTLQ 326
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
L + + P+ ER+ + Q DPSL L + FGRYLLI SSRPGTQ
Sbjct: 327 LEIAASSYL------------PTNERMLNIQKTADPSLAALCYNFGRYLLIGSSRPGTQP 374
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
ANLQGIWN D++P WDS NIN EMNYW + NL EC EPL + L GS+ A+
Sbjct: 375 ANLQGIWNNDMNPAWDSKYTTNINTEMNYWPAETGNLPECVEPLIQMVKELMDQGSQVAK 434
Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
+Y GWV H TD+W + +A W + GGAWLCT LWEHY ++MD+++L K
Sbjct: 435 EHYGCRGWVFHQNTDLW-RVAAPMDGPSWGTFTTGGAWLCTQLWEHYLFSMDKEYL-KEI 492
Query: 492 YPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKL------------ACVSY 538
YP+++G F +D+L+E D +L TNPSTSPE+ +P + + Y
Sbjct: 493 YPVMQGSVQFFMDFLVETPDKKWLVTNPSTSPENFPASPGNQPYFDEVTGMNLPGTTICY 552
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S++DM I+ ++F + A+ +L+ +++ KV + R P +I +DG++ EW +
Sbjct: 553 GSSIDMQILSDLFGYYVQASALLQVDQE-FAAKVAAARKRFPPPQIGKDGALQEWAE 608
>gi|336428235|ref|ZP_08608219.1| hypothetical protein HMPREF0994_04225 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336006471|gb|EGN36505.1| hypothetical protein HMPREF0994_04225 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 721
Score = 333 bits (854), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 211/597 (35%), Positives = 312/597 (52%), Gaps = 61/597 (10%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + + A+ + +++PIGNG LGAM+ GG E L LNE+++W+G D N A L
Sbjct: 4 MMLWYEKSAERWEESLPIGNGSLGAMILGGAEEEILGLNEESVWSGYYKDKNNAKAADCL 63
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAE-ETYRRELDL 130
+VRSLV SG+ EA + G + Y LG+++L+F K + E YRR+LDL
Sbjct: 64 EEVRSLVFSGKNKEAERLIQNNMLGEYNESYLPLGNLKLKFAYGIGKEGKAEGYRRQLDL 123
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS-LLDNHSYVNG 189
A A+V Y+ V + RE+F+S P + I ++ ++ + F VS S L S +G
Sbjct: 124 ENAVAQVSYTCNEVHYQREYFASYPAKAIFVLLT-ADKPVMDFTVSFISQLCLAVSAEDG 182
Query: 190 NNQIIMEGRCPGKRIPP-----KANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
Q+ GRCP P + + KG+Q +A E ++ G + E++ L V
Sbjct: 183 ALQVT--GRCPEHVDPSYLPEREGSVVQGTKGMQVNA--EFRVVSCDGQVRE-EEEMLHV 237
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
G+ +L+L A P + P N+ Y L H+ DY+ +
Sbjct: 238 SGASRCLLMLSAMR----PPVLPD------------------NMDYEALKAAHIQDYRSI 275
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
+ +V + L KD+ T EE ++ + E ED L L FQ+GRYLLI+S
Sbjct: 276 YDKVELYLGEQ-KDLPT----EERLELLKKGE--------EDNGLYGLFFQYGRYLLIAS 322
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
SR G+ ANLQGIW+ +L W S +NIN +MNYW +L CNL EC EP F+ +S
Sbjct: 323 SREGSLPANLQGIWSWELRAPWSSNWTININTQMNYWHALSCNLEECLEPYIRFVERVSE 382
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSS----------ADRGKVVWALWPMGGAWLCTHL 474
G KTA VNY G V HH D W +S + G V WA WPMGGAWL +
Sbjct: 383 EGKKTAAVNYRCRGSVAHHNVDYWGNTSPVGVPQGEKAGEDGCVNWAFWPMGGAWLTQEI 442
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
+ Y Y+ D ++L+ A P++ A FL DWL+E + G T PSTSPE++F PDG++
Sbjct: 443 FRAYEYSGDEEYLKNTAAPIIREAALFLNDWLVE-YQGEWVTCPSTSPENQFRLPDGQIT 501
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
++Y+S MDMAI++EVF+ E+L +D L ++ + +P L P + G ++
Sbjct: 502 GLTYASAMDMAIVKEVFTHYCRICEIL-GAQDELYREICEKMPCLAPFRTGSFGQLL 557
>gi|425767412|gb|EKV05986.1| hypothetical protein PDIG_81830 [Penicillium digitatum PHI26]
gi|425779681|gb|EKV17720.1| hypothetical protein PDIP_30190 [Penicillium digitatum Pd1]
Length = 740
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 213/585 (36%), Positives = 300/585 (51%), Gaps = 45/585 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA+ + A+P+GNGRLGAMV+G +E L+LNED++W G P D DA + L +R
Sbjct: 6 YQQPAEDWNSALPVGNGRLGAMVYGRTDTEMLQLNEDSVWYGGPQDRNPQDALEYLPRLR 65
Query: 77 SLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
+ + +AEA A + F +P Y+ LG++ L D H YRR LDL A
Sbjct: 66 EAIRAENHAEAEKIAKLAFFANPISQRNYEPLGNLFL--DLGHNPSQVTGYRRSLDLARA 123
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG-NNQ 192
TA V+Y + F RE +SNPD V+ ++ S F V L + D N +
Sbjct: 124 TAHVRYEYQGICFEREVLASNPDDVLAIRLHSSSKAE--FVVRLTRMSDVEFETNEWLDD 181
Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
I G + P + + ++ ++ GTI+ + K L V +D +L
Sbjct: 182 ISASGNSITMHVTPGGKNSS-----RVCCVVSVRCDGADGTITKI-GKNLVVNSTD-TLL 234
Query: 253 LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
++ A ++F +D + + LS DL TRH DYQ L+ R+ +QL
Sbjct: 235 VIAAQTTF---------RHEDIDQRTKQDAEIALGLSLKDLRTRHTADYQSLYDRMELQL 285
Query: 313 SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV- 371
+I TD +R+KS DP L+ L + RYLLIS SR G +
Sbjct: 286 GPGSPEIPTD-------------QRLKS---SRDPGLIALYHNYSRYLLISCSRDGHKSL 329
Query: 372 -ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWN P W S NINL+MNYW + CNLSEC+ PLFD L + G TA
Sbjct: 330 PANLQGIWNPSFHPAWGSRFTTNINLQMNYWSANVCNLSECEFPLFDLLERMVEPGKTTA 389
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
Q+ Y GW H TDIWA ++ + ++WP+GGAWLC H+W+H+ YT D FL +R
Sbjct: 390 QIMYGCRGWTAHSNTDIWADTAPVDRWMPASIWPLGGAWLCYHIWDHFQYTCDEVFL-RR 448
Query: 491 AYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
+P L GC FLLD+LI +G YL T+PS SPE+ F G+ + ST+D+ II
Sbjct: 449 MFPTLRGCVEFLLDFLIVDANGAYLITSPSASPENSFYDHKGQKGVLCEGSTIDIQIIDA 508
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+ A S + L+ +DAL+ V + RL P KI+ G + EW
Sbjct: 509 ILGAFQSCTKKLDL-QDALLPAVYATKSRLPPLKISPAGYLQEWA 552
>gi|305665057|ref|YP_003861344.1| hypothetical protein FB2170_02120 [Maribacter sp. HTCC2170]
gi|88709809|gb|EAR02041.1| hypothetical protein FB2170_02120 [Maribacter sp. HTCC2170]
Length = 787
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 198/600 (33%), Positives = 326/600 (54%), Gaps = 56/600 (9%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA---PK 70
K+ + PAK + A+P+GNGRLGAMV+G E ++LNED++W PG+ PD
Sbjct: 30 KLWYGKPAKEWMQALPVGNGRLGAMVFGDPNHERIQLNEDSMW---PGEADWPDYRGNSD 86
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRREL 128
L ++R+L++ G+ E + V+ F + V +Q +GD+ ++F++ + E Y R L
Sbjct: 87 DLEEIRNLLNEGKTGEVDSLIVEKFSYKTIVRSHQTMGDLYIDFENER---SVENYTRSL 143
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
+LN A Y G ++++ FSS PD V+V ++S + + F + ++ D+
Sbjct: 144 NLNDALITAAYQSGGNSYSQKVFSSKPDDVMVIELSTDATDGMDFTLRMNRPTDD----- 198
Query: 189 GNNQIIM----EGRCPGKRIPPKANANDDPK------GIQFSAILEIKISDDRGTISALE 238
GN + E K + + + D K G++F L ++ ++ GT++A +
Sbjct: 199 GNATVTTRNPSESEISMKGVVTQYSGKRDSKSFPLDYGVKFETRL--RVHNEGGTVTA-D 255
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+L ++G ++ LV ++SF ++ T +++ L+ + N S+ L H
Sbjct: 256 KGQLTLKGVKTVLIHLVGNTSFY--------HGENYTKKNLETLEKVNNSSFKTLLKNHT 307
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFG 357
DY++L++RV + L +D++P R++ + ++DP L LF++G
Sbjct: 308 KDYEELYNRVGLDLGG------------RELDSLPIDARLQRIKEGNDDPDLAAKLFKYG 355
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLI+SSR GT ANLQGIWNE ++ W++ H+NINL+MNYW + NLSE +P F+
Sbjct: 356 RYLLIASSRQGTNPANLQGIWNEHITAPWNADYHLNINLQMNYWPAEVANLSELHQPFFE 415
Query: 418 FLTYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
+L + G TA+ Y + G + HH +D+WA + W W GG W H WE
Sbjct: 416 YLDRVLERGKNTAKKQYGINRGTMAHHASDLWATPFMRAERAYWGSWVHGGGWCAQHYWE 475
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLA 534
HY YT D++FL+ RAYP+L+G + F LDWL+ E ++ ++P TSPE+ + DG A
Sbjct: 476 HYRYTEDKEFLKNRAYPVLKGISEFYLDWLVWDETSKAWV-SSPETSPENSYFNADGNSA 534
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEW 593
VS+ S M II EVF ++ AA+VL +D ++V +L P + +DG ++EW
Sbjct: 535 AVSFGSAMGHQIIAEVFDNVLEAAKVL-GIQDEFTKEVKAKREKLFPGIVVGDDGRLLEW 593
>gi|149196081|ref|ZP_01873137.1| putative large secreted protein [Lentisphaera araneosa HTCC2155]
gi|149140928|gb|EDM29325.1| putative large secreted protein [Lentisphaera araneosa HTCC2155]
Length = 790
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 210/605 (34%), Positives = 318/605 (52%), Gaps = 72/605 (11%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ + A+ F ++PIGNGRLGAMV+G V E + +NE+++W+G + P K L+
Sbjct: 28 KLWYKQAAQGFEQSLPIGNGRLGAMVFGDVDEERIVINEESVWSGSKVENNIPVGYKHLA 87
Query: 74 DVRSLVDSGQYAEAT---------------AASVKLFGHPADVYQLLGDIELEFDDSHLK 118
+R L+ ++ EA A + FG YQ+LG+I L+F + K
Sbjct: 88 KIRQLLGEEKFTEANKLMKQAFKVKNAPKYAKGISAFGR----YQVLGNIHLKFLGNKAK 143
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
++ Y+RELDLN+A A V Y G +FTREHF S PD+V V++ SG +SF++S+D
Sbjct: 144 VSQ--YKRELDLNSALATVNYQAGKQQFTREHFVSAPDEVFVSRFSGP----ISFSISMD 197
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
+ V ++++M G ND + + + +++ I A +
Sbjct: 198 RPERFKTSVVNKHELLMTGAL-----------NDGFEKDGLTYVARLRVIAPNAKIKA-D 245
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
KL VE + +LLL A++ + G DP + L S+++L
Sbjct: 246 GNKLIVESQEEVMLLLAAATDYRGI---AGRQLSDPFKATSEDLDKAEKKSFTELRQAQK 302
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFG 357
D++K + RV + L+ E + +P+ +R+ +++ + DP+L L F G
Sbjct: 303 ADHEKYYRRVKLNLA------------ESHNSALPTDQRLAAYRKGKADPALAALFFNVG 350
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RY LISSSRPG ANLQGIW E++ W+ H NIN +MNYW +L CN+ E QEP+ +
Sbjct: 351 RYFLISSSRPGGLPANLQGIWAEEVHTMWNGDYHFNINTQMNYWPALSCNMVEMQEPMNN 410
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIW---AKSSADRGKVVWALWPMGGAWLCTHL 474
F+ L GSKTA+ Y + GW+ H T+IW A + D G G AWLC HL
Sbjct: 411 FIASLVEPGSKTAKAYYDSPGWIAHRLTNIWGYTAPAGMDIG---------GPAWLCEHL 461
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKL 533
WE Y YT+DR+FL K YP+++ F L L E + +L T PS SPE+ F P K
Sbjct: 462 WEQYAYTLDREFL-KSVYPIMKSSIDFYLHNLWEEPENKWLVTGPSASPENGFKLPGNKR 520
Query: 534 --ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL-KSLPRLRPTKIAEDGSI 590
+ + T+DM +RE+F + AA++L DA ++K L + PRL P +IA DG +
Sbjct: 521 GGSGICAGPTIDMQQLRELFGNTLRAAKIL--GIDAELQKELAEKRPRLAPNQIAPDGVL 578
Query: 591 MEWVQ 595
EW++
Sbjct: 579 QEWLK 583
>gi|340347371|ref|ZP_08670480.1| alpha-L-fucosidase [Prevotella dentalis DSM 3688]
gi|433651138|ref|YP_007277517.1| hypothetical protein Prede_0088 [Prevotella dentalis DSM 3688]
gi|339609463|gb|EGQ14335.1| alpha-L-fucosidase [Prevotella dentalis DSM 3688]
gi|433301671|gb|AGB27487.1| hypothetical protein Prede_0088 [Prevotella dentalis DSM 3688]
Length = 784
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 212/591 (35%), Positives = 303/591 (51%), Gaps = 38/591 (6%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPK 70
PL++ ++ PA F +++PIGNG+LGA+++GG + LN+ T W+G P D T + DA
Sbjct: 26 PLRLWYDRPATCFEESLPIGNGKLGAIIYGGPDDNVIHLNDITFWSGKPVDLTIDSDAHV 85
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET-YRRELD 129
+ +R + Y A + + G + YQ LG + + L+ E + Y R+L
Sbjct: 86 WIPKIREALFREDYRLADSLQHHVQGANSQYYQPLGTLRIR----DLQPGEASGYHRQLS 141
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L++A +Y G V +TRE+F+S PD+VI ++ S G LS ++ L S +D H
Sbjct: 142 LDSAVCHDRYVRGGVTYTREYFASAPDKVIAVRLRASRPGMLSCSIGLGSQVD-HGTKTS 200
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPK-GIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
+ QIIM G NA DP+ I F +L ++S+D G++ D L V G++
Sbjct: 201 DRQIIMTG-----------NAAGDPQETIHFCTVL--RVSNDGGSVER-TDSSLVVTGAN 246
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
A + LV +SF+G +P +M + N S L RHLDDYQ +FHRV
Sbjct: 247 GATIYLVNETSFNGYDKHPVTQGTPYIENAMDDAWHLANYSCDSLLRRHLDDYQPIFHRV 306
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
S L S + T S R Q D L L FQFGRYLLISSSR
Sbjct: 307 SFTLDGSRYNATQPT---------DSMLRAYGSQPAYDRYLEALYFQFGRYLLISSSRTP 357
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
ANLQG+WNE W +NINLE NYW N+ E PL F L+ G++
Sbjct: 358 GVPANLQGLWNEKKKAPWRGNYTININLEENYWPCDVANMPEMFAPLATFCQNLAQTGAQ 417
Query: 429 TAQVNY-LASGWVIHHKTDIWAKSSA---DRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
A+ Y + GW H +DIWA ++ R W+ W MGGAWL ++++HY YT DR
Sbjct: 418 NARNYYGIGRGWSCGHNSDIWAMTNPVGEKRESPTWSNWNMGGAWLMQNVYDHYLYTQDR 477
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
D+L AYPL+ G + F+LDWL+ + L T PSTSPE ++ G Y T
Sbjct: 478 DYLSGTAYPLMRGASDFILDWLVPNPRNPEELITAPSTSPEAYYVTDKGYKGATLYGGTA 537
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
D+AIIRE+ + + AA L ++ A + + +L RL P + G + EW
Sbjct: 538 DLAIIRELLTNTLEAARTLNRDR-AYQDTLRHTLARLHPYTVGRQGDLNEW 587
>gi|448410558|ref|ZP_21575263.1| alpha-L-fucosidase [Halosimplex carlsbadense 2-9-1]
gi|445671594|gb|ELZ24181.1| alpha-L-fucosidase [Halosimplex carlsbadense 2-9-1]
Length = 822
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 213/610 (34%), Positives = 312/610 (51%), Gaps = 56/610 (9%)
Query: 10 TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
T+ ++ ++ PA + +A+P+GNGRLG MV G E + LN+D LW G D T P
Sbjct: 20 THDDRLWYDAPATEWVEALPVGNGRLGGMVHGRPARERVALNDDRLWVGDHADRTADGGP 79
Query: 70 KALSDVRSLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLKYAEETYRR 126
L VR + G++ A +LF G V YQ LGD+ + D + YRR
Sbjct: 80 DDLDAVRECLWDGEFERAQRLCNELFVGDLTGVAPYQPLGDLLI---DCPAHDDPDEYRR 136
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
LDL +RV+Y+VG F RE F+S PD V+ +I ESG++ V LD +
Sbjct: 137 SLDLRAGVSRVEYTVGGTRFERECFASEPDGVLAMRIEADESGAVDARVRLDRDRSARTT 196
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKG--IQFSAILEIK----------------IS 228
V ++ +++ G+ P + + DP G +F A ++ I
Sbjct: 197 VV-DDTVVLRGQVIDL---PGDDESVDPGGWGQRFEARARVRAEGGIVAAAADEAAPSIG 252
Query: 229 DDRGTI--SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIR 286
D G +A + V G+D ++L A + PSD DP E AL +
Sbjct: 253 DGDGEREGAAYGTDGIVVAGADAVTVVLTAG-------VAPSDG--DPRDECREALAGVA 303
Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDED 346
+ Y+ + RH+ D+++ RV + L P D D E +D V ER D
Sbjct: 304 DDDYAAIRERHVADHREHMDRVDLDLG-EPVDAPVD----ERLDRVRDGER--------D 350
Query: 347 PSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPC 406
P L +L Q+GRYLL+ SSRPGT ANLQGIWNE+ P WDS ++NLEMNYW +
Sbjct: 351 PHLAQLYVQYGRYLLLGSSRPGTLPANLQGIWNEEFHPPWDSDYTQDVNLEMNYWHAEVA 410
Query: 407 NLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
NL EC +PL +F+ G +TA+ Y G+ H +D W ++A W WPMG
Sbjct: 411 NLRECADPLVEFVDESREPGRETARERYGCEGFTTHLHSDRW-HTTAQTADAHWGHWPMG 469
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHE 525
AWLC +LWE Y ++ DR+ LE R YP+L A FLLD+L+E + +L T PS SPE++
Sbjct: 470 AAWLCQNLWERYAFSGDREDLE-RIYPILREAAEFLLDYLVEHPEEEWLVTAPSASPENQ 528
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
F DG+ A MD+ + R++F + AAE L+++ D E + ++L RL P +
Sbjct: 529 FRTADGQEATTCVMPAMDIQLTRDLFGHCVEAAETLDRDADFAAE-LAEALERLPPMGVD 587
Query: 586 EDGSIMEWVQ 595
+ G++ EW++
Sbjct: 588 DRGALREWLR 597
>gi|395213355|ref|ZP_10400162.1| alpha-L-fucosidase [Pontibacter sp. BAB1700]
gi|394456724|gb|EJF10981.1| alpha-L-fucosidase [Pontibacter sp. BAB1700]
Length = 827
Score = 332 bits (852), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 214/591 (36%), Positives = 321/591 (54%), Gaps = 47/591 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ + PA ++ +A+P+GNGRLGAMV+ E L+LNE+T+W G PG+ P AL
Sbjct: 32 KLWYKQPAANWNEALPLGNGRLGAMVFSQPAREQLQLNEETVWAGEPGNNVLPALNSALP 91
Query: 74 DVRSLVDSGQYAEAT-AASVKLFGHPAD------VYQLLGDIELEFDDSHLKYAEETYRR 126
++R L+ +G++ EA A KL PA YQ +G++ + F H + + Y R
Sbjct: 92 EIRQLIAAGKHKEAQDLAMEKLPRQPAADNNYGMPYQPVGNLFISFP-GHEQATD--YYR 148
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
+LD+ A + V Y V V F RE FSS D V++ ++S + S++F +S DS N++
Sbjct: 149 DLDIQQAISTVYYKVNGVTFKREMFSSFTDDVVIVRLSADKPKSINFTLSADSPHKNYTV 208
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVE 245
NQ+I+ G + D+ KG ++F ++E + + G I++ + ++V
Sbjct: 209 RTRGNQLILSG---------VSGDVDNKKGKVKFQTLVEPET--EGGKITSTPEG-VQVS 256
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G++ A L + ++F + D D +++ L S Y H Y+ +
Sbjct: 257 GANAATLYISIGTNFK----SYRDLSGDGEAKAAKLLSSAVKKKYKKAKAEHTAFYRNYY 312
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
R S+ L + D+ P+ ER+ +F DP L L FQFGRYLLISSS
Sbjct: 313 DRASLNLGTT-ADLQK-----------PTDERLAAFARSNDPHLAALYFQFGRYLLISSS 360
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
+PGTQ ANLQGIWN+ ++P WDS VNIN EMNYW + NLSE PLF L LS +
Sbjct: 361 QPGTQPANLQGIWNDKIAPPWDSKYTVNINTEMNYWPAEVTNLSEMHGPLFSMLKDLSES 420
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G ++A Y A GW++HH TDIW + G + +WPMGGAWL HLW+HY YT D+
Sbjct: 421 GRESASKMYGARGWMMHHNTDIWRITGPIDG-AFYGMWPMGGAWLTQHLWQHYLYTGDQK 479
Query: 486 FLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
FL K YP+L+G A F D L E + +L +PS SPE++ + +S +TMD
Sbjct: 480 FL-KVVYPVLKGSAMFYADVLQEEPTNKWLVVSPSMSPENKHQSG----VSISAGTTMDN 534
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+I ++FS +I AEVL ++ A + + RL P +I + + EW++
Sbjct: 535 QLIFDLFSNVIRTAEVLNTDQ-AFADSLRTMRDRLPPMQIGQHNQLQEWLR 584
>gi|336412946|ref|ZP_08593299.1| hypothetical protein HMPREF1017_00407 [Bacteroides ovatus
3_8_47FAA]
gi|335942992|gb|EGN04834.1| hypothetical protein HMPREF1017_00407 [Bacteroides ovatus
3_8_47FAA]
Length = 799
Score = 332 bits (852), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 201/594 (33%), Positives = 328/594 (55%), Gaps = 51/594 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAPKA 71
+++ + PAK + ++PIGNGR+GAMV+GG+ ET+ LNE ++W+G + P +
Sbjct: 29 VELWYEQPAKEWMSSVPIGNGRIGAMVFGGIEEETIALNESSMWSGQYDENQEIPFGKER 88
Query: 72 LSDVRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDDSHLKYAEET---YR 125
++++R L G+ E + + GH + +GD++L F Y E T YR
Sbjct: 89 MNELRKLFFEGKIQEGNQIAGEFLHGNGHSFGTHLPIGDLKLTFS-----YPENTVSNYR 143
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
R LDL TA + Y++G+V + RE F++NPD V+V ++S S+ +++ +SL L ++
Sbjct: 144 RSLDLGTAISTTNYTIGDVNYVRECFATNPDDVLVLRMSASKKKAINAKLSLSMLRESEI 203
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
+GN Q+I EG P + P G+ F I IS GT+ A ED + V
Sbjct: 204 STDGN-QLIFEGTV---NFPKQG-----PGGVSFQG--RIAISAPNGTLQA-EDSSISVN 251
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
+D +++ +++ +D+ K E++ + +Y L HL+DY LF
Sbjct: 252 DADMLTIVIDVRTNYK------NDAYKSLCKETVVKAEK---KTYEKLKKTHLNDYTPLF 302
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
RVS+QL T + T E+VK + DP L LLFQ+GRYLL++SS
Sbjct: 303 DRVSLQLG---------TGEYAGLPTDKRWEQVK--KGGYDPGLDVLLFQYGRYLLLASS 351
Query: 366 RPGTQV-ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
R + + A LQG +N++L+ W + H++IN + NYW + NL+EC PLF ++ L
Sbjct: 352 RENSPLPAALQGFFNDNLACNMGWTNDYHLDINTQQNYWIANVGNLAECHLPLFKYIEDL 411
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
S++G+KTAQ Y GW H +IW +A G ++W L+P +W+ +HLW Y YT
Sbjct: 412 SVHGAKTAQKIYGCKGWTAHTTANIWG-YTAPSGSILWGLFPTASSWIASHLWTQYEYTR 470
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
D+D+L K AYPLL+G A FLLD+++E + GY+ T PS SPE+ F+ L C S T
Sbjct: 471 DKDYLTKTAYPLLKGNAEFLLDYMVEDPNTGYMVTGPSISPENSFLYQGNNL-CASMMPT 529
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
D + E+F+A I +A++L +++ + + +++ + P ++ +G + EW++
Sbjct: 530 CDRVLAYEIFNACIQSAQILNIDKE-FSDSLQQAIKKFPPIRLRANGGVREWLE 582
>gi|313204128|ref|YP_004042785.1| alpha-L-fucosidase [Paludibacter propionicigenes WB4]
gi|312443444|gb|ADQ79800.1| Alpha-L-fucosidase [Paludibacter propionicigenes WB4]
Length = 826
Score = 332 bits (852), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 203/592 (34%), Positives = 317/592 (53%), Gaps = 50/592 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA ++ +A+PI NGR+ AMV G E L+LNE + W+G P NPD K L
Sbjct: 29 LKLWYDKPAANWNEALPIANGRIAAMVHGNPSKELLQLNESSFWSGGPSRNDNPDGLKGL 88
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHP---ADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
+R+ + G Y A S + +Q +G++ + F ++ K+ + Y R+LD
Sbjct: 89 DSIRTYIFQGNYTRANTLSNQFLTAKQLHGSKFQSIGNLNISFPNAE-KFTD--YYRDLD 145
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
+ A + V Y V +V + RE +S PDQVIV +++ S+ G L+F + DS L S
Sbjct: 146 IENALSSVSYKVDDVIYKREILASIPDQVIVVRLTASKPGKLTFTTNFDSQLKKTSVALD 205
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
N+ + M G + ++ G ++F A K+ ++ GT+S + D LKV+ ++
Sbjct: 206 NHTLEMTGL---------SGTHEGVIGQVKFDA--RAKVINNGGTVSFVSDS-LKVKNAN 253
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
++++ +++F ++ + + T + + L ++ + H+ YQK F RV
Sbjct: 254 EVIIMVSIATNF----VDYQNLTANETQKCIQYLSVAEKKPFNTILKNHISTYQKYFKRV 309
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
+ L S T + +R+K+F DP LV L +QFGRYLLI SS+P
Sbjct: 310 NFDLGTSEAAKAT------------TKDRIKNFSKSYDPELVSLYYQFGRYLLICSSQPN 357
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
Q +NLQGIWN +P WDS +NIN EMNYW + NL+E EPL + LS +G +
Sbjct: 358 GQPSNLQGIWNGSNNPMWDSKYTININTEMNYWPAEKTNLTEMHEPLIKMIKELSQSGKE 417
Query: 429 TAQVNYLASGWVIHHKTDIWAKSS----ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
TA+V Y ++GWV HH TDIW + AD G+ WPMGGAWL HLWE Y Y +
Sbjct: 418 TAKVMYGSNGWVAHHNTDIWRITGVVDFADAGQ-----WPMGGAWLSQHLWEKYLYNGNL 472
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
+LE YP+L+ F D+LIE +L +PS SPE+ P G + + T+D
Sbjct: 473 KYLE-SVYPVLKSACEFYKDFLIEEPTHKWLVVSPSVSPEN---TPQGHKSALVAGCTID 528
Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
++ ++F+ I AA++L+K+ +V+ K L RL P +I G + EW++
Sbjct: 529 NQLLFDLFTKTIKAAKLLKKDASLMVD-FQKILDRLPPMQIGRLGQLQEWLE 579
>gi|29346420|ref|NP_809923.1| hypothetical protein BT_1010 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29338316|gb|AAO76117.1| glycoside hydrolase family 95 [Bacteroides thetaiotaomicron
VPI-5482]
Length = 824
Score = 332 bits (851), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 216/598 (36%), Positives = 337/598 (56%), Gaps = 45/598 (7%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E + K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+T+W G P +
Sbjct: 25 EKKVSAQEYKLWYDRPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEETIWAGRPNNNA 84
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NP+A + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 85 NPNALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 140
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y++ Y R+L L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 141 YSD--YYRDLSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 198
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISAL 237
S H V +++ EG C + ++ ++ KG ++F L + ++G A
Sbjct: 199 S---PHQDVMIHSE---EGNC--VTLSGVSSLHEGLKGKVEFQGRLTAR---NQGGKIAC 247
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
D L VEG+D A + + +++F+ N D + T + S L +++ H
Sbjct: 248 TDGVLSVEGADEATIYVSIATNFN----NYLDITGNQTERAKSYLSEALVRPFAEAKKNH 303
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
++ Y++ RVS+ L E+ V + +RV++F+ D LV FQFG
Sbjct: 304 VEFYRRYLTRVSLDLG------------EDQYKNVTTDKRVENFKDTHDAHLVATYFQFG 351
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLS+ EPLF
Sbjct: 352 RYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSDLNEPLFR 411
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ +S +G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC HLWE
Sbjct: 412 LIKEVSESGKETAKIMYGANGWVLHHNTDIWRITGA-LDKAPSGMWPSGGAWLCRHLWER 470
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACV 536
Y YT D +FL + YP+L+ F + ++ E +L PS SPE+ DGK A
Sbjct: 471 YLYTGDTEFL-RSVYPILKESGLFFDEIMVKEPVHNWLVVCPSNSPENVHSGSDGK-ATT 528
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+ TMD +I ++++AIISA+ +L+ +++ + + L + P ++ G + EW+
Sbjct: 529 AAGCTMDNQLIFDLWTAIISASRILDTDKE-FAAHLEQRLKEMAPMQVGHWGQLQEWM 585
>gi|326790118|ref|YP_004307939.1| alpha-L-fucosidase [Clostridium lentocellum DSM 5427]
gi|326540882|gb|ADZ82741.1| Alpha-L-fucosidase [Clostridium lentocellum DSM 5427]
Length = 756
Score = 332 bits (851), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 199/587 (33%), Positives = 313/587 (53%), Gaps = 54/587 (9%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ A+++ +A+PIGNG LG M++GG+ E +++NE++LW G D N DA K L +R
Sbjct: 8 YKQAARNWNEALPIGNGALGGMIFGGIKKELIQMNEESLWYGTFRDRNNKDARKYLPVIR 67
Query: 77 SLVDSGQYAEATAA-SVKLFGHP--ADVYQLLGDIELEFDDSHLKYAEET----YRRELD 129
L+ G+ EA S+ +FG P Y +LGD+ ++ + +E YRR LD
Sbjct: 68 DLLWQGKIGEAEKLLSMSMFGTPDGQRQYSVLGDLVIQC------FGQEEPVSHYRRTLD 121
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L TA A V Y +F RE+F S PD ++ ++ + + +D N
Sbjct: 122 LETACATVGYVSPKGKFEREYFCSKPDNLLAVRLRCDQEEQIELMAYIDRWKYNDEIEMS 181
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
+ + + G ++ +GI + ++ K+ + GT + ++L +G +
Sbjct: 182 KDGMSLYG----------SSGPCSSEGIGYHFMM--KLIPNGGTAQNI-GQRLYAKGCNE 228
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
++L+ A++ + DS +P S L+ Y +L RH+ DY+ L+ R+S
Sbjct: 229 VIILVTATTDY-------KDS--NPRSICEERLKKATQKGYEELKARHVADYKSLYKRLS 279
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPG 368
+ L E+++ +P+ ER++ + ED L+ + FQ+GRYLLIS SR G
Sbjct: 280 LDLKG------------ESLNHLPTDERLERIKKGGEDLDLIAMYFQYGRYLLISCSREG 327
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
A LQGIWN + P WDS +NIN EMNYW + C+LSEC PL + L + I+G K
Sbjct: 328 GLPATLQGIWNGEWLPPWDSKYTININTEMNYWLAEKCHLSECHLPLVEHLEKVRIHGEK 387
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
TA+ Y G++ HH TDIW ++ + +WPMG AWL H+WEHY YT+D+ FL
Sbjct: 388 TAEQMYGCRGFMAHHNTDIWGDAAPQDMWMPATIWPMGAAWLVLHIWEHYEYTLDQAFL- 446
Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
K Y LL+G F D+L+ +GYL T PSTSPE+ + G+ V +MD I+
Sbjct: 447 KEKYHLLKGAGDFFKDYLMMDENGYLVTGPSTSPENTYRLSSGEQGTVCIGPSMDSQILF 506
Query: 549 EVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEW 593
E+F+AII A +++ + E+ + +++ K LP P +I + G IMEW
Sbjct: 507 ELFTAIIEAGQLVGEAEEEIQCFKEMRKKLP---PIQIGKYGQIMEW 550
>gi|325298118|ref|YP_004258035.1| alpha-L-fucosidase [Bacteroides salanitronis DSM 18170]
gi|324317671|gb|ADY35562.1| Alpha-L-fucosidase [Bacteroides salanitronis DSM 18170]
Length = 820
Score = 332 bits (851), Expect = 4e-88, Method: Compositional matrix adjust.
Identities = 216/595 (36%), Positives = 312/595 (52%), Gaps = 41/595 (6%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+S LK+ + PA + +A+P+GN +G MV+GG E L+LNE+T+W G P NP
Sbjct: 18 SSWAESLKLWYRQPAHVWVEALPLGNSNMGVMVYGGTGVEQLQLNEETMWGGGPHRNDNP 77
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETY 124
A +AL +VR L+ + EA K F G YQ +G + +E H ++A + Y
Sbjct: 78 KALQALPEVRKLIFDNRNMEAQQLIDKTFYSGRNGMPYQTIGSLMIE-QPGH-EHATDYY 135
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
R +LDL A A V+Y V V + RE F+S D+VI ++ G L+F + S L H
Sbjct: 136 R-DLDLERAVATVRYQVDGVTYRREVFASLVDKVIRVHLTADRPGMLTFTLGYQSPLTRH 194
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEI--KISDDRGTISALEDKKL 242
C GK + N +D +G++ +E ++ G + A DK L
Sbjct: 195 QVT-----------CKGKTLVLTGNG-EDHEGVKGVIRMETGTQVMAKGGKVKAQGDK-L 241
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
VEG+D V L VAS++ F + +D +P L+ SY+ H Y+
Sbjct: 242 CVEGAD-EVTLYVASAT---NFRSYNDVSGNPHRSVQELLKKAVKTSYTQALADHEAYYR 297
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
K F RV + L E D + ER++ F +D SL L+FQ+GRYLLI
Sbjct: 298 KQFDRVRLDLG------------EGQGDQWETTERIRRFNEGKDVSLAALMFQYGRYLLI 345
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SSS+PG Q ANLQGIWN+ L WD +NIN EMNYW + NL E +PLF+ + L
Sbjct: 346 SSSQPGGQAANLQGIWNDKLLAPWDGKYTININTEMNYWPAEVTNLPETHQPLFELVKEL 405
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
S G +TA+V Y A+GWV HH TDIW + + K + WP GGAWL THLW+HY YT
Sbjct: 406 SQTGQETARVMYGANGWVAHHNTDIW-RCTGPVDKAFYGTWPNGGAWLTTHLWQHYLYTG 464
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPD-GKLACVSYSS 540
D++FLE+ YP L+G A F L +LI G++ PS SPEH + GK + +
Sbjct: 465 DKEFLEE-VYPALKGAADFYLSYLIPHPKYGWMVEAPSMSPEHGPQGENTGKASTIVAGC 523
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
TMD I+ +V + + A +L+ + A + + + +L P +I + + EW++
Sbjct: 524 TMDNQIVFDVLNNALHATRILDGSV-AYQDSLRWMIEQLPPMQIGQYNQLQEWLE 577
>gi|393782187|ref|ZP_10370376.1| hypothetical protein HMPREF1071_01244 [Bacteroides salyersiae
CL02T12C01]
gi|392674221|gb|EIY67670.1| hypothetical protein HMPREF1071_01244 [Bacteroides salyersiae
CL02T12C01]
Length = 1400
Score = 332 bits (850), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 209/607 (34%), Positives = 327/607 (53%), Gaps = 55/607 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA ++ +A+P+GNGRLGAMV+G +T+++NEDT W+G P + NP+A L
Sbjct: 27 LKLWYDRPADYWVEALPLGNGRLGAMVYGIASQDTIQINEDTYWSGSPYNNANPNALTHL 86
Query: 73 SDVRSLVDSGQYAEATA-------ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYR 125
D+R+ +++G+YAEA A + GH +Y+ +G++ L+F ++H Y
Sbjct: 87 EDIRNYINNGEYAEAQKLALANIIADRNITGHGM-IYESIGNLLLDFPENH--KTPSNYY 143
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH- 184
RELDL+ A A++ Y+V V +TRE F+S DQ+I+ KIS + G ++F S L +
Sbjct: 144 RELDLSNAVAKITYTVDGVNYTREVFTSLADQLIIIKISADQPGKVTFKTSFVGPLKTNR 203
Query: 185 -----SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
V G + ++ GK+ P + ++ IK+ D G+ +A +
Sbjct: 204 TKVTVKLVEGADNMLSVYTEGGKKTEENI-----PNLLHAHSL--IKVVADGGSQTA-AN 255
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
L V ++ A + + +++ F++ D D + + L + Y H+
Sbjct: 256 SSLNVTNANSACIYISTATN----FVSYKDISADSEARAKEYLDKF-DKDYEQAKADHIA 310
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
YQ+ F RV++ L + SE+ + P+ R++ F T DPSL L FQFGRY
Sbjct: 311 KYQEQFGRVTLNLGNN---------SEQ--EKKPTDVRIEEFSTVNDPSLAALYFQFGRY 359
Query: 360 LLISSSRPGTQVANLQGIWNEDLS--PTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
LLISSS+PGTQ ANLQGIWN + P WDS NIN+EMNYW + NLSEC P
Sbjct: 360 LLISSSQPGTQPANLQGIWNPNAGQYPAWDSKYTANINVEMNYWPAEVTNLSECHNPFLQ 419
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ +S+ G ++A Y GW +HH TDIW +S+ K +WP AW C HLWEH
Sbjct: 420 MVKDVSVTGEESAGKMYGCRGWTLHHNTDIW-RSTGAVDKSACGVWPTCNAWFCFHLWEH 478
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHE---FIAPD--- 530
Y +T D++FL + YP+L+ + F D+LI + + GY +PS SPE+ F D
Sbjct: 479 YLFTGDKEFLAE-IYPVLKSASEFYQDFLITDPNTGYKVVSPSNSPENHPGLFSYTDDSG 537
Query: 531 -GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP-RLRPTKIAEDG 588
+ A + TMD ++ ++ I AAE+L ++ + + LK L +L P + + G
Sbjct: 538 SKQNAAIFSGVTMDNQMVYDLLRNTIEAAEILNTDKGFVAD--LKELKEQLPPMHVGKYG 595
Query: 589 SIMEWVQ 595
+ EW++
Sbjct: 596 QLQEWLE 602
>gi|423221590|ref|ZP_17208060.1| hypothetical protein HMPREF1062_00246 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392645917|gb|EIY39637.1| hypothetical protein HMPREF1062_00246 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 826
Score = 331 bits (849), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 201/593 (33%), Positives = 318/593 (53%), Gaps = 48/593 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++GPA + +A+P+GNGR+GAMV+G E +LNE+T+W G P + TNP A AL
Sbjct: 27 LKLWYDGPATQWVEALPLGNGRIGAMVFGDPVHEQFQLNEETVWGGSPYNNTNPKAKDAL 86
Query: 73 SDVRSLVDSGQYAEA------TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
+R L+ G+ EA T S G P YQ +G + L+FD Y + Y R
Sbjct: 87 PRIRQLIFEGKNKEAQELCGPTICSPSANGMP---YQTVGSLHLDFDGIS-NYND--YYR 140
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
+LD+ A A +++ V +TRE ++S PDQV+V +++ S+ S+SF + ++
Sbjct: 141 DLDIAKAIATTRFTTNGVTYTREAYTSFPDQVLVIRLTASQKKSISFTAKYTTPYKSNVV 200
Query: 187 --VNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLK 243
++ ++ + G KAN ++ KG ++F+A+ +I + G++ A D L+
Sbjct: 201 RSISSRKELQLSG---------KANDHEGIKGKVEFTAL--TRIENSGGSLEATSDSTLQ 249
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V+ ++ +V L V S F+N D + S + L+ + N +Y+ H++ YQK
Sbjct: 250 VKNAN-SVTLYV---SIGTNFVNYKDVSGNALSTAQKYLKQV-NKNYAKSKAAHINAYQK 304
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
F+RVS+ L R+ + P+ RVK F T DP + L FQFGRYLLI
Sbjct: 305 YFNRVSLDLGRNAQA------------DKPTDVRVKEFSTSFDPQMAALYFQFGRYLLIC 352
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SS+PG Q ANLQGIWN L WD +IN+EMNYW + +L E EP + +
Sbjct: 353 SSQPGGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAESTSLPEMHEPFLQLVKEAA 412
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
I G ++A + Y GW +HH TDIW + A G + +WP AW C HLW+ Y ++ D
Sbjct: 413 IQGRESAAM-YGCRGWTLHHNTDIWRSTGAVDGP-SYGVWPTCNAWFCQHLWDRYLFSGD 470
Query: 484 RDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+++L + YPL+ G F LD+L+ E + +L PS SPE+ + + V +TM
Sbjct: 471 KNYLAE-VYPLMRGACEFYLDFLVREPENNWLVVAPSYSPENSPVVNGKRTFVVVAGTTM 529
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
D ++ ++F I+AA ++ +N A + + + L P ++ G + EW+
Sbjct: 530 DNQMVYDLFYNTIAAAGLMNENT-AFTDSLQTVVNNLAPMQVGRWGQLQEWMH 581
>gi|212540772|ref|XP_002150541.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
gi|210067840|gb|EEA21932.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
Length = 755
Score = 331 bits (849), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 204/596 (34%), Positives = 316/596 (53%), Gaps = 50/596 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ + PA+ + +A+P+GNGRLG MV+G +E L LNED++W G P T + L+
Sbjct: 4 KLWYQQPAQCWNEALPVGNGRLGVMVYGRTSTELLALNEDSVWYGGPQSRTPQPSIGELA 63
Query: 74 DVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFD-DSHLKYAEETYRRELD 129
+R L+ ++ +A + K F PA Y+ LG + ++F+ D+ K + Y+R LD
Sbjct: 64 LLRDLIRKEKHTDAEKLARKSFFASPASQRHYEPLGTVFIDFNHDNEQKLLD--YQRSLD 121
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS---- 185
+ + V+Y + R+ +S PD V+ I S + ++ + LD +
Sbjct: 122 IEKSLCHVEYEYDGICIARDLIASYPDSVLAMHIQSSAPIEFTVRLTRVNELDYETNEFL 181
Query: 186 --YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
N ++M GKR + +L + DD G ++A + L
Sbjct: 182 DDVAAKGNSLVMSVTPGGKR------------SNRACCVLSARCIDDEGIVTARPNNSLH 229
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
+ G + +LL++A+ + +D K ++ +ALQ S+ +L TRH+ DY
Sbjct: 230 IRGQN--ILLVIAAQTE----YRCNDIDKVTVTDCNNALQK----SWDELLTRHIQDYSA 279
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
L+ R+S+++ D+ + + +P+ R++ D L+ L + RYLLIS
Sbjct: 280 LYTRMSLRIG--------DSANLHELQKIPTDVRLRE---SRDLGLISLYHNYSRYLLIS 328
Query: 364 SSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
SSR G + A LQGIWN +P W S +NINL+MNYW CNLSEC +PLF L
Sbjct: 329 SSRNGYKALPATLQGIWNPSFTPAWGSKYTININLQMNYWPVNVCNLSECSQPLFALLRR 388
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
++ NG KTA+ Y GW HH TDIWA + + LWP+GGAWLC H+WEH++YT
Sbjct: 389 MAENGVKTAKSMYNCGGWAAHHNTDIWADTDPQDRWMPATLWPLGGAWLCFHIWEHFDYT 448
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACV-SYS 539
D++FL + +P+L+GC FLLD+LIE DG YL TNPS SPE+ F + + V
Sbjct: 449 QDKEFLSE-MFPVLQGCVEFLLDFLIESVDGKYLVTNPSLSPENTFYTHNRENQGVFCEG 507
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
ST+D+ II VF+A +S+ +VL ++ L +V + RL P +I G + EW+
Sbjct: 508 STIDIQIIEAVFTAFLSSVDVLNLTDNELGGRVQDAKKRLPPMQIGSFGQLQEWMH 563
>gi|224538245|ref|ZP_03678784.1| hypothetical protein BACCELL_03136 [Bacteroides cellulosilyticus
DSM 14838]
gi|224520142|gb|EEF89247.1| hypothetical protein BACCELL_03136 [Bacteroides cellulosilyticus
DSM 14838]
Length = 827
Score = 331 bits (849), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 201/593 (33%), Positives = 318/593 (53%), Gaps = 48/593 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++GPA + +A+P+GNGR+GAMV+G E +LNE+T+W G P + TNP A AL
Sbjct: 28 LKLWYDGPATQWVEALPLGNGRIGAMVFGDPVHEQFQLNEETVWGGSPYNNTNPKAKDAL 87
Query: 73 SDVRSLVDSGQYAEA------TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
+R L+ G+ EA T S G P YQ +G + L+FD Y + Y R
Sbjct: 88 PRIRQLIFEGKNKEAQELCGPTICSPSANGMP---YQTVGSLHLDFDGIS-NYND--YYR 141
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
+LD+ A A +++ V +TRE ++S PDQV+V +++ S+ S+SF + ++
Sbjct: 142 DLDIAKAIATTRFTTNGVTYTREAYTSFPDQVLVIRLTASQKKSISFTAKYTTPYKSNVV 201
Query: 187 --VNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLK 243
++ ++ + G KAN ++ KG ++F+A+ +I + G++ A D L+
Sbjct: 202 RSISSRKELQLSG---------KANDHEGIKGKVEFTAL--TRIENSGGSLEATSDSTLQ 250
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V+ ++ +V L V S F+N D + S + L+ + N +Y+ H++ YQK
Sbjct: 251 VKNAN-SVTLYV---SIGTNFVNYKDVSGNALSTAQKYLKQV-NKNYAKSKAAHINAYQK 305
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
F+RVS+ L R+ + P+ RVK F T DP + L FQFGRYLLI
Sbjct: 306 YFNRVSLDLGRNAQA------------DKPTDVRVKEFSTSFDPQMAALYFQFGRYLLIC 353
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SS+PG Q ANLQGIWN L WD +IN+EMNYW + +L E EP + +
Sbjct: 354 SSQPGGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAESTSLPEMHEPFLQLVKEAA 413
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
I G ++A + Y GW +HH TDIW + A G + +WP AW C HLW+ Y ++ D
Sbjct: 414 IQGRESAAM-YGCRGWTLHHNTDIWRSTGAVDGP-SYGVWPTCNAWFCQHLWDRYLFSGD 471
Query: 484 RDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+++L + YPL+ G F LD+L+ E + +L PS SPE+ + + V +TM
Sbjct: 472 KNYLAE-VYPLMRGACEFYLDFLVREPENNWLVVAPSYSPENSPVVNGKRTFVVVAGTTM 530
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
D ++ ++F I+AA ++ +N A + + + L P ++ G + EW+
Sbjct: 531 DNQMVYDLFYNTIAAAGLMNENT-AFTDSLQTVVNNLAPMQVGRWGQLQEWMH 582
>gi|237718536|ref|ZP_04549017.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
gi|229452243|gb|EEO58034.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
Length = 1100
Score = 331 bits (849), Expect = 7e-88, Method: Compositional matrix adjust.
Identities = 210/592 (35%), Positives = 302/592 (51%), Gaps = 49/592 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ +N PA+H+ +A+PIGN RLGAMV+GG E L++NE+T W G P +P A L
Sbjct: 288 LKLWYNRPAQHWEEALPIGNSRLGAMVYGGAGCEELQINEETFWAGGPHHNNSPKAKTVL 347
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+ R L+ + EA + F P + L L H K Y RELD+
Sbjct: 348 DEARRLIFEDKTMEAQKLINPNFFSGPHGMSYLNMGSLLILQPGHEK--ATNYYRELDIE 405
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS--------LLDN 183
ATA Y V V +TR FSS DQVI+ ++ + G+L F++ D+ LL
Sbjct: 406 DATATTCYEVDGVTYTRTAFSSMTDQVIIVRLEANRKGALQFSLGYDTPKEADGSALLHP 465
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
V GN + +C G A+A ++++ D ++ + +L
Sbjct: 466 VVKVRGNKLTM---QCIGMEQEGVASA--------IKGEWQVQVVHDGKQVN--QPDRLG 512
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V+G+ A + L A+++F +N D + + + + L++ Y H YQ
Sbjct: 513 VQGATTATVYLSAATNF----VNYKDVSGNASRRAAAYLKTALKQPYPKALEAHSKAYQT 568
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
F+RV + L P I + P+ +RV F +D +L+ LL+Q+GRYLLI
Sbjct: 569 QFNRVKLDL---PATIAS---------LAPTNQRVADFNRVDDRNLMALLYQYGRYLLIC 616
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SS+PG Q ANLQGIW L WDS +NIN EMNYW + NLSEC EPLF L LS
Sbjct: 617 SSQPGGQPANLQGIWCRSLHAPWDSKYTININTEMNYWPAEVTNLSECHEPLFSMLEDLS 676
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+ G +TA+ Y A GWV HH TD+W + G W +WP GGAWLC HLW+HY YT D
Sbjct: 677 VTGHETARTLYGAKGWVAHHNTDLWRIAGPVDG-ATWGMWPNGGAWLCQHLWQHYLYTGD 735
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+ FL K YP+++G A F++ L++ G+L T PS SPEH + A C TM
Sbjct: 736 QAFLRKY-YPVMKGAADFMMSHLVKHPKYGWLVTVPSVSPEHGYTASTLTAGC-----TM 789
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
D I ++ + AA +L + A + + + +L P +I + I EW+
Sbjct: 790 DNQIAFDILNNTRLAATILGE-PTAYQDSLQATCTQLPPMQIGKYNQIQEWM 840
>gi|160882310|ref|ZP_02063313.1| hypothetical protein BACOVA_00258 [Bacteroides ovatus ATCC 8483]
gi|156112318|gb|EDO14063.1| hypothetical protein BACOVA_00258 [Bacteroides ovatus ATCC 8483]
Length = 1100
Score = 330 bits (846), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 210/592 (35%), Positives = 302/592 (51%), Gaps = 49/592 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ +N PA+H+ +A+PIGN RLGAMV+GG E L++NE+T W G P +P A L
Sbjct: 288 LKLWYNRPAQHWEEALPIGNSRLGAMVYGGAGREELQINEETFWAGGPHHNNSPKAKTVL 347
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+ R L+ + EA + F P + L L H K Y RELD+
Sbjct: 348 DEARRLIFEDKTMEAQKLINPNFFSGPHGMSYLNMGSLLILQPGHEK--ATNYYRELDIE 405
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS--------LLDN 183
ATA Y V V +TR FSS DQVI+ ++ + G+L F++ D+ LL
Sbjct: 406 DATATTCYEVDGVTYTRTAFSSMTDQVIIVRLEANRKGALLFSLGYDTPKEADGSALLHP 465
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
V GN + +C G A+A ++++ D ++ + +L
Sbjct: 466 VVKVRGNKLTM---QCIGMEQEGVASA--------IKGEWQVQVVHDGKQVN--QPDRLG 512
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V+G+ A + L A+++F +N D + + + + L++ Y H YQ
Sbjct: 513 VQGATTATVYLSAATNF----VNYKDVSGNASRRAAAYLKTALKQPYPKALEAHSKAYQT 568
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
F+RV + L P I + P+ +RV F +D +L+ LL+Q+GRYLLI
Sbjct: 569 QFNRVKLDL---PATIAS---------LAPTNQRVADFNRVDDRNLMALLYQYGRYLLIC 616
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SS+PG Q ANLQGIW L WDS +NIN EMNYW + NLSEC EPLF L LS
Sbjct: 617 SSQPGGQPANLQGIWCRSLHAPWDSKYTININTEMNYWPAEVTNLSECHEPLFSMLEDLS 676
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+ G +TA+ Y A GWV HH TD+W + G W +WP GGAWLC HLW+HY YT D
Sbjct: 677 VTGHETARTLYGAKGWVAHHNTDLWRIAGPVDG-ATWGMWPNGGAWLCQHLWQHYLYTGD 735
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+ FL K YP+++G A F++ L++ G+L T PS SPEH + A C TM
Sbjct: 736 QAFLRKY-YPVMKGAADFMMSHLVKHPKYGWLVTVPSVSPEHGYTASTLTAGC-----TM 789
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
D I ++ + AA +L + A + + + +L P +I + I EW+
Sbjct: 790 DNQIAFDILNNTRLAATILGE-PTAYQDSLQATCTQLPPMQIGKYNQIQEWM 840
>gi|255936621|ref|XP_002559337.1| Pc13g09120 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211583957|emb|CAP91981.1| Pc13g09120 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 740
Score = 330 bits (845), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 210/593 (35%), Positives = 313/593 (52%), Gaps = 55/593 (9%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA+ + A+P+GNGRLGAMV+G +E L+LNED++W G P D DA + L
Sbjct: 3 ELWYQQPAEDWNSALPVGNGRLGAMVYGRTDTEMLQLNEDSVWYGGPQDRNPQDALEYLP 62
Query: 74 DVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
+R + +G +AEA A + F +P+ Y+ LG++ L D H YRR LDL
Sbjct: 63 RLREAIRAGNHAEAEKIAKLAFFANPSSQRNYEPLGNLFL--DLGHDPSQVTGYRRSLDL 120
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD--NHSYVN 188
+ATA V Y V + R+ +S PD VI K+ S ++ S L+ H +++
Sbjct: 121 TSATAHVSYEYQGVRYERQVLASYPDDVIAIKMYSSSRAEFVVRLTRMSELEFETHEWLD 180
Query: 189 G----NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
N I M GK N+N + ++ I+ TI+ + + L V
Sbjct: 181 DVSATGNSITMHVTPGGK------NSN------RACCMVSIRCDGAESTITRVGNN-LVV 227
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
SD A+L++ A ++F +D +M ++ D+ RH+ DYQ L
Sbjct: 228 NSSD-ALLVVAAQTTF---------RHEDNDQRTMQDAENALGFPLEDIRARHVADYQSL 277
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
++R+ +QL +I TD +R+KS + DP L+ L + RYLLIS
Sbjct: 278 YNRMELQLGPDSPEIPTD-------------QRLKSLR---DPGLIALYHNYNRYLLISC 321
Query: 365 SRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SR + ANLQGIWN P W S +N+NL+MNYW + NLSEC+ PLFD L +
Sbjct: 322 SRDRHKSLPANLQGIWNPSFHPAWGSRFTINVNLQMNYWSANMGNLSECELPLFDLLERM 381
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
G TA++ Y GW H TDIWA ++ + ++WP+GGAWLC H+W+H+ YT
Sbjct: 382 VEPGKVTARIMYGCRGWTAHPNTDIWADTAPFDRWMPASIWPLGGAWLCYHIWDHFRYTG 441
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSST 541
D++FL +R +P L GC FLLD+LIE +G YL T+PSTSPE+ F G+ + ST
Sbjct: 442 DQNFL-RRMFPTLRGCVEFLLDFLIEDANGEYLVTSPSTSPENSFYDGKGQKGVLCEGST 500
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+D+ II + A S A+ L EDA++ V + R+ P +++ G + EW
Sbjct: 501 IDIQIIDAILDAFQSCAKSLGL-EDAILPAVQATRSRIPPMRVSPAGYLQEWA 552
>gi|254446849|ref|ZP_05060324.1| hypothetical protein VDG1235_197 [Verrucomicrobiae bacterium
DG1235]
gi|198256274|gb|EDY80583.1| hypothetical protein VDG1235_197 [Verrucomicrobiae bacterium
DG1235]
Length = 800
Score = 330 bits (845), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 209/613 (34%), Positives = 312/613 (50%), Gaps = 54/613 (8%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+T+ T + F+ T++IP+GNGRLGA +G V ET+ LNE +W+G P +
Sbjct: 21 ATAQTPERSVWFDSAGASLTESIPLGNGRLGASFFGMVEEETVILNESGMWSGSPQEADR 80
Query: 66 PDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIEL-- 110
DA KAL +++ L+ G+ AEA A F P YQ+L + +
Sbjct: 81 MDAHKALPEIKRLLLEGRNAEAEALVNANFTCAGRGSGYGGGANDPYGSYQILAKLHIVD 140
Query: 111 --EFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSES 168
E D+ +K YRRELDL TAT R + G V + RE F+S PD+ +V + + SE+
Sbjct: 141 RSESSDTVVK----NYRRELDLATATYRHSFERGGVGYIRESFASRPDEALVVRFTASEA 196
Query: 169 GSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
G L + SL G + ++M G+ + G++++ +L+ +
Sbjct: 197 GGLDLDFSLSREERMQVEPLGADALLMTGQL--------NDGYGGEDGVRYAGVLK---A 245
Query: 229 DDRGTISALEDKKLKVEGSDWAVLLL-----VASSSFDGPFINPSDSKKDPTSESMSALQ 283
RG E+ +L+V G+D ++ +A SF G + +DP + + L
Sbjct: 246 SARGGEVRSEEGRLEVRGADEVIVYFTTANDIAKRSFAGRMV------EDPIATAKLDLA 299
Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
+ + S+ +L RH+ +++ + RVS+QL ++ + V ++
Sbjct: 300 GVESYSFEELKRRHVAAFREYYGRVSLQLG-------SEELAASRAKVATPQRLVDHWEG 352
Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
+DP L L F FGRYLLISSSRPG Q ANLQGIW++ + W+ H NIN++MNYW +
Sbjct: 353 VDDPDLAALYFDFGRYLLISSSRPGGQPANLQGIWSDTIQTPWNGDWHANINVQMNYWPA 412
Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
CNLSE EP+F + L G KTA+ Y A GWV + W +S W
Sbjct: 413 ELCNLSELHEPMFKLIESLVEPGRKTAKAYYDAEGWVSFLLANPWGFTSPGE-SASWGST 471
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSP 522
AWLC HLW+HY +T D FL + AYP+L+ A F L+E G+L T PS SP
Sbjct: 472 VSCSAWLCQHLWDHYLFTKDEAFL-RWAYPILKDSAVFYSQMLMEDTRTGWLVTCPSNSP 530
Query: 523 EHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT 582
E F +G+ VS T+D ++R +F A I AAE+L ++ + E KS RL PT
Sbjct: 531 ESAFKLANGETVHVSMGPTIDQQLLRYLFGACIEAAEILGQDPEFAAELAEKS-ARLAPT 589
Query: 583 KIAEDGSIMEWVQ 595
+I DG +MEW++
Sbjct: 590 QIGSDGRVMEWLE 602
>gi|255035637|ref|YP_003086258.1| glycoside hydrolase [Dyadobacter fermentans DSM 18053]
gi|254948393|gb|ACT93093.1| glycoside hydrolase family protein [Dyadobacter fermentans DSM
18053]
Length = 781
Score = 329 bits (844), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 207/616 (33%), Positives = 319/616 (51%), Gaps = 59/616 (9%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
+N + + PL++ + PA + + IP+GNGRLG M GGV ET+ LN+ TLW+G P
Sbjct: 13 FLNLAALAQQAPLRLWYTKPASQWEETIPLGNGRLGMMGDGGVTKETVVLNDITLWSGAP 72
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------GH------PADVYQLLGD 107
D DA ++L ++R L+ +G+ EA A K F GH P YQ+LG+
Sbjct: 73 QDANRYDAHESLPEIRRLILAGKNDEAQALVNKNFVAKGAGSGHGDGANVPFGCYQVLGN 132
Query: 108 IELEFDDSHLKYAE---ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKIS 164
+ LEF + A Y+REL L+ A + V Y V V +TRE+F+S D + + KI+
Sbjct: 133 LHLEFGYKGVDTARVQVRDYKRELSLDEAVSSVIYQVNGVTYTREYFTSFGDDLGIIKIT 192
Query: 165 GSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE 224
+ G L+ ++LD + V NN + M G+ N D KG+++ ++
Sbjct: 193 ADKPGQLNLRIALDRP-ERFQTVIKNNTLEMSGQL---------NNGTDGKGMRYLTKIK 242
Query: 225 IKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS 284
+ + ++S K++ + +D ++ A + F K+ +E+ + +
Sbjct: 243 PLVKGGKTSVSG---KQIVISDADEIIVYFSAGTDF---------KNKNFETETQRLIDA 290
Query: 285 IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT- 343
SYS H +YQKLF+R I L S D VP+ +R+ +FQ
Sbjct: 291 AVKKSYSVQKNLHTTNYQKLFNRTKIHLGGSKGD------------GVPTDQRLSAFQKN 338
Query: 344 -DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQ 402
++D L L FQFGRYL ISS+R G NLQG+W + W+ H+++N++MN+W
Sbjct: 339 PEKDNELAVLYFQFGRYLSISSTRVGLLPPNLQGLWANQIRTPWNGDYHLDVNVQMNHWP 398
Query: 403 SLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWAL 462
NLSE PL D + + G KTA+ Y A+GWV H T++W + + W
Sbjct: 399 VEVANLSELNLPLADLVKGMVKQGEKTAKAYYNANGWVAHVITNVWGYTEPGE-EASWGA 457
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTS 521
G W+C +LWEHY +T D+++L K YP+L+G A F + LI+ G+L T PS S
Sbjct: 458 SNAGSGWICNNLWEHYAFTHDKNYL-KDIYPVLKGSAEFYISALIKDPKTGWLVTAPSVS 516
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRL 579
PE+ F P+GK A + T+D I RE+F+ +I+A EVL + D ++ LK LP
Sbjct: 517 PENSFYLPNGKTAAICMGPTIDNQITRELFTNVITACEVLGVDADFAKSLQNKLKELP-- 574
Query: 580 RPTKIAEDGSIMEWVQ 595
P + DG +MEW++
Sbjct: 575 PPGVVGSDGRLMEWLE 590
>gi|423298609|ref|ZP_17276665.1| hypothetical protein HMPREF1070_05330 [Bacteroides ovatus
CL03T12C18]
gi|392662352|gb|EIY55913.1| hypothetical protein HMPREF1070_05330 [Bacteroides ovatus
CL03T12C18]
Length = 822
Score = 329 bits (844), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 214/593 (36%), Positives = 330/593 (55%), Gaps = 53/593 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+T+W G P + NP+A + +
Sbjct: 32 KLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEETIWAGRPNNNANPNALEYIP 91
Query: 74 DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +Y+ Y RE
Sbjct: 92 KVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTRYS--NYYRE 145
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
L L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L S
Sbjct: 146 LSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLTS-------- 197
Query: 188 NGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKL 242
+Q +M EG C + ++ ++ KG ++F L K ++G A D L
Sbjct: 198 --PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGGEIACADGIL 250
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
VE +D A++ + +++F+ N D + + + L + + H+D Y+
Sbjct: 251 SVEKADEAIIYVSIATNFN----NYQDITGNQIERAKNYLAKAMVHPFIESKRNHVDFYR 306
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
+ RVS+ L E+ V + +RV++F+ D LV FQFGRYLLI
Sbjct: 307 QYLTRVSLDLG------------EDQYANVTTDKRVENFKNTNDTHLVATYFQFGRYLLI 354
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE EPLF + +
Sbjct: 355 CSSQPGGQPANLQGIWNDKLVPSWDSKYTCNINLEMNYWPSEVTNLSELNEPLFRLIKEV 414
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC HLWE Y YT
Sbjct: 415 SDTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRHLWERYLYTG 473
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
D +FL + YP+L+ F + ++ E +L PS SPE+ +GK A + T
Sbjct: 474 DVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK-ATTAAGCT 531
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
MD ++ ++++AIISA+++L+ + + + + L + P ++ G + EW+
Sbjct: 532 MDNQLVFDLWTAIISASQILDTDRE-FASHLTQRLKEMAPMQVGHWGQLQEWM 583
>gi|336416256|ref|ZP_08596592.1| hypothetical protein HMPREF1017_03700 [Bacteroides ovatus
3_8_47FAA]
gi|335938987|gb|EGN00866.1| hypothetical protein HMPREF1017_03700 [Bacteroides ovatus
3_8_47FAA]
Length = 822
Score = 329 bits (844), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 214/593 (36%), Positives = 330/593 (55%), Gaps = 53/593 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+T+W G P + NP+A + +
Sbjct: 32 KLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEETIWAGRPNNNANPNALEYIP 91
Query: 74 DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +Y+ Y RE
Sbjct: 92 KVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTRYS--NYYRE 145
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
L L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L S
Sbjct: 146 LSLDSARAIVRYEVDGVQYQREMITSFTDQVVMVRLTANRPGQITFNAQLTS-------- 197
Query: 188 NGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKL 242
+Q +M EG C + ++ ++ KG ++F L K ++G A D L
Sbjct: 198 --PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGGEIACADGIL 250
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
VE +D A++ + +++F+ N D + + + L + + H+D Y+
Sbjct: 251 SVEKADEAIIYVSIATNFN----NYQDITGNQIERAKNYLAKAMVHPFIESKRNHIDFYR 306
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
+ RVS+ L E+ V + +RV++F+ D LV FQFGRYLLI
Sbjct: 307 QYLTRVSLDLG------------EDQYANVTTDKRVENFKNTNDAHLVATYFQFGRYLLI 354
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE EPLF + +
Sbjct: 355 CSSQPGGQPANLQGIWNDKLVPSWDSKYTCNINLEMNYWPSEVTNLSELNEPLFRLIKEV 414
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC HLWE Y YT
Sbjct: 415 SDTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRHLWERYLYTG 473
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
D +FL + YP+L+ F + ++ E +L PS SPE+ +GK A + T
Sbjct: 474 DVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK-ATTAAGCT 531
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
MD ++ ++++AIISA+++L+ + + + + L + P ++ G + EW+
Sbjct: 532 MDNQLVFDLWTAIISASQILDTDRE-FASHLTQRLKEMAPMQVGHWGQLQEWM 583
>gi|423241186|ref|ZP_17222300.1| hypothetical protein HMPREF1065_02923 [Bacteroides dorei
CL03T12C01]
gi|392642334|gb|EIY36101.1| hypothetical protein HMPREF1065_02923 [Bacteroides dorei
CL03T12C01]
Length = 825
Score = 329 bits (844), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 203/590 (34%), Positives = 318/590 (53%), Gaps = 45/590 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
KI ++ PA ++ +AIPIGNGR+ AMV+G E L+LNE+T+ G P N + AL
Sbjct: 27 KIWYDTPAHYWEEAIPIGNGRIAAMVFGNPQLEQLQLNEETISAGSPYQNYNKEGKGALK 86
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPADV---YQLLGDIELEFDDSHLKYAEETYRRELDL 130
++R L+ G Y EA + K P YQ +G++ + + + + Y RELDL
Sbjct: 87 EIRRLIFDGHYEEAQNMAEKKILSPVGREMPYQTVGNLNIRYKNHK---QIKKYYRELDL 143
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN-HSYVNG 189
A A +Y + +VE T E F+S DQ+I+ I S+ GS++ + + +D G
Sbjct: 144 TRAIATTRYQIKDVEITEETFASFTDQLIIKHIKSSKKGSINCELFFQTPMDAPKRSACG 203
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
++ +EG G N P + + A L +K SD G + AL D +KVE +
Sbjct: 204 KKKLRLEGITSGN--------NHIPGKVHYCADLSVKNSD--GKVFALNDTLIKVEKATE 253
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ-SIRNLSYSDLYTRHLDDYQKLFHRV 308
L + +++F +N D +P + L+ S+++ + + H+ Y+K+F+RV
Sbjct: 254 ICLYVSMATNF----VNYKDISANPYERNEKYLKNSMKDFEKAKI--EHVAAYKKMFNRV 307
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
+++L SP+ P+ R+K F++ DP LV L FQFGRYLLISSS+PG
Sbjct: 308 TLELGHSPQI------------NKPTNIRLKEFESSYDPHLVSLYFQFGRYLLISSSQPG 355
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
Q ANLQG WN + P W S NIN EMNYW + NLSE EPL + S +G +
Sbjct: 356 CQPANLQGKWNAKVRPPWSSNYTTNINTEMNYWPAEVTNLSELHEPLIQIIQDWSQSGRE 415
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
TA Y GWV+HH +D+W + A DR +WP GAW+C HLW+ Y ++ ++++L
Sbjct: 416 TADQMYGCRGWVLHHNSDLWRVTGAVDRAYC--GVWPTAGAWMCQHLWDRYLFSGNKEYL 473
Query: 488 EKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
K+ YP++ + F +D+L++ + GY PS SPE+ K + S +TMD +
Sbjct: 474 -KKIYPIMRSASKFFIDFLVQNPNTGYWVVGPSPSPENSPKKIKQKASLFS-GNTMDNQL 531
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLP-RLRPTKIAEDGSIMEWVQ 595
I ++FS AA++L ++D+ + LK++ +L P ++ E G + EW +
Sbjct: 532 IFDLFSNTCEAAKIL--SQDSTLCDTLKTMRNQLPPMQVGEYGQLQEWFE 579
>gi|153812246|ref|ZP_01964914.1| hypothetical protein RUMOBE_02645 [Ruminococcus obeum ATCC 29174]
gi|149831653|gb|EDM86740.1| hypothetical protein RUMOBE_02645 [Ruminococcus obeum ATCC 29174]
Length = 754
Score = 329 bits (843), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 198/593 (33%), Positives = 301/593 (50%), Gaps = 56/593 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + F+ PA+ + +A+P+GNG +GAM +G + E ++LN DTLW+G N +
Sbjct: 9 LTLAFDRPAEAWNEALPLGNGSMGAMSYGRLREEKIELNLDTLWSGTGRSKENKNTDVDW 68
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+R + G+Y EA A + G + Y G++ ++ + LK +Y+R+L +
Sbjct: 69 DFLRQKIFDGEYEEAEAYCKENILGDWTESYLPAGNLHIDANIPELK-EHGSYQRQLSIK 127
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
A +V Y + RE F S + V+ SL +SLDS + + G +
Sbjct: 128 DALEQVVYRQDGQGYLREFFVSMSEPVMALHYRADAGSSLELRISLDSQIRHVCSGYGTS 187
Query: 192 QIIMEGRCPGKRIPPKANAN-----DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
++++EG+ P P + ++ KG +F+ + I + +G I +D L V
Sbjct: 188 ELVLEGQAPVYAAPLYYSCEQPIVYEEGKGTRFA--IGISVQAPKGCIRQ-KDNTLLVTA 244
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ L + F ++ S L+ I +LSY L H Y F
Sbjct: 245 DGDVYIYLSGITDFQ--------AQDSYLSRKKQMLEQICDLSYPQLKEAHKKAYAAYFD 296
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
R+ + L Q D L+ +F + RYL+ISSS+
Sbjct: 297 RMDLTLD-------------------------PGIQND----LITKMFHYARYLMISSSK 327
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PGTQ ANLQGIWN +L W S VNIN EMNYW + NLS+C E LFD + + +G
Sbjct: 328 PGTQCANLQGIWNHNLRAPWSSNYTVNINTEMNYWMAEKANLSDCHESLFDLIERTASHG 387
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSS------ADRGKVVWALWPMGGAWLCTHLWEHYNY 480
KTA+ Y +GWV HH DIW SS D +++WPM WLC+HLWEHY Y
Sbjct: 388 KKTAKEVYHLNGWVSHHNVDIWGHSSPVGYFGQDENPCTYSMWPMSSGWLCSHLWEHYRY 447
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
T+DR+FL K+A+PL+ G F L +L+ +DGYL T PSTSPE+ F A D + V++ S
Sbjct: 448 TLDREFLRKKAFPLIRGAVEFYLGYLVP-YDGYLVTAPSTSPENTFTASDHSVHSVTFGS 506
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
TMD +I++E+F + A E+L+ + L+++V +L +L P KI ++G + EW
Sbjct: 507 TMDCSILKELFGNYLKACEILDITD--LMDEVKAALKKLLPFKIGKEGQLQEW 557
>gi|349572636|gb|AEP84398.1| glycoside hydrolase family protein [bacterium enrichment culture
clone g13]
Length = 824
Score = 328 bits (842), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 215/600 (35%), Positives = 315/600 (52%), Gaps = 53/600 (8%)
Query: 8 STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
ST K+ + PAK + +++P+GNGRLGAMV+G V S+ ++LNE+T W G P + NP
Sbjct: 21 STAVEQKLWYEQPAKQWEESLPLGNGRLGAMVYGDVLSDNIQLNENTFWAGGPHNNLNPA 80
Query: 68 APKALSDVRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDDSHLKYAEETY 124
A AL ++R L+ G Y A + K G YQ G++ LEF + H Y Y
Sbjct: 81 ALNALPEIRRLITVGDYLAAEKLAAKTIASQGSNGMPYQTAGNLRLEFSE-HKNYNH--Y 137
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
R+LD+ +A A +Y V +V +TRE FSS DQVIV K++ S+ G LSF+ +
Sbjct: 138 YRDLDIGSAVATTRYRVNDVVYTREVFSSFVDQVIVVKLTASKRGQLSFDAYMSHPSAMV 197
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE--IKISDDRGTISALEDKKL 242
N ++M+G+ D +GI+ L + IS G+I+ D ++
Sbjct: 198 FSREDANTLLMQGQSM------------DHEGIKGQVRLASLVNISTIGGSINQ-RDNRI 244
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY----TRHL 298
V+ +D A++L+ +++F +N D + + + + +N +D Y H
Sbjct: 245 TVKNADSALILVSMATNF----VNYKDVSANALARARHYMAQAKNNFANDHYELRKQAHS 300
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+ Y+ F RV + L +S S+E+ D +R+ F DP L L FQFGR
Sbjct: 301 NFYKNYFDRVILNLGKS-------EFSKESTD-----QRIALFSGRHDPELASLYFQFGR 348
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSS+PG Q ANLQG+WN P WDS +NIN EMNYW + NLSE EPL
Sbjct: 349 YLLISSSQPGGQPANLQGLWNHRQDPPWDSKYTLNINAEMNYWPAEITNLSELHEPLITM 408
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
LSI G ++A+ Y A GW+ HH TDIW + W WP AWL HLWE Y
Sbjct: 409 TKELSITGQESAKTMYGARGWMAHHNTDIWRITGGV--DYTWGSWPTSSAWLSQHLWERY 466
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y+ D+ +L + YP+++ F D+LI + +L +PS SPE+ A K+A
Sbjct: 467 LYSGDKQYLAE-IYPVMKSAVVFFDDFLISSPNKKWLIVSPSMSPENVPKATGTKIAA-- 523
Query: 538 YSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
TMD ++ ++FS I+AA++L +K L EK L LP P +I + + EW++
Sbjct: 524 -GVTMDNQLLFDLFSNTIAAAKILGEDKQHIPLWEKTLSRLP---PMQIGKYHQLQEWLE 579
>gi|299145505|ref|ZP_07038573.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
gi|298515996|gb|EFI39877.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
Length = 822
Score = 328 bits (841), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 214/593 (36%), Positives = 330/593 (55%), Gaps = 53/593 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+T+W G P + NP+A + +
Sbjct: 32 KLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEETIWAGHPNNNANPNALEYIP 91
Query: 74 DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +Y+ Y RE
Sbjct: 92 KVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTRYS--NYYRE 145
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
L L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L S
Sbjct: 146 LSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLTS-------- 197
Query: 188 NGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKL 242
+Q +M EG C + ++ ++ KG ++F L K ++G A D L
Sbjct: 198 --PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGGEIACADGIL 250
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
VE +D A++ + +++F+ N D + + + L + + H+D Y+
Sbjct: 251 SVEKADEAIIYVSIATNFN----NYQDITGNQIERAKNYLAKAMVHPFIESKRNHVDFYR 306
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
+ RVS+ L E+ V + +RV++F+ D LV FQFGRYLLI
Sbjct: 307 QYLTRVSLDLG------------EDQYANVTTDKRVENFKNTNDAHLVATYFQFGRYLLI 354
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE EPLF + +
Sbjct: 355 CSSQPGGQPANLQGIWNDKLVPSWDSKYTCNINLEMNYWPSEVTNLSELNEPLFRLIKEV 414
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC HLWE Y YT
Sbjct: 415 SDTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRHLWERYLYTG 473
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
D +FL + YP+L+ F + ++ E +L PS SPE+ +GK A + T
Sbjct: 474 DVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK-ATTAAGCT 531
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
MD ++ ++++AIISA+++L+ + + + + L + P ++ G + EW+
Sbjct: 532 MDNQLVFDLWTAIISASQILDTDRE-FASHLTQRLKEMAPMQVGHWGQLQEWM 583
>gi|224536491|ref|ZP_03677030.1| hypothetical protein BACCELL_01366 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521893|gb|EEF90998.1| hypothetical protein BACCELL_01366 [Bacteroides cellulosilyticus
DSM 14838]
Length = 815
Score = 328 bits (841), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 211/601 (35%), Positives = 316/601 (52%), Gaps = 60/601 (9%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAPKALS 73
I ++ PA+ + +A+PIGNGRLGAM +GG+ E L+LN+ T+W+G P ++ DA K L
Sbjct: 35 IWYDKPAEIWEEALPIGNGRLGAMCFGGIHEEKLQLNDVTIWSGEPQPNSDRTDAYKKLP 94
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPA----DVY--------QLLGDIELEFDDSHLKYAE 121
++R + + Y A + + + D+Y Q LGD+ L+F+ +
Sbjct: 95 EIREALRNRDYKLAEVLTHQYMTCNSVSNEDIYNTIYSSSYQTLGDLSLKFELPEGEMG- 153
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
+YRR LD+ A + V + +G F+RE FSS PD VIV K+ G LSF++ LD
Sbjct: 154 -SYRRWLDITRAISGVDFKIGEYSFSREIFSSAPDSVIVMKLGTDMKGGLSFSMLLDRKF 212
Query: 182 ------DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
D+H V N ME R N + + + +K+ D G +S
Sbjct: 213 SAVTTSDSHGLVMKGNTDYMEHR---------GNCDYEAR---------VKVVADGGRVS 254
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
K+ V+G+D A + + +S+ + D + +++ L + Y D+ +
Sbjct: 255 N-SKGKISVQGADSAYVYITCQTSYILDYKKNYRRAID-SKDAVRKLNIVSRKKYDDVKS 312
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLF 354
H+ DYQ +F+R+S+ L + ++ID +P+ +R+ F + +D V+L +
Sbjct: 313 IHVADYQGIFNRLSLNLG-----------NNKSID-IPTDQRLTRFNEKSDDLGFVDLFY 360
Query: 355 QFGRYLLISSSRPGTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
QFGRYL+ISSSR + N QGIW + W S NIN +MNYW NLSEC
Sbjct: 361 QFGRYLMISSSRENNPLPGNCQGIWGDGYKLPWHSDYKANINYQMNYWMVEASNLSECHI 420
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
P+ L G KTAQ + ASGW+ T+ W +S + +W + G W C
Sbjct: 421 PMLRLTASLVEPGRKTAQSYFNASGWMYAMMTNAWGWTSPGQ-YTIWGSFFGGSGWACQD 479
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
WEHY YT D+++L K YP+L+ F L LIE DGYL T+PSTSPE+ +IAPDG
Sbjct: 480 FWEHYAYTQDKEYLRK-VYPILKEACEFYLSVLIENKDGYLVTSPSTSPENRYIAPDGSR 538
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL-KSLPRLRPTKIAEDGSIME 592
V+ ST++++IIR +FS I A +L NED +++L KSL RLRP +I G +ME
Sbjct: 539 VAVTEGSTIELSIIRNLFSNTIYATGIL--NEDNSFKEILEKSLARLRPLQIGRAGQLME 596
Query: 593 W 593
W
Sbjct: 597 W 597
>gi|288929797|ref|ZP_06423640.1| alpha-L-fucosidase 2 (Alpha-L-fucosidefucohydrolase 2)
(Alpha-1,2-fucosidase 2) [Prevotella sp. oral taxon 317
str. F0108]
gi|288328898|gb|EFC67486.1| alpha-L-fucosidase 2 (Alpha-L-fucosidefucohydrolase 2)
(Alpha-1,2-fucosidase 2) [Prevotella sp. oral taxon 317
str. F0108]
Length = 792
Score = 328 bits (840), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 213/597 (35%), Positives = 323/597 (54%), Gaps = 44/597 (7%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP-- 69
PL+I N P F +++PIGNG+LGAMV G + LKLN+ TLW+G P D N DA
Sbjct: 24 PLRIWDNRPGSFFENSMPIGNGKLGAMVDGNPHCDYLKLNDITLWSGKPID-PNEDAGAH 82
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLG-----DIELEFD-DSHLKYAEET 123
K + +R + YA A + +++ GH + YQ L D++ + D+ LK
Sbjct: 83 KWIPQIRKALFEENYALADSLQLRVQGHNSAWYQPLSTLCICDVKAAANADAPLK----N 138
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
YRRELDL+++ +V Y V + RE+F+S+P + I+ +++ ++ ++S +SL SLL++
Sbjct: 139 YRRELDLDSSLVKVSYESEGVSYRREYFASHPGRAIMVRLTANKPHAISLQLSLTSLLNH 198
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKL 242
+ V GN +M +A P + F +L+ K + GTI+A +D L
Sbjct: 199 QTRVEGNTIRLM------------GHAEGHPDSTVHFCNLLQAKATG--GTITA-QDSTL 243
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
+ + VL +V +S++G +P + + L++++N ++ L H DDYQ
Sbjct: 244 LISNATQVVLYIVNETSYNGFDKHPVTQGAPYVQLAETDLKNLQNCTFEQLKQNHTDDYQ 303
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
LF R+++ L + D+ T ++ D E +P L L FQFGRYLLI
Sbjct: 304 ALFGRLALHLDGTKLDM-HRTTEQQLQDYTKRGE--------TNPYLETLYFQFGRYLLI 354
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SSSR ANLQG+WN + W S VNINLE NYW + NL+E PL + L
Sbjct: 355 SSSRTPGVPANLQGLWNPHVRAPWRSNYTVNINLEENYWPAQVANLAELTTPLVGMVKAL 414
Query: 423 SINGSKTAQVNY-LASGWVIHHKTDIWAKSSA---DRGKVVWALWPMGGAWLCTHLWEHY 478
S+NG A+ Y + GW H TD+WA ++ R WA W +GGAWL ++LWE Y
Sbjct: 415 SVNGRYAARNYYGINEGWCSSHNTDLWAMTNPVGEKRESPEWANWNLGGAWLLSNLWEQY 474
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACV 536
++T DR +L YPL++G F+L WL+E G L T PSTSPE+E++ PDG
Sbjct: 475 DFTRDRHYLRHTLYPLMKGACDFMLQWLVENPKQPGELITAPSTSPENEYVTPDGYHGTT 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
Y T D+AI+RE+F+ +A E+L A + + +++ RL P I ++G + EW
Sbjct: 535 VYGGTADLAILRELFANTATADEILNGRPTAYSKILRQTIGRLHPYTIGKEGDLNEW 591
>gi|300777551|ref|ZP_07087409.1| alpha-L-fucosidase [Chryseobacterium gleum ATCC 35910]
gi|300503061|gb|EFK34201.1| alpha-L-fucosidase [Chryseobacterium gleum ATCC 35910]
Length = 836
Score = 328 bits (840), Expect = 6e-87, Method: Compositional matrix adjust.
Identities = 208/598 (34%), Positives = 314/598 (52%), Gaps = 63/598 (10%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PAK + +A+P+GNG + AMV+G E L+LNE T W+G P NPDAPK L
Sbjct: 26 KLWYDKPAKQWVEALPVGNGNMAAMVYGDPYQEKLQLNEGTFWSGGPSRNDNPDAPKVLD 85
Query: 74 DVRSLVDSGQYAEA--------TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYR 125
+R + G Y A TA +V +Q +GD L+ ++ LK Y
Sbjct: 86 SIRYYLFHGNYKRAQILADKGLTAKTVH-----GSAFQNIGDFTLDLNN--LKEIR-NYY 137
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
RELD+ A A ++ G + F RE F+S PD VIV K+S +L+F +S L +
Sbjct: 138 RELDIEKAIATTTFTSGGIYFKREVFASIPDHVIVIKLSSDHKNALNFTAKFNSELKKNV 197
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
N + M+G + + P ++F+A+ + +G + ++ + V
Sbjct: 198 KAIDANTLQMDGIS--------STLDGIPGQVKFNALAKFIT---KGGKTQTSEEGISVS 246
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
+ ++L+ +++F + + D +++ +++ N S+ L HL+ YQ F
Sbjct: 247 NAHEVMILISIATNF----TDYKNLNTDEVAKARKYIEAAANKSFKTLVQNHLNAYQNYF 302
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
RV + L S + +N P+ R+K+F T DP L+ L +QFGRYLLISSS
Sbjct: 303 KRVDLNLGTSE--------AAKN----PTDVRIKNFATGYDPELISLYYQFGRYLLISSS 350
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
+PG Q ANLQGIWN P WDS +NIN EMNYW + NLSE EPL + LS
Sbjct: 351 QPGGQPANLQGIWNNSNKPAWDSKYTININTEMNYWPAEKTNLSEMHEPLIQMIKDLSET 410
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTM 482
G +TA+ Y + GWV HH TDIW + G V +A +WPMGGAWL HLWE Y Y+
Sbjct: 411 GKETAKTMYNSRGWVAHHNTDIWRIT----GVVDFANAGMWPMGGAWLSQHLWEKYLYSG 466
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIE--GHDGYLETNPSTSPEHEFIAPDG-KLACVSYS 539
D +L + YP+L+ A F D+LIE H +L +PS SPE+ P G + + ++
Sbjct: 467 DEHYL-RTIYPVLKSAAQFYEDFLIEEPAHH-WLVASPSMSPEN---IPQGHQGSALAAG 521
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALV--EKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+TMD ++ ++F+ AA++L + D + ++ LP P KI G + EW++
Sbjct: 522 NTMDNQLMFDLFTKTKKAAQILNTDSDKIQVWNTIISKLP---PMKIGSYGQLQEWME 576
>gi|423303028|ref|ZP_17281049.1| hypothetical protein HMPREF1057_04190 [Bacteroides finegoldii
CL09T03C10]
gi|408470357|gb|EKJ88892.1| hypothetical protein HMPREF1057_04190 [Bacteroides finegoldii
CL09T03C10]
Length = 1100
Score = 327 bits (838), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 207/592 (34%), Positives = 301/592 (50%), Gaps = 49/592 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ +N PA+ + +A+PIGN RLGAMV+GG E L++NE+T W G P +P A L
Sbjct: 288 LKLWYNRPAQRWEEALPIGNSRLGAMVYGGAGHEELQINEETFWAGGPHHNNSPKAKAVL 347
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+ R L+ + EA + F P + L L H K Y RELD+
Sbjct: 348 DEARRLIFEDKTMEAQKLINPNFFSGPHGMSYLNMGSLLILQPGHEK--ATNYYRELDIE 405
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY----- 186
ATA Y V V +TR FSS DQVI+ ++ + G+L F++ D+ + +
Sbjct: 406 DATATTCYEVDGVTYTRTAFSSMTDQVIIVRLEANRKGALQFSLGYDTPKEADGFAPLHP 465
Query: 187 ---VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
V GN + +C G A+A ++++ D ++ + +L
Sbjct: 466 IVKVRGNRLTM---QCTGMEQEGVASA--------IKGEWQVQVVHDGKQVN--QPDRLG 512
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V+G+ A + L A+++F +N D + + + + L++ Y H YQ
Sbjct: 513 VQGATTATVYLSAATNF----VNYKDVSGNASRRAAAYLKTALKQPYPKALEAHSKAYQT 568
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
F+RV + L P I + P+ +RV F +D +L+ LL+Q+GRYLLI
Sbjct: 569 QFNRVKLDL---PATIAS---------LAPTNQRVADFNRVDDRNLMALLYQYGRYLLIC 616
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SS+PG Q ANLQGIW L WDS +NIN EMNYW + NLSEC EPLF L LS
Sbjct: 617 SSQPGGQPANLQGIWCRSLHAPWDSKYTININTEMNYWPAEVTNLSECHEPLFSMLEDLS 676
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+ G +TA+ Y A GWV HH TD+W + G W +WP GGAWLC HLW+HY YT D
Sbjct: 677 VTGHETARTLYGAKGWVAHHNTDLWRIAGPVDG-ATWGMWPNGGAWLCQHLWQHYLYTGD 735
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+ FL K YP+++G A F++ L++ G+L T PS SPEH + A C TM
Sbjct: 736 QAFLRKY-YPVMKGAADFMMSHLVKHPKYGWLVTVPSVSPEHGYTASTLTAGC-----TM 789
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
D I ++ + AA +L + A + + + +L P +I + I EW+
Sbjct: 790 DNQIAFDILNNTRLAATILGE-PTAYQDSLQATCTQLPPMQIGKYNQIQEWM 840
>gi|423223626|ref|ZP_17210095.1| hypothetical protein HMPREF1062_02281 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638251|gb|EIY32098.1| hypothetical protein HMPREF1062_02281 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 814
Score = 327 bits (838), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 211/601 (35%), Positives = 315/601 (52%), Gaps = 60/601 (9%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAPKALS 73
I ++ PA+ + +A+PIGNGRLGAM +GG+ E L+LN+ T+W+G P ++ DA K L
Sbjct: 34 IWYDKPAEIWEEALPIGNGRLGAMCFGGIHEEKLQLNDVTIWSGEPQPNSDRTDAYKKLP 93
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPA----DVY--------QLLGDIELEFDDSHLKYAE 121
++R + + Y A + + + D+Y Q LGD+ L+F +
Sbjct: 94 EIREALRNRDYKLAEVLTHQYMTCNSVSNEDIYNTIYSSSYQTLGDLSLKFKLPEGEMG- 152
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
+YRR LD+ A + V + +G F+RE FSS PD VIV K+ G LSF++ LD
Sbjct: 153 -SYRRWLDITRAISGVDFKIGEYSFSREIFSSAPDSVIVMKLGTDMKGGLSFSMLLDRKF 211
Query: 182 ------DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
D+H V N ME R N + + + +K+ D G +S
Sbjct: 212 SAVTTSDSHGLVMKGNTDYMEHR---------GNCDYEAR---------VKVVADGGRVS 253
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
K+ V+G+D A + + +S+ + D + +++ L + Y D+ +
Sbjct: 254 N-SKGKISVQGADSAYVYITCQTSYILDYKKNYRRAID-SKDAVRKLNIVSRKKYDDVKS 311
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLF 354
H+ DYQ +F+R+S+ L + ++ID +P+ +R+ F + +D V+L +
Sbjct: 312 IHVADYQGIFNRLSLNLG-----------NNKSID-IPTDQRLTRFNEKSDDLGFVDLFY 359
Query: 355 QFGRYLLISSSRPGTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
QFGRYL+ISSSR + N QGIW + W S NIN +MNYW NLSEC
Sbjct: 360 QFGRYLMISSSRENNPLPGNCQGIWGDGYKLPWHSDYKANINYQMNYWMVEASNLSECHI 419
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
P+ L G KTAQ + ASGW+ T+ W +S + +W + G W C
Sbjct: 420 PMLRLTASLVEPGRKTAQSYFNASGWMYAMMTNAWGWTSPGQ-YTIWGSFFGGSGWACQD 478
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
WEHY YT D+++L K YP+L+ F L LIE DGYL T+PSTSPE+ +IAPDG
Sbjct: 479 FWEHYAYTQDKEYLRK-VYPILKEACEFYLSVLIENKDGYLVTSPSTSPENRYIAPDGSR 537
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL-KSLPRLRPTKIAEDGSIME 592
V+ ST++++IIR +FS I A +L NED +++L KSL RLRP +I G +ME
Sbjct: 538 VAVTEGSTIELSIIRNLFSNTIYATGIL--NEDNSFKEILEKSLARLRPLQIGRAGQLME 595
Query: 593 W 593
W
Sbjct: 596 W 596
>gi|389793150|ref|ZP_10196324.1| hypothetical protein UU9_03133 [Rhodanobacter fulvus Jip2]
gi|388434883|gb|EIL91810.1| hypothetical protein UU9_03133 [Rhodanobacter fulvus Jip2]
Length = 802
Score = 327 bits (838), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 205/605 (33%), Positives = 311/605 (51%), Gaps = 42/605 (6%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
+ + S + N L++ ++ PA F +A+P+GNGR+G MV+GGV L+E ++++G
Sbjct: 28 LFSGASLAAQN-LQLHYDAPANTFNEALPLGNGRMGVMVYGGVQQARYSLSEISMFSGSR 86
Query: 61 GDYTN-PDAPKALSDVRSLVDSGQYAEATAASVKLF---GHPADV----YQLLGDIELEF 112
D + +A L +R L+ G+ EA + + F G A+ YQ LG + L+F
Sbjct: 87 YDGADRKEAVNYLPKIRQLLLQGRNVEAEQLTNQHFTWSGEGANAHYGTYQGLGTLTLDF 146
Query: 113 DDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLS 172
+ ++ YRR LD+ +AT+ V+Y+ V + RE F S PDQV+V +S +G+L+
Sbjct: 147 AANAAPVSD--YRRRLDIPSATSDVRYAQDGVRYRREMFVSAPDQVMVLHLSADRAGALN 204
Query: 173 FNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRG 232
F LD +G N ++M G ++ KG+ F+A + + G
Sbjct: 205 FVARLDRAERASVEGDGANGLLMRGEL---------DSGGSGKGLAFAARVRVIAP---G 252
Query: 233 TISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSD 292
+ ++VE +L+ ++ +DG DP + S + LQ + + S +
Sbjct: 253 ASMHADAHGIRVEHGTDVTVLISEATDYDG---FAGRHTTDPVAASATDLQRVASRSVAQ 309
Query: 293 LYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVEL 352
L+ H+ D+ F R S+QL + +T+ R+ ++ DP L
Sbjct: 310 LHAAHVADFSSWFDRFSLQLG----------SVDNTRETMSMRARLDTYGASGDPGFAAL 359
Query: 353 LFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
FQ+ RYLLISSSRPG ANLQG+W E S W+ H N+N+EMNYW + P L E
Sbjct: 360 YFQYARYLLISSSRPGGLPANLQGLWAEGTSTPWNGDYHTNVNIEMNYWPAEPTGLGELV 419
Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
+PLF L G+KTAQ Y A GWV+H T++W +A + W +W AWL
Sbjct: 420 QPLFALTASLQQPGAKTAQRYYGARGWVVHTLTNLWG-FTAPGAEASWGVWQGAPAWLSF 478
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDGYLETNPSTSPEHEFIAPD 530
H+W+HY YT DRDFL +R YP+L G A F D LIE H +L T PS+SPE+ +
Sbjct: 479 HIWDHYRYTGDRDFL-RRYYPVLRGAAQFYADVLIEEPSHH-WLVTAPSSSPENTVYMEN 536
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSI 590
G A + TMD +IR +F A+I A++ L + D E K RL P +I DG I
Sbjct: 537 GGKAAIVMGPTMDEELIRFLFGAVIEASQTLHVDADFRRELEAKR-ARLAPIQIGPDGRI 595
Query: 591 MEWVQ 595
E+++
Sbjct: 596 QEYLK 600
>gi|259485946|tpe|CBF83399.1| TPA: conserved hypothetical protein [Aspergillus nidulans FGSC A4]
Length = 757
Score = 327 bits (837), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 200/594 (33%), Positives = 309/594 (52%), Gaps = 46/594 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + +A+P+GNGRLGAMV+G +E L+LNED++W G P + DA K L
Sbjct: 3 ELWYQRPAAGWDEALPVGNGRLGAMVYGRTDTELLQLNEDSVWYGGPQNRLPEDALKCLP 62
Query: 74 DVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
+R L+ G + EA A F P Y+ LG + LEF H YRR LDL
Sbjct: 63 RLRELIREGAHKEAERLARRAFFASPNSQRHYEPLGTLFLEF--GHPCEEVTGYRRSLDL 120
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
N V Y V++ R+ +S PD V+ ++ S +S S L+ +
Sbjct: 121 NEGITHVHYEHNGVQYHRQVIASYPDNVLAMRVQASRCSEFLVRLSRLSELE-YETNEFL 179
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI-SDDRGTISA-LEDKKLKVEGSD 248
+ ++++G+ + P ++ + ++ I+ SDD+ I K L + D
Sbjct: 180 DDLVVDGQSIKMHVTPGGKDSN-----RACCMVAIRCGSDDQEPIKVDCVGKNLIINARD 234
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
A++++VA S++ D +++ L+++ S D++ RH+ DYQ L+ R+
Sbjct: 235 -ALIVIVAQSTY-------RCDDADLDRATVADLEAVLASSVEDIWARHITDYQSLYGRL 286
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
+ L DI TD +R+ + P LV + ++ RYLLIS SRPG
Sbjct: 287 ELNLGPDATDIPTD-------------QRILHVR---GPELVAIYLRYSRYLLISCSRPG 330
Query: 369 TQ-------VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
+ A LQGIWN P W +NINL+MNYW + NL EC+EPLF L
Sbjct: 331 RKGSSDRVLPATLQGIWNASFHPPWGCRYTININLQMNYWPANVGNLLECEEPLFALLER 390
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
L++ G++TA+ Y GW +HH TD+WA ++ + LWP+GGAWLCTH+WE + +
Sbjct: 391 LAVTGTETARKMYGCRGWTVHHNTDLWADTAPVDRWMPATLWPLGGAWLCTHVWERFLFN 450
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSS 540
++ FL KR +P+L GC FL D+L++ G Y TNPS SPE+ F G+ + S
Sbjct: 451 GNKAFL-KRMFPVLRGCVEFLQDFLVDDVSGQYKVTNPSLSPENTFRDEKGQEGVLCEGS 509
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
T+D+ ++R V A + + EVL ++D L+ V +L RL P +I G + EW+
Sbjct: 510 TIDIQLVRAVLKAFVESLEVLGYSQDELLPSVHDTLRRLPPARIGSKGQLQEWM 563
>gi|402307321|ref|ZP_10826347.1| putative lipoprotein [Prevotella sp. MSX73]
gi|400378835|gb|EJP31686.1| putative lipoprotein [Prevotella sp. MSX73]
Length = 796
Score = 327 bits (837), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 206/595 (34%), Positives = 313/595 (52%), Gaps = 38/595 (6%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPK 70
PLK+ +N PA F +A+PIGNGRLGA+V+GG ++++ +N+ TLWTG P + DA +
Sbjct: 26 PLKLWYNRPATAFEEALPIGNGRLGALVYGGADTDSIYINDLTLWTGKPVELNEGGDAHR 85
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQ---LLGDIELEFDDSHLKYAEE--TYR 125
+ +R + +G Y A + GH ++ YQ LL +L + + E+ +
Sbjct: 86 WIPVIRKELIAGNYKNADILQHLVQGHNSEYYQSLALLTLTDLNASPAEGRSGEDFGGLK 145
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
R LD+++A R Y G V + RE+F+S PD +I +I + SG+++ ++L S++ +
Sbjct: 146 RSLDIDSAVCRDTYHRGGVTYIREYFASAPDSLIAIRIRANRSGAINCRLALTSVVPHQV 205
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
G Q+ M G G D + I F AIL++K D G ++A D L V
Sbjct: 206 KATGR-QLTMTGHAIG----------DPLQSIHFCAILKVKTDD--GQVAA-SDSSLTVN 251
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G+ + V +SF+G +P + +++ + N++Y++ RH+ DY++LF
Sbjct: 252 GASEVTVYFVNRTSFNGFDKHPVTAGAPYLAKATDDIWHTENITYNECRQRHVTDYKRLF 311
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
R LS + D + T E+ + + ER +P L L Q+GRYLLIS S
Sbjct: 312 DRFRFTLSGAKPD-YSRTTEEQLMAYSDNGER--------NPYLEMLYMQYGRYLLISCS 362
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
R ANLQG+W W +NINLE NYW + +L E P+ + ++
Sbjct: 363 RTPGVPANLQGLWAPQKYSPWRGNYTININLEENYWPAEVTDLGELVMPVDGLVRAMAAT 422
Query: 426 GSKTAQVNY-LASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
G TA Y + GW H +DIWA ++ + W+ W MGGAWL LW+HY++T
Sbjct: 423 GRHTAAHYYGIDEGWCAGHNSDIWAMTNPVGTGKESPKWSNWNMGGAWLVQTLWDHYDFT 482
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
D +L AYPL++G A F+L WL+E G L T P TSPE E+I G C Y
Sbjct: 483 RDTHYLRNTAYPLMKGAADFMLRWLVESPVKRGELITAPCTSPEAEYINDKGYQGCTFYG 542
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLK-SLPRLRPTKIAEDGSIMEW 593
T D+AI+RE+F+ + AAE+L N DA + L+ SL L P KI + G++ EW
Sbjct: 543 GTSDLAIVRELFTNTLRAAEIL--NIDAGYRQQLRSSLDHLAPYKIGKRGNLQEW 595
>gi|67525297|ref|XP_660710.1| hypothetical protein AN3106.2 [Aspergillus nidulans FGSC A4]
gi|40744501|gb|EAA63677.1| hypothetical protein AN3106.2 [Aspergillus nidulans FGSC A4]
Length = 1679
Score = 327 bits (837), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 200/594 (33%), Positives = 309/594 (52%), Gaps = 46/594 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + +A+P+GNGRLGAMV+G +E L+LNED++W G P + DA K L
Sbjct: 3 ELWYQRPAAGWDEALPVGNGRLGAMVYGRTDTELLQLNEDSVWYGGPQNRLPEDALKCLP 62
Query: 74 DVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
+R L+ G + EA A F P Y+ LG + LEF H YRR LDL
Sbjct: 63 RLRELIREGAHKEAERLARRAFFASPNSQRHYEPLGTLFLEF--GHPCEEVTGYRRSLDL 120
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
N V Y V++ R+ +S PD V+ ++ S +S S L+ +
Sbjct: 121 NEGITHVHYEHNGVQYHRQVIASYPDNVLAMRVQASRCSEFLVRLSRLSELE-YETNEFL 179
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI-SDDRGTISA-LEDKKLKVEGSD 248
+ ++++G+ + P ++ + ++ I+ SDD+ I K L + D
Sbjct: 180 DDLVVDGQSIKMHVTPGGKDSN-----RACCMVAIRCGSDDQEPIKVDCVGKNLIINARD 234
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
A++++VA S++ D +++ L+++ S D++ RH+ DYQ L+ R+
Sbjct: 235 -ALIVIVAQSTY-------RCDDADLDRATVADLEAVLASSVEDIWARHITDYQSLYGRL 286
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
+ L DI TD +R+ + P LV + ++ RYLLIS SRPG
Sbjct: 287 ELNLGPDATDIPTD-------------QRILHVR---GPELVAIYLRYSRYLLISCSRPG 330
Query: 369 TQ-------VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
+ A LQGIWN P W +NINL+MNYW + NL EC+EPLF L
Sbjct: 331 RKGSSDRVLPATLQGIWNASFHPPWGCRYTININLQMNYWPANVGNLLECEEPLFALLER 390
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
L++ G++TA+ Y GW +HH TD+WA ++ + LWP+GGAWLCTH+WE + +
Sbjct: 391 LAVTGTETARKMYGCRGWTVHHNTDLWADTAPVDRWMPATLWPLGGAWLCTHVWERFLFN 450
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSS 540
++ FL KR +P+L GC FL D+L++ G Y TNPS SPE+ F G+ + S
Sbjct: 451 GNKAFL-KRMFPVLRGCVEFLQDFLVDDVSGQYKVTNPSLSPENTFRDEKGQEGVLCEGS 509
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
T+D+ ++R V A + + EVL ++D L+ V +L RL P +I G + EW+
Sbjct: 510 TIDIQLVRAVLKAFVESLEVLGYSQDELLPSVHDTLRRLPPARIGSKGQLQEWM 563
>gi|427387089|ref|ZP_18883145.1| hypothetical protein HMPREF9447_04178 [Bacteroides oleiciplenus YIT
12058]
gi|425725694|gb|EKU88563.1| hypothetical protein HMPREF9447_04178 [Bacteroides oleiciplenus YIT
12058]
Length = 826
Score = 326 bits (836), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 198/603 (32%), Positives = 312/603 (51%), Gaps = 48/603 (7%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
N ++ LK+ ++ PA + +A+P+GNGR+GAMV+G E +LNE+T+W G P +
Sbjct: 17 NVQAQQADETLKLWYDTPATQWVEALPLGNGRIGAMVFGDPVHEQFQLNEETVWGGSPHN 76
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATA------ASVKLFGHPADVYQLLGDIELEFDDSH 116
TNP A +AL +R L+ G+ AEA A S G P YQ +G + L+FD
Sbjct: 77 NTNPKAKEALPRIRQLIFEGKNAEAQALCGPAICSQSANGMP---YQTVGTLHLDFDGIS 133
Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
Y + Y R+LD+ A + +++ V +TRE ++S PDQV+V +++ S+ S+SF
Sbjct: 134 -NYTD--YYRDLDIEKAISTTRFTANGVTYTREAYTSFPDQVLVIRLTASQKKSISFTAK 190
Query: 177 LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPK---GIQFSAILEIKISDDRGT 233
Y + I+ P K + AND ++F+ + +I + G
Sbjct: 191 ---------YTTPYKENIVRCISPRKELQLNGKANDHEGIEGKVEFTTL--TRIENSGGN 239
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
+ L D L+V+ ++ +V L V S F+N D + + + L ++ N +Y+
Sbjct: 240 LEVLSDSTLQVKNAN-SVTLYV---SIGTNFVNYKDVSGNAQTTAQKYLANV-NKNYTKS 294
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H YQK F+RVS+ L R+ + P+ RVK F + DP + L
Sbjct: 295 KATHTSTYQKFFNRVSLDLGRNAQA------------DKPTDVRVKEFSSSFDPQMAALY 342
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+P Q ANLQGIWN L WD +IN+EMNYW + +L E E
Sbjct: 343 FQFGRYLLICSSQPDGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAESTSLPEMHE 402
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
P + ++I G K+A + Y GW +HH TDIW + A G + +WP AW C H
Sbjct: 403 PFLQLVKEVAIQGRKSAAM-YGCRGWTLHHNTDIWRSTGAVDGP-GYGIWPTCNAWFCQH 460
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
LW+ Y ++ D+++L + YPL+ G F LD+L+ E + +L PS SPE+ + +
Sbjct: 461 LWDRYLFSGDKNYLAE-VYPLMRGACEFYLDFLVREPENNWLVVAPSYSPENRPVVNGKR 519
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
V +TMD ++ ++F I+AA+++ +N + + + L P ++ G + E
Sbjct: 520 DFVVVAGATMDNQMVYDLFYNTIAAAQLMNENT-TFTDSLQTVVNHLAPMQVGRWGQLQE 578
Query: 593 WVQ 595
W+
Sbjct: 579 WMH 581
>gi|406660853|ref|ZP_11068981.1| hypothetical protein B879_00989 [Cecembia lonarensis LW9]
gi|405555406|gb|EKB50440.1| hypothetical protein B879_00989 [Cecembia lonarensis LW9]
Length = 778
Score = 326 bits (836), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 201/589 (34%), Positives = 302/589 (51%), Gaps = 38/589 (6%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA-PKALSDV 75
+ PA + +A+P+GNGRLGAMV+G +E ++LNED+LW G P D+ + P+ L +
Sbjct: 28 YEKPASIWEEALPLGNGRLGAMVFGQTSTERIQLNEDSLWPGGPDDWGPAEGKPEDLEFI 87
Query: 76 RSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
R L+ G+ +A + V F + +Q LGD+ L+ + YRRELDL+ A
Sbjct: 88 RQLLLHGENKKADSLLVAKFSRKSITRSHQTLGDLWLDLGHEEVS----NYRRELDLDRA 143
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS-----YVN 188
+ Y+V F ++ FSS PDQ IV ++ ++ + L D+
Sbjct: 144 LVTISYTVEGYVFLQKVFSSAPDQAIVIRLESKHPKGINGKIRLSRPEDDGYPTVTVQAT 203
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
N + MEG +R + + G++F I + I ++ G D +++EG +
Sbjct: 204 SNQTLQMEGEITQRRGQIDSKPSPILHGVKFQTI--VFIENESGKTFQKGDH-IELEGVE 260
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
+ LV ++S+ +D ++ LQ+I+ ++ +L RH+ DYQ LF RV
Sbjct: 261 ALNIKLVTNTSY---------YHQDFQRKNQEQLQNIKAKTFEELEQRHITDYQSLFQRV 311
Query: 309 SIQLSR-SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
L +P DI TD ERVK + + D L LLF FGRYLLISSSRP
Sbjct: 312 KFSLEEPNPLDIPTDQ----------RIERVK--EGNSDLYLESLLFDFGRYLLISSSRP 359
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
GT ANLQG+WN + W++ H+NINL+MNYW + NLSE EP FD++ L ++G
Sbjct: 360 GTLPANLQGLWNRHIEAPWNADYHLNINLQMNYWPAEVTNLSELHEPFFDYMDQLILSGK 419
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
KTA+ Y G + H +D+W + + W W G W+ H WE Y +T D++FL
Sbjct: 420 KTARETYGMRGSALAHGSDLWHMTFLQAAQAYWGAWLGAGGWMMQHFWERYLFTQDKNFL 479
Query: 488 EKRAYPLLEGCASFLLDWLIE-GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
+R P +E A+F LDWL+ DG ++PSTSPE+ FI G+ + + MD I
Sbjct: 480 RQRFLPAMEEIAAFYLDWLVPYPEDGTWVSSPSTSPENSFINAKGESVASTMGAAMDQQI 539
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I EVF + A+++L L E K + DG ++EW Q
Sbjct: 540 IAEVFDHFMQASKILGYQSPVLDEVKSKRQNLRSGLRTGNDGRLLEWDQ 588
>gi|404448807|ref|ZP_11013799.1| hypothetical protein A33Q_05728 [Indibacter alkaliphilus LW1]
gi|403765531|gb|EJZ26409.1| hypothetical protein A33Q_05728 [Indibacter alkaliphilus LW1]
Length = 778
Score = 326 bits (836), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 202/599 (33%), Positives = 313/599 (52%), Gaps = 58/599 (9%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA-PKALSDV 75
+ PA + +A+P+GNGRLGAMV+G E ++LNED+LW G P D+ P L+ +
Sbjct: 29 YEQPADKWEEALPLGNGRLGAMVFGRTDVERIQLNEDSLWPGGPNDWGLAQGKPDDLACI 88
Query: 76 RSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
R L+ G+ +A + V LF + +Q +GD+ LE + Y+R LDL+ A
Sbjct: 89 RELLVKGENKKADSLMVALFSRKSITRSHQTMGDLWLELGHQDIS----NYQRSLDLDKA 144
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN-----HSYVN 188
A V Y EF ++ +S DQ I+ +I+ + L+ + LD D+
Sbjct: 145 LATVTYQYEGYEFEQKAIASAKDQGIIIQITTTHPKGLNGKIRLDRPEDDGYPTVKISTP 204
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
NN + M+G ++ + G++F TI+ LE++ K+EG
Sbjct: 205 ANNSLQMDGEVTQRKGQIDSKPAPILHGVRFQ------------TIALLENEGGKLEGKG 252
Query: 249 WAVLL---------LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
A+ + LVA++SF D ++ + L +++ L++++L RH
Sbjct: 253 DAIWIENVKTLSIKLVANTSF---------YHTDFRGKNQADLMALKELNFAELQKRHQK 303
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGR 358
D+Q LF RV+ QL E++IDT+P+ R+++ + D L +LLF +GR
Sbjct: 304 DHQGLFRRVNFQLG------------EKSIDTIPTDRRIENIKAGATDLHLEKLLFDYGR 351
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLI SSRPGT ANLQGIWN+ ++ W++ H+NIN++MNYW + NLSE +P F+F
Sbjct: 352 YLLIGSSRPGTLPANLQGIWNQHIAAPWNADYHMNINMQMNYWPAEVTNLSELHDPFFEF 411
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
L +G KTA+ Y G H TD+W + + W W G W+ H WE Y
Sbjct: 412 TDALIPSGQKTAKETYGMRGAAFAHGTDLWKMTFLQAAQAYWGSWLGAGGWMMQHYWERY 471
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
+T D +FL++R P+ E +F DW++ DG L ++PSTSPE+ FI +G A +
Sbjct: 472 LFTQDVEFLKERFIPVAEEIVAFYADWIVPHPLDGKLASSPSTSPENSFINSNGDHAAST 531
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEWVQ 595
+ MD II EVF I+A E+L D L++++ + RLR ++ DG +MEW Q
Sbjct: 532 IGAAMDQQIIAEVFDNYINAVELLGIQSD-LLQEIKEKRSRLRSGLQVGSDGRLMEWDQ 589
>gi|290962265|ref|YP_003493447.1| hypothetical protein SCAB_79571 [Streptomyces scabiei 87.22]
gi|260651791|emb|CBG74917.1| putative secreted protein [Streptomyces scabiei 87.22]
Length = 945
Score = 326 bits (836), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 212/588 (36%), Positives = 310/588 (52%), Gaps = 44/588 (7%)
Query: 13 LKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
L + ++ PA + A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D N
Sbjct: 42 LALWYDKPAGADWLRALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPYDSANTRGAAN 101
Query: 72 LSDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRREL 128
++++R V + Q+ A + + G PA YQ +G++ L F + Y+R L
Sbjct: 102 IAEIRRRVFADQWGPAQDLINQTMLGSPAGQLAYQPVGNLLLSFGSA---TGASQYKRTL 158
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
DL TATA Y++ V + RE F DQVIV +++ + +++ + + DS
Sbjct: 159 DLTTATALTTYALNGVRYQREVFVGARDQVIVVRLTADRANAITCSATFDSPQRTTLSSP 218
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
I ++G ++F A+ + GT+S+ L+V G+
Sbjct: 219 DGATIALDG--------TSGTMEGITGRVRFLALAHAAATG--GTVSS-SGGTLRVSGAT 267
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
+L+ SS+ ++ ++ D + L + R++ L +RH D+Q LF RV
Sbjct: 268 SVTVLVSIGSSY----VDFRNTDGDHRGIARRHLDAARDIDIDALRSRHRTDHQALFDRV 323
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
SI L R+ T +++ P+ R+ DP LLFQFGRYLLISSSRPG
Sbjct: 324 SIDLGRT-------TAADQ-----PTDVRIAQHAQVSDPQFAALLFQFGRYLLISSSRPG 371
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
TQ ANLQGIWN+ ++P+WDS +N NL MNYW + NLSEC P+FD + L++ G++
Sbjct: 372 TQPANLQGIWNDQMAPSWDSKFTINANLPMNYWPADTTNLSECLLPVFDMIDDLTVTGAR 431
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
A+ Y A GWV HH TD W +S G W +W GGAWL T +W+HY +T D DFL
Sbjct: 432 VARAQYGAGGWVTHHNTDAWRGASVVDG-AQWGMWQTGGAWLATLIWDHYLFTGDTDFLR 490
Query: 489 KRAYPLLEGCASFLLDWLIEGHD--GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
YP L+G A F LD L+ H G+L TNPS SPE P A V TMD I
Sbjct: 491 SN-YPALKGAAQFFLDTLVA-HPTLGHLVTNPSNSPE----LPHHTNATVCAGPTMDNQI 544
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+R++F+++ A E L + + L + RL PT++ G++ EW+
Sbjct: 545 LRDLFTSVARAGETLGVDA-GFRAQALAARDRLAPTRVGSRGNVQEWL 591
>gi|402814854|ref|ZP_10864447.1| hypothetical protein PAV_3c01950 [Paenibacillus alvei DSM 29]
gi|402507225|gb|EJW17747.1| hypothetical protein PAV_3c01950 [Paenibacillus alvei DSM 29]
Length = 810
Score = 326 bits (835), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 203/611 (33%), Positives = 320/611 (52%), Gaps = 63/611 (10%)
Query: 21 AKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVD 80
A +T+A PIGNGRLG +V+GG+ E ++LNED++W G D N A AL D+++L+
Sbjct: 15 ASKWTEAFPIGNGRLGGVVYGGIQREQIQLNEDSIWYGGARDNDNRAAQAALPDIKNLLL 74
Query: 81 SGQYAEATAASVKLFGHPADV------YQLLGDIELEFDDSHLKYAEETYRRELDLNTAT 134
G +A +K H +V YQ LG++ L+F+ + +A Y R+LDL+ A
Sbjct: 75 QGNVRKAEKLVLK---HMTNVPQYFNPYQTLGNLFLDFEPNIEVHAINQYCRKLDLDHAL 131
Query: 135 ARVKYSVGN-------------------VEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
+V Y VG ++++RE FSS DQV+V +++ ++ L+F
Sbjct: 132 VQVNYEVGRQDKEGRTATQATGEAQKEAIQYSREIFSSAADQVLVIRMTTTDEAGLTFAA 191
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
D V ++ G+ I + D G++++ +L+ + G
Sbjct: 192 KFDRRPFTGEMVQTDD---------GQGIAMQGQLGAD--GVRYAVVLQAVVE---GGQC 237
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
L + + L++ A +SF +D+ +++ A + + Y L
Sbjct: 238 QTAGNYLDIRQARAVTLIVAAQTSF-----RCADAYAVACQQAIQAAK----VPYEKLKQ 288
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLF 354
RHLDDY+ LF+RV++ L + + +++R++ + Q D L L +
Sbjct: 289 RHLDDYKPLFNRVTLDLEAEEGERTEPQQQVPGQQCLSTSQRLERYRQGATDNGLEALFY 348
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
Q+GRYLL++SSRPGT ANLQGIWN+ +P W+S H+NINL+MNYW + NL+EC P
Sbjct: 349 QYGRYLLLASSRPGTLPANLQGIWNDSFTPPWESDYHLNINLQMNYWLAETGNLAECHMP 408
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
LFDF+ L ING +TA+ Y A G+V H +++WA + V +WPMGGAW+ H+
Sbjct: 409 LFDFIERLVINGRQTARNIYGARGFVAHTSSNLWADTGIYGEYVSANMWPMGGAWIALHM 468
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
WEHY Y FL +RAYP+L+ A F LD+L+E G L T PS SPE+ + + G++
Sbjct: 469 WEHYCYNGSLSFLRERAYPVLKEAALFFLDFLLELPSGQLVTVPSLSPENSYRSEQGEVG 528
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDA-----------LVEKVLKSLPRLRPTK 583
+ Y +MD I+ +F+A I A E+L+ +E+ L+ + + +L +
Sbjct: 529 ALCYGPSMDSQILYALFTACIRAGELLQLDEEGHLKQGFHEDKDLLAQWQQVRSKLPQPQ 588
Query: 584 IAEDGSIMEWV 594
I G IMEW
Sbjct: 589 IGRHGQIMEWA 599
>gi|386820649|ref|ZP_10107865.1| hypothetical protein JoomaDRAFT_2613 [Joostella marina DSM 19592]
gi|386425755|gb|EIJ39585.1| hypothetical protein JoomaDRAFT_2613 [Joostella marina DSM 19592]
Length = 780
Score = 325 bits (834), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 201/595 (33%), Positives = 308/595 (51%), Gaps = 48/595 (8%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
+ ++ PA + +A+P+GNGR+GAM++GG+ +E +LNED++W G P + L+
Sbjct: 25 VWYSQPADTWMEALPVGNGRMGAMIYGGIETEHFQLNEDSMWPGSPNLSNAKGTAEDLAL 84
Query: 75 VRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
+R L+D G+ EA + + F V +Q GD+ L F + + Y+R LD
Sbjct: 85 IRKLIDEGKVHEADSLIIDKFSRQDIVRSHQTAGDLFLHFKN---RGEVTNYKRSLDFEK 141
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH------SY 186
AT+ V YSV F FSS PD V+V K+ S + F++ + D +
Sbjct: 142 ATSYVSYSVDGNTFKETAFSSQPDNVLVIKLETSNRNGMDFDIEMSRPKDEGVETVKVAT 201
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
++M G ++ G++F L++K G I++ +L V
Sbjct: 202 FPEKQLMLMNGEVTQMGGVVESVPTPIKNGVKFQTRLKVK--SKSGIITS-NGNRLTVRN 258
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ +LL+ +S+ P D ++ +++ + Y L H+ D++ L++
Sbjct: 259 AKEVLLLIATETSYYHP---------DYIEKAELVIENAESKGYKALVNNHIQDFKNLYN 309
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSS 365
RVS+ I TD ++E P+ +R++ ++ D L E LF +GRYLLISSS
Sbjct: 310 RVSLH-------IETDNSNKE----FPTDKRLERYKAGVVDVGLQETLFNYGRYLLISSS 358
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
R GT ANLQGIWN ++ W++ H+NINL+MNYW + NL+EC+ PLFDF L I
Sbjct: 359 RKGTNPANLQGIWNNHITAPWNADYHLNINLQMNYWLAPITNLAECELPLFDFGNRLIIR 418
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G +TA+ + G + HH TD+W + W W G WL H W +Y +T D
Sbjct: 419 GKETAKQYGINRGSMSHHATDLWGPAFMRARTPYWGAWIHGAGWLAQHYWGYYLFTEDEV 478
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETN------PSTSPEHEFIAPDGKLACVSYS 539
FL+++ YP L+ A+F LDWL Y E+ P TSPE+ +IA DGK A VS
Sbjct: 479 FLKEQGYPYLKEVATFYLDWL-----QYDESTKEWFSYPETSPENSYIANDGKPAAVSRG 533
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEW 593
+ M II EVF IISA+E+L +D L+++V K LRP +I DG ++EW
Sbjct: 534 TAMGQQIIGEVFRNIISASEILAI-DDELIKEVKKKAENLRPGVQIGADGRVLEW 587
>gi|376260262|ref|YP_005146982.1| trehalose/maltose hydrolase or phosphorylase [Clostridium sp.
BNL1100]
gi|373944256|gb|AEY65177.1| trehalose/maltose hydrolase or phosphorylase [Clostridium sp.
BNL1100]
Length = 1159
Score = 325 bits (834), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 204/586 (34%), Positives = 302/586 (51%), Gaps = 57/586 (9%)
Query: 24 FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
F A+P+GNGR+GAMV+G P E + LNE T W+ PG+ A +L + + +GQ
Sbjct: 76 FYKALPLGNGRIGAMVYGNYPDERIDLNEATFWSSGPGNNNRAGAANSLKTAQDQLFAGQ 135
Query: 84 Y-AEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVG 142
Y +T + + G YQ +GD++L F S + Y R+LD+NT Y+
Sbjct: 136 YKTGSTTIANSMIGGGEAKYQSIGDLKLLFGHSSV----SNYSRQLDMNTGVVSSDYTYN 191
Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN--GNNQIIMEGRCP 200
++ RE F S PDQ++VTKI+ S GS+S +S L V+ GN+ ++M G
Sbjct: 192 GKQYHRESFVSYPDQIMVTKITCSSPGSISLTAGYESSLTGQYTVSTSGNDTLVMNGH-- 249
Query: 201 GKRIPPKANANDDPKGIQFSAILEI--KISDDRGTISALEDKKLKVEGSDWAVLLLVASS 258
D GI ++ KI + G++SA + ++ V +D V+L +
Sbjct: 250 ----------GDSDNGISYAVWFSTRSKIINSNGSVSA-NNNQISVSNADSVVIL----T 294
Query: 259 SFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKD 318
S F+N D ++ + + + SY LY H+ DYQ LF RV + L S
Sbjct: 295 SIRTNFVNYKTCNGDEKGKATTDITNASAKSYDTLYNNHVADYQNLFKRVDVDLGGSG-- 352
Query: 319 IVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIW 378
SE N P +R+ F T DP L ++LFQ+GRYL+IS+SR +Q NLQGIW
Sbjct: 353 ------SENN---KPMGQRISEFGTTNDPKLAKVLFQYGRYLMISASRD-SQSMNLQGIW 402
Query: 379 NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY-LAS 437
N+ +P W NIN EMNYW + NL+EC EP L G++TA+ +Y +++
Sbjct: 403 NKFRNPAWGCKMTTNINYEMNYWPAFTTNLAECFEPFVKKAKELQAPGNETARAHYNISN 462
Query: 438 GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEG 497
GWV+HH TD+W +++ G+ W LWP G W+ L++ YN+ D +L + YP+++G
Sbjct: 463 GWVLHHNTDLWNRTAPIDGE--WGLWPTGAGWVSNMLYDAYNFNQDTAYLNE-IYPVIKG 519
Query: 498 CASFLLDWL----IEGHDGYLETNPSTSPEHEFIAP----DGKLACVSYSSTMDMAIIRE 549
A FL + I G + Y PSTSPE + P G+ A SY TMD I RE
Sbjct: 520 AADFLQTLMQSKSINGQN-YQVICPSTSPE---LTPPGTSGGQGAYNSYGVTMDNGISRE 575
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWV 594
+F +I AA +L N D L+S + +++P I G + EW
Sbjct: 576 LFKDVIQAAGIL--NVDPAFRSTLQSKVSQIKPNTIGSWGQLQEWA 619
>gi|189468049|ref|ZP_03016834.1| hypothetical protein BACINT_04443 [Bacteroides intestinalis DSM
17393]
gi|189436313|gb|EDV05298.1| hypothetical protein BACINT_04443 [Bacteroides intestinalis DSM
17393]
Length = 830
Score = 325 bits (834), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 201/603 (33%), Positives = 315/603 (52%), Gaps = 48/603 (7%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
N ++ LK+ ++ PA + +A+P+GNGR+G MV+G E +LNE+T+W G P +
Sbjct: 18 NLQAQQEDQTLKLWYDKPATQWVEALPLGNGRIGTMVFGDPVHEQFQLNEETVWGGSPHN 77
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEA------TAASVKLFGHPADVYQLLGDIELEFDDSH 116
TNP A AL +R L+ G+ EA T S G P YQ +G + L+FD +
Sbjct: 78 NTNPKAKDALPRIRQLIFEGKNKEAQELCGPTICSQSANGMP---YQTVGSLHLDFDGIN 134
Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
+Y + Y R+LD+ A A +++ V +TRE ++S PDQV+V +++ S+ S+SF
Sbjct: 135 -EYND--YYRDLDIEKAIATTRFTANGVTYTREAYTSFPDQVLVIRLTASQKKSISFTAK 191
Query: 177 LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPK---GIQFSAILEIKISDDRGT 233
Y ++ P K + AND ++F+A+ +I ++ G
Sbjct: 192 ---------YSTPYKSSVIRCISPRKELQLNGKANDHEGIEGKVEFTAL--TRIENNGGK 240
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
+ L D L+V+ ++ +V+L V S F+N D D + + L+ + N +Y
Sbjct: 241 LEILSDSTLQVKDAN-SVILYV---SIGTNFVNYKDVSGDALNSAQQYLKLV-NKNYPKS 295
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H++ YQK F+RVS+ L S I+ P+ RVK F + DP + L
Sbjct: 296 KASHINAYQKYFNRVSLNLG-----------SNAQINK-PTDVRVKEFSSSFDPQMAVLY 343
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN L WD +IN+EMNYW + +L E E
Sbjct: 344 FQFGRYLLICSSQPGGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAESTSLPEMHE 403
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
P + ++I G ++A + Y GW +HH TDIW + A G + +WP AW C H
Sbjct: 404 PFLQLVKEVAIQGRESAAM-YGCRGWTLHHNTDIWRSTGAVDGS-SYGVWPTCNAWFCQH 461
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
LW+ Y ++ D+++L + AYPL+ G F LD+L+ E + +L PS SPE+ +
Sbjct: 462 LWDRYLFSGDKNYLSE-AYPLMRGACEFYLDFLVREPENNWLVVAPSYSPENSPAVNGQR 520
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
V +TMD ++ ++F ISAA+++ + A + + + L P ++ G + E
Sbjct: 521 TFVVVAGTTMDNQMVYDLFYNTISAAKLMNETT-AFTDSLQTVVNNLAPMQVGRWGQLQE 579
Query: 593 WVQ 595
W+
Sbjct: 580 WMH 582
>gi|261878761|ref|ZP_06005188.1| fibronectin type III domain protein [Prevotella bergensis DSM
17361]
gi|270334768|gb|EFA45554.1| fibronectin type III domain protein [Prevotella bergensis DSM
17361]
Length = 814
Score = 325 bits (833), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 209/585 (35%), Positives = 306/585 (52%), Gaps = 45/585 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ ++ PA + +A+PIGNG LGAMV+GG ETL LNE T W+G P D + ++ L
Sbjct: 23 RLWYHQPASKWVEALPIGNGFLGAMVYGGTRQETLALNETTFWSGGPHDNNSTESLSYLP 82
Query: 74 DVRSLVDSGQYAEATA-ASVKLFGHPADVYQL-LGDIELEFDDSHLKYAEETYRRELDLN 131
++R + G+ EA + P + L LGD+ + F++ H + + Y R L+L
Sbjct: 83 EIRQKIFEGKENEAQKLIDQHVVKGPHGMRFLPLGDVRIRFEE-HGEVGQ--YSRSLNLE 139
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
A V Y++G V+ R F+S PD+VI +I S SF +S+ SL + + +GN
Sbjct: 140 KALHEVSYTIGGVKIQRVSFASLPDRVIGMRIKSSRR--TSFTISVHSLFQSEAQTHGN- 196
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGI--QFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
+EG G D +G+ + A I + + G + D L+VE +
Sbjct: 197 --ALEGTVYG----------DSQEGVAGRLRAHYRIVVKGN-GKVVPTGDS-LRVERASN 242
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
+ + A+++F +N D D + + + S+ L RH+ Y+ + RVS
Sbjct: 243 TEIYMAAATNF----VNFKDVSGDEKAVVNRLMAGVSGQSFDRLLKRHVRAYRCQYDRVS 298
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L + S +P+ ER++ F +D +V L+F +GRYLLISSS+PG
Sbjct: 299 LTL---------NGASPSPHAQLPTDERLRQFAGSQDMGMVALIFNYGRYLLISSSQPGG 349
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN + + WDS +NIN EMNYW + CNL E +PLF + LS+ G KT
Sbjct: 350 QPANLQGIWNGERNAPWDSKYTININTEMNYWPAETCNLREAVKPLFSLIGDLSLTGEKT 409
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+ Y GWV HH TD+W + G W ++P GG WL THLW+HY YT DR FL +
Sbjct: 410 ARQMYGCRGWVAHHNTDLWRIAGPVDG-AYWGMFPNGGGWLSTHLWQHYLYTGDRVFL-R 467
Query: 490 RAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
Y +L+G A F LD++ + GYL PS SPEH P GK + V TMD I
Sbjct: 468 LWYSVLKGAADFYLDYMQTDPRTGYLVVVPSVSPEH---GPHGK-SPVGAGCTMDNQIAF 523
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+V S + A E+L N A + + K++ L P KI G + EW
Sbjct: 524 DVLSNCLQATEILNGNR-AYADSLRKAIAALPPMKIGRHGQLQEW 567
>gi|374333663|ref|YP_005086791.1| hypothetical protein PSE_p0242 [Pseudovibrio sp. FO-BEG1]
gi|359346451|gb|AEV39824.1| hypothetical protein PSE_p0242 [Pseudovibrio sp. FO-BEG1]
Length = 798
Score = 325 bits (833), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 207/602 (34%), Positives = 300/602 (49%), Gaps = 64/602 (10%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGD-YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF- 95
MV+G S + LNEDTL++G P Y P+ + V +L+ G+ EA K +
Sbjct: 1 MVYGNPLSARIHLNEDTLYSGEPTRIYPVPEIAHQIDHVEALLRDGKLFEAQEFVRKNWT 60
Query: 96 GHPADVYQLLGDIELEF-DDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSN 154
G YQ +G++ + DDS + YRR LD+ + Y F R F+S
Sbjct: 61 GRQGQAYQPVGNLFITMADDSPVS----NYRRALDIRHSLHHESYEQNRTTFERTSFASF 116
Query: 155 PDQVIVTKISGSESGSLSFNVSLDSLLD--NHSYVNGNNQIIMEGRCP------------ 200
PD VIV +++ + G+LSF++ DS ++ N ++ + G+ P
Sbjct: 117 PDNVIVVRLTADKPGTLSFSLRYDSPHPTCRTTHEAENTRLHLRGQAPAFTSSRVIERIE 176
Query: 201 ---------------GKRIPPKANANDDPKG------------IQFSAILEIKISDDRGT 233
GK P N D +G F A L +++ R
Sbjct: 177 HDQEQSRNPEIFGADGKLRPFAENEQDGHRGGIVYSEDGLGEGTYFEAGLSVELEGGR-- 234
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
E +L +EG+ L + ++SF+GP +PS KDP SAL + ++SY D
Sbjct: 235 -IRPERGELHIEGATAVTLRIAIATSFNGPDKSPSREGKDPAPIVKSALDTAGSVSYEDT 293
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
+H DD +LF RVS++L + I +P++ R++ FQ DP+L L
Sbjct: 294 LQKHSDDVLRLFDRVSLKLGNNA------------IPDLPTSTRLEQFQEKGDPALAALQ 341
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQ+GRYLLI+SSR G+Q NLQGIW+ P W S +NINLEMNYW + LS+ E
Sbjct: 342 FQYGRYLLIASSRGGSQPPNLQGIWSNLRRPQWSSNYTMNINLEMNYWPAEITGLSDLHE 401
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLF + L+++G++TA+ + A GW H T IW S A WPM WL +H
Sbjct: 402 PLFMLIEELAVSGARTAKKMFNAPGWCAFHNTTIWRDSVPSPCDPASAFWPMAAGWLLSH 461
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
+WEH+ YT D++FL+ RAYPL++ A F WL E DGYL STSPE+ ++ DG +
Sbjct: 462 MWEHFLYTGDKEFLKNRAYPLMKSAAEFYEWWLCENKDGYLVPKVSTSPENRYLDEDGHV 521
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
V STMD AIIRE F+ +AA++L + + L + RL P +I G + EW
Sbjct: 522 ITVDQGSTMDCAIIRETFTNTAAAAKLLGLDAE-LANTLEAKAARLLPYQIGAQGQVQEW 580
Query: 594 VQ 595
Q
Sbjct: 581 SQ 582
>gi|410029118|ref|ZP_11278954.1| hypothetical protein MaAK2_07959 [Marinilabilia sp. AK2]
Length = 754
Score = 325 bits (832), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 203/592 (34%), Positives = 309/592 (52%), Gaps = 44/592 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA-PKALSDV 75
+ PA + +A+P+GNGRLGAMV+G +E ++LNED+LW G P D+ + P+ L +
Sbjct: 4 YEKPASIWEEALPLGNGRLGAMVFGQTSTERIQLNEDSLWPGGPDDWGPAEGKPEDLEFI 63
Query: 76 RSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
R L+ G+ +A + V F + +Q LGD+ L+ + YRRELDL+ A
Sbjct: 64 RQLLLHGENKKADSLLVAKFSRKSITRSHQTLGDLWLDLGHEEVS----NYRRELDLDRA 119
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS-----YVN 188
+ Y+V F ++ FSS PDQ IV ++ ++ + L D+
Sbjct: 120 LVTISYTVEGYVFLQKVFSSAPDQAIVIRLESKHPKGINGKIKLSRPEDDGYPTVTVQAT 179
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
N + MEG +R + + G++F I + I ++ G D +++EG +
Sbjct: 180 SNQTLHMEGEITQRRGQIDSKPSPILHGVKFQTI--VFIENESGKTFQKGDH-IELEGVE 236
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
+ LV ++S+ +D ++ LQ+I+ ++ +L RH+ DYQ LFHRV
Sbjct: 237 ALNIKLVTNTSY---------YHQDFQRKNQEQLQNIKAKTFEELEQRHITDYQSLFHRV 287
Query: 309 SIQLSR-SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
L +P D TD ERVK +TD L LLF FGRYLLISSSRP
Sbjct: 288 KFSLDDPNPLDSPTDQ----------RIERVKGGKTD--LYLESLLFDFGRYLLISSSRP 335
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
GT ANLQG+WN + W++ H+NINL+MNYW + NLSE EP FD++ L ++G
Sbjct: 336 GTLPANLQGLWNRHIEAPWNADYHLNINLQMNYWPAEVTNLSELHEPFFDYMDQLILSGK 395
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
KTA+ Y G + H +D+W + + W W G W+ H WE Y +T D++FL
Sbjct: 396 KTARETYGMRGAALAHGSDLWNMTFLQAAEAYWGAWLGAGGWMMQHFWERYLFTQDKNFL 455
Query: 488 EKRAYPLLEGCASFLLDWLI---EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
+R P +E A+F LDWL+ EG G ++PSTSPE+ FI G+ + + MD
Sbjct: 456 RQRFLPAMEEIAAFYLDWLVPYPEG--GKWVSSPSTSPENSFINAKGESVASTMGAAMDQ 513
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEWVQ 595
+I EVF + A+++L + ++++V LR +I DG ++EW Q
Sbjct: 514 QVIAEVFDNFMQASKIL-GYQSPILDEVKSKRQNLRSGLRIGSDGRLLEWDQ 564
>gi|319642679|ref|ZP_07997325.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_40A]
gi|345520274|ref|ZP_08799672.1| glycoside hydrolase family 95 [Bacteroides sp. 4_3_47FAA]
gi|254836101|gb|EET16410.1| glycoside hydrolase family 95 [Bacteroides sp. 4_3_47FAA]
gi|317385767|gb|EFV66700.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_40A]
Length = 814
Score = 325 bits (832), Expect = 6e-86, Method: Compositional matrix adjust.
Identities = 202/589 (34%), Positives = 321/589 (54%), Gaps = 46/589 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ +T+A+P+GNGRLGAMV+G E ++LNE+T+W G P + NPDA + +
Sbjct: 25 KLWYDEPAQVWTEALPLGNGRLGAMVFGNPGVEHIQLNEETIWAGRPNNNANPDALEYIP 84
Query: 74 DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
VR LV G+Y EA T A+ K+ G P YQ GD+ + F H +Y++ Y RE
Sbjct: 85 KVRRLVFEGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLHISFP-GHTRYSD--YYRE 138
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
L L++A V+Y V V + RE +S DQV++ +++ S+ G ++ N +L + +
Sbjct: 139 LSLDSARTIVRYKVDGVTYQRETLTSFADQVVMVRLTASQPGKITCNANLTTPHQDVMVA 198
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
++ + G ++ ++ KG ++F + + +G + D L +EG
Sbjct: 199 TEGEEVTLSG---------VSSWHEGLKGKVEFQGRMTART---QGGTRSCRDGVLSIEG 246
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D AV+ + +++F N D + + + L+ + Y H+D +++
Sbjct: 247 ADEAVIYISIATNF----TNYKDITGNQVERAKNYLRRAVSKDYMTSRKAHVDFFKQYMD 302
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RVS+ D+ D + D RV++F+ +D LV F+FGRYLLI SS+
Sbjct: 303 RVSL-------DLGIDKYAGVTTDM-----RVQNFKETKDDFLVATYFRFGRYLLICSSQ 350
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQGIWN+ L P+WDS NIN+EMNYW + NLSE EPL + +S G
Sbjct: 351 PGGQPANLQGIWNDKLFPSWDSKYTCNINVEMNYWPAEVTNLSELHEPLIQLIREVSETG 410
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
++A++ Y A GWV+HH TDIW + A K LWP GGAWLC HLWE Y YT D +F
Sbjct: 411 RESAKIMYGADGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDMEF 469
Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L + AYP+++ F + ++ E +L PS SPE+ +GK A + T+D
Sbjct: 470 L-RSAYPIMKEAGKFFDEIMVKEPLHNWLVVCPSNSPENTHAGSNGK-ATTAAGCTLDNQ 527
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+I ++++ II+ A +L + + ++ + L + P +I G + EW+
Sbjct: 528 LIFDLWNQIITTARLLGTDAE-FATRLEQRLKEMAPMQIGRWGQLQEWM 575
>gi|294777781|ref|ZP_06743227.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
gi|294448369|gb|EFG16923.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
Length = 814
Score = 324 bits (831), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 202/589 (34%), Positives = 320/589 (54%), Gaps = 46/589 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ +T+A+P+GNGRLGAMV+G E ++LNE+T+W G P + NPDA + +
Sbjct: 25 KLWYDEPAQVWTEALPLGNGRLGAMVFGNPGVEHIQLNEETIWAGRPNNNANPDALEYIP 84
Query: 74 DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
VR LV G+Y EA T A+ K+ G P YQ GD+ + F H +Y++ Y RE
Sbjct: 85 KVRRLVFEGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLHISFP-GHTRYSD--YYRE 138
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
L L++A V+Y V V + RE +S DQV++ +++ S+ G ++ N +L + +
Sbjct: 139 LSLDSARTIVRYKVDGVTYQRETLTSFADQVVMVRLTASQPGKITCNANLTTPHQDVMVA 198
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
++ + G ++ ++ KG ++F + + +G + D L +EG
Sbjct: 199 TEGEEVTLSG---------VSSWHEGLKGKVEFQGRMTART---QGGTRSCRDGVLSIEG 246
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D AV+ + +++F N D + + + L+ + Y H+D +++
Sbjct: 247 ADEAVIYISIATNF----TNYKDITGNQVERAKNYLRRAVSKDYMTSRKAHVDFFKQYMD 302
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RVS+ L VT + RV++F+ +D LV F+FGRYLLI SS+
Sbjct: 303 RVSLNLGIDKYAGVT------------TDMRVQNFKETKDDFLVATYFRFGRYLLICSSQ 350
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQGIWN+ L P+WDS NIN+EMNYW + NLSE EPL + +S G
Sbjct: 351 PGGQPANLQGIWNDKLFPSWDSKYTCNINVEMNYWPAEVTNLSELHEPLIQLIREVSETG 410
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
++A++ Y A GWV+HH TDIW + A K LWP GGAWLC HLWE Y YT D +F
Sbjct: 411 RESAKIMYGADGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDMEF 469
Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L + AYP+++ F + ++ E +L PS SPE+ +GK A + T+D
Sbjct: 470 L-RSAYPIMKEAGKFFDEIMVKEPLHNWLVVCPSNSPENTHAGSNGK-ATTAAGCTLDNQ 527
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+I ++++ II+ A +L + + ++ + L + P +I G + EW+
Sbjct: 528 LIFDLWNQIITTARLLGTDAE-FATRLEQRLKEMAPMQIGRWGQLQEWM 575
>gi|150005172|ref|YP_001299916.1| glycoside hydrolase [Bacteroides vulgatus ATCC 8482]
gi|149933596|gb|ABR40294.1| glycoside hydrolase family 95 [Bacteroides vulgatus ATCC 8482]
Length = 814
Score = 324 bits (831), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 202/589 (34%), Positives = 321/589 (54%), Gaps = 46/589 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ +T+A+P+GNGRLGAMV+G E ++LNE+T+W G P + NPDA + +
Sbjct: 25 KLWYDEPAQVWTEALPLGNGRLGAMVFGNPGVEHIQLNEETIWAGRPNNNANPDALEYIP 84
Query: 74 DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
VR LV G+Y EA T A+ K+ G P YQ GD+ + F H +Y++ Y RE
Sbjct: 85 KVRRLVFEGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLHISFP-GHTRYSD--YYRE 138
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
L L++A V+Y V V + RE +S DQV++ +++ S+ G ++ N +L + +
Sbjct: 139 LSLDSARTIVRYKVDGVTYQRETLTSFADQVVMVRLTASQPGKITCNANLTTPHQDVMVA 198
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
++ + G ++ ++ KG ++F + + +G + D L +EG
Sbjct: 199 TEGEEVTLSG---------VSSWHEGLKGKVEFQGRMTART---QGGTRSCRDGVLSIEG 246
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D AV+ + +++F N D + + + L+ + Y H+D +++
Sbjct: 247 ADEAVIYISIATNF----TNYKDITGNQVERAKNYLRRAVSKDYMTSRKAHVDFFKQYMD 302
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RVS+ D+ D + D RV++F+ +D LV F+FGRYLLI SS+
Sbjct: 303 RVSL-------DLGIDKYAGVTTDM-----RVQNFKETKDDFLVATYFRFGRYLLICSSQ 350
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQGIWN+ L P+WDS NIN+EMNYW + NLSE EPL + +S G
Sbjct: 351 PGGQPANLQGIWNDKLFPSWDSKYTCNINVEMNYWPAEVTNLSELHEPLIQLIREVSETG 410
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
++A++ Y A GWV+HH TDIW + A K LWP GGAWLC HLWE Y YT D +F
Sbjct: 411 RESAKIMYGADGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDMEF 469
Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L + AYP+++ F + ++ E +L PS SPE+ +GK A + T+D
Sbjct: 470 L-RSAYPIMKEAGKFFDEIMVKEPLHNWLVVCPSNSPENTHAGSNGK-ATTAAGCTLDNQ 527
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+I ++++ II+ A +L + + ++ + L + P +I G + EW+
Sbjct: 528 LIFDLWNQIITTARLLGTDAE-FATRLEQRLKEMAPMQIGRWGQLQEWM 575
>gi|423311885|ref|ZP_17289822.1| hypothetical protein HMPREF1058_00434 [Bacteroides vulgatus
CL09T03C04]
gi|392689264|gb|EIY82542.1| hypothetical protein HMPREF1058_00434 [Bacteroides vulgatus
CL09T03C04]
Length = 814
Score = 324 bits (831), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 202/589 (34%), Positives = 321/589 (54%), Gaps = 46/589 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ +T+A+P+GNGRLGAMV+G E ++LNE+T+W G P + NPDA + +
Sbjct: 25 KLWYDEPAQVWTEALPLGNGRLGAMVFGNPGVEHIQLNEETIWAGRPNNNANPDALEYIP 84
Query: 74 DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
VR LV G+Y EA T A+ K+ G P YQ GD+ + F H +Y++ Y RE
Sbjct: 85 KVRRLVFEGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLHISFP-GHTRYSD--YYRE 138
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
L L++A V+Y V V + RE +S DQV++ +++ S+ G ++ N +L + +
Sbjct: 139 LSLDSARTIVRYKVDGVTYQRETLTSFADQVVMVRLTASQPGKITCNANLTTPHQDVMVA 198
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
++ + G ++ ++ KG ++F + + +G + D L +EG
Sbjct: 199 TEGEEVTLSG---------VSSWHEGLKGKVEFQGRMTART---QGGTRSCRDGVLSIEG 246
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D AV+ + +++F N D + + + L+ + Y H+D +++
Sbjct: 247 ADEAVIYISIATNF----TNYKDITGNQVERAKNYLRRAVSKDYMTSRKAHVDFFKQYMD 302
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RVS+ D+ D + D RV++F+ +D LV F+FGRYLLI SS+
Sbjct: 303 RVSL-------DLGIDKYAGVTTDM-----RVQNFKETKDDFLVATYFRFGRYLLICSSQ 350
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQGIWN+ L P+WDS NIN+EMNYW + NLSE EPL + +S G
Sbjct: 351 PGGQPANLQGIWNDKLFPSWDSKYTCNINVEMNYWPAEVTNLSELHEPLIQLIREVSETG 410
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
++A++ Y A GWV+HH TDIW + A K LWP GGAWLC HLWE Y YT D +F
Sbjct: 411 RESAKIMYGADGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDMEF 469
Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L + AYP+++ F + ++ E +L PS SPE+ +GK A + T+D
Sbjct: 470 L-RSAYPIMKEAGKFFDEIMVKEPLHNWLVVCPSNSPENTHAGSNGK-ATTAAGCTLDNQ 527
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+I ++++ II+ A +L + + ++ + L + P +I G + EW+
Sbjct: 528 LIFDLWNQIITTARLLGTDAE-FATRLEQRLKEMAPMQIGRWGQLQEWM 575
>gi|345569032|gb|EGX51901.1| hypothetical protein AOL_s00043g635 [Arthrobotrys oligospora ATCC
24927]
Length = 723
Score = 324 bits (831), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 200/570 (35%), Positives = 295/570 (51%), Gaps = 60/570 (10%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
MV+G +E L+LNED++W G P D A + L ++R L+ G+ EA A F
Sbjct: 1 MVYGQTTTEVLQLNEDSVWYGGPQDRLPKAALQNLPELRRLIREGRQKEAEALVRAAFFA 60
Query: 97 HPADVY--QLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSN 154
+P+ + LG + L+FD + YRRELD++ A +RV+YS +++ RE +S
Sbjct: 61 YPSSQRHSEPLGTLHLDFDYGYQGIDIRDYRRELDISQAISRVQYSCNGIQYQREAIASY 120
Query: 155 PDQVIVTKISGSESGSLSFNVS--------LDSLLDNHSYVNGNNQIIMEGRCPGKRIPP 206
PDQVI +S S+S + ++ + LD + +G +IIM
Sbjct: 121 PDQVIGINLSSSQSSKYTIRLNRVSEREYETNEFLDTLTTRDG--KIIM----------- 167
Query: 207 KANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFIN 266
+A G + ++ + +D G + L + L V G + +LL + ++F
Sbjct: 168 --HATPGGGGSRLCCVVSARSNDPDGRVQVLGNT-LVVTGKS-STILLASQTTF------ 217
Query: 267 PSDSKKDPTSESMSALQSIRNL-SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCS 325
+DP ++AL I S++ + RHL DY+ L+ RV ++LS I TD
Sbjct: 218 ---RVEDP---ELAALGDIEKCGSWTQILDRHLKDYKNLYGRVCLKLSSDDSHIPTDL-- 269
Query: 326 EENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV--ANLQGIWNEDLS 383
Q DP LV L +GRYLLIS SRPG + A LQGIWN
Sbjct: 270 --------------RLQRKPDPGLVGLYHNYGRYLLISCSRPGDKALPATLQGIWNPSFQ 315
Query: 384 PTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHH 443
P W S +NIN +MNYW + NL EC+ PLF+ L + +NG++TA+ Y GW HH
Sbjct: 316 PPWGSKYTININTQMNYWPANISNLPECETPLFELLERVQVNGARTAKEMYGCRGWCAHH 375
Query: 444 KTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
TDIWA ++ + LWP+GGAWLCTH+WE Y + D+ FL+ R +P+LEGC FLL
Sbjct: 376 NTDIWADTNPQDKWMPATLWPLGGAWLCTHIWERYLFFEDKSFLQ-RLFPVLEGCVRFLL 434
Query: 504 DWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEK 563
D+LI+ G+ TNPS SPE+ F G+ +STMD+ I+ VF A I++ +LE
Sbjct: 435 DFLIKDDHGFYVTNPSLSPENTFKNQRGEEGVFCEASTMDIQILTAVFKAYITSCHILEG 494
Query: 564 NEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +V K+L L P ++ G + EW
Sbjct: 495 LGTVDMAEVNKALAGLPPVIVSSTGLLQEW 524
>gi|408787527|ref|ZP_11199255.1| hypothetical protein C241_15883 [Rhizobium lupini HPC(L)]
gi|408486464|gb|EKJ94790.1| hypothetical protein C241_15883 [Rhizobium lupini HPC(L)]
Length = 739
Score = 324 bits (831), Expect = 8e-86, Method: Compositional matrix adjust.
Identities = 207/589 (35%), Positives = 313/589 (53%), Gaps = 48/589 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ ++ A +T+A+PIGNGRLGAMV+GG E +++NE T + G P NPDA L
Sbjct: 5 RLWYDTAASAWTEALPIGNGRLGAMVFGGAWDERIQINESTFYNGGPYQPINPDAKDHLP 64
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPADV---YQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR + G+Y EA + D+ YQ +GD+++ F YRRELDL
Sbjct: 65 AVRQRILDGKYMEAERLAYDHVMARPDLQTSYQPIGDLKIAFQHDMTTI---NYRRELDL 121
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
T A +Y V + R+ F+S VIV K++ + GSLS ++ L S + + +
Sbjct: 122 ETGIAVTRYDCDGVHYHRQIFASAIADVIVCKVTVDKPGSLSLSLLLSSPQNGEAEDRRD 181
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD---DRGTISALEDKKLKVEGS 247
+ + GR N P ++F+ ++ + DRG S ++V +
Sbjct: 182 HVLGYLGR--------NRKQNGIPGALRFAFRTQVVATGGFVDRGPES------IRVREA 227
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D ++ + A +SF D DP + L ++ DL H++D+++LF R
Sbjct: 228 DSVIIFIDAGTSFR----RYDDVSGDPEKTTEMRLARASTRAFEDLLEEHVEDHRRLFGR 283
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
++I + ++ VP+ +RV+ DP L L Q+GRYL I+SSRP
Sbjct: 284 MAIDIG-------------PDLSHVPTDKRVRDNVAKPDPQLAALYTQYGRYLAIASSRP 330
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
GTQ +NLQGIWNE++ P W+S +NIN +MNYW + P NL+E PL + + L+ G
Sbjct: 331 GTQPSNLQGIWNEEILPPWNSKFTLNINTQMNYWLADPANLAETFIPLIEMVEDLAETGQ 390
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
+ A+ +Y A GWV+HH TDIW S G W LWP GGAWLC L++HY+++ D L
Sbjct: 391 EMARAHYGARGWVVHHNTDIWRASGPIDGP-KWGLWPTGGAWLCAQLYDHYSFSGDEAIL 449
Query: 488 EKRAYPLLEGCASFLLDWLIE-GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
+R YPL++G A F+LD L++ Y T PS SPE+ P G C MD I
Sbjct: 450 -RRIYPLMKGSAEFILDILVDLPGTSYRVTCPSLSPENRH--PGGTSLCA--GPAMDNQI 504
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
IR+VF+A+ISA+E L +E AL +++ + RL K+ + G + EW++
Sbjct: 505 IRDVFAAVISASEALAIDE-ALRAELVAARARLPEDKVGKVGQLQEWIE 552
>gi|333382100|ref|ZP_08473777.1| hypothetical protein HMPREF9455_01943 [Dysgonomonas gadei ATCC
BAA-286]
gi|332829131|gb|EGK01795.1| hypothetical protein HMPREF9455_01943 [Dysgonomonas gadei ATCC
BAA-286]
Length = 820
Score = 324 bits (830), Expect = 9e-86, Method: Compositional matrix adjust.
Identities = 200/588 (34%), Positives = 320/588 (54%), Gaps = 47/588 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPKALSDV 75
+ PA + ++P+GNGR+GAMV+GG+ E + LNE T+W+G P + P L+D+
Sbjct: 47 YENPADEWMKSLPLGNGRIGAMVFGGIEKEVIALNEVTMWSGQPDKFQERPLGKTMLNDI 106
Query: 76 RSLVDSGQYAEATAASVKLFG---HPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
R L G+YA+ + H + GD++L+F + A Y+REL+L
Sbjct: 107 RQLFFEGKYAKGNRVVSEFMSGTPHSFGSHVPAGDLKLDF--KYPAGAVSGYKRELNLEN 164
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG-NN 191
A V + VGN+ +TRE+F SNPD + +++ +++ SL+ +VSLD L + S + +N
Sbjct: 165 AINTVSFKVGNILYTREYFCSNPDNAFIVRLTANKAKSLTLDVSLDMLRE--SVIKAVDN 222
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
+ G+ P + P G+ F + + D G +SA + K+ + +
Sbjct: 223 SLEFSGKVS---FPKQG-----PGGVDFMGKVGVTAKD--GNVSA-SNNKISIADATSVT 271
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
++L + + N K+D + AL Y+ L +H+ DY LF RV +
Sbjct: 272 IILDLRTDY-----NNKHYKEDCFATVNKALSQ----DYNRLKNKHVSDYSNLFKRVDLF 322
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
L +S D + T ERVK+ + ED L L FQ+ RYLLI++SR + +
Sbjct: 323 LGKSEAD---------KLPTDKRWERVKAGK--EDVGLDALFFQYARYLLIAASREDSPL 371
Query: 372 -ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
ANLQGIWN++L+ W + H++IN + NYW S NL EC PLFD++ LS+ G K
Sbjct: 372 PANLQGIWNDNLACNMGWTNDYHLDINTQQNYWLSNIGNLHECNTPLFDYIKDLSVYGQK 431
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
TA+ Y A GWV + ++W +++ +G V W L+P+ G W+ +HLW HY YTMD ++L
Sbjct: 432 TAKNVYGARGWVANTVANVWGYTASGQG-VNWGLFPLAGTWIASHLWTHYIYTMDENYLR 490
Query: 489 KRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
+AYP+L+ A FLLD++++ +GYL T PSTSPE+ F +L+ VS D +
Sbjct: 491 NKAYPILKSNAEFLLDYMVQDPKNGYLMTGPSTSPENSFRYKGNELS-VSLMPACDRQLA 549
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
E F++ I A+++L +D + + +L +L P I ++G+I EW +
Sbjct: 550 YEAFASCIQASKILNV-DDKFRDSLSIALKKLPPIIIGKNGAIQEWFE 596
>gi|290769720|gb|ADD61497.1| putative multimodular carbohydrate-active enzyme [uncultured
organism]
Length = 1083
Score = 324 bits (830), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 199/595 (33%), Positives = 312/595 (52%), Gaps = 42/595 (7%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
M+N + + +K+ ++ PA+ + +A+P+GN RLGAMV+GG E ++LNE+T W G P
Sbjct: 282 MINKQEATR---MKLWYSAPARRWVEALPVGNSRLGAMVYGGTDKEEIQLNEETFWAGGP 338
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
NP + L+ R LV + + +EA + F + L L + K
Sbjct: 339 YRNDNPKGKEVLAKTRELVFANRLSEAQKLIDENFFTGQHGMRFLTMGSLLINQPEHKNV 398
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
E Y RELD+ A A +Y V V +TR FSS D VIV ++ + +L+F++S +S
Sbjct: 399 E-NYYRELDIENAVAVTRYMVDGVTYTRTVFSSFADNVIVVRMEADKPKALNFDLSYNSP 457
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
L + GN ++ +C G + +GI + E ++ S +K
Sbjct: 458 LKHVVMAKGNELVV---KCEGM----------EQEGIPAALNAECRVLVRHNGKSGKSNK 504
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ V+ + A L + A+++F +N D + + + S L+ + Y H+
Sbjct: 505 SVVVDQATVATLYISAATNF----VNYHDVGGNASKLASSILKRAVKVPYEQALANHIAA 560
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
Y++ F RV+ + T+T T+ + +RV +F +D +L+ L+FQ+GRYL
Sbjct: 561 YKEQFDRVTFSIPS------TET------STLETDKRVVAFGEGKDLNLIALMFQYGRYL 608
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSS+PG Q ANLQG+W + WDS +NIN EMNYW + NLSE +PLFD ++
Sbjct: 609 LISSSQPGGQPANLQGLWCNSVYAPWDSKYTININTEMNYWPAEVTNLSENHQPLFDMVS 668
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
LS+NG KTA+ Y A GWV HH TD+W ++ + +WP GGAWL HLW+HY +
Sbjct: 669 DLSVNGKKTAETVYGARGWVAHHNTDLW-RACGPIDAAYFGMWPNGGAWLTQHLWQHYLF 727
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
T D++FL +R YP+++G A F L L++ +G+L T PS SPEH + C
Sbjct: 728 TGDKEFL-RRYYPVMKGAADFYLSHLVKHPQNGWLVTAPSVSPEHGYAGSSITAGC---- 782
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
TMD I + + AA +L +++ A + + + +L P +I I EW+
Sbjct: 783 -TMDNQIAFDALYNTMLAARILGESQ-AYQDSLAVAFKQLPPMQIGRHNQIQEWL 835
>gi|383778158|ref|YP_005462724.1| hypothetical protein AMIS_29880 [Actinoplanes missouriensis 431]
gi|381371390|dbj|BAL88208.1| hypothetical protein AMIS_29880 [Actinoplanes missouriensis 431]
Length = 746
Score = 323 bits (829), Expect = 1e-85, Method: Compositional matrix adjust.
Identities = 202/588 (34%), Positives = 308/588 (52%), Gaps = 53/588 (9%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY-TNPDAPKAL 72
++ ++ PA + +A+PIGNGRLG MV GGV +E ++L+E T W+G P D+ NP A +++
Sbjct: 3 RLLYDRPASRWFEALPIGNGRLGGMVHGGVGTEIIRLSESTAWSGAPSDHDVNPAAAQSI 62
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+R L+ G++AEA A+ L G P L L D + L A+ YRRELDL+
Sbjct: 63 PVIRRLLFEGEHAEAQRLAAEHLTGRPTSFGTNLPLPRLRLDFA-LDQAD-GYRRELDLD 120
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
T A V++ F RE F+S+P VI ++S S + ++SF +LD + ++ G +
Sbjct: 121 TGLASVEFDQNQTHFVRETFASHPHGVIAMRLSASRAAAISFTAALDDTVLPGTFTGGAD 180
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
+ GR + +D +G+ + ++ D GT+ A +D + V G+D
Sbjct: 181 GLAFRGRAV------ETLHSDGEQGVDVE--IRVRFVIDGGTLLAADDT-VTVTGADVVD 231
Query: 252 LLLVASSSFDGP-FINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
+ + S+SF P + P+ Y + H++D+Q+L RVS+
Sbjct: 232 VFVTVSTSFCAPSLVEPA--------------------PYEVMRAAHVEDHQRLMRRVSL 271
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
L +P D+ TD ER+ + D+D L+ L FQ+GRYL I+ SR +
Sbjct: 272 DLG-TPIDLPTDV----------RRERLARGERDDD--LIALYFQYGRYLTIAGSRADSP 318
Query: 371 VA-NLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
+ LQG+WN+ + + W + H++IN + NYW + NL+EC PLF FLT L+ +G
Sbjct: 319 LPLALQGVWNDGFASSMGWSNDFHLDINTQQNYWAAESTNLAECHTPLFRFLTGLASSGR 378
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
TAQ Y A GWV H T+ W S+ RG + W L GGAWL LWEHY Y D FL
Sbjct: 379 STAQQMYGADGWVAHTVTNAWGYSAPGRG-IGWGLNVTGGAWLALQLWEHYEYRPDVRFL 437
Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
+AYP+L CA FLLD+L E G+L PS SPE+ ++A DG ++ +T D
Sbjct: 438 RDQAYPVLRSCALFLLDYLTPEPSHGWLVAGPSESPENSYLAADGTPCSIAMGTTADRVF 497
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+ AA +L+ + + L +V + RL P +I G + EW+
Sbjct: 498 AEAILRICGQAAAILDVDPE-LRSRVAAARDRLSPFRIGRHGQLQEWL 544
>gi|347840685|emb|CCD55257.1| glycoside hydrolase family 95 protein [Botryotinia fuckeliana]
Length = 747
Score = 322 bits (826), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 201/596 (33%), Positives = 313/596 (52%), Gaps = 60/596 (10%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
+ + PAK +++++PIGNGRLGAMV+GG+ ETL+LNE+++W G P D T DA + L
Sbjct: 10 LHYTSPAKEWSESLPIGNGRLGAMVYGGISRETLQLNENSIWYGGPQDRTPKDAFRNLDR 69
Query: 75 VRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+R + G + EA + + F H Y+ LG + L+ K ++ Y R L+L+
Sbjct: 70 LRHFIRIGDHTEAEKLAEQAFFATPHSQRHYEPLGTLTLDLGHDPAKVSK--YWRGLELS 127
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS--------LDSLLDN 183
TA +Y V R F+S PD V+V ++ SE + +S D +D+
Sbjct: 128 TANVTTEYEHLGVRHKRTVFASYPDDVLVVQLESSEKAQFTIRLSRYSDREFATDEFVDS 187
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+G I+M G PG R N+N+ F ++ ++ G + + +
Sbjct: 188 IEAQDGT--IVMHG-TPGGR-----NSNN------FCCVVSVQELAGDGNVETVGN--CV 231
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
+ S A++++ A ++F +D + ++ +AL S ++DL RH+ DY
Sbjct: 232 IVNSSKAIIIISAQTTF-----RYTDVEAKTLIQARNALHS-----HADLSKRHVQDYSS 281
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
L+ R ++L I P+ ER+ T DP LV L +GRYLLIS
Sbjct: 282 LYGRFKLRLFPDAAHI-------------PTNERL---LTSPDPGLVALYANYGRYLLIS 325
Query: 364 SSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
SRPG + A LQG+WN P W S +NIN +MNYW + CNL EC++PLFD L
Sbjct: 326 CSRPGDKALPATLQGLWNPSFQPAWGSKYTININTQMNYWPANVCNLEECEDPLFDMLER 385
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
++ G KTA+V Y GW H TDIWA + + LWPM GAWLCTH+W+ + +
Sbjct: 386 MANRGEKTARVMYGCRGWASHSCTDIWADTDPQDRWMPGTLWPMSGAWLCTHIWQRHLFG 445
Query: 482 MDRDF-LEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYS 539
D++ +R +P+L G F+LD+L++ G YL TNPS SPE+ +I G+ +
Sbjct: 446 GDQNLKFLQRMFPVLRGSVQFILDFLVKDSSGDYLITNPSLSPENSYIDLKGQKGVLCEG 505
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +D+ II+ +F A + + + L+ +D L E + + +L P++I E G + EW+Q
Sbjct: 506 SAIDIQIIKSLFKAFLLSVDSLQM-KDELTEPLKLARDKLPPSEIGEFGQLQEWLQ 560
>gi|218129730|ref|ZP_03458534.1| hypothetical protein BACEGG_01309 [Bacteroides eggerthii DSM 20697]
gi|217988142|gb|EEC54466.1| hypothetical protein BACEGG_01309 [Bacteroides eggerthii DSM 20697]
Length = 1063
Score = 322 bits (826), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 197/585 (33%), Positives = 307/585 (52%), Gaps = 43/585 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ ++ PA + +A+P+GN RLGAMV+GG E ++LNE+T W G P NP AL
Sbjct: 271 MKLWYSAPAHRWVEALPVGNSRLGAMVYGGTDKEEIQLNEETFWAGGPYSNDNPKGKGAL 330
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+ VR LV + + +EA + F G + +G + F + E Y RELD+
Sbjct: 331 AKVRELVFANRLSEAQKMIDENFFTGQHGMRFLTMGSL---FINQPEHKNVENYYRELDI 387
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A +Y V V +TR FSS D VIV ++ + +L+F++S +S L + GN
Sbjct: 388 ENAVAVTRYMVDGVTYTRTVFSSFADDVIVVRMEADKPKALNFDLSYNSPLKHAVTAKGN 447
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
I+ +C G + +GI + E ++ S ++ + V + A
Sbjct: 448 ELIV---KCEGA----------EQEGIPAALNAECRVLVKHNGKSGKSNESVVVNQATVA 494
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
L + A+++F +N D + + ++L+ + Y H+ Y+K F RV
Sbjct: 495 TLYISAATNF----VNYHDVSGNASKLVSTSLKRAVKIPYEQALANHIAAYKKQFDRVKF 550
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
+ T+T T+ + +RV +F +D +L+ L+FQ+GRYLLISSS+PG Q
Sbjct: 551 SIPS------TET------STLETDKRVAAFGEGKDQNLMALMFQYGRYLLISSSQPGGQ 598
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQG+W + WDS +NIN EMNYW + NLSE +PLFD ++ LS++G KTA
Sbjct: 599 PANLQGLWCNSVYAPWDSKYTININTEMNYWPAEVTNLSENHQPLFDMVSDLSVSGKKTA 658
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
+ Y A GWV HH TD+W ++ + +WP GGAWL HLW+HY +T D++FL +R
Sbjct: 659 ETVYGARGWVAHHNTDLW-RACGPIDAAYFGMWPNGGAWLTQHLWQHYLFTGDKEFL-RR 716
Query: 491 AYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
YP+++G A F L L++ +G+L T PS SPEH + C TMD I +
Sbjct: 717 YYPVMKGAADFYLSHLVKHPQNGWLVTAPSVSPEHGYAGSSITAGC-----TMDNQIAFD 771
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+ AA +L +++ A + + + +L P +I + EW+
Sbjct: 772 ALYNTMLAARILGESQ-AYQDSLAVAFKQLPPMQIGRHNQLQEWL 815
>gi|312131915|ref|YP_003999255.1| alpha-l-fucosidase [Leadbetterella byssophila DSM 17132]
gi|311908461|gb|ADQ18902.1| Alpha-L-fucosidase [Leadbetterella byssophila DSM 17132]
Length = 793
Score = 322 bits (826), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 204/604 (33%), Positives = 317/604 (52%), Gaps = 69/604 (11%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
+ F PA+HFT+++P+GNGRLGAMV+G E + LNE +LW+G P D +A K+L
Sbjct: 23 LLFYAPARHFTESLPLGNGRLGAMVFGQTAKERIALNEISLWSGGPQDADREEAYKSLKP 82
Query: 75 VRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIELEFDDSHLKYAE 121
++ L+ G+ EA K F P YQ LGD+ LE+ D +
Sbjct: 83 IQQLLLEGKNKEAQTLLEKEFIAKGRGSGFGRGAKDPYGSYQTLGDLFLEWKDGEVS--- 139
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
Y+R LDL+ A A +++ ++ T E F+ + +I ++ S++ L V L S
Sbjct: 140 -NYKRWLDLDNALATTQFTRNGIQITEEVFTDFKNDLIWVRLRSSKAKGLYLKVGL-SRE 197
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
+N + +I + G+ P A +P G++F+AIL+ A D K
Sbjct: 198 ENAQVQADSKEIKLWGQLP---------AGSEP-GMKFAAILQ----------EAHVDGK 237
Query: 242 LKVEGSDW-------AVLLLVASSSF-DGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
++VEG+ W +L + A++++ +G I ++D T ++ Q + L+YS
Sbjct: 238 VEVEGNTWNIVGASEVILQISAATNYHEGKLI-----EEDVTQKARKYFQ--KGLTYSAA 290
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVEL 352
+ L+ +Q FHR +QL ++ + + + +R+K + D L L
Sbjct: 291 FKSSLEKFQSYFHRSELQLK-----------GQDKLAHLSTPDRLKRLAEGKSDLDLYAL 339
Query: 353 LFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
+ +GRYLLI SSRPG ANLQG+W + W+ H+NIN++MNYW + L E
Sbjct: 340 YYHYGRYLLICSSRPGLLPANLQGLWAVEYQAPWNGDYHLNINVQMNYWPAELTGLGELA 399
Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
EPL F L NG KTA+ Y A GWV H ++ W +S G W GGAWLC
Sbjct: 400 EPLHRFTANLVKNGEKTAKAYYQAEGWVAHVISNPWFFTSPGEG-ADWGSTLTGGAWLCE 458
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDG 531
H+WEHY +T D +FL K YP+L+G A FL LIE +G+L T PS SPEH ++ PDG
Sbjct: 459 HIWEHYRFTKDIEFLRKY-YPVLKGSAQFLSSILIEEPKNGWLVTAPSNSPEHAYVLPDG 517
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
+ TMDM I RE+F+A+I +AE+L +++ +++ + L P ++ ++G +
Sbjct: 518 TKVNTAMGPTMDMQICRELFNAVIQSAEILGVDKE-FRDELSAKVRNLAPNRVGKNGDLN 576
Query: 592 EWVQ 595
EW++
Sbjct: 577 EWLE 580
>gi|431798012|ref|YP_007224916.1| hypothetical protein Echvi_2666 [Echinicola vietnamensis DSM 17526]
gi|430788777|gb|AGA78906.1| hypothetical protein Echvi_2666 [Echinicola vietnamensis DSM 17526]
Length = 819
Score = 322 bits (826), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 201/586 (34%), Positives = 305/586 (52%), Gaps = 41/586 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA + +A+PIGNGRLGAMV+G E ++LNE+TL+ G P NPDA +AL
Sbjct: 30 LKLWYDDPAASWVEALPIGNGRLGAMVFGDPYEEVIQLNENTLYAGRPHRNDNPDAKEAL 89
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
++V+S++ GQY A + F G YQ +G ++L FDD + YRRELDL
Sbjct: 90 AEVQSMIFDGQYGAAQHRINETFFSGINGMPYQTMGQLKLYFDDER---EVKEYRRELDL 146
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A Y G+ FT + +S+PDQV+V ++ + G++ F +D N
Sbjct: 147 KKALVTTHYKKGDTHFTTQVLASHPDQVMVIHLTADKPGAIHFTALVDRPGPFQLQHAAN 206
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
+++M G + G++F+ + +K S + + + V ++ A
Sbjct: 207 GELLMTGTS--------GDHEGIKGGVEFATRVRVKHSKGEMVKTG---EGIAVNNANSA 255
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
+ + +++F D + S L+ S+ + H +D+++ F RVS+
Sbjct: 256 TIYISMATNFK----QYDDISGNAVELSKQHLEKALGKSFDQIRKSHEEDHRRYFDRVSL 311
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
L E + P+ +RV++F +DP L L FQFGRYLLI++SR G Q
Sbjct: 312 DLG------------ESEAEKDPTDKRVENFSKRDDPGLAALYFQFGRYLLIAASRAGGQ 359
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWN+ L+P WDS VNIN EMNYW S +LSE EPL + + LS G KTA
Sbjct: 360 PANLQGIWNDQLNPAWDSKYTVNINTEMNYWPSEITHLSEMNEPLVEMVRELSQTGRKTA 419
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
+ Y A GW +HH TD+W + G W +WPMGGAWL HL + ++++ D +L K
Sbjct: 420 KDMYGARGWAMHHNTDLWRITGPVDG-AFWGMWPMGGAWLTQHLLDKFDFSGDTTYL-KS 477
Query: 491 AYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHE-FIAPDGKLACVSYSSTMDMAIIR 548
YP+L+ F LD L + G+ PS SPE+ ++ D A V TMD ++
Sbjct: 478 IYPILKEACLFYLDILKVAPETGWKVVVPSISPENAPYLDHD---ASVGAGHTMDNQLLS 534
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
++F AA +L+ + A E++ S L P +I G + EW+
Sbjct: 535 DLFQRTSRAASILD--DKAFAEQLKDSWALLAPMQIGRWGQLQEWM 578
>gi|325263746|ref|ZP_08130479.1| fibronectin type III domain protein [Clostridium sp. D5]
gi|324030784|gb|EGB92066.1| fibronectin type III domain protein [Clostridium sp. D5]
Length = 769
Score = 322 bits (824), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 211/605 (34%), Positives = 309/605 (51%), Gaps = 57/605 (9%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
+I F A+ +T+A+PIGNG LGAMV+G E +++NED++W+G + NPDA L
Sbjct: 3 EIWFRKEAEEWTEALPIGNGFLGAMVFGRTSVERIQVNEDSVWSGGYMERLNPDAKGHLD 62
Query: 74 DVRSLVDSGQYAEATA-ASVKLFG-HP-ADVYQLLGDIELEFDD--------------SH 116
+VR L+ G+ EA AS ++ +P YQ LGD+ ++F + S
Sbjct: 63 EVRQLLMQGRVQEAELLASRSMYAVYPHMRHYQTLGDVWIDFFNTRGRQTVKKKENGTSF 122
Query: 117 LKYAE---ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF 173
++Y E YRR L+L A + Y+ RE F+S+P V+V ++ E +L F
Sbjct: 123 VEYESPVFEEYRRSLNLEDAVGNIVYTAEKGAVKREFFASSPAGVLVYRMCAEEDEALDF 182
Query: 174 NVSLDSLLDNHSYVNGNNQIIMEGRCPGK----RIPPKANANDDPKGIQFSAILEIKISD 229
VSL + DN S G +G R+ K ND GI F + ++I+
Sbjct: 183 EVSL-TRKDNRS---GRGSSFCDGTMAVGDDTIRLYGKNGGND---GIAFE--MAVRIAS 233
Query: 230 DRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLS 289
G + + VEG+ AVL + +++ KDP + M L+ L
Sbjct: 234 VGGRQYRM-GSHIIVEGAKEAVLYITGRTTY---------RSKDPAAWCMETLEKAAGLP 283
Query: 290 YSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPS 348
Y +L +HL+DY L++ V + EE ++ + + ER+ +T ED
Sbjct: 284 YEELKMQHLEDYHSLYN-----------SCVLELDEEEELEQLSTPERLARMRTGKEDVG 332
Query: 349 LVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL 408
LV L + FGRYLLISSSR + ANLQGIWNED P W S +NIN++MNYW + L
Sbjct: 333 LVNLHYNFGRYLLISSSRENSLPANLQGIWNEDFEPAWGSKYTININIQMNYWMAEKTGL 392
Query: 409 SECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGA 468
S PL + L + +G +TA+ Y A G+ HH TDIW + V +WPMGGA
Sbjct: 393 SRLHMPLLEHLKTMRPHGQETAEKMYGARGFCCHHNTDIWGDCAPQDSHVSATIWPMGGA 452
Query: 469 WLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIA 528
WLC H+ EHY YT DR F+E+ Y +L F D++++ G+ T PS+SPE+ ++
Sbjct: 453 WLCLHIIEHYLYTKDRVFMEE-FYGILRDSVQFFADYMVQDEQGHWITGPSSSPENIYMN 511
Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDG 588
G+ C+ MD I+RE+FS + E L++ D L +V L L P KI + G
Sbjct: 512 EQGECGCLCMGPAMDSEILRELFSGYLRITEELDRG-DGLEAEVKMRLEGLPPVKIGKYG 570
Query: 589 SIMEW 593
I EW
Sbjct: 571 QIQEW 575
>gi|357043574|ref|ZP_09105265.1| hypothetical protein HMPREF9138_01737 [Prevotella histicola F0411]
gi|355368238|gb|EHG15659.1| hypothetical protein HMPREF9138_01737 [Prevotella histicola F0411]
Length = 808
Score = 322 bits (824), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 206/608 (33%), Positives = 308/608 (50%), Gaps = 55/608 (9%)
Query: 7 TSTTNP----LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
TST N + + ++ PA+ F +++P+GNG+LGA+++GG ++T+ LN+ T WTG P
Sbjct: 14 TSTINAQQQSMLLWYDHPAQFFEESLPMGNGKLGALIYGGTKNDTIYLNDITYWTGKP-- 71
Query: 63 YTNPDAPKALS----DVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLK 118
NP+ S +R + + Y A + + G + YQ LG L +
Sbjct: 72 -VNPNEGIGKSVWIPRIREALFAENYRLADSLQHYVQGEQSASYQPLGTFNL---INLTP 127
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
A + YRREL++++A A V Y V + +E+F S D +I +I+ ++ G ++F +SL
Sbjct: 128 GAIQNYRRELNIDSAMAHVSYQQDGVTYKKEYFVSQSDSLIAIRITANKPGKVNFKISLT 187
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
+ + H + Q+ M G GK + A ++++ G S
Sbjct: 188 AQVP-HKTKASDEQLTMIGHATGK------------ENETIHACTIVRLTHKEGQDSH-T 233
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D L VE +D A L +V ++SF+G +P D D + ++ A +N +Y++ RH+
Sbjct: 234 DSTLTVENADEATLYIVNATSFNGFNKHPVDDGADYMNNAIDAAWHTKNFTYNEFKQRHI 293
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDP-------SLVE 351
+ YQ+L+ R+++QL D + +P+ E +K + T P L
Sbjct: 294 NAYQRLYQRLNLQLGHDKYD-----------NNIPTDELLKKYSTPHTPLSVAAQRYLET 342
Query: 352 LLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSEC 411
L FQFGRYLL+S SR ANLQG+W L W +NINLE NYW + N+SE
Sbjct: 343 LYFQFGRYLLLSCSRTPGVPANLQGLWTPYLFSPWRGNYTMNINLEENYWPANSTNISET 402
Query: 412 QEPLFDFLTYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSS-ADRGKVV--WALWPMGG 467
+PLF FL L+ NG TA Y + GW H +DIW K++ GK WA W +GG
Sbjct: 403 IQPLFSFLKGLAANGKYTAHNFYGVNEGWCASHNSDIWCKTAPVGEGKESPEWANWNLGG 462
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHE 525
AWL LW++Y YT D L+ YPL+EG + F WLIE H G L T PST+PE+E
Sbjct: 463 AWLVNTLWDYYLYTQDFQMLKSTIYPLMEGASRFCKQWLIENPKHPGELITAPSTTPENE 522
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
++ G Y T D+AIIRE+F A +L D + LK RL P I
Sbjct: 523 YLTDKGYHGTTCYGGTADLAIIRELFENTQQARRILNIKPDKQLNNTLK---RLHPYTIG 579
Query: 586 EDGSIMEW 593
+G + EW
Sbjct: 580 AEGDLNEW 587
>gi|294674990|ref|YP_003575606.1| hypothetical protein PRU_2351 [Prevotella ruminicola 23]
gi|294471732|gb|ADE81121.1| conserved hypothetical protein [Prevotella ruminicola 23]
Length = 769
Score = 322 bits (824), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 204/595 (34%), Positives = 308/595 (51%), Gaps = 48/595 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPKA 71
+ + +N PA F +++PIGNG++GA+++GG + LN+ TLWTG P D + DA K
Sbjct: 1 MVLEYNKPATFFEESLPIGNGKMGALIYGGTDDNVIYLNDITLWTGKPVDRNLDADAHKW 60
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIEL-EFDDSHLKYAEETYRRELDL 130
+ ++R + + YA A + + + G + YQ LG + + + +KY YRR LD+
Sbjct: 61 IPEIRKALFNENYALADSLQLHVQGPNSQHYQPLGTLHIKDLGLGEIKY----YRRTLDI 116
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
++A R Y TRE+F+SNPD++I ++ G + ++ + H +G
Sbjct: 117 DSAIVRDSYERDGRHITREYFASNPDKLIAIRLRGDINCQIALTAQVP-----HQVKSGL 171
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
Q+ M G G D + F IL +K + A D L + + A
Sbjct: 172 GQLTMTGHATG----------DAQESTHFCTILSVKTDGEM----AASDSSLTITKAKEA 217
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
++ +V +SF+G +P + + L +N+++ + Y RHL DY+ ++ RV I
Sbjct: 218 IIYIVNETSFNGFDKHPVREGANYLEAVTNDLWHTQNMTFDEFYARHLADYKTIYDRVKI 277
Query: 311 QLS---RSPKDIVTDTCSEENIDTVPSAERVKSFQT--DEDPSLVELLFQFGRYLLISSS 365
L+ R+PKD+ D + E + + D+ P L EL FQFGRYLLIS+S
Sbjct: 278 CLNKGGRNPKDLPGAK------DRRMTDEMLLDYTNGNDQTPYLEELYFQFGRYLLISAS 331
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
R ANLQG+W L W VNINLE NYW + N++E EPL F+ L+ N
Sbjct: 332 RTKNVPANLQGLWAPQLWSPWRGNYTVNINLEENYWPAFVANMAEMAEPLDGFIAGLAAN 391
Query: 426 GSKTAQVNY-LASGWVIHHKTDIWAKSSADRGK---VVWALWPMGGAWLCTHLWEHYNYT 481
G TA+ Y + GW H +DIWA ++ K W+ W +GGAWL LWE Y +T
Sbjct: 392 GKFTAKNYYNIHEGWCSSHNSDIWAMTNPVGEKNESPEWSNWNLGGAWLVNTLWERYQFT 451
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
D+ +L+ AYPL++G A F L WLI+ G L T PSTSPE+E+ G Y
Sbjct: 452 QDKTYLKNIAYPLMKGAAQFCLRWLIDNPKQPGELITAPSTSPENEYKTDKGYHGTTCYG 511
Query: 540 STMDMAIIREVFSAIISAAEVLE-KNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
T D+AIIRE+F I+A +VL KN++ + ++L +L P I G + EW
Sbjct: 512 GTADLAIIRELFINTIAAGKVLGLKNKE-----MEQALAKLHPYTIGHMGDLNEW 561
>gi|288925542|ref|ZP_06419475.1| alpha-L-fucosidase 2 (Alpha-L-fucosidefucohydrolase 2)
(Alpha-1,2-fucosidase 2) [Prevotella buccae D17]
gi|288337758|gb|EFC76111.1| alpha-L-fucosidase 2 (Alpha-L-fucosidefucohydrolase 2)
(Alpha-1,2-fucosidase 2) [Prevotella buccae D17]
Length = 796
Score = 322 bits (824), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 205/595 (34%), Positives = 309/595 (51%), Gaps = 38/595 (6%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPK 70
PLK+ +N PA F +A+PIGNGRLGA+V+GG ++++ +N+ TLWTG P + DA +
Sbjct: 26 PLKLWYNRPATAFEEALPIGNGRLGALVYGGADTDSIYINDLTLWTGKPVELNEGGDAHR 85
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQ---LLGDIELEFDDSHLKYAEE--TYR 125
+ +R + +G Y A + GH ++ YQ LL +L + + E+ +
Sbjct: 86 WIPVIRKELFAGNYKNADILQHLVQGHNSEYYQSLALLTLTDLNASPAEGRSGEDFGELK 145
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
R LD+++A R Y G V + RE+F+S PD +I I G+++ ++L S++ +
Sbjct: 146 RSLDIDSAVCRDTYHRGGVTYIREYFASAPDSLIAMHIRADRPGAINCCLALTSIVPHQV 205
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
G Q+ M G G D + I F AIL++K SD G ++A D L V
Sbjct: 206 KATGR-QLTMTGHAIG----------DPLQSIHFCAILKVKTSD--GQVAA-SDSSLTVS 251
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G+ + V +SF+G +P + +++ + N++Y++ RH+ DY++LF
Sbjct: 252 GASEVTVYFVNRTSFNGFDKHPVTAGAPYLAKATDDIWHTENITYNECRQRHVADYKRLF 311
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
R L + + T EE + S Q + +P L L Q+GRYLLIS S
Sbjct: 312 DRFKFTLGGAKPNYSRTT--EEQL-------MAYSDQGERNPYLEMLYMQYGRYLLISCS 362
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
R ANLQG+W W +NINLE NYW + +L E P+ + ++
Sbjct: 363 RTPGVPANLQGLWAPQKYSPWRGNYTININLEENYWPAEVTDLGELVMPVDGLVRAMAAT 422
Query: 426 GSKTAQVNY-LASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
G TA Y + GW H +DIWA ++ + W+ W MGGAWL LW+HY++T
Sbjct: 423 GRHTAAHYYGIDEGWCAGHNSDIWAMTNPVGTGKESPKWSNWNMGGAWLVQTLWDHYDFT 482
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
D +L AYPL++G A F+L WL+E G L T P TSPE E+I G C Y
Sbjct: 483 RDTHYLRNTAYPLMKGAADFMLRWLVESPVKRGELITAPCTSPEAEYINDKGYQGCTFYG 542
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLK-SLPRLRPTKIAEDGSIMEW 593
T D+AI+RE+F+ + AAE+L N DA + L+ SL L P KI + G++ EW
Sbjct: 543 GTSDLAIVRELFTNTLRAAEIL--NIDAGYRQQLRSSLDHLAPYKIGKRGNLQEW 595
>gi|402087340|gb|EJT82238.1| hypothetical protein GGTG_02212 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 833
Score = 322 bits (824), Expect = 5e-85, Method: Compositional matrix adjust.
Identities = 205/596 (34%), Positives = 313/596 (52%), Gaps = 46/596 (7%)
Query: 8 STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
S + PL++ P +F D+ IGNGRLG + GG SE++ LNED+ W+G D NPD
Sbjct: 27 SASKPLRMWQTTPGVNFNDSFLIGNGRLGFSLPGGALSESIVLNEDSFWSGGEMDRVNPD 86
Query: 68 APKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETY 124
A + ++++L+ G+ EA+ AS+ G P V + +G + + S + + Y
Sbjct: 87 AAAHMPEIQALIARGEIREASRLASMSYVGTPVSVRHFDWVGKLGISMRGSAGQVRD--Y 144
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD-----S 179
R LD+ A V Y+VG V + RE+ +S PD VI +IS ++SG++SF++ +
Sbjct: 145 ERWLDVGEGVAGVYYTVGGVAYKREYTASFPDDVIAVQISANKSGAVSFDLHQSRGIGLN 204
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
L + + +G + I+M G G K I F+A ++ I D G++ + D
Sbjct: 205 LFQDSAGGSGKDTILMGGGSFGA------------KAIVFAAGAKVTI--DGGSMKRIGD 250
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+ V+G+D A + A +++ S + S M+ L Y L + H+
Sbjct: 251 T-IVVDGADSATIYWSAWTTY-------RKSAGELQSAVMADLSQASRKGYGALRSDHVK 302
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
DYQ L RV + L +S SE+ T +A+R++ +T DP + L F F RY
Sbjct: 303 DYQSLAGRVELSLGKS--------TSEQKAKT--TADRLRGLRTAFDPEIATLYFYFARY 352
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LLI+S RPGT ANLQG+WN DL+P W S +NINLEMNYW SL N+ E E +F+ +
Sbjct: 353 LLIASGRPGTLPANLQGLWNNDLNPMWGSKYTININLEMNYWPSLLTNMPELHESMFEHI 412
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
+ G A+ Y ASG V HH TDIW + WP G AW+ TH++EHY
Sbjct: 413 MKMHEKGRDVAKRMYNASGSVCHHNTDIWGDCAPQDNYAASTFWPSGLAWMATHIYEHYQ 472
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-GKLACVSY 538
+T D D L K YP L A F LD++ E HDG+L TNPS SPE + P+ + ++
Sbjct: 473 FTGDVDVLRKY-YPALRDAAVFFLDFMTE-HDGHLVTNPSVSPEISYRLPNTTQSVALTL 530
Query: 539 SSTMDMAIIREVFSAIISAAEVL-EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
T D +II E+ ++ + ++L + + D + +++ RL P + + G I E+
Sbjct: 531 GPTADNSIIWELVGMVLESQKILGDSDPDNIGQRLTGLRARLPPLRKDQYGGIAEF 586
>gi|326201460|ref|ZP_08191331.1| coagulation factor 5/8 type domain protein [Clostridium
papyrosolvens DSM 2782]
gi|325988060|gb|EGD48885.1| coagulation factor 5/8 type domain protein [Clostridium
papyrosolvens DSM 2782]
Length = 1026
Score = 321 bits (823), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 197/586 (33%), Positives = 302/586 (51%), Gaps = 57/586 (9%)
Query: 24 FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
F A+P+GNGR+GAMV+G P E + LNE T W+ PG+ A +L + + +GQ
Sbjct: 76 FYKALPLGNGRIGAMVYGNYPDERIDLNEATFWSSGPGNNNRAGAANSLKTAQDQLFAGQ 135
Query: 84 YAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVG 142
Y + K + G YQ +GD++L F S + Y R+LD+NT Y+
Sbjct: 136 YTNGSTTIAKSMIGGGEAKYQSIGDLKLSFGHSSV----SNYSRQLDMNTGVVSSDYTYN 191
Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN--GNNQIIMEGRCP 200
++ RE F S PDQ++VTKI+ S GS+S +S L V+ GN+ ++M G
Sbjct: 192 GKKYHRESFVSYPDQIMVTKITCSSPGSISLTAGYESSLSGQYTVSTSGNDTLVMNGH-- 249
Query: 201 GKRIPPKANANDDPKGIQFSAILEI--KISDDRGTISALEDKKLKVEGSDWAVLLLVASS 258
D GI ++ K+ + G++SA + ++ V +D V+L +
Sbjct: 250 ----------GDSDNGISYAVWFSTRSKLINTNGSVSA-NNNQISVSNADSVVIL----T 294
Query: 259 SFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKD 318
S +IN D ++ + + + SY L H+ DYQ LF RV + L S +
Sbjct: 295 SIRTNYINYKTCNGDEKGKATTDITNASAKSYDTLLNNHVADYQSLFKRVDVDLGGSGSE 354
Query: 319 IVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIW 378
++ P ++R+ F + DP L ++LFQ+GRYL+IS+SR +Q NLQGIW
Sbjct: 355 -----------NSKPMSQRISEFGSTNDPKLAKVLFQYGRYLMISASRD-SQPMNLQGIW 402
Query: 379 NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY-LAS 437
N+ +P W NIN EMNYW + NL+EC EP + L G++TA+ +Y +++
Sbjct: 403 NKFRNPAWGCKMTTNINYEMNYWPAFTTNLAECFEPFVEKAKALQAPGNETARAHYNISN 462
Query: 438 GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEG 497
GWV+HH TD+W +++ G+ W WP G W+ L++ YN+ D +L + YP+++G
Sbjct: 463 GWVLHHNTDLWNRTAPIDGE--WGFWPTGAGWVSNMLYDAYNFNQDTAYLNE-IYPVIKG 519
Query: 498 CASFLLDWL----IEGHDGYLETNPSTSPEHEFIAP----DGKLACVSYSSTMDMAIIRE 549
A FL + I G + Y P TSPE + P G+ A SY TMD I RE
Sbjct: 520 AADFLQTLMQSKSINGQN-YQVICPGTSPE---LTPPGNSGGQGAYNSYGVTMDNGISRE 575
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWV 594
+F A+I AA +L N D+ L+S + +++P I G + EW
Sbjct: 576 LFKAVIQAAGIL--NIDSSFRSTLQSKVSQIKPNTIGSWGQLQEWA 619
>gi|315606675|ref|ZP_07881686.1| alpha-L-fucosidase [Prevotella buccae ATCC 33574]
gi|315251685|gb|EFU31663.1| alpha-L-fucosidase [Prevotella buccae ATCC 33574]
Length = 807
Score = 321 bits (823), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 204/595 (34%), Positives = 310/595 (52%), Gaps = 38/595 (6%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPK 70
PLK+ +N PA F +A+PIGNGRLGA+V+GG ++++ +N+ TLWTG P + DA +
Sbjct: 37 PLKLWYNRPATAFEEALPIGNGRLGALVYGGADTDSIYINDLTLWTGKPVELNEGGDAHR 96
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQ---LLGDIELEFDDSHLKYAEE--TYR 125
+ +R + +G Y A + GH ++ YQ LL +L + + E+ +
Sbjct: 97 WIPVIRKELFAGNYKNADILQHLVQGHNSEYYQSLALLTLTDLNASPAEGRSGEDFGELK 156
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
R LD+++A Y G V + RE+F+S PD +I + + SG+++ ++L S++ +
Sbjct: 157 RSLDIDSAVCSDTYHRGGVTYIREYFASAPDSLIAIRFRANRSGAINCRLALTSVVPHQV 216
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
G Q+ M G G D + I F AIL++K D G ++A D L V
Sbjct: 217 KATGR-QLTMTGHAIG----------DPLQSIHFCAILKVKTDD--GQVAA-SDSSLTVN 262
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G+ + V +SF+G +P + +++ + N++Y++ RH+ DY++LF
Sbjct: 263 GASEVTVYFVNRTSFNGFDKHPVTAGAPYLAKATDDIWHTENITYNECRQRHVADYKRLF 322
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
R LS + + T EE + S Q + +P L L Q+GRYLLIS S
Sbjct: 323 DRFKFTLSGAKPNYSRTT--EEQL-------MAYSDQGERNPYLEMLYMQYGRYLLISCS 373
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
R ANLQG+W W +NINLE NYW + +L E P+ + ++
Sbjct: 374 RTPGVPANLQGLWAPQKYSPWRGNYTININLEENYWPAEMTDLGELVMPVDGLVRAMAAT 433
Query: 426 GSKTAQVNY-LASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
G TA Y + GW H +DIWA ++ + W+ W MGGAWL LW+HY++T
Sbjct: 434 GRHTAAHYYGIDEGWCAGHNSDIWAMTNPVGTGKESPKWSNWNMGGAWLVQTLWDHYDFT 493
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
D +L AYPL++G A F+L WL+E G L T P TSPE E+I G C Y
Sbjct: 494 RDTHYLRNTAYPLMKGAADFMLRWLVESPVKRGELITAPCTSPEAEYINDKGYQGCTFYG 553
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLK-SLPRLRPTKIAEDGSIMEW 593
T D+AI+RE+F+ + AAE+L N DA + L+ SL L P KI + G++ EW
Sbjct: 554 GTSDLAIVRELFTNTLRAAEIL--NIDAGYRQQLRSSLDHLAPYKIGKRGNLQEW 606
>gi|325680593|ref|ZP_08160136.1| hypothetical protein CUS_5001 [Ruminococcus albus 8]
gi|324107730|gb|EGC02003.1| hypothetical protein CUS_5001 [Ruminococcus albus 8]
Length = 759
Score = 321 bits (823), Expect = 6e-85, Method: Compositional matrix adjust.
Identities = 206/587 (35%), Positives = 307/587 (52%), Gaps = 49/587 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA + A+P+GNGR+GAMV+ E ++LNED++W+G + N A L VR
Sbjct: 9 YKTPADDWNKALPLGNGRIGAMVFSQPLEERIQLNEDSVWSGGFRERNNKSALPNLEKVR 68
Query: 77 SLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLKYAEETYR-RELDLNT 132
L+ + EA F G P + Y LGD+ + H K +E ++ R LDLNT
Sbjct: 69 KLLFEEKINEAEKIIYDAFCGTPVNQRHYMPLGDMNV----IHYKESECDFKSRSLDLNT 124
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS---LLDNHSYVNG 189
A +Y++ V++TRE F S PDQV+V I+ SE ++S V +D D++S V+
Sbjct: 125 AVCTTEYAINGVDYTREVFISQPDQVLVMHITASEKKAISVRVRIDGRDDYFDDNSPVHD 184
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
N+ + G + ++D GI F+A IK+ G + + E D
Sbjct: 185 NDILFYGG-----------SGSED--GINFAAY--IKVLHKGGKVYPY-GSFITCEDCDE 228
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
+LL A +S+ +D +++ ++ +Y+ L H+ DY+ + R +
Sbjct: 229 VTILLGAQTSY---------RCEDYKGQAVFDVERAEEKTYAQLKADHIADYKSYYDRAN 279
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPG 368
I L D S + T+P+ +R+ + + D L+E+ FGRYLLI+ SR
Sbjct: 280 ISLC--------DNSSGNS--TLPTDKRLALVKEGNPDNKLIEMYHNFGRYLLIAGSREK 329
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
T NLQGIWN+D+ P W +NIN EMNYW + CNLSE PL D + L NG K
Sbjct: 330 TLPTNLQGIWNKDMWPAWGCKFTININTEMNYWCAENCNLSELHMPLIDHIEKLRPNGRK 389
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
TA+ Y G+V HH TDIW ++ + WPMG AWLC H+WEHY Y DR+FL
Sbjct: 390 TARNMYGCRGFVCHHNTDIWGDTAPQDLWIPGTQWPMGAAWLCLHIWEHYLYVQDREFLS 449
Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
++ Y L+ A F LD+LIE G L T PS SPE+ ++ G + +MD II
Sbjct: 450 EK-YDTLKEAAEFFLDFLIEDKKGRLVTCPSVSPENTYLTASGSKGSICIGPSMDSQIIY 508
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
E+F+A+ A+++LE + +KVL++ RL +I + G IMEW +
Sbjct: 509 ELFTAVAEASKILE-TDGGFRKKVLEARDRLPAPEIGKYGQIMEWAE 554
>gi|333381846|ref|ZP_08473525.1| hypothetical protein HMPREF9455_01691 [Dysgonomonas gadei ATCC
BAA-286]
gi|332829775|gb|EGK02421.1| hypothetical protein HMPREF9455_01691 [Dysgonomonas gadei ATCC
BAA-286]
Length = 808
Score = 320 bits (820), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 209/594 (35%), Positives = 298/594 (50%), Gaps = 57/594 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ +N PAK + +A+P+GN RLG MV+G E L+LNE+T+W G P NP A AL
Sbjct: 24 LKLWYNTPAKIWEEALPLGNSRLGVMVYGIPEKEELQLNEETIWGGGPYRNDNPKALGAL 83
Query: 73 SDVRSLVDSGQYAEATAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
+ R L+ G+ EA + F G P +Q G + L F H Y + Y RE
Sbjct: 84 PEARELIFKGKSREADQLINRTFFTKTHGMP---FQTAGSVILNFP-GHQNY--QDYSRE 137
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
LDL+ A A +Y+V V++TRE FSS D VI+ +I+ G+L+F + H+
Sbjct: 138 LDLDKALAITRYTVNGVKYTREVFSSFADDVIIMRITAGRKGTLNFETEYTNN-SQHTIS 196
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
+N +I+EG+ D +GI E KI T+ D K++V GS
Sbjct: 197 KKDNILILEGK------------GSDHEGI------EGKIRYQIHTLIRNHDGKIEVTGS 238
Query: 248 DWAVLLLVASS---SFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
++ ++ S F+N + DP ++ AL Y H D Y K
Sbjct: 239 KISISGATVATIYISIGTNFLNYKSVEGDPAKKASDALAKALKTDYRSALKNHSDIYGKQ 298
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F R + L P+ + T +R+ FQ + DP+LV LL QFGRYLLI S
Sbjct: 299 FKRFKLDLGNVPEAMKLTTT-----------QRIIDFQKNHDPALVTLLTQFGRYLLICS 347
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+ G Q ANLQGIW + P WDS +NIN EMNYW + NLSE P+ + LS
Sbjct: 348 SQLGGQPANLQGIWCNSMHPAWDSKYTININAEMNYWPAEVTNLSETHLPMIQMVKDLSE 407
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
+G +TA+ Y A GWV HH TDIW +S +WP GGAWL HLWEHY +T D+
Sbjct: 408 SGQQTAKTMYGARGWVAHHNTDIWRVTSPVDFAAA-GMWPTGGAWLVQHLWEHYLFTGDK 466
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
+L YP ++G A + L L+E G++ PS SPEH +S TMD
Sbjct: 467 KYLAD-VYPAMKGAADYFLSSLVEHPQYGWMVVCPSVSPEH---------GPMSAGCTMD 516
Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQRR 597
++ +V + A +L +NE+ ++L + +L P I + + EW++ +
Sbjct: 517 NQLVFDVLTRTAQANNILGENEE-YRNQLLAMVSKLPPMHIGKYSQLQEWLEDK 569
>gi|329962425|ref|ZP_08300425.1| hypothetical protein HMPREF9446_02012 [Bacteroides fluxus YIT
12057]
gi|328529981|gb|EGF56869.1| hypothetical protein HMPREF9446_02012 [Bacteroides fluxus YIT
12057]
Length = 827
Score = 320 bits (819), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 201/604 (33%), Positives = 314/604 (51%), Gaps = 54/604 (8%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
++ N LK+ ++ PA + +A+P+GNGRLGAMV+G +E +LNE+T+W G P + T
Sbjct: 20 QAQQQENNLKLWYDKPATQWVEALPLGNGRLGAMVFGDPANEQFQLNEETVWGGSPYNNT 79
Query: 65 NPDAPKALSDVRSLVDSGQYAEATA------ASVKLFGHPADVYQLLGDIELEFDDSHLK 118
NP A AL +R L+ G+ AEA A S G P YQ +G + L+F+ +
Sbjct: 80 NPKAKDALPRIRQLIFEGRNAEAQALCGPGICSQSANGMP---YQTVGSLHLDFEGTS-- 134
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y RELDL A +++ G + +TRE ++S P+Q++V +++ S+ S+SF
Sbjct: 135 -GYTNYYRELDLEKAVTTTRFTAGGITYTREAYTSFPEQLLVIRLTASQKKSISFTAR-- 191
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQ----FSAILEIKISDDRGTI 234
Y + + P K + AND +GI+ F+A+ +I + G++
Sbjct: 192 -------YTTPYKKNVERSISPDKELQLDGKANDH-EGIEGKVRFTAL--TRIENSGGSL 241
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSAL-QSIRNLSYSDL 293
L D L+V+ ++ +V L V S F+N D D + + + Q+ +N + L
Sbjct: 242 EVLSDSTLQVKNAN-SVTLYV---SIGTNFVNYKDVSGDALATARKYMKQAGKNYTKGKL 297
Query: 294 YTRHLDDYQKLFHRVSIQL-SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVEL 352
H++ Y+K F RVS+ L S + D TD RVK F DP + L
Sbjct: 298 --AHINAYRKYFDRVSLNLGSNAQADKPTDV-------------RVKEFSGSFDPQMAAL 342
Query: 353 LFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
FQFGRYLLI SS+PG Q ANLQGIWN L WD +IN+EMNYW + +L E
Sbjct: 343 YFQFGRYLLICSSQPGGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAESTSLPEMH 402
Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
EP + +++ G ++A + Y GW +HH TDIW + A G + +WP AW C
Sbjct: 403 EPFLQLVKEVALTGRESAAM-YGCRGWTLHHNTDIWRSTGAVDGP-GYGIWPTCNAWFCQ 460
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDG 531
HLW+ Y ++ D+ +L + YPL+ G F LD+L+ E + +L PS SPE+ +
Sbjct: 461 HLWDRYLFSGDKAYLAE-IYPLMRGACEFYLDFLVREPKNNWLVVAPSYSPENRPVVNGK 519
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
+ V +TMD ++ ++F I AA+++ +N A + + L P ++ G +
Sbjct: 520 RDFVVVAGTTMDNQMVYDLFYNTIQAAKLMNEN-IAFTDSLQAVSDHLAPMQVGRWGQLQ 578
Query: 592 EWVQ 595
EW++
Sbjct: 579 EWME 582
>gi|220928668|ref|YP_002505577.1| family 6 carbohydrate binding protein [Clostridium cellulolyticum
H10]
gi|219998996|gb|ACL75597.1| Carbohydrate binding family 6 [Clostridium cellulolyticum H10]
Length = 1164
Score = 320 bits (819), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 199/586 (33%), Positives = 300/586 (51%), Gaps = 57/586 (9%)
Query: 24 FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
F A+P+GNGR+GAMV+G P E + LNE T W+ PG+ A L + + +GQ
Sbjct: 76 FYKALPLGNGRIGAMVYGNYPDERIDLNEATFWSSGPGNNNRAGAANFLKTAQDQLFAGQ 135
Query: 84 YAEATAA-SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVG 142
Y +A + + G YQ +GD++L F S + Y R+LD+NT Y+
Sbjct: 136 YKTGSATIANNMIGGGEAKYQSIGDLKLSFGHSSV----SNYSRQLDMNTGVVSSDYTYN 191
Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN--GNNQIIMEGRCP 200
++ RE F S PDQV+VTKI+ S GS+S +S L V+ GN+ ++M G
Sbjct: 192 GKKYHRESFVSYPDQVMVTKITCSSPGSISLTAGYESSLTGQYTVSTSGNDTLVMNGH-- 249
Query: 201 GKRIPPKANANDDPKGIQFSAILEI--KISDDRGTISALEDKKLKVEGSDWAVLLLVASS 258
D GI ++ KI + G++SA + ++ V +D V+L +
Sbjct: 250 ----------GDSDNGISYAVWFSTRSKIINSNGSVSA-NNNQISVSNADSVVIL----T 294
Query: 259 SFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKD 318
S F+N D ++ + + + SY LY H+ DYQ LF RV + L S +
Sbjct: 295 SIRTNFVNYKTCNGDEKGKATTDIANASAKSYDTLYNNHVTDYQNLFKRVDVDLGGSGSE 354
Query: 319 IVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIW 378
+ P +R+ F T DP L ++LFQ+GRYL+IS+SR +Q NLQGIW
Sbjct: 355 -----------NGKPMGQRISEFGTTNDPKLAKVLFQYGRYLMISASRD-SQPMNLQGIW 402
Query: 379 NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY-LAS 437
N+ +P W NIN EMNYW + NL+EC EP L G++TA+V+Y +++
Sbjct: 403 NKFRNPAWGCKMTTNINYEMNYWPAFTTNLAECFEPFVKKAKELQAPGNETARVHYNISN 462
Query: 438 GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEG 497
GWV+HH TD+W +++ G W WP G W+ L++ Y++ D +L + YP+++G
Sbjct: 463 GWVLHHNTDLWNRTAPIDGD--WGFWPTGAGWVSNMLFDAYSFNQDTVYLNE-IYPVIKG 519
Query: 498 CASFLLDWL----IEGHDGYLETNPSTSPEHEFIAP----DGKLACVSYSSTMDMAIIRE 549
A FL + I G + Y PSTSPE + P G+ A SY TMD I RE
Sbjct: 520 AADFLQTLMQSKSINGQN-YQVICPSTSPE---LTPPGTSGGQGAYNSYGVTMDNGISRE 575
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWV 594
+F +I A+++L N D+ L S + +++P + G + EW
Sbjct: 576 LFKDVIQASKIL--NIDSSFRSTLASKVSQIKPNTVGSWGQLQEWA 619
>gi|262405238|ref|ZP_06081788.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|294646990|ref|ZP_06724607.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|345508052|ref|ZP_08787692.1| glycoside hydrolase family 95 [Bacteroides sp. D1]
gi|229444703|gb|EEO50494.1| glycoside hydrolase family 95 [Bacteroides sp. D1]
gi|262356113|gb|EEZ05203.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|292637661|gb|EFF56062.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
Length = 811
Score = 319 bits (817), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 204/591 (34%), Positives = 307/591 (51%), Gaps = 57/591 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PAK++++A+PIGN RLGAMV+GG E L+LNE+T W G P + NP+A L
Sbjct: 22 LKLWYSQPAKNWSEALPIGNSRLGAMVYGGTEREELQLNEETFWAGSPYNNNNPNAVHVL 81
Query: 73 SDVRSLVDSGQYAEATA---ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
VR L+ G+ EA A+ H Y LG++ LEF K A++ YR +L+
Sbjct: 82 PIVRKLIFEGRNKEAQRLIDANFLTRQHGMS-YLTLGNLYLEFPGH--KDADDFYR-DLN 137
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L AT +Y V + +TR F+S D VI+ I S+ +L+FNVS + L N V
Sbjct: 138 LENATTTTRYQVNGINYTRTTFASFTDNVIIMHIKASQPNALNFNVSYNCPLKNEVNVQN 197
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
+ II C GK + +G++ + E ++ I L++ G
Sbjct: 198 DKLIIT---CQGK----------EQEGMKAALRAECQVQVKTDGIIHPAGNILQINGGTE 244
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L + A++++ +N + D + + L+ + Y H+ Y+K F RV
Sbjct: 245 ATLYISAATNY----VNYQNVSADESRRTTDYLEEAILIPYEKALKEHIAFYKKQFDRVQ 300
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L S + + R+++F D ++ LLFQ+GRYLLISSS+PG
Sbjct: 301 LHLPSS------------EASQIETPRRIENFGQGNDMAMAALLFQYGRYLLISSSQPGG 348
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN WDS +NIN EMNYW + NLSE PLF L LS+ G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 408
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
A+ Y GWV HH TD+W G V +A +WP GGAWL H+W+HY +T +++F
Sbjct: 409 ARTMYDCWGWVAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L K YP+L+G A F +D+L+E H Y L +PS SPEH ++ TMD
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPTYKWLVVSPSVSPEH---------GPITAGCTMDN 513
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I + + A+ + + + + + ++L +L P +I + + EW++
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLE 563
>gi|315499511|ref|YP_004088314.1| alpha-l-fucosidase [Asticcacaulis excentricus CB 48]
gi|315417523|gb|ADU14163.1| Alpha-L-fucosidase [Asticcacaulis excentricus CB 48]
Length = 789
Score = 318 bits (816), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 210/603 (34%), Positives = 315/603 (52%), Gaps = 52/603 (8%)
Query: 2 MNAESTSTTNP---LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG 58
+ A+S P L + + PA + A+P+GNGRLG MV+GGV E ++LNEDT + G
Sbjct: 24 VKAQSAPPEQPSPDLSLWYERPADEWVKALPVGNGRLGGMVFGGVAFERIQLNEDTFFAG 83
Query: 59 VPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFD-- 113
P TNP + L V+SL+ G+YAEA A+ L PA YQ +GD+ L F
Sbjct: 84 SPYTPTNPRSRDGLPQVQSLIFEGKYAEAERLANETLISQPAKQMAYQPVGDLILLFPGL 143
Query: 114 DSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF 173
D+ KY R LDL+ A +++ G+ RE F S DQV+V ++S + +++
Sbjct: 144 DNTSKYV-----RRLDLSEGVAVTEFNAGSNRHRREVFVSAVDQVMVVRLSSEKGKAITV 198
Query: 174 NVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEI--KISDDR 231
++SL + + +I++G P + +GI+ E+ K+
Sbjct: 199 DLSLSTPQKAEIDTIDGDTLIIKGVSPTQ------------QGIEGKLPFELRAKVIAPT 246
Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYS 291
GT+++ E + + G+ AV+L+ A++ + + D DP+ + + Y+
Sbjct: 247 GTLTSREGG-VYISGAQDAVVLISAATGY----VRYDDISGDPSVLNAGRIAIAAAKGYA 301
Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVE 351
L HL DY+ LF RVS+ L P +P+ +R+ + +DP L
Sbjct: 302 ALKADHLKDYKALFDRVSLSLGEGPNA------------RLPTDQRIARYGEGKDPGLAA 349
Query: 352 LLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSEC 411
L Q+GRYLL+SSSR Q ANLQGIWN+ L+P+W S +NIN +MNYW + CNL+E
Sbjct: 350 LYLQYGRYLLVSSSRGSRQPANLQGIWNDKLNPSWQSKWTLNINTQMNYWPAEMCNLTET 409
Query: 412 QEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLC 471
+PL + L+ G+K A+ Y A GWV + TD+W +S G VWALWPMGGAWL
Sbjct: 410 IDPLVCLVEDLAETGAKLAKDMYGAPGWVAFNNTDVWRVASPPDG-AVWALWPMGGAWLL 468
Query: 472 THLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPD 530
+LWE + Y D +L +R YPL++G + F L++ Y+ TNPS SPE+ P
Sbjct: 469 QNLWEPWLYNGDEAYL-RRIYPLMKGASEFYQATLLKDPRSDYMVTNPSNSPENRH--PF 525
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSI 590
G C MD ++R++F+ AA+VL K + A L +L P KI + G +
Sbjct: 526 GSSVCA--GPAMDNQLLRDLFAHTAEAAKVL-KTDAAFARACLAMRSKLPPEKIGKAGQL 582
Query: 591 MEW 593
EW
Sbjct: 583 QEW 585
>gi|325299782|ref|YP_004259699.1| alpha-L-fucosidase [Bacteroides salanitronis DSM 18170]
gi|324319335|gb|ADY37226.1| Alpha-L-fucosidase [Bacteroides salanitronis DSM 18170]
Length = 826
Score = 318 bits (816), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 203/607 (33%), Positives = 318/607 (52%), Gaps = 53/607 (8%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
+ N KI ++ PA ++ +A+P+GNGR+ AMV+G E L+LNE+T+ G P
Sbjct: 15 VCNVTGLCAQESYKIWYDKPAAYWEEALPVGNGRIAAMVFGNARMERLQLNEETVSAGSP 74
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEAT----AASVKLFGHPADVYQLLGDIELEFDDSH 116
NP+A AL ++R L+ G+ EA A + G+ YQ +G++ + + + H
Sbjct: 75 YQNYNPEAKAALPEIRRLIFEGKNEEAQLLAGKAIISQVGNEMP-YQTVGNLNIRYKN-H 132
Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
++ Y R+LD++ A A +Y VG+ E+T E F+S DQ+IV I S++G++ +V
Sbjct: 133 ENVSD--YYRDLDISRAVATTRYRVGSTEYTEETFASFTDQLIVKHIKASKAGAIDCDVF 190
Query: 177 LDSLLDN-HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
D+ + G + +EG G + P + + A L++K+ + S
Sbjct: 191 FDTPMKRPQRSAIGKKGLRLEGMADGTKFFPGK--------VHYCADLQVKLKGGKAETS 242
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
D L V+G+ L + +++F +N D DP + L++ Y +
Sbjct: 243 --NDTLLSVKGATELTLYISMATNF----VNYKDVSADPYVRNRVYLKNAGK-EYEKAKS 295
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
H+ Y++ F RV++ + +P+ +++ +D R+K F + DP L+ L FQ
Sbjct: 296 AHIAAYREQFDRVTLDMGTTPQ-------ADKPMDV-----RIKEFASSYDPHLIALYFQ 343
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
+GRYLLISSS+PG Q ANLQG WN P W+ NIN EMNYW + NL E EPL
Sbjct: 344 YGRYLLISSSQPGCQPANLQGKWNAKTKPAWNCNYTTNINTEMNYWPAEVTNLPELHEPL 403
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWAL---WPMGGAWLCT 472
+ LS NG + A Y GWV+HH TD+W + G V +A WP+ AWLC
Sbjct: 404 IRMIRELSENGKEAASKMYGCRGWVLHHNTDLWRMT----GAVDYAYCGTWPVCNAWLCQ 459
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPD- 530
HLW+ Y Y+ D+ +L K YP+++ + F +D+L+ + + GYL PS SPE+ AP
Sbjct: 460 HLWDRYLYSGDKQYL-KEVYPIMKSASQFFVDFLVRDPNTGYLVVTPSNSPEN---APRW 515
Query: 531 -GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDG 588
K A + TMD ++ ++FS AA VL NED L L+S+ R L P ++ + G
Sbjct: 516 IKKKANLFAGITMDNQLVFDLFSNTCRAASVL--NEDTLFCDTLRSMRRQLPPMQVGQYG 573
Query: 589 SIMEWVQ 595
+ EW +
Sbjct: 574 QLQEWFE 580
>gi|281422553|ref|ZP_06253552.1| putative large secreted protein [Prevotella copri DSM 18205]
gi|281403377|gb|EFB34057.1| putative large secreted protein [Prevotella copri DSM 18205]
Length = 807
Score = 318 bits (815), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 196/600 (32%), Positives = 316/600 (52%), Gaps = 63/600 (10%)
Query: 2 MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
M A+++ T N + +N PA+ + +A+PIGN LG MV+GG E ++LNE+T W+G P
Sbjct: 21 MMAKTSCTDNSTLLWYNAPAQQWLEALPIGNSHLGGMVYGGTTDENIQLNEETFWSGGPH 80
Query: 62 DYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAE 121
+ + + + L VR L+ +G+ EA A + F + L L + AE
Sbjct: 81 NNNSKKSLENLPKVRELIFNGREEEAAALINQTFIPGPHGMRFLPMANLHITMKNQGKAE 140
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
+ + R LDL A A + + V +TR F+S D VIV I S G+L+ +V+LDS
Sbjct: 141 Q-FVRNLDLKRAIATTSFVMDGVRYTRTTFASLADGVIVCHIKASRKGALNIDVTLDS-- 197
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK- 240
P + P G+ +L++K D G +AL +
Sbjct: 198 -----------------------PFEHQTQKMPSGV----MLKVKGQDQEGIKAALTAEC 230
Query: 241 --KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
++ +G++ +++ A++ F+N D + + + ++ +SY+ L RH+
Sbjct: 231 VADVRKDGTEATIIVSAATN-----FVNYHDVSGNAAQRNADYINKVKLMSYAQLEKRHV 285
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+ YQK F S+ L P DI ++P+ +R++ F +D ++V L++ +GR
Sbjct: 286 EAYQKQFATSSLIL---PTDINA---------SLPTNQRLEKFAGSKDMAMVALMYNYGR 333
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSS+PG Q ANLQG+WN+ + WDS +NIN EMNYW + NL EPL+
Sbjct: 334 YLLISSSQPGGQAANLQGVWNDSKNAPWDSKYTININTEMNYWPAEVTNLGNTTEPLYSL 393
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ LS+ G++TA+ Y GW+ HH TDIW + G W ++P GGAWL THLW+HY
Sbjct: 394 IKDLSVTGAQTAREMYGCRGWMAHHNTDIWRIAGPVDG-AQWGMFPNGGAWLTTHLWQHY 452
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
YT D+ FL K+ YP+++G A F LD++ + G + + PS SPE P GK V
Sbjct: 453 LYTGDKAFL-KQWYPVIKGAAEFYLDYMQKLPGTEWKVSV-PSVSPEQ---GPKGKRTAV 507
Query: 537 SYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+ TMD I + ++ + A+E+L ++ E +++++ +P P +I + G + EW+
Sbjct: 508 TAGCTMDNQIAFDALTSAVKASEILGVDEAERKDMQQLVSQIP---PMQIGKYGQLQEWL 564
>gi|225011898|ref|ZP_03702336.1| conserved hypothetical protein [Flavobacteria bacterium MS024-2A]
gi|225004401|gb|EEG42373.1| conserved hypothetical protein [Flavobacteria bacterium MS024-2A]
Length = 792
Score = 318 bits (815), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 206/590 (34%), Positives = 309/590 (52%), Gaps = 44/590 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD--YTNPDAP-KALS 73
+ A + +A+P+GNGRLG MV+G E ++LN+D+LW P D + NP+ + L
Sbjct: 40 YEQAASEWEEALPLGNGRLGVMVFGNPTKEHIQLNDDSLW---PKDIEWGNPEGTFEDLK 96
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+R+L+ G + ++ F V +Q LGD+ + D + Y+R L+LN
Sbjct: 97 QIRNLLIDGDIEKTDHLLIEKFSRKTVVRSHQTLGDLHIRLDHDSIS----DYKRSLNLN 152
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSE----SGSLSFNVSLDSLLDNHSYV 187
ATA V Y F S+P Q IV I +GS+ + +D S +
Sbjct: 153 KATAYVNYKTEGYPVKESVFVSHPHQAIVVIIESEHPKGINGSIQLSRPMDEGFPTVSVL 212
Query: 188 NGNN-QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ NN +IIM G + + +G+ F IL K S + G+I++ E+K L+++G
Sbjct: 213 SRNNSEIIMTGEVTQRGGKFDSKTLPILEGVSFETIL--KTSHEGGSIASNENK-LELKG 269
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
AVL +V++SSF ++ TS++ I S SD+ +H+ D+Q +
Sbjct: 270 VRKAVLYIVSNSSF---------YHENYTSQNQKNFAVIEKTSLSDIEEQHIRDHQNYYE 320
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSS 365
R+ +I T S+ +P+ +R+++ + + D L ELLF FGRYLLI+SS
Sbjct: 321 RIDF-------NIETKNISQ----LIPTDKRIEAVKKGNVDLELQELLFHFGRYLLIASS 369
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
R GT ANLQG+WN+ +S W++ H+NINL+MNYW + L E PLFD++ L IN
Sbjct: 370 REGTLPANLQGLWNQHISAPWNADYHLNINLQMNYWLANVTQLDELNNPLFDYVDRLLIN 429
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G KTAQ N+ A G + H TDIWA + W G W+ H W H+ YT D +
Sbjct: 430 GKKTAQENFGARGSFLPHATDIWAPTWLRAPTAYWGASFGAGGWMVQHYWNHFEYTQDYN 489
Query: 486 FLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
FL RA+P +E A F DWLIE DG L + PSTSPE+ +I G S MD
Sbjct: 490 FLRNRAFPAIEEVAKFYSDWLIEDPRDGSLISAPSTSPENRYINDQGVAVSSCLGSAMDQ 549
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI-AEDGSIMEW 593
+I+EVF+ + A +L + + ++K+ K L +LRP + DG I+EW
Sbjct: 550 QVIKEVFTNYLKAVRLLNIDNE-WIQKIEKQLKQLRPGFVLGSDGRILEW 598
>gi|333377780|ref|ZP_08469513.1| hypothetical protein HMPREF9456_01108 [Dysgonomonas mossii DSM
22836]
gi|332883800|gb|EGK04080.1| hypothetical protein HMPREF9456_01108 [Dysgonomonas mossii DSM
22836]
Length = 788
Score = 318 bits (815), Expect = 5e-84, Method: Compositional matrix adjust.
Identities = 194/595 (32%), Positives = 323/595 (54%), Gaps = 49/595 (8%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
N + F+ P+ + ++IP+GNGR+G M WGGV E + LNE +LW+G D NP+A K
Sbjct: 25 NEWQYYFDKPSSIWEESIPLGNGRIGMMPWGGVERERVVLNEISLWSGNKQDADNPEAYK 84
Query: 71 ALSDVRSLVDSGQYAEATAASVKLF--------GHPADVYQLLGDIELEFDDSHLKYAEE 122
L ++R L+ + EA K F G +Q+ ++ ++F A +
Sbjct: 85 YLGEIRRLLFEKKNKEAQELMYKTFTCKGKGSAGLEYGKFQIFANLYVDFLYPDKSEATQ 144
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
Y+R LD+N A + V +S +VE+ RE+F+S + + + K + S+S +LS +SL +
Sbjct: 145 -YKRVLDMNNALSTVSFSKNDVEYKREYFTSFSNDIGLVKYTASKSEALSLKISLQRDEN 203
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+Y +GN I + A ++ G+++ + +K+ + G +SA DK +
Sbjct: 204 FKTYASGNTLYIF----------GQLEAGENHSGMKYLGM--VKVINKGGKLSA-TDKVI 250
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
++ ++ L + +++++G + S L + ++Y L +H+ YQ
Sbjct: 251 DIKNANEVTLYVSLATNYNGT----------NHEKVASDLLNNAGVNYEKLKKKHIAKYQ 300
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLL 361
LF+RV + L ++ + ID +R+++F TD+ D +L L Q+GRYLL
Sbjct: 301 ALFNRVDLTLEKNKNSSLA-------ID-----KRLEAFATDKTDYNLAALYMQYGRYLL 348
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
ISS+R G NLQG+W ++ W++ H+NINL+MN W + NLSE +P +F+
Sbjct: 349 ISSTREGGLPPNLQGLWAPQINTPWNADYHLNINLQMNLWGAEMFNLSELHKPTIEFVKS 408
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
L G KTA++ Y + GWV+H +++W +S W GAW+C HLWEHY YT
Sbjct: 409 LVEPGEKTAKIYYNSRGWVVHILSNVWGFTSPGE-HPSWGATNTAGAWMCQHLWEHYLYT 467
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
D+++L K YP ++ A F D LIE ++GYL T P+TSPE+ +I P G + + S
Sbjct: 468 QDKEYL-KSVYPTMKSAALFFEDMLIEDPNNGYLVTAPTTSPENAYITPSGDVVSICAGS 526
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
MD IIRE+F+ + +AA++LE + + ++ + RL PT I + G +MEW++
Sbjct: 527 AMDNQIIRELFTNVENAAKILEVDNE-WIKDISAKKERLAPTSIGKYGQVMEWLE 580
>gi|312621676|ref|YP_004023289.1| alpha-L-fucosidase [Caldicellulosiruptor kronotskyensis 2002]
gi|312202143|gb|ADQ45470.1| Alpha-L-fucosidase [Caldicellulosiruptor kronotskyensis 2002]
Length = 786
Score = 318 bits (814), Expect = 6e-84, Method: Compositional matrix adjust.
Identities = 209/616 (33%), Positives = 309/616 (50%), Gaps = 71/616 (11%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
+P ++F PA + +A+P+GNGRLGAMV+G E ++LN+D+LW+G D NP +
Sbjct: 3 HPYHLSFYKPASTWYEALPLGNGRLGAMVYGHTAVERIQLNDDSLWSGTFIDRNNPSLKE 62
Query: 71 ALSDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAE------ 121
L ++R LV G A ++ + G PA + Y LG++++ + HL +A
Sbjct: 63 KLPEIRRLVLVGDLYHAEELIMQYMVGTPASMRHYTTLGELDIALN-QHLPFATGWIPNS 121
Query: 122 ---ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
E Y +LDL + + V + RE F S P QV+ + + G+++ ++ LD
Sbjct: 122 NGCEDYYCDLDLMNGILSITHRQAGVRYCREMFVSYPAQVMCIRFVSEKPGTINMDIMLD 181
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIP---PKANANDDPKGIQFSAILEIKISDDRGTIS 235
+ + ++ + + R PG+R+ P N + F ++ + RG S
Sbjct: 182 RTVIS-------DETVPDERRPGQRVRRGWPTVN-------VDFIRTMDERTILMRGNES 227
Query: 236 ALE---------DKKLKVEGSDW------AVLLLVASSSFDGPFINPSDSKKDPTSESMS 280
+E D KL+ S V+L +ASS+ ++ +DP SE
Sbjct: 228 GVEFATAVRVVCDGKLQNPVSQLLARNCGEVILYLASST--------TNRSEDPVSEVFR 279
Query: 281 ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKS 340
L + Y L H++D+ L R + L SP P+ ER+ +
Sbjct: 280 LLDAAEKKGYVALREEHINDFSNLMWRCVLDLGPSPDK--------------PTDERIAA 325
Query: 341 FQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMN 399
+ D DP+L L FQ GRYL++S SR G+ NLQGIWN D P WDS +NINL+MN
Sbjct: 326 LRAGDNDPALAALYFQLGRYLIVSGSREGSAPLNLQGIWNADFMPIWDSKYTLNINLQMN 385
Query: 400 YWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVV 459
YW CNLSE PL + L + G +TA+V Y G V HH TD + + +
Sbjct: 386 YWPVEICNLSELHMPLMELLGKMHEKGRETARVMYGMRGMVCHHNTDFYGDCAPQDRYMA 445
Query: 460 WALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPS 519
W +GGAWL H+WEHY +T D +FL + YP+L A F D+LIE DG L T PS
Sbjct: 446 ATPWVIGGAWLGLHVWEHYLFTKDLNFL-REMYPILRDIAMFYEDFLIE-VDGKLVTCPS 503
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPE+ +I PDG + S MD I+RE+F+A I AA +L +++ L EK L+ RL
Sbjct: 504 VSPENRYILPDGYDTPMCVSPAMDNQILRELFAACIEAANLLGVDQE-LTEKWLEISQRL 562
Query: 580 RPTKIAEDGSIMEWVQ 595
KI G ++EW Q
Sbjct: 563 PKDKIGSKGQLLEWDQ 578
>gi|29348564|ref|NP_812067.1| hypothetical protein BT_3155 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29340469|gb|AAO78261.1| glycoside hydrolase family 95 [Bacteroides thetaiotaomicron
VPI-5482]
Length = 808
Score = 318 bits (814), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 201/598 (33%), Positives = 313/598 (52%), Gaps = 43/598 (7%)
Query: 8 STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-P 66
+T N +K+ ++ PA + ++P+GNGRLG M++GG+ +ETL LNE T+W+G ++ P
Sbjct: 24 ATENKMKLWYDKPADEWMKSLPLGNGRLGVMIYGGIETETLALNESTMWSGEYDEHQQRP 83
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLFG---HPADVYQLLGDIELEFDDSHLKYAEET 123
+ L+ VR L +E + + H + +GD+++ F S+ +
Sbjct: 84 FGREKLNQVRKLFFENNLSEGNHVAGNMMAGSPHSVGTHLPIGDLKINF--SYPQGEISD 141
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
YR ELDL+TA V Y VGN E+ R+ +SNPD V+ I S +++ + L LL
Sbjct: 142 YRHELDLHTAINTVSYKVGNTEYIRQCIASNPDDVVAMHIKASRPKAITMELEL-KLLRQ 200
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ V NQ+I G ++ G+ F + ++I GTI A E KKL
Sbjct: 201 ANVVASGNQLIYTGNAEFEK--------HGKGGVHFEGRIAVQIKG--GTIKA-EGKKLY 249
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
+E + LL S F N + S + + ++ + L +H++DY
Sbjct: 250 IEKATEVTLL----SDVRTNFKNNTFSGYNYKIKCEKTIELASKKDFKTLKKKHIEDYSP 305
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLI 362
LF RV + K D +P+ ER + E DP L L FQ+ RYLLI
Sbjct: 306 LFSRVGLSFEHHAK-----------FDHLPNDERWARVKKGESDPGLDALFFQYARYLLI 354
Query: 363 SSSRPGTQV-ANLQGIWNEDLSP--TWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
+SSRP + + LQG +N++L+ W + H++IN E NYW + NL+EC PLFD++
Sbjct: 355 ASSRPNSPLPVALQGFFNDNLACHMGWTNDYHLDINTEQNYWIANVGNLAECHLPLFDYI 414
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
LSI+G+KTA+ Y GW H + W ++ G ++W L+P +WL +HLW Y+
Sbjct: 415 KDLSIHGAKTAKDLYGCKGWTAHTTANPWGYTAVS-GSILWGLFPTASSWLASHLWTQYD 473
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
YT D+DFL+ AYPLL+ A FLLD++ I+ + YL T PS SPE+ F G+ C S
Sbjct: 474 YTQDKDFLKNTAYPLLKSNAEFLLDYMVIDPRNNYLVTGPSISPENSF-RHQGQEFCASM 532
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDA-LVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
T D + E+FSA + + E+L N DA + + ++ +L P +I+ +G + EW +
Sbjct: 533 MPTCDRVLAYEIFSACLQSTEIL--NVDASFADSLRTAISKLPPFRISTNGGVQEWFE 588
>gi|380693852|ref|ZP_09858711.1| alpha-L-fucosidase [Bacteroides faecis MAJ27]
Length = 772
Score = 318 bits (814), Expect = 7e-84, Method: Compositional matrix adjust.
Identities = 199/566 (35%), Positives = 293/566 (51%), Gaps = 46/566 (8%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEAT-AASVKL-- 94
MV+G +E ++LNE+T+ G P N +A +AL +R L+ G YAEA A K+
Sbjct: 1 MVYGDPVNEEIQLNEETVSAGSPYKNYNSEAKEALPAIRKLIFDGNYAEAQLMAGEKILS 60
Query: 95 ---FGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHF 151
FG P YQ +G + L F YRRELD++ A A Y V VE+ RE F
Sbjct: 61 KNGFGMP---YQTVGSLRLHFQGQE---NHTDYRRELDIDKALAITTYRVNGVEYKRETF 114
Query: 152 SSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANAN 211
+S DQ+++ +++ S+ G L+F +L V+G N I M G G + A
Sbjct: 115 TSFTDQLVIVRLTASKPGMLTFTAALTCPQAVEVSVSGKNTIKMSGITKGDQFTEGA--- 171
Query: 212 DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSK 271
I+F+A L++++ +G S +D L V +D AVL + +++F +N D
Sbjct: 172 -----IRFAADLKLEL---QGGKSIAQDSVLSVSNADSAVLYIAMATNF----VNYKDIS 219
Query: 272 KDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKDIVTDTCSEENID 330
D + L++ +YS H+ YQK +HRVS+ L S D TD
Sbjct: 220 ADAVKRNQVYLRNAGK-NYSKALQEHIAAYQKYYHRVSLDLGYTSQADKPTDV------- 271
Query: 331 TVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAP 390
RVK F +DP L+ L FQ+GRYLLISSS+PG Q ANLQGIWN+ L+P W
Sbjct: 272 ------RVKEFAVSDDPQLISLYFQYGRYLLISSSQPGRQPANLQGIWNDKLNPVWKCRY 325
Query: 391 HVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAK 450
N+N EMNYW + NLSE EP + L NG + A+ Y GWV+HH TD+W
Sbjct: 326 TTNVNAEMNYWPAEVTNLSEMHEPFLQMIRELYENGQEAAREMYGCRGWVLHHNTDLWRM 385
Query: 451 SSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EG 509
+ A K WP AWLC HLWE Y Y+ D+DFL YP+++ + F +D+L+ +
Sbjct: 386 NGA-VDKAYCGTWPTCNAWLCHHLWERYLYSGDKDFLAS-VYPIMKSASEFFVDFLVRDP 443
Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALV 569
+ GY+ PS SPE+ GK A + TMD ++ ++F+ +AA +L ++
Sbjct: 444 NTGYMVVTPSNSPENAPRQWKGK-ANLFAGITMDNQLVFDLFTNTEAAAHILNGKDEQFC 502
Query: 570 EKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+ + +L P ++ + G + EW +
Sbjct: 503 DTIRSLKKQLPPMQVGQYGQLQEWFE 528
>gi|298386944|ref|ZP_06996498.1| alpha-L-fucosidase 2 [Bacteroides sp. 1_1_14]
gi|298260094|gb|EFI02964.1| alpha-L-fucosidase 2 [Bacteroides sp. 1_1_14]
Length = 809
Score = 317 bits (813), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 201/600 (33%), Positives = 322/600 (53%), Gaps = 43/600 (7%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
S +TT+ +K+ ++ PA + ++P+GNGRLGAMV+GGV +ET+ LNE T+W+G ++
Sbjct: 24 SEATTDNMKLWYDKPADEWMKSLPLGNGRLGAMVYGGVETETIGLNESTMWSGEYDEHQQ 83
Query: 66 -PDAPKALSDVRSLVDSGQYAEAT-AASVKLFG--HPADVYQLLGDIELEFDDSHLKYAE 121
P + L ++R L G AE A + G H A + +GD++L F + ++
Sbjct: 84 RPLGREKLDEIRKLFFEGNLAEGNHIAGNTMAGSPHSAGTHLPIGDLKLNFTYPEGELSD 143
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
Y ELDL+TA V Y +G+ E+TR+ +SNPD VI I+ S +++ + L+ LL
Sbjct: 144 --YHHELDLSTAVNTVTYKIGDTEYTRQSIASNPDDVIAMYITASRPEAITMELELN-LL 200
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
N + NQ+I G ++ G+ F + ++I GTI A + KK
Sbjct: 201 RNAEVIASGNQLIYTGNAEFEK--------HGRGGVLFEGRIAVEIKG--GTIKA-DGKK 249
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L ++ + LL S + N + + D + +++ S+ L H++DY
Sbjct: 250 LLIDKATEVTLL----SDVRTNYKNTTFAGYDYKQKCKETIEAASKKSFKTLRNIHVEDY 305
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYL 360
LF RV++ + K + +P+ +R + E DP L L FQ+ RYL
Sbjct: 306 APLFSRVALSFGDNGK-----------LSHLPNDQRWARVKAGESDPGLDALFFQYARYL 354
Query: 361 LISSSRPGTQV-ANLQGIWNEDLSP--TWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
LI+SSRP + + LQG +N++L+ W + H++IN E NYW + NL EC PLFD
Sbjct: 355 LIASSRPNSPLPVALQGFFNDNLACHMGWTNDYHLDINTEQNYWIANVGNLPECHLPLFD 414
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
++ LS++GSK AQ Y GW H ++ W ++ G ++W L+P +WL +H+W
Sbjct: 415 YIKDLSVHGSKIAQDLYGCKGWTAHTTSNPWGYTAVS-GSILWGLFPTASSWLTSHVWTQ 473
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
Y YT D+ FL++ AYPLL+ A FLLD++ I+ + YL T PS SPE+ F G+ C
Sbjct: 474 YEYTQDKKFLQETAYPLLKSNAEFLLDYMVIDPRNNYLVTGPSISPENSF-HYQGQEFCA 532
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDA-LVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S T D + E+FSA + + E+L N DA + + ++ +L P +I+ +G + EW +
Sbjct: 533 SMMPTCDRVLAYEIFSACLQSTEIL--NVDASFADSLRTAISQLPPFRISANGGVQEWFE 590
>gi|429860996|gb|ELA35710.1| alpha-l-fucosidase [Colletotrichum gloeosporioides Nara gc5]
Length = 776
Score = 317 bits (812), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 209/599 (34%), Positives = 304/599 (50%), Gaps = 47/599 (7%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
+ +S PL + + PA +++A+PIGNGRLGAMV G +E L+LNED++W G P D
Sbjct: 12 SGQSQQQPRPLLLHYESPASEWSEALPIGNGRLGAMVHGRTQTELLQLNEDSVWYGGPQD 71
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKY 119
T DA + L +R L+ ++AEA + F PA + Y+ LG +EF H+
Sbjct: 72 RTPKDALRHLPKLRQLIRDEEHAEAESLVREAFFATPASMRHYEPLGTCTIEF--GHVVE 129
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
YRR L L TA V+Y V + R+ +S PD V+ ++ SE+ F V L+
Sbjct: 130 DVTDYRRYLCLETAQTTVEYRCRGVSYRRDAIASFPDNVLAFRVVASEA--TRFVVRLNR 187
Query: 180 LLDNHSYVNGNNQII--MEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
L + N I GR K P N+N + + L + D G++ A+
Sbjct: 188 LSEIEYETNEFLDSIDATNGRIVLKATPGGHNSN------RLAIALGVSCDDAEGSVEAI 241
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
+ + S +++ A ++F +DP + ++ + + +SDL RH
Sbjct: 242 GNAL--IVNSTSCTIVIGAQTTF---------RTEDPEAAAVDDVLKALSHQWSDLVERH 290
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
DY LF+R S+++S D C +P+ ER+K+ DP LV L +G
Sbjct: 291 QQDYAGLFNRTSLRMS-------PDACH------LPTDERIKN---SRDPGLVALYHNYG 334
Query: 358 RYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
RYLLIS SR + A LQGIWN +P W S +NINL+MNYW + PC+L EC P+
Sbjct: 335 RYLLISCSRNSKKALPATLQGIWNPSFAPPWGSKYTININLQMNYWPAGPCSLIECAIPV 394
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
L ++ G KTA+V Y GW H TDIWA + + +WP+GG W+C ++
Sbjct: 395 LGLLEKMAERGKKTARVMYGCEGWCARHNTDIWADTDPHDRWMPSTIWPLGGVWVCIDIF 454
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLA 534
E Y D + L KRA +LEG FLL++LI G YL TNPS SPE+ F++ G+
Sbjct: 455 EMLQYQYDEN-LHKRAAVVLEGAIMFLLEYLIPSACGRYLVTNPSLSPENTFLSVSGEPG 513
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ S +DM II F + + +L E+ L KV ++L RL P I DG I EW
Sbjct: 514 ILCEGSVIDMTIIHIAFEKFLWSTNIL-GGENPLRAKVEEALERLPPLVINSDGLIQEW 571
>gi|302413419|ref|XP_003004542.1| alpha-L-fucosidase [Verticillium albo-atrum VaMs.102]
gi|261357118|gb|EEY19546.1| alpha-L-fucosidase [Verticillium albo-atrum VaMs.102]
Length = 765
Score = 317 bits (812), Expect = 1e-83, Method: Compositional matrix adjust.
Identities = 207/597 (34%), Positives = 300/597 (50%), Gaps = 61/597 (10%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + ++ PA +++A+P+GNGRLG MV+G +E L+LNED++W G P D T DA + L
Sbjct: 8 LTLHYDAPAASWSEALPVGNGRLGGMVYGRTSTELLQLNEDSVWYGGPQDRTPRDARRHL 67
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADVY--QLLGDIELEFDDSHLKYAEETYRRELD 129
+R L+ ++A A A F PA + + LG+ LEF H YRR LD
Sbjct: 68 DTLRQLIRDEEHAAAEALVREAFFATPASMRHSEPLGNCTLEF--GHEAQDVTGYRRSLD 125
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS--------LDSLL 181
L TA A V+Y V + RE +S PD V+ + S SE ++ + L
Sbjct: 126 LATAQATVEYQCRGVSYRRETIASFPDNVVALRFSASEPTRFVVRLNRVSEIEWETNEFL 185
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI--SDDRGTISALED 239
D+ NG +I++ GK N +P S +L I SDD G+I A+ +
Sbjct: 186 DSIQAANG--RIVLNATPGGK--------NSNP----LSLVLGISCDASDDGGSIEAIGN 231
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+ S L++ A ++F DP + + + + S+ +L R
Sbjct: 232 ALVVKAFS--CTLVIAAHTAF---------RNADPEAAARQDVDNALKRSWHELVLRQRT 280
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
DY LF R S+++ + D+ P+ ER+ + + DP LV L + +GRY
Sbjct: 281 DYASLFQRSSLRMWPAAHDL-------------PTNERI---EKNRDPGLVALYYNYGRY 324
Query: 360 LLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
LLISSSR + A LQGIWN +P W +NINL+MNYW + P NL EC P+
Sbjct: 325 LLISSSRDSDKALPATLQGIWNPSFAPPWGCKYTININLQMNYWLAAPGNLVECALPMLG 384
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ +++ G+KTA++ Y GW HH TDIWA + + +WP+GG WLC + E
Sbjct: 385 LVERMAVRGAKTARIMYDCGGWCAHHNTDIWADTDPQDRWMPSTIWPLGGVWLCIDVLEM 444
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACV 536
Y DR L +RA LLEGC FLLD+LI +L TNPS SPE+ F++ G +
Sbjct: 445 LLYHYDRK-LHERAAVLLEGCIVFLLDFLIPSACRTFLVTNPSLSPENTFVSKSGDTGIL 503
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
S +D I+R F + + +LEK + LV KV ++ RL I DG I EW
Sbjct: 504 CEGSAIDTTIVRIAFEKFLWSTAILEKG-NPLVPKVRDAMARLPDLTINNDGLIQEW 559
>gi|329957629|ref|ZP_08298104.1| hypothetical protein HMPREF9445_02986 [Bacteroides clarus YIT
12056]
gi|328522506|gb|EGF49615.1| hypothetical protein HMPREF9445_02986 [Bacteroides clarus YIT
12056]
Length = 827
Score = 316 bits (810), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 194/604 (32%), Positives = 320/604 (52%), Gaps = 48/604 (7%)
Query: 2 MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
++ + T+ N LK+ ++ PAK + +A+P+GNGR+GAMV+G E +LNE+T+W G P
Sbjct: 15 ISGKITAHDNSLKLWYDKPAKQWVEALPLGNGRIGAMVFGDPAHERFQLNEETVWGGSPH 74
Query: 62 DYTNPDAPKALSDVRSLVDSGQYAEA------TAASVKLFGHPADVYQLLGDIELEFDDS 115
+ TNP+A +AL +R L+ G+ EA S G P YQ +G + L+F+
Sbjct: 75 NNTNPNAKEALPRIRRLIFEGKNKEAQELCGPAICSQSANGMP---YQTVGTLHLDFEGI 131
Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
+ + + R+LD+ A A +++ + + RE F+S PD++++ K++ S+ S+SF
Sbjct: 132 N---QYDDFYRDLDIEKAIATTRFTANGITYIREAFTSFPDRLLIIKLTASKKKSISFTA 188
Query: 176 SLDS-LLDNHSY-VNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRG 232
+ +N + ++ ++ + G KAN ++ +G I+F+A+ +I ++ G
Sbjct: 189 HYTTPYTENTEFCISPRKELQLNG---------KANDHEGIEGKIRFTAL--TRIDNNGG 237
Query: 233 TISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSD 292
T+ D L+V+ +D +V L V S FIN D D + ++ +Y+
Sbjct: 238 TLKVTSDSTLQVKNAD-SVTLYV---SIGTNFINYKDVSGDALKAARQYMKQAGK-NYTK 292
Query: 293 LYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVEL 352
H+ YQ+ F+RVS+ L S + I P+ RV+ F + DP + L
Sbjct: 293 RKEAHIAAYQQYFNRVSLDLG-----------SNDQIKK-PTDRRVREFSSVTDPQMAAL 340
Query: 353 LFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
FQFGRYLLI SS+PG Q ANLQGIWN L WD +IN+EMNYW + LSE
Sbjct: 341 YFQFGRYLLICSSQPGGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAETTALSEMH 400
Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
EP + ++I G ++A + Y GW +HH TDIW + A G + +WP AW C
Sbjct: 401 EPFLQLVKEVAIQGRESASM-YSCRGWTLHHNTDIWRTTGAVDG-AKYGVWPTCNAWFCQ 458
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDG 531
HLW+ Y ++ D+++L + YP++ G F LD+L+ E + +L PS SPE+
Sbjct: 459 HLWDRYLFSGDKNYLAE-VYPIMRGACEFYLDFLVREPKNNWLVVAPSYSPENSPSVNGK 517
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
+ + +TMD ++ ++F I AA ++ +N A + + L P ++ G +
Sbjct: 518 RGFVIVAGTTMDNQMVYDLFYNTIQAANLMNENT-AFTDSLQTVANHLAPMQVGRWGQLQ 576
Query: 592 EWVQ 595
EW++
Sbjct: 577 EWME 580
>gi|154503234|ref|ZP_02040294.1| hypothetical protein RUMGNA_01058 [Ruminococcus gnavus ATCC 29149]
gi|153796228|gb|EDN78648.1| hypothetical protein RUMGNA_01058 [Ruminococcus gnavus ATCC 29149]
Length = 784
Score = 316 bits (810), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 198/614 (32%), Positives = 302/614 (49%), Gaps = 73/614 (11%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
I F A+ + A PIGNG LGAMV+G V E +++NED++W+G + NPDA + L
Sbjct: 20 IYFRKEAEEWNQAFPIGNGFLGAMVFGNVAKERIQVNEDSVWSGGFSNRVNPDAGRYLEK 79
Query: 75 VRSLVDSG--QYAEATAASVKLFGHP-ADVYQLLGDIELEF-----------DDSHLKYA 120
+R + G Q AE A P VYQ LGDI + F D+S L Y
Sbjct: 80 IRECLFEGNVQEAEKLAEQSMYATSPNMRVYQPLGDIWIRFMDQEAERKLARDESGLPYL 139
Query: 121 EET------YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFN 174
+E+ Y+R L+L A +++Y VG ++ RE F+SNP +V + I ++
Sbjct: 140 KESAAEVEAYQRILNLEQAVGKIEYCVGRTKWNREFFASNPAKVAMYSICAESGEDINLE 199
Query: 175 VSLDSLLDNHS----------YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE 224
+S + DN S N I +EG G+ +GI F+ +
Sbjct: 200 ISA-TRKDNRSGRGVSFCDRILAEENQYIWLEGSSGGR------------EGIGFA--MG 244
Query: 225 IKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS 284
+++ G + ++ VE + ++ ++F +P L S
Sbjct: 245 VRVCSCGGRQYQM-GSRIIVEKARKVLICFTGRTTF---------RSAEPKQWCREHLAS 294
Query: 285 IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QT 343
+ +Y++ H+ DYQ F+ + + E N+D + + ER+K +
Sbjct: 295 LSLDTYAERKREHIQDYQTYFNASRLTFRQ-----------EMNLDNLTTPERLKRIREG 343
Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
D LV L + F RYLLISSSR G+ ANLQGIWNE+ P W S +NIN++MNYW +
Sbjct: 344 HHDIGLVNLYYDFARYLLISSSREGSLPANLQGIWNEEFEPMWGSKYTININIQMNYWMA 403
Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
L PL + L + G + A Y G+ HH TDIW + +W
Sbjct: 404 EKTGLQALHLPLLEHLKRMHPRGKEVAASMYHVEGFCCHHNTDIWGDCAPQDYHTSSTIW 463
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
PMGGAWLC H++EHY YT D+ FLE+ +P+L+ F ++++++ DG T PS+SPE
Sbjct: 464 PMGGAWLCLHIYEHYQYTKDKGFLEE-YFPILKDSVQFFMNYMVQNSDGKWVTGPSSSPE 522
Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNE--DALVEKVLKSLPRLRP 581
+ +I + C+ TMD+ I+RE+FS + E+LEK E LV+ +++LP+L
Sbjct: 523 NIYITAKNQYGCLCMGPTMDIEIVRELFSNYLKTVEILEKEEPLTGLVKDRIENLPKL-- 580
Query: 582 TKIAEDGSIMEWVQ 595
K+ + G I EW Q
Sbjct: 581 -KVGKYGQIQEWDQ 593
>gi|336432957|ref|ZP_08612787.1| hypothetical protein HMPREF0991_01906 [Lachnospiraceae bacterium
2_1_58FAA]
gi|336017627|gb|EGN47385.1| hypothetical protein HMPREF0991_01906 [Lachnospiraceae bacterium
2_1_58FAA]
Length = 768
Score = 316 bits (810), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 198/614 (32%), Positives = 302/614 (49%), Gaps = 73/614 (11%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
I F A+ + A PIGNG LGAMV+G V E +++NED++W+G + NPDA + L
Sbjct: 4 IYFRKEAEEWNQAFPIGNGFLGAMVFGNVAKERIQVNEDSVWSGGFSNRVNPDAGRYLEK 63
Query: 75 VRSLVDSG--QYAEATAASVKLFGHP-ADVYQLLGDIELEF-----------DDSHLKYA 120
+R + G Q AE A P VYQ LGDI + F D+S L Y
Sbjct: 64 IRECLFEGNVQEAEKLAEQSMYATSPNMRVYQPLGDIWIRFMDQEAERKLARDESGLPYL 123
Query: 121 EET------YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFN 174
+E+ Y+R L+L A +++Y VG ++ RE F+SNP +V + I ++
Sbjct: 124 KESAAEVEAYQRILNLEQAVGKIEYCVGRTKWNREFFASNPAKVAMYSICAESGEDINLE 183
Query: 175 VSLDSLLDNHS----------YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE 224
+S + DN S N I +EG G+ +GI F+ +
Sbjct: 184 ISA-TRKDNRSGRGVSFCDRILAEENQYIWLEGSSGGR------------EGIGFA--MG 228
Query: 225 IKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS 284
+++ G + ++ VE + ++ ++F +P L S
Sbjct: 229 VRVCSCGGRQYQM-GSRIIVEKARKVLICFTGRTTF---------RSAEPKQWCREHLAS 278
Query: 285 IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QT 343
+ +Y++ H+ DYQ F+ + + E N+D + + ER+K +
Sbjct: 279 LSLDTYAERKREHIQDYQTYFNASRLTFRQ-----------EMNLDNLTTPERLKRIREG 327
Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
D LV L + F RYLLISSSR G+ ANLQGIWNE+ P W S +NIN++MNYW +
Sbjct: 328 HHDIGLVNLYYDFARYLLISSSREGSLPANLQGIWNEEFEPMWGSKYTININIQMNYWMA 387
Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
L PL + L + G + A Y G+ HH TDIW + +W
Sbjct: 388 EKTGLQALHLPLLEHLKRMHPRGKEVAASMYHVEGFCCHHNTDIWGDCAPQDYHTSSTIW 447
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
PMGGAWLC H++EHY YT D+ FLE+ +P+L+ F ++++++ DG T PS+SPE
Sbjct: 448 PMGGAWLCLHIYEHYQYTKDKGFLEE-YFPILKDSVQFFMNYMVQNSDGKWVTGPSSSPE 506
Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNE--DALVEKVLKSLPRLRP 581
+ +I + C+ TMD+ I+RE+FS + E+LEK E LV+ +++LP+L
Sbjct: 507 NIYITAKNQYGCLCMGPTMDIEIVRELFSNYLKTVEILEKEEPLTGLVKDRIENLPKL-- 564
Query: 582 TKIAEDGSIMEWVQ 595
K+ + G I EW Q
Sbjct: 565 -KVGKYGQIQEWDQ 577
>gi|383124735|ref|ZP_09945397.1| hypothetical protein BSIG_1516 [Bacteroides sp. 1_1_6]
gi|251841110|gb|EES69191.1| hypothetical protein BSIG_1516 [Bacteroides sp. 1_1_6]
Length = 808
Score = 316 bits (810), Expect = 2e-83, Method: Compositional matrix adjust.
Identities = 200/598 (33%), Positives = 313/598 (52%), Gaps = 43/598 (7%)
Query: 8 STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-P 66
+T N +K+ ++ PA + ++P+GNGRLG +++GG+ +ETL LNE T+W+G ++ P
Sbjct: 24 ATENKMKLWYDKPADEWMKSLPLGNGRLGVIIYGGIETETLALNESTMWSGEYDEHQQRP 83
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLFG---HPADVYQLLGDIELEFDDSHLKYAEET 123
+ L+ VR L +E + + H + +GD+++ F S+ +
Sbjct: 84 FGREKLNQVRKLFFENNLSEGNHVAGNMMAGSPHSVGTHLPIGDLKINF--SYPQGEISD 141
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
YR ELDL+TA V Y VGN E+ R+ +SNPD V+ I S +++ + L LL
Sbjct: 142 YRHELDLHTAINTVSYKVGNTEYIRQCIASNPDDVVAMHIKASRPKAITMELEL-KLLRQ 200
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ V NQ+I G ++ G+ F + ++I GTI A E KKL
Sbjct: 201 ANVVASGNQLIYTGNAEFEK--------HGKGGVHFEGRIAVQIKG--GTIKA-EGKKLY 249
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
+E + LL S F N + S + + ++ + L +H++DY
Sbjct: 250 IEKATEVTLL----SDVRTNFKNNTFSGYNYKIKCEKTIELASKKDFKTLKKKHIEDYSP 305
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLI 362
LF RV + K D +P+ ER + E DP L L FQ+ RYLLI
Sbjct: 306 LFSRVGLSFEHHAK-----------FDHLPNDERWARVKKGESDPGLDALFFQYARYLLI 354
Query: 363 SSSRPGTQV-ANLQGIWNEDLSP--TWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
+SSRP + + LQG +N++L+ W + H++IN E NYW + NL+EC PLFD++
Sbjct: 355 ASSRPNSPLPVALQGFFNDNLACHMGWTNDYHLDINTEQNYWIANVGNLAECHLPLFDYI 414
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
LSI+G+KTA+ Y GW H + W ++ G ++W L+P +WL +HLW Y+
Sbjct: 415 KDLSIHGAKTAKDLYGCKGWTAHTTANPWGYTAVS-GSILWGLFPTASSWLASHLWTQYD 473
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
YT D+DFL+ AYPLL+ A FLLD++ I+ + YL T PS SPE+ F G+ C S
Sbjct: 474 YTQDKDFLKNTAYPLLKSNAEFLLDYMVIDPRNNYLVTGPSISPENSF-RHQGQEFCASM 532
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDA-LVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
T D + E+FSA + + E+L N DA + + ++ +L P +I+ +G + EW +
Sbjct: 533 MPTCDRVLAYEIFSACLQSTEIL--NVDASFADSLRTAISQLPPFRISTNGGVQEWFE 588
>gi|393789783|ref|ZP_10377902.1| hypothetical protein HMPREF1068_04182 [Bacteroides nordii
CL02T12C05]
gi|392650186|gb|EIY43857.1| hypothetical protein HMPREF1068_04182 [Bacteroides nordii
CL02T12C05]
Length = 800
Score = 315 bits (808), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 191/587 (32%), Positives = 317/587 (54%), Gaps = 50/587 (8%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP---KALSDVR 76
PAK + +++PIGNGRLGAM +GG+ ETL LNE ++W+G + N D P L ++R
Sbjct: 35 PAKEWMESLPIGNGRLGAMTYGGIEEETLALNESSMWSGQFNE--NQDKPFGRAKLDNLR 92
Query: 77 SLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
L G+ E A L G + +GD++++F ++ K YRR L+LN A
Sbjct: 93 KLFFEGKLWEGNQTAGDNLNGMQTSFGTHLPIGDLKMKF--TYPKGDITGYRRSLNLNEA 150
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
+ V ++ G V + RE+F++NPD V+V ++S + S++ +++LD L+ ++ NNQ+
Sbjct: 151 ISSVSFNAGGVNYKREYFATNPDNVLVLRLSADKPKSVTMDMALD-LMRQSAFTVENNQL 209
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
I G+ P P G+ F I + D G + +++ + V +D ++
Sbjct: 210 IFTGKV---DFPLHG-----PGGVNFEG--RIAVLADNGEVK-MDEAGISVSNADAVTMI 258
Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
+ + + P D + + ++ Y L H+ DY LF+RV + L
Sbjct: 259 VDVRTDYKSP---------DYKALCATTVEEAGMKPYEALKLMHIKDYSNLFNRVELSLG 309
Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQV- 371
+ D T+P+ R K ++ + D S L FQ+GRYL I+SSR + +
Sbjct: 310 KDSND------------TIPTDIRWKQIRSGKTDTSFDALYFQYGRYLTIASSRENSPLP 357
Query: 372 ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
LQG +N++ + W + H++IN + NYW S NL+EC PLF+++ LS++G+KT
Sbjct: 358 IALQGFFNDNQACNMGWTNDYHLDINTQQNYWVSNVGNLAECNTPLFNYIKDLSVHGAKT 417
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+V Y GW + +IW + A G ++W L+P+ G+W+ THLW Y YT D+ +L +
Sbjct: 418 AEVVYGCKGWTANTTANIWGYTPAS-GSIIWGLFPLAGSWIATHLWTQYEYTQDKKYLAE 476
Query: 490 RAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
AYPLL+G A F+LD++ E +GYL T PS SPE+ F +G+ S T D ++
Sbjct: 477 VAYPLLKGNAEFILDYMTENPANGYLMTGPSISPENWFKTANGQEMVASMMPTCDRELVY 536
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
E+F++ I AA++L ++ A + +L +L P ++ +G+I EW +
Sbjct: 537 EIFTSCIQAADILGIDK-AFSNNLQTALAKLPPIQLRANGAIREWFE 582
>gi|423346901|ref|ZP_17324588.1| hypothetical protein HMPREF1060_02260 [Parabacteroides merdae
CL03T12C32]
gi|409218562|gb|EKN11530.1| hypothetical protein HMPREF1060_02260 [Parabacteroides merdae
CL03T12C32]
Length = 809
Score = 315 bits (808), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 205/609 (33%), Positives = 316/609 (51%), Gaps = 50/609 (8%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
N T PL F+ PA + + P+GNGRLG M GGV +E + LNE ++W+G D
Sbjct: 17 NMPEVKTGKPLSYHFDAPAGIWEASFPLGNGRLGLMPDGGVDTENIVLNEISMWSGSKQD 76
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIE 109
NP A +L +R L+ G+ EA F G A+V YQLLG++
Sbjct: 77 TDNPQAYHSLGTIRKLLFEGRNDEAQELMYNTFVCKGEGSGQGQGANVPYGSYQLLGNLV 136
Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
L +D + YRREL+L+ A A + G V + RE F+S D + V ++
Sbjct: 137 LNYDYQGTSDSIFGYRRELNLDNAIATASFRRGKVTYNREVFTSFADDLGVIHLTADADR 196
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD 229
+L+F+ ++ +++ N ++M+G+ P + KG+++++ +++
Sbjct: 197 ALNFSFGMNRP-EHYKVTADGNDLLMQGQLP------DGVDTLEMKGLRYAS--RVRVIL 247
Query: 230 DRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSALQSIRNL 288
+G D + V + A+LL+ +A+ FD KD + S L +
Sbjct: 248 PKGGNVTPGDSTVSVRNASEAILLVSMATDYFD----------KDLAGKVSSLLANAEKK 297
Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDP 347
++ L H+ Y+ LF RV + L S S EN+ P ER+ +F + +DP
Sbjct: 298 DFASLKKGHIAAYRSLFGRVELDLGHS---------SRENL---PMDERLAAFHENPDDP 345
Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
SL L FQFGRYLLISS+R G NLQG+W ++ W+ H+NINL+MN+W + N
Sbjct: 346 SLAALYFQFGRYLLISSTRVGLLPPNLQGLWCNTINTPWNGDYHLNINLQMNHWPAEVAN 405
Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
LSE PL ++ +G +TA+ Y A GWV H ++W + +A W
Sbjct: 406 LSELHLPLVEWTKQQVASGERTAKAYYNARGWVTHILGNVW-EFTAPGEHPSWGATNTSA 464
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEF 526
AWLC HL+ HY YT+D+++L K YP+L+G + F +D L+E + YL T P+TSPE+ +
Sbjct: 465 AWLCEHLYMHYLYTLDKEYL-KDVYPVLKGASLFFVDMLVEDPRNKYLVTAPTTSPENGY 523
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE 586
P+GK A + STMD I+RE+F+ I AA++L + A ++ RL PT I +
Sbjct: 524 KLPNGKTAHICAGSTMDNQIVRELFTNTIEAADILGL-DSAFAGELAAKRARLMPTTIGK 582
Query: 587 DGSIMEWVQ 595
DG IMEW++
Sbjct: 583 DGCIMEWLE 591
>gi|282878225|ref|ZP_06287021.1| conserved hypothetical protein [Prevotella buccalis ATCC 35310]
gi|281299643|gb|EFA92016.1| conserved hypothetical protein [Prevotella buccalis ATCC 35310]
Length = 793
Score = 315 bits (808), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 204/597 (34%), Positives = 313/597 (52%), Gaps = 46/597 (7%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPK 70
P+++ ++ PA++F +++PIGNGR+GA+V+GG + LN+ TLWTG P D + +A +
Sbjct: 23 PMQLWYDKPAQYFEESMPIGNGRMGALVYGGTRDNLIYLNDITLWTGQPVDPNLDQNAHQ 82
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+ +R + Y +A + +++ G + YQ L + L D + Y R LD+
Sbjct: 83 WIPAIREALFKEDYRKADSLQLRVQGPNSQYYQPLATLHL-LDPRGGQ--ATNYTRTLDI 139
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
+ A YS+ V+ RE+F+S+PD VI I+ ++ S+S V+L + + HS
Sbjct: 140 DKALLTDSYSLNGVKIKREYFASHPDSVICIHITANKPRSISLEVNLSAQIP-HSVKAAG 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
N I M+G G + I F ++L + +G I A + L ++ ++ A
Sbjct: 199 NLITMKGHAMG----------NPENSIHFCSVL--RAVTKQGKIQATDSTLLIIDATE-A 245
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
L V +SF+G +P K +++ +++ Y + +H+ DY + R+ +
Sbjct: 246 TLFFVNETSFNGFDKHPVRQGKPCEQLALAHQKALEKKDYQTIKKQHVADYTHYYDRMKL 305
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSF--QTDEDPSLVELLFQFGRYLLISSSRPG 368
L S VTD CS + +++K + Q +P L L Q+GRYLLI+SSR
Sbjct: 306 FLGGS----VTD-CSRT------TEQQLKDYTDQGGHNPYLETLYMQYGRYLLIASSRTK 354
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
ANLQG+W+ L W S VNINLE NYW + NL E +PLF F+ L+ NG
Sbjct: 355 GIPANLQGLWSHYLRAPWRSNYTVNINLEENYWLAEVANLGEMAKPLFTFMQALAANGRH 414
Query: 429 TAQVNY-LASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
TA+ Y + GW H +D+WA ++ R W+ W MGGAWL +LWEHY + D
Sbjct: 415 TAKNYYGINRGWCSSHNSDVWAMTNPVGEKRESPEWSNWNMGGAWLTQNLWEHYRFNPDA 474
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
FL A PLLEG ++F+LDWL+E + L T PSTSPE+E+ P+G Y T
Sbjct: 475 QFLNDTALPLLEGASAFMLDWLVENPKNPSELITAPSTSPENEYKTPEGYHGTTCYGGTA 534
Query: 543 DMAIIREVFSAIISAAEVLEKN------EDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
D+AIIRE+F I+ AE + K + L++ + SL RL P I G + EW
Sbjct: 535 DLAIIRELF---INTAEAINKKGADYARQSQLLKDIEASLKRLHPYTIGHLGDLNEW 588
>gi|410096023|ref|ZP_11291014.1| hypothetical protein HMPREF1076_00192 [Parabacteroides goldsteinii
CL02T12C30]
gi|409227429|gb|EKN20327.1| hypothetical protein HMPREF1076_00192 [Parabacteroides goldsteinii
CL02T12C30]
Length = 821
Score = 315 bits (808), Expect = 3e-83, Method: Compositional matrix adjust.
Identities = 199/602 (33%), Positives = 317/602 (52%), Gaps = 48/602 (7%)
Query: 9 TTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
T N + F+ PA+ + + +P+GNGRLG M GG+ E + LNE ++W+G D NP A
Sbjct: 35 TANKIAYHFDEPARIWEETLPLGNGRLGMMPDGGINKENILLNEISMWSGSKQDTDNPQA 94
Query: 69 PKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIELEFDDS 115
+L+++R L+ G+ EA + F P YQLLG++ L++
Sbjct: 95 VWSLANIRRLLFEGKNDEAQDLMYRTFVCKGAGSGQGQGANVPYGSYQLLGNLVLDYVYV 154
Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
+ YRREL+LN A A + G V ++RE F+S + V + +L+F V
Sbjct: 155 DGSDSVAAYRRELNLNDAIASTSFRKGKVNYSRESFTSFSGDLGVVHLMADADKALNFTV 214
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
++ V+G + ++M+G+ P + KGI++ A + + + IS
Sbjct: 215 GMNRPEHYALSVDGKD-LLMKGQLP------DGVDTLEMKGIKYGARVRVLLPKGGSLIS 267
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
D L V+ + A+LL+ ++++ ++ +D + S L YS L
Sbjct: 268 G--DSSLTVQNASEAILLVSMATNYK------NEGFED---QLFSLLAESERKDYSTLRK 316
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLF 354
H++ Y+ LF RV + L RS +D +P ER+ +FQ D+ DPSL L F
Sbjct: 317 EHVNAYRSLFDRVDLDLGRSARD------------EMPINERLHAFQEDQNDPSLGALYF 364
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
QFGRYLLISS+R G+ NLQG+W ++ W+ H+NIN +MN+W + NLSE P
Sbjct: 365 QFGRYLLISSTRTGSLPPNLQGLWCNTINTPWNGDYHLNINFQMNHWPAEVTNLSELHLP 424
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
+ ++ +G +TA+V Y A G V H ++W + +A W AWLC HL
Sbjct: 425 MIEWTKQQVESGERTAKVFYNARGLVTHILGNVW-EFTAPGEHPSWGATNTSAAWLCEHL 483
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKL 533
+ HY YT+D+++L K YP+++G A F D L+ + + YL T P+TSPE+ + P+GK+
Sbjct: 484 FTHYQYTLDKEYL-KEVYPVMKGAALFFTDMLVRDPRNNYLVTAPTTSPENAYRMPNGKV 542
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ STMD I+RE+F+ I+AA +L + A +++ RL PT I +DG I+EW
Sbjct: 543 VHICAGSTMDNQIVRELFTNTIAAANILGI-DSAFCQELADKRSRLMPTTIGKDGRILEW 601
Query: 594 VQ 595
++
Sbjct: 602 LE 603
>gi|291550959|emb|CBL27221.1| hypothetical protein RTO_27700 [Ruminococcus torques L2-14]
Length = 775
Score = 315 bits (807), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 200/605 (33%), Positives = 308/605 (50%), Gaps = 57/605 (9%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
KI F AK + +A+PIGNG LGAMV+G +E L++NED++WTG + NPDA +
Sbjct: 3 KICFREEAKDWNEALPIGNGFLGAMVFGKTGTERLQINEDSVWTGSFMERVNPDARENYP 62
Query: 74 DVRSLVDSGQY--AEATAASVKLFGHP-ADVYQLLGDIELEFDDS--------------- 115
VR L+ +G+ AE A +P YQ LGD+ ++F
Sbjct: 63 KVRELLLNGEIEQAELLAERSMYATYPHMRHYQTLGDVWIDFYKQRGKTIFKKDQGGLLS 122
Query: 116 --HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF 173
H +TY RELD++ A +++Y ++ RE F+SNPD +IV ++ + L+F
Sbjct: 123 VQHESVEVQTYNRELDISRAVGKIQYESEKGKYEREFFASNPDHIIVYQMKSIDGELLNF 182
Query: 174 NVSLDSLLDNHSYVNGNNQIIMEGR--CPGKRIPPKANANDDPKGIQFSAILEIKISDDR 231
++SL + DN S G +G G +I D GI F +++++ +
Sbjct: 183 DLSL-TRKDNRS---GRGSSFCDGTEVLDGNKIRLYGKQGGD-HGIAFELLVQVRTKN-- 235
Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYS 291
G IS + L VE + A L + A +SF + P M L + SY
Sbjct: 236 GKISRM-GSHLLVEDAKEATLFITARTSF---------RSEQPLQWCMDVLSNAEKESYG 285
Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLV 350
L RH+ DY + + +++L+ +++ + + + ER++ + ED L+
Sbjct: 286 TLQERHIKDYLSYYEKSNLKLN-----------YKDSYEHLTTPERLEQMRNGIEDIELI 334
Query: 351 ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
+ F RYLLISSSR G+ +NLQGIWNE+ P W S +NIN+EMNYW + LS+
Sbjct: 335 NTYYNFARYLLISSSREGSLPSNLQGIWNEEFEPMWGSKYTININIEMNYWIAEKTGLSK 394
Query: 411 CQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWL 470
PL + L + +G A+ Y G+ HH TDIW + V LWPMGGAW
Sbjct: 395 LHMPLLEHLQRMYPHGKDVAEKMYGIDGFCCHHNTDIWGDCAPQDNHVSSTLWPMGGAWF 454
Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD 530
C HL EHY YT DR+FL K Y +L+ F L ++++ G + PS+SPE+ ++
Sbjct: 455 CLHLIEHYKYTKDREFL-KEYYGILKDAVKFFLQYMVKDAHGKWISGPSSSPENIYLNQK 513
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNE--DALVEKVLKSLPRLRPTKIAEDG 588
G+ C+ ++MD IIRE+F+ + E+ E+N+ + L E + + L + +I + G
Sbjct: 514 GEAGCLCMGASMDTEIIRELFNGYL---EITEENQLPNDLNEAINERLNHMPELQIGKYG 570
Query: 589 SIMEW 593
I EW
Sbjct: 571 QIQEW 575
>gi|346972979|gb|EGY16431.1| alpha-L-fucosidase [Verticillium dahliae VdLs.17]
Length = 765
Score = 315 bits (806), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 205/597 (34%), Positives = 299/597 (50%), Gaps = 61/597 (10%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + ++ PA +++A+P+GNGRLG MV+G +E L+LNED++W G P D T DA + L
Sbjct: 8 LTLHYDAPAASWSEALPVGNGRLGGMVYGRTSTELLQLNEDSVWYGGPQDRTPRDARRHL 67
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADVY--QLLGDIELEFDDSHLKYAEETYRRELD 129
+R L+ ++A A A F PA + + LG+ LEF H YRR LD
Sbjct: 68 DTLRQLIRDEKHAAAEALVREAFFATPASMRHSEPLGNCTLEF--GHEAQDVTGYRRSLD 125
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS--------LDSLL 181
L TA A V+Y V + RE +S PD V+ + S SE ++ + L
Sbjct: 126 LATAQATVEYQCTGVSYRRETIASFPDNVVALRFSASEPTRFVVRLNRVSEIEWETNEFL 185
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI--SDDRGTISALED 239
D+ NG +I++ GK N +P S +L I +D+ G+I A+ +
Sbjct: 186 DSIQAANG--RIVLNATPGGK--------NSNP----LSLVLGISCDANDEGGSIEAVGN 231
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
L++ A S + + K DP + + + S+ +L R
Sbjct: 232 -----------ALVVKAFSCTIAIAAHTTYRKADPEAAARQDVDKALKRSWHELVLRQRT 280
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
DY LF R S+++ + D+ P+ ER+ + + DP LV L + +GRY
Sbjct: 281 DYASLFQRSSLRMWPAAHDL-------------PTNERI---EKNRDPGLVALYYNYGRY 324
Query: 360 LLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
LLISSSR + A LQGIWN +P W +NINL+MNYW + PCNL +C P+
Sbjct: 325 LLISSSRDSDKALPATLQGIWNPSFAPPWGCKYTININLQMNYWLAAPCNLVDCALPMLG 384
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ +++ G+KTA+ Y GW HH TDIWA + + +WP+GG WLC + E
Sbjct: 385 LVERMAVRGAKTARTMYDCGGWCAHHNTDIWADTDPQDRWMPSTIWPLGGVWLCIDVLEM 444
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACV 536
Y DR L +RA LLEGC FLLD+LI G +L TNPS SPE+ F++ G +
Sbjct: 445 LLYQYDRK-LHERAAVLLEGCIVFLLDFLIPSACGKFLVTNPSLSPENTFVSKSGDTGIL 503
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
S +D IIR F + + +L+K + LV +V ++ RL I DG I EW
Sbjct: 504 CEGSAIDTTIIRIAFEKFLWSTAILDKG-NPLVPEVRDAMARLPNLTINNDGLIQEW 559
>gi|253580291|ref|ZP_04857557.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39B_FAA]
gi|251848384|gb|EES76348.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39BFAA]
Length = 751
Score = 315 bits (806), Expect = 6e-83, Method: Compositional matrix adjust.
Identities = 190/595 (31%), Positives = 303/595 (50%), Gaps = 60/595 (10%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + F+ PA+ + +A+P+GNG +GAM +G +E ++LN D+LW+G + NP+
Sbjct: 4 LALIFDKPAEAWNEALPLGNGTMGAMSYGRFQNERIELNLDSLWSGNGRNKENPNKNVDW 63
Query: 73 SDVRSLVDSGQYAEA-TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
R + +G Y A + G + Y G + + + ++ YRREL L
Sbjct: 64 DLFRKHIFAGDYQGAENYCKENVLGDWTESYLPAGTLSINVKEP-IQNGNSFYRRELCLT 122
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
AT ++++ ++ + RE F S + V+ S + +L +++L+S + + S N
Sbjct: 123 NATEKIEFCQDDLIYQREFFVSMSEPVMAIHYHTSPNCNLEMSITLESEIKHKSAFFAEN 182
Query: 192 QIIMEGRCPGKRIPPKANAN-----DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
II+EG+ P PP + ++ +GI+F+ + + + + G + DK
Sbjct: 183 GIILEGQAPIYVAPPYYSCEVPVVYEEGQGIRFA--IGLYVQTNGGNVYQQADKLFINTP 240
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D V + V+ + K+ S+ +++I+++ Y H+D Y F
Sbjct: 241 ND--VYIYVSG-------VTDFKQKELFFSKRNCMMENIQHIQYEKQKKAHMDVYANYFD 291
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
R+ + ++ +P D L +F + RYL+I SS
Sbjct: 292 RMHLDINYTP-----------------------------DNELALKMFHYARYLMICSSV 322
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG+Q NLQGIWN + W S VNIN EMNYW + NLS+C PL + + S G
Sbjct: 323 PGSQCTNLQGIWNHHMRAPWSSNYTVNINTEMNYWMAEKANLSDCHMPLLELIERTSKKG 382
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSS------ADRGKVVWALWPMGGAWLCTHLWEHYNY 480
KTAQ Y +GWV HH DIW SS D +++WPM WLC HLWEHY Y
Sbjct: 383 EKTAQDVYHLAGWVSHHNLDIWGHSSPVGQFGQDENPCTYSMWPMSSGWLCCHLWEHYCY 442
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
T+D FL+K+A+P+++G F L +L+ + GY T PSTSPE+ F+APD V+++S
Sbjct: 443 TLDEAFLKKKAFPIIQGAVEFYLGYLVP-YKGYYVTAPSTSPENTFLAPDMTTHGVTFAS 501
Query: 541 TMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
TMD++I+RE+F + A E+L E +A V+ VL+ LP P KI ++G + EW
Sbjct: 502 TMDISILRELFGLYLKACEILGVEDFTNA-VKNVLQKLP---PYKIGKEGQLQEW 552
>gi|315500597|ref|YP_004089399.1| Alpha-L-fucosidase [Asticcacaulis excentricus CB 48]
gi|315418609|gb|ADU15248.1| Alpha-L-fucosidase [Asticcacaulis excentricus CB 48]
Length = 788
Score = 314 bits (805), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 199/591 (33%), Positives = 290/591 (49%), Gaps = 62/591 (10%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK---- 70
++FN PA + +A+P+GNGRLGAMV+GGV SE L+LN LW+G T D PK
Sbjct: 38 LSFNAPAARWMEALPVGNGRLGAMVYGGVRSERLQLNHIELWSG----RTVEDNPKTTRA 93
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPAD-----VYQLLGDIELEFDDSHLKYAEETYR 125
AL VR L+ + + AEA + P + YQ+LGD+ LE A Y
Sbjct: 94 ALPKVRELLFADKRAEANRLAQDDMMAPMNEVDYGSYQMLGDLRLEMGHEE---AVSDYS 150
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
RELD+ T V+Y +G ++R +S PDQ + +I S LS +L D
Sbjct: 151 RELDMATGQVTVRYRIGKATYSRTVLASAPDQCLAVRIETSAPEGLSLKATLKR--DRDV 208
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
+ Q++ K + P G+ + A L + G A + +V
Sbjct: 209 AFDWQGQVL------------KMSGQPQPFGVHYCAYLACR---SEGGSVAPDGHGFRVS 253
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G+ VL L ++ P +P + +A + S+ L D++ LF
Sbjct: 254 GARAVVLNLTGATDLLAP---------EPEKVAQAAQAKLVARSWQALARDQERDHRALF 304
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVP--SAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
RV + L+ + VP ++ER+ + + +L+E F FGRYLLI
Sbjct: 305 ERVELTLASA---------------GVPRLASERLAAASDAAEMALIETYFNFGRYLLIG 349
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
S+RPG+ NLQG+W + +P W + H+NIN++MNYW + C LSE E LFD++ L
Sbjct: 350 SNRPGSLPPNLQGLWADGFAPPWSADYHININIQMNYWPAEVCGLSELHESLFDYVDRLM 409
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+TAQ+ Y G V H+ T+ W ++ D GKV W LWP G AWL H WEHY YT D
Sbjct: 410 PYARQTAQIAYGCRGAVAHYTTNPWGHTALD-GKVQWGLWPEGLAWLTLHYWEHYLYTGD 468
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+FL+ RA P+ CA F LD+L+E G L + P++SPE+ ++ +G++ V M
Sbjct: 469 LEFLKTRALPVFRACAEFTLDYLVEDPRTGKLVSGPASSPENSYVMDNGEVGYVDMGCAM 528
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
++ V + A E L E L E +L RL KI DG + EW
Sbjct: 529 SQSMAFTVLTLTQKATEALSV-EPELREACAAALARLDRLKIGPDGRVQEW 578
>gi|154494326|ref|ZP_02033646.1| hypothetical protein PARMER_03681 [Parabacteroides merdae ATCC
43184]
gi|423725485|ref|ZP_17699622.1| hypothetical protein HMPREF1078_03511 [Parabacteroides merdae
CL09T00C40]
gi|154085770|gb|EDN84815.1| hypothetical protein PARMER_03681 [Parabacteroides merdae ATCC
43184]
gi|409234609|gb|EKN27437.1| hypothetical protein HMPREF1078_03511 [Parabacteroides merdae
CL09T00C40]
Length = 809
Score = 314 bits (804), Expect = 9e-83, Method: Compositional matrix adjust.
Identities = 202/609 (33%), Positives = 315/609 (51%), Gaps = 50/609 (8%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
N T PL F+ PA + + P+GNGRLG M GGV +E + LNE ++W+G D
Sbjct: 17 NMPEVKTGKPLSYHFDAPAGIWEASFPLGNGRLGLMPDGGVDTENIVLNEISMWSGSKQD 76
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIE 109
NP A +L +R L+ G+ EA F G A+V YQLLG++
Sbjct: 77 TDNPQAYHSLGTIRKLLFEGRNDEAQELMYNTFVCKGEGSGQGQGANVPYGSYQLLGNLV 136
Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
L +D + YRREL+L+ A A + G V + RE F+S D + V ++
Sbjct: 137 LNYDYQGTSDSIFGYRRELNLDNAIATASFRRGKVTYNREVFTSFADDLGVIHLTADADR 196
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD 229
+L+F+ ++ +++ N ++M+G+ P + KG+++++ +++
Sbjct: 197 ALNFSFGMNRP-EHYKVTADGNDLLMQGQLP------DGVDTLEMKGLRYAS--RVRVIL 247
Query: 230 DRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSALQSIRNL 288
+G D + V + A+LL+ +A+ FD KD + S L +
Sbjct: 248 PKGGNVTPGDSTVSVRNASEAILLVSMATDYFD----------KDLEGKVSSLLANAEKK 297
Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDP 347
++ L H+ Y+ LF RV + L S ++ +P ER+ +F + +DP
Sbjct: 298 DFASLKKGHIAAYRSLFGRVELDLGHSSRE------------DLPMDERLAAFHENPDDP 345
Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
SL L FQFGRYLLISS+R G NLQG+W ++ W+ H+NINL+MN+W + N
Sbjct: 346 SLAALYFQFGRYLLISSTRVGLLPPNLQGLWCNTINTPWNGDYHLNINLQMNHWPAEVAN 405
Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
LSE PL ++ +G +TA+ Y A GWV H ++W + +A W
Sbjct: 406 LSELHLPLVEWTKQQVASGERTAKAYYNARGWVTHILGNVW-EFTAPGEHPSWGATNTSA 464
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEF 526
AWLC HL+ HY YT+D+++L K YP+L+G + F +D L+E + YL T P+TSPE+ +
Sbjct: 465 AWLCEHLYMHYLYTLDKEYL-KDVYPVLKGASLFFVDMLVEDPRNKYLVTAPTTSPENGY 523
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE 586
P+GK A + STMD I+RE+F+ I AA++L + A ++ RL PT I +
Sbjct: 524 KLPNGKTAHICAGSTMDNQIVRELFTNTIEAADILGL-DSAFAGELAAKRARLMPTTIGK 582
Query: 587 DGSIMEWVQ 595
DG IMEW++
Sbjct: 583 DGRIMEWLE 591
>gi|160885575|ref|ZP_02066578.1| hypothetical protein BACOVA_03577 [Bacteroides ovatus ATCC 8483]
gi|156109197|gb|EDO10942.1| hypothetical protein BACOVA_03577 [Bacteroides ovatus ATCC 8483]
Length = 826
Score = 314 bits (804), Expect = 9e-83, Method: Compositional matrix adjust.
Identities = 200/594 (33%), Positives = 312/594 (52%), Gaps = 47/594 (7%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
N +I ++ PA ++ +A+P+GNGR+ AMV+G E ++LNE+T+ G P N +A
Sbjct: 25 NIYRIWYDKPASYWEEALPVGNGRIAAMVFGDPQLEQIQLNEETVSAGSPYQNYNEEAKT 84
Query: 71 ALSDVRSLVDSGQYAEA-TAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRE 127
AL ++R L+ G+Y EA A+ K+ + YQ +G + + + D H K Y R+
Sbjct: 85 ALPEMRRLIFEGKYEEAQNMAATKILSQVGNEMPYQTVGRLNIRYQD-HKKV--NNYYRD 141
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS-LLDNHSY 186
LD++ A A +Y V VEFT E F+S DQ+++ I S+ G+++ + ++ + D
Sbjct: 142 LDISNAVAVTRYEVDGVEFTEETFASFTDQLVIRHIKASKPGTINCELFFNTPMRDPKRS 201
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ G + +EG G R P + + A L++K G + D L V+G
Sbjct: 202 IYGKKGLRLEGITHGSRYFPGK--------VHYCADLDVK--HKGGKVITANDTLLSVQG 251
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ L + +++F +N D DP + + L++ YS H+ YQK F+
Sbjct: 252 ASELTLYISMATNF----VNYKDISGDPYQRNKAYLKNAAK-DYSKAKAAHIAAYQKQFN 306
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV++ L + S+ N P R+K F + DP+L+ L FQ+GRYLLISSS+
Sbjct: 307 RVTLDLGET---------SQAN---KPMDVRIKEFSSSYDPALIALYFQYGRYLLISSSQ 354
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQG WN + P W NIN EMNYW + NL+E +P + LS NG
Sbjct: 355 PGCQPANLQGKWNHNPGPPWSCNYTTNINAEMNYWPAEITNLAELHKPFIQMVRELSENG 414
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
+ A Y GWV+HH TD+W + A DR WP+ AWLC HLW+ Y ++ D+
Sbjct: 415 REAASRMYGCRGWVLHHNTDLWRMTGAVDRPYC--GTWPVANAWLCQHLWDRYLFSGDKK 472
Query: 486 FLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH--EFIAPDGKLACVSYSSTM 542
+LE+ YP+++ + F +D+L+ + + GYL PS SPE+ +I L TM
Sbjct: 473 YLEE-VYPMMKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPRWIKKKSNLFA---GITM 528
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWVQ 595
D ++ ++FS AA+VL N D LK++ R L P ++ + G + EW +
Sbjct: 529 DNQLVFDLFSNTCEAAKVL--NADTDFCDTLKNMRRQLPPMQVGQYGQLQEWFE 580
>gi|393781489|ref|ZP_10369684.1| hypothetical protein HMPREF1071_00552 [Bacteroides salyersiae
CL02T12C01]
gi|392676552|gb|EIY69984.1| hypothetical protein HMPREF1071_00552 [Bacteroides salyersiae
CL02T12C01]
Length = 821
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 192/593 (32%), Positives = 310/593 (52%), Gaps = 48/593 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ ++ PA + +++P+GNGRLGAMV+G E +LNE+T+W G P + TNP A +AL
Sbjct: 24 MKLWYDRPATQWVESLPLGNGRLGAMVYGDPIHEEFQLNEETIWGGSPYNNTNPKAKEAL 83
Query: 73 SDVRSLVDSGQYAEATA------ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
+R L+ G+ EA A S G P YQ +G + L+F+ + Y R
Sbjct: 84 PQIRQLIFEGRNKEAQALCGPNICSQTANGMP---YQTVGSLHLDFEGIS---SYSNYYR 137
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH-- 184
ELD+ A +++ G V +TRE F+S PDQ+++ +++ SE G LSF + +
Sbjct: 138 ELDIEKAVTTTRFTAGGVTYTREAFTSFPDQLLIIRLTASEKGKLSFTARYSTPYQENIT 197
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLK 243
++ ++ M+G KAN ++ +G +QF+A+ +I + G + ++ D L+
Sbjct: 198 KSISSRKELQMDG---------KANDHEGIEGKVQFTAL--TRIERNGGHMESVSDTLLR 246
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V ++ +V + V S FIN D + + + L++ +Y H Y K
Sbjct: 247 VRNAN-SVTIYV---SIGTNFINYKDISGNARKTAQTYLKNAGK-NYLKAKEAHCATYGK 301
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
F+RVS+ L + + P+ RV F + DP L L FQFGRYLLI
Sbjct: 302 WFNRVSLDLGSNAQA------------AKPTDVRVHEFASAFDPQLAALYFQFGRYLLIC 349
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SS+PG Q ANLQGIWN L WD +IN+EMNYW + P NL+E EP + ++
Sbjct: 350 SSQPGGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAEPTNLTEMHEPFLQLVKEVA 409
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
G ++A + Y GW +HH TDIW + + G + +WP AW C HLW+ Y ++ +
Sbjct: 410 EQGRQSAAM-YGCRGWTLHHNTDIWRSTGSVDGP-GYGIWPTCNAWFCQHLWDRYLFSGN 467
Query: 484 RDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
RD+L + YPL+ F LD+LI E + +L +PS SPE+ + V +TM
Sbjct: 468 RDYLAE-VYPLMRSACEFYLDFLIREPQNNWLVVSPSYSPENRPSVNGKRDFVVVAGATM 526
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
D ++ ++F + AA ++ ++ ++ + + L P ++ G + EW++
Sbjct: 527 DNQMVSDLFHNTLEAASLMGES-STFMDSLQTVVQNLAPMQVGRWGQLQEWME 578
>gi|333380444|ref|ZP_08472135.1| hypothetical protein HMPREF9455_00301 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826439|gb|EGJ99268.1| hypothetical protein HMPREF9455_00301 [Dysgonomonas gadei ATCC
BAA-286]
Length = 786
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 194/599 (32%), Positives = 316/599 (52%), Gaps = 52/599 (8%)
Query: 9 TTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
+ N + FN PA + ++IP+GNGR+G M WGGV E + LNE +LW G D NPDA
Sbjct: 20 SQNKWQYYFNEPASAWEESIPLGNGRIGMMPWGGVDKERIVLNEISLWAGNKQDADNPDA 79
Query: 69 PKALSDVRSLVDSGQYAEATAASVKLF------GHPADV--YQLLGDIELEFDDSHLKYA 120
K L ++R L+ + EA K F G AD ++ G++ ++ A
Sbjct: 80 YKHLGEIRKLLFEKKNREAQELMYKTFTCKGEGGSGADYGKFENFGNLYIDITYPDASAA 139
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
YRR LD+N A + V Y+ G +++TRE+F+S D + + + + +S +L+ +SLD
Sbjct: 140 VSDYRRTLDMNNALSDVTYTKGGIKYTREYFTSFTDDIGIARYTADKSKALNMCISLDRD 199
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ +Y +G I G+ P A + +G+++ +++ ++ +G +
Sbjct: 200 ENYETYASGPVLYIF-GQLP---------AGEGKEGMKYLGMVK---AEHKGGQLFTNAR 246
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA--LQSIRNLSYSDLYTRHL 298
++++ +D L + +++++G E ++ L ++ Y +H+
Sbjct: 247 DIEIKNADEVTLFISLATNYNGV-----------EHEKLAGYLLNKLKG-DYKTRKQKHI 294
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFG 357
+ YQ LF+RV + L ++ +N D +P +R+++F D D L L Q+G
Sbjct: 295 EKYQNLFNRVDLTLGKN-----------KNSD-LPINKRLEAFVNDRSDYDLAALYMQYG 342
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLISS+R G NLQG+W + W+ H+NINL+MN W + CNLSE P +
Sbjct: 343 RYLLISSTREGGLPPNLQGLWAPQIHTPWNGDYHLNINLQMNLWPAEVCNLSELHLPTIE 402
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
++ L+ G KTA+V Y + GWV H ++W +S W GAW+C HLWEH
Sbjct: 403 YVKSLTEPGHKTAKVYYNSDGWVTHILGNVWGFTSPGESP-SWGATNTSGAWMCQHLWEH 461
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACV 536
Y Y+ D ++L K YP ++G A F + L+E ++GYL T P+TSPE+ +I G + V
Sbjct: 462 YLYSQDVEYL-KSVYPTMKGAALFFENMLVEDPNNGYLVTAPTTSPENTYITESGDVLSV 520
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
STMD I+RE+F+ + AA++L +E + + RL PT I + G IMEW++
Sbjct: 521 CAGSTMDNQIVRELFTNVSEAAKILNTDEQ-WIRTIETKKQRLAPTTIGKYGQIMEWLE 578
>gi|255692382|ref|ZP_05416057.1| fibronectin type III domain protein [Bacteroides finegoldii DSM
17565]
gi|260621848|gb|EEX44719.1| hypothetical protein BACFIN_07502 [Bacteroides finegoldii DSM
17565]
Length = 826
Score = 313 bits (803), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 200/594 (33%), Positives = 312/594 (52%), Gaps = 47/594 (7%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
N +I ++ PA ++ +A+P+GNGR+ AMV+G E ++LNE+T+ G P N +A
Sbjct: 25 NIYRIWYDKPASYWEEALPVGNGRIAAMVFGDPQLEQIQLNEETVSAGSPYQNYNEEAKA 84
Query: 71 ALSDVRSLVDSGQYAEA-TAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRE 127
AL ++R L+ G+Y EA A+ K+ + YQ +G + + + D H K Y R+
Sbjct: 85 ALPEMRRLIFEGKYEEAQNMAATKILSQVGNEMPYQTVGRLNIRYQD-HKKV--NNYYRD 141
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS-LLDNHSY 186
LD++ A A +Y V VEFT E F+S DQ+++ I S+ G+++ + ++ + D
Sbjct: 142 LDISNAVAVTRYEVDGVEFTEETFASFTDQLVIRHIKASKPGTINCELFFNTPMRDPKRS 201
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ G + +EG G R P + + A L++K G + D L V+G
Sbjct: 202 IYGKKGLRLEGITYGSRYFPGK--------VHYCADLDVK--HKGGKVITANDTLLSVQG 251
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ L + +++F +N D DP + + L++ YS H+ YQK F+
Sbjct: 252 ASELTLYISMATNF----VNYKDISGDPYQRNKAYLKNAAK-DYSKAKAAHIAAYQKQFN 306
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV++ L + S+ N P R+K F + DP+L+ L FQ+GRYLLISSS+
Sbjct: 307 RVTLDLGET---------SQAN---KPMDVRIKEFSSSYDPALIALYFQYGRYLLISSSQ 354
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQG WN + P W NIN EMNYW + NL+E +P + LS NG
Sbjct: 355 PGCQPANLQGKWNHNPGPPWSCNYTTNINAEMNYWPAEITNLAELHKPFIQMVRELSENG 414
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
+ A Y GWV+HH TD+W + A DR WP+ AWLC HLW+ Y ++ D+
Sbjct: 415 REAASRMYGCRGWVLHHNTDLWRMTGAVDRPYC--GTWPVANAWLCQHLWDRYLFSGDKK 472
Query: 486 FLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH--EFIAPDGKLACVSYSSTM 542
+LE+ YP+++ + F +D+L+ + + GYL PS SPE+ +I L TM
Sbjct: 473 YLEE-VYPMMKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPRWIKKKSNLFA---GITM 528
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWVQ 595
D ++ ++FS AA+VL N D LK++ R L P ++ + G + EW +
Sbjct: 529 DNQLVFDLFSNTCEAAKVL--NADTDFCDTLKNMRRQLPPMQVGQYGQLQEWFE 580
>gi|423290259|ref|ZP_17269108.1| hypothetical protein HMPREF1069_04151 [Bacteroides ovatus
CL02T12C04]
gi|423294445|ref|ZP_17272572.1| hypothetical protein HMPREF1070_01237 [Bacteroides ovatus
CL03T12C18]
gi|392665646|gb|EIY59169.1| hypothetical protein HMPREF1069_04151 [Bacteroides ovatus
CL02T12C04]
gi|392675636|gb|EIY69077.1| hypothetical protein HMPREF1070_01237 [Bacteroides ovatus
CL03T12C18]
Length = 816
Score = 313 bits (803), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 200/594 (33%), Positives = 312/594 (52%), Gaps = 47/594 (7%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
N +I ++ PA ++ +A+P+GNGR+ AMV+G E ++LNE+T+ G P N +A
Sbjct: 15 NIYRIWYDKPASYWEEALPVGNGRIAAMVFGDPQLEQIQLNEETVSAGSPYQNYNEEAKT 74
Query: 71 ALSDVRSLVDSGQYAEA-TAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRE 127
AL ++R L+ G+Y EA A+ K+ + YQ +G + + + D H K Y R+
Sbjct: 75 ALPEMRRLIFEGKYEEAQNMAATKILSQVGNEMPYQTVGRLNIRYQD-HKKV--NNYYRD 131
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS-LLDNHSY 186
LD++ A A +Y V VEFT E F+S DQ+++ I S+ G+++ + ++ + D
Sbjct: 132 LDISNAVAVTRYEVDGVEFTEETFASFTDQLVIRHIKASKPGTINCELFFNTPMRDPKRS 191
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ G + +EG G R P + + A L++K G + D L V+G
Sbjct: 192 IYGKKGLRLEGITHGSRYFPGK--------VHYCADLDVK--HKGGKVITANDTLLSVQG 241
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ L + +++F +N D DP + + L++ YS H+ YQK F+
Sbjct: 242 ASELTLYISMATNF----VNYKDISGDPYQRNKAYLKNAAK-DYSKAKAAHIAAYQKQFN 296
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV++ L + S+ N P R+K F + DP+L+ L FQ+GRYLLISSS+
Sbjct: 297 RVTLDLGET---------SQAN---KPMDVRIKEFSSSYDPALIALYFQYGRYLLISSSQ 344
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQG WN + P W NIN EMNYW + NL+E +P + LS NG
Sbjct: 345 PGCQPANLQGKWNHNPGPPWSCNYTTNINAEMNYWPAEITNLAELHKPFIQMVRELSENG 404
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
+ A Y GWV+HH TD+W + A DR WP+ AWLC HLW+ Y ++ D+
Sbjct: 405 REAASRMYGCRGWVLHHNTDLWRMTGAVDRPYC--GTWPVANAWLCQHLWDRYLFSGDKK 462
Query: 486 FLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH--EFIAPDGKLACVSYSSTM 542
+LE+ YP+++ + F +D+L+ + + GYL PS SPE+ +I L TM
Sbjct: 463 YLEE-VYPMMKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPRWIKKKSNLFA---GITM 518
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWVQ 595
D ++ ++FS AA+VL N D LK++ R L P ++ + G + EW +
Sbjct: 519 DNQLVFDLFSNTCEAAKVL--NADTDFCDTLKNMRRQLPPMQVGQYGQLQEWFE 570
>gi|336415223|ref|ZP_08595564.1| hypothetical protein HMPREF1017_02672 [Bacteroides ovatus
3_8_47FAA]
gi|335941256|gb|EGN03114.1| hypothetical protein HMPREF1017_02672 [Bacteroides ovatus
3_8_47FAA]
Length = 816
Score = 313 bits (802), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 200/594 (33%), Positives = 312/594 (52%), Gaps = 47/594 (7%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
N +I ++ PA ++ +A+P+GNGR+ AMV+G E ++LNE+T+ G P N +A
Sbjct: 15 NIYRIWYDKPASYWEEALPVGNGRIAAMVFGDPQLEQIQLNEETVSAGSPYQNYNEEAKT 74
Query: 71 ALSDVRSLVDSGQYAEA-TAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRE 127
AL ++R L+ G+Y EA A+ K+ + YQ +G + + + D H K Y R+
Sbjct: 75 ALPEMRRLIFEGKYEEAQNMAATKILSQVGNEMPYQTVGRLNIRYQD-HKKV--NNYYRD 131
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS-LLDNHSY 186
LD++ A A +Y V VEFT E F+S DQ+++ I S+ G+++ + ++ + D
Sbjct: 132 LDISNAVAVARYEVDGVEFTEETFASFTDQLVIRHIKASKPGTINCELFFNTPMRDPKRS 191
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ G + +EG G R P + + A L++K G + D L V+G
Sbjct: 192 IYGKKGLRLEGITHGSRYFPGK--------VHYCADLDVK--HKGGKVITANDTLLSVQG 241
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ L + +++F +N D DP + + L++ YS H+ YQK F+
Sbjct: 242 ASELTLYISMATNF----VNYKDISGDPYQRNKAYLKNAAK-DYSKAKAAHIAAYQKQFN 296
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV++ L + S+ N P R+K F + DP+L+ L FQ+GRYLLISSS+
Sbjct: 297 RVTLDLGET---------SQAN---KPMDVRIKEFSSSYDPALIALYFQYGRYLLISSSQ 344
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQG WN + P W NIN EMNYW + NL+E +P + LS NG
Sbjct: 345 PGCQPANLQGKWNHNPGPPWSCNYTTNINAEMNYWPAEITNLAELHKPFIQMVRELSENG 404
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
+ A Y GWV+HH TD+W + A DR WP+ AWLC HLW+ Y ++ D+
Sbjct: 405 REAASRMYGCRGWVLHHNTDLWRMTGAVDRPYC--GTWPVANAWLCQHLWDRYLFSGDKK 462
Query: 486 FLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH--EFIAPDGKLACVSYSSTM 542
+LE+ YP+++ + F +D+L+ + + GYL PS SPE+ +I L TM
Sbjct: 463 YLEE-VYPMMKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPRWIKKKSNLFA---GITM 518
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWVQ 595
D ++ ++FS AA+VL N D LK++ R L P ++ + G + EW +
Sbjct: 519 DNQLVFDLFSNTCEAAKVL--NADTDFCDTLKNMRRQLPPMQVGQYGQLQEWFE 570
>gi|380692991|ref|ZP_09857850.1| hypothetical protein BfaeM_03308 [Bacteroides faecis MAJ27]
Length = 779
Score = 313 bits (801), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 201/593 (33%), Positives = 314/593 (52%), Gaps = 43/593 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAPKA 71
+K+ ++ PA + ++P+GNGRLGAMV+GGV +ET+ LNE T+W+G ++ P +
Sbjct: 1 MKLWYDKPADKWMKSLPLGNGRLGAMVYGGVETETIGLNESTMWSGEYDEHQQRPLGREK 60
Query: 72 LSDVRSLVDSGQYAEAT-AASVKLFG--HPADVYQLLGDIELEFDDSHLKYAEETYRREL 128
L +R L AE A + G H A + +GD++L F + ++ Y EL
Sbjct: 61 LDQIRKLFFEDNLAEGNHIAGNTMAGSPHSAGTHLPIGDLKLNFTYPEGELSD--YHHEL 118
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
DL TAT V Y VG+ E+TR+ +SNPD VI I S S++ + L LL N V
Sbjct: 119 DLTTATNTVTYKVGDTEYTRQCIASNPDDVIAMHIKASRPESITVELELQ-LLRNAEVVA 177
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
NQ+I G ++ G+ F + +I GTI A + KKL ++ +
Sbjct: 178 SGNQLIYTGNAEFEK--------HGRGGVLFEGRIAAEIKG--GTIKA-DGKKLLIDKAT 226
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
+LL S + N + + D + +++ S+ L H++DY LF RV
Sbjct: 227 EVLLL----SDVRTNYKNTTFAGYDYQQKCKETIEAASKKSFKTLRNTHVEDYTPLFSRV 282
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRP 367
++ + K +P+ +R + E DP L L FQ+ RYLLISSSRP
Sbjct: 283 ALSFGENGK-----------FSHLPNDQRWARVKAGESDPGLDALFFQYARYLLISSSRP 331
Query: 368 GTQV-ANLQGIWNEDLSP--TWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
+ + LQG +N++L+ W + H++IN E NYW + NL EC PLFD++ LS+
Sbjct: 332 NSPLPVALQGFFNDNLACHMGWTNDYHLDINTEQNYWIANVGNLPECHLPLFDYIKDLSV 391
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
+GSK AQ Y GW H ++ W ++ G ++W L+P +W+ +H+W Y YT D+
Sbjct: 392 HGSKIAQDLYGCKGWTAHTTSNPWGYAAVS-GSILWGLFPTASSWITSHVWTQYEYTQDK 450
Query: 485 DFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
+FL++ AYPLL+ A FLLD+++ + + YL T PS SPE+ F G+ C S T D
Sbjct: 451 NFLKETAYPLLKSNAEFLLDYMVTDPRNNYLVTGPSISPENSF-RYQGQEFCASMMPTCD 509
Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWVQ 595
++ E+FSA + + E+L N DA L++ + +L P +I+ +G + EW +
Sbjct: 510 RVLVYEIFSACLKSTEIL--NVDAAFADSLRTAISKLPPFRISANGGVQEWFE 560
>gi|218262384|ref|ZP_03476870.1| hypothetical protein PRABACTJOHN_02544 [Parabacteroides johnsonii
DSM 18315]
gi|218223418|gb|EEC96068.1| hypothetical protein PRABACTJOHN_02544 [Parabacteroides johnsonii
DSM 18315]
Length = 809
Score = 313 bits (801), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 199/609 (32%), Positives = 313/609 (51%), Gaps = 50/609 (8%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
N T L F+ PA+ + + +P+GNGRLG M GGV +E + LNE ++W+G D
Sbjct: 17 NMPEVKTGKSLSYHFDAPAEIWEETLPLGNGRLGLMPDGGVDTEKIVLNEISMWSGSKQD 76
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
NP A +L +R L+ G+ EA F P YQLLG++
Sbjct: 77 TDNPQAYYSLGTIRKLLFEGRNDEAQELMYNTFVCKGEGSALGQGANAPYGSYQLLGNLV 136
Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
L +D + YRREL+L+ A A + G V++ RE F+S D + V ++
Sbjct: 137 LNYDYQGSSDSISGYRRELNLDNAIATASFRRGKVKYDREVFTSFADDLGVIHLTADADK 196
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD 229
+L+F+ ++ +++ N ++M+G+ P + KG+++++ + + +
Sbjct: 197 ALNFSFGMNRP-EHYKVTADGNDLLMQGQLP------DGVDTLEMKGLRYASRVRVVLPK 249
Query: 230 DRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSALQSIRNL 288
I D + + + A+LL+ +A+ FD KD + S L +
Sbjct: 250 GGNVIPG--DSTVTIRNASEAILLVSMATDYFD----------KDLDEKVASLLANAEKK 297
Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDP 347
++ L H+ Y+ LF RV + L S ++ +P ER+ +F D +DP
Sbjct: 298 DFASLKKGHIAAYRSLFGRVDLDLGHSSRE------------DLPIDERLATFNADPDDP 345
Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
SL L FQFGRYLLISS+R G NLQG+W ++ W+ H+NINL+MN+W + N
Sbjct: 346 SLGALYFQFGRYLLISSTRVGLLPPNLQGLWCNTVNTPWNGDYHLNINLQMNHWPAEVAN 405
Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
LSE PL ++ +G +TA+ Y A GWV H ++W + +A W
Sbjct: 406 LSELHLPLVEWTKQQVASGEQTAKAYYNAGGWVTHILGNVW-EFTAPGEHPSWGATNTSA 464
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEF 526
AWLC HL+ HY YT+D+++L K YP+L+G + F +D L+E + YL T P+TSPE+ +
Sbjct: 465 AWLCEHLYMHYLYTLDKEYL-KDVYPVLKGASRFFVDMLVEDPRNKYLVTAPTTSPENGY 523
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE 586
P+GK A + STMD I+RE+F+ I AA +L + A +++ RL PT I +
Sbjct: 524 KLPNGKTAHICAGSTMDNQIVRELFTNTIEAANILGI-DSAFAGELVAKRARLMPTTIGK 582
Query: 587 DGSIMEWVQ 595
DG IMEW++
Sbjct: 583 DGRIMEWLE 591
>gi|294806382|ref|ZP_06765225.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
gi|294446397|gb|EFG15021.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
Length = 562
Score = 312 bits (800), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 202/582 (34%), Positives = 302/582 (51%), Gaps = 57/582 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PAK++++A+PIGN RLGAMV+GG E L+LNE+T W G P + NP+A L
Sbjct: 22 LKLWYSQPAKNWSEALPIGNSRLGAMVYGGTEREELQLNEETFWAGSPYNNNNPNAVHVL 81
Query: 73 SDVRSLVDSGQYAEATA---ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
VR L+ G+ EA A+ H Y LG++ LEF K A++ YR +L+
Sbjct: 82 PIVRKLIFEGRNKEAQRLIDANFLTRQHGMS-YLTLGNLYLEFPGH--KDADDFYR-DLN 137
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L AT +Y V + +TR F+S D VI+ I S+ +L+FNVS + L N V
Sbjct: 138 LENATTTTRYQVNGINYTRTTFASFTDNVIIMHIKASQPNALNFNVSYNCPLKNEVNVQN 197
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
+ II C GK + +G++ + E ++ I L++ G
Sbjct: 198 DKLIIT---CQGK----------EQEGMKAALRAECQVQVKTDGIIHPAGNILQINGGTE 244
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L + A++++ +N + D + + L+ + Y H+ Y+K F RV
Sbjct: 245 ATLYISAATNY----VNYQNVSADESRRTTDYLEEAILIPYEKALKEHIAFYKKQFDRVQ 300
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L S + + R+++F D ++ LLFQ+GRYLLISSS+PG
Sbjct: 301 LHLPSS------------EASQIETPRRIENFGQGNDMAMAALLFQYGRYLLISSSQPGG 348
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN WDS +NIN EMNYW + NLSE PLF L LS+ G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 408
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
A+ Y GWV HH TD+W G V +A +WP GGAWL H+W+HY +T +++F
Sbjct: 409 ARTMYDCWGWVAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L K YP+L+G A F +D+L+E H Y L +PS SPEH ++ TMD
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPTYKWLVVSPSVSPEH---------GPITAGCTMDN 513
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE 586
I + + A+ + + + + + ++L +L P +I +
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGK 554
>gi|373956599|ref|ZP_09616559.1| alpha-L-fucosidase [Mucilaginibacter paludis DSM 18603]
gi|373893199|gb|EHQ29096.1| alpha-L-fucosidase [Mucilaginibacter paludis DSM 18603]
Length = 783
Score = 312 bits (800), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 204/611 (33%), Positives = 317/611 (51%), Gaps = 62/611 (10%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
S ++PL++ +N PA+ + + +P+GNGRLG M GGV ET+ LN+ TLW+G P D N
Sbjct: 20 SFGQSHPLRLWYNKPAQMWEETLPLGNGRLGMMPDGGVSQETIVLNDITLWSGAPQDANN 79
Query: 66 PDAPKALSDVRSLVDSGQYAEATAASVKLF--------GHPADVYQLLGDIELEFD---- 113
A K+L +R L+ G+ EA A + F G YQ+LG++ L F
Sbjct: 80 YQAYKSLPQIRKLLMEGKNDEAQALVDQAFICTGKGSGGVNYGCYQVLGNLSLNFQYPDH 139
Query: 114 ---DSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGS 170
+S + Y + Y REL L+ A A+ Y V V + RE+ +S D V + K++ + G
Sbjct: 140 NTANSPVNY--QNYERELTLDNAIAKCTYQVNGVTYKREYITSFGDDVDIIKLTADKPGQ 197
Query: 171 LSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD 230
L+ ++ + + + V N + MEG+ + D KG+Q+ AI++ ++
Sbjct: 198 LNLSIGISRPERSATSV-ANGALQMEGQL---------DNGIDGKGMQYQAIVK---AEQ 244
Query: 231 RGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSY 290
+G ++ ++ + ++ + A + F P K+ S A+Q Y
Sbjct: 245 QGGSVNYSSSQINIKDATSVIIYISAGTDFRNPHF-----KQSIQSVLTKAIQK----PY 295
Query: 291 SDLYTRHLDDYQKLFHRVSIQLSRSP-KDIVTDTCSEENIDTVPSAERVKSFQTDE--DP 347
S +H+ YQKLF+RV + L P K++ TD +R+ +F D D
Sbjct: 296 SLQKQQHIARYQKLFNRVHVNLGAEPAKELTTD-------------QRLIAFHADRKADN 342
Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
L L FQFGRYL I S+R G NLQG+W +S W H+++N++MN+W N
Sbjct: 343 GLPALFFQFGRYLSICSTRVGLLPPNLQGLWANQISTPWTGDYHLDVNVQMNHWPLEVAN 402
Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
LSE PL D + + +G KTA+ Y A GWV H T++W + W G
Sbjct: 403 LSELNLPLADLVKRMVPHGEKTAKAYYNAKGWVAHVITNVWQFTEPGE-SASWGATKAGS 461
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEF 526
WLC +LWEHY +T D ++L + YP+L+G A F D LI+ G+L T+PS+SPE+ F
Sbjct: 462 GWLCDNLWEHYAFTNDVNYL-RDIYPVLKGAAQFYNDMLIKDPKSGWLVTSPSSSPENSF 520
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNE--DALVEKVLKSLPRLRPTKI 584
P+GK A + T+D IIRE+F+ +I+A+ L + A +++ + LP P +I
Sbjct: 521 YLPNGKHASICLGPTIDNQIIRELFNNVITASGKLGVDAALSAELQQRVTQLP--PPGRI 578
Query: 585 AEDGSIMEWVQ 595
A DG IMEW++
Sbjct: 579 ASDGRIMEWME 589
>gi|383110853|ref|ZP_09931671.1| hypothetical protein BSGG_1961 [Bacteroides sp. D2]
gi|382949363|gb|EFS31261.2| hypothetical protein BSGG_1961 [Bacteroides sp. D2]
Length = 810
Score = 312 bits (800), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 207/592 (34%), Positives = 309/592 (52%), Gaps = 59/592 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+++++A+PIGN RLGAMV+GG E L+LNE+T W G P N +A L
Sbjct: 22 LKLWYSQPARNWSEALPIGNSRLGAMVYGGTEREELQLNEETFWAGGPYSNNNSNAKYVL 81
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR+L+ G+ EA + F Y LG++ ++F K A YR +L+L
Sbjct: 82 PVVRNLIFDGKNREAQSLVDANFLTKQHGMSYLTLGNLYIDFPGH--KDASGFYR-DLNL 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
AT +Y V V +TR F+S D VI+ I ++ +L+FN++ + L+ + +
Sbjct: 139 ENATTTTRYEVNGVTYTRTTFASFTDNVIIVHIQADKTQALNFNMTYNCPLEYNVNAQDD 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
II C GK IQ ++++K + G IS K L+VE + A
Sbjct: 199 KLIIT---CQGKE------QEGIKAAIQAECVVQVKTN---GAISP-AGKVLQVEKATEA 245
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
L + A++++ +N + + + + L+ Y+ H+ Y+K F RV +
Sbjct: 246 TLYIAAATNY----VNYQNVSANASERANKFLEKAIQTPYNKALKDHIAFYKKQFDRVRL 301
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
L SE + P R+++F ED ++ LLFQFGRYLLISSS+PG Q
Sbjct: 302 NLP----------SSEASKAETP--RRIENFNKGEDMAMAALLFQFGRYLLISSSQPGGQ 349
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWN WDS +NIN EMNYW + NLSE PLF L LS+ G++TA
Sbjct: 350 PANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVANLSETHSPLFSMLKDLSVTGAETA 409
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDFL 487
Q Y GWV HH TD+W G V +A +WP GGAWL H+W+HY +T D++FL
Sbjct: 410 QSMYNCRGWVAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGDKEFL 465
Query: 488 EKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
K YP+L+G A F +D+L+E D +L PS SPEH ++ TMD I
Sbjct: 466 -KEYYPILKGTAQFYMDFLVEHPDYKWLVVAPSVSPEH---------GPITAGCTMDNQI 515
Query: 547 IREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+ + A+ + + +D+L +++L LP P +I + + EW++
Sbjct: 516 AFDALHNTLLASRITGETSSFQDSL-QQILDKLP---PMQIGKHHQLQEWLE 563
>gi|167764888|ref|ZP_02437009.1| hypothetical protein BACSTE_03280 [Bacteroides stercoris ATCC
43183]
gi|167697557|gb|EDS14136.1| hypothetical protein BACSTE_03280 [Bacteroides stercoris ATCC
43183]
Length = 825
Score = 312 bits (799), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 195/595 (32%), Positives = 315/595 (52%), Gaps = 52/595 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+ + +A+P+GNG LGAMV+G E +LNE+T+W G P + TNP A +AL
Sbjct: 27 LKLWYDSPARQWVEALPLGNGSLGAMVFGDPIHERFQLNEETVWGGSPHNNTNPKAKEAL 86
Query: 73 SDVRSLVDSGQYAEA------TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
+R L+ G+ EA S G P YQ +G + L+F+ KY + Y R
Sbjct: 87 PRIRQLIFEGKNKEAQELCGPAICSQSANGMP---YQTVGTLHLDFEGIS-KY--DDYYR 140
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL--DNH 184
+LD+ A A +++ + + RE F+S PD+++V +++ S+ S+SF + +
Sbjct: 141 DLDIEKAIATTRFTANGITYVRETFTSFPDRLLVIRLTASKKRSISFTAHYTTPYTENTE 200
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLK 243
++ N++ + G KAN ++ +G ++F+A+ +I ++ GT+ A D L+
Sbjct: 201 RRISSLNELQLNG---------KANDHEGIEGKVRFTAL--TRIENNGGTLKATSDSTLQ 249
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSAL-QSIRNLSYSDLYTRHLDDYQ 302
V+ ++ VL + ++F IN D D + + Q+ +N Y+ H+ YQ
Sbjct: 250 VKNANSVVLYVSIGTNF----INYKDISGDALKTAQQYMKQAGKN--YTKRKEAHIAAYQ 303
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
K F+RVS+ L S I P+ RVK F + DP + L FQFGRYLLI
Sbjct: 304 KYFNRVSLDLG-----------SNSQIKK-PTDRRVKEFSSTADPQMAALYFQFGRYLLI 351
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SS+PG Q ANLQGIWN L WD +IN+EMNYW + L E EP + +
Sbjct: 352 CSSQPGGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAETTALPEMHEPFLQLVKEV 411
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
+I G ++A + Y GW +HH TDIW + A G + +WP AW C HLW+ Y ++
Sbjct: 412 AIQGRESAAM-YGCRGWTLHHNTDIWRSTGAVDGP-KYGIWPTCNAWFCQHLWDRYLFSG 469
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
D+++L + YP++ G F LD+L+ E + +L PS SPE+ + + +T
Sbjct: 470 DKNYLAE-VYPIMRGACEFYLDFLVREPQNNWLVVAPSYSPENSPSVNGKRDFVIVAGAT 528
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWVQ 595
MD ++ ++F I AA ++ NE L+++ + L P ++ G + EW++
Sbjct: 529 MDNQMVYDLFHNTIQAATLM--NEHKSFTDSLQTVAKHLAPMQVGRWGQLQEWME 581
>gi|160879031|ref|YP_001557999.1| hypothetical protein Cphy_0874 [Clostridium phytofermentans ISDg]
gi|160427697|gb|ABX41260.1| conserved hypothetical protein [Clostridium phytofermentans ISDg]
Length = 760
Score = 312 bits (799), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 205/593 (34%), Positives = 300/593 (50%), Gaps = 60/593 (10%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PAK + +A+P+GNGRLGAM++G E +++NED++W+G D NPDA K L +R
Sbjct: 8 YQDPAKDWDEALPLGNGRLGAMIYGKPEHEIIQVNEDSIWSGYAMDRNNPDAKKNLPIIR 67
Query: 77 SLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
SL+ G EA A++ L G P ++ YQ G+I + S + Y+R+L+L+ A
Sbjct: 68 SLIADGNLEEAQNATLHSLSGTPDNMRCYQTAGEIHITTGHSEVT----NYKRQLNLSEA 123
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKIS--GSESGSLSFNVSLDSLLDNHSYVNGNN 191
T V Y F REH S P V V + + G +LS +S +D Y +
Sbjct: 124 TVTVSYDFEGTTFIREHLISTPADVFVMRFTSKGPRKLNLSILLSRPHFMDR-LYCENGD 182
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
I++ R GI F L +A D K+K G+ V
Sbjct: 183 SIVLTYR----------------GGIPFCNRL----------TAASCDGKIKTIGAHLVV 216
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
+ F I + ++ T++ S L +++L + +L H DYQ F R +
Sbjct: 217 SEATTVTLFFD--IRTAYRSENYTNDVKSHLMDVKSLQFDELKRSHKKDYQSFFKRNDLI 274
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQ 370
L+ S ++ E ++ T+ +A+R++ + D L+E F FGRYLLIS SRPGT
Sbjct: 275 LTPSAEE-------EADVATLDTAKRLERMRMGHSDLKLLEDYFHFGRYLLISCSRPGTL 327
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWN ++P W +NIN EMNYW + NL E PLFD L + NG TA
Sbjct: 328 PANLQGIWNNSMTPPWGGKFTININTEMNYWFAEKLNLPELHLPLFDLLKRMHQNGKVTA 387
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
+ Y G+V HH TD+W + + W +GGAWLC H+WEHY YT D +FL
Sbjct: 388 EKMYGCHGFVAHHNTDLWGDCAPQDYWLPGTYWVLGGAWLCLHIWEHYEYTKDINFL-IN 446
Query: 491 AYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREV 550
+P+L FL ++L E +G L +P+ SPE+++ P+G++ + TMD I+RE+
Sbjct: 447 MFPVLSDACLFLTEFLTEDENGKLILSPTASPENKYRHPNGRIGYLCAGCTMDHQIMREL 506
Query: 551 FSAIISAAEVL--EKNED-------ALVEKVLKS----LPRLRPTKIAEDGSI 590
F I A L KN AL EK+ KS L RL T++ +G+I
Sbjct: 507 FHHYIDAYHTLLDAKNSTENKEVPIALNEKLTKSVKDCLSRLPETRVHSNGTI 559
>gi|289773991|ref|ZP_06533369.1| large secreted protein [Streptomyces lividans TK24]
gi|289704190|gb|EFD71619.1| large secreted protein [Streptomyces lividans TK24]
Length = 693
Score = 311 bits (798), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 183/511 (35%), Positives = 269/511 (52%), Gaps = 46/511 (9%)
Query: 92 VKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTRE 149
+ G P++ YQ+LGD+EL + Y RELDL TA AR Y+ G V RE
Sbjct: 15 AEFLGSPSEQAAYQVLGDLELTLAG---EGEAADYERELDLETAVARTTYTRGGVRHVRE 71
Query: 150 HFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKAN 209
F+S PDQV+V ++S G++ F S + + I ++G +
Sbjct: 72 VFASAPDQVLVVRLSADTPGAVGFTARFTSPQRSGGSAVDAHTIALDG--------VGGD 123
Query: 210 ANDDPKGIQFSAILEI-----KISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
P ++F + ++S D GT L VEG+D A L++ ++S+
Sbjct: 124 WYGRPGSVRFRGLARAESEGGRVSTDGGT--------LTVEGADAATLVISLATSYR--- 172
Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
N D DP S + + L Y+ L RH+ D+++LF RV++ L S +
Sbjct: 173 -NYLDVGADPASRARNHLAPAARKPYAHLRARHVADHRRLFGRVALDLGPSERA------ 225
Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSP 384
+P+ +R+ F +DP L L FQ+GRYLL S SR Q ANLQG+WN+ L+P
Sbjct: 226 ------ELPTDQRIPLFADGKDPQLAALYFQYGRYLLASCSRSPGQPANLQGLWNDSLNP 279
Query: 385 TWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHK 444
W+S VNIN EMNYW + P NL+EC +P + L+ +G++TA+ Y A GWV+HH
Sbjct: 280 AWESKYTVNINFEMNYWPAGPGNLAECWDPAVRMVHELAESGTRTAKALYDAPGWVLHHN 339
Query: 445 TDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLD 504
TD W + +A + +WP GGAWLC LW+HY +T D L R YP+++G F LD
Sbjct: 340 TDGW-RGTAPVDAAQYGMWPTGGAWLCVMLWDHYRFTGDTGAL-SRNYPVMKGAVEFFLD 397
Query: 505 WL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEK 563
L ++ G+L TNPS SPE +G+ + TMDM ++R++F A AAEVL++
Sbjct: 398 TLQVDAETGWLVTNPSQSPEVTHHQDEGESVSICAGPTMDMQLLRDLFDAYRQAAEVLDR 457
Query: 564 NEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+ LV +V + RL PT++ G I EW+
Sbjct: 458 DSR-LVGRVTEVRDRLAPTRVGHLGQIQEWL 487
>gi|375144807|ref|YP_005007248.1| alpha-L-fucosidase [Niastella koreensis GR20-10]
gi|361058853|gb|AEV97844.1| Alpha-L-fucosidase [Niastella koreensis GR20-10]
Length = 780
Score = 311 bits (798), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 204/603 (33%), Positives = 318/603 (52%), Gaps = 55/603 (9%)
Query: 9 TTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
T PL++ ++ PA + + +P+GNGRLG M GGV E + LN+ TLW+G P D N A
Sbjct: 27 TNKPLRLWYDKPAAQWEETLPLGNGRLGMMPDGGVLQENIVLNDITLWSGAPQDANNYKA 86
Query: 69 PKALSDVRSLVDSGQYAEATAASVKLF--------GHPADVYQLLGDIELEFD-DSHLKY 119
+ L +++ L+ G+ EA A K F P +Q LG + + F+ D
Sbjct: 87 NQKLPEIQKLLLEGKNDEAQALINKDFICTGKGSGAEPFGCFQTLGRLGIAFNYDGPANA 146
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
A Y R+L LN A A Y VG+V + RE+F+S + V + K++ S +G L+F VSL S
Sbjct: 147 AFTNYSRQLSLNDAAAACTYKVGDVTYNREYFTSFGNDVGIIKLTASAAGKLNFEVSL-S 205
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+ + N++ M G+ D KG+Q+ A++ K++ G++SA +
Sbjct: 206 RPEKATVTVAGNKLEMAGQLEN---------GTDGKGMQYVALVSAKLTG--GSLSAAGN 254
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
K L V+ + A+L A +S+ D + L ++Y +HL+
Sbjct: 255 K-LVVKNATKAILFFSAKTSY---------KDADYRQHAQQLLDKAMLVAYDAEKKKHLN 304
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF--QTDEDPSLVELLFQFG 357
+Y KLF+R+ + L S D +P+ +R+ F T D L L +Q+
Sbjct: 305 NYGKLFNRLQVDLGSS------------GADELPTDQRLDKFYNATTPDNRLTVLFYQYS 352
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYL ISS+R G NLQG+W ++ W+ H+++N++MN+W P NLSE PL D
Sbjct: 353 RYLSISSTRVGLLPPNLQGLWAHEVHTPWNGDYHLDVNVQMNHWGVEPANLSELNLPLAD 412
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ + +G KTA+ Y A GWV H T+ W + W + G WLC +LW+H
Sbjct: 413 LVKEMGPHGEKTAKAYYNARGWVAHVITNPWLFTEPGE-SASWGVTKAGSGWLCNNLWDH 471
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDG-KLAC 535
Y ++ D ++L K+ YP+L+G A F D LI+ + G+L T PS+SPE+ F PDG K +
Sbjct: 472 YTFSNDLNYL-KKIYPVLKGSALFYSDILIKDPETGWLVTAPSSSPENWFYMPDGSKQSS 530
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNE---DALVEKVLKSLPRLRPTKIAEDGSIME 592
+ +T+D IIRE+F+ +I+A+E L +E L EK LK +P +I+ DG +ME
Sbjct: 531 ICMGATIDNQIIRELFNNVITASEQLHIDEPFRKELKEK-LKQIP--PAAQISADGRVME 587
Query: 593 WVQ 595
W++
Sbjct: 588 WLK 590
>gi|375256587|ref|YP_005015754.1| putative lipoprotein [Tannerella forsythia ATCC 43037]
gi|363407344|gb|AEW21030.1| putative lipoprotein [Tannerella forsythia ATCC 43037]
Length = 850
Score = 311 bits (798), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 201/626 (32%), Positives = 319/626 (50%), Gaps = 76/626 (12%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
F+ PA + ++ P+GNGR+G M GG+ E + LNE ++W+G NP A K+L +R
Sbjct: 32 FDEPATLWEESFPLGNGRIGLMPDGGIEKENIVLNEISMWSGSKQQTDNPAAQKSLGRIR 91
Query: 77 SLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIELEF-----DDSHLK 118
L+ +G+ EA F P YQLLG++ L+F DD+ +
Sbjct: 92 ELLFAGRNDEAQELMYDTFVCYGDGSGRGSGANKPYGSYQLLGNLMLDFTYDAADDAQVS 151
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
YRRELDL A + + G E++RE F+S D V V ++ + L + ++
Sbjct: 152 ----DYRRELDLEQALTTLSFRKGKTEYSREVFTSFADDVAVIRLKVNNGRKLQCQIGMN 207
Query: 179 SLLDNHSYVNGNNQIIMEGRC-----------------------PGKRIPPKANAN---- 211
+ ++ N+++ M GR IP
Sbjct: 208 RP-ERYAVRAENSELEMRGRLYEGDAYKTKEQLEREEAMRNRTNNSDSIPAAEQKTMPGA 266
Query: 212 DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSK 271
+D +G+++++ +++ + + G + A D L VE + +LL+ ++ + G + D++
Sbjct: 267 EDGQGVRYASRVQVVLPNG-GEVKAFNDTTLIVEEASEIILLVGMATDYFGKAV---DAQ 322
Query: 272 KDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDT 331
D S L + + SY L H+ YQ+L+HRV++ R+ + +
Sbjct: 323 ID------SLLTAAASKSYETLKEEHIRAYQELYHRVAVHFGRNAQK-----------EA 365
Query: 332 VPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAP 390
+P +R+++FQ D+ DPSL+ L +QFGRYLLISS+RPG NLQG+W + W+
Sbjct: 366 LPMNKRLEAFQNDKNDPSLLALYYQFGRYLLISSTRPGLLPPNLQGLWCNTIHTPWNGDY 425
Query: 391 HVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAK 450
H+NINL+MN W + NLSE PL ++ +G +TA+ Y A GWV H ++W +
Sbjct: 426 HLNINLQMNLWPAETGNLSELHLPLIEWTKQQVESGRQTAKAFYNARGWVTHILGNVW-E 484
Query: 451 SSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG- 509
+A W AWLC HL+ HY +T+D +L + YP++ A F +D L+E
Sbjct: 485 FTAPGEHPSWGATNTSAAWLCEHLYTHYLFTLDTAYL-RDVYPVMRESALFFVDMLVEDP 543
Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALV 569
YL T P+TSPE+ ++ P+GK V STMD I+RE+FS I AA +L+ +E+ LV
Sbjct: 544 RSHYLVTAPTTSPENAYVMPNGKKVSVCAGSTMDNQILRELFSNTIQAARLLKTDEE-LV 602
Query: 570 EKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+ + RL PT I DG IMEW++
Sbjct: 603 QTLAAYQARLMPTTIGPDGRIMEWLE 628
>gi|423343039|ref|ZP_17320753.1| hypothetical protein HMPREF1077_02183 [Parabacteroides johnsonii
CL02T12C29]
gi|409216715|gb|EKN09698.1| hypothetical protein HMPREF1077_02183 [Parabacteroides johnsonii
CL02T12C29]
Length = 809
Score = 311 bits (797), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 198/609 (32%), Positives = 312/609 (51%), Gaps = 50/609 (8%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
N T L F+ PA+ + + +P+GNGR G M GGV +E + LNE ++W+G D
Sbjct: 17 NMPEVKTGKSLSYHFDAPAEIWEETLPLGNGRFGLMPDGGVDTEKIVLNEISMWSGSKQD 76
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
NP A +L +R L+ G+ EA F P YQLLG++
Sbjct: 77 TDNPQAYYSLGTIRKLLFEGRNDEAQELMYNTFVCKGEGSALGQGANAPYGSYQLLGNLV 136
Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
L +D + YRREL+L+ A A + G V++ RE F+S D + V ++
Sbjct: 137 LNYDYQGSSDSISGYRRELNLDNAIATASFRRGKVKYDREVFTSFADDLGVIHLTADADK 196
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD 229
+L+F+ ++ +++ N ++M+G+ P + KG+++++ + + +
Sbjct: 197 ALNFSFGMNRP-EHYKVTADGNDLLMQGQLP------DGVDTLEMKGLRYASRVRVVLPK 249
Query: 230 DRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSALQSIRNL 288
I D + + + A+LL+ +A+ FD KD + S L +
Sbjct: 250 GGNVIPG--DSTVTIRNASEAILLVSMATDYFD----------KDLDEKVASLLANAEKK 297
Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDP 347
++ L H+ Y+ LF RV + L S ++ +P ER+ +F D +DP
Sbjct: 298 DFASLKKGHIVAYRSLFGRVDLDLGHSSRE------------DLPIDERLAAFNADPDDP 345
Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
SL L FQFGRYLLISS+R G NLQG+W ++ W+ H+NINL+MN+W + N
Sbjct: 346 SLGALYFQFGRYLLISSTRVGLLPPNLQGLWCNTVNTPWNGDYHLNINLQMNHWPAEVAN 405
Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
LSE PL ++ +G +TA+ Y A GWV H ++W + +A W
Sbjct: 406 LSELHLPLVEWTKQQVASGEQTAKAYYNAGGWVTHILGNVW-EFTAPGEHPSWGATNTSA 464
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEF 526
AWLC HL+ HY YT+D+++L K YP+L+G + F +D L+E + YL T P+TSPE+ +
Sbjct: 465 AWLCEHLYMHYLYTLDKEYL-KDVYPVLKGASRFFVDMLVEDPRNKYLVTAPTTSPENGY 523
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE 586
P+GK A + STMD I+RE+F+ I AA +L + A +++ RL PT I +
Sbjct: 524 KLPNGKTAHICAGSTMDNQIVRELFTNTIEAANILGI-DSAFAGELVAKRARLMPTTIGK 582
Query: 587 DGSIMEWVQ 595
DG IMEW++
Sbjct: 583 DGRIMEWLE 591
>gi|46118818|ref|XP_384910.1| hypothetical protein FG04734.1 [Gibberella zeae PH-1]
Length = 768
Score = 311 bits (797), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 202/602 (33%), Positives = 304/602 (50%), Gaps = 55/602 (9%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
+E ST L + + PA +++A+PIGNGRLGAMV+G +E L+LNED++W G P D
Sbjct: 5 SEKASTDKSLLLHYAAPASSWSEALPIGNGRLGAMVYGRTSTELLQLNEDSVWYGGPQDR 64
Query: 64 TNPDAPKALSDVRSLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYA 120
T DA L+ +R L+ ++ +A T A F PA + Y+ LG +EF H +
Sbjct: 65 TPRDACSNLATLRQLIRDEKHKDAETLAREAFFATPASMRHYEPLGQCTIEF--GHDEKN 122
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
Y+R LDL T+ + KY V + R+ +S P+ V+ + S ++ S
Sbjct: 123 VSDYKRHLDLATSQSTTKYDYEGVSYRRDVIASFPNNVLAFRFQASAPTRFVVRLNRQSE 182
Query: 181 LDNHS--YVNG----NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
++ + Y++ +N II++ GK N+N + + L + GT+
Sbjct: 183 VEGETNEYLDSIRAQDNHIILQATPGGK------NSN------RLALALGVSCKSINGTV 230
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
KV G+ L++ A + + +P + ++ + S + L
Sbjct: 231 --------KVVGN---CLIVNAEECIIAIGAHTTYRSYNPDASALRDVNSALREPWETLV 279
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
+RH DY +LF + ++++ + VP+ ER+ Q++ DP +V L
Sbjct: 280 SRHRRDYGRLFGKTALRM-------------WPDASHVPTEERI---QSNRDPGVVALYH 323
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
+GRYLLISSSR + A LQGIWN +P W S +NINL+MNYW + PCNL EC
Sbjct: 324 NYGRYLLISSSRKSAKALPATLQGIWNPSFAPPWGSKFTININLQMNYWPAAPCNLIECA 383
Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
PL D + ++ G +TA++ Y GW HH TDIWA + + LWP+GG WLC
Sbjct: 384 IPLIDHIERMAEKGKRTAKMMYNCRGWCAHHNTDIWADTDPQDRWMPATLWPLGGVWLCI 443
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDG 531
+ + Y D L R PLLEGC FLLD+LI G YL T+PS SPE+ FI+ G
Sbjct: 444 DVVKMLIYQYDH-MLHIRIAPLLEGCIQFLLDFLIPSACGKYLVTSPSLSPENSFISESG 502
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
+ S MDM I+R + I + +L K E L + V+ +L +L P +I + G I
Sbjct: 503 ETGTFCEGSVMDMTIVRIALESFIWSTSILNK-EHPLQKDVMATLGKLPPFRINKSGLIQ 561
Query: 592 EW 593
EW
Sbjct: 562 EW 563
>gi|390958734|ref|YP_006422491.1| hypothetical protein Terro_2921 [Terriglobus roseus DSM 18391]
gi|390413652|gb|AFL89156.1| hypothetical protein Terro_2921 [Terriglobus roseus DSM 18391]
Length = 837
Score = 310 bits (795), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 202/624 (32%), Positives = 300/624 (48%), Gaps = 74/624 (11%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPS-------------------------- 45
P ++ + PA +T+A+PIGNGR+GAMV+GG +
Sbjct: 37 PARLWYRAPAPVWTEALPIGNGRIGAMVFGGANTGPNNGDLEDAAKNADILSGDKTRGQD 96
Query: 46 ETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLV------DSGQYAEATA-ASVKLFGHP 98
E L+LNE T+W G D NP A + VR+L+ D + AEA A + +P
Sbjct: 97 EHLQLNESTVWAGSRADRLNPRAAEGFRRVRALLLESKGTDGKKIAEAEKLAQETMIANP 156
Query: 99 ADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPD 156
+ Y +GD+ L S A Y R+LDL T R+ Y G V FTRE F+S PD
Sbjct: 157 KAMPPYSTVGDLYLRSSSSE---AIADYHRQLDLKTGVVRITYRQGPVHFTREIFASAPD 213
Query: 157 QVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG 216
VIV ++ ++S S+D D +G +++ K
Sbjct: 214 HVIVMHLTADRPNAISLTASMDRPGDFAIRASGQRDLVLTQSATTK------------NA 261
Query: 217 IQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTS 276
F A + + + G + A D+ + + + VL+ AS GP + DP +
Sbjct: 262 THFQA--QARFATHGGAVHADGDRIVVEKAQELTVLIAAASDFKGGPILG-----GDPAT 314
Query: 277 ESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAE 336
L S + +++ L D + R+S+ L P D + +P+ E
Sbjct: 315 LCGDILASAQKKNFAALSAAATKDQFRYIDRMSLSLG--PVDAA--------LAAMPTDE 364
Query: 337 RVKSFQTDEDP-SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNIN 395
R+K +D L L FQ+ RYLL+ SSRPG ANLQG+W LS W S +N+N
Sbjct: 365 RLKRVAAGQDDFGLQALYFQYARYLLLGSSRPGGLAANLQGLWASGLSNPWGSKWTINVN 424
Query: 396 LEMNYWQSLPCNLSECQEPLFDFLTYL----SINGSKTAQVNYLASGWVIHHKTDIWAKS 451
EMNYW + NLSE +PLFD + + S G K A+ Y A G+VIHH TDIW +
Sbjct: 425 TEMNYWLAEAANLSEMHQPLFDLVGMVRDPASGTGVKVAKEYYGAKGFVIHHNTDIWGDA 484
Query: 452 SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD 511
G + +WP GGAWL H W+HY +T ++ FL +A+PLL + F LD+L +
Sbjct: 485 EPIDG-YQYGIWPDGGAWLTLHAWDHYAFTGNKQFLRSQAWPLLHDASLFFLDYLTDDGS 543
Query: 512 GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEK 571
G+L T PS SPE+++ DG ++ TMD+ I+RE+F + A +L ++ A +++
Sbjct: 544 GHLVTGPSLSPENKYKLADGTSHSLTMGPTMDIEIVRELFQRTMQAGTILGEDA-AFLQQ 602
Query: 572 VLKSLPRLRPTKIAEDGSIMEWVQ 595
V ++ RL P + G + EW Q
Sbjct: 603 VRQASDRLPPFHVGSLGQLQEWQQ 626
>gi|302917285|ref|XP_003052415.1| hypothetical protein NECHADRAFT_105964 [Nectria haematococca mpVI
77-13-4]
gi|256733355|gb|EEU46702.1| hypothetical protein NECHADRAFT_105964 [Nectria haematococca mpVI
77-13-4]
Length = 765
Score = 310 bits (795), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 191/587 (32%), Positives = 300/587 (51%), Gaps = 43/587 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + + PA +++A+P+GNGRLGAM++G +E L+LNED++W G P D T DA + L
Sbjct: 8 LALHYTSPASSWSEALPVGNGRLGAMIYGRTTTELLQLNEDSVWYGGPQDRTPRDAKRNL 67
Query: 73 SDVRSLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+ +R L+ + ++ EA T F P + Y+ LG+ +EF+ H +RR LD
Sbjct: 68 AKLRELIRAERHQEAETLVREAFFATPTSMRHYEPLGNCTIEFN--HGVEDVTDFRRRLD 125
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L+T+ +Y+ V + R+ +S PD V+ + SE ++ S ++ +
Sbjct: 126 LSTSQNTTEYTCRGVSYRRDVIASFPDNVLAIRFEASEKTRFVVRLTRRSDVEWETNEFL 185
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
++ +GR P N+N Q + +L + + G + A+ + + +
Sbjct: 186 DSIRAEDGRIILHATPGGRNSN------QLALVLGVSCDANDGEVEAIGN--CLIVNTTR 237
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
V+ + A +++ DP + ++ + +S+L H DY LF R+S
Sbjct: 238 CVIAIGAQTTY---------RVADPEASALHDVDEALKRPWSELAEHHRQDYTNLFGRMS 288
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+++ N +P+ ER+K+ + DP LV L +GRYLLISSSR
Sbjct: 289 LRMG-------------PNAGHIPTDERIKN---NRDPGLVALYHNYGRYLLISSSRNSH 332
Query: 370 QV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
+ A LQGIWN +P W S +NINL+MNYW + CNL EC P+ D L ++ G
Sbjct: 333 KALPATLQGIWNPFFAPPWGSKYTININLQMNYWPAAQCNLLECALPVMDLLEKMAERGR 392
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
KTA+ Y GW HH TDIW + + +LWP+GG W+C ++ Y D L
Sbjct: 393 KTAETMYGCRGWCAHHNTDIWGDTDPQDTWMPASLWPLGGVWVCIDVFNMLKYEYD-SAL 451
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
R P+LEGC FLLD+LI G YL TNPS SPE+ F++ GK + S +DM I
Sbjct: 452 HSRVAPVLEGCIEFLLDFLIPSACGKYLVTNPSLSPENTFLSESGKPGILCEGSVIDMTI 511
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+R F + + + ++L ++ L +V ++L +L P I DG I EW
Sbjct: 512 VRIAFESFLLSVDILNQDH-PLRSQVQEALEKLPPLTINNDGLIQEW 557
>gi|293370624|ref|ZP_06617176.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
gi|292634358|gb|EFF52895.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
Length = 811
Score = 310 bits (795), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 200/591 (33%), Positives = 309/591 (52%), Gaps = 57/591 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+++++A+PIGN RLGAMV+GG+ E L+LNE+T W G P + NP+A L
Sbjct: 22 LKLWYSQPAQNWSEALPIGNSRLGAMVYGGIKREELQLNEETFWAGSPYNNNNPNAIHVL 81
Query: 73 SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ G+ EA + L Y LG + LEF + + R+L+L
Sbjct: 82 PIVRKLIFEGRNKEAQRLIDTNFLTQQHGMSYLTLGSLYLEFPEHQ---NASGFYRDLNL 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
AT +Y V +V +TR F+S D VI+ I S++ +L+F ++ + L + V +
Sbjct: 139 ENATTTTRYQVDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNCPLVHKVNVQND 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
+ C GK + +G++ + E +I GT+ + EG++
Sbjct: 199 KLTVT---CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L + A++++ +N D + + + L+ + Y H+ Y+K F RV
Sbjct: 245 ATLYISAATNY----VNYQDVSANESRRTSEYLKRAMQIPYEKALKSHIAYYKKQFDRVR 300
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L T S+ + + +R+++F ED ++ LLF +GRYLLISSS+PG
Sbjct: 301 LTLP-------TGKASQ-----LETPKRIENFGNGEDMAMAALLFHYGRYLLISSSQPGG 348
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN WDS +NIN EMNYW + NLSE PLF L LS+ G+KT
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGTKT 408
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
A+ Y + GWV HH TD+W G V +A +WP GGAWL H+W+HY +T D++F
Sbjct: 409 ARNMYNSRGWVAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGDQEF 464
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L K YP+L+G A F +D+L+E H Y L PS SPEH V+ TMD
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPTYKWLVVAPSVSPEH---------GPVTAGCTMDN 513
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I + + A+ + + + + + ++L +L P +I + + EW++
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLE 563
>gi|423290387|ref|ZP_17269236.1| hypothetical protein HMPREF1069_04279 [Bacteroides ovatus
CL02T12C04]
gi|392665774|gb|EIY59297.1| hypothetical protein HMPREF1069_04279 [Bacteroides ovatus
CL02T12C04]
Length = 811
Score = 310 bits (795), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 199/591 (33%), Positives = 309/591 (52%), Gaps = 57/591 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+++++A+PIGN RLGAMV+GG+ E L+LNE+T W G P + NP+A L
Sbjct: 22 LKLWYSQPAQNWSEALPIGNSRLGAMVYGGIEREELQLNEETFWAGSPYNNNNPNAVHVL 81
Query: 73 SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ G+ EA + L Y LG + LEF + + REL+L
Sbjct: 82 PVVRKLIFEGRNKEAQRLIDANFLTQQHGMSYLTLGSLYLEFPEHQ---NGSGFYRELNL 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
AT +Y V +V +TR F+S D VI+ I S++ +L+F ++ + L + V N
Sbjct: 139 ENATTTTRYQVDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNCPLVHKVNVQ-N 197
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
+Q+ + C GK + +G++ + E +I GT+ + EG++
Sbjct: 198 DQLTVT--CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L + A++++ +N D D + + L+ + Y H+ Y+K F RV
Sbjct: 245 ATLYISAATNY----VNYQDVSADESHRTSEYLKRAMQIPYEKALKNHIAYYKKQFDRVR 300
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L + + +R+++F ED ++ LLF +GRYLLISSS+PG
Sbjct: 301 LTLPAG------------KASQLETPKRIENFGNGEDMAMAALLFHYGRYLLISSSQPGG 348
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN WDS +NIN EMNYW + NLSE PLF L LS+ G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 408
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
A+ Y GWV HH TD+W G V +A +WP GGAWL H+W+HY +T +++F
Sbjct: 409 ARTMYDCRGWVAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L K YP+L+G A F +D+L+E H Y L +PS SPEH ++ TMD
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDN 513
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I + + A+ + + + + + ++L +L P +I + + EW++
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLE 563
>gi|299147445|ref|ZP_07040510.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
gi|298514723|gb|EFI38607.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
Length = 826
Score = 310 bits (795), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 197/594 (33%), Positives = 313/594 (52%), Gaps = 47/594 (7%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
N +I ++ PA ++ +A+P+GNGR+ AMV+G E ++LNE+T+ G P N +A
Sbjct: 25 NIYRIWYDKPASYWEEALPVGNGRIAAMVFGNPQLEQIQLNEETVSAGSPYQNYNEEAKT 84
Query: 71 ALSDVRSLVDSGQYAEA-TAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRE 127
AL ++R L+ G+Y EA A+ K+ + YQ +G + + + D H K Y R+
Sbjct: 85 ALPEMRRLIFEGKYEEAQNMAATKILSQVGNEMPYQTVGRLNIRYPD-HKKV--NNYYRD 141
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS-LLDNHSY 186
LD++ A A +Y V VEFT E F+S DQ+++ I S+ G+++ + ++ + D
Sbjct: 142 LDISNAVAVTRYEVDGVEFTEETFASFTDQLVIRHIKASKPGTINCELFFNTPMRDPKRS 201
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ G + +EG G R + + A L++K G + D L V+G
Sbjct: 202 IYGKKGLRLEGITHGSRYFSGK--------VHYCADLDVK--HKGGKVITANDTLLSVQG 251
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ L + +++F +N D DP + + L++ YS H+ YQK F+
Sbjct: 252 ASELTLYISMATNF----VNYKDISGDPYQRNKAYLKNAAK-DYSKAKAAHIAAYQKQFN 306
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV++ L + + + +++D R+K F + DP+L+ L FQ+GRYLLISSS+
Sbjct: 307 RVTLDLGETSQ-------ANKSMDV-----RIKEFSSSYDPALIALYFQYGRYLLISSSQ 354
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQG WN + P W NIN EMNYW + NL+E +P + LS NG
Sbjct: 355 PGCQPANLQGKWNHNPGPPWSCNYTTNINAEMNYWPAEITNLAELHKPFIQMVRELSENG 414
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
+ A Y GWV+HH TD+W + A DR WP+ AWLC HLW+ Y ++ D+
Sbjct: 415 REAASRMYGCRGWVLHHNTDLWRMTGAVDRPYC--GTWPVANAWLCQHLWDRYLFSGDKK 472
Query: 486 FLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH--EFIAPDGKLACVSYSSTM 542
+LE+ YP+++ + F +D+L+ + + GYL PS SPE+ +I L TM
Sbjct: 473 YLEE-VYPMMKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPRWIKKKSNLFA---GITM 528
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWVQ 595
D ++ ++FS AA+VL N D LK++ R L P ++ + G + EW +
Sbjct: 529 DNQLVFDLFSNTCEAAKVL--NADTDFCDTLKNMRRQLPPMQVGQYGQLQEWFE 580
>gi|423215145|ref|ZP_17201673.1| hypothetical protein HMPREF1074_03205 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692408|gb|EIY85646.1| hypothetical protein HMPREF1074_03205 [Bacteroides xylanisolvens
CL03T12C04]
Length = 816
Score = 310 bits (794), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 197/594 (33%), Positives = 313/594 (52%), Gaps = 47/594 (7%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
N +I ++ PA ++ +A+P+GNGR+ AMV+G E ++LNE+T+ G P N +A
Sbjct: 15 NIYRIWYDKPASYWEEALPVGNGRIAAMVFGDPQLEQIQLNEETVSAGSPYQNYNEEAKT 74
Query: 71 ALSDVRSLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRE 127
AL ++R L+ G+Y EA A+ K+ + YQ +G + + + D H K Y R+
Sbjct: 75 ALPEMRRLIFEGKYEEAQNMAATKILSQVGNEMPYQTVGRLNIRYQD-HKKV--NNYYRD 131
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS-LLDNHSY 186
LD++ A A +Y V VEFT E F+S DQ+++ I S+ G+++ + ++ + D
Sbjct: 132 LDISNAVAVTRYEVDGVEFTEETFASFTDQLVIRHIKASKPGTINCELFFNTPMRDPKRS 191
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ G + +EG G R + + A L++K G + D L V+G
Sbjct: 192 IYGKKGLRLEGITHGSRYFSGK--------VHYCADLDVK--HKGGKVITANDTLLSVQG 241
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ L + +++F +N D DP + + L++ YS H+ YQK F+
Sbjct: 242 ASELTLYISMATNF----VNYKDISGDPYQRNKAYLKNAAK-DYSKAKAAHIAAYQKQFN 296
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV++ L + + + +++D R+K F + DP+L+ L FQ+GRYLLISSS+
Sbjct: 297 RVTLDLGETSQ-------ANKSMDV-----RIKEFSSSYDPALIALYFQYGRYLLISSSQ 344
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQG WN + P W NIN EMNYW + NL+E +P + LS NG
Sbjct: 345 PGCQPANLQGKWNHNPGPPWSCNYTTNINAEMNYWPAEITNLAELHKPFIQMVRELSENG 404
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
+ A Y GWV+HH TD+W + A DR WP+ AWLC HLW+ Y ++ D+
Sbjct: 405 REAASRMYGCRGWVLHHNTDLWRMTGAVDRPYC--GTWPVANAWLCQHLWDRYLFSGDKK 462
Query: 486 FLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH--EFIAPDGKLACVSYSSTM 542
+LE+ YP+++ + F +D+L+ + + GYL PS SPE+ +I L TM
Sbjct: 463 YLEE-VYPMMKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPRWIKKKSNLFA---GITM 518
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWVQ 595
D ++ ++FS AA+VL N D LK++ R L P ++ + G + EW +
Sbjct: 519 DNQLVFDLFSNTCEAAKVL--NADTDFCDTLKNMRRQLPPMQVGQYGQLQEWFE 570
>gi|317474862|ref|ZP_07934132.1| glycoside hydrolase [Bacteroides eggerthii 1_2_48FAA]
gi|316909000|gb|EFV30684.1| glycoside hydrolase [Bacteroides eggerthii 1_2_48FAA]
Length = 801
Score = 310 bits (794), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 210/590 (35%), Positives = 312/590 (52%), Gaps = 48/590 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ +T+A+P+GNGRLGAMV+G E ++LNE+TLW G P + NP+A + +
Sbjct: 12 KLWYDEPAQVWTEALPLGNGRLGAMVYGTPAMENIQLNEETLWAGRPNNNANPNALEYIP 71
Query: 74 DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
VR LV G+Y EA T A+ K+ G P YQ G + + F H +Y + Y RE
Sbjct: 72 KVRQLVFEGKYLEAQTLATEKVMAKTNSGMP---YQSFGHLRIAFP-GHTRYTD--YYRE 125
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
L L++A V Y+V V + RE +S DQV++ ++S S G ++ N L S +
Sbjct: 126 LSLDSARTVVCYTVDGVRYRRETITSLADQVVMVRLSASRPGMITCNAHLTSPHQDVMIA 185
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ ++I + G ++ ++ KG + F + ++ +G S+ D L VE
Sbjct: 186 SEGDEITLSG---------VSSWHEGLKGKVLFQGRMAVRT---QGGHSSCADGVLAVEK 233
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D A L +++F +N D + S + L + SY HL Y+
Sbjct: 234 ADEATFYLSIATNF----VNYKDITGNEVERSKNYLHAALKHSYRQSLLEHLAIYKSYMD 289
Query: 307 RVSIQLSRSP-KDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
RV + L D+ TD RV++F+ +D LV F+FGRYLLI SS
Sbjct: 290 RVDLDLGHDRYADVTTDM-------------RVQNFRETQDDFLVATYFRFGRYLLICSS 336
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
+PG Q ANLQGIWN+ L P+WDS NINLEMNYW + NLSE +PL ++ +S
Sbjct: 337 QPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPAEVTNLSELHQPLMQLISEVSET 396
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G +TA+ Y A GWV+HH TDIW + A K LWP GGAWLC HLWE Y YT D
Sbjct: 397 GRETAKTMYGAEGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDVG 455
Query: 486 FLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
FL + AYP+++ A F ++ E +L PS SPE+ GK + + TMD
Sbjct: 456 FL-RTAYPIMKEAAKFFDQIMVKEPVHNWLVVCPSNSPENVHAGSKGK-STTAPGCTMDN 513
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+I ++++ +I+ A +L +E L + L + P ++ G + EW+
Sbjct: 514 QLIFDLWNQVITTARLLNTDE-TLAVHYEQRLREMAPMQVGRWGQLQEWM 562
>gi|405378422|ref|ZP_11032344.1| hypothetical protein PMI11_02312 [Rhizobium sp. CF142]
gi|397325094|gb|EJJ29437.1| hypothetical protein PMI11_02312 [Rhizobium sp. CF142]
Length = 750
Score = 310 bits (794), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 212/584 (36%), Positives = 300/584 (51%), Gaps = 42/584 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA+ +TDA+P+GNGRLGAMV+G SE L++N+ T W G P NPD+ L +R
Sbjct: 10 YDAPARLWTDALPLGNGRLGAMVFGDPVSERLQINDSTFWAGGPYRPVNPDSYGHLEKIR 69
Query: 77 SLVDSGQYAEATAASV-KLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
L+ +G YAEA A + L P YQ +GD+ ++F S +YRR LDL+TA
Sbjct: 70 ELIFAGHYAEAEAMAEEHLMARPIKQMSYQPIGDLHIDFLHSQ---TIGSYRRTLDLDTA 126
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y + F RE F S D V+V ++S G++ +SLDS + +
Sbjct: 127 IATTSYVADGITFFREAFISTVDGVLVLRLSADRPGAIRCRISLDSPQQGQLFDQDAAGL 186
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
G GK A A ++F+ + + + G + + V+ +D V+L
Sbjct: 187 TFSGT--GKAEWGIAAA------LRFAFGIRVI---NTGGSLSSSSGIISVDSTDELVIL 235
Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
L A++SF D DP + L S + H+ ++Q+LF +I L
Sbjct: 236 LDAATSFR----RFDDVSGDPDGAITARLSKATGHSIEAMRRDHIIEHQRLFRAFAIDLG 291
Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVAN 373
T S P+ R+ F EDP+L L QFGRYL+I+SSRPGTQ AN
Sbjct: 292 ------TTQAASH------PTDRRIAGFADGEDPALAALYVQFGRYLMIASSRPGTQPAN 339
Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVN 433
LQGIWNE++ P W S NINL+MNYW P NL +C PL + L+ G +TAQV+
Sbjct: 340 LQGIWNEEVDPPWGSKYTANINLQMNYWLPAPANLPQCIVPLVEMAEELAEAGRETAQVH 399
Query: 434 YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYP 493
Y A GWV+HH TD+W + G W LWP GGAWL T L + +Y D D L +R +P
Sbjct: 400 YRARGWVMHHNTDLWRATGPIDG-AKWGLWPTGGAWLMTQLLDLSDYLDDADRLRRRLFP 458
Query: 494 LLEGCASFLLDWL--IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
+ + A F+ D L + G + YL T PS SPE+ + P G C MD IIR+
Sbjct: 459 VAKAAAEFVFDALASLPGTN-YLVTTPSLSPEN--VHPHGASICA--GPAMDNQIIRDFL 513
Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+ + A + ED V ++ + LPRL P +I G + EW++
Sbjct: 514 NLLRPIATSI-GGEDEFVSEIDRVLPRLPPDRIGSAGQLQEWLE 556
>gi|408387708|gb|EKJ67420.1| hypothetical protein FPSE_12405 [Fusarium pseudograminearum CS3096]
Length = 768
Score = 310 bits (793), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 196/602 (32%), Positives = 305/602 (50%), Gaps = 55/602 (9%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
+E +T L + + PA +++A+PIGNGRLGAMV+G +E L+LNED++W G P D
Sbjct: 5 SEKANTDKSLLLHYAAPASSWSEALPIGNGRLGAMVYGRASTELLQLNEDSVWYGGPQDR 64
Query: 64 TNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYA 120
T DA L+ +R L+ ++ +A A A F PA + Y+ LG +EF H +
Sbjct: 65 TPRDAYSNLATLRQLIRDEKHKDAEALAREAFFATPASMRHYEPLGQCTIEF--GHDERI 122
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
Y+R LDL T+ + KY V + R+ +S P+ V+ + S ++ S
Sbjct: 123 VSDYKRHLDLATSQSTTKYDYEGVTYRRDVIASFPNNVLAIRFQASAPTRFVVRLNRQSE 182
Query: 181 LDNHS--YVNG----NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
++ + Y++ +N II++ GK N+N + + L + + G +
Sbjct: 183 VEGETNEYLDSIRAQDNHIILQATPGGK------NSN------RLALALGVSCKSNNGNV 230
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
+ + + ++ ++ + A +++ +P + ++ + S + +L
Sbjct: 231 KVVGN--CLIVNTEECIIAIGAHTTY---------RSYNPDASALRDVNSALREPWENLV 279
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
+RH DY +LF + ++++ + VP+ ER+ Q++ DP L+ L
Sbjct: 280 SRHRQDYGRLFSKTALRM-------------WPDASHVPTDERI---QSNRDPGLIALYH 323
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
+ RYLLISSSR + A LQGIWN +P W S +NINL+MNYW + CNL EC
Sbjct: 324 NYSRYLLISSSRKSAKALPATLQGIWNPSFAPPWGSKFTININLQMNYWPAASCNLIECA 383
Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
PL D + ++ G +TA+V Y GW HH TDIWA + + LWP+GG WLC
Sbjct: 384 VPLIDHIERMAQKGKRTAKVMYNCRGWCAHHNTDIWADTDPQDRWMPATLWPLGGVWLCI 443
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDG 531
+ + Y D L R PLLEGC FLLD+LI G YL TNPS SPE+ FI+ G
Sbjct: 444 DVVKMLIYQYDH-MLHIRIAPLLEGCIQFLLDFLIPSACGKYLVTNPSLSPENSFISESG 502
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
+ S MDM I+R + I + +L K E L + V+ +L +L P +I + G I
Sbjct: 503 ETGTFCEGSVMDMTIVRIALESFIWSTSILNK-EHPLQKDVMATLGKLPPFRINKSGLIQ 561
Query: 592 EW 593
EW
Sbjct: 562 EW 563
>gi|298481665|ref|ZP_06999856.1| alpha-L-fucosidase 2 [Bacteroides sp. D22]
gi|298272206|gb|EFI13776.1| alpha-L-fucosidase 2 [Bacteroides sp. D22]
Length = 812
Score = 310 bits (793), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 198/591 (33%), Positives = 311/591 (52%), Gaps = 56/591 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+++++A+PIGN RLGAM++GG+ E L+LNE+T W G P + NP+A L
Sbjct: 22 LKLWYSQPAQNWSEALPIGNSRLGAMIYGGIEREELQLNEETFWAGSPYNNNNPNAIHVL 81
Query: 73 SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ G+ EA + L Y LG + LEF + + R+L+L
Sbjct: 82 PVVRKLIFEGRNKEAQRLIDANFLTQQHGMSYLTLGSLYLEFPEHQ---NASGFYRDLNL 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
AT +Y V +V +TR F+S D VI+ I S++ +L+F ++ + L + V N
Sbjct: 139 ENATTTTRYQVDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNCPLVHKVNVQ-N 197
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
+Q+ + C GK + +G++ + E +I GT+ + EG++
Sbjct: 198 DQLTVT--CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L + A++++ +N D D + + L+ + Y H+ Y+K F RV
Sbjct: 245 ATLYISAATNY----VNYQDVSADESHRTSEYLKKAMQIPYEKALKSHIAYYKKQFDRVR 300
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L + K +T +R+++F ED ++ LLF +GRYLLISSS+PG
Sbjct: 301 LTLPAAGKASQLET-----------PKRIENFGNGEDMAMAALLFHYGRYLLISSSQPGG 349
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN WDS +NIN EMNYW + NLSE PLF L LS+ G++T
Sbjct: 350 QSANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 409
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
A+ Y GW+ HH TD+W G V +A +WP GGAWL H+W+HY +T +++F
Sbjct: 410 ARTMYDCRGWMAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 465
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L K YP+L+G A F +D+L+E H Y L +PS SPEH ++ TMD
Sbjct: 466 L-KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDN 514
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I + + A+ + + + + + ++L +L P +I + + EW++
Sbjct: 515 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLE 564
>gi|423575217|ref|ZP_17551336.1| hypothetical protein II9_02438 [Bacillus cereus MSX-D12]
gi|401209825|gb|EJR16582.1| hypothetical protein II9_02438 [Bacillus cereus MSX-D12]
Length = 940
Score = 309 bits (792), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 205/603 (33%), Positives = 314/603 (52%), Gaps = 75/603 (12%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
L + +N PAK + A+PIGNG +G MV+GGV E ++ NE TLWTG P Y N
Sbjct: 84 LSLWYNEPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPSSTSEYTYGNR 143
Query: 67 D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEF---DDS 115
D A L +R + G + A S + FG YQ GDI L+F D S
Sbjct: 144 DGAASHLGSIREKLSKGDKSGAERESTQFLTGLQKGFGS----YQNFGDIYLDFNMPDGS 199
Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
YRREL+LN + V Y+ V++ RE+F+S PD+V+V +++ SES LS +V
Sbjct: 200 SF----SNYRRELNLNEGISTVSYNYKGVQYNREYFASYPDRVMVMRLTASESKQLSLDV 255
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
S + +N+I ++G+ AN+ G+++ + E K+ ++ GT++
Sbjct: 256 RPTSAQGGQ-VTSKDNKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLT 299
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
A E+ K+KV +D +++ A++ ++ + PS +DP + + +I N SY L
Sbjct: 300 A-ENGKIKVANADSLTIIMTAATDYENKY--PSYKGEDPHQKVEKIMSAISNKSYEVLKY 356
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
H+ DY LF+RVS+ L +VP+ E + S+ + L EL FQ
Sbjct: 357 THIKDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQ 403
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
+GRYLLISSSRPGT ANLQG+WN +P W+S H NINL+MNYW + NLSE EPL
Sbjct: 404 YGRYLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPL 463
Query: 416 FDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
D++ L G +A+ ++ GW ++ + + ++ G + W P A++ +
Sbjct: 464 MDYVDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQN 522
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
LWEHY +T D+ +L+++ YP+L+ A F +L+E + L +P SPE L
Sbjct: 523 LWEHYKFTNDKQYLQEKIYPILKEAAVFHSKFLVEDQNKKLVVSPCWSPE---------L 573
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSI 590
+S D ++ E+FS +I A+EVL+ + D L K K P P +I G +
Sbjct: 574 GGISNGCAFDQQLVYELFSNVIEASEVLQVDNVFRDELKAKRDKLFP---PIQIGRYGQV 630
Query: 591 MEW 593
EW
Sbjct: 631 QEW 633
>gi|345515268|ref|ZP_08794774.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
gi|229434306|gb|EEO44383.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
Length = 814
Score = 309 bits (792), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 202/589 (34%), Positives = 320/589 (54%), Gaps = 46/589 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ +T+A+P+GNGRLGAMV+G E ++LNE+T+W G P + NP+A + +
Sbjct: 25 KLWYDEPAQVWTEALPLGNGRLGAMVFGNPGVEHIQLNEETIWAGRPNNNANPNALEYIP 84
Query: 74 DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
VR LV G+Y EA T A+ K+ G P YQ GD+ + F H +Y++ Y RE
Sbjct: 85 KVRQLVFEGKYLEAQTLATEKIMAKTNSGMP---YQSFGDLHISFP-GHTRYSD--YYRE 138
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
L L++A V+Y V V + RE +S DQV++ +++ S+ G ++ N +L + +
Sbjct: 139 LSLDSARTIVRYKVDGVTYQRETLTSFADQVVMVRLTASQPGKITCNANLTTPHQDVMVS 198
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
++ + G ++ ++ KG ++F + + +G A D L +EG
Sbjct: 199 TEGEEVTLSG---------VSSWHEGLKGKVEFQGRMTAR---SQGGTQACRDGVLSIEG 246
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D AV+ + +++F N D + + + L+ + Y H+D +++
Sbjct: 247 ADEAVIYISIATNF----TNYKDITGNQVERAKNYLRRAVSKDYVTSRKAHVDFFKQYMD 302
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RVS+ D+ D + D RV++F+ +D LV F+FGRYLLI SS+
Sbjct: 303 RVSL-------DLGIDKYAGVTTDM-----RVQNFKETKDDFLVATYFRFGRYLLICSSQ 350
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQGIWN+ L P+WDS NIN+EMNYW + NLSE EPL + +S G
Sbjct: 351 PGGQPANLQGIWNDKLFPSWDSKYTCNINVEMNYWPAEVTNLSELHEPLIQLIREVSETG 410
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
++A++ Y A GWV+HH TDIW + A K LWP GGAWLC HLWE Y YT D +F
Sbjct: 411 RESAKIMYGADGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDMEF 469
Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L + AYP+++ F + ++ E +L PS SPE+ +GK A + T+D
Sbjct: 470 L-RSAYPIMKEAGKFFDEIMVKEPLHNWLVVCPSNSPENTHAGSNGK-ATTAAGCTLDNQ 527
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+I ++++ II+ A +L + + + + L + P +I G + EW+
Sbjct: 528 LIFDLWNQIITTARLLGTDAE-FATHLEQRLKEMAPMQIGRWGQLQEWM 575
>gi|212694638|ref|ZP_03302766.1| hypothetical protein BACDOR_04169 [Bacteroides dorei DSM 17855]
gi|237711097|ref|ZP_04541578.1| glycoside hydrolase family 95 protein [Bacteroides sp. 9_1_42FAA]
gi|265750683|ref|ZP_06086746.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_33FAA]
gi|423239195|ref|ZP_17220311.1| hypothetical protein HMPREF1065_00934 [Bacteroides dorei
CL03T12C01]
gi|212663139|gb|EEB23713.1| hypothetical protein BACDOR_04169 [Bacteroides dorei DSM 17855]
gi|229454941|gb|EEO60662.1| glycoside hydrolase family 95 protein [Bacteroides sp. 9_1_42FAA]
gi|263237579|gb|EEZ23029.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_33FAA]
gi|392646982|gb|EIY40688.1| hypothetical protein HMPREF1065_00934 [Bacteroides dorei
CL03T12C01]
Length = 814
Score = 309 bits (792), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 202/589 (34%), Positives = 319/589 (54%), Gaps = 46/589 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ +T+A+P+GNGRLGAMV+G E ++LNE+T+W G P + NP+A + +
Sbjct: 25 KLWYDEPAQVWTEALPLGNGRLGAMVFGNPGVEHIQLNEETIWAGRPNNNANPNALEYIP 84
Query: 74 DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
VR LV G+Y EA T A+ K+ G P YQ GD+ + F H +Y++ Y RE
Sbjct: 85 KVRQLVFEGKYLEAQTLATEKIMAKTNSGMP---YQSFGDLHISFP-GHTRYSD--YYRE 138
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
L L++A V+Y V V + RE +S DQV++ +++ S+ G ++ N +L + +
Sbjct: 139 LSLDSARTIVRYKVDGVTYQRETLTSFADQVVMVRLTASQPGKITCNANLTTPHQDVMVS 198
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
++ + G ++ ++ KG ++F + + +G A D L +EG
Sbjct: 199 TEGEEVTLSG---------VSSWHEGLKGKVEFQGRMTAR---SQGGTQACRDGVLSIEG 246
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D AV+ + +++F N D + + + L+ + Y H+D +++
Sbjct: 247 ADEAVIYISIATNF----TNYKDITGNQVERAKNYLRRAVSKDYVTSRKAHVDFFKQYMD 302
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RVS+ L VT + RV++F+ +D LV F+FGRYLLI SS+
Sbjct: 303 RVSLDLGIDKYAGVT------------TDMRVQNFKETKDDFLVATYFRFGRYLLICSSQ 350
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQGIWN+ L P+WDS NIN+EMNYW + NLSE EPL + +S G
Sbjct: 351 PGGQPANLQGIWNDKLFPSWDSKYTCNINVEMNYWPAEVTNLSELHEPLIQLIREVSETG 410
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
++A++ Y A GWV+HH TDIW + A K LWP GGAWLC HLWE Y YT D +F
Sbjct: 411 RESAKIMYGADGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDMEF 469
Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L + AYP+++ F + ++ E +L PS SPE+ +GK A + T+D
Sbjct: 470 L-RSAYPIMKEAGKFFDEIMVKEPLHNWLVVCPSNSPENTHAGSNGK-ATTAAGCTLDNQ 527
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+I ++++ II+ A +L + + + + L + P +I G + EW+
Sbjct: 528 LIFDLWNQIITTARLLGTDAE-FATHLEQRLKEMAPMQIGRWGQLQEWM 575
>gi|342884136|gb|EGU84463.1| hypothetical protein FOXB_05018 [Fusarium oxysporum Fo5176]
Length = 767
Score = 309 bits (792), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 198/601 (32%), Positives = 304/601 (50%), Gaps = 47/601 (7%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
M ES+ T + + + PA +++A+PIGNGRLGAMV+G +E L+LNED++W G P
Sbjct: 1 MDEGESSDTDKGMLLHYAAPASSWSEALPIGNGRLGAMVYGRTSTELLQLNEDSVWYGGP 60
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEAT-AASVKLFGHPADV--YQLLGDIELEFDDSHL 117
D T DA L+ +R L+ ++ +A F P+ + Y+ LG ++EFD H
Sbjct: 61 QDRTPRDAHSHLATLRQLIRDEKHKDAEDLVKEAFFATPSSMRHYEPLGQCKIEFD--HD 118
Query: 118 KYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL 177
+ Y R LDLNT+ +Y + R+ +S PD V+ ++ SE F V L
Sbjct: 119 ESEVTDYTRYLDLNTSQVTTRYKCDGRSYRRDIIASFPDSVLAVQVQASEKSR--FVVRL 176
Query: 178 DSLLDNHSYVNG--NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
+ +N N ++ + R IP AN+N + S +L + GT+
Sbjct: 177 NRQSENEGETNEYLDSIFAQDSRIILNAIPGGANSN------RLSLVLGVSCGPGDGTVK 230
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
A+ + + + V+ + A ++F K+DP ++ + + L
Sbjct: 231 AVGN--CLIVNATKCVIAIGAHTTF---------RKEDPERSALLNVDDALRRPWDVLVR 279
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
RH DY LF R+S++L + + +P+ +R+ S + DP LV L
Sbjct: 280 RHRSDYTNLFGRMSLRLF-------------PDANHLPTNKRIVS---NRDPGLVALYHN 323
Query: 356 FGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
+GRYLLISSSR + A LQGIWN SP W S +NINL+MNYW ++PC+L +C
Sbjct: 324 YGRYLLISSSRNSDKALPATLQGIWNPSFSPPWGSKFTININLQMNYWPAIPCSLIQCAI 383
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PL + L ++ G +TA++ Y GW HH TDIWA + + +WP+GGAWLCT
Sbjct: 384 PLINLLERMAERGKRTAKMMYNCKGWCAHHNTDIWADTDPQDRWMPATIWPLGGAWLCTD 443
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGK 532
+ Y + L R P+LEGC FLLD+LI G YL TNPS SPE+ F++ G+
Sbjct: 444 VVRMLIYQYE-PTLHCRIAPILEGCVQFLLDFLIPSACGRYLVTNPSLSPENSFVSQSGE 502
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
S +DM I+R + + + +L+ + + + +L +L P + +DG I E
Sbjct: 503 TGIFCEGSVIDMTIVRIALESFLWSISILDPDHPRRNDAI-AALDKLPPMSLNKDGLIQE 561
Query: 593 W 593
W
Sbjct: 562 W 562
>gi|423228044|ref|ZP_17214450.1| hypothetical protein HMPREF1063_00270 [Bacteroides dorei
CL02T00C15]
gi|423243307|ref|ZP_17224383.1| hypothetical protein HMPREF1064_00589 [Bacteroides dorei
CL02T12C06]
gi|392637080|gb|EIY30955.1| hypothetical protein HMPREF1063_00270 [Bacteroides dorei
CL02T00C15]
gi|392645314|gb|EIY39042.1| hypothetical protein HMPREF1064_00589 [Bacteroides dorei
CL02T12C06]
Length = 814
Score = 309 bits (792), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 202/589 (34%), Positives = 319/589 (54%), Gaps = 46/589 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ +T+A+P+GNGRLGAMV+G E ++LNE+T+W G P + NP+A + +
Sbjct: 25 KLWYDEPAQVWTEALPLGNGRLGAMVFGNPGVEHIQLNEETIWAGRPNNNANPNALEYIP 84
Query: 74 DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
VR LV G+Y EA T A+ K+ G P YQ GD+ + F H +Y++ Y RE
Sbjct: 85 KVRQLVFEGKYLEAQTLATEKIMTKTNSGMP---YQSFGDLHISFP-GHTRYSD--YYRE 138
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
L L++A V+Y V V + RE +S DQV++ +++ S+ G ++ N +L + +
Sbjct: 139 LSLDSARTIVRYKVDGVTYQRETLTSFADQVVMVRLTASQPGKITCNANLTTPHQDVMVS 198
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
++ + G ++ ++ KG ++F + + +G A D L +EG
Sbjct: 199 TEGEEVTLSG---------VSSWHEGLKGKVEFQGRMTAR---SQGGTQACRDGVLSIEG 246
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D AV+ + +++F N D + + + L+ + Y H+D +++
Sbjct: 247 ADEAVIYISIATNF----TNYKDITGNQVERAKNYLRRAVSKDYVTSRKAHVDFFKQYMD 302
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RVS+ L VT + RV++F+ +D LV F+FGRYLLI SS+
Sbjct: 303 RVSLDLGIDKYAGVT------------TDMRVQNFKETKDDFLVATYFRFGRYLLICSSQ 350
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQGIWN+ L P+WDS NIN+EMNYW + NLSE EPL + +S G
Sbjct: 351 PGGQPANLQGIWNDKLFPSWDSKYTCNINVEMNYWPAEVTNLSELHEPLIQLIREVSETG 410
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
++A++ Y A GWV+HH TDIW + A K LWP GGAWLC HLWE Y YT D +F
Sbjct: 411 RESAKIMYGADGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDMEF 469
Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L + AYP+++ F + ++ E +L PS SPE+ +GK A + T+D
Sbjct: 470 L-RSAYPIMKEAGKFFDEIMVKEPLHNWLVVCPSNSPENTHAGSNGK-ATTAAGCTLDNQ 527
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+I ++++ II+ A +L + + + + L + P +I G + EW+
Sbjct: 528 LIFDLWNQIITTARLLGTDAE-FATHLEQRLKEMAPMQIGRWGQLQEWM 575
>gi|423482848|ref|ZP_17459538.1| hypothetical protein IEQ_02626 [Bacillus cereus BAG6X1-2]
gi|401143214|gb|EJQ50752.1| hypothetical protein IEQ_02626 [Bacillus cereus BAG6X1-2]
Length = 1156
Score = 309 bits (791), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 204/600 (34%), Positives = 318/600 (53%), Gaps = 69/600 (11%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG---DYT--NP 66
L + +N PAK + A+PIGNG +G MV+GGV E ++ NE TLWTG P DYT N
Sbjct: 47 LTLWYNQPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPNSTSDYTYGNR 106
Query: 67 D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEFDDSHLK 118
D A L +R V G + A S + FG YQ GDI L+F+ +
Sbjct: 107 DGAASHLDSIREKVSKGDKSGAEEESSQFLTGLQNGFGS----YQNFGDIYLDFNMPD-Q 161
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
+ YRREL+LN A V Y+ +V++ RE+F+S PD+V+V +++ SES LS +V
Sbjct: 162 ASFSNYRRELNLNEGIATVSYNYKDVQYNREYFTSYPDRVMVMRLTASESKQLSLDVRPT 221
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
S + +N+I ++G+ AN+ G+++ + E K+ ++ GT++A E
Sbjct: 222 SA-QGGEITSIDNKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLTA-E 264
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+ K+KV +D +++ A++ ++ + P+ +DP + + +I N SY L H+
Sbjct: 265 NGKIKVANADSLTIIMTAATDYENKY--PTYKGEDPHQKVEKIMAAISNKSYEVLKYTHI 322
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
DY LF+RVS+ L +VP+ E + S+ L EL FQ+GR
Sbjct: 323 KDYHSLFNRVSLDLGGEKP-------------SVPTNELLASYNKQNSKYLEELFFQYGR 369
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSSRPGT ANLQG+WN +P W+S H NINL+MNYW + NLSE EPL D+
Sbjct: 370 YLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPLMDY 429
Query: 419 LTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
+ L G +A+ ++ GW ++ + + ++ G + W P A++ +LWE
Sbjct: 430 VDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQNLWE 488
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
HY +T D+ +L+++ YP+L+ A F +L+E + L +P SPE + +
Sbjct: 489 HYKFTDDKQYLQEKIYPILKEAAEFHSKFLVEDQNKKLVVSPCWSPE---------IGGI 539
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNE---DALVEKVLKSLPRLRPTKIAEDGSIMEW 593
S D ++ E+FS +I A+EVL+ ++ D L K + P P +I G + EW
Sbjct: 540 SNGCAFDQQLVYELFSNVIEASEVLQTDKVFRDELKAKRDRLFP---PIQIGRYGQVQEW 596
>gi|218129080|ref|ZP_03457884.1| hypothetical protein BACEGG_00654 [Bacteroides eggerthii DSM 20697]
gi|217988715|gb|EEC55034.1| hypothetical protein BACEGG_00654 [Bacteroides eggerthii DSM 20697]
Length = 828
Score = 309 bits (791), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 210/590 (35%), Positives = 312/590 (52%), Gaps = 48/590 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ +T+A+P+GNGRLGAMV+G E ++LNE+TLW G P + NP+A + +
Sbjct: 39 KLWYDEPAQVWTEALPLGNGRLGAMVYGTPAMENIQLNEETLWAGRPNNNANPNALEYIP 98
Query: 74 DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
VR LV G+Y EA T A+ K+ G P YQ G + + F H +Y + Y RE
Sbjct: 99 KVRQLVFEGKYLEAQTLATEKVMAKTNSGMP---YQSFGHLRIAFP-GHTRYTD--YYRE 152
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
L L++A V Y+V V + RE +S DQV++ ++S S G ++ N L S +
Sbjct: 153 LSLDSARTVVCYTVDGVRYRRETITSLADQVVMVRLSASRPGMITCNAHLTSPHQDVMIA 212
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ ++I + G ++ ++ KG + F + ++ +G S+ D L VE
Sbjct: 213 SEGDEITLSG---------VSSWHEGLKGKVLFQGRMAVRT---QGGHSSCADGVLAVEK 260
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D A L +++F +N D + S + L + SY HL Y+
Sbjct: 261 ADEATFYLSIATNF----VNYKDITGNEVERSKNYLHAALKHSYRQSLLEHLAIYKSYMD 316
Query: 307 RVSIQLSRSP-KDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
RV + L D+ TD RV++F+ +D LV F+FGRYLLI SS
Sbjct: 317 RVDLDLGPDRYADVTTDM-------------RVQNFRETQDDFLVATYFRFGRYLLICSS 363
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
+PG Q ANLQGIWN+ L P+WDS NINLEMNYW + NLSE +PL ++ +S
Sbjct: 364 QPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPAEVTNLSELHQPLMQLISEVSET 423
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G +TA+ Y A GWV+HH TDIW + A K LWP GGAWLC HLWE Y YT D
Sbjct: 424 GRETAKTMYGAEGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDVG 482
Query: 486 FLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
FL + AYP+++ A F ++ E +L PS SPE+ GK + + TMD
Sbjct: 483 FL-RTAYPIMKEAAKFFDQIMVKEPVHNWLVVCPSNSPENVHAGSKGK-STTAPGCTMDN 540
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+I ++++ +I+ A +L +E L + L + P ++ G + EW+
Sbjct: 541 QLIFDLWNQVITTARLLNTDE-TLAVHYEQRLREMAPMQVGRWGQLQEWM 589
>gi|225157647|ref|ZP_03725037.1| alpha/beta hydrolase fold-3 domain protein [Diplosphaera
colitermitum TAV2]
gi|224802714|gb|EEG20967.1| alpha/beta hydrolase fold-3 domain protein [Diplosphaera
colitermitum TAV2]
Length = 852
Score = 309 bits (791), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 197/618 (31%), Positives = 309/618 (50%), Gaps = 80/618 (12%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
A+P+GNGRLGAM++G + SE L+LNED+LW G P D NPD + L +R L+ G+ A
Sbjct: 25 ALPVGNGRLGAMIFGDIVSERLQLNEDSLWNGGPRDRRNPDTREHLPVLRQLLADGRLAA 84
Query: 87 ATAASVKLFGHPAD---VYQLLGDIELEF-----------DDSHLKYAEET--------- 123
A + D Y+ L D+ L F D+ L T
Sbjct: 85 AHELVHDVMAGIPDSQRCYEPLADLFLNFEHPGAPVSVSADEMALAAGYTTPRFDPSLLS 144
Query: 124 -YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS--- 179
YRR LDL TA A V Y++ ++ ++R +S DQVI ++ GSL+ V ++
Sbjct: 145 HYRRALDLRTAVASVDYTLNSIAYSRRMVASAVDQVIAIQLRAGRPGSLTLRVRMERGPR 204
Query: 180 ------LLDNHSYVN----GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD 229
D +V+ + +++ GR G+ +G++F+ L +IS
Sbjct: 205 NSYSTRYADTVGFVSDACSSSPTLLLRGRAGGE------------EGVRFATGLRAQISG 252
Query: 230 DRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLS 289
G + + + L ++G+D L+L A++SF + DP + + ++
Sbjct: 253 --GALRHI-GETLYIDGADSVTLVLAAATSF---------READPAASVIERTRAALARG 300
Query: 290 YSDLYTRHLDDYQKLFHRVSIQLSRS-PKDIVTDTCSEENIDTVPSAERVK-SFQTDEDP 347
+ + H +Y+ F R S+ L + T T T+P+ ER++ + +T DP
Sbjct: 301 WEKILADHEREYRSFFDRASLTLGAGFASEAPTATA------TLPTDERLRHAHETSGDP 354
Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
+L L F + RYLLISSSRPG+ +NLQG+WN D P+W S +NIN EMNYW + P N
Sbjct: 355 ALASLYFNYARYLLISSSRPGSLPSNLQGLWNGDFWPSWGSKYTININTEMNYWIAEPAN 414
Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
L++C +PLFD L + +G +TA+V Y G+V+HH TDIWA + + W +GG
Sbjct: 415 LADCHKPLFDHLERMVESGRETARVMYGCRGFVVHHNTDIWADTCPTDRNAGASYWLLGG 474
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFI 527
AW H W+ +++ D L AY L+ A F LD+L+E G L +PS SPE+ +
Sbjct: 475 AWFVLHAWDRFDFDRDPASLAA-AYERLKEAALFFLDFLVEDARGRLVISPSCSPENTYR 533
Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEK----------NEDALVEKVLKSLP 577
P+G+ + STMD ++ +F + AA +LE+ +E + +V +
Sbjct: 534 LPNGEAGVLCVGSTMDSQMLAILFRRTLQAARLLEQRNATAGGGGGDEREFLAQVAAAAE 593
Query: 578 RLRPTKIAEDGSIMEWVQ 595
RL I G ++EW++
Sbjct: 594 RLPKMTIGRHGQLLEWLE 611
>gi|217960596|ref|YP_002339160.1| alpha-fucosidase [Bacillus cereus AH187]
gi|375285103|ref|YP_005105542.1| hypothetical protein BCN_3009 [Bacillus cereus NC7401]
gi|423352889|ref|ZP_17330516.1| hypothetical protein IAU_00965 [Bacillus cereus IS075]
gi|423567917|ref|ZP_17544164.1| hypothetical protein II7_01140 [Bacillus cereus MSX-A12]
gi|217068135|gb|ACJ82385.1| alpha-fucosidase [Bacillus cereus AH187]
gi|358353630|dbj|BAL18802.1| conserved hypothetical protein [Bacillus cereus NC7401]
gi|401090895|gb|EJP99046.1| hypothetical protein IAU_00965 [Bacillus cereus IS075]
gi|401211256|gb|EJR18004.1| hypothetical protein II7_01140 [Bacillus cereus MSX-A12]
Length = 1193
Score = 309 bits (791), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 205/603 (33%), Positives = 314/603 (52%), Gaps = 75/603 (12%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
L + +N PAK + A+PIGNG +G MV+GGV E ++ NE TLWTG P Y N
Sbjct: 84 LSLWYNEPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPSSTSEYTYGNR 143
Query: 67 D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEF---DDS 115
D A L +R + G + A S + FG YQ GDI L+F D S
Sbjct: 144 DGAASHLGSIREKLSKGDKSGAERESTQFLTGLQKGFGS----YQNFGDIYLDFNMPDGS 199
Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
YRREL+LN + V Y+ V++ RE+F+S PD+V+V +++ SES LS +V
Sbjct: 200 SF----SNYRRELNLNEGISTVSYNYKGVQYNREYFASYPDRVMVMRLTASESKQLSLDV 255
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
S + +N+I ++G+ AN+ G+++ + E K+ ++ GT++
Sbjct: 256 RPTSAQGGQ-VTSKDNKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLT 299
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
A E+ K+KV +D +++ A++ ++ + PS +DP + + +I N SY L
Sbjct: 300 A-ENGKIKVANADSLTIIMTAATDYENKY--PSYKGEDPHQKVEKIMSAISNKSYEVLKY 356
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
H+ DY LF+RVS+ L +VP+ E + S+ + L EL FQ
Sbjct: 357 THIKDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQ 403
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
+GRYLLISSSRPGT ANLQG+WN +P W+S H NINL+MNYW + NLSE EPL
Sbjct: 404 YGRYLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPL 463
Query: 416 FDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
D++ L G +A+ ++ GW ++ + + ++ G + W P A++ +
Sbjct: 464 MDYVDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQN 522
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
LWEHY +T D+ +L+++ YP+L+ A F +L+E + L +P SPE L
Sbjct: 523 LWEHYKFTDDKQYLQEKIYPILKEAAVFHSKFLVEDQNKKLVVSPCWSPE---------L 573
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSI 590
+S D ++ E+FS +I A+EVL+ + D L K K P P +I G +
Sbjct: 574 GGISNGCAFDQQLVYELFSNVIEASEVLQVDNVFRDELKAKRDKLFP---PIQIGRYGQV 630
Query: 591 MEW 593
EW
Sbjct: 631 QEW 633
>gi|160885438|ref|ZP_02066441.1| hypothetical protein BACOVA_03438 [Bacteroides ovatus ATCC 8483]
gi|423294310|ref|ZP_17272437.1| hypothetical protein HMPREF1070_01102 [Bacteroides ovatus
CL03T12C18]
gi|156109060|gb|EDO10805.1| hypothetical protein BACOVA_03438 [Bacteroides ovatus ATCC 8483]
gi|392675501|gb|EIY68942.1| hypothetical protein HMPREF1070_01102 [Bacteroides ovatus
CL03T12C18]
Length = 811
Score = 309 bits (791), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 198/591 (33%), Positives = 309/591 (52%), Gaps = 57/591 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+++++A+PIGN RLGAMV+GG+ E L+LNE+T W G P + NP+A L
Sbjct: 22 LKLWYSQPAQNWSEALPIGNSRLGAMVYGGIEREELQLNEETFWAGSPYNNNNPNAVHVL 81
Query: 73 SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ G+ EA + L Y LG + LEF + + R+L+L
Sbjct: 82 PVVRKLIFEGRNKEAQRLIDANFLTQQHGMSYLTLGSLYLEFPEHQ---NGSGFYRDLNL 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
AT +Y V +V +TR F+S D VI+ I S++ +L+F ++ + L + V N
Sbjct: 139 ENATTTTRYQVDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNCPLVHKVNVQ-N 197
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
+Q+ + C GK + +G++ + E +I GT+ + EG++
Sbjct: 198 DQLTVT--CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L + A++++ +N D D + + L+ + Y H+ Y+K F RV
Sbjct: 245 ATLYISAATNY----VNYQDVSADESHRTSEYLKRAMQIPYEKALKNHIAYYKKQFDRVR 300
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L + + +R+++F ED ++ LLF +GRYLLISSS+PG
Sbjct: 301 LTLPAG------------KASQLETPKRIENFGNGEDMAMAALLFHYGRYLLISSSQPGG 348
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN WDS +NIN EMNYW + NLSE PLF L LS+ G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 408
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
A+ Y GWV HH TD+W G V +A +WP GGAWL H+W+HY +T +++F
Sbjct: 409 ARTMYDCRGWVAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L K YP+L+G A F +D+L+E H Y L +PS SPEH ++ TMD
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDN 513
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I + + A+ + + + + + ++L +L P +I + + EW++
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLE 563
>gi|423373036|ref|ZP_17350376.1| hypothetical protein IC5_02092 [Bacillus cereus AND1407]
gi|401097368|gb|EJQ05391.1| hypothetical protein IC5_02092 [Bacillus cereus AND1407]
Length = 1193
Score = 308 bits (790), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 205/603 (33%), Positives = 314/603 (52%), Gaps = 75/603 (12%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
L + +N PAK + A+PIGNG +G MV+GGV E ++ NE TLWTG P Y N
Sbjct: 84 LSLWYNEPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPSSTSEYTYGNR 143
Query: 67 D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEF---DDS 115
D A L +R + G + A S + FG YQ GDI L+F D S
Sbjct: 144 DGAASHLGSIREKLSKGDKSGAERESTQFLTGLQKGFGS----YQNFGDIYLDFNMPDGS 199
Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
YRREL+LN + V Y+ V++ RE+F+S PD+V+V +++ SES LS +V
Sbjct: 200 SF----SNYRRELNLNEGISTVSYNYKGVQYNREYFASYPDRVMVMRLTASESKQLSLDV 255
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
S + +N+I ++G+ AN+ G+++ + E K+ ++ GT++
Sbjct: 256 RPTSAQGGQ-VTSKDNKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLT 299
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
A E+ K+KV +D +++ A++ ++ + PS +DP + + +I N SY L
Sbjct: 300 A-ENGKIKVANADSLTIIMTAATDYENKY--PSYKGEDPHQKVEKIMSAISNKSYEVLKY 356
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
H+ DY LF+RVS+ L +VP+ E + S+ + L EL FQ
Sbjct: 357 THIKDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQ 403
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
+GRYLLISSSRPGT ANLQG+WN +P W+S H NINL+MNYW + NLSE EPL
Sbjct: 404 YGRYLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPL 463
Query: 416 FDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
D++ L G +A+ ++ GW ++ + + ++ G + W P A++ +
Sbjct: 464 MDYVDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQN 522
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
LWEHY +T D+ +L+++ YP+L+ A F +L+E + L +P SPE L
Sbjct: 523 LWEHYKFTDDKQYLQEKIYPILKEAAVFHSKFLVEDQNKKLVVSPCWSPE---------L 573
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSI 590
+S D ++ E+FS +I A+EVL+ + D L K K P P +I G +
Sbjct: 574 GGISNGCAFDQQLVYELFSNVIEASEVLQVDNVFRDELKAKRDKLFP---PIQIGRYGQV 630
Query: 591 MEW 593
EW
Sbjct: 631 QEW 633
>gi|229139796|ref|ZP_04268363.1| hypothetical protein bcere0013_29050 [Bacillus cereus BDRD-ST26]
gi|228643676|gb|EEK99940.1| hypothetical protein bcere0013_29050 [Bacillus cereus BDRD-ST26]
Length = 1172
Score = 308 bits (790), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 205/603 (33%), Positives = 314/603 (52%), Gaps = 75/603 (12%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
L + +N PAK + A+PIGNG +G MV+GGV E ++ NE TLWTG P Y N
Sbjct: 63 LSLWYNEPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPSSTSEYTYGNR 122
Query: 67 D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEF---DDS 115
D A L +R + G + A S + FG YQ GDI L+F D S
Sbjct: 123 DGAASHLGSIREKLSKGDKSGAERESTQFLTGLQKGFGS----YQNFGDIYLDFNMPDGS 178
Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
YRREL+LN + V Y+ V++ RE+F+S PD+V+V +++ SES LS +V
Sbjct: 179 SF----SNYRRELNLNEGISTVSYNYKGVQYNREYFASYPDRVMVMRLTASESKQLSLDV 234
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
S + +N+I ++G+ AN+ G+++ + E K+ ++ GT++
Sbjct: 235 RPTSAQGGQ-VTSKDNKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLT 278
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
A E+ K+KV +D +++ A++ ++ + PS +DP + + +I N SY L
Sbjct: 279 A-ENGKIKVANADSLTIIMTAATDYENKY--PSYKGEDPHQKVEKIMSAISNKSYEVLKY 335
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
H+ DY LF+RVS+ L +VP+ E + S+ + L EL FQ
Sbjct: 336 THIKDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQ 382
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
+GRYLLISSSRPGT ANLQG+WN +P W+S H NINL+MNYW + NLSE EPL
Sbjct: 383 YGRYLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPL 442
Query: 416 FDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
D++ L G +A+ ++ GW ++ + + ++ G + W P A++ +
Sbjct: 443 MDYVDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQN 501
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
LWEHY +T D+ +L+++ YP+L+ A F +L+E + L +P SPE L
Sbjct: 502 LWEHYKFTDDKQYLQEKIYPILKEAAVFHSKFLVEDQNKKLVVSPCWSPE---------L 552
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSI 590
+S D ++ E+FS +I A+EVL+ + D L K K P P +I G +
Sbjct: 553 GGISNGCAFDQQLVYELFSNVIEASEVLQVDNVFRDELKAKRDKLFP---PIQIGRYGQV 609
Query: 591 MEW 593
EW
Sbjct: 610 QEW 612
>gi|300726087|ref|ZP_07059544.1| alpha-L-fucosidase 2 [Prevotella bryantii B14]
gi|299776557|gb|EFI73110.1| alpha-L-fucosidase 2 [Prevotella bryantii B14]
Length = 824
Score = 308 bits (790), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 199/594 (33%), Positives = 309/594 (52%), Gaps = 48/594 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ +N PA + +A+PIGNGR+ M++GGV SE ++LNE+T+W G P L
Sbjct: 22 LKLWYNHPASIWQEALPIGNGRIAGMIYGGVQSEEIQLNEETVWGGGPHSNVRAIPVDTL 81
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ GQ A A + F G Y+ +G ++++F+ + YRRELDL
Sbjct: 82 RQVRQLIFDGQEKAAHAMINRNFMTGQHGMPYESVGSLKIDFN--YRAGDTRNYRRELDL 139
Query: 131 NTATARVKYSVGNVEFTREHFS--SNPDQ---VIVTKISGSESGSLSFNVSLDSLLDNHS 185
N A + + VG V + RE F+ S+P+ V+V +++ S+ GS+SF + S L +
Sbjct: 140 NRAVSTTTFQVGKVTYKREVFTTFSSPEHHANVMVIRLTASKRGSISFKLHYTSPLRHAI 199
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQ--FSAILEIKISDDRGTISALEDKKLK 243
+N + M G D +GI+ A ++ + G I + ++
Sbjct: 200 TLNQQGDLCMLGYGA------------DHEGIKGVIQASTVTRVLNIGGKIKR-NGESIE 246
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V ++ + L ++F + ++ D +++ LQ+ +Y L +H YQ
Sbjct: 247 VTNANQVEIRLAMGTNFK----SYNEVSLDAKAQTFGELQTASPYTYEALLQQHEQVYQN 302
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
F RVS+ L + N ++P+ ER++ FQ DP+L L+FQ+GRYLLIS
Sbjct: 303 QFGRVSLDLGEN-----------TNETSLPTDERLRRFQQSNDPALATLVFQYGRYLLIS 351
Query: 364 SSRPGTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SS+ ++ ANLQGIWN+D++ WD +NIN EMNYW + NLS+ + PL+ + L
Sbjct: 352 SSQIDSRTPANLQGIWNKDMNAPWDGKYTININTEMNYWPAQTTNLSDNEWPLYRLVQNL 411
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
S G + A Y A G++ HH TDIWA + G W +WP G WL THLW+ Y +T
Sbjct: 412 SKTGVEAASKMYGAKGYMAHHNTDIWATTGMVDG-ATWGIWPNGAGWLSTHLWQRYLFTG 470
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSST 541
D+ FL + YP L+G A F L ++ GY+ T PS SPEH P GK V+ T
Sbjct: 471 DQQFL-RTFYPQLKGAADFYLTAMVRHPKYGYMVTVPSISPEH---GPHGK-PSVTAGCT 525
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
MD I +V + A EVL ++E A + + + + +L P ++ + EW++
Sbjct: 526 MDNQIAFDVLQDALQATEVLGESE-AYADSLRQHIRQLAPMQVGRYCQLQEWLE 578
>gi|222096655|ref|YP_002530712.1| alpha-fucosidase [Bacillus cereus Q1]
gi|221240713|gb|ACM13423.1| alpha-fucosidase [Bacillus cereus Q1]
Length = 1172
Score = 308 bits (790), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 205/603 (33%), Positives = 314/603 (52%), Gaps = 75/603 (12%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
L + +N PAK + A+PIGNG +G MV+GGV E ++ NE TLWTG P Y N
Sbjct: 63 LSLWYNEPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPSSTSEYTYGNR 122
Query: 67 D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEF---DDS 115
D A L +R + G + A S + FG YQ GDI L+F D S
Sbjct: 123 DGAASHLGSIREKLSKGDKSGAERESTQFLTGLQKGFGS----YQNFGDIYLDFNMPDGS 178
Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
YRREL+LN + V Y+ V++ RE+F+S PD+V+V +++ SES LS +V
Sbjct: 179 SF----SNYRRELNLNEGISTVSYNYKGVQYNREYFASYPDRVMVMRLTASESKQLSLDV 234
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
S + +N+I ++G+ AN+ G+++ + E K+ ++ GT++
Sbjct: 235 RPTSAQGGQ-VTSKDNKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLT 278
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
A E+ K+KV +D +++ A++ ++ + PS +DP + + +I N SY L
Sbjct: 279 A-ENGKIKVANADSLTIIMTAATDYENKY--PSYKGEDPHQKVEKIMSAISNKSYEVLKY 335
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
H+ DY LF+RVS+ L +VP+ E + S+ + L EL FQ
Sbjct: 336 THIKDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQ 382
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
+GRYLLISSSRPGT ANLQG+WN +P W+S H NINL+MNYW + NLSE EPL
Sbjct: 383 YGRYLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPL 442
Query: 416 FDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
D++ L G +A+ ++ GW ++ + + ++ G + W P A++ +
Sbjct: 443 MDYVDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQN 501
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
LWEHY +T D+ +L+++ YP+L+ A F +L+E + L +P SPE L
Sbjct: 502 LWEHYKFTDDKQYLQEKIYPILKEAAVFHSKFLVEDQNKKLVVSPCWSPE---------L 552
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSI 590
+S D ++ E+FS +I A+EVL+ + D L K K P P +I G +
Sbjct: 553 GGISNGCAFDQQLVYELFSNVIEASEVLQVDNVFRDELKAKRDKLFP---PIQIGRYGQV 609
Query: 591 MEW 593
EW
Sbjct: 610 QEW 612
>gi|325678667|ref|ZP_08158277.1| hypothetical protein CUS_6446 [Ruminococcus albus 8]
gi|324109717|gb|EGC03923.1| hypothetical protein CUS_6446 [Ruminococcus albus 8]
Length = 761
Score = 308 bits (789), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 203/589 (34%), Positives = 302/589 (51%), Gaps = 49/589 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
KI F PA+ + A+P+GNGR+G M +G E ++LNED++++G NP A + L
Sbjct: 10 KIWFKAPAEDWNVALPVGNGRIGGMCFGQPLYEKIQLNEDSIFSGGQRKRNNPSARENLE 69
Query: 74 DVRSLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ + AEA ++ F G P + Y LGD+ ++ HL+ E R LDL
Sbjct: 70 KVRQLLKEEKIAEAEKIVLEAFCGTPVNQRHYMPLGDLVIQ---HHLESECEYKCRSLDL 126
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS---LLDNHSYV 187
A +YS+ V + R S P QV+ I+ +S S+S ++LD D++S +
Sbjct: 127 ENAVCTAEYSIKGVNYVRRVICSEPAQVMAINITADKSASISLKLTLDGRDDYFDDNSPM 186
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
N + I+ G C G+ GI F+A L ++ G++ + E
Sbjct: 187 N-DTDILYYGGCGGE------------DGINFAAYL--RVIGVGGSVHRW-GSSIVTEDC 230
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D +L+ +S+ SD KK + ++A + + +L H++DY+ F R
Sbjct: 231 DSVTILIGVQTSY-----RVSDYKKSAELDVITAAEK----DFEELLKEHIEDYRSYFDR 281
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSR 366
+IV D E D++P+ ER+K + D LV L F FGRYL+IS SR
Sbjct: 282 T---------EIVFD---EGGNDSLPTDERLKLVKEGGVDNGLVSLYFDFGRYLMISGSR 329
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
GT NLQGIWN+D+ P W VNIN EMNYW + ++ + PLFD + + NG
Sbjct: 330 EGTLPLNLQGIWNKDMWPAWGCRFTVNINTEMNYWLAEVADMGDLHMPLFDHIERMRPNG 389
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
TA+ Y G+V HH TDIW ++ + W G AWLCTH+WEH+ Y+ DR+F
Sbjct: 390 RATAREMYGCGGFVCHHNTDIWGDTAPQDLWMPGTQWVTGAAWLCTHIWEHWLYSRDREF 449
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
L ++ Y L+ + F +D+LI+ G L T PS SPE+ +I G V +MD I
Sbjct: 450 LAEK-YDTLKEASLFFVDFLIDNGKGQLVTCPSVSPENTYITASGAKGSVCMGPSMDSQI 508
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I E+F+A+I A EVL + D EK+ +L +I + G IMEW +
Sbjct: 509 IYELFTAVIEAGEVLGIDAD-YREKLKGMREKLPKPQIGKYGQIMEWAE 556
>gi|423215045|ref|ZP_17201573.1| hypothetical protein HMPREF1074_03105 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692308|gb|EIY85546.1| hypothetical protein HMPREF1074_03105 [Bacteroides xylanisolvens
CL03T12C04]
Length = 811
Score = 308 bits (789), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 199/591 (33%), Positives = 312/591 (52%), Gaps = 57/591 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+++++A+PIGN RLGAMV+GG+ E L+LNE+T W G P + NP+A L
Sbjct: 22 LKLWYSQPAQNWSEALPIGNSRLGAMVYGGIEREELQLNEETFWAGSPYNNNNPNAVHVL 81
Query: 73 SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ G+ EA + L Y LG + LEF + + R+L+L
Sbjct: 82 PIVRKLIFEGRNKEAQRLIDANFLTQQHGMSYLTLGSLYLEFPEHQ---NASGFYRDLNL 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
AT +Y V +V +TR F+S D VI+ I S++ +L+F ++ + L + V N
Sbjct: 139 ENATTTTRYQVDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNCPLVHKVNVQ-N 197
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
+Q+ + C GK + +G++ + E +I GT+ + EG++
Sbjct: 198 DQLTVT--CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L + A++++ +N D D + + L+ + Y H+ Y+K F RV
Sbjct: 245 ATLYISAATNY----VNYQDVSADESRRTSEYLKRAMQIPYEKALKSHIAYYKKQFDRVR 300
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L T S+ + + +R+++F ED ++ LLF +GRYLLISSS+PG
Sbjct: 301 LTLP-------TGKTSQ-----LETPKRIENFGNGEDMAMAALLFHYGRYLLISSSQPGG 348
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN WDS +NIN EMNYW + NLSE PLF L LS+ G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 408
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
A+ Y GW+ HH TD+W G V +A +WP GGAWL H+W+HY +T +++F
Sbjct: 409 ARTMYDCRGWMAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L K YP+L+G A F +D+L+E H Y L +PS SPEH ++ TMD
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDN 513
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I + + A+ + + + + + ++L +L P +I + + EW++
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLE 563
>gi|319936285|ref|ZP_08010703.1| hypothetical protein HMPREF9488_01536 [Coprobacillus sp. 29_1]
gi|319808661|gb|EFW05205.1| hypothetical protein HMPREF9488_01536 [Coprobacillus sp. 29_1]
Length = 749
Score = 308 bits (789), Expect = 5e-81, Method: Compositional matrix adjust.
Identities = 192/587 (32%), Positives = 304/587 (51%), Gaps = 43/587 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ FN PA + +A+P+GNG LGAMV+G E + +NED+L++G P + NP+ L
Sbjct: 6 KLIFNKPALQWEEAMPLGNGYLGAMVFGQTQKELICMNEDSLYSGGPIERGNPNTLDHLD 65
Query: 74 DVRSLVDSGQYAEATAASVKLF----GHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
++R+L+ G+ EA + F HP YQ LG + +EF ++ + Y++ LD
Sbjct: 66 EMRTLLLDGKVEEAQKKAPNYFYATTPHPRH-YQPLGQVWMEFHHQNV----QDYQKVLD 120
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L + ++Y NVE+ RE F S P+QV V KI S++ L+F D L G
Sbjct: 121 LKNSIGSIQYRYNNVEYQRECFISYPNQVFVYKIKASQNQQLNF----DLYLTRRDIRPG 176
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPK-GIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
++ ++ K + N + K GI ++ +++ D G + +L +E +
Sbjct: 177 RSESYVDDIHIEKDYLYLSGYNGNQKNGISYTMATTVQLKD--GCLKKY-GSRLVIENAT 233
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
A++ +V +S+ +P L SY +L H+ DYQ F ++
Sbjct: 234 EAIVYVVGRTSY---------RSHNPFQWCQKQLDKTLLKSYRNLKQDHIRDYQNYFDQL 284
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSA-ERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
+ L EN+ ++P +++K Q D D L+E F FGRYLLISSSR
Sbjct: 285 ELTLGDH---------KNENMMSIPERLQKMKEGQIDLD--LIETYFHFGRYLLISSSRE 333
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
G+ ANLQGIWN + P W S +NIN++MNYW + LS PL + G
Sbjct: 334 GSLAANLQGIWNGEFEPPWGSRYTININIQMNYWLAEKTGLSRLHLPLMQLQKIMLPRGQ 393
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
K A+ Y G HH TDIW + V LWPMG WL H++EHY YT +++F+
Sbjct: 394 KIAKEMYGCRGTCAHHNTDIWGDCAPADYYVPSTLWPMGSLWLSLHIFEHYQYTHNQEFI 453
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
+ +P+L+ A F LD++ + +G+ T PS SPE+ ++ DG+ A V S +MD+ ++
Sbjct: 454 LE-YFPILKENALFFLDYMFKDANGFYATGPSVSPENAYMTQDGQAATVCLSPSMDIQLL 512
Query: 548 REVFSAIISAAEVLEKNE-DALVEKVLKSLPRLRPTKIAEDGSIMEW 593
RE F++ + + L +++ +A + + L+ LP P +I + G IMEW
Sbjct: 513 REFFTSYLQLLKELNRHDLEAEINEYLEKLP---PIQIGKYGQIMEW 556
>gi|336415344|ref|ZP_08595684.1| hypothetical protein HMPREF1017_02792 [Bacteroides ovatus
3_8_47FAA]
gi|335940940|gb|EGN02802.1| hypothetical protein HMPREF1017_02792 [Bacteroides ovatus
3_8_47FAA]
Length = 648
Score = 307 bits (787), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 198/591 (33%), Positives = 309/591 (52%), Gaps = 57/591 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+++++A+PIGN RLGAMV+GG+ E L+LNE+T W G P + NP+A L
Sbjct: 22 LKLWYSQPAQNWSEALPIGNSRLGAMVYGGIEREELQLNEETFWAGSPYNNNNPNAVHVL 81
Query: 73 SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ G+ EA + L Y LG + LEF + + R+L+L
Sbjct: 82 PVVRKLIFEGRNKEAQRLIDANFLTQQHGMSYLTLGSLYLEFPEHQ---NASGFYRDLNL 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
AT +Y V +V +TR F+S D VI+ I S++ +L+F ++ + L + V +
Sbjct: 139 ENATTTTRYQVDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNFPLVHKVNVQND 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
+ C GK + +G++ + E +I GT+ + EG++
Sbjct: 199 KLTVT---CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L + A++++ +N D D + + L+ + Y H+ Y+K F RV
Sbjct: 245 ATLYISAATNY----VNYQDVSADESRRTSEYLKRAMQIPYEKALKSHIAYYKKQFDRVR 300
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L TD S+ + + +R+++F ED ++ LLF +GRYLLISSS+PG
Sbjct: 301 LTLP-------TDKTSQ-----LETPKRIENFGNGEDMAMAALLFHYGRYLLISSSQPGG 348
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN WDS +NIN EMNYW + NLSE PLF L LS G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSATGAET 408
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
A+ Y GW+ HH TD+W G V +A +WP GGAWL H+W+HY +T +++F
Sbjct: 409 ARTMYDCRGWMAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L K YP+L+G A F +D+L+E H Y L +PS SPEH ++ TMD
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDN 513
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I + + A+ + + + + + ++L +L P +I + + EW++
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLE 563
>gi|302669281|ref|YP_003832431.1| glycoside hydrolase family 95 Gh95A [Butyrivibrio proteoclasticus
B316]
gi|302396945|gb|ADL35849.1| glycoside hydrolase family 95 Gh95A [Butyrivibrio proteoclasticus
B316]
Length = 714
Score = 307 bits (787), Expect = 9e-81, Method: Compositional matrix adjust.
Identities = 195/610 (31%), Positives = 301/610 (49%), Gaps = 86/610 (14%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + AK + A+P+GNG +GAM +GG + +LN D++W P D NPDA +++
Sbjct: 3 RLWYKEAAKDWNSALPLGNGFMGAMCFGGTLIDRFQLNNDSIWWSGPRDRINPDAKESIP 62
Query: 74 DVRSLVDSGQYAEAT-AASVKLFGHP--ADVYQLLGDIEL--------------EFDDSH 116
+R L+ G+ ++A A+ + G P Y+ LGD+ + E
Sbjct: 63 VIRRLIREGRISDAEDLANEAMAGIPEYQSHYEPLGDLFIIPEGKERIQILGIREHWSGQ 122
Query: 117 LKYAEET--YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGS------ES 168
L EE Y+RELD+ V Y+ V+F RE F SN D+V+ K GS E
Sbjct: 123 LNRIEEIPDYKRELDIEKGIHTVSYTKDGVKFCRESFISNVDKVMAIKCLGSRLRIFAER 182
Query: 169 GSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
G V Y N + MEGR G++F ++ +
Sbjct: 183 GDQCEKV----------YKLSENTLCMEGRTGAD-------------GVRFCMVIRVVNG 219
Query: 229 DD--RGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIR 286
+ RG + + D A +L+ + + F +DP ++++ L + +
Sbjct: 220 NPYIRGRM---------LHADDDAEILIASQTDF---------YNEDPVADAVRTLDAAQ 261
Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDE 345
L Y +L RH+ D Q+L R ++++ +N D +P+ +R+++ +
Sbjct: 262 KLGYDELKKRHVCDVQELMDRCTLEID------------SDNRDNIPTDKRLQAVAEGGT 309
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
D L+ LLF +GRYLLISSSRPG+ ANLQGIWN+ SP WDS +NIN +MNYW +
Sbjct: 310 DNGLINLLFAYGRYLLISSSRPGSLPANLQGIWNDSFSPAWDSKFTININAQMNYWPAEV 369
Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
LSE EPLFD + + NG + A Y A GW+ HH TDIW + + W M
Sbjct: 370 TGLSELHEPLFDLMKRMLPNGRRAAAEMYCARGWMAHHNTDIWGDCAPQDTWQAASYWQM 429
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHE 525
G AWLC H+ EHY YT D +F+ + P+++ A F D LIE G L +PS SPE+
Sbjct: 430 GAAWLCLHILEHYRYTQDENFM-REYLPMVKEAALFFEDSLIENEAGQLVVSPSVSPENT 488
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
++ P G+ + ++MD I+ E+FS +I ++L E +L LP+ +I+
Sbjct: 489 YVLPSGERGMMCEGASMDAQILYELFSGLI-GTDMLSSEEKERYTTILCKLPK---PQIS 544
Query: 586 EDGSIMEWVQ 595
E G++ EW +
Sbjct: 545 EIGTVQEWAE 554
>gi|325281855|ref|YP_004254397.1| Alpha-L-fucosidase [Odoribacter splanchnicus DSM 20712]
gi|324313664|gb|ADY34217.1| Alpha-L-fucosidase [Odoribacter splanchnicus DSM 20712]
Length = 807
Score = 307 bits (787), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 202/602 (33%), Positives = 307/602 (50%), Gaps = 59/602 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PA+ + + +P+GNGRLG M GGV ET+ LN+ T+W+G D NP+A K L
Sbjct: 28 LKLWYTRPAERWEETLPLGNGRLGMMPDGGVVQETIVLNDITMWSGSFQDTRNPEALKYL 87
Query: 73 SDVRSLVDSGQYAEATAASVKLFG-------------HPADVYQLLGDIELEF---DDSH 116
++R L+ G+ EA K F P +QLLG++ L++ D S
Sbjct: 88 PEIRRLLLEGKNDEAQELMYKHFACGGQGSAFGQGANAPYGAFQLLGNLHLQYHFPDSSD 147
Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
+ Y+ Y R L L+ A A + G V++ RE+F S + V++ K++ G L F+V+
Sbjct: 148 VGYS--AYERGLSLDKALAWTCFRKGKVKYRREYFVSQTEDVMIMKLTADRKGMLDFDVA 205
Query: 177 LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISA 236
+D + Y N + + MEG+ + G ++ L++ +D R
Sbjct: 206 IDRPENYTCYAN-DGVVYMEGQL---------DNGKGKAGTKYMVQLKVWTADGR---QV 252
Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
+ + V+ + A +L+ A +S D +Q N+ Y L R
Sbjct: 253 ADSACIHVKEATTAYVLVSAGTSL---------WAADYPERVEKLMQIAGNMDYGYLLER 303
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
H ++ ++RV + L +P+DI+ P+ +R+ FQ EDP LV L FQ+
Sbjct: 304 HDSAWRYKYNRVELDLG-TPQDIL------------PTDQRLARFQEQEDPGLVALYFQY 350
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLIS +R + NLQG+W + W+ H+NINL+MNYW NLSE PL
Sbjct: 351 GRYLLISGTRENSFPLNLQGLWANSVQTPWNGDYHLNINLQMNYWPVEIVNLSELHTPLK 410
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
+ + L +G TA Y A GWV H T+ W + +A W GGAWLC HLWE
Sbjct: 411 NLVKDLVTSGEVTAHSFYGAQGWVAHMMTNPW-RFTAPGEHASWGATNTGGAWLCEHLWE 469
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDG-KLA 534
HY +T+D+++L + YP+L G + F L +IE G+L T PS+SPE+ F P K
Sbjct: 470 HYAFTLDQEYL-REVYPVLSGASRFFLSSMIEEPTQGWLVTAPSSSPENAFYMPGTRKEV 528
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA-EDGSIMEW 593
V MD IIRE+FS I AA +LE + A + + K+L +L P +I+ + G + EW
Sbjct: 529 SVCMGPAMDTQIIRELFSNTIQAARLLEIDA-AFADSLEKALDKLPPMQISPKGGYLQEW 587
Query: 594 VQ 595
++
Sbjct: 588 LE 589
>gi|260591756|ref|ZP_05857214.1| putative alpha-L-fucosidase 2 [Prevotella veroralis F0319]
gi|260536040|gb|EEX18657.1| putative alpha-L-fucosidase 2 [Prevotella veroralis F0319]
Length = 804
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 190/596 (31%), Positives = 303/596 (50%), Gaps = 47/596 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA- 71
+++ +N PA F ++IP+GNG+LGA+V+GG +T+ LN+ T WTG P D N KA
Sbjct: 24 MRLWYNQPAHFFEESIPLGNGKLGALVYGGTQKDTIYLNDITYWTGKPVD-PNEGLGKAK 82
Query: 72 -LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+ ++R + + Y A + + G + YQ LG + + ++ A Y REL+L
Sbjct: 83 WIPEIRKALFAENYRTADSLQHFVQGEQSASYQPLGTLNIINLNTG---AVSNYYRELNL 139
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
++A A + Y ++FTRE+F+++ D +I I +++G+++ ++ L + H N
Sbjct: 140 DSALAHISYQQNGIQFTREYFATHRDSLIAIHIKANQAGAINLHIQLTAQTP-HKVKATN 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
NQ+ M G G A +++ G + A D L + +D A
Sbjct: 199 NQLTMTGHTTGSETE------------SVHACTIVRLLPQGGKVIA-SDSTLTLTNADNA 245
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
+ +V ++SF+G +P +++A +N +YS+ RH+ +YQ++++R+ +
Sbjct: 246 TIYIVNATSFNGFDKHPVKDGASYIDNAVNAAWHTQNFTYSEFKDRHIKEYQQIYNRIKL 305
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDP-------SLVELLFQFGRYLLIS 363
QL ++E + +P+ + ++ + + P L L FQFGRYLL+S
Sbjct: 306 QLG-----------NKEYTNNLPTDQLLRRYSSSTAPLPEAAQRYLETLYFQFGRYLLLS 354
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SR ANLQG+W L W +NINLE NYW + P N+SE +PL F+ LS
Sbjct: 355 CSRTPNIPANLQGLWTPHLFSPWRGNYTMNINLEENYWPADPANMSETIQPLIGFVKGLS 414
Query: 424 INGSKTAQVNY-LASGWVIHHKTDIWAKSS-ADRGKVV--WALWPMGGAWLCTHLWEHYN 479
G TA+ Y + GW H +D W K+S GK WA W +GGAWL LW+HY
Sbjct: 415 ATGKHTARNFYGINEGWCAAHNSDPWCKTSPVGEGKESPEWANWNLGGAWLVNALWDHYL 474
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD--GYLETNPSTSPEHEFIAPDGKLACVS 537
Y+ D+ L+ YPL+EG + F WL+ + L T PSTSPE+E++ G
Sbjct: 475 YSQDKQLLQNTIYPLMEGSSKFFQQWLVTNPNKPNELITAPSTSPENEYVTDKGYHGTTC 534
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
Y T D+AIIRE+F + A + L D ++ L RL P + G + EW
Sbjct: 535 YGGTADLAIIRELFMNMQQARKSLGLKPDKEMD---DKLHRLHPYTVGSQGDLNEW 587
>gi|198275795|ref|ZP_03208326.1| hypothetical protein BACPLE_01970 [Bacteroides plebeius DSM 17135]
gi|198271424|gb|EDY95694.1| hypothetical protein BACPLE_01970 [Bacteroides plebeius DSM 17135]
Length = 816
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 201/590 (34%), Positives = 309/590 (52%), Gaps = 50/590 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+++ +A+P+GN LG MV+GG+ E ++LNE+T W G P A L
Sbjct: 26 LKLWYSAPARNWWEALPVGNSHLGGMVFGGINHEEIQLNEETFWAGGPYSNNRTGASGYL 85
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+VR L+ + EA + F H Y LG + ++F+ + ++Y R+L+L
Sbjct: 86 DEVRRLIFENKNLEARTLLDEKFMTSHHGMRYLTLGSLLMDFN---CEGKVDSYYRDLNL 142
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
ATA V++ VE+TR F+S D V+V +++ ++ G+ +V L S V
Sbjct: 143 EDATASVRFRCDGVEYTRRVFTSFSDNVMVVEMA-TDKGNKKLDVDLRYTCPLTSEVKSE 201
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
++ +C G A P + A++ +++ D G I +D +L V G+ A
Sbjct: 202 GDYLIM-KCNG------AEHEGIPAALH--AVVMMRVKSD-GKIEC-KDGRLSVRGASSA 250
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
+ L A+++F +N D D +++ A++ + LY H Y F RV++
Sbjct: 251 TVFLSAATNF----VNYQDVSGDAYAKARCAIEGAWDKQNKKLYDEHKAIYSAQFGRVAL 306
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
L S + E N+ R+ F +D SL L+FQ+GRYLLISSS+PG+Q
Sbjct: 307 HLPSS-----EFSKKETNV-------RINEFNKVKDCSLAALMFQYGRYLLISSSQPGSQ 354
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWN+DL WDS +NIN EMNYW + NLSE P F LS+ G + A
Sbjct: 355 PANLQGIWNKDLYAPWDSKYTININAEMNYWPAEVTNLSETHVPFFQMAHELSVTGKEAA 414
Query: 431 QVNYLASGWVIHHKTDIWAKSS----ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
+V Y A GWV HH TDIW + AD G +WP GGAW+ HLW+HY Y+ D++F
Sbjct: 415 RVLYGAKGWVAHHNTDIWRAAGPVDFADAG-----MWPNGGAWVAQHLWQHYLYSGDKNF 469
Query: 487 LEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L + YP+L+G A FLL ++ + G+ T PS SPEH P+G + TMD
Sbjct: 470 L-REYYPVLKGTADFLLSFMTKHPRYGWRVTAPSVSPEH---GPNG--VSIVAGCTMDNQ 523
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I +V S + AA ++ + A + + + +L P +I + + EW++
Sbjct: 524 IAFDVLSNTLRAARII-GDSKAYCDSLQSLISQLPPMQIGQYNQLQEWLE 572
>gi|453085568|gb|EMF13611.1| glycoside hydrolase family 95 protein [Mycosphaerella populorum
SO2202]
Length = 811
Score = 306 bits (785), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 215/608 (35%), Positives = 309/608 (50%), Gaps = 70/608 (11%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
+ + PA + D +PIGNGRLGAM+ G E L LNED++W G P + NP A K L
Sbjct: 7 LFYESPANLWEDGLPIGNGRLGAMIRGTTNVERLWLNEDSVWYGGPQNRVNPAAHKNLEL 66
Query: 75 VRSLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLK----YAEETYRRE 127
VR L+D + AEA + F G P + Y+ LGD+ + F A ++YRR
Sbjct: 67 VRELIDQNKIAEAENIMSRTFTGMPESMRHYEPLGDVFMHFGHGRFSGRGGAAVQSYRRA 126
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
LDL T A V Y+ F RE FSS +VI +IS + LSF ++L+ DN ++
Sbjct: 127 LDLQTGLATVSYACQGGNFQREVFSSTVAEVICMRISSDQC--LSFLLTLNRGDDNDAH- 183
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE----------IKISDDRGTISAL 237
R + N +D G+ +A++ +KI D G
Sbjct: 184 ----------RQFDRAFDTLTNTDD---GLVLTAVMGGRNAVELAIGVKIVCDDGVKVDS 230
Query: 238 EDKKLKVEGSDWAVLLLVAS-SSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
++V +VL+L+A ++F N D+ + E+ + ++ L +
Sbjct: 231 CGIDVEVSMQKGSVLILIAGETTFRN--TNAVDAVQQRLEEAAKS-------TWDQLLSA 281
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLF 354
H+ + +L++RV + L + E N+D V + +R++ + +D L LLF
Sbjct: 282 HVAHFGRLYNRVELHLDQ-----------ELNVDHVSTDQRLEQARQHPGQDNELTALLF 330
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
+GRYLLISSS ANLQGIWN D P W S NINLEMNYW + NL EC +
Sbjct: 331 HYGRYLLISSSLS-GLPANLQGIWNCDAKPVWGSKYTANINLEMNYWPAEVTNLPECHQV 389
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
LF+FL L+ G++TAQ Y GW HH TDIWA ++ + W + GAWL TH+
Sbjct: 390 LFNFLERLAERGTQTAQQMYGCRGWTCHHNTDIWADTAPQDRSICATYWNLTGAWLSTHI 449
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGK-- 532
WEHY +T+D DFL+ R +P++ G A F D+LIE DG+L T+PS S E+ + P+
Sbjct: 450 WEHYLFTLDLDFLQ-RYFPIMRGSAQFFQDFLIE-RDGHLVTSPSISAENSYFLPNSNSN 507
Query: 533 -----LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAED 587
+ + T D I+RE+F A I A +L + A E VL LP PT+I +
Sbjct: 508 NNKPVVGSICAGPTWDSQILRELFHACIQAGNLLHE-PVAEYEHVLNKLP---PTQIGKH 563
Query: 588 GSIMEWVQ 595
G IMEW+
Sbjct: 564 GQIMEWLH 571
>gi|395804734|ref|ZP_10483969.1| glycoside hydrolase [Flavobacterium sp. F52]
gi|395433122|gb|EJF99080.1| glycoside hydrolase [Flavobacterium sp. F52]
Length = 778
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 203/608 (33%), Positives = 320/608 (52%), Gaps = 57/608 (9%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
S S N L++ + PA + + +P+GNGRLG M GG+ +E L LN+ TLW+G P D N
Sbjct: 18 SFSQNNQLELWYTKPASQWEETLPLGNGRLGIMPDGGIETEKLVLNDITLWSGSPQDANN 77
Query: 66 PDAPKALSDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEF 112
A L +R L+ + + +EA + F G A+V YQ+LGD+ L+F
Sbjct: 78 YKAYTFLPQIRELLLANKNSEAEQLINQNFVCTGPGSGSGDGANVQFGCYQVLGDMTLKF 137
Query: 113 DDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLS 172
D K Y R L++ TA A ++++ V + RE+F+ D V+ K++ S+ G L+
Sbjct: 138 D-YKTKSKAINYSRNLNIQTALASTQFTIDGVIYKREYFAGFGDDVLFVKLTSSKKGKLN 196
Query: 173 FNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRG 232
F V LD ++ VN +N ++M G+ N D KG+++ A ++ K +D G
Sbjct: 197 FTVKLDRS-EHFKTVNSDNSLVMTGQL---------NNGIDGKGMKYKAKVKAKTAD--G 244
Query: 233 TISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSD 292
++ + ++V+ + VL + A + F ++ D T E ALQ Y +
Sbjct: 245 SV-LYTNNTIEVKNATEVVLYVSAGTDFKNQNF---ETAVDKTLEI--ALQK----KYDE 294
Query: 293 LYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT--DEDPSLV 350
H+ +YQKLF+RV++ ++ ++ T+P+ ER+ +F D D L
Sbjct: 295 QKKTHIQNYQKLFNRVALNFGKTARN------------TLPTNERLDAFMKNPDSDTGLP 342
Query: 351 ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
L +Q+GRYL ISS+R G NLQG+W + W+ H+++N++MN+W NLSE
Sbjct: 343 VLFYQYGRYLSISSTRVGLLPPNLQGLWAHQIQTPWNGDYHLDVNVQMNHWALETGNLSE 402
Query: 411 CQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWL 470
PL D + + G KTA+ Y A GWV H T+IW + W + G WL
Sbjct: 403 LNLPLKDLVKEMVPYGEKTAKAYYNADGWVAHVITNIWGFTEPGE-SASWGIAKAGSGWL 461
Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAP 529
C +LW HY YT D+ +L YP+++G A F L++ + G+L T+PS SPE+ F P
Sbjct: 462 CNNLWNHYLYTNDQAYLAD-IYPIIKGAAQFYNSMLVKDPETGWLVTSPSVSPENSFFLP 520
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEV--LEKNEDALVEKVLKSLPRLRPTKIAED 587
+G+ A V T+D I+RE+F+ +I+A+ L+ A +EK LK LP P ++ D
Sbjct: 521 NGQDAHVCMGPTIDNQIVRELFNNVIAASSKLGLDNTLKAELEKRLKLLP--PPGVVSPD 578
Query: 588 GSIMEWVQ 595
G I EW++
Sbjct: 579 GRIQEWLK 586
>gi|340619499|ref|YP_004737952.1| alpha-L-fucosidase [Zobellia galactanivorans]
gi|339734296|emb|CAZ97673.1| Alpha-L-fucosidase, family GH95 [Zobellia galactanivorans]
Length = 809
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 194/603 (32%), Positives = 319/603 (52%), Gaps = 42/603 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + + PAK +TDA P+GNGRL AM +GGV E +LNE++LW GVP + D L
Sbjct: 36 LTLWYTSPAKKWTDAFPLGNGRLAAMTFGGVAQERFQLNEESLWAGVPSNPFAEDYRAKL 95
Query: 73 SDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDS-HLKYAEETYRREL 128
+ ++ L+ G+ EA A ++ + PA Y+ LGDI L+F D+ H+ Y+R L
Sbjct: 96 TKLQKLILEGKTLEANAFGLENMTAAPASFRSYEPLGDIVLDFKDTTHIS----NYKRAL 151
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
DL T ++V Y + E RE F S D + ++S S ++ +SL D
Sbjct: 152 DLETGISKVTYRTEDSEMVRESFISAEDDALFIRLSAKGSKKINCTISLARPKDVRITAT 211
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKG-----IQFSAILEIKISDDRGTISALEDKKLK 243
++ M G+ P + N G + F+A L+ K+S G + L
Sbjct: 212 PEGKLYMLGQIVDIEAPEAHDENAGGSGEGGEHMSFAAGLQTKVS---GGKLCHTEHNLV 268
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
+E +D ++ A++++D +N D+ DP+ + L+ + S+ +L H ++++
Sbjct: 269 IENADEVLIAYTAATNYDLSKLN-FDASVDPSLKVRGILEKLDQKSWKELEYTHREEHRN 327
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLI 362
+F RV L SP D ++P+ ER+ +F+ +D L LFQFGRYLL+
Sbjct: 328 MFDRVQFDLGTSPND------------SLPTDERLLAFKNGAKDTGLPVQLFQFGRYLLM 375
Query: 363 SSSR-PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
SSR P ANLQG W+E + W++ H+N+NL+MNYW + N+SE +PL ++
Sbjct: 376 GSSRGPAVLPANLQGKWSERMWAPWEADYHLNVNLQMNYWPADVTNISETIDPLVNWFEL 435
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVV-----WALWPMGGAWLCTHLWE 476
+ A+ Y + GW HH ++ + + + + L P+ GAW+ +LW+
Sbjct: 436 IVETSKPLAKEMYGSDGWFSHHASNPFGRVTPSASTLPSQFNNAVLDPLPGAWMAMNLWD 495
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP-DGKLAC 535
HY +T D+ FL++R YPLL+G + F+LD L+E +G L PSTSPE+++ P G++
Sbjct: 496 HYEFTQDKVFLKERLYPLLKGASEFILDVLVEDSEGVLHFVPSTSPENQYKDPATGQMMR 555
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL---KSLPRLRPTKIAEDGSIME 592
++ +ST ++IIR +F A + AA +L + + ++++ K+LP K +G +ME
Sbjct: 556 ITSTSTYHLSIIRAMFKATLEAATILGEGNNERCKRIVEAGKALPDFPIDKT--NGRMME 613
Query: 593 WVQ 595
W Q
Sbjct: 614 WRQ 616
>gi|423605155|ref|ZP_17581048.1| hypothetical protein IIK_01736 [Bacillus cereus VD102]
gi|401244303|gb|EJR50667.1| hypothetical protein IIK_01736 [Bacillus cereus VD102]
Length = 1193
Score = 306 bits (784), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 202/600 (33%), Positives = 313/600 (52%), Gaps = 69/600 (11%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
L + +N PAK + A+PIGNG +G MV+GGV E ++ NE TLWTG P Y N
Sbjct: 84 LSLWYNEPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPSSTSEYTYGNR 143
Query: 67 D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEFDDSHLK 118
D A L +R + G + A S + FG YQ GDI L+F+
Sbjct: 144 DGAASHLGSIREKLSKGDKSGAERESTQFLTGLQKGFGS----YQNFGDIYLDFNMPDAS 199
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
+ YRREL+LN + V YS V++ RE+F+S PD+V+V +++ SES LS +V
Sbjct: 200 -SFSNYRRELNLNEGISTVSYSYKGVQYNREYFASYPDRVMVMRLTASESKQLSLDVRPT 258
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
S + + +I ++G+ AN+ G+++ + E K+ ++ GT++A E
Sbjct: 259 SAQGGQ-VTSKDKKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLTA-E 301
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+ K+KV +D +++ A++ ++ + P+ +DP + + +I N SY L H+
Sbjct: 302 NGKIKVANADSLTIIMTAATDYENKY--PNYKGEDPHQKVEKIMSAISNKSYEVLKYTHI 359
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
DY LF+RVS+ L +VP+ E + S+ + L EL FQ+GR
Sbjct: 360 KDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQYGR 406
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSSRPGT ANLQG+WN +P W+S H NINL+MNYW + NLSE EPL D+
Sbjct: 407 YLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPLMDY 466
Query: 419 LTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
+ L G +A+ ++ GW ++ + + ++ G + W P A++ +LWE
Sbjct: 467 VDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQNLWE 525
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
HY +T D+ +L+++ YP+L+ A F +L+E + L +P SPE L +
Sbjct: 526 HYKFTDDKQYLQEKIYPILKEAAVFHSKFLVEDQNKKLVVSPCWSPE---------LGGI 576
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
S D ++ E+FS +I A+EVL+ + D L K K P P +I G + EW
Sbjct: 577 SNGCAFDQQLVYELFSNVIEASEVLQVDNVFRDELKAKRDKLFP---PIQIGRYGQVQEW 633
>gi|229197298|ref|ZP_04324028.1| hypothetical protein bcere0001_28460 [Bacillus cereus m1293]
gi|228586175|gb|EEK44263.1| hypothetical protein bcere0001_28460 [Bacillus cereus m1293]
Length = 1172
Score = 306 bits (783), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 203/603 (33%), Positives = 314/603 (52%), Gaps = 75/603 (12%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
L + +N PAK + A+PIGNG +G MV+GGV E ++ NE TLWTG P Y N
Sbjct: 63 LSLWYNEPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPSSTSEYTYGNR 122
Query: 67 D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEF---DDS 115
D A L +R + G + A S + FG YQ GDI L+F D S
Sbjct: 123 DGAASHLGSIREKLSKGDKSGAERESTQFLTGLQKGFGS----YQNFGDIYLDFNMPDGS 178
Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
YRREL+LN + V Y+ V++ RE+F+S PD+V+V +++ SES LS +V
Sbjct: 179 SF----SNYRRELNLNEGISTVSYNYKGVQYNREYFASYPDRVMVMRLTASESKQLSLDV 234
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
S + +N+I ++G+ AN+ G+++ + E K+ ++ GT++
Sbjct: 235 RPTSAQGGQ-VTSKDNKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLT 278
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
A E+ K+KV +D +++ A++ ++ + PS +DP + + +I N SY L
Sbjct: 279 A-ENGKIKVANADSLTIIMTAATDYENKY--PSYKGEDPHQKVEKIMSAISNKSYEVLKY 335
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
H+ DY LF+RVS+ L +VP+ E + S+ + L EL FQ
Sbjct: 336 THIKDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQ 382
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
+GRYLLISSSRPGT ANLQG+WN +P W+S H NINL+MNYW + NLSE EPL
Sbjct: 383 YGRYLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPL 442
Query: 416 FDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
D++ L G +A+ ++ GW ++ + + ++ G + W P A++ +
Sbjct: 443 MDYVDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQN 501
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
LWEHY +T D+ +L+++ YP+L+ A F +L+E + L +P SPE L
Sbjct: 502 LWEHYKFTDDKQYLQEKIYPILKEAAEFHSKFLVEDQNKKLVVSPCWSPE---------L 552
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNE---DALVEKVLKSLPRLRPTKIAEDGSI 590
+S D ++ E+FS +I A+ +L+ ++ D L K K P P +I G +
Sbjct: 553 GGISNGCAFDQQLVYELFSNVIEASNLLQIDKGFRDELKAKRDKLFP---PIQIGRYGQV 609
Query: 591 MEW 593
EW
Sbjct: 610 QEW 612
>gi|393786769|ref|ZP_10374901.1| hypothetical protein HMPREF1068_01181 [Bacteroides nordii
CL02T12C05]
gi|392658004|gb|EIY51634.1| hypothetical protein HMPREF1068_01181 [Bacteroides nordii
CL02T12C05]
Length = 821
Score = 306 bits (783), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 187/593 (31%), Positives = 312/593 (52%), Gaps = 48/593 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA + +A+P+GNGR+GAMV+G V E +LNE+++W G P + NP A +AL
Sbjct: 24 LKLWYDRPATQWVEALPLGNGRIGAMVYGDVLHEEFQLNEESIWGGSPYNNVNPKAKEAL 83
Query: 73 SDVRSLVDSGQYAEA------TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
+R L+ G+ EA S G P YQ +G + L+F+ + Y++ Y R
Sbjct: 84 PRIRQLIFEGRNKEAQEMCGHAICSQTANGMP---YQTVGSLHLDFEGVN-NYSD--YYR 137
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL--DNH 184
ELD+ A K++ V +TRE F+S PDQ+++ +++ S+ +SF ++ D
Sbjct: 138 ELDIEKAIVTTKFTSEGVTYTREAFTSFPDQLLIIRLTASQKRKISFTARYNTPYGKDII 197
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLK 243
V+ ++ + G KAN ++ +G ++FS + ++ + G A+ D L+
Sbjct: 198 RNVSSRKELQLHG---------KANDHEGIEGKVRFSTL--TRVEHNGGYTEAIADTLLR 246
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
+ ++ +V L V S FIN +D + + + L++ +Y H Y+K
Sbjct: 247 ISNAN-SVTLYV---SIGTNFINYNDVSGNALKTAQNYLKNAGK-NYQKAKETHCSTYRK 301
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
F+RVS+ L + + P+ RV+ F + DP L L FQFGRYLLI
Sbjct: 302 WFNRVSLDLGSNAQSFK------------PTDVRVREFTSTFDPQLAALYFQFGRYLLIC 349
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SS+PG Q ANLQGIWN L WD +IN+EMNYW + NL E EP + ++
Sbjct: 350 SSQPGGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAESTNLPEMHEPFLQLIKEVA 409
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
G ++A + Y GW +HH TDIW + + G + +WP +W C HLW+HY ++ +
Sbjct: 410 EKGKQSAAM-YGCRGWTLHHNTDIWRSTGSVDGP-GYGIWPTCNSWFCQHLWDHYLFSGN 467
Query: 484 RDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
RD+L + YPL+ F LD+LI + + +L +PS SPE+ + + + +TM
Sbjct: 468 RDYLTE-IYPLMRSACEFYLDFLIRDPKNNWLVVSPSYSPENRPVVNGKRDFTIVAGATM 526
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
D ++ ++F + AA ++ ++ A ++ + + L P ++ G + EW++
Sbjct: 527 DNQMVNDLFRNTLEAASLIGES-SAFIDSLQTVIQNLAPMQVGRWGQLQEWME 578
>gi|299147305|ref|ZP_07040370.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
gi|298514583|gb|EFI38467.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
Length = 811
Score = 306 bits (783), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 192/590 (32%), Positives = 303/590 (51%), Gaps = 55/590 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+++++A+PIGN RLGAMV+GG+ E L+LNE+T W G P + NP+A L
Sbjct: 22 LKLWYSQPAQNWSEALPIGNSRLGAMVYGGIEREELQLNEETFWAGSPYNNNNPNAVHVL 81
Query: 73 SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ G+ EA + L Y LG + LEF + + R+L+L
Sbjct: 82 PVVRKLIFEGRNKEAQRLIDANFLTQQHGMSYLTLGSLYLEFPEHQ---NGSGFYRDLNL 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
AT +Y V +V +TR F+S D VI+ I S++ +L+F ++ + L + V +
Sbjct: 139 ENATTTTRYQVDDVTYTRTTFASFTDNVIIMHIKASKANTLNFTIAYNFPLVHKVNVQND 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
+ C GK + +G++ + E +I + L++ A
Sbjct: 199 KLTVT---CQGK----------EQEGLKAALRAECQIQVKTNSTLRPGGNTLQINEGTEA 245
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
L + A++++ +N + D + + L+ + Y H+ Y+K F RV +
Sbjct: 246 TLYISAATNY----VNYQNVSADESHRTSEYLKRATQIPYEKALKSHIAYYKKQFDRVRL 301
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
L I + + +R+++F ED ++ LLF +GRYLLISSS+PG Q
Sbjct: 302 TLPTG------------KISQLETPKRIENFGNGEDMAMAALLFHYGRYLLISSSQPGGQ 349
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWN WDS +NIN EMNYW + NLSE PLF L LS+ G++TA
Sbjct: 350 PANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAETA 409
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDFL 487
+ Y GW+ HH TD+W G V +A +WP GGAWL H+W+HY +T +++FL
Sbjct: 410 RTMYDCRGWMAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEFL 465
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
K YP+L+G A F +D+L+E H Y L +PS SPEH ++ TMD
Sbjct: 466 -KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDNQ 514
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I + + A+ + + + + + ++L +L P +I + + EW++
Sbjct: 515 IAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLE 563
>gi|336404644|ref|ZP_08585337.1| hypothetical protein HMPREF0127_02650 [Bacteroides sp. 1_1_30]
gi|335941548|gb|EGN03401.1| hypothetical protein HMPREF0127_02650 [Bacteroides sp. 1_1_30]
Length = 811
Score = 306 bits (783), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 194/591 (32%), Positives = 306/591 (51%), Gaps = 57/591 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+++++A+PIGN RLGAM++GG+ E L+LNE+T W G P + NP+A L
Sbjct: 22 LKLWYSQPAQNWSEALPIGNSRLGAMIYGGIEREELQLNEETFWAGSPYNNNNPNAVHVL 81
Query: 73 SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ G+ EA + L Y LG + LEF + + R+L+L
Sbjct: 82 PVVRKLIFEGRNKEAQRLIDTNFLTQQHGMSYLTLGSLYLEFPEHQ---NASGFYRDLNL 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
AT +Y V +V +TR F+S D VI+ I S++ +L+F ++ + L + V +
Sbjct: 139 ENATTTTRYQVDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNCPLVHKVNVQND 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
+ C GK + +G++ + E +I GT+ + EG++
Sbjct: 199 KLTVT---CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L + A++++ +N D D + + L+ + Y H+ Y+K F RV
Sbjct: 245 ATLYISAATNY----VNYQDVSADESHRTSEYLKRAMQIPYEKALKSHIAYYKKQFDRVR 300
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L + + +R+++F ED ++ LLF +GRYLLISSS+PG
Sbjct: 301 LTLPAG------------KASQLETPKRIENFGNGEDMAMAALLFHYGRYLLISSSQPGG 348
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN WDS +NIN EMNYW + NLSE PLF L LS+ G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 408
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
A+ Y GW+ HH TD+W G V +A +WP GGAWL H+W+HY +T +++F
Sbjct: 409 ARTMYDCRGWMAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L K YP+L+G A F +D+L+E H Y L +PS SPEH ++ TMD
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDN 513
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I + + A+ + + + + + ++L +L P +I + + EW++
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLE 563
>gi|295086436|emb|CBK67959.1| hypothetical protein [Bacteroides xylanisolvens XB1A]
Length = 811
Score = 306 bits (783), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 197/591 (33%), Positives = 312/591 (52%), Gaps = 57/591 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+++++A+PIGN RLGAMV+GG+ E L+LNE+T W G P + NP+A L
Sbjct: 22 LKLWYSQPAQNWSEALPIGNSRLGAMVYGGIEREELQLNEETFWAGSPYNNNNPNAIHVL 81
Query: 73 SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ G+ EA + L Y LG + LEF + + R+L+L
Sbjct: 82 PAVRKLIFEGRNKEAQRLIDANFLTQQHGMSYLTLGSLYLEFPEHQ---NASGFYRDLNL 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
AT +Y + +V +TR F+S D VI+ I S++ +L+F ++ + L + V N
Sbjct: 139 ENATTTTRYQMDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNCPLVHKVNVQ-N 197
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
+Q+ + C GK + +G++ + E +I GT+ + EG++
Sbjct: 198 DQLTVT--CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L + A++++ +N D + + + L+ + Y H+ Y+K F RV
Sbjct: 245 ATLYISAATNY----VNYQDVSANESHRTSEYLKRAMQIPYEKALKSHIAYYKKQFDRVR 300
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L T S+ + + +R+++F ED ++ LLF +GRYLLISSS+PG
Sbjct: 301 LTLP-------TGKASQ-----LETPKRIENFGYGEDMAMAALLFHYGRYLLISSSQPGG 348
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN WDS +NIN EMNYW + NLSE PLF L LS+ G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 408
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
A+ Y GW+ HH TD+W G V +A +WP GGAWL H+W+HY +T +++F
Sbjct: 409 ARTMYDCRGWMAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L K YP+L+G A F +D+L+E H Y L +PS SPEH ++ TMD
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDN 513
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I + + A+ + + + + + ++L +L P +I + + EW++
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLE 563
>gi|160883519|ref|ZP_02064522.1| hypothetical protein BACOVA_01491 [Bacteroides ovatus ATCC 8483]
gi|156110932|gb|EDO12677.1| hypothetical protein BACOVA_01491 [Bacteroides ovatus ATCC 8483]
Length = 793
Score = 305 bits (782), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 199/593 (33%), Positives = 303/593 (51%), Gaps = 58/593 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PAK + +A+P+GN RLGAMV+G E L+LNE+T+W G P NP A +AL
Sbjct: 10 LKLWYDRPAKVWEEALPLGNSRLGAMVYGIPQREELQLNEETIWGGSPYRNDNPKAVQAL 69
Query: 73 SDVRSLVDSGQYAEATAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
+ R L+ +G+ EA + F G P +Q G I L F H Y + + RE
Sbjct: 70 PEARKLIFAGKNTEADKLINETFFTRAHGMP---FQTAGSIILNFP-GHENY--QNFYRE 123
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
LDL A + +Y+V VE+ RE ++S D VIV +I+ S +++F + ++ + V
Sbjct: 124 LDLGRAVSTTRYTVDGVEYAREAYASFADDVIVMRITASRKRAINFVLEYSRPVNFNVSV 183
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
G+ I + IP + N + ++ + G L ++ + V+ +
Sbjct: 184 KGSTLIFHSKGTDHEGIPGEINYQ-----------IHTRVVTNDGEAEVLNNR-IVVKNA 231
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
A L + S+F D ++ + +I+N +Y +H++ + + F+R
Sbjct: 232 TVATLYISIGSNFIDYKTLGGDEYVAKVTQKLDC--AIKN-NYKAALKKHIEIFSQQFNR 288
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
+ L + +T +R+ FQ D+DPSLV LL QFGRYLLI SS+P
Sbjct: 289 FKLNLGNRSDGVKKNTL-----------QRIADFQIDQDPSLVTLLTQFGRYLLICSSQP 337
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
G Q ANLQGIW ++P+WDS +NIN EMNYW + NLSE P + LS NG
Sbjct: 338 GGQPANLQGIWCHQMNPSWDSKYTLNINAEMNYWPAEVTNLSETHLPFLQMVKDLSENGR 397
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDR 484
+TA + Y A GW +HH TDIW + G + +A +WP GGAW+C HLWEHY YT D+
Sbjct: 398 RTAAMMYNAEGWTVHHNTDIWRVT----GPIDFARSGMWPTGGAWVCQHLWEHYLYTGDK 453
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTM 542
FL YP ++G A + L +++ H Y + PS SPE V TM
Sbjct: 454 KFLAD-VYPAMKGAADYFLSSMVK-HPKYDWMVVCPSVSPEQ---------GGVVAGCTM 502
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
D +I E+ + A E+L ++ +K+ + L +L P I + + EW++
Sbjct: 503 DNQLIIELLTKTAKANEILGESP-VYRQKLYELLEKLPPMHIGKHTQLQEWLE 554
>gi|160879541|ref|YP_001558509.1| hypothetical protein Cphy_1395 [Clostridium phytofermentans ISDg]
gi|160428207|gb|ABX41770.1| conserved hypothetical protein [Clostridium phytofermentans ISDg]
Length = 758
Score = 305 bits (782), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 196/588 (33%), Positives = 302/588 (51%), Gaps = 60/588 (10%)
Query: 24 FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
+ +A+P+GNG GAM++G V E +KLN++++W G + NPD+ K L VR L+ GQ
Sbjct: 18 WEEALPLGNGSFGAMLYGNVEEEVIKLNQESVWYGGFRNRINPDSRKVLPKVRELIFDGQ 77
Query: 84 YAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEE---------TYRRELDLN 131
A +FG P Y+ L D+ + F+ L ++E+ Y+R LDL
Sbjct: 78 LKAAEELVYTSMFGTPISQGHYEPLADLRIAFNKRILHHSEQWQERQINHSNYKRFLDLQ 137
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN- 190
TA Y+ ++ RE S PDQV+ +++ + + LD +N+ V N
Sbjct: 138 TACYNSSYTWRETDYKREALISYPDQVMAIRLTAD--NPMGVRIELDRG-ENYEKVEANE 194
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
N I + G C G G +F A +++ ISD GTI L+VE +
Sbjct: 195 NTITLSGSCGGN-------------GSKFIAKVQV-ISD--GTI-VRAGAFLEVENASEI 237
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
VL + + F ++DP L Y ++ H+ DY L+ RV +
Sbjct: 238 VLYVAGRTDF---------YEEDPMDWCNEKLALAAQKGYEEIKKDHIADYASLYQRVDL 288
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGT 369
L+ ++N +P+ ER++ F+ ++ D L+EL + +GRYLLISSSR G
Sbjct: 289 DLN-----------GDKNYLNLPTDERLRLFKENKLDDGLLELYYNYGRYLLISSSREGA 337
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
ANLQGIWN+D+ P W S +NIN +MNYW + NLSEC PLF+ + + +G +
Sbjct: 338 LPANLQGIWNKDMMPAWGSKYTININTQMNYWPAEVTNLSECHTPLFEHIKRMVPHGREV 397
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+ Y G V HH TDI+ + +WPMG AWL TH+ EHY YT D F+ K
Sbjct: 398 AEKMYGCRGIVAHHNTDIYGDCVPQGKWMPATMWPMGFAWLATHVIEHYRYTKDVSFV-K 456
Query: 490 RAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
Y +L+ + F +D+L+ + L T PSTSPE+ +I +G+ + + Y +MD II+E
Sbjct: 457 DFYSILKDASLFYVDYLVRDKENQLVTCPSTSPENTYILENGEKSTLCYGPSMDSQIIKE 516
Query: 550 VFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+++ I + LE + D + VE +LK LP+ K+ G ++EW +
Sbjct: 517 LWTGFIEVSSDLEVSNDVVSAVENMLKELPK---AKVGSRGQLLEWTK 561
>gi|237719758|ref|ZP_04550239.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
gi|229451027|gb|EEO56818.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
Length = 811
Score = 305 bits (781), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 196/591 (33%), Positives = 312/591 (52%), Gaps = 57/591 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+++++A+PIGN RLGAM++GG+ E L+LNE+T W G P + NP+A L
Sbjct: 22 LKLWYSQPAQNWSEALPIGNSRLGAMIYGGIEREELQLNEETFWAGSPYNNNNPNAIHVL 81
Query: 73 SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ G+ EA + L Y LG + LEF + + R+L+L
Sbjct: 82 PAVRKLIFEGRNKEAQRLIDANFLTQQHGMSYLTLGSLYLEFPEHQ---NASGFYRDLNL 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
AT +Y + +V +TR F+S D VI+ I S++ +L+F ++ + L + V N
Sbjct: 139 ENATTTTRYQMDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNCPLVHKVNVQ-N 197
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
+Q+ + C GK + +G++ + E +I GT+ + EG++
Sbjct: 198 DQLTVT--CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L + A++++ +N D + + + L+ + Y H+ Y+K F RV
Sbjct: 245 ATLYISAATNY----VNYQDVSANESHRTSEYLKRAMQIPYEKALKSHIAYYKKQFDRVR 300
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L T S+ + + +R+++F ED ++ LLF +GRYLLISSS+PG
Sbjct: 301 LTLP-------TGKASQ-----LETPKRIENFGYGEDMAMAALLFHYGRYLLISSSQPGG 348
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN WDS +NIN EMNYW + NLSE PLF L LS+ G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 408
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
A+ Y GW+ HH TD+W G V +A +WP GGAWL H+W+HY +T +++F
Sbjct: 409 ARTMYDCRGWMAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L K YP+L+G A F +D+L+E H Y L +PS SPEH ++ TMD
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDN 513
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I + + A+ + + + + + ++L +L P +I + + EW++
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLE 563
>gi|384181040|ref|YP_005566802.1| alpha-fucosidase [Bacillus thuringiensis serovar finitimus YBT-020]
gi|324327124|gb|ADY22384.1| alpha-fucosidase [Bacillus thuringiensis serovar finitimus YBT-020]
Length = 1172
Score = 305 bits (781), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 200/600 (33%), Positives = 310/600 (51%), Gaps = 69/600 (11%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
L + +N PAK + A+PIGNG +G MV+GGV E ++ NE TLWTG P Y N
Sbjct: 63 LTLWYNEPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPNSTSEYTYGNR 122
Query: 67 D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEFDDSHLK 118
D A L +R + G + A S + FG YQ GDI L+F+
Sbjct: 123 DGAASHLGSIREKLAKGDKSGAERESSQFLTGLQKGFGS----YQNFGDIYLDFNMPDAS 178
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
A YRREL+LN A V Y+ +V++ RE+F+S PD+V+V +++ SE+ +S +V
Sbjct: 179 -AFSNYRRELNLNEGIATVSYNYKDVQYNREYFTSYPDRVMVMRLTASEAKKISLDVRPT 237
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
S + +N+I M+G+ G+++ A K+ ++ GT++A E
Sbjct: 238 SAQGGQ-VTSVDNKITMKGQITNN-------------GMKYEAAF--KVLNEGGTLTA-E 280
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+ K+KV +D +++ A++ ++ + P+ +DP + + +I SY L H+
Sbjct: 281 NGKIKVANADSLTIIMTAATDYENKY--PTYKGQDPHEKVEKVMSAISKKSYEVLKYTHI 338
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
DY LF+RVS+ L +VP+ E + S+ + L EL FQ+GR
Sbjct: 339 KDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQYGR 385
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSSRPGT ANLQG+WN +P W+S H NINL+MNYW + NLSE EPL D+
Sbjct: 386 YLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPLMDY 445
Query: 419 LTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
+ L G +A+ ++ GW ++ + + ++ G + W P A++ +LWE
Sbjct: 446 VDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQNLWE 504
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
HY +T D+ +L+++ YP+L+ A F +L+E + L +P SPE L +
Sbjct: 505 HYKFTDDKQYLQEKIYPILKEAAEFHSKFLVEDQNKKLVVSPCWSPE---------LGGI 555
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
S D ++ E+FS +I A+EVL+ + D L K K P P +I G + EW
Sbjct: 556 SNGCAFDQQLVYELFSNVIEASEVLQVDNVFRDELKAKRDKLFP---PIQIGRYGQVQEW 612
>gi|269793879|ref|YP_003313334.1| hypothetical protein Sked_05400 [Sanguibacter keddieii DSM 10542]
gi|269096064|gb|ACZ20500.1| hypothetical protein Sked_05400 [Sanguibacter keddieii DSM 10542]
Length = 856
Score = 305 bits (781), Expect = 5e-80, Method: Compositional matrix adjust.
Identities = 205/592 (34%), Positives = 297/592 (50%), Gaps = 57/592 (9%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG--- 61
++ + PL + ++ PA +T+A+P+GNGRLGAM +GG + +++N+DT W+G P
Sbjct: 16 DNEAAARPLVLAYDAPAGRWTEALPVGNGRLGAMCFGGTTDDRVQVNDDTCWSGSPATTA 75
Query: 62 ---DYTNPDAPKALSDVRSLVDSGQYAEATAASVKL-FGHPADVYQLLGDIEL-EFDDSH 116
+ + P + D R+ + +G A A +L GH + YQ L D+ L E D +
Sbjct: 76 GRRHFETGEGPGIVDDARAALAAGDVRAAERAVQRLQHGH-SQAYQPLVDLLLVEVDPAG 134
Query: 117 LKYAEET---YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF 173
E Y R LDL TA AR ++ +E +SS P V+V ++ +
Sbjct: 135 GAVDPEPRTGYARSLDLRTAVARHTWTGAGGTVVQETWSSAPRGVLVVDRRATDGTLPAL 194
Query: 174 NVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANAN-----DDPKGIQFSAILEIKIS 228
VSL S + + R P +P A+ D G +A + + +
Sbjct: 195 RVSLTSPHPTLDVQGTPTGLAVTVRMPSDVVPDHEPADVHVRYDPAPGAAVTAAVHVAVH 254
Query: 229 DDR----GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSE--SMSAL 282
D G SA D ++V G+ + L+L + F D++ P + S+ A
Sbjct: 255 TDGIVGDGGPSATADA-VEVVGATYVTLVLGTETDF-------VDAETAPHGDVDSLRAA 306
Query: 283 QSIRNLSYSD---------LYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVP 333
++R D L H+ D+ LF RV I L +P +T VP
Sbjct: 307 VALRTSGVVDAITASGLPALRAEHVADHDALFGRVEIDLGPAPDSGLT----------VP 356
Query: 334 SAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHV 392
ER+ DP+L L Q+GRYL+I+ SRPGT+ NLQGIWNE + P W S
Sbjct: 357 --ERLARHAAGAPDPALAALQAQYGRYLMIAGSRPGTRPMNLQGIWNESVVPPWSSNYTT 414
Query: 393 NINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKS- 451
NIN EMNYW + P NL EC EPL +L L+ G TA+ Y GW HH +D+W S
Sbjct: 415 NINTEMNYWPAGPANLDECHEPLTSWLADLARTGGDTAREVYGLPGWAAHHNSDVWGFSL 474
Query: 452 SADRGKV--VWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG 509
A G W WP+GG WL THLW+ Y+++ D FL A+PLL G A F L WL+E
Sbjct: 475 PAGDGDSDPSWTAWPLGGVWLATHLWDRYDWSRDLGFLAD-AWPLLRGAADFALAWLVEQ 533
Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVL 561
DG L T+P+TSPE+ ++APDG A V+ S+T D+A++RE+ + AA+VL
Sbjct: 534 PDGTLGTSPATSPENRYVAPDGLPAAVTVSTTSDLAMVRELLGRCLDAAQVL 585
>gi|317057786|ref|YP_004106253.1| alpha-L-fucosidase [Ruminococcus albus 7]
gi|315450055|gb|ADU23619.1| Alpha-L-fucosidase [Ruminococcus albus 7]
Length = 756
Score = 305 bits (780), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 198/588 (33%), Positives = 298/588 (50%), Gaps = 49/588 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
+I F PA+ + A+P+GNGR+G M +G +E ++LNED++W+G P N A L
Sbjct: 5 RIWFRRPAEDWNVALPVGNGRIGGMCFGQALNEKIQLNEDSVWSGGPRKRNNASARANLE 64
Query: 74 DVRSLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ + AEA ++ F G P + Y LGD+ ++ H + E R LDL
Sbjct: 65 KVRQLLREEKIAEAEKIVMEAFCGTPVNERHYMPLGDLSIQ---HHKEDTFEYTERSLDL 121
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS---LLDNHSYV 187
A +YS+ V +TR S P QV+ I + S+S VS+D D++S V
Sbjct: 122 ENAVCETRYSINGVNYTRRVICSEPAQVMAVCIDADKPASVSVKVSIDGRDDYFDDNSPV 181
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
N + I+ G C + GI F+A I++ GT+ + +
Sbjct: 182 N-DTDILYYGGCGSE------------DGICFAAY--IRVLGYGGTVGRW-GSSIVTDCC 225
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D +++L A + F +D KK + ++A ++ +L H +DY+ F R
Sbjct: 226 DRVMIILGAQTDF-----RVTDYKKGAELDVITAAGK----TFEELLAEHTEDYRSYFDR 276
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSR 366
I + D S ++P+ ER+K + D LV L F FGRYL+I+ SR
Sbjct: 277 AEI--------VFEDGGSY----SLPTDERLKLVKDGGVDNGLVSLYFDFGRYLMIAGSR 324
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
GT NLQGIWN+D+ P W VNIN EMNYW + PC L + PLFD + + +G
Sbjct: 325 EGTLPLNLQGIWNKDMWPAWGCRFTVNINTEMNYWCAEPCGLGDLHIPLFDHIERMRPHG 384
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
TA+ Y SG+V HH TDIW ++ + W G AWLCTH+WEH+ +T D++F
Sbjct: 385 RDTAREMYGCSGFVCHHNTDIWGDTAPQDLWIPGTQWVTGAAWLCTHIWEHWLFTQDKEF 444
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
L ++ Y ++ A F +D+LI+ G L T PS SPE+ +I G V +MD I
Sbjct: 445 LAQK-YDTMKEAAKFFVDFLIDDGSGRLVTAPSVSPENTYITESGARGSVCIGPSMDSQI 503
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
I ++F+A+I A ++L ++ + EK+ RL +I + G I EW
Sbjct: 504 IYQLFTAVIEAGKILGIDK-SFGEKLSAMRERLPKPEIGKYGQIKEWA 550
>gi|452000004|gb|EMD92466.1| glycoside hydrolase family 95 protein [Cochliobolus heterostrophus
C5]
Length = 806
Score = 304 bits (779), Expect = 9e-80, Method: Compositional matrix adjust.
Identities = 200/604 (33%), Positives = 308/604 (50%), Gaps = 57/604 (9%)
Query: 11 NPLKITFNGP--AKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
N ++ + P + +TDA+PIGNGRLGAM++G E ++LNE+T+W+G D N +
Sbjct: 21 NSTRLWYTAPVASSTWTDALPIGNGRLGAMIYGIPVQELIQLNEETIWSGGRRDRVNQNG 80
Query: 69 PKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYR 125
+ +S+VR L+ G A A++ + G P YQ LGD+E+ FD + +Y TY
Sbjct: 81 AQTVSEVRDLLARGDAGGAQKLANLGMMGTPQTCRNYQTLGDMEISFDGTS-EYDNTTYE 139
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
R LDL+TA A V++ V + + RE F S PD V V + + +G LSF + + D +
Sbjct: 140 RWLDLDTALAGVRFKVNDTLYEREMFVSVPDDVFVHHLKATGNGKLSFQIRVHRPKDGLN 199
Query: 186 YV-----NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
N N M G G DP + F+ L ++ T+
Sbjct: 200 EASDQNWNENGWTYMTGGTGGI----------DP--VVFTTALAVESDGHVRTLGEF--- 244
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ VE + A L A++S+ D + S +Q R +Y +L RH++D
Sbjct: 245 -IVVENATEATAFLAAATSY---------RHNDTRAAVDSTIQKARQHTYEELRRRHIED 294
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRY 359
Y L++ + L+ D+ T + +P+ R+ + + DP LV L + +GRY
Sbjct: 295 YSPLYNASVLNLN--GPDLGTSS--------LPTNARINATRRGANDPGLVALAYNYGRY 344
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LLISSSR G +NLQGIWN++ P W S VNINL+MNYW + +LS EP FD L
Sbjct: 345 LLISSSRAGNLPSNLQGIWNKEFDPLWGSKYTVNINLQMNYWPAEVTSLSSLHEPFFDLL 404
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
+ +G+ TA+ Y ASGW+ HH TD+W ++ + W + WL TH+ EHY
Sbjct: 405 ELMRKDGTHTAKAMYNASGWMSHHNTDLWGDTAPVDTYLPATYWTLSSGWLVTHILEHYW 464
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWL----IEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
YT D+ FL + + E F LD L G + YL TNPS SPE+ ++ PDGK
Sbjct: 465 YTGDKSFLASNLHIVSEAI-EFYLDTLQPYKTNGTE-YLVTNPSVSPENTYVGPDGKSYN 522
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKN--EDALVEKVLKSLPRLRPTKIAED--GSIM 591
+ T D+ I+ E+F+ ++A L + + A + ++ + +L P + + G++
Sbjct: 523 FDIAPTCDVEILNELFTNYLNAVATLSNSTVDSAFLTRIRDTQAKLPPYRYSTRYPGTLQ 582
Query: 592 EWVQ 595
EW+Q
Sbjct: 583 EWMQ 586
>gi|343083763|ref|YP_004773058.1| glycoside hydrolase [Cyclobacterium marinum DSM 745]
gi|342352297|gb|AEL24827.1| glycoside hydrolase family 65 central catalytic [Cyclobacterium
marinum DSM 745]
Length = 806
Score = 303 bits (777), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 192/606 (31%), Positives = 325/606 (53%), Gaps = 35/606 (5%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
A T+ + +++ + PA + +A+PIGNGRLGAM++GGV E ++LNE++LW G+P D
Sbjct: 32 ARKTNNSKKMQLWYTSPANEWLEALPIGNGRLGAMIFGGVKEEQIQLNEESLWAGMPEDP 91
Query: 64 TNPDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYA 120
D K + + L G+Y EA ++ L P + Y+ LG++ + FD H K +
Sbjct: 92 YPEDVQKHYAAFQQLNMEGKYEEALKYGMEHLAVSPTSIRSYEPLGELHITFD--HQK-S 148
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
E YRR LDL T Y++ + RE FSS+ VI + + ++ + D
Sbjct: 149 PENYRRTLDLETGVVISTYTIDGKRYLREAFSSDKYDVIFYRFQSLDGEPVNSTIRFDRE 208
Query: 181 LDNHSYVNGNNQIIMEGRC---PGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
D + +I++G+ P + + + ++F++ +I + D G++S
Sbjct: 209 KDIVQSIGEGELLIVDGQVFDDPDGYEDNPGGSGETGRHMKFAS--QITATLDEGSMSGN 266
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
E+ L +E S +++ A++ ++ +N D D +++ +L+ +Y H
Sbjct: 267 ENT-LNIENSTGYTVIVSAATDYNLAKLN-FDRNIDAKDKALKSLKGALETAYQTAKDAH 324
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQF 356
+ K+F+RV++ L SP DT+P+ +R+ + D + EL FQ+
Sbjct: 325 TAAHSKMFNRVALSLG-SPLQ-----------DTIPTDKRLDQVREGTNDNHITELFFQY 372
Query: 357 GRYLLISSS-RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
GRYLL+ SS ANLQGIWN+++ W+S H+NINL+MNYW + NLSE PL
Sbjct: 373 GRYLLMGSSVNRAILPANLQGIWNKEMWAPWESDFHLNINLQMNYWPADQTNLSESFVPL 432
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAK-----SSADRGKVVWALWPMGGAWL 470
+F+ L+ NG TA+ +SGW+ HH ++ + + S+ D P+ GAW+
Sbjct: 433 SNFMEKLAKNGEITAEKFIGSSGWMAHHVSNPFGRTTPSGSTKDSQMTNGYSNPLAGAWM 492
Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD 530
LW HY +T D+++L++ AYP+L G A F+LD+L E G L T+PS SPE+ +I P
Sbjct: 493 SLSLWRHYEFTQDQEYLKETAYPVLAGTAQFILDFLKENEKGELVTSPSYSPENAYIDPK 552
Query: 531 -GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
GK + +++MD+ II ++F+A + A E++ + L + K+ +L P KI ++G+
Sbjct: 553 TGKATRNTTAASMDIQIINDIFNACLKAEEII--GDKQLTAAIKKASSKLPPIKIGKNGT 610
Query: 590 IMEWVQ 595
+ EW +
Sbjct: 611 LQEWYE 616
>gi|336425540|ref|ZP_08605561.1| hypothetical protein HMPREF0994_01567 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336012115|gb|EGN42041.1| hypothetical protein HMPREF0994_01567 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 835
Score = 303 bits (776), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 203/631 (32%), Positives = 308/631 (48%), Gaps = 75/631 (11%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLV 79
PA+ F DA +GNG LG V G E + +NEDTLW+G G Y NP + R L
Sbjct: 11 PAEQFWDAHYLGNGSLGMSVMGDPVLEEVYINEDTLWSGSEGFYLNPQHYDRFMEARRLA 70
Query: 80 DSGQYAEA-TAASVKLFGHPADVYQLLGDIELEFDDSH------LKYAEET-------YR 125
G+ EA T + + G + Y L + + + LK E YR
Sbjct: 71 LEGKGKEANTIINNDMEGRWLETYLPLASLHITMGQADNRRNMPLKMVIEPQPGDIEDYR 130
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVT------KISGSESGSLSFNVSLDS 179
R L L+ A V + + + RE+F S PD+ K L F +DS
Sbjct: 131 RCLSLSDAAETVSWIRDGIRYRREYFVSYPDRTAYVYCTAEPKEGEGRDKVLDFAFGVDS 190
Query: 180 LLDNHSYVNG--NNQIIMEGRCPGKRIP------PKANAND--DPKGIQFSAILEIKISD 229
L Y+NG + + + G P P P+ D + ++F+ + +D
Sbjct: 191 SL---HYINGAEDGEAFLTGIAPDHAEPSYTAVAPRFIYKDPENSDALRFACCARVISTD 247
Query: 230 DRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESM-SALQSIRNL 288
GT+++ + ++ V G+ +A+L + A +S+ G F P D E + L ++
Sbjct: 248 --GTVAS-DGARVYVNGASYALLAVRAGTSYAG-FRVPRDRDAGKVLEELRKGLDGLQKA 303
Query: 289 S--YSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVK-SFQTDE 345
Y H+ DYQ L++RV + L E +P+ +R+ + +
Sbjct: 304 GRDYEGARKDHVTDYQALYNRVDLDLG------------TELSGNLPTTQRLHFCGEGVD 351
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
DPSL L+ Q+ RYL I+ SRPG+Q NLQGIWN+ +P W S NIN+EMNYW
Sbjct: 352 DPSLAALMLQYSRYLTIAGSRPGSQALNLQGIWNDTPNPPWSSNYTNNINVEMNYWPCEV 411
Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
L EC P+ D LT L+ G +TA+ Y +GWV HH D+W + W+ WP
Sbjct: 412 LGLPECHLPMMDLLTELADAGKQTAKEYYHMNGWVAHHNADLWRSTEPSCEDASWSWWPF 471
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHE 525
GGAW+C H+W HY YT DR+FL K YP+L A+F+LD+L+E +GYL T PS SPE++
Sbjct: 472 GGAWMCEHIWTHYEYTQDREFLRK-MYPVLREAAAFMLDFLVENKEGYLVTAPSLSPENK 530
Query: 526 F--------------IAPDGK-------LACVSYSSTMDMAIIREVFSAIISAAEVLEKN 564
F +A + + ++ V+ STMDM+I+RE+FS + AA++L+ +
Sbjct: 531 FLTSGEETVIELIDEVAKESRCSPNHPCISAVTIGSTMDMSILRELFSNVARAAQILDIS 590
Query: 565 EDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+D + + L+S+ + P + G + EW +
Sbjct: 591 DDPVPVQALESMKKFPPYRTGRFGQLQEWYE 621
>gi|451854086|gb|EMD67379.1| glycoside hydrolase family 95 protein [Cochliobolus sativus ND90Pr]
Length = 805
Score = 302 bits (774), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 197/589 (33%), Positives = 300/589 (50%), Gaps = 55/589 (9%)
Query: 24 FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
+TDA+PIGNGRLGAM++G E ++LNE+T+W+G D N + + +S+VR L+ G
Sbjct: 36 WTDALPIGNGRLGAMIYGIPVQERIQLNEETIWSGGRRDRVNQNGAQTVSEVRDLLARGD 95
Query: 84 YAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYS 140
A A A++ + G P YQ LGD+E+ FD + KY + TY R LDL+TA A V++
Sbjct: 96 AAGAQKLANLGMMGTPQTCRNYQTLGDMEISFDGTS-KYDKTTYERWLDLDTALAGVRFR 154
Query: 141 VGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV-----NGNNQIIM 195
V + + RE F S PD V V ++ + + LSF + + D + N N M
Sbjct: 155 VNDTLYEREMFVSVPDDVFVHRLKATGNEKLSFQIRVHRPKDGLNEASDQNWNENGWTYM 214
Query: 196 EGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLV 255
G G DP + F+ L I+ T+ + VE + A L
Sbjct: 215 TGGTGGI----------DP--VVFTTALAIESDGHVRTLGEF----IVVENATEATAFLA 258
Query: 256 ASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRS 315
A++S+ D + S +Q R +Y +L RH++DY ++ + L+
Sbjct: 259 AATSY---------RHNDTRAAVESTIQKARQHTYEELRRRHIEDYAPFYNASVLNLN-G 308
Query: 316 PKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGTQVANL 374
P +D +P+ R+ + + DP LV L + +GRYLLI+SSR G +NL
Sbjct: 309 PDLKTSD---------LPTNARINATRKGANDPGLVALAYNYGRYLLIASSRAGNLPSNL 359
Query: 375 QGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY 434
QGIWN++ P W S VNINL+MNYW + +LS P FD L + +G TA+ Y
Sbjct: 360 QGIWNKEFDPLWGSKYTVNINLQMNYWPAEVTSLSSLHAPFFDLLELMRKDGMHTAKAMY 419
Query: 435 LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPL 494
ASGW+ HH TD+W ++ + W + WL TH+ EHY YT D+ FL P+
Sbjct: 420 NASGWMSHHNTDLWGDTAPVDTYLPATYWTLSSGWLVTHILEHYWYTGDKGFLASN-LPI 478
Query: 495 LEGCASFLLDWL----IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREV 550
+ F LD L G + YL TNPS SPE+ ++ PDGK + T D+ I+ E+
Sbjct: 479 VSEAIEFYLDTLQPYKANGTE-YLVTNPSVSPENTYVGPDGKSYNFDTAPTCDVQILNEL 537
Query: 551 FSAIISAAEVLEKN--EDALVEKVLKSLPRLRPTKIAED--GSIMEWVQ 595
F+ ++A L + + A + ++ + +L P + + G++ EW+Q
Sbjct: 538 FTNYLNAVATLSNSTVDSAFLTRIRDTQAKLPPYRYSTRYPGTLQEWMQ 586
>gi|383812006|ref|ZP_09967453.1| hypothetical protein HMPREF9969_0999 [Prevotella sp. oral taxon 306
str. F0472]
gi|383355392|gb|EID32929.1| hypothetical protein HMPREF9969_0999 [Prevotella sp. oral taxon 306
str. F0472]
Length = 781
Score = 302 bits (773), Expect = 4e-79, Method: Compositional matrix adjust.
Identities = 187/596 (31%), Positives = 301/596 (50%), Gaps = 47/596 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA- 71
+++ +N PA F +++P+GNG+LGA+V+GG +T+ LN+ T WTG P D N KA
Sbjct: 1 MRLWYNQPAHFFEESLPLGNGKLGALVYGGTQKDTIYLNDITYWTGNPVD-PNEGLGKAK 59
Query: 72 -LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+ ++R + + Y A + + G + YQ LG + + ++ A Y REL+L
Sbjct: 60 WIPEIRKALFAENYRTADSLQHFVQGEQSASYQPLGTLNIINLNTG---AVSNYYRELNL 116
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
++A + Y ++FTRE+F+++ D +I I +++G+++ + L + H N
Sbjct: 117 DSALVHISYQQNGIQFTREYFATHRDSLIAIHIKANQAGAINLRIQLTAQTP-HKVKATN 175
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
NQ+ M G G A +++ G + A D L + +D A
Sbjct: 176 NQLTMTGHTTGSETE------------SVHACTIVRLLPQGGKVIA-SDSTLTLTNADNA 222
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
+ +V ++SF+G +P +++A +N +Y++ RH+ +YQ++++RV +
Sbjct: 223 TIYIVNATSFNGFDKHPVKDGASYIDNAVNAAWHTQNFTYNEFKDRHIKEYQQIYNRVKL 282
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDP-------SLVELLFQFGRYLLIS 363
+L ++E + +P+ + ++ + + P L L FQFGRYLL+S
Sbjct: 283 KLG-----------NKEYTNNLPTDQLLRRYSSSTAPLPEAAQRYLETLYFQFGRYLLLS 331
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SR ANLQG+W L W +NINLE NYW + P N+SE +PL F+ LS
Sbjct: 332 CSRTPNIPANLQGLWTPHLFSPWRGNYTMNINLEENYWPADPANMSETIQPLIGFVKGLS 391
Query: 424 INGSKTAQVNY-LASGWVIHHKTDIWAKSS-ADRGKVV--WALWPMGGAWLCTHLWEHYN 479
G TA+ Y + GW H +D W K+S GK WA W +GGAWL LW+HY
Sbjct: 392 ATGKHTARNFYGINEGWCAAHNSDPWCKTSPVGEGKESPEWANWNLGGAWLVNALWDHYL 451
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD--GYLETNPSTSPEHEFIAPDGKLACVS 537
Y+ D+ L+ YPL+EG + F WL+ + L T PSTSPE+E++ G
Sbjct: 452 YSQDKQLLQNTIYPLMEGSSKFFQQWLVTNPNKPNELITAPSTSPENEYVTDKGYHGTTC 511
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
Y T D+AIIRE+F + A + L D ++ L RL P + G + EW
Sbjct: 512 YGGTADLAIIRELFMNMQQARKSLGLKPDKEID---DKLHRLHPYTVGSQGDLNEW 564
>gi|325298040|ref|YP_004257957.1| alpha-L-fucosidase [Bacteroides salanitronis DSM 18170]
gi|324317593|gb|ADY35484.1| Alpha-L-fucosidase [Bacteroides salanitronis DSM 18170]
Length = 1004
Score = 301 bits (770), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 186/592 (31%), Positives = 310/592 (52%), Gaps = 45/592 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PAK + + +P+GNGRLG M GG+ E + LNE ++W+G DY NP+A ++L +R
Sbjct: 232 YDKPAKQWEETLPLGNGRLGMMPDGGITKEHIVLNEISMWSGSEADYRNPEAAESLPRIR 291
Query: 77 SLVDSGQYAEATAASVKLF-------GHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
L+ G+ EA F G +Q+L D+ + + + Y R L+
Sbjct: 292 QLLFEGKNKEAQELMYTSFVPKKPEKGGTFGCFQMLADMYINYTFPDTISQAKDYLRWLN 351
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L+ A ++ + RE+F S V++ + +L F+++L H
Sbjct: 352 LDEGVAYTTFTKNATRYIREYFVSRNKDVMLIHLQADRPDALGFHLTLSRPERGHVRKLS 411
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
++ + G + N+ +GI+++AI +K+S + + D ++V +D
Sbjct: 412 EGKLEITGTL--------DSGNERQEGIRYAAIAGVKLSGKKSRMHTHADG-IEVSDADE 462
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A +++ A++S+ I +++++ S L + + +YQ+LFHR
Sbjct: 463 AWIIVSANTSYMKGEIYQTETQRLLDQALASDLTQAKQEA--------TGEYQQLFHRAG 514
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
I+L + T S+ + D +R+++FQT +DPSL L + +GRYLLISS+RPG+
Sbjct: 515 IELPEN------KTVSQLSTD-----KRLEAFQTQDDPSLAALYYNYGRYLLISSTRPGS 563
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
NLQG+W + W+ H NIN++MN+W PCNLSE +PL D + L +G +T
Sbjct: 564 LPPNLQGLWANGVMTPWNGDYHTNINVQMNHWPVEPCNLSELYQPLVDLIKRLVPSGEET 623
Query: 430 AQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
A+ Y A GWV+H T++W +S W GGAWLC HLWEHY YT ++ +L
Sbjct: 624 AKAFYGSEAKGWVLHMMTNVWNYTSPGE-HPSWGATNTGGAWLCAHLWEHYLYTGNKQYL 682
Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIA--PDGKLACVSYSSTMDM 544
YPLL+G + F ++ E G+L T P++SPE+EF D V TMD+
Sbjct: 683 AD-IYPLLKGASEFFYSTMVREPEHGWLVTAPTSSPENEFYVSKKDRTPISVCMGPTMDI 741
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLK-SLPRLRPTKIAEDGSIMEWVQ 595
++RE+++ +I AA +L + D+L LK + +L P +I++ G +MEW++
Sbjct: 742 QLVRELYTHVIEAASIL--HTDSLYANQLKEASAQLPPHQISKKGYLMEWLK 791
>gi|197302771|ref|ZP_03167824.1| hypothetical protein RUMLAC_01500 [Ruminococcus lactaris ATCC
29176]
gi|197298169|gb|EDY32716.1| hypothetical protein RUMLAC_01500 [Ruminococcus lactaris ATCC
29176]
Length = 773
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 202/596 (33%), Positives = 301/596 (50%), Gaps = 38/596 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA- 71
+K+ ++ PA+++ +++P+GNGR+GAMV+GG E L LNEDTLW+G P + T P+
Sbjct: 1 MKLYYDHPAENWHESLPLGNGRIGAMVYGGTKKEILALNEDTLWSGYP-EKTQKKLPEGY 59
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
L VR L + +Y +A + F DV Y G++ +E D + ++ Y REL
Sbjct: 60 LEKVRELTEKREYQKAMEYLEECFSSSEDVQMYVPFGNVYMEMLDGTEEISD--YHRELC 117
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L+TA R+ Y + S P QV+V KI ++ SL V ++
Sbjct: 118 LDTAEVRITYKNQGALVEKSCIVSQPAQVLVYKIRSEKAFSLKLYVEGGYARES---CCT 174
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQ-FSAILEIKISDDRGTISALEDKKLK----- 243
+ + +G+CPG R+P K + F E + G + D K+
Sbjct: 175 DGILKTKGQCPG-RVPFTVGEGGSEKAVPVFPEEPEKQGMCYEGWGKIVTDGKVNEAGNA 233
Query: 244 --VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
VE ++ L SSF G +P + P E + A SY L T HL +Y
Sbjct: 234 VIVENAEEVTLYYGIRSSFAGFDRHPVIEGRCP-EELLKADFDCTGKSYEALRTEHLKEY 292
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYL 360
QK + RVS L D +E+++ +R+ FQ ED L LLFQ+GRYL
Sbjct: 293 QKYYKRVSFSLGEK------DEYAEKDLR-----QRLTDFQDHPEDVGLNALLFQYGRYL 341
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LI++SRPGTQ ANLQGIWN +L P W S +NIN EMNYWQ+ PCNL E EPL
Sbjct: 342 LIAASRPGTQAANLQGIWNAELVPPWFSDYTININTEMNYWQTGPCNLEEMGEPLVRLCE 401
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
++ +G +TA + G H TD+W K++ G+ W WPMG AWLC +L++ Y +
Sbjct: 402 EMAADGKETAMHYFGKEGVCSFHNTDLWRKTTPADGRAEWNFWPMGYAWLCRNLYDQYLF 461
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD---GKLACVS 537
T DR +LE R YP+L+ F ++ ++ GY +P+TSPE++F+ + KL
Sbjct: 462 TEDRAYLE-RIYPVLKENVRFCVESVVGTAQGYA-MSPATSPENDFLFGEEKKEKLTVAQ 519
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
Y+ + AI+R + + A +L D L + K + + +G I+EW
Sbjct: 520 YTEN-ENAIVRNLLRDYLEAGRIL-GIRDELTGQAEKIFEEMAAPAVGSNGQILEW 573
>gi|389642921|ref|XP_003719093.1| hypothetical protein MGG_00050 [Magnaporthe oryzae 70-15]
gi|351641646|gb|EHA49509.1| hypothetical protein MGG_00050 [Magnaporthe oryzae 70-15]
gi|440473491|gb|ELQ42283.1| alpha-L-fucosidase 2 [Magnaporthe oryzae Y34]
gi|440483559|gb|ELQ63936.1| alpha-L-fucosidase 2 [Magnaporthe oryzae P131]
Length = 827
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 199/603 (33%), Positives = 300/603 (49%), Gaps = 50/603 (8%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+ S NPL++ + D+ IGNGRLG + G +E + LNED+ W+G D N
Sbjct: 26 ANSAANPLRLWQTTAGVTYNDSFLIGNGRLGFSLPGSALTEAITLNEDSFWSGGKMDRVN 85
Query: 66 PDAPKALSDVRSLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEE 122
PDA + ++ L+ G+ EA T A + G P V Y LG + L +
Sbjct: 86 PDAAANMPQIQQLITQGRIEEAATLAGMAYKGLPDSVRHYDWLGRLHLAMKGPAGQAGN- 144
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS------ 176
Y R LD+ A V Y++ F+RE+ +S PDQ+I ++ ++SGS+SF +S
Sbjct: 145 -YERWLDVGEGLAGVDYTLNGTAFSREYLASFPDQIIAVRMKSNQSGSISFTLSQSRGSG 203
Query: 177 LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISA 236
L+ D + ++G+ I+M G G I FS+ ++ +S G+I
Sbjct: 204 LNRFQDYTTSLDGDT-ILMGGGSMGS------------DAIVFSSGAKVTVSG--GSIKT 248
Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
+ + + V +D AV+ A +++ P K+ + L++ Y + +
Sbjct: 249 I-GETIVVSDADSAVIYWTAWTTYRKP-------KEQLRESVLVDLRTAAAKGYDAIRSE 300
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
H+ DYQKL RV + L S SE+ + +A+R++ DP + L F F
Sbjct: 301 HVKDYQKLAGRVDLNLGMS--------SSEQK--SKSTAQRLRGMSQAFDPEMATLYFYF 350
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
RYLLI+S RPGT ANLQGIWN D+SP W S VNINL+MNYW +L N+ E L
Sbjct: 351 ARYLLIASGRPGTLPANLQGIWNTDISPQWGSKYTVNINLQMNYWPALLTNMPELHHSLL 410
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
D L + NG A+ Y ASG V HH TD+W + WP G WL TH++E
Sbjct: 411 DHLKIMHENGKDVARRMYNASGSVCHHNTDLWGDCAPQDNYAASTFWPTGLGWLVTHVYE 470
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDG---KL 533
HY +T D L + YP+L A F LD+L E + G+L TNPS SPE ++ P+ +
Sbjct: 471 HYLFTGDEQVL-RDYYPVLRDSALFFLDFLTE-YQGHLVTNPSVSPEIQYYLPNSTTRQG 528
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDA-LVEKVLKSLPRLRPTKIAEDGSIME 592
++ T D +II EVF + A E+L E ++++ + RL P + + G + E
Sbjct: 529 VALTLGPTCDNSIIWEVFGLVFHATEILGNVEGKEFQDRLMSARARLPPLRRDQYGGLAE 588
Query: 593 WVQ 595
++
Sbjct: 589 FIH 591
>gi|423330223|ref|ZP_17308007.1| hypothetical protein HMPREF1075_00020 [Parabacteroides distasonis
CL03T12C09]
gi|409231839|gb|EKN24687.1| hypothetical protein HMPREF1075_00020 [Parabacteroides distasonis
CL03T12C09]
Length = 809
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 197/611 (32%), Positives = 310/611 (50%), Gaps = 53/611 (8%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
N T + F+ PA+ + + +P+GNGR+G M GG+ E + LNE +LW+G D
Sbjct: 16 NLPDMQTGKEVSYYFDQPAQIWEETLPLGNGRIGMMPDGGIERENVVLNEISLWSGSKQD 75
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
NP A +L+++R L+ G+ EA K F P YQL G++
Sbjct: 76 TDNPYAYYSLANIRRLLFEGRNDEAQDLMYKTFVCKGTGSNLGDGANAPYGSYQLFGNLV 135
Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
L++ + + YRR L+L+ A A V + GNV + RE F+S + V +
Sbjct: 136 LKYTYPNESDSIAEYRRRLNLSEAIASVSFKRGNVNYQREMFTSFSGDLGVIHLVADTDR 195
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQ-IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
+L+F++ ++ H+ ++ + + ++M G+ P + KG++F++ ++I
Sbjct: 196 ALNFSLGMNR--PEHATISLDGKDLLMRGQLP------DGVDTLEMKGMRFAS--RVRIV 245
Query: 229 DDRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSA-LQSIR 286
+G A D L V + A++L+ + + FD KD +S+ L
Sbjct: 246 LPKGGDLATTDSCLSVRSASEAIILVSLGTDYFD----------KDGAGQSLEKYLSQAE 295
Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE- 345
+ +S L H Y+ LF RVS+ L R +D +P ER+ +F D+
Sbjct: 296 SKDFSTLRREHTLAYRSLFDRVSLDLGRGERD------------HLPINERLAAFAQDKN 343
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
DP L L FQFGRYLLISS+R G NLQG+W + W+ H+NINL+MN+W +
Sbjct: 344 DPGLAALYFQFGRYLLISSTRQGLLPPNLQGLWCNTIHTPWNGDYHLNINLQMNHWPAEV 403
Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
NLSE PL ++ +G +TA+ Y A GWV H ++W + +A W
Sbjct: 404 TNLSELHLPLIEWTKLQVPSGERTAKAFYNARGWVTHILGNVW-EFTAPGEHPSWGATNT 462
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEH 524
AWLC HL+ HY YT+D+ +L + YP ++G A F +D L++ YL T P+TSPE+
Sbjct: 463 SAAWLCEHLYTHYQYTLDKAYL-RDVYPTMKGAALFFVDMLVQDPRTKYLVTAPTTSPEN 521
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
+ P+G + + STMD I+RE+F+ I AA +L + A ++ RL PT I
Sbjct: 522 AYKMPNGSVVSICAGSTMDNQIVRELFTNTIEAAGILGV-DSAFAAELAAKRDRLMPTTI 580
Query: 585 AEDGSIMEWVQ 595
+DG IMEW++
Sbjct: 581 GKDGRIMEWLE 591
>gi|380472541|emb|CCF46724.1| alpha-L-fucosidase [Colletotrichum higginsianum]
Length = 780
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 198/589 (33%), Positives = 292/589 (49%), Gaps = 51/589 (8%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
+ + PA + +A+PIGNGRLGAMV+G +E ++LNED++W G P D T DA + L
Sbjct: 21 LHYQSPASEWAEALPIGNGRLGAMVYGRTGTELVQLNEDSVWYGGPQDRTPKDALRHLPK 80
Query: 75 VRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+R L+ ++AEA + F PA + Y+ LG +E H YRR L L+
Sbjct: 81 LRQLIRDEKHAEAESLVREAFFATPASMRHYEPLGTCTIEL--GHAVEDVTGYRRHLCLD 138
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TA V+Y V + R+ +S P+ V+ +++ SE ++ S ++ + ++
Sbjct: 139 TAQTTVEYLSRGVSYRRDAIASFPNNVLAFRVTASEPTRFVVRLNRVSEIEWETNEFLDS 198
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
+GR P N+N + S +L + D +G++ A+ + L V+ S
Sbjct: 199 IEADDGRIVLNATPGGRNSN------RLSIVLGVSCHDAQGSVEAIGNS-LVVKSSS-CT 250
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
+ + A +++ P + + ++ +L + DL H DYQ LF R +++
Sbjct: 251 IAIGAQTTY---------RTLHPETVATEDVRKALDLPWDDLIRHHRSDYQTLFGRTALR 301
Query: 312 L----SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
+ S +P D+ + D LV L +GRYLLISSSR
Sbjct: 302 MWPDASHNPTDM--------------------RIEKGRDAGLVALYHNYGRYLLISSSRH 341
Query: 368 GTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
+ A LQGIWN +P W S +NINL+MNYW + PCNL EC P+ D L ++
Sbjct: 342 AEKALPATLQGIWNPSFAPPWGSKYTININLQMNYWPAGPCNLVECAIPVLDLLERMAER 401
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G KTAQ Y GW HH TDIWA + + +WP+GG WLC ++E Y D D
Sbjct: 402 GRKTAQAMYGCRGWCAHHNTDIWADTDPQDRWMPSTIWPLGGVWLCIDVFEMLQYHHD-D 460
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L +RA +LEGC FLLD+LI G YL TNPS SPE+ FI+ GK + S +D
Sbjct: 461 GLHRRAAAVLEGCILFLLDFLIPSSCGKYLVTNPSLSPENTFISNSGKAGILCEGSAIDT 520
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
IIR F + + +L NE L KV ++L +L G I EW
Sbjct: 521 TIIRIAFEKFLWSNSMLGTNE-PLCSKVREALGKLPELMTNAHGLIQEW 568
>gi|256840971|ref|ZP_05546478.1| glycoside hydrolase, family 95 [Parabacteroides sp. D13]
gi|298375740|ref|ZP_06985696.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_19]
gi|256736814|gb|EEU50141.1| glycoside hydrolase, family 95 [Parabacteroides sp. D13]
gi|298266777|gb|EFI08434.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_19]
Length = 811
Score = 300 bits (768), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 197/611 (32%), Positives = 310/611 (50%), Gaps = 53/611 (8%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
N T + F+ PA+ + + +P+GNGR+G M GG+ E + LNE +LW+G D
Sbjct: 18 NLPDMQTGKEVSYYFDQPAQIWEETLPLGNGRIGMMPDGGIERENVVLNEISLWSGSKQD 77
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
NP A +L+++R L+ G+ EA K F P YQL G++
Sbjct: 78 TDNPYAYYSLANIRRLLFEGRNDEAQDLMYKTFVCKGTGSNLGDGANAPYGSYQLFGNLV 137
Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
L++ + + YRR L+L+ A A V + GNV + RE F+S + V +
Sbjct: 138 LKYTYPNESDSIAEYRRRLNLSEAIASVSFKRGNVNYQREMFTSFSGDLGVIHLVADTDR 197
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQ-IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
+L+F++ ++ H+ ++ + + ++M G+ P + KG++F++ ++I
Sbjct: 198 ALNFSLGMNR--PEHATISLDGKDLLMRGQLP------DGVDTLEMKGMRFAS--RVRIV 247
Query: 229 DDRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSA-LQSIR 286
+G A D L V + A++L+ + + FD KD +S+ L
Sbjct: 248 LPKGGDLATTDSCLSVRSASEAIILVSLGTDYFD----------KDGAGQSLEKYLSQAE 297
Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE- 345
+ +S L H Y+ LF RVS+ L R +D +P ER+ +F D+
Sbjct: 298 SKDFSTLRREHTLAYRSLFDRVSLDLGRGERD------------HLPINERLAAFAQDKN 345
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
DP L L FQFGRYLLISS+R G NLQG+W + W+ H+NINL+MN+W +
Sbjct: 346 DPGLAALYFQFGRYLLISSTRQGLLPPNLQGLWCNTIHTPWNGDYHLNINLQMNHWPAEV 405
Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
NLSE PL ++ +G +TA+ Y A GWV H ++W + +A W
Sbjct: 406 TNLSELHLPLIEWTKLQVPSGERTAKAFYNARGWVTHILGNVW-EFTAPGEHPSWGATNT 464
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEH 524
AWLC HL+ HY YT+D+ +L + YP ++G A F +D L++ YL T P+TSPE+
Sbjct: 465 SAAWLCEHLYTHYQYTLDKAYL-RDVYPTMKGAALFFVDMLVQDPRTKYLVTAPTTSPEN 523
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
+ P+G + + STMD I+RE+F+ I AA +L + A ++ RL PT I
Sbjct: 524 AYKMPNGSVVSICAGSTMDNQIVRELFTNTIEAAGILGV-DSAFAAELAAKRDRLMPTTI 582
Query: 585 AEDGSIMEWVQ 595
+DG IMEW++
Sbjct: 583 GKDGRIMEWLE 593
>gi|296130834|ref|YP_003638084.1| alpha-L-fucosidase [Cellulomonas flavigena DSM 20109]
gi|296022649|gb|ADG75885.1| Alpha-L-fucosidase [Cellulomonas flavigena DSM 20109]
Length = 809
Score = 300 bits (768), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 195/611 (31%), Positives = 300/611 (49%), Gaps = 41/611 (6%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
M++ + +T + L + + PA+ +TDA P+GNGRLGAMV GG +E L++N+DT W+G P
Sbjct: 1 MIDDGAVTTASGLVLRLDEPARWWTDAFPVGNGRLGAMVHGGTGAERLQVNDDTCWSGAP 60
Query: 61 GDYT-------NPD-APKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEF 112
D T PD AP + R L+ G A KL YQ L D+ +E
Sbjct: 61 HDGTVEPVGPLGPDGAPGVVRRARHLLAEGDPLAAQDELAKLQSGWVQAYQPLVDVLVEQ 120
Query: 113 DDSHLKYAEETYRRELDLNTATARVKY-SVGNVEFTREHFSSNPDQVIVTKISGSESGSL 171
+ + YRR LDL + S + +E S+PD ++ + +G+ G
Sbjct: 121 PGA---AGRDDYRRVLDLARGVVTTTWRSAAGEPWRQEVLVSHPDGALLLERAGA-PGET 176
Query: 172 SFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSA----ILEIKI 227
++ + G+ ++ P +P + D P +Q+
Sbjct: 177 RVRLASPHPWASTPAAAGDGILVATLDMPSHVLP---DWVDGPDPVQYGGRSVHAAVALA 233
Query: 228 SDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRN 287
A+ D +++V G+ ++L +++ D + D + AL +R
Sbjct: 234 VLADDAPVAVVDGEVRVTGARRVRVVLTSATDHD---VATGTLHGDRERVAADALAGLRG 290
Query: 288 L--SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE 345
+ RH+ D+ L RVS+ L +P D+ D A + +
Sbjct: 291 ALADVDGIPARHVADHAALLGRVSLDLVAAPPDLPLD------------ARLARHAAGEP 338
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
D L L FQ GRYL ++ SRPGT NLQGIWNE + P W S +NIN EMNYW +L
Sbjct: 339 DAHLAVLAFQLGRYLTVAGSRPGTLPLNLQGIWNERVRPPWSSNYTININTEMNYWPALV 398
Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWA-KSSADRG--KVVWAL 462
+L+EC EPL +L L+ G +TA+ Y A GWV HH +D W RG W+
Sbjct: 399 GDLAECHEPLLSWLDRLAAAGRQTARTLYGARGWVAHHNSDPWCFTGPTGRGHDSASWSA 458
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSP 522
WP+GGAWL H+ +H+++T D D L +R +P++ A +LD L+E DG L T+P TSP
Sbjct: 459 WPLGGAWLARHVVDHHDWTGDDDAL-RRHWPVVRDAARAVLDLLVELPDGTLGTSPGTSP 517
Query: 523 EHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT 582
E+ ++ PDG+ A V+ S+T D+AI+R++ + A V+ ++ L V +L RL
Sbjct: 518 ENHYLLPDGRPAAVAVSTTADLAIVRDLLEQVRRLAPVVRDRDEDLRAAVDGALERLPTE 577
Query: 583 KIAEDGSIMEW 593
++A DG + EW
Sbjct: 578 RVAPDGRLAEW 588
>gi|149199940|ref|ZP_01876968.1| hypothetical protein LNTAR_09881 [Lentisphaera araneosa HTCC2155]
gi|149137009|gb|EDM25434.1| hypothetical protein LNTAR_09881 [Lentisphaera araneosa HTCC2155]
Length = 793
Score = 300 bits (767), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 193/591 (32%), Positives = 300/591 (50%), Gaps = 65/591 (10%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTG-VPGDYTNPDAPKALSDVRSLVDSGQYAE 86
+PIGNG++GAMV+GGV E + D+LW+G V G + K + +R ++ +Y
Sbjct: 55 LPIGNGKIGAMVYGGVEQEKINFTIDSLWSGKVDGTQNLAGSYKGMEQLRGMLMKDEYDA 114
Query: 87 ATAASVKLFGHP--AD----VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKY 139
A + L G AD +Q GD+ D+ +K+ + Y+R+LD+N A + V++
Sbjct: 115 AHKLAKDLIGSSPSADGNFGTFQTFGDLVF---DTGIKFESVSDYQRKLDINNALSVVEF 171
Query: 140 SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV---NGNNQIIME 196
++G ++TR F S+PDQ +V + S GS N+ L N +V NGN+ I++
Sbjct: 172 TMGKHKYTRTAFVSHPDQCLVLRFEVSAGGSQ--NIKLGFETPNKDWVPRINGND-IVIS 228
Query: 197 GRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVA 256
G+ +P A +G +FSA +GT+S VEG+ L A
Sbjct: 229 GKAAQNHMPVNARIRVKHEGGKFSA--------SKGTLS--------VEGARVVEFYLSA 272
Query: 257 SSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSP 316
++FD + P+ + P E + L SY++L RHL+DY+ LF R++I + S
Sbjct: 273 DTAFD--YKAPNRIGEAPDQEVLKTLNQASEKSYAELLERHLEDYKDLFDRLTIDIGDSS 330
Query: 317 KDIVTDTCSEENIDTVPSAERVKSF------QTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
++ +P R+K++ + DP L+E ++Q+GRYLLI+SSRPGT
Sbjct: 331 LEL----------RNMPMEARLKNYGDSLASNANPDPDLIETIYQYGRYLLIASSRPGTL 380
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQG+WN L+P W + H+NINL+MNYW + P NL EC+EPL F+ L G TA
Sbjct: 381 PANLQGVWNNSLTPPWAADYHININLQMNYWLAGPTNLIECEEPLLKFIESLVEPGRITA 440
Query: 431 QVNYLASGWVIHHKTDIWAKSS----ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
+ + + GW+ +H T+IW ++ +GK+ W WL HL+EH+ Y D+
Sbjct: 441 KEYFNSEGWMSYHATNIWGHTAPRVGRGKGKLTWKALTTCSLWLSHHLYEHFAYRQDKSQ 500
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
L+ +P+L A F +L + DG + PS S EH +S + D+A
Sbjct: 501 LKNEIWPVLAEAADFAAGYLTQLPDGAYTSMPSWSSEH---------GLISKGAITDIAT 551
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQRR 597
REV + AE+L N + K L KI + G + EW++ R
Sbjct: 552 TREVLQCALECAEILGINNER-TAKWKNRKDNLLAYKIGQHGQLQEWLEDR 601
>gi|429848646|gb|ELA24104.1| alpha-l-fucosidase [Colletotrichum gloeosporioides Nara gc5]
Length = 791
Score = 299 bits (766), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 207/621 (33%), Positives = 307/621 (49%), Gaps = 91/621 (14%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+N PA + DA PIGNGRLGAMV G E L +NED++W G P + NP A AL VR
Sbjct: 8 YNKPANLWDDATPIGNGRLGAMVRGTTDVERLWINEDSVWYGGPQNRLNPAARDALPKVR 67
Query: 77 SLVDSGQYAEA--------TAASVKL------------FGH----PADVYQLLGDIELEF 112
L+D + EA TA L FGH P D ++ G + E
Sbjct: 68 ELIDQNRIREAEQLIKKTQTARPRSLRHYEPLGDVFLTFGHGQDPPGDEVRVSGIVNFEN 127
Query: 113 DDSH-LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL 171
S L + + YRRELDL T + V Y G + R+ FSS D+VI IS G
Sbjct: 128 SFSRDLNRSPQNYRRELDLRTGISSVSYDFGGAHYERQVFSSTVDEVIA--ISVRSEGEY 185
Query: 172 SFNV------------SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQF 219
SF + L+ D+ ++G + I G ++F
Sbjct: 186 SFQIDLNRGDHPEWDRRLNQRYDSLEEIDGGHMITGSMGLKG--------------AVEF 231
Query: 220 SAILEIKISDDRGTISALEDKKLKVEGSDWAVLL-----LVASSSFDGPFINPSDSKKDP 274
+ + +++ D G D +++V+ + + V++ ++ S + F NP+ +
Sbjct: 232 A--MGVRVIADPG------DGEVQVDNTGYNVVVNAKDRVIVLVSGETTFRNPNAGEAVQ 283
Query: 275 TSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPS 334
+ ++++S ++DL + H++ + L+ RV +QL S VP
Sbjct: 284 NRLATASMKS-----WNDLKSAHVERFSALYDRVELQLPGSGDKT-----------AVPI 327
Query: 335 AERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVN 393
+R+++ Q D L +LLF FGRYLLIS S G ANLQGIWN D P W S +N
Sbjct: 328 DQRIQAVKQGAVDNGLAQLLFHFGRYLLISCSLSGLP-ANLQGIWNRDHMPVWGSKYTIN 386
Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA 453
IN++MNYW + NL+E + LF FL + G++TA+ Y GWV+HH TDIWA ++
Sbjct: 387 INIQMNYWPAEVANLAETHDVLFRFLERTAERGAETAKAMYGCRGWVMHHNTDIWADTAP 446
Query: 454 DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY 513
V W + GAW HLWEHY + D+DFL +R YPL+ G A F D+L+E DG
Sbjct: 447 QDDGVQCTYWTLSGAWFMIHLWEHYRFGRDKDFL-RRVYPLMAGSALFFQDFLVE-RDGK 504
Query: 514 LETNPSTSPEHE-FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV 572
L T+PS+S E+ +I +A ++ D I+ E+F A++ A ++L ++ EKV
Sbjct: 505 LITSPSSSAENSYYILGTKTVASIAAGPAWDGQILTELFRAVVEAGKLLGEDTSEF-EKV 563
Query: 573 LKSLPRLRPTKIAEDGSIMEW 593
L LP ++ + G +MEW
Sbjct: 564 LAKLP---TPQMGKHGQVMEW 581
>gi|423304137|ref|ZP_17282136.1| hypothetical protein HMPREF1072_01076 [Bacteroides uniformis
CL03T00C23]
gi|423310748|ref|ZP_17288732.1| hypothetical protein HMPREF1073_03482 [Bacteroides uniformis
CL03T12C37]
gi|392681018|gb|EIY74381.1| hypothetical protein HMPREF1073_03482 [Bacteroides uniformis
CL03T12C37]
gi|392685663|gb|EIY78977.1| hypothetical protein HMPREF1072_01076 [Bacteroides uniformis
CL03T00C23]
Length = 820
Score = 299 bits (765), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 194/604 (32%), Positives = 311/604 (51%), Gaps = 50/604 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + + +P+GNGRLG M GG+ E + LNE +LW+G+ DY+NPDA K+L
Sbjct: 29 QLYYTSPAAIWEETLPLGNGRLGMMPDGGILREHIVLNEISLWSGMEADYSNPDASKSLP 88
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPAD-------VYQLLGDIELEFDD----SHLKYAEE 122
+R L+ G+ EA F YQ+LGD++++F S L
Sbjct: 89 AIRQLLFEGKNREAQELMYSSFVPKKQETDGRYGTYQVLGDLDIDFTYNSSLSILNSPLN 148
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
YRR L+L A A + + +V++ RE+F S V++ + G+L+F+ L
Sbjct: 149 NYRRWLNLRDAVAYTAFRLEDVDYRREYFVSRDRDVMLIHLVAGREGALNFSARLSRAEH 208
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+ V GN ++M+G + P + G+++ +++ + ++S +L
Sbjct: 209 SSVTVQGNT-LLMDGMLESGK--PGLD------GMKYRVAMQLVQNGGESSVSPENGIRL 259
Query: 243 KVEGSDWAVL-----LLVASSSFDGP-FINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
K W +L A + F G + DS P + ++ SI + S+S
Sbjct: 260 KNGQEAWLILSAATSYAAAGTDFPGERYAEVCDSLLRPFTAPANSPCSILHSSFSS---- 315
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
H+ ++ L+ RVS+ L +P D T+P+ ER+ F E P+L L + +
Sbjct: 316 HVTAHRFLYDRVSLTLPATPDD------------TLPTNERILRFTQQESPALAALYYNY 363
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLISS+RPG+ NLQG+W +S W+ H NIN++MN+W LSE +PL
Sbjct: 364 GRYLLISSTRPGSLPPNLQGLWANGVSTPWNGDYHTNINIQMNHWPLEQAGLSELYQPLT 423
Query: 417 DFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
+ L +G +A+ Y A GWV+H T++W +A W GGAWLC HL
Sbjct: 424 TLMERLIPSGEASARTFYGDEADGWVLHMMTNVW-NYTAPGEHPSWGATNTGGAWLCAHL 482
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKL 533
WEHY YT D+D+L +R YP+L+G A F + E G+L T P++SPE+ F P +
Sbjct: 483 WEHYLYTQDKDYL-RRIYPVLKGAARFFSSTTVQEPSHGWLVTAPTSSPENSFYVPGDSV 541
Query: 534 ACVS--YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
VS TMD+ ++ E++ +I+AA +L+ + D V K+ L R P +I+++G +
Sbjct: 542 TPVSICMGPTMDVQLLTELYINVIAAARLLDCDAD-YVAKLEADLKRFPPMQISKEGYLQ 600
Query: 592 EWVQ 595
EW++
Sbjct: 601 EWLE 604
>gi|229173820|ref|ZP_04301360.1| hypothetical protein bcere0006_29180 [Bacillus cereus MM3]
gi|228609670|gb|EEK66952.1| hypothetical protein bcere0006_29180 [Bacillus cereus MM3]
Length = 1156
Score = 299 bits (765), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 199/600 (33%), Positives = 314/600 (52%), Gaps = 69/600 (11%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
L + +N PAK + A+PIGNG +G MV+GGV E ++ NE TLWTG P Y N
Sbjct: 47 LSLWYNQPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPNSTSEYTYGNR 106
Query: 67 D-APKALSDVRSLV--DSGQYAEATAASV-----KLFGHPADVYQLLGDIELEFDDSHLK 118
D A L +R + D AE ++ K FG YQ GDI L+F+
Sbjct: 107 DGAASHLGSIREKLAKDDKSGAERESSQFLTGLQKGFGS----YQNFGDIYLDFNMPDAS 162
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
+ YRREL++N A V Y+ V++ RE+F+S PD+V+V +++ SES LS +V
Sbjct: 163 -SFSNYRRELNVNEGIATVSYNYKGVQYNREYFASYPDRVMVMRLTASESKQLSLDVRPT 221
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
S +N+I ++G+ AN+ G+++ + E K+ ++ GT++A E
Sbjct: 222 SAQGGQVSAT-DNKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLTA-E 264
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+ K+KV +D +++ A++ ++ + P+ +DP + + +I SY L H+
Sbjct: 265 NGKIKVANADSLTIIMTAATDYENKY--PTYKGEDPHQKIEKIMSAISKKSYEVLKYTHM 322
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
DY LF+RVS+ L +VP+ E + S+ + L EL FQ+GR
Sbjct: 323 KDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQYGR 369
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSSRPGT ANLQG+WN +P W+S H NINL+MNYW + NLSE EPL D+
Sbjct: 370 YLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPLMDY 429
Query: 419 LTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
+ L G +A+ ++ GW ++ + + ++ G + W P A++ ++WE
Sbjct: 430 VDSLREPGRVSAEKHFGVKGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQNVWE 488
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
HY +T D+ +L+++ YP+++ A F ++L+E + L +P SPE L +
Sbjct: 489 HYKFTDDKQYLQEKIYPIIKEAAEFHSNFLVEDQNKKLVVSPCWSPE---------LGGI 539
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
S D ++ E+FS +I A+EVL+ + D L K + P P +I G + EW
Sbjct: 540 SNGCAFDQQLVYELFSNVIEASEVLQIDNVFRDELKAKRERLFP---PIQIGRYGQVQEW 596
>gi|254475685|ref|ZP_05089071.1| conserved hypothetical protein [Ruegeria sp. R11]
gi|214029928|gb|EEB70763.1| conserved hypothetical protein [Ruegeria sp. R11]
Length = 792
Score = 298 bits (764), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 186/601 (30%), Positives = 291/601 (48%), Gaps = 62/601 (10%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA--LSDVRSLVDSGQYAEATAASVKLF 95
MV+GG + LNEDTL++G P + P P A + V L++ G+Y EA + F
Sbjct: 1 MVYGGADIFKMHLNEDTLYSGEPSEVFKP-TPVADQVPKVSKLLEQGEYEEAQELVRRSF 59
Query: 96 -GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSN 154
G YQ +G +E + + + Y R LD+ V + + R+ + S+
Sbjct: 60 LGKQGASYQPVGYFLVEPRN---RVSASAYERRLDIGNGVHSETIEVDDAKILRDCYISH 116
Query: 155 PDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPG------------K 202
Q IV + S L+ + + + N + + + G+ P +
Sbjct: 117 EHQAIVITMETSADEGLNLDARIVTQHPNGKATHRGRRYVFSGQAPSFTQHAKSLLQMHQ 176
Query: 203 RI---------------------PPKANA------NDDPKGIQFSAILEIKISDDRGTIS 235
R+ P + ++ N D +G+ + + D GT+
Sbjct: 177 RLGDTWKQPALYDRNGDIHPYLTPAEMSSEHTVLYNQDGRGLGMFFEAAVDVRHDGGTVE 236
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
+ D + + L+ ++S++G +PS DP + + L ++ ++ + +
Sbjct: 237 -VSDAGISLTNVQSVTFLISLATSYNGFDKSPSREGADPVRRNNNVLDALVGVAEPKIRS 295
Query: 296 RHLDDYQKLFHRVSIQL-SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H DD Q L RVS+ L SP ++ TD +R+K Q DP L L F
Sbjct: 296 SHTDDIQALMSRVSLHLDGESPANLTTD-------------QRLKQAQDRPDPELAALAF 342
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
Q+GRYLLISSSRPG+Q NLQGIWN W S +NINL+MNYW + P L+E EP
Sbjct: 343 QYGRYLLISSSRPGSQPPNLQGIWNNSTCAMWSSNYTMNINLQMNYWPAEPTGLAELTEP 402
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
LF+ + LS+ G++ A+ + A GW+ H T +W + + A WP+G WL HL
Sbjct: 403 LFNLIDELSVTGARQAKHMFDAPGWMAFHNTTLWREVTPSHATPQSAFWPVGAGWLVAHL 462
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
WE Y Y+ D +FL RA+P +EG FLLDW++EG DG+L T STSPE++F+ +G
Sbjct: 463 WERYEYSGDLEFLRDRAWPRMEGALEFLLDWMVEGSDGFLTTPISTSPENKFLDENGVEC 522
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
V STMD+AIIR + ++ AAE L+K + + + +L +L P + G ++EW
Sbjct: 523 TVHQGSTMDIAIIRGLLEQMLQAAEALDKPAE-ISARYQTALDKLPPYRTGAKGELLEWA 581
Query: 595 Q 595
+
Sbjct: 582 E 582
>gi|330933451|ref|XP_003304180.1| hypothetical protein PTT_16648 [Pyrenophora teres f. teres 0-1]
gi|311319408|gb|EFQ87743.1| hypothetical protein PTT_16648 [Pyrenophora teres f. teres 0-1]
Length = 792
Score = 298 bits (764), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 203/606 (33%), Positives = 311/606 (51%), Gaps = 47/606 (7%)
Query: 4 AESTSTTNPLKITFNGPAKH--FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
A S N ++ + PA+ +TDA+PIGNGRLGAM +G E + LNE+T+W+G
Sbjct: 14 ASLASAGNNTRLWYTTPAQSSAWTDALPIGNGRLGAMAFGIPVQERIALNEETIWSGGQQ 73
Query: 62 DYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLK 118
D ++P+ +S+VR L+ G +A A++ + G P YQ LGD+++ FD +
Sbjct: 74 DRIGQNSPQTVSEVRDLLAQGHAGDAEKLANMGMMGTPQSCRNYQPLGDMDIFFDGT-TG 132
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y TY+R LD++TA A V++ V + RE F S PD V+V + + SG LSF + +
Sbjct: 133 YDNATYKRWLDVDTALAGVQFQVNGTLYEREMFVSAPDDVLVHHLKATGSGKLSFQIRV- 191
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
+ GN E G DP + F+ L ++ SD G + L
Sbjct: 192 ----HRPEKGGNEASDHEWNADGLAYMTGGAGGIDP--VVFTTALAVQ-SD--GHVKNL- 241
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+ +E + A + AS+S+ D + S +Q R +Y +L RH+
Sbjct: 242 GPFIVIENATEATAIFAASTSY---------RHNDTRAAVESTIQQARQHTYEELRQRHI 292
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFG 357
DY L++ + LS S DI ++P+ R+ + + DP+L L + +G
Sbjct: 293 ADYAPLYNASVLDLSGS--DI--------EASSLPTDARINATREGASDPALAALSYNYG 342
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLI+SSR G +NLQGIWN++ +P W S VNINL+MNYW + +LS EPLFD
Sbjct: 343 RYLLIASSRAGNLPSNLQGIWNKEFAPQWGSKYTVNINLQMNYWPAEVTSLSSLHEPLFD 402
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
L + +G+KTA+ Y ASGWV HH TD+W ++ + W + WL TH+ EH
Sbjct: 403 LLDLMRKDGTKTARQMYNASGWVTHHNTDLWGDTAPVDRWLPATYWTLSSGWLVTHILEH 462
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWL----IEGHDGYLETNPSTSPEHEFIAPDGKL 533
Y YT D+ FL + + E A F LD L I G YL TNPS SPE+ ++ D
Sbjct: 463 YWYTGDKKFLASKLDVVSEAIA-FYLDILQPYSINGTQ-YLVTNPSVSPENSYLDADNNT 520
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKN--EDALVEKVLKSLPRLRPTKIAED--GS 589
+ T D+ I+ E+F+ ++A L + + + + + +L P + ++ G+
Sbjct: 521 YHFDIAPTCDIEILNELFTNYLNAVATLPNSTVDSTFLTHIRDTQAKLPPYRYSKRYPGT 580
Query: 590 IMEWVQ 595
+ EW+Q
Sbjct: 581 LQEWMQ 586
>gi|317477822|ref|ZP_07937009.1| glycoside hydrolase [Bacteroides sp. 4_1_36]
gi|316906021|gb|EFV27788.1| glycoside hydrolase [Bacteroides sp. 4_1_36]
Length = 820
Score = 298 bits (762), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 194/604 (32%), Positives = 311/604 (51%), Gaps = 50/604 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + + +P+GNGRLG M GG+ E + LNE +LW+G+ DY+NPDA K+L
Sbjct: 29 QLYYTSPAAIWEETLPLGNGRLGMMPDGGILREHIVLNEISLWSGMEADYSNPDASKSLP 88
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPAD-------VYQLLGDIELEFDD----SHLKYAEE 122
+R L+ G+ EA F YQ+LGD++++F S L
Sbjct: 89 AIRQLLFEGKNREAQELMYSSFVPKKQETDGRYGTYQVLGDLDIDFTYNSSLSILNSPLN 148
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
YRR L+L A A + + +V++ RE+F S V++ + G+L+F+ L
Sbjct: 149 NYRRWLNLRDAVAYTAFRLEDVDYRREYFVSRDRDVMLIHLVAGREGTLNFSARLSRAEH 208
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+ V GN ++M+G + P + G+++ +++ + ++S L
Sbjct: 209 SSVTVQGNT-LLMDGMLESGK--PGLD------GMKYRVAMQLVQNGGESSVSPGNGICL 259
Query: 243 KVEGSDWAVL-----LLVASSSFDGP-FINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
K W +L A + F G + DS P + ++ SI + S S+
Sbjct: 260 KNGQEAWLILSAATSYAAAGTDFPGERYAEVCDSLLRPFTTPANSPCSILHSSLSN---- 315
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
H+ ++ L+ RVS+ L +P D T+P+ ER+ F E P+L L + +
Sbjct: 316 HVTAHRFLYDRVSLTLPATPDD------------TLPTNERILRFTQQESPALAALYYNY 363
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLISS+RPG+ NLQG+W +S W+ H NIN++MN+W LSE +PL
Sbjct: 364 GRYLLISSTRPGSLPPNLQGLWANGVSTPWNGDYHTNINIQMNHWPLEQAGLSELYQPLT 423
Query: 417 DFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
+ L +G +A+ Y A GWV+H T++W +A W GGAWLC HL
Sbjct: 424 TLMERLVPSGEASARTFYGDEADGWVLHMMTNVW-NYTAPGEHPSWGATNTGGAWLCAHL 482
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKL 533
WEHY YT DRD+L +R YP+L+G A F + E G+L T P++SPE+ F P +
Sbjct: 483 WEHYLYTQDRDYL-RRIYPVLKGAARFFSSTTVQEPSHGWLVTAPTSSPENSFYVPGDSV 541
Query: 534 ACVS--YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
VS TMD+ ++ E+++ +I+AA +L+ + D V K+ L + P +I+++G +
Sbjct: 542 TPVSICMGPTMDVQLLTELYTNVIAAARLLDCDAD-YVAKLEADLKKFPPMQISKEGYLQ 600
Query: 592 EWVQ 595
EW++
Sbjct: 601 EWLE 604
>gi|115443166|ref|XP_001218390.1| predicted protein [Aspergillus terreus NIH2624]
gi|114188259|gb|EAU29959.1| predicted protein [Aspergillus terreus NIH2624]
Length = 796
Score = 298 bits (762), Expect = 8e-78, Method: Compositional matrix adjust.
Identities = 195/596 (32%), Positives = 308/596 (51%), Gaps = 55/596 (9%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA + ++PIGNGRLGA VWG E + LNE+++W+G D NP+A + R
Sbjct: 31 YESPASDYAGSLPIGNGRLGATVWG-TAVEKITLNENSIWSGPFQDRVNPNAYDGFTQAR 89
Query: 77 SLVDSGQYAEATAASVKLFGH----PADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
SL++ G A +++ P + Y LG + L+F+ H YRR LDL +
Sbjct: 90 SLLEKGDMTGAGEVTLRDMASIPTSPRE-YHPLGVLHLDFN--HDVNLMTNYRRSLDLYS 146
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGN 190
A V+Y V ++RE+ +S P VI +++ SE G+L+ SL D + ++S + N
Sbjct: 147 GNAVVEYDYNGVRYSREYIASAPAGVIAIRVTASEPGNLTVACSLARDRYVIDNSASSPN 206
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
I+ R+ AN D IQF I E +I G + + + + +
Sbjct: 207 ETGIL-------RL--MANTGDMEDPIQF--ISEARIIGHGGRVVSNSTTVVVRDATSVE 255
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
+ +S + P + K++ +E L + Y+ + T + D+ L RV+I
Sbjct: 256 IFFDAETS-----YRYPDEDKRE--AEMDRKLSTAMGRGYNAVKTAAVADHLSLARRVNI 308
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ--TDEDPSLVELLFQFGRYLLISSSR-- 366
+L S + +P+ R+K+++ D DP L L+F FGR+ LI+SSR
Sbjct: 309 KLG-----------SSGSAGQLPTDTRLKNYKDNPDSDPELATLMFNFGRHSLIASSRQS 357
Query: 367 --PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
PG ANLQGIWN+D SP W V++NLEMNYW + NL++ +P D + +
Sbjct: 358 GSPGLP-ANLQGIWNQDYSPAWGGKYTVDVNLEMNYWPAEVTNLADTFDPFMDLMDTVVP 416
Query: 425 NGSKTAQVNYLA--SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
+G A+ Y G+V+HH TD+W ++ W +WPMG AWL +L +HY +T
Sbjct: 417 HGIDVAKRMYQCDNGGYVLHHNTDLWGDAAPVDNGTTWTMWPMGSAWLSENLMQHYRFTQ 476
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVS 537
+++ L +R +PLL+ A F +L E DGY + PS SPE+ FI P GK +
Sbjct: 477 NKEVLRERIWPLLKSAAQFYYCYLFE-FDGYFSSGPSISPENAFIVPSDMSVAGKSEGID 535
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
S TMD A++ E+F+++I A++LE + V+K + L +++P +I DG I+EW
Sbjct: 536 ISPTMDNALLYELFNSVIETADILEITGEE-VDKAKEYLAKIKPPQIGSDGQILEW 590
>gi|270294825|ref|ZP_06201026.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|270274072|gb|EFA19933.1| conserved hypothetical protein [Bacteroides sp. D20]
Length = 820
Score = 297 bits (761), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 193/604 (31%), Positives = 309/604 (51%), Gaps = 50/604 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + + +P+GNGRLG M GG+ E + LNE +LW+G+ DY+NPDA K+L
Sbjct: 29 QLYYTSPAAIWEETLPLGNGRLGMMPDGGILREHIVLNEISLWSGMEADYSNPDASKSLP 88
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPAD-------VYQLLGDIELEFDD----SHLKYAEE 122
+R L+ G+ EA F YQ+LGD++++F S L
Sbjct: 89 AIRQLLFEGKNREAQELMYSSFVPKKQETDGRYGTYQVLGDLDIDFTYNSSLSILNSPLN 148
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
YRR L+L A A + + +V++ RE+F S V++ + G+L+F+ L
Sbjct: 149 NYRRWLNLRDAVAYTAFRLEDVDYRREYFVSRDRDVMLIHLVAGREGTLNFSARLSRAEH 208
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+ V GN ++M+G + G+++ +++ + ++S L
Sbjct: 209 SLVTVQGNT-LLMDGML--------ESGKPGLDGMKYRVAMQLVQNGGESSVSPENGICL 259
Query: 243 KVEGSDWAVL-----LLVASSSFDGP-FINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
K W +L A + F G + DS P + ++ SI + S S+
Sbjct: 260 KNGQEAWLILSAATSYAAAGTDFPGERYAEVCDSLLRPFTTPANSPCSILHSSLSN---- 315
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
H+ ++ L+ RVS+ L +P D T+P+ ER+ F E P+L L + +
Sbjct: 316 HVTAHRFLYDRVSLTLPATPDD------------TLPTNERILRFTQQESPALAALYYNY 363
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLISS+RPG+ NLQG+W +S W+ H NIN++MN+W LSE +PL
Sbjct: 364 GRYLLISSTRPGSLPPNLQGLWTNGVSTPWNGDYHTNINIQMNHWPLEQAGLSELYQPLT 423
Query: 417 DFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
+ L +G +A+ Y A GWV+H T++W +A W GGAWLC HL
Sbjct: 424 TLMERLVPSGEASARTFYGDEADGWVLHMMTNVW-NYTAPGEHPSWGATNTGGAWLCAHL 482
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKL 533
WEHY YT DRD+L +R YP+L+G A F + E G+L T P++SPE+ F P +
Sbjct: 483 WEHYLYTQDRDYL-RRIYPVLKGAARFFSSTTVQEPSHGWLVTAPTSSPENSFYVPGDSV 541
Query: 534 ACVS--YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
VS TMD+ ++ E+++ +I+AA +L+ + D V K+ L + P +I+++G +
Sbjct: 542 TPVSICMGPTMDVQLLTELYTNVIAAARLLDCDAD-YVAKLEADLKKFPPMQISKEGYLQ 600
Query: 592 EWVQ 595
EW++
Sbjct: 601 EWLE 604
>gi|302884741|ref|XP_003041265.1| hypothetical protein NECHADRAFT_88794 [Nectria haematococca mpVI
77-13-4]
gi|256722164|gb|EEU35552.1| hypothetical protein NECHADRAFT_88794 [Nectria haematococca mpVI
77-13-4]
Length = 765
Score = 297 bits (760), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 194/589 (32%), Positives = 304/589 (51%), Gaps = 44/589 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L++ +N P+ +++++P+GNGRLGA+V G +E L+LNE+++W+G P + T PDA + L
Sbjct: 8 LRLQYNSPSSQWSESLPVGNGRLGAVVHGQPGAEVLQLNENSVWSGGPQERTPPDARRML 67
Query: 73 SDVRSLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+RSL+ + ++AEA A + F +P Y+ +G EF + Y R LD
Sbjct: 68 PKLRSLIRADKHAEAEALAKLAFYANPKSQRHYEPMGTASFEFGHEQVS----NYHRHLD 123
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L TA A V+Y G + R+ +S PD V++ + + S+ F V LD + D+ N
Sbjct: 124 LATAQAVVEYEHGGASYRRDMIASFPDNVLLWRFTASQ--KTRFIVRLDRINDDPIETNT 181
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
I + G RI A G + ++L D+ G I A+ V S
Sbjct: 182 YADTI---KSEGSRIVLHATPR-GAGGNRLCSVLRAVCDDEEGAIEAV--GSCLVINSAS 235
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
+ + A ++F P DP + + + ++S+L RH DY+ LF R+S
Sbjct: 236 CTIAIGAQTTFRHP---------DPELVATTDVDCALMRTWSELVVRHRRDYEGLFGRMS 286
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+++ + TD R+++ Q+ DP LV L +GRYLLISSSR G
Sbjct: 287 LRMWPDASEKPTDA-------------RLETRQS-RDPGLVALYHNYGRYLLISSSRDGH 332
Query: 370 QV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL-SECQEPLFDFLTYLSING 426
+ A LQGIWN +P W S +NINL+MNYW + PC+L EC P+ D L +SI G
Sbjct: 333 RALPATLQGIWNPSFTPPWGSKYTININLQMNYWLTAPCSLVDECTLPVIDLLERMSIRG 392
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
+TA+ Y GW HH TDIWA +S + +WP+GG W+ + + Y +
Sbjct: 393 QETAKAMYGCRGWCAHHNTDIWADTSPQDHWISATVWPLGGLWVSVTVMDMLRYQYSEE- 451
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L +R + EG F++D+L+ DG YL NPS SPE+ F + G++ STMDM
Sbjct: 452 LHRRIFACHEGAVQFVIDFLVPSSDGLYLIANPSISPENTFYSTTGEVGVFCEGSTMDMT 511
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLK-SLPRLRPTKIAEDGSIMEW 593
+IR + + + + LE ++ ++ V++ +L R+ P + + G I EW
Sbjct: 512 LIRVALTQFLWSLDRLEGLQEHTLKTVVQDTLDRIPPILVNDAGRIQEW 560
>gi|150009027|ref|YP_001303770.1| glycoside hydrolase [Parabacteroides distasonis ATCC 8503]
gi|149937451|gb|ABR44148.1| glycoside hydrolase family 95 [Parabacteroides distasonis ATCC
8503]
Length = 809
Score = 296 bits (759), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 197/611 (32%), Positives = 308/611 (50%), Gaps = 53/611 (8%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
N T + F+ PA+ + + +P+GNGR+G M GG+ E + LNE +LW+G D
Sbjct: 16 NLPDMQTGKEVSYYFDQPAQIWEETLPLGNGRIGMMPDGGIERENVVLNEISLWSGSKQD 75
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
NP A +L+++R L+ G+ EA K F P YQL G++
Sbjct: 76 TDNPYAYYSLANIRRLLFEGRNDEAQDLMYKTFVCKGTGSNLGDGANAPYGSYQLFGNLV 135
Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
L++ + + YRR L+L+ A A V + GNV + RE F+S + V +
Sbjct: 136 LKYTYPNESDSIAEYRRRLNLSEAIASVSFKRGNVNYQREMFTSFSGDLGVIHLVADTDR 195
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQ-IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
+L+F++ ++ H+ ++ + + ++M G+ P + KG++F++ ++I
Sbjct: 196 ALNFSLGMNR--PEHATISLDGKDLLMRGQLP------DGVDTLEMKGMRFAS--RVRIV 245
Query: 229 DDRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSA-LQSIR 286
+G A D L V + A++L+ + + FD KD +S+ L
Sbjct: 246 LPKGGDLATTDSCLSVRSASEAIILVSLGTDYFD----------KDGAGQSLEKYLSQAE 295
Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE- 345
+ +S L H Y+ LF RVS+ L R +D +P ER+ +F D+
Sbjct: 296 SKDFSTLRREHTLAYRSLFDRVSLDLGRGERD------------HLPINERLAAFAQDKN 343
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
DP L L FQFGRYLLISS+R G NLQG+W + W+ H+NINL+MN+W +
Sbjct: 344 DPGLAALYFQFGRYLLISSTRQGLLPPNLQGLWCNTIHTPWNGDYHLNINLQMNHWPAEV 403
Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
NLSE PL ++ +G +TA+ Y A GWV H ++W + +A W
Sbjct: 404 TNLSELHLPLIEWTKLQVPSGERTAKAFYNARGWVTHILGNVW-EFTAPGEHPSWGATNT 462
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEH 524
AWLC HL+ HY YT+D+ +L + YP ++G A F +D L++ YL T P+TSPE+
Sbjct: 463 SAAWLCEHLYTHYQYTLDKAYL-RDVYPTMKGAALFFVDMLVQDPRTKYLVTAPTTSPEN 521
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
+ P+ + + STMD I+RE+F+ I AA +L + E K RL PT I
Sbjct: 522 AYKMPNESVVSICAGSTMDNQILRELFTNTIEAAGILGVDSTFAAELAAKR-DRLMPTTI 580
Query: 585 AEDGSIMEWVQ 595
+DG IMEW++
Sbjct: 581 GKDGRIMEWLE 591
>gi|160887922|ref|ZP_02068925.1| hypothetical protein BACUNI_00326 [Bacteroides uniformis ATCC 8492]
gi|156862608|gb|EDO56039.1| hypothetical protein BACUNI_00326 [Bacteroides uniformis ATCC 8492]
Length = 820
Score = 296 bits (758), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 193/604 (31%), Positives = 311/604 (51%), Gaps = 50/604 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + + +P+GNGRLG M GG+ E + LNE +LW+G+ DY+NPDA K+L
Sbjct: 29 QLYYTSPAAIWEETLPLGNGRLGMMPDGGILREHIVLNEISLWSGMEADYSNPDASKSLP 88
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPAD-------VYQLLGDIELEFDD----SHLKYAEE 122
+R L+ G+ EA F YQ+LGD++++F S L
Sbjct: 89 AIRQLLFEGKNREAQELMYSSFVPKKQETDGRYGTYQVLGDLDIDFTYNSSLSILNSPLN 148
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
YRR L+L A A + + +V++ RE+F S V++ + G+L+F+ L
Sbjct: 149 NYRRWLNLRDAVAYTAFRLEDVDYRREYFVSRDRDVMLIHLVAGHEGTLNFSARLSRAEH 208
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+ V GN ++M+G + P + G+++ +++ + ++S L
Sbjct: 209 SLVTVQGNT-LLMDGMLESGK--PGLD------GMKYRVAMQLVQNGGESSVSPENGICL 259
Query: 243 KVEGSDWAVL-----LLVASSSFDGP-FINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
K W +L A + F G + DS P + ++ +I + S S+
Sbjct: 260 KNGQEAWLILSAATSYAAAGTDFPGERYAEVCDSLLRPFTAPANSPCAILHSSLSN---- 315
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
H+ ++ L+ RVS+ L +P D T+P+ ER+ F E P+L L + +
Sbjct: 316 HVTAHRSLYDRVSLTLPATPDD------------TLPTNERILRFTQQESPALAALYYNY 363
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLISS+RPG+ NLQG+W +S W+ H NIN++MN+W LSE +PL
Sbjct: 364 GRYLLISSTRPGSLPPNLQGLWANGVSTPWNGDYHTNINIQMNHWPLEQAGLSELYQPLT 423
Query: 417 DFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
+ L +G +A+ Y A GWV+H T++W +A W GGAWLC HL
Sbjct: 424 TLMERLIPSGEASARTFYGDEADGWVLHMMTNVW-NYTAPGEHPSWGATNTGGAWLCAHL 482
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKL 533
WEHY YT D+D+L +R YP+L+G A F + E G+L T P++SPE+ F P +
Sbjct: 483 WEHYLYTQDKDYL-RRIYPVLKGAARFFSSTTVQEPSHGWLVTAPTSSPENSFYVPGDSV 541
Query: 534 ACVS--YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
VS TMD+ ++ E+++ +I+AA +L+ + D V K+ L R P +I+++G +
Sbjct: 542 TPVSICMGPTMDVQLLTELYTNVIAAARLLDCDAD-YVAKLEVDLKRFPPMQISKEGYLQ 600
Query: 592 EWVQ 595
EW++
Sbjct: 601 EWLE 604
>gi|301312083|ref|ZP_07218005.1| alpha-L-fucosidase 2 [Bacteroides sp. 20_3]
gi|423339363|ref|ZP_17317104.1| hypothetical protein HMPREF1059_03029 [Parabacteroides distasonis
CL09T03C24]
gi|300830185|gb|EFK60833.1| alpha-L-fucosidase 2 [Bacteroides sp. 20_3]
gi|409230744|gb|EKN23605.1| hypothetical protein HMPREF1059_03029 [Parabacteroides distasonis
CL09T03C24]
Length = 809
Score = 296 bits (757), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 194/611 (31%), Positives = 307/611 (50%), Gaps = 53/611 (8%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
N T + F+ PA+ + + +P+GNGR+G M GG+ E + LNE +LW+G D
Sbjct: 16 NLPDMQTGKEVSYYFDQPAQIWEETLPLGNGRIGMMPDGGIERENVVLNEISLWSGSKQD 75
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
NP A +L+++R L+ G+ EA K F P YQL G++
Sbjct: 76 TDNPYAYYSLANIRRLLFEGRNDEAQDLMYKTFVCKGTGSNLGDGANAPYGSYQLFGNLV 135
Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
L + + + YRR L+L+ A A V + GNV + RE F+S + V +
Sbjct: 136 LRYTYPNESDSIAEYRRRLNLSEAIASVSFKRGNVNYQREMFTSFSGDLGVIHLVADADR 195
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQ-IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
+L+F++ ++ H+ ++ + + ++M G+ P + KG++F++ ++I
Sbjct: 196 ALNFSLGMNR--PEHATISLDGKDLLMRGQLP------DGVDTLEMKGMRFAS--RVRIV 245
Query: 229 DDRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSA-LQSIR 286
+G D L V + A++L+ + + FD KD +S+ L
Sbjct: 246 LPKGGDLTATDSSLSVRSASEAIILVSLGTDYFD----------KDGVGQSLEKYLSQAE 295
Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE- 345
+ +S L H Y+ LF RVS+ L + +D +P ER+ +F D+
Sbjct: 296 SKDFSTLRREHTFAYRSLFDRVSLDLGKGERD------------HLPIHERLAAFAQDKN 343
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
DP L L FQFGRYLLISS+R G NLQG+W + W+ H+NINL+MN+W +
Sbjct: 344 DPGLAALYFQFGRYLLISSTRQGLLPPNLQGLWCNTIHTPWNGDYHLNINLQMNHWPAEV 403
Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
NLSE PL ++ +G +TA+ Y A GW H ++W + +A W
Sbjct: 404 TNLSELHLPLIEWTKQQVPSGERTAKAFYNARGWGTHILGNVW-EFTAPGEHPSWGATNT 462
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEH 524
AWLC HL+ HY YT+D+ +L + YP ++G A F +D L++ YL T P+TSPE+
Sbjct: 463 SAAWLCEHLYTHYQYTLDKAYL-RDVYPTMKGAAQFFVDMLVQDPRTKYLVTAPTTSPEN 521
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
+ P+G + + STMD I+RE+F+ I AA +L + A ++ RL PT I
Sbjct: 522 AYKMPNGSVVSICAGSTMDNQILRELFTNTIEAAGILGV-DSAFAAELAAKRDRLMPTTI 580
Query: 585 AEDGSIMEWVQ 595
+DG IMEW++
Sbjct: 581 GKDGRIMEWLE 591
>gi|423668781|ref|ZP_17643810.1| hypothetical protein IKO_02478 [Bacillus cereus VDM034]
gi|423675093|ref|ZP_17650032.1| hypothetical protein IKS_02636 [Bacillus cereus VDM062]
gi|401300760|gb|EJS06350.1| hypothetical protein IKO_02478 [Bacillus cereus VDM034]
gi|401309028|gb|EJS14402.1| hypothetical protein IKS_02636 [Bacillus cereus VDM062]
Length = 1156
Score = 295 bits (756), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 194/600 (32%), Positives = 308/600 (51%), Gaps = 69/600 (11%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
L + +N PAK + A+PIGNG +G MV+GGV E ++ NE TLWTG P Y N
Sbjct: 47 LTLWYNQPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPSSTSEYTYGNR 106
Query: 67 D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEFDDSHLK 118
D A L +R + G + A S + FG YQ GDI L+F+
Sbjct: 107 DGAASHLGSIREKLAKGDKSGAEKESSQFLTGLEKGFGS----YQNFGDIYLDFNMPDAS 162
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
+ YRREL++N A V Y+ +V++ RE+F+S PD+V+V +++ SE+ +S +V
Sbjct: 163 -SFSNYRRELNINEGIATVSYNYKDVQYNREYFTSYPDRVMVMRLTASEAKKISLDVRPT 221
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
S + +N+I M+G+ G+++ A K+ ++ GT++A E
Sbjct: 222 SAQGGQ-VTSVDNKITMKGQITNN-------------GMKYEAAF--KVLNEGGTLTA-E 264
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+ K+KV +D +++ A++ ++ + P+ +DP + + +I SY L H+
Sbjct: 265 NGKIKVANADSLTIIMTAATDYENKY--PAYKGEDPHEKVEKTMAAISKKSYEVLKYTHI 322
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
DY LF+RVS+ L +VP+ E + S+ + L EL FQ+GR
Sbjct: 323 KDYHSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQYGR 369
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSSRPGT ANLQG+WN +P W+S H NINL+MNYW + NLSE PL D+
Sbjct: 370 YLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETALPLMDY 429
Query: 419 LTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
+ L G +A+ ++ GW ++ + + ++ G + W P A++ ++WE
Sbjct: 430 VDSLREPGRVSAEKHFGVKGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQNVWE 488
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
HY +T D+ +L+++ YP++ A F +L+E + L +P SPE L +
Sbjct: 489 HYKFTDDKQYLKEKIYPIINEAAEFHSKFLVEDQNKKLVVSPCWSPE---------LGGI 539
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
S D ++ E+FS +I A+EVL+ + D L K + P P +I G + EW
Sbjct: 540 SNGCAFDQQLVYELFSNVIEASEVLQIDNVFRDELKAKRDRLFP---PIQIGRYGQVQEW 596
>gi|265753143|ref|ZP_06088712.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_33FAA]
gi|263236329|gb|EEZ21824.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_33FAA]
Length = 803
Score = 295 bits (756), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 182/598 (30%), Positives = 311/598 (52%), Gaps = 44/598 (7%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+T + ++ + PAK + +++PIGNGRLGAM +GG+ E L LNE T+W+G + N
Sbjct: 24 ATDSCETTELWYAQPAKVWMESLPIGNGRLGAMTYGGIEEEKLALNESTMWSGQYNENQN 83
Query: 66 -PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAE 121
P + ++ +R L G+ +E A L G+ + +GD++++F K +
Sbjct: 84 KPFGREKMNQLRKLFFEGKLSEGNRIAGDNLHGNQTSFGTHLPIGDLKMQFIYPEGKVTD 143
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
YRR L L+ A + V ++ G V + RE+F++NPD V+V +++ + S++ N+ LD L+
Sbjct: 144 --YRRSLSLDEAVSSVSFNSGGVNYKREYFATNPDNVLVLRLTADKQKSITMNMGLD-LM 200
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
NNQ++ G+ P P G+ F I + D G + +E
Sbjct: 201 RQADLSVENNQLVFTGKVD---FPLHG-----PGGVCFEG--RIAVLADNGEVK-MEQSG 249
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
+ ++ +D L++ + + P D + ++ SY +L H+ DY
Sbjct: 250 VSIKEADAVTLIVDVRTDYKSP---------DYKTLCADGVEKAAAKSYDELKQAHIKDY 300
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
L++RVSI + + + T ++VK +TD L L FQ+GRYL
Sbjct: 301 NTLYNRVSIHFGQD---------ANRAMPTDVRWKQVKEGKTD--TGLDALFFQYGRYLT 349
Query: 362 ISSSRPGTQV-ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
I+SSR + + LQG +N++ + W + H++IN E NYW + NL+EC PLF +
Sbjct: 350 IASSRENSPLPIALQGFFNDNKACNMGWTNDYHLDINTEQNYWAANVGNLAECNAPLFTY 409
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ L+ +G+KTA+V Y GW H ++W + A ++W L+PM G+W+ +HLW Y
Sbjct: 410 IKDLAHHGAKTAEVVYGCKGWTAHTTANVWGYTPAS-STIIWGLFPMAGSWIASHLWTQY 468
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVS 537
+T D+ +L + AYPLL+G A F+LD+L + GYL T PS SPE+ F G+ S
Sbjct: 469 EFTQDKQYLAETAYPLLKGNAQFILDFLAKDPKSGYLMTGPSISPENWFRTAGGEEMVAS 528
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
D + E+ S + A+E+L+ + + + + ++ +L P ++ +G+I EW +
Sbjct: 529 MMPACDRELAYEILSNCVRASEILDTDRE-FADSLRTAIAQLPPIQLRANGAIREWFE 585
>gi|212695253|ref|ZP_03303381.1| hypothetical protein BACDOR_04793 [Bacteroides dorei DSM 17855]
gi|237711725|ref|ZP_04542206.1| glycoside hydrolase family 95 protein [Bacteroides sp. 9_1_42FAA]
gi|212662163|gb|EEB22737.1| hypothetical protein BACDOR_04793 [Bacteroides dorei DSM 17855]
gi|229454420|gb|EEO60141.1| glycoside hydrolase family 95 protein [Bacteroides sp. 9_1_42FAA]
Length = 818
Score = 295 bits (756), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 181/599 (30%), Positives = 309/599 (51%), Gaps = 46/599 (7%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+T + ++ + PAK + +++PIGNGRLGAM +GG+ E L LNE T+W+G + N
Sbjct: 39 ATDSCETTELWYAQPAKVWMESLPIGNGRLGAMTYGGIEEEKLALNESTMWSGQYNENQN 98
Query: 66 -PDAPKALSDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAE 121
P + ++ +R L G+ +E A L G+ + +GD++++F K +
Sbjct: 99 KPFGREKMNQLRKLFFEGKLSEGNRIAGDNLHGNQTSFGTHLPIGDLKMQFIYPEGKVTD 158
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
YRR L L+ A + V ++ G V + RE+F++NPD V+V +++ + S++ N+ LD L+
Sbjct: 159 --YRRSLSLDEAVSSVSFNSGGVNYKREYFATNPDNVLVLRLTADKQKSITMNMGLD-LM 215
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
NNQ++ G+ P P G+ F I + D G + +E
Sbjct: 216 RQADLSVENNQLVFTGKVD---FPLHG-----PGGVCFEG--RIAVLADNGEVK-MEQSG 264
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
+ ++ +D L++ + + P D + ++ SY +L H+ DY
Sbjct: 265 VSIKEADAVTLIVDVRTDYKSP---------DYKTLCADGVEKAAAKSYDELKQAHIKDY 315
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYL 360
L++RVSI + +P+ R K + + D L L FQ+GRYL
Sbjct: 316 NTLYNRVSIHFGQDANR------------AMPTDVRWKQVKEGKTDTGLDALFFQYGRYL 363
Query: 361 LISSSRPGTQV-ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
I+SSR + + LQG +N++ + W + H++IN E NYW + NL+EC PLF
Sbjct: 364 TIASSRENSPLPIALQGFFNDNKACNMGWTNDYHLDINTEQNYWAANVGNLAECNAPLFT 423
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
++ L+ +G+KTA+V Y GW H ++W + A ++W L+PM G+W+ +HLW
Sbjct: 424 YIKDLAHHGAKTAEVVYGCKGWTAHTTANVWGYTPAS-STIIWGLFPMAGSWIASHLWTQ 482
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACV 536
Y +T D+ +L + AYPLL+G A F+LD+L + GYL T PS SPE+ F G+
Sbjct: 483 YEFTQDKQYLAETAYPLLKGNAQFILDFLAKDPKSGYLMTGPSISPENWFRTAGGEEMVA 542
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S D + E+ S + A+E+L+ + + + + ++ +L P ++ +G+I EW +
Sbjct: 543 SMMPACDRELAYEILSNCVRASEILDTDRE-FADSLRTAIAQLPPIQLRANGAIREWFE 600
>gi|312131012|ref|YP_003998352.1| alpha-l-fucosidase [Leadbetterella byssophila DSM 17132]
gi|311907558|gb|ADQ17999.1| Alpha-L-fucosidase [Leadbetterella byssophila DSM 17132]
Length = 805
Score = 295 bits (756), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 210/603 (34%), Positives = 315/603 (52%), Gaps = 61/603 (10%)
Query: 6 STSTTNPLKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
S S K+ + PA K + A+P+GNG +G MV+G E + LNE + W+G P +
Sbjct: 14 SLSFAQEYKMWYQNPAGKVWEKALPVGNGFIGGMVYGNTEEERIDLNETSFWSGGPYATS 73
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLFGHPA--DVYQLLGDIELEFDDSHLKYAE 121
+L +RSLV S +Y EA A+ LF H + ++ +G + L+F +
Sbjct: 74 PTLNRDSLEKLRSLVFSEKYKEAENMANRVLFSHGSHGQMFLPIGSLILKFPG---QKEA 130
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
+Y RELDL+ A A ++SVG + RE F+ ++V+V K+S +E+ ++
Sbjct: 131 TSYYRELDLSKAVASTRFSVGKQNYEREVFTPLQEKVLVMKLSSTEAMNVEVLYRTPLPE 190
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDK 240
V GN E + G+ I A++ +G ++F I+ +K S G S+ D
Sbjct: 191 GRVVQVQGN-----ELQIGGRNI-----AHEGSEGALRFHGIIHVKQS---GGNSSRTDS 237
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
L + + VL + ++++ D K + SAL+S Y++L +H++
Sbjct: 238 SLIISNAKELVLYVSLATNYQSYQDVSGDEKALARARLTSALKS----PYTELKRKHIEK 293
Query: 301 YQKLFHRVSIQLS---RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
YQ L++RV + L R P DI R++ F+ DP L FQFG
Sbjct: 294 YQSLYNRVELTLGSDRREPTDI-----------------RLEKFREGNDPGFAALYFQFG 336
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLISSS+PG Q ANLQGIWN + P WDS +NIN EMNYW + NLSE +PLF+
Sbjct: 337 RYLLISSSQPGGQPANLQGIWNASIRPPWDSKYTININTEMNYWPAERTNLSEMHKPLFE 396
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ L+ G+ TA+ Y A GWV HH TD+W + + + LWP GGAWL H+WEH
Sbjct: 397 MVKDLTKTGAVTAKRLYGAGGWVAHHNTDLW-RLTWPVDAAFYGLWPSGGAWLSQHIWEH 455
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG--YLETNPSTSPEHEFIAPDG-KLA 534
Y YT + FL K +L G A F +D +++ H YL NPSTSPE+ AP+ + +
Sbjct: 456 YQYTGNLHFL-KENQEVLFGAARFYVD-ILQKHPKYPYLVINPSTSPEN---APEAHQRS 510
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVL---EKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
+S TMD + +VF I A+++L + D+L +++LK LP P I + G +
Sbjct: 511 SLSAGVTMDNQLAFDVFQNAIWASKILGVKTQFSDSL-KQLLKQLP---PMHIGKHGQLQ 566
Query: 592 EWV 594
EW+
Sbjct: 567 EWL 569
>gi|423241353|ref|ZP_17222466.1| hypothetical protein HMPREF1065_03089 [Bacteroides dorei
CL03T12C01]
gi|392641729|gb|EIY35503.1| hypothetical protein HMPREF1065_03089 [Bacteroides dorei
CL03T12C01]
Length = 800
Score = 295 bits (756), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 182/598 (30%), Positives = 311/598 (52%), Gaps = 44/598 (7%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+T + ++ + PAK + +++PIGNGRLGAM +GG+ E L LNE T+W+G + N
Sbjct: 21 ATDSCETTELWYAQPAKVWMESLPIGNGRLGAMTYGGIEEEKLALNESTMWSGQYNENQN 80
Query: 66 -PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAE 121
P + ++ +R L G+ +E A L G+ + +GD++++F K +
Sbjct: 81 KPFGREKMNQLRKLFFEGKLSEGNRIAGDNLHGNQTSFGTHLPIGDLKMQFIYPEGKVTD 140
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
YRR L L+ A + V ++ G V + RE+F++NPD V+V +++ + S++ N+ LD L+
Sbjct: 141 --YRRSLSLDEAVSSVSFNSGGVNYKREYFATNPDNVLVLRLTADKQKSITMNMGLD-LM 197
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
NNQ++ G+ P P G+ F I + D G + +E
Sbjct: 198 RQADLSVENNQLVFTGKVD---FPLHG-----PGGVCFEG--RIAVLADNGEVK-MEQSG 246
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
+ ++ +D L++ + + P D + ++ SY +L H+ DY
Sbjct: 247 VSIKEADAVTLIVDVRTDYKSP---------DYKTLCADGVEKAAAKSYDELKQAHIKDY 297
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
L++RVSI + + + T ++VK +TD L L FQ+GRYL
Sbjct: 298 NTLYNRVSIHFGQD---------ANRAMPTDVRWKQVKEGKTD--TGLDALFFQYGRYLT 346
Query: 362 ISSSRPGTQV-ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
I+SSR + + LQG +N++ + W + H++IN E NYW + NL+EC PLF +
Sbjct: 347 IASSRENSPLPIALQGFFNDNKACNMGWTNDYHLDINTEQNYWAANVGNLAECNAPLFTY 406
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ L+ +G+KTA+V Y GW H ++W + A ++W L+PM G+W+ +HLW Y
Sbjct: 407 IKDLAHHGAKTAEVVYGCKGWTAHTTANVWGYTPAS-STIIWGLFPMAGSWIASHLWTQY 465
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVS 537
+T D+ +L + AYPLL+G A F+LD+L + GYL T PS SPE+ F G+ S
Sbjct: 466 EFTQDKQYLAETAYPLLKGNAQFILDFLAKDPKSGYLMTGPSISPENWFRTAGGEEMVAS 525
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
D + E+ S + A+E+L+ + + + + ++ +L P ++ +G+I EW +
Sbjct: 526 MMPACDRELAYEILSNCVRASEILDTDRE-FADSLRTAIAQLPPIQLRANGAIREWFE 582
>gi|423231014|ref|ZP_17217418.1| hypothetical protein HMPREF1063_03238 [Bacteroides dorei
CL02T00C15]
gi|423244725|ref|ZP_17225800.1| hypothetical protein HMPREF1064_02006 [Bacteroides dorei
CL02T12C06]
gi|392630134|gb|EIY24136.1| hypothetical protein HMPREF1063_03238 [Bacteroides dorei
CL02T00C15]
gi|392641574|gb|EIY35350.1| hypothetical protein HMPREF1064_02006 [Bacteroides dorei
CL02T12C06]
Length = 800
Score = 295 bits (756), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 182/598 (30%), Positives = 311/598 (52%), Gaps = 44/598 (7%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+T + ++ + PAK + +++PIGNGRLGAM +GG+ E L LNE T+W+G + N
Sbjct: 21 ATDSCETTELWYAQPAKVWMESLPIGNGRLGAMTYGGIEEEKLALNESTMWSGQYNENQN 80
Query: 66 -PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAE 121
P + ++ +R L G+ +E A L G+ + +GD++++F K +
Sbjct: 81 KPFGREKMNQLRKLFFEGKLSEGNRIAGDNLHGNQTSFGTHLPIGDLKMQFIYPEGKVTD 140
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
YRR L L+ A + V ++ G V + RE+F++NPD V+V +++ + S++ N+ LD L+
Sbjct: 141 --YRRSLSLDEAVSSVSFNSGGVNYKREYFATNPDNVLVLRLTADKQKSITMNMGLD-LM 197
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
NNQ++ G+ P P G+ F I + D G + +E
Sbjct: 198 RQADLSVENNQLVFTGKVD---FPLHG-----PGGVCFEG--RIAVLADNGEVK-MEQSG 246
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
+ ++ +D L++ + + P D + ++ SY +L H+ DY
Sbjct: 247 VSIKEADAVTLIVDVRTDYKSP---------DYKTLCADGVEKAAAKSYDELKQAHIKDY 297
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
L++RVSI + + + T ++VK +TD L L FQ+GRYL
Sbjct: 298 NTLYNRVSIHFGQD---------ANRAMPTDVRWKQVKEGKTD--TGLDALFFQYGRYLT 346
Query: 362 ISSSRPGTQV-ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
I+SSR + + LQG +N++ + W + H++IN E NYW + NL+EC PLF +
Sbjct: 347 IASSRENSPLPIALQGFFNDNKACNMGWTNDYHLDINTEQNYWAANVGNLAECNAPLFTY 406
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ L+ +G+KTA+V Y GW H ++W + A ++W L+PM G+W+ +HLW Y
Sbjct: 407 IKDLAHHGAKTAEVVYGCKGWTAHTTANVWGYTPAS-STIIWGLFPMAGSWIASHLWTQY 465
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVS 537
+T D+ +L + AYPLL+G A F+LD+L + GYL T PS SPE+ F G+ S
Sbjct: 466 EFTQDKQYLAETAYPLLKGNAQFILDFLAKDPKSGYLMTGPSISPENWFRTAGGEEMVAS 525
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
D + E+ S + A+E+L+ + + + + ++ +L P ++ +G+I EW +
Sbjct: 526 MMPACDRELAYEILSNCVRASEILDTDRE-FADSLRTAIAQLPPIQLRANGAIREWFE 582
>gi|311746349|ref|ZP_07720134.1| fibronectin type III domain protein [Algoriphagus sp. PR1]
gi|126575233|gb|EAZ79565.1| fibronectin type III domain protein [Algoriphagus sp. PR1]
Length = 778
Score = 295 bits (756), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 192/591 (32%), Positives = 310/591 (52%), Gaps = 42/591 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA-LSDV 75
+ PA+ + +A+P+GNGRLGAMV+G E ++LNED+LW G GD+ ++ L +
Sbjct: 27 YTSPAEIWEEALPVGNGRLGAMVFGKPSMERIQLNEDSLWPGEQGDWGIAKGRRSDLDQI 86
Query: 76 RSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
R+ + +G+ ++ + V F A +Q LGD+ L+FD + Y+R LDL TA
Sbjct: 87 RAYLRAGENEKSDSLLVAAFSRKAITRSHQTLGDLWLDFDFQEIS----DYKRSLDLTTA 142
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN--- 190
A + T+E SS PD IV ++ + + L S ++ +
Sbjct: 143 VASSTFKSQGYTVTQEVLSSAPDDAIVIRLKTNHPDGFVGKIRL-SRPEDEGFATAETKS 201
Query: 191 ---NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
N + M G ++ +N G++F ++ ++ D G ++ D L++ GS
Sbjct: 202 LSENTLSMAGMITQRKGQLDSNPYPLLTGVKFKTLVYVETED--GNLNNGVDY-LELSGS 258
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
++ LV +SF +D + L++++ ++ + H+ DY + F R
Sbjct: 259 KEVLIKLVTETSF---------YNQDFDHAAELELENVKTKNWEGILEPHIQDYSQWFER 309
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSR 366
+ ++L ++ + VP+ R+++ Q D L +LLF +GRYLLISSSR
Sbjct: 310 MELKLGKAA------------MSEVPTDVRIENVQAGGVDLHLEKLLFDYGRYLLISSSR 357
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG ANLQGIWN+D++ W++ H+NINL+MNYW + NLS+ +PLFDF+ + G
Sbjct: 358 PGNNPANLQGIWNKDINAPWNADYHLNINLQMNYWPADVTNLSKLNQPLFDFVDGVIHRG 417
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
+ AQ N+ +G + H TD+W W W G W+ H W+HY +T D F
Sbjct: 418 QEVAQTNFGMAGTFLPHATDLWQVPFMRAATAYWGGWVGAGGWMARHYWDHYLFTKDERF 477
Query: 487 LEKRAYPLLEGCASFLLDWLIE-GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L +RA+P + +F DWL+E + L + PSTSPE+ F G+ + + MD
Sbjct: 478 LRERAFPAISQVTAFYSDWLVEYPGENTLVSAPSTSPENRFFNEAGRPVATTMGAAMDQQ 537
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEWVQ 595
II +VFS+ ++A+E+L +E L ++V + L RLRP +IAEDG I+EW Q
Sbjct: 538 IIADVFSSFLAASEIL-NSESRLRDRVKEQLARLRPGVQIAEDGRILEWDQ 587
>gi|332881351|ref|ZP_08449001.1| hypothetical protein HMPREF9074_04789 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|357045233|ref|ZP_09106870.1| hypothetical protein HMPREF9441_00875 [Paraprevotella clara YIT
11840]
gi|332680727|gb|EGJ53674.1| hypothetical protein HMPREF9074_04789 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|355531816|gb|EHH01212.1| hypothetical protein HMPREF9441_00875 [Paraprevotella clara YIT
11840]
Length = 798
Score = 295 bits (755), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 184/590 (31%), Positives = 307/590 (52%), Gaps = 49/590 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG-VPGDYTNPDAPKALSDV 75
++ PA + ++P+GNGR+GAMV+GGV ET+ LNE ++W G + P + L ++
Sbjct: 29 YDAPADEWMKSLPVGNGRVGAMVFGGVNEETVALNESSMWAGEYDPNQEKPFGREKLDEL 88
Query: 76 RSLVDSGQYAEATA-ASVKLFG--HPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
R L G+ E A +L G H + +GD++++FD + + E YRRELDL
Sbjct: 89 RKLFFEGKLIEGNGIAGRELVGTPHSFGTHLPIGDLKIKFDYTGKEGGVEDYRRELDLTN 148
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A V + G ++ RE SSNP +V + + S+SF++ + + GN
Sbjct: 149 AVVTVSFKKGGTKYKREFISSNPQDAVVMHFTADKKQSVSFDMRMKMITAAQVRTEGNLL 208
Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
+ G+ + PK G+ F + +K+ DRG + A + ++V+ +D +
Sbjct: 209 VF-----DGQALFPKLGTG----GVHFQGRVVVKV--DRGEVEA-TGETVRVKHAD--AV 254
Query: 253 LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLS--YSDLYTRHLDDYQKLFHRVSI 310
+VA D K+ ES+ + ++ + + H+ DY LF RVS+
Sbjct: 255 TIVADVRTD---------YKNGQYESLCEKTVEKAIARPFETMKEEHVADYAPLFARVSL 305
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPGT 369
+L+ K ++P R K+ + ++D L L FQ+GRYL I+SSR +
Sbjct: 306 KLADDSKK------------SIPVDRRWKALCEGNKDAGLQALFFQYGRYLTIASSRENS 353
Query: 370 QV-ANLQGIWNEDLSP--TWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
+ LQG +N++L+ W S H++IN E NYW + NL+EC PLF ++ L+ +G
Sbjct: 354 PLPIALQGFFNDNLACNMCWTSDYHLDINTEQNYWLTNVGNLAECNAPLFTYIADLAHHG 413
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
+KT + Y GW H ++W ++ G + W L+P+ G+W+ THLW Y YT+D+D+
Sbjct: 414 AKTVRTVYGCKGWTAHTVANVWGFTAPSEG-MGWGLFPLAGSWMATHLWTQYEYTLDKDY 472
Query: 487 LEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L + AYPLL+G A FLLD+++E + GY+ T P SPE+ F +L S +T D
Sbjct: 473 LRRTAYPLLKGNAEFLLDYMVEDPNTGYMVTGPCVSPENSFRYQGWELG-ASMMTTCDKV 531
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+ E+ SA + A+++L ++ A + + +L + P +I G + EW +
Sbjct: 532 LAHEIMSACVQASDILGVDK-AFADSLRLALAKFPPFRINSFGGLCEWYE 580
>gi|262383921|ref|ZP_06077057.1| glycoside hydrolase family 95 [Bacteroides sp. 2_1_33B]
gi|262294819|gb|EEY82751.1| glycoside hydrolase family 95 [Bacteroides sp. 2_1_33B]
Length = 809
Score = 295 bits (754), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 196/611 (32%), Positives = 308/611 (50%), Gaps = 53/611 (8%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
N T + F+ PA+ + + +P+GNGR+G M GG+ E + LNE +LW+G D
Sbjct: 16 NLPDMQTGKEVSYYFDQPAQIWEETLPLGNGRIGMMPDGGIERENVVLNEISLWSGSKQD 75
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
NP A +L+++R L+ G+ EA K F P YQL G++
Sbjct: 76 TDNPYAYYSLANIRRLLFEGRNDEAQDLMYKTFVCKGTGSNLGDGANAPYGSYQLFGNLV 135
Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
L++ + + YRR L+L+ A A V + GNV + RE F+S + V +
Sbjct: 136 LKYTYPNESDSIAEYRRRLNLSEAIASVSFKRGNVNYQREMFTSFSGDLGVIHLVADADR 195
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQ-IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
+L+F++ ++ H+ ++ + + ++M G+ P + KG++F++ ++I
Sbjct: 196 ALNFSLGMNR--PEHATISLDGKDLLMRGQLP------DGVDTLEMKGMRFAS--RVRIV 245
Query: 229 DDRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSA-LQSIR 286
+G A D L V + A++L+ + + FD KD +S+ L
Sbjct: 246 LPKGGDLATTDSCLSVRSASEAIILVSLGTDYFD----------KDGVGQSLEKYLSQAE 295
Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE- 345
+ +S L H Y+ LF RVS+ L + +D +P ER+ +F D+
Sbjct: 296 SKDFSTLRREHTLAYRSLFDRVSLDLGKGERD------------HLPINERLAAFAQDKN 343
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
DP L L FQFGRYLLISS+R G NLQG+W + W+ H+NINL+MN+W +
Sbjct: 344 DPGLAALYFQFGRYLLISSTRQGLLPPNLQGLWCNTIHTPWNGDFHLNINLQMNHWPAEV 403
Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
NLSE PL ++ +G +TA+ Y A GWV H ++W + +A W
Sbjct: 404 TNLSELHLPLIEWTKQQVPSGERTAKAFYNARGWVTHILGNVW-EFTAPGEHPSWGATNT 462
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEH 524
AWLC HL+ HY YT+D+ +L + YP ++G A F +D L++ YL T P+TSPE+
Sbjct: 463 SAAWLCEHLYTHYQYTLDKAYL-RDVYPTMKGAALFFVDMLVQDPRTKYLVTAPTTSPEN 521
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
+ P+ + + STMD I+RE+F+ I AA +L + E K RL PT I
Sbjct: 522 AYKMPNESVVSICAGSTMDNQILRELFTNTIEAAGILGVDSTFAAELAAKR-DRLMPTTI 580
Query: 585 AEDGSIMEWVQ 595
+DG IMEW++
Sbjct: 581 GKDGRIMEWLE 591
>gi|256425749|ref|YP_003126402.1| alpha-L-fucosidase [Chitinophaga pinensis DSM 2588]
gi|256040657|gb|ACU64201.1| Alpha-L-fucosidase [Chitinophaga pinensis DSM 2588]
Length = 778
Score = 295 bits (754), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 191/607 (31%), Positives = 310/607 (51%), Gaps = 48/607 (7%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
M A + NPL + ++ PA + + +P+GNGRLG M GG+ +E + LN+ TLW+G P
Sbjct: 16 MPAALCKAQQNPLTLKYDKPAAVWEETLPLGNGRLGMMPDGGIQTEKVVLNDITLWSGAP 75
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLF--------GHPADVYQLLGDIELEF 112
+ N +A K L ++ L+ G+ EA + K F P YQ LG+++++F
Sbjct: 76 QNANNYEAYKQLPKIQELLKEGRNDEAQSLMDKDFICTGKGSGDVPFGCYQTLGELQIQF 135
Query: 113 D-DSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL 171
D K Y R+L L A A Y V NV + RE+F+S D + +++ S++G L
Sbjct: 136 AYDKADKVEPTAYERKLSLQQAIASCSYKVNNVTYNREYFTSFGDDLSFIRLTASQAGKL 195
Query: 172 SFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDR 231
+ +++ S + + N ++++ G+ ++ +D KG+Q+ A +K
Sbjct: 196 NLRITM-SRPEKAATRTENGELLLYGQL---------DSGNDTKGMQYQA--NVKAQLKG 243
Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYS 291
GTI+ E+ L ++ + +L + A + F + +D KK ++ +A++ Y
Sbjct: 244 GTITT-EEHALVIKNATEVILYVAAGTDF-----HKNDFKKQISTVLATAVKK----PYE 293
Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE--DPSL 349
H+ +Y KLF+RV + L + T+ + +R+ +F + D L
Sbjct: 294 AQKQAHMRNYTKLFNRVQVDLGKG------------TAGTLTTDKRLAAFYNNAAADNEL 341
Query: 350 VELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLS 409
L +QFGRYL I S+R G NLQG+W + W+ H+++N++MN+W NLS
Sbjct: 342 PVLFYQFGRYLTICSTRKGLLPPNLQGLWANQVHTPWNGDYHLDVNVQMNHWPVEVSNLS 401
Query: 410 ECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAW 469
E PL D + L G +TA+ Y A GWV H T++W + W G W
Sbjct: 402 ELNLPLADLVKGLVAPGQRTAKAYYNAPGWVAHVITNVWGFTEPGE-SASWGATKSGSGW 460
Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIA 528
LC +LWEHY +T D+ +L YP+L+G A F LI+ G+L +PS+SPE+ F
Sbjct: 461 LCNNLWEHYAFTNDKKYLAD-IYPVLKGSAEFYNSLLIKDEKTGWLVMSPSSSPENAFYL 519
Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDG 588
P+GK A + +T+D I+R++F+ II+A+ L + D E K P IA DG
Sbjct: 520 PNGKHASICIGATIDNQIVRDLFNNIITASTELGIDADFKKELQQKVALLPPPGVIAPDG 579
Query: 589 SIMEWVQ 595
IMEW++
Sbjct: 580 RIMEWLE 586
>gi|345513833|ref|ZP_08793348.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
gi|345456122|gb|EEO45721.2| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
Length = 800
Score = 295 bits (754), Expect = 6e-77, Method: Compositional matrix adjust.
Identities = 182/598 (30%), Positives = 311/598 (52%), Gaps = 44/598 (7%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+T + ++ + PAK + +++PIGNGRLGAM +GG+ E L LNE T+W+G + N
Sbjct: 21 ATDSCETTELWYAQPAKVWMESLPIGNGRLGAMTYGGIEEEKLALNESTMWSGQYNENQN 80
Query: 66 -PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAE 121
P + ++ +R L G+ +E A L G+ + +GD++++F K +
Sbjct: 81 KPFGREKMNQLRKLFFEGKLSEGNRIAGDNLHGNQTSFGTHLPIGDLKMQFIYPEGKVTD 140
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
YRR L L+ A + V ++ G V + RE+F++NPD V+V +++ + S++ N+ LD L+
Sbjct: 141 --YRRSLSLDEAVSSVSFNSGGVNYKREYFATNPDNVLVLRLTADKQKSITMNMGLD-LM 197
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
NNQ++ G+ P P G+ F I + D G + +E
Sbjct: 198 RQADLSVENNQLVFTGKVD---FPLHG-----PGGVCFEG--RIAVLADNGEVK-MEQSG 246
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
+ ++ +D L++ + + P D + ++ SY +L H+ DY
Sbjct: 247 VSIKEADTVTLIVDVRTDYKSP---------DYKTLCADGVEKAAVKSYDELKQAHIKDY 297
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
L++RVSI + + + T ++VK +TD L L FQ+GRYL
Sbjct: 298 NTLYNRVSIHFGQD---------ANRAMPTDVRWKQVKEGKTD--TGLDALFFQYGRYLT 346
Query: 362 ISSSRPGTQV-ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
I+SSR + + LQG +N++ + W + H++IN E NYW + NL+EC PLF +
Sbjct: 347 IASSRENSPLPIALQGFFNDNKACNMGWTNDYHLDINTEQNYWAANVGNLAECNAPLFTY 406
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ L+ +G+KTA+V Y GW H ++W + A ++W L+PM G+W+ +HLW Y
Sbjct: 407 IKDLAHHGAKTAEVVYGCKGWTAHTTANVWGYTPAS-STIIWGLFPMAGSWIASHLWTQY 465
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVS 537
+T D+ +L + AYPLL+G A F+LD+L + GYL T PS SPE+ F G+ S
Sbjct: 466 EFTQDKQYLAETAYPLLKGNAQFILDFLAKDPKSGYLMTGPSISPENWFRTAGGEEMVAS 525
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
D + E+ S + A+E+L+ + + + + ++ +L P ++ +G+I EW +
Sbjct: 526 MMPACDRELAYEILSNCVRASEILDTDRE-FADSLRTAIAQLPPIQLRANGAIREWFE 582
>gi|281419724|ref|ZP_06250723.1| putative alpha-L-fucosidase 2 [Prevotella copri DSM 18205]
gi|281406253|gb|EFB36933.1| putative alpha-L-fucosidase 2 [Prevotella copri DSM 18205]
Length = 1246
Score = 295 bits (754), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 202/634 (31%), Positives = 313/634 (49%), Gaps = 66/634 (10%)
Query: 7 TSTTNPL---------KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWT 57
T+ TNP+ + +N PA ++ +A+P+GNGRLG M G V +TL+LNEDT W
Sbjct: 333 TADTNPIPAPTIESKNHLWYNKPAGYWEEALPLGNGRLGVMHSGSVACDTLQLNEDTFWD 392
Query: 58 GVPGDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDD 114
P N +A L +V+ + + YA +V + G Y+ G + L F
Sbjct: 393 QGPNTNYNANAFGVLREVQQGIFNKDYASVQNLAVTNWMSQGSHGASYRAAGVVLLGFPG 452
Query: 115 SHLKYAE----------ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKIS 164
E + Y R LD+NTAT+ V+Y V V + R F+S D V V ++
Sbjct: 453 QRFDDMESAQTSDAVDAQGYVRYLDMNTATSNVEYHVKGVGYKRTVFTSFKDNVTVVRLE 512
Query: 165 GSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE 224
+ G L FNV+ ++ +N + E P + + + L
Sbjct: 513 ADQKGKLDFNVAYAGCNKSNIEKLTSNVLYDEHTVKATMGPARDKCENVENKLNLCTYLR 572
Query: 225 I-----KISDD------RGTISALEDK-KLKVEGSDWAVLLLVASSSFDGPFINPSDSKK 272
I I++D +GT+ A + +L V G+ +A +++ +++F D
Sbjct: 573 IVDTDGTITNDNVNIYAQGTVGAATNAPRLNVTGATYATIIISQATNFK----KYDDVSG 628
Query: 273 DPTSESMSALQSIRNLS--YSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENID 330
D ++ +++ L++ N Y + H Y+ F RV + L+ + ++E+ +
Sbjct: 629 DASASALAYLEAYENSKKDYVTTLSDHESVYRAQFDRVDLTLAGN--------ATQESKN 680
Query: 331 TVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLS--PTWDS 388
T +R+K F DP L FQFGRYLLISSS+PGTQ ANLQGIWN D P WDS
Sbjct: 681 T---EQRIKEFHKTSDPQLAANYFQFGRYLLISSSQPGTQPANLQGIWNPDARQYPAWDS 737
Query: 389 APHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIW 448
NIN+EMNYW + NL+EC EP + + +S+ G++TA+ Y A GW +HH TDIW
Sbjct: 738 KYTSNINVEMNYWPAEVTNLAECHEPFVEMVKDVSVTGAETAKKMYGARGWALHHNTDIW 797
Query: 449 AKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI 507
+ A D G V +WP AW C+HLWE Y ++ D+ +L + YP+++G A F D+L+
Sbjct: 798 RTTGAVDNGTV--GVWPTCNAWFCSHLWERYLFSGDKTYLAE-VYPIMKGAAEFFQDFLV 854
Query: 508 EG-HDGYLETNPSTSPEHE-----FIAPDGKLACVSY--SSTMDMAIIREVFSAIISAAE 559
+ + GY+ PS SPE+ + PDGK A ++ MD ++ ++ AA
Sbjct: 855 KDPNTGYMVVCPSNSPENHPGIGSYTKPDGKTANIALFGGVAMDNEMVYDLLKNTALAAR 914
Query: 560 VLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
L+K+ D ++ P KI + G + EW
Sbjct: 915 ALDKDADFADALDALK-AQITPWKIGQYGQVQEW 947
>gi|329962213|ref|ZP_08300219.1| hypothetical protein HMPREF9446_01800 [Bacteroides fluxus YIT
12057]
gi|328530321|gb|EGF57198.1| hypothetical protein HMPREF9446_01800 [Bacteroides fluxus YIT
12057]
Length = 834
Score = 294 bits (752), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 195/619 (31%), Positives = 315/619 (50%), Gaps = 68/619 (10%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + + +P+GNGRLG M GGV E + LNE +LW+G+ DY+NPDA K+L
Sbjct: 29 RLYYTKPASVWEETLPLGNGRLGMMPDGGVLREHIVLNEISLWSGMEADYSNPDASKSLP 88
Query: 74 DVRSLVDSGQYAEATAASVKLF---GHPAD----VYQLLGDIELEF-----------DDS 115
+R L+ G+ EA F AD YQ LG ++++F +
Sbjct: 89 AIRKLLFEGKNREAQELMYSSFVPKKQEADGRYGTYQTLGTLDIDFAYQSQTSVSKSESL 148
Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
L YRR LDL A A +++ V++ RE+F S V++ ++ G+L+F+
Sbjct: 149 ALDGGTSRYRRCLDLRDAVAYTAFALEGVDYRREYFVSRDRDVMLVHLTAGSKGALNFSA 208
Query: 176 SLDSLLDNHSYVNGNNQII---MEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRG 232
L V GN ++ +E PG+ +G+++ + +++ D G
Sbjct: 209 RLGRAEHGTVTVKGNALLMDGTLESGSPGR------------EGMKYR--VAMQLVSDGG 254
Query: 233 TISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESM--SALQSIRN--- 287
++A + + ++ A L+L A++S+ + S+ +S+ +A I+N
Sbjct: 255 EVAADPENGISLKHGQEAWLVLSATTSYAAEGTDFPGSRYAEVCDSLLKNAGVQIKNEMR 314
Query: 288 ----LSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
+ + H ++ L+ RVS+ L +P D T+P+ ER+ F
Sbjct: 315 MRGMAAEATALKSHAAAHRSLYDRVSLTLPSTPDD------------TLPTDERILRFTR 362
Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
E P+L L + +GRYLLISS+RPG+ NLQG+W L W+ H NIN++MN+W
Sbjct: 363 QESPALAALYYNYGRYLLISSTRPGSLPPNLQGLWANSLLTPWNGDYHTNINVQMNHWPL 422
Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWA 461
LSE +PL + L +G TA+ Y A GWV+H T++W +A W
Sbjct: 423 EQAGLSELYQPLTTLMERLVPSGEATARTFYGKEAEGWVLHMMTNVW-NYTAPGEHPSWG 481
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPST 520
GGAWLC HLWEHY YT D+D+L +R YP+L+G A F + E G+L T P++
Sbjct: 482 ATNTGGAWLCAHLWEHYLYTQDKDYL-RRIYPVLKGAARFFSSTTVEEPSHGWLVTAPTS 540
Query: 521 SPEHEFIAPDGKLACVS--YSSTMDMAIIREVFSAIISAAEVLEKNED--ALVEKVLKSL 576
SPE+ F P + VS TMD+ ++ E+++ +I+AA +L + + A +E LK
Sbjct: 541 SPENSFYVPGDSVTPVSICMGPTMDVQLLTELYTNVITAARLLGCDAEYAAKLEADLKKF 600
Query: 577 PRLRPTKIAEDGSIMEWVQ 595
P P +I+++G + EW++
Sbjct: 601 P---PMQISKEGYLQEWLE 616
>gi|330996466|ref|ZP_08320348.1| hypothetical protein HMPREF9442_01433 [Paraprevotella xylaniphila
YIT 11841]
gi|329573022|gb|EGG54641.1| hypothetical protein HMPREF9442_01433 [Paraprevotella xylaniphila
YIT 11841]
Length = 798
Score = 294 bits (752), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 182/588 (30%), Positives = 306/588 (52%), Gaps = 45/588 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG-VPGDYTNPDAPKALSDV 75
++ PA + ++P+GNGR+GAMV+GGV ET+ LNE ++W G + P L +
Sbjct: 29 YDAPADEWMKSLPVGNGRVGAMVFGGVDEETVALNESSMWAGEYDPNQEKPFGRARLDSL 88
Query: 76 RSLVDSGQYAEATA-ASVKLFG--HPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
R L +G+ E A +L G H + +GD++++FD + + E YRRELDL
Sbjct: 89 RELFFAGKLIEGNGIAGRELVGTPHSFGTHLPIGDLKIKFDYAGKEGGVEDYRRELDLTN 148
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A A V + G ++ RE+ SSNP +V + + S+SF++ + + GN
Sbjct: 149 AVATVSFKKGGTKYKREYISSNPQDAVVMHFTADKKQSVSFDMRMKMITAAQVRTEGNLL 208
Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
+ G+ + PK G++F + +K+ D G + A + ++V+ +D +
Sbjct: 209 VF-----DGQALFPKLGTG----GVKFQGRVVVKV--DNGEVEA-AGETVRVKHAD--AV 254
Query: 253 LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
+VA D + + E+++ + + H+ DY LF RVS++L
Sbjct: 255 TIVADVRTDYKNGQYASLCEKTVGEAIAR-------PFETMKEEHVADYAPLFARVSLKL 307
Query: 313 SRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
+ K +VP R K+ + ++D L L FQ+GRYL I+SSR + +
Sbjct: 308 ADDSKK------------SVPVDRRWKALCEGNKDAGLQALFFQYGRYLTIASSRENSPL 355
Query: 372 -ANLQGIWNEDLSP--TWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
LQG +N++L+ W S H++IN E NYW + NL+EC PLF ++ L+ +G+K
Sbjct: 356 PIALQGFFNDNLACNMCWTSDYHLDINTEQNYWLANVGNLAECNAPLFTYIADLARHGAK 415
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
T + Y GW H ++W ++ G + W L+P+ G+W+ THLW Y YT+D+D+L
Sbjct: 416 TVRTVYGCKGWTAHTVANVWGFTAPSEG-MGWGLFPLAGSWMATHLWTQYEYTLDKDYLR 474
Query: 489 KRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
+ AYPLL+G A FLLD+++E + GY+ T P SPE+ F +L S +T D +
Sbjct: 475 RTAYPLLKGNAEFLLDYMVEDPNTGYMVTGPCVSPENSFRYQGWELG-ASMMTTCDRVLA 533
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
E+ SA + A+++L ++D + + +L + P ++ G + EW +
Sbjct: 534 HEIMSACVQASDILGVDKD-FADSLRLALAKFPPFRVNSYGGLCEWYE 580
>gi|427388255|ref|ZP_18884138.1| hypothetical protein HMPREF9447_05171 [Bacteroides oleiciplenus YIT
12058]
gi|425724838|gb|EKU87712.1| hypothetical protein HMPREF9447_05171 [Bacteroides oleiciplenus YIT
12058]
Length = 829
Score = 293 bits (751), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 192/611 (31%), Positives = 319/611 (52%), Gaps = 59/611 (9%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + + +P+GNGRLG M GG+ E + LNE +LW+G+ DY NPDA ++L
Sbjct: 33 QLYYTTPATIWEETLPLGNGRLGMMPDGGIDKEHIVLNEISLWSGMEADYGNPDASRSLP 92
Query: 74 DVRSLVDSGQYAEATAASVKLF-------GHPADVYQLLGDIELEFDDSHLK--YAEET- 123
++ L+ G+ EA F G YQ+L D+ L F K ++ +T
Sbjct: 93 AIQQLLFEGKNREAQELMYNSFVPKKPESGGTYGNYQMLADLTLNFSIPVKKEFFSGDTV 152
Query: 124 ----YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
YRR LDL A A ++ G +++ RE+++S V++ ++ S SL F SL
Sbjct: 153 PVTGYRRWLDLRDAVAYTTFTKGGIDYQREYYTSRDKDVMIIHLTASRRRSLFFTASLSR 212
Query: 180 LLDNH-SYVNGNNQ----IIMEGRC----PGKRIPPKANANDDPKGIQFSAILEIKISDD 230
S+V GN + +++EG PG+ G+++ + + D
Sbjct: 213 PQQGTVSFVPGNGKESGTLLLEGVLDSGKPGQ------------DGMKYRVAMRVVSKDG 260
Query: 231 RGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESM--SALQSIRNL 288
+ ISA E+ + +G++ A L++ A++S+ + S S+ +S+ +A QS L
Sbjct: 261 KQHISA-ENGVMLTQGTE-AWLVISATTSYAAAGTDFSGSRYKEVCDSLLNAATQSHSQL 318
Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPS 348
S + ++ +++L+ RVS+ L + D +P+ ER+ F E P+
Sbjct: 319 SILNSQLKNAS-HRELYDRVSLTLPATEDD------------ALPTNERIVRFTERESPA 365
Query: 349 LVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL 408
L L + +GRYLLISS+RPG+ NLQG+W + W+ H NIN++MN+W L
Sbjct: 366 LATLYYNYGRYLLISSTRPGSLPPNLQGLWANGIQTPWNGDYHTNINIQMNHWPLEQAGL 425
Query: 409 SECQEPLFDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
SE +PL + L +G +TA Y A GWV+H T++W +A W G
Sbjct: 426 SELYQPLTTLIERLVPSGKETACTFYGNRAQGWVLHMMTNVW-NYTAPGEHPSWGATNTG 484
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHE 525
GAWLCTHLWEHY YT D ++L K+ YP+L+G + F ++ E G+L T P++SPE+
Sbjct: 485 GAWLCTHLWEHYQYTQDLEYL-KKIYPILKGASEFFYSTMVQEPKHGWLVTAPTSSPENA 543
Query: 526 -FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
F+ D + TMD+ ++ E+++ ++ AA +L K +D K+ +L + P +I
Sbjct: 544 FFVGDDPTPVSICMGPTMDVQLLTELYTNVVQAASIL-KCDDGYAAKLRAALEKFPPMQI 602
Query: 585 AEDGSIMEWVQ 595
+++G + EW++
Sbjct: 603 SKEGYLQEWLE 613
>gi|255014859|ref|ZP_05286985.1| glycoside hydrolase family protein [Bacteroides sp. 2_1_7]
Length = 850
Score = 293 bits (750), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 193/611 (31%), Positives = 305/611 (49%), Gaps = 53/611 (8%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
N T + F+ PA+ + + +P+GNGR+G M GG+ E + LNE +LW+G D
Sbjct: 57 NLPDMQTGKEVSYYFDQPAQIWEETLPLGNGRIGMMPDGGIERENVVLNEISLWSGSKQD 116
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
NP A +L+++R L+ G+ EA K F P YQL G++
Sbjct: 117 TDNPYAYYSLANIRRLLFEGRNDEAQDLMYKTFVCKGTGSNLGDGANAPYGSYQLFGNLV 176
Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
L + + + YRR L+L+ A A V + GNV + RE F+S + V +
Sbjct: 177 LRYMYPNESDSIAEYRRRLNLSEAIASVSFKRGNVNYQREMFTSFSGDLGVIHLVADADR 236
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQ-IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
+L+F++ ++ H+ ++ + + ++M G+ P + KG++F++ ++I
Sbjct: 237 ALNFSLGMNR--PEHATISLDGKDLLMRGQLP------DGVDTLEMKGMRFAS--RVRIV 286
Query: 229 DDRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSA-LQSIR 286
+G D L V + A++L+ + + FD KD + + L
Sbjct: 287 LPKGGDLTATDSSLSVRSASEAIILVSLGTDYFD----------KDGVGQFLEKYLSQAE 336
Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE- 345
+ +S L H Y+ LF RVS+ L + +D +P ER+ +F D+
Sbjct: 337 SKDFSTLRREHTLAYRSLFDRVSLDLGKGERD------------HLPIHERLAAFAQDKN 384
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
DP L L FQFGRYLLISS+R G NLQG+W + W+ H+NINL+MN+W +
Sbjct: 385 DPGLAALYFQFGRYLLISSTRQGLLPPNLQGLWCNTIHTPWNGDYHLNINLQMNHWPAEV 444
Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
NLSE PL + +G +TA+ Y A GWV H ++W + +A W
Sbjct: 445 TNLSELHLPLIELTKQQVPSGERTAKAFYNARGWVTHILGNVW-EFTAPGEHPSWGATNT 503
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEH 524
AWLC HL+ HY YT+D+ +L + YP ++G A F +D L++ YL T P+TSPE+
Sbjct: 504 SAAWLCEHLYTHYQYTLDKAYL-RDVYPTMKGAALFFVDMLVQDPRTKYLVTAPTTSPEN 562
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
+ P+G + + S MD I+RE+F+ I AA +L + A ++ RL PT I
Sbjct: 563 AYKMPNGSVVSICAGSMMDNQILRELFTNTIEAAGILGV-DSAFAAELAAKRDRLMPTTI 621
Query: 585 AEDGSIMEWVQ 595
+DG IMEW++
Sbjct: 622 GKDGRIMEWLE 632
>gi|410102732|ref|ZP_11297657.1| hypothetical protein HMPREF0999_01429 [Parabacteroides sp. D25]
gi|409237859|gb|EKN30654.1| hypothetical protein HMPREF0999_01429 [Parabacteroides sp. D25]
Length = 809
Score = 293 bits (750), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 193/611 (31%), Positives = 305/611 (49%), Gaps = 53/611 (8%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
N T + F+ PA+ + + +P+GNGR+G M GG+ E + LNE +LW+G D
Sbjct: 16 NLPDMQTGKEVSYYFDQPAQIWEETLPLGNGRIGMMPDGGIERENVVLNEISLWSGSKQD 75
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
NP A +L+++R L+ G+ EA K F P YQL G++
Sbjct: 76 TDNPYAYYSLANIRRLLFEGRNDEAQDLMYKTFVCKGTGSNLGDGANAPYGSYQLFGNLV 135
Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
L + + + YRR L+L+ A A V + GNV + RE F+S + V +
Sbjct: 136 LRYMYPNESDSIAEYRRRLNLSEAIASVSFKRGNVNYQREMFTSFSGDLGVIHLVADADR 195
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQ-IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
+L+F++ ++ H+ ++ + + ++M G+ P + KG++F++ ++I
Sbjct: 196 ALNFSLGMNR--PEHATISLDGKDLLMRGQLP------DGVDTLEMKGMRFAS--RVRIV 245
Query: 229 DDRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSA-LQSIR 286
+G D L V + A++L+ + + FD KD + + L
Sbjct: 246 LPKGGDLTATDSSLSVRSASEAIILVSLGTDYFD----------KDGVGQFLEKYLSQAE 295
Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE- 345
+ +S L H Y+ LF RVS+ L + +D +P ER+ +F D+
Sbjct: 296 SKDFSTLRREHTLAYRSLFDRVSLDLGKGERD------------HLPIHERLAAFAQDKN 343
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
DP L L FQFGRYLLISS+R G NLQG+W + W+ H+NINL+MN+W +
Sbjct: 344 DPGLAALYFQFGRYLLISSTRQGLLPPNLQGLWCNTIHTPWNGDYHLNINLQMNHWPAEV 403
Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
NLSE PL + +G +TA+ Y A GWV H ++W + +A W
Sbjct: 404 TNLSELHLPLIELTKQQVPSGERTAKAFYNARGWVTHILGNVW-EFTAPGEHPSWGATNT 462
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEH 524
AWLC HL+ HY YT+D+ +L + YP ++G A F +D L++ YL T P+TSPE+
Sbjct: 463 SAAWLCEHLYTHYQYTLDKAYL-RDVYPTMKGAALFFVDMLVQDPRTKYLVTAPTTSPEN 521
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
+ P+G + + S MD I+RE+F+ I AA +L + A ++ RL PT I
Sbjct: 522 AYKMPNGSVVSICAGSMMDNQILRELFTNTIEAAGILGV-DSAFAAELAAKRDRLMPTTI 580
Query: 585 AEDGSIMEWVQ 595
+DG IMEW++
Sbjct: 581 GKDGRIMEWLE 591
>gi|409099481|ref|ZP_11219505.1| alpha-L-fucosidase [Pedobacter agri PB92]
Length = 937
Score = 293 bits (750), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 175/503 (34%), Positives = 265/503 (52%), Gaps = 50/503 (9%)
Query: 102 YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVT 161
YQ GD+ L F L Y+R LDL TA AR Y++ V +TRE+F+S P+Q IV
Sbjct: 293 YQPFGDLNLAFQHKGLI---TKYKRSLDLTTAIARTNYTIAGVNYTREYFASQPNQSIVI 349
Query: 162 KISGSESGSLSFNVSLDSLLDNHSY-VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFS 220
+S + S+S +L SL G N I + + + ++ + +
Sbjct: 350 HLSADKKASISLTAALSSLHQQSGIKALGKNTISLSVQVKDGALKGES---------RLT 400
Query: 221 AILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMS 280
A+++ G + L +K + + +D L L A ++F IN D DP + ++
Sbjct: 401 AVIK------NGAVKVLNNK-ISISKADEVTLYLTAGTNF----INAQDVSGDPAAANIK 449
Query: 281 ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKS 340
AL ++ + + +++ RH+ +YQ +++ + +S K+ +P+ ER+
Sbjct: 450 ALNTVTDKTSAEIKNRHIKEYQSYYNKFHVDFGQSGKE------------NLPTNERLNK 497
Query: 341 FQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
F T DP L Q+GRYLLISSSRPGTQ ANLQGIWN+ L+P W S NIN+EMNY
Sbjct: 498 FATSNDPGFAALYMQYGRYLLISSSRPGTQPANLQGIWNDLLTPPWGSKYTTNINMEMNY 557
Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVW 460
W + NLS EPLF+ + L+ G++TA+ Y GWV+HH TD+W +A
Sbjct: 558 WPAEVLNLSALNEPLFNKINGLAKTGTETAKEYYNTPGWVLHHNTDLW-NGTAPINASNH 616
Query: 461 ALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPS 519
+W G AWL HLWEHY +T D+ FL AYPL++ A F +LI+ G+L + PS
Sbjct: 617 GIWVTGAAWLSQHLWEHYAFTGDQTFLRNEAYPLMKQAALFFDAFLIKDPKTGWLISTPS 676
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPR 578
SPE +G L TMD IIR +F I+A E+L N DA +L++ + +
Sbjct: 677 NSPE------NGGLVA---GPTMDHQIIRSLFKNCIAATEIL--NVDADFRTILQAKMKQ 725
Query: 579 LRPTKIAEDGSIMEWVQRRLNTS 601
+ P +I + G + EW + + +T+
Sbjct: 726 IAPNQIGKYGQLQEWREDKDDTT 748
Score = 82.0 bits (201), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 38/82 (46%), Positives = 57/82 (69%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ +N PA+ +TDA+PIGNGRLGAMV+ GV ++ ++ NE+TLWTG P +Y A K L+
Sbjct: 29 QLWYNQPAEKWTDALPIGNGRLGAMVFAGVENDHIQFNEETLWTGKPRNYNRKGAYKYLA 88
Query: 74 DVRSLVDSGQYAEATAASVKLF 95
++R L+ G+ EA + K F
Sbjct: 89 EIRKLLFEGKQKEAEVLAQKEF 110
>gi|189460419|ref|ZP_03009204.1| hypothetical protein BACCOP_01058 [Bacteroides coprocola DSM 17136]
gi|189432851|gb|EDV01836.1| GDSL-like protein [Bacteroides coprocola DSM 17136]
Length = 1006
Score = 292 bits (748), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 182/592 (30%), Positives = 308/592 (52%), Gaps = 45/592 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA + + +P+GNGRLG M GG+ E + LNE ++W+G +Y NPDA K+L ++R
Sbjct: 233 YDEPAAQWEETLPLGNGRLGMMPDGGIVKEHIVLNEISMWSGSEANYLNPDASKSLPEIR 292
Query: 77 SLVDSGQYAEATAASVKLF-------GHPADVYQLLGDIELEFD-DSHLKYAEETYRREL 128
L+ G+ EA F G +Q+LG++ LE H K Y R L
Sbjct: 293 RLLFEGKNKEAQELMYTSFVPKKPEKGGTYGTFQMLGNLFLEHQYGVHEKDVPADYHRWL 352
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
DL+ A +S GNV + RE+ S V++ + + GS++F ++L
Sbjct: 353 DLSKGIAYTTFSRGNVNYVREYVVSRDKDVMLIHLKANVPGSINFKMNLSRP------ER 406
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
G+ + + EG+ + ++ G++++AI I R T + +++ + V+ +D
Sbjct: 407 GSVRKLAEGKL---ELYGSLDSGSSQTGVRYAAIAGI-TCKGRQTNQSTDEQSITVQNAD 462
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
A +++ A +SF I +++ + L + + + + YQ LF+R
Sbjct: 463 EAWIVVSAKTSFLAGEIYETEADR--------ILNDALKSNLCETVSEAILSYQALFNRA 514
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
I+L + E + + + +R++ FQ +DPSL L + +GRYLLISS+RPG
Sbjct: 515 GIRLPEN-----------EAVSHLTTDQRIERFQQQDDPSLAALYYNYGRYLLISSTRPG 563
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
+ NLQG+W + W+ H NIN++MN+W NLSE PL D + L +G +
Sbjct: 564 SLPPNLQGLWANEPGTPWNGDYHTNINVQMNHWPVEQANLSELYLPLVDLVKRLVPSGEE 623
Query: 429 TAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
+A+ Y A GWV+H T++W +A W GGAWLC HLWEHY ++ DR++
Sbjct: 624 SAKAFYGPQAKGWVLHMMTNVW-NYTAPGEHPSWGATNTGGAWLCAHLWEHYLFSGDRNY 682
Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAP--DGKLACVSYSSTMD 543
L YP+++G + F ++ E G+L T P++SPE+ F P D V TMD
Sbjct: 683 LAD-IYPIMKGASEFFYSTMVREPKHGWLVTAPTSSPENAFYLPGKDRTPISVCMGPTMD 741
Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+ ++RE+++ +I A+ +L + A E + +++ L P +I++ G +MEW++
Sbjct: 742 IQLVRELYTNVIEASHILH-TDTAYAEALQEAIGLLPPHQISKKGYLMEWLE 792
>gi|340621763|ref|YP_004740215.1| alpha-1,2-fucosidase 2 [Capnocytophaga canimorsus Cc5]
gi|339902029|gb|AEK23108.1| Alpha-1,2-fucosidase 2 [Capnocytophaga canimorsus Cc5]
Length = 806
Score = 292 bits (748), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 201/605 (33%), Positives = 312/605 (51%), Gaps = 60/605 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + F+ PA FT+++P+GNGRLGAMV+G ET+ LNE +LW+G + + +A K L
Sbjct: 23 VSVVFDQPATFFTESLPLGNGRLGAMVFGKTDVETIVLNEISLWSGGKQEADDENAHKYL 82
Query: 73 SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFD-DSHLK 118
++++L+ G+ EA + +K F G+ A+ YQ LG +++++ D+ +
Sbjct: 83 KEIQNLLLQGKNLEAQSLLMKHFVAKGKGTCHGNGANCHYGCYQTLGQLKIDWKSDASVT 142
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
+ Y+R LDL A A +Y + + F+ + VI KI ++ L ++
Sbjct: 143 H----YKRVLDLEKAVATTQYVRNGNQIEQIVFTDFNNDVIWVKIKSAQKTDLGLSLFRK 198
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
+N + N++IM+G P N++ KG++F+ I E+ + T A
Sbjct: 199 ---ENAHFSYDKNKLIMQGTLP----------NENQKGMEFATIAEVTTDGELTTSLA-- 243
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
L+V + ++ + AS+++ + N D ++++ L++I +LS+ + +
Sbjct: 244 --GLEVRSASEVIVKISASTNYS--YENGELENTDVVKQTLAYLKAINSLSFQNALLENQ 299
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
Y K+F+R ++ S D EN+ T +R ++ TD L L + FGR
Sbjct: 300 VTYGKIFNRNRWEMPTSLTD--------ENLTTWQRLQRYQAGNTD--AQLPVLYYNFGR 349
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSSR G ANLQG+W E+ W+ H+NIN++MNYW + NLS+ EPL F
Sbjct: 350 YLLISSSRKGLLPANLQGLWAEEYQTPWNGDYHLNINVQMNYWLAEVTNLSDLAEPLLRF 409
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
L NG KTA+ Y A GWV H ++ W +S G W GGAWLC H+WEHY
Sbjct: 410 TKNLVPNGKKTAKAYYNAEGWVAHVVSNPWFFTSPGEG-ASWGSTLTGGAWLCQHIWEHY 468
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAP---DGK-- 532
+T + DFL K Y +L+ A F D LI E GY T PS SPE+ + P DGK
Sbjct: 469 QFTQNIDFL-KEYYFVLKEAAHFFEDMLIKEPKSGYWVTAPSNSPENAYYLPELKDGKKQ 527
Query: 533 --LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSI 590
C+ TMDM I+RE+FS ++ A+E+L K+ D K + P I E G +
Sbjct: 528 HGFTCM--GPTMDMQIVRELFSNVLKASEILNKDTDKH-PKWKDIIKNTVPNTIGEQGDL 584
Query: 591 MEWVQ 595
EW
Sbjct: 585 NEWFH 589
>gi|326798066|ref|YP_004315885.1| alpha-L-fucosidase [Sphingobacterium sp. 21]
gi|326548830|gb|ADZ77215.1| Alpha-L-fucosidase [Sphingobacterium sp. 21]
Length = 794
Score = 292 bits (748), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 185/608 (30%), Positives = 304/608 (50%), Gaps = 69/608 (11%)
Query: 8 STTNPLKITFNGPAKHFT-DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN- 65
S L++ ++ PA + +A+PIGNG +GAM +GG+ E ++ +E +LW+G PG N
Sbjct: 25 SQQKALQLWYDRPATDWMREALPIGNGYIGAMFFGGIGEEQIQFSEGSLWSGGPGANPNY 84
Query: 66 -----PDAPKALSDVRSLVDSGQYAEAT---------AASVKLFGHPAD-----VYQLLG 106
P+A K L +VR+L+ G+ EA A VKL G D Q +G
Sbjct: 85 NFGNRPNAWKYLGEVRALIKQGKLKEANELVEKQMTGMAPVKLAGDSTDWGDYGAQQTMG 144
Query: 107 DIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGS 166
D+ ++ H + YRR LD+ A +V YSV ++ R F S P V+V K +
Sbjct: 145 DLFIKV--GHGSIPVQDYRRTLDIQRAIGKVSYSVAGNKYQRSFFGSYPQGVMVYKFTSD 202
Query: 167 ESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK 226
+S S + + S + S+ + G P ++ + + + + +
Sbjct: 203 KSESYTLHFSTPQYKEKESFEGLRYSCV--GYVPNNKLAFET---------AYQLVTDGR 251
Query: 227 ISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIR 286
+ GT+S + K L +++ A++++ + P + D S L + +
Sbjct: 252 VKYTNGTVSVEKAKSL--------LIIHTAATAYTMQY--PHYNGNDFRSIIKKRLDAAK 301
Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKS-FQTDE 345
SY L+ H +DYQ LF RVS QL ++ D +P+ +R ++ F+ E
Sbjct: 302 GKSYKQLFQIHQEDYQPLFDRVSFQLQ------------GKSADHLPTDKRQQALFEGAE 349
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
D L +L FQ+GRYL+I++SRPGT +LQG WN ++P W + H NIN +M YW +
Sbjct: 350 DVGLEQLYFQYGRYLMIAASRPGTMPMHLQGKWNNSVNPPWAADYHTNINEQMLYWPAEV 409
Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
NLSEC EPL D++ L G K+A + GW+++ + + ++ + G + W +P
Sbjct: 410 TNLSECHEPLIDYIESLVEPGKKSAHDFFHTRGWIVNTMNNAFGYTAVNWG-LPWGFYPA 468
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHE 525
G AWL H+WEHY YT D+ +L RAYP+++ A F +D+L +G+L ++PS SPEH
Sbjct: 469 GAAWLTQHVWEHYAYTQDKAYLRNRAYPIMKEAARFWIDYLTLDENGHLVSSPSYSPEH- 527
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S ++MD I ++ + + AA VL+ + A + R+ P ++
Sbjct: 528 --------GGISGGASMDHQIAWDILNNSLEAAMVLD--DKAFADTAQHVRDRILPPQVG 577
Query: 586 EDGSIMEW 593
G + EW
Sbjct: 578 RWGQLQEW 585
>gi|350633298|gb|EHA21663.1| hypothetical protein ASPNIDRAFT_53702 [Aspergillus niger ATCC 1015]
Length = 833
Score = 291 bits (746), Expect = 5e-76, Method: Compositional matrix adjust.
Identities = 198/597 (33%), Positives = 297/597 (49%), Gaps = 58/597 (9%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA +FT +PIGNGRLGA +WG +E + LNE+++W+G + NP + AL VR
Sbjct: 70 YTTPANNFTSTLPIGNGRLGAAIWG-TATENVTLNENSIWSGPFINRVNPRSYDALWPVR 128
Query: 77 SLVDSGQYAEATAASV-KLFGHP--ADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
SL+ G E A++ + G P Y LG + L+F H + Y R LDL +
Sbjct: 129 SLLAEGNMTEGNDATLANMVGIPDSPQSYSALGSLVLDF--GHDEAGISNYTRYLDLRSG 186
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A V+Y+ V + RE+ +S+PD V+ ++S SE G L NV+ S L YV NN
Sbjct: 187 MAVVEYTYRAVRYRREYLASHPDNVVAVRLSSSEPGGL--NVA--SSLVRDRYVVSNNAT 242
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
+ G + +A +N+ IQF+A + +SD R T S+ L+
Sbjct: 243 LSHD---GGLLTLRAYSNNVSNPIQFTAEARV-VSDGRAT-------------SNGTSLV 285
Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQKLFHRV 308
+ +S+ D FI+ S + E+ A L + + + + + DY L RV
Sbjct: 286 VRNASTID-IFIDTETSYRYSAQENWEAEIKSKLDTACSSGFVAVKKNAIADYSALAQRV 344
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLISSSR 366
+ L S + +P+ R+ +++ D DP LV L+F FGR+ LI+SSR
Sbjct: 345 DLNLG-----------SSGSAGNLPTDSRLVNYRIDPDSDPELVVLMFHFGRHSLIASSR 393
Query: 367 PGTQVA---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
A NLQG+WN+D P W ++INLEMNYW + NL++ P D L +
Sbjct: 394 ATESPALPANLQGLWNQDFDPAWGGRFTIDINLEMNYWPAEVTNLADTFSPFIDLLDVVH 453
Query: 424 INGSKTAQVNYLAS--GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
G A+ Y S G+V+HH TD+W ++ W +WPMGGAWL +L EHY ++
Sbjct: 454 DRGLDVAESMYHCSNGGYVLHHNTDLWGDAAPVDNGTTWTMWPMGGAWLSANLIEHYRFS 513
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACV 536
D L R +PLL+ A F +L +GY T PS SPE +I P+ GK +
Sbjct: 514 RDESILRNRIWPLLQSAARFYYCYLFP-FEGYYSTGPSLSPEASYIVPNDMTTAGKEEGI 572
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ TMD +++ E+F A+I +VL N L +++P +I G I+EW
Sbjct: 573 DIAPTMDNSLLHELFQAVIETCDVLAINNTDCTTAA-SYLAKIKPPQIGSSGRILEW 628
>gi|159125849|gb|EDP50965.1| conserved hypothetical protein [Aspergillus fumigatus A1163]
Length = 792
Score = 291 bits (746), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 187/598 (31%), Positives = 307/598 (51%), Gaps = 47/598 (7%)
Query: 11 NPLKIT-FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
NP T + PA F +PIGNGRL +WGG + + LNE+++W+G D NP+A
Sbjct: 22 NPSTYTWYTSPAADFASTLPIGNGRLATAIWGGA-VDNITLNENSIWSGPFQDRVNPNAY 80
Query: 70 KALSDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRR 126
+ +D R+++++G + A ++ + P+ Y LG ++L+F H + Y R
Sbjct: 81 EGFTDSRAMLEAGNLSSANDVVLREMVSIPSSPREYHPLGSLKLDF--GHEASSLHNYTR 138
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
LDL T A V+Y VG+V ++RE+ +S+PD V+ ++ S+ +L+ VSL+ + Y
Sbjct: 139 FLDLGTGVAGVRYHVGDVVYSREYVASHPDGVLAVRLRASKDSALNVVVSLE----RNRY 194
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
V + +G + KAN+ + I+F++ + + R T + + V G
Sbjct: 195 VESLTAVSSKGMG---TLTLKANSGQNTDPIRFTSQARVVSREGRITTNG---TSVVVTG 248
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ + +S+ P ++++D S L + L+Y + DYQ L
Sbjct: 249 ASTVDIFFDTQTSYR----YPDETERD--SAVKKQLDAAVKLNYPAVKQAATSDYQSLSG 302
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE--DPSLVELLFQFGRYLLISS 364
RV + D S + P+ R+ +++T+ DP LV L+F FGR+ LI+S
Sbjct: 303 RVKL-----------DLGSSGSAGNQPTDIRLTNYKTNPNGDPELVTLMFNFGRHSLIAS 351
Query: 365 SRPGTQV---ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
SR G+ ANLQGIWN+D SP W V++NLEMNYW + NL++ EP+ D +
Sbjct: 352 SREGSSSGLPANLQGIWNQDYSPAWGGKYTVDVNLEMNYWHAQVTNLADTFEPVIDLMDK 411
Query: 422 LSINGSKTAQVNYLA-SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
+ +G A+ Y +G+++HH TD+W ++ W +WPMG AWL +L + Y +
Sbjct: 412 VLPHGQDVARKMYHCDTGYILHHNTDLWGDAAPVDNGTKWTMWPMGSAWLSMNLMDQYRF 471
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLAC 535
T D+ L +R +PLL+ A F +L E +GY + PS SPE+ F P+ GK
Sbjct: 472 TQDKTLLRERIWPLLKSAADFYYCYLFE-FEGYYTSGPSISPENAFRIPEDMTIAGKSTG 530
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ + TMD ++ E+F A+I + L+ + L K + R+R +I G I+EW
Sbjct: 531 IDLAPTMDNLLLHELFLAVIETCKALDITGEDLA-NAQKYISRIRQPQIGSYGQILEW 587
>gi|375101342|ref|ZP_09747605.1| alpha-galactosidase family protein [Saccharomonospora cyanea
NA-134]
gi|374662074|gb|EHR61952.1| alpha-galactosidase family protein [Saccharomonospora cyanea
NA-134]
Length = 1130
Score = 291 bits (745), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 201/595 (33%), Positives = 308/595 (51%), Gaps = 55/595 (9%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG----DYTNPD 67
L + ++ PA + ++ +PIG+G LGA V+GGV +E L+ NE TLWTG PG D+ N
Sbjct: 52 LTLWYDEPASDWESEILPIGSGALGAGVFGGVATERLQFNEKTLWTGGPGSAGYDFGNWK 111
Query: 68 APK--ALSDVRSLVDSGQYAEATAASVKLFGHPAD---VYQLLGDIELEFDDSHLKYAEE 122
P+ A+ +V+ +D+ Q + + KL G P YQ G++ + S + E
Sbjct: 112 EPRPGAIEEVQERIDAEQRVDPEWVASKL-GQPKQGYGAYQTFGEVRV----SGAEPQEV 166
Query: 123 T-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
T YRR LD+ A A V Y V TRE+F++ D VIV + SG E+G++ V + +
Sbjct: 167 TDYRRYLDIADAVAGVSYEADGVRHTREYFATAADDVIVARFSGDETGAVDVTVGV-TAP 225
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
DN S N +GR A A DD G+++ A L++ + G+ + D
Sbjct: 226 DNRS----KNVTAKDGRIT------FAGALDD-NGLRYEAQLQVLT--EGGSRTDNPDGS 272
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
+ V +D L+L A + + + P+ DP + + + Y L H+ D+
Sbjct: 273 VTVADADTMTLVLAAGTDYSDEY--PAYRGDDPHAAVTERVDAAVAEGYDALRAAHVADH 330
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
++LF RVS+ L + D+ TD D +AE ++ + L FQ+GRYLL
Sbjct: 331 RELFDRVSLDLGQRMPDLPTDELLARYRDGGLAAEERRALEA--------LYFQYGRYLL 382
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
I+SSRPG+ ANLQG+WN+ SP W + HVNINL+MNYW + NLSE +PLFD++
Sbjct: 383 IASSRPGSLPANLQGVWNDSTSPPWSADYHVNINLQMNYWPAEVTNLSETTDPLFDYVDS 442
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNY 480
L G TA+ + GWV+H++T + + D W +P GAWL WEHY +
Sbjct: 443 LVAPGEVTAREMFDNRGWVVHNETTPFGYTGVHDWATAFW--FPEAGAWLAQSYWEHYLF 500
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
T D FL +RAYP+L+ + F +D L+ + DG L NPS SPE S
Sbjct: 501 TRDETFLRERAYPMLKSLSQFWIDELVTDPRDGKLVVNPSYSPEQ---------GDFSAG 551
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEW 593
++M I+ ++ ++ AAE++ E+A ++ +L L P ++ G + EW
Sbjct: 552 ASMSQQIVWDLLTSTAEAAELV-GGEEAFRSELAGTLAELDPGLRVGSWGQLQEW 605
>gi|70985434|ref|XP_748223.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
gi|66845851|gb|EAL86185.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
Length = 792
Score = 290 bits (743), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 187/598 (31%), Positives = 307/598 (51%), Gaps = 47/598 (7%)
Query: 11 NPLKIT-FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
NP T + PA F +PIGNGRL A +WGG + + +NE+++W+G D NP+A
Sbjct: 22 NPSTYTWYTSPAADFASTLPIGNGRLAAAIWGGA-VDNITVNENSIWSGPFQDRVNPNAY 80
Query: 70 KALSDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRR 126
+ +D R+++++G + A ++ + P+ Y LG ++L+F H + Y R
Sbjct: 81 EGFTDSRAMLEAGNLSSANDVVLREMVSIPSSPREYHPLGPLKLDF--GHEASSLHNYTR 138
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
LDL T A V+Y VG+V ++RE+ +S+PD V+ ++ S+ +L+ VSL+ + Y
Sbjct: 139 FLDLGTGVAGVRYHVGDVVYSREYVASHPDGVLAVRLRASKDSALNVVVSLE----RNRY 194
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
V + +G + KAN+ + I+F++ + + R T + + V G
Sbjct: 195 VESLTAVSSKGMG---TLTLKANSGQNTDPIRFTSQARVVSREGRITTNG---TSVVVTG 248
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ + +S+ P ++++D S L + L Y + DYQ L
Sbjct: 249 ASTVDIFFDTQTSYR----YPDETERD--SAVKKQLDAAVKLIYPAVKQAATSDYQSLSG 302
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE--DPSLVELLFQFGRYLLISS 364
RV + D S + P+ R+ +++T+ DP LV L+F FGR+ LI+S
Sbjct: 303 RVKL-----------DLGSSGSAGNQPTDIRLTNYKTNPNGDPELVTLMFNFGRHSLIAS 351
Query: 365 SRPGTQVA---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
SR G+ A NLQGIWN+D SP W V++NLEMNYW + NL++ EP+ D +
Sbjct: 352 SREGSSSALPANLQGIWNQDYSPAWGGKYTVDVNLEMNYWHAQVTNLADTFEPVIDLMDK 411
Query: 422 LSINGSKTAQVNYLA-SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
+ +G A+ Y +G+++HH TD+W ++ W +WPMG AWL +L + Y +
Sbjct: 412 VLPHGQAVARKMYHCDTGYILHHNTDLWGDAAPVDNGTKWTMWPMGSAWLSMNLMDQYRF 471
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLAC 535
T D+ L +R +PLL+ A F +L E +GY + PS SPE+ F P+ GK
Sbjct: 472 TQDKTLLRERIWPLLKSAADFYYCYLFE-FEGYYTSGPSISPENAFRIPEDMTIAGKSTG 530
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ + TMD ++ E+F A+I + L+ + L K + R+R +I G I+EW
Sbjct: 531 IDLAPTMDNLLLHELFLAVIETCKALDITGEDLA-NAQKYISRIRQPQIGSYGQILEW 587
>gi|238504526|ref|XP_002383494.1| alpha-L-fucosidase 2 precursor, putative [Aspergillus flavus
NRRL3357]
gi|220690965|gb|EED47314.1| alpha-L-fucosidase 2 precursor, putative [Aspergillus flavus
NRRL3357]
Length = 792
Score = 290 bits (743), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 205/598 (34%), Positives = 293/598 (48%), Gaps = 60/598 (10%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA +FT +P+GNGRLGA VWG E + LNE+++W+G D NPD+ AL VR
Sbjct: 28 YTSPASNFTSTLPLGNGRLGAAVWGST-VENITLNENSIWSGQFMDRVNPDSYSALDPVR 86
Query: 77 SLVDSGQYAEATAASVK-LFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
S++ G A +++ + G P + Y LG + L+F H E Y R LDL
Sbjct: 87 SMLKEGNMTAAGQTTLEHMVGSPDEPRAYHPLGSLVLDF--GHEDSQVENYTRSLDLLKG 144
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A V Y VEF RE+ +S+P VI +++ SE+G L+ SL YV N
Sbjct: 145 RAVVHYGYHGVEFRREYIASHPAGVIAARLTASEAGRLNVAASLS----RGRYVTENT-- 198
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG---SDWA 250
A A +D ++ A SDD IS ++ G S A
Sbjct: 199 --------------ATAGNDTGSLKLRA--STAESDD---ISFSAAARIVTHGGWVSRSA 239
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQKLF 305
+++ +++ FI+ S + T E+ A L + + + D++ L
Sbjct: 240 SSVVIQNATTVDIFIDAETSYRFETQEAWEAEIERKLDAAMRAGFPAIEQAATADHEALA 299
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
RV + L+ S + N+ T ER K+ D DP LV L+FQFGRY LI+SS
Sbjct: 300 GRVHLDLASS--------GAAGNLPTDVRLERYKT-HPDADPELVTLMFQFGRYSLIASS 350
Query: 366 R-PGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
R GT NLQG+WNED P W VNINLEMNYW + NL+E PL L +
Sbjct: 351 RETGTSPLPPNLQGLWNEDYEPAWGGRYTVNINLEMNYWPAGVTNLAETLGPLIFLLETV 410
Query: 423 SINGSKTAQVNYLA--SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
G A+ Y G+V+HH TDIW + W +WPMGGAWL +L E+Y +
Sbjct: 411 KPRGQDIARRMYNCDNGGYVLHHNTDIWGDAVPVNNGTKWTMWPMGGAWLSANLMEYYRF 470
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLAC 535
T D + L++R +PLL A F ++ +GYL T PS+SPE+ F+ P+ G
Sbjct: 471 TQDTNLLKERIWPLLRSAAQFYHCYVFS-FNGYLSTGPSSSPENAFVVPNDMSESGNEEG 529
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ + TMD ++ E+F +II +VL N + K SLP ++ +I G I+EW
Sbjct: 530 IDIAPTMDNTLLSELFHSIIETGKVLGIN-NTDTTKAASSLPLIKLPQIGSYGQILEW 586
>gi|429751943|ref|ZP_19284832.1| hypothetical protein HMPREF9073_00790 [Capnocytophaga sp. oral
taxon 326 str. F0382]
gi|429178378|gb|EKY19657.1| hypothetical protein HMPREF9073_00790 [Capnocytophaga sp. oral
taxon 326 str. F0382]
Length = 806
Score = 290 bits (742), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 203/603 (33%), Positives = 315/603 (52%), Gaps = 59/603 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + F+ PAKHFT+++PIGNGRLGAM++G + + LNE +LW+G D +PDA L
Sbjct: 23 VSVVFHEPAKHFTESLPIGNGRLGAMLFGKTDIDRIVLNEISLWSGGTQDADDPDAHIHL 82
Query: 73 SDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIELEFDDSHLKY 119
++ L+ G+ EA + K F YQ+LG+++L++ +
Sbjct: 83 KTIQQLLLDGKNLEAQSLLQKHFIAKGKGSCNGNGANGNYGCYQILGELQLDWKTN---L 139
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+ Y+R L L+ ATA + G+ + F+ + +I KI+ S+ L ++SL+
Sbjct: 140 PIKNYQRILRLDQATAFTSFKRGDNHIQQIAFADFKNDLIWIKITASQP--LDMDISLNR 197
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD-RGTISALE 238
+N + +N+II+ G P N+D +G+QF+++++I+ + + T SA
Sbjct: 198 K-ENATTSYKSNKIILSGALP----------NNDIQGMQFASVIDIQTDGNLQNTASATS 246
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+K K VL + A++++D F ++ D ++ + LQ + + +
Sbjct: 247 VQKAKE-----IVLKISAATNYD--FTKGRLTQDDVLQKANNYLQKT-TIPFDNAIIESQ 298
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF-QFG 357
YQ LF+R +R D TDT S + ER++ F + +L+ +L+ FG
Sbjct: 299 KAYQVLFNR-----NRWYSDANTDTSS------FSTFERLQRFYKGKKDALLPILYYNFG 347
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLISSSR G ANLQG+W E+ W+ H+NINL+MNYW + NLSE PL
Sbjct: 348 RYLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINLQMNYWLAESTNLSELTTPLHQ 407
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
F L NG KTA+ Y A GWV H ++ W +S W GGAWLC H+W+H
Sbjct: 408 FTKNLVANGRKTAKAYYNAKGWVAHVISNPWFYTSPGES-AEWGSTLTGGAWLCEHIWQH 466
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP---DGK- 532
Y YT++ DFL K YP+L+ A F LI+ GY T PS SPE+ +I P DGK
Sbjct: 467 YLYTLNTDFL-KEYYPVLKEAADFFQSLLIKDPKTGYWVTAPSNSPENAYIMPQLKDGKK 525
Query: 533 -LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
+ + TMDM I+RE+FS + AA++L + D L + + + P +I G +
Sbjct: 526 QIGNTCIAPTMDMQIVRELFSNTLQAAKILGVDSD-LYSQWQEIITHTVPNRIGRKGDLN 584
Query: 592 EWV 594
EW+
Sbjct: 585 EWL 587
>gi|18310857|ref|NP_562791.1| hypothetical protein CPE1875 [Clostridium perfringens str. 13]
gi|18145539|dbj|BAB81581.1| conserved hypothetical protein [Clostridium perfringens str. 13]
Length = 1479
Score = 290 bits (741), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 189/604 (31%), Positives = 313/604 (51%), Gaps = 70/604 (11%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
L + ++ PA ++ +A+PIGNG +G M++G V SE ++ NE TLW+G PG +
Sbjct: 48 LALWYDEPATNWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107
Query: 66 PDAPKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
A +A+ ++R ++ G + + +G YQ GDI L+F SH +
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQRAYG----AYQNFGDIFLDFK-SHEESK 162
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
YRREL++ + + VKY+ V + RE+F S PD V+V K+ ++ SL+ +V +
Sbjct: 163 VTNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNVMVIKLKADKASSLTVDVRNEGA 222
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ + NN +I+ G + G+++ + +IK+ ++ G+I ED+
Sbjct: 223 HNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINNGGSIQDKEDR 267
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ VE +D +++ A + + + P+ +DP S + + NL Y +L +RH++D
Sbjct: 268 -ISVENADEITIIMSAGTDYVNEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
Y+ LF RV + L D TD E + ++T++ SL L FQ+GRYL
Sbjct: 325 YKNLFDRVDLNLGELKLDKPTD-------------EMLNEYKTNQSNSLETLFFQYGRYL 371
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSR G+ ANLQG+WN +P W S H N+N++MNYW + NLSE PL +++
Sbjct: 372 LISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYVE 431
Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
L G KTA+++ +GW ++ + + +A + W P AW+ +
Sbjct: 432 SLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
LWEHYN+T D+D+L + YP+++ A F +L+E DG YL ++PS SPEH
Sbjct: 491 LWEHYNFTDDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH----- 545
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
+ +T D +I ++F+ I A+E L +E+ E K L+P +I + G
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGIDEEFRAELENKRERLLKP-QIGKHGQ 600
Query: 590 IMEW 593
+ EW
Sbjct: 601 VQEW 604
>gi|168211677|ref|ZP_02637302.1| fibronectin type III domain protein [Clostridium perfringens B str.
ATCC 3626]
gi|170710364|gb|EDT22546.1| fibronectin type III domain protein [Clostridium perfringens B str.
ATCC 3626]
Length = 1479
Score = 290 bits (741), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 193/605 (31%), Positives = 315/605 (52%), Gaps = 72/605 (11%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
L + ++ PA ++ +A+PIGNG +G M++G V SE ++ NE TLW+G PG +
Sbjct: 48 LALWYDEPATNWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107
Query: 66 PDAPKALSDVRSLVDSGQYAEATAASVKLF----GHPAD--VYQLLGDIELEFDDSHLKY 119
A +A+ ++R ++ AE S L+ G D YQ GDI L+F SH +
Sbjct: 108 EGAWEAVQEIRKIL-----AEGGTPSNDLYQRVCGDQRDYGAYQNFGDIFLDFK-SHEES 161
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
YRREL++ + + VKY+ V + RE+F S PD V+V K+ ++ SL+ +V +
Sbjct: 162 KVTNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNVMVIKLKADKASSLNVDVRNEG 221
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+ + NN +I+ G + G+++ + +IK+ + G+I ED
Sbjct: 222 AHNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINTGGSIQDKED 266
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+ + VE +D +++ A + + + P+ +DP S + + NL Y +L +RH++
Sbjct: 267 R-ISVENADEITIIMSAGTDYINEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIE 323
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
DY+ LF RV++ L D TD E + ++T++ SL L FQ+GRY
Sbjct: 324 DYKNLFDRVNLNLGELKLDKPTD-------------EMLNEYKTNQSNSLETLFFQYGRY 370
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LLISSSR G+ ANLQG+WN +P W S H N+N++MNYW + NLSE PL +++
Sbjct: 371 LLISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYI 430
Query: 420 TYLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
L G KTA+++ +GW ++ + + +A + W P AW+
Sbjct: 431 ESLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQ 489
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIA 528
+LWEHYN+T D+D+L + YP+++ A F +L+E DG YL ++PS SPEH
Sbjct: 490 NLWEHYNFTDDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH---- 545
Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDG 588
+ +T D +I ++F+ I A+E L +E+ E K L+P +I + G
Sbjct: 546 -----GPRTVGTTFDQELIWQLFTDTIKASETLGIDEEFRAELENKRERLLKP-QIGKHG 599
Query: 589 SIMEW 593
+ EW
Sbjct: 600 QVQEW 604
>gi|422346543|ref|ZP_16427457.1| hypothetical protein HMPREF9476_01530 [Clostridium perfringens
WAL-14572]
gi|373226088|gb|EHP48415.1| hypothetical protein HMPREF9476_01530 [Clostridium perfringens
WAL-14572]
Length = 1479
Score = 290 bits (741), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 189/604 (31%), Positives = 313/604 (51%), Gaps = 70/604 (11%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
L + ++ PA ++ +A+PIGNG +G M++G V SE ++ NE TLW+G PG +
Sbjct: 48 LALWYDEPATNWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107
Query: 66 PDAPKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
A +A+ ++R ++ G + + +G YQ GDI L+F SH +
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQRAYG----AYQNFGDIFLDFK-SHEESK 162
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
YRREL++ + + VKY+ V + RE+F S PD V+V K+ ++ SL+ +V +
Sbjct: 163 ITNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNVLVIKLKADKASSLTVDVRNEGA 222
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ + NN +I+ G + G+++ + +IK+ + G+I ED+
Sbjct: 223 HNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINTGGSIQDKEDR 267
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ VE +D +++ A + + + P+ +DP S + + NL Y +L +RH++D
Sbjct: 268 -ISVENADEITIIMSAGTDYINEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
Y+ LF RV++ L D TD E + ++T++ SL L FQ+GRYL
Sbjct: 325 YKNLFDRVNLNLGELKLDKPTD-------------EMLNEYKTNQSNSLETLFFQYGRYL 371
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSR G+ ANLQG+WN +P W S H N+N++MNYW + NLSE PL +++
Sbjct: 372 LISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYVE 431
Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
L G KTA+++ +GW ++ + + +A + W P AW+ +
Sbjct: 432 SLREPGRKTAEIHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
LWEHYN+T D+D+L + YP+++ A F +L+E DG YL ++PS SPEH
Sbjct: 491 LWEHYNFTDDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH----- 545
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
+ +T D +I ++F+ I A+E L +E+ E K L+P +I + G
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGIDEEFRAELEDKRERLLKP-QIGKHGQ 600
Query: 590 IMEW 593
+ EW
Sbjct: 601 VQEW 604
>gi|182626122|ref|ZP_02953882.1| fibronectin type III domain protein [Clostridium perfringens D str.
JGS1721]
gi|177908559|gb|EDT71084.1| fibronectin type III domain protein [Clostridium perfringens D str.
JGS1721]
Length = 1479
Score = 289 bits (740), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 190/604 (31%), Positives = 314/604 (51%), Gaps = 70/604 (11%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG---DYTNPD- 67
L + ++ PA ++ +A+PIGNG +G M++G V SE ++ NE TLW+G PG DY +
Sbjct: 48 LALWYDEPATNWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEDYNGGNK 107
Query: 68 --APKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
A +A+ ++R ++ G + + +G YQ GDI L+F SH +
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQRAYG----AYQNFGDIFLDFK-SHEESK 162
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
YRREL++ + + VKY+ V + RE+F S PD V+V K+ ++ SL+ +V +
Sbjct: 163 VTNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNVMVIKLKADKASSLTVDVRNEGA 222
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ + NN +I+ G + G+++ + +IK+ + G+I ED+
Sbjct: 223 YNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINTGGSIQDKEDR 267
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ VE +D +++ A + + + P+ +DP S + + NL Y +L +RH++D
Sbjct: 268 -ISVENADEITIIMSAGTDYVNEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
Y+ LF RV++ L D TD E + ++T++ SL L FQ+GRYL
Sbjct: 325 YKNLFDRVNLNLGELKLDKPTD-------------EMLNEYKTNQSNSLETLFFQYGRYL 371
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSR G+ ANLQG+WN +P W S H N+N++MNYW + NLSE PL +++
Sbjct: 372 LISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYVE 431
Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
L G KTA+++ +GW ++ + + +A + W P AW+ +
Sbjct: 432 SLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
LWEHY +T D+D+L + YP+++ A F +L+E DG YL ++PS SPEH
Sbjct: 491 LWEHYKFTDDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH----- 545
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
+ +T D +I ++F+ I A+E L +E+ E K L+P +I + G
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGIDEEFRAELENKRERLLKP-QIGKHGQ 600
Query: 590 IMEW 593
+ EW
Sbjct: 601 VQEW 604
>gi|110798918|ref|YP_696557.1| fibronectin type III [Clostridium perfringens ATCC 13124]
gi|110673565|gb|ABG82552.1| fibronectin type III domain protein [Clostridium perfringens ATCC
13124]
Length = 1479
Score = 289 bits (739), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 189/604 (31%), Positives = 313/604 (51%), Gaps = 70/604 (11%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
L + ++ PA ++ +A+PIGNG +G M++G V SE ++ NE TLW+G PG +
Sbjct: 48 LALWYDEPATNWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107
Query: 66 PDAPKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
A +A+ ++R ++ G + + +G YQ GDI L+F SH +
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQRAYG----AYQNFGDIFLDFK-SHEESK 162
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
YRREL++ + + VKY+ V + RE+F S PD V+V K+ ++ SL+ +V +
Sbjct: 163 VTNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNVMVIKLKADKASSLNVDVRNEGA 222
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ + NN +I+ G + G+++ + +IK+ + G+I ED+
Sbjct: 223 HNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINTGGSIQDKEDR 267
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ VE +D +++ A + + + P+ +DP S + + NL Y +L +RH++D
Sbjct: 268 -ISVENADEITIIMSAGTDYINEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
Y+ LF RV++ L D TD E + ++T++ SL L FQ+GRYL
Sbjct: 325 YKNLFDRVNLNLGELKLDKPTD-------------EMLNEYKTNQSNSLETLFFQYGRYL 371
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSR G+ ANLQG+WN +P W S H N+N++MNYW + NLSE PL +++
Sbjct: 372 LISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYIE 431
Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
L G KTA+++ +GW ++ + + +A + W P AW+ +
Sbjct: 432 SLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
LWEHYN+T D+D+L + YP+++ A F +L+E DG YL ++PS SPEH
Sbjct: 491 LWEHYNFTDDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH----- 545
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
+ +T D +I ++F+ I A+E L +E+ E K L+P +I + G
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGIDEEFRAELENKRERLLKP-QIGKHGQ 600
Query: 590 IMEW 593
+ EW
Sbjct: 601 VQEW 604
>gi|336417082|ref|ZP_08597411.1| hypothetical protein HMPREF1017_04519 [Bacteroides ovatus
3_8_47FAA]
gi|335936707|gb|EGM98625.1| hypothetical protein HMPREF1017_04519 [Bacteroides ovatus
3_8_47FAA]
Length = 859
Score = 288 bits (738), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 206/633 (32%), Positives = 318/633 (50%), Gaps = 74/633 (11%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD--------Y 63
LK T+N PAK + ++A+PIGNG +GAM++GGV + ++ NE TLW+G PG+
Sbjct: 32 LKATYNKPAKVWESEALPIGNGYMGAMIFGGVEVDVIQTNEHTLWSGGPGEDPSYNGGHL 91
Query: 64 TNPDAPKA-LSDVRSL---------VDSGQYAEATAASV------------------KLF 95
P+ K+ L R L V+ Y +A + KL
Sbjct: 92 GTPETNKSYLHKTRVLLQQKMNDFTVNHSAYIDADGKLITHNYEGDGNGTELRNLIDKLA 151
Query: 96 GHPADV--YQLLGDIELEFDDSHLKY-AEETYRRELDLNTATARVKYSVGNVEFTREHFS 152
G +Q L +I +E +S A Y R LD++ A RV Y G + F RE+F
Sbjct: 152 GTKEHFGSFQTLSNIIVEVVNSATSEPAYSDYTRTLDIDNAIHRVTYKEGGITFKREYFM 211
Query: 153 SNPDQVIVTKI-SGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANAN 211
S PD ++V ++ S S+ G +S +SL+SL + +N I + G P K +
Sbjct: 212 SYPDNIMVMRLTSDSKKGKISRMISLESLHTDKVIRASDNTITLTGY-PTPTSGDKRVGD 270
Query: 212 DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD-- 269
G++++ L +K + G I+ ++ KKLK+E + ++L+ A++++ +
Sbjct: 271 HWKNGLKYAQQLLVKHTG--GKITVVDGKKLKIEEAKEIIVLMSAATNYVQCMDDSYHYF 328
Query: 270 SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
S ++P + + L+ N Y+ L H DY L+ R+ + L P+ V T
Sbjct: 329 SGEEPLDKVKATLKKAANKKYTALLATHEKDYHSLYDRMKLNLGNLPEMPVATT------ 382
Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
D++ + E+ L L FQFGRYLLISSSR G+ ANLQG+W E LS W+S
Sbjct: 383 DSLLKGMDAHANSESENQYLEMLYFQFGRYLLISSSREGSLPANLQGVWGERLSNPWNSD 442
Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYL------ASGWVIHH 443
H NIN++MNYW + P NLS C P+ +++ L G TAQ Y GWV HH
Sbjct: 443 YHTNINVQMNYWPTQPTNLSRCHLPMVEYVKSLVPRGKYTAQQYYCKPDGGNVRGWVTHH 502
Query: 444 KTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
+ +IW ++ + K +P G W+C +WE+Y + +D+DFLE Y ++ A F +
Sbjct: 503 ENNIWGNTAPAK-KDTPHHFPAGAIWMCQDIWEYYQFNLDKDFLEAY-YDVMLQAALFWV 560
Query: 504 D--WLIEGHDGYLETNPSTSPEH-EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEV 560
D W E DG L NPS SPEH EF L C + A+I E+F +I A++V
Sbjct: 561 DNLWTDE-RDGTLVANPSHSPEHGEF-----SLGC-----STSQAMIAEMFDMMIKASKV 609
Query: 561 LEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
L K+++ + ++ ++ +L KI G +MEW
Sbjct: 610 LGKDKEPEIAEIKTAMNKLSGPKIGLGGQLMEW 642
>gi|168214908|ref|ZP_02640533.1| fibronectin type III domain protein [Clostridium perfringens CPE
str. F4969]
gi|170713641|gb|EDT25823.1| fibronectin type III domain protein [Clostridium perfringens CPE
str. F4969]
Length = 1479
Score = 288 bits (738), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 188/604 (31%), Positives = 312/604 (51%), Gaps = 70/604 (11%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
L + ++ PA + +A+PIGNG +G M++G V SE ++ NE TLW+G PG +
Sbjct: 48 LALWYDEPATSWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107
Query: 66 PDAPKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
A +A+ ++R ++ G + + +G YQ GDI L+F SH +
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQRAYG----AYQNFGDIFLDFK-SHEESK 162
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
YRREL++ + + VKY+ V + RE+F S PD V+V K+ ++ SL+ +V +
Sbjct: 163 VTNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNVMVIKLKADKASSLTVDVRNEGA 222
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ + NN +I+ G + G+++ + +IK+ + G+I ED+
Sbjct: 223 HNGKNLSVENNTLILSGEI-------------EDNGMKYES--QIKVINTGGSIQDKEDR 267
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ VE +D +++ A + + + P+ +DP S + + NL Y +L +RH++D
Sbjct: 268 -ISVENADEITIIMSAGTDYINEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
Y+ LF RV++ L D TD E + ++T++ SL L FQ+GRYL
Sbjct: 325 YKNLFDRVNLNLGELKLDKPTD-------------EMLNEYKTNQSNSLETLFFQYGRYL 371
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSR G+ ANLQG+WN +P W S H N+N++MNYW + NLSE PL +++
Sbjct: 372 LISSSRAGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYVE 431
Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
L G KTA+++ +GW ++ + + +A + W P AW+ +
Sbjct: 432 SLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
LWEHYN+T D+D+L + YP+++ A F +L+E DG YL ++PS SPEH
Sbjct: 491 LWEHYNFTDDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH----- 545
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
+ +T D +I ++F+ I A+E L +E+ E K L+P ++ + G
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGVDEEFRAELEDKRERLLKP-QVGKHGQ 600
Query: 590 IMEW 593
+ EW
Sbjct: 601 VQEW 604
>gi|317138010|ref|XP_001816599.2| alpha-fucosidase [Aspergillus oryzae RIB40]
gi|195972741|dbj|BAG68493.1| probable secreted protein [Aspergillus oryzae]
Length = 792
Score = 288 bits (738), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 204/598 (34%), Positives = 292/598 (48%), Gaps = 60/598 (10%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA +FT +P+GNGRLGA VWG E + LNE+++W+G D NPD+ AL VR
Sbjct: 28 YTSPASNFTSTLPLGNGRLGAAVWGST-VENITLNENSIWSGQFMDRVNPDSYSALDPVR 86
Query: 77 SLVDSGQYAEATAASVK-LFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
++ G A +++ + G P + Y LG + L+F H E Y R LDL
Sbjct: 87 YMLKEGNMTAAGQTTLEHMVGSPDEPRAYHPLGSLVLDF--GHEDSQVENYTRSLDLLKG 144
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A V Y VEF RE+ +S+P VI +++ SE+G L+ SL YV N
Sbjct: 145 RAVVHYGYHGVEFRREYIASHPAGVIAARLTASEAGRLNVAASLS----RGRYVTENT-- 198
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG---SDWA 250
A A +D ++ A SDD IS ++ G S A
Sbjct: 199 --------------ATAGNDTGSLKLRA--STAESDD---ISFSAAARIVTHGGWVSRSA 239
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQKLF 305
+++ +++ FI+ S + T E+ A L + + + D++ L
Sbjct: 240 SSVVIQNATTVDIFIDAETSYRFETQEAWEAEIERKLDAAMRAGFPAIEQAATADHEALA 299
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
RV + L+ S + N+ T ER K+ D DP LV L+FQFGRY LI+SS
Sbjct: 300 GRVHLDLASS--------GAAGNLPTDVRLERYKT-HPDADPELVTLMFQFGRYSLIASS 350
Query: 366 RP-GTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
R GT NLQG+WNED P W VNINLEMNYW + NL+E PL L +
Sbjct: 351 RKTGTSPLPPNLQGLWNEDYEPAWGGRYTVNINLEMNYWPAGVTNLAETLGPLIFLLETV 410
Query: 423 SINGSKTAQVNYLA--SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
G A+ Y G+V+HH TDIW + W +WPMGGAWL +L E+Y +
Sbjct: 411 KPRGQDIARRMYNCDNGGYVLHHNTDIWGDAVPVNNGTKWTMWPMGGAWLSANLMEYYRF 470
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLAC 535
T D + L++R +PLL A F ++ +GYL T PS+SPE+ F+ P+ G
Sbjct: 471 TQDTNLLKERIWPLLRSAAQFYHCYVFS-FNGYLSTGPSSSPENAFVVPNDMSESGNEEG 529
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ + TMD ++ E+F +II +VL N + K SLP ++ +I G I+EW
Sbjct: 530 IDIAPTMDNTLLSELFHSIIETGKVLGIN-NTDTTKAASSLPLIKLPQIGSYGQILEW 586
>gi|300771448|ref|ZP_07081323.1| possible alpha-L-fucosidase [Sphingobacterium spiritivorum ATCC
33861]
gi|300761437|gb|EFK58258.1| possible alpha-L-fucosidase [Sphingobacterium spiritivorum ATCC
33861]
Length = 778
Score = 288 bits (737), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 192/613 (31%), Positives = 309/613 (50%), Gaps = 68/613 (11%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+ +N LK+ ++ AK + + +P+GNG +G M GGV E + LNE ++W+G D N
Sbjct: 22 VAQSNSLKLWYDKAAKRWEETLPLGNGLIGMMPDGGVQKEKIVLNEISMWSGSEEDPNNY 81
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLF-------GH------PADVYQLLGDIELEFD 113
A K++ +++ L+ G+ EA K F GH P YQ LG + L+F
Sbjct: 82 TAYKSVGEIQKLLFEGKNDEAERLVNKNFVTSGKGSGHGNGANVPFGCYQNLGFLNLQFT 141
Query: 114 DSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF 173
++ Y R LDL A AR +++ V++TRE+F+S V V +++ S+ G+L+F
Sbjct: 142 GTN---QPTGYERSLDLKDAVARTHFTINGVKYTREYFTSYDQNVGVVRLTSSKKGALNF 198
Query: 174 NVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGT 233
+ SL S + Y + N+ M G + P D GI FS+ + I RG
Sbjct: 199 SASL-SREERARYTSKGNEFSMSG------VLPDGKGGD---GISFSSKIRIF---HRGG 245
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
A D L V + ++ A++S+ P DP L+ + Y L
Sbjct: 246 KVAASDTALTVSKASEVLIFFAAATSYFHP---------DPQQYVNEQLKLAYDTPYPQL 296
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDT--VPSAERVKSFQTD--EDPSL 349
+ +HL Y+ +F+RV +QL E++ID + + +R+++F + +D L
Sbjct: 297 FKQHLSRYESVFNRVDLQL-------------EDDIDKSDITTDKRLRAFYDNPAQDNGL 343
Query: 350 VELLFQFGRYLLISSSRPGTQVA---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPC 406
L +QFGRYL ISS+ P + A NLQG+W + W+ H+NIN +MN+W
Sbjct: 344 AALYYQFGRYLNISSTAPDVKGALPPNLQGLWAHQIQTPWNGDYHLNINAQMNHWGVEVN 403
Query: 407 NLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
NLSE P + + ++ G KTA+ Y A GWV++ T++W S+ + W
Sbjct: 404 NLSEYHTPFIELIKKIAKTGEKTARAYYNAPGWVVYMMTNVWGYSAPGE-QASWGASTAS 462
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHE 525
G WLC HLWEHY +T D +L K YP+++G A F ++ + G+L T+PS SPE+
Sbjct: 463 G-WLCNHLWEHYQFTKDSVYL-KEVYPVMQGAARFYAHTMVTDPKTGWLVTSPSVSPENA 520
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNE---DALVEKVLKSLPRLRPT 582
F +GK A V +D I+RE++ +I A +L ++ D L ++ + P P
Sbjct: 521 FRMKNGKTAAVVMGPAIDNQIVRELYKNLIDADSILGQHNTFTDTLRTQIQQLAP---PV 577
Query: 583 KIAEDGSIMEWVQ 595
I++ G + EW++
Sbjct: 578 LISKSGRVQEWLE 590
>gi|169343800|ref|ZP_02864799.1| fibronectin type III domain protein [Clostridium perfringens C str.
JGS1495]
gi|169298360|gb|EDS80450.1| fibronectin type III domain protein [Clostridium perfringens C str.
JGS1495]
Length = 1479
Score = 288 bits (737), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 188/604 (31%), Positives = 313/604 (51%), Gaps = 70/604 (11%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
L + ++ PA ++ +A+PIGNG +G M++G V SE ++ NE TLW+G PG +
Sbjct: 48 LALWYDEPATNWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107
Query: 66 PDAPKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
A +A+ ++R ++ G + + +G YQ GDI L+F SH +
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQRAYG----AYQNFGDIFLDFK-SHEESK 162
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
YRREL++ + + VKY+ V + RE+F S PD ++V K+ ++ SL+ +V +
Sbjct: 163 VTNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNIMVIKLKADKASSLTVDVRNEGA 222
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ + NN +I+ G + G+++ + +IK+ + G+I ED+
Sbjct: 223 HNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINTGGSIQDKEDR 267
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ VE +D +++ A + + + P+ +DP S + + NL Y +L +RH++D
Sbjct: 268 -ISVENADEITIIMSAGTDYINEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
Y+ LF RV++ L D TD E + ++T++ SL L FQ+GRYL
Sbjct: 325 YKNLFDRVNLNLGELKLDKPTD-------------EILNEYKTNQSNSLETLFFQYGRYL 371
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSR G+ ANLQG+WN +P W S H N+N++MNYW + NLSE PL +++
Sbjct: 372 LISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYVE 431
Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
L G KTA+++ +GW ++ + + +A + W P AW+ +
Sbjct: 432 SLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
LWEHYN+T D+D+L + YP+++ A F +L+E DG YL ++PS SPEH
Sbjct: 491 LWEHYNFTDDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH----- 545
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
+ +T D +I ++F+ I A+E L +E+ E K L+P +I + G
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGIDEEFRAELEDKRERLLKP-QIGKHGQ 600
Query: 590 IMEW 593
+ EW
Sbjct: 601 VQEW 604
>gi|150003335|ref|YP_001298079.1| glycoside hydrolase [Bacteroides vulgatus ATCC 8482]
gi|149931759|gb|ABR38457.1| glycoside hydrolase family 95 [Bacteroides vulgatus ATCC 8482]
Length = 803
Score = 288 bits (737), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 179/598 (29%), Positives = 309/598 (51%), Gaps = 44/598 (7%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+T + ++ + PA+ + +++PIGNGRLGAM +GG+ E L LNE T+W+G + N
Sbjct: 24 ATDSCETTELWYAQPAEVWMESLPIGNGRLGAMTYGGIEEEKLALNESTMWSGQYNENQN 83
Query: 66 -PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAE 121
P + ++ +R L G+ +E A L G+ + +GD++++F K
Sbjct: 84 IPFGREKMNQLRKLFFEGKLSEGNRIAGDNLHGNQTSFGTHLPIGDLKMQFIYPEGKVT- 142
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
YRR L L+ A + V ++ G V + RE+F++NPD V+V +++ + S++ N+ LD L+
Sbjct: 143 -GYRRSLSLDEAVSSVSFNSGGVNYKREYFATNPDNVLVLRLTADKQKSITMNMGLD-LM 200
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
+NQ++ G+ P P G+ F I + D G + +E +
Sbjct: 201 RQADLSVEDNQLVFTGKVD---FPLHG-----PGGVCFEG--RIAVLADNGEVK-MEQSE 249
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
+ ++ +D L++ + + P D + ++ SY +L H+ DY
Sbjct: 250 VGIKEADAVTLIVDVRTDYKSP---------DYKTLCADGVKKAAAKSYDELKQAHIKDY 300
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
L++RVSI + + + T ++VK +TD L L FQ+GRYL
Sbjct: 301 NTLYNRVSIHFGQD---------ANRALPTDVRWKQVKEGKTD--TGLDALFFQYGRYLT 349
Query: 362 ISSSRPGTQV-ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
I+SSR + + LQG +N++ + W + H++IN E NYW + NL+EC PLF +
Sbjct: 350 IASSRENSPLPIALQGFFNDNKACNMGWTNDYHLDINTEQNYWAANVGNLAECNAPLFTY 409
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ L+ +G+KTA+V Y GW H ++W + A ++W L+PM +W+ +HLW Y
Sbjct: 410 IKDLAHHGAKTAEVVYGCKGWTAHTTANVWGYTPAS-STIIWGLFPMASSWIASHLWTQY 468
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVS 537
+T D+ +L + AYPLL+G A F+LD+L + GYL T PS SPE+ F G+ S
Sbjct: 469 EFTQDKQYLAETAYPLLKGNAQFILDFLAKDPKSGYLMTGPSISPENWFRTAGGEEMVAS 528
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
D + E+ S + A+E+L + + + + ++ +L P ++ +G+I EW +
Sbjct: 529 MMPACDRELAYEILSNCVQASEILNTDRE-FADSLRTAIAQLPPIQLRANGAIREWFE 585
>gi|271969414|ref|YP_003343610.1| alpha-L-fucosidase [Streptosporangium roseum DSM 43021]
gi|270512589|gb|ACZ90867.1| Alpha-L-fucosidase [Streptosporangium roseum DSM 43021]
Length = 991
Score = 288 bits (736), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 200/617 (32%), Positives = 317/617 (51%), Gaps = 72/617 (11%)
Query: 1 MMNAE------STSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNED 53
M NAE + T + L + ++ PA ++ T A+PIGNG LGAMV+GGV SE ++ NE
Sbjct: 1 MANAEPEKSAAAVQTPDDLTLWYDKPATNWETQALPIGNGALGAMVFGGVASEQIQFNEK 60
Query: 54 TLWTGVPG-------DYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPAD---VYQ 103
TLWTG PG ++T+P P A+++V++ +D +A + KL G P YQ
Sbjct: 61 TLWTGGPGSGGYNAGNWTSPR-PNAIAEVQAQIDRDGRMSPSAVTAKL-GQPKSGFGAYQ 118
Query: 104 LLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKI 163
GD+ L+ D+ + YRREL L A ARV Y+ G V ++RE+F+S+P VIV +I
Sbjct: 119 TFGDLWLDVPDA--PASPTGYRRELSLREAVARVGYTAGGVTYSREYFASHPGGVIVGRI 176
Query: 164 SGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL 223
S S++G +SF + S + N ++ + G G++F +
Sbjct: 177 SASQAGKVSFTLRTSSPRSDKQVSVANGRLTVRGTLA-------------DNGMRFES-- 221
Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ 283
+I++ G+ + D+ + V G+D A+ +L A + + G +P+ DP ++ +A+
Sbjct: 222 QIQVVTQGGSRTDGTDR-VTVTGADSAMFVLSAGTDYAG--THPAYRGPDPHAKVTAAVD 278
Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
+ ++ L T H +DY+KLF RV + L + I TD R+++ T
Sbjct: 279 AAAARTFDQLRTAHQNDYRKLFDRVRLDLGQRVPAIPTD--------------RLRAAYT 324
Query: 344 D----EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMN 399
+D +L + F +GRYLLISSSR ANLQG+WN SP W + HVNINL+MN
Sbjct: 325 GRASADDRALEAMFFAYGRYLLISSSRDEALPANLQGVWNNSTSPPWSADYHVNINLQMN 384
Query: 400 YWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKV 458
YW + NL+E ++ + G KTAQ + + GWV+H++T+ + + D
Sbjct: 385 YWLAEQTNLAETTVAYDRYIKAMVAPGRKTAQEMFGSRGWVVHNETNPFGFTGVHDWATA 444
Query: 459 VWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETN 517
W +P AW+ +++HY + D +L AYP+++G A F LD L + DG L +
Sbjct: 445 FW--FPEAAAWVTQQMYDHYRFNGDTAYLRDTAYPVMKGAAEFWLDNLHADPRDGKLVVS 502
Query: 518 PSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP 577
PS SPE S ++M I+ +V + + AA L + A +V +L
Sbjct: 503 PSYSPEQ---------GDFSAGASMSQQIVFDVLTNSLEAARKLNVDP-AFQAEVTAALA 552
Query: 578 RL-RPTKIAEDGSIMEW 593
+L R ++ G + EW
Sbjct: 553 KLDRGIRVGSWGQLQEW 569
>gi|388259769|ref|ZP_10136938.1| alpha-L-fucosidase, putative, afc95A [Cellvibrio sp. BR]
gi|387936495|gb|EIK43057.1| alpha-L-fucosidase, putative, afc95A [Cellvibrio sp. BR]
Length = 806
Score = 288 bits (736), Expect = 7e-75, Method: Compositional matrix adjust.
Identities = 204/610 (33%), Positives = 298/610 (48%), Gaps = 64/610 (10%)
Query: 4 AESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG- 61
A S I F+ PA + + +PIGNG LGA++ G V + ++ NE TLWTG PG
Sbjct: 28 ASSVQAAGGESIWFDAPAADWEREGLPIGNGALGAVIAGDVTRDRIQFNEKTLWTGGPGA 87
Query: 62 ---DYTNPDAPK--ALSDVRSLVDSGQYAEATAASVKLFGHPADVY---QLLGD--IELE 111
D+ P + A++ VR+ ++ Q + + KL GH Y Q GD I+
Sbjct: 88 QGYDFGWPQQAQGDAVAQVRTTINE-QGSITPEDAAKLLGHKITAYGDYQTFGDLIIDSN 146
Query: 112 FDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL 171
+DS +K YRREL L+ A V Y G V + RE+ +S PD VI K S + S+
Sbjct: 147 KNDSDVKSVFTNYRRELSLSDAQINVSYEQGGVRYRREYLASYPDGVIAIKYSADQPASI 206
Query: 172 SFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDR 231
SF S+ + DN S I +GR A+ G+QF +I++ +
Sbjct: 207 SFTASVQ-VPDNRSLAVA----IDQGRI-------TASGKLHSNGLQFET--QIQLLNQG 252
Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYS 291
G ++ ++ KL+V +D V+LL A + + + P P L S+
Sbjct: 253 GELAVIDGNKLQVTAADSVVILLAAGTDYAQSY--PKYRGAHPHKRLHKQLNKASKKSFE 310
Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDT--CSEENIDTVPSAERVKSFQTDEDPSL 349
L H DYQ LF+RV++ + + P+ + T + D V D +L
Sbjct: 311 QLQATHRADYQTLFNRVALDIGQKPQSLTTPKLLAGYKKGDAV------------LDRTL 358
Query: 350 VELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLS 409
FQFGRYLLISSSRPG+ ANLQG+WN ++P W++ HVNINL+MNYW + NL
Sbjct: 359 EATYFQFGRYLLISSSRPGSLPANLQGVWNNSITPPWNADYHVNINLQMNYWLAETTNLP 418
Query: 410 ECQEPLFDFLTYLSINGSKTAQ-VNYLASGWVIHHKTDIWAKSSADRGKVVW--ALW-PM 465
E PLFDF+ L + G+ AQ V + GW + T+IW + G + W A W P
Sbjct: 419 ELTAPLFDFVDSLVVPGTIAAQKVAGVDKGWTLFLNTNIWGFT----GVIDWPTAFWQPE 474
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEH 524
AWL H +EHY ++ D+ FL RAYPL++ + F L++L++ DG +PS SPEH
Sbjct: 475 AAAWLAQHYYEHYLFSGDKKFLRNRAYPLMKSASEFWLEFLVKDPRDGQWIVSPSFSPEH 534
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTK 583
P + A +S D+ +R A L + + V + L L R +
Sbjct: 535 ---GPFTRAAAMSQQIVFDL--LRNTHEA------ALLTGDKKFAQAVQEKLANLDRGMR 583
Query: 584 IAEDGSIMEW 593
I + G + EW
Sbjct: 584 IGKWGQLQEW 593
>gi|373850041|ref|ZP_09592842.1| Alpha-L-fucosidase [Opitutaceae bacterium TAV5]
gi|372476206|gb|EHP36215.1| Alpha-L-fucosidase [Opitutaceae bacterium TAV5]
Length = 839
Score = 288 bits (736), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 196/624 (31%), Positives = 301/624 (48%), Gaps = 68/624 (10%)
Query: 17 FNGPAKH-FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDV 75
F+ PA+ + A+PIGNGR GAM++G + +E L+LNED+LW G P D NPDA + L +
Sbjct: 14 FDQPAQQDWNRALPIGNGRFGAMIYGNIVAERLQLNEDSLWNGGPRDRRNPDAREHLPVL 73
Query: 76 RSLVDSGQYAEA-TAASVKLFGHP--ADVYQLLGDIELEF-----------DDSHLKYAE 121
R L+ G+ A A L G P Y+ L D+ L F D+ L
Sbjct: 74 RQLLADGRLAAAHDLVHDALAGIPDSQRCYEPLADLFLHFEHPGAPVAVSADEMALASGY 133
Query: 122 ET----------YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL 171
T YRR LDL TA V Y++ N + R H +S DQVI + G L
Sbjct: 134 ATPRFDPALLSHYRRALDLRTAVMSVDYTLNNTSYARRHLASTVDQVIALHLRAGRPGGL 193
Query: 172 SFNVSLDS---------LLDNHSYVNGNNQIIMEGRC-PGKRIPPKANANDDPKGIQFSA 221
+ + L+ D +V + + R P + +A D G++F+
Sbjct: 194 TLRLRLERGPRESYSTRYADTVGFVADAAREPADARTSPALLLRGRAGGED---GVRFAV 250
Query: 222 ILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA 281
L +I+ G + + + L ++ +D L+L A+++F + DP + +
Sbjct: 251 GLRARIAG--GALRRI-GETLCIDAADSVTLVLAAATTF---------REDDPAAFVIGR 298
Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVK-S 340
+ + + H +Y+ F R S+ L +E ++P R+K +
Sbjct: 299 TGAALARGWDKIRADHEREYRSRFDRASLTLG-------APAAAEAGAGSIPVDLRLKRA 351
Query: 341 FQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
++ DP L L F + RYLLISSSRPG+ ANLQG+WN D P+W S +NIN EMNY
Sbjct: 352 RESGGDPVLASLYFNYARYLLISSSRPGSLPANLQGLWNADFWPSWGSKYTININTEMNY 411
Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVW 460
W + P NL++C +PLFD L + +G +TA+V Y G+V HH TD+WA +
Sbjct: 412 WIAEPANLADCHQPLFDHLGRVVESGRETARVMYGCRGFVAHHNTDLWADTCPTDRNAGA 471
Query: 461 ALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPST 520
+ W +GGAWL H W+ ++Y D L AY LL + F LD+LIE G L +P+
Sbjct: 472 SYWTIGGAWLVLHAWDRFDYDRDPGSLAA-AYALLREASLFFLDFLIEDARGRLVLSPTC 530
Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDA---------LVEK 571
SPE+ + P+G+ + TMD ++ +F AA++L + A + +
Sbjct: 531 SPENTYRLPNGEAGVLCAGCTMDSQLLSILFRRTAQAAQLLGRRPLAAAAIAGDHDFLAR 590
Query: 572 VLKSLPRLRPTKIAEDGSIMEWVQ 595
V + RL + G ++EW++
Sbjct: 591 VAAAAARLPQPAVGRHGQLLEWLE 614
>gi|168206072|ref|ZP_02632077.1| fibronectin type III domain protein [Clostridium perfringens E str.
JGS1987]
gi|170662403|gb|EDT15086.1| fibronectin type III domain protein [Clostridium perfringens E str.
JGS1987]
Length = 1479
Score = 288 bits (736), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 189/604 (31%), Positives = 311/604 (51%), Gaps = 70/604 (11%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
L + ++ PA + +A+PIGNG +G M++G V SE ++ NE TLW+G PG +
Sbjct: 48 LALWYDEPATSWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107
Query: 66 PDAPKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
A +A+ ++R ++ G + K +G YQ GDI L+F SH +
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQKAYG----AYQNFGDIFLDFK-SHEESK 162
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
YRREL++ + + VKY+ V + RE+F S PD V+V K+ ++ SL+ +V +
Sbjct: 163 ITNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNVMVIKLKADKASSLTVDVRNEGA 222
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ + NN +I+ G + G+++ + +IK+ + G+I ED+
Sbjct: 223 HNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINTGGSIKDKEDR 267
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ VE +D +++ A + + + P+ +DP S + + NL Y +L +RH++D
Sbjct: 268 -ISVENADEITIIMSAGTDYVNEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
Y+ LF RV++ L D TD E + ++T++ SL L FQ+GRYL
Sbjct: 325 YKNLFDRVNLNLGELKLDKPTD-------------EMLNEYKTNQSNSLETLFFQYGRYL 371
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSR G+ ANLQG+WN +P W S H N+N++MNYW + NLSE PL +++
Sbjct: 372 LISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYVE 431
Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
L G KTA+++ +GW ++ + + +A + W P AW+ +
Sbjct: 432 SLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
LWEHY +T D+D+L + YP+++ A F +L+E DG YL ++PS SPEH
Sbjct: 491 LWEHYQFTEDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH----- 545
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
+ +T D +I ++F+ I A+E L +E+ E K L+P +I + G
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGVDEEFRAELENKRERLLKP-QIGKHGQ 600
Query: 590 IMEW 593
+ EW
Sbjct: 601 VQEW 604
>gi|346311070|ref|ZP_08853080.1| hypothetical protein HMPREF9452_00949 [Collinsella tanakaei YIT
12063]
gi|345901764|gb|EGX71561.1| hypothetical protein HMPREF9452_00949 [Collinsella tanakaei YIT
12063]
Length = 770
Score = 288 bits (736), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 191/598 (31%), Positives = 290/598 (48%), Gaps = 62/598 (10%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+++ + PA + +A+PIGNG + MV+GGV +E LN++T+W P D NP + L
Sbjct: 1 MRLWYTSPASVWNEALPIGNGHIAGMVFGGVENEKFSLNDETIWYRGPADRNNPSSADNL 60
Query: 73 SDVRSLVDSGQYAEAT-AASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R L+ G A ++ +F P D Y++LG++ LE L+ A E+Y RELD
Sbjct: 61 GKIRELLAVGDVEAAEDLVALTMFATPRDQSHYEVLGEMFLEQRGVALE-ACESYERELD 119
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L A RV +S G V++ RE+FSS VI+ +++ S+ GS+S +L
Sbjct: 120 LENALCRVSFSCGGVDYRREYFSSFARNVILARLTASKEGSISLRATL------------ 167
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQ-----------FSAILEIKISDDRGTISALE 238
GRC KR D I F L + D G++ L
Sbjct: 168 -------GRC--KRFNDSVRQYRDRGVIMAAHAGGAAGVGFEVGLRVVSCD--GSVRVLG 216
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+ + E ++ VL LV+S+ + S +P + S+ + L + H+
Sbjct: 217 ETIVVDEATE-VVLALVSSTDY------WSAGAVEPDASSL--MDGFDGLDFDCALDDHV 267
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDED-PSLVELLFQFG 357
Y++ + RV++ D ++E ++P+ + + P L+ L F +G
Sbjct: 268 AAYREQYGRVAL-----------DIAADEEAPSIPTDGLIACAREGRHVPYLLNLAFDYG 316
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLL+SSS+PG ANLQGIW ED+ P W S +NIN EMNYW P +L E Q PLFD
Sbjct: 317 RYLLLSSSQPGGLPANLQGIWCEDIDPIWGSKYTININTEMNYWMCGPADLPEAQLPLFD 376
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
L + G +TA+ Y A G+ HH TD +A ++ + A+WP+ WL TH+WE
Sbjct: 377 LLERMREPGRRTARAMYGARGFTCHHNTDGFADTAPQSHAIGAAVWPLTVPWLLTHVWEQ 436
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y + D L + + + F D+L E + GYL T PS SPE+ + P+G V
Sbjct: 437 YRFFGDASVLAEH-LDMFKEALLFFEDYLFE-YQGYLVTGPSASPENRYRLPNGVEGNVC 494
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +D I+R F + A VL D ++ RL PT+I G I EW++
Sbjct: 495 LSPAIDNQILRFFFDCCVDVARVLGDQSD-FADRAKALAERLPPTRIGSHGQIQEWLE 551
>gi|299149390|ref|ZP_07042447.1| fibronectin type III domain protein [Bacteroides sp. 3_1_23]
gi|298512577|gb|EFI36469.1| fibronectin type III domain protein [Bacteroides sp. 3_1_23]
Length = 859
Score = 288 bits (736), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 205/633 (32%), Positives = 319/633 (50%), Gaps = 74/633 (11%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD--------Y 63
LK T+N PAK + ++A+PIGNG +GAM++GGV + ++ NE TLW+G PG+
Sbjct: 32 LKATYNKPAKVWESEALPIGNGYMGAMIFGGVEVDVIQTNEHTLWSGGPGEDPSYNGGHL 91
Query: 64 TNPDAPKA-LSDVRSL---------VDSGQYAEATAASV------------------KLF 95
P+ K+ L R L V+ Y +A + KL
Sbjct: 92 GTPETNKSYLHKTRVLLQQKMNDFTVNHSAYIDADGKLITHNYEGDGNGTELRNLIDKLA 151
Query: 96 GHPADV--YQLLGDIELEF-DDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFS 152
G +Q L +I +E + + + A Y R LD++ A RV Y G + F RE+F
Sbjct: 152 GTKEHFGSFQTLSNIIVEVVNPATSEPAYSDYTRTLDIDNAIHRVTYKEGGITFKREYFM 211
Query: 153 SNPDQVIVTKI-SGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANAN 211
S PD ++V ++ S S+ G +S +SL+SL + +N I + G P K +
Sbjct: 212 SYPDNIMVMRLTSDSKKGKISRMISLESLHTDKVIRASDNTITLTGY-PTPTSGDKRVGD 270
Query: 212 DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD-- 269
G++++ L +K + G I+ ++ KKLK+E + ++L+ A++++ +
Sbjct: 271 HWKNGLKYAQQLLVKHTG--GKITVVDGKKLKIEEAKEIIVLMSAATNYVQCMDDSYHYF 328
Query: 270 SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
S ++P + + L+ N Y+ L H DY L+ R+ + L P+ V T
Sbjct: 329 SGEEPLDKVKATLKKAANKKYTALLATHEKDYHSLYDRMKLNLGNLPEMPVVTT------ 382
Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
D++ + E+ L L FQFGRYLLISSSR G+ ANLQG+W E LS W+S
Sbjct: 383 DSLLKGMDAHTNSESENQYLEMLYFQFGRYLLISSSREGSLPANLQGVWGERLSNPWNSD 442
Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYL------ASGWVIHH 443
H NIN++MNYW + P NLS C P+ +++ L G TAQ Y GWV HH
Sbjct: 443 YHTNINVQMNYWPTQPTNLSRCHLPMVEYVKSLVPRGKYTAQQYYCKPDGGNVRGWVTHH 502
Query: 444 KTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
+ +IW ++ + K +P G W+C +WE+Y + +D+DFLE Y ++ A F +
Sbjct: 503 ENNIWGNTAPAK-KDTPHHFPAGAIWMCQDIWEYYQFNLDKDFLEAY-YDVMLQAALFWV 560
Query: 504 D--WLIEGHDGYLETNPSTSPEH-EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEV 560
D W E DG L NPS SPEH EF L C + A+I E+F +I A++V
Sbjct: 561 DNLWTDE-RDGTLVANPSHSPEHGEF-----SLGC-----STSQAMIAEMFDMMIKASKV 609
Query: 561 LEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
L K+++ + ++ ++ +L KI G +MEW
Sbjct: 610 LGKDKEPEIAEIETAMNKLSGPKIGLGGQLMEW 642
>gi|119499317|ref|XP_001266416.1| hypothetical protein NFIA_040960 [Neosartorya fischeri NRRL 181]
gi|119414580|gb|EAW24519.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 792
Score = 287 bits (735), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 187/598 (31%), Positives = 309/598 (51%), Gaps = 47/598 (7%)
Query: 11 NPLKIT-FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
NP T + PA F +PIGNGRL A +WGG + + LNE+++W+G D NP+A
Sbjct: 22 NPSTYTWYTTPAADFASTLPIGNGRLAAAIWGGA-VDNITLNENSIWSGPFQDRVNPNAY 80
Query: 70 KALSDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRR 126
+ +D R+++++G + A ++ + P+ Y LG + L+F H + ++Y R
Sbjct: 81 EGFTDSRAMLEAGNLSSANDVVLQDMVSIPSSPREYHPLGSLRLDF--GHDATSLQSYTR 138
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
LDL T A V+Y VG+V ++RE+ +S+PD V+ ++ S++G+L+ SL+ Y
Sbjct: 139 FLDLGTGVAGVRYQVGDVVYSREYVTSHPDGVLAVRLRASKNGALNVVTSLE----RSRY 194
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
V + G + KAN+ I+F+A + +RG + V G
Sbjct: 195 VESLTAVSSRGMG---TLTLKANSGQSTDPIRFTAQARVV---NRGGRITTNGTAVVVAG 248
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ + +S+ P ++++D + L + SY + DY+ L
Sbjct: 249 ASTVDIFFDTQTSYR----YPDETERDAVVKKQ--LDAAVKASYPAVKQAATSDYKSLSG 302
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLISS 364
RV + L S + P+ R+K+++TD DP L+ L+F FGR+ LI+S
Sbjct: 303 RVKLDLG-----------SSGSAGNQPTDIRLKNYKTDPDRDPELMTLMFNFGRHSLIAS 351
Query: 365 SRPGTQV---ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
SR G+ ANLQGIWN+D SP W V++NL+MNYW + NL++ EP+ D +
Sbjct: 352 SRAGSSSGLPANLQGIWNQDYSPAWGGKYTVDVNLQMNYWHAQVTNLADTFEPVIDLMDK 411
Query: 422 LSINGSKTAQVNYLA-SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
+ +G A+ Y +G+++HH TD+W ++ W +WPMG AWL +L + + +
Sbjct: 412 VVPHGQDVAKKMYHCDTGYILHHNTDLWGDAAPVDNGTKWTMWPMGSAWLSMNLMDQFRF 471
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLAC 535
T D+ L++R +PLL+ A F +L + +GY + PS SPE+ FI P+ GK
Sbjct: 472 TQDKTLLQERIWPLLKSAADFYYCYLFD-FEGYYTSGPSISPENAFIIPEDMTIAGKSTG 530
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ S TMD ++ E+F+A+I + L+ + L K + R+R +I G I+EW
Sbjct: 531 IDLSPTMDNLLLHELFTAVIETCKALDITGEDLT-NAHKYISRIRHPQIGSYGQILEW 587
>gi|224537148|ref|ZP_03677687.1| hypothetical protein BACCELL_02025 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521203|gb|EEF90308.1| hypothetical protein BACCELL_02025 [Bacteroides cellulosilyticus
DSM 14838]
Length = 776
Score = 287 bits (735), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 188/583 (32%), Positives = 288/583 (49%), Gaps = 43/583 (7%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLV 79
PA + A+PIGNGR+G M++G +E + +NE+T+W G P NP P+ ++ +R+L+
Sbjct: 32 PASDWKSALPIGNGRIGGMIFGDPSTEQIVINENTIWCGPPLPPNNPKGPELINKMRNLI 91
Query: 80 DSGQYAEATAASVKLFG----HPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
+G+Y EA K F A YQ G + ++F D K A Y+R LD A
Sbjct: 92 FNGKYEEAVIVCEKEFADGVHENARSYQPFGFLNIDFKD---KGAISNYKRWLDYTKAIT 148
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
V Y+ V +TRE F S P++V+V +I+ + G +SF + N +
Sbjct: 149 YVSYTQNGVTYTREAFVSKPNEVMVVRITADKPGQVSFKSKYTRPFGATTKAENNRSQYV 208
Query: 196 EGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLV 255
+G+ + N + G++F I I ++ G I A E +++ ++ +++
Sbjct: 209 QGQAYAE--------NGEFVGVKFEGI--INYYNEGGKIKANETD-IEINNANSVTIMIA 257
Query: 256 ASSSFDGPFINPSDSKKDPTSESMS----ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
S+ + N D+K T L + L Y L H+D+Y L++R S
Sbjct: 258 ISTDY-----NIHDTKNVLTHNRKKICEKQLSQAQKLGYKKLKQTHIDEYSALYNRSSF- 311
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQ 370
DI +T N P +R++ + + D L+ + + RYL ISSSR G
Sbjct: 312 ------DITFNTPVNNN----PIDKRIQLAASGQIDSELLFEYYNYCRYLFISSSRKGGL 361
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
NLQGIWN + W S H+N+N++ YW + NLSEC EP+F L NG +TA
Sbjct: 362 PMNLQGIWNPLMLAPWRSNFHINVNIQEAYWFAEQANLSECHEPIFTLTENLIKNGKETA 421
Query: 431 QVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
QV + G V H+TD W + K W + AWLC H EHY YT+D++FL+
Sbjct: 422 QVMFGTKRGSVAGHRTDAWFYAPPTFLKAHWGMSITNAAWLCLHHMEHYRYTLDKEFLKT 481
Query: 490 RAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
RA P+L A F +DWL+ + G L + P+ SPE+ F +GK+A ++ T D II
Sbjct: 482 RALPILRETALFFVDWLVPDPRSGKLVSGPTASPENRFKV-NGKVASLTMGCTYDQEIIW 540
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
F + A ++L N + VE V S+ +L IA DG +M
Sbjct: 541 NTFRDFLEACKILGINNEETVE-VEASMKKLSMPTIANDGRLM 582
>gi|168215503|ref|ZP_02641128.1| fibronectin type III domain protein [Clostridium perfringens NCTC
8239]
gi|182382428|gb|EDT79907.1| fibronectin type III domain protein [Clostridium perfringens NCTC
8239]
Length = 1479
Score = 287 bits (734), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 188/604 (31%), Positives = 311/604 (51%), Gaps = 70/604 (11%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
L + ++ PA + +A+PIGNG +G M++G V SE ++ NE TLW+G PG +
Sbjct: 48 LALWYDEPATKWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107
Query: 66 PDAPKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
A +A+ ++R ++ G + + +G YQ GDI L+F SH +
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQRAYG----AYQNFGDIFLDFK-SHEESK 162
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
YRREL++ + + VKY+ V + RE+F S PD V+V K+ ++ SL+ +V +
Sbjct: 163 VTNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNVMVIKLKADKASSLTVDVRNEGA 222
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ + NN +I+ G + G+++ + +IK+ + G+I ED+
Sbjct: 223 HNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINTGGSIQDKEDR 267
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ VE +D +++ A + + + P+ +DP S + + NL Y +L +RH++D
Sbjct: 268 -ISVENADEITIIMSAGTDYVNEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
Y+ LF RV++ L D TD E + ++T++ SL L FQ+GRYL
Sbjct: 325 YKNLFDRVNLNLGELKLDKPTD-------------EMLNEYKTNQSNSLETLFFQYGRYL 371
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSR G+ ANLQG+WN +P W S H N+N++MNYW + NLSE PL +++
Sbjct: 372 LISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYVE 431
Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
L G KTA+++ +GW ++ + + +A + W P AW+ +
Sbjct: 432 SLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
LWEHY +T D+D+L + YP+++ A F +L+E DG YL ++PS SPEH
Sbjct: 491 LWEHYQFTEDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH----- 545
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
+ +T D +I ++F+ I A+E L +E+ E K L+P +I + G
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGVDEEFRAELENKRERLLKP-QIGKHGQ 600
Query: 590 IMEW 593
+ EW
Sbjct: 601 VQEW 604
>gi|255532706|ref|YP_003093078.1| alpha-L-fucosidase [Pedobacter heparinus DSM 2366]
gi|255345690|gb|ACU05016.1| Alpha-L-fucosidase [Pedobacter heparinus DSM 2366]
Length = 940
Score = 286 bits (733), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 176/502 (35%), Positives = 263/502 (52%), Gaps = 47/502 (9%)
Query: 102 YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVT 161
YQ GD+ L F + A Y+R+LDLNTA A Y++ + + RE+ +S PDQ IV
Sbjct: 295 YQPFGDLYLNFKTEN--EAVTNYKRKLDLNTAVASTTYTLKGINYLREYLASQPDQAIVI 352
Query: 162 KISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSA 221
+++ + GS+SF D+LL + +G +I ++ ++ +
Sbjct: 353 RLTADKKGSISF----DALLGSPHKYSGVKKINANTIALSLKVRDGV--------LKGES 400
Query: 222 ILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA 281
L+ I+ + ++A K+ + +D L L A +SF +N D +P S ++ A
Sbjct: 401 RLQAIITKGKLLVTA---NKISIVAADAVTLYLTAGTSF----VNDKDVSGNPASAAVKA 453
Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF 341
L + SY+ + H+ +YQK + S+ K ++P+ ER++ F
Sbjct: 454 LTGLNGKSYAQVKAAHIKEYQKYYTAFSVSFGPDSKA------------SLPTDERIEQF 501
Query: 342 QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
DP+ L Q+GRYLLISSSRPGTQ ANLQGIWNE L+P W S NINLEMNYW
Sbjct: 502 SDGNDPAFAALFMQYGRYLLISSSRPGTQPANLQGIWNELLTPPWGSKYTTNINLEMNYW 561
Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+ NLS EPL + L+ NG TA+V+Y A GWV+HH TD+W +A
Sbjct: 562 PTGVLNLSAMAEPLIRKINALAKNGEVTAKVHYNAKGWVLHHNTDLW-NGTAPINASNHG 620
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPST 520
+W G WL HLWEHY +T D +FL+ AYP+++ A F D+LI+ G+L + PS
Sbjct: 621 IWVSGAGWLSQHLWEHYLFTQDLNFLKNEAYPVMKQAAVFFNDFLIKDPKTGWLISTPSN 680
Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL-KSLPRL 579
SPE +G L TMD IIR +F I+A +L DA +K L + + +
Sbjct: 681 SPE------NGGLVA---GPTMDHQIIRTLFRNCIAATALL--GVDADFKKTLEQKITLI 729
Query: 580 RPTKIAEDGSIMEWVQRRLNTS 601
P +I + G + EW++ + +T+
Sbjct: 730 APNQIGKYGQLQEWLEDKDDTT 751
Score = 75.9 bits (185), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 52/82 (63%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA+ +TDA+PIGNGRLGAM++ GV + ++ NE+TLWTG P DY + A L
Sbjct: 32 QLWYTKPAEKWTDALPIGNGRLGAMIFAGVEKDHIQFNEETLWTGGPRDYNHKGAAAYLP 91
Query: 74 DVRSLVDSGQYAEATAASVKLF 95
+R L+ G EA + + F
Sbjct: 92 QIRQLLFEGNQQEAEKLAAEKF 113
>gi|374376430|ref|ZP_09634088.1| Alpha-L-fucosidase [Niabella soli DSM 19437]
gi|373233270|gb|EHP53065.1| Alpha-L-fucosidase [Niabella soli DSM 19437]
Length = 946
Score = 286 bits (732), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 173/495 (34%), Positives = 263/495 (53%), Gaps = 40/495 (8%)
Query: 102 YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVT 161
YQ GD+ + K + YRR LDL TA Y+ V+F R + +S P QV+
Sbjct: 289 YQPFGDVVFHVNADETKVKD--YRRVLDLETAVLTTAYNYNGVDFKRTYIASQPQQVLAV 346
Query: 162 KISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSA 221
+ S GS+SF L S H V +Q + + K D ++ +
Sbjct: 347 NFTASRPGSVSFETELTSP-HQHFIVEAVDQ---------QTLVLKIQVKDG--ALRGES 394
Query: 222 ILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA 281
++++++ +G++ A++D KL V +D A + + A+++F N D DP++ +A
Sbjct: 395 YVQVRVT--KGSV-AVKDNKLIVSKADEATVFIAAATNFK----NFKDVSADPSARCRAA 447
Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF 341
++ I+ S++ + H+ +YQ+ F+ +S+ + +++P+ R++ F
Sbjct: 448 IKGIQQQSFASVLKAHVKEYQQYFNTLSVNFYGQKNQPSAN-------ESLPTDLRLEKF 500
Query: 342 QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
DP V L Q+GRYLLISSSRPGT ANLQGIWNE LSP W S NIN EMNYW
Sbjct: 501 ARSGDPEFVALYMQYGRYLLISSSRPGTYPANLQGIWNELLSPPWGSKYTTNINAEMNYW 560
Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+ LS + LF + L+++G +TA+ Y A GWV+HH TD+W ++A
Sbjct: 561 PAELLGLSPLHDALFKMVEELAVSGKETAKEYYNAPGWVLHHNTDLWRGTAAINASNH-G 619
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH-DGYLETNPST 520
+W GGAWLC+HLWE Y +T D FL+ AYP++ A F +LI+ GYL + PS
Sbjct: 620 IWVTGGAWLCSHLWERYLFTKDERFLKDTAYPIMREAALFFNHFLIKDPVTGYLISTPSN 679
Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLR 580
SPEH G L TMD IIR +F + I A+++L K + AL +++ + PR+
Sbjct: 680 SPEH------GGLVA---GPTMDHQIIRALFKSTIEASQIL-KTDAALRKELEEKYPRIA 729
Query: 581 PTKIAEDGSIMEWVQ 595
P KI G + EW+Q
Sbjct: 730 PNKIGRFGQLQEWMQ 744
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 36/75 (48%), Positives = 53/75 (70%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PAK + +A+PIGNGRLGAMV+GGV ++ ++ NE+TLW+G P DY A + L
Sbjct: 24 LKLWYQHPAKEWVEALPIGNGRLGAMVFGGVQTDRVQFNEETLWSGYPRDYNKKGAYRYL 83
Query: 73 SDVRSLVDSGQYAEA 87
+R L+ +G+ EA
Sbjct: 84 DSIRGLLFAGKQKEA 98
>gi|391227681|ref|ZP_10263888.1| hypothetical protein OpiT1DRAFT_00166 [Opitutaceae bacterium TAV1]
gi|391223174|gb|EIQ01594.1| hypothetical protein OpiT1DRAFT_00166 [Opitutaceae bacterium TAV1]
Length = 839
Score = 286 bits (731), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 196/633 (30%), Positives = 302/633 (47%), Gaps = 86/633 (13%)
Query: 17 FNGPAKH-FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDV 75
F+ PA+ + A+PIGNGR GAM++G + +E L+LNED+LW G P D NPDA + L +
Sbjct: 14 FDQPAQQDWNRALPIGNGRFGAMIYGNIVAERLQLNEDSLWNGGPRDRRNPDAREHLPVL 73
Query: 76 RSLVDSGQYAEA-TAASVKLFGHP--ADVYQLLGDIELEF-----------DDSHLKYAE 121
R L+ G+ A A L G P Y+ L D+ L F D+ L
Sbjct: 74 RKLLADGRLAAAHDLVHDALAGIPDSQRCYEPLADLFLHFEHPGAPVAVSADEMALASGY 133
Query: 122 ET----------YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL 171
T YRR LDL TA V Y++ N + R H +S DQVI + G L
Sbjct: 134 ATPRFDPALLSHYRRALDLRTAVMSVDYTLNNTSYARRHLASAADQVIALHLRAGRPGGL 193
Query: 172 SFNVSLDS---------LLDNHSYVN----------GNNQIIMEGRCPGKRIPPKANAND 212
+ + L+ D +V + +++ GR G+
Sbjct: 194 TLRLRLERGPRKSYSTRYADTVGFVADAAREPSDACASPALLLRGRAGGE---------- 243
Query: 213 DPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKK 272
G++F+ L +I+ G + + + L ++ +D L+L A+++F +
Sbjct: 244 --DGVRFAVGLRARIAG--GALRRI-GETLCIDAADSVTLVLAAATTF---------RED 289
Query: 273 DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTV 332
DP + + + + + H +Y+ F R S+ L +E ++V
Sbjct: 290 DPAAFVIGRTGAALARGWDKIRADHEREYRSRFDRASLTLG-------APAAAEAGAESV 342
Query: 333 PSAERVK-SFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPH 391
P R+K + ++ DP L L F + RYLLISSSRPG+ ANLQG+WN D P+W S
Sbjct: 343 PVDLRLKRARESGGDPVLASLYFNYARYLLISSSRPGSLPANLQGLWNADFWPSWGSKYT 402
Query: 392 VNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKS 451
+NIN EMNYW + P NL++C +PLFD L + +G +TA+V Y G+V HH TD+WA +
Sbjct: 403 ININTEMNYWIAEPANLADCHQPLFDHLGRVVESGRETARVMYGCRGFVAHHNTDLWADT 462
Query: 452 SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD 511
+ W +GGAWL H W+ ++Y D L AY LL + F LD+LIE
Sbjct: 463 CPTDRNAGASYWTIGGAWLVLHAWDRFDYDRDPGSLAA-AYALLREASLFFLDFLIEDAR 521
Query: 512 GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDA---- 567
G L +P+ SPE+ + P+G+ + TMD ++ +F AA++L + A
Sbjct: 522 GRLVLSPTCSPENTYRLPNGEAGVLCAGCTMDSQLLSILFRRTAQAAQLLGRRPLAAAAI 581
Query: 568 -----LVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+ +V + RL + G ++EW++
Sbjct: 582 AGDHDFLARVAAAAARLPQPAVGRHGQLLEWLE 614
>gi|383113365|ref|ZP_09934137.1| hypothetical protein BSGG_3069 [Bacteroides sp. D2]
gi|313695534|gb|EFS32369.1| hypothetical protein BSGG_3069 [Bacteroides sp. D2]
Length = 859
Score = 286 bits (731), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 202/633 (31%), Positives = 319/633 (50%), Gaps = 74/633 (11%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD--------Y 63
LK T+N PAK + ++A+PIGNG +GAM++GGV + ++ NE TLW+G PG+
Sbjct: 32 LKATYNKPAKVWESEALPIGNGYMGAMIFGGVEVDVIQTNEHTLWSGGPGEDPSYNGGHL 91
Query: 64 TNPDAPKA-LSDVRSLVD------SGQYAEATAASVKLFGHPAD---------------- 100
P+ K+ L R L+ + ++ A KL H +
Sbjct: 92 GTPETNKSYLHKTRVLLQQKMNDFTANHSAYIDADGKLITHNYEGDGNGTELRNLIDKLA 151
Query: 101 -------VYQLLGDIELEF-DDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFS 152
+Q L +I +E + + + A Y R LD++ A RV Y G + F RE+F
Sbjct: 152 GTKEHFGSFQTLSNIIVEVVNPATSEPAYSDYTRTLDIDNAIHRVTYKEGGITFKREYFM 211
Query: 153 SNPDQVIVTKI-SGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANAN 211
S PD ++V ++ S S+ G +S +SL+SL + +N I + G P K +
Sbjct: 212 SYPDNIMVMRLTSDSKKGKISRMISLESLHTDKVIRASDNTITLTGY-PTPTSGDKRVGD 270
Query: 212 DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD-- 269
G++++ L +K + G I+ ++ KKLK+E + ++L+ A++++ +
Sbjct: 271 HWKNGLKYAQQLLVKHTG--GKITVVDGKKLKIEEAKEIIVLMSAATNYVQCMDDSYHYF 328
Query: 270 SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
S ++P + + L+ N Y+ L H DY L+ R+ + L + V T
Sbjct: 329 SGEEPLDKVKATLKKAANKKYTALLAAHEKDYHSLYDRMKLNLGNLTEMPVVTT------ 382
Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
D++ ++ E+ L L FQFGRYLLISSSR G+ ANLQG+W E LS W+S
Sbjct: 383 DSLLKGMDARTNSESENQYLEMLYFQFGRYLLISSSREGSLPANLQGVWGERLSNPWNSD 442
Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYL------ASGWVIHH 443
H NIN++MNYW + P NLS C P+ +++ L G TAQ Y GWV HH
Sbjct: 443 YHTNINVQMNYWPTQPTNLSRCHLPMVEYVKSLVPRGKYTAQQYYCKPDGGNVRGWVTHH 502
Query: 444 KTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
+ +IW ++ + K +P G W+C +WE+Y + +D+DFLE Y ++ A F +
Sbjct: 503 ENNIWGNTAPAK-KDTPHHFPAGAIWMCQDIWEYYQFNLDKDFLEAY-YDVMLQAALFWV 560
Query: 504 D--WLIEGHDGYLETNPSTSPEH-EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEV 560
D W E DG L NPS SPEH EF L C + A+I E+F +I A++V
Sbjct: 561 DNLWTDE-RDGTLVANPSHSPEHGEF-----SLGC-----STSQAMIAEMFDMMIKASKV 609
Query: 561 LEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
L K+++ + ++ ++ +L KI G +MEW
Sbjct: 610 LGKDKEPEIAEIETAMNKLSGPKIGLGGQLMEW 642
>gi|423223092|ref|ZP_17209561.1| hypothetical protein HMPREF1062_01747 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392639998|gb|EIY33805.1| hypothetical protein HMPREF1062_01747 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 776
Score = 285 bits (730), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 187/583 (32%), Positives = 288/583 (49%), Gaps = 43/583 (7%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLV 79
PA + A+PIGNGR+G M++G +E + +NE+T+W G P NP P+ ++ +R+L+
Sbjct: 32 PASDWKSALPIGNGRIGGMIFGDPSTEQIVINENTIWCGPPLPPNNPKGPELINKMRNLI 91
Query: 80 DSGQYAEATAASVKLFG----HPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
+G+Y EA K F A YQ G + ++F D K A Y+R LD A
Sbjct: 92 FNGKYEEAVIVCEKEFADGVHENARSYQPFGFLNIDFKD---KGAISNYKRWLDYTKAIT 148
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
V Y+ V +TRE F S P++V+V +I+ + G +SF + N +
Sbjct: 149 YVSYTQNGVTYTREAFVSKPNEVMVVRITADKPGQVSFKSKYTRPFGATTKAENNRSQYV 208
Query: 196 EGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLV 255
+G+ + N + G++F I I ++ G I A +++ ++ +++
Sbjct: 209 QGQAYAE--------NGEFVGVKFEGI--INYYNEGGKIKA-NGTDIEINNANSVTIMIA 257
Query: 256 ASSSFDGPFINPSDSKKDPTSESMS----ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
S+ + N D+K T L + L Y L H+D+Y L++R S
Sbjct: 258 ISTDY-----NIHDTKNVLTHNRKKICEKQLSQAQKLGYKKLKQTHIDEYSALYNRSSF- 311
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQ 370
DI +T N P +R++ + + D L+ + + RYL ISSSR G
Sbjct: 312 ------DIAFNTPVNNN----PIDKRIQLAASGQIDSELLFEYYNYCRYLFISSSRKGGL 361
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
NLQGIWN + W S H+N+N++ YW + NLSEC EP+F L NG +TA
Sbjct: 362 PMNLQGIWNPLMLAPWRSNFHINVNIQEAYWFAEQANLSECHEPMFTLTENLIKNGKETA 421
Query: 431 QVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
QV + G V H+TD W + K W + AWLC H EHY YT+D++FL+
Sbjct: 422 QVMFGTKRGSVAGHRTDAWFYAPPTFLKAHWGMSITNAAWLCLHHMEHYRYTLDKEFLKT 481
Query: 490 RAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
RA P+L A F +DWL+ + G L + P+ SPE+ F +GK+A ++ S T D II
Sbjct: 482 RALPVLRETALFFVDWLVPDPRSGKLVSGPTASPENRFKV-NGKVASLTMSCTYDQEIIW 540
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
F + A ++L + + VE V S+ +L IA DG +M
Sbjct: 541 NTFRDFLEACKILGISNEETVE-VEASMKKLSMPTIANDGRLM 582
>gi|149277534|ref|ZP_01883675.1| hypothetical protein PBAL39_05083 [Pedobacter sp. BAL39]
gi|149231767|gb|EDM37145.1| hypothetical protein PBAL39_05083 [Pedobacter sp. BAL39]
Length = 780
Score = 285 bits (729), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 185/611 (30%), Positives = 315/611 (51%), Gaps = 54/611 (8%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
M+++ + LK+ + PA+ + + + +GNGRLG M GG+ ET+ LN+ TLW+G P
Sbjct: 15 MLSSNGVFSQAKLKLWYEHPAQKWEETLALGNGRLGMMPDGGITRETVVLNDITLWSGAP 74
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLF--------GHPADVYQLLGDIELEF 112
D N +A K+L +R L+ G+ EA + F G +Q+LG +++ F
Sbjct: 75 QDANNYEASKSLPQIRKLLAEGKNDEAQELVNRDFICTGKGSGGVNYGCFQVLGTLQMNF 134
Query: 113 D---DSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
+ + + Y REL + A A Y + V++ +E+ +S D + + +I+ + G
Sbjct: 135 SYPGATADQLKDNGYHRELSIGEAIASSSYQINGVKYKKEYLTSFDDDICLIRITADKPG 194
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD 229
+L+F VS+ + + G ++ ++G+ + D KG+Q+ + + +
Sbjct: 195 ALNFKVSISRPERGEASIAGQ-ELQLQGQL---------DNGIDGKGMQYLSRVRAVLKG 244
Query: 230 DRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLS 289
+ T +K+ V V+L VAS G SD + T + M+A R
Sbjct: 245 GKLTT----EKEALVISKATEVILFVAS----GTDFRASDFRMK-TEQVMAAAMKKR--- 292
Query: 290 YSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDP 347
Y+ + H+ ++Q LF+RVS+ + + +D+VP+ R++ F + D
Sbjct: 293 YALQRSNHIRNFQHLFNRVSV------------SIGHQLMDSVPTDLRLERFHKNPAADL 340
Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
L +QFGRYL ISS+R G NLQG+W + W H+++N++MN+W N
Sbjct: 341 GFPALFYQFGRYLSISSTRVGLLPPNLQGLWANQIQTPWTGDYHLDVNVQMNHWPVEVSN 400
Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
LSE PL + + L G +TA+ Y A GW+ H T++W + W G
Sbjct: 401 LSELNLPLAELVRGLVKPGQRTAKAYYNADGWIAHVITNVWGFTEPGE-SASWGSSNAGS 459
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEF 526
WLC +LW+HY ++ D+++L + YP+L+G A F L+ + G+L T PS SPE+ F
Sbjct: 460 GWLCNNLWDHYAFSNDKEYL-RSIYPILKGSAEFYNSVLVRDEETGWLVTAPSVSPENSF 518
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEV--LEKNEDALVEKVLKSLPRLRPTKI 584
P+GK A +S T+D I+RE+F +I+A+E+ L+ A++++ LKS+P I
Sbjct: 519 YLPNGKTASISMGPTIDNQIVRELFGNVIAASEMLGLDAGFRAILQEKLKSIP--PAGNI 576
Query: 585 AEDGSIMEWVQ 595
++DG IMEW++
Sbjct: 577 SKDGRIMEWLR 587
>gi|422874794|ref|ZP_16921279.1| fibronectin type III domain-containing protein [Clostridium
perfringens F262]
gi|380304435|gb|EIA16724.1| fibronectin type III domain-containing protein [Clostridium
perfringens F262]
Length = 1479
Score = 285 bits (729), Expect = 6e-74, Method: Compositional matrix adjust.
Identities = 187/604 (30%), Positives = 312/604 (51%), Gaps = 70/604 (11%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
L + ++ PA + +A+PIGNG +G M++G V SE ++ NE TLW+G PG +
Sbjct: 48 LALWYDEPATKWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107
Query: 66 PDAPKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
A +A+ ++R ++ G + + +G YQ GDI L+F SH +
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQRAYG----AYQNFGDIFLDFK-SHEESK 162
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
YRREL+++ + + VKY+ V + RE+F S PD V+V K+ ++ SL+ +V +
Sbjct: 163 VTNYRRELNIDESLSTVKYNYKGVNYEREYFCSYPDNVMVIKLKADKASSLTVDVRNEGA 222
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ + NN +I+ G + G+++ + +IK+ + G+I ED+
Sbjct: 223 HNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINTGGSIQDKEDR 267
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ VE ++ +++ A + + + P+ +DP S + + NL Y +L +RH++D
Sbjct: 268 -ISVENANEITIIMSAGTDYVNEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
Y+ LF RV++ L D TD E + ++T++ SL L FQ+GRYL
Sbjct: 325 YKNLFDRVNLNLGELKSDKPTD-------------EMLNEYKTNQSNSLETLFFQYGRYL 371
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSR G+ ANLQG+WN +P W S H N+N++MNYW + NLSE PL +++
Sbjct: 372 LISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYVE 431
Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
L G KTA+++ +GW ++ + + +A + W P AW+ +
Sbjct: 432 SLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
LWEHYN+T D+D+L + YP+++ A F +L+E DG YL ++PS SPE
Sbjct: 491 LWEHYNFTDDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEQ----- 545
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
+ +T D +I ++F+ I A+E L +E+ E K L+P +I + G
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGIDEEFRAELEDKRERLLKP-QIGKHGQ 600
Query: 590 IMEW 593
+ EW
Sbjct: 601 VQEW 604
>gi|325261844|ref|ZP_08128582.1| fibronectin type III domain protein [Clostridium sp. D5]
gi|324033298|gb|EGB94575.1| fibronectin type III domain protein [Clostridium sp. D5]
Length = 805
Score = 284 bits (726), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 196/597 (32%), Positives = 297/597 (49%), Gaps = 46/597 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA-PKA-LSD 74
+ PAK FT A+P+GNG LGAMV+GG P E + LN DTLW+G PG + P+ +
Sbjct: 10 YTHPAKDFTQALPLGNGHLGAMVYGGFPRERISLNLDTLWSGHPGHWHGKQKIPQGTMER 69
Query: 75 VRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
VRSL+D+G Y EA K + G + Y G +EL+FD + Y E R L L A
Sbjct: 70 VRSLIDAGAYWEAQKQIQKHMLGCNNESYLSAGSLELQFD-TEADY--EGCERRLSLEEA 126
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
R + + + + F S + +I +E +S +SL + L + +
Sbjct: 127 ITRTDWELKGQKVREDVFVSAVQNGMYIRIF-TEGAPVSVAISLQTQLRVLQSAAEADGL 185
Query: 194 IMEGRCPG----KRIPPKA--NANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
++ + P +P + +++ G+ + L I D G I E+ + VE
Sbjct: 186 LLVAQAPSHVEPNYVPSREPIQYDEEKPGMIYGLFLGINECD--GGIKRTEEG-ICVENF 242
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNL-SYSDLYTRHLDDYQKLFH 306
+ L + ++G + P + + + + L S+ + + HL ++Q+L+
Sbjct: 243 TCLTMFLSGETEYEG-YGKPLNGQAESIIRYLRERGHRAKLKSWEENFRAHLREHQRLYL 301
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSS 365
R V + E + P+ ER++ ++ EDP L LLF +GRYL+++SS
Sbjct: 302 RT-----------VLELEGGEEEEQRPTDERLEMVRSGKEDPGLSALLFHYGRYLILASS 350
Query: 366 RPG---TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
RP Q A LQGIW ED+ W S VNIN +MNYW P NL EC+ PL + L
Sbjct: 351 RPLDGLVQPATLQGIWCEDVRSVWSSNWTVNINTQMNYWICGPGNLPECEIPLIRMVKEL 410
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
S + + A N G+V+HH D+W + G+V WA WPMGG WL THL+ HY YT
Sbjct: 411 S-DAGREAAANLNCRGFVVHHNVDLWRQCIPALGEVKWAYWPMGGLWLTTHLYRHYLYTG 469
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSST 541
D+++LEK YP+ + C +F+LD+L HDG +T PSTSPE+ F + S T
Sbjct: 470 DKEYLEK-IYPVFQECTAFILDYLY--HDGSAYQTCPSTSPENTFYDEQERECAACVSPT 526
Query: 542 MDMAIIREVFSAIISAAEVL-----EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
MD+A+IREV ++ E++ E + +VL LP + G ++EW
Sbjct: 527 MDIALIREVLCNLLEIDEIIRGTRPESGQCREARRVLNELPAF---QTGSRGQLLEW 580
>gi|256394373|ref|YP_003115937.1| alpha-L-fucosidase [Catenulispora acidiphila DSM 44928]
gi|256360599|gb|ACU74096.1| alpha-L-fucosidase, putative, afc95A [Catenulispora acidiphila DSM
44928]
Length = 742
Score = 283 bits (725), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 199/598 (33%), Positives = 300/598 (50%), Gaps = 78/598 (13%)
Query: 17 FNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-------DYTNPDA 68
++ PA + +A+PIGNGR+GAMV+GGV +E ++ E+TLWTG PG D+ P
Sbjct: 7 YDAPASDWEREALPIGNGRIGAMVFGGVAAERVQFTEETLWTGGPGHPGYDHGDWREP-R 65
Query: 69 PKALSDVRSLVDSGQYAEATAASVKLFGHPA---DVYQLLGDIELEFDDSHLKYAEETYR 125
P AL +VR +D + T +L G P +Q GD+ +EF L + YR
Sbjct: 66 PGALEEVRRRIDE-HGSLPTQTVTELLGQPKTGFGAFQNYGDLIIEF--PGLSEEAQDYR 122
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLD 182
R LD++ A A V + V TRE+F S+P V++ +++ + G+L + + D
Sbjct: 123 RTLDISDALAGVAFEADGVHHTREYFVSHPAGVLLGRLTADQPGALHCVLRYEPGTDATD 182
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+ +++ G P G++ +A IK+ + G + ED+ L
Sbjct: 183 ATRVTTEDATLVIIGALPDN-------------GLRHAA--RIKVIPEGGRLIEGEDR-L 226
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
+EG+D V++L A++ + + + DP A+ +Y DL H+ D+
Sbjct: 227 TIEGADRVVIILAAATDYADTYPAYRNGI-DPAGPVAEAVAKAAASTYDDLRAAHIADHS 285
Query: 303 KLFHRVSIQLSRS-PKDIVTDTC-SEENID-TVPSAERVKSFQTDEDPSLVELLFQFGRY 359
LF RV + L S P D+ TD + D + P+A+R +L +L F GRY
Sbjct: 286 ALFDRVVLDLGGSLPGDVPTDRLLTAYGTDASTPAADR----------ALEQLFFDHGRY 335
Query: 360 LLISSSRPGTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
LLI+SSRP +Q+ ANLQG+WN +P W HVNINL+MNYW + PC L EC EPLF +
Sbjct: 336 LLIASSRPASQLPANLQGVWNASPTPPWAGDYHVNINLQMNYWLAEPCALGECAEPLFAY 395
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEH 477
+ L G +A+ + GWV+H++T + + D W +P AWLC HLWEH
Sbjct: 396 IEALRAPGRVSARTLFGTEGWVVHNETTPFGFTGVHDWPDAFW--FPEAAAWLCRHLWEH 453
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEH-EFIAPDGKLAC 535
Y +T+D +FL++RAYP+++ A F L L + DG L NPS SPE E+ A
Sbjct: 454 YAFTLDEEFLKERAYPVMKEAAQFWLANLRRDPRDGKLVANPSFSPEQGEYTA------- 506
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
S M IIR++F + A +E + L +I G + EW
Sbjct: 507 ---GSAMAQQIIRDLFKNTVGLAAEVEDLDTGL--------------RIGSWGQLQEW 547
>gi|224537768|ref|ZP_03678307.1| hypothetical protein BACCELL_02651 [Bacteroides cellulosilyticus
DSM 14838]
gi|224520588|gb|EEF89693.1| hypothetical protein BACCELL_02651 [Bacteroides cellulosilyticus
DSM 14838]
Length = 833
Score = 283 bits (725), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 185/614 (30%), Positives = 308/614 (50%), Gaps = 69/614 (11%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + + +P+GNGRLG M GGV E + LNE +LW+G+ DY NPDA ++L
Sbjct: 41 QLYYTAPATIWEETLPLGNGRLGMMPDGGVDREHIVLNEISLWSGMEADYGNPDASRSLP 100
Query: 74 DVRSLVDSGQYAEATAASVKLF-------GHPADVYQLLGDIELEFDDSHLKYAEET--- 123
++ L+ G+ EA F G YQ+L D+ ++F H +
Sbjct: 101 AIQQLLFEGKNKEAQELMYSSFVPKKPESGGTYGNYQMLADLNIDFSFPHRRKTISENDA 160
Query: 124 -----YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
YRR LDL A A ++ +++ RE+F+S V++ ++ S +LSF+ L
Sbjct: 161 APVTDYRRWLDLRDAVAYTSFTKEGIDYQREYFTSRDKDVMIIHLTTSRRRALSFSAQLS 220
Query: 179 -------SLLDNHSYVNGNNQIIMEGRC----PGKRIPPKANANDDPKGIQFSAILEIKI 227
S+L G +++EG PG+ +G+++ + +
Sbjct: 221 RPKQGAVSMLPGIGKEEGT--LLLEGTLDSGKPGR------------EGMKYRVAMRLIS 266
Query: 228 SDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESM--SALQSI 285
+ ISA ++ + + A L+L A++S+ + S ++ +S+ +A Q +
Sbjct: 267 KGGKQNISA--ERGITLTQGREAWLVLSATTSYAASGTDFSGNRYKEVCDSLLNAATQHV 324
Query: 286 RNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE 345
+ + H+ ++ + RVS+ L + D++ P+ ER+ F E
Sbjct: 325 Q------IKESHIASHRTFYDRVSLTLPFTEDDVL------------PTNERITRFTERE 366
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
P+L L + +GRYL ISS+RPG+ NLQG+W + W+ H NIN++MN+W
Sbjct: 367 SPALAALYYNYGRYLFISSTRPGSLPPNLQGLWANGVETPWNGDYHTNINIQMNHWPLEQ 426
Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALW 463
LSE +PL + L +G +TA+ Y A GWV+H T+IW +A W
Sbjct: 427 AGLSELYQPLTALVERLIPSGEETARTFYGTHAQGWVLHMMTNIW-NYTAPGEHPSWGAT 485
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSP 522
GGAWLC HLWEHY YT D +FL KR YP+L+G + F ++ E G+L T P++SP
Sbjct: 486 NTGGAWLCAHLWEHYQYTQDIEFL-KRIYPVLKGASEFFYSTMVREPKHGWLVTAPTSSP 544
Query: 523 EHE-FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
E+ F+ D V TMD+ ++ E+++ +I A +LE + D K+ ++L + P
Sbjct: 545 ENAFFVGNDPTPVSVCMGPTMDVQLLTELYTNVIEATSILECDAD-YAAKLREALDKFPP 603
Query: 582 TKIAEDGSIMEWVQ 595
+I++ G + EW++
Sbjct: 604 MQISKGGYLQEWLE 617
>gi|423227144|ref|ZP_17213608.1| hypothetical protein HMPREF1062_05794 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392624284|gb|EIY18376.1| hypothetical protein HMPREF1062_05794 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 825
Score = 283 bits (724), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 193/621 (31%), Positives = 310/621 (49%), Gaps = 79/621 (12%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + + +P+GNGRLG M GG+ E + LNE +LW+G+ DY NPDA ++L
Sbjct: 29 QLYYTAPATIWEETLPLGNGRLGMMPDGGIDKEHIVLNEISLWSGMEADYGNPDASRSLP 88
Query: 74 DVRSLVDSGQYAEATAASVKLF-------GHPADVYQLLGDIELEFD-DSHLKYAEE--- 122
++ L+ G+ EA F G YQ+L D+ L F K+A +
Sbjct: 89 AIQQLLFEGKNKEAQELMYNSFVPKKPESGGTYGNYQMLADLTLNFSIPVKKKFASDEVV 148
Query: 123 ---TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
YRR LDL A A ++ G +++ RE+++S V++ ++ S SL F SL
Sbjct: 149 PVTNYRRWLDLRDAVAYTTFTKGGIDYQREYYTSRDKDVMIIHLTVSRRRSLFFTASLSR 208
Query: 180 LLDNH-SYVNGNNQ----IIMEGRC----PGKRIPPKANANDDPKGIQFSAILEIKISDD 230
S V G+ + +++EG PG+ G+++ + +
Sbjct: 209 PQQGTVSLVPGSGKEAGTLLLEGALDSGKPGQ------------DGMKYRVAMRVVSKGG 256
Query: 231 RGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFIN-PSDSKKD----------PTSESM 279
+ ISA ED + +G++ A L++ A++S+ + P K+ P S +
Sbjct: 257 KQFISA-EDGIMLTQGTE-AWLIISATTSYAAAGTDFPGSRYKEVCDSLLNAATPPSSQL 314
Query: 280 SALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV 338
S L S + N S+ +LY R V+ T D +P+ ER+
Sbjct: 315 SILNSPLTNASHRELYDR-----------------------VSLTLPATEDDALPTNERI 351
Query: 339 KSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEM 398
F E P+L L + +GRYLLISS+RPG+ NLQG+W + W+ H NIN++M
Sbjct: 352 VRFAERESPALAALYYNYGRYLLISSTRPGSLPPNLQGLWANGVQTPWNGDYHTNINIQM 411
Query: 399 NYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRG 456
N+W LSE +PL + L +G TA+ Y A GWV+H T++W +A
Sbjct: 412 NHWPLEQAGLSELYQPLTGLVERLVPSGKGTARTFYGNHAQGWVLHMMTNVW-NYTAPGE 470
Query: 457 KVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLE 515
W GGAWLC HLWEHY YT D ++L K+ YP+L+G + F ++ E G+L
Sbjct: 471 HPSWGATNTGGAWLCAHLWEHYQYTQDIEYL-KKIYPILKGASEFFYSTMVREPKHGWLV 529
Query: 516 TNPSTSPEHE-FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLK 574
T P++SPE+ F+ D V TMD+ ++ E+++ +I AA +LE ++D K+ +
Sbjct: 530 TAPTSSPENAFFVGDDPTPVSVCMGPTMDVQLLTELYTNVIEAASILECDDD-YAAKLRE 588
Query: 575 SLPRLRPTKIAEDGSIMEWVQ 595
+L + P +I++ G + EW++
Sbjct: 589 ALGKFPPMQISKGGYLQEWLE 609
>gi|227536429|ref|ZP_03966478.1| possible alpha-L-fucosidase [Sphingobacterium spiritivorum ATCC
33300]
gi|227243805|gb|EEI93820.1| possible alpha-L-fucosidase [Sphingobacterium spiritivorum ATCC
33300]
Length = 798
Score = 283 bits (723), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 192/610 (31%), Positives = 310/610 (50%), Gaps = 62/610 (10%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+ + L++ ++ PAK + + +P+GNG +G M GGV E + LNE ++W+G D N
Sbjct: 42 VAQSGSLRLWYDKPAKRWEETLPLGNGLIGMMPDGGVQKEKIVLNEISMWSGSEEDPNNY 101
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLF-------GH------PADVYQLLGDIELEFD 113
A K++ +++ L+ G+ EA K F GH P YQ LG + L+F
Sbjct: 102 AAYKSVGEIQKLLVEGKNDEAEQLVNKNFVTSGKGSGHGNGANVPFGCYQNLGFLNLQFK 161
Query: 114 DSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLS 172
++ A+ T Y R LDL A AR +++ V++TRE+F+S V V ++ S+ G+L+
Sbjct: 162 EA----AQSTDYERSLDLKDAVARTNFTINGVKYTREYFTSFDQNVGVVRLKSSKKGALN 217
Query: 173 FNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRG 232
F+ SL S + Y + N+ M G I P D GI FS+ +IK+ G
Sbjct: 218 FSASL-SREEGVQYSSKGNEFSMSG------ILPDGKGGD---GISFSS--KIKVFHRGG 265
Query: 233 TISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSD 292
+ A D L V + ++ A++S+ DP L+ + Y
Sbjct: 266 KVVA-SDTALTVSKASEVLIFFAAATSY---------FHADPLQYVDEQLKQANDTPYPQ 315
Query: 293 LYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLV 350
L+ +HL Y+ +F+RV +QL D + I T +R+++F + +D L
Sbjct: 316 LFKQHLSRYESVFNRVDLQLE--------DDADKSGITT---DKRLRAFYDNPAQDNGLA 364
Query: 351 ELLFQFGRYLLISSSRPGTQVA---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
L +QFGRYL ISS+ P + A NLQG+W + W+ H+NIN +MN+W N
Sbjct: 365 ALYYQFGRYLNISSTAPDVKGALPPNLQGLWAHQIQTPWNGDYHLNINAQMNHWGVEVNN 424
Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
LSE P + + ++ G KTA+ Y A GWV++ T++W S+ + W G
Sbjct: 425 LSEYHIPFIELIKKIAKTGEKTARAYYNAPGWVVYMMTNVWGYSAPGE-QASWGASTASG 483
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEF 526
WLC HLWEHY +T D +L K YP+++G A F ++ + G+L T+PS SPE+ F
Sbjct: 484 -WLCNHLWEHYQFTKDSVYL-KEVYPVMQGAARFYAHTMVTDPKTGWLVTSPSVSPENAF 541
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLR-PTKIA 585
+GK A V +D I+RE++ +I A +L ++ +A + + + +L P I+
Sbjct: 542 RMKNGKTAAVVMGPAIDNQIVRELYRNLIDADSILGQH-NAFTDTLRIQIQQLAPPVLIS 600
Query: 586 EDGSIMEWVQ 595
+ G + EW++
Sbjct: 601 KSGRVQEWLE 610
>gi|192360052|ref|YP_001983169.1| alpha-L-fucosidase [Cellvibrio japonicus Ueda107]
gi|190686217|gb|ACE83895.1| alpha-L-fucosidase, putative, afc95A [Cellvibrio japonicus Ueda107]
Length = 782
Score = 282 bits (722), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 199/610 (32%), Positives = 309/610 (50%), Gaps = 60/610 (9%)
Query: 2 MNAESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
M+AE + + PL I F+ PA + + +PIGNG +GA++ GGV + ++ NE TLWTG P
Sbjct: 1 MSAEVSRESVPLAIAFDRPATDWEREGLPIGNGAMGAVISGGVEQDIIQFNEKTLWTGGP 60
Query: 61 G-----DYTNPDAPKA--LSDVR-SLVDSGQYAEATAASV---KLFGHPADVYQLLGDIE 109
G D+ P +A L+ VR S+ G + AA + K+ G+ YQ GD+
Sbjct: 61 GSVRGYDFGIPAESQASALAKVRDSIRKDGSISPEKAAELMGRKILGYGD--YQTFGDLI 118
Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
L F ++ + Y R L L+ + Y V +TRE+F+S PD VIV ++S + G
Sbjct: 119 LSFPENDSGVIK--YNRRLSLDEGRVILGYQQEGVTYTREYFASYPDGVIVVRLSADKPG 176
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD 229
+ V L + N Q+ R G ++ D+ G F+A I +
Sbjct: 177 QIHLRVGLRT--------PDNRQVTT--RIEGNQLDIVGELQDNKLG--FAA--RIAVVA 222
Query: 230 DRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMS-ALQSIRNL 288
+ G + + L+V+ +D ++ A++++ + + + + +S L +
Sbjct: 223 EGGNLDNSGQQSLQVKRADAVTIVFAAATNYAQRYPHYRQADASYAQQKISNTLAAALQK 282
Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPS 348
+Y+ L RH DYQ L+ RV++ + + + T + K+ D S
Sbjct: 283 NYAQLLARHTQDYQSLYKRVALDIGQGVHSLATPALLAQ----------YKTGNAALDRS 332
Query: 349 LVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL 408
L + FQFGRYLLI+SSRPG+ ANLQG+WN ++P W++ HVNINL+MNYW + NL
Sbjct: 333 LEAIYFQFGRYLLIASSRPGSLPANLQGVWNNSITPPWNADYHVNINLQMNYWLAETANL 392
Query: 409 SECQEPLFDFLTYLSINGSKTAQ-VNYLASGWVIHHKTDIWAKSSADRGKVVW--ALW-P 464
E +P FDF+ L G+ +AQ + ++ GW + T+IW + G + W A W P
Sbjct: 393 PELMQPYFDFVDSLVEPGNISAQRIADVSKGWALFLNTNIWGFT----GVIDWPTAFWQP 448
Query: 465 MGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPE 523
GAWL H +EH+ ++ D+ FL RAYPL++G A F LD+L++ DG PS SPE
Sbjct: 449 EAGAWLAQHYYEHFLFSGDQAFLRNRAYPLMKGAAEFWLDFLVKDPRDGLWVVTPSFSPE 508
Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTK 583
H P A +S D+ +R A AA V +K LV++ LK++ R +
Sbjct: 509 H---GPFTTGAAMSQQIVFDL--LRNTSEA---AALVGDKKFKRLVDQTLKNMD--RGIR 558
Query: 584 IAEDGSIMEW 593
I G + EW
Sbjct: 559 IGSWGQLQEW 568
>gi|325103050|ref|YP_004272704.1| alpha-L-fucosidase [Pedobacter saltans DSM 12145]
gi|324971898|gb|ADY50882.1| Alpha-L-fucosidase [Pedobacter saltans DSM 12145]
Length = 938
Score = 282 bits (721), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 178/498 (35%), Positives = 267/498 (53%), Gaps = 55/498 (11%)
Query: 102 YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVT 161
YQ GDI L F H +Y Y+RELDLN+A A+ YS +TR +F + P +V
Sbjct: 294 YQPFGDIYLNF--KHQEYT--NYKRELDLNSALAKTSYSHKGTNYTRTYFVNAPQNTLVI 349
Query: 162 KISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSA 221
+ ++ +++F S DS S ++I + A D +++ A
Sbjct: 350 HLEANQPKNVTFTASFDSPHSQKSI---------------RKIDDRTIALDVK--VKYGA 392
Query: 222 ILE---IKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSES 278
+ + + + G IS +++ +L VEG+D A L+L A+++F +N D P+ ++
Sbjct: 393 LFGESILHLKNKNGKIS-VKNNQLVVEGADEATLMLFAATNF----VNFHDVSGKPSVKN 447
Query: 279 MSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV 338
L S +NL Y L HL DY L++R S+ + ++ +P+ ER+
Sbjct: 448 QQTLASAKNLDYQTLKQNHLQDYTSLYNRFSLSFGGNSRE------------DLPTDERI 495
Query: 339 KSF-QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLE 397
+ F +T DP+L+ L Q+GRYLLISSSR TQ ANLQGIWN L+P+W S NIN+E
Sbjct: 496 REFSKTANDPALLALYAQYGRYLLISSSRANTQPANLQGIWNHLLAPSWGSKYTTNINVE 555
Query: 398 MNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK 457
MNYW S NLS+ +PLF + LS +G++TA+ Y GWV+HH TDIW + +A
Sbjct: 556 MNYWLSEMLNLSDLHQPLFGMIEDLSKSGAETAKNYYNLPGWVLHHNTDIW-RGAAPINN 614
Query: 458 VVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLET 516
+WP GGAWL THL EHY +T D+ FL K+ YP+++ F D+L ++ G L +
Sbjct: 615 SNHGIWPTGGAWLTTHLLEHYAFTKDQAFL-KKYYPIIKNSVLFYKDFLVVDPISGCLIS 673
Query: 517 NPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL 576
PS SPEH G L TMD IIR +F ++ + L +ED L +++
Sbjct: 674 TPSNSPEH------GGLVA---GPTMDHQIIRALFDGFVNVSAALGLDED-LRKEIQTKK 723
Query: 577 PRLRPTKIAEDGSIMEWV 594
++ P KI + G + EW+
Sbjct: 724 QQILPNKIGKYGQLQEWM 741
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 37/79 (46%), Positives = 57/79 (72%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PAK +T+A+PIGNG++GAM++GGV + ++ NE+TLWTG P +Y PDA K L +R
Sbjct: 32 YKQPAKEWTEALPIGNGKIGAMIFGGVAQDRIQFNEETLWTGSPRNYNKPDAYKYLPQIR 91
Query: 77 SLVDSGQYAEATAASVKLF 95
+L+ G+ EA A +++ F
Sbjct: 92 TLLQQGKQREAEALAMQEF 110
>gi|254785612|ref|YP_003073041.1| alpha-L-fucosidase [Teredinibacter turnerae T7901]
gi|237683920|gb|ACR11184.1| alpha-L-fucosidase [Teredinibacter turnerae T7901]
Length = 814
Score = 281 bits (720), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 194/605 (32%), Positives = 308/605 (50%), Gaps = 71/605 (11%)
Query: 15 ITFNGPAKHFTD-AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG------DYTNPD 67
+ F PA + + +PIGNG +GA++ G + E ++ NE +LW G PG P+
Sbjct: 44 LLFFSPASDWENQGLPIGNGAMGAVITGEINKELVQFNEKSLWEGGPGAQGYNFGLAAPN 103
Query: 68 APKALSDVRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAE-ETY 124
P L V+ + G A + +L P + YQ GD+ +E HL E + Y
Sbjct: 104 FPAKLKAVQQQLAKGAVLSAETVATQLGQDPTEYGNYQTFGDLIIE----HLHSTEVQDY 159
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
RR L++ A A V+Y++ V + RE+F+S PD+VIV +I+ + G+L+ NV L + +
Sbjct: 160 RRNLNIENALASVEYTITGVGYRREYFASFPDKVIVLQIASDKPGALNLNVGLHTSDNRS 219
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+N R+ N++ G++++A++E++ GT++ DK L++
Sbjct: 220 QLLNATTH----------RMSLSGALNNN--GLRYAAMVEVRTQS--GTVARTSDK-LQI 264
Query: 245 EGSDWAVLLLVASSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
+D L+L ++ + P + P + + L S+ Y L +RH+ DY+
Sbjct: 265 RSADKVTLVLATATDYAPVYPTYRVASGAPSPLAVVETRLNSLTKKGYPLLKSRHITDYR 324
Query: 303 KLFHRVSIQLS--RSPKDIVTDTCSEENIDTVPSAERVKSFQTD---EDPSLVELLFQFG 357
LF RV++ L+ SP + DT P R++++ D +L L F +G
Sbjct: 325 SLFQRVTLNLTPNSSPNSVA---------DTKPLPARLEAYHKDTPENKRALETLYFNYG 375
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLI+SSR G+ ANLQG+WN +P W++ HVNINL+MNYW +L NLSE PL+D
Sbjct: 376 RYLLIASSRAGSLPANLQGVWNHSNTPPWNADYHVNINLQMNYWPALVTNLSETTPPLYD 435
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVW--ALW-PMGGAWLCTHL 474
F+ L G K+AQ +GW + T+I+ S G + W A W P AWL
Sbjct: 436 FVDALRAPGEKSAQTLGADAGWAVLLNTNIFGFS----GLISWPTAFWQPEANAWLMRLY 491
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
++ Y +T D+ FL +RAYP ++ + F + +L + DG NPS SPEH
Sbjct: 492 FDFYQFTGDKKFLRERAYPAMKSTSQFWMTFLTQ-RDGTYWVNPSYSPEH---------G 541
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT----KIAEDGSI 590
S ++M I+ E+F +AAE+L+ + A + LK P L+ T +I + G +
Sbjct: 542 PFSEGASMSQQIVSELFRNTHAAAEMLKDRQFA---RSLK--PFLQNTDDGLRIGKWGQL 596
Query: 591 MEWVQ 595
EW Q
Sbjct: 597 QEWQQ 601
>gi|189464509|ref|ZP_03013294.1| hypothetical protein BACINT_00851 [Bacteroides intestinalis DSM
17393]
gi|189438299|gb|EDV07284.1| hypothetical protein BACINT_00851 [Bacteroides intestinalis DSM
17393]
Length = 817
Score = 281 bits (718), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 198/598 (33%), Positives = 297/598 (49%), Gaps = 72/598 (12%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
++PIGNG LGA + G V +E + LNE TLW G P DY N + L ++R
Sbjct: 64 SLPIGNGSLGANILGSVAAERITLNEKTLWRGGPNTSGGADYYWNVNKQSAPILKEIRQA 123
Query: 79 VDSGQYAEATAASVKLFG----------HPADV--YQLLGDIELEFDDSHLKYAEETYRR 126
G +A + K F HP + +G++ +E D S L+ + YRR
Sbjct: 124 FTEGNGEKAAQLTRKNFNGLAAYEEKDEHPFRFGSFTTMGELYIETDLSELRM--KNYRR 181
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
L L++A A V++ V++ R++F S PD V+ + S ++G + +S + S
Sbjct: 182 ILSLDSAMAVVQFDKEGVQYRRKYFISYPDSVMAMEFSADKAGKQNLVLSYAPNPEAQSN 241
Query: 187 V--NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+ +G + ++ G + G++F+ IK GT+ A D+ L V
Sbjct: 242 IRTDGTDGLVYTGVL-------------NNNGMKFA--FRIKAIAKGGTVIAQNDR-LIV 285
Query: 245 EGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+G+D V LL A + +F+ F NP DP + S + Y L H
Sbjct: 286 KGADRVVFLLTADTDYKMNFNPDFKNPKTYVGDDPELTTQSMMNQALLKGYETLANNHKA 345
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGR 358
DY LF+RV + L+ P +D +P+ +R+ +++ + D L EL +QFGR
Sbjct: 346 DYTALFNRVKLTLN--PDVTGSD---------LPTYQRLANYRKGQPDFRLEELYYQFGR 394
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLI+SSRPG ANLQG+W+ +L W H NIN++MNYW + P NLSEC PL DF
Sbjct: 395 YLLIASSRPGNLPANLQGMWHNNLDGPWRVDYHNNINIQMNYWPAGPTNLSECTWPLIDF 454
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVV-WALWPMGGAWLCTHLWEH 477
+ L G KTAQ + A GW +I+ +S +++ W PM G WL TH+WE+
Sbjct: 455 IRGLVKPGEKTAQAYFAARGWTASISANIFGFTSPLSSEIMAWNFNPMAGPWLATHIWEY 514
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y+YT DR+FL++ Y L++ A F +D+L DG PSTSPEH V
Sbjct: 515 YDYTRDRNFLKEVGYDLIKSSAQFTVDYLWHKPDGTYTAAPSTSPEH---------GPVD 565
Query: 538 YSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+T A++RE+ I A++VL + E ++VL L P KI G ++EW
Sbjct: 566 EGATFVHAVVREILLDAIEASKVLGVDSRERKHWQEVLA---HLVPYKIGRYGQLLEW 620
>gi|383638758|ref|ZP_09951164.1| alpha-L-fucosidase [Streptomyces chartreusis NRRL 12338]
Length = 740
Score = 280 bits (717), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 189/583 (32%), Positives = 286/583 (49%), Gaps = 61/583 (10%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG--------DYTNPDAPKALSDVRSL 78
A+P+GNG LGAMV+G + SE ++ NE TLWTG PG D+ P P A+ V+
Sbjct: 15 ALPVGNGALGAMVFGSIASERVQFNEKTLWTGGPGSVQGYDHGDWREPR-PTAIDAVQDD 73
Query: 79 VDSGQYAEATAASVKLFGHPA---DVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
+D+ + + +L G P YQ GD+ L+F + E YRREL L+T A
Sbjct: 74 LDTRRRLAPEDVAGRL-GQPRVGFGAYQTFGDLYLDFPGTP---TPEAYRRELALDTGVA 129
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
V Y+ RE F+S PD VIV +I ++F + S + + ++ +
Sbjct: 130 SVAYTHRQTRHRREFFASFPDGVIVGRIGADRPAGITFTLRYTSPRGDFTTTATGGRLTV 189
Query: 196 EGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLV 255
G K N G++F A ++++ D G +++ D + V G+D A +L
Sbjct: 190 RGAL-------KDN------GLRFEA--QVQVRSDGGAVTSGADGTITVTGADSAWFVLA 234
Query: 256 ASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRS 315
A + + +P DP A+ + Y L RH+ D++ LF RV++ + +S
Sbjct: 235 AGTDYAD--THPDYRGADPHPAVTRAVDRASSRGYDSLRARHIADHRTLFARVTLDIGQS 292
Query: 316 -PKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANL 374
P ++ TD +A+R +L L FQ+GRYLLI+SSR G+ ANL
Sbjct: 293 APAEVPTDRLLASYTGGTSAADR----------ALEALFFQYGRYLLIASSRAGSLPANL 342
Query: 375 QGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY 434
QG+WN SP W + HVNINL+MNYW + NL E P F+ L G TA+ +
Sbjct: 343 QGVWNHSTSPPWSADYHVNINLQMNYWLAEAANLPETTVPYDRFVQALRAPGRHTARQMF 402
Query: 435 LASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYP 493
+ GWV+H++T+ + + D W +P AWL L+EHY + D+L AYP
Sbjct: 403 GSRGWVVHNETNPYGFTGVHDWATAFW--FPEAAAWLTQQLYEHYRFGGSTDYLRTTAYP 460
Query: 494 LLEGCASFLLDWL-IEGHDGYLETNPSTSPEH-EFIAPDGKLACVSYSSTMDMAIIREVF 551
+++ A F LD L + DG L PS SPEH +F A + M I+ ++F
Sbjct: 461 VMKEAAEFWLDNLRTDPRDGRLVVTPSYSPEHGDFTA----------GAAMSQQIVHDLF 510
Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEW 593
+ + AA VL + D ++V ++L L P +I G + EW
Sbjct: 511 TNTLEAARVLGDSRD-FRQRVEQALAHLDPGLRIGSWGQLQEW 552
>gi|345881344|ref|ZP_08832866.1| hypothetical protein HMPREF9431_01530 [Prevotella oulorum F0390]
gi|343920009|gb|EGV30749.1| hypothetical protein HMPREF9431_01530 [Prevotella oulorum F0390]
Length = 834
Score = 280 bits (716), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 189/598 (31%), Positives = 296/598 (49%), Gaps = 71/598 (11%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT--------NPDAPKALSDVRSL 78
++PIGNG LGA + G + +E + LNE +LW G PG + N A L +R+
Sbjct: 82 SLPIGNGSLGANILGSIAAERITLNEKSLWRGGPGVSSDASYYWNVNKHAAPVLKAIRAA 141
Query: 79 VDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETYRR 126
+G A+A + + K F A + +G++ +E + ++++ YRR
Sbjct: 142 FLAGDKAKADSLTRKNFNGLAAYESYAEKPFRFGNFTTMGELTIETGLNDAQFSD--YRR 199
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNH 184
EL L++A V++ V + R F S PD V+V + + G +L F+ + + +
Sbjct: 200 ELSLDSARTLVQFVHDGVRYARTAFISYPDNVMVLRFKANAKGMQNLCFHYAPNPVSTGK 259
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+G N ++ G D G+Q+ ++ I+ GT+ + L +
Sbjct: 260 MQADGANGLVYRGAL-------------DSNGMQY--VVRIQAVTHSGTLEN-SGQTLTI 303
Query: 245 EGSDWAVLLLVASS----SFDGPFINPSDSKK-DPTSESMSALQSIRNLSYSDLYTRHLD 299
+G+D V L+ A + +FD F NP P + +Q Y+ L+ RH
Sbjct: 304 KGADEVVFLITADTDYRINFDPDFHNPKTYVGVQPEVTTEKWMQQAAERGYAQLFQRHFK 363
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGR 358
DY LF RV +QL+ ++ N VP+A+R+ +++ D L EL +QFGR
Sbjct: 364 DYSPLFQRVKLQLN----------AAQTNDKDVPTAQRLAAYRNGATDNYLEELYYQFGR 413
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLI+SSRPG ANLQG+W+ ++ W H NIN++MNYW NL+EC PL DF
Sbjct: 414 YLLIASSRPGNLPANLQGLWHNNVDGPWRVDYHNNINVQMNYWPVHTTNLNECALPLVDF 473
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
+ L G+ TA+ Y A GW ++I+ ++ + + W L PMGG WL THLWE+
Sbjct: 474 VRTLVKPGAVTAKAYYGARGWTTSVSSNIFGFTAPLASEDMSWNLCPMGGPWLATHLWEY 533
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y++T D+ FL Y +++ A+F +D+L DG PSTSPEH +
Sbjct: 534 YDFTRDKRFLRSTLYDIIKQSANFAVDYLWHKPDGTYTAAPSTSPEH---------GPID 584
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALV--EKVLKSLPRLRPTKIAEDGSIMEW 593
T A+IRE+ I+A++VL+ +E A + VL LP P +I G + EW
Sbjct: 585 EGVTFVHAVIREILLDAIAASKVLQVDETARKQWQMVLLHLP---PYRIGRYGQLQEW 639
>gi|429852446|gb|ELA27582.1| alpha-l-fucosidase [Colletotrichum gloeosporioides Nara gc5]
Length = 796
Score = 280 bits (716), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 179/580 (30%), Positives = 280/580 (48%), Gaps = 35/580 (6%)
Query: 22 KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDS 81
+ F +A+PIGNGRLGAM+ G E ++LNE+++W G P D A AL +R +
Sbjct: 37 RDFYEALPIGNGRLGAMIHGYTDKELIRLNEESIWNGGPRDKIPTTALDALEPLREQILD 96
Query: 82 GQYAEATAASVKLFGHPAD---VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVK 138
G+ EA V F D YQ G++ L+F+ H YR LD++ + +
Sbjct: 97 GRLTEADQNWVANFTPEYDDMRRYQPAGELRLDFN--HTLNETSGYRHSLDVSKGLSSLS 154
Query: 139 YSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGR 198
Y G VE+TRE F + P V+ + S + SGSLS + SL N +
Sbjct: 155 YVFGGVEYTREAFGNAPKNVLAFRFSCNSSGSLSLDASLS---------RDRNVTELTAD 205
Query: 199 CPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS 258
G+ + +D +F + ++ + D G I + L + + ++ A +
Sbjct: 206 AAGRILKLDGTGEEDDT-YRFVSQAQVLLPDGVGDIIS-NGTALHIRNATDVFIIYTAET 263
Query: 259 SFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKD 318
+F +P + + L++ + Y + + DY++ + R SI S
Sbjct: 264 AFR----HPDATMAQLETIVNGRLETAQEAGYETIQREAVKDYKQYYDRTSIDFGTS--- 316
Query: 319 IVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIW 378
+ S++ I + +R + TD P L+ L F G+YLLI SSRPG+ ANLQGIW
Sbjct: 317 --QEIGSKDTIARLEDWKRGSNITTD--PELMALQFNVGKYLLIQSSRPGSLPANLQGIW 372
Query: 379 NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASG 438
N D P WDS +N+NLEMNYW + P NL E P+ DFL L++ GS+ A+ Y A G
Sbjct: 373 NRDFGPPWDSKFTINVNLEMNYWPAQPLNLPEIAGPVVDFLDRLAVTGSEVAKGMYGADG 432
Query: 439 WVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGC 498
W HH TDI + + A +P+GGAWL E++ +T D + R P+L+G
Sbjct: 433 WCCHHNTDITGDCTPFHAITIAAPYPLGGAWLAFEAIEYFRFTGDTTYARDRILPILKGA 492
Query: 499 ASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYSSTMDMAIIREVFSA 553
F+ W E DG+ TNPS SPE+ + P+ G+ + + D AI+ E+ S
Sbjct: 493 MDFIYSWATE-RDGWRITNPSCSPENSYYIPENMTVAGETTGIDAGAMNDRAIMWEIMSG 551
Query: 554 IISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +E L +E A + + +++P G ++E+
Sbjct: 552 FLEISEALSSDEGADRARSFRD--KIQPPVAGSFGQLLEY 589
>gi|336427807|ref|ZP_08607799.1| hypothetical protein HMPREF0994_03805 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336008767|gb|EGN38776.1| hypothetical protein HMPREF0994_03805 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 784
Score = 280 bits (716), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 187/603 (31%), Positives = 289/603 (47%), Gaps = 95/603 (15%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYA 85
DA P+GNG LGAMV+G + ++LNED+LW G D NP+A + L +V+ L+ ++
Sbjct: 37 DATPMGNGFLGAMVYGHTARDRIQLNEDSLWHGKFRDRINPNAKEHLKEVQELILDRKFE 96
Query: 86 EATAASVKLFGH----PADV--YQLLGDIELEFDDS---HLKYAEET----YRRELDLNT 132
EA +F H P ++ + LG++ L + + + + E+ Y +L++
Sbjct: 97 EAEEL---MFSHMVSAPGNMRNFSPLGELNLALNTALPFQMGWLPESDGENYVSDLNMEE 153
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
+ + V++TRE F SNPD+V+ ++ + + + LD LL+ + + Q
Sbjct: 154 GILCISHEDKGVQYTREMFVSNPDRVLCIRLKSDKEKA----IRLDMLLNRVPFTD---Q 206
Query: 193 IIMEGRCPGKRIPPKA-----------------NANDDPKGIQFSAILEIKISDDRGTIS 235
+ + R PGK + D G +F+ L + ++D R
Sbjct: 207 RLPDDRRPGKFVSAGVWPVTRCERIYTENGHTLMMEGDENGTRFACGLTV-VTDGR---- 261
Query: 236 ALED--KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
+ED KL + V+ L ASS + ++D S+L + R Y+D+
Sbjct: 262 -IEDCYAKLVAHEAGEVVIYLAASSD---------NREEDFVGNVKSSLAAARAKGYADI 311
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
T H+ D+ R ++ L P E+ +
Sbjct: 312 RTDHIADFTSYMKRCTLAL--------------------PEDEKAGMY------------ 339
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQ+ RY+++S+ R G NLQGIWN + P+W+S NINL+MNYW + CNLS E
Sbjct: 340 FQYARYMMVSAGREGATAMNLQGIWNHEFCPSWESKYTTNINLQMNYWPAEICNLSTLHE 399
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLFD + + G A+ Y G + HH TDI+ A W MGGAW+ H
Sbjct: 400 PLFDLIHTVQERGRDVAKRMYGCRGTMCHHNTDIYGDCGTQDMYAAAAFWQMGGAWMAMH 459
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
LWEHY +T+D DFL K YP++E A F +D+LI+ +GYL T PS SPE+ F+ DG
Sbjct: 460 LWEHYLFTLDEDFLRKE-YPVMEEFALFFVDFLIKDKEGYLVTCPSVSPENRFVLEDGSD 518
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
+ TMD IIR + SA + AA++L E A E++++ LRP +I G +
Sbjct: 519 TPICAGPTMDNQIIRGLMSACLEAAKILGIESPYKADFERIIRE---LRPNQIDSIGRLK 575
Query: 592 EWV 594
EW
Sbjct: 576 EWA 578
>gi|374384834|ref|ZP_09642351.1| hypothetical protein HMPREF9449_00737 [Odoribacter laneus YIT
12061]
gi|373227638|gb|EHP49951.1| hypothetical protein HMPREF9449_00737 [Odoribacter laneus YIT
12061]
Length = 780
Score = 279 bits (713), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 180/585 (30%), Positives = 283/585 (48%), Gaps = 59/585 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG---DYTNPDAP---KALSDVRSLV 79
+A+P+GNG +GAM +GG + ++L E++ W G PG Y + K L +VR L+
Sbjct: 36 EALPVGNGYMGAMWFGGPVRDEIQLAEESFWAGGPGASKSYKGGNKEGSWKYLKEVRELL 95
Query: 80 DSGQYAEATAASVKLFGH---PADVYQLLGDIELEFDDSHLKYAEET-------YRRELD 129
+SG+ +A + + F P + GD L E YRR LD
Sbjct: 96 ESGEKEKAAELAGRYFVGEITPTEAGDQFGDFGGNQPFGSLGVTVEAADTSWTDYRRSLD 155
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L A +V+Y +G F +F+S P ++ V K + + G + V+ ++
Sbjct: 156 LERAMGKVEYDMGGTHFRNTYFASYPARMFVFKYTNNAPGGKDYRVTFETPHQGTKITVR 215
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
+ I++G+ +P + IK+ D G I + ++EG+
Sbjct: 216 KDLWIIQGKLASNGLPFEGR---------------IKVKTD-GKIR-FQKGVFRIEGAKN 258
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
+ +S++ + P D + A++ ++ DL H DY+ LF RV
Sbjct: 259 TEFYVSIASAYANTY--PLYRGNDYEEVNRKAIERAERGTWEDLQAEHETDYRSLFERVK 316
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPG 368
++L S ++ +P+ +R + DP L L FQ+GRYLLISSSRPG
Sbjct: 317 LELGHS------------GLEKLPTDKRQLRYSLGAYDPGLEALYFQYGRYLLISSSRPG 364
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
T A+LQG WN L+ W H+NINL+M YW + NLSEC PL +++ L G
Sbjct: 365 TLPAHLQGRWNHQLNAPWACDYHMNINLQMIYWPAEVANLSECHLPLLEYIDKLREPGRV 424
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
TA+ + A GWV+H + + +A W P AWLC HLWEH+NYT DR+FL
Sbjct: 425 TAREYFNARGWVVHTMNNAFG-YTAPGWDFYWGYAPNSAAWLCAHLWEHFNYTRDREFLG 483
Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
++AYP+++ A F +D+L+ DG+L ++PS SPEH IA +TMD I
Sbjct: 484 RKAYPIMKEVARFWMDYLVADEDGFLVSSPSYSPEHGDIA---------IGATMDQEIAW 534
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
++F+ ++ A + + K + A + V RL P +I + G + EW
Sbjct: 535 DLFTNVLQAMDYV-KEDPAFADSVSDFRKRLLPLRIGKFGQLQEW 578
>gi|358368086|dbj|GAA84703.1| alpha-L-fucosidase 2 precursor [Aspergillus kawachii IFO 4308]
Length = 790
Score = 279 bits (713), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 193/599 (32%), Positives = 290/599 (48%), Gaps = 62/599 (10%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+N PA +FT +PIGNGRLGA +WG +E + LNE+++W G + NP + AL VR
Sbjct: 27 YNTPANNFTSTLPIGNGRLGAAIWG-TATENVTLNENSIWNGPFINRVNPRSYDALWPVR 85
Query: 77 SLVDSGQYAEATAASV-KLFGHP--ADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
SL+ G E ++ + G P + LG + L+F H + Y R LDL T
Sbjct: 86 SLLAQGNMTEGNDVTLANMVGIPDSPQSFSALGSLVLDF--GHDQAGISNYTRYLDLRTG 143
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNN 191
A V+Y+ V + RE+ +S PD V+ ++S S+ G L+ SL D + ++ ++
Sbjct: 144 VAVVEYTYREVHYRREYVASYPDGVVAVRLSSSQPGRLNVASSLARDRYVVSNQAAVSSD 203
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
++ R K I DP IQF+ I +SD R T + V
Sbjct: 204 LGVLTLRAYSKNI-------SDP--IQFTTEARI-VSDGRATSNG--------------V 239
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQKLFH 306
L+V ++S FI+ S + T E+ A L + + + + DY L
Sbjct: 240 SLVVRNASTVDIFIDTETSYRYTTRETREAEIKDKLDTASRSGFLTVKQNAIADYSTLAQ 299
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLISS 364
RV + L S + +P+ R+ +++TD DP L L+F FGR+ LI+S
Sbjct: 300 RVDLNLG-----------SSGSAGNLPTDTRLVNYRTDPDSDPELAVLMFHFGRHSLIAS 348
Query: 365 SRPGTQVA---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
SR A NLQG+WN++ P W ++INLEMNYW + NL++ P D L
Sbjct: 349 SRATESPALPANLQGLWNQEFDPAWGGRFTIDINLEMNYWPAEVTNLADTFSPFIDLLDI 408
Query: 422 LSINGSKTAQVNYLAS--GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
+ G A+ Y S G+V+HH TD+W ++ W +WPMGGAWL +L EHY
Sbjct: 409 VHGRGLDVAESMYHCSNGGYVLHHNTDLWGDAAPVDNGTTWTMWPMGGAWLSANLIEHYR 468
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLA 534
+T D L R +PLL+ A F +L +GY T S SPE +I PD G +
Sbjct: 469 FTRDETILRDRIWPLLQSAARFYYCYLFP-FEGYYSTGLSLSPEASYIVPDDMTTAGNVE 527
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ + TMD +++ E+F A+ +VL N K L +++ +I G I+EW
Sbjct: 528 GIDIAPTMDNSLLHELFQAVTETCDVLGINNTDCTTAA-KYLSKIKQPQIGSSGRILEW 585
>gi|384566468|ref|ZP_10013572.1| hypothetical protein SacglDRAFT_02625 [Saccharomonospora glauca
K62]
gi|384522322|gb|EIE99517.1| hypothetical protein SacglDRAFT_02625 [Saccharomonospora glauca
K62]
Length = 924
Score = 279 bits (713), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 203/609 (33%), Positives = 306/609 (50%), Gaps = 58/609 (9%)
Query: 3 NAESTSTTNP----LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWT 57
A TS P L + ++ PA + ++ +P+GNG LG V+GGV +E L+ NE TLWT
Sbjct: 39 GAAETSDLRPSPEGLTLWYDEPASDWESEVLPVGNGALGVGVFGGVATERLQFNEKTLWT 98
Query: 58 GVPG-----DYTNPDAPK--ALSDVRSLVDSGQYAEATAASVKLFGHPA---DVYQLLGD 107
G PG D+ N P+ A+ +VR +D+ A+ KL G P YQ G+
Sbjct: 99 GGPGAADGYDFGNWREPRPGAIEEVRQRLDTELRADPEWVVSKL-GQPKRGYGAYQTFGE 157
Query: 108 IELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSE 167
I + + L+ + YRR L+L A A V Y V TRE+F+S D V+V + SG
Sbjct: 158 IRVS--GAELEEVAD-YRRYLNLADAVAGVSYEADGVHHTREYFASAADDVVVARFSGEV 214
Query: 168 SGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI 227
G++ V + + DN S N GR + A DD G+++ A +I++
Sbjct: 215 PGAVDVTVGV-TAPDNRS----KNLTARGGRIT------FSGALDD-NGLRYEA--QIQV 260
Query: 228 SDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRN 287
D G+ D + V +D L+L A + + + P +DP + + +
Sbjct: 261 LTDGGSRVDNPDGSVTVTDADTMTLVLAAGTDYSAEY--PVYRGEDPHAAVTERVDAAVA 318
Query: 288 LSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDP 347
Y L H+ D++ LF RVS+ L + D+ TD D +AE ++ +
Sbjct: 319 KGYDALRAAHVADHRGLFDRVSLDLGQRMPDLPTDELLARYRDGGLAAEERRALEV---- 374
Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
L FQ+GRYLLI+SSR G+ ANLQG+WN+ SP W + HVNINL+MNYW + N
Sbjct: 375 ----LYFQYGRYLLIASSRSGSLPANLQGVWNDSTSPPWSADYHVNINLQMNYWPAEVTN 430
Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMG 466
LSE EPLFD++ L G+ TA+ + GWV+H++T + + D W +P
Sbjct: 431 LSETTEPLFDYVDSLVAPGTVTAKEMFGNRGWVVHNETTPFGYTGVHDWATSFW--FPEA 488
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHE 525
GAWL WEHY +T D FL +RAYP+L+ + F +D L+ + DG L +PS SPE
Sbjct: 489 GAWLAQSYWEHYLFTRDETFLAERAYPMLKSLSRFWIDELVTDSRDGRLVVSPSYSPEQ- 547
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KI 584
S ++M I+ ++ + AAE++ ++E+ E + +L L P +I
Sbjct: 548 --------GDFSAGASMSQQIVWDLLTNTAEAAELVGEDEEFRAE-LAATLADLDPGLRI 598
Query: 585 AEDGSIMEW 593
G + EW
Sbjct: 599 GSWGQLQEW 607
>gi|375146879|ref|YP_005009320.1| alpha-L-fucosidase [Niastella koreensis GR20-10]
gi|361060925|gb|AEV99916.1| Alpha-L-fucosidase [Niastella koreensis GR20-10]
Length = 943
Score = 278 bits (710), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 172/499 (34%), Positives = 257/499 (51%), Gaps = 64/499 (12%)
Query: 110 LEFDDSHLKYAEET----YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISG 165
L F D + ++A Y+R LDL+ A + V Y+ V + RE+F S P Q +V ++
Sbjct: 296 LPFGDLYFRFAHGNNSSDYQRSLDLDNAISTVSYTANGVSYNREYFISAPHQCVVMHVTA 355
Query: 166 SESGSLSFNVSLDS--------LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGI 217
S+ G+LS L++ +D+H+ + +E +N K +
Sbjct: 356 SKPGALSLQAVLNTPHKKYVVKKIDDHTL-----SLSLE------------VSNGVLKAV 398
Query: 218 QFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSE 277
+ L + R T++ D + ++ + LVA++SF N D DP +
Sbjct: 399 GY---LYATATGGRLTVN---DTAINLQQATEVNFYLVAATSFK----NYKDVSGDPVAA 448
Query: 278 SMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAER 337
+AL ++ + Y+ + T HL++Y KLF S T +P+ ER
Sbjct: 449 CKAALARVKGVPYASIKTAHLNEYHKLFETFSF------------TVPAGKNSGLPTNER 496
Query: 338 VKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLE 397
++ F +D +LV L + RYLLISSSRPGTQ ANLQGIWN+ L+P W S NINLE
Sbjct: 497 IRQFNMKDDAALVPLFLMYSRYLLISSSRPGTQPANLQGIWNDLLTPPWGSKYTTNINLE 556
Query: 398 MNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK 457
MNYW + NLS C +PLF+ + L++ G +TA+ +Y A GWV+HH TD+W + +A
Sbjct: 557 MNYWTAEVLNLSTCTQPLFNMINELAVAGHQTAKDHYNAPGWVLHHNTDLW-RGTAPINA 615
Query: 458 VVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLET 516
+W G AWL H+WEH+ YT D FL + YP L+G A F +L++ GYL +
Sbjct: 616 SNHGIWVTGAAWLTLHIWEHFLYTQDTAFLRAQ-YPNLQGAAQFFEHFLVKDPKTGYLIS 674
Query: 517 NPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL 576
PS SPEH G L TMD IIRE+F +AA VL K + A E++ +
Sbjct: 675 TPSNSPEH------GGLVA---GPTMDHQIIRELFRNCSAAAAVL-KTDAAFAERLKTLI 724
Query: 577 PRLRPTKIAEDGSIMEWVQ 595
P++ P KI + + EW++
Sbjct: 725 PQIAPNKIGKHNQLQEWME 743
Score = 94.0 bits (232), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 99/199 (49%), Gaps = 25/199 (12%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
++S + PL++ + PA +TDA+P+GNGRLGAMV+GGV E L+LNE+TLW+G P Y
Sbjct: 20 SQSYAQKQPLRLWYQQPAATWTDALPLGNGRLGAMVFGGVGEEHLQLNEETLWSGRPRSY 79
Query: 64 TNPDAPKALSDVRSLVDSGQYAEATAASVKLF-GHPADVYQLLGDIELEFDDSHLKYAEE 122
++P A + L +R L+ G+ AE+ A K F G A DDS + ++
Sbjct: 80 SHPGAAQYLQPMRQLLAEGKQAESEAMGEKYFMGLKAP------------DDSAYELQKD 127
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
T+ R + A V Y+ N + ++V + GS SFNV L
Sbjct: 128 TWFRSVRAQIEPAGVTYNDNNWPAMQLPTPEGWERVGLEGTDGSLWFRTSFNVPAKWLGK 187
Query: 183 N------------HSYVNG 189
N ++YVNG
Sbjct: 188 NLVLDLGRIRDLDYTYVNG 206
>gi|421235008|ref|ZP_15691623.1| large secreted protein [Streptococcus pneumoniae 2061617]
gi|395599385|gb|EJG59558.1| large secreted protein [Streptococcus pneumoniae 2061617]
Length = 764
Score = 277 bits (709), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 190/593 (32%), Positives = 300/593 (50%), Gaps = 56/593 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S KD ++ I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I+R + I A+ L N D + V+++ K LPR TKI +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLE 547
>gi|148998038|ref|ZP_01825551.1| hypothetical protein CGSSp11BS70_05545 [Streptococcus pneumoniae
SP11-BS70]
gi|168576031|ref|ZP_02721936.1| large secreted protein [Streptococcus pneumoniae MLV-016]
gi|307068776|ref|YP_003877742.1| hypothetical protein SPAP_2209 [Streptococcus pneumoniae AP200]
gi|419472044|ref|ZP_14011900.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA07914]
gi|419504884|ref|ZP_14044547.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47760]
gi|421315019|ref|ZP_15765603.1| hypothetical protein SPAR100_2088 [Streptococcus pneumoniae
GA47562]
gi|147756048|gb|EDK63091.1| hypothetical protein CGSSp11BS70_05545 [Streptococcus pneumoniae
SP11-BS70]
gi|183578125|gb|EDT98653.1| large secreted protein [Streptococcus pneumoniae MLV-016]
gi|306410313|gb|ADM85740.1| hypothetical protein SPAP_2209 [Streptococcus pneumoniae AP200]
gi|379543433|gb|EHZ08583.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA07914]
gi|379604070|gb|EHZ68832.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47760]
gi|395911603|gb|EJH22468.1| hypothetical protein SPAR100_2088 [Streptococcus pneumoniae
GA47562]
Length = 764
Score = 277 bits (708), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 190/593 (32%), Positives = 300/593 (50%), Gaps = 56/593 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAASNWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S KD ++ I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I+R + I A+ L N D + V+++ K LPR TKI +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLE 547
>gi|410477499|ref|YP_006744258.1| hypothetical protein HMPREF1038_02170 [Streptococcus pneumoniae
gamPNI0373]
gi|421269340|ref|ZP_15720202.1| hypothetical protein SPAR95_2166 [Streptococcus pneumoniae SPAR95]
gi|444387345|ref|ZP_21185368.1| hypothetical protein PCS125219_00757 [Streptococcus pneumoniae
PCS125219]
gi|444391139|ref|ZP_21189052.1| hypothetical protein PCS70012_02198 [Streptococcus pneumoniae
PCS70012]
gi|444391645|ref|ZP_21189459.1| hypothetical protein PCS81218_00254 [Streptococcus pneumoniae
PCS81218]
gi|444395928|ref|ZP_21193466.1| hypothetical protein PNI0002_01943 [Streptococcus pneumoniae
PNI0002]
gi|444398446|ref|ZP_21195928.1| hypothetical protein PNI0006_02051 [Streptococcus pneumoniae
PNI0006]
gi|444399000|ref|ZP_21196473.1| hypothetical protein PNI0007_00277 [Streptococcus pneumoniae
PNI0007]
gi|444402193|ref|ZP_21199365.1| hypothetical protein PNI0008_00811 [Streptococcus pneumoniae
PNI0008]
gi|444404331|ref|ZP_21201289.1| hypothetical protein PNI0009_00393 [Streptococcus pneumoniae
PNI0009]
gi|444408063|ref|ZP_21204730.1| hypothetical protein PNI0010_01508 [Streptococcus pneumoniae
PNI0010]
gi|444415928|ref|ZP_21212144.1| hypothetical protein PNI0199_01872 [Streptococcus pneumoniae
PNI0199]
gi|444417791|ref|ZP_21213797.1| hypothetical protein PNI0360_01205 [Streptococcus pneumoniae
PNI0360]
gi|444419629|ref|ZP_21215476.1| hypothetical protein PNI0427_00501 [Streptococcus pneumoniae
PNI0427]
gi|395866259|gb|EJG77390.1| hypothetical protein SPAR95_2166 [Streptococcus pneumoniae SPAR95]
gi|406370444|gb|AFS44134.1| conserved hypothetical membrane protein [Streptococcus pneumoniae
gamPNI0373]
gi|444253440|gb|ELU59896.1| hypothetical protein PCS125219_00757 [Streptococcus pneumoniae
PCS125219]
gi|444255297|gb|ELU61653.1| hypothetical protein PCS70012_02198 [Streptococcus pneumoniae
PCS70012]
gi|444255745|gb|ELU62088.1| hypothetical protein PNI0002_01943 [Streptococcus pneumoniae
PNI0002]
gi|444259175|gb|ELU65491.1| hypothetical protein PNI0006_02051 [Streptococcus pneumoniae
PNI0006]
gi|444265102|gb|ELU71130.1| hypothetical protein PCS81218_00254 [Streptococcus pneumoniae
PCS81218]
gi|444266940|gb|ELU72867.1| hypothetical protein PNI0008_00811 [Streptococcus pneumoniae
PNI0008]
gi|444269354|gb|ELU75162.1| hypothetical protein PNI0007_00277 [Streptococcus pneumoniae
PNI0007]
gi|444271659|gb|ELU77410.1| hypothetical protein PNI0010_01508 [Streptococcus pneumoniae
PNI0010]
gi|444277109|gb|ELU82631.1| hypothetical protein PNI0009_00393 [Streptococcus pneumoniae
PNI0009]
gi|444278655|gb|ELU84090.1| hypothetical protein PNI0199_01872 [Streptococcus pneumoniae
PNI0199]
gi|444282561|gb|ELU87815.1| hypothetical protein PNI0360_01205 [Streptococcus pneumoniae
PNI0360]
gi|444286393|gb|ELU91377.1| hypothetical protein PNI0427_00501 [Streptococcus pneumoniae
PNI0427]
Length = 764
Score = 277 bits (708), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 190/593 (32%), Positives = 300/593 (50%), Gaps = 56/593 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAASNWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S KD ++ I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I+R + I A+ L N D + V+++ K LPR TKI +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLE 547
>gi|149012024|ref|ZP_01833172.1| hypothetical protein CGSSp19BS75_03168 [Streptococcus pneumoniae
SP19-BS75]
gi|418077389|ref|ZP_12714618.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47502]
gi|147763979|gb|EDK70912.1| hypothetical protein CGSSp19BS75_03168 [Streptococcus pneumoniae
SP19-BS75]
gi|353745563|gb|EHD26232.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47502]
Length = 764
Score = 277 bits (708), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 190/593 (32%), Positives = 300/593 (50%), Gaps = 56/593 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S KD ++ I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I+R + I A+ L N D + V+++ K LPR TKI +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLE 547
>gi|387760237|ref|YP_006067215.1| hypothetical protein SPNINV200_19710 [Streptococcus pneumoniae
INV200]
gi|419515658|ref|ZP_14055280.1| putative alpha-L-fucosidase [Streptococcus pneumoniae England14-9]
gi|301802826|emb|CBW35604.1| conserved hypothetical protein [Streptococcus pneumoniae INV200]
gi|379633974|gb|EHZ98540.1| putative alpha-L-fucosidase [Streptococcus pneumoniae England14-9]
Length = 764
Score = 277 bits (708), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 190/593 (32%), Positives = 300/593 (50%), Gaps = 56/593 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S KD ++ I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I+R + I A+ L N D + V+++ K LPR TKI +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLE 547
>gi|225861978|ref|YP_002743487.1| large secreted protein [Streptococcus pneumoniae Taiwan19F-14]
gi|298229408|ref|ZP_06963089.1| large secreted protein [Streptococcus pneumoniae str. Canada
MDR_19F]
gi|298255588|ref|ZP_06979174.1| large secreted protein [Streptococcus pneumoniae str. Canada
MDR_19A]
gi|298501665|ref|YP_003723605.1| alpha-L-fucosidase [Streptococcus pneumoniae TCH8431/19A]
gi|387789197|ref|YP_006254265.1| large secreted protein [Streptococcus pneumoniae ST556]
gi|417313623|ref|ZP_12100332.1| alpha-L-fucosidase [Streptococcus pneumoniae GA04375]
gi|418083982|ref|ZP_12721174.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44288]
gi|418086144|ref|ZP_12723319.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47281]
gi|418094961|ref|ZP_12732084.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA49138]
gi|418119732|ref|ZP_12756683.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA18523]
gi|418142694|ref|ZP_12779502.1| alpha-L-fucosidase [Streptococcus pneumoniae GA13455]
gi|418151670|ref|ZP_12788412.1| alpha-L-fucosidase [Streptococcus pneumoniae GA14798]
gi|418153939|ref|ZP_12790673.1| alpha-L-fucosidase [Streptococcus pneumoniae GA16121]
gi|418199016|ref|ZP_12835468.1| alpha-L-fucosidase [Streptococcus pneumoniae GA47778]
gi|418224372|ref|ZP_12851007.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 5185-06]
gi|418228657|ref|ZP_12855270.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 3063-00]
gi|419430394|ref|ZP_13970551.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA11856]
gi|419439146|ref|ZP_13979210.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA13499]
gi|419502823|ref|ZP_14042501.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47628]
gi|419529128|ref|ZP_14068665.1| hypothetical protein SPAR51_2171 [Streptococcus pneumoniae GA17719]
gi|225728210|gb|ACO24061.1| large secreted protein [Streptococcus pneumoniae Taiwan19F-14]
gi|298237260|gb|ADI68391.1| possible alpha-L-fucosidase [Streptococcus pneumoniae TCH8431/19A]
gi|327388899|gb|EGE87247.1| alpha-L-fucosidase [Streptococcus pneumoniae GA04375]
gi|353753506|gb|EHD34129.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44288]
gi|353754984|gb|EHD35594.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47281]
gi|353762498|gb|EHD43057.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA49138]
gi|353788845|gb|EHD69241.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA18523]
gi|353803816|gb|EHD84107.1| alpha-L-fucosidase [Streptococcus pneumoniae GA13455]
gi|353811993|gb|EHD92229.1| alpha-L-fucosidase [Streptococcus pneumoniae GA14798]
gi|353815265|gb|EHD95485.1| alpha-L-fucosidase [Streptococcus pneumoniae GA16121]
gi|353859431|gb|EHE39382.1| alpha-L-fucosidase [Streptococcus pneumoniae GA47778]
gi|353876904|gb|EHE56749.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 5185-06]
gi|353878966|gb|EHE58794.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 3063-00]
gi|379138939|gb|AFC95730.1| large secreted protein [Streptococcus pneumoniae ST556]
gi|379535583|gb|EHZ00782.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA13499]
gi|379548700|gb|EHZ13818.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA11856]
gi|379562772|gb|EHZ27781.1| hypothetical protein SPAR51_2171 [Streptococcus pneumoniae GA17719]
gi|379598038|gb|EHZ62833.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47628]
Length = 764
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 189/593 (31%), Positives = 301/593 (50%), Gaps = 56/593 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAASNWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFDPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + +++ G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTNYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S KD ++ I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I+R + I A+ L N D + V+++ K LP+ TKI +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 547
>gi|418101640|ref|ZP_12738719.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 7286-06]
gi|353768739|gb|EHD49262.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 7286-06]
Length = 764
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 190/592 (32%), Positives = 299/592 (50%), Gaps = 54/592 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAASNWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFDPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
+ L L + +++ G PS SI + D H+ YQ+ F
Sbjct: 225 NATEVFLYLKSMTNYWGNIDIPS---------LQGEFSSIDYFTEKD---EHVKKYQEQF 272
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
+RV +L S KD ++ I T E K + L LLF +GRYLLISSS
Sbjct: 273 NRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISSS 320
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 321 QPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMREP 380
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 381 GRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDER 440
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 441 ILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQ 498
Query: 546 IIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I+R + I A+ L N D + V+++ K LP+ TKI +G I EW++
Sbjct: 499 ILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 547
>gi|168484015|ref|ZP_02708967.1| large secreted protein [Streptococcus pneumoniae CDC1873-00]
gi|417697350|ref|ZP_12346525.1| alpha-L-fucosidase [Streptococcus pneumoniae GA47368]
gi|418108816|ref|ZP_12745849.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA41410]
gi|418111150|ref|ZP_12748165.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA49447]
gi|418163224|ref|ZP_12799902.1| alpha-L-fucosidase [Streptococcus pneumoniae GA17328]
gi|418168087|ref|ZP_12804735.1| alpha-L-fucosidase [Streptococcus pneumoniae GA19077]
gi|418176974|ref|ZP_12813561.1| alpha-L-fucosidase [Streptococcus pneumoniae GA41437]
gi|418219924|ref|ZP_12846585.1| putative alpha-L-fucosidase [Streptococcus pneumoniae NP127]
gi|419423904|ref|ZP_13964112.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA43264]
gi|419461001|ref|ZP_14000923.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA02270]
gi|419463323|ref|ZP_14003222.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA02714]
gi|421273944|ref|ZP_15724780.1| alpha-L-fucosidase [Streptococcus pneumoniae SPAR55]
gi|172042696|gb|EDT50742.1| large secreted protein [Streptococcus pneumoniae CDC1873-00]
gi|332198777|gb|EGJ12859.1| alpha-L-fucosidase [Streptococcus pneumoniae GA47368]
gi|353775273|gb|EHD55754.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA41410]
gi|353780261|gb|EHD60720.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA49447]
gi|353825359|gb|EHE05524.1| alpha-L-fucosidase [Streptococcus pneumoniae GA17328]
gi|353837695|gb|EHE17777.1| alpha-L-fucosidase [Streptococcus pneumoniae GA19077]
gi|353838933|gb|EHE19009.1| alpha-L-fucosidase [Streptococcus pneumoniae GA41437]
gi|353871990|gb|EHE51859.1| putative alpha-L-fucosidase [Streptococcus pneumoniae NP127]
gi|379528874|gb|EHY94127.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA02270]
gi|379529046|gb|EHY94298.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA02714]
gi|379584326|gb|EHZ49194.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA43264]
gi|395872020|gb|EJG83121.1| alpha-L-fucosidase [Streptococcus pneumoniae SPAR55]
Length = 764
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 189/593 (31%), Positives = 301/593 (50%), Gaps = 56/593 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S KD ++ I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD ++ ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFSDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I+R + I A+ L N D + V+++ K LP+ TKI +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 547
>gi|345517561|ref|ZP_08797030.1| hypothetical protein BSFG_03806 [Bacteroides sp. 4_3_47FAA]
gi|254837350|gb|EET17659.1| hypothetical protein BSFG_03806 [Bacteroides sp. 4_3_47FAA]
Length = 828
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 186/601 (30%), Positives = 294/601 (48%), Gaps = 74/601 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY--------TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P N + L ++R
Sbjct: 72 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTSAGAAAYWNVNKQSAHILDEIR 131
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
+G A + K F + +G+ +E S + ++ Y
Sbjct: 132 QAFINGDEKRAMLLTQKNFNSEVPYESWKEKPFRFGNFTTMGEFYIETGLSTIGMSD--Y 189
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V+++ V + R +F S P+ V+ + ++ G +L F+ + +
Sbjct: 190 KRILSLDSALAIVQFNKDGVAYERNYFISYPNNVMTIRFKANKPGKQNLVFSYEPNPVST 249
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
NGNN ++ R Q ++ I + GT+S + KL
Sbjct: 250 GKMETNGNNGLVYTARLDNN---------------QMEYVIRIHATAKGGTLSN-QSGKL 293
Query: 243 KVEGSDWAVLLLVASSSFDGPFINP--SDSKK----DPTSESMSALQSIRNLSYSDLYTR 296
V G+D + L+ A + + F NP +D K +P+ + + ++ L Y L+
Sbjct: 294 SVNGADEVIFLVTADTDYQINF-NPDFNDPKAYVGVNPSETTATWMKDAAALGYDALFDA 352
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQ 355
H DY LF+RVS+ L+ S K D +P+ +R+K+++ + D L EL +Q
Sbjct: 353 HYKDYASLFNRVSLSLNGSGK-----------TDNIPTPQRLKNYRKGKPDFYLEELYYQ 401
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
FGRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL+EC PL
Sbjct: 402 FGRYLLIASSRPGNLPANLQGIWHNNVDGPWRVDYHNNINVQMNYWPAGSTNLAECTLPL 461
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHL 474
DF+ L G KTAQ + A GW +I+ ++ + + W PM G WL TH+
Sbjct: 462 IDFIKTLVKPGEKTAQAYFGARGWTASISGNIFGFTAPLESENMSWNFNPMAGPWLATHV 521
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
W++Y+YT D+ FL+K Y L++ A F +D+L + DG PSTSPEH
Sbjct: 522 WDYYDYTRDKQFLKKTGYGLIKSSAQFAVDYLWKKPDGTYTAAPSTSPEH---------G 572
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
+ +T A++RE+ I A+++L +K E E+VL+ +L P +I G +ME
Sbjct: 573 PIDQGATFIHAVVREILLNAIDASKILGVDKKERKQWEEVLE---KLAPYQIGRYGQLME 629
Query: 593 W 593
W
Sbjct: 630 W 630
>gi|67541006|ref|XP_664277.1| hypothetical protein AN6673.2 [Aspergillus nidulans FGSC A4]
gi|40738426|gb|EAA57616.1| hypothetical protein AN6673.2 [Aspergillus nidulans FGSC A4]
gi|259480257|tpe|CBF71222.1| TPA: alpha-fucosidase (Eurofung) [Aspergillus nidulans FGSC A4]
Length = 831
Score = 276 bits (706), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 197/585 (33%), Positives = 278/585 (47%), Gaps = 46/585 (7%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
A+PIGNGRL A ++GGV +E + LNE+T+W+G + T +A AL R L+ +G E
Sbjct: 45 ALPIGNGRLAATIYGGVRAEVITLNENTIWSGPFQERTPENALAALPIARELLLNGSITE 104
Query: 87 ATAASVKLFGHPAD---VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
A + H D Y G++EL F H + E YRR LD A V+Y V
Sbjct: 105 AGEFIQREMMHEIDSMRAYSYFGNLELGF--GHDEAKVEGYRRWLDTRKGDAGVEYVVEG 162
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKR 203
V++TRE+ +S P V+ + + SE G+L+ N + + D S Q + R P R
Sbjct: 163 VKYTREYIASFPAGVLAARFTASEKGALTLNATFCRVSDATSL-----QASVSDRAPWIR 217
Query: 204 IPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGP 263
+ + + I FS G S + + L + L LV +++ D
Sbjct: 218 LSGTSGQPAEEYPIVFS-----------GQASFVAEGALFTSSN--GTLTLVNATTVD-I 263
Query: 264 FINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKD 318
F + + + P+ E++ A L N Y + L D L R SI S D
Sbjct: 264 FFDAETNYRYPSQEAIDAEIAHKLTDALNKGYDRIRDEALADSSSLLDRASIDFGIS-TD 322
Query: 319 IVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV----ANL 374
+D ++E I V SA + D D L L + +GR+LL++SSR T+ ANL
Sbjct: 323 ETSDLATDERIALVRSAGGL-----DGDLELATLAWNYGRHLLVASSRNTTEAIDLPANL 377
Query: 375 QGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY 434
QGIWN + W +NIN EMNYW + P NL E QEPLFD G K A+ Y
Sbjct: 378 QGIWNNQTTAAWGGKYTININTEMNYWPAGPTNLIETQEPLFDLFAVAYPRGQKLARDMY 437
Query: 435 LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPL 494
SG V HH D+W + ++WPMG AWL THL++ Y +T D+ L YP
Sbjct: 438 NCSGVVFHHNLDVWGDPAPVDNYTSSSMWPMGAAWLATHLYDQYRFTGDKALLADTIYPY 497
Query: 495 LEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYSSTMDMAIIRE 549
L A F + E H+GY T PS SPE+ FI P+ G A + + MD II E
Sbjct: 498 LVDVAKFYQCYTFE-HEGYKVTGPSLSPENTFIIPENWTVAGNKAAMDVAIPMDDQIIWE 556
Query: 550 VFSAIISAA-EVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
V ++ AA E+ ++D V L ++ P +I G I EW
Sbjct: 557 VLHNLLDAASELGIADDDHTVSAAKSFLHKIHPPRIGFQGQIQEW 601
>gi|302540737|ref|ZP_07293079.1| alpha-L-fucosidase 2 [Streptomyces hygroscopicus ATCC 53653]
gi|302458355|gb|EFL21448.1| alpha-L-fucosidase 2 [Streptomyces himastatinicus ATCC 53653]
Length = 775
Score = 276 bits (705), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 193/587 (32%), Positives = 294/587 (50%), Gaps = 63/587 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV----------PGDYTNPDAPKALSDV 75
+A+PIGNG LGAMV+GGV E ++ NE +LWTG G++ P P AL+ V
Sbjct: 18 EALPIGNGTLGAMVFGGVARERIQFNEKSLWTGGPGGPGSAPYDSGNWREPR-PGALAAV 76
Query: 76 RSLVDSGQYAEATAASVKLFGHPAD---VYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
+ L+D A + +L G P YQ GD+ LE + + ++YRR L++
Sbjct: 77 QRLIDEHGAAAPEDVAARL-GQPRSRYGAYQPFGDLWLEIPGA--PESPDSYRRLLEIRK 133
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A VKY+ V RE F+S PD+VIV + + G++ F + S +V ++
Sbjct: 134 GVALVKYTAQGVRHRREFFASYPDRVIVGRFDAA-PGTVGFTLRHTSPRPGDHHVTAHD- 191
Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
R+ + D+ G++F A ++++ D GT+++ ED L V G+ A
Sbjct: 192 ---------GRLTIRGALEDN--GLRFEA--QVRVMADGGTVTSGEDGTLTVTGAHSAWF 238
Query: 253 LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
+L A + + +P +DP + + + Y L +RH+ D++ LF R ++ L
Sbjct: 239 VLAAGTDYAD--THPHYRGEDPHRTVTGTVDAAADRGYLTLLSRHVRDHRALFDRTALDL 296
Query: 313 S-RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
R+P TD A+R +L EL F +GRYLLI+SSRPG +
Sbjct: 297 GGRTPPRTPTDRQRAAYTGGESPADR----------ALEELFFDYGRYLLIASSRPGAPL 346
Query: 372 -ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWN+ + P W + H NINL+M YW + +L+E EPL F+T L G TA
Sbjct: 347 PANLQGIWNDSVRPAWSADYHTNINLQMAYWPAHALHLAETAEPLHRFITALRAPGRITA 406
Query: 431 QVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
+ + A GWV+H++T+ + + D W +P AWL HL+EHY +T+D FL
Sbjct: 407 REMFGARGWVVHNETNAYGFTGVHDWSTAFW--FPEAAAWLVHHLYEHYRFTLDTGFLRD 464
Query: 490 RAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH-EFIAPDGKLACVSYSSTMDMAII 547
AYP + A+F LD L + DG L +P SPEH +F A M I+
Sbjct: 465 TAYPAMREAAAFWLDTLRPDPRDGTLVVSPGYSPEHGDFTA----------GPAMSQQIV 514
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEW 593
++ +A + AA L ++ AL + ++L L P +I G + EW
Sbjct: 515 HDLLTATLEAARTL-GDDPALQAGLRRALDALDPGLRIGSWGQLQEW 560
>gi|15904007|ref|NP_359557.1| hypothetical protein spr1966 [Streptococcus pneumoniae R6]
gi|116517212|ref|YP_817374.1| hypothetical protein SPD_1988 [Streptococcus pneumoniae D39]
gi|148988800|ref|ZP_01820215.1| hypothetical protein CGSSp6BS73_06978 [Streptococcus pneumoniae
SP6-BS73]
gi|148991988|ref|ZP_01821762.1| hypothetical protein CGSSp9BS68_10895 [Streptococcus pneumoniae
SP9-BS68]
gi|149020072|ref|ZP_01835046.1| hypothetical protein CGSSp23BS72_08554 [Streptococcus pneumoniae
SP23-BS72]
gi|168494084|ref|ZP_02718227.1| large secreted protein [Streptococcus pneumoniae CDC3059-06]
gi|387627290|ref|YP_006063466.1| hypothetical protein INV104_18640 [Streptococcus pneumoniae INV104]
gi|417687620|ref|ZP_12336887.1| alpha-L-fucosidase [Streptococcus pneumoniae GA41301]
gi|418075010|ref|ZP_12712256.1| alpha-L-fucosidase [Streptococcus pneumoniae GA11184]
gi|418081811|ref|ZP_12719017.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 6735-05]
gi|418090533|ref|ZP_12727683.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA43265]
gi|418099496|ref|ZP_12736589.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 6901-05]
gi|418103895|ref|ZP_12740963.1| putative alpha-L-fucosidase [Streptococcus pneumoniae NP070]
gi|418106297|ref|ZP_12743347.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44500]
gi|418115675|ref|ZP_12752658.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 5787-06]
gi|418117845|ref|ZP_12754811.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 6963-05]
gi|418135939|ref|ZP_12772788.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA11426]
gi|418160899|ref|ZP_12797595.1| alpha-L-fucosidase [Streptococcus pneumoniae GA17227]
gi|418174588|ref|ZP_12811195.1| alpha-L-fucosidase [Streptococcus pneumoniae GA41277]
gi|418183706|ref|ZP_12820260.1| alpha-L-fucosidase [Streptococcus pneumoniae GA43380]
gi|418203403|ref|ZP_12839826.1| alpha-L-fucosidase [Streptococcus pneumoniae GA52306]
gi|418217614|ref|ZP_12844290.1| alpha-L-fucosidase [Streptococcus pneumoniae Netherlands15B-37]
gi|419432556|ref|ZP_13972681.1| putative alpha-L-fucosidase [Streptococcus pneumoniae EU-NP05]
gi|419434785|ref|ZP_13974899.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA40183]
gi|419456417|ref|ZP_13996371.1| putative alpha-L-fucosidase [Streptococcus pneumoniae EU-NP04]
gi|419465661|ref|ZP_14005549.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA04175]
gi|419467835|ref|ZP_14007713.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA05248]
gi|419469963|ref|ZP_14009827.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA06083]
gi|419476555|ref|ZP_14016386.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA14688]
gi|419480975|ref|ZP_14020776.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA19101]
gi|419487705|ref|ZP_14027464.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44128]
gi|419498536|ref|ZP_14038238.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47522]
gi|419500675|ref|ZP_14040366.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47597]
gi|419513550|ref|ZP_14053180.1| hypothetical protein SPAR149_2151 [Streptococcus pneumoniae
GA05578]
gi|419517761|ref|ZP_14057373.1| hypothetical protein SPAR154_2086 [Streptococcus pneumoniae
GA02506]
gi|419522113|ref|ZP_14061704.1| hypothetical protein SPAR7_2189 [Streptococcus pneumoniae GA05245]
gi|421207672|ref|ZP_15664716.1| large secreted protein [Streptococcus pneumoniae 2090008]
gi|421209866|ref|ZP_15666875.1| large secreted protein [Streptococcus pneumoniae 2070005]
gi|421221343|ref|ZP_15678174.1| large secreted protein [Streptococcus pneumoniae 2070425]
gi|421223600|ref|ZP_15680377.1| large secreted protein [Streptococcus pneumoniae 2070531]
gi|421226019|ref|ZP_15682753.1| large secreted protein [Streptococcus pneumoniae 2070768]
gi|421230717|ref|ZP_15687375.1| large secreted protein [Streptococcus pneumoniae 2061376]
gi|421241635|ref|ZP_15698176.1| large secreted protein [Streptococcus pneumoniae 2080913]
gi|421267146|ref|ZP_15718023.1| hypothetical protein SPAR27_2104 [Streptococcus pneumoniae SPAR27]
gi|421284302|ref|ZP_15735084.1| hypothetical protein SPAR151_2120 [Streptococcus pneumoniae
GA04216]
gi|421292975|ref|ZP_15743706.1| hypothetical protein SPAR159_2221 [Streptococcus pneumoniae
GA56348]
gi|444381684|ref|ZP_21179890.1| hypothetical protein PCS8106_00075 [Streptococcus pneumoniae
PCS8106]
gi|444384154|ref|ZP_21182250.1| hypothetical protein PCS8203_00020 [Streptococcus pneumoniae
PCS8203]
gi|15459667|gb|AAL00768.1| Conserved hypothetical protein [Streptococcus pneumoniae R6]
gi|116077788|gb|ABJ55508.1| conserved hypothetical protein [Streptococcus pneumoniae D39]
gi|147925611|gb|EDK76687.1| hypothetical protein CGSSp6BS73_06978 [Streptococcus pneumoniae
SP6-BS73]
gi|147929037|gb|EDK80048.1| hypothetical protein CGSSp9BS68_10895 [Streptococcus pneumoniae
SP9-BS68]
gi|147930750|gb|EDK81731.1| hypothetical protein CGSSp23BS72_08554 [Streptococcus pneumoniae
SP23-BS72]
gi|183575953|gb|EDT96481.1| large secreted protein [Streptococcus pneumoniae CDC3059-06]
gi|301795076|emb|CBW37545.1| conserved hypothetical protein [Streptococcus pneumoniae INV104]
gi|332071430|gb|EGI81924.1| alpha-L-fucosidase [Streptococcus pneumoniae GA41301]
gi|353745184|gb|EHD25855.1| alpha-L-fucosidase [Streptococcus pneumoniae GA11184]
gi|353750133|gb|EHD30775.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 6735-05]
gi|353759533|gb|EHD40117.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA43265]
gi|353767716|gb|EHD48248.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 6901-05]
gi|353773458|gb|EHD53955.1| putative alpha-L-fucosidase [Streptococcus pneumoniae NP070]
gi|353774259|gb|EHD54752.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44500]
gi|353783638|gb|EHD64065.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 5787-06]
gi|353787046|gb|EHD67455.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 6963-05]
gi|353820164|gb|EHE00352.1| alpha-L-fucosidase [Streptococcus pneumoniae GA17227]
gi|353835112|gb|EHE15207.1| alpha-L-fucosidase [Streptococcus pneumoniae GA41277]
gi|353846724|gb|EHE26752.1| alpha-L-fucosidase [Streptococcus pneumoniae GA43380]
gi|353864851|gb|EHE44761.1| alpha-L-fucosidase [Streptococcus pneumoniae GA52306]
gi|353868852|gb|EHE48736.1| alpha-L-fucosidase [Streptococcus pneumoniae Netherlands15B-37]
gi|353899786|gb|EHE75353.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA11426]
gi|379535787|gb|EHZ00985.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA04175]
gi|379536100|gb|EHZ01291.1| hypothetical protein SPAR7_2189 [Streptococcus pneumoniae GA05245]
gi|379542257|gb|EHZ07415.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA05248]
gi|379542673|gb|EHZ07828.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA06083]
gi|379557271|gb|EHZ22317.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA14688]
gi|379569141|gb|EHZ34115.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA19101]
gi|379575027|gb|EHZ39964.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA40183]
gi|379584597|gb|EHZ49463.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44128]
gi|379597600|gb|EHZ62398.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47522]
gi|379597787|gb|EHZ62584.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47597]
gi|379626380|gb|EHZ90998.1| putative alpha-L-fucosidase [Streptococcus pneumoniae EU-NP04]
gi|379626589|gb|EHZ91206.1| putative alpha-L-fucosidase [Streptococcus pneumoniae EU-NP05]
gi|379632837|gb|EHZ97407.1| hypothetical protein SPAR149_2151 [Streptococcus pneumoniae
GA05578]
gi|379637411|gb|EIA01967.1| hypothetical protein SPAR154_2086 [Streptococcus pneumoniae
GA02506]
gi|395571912|gb|EJG32514.1| large secreted protein [Streptococcus pneumoniae 2090008]
gi|395572036|gb|EJG32637.1| large secreted protein [Streptococcus pneumoniae 2070005]
gi|395584331|gb|EJG44724.1| large secreted protein [Streptococcus pneumoniae 2070425]
gi|395586059|gb|EJG46437.1| large secreted protein [Streptococcus pneumoniae 2070531]
gi|395588107|gb|EJG48442.1| large secreted protein [Streptococcus pneumoniae 2070768]
gi|395592519|gb|EJG52784.1| large secreted protein [Streptococcus pneumoniae 2061376]
gi|395605911|gb|EJG66022.1| large secreted protein [Streptococcus pneumoniae 2080913]
gi|395865531|gb|EJG76670.1| hypothetical protein SPAR27_2104 [Streptococcus pneumoniae SPAR27]
gi|395879316|gb|EJG90376.1| hypothetical protein SPAR151_2120 [Streptococcus pneumoniae
GA04216]
gi|395891223|gb|EJH02225.1| hypothetical protein SPAR159_2221 [Streptococcus pneumoniae
GA56348]
gi|429316926|emb|CCP36654.1| putative alpha-L-fucosidase [Streptococcus pneumoniae SPN034156]
gi|444252808|gb|ELU59268.1| hypothetical protein PCS8203_00020 [Streptococcus pneumoniae
PCS8203]
gi|444253936|gb|ELU60383.1| hypothetical protein PCS8106_00075 [Streptococcus pneumoniae
PCS8106]
Length = 764
Score = 276 bits (705), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 189/593 (31%), Positives = 300/593 (50%), Gaps = 56/593 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAASNWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S KD ++ I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I+R + I A+ L N D + V+++ K LP+ TKI +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 547
>gi|168491689|ref|ZP_02715832.1| large secreted protein [Streptococcus pneumoniae CDC0288-04]
gi|183574053|gb|EDT94581.1| large secreted protein [Streptococcus pneumoniae CDC0288-04]
Length = 764
Score = 275 bits (704), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 188/593 (31%), Positives = 298/593 (50%), Gaps = 56/593 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S + +I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYSKGCL--------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I+R + I A+ L N D + V+++ K LPR TKI +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFINRVKELKKKLPR---TKIGSNGQIQEWLE 547
>gi|421212007|ref|ZP_15668985.1| large secreted protein [Streptococcus pneumoniae 2070035]
gi|421232851|ref|ZP_15689488.1| large secreted protein [Streptococcus pneumoniae 2080076]
gi|395571698|gb|EJG32309.1| large secreted protein [Streptococcus pneumoniae 2070035]
gi|395593380|gb|EJG53629.1| large secreted protein [Streptococcus pneumoniae 2080076]
Length = 764
Score = 275 bits (704), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 188/593 (31%), Positives = 298/593 (50%), Gaps = 56/593 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAASNWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S + +I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYSKGCL--------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I+R + I A+ L N D + V+++ K LPR TKI +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLE 547
>gi|255035225|ref|YP_003085846.1| hypothetical protein Dfer_1435 [Dyadobacter fermentans DSM 18053]
gi|254947981|gb|ACT92681.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
Length = 790
Score = 275 bits (704), Expect = 4e-71, Method: Compositional matrix adjust.
Identities = 185/610 (30%), Positives = 297/610 (48%), Gaps = 67/610 (10%)
Query: 4 AESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG- 61
A+ TS T PL + ++ PAK + T A+PIGNG +GAM +GG E ++ +E +LW G G
Sbjct: 24 AQPTSKTAPLSLWYDQPAKEWMTQALPIGNGHVGAMFFGGTDEERIQFSEGSLWAGGKGA 83
Query: 62 --DYT---NPDAPKALSDVRSLVDSGQYAEATA-ASVKLFGH--------PADVY---QL 104
DY +A K L +VR L+ +G+ EA A A+ +L G P+ + Q
Sbjct: 84 NADYNFGIKKEAHKHLPEVRELLAAGKLKEAHALANKELTGAIHEKKENTPSSDFGAQQT 143
Query: 105 LGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKIS 164
+GD+ ++ K A + YRREL+++ A +V+Y G F R +F + P +V+V + +
Sbjct: 144 VGDLFIKMPS---KGAAQNYRRELNISDALGKVQYEAGGTRFERSYFGNYPAKVMVYRFT 200
Query: 165 GSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE 224
S + S D R GK+ + D+ + +F +
Sbjct: 201 SSTPETYSIRFETPHAKDYE-------------RFEGKQYTFGGHLKDNHQ--EFETVYR 245
Query: 225 IKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS 284
I D +A D L V G+ VL+ ++ + F P D + + +
Sbjct: 246 I----DTDGKTAFSDGVLTVTGARSIVLIHTVATDYVMKF--PDYKGNDYKKANAATMAG 299
Query: 285 IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD 344
+ +Y+ L DY LF RV++ L + + +P+ +R K++
Sbjct: 300 VAGKNYASLVAAQQKDYHSLFDRVALTLGNA------------DAPAIPTDQRQKAYSAG 347
Query: 345 E-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
+ D L EL FQ+GRYL+ISS+RPGT +LQG WN+ +P W + H NIN++M YW +
Sbjct: 348 QADGRLEELYFQYGRYLMISSTRPGTMPMSLQGKWNDSTNPPWANDYHTNINIQMLYWPA 407
Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
NLSEC PL DF + G A+ + A GW+++ + + +S W +
Sbjct: 408 EVTNLSECHVPLMDFTQSIVAPGRLAAKEFFNAKGWIVNTMLNAYGYTSPGW-DFPWGFF 466
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
P G AWL HLWEHY +T D+ FL+ AYP+++ + F +D+L + G L ++PS SPE
Sbjct: 467 PGGAAWLSQHLWEHYAFTNDKAFLKNTAYPIMKEASEFWMDYLTDDGRGRLVSSPSYSPE 526
Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTK 583
H +S +TMD + +V + AA +L ++D +K + ++ P +
Sbjct: 527 H---------GGISTGATMDHEMAWDVLNNTAEAAAILGVDQD-FAQKARSTRDKILPLQ 576
Query: 584 IAEDGSIMEW 593
I + EW
Sbjct: 577 IGRWKQLQEW 586
>gi|419436976|ref|ZP_13977057.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 8190-05]
gi|379611263|gb|EHZ75990.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 8190-05]
Length = 764
Score = 275 bits (703), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 189/593 (31%), Positives = 300/593 (50%), Gaps = 56/593 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTVFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S KD ++ I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I+R + I A+ L N D + V+++ K LP+ TKI +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 547
>gi|149003007|ref|ZP_01827918.1| hypothetical protein CGSSp14BS69_00740 [Streptococcus pneumoniae
SP14-BS69]
gi|168489226|ref|ZP_02713425.1| large secreted protein [Streptococcus pneumoniae SP195]
gi|221232865|ref|YP_002512019.1| hypothetical protein SPN23F_21920 [Streptococcus pneumoniae ATCC
700669]
gi|225855653|ref|YP_002737165.1| large secreted protein [Streptococcus pneumoniae JJA]
gi|237650653|ref|ZP_04524905.1| large secreted protein [Streptococcus pneumoniae CCRI 1974]
gi|237822208|ref|ZP_04598053.1| large secreted protein [Streptococcus pneumoniae CCRI 1974M2]
gi|415701401|ref|ZP_11458355.1| large secreted protein [Streptococcus pneumoniae 459-5]
gi|415750467|ref|ZP_11478309.1| large secreted protein [Streptococcus pneumoniae SV35]
gi|415753360|ref|ZP_11480342.1| large secreted protein [Streptococcus pneumoniae SV36]
gi|417680132|ref|ZP_12329525.1| alpha-L-fucosidase [Streptococcus pneumoniae GA17570]
gi|418124532|ref|ZP_12761459.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44378]
gi|418126808|ref|ZP_12763710.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44511]
gi|418129072|ref|ZP_12765961.1| putative alpha-L-fucosidase [Streptococcus pneumoniae NP170]
gi|418138272|ref|ZP_12775106.1| alpha-L-fucosidase [Streptococcus pneumoniae GA11663]
gi|418144761|ref|ZP_12781556.1| alpha-L-fucosidase [Streptococcus pneumoniae GA13494]
gi|418179304|ref|ZP_12815881.1| alpha-L-fucosidase [Streptococcus pneumoniae GA41565]
gi|418192602|ref|ZP_12829101.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47388]
gi|418215362|ref|ZP_12842093.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA54644]
gi|418235345|ref|ZP_12861918.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA08780]
gi|419458700|ref|ZP_13998639.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA02254]
gi|419474246|ref|ZP_14014091.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA13430]
gi|419485376|ref|ZP_14025147.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA43257]
gi|419494282|ref|ZP_14034004.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47210]
gi|419509243|ref|ZP_14048891.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA49542]
gi|421279922|ref|ZP_15730725.1| alpha-L-fucosidase [Streptococcus pneumoniae GA17301]
gi|421300242|ref|ZP_15750913.1| alpha-L-fucosidase [Streptococcus pneumoniae GA19998]
gi|147759010|gb|EDK66005.1| hypothetical protein CGSSp14BS69_00740 [Streptococcus pneumoniae
SP14-BS69]
gi|183572159|gb|EDT92687.1| large secreted protein [Streptococcus pneumoniae SP195]
gi|220675327|emb|CAR69925.1| conserved hypothetical protein [Streptococcus pneumoniae ATCC
700669]
gi|225723250|gb|ACO19103.1| large secreted protein [Streptococcus pneumoniae JJA]
gi|332071597|gb|EGI82090.1| alpha-L-fucosidase [Streptococcus pneumoniae GA17570]
gi|353794144|gb|EHD74502.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44378]
gi|353794344|gb|EHD74701.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44511]
gi|353797122|gb|EHD77459.1| putative alpha-L-fucosidase [Streptococcus pneumoniae NP170]
gi|353807227|gb|EHD87499.1| alpha-L-fucosidase [Streptococcus pneumoniae GA13494]
gi|353840818|gb|EHE20880.1| alpha-L-fucosidase [Streptococcus pneumoniae GA41565]
gi|353854436|gb|EHE34414.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47388]
gi|353867652|gb|EHE47543.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA54644]
gi|353885068|gb|EHE64858.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA08780]
gi|353899629|gb|EHE75198.1| alpha-L-fucosidase [Streptococcus pneumoniae GA11663]
gi|379528696|gb|EHY93950.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA02254]
gi|379549315|gb|EHZ14425.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA13430]
gi|379580149|gb|EHZ45044.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA43257]
gi|379591544|gb|EHZ56368.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47210]
gi|379609534|gb|EHZ74272.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA49542]
gi|381309007|gb|EIC49850.1| large secreted protein [Streptococcus pneumoniae SV36]
gi|381313067|gb|EIC53859.1| large secreted protein [Streptococcus pneumoniae 459-5]
gi|381316317|gb|EIC57067.1| large secreted protein [Streptococcus pneumoniae SV35]
gi|395877150|gb|EJG88220.1| alpha-L-fucosidase [Streptococcus pneumoniae GA17301]
gi|395899666|gb|EJH10605.1| alpha-L-fucosidase [Streptococcus pneumoniae GA19998]
Length = 764
Score = 275 bits (703), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 188/593 (31%), Positives = 298/593 (50%), Gaps = 56/593 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELCIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S + +I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYSKGCL--------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I+R + I A+ L N D + V+++ K LPR TKI +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLE 547
>gi|213963750|ref|ZP_03392000.1| alpha-L-fucosidase 2 [Capnocytophaga sputigena Capno]
gi|213953630|gb|EEB64962.1| alpha-L-fucosidase 2 [Capnocytophaga sputigena Capno]
Length = 806
Score = 275 bits (703), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 195/603 (32%), Positives = 315/603 (52%), Gaps = 59/603 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + F+ PA+HFT+++PIGNGRLGAM +G + + LNE +LW+G D +P+A L
Sbjct: 23 VSVVFHKPAEHFTESLPIGNGRLGAMFFGKTDVDRIVLNEISLWSGGTQDADDPNAHIHL 82
Query: 73 SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
++ L+ G+ EA A K F G+ A+ YQ+LG++ L++ +
Sbjct: 83 KTIQQLLLEGKNLEAQALLQKHFIAKGEGSCKGNGANCSYGCYQILGELLLDWKST---L 139
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
E Y+R L L+ ATA + GN + F+ + +I +I+ S+ L ++SL
Sbjct: 140 PTENYQRILRLDQATAFTSFKRGNNYIQQIAFADFKNDLIWIRITASQP--LDIDISLHR 197
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD-RGTISALE 238
+N + +N+I + G P N++ +G+QF++ ++++ + + T +A
Sbjct: 198 R-ENATTSYKSNKITLSGVLP----------NENTEGMQFASEIDVQTDGNLQNTTNATS 246
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+K K VL + A+++++ F ++ D ++ LQ + + +
Sbjct: 247 IQKAKE-----IVLKISAATNYN--FTKGGLTQNDVLQKANDYLQKA-TIPFENAIIESQ 298
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF-QFG 357
YQ F+R +R + TDT S + + ER++ F + +L+ +L+ FG
Sbjct: 299 KAYQVFFNR-----NRWYSEANTDTSS------LSTFERLQRFYKGKKDALLPVLYYNFG 347
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLISSSR G ANLQG+W E+ W+ H+NINL+MNYW + NLSE PL
Sbjct: 348 RYLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINLQMNYWLAESTNLSELTTPLHK 407
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
F L NG KTA+ Y A+GW+ H ++ W +S W GGAWLC H+W+H
Sbjct: 408 FTKNLVANGRKTARAYYNANGWMAHVISNPWFYTSPGE-SAEWGSTLTGGAWLCEHIWQH 466
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP---DGK- 532
Y YT++ DFL + YP+L+ A F LI+ GY T PS SPE+ +I P DGK
Sbjct: 467 YLYTLNTDFL-REYYPVLKEAADFFQSLLIKDPKTGYWVTAPSNSPENAYIMPQLKDGKK 525
Query: 533 -LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
+ + TMDM I+RE+FS + AA++L + + L + + + P +I + G +
Sbjct: 526 QIGNTCIAPTMDMQIVRELFSNTLQAAKILGVDNE-LYSQWQEIITHTVPNRIGKKGDLN 584
Query: 592 EWV 594
EW+
Sbjct: 585 EWL 587
>gi|418147412|ref|ZP_12784184.1| alpha-L-fucosidase [Streptococcus pneumoniae GA13637]
gi|353810492|gb|EHD90743.1| alpha-L-fucosidase [Streptococcus pneumoniae GA13637]
Length = 764
Score = 275 bits (702), Expect = 6e-71, Method: Compositional matrix adjust.
Identities = 188/593 (31%), Positives = 298/593 (50%), Gaps = 56/593 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTVFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S + +I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYSKGCL--------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I+R + I A+ L N D + V+++ K LPR TKI +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLE 547
>gi|417695030|ref|ZP_12344214.1| alpha-L-fucosidase [Streptococcus pneumoniae GA47901]
gi|332198979|gb|EGJ13060.1| alpha-L-fucosidase [Streptococcus pneumoniae GA47901]
Length = 764
Score = 274 bits (701), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 188/593 (31%), Positives = 299/593 (50%), Gaps = 56/593 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAASNWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S KD ++ I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P NLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPVNLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I+R + I A+ L N D + V+++ K LP+ TKI +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 547
>gi|307705834|ref|ZP_07642675.1| alpha-fucosidase [Streptococcus mitis SK597]
gi|307620620|gb|EFN99715.1| alpha-fucosidase [Streptococcus mitis SK597]
Length = 764
Score = 274 bits (701), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 189/593 (31%), Positives = 299/593 (50%), Gaps = 56/593 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAASNWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEVQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SSALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGDI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S KD ++ I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTATKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RVLTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I+R + I A+ L N D + V+++ K LP+ TKI +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 547
>gi|373958328|ref|ZP_09618288.1| glycoside hydrolase family 2 sugar binding [Mucilaginibacter
paludis DSM 18603]
gi|373894928|gb|EHQ30825.1| glycoside hydrolase family 2 sugar binding [Mucilaginibacter
paludis DSM 18603]
Length = 960
Score = 274 bits (700), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 165/502 (32%), Positives = 266/502 (52%), Gaps = 49/502 (9%)
Query: 102 YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVT 161
Y GD+ L F S Y+R+LD+ A A Y+ V FTRE+ +S+P + I+
Sbjct: 311 YLPFGDLILNFKTSS---QVMDYKRDLDIGKAVATTTYNSNGVNFTREYLASDPAKAIII 367
Query: 162 KISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSA 221
+ S+ G +++ +LL ++ +Q+ ++ KG+ A
Sbjct: 368 HLKASKPG----QINMVALLQTSHKISSVHQVDANTIALDVKVQ---------KGV-LKA 413
Query: 222 ILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA 281
+ + I GT+ + ++ + + +D + L A++SF N D P A
Sbjct: 414 VSYLYIKALSGTVKVINNQ-ISISKADDVTIYLTAATSFK----NYKDVSGKPDEICKQA 468
Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF 341
LQ+ + +++ L + + DYQ+ F+ S+ L D+ TD ER+K++
Sbjct: 469 LQAAKTKTFAQLKAQSITDYQQYFNTFSVNLGPGKVDVPTD-------------ERIKTY 515
Query: 342 QTDEDPSLVELLFQFGRYLLISSSRPGTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
DP L+ L Q+GRYLLIS SRP +++ ANLQGIWN+ + P+W S NINL+MNY
Sbjct: 516 SVAFDPGLLALYMQYGRYLLISCSRPNSKLPANLQGIWNDQMVPSWGSKFTTNINLQMNY 575
Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVW 460
W + NL+ C++PLF ++ L++ G++TA+++Y A GW++HH TDIW +A
Sbjct: 576 WPAEELNLTPCEKPLFKMISQLAVTGAQTAKIHYDAPGWILHHNTDIWL-GTAPINASNH 634
Query: 461 ALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH-DGYLETNPS 519
+W G AWLC LWEHY YT D DFL+K Y ++G A F + L++ G+L + PS
Sbjct: 635 GIWQGGAAWLCHQLWEHYLYTGDIDFLKKH-YAEMKGAAEFFVSTLVKDPVTGFLISTPS 693
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH G L TMD IIR++F ISA+E+L K +DA + + + ++
Sbjct: 694 NSPEH------GGLVA---GPTMDRQIIRDLFKNCISASEIL-KTDDAFRKTLQEKYAQI 743
Query: 580 RPTKIAEDGSIMEWVQRRLNTS 601
P K+ + G + EW++ + +T+
Sbjct: 744 APNKVGKFGQLQEWMEDKDDTA 765
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 39/95 (41%), Positives = 60/95 (63%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
++ A + LK+ + PA+ +TDA+PIGNG LGAM +GG+ S+ ++ NE TLW+G P
Sbjct: 14 LLAAAQNVFSQDLKLWYKKPAEKWTDALPIGNGTLGAMFYGGISSDRIQFNEQTLWSGSP 73
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLF 95
Y A L ++R+L+ +G+ AEA A + K F
Sbjct: 74 RKYQRDGAATYLPEIRNLLFAGKQAEAEALAEKHF 108
>gi|225857727|ref|YP_002739238.1| large secreted protein [Streptococcus pneumoniae P1031]
gi|444410728|ref|ZP_21207248.1| hypothetical protein PNI0076_01706 [Streptococcus pneumoniae
PNI0076]
gi|444412459|ref|ZP_21208780.1| hypothetical protein PNI0153_00823 [Streptococcus pneumoniae
PNI0153]
gi|444422182|ref|ZP_21217843.1| hypothetical protein PNI0446_00530 [Streptococcus pneumoniae
PNI0446]
gi|225724930|gb|ACO20782.1| large secreted protein [Streptococcus pneumoniae P1031]
gi|444274421|gb|ELU80068.1| hypothetical protein PNI0153_00823 [Streptococcus pneumoniae
PNI0153]
gi|444276759|gb|ELU82299.1| hypothetical protein PNI0076_01706 [Streptococcus pneumoniae
PNI0076]
gi|444288455|gb|ELU93349.1| hypothetical protein PNI0446_00530 [Streptococcus pneumoniae
PNI0446]
Length = 764
Score = 274 bits (700), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 187/593 (31%), Positives = 298/593 (50%), Gaps = 56/593 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S + +I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYSKGCL--------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I+R + I A+ L N D + V+++ K LP+ TKI +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 547
>gi|393788805|ref|ZP_10376931.1| hypothetical protein HMPREF1068_03211 [Bacteroides nordii
CL02T12C05]
gi|392653911|gb|EIY47561.1| hypothetical protein HMPREF1068_03211 [Bacteroides nordii
CL02T12C05]
Length = 814
Score = 274 bits (700), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 191/599 (31%), Positives = 299/599 (49%), Gaps = 70/599 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
T ++P+GNG LGA + G + +E + LNE TLW G P DY N + L ++R
Sbjct: 60 TSSLPLGNGSLGANIMGSIAAERITLNEKTLWKGGPNTSGGADYYWNVNKQSAPILKEIR 119
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
+G A + K F A + +G++ +E S + ++ Y
Sbjct: 120 QAFTAGDQKRAETLTRKNFNGLAAYEEKDETPFRFGSFTTMGEVYVETGLSEIGMSD--Y 177
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ +++ R +F S PD V+V + + + G +L+F+ S ++
Sbjct: 178 KRILSLDSAMATVRFLKDGIKYQRNYFISYPDSVMVMRFTADKPGMQNLTFSYSPNTEAQ 237
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+G N + G K N N ++F AI ++G +E+ KL
Sbjct: 238 GKIEADGTNGLYYAG---------KLNNNQMKFALRFRAI-------NKGGTVRVENGKL 281
Query: 243 KVEGSDWAVLLLVASSSFD---GPFINPSDS--KKDPTSESMSALQSIRNLSYSDLYTRH 297
++ ++ V LL A + + P N ++ +P+ + + ++ +Y LY RH
Sbjct: 282 VIKDANEVVFLLTADTDYKMNYNPDFNSPETYVGNNPSETTRNMMKQAEAKTYEVLYLRH 341
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQF 356
+DY LF+RV +LS +P+ + D +P+ +R+K + Q D L +L +Q+
Sbjct: 342 QNDYTALFNRV--KLSLNPQVPIAD---------LPTDQRLKHYRQGTPDYYLEQLYYQY 390
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ +L W H NIN++MNYW + NL EC PL
Sbjct: 391 GRYLLIASSRPGNMPANLQGIWHNNLDGPWRVDYHNNINIQMNYWPACSTNLDECMIPLI 450
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTA+ + A GW +I+ ++ ++ W PM G WL TH+W
Sbjct: 451 DFIRGLVKPGEKTAKAYFNARGWTASISANIFGFTAPLSSEQMEWNFNPMAGPWLATHIW 510
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D+ FL + YPL++ A F +D+L DG PSTSPEH
Sbjct: 511 EYYDYTRDKKFLSEIGYPLIKSSAQFTVDYLWHKPDGTYTAAPSTSPEH---------GP 561
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEW 593
V +T A++RE+ S ISA+++L DA K K L L P +I G +MEW
Sbjct: 562 VDQGATFVHAVVREILSDAISASKIL--GVDAKERKQWKDILKNLVPYQIGRYGQLMEW 618
>gi|15901970|ref|NP_346574.1| hypothetical protein SP_2160 [Streptococcus pneumoniae TIGR4]
gi|418131327|ref|ZP_12768207.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA07643]
gi|418230992|ref|ZP_12857587.1| putative alpha-L-fucosidase [Streptococcus pneumoniae EU-NP01]
gi|419478817|ref|ZP_14018636.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA18068]
gi|421243935|ref|ZP_15700445.1| large secreted protein [Streptococcus pneumoniae 2081074]
gi|421248340|ref|ZP_15704814.1| large secreted protein [Streptococcus pneumoniae 2082170]
gi|14973671|gb|AAK76214.1| conserved hypothetical protein [Streptococcus pneumoniae TIGR4]
gi|353800742|gb|EHD81051.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA07643]
gi|353884503|gb|EHE64302.1| putative alpha-L-fucosidase [Streptococcus pneumoniae EU-NP01]
gi|379563089|gb|EHZ28094.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA18068]
gi|395605861|gb|EJG65975.1| large secreted protein [Streptococcus pneumoniae 2081074]
gi|395612201|gb|EJG72246.1| large secreted protein [Streptococcus pneumoniae 2082170]
Length = 764
Score = 273 bits (699), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 187/593 (31%), Positives = 298/593 (50%), Gaps = 56/593 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTVFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S + +I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYSKGCL--------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I+R + I A+ L N D + V+++ K LP+ TKI +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 547
>gi|419443562|ref|ZP_13983582.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA13224]
gi|379549113|gb|EHZ14224.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA13224]
Length = 764
Score = 273 bits (699), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 187/593 (31%), Positives = 298/593 (50%), Gaps = 56/593 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S + +I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYSKGCL--------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L + + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPKVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I+R + I A+ L N D + V+++ K LPR TKI +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLE 547
>gi|429847882|gb|ELA23431.1| alpha-l-fucosidase [Colletotrichum gloeosporioides Nara gc5]
Length = 798
Score = 273 bits (698), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 193/592 (32%), Positives = 282/592 (47%), Gaps = 65/592 (10%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
A+PIGNGRLG VWGG +ETL +NEDT+W+G D T P+A L R L SG+ E
Sbjct: 42 ALPIGNGRLGGTVWGGA-NETLTINEDTIWSGPIQDRTPPNALATLPVARKLFLSGKITE 100
Query: 87 ATAASVKLFGHPAD----VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVG 142
++ PA+ + G+++L+F S E Y R LD + Y+
Sbjct: 101 GGQLVLREM-TPAEKSERQFGYFGNLDLDFGHSG---NLENYVRWLDTKQGNSGSSYAFD 156
Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS---LDSLLDNHSYVNGNNQIIMEGRC 199
V FTRE +S P V+ + + SE G+L+ S L ++L N + G +
Sbjct: 157 GVNFTREFVASYPAGVLAARFTSSEEGALNLKASFSRLANILVNVASTAGGVNSVTLMSS 216
Query: 200 PGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSS 259
G+ + D I F+ K + +GS VL + +++
Sbjct: 217 SGQPL--------DENPILFTGQARF----------VAPGAKFENDGS---VLRITGATA 255
Query: 260 FDGPFINPSD----SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRS 315
D F ++ S+ + +E L + YSDL L D L R SI L +S
Sbjct: 256 IDLFFDAETNYRFASQDEWEAEIDRKLNAALTKGYSDLRDEALKDSSSLLGRASIDLGKS 315
Query: 316 PKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQV--- 371
P+ + +P+ ERV + + D L L + GR++L+ +SR T+
Sbjct: 316 PR----------GLSALPTDERVAIARNNSSDVELSTLTWNLGRHMLVGASR-NTEADID 364
Query: 372 --ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
ANLQGIWN + W +NIN EMNYW + P NL E QEPLFD + + G
Sbjct: 365 MPANLQGIWNNKTTAAWGGKYTININTEMNYWSAGPTNLIETQEPLFDLMKVANPRGKAM 424
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+ Y G + HH D+W A +WPMG AWL H+ +HY++T D+ FL
Sbjct: 425 AKAMYGCDGTMFHHNLDVWGDPGATDNYTSSTMWPMGAAWLVQHMVDHYHFTGDKTFLAD 484
Query: 490 RAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYSSTMDM 544
AYP L A+F + E H+GY T PS SPE+ F+ P G+ + MD
Sbjct: 485 VAYPFLIDVATFYECYTFE-HEGYRITGPSLSPENTFVVPSNFSVAGRSEPMDIDIPMDN 543
Query: 545 AIIREVFSAIISAAEVL---EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
++ +VFSAII AA++L + N+D ++K LPR++P +I G I+EW
Sbjct: 544 QLMHDVFSAIIEAADILGIDDTNQD--LKKAKDFLPRIKPAQIGSKGQILEW 593
>gi|238482887|ref|XP_002372682.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
gi|220700732|gb|EED57070.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
Length = 608
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 181/592 (30%), Positives = 293/592 (49%), Gaps = 60/592 (10%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ P F ++P+GNGRLG ++ +P+E + NED++W+G D N +A VR
Sbjct: 34 YDTPGTRFNASLPVGNGRLGGTLYY-LPTEIVTWNEDSVWSGTFQDRVNSNALDGFPKVR 92
Query: 77 SLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
+L+ +G A ++ + G D YQ+L ++ ++ + R LD
Sbjct: 93 NLLVNGNITAAGELALSDMTGSSVDQREYQVLSNLYVDLGQ---RGDATNLVRYLDTLEG 149
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
+Y V +TRE +S P V+ +I + S +++ N + NG I
Sbjct: 150 YTACEYGFDGVSYTRELIASAPSGVLGFRIQANTSRAINLN----------AVANGIASI 199
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
+M+ R + F+A + + + D G ++A DK L V G+ V
Sbjct: 200 VMKAR------------TGEADYSTFTAGVRVVV--DGGNVTANGDK-LYVTGATTVVFF 244
Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
L A SS+ + D +E L + L Y L + D++ L RV++ L
Sbjct: 245 LDAESSYR------YATDSDQETELNRKLDAATELGYEALRKEAITDHKDLAGRVTLDLG 298
Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQT--DEDPSLVELLFQFGRYLLISSSRPGTQV 371
S D + +P ER+ ++++ D D L+F +GR+LLI+SSR +
Sbjct: 299 SSTDDAAS----------LPPNERMTNYRSSPDHDVQFATLVFNYGRHLLIASSRRTRER 348
Query: 372 A---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
+ LQGIWN+D SP+W + VNINLEMNYW + NL+E PL+D L + G
Sbjct: 349 SLSPGLQGIWNQDYSPSWGAKYTVNINLEMNYWPAETTNLNELTSPLWDLLALIQERGGD 408
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
A+ + G+V+HH TD+W S +++WPMGGAWL H+ EHY +T D+ FL+
Sbjct: 409 VAEKMHGCPGFVLHHNTDLWGDSVPVHNGTKYSIWPMGGAWLALHMMEHYRFTGDKTFLK 468
Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYSSTMD 543
++A P+ + F +L + DGYL T PS SPE+ F P GK ++ S T+D
Sbjct: 469 EQACPIFKSAFEFFECYLFD-VDGYLTTGPSCSPENAFQIPSDMTVAGKEEALTMSPTLD 527
Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+++ E+ +A+ ++LE + D L V L ++RP +I DG I+EW++
Sbjct: 528 NSMLFELLTALNETHQILEIDND-LSGSVQTYLGKIRPPRIGSDGQILEWIE 578
>gi|169834518|ref|YP_001695515.1| large secreted protein [Streptococcus pneumoniae Hungary19A-6]
gi|168997020|gb|ACA37632.1| large secreted protein [Streptococcus pneumoniae Hungary19A-6]
Length = 764
Score = 273 bits (697), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 187/593 (31%), Positives = 298/593 (50%), Gaps = 56/593 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S + +I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYSKGCL--------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWIVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I+R + I A+ L N D + V+++ K LP+ TKI +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 547
>gi|223932290|ref|ZP_03624293.1| conserved hypothetical protein [Streptococcus suis 89/1591]
gi|386584235|ref|YP_006080638.1| hypothetical protein SSUD9_1198 [Streptococcus suis D9]
gi|223898971|gb|EEF65329.1| conserved hypothetical protein [Streptococcus suis 89/1591]
gi|353736381|gb|AER17390.1| conserved hypothetical protein [Streptococcus suis D9]
Length = 763
Score = 272 bits (696), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 188/593 (31%), Positives = 300/593 (50%), Gaps = 56/593 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAASNWNEALPIGNGHLGGMIYGSAVKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
VR + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKVREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-PSALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVIFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG++F + K++D G ++ L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVRFKVVCHSKVTD--GEVNVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNL-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S KD ++ I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLEDTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RIL-REHFEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I+R + I A+ L N D + V+++ K LP+ TKI +G I EW++
Sbjct: 498 QILRYFCDSCIGIAKQLVDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 547
>gi|374992668|ref|YP_004968163.1| alpha-L-fucosidase [Streptomyces bingchenggensis BCW-1]
gi|297163320|gb|ADI13032.1| Alpha-L-fucosidase [Streptomyces bingchenggensis BCW-1]
Length = 789
Score = 272 bits (695), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 193/593 (32%), Positives = 278/593 (46%), Gaps = 46/593 (7%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL-S 73
I PA F D+ IGNG LG + G V +E + LN D+LW+G P + +P L
Sbjct: 6 IQLTEPATAFHDSFLIGNGSLGGTLRGAVGTERIDLNLDSLWSGGPVTAEDTGSPAGLLP 65
Query: 74 DVRSLVDSGQYAEATAASVKLFGHP-ADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
+R+ + + + + G + YQ LG +E + D+ Y+R L+L
Sbjct: 66 QLRAAIRAEDNVRVEKLAQAMMGPGWTESYQPLGWLEWHYADTSDATG---YQRRLNLAD 122
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A A Y E F S PD V+V ++G G+ S V L + + H +
Sbjct: 123 AVATTGYGPAGAEVEMSSFVSAPDNVLVVTVTGP--GAASHPV-LPTFVSPHPVTTAAPR 179
Query: 193 ---IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
++ GR P + +P N D+ + + ++ G +
Sbjct: 180 PGLLVATGRVPARVLP---NYVDEEPAVVYGEDEPDGAGTVAAGAGFAVAVAVERTGPEA 236
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSI-RNLSYS--DLYTRHLDDYQKLFH 306
L+ A+S F G PS D + + SA +++ R L+ + L RH+ DY+ F
Sbjct: 237 LRLIAAAASGFRGYDRRPS---ADLAALARSAEETVTRALTRTAEQLVQRHVQDYRSYFD 293
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV + LS SP DP+ ELLF FGRYLLISSSR
Sbjct: 294 RVDLDLSASPA------------------------ADHGDPARAELLFHFGRYLLISSSR 329
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PGT+ ANLQGIWN D+ P W + NIN+EMNYW + L + P+ L+ +G
Sbjct: 330 PGTEAANLQGIWNIDVRPGWSANYTTNINVEMNYWAAESTALEDVHGPMLTLADDLAESG 389
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
+ TA Y A+G V+HH TDIW S+ +G WA WP G WL H+W+HY Y + DF
Sbjct: 390 TATAARYYGAAGAVVHHNTDIWRFSTPVKGDTQWATWPTGLYWLAAHVWDHYEYGGNDDF 449
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDG-KLACVSYSSTMDMA 545
A + A F LD L+ DG L T+PSTSPEH F+ P + A VS +TMD
Sbjct: 450 GAGPALRVHRSAALFALDMLVPDDDGLLVTSPSTSPEHRFVLPPAPRGAAVSEGTTMDQE 509
Query: 546 IIREVFSAIISAAEVLEK-NEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQRR 597
++ EV S ++ AE + ++D L+ + +L LR I G ++EW R
Sbjct: 510 LVHEVLSRYVTLAERFGRGDDDVLLARARHALGALRLPGIGASGELLEWKDER 562
>gi|393784536|ref|ZP_10372699.1| hypothetical protein HMPREF1071_03567 [Bacteroides salyersiae
CL02T12C01]
gi|392665517|gb|EIY59041.1| hypothetical protein HMPREF1071_03567 [Bacteroides salyersiae
CL02T12C01]
Length = 818
Score = 272 bits (695), Expect = 5e-70, Method: Compositional matrix adjust.
Identities = 190/604 (31%), Positives = 286/604 (47%), Gaps = 84/604 (13%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
++PIGNG LGA + G V +E + LNE TLW G P DY N + + ++R
Sbjct: 63 SLPIGNGSLGANILGSVAAERITLNEKTLWKGGPNTAGGADYYWKVNKQSASVMEEIRQA 122
Query: 79 VDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETYRR 126
G Y +A + K F A + +G+I +E S + ++ Y R
Sbjct: 123 FTDGDYEKAELLTRKNFNGLAHYEEGDETPFRFGSFTTMGEIYVETGLSEIGMSD--YYR 180
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
L L++A A V + N + R++F S PD V+ K + +++G
Sbjct: 181 ALSLDSAMAVVSFKKDNTRYMRKYFISYPDSVMAMKFTANKTGK---------------- 224
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE-------IKISD-DRGTISALE 238
Q ++ CP A DD G+ ++ +LE I+I +G + +E
Sbjct: 225 -----QNLVLRYCPNSEAKSSLCA-DDTDGLLYTGVLENNGMKFAIRIKAITKGGTTTVE 278
Query: 239 DKKLKVEGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLSYSDL 293
+L V+ +D V LL A + +F F +P DP + ++ Y +L
Sbjct: 279 QDRLIVKDADEVVFLLTADTDYKMNFQPDFKDPKTYVGSDPEQTTRKTMEGAIRKGYDEL 338
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVEL 352
Y H DY LF+RV +QL+ E +P+ R+ +++ + D L EL
Sbjct: 339 YRAHEADYTSLFNRVKLQLN-----------PEVTARNLPTNLRLANYRKGQADYRLEEL 387
Query: 353 LFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
+Q+GRYLLI+ SR G ANLQG+W+ +L+ W H NIN++MNYW + NL EC
Sbjct: 388 YYQYGRYLLIACSRSGNMPANLQGMWHNNLNGPWRVDYHNNINIQMNYWPACSTNLGECT 447
Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLC 471
PL DF+ L G++TA+ + A GW +I+ +S + + W PM G WL
Sbjct: 448 RPLVDFIRSLVKPGAETAKAYFNARGWTASISANIFGFTSPLSSEDMSWNFNPMAGPWLA 507
Query: 472 THLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDG 531
TH+WE+Y+YT D++FL+ Y LL+ A F +D+L DG PSTSPEH
Sbjct: 508 THIWEYYDYTRDKEFLKSTGYDLLKSSAQFTVDYLWHKPDGTYTAAPSTSPEH------- 560
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGS 589
V +T A++RE+ I A++VL +K E E VL L P KI G
Sbjct: 561 --GPVDEGTTFVHAVVREILLNAIEASKVLGVDKKERKEWEYVLAHLA---PYKIGRYGQ 615
Query: 590 IMEW 593
+MEW
Sbjct: 616 LMEW 619
>gi|121719440|ref|XP_001276419.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
gi|119404617|gb|EAW14993.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
Length = 781
Score = 271 bits (694), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 197/600 (32%), Positives = 299/600 (49%), Gaps = 64/600 (10%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PAK F+ +PIGN RL A +WG + ++ + LNE+++W+G D NP + + + VR
Sbjct: 29 YTSPAKDFSSTLPIGNSRLAAAIWGSL-TDNITLNENSIWSGPFQDRVNPRSYEGFTQVR 87
Query: 77 SLVDSGQYAEATAAS-VKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
S++ G+ + A + V + G P Y LG ++L+F + Y R LDL
Sbjct: 88 SMLQDGKISAANQLTLVDMAGIPTSPRAYNPLGALKLDFGHDTVN----NYTRFLDLGMG 143
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A V+Y NV ++RE+ +S+PD ++ ++ S GSL+ SL+ YV N
Sbjct: 144 VAGVEYEYDNVTYSREYVASHPDGILAVRLRASTPGSLNVACSLE----RSRYVKSNTAN 199
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
+ R + KAN I F A E +I G +S+ + + + G+ +
Sbjct: 200 V---RKSWGTLTLKANTGQANDPISFVA--EAQIVSVGGHMSS-DGSSVVINGASTIDIF 253
Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL- 312
A +S+ F DS+ S+ + A + TR DY L RV + L
Sbjct: 254 FDAQTSYR--FFE-EDSRAAQLSKQLDAAVKQGYPAVKKAATR---DYASLTSRVRLNLG 307
Query: 313 -SRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLISSSRPGT 369
S + TD R+ +++ D DP L L+F FGR+LLI+SSR G
Sbjct: 308 SSGAAGGFSTDV-------------RLFNYKKDANSDPELATLMFNFGRHLLIASSRGGD 354
Query: 370 QV---ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
ANLQGIWNED P W V++NLEMNYW + NL+E P+ D + + +G
Sbjct: 355 TPGLPANLQGIWNEDYEPAWGGKYTVDVNLEMNYWPAQVTNLAETFGPVVDLMDTVVPHG 414
Query: 427 SKTAQVNYLA-SGWVIHHKTDIWAKSS-ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
AQ Y +G+V+HH TD+W ++ D G AW+ +L E Y +T D+
Sbjct: 415 KDVAQRMYHCDAGYVLHHNTDLWGDAAPVDNGT----------AWMSMNLIEQYRFTQDK 464
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYS 539
L++R +PLL+ A+F +L E H+G+ + PS SPEH FI PD GK A + S
Sbjct: 465 SLLKERIWPLLKEAANFYYCYLFE-HEGHYISGPSISPEHAFIVPDEMSVPGKEAGIDLS 523
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQRRLN 599
TMD ++++E+F+A+I A L D ++K K L +L P I G I+EW +R N
Sbjct: 524 PTMDNSLLQELFAAVIEACTTLGITGDD-IDKAQKYLSKLPPPPIGSYGQILEW-RREYN 581
>gi|168071227|ref|XP_001787102.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162659703|gb|EDQ48084.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 319
Score = 271 bits (693), Expect = 7e-70, Method: Compositional matrix adjust.
Identities = 130/322 (40%), Positives = 191/322 (59%), Gaps = 9/322 (2%)
Query: 215 KGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDP 274
+G+ S +++ + GT E +L V G+ LL+ A++ F G P +P
Sbjct: 6 EGLGLSFEVQLLALTEGGTAKVDESGRLIVRGAQSVTLLVAAATDFAGYEKAPGSGGVNP 65
Query: 275 TSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPS 334
++AL Y L RH++D+++LF RV ++L + T + E + P+
Sbjct: 66 AERCLAALTKAAEFGYERLRERHVEDHRRLFERVELRLG-------SATAAAERA-SRPT 117
Query: 335 AERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVN 393
ER+++++ ED +L L F +GRYLL++SSRPGT+ A+LQGIWN + P W+ N
Sbjct: 118 DERLEAYRNGAEDLALEALYFHYGRYLLMASSRPGTEAAHLQGIWNPHVQPPWNCGYTTN 177
Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA 453
IN +MNYW + L EC EPLF+ + LS+ GS+TA+++Y A GWV HH D+W +S+
Sbjct: 178 INTQMNYWHAEVAGLPECHEPLFELIRDLSVTGSRTARIHYGARGWVAHHNVDLWRQSTP 237
Query: 454 DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY 513
G+ WA WP+GG WLC HLWEHY + + FL + AYPL++G A F DWL+ G DG
Sbjct: 238 SDGESSWAFWPLGGVWLCRHLWEHYQFAPNESFLLETAYPLMKGAAEFSQDWLVAGPDGR 297
Query: 514 LETNPSTSPEHEFIAPDGKLAC 535
L T PSTSPE++F+ PD C
Sbjct: 298 LVTAPSTSPENKFLTPDRGEPC 319
>gi|326799708|ref|YP_004317527.1| alpha-L-fucosidase [Sphingobacterium sp. 21]
gi|326550472|gb|ADZ78857.1| Alpha-L-fucosidase [Sphingobacterium sp. 21]
Length = 943
Score = 270 bits (690), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 163/501 (32%), Positives = 260/501 (51%), Gaps = 45/501 (8%)
Query: 96 GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNP 155
G + YQ GD+ L+F + Y+R LD+ A + Y V F R +FSS P
Sbjct: 287 GKYQESYQPFGDLLLDF---RAQAPFSNYKRTLDVEQAICKTSYVQNGVSFERTYFSSAP 343
Query: 156 DQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPK 215
D + ++ +SF+ SL S ++ ++ I RI + +
Sbjct: 344 DACLAIHLTADRPRQISFDASLASPHKTYNVEKVDDSTI--------RISVQVKQGV-LR 394
Query: 216 GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPT 275
G+ F + + + G + + D K+K+ G++ A L L A++++ + +D D
Sbjct: 395 GVGF-----LHVRHEGGELH-VGDGKIKILGANQATLFLTAATNYK----SYNDVSGDAE 444
Query: 276 SESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSA 335
+ S L ++N Y + H+ DYQ+ F + S++ ++E +++P+
Sbjct: 445 EIAKSQLNKVKNKPYDVIRLAHIQDYQQYFTKFSLKFE-----------ADEASNSLPTD 493
Query: 336 ERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNIN 395
+R+ F DP+L+ L Q+GRYLLISSSR G NLQGIWN+ L+P W S NIN
Sbjct: 494 QRIAQFVKSRDPNLLALFVQYGRYLLISSSRSGGLAPNLQGIWNDLLTPPWGSKYTTNIN 553
Query: 396 LEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADR 455
EMNYW + NLSE QEPLF + LS+ G +TA+ Y A GWV+HH TD+W + +A
Sbjct: 554 AEMNYWLAENTNLSELQEPLFQMIKELSVVGQETAKTYYDAPGWVLHHNTDLW-RGTAPI 612
Query: 456 GKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYL 514
+W GGAWLC HLWEH+ YT D FL ++AYP+++ A F +L+ + G+L
Sbjct: 613 NNPNHGIWVTGGAWLCQHLWEHFLYTQDESFLREQAYPIMKASALFFDHFLVSDPKTGWL 672
Query: 515 ETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLK 574
+ PS SPE G L TMD +IR++F + +AA +L+ +++ + +L
Sbjct: 673 ISTPSNSPEQ------GGLVA---GPTMDHQLIRQLFRNVAAAATILKLDKE-FAQHILD 722
Query: 575 SLPRLRPTKIAEDGSIMEWVQ 595
++ P +I + G + EW++
Sbjct: 723 KGAKIAPNQIGKYGQLQEWLE 743
Score = 77.0 bits (188), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 34/83 (40%), Positives = 53/83 (63%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + + PA +T+A+PIGNG+LGAMV+GGV ++ ++ NE +LWTG P +Y P A L
Sbjct: 28 LTLWYQHPANTWTEALPIGNGKLGAMVFGGVQADRIQFNESSLWTGGPRNYNQPGAKNYL 87
Query: 73 SDVRSLVDSGQYAEATAASVKLF 95
++R L+ G+ A + + F
Sbjct: 88 GEIRKLLSEGKQQAAEELAGRHF 110
>gi|319902716|ref|YP_004162444.1| alpha-L-fucosidase [Bacteroides helcogenes P 36-108]
gi|319417747|gb|ADV44858.1| Alpha-L-fucosidase [Bacteroides helcogenes P 36-108]
Length = 832
Score = 270 bits (689), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 193/600 (32%), Positives = 300/600 (50%), Gaps = 72/600 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG +GA + G V +E + NE TLW G P DY N + L +R
Sbjct: 75 SQSLPIGNGSIGASIMGSVEAERITFNEKTLWRGGPNTSKGADYYWNVNKQSAHVLEQIR 134
Query: 77 SLVDSGQYAEATAASVKLFGHPADV-YQLLGDIELEFDD--SHLKYAEET---------Y 124
G A+A + + F +DV Y+ + F + + ++ ET Y
Sbjct: 135 KAFVEGDQAKAEKLTRENFN--SDVPYEAARENPFRFGNFTTMGEFYVETGLNIIGMSGY 192
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V+++ V++ R +F S P V+V + + S +G +L F+ + + +
Sbjct: 193 KRALSLDSAMAVVQFTKDKVDYQRTYFISYPANVMVMRYTASRAGMQNLVFSYAPNPVST 252
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+G + ++ +A D G+++ ++ I + G +S D KL
Sbjct: 253 GSISADGMDGLVY-------------SAVLDNNGMKY--VVRIHAVVNGGKLSN-ADGKL 296
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDSKK-DPTSESMSALQSIRNLSYSDLYTRH 297
V+G+D V + A + +FD F NP+ +P + + S Y L H
Sbjct: 297 TVKGADEVVFYVTADTDYQINFDPDFANPATYVGVNPAETTRKWMDSAVAKGYDLLRKEH 356
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
+DY LF+RV + L+ P TD +P+++R+K++++ + D L EL +QF
Sbjct: 357 YEDYATLFNRVKLVLN--PDAKATD---------LPTSQRLKNYRSGKPDYYLEELYYQF 405
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL EC EPL
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINVQMNYWPACSTNLDECMEPLI 465
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G +TAQ + A GW +I+ ++ + + W PM G WL TH+W
Sbjct: 466 DFIRTLVKPGKRTAQAYFGARGWTASISGNIFGFTAPLESQDMSWNFNPMAGPWLATHIW 525
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D+ FL++ Y L++ A F +D+L DG PSTSPEH
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSADFAVDYLWHKPDGTFTAAPSTSPEH---------GP 576
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
V +T A+IRE+ I A+ VL +K E E+VL RL P +I G +MEW
Sbjct: 577 VDQGTTFVHAVIREILLDAIEASRVLGVDKAERRQWEQVLA---RLLPYRIGRYGQLMEW 633
>gi|383115161|ref|ZP_09935919.1| hypothetical protein BSGG_2959 [Bacteroides sp. D2]
gi|313695424|gb|EFS32259.1| hypothetical protein BSGG_2959 [Bacteroides sp. D2]
Length = 829
Score = 269 bits (688), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 188/600 (31%), Positives = 297/600 (49%), Gaps = 72/600 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 74 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 133
Query: 77 SLVDSGQYAEATAASVKLFG----HPAD--------VYQLLGDIELEFDDSHLKYAEETY 124
G +A + + F + AD + +G+ +E + + ++ Y
Sbjct: 134 KAFTEGDQKKAEMLTRQNFNSEVSYEADGENPFRFGSFTTMGEFYVETGLNMIGMSD--Y 191
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ V + R F S P V+V + S +SG +L F+ + + L
Sbjct: 192 KRILSLDSAMAVVQFKKDRVAYQRNFFISYPANVMVVRFSADQSGKQNLVFSYAPNPLST 251
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+GN ++ A+ D G+++ ++ I+ GT+S D KL
Sbjct: 252 GSMVSDGNKGLVY-------------TASLDNNGMKY--VVRIQAETKGGTLSN-ADGKL 295
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D V + A + +FD F +P +P + + + Y+ L+ +H
Sbjct: 296 TVKDADEVVFYITADTDYKINFDPDFKDPKTYIGVNPEETTKQWMNNAVAQGYTALFNQH 355
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
+DY LF+RV + L+ + K + +P+++R+KS++ + D L EL +QF
Sbjct: 356 YNDYATLFNRVRLNLNPAVKGV-----------NLPTSQRLKSYRKGQPDYYLEELYYQF 404
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL+EC PL
Sbjct: 405 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECVLPLI 464
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + A GW +I+ ++ + + W PM G WL TH+W
Sbjct: 465 DFIRTLVKPGEKTAQAYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATHIW 524
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D FL++ Y L++ A F +D+L DG PSTSPEH
Sbjct: 525 EYYDYTRDLKFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH---------GP 575
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A++RE+ I A++VL +K E E VL + L P +I G +MEW
Sbjct: 576 IDQGATFVHAVVREILMDAIEASKVLGVDKKERKQWEHVLAN---LVPYQIGRYGQLMEW 632
>gi|288801450|ref|ZP_06406903.1| fibronectin type III domain protein [Prevotella sp. oral taxon 299
str. F0039]
gi|288331661|gb|EFC70146.1| fibronectin type III domain protein [Prevotella sp. oral taxon 299
str. F0039]
Length = 827
Score = 269 bits (688), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 186/600 (31%), Positives = 295/600 (49%), Gaps = 72/600 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA--------PKALSDVR 76
+ ++P+GNG LGA V G + +E + NE TLW G P DA L ++R
Sbjct: 72 SQSLPLGNGSLGANVMGSIEAERITFNEKTLWRGGPNTSAGADAYWNVNKQSAHYLKEIR 131
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
G +A + K F + +G+ +E S + +E Y
Sbjct: 132 QAFIEGNEKKAALLTRKNFNSTVPYESWKDKPFRFGNFTTMGEFYIETGLSSIGMSE--Y 189
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ V + R +F S P+ ++V + + G +L F+ + +
Sbjct: 190 KRALSLDSALAVVQFKKDGVRYERNYFISYPNNIMVVRFKADQPGKQNLVFSYETNPVST 249
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+G+N ++ KA+ +++ Q ++ IK + GTI+ + KL
Sbjct: 250 GKMEADGSNGLVF-----------KAHLDNN----QMEYVVRIKALNQGGTINN-DKGKL 293
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDSKKDPTSESMSA-LQSIRNLSYSDLYTRH 297
+ G++ V L+ A + +F+ + NP SE+ +A ++ Y+ L H
Sbjct: 294 TINGANEVVFLITADTEYKVNFNPDYKNPRTYVGVNPSETTAAWMKKAVAQGYNALLEAH 353
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQF 356
DY LF+RVS+ L+ SE+ +P+ +R+ +++ ED L EL +QF
Sbjct: 354 YKDYSSLFNRVSLTLN-----------SEQRTSDIPTPQRLINYRKGKEDFYLEELYYQF 402
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NLSEC PL
Sbjct: 403 GRYLLIASSRPGNLPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPAGSTNLSECTLPLI 462
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + A GW +I+ ++ + W PM G WL TH+W
Sbjct: 463 DFIRTLVKPGEKTAQAYFDARGWTASISGNIFGFTAPLGSEDMSWNFNPMAGPWLATHVW 522
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
++Y+YT D+ FL++ Y L++ A F +D+L + DG PSTSPEH
Sbjct: 523 DYYDYTRDKKFLKEVGYDLIKSSAIFAVDFLWKKPDGTYTAAPSTSPEH---------GP 573
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A+IRE+ I A++VL +K E E+VLK R+ P K+ G ++EW
Sbjct: 574 IDEGTTFVHAVIREILMNAIDASKVLDVDKKERKQWEEVLK---RIAPYKVGRYGQLLEW 630
>gi|419535657|ref|ZP_14075151.1| hypothetical protein SPAR46_2252 [Streptococcus pneumoniae GA17457]
gi|379561797|gb|EHZ26812.1| hypothetical protein SPAR46_2252 [Streptococcus pneumoniae GA17457]
Length = 746
Score = 269 bits (688), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 186/578 (32%), Positives = 291/578 (50%), Gaps = 56/578 (9%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEA 87
+PIGNG LG M++G E ++LN++T+W D NPD+ L +R + G+ +A
Sbjct: 1 MPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKA 60
Query: 88 TA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVG 142
+ +F P D Y+LLG++ +E D A Y RELDL+TA + V + +
Sbjct: 61 EELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSC 119
Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCP 200
N++ RE+F+S ++ +I S +L+ N++L + ++ ++ I+M
Sbjct: 120 NLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAG 179
Query: 201 GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSF 260
G+ KG+QF + K++D G +S L + + + + L L + + +
Sbjct: 180 GR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDY 224
Query: 261 DGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDI 319
G +S+LQ ++ Y H+ YQ+ F+RV +L S KD
Sbjct: 225 WGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDC 270
Query: 320 VTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN 379
++ I T E K + L LLF +GRYLLISSS+P ANLQGIW
Sbjct: 271 LS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWC 319
Query: 380 EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGW 439
++L+P W S +NIN +MNYW PC+L E + PLFD L + G TA+ Y A G+
Sbjct: 320 DELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGF 379
Query: 440 VIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCA 499
HH TD + ++ + A+W + WLCTH+WEHY Y D L + + +++
Sbjct: 380 TAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAF 438
Query: 500 SFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAE 559
F D+L E DGYL T PS SPE+++ +G SST+D I+R + I A+
Sbjct: 439 LFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAK 497
Query: 560 VLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
L N D + V+++ K LP+ TKI +G I EW++
Sbjct: 498 QLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 532
>gi|90022148|ref|YP_527975.1| hypothetical protein Sde_2503 [Saccharophagus degradans 2-40]
gi|89951748|gb|ABD81763.1| a-L-fucosidase-like protein [Saccharophagus degradans 2-40]
Length = 803
Score = 269 bits (688), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 194/614 (31%), Positives = 310/614 (50%), Gaps = 72/614 (11%)
Query: 6 STSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG--- 61
ST L I F PA + ++ +P+GNG +G +V G V ETL+LNE TLWTG PG
Sbjct: 26 STVAAKSLPIWFGAPALDWESEGLPMGNGAMGIVVTGEVARETLQLNEKTLWTGGPGAKG 85
Query: 62 -------DYTNPDAPKALSDV--RSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEF 112
D D + + +D A+ ++ +GH YQ G++++++
Sbjct: 86 YNFGLPTDSIKQDVAHVRQQITLHNGIDPQTAADKLGQNMHGYGH----YQSFGELDIQY 141
Query: 113 DDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLS 172
+D A Y R LDL A V Y+ N + RE+F S P Q + K+S S S+S
Sbjct: 142 NDQ--TGAVSNYVRSLDLTQGVATVAYTRNNTHYKREYFVSYPQQAAIVKLSASNKQSIS 199
Query: 173 FNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRG 232
F++ + V+ N I + + K N+ +Q+ I +++I D G
Sbjct: 200 FDLGVR--------VHPNRTIETQVKRGVLTFSGKLFDNN----LQY--IGKVQIVVDGG 245
Query: 233 TISALEDK-KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYS 291
++ E +++V ++ AV+ +VA +++ + P + P L+ I+ YS
Sbjct: 246 ELTENEKTGRIQVSRANSAVISIVAGTNYAQAY--PHYRGRLPVKTLDKNLEKIKASEYS 303
Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD---EDPS 348
L HL DY LF RV + L + +E + P+ E +K ++ + + +
Sbjct: 304 ALLAEHLTDYTALFGRVELSLIEN---------AESYLLAKPTPELLKQYKGEGSAPERA 354
Query: 349 LVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL 408
L +L FQFGRYLLI+SSR G+ ANLQG+WN +P W++ HVNINL+MNYW + NL
Sbjct: 355 LEQLYFQFGRYLLIASSRNGSLPANLQGVWNNSATPPWNADYHVNINLQMNYWPAQVTNL 414
Query: 409 SECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVW--ALW-PM 465
E P FDF+ L G ++AQ + A GW + T+I+ + G + W A W P
Sbjct: 415 GETALPFFDFIDSLVEPGKQSAQKVFGARGWTLFLNTNIFGYT----GLIEWPTAFWQPE 470
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH 524
AWL H +EHY + D FL++RAYP+++ A F +D L+ + + G L +PS SPE
Sbjct: 471 AAAWLAQHYFEHYQFYQDNTFLKERAYPVMKEAALFWVDVLVADPNTGLLVVSPSFSPEQ 530
Query: 525 -EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRP- 581
F++ + M I+ ++F+ ++ AA ++ DA +K++++ L +L P
Sbjct: 531 GPFVS----------GAAMSQQIVFDLFTNVVEAANLV---GDAEFKKLIQAKLAKLDPG 577
Query: 582 TKIAEDGSIMEWVQ 595
T+I G + EW Q
Sbjct: 578 TRIGSWGQLQEWQQ 591
>gi|423280895|ref|ZP_17259807.1| hypothetical protein HMPREF1203_04024 [Bacteroides fragilis HMW
610]
gi|404583536|gb|EKA88214.1| hypothetical protein HMPREF1203_04024 [Bacteroides fragilis HMW
610]
Length = 829
Score = 269 bits (688), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 182/600 (30%), Positives = 292/600 (48%), Gaps = 72/600 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY--------TNPDAPKALSDVR 76
+ ++PIGNG +GA + G + +E + NE TLW G P N + L ++R
Sbjct: 75 SQSLPIGNGSIGANIMGSIEAERITFNEKTLWRGGPNTSKGAEAYWNVNKQSAHVLDEIR 134
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
G +A + K F + +G+ +E S + + Y
Sbjct: 135 QAFSEGDQKKAAMLTRKNFNSEVPYESNREKPFRFGNFTTMGEFYVETGLSAVNMS--GY 192
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ +V + R +F S P V+ + G +L+F+ + + +
Sbjct: 193 KRILSLDSALAVVQFKKDDVAYERSYFISYPANVMAIRFKADRPGKQNLTFSYAPNPVST 252
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+G+N + A+ D G+Q+ ++ I + GT+S D K+
Sbjct: 253 GSMTTDGSNGLTY-------------TAHLDNNGMQY--VVRIHATTKGGTLSN-ADGKI 296
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDSKK-DPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D AV L+ A + +FD F +P +P + + + ++ Y L+ +H
Sbjct: 297 TVKDADEAVFLITADTDYKINFDPDFKDPKTYVGVNPAETTRQWMDNAVSMGYDVLFKQH 356
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
DDY LF+RV +QL+ ++ +P+A+R+++++ + D L EL +QF
Sbjct: 357 YDDYAALFNRVKLQLN-----------PDQQSANLPTAKRLQNYRKGQPDFYLEELYYQF 405
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + P NL+EC PL
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACPTNLNECTLPLV 465
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + GW +I+ ++ + + W PM G WL TH+W
Sbjct: 466 DFIRTLVKPGQKTAQAYFGTRGWTASISANIFGFTAPLESEDMSWNFNPMAGPWLATHIW 525
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D+ FL++ Y L++ A F D+L DG PSTSPEH
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSAQFATDFLWRKPDGTYTAAPSTSPEH---------GP 576
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A+IRE+ I A++VL + E ++VL L P KI G +MEW
Sbjct: 577 IDEGTTFVHAVIREILLDAIEASKVLGVDSKERKQWQEVLA---HLAPYKIGRYGQLMEW 633
>gi|419441357|ref|ZP_13981397.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA40410]
gi|421282151|ref|ZP_15732944.1| hypothetical protein SPAR155_2159 [Streptococcus pneumoniae
GA04672]
gi|421308367|ref|ZP_15759005.1| hypothetical protein SPAR166_2207 [Streptococcus pneumoniae
GA60132]
gi|421310565|ref|ZP_15761187.1| hypothetical protein SPAR168_2146 [Streptococcus pneumoniae
GA62681]
gi|421312927|ref|ZP_15763524.1| hypothetical protein SPAR167_2288 [Streptococcus pneumoniae
GA58981]
gi|379576014|gb|EHZ40943.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA40410]
gi|395878598|gb|EJG89661.1| hypothetical protein SPAR155_2159 [Streptococcus pneumoniae
GA04672]
gi|395905170|gb|EJH16076.1| hypothetical protein SPAR166_2207 [Streptococcus pneumoniae
GA60132]
gi|395907679|gb|EJH18569.1| hypothetical protein SPAR167_2288 [Streptococcus pneumoniae
GA58981]
gi|395908180|gb|EJH19063.1| hypothetical protein SPAR168_2146 [Streptococcus pneumoniae
GA62681]
Length = 749
Score = 269 bits (687), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 186/578 (32%), Positives = 291/578 (50%), Gaps = 56/578 (9%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEA 87
+PIGNG LG M++G E ++LN++T+W D NPD+ L +R + G+ +A
Sbjct: 1 MPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKA 60
Query: 88 TA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVG 142
+ +F P D Y+LLG++ +E D A Y RELDL+TA + V + +
Sbjct: 61 EELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSC 119
Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCP 200
N++ RE+F+S ++ +I S +L+ N++L + ++ ++ I+M
Sbjct: 120 NLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAG 179
Query: 201 GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSF 260
G+ KG+QF + K++D G +S L + + + + L L + + +
Sbjct: 180 GR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDY 224
Query: 261 DGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDI 319
G +S+LQ ++ Y H+ YQ+ F+RV +L S KD
Sbjct: 225 WGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDC 270
Query: 320 VTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN 379
++ I T E K + L LLF +GRYLLISSS+P ANLQGIW
Sbjct: 271 LS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWC 319
Query: 380 EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGW 439
++L+P W S +NIN +MNYW PC+L E + PLFD L + G TA+ Y A G+
Sbjct: 320 DELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGF 379
Query: 440 VIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCA 499
HH TD + ++ + A+W + WLCTH+WEHY Y D L + + +++
Sbjct: 380 TAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAF 438
Query: 500 SFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAE 559
F D+L E DGYL T PS SPE+++ +G SST+D I+R + I A+
Sbjct: 439 LFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAK 497
Query: 560 VLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
L N D + V+++ K LP+ TKI +G I EW++
Sbjct: 498 QLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 532
>gi|251795324|ref|YP_003010055.1| alpha-L-fucosidase [Paenibacillus sp. JDR-2]
gi|247542950|gb|ACS99968.1| Alpha-L-fucosidase [Paenibacillus sp. JDR-2]
Length = 775
Score = 269 bits (687), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 184/599 (30%), Positives = 307/599 (51%), Gaps = 52/599 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPKA 71
+K+ + PA + +P+GNG+LGA++ GG+ SET + E T W+G P + +PDA +
Sbjct: 4 MKMIYTQPAAGWKQGLPLGNGQLGAVLHGGINSETWNMTEITFWSGKPERFGGSPDAKEK 63
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYR--RELD 129
L +R +G Y KL G + + L D + Y +E + RELD
Sbjct: 64 LKTMREAFFNGNYVLGD----KLAGEQLEPVKGNFGTNLSLCDVLISYNDEGSQLVRELD 119
Query: 130 LNTATARVKYSVGN-VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH-SYV 187
L A A V Y G+ RE F S+PD V+V++I G ++GS+S ++ ++ + +
Sbjct: 120 LEKAVAAVSYRSGSGAAMRRETFVSHPDGVLVSRIKGDQAGSVSLSLRIEGRTTTFDARL 179
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPK-GIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+G ++++ R N + D G+ L+ ++ R E + +E
Sbjct: 180 DGPDKLVF-------RTQATENIHSDGTCGVWSEGALKAVVTGGR---VFGEAGTVIIEQ 229
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPT--SESMSALQSIRNLSYSDLYTRHLDDYQKL 304
+D VL L ++ + + D T ES L++ + L H+ DY+ L
Sbjct: 230 ADEVVLYLAVATDY---------GRMDDTWKVESTERLEAAEAKGFERLLRDHIADYRSL 280
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE--DPSLVELLFQFGRYLLI 362
+ RV + L S + D +P+ ER++ + E D L+ L +Q+GRYL I
Sbjct: 281 YGRVDLDLGGS-----------KAFDLLPTDERIRKLRAGEQTDNGLIALFYQYGRYLTI 329
Query: 363 SSSRPGTQVA-NLQGIWN--EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
+ +R +++ +LQG+WN E + W H+++N EMNY+ + NL+EC PL +++
Sbjct: 330 AGTRADSRLPLHLQGLWNDGEANAMAWSCDYHLDVNTEMNYYPTEISNLAECHIPLMNYI 389
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
LS G A+ Y GWV H ++ W +S G+ W L GG W+ THL EHY
Sbjct: 390 EQLSFAGRTAAEDFYGCEGWVAHVFSNAWGFASPGWGRS-WGLNVTGGLWIATHLKEHYE 448
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFI-APDGK-LACV 536
Y+ DR FL ++AYP+++ A F LD++ I G+L T PSTSPE+ F P+ + +
Sbjct: 449 YSRDRGFLTRQAYPVMKEAALFFLDYMTIHPKYGWLVTGPSTSPENSFYPGPEEQGEQQL 508
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S STMD ++R++F ++ AAE+L +E+ L ++ ++ L P +I + G + EW++
Sbjct: 509 SMGSTMDQMLVRDLFGFVLEAAEMLAVDEE-LQHRLKDAMELLPPLQIGKRGQLQEWLE 566
>gi|299144684|ref|ZP_07037752.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
gi|298515175|gb|EFI39056.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
Length = 829
Score = 269 bits (687), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 187/600 (31%), Positives = 298/600 (49%), Gaps = 72/600 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 74 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 133
Query: 77 SLVDSGQYAEATAASVKLFG----HPAD--------VYQLLGDIELEFDDSHLKYAEETY 124
G +A + + F + AD + +G+ +E + + ++ Y
Sbjct: 134 KAFTEGDQKKAEMLTRQNFNSEVSYEADGENPFRFGSFTTMGEFYVETGLNMIGMSD--Y 191
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ V + R F S P V+V + S +SG +L F+ + + L
Sbjct: 192 KRILSLDSAMAVVQFKKDRVAYQRNFFISYPANVMVVRFSADQSGKQNLVFSYAPNPLST 251
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+GN ++ A+ D G+++ ++ I+ GT+S D KL
Sbjct: 252 GSMVSDGNKGLVY-------------TASLDNNGMKY--VVRIQAETKGGTLSN-ADGKL 295
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D V + A + +FD F +P +P + + + Y+ L+ +H
Sbjct: 296 TVKDADEVVFYITADTDYKINFDPDFKDPKTYIGVNPEETTKQWMNNAVAQGYTALFNQH 355
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
+DY LF+RV + L+ + K + +P+++R+K+++ + D L EL +QF
Sbjct: 356 YNDYATLFNRVRLNLNPAVKGV-----------NLPTSQRLKNYRKGQPDYYLEELYYQF 404
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL+EC PL
Sbjct: 405 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECVLPLI 464
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + A GW +I+ ++ + + W PM G WL TH+W
Sbjct: 465 DFIRTLVKPGEKTAQAYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATHIW 524
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D FL++ Y L++ A F++D+L DG PSTSPEH
Sbjct: 525 EYYDYTRDLKFLKETGYELIKSSADFVVDYLWHKPDGTYTAAPSTSPEH---------GP 575
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A++RE+ I A++VL +K E E VL + L P +I G +MEW
Sbjct: 576 IDQGATFVHAVVREILMDAIEASKVLGVDKKERKQWEHVLAN---LVPYQIGRYGQLMEW 632
>gi|332671290|ref|YP_004454298.1| alpha-L-fucosidase [Cellulomonas fimi ATCC 484]
gi|332340328|gb|AEE46911.1| Alpha-L-fucosidase [Cellulomonas fimi ATCC 484]
Length = 820
Score = 269 bits (687), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 190/610 (31%), Positives = 290/610 (47%), Gaps = 54/610 (8%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG-VPG-------DYTNP 66
++F+GPA+ + +A P+GNGRLGAM+ GG +++N+ T W+G V G
Sbjct: 30 LSFDGPARRWVEAFPVGNGRLGAMLHGGTERALVQVNDATAWSGRVDGPARALAAVRAAG 89
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYR- 125
P L+ R + +G++ EA G +Q D+ + S + A+ +R
Sbjct: 90 AGPDRLARARDALAAGRHDEAADLLAVFQGPWTQAFQPFVDLHVTVA-SAPRPAQVRHRD 148
Query: 126 ---RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
R LDL R + G VE E F+S D + + S +E + +S +
Sbjct: 149 DSPRTLDLRDGVVRERLPAG-VEV--EWFASAVDGALHGRWSAAEPFDVHVELSTPHHVR 205
Query: 183 NHSYVNGNNQIIME---GRCPGKRI-PPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
+ G +++E PG P DD + A+L + D G +
Sbjct: 206 TDHHAPGGRVLVLELPDDVAPGHEPDAPAVTRTDDGASLTGVAVL-LACGD--GEVGGTP 262
Query: 239 DKKLKVEGSDWAVLLLVASSSF----DGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
L+VE + W ++L ++ DGP + + D + + AL R +
Sbjct: 263 GGALRVERATWVEVVLATGTTSPWPQDGPLRDREEVVADVLACARRALPGDRGTGDA-TR 321
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
RH+ D++++ + L P D+ D + I T P A +L + +F
Sbjct: 322 ARHVADHRRIADATVLALV--PHDL--DLRLPDAIGTTPHA------------ALAQAVF 365
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
GRYLLI+SSRPG+ ANLQG+WN D P W S +N+NLEM YW + L EC EP
Sbjct: 366 DHGRYLLIASSRPGSPPANLQGVWNADPRPPWSSNYTLNVNLEMAYWGAEAVGLGECHEP 425
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKS---SADRGKVVWALWPMGGAWLC 471
L + L+ +G+ A+ Y GWV HH +D+W + A G WA W MGG WLC
Sbjct: 426 LLAHVGLLARHGAHVARELYGCQGWVAHHNSDVWGWALPVGAGHGDPSWAQWWMGGVWLC 485
Query: 472 THLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP-- 529
HLW+H + D FL A+PLL G A F LDWL+E DG L T+PSTSPE++F P
Sbjct: 486 RHLWDHADVGGDDAFLRDEAWPLLRGAALFCLDWLVEAPDGSLTTSPSTSPENQFRLPSS 545
Query: 530 ----DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
G + ++ STMD+A++R++ + + L+ +D L ++ +L RL +
Sbjct: 546 ADGTGGGVGALATGSTMDLALVRDLLERCLDTIDRLDL-DDPLEGRLRSALARLARPVVG 604
Query: 586 EDGSIMEWVQ 595
DG + EW
Sbjct: 605 PDGLLREWAH 614
>gi|313149260|ref|ZP_07811453.1| glycoside hydrolase family 95 [Bacteroides fragilis 3_1_12]
gi|313138027|gb|EFR55387.1| glycoside hydrolase family 95 [Bacteroides fragilis 3_1_12]
Length = 829
Score = 268 bits (686), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 182/600 (30%), Positives = 292/600 (48%), Gaps = 72/600 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY--------TNPDAPKALSDVR 76
+ ++PIGNG +GA + G + +E + NE TLW G P N + L ++R
Sbjct: 75 SQSLPIGNGSIGANIMGSIEAERITFNEKTLWRGGPNTSKGAEAYWNVNKQSAHVLDEIR 134
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
G +A + K F + +G+ +E S + + Y
Sbjct: 135 QAFSEGDQKKAAMLTRKNFNSEVPYESNREKPFRFGNFTTMGEFYVETGLSAVNMS--GY 192
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ +V + R +F S P V+ + G +L+F+ + + +
Sbjct: 193 KRILSLDSALAVVQFKKDDVAYERSYFISYPANVMAIRFKADRPGKQNLTFSYAPNPVST 252
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+G+N + A+ D G+Q+ ++ I + GT+S D K+
Sbjct: 253 GSMTTDGSNGLTY-------------TAHLDNNGMQY--VVRIYATTKGGTLSN-ADGKI 296
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDSKK-DPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D AV L+ A + +FD F +P +P + + + ++ Y L+ +H
Sbjct: 297 TVKDADEAVFLITADTDYKINFDPDFKDPKTYVGVNPAETTRQWMDNAVSMGYDVLFKQH 356
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
DDY LF+RV +QL+ ++ +P+A+R+++++ + D L EL +QF
Sbjct: 357 YDDYAALFNRVKLQLN-----------PDQQSTNLPTAKRLQNYRKGQPDFYLEELYYQF 405
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + P NL+EC PL
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACPTNLNECTLPLV 465
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + GW +I+ ++ + + W PM G WL TH+W
Sbjct: 466 DFIRTLVKPGQKTAQAYFGTRGWTASISANIFGFTAPLESEDMSWNFNPMAGPWLATHIW 525
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D+ FL++ Y L++ A F D+L DG PSTSPEH
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSAQFATDFLWRKPDGTYTAAPSTSPEH---------GP 576
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A+IRE+ I A++VL + E ++VL L P KI G +MEW
Sbjct: 577 IDEGTTFVHAVIREILLDAIEASKVLGVDSKERKQWQEVLA---HLAPYKIGRYGQLMEW 633
>gi|421295152|ref|ZP_15745870.1| alpha-L-fucosidase [Streptococcus pneumoniae GA56113]
gi|395891509|gb|EJH02504.1| alpha-L-fucosidase [Streptococcus pneumoniae GA56113]
Length = 749
Score = 268 bits (686), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 185/578 (32%), Positives = 289/578 (50%), Gaps = 56/578 (9%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEA 87
+PIGNG LG M++G E ++LN++T+W D NPD+ L +R + G+ +A
Sbjct: 1 MPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKA 60
Query: 88 TA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVG 142
+ +F P D Y+LLG++ +E D A Y RELDL+TA + V + +
Sbjct: 61 EELIKLTMFATPRDQSHYELLGELCIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSC 119
Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCP 200
N++ RE+F+S ++ +I S +L+ N++L + ++ ++ I+M
Sbjct: 120 NLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAG 179
Query: 201 GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSF 260
G+ KG+QF + K++D G +S L + + + + L L + + +
Sbjct: 180 GR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDY 224
Query: 261 DGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDI 319
G +S+LQ ++ Y H+ YQ+ F+RV +L S +
Sbjct: 225 WGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYSKGCL 271
Query: 320 VTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN 379
+I T E K + L LLF +GRYLLISSS+P ANLQGIW
Sbjct: 272 --------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWC 319
Query: 380 EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGW 439
++L+P W S +NIN +MNYW PC+L E + PLFD L + G TA+ Y A G+
Sbjct: 320 DELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGF 379
Query: 440 VIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCA 499
HH TD + ++ + A+W + WLCTH+WEHY Y D L + + +++
Sbjct: 380 TAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAF 438
Query: 500 SFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAE 559
F D+L E DGYL T PS SPE+++ +G SST+D I+R + I A+
Sbjct: 439 LFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAK 497
Query: 560 VLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
L N D + V+++ K LPR TKI +G I EW++
Sbjct: 498 QLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLE 532
>gi|336404392|ref|ZP_08585089.1| hypothetical protein HMPREF0127_02402 [Bacteroides sp. 1_1_30]
gi|335943224|gb|EGN05065.1| hypothetical protein HMPREF0127_02402 [Bacteroides sp. 1_1_30]
Length = 850
Score = 268 bits (686), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 191/606 (31%), Positives = 299/606 (49%), Gaps = 84/606 (13%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 95 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 154
Query: 77 SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDD--SHLKYAEET---------YR 125
G +A + + F Y G+ F + ++ ET Y+
Sbjct: 155 QAFTEGNQEKAEMLTRQNFNSEVS-YDADGETPFRFGSFTTMGEFYVETGLNIIGMSDYK 213
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDN 183
R L L++A A V++ +V + R +F S P V+V + S + G +L F+ + + +
Sbjct: 214 RILSLDSAMAVVQFKKDHVAYQRNYFISYPANVMVMRFSADQPGKQNLVFSYAPNPVSTG 273
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ + N ++ +A+ D GI++ ++ I+ GT+S D KL
Sbjct: 274 NMASDSNKGLVY-------------SASLDNNGIKY--VVRIQAETKGGTLSN-ADGKLT 317
Query: 244 VEGSDWAVLLLVASS----SFDGPF--------INPSDSKKDPTSESMSALQSIRNLSYS 291
V+G+D V + A + +FD F +NP ++ K+ + ++S Y+
Sbjct: 318 VKGADEVVFYITADTDYKPNFDPDFKEPKTYVGVNPEETTKEWMNNAVSQ-------GYT 370
Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLV 350
L+++H +DY LF+RV + L+ + K +P+ +R+K+++ + D L
Sbjct: 371 ALFSQHYNDYAALFNRVKLNLNPAIKG-----------RNLPTPQRLKNYRAGQPDYDLE 419
Query: 351 ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
EL FQFGRYLLISSSRPG ANLQGIW+ ++ W H NIN++MNYW + NL+E
Sbjct: 420 ELYFQFGRYLLISSSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNE 479
Query: 411 CQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAW 469
C PL DF+ L G KTA+ + A GW +I+ ++ + + W PM G W
Sbjct: 480 CMLPLVDFIRTLVKPGEKTAKSYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPW 539
Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP 529
L TH+WE+Y+YT D FL++ Y L++ A F +D+L DG PSTSPEH
Sbjct: 540 LATHIWEYYDYTRDLTFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH----- 594
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAED 587
+ +T A++RE+ I A++VL +K E E VL + L P KI
Sbjct: 595 ----GPIDQGATFVHAVVREILLDAIEASKVLGVDKKERKQWEHVLAN---LVPYKIGRY 647
Query: 588 GSIMEW 593
G +MEW
Sbjct: 648 GQLMEW 653
>gi|293371889|ref|ZP_06618293.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
gi|292633135|gb|EFF51712.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
Length = 829
Score = 268 bits (685), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 187/600 (31%), Positives = 297/600 (49%), Gaps = 72/600 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 74 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 133
Query: 77 SLVDSGQYAEATAASVKLFG----HPAD--------VYQLLGDIELEFDDSHLKYAEETY 124
G +A + + F + AD + +G+ +E + + ++ Y
Sbjct: 134 KAFTEGDQKKAEMLTRQNFNSEVSYEADGENPFRFSSFTTMGEFYVETGLNMIGMSD--Y 191
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ V + R F S P V+V + S +SG +L F+ + + L
Sbjct: 192 KRILSLDSAMAVVQFKKDRVAYQRNFFISYPANVMVVRFSADQSGKQNLVFSYAPNPLST 251
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+GN ++ A+ D G+++ ++ I+ GT+S D KL
Sbjct: 252 GSMVSDGNKGLVY-------------TASLDNNGMKY--VVRIQAETKGGTLSN-ADGKL 295
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D V + A + +FD F +P +P + + + Y+ L+ +H
Sbjct: 296 TVKDADEVVFYITADTDYKINFDPDFKDPKTYIGVNPEETTKQWMNNAVAQGYTALFNQH 355
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
+DY LF+RV + L+ + K + +P+++R+K+++ + D L EL +QF
Sbjct: 356 YNDYATLFNRVRLNLNPAVKGV-----------NLPTSQRLKNYRKGQPDYYLEELYYQF 404
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL+EC PL
Sbjct: 405 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECVLPLI 464
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + A GW +I+ ++ + + W PM G WL TH+W
Sbjct: 465 DFIRTLVKPGEKTAQAYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATHIW 524
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D FL++ Y L++ A F +D+L DG PSTSPEH
Sbjct: 525 EYYDYTRDLKFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH---------GP 575
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A++RE+ I A++VL +K E E VL + L P +I G +MEW
Sbjct: 576 IDQGATFVHAVVREILMDAIEASKVLGVDKKERKQWEHVLAN---LVPYQIGRYGQLMEW 632
>gi|237718842|ref|ZP_04549323.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
gi|229451974|gb|EEO57765.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
Length = 829
Score = 268 bits (685), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 187/600 (31%), Positives = 297/600 (49%), Gaps = 72/600 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 74 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 133
Query: 77 SLVDSGQYAEATAASVKLFG----HPAD--------VYQLLGDIELEFDDSHLKYAEETY 124
G +A + + F + AD + +G+ +E + + ++ Y
Sbjct: 134 KAFTEGDQKKAEMLTRQNFNSEVSYEADGENPFRFGSFTTMGEFYVETGLNMIGMSD--Y 191
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ V + R F S P V+V + S +SG +L F+ + + L
Sbjct: 192 KRILSLDSAMAVVQFKKDRVAYQRNFFISYPANVMVVRFSADQSGKQNLVFSYAPNPLST 251
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+GN ++ A+ D G+++ ++ I+ GT+S D KL
Sbjct: 252 GSMVSDGNKGLVY-------------TASLDNNGMKY--VVRIQAETKGGTLSN-ADGKL 295
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D V + A + +FD F +P +P + + + Y+ L+ +H
Sbjct: 296 TVKDADEVVFYITADTDYKINFDPDFKDPKTYIGVNPEETTKQWMNNAVAQGYTALFNQH 355
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
+DY LF+RV + L+ + K + +P+++R+K+++ + D L EL +QF
Sbjct: 356 YNDYATLFNRVRLNLNPAVKGV-----------NLPTSQRLKNYRKGQPDYYLEELYYQF 404
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL+EC PL
Sbjct: 405 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECVLPLI 464
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + A GW +I+ ++ + + W PM G WL TH+W
Sbjct: 465 DFIRTLVKPGEKTAQAYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATHIW 524
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D FL++ Y L++ A F +D+L DG PSTSPEH
Sbjct: 525 EYYDYTRDLKFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH---------GP 575
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A++RE+ I A++VL +K E E VL + L P +I G +MEW
Sbjct: 576 IDQGATFVHAVVREILMDAIEASKVLGVDKKERKQWEHVLAN---LVPYQIGRYGQLMEW 632
>gi|423293334|ref|ZP_17271461.1| hypothetical protein HMPREF1070_00126 [Bacteroides ovatus
CL03T12C18]
gi|392678277|gb|EIY71685.1| hypothetical protein HMPREF1070_00126 [Bacteroides ovatus
CL03T12C18]
Length = 829
Score = 268 bits (685), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 187/600 (31%), Positives = 297/600 (49%), Gaps = 72/600 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 74 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 133
Query: 77 SLVDSGQYAEATAASVKLFG----HPAD--------VYQLLGDIELEFDDSHLKYAEETY 124
G +A + + F + AD + +G+ +E + + ++ Y
Sbjct: 134 KAFTEGDQKKAEMLTRQNFNSEVSYEADGENPFRFGSFTTMGEFYVETGLNMIGMSD--Y 191
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ V + R F S P V+V + S +SG +L F+ + + L
Sbjct: 192 KRILSLDSAMAVVQFKKDRVAYQRNFFISYPANVMVVRFSADQSGKQNLVFSYAPNPLST 251
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+GN ++ A+ D G+++ ++ I+ GT+S D KL
Sbjct: 252 GSMVSDGNKGLVY-------------TASLDNNGMKY--VVRIQAETKGGTLSN-ADGKL 295
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D V + A + +FD F +P +P + + + Y+ L+ +H
Sbjct: 296 TVKDADEVVFYITADTDYKINFDPDFKDPKTYIGVNPEETTKQWMNNAVAQGYTALFNQH 355
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
+DY LF+RV + L+ + K + +P+++R+K+++ + D L EL +QF
Sbjct: 356 YNDYATLFNRVRLNLNPAVKGV-----------NLPTSQRLKNYRKGQPDYYLEELYYQF 404
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL+EC PL
Sbjct: 405 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECVLPLI 464
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + A GW +I+ ++ + + W PM G WL TH+W
Sbjct: 465 DFIRTLVKPGEKTAQAYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATHIW 524
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D FL++ Y L++ A F +D+L DG PSTSPEH
Sbjct: 525 EYYDYTRDLKFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH---------GP 575
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A++RE+ I A++VL +K E E VL + L P +I G +MEW
Sbjct: 576 IDQGATFVHAVVREILMDAIEASKVLGVDKKERKQWEHVLAN---LVPYQIGRYGQLMEW 632
>gi|150003836|ref|YP_001298580.1| glycoside hydrolase [Bacteroides vulgatus ATCC 8482]
gi|149932260|gb|ABR38958.1| glycoside hydrolase family 95 [Bacteroides vulgatus ATCC 8482]
Length = 818
Score = 268 bits (685), Expect = 6e-69, Method: Compositional matrix adjust.
Identities = 198/639 (30%), Positives = 301/639 (47%), Gaps = 82/639 (12%)
Query: 2 MNAESTSTTNPLKITFNGP------AKHFTDA---------IPIGNGRLGAMVWGGVPSE 46
+ A+ST T L I F+ P A ++ A +PIGNG LG V G + +E
Sbjct: 17 LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPATDKAWENNSLPIGNGSLGGNVMGSIAAE 76
Query: 47 TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
+ LNE TLW G P N ++ LS++R G +A + K F
Sbjct: 77 RITLNEKTLWRGGPNTEKGAAYYWNVNKESAHLLSEIRQAFTDGNQKKAEELTCKNFNGL 136
Query: 99 ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
AD + LG+ +E S + + Y+R L L++A A V + V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEIGMTD--YKRILSLDSAMAIVSFRKDEVNY 194
Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
R++F S PD V+V K + G +L F+ + +G N+++ GR
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKTDGPNRLLYTGRL----- 249
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
K Q L I+ + G+++ D K V +D + LL A + +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298
Query: 261 DGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
+ F +P DP +++ L + +Y++L RH DY +LF RV +QL+ +P
Sbjct: 299 NPDFKDPKTYVGPDPDQTTLAMLDAAAAKNYNELCERHKTDYTQLFGRVKLQLNPHAPM- 357
Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
T + +P+ +R+ ++ + D L E+ +QFGRYLLI+SSRPG ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTHQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413
Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
W + W H NIN++MNYW + NL+EC PL DF+ L G KTAQ + A
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473
Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
GW +I+ +S + W PM G WL TH+WE+Y+YT D+ FL++ Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533
Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
A+F +D+L DG PSTSPEH V +T A++RE+ I
Sbjct: 534 SSANFAVDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLNAID 584
Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
A++ L + + + VLK L P +I G +MEW
Sbjct: 585 ASKALGVDSKDRKQWQYVLK---HLVPYQIGRYGQLMEW 620
>gi|336412577|ref|ZP_08592930.1| hypothetical protein HMPREF1017_00038 [Bacteroides ovatus
3_8_47FAA]
gi|335942623|gb|EGN04465.1| hypothetical protein HMPREF1017_00038 [Bacteroides ovatus
3_8_47FAA]
Length = 799
Score = 268 bits (684), Expect = 7e-69, Method: Compositional matrix adjust.
Identities = 187/600 (31%), Positives = 297/600 (49%), Gaps = 72/600 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 74 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTEKGADYYWNVNKQSAHLLDEIR 133
Query: 77 SLVDSGQYAEATAASVKLFG----HPAD--------VYQLLGDIELEFDDSHLKYAEETY 124
G +A + + F + AD + +G+ +E + + ++ Y
Sbjct: 134 KAFTEGDQKKAEMLTRQNFNSEVSYEADRENPFRFGSFTTMGEFYVETGLNMIGMSD--Y 191
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ V + R F S P V+V + S +SG +L F+ + + L
Sbjct: 192 KRILSLDSAMAVVQFKKDRVAYQRNFFISYPANVMVVRFSADQSGKQNLVFSYAPNPLST 251
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+GN ++ A+ D G+++ ++ I+ GT+S D KL
Sbjct: 252 GSMVSDGNKGLVY-------------TASLDNNGMKY--VVRIQAETKGGTLSN-ADGKL 295
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D V + A + +FD F +P +P + + + Y+ L+ +H
Sbjct: 296 TVKDADEVVFYITADTDYKINFDPDFKDPKTYIGVNPEETTKQWMNNAVAQGYTALFNQH 355
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
+DY LF+RV + L+ + K + +P+++R+K+++ + D L EL +QF
Sbjct: 356 YNDYAALFNRVRLNLNPAVKGV-----------NLPTSQRLKNYRKGQPDYYLEELYYQF 404
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL+EC PL
Sbjct: 405 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECVLPLI 464
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + A GW +I+ ++ + + W PM G WL TH+W
Sbjct: 465 DFIRTLVKPGEKTAQAYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATHIW 524
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D FL++ Y L++ A F +D+L DG PSTSPEH
Sbjct: 525 EYYDYTRDLKFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH---------GP 575
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A++RE+ I A++VL +K E E VL + L P +I G +MEW
Sbjct: 576 IDQGATFVHAVVREILMDAIEASKVLGVDKKERKQWEHVLAN---LVPYQIGRYGQLMEW 632
>gi|255693316|ref|ZP_05416991.1| fibronectin type III domain protein [Bacteroides finegoldii DSM
17565]
gi|260620891|gb|EEX43762.1| hypothetical protein BACFIN_08516 [Bacteroides finegoldii DSM
17565]
Length = 861
Score = 268 bits (684), Expect = 9e-69, Method: Compositional matrix adjust.
Identities = 193/647 (29%), Positives = 316/647 (48%), Gaps = 78/647 (12%)
Query: 1 MMNAESTSTT--NPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWT 57
++NA++T PL+ T++ PAK + ++A+PIGNG +GAM++GGV + ++ NE TLW+
Sbjct: 21 VVNAKTTDRNFPPPLRATYDTPAKIWESEALPIGNGYMGAMIFGGVYVDVIQTNEHTLWS 80
Query: 58 GVP--------GDYTNPDAPK-ALSDVRSLV----------------------------- 79
G P G P+ K L R+L+
Sbjct: 81 GGPSENPGYNGGHLRTPEINKDNLQKARNLLQQKMIDFMADKAAHFDANGKLITYDYEGD 140
Query: 80 ----DSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHL-KYAEETYRRELDLNTAT 134
D +Y + A + + FG YQ L +I + +++ A Y R LD++ +
Sbjct: 141 GEETDLRRYIDNIAGTKEHFGS----YQTLSNIVITSNNTKCPDCAYSDYNRTLDIDNSI 196
Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQII 194
V Y + + RE+F S PD V+V +++ +S ++L+SL + ++ N I
Sbjct: 197 HTVSYKESGITYKREYFMSYPDNVMVIRLTSDSKDGISRTIALESLHKTKNIISEGNTIT 256
Query: 195 MEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLL 254
M G P K + G++++ ++ + +D G ISA+ D +KV G+ V+L+
Sbjct: 257 MTGY-PTPVGGDKRVGDHWKNGLRYAQ--QVMVRNDGGKISAV-DGMIKVAGAKEIVILM 312
Query: 255 VASSSFDGPFINPSD--SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
A++++ + + SK+DP + + L+ SY L H DY+ L+ R+ I L
Sbjct: 313 SAATNYVQCMDDSYNFFSKEDPLDKVKAILKKASAKSYKKLLIAHQKDYRSLYDRMKINL 372
Query: 313 SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVA 372
+ V T D + ++ ++ L L +QFGRYLLISSSR G+ A
Sbjct: 373 GNVKEAPVMTT------DKLLKGMDERTNLQADNLYLEMLYYQFGRYLLISSSREGSLPA 426
Query: 373 NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQV 432
NLQG+W + L W+S H NIN++MNYW + P NLS C P+ +++ L G TAQ
Sbjct: 427 NLQGVWADRLQNAWNSDYHTNINVQMNYWPAQPTNLSPCHLPMVEYVKSLVPRGRYTAQH 486
Query: 433 NYL------ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
Y GWV HH+ +IW ++ + K +P G W+C +WE+Y + DR F
Sbjct: 487 YYCRPDGKPVRGWVTHHENNIWGNTAPAK-KDTPHHFPAGAIWMCQDIWEYYQFNQDRKF 545
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
LE+ +L+ ++ + + DG L NPS SPEH + L C + A+
Sbjct: 546 LEEYYDTMLQAALFWVDNLWTDKRDGMLVANPSHSPEH----GEYSLGC-----STSQAM 596
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
I E+F+ +I A++ L + D ++++ SL +L KI G MEW
Sbjct: 597 IWEIFNIMIKASKELGRENDPEIKEISASLAKLSGPKIGLGGQFMEW 643
>gi|294775002|ref|ZP_06740531.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
gi|294451046|gb|EFG19517.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
Length = 818
Score = 267 bits (682), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 198/639 (30%), Positives = 300/639 (46%), Gaps = 82/639 (12%)
Query: 2 MNAESTSTTNPLKITFNGP------AKHFTDA---------IPIGNGRLGAMVWGGVPSE 46
+ A+ST T L I F+ P A ++ A +PIGNG LG V G + +E
Sbjct: 17 LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPATDKAWENNSLPIGNGSLGGNVMGSIAAE 76
Query: 47 TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
+ LNE TLW G P N ++ L ++R G +A + K F
Sbjct: 77 RITLNEKTLWRGGPNTEKGAAYYWKVNKESAHLLPEIRQAFTDGNQKKAEELTCKNFNGL 136
Query: 99 ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
AD + LG+ +E S + + Y+R L L++A A V + V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEIGMTD--YKRILSLDSAMAIVSFRKDEVNY 194
Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
R++F S PD V+V K + G +L F+ + +G N+++ GR
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKTDGPNRLLYTGRL----- 249
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
K Q L I+ + G+++ D K V +D + LL A + +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298
Query: 261 DGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
+ F +P DP +++ L + +Y++L RH DY +LF RV +QL+ +P
Sbjct: 299 NPDFKDPKTYVGPDPDQTTLAMLDAAAAKNYNELCERHKTDYTQLFGRVKLQLNPHAPM- 357
Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
T + +P+ +R+ ++ + D L E+ +QFGRYLLI+SSRPG ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTHQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413
Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
W + W H NIN++MNYW + NL+EC PL DF+ L G KTAQ + A
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473
Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
GW +I+ +S + W PM G WL TH+WE+Y+YT D+ FL++ Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533
Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
A+F +D+L DG PSTSPEH V +T A+IRE+ I
Sbjct: 534 SSANFAVDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVIREILLNAID 584
Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
A++ L + + + VLK L P +I G +MEW
Sbjct: 585 ASKALGVDSKDRKQWQYVLK---HLVPYQIGRYGQLMEW 620
>gi|298480149|ref|ZP_06998348.1| alpha-L-fucosidase 2 [Bacteroides sp. D22]
gi|298273958|gb|EFI15520.1| alpha-L-fucosidase 2 [Bacteroides sp. D22]
Length = 837
Score = 267 bits (682), Expect = 1e-68, Method: Compositional matrix adjust.
Identities = 188/599 (31%), Positives = 296/599 (49%), Gaps = 70/599 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 82 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 141
Query: 77 SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDD--SHLKYAEET---------YR 125
G +A + + F Y G+ F + ++ ET Y+
Sbjct: 142 KAFTEGNQEKAEMLTRQNFNSEVS-YDADGETPFRFGSFTTMGEFYVETGLNIIGMSDYK 200
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDN 183
R L L++A A V++ +V + R +F S P V+V + S + G +L F+ + + +
Sbjct: 201 RILSLDSAMAVVQFKKDHVVYQRNYFISYPANVMVMRFSADQPGKQNLVFSYAPNPVSTG 260
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ +GN ++ +A+ D G+++ ++ I+ GT+ D KL
Sbjct: 261 NMASDGNKGLVY-------------SASLDNNGMKY--VVRIQAETKGGTLFN-ADGKLT 304
Query: 244 VEGSDWAVLLLVASS----SFDGPFINPSDSKK-DPTSESMSALQSIRNLSYSDLYTRHL 298
V+G+D V + A + +FD F +P +P + + + + Y+ L+++H
Sbjct: 305 VKGADEVVFYITADTDYKPNFDPDFKDPKTYVGVNPEETTKEWMNNAVSQGYTALFSQHY 364
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFG 357
+DY LF+RV + L+ + K +P+ +R+K+++ + D L EL FQFG
Sbjct: 365 NDYAALFNRVKLNLNPAIKG-----------RNLPTPQRLKNYRAGQPDYDLEELYFQFG 413
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLISSSRPG ANLQGIW+ ++ W H NIN++MNYW + NL+EC PL D
Sbjct: 414 RYLLISSSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECMLPLVD 473
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWE 476
F+ L G KTA+ + A GW +I+ ++ + + W PM G WL TH+WE
Sbjct: 474 FIRTLVKPGEKTAKSYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATHIWE 533
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
+Y+YT D FL++ Y L++ A F +D+L DG PSTSPEH +
Sbjct: 534 YYDYTRDLTFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH---------GPI 584
Query: 537 SYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+T A++RE+ I A++VL +K E E VL + L P KI G +MEW
Sbjct: 585 DQGATFVHAVVREILLDAIEASKVLGIDKKERKQWEHVLAN---LVPYKIGRYGQLMEW 640
>gi|213693185|ref|YP_002323771.1| hypothetical protein Blon_2335 [Bifidobacterium longum subsp.
infantis ATCC 15697 = JCM 1222]
gi|384200416|ref|YP_005586159.1| glycosyl hydrolase [Bifidobacterium longum subsp. infantis ATCC
15697 = JCM 1222]
gi|213524646|gb|ACJ53393.1| conserved hypothetical protein [Bifidobacterium longum subsp.
infantis ATCC 15697 = JCM 1222]
gi|320459368|dbj|BAJ69989.1| glycosyl hydrolase [Bifidobacterium longum subsp. infantis ATCC
15697 = JCM 1222]
Length = 782
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 189/609 (31%), Positives = 297/609 (48%), Gaps = 56/609 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+TF+G + H+ + IP GNGR+GA++ ++ L LN+DTLW+G P T+P P+ +
Sbjct: 1 MKLTFDGISSHWEEGIPFGNGRMGAVLCSEPDADVLYLNDDTLWSGYPHAETSPLTPEIV 60
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIEL--EFDDSHLKYAEET-----YR 125
+ R G Y AT D Q D ++ F + ++Y+ E +
Sbjct: 61 AKARQASSRGDYVSATRII-------QDATQREKDEQIYEPFGTACIRYSSEAGERKHVK 113
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
R LDL A A + +G + + + S PD ++V ++S S S +V+ + L
Sbjct: 114 RSLDLARALAGESFRLGAADVHVDAWCSAPDDLLVYEMSSSAPVDASVSVT-GTFLKQTR 172
Query: 186 YVNGNNQ------IIMEGRCPGKRIPPKANANDDP-----KGIQFSAILEIKISDDRGTI 234
+G++ +++ G+ PG + A+ D+P GI + ++ G I
Sbjct: 173 ISSGSDSDARQATLVVMGQMPGLNVGSLAHVTDNPWEDERDGIGMAYAGAFSLTVTGGEI 232
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSK---KDPTSESMSALQSIRNLSYS 291
+ ++D L+ G L + S F G P D E+++A S
Sbjct: 233 TVIDDV-LQCSGVTGLSLRFRSLSGFKGSAEQPERDMTVLADRLGETIAAWPS----DSR 287
Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDP---- 347
+ RH+ DY++ F RV ++L + D EE VP AE ++S ++ P
Sbjct: 288 AMLDRHVADYRRFFDRVGVRLGPAHDD------DEE----VPFAEILRS--KEDTPHRLE 335
Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
+L E +F FGRYLLISSSRP TQ +NLQGIWN P W SA NIN+EMNYW + PC
Sbjct: 336 TLSEAMFDFGRYLLISSSRPHTQPSNLQGIWNHKDFPNWYSAYTTNINIEMNYWMTGPCA 395
Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
L E EPL L G A G + H DIW ++ G+ WA WP G
Sbjct: 396 LKELIEPLVAMNRELLEPGHDAAGAILGCGGSAVFHNVDIWRRALPANGEPTWAFWPFGQ 455
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFI 527
AW+C +L++ Y + D +L +P++ A F +D+L + G L P+TSPE+ F+
Sbjct: 456 AWMCRNLFDEYLFNQDESYLAS-IWPIMRDSARFCMDFLSDTEHG-LAPAPATSPENYFV 513
Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAAEV---LEKNEDALVEKVLKSLPRLRPTKI 584
DG+ V+++S AI+R + +I AA+ L+ + ALV + + +L ++
Sbjct: 514 V-DGETIAVAHTSENTTAIVRNLLDDLIHAAQTMPDLDDGDKALVREAESTRAKLAAVRV 572
Query: 585 AEDGSIMEW 593
DG I+EW
Sbjct: 573 GSDGRILEW 581
>gi|345510592|ref|ZP_08790159.1| glycoside hydrolase family 95 [Bacteroides sp. D1]
gi|345454467|gb|EEO49096.2| glycoside hydrolase family 95 [Bacteroides sp. D1]
Length = 850
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 192/602 (31%), Positives = 297/602 (49%), Gaps = 76/602 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 95 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 154
Query: 77 SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDD--SHLKYAEET---------YR 125
G +A + + F Y G+ F + ++ ET Y+
Sbjct: 155 KAFTEGNQEKAEMLTRQNFNSEVS-YDADGETPFRFGSFTTMGEFYVETGLNIIGMSDYK 213
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDN 183
R L L++A A V++ +V + R +F S P V+V + S + G +L F+ + + +
Sbjct: 214 RILSLDSAMAVVQFKKDHVVYQRNYFISYPANVMVMRFSADQPGKQNLVFSYAPNPVSTG 273
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ +GN ++ +A+ D G+++ ++ I+ GT+ D KL
Sbjct: 274 NMASDGNKGLVY-------------SASLDNNGMKY--VVRIQAETKGGTLFN-ADGKLT 317
Query: 244 VEGSDWAVLLLVASS----SFDGPFINPSD----SKKDPTSESMSALQSIRNLSYSDLYT 295
V+G+D V + A + +FD F +P + ++ T E M+ S R Y+ L++
Sbjct: 318 VKGADEVVFYITADTDYKPNFDPDFKDPKTYVGVNPEETTKEWMNNAVSQR---YTALFS 374
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLF 354
+H +DY LF RV + L+ + K +P+ +R+K+++ + D L EL F
Sbjct: 375 QHYNDYAALFDRVKLNLNPAIKG-----------RNLPTPQRLKNYRAGQPDYYLEELYF 423
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
QFGRYLLISSSRPG ANLQGIW+ ++ W H NIN++MNYW NL+EC P
Sbjct: 424 QFGRYLLISSSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWSVCSTNLNECMLP 483
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTH 473
L DF+ L G KTA+ + A GW +I+ ++ + + W PM G WL TH
Sbjct: 484 LVDFIRTLVKPGEKTAKSYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATH 543
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
+WE+Y+YT D FL++ Y L++ A F +D+L DG PSTSPEH
Sbjct: 544 IWEYYDYTRDLKFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH--------- 594
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
+ +T A++RE+ I A++VL +K E E VL + L P KI G +M
Sbjct: 595 GPIDQGATFVHAVVREILLDAIEASKVLGVDKKERKQWEHVLAN---LVPYKIGRYGQLM 651
Query: 592 EW 593
EW
Sbjct: 652 EW 653
>gi|298483252|ref|ZP_07001431.1| fibronectin type III domain protein [Bacteroides sp. D22]
gi|298270569|gb|EFI12151.1| fibronectin type III domain protein [Bacteroides sp. D22]
Length = 815
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 184/597 (30%), Positives = 289/597 (48%), Gaps = 70/597 (11%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
++PIGNG LGA + G + +E + LNE TLW G P +Y N + L ++R
Sbjct: 63 SLPIGNGSLGANIMGSISAERITLNEKTLWKGGPNTAKGAEYYWNVNKQSSGVLKEIRQA 122
Query: 79 VDSGQYAEATAASVKLFGHPA------------DVYQLLGDIELEFDDSHLKYAEETYRR 126
+G +A + + F + +G++ +E S + + YRR
Sbjct: 123 FLNGDSKKAGYLTQENFNGLGAYEEKDETPFRFGAFTTMGELYVETGLSEINMS--NYRR 180
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
L L++A A V++ + + R++F S PD V+V K + + G + +S ++ +H
Sbjct: 181 ILSLDSAMAVVQFDKDGIRYQRKYFISYPDSVMVMKFAADKGGKQNLVLSYCPNNEAKSH 240
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+GN+ ++ G + G++F+ IK GT+ A E+ ++ V
Sbjct: 241 LEADGNDGLVYTGVL-------------NNNGMKFA--FRIKAIHKGGTLKA-ENDRIIV 284
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRHLD 299
+ +D V LL A + + F K DP+ +++ + + Y +LY H
Sbjct: 285 KDADEVVFLLTADTDYKMNFAPDFKDPKAYVGNDPSQTTLAMMDNALKKGYDELYRNHEA 344
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGR 358
DY LF+RV +++ E +P+ +R+ S++ D L +L +QFGR
Sbjct: 345 DYTALFNRVRFEIN-----------PEIGTPNLPTYKRLASYKKGVPDYQLEQLYYQFGR 393
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLI+SSRPG ANLQG+W+ + W H NIN++MNYW + P NLSEC PL DF
Sbjct: 394 YLLIASSRPGNMPANLQGLWHNNTDGPWRVDYHNNINIQMNYWPACPTNLSECTWPLIDF 453
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
+ L G KTAQ + A GW +I+ ++ K + W L P G WL TH+WE+
Sbjct: 454 IRSLVKPGEKTAQAYFNARGWTASISANIFGFTAPLSSKSMAWNLNPTAGPWLATHIWEY 513
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y+YT D FL++ Y L++ A F +D L DG PSTSPEH V
Sbjct: 514 YDYTRDTKFLKEIGYDLIKSSAQFAVDHLWHKPDGTYTAAPSTSPEH---------GPVD 564
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEW 593
T A++RE+ I A++VL DA K ++ L +L P +I G ++EW
Sbjct: 565 EGVTFAHAVVREILLDAIQASKVL--GTDAKERKQWENVLAKLVPYRIGRYGQLLEW 619
>gi|319639947|ref|ZP_07994674.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_40A]
gi|345516953|ref|ZP_08796433.1| glycoside hydrolase family 95 [Bacteroides sp. 4_3_47FAA]
gi|254833732|gb|EET14041.1| glycoside hydrolase family 95 [Bacteroides sp. 4_3_47FAA]
gi|317388225|gb|EFV69077.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_40A]
Length = 818
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 197/639 (30%), Positives = 300/639 (46%), Gaps = 82/639 (12%)
Query: 2 MNAESTSTTNPLKITFNGP------AKHFTDA---------IPIGNGRLGAMVWGGVPSE 46
+ A+ST T L I F+ P A ++ A +PIGNG LG V G + +E
Sbjct: 17 LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPATDKAWENNSLPIGNGSLGGNVMGSIAAE 76
Query: 47 TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
+ LNE TLW G P N ++ L ++R G +A + K F
Sbjct: 77 RITLNEKTLWRGGPNTEKGAAYYWNVNKESAHLLPEIRQAFTDGNQKKAEELTCKNFNGL 136
Query: 99 ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
AD + LG+ +E S + + Y+R L L++A A V + V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEIGMTD--YKRILSLDSAMAIVSFRKDEVNY 194
Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
R++F S PD V+V K + G +L F+ + +G N+++ GR
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKTDGPNRLLYTGRL----- 249
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
K Q L I+ + G+++ D K V +D + LL A + +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298
Query: 261 DGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
+ F +P DP +++ L + +Y++L RH DY +LF RV +QL+ +P
Sbjct: 299 NPDFKDPKTYVGPDPDQTTLAMLDAAAAKNYNELCERHKTDYTQLFGRVKLQLNPHAPM- 357
Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
T + +P+ +R+ ++ + D L E+ +QFGRYLLI+SSRPG ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTHQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413
Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
W + W H NIN++MNYW + NL+EC PL DF+ L G KTAQ + A
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473
Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
GW +I+ +S + W PM G WL TH+WE+Y+YT D+ FL++ Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533
Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
A+F +D+L DG PSTSPEH V +T A++RE+ I
Sbjct: 534 SSANFAVDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLNAID 584
Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
A++ L + + + VLK L P +I G +MEW
Sbjct: 585 ASKALGVDSKDRKQWQYVLK---HLVPYQIGRYGQLMEW 620
>gi|443630249|ref|ZP_21114539.1| putative Fibronectin type III domain-containing protein
[Streptomyces viridochromogenes Tue57]
gi|443336258|gb|ELS50610.1| putative Fibronectin type III domain-containing protein
[Streptomyces viridochromogenes Tue57]
Length = 744
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 186/586 (31%), Positives = 290/586 (49%), Gaps = 64/586 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-----DYTN-----PDAPKALSD- 74
+A+PIGNG LGAMV+G + SE L+ NE TLWTG PG D+ N PDA A+ D
Sbjct: 14 EALPIGNGALGAMVFGTLASERLQFNEKTLWTGGPGSAQGYDHGNWRTPRPDAITAVQDD 73
Query: 75 --VRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
R+ +D + A+ +G +Q GD+ L+ + + YRRELDL+
Sbjct: 74 LDARTTLDPEEVADRLGQPRIGYG----AHQTFGDLHLDIPGAPTTPPAD-YRRELDLDK 128
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A A V Y+ V R+ +S PD VI ++ GS++F + S + + +
Sbjct: 129 AVASVGYTYQGVRHQRDFLASYPDGVIAGRLHADRPGSVTFTLRYTSPRADFTATAADGT 188
Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
+ + G A A++ G++F A ++++ GT+++ + + V G+D A
Sbjct: 189 LTVRG----------ALADN---GLRFEA--QVRVRSRGGTVTSDANGTITVTGADSAWF 233
Query: 253 LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
+L A + + + P DP + A++ + Y L RH+ D++ LF RV++ +
Sbjct: 234 VLAAGTDYADTY--PDYRGPDPHAAVGRAVRQAGD-RYEALLARHVRDHRALFRRVALDI 290
Query: 313 SRS-PKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
+S P D+ TD +A+R L F++GRYLLI+SSRPG+
Sbjct: 291 GQSLPADVPTDRLLAAYAGGAGAADRALE----------ALYFEYGRYLLIASSRPGSLP 340
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
ANLQG+WN +P W + H NIN++MNYW + NL+E P F+ L G +TAQ
Sbjct: 341 ANLQGVWNNSTTPPWSADYHTNINIQMNYWPAEAANLAETTPPYDRFVEALRAPGRRTAQ 400
Query: 432 VNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
+ + GWV+H++T+ + + D W +P AWL L+EHY + D+L
Sbjct: 401 EMFGSRGWVVHNETNPYGFTGVHDWATAFW--FPEAAAWLTQQLYEHYRFAGSTDYLRTT 458
Query: 491 AYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEH-EFIAPDGKLACVSYSSTMDMAIIR 548
AYP ++ F LD L + DG L PS SPEH +F A + M I+
Sbjct: 459 AYPAMKEATEFWLDNLRTDPRDGTLVVTPSYSPEHGDFTA----------GAAMSQQIVH 508
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEW 593
++F++ + AA +L D +V +L RL P +I G + EW
Sbjct: 509 DLFTSTLEAARILGDAPD-FRRRVEAALNRLDPGLRIGSWGQLQEW 553
>gi|307565695|ref|ZP_07628164.1| conserved hypothetical protein [Prevotella amnii CRIS 21A-A]
gi|307345521|gb|EFN90889.1| conserved hypothetical protein [Prevotella amnii CRIS 21A-A]
Length = 771
Score = 266 bits (680), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 187/598 (31%), Positives = 289/598 (48%), Gaps = 68/598 (11%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY--------TNPDAPKALSDVRSL 78
++PIGNG LGA + GG+ + LNE +LW G PG N + L +R
Sbjct: 64 SLPIGNGSLGANIMGGIACDRFTLNEKSLWRGGPGVKGGAAYYWDQNKQSAHFLKAIRKA 123
Query: 79 VDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDD-----------SHLKYAEETYRRE 127
G A + F A Y + + F + H + Y+R
Sbjct: 124 FLQGNTKLAAKLTQDNFNGKA-AYSIATEPHFRFGNFTTMGEVTIQTGHKEQDISGYKRC 182
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKIS--GSESGSLSFNVSLDSLLDNHS 185
L L++A A V Y + R +F S PD V+V K + G++ +L+ + +
Sbjct: 183 LSLDSAIASVSYHTNTTYYKRSYFISYPDNVMVIKYTAKGADLLNLTLTYTPSPIAQGQV 242
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
+ + I +G+ ND+ ++F+ + IK + D GT S + D KL +
Sbjct: 243 VNDSTDGITYKGKL-----------NDN--NMRFT--IRIKANIDSGT-SKVIDGKLHIL 286
Query: 246 GSDWAVLLLVASSSFDGPFINPS--DSKK----DPTSESMSALQSIRNLSYSDLYTRHLD 299
+ L A + + NPS D K +P + ++ Y++L HL
Sbjct: 287 KAKTVTFFLTADTDYKQN-TNPSFTDPKTYIGVNPDKTTKKWIKHALQKGYNNLLNNHLA 345
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGR 358
DY LF RV + ++ KD C +P+ +R++ ++T + D L L FQ+GR
Sbjct: 346 DYTPLFKRVKLIINPDDKDTKEALC-------LPTNKRLQRYRTGKADYDLEALYFQYGR 398
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLI+SSRPGT ANLQG+W+ ++ W H NINL+MNYW +L NL+EC PL +F
Sbjct: 399 YLLIASSRPGTLPANLQGLWHNNVDGPWRVDYHNNINLQMNYWHALTTNLAECALPLNNF 458
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
+ L G +TA+ Y A GW ++I+ ++ K + W L P+ G WL THLWE+
Sbjct: 459 ICMLEKPGRRTAKAYYNARGWTTSISSNIFGFTAPLIDKDMTWNLSPISGPWLSTHLWEY 518
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y++T ++ +L AYP+L+G A F +D+L DG PSTSPEH +
Sbjct: 519 YDFTRNKTYLRNTAYPILKGSAQFAVDFLWHKPDGTYTAAPSTSPEH---------GSID 569
Query: 538 YSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+T A++RE+ + I+A++VL ++ E EKVL +L P +I G +MEW
Sbjct: 570 QGATFVHAVVREILTDAIAASKVLDIDRKERKQWEKVLL---KLSPYRIGRYGQLMEW 624
>gi|423313025|ref|ZP_17290961.1| hypothetical protein HMPREF1058_01573 [Bacteroides vulgatus
CL09T03C04]
gi|392686239|gb|EIY79545.1| hypothetical protein HMPREF1058_01573 [Bacteroides vulgatus
CL09T03C04]
Length = 818
Score = 266 bits (680), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 197/639 (30%), Positives = 300/639 (46%), Gaps = 82/639 (12%)
Query: 2 MNAESTSTTNPLKITFNGP------AKHFTDA---------IPIGNGRLGAMVWGGVPSE 46
+ A+ST T L I F+ P A ++ A +PIGNG LG V G + +E
Sbjct: 17 LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPATDKAWENNSLPIGNGSLGGNVMGSIAAE 76
Query: 47 TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
+ LNE TLW G P N ++ L ++R G +A + K F
Sbjct: 77 RITLNEKTLWRGGPNTEKGAAYYWNVNKESAHLLPEIRQAFTDGNQKKAEELTCKNFNGL 136
Query: 99 ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
AD + LG+ +E S + + Y+R L L++A A V + V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEIGMTD--YKRILSLDSAMAIVSFRKDEVNY 194
Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
R++F S PD V+V K + G +L F+ + +G N+++ GR
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKTDGPNRLLYTGRL----- 249
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
K Q L I+ + G+++ D K V +D + LL A + +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298
Query: 261 DGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
+ F +P DP +++ L + +Y++L RH DY +LF RV +QL+ +P
Sbjct: 299 NPDFKDPKTYVGPDPDQTTLAMLDAAAAKNYNELCERHKTDYTQLFGRVKLQLNPHAPM- 357
Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
T + +P+ +R+ ++ + D L E+ +QFGRYLLI+SSRPG ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTHQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413
Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
W + W H NIN++MNYW + NL+EC PL DF+ L G KTAQ + A
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473
Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
GW +I+ +S + W PM G WL TH+WE+Y+YT D+ FL++ Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533
Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
A+F +D+L DG PSTSPEH V +T A++RE+ I
Sbjct: 534 SSANFAVDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLNAID 584
Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
A++ L + + + VLK L P +I G +MEW
Sbjct: 585 ASKALGVDSKDRKQWQYVLK---HLVPYQIGRYGQLMEW 620
>gi|262406087|ref|ZP_06082637.1| glycoside hydrolase family 95 [Bacteroides sp. 2_1_22]
gi|294648155|ref|ZP_06725698.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294809712|ref|ZP_06768400.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
gi|262356962|gb|EEZ06052.1| glycoside hydrolase family 95 [Bacteroides sp. 2_1_22]
gi|292636539|gb|EFF55014.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294443087|gb|EFG11866.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
Length = 830
Score = 266 bits (680), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 192/602 (31%), Positives = 297/602 (49%), Gaps = 76/602 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 75 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 134
Query: 77 SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDD--SHLKYAEET---------YR 125
G +A + + F Y G+ F + ++ ET Y+
Sbjct: 135 KAFTEGNQEKAEMLTRQNFNSEVS-YDADGETPFRFGSFTTMGEFYVETGLNIIGMSDYK 193
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDN 183
R L L++A A V++ +V + R +F S P V+V + S + G +L F+ + + +
Sbjct: 194 RILSLDSAMAVVQFKKDHVVYQRNYFISYPANVMVMRFSADQPGKQNLVFSYAPNPVSTG 253
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ +GN ++ +A+ D G+++ ++ I+ GT+ D KL
Sbjct: 254 NMASDGNKGLVY-------------SASLDNNGMKY--VVRIQAETKGGTLFN-ADGKLT 297
Query: 244 VEGSDWAVLLLVASS----SFDGPFINPSD----SKKDPTSESMSALQSIRNLSYSDLYT 295
V+G+D V + A + +FD F +P + ++ T E M+ S R Y+ L++
Sbjct: 298 VKGADEVVFYITADTDYKPNFDPDFKDPKTYVGVNPEETTKEWMNNAVSQR---YTALFS 354
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLF 354
+H +DY LF RV + L+ + K +P+ +R+K+++ + D L EL F
Sbjct: 355 QHYNDYAALFDRVKLNLNPAIKG-----------RNLPTPQRLKNYRAGQPDYYLEELYF 403
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
QFGRYLLISSSRPG ANLQGIW+ ++ W H NIN++MNYW NL+EC P
Sbjct: 404 QFGRYLLISSSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWSVCSTNLNECMLP 463
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTH 473
L DF+ L G KTA+ + A GW +I+ ++ + + W PM G WL TH
Sbjct: 464 LVDFIRTLVKPGEKTAKSYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATH 523
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
+WE+Y+YT D FL++ Y L++ A F +D+L DG PSTSPEH
Sbjct: 524 IWEYYDYTRDLKFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH--------- 574
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
+ +T A++RE+ I A++VL +K E E VL + L P KI G +M
Sbjct: 575 GPIDQGATFVHAVVREILLDAIEASKVLGVDKKERKQWEHVLAN---LVPYKIGRYGQLM 631
Query: 592 EW 593
EW
Sbjct: 632 EW 633
>gi|299145135|ref|ZP_07038203.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
gi|298515626|gb|EFI39507.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
Length = 815
Score = 266 bits (680), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 182/597 (30%), Positives = 290/597 (48%), Gaps = 70/597 (11%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
++PIGNG LGA + G + +E + LNE TLW G P +Y N + L ++R
Sbjct: 63 SLPIGNGSLGANIMGSISAERITLNEKTLWKGGPNTAKGAEYYWNVNKQSAGVLKEIRQA 122
Query: 79 VDSGQYAEATAASVKLFGHPA------------DVYQLLGDIELEFDDSHLKYAEETYRR 126
G +A + + F + +G++ +E S + + +YRR
Sbjct: 123 FLDGDSQKAGYLTQENFNGLGAYEEKDETPFRFGAFTTMGELYVETGLSEINMS--SYRR 180
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
L L++A A V++ + + R++F S PD V+V K + + G + +S ++ +H
Sbjct: 181 ILSLDSAMAVVQFDKDGIRYQRKYFISYPDSVMVMKFTADKGGKQNLVLSYCPNNEAKSH 240
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+GN+ ++ G + G++F+ IK GT+ A E+ ++ V
Sbjct: 241 LEADGNDGLVYTGVL-------------NNNGMKFA--FRIKAIHKGGTLKA-ENDRIIV 284
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRHLD 299
+ +D V LL A + + F K DP+ +++ + ++ Y +LY H
Sbjct: 285 KDADEVVFLLTADTDYKMNFAPDFKDPKAYVGNDPSQTTLAMMNNVLKKGYDELYRNHEA 344
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGR 358
DY LF+RV ++++ E +P+ +R+ +++ D L +L +QFGR
Sbjct: 345 DYTALFNRVRFEINQ-----------EIGSPNLPTYKRLANYKKGTPDYQLEQLYYQFGR 393
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLI+SSRPG ANLQG+W+ + W H NIN++MNYW + P NL EC PL DF
Sbjct: 394 YLLIASSRPGNMPANLQGLWHNNTDGPWRVDYHNNINIQMNYWPACPTNLPECTWPLIDF 453
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
+ L G KTAQ + A GW +I+ ++ K + W L P G WL TH+WE+
Sbjct: 454 IRSLVKPGEKTAQAYFNARGWTASISANIFGFTAPLSSKSMAWNLNPTAGPWLATHIWEY 513
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y+YT D FL++ Y L++ A F +D L DG PSTSPEH V
Sbjct: 514 YDYTRDTKFLKEIGYDLIKSSAQFAVDHLWHKPDGTYTAAPSTSPEH---------GPVD 564
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEW 593
T A++RE+ I A++VL DA K ++ L +L P +I G ++EW
Sbjct: 565 EGVTFAHAVVREILLDAIQASKVL--GTDAKERKQWENVLTKLVPYRIGRYGQLLEW 619
>gi|242815487|ref|XP_002486578.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
gi|218714917|gb|EED14340.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
Length = 787
Score = 266 bits (679), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 180/596 (30%), Positives = 288/596 (48%), Gaps = 56/596 (9%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ A F A+P+GNGRLG +++ P+E + LNE+++W+G + NP+A L++VR
Sbjct: 29 YTSAATDFNSALPVGNGRLGGLMYC-TPTERVSLNENSIWSGPFLNRLNPNAKSVLTEVR 87
Query: 77 SLVDSGQYAEATAASV-KLFGHP--ADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
S+++SG A ++ + G+P Y LG + L+F S ++ + R LD
Sbjct: 88 SMLESGNITGAGQVALPNMAGNPNSPQHYTPLGQLNLDFGHS----SQGSLNRWLDTYQG 143
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD----NHSYVNG 189
+ Y V +TRE ++ P V+ ++ S++G L+ +SL L + S G
Sbjct: 144 NSGCSYIYNGVNYTREIIANYPTGVLAMRLQASQAGQLNIKISLSRLQNVISNTASTSGG 203
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
N I+M+G G +P F+A ++ S + L V G+
Sbjct: 204 ANSIVMKGNSGGS----------NPY---FAAEAQVIASGGS---VSASGSTLSVSGATT 247
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
+ A +S+ ++ +E L S + Y L T + D L RVS
Sbjct: 248 VDIFFDAEASYR------YSTEAAAETELTRKLSSATSQGYQALRTAAIADNTALVGRVS 301
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLISSSR- 366
+ L S P+ +R+ +++++ D LV L++ GR+LL++SSR
Sbjct: 302 LNLGSSSGSAANQ----------PTDKRLSNYKSNPGNDVQLVTLMYNMGRHLLVASSRD 351
Query: 367 --PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
P + ANLQGIWNED +P W S +NINLEMNYW + NL+E +P +D L
Sbjct: 352 TGPLSLPANLQGIWNEDFNPAWGSKYTININLEMNYWHAETTNLAETTKPFWDLLAVAKT 411
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G A Y SG+V+HH D W + + +WP+GG WL THL EHY +T ++
Sbjct: 412 RGELAASSMYGCSGFVLHHNIDCWGDPAPVDYGTPYTIWPLGGVWLSTHLMEHYRFTGNK 471
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYS 539
FL++ A+P+L+ A F + +GY T PS SPE+ FI P G + S
Sbjct: 472 TFLQETAWPILQSAADFCFCYTFL-WNGYYTTGPSLSPENSFIVPSNESKAGNAEGIDIS 530
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
TMD +++ ++FS +I A ++L L +++P + G I+EW Q
Sbjct: 531 PTMDNSLLYQLFSDVIEACQILGLTSSE-CSNAKNYLSKIKPPQTGSYGQILEWRQ 585
>gi|345514340|ref|ZP_08793853.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
gi|229437320|gb|EEO47397.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
Length = 818
Score = 266 bits (679), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 197/639 (30%), Positives = 299/639 (46%), Gaps = 82/639 (12%)
Query: 2 MNAESTSTTNPLKITFNGP------AKHFTDA---------IPIGNGRLGAMVWGGVPSE 46
+ A+ST T L I F+ P A ++ A +PIGNG LG V G + +E
Sbjct: 17 LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPATDKAWENNSLPIGNGSLGGNVMGSIAAE 76
Query: 47 TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
+ LNE TLW G P N ++ L ++R G +A + K F
Sbjct: 77 RITLNEKTLWRGGPNTEKGAAYYWNVNKESAHLLPEIRQAFTDGNQKKAEELTCKNFNGL 136
Query: 99 ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
AD + LG+ +E S + Y+R L L++A A V + V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEI--GMTNYKRILSLDSAMAVVSFRKDEVNY 194
Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
R++F S PD V+V K + G +L F+ + V+G N+++ G
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKVDGPNRLLYTGCL----- 249
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
K Q L I+ + G+++ D K V +D + LL A + +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298
Query: 261 DGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
+ F +P DP +++ + + SY++L RH DY +LF RV +QL+ R+P
Sbjct: 299 NPDFKDPKTYVGPDPEQTTLAMMDAAAAKSYNELCERHKTDYTQLFGRVQLQLNPRAPM- 357
Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
T + +P+ +R+ ++ + D L E+ +QFGRYLLI+SSRPG ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTYQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413
Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
W + W H NIN++MNYW + NL+EC PL DF+ L G KTAQ + A
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473
Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
GW +I+ +S + W PM G WL TH+WE+Y+YT D+ FL++ Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533
Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
A+F +D+L DG PSTSPEH V +T A++RE+ I
Sbjct: 534 SSANFAIDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLNAID 584
Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
A++ L + + + VL L P +I G +MEW
Sbjct: 585 ASKALGVDSKDRKQWQYVLN---HLVPYQIGRYGQLMEW 620
>gi|423301304|ref|ZP_17279328.1| hypothetical protein HMPREF1057_02469 [Bacteroides finegoldii
CL09T03C10]
gi|408471905|gb|EKJ90434.1| hypothetical protein HMPREF1057_02469 [Bacteroides finegoldii
CL09T03C10]
Length = 802
Score = 266 bits (679), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 200/637 (31%), Positives = 304/637 (47%), Gaps = 98/637 (15%)
Query: 4 AESTSTTNPLKITFNGP---AKHF---TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWT 57
A T T L I F+ P +H + ++PIGNG LGA + G V +E + NE TLW
Sbjct: 20 AGETEYTKGLSIWFDTPNVMEEHTAWESRSLPIGNGSLGANIIGSVDTERITFNEKTLWR 79
Query: 58 GVP-----GDY---TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGH--PADV------ 101
G P +Y N + L ++R G +A + + F P +
Sbjct: 80 GGPNTAKGAEYYWNVNKQSAHVLDEIRKAFTEGDQQKAEMLTRQNFNSEVPYEANREKPF 139
Query: 102 ----YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQ 157
+ ++G+ +E L ++ Y+R L L++A A V++ NV + R +F S P
Sbjct: 140 RFGNFTIMGEFYVETGLDTLGISD--YKRILSLDSALAVVQFKKNNVAYQRSYFISYPAN 197
Query: 158 VIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPK 215
V+V + S +G +L F+ + +S I +G G D K
Sbjct: 198 VMVMRFSADRAGMQNLVFSYAPNS--------------ISQGSLSG----------DGDK 233
Query: 216 GIQFSA---------ILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFIN 266
G+ FSA ++ I+ GT+S +L V+G+D V + A + + F N
Sbjct: 234 GLVFSASLNNNGMKYVVRIQAETKGGTLSN-AGCRLTVKGADEVVFYVTADTDYKMNF-N 291
Query: 267 P--SDSKK----DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIV 320
P D K DP + + + Y+ L+ +H DY LF+R+ + L+ + K
Sbjct: 292 PDFKDPKTYVGVDPAETTCQWINNAVMQGYTALFQQHYSDYAALFNRLRLNLNPTVK--- 348
Query: 321 TDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN 379
+P+ +R+K+++ + D L EL +QFGRYLLI+SSR G ANLQGIW+
Sbjct: 349 --------TSDIPTPQRLKNYRNGQPDYYLEELYYQFGRYLLIASSRAGNMPANLQGIWH 400
Query: 380 EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGW 439
D+ W H NIN++MNYW + P NLSEC PL DF+ L G KTAQ + A GW
Sbjct: 401 NDVDGPWRVDYHNNINVQMNYWPACPTNLSECMLPLVDFIRTLVKPGEKTAQSYFGARGW 460
Query: 440 VIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGC 498
++I+ ++ + + W PM G WL TH+WE+Y+YT D +FL++ Y L++
Sbjct: 461 TASISSNIFGFTTPLESQDMSWNFNPMAGPWLATHIWEYYDYTRDLNFLKETGYELIKSS 520
Query: 499 ASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAA 558
A F +D+L DG PSTSPEH V +T A++RE+ I A+
Sbjct: 521 ADFAVDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLDAIEAS 571
Query: 559 EVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+VL +K + VL +L P KI G +MEW
Sbjct: 572 KVLGVDKKKRKQWNDVLS---KLVPYKIGRYGQLMEW 605
>gi|224026224|ref|ZP_03644590.1| hypothetical protein BACCOPRO_02980 [Bacteroides coprophilus DSM
18228]
gi|224019460|gb|EEF77458.1| hypothetical protein BACCOPRO_02980 [Bacteroides coprophilus DSM
18228]
Length = 825
Score = 266 bits (679), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 183/598 (30%), Positives = 285/598 (47%), Gaps = 70/598 (11%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP------GDY--TNPDAPKALSDVRSL 78
++P+GNG +GA + G V E NE TLW G P Y N ++ L D+R
Sbjct: 70 SLPVGNGSIGANIMGSVSVERFTFNEKTLWRGGPRTVKNAASYWNVNKESAHVLKDIRQA 129
Query: 79 VDSGQYAEATAASVKLFG----HPADV--------YQLLGDIELEFDDSHLKYAEETYRR 126
G +AT + F + AD + G+ ++ KY+ Y R
Sbjct: 130 FADGNVEKATQLTQDNFNSEVPYEADAEEPFRFGSFTSCGEFRIQTGLDEQKYS--GYSR 187
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNH 184
L L++A V++ V + R+ F+S P V+V + + + +L N + + L +H
Sbjct: 188 SLLLDSALVTVRFEQEGVHYRRDFFTSYPHNVMVVRFTADQEKRQNLVLNYTPNPL--SH 245
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
N+ +G C R+ Q ++ K + G + + V
Sbjct: 246 GKFKAENR---DGFCFDARLDNN----------QMHYVVRAKAVAEGGKVWTDRQGNIHV 292
Query: 245 EGSDWAVLLLVASS----SFDGPFINPSDSKK-DPTSESMSALQSIRNLSYSDLYTRHLD 299
EG+D L+ A + +FD F +P DP + ++ +LSY++L H
Sbjct: 293 EGADEVYFLITADTDYQINFDPDFKDPKTYVGVDPLRTTREWMKQAASLSYAELLGEHYT 352
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGR 358
DY LF R ++L+ K +T +P+ R++ ++T D SL L +QFGR
Sbjct: 353 DYAALFGRTQLELNPDQKGGMT----------LPTPRRLERYRTGAPDYSLESLYYQFGR 402
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLI+SSRPG ANLQG+W+ ++ W H NIN++MNYW + P NLSEC++PL DF
Sbjct: 403 YLLIASSRPGNLPANLQGMWHNNVDGPWRVDYHNNINVQMNYWPACPTNLSECEQPLIDF 462
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
+ G +TA+ + A GW ++I+ ++ R K + W P+ G WL TH+W +
Sbjct: 463 IRMQVKPGKETARAYFGARGWTTSISSNIFGFTTPLRDKDMSWNFSPVAGPWLATHVWNY 522
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y+YT D +FL Y L++G A F +D+L DG PSTSPEH +
Sbjct: 523 YDYTRDLEFLRTVGYDLIKGAADFSVDYLWHKPDGTYTAAPSTSPEH---------GPID 573
Query: 538 YSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+T A+IRE+ I A+ L ++ E A E+VL+ +P P +I G +MEW
Sbjct: 574 QGATFSHAVIREILLDAIEASRTLNVDEQERARWEEVLQGMP---PYQIGRYGQLMEW 628
>gi|293373575|ref|ZP_06619926.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
gi|292631473|gb|EFF50100.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
Length = 815
Score = 266 bits (679), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 184/597 (30%), Positives = 288/597 (48%), Gaps = 70/597 (11%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
++PIGNG LGA + G + +E + LNE TLW G P +Y N + L ++R
Sbjct: 63 SLPIGNGSLGANIMGSISAERITLNEKTLWKGGPNTAKGAEYYWNVNKQSSGVLKEIRQA 122
Query: 79 VDSGQYAEATAASVKLFGHPA------------DVYQLLGDIELEFDDSHLKYAEETYRR 126
G +A + + F + +G++ +E S + + YRR
Sbjct: 123 FLDGDSKKAGYLTQENFNGLGAYEEKDETPFRFGAFTTMGELYVETGLSEINMS--NYRR 180
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
L L++A A V++ + + R++F S PD V+V K + + G + +S ++ +H
Sbjct: 181 ILSLDSAMAVVQFYKDGIRYQRKYFISYPDSVMVMKFTADKGGKQNLVLSYCPNNEAKSH 240
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+GN+ ++ G + G++F+ IK GT+ A E+ ++ V
Sbjct: 241 LEADGNDGLVYTGVL-------------NNNGMKFA--FRIKAIHKGGTLKA-ENDRIIV 284
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRHLD 299
+ +D V LL A + + F K DP+ +++ + + Y +LY H
Sbjct: 285 KDADEVVFLLTADTDYKMNFAPDFKDPKAYVGNDPSQTTLAMMDNALKKGYDELYRNHEA 344
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGR 358
DY LF+RV +++ E +P+ +R+ S++ D L +L +QFGR
Sbjct: 345 DYTALFNRVRFEIN-----------PEIGTPNLPTYKRLASYKKGVPDYQLEQLYYQFGR 393
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLI+SSRPG ANLQG+W+ + W H NIN++MNYW + P NLSEC PL DF
Sbjct: 394 YLLIASSRPGNMPANLQGLWHNNTDGPWRVDYHNNINIQMNYWPACPTNLSECTWPLIDF 453
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
+ L G KTAQ + A GW +I+ ++ K + W L P G WL TH+WE+
Sbjct: 454 IRSLVKPGEKTAQAYFNARGWTASISANIFGFTAPLSSKSMAWNLNPTAGPWLATHIWEY 513
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y+YT D FL++ Y L++ A F +D L DG PSTSPEH V
Sbjct: 514 YDYTRDTKFLKEIGYDLIKSSAQFAVDHLWHKPDGTYTAAPSTSPEH---------GPVD 564
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEW 593
T A++RE+ I A++VL DA K ++ L +L P +I G ++EW
Sbjct: 565 EGVTFAHAVVREILLDAIQASKVL--GTDAKERKQWENVLTKLVPYRIGRYGQLLEW 619
>gi|423214546|ref|ZP_17201074.1| hypothetical protein HMPREF1074_02606 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692961|gb|EIY86197.1| hypothetical protein HMPREF1074_02606 [Bacteroides xylanisolvens
CL03T12C04]
Length = 815
Score = 266 bits (679), Expect = 3e-68, Method: Compositional matrix adjust.
Identities = 184/597 (30%), Positives = 288/597 (48%), Gaps = 70/597 (11%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
++PIGNG LGA + G + +E + LNE TLW G P +Y N + L ++R
Sbjct: 63 SLPIGNGSLGANIMGSISAERITLNEKTLWKGGPNTAKGAEYYWNVNKQSSGVLKEIRQA 122
Query: 79 VDSGQYAEATAASVKLFGHPA------------DVYQLLGDIELEFDDSHLKYAEETYRR 126
G +A + + F + +G++ +E S + + YRR
Sbjct: 123 FLDGDSKKAGYLTQENFNGLGAYEEKDETPFRFGAFTTMGELYVETGLSEINMS--NYRR 180
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
L L++A A V++ + + R++F S PD V+V K + + G + +S ++ +H
Sbjct: 181 ILSLDSAMAVVQFYKDGIRYQRKYFISYPDSVMVMKFTADKGGKQNLVLSYCPNNEAKSH 240
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+GN+ ++ G + G++F+ IK GT+ A E+ ++ V
Sbjct: 241 LEADGNDGLVYTGVL-------------NNNGMKFA--FRIKAIHKGGTLKA-ENDRIIV 284
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRHLD 299
+ +D V LL A + + F K DP+ +++ + + Y +LY H
Sbjct: 285 KDADEVVFLLTADTDYKMNFAPDFKDPKAYVGNDPSQTTLAMMDNALKKGYDELYRNHEA 344
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGR 358
DY LF+RV +++ E +P+ +R+ S++ D L +L +QFGR
Sbjct: 345 DYTALFNRVRFEIN-----------PEIGTPNLPTYKRLASYKKGVPDYQLEQLYYQFGR 393
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLI+SSRPG ANLQG+W+ + W H NIN++MNYW + P NLSEC PL DF
Sbjct: 394 YLLIASSRPGNMPANLQGLWHNNTDGPWRVDYHNNINIQMNYWPACPTNLSECTWPLIDF 453
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
+ L G KTAQ + A GW +I+ ++ K + W L P G WL TH+WE+
Sbjct: 454 IRSLVKPGEKTAQAYFNARGWTASISANIFGFTAPLSSKSMAWNLNPTAGPWLATHIWEY 513
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y+YT D FL++ Y L++ A F +D L DG PSTSPEH V
Sbjct: 514 YDYTRDTKFLKEIGYDLIKSSAQFAVDHLWHKPDGTYTAAPSTSPEH---------GPVD 564
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEW 593
T A++RE+ I A++VL DA K ++ L +L P +I G ++EW
Sbjct: 565 EGVTFAHAVVREILLDAIQASKVL--GTDAKERKQWENVLTKLVPYRIGRYGQLLEW 619
>gi|423214184|ref|ZP_17200712.1| hypothetical protein HMPREF1074_02244 [Bacteroides xylanisolvens
CL03T12C04]
gi|392693129|gb|EIY86364.1| hypothetical protein HMPREF1074_02244 [Bacteroides xylanisolvens
CL03T12C04]
Length = 850
Score = 265 bits (678), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 188/599 (31%), Positives = 294/599 (49%), Gaps = 70/599 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 95 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 154
Query: 77 SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDD--SHLKYAEET---------YR 125
G +A + + F Y G+ F + ++ ET Y+
Sbjct: 155 QAFMEGNQEKAEMLTRQNFNSEVS-YDADGETPFRFGSFTTMGEFYVETGLNIIGMSDYK 213
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDN 183
R L L++A A V++ V + R +F S P V+V + S + G +L F+ + + +
Sbjct: 214 RILSLDSAMAVVQFKKDYVAYQRNYFISYPANVMVMRFSADQPGKQNLVFSYAPNPVSTG 273
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ + N ++ +A+ D G+++ ++ I+ GT+S D KL
Sbjct: 274 NMASDSNKGLVY-------------SASLDNNGMKY--VVRIQAETKGGTLSN-ADGKLT 317
Query: 244 VEGSDWAVLLLVASS----SFDGPFINPSDSKK-DPTSESMSALQSIRNLSYSDLYTRHL 298
V+G+D V + A + +FD F +P P + + + + Y+ L+++H
Sbjct: 318 VKGADEVVFYITADTDYKPNFDPDFKDPKTYVGVKPEETTKEWMNNAVSQGYTALFSQHY 377
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFG 357
+DY LF+RV + L+ + K +P+ +R+K+++ + D L EL FQFG
Sbjct: 378 NDYAALFNRVKLNLNPAIKG-----------KNMPTPQRLKNYRAGQPDYDLEELYFQFG 426
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLISSSRPG ANLQGIW+ ++ W H NIN++MNYW + NL+EC PL D
Sbjct: 427 RYLLISSSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECMLPLVD 486
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWE 476
F+ L G KTA+ + A GW +I+ ++ + + W PM G WL TH+WE
Sbjct: 487 FIHTLVKPGEKTAKSYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATHIWE 546
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
+Y+YT D FL++ Y L++ A F +D+L DG PSTSPEH +
Sbjct: 547 YYDYTRDLTFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH---------GPI 597
Query: 537 SYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+T A++RE+ I A++VL +K E E VL + L P KI G +MEW
Sbjct: 598 DQGATFVHAVVREILLDAIEASKVLGVDKKERKQWEHVLAN---LVPYKIGRYGQLMEW 653
>gi|237722074|ref|ZP_04552555.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
gi|229448943|gb|EEO54734.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
Length = 815
Score = 265 bits (678), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 182/597 (30%), Positives = 289/597 (48%), Gaps = 70/597 (11%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
++PIGNG LGA + G + +E + LNE TLW G P +Y N + L ++R
Sbjct: 63 SLPIGNGSLGANIMGSISAERITLNEKTLWKGGPNTAKGAEYYWNVNKQSSGVLKEIRQA 122
Query: 79 VDSGQYAEATAASVKLFGHPA------------DVYQLLGDIELEFDDSHLKYAEETYRR 126
G +A + + F + +G++ +E S + + YRR
Sbjct: 123 FLDGDSKKAGYLTQENFNGLGAYEEKDETPFRFGAFTTMGELYVETGLSEINMS--NYRR 180
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
L L++A A V++ + + R++F S PD V+V K + + G + +S ++ +H
Sbjct: 181 ILSLDSAMAVVQFYKDGIRYQRKYFISYPDSVMVMKFTADKGGKQNLVLSYCPNNEAKSH 240
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+GN+ ++ G + G++F+ IK GT+ A E+ ++ V
Sbjct: 241 LEADGNDGLVYTGVL-------------NNNGMKFT--FRIKAIHKGGTLKA-ENDRIIV 284
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRHLD 299
+ +D V LL A + + F K DP+ +++ + ++ Y +LY H
Sbjct: 285 KDADEVVFLLTADTDYKMNFAPDFKDPKAYVGNDPSQTTLAMMNNVLKKGYDELYRNHEA 344
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGR 358
DY LF+RV ++++ E +P+ +R+ +++ D L +L +QFGR
Sbjct: 345 DYTALFNRVRFEINQ-----------EIGSPNLPTYKRLANYKKGTPDYQLEQLYYQFGR 393
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLI+SSRPG ANLQG+W+ + W H NIN++MNYW + P NL EC PL DF
Sbjct: 394 YLLIASSRPGNMPANLQGLWHNNTDGPWRVDYHNNINIQMNYWPACPTNLPECTWPLIDF 453
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
+ L G KTAQ + A GW +I+ ++ K + W L P G WL TH+WE+
Sbjct: 454 IRSLVKPGEKTAQAYFNARGWTASISANIFGFTAPLSSKSMAWNLNPTAGPWLATHIWEY 513
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y+YT D FL++ Y L++ A F +D L DG PSTSPEH V
Sbjct: 514 YDYTRDTKFLKEIGYDLIKSSAQFAVDHLWHKPDGTYTAAPSTSPEH---------GPVD 564
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEW 593
T A++RE+ I A++VL DA K ++ L +L P +I G ++EW
Sbjct: 565 EGVTFAHAVVREILLDAIQASKVL--GTDAKERKQWENVLTKLVPYRIGRYGQLLEW 619
>gi|255693982|ref|ZP_05417657.1| fibronectin type III domain protein [Bacteroides finegoldii DSM
17565]
gi|260620198|gb|EEX43069.1| hypothetical protein BACFIN_09249 [Bacteroides finegoldii DSM
17565]
Length = 820
Score = 265 bits (678), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 191/608 (31%), Positives = 294/608 (48%), Gaps = 73/608 (12%)
Query: 18 NGPAKHFTDA-IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP------GDYTNPDAPK 70
N P K + ++ +PIGNG LGA + G + +E + LNE TLW G P G Y N +
Sbjct: 53 NNPDKAWENSSLPIGNGSLGANILGSISAERITLNEKTLWKGGPNTAKGAGYYWNVNKQS 112
Query: 71 A--LSDVRSLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSH 116
A L D+R G +A + + F A+ + +G++ +E S
Sbjct: 113 ANILKDIRQAFLDGNKEKAARLTQENFNGLAEYEERDETPFRFGSFTTMGELYIETGLSE 172
Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
+ + Y R L L++A A V++ E+ R++F S PD V+V K + ++ G + +S
Sbjct: 173 INM--KNYHRILSLDSAMAVVQFDKDGTEYQRKYFISYPDSVMVMKFTANKKGKQNLVLS 230
Query: 177 LDSLLDNHSYV--NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
+ SY+ +GNN + G N N + A+ +G I
Sbjct: 231 YCPNSEAESYLSADGNNGLGYTGVL---------NNNKMKFAFRIKAL-------HKGGI 274
Query: 235 SALEDKKLKVEGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLS 289
E+ ++ V+ +D V LL A + +F+ F +P KDP +++ + +
Sbjct: 275 LKTENSRIIVKDADEVVFLLTADTDYKINFNPDFNDPQTYVGKDPEQTTLAMMNNALEKG 334
Query: 290 YSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPS 348
Y L H DY LF+RV +Q++ E +P+ +R+ +++ D
Sbjct: 335 YDKLIRNHKTDYTALFNRVQLQIN-----------PEAGTPDLPTYKRLDNYRKGVPDYQ 383
Query: 349 LVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL 408
L +L +QFGRYLLI+SSRPG ANLQG+W+ +L W H NIN++MNYW + NL
Sbjct: 384 LEQLYYQFGRYLLIASSRPGNMPANLQGLWHNNLDGPWRVDYHNNINIQMNYWPACSANL 443
Query: 409 SECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVV-WALWPMGG 467
SEC PL DF+ L G KTAQ + A GW +I+ ++ K + W L P+ G
Sbjct: 444 SECTWPLIDFIRSLVKPGEKTAQSYFNARGWTASISANIFGFTAPLSSKSMEWNLNPIVG 503
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFI 527
WL TH+WE+Y+YT D+ FL + Y L++ A F +D L DG PSTSPEH
Sbjct: 504 PWLATHIWEYYDYTRDKRFLSEIGYELIKSSAQFTVDHLWHKPDGTYTAAPSTSPEH--- 560
Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIA 585
V T A++RE+ I A++VL ++ E E + L +L P +I
Sbjct: 561 ------GPVDEGVTFAHAVVREILLDAIQASKVLGVDRKERRQWENI---LAKLVPYRIG 611
Query: 586 EDGSIMEW 593
G ++EW
Sbjct: 612 RYGQLLEW 619
>gi|220911208|ref|YP_002486517.1| twin-arginine translocation pathway signal [Arthrobacter
chlorophenolicus A6]
gi|219858086|gb|ACL38428.1| twin-arginine translocation pathway signal [Arthrobacter
chlorophenolicus A6]
Length = 781
Score = 265 bits (677), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 189/605 (31%), Positives = 287/605 (47%), Gaps = 56/605 (9%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP-----DAPKA 71
+ GPA+ F +++P+GNG GA + G E +++NE + W+G P D + P +
Sbjct: 4 YRGPAEKFVESLPVGNGLAGATLRGLAGGERIQINEGSAWSG-PTDRSAPPLDPAEGTAR 62
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
L VR VD+G A + G + Y L L D + R LDL
Sbjct: 63 LHAVREAVDAGDVRRAEELLLAFQGTHSQAY--LPFAVLSVDAEGTAAPADGPARWLDLR 120
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
T A +Y + E F+S+PD VIV I+ S L ++ D + G +
Sbjct: 121 TGVAGHRYLLDGAEARHRTFASHPDAVIVHDIAFSAPADLRIGIAPDKI-----TATGMD 175
Query: 192 QIIME-------GRCPGKRIPPKANANDDP----KGIQFSAILEIKISDDRGTISALEDK 240
+ + G + P D P G + A+ +D +
Sbjct: 176 AVTRDWGTELRLGLLLPADVAPAHEQADHPVVYGHGSRAGAVHAGVATDGD---AGFARG 232
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIR------NLSYSDLY 294
L + G+ + +++ + + PF +++ D +++++ L S R +
Sbjct: 233 VLAIRGATFVRIVVATGTVLNHPFARHANTADD--ADALAGLLSARIAGVLEEEAVEPAL 290
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELL 353
RHL D+ +L+ RV+++L P+ ER+++F+TD+ D +L+ LL
Sbjct: 291 QRHLADHARLYSRVTLELG----------GGPAAAAGKPTDERIRAFETDKSDSALMALL 340
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
F +GRYLLI+SSR G ANLQGIWNE+L W S +NIN +MNYW +L +L+EC E
Sbjct: 341 FHYGRYLLIASSREGGFPANLQGIWNEELQAPWSSNYTININTQMNYWPALTTSLAECHE 400
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKS---SADRGKVVWALWPMGGAWL 470
PL + L+ A Y A GWV HH TD W A +G +WA W MGG WL
Sbjct: 401 PLLRLVDTLART-GAAAAGLYGARGWVAHHNTDPWGHPFAVGAGKGNAMWASWAMGGTWL 459
Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD 530
+W HY +T D LEK ++P LEG F LDW+ T+PSTSPE+ F+A D
Sbjct: 460 AEAVWRHYAFTGDLARLEK-SWPALEGACLFALDWITGEPGSGTHTSPSTSPENRFVADD 518
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE--KVLKSLPRLRPTKIAEDG 588
G A V S+TMD++++R + + AA VL L E + + +LP+ I G
Sbjct: 519 GGPAAVGRSATMDVSLLRALCGSARQAAAVLGAPVPWLDEFTRKVAALPQ---PAIGSRG 575
Query: 589 SIMEW 593
++EW
Sbjct: 576 EVLEW 580
>gi|317505590|ref|ZP_07963500.1| alpha-L-fucosidase [Prevotella salivae DSM 15606]
gi|315663302|gb|EFV03059.1| alpha-L-fucosidase [Prevotella salivae DSM 15606]
Length = 828
Score = 265 bits (677), Expect = 5e-68, Method: Compositional matrix adjust.
Identities = 181/600 (30%), Positives = 296/600 (49%), Gaps = 72/600 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA--------PKALSDVR 76
+ ++P+GNG LGA + G + +E + NE TLW G P DA L+++R
Sbjct: 72 SQSLPLGNGSLGANIMGSIEAERITFNEKTLWRGGPNTSAGADAYWNVNKQSAHYLNEIR 131
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
G +A + K F + +G+ +E S + +E Y
Sbjct: 132 QAFIEGDEKKAALLTRKNFNSTVPYESWKENPFRFGNFTTMGEFYIETGLSSIGMSE--Y 189
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ V + R +F S P+ V+V + + G +L F+ + +
Sbjct: 190 KRALSLDSALATVQFKKDGVRYERNYFISYPNNVMVVRFKADQPGKQNLVFSYESNPVST 249
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+G+N ++ KA+ +++ Q ++ I+ + GTIS ++ KL
Sbjct: 250 GKMEADGSNGLVF-----------KAHLDNN----QMEYVVRIQALNQGGTISN-DNGKL 293
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINP-SDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
+ G++ V L+ A + +F+ F NP + +P+ + + ++ Y L H
Sbjct: 294 SINGANEVVFLITADTDYKVNFNPDFKNPRAYVGVNPSETTAAWMKKAVAQGYDALLQVH 353
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQF 356
DY LF+RVS+ L+ K +P+ +R+ +++ ED L EL +QF
Sbjct: 354 YKDYASLFNRVSLTLNDGQK-----------TQDIPTPQRLINYRKGKEDYYLEELYYQF 402
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NLSEC PL
Sbjct: 403 GRYLLIASSRPGNLPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPAGSTNLSECTLPLI 462
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTA+ + A GW +I+ ++ + + W PM G WL TH+W
Sbjct: 463 DFIRTLVKPGEKTAKAYFGARGWTASISGNIFGFTAPLESEDMSWNFNPMAGPWLATHVW 522
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
++Y+YT D+ FL++ Y L++ A F +D+L + DG PSTSPEH
Sbjct: 523 DYYDYTRDKKFLKEVGYDLIKSSAIFAVDYLWKKPDGTYTAAPSTSPEH---------GP 573
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A+IRE+ I A++VL +K E E+VL+ ++ P K+ G ++EW
Sbjct: 574 IDEGTTFVHAVIREILMNAIDASKVLNVDKKERKQWEEVLR---KIAPYKVGRYGQLLEW 630
>gi|429749280|ref|ZP_19282413.1| hypothetical protein HMPREF9075_01080 [Capnocytophaga sp. oral
taxon 332 str. F0381]
gi|429168711|gb|EKY10529.1| hypothetical protein HMPREF9075_01080 [Capnocytophaga sp. oral
taxon 332 str. F0381]
Length = 805
Score = 265 bits (676), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 191/608 (31%), Positives = 307/608 (50%), Gaps = 70/608 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + F+ PA HFT++ PIGNGR+GAM++GG ++ + LNE +LW+G + P A + L
Sbjct: 23 VSVVFHNPATHFTESAPIGNGRIGAMLYGGTSTDRIVLNEISLWSGGAQESDEPQAYEYL 82
Query: 73 SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
++ L+ + EA A + F G+ A+ YQ+ GD+ +++ D+
Sbjct: 83 PHIQQLLLERKNIEAEALLQQHFIAKGEGSCRGNGANCSYGCYQIFGDLLIKWKDTS--- 139
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+ Y R L L+ ATA Y T+ F+ + +I KIS + F V++
Sbjct: 140 PVQNYSRILRLDEATAVTTYQRNGNTITQTAFADFKNDIIWVKISAQKP----FEVAVSL 195
Query: 180 LLDNHSYVNG-NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK----ISDDRGTI 234
++ V+ ++II+ G P N + +G+ F+ I+ ++ + D I
Sbjct: 196 TRKENAIVSYLPDRIILTGVLP----------NKEQQGMHFAGIVALESDGNMQKDEAAI 245
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
+ ++L LL S S + + N + P + + LQ+ N +
Sbjct: 246 TVQNAREL----------LLKVSMSTNYNYTNSGLTAVSPLETTKAYLQTA-NSDFESAL 294
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
T+ YQ+LF+R +R DT S + + +R+++F + +L+ +L+
Sbjct: 295 TKSKSAYQELFNR-----NRWYAKANADTQS------LSTLQRLENFSKGKKDALLPILY 343
Query: 355 -QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FGRYLLI SSR G ANLQG+W E+ W+ H+NINL+MNYW + NLS E
Sbjct: 344 YNFGRYLLICSSREGLLPANLQGLWAEEYQTPWNGDYHLNINLQMNYWLAEISNLSNLTE 403
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PL F L NG KTA+ Y A GWV H ++ W +S VW GGAWLC H
Sbjct: 404 PLHRFTKNLMPNGRKTAKSYYKAEGWVAHVISNPWFFTSPGES-AVWGSTLTGGAWLCQH 462
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP--D 530
+W+HY +T D DFL K YP+++ +F +LI+ Y T PS SPE+ ++ P
Sbjct: 463 IWQHYLFTHDLDFL-KNYYPVMKEATAFFQSFLIKDPTTDYWVTAPSNSPENAYLFPIDS 521
Query: 531 GK--LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE--KVLKSLPRLRPTKIAE 586
GK A + TMDM I+RE+ + I AA +L+ +++ + E K++++ P P +I +
Sbjct: 522 GKKVAAHTCIAPTMDMQIVRELLNNTIKAATILKVDDEKITEWKKIVENTP---PNRIGK 578
Query: 587 DGSIMEWV 594
G + EW+
Sbjct: 579 KGDLNEWL 586
>gi|29348582|ref|NP_812085.1| hypothetical protein BT_3173 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29340487|gb|AAO78279.1| glycoside hydrolase family 95 [Bacteroides thetaiotaomicron
VPI-5482]
Length = 815
Score = 265 bits (676), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 190/599 (31%), Positives = 294/599 (49%), Gaps = 74/599 (12%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR-S 77
++PIGNG LGA + G V +E + LNE TLW G P +Y N + L ++R +
Sbjct: 63 SLPIGNGSLGANILGSVAAERITLNEKTLWKGGPNTSKGAEYYWDVNKQSAGVLKEIRQA 122
Query: 78 LVDSGQYAEATAASVKLFGHPA-----------DVYQLLGDIELEFDDSHLKYAEETYRR 126
+D + A G A + +G++ +E + L+ + YRR
Sbjct: 123 FLDEDKEKAAQLTRNNFNGLAAYEEKDETPFRFGSFTTMGELYVETGLNELRMS--NYRR 180
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
L L++A V++ V++ R++F S PD V+V K + ++SG + +S +S ++
Sbjct: 181 ILSLDSAMVVVQFDKDGVQYQRKYFISYPDSVMVMKFTANQSGKQNLILSYCPNSEAKSN 240
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+G + ++ G D G++F+ IK GT+ A E+ +L V
Sbjct: 241 LRADGKDGLVYTGVL-------------DNNGMKFA--FRIKAIHKGGTLEA-ENDRLIV 284
Query: 245 EGSDWAVLLLVASSSFDGPFINP--SDSK----KDPTSESMSALQSIRNLSYSDLYTRHL 298
+G+D V LL A + + F NP D K DP + + Y +LY H
Sbjct: 285 KGADEVVFLLTADTDYKMNF-NPDFKDPKTYVGNDPEQTTRIMMDQAVQKGYDELYRNHE 343
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFG 357
D+ LF+RV +QL+ DI + +P+ +R+ +++ D L +L +QFG
Sbjct: 344 ADHTALFNRVRLQLN---PDISSPN--------LPTYQRLANYKKGTPDYQLEQLYYQFG 392
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLI+SSRPG ANLQG+W+ +L W H NIN++MNYW + NLSEC PL D
Sbjct: 393 RYLLIASSRPGNMPANLQGMWHNNLDGPWRVDYHNNINIQMNYWPACSANLSECTWPLID 452
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVV-WALWPMGGAWLCTHLWE 476
F+ L G +TAQ + A GW +I+ ++ ++ W L P G WL TH+WE
Sbjct: 453 FIRSLVKPGEQTAQAYFNARGWTASISANIFGFTAPLSSNMMSWNLNPTAGPWLATHIWE 512
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
+Y+YT D+ FL++ Y L++ A F +D L DG PSTSPEH +
Sbjct: 513 YYDYTRDKKFLKEIGYDLIKSSAQFAVDHLWHKPDGTYTAAPSTSPEH---------GPI 563
Query: 537 SYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
T A++RE+ I A++ L + E EK+L +L P +I G +MEW
Sbjct: 564 DEGVTFAHAVVREILLDAIQASKELGIDSKERKQWEKILD---KLVPYRIGRYGQLMEW 619
>gi|262405728|ref|ZP_06082278.1| glycoside hydrolase family 95 [Bacteroides sp. 2_1_22]
gi|294644470|ref|ZP_06722231.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294806820|ref|ZP_06765646.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
gi|345510919|ref|ZP_08790478.1| glycoside hydrolase family 95 [Bacteroides sp. D1]
gi|229442942|gb|EEO48733.1| glycoside hydrolase family 95 [Bacteroides sp. D1]
gi|262356603|gb|EEZ05693.1| glycoside hydrolase family 95 [Bacteroides sp. 2_1_22]
gi|292640192|gb|EFF58449.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294445990|gb|EFG14631.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
Length = 815
Score = 265 bits (676), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 184/597 (30%), Positives = 288/597 (48%), Gaps = 70/597 (11%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
++PIGNG LGA + G + +E + LNE TLW G P +Y N + L ++R
Sbjct: 63 SLPIGNGSLGANIMGSISAERITLNEKTLWKGGPNTAKGAEYYWNVNKQSSGVLKEIRQA 122
Query: 79 VDSGQYAEATAASVKLFGHPA------------DVYQLLGDIELEFDDSHLKYAEETYRR 126
G +A + + F + +G++ +E S + + YRR
Sbjct: 123 FLDGDSKKAGYLTQENFNGLGAYEEKDETPFRFGAFTTMGELYVETGLSEINMS--NYRR 180
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
L L++A A V++ + + R++F S PD V+V K + + G + +S ++ +H
Sbjct: 181 ILSLDSAMAVVQFYKDGIRYQRKYFISYPDSVMVMKFTADKGGKQNLVLSYCPNNEAKSH 240
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+GN+ ++ G + G++F+ IK GT+ A E+ ++ V
Sbjct: 241 LEADGNDGLVYTGVL-------------NNNGMKFA--FRIKAIHKGGTLKA-ENDRIIV 284
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRHLD 299
+ +D V LL A + + F K DP+ +++ + + Y +LY H
Sbjct: 285 KDADEVVFLLTADTDYKMNFAPDFKDPKAYVGNDPSQTTLAMMDNALKKGYDELYRNHEA 344
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGR 358
DY LF+RV +++ E +P+ +R+ S++ D L +L +QFGR
Sbjct: 345 DYTALFNRVRFEIN-----------PEIGTPNLPTYKRLASYKKGVPDYQLEQLYYQFGR 393
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLI+SSRPG ANLQG+W+ + W H NIN++MNYW + P NLSEC PL DF
Sbjct: 394 YLLIASSRPGNMPANLQGLWHNNTDGPWRVDYHNNINIQMNYWPACPTNLSECTWPLIDF 453
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
+ L G KTAQ + A GW +I+ ++ K + W L P G WL TH+WE+
Sbjct: 454 IRSLVKPGEKTAQAYFNARGWTASISANIFGFTAPLSSKSMAWNLNPTVGPWLATHIWEY 513
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y+YT D FL++ Y L++ A F +D L DG PSTSPEH V
Sbjct: 514 YDYTRDTKFLKEIGYDLIKSSAQFAVDHLWHKPDGTYTAAPSTSPEH---------GPVD 564
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEW 593
T A++RE+ I A++VL DA K ++ L +L P +I G ++EW
Sbjct: 565 EGVTFAHAVVREILLDAIQASKVL--GTDAKERKQWENVLTKLVPYRIGRYGQLLEW 619
>gi|295085494|emb|CBK67017.1| Trehalose and maltose hydrolases (possible phosphorylases)
[Bacteroides xylanisolvens XB1A]
Length = 782
Score = 265 bits (676), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 188/606 (31%), Positives = 296/606 (48%), Gaps = 84/606 (13%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 75 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 134
Query: 77 SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDD--SHLKYAEET---------YR 125
G +A + + F Y G+ F + ++ ET Y+
Sbjct: 135 KAFTEGNQEKAEMLTRQNFNSEVS-YDADGETPFRFGSFTTMGEFYVETGLNIIGMSDYK 193
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDN 183
R L L++A A V++ +V + R +F S P V+V + S + G +L F+ + + +
Sbjct: 194 RILSLDSAMAVVQFKKDHVVYQRNYFISYPANVMVMRFSADQPGKQNLVFSYAPNPVSTG 253
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ + N ++ +A+ D G+++ ++ I+ GT+S D KL
Sbjct: 254 NMASDSNKGLVY-------------SASLDNNGMKY--VVRIQAETKGGTLSN-ADGKLM 297
Query: 244 VEGSDWAVLLLVASSSFDGPF------------INPSDSKKDPTSESMSALQSIRNLSYS 291
V+G+D V + A + + F +NP ++ K+ + ++S Y+
Sbjct: 298 VKGADEVVFYITADTDYKPDFDPDFKDPKTYVGVNPEETTKEWMNNAVSQ-------GYT 350
Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLV 350
L+++H +DY LF RV + L+ + K +P+ +R+K+++ + D L
Sbjct: 351 ALFSQHYNDYAALFDRVKLNLNPAIKG-----------RNLPTPQRLKNYRAGQPDYDLE 399
Query: 351 ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
EL FQFGRYLLISSSRPG ANLQGIW+ ++ W H NIN++MNYW + NL+E
Sbjct: 400 ELYFQFGRYLLISSSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNE 459
Query: 411 CQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAW 469
C PL DF+ L G KTA+ + A GW +I+ ++ + + W PM G W
Sbjct: 460 CMLPLVDFIRTLVKPGEKTAKSYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPW 519
Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP 529
L TH+WE+Y+YT D FL++ Y L++ A F +D+L DG PSTSPEH
Sbjct: 520 LATHIWEYYDYTRDLTFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH----- 574
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAED 587
+ +T A++RE+ I A++VL +K E E VL + L P KI
Sbjct: 575 ----GPIDQGATFVHAVVREILLDAIEASKVLGVDKKERKQWEHVLAN---LVPYKIGRY 627
Query: 588 GSIMEW 593
G +MEW
Sbjct: 628 GQLMEW 633
>gi|336403471|ref|ZP_08584186.1| hypothetical protein HMPREF0127_01499 [Bacteroides sp. 1_1_30]
gi|335945801|gb|EGN07608.1| hypothetical protein HMPREF0127_01499 [Bacteroides sp. 1_1_30]
Length = 815
Score = 264 bits (675), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 184/597 (30%), Positives = 288/597 (48%), Gaps = 70/597 (11%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
++PIGNG LGA + G + +E + LNE TLW G P +Y N + L ++R
Sbjct: 63 SLPIGNGSLGANIMGSISAERITLNEKTLWKGGPNTAKGAEYYWNVNKQSSGVLKEIRQA 122
Query: 79 VDSGQYAEATAASVKLFGHPA------------DVYQLLGDIELEFDDSHLKYAEETYRR 126
G +A + + F + +G++ +E S + + YRR
Sbjct: 123 FLDGDSKKAGYLTQENFNGLGAYEEKDETPFRFGAFTTMGELYVETGLSEINMS--NYRR 180
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
L L++A A V++ + + R++F S PD V+V K + + G + +S ++ +H
Sbjct: 181 ILSLDSAMAVVQFYKDGIRYQRKYFISYPDSVMVMKFTADKGGKQNLVLSYCPNNEAKSH 240
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+GN+ ++ G + G++F+ IK GT+ A E+ ++ V
Sbjct: 241 LEADGNDGLVYTGVL-------------NNNGMKFA--FRIKAIHKGGTLKA-ENDRIIV 284
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRHLD 299
+ +D V LL A + + F K DP+ +++ + + Y +LY H
Sbjct: 285 KDADEVVFLLTADTDYKMNFAPDFKDPKAYVGNDPSQTTLAMMDNALKKGYDELYRNHEA 344
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGR 358
DY LF+RV +++ E +P+ +R+ S++ D L +L +QFGR
Sbjct: 345 DYTALFNRVRFEIN-----------PEIGTPNLPTYKRLASYKKGVPDYQLEQLYYQFGR 393
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLI+SSRPG ANLQG+W+ + W H NIN++MNYW + P NLSEC PL DF
Sbjct: 394 YLLIASSRPGNMPANLQGLWHNNTDGPWRVDYHNNINIQMNYWPACPTNLSECTWPLIDF 453
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
+ L G KTAQ + A GW +I+ ++ K + W L P G WL TH+WE+
Sbjct: 454 IRSLVKPGEKTAQAYFNARGWTASISANIFGFTAPLSSKSMAWNLNPTVGPWLATHIWEY 513
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y+YT D FL++ Y L++ A F +D L DG PSTSPEH V
Sbjct: 514 YDYTRDTRFLKEIGYDLIKSSAQFAVDHLWHKPDGTYTAAPSTSPEH---------GPVD 564
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEW 593
T A++RE+ I A++VL DA K ++ L +L P +I G ++EW
Sbjct: 565 EGVTFAHAVVREILLDAIQASKVL--GTDAKERKQWENVLTKLVPYRIGRYGQLLEW 619
>gi|423230473|ref|ZP_17216877.1| hypothetical protein HMPREF1063_02697 [Bacteroides dorei
CL02T00C15]
gi|423240882|ref|ZP_17221996.1| hypothetical protein HMPREF1065_02619 [Bacteroides dorei
CL03T12C01]
gi|423244182|ref|ZP_17225257.1| hypothetical protein HMPREF1064_01463 [Bacteroides dorei
CL02T12C06]
gi|392630838|gb|EIY24820.1| hypothetical protein HMPREF1063_02697 [Bacteroides dorei
CL02T00C15]
gi|392642736|gb|EIY36499.1| hypothetical protein HMPREF1064_01463 [Bacteroides dorei
CL02T12C06]
gi|392643844|gb|EIY37593.1| hypothetical protein HMPREF1065_02619 [Bacteroides dorei
CL03T12C01]
Length = 818
Score = 264 bits (675), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 196/639 (30%), Positives = 298/639 (46%), Gaps = 82/639 (12%)
Query: 2 MNAESTSTTNPLKITFNGP------AKHFTDA---------IPIGNGRLGAMVWGGVPSE 46
+ A+ST T L I F+ P A ++ A +PIGNG LG V G + +E
Sbjct: 17 LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPATDKAWENNSLPIGNGSLGGNVMGSIAAE 76
Query: 47 TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
+ LNE TLW G P N ++ L ++R G +A + K F
Sbjct: 77 RITLNEKTLWRGGPNTEKGAAYYWNVNKESAHLLPEIRQAFTDGNQKKAEELTCKNFNGL 136
Query: 99 ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
AD + LG+ +E S + Y+R L L++A A V + V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEI--GMTNYKRILSLDSAMAVVSFRKDEVNY 194
Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
R++F S PD V+V K + G +L F+ + +G N+++ G
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKADGPNRLLYTGCL----- 249
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
K Q L I+ + G+++ D K V +D + LL A + +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298
Query: 261 DGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
+ F +P DP +++ + + SY++L RH DY +LF RV +QL+ R+P
Sbjct: 299 NPDFKDPKTYVGPDPEQTTLAMMDAAAAKSYNELCERHKTDYTQLFGRVQLQLNPRAPM- 357
Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
T + +P+ +R+ ++ + D L E+ +QFGRYLLI+SSRPG ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTYQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413
Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
W + W H NIN++MNYW + NL+EC PL DF+ L G KTAQ + A
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473
Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
GW +I+ +S + W PM G WL TH+WE+Y+YT D+ FL++ Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533
Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
A+F +D+L DG PSTSPEH V +T A++RE+ I
Sbjct: 534 SSANFAIDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLNAID 584
Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
A++ L + + + VL L P +I G +MEW
Sbjct: 585 ASKALGVDSKDRKQWQYVLN---HLVPYQIGRYGQLMEW 620
>gi|318078709|ref|ZP_07986041.1| alpha-L-fucosidase [Streptomyces sp. SA3_actF]
Length = 769
Score = 264 bits (675), Expect = 9e-68, Method: Compositional matrix adjust.
Identities = 188/594 (31%), Positives = 290/594 (48%), Gaps = 59/594 (9%)
Query: 15 ITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-------DYTNP 66
+ + PA + T+++P+GNG LGA V+G +P+E ++ E TLWTG PG ++ NP
Sbjct: 32 LRYGAPATDWETESLPVGNGALGASVFGTLPTEHIQFAEKTLWTGGPGTPGYRYGNWENP 91
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVY---QLLGDIELEFDDSHLKYAEET 123
P AL+ VR+ +++ A+ +L G P Y Q GD+ ++ D + + E
Sbjct: 92 R-PDALASVRADIEARTKITPEDAAARL-GQPRIGYGGHQTFGDLLIDVDGA--PGSAEG 147
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
Y R LDL A A V Y F R F+S PD+V+V + GS+ N+ S +
Sbjct: 148 YTRTLDLAQALATVSYPHDGTTFHRTVFASCPDKVLVGHFTADRGGSVGLNLRYTSPRQD 207
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ +++ + G G++F A +I++ + GT++A D+ L
Sbjct: 208 FTATTNGDRLTVRGAL-------------QDNGMRFEA--QIRLLSEGGTVTANGDR-LT 251
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V G+D A +L A + + + P DP +A+ Y +L RH D+
Sbjct: 252 VSGADSAWFVLSAGTDYADTY--PDYRGADPHDRVTTAVDQAAARPYRELLDRHTSDHAA 309
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
LF RV + L + D+ + D + A + +D +L L FQ+GRYLLI+
Sbjct: 310 LFSRVVLDLGQ-------DSAPDRTTDALLKA--YTGGNSADDRALEALFFQYGRYLLIA 360
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSR G+ ANLQG WN +P W + HVNINL+MNYW + NL+E P F+ L
Sbjct: 361 SSRAGSLPANLQGAWNNSTAPPWSADYHVNINLQMNYWPAEATNLAETTAPYDRFVEALR 420
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
G TA+ + A GWV+H +T + + D W +P AWL + L+EHY +
Sbjct: 421 APGRTTARSMFDARGWVVHDETTPFGFTGVHDWPTSFW--FPEAAAWLTSQLYEHYRFDG 478
Query: 483 DRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEH-EFIAPDGKLACVSYSS 540
D+L AYP ++ A F +D L + D L PS SPEH +F A +
Sbjct: 479 STDYLRATAYPAMKEAAEFWIDVLRTDPRDNTLVVTPSFSPEHGDFTA----------GA 528
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEW 593
M I+RE+F + AA+ L ++ A + ++L R+ P +I G +MEW
Sbjct: 529 AMSQQIVRELFLNTLEAAQTL-GDDPAFRATLKETLDRIDPGLRIGSWGQLMEW 581
>gi|265752589|ref|ZP_06088158.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_33FAA]
gi|263235775|gb|EEZ21270.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_33FAA]
Length = 818
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 195/639 (30%), Positives = 300/639 (46%), Gaps = 82/639 (12%)
Query: 2 MNAESTSTTNPLKITFNGP------AKHFT---------DAIPIGNGRLGAMVWGGVPSE 46
+ A+ST T L I F+ P A ++ +++PIGNG LG V G + +E
Sbjct: 17 LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPVTDKAWENNSLPIGNGSLGGNVMGSIAAE 76
Query: 47 TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
+ LNE TLW G P N ++ L ++R G +A + K F
Sbjct: 77 RITLNEKTLWRGGPNTEKGAAYYWNVNKESAHLLPEIRQAFTDGNQKKAEELTCKNFNGL 136
Query: 99 ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
AD + LG+ +E S + Y+R L L++A A V + V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEIGIT--NYKRILSLDSAMAVVSFRKDEVNY 194
Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
R++F S PD V+V K + G +L F+ + +G N+++ G
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKADGPNRLLYTGCL----- 249
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
K Q L I+ + G+++ D K V +D + LL A + +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298
Query: 261 DGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
+ F +P DP +++ + + SY++L RH DY +LF RV +QL+ R+P
Sbjct: 299 NPDFKDPKTYVGPDPEQTTLAMMDAAAAKSYNELCERHKTDYTQLFGRVQLQLNPRAPM- 357
Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
T + +P+ +R+ ++ + D L E+ +QFGRYLLI+SSRPG ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTYQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413
Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
W + W H NIN++MNYW + NL+EC PL DF+ L G KTAQ + A
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473
Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
GW +I+ +S + W PM G WL TH+WE+Y+YT D+ FL++ Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533
Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
A+F +D+L +G PSTSPEH V +T A++RE+ I
Sbjct: 534 SSANFAIDYLWHKPEGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLNAID 584
Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
A++ L + + + VLK L P +I G +MEW
Sbjct: 585 ASKALGVDSKDRKQWQYVLK---HLVPYQIGRYGQLMEW 620
>gi|317505420|ref|ZP_07963340.1| alpha-L-fucosidase [Prevotella salivae DSM 15606]
gi|315663460|gb|EFV03207.1| alpha-L-fucosidase [Prevotella salivae DSM 15606]
Length = 861
Score = 264 bits (675), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 186/616 (30%), Positives = 295/616 (47%), Gaps = 85/616 (13%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG------------------------- 61
++PIGNG +GA ++G + +E + LNE +LW G PG
Sbjct: 79 SLPIGNGSVGANIFGSISAERITLNEKSLWRGGPGVSHDASYYWNVNDNNVFPVNIDDGH 138
Query: 62 --DY---TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQL------LGDI-- 108
Y N + L D+R+ +G A+A + + K F A Q G+
Sbjct: 139 DASYYWNVNKRSVSVLKDIRAAFLAGDKAKADSLTRKNFNGWASYEQRDEKPFRFGNFTT 198
Query: 109 --ELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGS 166
EL + + YRREL L++A V+++ V + R F S PD V+V + +
Sbjct: 199 MGELFIETGLTEEGISHYRRELSLDSARTLVQFNQNGVCYQRTAFVSYPDNVLVLRFKAN 258
Query: 167 ESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE 224
G +L+F+ + + + +G N ++ G D G+Q+ ++
Sbjct: 259 AEGRQNLNFSYAPNPVSTGQMQADGANGLVYRGAL-------------DDNGMQY--VVR 303
Query: 225 IKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SFDGPFINPSDSKK-DPTSESM 279
I+ G+++ D LK+ +D + L+ A + +F+ F NP P +
Sbjct: 304 IQAVTKGGSVTNEHDT-LKIRHADEVMFLITADTDYRINFNPDFTNPKTYVGVQPEVTTQ 362
Query: 280 SALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVK 339
+ +Q Y+ L++RH DY LF RV ++L+ S D P+A+R++
Sbjct: 363 AWMQQAEKKDYNQLFSRHYRDYSALFQRVKLRLN----------PSNHAADDKPTAQRLE 412
Query: 340 SFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEM 398
+++ D +L EL +QFGRYLLI+SSRPGT ANLQG+W+ ++ W H NINL+M
Sbjct: 413 AYRNGTTDNALEELYYQFGRYLLIASSRPGTLPANLQGLWHNNVDGPWHVDYHNNINLQM 472
Query: 399 NYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK- 457
NYW +L EC PL DF+ L G++TA+ Y A GW ++I+ ++ +
Sbjct: 473 NYWPVHTTHLDECALPLIDFVRSLVKPGAETAKAYYGARGWTTSVSSNIFGFTAPLSSED 532
Query: 458 VVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETN 517
+ W L PMGG WL THLWE+Y++T D+ L Y L++ A F +D+L DG
Sbjct: 533 MSWNLCPMGGPWLATHLWEYYDFTRDKQLLRSTLYDLIKQSADFAVDYLWRKPDGTYTAA 592
Query: 518 PSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP 577
PSTSPEH + T A+IRE+ I+A++VL + +A ++ + L
Sbjct: 593 PSTSPEH---------GPIDEGVTFVHAVIREILLDAIAASKVLGVDVEAR-KQWQQVLN 642
Query: 578 RLRPTKIAEDGSIMEW 593
L P +I G + EW
Sbjct: 643 HLAPYRIGRYGQLQEW 658
>gi|424665546|ref|ZP_18102582.1| hypothetical protein HMPREF1205_01421 [Bacteroides fragilis HMW
616]
gi|404574619|gb|EKA79368.1| hypothetical protein HMPREF1205_01421 [Bacteroides fragilis HMW
616]
Length = 829
Score = 264 bits (674), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 179/600 (29%), Positives = 291/600 (48%), Gaps = 72/600 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA--------PKALSDVR 76
+ ++PIGNG +GA + G + +E + NE TLW G P DA L ++R
Sbjct: 75 SQSLPIGNGSIGANIMGSIEAERITFNEKTLWRGGPNTSRGADAYWNVNKQSAHVLKEIR 134
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
G +A + K F + +G+ +E S + ++ Y
Sbjct: 135 QAFTDGDQKKAEMLTRKNFNSEVPYESSREKPFRFGNFTTMGEFYIETGLSAVNMSD--Y 192
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ +V + R++F S P V+ + G +L+F+ + + +
Sbjct: 193 KRILSLDSALAVVQFKKDDVAYERDYFISYPANVMAIRFKADRPGKQNLTFSYAPNPVST 252
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+G N + A+ D G+Q+ ++ I + GT+S D K+
Sbjct: 253 GSMSADGANGLAY-------------TAHLDNNGMQY--VVRIHATAKGGTLSN-ADGKI 296
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDSKK-DPTSESMSALQSIRNLSYSDLYTRH 297
++ +D V L+ A + +FD F +P +P + + + + Y L+ +H
Sbjct: 297 TIKDADEVVFLVTADTDYKINFDPDFKDPKTYVGVNPAETTRQWMDNAVTMGYDVLFKQH 356
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
DDY LF+RV +QL+ ++ ++P+A+R+++++ + D L EL +QF
Sbjct: 357 YDDYAALFNRVKLQLN-----------PDQQSPSLPTAKRLQNYRKGQPDFYLEELYYQF 405
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL+EC PL
Sbjct: 406 GRYLLITSSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECTLPLV 465
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + GW +I+ ++ + + W PM G WL TH+W
Sbjct: 466 DFIRTLVKPGQKTAQAYFGTRGWTASISANIFGFTAPLESEDMSWNFNPMAGPWLATHIW 525
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D+ FL++ Y L++ A F D+L DG PSTSPEH
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSAQFATDFLWRKPDGTYTAAPSTSPEH---------GP 576
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A+IRE+ I A++VL + E ++VL L P K+ G +MEW
Sbjct: 577 IDEGTTFVHAVIREILLDAIEASKVLGVDSKERKQWQEVLA---HLAPYKVGRYGQLMEW 633
>gi|160884032|ref|ZP_02065035.1| hypothetical protein BACOVA_02006 [Bacteroides ovatus ATCC 8483]
gi|423291498|ref|ZP_17270346.1| hypothetical protein HMPREF1069_05389 [Bacteroides ovatus
CL02T12C04]
gi|156110374|gb|EDO12119.1| hypothetical protein BACOVA_02006 [Bacteroides ovatus ATCC 8483]
gi|392663498|gb|EIY57048.1| hypothetical protein HMPREF1069_05389 [Bacteroides ovatus
CL02T12C04]
Length = 829
Score = 264 bits (674), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 186/600 (31%), Positives = 296/600 (49%), Gaps = 72/600 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-----DY---TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 74 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGVDYYWNVNKQSAHLLDEIR 133
Query: 77 SLVDSGQYAEATAASVKLFG----HPAD--------VYQLLGDIELEFDDSHLKYAEETY 124
G +A + + F + AD + +G+ +E + + ++ Y
Sbjct: 134 KAFTEGDQKKAEMLTRQNFNSEVSYEADGENPFRFGSFTTMGEFYVETGLNMIGMSD--Y 191
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ V + R F S P V+V + S +SG +L F+ + + L
Sbjct: 192 KRILSLDSAMAVVQFKKDRVAYQRNFFISYPANVMVVRFSADQSGKQNLVFSYAPNPLST 251
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+GN ++ A+ D G+++ ++ I+ GT+S D KL
Sbjct: 252 GSMVSDGNKGLVY-------------TASLDNNGMKY--VVCIQAETKGGTLSN-ADGKL 295
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D V + A + +FD F +P +P + + + Y+ L+ +H
Sbjct: 296 TVKDADEVVFYITADTDYKINFDPDFKDPKTYIGVNPEETTKQWMNNAVAQGYTALFNQH 355
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
+DY LF+RV + L+ + K + +P+++R+K+++ + D L EL +QF
Sbjct: 356 YNDYATLFNRVRLNLNPAVKGV-----------NLPTSQRLKNYRKGQPDYYLGELYYQF 404
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL+EC PL
Sbjct: 405 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECVLPLI 464
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + A GW +I+ ++ + + W PM G WL TH+W
Sbjct: 465 DFIRTLVKPGEKTAQAYFGARGWTASISGNIFGFTTPLESRDMSWNFNPMAGPWLATHIW 524
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D FL++ Y L++ A F +D+L DG PSTSPEH
Sbjct: 525 EYYDYTRDLKFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH---------GP 575
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A++RE+ I A++VL +K E VL + L P +I G +MEW
Sbjct: 576 IDQGATFVHAVVREILMDAIEASKVLGVDKKGRKQWEHVLAN---LVPYQIGRYGQLMEW 632
>gi|256818918|ref|YP_003140197.1| glycoside hydrolase [Capnocytophaga ochracea DSM 7271]
gi|256580501|gb|ACU91636.1| glycoside hydrolase family protein [Capnocytophaga ochracea DSM
7271]
Length = 835
Score = 264 bits (674), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 191/605 (31%), Positives = 309/605 (51%), Gaps = 63/605 (10%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + F+ PA HFT++IPIGNGRLGAM++G + + LNE +LW+G D +P+A L
Sbjct: 52 VSVVFHKPANHFTESIPIGNGRLGAMLFGKTDVDRIVLNEISLWSGGTQDADDPNAHNYL 111
Query: 73 SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
+++ L+ G+ EA A + F G A+ YQ+L ++ L++ +
Sbjct: 112 KEIQKLLLEGKNLEAQALLQQHFVAKGKGSCLGQGANCSYGCYQVLAELLLDWKTTS--- 168
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+ Y+R L L+ ATA + N + F+ + VI KI + L+ ++SL
Sbjct: 169 PIQDYKRVLRLDEATAVTSFKRDNNSIEQTAFADFKNDVIWVKIKATSP--LNLDISLFR 226
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+N + NN+I + G P ND +G+ F+++++++ G I +
Sbjct: 227 K-ENATITYQNNKISLNGVLP----------NDGKEGMHFASVVDVQTD---GKIESTH- 271
Query: 240 KKLKVEGSDWAVLLLVASSSFD---GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
K + ++ + L + A ++++ G ++ S +KK + LQ +S+
Sbjct: 272 KAIAIQSAKEITLRISAVTNYNFNKGGLLDISVTKK-----ANEYLQKAP-MSFDKAKAE 325
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF-Q 355
+Q+LF+R + N + + + ER++ F E +L+ +L+
Sbjct: 326 SSIVFQRLFNRNRWYGK-----------ANANTEGLTTFERLERFYKGEQDALLPILYYN 374
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
FGRYLLISSSR G ANLQG+W E+ W+ H+NIN++MNYW + P NLS+ EPL
Sbjct: 375 FGRYLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINIQMNYWLAEPTNLSQLTEPL 434
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
F L NGSKTA+ Y A+GWV H ++ W +S W GGAWLC H+W
Sbjct: 435 QRFTKNLVPNGSKTAKAYYNANGWVAHVISNPWFYTSPGE-SATWGSTLTGGAWLCEHIW 493
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP---DG 531
+HY +T D +FL + YP+L+ +F LI+ GY T PS SPE+ ++ P DG
Sbjct: 494 QHYLFTKDINFL-REYYPVLKEATTFFESLLIKDPKTGYWVTAPSNSPENAYVLPELKDG 552
Query: 532 K--LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
K + + TMDM I+RE+F+ AA++L + E S + P +I ++G
Sbjct: 553 KKQIGTTCVAPTMDMQIVRELFTNTSDAAKILGLDSKKRTEWERISRNTV-PNRIGKEGD 611
Query: 590 IMEWV 594
+ EW+
Sbjct: 612 LNEWL 616
>gi|212692624|ref|ZP_03300752.1| hypothetical protein BACDOR_02121 [Bacteroides dorei DSM 17855]
gi|212664909|gb|EEB25481.1| hypothetical protein BACDOR_02121 [Bacteroides dorei DSM 17855]
Length = 818
Score = 264 bits (674), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 195/639 (30%), Positives = 299/639 (46%), Gaps = 82/639 (12%)
Query: 2 MNAESTSTTNPLKITFNGP------AKHFT---------DAIPIGNGRLGAMVWGGVPSE 46
+ A+ST T L I F+ P A ++ +++PIGNG LG V G + +E
Sbjct: 17 LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPVTDKAWENNSLPIGNGSLGGNVMGSIAAE 76
Query: 47 TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
+ LNE TLW G P N ++ L ++R G +A + K F
Sbjct: 77 RITLNEKTLWRGGPNTEKGAAYYWNVNKESAHLLPEIRQAFTDGNQKKAEELTCKNFNGL 136
Query: 99 ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
AD + LG+ +E S + Y+R L L++A A V + V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEI--GMTNYKRILSLDSAMAVVSFRKDEVNY 194
Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
R++F S PD V+V K + G +L F+ + +G N+++ G
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKADGPNRLLYTGCL----- 249
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
K Q L I+ + G+++ D K V +D + LL A + +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298
Query: 261 DGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
+ F +P DP +++ + + SY++L RH DY +LF RV +QL+ R+P
Sbjct: 299 NPDFKDPKTYVGPDPEQTTLAMMDAAAAKSYNELCERHKTDYTQLFGRVQLQLNPRAPM- 357
Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
T + +P+ +R+ ++ + D L E+ +QFGRYLLI+SSRPG ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTYQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413
Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
W + W H NIN++MNYW + NL+EC PL DF+ L G KTAQ + A
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473
Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
GW +I+ +S + W PM G WL TH+WE+Y+YT D+ FL++ Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533
Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
A+F +D+L DG PSTSPEH V +T A++RE+ I
Sbjct: 534 SSANFAIDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLNAID 584
Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
A++ L + + + VL L P +I G +MEW
Sbjct: 585 ASKALGVDSKDRKQWQYVLN---HLVPYQIGRYGQLMEW 620
>gi|318059330|ref|ZP_07978053.1| alpha-L-fucosidase [Streptomyces sp. SA3_actG]
Length = 783
Score = 264 bits (674), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 188/594 (31%), Positives = 290/594 (48%), Gaps = 59/594 (9%)
Query: 15 ITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-------DYTNP 66
+ + PA + T+++P+GNG LGA V+G +P+E ++ E TLWTG PG ++ NP
Sbjct: 46 LRYGAPATDWETESLPVGNGALGASVFGTLPTEHIQFAEKTLWTGGPGTPGYRYGNWENP 105
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVY---QLLGDIELEFDDSHLKYAEET 123
P AL+ VR+ +++ A+ +L G P Y Q GD+ ++ D + + E
Sbjct: 106 R-PDALASVRADIEARTKITPEDAAARL-GQPRIGYGGHQTFGDLLIDVDGA--PGSAEG 161
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
Y R LDL A A V Y F R F+S PD+V+V + GS+ N+ S +
Sbjct: 162 YTRTLDLAQALATVSYPHDGTTFHRTVFASCPDKVLVGHFTADRGGSVGLNLRYTSPRQD 221
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ +++ + G G++F A +I++ + GT++A D+ L
Sbjct: 222 FTATTNGDRLTVRGAL-------------QDNGMRFEA--QIRLLSEGGTVTANGDR-LT 265
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V G+D A +L A + + + P DP +A+ Y +L RH D+
Sbjct: 266 VSGADSAWFVLSAGTDYADTY--PDYRGADPHDRVTTAVDQAAARPYRELLDRHTSDHAA 323
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
LF RV + L + D+ + D + A + +D +L L FQ+GRYLLI+
Sbjct: 324 LFSRVVLDLGQ-------DSAPDRTTDALLKA--YTGGNSADDRALEALFFQYGRYLLIA 374
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSR G+ ANLQG WN +P W + HVNINL+MNYW + NL+E P F+ L
Sbjct: 375 SSRAGSLPANLQGAWNNSTAPPWSADYHVNINLQMNYWPAEATNLAETTAPYDRFVEALR 434
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
G TA+ + A GWV+H +T + + D W +P AWL + L+EHY +
Sbjct: 435 APGRTTARSMFDARGWVVHDETTPFGFTGVHDWPTSFW--FPEAAAWLTSQLYEHYRFDG 492
Query: 483 DRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEH-EFIAPDGKLACVSYSS 540
D+L AYP ++ A F +D L + D L PS SPEH +F A +
Sbjct: 493 STDYLRATAYPAMKEAAEFWIDVLRTDPRDNTLVVTPSFSPEHGDFTA----------GA 542
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEW 593
M I+RE+F + AA+ L ++ A + ++L R+ P +I G +MEW
Sbjct: 543 AMSQQIVRELFLNTLEAAQTL-GDDPAFRATLKETLDRIDPGLRIGSWGQLMEW 595
>gi|332882772|ref|ZP_08450383.1| hypothetical protein HMPREF9074_06194 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|332679274|gb|EGJ52260.1| hypothetical protein HMPREF9074_06194 [Capnocytophaga sp. oral
taxon 329 str. F0087]
Length = 805
Score = 264 bits (674), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 191/601 (31%), Positives = 300/601 (49%), Gaps = 56/601 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + F PAKHFT+++PIGNGRLGA+++G ++ + LNE +LW+G + +P+A L
Sbjct: 23 VSVVFKQPAKHFTESLPIGNGRLGAILFGKTDTDRIVLNEISLWSGGYQEADDPEAHTYL 82
Query: 73 SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
+++ L+ G+ EA A K F G A+ YQ+ D+ L++ + +
Sbjct: 83 KEIQQLLLEGKNLEAQALLQKHFIARGKGSCHGQGANCSYGCYQVFADLLLDWKN---QT 139
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+ Y+R L L+ ATA Y+ + F+ + ++ KI+G++ N+SL
Sbjct: 140 PVKDYKRVLRLDEATAITTYTRDENSIEQVAFADFKNDLLWIKITGTKP--FDLNISLFR 197
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+N + NN I + G P +D +G+ F++ ++++ T E+
Sbjct: 198 K-ENATISYQNNHITLTGVLP----------DDKKEGMHFASAIDVQ------TDGKAEN 240
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
K+ +E L+L S + + + N S ++ S LQ + S+
Sbjct: 241 KEKAIEIQAAKELILKISMATNYQYKNGGLSNVSVKEKAESYLQRCTS-SFEAALAESKT 299
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGR 358
YQ LF++ +R + + N + + ER++ F + D+D L L + FGR
Sbjct: 300 IYQGLFNK-----NRWYGN------ANSNTSHLSTYERLEGFYKGDKDALLPILYYNFGR 348
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSSR G ANLQG+W E+ W+ H+NIN++MNYW + NLSE EPL F
Sbjct: 349 YLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINIQMNYWLAEATNLSELTEPLNRF 408
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
L NG KTA+ Y A GWV H ++ W +S VW GGAWLC H+W+HY
Sbjct: 409 TKNLVPNGYKTAKAYYNADGWVAHVISNPWFYTSPGE-SAVWGSTLTGGAWLCEHIWQHY 467
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAP----DGKL 533
+T D DFL K YP+L+ F LI E GY T PS SPE+ ++ P ++
Sbjct: 468 LFTHDIDFL-KEYYPVLKQATDFFKSLLIKEPKKGYWITAPSNSPENAYLLPSKDNKKQV 526
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ TMDM I+RE+FS + AA +L + D + + P +I + G + EW
Sbjct: 527 GNTCIAPTMDMQIVRELFSNTMQAATILGVDSDKFSQWT-DIIKHTAPNRIGKKGDLNEW 585
Query: 594 V 594
+
Sbjct: 586 L 586
>gi|423219674|ref|ZP_17206170.1| hypothetical protein HMPREF1061_02943 [Bacteroides caccae
CL03T12C61]
gi|392624879|gb|EIY18957.1| hypothetical protein HMPREF1061_02943 [Bacteroides caccae
CL03T12C61]
Length = 831
Score = 263 bits (673), Expect = 1e-67, Method: Compositional matrix adjust.
Identities = 184/600 (30%), Positives = 288/600 (48%), Gaps = 72/600 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG +GA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 75 SQSLPIGNGSIGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHILDEIR 134
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
G A+A + + F + +G+ +E + + ++ Y
Sbjct: 135 KAFTEGDQAKAERLTRQNFNSEVPYEGSREKPFRFGSFTTMGEFYVETGLNIIGMSD--Y 192
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ +V + R +F S P V+V + S G +L F+ + + +
Sbjct: 193 KRILSLDSAMAVVQFKKDDVAYQRNYFISYPANVLVMRFSADRPGKQNLIFSYAPNPVST 252
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
G+N ++ +A D G+++ ++ I+ GT+ + KL
Sbjct: 253 GSMVAQGDNGLVY-------------SAALDNNGMKY--VVRIQAETKGGTLVN-RNGKL 296
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRH 297
V+G+D V + A + + F + K +P + L + YS L H
Sbjct: 297 TVKGADEVVFYVTADTDYKANFAPDFKNPKTYVGVNPVETTGQWLANAVAKGYSALLNEH 356
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
DY LF+RV + L+ + K +P+ +R+K+++ + D L EL FQF
Sbjct: 357 YQDYAALFNRVKLNLNPTVK-----------TGNLPTGQRLKNYRKGQPDYYLEELYFQF 405
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL EC PL
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLEECMLPLI 465
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + A GW +I+ ++ + + W PM G WL TH+W
Sbjct: 466 DFIRTLVKPGEKTAQSYFGARGWTASISANIFGFTAPLESQDMSWNFNPMAGPWLATHIW 525
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D FL++ Y L++ A+F +D+L DG PSTSPEH
Sbjct: 526 EYYDYTRDLKFLKETGYELIKSSANFAVDYLWHKPDGTYTAAPSTSPEH---------GP 576
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A++RE+ I A+E L +K E E+VL + L P KI G +MEW
Sbjct: 577 IDQGATFVHAVVREILLDAIKASEELGVDKKERKEWEQVLAN---LVPYKIGRYGQLMEW 633
>gi|393782601|ref|ZP_10370784.1| hypothetical protein HMPREF1071_01652 [Bacteroides salyersiae
CL02T12C01]
gi|392672828|gb|EIY66294.1| hypothetical protein HMPREF1071_01652 [Bacteroides salyersiae
CL02T12C01]
Length = 804
Score = 263 bits (671), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 177/607 (29%), Positives = 297/607 (48%), Gaps = 67/607 (11%)
Query: 17 FNGPAKHFTD-AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD-------- 67
F PA+++++ A+ IGNG +GA +G V E + E T WTG P ++ PD
Sbjct: 35 FTYPARNWSEQALHIGNGYMGASFYGDVEKERFDIAEKTFWTGGP--HSVPDFNYGVVKG 92
Query: 68 APKALSDVRSLVDSGQYAEATAAS-VKLFGHPADV--YQLLGDIELEFDDSHLKYAEETY 124
++ +R + ++AEA + S + + G + + ++G++ ++F + + Y
Sbjct: 93 GKDKIAAIRRSITDRRFAEADSLSRLYMVGDYTNYGYFSMVGNLFVDFGKKNQPV--QNY 150
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
R +DL+T+ V+Y+ G+V F RE+F S PD+++ + + G +SF++S +
Sbjct: 151 LRGIDLSTSRGFVEYTQGDVRFNREYFCSYPDKLMALHFTADQKGKISFSLSHSLVYQPE 210
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
G +++I G G G+ ++ + +K+ G+I + +++ V
Sbjct: 211 KVTEGKDELIFNGIIQGN-------------GLGYT--IRMKVLHQGGSIK-VGHQQITV 254
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
EG+D A + + + + P + P + ++S Y + H+ DYQ L
Sbjct: 255 EGADEATVFYTVDTEYSP--VYPLYKGEKPRQTTEKIIKSAITKGYETVKHTHISDYQTL 312
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLI 362
++RV LS DT SE+ +P+ RVK Q +D SL L F RYLLI
Sbjct: 313 YNRVKFTLS-------GDTASEK----LPTDIRVKQLQQGFTDDASLKVLWFNLSRYLLI 361
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
S+SRPGT +NLQG+WN W+ NINL+ YW P L EC+E +++ L
Sbjct: 362 SASRPGTLPSNLQGVWNTFEKAPWNGNFQSNINLQEMYWGCGPTQLPECEEAYLEWIEGL 421
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
G KTA Y GWV H +IW + ++W L+P G AW C HLWEHY +
Sbjct: 422 VEPGRKTAGEYYGTKGWVSHSTGNIWGHTVPGD-DILWGLYPSGAAWHCRHLWEHYAFGG 480
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST- 541
D+ +LE + YP+++ A F L+ ++E + + PS S EH +G + V YS+
Sbjct: 481 DKSYLETKGYPIMKEAAEFWLENMVE-YQKHFIIAPSVSAEHGIEMKNG--SPVDYSTAN 537
Query: 542 --------------MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAED 587
D+ ++ ++++ +I A+E L + A EKV + +L P KI
Sbjct: 538 GEQTAGRIFTLPAYQDIEMVYDLYTHVIKASECL-GIDSAFREKVTIARNKLLPLKIGRY 596
Query: 588 GSIMEWV 594
G + EW+
Sbjct: 597 GQLQEWI 603
>gi|210613381|ref|ZP_03289701.1| hypothetical protein CLONEX_01908 [Clostridium nexile DSM 1787]
gi|210151223|gb|EEA82231.1| hypothetical protein CLONEX_01908 [Clostridium nexile DSM 1787]
Length = 1549
Score = 262 bits (669), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 192/611 (31%), Positives = 305/611 (49%), Gaps = 88/611 (14%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG----DYTNPDAPK------ALSDVRS 77
+PIGNG +GA V+G + SE L NE TLWTG P DY ++ + +L +++
Sbjct: 73 LPIGNGDMGANVYGEIASEHLTFNEKTLWTGGPSESRKDYMGGNSTEKGQDGASLKNIQK 132
Query: 78 LVDSGQYAEATAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
L G+ +EATAA L G+ A YQ GDI ++ D K A E Y+R+LDL T
Sbjct: 133 LFAEGKTSEATAACNNLLVGISNGYGA--YQPWGDIYFDYKDITEKNATE-YQRDLDLKT 189
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A + V + ++TRE F S+ D V+V ++ S L+ +V S + GN+
Sbjct: 190 AISTVSFKEDGTQYTREFFMSHDDDVLVARLEAKGSEKLNLDVRFPSKQGGKTVAEGNDT 249
Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
+ + G ++ ++++ L +K D G+++ DK L V+ + +
Sbjct: 250 LKLCGALTDNQM-------------KYASYLTVKA--DNGSVTGSGDK-LTVKDASAVTV 293
Query: 253 LLVASSSFDGPFINPSDSKKD---PTSESMSAL-QSIRNL-------SYSDLYTRHLDDY 301
L A++ + F N D +D T E+ AL + ++ Y ++ HL+DY
Sbjct: 294 YLSAATDYKNAFYN-EDKTEDYYYRTGETDEALAKRVKETVDKAVEKGYKEVKATHLEDY 352
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
Q+LF+RVS+ + + T SE+ D + + S E L +LFQ+GRYL
Sbjct: 353 QELFNRVSLNIGQ--------TVSEKTTDDLLKTYKDGSASESEKRQLENMLFQYGRYLT 404
Query: 362 ISSSRPGTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
I+SSR +Q+ +NLQG+WN +P W S H+N+NL+MNYW + NLSEC PL D++
Sbjct: 405 IASSREDSQLPSNLQGVWNSLTNPPWSSDYHMNVNLQMNYWPTYSTNLSECALPLIDYVD 464
Query: 421 YLSINGSKTAQV-------NYLASGWVIHHKTD-------IWAKSSADRGKVVWALWPMG 466
L G TA+V + A+G++ H + WA S W P
Sbjct: 465 SLREPGRVTAKVYAGVESKDGEANGFMAHTQNTPFGWTCPGWAFS--------WGWSPAA 516
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEF 526
W+ + WE+Y +T D +F+E+ YP+L+ A+F L E DG L ++PS SPEH
Sbjct: 517 VPWILQNCWEYYEFTGDTEFMEENIYPMLKEEATFYNQILTEDKDGKLVSSPSYSPEH-- 574
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLR-PTKIA 585
+ +T + +I +++ AAEVL ++ + L K ++ +L+ P +I
Sbjct: 575 -------GPYTAGNTYEHTLIWQLYEDAAKAAEVLGQDTE-LAAKWKENQSKLKGPIEIG 626
Query: 586 EDGSIMEWVQR 596
+DG I EW +
Sbjct: 627 DDGQIKEWYEE 637
>gi|153805874|ref|ZP_01958542.1| hypothetical protein BACCAC_00113 [Bacteroides caccae ATCC 43185]
gi|149130551|gb|EDM21757.1| hypothetical protein BACCAC_00113 [Bacteroides caccae ATCC 43185]
Length = 833
Score = 262 bits (669), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 183/600 (30%), Positives = 287/600 (47%), Gaps = 72/600 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG +GA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 77 SQSLPIGNGSIGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHILDEIR 136
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
G +A + + F + +G+ +E + + ++ Y
Sbjct: 137 KAFTEGDQVKAERLTRQNFNSEVPYEGSREKPFRFGSFTTMGEFYVETGLNIIGMSD--Y 194
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ +V + R +F S P V+V + S G +L F+ + + +
Sbjct: 195 KRILSLDSAMAVVQFKKDDVAYQRNYFISYPANVLVMRFSADRPGKQNLIFSYAPNPVST 254
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
G+N ++ +A D G+++ ++ I+ GT+ + KL
Sbjct: 255 GSMVAQGDNGLVY-------------SAALDNNGMKY--VVRIQAETKGGTLVN-RNGKL 298
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRH 297
V+G+D V + A + + F + K +P + L + YS L H
Sbjct: 299 TVKGADEVVFYVTADTDYKANFAPDFKNPKTYVGVNPVETTGQWLANAVAKGYSALLNEH 358
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
DY LF+RV + L+ + K +P+ +R+K+++ + D L EL FQF
Sbjct: 359 YQDYAALFNRVKLNLNPTVK-----------TGNLPTGQRLKNYRKGQPDYYLEELYFQF 407
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL EC PL
Sbjct: 408 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLEECMLPLI 467
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + A GW +I+ ++ + + W PM G WL TH+W
Sbjct: 468 DFIRTLVKPGEKTAQSYFGARGWTASISANIFGFTAPLESQDMSWNFNPMAGPWLATHIW 527
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D FL++ Y L++ A+F +D+L DG PSTSPEH
Sbjct: 528 EYYDYTRDLKFLKETGYELIKSSANFAVDYLWHKPDGTYTAAPSTSPEH---------GP 578
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A++RE+ I A+E L +K E E+VL + L P KI G +MEW
Sbjct: 579 IDQGATFVHAVVREILLDAIKASEELGVDKKERKEWEQVLAN---LVPYKIGRYGQLMEW 635
>gi|418113491|ref|ZP_12750487.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA41538]
gi|353781702|gb|EHD62143.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA41538]
Length = 535
Score = 261 bits (668), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 181/574 (31%), Positives = 286/574 (49%), Gaps = 53/574 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAASNWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S KD ++ I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR 578
I+R + I A+ L N D + +K L R
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISR--VKELKR 529
>gi|237709067|ref|ZP_04539548.1| glycoside hydrolase family 95 protein [Bacteroides sp. 9_1_42FAA]
gi|229456763|gb|EEO62484.1| glycoside hydrolase family 95 protein [Bacteroides sp. 9_1_42FAA]
Length = 818
Score = 261 bits (668), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 195/639 (30%), Positives = 298/639 (46%), Gaps = 82/639 (12%)
Query: 2 MNAESTSTTNPLKITFNGP------AKHFT---------DAIPIGNGRLGAMVWGGVPSE 46
+ A+ST T L I F+ P A ++ +++PIGNG LG V G + +E
Sbjct: 17 LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPVTDKAWENNSLPIGNGSLGGNVMGSIAAE 76
Query: 47 TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
+ LNE TLW G P N ++ L ++R G +A + K F
Sbjct: 77 RITLNEKTLWRGGPNTEKGAAYYWNVNKESAHLLPEIRQAFTDGNQKKAEELTCKNFNGL 136
Query: 99 ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
AD + LG+ +E S + Y+R L L++A A V + V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEI--GMTNYKRILSLDSAMAVVSFRKDEVNY 194
Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
R++F S PD V+V K + G +L F+ + +G N ++ G
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKADGPNCLLYTGCL----- 249
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
K Q L I+ + G+++ D K V +D + LL A + +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298
Query: 261 DGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
+ F +P DP +++ + + SY++L RH DY +LF RV +QL+ R+P
Sbjct: 299 NPDFKDPKTYVGPDPEQTTLAMMDAAAAKSYNELCERHKTDYTQLFGRVQLQLNPRAPM- 357
Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
T + +P+ +R+ ++ + D L E+ +QFGRYLLI+SSRPG ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTYQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413
Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
W + W H NIN++MNYW + NL+EC PL DF+ L G KTAQ + A
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473
Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
GW +I+ +S + W PM G WL TH+WE+Y+YT D+ FL++ Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533
Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
A+F +D+L DG PSTSPEH V +T A++RE+ I
Sbjct: 534 SSANFAVDYLWYKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLNAID 584
Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
A++ L + + + VL L P +I G +MEW
Sbjct: 585 ASKALGVDSKDRKQWQYVLN---HLVPYQIGRYGQLMEW 620
>gi|393785852|ref|ZP_10373996.1| hypothetical protein HMPREF1068_00276 [Bacteroides nordii
CL02T12C05]
gi|392660966|gb|EIY54563.1| hypothetical protein HMPREF1068_00276 [Bacteroides nordii
CL02T12C05]
Length = 810
Score = 261 bits (668), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 178/610 (29%), Positives = 300/610 (49%), Gaps = 69/610 (11%)
Query: 15 ITFNGPAKHFTD-AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA----- 68
+ F PAK +++ A+ IGNG +GA +G V E L + E T W G P + PD
Sbjct: 35 VWFRYPAKSWSEQALHIGNGYMGASFYGEVEKERLDIAEKTFWAGGP--HAAPDFNYGII 92
Query: 69 ---PKALSDVRSLVDSGQYAEATAAS-VKLFGHPADV--YQLLGDIELEFDDSHLKYAEE 122
++ +R L+ ++AEA + S + + G + + ++G++ ++F + K +
Sbjct: 93 KGDKDKIATIRQLIVERRFAEADSLSRIYMTGDYTNYGYFSMVGNLWIDFGKN--KQPVQ 150
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
Y R +DL+T+ V+Y+ G V+F RE+F S PD+++ + ++G +SF++S +
Sbjct: 151 NYLRGIDLSTSRGFVEYTQGGVQFNREYFCSYPDKLMALHFTADKAGKISFSLSHSLVYP 210
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+ N + G + N S + IKI G++ + +++
Sbjct: 211 PEEVIESENGLTFNGII-------RKNG--------LSYTIRIKIVQQGGSVK-VAHQRI 254
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
VE ++ A + + + P + P ++P + + Y + H+ DYQ
Sbjct: 255 VVEKANEATVFYAVDTEY-AP-VYPLYKGENPQQNTGKVITKAITKGYETVKNTHISDYQ 312
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYL 360
L++RV L+ DT SE+ +P+ RVK Q +D SL L F RYL
Sbjct: 313 TLYNRVRFTLT-------GDTASEQ----LPTNMRVKQLQKGFTDDASLKVLGFNLSRYL 361
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LIS+SRPGT + LQG+WN W+ NINL+ YW P +L EC+E +++
Sbjct: 362 LISASRPGTLPSTLQGVWNTFEKAPWNGNFQSNINLQEMYWGCGPTHLPECEEAYLEWIE 421
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
L G +TA+ Y GWV H +IW + ++W L+P G AW C HLWEHY +
Sbjct: 422 GLVEPGRQTAREYYGTKGWVSHSTGNIWGHTVPGD-DILWGLYPSGAAWHCRHLWEHYAF 480
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
D+++L + YP+++ A F L+ ++E + G+ PS S EH +G + V YS+
Sbjct: 481 NGDKEYLRTKGYPIMKEAAEFWLENMVE-YQGHFIIAPSVSAEHGIEMKNG--SPVEYST 537
Query: 541 T---------------MDMAIIREVFSAIISAAEVLEKNEDALV-EKVLKSLPRLRPTKI 584
T D+ ++ +++S +I AAE L N D++ +K+L + +L P KI
Sbjct: 538 TNGEQTEGRLFTVPAYQDIEMVYDLYSHVIKAAECL--NTDSVFRQKLLIAKNKLLPLKI 595
Query: 585 AEDGSIMEWV 594
G + EW+
Sbjct: 596 GRYGQLQEWI 605
>gi|429755750|ref|ZP_19288384.1| hypothetical protein HMPREF9072_01113 [Capnocytophaga sp. oral
taxon 324 str. F0483]
gi|429173108|gb|EKY14643.1| hypothetical protein HMPREF9072_01113 [Capnocytophaga sp. oral
taxon 324 str. F0483]
Length = 799
Score = 261 bits (667), Expect = 7e-67, Method: Compositional matrix adjust.
Identities = 190/605 (31%), Positives = 306/605 (50%), Gaps = 63/605 (10%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + F+ PA HFT++IPIGNGRLGAM++G + + LNE +LW+G + +P+A L
Sbjct: 16 VSVVFHKPANHFTESIPIGNGRLGAMLFGKTDVDRIVLNEISLWSGGTQEADDPNAHNYL 75
Query: 73 SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
D++ L+ G+ EA A + F G A+ YQ+L ++ L++ +
Sbjct: 76 KDIQKLLLQGKNLEAQALLQQHFVAKGKGSCLGQGANCSYGCYQVLAELLLDWKTTS--- 132
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+ Y+R L L+ A A + N + F+ + VI +I + L+ ++SL
Sbjct: 133 PIQDYKRVLRLDEAIAVTSFKRDNNSIEQTAFADFKNDVIWVRIKATSP--LNLDISLFR 190
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+N + NN+I + G P ND +G+ F++I++++ G I +
Sbjct: 191 K-ENATITYQNNKITLNGVLP----------NDGKEGMHFASIVDVQTD---GKIESTH- 235
Query: 240 KKLKVEGSDWAVLLLVASSSFD---GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
K + ++ + L + A ++++ G ++ S +KK + LQ +S+
Sbjct: 236 KAIAIQSAKEITLRISAVTNYNFNKGGLLDISVTKK-----ANEYLQKAP-MSFDKAKAE 289
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF-Q 355
+Q+LF+R + N + + + ER+ F E +L+ +L+
Sbjct: 290 SSIVFQRLFNRNRWYGK-----------ANANTEGLTTFERLGRFYKGEQDALLPILYYN 338
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
FGRYLLISSSR G ANLQG+W E+ W+ H+NIN++MNYW + P NLS+ EPL
Sbjct: 339 FGRYLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINIQMNYWLAEPTNLSQLTEPL 398
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
F L NGSKTA+ Y A+GWV H ++ W +S W GGAWLC H+W
Sbjct: 399 QRFTKNLVPNGSKTAKAYYNANGWVAHVISNPWFYTSPGE-SATWGSTLTGGAWLCEHIW 457
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP---DG 531
+HY +T D +FL + YP+L+ +F LI+ GY T PS SPE+ ++ P DG
Sbjct: 458 QHYLFTKDINFL-REYYPVLKEATTFFESLLIKDPKTGYWVTAPSNSPENAYVLPELKDG 516
Query: 532 K--LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
K + + TMDM I+RE+F+ AA++L + E S + P +I + G
Sbjct: 517 KKQIGTTCVAPTMDMQIVRELFTNTSDAAKILGLDSKKRTEWERISRNTV-PNRIGKKGD 575
Query: 590 IMEWV 594
+ EW+
Sbjct: 576 LNEWL 580
>gi|383123942|ref|ZP_09944612.1| hypothetical protein BSIG_4038 [Bacteroides sp. 1_1_6]
gi|251838825|gb|EES66910.1| hypothetical protein BSIG_4038 [Bacteroides sp. 1_1_6]
Length = 812
Score = 261 bits (667), Expect = 8e-67, Method: Compositional matrix adjust.
Identities = 190/637 (29%), Positives = 309/637 (48%), Gaps = 87/637 (13%)
Query: 3 NAESTSTTNPLKITFNGP---------------AKHFTDAIPIGNGRLGAMVWGGVPSET 47
+AEST T L I F+ P A + ++PIGNG +GA + G V +E
Sbjct: 19 HAESTDYTKGLSIWFDSPNTLQGKEVWHSAQQDASWESQSLPIGNGSIGANILGSVEAER 78
Query: 48 LKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGH-- 97
+ NE TLW G P DY N + L ++R G +A + + F
Sbjct: 79 ITFNEKTLWRGGPNTAKGADYYWNVNKQSAHILDEIRKAFVEGDQKKAEKLTRENFNSEV 138
Query: 98 PADV----------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFT 147
P + + +G+ +E S + ++ Y+R L L++A A V++ +V +
Sbjct: 139 PYEFSREKPFRFGNFTTMGEFYVETGLSTIGMSD--YKRILSLDSAMAVVQFKKDDVAYQ 196
Query: 148 REHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIP 205
R +F S P V+V + S + +L+F + + + +GNN ++
Sbjct: 197 RNYFISYPANVMVMRFSADQPSKQNLTFRYAPNPVSTGQFSTDGNNGLVY---------- 246
Query: 206 PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFI 265
A+ D G++++ + I+ + + GT++ D ++ V+ +D + + A + + F
Sbjct: 247 ---TASLDNNGMKYA--VRIQATVNGGTLNN-ADGRITVKEADEVIFYVTADTDYKMNFA 300
Query: 266 -NPSDSKK----DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIV 320
+ +D K +P + ++ Y++L H DY LF+RV ++L+ + K
Sbjct: 301 PDFTDPKTYVGVNPLETTQQWMKDAVAKGYANLLNEHYKDYASLFNRVKLELNPTVK--- 357
Query: 321 TDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN 379
I +P+A+R+K+++ + D L +L +QFGRYLLI+SSRPG ANLQGIW+
Sbjct: 358 --------IANLPTAQRLKNYRKGQPDYYLEKLYYQFGRYLLIASSRPGNMPANLQGIWH 409
Query: 380 EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGW 439
++ W H NIN++MNYW + NL EC PL DF+ L G KTAQ + A GW
Sbjct: 410 NNIDGPWRVDYHNNINIQMNYWPACSTNLDECMLPLIDFIRTLVKPGEKTAQSYFGARGW 469
Query: 440 VIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGC 498
+I+ ++ + + W PM G WL TH+WE+Y+YT D FL++ Y L++
Sbjct: 470 TASISANIFGFTTPLESQDMSWNFNPMAGPWLATHVWEYYDYTKDLKFLKETGYELIKSS 529
Query: 499 ASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAA 558
A+F +D+L DG PSTSPEH V +T A++RE+ I A+
Sbjct: 530 ANFTVDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLDAIQAS 580
Query: 559 EVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ L +K E E VL + L P KI G ++EW
Sbjct: 581 KELGIDKKERKQWEHVLAN---LVPYKIGRYGQLLEW 614
>gi|336436142|ref|ZP_08615855.1| hypothetical protein HMPREF0988_01440 [Lachnospiraceae bacterium
1_4_56FAA]
gi|336008182|gb|EGN38201.1| hypothetical protein HMPREF0988_01440 [Lachnospiraceae bacterium
1_4_56FAA]
Length = 473
Score = 261 bits (667), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 157/489 (32%), Positives = 250/489 (51%), Gaps = 35/489 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ + PAK FT+A+P+GNG LGAMV+GGVP E + LN DT W+G K L
Sbjct: 4 KLKYITPAKSFTEALPLGNGSLGAMVYGGVPEEHITLNHDTFWSGTGRRPEKEIDAKILG 63
Query: 74 DVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
R L+ ++ EA + G + Y LG++ F++ E Y R LDL
Sbjct: 64 HARELLFEEKFWEAEQFIKEHMLGFYNESYMPLGELNYRFEEIG---EIEQYSRNLDLEN 120
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A ++ N + E F S P + ++ ++ S S L+ +V+L+S + + +
Sbjct: 121 AIFSSEFCSKNTLYQTEVFISYPAKALILRMKVSGSEKLNLSVNLNSKVRHDMKAEVSQD 180
Query: 193 IIMEGRCPGKRIPPKANANDDP-------KGIQFSAILEIKISDDRGTISALEDKKLKVE 245
+ + G P + P D P G+ F L I+ + G ++AL+D+ LKV+
Sbjct: 181 LYIFGNAPSN-VQPNYLTCDHPITYDEQNPGMAFGCYLHIE--NTGGEVTALKDE-LKVK 236
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDP---TSESMSALQSIRNLSYSDLYTRHLDDYQ 302
+D + L A + G +KDP ++ +L+ ++N Y L H+ DY+
Sbjct: 237 NADEVLFYLTAEDGYRG---YKKRIEKDPEVCITQCRKSLEILKNRDYESLKQEHIIDYK 293
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLL 361
++ V ++L + D+ P +R+ F+ +D L+ L F + RYL+
Sbjct: 294 SVYKDVRLELEKEESDM-------------PLDQRLAEFRNGKQDLGLLCLFFHYNRYLM 340
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
++SSR G+Q ANLQGIWNE + P W S VNIN EMNYW + CNL + P +F++
Sbjct: 341 VASSRKGSQPANLQGIWNESIRPVWSSNWTVNINTEMNYWMNGSCNLLDSYLPFVEFVSE 400
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
LS G +TA+ Y SGW +H DIW ++ G+ +A WPMGG WLC +E++ Y+
Sbjct: 401 LSDAGKETARKQYHCSGWTANHNVDIWRQTGPVAGEPKYAYWPMGGIWLCAQSYEYFKYS 460
Query: 482 MDRDFLEKR 490
D ++L+++
Sbjct: 461 KDIEYLKQK 469
>gi|265767320|ref|ZP_06094986.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
gi|263252625|gb|EEZ24137.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
Length = 829
Score = 261 bits (666), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 179/600 (29%), Positives = 287/600 (47%), Gaps = 72/600 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA--------PKALSDVR 76
+ ++PIGNG +GA + G + +E + NE TLW G P DA L ++R
Sbjct: 75 SQSLPIGNGSIGANIMGSIEAERITFNEKTLWRGGPNTSRGADAYWNVNKQSAHVLKEIR 134
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
G +A + K F + +G+ +E S + ++ Y
Sbjct: 135 QAFTDGDQKKAEMLTRKNFNSEVPYESNREKPFRFGNFTTMGEFYIETGLSTVNMSD--Y 192
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ +V + R++F S P V+ + G +L+F+ S + +
Sbjct: 193 KRILSLDSALAVVQFKKDDVAYERDYFISYPANVMAIRFKADRPGKQNLTFSYSPNPVST 252
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+G N + A+ D G+Q+ ++ I GT+S + K+
Sbjct: 253 GSMSADGANGLAY-------------TAHLDNNGMQY--VVRIHAIAKGGTLSN-ANGKI 296
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDSKK-DPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D V L+ A + +FD F +P +P + + + + Y L+ +H
Sbjct: 297 TVKNADEVVFLVTADTDYKINFDPDFKDPKAYVGVNPAETTRQWMDNAVAMGYDVLFKQH 356
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
DDY LF+RV +QL+ + +P+ +R+++++ + D L EL +QF
Sbjct: 357 YDDYAALFNRVKLQLNPDAQSA-----------NLPTGKRLQNYRKGQPDFYLEELYYQF 405
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL EC PL
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLYECTLPLI 465
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + GW +I+ ++ + ++ W PM G WL TH+W
Sbjct: 466 DFIRTLVKPGQKTAQAYFGTRGWTASISANIFGFTTPLESEEMAWNFNPMAGPWLATHVW 525
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D+ FL++ Y L++ A F D+L DG PSTSPEH
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSAQFATDFLWRKPDGTYTAAPSTSPEH---------GP 576
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A+IRE+ I A++VL + E ++VL L P K+ G +MEW
Sbjct: 577 IDEGTTFVHAVIREILQDAIEASKVLGVDSKERKQWQEVLT---HLAPYKVGRYGQLMEW 633
>gi|423259841|ref|ZP_17240764.1| hypothetical protein HMPREF1055_03041 [Bacteroides fragilis
CL07T00C01]
gi|423267496|ref|ZP_17246477.1| hypothetical protein HMPREF1056_04164 [Bacteroides fragilis
CL07T12C05]
gi|387775879|gb|EIK37983.1| hypothetical protein HMPREF1055_03041 [Bacteroides fragilis
CL07T00C01]
gi|392696970|gb|EIY90157.1| hypothetical protein HMPREF1056_04164 [Bacteroides fragilis
CL07T12C05]
Length = 829
Score = 260 bits (665), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 179/600 (29%), Positives = 288/600 (48%), Gaps = 72/600 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA--------PKALSDVR 76
+ ++PIGNG +GA + G + +E + NE TLW G P DA L ++R
Sbjct: 75 SQSLPIGNGSIGANIMGSIEAERITFNEKTLWRGGPNTSRGADAYWNVNKQSAHVLKEIR 134
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
G +A + K F + +G+ +E S + ++ Y
Sbjct: 135 QAFTDGDQKKAEMLTRKNFNSEVPYESNREKPFRFGNFTTMGEFYIETGLSTVNMSD--Y 192
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ +V + R++F S P V+ + G +L+F+ S + +
Sbjct: 193 KRILSLDSALAVVQFKKDDVAYERDYFISYPANVMAIRFKADRPGKQNLTFSYSPNPVST 252
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+G N + A+ D G+Q+ ++ I GT+S + K+
Sbjct: 253 GSMSADGANGLAY-------------TAHLDNNGMQY--VVRIHAIAKGGTLSN-ANGKI 296
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINP-SDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D V L+ A + +FD F +P + +P + + + + Y L+ +H
Sbjct: 297 TVKDADEVVFLVTADTDYKINFDPDFKDPKAYVGVNPAETTRQWMDNAVAMGYDVLFKQH 356
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
DDY LF+RV +QL+ + +P+ +R+++++ + D L EL +QF
Sbjct: 357 YDDYAALFNRVKLQLNPDAQSA-----------NLPTGKRLQNYRKGQPDFYLEELYYQF 405
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL EC PL
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLYECTLPLI 465
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + GW +I+ ++ + ++ W PM G WL TH+W
Sbjct: 466 DFIRTLVKPGQKTAQAYFGTRGWTASISANIFGFTTPLESEEMAWNFNPMAGPWLATHVW 525
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D+ FL++ Y L++ A F D+L DG PSTSPEH
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSAQFATDFLWRKPDGTYTAAPSTSPEH---------GP 576
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A+IRE+ I A++VL + E ++VL L P K+ G +MEW
Sbjct: 577 IDEGTTFVHAVIREILQDAIEASKVLGVDSKERKQWQEVLT---HLAPYKVGRYGQLMEW 633
>gi|53715738|ref|YP_101730.1| hypothetical protein BF4459 [Bacteroides fragilis YCH46]
gi|60683673|ref|YP_213817.1| hypothetical protein BF4255 [Bacteroides fragilis NCTC 9343]
gi|336411650|ref|ZP_08592113.1| hypothetical protein HMPREF1018_04131 [Bacteroides sp. 2_1_56FAA]
gi|375360504|ref|YP_005113276.1| hypothetical protein BF638R_4337 [Bacteroides fragilis 638R]
gi|383119758|ref|ZP_09940496.1| hypothetical protein BSHG_3421 [Bacteroides sp. 3_2_5]
gi|423252289|ref|ZP_17233283.1| hypothetical protein HMPREF1066_04293 [Bacteroides fragilis
CL03T00C08]
gi|423252862|ref|ZP_17233793.1| hypothetical protein HMPREF1067_00437 [Bacteroides fragilis
CL03T12C07]
gi|52218603|dbj|BAD51196.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
gi|60495107|emb|CAH09926.1| conserved hypothetical exported protein [Bacteroides fragilis NCTC
9343]
gi|251944620|gb|EES85095.1| hypothetical protein BSHG_3421 [Bacteroides sp. 3_2_5]
gi|301165185|emb|CBW24755.1| conserved hypothetical exported protein [Bacteroides fragilis 638R]
gi|335941084|gb|EGN02944.1| hypothetical protein HMPREF1018_04131 [Bacteroides sp. 2_1_56FAA]
gi|392647562|gb|EIY41261.1| hypothetical protein HMPREF1066_04293 [Bacteroides fragilis
CL03T00C08]
gi|392659231|gb|EIY52857.1| hypothetical protein HMPREF1067_00437 [Bacteroides fragilis
CL03T12C07]
Length = 829
Score = 260 bits (665), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 179/600 (29%), Positives = 287/600 (47%), Gaps = 72/600 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA--------PKALSDVR 76
+ ++PIGNG +GA + G + +E + NE TLW G P DA L ++R
Sbjct: 75 SQSLPIGNGSIGANIMGSIEAERITFNEKTLWRGGPNTSRGADAYWNVNKQSAHVLKEIR 134
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
G +A + K F + +G+ +E S + ++ Y
Sbjct: 135 QAFTDGDQKKAEMLTRKNFNSEVPYESNREKPFRFGNFTTMGEFYIETGLSTVNMSD--Y 192
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ +V + R++F S P V+ + G +L+F+ S + +
Sbjct: 193 KRILSLDSALAVVQFKKDDVAYERDYFISYPANVMAIRFKADRPGKQNLTFSYSPNPVST 252
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+G N + A+ D G+Q+ ++ I GT+S + K+
Sbjct: 253 GSMSADGANGLAY-------------TAHLDNNGMQY--VVRIHAIAKGGTLSN-ANGKI 296
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDSKK-DPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D V L+ A + +FD F +P +P + + + + Y L+ +H
Sbjct: 297 TVKDADEVVFLVTADTDYKINFDPDFKDPKAYVGVNPAETTRQWMDNAVAMGYDVLFKQH 356
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
DDY LF+RV +QL+ + +P+ +R+++++ + D L EL +QF
Sbjct: 357 YDDYAALFNRVKLQLNPDAQSA-----------NLPTGKRLQNYRKGQPDFYLEELYYQF 405
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL EC PL
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLYECTLPLI 465
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + GW +I+ ++ + ++ W PM G WL TH+W
Sbjct: 466 DFIRTLVKPGQKTAQAYFGTRGWTASISANIFGFTTPLESEEMAWNFNPMAGPWLATHVW 525
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D+ FL++ Y L++ A F D+L DG PSTSPEH
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSAQFATDFLWRKPDGTYTAAPSTSPEH---------GP 576
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A+IRE+ I A++VL + E ++VL L P K+ G +MEW
Sbjct: 577 IDEGTTFVHAVIREILQDAIEASKVLGVDSKERKQWQEVLT---HLAPYKVGRYGQLMEW 633
>gi|339479496|gb|ABE95964.1| Conserved hypothetical protein (Glycosyl hydrolases family 95)
[Bifidobacterium breve UCC2003]
Length = 783
Score = 260 bits (665), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 183/606 (30%), Positives = 294/606 (48%), Gaps = 49/606 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+TF+G + + + IP+GNGR+GA++ ++ L LN+DTLW+G P T+P P+ +
Sbjct: 1 MKLTFDGISSCWEEGIPLGNGRMGAVLCSEPETDVLYLNDDTLWSGYPHAETSPVTPEIV 60
Query: 73 SDVR--SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+ R SL D A L +Y+ G +++ S E+ +R+LDL
Sbjct: 61 AKARQASLQDDYNAATRIIKDATLQEKDEQIYEPFGTARIQYSTS--ADGRESMKRQLDL 118
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS----------LDSL 180
A A + +G+ + + S PD ++V ++S S ++ +VS ++++
Sbjct: 119 ARALAGETFQMGDANVHVDAWCSEPDDLLVYRMSSDASIDVNISVSGTFLKQSRASMETV 178
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIP----PKANANDDPK---GIQFSAILEIKISDDRGT 233
D H +++ GR PG I P N +D + G+ ++ + ++ G
Sbjct: 179 FDGHRAT-----LVVMGRMPGLNIGLLPHPSENPWEDEQDGTGMAYAGAFSLTVT---GG 230
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
+ D L+ L + S F G P S ++ + + +
Sbjct: 231 DVNVGDNSLQCSNITGLSLRFRSMSGFRGSDQQPERSMT-VIADHLEKTIDEWSTDLRTM 289
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSL---V 350
+ RH+ DY++ F RV+I L + D DT +P + ++S + E L
Sbjct: 290 FDRHIADYRRYFDRVAIHLGSAHDD---DT-------ELPFSAILRSDENKEPHRLEMLA 339
Query: 351 ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
E +F FGRYLLISSSRP TQ ANLQGIWN P W SA NIN+EMNYW + PC L E
Sbjct: 340 EAMFDFGRYLLISSSRPHTQPANLQGIWNHKDFPNWYSAYTTNINVEMNYWMTGPCALQE 399
Query: 411 CQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWL 470
EPL L + G A G + H D+W ++ G +W+ WP G AW+
Sbjct: 400 LIEPLVSMNEELLVPGHDAADRILGCRGSAVFHNVDLWRRALPANGDPMWSFWPFGQAWM 459
Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD 530
C +L++ Y + D +L R +P++ A F +D+L E G L +P+TSPE+ F+ +
Sbjct: 460 CRNLFDEYLFNQDASYL-ARIWPIMRDNARFCMDFLSETKHG-LAPSPATSPENCFLV-N 516
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAA---EVLEKNEDALVEKVLKSLPRLRPTKIAED 587
G+L V+ SS AI+R + +I A+ E L++ + LV + L T++ D
Sbjct: 517 GELVSVAQSSENATAIVRNLLDDLIQASHDLENLDEEDRDLVHEAESVRSPLAETRLGAD 576
Query: 588 GSIMEW 593
G I+EW
Sbjct: 577 GRILEW 582
>gi|253574361|ref|ZP_04851702.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
str. D14]
gi|251846066|gb|EES74073.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
str. D14]
Length = 793
Score = 260 bits (665), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 187/604 (30%), Positives = 301/604 (49%), Gaps = 51/604 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD--YTNPDAPKA 71
K+ ++ PA +++ +P+GNGR+GA+V E L E T W+G + A
Sbjct: 12 KLWYDKPAAGWSEGLPVGNGRIGAIVMAAPEREVWNLTESTYWSGQADETASAASGGKAA 71
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPA---DVYQLLGDIELEFDDSHLKYAEET----- 123
L+ +R + +G YA + + P + + D+ +EF S ET
Sbjct: 72 LAAIRERLFAGDYAGGDRLAKQALQPPKRNFGTHLAMCDVVIEFAPSGEPSETETGAVNG 131
Query: 124 ----YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+RRELDL+TA RE F+S+ D V+V++I +G +SF + L
Sbjct: 132 ACSPFRRELDLSTALLTTTSGQPGSTLVRELFASHADDVLVSRIWSEAAGGVSFTLGLAG 191
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
L V+ + +E R GK + +D G++ +E+ D RG +++
Sbjct: 192 LTPEFE-VSASGMAALEFR--GKAT--ETVHSDGACGVRCRGRIEL---DTRGGSLYVQN 243
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+L V G+D A + L ++ + +S+ + + A ++ Y L HL
Sbjct: 244 DRLVVRGADEACIYLTVATDYR------CESRSWELAPRLQASLALSK-GYDQLKADHLA 296
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGR 358
DY+ LF RVSI+L S E +P+ +R++ Q DP L L Q+GR
Sbjct: 297 DYEPLFRRVSIELGPS-----------EEAAKLPTDQRIRLLRQGYSDPQLFALFLQYGR 345
Query: 359 YLLISSSRPGTQVA-NLQGIWN--EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
YL ++ SR + + +LQGIWN E W H+++N EMNY+ + +L E Q+PL
Sbjct: 346 YLTLAGSREDSPLPLHLQGIWNDGEACRMGWSCDYHLDVNTEMNYYPTEVVHLGESQQPL 405
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRG-KVVWALWPMGGAWLCTHL 474
+L L+ G KTA+ Y + GWV H +++W + D G W L GG WL +
Sbjct: 406 MRYLEDLARAGQKTARDVYGSPGWVAHVFSNVWGFT--DPGWDTSWGLNVTGGLWLAMQM 463
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKL 533
EHY + +DR FLEK+AYP+L A F LD++ + G+L T PS SPE+ F +
Sbjct: 464 IEHYRFGLDRVFLEKQAYPVLREAALFFLDYMTVHPKYGWLVTGPSNSPENHFYPGRPEE 523
Query: 534 AC--VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
C +S STMD A++RE+F+ + AAE+LE++ + L ++ ++P L P +I + G +
Sbjct: 524 GCWQLSMGSTMDQALVRELFTFCLEAAELLEEDVE-LRSRLSSAIPLLPPLQIGKKGQLQ 582
Query: 592 EWVQ 595
EW++
Sbjct: 583 EWLE 586
>gi|29350090|ref|NP_813593.1| hypothetical protein BT_4682 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29342002|gb|AAO79787.1| glycoside hydrolase family 95 [Bacteroides thetaiotaomicron
VPI-5482]
Length = 812
Score = 260 bits (665), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 189/637 (29%), Positives = 308/637 (48%), Gaps = 87/637 (13%)
Query: 3 NAESTSTTNPLKITFNGP---------------AKHFTDAIPIGNGRLGAMVWGGVPSET 47
+AE T T L I F+ P A + ++PIGNG +GA + G + +E
Sbjct: 19 HAEDTDYTKGLSIWFDSPNTLQGKEVWHSSKQDASWESQSLPIGNGSIGANILGSIEAER 78
Query: 48 LKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGH-- 97
+ NE TLW G P DY N + L ++R G +A + + F
Sbjct: 79 ITFNEKTLWRGGPNTAKGADYYWNVNKQSAHILDEIRKAFVEGDQKKAEKLTRENFNSEV 138
Query: 98 PADV----------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFT 147
P + + +G+ +E S + ++ Y+R L L++A A V++ +V +
Sbjct: 139 PYEFSREKPFRFGNFTTMGEFYVETGLSTIGMSD--YKRILSLDSAMAVVQFKKDDVAYQ 196
Query: 148 REHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIP 205
R +F S P V+V + S + G +L+F + + + +GNN ++
Sbjct: 197 RNYFISYPANVMVMRFSADQPGKQNLTFRYAPNPVSTGQFSADGNNGLVY---------- 246
Query: 206 PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFI 265
A+ D G++++ + I+ + GT++ D ++ V+ +D V + A + + F
Sbjct: 247 ---TASLDNNGMKYA--VRIQATVKGGTLNN-TDGRITVKEADEVVFYVTADTDYKMNFA 300
Query: 266 -NPSDSKK----DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIV 320
+ +D K +P + ++ + YS+L H DY LF+RV ++L+ + K
Sbjct: 301 PDFTDPKTYVGVNPLETTQQWMKDAVSKGYSNLLDEHYKDYASLFNRVKLELNPTVK--- 357
Query: 321 TDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN 379
+P+A+R+K+++ + D L +L +QFGRYLLI+SSRPG ANLQGIW+
Sbjct: 358 --------TSNLPTAQRLKNYRNGQPDYYLEKLYYQFGRYLLIASSRPGNMPANLQGIWH 409
Query: 380 EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGW 439
++ W H NIN++MNYW + NL EC PL DF+ L G KTAQ + A GW
Sbjct: 410 NNIDGPWRVDYHNNINIQMNYWPACSTNLDECMLPLIDFIRTLVKPGEKTAQSYFGARGW 469
Query: 440 VIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGC 498
+I+ ++ + + W PM G WL TH+WE+Y+YT D FL++ Y L++
Sbjct: 470 TASISANIFGFTTPLESQDMSWNFNPMAGPWLATHVWEYYDYTKDLKFLKETGYELIKSS 529
Query: 499 ASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAA 558
A+F +D+L DG PSTSPEH + +T A++RE+ I A+
Sbjct: 530 ANFTVDYLWHKPDGTYTAAPSTSPEH---------GPIDQGATFVHAVVREILLDAIQAS 580
Query: 559 EVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ L +K E E VL + L P KI G ++EW
Sbjct: 581 KELGIDKKERKQWEHVLAN---LVPYKIGRYGQLLEW 614
>gi|298384410|ref|ZP_06993970.1| fibronectin type III domain protein [Bacteroides sp. 1_1_14]
gi|298262689|gb|EFI05553.1| fibronectin type III domain protein [Bacteroides sp. 1_1_14]
Length = 812
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 189/637 (29%), Positives = 308/637 (48%), Gaps = 87/637 (13%)
Query: 3 NAESTSTTNPLKITFNGP---------------AKHFTDAIPIGNGRLGAMVWGGVPSET 47
+AE T T L I F+ P A + ++PIGNG +GA + G + +E
Sbjct: 19 HAEDTDYTKGLSIWFDSPNTLQGKEVWHSSKQDASWESQSLPIGNGSIGANILGSIEAER 78
Query: 48 LKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGH-- 97
+ NE TLW G P DY N + L ++R G +A + + F
Sbjct: 79 ITFNEKTLWRGGPNTTKGADYYWNVNKQSAHILDEIRKAFVEGDQKKAEKLTRENFNSEV 138
Query: 98 PADV----------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFT 147
P + + +G+ +E S + ++ Y+R L L++A A V++ +V +
Sbjct: 139 PYEFSREKPFRFGNFTTMGEFYVETGLSTIGMSD--YKRILSLDSAMAVVQFKKDDVAYQ 196
Query: 148 REHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIP 205
R +F S P V+V + S + G +L+F + + + +GNN ++
Sbjct: 197 RNYFISYPANVMVMRFSADQPGKQNLTFRYAPNPVSTGQFSADGNNGLVY---------- 246
Query: 206 PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFI 265
A+ D G++++ + I+ + GT++ D ++ V+ +D V + A + + F
Sbjct: 247 ---TASLDNNGMKYA--VRIQATVKGGTLNN-TDGRITVKEADEVVFYVTADTDYKMNFA 300
Query: 266 -NPSDSKK----DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIV 320
+ +D K +P + ++ + YS+L H DY LF+RV ++L+ + K
Sbjct: 301 PDFTDPKTYVGVNPLETTQQWMKDAVSKGYSNLLDEHYKDYASLFNRVKLELNPTVK--- 357
Query: 321 TDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN 379
+P+A+R+K+++ + D L +L +QFGRYLLI+SSRPG ANLQGIW+
Sbjct: 358 --------TSNLPTAQRLKNYRNGQPDYYLEKLYYQFGRYLLIASSRPGNMPANLQGIWH 409
Query: 380 EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGW 439
++ W H NIN++MNYW + NL EC PL DF+ L G KTAQ + A GW
Sbjct: 410 NNIDGPWRVDYHNNINIQMNYWPACSTNLDECMLPLIDFIRTLVKPGEKTAQSYFGARGW 469
Query: 440 VIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGC 498
+I+ ++ + + W PM G WL TH+WE+Y+YT D FL++ Y L++
Sbjct: 470 TASISANIFGFTTPLESQDMSWNFNPMAGPWLATHVWEYYDYTKDLKFLKETGYELIKSS 529
Query: 499 ASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAA 558
A+F +D+L DG PSTSPEH + +T A++RE+ I A+
Sbjct: 530 ANFTVDYLWHKPDGTYTAAPSTSPEH---------GPIDQGATFVHAVVREILLDAIQAS 580
Query: 559 EVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ L +K E E VL + L P KI G ++EW
Sbjct: 581 KELGIDKKERKQWEHVLAN---LVPYKIGRYGQLLEW 614
>gi|423282784|ref|ZP_17261669.1| hypothetical protein HMPREF1204_01207 [Bacteroides fragilis HMW
615]
gi|404581655|gb|EKA86351.1| hypothetical protein HMPREF1204_01207 [Bacteroides fragilis HMW
615]
Length = 829
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 179/600 (29%), Positives = 288/600 (48%), Gaps = 72/600 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA--------PKALSDVR 76
+ ++PIGNG +GA + G + +E + NE TLW G P DA L ++R
Sbjct: 75 SQSLPIGNGSIGANIMGSIEAERITFNEKTLWRGGPNTSRGADAYWNVNKQSAHVLKEIR 134
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
G +A + K F + +G+ +E S + ++ Y
Sbjct: 135 QAFTDGDQKKAEMLTRKNFNSEVPYESNREKPFRFGNFTTMGEFYIETGLSTVNMSD--Y 192
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ +V + R++F S P V+ + G +L+F+ S + +
Sbjct: 193 KRILSLDSALAVVQFKKDDVAYERDYFISYPANVMAIRFKADRPGKQNLTFSYSPNPVST 252
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+G N + A+ D G+Q+ ++ I GT+S + K+
Sbjct: 253 GSMSADGANGLAY-------------TAHLDNNGMQY--VVRIHAIAKGGTLSN-ANGKI 296
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINP-SDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D V L+ A + +FD F +P + +P + + + + Y L+ +H
Sbjct: 297 TVKDADEVVFLVTADTDYKINFDPDFKDPKAYVGVNPAETTRQWMDNAVAMGYDVLFKQH 356
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
DDY LF+RV +QL+ + +P+ +R+++++ + D L EL +QF
Sbjct: 357 YDDYAALFNRVKLQLNPDAQSA-----------NLPTGKRLQNYRKGQPDFYLEELYYQF 405
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL EC PL
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLYECTLPLI 465
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + GW +I+ ++ + ++ W PM G WL TH+W
Sbjct: 466 DFIRTLVKPGQKTAQAYFGTRGWTASISANIFGFTTPLESEEMAWNFNPMAGPWLATHVW 525
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D+ FL++ Y L++ A F D+L DG PSTSPEH
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSAQFATDFLWRKPDGTYTAAPSTSPEH---------GP 576
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A+IRE+ I A++VL + E ++VL L P K+ G +MEW
Sbjct: 577 IDEGTTFVHAVIREILQDAIEASKVLGVDGKERKQWQEVLT---HLAPYKVGRYGQLMEW 633
>gi|423271952|ref|ZP_17250921.1| hypothetical protein HMPREF1079_04003 [Bacteroides fragilis
CL05T00C42]
gi|423276043|ref|ZP_17254986.1| hypothetical protein HMPREF1080_03639 [Bacteroides fragilis
CL05T12C13]
gi|392696307|gb|EIY89503.1| hypothetical protein HMPREF1079_04003 [Bacteroides fragilis
CL05T00C42]
gi|392699548|gb|EIY92724.1| hypothetical protein HMPREF1080_03639 [Bacteroides fragilis
CL05T12C13]
Length = 829
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 179/600 (29%), Positives = 288/600 (48%), Gaps = 72/600 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA--------PKALSDVR 76
+ ++PIGNG +GA + G + +E + NE TLW G P DA L ++R
Sbjct: 75 SQSLPIGNGSIGANIMGSIEAERITFNEKTLWRGGPNTSRGADAYWNVNKQSAHVLKEIR 134
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
G +A + K F + +G+ +E S + ++ Y
Sbjct: 135 QAFTDGDQKKAEMLTRKNFNSEVPYESNREKPFRFGNFTTMGEFYIETGLSTVNMSD--Y 192
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ +V + R++F S P V+ + G +L+F+ S + +
Sbjct: 193 KRILSLDSALAVVQFKKDDVAYERDYFISYPANVMAIRFKADRPGKQNLTFSYSPNPVST 252
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+G N + A+ D G+Q+ ++ I GT+S + K+
Sbjct: 253 GSMSADGANGLAY-------------TAHLDNNGMQY--VVRIHAIAKGGTLSN-ANGKI 296
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINP-SDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D V L+ A + +FD F +P + +P + + + + Y L+ +H
Sbjct: 297 TVKDADEVVFLVTADTDYKINFDPDFKDPKAYVGVNPAETTRQWMDNAVAMGYDVLFKQH 356
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
DDY LF+RV +QL+ + +P+ +R+++++ + D L EL +QF
Sbjct: 357 YDDYAALFNRVKLQLNPDAQSA-----------NLPTGKRLQNYRKGQPDFYLEELYYQF 405
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL EC PL
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLYECTLPLI 465
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + GW +I+ ++ + ++ W PM G WL TH+W
Sbjct: 466 DFIRTLVKPGQKTAQAYFGTRGWTASISANIFGFTTPLESEEMAWNFNPMAGPWLATHVW 525
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D+ FL++ Y L++ A F D+L DG PSTSPEH
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSAQFATDFLWRKPDGTYTAAPSTSPEH---------GP 576
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A+IRE+ I A++VL + E ++VL L P K+ G +MEW
Sbjct: 577 IDEGTTFVHAVIREILQDAIEASKVLGVDGKERKQWQEVLT---HLAPYKVGRYGQLMEW 633
>gi|116624427|ref|YP_826583.1| hypothetical protein Acid_5349 [Candidatus Solibacter usitatus
Ellin6076]
gi|116227589|gb|ABJ86298.1| conserved hypothetical protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 718
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 190/591 (32%), Positives = 281/591 (47%), Gaps = 98/591 (16%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
L + + PA+ + + A+PIGNGRLGAM++G E L+LNE +LWTG
Sbjct: 23 LALWYQQPAEDWQSQALPIGNGRLGAMIFGDARREHLQLNEISLWTG------------- 69
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
D+G+ YQ LGD+ L+ + YRR LD++
Sbjct: 70 -----DEKDTGR------------------YQNLGDLFLDLTHG----PPQNYRRSLDID 102
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TA V YS G + RE+F+S P QVIV + + + G+ + + L D H +
Sbjct: 103 TAIHTVDYSAGGAAWRREYFASAPRQVIVLRCTADKRGAYTGTLRLT---DAHG-----S 154
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
+ E G R+ ++A G++F +++ + R T S L +E +D A+
Sbjct: 155 PVSAE----GTRL---SSAGKLENGLEFETQIQVMATGGRITASG---DALHIENAD-AL 203
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
+ +A+ + P + P + L + + Y+ + H+ DYQ+LF RV++
Sbjct: 204 TIFIAAGTNYVPDRARAWRGDSPHARITRQLAAAAAMDYAGMRAAHIADYQQLFRRVTLN 263
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQ 370
L +P ++ TD ER+ ++ DP L L FQ+GRYLLISSSRPG+
Sbjct: 264 LGSTPGEMPTD-------------ERLLRYRDGSPDPELEALFFQYGRYLLISSSRPGSL 310
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQG+WN +P W S H NIN++MNYW + NL+EC P FD++ S+ G +T
Sbjct: 311 PANLQGLWNNSNNPPWRSDYHSNINIQMNYWPAEVTNLAECALPFFDYVN--SLRGVRTE 368
Query: 431 QVNYL---ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
+ GW + + +I+ G W P G AW H WEHY +T DRDFL
Sbjct: 369 ATHKYYPNVRGWTVQTENNIFGA-----GSFKWN--PPGSAWYAQHFWEHYAFTHDRDFL 421
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
K AYP+L+ F D L+ DG L T SPEH P T D ++
Sbjct: 422 SKMAYPVLKEITQFWEDHLVARPDGALVTPDGWSPEHGPEEP---------GVTYDQELV 472
Query: 548 REVFSAIISAAEVLEKNEDALVE-KVLKSLPRLRPTKIAEDGSIMEWVQRR 597
++F+ + AA VL N DA KV + RL K+ G + EW + R
Sbjct: 473 WDLFTNYLEAAAVL--NVDAGYRIKVTQLRQRLLKPKVGAWGQLQEWPEDR 521
>gi|380694581|ref|ZP_09859440.1| hypothetical protein BfaeM_11488 [Bacteroides faecis MAJ27]
Length = 812
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 181/600 (30%), Positives = 296/600 (49%), Gaps = 72/600 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG +GA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 56 SQSLPIGNGSIGANILGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHILDEIR 115
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
G +A + + F + +G+ +E + +K +E Y
Sbjct: 116 KAFIEGDQQKAEKLTRENFNSEVPYEYSGEKPFRFGNFTTMGEFYIETGLNTVKMSE--Y 173
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ NV + R +F S P V+V + S + G +L F+ + + +
Sbjct: 174 KRILSLDSAMAVVQFKKDNVAYQRNYFISYPANVMVMRFSADQPGKQNLIFSYAPNPMST 233
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
++G+N ++ +A + G++++ + I+ + GT++ D KL
Sbjct: 234 GQIAIDGSNGLVY-------------SAFLENNGMKYA--VRIQATVKGGTLNN-SDGKL 277
Query: 243 KVEGSDWAVLLLVASSSFDGPFI-NPSDSKK----DPTSESMSALQSIRNLSYSDLYTRH 297
++ +D AV + A + + F + +D K +P + ++ Y++L H
Sbjct: 278 TIKDADEAVFYVTADTDYKMNFAPDFTDPKTYVGVNPLETTQQWMEDAVAKGYTNLLDEH 337
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
DY LF+RV ++L+ + K +P+ +R+K+++ + D L +L +QF
Sbjct: 338 YKDYAALFNRVKLELNPTVKTA-----------NLPTEQRLKNYRKGQPDYYLEKLYYQF 386
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL EC PL
Sbjct: 387 GRYLLIASSRPGNMPANLQGIWHNNIDGPWRVDYHNNINIQMNYWPACSTNLDECMLPLI 446
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + A GW +I+ ++ + + W PM G WL TH+W
Sbjct: 447 DFIRTLVKPGEKTAQSYFGARGWTASISANIFGFTTPLESQDMSWNFNPMAGPWLATHVW 506
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT + FL++ Y L++ A+F +D+L DG PSTSPEH
Sbjct: 507 EYYDYTQNLKFLKETGYELIKSSANFAVDYLWHKPDGTYTAAPSTSPEH---------GP 557
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A+IRE+ I A++ L +K E E VL + L P KI G +MEW
Sbjct: 558 IDQGATFVHAVIREILLDAIKASKELGIDKKERKQWEHVLAN---LTPYKIGRYGQLMEW 614
>gi|261408195|ref|YP_003244436.1| alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
gi|261284658|gb|ACX66629.1| Alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
Length = 779
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 189/611 (30%), Positives = 297/611 (48%), Gaps = 64/611 (10%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA- 71
+K+ + PA+ ++ +PIGNGR+G +V E + E T W+G P KA
Sbjct: 4 MKLWYTKPAQGWSQGLPIGNGRMGNVVISAPDREIWNITETTYWSGQPEPAQGRSNSKAD 63
Query: 72 LSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEFDDS--------H 116
L +R G Y E + K FG + Q++ LEFD +
Sbjct: 64 LERMRQHFFQGDYREGDRLAKKHLEPEKLNFGTNLGLCQVV----LEFDHNVKPSEGGRQ 119
Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGS-LSFNV 175
AE + RELDL A AR + E TRE F+S+ DQVIV++I S S +SF +
Sbjct: 120 EAAAEPLFYRELDLQEAVARSFCEIDGAEMTREVFASHADQVIVSRIRSSHGSSGVSFRI 179
Query: 176 SLDSLLDN---HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRG 232
S+ +N H+ V G + I G+ ++N + S +++++ + G
Sbjct: 180 SIRG--ENGPFHANVTGKDTIEFRGQAL-----EDVHSNGE---CGVSCQGQLRVAAEGG 229
Query: 233 TISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSD 292
+S D + V G+D A + ++ + + +S L+ L Y
Sbjct: 230 KVSCTADT-ISVSGADEAAIYFAVNTDY-------RQEGESWREKSAFQLEQAVLLGYDA 281
Query: 293 LYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT--DEDPSLV 350
L +HL DYQ L+ RV + L S ++P+ ER+ F+ +DP+L
Sbjct: 282 LRAKHLADYQPLYARVRLDLGSSEHA------------SLPTDERIGRFKQGKQDDPALF 329
Query: 351 ELLFQFGRYLLISSSRPGTQV-ANLQGIWN--EDLSPTWDSAPHVNINLEMNYWQSLPCN 407
L +Q+GRYL IS SRP + + +LQGIWN E W H++ N +MNY+ + N
Sbjct: 330 ALFYQYGRYLTISGSRPDSILPMHLQGIWNDGEANKMAWSCDYHLDTNTQMNYFPTEAAN 389
Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
LSE EPL ++ LS+ G A+ Y A GWV H ++ W +S + W L GG
Sbjct: 390 LSESHEPLMRYIQQLSVAGRSAARHYYDAEGWVAHVFSNAWGFASPGW-ETSWGLNVTGG 448
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEF 526
W+ TH+ EHY Y D+ FLE+ AYP+L+ A+F +D++ + G+L T PS SPE+ F
Sbjct: 449 LWIATHMMEHYAYNQDQAFLEELAYPVLKEAAAFFMDYMTVHPKYGWLVTGPSNSPENSF 508
Query: 527 IA--PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
P+ +S TMD ++R++ + + AA+ L +E+ L +K +L +L P I
Sbjct: 509 YTGNPEDGHQQLSMGPTMDQVLVRDLLAFCVKAAQTLGVDEE-LRQKWQTALDQLPPLMI 567
Query: 585 AEDGSIMEWVQ 595
+ G + EW++
Sbjct: 568 GKKGQLQEWLE 578
>gi|296453497|ref|YP_003660640.1| hypothetical protein BLJ_0327 [Bifidobacterium longum subsp. longum
JDM301]
gi|296182928|gb|ADG99809.1| hypothetical protein BLJ_0327 [Bifidobacterium longum subsp. longum
JDM301]
Length = 783
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 187/607 (30%), Positives = 294/607 (48%), Gaps = 51/607 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+TF+G + + + IP+GNGR+GA++ ++ L LN+DTLW+G P T+P P+ +
Sbjct: 1 MKLTFDGISSCWEEGIPLGNGRMGAVLCSEPETDVLYLNDDTLWSGYPHAETSPVTPEIV 60
Query: 73 SDVRSLVDSGQYAEATA--ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+ R Y AT L +Y+ G +++ S E+ +R+LDL
Sbjct: 61 AKARQAASGDDYTAATRIIKEATLQEKDEQIYEPFGTARIQY--STPADGRESMKRQLDL 118
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS----------LDSL 180
A A + +G+ + + S PD ++V ++S ++ +VS L+++
Sbjct: 119 ARALAGETFQMGDANVHVDAWCSEPDDLLVYRMSSDAPIDVNISVSGTFLKQSRASLETV 178
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRI-----PPKANANDDPKGIQFSAILEIKISDDRGTIS 235
D H +I+ GR PG + P + D+ G + ++ G I+
Sbjct: 179 SDGHRAT-----LIVMGRMPGLNVGLLPHPSEHPWEDEQDGTGMAYAGAFSLTATGGDIN 233
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
++D L+ L + S F G P S + L+ + +DL T
Sbjct: 234 -VDDNSLQCSHITGLSLRFRSMSGFKGSDQQPERS----MTVIADHLEKTIDEWSTDLQT 288
Query: 296 ---RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSL--- 349
RH+ DY++ F RV+I L + D DT +P + ++S + E L
Sbjct: 289 MLDRHIADYRRYFDRVAIHLGSAHDD---DT-------ELPFSAILRSDENKEPHRLEML 338
Query: 350 VELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLS 409
E +F FGRYLLISSSRP TQ ANLQGIWN P W SA NIN+EMNYW + PC L
Sbjct: 339 AEAMFDFGRYLLISSSRPHTQPANLQGIWNHKDFPNWYSAYTTNINVEMNYWMTGPCALK 398
Query: 410 ECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAW 469
E EPL L G A G + H D+W ++ G+ +WA WP G AW
Sbjct: 399 ELIEPLVSMNEELLAPGHDAADKILGCRGSAVFHNVDLWRRALPANGEPMWAFWPFGQAW 458
Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP 529
+C +L++ Y + D +L R +P++ A F +D+L E G L +P+TSPE+ F+
Sbjct: 459 MCRNLFDEYLFNQDASYL-ARIWPIMRDNARFCMDFLSETEHG-LAPSPATSPENCFLV- 515
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAA---EVLEKNEDALVEKVLKSLPRLRPTKIAE 586
+G+ V+ SS AI+R + +I A+ E L++ + ALV + +L T++
Sbjct: 516 NGEPVSVAQSSENATAIVRNLLDDLIQASHDLENLDEEDSALVREAESVRSQLAETRLGA 575
Query: 587 DGSIMEW 593
DG I+EW
Sbjct: 576 DGRILEW 582
>gi|429746943|ref|ZP_19280255.1| hypothetical protein HMPREF9078_01397 [Capnocytophaga sp. oral
taxon 380 str. F0488]
gi|429164651|gb|EKY06768.1| hypothetical protein HMPREF9078_01397 [Capnocytophaga sp. oral
taxon 380 str. F0488]
Length = 799
Score = 259 bits (663), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 188/605 (31%), Positives = 308/605 (50%), Gaps = 63/605 (10%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + F+ PA HFT++IPIGNGRLGAM++G + + LNE +LW+G + +P+A L
Sbjct: 16 VSVVFHKPANHFTESIPIGNGRLGAMLFGKTDVDRIVLNEISLWSGGTQEADDPNAHNYL 75
Query: 73 SDVRSLVDSGQYAEATAASVKLF---------GHPAD----VYQLLGDIELEFDDSHLKY 119
+++ L+ G+ EA A + F G A+ YQ+L ++ L++ +
Sbjct: 76 KEIQKLLLQGKNLEAQALLQQHFVAKGKGSCLGQGANCSYGCYQVLAELLLDWKTTS--- 132
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+ Y+R L L+ ATA + N + F+ + VI +I + L+ ++SL
Sbjct: 133 PIQDYKRVLRLDEATAVTSFKRDNNSIEQTAFADFKNDVIWVRIKATS--PLNLDISLFR 190
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+N + NN+I + G P ND +G+ F+++++++ G I +
Sbjct: 191 -KENATITYQNNKITLNGVLP----------NDGKEGMHFASVVDVQTD---GKIESTH- 235
Query: 240 KKLKVEGSDWAVLLLVASSSFD---GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
K + ++ + L + A ++++ G ++ S +KK + LQ +S+
Sbjct: 236 KAIAIQSAKEITLRISAVTNYNFNKGGLLDISVTKK-----ANEYLQKAP-MSFDKAKAE 289
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL-FQ 355
+Q LF+R + N + + + ER++ F E +L+ +L +
Sbjct: 290 SSIVFQGLFNRNRWYGK-----------ANANTEGLTTFERLERFYKGEQDALLPILYYN 338
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
FGRYLLISSSR G ANLQG+W E+ W+ H+NIN++MNYW + P NLS+ EPL
Sbjct: 339 FGRYLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINIQMNYWLAEPTNLSQLTEPL 398
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
F L NGSKTA+ Y A+GWV H ++ W +S W GGAWLC H+W
Sbjct: 399 QRFTKNLVPNGSKTAKAYYNANGWVAHVISNPWFYTSPGE-SATWGSTLTGGAWLCEHIW 457
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP---DG 531
+HY +T + +FL + YP+L+ +F + LI+ GY T PS SPE+ ++ P DG
Sbjct: 458 QHYLFTKNINFL-REYYPVLKEATTFFENLLIKDPKTGYWVTAPSNSPENAYVLPELKDG 516
Query: 532 K--LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
K + + TMDM I+RE+F+ AA++L + E S + P +I + G
Sbjct: 517 KKQIGTTCVAPTMDMQIVRELFTNTSDAAKILGLDSKKRTEWERISRNTV-PNRIGKKGD 575
Query: 590 IMEWV 594
+ EW+
Sbjct: 576 LNEWL 580
>gi|295835067|ref|ZP_06822000.1| fibronectin type III domain-containing protein [Streptomyces sp.
SPB74]
gi|197698025|gb|EDY44958.1| fibronectin type III domain-containing protein [Streptomyces sp.
SPB74]
Length = 790
Score = 259 bits (662), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 189/595 (31%), Positives = 286/595 (48%), Gaps = 61/595 (10%)
Query: 15 ITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-------DYTNP 66
+ + PA + T ++P+GNG LGA V+G +P+E ++ E TLWTG PG ++ NP
Sbjct: 53 LRYTAPATDWETQSLPVGNGALGASVFGTLPTEHVQFAEKTLWTGGPGTPGYRYGNWENP 112
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVY---QLLGDIELEFDDSHLKYAEET 123
P ALS VR+ +++ A+ +L G P Y Q GD L D + +
Sbjct: 113 R-PDALSSVRADIEARTKITPEDAAARL-GQPRIGYGGHQTFGD--LLIDVAGAPASANG 168
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
Y R LDL A V Y F R F+S PD+V+V + GS+ ++ S +
Sbjct: 169 YSRTLDLAQGLAGVTYPHDGTTFRRTVFASYPDKVLVGHFTADRGGSVELSLRYTSPRQD 228
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ +++ + G G++F A +I++ + GT+SA D+ L
Sbjct: 229 FTATASGDRLTLRGAL-------------QDNGMRFEA--QIRLLSEGGTVSANGDR-LT 272
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V G+D A +L A + + + P DP A+ Y +L RH D+
Sbjct: 273 VSGADSAWFVLSAGTDYADTY--PGYRGADPHDRVTGAVNQAAARPYRELLDRHTSDHGG 330
Query: 304 LFHRVSIQLSR-SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
LF RV + L + S D TD + +A+R +L L FQ+GRYLLI
Sbjct: 331 LFSRVVLDLGQQSAPDQSTDALLKAYTGGNSAADR----------ALEALFFQYGRYLLI 380
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
+SSR G+ ANLQG WN +P W + HVNINL+MNYW + NL+E P F+ L
Sbjct: 381 ASSRAGSLPANLQGAWNNSTTPPWSADYHVNINLQMNYWPAEATNLAETTAPYDRFVEAL 440
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYT 481
+ G TAQ + A GWV+H +T + + D W +P AWL + L+EHY +
Sbjct: 441 RVPGRTTAQSMFGARGWVVHDETTPFGFTGVHDWPTSFW--FPEAAAWLTSQLYEHYRFD 498
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEH-EFIAPDGKLACVSYS 539
D+L AYP ++ A F +D L + D L PS SPEH +F A
Sbjct: 499 GSTDYLRATAYPAMKEAAEFWIDVLRTDPRDNTLVVTPSFSPEHGDFTA----------G 548
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEW 593
+ M I+ E+F+ + AA+ L ++ A ++ ++L R+ P ++ G +MEW
Sbjct: 549 AAMSQQIVHELFTNTLEAAQTL-GDDPAFRGRLKETLDRIDPGLRVGSWGQLMEW 602
>gi|315224299|ref|ZP_07866133.1| possible alpha-L-fucosidase [Capnocytophaga ochracea F0287]
gi|420159534|ref|ZP_14666333.1| hypothetical protein HMPREF1319_1323 [Capnocytophaga ochracea str.
Holt 25]
gi|314945689|gb|EFS97704.1| possible alpha-L-fucosidase [Capnocytophaga ochracea F0287]
gi|394761875|gb|EJF44190.1| hypothetical protein HMPREF1319_1323 [Capnocytophaga ochracea str.
Holt 25]
Length = 799
Score = 259 bits (662), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 187/602 (31%), Positives = 305/602 (50%), Gaps = 57/602 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + F+ PA HFT++IPIGNGRLGAM++G + + LNE +LW+G + +P+A L
Sbjct: 16 VSVVFHKPANHFTESIPIGNGRLGAMLFGKTDVDRIVLNEISLWSGGTQEADDPNAHNYL 75
Query: 73 SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
+++ L+ G+ EA A + F G A+ YQ+L ++ L++ +
Sbjct: 76 KEIQKLLLQGKNLEAQALLQQHFVAKGKGSCLGQGANCSYGCYQVLAELLLDWKTTS--- 132
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+ Y+R L L+ ATA + N + F+ + VI KI + L+ ++SL
Sbjct: 133 PIQDYKRVLRLDEATAVTSFKRDNNSIEQTAFADFKNDVIWVKIKATSP--LNLDISLFR 190
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+N + NN+I + G P N +G+ F+++++++ G I +
Sbjct: 191 K-ENATITYQNNKITLNGVLP----------NGGKEGMHFASVVDVQTD---GKIESTH- 235
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
K + ++ + L + A ++++ F S T ++ LQ +S+
Sbjct: 236 KAIAIQSAKEITLRISAVTNYN--FDKGGLSDISVTKKANEYLQKAP-MSFDKAKAESSI 292
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF-QFGR 358
+Q+LF+R + N + + + ER++ F E +L+ +L+ FGR
Sbjct: 293 VFQRLFNRNRWYGK-----------ANANTEGLTTFERLERFYKGEQDALLPILYYNFGR 341
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSSR G ANLQG+W E+ W+ H+NIN++MNYW + P NLS+ EPL F
Sbjct: 342 YLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINIQMNYWLAEPTNLSQLTEPLQRF 401
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
L NGSKTA+ Y A+GWV H ++ W +S W GGAWLC H+W+HY
Sbjct: 402 TKNLVPNGSKTAKAYYNANGWVAHVISNPWFYTSPGE-SATWGSTLTGGAWLCEHIWQHY 460
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP---DGK-- 532
+T + +FL + YP+L+ +F + LI+ GY T PS SPE+ ++ P DGK
Sbjct: 461 LFTKNINFL-REYYPVLKEATTFFENLLIKDPKTGYWVTAPSNSPENAYVLPELKDGKKQ 519
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
+ + TMDM I+RE+F+ AA++L + E S + P +I + G + E
Sbjct: 520 IGTTCVAPTMDMQIVRELFTNTSDAAKILGLDSKKRTEWERISRNTV-PNRIGKKGDLNE 578
Query: 593 WV 594
W+
Sbjct: 579 WL 580
>gi|333022556|ref|ZP_08450620.1| putative fibronectin type III domain-containing protein
[Streptomyces sp. Tu6071]
gi|332742408|gb|EGJ72849.1| putative fibronectin type III domain-containing protein
[Streptomyces sp. Tu6071]
Length = 783
Score = 259 bits (662), Expect = 3e-66, Method: Compositional matrix adjust.
Identities = 188/598 (31%), Positives = 292/598 (48%), Gaps = 67/598 (11%)
Query: 15 ITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-------DYTNP 66
+ + PA + T+++P+GNG LGA V+G +P+E ++ E TLWTG PG ++ NP
Sbjct: 46 LRYGAPATDWETESLPVGNGALGASVFGTLPTEHIQFAEKTLWTGGPGTSGYRYGNWENP 105
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVY---QLLGDIELEFDDSHLKYAEET 123
P AL+ VR+ +++ A+ +L G P Y Q GD+ ++ D + + +
Sbjct: 106 R-PDALASVRADIEARTKITPEDAAARL-GQPRIGYGGHQTFGDLLIDVDGA--PGSADG 161
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
Y R LDL A A V Y F R F+S PD+V+V + GS+ N+ S +
Sbjct: 162 YTRTLDLAQALATVSYPHDGTTFRRTVFASCPDKVLVGHFTADRGGSVGLNLRYTSPRQD 221
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ +++ + G G++F A +I++ + G+++A D+ L
Sbjct: 222 FTATTDGDRLTVRGAL-------------QDNGMRFEA--QIRLLSEGGSVTANGDR-LT 265
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V G+D A +L A + + + P DP +A+ Y +L RH D+
Sbjct: 266 VSGADSAWFVLSAGTDYADTY--PDYRGADPHDRVTTAVDQAAARPYRELLDRHTSDHAA 323
Query: 304 LFHRVSIQLSR-SPKDIVTDTCSEENIDTVPSAERVKSF---QTDEDPSLVELLFQFGRY 359
LF RV + L + S D TD +K++ + +D +L L FQ+GRY
Sbjct: 324 LFSRVVLDLGQGSAPDRTTDAL-------------LKAYTGGNSADDRALEALFFQYGRY 370
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LLI+SSR G+ ANLQG WN +P W + HVNINL+MNYW + NL+E P F+
Sbjct: 371 LLIASSRAGSLPANLQGAWNNSTAPPWSADYHVNINLQMNYWPAEATNLAETTAPYDRFV 430
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHY 478
L G TA+ + A GWV+H +T + + D W +P AWL + L+EHY
Sbjct: 431 EALRAPGRTTARSMFDARGWVVHDETTPFGFTGVHDWPTSFW--FPEAAAWLTSQLYEHY 488
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEH-EFIAPDGKLACV 536
+ D+L AYP ++ A F +D L + D L PS SPEH +F A
Sbjct: 489 RFDGSTDYLRATAYPAMKEAAEFWIDVLRTDPRDNTLVVTPSFSPEHGDFTA-------- 540
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEW 593
+ M I+RE+F + AA+ L ++ A + ++L R+ P +I G +MEW
Sbjct: 541 --GAAMSQQIVRELFLNTLEAAQTL-GDDPAFRTTLKETLDRIDPGLRIGSWGQLMEW 595
>gi|393778744|ref|ZP_10367005.1| hypothetical protein HMPREF1321_0421 [Capnocytophaga sp. oral taxon
412 str. F0487]
gi|392611313|gb|EIW94052.1| hypothetical protein HMPREF1321_0421 [Capnocytophaga sp. oral taxon
412 str. F0487]
Length = 799
Score = 259 bits (661), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 188/605 (31%), Positives = 307/605 (50%), Gaps = 63/605 (10%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + F+ PA HFT++IPIGNGRLGAM++G + + LNE +LW+G + +P+A L
Sbjct: 16 VSVVFHKPANHFTESIPIGNGRLGAMLFGKTDVDRIVLNEISLWSGGTQEADDPNAHNYL 75
Query: 73 SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
+++ L+ G+ EA A + F G A+ YQ+L ++ L++ +
Sbjct: 76 KEIQKLLLQGKNLEAQALLQQHFVAKGKGSCLGQGANCSYGCYQVLAELLLDWKTTS--- 132
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+ Y+R L L+ ATA + N + F+ + VI KI + L+ ++SL
Sbjct: 133 PIQDYKRVLRLDEATAVTSFKRDNNSIGQTAFADFKNDVIWVKIKATSP--LNLDISLFR 190
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+N + NN+I + G P N +G+ F+++++++ G I +
Sbjct: 191 K-ENATITYQNNKITLNGVLP----------NGGKEGMHFASVVDVQTD---GKIESTH- 235
Query: 240 KKLKVEGSDWAVLLLVASSSFD---GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
K + ++ + L + A ++++ G ++ S +KK + LQ +S+
Sbjct: 236 KAIAIQSAKEITLRISAVTNYNFNKGGLLDISVTKK-----ANEYLQKAP-MSFDKAKAE 289
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF-Q 355
+Q+LF+R + N + + + ER++ F E +L+ +L+
Sbjct: 290 SSIVFQRLFNRNRWYGK-----------ANANTEGLTTFERLERFYKGEQDALLPILYYN 338
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
FGRYLLISSSR G ANLQG+W E+ W+ H+NIN++MNYW + P NLS+ EPL
Sbjct: 339 FGRYLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINIQMNYWLAEPTNLSQLTEPL 398
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
F L NGSKTA+ Y A+GWV H ++ W +S W GGAWLC H+W
Sbjct: 399 QRFTKNLVPNGSKTAKAYYNANGWVAHVISNPWFYTSPGE-SATWGSTLTGGAWLCEHIW 457
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP---DG 531
+HY +T + +FL + YP+L+ +F LI+ GY T PS SPE+ ++ P DG
Sbjct: 458 QHYLFTKNINFL-REYYPVLKEATTFFESLLIKDPKTGYWVTAPSNSPENAYVLPELKDG 516
Query: 532 K--LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
K + + TMDM I+RE+F+ AA++L + E S + P +I + G
Sbjct: 517 KKQIGTTCVAPTMDMQIVRELFTNTSDAAKILGLDSKKRTEWERISRNTV-PNRIGKKGD 575
Query: 590 IMEWV 594
+ EW+
Sbjct: 576 LNEWL 580
>gi|329923050|ref|ZP_08278566.1| hypothetical protein HMPREF9412_5028 [Paenibacillus sp. HGF5]
gi|328941823|gb|EGG38108.1| hypothetical protein HMPREF9412_5028 [Paenibacillus sp. HGF5]
Length = 767
Score = 258 bits (659), Expect = 7e-66, Method: Compositional matrix adjust.
Identities = 193/614 (31%), Positives = 300/614 (48%), Gaps = 70/614 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA- 71
+K+ + PA+ ++ +PIGNGR+G +V E + E T W+G P KA
Sbjct: 4 MKLWYTKPAQGWSQGLPIGNGRMGNVVVSTPDREIWNITETTYWSGQPEPAQGRSNSKAD 63
Query: 72 LSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEFDDSHLK------ 118
L +R G Y E + K FG + Q++ LEFD H+K
Sbjct: 64 LERMRQHFFQGDYREGDRLAKKHLEPEKLNFGTNLGLCQVV----LEFD-HHVKPSEGGR 118
Query: 119 ---YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGS-LSFN 174
AE + RELDL A AR + E RE F+S+ DQVIV +I S S +SF
Sbjct: 119 QDAAAEPLFHRELDLQEAVARSLCEIDGAEMAREVFASHADQVIVARIRSSHGSSGVSFR 178
Query: 175 VSLDSLLDN---HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDR 231
+S+ +N H+ V G + I +G+ + I +G+ +++ +
Sbjct: 179 ISIRG--ENGPFHAVVTGKDTIDFQGQA-WEGIHSNGECGVSCQGL-------LRVVTEG 228
Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRN--LS 289
G +S ++D + V G+D A + +N ++ + SALQ + L
Sbjct: 229 GQVSCMDDTII-VSGADEAAIYFA---------VNTDYRQEGESWREKSALQLEQAVLLG 278
Query: 290 YSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDP 347
Y +L +HL DYQ L+ RV + L S ++P+ ER+ F+ +D
Sbjct: 279 YDELKAKHLADYQPLYARVRLDLGSSEHA------------SLPTDERIGRFKQGKRDDQ 326
Query: 348 SLVELLFQFGRYLLISSSRPGTQV-ANLQGIWN--EDLSPTWDSAPHVNINLEMNYWQSL 404
+L L +Q+GRYL IS SR + + +LQGIWN E W H+++N +MNY+ +
Sbjct: 327 ALFALFYQYGRYLTISGSRQDSILPMHLQGIWNDGEANKMAWSCDYHLDVNTQMNYFPTE 386
Query: 405 PCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWP 464
NLSE EPL ++ LS+ G A+ Y A GWV H ++ W +S G W L
Sbjct: 387 AANLSESHEPLMRYIQQLSVAGCSAARHYYDAEGWVAHVFSNAWGFASPGWG-TSWGLNV 445
Query: 465 MGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPE 523
GG W+ THL EHY Y D+ FLE+ AYP+L+ A+F +D++ + G+L T PS SPE
Sbjct: 446 TGGLWIATHLIEHYAYNRDQAFLEELAYPVLKEAAAFFMDYMTVHPQYGWLVTGPSNSPE 505
Query: 524 HEFIA--PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
+ F P+ +S TMD ++R++ + + AA+ L +E+ L +K +L +L P
Sbjct: 506 NSFYTSKPEDGHQQLSMGPTMDQVLVRDLLAFCVKAAQTLGVDEE-LQQKWQTALDQLPP 564
Query: 582 TKIAEDGSIMEWVQ 595
I + G + EW++
Sbjct: 565 LIIGKKGQLQEWLE 578
>gi|375088282|ref|ZP_09734622.1| hypothetical protein HMPREF9703_00704 [Dolosigranulum pigrum ATCC
51524]
gi|374562320|gb|EHR33650.1| hypothetical protein HMPREF9703_00704 [Dolosigranulum pigrum ATCC
51524]
Length = 820
Score = 257 bits (657), Expect = 1e-65, Method: Compositional matrix adjust.
Identities = 185/611 (30%), Positives = 302/611 (49%), Gaps = 74/611 (12%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG--VPGD--YTNPD---APKALSDVRSL 78
+A+P+GNG +G+ V+G V E ++ NE TLW+G PGD Y + L ++R
Sbjct: 22 EALPVGNGTMGSKVFGWVGRERIQFNEKTLWSGGPKPGDDSYNGGNLEGKHSVLPEIRQA 81
Query: 79 VDSGQYAEATAASVKLFGHPAD----VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ G +A + + P Y GDI L+F + + T Y+R LD++TA
Sbjct: 82 LEDGNTEKAKQLAEEHLVGPNSPEYGRYLSFGDIYLDFTNQSKELESVTDYKRVLDMDTA 141
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL---DSLLDNHS-YVN- 188
T V+Y F R+ F S+PD+V+VT +S L FN L L+D S +VN
Sbjct: 142 TTSVRYKEDGTTFKRDTFISHPDKVMVTHLSKEGDKPLEFNAGLYLTKELVDGGSNHVNH 201
Query: 189 ------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
Q +E G + K D+ G++F++ +EI D G I L D L
Sbjct: 202 YAEKESDYKQATVEYTEKGALL--KGTVRDN--GLEFASYMEI---DTDGVIEVL-DGYL 253
Query: 243 KVEGSDWAVLLLVASSSF-DGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
+V G+ +A L+ A +++ P N D+ D + S +Q + +Y + H++D+
Sbjct: 254 RVTGATYATLMTHAVTNYAQNPETNYRDTTMDVAEVAQSTVQQAIDKTYEQVKVDHINDH 313
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
Q LFHRV + L + TD + ++ + +L EL +Q+GRYLL
Sbjct: 314 QDLFHRVQLDLGAKTSALFTDDL-------------LATYDKQDGRALEELFYQYGRYLL 360
Query: 362 ISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
I+SSRPG ANLQG+WN +P W+S H+N+NL+MNYW + N++E PL +F+
Sbjct: 361 ITSSRPGKNALPANLQGVWNAVDNPAWNSDYHMNVNLQMNYWPAYSANMAETALPLINFV 420
Query: 420 TYLSINGSKTAQVNYL--------ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLC 471
L G + A Y +GW+ H + + ++ W P AW+
Sbjct: 421 DDLRYYG-RVAASEYANITSKEGEENGWLAHTQVTPFGWTTPGW-NYYWGWSPAANAWIM 478
Query: 472 THLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAP 529
+++E+Y YT D++FL+++ YP+L+ A F +L E D ++ ++PS SPEH
Sbjct: 479 QNVYEYYRYTQDKEFLQEKIYPMLKETAKFWNQFLHYDEASDRWV-SSPSYSPEH----- 532
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLE-----KNEDALVEKVLKSLPRLRPTKI 584
++ +T D +++ ++F A EVL + +D L+ ++ + +L+P I
Sbjct: 533 ----GTITIGNTFDQSLVWQLFHDFKEATEVLRDVEGFRPDDTLLAEISEKFAKLKPLHI 588
Query: 585 AEDGSIMEWVQ 595
DG I EW +
Sbjct: 589 NNDGHIKEWYE 599
>gi|420150260|ref|ZP_14657420.1| hypothetical protein HMPREF1320_0993 [Capnocytophaga sp. oral taxon
335 str. F0486]
gi|394752319|gb|EJF36021.1| hypothetical protein HMPREF1320_0993 [Capnocytophaga sp. oral taxon
335 str. F0486]
Length = 799
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 188/605 (31%), Positives = 305/605 (50%), Gaps = 63/605 (10%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + F+ PA HFT++IPIGNGRLGAM++G + + LNE +LW+G + +P+A L
Sbjct: 16 VSVVFHKPANHFTESIPIGNGRLGAMLFGKTDVDRIVLNEISLWSGGTQEADDPNAHNYL 75
Query: 73 SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
+++ L+ G+ EA A + F G A+ YQ+L ++ L++ +
Sbjct: 76 KEIQKLLLQGKNLEAQALLQQHFVAKGKGSCLGQGANCSYGCYQVLAELLLDWKTTS--- 132
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+ Y+R L L+ A A + N + F+ + VI KI + L+ ++SL
Sbjct: 133 PIQDYKRVLRLDEAIAVTLFKRDNNSIEQTAFADFKNDVIWVKIKATSP--LNLDISLFR 190
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+N + NN+I + G P ND +G+ F+++++++ G I +
Sbjct: 191 K-ENATITYQNNKITLNGALP----------NDGKEGMHFASVVDVQTD---GKIESTH- 235
Query: 240 KKLKVEGSDWAVLLLVASSSFD---GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
K + ++ + L + A ++++ G ++ S +KK + LQ +S+
Sbjct: 236 KAIAIQSAKEITLRISAVTNYNFNKGGLLDISVTKK-----ANEYLQKAP-MSFDKAKAE 289
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF-Q 355
+Q LF+R + N + + + ER+ F E +L+ +L+
Sbjct: 290 SSIVFQGLFNRNRWYGK-----------ANANTEGLTTFERLGRFYKGEQDALLPILYYN 338
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
FGRYLLISSSR G ANLQG+W E+ W+ H+NIN++MNYW + P NLS+ EPL
Sbjct: 339 FGRYLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINIQMNYWLAEPTNLSQLTEPL 398
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
F L NGSKTA+ Y A+GWV H ++ W +S W GGAWLC H+W
Sbjct: 399 QRFTKNLVPNGSKTAKAYYNANGWVAHVISNPWFYTSPGE-SATWGSTLTGGAWLCEHIW 457
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP---DG 531
+HY +T + +FL + YP+L+ +F LI+ GY T PS SPE+ ++ P DG
Sbjct: 458 QHYLFTKNINFL-REYYPVLKEATTFFESLLIKDPKTGYWVTAPSNSPENAYVLPELKDG 516
Query: 532 K--LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
K + + TMDM I+RE+F+ AA++L + E S + P +I + G
Sbjct: 517 KRQIGTTCVAPTMDMQIVRELFTNTSDAAKILGLDSKKRTEWERISRNTV-PNRIGKKGD 575
Query: 590 IMEWV 594
+ EW+
Sbjct: 576 LNEWL 580
>gi|384196720|ref|YP_005582464.1| hypothetical protein HMPREF9228_0580 [Bifidobacterium breve
ACS-071-V-Sch8b]
gi|333110104|gb|AEF27120.1| conserved hypothetical protein [Bifidobacterium breve
ACS-071-V-Sch8b]
Length = 783
Score = 255 bits (651), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 185/609 (30%), Positives = 296/609 (48%), Gaps = 55/609 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+TF+G + + ++IP+GNGR+GA++ ++ L LN+DTLW+G P T+P P+ +
Sbjct: 1 MKLTFDGISSCWEESIPLGNGRMGAVLCSEPETDVLYLNDDTLWSGYPHAETSPVTPEIV 60
Query: 73 SDVR--SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+ R SL D A L +Y+ G +++ S E+ +R+LDL
Sbjct: 61 AKARQASLQDDYNAATRIIKDATLQEKDEQIYEPFGTARIQYSTS--ADGRESMKRQLDL 118
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS----------LDSL 180
A A + +G+ + + S PD ++V ++S ++ +VS ++++
Sbjct: 119 ARALAGETFQMGDANVHVDAWCSEPDDLLVYRMSSDAPIDVNISVSGTFLKQSRASMETV 178
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIP----PKANANDDPK---GIQFSAILEIKISDDRGT 233
D H +++ GR PG I P N +D + G+ ++ + ++ G
Sbjct: 179 SDGHRAT-----LVVMGRMPGLNIGLLPHPSENPWEDEQDGTGMTYAGAFSLTVT---GG 230
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
+ D L+ L + S F G P S + L+ + +DL
Sbjct: 231 DVNVGDNSLQCSNITGLSLRFRSMSGFRGSDQQPERS----MTVIADHLEKTIDEWSTDL 286
Query: 294 YT---RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSL- 349
T RH+ DY++ F RV+I L + D DT +P + ++S + E L
Sbjct: 287 RTMLDRHIADYRRYFDRVAIHLGSAHDD---DT-------ELPFSAILRSDEKKEPHRLE 336
Query: 350 --VELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
E +F FGRYLLISSSRP TQ ANLQGIWN P W SA NIN+EMNYW + PC
Sbjct: 337 MLAEAMFDFGRYLLISSSRPHTQPANLQGIWNHKDFPNWYSAYTTNINVEMNYWMTGPCA 396
Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
L E EPL L + G A G + H D+W ++ G +W+ WP G
Sbjct: 397 LQELIEPLVSMNEELLVPGHDAADRILGCRGSAVFHNVDLWRRALPANGDPMWSFWPFGQ 456
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFI 527
AW+C +L++ Y + D +L R +P++ A F +D+L E G L +P+TSPE+ F+
Sbjct: 457 AWMCRNLFDEYLFNQDASYL-ARIWPIMRDNARFCMDFLSETKHG-LAPSPATSPENCFL 514
Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAA---EVLEKNEDALVEKVLKSLPRLRPTKI 584
+G+ V+ SS AI+R + +I A+ E L++ + LV + +L T++
Sbjct: 515 V-NGEPVSVAQSSENATAIVRNLLDDLIQASHDLENLDEEDRDLVHEAESVRSQLAETRL 573
Query: 585 AEDGSIMEW 593
DG I+EW
Sbjct: 574 GADGRILEW 582
>gi|225351622|ref|ZP_03742645.1| hypothetical protein BIFPSEUDO_03219 [Bifidobacterium
pseudocatenulatum DSM 20438 = JCM 1200]
gi|225157966|gb|EEG71249.1| hypothetical protein BIFPSEUDO_03219 [Bifidobacterium
pseudocatenulatum DSM 20438 = JCM 1200]
Length = 783
Score = 255 bits (651), Expect = 5e-65, Method: Compositional matrix adjust.
Identities = 187/606 (30%), Positives = 294/606 (48%), Gaps = 49/606 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+TF+G + + + IP+GNGR+GA++ ++ L LN+DTLW+G P T+P P+ +
Sbjct: 1 MKLTFDGISSCWEEGIPLGNGRMGAVLCSEPETDVLYLNDDTLWSGYPHAETSPVTPEIV 60
Query: 73 SDVRSLVDSGQYAEATA--ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+ R YA AT L +Y+ G +++ S E+ +R+LDL
Sbjct: 61 AKARQASLHDDYATATRIIKEATLQEKDEQIYEPFGTARIQYSTS--ADGRESMKRQLDL 118
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS----------LDSL 180
A A + +G+ + + S PD ++V ++S ++ +VS L+++
Sbjct: 119 ARALAGETFQMGDANVHVDAWCSEPDDLLVYRMSSDAPIDVNISVSGTFLKQSRASLETV 178
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIP----PKANANDDPK---GIQFSAILEIKISDDRGT 233
D H +I+ GR PG I P N +D + G+ ++ + ++ G
Sbjct: 179 SDGHRAT-----LIVMGRMPGLNIGLLPHPSENPWEDEQDGTGMAYAGAFSLTVTG--GD 231
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
I+ + D L+ L + S F G P S + L+ + +DL
Sbjct: 232 IN-VGDNSLQCSNITGLSLRFRSMSGFKGSDQQPERS----MTVIADHLEKTIDEWSTDL 286
Query: 294 YT---RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLV 350
T RH+ DY++ F RV+I L + D S + S E +S + + L
Sbjct: 287 QTMLDRHIADYRRYFDRVAIHLGSAHADDAELLFSA----ILRSDENKESHRLE---MLA 339
Query: 351 ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
E +F FGRYLLISSSRP TQ ANLQGIWN P W SA NIN+EMNYW + PC L E
Sbjct: 340 EAMFDFGRYLLISSSRPHTQPANLQGIWNHKDFPNWYSAYTTNINVEMNYWMTGPCALQE 399
Query: 411 CQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWL 470
EPL L G A G + H D+W ++ G +W+ WP G AW+
Sbjct: 400 LIEPLVSMNEELLAPGHDAADRILGCRGSAVFHNVDLWRRALPANGDPMWSFWPFGQAWM 459
Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD 530
C +L++ Y + D +L R +P++ A F +D+L E G L +P+TSPE+ F+ +
Sbjct: 460 CRNLFDEYLFNQDASYL-ARIWPIMRDNARFCMDFLSETEHG-LAPSPATSPENCFLV-N 516
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAA---EVLEKNEDALVEKVLKSLPRLRPTKIAED 587
G+ V+ SS AI+R + +I A+ E L++ + LV + +L T++ D
Sbjct: 517 GEPVSVAQSSENATAIVRNLLDDLIQASHDLENLDEEDRDLVREAEAVRSQLAETRLGAD 576
Query: 588 GSIMEW 593
G I+EW
Sbjct: 577 GRILEW 582
>gi|367026916|ref|XP_003662742.1| glycoside hydrolase family 95 protein [Myceliophthora thermophila
ATCC 42464]
gi|347010011|gb|AEO57497.1| glycoside hydrolase family 95 protein [Myceliophthora thermophila
ATCC 42464]
Length = 834
Score = 254 bits (648), Expect = 1e-64, Method: Compositional matrix adjust.
Identities = 186/595 (31%), Positives = 286/595 (48%), Gaps = 54/595 (9%)
Query: 24 FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
F +PIGNGRL A V+G +E L LNE+++W+G D NP++ A+ +R ++ SG
Sbjct: 36 FKSTLPIGNGRLAAAVYG-TGTEKLVLNENSVWSGPWLDRANPNSKDAVPKIREMLISGN 94
Query: 84 YAEATAASV-KLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVG 142
A A++ + G+P + L D H + Y R LD TA V Y+
Sbjct: 95 ITGAGQAALDNMAGNPISPRAYHPLVNLGIDFGHGSGISD-YTRWLDTFQGTAAVNYTYH 153
Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGK 202
++RE+ +S P V+ ++S + G L+ N SL +V + +G G
Sbjct: 154 GTSYSREYVASYPHGVLAFRLSADQPGKLNANFSLS----RSQWVLSRRASVSDGEG-GH 208
Query: 203 RIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDG 262
+ A++ I F + E +I + G ++ + + + G+D + A +S+
Sbjct: 209 TVALSADSGQPSDAITFWS--EARIVNSGGNATS-DGTTVFITGADTVDVFFDAETSYRH 265
Query: 263 PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTD 322
P +D+ + E L + Y + ++D+ L RV + L S
Sbjct: 266 P---DADAAQ---RELKRKLDAAVAAGYPAVRDGAVEDFSSLMGRVRLDLGSS------G 313
Query: 323 TCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLISSSR---PGTQVANLQGI 377
+ E+ + T R+ +F+ D DP L+ L+F FGR+LL +SSR P + ANLQGI
Sbjct: 314 SAGEQPVPT-----RLSNFRQDPDADPELMTLVFNFGRHLLAASSRDTGPRSLPANLQGI 368
Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY-LA 436
WN+D P W S +NIN+EMNYW +L NL+E +PLFD + G A+ Y
Sbjct: 369 WNDDYDPPWQSKYTININIEMNYWPALVTNLAETHKPLFDLIDMAIPRGRDVARTMYGCE 428
Query: 437 SGWVIHHKTDIWAKSS-ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLL 495
G+V+HH TD+W ++ DRG + +WPMG AWL TH EHY +T +R FL + A+P+L
Sbjct: 429 RGFVLHHNTDLWGDAAPVDRG-TPYTVWPMGAAWLATHAMEHYRFTRNRTFLAEVAWPVL 487
Query: 496 EGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC-----VSYSSTMDMAIIREV 550
A F +L E D Y T PS SPEH FI P G + S MD ++ ++
Sbjct: 488 RETARFYHCYLFE-WDSYWTTGPSLSPEHSFIVPPGMTTAGAAEGLDISPEMDNQLLHQL 546
Query: 551 FSAIISAAEVL-----------EKNEDALVEKVLKSLPRLRPTKI-AEDGSIMEW 593
F+ + A L + + + LPR+RP + G I EW
Sbjct: 547 FTDVTEACARLGLFSSSSSDDDDDDAETCTTTAETYLPRIRPPAVHPTTGRIQEW 601
>gi|325964568|ref|YP_004242474.1| hypothetical protein Asphe3_32320 [Arthrobacter phenanthrenivorans
Sphe3]
gi|323470655|gb|ADX74340.1| hypothetical protein Asphe3_32320 [Arthrobacter phenanthrenivorans
Sphe3]
Length = 863
Score = 253 bits (647), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 197/641 (30%), Positives = 288/641 (44%), Gaps = 76/641 (11%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVP-----SETLKLNEDTLWTGVPGD------ 62
++ ++ PA + +A+P+GNGR GAMV+GG P S +LN+ + W+G P
Sbjct: 6 RLAYDAPAAEWLEALPLGNGRHGAMVFGGSPANGGMSHRFQLNDSSAWSGSPHSQDREPV 65
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEE 122
++ +A + LS R L+ SG +A A L + Y L F D HL A
Sbjct: 66 FSREEADRILSGSRRLISSGDFAGAAETLKGLQHRHSQAY-------LPFVDLHLTAAPA 118
Query: 123 T-------------YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
Y R LDL TA + Y + E F S+ V+V +
Sbjct: 119 ATPTAGPAAGRPSDYHRGLDLATAVSTNTYCLEGHAVRVEAFISHDPSVLVISLLADAPE 178
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANA-----NDDPKGIQFSAILE 224
++ ++ LDS L +E + P P D+ +Q +A +
Sbjct: 179 GVNLSLRLDSPLRVLRRTEDRGTCSLELKLPSDAAPAHDGGLVEYSEDESLSLQGAAAVS 238
Query: 225 IKISD---DRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA 281
D +A L G A + + A+++F G +P+ +E+
Sbjct: 239 WAHDGQDVDAPGGTAGHYGGLAATGVRRADVFVTAATTFAGLGRHPAGDAASAAAEARGV 298
Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDT---VPSAERV 338
L+ S S L RH + + +L+ I+L D + E DT + +A
Sbjct: 299 LELAHAASPSTLKERHQESHSRLYRAAQIEL---------DVPAWEGTDTGRRLLAANAH 349
Query: 339 KSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ-----------VANLQGIWNEDLSPTWD 387
D L LLF +GRYLLISSSRPG ANLQG+WN +L W
Sbjct: 350 PGGPLAADAGLAALLFNYGRYLLISSSRPGPAGSGKGSAWRGVPANLQGLWNAELPAPWS 409
Query: 388 SAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDI 447
S NINL+MNYW + P L+EC PLF + + + G+ A+ Y A GW +HH +DI
Sbjct: 410 SNYTTNINLQMNYWGAEPTGLAECVVPLFALIEAMQVTGAAVAREYYGARGWTVHHNSDI 469
Query: 448 WAKSSA---DRGKVVWALWPMGGAWLCTHLWEHYNY---TMDRD---FLEKRAYPLLEGC 498
WA + W+ WPM G WL HLWEH + T+DRD F A+P + G
Sbjct: 470 WAYAKPVGHGAHSPEWSYWPMAGLWLVRHLWEHLQFGAATVDRDKAGFARDAAWPAIRGA 529
Query: 499 ASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD---GKL--ACVSYSSTMDMAIIREVFSA 553
A F LD L E DG L T PSTSPE+ F A D G+ V+ SSTMD+ + +VF
Sbjct: 530 AEFALDLLAELPDGSLGTGPSTSPENTFAAVDPSSGRRIQGSVAQSSTMDLTLTGDVFRM 589
Query: 554 IISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+ + L + D ++++ ++LPRL + DG + EW+
Sbjct: 590 LDALGRDLGMDADPVLDEARRALPRLPAPEPGRDGKLREWL 630
>gi|418172315|ref|ZP_12808932.1| alpha-L-fucosidase [Streptococcus pneumoniae GA19451]
gi|418196823|ref|ZP_12833294.1| alpha-L-fucosidase [Streptococcus pneumoniae GA47688]
gi|419426112|ref|ZP_13966303.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 7533-05]
gi|419445683|ref|ZP_13985694.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA19923]
gi|419447843|ref|ZP_13987844.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 7879-04]
gi|419449944|ref|ZP_13989937.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 4075-00]
gi|419452089|ref|ZP_13992069.1| putative alpha-L-fucosidase [Streptococcus pneumoniae EU-NP02]
gi|419519881|ref|ZP_14059484.1| alpha-L-fucosidase [Streptococcus pneumoniae GA08825]
gi|421288567|ref|ZP_15739325.1| alpha-L-fucosidase [Streptococcus pneumoniae GA58771]
gi|353833518|gb|EHE13628.1| alpha-L-fucosidase [Streptococcus pneumoniae GA19451]
gi|353858855|gb|EHE38814.1| alpha-L-fucosidase [Streptococcus pneumoniae GA47688]
gi|379569503|gb|EHZ34473.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA19923]
gi|379611583|gb|EHZ76306.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 7879-04]
gi|379616518|gb|EHZ81213.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 7533-05]
gi|379620888|gb|EHZ85538.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 4075-00]
gi|379621308|gb|EHZ85956.1| putative alpha-L-fucosidase [Streptococcus pneumoniae EU-NP02]
gi|379638035|gb|EIA02581.1| alpha-L-fucosidase [Streptococcus pneumoniae GA08825]
gi|395885199|gb|EJG96226.1| alpha-L-fucosidase [Streptococcus pneumoniae GA58771]
Length = 739
Score = 253 bits (647), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 179/568 (31%), Positives = 284/568 (50%), Gaps = 56/568 (9%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
M++G E ++LN++T+W D NPD+ L +R + G+ +A + +F
Sbjct: 1 MIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKAEELIKLTMFA 60
Query: 97 HPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVGNVEFTREHFS 152
P D Y+LLG++ +E D A Y RELDL+TA + V + + N++ RE+F+
Sbjct: 61 TPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFDPNSCNLQIKREYFT 119
Query: 153 SNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANA 210
S ++ +I S +L+ N++L + ++ ++ I+M G+
Sbjct: 120 SFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGR-------- 171
Query: 211 NDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS 270
KG+QF + K++D G +S L + + + + L L + +++ G
Sbjct: 172 ----KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTNYWGNI------ 218
Query: 271 KKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
+S+LQ ++ Y H+ YQ+ F+RV +L S KD ++ I
Sbjct: 219 -------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDCLS-------I 263
Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
T E K + L LLF +GRYLLISSS+P ANLQGIW ++L+P W S
Sbjct: 264 PTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSK 319
Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWA 449
+NIN +MNYW PC+L E + PLFD L + G TA+ Y A G+ HH TD +
Sbjct: 320 YTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTDGFG 379
Query: 450 KSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG 509
++ + A+W + WLCTH+WEHY Y D L + + +++ F D+L E
Sbjct: 380 DTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYLFEV 438
Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL- 568
DGYL T PS SPE+++ +G SST+D I+R + I A+ L N D +
Sbjct: 439 -DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFIS 497
Query: 569 -VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
V+++ K LP+ TKI +G I EW++
Sbjct: 498 RVKELKKKLPK---TKIGSNGQIQEWLE 522
>gi|418092776|ref|ZP_12729912.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44452]
gi|353761446|gb|EHD42013.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44452]
Length = 739
Score = 253 bits (647), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 179/568 (31%), Positives = 285/568 (50%), Gaps = 56/568 (9%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
M++G E ++LN++T+W D NPD+ L +R + G+ +A + +F
Sbjct: 1 MIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKAEELIKLTMFA 60
Query: 97 HPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVGNVEFTREHFS 152
P D Y+LLG++ +E D A Y RELDL+TA + V + + N++ RE+F+
Sbjct: 61 TPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFDPNSCNLQIKREYFT 119
Query: 153 SNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANA 210
S ++ +I S +L+ N++L + ++ ++ I+M G+
Sbjct: 120 SFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGR-------- 171
Query: 211 NDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS 270
KG+QF + K++D G +S L + + + + L L + +++ G
Sbjct: 172 ----KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTNYWGNI------ 218
Query: 271 KKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
+S+LQ ++ Y H+ YQ+ F+RV +L S KD ++ I
Sbjct: 219 -------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDCLS-------I 263
Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
T E K + L LLF +GRYLLISSS+P ANLQGIW ++L+P W S
Sbjct: 264 PTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSK 319
Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWA 449
+NIN +MNYW PC+L E + PLFD L + G TA+ Y A G+ HH TD ++
Sbjct: 320 YTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTDGFS 379
Query: 450 KSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG 509
++ + A+W + WLCTH+WEHY Y D L + + +++ F D+L E
Sbjct: 380 DTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYLFEV 438
Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL- 568
DGYL T PS SPE+++ +G SST+D I+R + I A+ L N D +
Sbjct: 439 -DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFIS 497
Query: 569 -VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
V+++ K LP+ TKI +G I EW++
Sbjct: 498 RVKELKKKLPK---TKIGSNGQIQEWLE 522
>gi|418079608|ref|ZP_12716827.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 4027-06]
gi|418087855|ref|ZP_12725020.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47033]
gi|421286404|ref|ZP_15737176.1| hypothetical protein SPAR162_2099 [Streptococcus pneumoniae
GA60190]
gi|353745351|gb|EHD26021.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 4027-06]
gi|353755532|gb|EHD36135.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47033]
gi|395884860|gb|EJG95894.1| hypothetical protein SPAR162_2099 [Streptococcus pneumoniae
GA60190]
Length = 739
Score = 253 bits (645), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 179/568 (31%), Positives = 283/568 (49%), Gaps = 56/568 (9%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
M++G E ++LN++T+W D NPD+ L +R + G+ +A + +F
Sbjct: 1 MIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKAEELIKLTMFA 60
Query: 97 HPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVGNVEFTREHFS 152
P D Y+LLG++ +E D A Y RELDL+TA + V + + N++ RE+F+
Sbjct: 61 TPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSCNLQIKREYFT 119
Query: 153 SNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANA 210
S ++ +I S +L+ N++L + ++ ++ I+M G+
Sbjct: 120 SFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGR-------- 171
Query: 211 NDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS 270
KG+QF + K++D G +S L + + + + L L + + + G
Sbjct: 172 ----KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDYWGNI------ 218
Query: 271 KKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
+S+LQ ++ Y H+ YQ+ F+RV +L S KD ++ I
Sbjct: 219 -------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDCLS-------I 263
Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
T E K + L LLF +GRYLLISSS+P ANLQGIW ++L+P W S
Sbjct: 264 PTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSK 319
Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWA 449
+NIN +MNYW PC+L E + PLFD L + G TA+ Y A G+ HH TD +
Sbjct: 320 YTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTDGFG 379
Query: 450 KSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG 509
++ + A+W + WLCTH+WEHY Y D L + + +++ F D+L E
Sbjct: 380 DTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYLFEV 438
Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL- 568
DGYL T PS SPE+++ +G SST+D I+R + I A+ L N D +
Sbjct: 439 -DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFIS 497
Query: 569 -VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
V+++ K LP+ TKI +G I EW++
Sbjct: 498 RVKELKKKLPK---TKIGSNGQIQEWLE 522
>gi|418239710|ref|ZP_12866256.1| putative alpha-L-fucosidase [Streptococcus pneumoniae
NorthCarolina6A-23]
gi|419489948|ref|ZP_14029693.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44386]
gi|419526922|ref|ZP_14066473.1| hypothetical protein SPAR35_2240 [Streptococcus pneumoniae GA14373]
gi|353890745|gb|EHE70505.1| putative alpha-L-fucosidase [Streptococcus pneumoniae
NorthCarolina6A-23]
gi|379555528|gb|EHZ20595.1| hypothetical protein SPAR35_2240 [Streptococcus pneumoniae GA14373]
gi|379584934|gb|EHZ49797.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44386]
Length = 739
Score = 253 bits (645), Expect = 3e-64, Method: Compositional matrix adjust.
Identities = 179/568 (31%), Positives = 284/568 (50%), Gaps = 56/568 (9%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
M++G E ++LN++T+W D NPD+ L +R + G+ +A + +F
Sbjct: 1 MIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKAEELIKLTMFA 60
Query: 97 HPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVGNVEFTREHFS 152
P D Y+LLG++ +E D A Y RELDL+TA + V + + N++ RE+F+
Sbjct: 61 TPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSCNLQIKREYFT 119
Query: 153 SNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANA 210
S ++ +I S +L+ N++L + ++ ++ I+M G+
Sbjct: 120 SFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGR-------- 171
Query: 211 NDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS 270
KG+QF + K++D G +S L + + + + L L + + + G
Sbjct: 172 ----KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDYWGNI------ 218
Query: 271 KKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
+S+LQ ++ Y H+ YQ+ F+RV +L S KD ++ I
Sbjct: 219 -------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDCLS-------I 263
Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
T E K + L LLF +GRYLLISSS+P ANLQGIW ++L+P W S
Sbjct: 264 PTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSK 319
Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWA 449
+NIN +MNYW PC+L E + PLFD L + G TA+ Y A G+ HH TD ++
Sbjct: 320 YTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTDGFS 379
Query: 450 KSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG 509
++ + A+W + WLCTH+WEHY Y D L + + +++ F D+L E
Sbjct: 380 DTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYLFEV 438
Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL- 568
DGYL T PS SPE+++ +G SST+D I+R + I A+ L N D +
Sbjct: 439 -DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFIS 497
Query: 569 -VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
V+++ K LP+ TKI +G I EW++
Sbjct: 498 RVKELKKKLPK---TKIGSNGQIQEWLE 522
>gi|291457532|ref|ZP_06596922.1| putative alpha-L-fucosidase 2 [Bifidobacterium breve DSM 20213 =
JCM 1192]
gi|291380585|gb|EFE88103.1| putative alpha-L-fucosidase 2 [Bifidobacterium breve DSM 20213 =
JCM 1192]
Length = 783
Score = 252 bits (644), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 184/609 (30%), Positives = 296/609 (48%), Gaps = 55/609 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+TF+G + + ++IP+GNGR+GA++ ++ L LN+DTLW+G P T+P P+ +
Sbjct: 1 MKLTFDGISSCWEESIPLGNGRMGAVLCSEPETDVLYLNDDTLWSGYPHAETSPVTPEIV 60
Query: 73 SDVR--SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+ R SL D A L +Y+ G +++ S E+ +R+LDL
Sbjct: 61 AKARQASLQDDYNAATRIIKDATLQEKDEQIYEPFGTARIQYSTS--ADGRESMKRQLDL 118
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS----------LDSL 180
A A + +G+ + + S PD ++V ++S ++ +VS ++++
Sbjct: 119 ARALAGETFRMGDANVHVDAWCSEPDDLLVYRMSSDAPIDVNISVSGTFLKQSRASMETV 178
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIP----PKANANDDPK---GIQFSAILEIKISDDRGT 233
D H +++ GR PG I P N +D + G+ ++ + ++ G
Sbjct: 179 SDGHRAT-----LVVMGRMPGLNIGLLPHPSENPWEDEQDGTGMAYAGAFSLTVT---GG 230
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
+ D L+ L + S F G P S + L+ + +DL
Sbjct: 231 DVNVGDNSLQCSNITGLSLRFRSMSGFRGSDQQPERS----MTVIADHLEKTIDEWSTDL 286
Query: 294 YT---RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSL- 349
T R + DY++ F RV+I L + D DT +P + ++S + E L
Sbjct: 287 RTMLDRRIADYRRYFDRVAIHLGSAHDD---DT-------ELPFSAILRSDEKKEPHRLE 336
Query: 350 --VELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
E +F FGRYLLISSSRP TQ ANLQGIWN P W SA NIN+EMNYW + PC
Sbjct: 337 MLAEAMFDFGRYLLISSSRPHTQPANLQGIWNHKDFPNWYSAYTTNINVEMNYWMTGPCA 396
Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
L E EPL L + G A G + H D+W ++ G+ +W+ WP G
Sbjct: 397 LQELIEPLVSMNEELLVPGHDAADRILGCRGSAVFHNVDLWRRALPANGEPMWSFWPFGQ 456
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFI 527
AW+C +L++ Y + D +L R +P++ A F +D+L E G L +P+TSPE+ F+
Sbjct: 457 AWMCRNLFDEYLFNQDASYL-ARIWPIMRDNARFCMDFLSETKHG-LAPSPATSPENCFL 514
Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAA---EVLEKNEDALVEKVLKSLPRLRPTKI 584
+G+ V+ SS AI+R + +I A+ E L++ + LV + +L T++
Sbjct: 515 V-NGEPVSVAQSSENATAIVRNLLDDLIQASHDLEDLDEEDRDLVHEAESVRSQLAETRL 573
Query: 585 AEDGSIMEW 593
DG I+EW
Sbjct: 574 GADGRILEW 582
>gi|421290728|ref|ZP_15741475.1| alpha-L-fucosidase [Streptococcus pneumoniae GA54354]
gi|421306123|ref|ZP_15756774.1| alpha-L-fucosidase [Streptococcus pneumoniae GA62331]
gi|395885632|gb|EJG96654.1| alpha-L-fucosidase [Streptococcus pneumoniae GA54354]
gi|395903807|gb|EJH14730.1| alpha-L-fucosidase [Streptococcus pneumoniae GA62331]
Length = 739
Score = 251 bits (641), Expect = 8e-64, Method: Compositional matrix adjust.
Identities = 178/568 (31%), Positives = 282/568 (49%), Gaps = 56/568 (9%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
M++G E ++LN++T+W D NPD+ L +R + G+ +A + +F
Sbjct: 1 MIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKAEELIKLTMFA 60
Query: 97 HPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVGNVEFTREHFS 152
P D Y+LLG++ +E D A Y RELDL+TA + V + + N++ RE+F+
Sbjct: 61 TPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSCNLQIKREYFT 119
Query: 153 SNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANA 210
S ++ +I S +L+ N++L + ++ ++ I+M G+
Sbjct: 120 SFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGR-------- 171
Query: 211 NDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS 270
KG+QF + K++D G +S L + + + + L L + + + G
Sbjct: 172 ----KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDYWGNI------ 218
Query: 271 KKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
+S+LQ ++ Y H+ YQ+ F+RV +L S KD ++ I
Sbjct: 219 -------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDCLS-------I 263
Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
T E K + L LLF +GRYLLISSS+P NLQGIW ++L+P W S
Sbjct: 264 PTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPVNLQGIWCDELNPIWGSK 319
Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWA 449
+NIN +MNYW PC+L E + PLFD L + G TA+ Y A G+ HH TD +
Sbjct: 320 YTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTDGFG 379
Query: 450 KSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG 509
++ + A+W + WLCTH+WEHY Y D L + + +++ F D+L E
Sbjct: 380 DTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYLFEV 438
Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL- 568
DGYL T PS SPE+++ +G SST+D I+R + I A+ L N D +
Sbjct: 439 -DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFIS 497
Query: 569 -VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
V+++ K LP+ TKI +G I EW++
Sbjct: 498 RVKELKKKLPK---TKIGSNGQIQEWLE 522
>gi|418188158|ref|ZP_12824676.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47360]
gi|421271597|ref|ZP_15722447.1| hypothetical protein SPAR48_2212 [Streptococcus pneumoniae SPAR48]
gi|353847967|gb|EHE27986.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47360]
gi|395865736|gb|EJG76874.1| hypothetical protein SPAR48_2212 [Streptococcus pneumoniae SPAR48]
Length = 739
Score = 251 bits (640), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 177/568 (31%), Positives = 281/568 (49%), Gaps = 56/568 (9%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
M++G E ++LN++T+W D NPD+ L +R + G+ +A + +F
Sbjct: 1 MIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKAEELIKLTVFA 60
Query: 97 HPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVGNVEFTREHFS 152
P D Y+LLG++ +E D A Y RELDL+TA + V + + N++ RE+F+
Sbjct: 61 TPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSCNLQIKREYFT 119
Query: 153 SNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANA 210
S ++ +I S +L+ N++L + ++ ++ I+M G+
Sbjct: 120 SFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGR-------- 171
Query: 211 NDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS 270
KG+QF + K++D G +S L + + + + L L + + + G
Sbjct: 172 ----KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDYWGNI------ 218
Query: 271 KKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
+S+LQ ++ Y H+ YQ+ F+RV +L S + +I
Sbjct: 219 -------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYSKGCL--------SI 263
Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
T E K + L LLF +GRYLLISSS+P ANLQGIW ++L+P W S
Sbjct: 264 PTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSK 319
Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWA 449
+NIN +MNYW PC+L E + PLFD L + G TA+ Y A G+ HH TD +
Sbjct: 320 YTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTDGFG 379
Query: 450 KSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG 509
++ + A+W + WLCTH+WEHY Y D L + + +++ F D+L E
Sbjct: 380 DTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYLFEV 438
Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL- 568
DGYL T PS SPE+++ +G SST+D I+R + I A+ L N D +
Sbjct: 439 -DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFIS 497
Query: 569 -VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
V+++ K LP+ TKI +G I EW++
Sbjct: 498 RVKELKKKLPK---TKIGSNGQIQEWLE 522
>gi|332880351|ref|ZP_08448029.1| hypothetical protein HMPREF9074_03804 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|357047449|ref|ZP_09109054.1| hypothetical protein HMPREF9441_03090 [Paraprevotella clara YIT
11840]
gi|332681796|gb|EGJ54715.1| hypothetical protein HMPREF9074_03804 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|355529520|gb|EHG98947.1| hypothetical protein HMPREF9441_03090 [Paraprevotella clara YIT
11840]
Length = 746
Score = 250 bits (639), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 187/595 (31%), Positives = 277/595 (46%), Gaps = 109/595 (18%)
Query: 12 PLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
P+K+ ++ PAK + T A+P+GNG +GAM +GGV E L+ N+ TLW G
Sbjct: 25 PMKLWYDEPAKVWMTSALPVGNGGIGAMFFGGVAKEQLQFNDKTLWAG------------ 72
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
S R YQ +GD+ EFD YRREL L
Sbjct: 73 --STTRR----------------------GAYQNMGDLFFEFDTPE---TCTNYRRELSL 105
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISG-SESGSLSFNVSLDSLLDNHSYVNG 189
+ A RV Y++ V++ RE+F+SNPD VIV +++ G L+F++ + + V+G
Sbjct: 106 DDAIGRVSYTIDGVDYLREYFASNPDSVIVVRLTTPGHKGKLNFSLRMQDGRQGMTRVDG 165
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
+ I D + A+L+ D G + D+ L+V+G+D
Sbjct: 166 HTMTI--------------KGTLDLLSYEAQALLQA----DGGMVETKSDR-LEVKGADA 206
Query: 250 AVLLLVASSSFD--GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
++L +++FD P D+ + S ++ R SY L HL DYQ LF R
Sbjct: 207 VTVVLTGATNFDLASPTYTRGDAYEIHRRVSARMDKATRK-SYKKLKAAHLADYQPLFAR 265
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
V + L D TD E+ D + L L FQ+GRYL++ SSR
Sbjct: 266 VELDLDAEQPDYTTDVLVREHKD---------------NAYLDMLYFQYGRYLMLGSSRG 310
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI--- 424
G +NLQG+WN +P W+ H NIN++MNYW + NLSEC P F+TY+S
Sbjct: 311 GQLPSNLQGLWNNVNNPAWECDYHSNINVQMNYWPAEVTNLSECYAP---FITYVSTEAL 367
Query: 425 -NGSKTAQVNYL--ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
+G QV GW +H + +I+ G W + AW CTHLW+HY YT
Sbjct: 368 KDGGAWQQVARKENCRGWAVHTQNNIF-------GYTDWLINRPANAWYCTHLWQHYAYT 420
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP--DGKLACVSYS 539
+D+++L A+P+++ + D L E +G L SPEH P DG V+Y+
Sbjct: 421 LDKEYLRDTAWPVMKVTCQYWFDRLKENAEGRLVAPNEWSPEH---GPWEDG----VAYA 473
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIMEW 593
+ A+ E ++AA+VL +DA V ++ + RL I G I EW
Sbjct: 474 QQLVYALFEET----LAAADVLAV-DDAFVSELKEKFSRLDNGLHIGSWGQIKEW 523
>gi|402847334|ref|ZP_10895629.1| hypothetical protein HMPREF1323_1685 [Porphyromonas sp. oral taxon
279 str. F0450]
gi|402266647|gb|EJU16068.1| hypothetical protein HMPREF1323_1685 [Porphyromonas sp. oral taxon
279 str. F0450]
Length = 838
Score = 250 bits (638), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 180/603 (29%), Positives = 286/603 (47%), Gaps = 55/603 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG-VPGDYTNPDAPKA 71
L F+ PA +A+P+GNGRLG + GGV + + LNE ++W+G V N +A K
Sbjct: 46 LTYFFDRPATSMMEALPLGNGRLGMLSDGGVQHQRITLNESSMWSGSVDSTAWNAEAYKQ 105
Query: 72 LSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIELEFDDSHLK 118
L +R L+ +G+ EA + F P YQ+ G + L +D +
Sbjct: 106 LPAIRKLLLAGRAKEAEDLIYRTFVCGGVGSGRGQGANTPYGSYQVGGFLHLNWDKAP-- 163
Query: 119 YAEETYRRELDLNTATARVKYSV-GNVEFTREHFSSNPDQVIVTKIS--GSESGSLSFNV 175
Y R L L+ +R + V G T+ +S +V V ++ E+ + +
Sbjct: 164 -ELSGYYRGLSLSEGVSRESFVVDGQAYRTKRLYSVLGREVQVVHLTNHSEEARRDTLRL 222
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
SL + H + + G+ P + +G+ + AI+ + GT+
Sbjct: 223 SLSRPENGHPAAEAGF-LTLSGQLPDGK---------GGRGMSY-AIVVRPVLPQGGTLI 271
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
D+ L V V L +A ++ N D + + S+ + + ++L+
Sbjct: 272 TRGDELLIVNAP--TVELYIAHNT------NYYDKRLPVMARSIEQTLQAKAVGEANLFA 323
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF--QTDEDPSLVELL 353
H+ + RV + S+ + ++P R+ ++ + DP+L L
Sbjct: 324 EHVQRFTAQMDRVQARF----------LGSDPALSSLPIQRRLIAYYEHPERDPALAALY 373
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
Q GRYLLISS+RPG NLQGIW E + W+ H+NINL+MNYW + L E
Sbjct: 374 MQLGRYLLISSTRPGALPPNLQGIWTETIQAPWNGDYHLNINLQMNYWPAEKGALPETVG 433
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
L D++ + +G +TA+ Y A GWV H ++W + +A W AWLC H
Sbjct: 434 ALTDWVESIVPSGERTARTFYRAKGWVTHVLGNVW-QFTAPGEHPSWGATNTSAAWLCEH 492
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGK 532
L+ HY Y+ DR +LE R YP+++G A F L L++ GYL P+TSPE+ + P GK
Sbjct: 493 LYNHYRYSQDRAYLE-RIYPVMQGAARFFLTTLVKDPKSGYLVNVPTTSPENSYYTPQGK 551
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
V+ STMD I+RE+FS AA L ++ V+ + +L +L+PT + DG IME
Sbjct: 552 AVAVAAGSTMDNQILRELFSTTREAAMTLGRDR-TFVDSLSTALRQLKPTTLGPDGRIME 610
Query: 593 WVQ 595
W++
Sbjct: 611 WME 613
>gi|317141175|ref|XP_001817567.2| hypothetical protein AOR_1_3054174 [Aspergillus oryzae RIB40]
Length = 770
Score = 249 bits (637), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 171/563 (30%), Positives = 277/563 (49%), Gaps = 59/563 (10%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ P F ++P+GNGRLG ++ +P+E + NED++W+G D N +A VR
Sbjct: 34 YDTPGTRFNASLPVGNGRLGGTLYC-LPTEIVTWNEDSVWSGTFQDRVNSNALDGFPKVR 92
Query: 77 SLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
+L+ +G A ++ + G D YQ+L ++ ++ Y L+ TA
Sbjct: 93 NLLVNGNITAAGELALSDMTGSSVDQREYQVLSNLYVDLGQRGDATNLVWYLDTLEGYTA 152
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
+Y V +TRE +S P V+ +I + S +++ N + NG I
Sbjct: 153 ---CEYGFDGVSYTRELIASAPSGVLGFRIQTNTSRAINLN----------AVANGIASI 199
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
+M+ R + F+A + + + D G ++A DK L V G+ V
Sbjct: 200 VMKART------------GEADYSTFTAGVRVVV--DGGNVTANGDK-LYVTGATTVVFF 244
Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
L A SS+ + D +E L + L Y L + D++ L RV++ L
Sbjct: 245 LDAESSYR------YATDSDQETELNRKLDAATELGYEALRKEAITDHKDLAGRVTLDLG 298
Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQT--DEDPSLVELLFQFGRYLLISSSRPGTQV 371
S D + +P ER+ ++++ D D L+F +GR+LLI+SSR +
Sbjct: 299 SSTDDAAS----------LPPNERMTNYRSSPDHDVQFATLVFNYGRHLLIASSRRTRER 348
Query: 372 A---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
+ LQGIWN+D SP+W + VNINLEMNYW + NL+E PL+D L + G
Sbjct: 349 SLSPGLQGIWNQDYSPSWGAKYTVNINLEMNYWPAETTNLNELTSPLWDLLALIQERGGD 408
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
A+ + G+V+HH TD+W S +++WPMGGAWL H+ EHY +T D+ FL+
Sbjct: 409 VAEKMHGCPGFVLHHNTDLWGDSVPVHNGTKYSIWPMGGAWLALHMMEHYRFTGDKTFLK 468
Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYSSTMD 543
++A P+ + F +L + DGYL T PS SPE+ F P GK ++ S T+D
Sbjct: 469 EQACPIFKSAFEFFECYLFD-VDGYLTTGPSCSPENAFQIPSDMTVAGKEEALTMSPTLD 527
Query: 544 MAIIREVFSAIISAAEVLEKNED 566
+++ E+ +A+ ++LE + D
Sbjct: 528 NSMLFELLTALNETHQILEIDND 550
>gi|405761776|ref|YP_006702372.1| hypothetical protein SPNA45_02013 [Streptococcus pneumoniae SPNA45]
gi|404278665|emb|CCM09296.1| conserved hypothetical protein [Streptococcus pneumoniae SPNA45]
Length = 739
Score = 249 bits (636), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 177/568 (31%), Positives = 282/568 (49%), Gaps = 56/568 (9%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
M++G E ++LN++T+W + NPD+ L +R + G+ +A + +F
Sbjct: 1 MIYGSATKECIQLNDETIWYRGKSNRNNPDSLLHLKKIREYLLDGEIQKAEELIKLTMFA 60
Query: 97 HPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVGNVEFTREHFS 152
P D Y+LLG++ +E D A Y RELDL+TA + V + + N++ RE+F+
Sbjct: 61 TPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSCNLQIKREYFT 119
Query: 153 SNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANA 210
S ++ +I S +L+ N++L + ++ ++ I+M G+
Sbjct: 120 SFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGR-------- 171
Query: 211 NDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS 270
KG+QF + K++D G +S L + + + + L L + + + G
Sbjct: 172 ----KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDYWGNI------ 218
Query: 271 KKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
+S+LQ ++ Y H+ YQ+ F+RV +L S KD ++ I
Sbjct: 219 -------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDCLS-------I 263
Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
T E K + L LLF +GRYLLISSS+P ANLQGIW ++L+P W S
Sbjct: 264 PTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSK 319
Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWA 449
+NIN +MNYW PC+L E + PLFD L + G TA+ Y A G+ HH TD +
Sbjct: 320 YTININTQMNYWMVGPCDLPEVEYPLFDMLERIREPGRLTAKKMYGARGFTAHHNTDGFG 379
Query: 450 KSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG 509
++ + A+W + WLCTH+WEHY Y D L + + +++ F D+L E
Sbjct: 380 DTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYLFEV 438
Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL- 568
DGYL PS SPE+++ +G SST+D I+R + I A+ L N D +
Sbjct: 439 -DGYLMIGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFIS 497
Query: 569 -VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
V+++ K LP+ TKI +G I EW++
Sbjct: 498 RVKELKKKLPK---TKIGSNGQIQEWLE 522
>gi|330998117|ref|ZP_08321945.1| hypothetical protein HMPREF9442_03053 [Paraprevotella xylaniphila
YIT 11841]
gi|329569206|gb|EGG50997.1| hypothetical protein HMPREF9442_03053 [Paraprevotella xylaniphila
YIT 11841]
Length = 746
Score = 249 bits (635), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 185/596 (31%), Positives = 274/596 (45%), Gaps = 111/596 (18%)
Query: 12 PLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
P+K+ ++ PAK + T A+P+GNG +GAM +GGV E L+ N+ TLW G
Sbjct: 25 PMKLWYDEPAKVWMTSALPVGNGGIGAMFFGGVAKERLQFNDKTLWAG------------ 72
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
S R YQ +GD+ EFD YRREL L
Sbjct: 73 --STTRR----------------------GAYQNMGDLFFEFDTPE---TCTNYRRELSL 105
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISG-SESGSLSFNVSLDSLLDNHSYVNG 189
+ A RV Y++ V++ RE+F+SNPD VIV +++ G L+F++ + + V+G
Sbjct: 106 DDAIGRVSYTIDGVDYLREYFASNPDSVIVVRLTTPRHKGKLNFSLRMQDGRQGMTRVDG 165
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQ--FSAILEIKISDDRGTISALEDKKLKVEGS 247
+ I KG S + ++ D G + D+ L+V+G+
Sbjct: 166 HTMTI--------------------KGTLDLLSYEAQARLQADGGMVETKSDR-LEVKGA 204
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA-LQSIRNLSYSDLYTRHLDDYQKLFH 306
D ++L +++FD + D +SA + SY L HL DYQ LF
Sbjct: 205 DAVTVVLTGATNFDLASPTYTRGDADEIHRRVSARMDKAARKSYKKLKAVHLADYQPLFA 264
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV + L D TD E+ D + L L FQ+GRYL++ SSR
Sbjct: 265 RVELDLDAEQPDYTTDVLVREHKD---------------NAYLDMLYFQYGRYLMLGSSR 309
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI-- 424
G +NLQG+WN +P W+ H NIN++MNYW + NLSEC P F+TY+S
Sbjct: 310 GGQLPSNLQGLWNNVNNPAWECDYHSNINVQMNYWPAEVANLSECYAP---FITYVSTEA 366
Query: 425 --NGSKTAQVNYL--ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
+G QV GW +H + +I+ G W + AW CTHLW+HY Y
Sbjct: 367 LKDGGSWQQVARKENCRGWAVHTQNNIF-------GYTDWLINRPANAWYCTHLWQHYAY 419
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP--DGKLACVSY 538
T+D+++L A+P+++ + D L E +G L SPEH P DG V+Y
Sbjct: 420 TLDKEYLRDTAWPVMKVTCQYWFDRLKENTEGRLVAPNEWSPEH---GPWEDG----VAY 472
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIMEW 593
+ + A+ E ++AA VL +DA V ++ + RL + G I EW
Sbjct: 473 AQQLVYALFEET----LAAAGVLAV-DDAFVSELKEKFSRLDNGLHVGSWGQIKEW 523
>gi|29347187|ref|NP_810690.1| hypothetical protein BT_1777 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29339086|gb|AAO76884.1| glycoside hydrolase family 95 [Bacteroides thetaiotaomicron
VPI-5482]
Length = 1019
Score = 249 bits (635), Expect = 4e-63, Method: Compositional matrix adjust.
Identities = 166/497 (33%), Positives = 261/497 (52%), Gaps = 35/497 (7%)
Query: 109 ELEFDDS-HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKI-SGS 166
EL D S L Y++ Y R LD++ A V Y + F RE+F S PD V+V ++ S S
Sbjct: 324 ELSIDASTELPYSD--YTRTLDIDNAIHTVMYKENGITFKREYFMSYPDNVMVMRLTSDS 381
Query: 167 ESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK 226
+ G LS +SL+SL + + + I M G P K + G++++ L +K
Sbjct: 382 KKGKLSRIISLESLHTDKTITADGHTITMTGY-PTPVSGDKRVGDAWKNGLKYAQQLVVK 440
Query: 227 ISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD--SKKDPTSESMSALQS 284
+ G IS ++ KLKVE +D ++L+ A++++ + + S++DP + + L
Sbjct: 441 --NKGGKISVVDGTKLKVEDADEIIVLMSAATNYVQCMDDSYNYFSQEDPLEKVQATLHK 498
Query: 285 IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSE-ENIDTVPSAERVKSFQT 343
+ + Y+ L H DY L+ R+ + L P+ V T S + +D ++E+
Sbjct: 499 VADKKYTALLATHQKDYHSLYDRMRLNLGNLPEAPVAPTDSLLKGMDENTNSEQ------ 552
Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
E+ L L FQFGRYLLISSSR G+ ANLQG+W E LS W++ H NIN++MNYW +
Sbjct: 553 -ENQYLEMLYFQFGRYLLISSSREGSLPANLQGVWGERLSNPWNADYHTNINIQMNYWPT 611
Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYL------ASGWVIHHKTDIWAKSSADRGK 457
P NLS C P+ +++ L G TAQ Y GWV HH+ +IW ++A K
Sbjct: 612 QPTNLSPCHLPMVEYVRSLVPRGKYTAQQYYCKPDGGNVRGWVTHHENNIW-DNTAPAKK 670
Query: 458 VVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETN 517
+P G W+C +WE+Y + +D+DFL+K +L+ ++ + + DG L N
Sbjct: 671 STPHHFPAGAIWMCQDIWEYYQFNLDKDFLKKYYDTMLDAALFWVDNLWTDERDGTLVAN 730
Query: 518 PSTSPEH-EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL 576
PS SPEH EF L C + A+I E+F +I A++ L +++D + ++ ++
Sbjct: 731 PSHSPEHGEF-----SLGC-----STSQAMICEMFDMMIKASKELGRDKDPEIIEIATAM 780
Query: 577 PRLRPTKIAEDGSIMEW 593
+L KI G MEW
Sbjct: 781 SKLSGPKIGLGGQFMEW 797
Score = 58.5 bits (140), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 28/63 (44%), Positives = 42/63 (66%), Gaps = 1/63 (1%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV 59
MM LK T+N PAK++ ++A+PIGNG +GAM++G V + ++ NE TLW+G
Sbjct: 23 MMACSEQPHQPTLKATYNKPAKNWESEALPIGNGYMGAMIFGDVYVDVIQTNEHTLWSGG 82
Query: 60 PGD 62
PG+
Sbjct: 83 PGE 85
>gi|225016900|ref|ZP_03706092.1| hypothetical protein CLOSTMETH_00813 [Clostridium methylpentosum
DSM 5476]
gi|224950294|gb|EEG31503.1| hypothetical protein CLOSTMETH_00813 [Clostridium methylpentosum
DSM 5476]
Length = 1565
Score = 248 bits (634), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 188/640 (29%), Positives = 303/640 (47%), Gaps = 89/640 (13%)
Query: 6 STSTTNPLKITFNGPAKHFTDA-------IPIGNGRLGAMVWGGVPSETLKLNEDTLWTG 58
S TNPL++ + PA TD+ +P+GNG +G MV+GG+ E + NE ++WTG
Sbjct: 38 SVRNTNPLRLWYTKPAPVNTDSKQWQYTVLPLGNGYMGGMVFGGISKERVHFNEKSMWTG 97
Query: 59 VPG---------DYTNPDAPKALSDVRSLVDSGQY----AEATAASVKLF----GHPAD- 100
P + T P + L + R+ +D ++A + KL G D
Sbjct: 98 GPSASRPNHNGSNRTEPVTTEWLDEFRAELDDKTNDVWGLSSSAGNNKLLDLIRGPKRDN 157
Query: 101 ------VYQLLGDIELEFDDSHLKY-AEETYRRELDLNTATARVKYSVGNVEFTREHFSS 153
+YQ GDI ++F + + E Y R+LDL TA + V Y +G V +TRE+F+S
Sbjct: 158 WDNGMGMYQDFGDIYMDFARAGITDDMAENYVRDLDLTTALSTVSYDIGGVHYTREYFNS 217
Query: 154 NPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDD 213
PD V+ +++ SE+G L+F+ S+ S + N + EG R + N
Sbjct: 218 YPDNVLAMRLNASEAGKLTFDASITPA---SSTSSTNRTVTAEGDIITLRGQIRDNQ--- 271
Query: 214 PKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKD 273
+Q+ A ++K+ ++ GT+ A ED + ++G+D L+L + + + P +D
Sbjct: 272 ---LQYEA--QLKVLNEGGTLKANEDGTISIDGADSVTLILACGTDYKNEW--PKYRGED 324
Query: 274 PTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVP 333
P + + + + + LY HL+DYQ+LF RV + L E + +P
Sbjct: 325 PHEAISARIDNAADKGFDALYQTHLEDYQELFSRVDLDLG-------------EELPNIP 371
Query: 334 SAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN-EDLSPTWDSAPH 391
+ E +++++ E + SL L +Q GRYL I+ SR T NL G+W S W++ H
Sbjct: 372 TDELIQNYRDGEHNKSLEVLTYQMGRYLTIAGSRENTLPTNLNGVWMIGSASQFWNADYH 431
Query: 392 VNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS-----------GWV 440
N+N +MNYW ++ NL+EC P D++ L G TA S G+
Sbjct: 432 FNVNFQMNYWPTMAANLAECMLPYNDYMESLVEPGRVTAGATAGLSTEPGTPIGEGNGFN 491
Query: 441 IHHKTDIWAKSSADRGKVVWALWPMGGA-WLCTHLWEHYNYTMDRDFLEKRAYPLLEGCA 499
H +I+ + +V W +GGA W + +++Y YT D D+L + YP+L+ A
Sbjct: 492 AHTVNNIFGTTGP--YQVQEFGWTLGGASWALENSYDYYAYTQDEDYLRDKIYPMLKEQA 549
Query: 500 SFLLDWLIEGHDGY---LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
+F +L H Y L PS SPE + ST D +I E F I+
Sbjct: 550 TFYSKFLW--HSDYQNRLVVGPSVSPEQ---------GPTTNGSTFDQSIAWEAFEEAIN 598
Query: 557 AAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQR 596
A+E L +ED L + +L P + ++G I EW +
Sbjct: 599 ASEALGVDED-LRATWAEMQSQLNPIIVGDEGQIKEWYEE 637
>gi|383125191|ref|ZP_09945845.1| hypothetical protein BSIG_4345 [Bacteroides sp. 1_1_6]
gi|382983436|gb|EES66608.2| hypothetical protein BSIG_4345 [Bacteroides sp. 1_1_6]
Length = 1019
Score = 248 bits (633), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 165/497 (33%), Positives = 260/497 (52%), Gaps = 35/497 (7%)
Query: 109 ELEFDDS-HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKI-SGS 166
EL D S L Y++ Y R LD++ A V Y + F RE+F S PD V+V ++ S S
Sbjct: 324 ELSIDASTELPYSD--YTRTLDIDNAIHTVMYKENGITFKREYFMSYPDNVMVMRLTSDS 381
Query: 167 ESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK 226
+ G LS +SL+SL + + + I M G P K + G+ ++ L +K
Sbjct: 382 KKGKLSRIISLESLHTDKTITADGHTITMTGY-PTPVSGDKRVGDAWKNGLIYAQQLVVK 440
Query: 227 ISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD--SKKDPTSESMSALQS 284
+ G IS ++ KLKVE +D ++L+ A++++ + + S++DP + + L
Sbjct: 441 --NKGGKISVVDGTKLKVEDADEIIVLMSAATNYVQCMDDSYNYFSQEDPLEKVQATLHK 498
Query: 285 IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSE-ENIDTVPSAERVKSFQT 343
+ + Y+ L H DY L+ R+ + L P+ V T S + +D ++E+
Sbjct: 499 VADKKYTALLATHQKDYHSLYDRMRLNLGNLPEAPVAPTDSLLKGMDENTNSEQ------ 552
Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
E+ L L FQFGRYLLISSSR G+ ANLQG+W E LS W++ H NIN++MNYW +
Sbjct: 553 -ENQYLEMLYFQFGRYLLISSSREGSLPANLQGVWGERLSNPWNADYHTNINIQMNYWPT 611
Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYL------ASGWVIHHKTDIWAKSSADRGK 457
P NLS C P+ +++ L G TAQ Y GWV HH+ +IW ++ + K
Sbjct: 612 QPTNLSPCHLPMVEYVRSLVPRGKYTAQQYYCKPDGGNVRGWVTHHENNIWGNTAPAK-K 670
Query: 458 VVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETN 517
+P G W+C +WE+Y + +D+DFL+K +L+ ++ + + DG L N
Sbjct: 671 STPHHFPAGAIWMCQDIWEYYQFNLDKDFLKKYYDTMLDAALFWVDNLWTDERDGTLVAN 730
Query: 518 PSTSPEH-EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL 576
PS SPEH EF L C + A+I E+F +I A++ L +++D + ++ ++
Sbjct: 731 PSHSPEHGEF-----SLGC-----STSQAMICEMFDMMIKASKELGRDKDPEIIEIATAM 780
Query: 577 PRLRPTKIAEDGSIMEW 593
+L KI G MEW
Sbjct: 781 SKLSGPKIGLGGQFMEW 797
Score = 58.9 bits (141), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 30/63 (47%), Positives = 44/63 (69%), Gaps = 2/63 (3%)
Query: 2 MNAESTSTTNP-LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV 59
M A S P LK T+N PAK++ ++A+PIGNG +GAM++G V + ++ NE TLW+G
Sbjct: 23 MTACSEQPYQPTLKATYNKPAKNWESEALPIGNGYMGAMIFGDVYVDVIQTNEHTLWSGG 82
Query: 60 PGD 62
PG+
Sbjct: 83 PGE 85
>gi|365122414|ref|ZP_09339317.1| hypothetical protein HMPREF1033_02663 [Tannerella sp.
6_1_58FAA_CT1]
gi|363642654|gb|EHL82000.1| hypothetical protein HMPREF1033_02663 [Tannerella sp.
6_1_58FAA_CT1]
Length = 837
Score = 248 bits (633), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 185/602 (30%), Positives = 288/602 (47%), Gaps = 76/602 (12%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR-S 77
+ PIGNG G + G V +E + LNE +LW G P Y N + K L +R S
Sbjct: 79 SFPIGNGSFGGNILGSVKTERITLNEKSLWKGGPNVSGGARYYWDANKEGYKVLDQIRHS 138
Query: 78 LVD-SGQYAEATAASVKLF----GHPADV--------YQLLGDIELEFDDSHLKYAE-ET 123
+ SG + AT + F G+ D + +G+ + D+ + +E
Sbjct: 139 FIQFSGINSVATELTRNNFNGKCGYEPDSEKSFRFGSFTTMGEFHI---DTGIAESEISD 195
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLL 181
YRR L L++A V+++ G F R+ FSS PD +++ + + G +L+F +
Sbjct: 196 YRRILSLDSALVVVQFNAGGDCFYRKFFSSYPDSIMIYRFECTRPGRQNLTFRYVANPQA 255
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
+G I+ GR D G+QF ++ ++ + GT++ +E+
Sbjct: 256 SGSVEADGTAGIVYNGRL-------------DSNGMQF--VIRVRAVAESGTVT-VENGA 299
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINP--SDSKK----DPTSESMSALQSIRNLSYSDLYT 295
+KV G+D + + + + NP +D + DP + + L Y +Y
Sbjct: 300 IKVIGADNVTFYVAGDTDYKMNY-NPDFNDDRAYVGVDPVMTTQNNLDFALAKGYDAVYN 358
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLF 354
H DY LF RV I L+ S + V+D +P+ R+ +++ D L EL F
Sbjct: 359 AHRADYSALFDRVKIDLNES--NPVSD---------IPTDMRLSNYRNGISDHYLEELYF 407
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
QFGRYLLI+SSR G ANLQG+W+ ++ W H NINL+MNYW + P NLSECQ P
Sbjct: 408 QFGRYLLIASSRAGNLPANLQGLWHNNVEGPWRVDYHNNINLQMNYWPACPANLSECQTP 467
Query: 415 LFDFLTYLSINGSKTAQVNYL--ASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLC 471
L +++ L G +TA+ Y GW ++I+ +S + + W + G WL
Sbjct: 468 LIEYIRTLVKPGERTAKAYYGPDTRGWTTSVSSNIFGFTSPLSSRDMSWNFSFVAGPWLA 527
Query: 472 THLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDG 531
TH+WE+Y+YT D DFL Y L++G A F +D L DG PSTSPEH
Sbjct: 528 THVWEYYDYTRDEDFLRTTGYELIKGSAEFAVDHLWHKPDGSYAAAPSTSPEH------- 580
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
V +T A++RE+ I +++L+ + E+ + L +L P +I G +M
Sbjct: 581 --GPVDQGATFAHAVVREILLDAIETSKILDVDASER-EEWQEVLNKLMPYEIGRYGQLM 637
Query: 592 EW 593
EW
Sbjct: 638 EW 639
>gi|417920435|ref|ZP_12563942.1| Gram-positive signal peptide protein, YSIRK family [Streptococcus
australis ATCC 700641]
gi|342829385|gb|EGU63741.1| Gram-positive signal peptide protein, YSIRK family [Streptococcus
australis ATCC 700641]
Length = 1209
Score = 247 bits (630), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 192/636 (30%), Positives = 307/636 (48%), Gaps = 107/636 (16%)
Query: 14 KITFNGPAKHFTD-----AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-------- 60
++T+N PA D A+P+GNG +GA V+G + E ++ NE TLW+G P
Sbjct: 123 QLTYNQPATPSYDGWEKQALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYN 182
Query: 61 -GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDS 115
G+Y D K L+++R +++G +A + + P + Y GDI + F++
Sbjct: 183 GGNY--EDRHKVLAEIRKALEAGDRQKAKQLAEENLVGPNNAQYGRYLSFGDIFMVFNNQ 240
Query: 116 HLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF- 173
T Y R LD+ A YS F RE FSS PD V VT +S +L F
Sbjct: 241 KKGLENVTDYHRALDITQAITTTAYSQDGTTFKRETFSSYPDDVTVTHLSKKGDKTLDFT 300
Query: 174 --NVSLDSLLDNHSY------------VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQF 219
N + LL N Y +N I+++G K N G+QF
Sbjct: 301 LWNSLTEDLLANGDYSWEYSNYKQGAVTTDSNGILLKGTV-------KDN------GLQF 347
Query: 220 SAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSES 278
++ L IK G ++A +D L V G+ +A LLL A ++F NP ++ +KD E+
Sbjct: 348 ASYLGIKTD---GQVTA-QDGYLTVTGASYATLLLSAKTNFAQ---NPKTNYRKDIDVEN 400
Query: 279 M--SALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAE 336
S +++ + Y L H++DYQ LF+RV + L S T + E
Sbjct: 401 TVKSIVEAAKAKDYETLKHDHIEDYQSLFNRVQLNLGGSKS-------------TQTTKE 447
Query: 337 RVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNI 394
++++ ++ L EL FQ+GRYL+ISSSR T ANLQG+WN +P W+S H+N+
Sbjct: 448 ALQTYNPEKGQQLEELFFQYGRYLIISSSRDRTDALPANLQGVWNAVDNPPWNSDYHLNV 507
Query: 395 NLEMNYWQSLPCNLSECQEPLFDFLTYLSING-----------SKTAQVNYLASGWVIHH 443
NL+MNYW + NL+E P+ +++ L G SK Q N GW++H
Sbjct: 508 NLQMNYWPAYMSNLAETARPMVNYIDDLRYYGRIAAKEYAGIESKEGQEN----GWLVHT 563
Query: 444 KTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASF 501
+ W D W P AW+ +++++Y +T D +L+++ YP+L+ A F
Sbjct: 564 QATPFGWTTPGWD---YYWGWSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKF 620
Query: 502 LLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAE 559
+L + D ++ ++PS SPEH ++ +T D +++ ++F + AA
Sbjct: 621 WNSFLHYDQTSDRWV-SSPSYSPEH---------GTITIGNTFDQSLVWQLFHDYMEAAN 670
Query: 560 VLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
L ++D LV +V +L+P I ++G I EW +
Sbjct: 671 HLNVDQD-LVTEVKTKFDKLKPLHINQEGRIKEWYE 705
>gi|319946487|ref|ZP_08020723.1| alpha-L-fucosidase [Streptococcus australis ATCC 700641]
gi|319747318|gb|EFV99575.1| alpha-L-fucosidase [Streptococcus australis ATCC 700641]
Length = 1643
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 192/636 (30%), Positives = 307/636 (48%), Gaps = 107/636 (16%)
Query: 14 KITFNGPAKHFTD-----AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-------- 60
++T+N PA D A+P+GNG +GA V+G + E ++ NE TLW+G P
Sbjct: 148 QLTYNQPATPSYDGWEKQALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYN 207
Query: 61 -GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDS 115
G+Y D K L+++R +++G +A + + P + Y GDI + F++
Sbjct: 208 GGNYE--DRHKVLAEIRKALEAGDRQKAKQLAEENLVGPNNAQYGRYLSFGDIFMVFNNQ 265
Query: 116 HLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF- 173
T Y R LD+ A YS F RE FSS PD V VT +S +L F
Sbjct: 266 KKGLENVTDYHRALDITQAITTTAYSQDGTTFKRETFSSYPDDVTVTHLSKKGDKTLDFT 325
Query: 174 --NVSLDSLLDNHSY------------VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQF 219
N + LL N Y +N I+++G K N G+QF
Sbjct: 326 LWNSLTEDLLANGDYSWEYSNYKQGAVTTDSNGILLKGTV-------KDN------GLQF 372
Query: 220 SAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSES 278
++ L IK G ++A +D L V G+ +A LLL A ++F NP ++ +KD E+
Sbjct: 373 ASYLGIKTD---GQVTA-QDGYLTVTGASYATLLLSAKTNFAQ---NPKTNYRKDIDVEN 425
Query: 279 M--SALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAE 336
S +++ + Y L H++DYQ LF+RV + L S T + E
Sbjct: 426 TVKSIVEAAKAKDYETLKHDHIEDYQSLFNRVQLNLGGSKS-------------TQTTKE 472
Query: 337 RVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNI 394
++++ ++ L EL FQ+GRYL+ISSSR T ANLQG+WN +P W+S H+N+
Sbjct: 473 ALQTYNPEKGQQLEELFFQYGRYLIISSSRDRTDALPANLQGVWNAVDNPPWNSDYHLNV 532
Query: 395 NLEMNYWQSLPCNLSECQEPLFDFLTYLSING-----------SKTAQVNYLASGWVIHH 443
NL+MNYW + NL+E P+ +++ L G SK Q N GW++H
Sbjct: 533 NLQMNYWPAYMSNLAETARPMVNYIDDLRYYGRIAAKEYAGIESKEGQEN----GWLVHT 588
Query: 444 KTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASF 501
+ W D W P AW+ +++++Y +T D +L+++ YP+L+ A F
Sbjct: 589 QATPFGWTTPGWD---YYWGWSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKF 645
Query: 502 LLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAE 559
+L + D ++ ++PS SPEH ++ +T D +++ ++F + AA
Sbjct: 646 WNSFLHYDQTSDRWV-SSPSYSPEH---------GTITIGNTFDQSLVWQLFHDYMEAAN 695
Query: 560 VLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
L ++D LV +V +L+P I ++G I EW +
Sbjct: 696 HLNVDQD-LVTEVKTKFDKLKPLHINQEGRIKEWYE 730
>gi|429740665|ref|ZP_19274345.1| hypothetical protein HMPREF9134_00217 [Porphyromonas catoniae
F0037]
gi|429160458|gb|EKY02921.1| hypothetical protein HMPREF9134_00217 [Porphyromonas catoniae
F0037]
Length = 837
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 177/597 (29%), Positives = 293/597 (49%), Gaps = 54/597 (9%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG-VPGDYTNPDAPKALSDV 75
F+ PA+ + +P+GNGRLG + G + + + LNE ++W+G + N DA K L +
Sbjct: 48 FDRPAESMMEELPLGNGRLGMLSDGALRHQRVTLNESSMWSGSIDSLALNRDAAKHLPKI 107
Query: 76 RSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIELEFDDSHLKYAEE 122
R L+ +G++ +A K F P Y++ G + L++
Sbjct: 108 RELLFAGRHKDAEELIYKTFVCGGKGSGQGAGAKVPYGSYEVGGFLHLDWGRD---IPSP 164
Query: 123 TYRRELDLNTATARVKYSV-GNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD-SL 180
+Y+R LDL + G + +++S V V I + + + L S
Sbjct: 165 SYKRSLDLTYGISTETIETWGQPYRMKTYYTSYTHDVNVITIYNQAISARTDTLRLSLSR 224
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+N + + + + G P + +G+ ++ + + + G + + ++
Sbjct: 225 PENGTSTVSDGLLTLSGDLPNGK---------GGEGLHYAIVAKPYLLHG-GKVISRGNE 274
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
L V S + +L+A ++ + NP S P + + + ++ + L H
Sbjct: 275 LLIVNAS--VIQILIAHNTN---YYNPQLS---PIAHGVEQIVKAAGITSAILERDHRAA 326
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGR 358
+ RVS+++ + EN+ P +R++++ D DP+L L QFGR
Sbjct: 327 FSSQMGRVSMRIGKG-------NAKAENL---PIDKRLEAYHKDPQSDPNLASLYMQFGR 376
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLL+SS+R G NLQGIW + W+S H+NINL+MNYW S NLSE PL +
Sbjct: 377 YLLLSSTRKGALPPNLQGIWTNLIQAPWNSDYHLNINLQMNYWPSEKGNLSETVLPLTSW 436
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ L +G +TA+ Y GWV H ++W ++ W G AWLC HL+ HY
Sbjct: 437 VEGLLPSGRETARAFYGGKGWVTHILGNVWGFTAPGE-HPSWGATNTGAAWLCQHLFNHY 495
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
YT DR++L +R YP+L+G + F L L+ + ++GYL T P+TSPE+ ++APD + VS
Sbjct: 496 LYTQDREYL-RRIYPILKGASQFFLSTLVRDPNNGYLVTAPTTSPENHYLAPDSSVVAVS 554
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
STMD IIRE+F+ ++A L E + ++++L L PT IA DG IMEW+
Sbjct: 555 AGSTMDNQIIRELFTNTRTSALAL--GERVFADTLVRTLSELMPTTIAPDGRIMEWL 609
>gi|145251710|ref|XP_001397368.1| hypothetical protein ANI_1_1356144 [Aspergillus niger CBS 513.88]
gi|134082905|emb|CAK46741.1| unnamed protein product [Aspergillus niger]
Length = 497
Score = 246 bits (629), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 168/506 (33%), Positives = 250/506 (49%), Gaps = 51/506 (10%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA +FT +PIGNGRLGA +WG +E + LNE+++W+G + NP + AL VR
Sbjct: 28 YTTPANNFTSTLPIGNGRLGAAIWG-TATENVTLNENSIWSGPFINRVNPRSYDALWPVR 86
Query: 77 SLVDSGQYAEATAASV-KLFGHP--ADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
SL+ G E A++ + G P Y LG + L+F H + Y R LDL +
Sbjct: 87 SLLAEGNMTEGNDATLANMVGIPDSPQSYSALGSLVLDF--GHDEAGISNYTRYLDLRSG 144
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A V+Y+ V + RE+ +S+PD V+ ++S SE G L NV+ S L YV NN
Sbjct: 145 MAVVEYTYRAVRYRREYLASHPDNVVAVRLSSSEPGGL--NVA--SSLVRDRYVVSNNAT 200
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
+ G + +A +N+ IQF+A + +SD R T + L
Sbjct: 201 LSHD---GGLLTLRAYSNNVSNPIQFTAEARV-VSDGRATSNGTS--------------L 242
Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQKLFHRV 308
+V ++S FI+ S + E+ A L + + + + + DY L RV
Sbjct: 243 VVRNASTIDIFIDTETSYRYSAQENWEAEIKSKLDTACSSGFVAVKKNAIADYSALAQRV 302
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLISSSR 366
+ L S + +P+ R+ +++ D DP LV L+F FGR+ LI+SSR
Sbjct: 303 DLNLG-----------SSGSAGNLPTDSRLVNYRIDPDSDPELVVLMFHFGRHSLIASSR 351
Query: 367 PGTQVA---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
A NLQG+WN+D P W ++INLEMNYW + NL++ P D L +
Sbjct: 352 ATESPALPANLQGLWNQDFDPAWGGRFTIDINLEMNYWPAEVTNLADTFSPFIDLLDVVH 411
Query: 424 INGSKTAQVNYLAS--GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
G A+ Y S G+V+HH TD+W ++ W +WPMGGAWL +L EHY ++
Sbjct: 412 DRGLDVAESMYHCSNGGYVLHHNTDLWGDAAPVDNGTTWTMWPMGGAWLSANLIEHYRFS 471
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI 507
D L R +PLL+ A F +L
Sbjct: 472 RDESILRNRIWPLLQSAARFYYCYLF 497
>gi|359404666|ref|ZP_09197493.1| hypothetical protein HMPREF0673_00698 [Prevotella stercorea DSM
18206]
gi|357560094|gb|EHJ41501.1| hypothetical protein HMPREF0673_00698 [Prevotella stercorea DSM
18206]
Length = 838
Score = 246 bits (628), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 184/601 (30%), Positives = 277/601 (46%), Gaps = 74/601 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY--------TNPDAPKALSDVR 76
+ ++PIGNG +G V G V +E + NE TLW G P N + + ++R
Sbjct: 75 SQSLPIGNGNIGGNVLGSVEAERITFNEKTLWRGGPNTARGAAYYWDVNKQSAHVVGEIR 134
Query: 77 SLVDSGQYAEATAASVKLFG----HPADV--------YQLLGDIELEFDDSHLKYAEETY 124
G + +A + K F + AD + G+ +E S + + Y
Sbjct: 135 EAFTKGDWQKAELLTRKNFNSVVPYEADAEEPFRFGSFTTAGEFYIETGLSSVGMTD--Y 192
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
RREL L++A A+V + V++ RE+F S+P V+ + + S+ G +L F+ + + +
Sbjct: 193 RRELSLDSALAKVSFCKDGVQYEREYFVSHPANVMAVRFAASQRGKQNLVFSYAPNPVST 252
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+G + + R D ++++ + IK G +S E KL
Sbjct: 253 GEMKADGTDALCWLARL-------------DNNSMEYA--VRIKAVAKGGAVSN-EGGKL 296
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKK------DPTSESMSALQSIRNLSYSDLYTR 296
V+ +D V L+ A + + P +P S DP + L Y+ L
Sbjct: 297 TVKDADEVVFLITADTDYK-PNYDPDFSAPKAYVGVDPAQTTADWLAKAATKGYAYLLNE 355
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQ 355
H DY +LF+RV + ++ + D D +P R++++ Q D L +L +Q
Sbjct: 356 HYADYSELFNRVRLNINNATADA----------DDLPVNRRLEAYRQGKPDYYLEQLYYQ 405
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
FGRYLLISSSR ANLQG+W+ ++ W H NINL+MNYW + P LSEC+ PL
Sbjct: 406 FGRYLLISSSRADNLPANLQGLWHNNVDGPWRIDYHNNINLQMNYWLACPTGLSECELPL 465
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHL 474
F+F+ L G TA+ + GW +I+ +S + + W P G WL THL
Sbjct: 466 FNFIRTLVKPGRVTAKSYFGTRGWTTSVSGNIFGFTSPLSSEDMSWNFSPFAGPWLATHL 525
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
W +Y++T DR FL Y +L+ A F D+L DG PSTSPEH
Sbjct: 526 WNYYDFTRDRKFLADN-YEILKESADFASDYLWHRADGVYTAAPSTSPEH---------G 575
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKN--EDALVEKVLKSLPRLRPTKIAEDGSIME 592
V +T A+IREV + A VL K+ E E LK L P KI G +ME
Sbjct: 576 PVDEGATFAHAVIREVLLDAVEANRVLGKSAKERRQWEDALK---HLAPYKIGRYGQLME 632
Query: 593 W 593
W
Sbjct: 633 W 633
>gi|380696427|ref|ZP_09861286.1| hypothetical protein BfaeM_21066 [Bacteroides faecis MAJ27]
Length = 1014
Score = 246 bits (627), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 157/494 (31%), Positives = 249/494 (50%), Gaps = 38/494 (7%)
Query: 114 DSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKI-SGSESGSLS 172
D+ L+ Y R LD++ A V Y G + F RE+F S PD V+V ++ S + G LS
Sbjct: 328 DASLELPYSDYARTLDIDNAIHTVTYKEGGITFKREYFMSYPDHVMVMRLTSDANEGKLS 387
Query: 173 FNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRG 232
+SL+SL + N I M G P K + G++++ L +K + G
Sbjct: 388 RIISLESLHTDKVIAADGNTITMTGY-PTPVSGDKRVGDAWKNGLRYAQQLVVK--NKGG 444
Query: 233 TISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD------SKKDPTSESMSALQSIR 286
IS ++ KLKVE +D ++L+ A++++ + D S++DP + + L +
Sbjct: 445 KISVVDGAKLKVEDADEIIVLMSAATNY----VQCMDDSYCYFSEEDPLDKVRATLHKVA 500
Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDED 346
+ Y+ L H DY L+ R+ + L + T D++ + ++
Sbjct: 501 DKKYTSLLAAHQKDYHSLYDRMQLNLGEQLEAPAATT------DSLLKGMDANTNSEQDN 554
Query: 347 PSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPC 406
L L FQFGRYLLISSSR G+ ANLQG+W E L+ W++ H NIN++MNYW + P
Sbjct: 555 QYLEMLYFQFGRYLLISSSREGSLPANLQGVWGERLANPWNADYHTNINVQMNYWPTQPT 614
Query: 407 NLSECQEPLFDFLTYLSINGSKTAQVNYL------ASGWVIHHKTDIWAKSSADRGKVVW 460
NLS C P+ +++ L G TAQ Y GWV HH+ +IW ++ + K
Sbjct: 615 NLSPCHLPMVEYVKSLVPRGKYTAQQYYCKPDGGNVRGWVTHHENNIWGNTAPAK-KSTP 673
Query: 461 ALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPST 520
+P G W+C +WE+Y + +D+DFL+K +L+ ++ + + DG L NPS
Sbjct: 674 HHFPAGAIWMCQDIWEYYQFNLDKDFLKKYYDTMLDAALFWVDNLWTDERDGTLVANPSH 733
Query: 521 SPEH-EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH EF L C + A+I E+F +I A++ L + +D + ++ ++ +L
Sbjct: 734 SPEHGEF-----SLGC-----STSQAMICEMFGMMIKASKELGREKDPEIAEIATAMSKL 783
Query: 580 RPTKIAEDGSIMEW 593
KI G MEW
Sbjct: 784 SGPKIGLGGQFMEW 797
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/63 (49%), Positives = 45/63 (71%), Gaps = 2/63 (3%)
Query: 2 MNAESTSTTNP-LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV 59
M A S P LK T+N PAK++ ++A+PIGNG +GAM++GGV + ++ NE TLW+G
Sbjct: 23 MTACSGQFHQPALKATYNKPAKNWESEALPIGNGYMGAMIFGGVYVDVIQTNEHTLWSGG 82
Query: 60 PGD 62
PG+
Sbjct: 83 PGE 85
>gi|391873203|gb|EIT82265.1| hypothetical protein Ao3042_00536 [Aspergillus oryzae 3.042]
Length = 580
Score = 246 bits (627), Expect = 3e-62, Method: Compositional matrix adjust.
Identities = 174/592 (29%), Positives = 278/592 (46%), Gaps = 88/592 (14%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ P F ++P+GNGRLG ++ +P+E + NED++W+G D N +A VR
Sbjct: 34 YDTPGTRFNASLPVGNGRLGGTLYC-LPTEIVTWNEDSVWSGTFQDRVNSNALDGFPKVR 92
Query: 77 SLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
+L+ +G A ++ + G D YQ+L ++ ++ Y L+ TA
Sbjct: 93 NLLVNGNITAAGELALSDMTGSSVDQREYQVLSNLYVDLGQRGDATNLVWYLDTLEGYTA 152
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
+Y V +T NG I
Sbjct: 153 ---CEYGFDGVSYT--------------------------------------VANGIASI 171
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
+M+ R + F+A + + + D G ++A DK L V G+ V
Sbjct: 172 VMKAR------------TGEADYSTFTAGVRVVV--DGGNVTANGDK-LYVTGATTVVFF 216
Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
L A SS+ + D +E L + L Y L + D++ L RV++ L
Sbjct: 217 LDAESSYR------YATDSDQETELNRKLDAATELGYEALRKEAITDHKDLAGRVTLDLG 270
Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQT--DEDPSLVELLFQFGRYLLISSSRPGTQV 371
S D + +P ER+ ++++ D D L+F +GR+LLI+SSR +
Sbjct: 271 SSTDDAAS----------LPPNERMTNYRSSPDHDVQFATLVFNYGRHLLIASSRRTRER 320
Query: 372 A---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
+ LQGIWN+D SP+W + VNINLEMNYW + NL+E PL+D L + G
Sbjct: 321 SLSPGLQGIWNQDYSPSWGAKYTVNINLEMNYWPAETTNLNELTSPLWDLLALIQERGGD 380
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
A+ + G+V+HH TD+W S +++WPMGGAWL H+ EHY +T D+ FL+
Sbjct: 381 VAEKMHGCPGFVLHHNTDLWGDSVPVHNGTKYSIWPMGGAWLALHMMEHYRFTGDKTFLK 440
Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYSSTMD 543
++A P+ + F +L + DGYL T PS SPE+ F P GK ++ S T+D
Sbjct: 441 EQACPIFKSAFEFFECYLFD-VDGYLTTGPSCSPENAFQIPSDMTVAGKEEALTMSPTLD 499
Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+++ E+ +A+ ++LE + D L V L ++RP +I DG I+EW++
Sbjct: 500 NSMLFELLTALNETHQILEIDND-LSGSVQTYLGKIRPPRIGSDGQILEWIE 550
>gi|298387491|ref|ZP_06997043.1| alpha-L-fucosidase 2 [Bacteroides sp. 1_1_14]
gi|298259698|gb|EFI02570.1| alpha-L-fucosidase 2 [Bacteroides sp. 1_1_14]
Length = 1036
Score = 245 bits (626), Expect = 4e-62, Method: Compositional matrix adjust.
Identities = 163/497 (32%), Positives = 261/497 (52%), Gaps = 35/497 (7%)
Query: 109 ELEFDDS-HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKI-SGS 166
EL D S L Y++ Y R LD++ A V Y + F RE+F S PD V+V ++ S S
Sbjct: 341 ELSIDASTELPYSD--YTRTLDIDNAIHTVMYKENGITFKREYFMSYPDNVMVMRLTSDS 398
Query: 167 ESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK 226
+ G LS +SL+SL + + ++ I M G P K + G++++ L +K
Sbjct: 399 KKGKLSRIISLESLHTDKTITADSHTITMTG-YPTPVSGDKRIGDAWKNGLKYAQQLVVK 457
Query: 227 ISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD--SKKDPTSESMSALQS 284
+ G +S ++ KLKVE +D ++L+ A++++ + + S++DP + + L
Sbjct: 458 --NKGGKVSVVDGTKLKVEDADEIIVLMSAATNYVQCMDDSYNYFSQEDPLEKVQATLHK 515
Query: 285 IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSE-ENIDTVPSAERVKSFQT 343
+ + Y+ L H DY L+ R+ + L P+ V T S + +D ++E+
Sbjct: 516 VADKKYTALLATHQKDYHSLYDRMRLNLGNLPEAPVAPTDSLLKGMDENTNSEQ------ 569
Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
E+ L L FQFGRYLLISSSR G+ ANLQG+W E LS W++ H NIN++MNYW +
Sbjct: 570 -ENQYLEMLYFQFGRYLLISSSREGSLPANLQGVWGERLSNPWNADYHTNINIQMNYWPT 628
Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYL------ASGWVIHHKTDIWAKSSADRGK 457
NLS C P+ +++ L G TAQ Y GWV HH+ +IW ++ + K
Sbjct: 629 QSTNLSPCHLPMVEYVRSLVPRGKYTAQQYYCKPDGGNVRGWVTHHENNIWGNTAPAK-K 687
Query: 458 VVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETN 517
+P G W+C +WE+Y + +D+DFL+K +L+ ++ + + DG L N
Sbjct: 688 STPHHFPAGAIWMCQDIWEYYQFNLDKDFLKKYYDTMLDAVLFWVDNLWTDERDGTLVAN 747
Query: 518 PSTSPEH-EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL 576
PS SPEH EF L C + A+I E+F +I A++ L +++D + ++ ++
Sbjct: 748 PSHSPEHGEF-----SLGC-----STSQAMICEMFDMMIKASKELGRDKDPEIIEIATAM 797
Query: 577 PRLRPTKIAEDGSIMEW 593
+L KI G MEW
Sbjct: 798 SKLSGPKIGLGGQFMEW 814
Score = 58.9 bits (141), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 28/63 (44%), Positives = 42/63 (66%), Gaps = 1/63 (1%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV 59
MM LK T+N PAK++ ++A+PIGNG +GAM++G V + ++ NE TLW+G
Sbjct: 40 MMACSEQPYQPTLKATYNKPAKNWESEALPIGNGYMGAMIFGDVYVDVIQTNEHTLWSGG 99
Query: 60 PGD 62
PG+
Sbjct: 100 PGE 102
>gi|421276774|ref|ZP_15727594.1| alpha-L-fucosidase, putative, afc95A [Streptococcus mitis SPAR10]
gi|395876055|gb|EJG87131.1| alpha-L-fucosidase, putative, afc95A [Streptococcus mitis SPAR10]
Length = 922
Score = 244 bits (623), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 188/631 (29%), Positives = 306/631 (48%), Gaps = 101/631 (16%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GD 62
P +++G K A+P+GNG +GA V+G + E ++ NE TLW+G P G+
Sbjct: 125 PTAPSYDGWEKQ---ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGN 181
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLK 118
Y D K LS++R ++ G +A + + P + Y GDI + F++
Sbjct: 182 YQ--DRYKVLSEIRKALEEGNRQKAKQLAERNLVGPNNAQYGRYLSFGDIFMVFNNQKKG 239
Query: 119 YAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---N 174
T Y R LD++ A + Y+ F RE FSS PD V VT +S +L F N
Sbjct: 240 LENVTDYHRGLDISEAISTTSYTQDGTNFKRETFSSYPDDVTVTHLSKKGDKTLDFTLWN 299
Query: 175 VSLDSLLDNHSY------------VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAI 222
+ L+ N Y +N I+++G K N G++F++
Sbjct: 300 SLTEDLIANGDYSWEYSNYKQGAVTTDSNGILLKGTV-------KDN------GLKFASY 346
Query: 223 LEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM-- 279
L IK G ++A +D L V G+ +A LLL A ++F NP ++ +KD E
Sbjct: 347 LGIKTD---GQVTA-QDGYLTVTGASYATLLLSAKTNFAQ---NPKTNYRKDIDLEKTVK 399
Query: 280 SALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVK 339
S +++ + Y L H+ DYQ LF+RV + L S + T E +
Sbjct: 400 SIVEASKAKDYETLKNNHIKDYQSLFNRVQLNLGGSRSNQTT-------------KEALH 446
Query: 340 SFQTDEDPSLVELLFQFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLE 397
++ ++ L EL FQ+GRYLLISSSR T ANLQG+WN +PTW+S H+N+NL+
Sbjct: 447 TYNPEKGQKLEELFFQYGRYLLISSSRDRTDALPANLQGVWNAVDNPTWNSDYHLNVNLQ 506
Query: 398 MNYWQSLPCNLSECQEPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTD 446
MNYW + NL+E +P+ +++ + G SK Q N GW++H +
Sbjct: 507 MNYWPAYMNNLAETAKPMINYIDDMRYYGRIAAKEYAGIESKEGQEN----GWLVHTQAT 562
Query: 447 IWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL 506
+ ++ W P AW+ +++++Y +T D +L+++ YP+L+ A F +L
Sbjct: 563 PFGWTTPGW-NYYWGWSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFL 621
Query: 507 I--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKN 564
+ D ++ ++PS SPEH ++ +T D +++ ++F + AA L +
Sbjct: 622 HYDKASDRWV-SSPSYSPEH---------GTITIGNTFDQSLVWQLFHDYMEAANHLNVD 671
Query: 565 EDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+D LV +V +L+P I +DG I EW +
Sbjct: 672 QD-LVTEVKAKFDKLKPLHINQDGRIKEWYE 701
>gi|333029856|ref|ZP_08457917.1| Alpha-L-fucosidase [Bacteroides coprosuis DSM 18011]
gi|332740453|gb|EGJ70935.1| Alpha-L-fucosidase [Bacteroides coprosuis DSM 18011]
Length = 816
Score = 244 bits (622), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 190/615 (30%), Positives = 287/615 (46%), Gaps = 106/615 (17%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD------YTNPDA------------ 68
++PIGNG GA + G V + + LNE TLW G P Y N +
Sbjct: 62 SLPIGNGSFGANIMGSVSVDRVTLNEKTLWRGGPNTANGASYYWNVNKLSAKYLPIIRQA 121
Query: 69 --PKALSDVRSLVDS---GQYA-EATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAE- 121
K L VR+L ++ G A E T S FG + LG++ LE + L+ E
Sbjct: 122 FMDKDLDKVRTLTENNFNGLAAYEETDESPFRFGS----FTTLGELYLE---TGLEEKEI 174
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGS--------------- 166
Y+R L L++A V + N ++R +F+S PD VIV + +
Sbjct: 175 SDYKRALSLDSAVVNVSFKEKNTMYSRSYFASYPDSVIVIRYTSEQKAKQNIKLFYAPNP 234
Query: 167 ESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK 226
ES + D +L +N N Q +E +C IP + GI
Sbjct: 235 ESRGVCIKKGSDRILFKRELLNNNQQFALEIKC----IPIGGYYENIENGI--------S 282
Query: 227 ISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP--SDSKK----DPTSESMS 280
I D +D V +L A++ + F NP SD K P ++
Sbjct: 283 ICD-----------------ADEVVFVLSAATDYQMNF-NPDFSDPKTYVGLPPEIKTSQ 324
Query: 281 ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKS 340
L + Y+ + HL DYQ LF+RV I L+ S + ++P+ R+
Sbjct: 325 RLLRLNGQDYNQMLNEHLQDYQSLFNRVHIDLN-----------SIHSFSSLPTDLRLAQ 373
Query: 341 FQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMN 399
++ + D + EL +Q+GRYLLI+SSR G+ ANLQG+W+ ++ W H NIN++MN
Sbjct: 374 YKEGKLDKAFEELYYQYGRYLLIASSRIGSMPANLQGLWHNNIDGPWRVDYHNNINIQMN 433
Query: 400 YWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-V 458
YW + NLSEC PL DF+ L G TAQ Y A GW ++I+ ++ K +
Sbjct: 434 YWPASTANLSECIPPLIDFIKTLVKPGKVTAQSYYNARGWTASISSNIFGFTAPLSSKDM 493
Query: 459 VWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNP 518
W PM G WL TH+W++++YT D DFL++ Y L++ A+F +D+L + +G P
Sbjct: 494 SWNFNPMAGPWLATHVWDYFDYTQDLDFLKETGYELIKESANFAVDYLWKMPNGVYSAAP 553
Query: 519 STSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR 578
STSPEH + +T A+IR+V S I A+++L +++D E + L
Sbjct: 554 STSPEH---------GPIDQGATFVHAVIRQVLSNAIEASKLLREDDDNRQEWI-AVLNN 603
Query: 579 LRPTKIAEDGSIMEW 593
L P ++ G +MEW
Sbjct: 604 LAPYQVGRYGQLMEW 618
>gi|169624315|ref|XP_001805563.1| hypothetical protein SNOG_15415 [Phaeosphaeria nodorum SN15]
gi|160705148|gb|EAT77080.2| hypothetical protein SNOG_15415 [Phaeosphaeria nodorum SN15]
Length = 792
Score = 244 bits (622), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 180/606 (29%), Positives = 293/606 (48%), Gaps = 78/606 (12%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
+ FN P + ++PIGNGR+ A +G E + +NE+++W+G D N + ALS
Sbjct: 26 LYFNTPGSSLSSSLPIGNGRVAAAAYG-TTLERITINENSVWSGQWQDRGNSQSLNALSS 84
Query: 75 VRSLVDSGQYAEATAASV-KLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
+R + G + A ++ + G+P Q +++ D H +Y R LD
Sbjct: 85 IRQKLMDGDMSSAGQQTLDAMAGNPQSPKQYHPTVDMTIDFGH-SGTLGSYTRILDTRQG 143
Query: 134 TARVKYSVGNVEFT-----------REHFSSNPDQVIVTKISGSESGSLSFNVSL---DS 179
TA Y +G V +T RE+ +S P V+ ++ +++G L+ +++L +
Sbjct: 144 TAMTTYILGGVNYTLMGAAHLTSFRREYVASYPAGVLAFRMMANQAGKLNVDIALARSQN 203
Query: 180 LLDNHSYVNGN-NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
+ N + +GN N I ++G GI F+A E ++ D G+IS +
Sbjct: 204 VASNAASSSGNINSITLKGNG----------------GIPFTA--EARVVSDTGSIS-VN 244
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+K + V+G+ + A +S+ S E + L + Y+ + T +
Sbjct: 245 EKTMSVKGATIVDIFFDAETSYR------YGSASAWELELKNKLDNAVKAGYNAVKTAAV 298
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE--DPSLVELLFQF 356
D + + RV+I L S + T P R+ +++ + DP LV L F +
Sbjct: 299 KDAEGILSRVNINLG-----------SSGSAGTQPIPSRLSNYKKNAGADPELVTLYFNY 347
Query: 357 GRYLLISSSRPG---TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
GR+LL++SSR + ANLQGIWN++ P W S VNIN EMNYW +L NL E +
Sbjct: 348 GRHLLLASSRDTGDRSLPANLQGIWNDNYDPPWQSKYTVNINTEMNYWHALTTNLDETHK 407
Query: 414 PLFDFLTYLSINGSKTAQVNYLAS-GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
PLFD + G A+ Y + G+V+HH TD+W ++ P+ T
Sbjct: 408 PLFDLVDMTRAQGRAMAKKMYGCNDGFVVHHNTDLWGDAA-----------PVDKGTPYT 456
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-- 530
HL EHY +T D++FL+ RA+P+L+ A+F +L ++G T PS SPE+ F+ P
Sbjct: 457 HLMEHYRFTQDKNFLQNRAWPVLKDAANFYYCYLFM-YNGSYVTGPSLSPENTFVVPSNM 515
Query: 531 ---GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAED 587
GK V + TMD ++ E+F+ +ISA + L D V K L +++ KI
Sbjct: 516 RTAGKTEGVDIAPTMDNELLWELFNNVISAGKALGIT-DITVSKAKDYLSKIKEPKIGSK 574
Query: 588 GSIMEW 593
G ++EW
Sbjct: 575 GQLLEW 580
>gi|418966542|ref|ZP_13518273.1| gram positive anchor [Streptococcus mitis SK616]
gi|383347120|gb|EID25122.1| gram positive anchor [Streptococcus mitis SK616]
Length = 1697
Score = 243 bits (621), Expect = 2e-61, Method: Compositional matrix adjust.
Identities = 185/611 (30%), Positives = 300/611 (49%), Gaps = 88/611 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 141 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSQDYNGGNYK--DRYKVLAEIRK 198
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD++
Sbjct: 199 ALEDGDRQKAKRLAERNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLENVTDYHRGLDISE 258
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV--SLDSLL--------D 182
A Y+ F RE FSS PD V VT +S +L F + SL L D
Sbjct: 259 AITTTSYTQDGTSFKRETFSSFPDDVTVTHLSKKGDKTLDFTLWNSLTEDLIANGQYSRD 318
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
N +Y G + G I K D+ G++F++ L IK G ++A +D L
Sbjct: 319 NSNYKKGTISVDSNG------ILLKGTVKDN--GLKFASYLGIKTD---GQVTA-QDGYL 366
Query: 243 KVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLYTRHLD 299
V+G+ +A LLL A ++F NP ++ +KD E S +++ + Y L H+
Sbjct: 367 TVKGASYATLLLSAKTNFAQ---NPETNYRKDIDVEKTVKSIVEAAKAKDYETLKNDHIK 423
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
DYQ LF+RV + L S + T E ++++ + L EL FQ+GRY
Sbjct: 424 DYQSLFNRVQLNLGGSKSNQTT-------------KEALQTYNPTKGQKLEELFFQYGRY 470
Query: 360 LLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
LLISSSR T ANLQG+WN +P W+S H+N+NL+MNYW + NL+E +P+ +
Sbjct: 471 LLISSSRNRTDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYMSNLAETAKPMIN 530
Query: 418 FLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
++ + G SK Q N GW++H + + ++ W P
Sbjct: 531 YIDDMRYYGRIAAKEYAGIESKEGQEN----GWLVHTQATPFGWTTPGW-NYYWGWSPAA 585
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEH 524
AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS SPEH
Sbjct: 586 NAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPSYSPEH 644
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
++ +T D +++ ++F + AA L+ ++D LV +V +L+P I
Sbjct: 645 ---------GNITIGNTFDQSLVWQLFHDYMEAANHLKIDQD-LVTEVKAKFDKLKPLHI 694
Query: 585 AEDGSIMEWVQ 595
+DG I EW +
Sbjct: 695 NQDGRIKEWYE 705
>gi|322377414|ref|ZP_08051905.1| fibronectin type III domain protein [Streptococcus sp. M334]
gi|321281614|gb|EFX58623.1| fibronectin type III domain protein [Streptococcus sp. M334]
Length = 803
Score = 243 bits (619), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 182/614 (29%), Positives = 288/614 (46%), Gaps = 65/614 (10%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
P T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPASTTYKGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEF-DDSHLK 118
D L+++R ++ Y A + + P +Y GDI +EF +
Sbjct: 72 NLQDQYVFLAEIRQDLEKRDYNRAKELAEQHLVGPKTSQYGIYLSFGDIHIEFSNQGKTL 131
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y Y+R+L+++ A A Y F RE F+S PD ++V + + S +L F + L
Sbjct: 132 YQVTDYQRQLNISKALATTSYVYKGTRFEREVFASFPDDLLVQRFTKEGSETLDFTMDLS 191
Query: 179 SLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
D S + + C I K D+ +QF++ L K G I
Sbjct: 192 LTRDLASDGKYEQEKLDYKECQLDISTSHILMKGRVKDND--LQFASCLAWKTD---GDI 246
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
DK +++ G+ +A L LVA + F + K D + +++ + Y+ L
Sbjct: 247 RVWSDK-VQISGASYANLFLVAKTDFAQNPASNYRKKIDLEQQVKDLVETAKEEGYTQLK 305
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
+RH++DYQ LF RV + L N D + + +K++++ E L EL F
Sbjct: 306 SRHIEDYQALFQRVQLDLG-------------ANGDISTTDDLLKNYKSQEGQDLEELFF 352
Query: 355 QFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW S NL E
Sbjct: 353 QYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPSYVTNLLETA 412
Query: 413 EPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWAL 462
P+ +++ L + G + A Y +GW++H + W D W
Sbjct: 413 FPVINYIDDLRVYG-RLAAAKYAGIISREGEENGWLVHTQATPFGWTAPGWD---YYWGW 468
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTS 521
P AW+ ++E Y++ D+D+L ++ YP+L F D+L + ++PS S
Sbjct: 469 SPASNAWMMQTVYEVYSFYRDQDYLREKIYPMLSETVRFWNDFLHKDQQAQRWVSSPSYS 528
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
PEH +S +T D ++I ++F I AA+ L + D L E V + L P
Sbjct: 529 PEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDADLLTE-VKEKFDLLNP 578
Query: 582 TKIAEDGSIMEWVQ 595
+I + G I EW +
Sbjct: 579 LQITQSGRIREWYE 592
>gi|225019389|ref|ZP_03708581.1| hypothetical protein CLOSTMETH_03342 [Clostridium methylpentosum
DSM 5476]
gi|224947845|gb|EEG29054.1| hypothetical protein CLOSTMETH_03342 [Clostridium methylpentosum
DSM 5476]
Length = 1708
Score = 242 bits (617), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 160/511 (31%), Positives = 259/511 (50%), Gaps = 49/511 (9%)
Query: 96 GHPADVYQLLGDIELEFD-DSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSN 154
G+ D QL EL FD S + Y+R LDL+ ATA+V+Y++ +V FTRE+F SN
Sbjct: 320 GNTTDGVQL---SELSFDLKSSTGPSYTNYKRTLDLDNATAKVEYTLDDVNFTREYFVSN 376
Query: 155 PDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDP 214
PD + +++ + G++S +S+ + + + I M G+ +R
Sbjct: 377 PDNFMAIRLTADQPGAISKAISITTPQSKKTITAEGDTITMTGQPADQR----------E 426
Query: 215 KGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD--SKK 272
G++F+ +IK+ G+++A + + VEG+D +LL+ A +++ + D + +
Sbjct: 427 DGLKFAQ--QIKVVPQGGSMTA-ANGTITVEGADSVLLLMTAGTNYQQCMDDTFDYFTDE 483
Query: 273 DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTV 332
DP + ++ Y DL H+ DYQ LF+ + + L +P E+ D +
Sbjct: 484 DPLDAVSQRIATVAAKDYDDLLAAHVADYQSLFNNMKLNLCDAP-------MPEKPTDEL 536
Query: 333 PSAERVKSFQTD---EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
+A ++ + ED L L +QFGRYLLI+SSR G+ ANLQGIW + L+P WD+
Sbjct: 537 LAAYGGRTSNPNTALEDRYLETLYYQFGRYLLIASSRDGSLPANLQGIWADGLNPPWDAD 596
Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS------GWVIHH 443
H NIN++MNYW + NL+EC P+ D++ L G TAQ + GW +H
Sbjct: 597 YHTNINVQMNYWLAESTNLTECHLPIVDYINSLVPRGEITAQRYHCTEDGGDVRGWTTYH 656
Query: 444 KTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
+ +IW ++ + +P GGAW+ +WE Y + D++FL + + L G A F +
Sbjct: 657 ENNIWGNTAPATSSAFY--FPAGGAWMTQDIWEIYAFNQDKEFLAEN-FDTLLGAALFWV 713
Query: 504 DWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLE 562
D L+ + DG L ++PS SPEH S + D II + F I AAE L
Sbjct: 714 DNLVTDTRDGTLVSSPSYSPEH---------GPYSLGAACDQGIIWDTFQNTIEAAEALG 764
Query: 563 KNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ + E + ++ +L +I G MEW
Sbjct: 765 IDTPEIAE-IREAQSKLAGPQIGLAGQFMEW 794
Score = 55.5 bits (132), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 46/76 (60%), Gaps = 3/76 (3%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
L+ + PA + +A P+GNG LGAMV+GGV S+ +++NE +LW+G PG N D
Sbjct: 42 LQAFYTKPATDWEKEATPLGNGFLGAMVFGGVESDRIQINEHSLWSGGPGANENYDG--G 99
Query: 72 LSDVRSLVDSGQYAEA 87
+SD + V+ EA
Sbjct: 100 MSDTPAEVNRQNLMEA 115
>gi|419765946|ref|ZP_14292168.1| Gram-positive signal peptide protein, YSIRK family / gram positive
anchor multi-domain protein [Streptococcus mitis SK579]
gi|383354600|gb|EID32158.1| Gram-positive signal peptide protein, YSIRK family / gram positive
anchor multi-domain protein [Streptococcus mitis SK579]
Length = 1662
Score = 241 bits (616), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 183/611 (29%), Positives = 297/611 (48%), Gaps = 88/611 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 141 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSQDYNGGNYK--DRYKVLAEIRK 198
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD++
Sbjct: 199 ALEDGDRQKAKRLAERNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLESVTDYHRGLDISE 258
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV--SLDSLL--------D 182
A Y+ F RE FSS PD V VT +S +L F + SL L D
Sbjct: 259 AITTTSYTQDGTSFKRETFSSFPDDVTVTHLSKKGDKNLDFTLWNSLTEDLIANGQYSRD 318
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
N +Y G + G I K D+ G++F++ L IK G ++A +D L
Sbjct: 319 NSNYKKGTISVDSNG------ILLKGTVKDN--GLKFASYLGIKTD---GQVTA-QDGYL 366
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKK---DPTSESMSALQSIRNLSYSDLYTRHLD 299
V+G+ +A LLL A ++F NP + + D S +++ + Y L H+
Sbjct: 367 TVKGASYATLLLSAKTNFAQ---NPETNYRKDIDVGKTVKSIVEAAKAKDYETLKNDHIK 423
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
DYQ LF+RV + L S + T E ++++ + L EL FQ+GRY
Sbjct: 424 DYQSLFNRVQLNLGGSKSNQTT-------------KEALQTYNPTKGQKLEELFFQYGRY 470
Query: 360 LLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
LLISSSR T ANLQG+WN +P W+S H+N+NL+MNYW + NL+E +P+ +
Sbjct: 471 LLISSSRNRTDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYMSNLAETAKPMIN 530
Query: 418 FLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
++ + G SK Q N GW++H + + ++ W P
Sbjct: 531 YIDDMRYYGRIAAKEYAGIESKEGQEN----GWLVHTQATPFGWTTPGW-NYYWGWSPAA 585
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEH 524
AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS SPEH
Sbjct: 586 NAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPSYSPEH 644
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
++ +T D +++ ++F + AA L+ ++D LV +V +L+P I
Sbjct: 645 ---------GNITIGNTFDQSLVWQLFHDYMEAANHLKIDQD-LVTEVKAKFDKLKPLHI 694
Query: 585 AEDGSIMEWVQ 595
+DG I EW +
Sbjct: 695 NQDGRIKEWYE 705
>gi|417935794|ref|ZP_12579111.1| gram positive anchor [Streptococcus infantis X]
gi|343402703|gb|EGV15208.1| gram positive anchor [Streptococcus infantis X]
Length = 1764
Score = 241 bits (615), Expect = 7e-61, Method: Compositional matrix adjust.
Identities = 183/617 (29%), Positives = 301/617 (48%), Gaps = 98/617 (15%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 153 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYQ--DRYKVLAEIRK 210
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD++
Sbjct: 211 ALEEGNRQKAKQLAERNLVGPNNAQYGRYLSFGDIFMVFNNQKKGLESVTDYHRGLDISE 270
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDNHSY--- 186
A + Y+ F RE FSS PD V VT +S +L F N + L+ N Y
Sbjct: 271 AISTTSYTQDGTNFKRETFSSYPDDVTVTHLSKKGDKTLDFTLWNSLTEDLIANGDYSWE 330
Query: 187 ---------VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+N I+++G K N G++F++ L IK G ++A
Sbjct: 331 YSNYKQGAVTTDSNGILLKGTV-------KDN------GLKFASYLGIKTD---GQVTA- 373
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D L V G+ +A LLL A ++F NP ++ +KD E+ S +++ + Y L
Sbjct: 374 QDGYLTVTGASYATLLLSAKTNFAQ---NPKTNYRKDIDLENTVKSIVEAAKAKDYETLK 430
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L S + T E ++++ + L EL F
Sbjct: 431 NDHIKDYQSLFNRVQLNLGGSKSNQTT-------------KEALQTYNPTKGQKLEELFF 477
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W+S H+N+NL+MNYW + NL+E
Sbjct: 478 QYGRYLLISSSRNRTDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYMSNLAETA 537
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 538 KPMINYIDDMRYYGRIAAKEYAGIESKEGQEN----GWLVHTQATPFGWTTPGW-NYYWG 592
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 593 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPS 651
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + AA L+ ++D LV +V +L
Sbjct: 652 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEAANHLKIDQD-LVTEVKAKFNKL 701
Query: 580 RPTKIAEDGSIMEWVQR 596
+P I +DG I EW +
Sbjct: 702 KPLHINQDGRIKEWYEE 718
>gi|307707449|ref|ZP_07643931.1| alpha-fucosidase [Streptococcus mitis NCTC 12261]
gi|307616401|gb|EFN95592.1| alpha-fucosidase [Streptococcus mitis NCTC 12261]
Length = 803
Score = 241 bits (615), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 184/624 (29%), Positives = 291/624 (46%), Gaps = 87/624 (13%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN-- 65
P T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 16 PASTTYKGWEE---EALPIGNGSLGAKVFGIIGAERIQFNEKSLWSGGPLPDSSDYQGGN 72
Query: 66 -PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADVYQL---LGDIELEFDDSHLKYA 120
D L+++R ++ Y A A L G Y GDI +EF +
Sbjct: 73 LQDQYGFLAEIRQALEKRDYNRAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLS 132
Query: 121 EET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD- 178
+ T Y+R+L+++ A A Y +F RE F+S PD ++V + + + +L F + L
Sbjct: 133 QVTDYQRQLNISKALATTSYVYKGTKFEREAFASFPDNLLVQRFTKEGAETLDFTIELSL 192
Query: 179 --SLLDNHSYVN------------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE 224
L + Y ++ I+M+GR ND +QF++ L
Sbjct: 193 SRDLASDGKYEEEKSDYKECKLDITDSHILMKGRVKD---------ND----LQFASCLA 239
Query: 225 IKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS 284
+ G I DK ++ G+ +A L L A + F + K D + ++
Sbjct: 240 WETD---GDIRVWSDKA-QISGASYANLFLAAKTDFAQNPASNYRKKIDLEKQVKDLVEI 295
Query: 285 IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD 344
+ Y+ L +RH+ DYQ LF RV + L ++DT + +K+++
Sbjct: 296 AKEKGYAQLKSRHIQDYQALFQRVQLDLG-------------ADVDTSTTDNLLKNYKPQ 342
Query: 345 EDPSLVELLFQFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQ 402
E +L EL FQ+GRYLLISSSR + ANLQG+WN +P W+S H+NINL+MNYW
Sbjct: 343 EGHALEELFFQYGRYLLISSSRDCSDALPANLQGVWNAVDNPPWNSDYHLNINLQMNYWP 402
Query: 403 SLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSS 452
+ NL E P+ +++ L + G + A Y +GW++H + W
Sbjct: 403 AYVTNLLETAFPVINYIDDLRVYG-RLAAARYAGIVSQEGEENGWLVHTQATPFGWTAPG 461
Query: 453 ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG 512
D W P AW+ ++E Y++ D+D+L ++ YP+L F D+L E
Sbjct: 462 WD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNDFLHEDQQA 518
Query: 513 Y-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEK 571
++PS SPEH +S +T D ++I ++F I AA+ LE + D L E
Sbjct: 519 QRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELELDADLLTE- 568
Query: 572 VLKSLPRLRPTKIAEDGSIMEWVQ 595
V + L P +I + G I EW +
Sbjct: 569 VKEKFDLLNPLQITQSGRIREWYE 592
>gi|354604085|ref|ZP_09022078.1| hypothetical protein HMPREF9450_00993 [Alistipes indistinctus YIT
12060]
gi|353348517|gb|EHB92789.1| hypothetical protein HMPREF9450_00993 [Alistipes indistinctus YIT
12060]
Length = 777
Score = 240 bits (612), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 174/589 (29%), Positives = 274/589 (46%), Gaps = 101/589 (17%)
Query: 17 FNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDV 75
+ PA ++ T+A+P+GNGR+GAM++GG+P E ++ N+ TLWTG
Sbjct: 42 YTRPATNWMTEALPVGNGRIGAMIFGGLPVERIQFNDKTLWTG----------------- 84
Query: 76 RSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFD--DSHLKYAEETYRRELDLNTA 133
S + G YQ GDI ++F + YRRELDL+ A
Sbjct: 85 -STTERG------------------AYQNFGDIFIDFGAAGGNNPRGPVDYRRELDLDDA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A+V Y V +TRE+ +S PD VI + + ++ G + F V +D N I
Sbjct: 126 LAKVVYKADGVTYTREYLASYPDDVIAMRFTANKKGKIGFTVRMDDAHTGGQRTVTGNSI 185
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
+ G+ S ++ + ++ GT+ A D L + G+D A LL
Sbjct: 186 TISGKL-----------------TLLSYKAQLTVLNEGGTLQA-GDSTLTLTGADAATLL 227
Query: 254 LVASSSFDGP---FINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
L A + +D ++ SD K ++ + A Y+ L HLDDY L++R+S+
Sbjct: 228 LSAGTDYDPQSPDYLTRSDWKGKVSTVAARAGSK----GYAALRKAHLDDYHALYNRLSL 283
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
+ + ++ TD V+ + + DP+ L FQ+GRYL I+SSRPG
Sbjct: 284 NVGNTTPELPTDELF------------VRYSKGEYDPAADVLYFQYGRYLTIASSRPGLD 331
Query: 371 V-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS-INGSK 428
+ +NLQG+WN+ +P W S H NIN++MNYW + P NL+EC EP ++ S ++ S
Sbjct: 332 LPSNLQGLWNDSNTPPWQSDIHSNINVQMNYWPAEPTNLAECHEPFTRYIYNESQLHDSW 391
Query: 429 TAQVNYL-ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
L GW + + +I+ S W AW C H+W+ Y + RD+L
Sbjct: 392 KKMAGELDCGGWALKTQNNIFGYSD-------WNWNRPANAWYCMHVWDKYLFDPQRDYL 444
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA-- 545
E+ AYP+++ F LD LI DG L SPEH + S + A
Sbjct: 445 EQEAYPVMKSACRFWLDRLIVDDDGKLVAPNEWSPEHG-----------PWESGIPYAQQ 493
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIMEW 593
+I ++F+ + A +L ++ A V+++ L RL + G + EW
Sbjct: 494 LIWDLFNNTVRAGRILGTDQ-AFVDQLESKLERLDNGLTVGSWGQLREW 541
>gi|385260919|ref|ZP_10039057.1| gram positive anchor [Streptococcus sp. SK140]
gi|385190192|gb|EIF37641.1| gram positive anchor [Streptococcus sp. SK140]
Length = 1717
Score = 240 bits (612), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 181/606 (29%), Positives = 298/606 (49%), Gaps = 78/606 (12%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 141 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSQDYNGGNYK--DRYKVLAEIRK 198
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD++
Sbjct: 199 ALEDGDRQKAKQLAERNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLENVTDYHRGLDISE 258
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDNHSYVNG 189
A Y+ F RE FSS PD V VT ++ +L F N + L+ N Y +
Sbjct: 259 AITTTSYTQDGTSFKRETFSSYPDDVTVTHLTKKGDKTLDFTLWNSLTEDLIANGDY-SW 317
Query: 190 NNQIIMEGRCP--GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
N +G I K D+ G++F++ L IK G ++A +D L V G+
Sbjct: 318 ENSKYKQGTVSVDSNGILLKGTVKDN--GLKFASYLGIKTD---GQVTA-QDGYLTVTGA 371
Query: 248 DWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLYTRHLDDYQKL 304
+A LLL A ++F NP ++ +KD E S +++ + Y L H+ DYQ L
Sbjct: 372 SYATLLLSAKTNF---AQNPKTNYRKDIDVEKTVKSIVEAAKAKDYETLKNDHIKDYQSL 428
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV + L S + T E ++++ + L EL FQ+GRYLLISS
Sbjct: 429 FNRVQLNLGGSKSNQTT-------------KEALQTYNPTKGQKLEELFFQYGRYLLISS 475
Query: 365 SRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SR T ANLQG+WN +P W+S H+N+NL+MNYW + NL+E +P+ +++ +
Sbjct: 476 SRNRTDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYMSNLAETAKPMINYIDDM 535
Query: 423 SING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLC 471
G SK Q N GW++H + + ++ W P AW+
Sbjct: 536 RYYGRIAAKEYAGIESKEGQEN----GWLVHTQATPFGWTTPGW-NYYWGWSPAANAWMM 590
Query: 472 THLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAP 529
+++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS SPEH
Sbjct: 591 QNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPSYSPEH----- 644
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
++ +T D +++ ++F + AA L+ +++ LV +V +L+P I +DG
Sbjct: 645 ----GTITIGNTFDQSLVWQLFHDYMEAANHLKVDQN-LVTEVKAKFDKLKPLHINQDGR 699
Query: 590 IMEWVQ 595
I EW +
Sbjct: 700 IKEWYE 705
>gi|419527991|ref|ZP_14067534.1| hypothetical protein SPAR51_0989 [Streptococcus pneumoniae GA17719]
gi|379566144|gb|EHZ31135.1| hypothetical protein SPAR51_0989 [Streptococcus pneumoniae GA17719]
Length = 803
Score = 239 bits (610), Expect = 3e-60, Method: Compositional matrix adjust.
Identities = 180/614 (29%), Positives = 291/614 (47%), Gaps = 65/614 (10%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTNP 66
P T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPASTTYKGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71
Query: 67 DAPKA---LSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
+ L+++R ++ Y A + + P Y GDI +EF
Sbjct: 72 NLQNQHNFLAEIRQALEKRDYNRAKELAEQHLVGPKTSQYGTYLSFGDIFIEFSQQGTIL 131
Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
++ T Y+R+L+++ A A Y+ F RE F+S PD ++V + + S +L F + L
Sbjct: 132 SQVTDYQRQLNVSKALATTSYAYKGTRFEREAFASFPDDLLVQRFTKEGSETLDFTIELS 191
Query: 179 SLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
D S + C I K D+ ++F++ L + G I
Sbjct: 192 LTRDLASDGKYEQEKTDYKECQLDITASHILMKGWVKDND--LRFASYLAWETD---GDI 246
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
DK +++ G+ +A L L A + F + K D + + +++ + Y+ L
Sbjct: 247 RVWSDK-VQISGASYANLFLAAKTDFAQNPASNYRKKIDLEQQVKNLVETAKEKGYARLK 305
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
+RH++DYQ LF RV + L ++DT + + +K+++ E +L EL F
Sbjct: 306 SRHIEDYQALFQRVQLDLG-------------SDVDTSTTDDLLKNYKPQEGQALEELFF 352
Query: 355 QFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR P ANLQGIWN +P W+S H+NINL+MNYW + NL E
Sbjct: 353 QYGRYLLISSSRDCPDALPANLQGIWNGVDNPPWNSDYHLNINLQMNYWPAYVTNLLETA 412
Query: 413 EPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWAL 462
P+ +++ L + G + A Y+ +GW++H + W D W
Sbjct: 413 FPVINYVDDLRVYG-RLAATRYVGIVSREGEENGWLVHTQATPFGWTAPGWD---YYWGW 468
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTS 521
P AW+ ++E Y + D+D+L ++ YP+L F +L E + ++PS S
Sbjct: 469 SPAANAWMMQTVYEAYLFYRDQDYLREKIYPILRETVRFWNAFLHEDNQAQRWVSSPSYS 528
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
PEH +S +T D ++I ++F I AA+ LE + D L E V + L P
Sbjct: 529 PEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELELDADLLTE-VKEKFDLLNP 578
Query: 582 TKIAEDGSIMEWVQ 595
+I + G I EW +
Sbjct: 579 LQITQSGRIREWYE 592
>gi|295085851|emb|CBK67374.1| hypothetical protein [Bacteroides xylanisolvens XB1A]
Length = 729
Score = 239 bits (609), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 159/503 (31%), Positives = 250/503 (49%), Gaps = 50/503 (9%)
Query: 101 VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIV 160
+ +G++ +E S + + YRR L L++A A V++ + + R++F S PD V+V
Sbjct: 71 AFTTMGELYVETGLSEINMS--NYRRILSLDSAMAVVQFYKDGIRYQRKYFISYPDSVMV 128
Query: 161 TKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQ 218
K + + G + +S ++ +H +GN+ ++ G + G++
Sbjct: 129 MKFTADKGGKQNLVLSYCPNNEAKSHLEADGNDGLVYTGVL-------------NNNGMK 175
Query: 219 FSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKK-----D 273
F+ IK GT+ A E+ ++ V+ +D V LL A + + F K D
Sbjct: 176 FA--FRIKAIHKGGTLKA-ENDRIIVKDADEVVFLLTADTDYKMNFAPDFKDPKAYVGND 232
Query: 274 PTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVP 333
P+ +++ + + Y +LY H DY LF+RV +++ E +P
Sbjct: 233 PSQTTLAMMDNALKKGYDELYRNHEADYTALFNRVRFEIN-----------PEIGTPNLP 281
Query: 334 SAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHV 392
+ +R+ S++ D L +L +QFGRYLLI+SSRPG ANLQG+W+ + W H
Sbjct: 282 TYKRLASYKKGVPDYQLEQLYYQFGRYLLIASSRPGNMPANLQGLWHNNTDGPWRVDYHN 341
Query: 393 NINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS 452
NIN++MNYW + P NLSEC PL DF+ L G KTAQ + A GW +I+ ++
Sbjct: 342 NINIQMNYWPACPTNLSECTWPLIDFIRSLVKPGEKTAQAYFNARGWTASISANIFGFTA 401
Query: 453 ADRGK-VVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD 511
K + W L P G WL TH+WE+Y+YT D FL++ Y L++ A F +D L D
Sbjct: 402 PLSSKSMAWNLNPTVGPWLATHIWEYYDYTRDTKFLKEIGYDLIKSSAQFAVDHLWHKPD 461
Query: 512 GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEK 571
G PSTSPEH V T A++RE+ I A++VL DA K
Sbjct: 462 GTYTAAPSTSPEH---------GPVDEGVTFAHAVVREILLDAIQASKVL--GTDAKERK 510
Query: 572 VLKS-LPRLRPTKIAEDGSIMEW 593
++ L +L P +I G ++EW
Sbjct: 511 QWENVLTKLVPYRIGRYGQLLEW 533
>gi|423220535|ref|ZP_17207030.1| hypothetical protein HMPREF1061_03803 [Bacteroides caccae
CL03T12C61]
gi|392623612|gb|EIY17715.1| hypothetical protein HMPREF1061_03803 [Bacteroides caccae
CL03T12C61]
Length = 775
Score = 239 bits (609), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 191/617 (30%), Positives = 284/617 (46%), Gaps = 113/617 (18%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV 59
+ E + T P+++ ++ PA ++ T A+PIGNG LGA+ +GGV SE + NE TLWTG
Sbjct: 21 VAGVEQKTETVPMRLWYDRPATNWMTSALPIGNGELGALFFGGVESEQILFNEKTLWTG- 79
Query: 60 PGDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKY 119
S G YQ GD+ + FD
Sbjct: 80 -----------------STTTRG------------------AYQKFGDVWIHFDGQE--- 101
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGS-LSFNVSLD 178
YRREL L+ A +V Y+ + RE+F+S PD+VIV ++S ++G L+F+VSL
Sbjct: 102 DVREYRRELSLDEAIGKVSYTSAGTHYLREYFASRPDEVIVLRLSTPKAGKKLNFSVSL- 160
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL-------EIKISDDR 231
+GR PG R + GI F L ++K+ ++
Sbjct: 161 ----------------ADGR-PGTRQEVTKD------GILFRRKLDLLSYEAQLKVINEG 197
Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFD---GPFINPSDSKKDPTSESMSALQSIRNL 288
GT+ A + KL V ++ ++LL A++++D ++ + + A S +
Sbjct: 198 GTLVA-DSNKLCVNAANSVLILLTAATNYDLSSATYVGETSGQLHKRLTDRLARASAK-- 254
Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPS 348
Y L + HL+DYQ LF+RV L R+ + I +VP+ E V + E
Sbjct: 255 GYDQLKSTHLNDYQSLFNRVRFDL-RTAAKTGGKIGMKTEIPSVPTNELVHLHK--EALY 311
Query: 349 LVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL 408
L L FQ+GRYL+I+SSR NLQGIWN D +P W+ H NIN++MNYW + CNL
Sbjct: 312 LDMLYFQYGRYLMIASSRGMNLSNNLQGIWNGDNAPPWECDIHSNINIQMNYWPAEVCNL 371
Query: 409 SECQEPLFDFLTYLSINGSKTAQVNYLA-----SGWVIHHKTDIWAKSSADRGKVVWALW 463
SEC EP ++ ++ + Q LA GW ++ + +I+ G W +
Sbjct: 372 SECHEPFIRYIATEALRPGGSWQ--QLARSEGLRGWTVNTQNNIF-------GYTDWNIN 422
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
AW C HLW+HY YT D ++L AYP++ + D L DG L SPE
Sbjct: 423 RPANAWYCMHLWKHYAYTQDINYLRSVAYPVMRSTCEYWFDRLQLTADGVLLAPAEWSPE 482
Query: 524 HEFIAP--DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL----VEKVLKSLP 577
H P DG V+Y+ + + ++FS + A VL L V K+ + L
Sbjct: 483 H---GPWEDG----VAYAQQL----VWQLFSETMQAVRVLRGAGIPLDADFVRKLSEKLK 531
Query: 578 RL-RPTKIAEDGSIMEW 593
RL + G I EW
Sbjct: 532 RLDNGVTLGAWGQIREW 548
>gi|167763307|ref|ZP_02435434.1| hypothetical protein BACSTE_01680 [Bacteroides stercoris ATCC
43183]
gi|167698601|gb|EDS15180.1| hypothetical protein BACSTE_01680 [Bacteroides stercoris ATCC
43183]
Length = 657
Score = 239 bits (609), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 156/481 (32%), Positives = 239/481 (49%), Gaps = 50/481 (10%)
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLL 181
YRREL L++A A V++ V++ R F S P V+V + S +L F+ + + +
Sbjct: 18 YRRELSLDSARAIVQFCKDGVKYKRTSFISYPANVLVMRYSADRPAKQNLRFSYAPNPVS 77
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
G N ++ R D +++ ++ +++ GT++ D+
Sbjct: 78 AGSLQPEGKNGLVFRARL-------------DNNSMEY--VVRMRVLTQGGTVTNTHDQL 122
Query: 242 LKVEGSDWAVLLLVASS----SFDGPFINPSDSKK-DPTSESMSALQSIRNLSYSDLYTR 296
L +EG+D V L+ A + +F+ F NP +P + + Y LY
Sbjct: 123 L-IEGADEVVFLITADTDYLINFNPDFTNPKTYVGVNPEETTAYWINEAEKQGYEALYQA 181
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQ 355
H DY LF+RV + L+ S + +P +R+ ++ + D L +L +Q
Sbjct: 182 HYADYTALFNRVKLNLTNS-----------SDFRDMPITQRLSRYREGQKDFYLEQLYYQ 230
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
FGRYLLI+SSRPG ANLQGIW+ ++ W H NINL+MNYW + NLSEC +PL
Sbjct: 231 FGRYLLIASSRPGNFPANLQGIWHNNVDGPWRVDYHNNINLQMNYWPACSTNLSECMKPL 290
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHL 474
DF+ L G KTAQ + A GW +I+ ++ + + W PM G WL TH+
Sbjct: 291 IDFIRTLVKPGEKTAQAYFGARGWTASISGNIFGFTTPLESENMSWNFNPMAGPWLATHI 350
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
WE+Y+YT D FL++ Y L++ A+F +D+L DG PSTSPEH
Sbjct: 351 WEYYDYTRDVKFLKEIGYELIKSSANFAVDYLWHKPDGTYTAAPSTSPEH---------G 401
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
V +T A++RE+ I A++VL + E E+VL+ +L P KI G +ME
Sbjct: 402 PVDQGATFVHAVVREILLDAIDASKVLRVDAKERKYWEQVLE---KLVPYKIGRYGQLME 458
Query: 593 W 593
W
Sbjct: 459 W 459
>gi|444414515|ref|ZP_21210772.1| hypothetical protein PNI0199_00487 [Streptococcus pneumoniae
PNI0199]
gi|444281657|gb|ELU86965.1| hypothetical protein PNI0199_00487 [Streptococcus pneumoniae
PNI0199]
Length = 803
Score = 238 bits (607), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 178/614 (28%), Positives = 289/614 (47%), Gaps = 65/614 (10%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
P+ T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPVSTTYMGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
D L+++R ++ Y A + + P Y GDI +EF
Sbjct: 72 NLQDQYVFLAEIRQALEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTL 131
Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
++ T Y+R+L+++ A Y F RE F+S PD ++V + + + +L F + L
Sbjct: 132 SQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELS 191
Query: 179 SLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
D S + C I K D+ ++F++ L + G I
Sbjct: 192 LTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDI 246
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L
Sbjct: 247 RVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLK 305
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
+RH++DYQ LF RV + L E N+D + + +K+++ E +L EL F
Sbjct: 306 SRHIEDYQALFQRVQLDL-------------EANVDAFTTDDLLKNYKPQEGQALEELFF 352
Query: 355 QFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E
Sbjct: 353 QYGRYLLISSSRDCPDALSANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAV 412
Query: 413 EPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWAL 462
P+ +++ L + G + A V Y +GW++H + W D W
Sbjct: 413 FPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGW 468
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTS 521
P AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS S
Sbjct: 469 SPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYS 528
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
PEH +S +T D ++I ++F I AA+ L +ED L E KS L P
Sbjct: 529 PEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNP 578
Query: 582 TKIAEDGSIMEWVQ 595
+I + G I EW +
Sbjct: 579 LQITQSGRIREWYE 592
>gi|225016842|ref|ZP_03706034.1| hypothetical protein CLOSTMETH_00754 [Clostridium methylpentosum
DSM 5476]
gi|224950383|gb|EEG31592.1| hypothetical protein CLOSTMETH_00754 [Clostridium methylpentosum
DSM 5476]
Length = 1957
Score = 238 bits (607), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 178/638 (27%), Positives = 312/638 (48%), Gaps = 77/638 (12%)
Query: 4 AESTSTTNPLKITFNGPAKH-----FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG 58
AE++ N L++ + PA T+++PIGNG +G+ V+GGV E L LNE TLW+G
Sbjct: 37 AEASVNDNDLRLWYTSPAPDTYNGWMTNSLPIGNGYMGSNVFGGVGRERLSLNEKTLWSG 96
Query: 59 VPG---DYTNPDAP------KALSDVRSLVDSGQYAEATAASVKLFGHPAD-------VY 102
P DY + + + ++ G + A + +L G D Y
Sbjct: 97 GPAEGRDYNGGNLESRGKNGETMKQIQQAFAEGNTSLANSLCNQLTGLSDDGGTQGYGYY 156
Query: 103 QLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTK 162
G++ LEF A+ Y R+LD+ TA A V Y V + RE+F+S PD ++V +
Sbjct: 157 LSYGNMYLEFPGMSDGNAQN-YVRDLDMKTAIASVNYDYDGVNYNREYFTSYPDNMMVAR 215
Query: 163 ISGSESGSLSFNVSLDSLLDNHS------YVNGNNQIIMEGRCPGKRIPPKANANDDPKG 216
++ SE+G L+FN+S++ DN S N Q G I + +D+
Sbjct: 216 LTASEAGKLTFNLSVNP--DNTSGKGQGPNTNNGYQRTWIQTADGGLITIQGQLSDNQ-- 271
Query: 217 IQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDG--PFINPSDSKKDP 274
++F++ + K+ + GT+ ED + V G+D V+L+ + +D P + +
Sbjct: 272 LKFAS--QTKVLNTGGTLVDNEDGTVSVTGADEVVILMTMGTDYDDNYPVYRTGQTDAEL 329
Query: 275 TSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPS 334
++ + + L Y L HL DYQ +F RV + L + I +P+
Sbjct: 330 LADIQGRIDAATELGYEGLLKSHLADYQGIFDRVHLDLGQE-------------ISQIPT 376
Query: 335 AERVKSFQTDED-PSLVE----LLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
+ + +++ + P+L + LL+Q+GRYL I+SSR G+ +NLQG+W + W S
Sbjct: 377 NQLLTNYKNGSNTPALNQALEVLLYQYGRYLTIASSREGSLPSNLQGVWTGANNSPWHSD 436
Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA---------SGWV 440
H+N+NL+MNYW + N++EC PL +++ L G TA++ Y +G++
Sbjct: 437 YHMNVNLQMNYWPTYSTNMAECAIPLIEYVDALRAPGRVTAKI-YAGIESTEENPENGFM 495
Query: 441 IHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCAS 500
H + + + + W P W+ + WE+Y YT D D++++ YP+L+ A
Sbjct: 496 AHTQNNPYGWTCPGW-SFDWGWSPAATPWIIQNCWEYYEYTGDLDYMKENIYPMLKEEAR 554
Query: 501 FLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAE 559
LIE + G L +P+ SPEH + +T + ++I ++F+ I A +
Sbjct: 555 LYEQMLIEDPETGKLVCSPAYSPEH---------GPRTNGNTYEQSLIWQLFTDAIIAGK 605
Query: 560 VLEKNEDALVEKVLKSLPRLR-PTKIAEDGSIMEWVQR 596
++++++ A ++K + + L+ P +I + G I EW +
Sbjct: 606 LVDEDQ-ATLDKWQEIIDNLKGPIEIGDSGQIKEWYEE 642
>gi|335029650|ref|ZP_08523157.1| hypothetical protein HMPREF9967_1785 [Streptococcus infantis
SK1076]
gi|334268947|gb|EGL87379.1| hypothetical protein HMPREF9967_1785 [Streptococcus infantis
SK1076]
Length = 806
Score = 238 bits (607), Expect = 7e-60, Method: Compositional matrix adjust.
Identities = 183/622 (29%), Positives = 304/622 (48%), Gaps = 83/622 (13%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GD 62
P +++G K A+P+GNG +GA ++G + E ++ NE TLW+G P G+
Sbjct: 14 PTAPSYDGWEKQ---ALPVGNGEMGAKIFGLIGEERIQYNEKTLWSGGPQLDSTDYNGGN 70
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLK 118
Y D K L+++R +++G +A + + P + Y GDI + F++
Sbjct: 71 YQ--DRYKVLAEIRKALEAGDRQKAKQLAERNLVGPNNAQYGRYLSFGDIFMVFNNQKKG 128
Query: 119 YAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---N 174
T Y R+LD+ A YS F RE FSS PD V VT +S +L F N
Sbjct: 129 LENVTDYHRDLDITEAITTTSYSQDGTNFKRETFSSYPDDVTVTHLSKKGDKTLDFTLWN 188
Query: 175 VSLDSLLDNHSY---VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDR 231
++LL N Y + Q + G I K D+ G++F++ L IK
Sbjct: 189 SLTENLLANGDYSWEYSNYKQGAVTTDSNG--ILLKGTVKDN--GLKFASYLGIKTD--- 241
Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNL 288
G ++A +D L V G+ +A LLL +++ NP ++ +KD E+ S +++ +
Sbjct: 242 GQVTA-QDGYLTVTGASYATLLLSVKTNYAQ---NPKTNYRKDIDVENTVKSIVEAAKAK 297
Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPS 348
Y L H+ DYQ LF+RV + L N + + E ++++ +
Sbjct: 298 DYETLKNNHIKDYQSLFNRVQLNLGG-------------NKSSQTTKEALQTYDPTKGQQ 344
Query: 349 LVELLFQFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPC 406
L EL FQ+GRYLLISSSR T ANLQG+WN +P W+S H+N+NL+MNYW +
Sbjct: 345 LEELFFQYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYMN 404
Query: 407 NLSECQEPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADR 455
NL+E +P+ +++ + G SK Q N GW++H + + ++
Sbjct: 405 NLAETAKPMINYIDDMRYYGRIAAKEYAGIESKEGQEN----GWLVHTQATPFGWTTPGW 460
Query: 456 GKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGY 513
W P AW+ +++++Y +T D +L+++ YP+L+ F +L + D +
Sbjct: 461 -NYYWGWSPAANAWMMQNVYDYYKFTKDEAYLKEKIYPMLKETTKFWNSFLHYDKSSDRW 519
Query: 514 LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL 573
+ ++PS SPEH ++ +T D +++ ++F + AA L ++D LV +V
Sbjct: 520 V-SSPSYSPEH---------GTITIGNTFDQSLVWQLFHDYMEAANHLNVDQD-LVTEVK 568
Query: 574 KSLPRLRPTKIAEDGSIMEWVQ 595
+L+P I +DG I EW +
Sbjct: 569 AKFDKLKPLHINQDGRIKEWYE 590
>gi|444387033|ref|ZP_21185059.1| hypothetical protein PCS125219_00445 [Streptococcus pneumoniae
PCS125219]
gi|444389242|ref|ZP_21187159.1| hypothetical protein PCS70012_00258 [Streptococcus pneumoniae
PCS70012]
gi|444393004|ref|ZP_21190665.1| hypothetical protein PCS81218_01475 [Streptococcus pneumoniae
PCS81218]
gi|444400918|ref|ZP_21198254.1| hypothetical protein PNI0007_02076 [Streptococcus pneumoniae
PNI0007]
gi|444418365|ref|ZP_21214349.1| hypothetical protein PNI0360_01769 [Streptococcus pneumoniae
PNI0360]
gi|444419893|ref|ZP_21215727.1| hypothetical protein PNI0427_00756 [Streptococcus pneumoniae
PNI0427]
gi|444254243|gb|ELU60689.1| hypothetical protein PCS125219_00445 [Streptococcus pneumoniae
PCS125219]
gi|444257842|gb|ELU64175.1| hypothetical protein PCS70012_00258 [Streptococcus pneumoniae
PCS70012]
gi|444262591|gb|ELU68882.1| hypothetical protein PCS81218_01475 [Streptococcus pneumoniae
PCS81218]
gi|444264795|gb|ELU70844.1| hypothetical protein PNI0007_02076 [Streptococcus pneumoniae
PNI0007]
gi|444281712|gb|ELU87019.1| hypothetical protein PNI0360_01769 [Streptococcus pneumoniae
PNI0360]
gi|444285998|gb|ELU91006.1| hypothetical protein PNI0427_00756 [Streptococcus pneumoniae
PNI0427]
Length = 803
Score = 238 bits (606), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 178/614 (28%), Positives = 289/614 (47%), Gaps = 65/614 (10%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
P+ T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPVSTTYMGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
D L+++R ++ Y A + + P Y GDI +EF
Sbjct: 72 NLQDQYVFLAEIRQALEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTL 131
Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
++ T Y+R+L+++ A Y F RE F+S PD ++V + + + +L F + L
Sbjct: 132 SQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELS 191
Query: 179 SLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
D S + C I K D+ ++F++ L + G I
Sbjct: 192 LTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDI 246
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L
Sbjct: 247 RVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLK 305
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
+RH++DYQ LF RV + L E N+D + + +K+++ E +L EL F
Sbjct: 306 SRHIEDYQALFQRVQLDL-------------EANVDAFTTDDLLKNYKPQEGQALEELFF 352
Query: 355 QFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E
Sbjct: 353 QYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAV 412
Query: 413 EPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWAL 462
P+ +++ L + G + A V Y +GW++H + W D W
Sbjct: 413 FPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGW 468
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTS 521
P AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS S
Sbjct: 469 SPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYS 528
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
PEH +S +T D ++I ++F I AA+ L +ED L E KS L P
Sbjct: 529 PEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNP 578
Query: 582 TKIAEDGSIMEWVQ 595
+I + G I EW +
Sbjct: 579 LQITQSGRIREWYE 592
>gi|221232393|ref|YP_002511546.1| hypothetical protein SPN23F_16560 [Streptococcus pneumoniae ATCC
700669]
gi|225857271|ref|YP_002738782.1| alpha-fucosidase [Streptococcus pneumoniae P1031]
gi|298254439|ref|ZP_06978025.1| alpha-fucosidase [Streptococcus pneumoniae str. Canada MDR_19A]
gi|298503399|ref|YP_003725339.1| alpha-L-fucosidase [Streptococcus pneumoniae TCH8431/19A]
gi|410477028|ref|YP_006743787.1| hypothetical protein HMPREF1038_01639 [Streptococcus pneumoniae
gamPNI0373]
gi|415700118|ref|ZP_11457832.1| fibronectin type III domain protein [Streptococcus pneumoniae
459-5]
gi|415752860|ref|ZP_11479842.1| fibronectin type III domain protein [Streptococcus pneumoniae SV36]
gi|418079078|ref|ZP_12716300.1| fibronectin type III domain protein [Streptococcus pneumoniae
4027-06]
gi|418081275|ref|ZP_12718485.1| fibronectin type III domain protein [Streptococcus pneumoniae
6735-05]
gi|418083460|ref|ZP_12720657.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44288]
gi|418123978|ref|ZP_12760909.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44378]
gi|418128522|ref|ZP_12765415.1| fibronectin type III domain protein [Streptococcus pneumoniae
NP170]
gi|418178700|ref|ZP_12815283.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA41565]
gi|419427712|ref|ZP_13967893.1| fibronectin type III domain protein [Streptococcus pneumoniae
5652-06]
gi|419436452|ref|ZP_13976539.1| fibronectin type III domain protein [Streptococcus pneumoniae
8190-05]
gi|419450962|ref|ZP_13990948.1| fibronectin type III domain protein [Streptococcus pneumoniae
EU-NP02]
gi|419473709|ref|ZP_14013558.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA13430]
gi|444394527|ref|ZP_21192078.1| hypothetical protein PNI0002_00522 [Streptococcus pneumoniae
PNI0002]
gi|444398107|ref|ZP_21195590.1| hypothetical protein PNI0006_01690 [Streptococcus pneumoniae
PNI0006]
gi|444402905|ref|ZP_21200052.1| hypothetical protein PNI0008_01502 [Streptococcus pneumoniae
PNI0008]
gi|444404353|ref|ZP_21201309.1| hypothetical protein PNI0009_00413 [Streptococcus pneumoniae
PNI0009]
gi|444407726|ref|ZP_21204393.1| hypothetical protein PNI0010_01147 [Streptococcus pneumoniae
PNI0010]
gi|444409151|ref|ZP_21205749.1| hypothetical protein PNI0076_00189 [Streptococcus pneumoniae
PNI0076]
gi|444412799|ref|ZP_21209118.1| hypothetical protein PNI0153_01181 [Streptococcus pneumoniae
PNI0153]
gi|444422007|ref|ZP_21217672.1| hypothetical protein PNI0446_00358 [Streptococcus pneumoniae
PNI0446]
gi|220674854|emb|CAR69429.1| conserved hypothetical protein [Streptococcus pneumoniae ATCC
700669]
gi|225726032|gb|ACO21884.1| alpha-fucosidase [Streptococcus pneumoniae P1031]
gi|298238994|gb|ADI70125.1| possible alpha-L-fucosidase [Streptococcus pneumoniae TCH8431/19A]
gi|353746605|gb|EHD27265.1| fibronectin type III domain protein [Streptococcus pneumoniae
4027-06]
gi|353752014|gb|EHD32645.1| fibronectin type III domain protein [Streptococcus pneumoniae
6735-05]
gi|353754680|gb|EHD35292.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44288]
gi|353795798|gb|EHD76144.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44378]
gi|353799021|gb|EHD79344.1| fibronectin type III domain protein [Streptococcus pneumoniae
NP170]
gi|353842759|gb|EHE22805.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA41565]
gi|379550873|gb|EHZ15969.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA13430]
gi|379612891|gb|EHZ77606.1| fibronectin type III domain protein [Streptococcus pneumoniae
8190-05]
gi|379617905|gb|EHZ82585.1| fibronectin type III domain protein [Streptococcus pneumoniae
5652-06]
gi|379622667|gb|EHZ87301.1| fibronectin type III domain protein [Streptococcus pneumoniae
EU-NP02]
gi|381308507|gb|EIC49350.1| fibronectin type III domain protein [Streptococcus pneumoniae SV36]
gi|381314814|gb|EIC55580.1| fibronectin type III domain protein [Streptococcus pneumoniae
459-5]
gi|406369973|gb|AFS43663.1| hypothetical protein HMPREF1038_01639 [Streptococcus pneumoniae
gamPNI0373]
gi|444259769|gb|ELU66078.1| hypothetical protein PNI0002_00522 [Streptococcus pneumoniae
PNI0002]
gi|444260764|gb|ELU67072.1| hypothetical protein PNI0006_01690 [Streptococcus pneumoniae
PNI0006]
gi|444265666|gb|ELU71662.1| hypothetical protein PNI0008_01502 [Streptococcus pneumoniae
PNI0008]
gi|444271322|gb|ELU77073.1| hypothetical protein PNI0010_01147 [Streptococcus pneumoniae
PNI0010]
gi|444274038|gb|ELU79693.1| hypothetical protein PNI0153_01181 [Streptococcus pneumoniae
PNI0153]
gi|444276986|gb|ELU82513.1| hypothetical protein PNI0009_00413 [Streptococcus pneumoniae
PNI0009]
gi|444280076|gb|ELU85452.1| hypothetical protein PNI0076_00189 [Streptococcus pneumoniae
PNI0076]
gi|444288631|gb|ELU93522.1| hypothetical protein PNI0446_00358 [Streptococcus pneumoniae
PNI0446]
Length = 803
Score = 238 bits (606), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 178/614 (28%), Positives = 289/614 (47%), Gaps = 65/614 (10%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
P+ T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPVSTTYMGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
D L+++R ++ Y A + + P Y GDI +EF
Sbjct: 72 NLQDQYVFLAEIRQALEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTL 131
Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
++ T Y+R+L+++ A Y F RE F+S PD ++V + + + +L F + L
Sbjct: 132 SQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELS 191
Query: 179 SLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
D S + C I K D+ ++F++ L + G I
Sbjct: 192 LTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDI 246
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L
Sbjct: 247 RVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLK 305
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
+RH++DYQ LF RV + L E N+D + + +K+++ E +L EL F
Sbjct: 306 SRHIEDYQALFQRVQLDL-------------EANVDAFTTDDLLKNYKPQEGQALEELFF 352
Query: 355 QFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E
Sbjct: 353 QYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAV 412
Query: 413 EPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWAL 462
P+ +++ L + G + A V Y +GW++H + W D W
Sbjct: 413 FPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGW 468
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTS 521
P AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS S
Sbjct: 469 SPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYS 528
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
PEH +S +T D ++I ++F I AA+ L +ED L E KS L P
Sbjct: 529 PEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNP 578
Query: 582 TKIAEDGSIMEWVQ 595
+I + G I EW +
Sbjct: 579 LQITQSGRIREWYE 592
>gi|419767010|ref|ZP_14293181.1| alpha-L-fucosidase [Streptococcus mitis SK579]
gi|383353528|gb|EID31137.1| alpha-L-fucosidase [Streptococcus mitis SK579]
Length = 803
Score = 238 bits (606), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 176/610 (28%), Positives = 284/610 (46%), Gaps = 84/610 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSFDYQGGNLQDQYSFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GD+ +EF + T Y+R+L+++ A
Sbjct: 87 LEKRDYNTAKELAEQHLVGPKTSQYGTYLSFGDLLIEFSRQGKTLFQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVNG- 189
A Y+ F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 147 LATTSYAYKGTMFKREAFASFPDDLLVQRFTKEGAETLDFTIELSLTRDLTSDEKYEQKK 206
Query: 190 -----------NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ++F+ L + G I
Sbjct: 207 SDYKECQLEITDSHILMKGRVK-------------DNNLRFAGCLAWQTD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
DK +++ G+ +A L L A + F + K D + +++ + Y+ L +RH+
Sbjct: 251 DK-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLVQQVRDLVETAKEKGYAQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
DYQ LF RV + L ++DT + + +K+++ E +L EL FQ+GR
Sbjct: 310 QDYQALFQRVQLDLG-------------ADVDTSTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQGIWN +P W+S H+NINL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGIWNAVDNPPWNSDYHLNINLQMNYWPAYVTNLLETAFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A Y +GW++H + W D W P
Sbjct: 417 NYIDDLRVYG-RLAAARYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F D+L E ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNDFLHEDRQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L + D L E V + L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGMDADLLTE-VKEKFDLLNPLQIT 582
Query: 586 EDGSIMEWVQ 595
+ G I EW +
Sbjct: 583 QSGRIREWYE 592
>gi|83765422|dbj|BAE55565.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 546
Score = 237 bits (605), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 172/588 (29%), Positives = 274/588 (46%), Gaps = 88/588 (14%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ P F ++P+GNGRLG ++ +P+E + NED++W+G D N +A VR
Sbjct: 34 YDTPGTRFNASLPVGNGRLGGTLYC-LPTEIVTWNEDSVWSGTFQDRVNSNALDGFPKVR 92
Query: 77 SLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
+L+ +G A ++ + G D YQ+L ++ ++ Y L+ TA
Sbjct: 93 NLLVNGNITAAGELALSDMTGSSVDQREYQVLSNLYVDLGQRGDATNLVWYLDTLEGYTA 152
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
+Y V +T NG I
Sbjct: 153 ---CEYGFDGVSYT--------------------------------------VANGIASI 171
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
+M+ R + F+A + + + D G ++A DK L V G+ V
Sbjct: 172 VMKAR------------TGEADYSTFTAGVRVVV--DGGNVTANGDK-LYVTGATTVVFF 216
Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
L A SS+ + D +E L + L Y L + D++ L RV++ L
Sbjct: 217 LDAESSYR------YATDSDQETELNRKLDAATELGYEALRKEAITDHKDLAGRVTLDLG 270
Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQT--DEDPSLVELLFQFGRYLLISSSRPGTQV 371
S D + +P ER+ ++++ D D L+F +GR+LLI+SSR +
Sbjct: 271 SSTDDAAS----------LPPNERMTNYRSSPDHDVQFATLVFNYGRHLLIASSRRTRER 320
Query: 372 A---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
+ LQGIWN+D SP+W + VNINLEMNYW + NL+E PL+D L + G
Sbjct: 321 SLSPGLQGIWNQDYSPSWGAKYTVNINLEMNYWPAETTNLNELTSPLWDLLALIQERGGD 380
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
A+ + G+V+HH TD+W S +++WPMGGAWL H+ EHY +T D+ FL+
Sbjct: 381 VAEKMHGCPGFVLHHNTDLWGDSVPVHNGTKYSIWPMGGAWLALHMMEHYRFTGDKTFLK 440
Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYSSTMD 543
++A P+ + F +L + DGYL T PS SPE+ F P GK ++ S T+D
Sbjct: 441 EQACPIFKSAFEFFECYLFD-VDGYLTTGPSCSPENAFQIPSDMTVAGKEEALTMSPTLD 499
Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
+++ E+ +A+ ++LE + D L V L ++RP +I DG I+
Sbjct: 500 NSMLFELLTALNETHQILEIDND-LSGSVQTYLGKIRPPRIGSDGQIL 546
>gi|322387111|ref|ZP_08060722.1| alpha-L-fucosidase [Streptococcus infantis ATCC 700779]
gi|321142098|gb|EFX37592.1| alpha-L-fucosidase [Streptococcus infantis ATCC 700779]
Length = 1840
Score = 237 bits (605), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 181/616 (29%), Positives = 298/616 (48%), Gaps = 98/616 (15%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 230 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYQ--DRYKVLAEIRK 287
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD++
Sbjct: 288 ALEEGNRQKAKQLAERNLVGPNNAQYGRYLSFGDIFMVFNNQKKGLENVTDYHRGLDISE 347
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDNHSY--- 186
A + Y+ F RE FSS PD V VT +S +L F N + L+ N Y
Sbjct: 348 AISTTSYTQDGTNFKRETFSSYPDDVTVTHLSKKGDKTLDFTLWNSLTEDLIANGDYSWE 407
Query: 187 ---------VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+N I+++G K N G++F++ L IK G ++A
Sbjct: 408 YSNYKQGAVTTDSNGILLKGTV-------KDN------GLKFASYLGIKTD---GQVTA- 450
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D L V G+ +A LLL A ++F NP ++ +KD E + +++ + Y L
Sbjct: 451 QDGYLTVTGASYATLLLSAKTNFAQ---NPKTNYRKDIDLEKTVKNIVETAKAKGYEKLK 507
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + S T E + ++ ++ L EL F
Sbjct: 508 EDHVKDYQSLFNRVQLNFGGSKSSQTT-------------KEALHTYNPEKGQKLEELFF 554
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W+S H+N+NL+MNYW + NL+E
Sbjct: 555 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYMNNLAETA 614
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 615 KPMVNYIDDMRYYGRIAAKEYAGIESKEGQEN----GWLVHTQATPFGWTTPGW-NYYWG 669
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 670 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKSSDRWV-SSPS 728
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + AA L+ ++D LV +V +L
Sbjct: 729 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEAANHLKVDQD-LVTEVKAKFDKL 778
Query: 580 RPTKIAEDGSIMEWVQ 595
+P I +DG I EW +
Sbjct: 779 KPLHINQDGRIKEWYE 794
>gi|418101115|ref|ZP_12738198.1| hypothetical protein SPAR128_1534 [Streptococcus pneumoniae
7286-06]
gi|418183181|ref|ZP_12819739.1| hypothetical protein SPAR78_1589 [Streptococcus pneumoniae GA43380]
gi|418196314|ref|ZP_12832790.1| hypothetical protein SPAR103_1510 [Streptococcus pneumoniae
GA47688]
gi|418223856|ref|ZP_12850496.1| hypothetical protein SPAR127_1557 [Streptococcus pneumoniae
5185-06]
gi|419447313|ref|ZP_13987318.1| fibronectin type III domain protein [Streptococcus pneumoniae
7879-04]
gi|353770615|gb|EHD51127.1| hypothetical protein SPAR128_1534 [Streptococcus pneumoniae
7286-06]
gi|353848164|gb|EHE28181.1| hypothetical protein SPAR78_1589 [Streptococcus pneumoniae GA43380]
gi|353860325|gb|EHE40270.1| hypothetical protein SPAR103_1510 [Streptococcus pneumoniae
GA47688]
gi|353878654|gb|EHE58484.1| hypothetical protein SPAR127_1557 [Streptococcus pneumoniae
5185-06]
gi|379614853|gb|EHZ79563.1| fibronectin type III domain protein [Streptococcus pneumoniae
7879-04]
Length = 778
Score = 237 bits (605), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 180/625 (28%), Positives = 294/625 (47%), Gaps = 87/625 (13%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
P+ T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPVSTTYMGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
D L+++R ++ Y A + + P Y GDI +EF
Sbjct: 72 NLQDQYVFLAEIRQALEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTL 131
Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
++ T Y+R+L+++ A Y F RE F+S PD ++V + + + +L F + L
Sbjct: 132 SQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELS 191
Query: 179 ---SLLDNHSYVN------------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL 223
L + Y ++ I+M+GR ND ++F++ L
Sbjct: 192 LTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRV---------KDND----LRFASYL 238
Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ 283
+ G I D+ +++ G+ +A L L A + F + K D + + +
Sbjct: 239 AWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVD 294
Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
+ + Y+ L +RH++DYQ LF RV + L E N+D + + +K+++
Sbjct: 295 TAKEKGYTQLKSRHIEDYQALFQRVQLDL-------------EANVDAFTTDDLLKNYKP 341
Query: 344 DEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
E +L EL FQ+GRYLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW
Sbjct: 342 QEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYW 401
Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKS 451
+ NL E P+ +++ L + G + A V Y +GW++H + W
Sbjct: 402 PAYVTNLLEAVFPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAP 460
Query: 452 SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD 511
D W P AW+ ++E Y++ D+D+L ++ YP+L F +L +
Sbjct: 461 GWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQ 517
Query: 512 GY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE 570
++PS SPEH +S +T D ++I ++F I AA+ L +ED L E
Sbjct: 518 AQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTE 568
Query: 571 KVLKSLPRLRPTKIAEDGSIMEWVQ 595
KS L P +I + G I EW +
Sbjct: 569 VKEKS-DLLNPLQITQSGRIREWYE 592
>gi|419844091|ref|ZP_14367392.1| gram positive anchor [Streptococcus infantis ATCC 700779]
gi|385702207|gb|EIG39356.1| gram positive anchor [Streptococcus infantis ATCC 700779]
Length = 1757
Score = 237 bits (605), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 181/617 (29%), Positives = 298/617 (48%), Gaps = 98/617 (15%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 147 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYQ--DRYKVLAEIRK 204
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD++
Sbjct: 205 ALEEGNRQKAKQLAERNLVGPNNAQYGRYLSFGDIFMVFNNQKKGLENVTDYHRGLDISE 264
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDNHSY--- 186
A + Y+ F RE FSS PD V VT +S +L F N + L+ N Y
Sbjct: 265 AISTTSYTQDGTNFKRETFSSYPDDVTVTHLSKKGDKTLDFTLWNSLTEDLIANGDYSWE 324
Query: 187 ---------VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+N I+++G K N G++F++ L IK G ++A
Sbjct: 325 YSNYKQGAVTTDSNGILLKGTV-------KDN------GLKFASYLGIKTD---GQVTA- 367
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D L V G+ +A LLL A ++F NP ++ +KD E + +++ + Y L
Sbjct: 368 QDGYLTVTGASYATLLLSAKTNF---AQNPKTNYRKDIDLEKTVKNIVETAKAKGYEKLK 424
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + S T E + ++ ++ L EL F
Sbjct: 425 EDHVKDYQSLFNRVQLNFGGSKSSQTT-------------KEALHTYNPEKGQKLEELFF 471
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W+S H+N+NL+MNYW + NL+E
Sbjct: 472 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYMNNLAETA 531
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 532 KPMVNYIDDMRYYGRIAAKEYAGIESKEGQEN----GWLVHTQATPFGWTTPGW-NYYWG 586
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 587 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKSSDRWV-SSPS 645
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + AA L+ ++D LV +V +L
Sbjct: 646 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEAANHLKVDQD-LVTEVKAKFDKL 695
Query: 580 RPTKIAEDGSIMEWVQR 596
+P I +DG I EW +
Sbjct: 696 KPLHINQDGRIKEWYEE 712
>gi|419425597|ref|ZP_13965793.1| hypothetical protein SPAR131_1527 [Streptococcus pneumoniae
7533-05]
gi|379619058|gb|EHZ83732.1| hypothetical protein SPAR131_1527 [Streptococcus pneumoniae
7533-05]
Length = 778
Score = 237 bits (605), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 180/625 (28%), Positives = 294/625 (47%), Gaps = 87/625 (13%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
P+ T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPVSTTYMGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
D L+++R ++ Y A + + P Y GDI +EF
Sbjct: 72 NLQDQYVFLAEIRQALEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTL 131
Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
++ T Y+R+L+++ A Y F RE F+S PD ++V + + + +L F + L
Sbjct: 132 SQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELS 191
Query: 179 ---SLLDNHSYVN------------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL 223
L + Y ++ I+M+GR ND ++F++ L
Sbjct: 192 LTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRV---------KDND----LRFASYL 238
Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ 283
+ G I D+ +++ G+ +A L L A + F + K D + + +
Sbjct: 239 AWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVD 294
Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
+ + Y+ L +RH++DYQ LF RV + L E N+D + + +K+++
Sbjct: 295 TAKEEGYTQLKSRHIEDYQALFQRVQLDL-------------EANVDAFTTDDLLKNYKP 341
Query: 344 DEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
E +L EL FQ+GRYLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW
Sbjct: 342 QEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYW 401
Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKS 451
+ NL E P+ +++ L + G + A V Y +GW++H + W
Sbjct: 402 PAYVTNLLEAVFPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAP 460
Query: 452 SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD 511
D W P AW+ ++E Y++ D+D+L ++ YP+L F +L +
Sbjct: 461 GWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQ 517
Query: 512 GY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE 570
++PS SPEH +S +T D ++I ++F I AA+ L +ED L E
Sbjct: 518 AQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTE 568
Query: 571 KVLKSLPRLRPTKIAEDGSIMEWVQ 595
KS L P +I + G I EW +
Sbjct: 569 VKEKS-DLLNPLQITQSGRIREWYE 592
>gi|417846683|ref|ZP_12492676.1| hypothetical protein HMPREF9958_1458 [Streptococcus mitis SK1073]
gi|339458316|gb|EGP70859.1| hypothetical protein HMPREF9958_1458 [Streptococcus mitis SK1073]
Length = 803
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 179/625 (28%), Positives = 293/625 (46%), Gaps = 87/625 (13%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
P T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPASTTYKGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
D L+++R ++ Y A + + P Y GDI +EF +
Sbjct: 72 NLQDQYAFLAEIRQALEKRDYNTAKELAEQHLVGPKTSQYGTYLSFGDIHIEFSNQGKTL 131
Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
++ T Y+R+L+++ A A Y +F RE F+S PD +V + + + +L F + L
Sbjct: 132 SQVTDYQRQLNISKALATTSYVYKGTKFERETFASFPDDFLVQRFTKEGAETLDFTIELS 191
Query: 179 ---SLLDNHSYVN------------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL 223
L + Y ++ I+M+GR ND +QF++ L
Sbjct: 192 LSRDLASDGKYEQEKSDYKECKLDITDSYILMKGRV---------KDND----LQFASYL 238
Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ 283
+ G I DK +++ G+ +A L L A + F + K D + +
Sbjct: 239 AWETD---GDIRVWSDK-VQISGASYANLFLAAKTDFAQNPASNYRKKIDLEQQVKDLVD 294
Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
+ + Y+ L +RH++DYQ LF RV + L ++DT + + +K+++
Sbjct: 295 TAKEKGYAQLKSRHIEDYQALFQRVQLDLG-------------ADVDTSTTDDLLKNYKP 341
Query: 344 DEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
E +L E+ FQ+GRYLLISSSR P ANLQG+WN +P W+S H+NINL+MNYW
Sbjct: 342 QEGQALEEMFFQYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNINLQMNYW 401
Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKS 451
+ NL E P+ +++ L + G + A Y +GW++H + W
Sbjct: 402 PAYVTNLLETAFPVINYIDDLRVYG-RLAAARYAGIVSQEGEENGWLVHTQATPFGWTAP 460
Query: 452 SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD 511
D W P AW+ ++E Y++ D+D+L ++ YP+L F +L +
Sbjct: 461 GWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQ 517
Query: 512 -GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE 570
++PS SPEH +S ++ D ++I ++F I AA+ L +ED L E
Sbjct: 518 VQRWVSSPSYSPEH---------GPISIGNSYDQSLIWQLFHDFIQAAQELSLDEDLLTE 568
Query: 571 KVLKSLPRLRPTKIAEDGSIMEWVQ 595
V + L P +I + G I EW +
Sbjct: 569 -VKEKFDLLNPLQITQSGRIREWYE 592
>gi|419445162|ref|ZP_13985177.1| alpha-fucosidase domain protein [Streptococcus pneumoniae GA19923]
gi|379572855|gb|EHZ37812.1| alpha-fucosidase domain protein [Streptococcus pneumoniae GA19923]
Length = 778
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 180/625 (28%), Positives = 294/625 (47%), Gaps = 87/625 (13%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
P+ T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPVSTTYMGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
D L+++R ++ Y A + + P Y GDI +EF
Sbjct: 72 NLQDQYVFLAEIRQALEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTL 131
Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
++ T Y+R+L+++ A Y F RE F+S PD ++V + + + +L F + L
Sbjct: 132 SQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELS 191
Query: 179 ---SLLDNHSYVN------------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL 223
L + Y ++ I+M+GR ND ++F++ L
Sbjct: 192 LTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRV---------KDND----LRFASYL 238
Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ 283
+ G I D+ +++ G+ +A L L A + F + K D + + +
Sbjct: 239 AWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVD 294
Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
+ + Y+ L +RH++DYQ LF RV + L E N+D + + +K+++
Sbjct: 295 TAKEEGYTQLKSRHIEDYQALFQRVQLDL-------------EANVDAFTTDDLLKNYKP 341
Query: 344 DEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
E +L EL FQ+GRYLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW
Sbjct: 342 QEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYW 401
Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKS 451
+ NL E P+ +++ L + G + A V Y +GW++H + W
Sbjct: 402 PAYVTNLLEAVFPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAP 460
Query: 452 SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD 511
D W P AW+ ++E Y++ D+D+L ++ YP+L F +L +
Sbjct: 461 GWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQ 517
Query: 512 GY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE 570
++PS SPEH +S +T D ++I ++F I AA+ L +ED L E
Sbjct: 518 AQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTE 568
Query: 571 KVLKSLPRLRPTKIAEDGSIMEWVQ 595
KS L P +I + G I EW +
Sbjct: 569 VKEKS-DLLNPLQITQSGRIREWYE 592
>gi|289167478|ref|YP_003445747.1| hypothetical protein smi_0630 [Streptococcus mitis B6]
gi|288907045|emb|CBJ21879.1| conserved hypothetical protein [Streptococcus mitis B6]
Length = 803
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 181/623 (29%), Positives = 288/623 (46%), Gaps = 83/623 (13%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
P T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPASTTYKGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEF-DDSHLK 118
D L+D+R ++ Y + + P Y GDI +EF +
Sbjct: 72 NLQDQHNFLTDIRQALEKRDYNRTKELAEQHLVGPKTSQYGTYLSFGDIHIEFSNQGKTL 131
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y Y+R+L+++ A A Y +F RE F+S PD ++V + + +L F + L
Sbjct: 132 YQVTDYQRQLNISKALATASYVYKGTKFERETFASFPDDLLVQRYTKEGLETLDFTIELS 191
Query: 179 ---SLLDNHSYVN------------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL 223
L + Y ++ I+M+GR ND +QF++ L
Sbjct: 192 LTHDLASDGKYEQEKSDYKECQLDISDSYILMKGRV---------KDND----LQFTSCL 238
Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ 283
+ D S K+++ G+ +A L L A + F + K D + ++
Sbjct: 239 AWETDGDIRVWS----NKVQISGASYANLFLAAKTDFAQNPASNYRKKIDLEKQVKDLVE 294
Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
+ Y+ L +RH+ DYQ LF RV + L ++DT + + +K+++
Sbjct: 295 IAKEKGYAQLKSRHIQDYQALFQRVQLDLG-------------ADVDTSTTDDLLKNYKP 341
Query: 344 DEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
E L EL FQ+GRYLLISSSR P ANLQGIWN +P W+S H+NINL+MNYW
Sbjct: 342 QEGQVLEELFFQYGRYLLISSSRDCPDALPANLQGIWNAVDNPPWNSDYHLNINLQMNYW 401
Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDIWAKSSA 453
+ NL E P+ +++ L + G + A Y +GW++H + + +A
Sbjct: 402 PAYVTNLLETAFPVINYIDDLRVYG-RLAAARYAGIVSQEGEENGWLVHTQATPFG-WTA 459
Query: 454 DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY 513
W P AWL ++E Y++ D+D+L ++ YP+L F D+L E
Sbjct: 460 PGWNYYWGWSPAANAWLMQTVYEAYSFYSDQDYLREKIYPMLRETVYFWNDFLHEDRQAQ 519
Query: 514 -LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV 572
++PS SPEH +S +T D ++I ++F I AA+ L + D L E V
Sbjct: 520 RWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDGDLLTE-V 569
Query: 573 LKSLPRLRPTKIAEDGSIMEWVQ 595
+ L P ++ + G I EW +
Sbjct: 570 KEKFDLLNPLQLTQSGRIREWYE 592
>gi|332982836|ref|YP_004464277.1| alpha-L-fucosidase [Mahella australiensis 50-1 BON]
gi|332700514|gb|AEE97455.1| Alpha-L-fucosidase [Mahella australiensis 50-1 BON]
Length = 816
Score = 237 bits (604), Expect = 1e-59, Method: Compositional matrix adjust.
Identities = 175/597 (29%), Positives = 282/597 (47%), Gaps = 52/597 (8%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLV 79
PA + DAIP GNG +GA+V+G + +E + LN + L+ N + LS +R ++
Sbjct: 13 PAIRWQDAIPCGNGSIGALVYGHIKNEIITLNHEALFLKSQKPQIN-SIYEYLSQLRKML 71
Query: 80 DSGQYAEATAASVKLFGH------PADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
G+Y E + D YQ DI++ DS A Y R LD T
Sbjct: 72 MEGKYNEGAQFFERKLKENYIGIARTDPYQPAFDIKI---DSETHEAFTGYCRYLDFETG 128
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A V++S GN + R+ F S D ++ +I+ S ++ +SL V G +
Sbjct: 129 EAVVRWSEGNTNYHRDLFVSRVDDAVILRINAVGSEKVNCVISLVP-----CRVEGATGM 183
Query: 194 IMEGRCPGKRIPPKANANDD----------PKGIQFSAILEIKISDDRGTISALEDKKLK 243
G ++P + A+ + P G +F + + ++ G + +E +
Sbjct: 184 GSGKDVKGDKLPFEWQASSEENWISFEAQYPDGNEFGGVARLIVNG--GCMEGIEAQNNC 241
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
+ D +L++ F+N K T E+ + ++ Y L ++H+ +++
Sbjct: 242 IYIKDATEVLMMVKV-----FVN---EKSKTTIENTKSQLEKMDVCYEALLSKHVYQHRE 293
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
L+ RV+I+ +D + E + ++S+ +L++ +F FGRYLLIS
Sbjct: 294 LYKRVNIEFHEQREDKLAKQKFNEEL-------LLESYNGQIPTALIQRMFYFGRYLLIS 346
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSRPG ANLQGIWN D P W S H + N+EMNYW +LP NL E P FD+ +
Sbjct: 347 SSRPGGLPANLQGIWNGDYVPAWASDYHNDENIEMNYWAALPGNLPETTLPYFDYYMSML 406
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+ A+V Y G + D +WA W G WL ++++ +T D
Sbjct: 407 EDFRTNAKVIYGCRGILAPIAQTTHGLVYTDP---IWATWTAGAGWLSQLFYDYWLFTGD 463
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
DFL+ +A P ++ A F D+L+EG DG PS SPE+ P+ L V+ ++TMD
Sbjct: 464 MDFLKNKAIPFMKEIALFYEDFLVEGEDGKFMFIPSLSPENTPPIPNASL--VTINATMD 521
Query: 544 MAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQRRL 598
+AI REV + + +A + L EK + + +L LP ++ EDG+I EW+ L
Sbjct: 522 IAIAREVLANLCAACKYLGIEKENVKIWKHMLSKLPEY---QVNEDGAIKEWIHSDL 575
>gi|418137717|ref|ZP_12774555.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA11663]
gi|353900672|gb|EHE76223.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA11663]
Length = 782
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 175/599 (29%), Positives = 283/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 6 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 66 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
Y F RE F+S PD ++V + + + +L F + L D S +
Sbjct: 126 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDGKYEQEK 185
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 186 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 239
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 240 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 299
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E N+D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 300 LDL-------------EANVDAFTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 346
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 347 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG- 405
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 406 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 462
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 463 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 513
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 514 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 571
>gi|189207008|ref|XP_001939838.1| hypothetical protein PTRG_09506 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187975931|gb|EDU42557.1| hypothetical protein PTRG_09506 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 742
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 185/606 (30%), Positives = 286/606 (47%), Gaps = 97/606 (16%)
Query: 4 AESTSTTNPLKITFNGPAKH--FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
A S ++ ++ + PA+ +T+A+PIGNGRLGAMV+G E + LNE+T+W+G
Sbjct: 14 ASLASASDNTRLWYKTPAQSSAWTNALPIGNGRLGAMVFGIPLQERIALNEETIWSGGQQ 73
Query: 62 DYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLK 118
D D+P+ +S+VR L+ G+ +A A++ + G P YQ LGD+++ FD +
Sbjct: 74 DRIGQDSPQTVSEVRDLLAQGRAGDAEKLANMGMMGTPQSCRNYQPLGDMDIFFDGT-TG 132
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y TY+R LD++TA A V++ V + RE F S PD V V + + SG LSF + +
Sbjct: 133 YDNATYKRWLDVDTALAGVQFQVNGTLYEREMFVSAPDDVFVHHLKATGSGKLSFQIRV- 191
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
+ GN E G DP I F+ L ++ SD G + L
Sbjct: 192 ----HRPDKGGNEAADHEWNANGLAYMTGGAGGIDP--IVFTTALAVQ-SD--GHVKNL- 241
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+ VE + A + AS+S+ D + S +Q R +Y +L RH+
Sbjct: 242 GPFIVVENATEATAIFAASTSY---------RHNDTRAAVESTIQQARQHTYEELRQRHI 292
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFG 357
DY L++ + LS S+ ++P+ R+ + + DP+L L + +G
Sbjct: 293 ADYAPLYNASVLDLS----------GSDLKASSLPTDARINATREGASDPALTALSYNYG 342
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLI+SSR G +NLQGIWN++ +P W S VNINL+MNYW + +LS EPLFD
Sbjct: 343 RYLLIASSRAGNLPSNLQGIWNKEFAPQWGSKYTVNINLQMNYWPAEVTSLSSLHEPLFD 402
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
L + +TD EH
Sbjct: 403 LLDLM---------------------RTD-----------------------------EH 412
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWL----IEGHDGYLETNPSTSPEHEFIAPDGKL 533
Y YT D+ FL + + E A F LD L I G YL TNPS SPE+ ++ D
Sbjct: 413 YWYTGDKAFLASKLDVVTEAIA-FYLDILQPYSINGTQ-YLVTNPSVSPENSYLDADNNT 470
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKN--EDALVEKVLKSLPRLRPTKIAED--GS 589
+ T D+ I+ E+F+ ++A L + + ++ + +L P + ++ G+
Sbjct: 471 YHFDIAPTCDIEILNELFTNYLNAVATLPNYTVDSTFLTRIRDTQAQLPPYRYSKRYPGT 530
Query: 590 IMEWVQ 595
+ EW+Q
Sbjct: 531 LQEWMQ 536
>gi|418198481|ref|ZP_12834939.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47778]
gi|353861591|gb|EHE41526.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47778]
Length = 782
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 175/599 (29%), Positives = 283/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 6 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 66 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
Y F RE F+S PD ++V + + + +L F + L D S +
Sbjct: 126 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDGKYEQEK 185
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 186 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 239
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 240 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEEGYTQLKSRHIEDYQALFQRVQ 299
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E N+D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 300 LDL-------------EANVDAFTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 346
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 347 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG- 405
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 406 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 462
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 463 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 513
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 514 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 571
>gi|419504391|ref|ZP_14044059.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47760]
gi|379605779|gb|EHZ70529.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47760]
Length = 803
Score = 236 bits (603), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 176/610 (28%), Positives = 288/610 (47%), Gaps = 84/610 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDILVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWVQ 595
+ G I EW +
Sbjct: 583 QSGRIREWYE 592
>gi|417849512|ref|ZP_12495432.1| hypothetical protein HMPREF9957_1083 [Streptococcus mitis SK1080]
gi|339456106|gb|EGP68701.1| hypothetical protein HMPREF9957_1083 [Streptococcus mitis SK1080]
Length = 803
Score = 236 bits (602), Expect = 2e-59, Method: Compositional matrix adjust.
Identities = 181/626 (28%), Positives = 291/626 (46%), Gaps = 87/626 (13%)
Query: 10 TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN 65
T P T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 14 TKPASTTYKGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQG 70
Query: 66 ---PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLK 118
D L+++R ++ Y A + + P Y GDI +EF +
Sbjct: 71 GNLQDQYGFLAEIRQALEKRDYNTAKELAEQHLVGPQTSQYGTYLSFGDIFIEFSNQGKT 130
Query: 119 YAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL 177
++ T Y+R+L+++ A A Y +F RE F+S PD ++V + +L F + L
Sbjct: 131 LSQVTDYQRQLNISKALATTSYVYKGTKFEREAFASFPDDLLVQRFIKEGLETLDFTIEL 190
Query: 178 --------DSLLDNHSYVNGNNQ-------IIMEGRCPGKRIPPKANANDDPKGIQFSAI 222
D + Y Q I+M+GR ND +QF++
Sbjct: 191 SLTRDLASDGKYEQEKYDYKECQLNITASHILMKGRV---------KDND----LQFASY 237
Query: 223 LEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSAL 282
L + G I DK +++ G+ +A L L A + F + K D + + +
Sbjct: 238 LTWQTD---GDIRVWSDK-IQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLV 293
Query: 283 QSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ 342
+ + Y+ L +RH++DYQ LF V + L ++D + + +K+++
Sbjct: 294 DTAKEKGYAQLKSRHIEDYQALFQSVQLDLG-------------SDVDASTTDDLLKNYK 340
Query: 343 TDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
E +L EL FQ+GRYLLISSSR P ANLQG+WN +P W+S H+NINL+MNY
Sbjct: 341 PQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNINLQMNY 400
Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAK 450
W + NL E P+ +++ L + G + A Y +GW++H + W
Sbjct: 401 WPAYVTNLLETAFPVINYIDDLRVYG-RLAAARYAGIVSQEGEENGWLVHTQATPFGWTA 459
Query: 451 SSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH 510
D W P AW+ ++E Y++ D+D+L ++ YP+L F +L +
Sbjct: 460 PGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQ 516
Query: 511 DGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALV 569
++PS SPEH +S +T D ++I ++F I AA+ L +ED L
Sbjct: 517 QAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELSLDEDLLT 567
Query: 570 EKVLKSLPRLRPTKIAEDGSIMEWVQ 595
E V + L P +I + G I EW +
Sbjct: 568 E-VKEKFDLLNPLQITQSGRIREWYE 592
>gi|419443014|ref|ZP_13983041.1| alpha-fucosidase domain protein [Streptococcus pneumoniae GA13224]
gi|379551714|gb|EHZ16808.1| alpha-fucosidase domain protein [Streptococcus pneumoniae GA13224]
Length = 612
Score = 236 bits (602), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 178/610 (29%), Positives = 289/610 (47%), Gaps = 84/610 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATA-ASVKLFGHPADVYQL---LGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A A L G Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETN---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWSAYVTNLLEAVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWVQ 595
+ G I EW +
Sbjct: 583 QSGRIREWYE 592
>gi|421236745|ref|ZP_15693342.1| fibronectin type III domain protein [Streptococcus pneumoniae
2071004]
gi|395601508|gb|EJG61655.1| fibronectin type III domain protein [Streptococcus pneumoniae
2071004]
Length = 803
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 177/614 (28%), Positives = 288/614 (46%), Gaps = 65/614 (10%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
P+ T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPVSTTYMGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
D L+++R ++ Y A + + P Y GDI +EF
Sbjct: 72 NLQDQYVFLAEIRQALEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTL 131
Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
++ T Y+R+L+++ A Y F RE F+S PD ++V + + + +L F + L
Sbjct: 132 SQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELS 191
Query: 179 SLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
D S + C I K D+ ++F++ L + G I
Sbjct: 192 LTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDI 246
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L
Sbjct: 247 RVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLK 305
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
+RH++DYQ LF RV + L E N+D + + +K+++ E +L EL F
Sbjct: 306 SRHIEDYQALFQRVQLDL-------------EANVDAFTTDDLLKNYKPQEGQALEELFF 352
Query: 355 QFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E
Sbjct: 353 QYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAV 412
Query: 413 EPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWAL 462
P+ +++ L + G + A V Y +GW++H + W D W
Sbjct: 413 FPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGW 468
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTS 521
P AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS S
Sbjct: 469 SPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYS 528
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
PEH +S +T D ++I ++F I A+ L +ED L E KS L P
Sbjct: 529 PEH---------GPISIGNTYDQSLIWQLFHDFIQVAQELGLDEDLLTEVKEKS-DLLNP 578
Query: 582 TKIAEDGSIMEWVQ 595
+I + G I EW +
Sbjct: 579 LQITQSGRIREWYE 592
>gi|358383778|gb|EHK21440.1| glycoside hydrolase family 95 protein [Trichoderma virens Gv29-8]
Length = 794
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 173/596 (29%), Positives = 274/596 (45%), Gaps = 71/596 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQY 84
T +PIGN RLGA ++GG +E + +NEDT+W G D + AL VR ++ +
Sbjct: 39 TGVLPIGNSRLGAAIFGG-GNEVVTINEDTIWDGPLQDRIPANGLAALPKVRQMLMANNL 97
Query: 85 AEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYS 140
+A + PA + G++ L F Y R LD + V Y+
Sbjct: 98 TDAGNLVLSQM-TPASCCERQFSYFGNLNLNFGHGS---GISNYIRSLDTRQGNSSVSYT 153
Query: 141 VGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS---LDSLLDNHSYVNGN-NQIIME 196
V +TRE+ +SNPD VI + + S++G+LS + + ++++L N + +G N + ++
Sbjct: 154 FNGVTYTREYVASNPDGVIAARYTASKAGALSVSATFSRINNILSNVASTSGGVNSVTLQ 213
Query: 197 GRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVA 256
G G+ P I F+ + + T SA L +
Sbjct: 214 GTS-GQSTNP----------ILFTG--KARFVASGATFSA-----------SGGTLTITG 249
Query: 257 SSSFDGPFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
+++ D F++ + + PT+ +++A L + + + ++ + D L R +I
Sbjct: 250 ATTID-VFVDVETNYRYPTASALAAEVDNKLNAAVSKGFPAVHNSAIADSSALLGRANIN 308
Query: 312 LSRSPK---DIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRP 367
L SP D+ TD +RVKS ++ DP L+ L + +GR+LL++SSR
Sbjct: 309 LGTSPNGLADLSTD-------------QRVKSARSAFNDPQLIVLAWNYGRHLLVASSRD 355
Query: 368 GTQV----ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
+ NLQG+WN S W +NIN EMN W + NL E Q PLFD L
Sbjct: 356 TSAAIDMPPNLQGVWNNATSAPWGGKFTININTEMNLWPAGQTNLIETQLPLFDLLKVAQ 415
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
G + AQ Y +G V HH D+W + +WPMG WL H+ E Y +T D
Sbjct: 416 PRGQEMAQKLYGCNGTVFHHNLDVWGDPAPTDNYTSSTMWPMGATWLVQHMMEQYRFTGD 475
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC-----VSY 538
+FL AYP L + FL + G T PS SPE+ ++ P G +
Sbjct: 476 LNFLRNTAYPYLLDISKFLQCYTFT-WQGNRVTGPSLSPENTYVVPSGANKAGTQEPMDM 534
Query: 539 SSTMDMAIIREVFSAIISAAEVLE-KNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ MD ++R+V ++I+ AA L + D+ V+ LP +R +I G I+EW
Sbjct: 535 APEMDNQLMRDVMTSILEAAAALGISSSDSNVQAATNFLPLIRTPRIGSYGQILEW 590
>gi|307068282|ref|YP_003877248.1| hypothetical protein SPAP_1662 [Streptococcus pneumoniae AP200]
gi|306409819|gb|ADM85246.1| hypothetical protein SPAP_1662 [Streptococcus pneumoniae AP200]
Length = 796
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 176/610 (28%), Positives = 288/610 (47%), Gaps = 84/610 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWVQ 595
+ G I EW +
Sbjct: 583 QSGRIREWYE 592
>gi|148997704|ref|ZP_01825268.1| hypothetical protein CGSSp11BS70_02314 [Streptococcus pneumoniae
SP11-BS70]
gi|168491464|ref|ZP_02715607.1| alpha-fucosidase [Streptococcus pneumoniae CDC0288-04]
gi|168575158|ref|ZP_02721121.1| alpha-fucosidase [Streptococcus pneumoniae MLV-016]
gi|225861483|ref|YP_002742992.1| alpha-fucosidase [Streptococcus pneumoniae Taiwan19F-14]
gi|387788703|ref|YP_006253771.1| hypothetical protein MYY_1579 [Streptococcus pneumoniae ST556]
gi|417313133|ref|ZP_12099845.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA04375]
gi|418142169|ref|ZP_12778982.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA13455]
gi|418151161|ref|ZP_12787907.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA14798]
gi|418164950|ref|ZP_12801618.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA17371]
gi|418171792|ref|ZP_12808416.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA19451]
gi|418194221|ref|ZP_12830710.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47439]
gi|418228162|ref|ZP_12854779.1| fibronectin type III domain protein [Streptococcus pneumoniae
3063-00]
gi|419429855|ref|ZP_13970019.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA11856]
gi|419438693|ref|ZP_13978761.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA13499]
gi|419449442|ref|ZP_13989438.1| fibronectin type III domain protein [Streptococcus pneumoniae
4075-00]
gi|419471542|ref|ZP_14011401.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA07914]
gi|419502307|ref|ZP_14041991.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47628]
gi|419506539|ref|ZP_14046200.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA49194]
gi|419519366|ref|ZP_14058972.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA08825]
gi|421238983|ref|ZP_15695547.1| fibronectin type III domain protein [Streptococcus pneumoniae
2071247]
gi|421245493|ref|ZP_15701991.1| fibronectin type III domain protein [Streptococcus pneumoniae
2081685]
gi|421292526|ref|ZP_15743260.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA56348]
gi|421312462|ref|ZP_15763064.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA58981]
gi|421314530|ref|ZP_15765117.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA47562]
gi|147756203|gb|EDK63245.1| hypothetical protein CGSSp11BS70_02314 [Streptococcus pneumoniae
SP11-BS70]
gi|183574240|gb|EDT94768.1| alpha-fucosidase [Streptococcus pneumoniae CDC0288-04]
gi|183578740|gb|EDT99268.1| alpha-fucosidase [Streptococcus pneumoniae MLV-016]
gi|225727028|gb|ACO22879.1| alpha-fucosidase [Streptococcus pneumoniae Taiwan19F-14]
gi|327389841|gb|EGE88186.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA04375]
gi|353806420|gb|EHD86694.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA13455]
gi|353814371|gb|EHD94597.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA14798]
gi|353828782|gb|EHE08918.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA17371]
gi|353835529|gb|EHE15623.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA19451]
gi|353857799|gb|EHE37761.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47439]
gi|353880557|gb|EHE60372.1| fibronectin type III domain protein [Streptococcus pneumoniae
3063-00]
gi|379138445|gb|AFC95236.1| hypothetical protein MYY_1579 [Streptococcus pneumoniae ST556]
gi|379537100|gb|EHZ02285.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA13499]
gi|379546258|gb|EHZ11397.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA07914]
gi|379550033|gb|EHZ15135.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA11856]
gi|379600520|gb|EHZ65301.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47628]
gi|379608453|gb|EHZ73199.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA49194]
gi|379622060|gb|EHZ86696.1| fibronectin type III domain protein [Streptococcus pneumoniae
4075-00]
gi|379641203|gb|EIA05741.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA08825]
gi|395600626|gb|EJG60781.1| fibronectin type III domain protein [Streptococcus pneumoniae
2071247]
gi|395608020|gb|EJG68116.1| fibronectin type III domain protein [Streptococcus pneumoniae
2081685]
gi|395891833|gb|EJH02827.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA56348]
gi|395909316|gb|EJH20192.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA58981]
gi|395913215|gb|EJH24068.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA47562]
Length = 803
Score = 236 bits (601), Expect = 3e-59, Method: Compositional matrix adjust.
Identities = 176/610 (28%), Positives = 288/610 (47%), Gaps = 84/610 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWVQ 595
+ G I EW +
Sbjct: 583 QSGRIREWYE 592
>gi|418119103|ref|ZP_12756060.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA18523]
gi|419453708|ref|ZP_13993678.1| fibronectin type III domain protein [Streptococcus pneumoniae
EU-NP03]
gi|353791055|gb|EHD71436.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA18523]
gi|379625778|gb|EHZ90404.1| fibronectin type III domain protein [Streptococcus pneumoniae
EU-NP03]
Length = 782
Score = 236 bits (601), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 176/610 (28%), Positives = 288/610 (47%), Gaps = 84/610 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 6 EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 66 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 126 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 185
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 186 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 229
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 230 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 288
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 289 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 335
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 336 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 395
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 396 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 451
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 452 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 510
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 511 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 561
Query: 586 EDGSIMEWVQ 595
+ G I EW +
Sbjct: 562 QSGRIREWYE 571
>gi|418160358|ref|ZP_12797057.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA17227]
gi|353822091|gb|EHE02267.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA17227]
Length = 809
Score = 236 bits (601), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 176/610 (28%), Positives = 288/610 (47%), Gaps = 84/610 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWVQ 595
+ G I EW +
Sbjct: 583 QSGRIREWYE 592
>gi|419521584|ref|ZP_14061179.1| hypothetical protein SPAR7_1605 [Streptococcus pneumoniae GA05245]
gi|379538884|gb|EHZ04064.1| hypothetical protein SPAR7_1605 [Streptococcus pneumoniae GA05245]
Length = 803
Score = 235 bits (600), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 176/610 (28%), Positives = 288/610 (47%), Gaps = 84/610 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWVQ 595
+ G I EW +
Sbjct: 583 QSGRIREWYE 592
>gi|417687098|ref|ZP_12336372.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA41301]
gi|332073988|gb|EGI84466.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA41301]
Length = 782
Score = 235 bits (600), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 176/610 (28%), Positives = 288/610 (47%), Gaps = 84/610 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 6 EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 66 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 126 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 185
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 186 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 229
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 230 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 288
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 289 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 335
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 336 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 395
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 396 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 451
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 452 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 510
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 511 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 561
Query: 586 EDGSIMEWVQ 595
+ G I EW +
Sbjct: 562 QSGRIREWYE 571
>gi|418085647|ref|ZP_12722826.1| hypothetical protein SPAR90_1552 [Streptococcus pneumoniae GA47281]
gi|418149015|ref|ZP_12785777.1| hypothetical protein SPAR34_1507 [Streptococcus pneumoniae GA13856]
gi|421207087|ref|ZP_15664139.1| alpha-fucosidase domain protein [Streptococcus pneumoniae 2090008]
gi|353756356|gb|EHD36957.1| hypothetical protein SPAR90_1552 [Streptococcus pneumoniae GA47281]
gi|353811351|gb|EHD91593.1| hypothetical protein SPAR34_1507 [Streptococcus pneumoniae GA13856]
gi|395574423|gb|EJG35001.1| alpha-fucosidase domain protein [Streptococcus pneumoniae 2090008]
Length = 778
Score = 235 bits (600), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 176/610 (28%), Positives = 288/610 (47%), Gaps = 84/610 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWVQ 595
+ G I EW +
Sbjct: 583 QSGRIREWYE 592
>gi|417699038|ref|ZP_12348209.1| hypothetical protein SPAR69_1595 [Streptococcus pneumoniae GA41317]
gi|332199684|gb|EGJ13759.1| hypothetical protein SPAR69_1595 [Streptococcus pneumoniae GA41317]
Length = 757
Score = 235 bits (599), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 176/610 (28%), Positives = 288/610 (47%), Gaps = 84/610 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 6 EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 66 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 126 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 185
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 186 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 229
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 230 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 288
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 289 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 335
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 336 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 395
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 396 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 451
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 452 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 510
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 511 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 561
Query: 586 EDGSIMEWVQ 595
+ G I EW +
Sbjct: 562 QSGRIREWYE 571
>gi|418167255|ref|ZP_12803910.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA17971]
gi|353829247|gb|EHE09381.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA17971]
Length = 803
Score = 234 bits (598), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 176/610 (28%), Positives = 288/610 (47%), Gaps = 84/610 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWVQ 595
+ G I EW +
Sbjct: 583 QSGRIREWYE 592
>gi|340520176|gb|EGR50413.1| glycoside hydrolase family 95 [Trichoderma reesei QM6a]
Length = 794
Score = 234 bits (598), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 174/594 (29%), Positives = 270/594 (45%), Gaps = 67/594 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQY 84
T +PIGN RLG ++GG +E + +NEDTLW G + + AL VR ++ +
Sbjct: 39 TGVLPIGNSRLGGAIFGG-GNEVITINEDTLWDGPLQNRIPANGLAALPKVRQMLLANNL 97
Query: 85 AEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYS 140
+A + PA + G++ L F Y R LD + V Y+
Sbjct: 98 TDAGNLVLSQM-MPAVGGERQFSYFGNLNLNFGHGS---GISNYIRSLDTRQGNSSVSYT 153
Query: 141 VGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS---LDSLLDNHSYVNGN-NQIIME 196
V +TRE+ +S P VI + + S++G+LS + + + ++L N + +G N + ++
Sbjct: 154 FNGVTYTREYVASAPVGVIAARFTASKAGALSVSATFSRISNILSNVASTSGGVNSVTLQ 213
Query: 197 GRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVA 256
G + P I F+ + + G++SA L +
Sbjct: 214 GTSGQAQNP-----------ILFTG--KARFVPQGGSVSA-----------SGGTLTITG 249
Query: 257 SSSFDGPFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
+++ D FI+ + + PT+ +++A + + + + ++ + D L R +I
Sbjct: 250 ATTID-VFIDVETNYRYPTASALAAEVDNKINTAVSQGFQKVHDDAIADSSALLGRANIN 308
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGTQ 370
L SP I P+ +RVKS ++ DP L+ L + +GR+LL++SSR +
Sbjct: 309 LGTSPNGIANQ----------PTDQRVKSARSAFNDPQLIVLAWNYGRHLLVASSRDTSA 358
Query: 371 V----ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
NLQG+WN S W +NIN EMN W + NL E Q PLFD L G
Sbjct: 359 AIDMPPNLQGVWNNATSAPWGGKFTININTEMNLWPAGQTNLIETQLPLFDLLKVAQPRG 418
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
+ AQ Y +G V HH D+W + ++WPMG WL H+ E Y +T D DF
Sbjct: 419 QEMAQKLYGCNGTVFHHNLDVWGDPAPTDNYPSSSMWPMGATWLVQHMMEQYRFTGDLDF 478
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA- 545
L AYP L + FL + G T PS SPE+ + P G MDMA
Sbjct: 479 LRNTAYPYLLDISKFLQCYTFT-WQGNRVTGPSLSPENTYAVPQGA-NVAGQQEPMDMAP 536
Query: 546 -----IIREVFSAIISAAEVLE-KNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
++R+V SAI+ AA L + DA V+ LP +R +I G I+EW
Sbjct: 537 EMDNQLMRDVMSAIVEAAAALGISSSDANVKAASDFLPLIRTPRIGSYGQILEW 590
>gi|225859410|ref|YP_002740920.1| alpha-fucosidase [Streptococcus pneumoniae 70585]
gi|225721936|gb|ACO17790.1| alpha-fucosidase [Streptococcus pneumoniae 70585]
Length = 803
Score = 234 bits (598), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 178/610 (29%), Positives = 287/610 (47%), Gaps = 84/610 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
A Y F RE F+S PD ++V + +L F + L L N Y
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASNGKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQSPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVCFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWVQ 595
+ G I EW +
Sbjct: 583 QSGRIREWYE 592
>gi|149011485|ref|ZP_01832732.1| hypothetical protein CGSSp19BS75_08742 [Streptococcus pneumoniae
SP19-BS75]
gi|147764475|gb|EDK71406.1| hypothetical protein CGSSp19BS75_08742 [Streptococcus pneumoniae
SP19-BS75]
Length = 803
Score = 234 bits (598), Expect = 8e-59, Method: Compositional matrix adjust.
Identities = 176/610 (28%), Positives = 288/610 (47%), Gaps = 84/610 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWVQ 595
+ G I EW +
Sbjct: 583 QSGRIREWYE 592
>gi|257070006|ref|YP_003156261.1| hypothetical protein Bfae_29100 [Brachybacterium faecium DSM 4810]
gi|256560824|gb|ACU86671.1| hypothetical protein Bfae_29100 [Brachybacterium faecium DSM 4810]
Length = 762
Score = 234 bits (597), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 179/596 (30%), Positives = 267/596 (44%), Gaps = 61/596 (10%)
Query: 9 TTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
T P + +GPA+ + +A+P+GNGRLGAM WG LNE TLW+G PG
Sbjct: 14 VTPPPALLRHGPAERWLEALPLGNGRLGAMAWGDPGRARFSLNESTLWSGAPGVDLPHRT 73
Query: 69 PK-----ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET 123
P+ AL R+L SG EA +L + Y +GD+ + D +
Sbjct: 74 PRAEAAAALERSRALFTSGAVQEAQEEIERLGASWSQAYLPVGDLTVRLDGDAGPEGGDG 133
Query: 124 YRRELDLNTATARVKYSVGNVEFTREH--FSSNPDQVIVTKISGSESGS--LSFNVSLDS 179
RRELDL RV + G EH F S D+V+V + E L + L
Sbjct: 134 -RRELDLQHGEHRVLAADG------EHLSFVSAADEVLVHCLPCPEGARAVLELDSPLVE 186
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+G+ + + R P +D P G QF +I + + +A+
Sbjct: 187 EQREEQPADGDAALTIVLRAP----------SDVPGG-QFRQQEQIAWESEGASRAAVVV 235
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+ + G V +V +++ G P + + E+ + ++ +L+ RH D
Sbjct: 236 RTRREAGRLLVVCAIV--TTWQGLGRTPDRAVAEAVQEATAQAETALARGAEELHRRHRD 293
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
+ V +QL+ S + + TC F +GRY
Sbjct: 294 RPRPGADAVGLQLTGSEEAELLATC-----------------------------FAYGRY 324
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LL S+SRPG ANLQG+WN L W S VNINLEMN+W + + E L ++
Sbjct: 325 LLASASRPGLPPANLQGLWNAKLEAPWSSNYTVNINLEMNHWGAAIAQVPEAAGALEQYV 384
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
L G TA+ Y A GW +HH +D W + RG+ WA WPMGG WL L + +
Sbjct: 385 EMLREQGRDTARRLYGADGWTVHHNSDPWGYTDPVRGEPSWATWPMGGLWL-EQLLDTFA 443
Query: 480 YTMDRDFLE--KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
D E + +P L +F L L E DG+L T PSTSPE+ + DG + C+S
Sbjct: 444 ACSGSDPAEVARDRFPALREAVAFALGLLHESADGHLATFPSTSPENRWRTADGTVVCLS 503
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ MD ++RE ++ AA VL + +D +V++ +L + ++ DG I+EW
Sbjct: 504 EGTGMDRWLLRETAQHLVEAAAVLGREDDPVVQQAASALDLVPGPRVGADGRILEW 559
>gi|419495837|ref|ZP_14035554.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47461]
gi|421302877|ref|ZP_15753541.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA17484]
gi|379593923|gb|EHZ58734.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47461]
gi|395901499|gb|EJH12435.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA17484]
Length = 803
Score = 234 bits (597), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 178/610 (29%), Positives = 287/610 (47%), Gaps = 84/610 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
A Y F RE F+S PD ++V + +L F + L L N Y
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASNGKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWVQ 595
+ G I EW +
Sbjct: 583 QSGRIREWYE 592
>gi|358399331|gb|EHK48674.1| glycoside hydrolase family 95 protein [Trichoderma atroviride IMI
206040]
Length = 797
Score = 234 bits (597), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 168/588 (28%), Positives = 266/588 (45%), Gaps = 52/588 (8%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQY 84
T +PIGN RLGA ++GG +E + +NEDTLW G + + AL VR ++++
Sbjct: 39 TGVLPIGNSRLGAAIFGGA-NEVVTINEDTLWDGPLQNRIPANGLAALPKVRQMLEANSL 97
Query: 85 AEATAASVKLFGHPAD---VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSV 141
A + P + G++ L F H Y R LD + V Y+
Sbjct: 98 TAAGNLVLSQMTPPISGERQFSYFGNLNLNF--GHSSGGISNYIRSLDTRQGNSSVSYTY 155
Query: 142 GNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS---LDSLLDN-HSYVNGNNQIIMEG 197
V +TRE+ +S P VI + + S++G+LS + + + ++L N S G N + ++G
Sbjct: 156 NGVTYTREYVASTPAGVIAARFTASKAGALSVSATFSRISNILSNVASTSGGANTLTLQG 215
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVAS 257
A+D+P I F+ + S G + L + G+ + +
Sbjct: 216 SS-------GQAASDNP--ILFTGTAQFVAS---GATFSTSGGTLTISGATTIDVFIDVE 263
Query: 258 SSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPK 317
+S+ P S D ++ S L + + + ++ + D L R +I L SP
Sbjct: 264 TSYRYP------SASDLAAQVNSKLSAAVSQGFQKIHDGAIADASALLGRANINLGTSPN 317
Query: 318 DIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGTQVA---- 372
+ + + + +RVK+ ++ DP L L + +GR+LL++SSR T A
Sbjct: 318 GLAS----------LSTDQRVKNARSSFNDPQLAVLAWNYGRHLLVASSR-NTSAAIDMP 366
Query: 373 -NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
NLQG+WN S W +NIN EMN W + NL E Q PLFD + G + AQ
Sbjct: 367 PNLQGVWNNQTSAPWGGKFTININTEMNLWPAGQTNLIETQLPLFDLMKVAQPRGQQMAQ 426
Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
Y +G V HH D+W + +WPMG WL H+ E Y + D + L
Sbjct: 427 DLYGCNGTVFHHNLDVWGDPAPTDNYTSSTMWPMGATWLVQHMIEQYRFGGDLNLLRSAT 486
Query: 492 YPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP-----DGKLACVSYSSTMDMAI 546
YP L + FL + G L T PS SPE+ ++ P G+ + + MD +
Sbjct: 487 YPYLLDISKFLQCYTFS-WQGNLVTGPSLSPENTYVVPSNATVSGQQEPMDLAPEMDNQL 545
Query: 547 IREVFSAIISAAEVLE-KNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+R+V II AA L + D+ V+ +P++R +I G I+EW
Sbjct: 546 MRDVMKGIIEAAAALGISSSDSNVQAATNFIPQIRTPRIGSYGQILEW 593
>gi|429764051|ref|ZP_19296381.1| f5/8 type C domain protein [Clostridium celatum DSM 1785]
gi|429188824|gb|EKY29689.1| f5/8 type C domain protein [Clostridium celatum DSM 1785]
Length = 1566
Score = 234 bits (597), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 174/618 (28%), Positives = 294/618 (47%), Gaps = 85/618 (13%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA------------------P 69
+P+GNG LG+ V+GGV E + N+ TLWTG P NPD
Sbjct: 49 LPLGNGNLGSSVFGGVEKERIHFNDKTLWTGGP---DNPDGTMNDGTQYQGGNRLFEFNE 105
Query: 70 KALSDVRSLVDSGQY---AEATAASVKLFGHPADV--YQLLGDIELEFDD--SHLKYAEE 122
+ +++ S DS T S LF + ++ +Q GDI L+F + S+ K +
Sbjct: 106 EGYNNLISKFDSNDPLVPTGNTGVSSTLFSNRPNLGSWQDFGDIYLDFSEMGSNSKNVD- 164
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---- 178
Y R LD+ A + V Y + REHF S PD V+VT++S G L F+V L
Sbjct: 165 NYERSLDIKNAISEVIYDYNETTYLREHFVSYPDNVLVTRLSKDGDGKLDFDVELKKSSA 224
Query: 179 -SLLDNHSYVNGNNQII-MEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISA 236
S D + ++ NN I + G G ++ ++SA L++ + T+
Sbjct: 225 LSSNDATTSIDDNNTTIKLIGTLNGNKM-------------KYSASLKVIVDGKESTVEP 271
Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
+ +KV +D VL+ + + P ++ ++ T+ + Y+ L
Sbjct: 272 NGNSTIKVRNADEVVLIFSTGTDYKNIYPGYRTGETSEEVTNRVNKVINDAAKKGYNTLL 331
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DY++LF RVS+ L+ ++ TD E + + S +L L+F
Sbjct: 332 ENHVSDYKELFDRVSLDLNEIAPNVPTDELIENYRNGIYS------------KALEALVF 379
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
Q+GRYL I+SSR G+ +NL G+W+ SP W H N+N++MNYW + NL+EC +
Sbjct: 380 QYGRYLTIASSREGSLPSNLAGLWSIG-SPLWSGDYHFNVNVQMNYWPAFSTNLAECGKV 438
Query: 415 LFDFLTYLSINGSKTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWA 461
D+++ L I G K+A+++ A +G++IH + + K+ + G+ +
Sbjct: 439 FADYMSSLVIPGRKSAEMSIGAKTDDFETTPIGEGNGFMIHTANNPFGKTCPN-GEEYYG 497
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTS 521
P G W + +++Y +T D+++LE YP+++ A+ + LIE ++ ST
Sbjct: 498 WNPNGATWALQNAFDYYEFTKDKEYLESTIYPMVKEVANMWTNSLIESK---VQKIGSTE 554
Query: 522 PEHEFIAP--DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL-PR 578
+ +AP + ++ +T D +++ E+F I AA +LEK+ D + K+ + +
Sbjct: 555 EQRLVVAPSTSAEQGPMTVGTTYDQSLVWEIFEKAIKAANILEKDSDEI--KIWTEMQSK 612
Query: 579 LRPTKIAEDGSIMEWVQR 596
L P I E G I EW Q
Sbjct: 613 LDPVIIGEGGQIKEWYQE 630
>gi|148993776|ref|ZP_01823203.1| hypothetical protein CGSSp9BS68_02603 [Streptococcus pneumoniae
SP9-BS68]
gi|168488632|ref|ZP_02712831.1| alpha-fucosidase [Streptococcus pneumoniae SP195]
gi|418234852|ref|ZP_12861428.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA08780]
gi|421220741|ref|ZP_15677580.1| fibronectin type III domain protein [Streptococcus pneumoniae
2070425]
gi|421222994|ref|ZP_15679776.1| fibronectin type III domain protein [Streptococcus pneumoniae
2070531]
gi|421279430|ref|ZP_15730236.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA17301]
gi|421294642|ref|ZP_15745363.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA56113]
gi|147927732|gb|EDK78756.1| hypothetical protein CGSSp9BS68_02603 [Streptococcus pneumoniae
SP9-BS68]
gi|183572723|gb|EDT93251.1| alpha-fucosidase [Streptococcus pneumoniae SP195]
gi|353886474|gb|EHE66256.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA08780]
gi|395586651|gb|EJG47018.1| fibronectin type III domain protein [Streptococcus pneumoniae
2070425]
gi|395586974|gb|EJG47336.1| fibronectin type III domain protein [Streptococcus pneumoniae
2070531]
gi|395878923|gb|EJG89985.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA17301]
gi|395893211|gb|EJH04198.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA56113]
Length = 803
Score = 234 bits (597), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 178/610 (29%), Positives = 287/610 (47%), Gaps = 84/610 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
A Y F RE F+S PD ++V + +L F + L L N Y
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASNGKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRYCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWVQ 595
+ G I EW +
Sbjct: 583 QSGRIREWYE 592
>gi|336430063|ref|ZP_08610019.1| hypothetical protein HMPREF0994_06025 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336001234|gb|EGN31379.1| hypothetical protein HMPREF0994_06025 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 782
Score = 234 bits (596), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 168/601 (27%), Positives = 277/601 (46%), Gaps = 54/601 (8%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAPKALS 73
+ + PA ++ +A+P+GNGRLGAM +GG ETL+L+E T W+G + N D+ + L+
Sbjct: 5 LMYKQPAGNWKEALPLGNGRLGAMDFGGAWRETLQLDESTYWSGEASEENNRADSRELLA 64
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPADVYQLL------------GDIELEFDDSHLKYAE 121
+R + Y A G+ + L G E E++++
Sbjct: 65 QIREALLEEDYERADELGHGFVGNKNNYGTNLPVGNFYIDCFPEGRPEKEWEEAAGADTV 124
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
+ R L L A + V + G + RE F SNP Q V + + + + +
Sbjct: 125 TDFVRRLYLEEARSEVSFKAGGSTYRREVFLSNPAQTAVIHMDTRPGKPFALRIRFEGIA 184
Query: 182 DNHSYVNGNNQ-IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
Q ++ G+ + +D G+ + I++ D L++
Sbjct: 185 SRVGITEERQQDYLIRGQA------RETLHSDGFTGVNLAG--RIRVVTD--GYHHLKES 234
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ VE + A LL+ + P DP + L+ Y L H+ D
Sbjct: 235 GIWVENATRATLLVDLETDMFQP---------DPEETAGRRLEEAWQKGYEQLRQEHIQD 285
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQFGRY 359
L++R+ I L E++ +P+ ER+ K + EDP L LLFQ+GRY
Sbjct: 286 VSALYNRMDISL------------GAEDMRELPTDERLRKQTEGKEDPGLAALLFQYGRY 333
Query: 360 LLISSSRPGTQV-ANLQGIWNEDLSPTWDSAP--HVNINLEMNYWQSLPCNLSECQEPLF 416
LLISSSR + + ++ GIWN+++ D HV++NL+M YW + C L EC +P F
Sbjct: 334 LLISSSREDSPLPTHMGGIWNDNIYNNIDCTQDMHVDMNLQMQYWLAALCALPECYQPAF 393
Query: 417 DFLTYLSI-NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
++ + + +G KTA Y A GW H T+ W +S W +W +GG W +W
Sbjct: 394 AYMRDILVPSGEKTAAGVYGARGWTAHVVTNPWGFTSLG-WSYNWGVWSLGGVWCAALIW 452
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLA 534
++Y +T D+DFL + +P+L+G A F D++ + G+ T PS SPE+ F + +GK
Sbjct: 453 DYYEFTGDKDFL-REWWPVLKGAAEFAADYVFPDEKSGFYMTGPSYSPENMF-SVEGKEY 510
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
+S S+ D ++RE+ I + L D+ +EK ++ L P +I G + EW
Sbjct: 511 FLSLSTACDCILVREILDIIAKGYQELSLERDSFLEKCVEIRENLPPYRIGSRGQLQEWF 570
Query: 595 Q 595
Sbjct: 571 H 571
>gi|417679619|ref|ZP_12329015.1| hypothetical protein SPAR50_1655 [Streptococcus pneumoniae GA17570]
gi|332072484|gb|EGI82967.1| hypothetical protein SPAR50_1655 [Streptococcus pneumoniae GA17570]
Length = 778
Score = 234 bits (596), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 178/610 (29%), Positives = 287/610 (47%), Gaps = 84/610 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
A Y F RE F+S PD ++V + +L F + L L N Y
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASNGKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRYCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWVQ 595
+ G I EW +
Sbjct: 583 QSGRIREWYE 592
>gi|334337751|ref|YP_004542903.1| alpha-L-fucosidase [Isoptericola variabilis 225]
gi|334108119|gb|AEG45009.1| Alpha-L-fucosidase [Isoptericola variabilis 225]
Length = 879
Score = 233 bits (595), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 190/642 (29%), Positives = 278/642 (43%), Gaps = 90/642 (14%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD----YTNPDAPK 70
+ ++ PA + +A+P+GNG AM G E L LN+ T W+G P D T P+
Sbjct: 49 LRYDRPASKWIEALPVGNGHRAAMCAGRPARERLWLNDVTAWSGPPPDDPLAGTRARGPE 108
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEF--DDSHLKYAEETYR-RE 127
L VR VD G A L Y L ++E+ + + + T+ R
Sbjct: 109 HLDRVRRAVDEGDVRTAERLLQDLQTPWVQAYLPLAELEVSVVPGEGNGPTDDVTFAGRH 168
Query: 128 LDLNTATARVKY-SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
LDL TA A + S G +E ++ V+V + + V + SLL
Sbjct: 169 LDLRTAVATHAWTSPGTGRVVQETWADARGGVLVHVVRAERP--VRAEVRVSSLLRRADE 226
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPK--GIQFSAILEIKISDDRG------------ 232
V P A+ P G + A+L++ + G
Sbjct: 227 VR-----------------PDADRGAGPADGGARLHAVLDLPVDVAPGHEPVDDPVRYAP 269
Query: 233 -------TISALEDKKLKVE------GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESM 279
++AL D + VE + +L VA+++ D P P+D +M
Sbjct: 270 DGRQGVVAVAALGDPEAVVEQDVLRTATARCHVLAVATATTDPPGDVPADRSAASRVAAM 329
Query: 280 -----------SALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEEN 328
A R +L H+ +++L+ R + L P+ +
Sbjct: 330 LREAGSVAVPGPAGDGARTALARELRAAHVAAHRRLYDRCRLVLPTPPEAL--------- 380
Query: 329 IDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDS 388
+P+ RV + Q DP L L F GRYLL +SSR G A LQGIWN +L W S
Sbjct: 381 --GLPTDVRVAAAQHRPDPGLAALAFHHGRYLLAASSRDGGLPATLQGIWNAELPGPWSS 438
Query: 389 APHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN-GSKTAQVNYLASGWVIHHKTDI 447
A +NIN +M YW + L+EC EPL + ++ G A+ Y GW HH +D
Sbjct: 439 AYTLNINTQMAYWPAEVTGLAECHEPLLRLVARIAAGPGGVVARELYGTDGWTAHHNSDA 498
Query: 448 WAKSS---ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD---FLEKRAYPLLEGCASF 501
WA ++ A G WA W MGG WL HL EH+ + D D FL A+P+LEG A F
Sbjct: 499 WAHAAPVGAGHGDASWAAWAMGGLWLAQHLVEHHRFAADTDGDAFLRDVAWPVLEGAARF 558
Query: 502 LLDWLIEGHDG------YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAII 555
L W+ D T+PSTSPE+ F A DG A V+ S TMD+A++R + A
Sbjct: 559 ALGWVRTETDADSGRVVRAWTSPSTSPENRFTADDGAPAAVTTSVTMDVALVRWLAEACR 618
Query: 556 SAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQRR 597
AAEVL + DA V+++++ L + G ++EW + R
Sbjct: 619 EAAEVLGRR-DAWVDRLVEVAAALPHPRAGARGELLEWDRER 659
>gi|418076872|ref|ZP_12714105.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47502]
gi|353747012|gb|EHD27670.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47502]
Length = 803
Score = 233 bits (595), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 175/599 (29%), Positives = 283/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y +F RE F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTQFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592
>gi|260588898|ref|ZP_05854811.1| putative fibronectin type III domain protein [Blautia hansenii DSM
20583]
gi|260540677|gb|EEX21246.1| putative fibronectin type III domain protein [Blautia hansenii DSM
20583]
Length = 744
Score = 233 bits (595), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 163/585 (27%), Positives = 270/585 (46%), Gaps = 62/585 (10%)
Query: 21 AKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVD 80
AK + +P+GNG+ GA++ GGV E + LNE++LW G + + L VR L++
Sbjct: 11 AKSWEQGLPVGNGQQGAVLLGGVQQERIVLNEESLWYGGKRERAVEAGKEKLEKVRELLE 70
Query: 81 SGQYAEATAASVKLF-GHP--ADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARV 137
G+ ++A + F G+P + Y + L F+ K E Y R +DL A V
Sbjct: 71 KGEASKAQTLCSRWFVGNPRYTNPYHPAAEAVLNFEPFG-KVKE--YFRGIDLEKGEAGV 127
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
K N + RE FSS QV ++ + +SF++ L+
Sbjct: 128 KICFDNCKTVREIFSSVKYQVTALRMETDKEQGMSFSLGLN------------------- 168
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVAS 257
R P + NA + + I + + D D ++ VEG LLV
Sbjct: 169 -----RRPFEENAEVEDREISLNGHSGDGVCYDVRCRVGKTDGRVCVEGG----YLLVER 219
Query: 258 SSFDGPF--INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRS 315
+S+ F + K+ + L++ + + ++ H+++Y +L++ + +++ +
Sbjct: 220 ASYVEIFFCVRTDYESKECLDKCSRLLKAAAKVGFEEIKKAHIEEYGRLYNNMRLEIEGA 279
Query: 316 PKDIVTDTCSEENIDTVPSAERVKSFQTDEDPS----LVELLFQFGRYLLISSSRPGTQV 371
E + +P+ E +K E+P L+ L+F + RYLLISSS
Sbjct: 280 -----------EELAQIPADELLKRC---EEPKVQGYLIWLMFSYARYLLISSSYGCALP 325
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
ANLQGIWN +P W+S +NINL+MNYW + L C E F+ + + NG KTA+
Sbjct: 326 ANLQGIWNGSFTPPWESGYTININLQMNYWMADRAGLGVCYESFFNLIEKMLPNGRKTAK 385
Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
Y G+V HH T++W + + LWPMGGAW+ L+ H + + + +R
Sbjct: 386 KVYACRGFVAHHNTNLWGDTDITGLWLPAFLWPMGGAWMANQLYHHSEFEENPKEIRERV 445
Query: 492 YPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
P+++ C F D+L D + P+ SPE+ + DG+ A V+ MD IIRE+
Sbjct: 446 LPVMKECILFFYDYLYRKSDKMWISGPTVSPENTYRLLDGQEASVAMGVAMDHQIIRELA 505
Query: 552 SAIISAAEVL-----EKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
+ E + + +++L+ LP PTKI + G I+
Sbjct: 506 ENYLEGCRRYNTGSPEYETEKMAQEILEHLP---PTKIGKSGRIL 547
>gi|418096757|ref|ZP_12733868.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA16531]
gi|353768478|gb|EHD49002.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA16531]
Length = 782
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 176/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + SE ++ NE +LW+G P DY D L+++R
Sbjct: 6 EALPIGNGSLGAKVFGLIGSERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 66 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 126 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 185
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 186 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 239
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 240 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 299
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 300 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 346
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 347 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 405
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 406 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 462
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 463 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 513
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 514 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 571
>gi|167749996|ref|ZP_02422123.1| hypothetical protein EUBSIR_00964 [Eubacterium siraeum DSM 15702]
gi|167657017|gb|EDS01147.1| hypothetical protein EUBSIR_00964 [Eubacterium siraeum DSM 15702]
Length = 796
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 176/612 (28%), Positives = 290/612 (47%), Gaps = 76/612 (12%)
Query: 14 KITFNGPA---KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD--- 67
KI F P K PIGNG +GA +GG+ E + LNE TLW G P + + PD
Sbjct: 24 KILFTAPGEFDKWEQQCQPIGNGYMGASFFGGISKEKIVLNEKTLWAGGPSE-SRPDYNS 82
Query: 68 -----APKALSDVRSLVDSGQYAEATAASVKLFGHP--ADVYQLLGDIELEFDDSHLKYA 120
+ + + V+ L+ G+Y EA A L G YQLL D+ L F + A
Sbjct: 83 GIIEGSYEYVKQVQQLLYDGKYDEAVALLPHLTGATDGFGAYQLLCDMMLTFSNIDETQA 142
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
+ Y R LDL+ + +++ RE F++ P VI K+S + + +SLD+L
Sbjct: 143 TD-YTRTLDLDNSIFTTQFTYQGAVHKREAFANYPSNVICLKLSSDKPRRICVKLSLDNL 201
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
NG+ + EG G+++ I K+ + G + +D
Sbjct: 202 QCGSVTANGDT-LTYEGALW-------------DNGLRYCTIF--KVVNKGGELIDAKDS 245
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPS-DSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+ VE +D + L AS+ + + P+ + +P++ +++ + + LY HL
Sbjct: 246 -IMVEHADEVYIYLTASTDYSNKY--PTFRTGVNPSAAVNQRIENAVSKGFDALYEEHLA 302
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVE----LLFQ 355
DY+ LF RV+++++ DI+ P + + ++ + S+ L FQ
Sbjct: 303 DYKALFDRVTLKINEDTDDII------------PCDKLISEYKENGSRSIANRLETLYFQ 350
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
FGRY+LISSSR G+ ANLQG+WNE P W H+N+NL+MNYW + NLSE PL
Sbjct: 351 FGRYMLISSSRAGSLPANLQGVWNESNCPPWCCDYHINVNLQMNYWGAYNTNLSETVPPL 410
Query: 416 FDFLTYLSINGSKTAQVNYL--------ASGWVIHHKTDI--WAKSSADRGKVVWALWPM 465
DFL + +G K+A+ Y +GW H ++ W D W
Sbjct: 411 VDFLDSMRPSGRKSAEAYYGIKSDEEHPENGWCAHTQSTPFGWTAPGWD---FYWGWSTA 467
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH 524
AWL +++EH+ +T D+++ + YP++ F WLI + L ++P+ SPEH
Sbjct: 468 AVAWLMQNIYEHFEFTGDKEYFAEHIYPIMRESVRFYTQWLIYDDKQKRLVSSPTYSPEH 527
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
V+ +T + ++I ++++ I+A+E L +E+ L V + +L+P I
Sbjct: 528 ---------GPVTIGNTYEQSLIEQLYNDFITASEALGTDEE-LRNIVKNQVVQLKPFSI 577
Query: 585 AED-GSIMEWVQ 595
++ G + EW +
Sbjct: 578 SKKTGLLKEWFE 589
>gi|418976823|ref|ZP_13524668.1| hypothetical protein HMPREF1048_1234 [Streptococcus mitis SK575]
gi|383350822|gb|EID28673.1| hypothetical protein HMPREF1048_1234 [Streptococcus mitis SK575]
Length = 803
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 176/613 (28%), Positives = 286/613 (46%), Gaps = 90/613 (14%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LG ++G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGVKIFGLIGAERIQFNEKSLWSGGPQPDSSDYQGGNLQDQYSFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF + ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNTAKELAEQHLVGPKTSQYGTYLSFGDIHIEFSNQGKTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y +F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 147 LVTTSYVYKGTKFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLSRDLSSDGKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND +QF++ L + G I
Sbjct: 207 SDYKECQLDISDSYILMKGRV---------KDND----LQFASCLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKK---DPTSESMSALQSIRNLSYSDLYT 295
DK +++ G+ +A L L A + F NP+ + + D + +++ + Y L +
Sbjct: 251 DK-VQISGASYANLFLAAKTDFAQ---NPASNYRKELDLERQVKDLVETAKEKGYDQLKS 306
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
RH+ DYQ LF RV + L +D + + +K+++ E +L EL FQ
Sbjct: 307 RHIQDYQALFQRVQLDLG-------------AEVDASNTDDLLKNYKPQEGQALEELFFQ 353
Query: 356 FGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
+GRYLLISSSR P ANLQG+WN +P W+S H+NINL+MNYW + NL E
Sbjct: 354 YGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNINLQMNYWPAYVTNLLETAF 413
Query: 414 PLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALW 463
P+ +++ L + G + A Y +GW++H + W D W
Sbjct: 414 PVINYIDDLRVYG-RLAAARYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWS 469
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSP 522
P AW+ ++E Y + D+D+L ++ YP+L F D+L E ++PS SP
Sbjct: 470 PAANAWMMQTVYEGYTFYRDKDYLREKIYPMLRETVRFWNDFLHEDRQAQRWVSSPSYSP 529
Query: 523 EHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT 582
EH +S +T D ++I ++F I AA+ L +E L E V + L P
Sbjct: 530 EH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDESLLTE-VKEKFDLLNPL 579
Query: 583 KIAEDGSIMEWVQ 595
+I + G I EW +
Sbjct: 580 QITQSGRIREWYE 592
>gi|149006721|ref|ZP_01830407.1| hypothetical protein CGSSp18BS74_05667 [Streptococcus pneumoniae
SP18-BS74]
gi|147761636|gb|EDK68600.1| hypothetical protein CGSSp18BS74_05667 [Streptococcus pneumoniae
SP18-BS74]
Length = 803
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 176/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + SE ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGSERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592
>gi|418162701|ref|ZP_12799382.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA17328]
gi|353826763|gb|EHE06920.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA17328]
Length = 782
Score = 232 bits (592), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 175/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 6 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 66 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 126 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 185
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 186 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 239
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 240 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 299
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 300 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 346
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 347 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 405
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 406 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 462
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 463 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 513
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 514 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 571
>gi|168483476|ref|ZP_02708428.1| alpha-fucosidase [Streptococcus pneumoniae CDC1873-00]
gi|418169754|ref|ZP_12806395.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA19077]
gi|418219383|ref|ZP_12846048.1| fibronectin type III domain protein [Streptococcus pneumoniae
NP127]
gi|418221685|ref|ZP_12848338.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47751]
gi|418239181|ref|ZP_12865732.1| fibronectin type III domain protein [Streptococcus pneumoniae
NorthCarolina6A-23]
gi|419460465|ref|ZP_14000393.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA02270]
gi|419462818|ref|ZP_14002721.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA02714]
gi|419489320|ref|ZP_14029069.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44386]
gi|419526372|ref|ZP_14065930.1| hypothetical protein SPAR35_1634 [Streptococcus pneumoniae GA14373]
gi|421273311|ref|ZP_15724151.1| fibronectin type III domain protein [Streptococcus pneumoniae
SPAR55]
gi|172043064|gb|EDT51110.1| alpha-fucosidase [Streptococcus pneumoniae CDC1873-00]
gi|353833733|gb|EHE13841.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA19077]
gi|353873743|gb|EHE53602.1| fibronectin type III domain protein [Streptococcus pneumoniae
NP127]
gi|353874995|gb|EHE54849.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47751]
gi|353892172|gb|EHE71921.1| fibronectin type III domain protein [Streptococcus pneumoniae
NorthCarolina6A-23]
gi|379530250|gb|EHY95490.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA02714]
gi|379530601|gb|EHY95840.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA02270]
gi|379557012|gb|EHZ22059.1| hypothetical protein SPAR35_1634 [Streptococcus pneumoniae GA14373]
gi|379586862|gb|EHZ51712.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44386]
gi|395873742|gb|EJG84832.1| fibronectin type III domain protein [Streptococcus pneumoniae
SPAR55]
Length = 803
Score = 232 bits (591), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 175/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592
>gi|418230426|ref|ZP_12857025.1| fibronectin type III domain protein [Streptococcus pneumoniae
EU-NP01]
gi|419478291|ref|ZP_14018115.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA18068]
gi|421271063|ref|ZP_15721917.1| fibronectin type III domain protein [Streptococcus pneumoniae
SPAR48]
gi|353885307|gb|EHE65096.1| fibronectin type III domain protein [Streptococcus pneumoniae
EU-NP01]
gi|379565727|gb|EHZ30719.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA18068]
gi|395867277|gb|EJG78401.1| fibronectin type III domain protein [Streptococcus pneumoniae
SPAR48]
Length = 803
Score = 232 bits (591), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 174/599 (29%), Positives = 283/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF+ ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFNQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDYP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL++NYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQLNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592
>gi|417696816|ref|ZP_12345994.1| hypothetical protein SPAR93_1691 [Streptococcus pneumoniae GA47368]
gi|418176443|ref|ZP_12813034.1| hypothetical protein SPAR71_1678 [Streptococcus pneumoniae GA41437]
gi|421232359|ref|ZP_15689000.1| alpha-fucosidase domain protein [Streptococcus pneumoniae 2080076]
gi|332200214|gb|EGJ14287.1| hypothetical protein SPAR93_1691 [Streptococcus pneumoniae GA47368]
gi|353840514|gb|EHE20578.1| hypothetical protein SPAR71_1678 [Streptococcus pneumoniae GA41437]
gi|395594862|gb|EJG55097.1| alpha-fucosidase domain protein [Streptococcus pneumoniae 2080076]
Length = 778
Score = 232 bits (591), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 175/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592
>gi|421211527|ref|ZP_15668509.1| fibronectin type III domain protein [Streptococcus pneumoniae
2070035]
gi|395572635|gb|EJG33230.1| fibronectin type III domain protein [Streptococcus pneumoniae
2070035]
Length = 803
Score = 232 bits (591), Expect = 5e-58, Method: Compositional matrix adjust.
Identities = 174/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDAFTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592
>gi|419480494|ref|ZP_14020298.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA19101]
gi|419500201|ref|ZP_14039895.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47597]
gi|379569663|gb|EHZ34630.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA19101]
gi|379599509|gb|EHZ64292.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47597]
gi|429316503|emb|CCP36209.1| conserved hypothetical protein [Streptococcus pneumoniae SPN034156]
Length = 803
Score = 231 bits (590), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 175/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + SE ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGSERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +E+ L E KS L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDENLLTEVKEKS-DLLNPLQITQSGRIREWYE 592
>gi|418146897|ref|ZP_12783675.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA13637]
gi|353812472|gb|EHD92707.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA13637]
Length = 782
Score = 231 bits (590), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 175/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 6 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 66 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 126 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 185
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 186 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 239
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 240 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 299
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 300 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDYP 346
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 347 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 405
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 406 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 462
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 463 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 513
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 514 SIGNTYDQSLIWQLFYDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 571
>gi|418130799|ref|ZP_12767682.1| hypothetical protein SPAR14_1598 [Streptococcus pneumoniae GA07643]
gi|418187633|ref|ZP_12824156.1| hypothetical protein SPAR92_1607 [Streptococcus pneumoniae GA47360]
gi|353802123|gb|EHD82423.1| hypothetical protein SPAR14_1598 [Streptococcus pneumoniae GA07643]
gi|353849618|gb|EHE29623.1| hypothetical protein SPAR92_1607 [Streptococcus pneumoniae GA47360]
Length = 778
Score = 231 bits (590), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 176/610 (28%), Positives = 288/610 (47%), Gaps = 84/610 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF+ ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFNQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
A Y F RE F+S PD ++V + +L F + L L + Y
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL++NYW + NL E P+
Sbjct: 357 YLLISSSRDYPDALPANLQGVWNAVDNPPWNSDYHLNVNLQLNYWPAYVTNLLETVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWVQ 595
+ G I EW +
Sbjct: 583 QSGRIREWYE 592
>gi|225855085|ref|YP_002736597.1| alpha-fucosidase [Streptococcus pneumoniae JJA]
gi|225723201|gb|ACO19054.1| alpha-fucosidase [Streptococcus pneumoniae JJA]
Length = 803
Score = 231 bits (590), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 179/625 (28%), Positives = 288/625 (46%), Gaps = 87/625 (13%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
P T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPASTTYKGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
D L+++R ++ Y A + + P Y GDI +EF
Sbjct: 72 NLQDQYGFLAEIRQALEKRDYNTAKELAEEHLVGPKTSQYGTYLSFGDIFIEFSQQGTIL 131
Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL- 177
++ T Y+R+L+++ A A Y+ F RE F+S PD ++V + + + +L F + L
Sbjct: 132 SQVTDYQRQLNISKALATTSYAYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIKLF 191
Query: 178 --DSLLDNHSYVN------------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL 223
L + Y ++ I+M GR ND ++F+ L
Sbjct: 192 LTRDLASDGKYDQEKSDYKECQLDITDSHILMNGRV---------KDND----LRFAGCL 238
Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ 283
+ G I DK +++ G+ +A L L A + F + K D + ++
Sbjct: 239 AWQTD---GDIRVWSDK-VQISGASYANLFLAAKTDFAQNPDSNYRKKIDLEKQVKDLVE 294
Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
+ Y+ L +RH+ DYQ LF RV + L E ++DT + + +K+++
Sbjct: 295 IAKEKGYAQLKSRHIQDYQALFQRVQLDL-------------EADVDTFTTDDLLKNYKP 341
Query: 344 DEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
+L EL FQ+GRYLLISSSR P ANLQG+WN +P W+S H+NINL+MNYW
Sbjct: 342 QAGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNINLQMNYW 401
Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKS 451
+ NL E P+ +++ L + G + A Y +GW++H + W
Sbjct: 402 PAYVTNLLETAFPVINYIDDLRVYG-RLAAARYAGIVSREGEENGWLVHTQATPFGWTAP 460
Query: 452 SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD 511
D W P AW+ ++E Y++ D+D+L ++ YP+L F +L E
Sbjct: 461 GWD---YYWGWSPATNAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWTGFLHEDQQ 517
Query: 512 GY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE 570
++PS SPEH +S +T D ++I ++F I A + L + D L E
Sbjct: 518 AQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFYDFIQATQELGLDGDLLTE 568
Query: 571 KVLKSLPRLRPTKIAEDGSIMEWVQ 595
V + L P +I + G I EW +
Sbjct: 569 -VKEKFDLLNPLQITQSGRIREWYE 592
>gi|405760473|ref|YP_006701069.1| hypothetical protein SPNA45_00586 [Streptococcus pneumoniae SPNA45]
gi|404277362|emb|CCM07874.1| conserved hypothetical protein [Streptococcus pneumoniae SPNA45]
Length = 803
Score = 231 bits (589), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 175/599 (29%), Positives = 281/599 (46%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKMSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y +F RE F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTQFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D ++F++ L K G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD--TDLRFASYLAWKTD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKDYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYL--------ASGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAEIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I A+ L +ED L E KS L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQVAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592
>gi|418202865|ref|ZP_12839294.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA52306]
gi|419456006|ref|ZP_13995963.1| fibronectin type III domain protein [Streptococcus pneumoniae
EU-NP04]
gi|421285997|ref|ZP_15736773.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA60190]
gi|421307849|ref|ZP_15758491.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA60132]
gi|353867422|gb|EHE47317.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA52306]
gi|379627982|gb|EHZ92588.1| fibronectin type III domain protein [Streptococcus pneumoniae
EU-NP04]
gi|395885984|gb|EJG97005.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA60190]
gi|395907234|gb|EJH18128.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA60132]
Length = 803
Score = 231 bits (589), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 174/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592
>gi|419491555|ref|ZP_14031293.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47179]
gi|379592917|gb|EHZ57732.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47179]
Length = 803
Score = 231 bits (588), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 174/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592
>gi|149021254|ref|ZP_01835500.1| hypothetical protein CGSSp23BS72_00470 [Streptococcus pneumoniae
SP23-BS72]
gi|147930355|gb|EDK81339.1| hypothetical protein CGSSp23BS72_00470 [Streptococcus pneumoniae
SP23-BS72]
Length = 803
Score = 231 bits (588), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 173/599 (28%), Positives = 283/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD +++ + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLIQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A ++F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTNFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592
>gi|194398489|ref|YP_002038269.1| hypothetical protein SPG_1564 [Streptococcus pneumoniae G54]
gi|418121711|ref|ZP_12758654.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44194]
gi|419532855|ref|ZP_14072370.1| hypothetical protein SPAR107_1539 [Streptococcus pneumoniae
GA47794]
gi|421275369|ref|ZP_15726198.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA52612]
gi|194358156|gb|ACF56604.1| conserved hypothetical protein [Streptococcus pneumoniae G54]
gi|353792547|gb|EHD72919.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44194]
gi|379605375|gb|EHZ70126.1| hypothetical protein SPAR107_1539 [Streptococcus pneumoniae
GA47794]
gi|395873333|gb|EJG84425.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA52612]
Length = 803
Score = 231 bits (588), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 174/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592
>gi|421241117|ref|ZP_15697662.1| fibronectin type III domain protein [Streptococcus pneumoniae
2080913]
gi|395607495|gb|EJG67592.1| fibronectin type III domain protein [Streptococcus pneumoniae
2080913]
Length = 782
Score = 231 bits (588), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 173/599 (28%), Positives = 282/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 6 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 65
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 66 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD +++ + +L F + L D S +
Sbjct: 126 LATTSYVYKGTRFERKAFASFPDDLLIQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 185
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 186 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 239
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 240 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 299
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 300 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 346
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 347 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 405
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 406 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 462
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 463 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 513
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 514 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 571
>gi|116191887|ref|XP_001221756.1| hypothetical protein CHGG_05661 [Chaetomium globosum CBS 148.51]
gi|88181574|gb|EAQ89042.1| hypothetical protein CHGG_05661 [Chaetomium globosum CBS 148.51]
Length = 537
Score = 231 bits (588), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 157/524 (29%), Positives = 255/524 (48%), Gaps = 45/524 (8%)
Query: 24 FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
F A+PIGNGRLGA V+G P+E L LNE+++W+G D N + A+ +R ++ +G
Sbjct: 36 FKSALPIGNGRLGAAVFG-TPTEKLVLNENSVWSGGFLDRANSRSKDAVPKIRQMLIAGD 94
Query: 84 YAEATAASV-KLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVG 142
A +++ + +P + + D H +Y R LD TA V Y G
Sbjct: 95 ITGAGQSAMDNMAANPTSPRAYNPLVNMGIDLGH-GSGIGSYTRWLDTLEGTAGVNYLQG 153
Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD------SLLDNHSYVNGNNQIIME 196
++RE+ +S P V+ +++ S G L+ +SL S G +++ +
Sbjct: 154 GTNYSREYVASYPHGVLAIRLTASAPGKLNAKISLSRSKWVTSQTAKTDSGTGGHKVTLS 213
Query: 197 GRCPGKRIPPKANAN-------DDPKGI-QFSAILEIKISDDRGTISALEDKKLKVEGSD 248
G + + A PKG+ +S + + I GT ++ + K + + G++
Sbjct: 214 GNSGSDALAFWSEARVVNSGGVYHPKGLLPYSMVADSHI----GTATSPDGKSISISGAN 269
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
+ A +S+ + + + ++ +E L + Y + + ++D+ L RV
Sbjct: 270 TVDIFFDAETSYR--YADATAAQ----AELKQKLDAATAAGYPAVRSAAIEDFSSLMSRV 323
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLISSSR 366
+ L S + P R+++F+ + DP L+ L+F FGR+LL SSSR
Sbjct: 324 KLDLG-----------SSGDAGRQPVTTRLQNFKNNPNADPQLMTLMFNFGRHLLASSSR 372
Query: 367 ---PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
P + ANLQG+WN+D P W S +NINLEMNYW +L NL+E Q+P+FD L
Sbjct: 373 DTGPRSLPANLQGLWNQDYDPAWQSKYTININLEMNYWPALVTNLAETQKPVFDLLNEAI 432
Query: 424 INGSKTAQVNYLAS-GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
G A+ Y + G+V+HH TD+W ++ + +WPMG AWL EHY +T
Sbjct: 433 PRGKAVAKTMYGCNDGFVLHHNTDLWGDAAPVDKGTPYTIWPMGAAWLSADAMEHYRFTQ 492
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEF 526
++ FL A+P+L A F L + +G+ PS SPEH F
Sbjct: 493 NKTFLSTTAWPILRDAARFFHCHLFQ-WNGHWTAGPSLSPEHAF 535
>gi|419482693|ref|ZP_14022480.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA40563]
gi|379579285|gb|EHZ44192.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA40563]
Length = 803
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 174/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592
>gi|336321550|ref|YP_004601518.1| alpha-L-fucosidase [[Cellvibrio] gilvus ATCC 13127]
gi|336105131|gb|AEI12950.1| Alpha-L-fucosidase [[Cellvibrio] gilvus ATCC 13127]
Length = 792
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 173/618 (27%), Positives = 288/618 (46%), Gaps = 73/618 (11%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-------DYTNPD 67
+ F GPA + +A P+GNG +GAMV GG +++N+ T W+G P + D
Sbjct: 5 LRFAGPALRWDEAFPLGNGSVGAMVHGGHRRARVQVNDATAWSGHPAGPGLALAELRRRD 64
Query: 68 -APKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFD------DSHLKYA 120
P+ LS +RS + G+ EA + + G A +Q D+ + D + A
Sbjct: 65 VGPRTLSALRSAIAEGRDDEAARLAQRFQGPYAQAFQPFVDLLVTLSPADPTGDDDVDAA 124
Query: 121 EETYRRELDLNTATAR--VKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
E R LDL V + F+S PD + + + + F++ L+
Sbjct: 125 YEG--RSLDLRDGLVHEAVTFESAGCRVMTTWFTSAPDGCLHARWRAPD---VPFSLELE 179
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRI----------------PPKANANDDPKGIQFSAI 222
+ G + +++E G ++ P + + ++ +
Sbjct: 180 L---RGAQPGGPSALVVEAGVVGAQVRVELPFDVAPGHEPDRPGRIAVGSHASLVGYATV 236
Query: 223 LEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSF----DGPFINPSDSKKDPTSES 278
L +D R T S ++V G+ W +L +++ GP +P++++ +
Sbjct: 237 L--VSTDGRATASP---GGVRVAGATWVEAVLATATTTRWPEPGPLAHPAEAEHASRERA 291
Query: 279 MSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV 338
+AL + + RH++D++ L ++L P D++ +P A
Sbjct: 292 RAALPP-SPAAGAVAQRRHVEDHRALADATRLELG-EPADLL-----------LPDA--- 335
Query: 339 KSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEM 398
T P+ F FGRYLL+++SRPG NLQG+WN++ P W S +NINL+M
Sbjct: 336 --LGTAPLPARARAAFAFGRYLLMAASRPGAPPVNLQGVWNDEARPPWSSGYTLNINLQM 393
Query: 399 NYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKS---SADR 455
YW + P L C EPL D + L+ G+ A+ Y +GWV HH +D+W +
Sbjct: 394 AYWPAEPTGLGVCVEPLVDQVRVLAREGAAVARDLYGCAGWVAHHNSDVWGWALPVGDGH 453
Query: 456 GKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLE 515
G WA W MGGAWLC HLW+ Y Y++D D L + +PLL G A+F++DWL+ G L
Sbjct: 454 GDPSWASWWMGGAWLCRHLWDRYEYSLDEDVL-RDVWPLLRGAAAFVVDWLVPDGRGGLV 512
Query: 516 TNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS 575
+PS+SPE+ G+ + ST+D+A+ R++ S + A ++L +E L + + +
Sbjct: 513 PSPSSSPEN-VRERAGREVALCAGSTVDVALARDLLSHCLEAVDILGLDE-PLAARWVDA 570
Query: 576 LPRLRPTKIAEDGSIMEW 593
+ RL + DG + EW
Sbjct: 571 VARLPRPDVDADGLLREW 588
>gi|291557898|emb|CBL35015.1| hypothetical protein ES1_21610 [Eubacterium siraeum V10Sc8a]
Length = 796
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 174/612 (28%), Positives = 290/612 (47%), Gaps = 76/612 (12%)
Query: 14 KITFNGPA---KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD--- 67
KI F P K PIGNG +GA +GG+ E + LNE TLW G P + + PD
Sbjct: 24 KILFTAPGEFDKWEQQCQPIGNGYMGASFFGGISKEKIVLNEKTLWAGGPSE-SRPDYNS 82
Query: 68 -----APKALSDVRSLVDSGQYAEATAASVKLFGHP--ADVYQLLGDIELEFDDSHLKYA 120
+ + + V+ L+ G+Y EA A L G YQLL D+ L F + A
Sbjct: 83 GIIEGSYEYVKQVQQLLYDGKYDEAVALLPHLTGATDGFGAYQLLCDMMLTFSNIDETQA 142
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
+ Y R LDL+ + +++ RE F++ P VI K+S + + +SLD+L
Sbjct: 143 TD-YTRTLDLDNSIFTTQFTYQGAVHKREAFANYPSNVICLKLSSDKPRRICVKLSLDNL 201
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
NG+ + EG G+++ I K+ + G + +D
Sbjct: 202 QCGSVTANGDT-LTYEGALW-------------DNGLRYCTIF--KVVNKGGELIDAKDS 245
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPS-DSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+ VE +D + L AS+ + + P+ + +P++ +++ + + LY HL
Sbjct: 246 -IMVEHADEVYIYLTASTDYSNKY--PTFRTGVNPSAAVNQRIENAVSKGFDALYEEHLA 302
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVE----LLFQ 355
DY+ LF RV+++++ DI+ P + + ++ + S+ L FQ
Sbjct: 303 DYKALFDRVTLKINEDTDDII------------PCDKLISEYKENGSRSIANRLETLYFQ 350
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
FGRY+LISSSR G+ ANLQG+WNE P W H+N+NL+MNYW + NLSE PL
Sbjct: 351 FGRYMLISSSRAGSLPANLQGVWNESNCPPWCCDYHINVNLQMNYWGAYNTNLSETVPPL 410
Query: 416 FDFLTYLSINGSKTAQVNYL--------ASGWVIHHKTDI--WAKSSADRGKVVWALWPM 465
DFL + +G K+A+ Y +GW H ++ W D W
Sbjct: 411 VDFLDSMRPSGRKSAEAYYGIKSDEEHPENGWCAHTQSTPFGWTAPGWD---FYWGWSTA 467
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH 524
AWL +++E++ +T D+++ + YP++ F WLI + L ++P+ SPEH
Sbjct: 468 AVAWLMQNIYEYFEFTGDKEYFAEHIYPIMRESVRFYTQWLIYDDKQKRLVSSPTYSPEH 527
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
V+ +T + ++I ++++ I+A+E L +E+ L V + +L+P +
Sbjct: 528 ---------GPVTIGNTYEQSLIEQLYNDFITASEALGTDEE-LRNIVKNQVVQLKPYSV 577
Query: 585 AED-GSIMEWVQ 595
++ G + EW +
Sbjct: 578 SKKTGLLKEWFE 589
>gi|419487130|ref|ZP_14026892.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44128]
gi|421209422|ref|ZP_15666435.1| fibronectin type III domain protein [Streptococcus pneumoniae
2070005]
gi|379585499|gb|EHZ50355.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44128]
gi|395573518|gb|EJG34108.1| fibronectin type III domain protein [Streptococcus pneumoniae
2070005]
Length = 803
Score = 230 bits (587), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 173/599 (28%), Positives = 282/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD +++ + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLIQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592
>gi|418092259|ref|ZP_12729399.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44452]
gi|353762959|gb|EHD43516.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44452]
Length = 803
Score = 230 bits (587), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 173/599 (28%), Positives = 282/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD +++ + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLIQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592
>gi|419475985|ref|ZP_14015821.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA14688]
gi|379558767|gb|EHZ23799.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA14688]
Length = 778
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 175/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86
Query: 79 VDSGQYAEATA-ASVKLFGHPADVYQL---LGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A A L G Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD +++ + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLIQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592
>gi|387626851|ref|YP_006063027.1| hypothetical protein INV104_14070 [Streptococcus pneumoniae INV104]
gi|444382288|ref|ZP_21180491.1| hypothetical protein PCS8106_00679 [Streptococcus pneumoniae
PCS8106]
gi|444385525|ref|ZP_21183597.1| hypothetical protein PCS8203_01377 [Streptococcus pneumoniae
PCS8203]
gi|301794637|emb|CBW37088.1| conserved hypothetical protein [Streptococcus pneumoniae INV104]
gi|444249595|gb|ELU56083.1| hypothetical protein PCS8203_01377 [Streptococcus pneumoniae
PCS8203]
gi|444252562|gb|ELU59024.1| hypothetical protein PCS8106_00679 [Streptococcus pneumoniae
PCS8106]
Length = 803
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 173/599 (28%), Positives = 282/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD +++ + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLIQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592
>gi|418103344|ref|ZP_12740416.1| hypothetical protein SPAR143_1623 [Streptococcus pneumoniae NP070]
gi|421225485|ref|ZP_15682223.1| alpha-fucosidase domain protein [Streptococcus pneumoniae 2070768]
gi|353774645|gb|EHD55132.1| hypothetical protein SPAR143_1623 [Streptococcus pneumoniae NP070]
gi|395588972|gb|EJG49294.1| alpha-fucosidase domain protein [Streptococcus pneumoniae 2070768]
Length = 757
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 173/599 (28%), Positives = 282/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 6 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 65
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 66 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD +++ + +L F + L D S +
Sbjct: 126 LATTSYVYKGTRFERKAFASFPDDLLIQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 185
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 186 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 239
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 240 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 299
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 300 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 346
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 347 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 405
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 406 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 462
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 463 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 513
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 514 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 571
>gi|331091988|ref|ZP_08340820.1| hypothetical protein HMPREF9477_01463 [Lachnospiraceae bacterium
2_1_46FAA]
gi|330402887|gb|EGG82454.1| hypothetical protein HMPREF9477_01463 [Lachnospiraceae bacterium
2_1_46FAA]
Length = 1785
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 175/614 (28%), Positives = 290/614 (47%), Gaps = 82/614 (13%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD----------APKALSDVR 76
++P+GNG LG +++GG+ E + NE TLWTG P + T PD K + R
Sbjct: 71 SLPLGNGNLGNLIFGGISKERIHFNEKTLWTGGPSE-TRPDYQFGNKKTAYTDKEIEAYR 129
Query: 77 SLVDSGQY----------AEATAASVKLFGHP---ADVYQLLGDIELEFDDSHLKYAE-E 122
L+D + +K G YQ GDI ++F ++ ++ +
Sbjct: 130 KLLDDKSKNVFNDDTSLGKPGMSGKIKFPGEDNLNKGSYQDFGDIWIDFSETGIRDDNVK 189
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
YRRELDL T A +S V++ REHF S+PDQV+VT++S S+ L ++ ++
Sbjct: 190 NYRRELDLQTGVAATTFSHQGVDYKREHFVSSPDQVMVTELSASKEKKLDVSIKMEL--- 246
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
N+S + G + E I K N G++F + KI G I+A E +L
Sbjct: 247 NNSGLEGTAKFDAEQNMY--TIFGKVKDN----GLKFRTTM--KIVQSGGDITADEKNQL 298
Query: 243 -KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
KVE +D ++++ A + + + D+KKD + ++ SY +L H++D+
Sbjct: 299 YKVENADKIMIVMAAETDYKNDYPTYRDTKKDLEKVVVERVKRASEKSYQELKENHIEDH 358
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL-FQFGRYL 360
Q LF RVS+ L EN +P+ E + +++ +E+L FQ+GRYL
Sbjct: 359 QGLFDRVSLDLG-------------ENRSNIPTNELIDAYRKGSYSKYLEVLAFQYGRYL 405
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
I+ SR GT +NL G+W S W H N+N++MNYW NL+EC + D++
Sbjct: 406 TIAGSR-GTLPSNLVGLWTMGAS-AWTGDYHFNVNVQMNYWPVYVTNLAECGTTMVDYME 463
Query: 421 YLSINGSKTAQ-------VNYLASGWVIHHKTDIWAKSS-ADRGKVVWALWPMGGAWLCT 472
L G TA+ +G+ +H + + + ++ + + W P G AW
Sbjct: 464 NLREPGRLTAERVHGIEDATTKKNGFTVHTENNPFGMTAPTNNQEYGWN--PTGAAWAIQ 521
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-------IEGHDGYLETNPSTSPEHE 525
+LW HY +T ++D+L+ YP+++ A F ++L + + + P
Sbjct: 522 NLWAHYEFTQNKDYLKNTIYPIMKEAAQFWDNYLWTSDYQKVHDKNSKYDGQPRLVVVPS 581
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS----LPRLRP 581
F A G A +T D +++ E+++ I A +++ ED E VLKS + RL P
Sbjct: 582 FSAEQGPTAV---GTTYDQSLVWELYNECIKAGKIV--GED---ETVLKSWEEKMQRLDP 633
Query: 582 TKIAEDGSIMEWVQ 595
++ I EW +
Sbjct: 634 IEMNATNGIKEWYE 647
>gi|15903541|ref|NP_359091.1| hypothetical protein spr1498 [Streptococcus pneumoniae R6]
gi|116515332|ref|YP_816923.1| hypothetical protein SPD_1467 [Streptococcus pneumoniae D39]
gi|421266644|ref|ZP_15717524.1| fibronectin type III domain protein [Streptococcus pneumoniae
SPAR27]
gi|15459158|gb|AAL00302.1| Conserved hypothetical protein [Streptococcus pneumoniae R6]
gi|116075908|gb|ABJ53628.1| conserved hypothetical protein [Streptococcus pneumoniae D39]
gi|395866712|gb|EJG77840.1| fibronectin type III domain protein [Streptococcus pneumoniae
SPAR27]
Length = 803
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 174/599 (29%), Positives = 282/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF+ ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFNQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNTFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNSLQITQSGRIREWYE 592
>gi|418192091|ref|ZP_12828593.1| alpha-fucosidase domain protein [Streptococcus pneumoniae GA47388]
gi|353855177|gb|EHE35147.1| alpha-fucosidase domain protein [Streptococcus pneumoniae GA47388]
Length = 778
Score = 230 bits (586), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 175/610 (28%), Positives = 286/610 (46%), Gaps = 84/610 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T +R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDCQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 147 LVMTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND + F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRV---------KDND----LWFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWVQ 595
+ G I EW +
Sbjct: 583 QSGRIREWYE 592
>gi|417923725|ref|ZP_12567182.1| hypothetical protein HMPREF9959_1543 [Streptococcus mitis SK569]
gi|342836607|gb|EGU70818.1| hypothetical protein HMPREF9959_1543 [Streptococcus mitis SK569]
Length = 803
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 173/599 (28%), Positives = 279/599 (46%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSFDYQGGNLQDQYSFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GD+ +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNTAEELAEQHLVGPKTSQYGTYLSFGDLLIEFSRQGKTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y+ F RE F+S PD ++V + + + +L F + L D S +
Sbjct: 147 LATTSYAYKGTMFKRESFASFPDDLLVQRFTKEGAETLDFTIELSLTRDLASDGKYEQKK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F+ L + G I DK +++ G+ +
Sbjct: 207 SDYKECQLEITDSHILMKGRVKDN--NLRFAGCLAWQTD---GDIRVWSDK-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + +++ + Y+ L +RH++D Q LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLVQQVRDLVETAKEKGYAQLKSRHIEDCQTLFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L +D + + +K+++ E SL EL FQ+GRYLLISSSR +
Sbjct: 321 LDLG-------------AEVDASTTDDLLKNYKPQEGQSLEELFFQYGRYLLISSSRDCS 367
Query: 370 QV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+NINL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNGVDNPPWNSDYHLNINLQMNYWPAYVTNLLETAFPVINYIDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAARYAGIVSQEGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTIYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L R YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLRDRIYPILRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E V + L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTE-VKEKFELLNPLQITQSGRIREWYE 592
>gi|168486978|ref|ZP_02711486.1| alpha-fucosidase [Streptococcus pneumoniae CDC1087-00]
gi|237650661|ref|ZP_04524913.1| alpha-fucosidase [Streptococcus pneumoniae CCRI 1974]
gi|237822420|ref|ZP_04598265.1| alpha-fucosidase [Streptococcus pneumoniae CCRI 1974M2]
gi|418126305|ref|ZP_12763211.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44511]
gi|418214849|ref|ZP_12841583.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA54644]
gi|419458244|ref|ZP_13998186.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA02254]
gi|419484883|ref|ZP_14024658.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA43257]
gi|419510919|ref|ZP_14050560.1| fibronectin type III domain protein [Streptococcus pneumoniae
NP141]
gi|419530550|ref|ZP_14070077.1| hypothetical protein SPAR62_1499 [Streptococcus pneumoniae GA40028]
gi|421213591|ref|ZP_15670545.1| fibronectin type III domain protein [Streptococcus pneumoniae
2070108]
gi|421215753|ref|ZP_15672674.1| fibronectin type III domain protein [Streptococcus pneumoniae
2070109]
gi|421301508|ref|ZP_15752178.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA19998]
gi|183570104|gb|EDT90632.1| alpha-fucosidase [Streptococcus pneumoniae CDC1087-00]
gi|353796245|gb|EHD76590.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44511]
gi|353869579|gb|EHE49460.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA54644]
gi|379529908|gb|EHY95149.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA02254]
gi|379573458|gb|EHZ38413.1| hypothetical protein SPAR62_1499 [Streptococcus pneumoniae GA40028]
gi|379581636|gb|EHZ46520.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA43257]
gi|379631522|gb|EHZ96099.1| fibronectin type III domain protein [Streptococcus pneumoniae
NP141]
gi|395578822|gb|EJG39332.1| fibronectin type III domain protein [Streptococcus pneumoniae
2070108]
gi|395579960|gb|EJG40455.1| fibronectin type III domain protein [Streptococcus pneumoniae
2070109]
gi|395899068|gb|EJH10012.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA19998]
Length = 803
Score = 229 bits (585), Expect = 2e-57, Method: Compositional matrix adjust.
Identities = 175/610 (28%), Positives = 286/610 (46%), Gaps = 84/610 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T +R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDCQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 147 LVMTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND + F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRV---------KDND----LWFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWVQ 595
+ G I EW +
Sbjct: 583 QSGRIREWYE 592
>gi|418144603|ref|ZP_12781398.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA13494]
gi|418185405|ref|ZP_12821946.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47283]
gi|353807069|gb|EHD87341.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA13494]
gi|353848689|gb|EHE28701.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47283]
Length = 782
Score = 229 bits (585), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 175/610 (28%), Positives = 286/610 (46%), Gaps = 84/610 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 6 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T +R+L+++ A
Sbjct: 66 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDCQRQLNISKA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 126 LVMTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 185
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND + F++ L + G I
Sbjct: 186 SDYKECKLDITDSHILMKGRV---------KDND----LWFASYLAWETD---GDIRVWS 229
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 230 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 288
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 289 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 335
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 336 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 395
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 396 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 451
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 452 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 510
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 511 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 561
Query: 586 EDGSIMEWVQ 595
+ G I EW +
Sbjct: 562 QSGRIREWYE 571
>gi|67902324|ref|XP_681418.1| hypothetical protein AN8149.2 [Aspergillus nidulans FGSC A4]
gi|74593077|sp|Q5AU81.1|AFCA_EMENI RecName: Full=Alpha-fucosidase A; AltName: Full=Alpha-L-fucoside
fucohydrolase A; Flags: Precursor
gi|40739981|gb|EAA59171.1| hypothetical protein AN8149.2 [Aspergillus nidulans FGSC A4]
gi|95025957|gb|ABF50892.1| alpha-fucosidase [Emericella nidulans]
gi|259480915|tpe|CBF73981.1| TPA: Alpha-fucosidasePutative uncharacterized protein ;
[Source:UniProtKB/TrEMBL;Acc:Q5AU81] [Aspergillus
nidulans FGSC A4]
Length = 809
Score = 229 bits (585), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 176/594 (29%), Positives = 289/594 (48%), Gaps = 65/594 (10%)
Query: 30 IGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYT--NPDAP--KALSDVRSLVDSG 82
IGNG+LG + +G +E L LN D+LW+G P +YT NP +P AL +R +
Sbjct: 46 IGNGKLGVIPFGPPDTEKLNLNVDSLWSGGPFEVENYTGGNPSSPIYDALPGIRERI--- 102
Query: 83 QYAEATAASVKLFGHPADVY---QLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY 139
+ T +L G + Y ++LG+I + D A Y+R LDL+ R +
Sbjct: 103 -FENGTGGMEELLG-SGNHYGSSRVLGNITIALDGVE---AYSKYKRTLDLSDGVHRTSF 157
Query: 140 SVGN---VEFTREHFSSNPDQVIVTKISGSESGSL-SFNVSLDSLLDNHSYVNGNNQIIM 195
++ N F S PDQV V + + L +S+++LL N S ++
Sbjct: 158 TIANRTTAALKSSIFCSYPDQVCVYHLESASDARLPKVTISIENLLVNQS--------LL 209
Query: 196 EGRCP--GKRIPPKANA---NDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
+ C KR + + P+G++++A+ E+ ++ + L + L++
Sbjct: 210 QTSCESEAKRAVLRHSGVTQAGPPEGMKYAAVAEV-VNPRSSVTTCLGEGALQISSRKKQ 268
Query: 251 VLLLV-ASSSFDGPFINPSD-----SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
+ +++ A++++D N + KDP S + Y L RH+ DY+KL
Sbjct: 269 LTIIIGAATNYDQKAGNAKSGWSFKNAKDPASIVDGIASAAGWKGYQRLLDRHVKDYKKL 328
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
S++L DT + DT E+ +P L LL + R+LL+SS
Sbjct: 329 MGDFSLELP--------DTTDSASKDTSELIEKYSYASATGNPYLENLLLDYARHLLVSS 380
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
SRP + ANLQG W E L+P+W + H NINL+MNYW + L E Q L++++ +
Sbjct: 381 SRPNSLPANLQGRWTESLTPSWSADYHANINLQMNYWLADQTGLGETQHALWNYMADTWV 440
Query: 425 -NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
G++TA++ Y ASGWV+H++ +I+ +A + WA +P AW+ H+W++++YT D
Sbjct: 441 PRGTETARLLYNASGWVVHNEINIFG-FTAMKEDAGWANYPAAAAWMMQHVWDNFDYTHD 499
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEG---HDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
+L + Y LL+G ASF L L E +DG L NP SPE P C Y
Sbjct: 500 TAWLVSQGYALLKGIASFWLSSLQEDKFFNDGSLVVNPCNSPE---TGPT-TFGCTHYQQ 555
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIMEW 593
+I +VF +++A E + +++ V+ V +L RL ++ G + EW
Sbjct: 556 -----LIHQVFETVLAAQEYIHESDTKFVDSVASALERLDTGLHLSSWGGLKEW 604
>gi|307709595|ref|ZP_07646048.1| alpha-fucosidase [Streptococcus mitis SK564]
gi|307619631|gb|EFN98754.1| alpha-fucosidase [Streptococcus mitis SK564]
Length = 803
Score = 229 bits (585), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 177/614 (28%), Positives = 282/614 (45%), Gaps = 65/614 (10%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
P T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPASTTYKGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLSNSSDYQGG 71
Query: 66 --PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADVYQL---LGDIELEFDDSHLKY 119
D ++++R ++ Y A A L G Y GDI +EF
Sbjct: 72 NLQDQYAFIAEIRQDLEKRDYNRAKELAEQHLVGSKTSQYGTYLSFGDIHIEFSKQGKTL 131
Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
++ Y+R+L+++ A A Y F RE F+S PD ++V + + +L F + L
Sbjct: 132 SQVMDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLLVQRFTKEGLETLDFTIELS 191
Query: 179 SLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
D S + C I K D+ ++F++ L + G I
Sbjct: 192 LTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDI 246
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
DK +++ G+ +A L L A + F + K D + +++ + Y+ L
Sbjct: 247 RVWSDK-VQISGASYANLFLAAKTDFAQNPASNYRKKIDLEQQVKDLVETAKEKGYAQLK 305
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
+RH++DYQ LF RV + L +D + + +K++ E +L EL F
Sbjct: 306 SRHIEDYQALFQRVQLDLG-------------AEVDASTTDDLLKNYNPQEGQALEELFF 352
Query: 355 QFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR P ANLQG+WN +P W+S H+NINL+MNYW + NL E
Sbjct: 353 QYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNINLQMNYWPAYVTNLLEAV 412
Query: 413 EPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWAL 462
P+ +++ L + G + A Y +GW++H + W D W
Sbjct: 413 FPVINYIDDLRVYG-RLAAARYAGIVSQEGEENGWLVHTQATPFGWTAPGWD---YYWGW 468
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTS 521
P AW+ ++E Y++ D+D+L ++ YP+L F +L E ++PS S
Sbjct: 469 SPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHEDRQAQRWVSSPSYS 528
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
PEH +S +T D ++I ++F I AA+ L +E L E V + L P
Sbjct: 529 PEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDESLLTE-VKEKFDLLNP 578
Query: 582 TKIAEDGSIMEWVQ 595
+I + G I EW +
Sbjct: 579 LQITQSGRIREWYE 592
>gi|418966783|ref|ZP_13518495.1| hypothetical protein HMPREF1045_1205 [Streptococcus mitis SK616]
gi|383346450|gb|EID24496.1| hypothetical protein HMPREF1045_1205 [Streptococcus mitis SK616]
Length = 803
Score = 229 bits (585), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 173/599 (28%), Positives = 279/599 (46%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSFDYQGGNLQDQYSFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GD+ +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNTAEELAEQHLVGPKTSQYGTYLSFGDLLIEFSRQGKTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y+ F RE F+S PD ++V + + + +L F + L D S +
Sbjct: 147 LATTSYAYKGTMFKRESFASFPDDLLVQRFTKEGAETLDFTIELSLTRDLASDGKYEQKK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F+ L + G I DK +++ G+ +
Sbjct: 207 SDYKECQLEITDSHILMKGRVKDN--NLRFAGCLAWQTD---GDIRVWSDK-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + +++ + Y+ L +RH++D Q LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLVQQVRDLVETAKEKGYAQLKSRHIEDCQTLFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L +D + + +K+++ E SL EL FQ+GRYLLISSSR +
Sbjct: 321 LDLG-------------AEVDASTTDDLLKNYKPQEGQSLEELFFQYGRYLLISSSRDCS 367
Query: 370 QV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+NINL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNGVDNPPWNSDYHLNINLQMNYWPAYVTNLLETAFPVINYIDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAARYAGIVSQEGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTIYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L R YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLRDRIYPILRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E V + L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTE-VKEKFELLNPLQITQSGRIREWYE 592
>gi|169604462|ref|XP_001795652.1| hypothetical protein SNOG_05244 [Phaeosphaeria nodorum SN15]
gi|160706577|gb|EAT87635.2| hypothetical protein SNOG_05244 [Phaeosphaeria nodorum SN15]
Length = 771
Score = 229 bits (585), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 170/606 (28%), Positives = 276/606 (45%), Gaps = 76/606 (12%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
S STT I F P +TDA+P+GNGRLGA++ GG E + LNED++W+G N
Sbjct: 21 SASTT----IWFGKPGVIWTDALPVGNGRLGAVIHGGYGMEQVGLNEDSIWSGGLQKRIN 76
Query: 66 PDAPKALSDVRSLVDSGQYAEATAA---SVKLFGHPADVYQLLGDIELEFDDSHLKYAEE 122
+A A + +G ++A ++K G YQ G++ +EF + +
Sbjct: 77 SNALAAFPGIPEAFTNGNISKADEIWHNNLKGTGTQVRQYQPAGNMMIEFGQN--VSSVS 134
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
Y R LDL T V Y+ +V + R+ +S P + + + ++G+L +SL
Sbjct: 135 GYNRSLDLTTGENHVSYTRNDVTYLRQALASYPHDTLGFRYTADKAGALDMKISLT---- 190
Query: 183 NHSYVNGNN------QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISA 236
+ V G I M G+ ND ++F + I++ D G
Sbjct: 191 RNESVTGLKVDLEKLSITMYGQ----------GTNDSS--LKF--VHSIRVVADTG---- 232
Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
K++++ A ++F + +++ + ++ A+ + + ++
Sbjct: 233 --GKEVRI--------YYGAETTFRHANVEAAEAAMNAKLDAAVAV------PWEEFKSK 276
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD----EDPSLVEL 352
++DY+ L RV + D S I + + +R+K++ T DP L+ L
Sbjct: 277 AIEDYKNLADRVQL-----------DVGSSGEIGRLDTGQRLKNWNTTGNATSDPELMAL 325
Query: 353 LFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
+ +GR+LLI SSR G+ +NLQG+WN+ P W S +NIN EMNYW + NL+E
Sbjct: 326 TYNYGRFLLIGSSRIGSLPSNLQGVWNDKFKPPWGSRFTININTEMNYWPAETTNLAETH 385
Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
P+FD L + G A+ Y SGWV HH TD+W + WA P+GGAWL
Sbjct: 386 LPVFDHLLRMQEQGRYVAKGMYNMSGWVCHHNTDLWGDCVPVDDQTYWAANPVGGAWLAL 445
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGK 532
HL EH+ + + + A P+L +F D+ I+ D Y +SPE+ + P K
Sbjct: 446 HLIEHFRFNGNTTWASSTALPILSDALTFFYDFSIKKGD-YNALIYDSSPENSYHIPSNK 504
Query: 533 -----LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAED 587
+ S ++ E+FS I +E + V K L + P +A D
Sbjct: 505 QVPNATTGIDQGSAHPRQVLHELFSGFIEMSEATGSIDG--VAKAKDYLAHIEPPNVATD 562
Query: 588 GSIMEW 593
G ++EW
Sbjct: 563 GHLLEW 568
>gi|421290215|ref|ZP_15740965.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA54354]
gi|421305607|ref|ZP_15756261.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA62331]
gi|395887900|gb|EJG98914.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA54354]
gi|395904565|gb|EJH15479.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA62331]
Length = 803
Score = 229 bits (584), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 173/599 (28%), Positives = 282/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYET 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L + KS L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTDVKEKS-DLLNPLQITQSGRIREWYE 592
>gi|291530512|emb|CBK96097.1| hypothetical protein EUS_08620 [Eubacterium siraeum 70/3]
Length = 776
Score = 229 bits (584), Expect = 3e-57, Method: Compositional matrix adjust.
Identities = 172/610 (28%), Positives = 290/610 (47%), Gaps = 72/610 (11%)
Query: 14 KITFNGPA---KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
KI F P K PIGNG +GA +GG+ E + LNE TLW G P + + PD
Sbjct: 4 KILFTAPGEFDKWEQQCQPIGNGYMGASFFGGISKEKIVLNEKTLWAGGPSE-SRPDYNG 62
Query: 71 ALSD--------VRSLVDSGQYAEATAASVKLFGHP--ADVYQLLGDIELEFDDSHLKYA 120
+ D V+ L+ G+Y EA A L G YQLL D+ L F + A
Sbjct: 63 GIIDGSYEYVKQVQQLLYDGKYDEAVALLPHLTGATDGYGAYQLLCDMMLTFSNIDETQA 122
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
+ Y R LDL+ + +++ RE F++ P VI K+S + + +SLD+L
Sbjct: 123 TD-YTRTLDLDNSIFTTQFTYQGAVHKREAFANYPSNVICIKLSSDKPRRICVKLSLDNL 181
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
NG+ + EG G+++ + K+ + G + +D
Sbjct: 182 QCGSVTANGDT-LTYEGALW-------------DNGLRYCTVF--KVVNKGGELIDAKDS 225
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPS-DSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+ VE +D + L AS+ + + P+ + +P++ +++ + ++ LY HL
Sbjct: 226 -IMVEHADEVYIYLTASTDYSNKY--PTFRTGVNPSAAVNQRIENAVSKGFNALYEEHLA 282
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVE----LLFQ 355
DY+ LF V+++++ DI+ P + ++ ++ + S+ L FQ
Sbjct: 283 DYKALFDSVTLKINEDTDDII------------PCDKLIREYKENGSRSIANRLETLYFQ 330
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
FGRY+LISSSR G+ ANLQG+WNE P W H+N+NL+MNYW + NLSE PL
Sbjct: 331 FGRYMLISSSRAGSLPANLQGVWNESNCPPWCCDYHINVNLQMNYWGAYNTNLSETVPPL 390
Query: 416 FDFLTYLSINGSKTAQVNYL--------ASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
DFL + +G K+A+ Y +GW H ++ + +A W
Sbjct: 391 VDFLDSMRPSGRKSAEAYYGIKSDEEHPENGWCAHTQSTPFG-WTAPGWNFYWGWSTAAV 449
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEF 526
AWL +++E++ +T D+ + + YP++ F WLI + L ++P+ SPEH
Sbjct: 450 AWLMQNIYEYFEFTGDKKYFAEHIYPIMRESVRFYTQWLIYDDKQKRLVSSPTYSPEH-- 507
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE 586
V+ +T + ++I ++++ I+A+E L +E+ L V + +L+P +++
Sbjct: 508 -------GPVTIGNTYEQSLIEQLYNDFITASEALGTDEE-LRNIVKNQVVQLKPFSVSK 559
Query: 587 D-GSIMEWVQ 595
G + EW +
Sbjct: 560 KTGLLKEWFE 569
>gi|419428224|ref|ZP_13968401.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 5652-06]
gi|379616100|gb|EHZ80800.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 5652-06]
Length = 707
Score = 229 bits (584), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 168/534 (31%), Positives = 265/534 (49%), Gaps = 56/534 (10%)
Query: 72 LSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRREL 128
L +R + G+ +A + +F P D Y+LLG++ +E D A Y REL
Sbjct: 3 LKKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYEREL 61
Query: 129 DLNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
DL+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 62 DLDTAISNVVFDPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDE 121
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 122 VSKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVI 166
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQK 303
+ L L + +++ G +S+LQ ++ Y H+ YQ+
Sbjct: 167 RNATEVFLYLKSMTNYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQE 213
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
F+RV +L S KD ++ I T E K + L LLF +GRYLLIS
Sbjct: 214 QFNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLIS 261
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SS+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 262 SSQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMR 321
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 322 EPGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQD 381
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 382 ERILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTID 439
Query: 544 MAIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I+R + I A+ L N D + V+++ K LP+ TKI +G I EW++
Sbjct: 440 NQILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLE 490
>gi|256376305|ref|YP_003099965.1| hypothetical protein Amir_2174 [Actinosynnema mirum DSM 43827]
gi|255920608|gb|ACU36119.1| conserved hypothetical protein [Actinosynnema mirum DSM 43827]
Length = 646
Score = 229 bits (583), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 114/266 (42%), Positives = 157/266 (59%), Gaps = 5/266 (1%)
Query: 328 NIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWD 387
+D P+ + S E P+L LLFQ GR+LL++SSRPGT ANLQG+WN P W
Sbjct: 199 ELDLGPAPDGPPSTWPREHPALAALLFQHGRHLLVASSRPGTLPANLQGVWNPHAEPPWR 258
Query: 388 SAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDI 447
S +NIN EMNYW + P L+EC EPL +FL L+ +G++ A+ Y GW HH TD
Sbjct: 259 SNYTLNINTEMNYWPAEPTALAECHEPLLEFLHGLAESGTRVARELYGLPGWCAHHNTDR 318
Query: 448 WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI 507
W ++ +G WA WPM GAWL HLWE Y + D +L RA+PLL G A F L WL+
Sbjct: 319 WFLATPVQGDPAWANWPMAGAWLSLHLWERYEFGGDAVWLRGRAWPLLLGAAEFCLAWLV 378
Query: 508 EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDA 567
E G L T PSTSPE+ ++ DG+ V +TMD+A+ E+ ++ A VL ++
Sbjct: 379 E-DRGELTTAPSTSPENHYLTADGREVAVGVGATMDLALTWELLDRVVRAGAVLGED--- 434
Query: 568 LVEKVLKSLPRLRPTKIAEDGSIMEW 593
V + ++L R+ + DG ++EW
Sbjct: 435 -VGRFAEALARIPEPPVGSDGRVLEW 459
Score = 46.6 bits (109), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 51/114 (44%), Gaps = 12/114 (10%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN---PDAPKALSDVR 76
PA + +A PIG+GR GAM WG LN+D LWT + AP+ + R
Sbjct: 15 PAARWEEAHPIGDGRFGAMCWG---DGRFDLNDDRLWTDPSPPDPSQPAAGAPEVVRAAR 71
Query: 77 SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+ +G A + G YQ LG + L + AE YRRELDL
Sbjct: 72 AAALAGDPERADELLRSVQGPDTASYQPLGTLVLGY------RAEGGYRRELDL 119
>gi|306824549|ref|ZP_07457895.1| possible alpha-L-fucosidase [Streptococcus sp. oral taxon 071 str.
73H25AP]
gi|304433336|gb|EFM36306.1| possible alpha-L-fucosidase [Streptococcus sp. oral taxon 071 str.
73H25AP]
Length = 1749
Score = 229 bits (583), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 183/616 (29%), Positives = 301/616 (48%), Gaps = 98/616 (15%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 184 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 241
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
+++G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 242 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRNLDITE 301
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDNHSYV-- 187
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N +Y
Sbjct: 302 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 361
Query: 188 -----NGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
NG+ N I+++G K N G++F++ L IK G + A+
Sbjct: 362 YSHYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIKTD---GKV-AV 404
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP ++ +KD E+ +++ + Y L
Sbjct: 405 QDETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLENTVKGIVEAAKAKDYETLK 461
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L S T E ++S+ ++ L EL F
Sbjct: 462 QDHIKDYQSLFNRVKLNLGGSKTAQTT-------------KEALQSYNPEKGQKLEELFF 508
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NLSE
Sbjct: 509 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLSETA 568
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 569 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 623
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 624 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKVSDRWV-SSPS 682
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L ++D LV +V +L
Sbjct: 683 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKAKFDKL 732
Query: 580 RPTKIAEDGSIMEWVQ 595
+P I +G I EW +
Sbjct: 733 KPLHINNEGRIKEWYE 748
>gi|270292150|ref|ZP_06198365.1| fibronectin type III domain protein [Streptococcus sp. M143]
gi|270279678|gb|EFA25520.1| fibronectin type III domain protein [Streptococcus sp. M143]
Length = 1747
Score = 229 bits (583), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 181/616 (29%), Positives = 301/616 (48%), Gaps = 98/616 (15%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y + K L+++R
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYQ--ERYKVLAEIRK 199
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 200 ALEVGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRGLDITE 259
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G++F++ L IK +D + T+
Sbjct: 320 YSNYKNGHVTTDENGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 362
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP ++ +KD E +++ + Y L
Sbjct: 363 QDETLTVTGASYATLYLSAKTNF---AQNPKTNYRKDIDLEKTVKGIVEAAKAKDYETLK 419
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L + D T E ++ + D+ L EL F
Sbjct: 420 KAHIKDYQSLFNRVKLNLGGNKTDQTT-------------KEALQGYNPDKGQKLEELFF 466
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 527 KPMINYIDDMRYYGRVAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDQASDRWV-SSPS 640
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L ++D LV +V +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKAKFDKL 690
Query: 580 RPTKIAEDGSIMEWVQ 595
+P I ++G I EW +
Sbjct: 691 KPLHINKEGRIKEWYE 706
>gi|418190394|ref|ZP_12826903.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47373]
gi|353851653|gb|EHE31644.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47373]
Length = 682
Score = 228 bits (582), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 165/511 (32%), Positives = 255/511 (49%), Gaps = 55/511 (10%)
Query: 94 LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVGNVEFTRE 149
+F P D Y+LLG++ +E D A Y RELDL+TA + V + + N++ RE
Sbjct: 1 MFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSCNLQIKRE 59
Query: 150 HFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPK 207
+F+S ++ +I S +L+ N++L + ++ ++ I+M G+
Sbjct: 60 YFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGR----- 114
Query: 208 ANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP 267
KG+QF + K++D G +S L + + + + L L + + + G
Sbjct: 115 -------KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDYWGNI--- 161
Query: 268 SDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSE 326
+S+LQ ++ Y H+ YQ+ F+RV +L S KD ++
Sbjct: 162 ----------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDCLS----- 205
Query: 327 ENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTW 386
I T E K + L LLF +GRYLLISSS+P ANLQGIW ++L+P W
Sbjct: 206 --IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIW 259
Query: 387 DSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTD 446
S +NIN +MNYW PC+L E + PLFD L + G TA+ Y A G+ HH TD
Sbjct: 260 GSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTD 319
Query: 447 IWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL 506
+ ++ + A+W + WLCTH+WEHY Y D L + + +++ F D+L
Sbjct: 320 GFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYL 378
Query: 507 IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNED 566
E DGYL T PS SPE+++ +G SST+D I+R + I A+ L N D
Sbjct: 379 FEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSD 437
Query: 567 AL--VEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+ V+++ K LPR TKI +G I EW++
Sbjct: 438 FISRVKELKKKLPR---TKIGSNGQIQEWLE 465
>gi|293364225|ref|ZP_06610951.1| conserved hypothetical protein [Streptococcus oralis ATCC 35037]
gi|307702420|ref|ZP_07639376.1| alpha-fucosidase [Streptococcus oralis ATCC 35037]
gi|291317071|gb|EFE57498.1| conserved hypothetical protein [Streptococcus oralis ATCC 35037]
gi|307624002|gb|EFO02983.1| alpha-fucosidase [Streptococcus oralis ATCC 35037]
Length = 1707
Score = 228 bits (582), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 181/616 (29%), Positives = 304/616 (49%), Gaps = 98/616 (15%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
+++G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 200 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRNLDITE 259
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G+QF++ L IK +D + T+
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLQFASYLGIK-TDGKVTV--- 362
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP ++ +KD E +++ ++ Y L
Sbjct: 363 QDETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKSKDYETLK 419
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L + T + E ++ + ++ L EL F
Sbjct: 420 KAHIKDYQSLFNRVKLNLGGT-------------KTTQTTKEALQGYNPEKGQKLEELFF 466
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDQASDRWV-SSPS 640
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L+ ++D LV +V +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKL 690
Query: 580 RPTKIAEDGSIMEWVQ 595
+P I ++G I EW +
Sbjct: 691 KPLHINKEGRIKEWYE 706
>gi|365119726|ref|ZP_09337619.1| hypothetical protein HMPREF1033_00965 [Tannerella sp.
6_1_58FAA_CT1]
gi|363648290|gb|EHL87470.1| hypothetical protein HMPREF1033_00965 [Tannerella sp.
6_1_58FAA_CT1]
Length = 1009
Score = 228 bits (582), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 161/501 (32%), Positives = 244/501 (48%), Gaps = 45/501 (8%)
Query: 105 LGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKI 163
L DIELE++ + + Y R LD++ A V Y FTRE F S PD V+V ++
Sbjct: 317 LSDIELEYEQLYEPLEPYSDYVRMLDIDNAVHSVIYKENGTTFTRECFMSYPDNVMVMRL 376
Query: 164 SGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL 223
+ G +S + S N + M G+ P N G++F+
Sbjct: 377 KADKGGCISRTFGITSPQPKKRIFASGNTLTMTGQ------PALHKEN----GLKFAQ-- 424
Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD--SKKDPTSESMSA 281
++K+ + G + +++KK++V+ +D +LL+ A++++ D S +DP +
Sbjct: 425 QVKVLNKGGYLEVIDNKKIRVKDADEVILLMSAATNYQQSMDEKFDYFSDEDPLTTVKRT 484
Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF 341
L + + +Y DL + H DY+ L+ R+S+ L T + + K
Sbjct: 485 LMAAESKTYEDLLSSHKKDYKALYDRMSLNLGNI-------TGMSTKTTDILLKDFYKGN 537
Query: 342 QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
+E+ L +QFGRYLLI+SSR + ANLQG+W E LS W++ H NIN++MNYW
Sbjct: 538 TVEENLYTEMLYYQFGRYLLIASSRENSLPANLQGVWGERLSNPWNADYHTNINVQMNYW 597
Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYL------ASGWVIHHKTDIWAKSSADR 455
+ NLS C PL ++ L G TA+ Y GWV HH+ +IW ++
Sbjct: 598 PAQQTNLSPCHIPLISYINSLVPRGKITARHYYCKPDGGDVRGWVTHHENNIWGNTAPGT 657
Query: 456 GKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLD--WLIEGHDGY 513
+ P G AW+C +WE+Y + D+ FLE+ Y L G A F +D W E DG
Sbjct: 658 SYGAFHF-PAGAAWMCQDIWEYYQFNCDKKFLEQN-YNTLLGAALFWVDNLWTDE-RDGT 714
Query: 514 LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE-KV 572
L NPS SPEH + L C ST+ A+I E+F +I A+E L K+ + E K
Sbjct: 715 LVANPSHSPEH----GEYSLGC----STV-QAMIAEIFDIVIKASEDLGKDTKEVAEIKA 765
Query: 573 LKSLPRLRPTKIAEDGSIMEW 593
KS +L +I G MEW
Sbjct: 766 AKS--KLAGPQIGLGGQFMEW 784
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 26/56 (46%), Positives = 42/56 (75%), Gaps = 3/56 (5%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
+K +N PAK + ++A+PIGNG +GAM++G V + +++NE +LW+G PG+ NPD
Sbjct: 40 MKAVYNKPAKVWESEALPIGNGYMGAMIFGDVYRDVIQVNEHSLWSGGPGE--NPD 93
>gi|417939732|ref|ZP_12583021.1| gram positive anchor [Streptococcus oralis SK313]
gi|343389927|gb|EGV02511.1| gram positive anchor [Streptococcus oralis SK313]
Length = 1727
Score = 228 bits (582), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 180/617 (29%), Positives = 301/617 (48%), Gaps = 98/617 (15%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y + K L+++R
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--ERYKVLAEIRK 199
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
+++G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 200 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRNLDITE 259
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G++F++ L IK +D + T+
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 362
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP ++ +KD E +++ + Y L
Sbjct: 363 QDETLTVTGASYATLYLSAKTNF---AQNPKTNYRKDIDLEKTVKGIVEAAKTKDYETLK 419
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L S T E ++ + ++ L EL F
Sbjct: 420 KAHIKDYQSLFNRVKLNLGGSKTGQTT-------------KEALQGYNPEKGQKLEELFF 466
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 467 QYGRYLLISSSRDQTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDQASDRWV-SSPS 640
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L ++D LV +V +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKTKFDKL 690
Query: 580 RPTKIAEDGSIMEWVQR 596
+P I ++G I EW +
Sbjct: 691 KPLHINKEGRIKEWYEE 707
>gi|419778183|ref|ZP_14304079.1| gram positive anchor [Streptococcus oralis SK10]
gi|383187500|gb|EIC79950.1| gram positive anchor [Streptococcus oralis SK10]
Length = 1707
Score = 228 bits (581), Expect = 6e-57, Method: Compositional matrix adjust.
Identities = 181/616 (29%), Positives = 304/616 (49%), Gaps = 98/616 (15%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
+++G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 200 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRNLDITE 259
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G+QF++ L IK +D + T+
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLQFASYLGIK-TDGKVTV--- 362
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP ++ +KD E +++ ++ Y L
Sbjct: 363 QDETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKSKDYETLK 419
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L + T + E ++ + ++ L EL F
Sbjct: 420 KAHIKDYQSLFNRVKLNLGGT-------------KTTQTTKEALQGYNPEKGQKLEELFF 466
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDQASDRWV-SSPS 640
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L+ ++D LV +V +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKL 690
Query: 580 RPTKIAEDGSIMEWVQ 595
+P I ++G I EW +
Sbjct: 691 KPLHINKEGRIKEWYE 706
>gi|406576906|ref|ZP_11052529.1| LPXTG cell surface protein, calx-beta domain-containing protein
[Streptococcus sp. GMD6S]
gi|404460587|gb|EKA06837.1| LPXTG cell surface protein, calx-beta domain-containing protein
[Streptococcus sp. GMD6S]
Length = 1707
Score = 228 bits (581), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 180/616 (29%), Positives = 302/616 (49%), Gaps = 98/616 (15%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
+++G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 200 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRGLDITE 259
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G++F++ L IK +D + T+
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 362
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP ++ +KD E +++ + Y L
Sbjct: 363 QDETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKAKDYETLK 419
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ+LF+RV + L N + E ++ + ++ L EL F
Sbjct: 420 KAHIKDYQRLFNRVKLNLGG-------------NKTAQTTKEALQGYNPEKGQKLEELFF 466
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPS 640
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L ++D LV +V +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKAKFDKL 690
Query: 580 RPTKIAEDGSIMEWVQ 595
+P I ++G I EW +
Sbjct: 691 KPLHINKEGRIKEWYE 706
>gi|400594907|gb|EJP62734.1| alpha-fucosidase [Beauveria bassiana ARSEF 2860]
Length = 798
Score = 228 bits (581), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 167/593 (28%), Positives = 272/593 (45%), Gaps = 63/593 (10%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQY 84
T + IGNGR+GA ++G +E + LNED++W+G + +AL +R +
Sbjct: 42 TGVLAIGNGRIGAAIFGS-GNEVITLNEDSIWSGPLQNRMPTRGLQALPKIRQQLVEDNI 100
Query: 85 AEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYS 140
EAT++ + P+ VY G++ L+F Y R LD A + Y+
Sbjct: 101 TEATSSIMNDM-MPSVSRERVYSYFGNLHLDFGHER---GMTNYVRWLDTRQGNAGISYT 156
Query: 141 VGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL---DSLLDNHSYVNGNNQIIMEG 197
+ +TRE+ +S P ++ + + S++G+LSFN + ++L N + N ++
Sbjct: 157 YNGINYTREYIASFPAGILAARFTASKAGALSFNTTFTRESNILANSASATTNGGLLTMR 216
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVAS 257
G+ + +DP I F+ + I+D+ T ++ L + G+ L
Sbjct: 217 GSSGQ------STKNDP--ILFTGKGQF-IADNAHT--SVSGSTLSITGATEVDLFFDIE 265
Query: 258 SSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPK 317
+S+ +++ +E L++ Y+D+ + D L R SI +SP
Sbjct: 266 TSYR------HQTQQKLEAEVDRKLKASIAKGYTDIRDGAIADATALLGRASINFGKSPN 319
Query: 318 DIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPG----TQVA 372
+P+ +R+K + +D L L + +GR+LL++SSR + A
Sbjct: 320 GAAN----------LPTDKRIKMARKGLDDTQLAVLAWNYGRHLLVASSRHNDADVSLPA 369
Query: 373 NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQV 432
NL G+WN + W +N+NLEMNYW + N+ E QE +F L G + AQ
Sbjct: 370 NLLGLWNNRTTSAWGGKFTINVNLEMNYWPAGQTNIIETQESMFSLLKIAKPRGEEMAQK 429
Query: 433 NYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
Y +G V HH D+W ++ +WPMG AW H+ +HY +T D FL AY
Sbjct: 430 LYGCNGTVFHHNLDLWGDAAPSDNNTSATMWPMGAAWTVQHMMDHYRFTGDAGFLLHTAY 489
Query: 493 PLLEGCASFL----LDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYSSTMD 543
P L ASF DW G T PS SPE+ FI P G + MD
Sbjct: 490 PFLTDVASFYRCYAFDW-----QGSKVTGPSVSPENSFIVPKNASVAGSRKAYDIAPEMD 544
Query: 544 MAIIREVFSAIISAAEVL---EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
++R+V +++ AA+ L + +ED V++ K LP +R I G I+EW
Sbjct: 545 NQLMRDVMESLLEAAKALNIPQTDED--VKEATKFLPLIRRPAIGSYGQILEW 595
>gi|15901489|ref|NP_346093.1| hypothetical protein SP_1654 [Streptococcus pneumoniae TIGR4]
gi|111658563|ref|ZP_01409226.1| hypothetical protein SpneT_02000319 [Streptococcus pneumoniae
TIGR4]
gi|421243582|ref|ZP_15700095.1| fibronectin type III domain protein [Streptococcus pneumoniae
2081074]
gi|421247923|ref|ZP_15704402.1| fibronectin type III domain protein [Streptococcus pneumoniae
2082170]
gi|14973145|gb|AAK75733.1| conserved hypothetical protein [Streptococcus pneumoniae TIGR4]
gi|395606587|gb|EJG66691.1| fibronectin type III domain protein [Streptococcus pneumoniae
2081074]
gi|395612939|gb|EJG72972.1| fibronectin type III domain protein [Streptococcus pneumoniae
2082170]
Length = 803
Score = 228 bits (580), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 173/599 (28%), Positives = 281/599 (46%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL++NYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQLNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y + D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YLFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592
>gi|336431570|ref|ZP_08611417.1| hypothetical protein HMPREF0991_00536, partial [Lachnospiraceae
bacterium 2_1_58FAA]
gi|336011929|gb|EGN41858.1| hypothetical protein HMPREF0991_00536 [Lachnospiraceae bacterium
2_1_58FAA]
Length = 1869
Score = 228 bits (580), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 176/643 (27%), Positives = 302/643 (46%), Gaps = 88/643 (13%)
Query: 6 STSTTNPLKITFNGPAKHFTD----------AIPIGNGRLGAMVWGGVPSETLKLNEDTL 55
+ S + LK+ + PA T ++P+GNG LG +++GG+ E + NE TL
Sbjct: 40 TESISQSLKLWYTSPANINTQETNGGEWMQQSLPLGNGNLGNLIFGGISKERIHFNEKTL 99
Query: 56 WTGVP---------GDYTNPDAPKALSDVRSLVDS------------GQYAEATAASVKL 94
WTG P G+ + + + R L+D G Y A +K
Sbjct: 100 WTGGPSPSRPGYQFGNKATAYTDEEIENYRKLLDDKSTKVFNDDQSLGGYG--MGAQIKF 157
Query: 95 FGHP---ADVYQLLGDIELEFDDSHLKYAE-ETYRRELDLNTATARVKYSVGNVEFTREH 150
G YQ GDI L+F L+ + YRRELDL T A ++S +V + REH
Sbjct: 158 PGENNLNKGSYQDFGDIWLDFSKMGLQDQNVKNYRRELDLQTGVASTEFSYEDVNYKREH 217
Query: 151 FSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVNGNNQIIMEGRCPGKRIPPK 207
F SNPDQ++VTK+S SESG L +V ++ + L+ + + NQ C I K
Sbjct: 218 FVSNPDQIMVTKLSASESGKLDLSVKMELNNNGLEGKTTFDSENQT-----CT---IEGK 269
Query: 208 ANANDDPKGIQFSAILEIKISDDRGTISALEDKKL-KVEGSDWAVLLLVASSSFDGPFIN 266
ND ++F +++ + + G + E ++ ++E ++ ++++ A + + +
Sbjct: 270 VKDND----LKFYTTMKLVL--EGGDLEVDEKNQVYQIEDANQVMIVMAAETDYKNDYPT 323
Query: 267 PSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSE 326
D +K+ + S SY L +H+ D+QKLF RVS+ L +I
Sbjct: 324 YRDKEKNLKKMVDDRVNSNAKKSYQKLKEKHIADHQKLFDRVSLDLGEQRTNI------- 376
Query: 327 ENIDTVPSAERVKSFQTDEDPSLVELL-FQFGRYLLISSSRPGTQVANLQGIWNEDLSPT 385
P+ + V ++ +E+L FQ+GRYL I+ SR GT +NL G+W S
Sbjct: 377 ------PTNQLVDEYRNGTYSHYLEVLAFQYGRYLTIAGSR-GTLPSNLVGLWTVGDSA- 428
Query: 386 WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF---------LTYLSINGSKTAQVNYLA 436
W H N+N++MNYW NL+EC D+ LT ++G + A N+
Sbjct: 429 WTGDYHFNVNVQMNYWPVYTTNLAECGVTFVDYMDKLREPGRLTAERVHGIEGAVENH-- 486
Query: 437 SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
+G+ +H + + + ++ + + P G AW +LW HY +T + D+L+ YP+++
Sbjct: 487 TGFTVHTENNPFGMTAPTNAQ-EYGWNPTGAAWAIQNLWWHYEFTQNEDYLKNTIYPIMK 545
Query: 497 GCASFLLD--WLIEGHDGYLETNPSTSPEHEFIAP--DGKLACVSYSSTMDMAIIREVFS 552
A F W E E++P + +AP + + +T D +++ E++
Sbjct: 546 EAAQFWDSYLWTSEYQKINDESSPYNGQDRLVVAPSFSEEQGPTAIGTTYDQSLVWELYK 605
Query: 553 AIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I A +++ ++E AL++ +++ +L P +I E I EW +
Sbjct: 606 ECIQAGKIVGEDE-ALLKSWEENMQKLDPIEINETNGIKEWYE 647
>gi|315611778|ref|ZP_07886700.1| alpha-L-fucosidase [Streptococcus sanguinis ATCC 49296]
gi|315316193|gb|EFU64223.1| alpha-L-fucosidase [Streptococcus sanguinis ATCC 49296]
Length = 1707
Score = 228 bits (580), Expect = 9e-57, Method: Compositional matrix adjust.
Identities = 180/616 (29%), Positives = 302/616 (49%), Gaps = 98/616 (15%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
+++G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 200 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRNLDITE 259
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G++F++ L IK +D + T+
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 362
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP ++ +KD E +++ + Y L
Sbjct: 363 QDETLTVTGASYATLYLSAKTNF---AQNPKTNYRKDIDLEKTVKGIVEAAKVKDYETLK 419
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L N + E ++ + ++ L EL F
Sbjct: 420 KAHIKDYQSLFNRVKLNLGG-------------NKTAQTTKEALQGYNPEKGQKLEELFF 466
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDQASDRWV-SSPS 640
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L+ ++D LV +V +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKL 690
Query: 580 RPTKIAEDGSIMEWVQ 595
+P I ++G I EW +
Sbjct: 691 KPLHINKEGRIKEWYE 706
>gi|418094446|ref|ZP_12731573.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA49138]
gi|353764942|gb|EHD45490.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA49138]
Length = 803
Score = 228 bits (580), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 176/610 (28%), Positives = 286/610 (46%), Gaps = 84/610 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +L +G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLCSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
A Y F RE F+S PD ++V + +L F + L L N Y
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCYLASNGKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A + Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAALKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWVQ 595
+ G I EW +
Sbjct: 583 QSGRIREWYE 592
>gi|148988700|ref|ZP_01820133.1| hypothetical protein CGSSp6BS73_09249 [Streptococcus pneumoniae
SP6-BS73]
gi|182684597|ref|YP_001836344.1| hypothetical protein SPCG_1627 [Streptococcus pneumoniae CGSP14]
gi|303255977|ref|ZP_07342005.1| hypothetical protein CGSSpBS455_10785 [Streptococcus pneumoniae
BS455]
gi|303258584|ref|ZP_07344564.1| hypothetical protein CGSSp9vBS293_05634 [Streptococcus pneumoniae
SP-BS293]
gi|303262671|ref|ZP_07348611.1| hypothetical protein CGSSp14BS292_00767 [Streptococcus pneumoniae
SP14-BS292]
gi|303263611|ref|ZP_07349533.1| hypothetical protein CGSSpBS397_07359 [Streptococcus pneumoniae
BS397]
gi|303266372|ref|ZP_07352261.1| hypothetical protein CGSSpBS457_04537 [Streptococcus pneumoniae
BS457]
gi|303268245|ref|ZP_07354043.1| hypothetical protein CGSSpBS458_04243 [Streptococcus pneumoniae
BS458]
gi|387759769|ref|YP_006066747.1| hypothetical protein SPNINV200_14790 [Streptococcus pneumoniae
INV200]
gi|419515161|ref|ZP_14054786.1| fibronectin type III domain protein [Streptococcus pneumoniae
England14-9]
gi|421296489|ref|ZP_15747198.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA58581]
gi|147925901|gb|EDK76976.1| hypothetical protein CGSSp6BS73_09249 [Streptococcus pneumoniae
SP6-BS73]
gi|182629931|gb|ACB90879.1| hypothetical protein SPCG_1627 [Streptococcus pneumoniae CGSP14]
gi|301802358|emb|CBW35112.1| conserved hypothetical protein [Streptococcus pneumoniae INV200]
gi|302597036|gb|EFL64154.1| hypothetical protein CGSSpBS455_10785 [Streptococcus pneumoniae
BS455]
gi|302636227|gb|EFL66722.1| hypothetical protein CGSSp14BS292_00767 [Streptococcus pneumoniae
SP14-BS292]
gi|302640085|gb|EFL70540.1| hypothetical protein CGSSpBS293_05634 [Streptococcus pneumoniae
SP-BS293]
gi|302642196|gb|EFL72545.1| hypothetical protein CGSSpBS458_04243 [Streptococcus pneumoniae
BS458]
gi|302644072|gb|EFL74330.1| hypothetical protein CGSSpBS457_04537 [Streptococcus pneumoniae
BS457]
gi|302646649|gb|EFL76874.1| hypothetical protein CGSSpBS397_07359 [Streptococcus pneumoniae
BS397]
gi|379635710|gb|EIA00269.1| fibronectin type III domain protein [Streptococcus pneumoniae
England14-9]
gi|395895362|gb|EJH06337.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA58581]
Length = 803
Score = 228 bits (580), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 173/599 (28%), Positives = 282/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQD 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYL---VWETDGDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ Y +L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYTMLRETVRFWNAFLRKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592
>gi|421488290|ref|ZP_15935682.1| Gram-positive signal peptide protein, YSIRK family [Streptococcus
oralis SK304]
gi|400368666|gb|EJP21674.1| Gram-positive signal peptide protein, YSIRK family [Streptococcus
oralis SK304]
Length = 1687
Score = 227 bits (579), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 181/616 (29%), Positives = 301/616 (48%), Gaps = 98/616 (15%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 141 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 198
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
+++G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 199 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRGLDITD 258
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 259 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 318
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G++F++ L IK +D + T+
Sbjct: 319 YSNYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 361
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS-KKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP S +KD E +++ + Y L
Sbjct: 362 QDETLTVTGASYATLYLSAKTNFAQ---NPKTSYRKDIDLEKTVKGIVEAAKAKDYETLK 418
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L N + E ++ + ++ L EL F
Sbjct: 419 KAHIKDYQSLFNRVKLNLGG-------------NKTAQTTKEALQGYNPEKGQKLEELFF 465
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 466 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 525
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 526 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 580
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 581 WSPAANAWMMQNVYDYYKFTKDESYLKEKIYPMLKETAKFWNSFLHYDKTSDRWV-SSPS 639
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L+ ++D LV +V +L
Sbjct: 640 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKL 689
Query: 580 RPTKIAEDGSIMEWVQ 595
+P I ++G I EW +
Sbjct: 690 KPLHINKEGRIKEWYE 705
>gi|418139981|ref|ZP_12776806.1| hypothetical protein SPAR28_1620 [Streptococcus pneumoniae GA13338]
gi|418181010|ref|ZP_12817579.1| hypothetical protein SPAR74_1621 [Streptococcus pneumoniae GA41688]
gi|353843082|gb|EHE23127.1| hypothetical protein SPAR74_1621 [Streptococcus pneumoniae GA41688]
gi|353904760|gb|EHE80210.1| hypothetical protein SPAR28_1620 [Streptococcus pneumoniae GA13338]
Length = 778
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 173/599 (28%), Positives = 282/599 (47%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQD 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYL---VWETDGDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ Y +L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYTMLRETVRFWNAFLRKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592
>gi|401683949|ref|ZP_10815833.1| LPXTG cell wall anchor domain protein [Streptococcus sp. BS35b]
gi|400186628|gb|EJO20836.1| LPXTG cell wall anchor domain protein [Streptococcus sp. BS35b]
Length = 1687
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 182/616 (29%), Positives = 299/616 (48%), Gaps = 98/616 (15%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 122 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPRPDSTDYNGGNYK--DRYKVLAEIRK 179
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 180 ALEDGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRGLDITE 239
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + L F N + LL N
Sbjct: 240 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKKLDFTLWNSLTEDLLANGEYSWE 299
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G++F++ L IK +D + T+
Sbjct: 300 YSNYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 342
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP ++ +KD E +++ + Y L
Sbjct: 343 QDETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKAKDYETLK 399
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L S T E ++S+ + L EL F
Sbjct: 400 QDHIKDYQNLFNRVKLNLGGSKTAQTT-------------KEALQSYNPSKGQKLEELFF 446
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 447 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 506
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 507 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 561
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 562 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDQASDRWV-SSPS 620
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L ++D LV +V +L
Sbjct: 621 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKAKFDKL 670
Query: 580 RPTKIAEDGSIMEWVQ 595
+P I ++G I EW +
Sbjct: 671 KPLHINKEGRIKEWYE 686
>gi|210614863|ref|ZP_03290362.1| hypothetical protein CLONEX_02576 [Clostridium nexile DSM 1787]
gi|210150505|gb|EEA81514.1| hypothetical protein CLONEX_02576 [Clostridium nexile DSM 1787]
Length = 1797
Score = 227 bits (578), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 181/645 (28%), Positives = 302/645 (46%), Gaps = 96/645 (14%)
Query: 8 STTNPLKITFNGPAKHFT----------DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWT 57
S LK+ + PAK T ++P+GNG LG +++GG+ E + NE TLWT
Sbjct: 43 SINQELKLWYTSPAKIDTAETNGGEWMQQSLPLGNGNLGNLIFGGIAKERIHFNEKTLWT 102
Query: 58 GVPG----DYTNPDAPKALSDV-----RSLVDS------------GQYAEATAASVKLFG 96
G P +Y + A +D R L+D G Y A +K G
Sbjct: 103 GGPSSSRPNYQFGNKATAYTDTEIEEYRKLLDDKSTNVFNDDKSLGGYG--MGAKIKFPG 160
Query: 97 HP---ADVYQLLGDIELEF-----DDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTR 148
YQ GDI L+F +D+++K YRRELD+ T A ++S +V + R
Sbjct: 161 ENNLNKGSYQDFGDIWLDFSKMGINDNNVK----DYRRELDIQTGIAATEFSCKDVTYKR 216
Query: 149 EHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVNGNNQIIMEGRCPGKRIP 205
EHF SNPDQV+VT++S SE G L NV ++ S L+ + + NQ C I
Sbjct: 217 EHFVSNPDQVMVTELSASEKGKLDLNVKMELNNSGLEGKTTFDEKNQT-----CT---IE 268
Query: 206 PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL-KVEGSDWAVLLLVASSSFDGPF 264
K ND ++F +++ ++ G +SA E ++ +++ +D ++++ A + + +
Sbjct: 269 GKVKDND----LKFCTTMKLVLTG--GKLSADEKNQVYQIQDADCVMIVMAAETDYKNDY 322
Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
D KD + + SY +L H+ D+Q LF RVS+ L
Sbjct: 323 PTYRDKNKDLKKVVADRVNNGTKKSYDELKETHIADHQGLFDRVSLDLG----------- 371
Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELL-FQFGRYLLISSSRPGTQVANLQGIWNEDLS 383
E +VP+ + V ++ +E+L FQ+GRYL I+ SR GT +NL G+W S
Sbjct: 372 --EQRTSVPTNQLVDEYRNGNYSHYLEVLAFQYGRYLTIAGSR-GTLPSNLVGLWTVGNS 428
Query: 384 PTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF---------LTYLSINGSKTAQVNY 434
W H N+N++MNYW NL+EC D+ LT ++G + A N+
Sbjct: 429 A-WTGDYHFNVNVQMNYWPVYATNLAECGTTFVDYMDKLREPGRLTAERVHGIEGAVKNH 487
Query: 435 LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPL 494
+G+ +H + + + ++ + + P G AW +LW HY +T D +L+ YP+
Sbjct: 488 --TGFTVHTENNPFGMTAPTNAQ-EYGWNPTGAAWAIQNLWWHYEFTQDEAYLKNTIYPI 544
Query: 495 LEGCASFLLD--WLIEGHDGYLETNPSTSPEHEFIAP--DGKLACVSYSSTMDMAIIREV 550
++ A F W E E +P +AP + + +T D +++ E+
Sbjct: 545 MKEAALFWDSYLWTSEYQKINDENSPYNGQNRLVVAPSFSEEQGPTAVGTTYDQSLVWEL 604
Query: 551 FSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
++ I A +++ ++E AL++ + + +L P +I + I EW +
Sbjct: 605 YNECIKAGKIVGEDE-ALLKSWEEKMQKLDPIEINDTNGIKEWYE 648
>gi|322375926|ref|ZP_08050437.1| fibronectin type III domain protein [Streptococcus sp. C300]
gi|321279194|gb|EFX56236.1| fibronectin type III domain protein [Streptococcus sp. C300]
Length = 1707
Score = 227 bits (578), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 181/617 (29%), Positives = 302/617 (48%), Gaps = 98/617 (15%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
+++G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 200 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRNLDITE 259
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G+QF++ L IK +D + T+
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLQFASYLGIK-TDGKVTV--- 362
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP ++ +KD E +++ ++ Y L
Sbjct: 363 QDETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKSKDYETLK 419
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L + T + E ++ + ++ L EL F
Sbjct: 420 KAHIKDYQSLFNRVKLNLGGT-------------KTTQTTKEALQGYNPEKGQKLEELFF 466
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPS 640
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L ++D LV +V +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKAKFDKL 690
Query: 580 RPTKIAEDGSIMEWVQR 596
+P I +G I EW +
Sbjct: 691 KPLHINNEGRIKEWYEE 707
>gi|418975961|ref|ZP_13523855.1| gram positive anchor [Streptococcus oralis SK1074]
gi|383346616|gb|EID24639.1| gram positive anchor [Streptococcus oralis SK1074]
Length = 1687
Score = 227 bits (578), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 183/611 (29%), Positives = 299/611 (48%), Gaps = 88/611 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199
Query: 78 LVDSGQYAEATA-ASVKLFGHPADVYQ---LLGDIELEFDDSHLKYAEET-YRRELDLNT 132
+++G +A A LFG Y GDI + F++ T Y R LD+
Sbjct: 200 ALEAGDRQKAKQLAEQNLFGPNNAQYGRCLAFGDIFMVFNNQKKGLDTVTDYHRGLDITE 259
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGDYSWE 319
Query: 184 -HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+Y NG+ G I K D+ G++F++ L IK GT++ ++++ L
Sbjct: 320 YSNYKNGHVTTDEHG------ILLKGTVKDN--GLKFASYLGIKTD---GTVT-VQNETL 367
Query: 243 KVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLYTRHLD 299
V G+ +A L L A ++F NP ++ +KD E +++ + Y L H+
Sbjct: 368 TVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKAKDYETLKKAHIK 424
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
DYQ LF+RV + LS S T E ++ + ++ L EL FQ+GRY
Sbjct: 425 DYQSLFNRVKLNLSGSKTAQTT-------------KEALQGYNPEKGQKLEELFFQYGRY 471
Query: 360 LLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
LLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E +P+ +
Sbjct: 472 LLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETAKPMIN 531
Query: 418 FLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
++ + G SK Q N GW++H + + ++ W P
Sbjct: 532 YIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWGWSPAA 586
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEH 524
AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS SPEH
Sbjct: 587 NAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKTSDRWV-SSPSYSPEH 645
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
++ +T D +++ ++F + A L+ ++D LV +V +L+P I
Sbjct: 646 ---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVEAKFDKLKPLHI 695
Query: 585 AEDGSIMEWVQ 595
+G I EW +
Sbjct: 696 NNEGRIKEWYE 706
>gi|317501845|ref|ZP_07960030.1| hypothetical protein HMPREF1026_01974 [Lachnospiraceae bacterium
8_1_57FAA]
gi|336439520|ref|ZP_08619132.1| hypothetical protein HMPREF0990_01526 [Lachnospiraceae bacterium
1_1_57FAA]
gi|316896735|gb|EFV18821.1| hypothetical protein HMPREF1026_01974 [Lachnospiraceae bacterium
8_1_57FAA]
gi|336015952|gb|EGN45750.1| hypothetical protein HMPREF0990_01526 [Lachnospiraceae bacterium
1_1_57FAA]
Length = 1802
Score = 226 bits (577), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 177/638 (27%), Positives = 300/638 (47%), Gaps = 92/638 (14%)
Query: 13 LKITFNGPAKHFT----------DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-- 60
LK+ + PAK T ++P+GNG LG +++GG+ E + NE TLWTG P
Sbjct: 47 LKLWYGSPAKINTAESSGGEWMQQSLPLGNGNLGNLIFGGISKERIHFNEKTLWTGGPSS 106
Query: 61 -------GDYTNPDAPKALSDVRSLVDS------------GQYAEATAASVKLFGHP--- 98
G+ + + R L+D G Y A ++ G
Sbjct: 107 SRPNYQFGNKATAYTATEIENYRKLLDDKSSNVFNDDQSLGGYG--MGAKIRFPGEDNLN 164
Query: 99 ADVYQLLGDIELEFDDSHLKYAE-ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQ 157
YQ GDI L+F + + YRREL+L T A ++S NV + REHF S+PDQ
Sbjct: 165 KGSYQDFGDIWLDFSAMGITDDNVQNYRRELNLQTGIASTEFSYKNVSYKREHFVSSPDQ 224
Query: 158 VIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGI 217
V+VT +S SE G L+F+ ++ L+N N ++ + R I K ND +
Sbjct: 225 VMVTNLSASEKGKLNFSAKME--LNND---NLEGKLTFDVRNQTCTIEGKVKDND----L 275
Query: 218 QFSAILEIKISDDRGTISALEDKKL-KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTS 276
+F +++ ++ G I+A E ++ +++ +D +++ A + + + D +K+ ++
Sbjct: 276 KFRTTMKLLLTG--GEITADEKNQVYRIKNADQVTIIMAAETDYKNDYPTYRDKEKNLSN 333
Query: 277 ESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAE 336
+ + SY +L H++D+Q LF RVS+ L + TD ID +
Sbjct: 334 VIDTRINDSSKKSYDELKQTHIEDHQSLFDRVSLDLGEFQTSVPTDQL----IDEYRNGS 389
Query: 337 RVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPT-WDSAPHVNIN 395
+T L FQ+GRYL I+ SR GT +NL G+W + P+ W H N+N
Sbjct: 390 YSHYLET--------LAFQYGRYLTIAGSR-GTLPSNLVGLWT--VGPSAWTGDYHFNVN 438
Query: 396 LEMNYWQSLPCNLSECQEPLFDF---------LTYLSINGSKTAQVNYLASGWVIHHKTD 446
++MNYW NL+EC D+ LT ++G K A N+ +G+ +H + +
Sbjct: 439 VQMNYWPVYSTNLAECGTTFVDYMDKLREPGRLTAERVHGIKGAVDNH--TGFTVHTENN 496
Query: 447 IWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL 506
+ ++ + + P G AW +LW HY +T D +L+ YP+++ A F +L
Sbjct: 497 PFGMTAPTNAQ-EYGWNPTGAAWAVQNLWWHYEFTQDEAYLKNTIYPIMKEAAQFWDSYL 555
Query: 507 IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS---------STMDMAIIREVFSAIISA 557
Y + N TSP H + +A S+S +T D ++I E+++ I A
Sbjct: 556 WTSE--YQKINDETSPYH---GENRLVAAPSFSEEQGPTAIGTTYDQSLIWELYNECIQA 610
Query: 558 AEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+++ ++E A+++ + + +L P +I I EW +
Sbjct: 611 GKIVGEDE-AVLQSWEEKMQKLDPIEINATNGIKEWYE 647
>gi|331088642|ref|ZP_08337553.1| hypothetical protein HMPREF1025_01136 [Lachnospiraceae bacterium
3_1_46FAA]
gi|330407599|gb|EGG87099.1| hypothetical protein HMPREF1025_01136 [Lachnospiraceae bacterium
3_1_46FAA]
Length = 1802
Score = 226 bits (577), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 177/638 (27%), Positives = 300/638 (47%), Gaps = 92/638 (14%)
Query: 13 LKITFNGPAKHFT----------DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-- 60
LK+ + PAK T ++P+GNG LG +++GG+ E + NE TLWTG P
Sbjct: 47 LKLWYGSPAKINTAESSGGEWMQQSLPLGNGNLGNLIFGGISKERIHFNEKTLWTGGPSS 106
Query: 61 -------GDYTNPDAPKALSDVRSLVDS------------GQYAEATAASVKLFGHP--- 98
G+ + + R L+D G Y A ++ G
Sbjct: 107 SRPNYQFGNKATAYTATEIENYRKLLDDKSSNVFNDDQSLGGYG--MGAKIRFPGEDNLN 164
Query: 99 ADVYQLLGDIELEFDDSHLKYAE-ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQ 157
YQ GDI L+F + + YRREL+L T A ++S NV + REHF S+PDQ
Sbjct: 165 KGSYQDFGDIWLDFSAMGITDDNVQNYRRELNLQTGIASTEFSYKNVSYKREHFVSSPDQ 224
Query: 158 VIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGI 217
V+VT +S SE G L+F+ ++ L+N N ++ + R I K ND +
Sbjct: 225 VMVTNLSASEKGKLNFSAKME--LNND---NLEGKLTFDVRNQTCTIEGKVKDND----L 275
Query: 218 QFSAILEIKISDDRGTISALEDKKL-KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTS 276
+F +++ ++ G I+A E ++ +++ +D +++ A + + + D +K+ ++
Sbjct: 276 KFRTTMKLLLTG--GEITADEKNQVYRIKNADQVTIIMAAETDYKNDYPTYRDKEKNLSN 333
Query: 277 ESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAE 336
+ + SY +L H++D+Q LF RVS+ L + TD ID +
Sbjct: 334 VIDTRINDSSKKSYDELKQTHIEDHQSLFDRVSLDLGEFQTSVPTDQL----IDEYRNGS 389
Query: 337 RVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPT-WDSAPHVNIN 395
+T L FQ+GRYL I+ SR GT +NL G+W + P+ W H N+N
Sbjct: 390 YSHYLET--------LAFQYGRYLTIAGSR-GTLPSNLVGLWT--VGPSAWTGDYHFNVN 438
Query: 396 LEMNYWQSLPCNLSECQEPLFDF---------LTYLSINGSKTAQVNYLASGWVIHHKTD 446
++MNYW NL+EC D+ LT ++G K A N+ +G+ +H + +
Sbjct: 439 VQMNYWPVYSTNLAECGTTFVDYMDKLREPGRLTAERVHGIKGAVDNH--TGFTVHTENN 496
Query: 447 IWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL 506
+ ++ + + P G AW +LW HY +T D +L+ YP+++ A F +L
Sbjct: 497 PFGMTAPTNAQ-EYGWNPTGAAWAVQNLWWHYEFTQDEAYLKNTIYPIMKEAAQFWDSYL 555
Query: 507 IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS---------STMDMAIIREVFSAIISA 557
Y + N TSP H + +A S+S +T D ++I E+++ I A
Sbjct: 556 WTSE--YQKINDETSPYH---GENRLVAAPSFSEEQGPTAIGTTYDQSLIWELYNECIQA 610
Query: 558 AEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+++ ++E A+++ + + +L P +I I EW +
Sbjct: 611 GKIVGEDE-AVLQSWEEKMQKLDPIEINATNGIKEWYE 647
>gi|306830121|ref|ZP_07463305.1| possible alpha-L-fucosidase [Streptococcus mitis ATCC 6249]
gi|304427647|gb|EFM30743.1| possible alpha-L-fucosidase [Streptococcus mitis ATCC 6249]
Length = 1685
Score = 226 bits (577), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 179/610 (29%), Positives = 295/610 (48%), Gaps = 86/610 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 141 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 198
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 199 ALEDGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRGLDITE 258
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 259 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGDYSWE 318
Query: 184 -HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+Y NG+ G I K D+ G++F++ L IK GT++ ++++ L
Sbjct: 319 YSNYKNGHVTTDEHG------ILLKGTVKDN--GLKFASYLGIKTD---GTVT-VQNETL 366
Query: 243 KVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLYTRHLD 299
V G+ +A L L A ++F NP ++ +KD E +++ + Y L H+
Sbjct: 367 TVTGASYATLYLSAKTNF---AQNPKTNYRKDIDLEKTVKGIVEAAKAKDYETLKKDHIK 423
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
DYQ LF+RV + L N T + E ++S+ + L EL FQ+GRY
Sbjct: 424 DYQSLFNRVKLNLGG-------------NKTTQTTKEALQSYNPSKGQKLEELFFQYGRY 470
Query: 360 LLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
LLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E +P+ +
Sbjct: 471 LLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETAKPMIN 530
Query: 418 FLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
++ + G SK Q N GW++H + + ++ W P
Sbjct: 531 YIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWGWSPAA 585
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHE 525
AW+ +++++Y +T D +L+++ YP+L+ A F +L + ++PS SPEH
Sbjct: 586 NAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKESDRWVSSPSYSPEH- 644
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
++ +T D +++ ++F + A L+ ++D LV +V +L+P I
Sbjct: 645 --------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKLKPLHIN 695
Query: 586 EDGSIMEWVQ 595
+G I EW +
Sbjct: 696 NEGRIKEWYE 705
>gi|187735615|ref|YP_001877727.1| hypothetical protein Amuc_1120 [Akkermansia muciniphila ATCC
BAA-835]
gi|187425667|gb|ACD04946.1| conserved hypothetical protein [Akkermansia muciniphila ATCC
BAA-835]
Length = 796
Score = 226 bits (576), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 182/608 (29%), Positives = 275/608 (45%), Gaps = 100/608 (16%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQY 84
+ PIGNGR+GAM++ E L LNE +LW+ G Y
Sbjct: 65 AEGYPIGNGRVGAMIFSAPGRERLALNEISLWS------------------GGANPGGGY 106
Query: 85 AEATAASVKLFGHPADVYQLLGDIELEFD--DSHLKYAEETYRRELDLNTATARVKYSVG 142
A FG+ Y GD+ ++F D + E + R LDL +V Y
Sbjct: 107 GYGPDAGTNQFGN----YLPFGDLFVDFKKGDQPASLSVEDFTRSLDLRDGIHKVNYKAD 162
Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGK 202
V + RE FSS P V+V S+ G S + S++S L G+ I +G
Sbjct: 163 GVTYDREAFSSTPANVLVLNYKASKPGQFSADFSVNSQLGADISAKGS-VITWKGMLK-- 219
Query: 203 RIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDG 262
G+ + + I GT+SA DK + V+ +D ++++ + +
Sbjct: 220 ------------NGMNYEG--RVLIRPKGGTLSASGDK-ISVKNADSCMVVIAMETDY-- 262
Query: 263 PFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPK 317
D KKD ES S + Y+ L H+ Y+ +F RV + ++
Sbjct: 263 ----LMDYKKDWKGESPSRKLDRYAAKAASADYAALKQAHISQYKSMFDRVKVNFGKT-- 316
Query: 318 DIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGTQVANLQG 376
EE++ +P+ +R+++++ + DP L E +FQFGRYLL+SSSRPGT ANLQG
Sbjct: 317 --------EEDVAKLPTPKRLEAYKKNPADPDLEETMFQFGRYLLLSSSRPGTLPANLQG 368
Query: 377 IWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVN--- 433
+WN+ + P W H NIN++M YW + P NLSEC E L +++ ++ +Q N
Sbjct: 369 LWNDYVKPPWACDYHNNINVQMAYWGAEPANLSECHEALVNYVEAMAPGCRDASQANKGF 428
Query: 434 -----YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
GW + +I+ + W G AW H+WEHY +T DR +LE
Sbjct: 429 NTKDGKPVRGWTVRTSQNIFGGNG-------WQWNIPGAAWYALHIWEHYAFTGDRKYLE 481
Query: 489 KRAYPLLEGCASFLLDWLIE---GHDGYLETNPSTSPEHE-----------FIAPDG--- 531
K+AYPL++ F D L E G +G+ +TN E E +AP+G
Sbjct: 482 KQAYPLMKEICHFWEDHLKELGAGGEGF-KTNGKDPSEEEKKDLADVKAGTLVAPNGWSP 540
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSI 590
+ D +I E+FS I AA +L K DA K L+ L RL KI ++G++
Sbjct: 541 EHGPREDGVMHDQQLIAELFSNTIKAARILGK--DAAWAKSLEGKLKRLAGNKIGKEGNL 598
Query: 591 MEWVQRRL 598
EW+ R+
Sbjct: 599 QEWMIDRI 606
>gi|419781688|ref|ZP_14307504.1| gram positive anchor [Streptococcus oralis SK610]
gi|383183996|gb|EIC76526.1| gram positive anchor [Streptococcus oralis SK610]
Length = 1707
Score = 226 bits (576), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 179/616 (29%), Positives = 299/616 (48%), Gaps = 96/616 (15%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y + K L+++R
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--ERYKVLAEIRK 199
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
+++G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 200 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRGLDITE 259
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G++F++ L IK +D + T+
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 362
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP ++ +KD E +++ + Y L
Sbjct: 363 QDETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKAKDYETLK 419
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L S T E ++ + ++ L EL F
Sbjct: 420 NAHIKDYQSLFNRVKLNLGGSKTAQTT-------------KEALQGYNPEKGQKLEELFF 466
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPST 520
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKESDRWVSSPSY 641
Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLR 580
SPEH ++ +T D +++ ++F + A L+ ++D LV +V +L+
Sbjct: 642 SPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKLK 691
Query: 581 PTKIAEDGSIMEWVQR 596
P I ++G I EW +
Sbjct: 692 PLHINKEGRIKEWYEE 707
>gi|153815077|ref|ZP_01967745.1| hypothetical protein RUMTOR_01294 [Ruminococcus torques ATCC 27756]
gi|145847645|gb|EDK24563.1| F5/8 type C domain protein [Ruminococcus torques ATCC 27756]
Length = 1812
Score = 226 bits (576), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 177/638 (27%), Positives = 300/638 (47%), Gaps = 92/638 (14%)
Query: 13 LKITFNGPAKHFT----------DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-- 60
LK+ + PAK T ++P+GNG LG +++GG+ E + NE TLWTG P
Sbjct: 57 LKLWYGSPAKINTAESSGGEWMQQSLPLGNGNLGNLIFGGISKERIHFNEKTLWTGGPSS 116
Query: 61 -------GDYTNPDAPKALSDVRSLVDS------------GQYAEATAASVKLFGHP--- 98
G+ + + R L+D G Y A ++ G
Sbjct: 117 SRPNYQFGNKATAYTATEIENYRKLLDDKSSNVFNDDQSLGGYG--MGAKIRFPGEDNLN 174
Query: 99 ADVYQLLGDIELEFDDSHLKYAE-ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQ 157
YQ GDI L+F + + YRREL+L T A ++S NV + REHF S+PDQ
Sbjct: 175 KGSYQDFGDIWLDFSAMGITDDNVQNYRRELNLQTGIASTEFSYKNVSYKREHFVSSPDQ 234
Query: 158 VIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGI 217
V+VT +S SE G L+F+ ++ L+N N ++ + R I K ND +
Sbjct: 235 VMVTNLSASEKGKLNFSAKME--LNND---NLEGKLTFDVRNQTCTIEGKVKDND----L 285
Query: 218 QFSAILEIKISDDRGTISALEDKKL-KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTS 276
+F +++ ++ G I+A E ++ +++ +D +++ A + + + D +K+ ++
Sbjct: 286 KFRTTMKLLLTG--GEITADEKNQVYRIKNADQVTIIMAAETDYKNDYPTYRDKEKNLSN 343
Query: 277 ESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAE 336
+ + SY +L H++D+Q LF RVS+ L + TD ID +
Sbjct: 344 VIDTRINDSSKKSYDELKQTHIEDHQSLFDRVSLDLGEFQTSVPTDQL----IDEYRNGS 399
Query: 337 RVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPT-WDSAPHVNIN 395
+T L FQ+GRYL I+ SR GT +NL G+W + P+ W H N+N
Sbjct: 400 YSHYLET--------LAFQYGRYLTIAGSR-GTLPSNLVGLWT--VGPSAWTGDYHFNVN 448
Query: 396 LEMNYWQSLPCNLSECQEPLFDF---------LTYLSINGSKTAQVNYLASGWVIHHKTD 446
++MNYW NL+EC D+ LT ++G K A N+ +G+ +H + +
Sbjct: 449 VQMNYWPVYSTNLAECGTTFVDYMDKLREPGRLTAERVHGIKGAVDNH--TGFTVHTENN 506
Query: 447 IWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL 506
+ ++ + + P G AW +LW HY +T D +L+ YP+++ A F +L
Sbjct: 507 PFGMTAPTNAQ-EYGWNPTGAAWAVQNLWWHYEFTQDEAYLKNTIYPIMKEAAQFWDSYL 565
Query: 507 IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS---------STMDMAIIREVFSAIISA 557
Y + N TSP H + +A S+S +T D ++I E+++ I A
Sbjct: 566 WTSE--YQKINDETSPYH---GENRLVAAPSFSEEQGPTAIGTTYDQSLIWELYNECIQA 620
Query: 558 AEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+++ ++E A+++ + + +L P +I I EW +
Sbjct: 621 GKIVGEDE-AVLQSWEEKMQKLDPIEINATNGIKEWYE 657
>gi|154505582|ref|ZP_02042320.1| hypothetical protein RUMGNA_03121 [Ruminococcus gnavus ATCC 29149]
gi|153794240|gb|EDN76660.1| hypothetical protein RUMGNA_03121, partial [Ruminococcus gnavus
ATCC 29149]
Length = 1873
Score = 226 bits (576), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 170/612 (27%), Positives = 292/612 (47%), Gaps = 78/612 (12%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
++P+GNG LG +++GG+ E + NE TLWTG P G+ + + + R
Sbjct: 4 SLPLGNGNLGNLIFGGISKERIHFNEKTLWTGGPSPSRPGYQFGNKATAYTDEEIENYRK 63
Query: 78 LVDS------------GQYAEATAASVKLFGHP---ADVYQLLGDIELEFDDSHLKYAE- 121
L+D G Y A +K G YQ GDI L+F L+
Sbjct: 64 LLDDKSTKVFNDDQSLGGYG--MGAQIKFPGENNLNKGSYQDFGDIWLDFSKMGLQDQNV 121
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD--- 178
+ YRRELDL T A ++S +V + REHF SNPDQ++VTK+S SESG L +V ++
Sbjct: 122 KNYRRELDLQTGVASTEFSYEDVNYKREHFVSNPDQIMVTKLSASESGKLDLSVKMELNN 181
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
+ L+ + + NQ C I K ND ++F +++ + + G + E
Sbjct: 182 NGLEGKTTFDPENQT-----CT---IEGKVKDND----LKFYTTMKLVL--EGGDLEVDE 227
Query: 239 DKKL-KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
++ ++E ++ ++++ A + + + D +K+ + S SY L +H
Sbjct: 228 KNQVYQIEDANQVMIVMAAETDYKNDYPTYRDKEKNLKKMVDDRVNSNAKKSYQKLKEKH 287
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL-FQF 356
+ D+QKLF RVS+ L +I P+ + V ++ +E+L FQ+
Sbjct: 288 IADHQKLFDRVSLDLGEQRTNI-------------PTNQLVDEYRNGTYSHYLEVLAFQY 334
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYL I+ SR GT +NL G+W S W H N+N++MNYW NL+EC
Sbjct: 335 GRYLTIAGSR-GTLPSNLVGLWTVGDSA-WTGDYHFNVNVQMNYWPVYTTNLAECGVTFV 392
Query: 417 DF---------LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
D+ LT ++G + A N+ +G+ +H + + + ++ + + P G
Sbjct: 393 DYMDKLREPGRLTAERVHGIEGAVENH--TGFTVHTENNPFGMTAPTNAQ-EYGWNPTGA 449
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLD--WLIEGHDGYLETNPSTSPEHE 525
AW +LW HY +T + D+L+ YP+++ A F W E E++P +
Sbjct: 450 AWAIQNLWWHYEFTQNEDYLKNTIYPIMKEAAQFWDSYLWTSEYQKINDESSPYNGQDRL 509
Query: 526 FIAP--DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTK 583
+AP + + +T D +++ E++ I A +++ ++E AL++ +++ +L P +
Sbjct: 510 VVAPSFSEEQGPTAIGTTYDQSLVWELYKECIQAGKIVGEDE-ALLKSWEENMQKLDPIE 568
Query: 584 IAEDGSIMEWVQ 595
I E I EW +
Sbjct: 569 INETNGIKEWYE 580
>gi|418098974|ref|ZP_12736071.1| fibronectin type III domain protein [Streptococcus pneumoniae
6901-05]
gi|353768956|gb|EHD49478.1| fibronectin type III domain protein [Streptococcus pneumoniae
6901-05]
Length = 795
Score = 226 bits (576), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 174/599 (29%), Positives = 279/599 (46%), Gaps = 70/599 (11%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + SE ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGSERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P +Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGIYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D ++++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSD-RVQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN D H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNSDY--------HLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG- 418
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 419 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 475
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 476 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 526
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 527 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 584
>gi|415750047|ref|ZP_11477991.1| fibronectin type III domain protein [Streptococcus pneumoniae SV35]
gi|381318341|gb|EIC59066.1| fibronectin type III domain protein [Streptococcus pneumoniae SV35]
Length = 803
Score = 226 bits (575), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 173/599 (28%), Positives = 281/599 (46%), Gaps = 62/599 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+ IGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALLIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQD 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 592
>gi|331265740|ref|YP_004325370.1| LPXTG cell surface protein, calx-beta domain-containing protein
[Streptococcus oralis Uo5]
gi|326682412|emb|CBZ00029.1| LPXTG cell surface protein, calx-beta domain protein [Streptococcus
oralis Uo5]
Length = 1707
Score = 226 bits (575), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 181/617 (29%), Positives = 300/617 (48%), Gaps = 98/617 (15%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
+++G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 200 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRNLDITE 259
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G+QF++ L IK +D + T+
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLQFASYLGIK-TDGKVTV--- 362
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS-KKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP ++ +KD E ++ + Y L
Sbjct: 363 QDETLTVTGASYATLYLSAKTNFAQ---NPKNNYRKDIDLEKTVKGIVEVAKAKDYETLK 419
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L + T + E ++ + ++ L EL F
Sbjct: 420 KAHIKDYQSLFNRVKLNLGGT-------------KTTQTTKEALQGYNPEKGQKLEELFF 466
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPS 640
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L ++D LV +V +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKAKFDKL 690
Query: 580 RPTKIAEDGSIMEWVQR 596
+P I +G I EW +
Sbjct: 691 KPLHINNEGRIKEWYEE 707
>gi|417794521|ref|ZP_12441772.1| gram positive anchor [Streptococcus oralis SK255]
gi|334269196|gb|EGL87623.1| gram positive anchor [Streptococcus oralis SK255]
Length = 1707
Score = 226 bits (575), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 179/614 (29%), Positives = 300/614 (48%), Gaps = 94/614 (15%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSLV 79
A+P+GNG +GA V+G + E ++ NE TLW+G P DY D K L+++R +
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSSDYNGGNYKDRYKVLAEIRKAL 201
Query: 80 DSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNTAT 134
+ G +A + + P + Y GDI + F++ T Y R LD+ AT
Sbjct: 202 EDGDRQKAKRLAERNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYYRGLDITEAT 261
Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN-------H 184
Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 262 TTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWEYS 321
Query: 185 SYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+Y NG+ N I+++G K N G++F++ L IK GT++ +++
Sbjct: 322 NYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIKTD---GTVT-VQN 364
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLYTR 296
+ L V G+ +A L L A ++F NP ++ +KD E +++ + Y L
Sbjct: 365 ETLTVTGASYATLYLSAKTNF---AQNPKTNYRKDIDLEKTVKGIVEAAKAKDYETLKKA 421
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
H+ DYQ LF+RV + L N + E ++ + ++ L EL FQ+
Sbjct: 422 HIKDYQSLFNRVKLNLGG-------------NKTAQTTKEALQGYNPEKGQKLEELFFQY 468
Query: 357 GRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E +P
Sbjct: 469 GRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMNNLAETAKP 528
Query: 415 LFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 529 MINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWGWS 583
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPSTS 521
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS S
Sbjct: 584 PAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPSYS 642
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
PEH ++ +T D +++ ++F + A L+ ++D LV +V +L+P
Sbjct: 643 PEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKLKP 692
Query: 582 TKIAEDGSIMEWVQ 595
I ++G I EW +
Sbjct: 693 LHINKEGRIKEWYE 706
>gi|385261489|ref|ZP_10039611.1| Gram-positive signal peptide protein, YSIRK family, partial
[Streptococcus sp. SK643]
gi|385193017|gb|EIF40405.1| Gram-positive signal peptide protein, YSIRK family, partial
[Streptococcus sp. SK643]
Length = 1474
Score = 225 bits (574), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 181/615 (29%), Positives = 293/615 (47%), Gaps = 96/615 (15%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y + K L+++R
Sbjct: 152 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPRPDSTDYNGGNYQ--ERYKVLAEIRK 209
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 210 ALEEGDRQKAKRLAERNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLENVTDYHRGLDITE 269
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV--SL-DSLLDNHSY--- 186
AT Y+ F RE FSS PD V VT ++ L F V SL + LL N +Y
Sbjct: 270 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTQKGDKKLDFTVWNSLTEDLLANGNYSAE 329
Query: 187 ---------VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
N I+++G K N G++F++ L IK G ++
Sbjct: 330 YSHYKSGHVTTDPNGILLKGTV-------KDN------GLRFASYLGIKTD---GKVTVH 373
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
ED L V G+ +A LLL + ++F NP ++ +KD E +++ R Y L
Sbjct: 374 EDS-LTVTGASYATLLLSSKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAARGKDYETLK 429
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L S T E ++++ + L EL F
Sbjct: 430 KNHIKDYQSLFNRVKLNLGGSNTAQTT-------------KEALQTYNPTKGQKLEELFF 476
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 477 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 536
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 537 KPMINYIDDMRYYGRIAAKEYAGIKSKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 591
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPST 520
P AW+ +++++Y +T D +L+++ YP+L+ A F +L D ++PS
Sbjct: 592 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKDSDRWVSSPSY 651
Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLR 580
SPEH ++ +T D +++ ++F + A L+ ++D LV +V +L+
Sbjct: 652 SPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKLK 701
Query: 581 PTKIAEDGSIMEWVQ 595
P I ++G I EW +
Sbjct: 702 PLHINKEGRIKEWYE 716
>gi|418134701|ref|ZP_12771558.1| hypothetical protein SPAR23_0971 [Streptococcus pneumoniae GA11426]
gi|419535112|ref|ZP_14074611.1| hypothetical protein SPAR46_1654 [Streptococcus pneumoniae GA17457]
gi|353901938|gb|EHE77468.1| hypothetical protein SPAR23_0971 [Streptococcus pneumoniae GA11426]
gi|379563273|gb|EHZ28277.1| hypothetical protein SPAR46_1654 [Streptococcus pneumoniae GA17457]
Length = 770
Score = 225 bits (573), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 174/599 (29%), Positives = 278/599 (46%), Gaps = 70/599 (11%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + SE ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGSERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D ++++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSD-RVQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN D H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNSDY--------HLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG- 418
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 419 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 475
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 476 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 526
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 527 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 584
>gi|331092304|ref|ZP_08341132.1| hypothetical protein HMPREF9477_01775 [Lachnospiraceae bacterium
2_1_46FAA]
gi|330401736|gb|EGG81315.1| hypothetical protein HMPREF9477_01775 [Lachnospiraceae bacterium
2_1_46FAA]
Length = 1730
Score = 225 bits (573), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 169/596 (28%), Positives = 273/596 (45%), Gaps = 62/596 (10%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS- 77
+PIGN +GA V+G + E L N+ TLW G P G+ D K +SDV
Sbjct: 76 LPIGNSFMGANVYGEIGKERLTFNQKTLWNGGPSTSRPNYKGGNKDTADNGKKMSDVYKE 135
Query: 78 ---LVDSGQYAEATAASVKLFGHPAD--VYQLLGDI--ELEFDDSHLKYAEETYRRELDL 130
L G+ A+A + KL G A YQ GDI + +FD+S K Y R+L++
Sbjct: 136 IIELYKKGEDAKANELAKKLTGEVAGYGAYQSWGDIYVDFKFDESQAK----NYVRDLNM 191
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A V + N + RE+F S PD V+ K + + L+ ++S +DN V G
Sbjct: 192 ENAVASVDFDYKNTKMHREYFVSYPDNVLAMKFTADGNEKLNLDISFP--IDNAEGVTG- 248
Query: 191 NQIIMEGRCPGKRIPPKANAN-----DDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
+ GK + N + + Q ++K+ + GT+ A + KL V
Sbjct: 249 -------KKLGKNVQTTVKDNTITVAGEMQDNQLKLNGKLKVETENGTVEAKDGDKLHVA 301
Query: 246 GSDWAVLLLVASSSF--DGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
+ + + A + + D P ++K+ + Y + H+ DY +
Sbjct: 302 NASEVTVYVSADTDYKNDYPKYRTGETKEQLNDSVQKTIDKASKKGYEKVKEDHIADYTE 361
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
+F RV + L +S + T T D + + + K ED +L +LFQ+GRYL I+
Sbjct: 362 IFDRVDLDLGQS---VPTKTT-----DVLLNDYKAKKNTAAEDRALEVMLFQYGRYLTIA 413
Query: 364 SSRPGTQVANLQGIWNEDLSPT----WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
SSR G +NLQG+W + W S H+N+NL+MNYW + N++EC PL D++
Sbjct: 414 SSRAGDLPSNLQGVWQNRVGDHNRVPWASDYHMNVNLQMNYWPTYSTNMAECATPLVDYI 473
Query: 420 TYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
L G TA+ + + +G H + + W P W+ + WE+Y
Sbjct: 474 NSLVEPGKVTAKTYFGVENGGFTAHTQNTPFGWTCPGWNFSWGWSPAALPWILQNCWEYY 533
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVS 537
YT D ++E+ YP+L+ A LIE G L + P+ SPEH V+
Sbjct: 534 EYTGDVKYMEEHIYPMLKEAALLYDQILIEDTKTGRLVSAPAYSPEH---------GPVT 584
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+T + ++I +++ +AAE+L ++D + + +L+P +I + G I EW
Sbjct: 585 AGNTYEQSLIWQLYEDAATAAEILNVDKDKAAQ-WRERQAKLKPIEIGDSGQIKEW 639
>gi|168493554|ref|ZP_02717697.1| alpha-fucosidase [Streptococcus pneumoniae CDC3059-06]
gi|418074476|ref|ZP_12711729.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA11184]
gi|418087331|ref|ZP_12724500.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47033]
gi|418090009|ref|ZP_12727163.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA43265]
gi|418105755|ref|ZP_12742811.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44500]
gi|418115170|ref|ZP_12752156.1| fibronectin type III domain protein [Streptococcus pneumoniae
5787-06]
gi|418117327|ref|ZP_12754296.1| fibronectin type III domain protein [Streptococcus pneumoniae
6963-05]
gi|418217095|ref|ZP_12843775.1| fibronectin type III domain protein [Streptococcus pneumoniae
Netherlands15B-37]
gi|419432029|ref|ZP_13972162.1| fibronectin type III domain protein [Streptococcus pneumoniae
EU-NP05]
gi|419433932|ref|ZP_13974050.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA40183]
gi|419440837|ref|ZP_13980882.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA40410]
gi|419464830|ref|ZP_14004721.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA04175]
gi|419469454|ref|ZP_14009322.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA06083]
gi|419498019|ref|ZP_14037726.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47522]
gi|421281641|ref|ZP_15732438.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA04672]
gi|183576395|gb|EDT96923.1| alpha-fucosidase [Streptococcus pneumoniae CDC3059-06]
gi|353748545|gb|EHD29197.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA11184]
gi|353758347|gb|EHD38939.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47033]
gi|353761200|gb|EHD41772.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA43265]
gi|353775931|gb|EHD56410.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44500]
gi|353785254|gb|EHD65673.1| fibronectin type III domain protein [Streptococcus pneumoniae
5787-06]
gi|353788008|gb|EHD68406.1| fibronectin type III domain protein [Streptococcus pneumoniae
6963-05]
gi|353870368|gb|EHE50241.1| fibronectin type III domain protein [Streptococcus pneumoniae
Netherlands15B-37]
gi|379536430|gb|EHZ01616.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA04175]
gi|379544258|gb|EHZ09403.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA06083]
gi|379576933|gb|EHZ41857.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA40183]
gi|379577907|gb|EHZ42824.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA40410]
gi|379598852|gb|EHZ63637.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47522]
gi|379629110|gb|EHZ93711.1| fibronectin type III domain protein [Streptococcus pneumoniae
EU-NP05]
gi|395880906|gb|EJG91957.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA04672]
Length = 795
Score = 224 bits (572), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 174/599 (29%), Positives = 278/599 (46%), Gaps = 70/599 (11%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + SE ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGSERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D ++++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSD-RVQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN D H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNSDY--------HLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG- 418
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 419 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 475
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 476 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 526
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 527 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 584
>gi|358463765|ref|ZP_09173746.1| signal peptide protein, YSIRK family [Streptococcus sp. oral taxon
058 str. F0407]
gi|357067821|gb|EHI77905.1| signal peptide protein, YSIRK family [Streptococcus sp. oral taxon
058 str. F0407]
Length = 1707
Score = 224 bits (572), Expect = 8e-56, Method: Compositional matrix adjust.
Identities = 180/616 (29%), Positives = 299/616 (48%), Gaps = 98/616 (15%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 200 ALEDGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRSLDITE 259
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G++F++ L IK +D + T+
Sbjct: 320 YSNYKNGHVTTDENGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 362
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+++ L V G+ +A L L A ++F NP ++ +KD E +++ + Y L
Sbjct: 363 QNETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKAKDYETLK 419
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L S T E ++ + + L EL F
Sbjct: 420 KDHIKDYQSLFNRVKLNLGGSKTAQTT-------------KEALQGYNPSKGQKLEELFF 466
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPS 640
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L ++D LV +V +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKAKFDKL 690
Query: 580 RPTKIAEDGSIMEWVQ 595
+P I ++G I EW +
Sbjct: 691 KPLHINKEGRIKEWYE 706
>gi|418174043|ref|ZP_12810655.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA41277]
gi|353837999|gb|EHE18080.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA41277]
Length = 774
Score = 224 bits (572), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 174/599 (29%), Positives = 278/599 (46%), Gaps = 70/599 (11%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + SE ++ NE +LW+G P DY D L+++R
Sbjct: 6 EALPIGNGSLGAKVFGLIGSERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 66 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 126 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 185
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D ++++ G+ +
Sbjct: 186 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSD-RVQISGASY 239
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 240 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 299
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 300 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 346
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN D H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 347 DALPANLQGVWNSDY--------HLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG- 397
Query: 428 KTAQVNYLA--------SGWVIHHKTD--IWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 398 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 454
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 455 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 505
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 506 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 563
>gi|414159134|ref|ZP_11415425.1| YSIRK family Gram-positive signal peptide [Streptococcus sp. F0441]
gi|410868266|gb|EKS16233.1| YSIRK family Gram-positive signal peptide [Streptococcus sp. F0441]
Length = 1707
Score = 224 bits (572), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 178/612 (29%), Positives = 297/612 (48%), Gaps = 88/612 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 200 ALEGGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRGLDITE 259
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF----NVSLDSLLDN----- 183
AT Y+ F RE FSS PD V VT ++ + +L F N++ D L +
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNNLTEDLLANGDYSWE 319
Query: 184 -HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+Y NG+ G I K D+ G++F++ L IK GT++ ++++ L
Sbjct: 320 YSNYKNGHVTTDEHG------ILLKGTVKDN--GLKFASYLGIKTD---GTVT-VQNETL 367
Query: 243 KVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLYTRHLD 299
V G+ +A L L A ++F NP ++ +KD E +++ + Y L H+
Sbjct: 368 TVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKAKDYETLKQDHIK 424
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
DYQ LF+RV + L S T E ++S+ + L EL FQ+GRY
Sbjct: 425 DYQSLFNRVKLNLGGSKTAQTT-------------KEALQSYNPSKGQKLEELFFQYGRY 471
Query: 360 LLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
LLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E +P+ +
Sbjct: 472 LLISSSRDKTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETAKPMIN 531
Query: 418 FLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
++ + G SK Q N GW++H + + ++ W P
Sbjct: 532 YIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWGWSPAA 586
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEH 524
AW+ +++++Y +T D +L+++ YP+L+ F +L + D ++ ++PS SPEH
Sbjct: 587 NAWMMQNVYDYYKFTKDETYLKEKIYPMLKETTKFWNSFLHYDQASDRWV-SSPSYSPEH 645
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
++ +T D +++ ++F + A L+ ++D LV +V +L+P I
Sbjct: 646 ---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKLKPLHI 695
Query: 585 AEDGSIMEWVQR 596
+G I EW +
Sbjct: 696 NNEGRIKEWYEE 707
>gi|269955992|ref|YP_003325781.1| hypothetical protein Xcel_1192 [Xylanimonas cellulosilytica DSM
15894]
gi|269304673|gb|ACZ30223.1| conserved hypothetical protein [Xylanimonas cellulosilytica DSM
15894]
Length = 837
Score = 224 bits (572), Expect = 9e-56, Method: Compositional matrix adjust.
Identities = 185/641 (28%), Positives = 281/641 (43%), Gaps = 84/641 (13%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-------DYTNPD 67
+ ++ PA + +A+P+GNG AM G E L LN+ W+G G D P
Sbjct: 4 LRYDSPATCWDEALPVGNGVRAAMCEGRAGGERLWLNDLRAWSGPVGAGPRGDVDAPVPA 63
Query: 68 A-----------------------PKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQL 104
A P+ L+ VR+ +D G A + Y
Sbjct: 64 AQDSASQDPAAEDPAAASRRAAAGPEHLAAVRAAIDDGDVRTAERLLQESQSPWVQAYLP 123
Query: 105 LGDIELEFD--DSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTK 162
LG++E+ L + R LDL TA A Y++G E ++ +V
Sbjct: 124 LGELEVTVTAVGDELAAPGGAHARSLDLRTAVAAHSYALGAARVRHETWADAAGGALVHV 183
Query: 163 ISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGR-------CPGKR-------IPP-- 206
++ + SLL S P R +PP
Sbjct: 184 VTADRP--VRLTARFTSLLRAESDAGAVPVAAAAPDAAAPGVDAPAPRDVLLHRLVPPVD 241
Query: 207 -KANANDDPKGIQFSA-----ILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSF 260
P+ +++ ++ ++ + D + +ED +L+ G+ A LLL+ +++
Sbjct: 242 VAPGHESAPEPVRYGPTTARLVVAVRAAGDPDAV--VEDGELRT-GAATAHLLLIGTATT 298
Query: 261 DGPFINPSDSKKDPTSESMSALQSIRNLS-YSDLYTRHLDDYQKLFHRVSIQLSRSPKDI 319
P + ++ PT +AL + S H ++ L+ RV + L
Sbjct: 299 HDPA---AGTQATPTEAVAAALALVTGPEPASPRRAAHEAAHRALYDRVELTLP------ 349
Query: 320 VTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN 379
S DT+P+ R+ + +DP L L F +GRYLL++SSRPG A LQGIWN
Sbjct: 350 -----SSSGADTLPTDARIAAAADVDDPGLTALAFHYGRYLLLASSRPGGLPATLQGIWN 404
Query: 380 EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS-INGSKTAQVNYLASG 438
L W SA NINL+M YW + L EC EPL F+ L+ G + A+ Y A G
Sbjct: 405 PLLPGPWSSAYTTNINLQMAYWPAETTALPECHEPLLAFVERLATTTGPEAARRLYGARG 464
Query: 439 WVIHHKTDIWAKS---SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLL 495
WV HH +D W + A G WA W +GG WL HLWE + + D FL +RA+P+L
Sbjct: 465 WVAHHNSDAWGHADPVGAGHGDPAWASWALGGVWLAHHLWERWLFGGDATFLRERAWPVL 524
Query: 496 EGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAII 555
G F LDW ++ T+PSTSPE+ ++APDG+ V S+TMD ++R + +A
Sbjct: 525 RGAGLFALDW-VQSDGTRAWTSPSTSPENHYVAPDGRPTGVGTSATMDGELLRWLAAACR 583
Query: 556 SAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWV 594
+AA+ L +ED L + KV LP ++ G ++EW
Sbjct: 584 AAADALGVSEDWLDDLAKVTALLPA---PEVGPRGELLEWA 621
>gi|419779913|ref|ZP_14305766.1| gram positive anchor [Streptococcus oralis SK100]
gi|383185738|gb|EIC78231.1| gram positive anchor [Streptococcus oralis SK100]
Length = 1707
Score = 224 bits (571), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 180/616 (29%), Positives = 298/616 (48%), Gaps = 98/616 (15%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y N K L+++R
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYKN--RYKVLAEIRK 199
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 200 ALEDGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRNLDITE 259
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G++ ++ L IK +D + T+
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLKLASYLGIK-TDGKVTV--- 362
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP ++ +KD E +++ + Y L
Sbjct: 363 QDETLTVTGASYATLYLSAKTNF---AQNPKTNYRKDIDLEKTVKGIVEAAKAKDYETLK 419
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L S T E ++ + ++ L EL F
Sbjct: 420 KAHIKDYQSLFNRVKLNLGGSKTAQTT-------------KEALQGYNPEKGQKLEELFF 466
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPS 640
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L ++D LV +V +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDKD-LVTEVKAKFDKL 690
Query: 580 RPTKIAEDGSIMEWVQ 595
+P I +G I EW +
Sbjct: 691 KPLHINNEGRIKEWYE 706
>gi|417934856|ref|ZP_12578176.1| gram positive anchor [Streptococcus mitis bv. 2 str. F0392]
gi|340771426|gb|EGR93941.1| gram positive anchor [Streptococcus mitis bv. 2 str. F0392]
Length = 1668
Score = 224 bits (570), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 178/616 (28%), Positives = 300/616 (48%), Gaps = 98/616 (15%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y + K L+++R
Sbjct: 103 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--ERYKVLAEIRK 160
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
+++G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 161 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRGLDITE 220
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 221 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 280
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G++F++ L IK +D + T+
Sbjct: 281 YSNYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 323
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP ++ +KD E +++ + Y L
Sbjct: 324 QDETLTVTGASYATLYLSAKTNF---AQNPKTNYRKDIDLEKTVKGIVEAAKAKDYETLK 380
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L N + E ++ + ++ L EL F
Sbjct: 381 KDHIKDYQSLFNRVKLNLGG-------------NKTAQTTKEALQGYNPEKGQKLEELFF 427
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 428 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 487
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 488 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 542
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ F +L + D ++ ++PS
Sbjct: 543 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETTKFWNSFLHYDQASDRWV-SSPS 601
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L ++D LV +V +L
Sbjct: 602 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKAKFDKL 651
Query: 580 RPTKIAEDGSIMEWVQ 595
+P I ++G I EW +
Sbjct: 652 KPLHINKEGRIKEWYE 667
>gi|417915380|ref|ZP_12558993.1| Gram-positive signal peptide protein, YSIRK family [Streptococcus
mitis bv. 2 str. SK95]
gi|342834366|gb|EGU68637.1| Gram-positive signal peptide protein, YSIRK family [Streptococcus
mitis bv. 2 str. SK95]
Length = 1686
Score = 223 bits (568), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 177/616 (28%), Positives = 300/616 (48%), Gaps = 98/616 (15%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y + K L+++R
Sbjct: 141 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--ERYKVLAEIRK 198
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 199 ALEDGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRSLDITE 258
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 259 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 318
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G++F++ L IK +D + T+
Sbjct: 319 YSNYKNGHVTTDENGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 361
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+++ L V G+ +A L L A ++F NP ++ +KD E +++ + Y L
Sbjct: 362 QNETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKAKDYKTLK 418
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L N + E ++ + ++ L EL F
Sbjct: 419 KAHIKDYQSLFNRVKLNLGG-------------NKTAQTTKEALQGYNPEKGQKLEELFF 465
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 466 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 525
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 526 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 580
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 581 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDQASDRWV-SSPS 639
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L ++D LV ++ +L
Sbjct: 640 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDKD-LVTEIKAKFDKL 689
Query: 580 RPTKIAEDGSIMEWVQ 595
+P I ++G I EW +
Sbjct: 690 KPLHINKEGRIKEWYE 705
>gi|298351514|sp|A2R797.1|AFCA_ASPNC RecName: Full=Probable alpha-fucosidase A; AltName:
Full=Alpha-L-fucoside fucohydrolase A; Flags: Precursor
gi|134083134|emb|CAK48586.1| unnamed protein product [Aspergillus niger]
Length = 793
Score = 223 bits (567), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 175/593 (29%), Positives = 273/593 (46%), Gaps = 63/593 (10%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD---YT--NPDAPKA--LSDVRS 77
T A P+GNGRLGAM G E + LN D+LW G P + Y+ NP+ KA L +R
Sbjct: 36 TTAFPLGNGRLGAMPIGSYDKEIVNLNVDSLWRGGPFESPTYSGGNPNVSKAGALPGIRE 95
Query: 78 LVDSGQYAEATAASVKLFG-HPA-DVYQLLGDIELEFDD-SHLKYAEETYRRELDLNTAT 134
+ + T L G +P YQ+L ++ ++ + S + + YRR LDL++A
Sbjct: 96 WI----FQNGTGNVSALLGEYPYYGSYQVLANLTIDMGELSDI----DGYRRNLDLDSAV 147
Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSES-GSLSFNV--SLDSLLDNHSYVNGNN 191
+S G RE F S PD V V ++S + S ++F + L S N S +GN+
Sbjct: 148 YSDHFSTGETYIEREAFCSYPDNVCVYRLSSNSSLPEITFGLENQLTSPAPNVS-CHGNS 206
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV-EGSDWA 250
+ G+ P G+ ++A + + + T +KV EG
Sbjct: 207 ISLY-----GQTYPVI--------GMIYNARVTVVVPGSSNTTDLCSSSTVKVPEGEKEV 253
Query: 251 VLLLVASSSFDGPFINPSDS----KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
L+ A ++++ N S ++P + + + SYS L + H+ DYQ +F+
Sbjct: 254 FLVFAADTNYEASNGNSKASFSFKGENPYMKVLQTATNAAKKSYSALKSSHVKDYQGVFN 313
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
+ ++ L P+ E + S+ DP + LLF +GRYL ISSSR
Sbjct: 314 KFTLTLP-----------DPNGSADRPTTELLSSYSQPGDPYVENLLFDYGRYLFISSSR 362
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI-N 425
PG+ NLQG+W E SP W H NINL+MN+W L E EPL+ ++ +
Sbjct: 363 PGSLPPNLQGLWTESYSPAWSGDYHANINLQMNHWAVDQTGLGELTEPLWTYMAETWMPR 422
Query: 426 GSKTAQVNYLAS-GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G++TA++ Y S GWV H + + + +A + WA +P AW+ H+W+H++Y+ D
Sbjct: 423 GAETAELLYGTSEGWVTHDEMNTFGH-TAMKDVAQWADYPATNAWMSHHVWDHFDYSQDS 481
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGH---DGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
+ + YP+L+G A F L L++ DG L NP SPEH C Y
Sbjct: 482 AWYRETGYPILKGAAQFWLSQLVKDEYFKDGTLVVNPCNSPEHGPTLTPQTFGCTHYQQ- 540
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEW 593
+I E+F ++ ++ + + L P I G I EW
Sbjct: 541 ----LIWELFDHVLQGWTASGDDDTSFKNAITSKFSTLDPGIHIGSWGQIQEW 589
>gi|429766026|ref|ZP_19298301.1| LPXTG-motif protein cell wall anchor domain protein [Clostridium
celatum DSM 1785]
gi|429185266|gb|EKY26251.1| LPXTG-motif protein cell wall anchor domain protein [Clostridium
celatum DSM 1785]
Length = 1927
Score = 222 bits (566), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 170/611 (27%), Positives = 295/611 (48%), Gaps = 85/611 (13%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----------YTNPDAP---KALS 73
+PIGNG +G V+G + E + NE TLWTG P D Y N + L
Sbjct: 70 LPIGNGDIGGNVYGEIVHERITFNEKTLWTGGPSDKRPNYNGGNKEYANDGITPMYEILQ 129
Query: 74 DVRS----LVDSGQYAEATAASV--KLFG--HPADVYQLLGDIELEF---DDSHLKYAEE 122
VR D G +ATA+S+ +L G YQ G+I L+F D++++
Sbjct: 130 QVRENFALHTDEG---DATASSLCNQLVGISDGYGAYQAWGEINLDFIGIDENNVT---- 182
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
Y R+L+L A + V Y+ G+ E+ RE+F S+PD V+V ++ + L+F+VS S
Sbjct: 183 DYVRDLNLRNAISSVNYTYGDTEYIRENFVSHPDDVMVIRVEANGENKLNFDVSFPSKQG 242
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+ V N+ I +EG ++ K N+ ++KI D G ++ DK L
Sbjct: 243 ATTIVE-NDTITLEGEVSDNQL--KYNS-------------QLKIVSDDGEVTEGTDK-L 285
Query: 243 KVEGSDWAVLLLVASSSF--DGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
VE + A + + A++ + D P ++ ++ + ++++ SY ++ H+ D
Sbjct: 286 TVENATSATIYISAATDYKNDYPEYRTGETAEELDARVGDVIEALDGKSYEEVKADHIAD 345
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
Y+ +F RV + L ++ +I TD + S E ++ + + FQ+GRYL
Sbjct: 346 YKSIFDRVDLDLGQALPNIPTDELLSGYGNNTVSEEARRALEV--------MFFQYGRYL 397
Query: 361 LISSSRPGTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
I+SSR +Q+ +NLQG+WN +P W S H+N+NL+MNYW + N++EC PL +++
Sbjct: 398 TIASSREDSQLPSNLQGVWNNKNNPAWSSDYHMNVNLQMNYWPTYSTNMAECATPLVEYI 457
Query: 420 TYLSINGSKTAQV------------NYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
L G +TA++ Y+ + + H + + W P
Sbjct: 458 DSLREPGRETARIYAGVESAKDENGEYIEANGFMAHTQNTPFGWTCPGWSFDWGWSPAAV 517
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEF 526
W+ ++WE Y YT D +++ YP+++ + + L+ + + ++P+ SPEH
Sbjct: 518 PWILQNVWEMYEYTGDVEYMRDVIYPMMKEEVNLYENMLVWDEVQQRMVSSPTYSPEH-- 575
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE-KVLKSLPRLRPTKIA 585
+ +T + +I +++ I+AAE L + D +VE K +S +L P +I
Sbjct: 576 -------GPRTVGNTYEQTLIWQLYEDTITAAETLGVDADLVVEWKDTQS--KLDPIQIG 626
Query: 586 EDGSIMEWVQR 596
+DG I EW +
Sbjct: 627 DDGQIKEWFEE 637
>gi|242815430|ref|XP_002486567.1| alpha-L-fucosidase 2 precursor, putative [Talaromyces stipitatus
ATCC 10500]
gi|218714906|gb|EED14329.1| alpha-L-fucosidase 2 precursor, putative [Talaromyces stipitatus
ATCC 10500]
Length = 773
Score = 222 bits (566), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 169/600 (28%), Positives = 283/600 (47%), Gaps = 58/600 (9%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD--YTNPDAPKA 71
K+ ++ PA+ + D +PIGNG +GA++ SE N + W+G +A
Sbjct: 5 KLWYDQPAQKWQDGLPIGNGHMGAVIISQPSSEIWSFNNISFWSGRSESTPVIEYGGREA 64
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQ---LLGDIELEFDDSHLKYAEETYRREL 128
L +R + Y + K Y ++ I L + + + +RREL
Sbjct: 65 LDKIRKEYFADNYEHGKRLTEKYLQPEKGNYGTNLMVARIYLALEHGGEEPSFTDFRREL 124
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
+L+ A R +Y +V F RE F+S P QV++ ++ ++ + + + S +
Sbjct: 125 NLDEAIVRTEYKSKSVLFRREVFASYPHQVLMARLRTECLEGMNLKLGVSGVTKEFSISD 184
Query: 189 GNNQ--IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
G ++ E + + I +GI ++ G++ + D +L+V+
Sbjct: 185 GETTDCLVFETQAV-EEIHSNGTCGVRGRGI-------VQAHTVGGSVHIV-DGELRVKN 235
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ ++ + SF F + +D D + L ++ + SY +L H+ DYQ L+
Sbjct: 236 ASEVIIKV----SFQTDFRSLND---DWKLRVQTLLDNVWDTSYEELRALHVRDYQSLYR 288
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLISS 364
RV I L + P +R SFQ DPSL YL IS
Sbjct: 289 RVHIDLGHTEDS------------NFPLNKRKASFQKSGYNDPSL---------YLTISG 327
Query: 365 SRPGTQVA-NLQGIWN--EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
+R + + +LQGIWN E + W H++IN +MNY+ + NL + Q PL + Y
Sbjct: 328 TRATSPLPLHLQGIWNDGEANAMNWSCDYHLDINTQMNYFPTETTNLGDLQGPLMRYCEY 387
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRG-KVVWALWPMGGAWLCTHLWEHYNY 480
L+ +G K+A+ Y A GWV H +++W + D G + W L GG W+ TH+ EHY Y
Sbjct: 388 LASSGKKSARNFYGAGGWVAHVFSNVWGYT--DPGWETSWGLNITGGLWMATHMIEHYEY 445
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFI----APDGKLAC 535
++DR+FL +AYP+L A F LD++ I+ GYL T PS SPE+ F +P K
Sbjct: 446 SLDRNFLTTQAYPVLREAAEFFLDYMTIDPRTGYLVTGPSNSPENSFYPSTQSPREKQE- 504
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+S T+D+ ++R++F I + + L NE +V ++L +L P +I + G + EW +
Sbjct: 505 LSLGPTIDITLVRDLFKFCIFSVDELGLNESEFAARVHEALAKLPPFRIGKRGQLQEWFE 564
>gi|336433106|ref|ZP_08612933.1| hypothetical protein HMPREF0991_02052, partial [Lachnospiraceae
bacterium 2_1_58FAA]
gi|336017272|gb|EGN47037.1| hypothetical protein HMPREF0991_02052 [Lachnospiraceae bacterium
2_1_58FAA]
Length = 1786
Score = 222 bits (565), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 190/649 (29%), Positives = 303/649 (46%), Gaps = 87/649 (13%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTD-----AIPIGNGRLGAMVWGGVPSETLKLNEDTL 55
+++AE + + LK+ + A D ++PIGN +GA V+GGV +E ++LNE +L
Sbjct: 32 VVHAEESQDRSELKLRYTSAAPDSYDGWEKWSLPIGNSGIGASVFGGVQTERIQLNEKSL 91
Query: 56 WTGVPGDYTNPD-----------APKALSDVRSLVDSGQYAEATAASVKLFGHPADV--- 101
W+G P + + PD + + +++ L +G A++ +L G D
Sbjct: 92 WSGGPSE-SRPDYNGGNLEEKGRNGQTVKEIQQLFANGDNDAASSKCGELVGLSDDAGVN 150
Query: 102 ----YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQ 157
Y G++ L+F K E Y R LDLNTA A V+Y G+ +TRE+F S PD
Sbjct: 151 GYGYYLSYGNMYLDFKGISDKDVE-NYERTLDLNTAIAGVEYDNGDTHYTRENFVSYPDN 209
Query: 158 VIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM--------EGRCPGKRIPPKAN 209
V+VT+++ L+ +V ++ DN + N I E I
Sbjct: 210 VLVTRLTAEGGDKLNLDVRVEP--DNEAGGGSNKNTIQAQSYQREWETTVKDALISIDGQ 267
Query: 210 ANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDG----PFI 265
D+ ++FS+ + K+ + GT ED KV D + ++ S D P
Sbjct: 268 LKDNQ--MRFSS--QTKVLTEGGTT---EDGDEKVTVKDAKAVTIITSIGTDYKNDYPVY 320
Query: 266 NPSDSKKDPTSESMS----ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVT 321
+S++ S + A ++ N SY L H+DDY +F RV++ L + P
Sbjct: 321 RTGESQEQVASRVRAYVDKAADTVVNDSYDTLKQAHVDDYSSIFGRVNLDLGQVP----- 375
Query: 322 DTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP--------GTQVAN 373
SE+ D + A S E L +LFQ+GRYL I SSR T +N
Sbjct: 376 ---SEKTTDKLLKAYNDGSASEQERRYLEVILFQYGRYLTIESSRETPEDDPSRATLPSN 432
Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVN 433
LQGIW S W S H+N+NL+MNYW + N++EC +PL ++ L G TA++
Sbjct: 433 LQGIWVGANSSAWHSDYHMNVNLQMNYWPTYSTNMAECAQPLISYVDSLREPGRVTAKIY 492
Query: 434 Y-LASGWVIHHKTDIWAKS----SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
+ G++ H + + + + S D W P W+ + WE+Y +T D +++
Sbjct: 493 AGVDQGFMAHTQNNPFGWTCPGWSFD-----WGWSPAAVPWILQNCWEYYEFTGDVSYMQ 547
Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
YP+++ A F + LI+ G+L ++PS SPEH P + A +Y T+ I
Sbjct: 548 NYIYPMMKEEAIFYDNILIDDGTGHLVSSPSYSPEH---GP--RTAGNTYEQTL----IW 598
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLR-PTKIAEDGSIMEWVQR 596
+++ I AAE L + D LV RL+ P +I + G I EW +
Sbjct: 599 QLYEDTIKAAETLGVDAD-LVATWKDHQSRLKGPIEIGDSGQIKEWYEE 646
>gi|154503020|ref|ZP_02040080.1| hypothetical protein RUMGNA_00842 [Ruminococcus gnavus ATCC 29149]
gi|153796374|gb|EDN78794.1| fibronectin type III domain protein [Ruminococcus gnavus ATCC
29149]
Length = 2168
Score = 222 bits (565), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 190/649 (29%), Positives = 303/649 (46%), Gaps = 87/649 (13%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTD-----AIPIGNGRLGAMVWGGVPSETLKLNEDTL 55
+++AE + + LK+ + A D ++PIGN +GA V+GGV +E ++LNE +L
Sbjct: 32 VVHAEESQDRSELKLRYTSAAPDSYDGWEKWSLPIGNSGIGASVFGGVQTERIQLNEKSL 91
Query: 56 WTGVPGDYTNPD-----------APKALSDVRSLVDSGQYAEATAASVKLFGHPADV--- 101
W+G P + + PD + + +++ L +G A++ +L G D
Sbjct: 92 WSGGPSE-SRPDYNGGNLEEKGRNGQTVKEIQQLFANGDNDAASSKCGELVGLSDDAGVN 150
Query: 102 ----YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQ 157
Y G++ L+F K E Y R LDLNTA A V+Y G+ +TRE+F S PD
Sbjct: 151 GYGYYLSYGNMYLDFKGISDKDVE-NYERTLDLNTAIAGVEYDNGDTHYTRENFVSYPDN 209
Query: 158 VIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM--------EGRCPGKRIPPKAN 209
V+VT+++ L+ +V ++ DN + N I E I
Sbjct: 210 VLVTRLTAEGGDKLNLDVRVEP--DNEAGGGSNKNTIQAQSYQREWETTVKDALISIDGQ 267
Query: 210 ANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDG----PFI 265
D+ ++FS+ + K+ + GT ED KV D + ++ S D P
Sbjct: 268 LKDNQ--MRFSS--QTKVLTEGGTT---EDGDEKVTVKDAKAVTIITSIGTDYKNDYPVY 320
Query: 266 NPSDSKKDPTSESMS----ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVT 321
+S++ S + A ++ N SY L H+DDY +F RV++ L + P
Sbjct: 321 RTGESQEQVASRVRAYVDKAADTVVNDSYDTLKQAHVDDYSSIFGRVNLDLGQVP----- 375
Query: 322 DTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP--------GTQVAN 373
SE+ D + A S E L +LFQ+GRYL I SSR T +N
Sbjct: 376 ---SEKTTDKLLKAYNDGSASEQERRYLEVMLFQYGRYLTIESSRETPEDDPSRATLPSN 432
Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVN 433
LQGIW S W S H+N+NL+MNYW + N++EC +PL ++ L G TA++
Sbjct: 433 LQGIWVGANSSAWHSDYHMNVNLQMNYWPTYSTNMAECAQPLISYVDSLREPGRVTAKIY 492
Query: 434 Y-LASGWVIHHKTDIWAKS----SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
+ G++ H + + + + S D W P W+ + WE+Y +T D +++
Sbjct: 493 AGVDQGFMAHTQNNPFGWTCPGWSFD-----WGWSPAAVPWILQNCWEYYEFTGDVSYMQ 547
Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
YP+++ A F + LI+ G+L ++PS SPEH P + A +Y T+ I
Sbjct: 548 NYIYPMMKEEAIFYDNILIDDGTGHLVSSPSYSPEH---GP--RTAGNTYEQTL----IW 598
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLR-PTKIAEDGSIMEWVQR 596
+++ I AAE L + D LV RL+ P +I + G I EW +
Sbjct: 599 QLYEDTIKAAETLGVDAD-LVATWKDHQSRLKGPIEIGDSGQIKEWYEE 646
>gi|197302981|ref|ZP_03168031.1| hypothetical protein RUMLAC_01709 [Ruminococcus lactaris ATCC
29176]
gi|197297976|gb|EDY32526.1| LPXTG-motif cell wall anchor domain protein [Ruminococcus lactaris
ATCC 29176]
Length = 1960
Score = 221 bits (564), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 177/650 (27%), Positives = 303/650 (46%), Gaps = 83/650 (12%)
Query: 1 MMNAESTSTT-----NPLKITFNGPA---KHFT----DAIPIGNGRLGAMVWGGVPSETL 48
+NAE + T N LK+ + PA K++ ++PIGNG +G V+GG+ E +
Sbjct: 29 QVNAEPAAVTQQTGDNDLKLWYTSPADITKYYEGWQEKSLPIGNGAIGGTVFGGITRERI 88
Query: 49 KLNEDTLWTGVP---------GDYTNPDAPKA-LSDVRSLVDSGQYAEATA-ASVKLFGH 97
+LN+ +LW+G P G+ N A ++ + + +GQ + A + A+ L G
Sbjct: 89 QLNDKSLWSGGPSTSRPNYNGGNLENKGNNGATMTSIHNYFANGQDSSAISLANSNLVGV 148
Query: 98 PADV-------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREH 150
D Y G++ ++F + Y R+LDL TA A V Y G+ ++RE+
Sbjct: 149 SDDAGTNGYGYYLSWGNMYIDFKNVSSNNDVTNYTRDLDLKTAIAGVNYDKGSTHYSREN 208
Query: 151 FSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG----NNQIIMEGRCPGKRIPP 206
F+S PD VIVT I+ S +S +VS++ S +NG + Q + RI
Sbjct: 209 FTSYPDNVIVTHITADGSEKISLDVSVEPDNSRGSAINGIGDSSYQRTWDTTVSDGRISI 268
Query: 207 KANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFIN 266
D+ ++FS+ ++ I+D+ GT++ D K+ V G+ ++ + + +
Sbjct: 269 NGQLTDNQ--MKFSSQTQV-ITDNAGTVTD-GDGKVSVSGASEVTIITSMGTDYKDEY-- 322
Query: 267 PSDSKKDPTSESMSALQ------SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIV 320
PS + SE + ++ +++ +Y +L H+ DYQ++F+RV + L +
Sbjct: 323 PSYRTGETASELTNRVKWYVDQAAVK--TYEELKANHVSDYQEIFNRVDLNLGQ------ 374
Query: 321 TDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG----------TQ 370
T S + D + SA + + E L +LFQ+GR++ I SSR T
Sbjct: 375 --TVSTKTTDALLSAYKAGTASEAERRQLEVMLFQYGRFMTIESSRETKTDGNGYVRETL 432
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
+NLQG+W + W S H+N+NL+MNYW + N++EC +PL D++ L G TA
Sbjct: 433 PSNLQGLWVGANNSPWHSDYHMNVNLQMNYWPTYSTNMAECAQPLVDYIDALREPGRVTA 492
Query: 431 QVNYLAS-------GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+ S G++ H + + + + W P W+ + W +Y YT D
Sbjct: 493 AIYAGVSSADGEENGFMAHTQNNPFGWTCPGW-SFSWGWSPAAVPWILQNCWAYYEYTGD 551
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
+L YP+++ A L+ DG L ++P+ SPEH V+ +T +
Sbjct: 552 TSYLRDNIYPMMKEEAKLYDRMLVRDSDGKLVSSPAYSPEH---------GPVTSGNTYE 602
Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+I +++ I AAEVL + D + P ++ + G I EW
Sbjct: 603 QTLIWQLYEDTIKAAEVLGTDADLVATWKANQADLKGPIEVGDSGQIKEW 652
>gi|302346987|ref|YP_003815285.1| hypothetical protein HMPREF0659_A7263 [Prevotella melaninogenica
ATCC 25845]
gi|302151004|gb|ADK97265.1| conserved hypothetical protein [Prevotella melaninogenica ATCC
25845]
Length = 1163
Score = 221 bits (563), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 155/525 (29%), Positives = 248/525 (47%), Gaps = 70/525 (13%)
Query: 11 NPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
N + + PA ++ T +PIGNG+ GA + G V + ++ N+ TLW+G G T
Sbjct: 341 NKYTLWYTKPATNWMTSCLPIGNGQFGATLMGDVAIDDVQFNDKTLWSGKLGGLT----- 395
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
+TAA +G+ Y G++ + S Y R LD
Sbjct: 396 -----------------STAA----YGY----YLNFGNLYIR---SRGMSKVTDYVRYLD 427
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD-NHSY-V 187
+N A A VKY++ V ++R +F+SNPD +V + + S++G ++ ++L + N SY V
Sbjct: 428 INDAVAGVKYTMDGVAYSRTYFASNPDSCVVVRYTASQNGKINTTLTLKNQNGRNVSYTV 487
Query: 188 NGNNQ--IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
+ NNQ I +G+ A +D S +I D GTI+ ++V
Sbjct: 488 DNNNQATITFDGQV--------ARQDDHGATTPESYYCAARIVTDGGTITKNAKGIIEVN 539
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G++ + L + FD + + + +N Y L H DY+ LF
Sbjct: 540 GANSMTVYLRGLTDFDPDAPTYVSGANLLAGRAAATVNDAQNKGYDALLAAHKADYKSLF 599
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLV--ELLFQFGRYLLIS 363
R + LS +I P+ + + S++ ++ +L EL F +GRYLLIS
Sbjct: 600 DRCQLTLSDVKNNI-------------PTPQLISSYRDNQHDNLFLEELYFNYGRYLLIS 646
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL---T 420
SSR + ANLQGIWN++ +P W S H NIN++MNYW + P NLSE P D++
Sbjct: 647 SSRGVSLPANLQGIWNDNNTPAWHSDIHANINVQMNYWPAEPTNLSELHRPFLDYIYREA 706
Query: 421 YLSINGSKTAQ-VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
+ + AQ + ++ +GW + + +I+ G + + AW C HLW+HY
Sbjct: 707 CVKPTWRRFAQDMGHVNTGWTLPTENNIYGS-----GTTFANTYTVANAWYCQHLWQHYT 761
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEH 524
YTMD+DFL +A+P ++ + L++ DG E SPEH
Sbjct: 762 YTMDKDFLRAKAFPAMKSAVDYWFKKLVKAADGTYECPNEWSPEH 806
>gi|350633541|gb|EHA21906.1| hypothetical protein ASPNIDRAFT_184037 [Aspergillus niger ATCC
1015]
Length = 758
Score = 221 bits (563), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 176/593 (29%), Positives = 275/593 (46%), Gaps = 67/593 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD---YT--NPDAPKA--LSDVRS 77
T A P+GNGRLGAM G E + LN D+LW G P + Y+ NP+ KA L +R
Sbjct: 36 TTAFPLGNGRLGAMPIGSYDKEIVNLNVDSLWRGGPFESPTYSGGNPNVSKAGALPGIRE 95
Query: 78 LVDSGQYAEATAASVKLFG-HPA-DVYQLLGDIELEFDD-SHLKYAEETYRRELDLNTAT 134
+ + T L G +P YQ+L ++ ++ + S + + YRR LDL++A
Sbjct: 96 WI----FQNGTGNVSALLGEYPYYGSYQVLANLTIDMGELSDI----DGYRRNLDLDSAV 147
Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSES-GSLSFNV--SLDSLLDNHSYVNGNN 191
+S G RE F S PD V V ++S + S ++F + L S N S +GN+
Sbjct: 148 YSDHFSTGETYIEREAFCSYPDNVCVYRLSSNSSLPEITFGLENQLTSPAPNVS-CHGNS 206
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV-EGSDWA 250
+ G+ P G+ ++A + + + T +KV EG
Sbjct: 207 ISLY-----GQTYPVI--------GMIYNARVTVVVPGSSNTTDLCSSSTVKVPEGEKEV 253
Query: 251 VLLLVASSSFDGPFINPSDS----KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
L+ A ++++ N S ++P + + + SYS L + H+ DYQ +F+
Sbjct: 254 FLVFAADTNYEASNGNSKASFSFKGENPYMKVLQTATNAAKKSYSALKSSHVKDYQGVFN 313
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
+ ++ L P+ E + S+ DP++ LLF +GRYL ISSSR
Sbjct: 314 KFTLTLP-----------DPNGSADRPTTELLSSYSQPGDPNVENLLFDYGRYLFISSSR 362
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI-N 425
PG+ NLQG+W E SP W H NINL+MN+W L E EPL+ ++ +
Sbjct: 363 PGSLPPNLQGLWTESYSPAWSGDYHANINLQMNHWAVDQTGLGELTEPLWTYMAETWMPR 422
Query: 426 GSKTAQVNYLAS-GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G++TA++ Y S GWV H + + + +A + WA +P AW+ H+W+H++Y+ D
Sbjct: 423 GAETAELLYGTSKGWVTHDEMNTFGH-TAMKDVAQWADYPATNAWMSHHVWDHFDYSQDS 481
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGH---DGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
+ + YP+L+G A F L L++ DG L NP SPEH P C Y
Sbjct: 482 AWYRETGYPILKGAAQFWLSQLVKDEYFKDGTLVVNPCNSPEH---GPT-TFGCTHYQQ- 536
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEW 593
+I E+F ++ ++ + + L P I G I EW
Sbjct: 537 ----LIWELFDHVLQGWTASGDDDTSFKNAITSKFSTLDPGIHIGSWGQIQEW 585
>gi|340514861|gb|EGR45120.1| glycoside hydrolase family 95 [Trichoderma reesei QM6a]
Length = 795
Score = 220 bits (561), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 172/630 (27%), Positives = 297/630 (47%), Gaps = 68/630 (10%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV-----PGDYTNPDA 68
++ + P+ F ++P+GNGR A V E L LNE + W+G G P+
Sbjct: 6 RLFYTTPSTAFPTSLPLGNGRFAASVLSSPSKEVLILNEVSFWSGKEQPAGAGLSHKPER 65
Query: 69 PK-ALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEFDDSH-LKY 119
K L + + SG YA+ + + FG V G +E+ + +
Sbjct: 66 AKDELRETQRCYLSGDYAQGKKRAERFLESRKTNFGTNLGV----GRLEIAVNGQETIDG 121
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+ REL L+ A +Y++ +F R F S+P QV+V ++ G + L V +
Sbjct: 122 VVSGFERELRLDEAVTETRYTLSGRQFKRRCFLSHPHQVLVVQLEGDDLQGLEIEVDVQG 181
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+N ++ + N +G+ + +D G++ ++ + D G + +
Sbjct: 182 --ENEAFTSNVN---ADGKLEFNVQALETVHSDGTCGVKGYGLIAATV--DEGKVQR-RN 233
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
KL + +L+ +F+ + P D+ + T M A LS SDL+ HL
Sbjct: 234 GKLVISAKKSITILV----TFNTDYAEPGDAWRRRTVAQMDA---ALELSASDLFQAHLQ 286
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFG 357
D+Q L+ RVSI L +++CS + P+ +R +SF+ D + L F +
Sbjct: 287 DFQPLYRRVSISLG-------SESCS---TASAPTDQRRQSFEASGYADAGMFALYFHYA 336
Query: 358 RYLLISSSRPGTQVA-NLQGIWN--EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
RYL I+ +R + + +LQG+WN E W H++IN +MNY+ + LS+ +P
Sbjct: 337 RYLTIAGTRHDSPLPLHLQGLWNDGEACKMGWSCDYHLDINTQMNYFAIMNSGLSDLMQP 396
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRG-KVVWALWPMGGAWLCTH 473
L ++L L +G TA+V Y GWV H +++W + D G +V + L GG WL +H
Sbjct: 397 LINYLVRLGESGQDTARVCYGCPGWVAHVFSNVWGFT--DPGWEVSYGLNVTGGLWLASH 454
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEF--IAPD 530
L E + Y++D F A+ +L G + F LD++IE G+L T PS SPE+ F + D
Sbjct: 455 LIEMFEYSLDDSFTRNEAWSVLLGASKFFLDYMIEDPKTGWLLTGPSVSPENSFFVVKED 514
Query: 531 GKLA--CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV---LKSLPRLRPTKIA 585
G+ + + T+D+ ++R++F+ A L+ E E V ++L +L P +I
Sbjct: 515 GEKEEHYAALAPTLDIVLVRDLFAFCEYALTKLDCQESNYKEDVRMYREALAKLPPFQIG 574
Query: 586 EDGSIMEWV---------QRRLNTSFSTCK 606
++G + EW+ R L+ + + C+
Sbjct: 575 KNGQLQEWLHDFEEAQPYHRHLSHTMALCR 604
>gi|302523529|ref|ZP_07275871.1| fibronectin type III domain-containing protein [Streptomyces sp.
SPB78]
gi|302432424|gb|EFL04240.1| fibronectin type III domain-containing protein [Streptomyces sp.
SPB78]
Length = 661
Score = 220 bits (561), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 155/496 (31%), Positives = 237/496 (47%), Gaps = 46/496 (9%)
Query: 102 YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVT 161
+Q GD+ ++ D + + E Y R LDL A A V Y F R F+S PD+V+V
Sbjct: 20 HQTFGDLLIDVDGA--PGSAEGYTRTLDLAQALATVSYPHDGATFHRTVFTSCPDKVLVG 77
Query: 162 KISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSA 221
+ GS+ N+ S + + +++ + G G++F A
Sbjct: 78 HFTADRGGSVGLNLRYTSPRQDFTATTDGDRLTVRGAL-------------QDNGMRFEA 124
Query: 222 ILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA 281
+I++ + GT++A D+ L V G+D A +L A + + + P DP +A
Sbjct: 125 --QIRLLSEGGTVTANGDR-LAVSGADSAWFVLSAGTDYADTY--PDYRGADPHDRVATA 179
Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF 341
+ Y +L RH D+ LF RV + L + D+ + D + A S
Sbjct: 180 VDQAAARPYRELLDRHTSDHAALFSRVVLDLGQ-------DSAPDRTTDALLKAYTGGS- 231
Query: 342 QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
+ +D +L L FQ+GRYLLI+SSR G+ ANLQG WN +P W + HVNINL+MNYW
Sbjct: 232 -SADDRALEALFFQYGRYLLIASSRAGSLPANLQGAWNNSTAPPWSADYHVNINLQMNYW 290
Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVW 460
+ NL+E P F+ L G TA+ + A GWV+H +T + + D W
Sbjct: 291 PAEATNLAETTAPYDRFVEALRAPGRTTARSMFDARGWVVHDETTPFGFTGVHDWPTSFW 350
Query: 461 ALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPS 519
+P AWL + L+EHY + D+L AYP ++ A F +D L + D L PS
Sbjct: 351 --FPEAAAWLTSQLYEHYRFDGSTDYLRATAYPAMKEAAEFWIDVLRTDPRDNTLVVTPS 408
Query: 520 TSPEH-EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR 578
SPEH +F A + M I+RE+F + AA+ L ++ A + ++L R
Sbjct: 409 FSPEHGDFTA----------GAAMSQQIVRELFLNTLEAAQTL-GDDPAFRATLKETLDR 457
Query: 579 LRPT-KIAEDGSIMEW 593
+ P +I G +MEW
Sbjct: 458 IDPGLRIGSWGQLMEW 473
>gi|336429327|ref|ZP_08609294.1| hypothetical protein HMPREF0994_05300 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336002938|gb|EGN33035.1| hypothetical protein HMPREF0994_05300 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 779
Score = 220 bits (560), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 178/594 (29%), Positives = 275/594 (46%), Gaps = 53/594 (8%)
Query: 21 AKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAPKALSDVRSLV 79
A+ + +A +GNGR+GA V+GGV ET+ L+E T ++G N A A ++RSL+
Sbjct: 11 AERWQEAYLLGNGRMGAAVYGGVFEETVDLSEITFFSGSSSSENNQKGAALAFQEMRSLL 70
Query: 80 DSGQYAEATAASVKLFGHPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
G+ A + G + L G +++ ++S K + Y R LDL T +
Sbjct: 71 QEGKEEAAMERASDFIGIRENYGTNLPVGRLKIMLENSGEK--PDGYVRRLDLQTGLFSM 128
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
+Y R F S PDQV +I + SLS + ++ G N
Sbjct: 129 EYRQEGSTVVRNAFVSWPDQVFCYEIKTGKPESLSGRIWVE---------GGENPFSART 179
Query: 198 RCPGKRIPPKANA---NDDPKGIQFSAILEI-----KISDDRGTISALEDKKLKVEGSDW 249
R +A +D G+ S +++ KIS GTI+ +L +
Sbjct: 180 EEEEYRFQVQAREKLHSDGSCGVDLSGMVKAWCEDGKISCSGGTIAFTGCSRLLIG---- 235
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
L + D K +S+ Y + +RH++D + RVS
Sbjct: 236 --LWMETDYEEKAGLTACKDKKAGCAKQSLPK-------EYDRIRSRHMEDVKSRMERVS 286
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
+ L + +E+ VP+ ERV S Q EDP L L FQFGRYLL SSR
Sbjct: 287 LCLGTKEE--------QEDAAAVPTDERVLASRQGKEDPLLFALAFQFGRYLLQCSSRED 338
Query: 369 TQV-ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI- 424
+ + A+LQG+WN++++ W H++IN +MNYW S P NL EC+ PLF ++ L I
Sbjct: 339 SPLPAHLQGVWNDNVACRIGWTCDMHLDINTQMNYWLSGPGNLPECRRPLFAWMEKLLIP 398
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
+G +A+ +Y GW ++ W S+ + + + P GG W + EHY YT D
Sbjct: 399 SGRISARESYGRKGWSADLVSNAWGFSAPYWSRTI-SPCPTGGIWQASDYMEHYRYTRDE 457
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
F + AYP++ F ++ EG DG + PS SPE+ +I +G+ S T ++
Sbjct: 458 AFAREHAYPVIREAVEFFTGYVFEGEDGCYLSGPSISPENAYIK-EGEKRFFSNGCTYEI 516
Query: 545 AIIREVFSAIISAAEV---LEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+IRE+ + A L + + ALV + K LPRL P +I DG++ EW
Sbjct: 517 LMIRELLEEFLELASFLPDLAEKDRALVMQAEKILPRLLPYRILPDGTLAEWAH 570
>gi|345882387|ref|ZP_08833873.1| hypothetical protein HMPREF0666_00049 [Prevotella sp. C561]
gi|345044169|gb|EGW48214.1| hypothetical protein HMPREF0666_00049 [Prevotella sp. C561]
Length = 1163
Score = 219 bits (558), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 152/530 (28%), Positives = 248/530 (46%), Gaps = 80/530 (15%)
Query: 11 NPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
N + + PA ++ T +PIGNG+ GA + G V + ++ N+ TLW+G G T
Sbjct: 341 NKYTLWYTKPATNWMTSCLPIGNGQFGATLMGDVAIDDVQFNDKTLWSGKLGGLT----- 395
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET----YR 125
+TAA +G+ L F + +++ E T Y
Sbjct: 396 -----------------STAA----YGY-----------YLNFGNLYIRSRELTKVTDYV 423
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD-NH 184
R LD+N A A V+Y++ V + R +F++NPD +V + + SE G ++ ++L + N
Sbjct: 424 RYLDINDAVAGVRYTMDGVAYDRTYFATNPDSCLVIRYTASEKGRINTTLTLKNQNGRNV 483
Query: 185 SY-VNGNNQ--IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
+Y V+ NNQ I EG+ A ND S +I D G+++
Sbjct: 484 NYTVDNNNQATITFEGKV--------ARQNDKGATTPESYYCAARIVTDGGSVTKNAKGL 535
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
++V G++ + L + FD + + + + N Y L H DY
Sbjct: 536 IEVSGANSMTVYLRGLTDFDPDAAEYVSGADRLAGRATATVNNAENKGYDALLAAHKADY 595
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLV--ELLFQFGRY 359
+ LF R + L+ S +T+P+ + + +++ ++ +L EL F +GRY
Sbjct: 596 KSLFDRCQLTLADSK-------------NTIPTPQLISNYRDNQHDNLFLEELYFNYGRY 642
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LLISSSR + ANLQGIWN++ +P W S H NIN++MNYW + P NLSE P D++
Sbjct: 643 LLISSSRGVSLPANLQGIWNDNNTPAWHSDIHANINVQMNYWPAEPTNLSELHRPFLDYI 702
Query: 420 TYLSINGSKT-----AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
Y T + ++ +GW + + +I+ G + + AW C HL
Sbjct: 703 -YREACVKPTWRRFAKDMGHVNTGWTLPTENNIYGS-----GTTFANTYTVANAWYCQHL 756
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEH 524
W+HY YTMD++FL +A+P ++ + L++ DG E SPEH
Sbjct: 757 WQHYTYTMDKEFLRTKAFPAMKTAVDYWFKKLVKAADGTYECPNEWSPEH 806
>gi|427404601|ref|ZP_18895341.1| hypothetical protein HMPREF9710_04937 [Massilia timonae CCUG 45783]
gi|425716772|gb|EKU79741.1| hypothetical protein HMPREF9710_04937 [Massilia timonae CCUG 45783]
Length = 764
Score = 219 bits (557), Expect = 4e-54, Method: Compositional matrix adjust.
Identities = 166/591 (28%), Positives = 271/591 (45%), Gaps = 87/591 (14%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYA 85
+A+PIGNGR+GAMV+G E L+ N+ TLWTG D +++
Sbjct: 46 EALPIGNGRIGAMVFGQPGREHLQFNDITLWTG---------------DDKTM------- 83
Query: 86 EATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVE 145
+Q GD+ +E + YRR LDL V Y+ G V
Sbjct: 84 --------------GAFQPFGDLLVELPGHESGVTD--YRRTLDLGRGVHTVTYTHGGVR 127
Query: 146 FTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIP 205
+ RE ++S P QVIV +++ G S VSL H V N ++ G G +P
Sbjct: 128 YRREAWASFPAQVIVLRLTADRPGRYSGAVSLTDRHGAHLAV-ANGRLHATGTLAGFALP 186
Query: 206 PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFI 265
+A P G S + ++ D G ++A + +++ G+D L+L A +S+ +
Sbjct: 187 DQA-----PSGNVMSYASQAQVISDGGKLTA-DGQRIAFAGADGLTLILGAGTSY---VL 237
Query: 266 NPSD--SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDT 323
+ + P + + + + + L H++D+++L RV+I L +P
Sbjct: 238 DAARRFEGGHPLARVTAQVDQAAARAPAALLEEHVEDFRRLMQRVAIDLGETPA------ 291
Query: 324 CSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDL 382
+P+ R+ ++ + DP L FQ+GRYLL SSSR G+ ANLQG+WN L
Sbjct: 292 ----ARRALPTDARLLAYTKAGGDPELEAQYFQYGRYLLASSSR-GSLPANLQGLWNNSL 346
Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS----- 437
+P W++ H NIN++MNYW + NL E P FDF+ ++ + + +
Sbjct: 347 TPPWNADYHTNINVQMNYWPAEVTNLGESALPFFDFVNGMAPVWRRATTEEFRRADGQPV 406
Query: 438 -GWVIHHKTDIWAKSSADRGKVVWALW-PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLL 495
GW + +++ + LW G AW H WEHY + D FL + AYP++
Sbjct: 407 RGWTLRTESNPFGAMD--------YLWNKTGNAWYAQHFWEHYAFNRDERFLREVAYPVM 458
Query: 496 EGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAII 555
+ ++F D+L DG L SPEH + DG V+Y D I+ ++F+ +
Sbjct: 459 KEASAFWQDYLKALPDGRLVAPQGWSPEHGPVE-DG----VAY----DQQIVWDLFNNTV 509
Query: 556 SAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQRRLNTSFSTCK 606
AA +L + D L ++ RL +I G ++EW++ + + T +
Sbjct: 510 EAAGILRVDPD-LRAQLAAMRDRLAGPRIGSWGQLLEWLEEKKDPVLDTPR 559
>gi|451852884|gb|EMD66178.1| glycoside hydrolase family 95 protein [Cochliobolus sativus ND90Pr]
Length = 805
Score = 219 bits (557), Expect = 5e-54, Method: Compositional matrix adjust.
Identities = 173/597 (28%), Positives = 277/597 (46%), Gaps = 63/597 (10%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYTNPDAPKALSDVRSLVDSGQ 83
A+P+GNGRL AM G +ETL LN D+LW+G P +YT + ++ +
Sbjct: 38 ALPVGNGRLAAMPIGSPSAETLTLNLDSLWSGGPFEASNYTGGNPESSIDSTLPGIRDWI 97
Query: 84 YAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSV 141
+ T KL G + Y++L ++ + S + Y R+LDL ++
Sbjct: 98 FTNGTGNVTKLLGTNDNYGSYRVLANLTVTIP-SLVGIQVSNYTRKLDLTNGLHSTSFNT 156
Query: 142 GNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL-----DSLLDNHSYV-NGNNQIIM 195
+ + F S PDQV V I S S +F + L D+ L+N + V NG
Sbjct: 157 NDTQLESTVFCSYPDQVCVYTIQSSRSLP-AFELKLGNELVDAKLENITCVANGTGADSG 215
Query: 196 EGRCPG-KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV---EGSDWAV 251
R G ++ P P+G+ + I + + D T LKV G+ A
Sbjct: 216 HVRLRGVTQLGP-------PEGMLYDTIARLLPNSDVKTTCDSNTGILKVTPENGAKSAT 268
Query: 252 LLLVASSSFDGPFINP----SDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
+++ A +++D S DP +Q + + +L + HL+D+ L R
Sbjct: 269 VIIGAETNYDMKKGTAEHQYSFRGNDPGPAVEETIQKVSMKTLEELKSSHLEDFTSLTGR 328
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ---TDEDPSLVELLFQFGRYLLISS 364
L P + N VP+ E + S+ T DP + LLF + +YLLISS
Sbjct: 329 FEFHL---PDPL--------NSAQVPTPELIASYDSNVTSGDPFVESLLFDYAQYLLISS 377
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
SRPG+ NLQG W E ++P W + H NINL+MNYW + L+E Q PL+D++ +
Sbjct: 378 SRPGSLPTNLQGRWTEQMAPDWSADYHANINLQMNYWTADQTGLTETQTPLWDYMINTWV 437
Query: 425 -NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
G +TA + Y A GWV+H++ +I+ + G+ WA +P AW+ H++++++YT D
Sbjct: 438 PRGHETAMLLYGAPGWVVHNEMNIFGHTGMKDGE-GWANYPAAPAWMMLHVFDYWDYTRD 496
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGH------DGYLETNPSTSPEHEFIAPDGKLACVS 537
+L + YPL++ A F WL + H D L NP +SPEH P C
Sbjct: 497 TTWLRTQGYPLIKSVAQF---WLSQLHADSFTNDNTLVVNPCSSPEH---GPT-TFGCAH 549
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIMEW 593
Y +I +VF A+++ + +++ + + +L RL + + I EW
Sbjct: 550 YQQ-----LIHQVFEAVLTTHSLAGESDTSFTSNISSTLSRLDKGFHVGSWSQIKEW 601
>gi|331082986|ref|ZP_08332105.1| hypothetical protein HMPREF0992_01029 [Lachnospiraceae bacterium
6_1_63FAA]
gi|330399723|gb|EGG79384.1| hypothetical protein HMPREF0992_01029 [Lachnospiraceae bacterium
6_1_63FAA]
Length = 1760
Score = 218 bits (555), Expect = 8e-54, Method: Compositional matrix adjust.
Identities = 170/594 (28%), Positives = 273/594 (45%), Gaps = 58/594 (9%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS- 77
+PIGN +GA V+G + E L N+ TLW G P G+ D + +SDV
Sbjct: 75 LPIGNSFMGANVYGEIGQERLTFNQKTLWNGGPSENRPDYDGGNKETADNGQKMSDVYKE 134
Query: 78 ---LVDSGQYAEATAASVKLFG--HPADVYQLLGDIELEFDDSHLKYAE-ETYRRELDLN 131
L G A+A + KL G + YQ GDI ++F LK + E Y R+L+L
Sbjct: 135 IIELYKEGNDAQANELAKKLTGEVNGYGAYQSWGDIYVDFG---LKEEQAENYVRDLNLE 191
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
A A V + + + RE+F S PD V+ K + S L F++S +DN V
Sbjct: 192 NAVASVDFDYQDTKMHREYFISYPDNVLAMKFTAEGSEKLDFDISFP--IDNAEGVADKK 249
Query: 192 -QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
+E I D+ Q ++K+ + G + + KL V G+ A
Sbjct: 250 LGKSVETTVEDDTITVSGEMQDN----QLQLNGKLKVETEGGKVQEKDGDKLHVSGASEA 305
Query: 251 VLLLVASSSF--DGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
V+ + A + + P ++ ++ + A+ Y + H+ DY ++F RV
Sbjct: 306 VVYVSADTDYLNKYPDYRTGETAQELDASVERAVDKASKKGYEKVKKEHIKDYSEIFSRV 365
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
+ L ++ D TD + + + ++ E+ +L +LFQ+GRYL I+SSR G
Sbjct: 366 QLDLGQNVPDKTTDIL----LKDYNAGKNTEA----ENRALEVILFQYGRYLTIASSRAG 417
Query: 369 TQVANLQGIWNEDLSPT----WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
+NLQG+W + W S H+N+NL+MNYW + N++EC PL D++ L
Sbjct: 418 DLPSNLQGVWQNRVGDHNRIPWASDYHMNVNLQMNYWPTYSTNMAECATPLIDYINSLVE 477
Query: 425 NGSKTAQVNY-LASGWVIHHKTDI---WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
G TA+ + + +G H + W D W P W+ + WE+Y Y
Sbjct: 478 PGKVTAKTYFGVENGGFTAHTQNTPFGWTCPGWD---FSWGWSPAALPWILQNCWEYYEY 534
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYS 539
T D ++E+ YP+L+ A LIE G L + P+ SPEH V+
Sbjct: 535 TGDVKYMEEHIYPMLKEAALLYDQILIEDEKTGRLVSAPAYSPEH---------GPVTAG 585
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+T + ++I +++ +AAE+L K+E+ E + +L+P +I E G I EW
Sbjct: 586 NTYEQSLIWQLYEDAATAAEILSKDEEKAKEWRQRQ-QKLKPIEIGESGQIKEW 638
>gi|225018139|ref|ZP_03707331.1| hypothetical protein CLOSTMETH_02076 [Clostridium methylpentosum
DSM 5476]
gi|224949136|gb|EEG30345.1| hypothetical protein CLOSTMETH_02076 [Clostridium methylpentosum
DSM 5476]
Length = 1556
Score = 218 bits (554), Expect = 9e-54, Method: Compositional matrix adjust.
Identities = 165/617 (26%), Positives = 288/617 (46%), Gaps = 72/617 (11%)
Query: 11 NPLKITFNGPAKHFT-DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN---- 65
N L++ + PA ++T D + IGNG G +++ GV + + NE TLW G PG +N
Sbjct: 57 NTLRMWYTKPASNWTNDCLVIGNGSTGGVLFSGVGRDRVHFNEKTLWNGGPGSVSNYNGG 116
Query: 66 ----PDAPKALSDVRSLVD---SGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSH 116
P + L +R D + + T G+ + + YQ GD+ L+F +
Sbjct: 117 NRTIPTTKEQLDAIREQADDHSTSVFPLGTGGVRDFMGNGSGMGQYQDFGDLYLDFSKTG 176
Query: 117 LKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
+ A T Y R+LD+ TA + + Y V + RE+F S+PD+V+ +++ SE+G L+F+
Sbjct: 177 MTDANATNYVRDLDMRTAVSSLNYDYDGVHYEREYFVSHPDKVMAVRLTASEAGKLTFDA 236
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
S V + + RI ++ + A ++ ++ GT++
Sbjct: 237 S----------VAAASGLTTTATAQDGRITLAGTVRNNGMKCEMQA----QVINEGGTLT 282
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
+ +D + VEG+D ++L + + + P+ DP E + + + SY +L
Sbjct: 283 SNDDGTVSVEGADAVTIVLTTGTDYANDW--PTYRTDDPHDELTATVDAAAAKSYQELKD 340
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLV-ELLF 354
HL DYQ+LF R+ I L C + VP+ E +K+++ E E+++
Sbjct: 341 AHLADYQELFSRLEIDLGGE--------CPQ-----VPTDEMMKAYRRGETSHAAEEMVY 387
Query: 355 QFGRYLLISSSRPGTQV-ANLQGIW-NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
QFGRYL I+ SR G ++ NL G+W W + H N+N++MNYW + NL+EC
Sbjct: 388 QFGRYLTIAGSREGDELPTNLCGLWLIGSAGSYWGADFHFNVNVQMNYWPAYQTNLAECG 447
Query: 413 EPLFDFLTYLSINGSKTAQVNYL-----------ASGWVIHHKTDIWAKSSADRGKVVWA 461
D++ L G TA + +G++++ + + + +A G +
Sbjct: 448 SVFTDYMESLVEPGRVTAGASAALPTEPGTPIGEGNGFLVNTQNNPFG-CTAPFGSQEYG 506
Query: 462 LWPMGG-AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPS 519
W +GG +W ++++ Y YT D++ L+ + YP+L+ A+F +L + G L PS
Sbjct: 507 -WNIGGSSWALQNVYDQYLYTGDKELLKNKIYPMLKEQANFWNQFLWYSDYQGRLVVGPS 565
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
S E +T D +I+ E++ I A+E+L +ED K +L
Sbjct: 566 VSAEQ---------GPTVNGTTYDQSIVWELYKMAIEASEILGVDEDQRAVWEDKQ-SQL 615
Query: 580 RPTKIAEDGSIMEWVQR 596
P I G + EW +
Sbjct: 616 NPIIIGSQGQVKEWYEE 632
>gi|288803110|ref|ZP_06408545.1| fibronectin type III domain protein [Prevotella melaninogenica D18]
gi|288334371|gb|EFC72811.1| fibronectin type III domain protein [Prevotella melaninogenica D18]
Length = 1163
Score = 217 bits (553), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 152/526 (28%), Positives = 248/526 (47%), Gaps = 72/526 (13%)
Query: 11 NPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
N + + PA ++ T +PIGNG+ GA + G V + ++ N+ TLW+G G T
Sbjct: 341 NKYTLWYTKPATNWMTSCLPIGNGQFGATLMGDVAIDDVQFNDKTLWSGKLGGLT----- 395
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
+TAA +G+ Y G++ + S Y R LD
Sbjct: 396 -----------------STAA----YGY----YLNFGNLYIR---SRGMSKVTDYVRYLD 427
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD-NHSY-V 187
+N A A V+Y++ V ++R +F+SNPD +V + + S++G ++ ++L + N SY V
Sbjct: 428 INDAVAGVRYTMDGVAYSRTYFASNPDSCVVVRYTASQNGKINTTLTLKNQNGRNVSYTV 487
Query: 188 NGNNQ--IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
+ NNQ I +G+ A +D S +I D GTI+ ++V
Sbjct: 488 DNNNQATITFDGQI--------ARQDDHGATTPESYYCVARIVTDGGTITKNAKGVIEVN 539
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G++ + L + FD + + + + +N Y L+ H DY+ LF
Sbjct: 540 GANSMTVYLRGLTDFDPDAPTYVSGANLLAARAAATVNGAQNKGYDALFAAHKTDYKSLF 599
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLV--ELLFQFGRYLLIS 363
R + L +I P+ + + S++ ++ +L EL F +GRYLLIS
Sbjct: 600 DRCQLTLGDVKNNI-------------PTPQLISSYRNNQHDNLFLEELYFNYGRYLLIS 646
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSR + ANLQGIWN++ +P W + H NIN++MNYW + P NLSE P D++ Y
Sbjct: 647 SSRGISLPANLQGIWNDNNTPAWHADIHANINVQMNYWPAEPTNLSELHRPFLDYI-YRE 705
Query: 424 INGSKTAQ-----VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
T + + ++ +GW + + +I+ G + + AW C HLW+HY
Sbjct: 706 ACVKPTWRRFAPDMGHVNTGWTLPTENNIYGS-----GTTFANTYTVANAWYCQHLWQHY 760
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEH 524
YTMD+DFL +A+P ++ + L++ DG E SPEH
Sbjct: 761 TYTMDKDFLRTKAFPAMKSAVDYWFKKLVKAADGTYECPNEWSPEH 806
>gi|429860747|gb|ELA35469.1| glycoside hydrolase family 95 protein [Colletotrichum
gloeosporioides Nara gc5]
Length = 797
Score = 217 bits (552), Expect = 1e-53, Method: Compositional matrix adjust.
Identities = 177/588 (30%), Positives = 274/588 (46%), Gaps = 64/588 (10%)
Query: 29 PIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYT--NPDAPK--ALSDVRSLVDS 81
P+GNG+LGA+ +G SE + LN D+LW G P +YT NP PK AL ++R+ +
Sbjct: 44 PVGNGKLGAIPFGPPGSEKVNLNIDSLWAGGPFGASNYTGGNPTEPKYEALPEIRATI-- 101
Query: 82 GQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY 139
+ T L G D ++L ++ + Y++ YRR LDL T K+
Sbjct: 102 --FENGTGDVSPLLGVGDDYGSNRVLANLTVNIQGIS-DYSD--YRRTLDLKTGVHTTKF 156
Query: 140 SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN---GNNQIIME 196
+ F HF S PDQV V I+ SE + V ++ L N G++ +
Sbjct: 157 TANGAAFEISHFCSYPDQVCVYHIA-SEGALPAVEVGYENQLVEQDTFNVSCGDDHVRFA 215
Query: 197 GRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVA 256
G + PP+ D I A + S + T++ +D+K +++
Sbjct: 216 GLT--QLGPPEGMKFDSIARINKGAAITNCTSANFLTVTPEKDQKA-------LTIIIGG 266
Query: 257 SSSFDGPFINP----SDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
+++D N S DP + S+ + H+ DYQKL + L
Sbjct: 267 ETNYDQKNGNAESDYSFKGGDPGPIVEKTTSDAASKSFHTILKDHIADYQKLESACELNL 326
Query: 313 SRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDE-DPSLVELLFQFGRYLLISSSRPGTQ 370
DT E +T + + + TD DP + LLF + RYLLI+SSR +
Sbjct: 327 P--------DTQGSEEKET---GQLISDYVYTDGGDPYVEALLFDYSRYLLITSSRANSL 375
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI-NGSKT 429
ANLQG W E L P W + H NIN++MNYW + L E Q L+D++ + G++T
Sbjct: 376 PANLQGRWTEQLWPAWSADYHANINIQMNYWAADQTGLGETQTALWDYMEDTWVPRGAET 435
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A++ Y ASGWV+H++ + + ++ G WA +P AW+ H+W+++ YT D ++ +
Sbjct: 436 AKLLYNASGWVVHNEMNTFGHTAMKEGS-SWANYPAAAAWMMQHVWDNFEYTQDLEWFIR 494
Query: 490 RAYPLLEGCASFLLDWLIEG---HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
+ YPL++G A F L L E +DG L NP SPEH P C Y +
Sbjct: 495 QGYPLIKGVAEFWLSQLQEDLYFNDGTLVVNPCNSPEH---GPT-TFGCTHYHQ-----M 545
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIMEW 593
I +VF A++ A + +E V +L RL + + E G + EW
Sbjct: 546 IHQVFEAVLHGATFVSTK---FIEDVPPNLNRLDKGVHVTEWGGLKEW 590
>gi|359406206|ref|ZP_09198915.1| hypothetical protein HMPREF0673_02149 [Prevotella stercorea DSM
18206]
gi|357556624|gb|EHJ38213.1| hypothetical protein HMPREF0673_02149 [Prevotella stercorea DSM
18206]
Length = 1013
Score = 216 bits (550), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 172/568 (30%), Positives = 264/568 (46%), Gaps = 91/568 (16%)
Query: 8 STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
+TT L G + A+PIG+G+ GA ++GGV + ++ NE TLW+G P
Sbjct: 216 ATTAKLYSGGQGYSNWMEYALPIGDGQFGACLFGGVYRDEIQFNEKTLWSGTP------- 268
Query: 68 APKALSDVRSLVDSGQYAEATAASVKLFGHPADVY--QLLGDIELEFDDSHLKYAEETYR 125
RS Y + + + +Y L G+ L D A Y
Sbjct: 269 -------ARSSQGGKGYGK--------YENFGSIYAKDLSGEFGLTTDK-----AASNYV 308
Query: 126 RELDLNTATARVKY-SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLD 182
R LDL TAT + + S VE+TRE+ +SNP +V+V + S+ G LSF ++ S+
Sbjct: 309 RLLDLTTATGKTMFKSAAGVEYTREYIASNPARVVVAHYTASKGGKLSFRFTMAAGSITA 368
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+ +Y +G EG GK NA +K+ GT++ +D+ +
Sbjct: 369 DPTYADG------EGTFSGKLETISYNA-------------RMKVVPVGGTMTT-DDEGI 408
Query: 243 KVEGSDWAVLLLVASSSFDG---PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+V G+D +++L + FD + + + S+ ++A + S+ DLY H+
Sbjct: 409 EVIGADEIMVVLGGGTDFDAYESTYTKNTSALAQTISDRVAAAAA---KSWKDLYAEHVA 465
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
DYQ F+R L+ + D+ T+ IDT S + L +L F +GRY
Sbjct: 466 DYQSFFNRCEFDLAGTKNDMTTNRL----IDTYNSGRGADALM------LEQLYFAYGRY 515
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
L ISSSR +NLQGIWN W+S H NIN++MNYW + P NLSE P FL
Sbjct: 516 LEISSSRGVDSPSNLQGIWNNINGVAWNSDIHSNINVQMNYWPAEPTNLSEMHLP---FL 572
Query: 420 TYLSINGSKTAQVNYLAS------GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
Y+ K Q A GW + +I+ SA + V + AW TH
Sbjct: 573 NYIWAMAEKQPQWKQWAKLQGQDRGWTCFTENNIFGGVSAFKNNYV-----IANAWYTTH 627
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
LW+HY YT+DR++L KR +P + + F +D L DG E SPEH + +G
Sbjct: 628 LWQHYRYTLDREYL-KRVFPAMLSASQFWMDRLKLASDGTYECPNEWSPEHGPESENG-- 684
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVL 561
V+++ + + ++FS ++A +VL
Sbjct: 685 --VAHAQQL----VYDLFSNTLAAIDVL 706
>gi|317500980|ref|ZP_07959190.1| hypothetical protein HMPREF1026_01133 [Lachnospiraceae bacterium
8_1_57FAA]
gi|316897683|gb|EFV19744.1| hypothetical protein HMPREF1026_01133 [Lachnospiraceae bacterium
8_1_57FAA]
Length = 1966
Score = 216 bits (549), Expect = 4e-53, Method: Compositional matrix adjust.
Identities = 180/632 (28%), Positives = 300/632 (47%), Gaps = 87/632 (13%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY----------TNPDAPKALSDVR 76
A+P+GN +GA V+GGV +E ++LNE +LW+G P D + K ++ ++
Sbjct: 68 ALPLGNSAIGASVFGGVQTERIQLNEKSLWSGGPSDSRPEYNGGNIESKGQNGKVMAQLK 127
Query: 77 SLVDSGQYAEATAASVKLFGHPADV-------YQLLGDIELEFDDSHLKYAEETYRRELD 129
+ SGQ ++ A +L G D Y G++ L+F + K Y R+LD
Sbjct: 128 EKLKSGQGFDSNLAG-QLIGVSDDAGVQGYGYYLSYGNMYLDFKNV-TKNNVSGYSRDLD 185
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L TA A V Y + +TRE+F S PD V+VT+++ ++ G+L F+V ++ +
Sbjct: 186 LRTAVAGVNYDLNGAHYTRENFVSYPDNVLVTRLTATDGGTLDFDVRVEP---DEEKGGS 242
Query: 190 NNQIIME--GRCPGKRIPPKANANDDP---KGIQFSAILEIKISDDRGTISALED--KKL 242
NQ + R K++ A A D ++FS+ ++ I DD GT ++D K
Sbjct: 243 QNQPGADSYARTFDKKVSDNAIAIDGQLTDNQLKFSSYTKV-IKDD-GTAGQIKDDSKNG 300
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSAL---------QSIRNLSYSDL 293
K+ S + ++ S D P + T E ++AL ++ Y L
Sbjct: 301 KITVSGAKAITIITSIGTDYKNDYPK-YRTGETKEQLAALVKGYVSGAEAKVKAGGYETL 359
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H++DY +F R+ + + ++ D TD E A + + E L +L
Sbjct: 360 KEDHVNDYDHIFGRLDLNIGQAVSDKTTDKLLE--------AYKKGTASETEKRYLELML 411
Query: 354 FQFGRYLLISSSRP-------------GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
FQ+GRYL + SSR T +NLQGIW + W S H+N+NL+MNY
Sbjct: 412 FQYGRYLTMGSSRETPVNEDGTKNERRATLPSNLQGIWVGANNSAWHSDYHMNVNLQMNY 471
Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA---------SGWVIHHKTDIWAKS 451
W + N++EC EPL D++ L G TA++ Y +G++ H + + + +
Sbjct: 472 WPTYTTNMAECAEPLIDYVDSLREPGRITAKI-YAGVESTEANPENGFMAHTQNNPYGWT 530
Query: 452 SADRGKVV-WALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH 510
+ G V W P G W+ + WE+Y +T D ++++ YP+++ A+ L+ +
Sbjct: 531 NP--GWVFDWGWSPAGVPWILQNCWEYYEFTGDTEYMQTHIYPMMKEEATLYDQMLMRDN 588
Query: 511 DGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE 570
DG L + PS SPEH + +T + ++I +++ I+AAE L +E A V
Sbjct: 589 DGKLVSVPSYSPEH---------GPRTAGNTYEHSLIWQLYEDTITAAETLGVDE-AKVA 638
Query: 571 KVLKSLPRLR-PTKIAEDGSIMEWV-QRRLNT 600
+ K+ L+ P ++ G I EW + LNT
Sbjct: 639 QWKKNQADLKGPIEVGASGQIKEWYNETTLNT 670
>gi|336439275|ref|ZP_08618890.1| hypothetical protein HMPREF0990_01284 [Lachnospiraceae bacterium
1_1_57FAA]
gi|336016192|gb|EGN45981.1| hypothetical protein HMPREF0990_01284 [Lachnospiraceae bacterium
1_1_57FAA]
Length = 1977
Score = 215 bits (548), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 180/632 (28%), Positives = 300/632 (47%), Gaps = 87/632 (13%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY----------TNPDAPKALSDVR 76
A+P+GN +GA V+GGV +E ++LNE +LW+G P D + K ++ ++
Sbjct: 68 ALPLGNSAIGASVFGGVQTERIQLNEKSLWSGGPSDSRPEYNGGNIESKGQNGKVMAQLK 127
Query: 77 SLVDSGQYAEATAASVKLFGHPADV-------YQLLGDIELEFDDSHLKYAEETYRRELD 129
+ SGQ ++ A +L G D Y G++ L+F + K Y R+LD
Sbjct: 128 EKLKSGQGFDSNLAG-QLIGVSDDAGVQGYGYYLSYGNMYLDFKNV-TKNNVSGYSRDLD 185
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L TA A V Y + +TRE+F S PD V+VT+++ ++ G+L F+V ++ +
Sbjct: 186 LRTAVAGVNYDLNGAHYTRENFVSYPDNVLVTRLTATDGGTLDFDVRVEP---DEEKGGS 242
Query: 190 NNQIIME--GRCPGKRIPPKANANDDP---KGIQFSAILEIKISDDRGTISALED--KKL 242
NQ + R K++ A A D ++FS+ ++ I DD GT ++D K
Sbjct: 243 QNQPGADSYARTFDKKVSDNAIAIDGQLTDNQLKFSSYTKV-IKDD-GTAGQIKDDSKNG 300
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSAL---------QSIRNLSYSDL 293
K+ S + ++ S D P + T E ++AL ++ Y L
Sbjct: 301 KITVSGAKAITIITSIGTDYKNDYPK-YRTGETKEQLAALVKGYVSGAEAKVKAGGYETL 359
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H++DY +F R+ + + ++ D TD E A + + E L +L
Sbjct: 360 KEDHVNDYDHIFGRLDLNIGQAVSDKTTDKLLE--------AYKKGTASETEKRYLELML 411
Query: 354 FQFGRYLLISSSRP-------------GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
FQ+GRYL + SSR T +NLQGIW + W S H+N+NL+MNY
Sbjct: 412 FQYGRYLTMGSSRETPVNEDGTKNERRATLPSNLQGIWVGANNSAWHSDYHMNVNLQMNY 471
Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA---------SGWVIHHKTDIWAKS 451
W + N++EC EPL D++ L G TA++ Y +G++ H + + + +
Sbjct: 472 WPTYTTNMAECAEPLIDYVDSLREPGRITAKI-YAGVESTEANPENGFMAHTQNNPYGWT 530
Query: 452 SADRGKVV-WALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH 510
+ G V W P G W+ + WE+Y +T D ++++ YP+++ A+ L+ +
Sbjct: 531 NP--GWVFDWGWSPAGVPWILQNCWEYYEFTGDTEYMQTHIYPMMKEEATLYDQMLMRDN 588
Query: 511 DGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE 570
DG L + PS SPEH + +T + ++I +++ I+AAE L +E A V
Sbjct: 589 DGKLVSVPSYSPEH---------GPRTAGNTYEHSLIWQLYEDTITAAETLGVDE-AKVA 638
Query: 571 KVLKSLPRLR-PTKIAEDGSIMEWV-QRRLNT 600
+ K+ L+ P ++ G I EW + LNT
Sbjct: 639 QWKKNQADLKGPIEVGASGQIKEWYNETTLNT 670
>gi|260589559|ref|ZP_05855472.1| putative fibronectin type III domain protein [Blautia hansenii DSM
20583]
gi|260540127|gb|EEX20696.1| putative fibronectin type III domain protein [Blautia hansenii DSM
20583]
Length = 1719
Score = 214 bits (546), Expect = 8e-53, Method: Compositional matrix adjust.
Identities = 169/600 (28%), Positives = 275/600 (45%), Gaps = 70/600 (11%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS- 77
+PIGN +GA V+G + E L N+ TLW G P G+ D + +S+V
Sbjct: 75 LPIGNSFMGANVYGEIGEERLTFNQKTLWNGGPSESRPNYDGGNKETADNGQKMSEVYKE 134
Query: 78 ---LVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAE-ETYRRELDLN 131
L G +A + KL G YQ GDI ++F LK + E Y R+L+L
Sbjct: 135 IIKLYKEGNDTQANELAKKLTGEVEGYGAYQSWGDIYVDFG---LKEEQAENYVRDLNLE 191
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
A A V + + + RE+F S PD V+ K + + L F++S +DN V
Sbjct: 192 NAVASVDFDYQDTKMHREYFISYPDNVLAMKFTADGNEKLDFDISFP--IDNAEGV---- 245
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGI-------QFSAILEIKISDDRGTISALEDKKLKV 244
+ GK + K DD + Q ++K+ + G + + KL V
Sbjct: 246 ----ADKKLGKSV--KTTVEDDMITVSGEMQDNQLKLNGKLKVETEGGKVQEKDGDKLHV 299
Query: 245 EGSDWAVLLLVASSSF--DGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
G+ AV+ + A + + P ++ ++ + A+ Y + H+ DY
Sbjct: 300 SGASEAVVYVSADTDYLNKYPDYRTGETAQELDASVEKAVDKASKKGYEKVKKEHIKDYS 359
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
++F RV + L ++ + TD ++ + + ++ E+ +L +LFQ+GRYL I
Sbjct: 360 EIFSRVQLDLGQNVPEKTTDIL----LNDYNAGKNTEA----ENRALEVILFQYGRYLTI 411
Query: 363 SSSRPGTQVANLQGIWNEDLSPT----WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
+SSR G +NLQG+W + W S H+N+NL+MNYW + N++EC PL D+
Sbjct: 412 ASSRAGDLPSNLQGVWQNRVGDHNRIPWASDYHMNVNLQMNYWPTYSTNMAECATPLIDY 471
Query: 419 LTYLSINGSKTAQVNY-LASGWVIHHKTDI---WAKSSADRGKVVWALWPMGGAWLCTHL 474
+ L G TA+ + + +G H + W D W P W+ +
Sbjct: 472 INSLVEPGKVTAKTYFGVENGGFTAHTQNTPFGWTCPGWD---FSWGWSPAALPWILQNC 528
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKL 533
WE+Y YT D ++E+ YP+L+ A LIE G L + P+ SPEH
Sbjct: 529 WEYYEYTGDVKYMEEHIYPMLKEAALLYDQILIEDEKTGRLVSAPAYSPEH--------- 579
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
V+ +T + ++I +++ +AAE+L K+ED E + +L+P +I E G I EW
Sbjct: 580 GPVTAGNTYEQSLIWQLYEDAATAAEILGKDEDKAKEWRQRQ-EKLKPIEIGESGQIKEW 638
>gi|153816042|ref|ZP_01968710.1| hypothetical protein RUMTOR_02288 [Ruminococcus torques ATCC 27756]
gi|331089120|ref|ZP_08338023.1| hypothetical protein HMPREF1025_01606 [Lachnospiraceae bacterium
3_1_46FAA]
gi|145846689|gb|EDK23607.1| LPXTG-motif cell wall anchor domain protein [Ruminococcus torques
ATCC 27756]
gi|330405897|gb|EGG85423.1| hypothetical protein HMPREF1025_01606 [Lachnospiraceae bacterium
3_1_46FAA]
Length = 1966
Score = 214 bits (545), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 176/636 (27%), Positives = 297/636 (46%), Gaps = 81/636 (12%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY----------TNPDAPKALSDVR 76
A+P+GN +GA V+GGV +E ++LNE +LW+G P D + K ++ ++
Sbjct: 68 ALPLGNSAIGASVFGGVQTERIQLNEKSLWSGGPSDSRPEYNGGNIESKGQNGKVMAQLK 127
Query: 77 SLVDSGQYAEATAASVKLFGHPADV-------YQLLGDIELEFDDSHLKYAEETYRRELD 129
+ SGQ ++ A +L G D Y G++ L+F + K Y R+LD
Sbjct: 128 EKLKSGQGFDSNLAG-QLIGVSDDAGVQGYGYYLSYGNMYLDFKNV-TKNNVSGYSRDLD 185
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L TA A V Y + +TRE+F S PD V+VT+++ ++ G+L F+V ++ + N
Sbjct: 186 LRTAVAGVNYDLNGAHYTRENFVSYPDNVLVTRLTATDGGTLDFDVRVEPDEEKGGSQN- 244
Query: 190 NNQIIMEGRCPGKRIPPKANANDDP---KGIQFSAILEIKISDDRGTISALED--KKLKV 244
+ R K++ A A D ++FS+ ++ I DD GT ++D K K+
Sbjct: 245 KPEADSYARTFDKKVSDNAIAIDGQLTDNQLKFSSYTKV-IKDD-GTAGQIKDDSKNGKI 302
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSAL---------QSIRNLSYSDLYT 295
S + ++ S D P + T E ++AL ++ Y L
Sbjct: 303 TVSGAKAITIITSIGTDYKNDYPK-YRTGETKEQLAALVKGYVSGAEAKVKAGGYETLKE 361
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
H++DY +F R+ + + ++ D TD E A + + E L +LFQ
Sbjct: 362 DHVNDYDHIFGRLDLNIGQAVSDKTTDKLLE--------AYKKGTASETEKRYLELMLFQ 413
Query: 356 FGRYLLISSSRP-------------GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQ 402
+GRYL + SSR T +NLQGIW + W S H+N+NL+MNYW
Sbjct: 414 YGRYLTMGSSRETPVNEDGTKNERRATLPSNLQGIWVGANNSAWHSDYHMNVNLQMNYWP 473
Query: 403 SLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA---------SGWVIHHKTDIWAKSSA 453
+ N++EC EPL D++ L G TA++ Y +G++ H + + + ++
Sbjct: 474 TYTTNMAECAEPLIDYVDSLREPGRITAKI-YAGVESTEANPENGFMAHTQNNPYGWTNP 532
Query: 454 DRGKVV-WALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG 512
G V W P G W+ + WE+Y +T D ++++ YP+++ A+ L+ +G
Sbjct: 533 --GWVFDWGWSPAGVPWILQNCWEYYEFTGDTEYMQTHIYPMMKEEATLYDQMLMRDSEG 590
Query: 513 YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV 572
L + PS SPEH + +T + ++I +++ I+AAE L +E + +
Sbjct: 591 KLVSVPSYSPEH---------GPRTAGNTYEHSLIWQLYEDTITAAETLGVDEAKVAQWK 641
Query: 573 LKSLPRLRPTKIAEDGSIMEWV-QRRLNTSFSTCKL 607
P +I + G I EW + LNT + K+
Sbjct: 642 QNQADLKGPIEIGDSGQIKEWYNETTLNTDENGQKM 677
>gi|325269425|ref|ZP_08136042.1| fibronectin type III domain protein [Prevotella multiformis DSM
16608]
gi|324988346|gb|EGC20312.1| fibronectin type III domain protein [Prevotella multiformis DSM
16608]
Length = 847
Score = 214 bits (545), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 141/511 (27%), Positives = 230/511 (45%), Gaps = 69/511 (13%)
Query: 24 FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
T +P+GNG+ GA V G + + ++ N+ TLW+G G T
Sbjct: 85 MTSCLPVGNGQFGATVMGQIVVDDVQFNDKTLWSGKLGGLT------------------- 125
Query: 84 YAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
S +G Y G++ + S Y R LD+N A A V++S+
Sbjct: 126 -------STAAYGS----YLNFGNLLIR---SRGMKGVTDYVRYLDINDAVAGVRFSMDG 171
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH-SYV---NGNNQIIMEGRC 199
V ++R +F+SNPD +V + + + G ++ ++L +H SY G I +G+
Sbjct: 172 VGYSRTYFASNPDSCVVIRYTATRGGMINTTLALKDQNGSHVSYTVDGPGRATITFDGQV 231
Query: 200 PGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSS 259
ND+ + S +I D GT++ + ++V ++ + L +
Sbjct: 232 --------GRQNDEGEATPESYCCAARIVADGGTVTKNAEGLVEVSDANSMTVYLRGLTD 283
Query: 260 FDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDI 319
FD + +M+A+ R Y L H DY+ LF R + L + D
Sbjct: 284 FDAAAPEYVSGTEQLAGRAMAAVDGARRKGYDALLAAHKADYKSLFDRCLLTLCSTGSD- 342
Query: 320 VTDTCSEENIDTVPSAERVKSFQTDEDPSLV--ELLFQFGRYLLISSSRPGTQVANLQGI 377
VP+ + + ++ D +L EL F +GRYLLISSSR + ANLQGI
Sbjct: 343 ------------VPTPQLISGYRADPQGNLFLEELYFSYGRYLLISSSRGVSLPANLQGI 390
Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK----TAQVN 433
WN +P W + H NIN++MNYW + P NLSE P D++ + +
Sbjct: 391 WNNSNAPAWHADIHANINVQMNYWPAEPTNLSELHRPFLDYIYREACVKPAWRRFARDMG 450
Query: 434 YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYP 493
+ +GW + + +I+ G + + AW C HLW+HY YT+DR++L ++A+P
Sbjct: 451 KVDAGWTLPTENNIYGS-----GTTFANTYTVANAWYCQHLWQHYAYTLDREYLRRQAFP 505
Query: 494 LLEGCASFLLDWLIEGHDGYLETNPSTSPEH 524
+++ + L L++G DG E SPEH
Sbjct: 506 VMKSAVDYWLRKLVKGADGTYECPEEWSPEH 536
>gi|46140003|ref|XP_391692.1| hypothetical protein FG11516.1 [Gibberella zeae PH-1]
Length = 798
Score = 214 bits (545), Expect = 1e-52, Method: Compositional matrix adjust.
Identities = 172/618 (27%), Positives = 308/618 (49%), Gaps = 64/618 (10%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV 59
+++A+ ++ P T G A++ P+GNG+LGA+ +G E + LN D+LW+G
Sbjct: 16 LVSAKELWSSKPASYTKQGSAEYLLRTGYPVGNGKLGAIHFGPPGREKINLNVDSLWSGG 75
Query: 60 PGD---YT--NPDAPK--ALSDVRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIEL 110
P + YT NP +PK L +R + + AT +L G + ++LG++ +
Sbjct: 76 PFEVDGYTGGNPSSPKFQYLPAIRDRI----FTNATGEMEELMGSGSHFGSNRVLGNLTI 131
Query: 111 EFDDSHLKYAEETYRRELDLNTATARVKYSV--GNVEFTREHFSSNPDQVIVTKISGSES 168
+FD +Y++ YRR LD+ T ++ G +F F S DQV V + + +
Sbjct: 132 QFDGLD-EYSD--YRRSLDMKTGIYETSFASKDGGSKFISSVFCSYSDQVCVYFLK-ANT 187
Query: 169 GSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCP-GKRIPPKANANDDPKGIQFSAILEIKI 227
+ + +++ L Q +++ C G + P+G++++A L +
Sbjct: 188 RLPNIKIGIENKL--------VKQDLIKTTCKNGMALHTGMTQTGPPEGMKYAAALSVDR 239
Query: 228 SDDRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDS----KKDPTSESMSAL 282
S GT++ L D ++ V+ + + + A +++D N D DP A
Sbjct: 240 S--LGTVTCLNDGQIIVKPKNKRMAIFWAAETNYDQKAGNTDDGWAFKGPDPVPRVKKAS 297
Query: 283 QSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ 342
++ Y+ L H++D++KL ++ L DT + ++++T A+ +++++
Sbjct: 298 KTAATKGYAKLRKVHVEDFKKLEEAFTLNLP--------DTQNSKDVET---ADLIQAYK 346
Query: 343 TDE--DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
D DP L +LF RYLLI+SSR + ANLQG W E L W + H NINL+MNY
Sbjct: 347 YDGPGDPFLEGILFDLSRYLLITSSRENSLPANLQGRWTELLQAAWGADYHANINLQMNY 406
Query: 401 WQSLPCNLSECQEPLFDFLTYLSI-NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVV 459
W + L+ Q+ +++++T + G++TA++ Y A+GWV+H++ +I+ +A +
Sbjct: 407 WVADQTGLAATQKSVWNYMTDTWVPRGTETAKLLYNATGWVVHNEMNIFGH-TAMKEVAG 465
Query: 460 WALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG---HDGYLET 516
WA +P+ AW+ H+W+ ++YT D+ +L + YPL++G A F + L E DG L
Sbjct: 466 WANYPVAPAWMMQHVWDAFDYTQDKKWLSSQGYPLIKGVAEFWVSQLQEDAYTEDGSLVA 525
Query: 517 NPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL 576
P S E P CV Y +I +V + + AA+++ + + V+ V +L
Sbjct: 526 IPCNSAE---TGPT-TFGCVHYQQ-----LIHQVLDSTLIAADIVSEPDSDFVDSVSSTL 576
Query: 577 PRL-RPTKIAEDGSIMEW 593
RL + A G + EW
Sbjct: 577 KRLDKGLHFASWGGLKEW 594
>gi|374373770|ref|ZP_09631430.1| Alpha-L-fucosidase [Niabella soli DSM 19437]
gi|373234743|gb|EHP54536.1| Alpha-L-fucosidase [Niabella soli DSM 19437]
Length = 733
Score = 213 bits (542), Expect = 2e-52, Method: Compositional matrix adjust.
Identities = 172/589 (29%), Positives = 261/589 (44%), Gaps = 84/589 (14%)
Query: 15 ITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
I F P K + +PIGNGRLGAM+ GGV ++T++ NE +LW+G N D
Sbjct: 27 IWFAKPGLKWDAEGLPIGNGRLGAMMMGGVANDTIQFNEQSLWSGD----NNWDGAYETG 82
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
D H Y+ G + + FD + YRR L+L
Sbjct: 83 D----------------------HGFGSYRNFGALVVNFDGDK---SSSGYRRGLNLTDG 117
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
++ ++ RE F+S+PDQV+V + + +++G LS +SL S + GN+
Sbjct: 118 IYTASLTINKTQYKREAFASHPDQVMVFRYT-AQNGRLSGRISLHSAQGASARATGNSLQ 176
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
A P +Q++A ++ + + GT++ L D +L G L
Sbjct: 177 F---------------AGTMPNQLQYAA--KMLLQQEGGTVTTL-DSQLVFTGCKTLTLY 218
Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
L A +++ P P L + +Y L H+ D+ L I +
Sbjct: 219 LDARTNYK-PDYTADWRGAAPRPVIEKELAAALRKTYEQLRAAHIKDFTALAAAAHIDVG 277
Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGTQVA 372
+P + +P+ R++ + DP L E +FQFGRYLLISSSRPG A
Sbjct: 278 TTPVAL----------RALPTDLRLQKYAAGGADPDLEETVFQFGRYLLISSSRPGGLPA 327
Query: 373 NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQV 432
NLQG+WN +P W S H NIN++MNYW + NLS C PL D++ + +
Sbjct: 328 NLQGLWNNSNTPPWASDYHNNINIQMNYWAAENTNLSACHIPLIDYIVAQAEPCRIATRK 387
Query: 433 NYLAS--GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
+ A+ GW I+ + W AW H++EH+ +T DRD+L+K
Sbjct: 388 AFGAATRGWTARTSQSIFGGNG-------WEWNIPASAWYAHHVFEHWAFTKDRDYLKKT 440
Query: 491 AYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP--DGKLACVSYSSTMDMAIIR 548
AYP+L+ +F D L + DG L SPEH P DG + D ++
Sbjct: 441 AYPVLKEICNFWEDRLKQLPDGSLVVPNGWSPEH---GPREDGVM--------HDQQLVW 489
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQRR 597
++F + AA+ L + A KV RL P KI + G + EW + R
Sbjct: 490 DLFQNYLDAAKALN-TDPAYQLKVADMQRRLAPNKIGKWGQLQEWQEDR 537
>gi|189208288|ref|XP_001940477.1| alpha-fucosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187976570|gb|EDU43196.1| alpha-fucosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 814
Score = 213 bits (542), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 170/579 (29%), Positives = 269/579 (46%), Gaps = 59/579 (10%)
Query: 29 PIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYT--NPDA--PKALSDVRSLVDS 81
P+GNGRLGAM G +ETL LN D+LW+G P +YT NP AL +R +
Sbjct: 41 PLGNGRLGAMPVGPPAAETLTLNLDSLWSGGPFNISNYTGGNPHTLIASALPGIRDWI-- 98
Query: 82 GQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY 139
+ T L G + YQ+LG++ ++ Y R+LD++T T +
Sbjct: 99 --FTNGTGNVSALLGSNDNYGSYQVLGNLTVKIPSLSSDIVSN-YTRKLDMSTGTHTTTF 155
Query: 140 SVGNVEFTREHFSSNPDQVIVTKISGSESGSLS-FNVSLDSLL-----DNHSYVNGNNQI 193
+ F S PDQV V + + +G + V+LD++L N + V G+
Sbjct: 156 IANGNDLETTGFCSFPDQVCVYTVQSTGAGDVPPLEVTLDNVLVSPQLQNVTCVEGDTTK 215
Query: 194 IMEGRCPG-KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL----KVEGSD 248
R G ++ P P+G+++ +I + +S+ +S E+ L G+
Sbjct: 216 PAHLRLRGVTQLGP-------PEGMRYDSIARV-VSNSNTDVSCDENTGLLSIAPRSGTK 267
Query: 249 WAVLLLVASSSFDGPF----INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
+++ A +++D N S +DP + + L RH+DD+ L
Sbjct: 268 SVSIVIGAGTNYDAKKGTAEHNYSFRGEDPALIVEATTLKAATKTLDQLRGRHIDDFTAL 327
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
+ L D + T R T DP L LL + RYL ISS
Sbjct: 328 TGLFELSLP--------DPLNSSQTQTSELINRYTVNNTSGDPYLESLLMENSRYLFISS 379
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
SRPG+ NLQG W+E L W + H NIN +MN+W S L++ Q PL+D++T +
Sbjct: 380 SRPGSLPPNLQGRWSEGLETDWSADYHANINFQMNHWTSDQTGLTDLQSPLWDYMTDTWM 439
Query: 425 -NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
G++TA + Y A GWV+H++ +I+ +A + WA +P+ AW+ H+++H++Y+ +
Sbjct: 440 PRGAETATLLYNAPGWVVHNEMNIFGH-TAMKSAAEWANYPIAAAWMMQHVFDHWDYSRN 498
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEG---HDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
+L K+ YPLL+G A F LD L + DG L NP SPEH C Y
Sbjct: 499 ATWLLKQGYPLLKGVAMFWLDQLQQDGYYKDGSLVVNPCNSPEHGGTT----FGCAHYQQ 554
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
+I +VF +I++ + + + + SL RL
Sbjct: 555 -----LIHQVFHSILAVQPTVADPDTVFLTNLTSSLHRL 588
>gi|323345397|ref|ZP_08085620.1| fibronectin type III domain protein [Prevotella oralis ATCC 33269]
gi|323093511|gb|EFZ36089.1| fibronectin type III domain protein [Prevotella oralis ATCC 33269]
Length = 801
Score = 212 bits (540), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 161/562 (28%), Positives = 259/562 (46%), Gaps = 89/562 (15%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
A+PIGNG+LGAM++GG+ + ++ NE TLWTG S + G Y
Sbjct: 49 ALPIGNGQLGAMIYGGIRQDIVQFNEKTLWTG------------------SAEERGSYQN 90
Query: 87 ATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSV--GNV 144
A ++ G D + Y R LDL+ ATA +S G+
Sbjct: 91 FGALVIENIGGSYD-----------------RRGVYNYYRNLDLSNATAVASWSTADGDT 133
Query: 145 EFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
+TRE+ +SNP Q +V + S +++ L+ + +Y G EG GK
Sbjct: 134 VYTREYIASNPAQCVVIHMKASVPRAINNRFYLNDVHGRETYYQGK-----EGMFAGKLT 188
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
S +K++ GT++ D + V+ +D +++L A + ++
Sbjct: 189 T-------------VSYCARMKVAAVGGTVTTTNDG-IVVKHADEVMVILAAGTDYNA-- 232
Query: 265 INPSDSKKDPT--SESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTD 322
+ PS S + + S ++ + LY+RH++DY+ + R +QL I TD
Sbjct: 233 VAPSYISHTTLLPSRIKNTVDSAVSMGWQALYSRHVEDYKAFYDRTDLQLGGVTNTIPTD 292
Query: 323 TCSEENIDTVPSAERVKSFQTDEDPSLVE-LLFQFGRYLLISSSRPGTQVANLQGIWNED 381
ID ++++ D L+E L FQ+GRYLLISSSR NLQGIWN
Sbjct: 293 KL----IDGY-----AENYEHDNRYRLIEQLYFQYGRYLLISSSRGIDLPNNLQGIWNNS 343
Query: 382 LSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI---NGSKTAQVNYL-AS 437
P W H +IN++MNYW + NLSE E L +++ +++ A+V +
Sbjct: 344 NEPAWQCDMHADINVQMNYWLANSTNLSEMNEKLLNYIYNMALVQPQWKSYARVRLRQQN 403
Query: 438 GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEG 497
GW + +I+ +A + A GAWLC HLW+HY YT+DR+FL +A P++
Sbjct: 404 GWACFTENNIFGHCTAWQNNYCAA-----GAWLCAHLWQHYRYTLDREFLLHKALPVMVS 458
Query: 498 CASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSY------SSTMDMAIIREVF 551
F L+ L++ DG E SPEH P + A Y ++ +++ +F
Sbjct: 459 QCEFWLERLVKATDGTYECPDEYSPEH---GPGTESAPGVYAIKPENATAHAQQLVKYLF 515
Query: 552 SAIISAAEVLEKNEDALVEKVL 573
SA + A ++ N+ A V+++
Sbjct: 516 SATLKAISIV-GNKAACVDRMF 536
>gi|358368279|dbj|GAA84896.1| similar to glycoside hydrolase family 95 protein [Aspergillus
kawachii IFO 4308]
Length = 810
Score = 211 bits (537), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 178/611 (29%), Positives = 274/611 (44%), Gaps = 87/611 (14%)
Query: 27 AIPIGNGRLG--------------------AMVWGGVPSETLKLNEDTLWTGVPGD---Y 63
A P+GNGRLG AM G E + LN D+LW G P + Y
Sbjct: 38 AFPLGNGRLGGSYFDQTSKGYYGRILKCSLAMPVGSYDKEIVNLNVDSLWRGGPFESPTY 97
Query: 64 T--NPDAPKA--LSDVRSLVDSGQYAEATAASVKLFG-HPA-DVYQLLGDIELEFDD-SH 116
+ NP+ KA L +R + + T L G +P YQ+L ++ ++ S
Sbjct: 98 SGGNPNVSKAGALPGIREWI----FQNGTGNVSALLGEYPYYGSYQVLANLTIDMGQLSD 153
Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSES-GSLSFNV 175
+ + YRR LDL++A +S G RE F S PD V V K+S + S ++F +
Sbjct: 154 I----DGYRRNLDLSSAVYSDHFSTGETYIEREAFCSYPDNVCVYKLSSNSSLPGITFGL 209
Query: 176 --SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGT 233
L S N S +GN+ + G+ P G+ ++A + + +
Sbjct: 210 ENQLTSPAPNVS-CHGNSISLY-----GQTYPVI--------GMIYNARVTVVVPGSSNA 255
Query: 234 ISALEDKKLKV-EGSDWAVLLLVASSSFDGPFINPSDS----KKDPTSESMSALQSIRNL 288
+KV EG L+ A +++D N S ++P ++ + A +
Sbjct: 256 SDLCSSLTIKVPEGEKEVFLVFAADTNYDASNGNSKASFSFKGENPYTKVLQAATNAAKK 315
Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPS 348
+YS L + H+ DYQ +F+ ++ L P+ E + S+ DP
Sbjct: 316 TYSALKSSHVKDYQGVFNEFTLTLP-----------DPNGSADRPTTELLSSYSQPGDPY 364
Query: 349 LVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL 408
+ LLF +GRYL ISSSRPG+ NLQG+W E SP W H NINL+MN+W L
Sbjct: 365 VENLLFDYGRYLFISSSRPGSLPPNLQGLWTESYSPAWSGDYHANINLQMNHWAVEQTGL 424
Query: 409 SECQEPLFDFLTYLSI-NGSKTAQVNYLAS-GWVIHHKTDIWAKSSADRGKVVWALWPMG 466
E EPL+ ++ + G++TA++ Y S GWV H + + + +A + WA +P
Sbjct: 425 GELTEPLWTYMAETWMPRGAETAELLYGTSEGWVTHDEMNTFGH-TAMKDVAQWADYPAT 483
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH---DGYLETNPSTSPE 523
AW+ H+W+H++Y+ D + ++ YP+L+G A F L L++ DG L NP SPE
Sbjct: 484 NAWMSHHVWDHFDYSQDSTWYREKGYPILKGAAQFWLSQLVKDEYFKDGTLVVNPCNSPE 543
Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-T 582
H P C Y +I EVF ++ ++ + + L L P
Sbjct: 544 H---GPT-TFGCTHYQQ-----LIWEVFGHVLQGWTASGDDDTSFKNAITSKLSTLDPGI 594
Query: 583 KIAEDGSIMEW 593
I G I EW
Sbjct: 595 HIGSWGQIQEW 605
>gi|452002453|gb|EMD94911.1| glycoside hydrolase family 95 protein [Cochliobolus heterostrophus
C5]
Length = 805
Score = 211 bits (536), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 176/602 (29%), Positives = 281/602 (46%), Gaps = 73/602 (12%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYT--NPDAP--KALSDVRSLV 79
A+P+GNGRL AM G +ETL LN D+LW+G P +YT NP + AL +R +
Sbjct: 38 ALPVGNGRLAAMPIGPPSAETLTLNLDSLWSGGPFEASNYTGGNPQSSIDSALPGIRDWI 97
Query: 80 DSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARV 137
+ T KL G + Y++L ++ + S + Y R+LDL
Sbjct: 98 ----FTNGTGNVTKLLGTNDNYGSYRVLANLTVAIP-SLVGSQVSNYTRKLDLANGLHST 152
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL-SFNVSL-----DSLLDNHSYV-NGN 190
++ + + F S PDQ+ V + SGSL +F + L D+ L+N + V NG
Sbjct: 153 SFNTNDTQLETTVFCSYPDQICVYTVQ--SSGSLPAFELKLGNELVDAKLENKTCVANGT 210
Query: 191 NQIIMEGRCPG-KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV---EG 246
R G ++ P P+G+ + I + + D L V +G
Sbjct: 211 GADSGHLRLRGVTQLGP-------PEGMLYDTIARLLPNSDVKATCDSNTGILTVTPGDG 263
Query: 247 SDWAVLLLVASSSFDGPFINP----SDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
+ A +++ A +++D S DP ++ + +L + HL+D+
Sbjct: 264 AKSATVIIGAETNYDMKKGTAEHQYSFRGNDPGPVVEETIRKASTKTLEELKSSHLEDFT 323
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ---TDEDPSLVELLFQFGRY 359
L R L P + N VP+ E + S+ T DP + LLF + +Y
Sbjct: 324 SLTGRFEFLL---PDPL--------NSAQVPTPELMASYDSNVTSGDPFVENLLFDYAQY 372
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LLISSSRPG+ NLQG W E ++P W + H NINL+MNYW + L+E Q PL+D++
Sbjct: 373 LLISSSRPGSLPTNLQGRWTEQMAPDWSADYHANINLQMNYWTADQTGLTETQTPLWDYM 432
Query: 420 TYLSI-NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ G +TA + Y A GWV+H++ +I+ ++ G+ WA +P AW+ H+++++
Sbjct: 433 INTWVPRGHETAMLLYGAPGWVVHNEMNIFGHTAMKDGE-GWANYPAAPAWMMLHVFDYW 491
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH------DGYLETNPSTSPEHEFIAPDGK 532
+YT D +L + YPL+ A F WL + H D L NP +SPEH P
Sbjct: 492 DYTRDTTWLRTQGYPLIRSVAQF---WLSQLHADSFTNDNTLVVNPCSSPEH---GPT-T 544
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIM 591
C Y +I +VF A+++ ++ +++ V +L RL + + I
Sbjct: 545 FGCAHYQQ-----LIHQVFEAVLTTHSLVGESDTEFTSNVSSTLSRLDKGFHVGSWSQIK 599
Query: 592 EW 593
EW
Sbjct: 600 EW 601
>gi|358388157|gb|EHK25751.1| glycoside hydrolase family 95 protein [Trichoderma virens Gv29-8]
Length = 794
Score = 211 bits (536), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 156/609 (25%), Positives = 285/609 (46%), Gaps = 55/609 (9%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDYTNPDAPKA-LSDVRSLVDS 81
+P+GNGR A V ET LNE + W+G G P+ PKA L + + +
Sbjct: 20 LPLGNGRFAASVLSSPAKETFILNEVSFWSGETQKAGGGLAERPEDPKAELRETQKCYLN 79
Query: 82 GQYAEATAASVKLFGHPADVYQL---LGDIELEFDDSHLKYAEETYRRELDLNTATARVK 138
G YA+ + K + +G +++ + + REL L+ A A +
Sbjct: 80 GDYAKGKKRAEKYLESKKRNFGTNLGVGTLDIVVNGHESIGQVNGFERELRLDEAVAETR 139
Query: 139 YSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGR 198
Y++ +F R F S+P+QV+V + G + L V + +N ++ + N +G+
Sbjct: 140 YTIDGRQFKRRSFLSHPNQVLVVQFDGDDLSGLEVVVGVQG--ENEAFTSKIND---DGK 194
Query: 199 CPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS 258
+ +D G++ I+ + D G + D KL + +L+
Sbjct: 195 LEFNAQALETVHSDGTCGVKGYGIIAATV--DEGKVEH-RDTKLVISAKKNITILV---- 247
Query: 259 SFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKD 318
+F+ + P++ + T+ L+ LS +DL HL+D+Q L+ R+SI L
Sbjct: 248 TFNTDYSEPNEEWRKRTTLQ---LEEALKLSAADLLKAHLEDFQPLYRRMSISLGSKSST 304
Query: 319 IVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVA-NLQGI 377
+ + + PS DPS+ L F + RYL I+ +R + + +LQG+
Sbjct: 305 TASIRTDQRRQNFEPSGY--------ADPSMFALYFHYARYLTIAGTRHDSPLPLHLQGL 356
Query: 378 WN--EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYL 435
WN E W H++IN +MNY+ L S+ +PL ++L L+ +G A+ Y
Sbjct: 357 WNDGEACKMGWSCDYHLDINTQMNYFAILNGGFSDLMQPLINYLIRLAASGQHAARACYG 416
Query: 436 ASGWVIHHKTDIWAKSSADRG-KVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPL 494
+ GWV H +++W AD G +V + L GG W+ HL E + Y++D F+ A+PL
Sbjct: 417 SEGWVAHVFSNVWG--FADPGWEVSYGLNVTGGLWMANHLIEMFEYSLDEGFMANDAWPL 474
Query: 495 LEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDG----KLACVSYSSTMDMAIIRE 549
L G + F L++++E G+L T PS SPE+ F +G + + + T+D+ ++R+
Sbjct: 475 LAGASKFFLNYMVEDPKTGWLLTGPSVSPENSFFVVNGDGEKEEHYAALAPTLDVVLVRD 534
Query: 550 VFS---AIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV---------QRR 597
+ + +++ + N + +++ ++ +L P +I ++G + EW+ R
Sbjct: 535 LLAFCEYVVTKFNAGKSNWEDDIQQYQEAQAKLPPFQIGKNGQLQEWLHDFEEAQPYHRH 594
Query: 598 LNTSFSTCK 606
L+ + + C+
Sbjct: 595 LSHTMALCR 603
>gi|421218935|ref|ZP_15675822.1| large secreted protein [Streptococcus pneumoniae 2070335]
gi|395581532|gb|EJG42003.1| large secreted protein [Streptococcus pneumoniae 2070335]
Length = 458
Score = 210 bits (535), Expect = 2e-51, Method: Compositional matrix adjust.
Identities = 156/496 (31%), Positives = 241/496 (48%), Gaps = 56/496 (11%)
Query: 92 VKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVGNVEFT 147
+ +F P D Y+LLG++ +E D A Y RELDL+TA + V + + N++
Sbjct: 4 LTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSCNLQIK 62
Query: 148 REHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIP 205
RE+F+S ++ +I S +L+ N++L + ++ ++ I+M G+
Sbjct: 63 REYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGR--- 119
Query: 206 PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFI 265
KG+QF + K++D G +S L + + + + L L + + + G
Sbjct: 120 ---------KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDYWGNI- 166
Query: 266 NPSDSKKDPTSESMSALQSIRNLSYSDLYTR---HLDDYQKLFHRVSIQLSRSPKDIVTD 322
+S+LQ S D +T H+ YQ+ F+RV +L S +
Sbjct: 167 ------------DISSLQG--EFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYSKGCL--- 209
Query: 323 TCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDL 382
+I T E K + L LLF +GRYLLISSS+P ANLQGIW ++L
Sbjct: 210 -----SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDEL 260
Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIH 442
+P W S +NIN +MNYW PC+L E + PLFD L + G TA+ Y A G+ H
Sbjct: 261 NPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAH 320
Query: 443 HKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFL 502
H TD + ++ + A+W + WLCTH+WEHY Y D L + + +++ F
Sbjct: 321 HNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFF 379
Query: 503 LDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLE 562
D+L E DGYL T PS SPE+++ +G SST+D I+R + I A+ L
Sbjct: 380 EDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLG 438
Query: 563 KNEDALVEKVLKSLPR 578
N D + +K L R
Sbjct: 439 DNSDFISR--VKELKR 452
>gi|330819167|ref|YP_004348029.1| hypothetical protein bgla_2g00360 [Burkholderia gladioli BSR3]
gi|327371162|gb|AEA62517.1| hypothetical protein bgla_2g00360 [Burkholderia gladioli BSR3]
Length = 796
Score = 209 bits (531), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 175/614 (28%), Positives = 283/614 (46%), Gaps = 103/614 (16%)
Query: 13 LKITFN---GPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
L+++++ G + + +P+GNGRLGA+ G E L LNE TLW+G D +P
Sbjct: 65 LRLSYSQAAGESNILFEGLPLGNGRLGALTGGSPVREALYLNEITLWSGQK-DAVDP--- 120
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
Y A S YQ+LG + +E H + Y R LD
Sbjct: 121 -------------AYTAAGMGS----------YQMLGKLYVELP-GHAQ--ASGYSRSLD 154
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
++ A AR +Y G + RE F S+PD+V+V ++S S+ GS +SL + + V G
Sbjct: 155 ISNAVARTQYVAGGHTYRREVFCSHPDKVLVMRLS-SDGGSHDGTISL--VDGQGASVTG 211
Query: 190 NNQIIM-EGRCPG--KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+N I++ +G+ G +R A D +++ A +G ++ L
Sbjct: 212 SNGILLAQGKLDGVGERYATHVLAMPDSGTVKYDA--------SKGVLTMSRCPAL---- 259
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
L++ A +++ G DP + + + +L Y +L RHL DY LF
Sbjct: 260 ----TLIIAARTNYSGIEAEGYLGATDPAALARADASGAAHLPYRNLLERHLRDYTALFG 315
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSS 365
R S+ L +S + T+P + ++ D DP L L QFGRYL I+SS
Sbjct: 316 RFSLDLGKS--------SDAQRAMTIPDRLKARTASPDIADPELEALYVQFGRYLTIASS 367
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
R G ANLQG+W+ + +P W + H +IN++MNYW + L ECQ+P D++ +
Sbjct: 368 R-GPLPANLQGLWSVNNTPPWMADYHTDINVQMNYWLADRAGLPECQKPFADYVLSQLPS 426
Query: 426 GSKTAQVNY-------------LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
+++ Q ++ +GW I T I+ G + W P AW C
Sbjct: 427 WARSTQAHFNDAANSNYSNSSGKVAGWTIAISTGIY-------GGIGWDWSPPASAWYCR 479
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDG 531
LW HY YT+DRD+L + YP+L+ F LI + G L + SPEH D
Sbjct: 480 TLWNHYQYTLDRDYL-RAIYPVLKSACEFWQARLIVDPASGLLVDDRDWSPEHG----DH 534
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNED-ALVEKVLKS---LPRLRPTKIAED 587
+ ++Y+ + + ++F+ +A+ L + D A L+S LP++ PT
Sbjct: 535 QELGITYAQEL----VWDLFTNYGTASGTLNLDTDFAATIAGLRSRLYLPKISPTT---- 586
Query: 588 GSIMEWVQRRLNTS 601
G + EW++ +++T
Sbjct: 587 GQLQEWMEDKVDTG 600
>gi|330915124|ref|XP_003296910.1| hypothetical protein PTT_07143 [Pyrenophora teres f. teres 0-1]
gi|311330715|gb|EFQ94998.1| hypothetical protein PTT_07143 [Pyrenophora teres f. teres 0-1]
Length = 755
Score = 209 bits (531), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 163/588 (27%), Positives = 272/588 (46%), Gaps = 48/588 (8%)
Query: 29 PIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYTNPDAPKALSDVRSLVDSGQYA 85
P+GNGRLGAM G +ETL LN D+LW+G P +YT + +++ + +
Sbjct: 41 PLGNGRLGAMPVGPAAAETLTLNLDSLWSGGPFNISNYTGGNPHTSIASALPGIRDWIFI 100
Query: 86 EATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
T L G + YQ+LG++ ++ Y RELD++T ++
Sbjct: 101 NGTGNVSALLGSNDNYGSYQVLGNLTVKIPSLESSIISN-YTRELDISTGIHTTTFTANG 159
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLS-FNVSLDSLL-----DNHSYVNGNNQIIMEG 197
+ F S PDQV V + + +G + V+LD++L N + V+ N+
Sbjct: 160 NQLETTGFCSFPDQVCVYTVQSTGAGDIPPLEVTLDNVLVLPQLQNVTCVDRNSTQPAYL 219
Query: 198 RCPG--KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLV 255
R G + PP+ D + +A +++ + + G +S G+ +++
Sbjct: 220 RLRGVTQLGPPEGMRYDSIARVVSNAKIDMSCNHNAGLLSIAPRS-----GAKSVSIVVG 274
Query: 256 ASSSFDGPF----INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
A +++D N S +DP + L +RH+DD+ L +
Sbjct: 275 AGTNYDAKKGRAEHNYSFRGEDPAPIVEVTTLKAAAKTLDQLRSRHVDDFTALTGLFELS 334
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
L D + T R T DP L LL + RYL ISSSRPG+
Sbjct: 335 LP--------DPLNSSQTQTSELVNRYTVNNTGGDPYLESLLMENSRYLFISSSRPGSLP 386
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL--TYLSINGSKT 429
NLQG W+E L W + H NIN++MN+W + L++ Q PL+D++ T++ G++T
Sbjct: 387 PNLQGRWSEGLETDWSADYHANINIQMNHWTADQTGLTDLQSPLWDYMADTWMP-RGAET 445
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A + Y A GWV+H++ +I+ +A + WA +P+ AW+ H+++H++Y+ + +L
Sbjct: 446 ALLEYNAPGWVVHNEMNIFGH-TAMKSAAEWANYPISAAWMMQHVFDHWDYSRNATWLRT 504
Query: 490 RAYPLLEGCASFLLDWL---IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
+AYP+L+G A+F L+ L + +D L NP SPEH C Y +
Sbjct: 505 QAYPMLKGVATFWLNQLQPDLYYNDNSLVVNPCNSPEHG----QTTFGCAHYQQ-----L 555
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEW 593
I +VF +I++ + + + + + SL RL I I EW
Sbjct: 556 IHQVFHSILAVQPTVADPDTSFLTTLTSSLARLDTGFHIGSFAQIKEW 603
>gi|225017021|ref|ZP_03706213.1| hypothetical protein CLOSTMETH_00943 [Clostridium methylpentosum
DSM 5476]
gi|224950188|gb|EEG31397.1| hypothetical protein CLOSTMETH_00943 [Clostridium methylpentosum
DSM 5476]
Length = 1158
Score = 209 bits (531), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 175/660 (26%), Positives = 303/660 (45%), Gaps = 101/660 (15%)
Query: 4 AESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
++S++ N L+I ++ PA + T+A+ IGNG +G MV+GGV + + +NE T+W G P +
Sbjct: 35 SQSSANDNLLRIWYDEPATDWQTEALAIGNGYMGGMVFGGVKRDKVHINEKTVWNGGPTE 94
Query: 63 ------YTNPDAPKALSDVRSLVD--SGQYAEATAASVKLFGHPADVYQ----------- 103
Y N + + D++ + D + + S +FG D YQ
Sbjct: 95 NNNRYNYGNTNPTETEEDLQKIKDDLNAIREKLDDKSEFVFGFDEDSYQSSGTSTRGEAM 154
Query: 104 -----LLGDIE-----LEFDDSHL------KYAEETYRRELDLNTATARVKYSVGNVEFT 147
L+GD+ ++ D + + A Y R+LD+ T A V Y V +T
Sbjct: 155 DWLNKLMGDLTGYSAPQDYADLFITNNAIDESAVTNYIRDLDMRTGLATVSYDYDGVHYT 214
Query: 148 REHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPK 207
RE+F+S PD V+V +++ + G ++FN +L GNN + G I K
Sbjct: 215 REYFNSYPDNVLVVRLTADQGGKINFNTNL------TDKTRGNN---LTNTAEGDTITMK 265
Query: 208 ANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP 267
++ + G++ A ++K+ + G IS ++ + V +D A L+L + + P
Sbjct: 266 SSLRSN--GLKVEA--QLKVVPEGGDIS-VDGSSINVANADAATLILACGTDYKMEL--P 318
Query: 268 SDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEE 327
+ +DP + + + Y+DL H+ D+ LF R+ I + E
Sbjct: 319 TFRGEDPHAAVTGRISAAAEKGYADLKEDHVADHSALFSRMEIGFN-------------E 365
Query: 328 NIDTVPSAERVKSFQ-----------TDEDPSLVELL-FQFGRYLLISSSRPGTQVANLQ 375
I +P+ E +K ++ T+ + +E++ +QFGRYL I+ SR G+ NLQ
Sbjct: 366 EIPQIPTDELIKKYRNMVDNNGGEVPTEAEQRALEIICYQFGRYLTIAGSREGSLPTNLQ 425
Query: 376 GIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYL 435
G+W E S W H NIN++MNYW ++ NL+EC P D+L L G A +
Sbjct: 426 GVWGEG-SFAWGGDYHFNINVQMNYWPTMASNLAECHVPYNDYLNVLREAGRGAAAAAFG 484
Query: 436 -------ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
+GW++ + + ++ + P G AW + +E+Y ++ D ++L+
Sbjct: 485 IKSEPGEENGWLVGCFSTPYMFATMGQKNNAAGWNPTGSAWALLNSYEYYLFSGDTEYLK 544
Query: 489 KRAYPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
YP ++ A+F + L E Y+ + PS SPE+ + ++ D
Sbjct: 545 NELYPSMKEVANFWNEALYWSEYQQRYV-SGPSYSPEN---------GPIVNGASYDQQF 594
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQRRLNTSFSTCK 606
I + F I AAE L +ED LV + +L P + +DG + EW + T+F +
Sbjct: 595 IWQHFENTIQAAETLGVDED-LVATWREKQSKLDPVIVGDDGQVKEWFEE---TTFGKAQ 650
>gi|291549437|emb|CBL25699.1| Uncharacterised Sugar-binding Domain [Ruminococcus torques L2-14]
Length = 1637
Score = 208 bits (530), Expect = 6e-51, Method: Compositional matrix adjust.
Identities = 172/655 (26%), Positives = 290/655 (44%), Gaps = 108/655 (16%)
Query: 5 ESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
E+ N L++ ++ PA + T ++ IGNG +G++V+GG+ + + +NE T+W G P Y
Sbjct: 38 ETAKNDNLLRVWYDEPATDWQTQSLAIGNGYMGSLVFGGINKDKIHINEKTVWEGGPTSY 97
Query: 64 ------------TNPDAPKALSDVRS----LVDSGQYA--------EATAASVKLFGHPA 99
T+ D K D+ + L D +Y EA+ + K G
Sbjct: 98 NGYSYGTTNKTETDADLQKIKDDLNAIREKLDDKSEYVFGFNEDSYEASGTNTK--GEAM 155
Query: 100 D-VYQLLGDI----------ELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTR 148
D + +L+GD+ L ++ Y R+LD+ TA A V Y V +TR
Sbjct: 156 DWLNKLMGDLVGYSAPKDYANLYISNNQDSSKVSNYVRDLDMRTALATVNYDYEGVHYTR 215
Query: 149 EHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY---VNGNNQIIMEGRCPGKRIP 205
E+F S PD V+ ++S + G ++F+ +L SL+ ++ V+G+ I M G +
Sbjct: 216 EYFDSYPDNVMAVRLSADQKGKINFDTNLQSLIGGRTHKSTVDGDT-ITMRDALGGNGLN 274
Query: 206 PKANANDDPKGIQFSAILEIKISDDRGTISA---LEDKKLKVEGSDWAVLLLVASSSFDG 262
+A ++K+ ++ G++S+ + + V +D L+ + +
Sbjct: 275 IEA---------------QLKVINEGGSLSSNTNGSNPSITVSDADAVTLIFACGTDYKM 319
Query: 263 PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTD 322
PS +DP + + + Y L H+ D+ LF R+ + +
Sbjct: 320 EL--PSFRGEDPHDAVTARINAAAKKGYEALKKDHVADHDALFSRMELGFN--------- 368
Query: 323 TCSEENIDTVPSAERVKSFQT------------DEDPSLVELLFQFGRYLLISSSRPGTQ 370
E + T+P+ E +K ++ E +L + +QFGRYL I+ SR G
Sbjct: 369 ----EEVPTIPTDELIKKYRNMVDNNGGEVPTESEQRALEVICYQFGRYLTIAGSREGAL 424
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
NLQG+W E W H NIN++MNYW +L NL+ECQ D+L L G A
Sbjct: 425 PTNLQGVWGEGYFQ-WGGDYHFNINVQMNYWPTLASNLAECQTAYNDYLNVLKEAGRYAA 483
Query: 431 QVNY-------LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+ +GW++ + + S+ + P+G AW + +E+Y YT D
Sbjct: 484 AAAFGIKSDEGEENGWLVGCFSTPYMFSALGQKNNAAGWNPIGSAWALLNAYEYYLYTED 543
Query: 484 RDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
D+L+ YP L+ A+F + L E Y+ PS SPE+ + ++
Sbjct: 544 TDYLKNELYPSLKEVANFWNEALYWSEYQQRYVSA-PSYSPEN---------GPIVNGAS 593
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQR 596
D I + F I AAE L + D LVE+ + +L P + +DG + EW +
Sbjct: 594 YDQQFIWQHFENTIQAAETLGVDAD-LVEQWKEKQSKLDPVLVGDDGQVKEWYEE 647
>gi|187734699|ref|YP_001876811.1| glycoside hydrolase family protein [Akkermansia muciniphila ATCC
BAA-835]
gi|187424751|gb|ACD04030.1| glycoside hydrolase family 95 [Akkermansia muciniphila ATCC
BAA-835]
Length = 788
Score = 208 bits (529), Expect = 8e-51, Method: Compositional matrix adjust.
Identities = 165/596 (27%), Positives = 260/596 (43%), Gaps = 50/596 (8%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
P+++T + PA+ +T+ GNGRLG + +G P ET+ LNE +++ A +A
Sbjct: 28 PMQVTASTPARVWTEGYGTGNGRLGILSFGVFPKETVVLNEGSIFA-KKNFQMREGAAEA 86
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADV---YQLLGDIELEFDDSHLKYAEETYRREL 128
L R L G+Y A K P ++ YQ G +++EF + +Y+R L
Sbjct: 87 LDKARELCKEGKYRSADQLFRKNILPPGNIAGDYQQGGRLQVEFQGLP---SPSSYQRTL 143
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
D+ A + G E T E ++ I+ + +++L+ + V
Sbjct: 144 DMRRGKATTRAQFGTGELTTEILAAPSSDCAAYHIACTMPSGCRVSLNLEHPDPSARIVA 203
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
N ++EG+ +N + IL S R + + D +V
Sbjct: 204 QPNGWVLEGQ----------GSNGGTRFENTVVILAPGASVTRKGSTIILDSAREV---- 249
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQK 303
++++S S D P + P + S++A L + + L D + +
Sbjct: 250 ----MVLSSISTDYNIRKP----EAPLTHSLAAKNARILAKAQKAGWKKLAAETEDYFSR 301
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
L R + L SP + T ++ ERVK Q +DP L+E LFQFGR+ I+
Sbjct: 302 LMTRCQVDLGDSPAGVSAMTTAQR-------LERVK--QGKKDPDLLEQLFQFGRFCTIA 352
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
+RPG LQG+WN +L W +NIN +MN W S L E Q DF+ L
Sbjct: 353 HTRPGQLPCGLQGLWNPELRAAWMGCYFLNINSQMNQWPSHVTGLGEFQSSYLDFVRSLR 412
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+G + A+ G+ H TD W ++ W M GAW C HL + Y +T D
Sbjct: 413 PHGEEFARF-IKRDGFCFGHYTDCWKRTYFSGNNPEWGASLMNGAWACAHLVDSYRFTGD 471
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGK----LACVSYS 539
R+ L K++ P+LE A F++ W + +G + P SPE F APDG L+ VS
Sbjct: 472 REDL-KKSLPILESNARFIMSWFEDDGEGRYLSGPGVSPETGFYAPDGTGPNVLSYVSNG 530
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
++ D + RE I A L L+ K ++ L ++ I DG + EW Q
Sbjct: 531 TSHDQLLGREALRNYIYACGELGIRTPTLL-KAVQFLRKIPQPAIGPDGRVQEWRQ 585
>gi|393222962|gb|EJD08446.1| glycoside hydrolase family 95 protein [Fomitiporia mediterranea
MF3/22]
Length = 842
Score = 207 bits (528), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 170/603 (28%), Positives = 280/603 (46%), Gaps = 74/603 (12%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP-------------DAPKALSD 74
+P+GNG LGAM+ GG E+ +LN ++LW+G P + +P + +A+
Sbjct: 56 LPVGNGFLGAMISGGTTQESTQLNIESLWSGGP--FADPGYNGGNKQLDEQSEIGQAMRS 113
Query: 75 VRSLVDSGQYA-----EATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + ++ +A A + +G+ + L+ + + A Y R LD
Sbjct: 114 IRQKIFKSKHGTIDNVDALMAPIGAYGNYSSAGFLVSTLT-----NTPSSAISDYARFLD 168
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD---NHSY 186
L T AR ++ GN +FTRE F S P Q S + S +L +++ +
Sbjct: 169 LETGVARTIWTHGNYQFTRETFCSYPAQACAQNTSSTNPSGFSQTYALGAIIGLPPPNVT 228
Query: 187 VNGNNQIIMEGRC--PGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
N+ + G PG A + P GI +E + + L +
Sbjct: 229 CADNSTLRSSGLVSNPGMAYEILATVSVSPGGI-----IECNTVPNVNHTRKASNATLTI 283
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDS----KKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ ++ V +++D + + S DP S L S SYS+ H+ D
Sbjct: 284 SNATSMSIMWVGGTNYDAGAGDAAHSFSFRGSDPHEGLSSLLISASEKSYSEFVAEHISD 343
Query: 301 YQKLFH-RVSIQLSRSPKDIVTDTCSEENID-TVPSAERVKSFQTDE-DPSLVELLFQFG 357
++ + S+ L +NI+ VP+ + ++ D+ DP L LLF +G
Sbjct: 344 FKSALNPSFSLNLG-------------QNINLKVPTDKLKDVYRVDKGDPYLEWLLFNYG 390
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLL+SS+R G ANLQG W D W + HVNINL+MNYW + NL + + LFD
Sbjct: 391 RYLLVSSAR-GALPANLQGKWARDAGNPWSADYHVNINLQMNYWFAESTNL-DVTKSLFD 448
Query: 418 FL--TYLSINGSKTAQVNYLAS-GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
F+ T++S G+ TAQV Y ++ GWV+H++ +I+ + +G WA +P AW+ H+
Sbjct: 449 FIEETWVS-RGTYTAQVLYNSTQGWVLHNEINIFGHTGMKQGDAEWADYPESNAWMMIHV 507
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH---DGYLETNPSTSPEHEFIAPDG 531
W+H+++T D + + + YPL++G ASF L+ LI DG L P SPE P
Sbjct: 508 WDHFDFTNDVAWWKAQGYPLVKGAASFHLNKLIPDERFKDGTLVVAPCNSPEQ----PPI 563
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSI 590
LAC +I ++F+A+ A + ++A + ++ R+ + I G +
Sbjct: 564 TLACAHAQQ-----VIWQLFNAVEKGAAAAGETDEAFLNEIKSKKGRMDKGIHIGSWGQL 618
Query: 591 MEW 593
EW
Sbjct: 619 QEW 621
>gi|309798858|ref|ZP_07693119.1| alpha-fucosidase [Streptococcus infantis SK1302]
gi|308117507|gb|EFO54922.1| alpha-fucosidase [Streptococcus infantis SK1302]
Length = 627
Score = 207 bits (528), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 157/518 (30%), Positives = 254/518 (49%), Gaps = 63/518 (12%)
Query: 102 YQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIV 160
Y GDI + F++ T Y R LD++ A Y+ F RE FSS PD V V
Sbjct: 12 YLAFGDIFMVFNNQKKGLENVTDYHRGLDISEAITTTSYTQDGTSFKRETFSSYPDDVTV 71
Query: 161 TKISGSESGSLSF---NVSLDSLLDNHSYVNGNNQIIMEGRCP--GKRIPPKANANDDPK 215
T ++ +L F N + L+ N Y + N +G I K D+
Sbjct: 72 THLTKKGDKTLDFTLWNSLTEDLIANGDY-SWENSKYKQGTVSVDSNGILLKGTVKDN-- 128
Query: 216 GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDP 274
G+QF++ L IK G ++A +D L V G+ +A LLL A ++F NP ++ +KD
Sbjct: 129 GLQFASYLGIKTD---GQVTA-QDGYLTVTGASYATLLLSAKTNFAQ---NPKTNYRKDI 181
Query: 275 TSESM--SALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTV 332
E S +++ + Y L H+ DYQ LF+RV + L S + T
Sbjct: 182 DVEKTVKSIVEAAKAKDYETLKNDHIKDYQSLFNRVQLNLGGSKSNQTT----------- 230
Query: 333 PSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAP 390
E ++++ + L EL FQ+GRYLLISSSR T ANLQG+WN +P W+S
Sbjct: 231 --KEALQTYNPTKGQKLEELFFQYGRYLLISSSRNRTDALPANLQGVWNAVDNPPWNSDY 288
Query: 391 HVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING-----------SKTAQVNYLASGW 439
H+N+NL+MNYW + NL+E +P+ +++ + G SK Q N GW
Sbjct: 289 HLNVNLQMNYWPAYMSNLAETAKPMINYIDDMRYYGRIAAKEYAGIESKEGQEN----GW 344
Query: 440 VIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCA 499
++H + + ++ W P AW+ +++++Y +T D +L+++ YP+L+ A
Sbjct: 345 LVHTQATPFGWTTPGW-NYYWGWSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETA 403
Query: 500 SFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISA 557
F +L + D ++ ++PS SPEH ++ +T D +++ ++F + A
Sbjct: 404 KFWNSFLHYDKASDRWV-SSPSYSPEH---------GTITIGNTFDQSLVWQLFHDYMEA 453
Query: 558 AEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
A L+ ++D LV +V +L+P I +DG I EW +
Sbjct: 454 ANHLKVDQD-LVTEVKTKFDKLKPLHINQDGRIKEWYE 490
>gi|358400122|gb|EHK49453.1| glycoside hydrolase family 95 protein [Trichoderma atroviride IMI
206040]
Length = 788
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 171/604 (28%), Positives = 274/604 (45%), Gaps = 75/604 (12%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYT--NPDAP--KAL 72
PA A P+GNG+LGAM G V + + LNE +LW+G P DY NP P AL
Sbjct: 29 PANIIMTAYPLGNGKLGAMPLGLVGEDIVVLNEHSLWSGGPFQNPDYIGGNPPGPVYTAL 88
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVY----QLLGDIELEFDDSHLKYAEETYRREL 128
+R + Q + L+G PAD Y + LG++ ++ +Y +Y R L
Sbjct: 89 PGIRDTIWQTQINNDIS---PLYGDPADYYYGNYETLGNLTVKIAGLS-QYT--SYNRAL 142
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSES-------------GSLSFNV 175
DL T + + FT F + PDQV V + +++ S + N+
Sbjct: 143 DLETGIHQTVFRSNGASFTTTTFCTFPDQVCVHNVQSTKALPAITIGLQDNARSSPASNL 202
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
S D+ N ++ G Q + G + PKG +A EI I D T S
Sbjct: 203 SCDA---NGVHLRGQTQQDI-----GMIFDARVQVLSRPKGAACTASHEIVIPADSKTKS 254
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
+ G+D+ +S++ S DP +S +++ SY+ LY
Sbjct: 255 V---TVIYAAGTDYDQKKGTKASNY-------SFKGVDPAPAVLSTIKAAAKESYNSLYN 304
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVE-LLF 354
H+ D+ LF + ++ L S +N ++P+A+ ++ + D + +E LLF
Sbjct: 305 SHVKDHNALFSQFTLNLPDS-----------DNSASIPTAKLMEDYDDDIGNTFIENLLF 353
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
+GRYL I S RPG+ NLQGIW E L+P W + HV++N++MN+W + L + Q P
Sbjct: 354 DYGRYLFIGSCRPGSLPPNLQGIWTESLTPAWSADYHVDVNVQMNHWHTEQTGLGDIQGP 413
Query: 415 LFDFLTYLSI-NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
L+DF+T + G++TA + Y A G+V + + + VW+ +P AWL +
Sbjct: 414 LWDFITDTWVPRGTETAALLYDAPGFVGFSNLNTFG-FTGQMNAAVWSDYPASAAWLMQN 472
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE---GHDGYLETNPSTSPEHEFIAPD 530
+W+ Y+Y D + YPL++ A + + ++ +DG L P SPEH +
Sbjct: 473 VWDRYDYGRDTTWYRATGYPLMKAVAEYWIHEMVPDLYSNDGTLVAAPCNSPEHGWTT-- 530
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGS 589
C Y ++ E+F II + + +E V ++ +L P I G
Sbjct: 531 --FGCTHYQQ-----LVWELFDHIIQSWDATGDKNTTFLETVKETQAKLSPGIIIGWFGQ 583
Query: 590 IMEW 593
I EW
Sbjct: 584 IQEW 587
>gi|396466146|ref|XP_003837624.1| similar to glycoside hydrolase family 95 protein [Leptosphaeria
maculans JN3]
gi|312214186|emb|CBX94180.1| similar to glycoside hydrolase family 95 protein [Leptosphaeria
maculans JN3]
Length = 807
Score = 207 bits (526), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 172/595 (28%), Positives = 271/595 (45%), Gaps = 64/595 (10%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD---YTNPDAPKALSDVRSLVDSGQ 83
A P+GNGRLGAM +G ET+ LN D+LW+G P + YT + A++ +
Sbjct: 46 AYPLGNGRLGAMPFGPAGQETVNLNLDSLWSGGPFETVSYTGGNPTSAVAQALPGIRDWI 105
Query: 84 YAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYS 140
+ T +L G + Y++LG++ + + T + R LD+ +Y
Sbjct: 106 FTNGTGNVTELLGEDGNFGSYRVLGNLSVSIPSLQIGNVSITGFTRTLDIVNGIHTTRYK 165
Query: 141 VGNVEFTREHFSSNPDQVIVTKISGSESGSLS-FNVSLDSLLD-----------NHSYVN 188
V E F S PDQV V S SG L +SLD+ L +H +
Sbjct: 166 VDENEINTTVFCSYPDQVCV--YSAQSSGQLPVLQLSLDNELVTSELKTRTCEVDHVRMR 223
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFS-----AILEIKISDDRGTISALEDKKLK 243
G Q+ G G R A P+GI+ S AIL I ++ +++ + +
Sbjct: 224 GVTQV---GPPEGMRYDAIARVAS-PEGIKMSCINGTAILNITPNNGTNSVTVILGAETD 279
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
+ ++ FD F +DP + Q + +L H++D+
Sbjct: 280 YDQKK-------GTAEFDYSF-----RGEDPGPTVEATTQKAAAKTSVELVGAHVEDFTS 327
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
L R + L TDT + T+ ER S T+ DP L LLF + YL IS
Sbjct: 328 LSERFKLSL--------TDTLNSLQTPTLDLIERYDSEDTNGDPYLESLLFDYSNYLFIS 379
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSR G+ NLQG W+E L W H NINL+MN+W + L++ Q PL+D++
Sbjct: 380 SSRAGSLPPNLQGRWSEGLYAAWSGDYHANINLQMNHWTADQTGLTDLQSPLWDYMADTW 439
Query: 424 I-NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
+ G++TA++ Y A GWV+H++ +I+ + G A + AW+ H+++H++Y+
Sbjct: 440 VPRGTETAELLYDAPGWVVHNEMNIFGHTGMKSGASW-ANYAAAAAWMMQHVYDHWDYSR 498
Query: 483 DRDFLEKRAYPLLEGCASFLLDWL---IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
D +L+ + YPLL+G A F L L + +D L P SPEH P AC +
Sbjct: 499 DTAWLKSQGYPLLKGVAKFWLHQLQLDMFSNDNSLVVIPCNSPEH---GPT-TFACAHFQ 554
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEW 593
+I ++F AI++ + ++ +++ A + SL L I G I EW
Sbjct: 555 Q-----VIHQLFDAILTLSPIVSESDTAFTTNISSSLKFLDTGFHIGSFGQIKEW 604
>gi|358396613|gb|EHK45994.1| glycoside hydrolase family 95 protein [Trichoderma atroviride IMI
206040]
Length = 793
Score = 206 bits (525), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 158/595 (26%), Positives = 274/595 (46%), Gaps = 52/595 (8%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYTNPDAPKALSDVR 76
P IGNGR G + G + L LN+D++W G P YT + +L+
Sbjct: 28 PGNVLMTGYTIGNGRQGGLPLGIPGDDLLCLNDDSVWRGGPFSNSSYTGGNPSSSLAHFL 87
Query: 77 SLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTAT 134
+ + T L+G +D Y+ L ++ + KY+ Y+R LDL TA
Sbjct: 88 PGIQEFIFQNGTGDESALYGGSSDYGSYEALANLTVSIAGV-TKYSN--YKRTLDLETAL 144
Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSES-GSLSFNVSLDSLLDNHSYVNGNNQI 193
+++ F F + PDQV V +S ++ ++F L+DN+ N
Sbjct: 145 HSAEFTANGASFQTVQFCTFPDQVCVYHVSSNKPLPDITF-----GLVDNYRT---NPAS 196
Query: 194 IMEGRCPGKRIPPKANANDDPK--GIQFSAILE-IKISDDRGTISALEDKKLKVEGSDWA 250
++ G + + A+D G++ A + S + T ++ L + A
Sbjct: 197 TVQCSSSGIWLSGRTVADDGEGLIGMKIDAQASALSSSGLKATCNSRGQTVLSTKSVKSA 256
Query: 251 VLLLVASSSFDGPFINPSDS----KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+++ + + +D N +++ DP + + ++ SY+ + RH+ D+ + F+
Sbjct: 257 TIVVASGTEYDAEKGNAANNYSFRGADPHPGVVKTINAVSKKSYNAILQRHVADHGEWFN 316
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSS 365
+ ++ L N V S E + ++ TD+ DP + LL +G+Y+ I+SS
Sbjct: 317 KFTLDLP-----------DPNNSAEVDSMELLTNYSTDKGDPFVEGLLIDYGKYMFIASS 365
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI- 424
RPG+ NLQG W D +P W S H+++N++MN+W L +PL+DF+TY +
Sbjct: 366 RPGSLPPNLQGSWAPDGNPAWSSDYHIDVNVQMNHWHVEKMGLGGLTDPLWDFMTYTWVP 425
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G++TA++ Y ASGWV T+I+ +A W+ AW+ H+W+ Y+Y D+
Sbjct: 426 RGTETARLWYNASGWVAFTNTNIFGH-TAQENDATWSDVAHDIAWMMAHVWDRYDYGRDK 484
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEG---HDGYLETNPSTSPEHEFIAPDG--KLACVSYS 539
++ YPL++G ASF +D L++ DG L NP SPEH P G C +
Sbjct: 485 NWYASVGYPLMKGVASFWMDLLVQDDYFKDGTLVANPCNSPEH---GPTGFQTFGCAQFQ 541
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEW 593
+I E+F II + + ++++ +S +L P + G I EW
Sbjct: 542 Q-----VIWELFDHIIKDWNASGDRDASFLKRLKESYGKLDPGVHVGSWGQIQEW 591
>gi|282881164|ref|ZP_06289851.1| conserved hypothetical protein [Prevotella timonensis CRIS 5C-B1]
gi|281304968|gb|EFA97041.1| conserved hypothetical protein [Prevotella timonensis CRIS 5C-B1]
Length = 1008
Score = 206 bits (523), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 156/561 (27%), Positives = 257/561 (45%), Gaps = 91/561 (16%)
Query: 24 FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
T +PIGNG+ G V GGV + ++ N+ TLW G V ++V +
Sbjct: 206 MTSTLPIGNGQFGGCVMGGVKRDEVQFNDKTLWKG---------------HVGAVVGNPN 250
Query: 84 YAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
Y Y G++ + DS L A YRR LD++ A A V Y+
Sbjct: 251 YGS---------------YLNFGNLYITSTDSRLN-AATNYRRWLDIDQAKAGVAYTANG 294
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL-DSLLDNHSY-VNGNNQII-MEGRCP 200
V++ RE+ S PD+VI SE G +S N+ L + +Y +NG +I +G P
Sbjct: 295 VDYQREYICSFPDKVIAIHYKASEKGKISNNIILFNQNGKTPTYNMNGTTGVITFQGEVP 354
Query: 201 GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSF 260
PKG + + ++ GTI+ +D + V+ +D + L +++F
Sbjct: 355 ---------RTGTPKGESY--YCKAYVTAKGGTIAVGKDGGIDVKNADEMFIYLYGTTNF 403
Query: 261 DGP---FINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPK 317
D +I SD+ P S + + + Y+ + H++DY+ L+ R + ++++
Sbjct: 404 DASNDEYI--SDAALLP-SHVTGVVDAALSKGYAAICDAHVEDYKALYDRCQLNITKA-- 458
Query: 318 DIVTDTCSEENIDTVPSAERVKSFQTDEDPSLV--ELLFQFGRYLLISSSRPGTQVANLQ 375
+ +V + + + F +L+ E+ F +GRYL+ISSSR +NLQ
Sbjct: 459 -----------MPSVTTRKLIADFAISPADNLLLEEIYFCYGRYLMISSSRGVDLPSNLQ 507
Query: 376 GIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING-------SK 428
GIWN +P W+S H NIN++MNYW + NLSE P FL Y+ +
Sbjct: 508 GIWNNVNNPAWNSDIHSNINVQMNYWPAEITNLSELHLP---FLKYIHREACERPQWRAN 564
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWAL-WPMGGAWLCTHLWEHYNYTMDRDFL 487
Q+ GW + + +I+ S W + + AW C HLW+HY +T+D+++L
Sbjct: 565 ARQIAGQTVGWTLTTENNIYGSGSN------WMQNYTIANAWYCMHLWQHYRFTLDKEYL 618
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
+ AYP + CA + L L++ DG E SPEH P + A + ++
Sbjct: 619 KNIAYPAMRSCAEYWLQRLVKAADGTYECPNEFSPEH---GPGSENA-----TAHSQQLV 670
Query: 548 REVFSAIISAAEVLEKNEDAL 568
++F+ + A L +EDA+
Sbjct: 671 WDLFNNTLQAIAELGISEDAI 691
>gi|340514441|gb|EGR44703.1| glycoside hydrolase family 95 [Trichoderma reesei QM6a]
Length = 755
Score = 205 bits (521), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 166/589 (28%), Positives = 263/589 (44%), Gaps = 59/589 (10%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYT--NPDAP--KALSDVRSLV 79
A P+GNG+LGAM G V + + LNE +LW G P DY NP AP AL +R +
Sbjct: 3 AYPLGNGKLGAMPLGVVGEDIVVLNEHSLWAGGPFQSPDYIGGNPPAPVYTALPGIRETI 62
Query: 80 DSGQYAEATAASVKLFGHPADVY----QLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
Q +A L+G PA Y + LG++ + KY +Y R LDL T
Sbjct: 63 WKTQINNDISA---LYGDPAYYYYGNYETLGNLTVNIAGVS-KYT--SYNRALDLETGIH 116
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
++ +FT F + PDQV I S+ DSL N +
Sbjct: 117 TTEFKANGAKFTITTFCTFPDQVCAYNIQSSKPLPAVTIGLRDSLRSNPA---------S 167
Query: 196 EGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV-LLL 254
C + + D G+ F A ++ R T ++ + +G ++ ++
Sbjct: 168 NLTCDANGVHLRGQTQQD-IGMIFDARAQLINRPKRATCTSSHGLSVPSDGRTTSLTVVY 226
Query: 255 VASSSFD----GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
A +++D N S DP +S ++ + S++ +Y H+ D+ LF + S+
Sbjct: 227 AAGTNYDQKKGTKASNYSFKGVDPAPAVLSTIKKVSQKSFNSMYNAHIKDHNGLFSQFSL 286
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGT 369
L K +VP+A ++++ D DP + LLF +GRYL I S R G+
Sbjct: 287 DLPDPEKSA-----------SVPTATLMENYDYDLGDPFVENLLFDYGRYLFIGSCRDGS 335
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI-NGSK 428
NLQGIW E L+P W + HV++N++MN+W + L E Q PL+DF+ + G++
Sbjct: 336 LPPNLQGIWTESLTPAWSADYHVDVNVQMNHWHTEQTGLGEIQGPLWDFIIDTWVPRGTE 395
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
TA + Y A G+V + + + VW+ +P AWL ++W Y+Y+ D + +
Sbjct: 396 TAALLYDAPGFVGFSNLNTFG-FTGQMNAAVWSNYPASAAWLMQNVWNRYDYSRDTHWWK 454
Query: 489 KRAYPLLEGCASFLLDWLIE---GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
YPL++ A + + ++ +DG L P SPEH + C Y
Sbjct: 455 TVGYPLMKSIAEYWIHEMVPDLYSNDGTLVAAPCNSPEHGWTT----FGCTHYQQ----- 505
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEW 593
++ EVF +I E +E V ++ +L P I G I EW
Sbjct: 506 LVWEVFDHVIEGWEASGDKNTTFLETVKETQSKLSPGIIIGWFGQIQEW 554
>gi|229829382|ref|ZP_04455451.1| hypothetical protein GCWU000342_01471 [Shuttleworthia satelles DSM
14600]
gi|229792545|gb|EEP28659.1| hypothetical protein GCWU000342_01471 [Shuttleworthia satelles DSM
14600]
Length = 1622
Score = 205 bits (521), Expect = 6e-50, Method: Compositional matrix adjust.
Identities = 175/665 (26%), Positives = 293/665 (44%), Gaps = 114/665 (17%)
Query: 6 STSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY- 63
+ + N L++ ++ PA + T ++ IGNG +G +V+GG+ + + +NE T+W G P
Sbjct: 39 NAKSDNLLRLWYDKPASDWQTQSLAIGNGYMGGLVFGGINQDRIHINEKTVWEGGPDGKS 98
Query: 64 ------TNPDAPKA--------LSDVRSLVDSGQYAEATAASVKLFGHPADVYQ------ 103
TNP + + L+++R +D S +FG + YQ
Sbjct: 99 TYSYGTTNPISTEEDLQKIKDNLNEIRQKLDD--------KSEHVFGFDENSYQASGTDT 150
Query: 104 ----------LLGDIELEFDDSHLKYAE------------ETYRRELDLNTATARVKYSV 141
L+GD L+ D+ YA Y R+LD+ TA A V Y
Sbjct: 151 KGEAMDALNKLMGD--LKGYDAPTDYANLYISNDQDPSKVTNYVRDLDMRTALATVSYDY 208
Query: 142 GNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPG 201
V + RE+F+S PD ++ ++S + G +SF +L++L+ +Y N ++ G
Sbjct: 209 EGVHYCREYFNSYPDNIMAVRLSADKDGKISFKTNLENLIGGDAYTN-----VVRGDTIT 263
Query: 202 KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK---KLKVEGSDWAVLLLVASS 258
R D +G A ++K+ ++ G+IS+ E+ ++V G++ L+ +
Sbjct: 264 MR--------DALRGNGLKAEAQLKVINEGGSISSDENDGKPAIRVSGANAVTLIFACGT 315
Query: 259 SFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKD 318
+ P+ +DP +Q+ Y L H++D+ LF R+ +
Sbjct: 316 DYKMEL--PNFRGEDPHKAVKKRIQAAAKKGYQVLKKDHVEDHSALFSRMELGFDEEIPQ 373
Query: 319 IVTD-------TCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
I TD E N +P + E +L + +QFGRYL I+ SR G+
Sbjct: 374 IPTDELIRRYRNMVENNGGQIP--------MSAEQRALEVMCYQFGRYLTIAGSREGSLP 425
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
NLQG+W E TW H NIN++MNYW ++ NL EC +P DFL L G A
Sbjct: 426 TNLQGVWGEGFF-TWYGDYHFNINVQMNYWPTMASNLGECMKPYNDFLNVLKEAGRNAAA 484
Query: 432 VNY-------LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
+Y +GW++ + + S+ + P+G AW + +E+Y YT D
Sbjct: 485 ASYGIKSREGEENGWLVGCFSTPYMFSALGQKNNAAGWNPIGSAWALLNSYEYYLYTGDT 544
Query: 485 DFLEKRAYPLLEGCASF---LLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
+L ++ YP ++ A+F L W E Y+ + PS SPE+ + ++
Sbjct: 545 QYL-RQLYPSMKEVANFWNKALYW-SEYQQRYV-SAPSYSPEN---------GPIVNGAS 592
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQRRLNTS 601
D I + I AAE L + D LV + + +L P + + G + EW + TS
Sbjct: 593 YDQQFIWQHLENTIHAAETLGLDGD-LVAEWKEKQSKLDPVIVGKSGQVKEWFEE---TS 648
Query: 602 FSTCK 606
F +
Sbjct: 649 FGKAQ 653
>gi|383113206|ref|ZP_09933980.1| hypothetical protein BSGG_4923 [Bacteroides sp. D2]
gi|313697388|gb|EFS34223.1| hypothetical protein BSGG_4923 [Bacteroides sp. D2]
Length = 765
Score = 204 bits (520), Expect = 9e-50, Method: Compositional matrix adjust.
Identities = 179/612 (29%), Positives = 285/612 (46%), Gaps = 120/612 (19%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
+K+ + PA+++ T A+PIGNG LG + +GG+ E L+ NE TLWTG
Sbjct: 32 MKLWYTRPAQNWMTSALPIGNGELGGLFFGGIACERLQFNEKTLWTG------------- 78
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
S+ + YQ G++ ++F + + + + Y REL L+
Sbjct: 79 -SETKR----------------------GAYQSFGNLYIDFAEHNGEAVD--YCRELCLD 113
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISG-SESGSLSFNVSLDSLLDNHSYVNGN 190
A V Y + V++ RE+F+S PD+VIV +I+ G L+ +V L+ D+H
Sbjct: 114 NAIGSVSYEMNGVKYRREYFASYPDRVIVMRITTPGMKGRLNLSVRLE---DSHF----- 165
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQ-----FSAILEIKISDDRGTISALEDKKLKVE 245
+ + N + GIQ S ++K+ +++G +S + D +L V
Sbjct: 166 ---------------GQLSVNKNILGIQGQLDLLSYDAQVKVLNEKGQLSVV-DNRLTVC 209
Query: 246 GSDWAVLLLVASSSFDGPFINPSD----SKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
+D +LLVA ++F+ I+ +D S +D E + L + +Y+ L HL DY
Sbjct: 210 DADAVTILLVAGTNFN---ISATDYLGTSSEDLHKELYTRLSNASRKNYAALKNIHLKDY 266
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
Q LF RV + L + ++ P+ E V++ + E L L FQ+GRYL+
Sbjct: 267 QSLFSRVKLDL-------------QADMPEYPTDELVRNHK--ESRYLDMLYFQYGRYLM 311
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
+ SSR NLQGIWN D +P W+ H NIN++MNYW + NL EC P FL Y
Sbjct: 312 LGSSRGMNLPNNLQGIWNADNTPPWECDIHSNINIQMNYWPAEITNLPECHLP---FLQY 368
Query: 422 LSI------NGS--KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
+++ NGS + AQ L GW I + +I+ S W + AW CTH
Sbjct: 369 IAVEAVGKPNGSWRRIAQGEGL-RGWTIKTQNNIFGYSD-------WNINRPANAWYCTH 420
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP--DG 531
LW+HY Y D ++L A+P+++ + D L E DG L SPE P DG
Sbjct: 421 LWQHYAYNNDLEYLRNIAFPVMQSTCKYWFDRLKENKDGKLVAPDEWSPEQ---GPWEDG 477
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSI 590
V+Y+ + + E A+ + +V + ++ V ++ +L + G I
Sbjct: 478 ----VAYAQQLVWQLFNETLHAVEALKKVDIQIDNVFVSELADKFRKLDNGVSVGSWGQI 533
Query: 591 MEWVQRRLNTSF 602
EW + + F
Sbjct: 534 KEWKEDKGKLDF 545
>gi|325855022|ref|ZP_08171738.1| hypothetical protein HMPREF9303_0271 [Prevotella denticola CRIS
18C-A]
gi|325484000|gb|EGC86940.1| hypothetical protein HMPREF9303_0271 [Prevotella denticola CRIS
18C-A]
Length = 753
Score = 204 bits (519), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 148/513 (28%), Positives = 228/513 (44%), Gaps = 73/513 (14%)
Query: 24 FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
T +PIGNG+ GA + G V + ++ N+ TLW+G G T S D G
Sbjct: 1 MTSCLPIGNGQFGATLMGQVAVDDVQFNDKTLWSGKLGGLT------------STADYGS 48
Query: 84 YAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
Y FG+ F SH Y R LD+N A A V++ +
Sbjct: 49 YLN--------FGNL-------------FISSHGMKKVTDYVRYLDINNAVAGVQFCMDG 87
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY----VNGNNQ--IIMEG 197
V + R +F+SNPD IV + + S+ G +S ++L + N Y V+ NQ I +G
Sbjct: 88 VAYRRTYFASNPDSCIVIRYTASQRGKISTTLAL--MDQNGGYVRYVVDKVNQATITFDG 145
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVAS 257
+ A D S ++ + G + ++V +D + L
Sbjct: 146 QI--------ARQKDGGAATPESYCCTARVVTEGGKVRKNAKGLIEVSNADCMTIYLRGL 197
Query: 258 SSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPK 317
+ FD S + + + S + Y+ L H DY+ LF R L S
Sbjct: 198 TDFDPDAPEYVAGSGRLASRAAATVDSAQRKGYAALLAAHKADYRSLFDRCQFTLGDSKA 257
Query: 318 DIVTDTCSEENIDTVPSAERVKSFQTDEDPSLV--ELLFQFGRYLLISSSRPGTQVANLQ 375
DI T + + S++ + +L EL F +GRYLLISSSR + ANLQ
Sbjct: 258 DIST-------------PQLISSYRDNPHDNLFLEELYFSYGRYLLISSSRGISLPANLQ 304
Query: 376 GIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL---TYLSINGSKTAQ- 431
GIWN +P W + H NIN++MNYW + P NLSE P D++ + + + A+
Sbjct: 305 GIWNNSNTPAWHADIHANINVQMNYWPAEPTNLSELHRPFLDYIYREACVRPSWHRFAKD 364
Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
+ ++ +GW + + +I+ G + + AW C HLW+HY YTMDR++L RA
Sbjct: 365 MGHVDAGWTLPTENNIYGS-----GTTFADTYTVANAWYCQHLWQHYMYTMDREYLRTRA 419
Query: 492 YPLLEGCASFLLDWLIEGHDGYLETNPSTSPEH 524
+ +++ + L L++ DG E SPEH
Sbjct: 420 FSVMKSAVDYWLRKLVKASDGTYECPDEWSPEH 452
>gi|358383160|gb|EHK20828.1| glycoside hydrolase family 95 protein [Trichoderma virens Gv29-8]
Length = 791
Score = 204 bits (519), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 154/593 (25%), Positives = 273/593 (46%), Gaps = 51/593 (8%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYTNPDAPKALSDVR 76
P IGNGR G + G ++ L LN+D++W G P YT + +L+
Sbjct: 29 PGNVLMTGYTIGNGRQGGLPLGIPGNDLLCLNDDSIWRGGPFANSSYTGGNPSSSLAHFL 88
Query: 77 SLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTAT 134
+ + T +L+G AD Y+ L ++ + Y++ Y+R LDL TA
Sbjct: 89 PGIQEAIFQNGTGDESELYGGTADYGSYEALANLTVSIAGV-TNYSK--YKRTLDLETAL 145
Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSES-GSLSFNVSLDSLLDNHSYVNGNNQI 193
+++ F+ F S PDQV V +S ++ ++F L+DN+ N
Sbjct: 146 HSAEFTANGATFSTVQFCSFPDQVCVYHVSSNKPLPQITF-----GLVDNYRT---NPPS 197
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK---LKVEGSDWA 250
++ G + + AND I + + G + + L + + A
Sbjct: 198 TVKCSSSGIWLSGRTVANDGEGLIGMKIDAQARALPSAGLKAICNSQGQTVLSTKSAKSA 257
Query: 251 VLLLVASSSFDGPFINPSDSKK----DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+++ + + +D N + + DP + + ++ SY+ + H+ D+ + F+
Sbjct: 258 TIVVASGTEYDATKGNAAHNYSFRGVDPYPGVVKTINAVSKKSYNTILQSHVKDHGEWFN 317
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSS 365
+ ++ L D + ++DT+ E + ++ T++ DP + LL ++G+Y+ I+SS
Sbjct: 318 KFTLDLP--------DPHNSADVDTM---ELLTNYTTEKGDPFVENLLIEYGQYMFIASS 366
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI- 424
RPG+ NLQG W D +P W S H+++N++MN+W L +PL+DF+TY +
Sbjct: 367 RPGSLPPNLQGSWAPDGNPAWSSDYHIDVNVQMNHWHVEKMGLGGLTDPLWDFMTYTWVP 426
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G++TA + Y SGWV T+I+ +A W+ AW+ H+W+ Y+Y D+
Sbjct: 427 RGTETASLWYNVSGWVAFTNTNIFGH-TAQENDATWSNVAHDIAWMMAHVWDRYDYGRDK 485
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGH---DGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
+ YPL++G ASF +D ++ DG L NP SPEH P C +
Sbjct: 486 KWYASVGYPLMKGVASFWVDMMVPDEYFKDGTLVANPCNSPEH---GPT-TFGCAQFQQ- 540
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEW 593
++ E+F II + + A +++V +S +L P + G I EW
Sbjct: 541 ----VVWELFDHIIKDWDASGDTDTAFLKRVKESYSKLDPGVHVGSWGQIQEW 589
>gi|421287924|ref|ZP_15738687.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA58771]
gi|395886487|gb|EJG97503.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA58771]
Length = 717
Score = 204 bits (518), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 149/513 (29%), Positives = 241/513 (46%), Gaps = 51/513 (9%)
Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
Y GDI +EF ++ T Y+R+L+++ A Y F RE F+S PD ++
Sbjct: 27 TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLL 86
Query: 160 VTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPK 215
V + + + +L F + L D S + C I K D+
Sbjct: 87 VQRFTKEGAETLDFTIELSLTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND- 145
Query: 216 GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPT 275
++F++ L + G I D+ +++ G+ +A L L A + F + K D
Sbjct: 146 -LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLE 200
Query: 276 SESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSA 335
+ + + + + Y+ L +RH++DYQ LF RV + L E N+D +
Sbjct: 201 QQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL-------------EANVDAFTTD 247
Query: 336 ERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVN 393
+ +K+++ E +L EL FQ+GRYLLISSSR P ANLQG+WN +P W+S H+N
Sbjct: 248 DLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLN 307
Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKT 445
+NL+MNYW + NL E P+ +++ L + G + A V Y +GW++H +
Sbjct: 308 VNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQA 366
Query: 446 DI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
W D W P AW+ ++E Y++ D+D+L ++ YP+L F
Sbjct: 367 TPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWN 423
Query: 504 DWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLE 562
+L + ++PS SPEH +S +T D ++I ++F I AA+ L
Sbjct: 424 AFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELG 474
Query: 563 KNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+ED L E KS L P +I + G I EW +
Sbjct: 475 LDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 506
>gi|358381765|gb|EHK19439.1| glycoside hydrolase family 95 protein [Trichoderma virens Gv29-8]
Length = 788
Score = 202 bits (515), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 163/596 (27%), Positives = 271/596 (45%), Gaps = 59/596 (9%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYT--NPDAP--KAL 72
PA A P+GNG+LGAM G V + + LNE +LW+G P DY NP AP AL
Sbjct: 29 PANIIMTAYPLGNGKLGAMPLGLVGEDIVVLNEHSLWSGGPFESPDYIGGNPPAPVYTAL 88
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEETYRREL 128
+R + + Q +A L+G P Y+ LG++ ++ +Y+ +Y R L
Sbjct: 89 PGIRETIWNTQINNDISA---LYGDPTYYHYGNYETLGNLTVKIAGVS-RYS--SYNRAL 142
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
DL T + ++ +FT F + PDQV + ++ L DN
Sbjct: 143 DLETGIHQTAFTSNGAKFTITTFCTFPDQVCAYNVQSNKP----LPAVTIGLQDNQ---- 194
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
+ C + + D G+ F A ++ + T ++ + + +G
Sbjct: 195 -RSSPSSNSSCDANGVRLRGQTQQD-IGMIFDARAQVLNRPRKATCTSSHELLVPSDGKT 252
Query: 249 WAV-LLLVASSSFD----GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
+V ++ A +++D N S DP +S +Q++ S+S +Y H+ D+
Sbjct: 253 ASVTVVYAAGTNYDQKKGTKASNYSFKGVDPAPAVVSTIQAVEKKSFSSMYNAHVKDHNT 312
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLI 362
LF + ++ L S + +VP+A ++++ + DP + LLF +GRYL I
Sbjct: 313 LFSQFTLNLPDSEHSV-----------SVPTATLMENYDYNVGDPFVENLLFDYGRYLFI 361
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
S R G+ NLQGIW E+ P W S HV++N++MN+W + L + Q PL+DF+
Sbjct: 362 GSCRDGSLPPNLQGIWTENQFPAWSSDYHVDVNVQMNHWHTEQTGLGDIQGPLWDFIIDT 421
Query: 423 SI-NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
+ G++TA++ Y A G+V + + + VW+ +P AWL ++W Y+Y
Sbjct: 422 WVPRGTETAELLYDAPGFVGFSNLNTFG-FTGQMNSAVWSNYPASAAWLMQNVWNRYDYG 480
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIE---GHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
D + + YPL++ A + + ++ +DG L P SPEH + C Y
Sbjct: 481 RDTHWWKTVGYPLMKSVAEYWIHEMVPDLYSNDGTLVAAPCNSPEHGWTT----FGCTHY 536
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEW 593
++ EVF II + E +E V ++ +L P I G I EW
Sbjct: 537 QQ-----LVWEVFDHIIDSWEDSGDTNTTFLETVKETQSKLSPGIIIGWFGQIQEW 587
>gi|327313293|ref|YP_004328730.1| hypothetical protein HMPREF9137_1029 [Prevotella denticola F0289]
gi|326946180|gb|AEA22065.1| conserved hypothetical protein [Prevotella denticola F0289]
Length = 753
Score = 202 bits (515), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 147/513 (28%), Positives = 229/513 (44%), Gaps = 73/513 (14%)
Query: 24 FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
T +PIGNG+ GA + G V + ++ N+ TLW+G G T S D G
Sbjct: 1 MTSCLPIGNGQFGATLMGQVAVDDVQFNDKTLWSGKLGGLT------------STADYGS 48
Query: 84 YAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
Y FG+ F SH Y R LD+N A A V++ +
Sbjct: 49 YLN--------FGNL-------------FISSHGMRKVTDYVRYLDINNAVAGVQFCIDG 87
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY----VNGNNQ--IIMEG 197
V + R +F+S+PD IV + + S+ G +S ++L + N Y V+ NQ I +G
Sbjct: 88 VAYRRTYFASSPDSCIVIRYTASQRGKISTTLAL--MDQNGGYVRYVVDKVNQATITFDG 145
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVAS 257
+ A D S ++ + G + ++V +D + L
Sbjct: 146 QI--------ARQKDGGAATPESYCCTARVVTEGGKVRKNARGLIEVINADCMTVYLRGL 197
Query: 258 SSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPK 317
+ FD + + + S + Y+ L H DY+ LF R + L S
Sbjct: 198 TDFDPDAPEYVAGAGRLAGRAAATVDSAQRRGYAALLAAHKADYRSLFDRCQLTLGDSKA 257
Query: 318 DIVTDTCSEENIDTVPSAERVKSFQTDEDPSLV--ELLFQFGRYLLISSSRPGTQVANLQ 375
DI T + + S++ + +L EL F +GRYLLISSSR + ANLQ
Sbjct: 258 DIST-------------PQLISSYRDNPHDNLFLEELYFSYGRYLLISSSRGVSLPANLQ 304
Query: 376 GIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL---TYLSINGSKTAQ- 431
GIWN +P W + H NIN++MNYW + P NLSE P D++ + + + A+
Sbjct: 305 GIWNNSNTPAWHADIHANINVQMNYWPAEPTNLSELHRPFLDYIYREACVRPSWHRFAKD 364
Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
+ ++ +GW + + +I+ G + + AW C HLW+HY YTMDR++L RA
Sbjct: 365 MGHVDAGWTLPTENNIYGS-----GTTFADTYTVANAWYCQHLWQHYMYTMDREYLRTRA 419
Query: 492 YPLLEGCASFLLDWLIEGHDGYLETNPSTSPEH 524
+P+++ + L L++ DG E SPEH
Sbjct: 420 FPVMKSAVDYWLRKLVKASDGTYECPDEWSPEH 452
>gi|358390062|gb|EHK39468.1| glycoside hydrolase family 95 protein [Trichoderma atroviride IMI
206040]
Length = 797
Score = 202 bits (515), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 171/637 (26%), Positives = 293/637 (45%), Gaps = 81/637 (12%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDYTNPDA 68
++ + P+ F ++ +GNGR A V ET LNE T W+G G P+
Sbjct: 6 RLYYTTPSTSFPTSLALGNGRFAASVLSSPEHETFLLNEVTFWSGEARNAGEGLAERPED 65
Query: 69 PKA-LSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEFDDSHLKYA 120
PKA L ++ +G YA+ + K FG V +L DI + H A
Sbjct: 66 PKAELRKTQNCYLNGDYAQGKKRAEKYLESKKNNFGTNLGVGKL--DIAV---TGHGNPA 120
Query: 121 E-ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+ + + REL + A +Y V ++ R F S+P QV+V + G + L VS
Sbjct: 121 DIQDFERELRFDEAITETRYKVNGHQYKRRAFLSHPHQVLVIQFDGDDLSGLEVAVS--- 177
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANA-----NDDPKGIQFSAILEIKISDDRGTI 234
V G N+ R+ A A +D G++ I+ K+++ +
Sbjct: 178 -------VQGENEAFTSKVNSESRLEFDAQALETVHSDGTCGVKGFGIVAAKVNEGK--- 227
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
+D KL + + + ++ ++ +S+ + ++ ++ + L DL
Sbjct: 228 VEQKDGKLTISAQKSITIFVAFNTDYN-------ESRNEWRERTLLQIEDVLQLPIDDLL 280
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVEL 352
HL DYQ L+ R+ I+L PK S N +P+ +R +F++ DP + L
Sbjct: 281 KEHLGDYQPLYRRMDIRLG--PK-------SNPN-SNIPTDQRRGNFESSGYADPGMFAL 330
Query: 353 LFQFGRYLLISSSRPGTQVA-NLQGIWN--EDLSPTWDSAPHVNINLEMNYWQSLPCNLS 409
F + RYL I+ +R + + +LQG+WN E W H++IN +MNY+ L L+
Sbjct: 331 YFHYSRYLTIAGTREDSPLPLHLQGLWNDGEACKMGWSCDYHLDINTQMNYFAILNSGLA 390
Query: 410 ECQEPLFDFLTYLSINGSKTAQVNYLA-SGWVIHHKTDIWAKSSADRG-KVVWALWPMGG 467
+ +PL+ ++ L++ G +TA+ Y + GWV H ++ W + D G ++ + L GG
Sbjct: 391 DLMKPLYKYIFKLAVKGQQTARTCYGSREGWVAHVFSNAWGFT--DPGWEISYGLNVTGG 448
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEF 526
W+ L E Y YT+D + +PLL G F LD++IE G+L T PS SPE+ F
Sbjct: 449 LWMAAPLIEMYEYTLDDGLMMTNLWPLLFGATKFWLDYMIEDPKTGWLLTGPSVSPENSF 508
Query: 527 --IAPDG--KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNE----DALVEKVLKSLPR 578
+ DG + S T+D+ ++R++F+ A L+ D +++ K L +
Sbjct: 509 FVVNEDGTKEEHSADLSPTLDVVLLRDLFAFCEYFAGKLKTMTGFPWDEDIKEYQKVLAK 568
Query: 579 LRPTKIAEDGSIMEWV---------QRRLNTSFSTCK 606
L P +I ++G + EW+ R L+ + + C+
Sbjct: 569 LPPLQIGKNGQLQEWLHDYEEAQPYHRHLSHTMALCR 605
>gi|421230262|ref|ZP_15686926.1| fibronectin type III domain protein [Streptococcus pneumoniae
2061376]
gi|395593788|gb|EJG54030.1| fibronectin type III domain protein [Streptococcus pneumoniae
2061376]
Length = 717
Score = 202 bits (513), Expect = 6e-49, Method: Compositional matrix adjust.
Identities = 150/524 (28%), Positives = 246/524 (46%), Gaps = 73/524 (13%)
Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
Y GDI +EF ++ T Y+R+L+++ A Y F RE F+S PD ++
Sbjct: 27 TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLL 86
Query: 160 VTKISGSESGSLSFNVSLD---SLLDNHSYVN------------GNNQIIMEGRCPGKRI 204
V + + + +L F + L L + Y ++ I+M+GR
Sbjct: 87 VQRFTKEGAETLDFTIELSLTCDLASDEKYEQEKSDYKECKLDITDSHILMKGRVKD--- 143
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
ND ++F++ L + G I D+ +++ G+ +A L L A + F
Sbjct: 144 ------ND----LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNP 189
Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
+ K D + + + + + Y+ L +RH++DYQ LF RV + L
Sbjct: 190 ASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL------------ 237
Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDL 382
E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P ANLQG+WN
Sbjct: 238 -EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVD 296
Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA------ 436
+P W+S H+N+NL+MNYW + NL E P+ +++ L + G + A V Y
Sbjct: 297 NPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG-RLAAVKYAGIVSQKG 355
Query: 437 --SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
+GW++H + W D W P AW+ ++E Y++ D+D+L ++ Y
Sbjct: 356 EENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIY 412
Query: 493 PLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
P+L F +L + ++PS SPEH +S +T D ++I ++F
Sbjct: 413 PMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLF 463
Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 464 HDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 506
>gi|210632036|ref|ZP_03297176.1| hypothetical protein COLSTE_01069 [Collinsella stercoris DSM 13279]
gi|210159752|gb|EEA90723.1| F5/8 type C domain protein [Collinsella stercoris DSM 13279]
Length = 1203
Score = 201 bits (511), Expect = 9e-49, Method: Compositional matrix adjust.
Identities = 162/607 (26%), Positives = 276/607 (45%), Gaps = 78/607 (12%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG------DYTNPD---APKALSDVR 76
DA+ IGNG+ GA+++G V + + NE TLWTG P D N D L +R
Sbjct: 72 DALVIGNGKTGAILFGQVAQDKVHFNEKTLWTGGPSKSRPNYDGGNKDQAVTKHQLDALR 131
Query: 77 SLVDSGQ---YAEATAASVKLFG--HPADVYQLLGDIELEFDDSHLKYAE-ETYRRELDL 130
+ +D + T +++G + YQ GD+E +F + + Y R+LD+
Sbjct: 132 AKMDDHSKDVFPMGTQIPTEVWGDGNGMGAYQDFGDLEFDFSPMGATNSNIQNYERDLDM 191
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
TA + V Y V +TRE+ +S+P V+ ++ S+ G +SF++ + S + + +
Sbjct: 192 RTAVSTVSYDFNGVHYTREYLASHPAGVVAVRLDASKDGEISFDLGVGSAKGLNVRASAD 251
Query: 191 -NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
+++ G + + A P+G G+I A E V +D
Sbjct: 252 AGDLVLAGNVADNGMLCEMRARVLPEG---------------GSIKASESGGFSVRDADA 296
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS----IRNLSYSDLYTRHLDDYQKLF 305
+L + ++ + PS + +AL+ +SY +L +H+DD++ LF
Sbjct: 297 VTVLYATETDYENAY--PSYRSGQTLEQVDAALKEKLDVAAGISYDELKKQHIDDHRSLF 354
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISS 364
RV I L P TD + +K ++ + DP + E+LFQFGRYL I+S
Sbjct: 355 ERVEIDLGGVPAQKPTD-------------QMMKDYRAGNNDPFIEEMLFQFGRYLTIAS 401
Query: 365 SRPGTQV-ANLQGIW-NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SR G ++ +NL GIW D W H N+N++MNYW + NLSEC D++ L
Sbjct: 402 SREGDELPSNLCGIWMMGDAGRFWGGDFHFNVNVQMNYWPAYMTNLSECGSVFTDYMESL 461
Query: 423 SINGSKTAQVNYL-------------ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAW 469
+ G TA+ + G++++ + + + +A G + G +W
Sbjct: 462 VVPGRVTAERSAAMKTENHATTPVGQGKGFLVNTQNNPFG-CTAPFGSQEYGWNVTGSSW 520
Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIA 528
++++ Y +T D + L R YP+L+ +F +L + L PS S E
Sbjct: 521 ALQNVYDEYLFTRDENLLRTRIYPMLKEMTTFWDGFLWWSDYQKRLVVGPSFSAEQ---- 576
Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDG 588
ST D +++ E+++ I A+E L +ED L + K+ +L P I E+G
Sbjct: 577 -----GPTVNGSTYDQSLVWELYTMAIDASERLGVDED-LRAEWKKTRDKLNPIIIGEEG 630
Query: 589 SIMEWVQ 595
+ EW +
Sbjct: 631 QVKEWFE 637
>gi|148984088|ref|ZP_01817383.1| hypothetical protein CGSSp3BS71_02667 [Streptococcus pneumoniae
SP3-BS71]
gi|418232655|ref|ZP_12859241.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA07228]
gi|418237110|ref|ZP_12863676.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA19690]
gi|147923377|gb|EDK74490.1| hypothetical protein CGSSp3BS71_02667 [Streptococcus pneumoniae
SP3-BS71]
gi|353885968|gb|EHE65752.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA07228]
gi|353891548|gb|EHE71302.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA19690]
Length = 717
Score = 201 bits (511), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 152/524 (29%), Positives = 245/524 (46%), Gaps = 73/524 (13%)
Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
Y GDI +EF ++ T Y+R+L+++ A A Y F RE F+S PD ++
Sbjct: 27 TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVCKGTRFEREAFASFPDDLL 86
Query: 160 VTKISGSESGSLSFNVSLD---SLLDNHSYVN------------GNNQIIMEGRCPGKRI 204
V + +L F + L L N Y ++ I+M+GR
Sbjct: 87 VQCFTKEGLETLDFTIELSLTCDLASNGKYEQEKSDYKECKLDITDSHILMKGRVKD--- 143
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
ND ++F++ L + G I D+ +++ G+ +A L L A + F
Sbjct: 144 ------ND----LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNP 189
Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
+ K D + + + + + Y+ L +RH++DYQ LF RV + L
Sbjct: 190 ASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL------------ 237
Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDL 382
E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P ANLQG+WN
Sbjct: 238 -EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVD 296
Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA------ 436
+P W+S H+N+NL+MNYW + NL E P+ +++ L + G + A V Y
Sbjct: 297 NPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG-RLAAVKYAGIVSQKG 355
Query: 437 --SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
+GW++H + W D W P AW+ ++E Y++ D+D+L ++ Y
Sbjct: 356 EENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIY 412
Query: 493 PLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
P+L F +L + ++PS SPEH +S +T D ++I ++F
Sbjct: 413 PMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLF 463
Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 464 HDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 506
>gi|418157949|ref|ZP_12794665.1| hypothetical protein SPAR41_1732 [Streptococcus pneumoniae GA16833]
gi|353824397|gb|EHE04571.1| hypothetical protein SPAR41_1732 [Streptococcus pneumoniae GA16833]
Length = 692
Score = 201 bits (511), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 152/524 (29%), Positives = 245/524 (46%), Gaps = 73/524 (13%)
Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
Y GDI +EF ++ T Y+R+L+++ A A Y F RE F+S PD ++
Sbjct: 27 TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86
Query: 160 VTKISGSESGSLSFNVSLD---SLLDNHSYVN------------GNNQIIMEGRCPGKRI 204
V + +L F + L L N Y ++ I+M+GR
Sbjct: 87 VQCFTKEGLETLDFTIELSLTCDLASNGKYEQEKSDYKECKLDITDSHILMKGRVKD--- 143
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
ND ++F++ L + G I D+ +++ G+ +A L L A + F
Sbjct: 144 ------ND----LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNP 189
Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
+ K D + + + + + Y+ L +RH++DYQ LF RV + L
Sbjct: 190 ASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL------------ 237
Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDL 382
E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P ANLQG+WN
Sbjct: 238 -EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVD 296
Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA------ 436
+P W+S H+N+NL+MNYW + NL E P+ +++ L + G + A V Y
Sbjct: 297 NPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG-RLAAVKYAGIVSQKG 355
Query: 437 --SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
+GW++H + W D W P AW+ ++E Y++ D+D+L ++ Y
Sbjct: 356 EENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIY 412
Query: 493 PLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
P+L F +L + ++PS SPEH +S +T D ++I ++F
Sbjct: 413 PMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLF 463
Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 464 HDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 506
>gi|421299116|ref|ZP_15749803.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA60080]
gi|395900587|gb|EJH11525.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA60080]
Length = 717
Score = 201 bits (511), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 153/524 (29%), Positives = 245/524 (46%), Gaps = 73/524 (13%)
Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
Y GDI +EF ++ T Y+R+L+++ A A Y F RE F+S PD ++
Sbjct: 27 TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86
Query: 160 VTKISGSESGSLSFNVSLD---SLLDNHSYVN------------GNNQIIMEGRCPGKRI 204
V + +L F + L L N Y ++ I+M+GR
Sbjct: 87 VQCFTKEGLETLDFTIELSLTCDLASNGKYEQEKSDYKECKLDITDSHILMKGRVKD--- 143
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
ND ++F++ L + G I D+ +++ G+ +A L L A + F
Sbjct: 144 ------ND----LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNP 189
Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
+ K D + + + + + Y+ L +RH++DYQ LF RV + L
Sbjct: 190 ASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL------------ 237
Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDL 382
E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P ANLQG+WN
Sbjct: 238 -EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVD 296
Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA------ 436
+P W+S H+N+NL+MNYW + NL E P+ +++ L + G + A V Y
Sbjct: 297 NPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG-RLAAVKYAGIVSQKG 355
Query: 437 --SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
+GW++H + W D W P AW+ ++E Y++ D+D+L ++ Y
Sbjct: 356 EENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIY 412
Query: 493 PLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
P+L F +L E ++PS SPEH +S +T D ++I ++F
Sbjct: 413 PMLRETVCFWNAFLHKEQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLF 463
Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 464 HDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 506
>gi|418112994|ref|ZP_12749994.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA41538]
gi|418153393|ref|ZP_12790131.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA16121]
gi|418155639|ref|ZP_12792366.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA16242]
gi|419513045|ref|ZP_14052677.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA05578]
gi|419517252|ref|ZP_14056868.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA02506]
gi|421283791|ref|ZP_15734577.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA04216]
gi|353783356|gb|EHD63785.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA41538]
gi|353816944|gb|EHD97152.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA16121]
gi|353819888|gb|EHE00077.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA16242]
gi|379634210|gb|EHZ98775.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA05578]
gi|379639325|gb|EIA03869.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA02506]
gi|395880477|gb|EJG91529.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA04216]
Length = 717
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 152/524 (29%), Positives = 245/524 (46%), Gaps = 73/524 (13%)
Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
Y GDI +EF ++ T Y+R+L+++ A A Y F RE F+S PD ++
Sbjct: 27 TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86
Query: 160 VTKISGSESGSLSFNVSLD---SLLDNHSYVN------------GNNQIIMEGRCPGKRI 204
V + +L F + L L N Y ++ I+M+GR
Sbjct: 87 VQCFTKEGLETLDFTIELSLTCDLASNGKYEQEKSDYKECKLDITDSHILMKGRVKD--- 143
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
ND ++F++ L + G I D+ +++ G+ +A L L A + F
Sbjct: 144 ------ND----LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNP 189
Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
+ K D + + + + + Y+ L +RH++DYQ LF RV + L
Sbjct: 190 ASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL------------ 237
Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDL 382
E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P ANLQG+WN
Sbjct: 238 -EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVD 296
Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA------ 436
+P W+S H+N+NL+MNYW + NL E P+ +++ L + G + A V Y
Sbjct: 297 NPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG-RLAAVKYAGIVSQKG 355
Query: 437 --SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
+GW++H + W D W P AW+ ++E Y++ D+D+L ++ Y
Sbjct: 356 EENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIY 412
Query: 493 PLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
P+L F +L + ++PS SPEH +S +T D ++I ++F
Sbjct: 413 PMLRETVCFWNAFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLF 463
Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 464 HDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 506
>gi|320537187|ref|ZP_08037155.1| hypothetical protein HMPREF9554_01893 [Treponema phagedenis F0421]
gi|320145965|gb|EFW37613.1| hypothetical protein HMPREF9554_01893 [Treponema phagedenis F0421]
Length = 735
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 175/594 (29%), Positives = 267/594 (44%), Gaps = 74/594 (12%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDA-----PKAL 72
++PIGNG +GA ++GG+ E L LNE TLWTG P G+ T D
Sbjct: 57 SLPIGNGFIGASIFGGIRREYLHLNEKTLWTGGPCKKRPNYSGGNKTGVDENGYTPADYF 116
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPAD----VYQLLGDIELEFDDS-HLKYAE-----E 122
+ +R+L G+ AEA A KL G A YQ G ++F S H +E +
Sbjct: 117 AKIRTLFSEGKDAEAAALCDKLVGEKASEGYGAYQSFGKFFIDFYYSAHTALSEPPAEIK 176
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
YRRELDLN A V+Y E+ R +F++ P V+ KI+ S L +V +S
Sbjct: 177 AYRRELDLNQALVEVRYQYNTTEYRRMYFANYPSNVLAGKITASNP-VLHCSVHFESD-Q 234
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
S N + G K ND ++F +L +I D I+ DK +
Sbjct: 235 GGSISYTQNGFTLSG---------KVEDND----LEF--LLRCRIRTD--GITTCSDKGI 277
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
+ + + L +++ + + P P + L N S+ L H+ DY
Sbjct: 278 SITQASFLEFFLCSATDYSDSY--PKYRTGFPPHIDEANL----NKSFDALLAEHIKDYC 331
Query: 303 KLFHRVSIQLSR-SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
LF R + + + S D+ TD E + S + L +LLFQ+GRYLL
Sbjct: 332 PLFDRCRLNIGQDSEPDMPTDVLLSEYKNGKFSRK------------LEDLLFQYGRYLL 379
Query: 362 ISSSRPGTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
+SSSR + ANLQG+WN SP W S H+NINL+MNYW + L EC PL ++
Sbjct: 380 LSSSREKNILPANLQGMWNNSNSPPWASDYHLNINLQMNYWLACVTGLPECCIPLVKYVA 439
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
L +TA+ G ++ H + + W P W+ +LW++Y
Sbjct: 440 ALEKPAERTAKAYTGLDGGLMIHTQNTPFGWTCPGWSFDWGWSPAAFPWILQNLWQYYCA 499
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
+ D L++ YPL + F L+ + L ++P+ SPEH P +
Sbjct: 500 SGDFTRLKEIIYPLFKKEIQFYTAVLVFDKKQNRLVSSPTYSPEH---GPR------TNG 550
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+T + ++I E+F I AA++ + + AL+ + K L+P I + I+EW
Sbjct: 551 NTYEQSLIWELFKQGIEAAKLCGEKK-ALIAQWKKVQENLKPIVIGKSRQILEW 603
>gi|386346135|ref|YP_006044384.1| alpha-L-fucosidase [Spirochaeta thermophila DSM 6578]
gi|339411102|gb|AEJ60667.1| alpha-L-fucosidase 2 precursor [Spirochaeta thermophila DSM 6578]
Length = 784
Score = 200 bits (509), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 166/586 (28%), Positives = 259/586 (44%), Gaps = 74/586 (12%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLV 79
PA + D P+GNGRL A+V GGV E + LN + LW G D + + VR
Sbjct: 13 PAGVWRDGYPVGNGRLAALVLGGVGEERIHLNHEWLWRGWYRDRVAEERAHLVGWVREAF 72
Query: 80 DSGQYAEATAASVKLFGHPADV---------YQLLGDIELEFDDSHLKYAEETYRRELDL 130
+G + E T + + FG V YQ G + L ++ E YRRELDL
Sbjct: 73 FTGDWEEGTRRANEAFGGGGGVSGRTCRVGAYQPAGTLVLRWEGME----EAEYRRELDL 128
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
RV+ ++E P + ++SG G + + ++ G+
Sbjct: 129 EEGVVRVRRGE-SLEEVMAVLGGGP---VGVRVSGWGKGWVGLGREVQEGVEVRVEC-GD 183
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFS--AILEIKISDDRGTISALEDKKLKVEGSD 248
++ +EGR +GI + A++E + + G +E +++ V
Sbjct: 184 GRVRLEGRFE--------------EGIVWEVLAVVEGGVCREEGKGVWVEGEEVVVWVVV 229
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
+ S PS + E A++ RH++ Y +LF RV
Sbjct: 230 DVWEEVGGSRRR-----LPSYGPPEVPGEGWEAVRR-----------RHVEAYGQLFGRV 273
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
+ + EE + +P+ R + D DP L LLF +GRYLLISSS PG
Sbjct: 274 RLVVE-----------GEEPL--LPTGRR----RGDPDPLLPVLLFDYGRYLLISSSAPG 316
Query: 369 TQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
+ ANLQG WN L P WD+ H++INL+MNYW + L EC PL ++ + +
Sbjct: 317 CDLPANLQGKWNPLLEPPWDADYHMDINLQMNYWLAEGAGLGECVTPLVRYVVRMMPSAR 376
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
+ A+ + G +D WA+++ + W +W AW+ HL Y Y+ D FL
Sbjct: 377 EAARRLFGCRGIWFPLTSDAWARATPE--AYGWDVWVGAAAWMAQHLVWRYLYSGDEGFL 434
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
+ YP LE A F D+L+E +G L+ PS SPEH + +G + SS +D+ ++
Sbjct: 435 RETVYPFLEEVALFFEDFLVEDGEGVLQVVPSQSPEHRWEGLEGFPVGLCVSSAVDVQLV 494
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
R V + L +E + ++ L RLR + DG ++EW
Sbjct: 495 RWVLRMAVELGGRL-GDEVSRWREMEGRLARLR---VGRDGVLLEW 536
>gi|417677381|ref|ZP_12326788.1| hypothetical protein SPAR148_1583 [Streptococcus pneumoniae
GA17545]
gi|418226036|ref|ZP_12852664.1| hypothetical protein SPAR141_1569 [Streptococcus pneumoniae NP112]
gi|419467267|ref|ZP_14007148.1| BH0842-like protein [Streptococcus pneumoniae GA05248]
gi|332072822|gb|EGI83303.1| hypothetical protein SPAR148_1583 [Streptococcus pneumoniae
GA17545]
gi|353881233|gb|EHE61047.1| hypothetical protein SPAR141_1569 [Streptococcus pneumoniae NP112]
gi|379543014|gb|EHZ08166.1| BH0842-like protein [Streptococcus pneumoniae GA05248]
Length = 692
Score = 200 bits (509), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 152/524 (29%), Positives = 245/524 (46%), Gaps = 73/524 (13%)
Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
Y GDI +EF ++ T Y+R+L+++ A A Y F RE F+S PD ++
Sbjct: 27 TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86
Query: 160 VTKISGSESGSLSFNVSLD---SLLDNHSYVN------------GNNQIIMEGRCPGKRI 204
V + +L F + L L N Y ++ I+M+GR
Sbjct: 87 VQCFTKEGLETLDFTIELSLTCDLASNGKYEQEKSDYKECKLDITDSHILMKGRVKD--- 143
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
ND ++F++ L + G I D+ +++ G+ +A L L A + F
Sbjct: 144 ------ND----LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNP 189
Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
+ K D + + + + + Y+ L +RH++DYQ LF RV + L
Sbjct: 190 ASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL------------ 237
Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDL 382
E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P ANLQG+WN
Sbjct: 238 -EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVD 296
Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA------ 436
+P W+S H+N+NL+MNYW + NL E P+ +++ L + G + A V Y
Sbjct: 297 NPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG-RLAAVKYAGIVSQKG 355
Query: 437 --SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
+GW++H + W D W P AW+ ++E Y++ D+D+L ++ Y
Sbjct: 356 EENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIY 412
Query: 493 PLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
P+L F +L + ++PS SPEH +S +T D ++I ++F
Sbjct: 413 PMLRETVCFWNAFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLF 463
Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 464 HDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 506
>gi|391873884|gb|EIT82888.1| alpha-L-fucosidase 2 precursor, putative [Aspergillus oryzae 3.042]
Length = 513
Score = 200 bits (508), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 119/304 (39%), Positives = 163/304 (53%), Gaps = 21/304 (6%)
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
D++ L RV + L+ S + N+ T ER K+ D DP LV L+FQFGRY
Sbjct: 15 DHEALAGRVHLDLASS--------GAAGNLPTDVRLERYKT-HPDADPELVTLMFQFGRY 65
Query: 360 LLISSSR-PGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
LI+SSR GT NLQG+WNED P W VNINLEMNYW + NL+E PL
Sbjct: 66 SLIASSRETGTSPLPPNLQGLWNEDYEPAWGGRYTVNINLEMNYWPAGVTNLAETLGPLI 125
Query: 417 DFLTYLSINGSKTAQVNYLA--SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
L + G A+ Y G+V+HH TDIW + W +WPMGGAWL +L
Sbjct: 126 FLLETVKPRGQDIARRMYNCDNGGYVLHHNTDIWGDAVPVNNGTKWTMWPMGGAWLSANL 185
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD---- 530
E+Y +T D + L++R +PLL A F ++ +GYL T PS+SPE+ F+ P+
Sbjct: 186 MEYYRFTQDTNLLKERIWPLLRSAAQFYHCYVFS-FNGYLSTGPSSSPENAFVVPNDMSK 244
Query: 531 -GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
G + + TMD ++ E+F +II +VL N + K SLP ++ +I G
Sbjct: 245 SGNEEGIDIAPTMDNTLLSELFHSIIETGKVLGIN-NTDTTKAASSLPLIKLPQIGSYGQ 303
Query: 590 IMEW 593
I+EW
Sbjct: 304 ILEW 307
>gi|83764453|dbj|BAE54597.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 513
Score = 200 bits (508), Expect = 2e-48, Method: Compositional matrix adjust.
Identities = 119/304 (39%), Positives = 163/304 (53%), Gaps = 21/304 (6%)
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
D++ L RV + L+ S + N+ T ER K+ D DP LV L+FQFGRY
Sbjct: 15 DHEALAGRVHLDLASS--------GAAGNLPTDVRLERYKT-HPDADPELVTLMFQFGRY 65
Query: 360 LLISSSRP-GTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
LI+SSR GT NLQG+WNED P W VNINLEMNYW + NL+E PL
Sbjct: 66 SLIASSRKTGTSPLPPNLQGLWNEDYEPAWGGRYTVNINLEMNYWPAGVTNLAETLGPLI 125
Query: 417 DFLTYLSINGSKTAQVNYLA--SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
L + G A+ Y G+V+HH TDIW + W +WPMGGAWL +L
Sbjct: 126 FLLETVKPRGQDIARRMYNCDNGGYVLHHNTDIWGDAVPVNNGTKWTMWPMGGAWLSANL 185
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD---- 530
E+Y +T D + L++R +PLL A F ++ +GYL T PS+SPE+ F+ P+
Sbjct: 186 MEYYRFTQDTNLLKERIWPLLRSAAQFYHCYVFS-FNGYLSTGPSSSPENAFVVPNDMSE 244
Query: 531 -GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
G + + TMD ++ E+F +II +VL N + K SLP ++ +I G
Sbjct: 245 SGNEEGIDIAPTMDNTLLSELFHSIIETGKVLGIN-NTDTTKAASSLPLIKLPQIGSYGQ 303
Query: 590 IMEW 593
I+EW
Sbjct: 304 ILEW 307
>gi|418189889|ref|ZP_12826401.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47373]
gi|419493782|ref|ZP_14033507.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47210]
gi|353853616|gb|EHE33597.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47373]
gi|379592355|gb|EHZ57171.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47210]
Length = 717
Score = 199 bits (507), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 151/524 (28%), Positives = 243/524 (46%), Gaps = 73/524 (13%)
Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
Y GDI +EF ++ T Y+R+L+++ A A Y F RE F+S PD ++
Sbjct: 27 TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86
Query: 160 VTKISGSESGSLSFNVSLD---SLLDNHSYVN------------GNNQIIMEGRCPGKRI 204
V + +L F + L L N Y ++ I+M+GR
Sbjct: 87 VQCFTKEGLETLDFTIELSLTCDLASNGKYEQEKSDYKECKLDITDSHILMKGRVKD--- 143
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
ND ++F++ L + D S ++++ G+ +A L L A + F
Sbjct: 144 ------ND----LRFASYLAWETDGDIRVWSY----RVQISGASYANLFLAAKTDFAQNP 189
Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
+ K D + + + + Y+ L +RH++DYQ LF RV + L
Sbjct: 190 ASNYRKKLDLEQQVRDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL------------ 237
Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDL 382
E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P ANLQG+WN
Sbjct: 238 -EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVD 296
Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA------ 436
+P W+S H+N+NL+MNYW + NL E P+ +++ L + G + A V Y
Sbjct: 297 NPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG-RLAAVKYAGIVSQKG 355
Query: 437 --SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
+GW++H + W D W P AW+ ++E Y++ D+D+L ++ Y
Sbjct: 356 EENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIY 412
Query: 493 PLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
P+L F +L + ++PS SPEH +S +T D ++I ++F
Sbjct: 413 PMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLF 463
Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
I AA+ L +ED L E KS L P +I + G I EW +
Sbjct: 464 HDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 506
>gi|336427815|ref|ZP_08607806.1| hypothetical protein HMPREF0994_03812 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336008564|gb|EGN38577.1| hypothetical protein HMPREF0994_03812 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 377
Score = 199 bits (507), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 135/409 (33%), Positives = 206/409 (50%), Gaps = 46/409 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ ++ PA+ + +A+PIGNGR+G MV GG+ E ++LNED++W+G + NPDA + L
Sbjct: 1 MKLWYDKPARFWHEALPIGNGRMGGMVHGGITRELIQLNEDSVWSGKHLNRINPDAKENL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHP--ADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
+R L+ G+ EA A L G P YQ G+ L+ H + YRREL+
Sbjct: 61 PVIRKLIREGRVEEAQQLAMYALSGVPNSQRSYQTAGECCLQM---HHGDEVQDYRRELE 117
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L +RV Y+V V + RE + S P+ +V + + + SF+ L H+ +
Sbjct: 118 LAEGISRVAYTVQGVRYIRESYVSYPENCMVMVLKTEDGTAFSFDCLLGRC---HNATDE 174
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
++ C + +GI F+A L K G + + L V
Sbjct: 175 VEKVDEHTIC--------FTVDGGQEGISFAAALCAKAV---GGFVRVIGEHLLVRDVQE 223
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSY---SDLYTRHLDDYQKLFH 306
A L L +SF +D +K L IR + +D+ H +D+ +F+
Sbjct: 224 AYLYLDIETSF-----READYRK-------VCLDRIRTAAVKEEADIRALHKEDFGSVFN 271
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSS 365
R+++ + D+ + +P+ ER++ Q E D L+EL FQ+GRYLL+SSS
Sbjct: 272 RLALSFELTDADL----------EQIPTDERLRRVQAGERDMGLMELYFQYGRYLLMSSS 321
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
R G+ ANLQGIWN+ L P W+S +NIN EMNYW + NLSECQ P
Sbjct: 322 RKGSLPANLQGIWNDKLYPVWESKFTININTEMNYWIAGSGNLSECQLP 370
>gi|418108082|ref|ZP_12745119.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA41410]
gi|419423496|ref|ZP_13963709.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA43264]
gi|353778359|gb|EHD58827.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA41410]
gi|379586068|gb|EHZ50922.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA43264]
Length = 717
Score = 199 bits (506), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 149/513 (29%), Positives = 240/513 (46%), Gaps = 51/513 (9%)
Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
Y GDI +EF ++ T Y+R+L+++ A A Y F RE F+S PD ++
Sbjct: 27 TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86
Query: 160 VTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPK 215
V + +L F + L D S + C I K D+
Sbjct: 87 VQCFTKEGLETLDFTIELSLTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND- 145
Query: 216 GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPT 275
++F++ L + G I D+ +++ G+ +A L L A + F + K D
Sbjct: 146 -LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLE 200
Query: 276 SESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSA 335
+ + + + + Y+ L +RH++DYQ LF RV + L E ++D +
Sbjct: 201 QQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL-------------EADVDASTTD 247
Query: 336 ERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVN 393
+ +K+++ E +L EL FQ+GRYLLISSSR P ANLQG+WN +P W+S H+N
Sbjct: 248 DLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLN 307
Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKT 445
+NL+MNYW + NL E P+ +++ L + G + A V Y +GW++H +
Sbjct: 308 VNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQA 366
Query: 446 DI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
W D W P AW+ ++E Y++ D+D+L ++ YP+L F
Sbjct: 367 TPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWN 423
Query: 504 DWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLE 562
+L + ++PS SPEH +S +T D ++I ++F I AA+ L
Sbjct: 424 AFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELG 474
Query: 563 KNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+ED L E KS L P +I + G I EW +
Sbjct: 475 LDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 506
>gi|417694531|ref|ZP_12343718.1| hypothetical protein SPAR120_1586 [Streptococcus pneumoniae
GA47901]
gi|418110621|ref|ZP_12747640.1| hypothetical protein SPAR113_1694 [Streptococcus pneumoniae
GA49447]
gi|332201080|gb|EGJ15151.1| hypothetical protein SPAR120_1586 [Streptococcus pneumoniae
GA47901]
gi|353781242|gb|EHD61687.1| hypothetical protein SPAR113_1694 [Streptococcus pneumoniae
GA49447]
Length = 692
Score = 199 bits (506), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 149/513 (29%), Positives = 240/513 (46%), Gaps = 51/513 (9%)
Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
Y GDI +EF ++ T Y+R+L+++ A A Y F RE F+S PD ++
Sbjct: 27 TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86
Query: 160 VTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPK 215
V + +L F + L D S + C I K D+
Sbjct: 87 VQCFTKEGLETLDFTIELSLTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND- 145
Query: 216 GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPT 275
++F++ L + G I D+ +++ G+ +A L L A + F + K D
Sbjct: 146 -LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLE 200
Query: 276 SESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSA 335
+ + + + + Y+ L +RH++DYQ LF RV + L E ++D +
Sbjct: 201 QQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL-------------EADVDASTTD 247
Query: 336 ERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVN 393
+ +K+++ E +L EL FQ+GRYLLISSSR P ANLQG+WN +P W+S H+N
Sbjct: 248 DLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLN 307
Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKT 445
+NL+MNYW + NL E P+ +++ L + G + A V Y +GW++H +
Sbjct: 308 VNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQA 366
Query: 446 DI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
W D W P AW+ ++E Y++ D+D+L ++ YP+L F
Sbjct: 367 TPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWN 423
Query: 504 DWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLE 562
+L + ++PS SPEH +S +T D ++I ++F I AA+ L
Sbjct: 424 AFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELG 474
Query: 563 KNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+ED L E KS L P +I + G I EW +
Sbjct: 475 LDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 506
>gi|421218284|ref|ZP_15675178.1| alpha-fucosidase domain protein [Streptococcus pneumoniae 2070335]
gi|395583053|gb|EJG43502.1| alpha-fucosidase domain protein [Streptococcus pneumoniae 2070335]
Length = 692
Score = 199 bits (505), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 149/513 (29%), Positives = 240/513 (46%), Gaps = 51/513 (9%)
Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
Y GDI +EF ++ T Y+R+L+++ A A Y F RE F+S PD ++
Sbjct: 27 TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86
Query: 160 VTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPK 215
V + +L F + L D S + C I K D+
Sbjct: 87 VQCFTKEGLETLDFTIELSLTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND- 145
Query: 216 GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPT 275
++F++ L + G I D+ +++ G+ +A L L A + F + K D
Sbjct: 146 -LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLE 200
Query: 276 SESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSA 335
+ + + + + Y+ L +RH++DYQ LF RV + L E ++D +
Sbjct: 201 QQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL-------------EADVDASTTD 247
Query: 336 ERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVN 393
+ +K+++ E +L EL FQ+GRYLLISSSR P ANLQG+WN +P W+S H+N
Sbjct: 248 DLLKNYKPQEGQALEELFFQYGRYLLISSSRDYPDALPANLQGVWNAVDNPPWNSDYHLN 307
Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKT 445
+NL+MNYW + NL E P+ +++ L + G + A V Y +GW++H +
Sbjct: 308 VNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQA 366
Query: 446 DI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
W D W P AW+ ++E Y++ D+D+L ++ YP+L F
Sbjct: 367 TPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWN 423
Query: 504 DWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLE 562
+L + ++PS SPEH +S +T D ++I ++F I AA+ L
Sbjct: 424 AFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFYDFIQAAQELG 474
Query: 563 KNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+ED L E KS L P +I + G I EW +
Sbjct: 475 LDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 506
>gi|336399821|ref|ZP_08580621.1| Alpha-L-fucosidase [Prevotella multisaccharivorax DSM 17128]
gi|336069557|gb|EGN58191.1| Alpha-L-fucosidase [Prevotella multisaccharivorax DSM 17128]
Length = 1111
Score = 199 bits (505), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 140/541 (25%), Positives = 247/541 (45%), Gaps = 81/541 (14%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV 59
+++ S + N + + PA+++ T +PIG+G+ GA + G + + ++ N+ TLW+G
Sbjct: 334 VISIASYTPKNKYTLWYTQPAENWMTSCLPIGDGQFGATLMGQIAVDDIQFNDKTLWSGK 393
Query: 60 PGDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKY 119
G T+ D +G Y G++ + H
Sbjct: 394 LGARTSSDN--------------------------YG----FYLNFGNLYIMSKGMH--- 420
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL-- 177
+ Y R LD+N A A V ++ V++ R +F+SNPD IV + S++G ++ + L
Sbjct: 421 SATNYVRYLDINDAIAGVNFTSDGVDYQRSYFASNPDSCIVIRYKASQNGHINAVLRLKN 480
Query: 178 ----DSL--LDN--HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD 229
DS +DN + ++ N I +G G + P+ S + ++
Sbjct: 481 QNGKDSCYNIDNSQQATISFNGTIARQGD-SGVTVEPE------------SYVCSARVVI 527
Query: 230 DRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLS 289
D G++ ++V G++ ++ L + +D + + +Q +
Sbjct: 528 DGGSLKKNSAGLIEVIGANSMIIYLRGLTDYDPDAPQYVSGAALLPTRVAAIVQKAQKKG 587
Query: 290 YSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSL 349
Y L H DY++ F R + LS + +I P+ + +++ D +L
Sbjct: 588 YETLLAAHKADYKQWFDRCQLTLSNAKNNI-------------PTPTLIANYKNDPKANL 634
Query: 350 V--ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
EL F +GRYLLISSSR + ANLQGIWN + +P W + H NIN++MNYW + P N
Sbjct: 635 FLEELYFSYGRYLLISSSRGVSLPANLQGIWNNNNTPAWHADIHSNINVQMNYWPAEPTN 694
Query: 408 LSECQEPLFDFLTYLSINGSKTAQ----VNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
LSE P +++ + Q + + +GW + + +I+ G +
Sbjct: 695 LSELHMPFLNYIYREACVKPTWRQYAKDMGGVNAGWTLPTENNIYGS-----GTTFAPTY 749
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
+ AW C HLW+HY YT+D+D+L ++A+P ++ C + L++ +DG E SPE
Sbjct: 750 TIANAWYCQHLWQHYQYTLDKDYLRRQAFPAMKSCVEYWFQKLVKANDGTYECPDEWSPE 809
Query: 524 H 524
H
Sbjct: 810 H 810
>gi|225019012|ref|ZP_03708204.1| hypothetical protein CLOSTMETH_02963 [Clostridium methylpentosum
DSM 5476]
gi|224948237|gb|EEG29446.1| hypothetical protein CLOSTMETH_02963 [Clostridium methylpentosum
DSM 5476]
Length = 1657
Score = 198 bits (503), Expect = 7e-48, Method: Compositional matrix adjust.
Identities = 172/621 (27%), Positives = 270/621 (43%), Gaps = 120/621 (19%)
Query: 13 LKITFNGPAKHFTDA------IPIGNGRLGAMVWGGVPSETLKLNEDTLW--TGVPGDYT 64
LK+ ++ PA + +DA +P+G G +GA V+G +E ++L E++L G G
Sbjct: 53 LKLWYDEPAPN-SDAGWEQWSLPLGCGYMGANVFGITDTERIQLTENSLCGNNGFEGGLN 111
Query: 65 NPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEE-- 122
N F +++L + +
Sbjct: 112 N----------------------------------------------FSETYLDFGHDYS 125
Query: 123 ---TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS--- 176
Y R+L LN ATA V+Y G V ++RE+F+S PD+V+ K+S SESG LSF +
Sbjct: 126 GVSNYTRDLILNDATAHVRYDYGGVTYSREYFTSYPDKVMAIKLSASESGKLSFTLRPTI 185
Query: 177 --LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
L+ G+ I + GR G + + P G S D GTI
Sbjct: 186 PYLNEKKSGTVSAQGDT-ITLSGRMHGYEVDFEGQYKVIPSGGSASMQAANDADGDNGTI 244
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFD---GPFINPSDSK----KDPTSESMSALQSIRN 287
+V G+D AV+L+ ++++ F+NP +K + P ++ ++
Sbjct: 245 --------QVTGADSAVILIAIGTNYEFDPQVFLNPDATKLEGFEHPHAKVTERIEQASA 296
Query: 288 LSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DED 346
SY L + H DYQ LF R L + + TD E + +++ D
Sbjct: 297 QSYEQLRSNHTADYQNLFDRTRFDLGGAVPQLTTD-------------ELMNAYKAGSND 343
Query: 347 PSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPC 406
L EL FQ+GRYLLISSSR G NLQG+WN W + NIN++MNYW
Sbjct: 344 RYLEELYFQYGRYLLISSSRKGALPPNLQGVWNMYEQAPWTAGYWHNINIQMNYWPVFST 403
Query: 407 NLSECQEPLFDFL-TYLSINGSKTAQV-------NYLASGWVIHHKTDIWAKSSADRGKV 458
NL+E + D+ YL + + Q NY G + W+ +
Sbjct: 404 NLAELFDSYIDYYNAYLPAVRNSSNQFIAQQHPDNYDPGG------DNGWSIGTGAGPYS 457
Query: 459 VWALWPMG------GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG 512
V+A G GA + WE+Y++T D D LE YP + G A+F + ++E H
Sbjct: 458 VYAPNGQGTDGNGTGALMAQVFWEYYDFTRDPDILENITYPAVSGAANF-MSRVMEPHGD 516
Query: 513 YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV 572
YL +PS SPE +G V+ + D + E+ + AAE+L + ++AL +++
Sbjct: 517 YLLADPSASPEQ---MENGNY-VVTVGTAWDQQLAYEMEQNTLEAAELLGRQDEALPQRL 572
Query: 573 LKSLPRLRPTKIAEDGSIMEW 593
+ +L P ++ G I E+
Sbjct: 573 ADQIDKLDPVQVGFSGQIKEF 593
>gi|302405797|ref|XP_003000735.1| alpha-L-fucosidase [Verticillium albo-atrum VaMs.102]
gi|261360692|gb|EEY23120.1| alpha-L-fucosidase [Verticillium albo-atrum VaMs.102]
Length = 652
Score = 197 bits (500), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 153/489 (31%), Positives = 229/489 (46%), Gaps = 50/489 (10%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ P + +PIGNGRLGA+V+G E + LNE+++W+G D NP + A VR
Sbjct: 30 YETPGQDLKSGLPIGNGRLGALVYGSAI-EKITLNENSVWSGPFQDRANPGSLSAFPVVR 88
Query: 77 SLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
L+ G+Y EA +++ + G P D Y + D+ L+F H + Y R LD T
Sbjct: 89 DLLTKGKYTEAGQLTLRNMTGIPTDTQWYSVTADLFLDF--GHREEGWSGYERWLDTQTG 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL---DSLLDNHSYVNGN 190
++ V +TRE + I +++ S+ G+LSFN S +L N S +
Sbjct: 147 ITGTVFNWNGVNYTREAVAGADGGAIAMRLTASQHGALSFNTSWYREKGILKNTSSSCAS 206
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
++ G DD I FS + + D G+I D + VEG+
Sbjct: 207 TLVLDIG-------------GDDAGSIPFSTAVRLVAED--GSIRKGNDSMISVEGATTV 251
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
+ + +SF + + K++ T + A+++ + + ++ D+Q L RV +
Sbjct: 252 DIFVNVETSFR--WASTDKIKEELTRQLDVAVKT----GFDTIKSQAAKDHQSLMKRVEL 305
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP--- 367
L S + + T D +A RV + DP + L F FGR+LLISSSR
Sbjct: 306 DLGSSSEAGLLTT------DKRIAAYRVNA---TADPEFLTLNFNFGRHLLISSSRASAS 356
Query: 368 -GTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
G V ANLQGIWN+ P W S VNIN EMNYW + +L E PL+D L+
Sbjct: 357 SGMGVPANLQGIWNDMYFPPWGSKYSVNINTEMNYWLAEVTDLPETLPPLWDLLSRTRDK 416
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD-- 483
G TA+ Y GWV HH DIW S + ++LWP W+ L E Y ++ D
Sbjct: 417 GLITAKEMYGCPGWVSHHNLDIWGDSCPNANGTAYSLWPSSNLWMSQQLMERYRFSNDKI 476
Query: 484 ----RDFLE 488
RD++E
Sbjct: 477 QEWRRDYVE 485
>gi|152968134|ref|YP_001363918.1| twin-arginine translocation pathway signal [Kineococcus
radiotolerans SRS30216]
gi|151362651|gb|ABS05654.1| twin-arginine translocation pathway signal [Kineococcus
radiotolerans SRS30216]
Length = 808
Score = 197 bits (500), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 183/617 (29%), Positives = 265/617 (42%), Gaps = 72/617 (11%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + GPA + +A+P+G+GRLGA+ WG E L LN+D W+G P
Sbjct: 5 RLRYEGPATTWLEALPVGDGRLGAVCWGLADGERLSLNDDRAWSG----------PVGGP 54
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAE----------ET 123
+ D EA A+V L G P +LL + + + L +
Sbjct: 55 HHPTPPDHPDRVEAARAAV-LAGDPTRAGELLEPV-VHHTQAFLPVGDLLVTTAAAAAPG 112
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
R LDL TATA + V T H +S V+V +++ +G+ ++L S L
Sbjct: 113 VVRGLDLGTATAWSQRPVPG--GTVRHETSVGAGVLVHRVTAPGAGT-GLRLALASPLRP 169
Query: 184 HS---YVNGNNQIIMEGRC----PGKRIPPKANANDDP-----KGIQFSAILEIKISDDR 231
V + +E R P P + ++DP G + +
Sbjct: 170 AGSTLRVPDGDPGGLEWRTLLDLPEDVHPWHPDQHEDPVRWAAPGTPSRQVAVVVRVRCD 229
Query: 232 GTISALEDKKLKVEGSDWAVL----LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRN 287
GT A D VEG W + ++VA + D P +P+ P E+ +A +
Sbjct: 230 GTPRAAPDPAGPVEGPAWDGVREAHVVVAVETPD-PATDPTGR---PDVEAAAARAAAAV 285
Query: 288 LSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKDIVTDTCSEENIDTVPSAERVKSFQTDED 346
+ RH ++ +LF R + L R P TD V + DED
Sbjct: 286 ADPGAVRERHRREHAELFGRSDLDLGGRVPAGTTTDAL-------------VGLAEHDED 332
Query: 347 PSLVELLFQFG--RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSL 404
+ V RYLL++ SRPGT LQGIWNE+L P W S +N+NL M YW
Sbjct: 333 AARVLAALAVAHARYLLVTGSRPGTLPLTLQGIWNEELQPPWSSNYTLNVNLPMAYWPVQ 392
Query: 405 PCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK---VVWA 461
P L EC EPL F L+ G+ TA Y A GWV HH +D WA++ + G W+
Sbjct: 393 PWGLPECAEPLLAFAERLAAAGTATAAEMYGARGWVAHHNSDGWAQTRSVGGGWNDPAWS 452
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTS 521
WP GG WL +L + ++ D L +R P++EG F LD L+ DG L T PSTS
Sbjct: 453 AWPYGGVWLSLNLLDALDFAADPGPLARRVLPVVEGAVRFCLDRLVVLPDGTLGTAPSTS 512
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAA-----EVLEKNEDALVEKVLKSL 576
PE+ ++ G V SST D+ + R + + A + + A VE L L
Sbjct: 513 PENHWLDAAGNAQAVERSSTCDLELTRGLLTGWSRWAGRQTHAPVPADLRAEVEAALAGL 572
Query: 577 PRLRPTKIAEDGSIMEW 593
P G ++EW
Sbjct: 573 PH---PGTGARGELLEW 586
>gi|317036568|ref|XP_001397589.2| alpha-fucosidase A [Aspergillus niger CBS 513.88]
Length = 768
Score = 196 bits (497), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 167/592 (28%), Positives = 262/592 (44%), Gaps = 86/592 (14%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD---YT--NPDAPKA--LSDVRS 77
T A P+GNGRLGAM G E + LN D+LW G P + Y+ NP+ KA L +R
Sbjct: 36 TTAFPLGNGRLGAMPIGSYDKEIVNLNVDSLWRGGPFESPTYSGGNPNVSKAGALPGIRE 95
Query: 78 LVDSGQYAEATAASVKLFG-HPA-DVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
+ + T L G +P YQ+L ++ ++ + ++D
Sbjct: 96 WI----FQNGTGNVSALLGEYPYYGSYQVLANLTIDMGE----------LSDID------ 135
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSES-GSLSFNV--SLDSLLDNHSYVNGNNQ 192
RE F S PD V V ++S + S ++F + L S N S +GN+
Sbjct: 136 --------GYHNREAFCSYPDNVCVYRLSSNSSLPEITFGLENQLTSPAPNVS-CHGNSI 186
Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV-EGSDWAV 251
+ G+ P G+ ++A + + + T +KV EG
Sbjct: 187 SLY-----GQTYPVI--------GMIYNARVTVVVPGSSNTTDLCSSSTVKVPEGEKEVF 233
Query: 252 LLLVASSSFDGPFINPSDS----KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
L+ A ++++ N S ++P + + + SYS L + H+ DYQ +F++
Sbjct: 234 LVFAADTNYEASNGNSKASFSFKGENPYMKVLQTATNAAKKSYSALKSSHVKDYQGVFNK 293
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
++ L P+ E + S+ DP + LLF +GRYL ISSSRP
Sbjct: 294 FTLTLP-----------DPNGSADRPTTELLSSYSQPGDPYVENLLFDYGRYLFISSSRP 342
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI-NG 426
G+ NLQG+W E SP W H NINL+MN+W L E EPL+ ++ + G
Sbjct: 343 GSLPPNLQGLWTESYSPAWSGDYHANINLQMNHWAVDQTGLGELTEPLWTYMAETWMPRG 402
Query: 427 SKTAQVNYLAS-GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
++TA++ Y S GWV H + + + +A + WA +P AW+ H+W+H++Y+ D
Sbjct: 403 AETAELLYGTSEGWVTHDEMNTFGH-TAMKDVAQWADYPATNAWMSHHVWDHFDYSQDSA 461
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGH---DGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+ + YP+L+G A F L L++ DG L NP SPEH P C Y
Sbjct: 462 WYRETGYPILKGAAQFWLSQLVKDEYFKDGTLVVNPCNSPEH---GPT-TFGCTHYQQ-- 515
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEW 593
+I E+F ++ ++ + + L P I G I EW
Sbjct: 516 ---LIWELFDHVLQGWTASGDDDTSFKNAITSKFSTLDPGIHIGSWGQIQEW 564
>gi|423281387|ref|ZP_17260298.1| hypothetical protein HMPREF1203_04515 [Bacteroides fragilis HMW
610]
gi|404583091|gb|EKA87774.1| hypothetical protein HMPREF1203_04515 [Bacteroides fragilis HMW
610]
Length = 402
Score = 195 bits (496), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 135/400 (33%), Positives = 202/400 (50%), Gaps = 51/400 (12%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAP 69
N L + + PA ++ +A+P+GNG LGAMV+G E L+LNE TL++G P P
Sbjct: 22 NNLSLWYRQPAANWNEALPLGNGYLGAMVFGDAGREHLQLNESTLYSGEPFSGVGVPSIG 81
Query: 70 KALSDVRSLVDSGQYAEATAASVKLF-GHPADVYQLLGDIELEFDDSHLKYAEETYRREL 128
++V +L++ G YA A + + G + YQ L D+ L FD ++ E Y REL
Sbjct: 82 SVYNEVLALLNHGDYAGAHRLITRNWQGRLSQSYQPLADLFLSFD---VQGKVENYVREL 138
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
+L A ++Y G + +TRE+F SNPD+V+V +IS S ++ VS S
Sbjct: 139 NLQDAVHTIRYQAGGIRYTREYFISNPDRVMVIRISASRRSPVNVAVSYTSEHPTAKVDG 198
Query: 189 GNNQIIMEGRCP---------------------------GKRIPPKANANDDP---KGIQ 218
++I+ G+ P G+R K D KG+
Sbjct: 199 TGEELILSGQAPGCVERRTLDFLEKNRLTDRHPELFDSHGRRKTDKQVLYADEVGGKGMF 258
Query: 219 FSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSES 278
F + +K+ T L+D +LKV G +LL+ A++S++G +PS D ++
Sbjct: 259 FQS--RVKVLKGNAT---LQDNQLKVSGEGEIILLVAAATSYNGFDRSPSQDGSDYQAKL 313
Query: 279 MSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV 338
+ L L Y DL RHL DYQ+LF RV++ L SE++ +P+ R+
Sbjct: 314 DTILSVSGQLPYEDLKKRHLADYQRLFGRVALTLK-----------SEKDYSGLPTDRRI 362
Query: 339 KSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIW 378
F+ + D +L LLFQ+GRYLLI+SSR G Q ANLQGIW
Sbjct: 363 IGFRDNPDNALAALLFQYGRYLLIASSREGGQPANLQGIW 402
>gi|225019811|ref|ZP_03709003.1| hypothetical protein CLOSTMETH_03764, partial [Clostridium
methylpentosum DSM 5476]
gi|224947447|gb|EEG28656.1| hypothetical protein CLOSTMETH_03764 [Clostridium methylpentosum
DSM 5476]
Length = 1411
Score = 195 bits (496), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 177/644 (27%), Positives = 289/644 (44%), Gaps = 130/644 (20%)
Query: 4 AESTSTTNPLKITFNGPAKHFTD------AIPIGNGRLGAMVWGGVPSETLKLNEDTLWT 57
AE + LK+ ++ PA +D +IP+GNG +G ++GGV +E +++ E++L
Sbjct: 38 AEPLAAAKQLKLWYDEPAPS-SDIGWREWSIPMGNGYMGVNLFGGVQTERIQITENSL-- 94
Query: 58 GVPGDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHL 117
+ + SV + ++ Y I+ E D
Sbjct: 95 ----------------------------QDSNTSVGGLNNFSETY-----IDFEHSDP-- 119
Query: 118 KYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV-- 175
+ Y+REL+L+ A V Y V + R++F+ PD+V+V ++S SE+G LSF +
Sbjct: 120 ----QNYQRELNLSEGVASVVYDSDGVRYERQYFTDYPDKVMVIRLSASEAGKLSFTLRP 175
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANAND-------DPKGIQFSAILEIKIS 228
++ L D H + G GK KA + + ++F + K+
Sbjct: 176 TIPYLCDYH---------VEPGDNRGKHGTVKAEGDTITLAGAMEYYNVEFEG--QYKVL 224
Query: 229 DDRGTISALEDKK-----LKVEGSDWAVLLLVASSSFD-GPFINPSDSKKD-------PT 275
GT++A D+ + V+ +D AV+L+ ++++ + ++++ D P
Sbjct: 225 PTGGTMTAQNDQNGDNGTISVQNADSAVILIGIGTNYELKSSVFTANNRLDKLKGNAHPH 284
Query: 276 SESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSA 335
++ +Q SY +L H +DY+ LF RVS+ + TD
Sbjct: 285 AKVTKIIQDASAKSYDELLASHQEDYKGLFDRVSVDFGGQMPTVTTD------------- 331
Query: 336 ERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNI 394
E +K++Q + DP L EL +QFGRY+LI SSR G NLQG+WN P W S NI
Sbjct: 332 ELLKNYQNGQSDPYLEELFYQFGRYMLICSSRKGALPPNLQGVWNVFNDPPWRSGYWHNI 391
Query: 395 NLEMNYWQSLPCNLSECQEPLFDFL-TYLSI------------NGSKTAQVNYLASGWVI 441
NL+MNYW + NL E E D+ YL N S +VN +GW +
Sbjct: 392 NLQMNYWPAFTGNLPELFEAYADYQKAYLEKAEQYAVSNIQKNNPSALDKVNTKENGWAL 451
Query: 442 HHKTDIW----AKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEG 497
+ T W + S++ G GA+ W++Y+YT D LE AYP + G
Sbjct: 452 GNST--WPYNISGSASHSGFGT-------GAFTSIMFWDYYDYTRDASVLEDTAYPAVSG 502
Query: 498 CASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISA 557
A F L +++ DGYL +PS SPE++ K ++ D +I E + A
Sbjct: 503 MAKF-LSKIVQPIDGYLLASPSYSPENQHNGGSYKTVGCAF----DQQMIYENHLDTLKA 557
Query: 558 AEVL---EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQRRL 598
A+ L ++E AL + + LP L P ++ G I E+ + +
Sbjct: 558 ADALGLTAEDEPALA-TLEQQLPLLDPVQVGASGQIKEYREEKF 600
>gi|302883112|ref|XP_003040458.1| hypothetical protein NECHADRAFT_122680 [Nectria haematococca mpVI
77-13-4]
gi|256721342|gb|EEU34745.1| hypothetical protein NECHADRAFT_122680 [Nectria haematococca mpVI
77-13-4]
Length = 812
Score = 195 bits (495), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 159/592 (26%), Positives = 263/592 (44%), Gaps = 50/592 (8%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD---YT--NPDAPK--ALSDVRSLV 79
P+GNG L +G E + N D+LW+G P + YT NP K AL +R +
Sbjct: 45 GYPVGNGILAGTHFGDPGHEKIVFNVDSLWSGGPFENSAYTGGNPTTSKSTALPGIREYI 104
Query: 80 DSGQYAEATAASVKLFG--HPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARV 137
+ + T L G + Y++LG++ + + Y Y R LD +T
Sbjct: 105 ----FDQGTGNVSALLGSGNYYGSYRVLGNLSIIIGHA-TDYTN--YTRSLDPSTGVHTT 157
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
Y +V +T F SNP V +++ E + N+ ++L + S N
Sbjct: 158 TYLADSVNYTTTLFCSNPADACVYRVTSDED-LPNINIQFENLAVSSSLAN------PSC 210
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE---GSDWAVLLL 254
P R D P+G+++ AI + D +S + L + G +++
Sbjct: 211 NHPYTRFRGVTQLGD-PEGMKYEAIARFVDNRDGDGVSCATNGSLTIARSPGFKTVDVII 269
Query: 255 VASSSFDGPFINPSDS----KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
A +++D N + DP + S Y L H++DYQ LF ++
Sbjct: 270 SAGTNYDATKGNAENDYSFRGDDPAEAVQRSTSSGAQQGYDKLLKAHIEDYQSLFGTFTL 329
Query: 311 QLSRSPKDIVTDTC---SEENIDTVPSAE-RVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
L + K +T S + + + R+ DP L LLF + RYLLI+SSR
Sbjct: 330 TLPDAQKSAGHETAVLISNYSSNGIGDPYIRIYYISKSRDPYLESLLFDYSRYLLIASSR 389
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI-N 425
+ ANLQG W E ++P+W S H NIN++MNYW + L + L++++ +
Sbjct: 390 ENSLPANLQGKWTEQMNPSWSSDYHANINIQMNYWAADQTGLGKTSVALWNYMRNTWVPR 449
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G++TA++ Y A GWV+H++ +I+ + +G WA +P+ AW+ H+W++Y Y
Sbjct: 450 GTETAKLLYDAPGWVVHNEMNIFGHTGM-KGSATWANYPVAAAWMMQHVWDNYEYGRSLT 508
Query: 486 FLEKRAYPLLEGCASFLLDWLIE---GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+L + YPLL+ A F + L E +DG L NP S EH P C Y
Sbjct: 509 WLRQEGYPLLKEVAQFWISQLQEDEFNNDGTLVVNPCNSAEH---GPT-TFGCTHYQQ-- 562
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIMEW 593
+I +V A +++ + +++ ++ L +L + G I EW
Sbjct: 563 ---LIHQVLEATLNSITYIGEDDQDFTSELKTVLKKLDKGLHYTSWGGIKEW 611
>gi|325261390|ref|ZP_08128128.1| fibronectin type III domain protein [Clostridium sp. D5]
gi|324032844|gb|EGB94121.1| fibronectin type III domain protein [Clostridium sp. D5]
Length = 1783
Score = 195 bits (495), Expect = 7e-47, Method: Compositional matrix adjust.
Identities = 167/614 (27%), Positives = 273/614 (44%), Gaps = 74/614 (12%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD----------YTNPDAPKALSDVR 76
++PIGN +GA V+GGV E ++LNE +LW+G P D N + ++
Sbjct: 73 SLPIGNSAIGASVFGGVDIERIQLNEKSLWSGGPSDSRPDYNGGNIQQNGQDGATMKQIQ 132
Query: 77 SLVDSGQYAEATAASVKLFGHPADV-------YQLLGDIELEFDDSHLKYAEETYRRELD 129
L G + A+A KL G D Y G++ L+F D E Y R+L+
Sbjct: 133 ELFKEGNNSAASALCNKLIGVSDDAGDKGYGYYLSYGNMYLDFQDGASPDNVENYSRDLN 192
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L A + V Y + RE+F S PD V+VT+++ +E G+L F+V ++ D+
Sbjct: 193 LRNAVSSVDYDYKGTHYHREYFVSYPDNVLVTRLT-AEGGTLDFDVRVEP--DDQKGGGS 249
Query: 190 NNQIIME-GRCPGKRIPPKA---NANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
NN GR + N ++FS+ K+ D G +K+ V
Sbjct: 250 NNPSAESYGRSWDTDVKDGVISINGELTDNQMKFSS--HTKVVADEGGKVKDGTEKVSVS 307
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDD 300
G+ + + + + + + T+E +SA + Y + H D
Sbjct: 308 GAKEVTIYTSIGTDYKNEY---PEYRTGQTAEEVSARIKAYVDQAAVKGYEAVKEAHTKD 364
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTC-SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
+ +F RV + L ++ D TD+ + N ER + L +LFQ+GRY
Sbjct: 365 FDSIFGRVDLNLGQTVSDRATDSLLAAYNSGKASEGERRQ---------LEVMLFQYGRY 415
Query: 360 LLISSSR------PGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSEC 411
L I SSR P + +NLQGIW + W + H+N+NL+MNYW + N++EC
Sbjct: 416 LTIESSRETPDDDPSRETLPSNLQGIWVGANNSAWHADYHMNVNLQMNYWPTYSTNMAEC 475
Query: 412 QEPLFDFLTYLSINGSKTAQV------NYLASGWVIHHKTD--IWAKSSADRGKVVWALW 463
+PL ++ L G TA++ +G++ H + + W D W
Sbjct: 476 AQPLISYVDSLREPGRVTAKIYAGIGDGKSETGFMAHTQNNPFGWTCPGWD---FSWGWS 532
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
P W+ + W++Y++T D ++L YP++ A L++ G L ++PS SPE
Sbjct: 533 PAAVPWILQNCWDYYDFTGDTEYLRNVIYPIMREEALLYDQMLVDDGTGKLVSSPSFSPE 592
Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLR-PT 582
H P + A +Y T+ I +++ I AAE+L + + VE RL+ P
Sbjct: 593 H---GP--RTAGNTYEQTL----IWQLYEDTIQAAEILGTDAEQ-VEVWKDKQSRLKGPI 642
Query: 583 KIAEDGSIMEWVQR 596
+I + G I EW +
Sbjct: 643 EIGDSGQIKEWYEE 656
>gi|111658272|ref|ZP_01408963.1| hypothetical protein SpneT_02000541 [Streptococcus pneumoniae
TIGR4]
Length = 576
Score = 195 bits (495), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 130/384 (33%), Positives = 194/384 (50%), Gaps = 36/384 (9%)
Query: 215 KGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDP 274
KG+QF + K++D G +S L + + + + L L + + + G
Sbjct: 9 KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDYWGNI---------- 55
Query: 275 TSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVP 333
+S+LQ ++ Y H+ YQ+ F+RV +L S + +I T
Sbjct: 56 ---DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYSKGCL--------SIPTNL 104
Query: 334 SAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVN 393
E K + L LLF +GRYLLISSS+P ANLQGIW ++L+P W S +N
Sbjct: 105 LLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSKYTIN 160
Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA 453
IN +MNYW PC+L E + PLFD L + G TA+ Y A G+ HH TD + ++
Sbjct: 161 INTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTDGFGDTAP 220
Query: 454 DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY 513
+ A+W + WLCTH+WEHY Y D L + + +++ F D+L E DGY
Sbjct: 221 QSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYLFEV-DGY 278
Query: 514 LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL--VEK 571
L T PS SPE+++ +G SST+D I+R + I A+ L N D + V++
Sbjct: 279 LMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFISRVKE 338
Query: 572 VLKSLPRLRPTKIAEDGSIMEWVQ 595
+ K LP+ TKI +G I EW++
Sbjct: 339 LKKKLPK---TKIGSNGQIQEWLE 359
>gi|146386783|pdb|2EAE|A Chain A, Crystal Structure Of 1,2-A-L-Fucosidase From
Bifidobacterium Bifidum In Complexes With Products
Length = 898
Score = 194 bits (494), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 166/625 (26%), Positives = 287/625 (45%), Gaps = 82/625 (13%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
+P GNG++G VWG V E + NE+TLWTG PG T N + + +R+L
Sbjct: 51 LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 110
Query: 82 GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
T L G + A+ L GDI L++ + E YRR+L+L+ A V
Sbjct: 111 LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 168
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
+ V +TRE+F+SNPD V+V +++ S++G L+FNVS+ + N +Y ++G
Sbjct: 169 TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 225
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
+ K ++ G+ +++ +++ + + GT+S D LKV + L + A
Sbjct: 226 DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGSDGASLKVSDAKAVTLYIAA 279
Query: 257 SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
++ + P ++ + + +Q N Y+ + H+DD+ ++ RV I L +
Sbjct: 280 ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIDDHSAIYDRVKIDLGQ 339
Query: 315 SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
S + D + A + S T + L L++++GRYL I SSR +Q+ +N
Sbjct: 340 SGHSSDGAVAT----DALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 395
Query: 374 LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
LQGIW N + W S H+N+NL+MNYW + N+ E EPL +++ L G
Sbjct: 396 LQGIWSVTAGDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 455
Query: 428 KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
TA+V A G++ H + + ++ + W P W+ ++
Sbjct: 456 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 514
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE----GHDGYLETNPSTSPEHEFIAPD 530
+E Y Y+ D L+ R Y LL+ + F +++++ L T + SPE + D
Sbjct: 515 YEAYEYSGDPALLD-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 573
Query: 531 GKLACVSYSSTMDMAIIREVFSAIIS--------------AAEVLEKNE-----DALVEK 571
G +Y S++ ++ + A + +A+ KN+ DA +
Sbjct: 574 GN----TYESSLVWQMLNDAIEAAKAKGDPDGLVGNTTDCSADNWAKNDSGNFTDANANR 629
Query: 572 ---VLKSLPRLRPTKIAEDGSIMEW 593
KSL L+P ++ + G I EW
Sbjct: 630 SWSCAKSL--LKPIEVGDSGQIKEW 652
>gi|146386777|pdb|2EAB|A Chain A, Crystal Structure Of 1,2-A-L-Fucosidase From
Bifidobacterium Bifidum (Apo Form)
gi|146386778|pdb|2EAB|B Chain B, Crystal Structure Of 1,2-A-L-Fucosidase From
Bifidobacterium Bifidum (Apo Form)
gi|146386779|pdb|2EAC|A Chain A, Crystal Structure Of 1,2-A-L-Fucosidase From
Bifidobacterium Bifidum In Complex With
Deoxyfuconojirimycin
gi|146386780|pdb|2EAC|B Chain B, Crystal Structure Of 1,2-A-L-Fucosidase From
Bifidobacterium Bifidum In Complex With
Deoxyfuconojirimycin
Length = 899
Score = 194 bits (493), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 166/625 (26%), Positives = 287/625 (45%), Gaps = 82/625 (13%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
+P GNG++G VWG V E + NE+TLWTG PG T N + + +R+L
Sbjct: 52 LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 111
Query: 82 GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
T L G + A+ L GDI L++ + E YRR+L+L+ A V
Sbjct: 112 LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 169
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
+ V +TRE+F+SNPD V+V +++ S++G L+FNVS+ + N +Y ++G
Sbjct: 170 TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 226
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
+ K ++ G+ +++ +++ + + GT+S D LKV + L + A
Sbjct: 227 DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGSDGASLKVSDAKAVTLYIAA 280
Query: 257 SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
++ + P ++ + + +Q N Y+ + H+DD+ ++ RV I L +
Sbjct: 281 ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIDDHSAIYDRVKIDLGQ 340
Query: 315 SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
S + D + A + S T + L L++++GRYL I SSR +Q+ +N
Sbjct: 341 SGHSSDGAVAT----DALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 396
Query: 374 LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
LQGIW N + W S H+N+NL+MNYW + N+ E EPL +++ L G
Sbjct: 397 LQGIWSVTAGDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 456
Query: 428 KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
TA+V A G++ H + + ++ + W P W+ ++
Sbjct: 457 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 515
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE----GHDGYLETNPSTSPEHEFIAPD 530
+E Y Y+ D L+ R Y LL+ + F +++++ L T + SPE + D
Sbjct: 516 YEAYEYSGDPALLD-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 574
Query: 531 GKLACVSYSSTMDMAIIREVFSAIIS--------------AAEVLEKNE-----DALVEK 571
G +Y S++ ++ + A + +A+ KN+ DA +
Sbjct: 575 GN----TYESSLVWQMLNDAIEAAKAKGDPDGLVGNTTDCSADNWAKNDSGNFTDANANR 630
Query: 572 ---VLKSLPRLRPTKIAEDGSIMEW 593
KSL L+P ++ + G I EW
Sbjct: 631 SWSCAKSL--LKPIEVGDSGQIKEW 653
>gi|34451973|gb|AAQ72464.1| alpha-fucosidase [Bifidobacterium bifidum JCM 1254]
Length = 1959
Score = 194 bits (493), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 166/625 (26%), Positives = 287/625 (45%), Gaps = 82/625 (13%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
+P GNG++G VWG V E + NE+TLWTG PG T N + + +R+L
Sbjct: 627 LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 686
Query: 82 GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
T L G + A+ L GDI L++ + E YRR+L+L+ A V
Sbjct: 687 LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 744
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
+ V +TRE+F+SNPD V+V +++ S++G L+FNVS+ + N +Y ++G
Sbjct: 745 TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 801
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
+ K ++ G+ +++ +++ + + GT+S D LKV + L + A
Sbjct: 802 DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGSDGASLKVSDAKAVTLYIAA 855
Query: 257 SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
++ + P ++ + + +Q N Y+ + H+DD+ ++ RV I L +
Sbjct: 856 ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIDDHSAIYDRVKIDLGQ 915
Query: 315 SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
S + D + A + S T + L L++++GRYL I SSR +Q+ +N
Sbjct: 916 SGH----SSDGAVATDALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 971
Query: 374 LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
LQGIW N + W S H+N+NL+MNYW + N+ E EPL +++ L G
Sbjct: 972 LQGIWSVTAGDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 1031
Query: 428 KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
TA+V A G++ H + + ++ + W P W+ ++
Sbjct: 1032 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 1090
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE----GHDGYLETNPSTSPEHEFIAPD 530
+E Y Y+ D L+ R Y LL+ + F +++++ L T + SPE + D
Sbjct: 1091 YEAYEYSGDPALLD-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 1149
Query: 531 GKLACVSYSSTMDMAIIREVFSAIIS--------------AAEVLEKNE-----DALVEK 571
G +Y S++ ++ + A + +A+ KN+ DA +
Sbjct: 1150 GN----TYESSLVWQMLNDAIEAAKAKGDPDGLVGNTTDCSADNWAKNDSGNFTDANANR 1205
Query: 572 ---VLKSLPRLRPTKIAEDGSIMEW 593
KSL L+P ++ + G I EW
Sbjct: 1206 SWSCAKSL--LKPIEVGDSGQIKEW 1228
>gi|310286736|ref|YP_003937994.1| cell wall protein containing Ig-like domains (group2 and 3)
[Bifidobacterium bifidum S17]
gi|309250672|gb|ADO52420.1| cell wall protein containing Ig-like domains (group2 and 3)
[Bifidobacterium bifidum S17]
Length = 1959
Score = 194 bits (493), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 165/630 (26%), Positives = 286/630 (45%), Gaps = 92/630 (14%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
+P GNG++G VWG V E + NE+TLWTG PG T N + + +R+L
Sbjct: 627 LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 686
Query: 82 GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
T L G + A+ L GDI L++ + E YRR+L+L+ A V
Sbjct: 687 LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 744
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
+ V +TRE+F+SNPD V+V +++ S++G L+FNVS+ + N +Y ++G
Sbjct: 745 TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 801
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
+ K ++ G+ +++ +++ + + GT+S D LKV + L + A
Sbjct: 802 DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGSDGASLKVSDAKAVTLYIAA 855
Query: 257 SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
++ + P ++ + + +Q+ N Y+ + H+DD+ ++ RV I L +
Sbjct: 856 ATDYKQKYPSYRTGETAAEVNTRVAKVVQAAANKGYTAVKKAHIDDHSAIYDRVKINLGQ 915
Query: 315 SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
S + D + A + S T + L L++++GRYL I SSR +Q+ +N
Sbjct: 916 SGH----SSDGAVATDALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 971
Query: 374 LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
LQGIW N + W S H+N+NL+MNYW + N+ E EPL +++ L G
Sbjct: 972 LQGIWSVTAGDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 1031
Query: 428 KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
TA+V A G++ H + + ++ + W P W+ ++
Sbjct: 1032 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 1090
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE----GHDGYLETNPSTSPEHEFIAPD 530
+E Y Y+ D L R Y LL+ + F +++++ L T + SPE + D
Sbjct: 1091 YEAYEYSGDPALLN-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 1149
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL----------------- 573
G +T + +++ ++ + I AA+ + + D LV
Sbjct: 1150 G--------NTYESSLVWQMLNDAIEAAKA-KGDPDGLVGNTTDCSADNWAKGDNGNFTD 1200
Query: 574 ----------KSLPRLRPTKIAEDGSIMEW 593
KSL L+P ++ + G I EW
Sbjct: 1201 ANANRSWSCAKSL--LKPIEVGDSGQIKEW 1228
>gi|393222468|gb|EJD07952.1| glycoside hydrolase family 95 protein [Fomitiporia mediterranea
MF3/22]
Length = 835
Score = 192 bits (489), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 170/612 (27%), Positives = 281/612 (45%), Gaps = 68/612 (11%)
Query: 17 FNGPAKHFTDA-IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-------GDYTNPDA 68
++ P + +T +P+GNG L AM GG E+ +LN ++LW+G P G PD
Sbjct: 36 YDAPGQIWTQHYLPLGNGFLAAMTPGGTLQESTQLNIESLWSGGPFADPAYNGGNKQPDE 95
Query: 69 PKALSDVRSLVDSGQYAEATAAS--VKLFGHPADVYQLL---GDIELEFDDSHLKYAEET 123
A++ + + +T + V + P D Y G + +S L +
Sbjct: 96 QAAMAQAMQSIRQSIFNSSTGITDNVDVLMTPIDAYGSYSGAGFLVSTLQNSSLSNISD- 154
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS---L 180
+ R LDL++ + ++ N +F+RE F S+P Q V S + S + +L + L
Sbjct: 155 FGRFLDLDSGLTKTIWNEDNAQFSRETFCSHPTQACVQNTSTAASSGFTQTYALAAASGL 214
Query: 181 LDNHSYVNGNNQIIMEGRC--PGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
+ N + + G PG A P G L+ + + T +
Sbjct: 215 PAPNVTCTDNATLRLNGLVAEPGMAYELLATVAVPPGGT-----LKCTVVPNMDTTDNVV 269
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDS-------KKDPTSESMSALQSIRNLSYS 291
+ + V A ++ V +++D IN D+ DP + + L S SYS
Sbjct: 270 NATITVSNVTSASVVWVGGTNYD---INAGDAVHNFSFRGPDPHDDLVPLLSSASKKSYS 326
Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVE 351
+L + H+ DY+ H S+ L + + ++DT + + + ++ D+ VE
Sbjct: 327 ELLSDHVADYEATLHAFSLDLGQ-----------KADLDT-STDKLINAYTVDKGDVYVE 374
Query: 352 -LLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
LLF +GR+LL SSSR G ANLQG W D P W + H++IN+EMNYW + NL +
Sbjct: 375 WLLFNYGRHLLASSSR-GILPANLQGKWAVDAFPAWGADYHLDINVEMNYWLAEMTNL-D 432
Query: 411 CQEPLFDFL--TYLSINGSKTAQVNY-LASGWVIHHKT--DIWAKSSADRGKVVWALWPM 465
+PLF+++ TY + G+ TAQV Y + GWV+H + I+ + G+ W +P
Sbjct: 433 VSKPLFNYIAKTY-APRGAYTAQVLYNITQGWVVHTEVMFKIFGYTGMKVGEAEWYDYPE 491
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH---DGYLETNPSTSP 522
AWL ++W+H++YT D + + + YPLL+G A F L+ LI DG L P SP
Sbjct: 492 PNAWLMLNVWDHFDYTNDVAWWKAQGYPLLKGVALFHLEKLIPDEHFLDGTLVVAPCNSP 551
Query: 523 EHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RP 581
E I LAC +I ++ +AI A + +++ + V + ++ +
Sbjct: 552 EQAPI----TLACAH-----SQQLIWQLLNAIEKGAAAAGETDESFLNDVRAKIAQMDKG 602
Query: 582 TKIAEDGSIMEW 593
I G + EW
Sbjct: 603 IHIGSWGQLQEW 614
>gi|146386781|pdb|2EAD|A Chain A, Crystal Structure Of 1,2-A-L-Fucosidase From
Bifidobacterium Bifidum In Complex With Substrate
gi|146386782|pdb|2EAD|B Chain B, Crystal Structure Of 1,2-A-L-Fucosidase From
Bifidobacterium Bifidum In Complex With Substrate
Length = 899
Score = 192 bits (487), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 165/625 (26%), Positives = 286/625 (45%), Gaps = 82/625 (13%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
+P GNG++G VWG V E + NE+TLWTG PG T N + + +R+L
Sbjct: 52 LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 111
Query: 82 GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
T L G + A+ L GDI L++ + E YRR+L+L+ A V
Sbjct: 112 LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 169
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
+ V +TRE+F+SNPD V+V +++ S++G L+FNVS+ + N +Y ++G
Sbjct: 170 TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 226
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
+ K ++ G+ +++ +++ + + GT+S D LKV + L + A
Sbjct: 227 DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGSDGASLKVSDAKAVTLYIAA 280
Query: 257 SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
++ + P ++ + + +Q N Y+ + H+DD+ ++ RV I L +
Sbjct: 281 ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIDDHSAIYDRVKIDLGQ 340
Query: 315 SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
S + D + A + S T + L L++++GRYL I SSR +Q+ +N
Sbjct: 341 SGHSSDGAVAT----DALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 396
Query: 374 LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
LQGIW N + W S H+N+NL+MNYW + N+ E EPL +++ L G
Sbjct: 397 LQGIWSVTAGDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 456
Query: 428 KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
TA+V A G++ H + + ++ + W P W+ ++
Sbjct: 457 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 515
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE----GHDGYLETNPSTSPEHEFIAPD 530
+E Y Y+ D L+ R Y LL+ + F +++++ L T + SP + D
Sbjct: 516 YEAYEYSGDPALLD-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPAQGPLGTD 574
Query: 531 GKLACVSYSSTMDMAIIREVFSAIIS--------------AAEVLEKNE-----DALVEK 571
G +Y S++ ++ + A + +A+ KN+ DA +
Sbjct: 575 GN----TYESSLVWQMLNDAIEAAKAKGDPDGLVGNTTDCSADNWAKNDSGNFTDANANR 630
Query: 572 ---VLKSLPRLRPTKIAEDGSIMEW 593
KSL L+P ++ + G I EW
Sbjct: 631 SWSCAKSL--LKPIEVGDSGQIKEW 653
>gi|383115618|ref|ZP_09936374.1| hypothetical protein BSGG_2513 [Bacteroides sp. D2]
gi|313694978|gb|EFS31813.1| hypothetical protein BSGG_2513 [Bacteroides sp. D2]
Length = 793
Score = 191 bits (485), Expect = 9e-46, Method: Compositional matrix adjust.
Identities = 166/604 (27%), Positives = 259/604 (42%), Gaps = 108/604 (17%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
++PIGNG +GA ++G +E ++L E T GV G Y
Sbjct: 58 SLPIGNGYMGACIFGRTDTERIQLTEKTF--GVKGPYKKGG------------------- 96
Query: 87 ATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
G+ A++Y IE D L Y+R L LN A +RV Y V +
Sbjct: 97 --------IGNFAEIY-----IEGIHHDQPL-----NYKRSLRLNDAISRVNYQYEGVNY 138
Query: 147 TREHFSSNPDQVIVTKISGSESGSLSFNVS-----LDSLLDNHSYVNG-----NNQIIME 196
TRE+F++ P VIV K+ + G +SF + L D + G N+ I +
Sbjct: 139 TREYFANYPSNVIVVKLKADQPGKISFTLRPVLPYLHEYNDEGTGRTGKVSAQNDLITLT 198
Query: 197 GRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVA 256
G R+P +A P G Q A+ +D+ G + ++++ +D VLL+ A
Sbjct: 199 GDIQFFRLPYEAQIKVIPSGGQLKAM-----NDELGN-----NGTIRIQQADSVVLLINA 248
Query: 257 -------SSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
SS F N + P +Q + Y L H+ DYQ LF RV
Sbjct: 249 QTAYQLKSSVFTASPENKFTGNEHPHRAVSQCIQKAADKGYEALCKEHIADYQSLFSRVD 308
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L I TD+ + +R K E + ELLFQ+GRYLLI+SSR G+
Sbjct: 309 LHLCNETPGIPTDSLLHD-------YQRGK-----ESLYMDELLFQYGRYLLIASSRKGS 356
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
+LQG W++ W NIN++MNYW + NL+E F+ Y+ N +
Sbjct: 357 LPPHLQGAWSQYEYAPWSGGYWHNINIQMNYWAAFNTNLAEV------FIPYVEYNEAFR 410
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAW---------------LCTHL 474
N A+G++ + D + + G W + A+ T L
Sbjct: 411 QSANEKATGYIKKNNPDALSAIPEENG---WTIGTGANAFSIDSPGGHSGPGTGGFTTKL 467
Query: 475 -WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
W++Y++T D D L+K +YP + G A FL L + YL +PS+SPE +
Sbjct: 468 FWDYYDFTRDEDILKKHSYPAMLGMAKFLSKTLKPTEEEYLLADPSSSPEQYHNGTTYQT 527
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
++ D +I E F ++ AA++L K E + + + + +L +I E G I E+
Sbjct: 528 KGCAF----DQGMIWESFHDVLKAADIL-KEESPFLRTIKEQIGKLDAIQIGESGQIKEY 582
Query: 594 VQRR 597
+ +
Sbjct: 583 REEK 586
>gi|149199701|ref|ZP_01876733.1| hypothetical protein LNTAR_25135 [Lentisphaera araneosa HTCC2155]
gi|149137218|gb|EDM25639.1| hypothetical protein LNTAR_25135 [Lentisphaera araneosa HTCC2155]
Length = 1754
Score = 191 bits (485), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 166/603 (27%), Positives = 270/603 (44%), Gaps = 117/603 (19%)
Query: 29 PIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEAT 88
PIGNG GA ++G +E +++ + TL + G+Y +
Sbjct: 63 PIGNGYTGANIFGRTDTERIQITDKTL-----------------------HNRGKYNKGG 99
Query: 89 AASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTR 148
S E++ D H K+++ YRR L+LN A V Y+ V +TR
Sbjct: 100 LTSF---------------AEIKLDFRHHKFSK--YRRSLNLNEGIAHVAYNYRGVNYTR 142
Query: 149 EHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKA 208
E+F+S PD VIV +++ + +LSF + + +G+
Sbjct: 143 EYFASYPDNVIVIRLTADKKAALSFEIRPEIPYLERKERSGS-----------------I 185
Query: 209 NANDDPKGIQFSAIL-------EIKISDDRGTISA-LEDKKLKVEGSDWAVLLLVASSSF 260
+A DD ++ S L +IK+ ++ GT+ A + ++V +D +L+ +++
Sbjct: 186 SAKDDLLTLKGSIALFSCNFDGQIKVLNEGGTLKANAKQGSIEVSKADAVTILIATGTNY 245
Query: 261 ---DGPFINPSDSKKDPT----SESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
+ F N S K +P +E + +Q+ +N Y L RHL DYQ LF RV++ L+
Sbjct: 246 RLHEDTFRNTSAKKLNPKEFPHNEVSARIQAAQNRGYEQLKERHLKDYQNLFGRVAVNLN 305
Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVAN 373
P + T E+ K+ +T+ L EL+FQ+GRYLLISSSR + AN
Sbjct: 306 SRPSNDPTHIL----------LEKYKAGKTNN--WLEELMFQYGRYLLISSSREKSLPAN 353
Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL-TYLSINGSKTAQV 432
LQG W++D W NIN++MNYW S+ NL+EC + +F YL I ++
Sbjct: 354 LQGAWSQDYYTPWSGGFWHNINVQMNYWGSMSTNLAECFQSYTNFYKAYLPI--AREHAT 411
Query: 433 NYLA------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
+Y+ +GW+I + + SA G + L ++Y +
Sbjct: 412 DYVQKYNPSQVTKGGDNGWIIGTGANAYYIPSAGGHSGP-----GTGGFTAKLLMDYYLF 466
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD------GKLA 534
T D+ +LE+ AYP + + F LI H L PS SPE + P+ GKL
Sbjct: 467 TQDKQYLEEVAYPAMLSLSKFYSKVLIP-HGDKLLVEPSASPE-QLAKPEQVKNMPGKLK 524
Query: 535 CVSY----SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSI 590
Y T D + E F+ ++ A+ L +ED ++ + + + +L P I DG I
Sbjct: 525 GGKYYVTAGCTFDQGFVWESFADTLTLADAL-GSEDPFLDTIREQITKLDPILIGADGQI 583
Query: 591 MEW 593
E+
Sbjct: 584 KEY 586
>gi|311063634|ref|YP_003970359.1| 1,2-A-L-fucosidase [Bifidobacterium bifidum PRL2010]
gi|310865953|gb|ADP35322.1| 1,2-A-L-Fucosidase [Bifidobacterium bifidum PRL2010]
Length = 1959
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 164/630 (26%), Positives = 285/630 (45%), Gaps = 92/630 (14%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
+P GNG++G VWG V E + NE+TLWTG PG T N + + +R+L
Sbjct: 627 LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTRYNGGNNETKGQNGATLRALNKQ 686
Query: 82 GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
T L G + A+ L GDI L++ + E YRR+L+L+ A V
Sbjct: 687 LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 744
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
+ V +TRE+F+SNPD V+V +++ S++G L+FNVS+ + N +Y ++G
Sbjct: 745 TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 801
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
+ K ++ G+ +++ +++ + + GT+S D LKV + L + A
Sbjct: 802 DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGADGASLKVSDAKAVTLYIAA 855
Query: 257 SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
++ + P ++ + + +Q N Y+ + H+ D+ ++ RV I L +
Sbjct: 856 ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIADHSAIYDRVKIDLGQ 915
Query: 315 SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
S + D + A + S T + L L++++GRYL I SSR +Q+ +N
Sbjct: 916 SGH----SSDGAVATDALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 971
Query: 374 LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
LQGIW N + W S H+N+NL+MNYW + N+ E EPL +++ L G
Sbjct: 972 LQGIWSVTADDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 1031
Query: 428 KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
TA+V A G++ H + + ++ + W P W+ ++
Sbjct: 1032 VTAKVYAGAETTNPETTPIGEGKGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 1090
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE----GHDGYLETNPSTSPEHEFIAPD 530
+E Y Y+ D L+ R Y LL+ + F +++++ L T + SPE + D
Sbjct: 1091 YEAYEYSGDPALLD-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 1149
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL----------------- 573
G +T + +++ ++ + I AA+ + + D LV
Sbjct: 1150 G--------NTYESSLVWQMLNDAIEAAKA-KGDPDGLVGDTTDCSANNWAKGDNGNFTD 1200
Query: 574 ----------KSLPRLRPTKIAEDGSIMEW 593
KSL L+P ++ + G I EW
Sbjct: 1201 ANANRSWSCAKSL--LKPIEVGDSGQIKEW 1228
>gi|115384756|ref|XP_001208925.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114196617|gb|EAU38317.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 1276
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 163/595 (27%), Positives = 263/595 (44%), Gaps = 96/595 (16%)
Query: 24 FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
T A P+GNGRLG + G G+ N A +AL +R +
Sbjct: 556 ITTAFPLGNGRLGEKAYAG------------------GNPNNCRA-EALPGIRDFI---- 592
Query: 84 YAEATAASVKLFGH-PA-DVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSV 141
+ T L G P+ YQ+LG++ ++ + YRR LD+ + ++V
Sbjct: 593 FQNGTGNVSALLGEFPSYGSYQVLGNLTIDLGELE---NVRGYRRRLDMKSGVYTDGFAV 649
Query: 142 GNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPG 201
GN + R F S PDQV V IS + + S + L+ NQ++ P
Sbjct: 650 GNALYNRTAFCSYPDQVCVYHISSANASLPSVEIGLE------------NQVV----SPA 693
Query: 202 KRIPPKANA-----NDDPK-GIQFSA----ILEIKISDD--RGTISALEDKKLKVEGSDW 249
+ AN+ P G+ ++A ++ K S D GT+ + + +V
Sbjct: 694 PNVTCHANSISLYGQTFPTIGMIYNARATVVVPGKSSGDFCAGTVVRVPSGQKEV----- 748
Query: 250 AVLLLVASSSFDGPFINP----SDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
++L A +++D N S DP + + SY+ L + H+ D++ +
Sbjct: 749 -YIVLAADTNYDASKGNAAAKFSFRGSDPYEKVLQTASKAAKKSYAQLKSSHVKDFRAIS 807
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
++ L D+ + P+ E + ++ DP + LLF +GRYL +SSS
Sbjct: 808 DGFTLTLPDR-----RDSAGK------PTTELIAAYTQPGDPFIEGLLFDYGRYLFMSSS 856
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL--TYLS 423
R G+ NLQG+W E SP W + H NINL+MN+W L E EPL+ ++ T+L
Sbjct: 857 RAGSLPPNLQGLWTEQASPAWSADYHANINLQMNHWAVEQVGLGELTEPLWKYMADTWLP 916
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
G +TA++ Y GWV H + +++ +A + WA +P AW+ H+W+H++YT D
Sbjct: 917 -RGQETARLLYGGEGWVTHDEMNVFGH-TAMKNDAQWANYPAVNAWMSQHVWDHFDYTQD 974
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEG---HDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
+ + YP+L+G A F L L++ +DG NP SPEH P C +Y
Sbjct: 975 AAWYQSMGYPILKGAAQFWLSQLVQDEHFNDGTWVVNPCNSPEH---GPT-TFGCTNYQQ 1030
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRL-RPTKIAEDGSIMEW 593
+I E+F ++ ++D L + + S L I G I EW
Sbjct: 1031 -----LIWELFDHVLRGWTA-SGDKDRLFRRAIASKFAALDNGIHIGSWGQIQEW 1079
>gi|149197418|ref|ZP_01874469.1| hypothetical protein LNTAR_00515 [Lentisphaera araneosa HTCC2155]
gi|149139436|gb|EDM27838.1| hypothetical protein LNTAR_00515 [Lentisphaera araneosa HTCC2155]
Length = 980
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 169/598 (28%), Positives = 272/598 (45%), Gaps = 65/598 (10%)
Query: 23 HFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSG 82
H+ DA PIG+GRLG MV+G V L + W T PD L VR L G
Sbjct: 197 HWRDAYPIGSGRLGGMVYGDVNEARFMLQDARHWFNHASSSTMPDLSGLLQQVRDLQKQG 256
Query: 83 QYAEATAASVKLF---GHPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
+YA+A F + A++ L GD+ + + ++ Y+R LDL +
Sbjct: 257 KYADANVLYRNAFKGKNYRANIGSPLSIGDLVIRSNAKNI----SQYQRTLDLKKSETHT 312
Query: 138 KYSVGNVEFTREHFSS---NPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN--- 191
+S V++TR+ F S + V+ +++ ++ +L +V L L N G+
Sbjct: 313 AWSNEGVDYTRKAFISRIGDSKDVLFVQLNAKQAKALDISVHLG--LHNPDKARGSRPKA 370
Query: 192 -----QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ G K + P N F A+ + IS D G + E +K++G
Sbjct: 371 FRPSVNVDFAGHIQYKALNP----NTTSALKDFGAVARV-ISHD-GELKE-EIDHVKIKG 423
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
A +L+A +F+ +D D + + L+ +Y H+ +Q LF+
Sbjct: 424 ---ASQILIAVKTFNSA---DADEAIDRITRELYKLKG----TYQTYLNPHVKAHQGLFN 473
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
S+ L S D +EE I K+ + D + + VE L+ GRYL I SR
Sbjct: 474 AASVDLKASKDD--RALSNEELI--------AKARKLDLENAFVERLWAMGRYLSIVGSR 523
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHV-NINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
G +L G+WN D +PTW A H+ NIN+ M +W + NLSE P FD
Sbjct: 524 KGGHPVHLTGLWNGDYNPTW--AIHLMNINMPMIHWHLMDGNLSELMLPFFDMFDRQLPA 581
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVV--WALWPMGG-AWLCTHLWEHYNYTM 482
+ A+ Y G I+ + + KV+ + MG AW+ H W++Y +T+
Sbjct: 582 SRENARKLYGLDGIYIN---PLLGNNEDGLLKVISPHLIHMMGNNAWVAQHYWDYYTFTL 638
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC------V 536
D+ FL +RA PL+E A+F +LIE DG+ + PS SPE+ + +G
Sbjct: 639 DKKFLAERAVPLMEEAATFYEGFLIENEDGFYDITPSNSPENSPLNAEGHRLIPNRHIDT 698
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWV 594
++T + A IRE+F+ +I A+ L N+ + + + + +LRP +I +G + EW+
Sbjct: 699 HINATWEYAAIREMFTNLIEASNTLAINQSKIAD-WKEVIAKLRPYEINAEGGVREWL 755
>gi|421310055|ref|ZP_15760680.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA62681]
gi|395909670|gb|EJH20545.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA62681]
Length = 709
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 147/513 (28%), Positives = 236/513 (46%), Gaps = 59/513 (11%)
Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
Y GDI +EF ++ T Y+R+L+++ A A Y F RE F+S PD ++
Sbjct: 27 TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86
Query: 160 VTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPK 215
V + +L F + L D S + C I K D+
Sbjct: 87 VQCFTKEGLETLDFTIELSLTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND- 145
Query: 216 GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPT 275
++F++ L + G I D+ +++ G+ +A L L A + F + K D
Sbjct: 146 -LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLE 200
Query: 276 SESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSA 335
+ + + + + Y+ L +RH++DYQ LF RV + L E ++D +
Sbjct: 201 QQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL-------------EADVDASTTD 247
Query: 336 ERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVN 393
+ +K+++ E +L EL FQ+GRYLLISSSR P ANLQG+WN D H+N
Sbjct: 248 DLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNSDY--------HLN 299
Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKT 445
+NL+MNYW + NL E P+ +++ L + G + A V Y +GW++H +
Sbjct: 300 VNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQA 358
Query: 446 DI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
W D W P AW+ ++E Y++ D+D+L ++ YP+L F
Sbjct: 359 TPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWN 415
Query: 504 DWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLE 562
+L + ++PS SPEH +S +T D ++I ++F I AA+ L
Sbjct: 416 AFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELG 466
Query: 563 KNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+ED L E KS L P +I + G I EW +
Sbjct: 467 LDEDLLTEVKEKS-DLLNPLQITQSGRIREWYE 498
>gi|313139434|ref|ZP_07801627.1| alpha-fucosidase [Bifidobacterium bifidum NCIMB 41171]
gi|313131944|gb|EFR49561.1| alpha-fucosidase [Bifidobacterium bifidum NCIMB 41171]
Length = 1959
Score = 190 bits (483), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 164/630 (26%), Positives = 284/630 (45%), Gaps = 92/630 (14%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
+P GNG++G VWG V E + NE+TLWTG PG T N + + +R+L
Sbjct: 627 LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 686
Query: 82 GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
T L G + A+ L GDI L++ + E YRR+L+L+ A V
Sbjct: 687 LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 744
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
+ V +TRE+F+SNPD V+V +++ S++G L+FNVS+ + N +Y ++G
Sbjct: 745 TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 801
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
+ K ++ G+ +++ +++ + + GT+S D LKV + L + A
Sbjct: 802 DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGSDGASLKVSDAKAVTLYIAA 855
Query: 257 SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
++ + P ++ + + +Q N Y+ + H+ D+ ++ RV I L +
Sbjct: 856 ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIADHSAIYDRVKIDLGQ 915
Query: 315 SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
S + D + A + S T + L L++++GRYL I SSR +Q+ +N
Sbjct: 916 SGH----SSDGAVATDALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 971
Query: 374 LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
LQGIW N + W S H+N+NL+MNYW + N+ E EPL +++ L G
Sbjct: 972 LQGIWSVTAGDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 1031
Query: 428 KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
TA+V A G++ H + + ++ + W P W+ ++
Sbjct: 1032 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 1090
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE----GHDGYLETNPSTSPEHEFIAPD 530
+E Y Y+ D L+ R Y LL+ + F +++++ L T + SPE + D
Sbjct: 1091 YEAYEYSGDPALLD-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 1149
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL----------------- 573
G +T + +++ ++ + I AA+ + + D LV
Sbjct: 1150 G--------NTYESSLVWQMLNDAIEAAKA-KGDPDGLVGDTTDCSTDNWAKGDNGNFAD 1200
Query: 574 ----------KSLPRLRPTKIAEDGSIMEW 593
KSL L+P ++ G I EW
Sbjct: 1201 ANANRSWSCAKSL--LKPIEVGNSGQIKEW 1228
>gi|238482581|ref|XP_002372529.1| alpha-L-fucosidase 2 precursor, putative [Aspergillus flavus
NRRL3357]
gi|220700579|gb|EED56917.1| alpha-L-fucosidase 2 precursor, putative [Aspergillus flavus
NRRL3357]
Length = 785
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 162/614 (26%), Positives = 268/614 (43%), Gaps = 59/614 (9%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
++ S +T N L + +A +GNG+LG M +G +E L N D LW G P
Sbjct: 6 LLGMSSFATANSLWSSKAASWDTTNEAYTLGNGKLGVMPFGEPGAEKLNYNHDELWEGGP 65
Query: 61 -------GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELE 111
G N + LS+VR + + + T +L G + L ++ +
Sbjct: 66 FEVDGYRGGNPNSSMTEILSEVRDEI----WKKGTGNDSRLHGDTDGYGSFHSLANLTIA 121
Query: 112 FDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL 171
D H K ++ Y R LDL T YS G ++T + + S P QV + K++ + + S
Sbjct: 122 IDGIH-KVSD--YTRSLDLGTGIHTTTYSTGKGKYTTDVYCSYPAQVCIYKLNSTAALS- 177
Query: 172 SFNVSLDSLLDNHSYVNGN-NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD 230
+ D L++ S N + R + PP+ G+ + I I
Sbjct: 178 KVTIYFDQLVEESSLWNATCDSDFARLRGVTQEGPPR--------GMTYDTIARSSIPGR 229
Query: 231 RGTISALEDKKLKVEGSDWAVLLLV--ASSSFDG----PFINPSDSKKDPTSESMSALQS 284
+ + KL + + + L +V A + FDG + + +DP S
Sbjct: 230 CDSSTG----KLAINARNSSSLTIVIGAGTDFDGTKGTAATDYTFKGEDPAEYVEKITSS 285
Query: 285 IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD 344
+ S S L T H++DY L ++ L DT + + +TD
Sbjct: 286 ALSQSESKLRTEHIEDYSGLMSAFTLDLP--------DTQDSTGTELSTLITNYNANKTD 337
Query: 345 EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSL 404
DP L +LLF +GR+L ISSSR + NLQG+W+ + W H NINL+MN W +
Sbjct: 338 GDPYLEKLLFDYGRHLFISSSRANSLPPNLQGVWSPTKNAAWSGDYHANINLQMNLWGAE 397
Query: 405 PCNLSECQEPLFDFLTYLSI-NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
+ E +F+++ + G++TA++ Y +GWV H + +I+ + + A +
Sbjct: 398 ATGIGELTVAVFNYMEQNWMPRGAETAELLYGGAGWVTHDEMNIFGHTGMKTYQTS-ANY 456
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE---GHDGYLETNPST 520
P AW+ H+W+ Y+Y+ ++ + K+ +PLL+G A F L +D L NP T
Sbjct: 457 PAAPAWMMQHVWDRYDYSHNKTWFIKQGWPLLKGVAEFWASQLQVDKFNNDSSLVVNPCT 516
Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL- 579
SPE ++ T +I +V+ I AE+ + + L++ + LPRL
Sbjct: 517 SPEQ---------GPTTFGCTHWQQLIHQVYENAIQGAEIAGETDSTLLKDIKDQLPRLD 567
Query: 580 RPTKIAEDGSIMEW 593
+ I G I EW
Sbjct: 568 KGLHIGTWGQIKEW 581
>gi|421735948|ref|ZP_16174814.1| alpha-L-fucosidase, partial [Bifidobacterium bifidum IPLA 20015]
gi|407296769|gb|EKF16285.1| alpha-L-fucosidase, partial [Bifidobacterium bifidum IPLA 20015]
Length = 1935
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 164/630 (26%), Positives = 284/630 (45%), Gaps = 92/630 (14%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
+P GNG++G VWG V E + NE+TLWTG PG T N + + +R+L
Sbjct: 622 LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 681
Query: 82 GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
T L G + A+ L GDI L++ + E YRR+L+L+ A V
Sbjct: 682 LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 739
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
+ V +TRE+F+SNPD V+V +++ S++G L+FNVS+ + N +Y ++G
Sbjct: 740 TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 796
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
+ K ++ G+ +++ +++ + + GT+S D LKV + L + A
Sbjct: 797 DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGADGASLKVSDAKAVTLYIAA 850
Query: 257 SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
++ + P ++ + + +Q N Y+ + H+ D+ ++ RV I L +
Sbjct: 851 ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIADHSAIYDRVKIDLGQ 910
Query: 315 SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
S + D + A + S T + L L++++GRYL I SSR +Q+ +N
Sbjct: 911 SGH----SSDGAVATDALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 966
Query: 374 LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
LQGIW N + W S H+N+NL+MNYW + N+ E EPL +++ L G
Sbjct: 967 LQGIWSVTADDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 1026
Query: 428 KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
TA+V A G++ H + + ++ + W P W+ ++
Sbjct: 1027 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 1085
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE----GHDGYLETNPSTSPEHEFIAPD 530
+E Y Y+ D L R Y LL+ + F +++++ L T + SPE + D
Sbjct: 1086 YEAYEYSGDPALLN-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 1144
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL----------------- 573
G +T + +++ ++ + I AA+ + + D LV
Sbjct: 1145 G--------NTYESSLVWQMLNDAIEAAKA-KGDPDGLVGNTTDCSADNWAKGDNGNFTD 1195
Query: 574 ----------KSLPRLRPTKIAEDGSIMEW 593
KSL L+P ++ + G I EW
Sbjct: 1196 ANANRSWSCAKSL--LKPIEVGDSGQIKEW 1223
>gi|390936092|ref|YP_006393651.1| alpha-L-fucosidase [Bifidobacterium bifidum BGN4]
gi|389889705|gb|AFL03772.1| alpha-L-fucosidase [Bifidobacterium bifidum BGN4]
Length = 1959
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 164/630 (26%), Positives = 284/630 (45%), Gaps = 92/630 (14%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
+P GNG++G VWG V E + NE+TLWTG PG T N + + +R+L
Sbjct: 627 LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 686
Query: 82 GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
T L G + A+ L GDI L++ + E YRR+L+L+ A V
Sbjct: 687 LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 744
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
+ V +TRE+F+SNPD V+V +++ S++G L+FNVS+ + N +Y ++G
Sbjct: 745 TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 801
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
+ K ++ G+ +++ +++ + + GT+S D LKV + L + A
Sbjct: 802 DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGSDGASLKVSDAKAVTLYIAA 855
Query: 257 SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
++ + P ++ + + +Q N Y+ + H+ D+ ++ RV I L +
Sbjct: 856 ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIADHSAIYDRVKIDLGQ 915
Query: 315 SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
S + D + A + S T + L L++++GRYL I SSR +Q+ +N
Sbjct: 916 SGH----SSDGAVATDALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 971
Query: 374 LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
LQGIW N + W S H+N+NL+MNYW + N+ E EPL +++ L G
Sbjct: 972 LQGIWSVTADDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 1031
Query: 428 KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
TA+V A G++ H + + ++ + W P W+ ++
Sbjct: 1032 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 1090
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE----GHDGYLETNPSTSPEHEFIAPD 530
+E Y Y+ D L R Y LL+ + F +++++ L T + SPE + D
Sbjct: 1091 YEAYEYSGDPALLN-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 1149
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL----------------- 573
G +T + +++ ++ + I AA+ + + D LV
Sbjct: 1150 G--------NTYESSLVWQMLNDAIEAAKA-KGDPDGLVGDTTDCSADNWAKGDNGNFTD 1200
Query: 574 ----------KSLPRLRPTKIAEDGSIMEW 593
KSL L+P ++ + G I EW
Sbjct: 1201 ANANRSWSCAKSL--LKPIEVGDSGQIKEW 1228
>gi|154305361|ref|XP_001553083.1| hypothetical protein BC1G_08975 [Botryotinia fuckeliana B05.10]
Length = 792
Score = 189 bits (480), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 167/600 (27%), Positives = 271/600 (45%), Gaps = 78/600 (13%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-------GDYTNPDAPKALSDVRSLV 79
A PIGNG+L A+ +G SE L LN+D+LW G P G N +LS +R +
Sbjct: 39 AYPIGNGQLAALPFGTPGSEKLNLNKDSLWNGGPFGDASYIGGNPNSSVSSSLSGIRDFI 98
Query: 80 DSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARV 137
+ T L G + YQ+L ++ + + A E Y+R LDLNT
Sbjct: 99 ----FQNGTGNVTALMGSDDNYGSYQVLANLSVSLQG--ISGATE-YKRSLDLNTGIHTT 151
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGS---LSF-NVSLDSLLDNHSYVNGNNQI 193
+ N +T F S PD V V +++ + + S + F NV DS L S +
Sbjct: 152 TFKTSNSSYTTAVFCSYPDSVCVYQVNSTTTLSKIDVHFDNVLTDSSLIKSSCSKSSKSA 211
Query: 194 IMEGRCP---GKRIPPKANANDDPKGIQFS---AILEIKISDDRGTISALEDKKLKVEGS 247
+ G G +A + K + S IL I S D+ ++S
Sbjct: 212 LFSGITQADIGMIYKAEARVLESTKSVSCSNTTGILSITPSHDQKSLS------------ 259
Query: 248 DWAVLLLVASSSFDGPFINPSDSKK----DPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
L++ A +++D +D+ DPT+ S + + + L +H+ D+
Sbjct: 260 ----LVISAGTNYDATKGTAADNYSFKGVDPTAYVSSTIAKAASKTVKTLRNKHVSDFSA 315
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE----DPSLVELLFQFGRY 359
L + ++ L D N +T A + ++ T + DP + LLF + RY
Sbjct: 316 LMNSFTLSLP--------DPLGSANKET---AAVIAAYNTTDNTHTDPWVENLLFDYSRY 364
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
L ISSSR + NLQG W L W + H NIN++MN+W ++ L + Q L+ ++
Sbjct: 365 LFISSSRDNSLPPNLQGKWAYGLYNAWGADYHANINIQMNHWGAVQTGLGDLQSALWTYM 424
Query: 420 T-YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ + G++TA++ Y A GWV+H + +I+ + G WA +P +WL H+ ++Y
Sbjct: 425 SETWAPRGAETAKLLYNAPGWVVHDEMNIFGHTGMKTGDEYWADYPAAASWLMQHVADYY 484
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG---HDGYLETNPSTSPEHEFIAPDGKLAC 535
+Y+ D+++L + YPLL+ + F L L + +DG L NP +SPEH P C
Sbjct: 485 DYSRDKNWLRETGYPLLKAVSEFWLSQLQKDEYFNDGTLVVNPCSSPEH---GPT-TFGC 540
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS--LPRLRPTKIAEDGSIMEW 593
Y +I +F+ + AA L D+ ++K L + L + I+ I EW
Sbjct: 541 THYQQ-----LIHSLFTTTLQAARTLSL--DSTLQKSLTTSLLSLDKGLHISPTTQIQEW 593
>gi|347826700|emb|CCD42397.1| glycoside hydrolase family 95 protein [Botryotinia fuckeliana]
Length = 792
Score = 189 bits (480), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 167/600 (27%), Positives = 271/600 (45%), Gaps = 78/600 (13%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-------GDYTNPDAPKALSDVRSLV 79
A PIGNG+L A+ +G SE L LN+D+LW G P G N +LS +R +
Sbjct: 39 AYPIGNGQLAALPFGTPGSEKLNLNKDSLWNGGPFGDASYIGGNPNSSVSSSLSGIRDFI 98
Query: 80 DSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARV 137
+ T L G + YQ+L ++ + + A E Y+R LDLNT
Sbjct: 99 ----FQNGTGNVTALMGSDDNYGSYQVLANLSVSLQG--ISGATE-YKRSLDLNTGIHTT 151
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGS---LSF-NVSLDSLLDNHSYVNGNNQI 193
+ N +T F S PD V V +++ + + S + F NV DS L S +
Sbjct: 152 TFKTSNSSYTTAVFCSYPDSVCVYQVNSTTTLSKIDVHFDNVLTDSSLIKSSCSKSSKSA 211
Query: 194 IMEGRCP---GKRIPPKANANDDPKGIQFS---AILEIKISDDRGTISALEDKKLKVEGS 247
+ G G +A + K + S IL I S D+ ++S
Sbjct: 212 LFSGITQADIGMIYKAEARVLESTKSVSCSNTTGILSITPSHDQKSLS------------ 259
Query: 248 DWAVLLLVASSSFDGPFINPSDSKK----DPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
L++ A +++D +D+ DPT+ S + + + L +H+ D+
Sbjct: 260 ----LVISAGTNYDATKGTAADNYSFKGVDPTAYVSSTIAKAASKTVKTLRNKHVSDFSA 315
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE----DPSLVELLFQFGRY 359
L + ++ L D N +T A + ++ T + DP + LLF + RY
Sbjct: 316 LMNSFTLSLP--------DPLGSANKET---AAVIAAYNTTDNTHTDPWVENLLFDYSRY 364
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
L ISSSR + NLQG W L W + H NIN++MN+W ++ L + Q L+ ++
Sbjct: 365 LFISSSRDNSLPPNLQGKWAYGLYNAWGADYHANINIQMNHWGAVQTGLGDLQSALWTYM 424
Query: 420 T-YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ + G++TA++ Y A GWV+H + +I+ + G WA +P +WL H+ ++Y
Sbjct: 425 SETWAPRGAETAKLLYNAPGWVVHDEMNIFGHTGMKTGDEYWADYPAAASWLMQHVADYY 484
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG---HDGYLETNPSTSPEHEFIAPDGKLAC 535
+Y+ D+++L + YPLL+ + F L L + +DG L NP +SPEH P C
Sbjct: 485 DYSRDKNWLRETGYPLLKAVSEFWLSQLQKDEYFNDGTLVVNPCSSPEH---GPT-TFGC 540
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS--LPRLRPTKIAEDGSIMEW 593
Y +I +F+ + AA L D+ ++K L + L + I+ I EW
Sbjct: 541 THYQQ-----LIHSLFTTTLQAARALSL--DSTLQKSLTTSLLSLDKGLHISPTTQIQEW 593
>gi|421734699|ref|ZP_16173762.1| alpha-L-fucosidase [Bifidobacterium bifidum LMG 13195]
gi|407077388|gb|EKE50231.1| alpha-L-fucosidase [Bifidobacterium bifidum LMG 13195]
Length = 1954
Score = 189 bits (480), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 164/630 (26%), Positives = 283/630 (44%), Gaps = 92/630 (14%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
+P GNG++G VWG V E + NE+TLWTG PG T N + + +R+L
Sbjct: 622 LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 681
Query: 82 GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
T L G + A+ L GDI L++ + E YRR+L+L+ A V
Sbjct: 682 LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 739
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
+ V +TRE+F+SNPD V+V +++ S++G L+FNVS+ + N +Y ++G
Sbjct: 740 TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 796
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
+ K ++ G+ +++ +++ + + GT+S D LKV + L + A
Sbjct: 797 DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGADGASLKVSDAKAVTLYIAA 850
Query: 257 SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
++ + P ++ + + +Q N Y+ + H+ D+ ++ RV I L +
Sbjct: 851 ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIADHSAIYDRVKIDLGQ 910
Query: 315 SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
S + D + A + S T + L L++++GRYL I SSR +Q+ +N
Sbjct: 911 SGH----SSDGAVATDALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 966
Query: 374 LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
LQGIW N + W S H+N+NL+MNYW + N+ E EPL +++ L G
Sbjct: 967 LQGIWSVTADDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 1026
Query: 428 KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
TA+V A G++ H + + ++ + W P W+ ++
Sbjct: 1027 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 1085
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE----GHDGYLETNPSTSPEHEFIAPD 530
+E Y Y+ D L R Y LL+ + F +++++ L T + SPE + D
Sbjct: 1086 YEAYEYSGDPALLN-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 1144
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL----------------- 573
G +T + +++ ++ + I AA+ + + D LV
Sbjct: 1145 G--------NTYESSLVWQMLNDAIEAAKA-KGDPDGLVGDTTDCSADNWAKGDNGNFTD 1195
Query: 574 ----------KSLPRLRPTKIAEDGSIMEW 593
KSL L+P ++ G I EW
Sbjct: 1196 ANANRSWSCAKSL--LKPIEVGNSGQIKEW 1223
>gi|379719129|ref|YP_005311260.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
gi|378567801|gb|AFC28111.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
Length = 913
Score = 189 bits (480), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 167/600 (27%), Positives = 270/600 (45%), Gaps = 71/600 (11%)
Query: 24 FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
+ +A+P GNG +GA V G + SET+ L LWTG P+ L+++R L+D G
Sbjct: 120 WREALPSGNGLIGAAVHGAIGSETVLLTHAELWTGGT-KQELPEVSGTLAEIRRLMDEGA 178
Query: 84 YAEATAASVKLFGHPADV-YQLLGDIELEFDDSHLKYAEET----YRRELDLNTATARVK 138
Y EA L G + Y+ + + L D + + YRRELDL T V+
Sbjct: 179 YREANGL---LEGRLREAGYEPVRETPLPLADLKVVRTAQAGFRRYRRELDLETGEVSVR 235
Query: 139 YSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD-------SLLDNHSYVNGNN 191
+ G + R+ F S D +IV ++ GS G + + L S D SYV+ +
Sbjct: 236 WEEGAAAYERKLFVSRSDDLIVYEL-GSRGGCVDVALLLQPHEKGTASRPDMPSYVSESL 294
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEI-----KISDDRGTISALEDKKLKVEG 246
+I A NDD G F A+L ++ +D+G +L V G
Sbjct: 295 EI-----TAADGFLRYAARNDD--GRDFGAVLRAVPAGGRLGEDQG--------RLSVTG 339
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLS--YSDLYTRHLDDYQKL 304
+D VL+LV F G D + E +R ++ YS+L RH + L
Sbjct: 340 AD-KVLILV--KVFAG---------GDRSQEWTRLEAELREVAWTYSELLDRHTALHGPL 387
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
+ L + ++ + T ++E + ++++ P+L EL++ +GRYL IS
Sbjct: 388 MRSADVHLGGAGEE-ASCTYTDELLQ--------EAYEGGLSPALAELMWAYGRYLFISG 438
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
+RPG L G+W D W S N N++M YW + LSE P+ D+
Sbjct: 439 TRPGGLPFGLYGLWCGDYKAVW-SHFMANENVQMMYWHAAAGGLSELILPMLDYYESRLE 497
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
A+ Y G I T V+ W WL H +E+Y +T D
Sbjct: 498 IFRDNARKLYGCRGIFIPAGTTPGMAEPFQTVPVI-MHWTGAAGWLARHFYEYYRFTGDL 556
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEH---EFIAPDGKLACVSYS-- 539
+FL +RA P ++ A F D+L+EG DG L + PS SPE+ +I+ +G ++++
Sbjct: 557 EFLRRRALPFMKEAALFYEDFLVEGEDGRLVSYPSVSPENTPGNYISEEGVFGAMAHAMP 616
Query: 540 ----STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+ +D AI++E+ + ++ A E+ + E V + L R+ ++ DG++ EW+
Sbjct: 617 TAVNALLDFAILKELLTDLLEAVELTGEGEPEAVRRWSVLLERIPAYEVNGDGAVREWLH 676
>gi|156041112|ref|XP_001587542.1| hypothetical protein SS1G_11535 [Sclerotinia sclerotiorum 1980]
gi|154695918|gb|EDN95656.1| hypothetical protein SS1G_11535 [Sclerotinia sclerotiorum 1980
UF-70]
Length = 796
Score = 189 bits (480), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 162/590 (27%), Positives = 257/590 (43%), Gaps = 58/590 (9%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-------GDYTNPDAPKALSDVRSLV 79
A PIGNG+L A+ +G SE L LN D+LW G P G N L +R +
Sbjct: 39 AYPIGNGQLAALPFGEPGSEKLNLNRDSLWNGGPFENASYNGGNPNFSVASTLPGIRDWI 98
Query: 80 DSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET-YRRELDLNTATAR 136
+ T L G + YQ+LG++ + ++ T Y+R LDL T
Sbjct: 99 ----FRNGTGNVTTLMGSDDNYGSYQVLGNLSVSLQG----ISDATGYKRSLDLGTGIHT 150
Query: 137 VKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIME 196
++ NV FT F S PD V V +++ S + ++ D+L + S V + +
Sbjct: 151 TTFNTANVSFTTAVFCSYPDSVCVYQVN-STATLPRIDIYFDNLQADSSLVKSSCSTSSK 209
Query: 197 GRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVA 256
+ + + S+ GT+S + L++ A
Sbjct: 210 SALFSGITQADIGMIYKAEARVIESAKSVSCSNTTGTLSIIPSNNQHS-----LSLVISA 264
Query: 257 SSSFDG----PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
+++D N S +DP++ + + ++ L HL D+ L + ++ L
Sbjct: 265 GTNYDATKGTAAHNYSFKGEDPSNYVSKTVAKAASKTFKTLRKNHLADFSALINTFTLSL 324
Query: 313 SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE----DPSLVELLFQFGRYLLISSSRPG 368
D N +T A + ++ T E DP L LLF + RYL ISSSR
Sbjct: 325 P--------DPLGSANKET---ATVISAYNTTENSHTDPWLESLLFDYSRYLFISSSRDN 373
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT-YLSINGS 427
+ NLQG W LS W H NINL+MN+W + L + Q L+ ++ + GS
Sbjct: 374 SLPPNLQGKWAYGLSNAWGGDYHSNINLQMNHWVADQTGLGDLQSALWSYMAETWAPRGS 433
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
+TA++ Y A GWV+H + +I+ + G WA +P +WL H+ ++Y+Y+ D +L
Sbjct: 434 ETAKLLYNAPGWVVHDEMNIFGHTGMKTGDEYWADYPAAASWLMQHVADYYDYSRDETWL 493
Query: 488 EKRAYPLLEGCASFLLDWL---IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
+ YPLL+ + F L L + +DG L NP +SPEH P C Y
Sbjct: 494 KNTGYPLLKAISEFWLSQLQKDVYFNDGTLVVNPCSSPEH---GPT-TFGCTHYQQ---- 545
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIMEW 593
+I VF++ + AA L ++ L + +L L + I+ I EW
Sbjct: 546 -LIHAVFTSTLQAARTLST-DNTLQNTLQSTLTTLDKGLHISPLTQIQEW 593
>gi|429725254|ref|ZP_19260100.1| hypothetical protein HMPREF9999_00368 [Prevotella sp. oral taxon
473 str. F0040]
gi|429150389|gb|EKX93300.1| hypothetical protein HMPREF9999_00368 [Prevotella sp. oral taxon
473 str. F0040]
Length = 1038
Score = 188 bits (477), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 183/618 (29%), Positives = 277/618 (44%), Gaps = 107/618 (17%)
Query: 11 NPLKITFNGPAKHFTD-----AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT- 64
NPL + + PA + ++PIGNG+LGA ++GGV ++ ++ NE TLW G P D
Sbjct: 201 NPLTLWYPSPANAGPNPWMEYSLPIGNGQLGACIFGGVKTDEIQFNEKTLWWGTPKDMQR 260
Query: 65 -NPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET 123
N D P V FG Y G + ++ +++L ++
Sbjct: 261 QNGDGP----------------------VSGFG----CYLNFGGLFVQNLNANLSQVKD- 293
Query: 124 YRRELDLNTATARVKYS-VGNVEFTREHFSSNPDQVIVT--KISGSESGSLSFN-VSLDS 179
Y R LD+ TA A VK++ ++TR + SS PD VI + +G L F +S D+
Sbjct: 294 YVRYLDIQTAVAGVKFTDEAGTQYTRRYLSSQPDGVIAALYEANGKNKLDLQFTLISGDT 353
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
L + + G+ P I A P G GT++A D
Sbjct: 354 LKTKKTEYTADGSGWFAGKLP--TIFHNARFKVVPVG---------------GTLTATAD 396
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSAL-QSIRNLSYSDLYTRHL 298
+ V+G++ +++L +SF + D + ++AL + S+ + ++
Sbjct: 397 G-IVVKGAEKVMVILAGGTSFAPTLPERTKGTADDLNARITALVDNAAKKSFEAIEAANI 455
Query: 299 DDYQKLFHRVSIQL-----SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
D+Q RV+ L R+ KD+V + N + T + L +L
Sbjct: 456 ADHQSYMSRVAFHLEGAASQRNTKDLVDYYSAAPN-----------NRNTADGLFLEQLY 504
Query: 354 FQFGRYLLISSSRPGTQVAN-LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
F FGRYL ISSSR V N LQGIWN W+S H NIN++MNYW + P NLS+C
Sbjct: 505 FNFGRYLSISSSRGSMPVPNNLQGIWNNRHDAPWNSDVHNNINVQMNYWPAEPTNLSDCH 564
Query: 413 EPLFDFLTYLSINGSKTAQVNYLA-----------SGWVIHHKTDIWAKSSADRGKVVWA 461
P FL Y+ IN S++ A GW + +++I+ G W+
Sbjct: 565 MP---FLNYI-INNSQSEGWQRAAREFNKINGKSNKGWTVFTESNIFG------GMSTWS 614
Query: 462 L-WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPST 520
+ + AWL HLW+HY YT+D+DFL +RA+P + G A F + L + +DG E
Sbjct: 615 SNYCVANAWLVYHLWQHYRYTLDQDFL-RRAWPAIWGSAEFWIHRLKKANDGTYEAPNEW 673
Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNED-ALVEKVLKSLPR- 578
SPE+ DG +A T ++ I +V I+ A V +ED L+ L L +
Sbjct: 674 SPEYG-PKQDG-VAHAQQLITENLQIAHDVVE-ILGAKNVGISDEDLKLLNDRLTHLDKG 730
Query: 579 LRPTKIAEDGSIMEWVQR 596
LR K D W QR
Sbjct: 731 LRIEKYRND-----WAQR 743
>gi|427386362|ref|ZP_18882559.1| hypothetical protein HMPREF9447_03592 [Bacteroides oleiciplenus YIT
12058]
gi|425726402|gb|EKU89267.1| hypothetical protein HMPREF9447_03592 [Bacteroides oleiciplenus YIT
12058]
Length = 817
Score = 188 bits (477), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 162/605 (26%), Positives = 266/605 (43%), Gaps = 112/605 (18%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
++PIGNG +GA ++G E ++L E T+ G G Y
Sbjct: 84 SLPIGNGAMGACIFGRTDVERIQLAEKTM--GNKGAY----------------------- 118
Query: 87 ATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
S+ F + A++Y D H YA+ Y+R L LN A + V Y E+
Sbjct: 119 ----SMGGFTNFAEIYL----------DIHHNYAQ-NYKRTLRLNDAISTVSYIHEGTEY 163
Query: 147 TREHFSSNPDQVIVTKISGSESGSLSFNVS-----LDSLLDNHSYVNGNNQ-----IIME 196
RE+F+SNP VI K+ S+ G +SF V L S + + +G+ Q I +E
Sbjct: 164 NREYFASNPANVIAVKLKASQPGMISFTVRPVLPYLHSFNNEQTGRSGHVQAEKDLITLE 223
Query: 197 GRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE----DKKLKVEGSDWAVL 252
G +P + +IKI + GT+S++ + + V +D +L
Sbjct: 224 GEIQYFHLPYEG---------------QIKIINYGGTLSSVNKGDNNSFINVSKADSVIL 268
Query: 253 LLVASSSF---DGPFINPSDSK----KDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
+ ++S+ D F+ P+ K P + ++ Y L ++H+ DYQ F
Sbjct: 269 YITVATSYELKDSVFLLPNAEKFKGNAHPHGQVSKRIREAIEKGYECLRSKHIADYQHFF 328
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISS 364
+RV +QL+ E+ ++P+ + + ++ + D L EL FQ+GRYLLISS
Sbjct: 329 NRVDLQLT-------------EHTPSIPTDKLLNQYRNGKHDTYLEELFFQYGRYLLISS 375
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF------ 418
SR G+ ANLQG+WN+ W N+N++MNYW + NL+E P D+
Sbjct: 376 SRQGSLPANLQGVWNQYEFAPWSGGYWHNVNVQMNYWPAFNTNLAELFIPYMDYNEAFRK 435
Query: 419 ------LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
+ Y++ N + +GW I + S G +
Sbjct: 436 AATGKAVDYITQNNPEALDPTVEENGWTIGTGATAFGISGPGGHSGP-----GTGGFTTK 490
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGK 532
W++Y++T D+ L+ YP L G A FL L DG L +PS SPE I G
Sbjct: 491 LFWDYYDFTRDKQLLKDHVYPALMGMAKFLSKTLKPQPDGTLLVDPSFSPEQ--IHQQGY 548
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
S D ++I E + ++ AA++L +++ ++ V + + +L +I E G I E
Sbjct: 549 YR--SKGCIFDQSMILETYRDLLIAAKILN-DKNPFLKTVKEQIGKLDAIQIGESGQIKE 605
Query: 593 WVQRR 597
+ + +
Sbjct: 606 FREEK 610
>gi|317139357|ref|XP_001817454.2| alpha-fucosidase A [Aspergillus oryzae RIB40]
Length = 777
Score = 187 bits (476), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 155/585 (26%), Positives = 260/585 (44%), Gaps = 51/585 (8%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYTNPDAPKALSDVRSLVDSG 82
+A +GNG+LG M +G +E L LN D LW G P Y + +++++ S V
Sbjct: 23 EAYTLGNGKLGVMPFGEPGAEKLNLNHDELWEGGPFEVNGYRGGNPNSSMTEILSEVRDE 82
Query: 83 QYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYS 140
+ + T +L G + L ++ + D K ++ Y R LDL T YS
Sbjct: 83 IWKKGTGNDSRLHGDTDGYGSFHSLANLTIAIDGID-KVSD--YTRSLDLGTGIHTTTYS 139
Query: 141 VGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN-NQIIMEGRC 199
G ++T + + S P QV + K++ + + S + D L++ S N + R
Sbjct: 140 TGKGKYTTDVYCSYPAQVCIYKLNSTATLS-KVTIYFDQLVEESSLWNATCDSDFARLRG 198
Query: 200 PGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLV--AS 257
+ PP+ G+ + I I + + KL + + + L +V A
Sbjct: 199 VTQEGPPR--------GMTYDTIARSSIPGRCDSSTG----KLAINARNSSSLTIVIGAG 246
Query: 258 SSFDG----PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
+ FDG + + +DP S + S S L T H++DY L ++ L
Sbjct: 247 TDFDGTKGTAATDYTFKGEDPAEYVEKITSSALSQSESKLRTEHIEDYSGLMSAFTLDLP 306
Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVAN 373
DT + + +TD DP L +LLF +GR+L ISSSR + N
Sbjct: 307 --------DTQDSTGTELSTLITNYNANKTDGDPYLEKLLFDYGRHLFISSSRANSLPPN 358
Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI-NGSKTAQV 432
LQG+W+ + W H NINL+MN W + L E +F+++ + G++TA++
Sbjct: 359 LQGVWSPTKNAAWSGDYHANINLQMNLWGAEATGLGELTVAVFNYMEQNWMPRGAETAEL 418
Query: 433 NYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
Y +GWV H + +I+ + + A +P AW+ H+W+ Y+Y+ ++ + ++ +
Sbjct: 419 LYGGAGWVTHDEMNIFGHTGMKTYQTS-ANYPAAPAWMMQHVWDRYDYSHNKTWFIEQGW 477
Query: 493 PLLEGCASFLLDWLIE---GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
PLL+G A F L +D L NP TSPE ++ T +I +
Sbjct: 478 PLLKGVAEFWASQLQVDKFNNDSSLVVNPCTSPEQ---------GPTTFGCTHWQQLIHQ 528
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIMEW 593
V+ I AE+ + + L++ + LPRL + I G I EW
Sbjct: 529 VYENAIQGAEIAGETDSTLLKDIKDQLPRLDKGLHIGTWGQIKEW 573
>gi|225019386|ref|ZP_03708578.1| hypothetical protein CLOSTMETH_03339 [Clostridium methylpentosum
DSM 5476]
gi|224947849|gb|EEG29058.1| hypothetical protein CLOSTMETH_03339 [Clostridium methylpentosum
DSM 5476]
Length = 1796
Score = 187 bits (476), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 143/499 (28%), Positives = 240/499 (48%), Gaps = 57/499 (11%)
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
Y+R LDLNTA V Y + V +TR+ F++ PD V+V K+ S+ G+L F V + + D
Sbjct: 185 YQRYLDLNTAVTGVSYDIDGVTYTRQMFANFPDNVMVYKMDASKEGALDFTVRPE-IPDM 243
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAIL---EIKISDDRGTISALED 239
S +GN G+ + + N +G ++ + +L + K+ D GT++A D
Sbjct: 244 VSKASGNYDKTTMGKE--GTVFAEENGLITLRGTLKHNGMLFEGQYKVIPDGGTMTASND 301
Query: 240 K-----KLKVEGSDWAVLLLVASSSFDGPFINPSDSK---KDPTSESMSALQSIRNLSYS 291
+ ++ V G++ A +++ +++ +N D +DP + + + + L +
Sbjct: 302 ENNDHGQITVSGANSAYIIIALGTNY----VNDYDKDYVGEDPHDDVTARIANAEALGFD 357
Query: 292 DLYTRHLDDYQKLFHRVSIQLSRS--PKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSL 349
+LY+RH DY LF R ++ L+ + P D TD +E + R + +
Sbjct: 358 ELYSRHKADYTALFDRATLSLNGATFPADKTTDQLLKE----YKAGSRSQYLE------- 406
Query: 350 VELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLS 409
+L FQFGRYLLI++SR T NLQG+WN+ +P+W S H NINL+MNYW ++ NLS
Sbjct: 407 -QLYFQFGRYLLIAASRGDTLPTNLQGVWNDSETPSWQSDYHTNINLQMNYWPAMETNLS 465
Query: 410 ECQEPLFDFLTYLSINGSKTAQVNY--------LASGWVIHHKTDIWAKSSADRGKVVWA 461
E PL +++ L G T Q + SGW+++ + +
Sbjct: 466 ETAIPLVEYIDSLRKPGRVTFQKTWGIEPAEGDEESGWIVNCSNGPMGFTGNINSNA--S 523
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL----IEGHDGYLETN 517
G A++ +L+++Y +T D+D+L YP+L+ + + L E L
Sbjct: 524 FTATGAAFINQNLFDYYQFTQDKDYLRSTIYPILKESSKTYMQILEPGRTEADKDKLYMV 583
Query: 518 PSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP 577
PS S E G +Y D +I + F+ AA+ L + D E + + +P
Sbjct: 584 PSYSSEQ------GPWTVGAY---FDQQLIYQCFNDTALAADELGIDSDFAAE-LRELMP 633
Query: 578 RLRPTKIAEDGSIMEWVQR 596
+L P +I + G I EW Q
Sbjct: 634 KLDPIQIGDSGQIKEWQQE 652
>gi|402084812|gb|EJT79830.1| hypothetical protein GGTG_04913 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 819
Score = 187 bits (475), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 171/602 (28%), Positives = 272/602 (45%), Gaps = 71/602 (11%)
Query: 30 IGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDY--TNPDAPKALSDVRSLVDSGQY 84
+GNGRLGAM +G +E L N D+LW+G P DY NP A KA D + +
Sbjct: 50 LGNGRLGAMPFGPPGAERLVFNVDSLWSGGPFQSADYRGGNPVASKA--DALPAIRDQIW 107
Query: 85 AEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSV 141
T L G A+ Y++LG+ + D + + A T YRR LDL T +
Sbjct: 108 KNGTGDLSPLLGSSANYGSYRVLGNFTV--DIAGVADAPYTDYRRSLDLTTGVHTTTFKT 165
Query: 142 GNVEFTREHFSSNPDQVIV--TKISGSESGSL-SFNVSLD-SLLDNHSYVNGNNQIIMEG 197
GN F+ + PDQV V ++G +L +V D +L+ ++
Sbjct: 166 GNSSFSTWVYCGFPDQVCVYTVAVTGDRPAALPDVSVRFDNALVPAETFTRSCGDAFTRV 225
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEI---------KISDDRGTISALEDKKLKV---E 245
R + PP+ G+++ A+ + S T +D L + E
Sbjct: 226 RGVTQVGPPE--------GLRYDAMARVVSSGGGGGGGGSAASTTTRCGDDGTLVISTPE 277
Query: 246 GSDWAVLLLVASSSFDGPFINPSDS----KKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
G +++ A + FD N + DP + + + ++L HLDDY
Sbjct: 278 GQRSVSVVIGAGTDFDQTKGNAASGYSFRGDDPAPLVEATTAAAAAKTQAELLKAHLDDY 337
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE---DPSLVELLFQFGR 358
L QL D + V + + + S++ D+ DP L LF + R
Sbjct: 338 AALMG--GFQL---------DIADAKGSAAVETRKLIASYRADDVTGDPYLEAALFDYSR 386
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP-LFD 417
+L +SSSR + NL G W E+L P W + H NINL+MNYW + L P L+D
Sbjct: 387 HLAVSSSRANSLPTNLAGRWTEELEPAWSADHHANINLQMNYWVNDQTGLGPATTPALWD 446
Query: 418 FLTY-LSINGSKTAQVNYLA-SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
++ + G++TA++ Y A +GWV+H++ +++ SA + + WA +P AW+ H+W
Sbjct: 447 YMELNWAPRGAETARLLYGADAGWVVHNEMNVFG-FSAMKEEASWANYPAANAWMMQHVW 505
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG---HDGYLETNPSTSPEHEFIAPDGK 532
+ + Y +D + ++ YPL++G A F L L E +DG L NP SPEH P
Sbjct: 506 DRWEYGLDAAWFRRQGYPLIKGTAQFWLSQLQEDKWFNDGSLVVNPCNSPEH---GPT-T 561
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIM 591
C + I R +F+A ++ AE + + A + V +L RL + ++ G +
Sbjct: 562 FGCTHFHQE----IHRTLFTA-LAGAEAGGETDAAFLGSVRAALARLDKGVHRSDFGGLK 616
Query: 592 EW 593
EW
Sbjct: 617 EW 618
>gi|393247026|gb|EJD54534.1| glycoside hydrolase family 95 protein [Auricularia delicata
TFB-10046 SS5]
Length = 861
Score = 187 bits (475), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 158/602 (26%), Positives = 262/602 (43%), Gaps = 91/602 (15%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPKALSDVRSLVDSGQYAE 86
+P+GNG +G M + + LN ++LWTG P N + L+ V + V E
Sbjct: 103 LPVGNGYMGMMQSSRPDFDDVVLNLESLWTGGPYNSANNYNGGNPLTAVNASVR-----E 157
Query: 87 ATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEE---------------TYRRELDLN 131
A++ G P D+ D SH Y R LD N
Sbjct: 158 NIRATIWANGSP--------DLTPLVDGSHYGSLSSPGSLHISRSIGNDVTGYERALDFN 209
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL-DNHSYVNGN 190
T + G+ + R +F S PDQV V G+ + + + SLD+L +++ V
Sbjct: 210 DGTISATWKEGSNSYLRTYFCSFPDQVCVVNTEGTGNDTAIY--SLDTLRPRDYASVACL 267
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE-IKISDDRGTISALEDKKLKVEGSDW 249
++ + R + G+ + ++ I S D T S + L G+
Sbjct: 268 DKSTLAYR-----------GLAESSGMTYEILVRLISSSPDSVTCSGAGNATLTGSGARQ 316
Query: 250 AVLLLVASS------------SFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
VL+ A++ SF GP DP + ++++L SY L +RH
Sbjct: 317 MVLITGATNYNIDAGTRAHNFSFAGP---------DPHASALNSLSKASRSSYEALLSRH 367
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVE-LLFQF 356
+DDY LFH + L + P D+V P+ + V + T +E LLF
Sbjct: 368 IDDYSALFHGFELDLGQKP-DVVK-----------PTDQLVAEYVTGTGNVYLEWLLFNL 415
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GR+++I+ +R G + LQ +W L W H NINL+MNYW + NL PL+
Sbjct: 416 GRFMMITGAR-GVLPSGLQSVWTTGLEAPWGGDYHANINLQMNYWGAEETNLGAVTGPLW 474
Query: 417 DFLTYLSI-NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
+++ + GS+TAQ+ Y + G+V+H++ +I+ + G WA +P W+ H+W
Sbjct: 475 NYMRKTWVPRGSETAQLVYGSRGFVVHNEMNIFGHTGMKLGDPQWADYPAAATWMMLHVW 534
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE---GHDGYLETNPSTSPEHEFIAPDGK 532
+H+++T D ++ + + LL+ A F LD L E DG L P SPE+ + P
Sbjct: 535 DHFDFTGDLNWFRSQGWSLLKAQAEFWLDNLFEDSASKDGTLVAVPCNSPENGIVGP--- 591
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIM 591
+Y +I E+F I ++ + + ++++ L +L R +I G +
Sbjct: 592 ----TYGCAHFQQLIWELFHNIQKGFKLSGDADQSFLKEIEAKLSKLDRGVRIGSWGQMQ 647
Query: 592 EW 593
EW
Sbjct: 648 EW 649
>gi|418200759|ref|ZP_12837202.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47976]
gi|353864300|gb|EHE44218.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47976]
Length = 477
Score = 186 bits (472), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 142/499 (28%), Positives = 232/499 (46%), Gaps = 76/499 (15%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
P+ T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPVSTTYMGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
D L+++R ++ Y A + + P Y GDI +EF
Sbjct: 72 NLQDQYVFLAEIRQALEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTL 131
Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
++ T Y+R+L+++ A Y F RE F+S PD ++V + + + +L F + L
Sbjct: 132 SQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELS 191
Query: 179 ---SLLDNHSYVN------------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL 223
L + Y ++ I+M+GR ND ++F++ L
Sbjct: 192 LTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKD---------ND----LRFASYL 238
Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ 283
+ G I D+ +++ G+ +A L L A + F + K D + + +
Sbjct: 239 AWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVD 294
Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
+ + Y+ L +RH++DYQ LF RV + L E N+D + + +K+++
Sbjct: 295 TAKEKGYTQLKSRHIEDYQALFQRVQLDL-------------EANVDAFTTDDLLKNYKP 341
Query: 344 DEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
E +L EL FQ+GRYLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW
Sbjct: 342 QEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYW 401
Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKS 451
+ NL E P+ +++ L + G + A V Y +GW++H + W
Sbjct: 402 PAYVTNLLEAVFPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAP 460
Query: 452 SADRGKVVWALWPMGGAWL 470
D W P AW+
Sbjct: 461 GWD---YYWGWSPAANAWM 476
>gi|336378685|gb|EGO19842.1| glycoside hydrolase family 95 protein [Serpula lacrymans var.
lacrymans S7.9]
Length = 864
Score = 186 bits (472), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 162/581 (27%), Positives = 267/581 (45%), Gaps = 76/581 (13%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-------GDYTNPDAPKALSDVRSLVD 80
+PIGNG L AM+ GG+ E +LN ++LW G P G P ++ +
Sbjct: 79 LPIGNGYLAAMIPGGIFQEVTQLNIESLWQGGPLQDPSYNGGNNLPSQQAQMAQDMQSIR 138
Query: 81 SGQYA--EATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVK 138
+A T +++ P Y + Y R LDL+ AR
Sbjct: 139 QSIFASPNGTINNIEEICTPPGDYGSYSGAGYFISTLNNTGTTSNYGRWLDLDEGVARTT 198
Query: 139 YSVGNVEFTREHFSSNPDQVIVTKISGSESGSL-----SFNVSLDSLLDNHSYVNGNNQI 193
+S G+ F+RE F S+P Q V ++ S SL +F+VS ++ L + +N
Sbjct: 199 WSQGSSIFSREAFCSHPAQACVQYVNTSGQASLPTVTYAFSVSQETGLPAPNVTCLDNAT 258
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISA-------LEDKKLKVEG 246
+ G P G+ + I ++ S+ GT+S + + V G
Sbjct: 259 L---NIRGYVTNP---------GMMYEIIGRVQASN--GTVSCNVVSGSTPTNATVSVSG 304
Query: 247 SDWAVLLLVASSSFDGPFINPSD-------SKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+ A + V +++D I+ D DP S +S + S + SY++L + H+
Sbjct: 305 ASEAWITWVGGTNYD---IDAGDLAHNFTFQGVDPHSNLVSLVSSATSNSYTELLSEHIA 361
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVE-LLFQFGR 358
DY L S+ L ++P D+ T P+ + V S+QT + +E +LF FGR
Sbjct: 362 DYTSLISPFSLSLGQTP-DLST-----------PTDQIVASYQTYVGNAYLEWVLFNFGR 409
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLL SS+R G ANLQG W + S +W + H NINL+MNYW + NL+ Q LFD+
Sbjct: 410 YLLTSSAR-GILPANLQGKWADGQSNSWGADYHANINLQMNYWFAEMANLNVTQS-LFDY 467
Query: 419 L-TYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSSA--DRGKVVWALWPMGGAWLCTHL 474
+ + G++TA + Y ++ GWV H + +I+ + + WA +P AW+ H
Sbjct: 468 MEKTWAPRGAETALILYNISQGWVTHDEMNIFGHTGMKLEGNSAQWADYPESNAWMMIHA 527
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI---EGHDGYLETNPSTSPEHEFIAPDG 531
W+H++YT D ++ + + +PL++ ASF L+ LI +DG L T P SPE
Sbjct: 528 WDHFDYTNDVEWWKAQGWPLVKAVASFHLEKLIPDLHFNDGTLVTAPCNSPEQ------- 580
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV 572
+++ +I ++F+A+ E + A ++ +
Sbjct: 581 --VPITFGCAHAQQLIWQLFNAVEKGYEAAGDTDTAFIQAI 619
>gi|337748035|ref|YP_004642197.1| alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
gi|336299224|gb|AEI42327.1| Alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
Length = 913
Score = 186 bits (472), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 167/598 (27%), Positives = 268/598 (44%), Gaps = 67/598 (11%)
Query: 24 FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
+ +A+P GNG +GA V G + SET+ L LWTG P+ L+++R L+D G
Sbjct: 120 WREALPSGNGLIGAAVHGAIGSETVLLTHAELWTGGT-KQELPEVSGTLAEIRRLMDEGA 178
Query: 84 YAEATAASVKLFGHPADV-YQLLGDIELEFDDSHLKYAEET----YRRELDLNTATARVK 138
Y EA L G + Y+ + + L D + + YRRELDL T V+
Sbjct: 179 YREANGL---LEGRLREAGYEPVRETPLPLADLKVVRTAQAGFRRYRRELDLETGEVSVR 235
Query: 139 YSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD-------SLLDNHSYVNGNN 191
+ G + R+ F S D +IV ++ S GS+ ++ L S D SYV+ +
Sbjct: 236 WEEGAAAYERKLFVSRSDDLIVYELE-SRGGSVDVDLLLQLHEKGTASRPDIPSYVSESL 294
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEI-----KISDDRGTISALEDKKLKVEG 246
QI A NDD G F A+L ++ +D+G +L V G
Sbjct: 295 QI-----TAADGFLRYAARNDD--GRDFGAVLRAVPAGGRLGEDQG--------RLSVTG 339
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D VL+LV F G D ++ T + A +YS+L RH + L
Sbjct: 340 AD-KVLILV--KVFAG-----GDRSQEWTR--LEAELREAAWTYSELLDRHTALHGPLMR 389
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
+ L + ++ T ++E + ++++ P+L EL++ +GRYL IS +R
Sbjct: 390 SADLHLGGAGEEAAC-TYTDELLQ--------EAYEGGLSPALAELMWAYGRYLFISGTR 440
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG L G+W D W S N N++M YW + LSE P+ D+
Sbjct: 441 PGGLPFGLYGLWCGDYKAVW-SHFMANENVQMMYWHAAAGGLSELILPMLDYYESRLEIF 499
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
A+ Y G I T V+ W WL H +E+Y +T D +F
Sbjct: 500 RDNARKLYDCRGIFIPAGTTPGMAEPFQTVPVI-MHWTGAAGWLARHFYEYYRFTGDLEF 558
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEH---EFIAPDGKLACVSYS---- 539
L +RA P ++ A F D+L+ G DG L + PS SPE+ +I+ +G ++++
Sbjct: 559 LRRRALPFMKEAALFYEDFLVAGEDGRLVSYPSVSPENTPGNYISEEGVFGAMAHAMPTA 618
Query: 540 --STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWVQ 595
+ +D AI++E+ + ++ A E+ + E V + L R+ + DG++ EW+
Sbjct: 619 VNALLDFAILKELLTGLLEAVELTGEGEPEAVRRWSVLLERIPAYEANGDGAVREWLH 676
>gi|307718131|ref|YP_003873663.1| alpha-L-fucosidase [Spirochaeta thermophila DSM 6192]
gi|306531856|gb|ADN01390.1| alpha-L-fucosidase 2 precursor [Spirochaeta thermophila DSM 6192]
Length = 758
Score = 186 bits (471), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 168/586 (28%), Positives = 253/586 (43%), Gaps = 74/586 (12%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLV 79
PA + D P+GNGRL A+V GG+ E + LN + LW G D L VR
Sbjct: 15 PAGVWRDGYPVGNGRLAALVVGGLGEERIHLNHEWLWRGRYRDRVAEGRAHLLGWVREAF 74
Query: 80 DSGQYAEATAASVKLF-------GHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
G + E T + + F G P V YQ G + L ++ E Y RELDL
Sbjct: 75 FRGDWEEGTRRANEAFGGGGGVSGRPCRVGAYQPAGTLVLWWEGMD----GEGYERELDL 130
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
RV+ E P + ++SG G + + + G
Sbjct: 131 EEGVVRVRRGRSVEEVM-AVMGGGP---VGVRVSGWGRGWVGLEREVQEGVAVRVGAKGG 186
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
+ +EGR ++ G + A++ RG + E ++ VEG +
Sbjct: 187 -MVRLEGRF------------EEGIGWEVRAVV-------RGGVCRGEGGRVWVEGEEVV 226
Query: 251 VLLLVASSSFDGPFIN--PSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
V ++V G PS + E A++ RH++ Y LF RV
Sbjct: 227 VWVVVDVWEEVGGSRRRLPSYGPPEVPGEGWEAVRR-----------RHVEAYGGLFGRV 275
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
+ + EE + +P+ R + D DP L LLF +GRYLLI+SS PG
Sbjct: 276 RLVVE-----------GEEPL--LPTGRR----REDPDPLLPALLFDYGRYLLIASSAPG 318
Query: 369 TQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
+ ANLQG WN L P WD+ H++INL+MNYW + L EC PL ++ + +
Sbjct: 319 CDLPANLQGKWNPLLEPPWDADYHMDINLQMNYWLAEGAGLGECVRPLVRYVLRMVPSAR 378
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
+ A+ + G +D WA+++ + W +W AW+ HL Y Y D FL
Sbjct: 379 EAARRLFGCRGIWFPLTSDAWARATPE--AYGWDVWVGAAAWMAQHLVWRYLYGGDEGFL 436
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
+ AYP L+ A F D+L+E +G L+ PS SPEH + +G + SS +D+ ++
Sbjct: 437 RETAYPFLKEVALFFEDFLVEDGEGVLQVVPSQSPEHRWEGLEGFPVGLCVSSAVDVQLV 496
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
R V + L +E ++ L RLR + DG ++EW
Sbjct: 497 RWVLRMAVELGGRL-GDELGRWREMEGRLARLR---VGGDGVLLEW 538
>gi|395326583|gb|EJF58991.1| hypothetical protein DICSQDRAFT_65986 [Dichomitus squalens LYAD-421
SS1]
Length = 831
Score = 184 bits (467), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 171/619 (27%), Positives = 272/619 (43%), Gaps = 81/619 (13%)
Query: 15 ITFNGPAKHFT----DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAP 69
I + P + F D +P+GNG L AMV G E +LN ++LW+G P D T
Sbjct: 33 IWYTQPGRDFDFWADDWLPVGNGYLAAMVNGQAAQEVTQLNIESLWSGGPFQDPTYNGGN 92
Query: 70 KALSDVRSLVDSGQYAE--------ATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAE 121
KA SD ++ Q T S G P + +G L L
Sbjct: 93 KAASDQATVAQEMQVIRQAIFQSPNGTIDSASTSGGPLSIGSYVGAGYL-LATLDLNGGF 151
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL---SFNVSLD 178
+ R LDL+ A R ++ GN F RE F S+P Q V +I+ +++ +L ++ S+D
Sbjct: 152 SDFVRWLDLDAAVQRTSWTQGNASFFRETFCSHPTQACVQRINTTDASTLPALTYAYSVD 211
Query: 179 S----LLDNHS-YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGT 233
+ L+ S + N QI PG + F + + S +
Sbjct: 212 AESGILIPTVSCFDNSTLQITGTASSPG---------------MAFEILARVSASGTNTS 256
Query: 234 I----SALEDKKLKVEGSDWAVLLLVASSSFD---GPFINPSDSKKDPTSESMSAL---Q 283
I + + + V G+ A + V + +D G ++ K +++ AL
Sbjct: 257 IVCAPTGTNNATISVSGASDAFITWVGGTDYDADAGDAVHSFSFKGADPHDALVALIEPA 316
Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
+ +Y H+ DY L + + L ++P D T P+ + ++QT
Sbjct: 317 TASATTYDGALAAHIADYAGLITKFELDLDQTP-DFAT-----------PTDQLHDAYQT 364
Query: 344 D-EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQ 402
D +P L LLF FGRYLL S+R GT ANLQG W +D S W + H NIN++MNYW
Sbjct: 365 DVGNPYLEWLLFNFGRYLLAGSAR-GTLPANLQGKWAKDDSNPWSADYHSNINIQMNYWF 423
Query: 403 SLPCNLSECQEPLFDFL-TYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSSADRG--KV 458
+ + + PLFD+ + G+ TAQ Y ++ GWV H+ +I+ + G
Sbjct: 424 AELTGM-DVVTPLFDYFEKTWAPRGALTAQYLYNISEGWVTHN--EIFGHTGMKGGGNTA 480
Query: 459 VWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI---EGHDGYLE 515
WA +P AW+ H+W+H+++T D D+ + + +PLL+ A F L L+ +D L
Sbjct: 481 SWADYPESNAWMMLHVWDHFDFTQDSDWFKAQGWPLLKSVAQFHLQKLVPDERFNDSTLV 540
Query: 516 TNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS 575
NP SPE I L C +I ++F+AI + + A +++V
Sbjct: 541 VNPCNSPEQVPI----TLGCAHAQQ-----LIWQLFNAIDKGFAISGDTDTAFLDEVRAK 591
Query: 576 LPRL-RPTKIAEDGSIMEW 593
++ + I G + EW
Sbjct: 592 REQMDKGIHIGSWGQLQEW 610
>gi|421234544|ref|ZP_15691162.1| fibronectin type III domain protein [Streptococcus pneumoniae
2061617]
gi|395600398|gb|EJG60555.1| fibronectin type III domain protein [Streptococcus pneumoniae
2061617]
Length = 477
Score = 184 bits (466), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 140/484 (28%), Positives = 226/484 (46%), Gaps = 73/484 (15%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATA-ASVKLFGHPADVYQL---LGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A A L G Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWL 470
AW+
Sbjct: 473 NAWM 476
>gi|421249885|ref|ZP_15706342.1| fibronectin type III domain protein [Streptococcus pneumoniae
2082239]
gi|395613579|gb|EJG73607.1| fibronectin type III domain protein [Streptococcus pneumoniae
2082239]
Length = 456
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 138/484 (28%), Positives = 226/484 (46%), Gaps = 73/484 (15%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 6 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 66 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 126 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 185
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 186 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 229
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 230 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 288
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 289 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 335
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 336 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 395
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 396 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 451
Query: 467 GAWL 470
AW+
Sbjct: 452 NAWM 455
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.315 0.132 0.395
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,012,891,469
Number of Sequences: 23463169
Number of extensions: 430173124
Number of successful extensions: 1042170
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1262
Number of HSP's successfully gapped in prelim test: 112
Number of HSP's that attempted gapping in prelim test: 1034349
Number of HSP's gapped (non-prelim): 2012
length of query: 613
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 464
effective length of database: 8,863,183,186
effective search space: 4112516998304
effective search space used: 4112516998304
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 80 (35.4 bits)