BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 003571
(810 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|224103687|ref|XP_002313154.1| predicted protein [Populus trichocarpa]
gi|222849562|gb|EEE87109.1| predicted protein [Populus trichocarpa]
Length = 803
Score = 1253 bits (3243), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 591/802 (73%), Positives = 692/802 (86%), Gaps = 11/802 (1%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
M + ++ + LKITFNGPAKH+TDAIPIGNGRLGAM+WGGV ETL+LNEDTLWTG P
Sbjct: 1 MDDDDNGENSRSLKITFNGPAKHWTDAIPIGNGRLGAMIWGGVSLETLQLNEDTLWTGTP 60
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
G+YTNP AP+ALS VR LVD+GQYA+AT A+ KL P+DVYQLLGDI+LEFD+SHLKY
Sbjct: 61 GNYTNPHAPEALSVVRKLVDNGQYADATTAAEKLSHDPSDVYQLLGDIKLEFDNSHLKYV 120
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
E++Y RELDL+TATARVKYSVG+VE+TRE+F+SNP+QVI TKISGS+SGS+SF V LDS
Sbjct: 121 EKSYHRELDLDTATARVKYSVGDVEYTREYFASNPNQVIATKISGSKSGSVSFTVYLDSK 180
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ ++SYV G NQIIMEG CPGKRIPPK NA+D+PKGIQF+AIL ++IS+ RG + L+ +
Sbjct: 181 MHHYSYVKGENQIIMEGSCPGKRIPPKLNADDNPKGIQFTAILNLQISNSRGVVHVLDGR 240
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
KLKVEGSDWA+LLLV+SSSFDGPF P DSKKDPTS+S+SAL+SI NLSY+DLY HLDD
Sbjct: 241 KLKVEGSDWAILLLVSSSSFDGPFTKPIDSKKDPTSDSLSALKSINNLSYTDLYAHHLDD 300
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
YQ LFHRVS+QLS+S K SE+N TV +AERVKSF+TDEDPSLVELLFQ+GRYL
Sbjct: 301 YQSLFHRVSLQLSKSSK-----RRSEDN--TVSTAERVKSFKTDEDPSLVELLFQYGRYL 353
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LIS SRPGTQVANLQGIWN+D+ P WD A H+NINL+MNYW +LPCNL ECQ+PLF++++
Sbjct: 354 LISCSRPGTQVANLQGIWNKDIEPPWDGAQHLNINLQMNYWPALPCNLKECQDPLFEYIS 413
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
LSINGSKTA+VNY A GWV H +DIWAK+S DRG+ VWALWPMGGAWLCTHLWEHY Y
Sbjct: 414 SLSINGSKTAKVNYDAKGWVAHQVSDIWAKTSPDRGQAVWALWPMGGAWLCTHLWEHYTY 473
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
TMD+DFL+ +AYPLLEGC+ FLLDWLIEG GYLETNPSTSPEH FI PDGK A VSYSS
Sbjct: 474 TMDKDFLKNKAYPLLEGCSLFLLDWLIEGRGGYLETNPSTSPEHMFIDPDGKPASVSYSS 533
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDP 600
TMDM+II+EVFSAIISAAE+L KNED +V+KV ++ PRL PT+IA DGSIMEWA DF+DP
Sbjct: 534 TMDMSIIKEVFSAIISAAEILGKNEDEIVQKVREAQPRLLPTRIARDGSIMEWAVDFEDP 593
Query: 601 EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQ 660
E+HHRH+SHLFGLFPGHTIT+EK PDLCKAA+ TL KRG+EGPGWS WKTALWARLH+
Sbjct: 594 EIHHRHVSHLFGLFPGHTITVEKTPDLCKAADYTLYKRGDEGPGWSTIWKTALWARLHNS 653
Query: 661 EHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
EHAYRMVK LF+LVDP+HE ++EGGLY NLF +HPPFQIDANFGF+AA+AEMLVQST+ D
Sbjct: 654 EHAYRMVKHLFDLVDPDHESNYEGGLYGNLFTSHPPFQIDANFGFSAAIAEMLVQSTVKD 713
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYR 780
LYLLPALP KW++GCVKGLKARGG TV++CWK+GDLHEVG++S +H S K LHYR
Sbjct: 714 LYLLPALPRYKWANGCVKGLKARGGVTVNVCWKEGDLHEVGLWS----KEHHSIKRLHYR 769
Query: 781 GTSVKVNLSAGKIYTFNRQLKC 802
GT V NLS G++YTFNRQL+C
Sbjct: 770 GTIVNANLSPGRVYTFNRQLRC 791
>gi|224103693|ref|XP_002313157.1| predicted protein [Populus trichocarpa]
gi|222849565|gb|EEE87112.1| predicted protein [Populus trichocarpa]
Length = 836
Score = 1248 bits (3230), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 597/818 (72%), Positives = 691/818 (84%), Gaps = 19/818 (2%)
Query: 1 MMNAEST--STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG 58
M N ST + PLKIT GPAK++TDAIPIGNGRLGAMVWGGV SE ++LNEDTLWTG
Sbjct: 17 MWNPTSTYLEDSKPLKITSTGPAKYWTDAIPIGNGRLGAMVWGGVSSELIQLNEDTLWTG 76
Query: 59 VPGDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLK 118
P DYTNPDAP+AL++VR+LVDSG++AEA+ A+ KL G A+VYQLLGDI+LEFD +L
Sbjct: 77 TPIDYTNPDAPEALAEVRNLVDSGEFAEASDAAAKLSGTNANVYQLLGDIKLEFD-GYLM 135
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
AEETY RELDL+TATARVKYSVG+VEFTREHF+S PDQVIVTKI+GS+ GS+SF VSLD
Sbjct: 136 CAEETYYRELDLDTATARVKYSVGDVEFTREHFASYPDQVIVTKIAGSKEGSVSFTVSLD 195
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
S LD+H Y+ +QI+MEGRCPGKRIPPK ANDDPKGI F+A+L ++ISD G +S L+
Sbjct: 196 SKLDHHCYITDESQIVMEGRCPGKRIPPKVKANDDPKGILFAAVLGLQISDGAGLMSVLD 255
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D +LKVEG++W VL +VASSSF+GPF PS+S+KDP S S+SAL+SI+N SYS+LY+RHL
Sbjct: 256 DGRLKVEGANWVVLHMVASSSFEGPFTKPSESEKDPASVSLSALKSIKNQSYSELYSRHL 315
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDT-------------CSEENIDTVPSAERVKSFQTDE 345
DDYQ LFHRVS+QL + + D C E N D VP+ +R++SFQ+DE
Sbjct: 316 DDYQNLFHRVSLQLCKGSDRNIGDRSLEIKNLMPSGKRCVEGNKDVVPTVDRIRSFQSDE 375
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN+DL P WDSAPH+NINLEMNYW SLP
Sbjct: 376 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNKDLEPKWDSAPHLNINLEMNYWPSLP 435
Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
CNLSECQEPLF+F+ LSING KTAQVNY SGWV+HHK+DIWAK SAD+G+VVWA+WPM
Sbjct: 436 CNLSECQEPLFEFIKSLSINGCKTAQVNYKTSGWVVHHKSDIWAKPSADKGEVVWAIWPM 495
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHE 525
GGAWLCTHLWEHY+YTMD DFL +AYPLLEGCASFLLDWLIEGH GYLETNPSTSPEH
Sbjct: 496 GGAWLCTHLWEHYSYTMDEDFLRNKAYPLLEGCASFLLDWLIEGHGGYLETNPSTSPEHM 555
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
FIAPDGK A VSYSSTMDMA+I+EVFSAIISA+EVL +NEDA V+KV K+ PRL PTKI
Sbjct: 556 FIAPDGKSASVSYSSTMDMALIKEVFSAIISASEVLGRNEDAFVQKVHKAQPRLYPTKID 615
Query: 586 EDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGW 645
E+GSIMEWAQDFKDP+VHHRHLSHLFGLFPGH+ITI+KNP+LC+AAE +L KRGE+GPGW
Sbjct: 616 EEGSIMEWAQDFKDPDVHHRHLSHLFGLFPGHSITIDKNPELCEAAENSLYKRGEDGPGW 675
Query: 646 SITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGF 705
S TWK ALWA LH+ EH+YRMVK+L LVDP+HE FEGGLYSNLFAAHPPFQIDANFGF
Sbjct: 676 STTWKIALWAHLHNSEHSYRMVKQLIKLVDPDHEVAFEGGLYSNLFAAHPPFQIDANFGF 735
Query: 706 TAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
TA V+EMLVQS++ DLYLLPALP DKW++GCVKGLKARGG TVSICWK+GDLHEVG+
Sbjct: 736 TAGVSEMLVQSSIKDLYLLPALPRDKWANGCVKGLKARGGLTVSICWKEGDLHEVGV--- 792
Query: 766 YSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLKCT 803
+ + S + +HY GT+V VNLS KIYTFN QL+C
Sbjct: 793 WLKDGSSSLQRIHYGGTTVTVNLSCRKIYTFNTQLECV 830
>gi|224056204|ref|XP_002298754.1| predicted protein [Populus trichocarpa]
gi|222846012|gb|EEE83559.1| predicted protein [Populus trichocarpa]
Length = 808
Score = 1244 bits (3218), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 587/801 (73%), Positives = 683/801 (85%), Gaps = 5/801 (0%)
Query: 2 MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
M + ++ PL++TF+GPAKH+TDAIPIGNGRLGAM+WGGV ETL+LNEDTLWTG+PG
Sbjct: 1 MEDNNGESSKPLRVTFSGPAKHWTDAIPIGNGRLGAMIWGGVALETLQLNEDTLWTGIPG 60
Query: 62 DYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAE 121
DYTNP+AP AL +VR LVD+GQYAEAT A+ KL G+ +DVYQLLGDI+LEFDDSHLKY E
Sbjct: 61 DYTNPNAPAALLEVRKLVDNGQYAEATTAAEKLSGNQSDVYQLLGDIKLEFDDSHLKYDE 120
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
+TY+RELDL+TATARVKYSV ++E+TREHF+SNP+QVIVTKISGS+ GS+SF VSLDS +
Sbjct: 121 KTYKRELDLDTATARVKYSVADIEYTREHFASNPNQVIVTKISGSKPGSVSFTVSLDSKM 180
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
+HSYV G NQII+EG CPG R K N ND P+GIQF+AIL++++S+ RG + ED K
Sbjct: 181 SHHSYVKGENQIIIEGSCPGNRYAQKLNENDSPQGIQFTAILDLQVSEARGLVRVSEDSK 240
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L+VEGSDWAVLLLV+SSSFDGPF P DSKK+PTS+S+S L+SI NLSY DLY HLDDY
Sbjct: 241 LRVEGSDWAVLLLVSSSSFDGPFTKPIDSKKNPTSDSLSVLKSIGNLSYVDLYAHHLDDY 300
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
Q LFHRVS+QLS+S K+ E+ DTV +AERVK+FQTDEDPSLVELLFQ+GRYLL
Sbjct: 301 QSLFHRVSLQLSKSSKNSDISLNGSED-DTVSTAERVKAFQTDEDPSLVELLFQYGRYLL 359
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
IS SRPGTQVANLQGIWN+DL+P WD A H+NINL+MNYW SL CNL ECQEPLF++++
Sbjct: 360 ISCSRPGTQVANLQGIWNKDLTPPWDGAQHLNINLQMNYWPSLSCNLKECQEPLFEYISS 419
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
LSI+GS+TA+VNY A GWV H +D+WAK+S D G+ +WALWPMGGAWLCTHLWEHY Y
Sbjct: 420 LSISGSRTAKVNYEAKGWVAHQVSDLWAKTSPDAGQALWALWPMGGAWLCTHLWEHYTYA 479
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
D+DFL +AYPLLEGC SFLLDWLIEG GYLETNPSTSPEH FIAPDGK A VSYSST
Sbjct: 480 KDKDFLRDKAYPLLEGCTSFLLDWLIEGPGGYLETNPSTSPEHMFIAPDGKPASVSYSST 539
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPE 601
MDM+II+EVFSAI+SAA++L +NED LV+KVL++LPRL PTKIA DGSIMEWAQDF+DPE
Sbjct: 540 MDMSIIKEVFSAIVSAAKILGRNEDELVQKVLEALPRLLPTKIARDGSIMEWAQDFQDPE 599
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQE 661
VHHRH+SHLFGLFPGHTIT+EK PDLCKAA TL KRGE+GPGWS WK ALWARLH+ E
Sbjct: 600 VHHRHVSHLFGLFPGHTITVEKTPDLCKAAGNTLYKRGEDGPGWSTMWKAALWARLHNSE 659
Query: 662 HAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDL 721
HAYRMVK LF LVDPE+E ++EGGLYSNLF AHPPFQIDANFGF AA+AEMLVQST DL
Sbjct: 660 HAYRMVKHLFVLVDPENEGNYEGGLYSNLFTAHPPFQIDANFGFPAAIAEMLVQSTAEDL 719
Query: 722 YLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRG 781
YLLPALP DKW++GCVKGLKARG TV+I WK+GDL EVG++SN N SFK LHYRG
Sbjct: 720 YLLPALPRDKWANGCVKGLKARGKLTVNIYWKEGDLREVGLWSNEQN----SFKRLHYRG 775
Query: 782 TSVKVNLSAGKIYTFNRQLKC 802
T+VK NLS G++YTFNR LKC
Sbjct: 776 TTVKANLSPGRVYTFNRTLKC 796
>gi|255573091|ref|XP_002527475.1| conserved hypothetical protein [Ricinus communis]
gi|223533115|gb|EEF34873.1| conserved hypothetical protein [Ricinus communis]
Length = 840
Score = 1231 bits (3185), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 586/772 (75%), Positives = 663/772 (85%), Gaps = 15/772 (1%)
Query: 8 STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
S PLK+TFNGPAKH+TD+IPIGNGR+GAM+ GG+ SE ++LNEDTLWTGVPG+YTNP+
Sbjct: 20 SYNKPLKVTFNGPAKHWTDSIPIGNGRIGAMISGGMQSEIIQLNEDTLWTGVPGNYTNPN 79
Query: 68 APKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
A +ALS+VR LVD G YAEATAASVK FG+PADVYQLLGD++LEFDDSHL YA+ETY RE
Sbjct: 80 ALEALSEVRKLVDDGLYAEATAASVKFFGNPADVYQLLGDVKLEFDDSHLTYADETYYRE 139
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
LDL+TATARV+YSVG+V+FT+E+F+SNPDQV V KISGS+SGSLSF VSLDS LD+H YV
Sbjct: 140 LDLDTATARVQYSVGDVKFTKEYFASNPDQVAVIKISGSKSGSLSFTVSLDSKLDHHCYV 199
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
N NQIIMEG CP KRIPPK +AN++PKGI+FSA+L++ +SD G I L++KKLKVEGS
Sbjct: 200 NVENQIIMEGSCPEKRIPPKMSANENPKGIKFSAVLDLHVSDGVGVIHVLDNKKLKVEGS 259
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
DW VLLL ASSSF+ P PSDSKKDPTSES+ AL++I NLSYSDLY RHL DYQKLFHR
Sbjct: 260 DWGVLLLAASSSFESPLTKPSDSKKDPTSESLRALKAITNLSYSDLYARHLHDYQKLFHR 319
Query: 308 VSIQLSRSPKDIVTDTCSEENI---------------DTVPSAERVKSFQTDEDPSLVEL 352
VS QL +S IV D N D VP+ ER+KSFQ+DEDPSLVEL
Sbjct: 320 VSFQLWKSSNRIVGDESQLTNNLIPSANALYVKGIKDDAVPTVERIKSFQSDEDPSLVEL 379
Query: 353 LFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
LFQFGRYLLIS SRPGTQVANLQG+WN+DL PTWDSAPH+NINLEMNYW SLPCNL+ECQ
Sbjct: 380 LFQFGRYLLISCSRPGTQVANLQGVWNKDLEPTWDSAPHLNINLEMNYWLSLPCNLNECQ 439
Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
EPLFDF+ LS+NGSKTAQVNY ASGWVIHHK+DIWAKSSADRG VWALWP+GGAWLCT
Sbjct: 440 EPLFDFIKSLSVNGSKTAQVNYGASGWVIHHKSDIWAKSSADRGDAVWALWPIGGAWLCT 499
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGK 532
HLWEHYNYTMD++FLE AY LLEGC SFLLDWL+EG +GYLETNPSTSPEH FI PDGK
Sbjct: 500 HLWEHYNYTMDKEFLENEAYFLLEGCVSFLLDWLVEGSEGYLETNPSTSPEHMFITPDGK 559
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
ACVSYSSTMDMAIIREVFS+ +SA+EVL +N+D LV+ V +LPRLRPTKIAEDGSIME
Sbjct: 560 PACVSYSSTMDMAIIREVFSSFVSASEVLGRNKDVLVQNVHTALPRLRPTKIAEDGSIME 619
Query: 593 WAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTA 652
W +DFKDPEVHHRHLS LFGLFPGHTITI+++P+LCKAAE TL KRGE GPGWS WK A
Sbjct: 620 WVRDFKDPEVHHRHLSPLFGLFPGHTITIDQDPELCKAAENTLYKRGENGPGWSTAWKIA 679
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
LWARL++ +HAY MVK L LVDP+HE FEGGLYSNLFAAHPPFQIDANFGFTAAVAEM
Sbjct: 680 LWARLYNSKHAYNMVKHLIKLVDPDHEVAFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 739
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
LVQS L DLYLLPALP DKW++GCVKGLKARGG TVSICWK+GDLHEVG+++
Sbjct: 740 LVQSRLEDLYLLPALPRDKWANGCVKGLKARGGLTVSICWKEGDLHEVGLWA 791
>gi|359475494|ref|XP_002270199.2| PREDICTED: alpha-L-fucosidase 2-like [Vitis vinifera]
Length = 817
Score = 1217 bits (3148), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 585/795 (73%), Positives = 677/795 (85%), Gaps = 13/795 (1%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+ F GPAKH+TDA+PIGNGRLGAMVWGGV SETL+LNE TLWTG PG+YTNPDAPKA
Sbjct: 34 PLKVRFFGPAKHWTDALPIGNGRLGAMVWGGVASETLQLNEGTLWTGTPGNYTNPDAPKA 93
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
LS+VR LVD+G Y AT A+VKL G+P+DVYQLLGDI LEF+DSHL YAEETY RELDL+
Sbjct: 94 LSEVRKLVDNGDYVAATEAAVKLSGNPSDVYQLLGDINLEFEDSHLAYAEETYSRELDLD 153
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TAT +KYSVG+VE+TREHF+S PDQVIVTKISGS+ GS+SF VSLDS +HS +G +
Sbjct: 154 TATVTIKYSVGDVEYTREHFASYPDQVIVTKISGSKPGSVSFTVSLDSKSHHHSNSSGKS 213
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
QIIMEG CPGKRIPPK ND+P+GI FSA+L+++ISD RG I+ L+DKKLKVEGSDWAV
Sbjct: 214 QIIMEGSCPGKRIPPKVYENDNPQGILFSAVLDLQISDGRGVINVLDDKKLKVEGSDWAV 273
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
L LVASSSFDGPF P DSK +PTSE++S L+SI N SYSDLY RHL+DYQ LFHRVS+Q
Sbjct: 274 LYLVASSSFDGPFTKPIDSKINPTSEALSTLKSIGNFSYSDLYARHLNDYQNLFHRVSLQ 333
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
LS+S K + ++ V +A RVKSF TDEDPSLVELLFQ+GRYLLIS SRPG+Q
Sbjct: 334 LSKSSKSV---------MNRVSTAARVKSFGTDEDPSLVELLFQYGRYLLISCSRPGSQP 384
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
ANLQGIWN+D+ P WD APH+NINL+MNYW SLPCNLSECQEPLFD+++ LSINGSKTA+
Sbjct: 385 ANLQGIWNKDIEPAWDGAPHLNINLQMNYWPSLPCNLSECQEPLFDYMSSLSINGSKTAK 444
Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
VNY ASGWV H +DIWAK+S DRG+ VWALWPMGGAWLCTHLWEHY +TMD+DFL+ +A
Sbjct: 445 VNYEASGWVTHQVSDIWAKTSPDRGQAVWALWPMGGAWLCTHLWEHYTFTMDKDFLKNKA 504
Query: 492 YPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
YPLLEGCA FLLDWLIEG GYLETNPSTSPEH FIAPDGK A VSYS+TMD+AIIREVF
Sbjct: 505 YPLLEGCARFLLDWLIEGRGGYLETNPSTSPEHMFIAPDGKPASVSYSTTMDIAIIREVF 564
Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLF 611
SA++SAAEVL KNED LV+KV ++ P+L PTKIA DGSIMEWAQDF+DPEVHHRH+SHLF
Sbjct: 565 SAVVSAAEVLGKNEDELVQKVRQAQPKLPPTKIARDGSIMEWAQDFEDPEVHHRHVSHLF 624
Query: 612 GLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLF 671
GL+PGHTIT+EK PDLCKA + TL KRGE+GPGWS TWKTALWARLH+ EHAYRMVK LF
Sbjct: 625 GLYPGHTITVEKTPDLCKAVDYTLYKRGEDGPGWSTTWKTALWARLHNSEHAYRMVKHLF 684
Query: 672 NLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDK 731
+LVDP E FEGGLYSNLF AHPPFQIDANFGF AAVAEM+VQST DLYLLPALP DK
Sbjct: 685 DLVDPAREADFEGGLYSNLFTAHPPFQIDANFGFCAAVAEMIVQSTSKDLYLLPALPRDK 744
Query: 732 WSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAG 791
W++GCVKGLKARGG TV++CWK+G+LH++G++S D +S + LHYRG+ V + AG
Sbjct: 745 WANGCVKGLKARGGVTVNVCWKEGELHQIGVWS----KDQNSTRRLHYRGSIVTAKMLAG 800
Query: 792 KIYTFNRQLKCTNLH 806
++YTF+RQLKC +
Sbjct: 801 RVYTFDRQLKCVKTY 815
>gi|255573093|ref|XP_002527476.1| conserved hypothetical protein [Ricinus communis]
gi|223533116|gb|EEF34874.1| conserved hypothetical protein [Ricinus communis]
Length = 849
Score = 1199 bits (3102), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 571/810 (70%), Positives = 678/810 (83%), Gaps = 19/810 (2%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLKI F+GPAKH+TDAIPIGNGRLGAMV+GGV SETL++NEDTLWTG PG+YTNP+AP+A
Sbjct: 36 PLKIVFSGPAKHWTDAIPIGNGRLGAMVFGGVASETLRINEDTLWTGTPGNYTNPNAPEA 95
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
L+ VR LV +YAEAT +VKL G P+++YQ+LGDI+LEFDDSHL Y E+TY+RELDL+
Sbjct: 96 LTQVRKLVGDRKYAEATTEAVKLSGLPSEIYQVLGDIKLEFDDSHLSYDEKTYQRELDLD 155
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TATARVKYS+G+VE+TREHF+SNP+QV+VTKI+ S+ GS+SF V LDS L +HSY G N
Sbjct: 156 TATARVKYSLGDVEYTREHFASNPNQVVVTKIAASKPGSVSFTVLLDSELHHHSYTKGEN 215
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
QI +EG CPGKR PP+ A+D PKGI+F+AIL+++IS+ RG I L+D+KLKVEGSDWAV
Sbjct: 216 QIFIEGSCPGKRAPPQIYASDGPKGIEFAAILKLQISEGRGKIHVLDDRKLKVEGSDWAV 275
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
L LVASSSFDGPF PS SKKDPTS + AL ++NLSY+DLY RHLDDYQ LFHRVS++
Sbjct: 276 LSLVASSSFDGPFTMPSASKKDPTSACLHALDLVKNLSYTDLYARHLDDYQTLFHRVSLR 335
Query: 312 LSRSPKDIVTD---------------TCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
LS+S K I+ + + +E DT+ +AERVKSF+TDEDPSLVELLFQ+
Sbjct: 336 LSKSSKSILGNGPLNMKKFLSFKNYLSLNESKDDTISTAERVKSFRTDEDPSLVELLFQY 395
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLIS SRPGTQVANLQGIW++D +P WD A H+NINL+MNYW +L CNL EC EPLF
Sbjct: 396 GRYLLISCSRPGTQVANLQGIWSKDNAPPWDGAQHLNINLQMNYWPALSCNLHECHEPLF 455
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
++++ LSINGS TA+VNY A+GWV H +D+WAK+S DRG+ VWALWPMGGAWLC HLWE
Sbjct: 456 EYMSSLSINGSMTAKVNYEANGWVAHQVSDLWAKTSPDRGEAVWALWPMGGAWLCIHLWE 515
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
HY YTMD+DFL+ +AYPLLEGCA+FLLDWLIEG GYLETNPSTSPEH FIAPDGK A V
Sbjct: 516 HYTYTMDKDFLKNKAYPLLEGCATFLLDWLIEGPGGYLETNPSTSPEHMFIAPDGKPASV 575
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S S+TMD+ II+EVFS I+SAAEVL + ED L++KV ++ PRLRP KIA DGSIMEWAQD
Sbjct: 576 SNSTTMDVEIIQEVFSEIVSAAEVLGRKEDELIQKVREAQPRLRPIKIARDGSIMEWAQD 635
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
F+DPEVHHRH+SHLFGLFPGHTIT+EK PDLCKAA+ TL KRGEEGPGWS WK ALWAR
Sbjct: 636 FEDPEVHHRHVSHLFGLFPGHTITVEKTPDLCKAADYTLYKRGEEGPGWSSMWKAALWAR 695
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
LH+ EHAYRM+K LF+LVDP+ E FEGGLYSNLF AHPPFQIDANFGF AA+AEMLVQS
Sbjct: 696 LHNSEHAYRMIKHLFDLVDPDRESDFEGGLYSNLFTAHPPFQIDANFGFPAAIAEMLVQS 755
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKT 776
TL DLYLLPALP DKW++GCVKGLKARGG TV+ICW++GDLHEVG++S H+S
Sbjct: 756 TLKDLYLLPALPRDKWANGCVKGLKARGGVTVNICWREGDLHEVGLWS----KTHNSITR 811
Query: 777 LHYRGTSVKVNLSAGKIYTFNRQLKCTNLH 806
LHYRGT V + +S+GK+YTFNR+LKC N +
Sbjct: 812 LHYRGTIVNLTISSGKVYTFNRELKCINTY 841
>gi|356574288|ref|XP_003555281.1| PREDICTED: alpha-L-fucosidase 2-like [Glycine max]
Length = 876
Score = 1182 bits (3057), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 554/809 (68%), Positives = 664/809 (82%), Gaps = 18/809 (2%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+TF PA H+TDAIPIGNGRLGAMVWG VPSE L+LNEDTLWTG+PGDYTN A +A
Sbjct: 65 PLKVTFAEPATHWTDAIPIGNGRLGAMVWGAVPSEALQLNEDTLWTGIPGDYTNKSAQQA 124
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
L++VR LVD +++EATAA+VKL G P+DVYQLLGDI+LEF DSHL Y++E+Y RELDL+
Sbjct: 125 LAEVRKLVDDRKFSEATAAAVKLSGDPSDVYQLLGDIKLEFHDSHLNYSKESYYRELDLD 184
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TATA++KYSVG+VEFTREHF+SNPDQVIVT++S S+ GSLSF V DS + + S V+G N
Sbjct: 185 TATAKIKYSVGDVEFTREHFASNPDQVIVTRLSTSKPGSLSFTVYFDSKMHHDSRVSGQN 244
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
QII+EGRCPG RI P N+ D+P+GIQFSA+L+++IS D+G I L+DKKL+VEGSDWA+
Sbjct: 245 QIIIEGRCPGSRIRPIVNSIDNPQGIQFSAVLDMQISKDKGVIHVLDDKKLRVEGSDWAI 304
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
LLL ASSSFDGPF P DSKKDP SES+S + S++ +SY DLY RHL DYQ LFHRVS+Q
Sbjct: 305 LLLTASSSFDGPFTKPEDSKKDPASESLSRMVSVKKISYGDLYARHLADYQNLFHRVSLQ 364
Query: 312 LSRSPKDI----VTD----TCSEENI------DTVPSAERVKSFQTDEDPSLVELLFQFG 357
LS+S K + V D S+ NI DT+P++ RVKSFQTDEDPS VELLFQ+G
Sbjct: 365 LSKSSKTVSGKSVLDRRKLVSSQTNISQMGGDDTIPTSARVKSFQTDEDPSFVELLFQYG 424
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLIS SRPGTQVANLQGIWN+D+ P WD APH+NINL+MNYW SL CNL ECQEPLFD
Sbjct: 425 RYLLISCSRPGTQVANLQGIWNKDVEPAWDGAPHLNINLQMNYWPSLACNLHECQEPLFD 484
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
F++ LS+ G KTA+VNY A+GWV+H +DIW K+S DRG+ VWALWPMGGAWLCTHLWEH
Sbjct: 485 FISSLSVIGKKTAKVNYEANGWVVHQVSDIWGKTSPDRGEAVWALWPMGGAWLCTHLWEH 544
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y YTMD+ FL+ +AYPLLEGC SFLLDWLIEG G LETNPSTSPEH F APDGK A VS
Sbjct: 545 YTYTMDKVFLKNKAYPLLEGCTSFLLDWLIEGRGGLLETNPSTSPEHMFTAPDGKTASVS 604
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
YSSTMD++II+EVFS IISAAEVL ++ D ++++V + +L PTK+A DGSIMEWA+DF
Sbjct: 605 YSSTMDISIIKEVFSMIISAAEVLGRHNDTIIKRVTEYQSKLPPTKVARDGSIMEWAEDF 664
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARL 657
DP+VHHRH+SHLFGLFPGHTI++EK PDLCKA E +L KRGE+GPGWS TWK +LWA L
Sbjct: 665 VDPDVHHRHVSHLFGLFPGHTISVEKTPDLCKAVEVSLIKRGEDGPGWSTTWKASLWAHL 724
Query: 658 HDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST 717
H+ EH+YRM+K L LV+P+HE+ FEGGLYSNLF AHPPFQIDANFGF+ AVAEMLVQST
Sbjct: 725 HNSEHSYRMIKHLIVLVEPDHERDFEGGLYSNLFTAHPPFQIDANFGFSGAVAEMLVQST 784
Query: 718 LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTL 777
+ DLYLLPALP DKW++GCVKGLKARGG TV+ICWK+GDL E G+++ N S L
Sbjct: 785 MKDLYLLPALPHDKWANGCVKGLKARGGVTVNICWKEGDLLEFGLWTENQN----SKVRL 840
Query: 778 HYRGTSVKVNLSAGKIYTFNRQLKCTNLH 806
HYRG V +LS G++Y+++ QLKC +
Sbjct: 841 HYRGNVVSASLSPGRVYSYDNQLKCAKTY 869
>gi|224056206|ref|XP_002298755.1| predicted protein [Populus trichocarpa]
gi|222846013|gb|EEE83560.1| predicted protein [Populus trichocarpa]
Length = 843
Score = 1173 bits (3034), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 558/806 (69%), Positives = 667/806 (82%), Gaps = 17/806 (2%)
Query: 10 TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
+ PLK+TF+GPAK++TD IPIGNGRLGAMVWGGV SE ++LNEDTLWTG P D+T+P P
Sbjct: 28 SRPLKVTFSGPAKYWTDGIPIGNGRLGAMVWGGVSSELIQLNEDTLWTGTPTDFTDPAIP 87
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
+ALS+VR+LVDSG+++EAT A+ ++FG +VY+LLGDI+LEF+ S YAE TY RELD
Sbjct: 88 QALSEVRNLVDSGKFSEATKAAARMFGKYTNVYKLLGDIKLEFNGS--TYAEGTYYRELD 145
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L+TAT RVKY+V +VEFTREHF+SNPDQVIVTKISGS++ S+SF VSLDS+L++ Y+
Sbjct: 146 LDTATGRVKYTVDDVEFTREHFASNPDQVIVTKISGSKAQSVSFAVSLDSILEHQCYLTD 205
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
NQ++MEG CPGKR+ + ANDDPKG++F+A+L+++IS+ + L+D KLKV G+DW
Sbjct: 206 ENQLVMEGICPGKRMTTEVKANDDPKGMKFTAVLDLQISNGARLVRLLDDNKLKVVGADW 265
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
AVLLLVASSSF+GPF++PSDSKK+PTS+S+ A+ SI+ LSYS LY+RHLDD+Q LFHRVS
Sbjct: 266 AVLLLVASSSFEGPFVDPSDSKKNPTSDSLQAMNSIKKLSYSQLYSRHLDDFQNLFHRVS 325
Query: 310 IQLSRSP---------KDIVTDTCS--EENIDTV-PSAERVKSFQTDEDPSLVELLFQFG 357
+QL +S K+++ E N D V P+ ER+KSF++DEDPSLVELLFQFG
Sbjct: 326 LQLEKSSAIGDGVSEIKNLMPSVIEDFEGNKDVVVPTVERIKSFESDEDPSLVELLFQFG 385
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLIS SRPGTQVANLQGIWN+DL P WDSAP +NINLEMNYW SLPCNL ECQEPLFD
Sbjct: 386 RYLLISCSRPGTQVANLQGIWNKDLYPAWDSAPTLNINLEMNYWPSLPCNLRECQEPLFD 445
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
F+ LSINGSK AQVNY+ SGWV HH++DIW K+SAD G WA+WPM GAW+CTHLWEH
Sbjct: 446 FIKSLSINGSKVAQVNYITSGWVAHHRSDIWEKASADMGNPKWAIWPMAGAWVCTHLWEH 505
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y YT+D+DFL AYPLLEGCASFL+DWLIEG+DGYLETNPSTSPEH FIAPDG A VS
Sbjct: 506 YTYTLDKDFLINTAYPLLEGCASFLMDWLIEGNDGYLETNPSTSPEHMFIAPDGNSASVS 565
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
YSSTMDMAII EVFSAI+SA+EVL ++EDALV+KVLK+ PRL P KIA DGSIMEWA +F
Sbjct: 566 YSSTMDMAIINEVFSAIVSASEVLGRSEDALVQKVLKAQPRLYPPKIAPDGSIMEWALNF 625
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARL 657
KDPEV HRH+SHLFGLFPGH+IT++KNP+LCKAAE TL KRGE+GPGWS WKTA+WARL
Sbjct: 626 KDPEVKHRHISHLFGLFPGHSITLKKNPELCKAAENTLYKRGEDGPGWSTVWKTAVWARL 685
Query: 658 HDQEHAYRMVKRLFNLVDPEHEK-HFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
+ EHAY MVK L LVDP +K FEGGLYSNLFAAHPPFQIDAN GF AAV+EMLVQS
Sbjct: 686 QNSEHAYTMVKHLIRLVDPADQKIGFEGGLYSNLFAAHPPFQIDANLGFPAAVSEMLVQS 745
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKT 776
T+ DLYLLPALP DKW+ GCVKGL+ARGG TV+ICW GDL EVG++ + S +
Sbjct: 746 TMTDLYLLPALPRDKWAKGCVKGLQARGGNTVNICWDKGDLQEVGLW--LKKDGSCSLQR 803
Query: 777 LHYRGTSVKVNLSAGKIYTFNRQLKC 802
LHYRGT+V +LS+G IYTFN QL+C
Sbjct: 804 LHYRGTTVTTSLSSGIIYTFNSQLQC 829
>gi|356536151|ref|XP_003536603.1| PREDICTED: alpha-L-fucosidase 2-like [Glycine max]
Length = 877
Score = 1171 bits (3030), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 548/809 (67%), Positives = 657/809 (81%), Gaps = 18/809 (2%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+TF PA H+TDAIPIGNGRLGAMVWG VPSE L+LNEDTLWTG+PGDYTN AP+A
Sbjct: 66 PLKVTFAEPATHWTDAIPIGNGRLGAMVWGAVPSEALQLNEDTLWTGIPGDYTNKSAPQA 125
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
L++VR LV+ ++AEATAA+VKL G P+DV+QLLGDI+LEF DSHL Y++E+Y RELDL+
Sbjct: 126 LAEVRKLVNDRKFAEATAAAVKLSGEPSDVFQLLGDIKLEFHDSHLNYSKESYYRELDLD 185
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TATA++KYSVG+VEFTREHF+SNPDQVIVT++S S+ GSLSF V DS + + S V+G N
Sbjct: 186 TATAKIKYSVGDVEFTREHFASNPDQVIVTRLSASKPGSLSFTVYFDSKMHHDSRVSGQN 245
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
QI +EGRCPG RI P+ N+ D+P+GIQFSA+L+++IS D+G I L+DKKL+VEGSD A+
Sbjct: 246 QIKIEGRCPGSRIRPRVNSIDNPQGIQFSAVLDMQISKDKGVIHVLDDKKLRVEGSDSAI 305
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
LLL ASSSFDGPF P DSKKDP SES+S + S++ SY DLY RHL DYQ LFHRVS+Q
Sbjct: 306 LLLTASSSFDGPFTKPEDSKKDPASESLSRMVSVKKFSYDDLYARHLADYQNLFHRVSLQ 365
Query: 312 LSRSPK--------------DIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
LS+S K T+ + DT+P++ RVKSFQTDEDPS VELLFQ+G
Sbjct: 366 LSKSSKTGSGKSVLEGRKLVSSQTNISQKRGDDTIPTSARVKSFQTDEDPSFVELLFQYG 425
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLIS SRPGTQVANLQGIWN+D+ P WD APH+NINL+MNYW SL CNL ECQEPLFD
Sbjct: 426 RYLLISCSRPGTQVANLQGIWNKDVEPAWDGAPHLNINLQMNYWPSLACNLHECQEPLFD 485
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
F++ LS+ G KTA+VNY A+GWV H +DIW K+S DRG+ VWALWPMGGAWLCTHLWEH
Sbjct: 486 FISSLSVIGKKTAKVNYEANGWVAHQVSDIWGKTSPDRGEAVWALWPMGGAWLCTHLWEH 545
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y YTMD+DFL+ +AYPLLEGC +FLLDWLIEG G LETNPSTSPEH F APDGK A VS
Sbjct: 546 YIYTMDKDFLKNKAYPLLEGCTTFLLDWLIEGRGGLLETNPSTSPEHMFTAPDGKTASVS 605
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
YSSTMD++II+EVFS IISAAEVL ++ D ++++V K +L PTK+A DGSIMEWA+DF
Sbjct: 606 YSSTMDISIIKEVFSMIISAAEVLGRHNDTIIKRVTKYQSKLPPTKVARDGSIMEWAEDF 665
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARL 657
DP+VHHRH+SHLFGLFPGHTI++EK PDLCKA E +L KRG++GPGWS TWK +LWA L
Sbjct: 666 VDPDVHHRHVSHLFGLFPGHTISVEKTPDLCKAVEVSLIKRGDDGPGWSTTWKASLWAHL 725
Query: 658 HDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST 717
H+ EHAYRM+K L LV+P+HE+ FEGGLYSNLF AHPPFQIDANFGF+ A+AEMLVQST
Sbjct: 726 HNSEHAYRMIKHLIVLVEPDHERDFEGGLYSNLFTAHPPFQIDANFGFSGAIAEMLVQST 785
Query: 718 LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTL 777
DLYLLPALP DKW++GCVKGLKARGG TV+ICWK+GDL E G+++ N S L
Sbjct: 786 TKDLYLLPALPRDKWANGCVKGLKARGGVTVNICWKEGDLLEFGLWTENQN----SQLRL 841
Query: 778 HYRGTSVKVNLSAGKIYTFNRQLKCTNLH 806
HYRG V +LS G++Y++N LKC +
Sbjct: 842 HYRGNVVLTSLSPGRVYSYNNLLKCVKAY 870
>gi|449446103|ref|XP_004140811.1| PREDICTED: alpha-L-fucosidase 2-like [Cucumis sativus]
Length = 803
Score = 1169 bits (3023), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 553/800 (69%), Positives = 661/800 (82%), Gaps = 6/800 (0%)
Query: 9 TTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
+++PLK+TFN PAKH+TDAIPIGNGRLGAMVWGGV +E L+LNEDTLWTG P DYTNPDA
Sbjct: 4 SSDPLKLTFNAPAKHWTDAIPIGNGRLGAMVWGGVDTEILQLNEDTLWTGTPADYTNPDA 63
Query: 69 PKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRREL 128
P+AL +VR LVD G+YAEAT A+VKL G P+DVYQLLGDI+LEF+ SH Y ETY REL
Sbjct: 64 PEALREVRKLVDDGKYAEATEAAVKLSGKPSDVYQLLGDIKLEFEVSHQSYTPETYHREL 123
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV- 187
DLNTATARVKYSVG+VEFTREHF+SNPDQ IVTKI+ S+ GSL+F VS+DS L + S+V
Sbjct: 124 DLNTATARVKYSVGDVEFTREHFASNPDQAIVTKIAASKPGSLTFIVSIDSKLHHSSHVV 183
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
+G + I++ G C G RIPPK + +D+PKGIQ+SA+L +++SD + L++KKLKV GS
Sbjct: 184 DGQSLIVLHGSCRGVRIPPKMDFDDNPKGIQYSAVLSLQVSDGSVVVHDLDEKKLKVNGS 243
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
DWAVL LVASSSF GPF PS S KDP+SES++ ++ I+ LSYS+LY RHL+DYQ LF R
Sbjct: 244 DWAVLRLVASSSFKGPFTQPSLSGKDPSSESLATMKKIKGLSYSNLYARHLNDYQSLFQR 303
Query: 308 VSIQLSRSPKDIVTDTC-SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
VS+ LS+S K+ + + + +AERVKSFQTDEDPSLVELLFQ+ RYLLIS SR
Sbjct: 304 VSLHLSKSSKNESSSPNSGGKEVRVASTAERVKSFQTDEDPSLVELLFQYSRYLLISCSR 363
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PGTQVANLQGIWN+++ P WD APH+NINL+MNYW SL CNL ECQEPLFDF ++LS+NG
Sbjct: 364 PGTQVANLQGIWNKNVEPAWDGAPHLNINLQMNYWPSLSCNLKECQEPLFDFTSFLSVNG 423
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
KTA+ NY ASGWV H +DIWAKSS DRG+ VWALWPMGGAWLCTHLWEHY YTMD++F
Sbjct: 424 RKTAKANYEASGWVAHQVSDIWAKSSPDRGQAVWALWPMGGAWLCTHLWEHYTYTMDKNF 483
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
L+ +AYPL+EGCASFLLDWLI+G DGYLETNPSTSPEH FIAPDGK A VSYS+TMDMAI
Sbjct: 484 LKNKAYPLMEGCASFLLDWLIDGKDGYLETNPSTSPEHMFIAPDGKPASVSYSTTMDMAI 543
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRH 606
+EVFS+IISAAE+L K +D ++KV K+ RL P KIA+DGS+MEWA DF+D +VHHRH
Sbjct: 544 TKEVFSSIISAAEILGKTKDTFIDKVRKAQARLLPYKIAKDGSLMEWALDFEDQDVHHRH 603
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRM 666
+SHLFGLFPGHTIT+EK P++ +AA TL KRGEEGPGWS WK ALWARLH+ EHAY+M
Sbjct: 604 VSHLFGLFPGHTITVEKTPNISEAASNTLHKRGEEGPGWSTAWKIALWARLHNSEHAYQM 663
Query: 667 VKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPA 726
VK LF+LVDP+HE +EGGLYSNLF AHPPFQIDANFGF+AA+AEMLVQST+NDLYLLPA
Sbjct: 664 VKHLFDLVDPDHESDYEGGLYSNLFTAHPPFQIDANFGFSAAIAEMLVQSTINDLYLLPA 723
Query: 727 LPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKV 786
LP + W GCVKGLKARGG TV++CW GDL+EVG++S ++ S TLHYR T+V
Sbjct: 724 LPRNVWPDGCVKGLKARGGLTVNMCWTGGDLNEVGLWS----SEQISLTTLHYRETTVAA 779
Query: 787 NLSAGKIYTFNRQLKCTNLH 806
NLS+G +YTFN+ LKC +
Sbjct: 780 NLSSGTVYTFNKLLKCVRTY 799
>gi|356575686|ref|XP_003555969.1| PREDICTED: alpha-L-fucosidase 2-like [Glycine max]
Length = 874
Score = 1166 bits (3016), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 549/820 (66%), Positives = 664/820 (80%), Gaps = 20/820 (2%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
+ N ES PLK+TF PA H+TDAIPIGNGRLGAMVWG VPSE L+LNEDTLWTG+P
Sbjct: 54 LTNGESPP--RPLKVTFAEPATHWTDAIPIGNGRLGAMVWGAVPSEALQLNEDTLWTGIP 111
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
DYTN AP+AL++VR LVD +++EATAA+VKL G P++VYQLLGDI+LEF DSHL Y+
Sbjct: 112 RDYTNSSAPQALAEVRKLVDDRKFSEATAAAVKLSGDPSEVYQLLGDIKLEFHDSHLNYS 171
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
+E+Y RELDL+TATA +KYSVG+VEFTREHF+SNPDQVIVT++S S+ GSLSF V DS
Sbjct: 172 KESYYRELDLDTATANIKYSVGDVEFTREHFASNPDQVIVTRLSTSKPGSLSFTVYFDSK 231
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ + S V+G NQIIMEGRCPG RIPP+ N+ D+P+GIQFSA+L+++IS D+G I L+DK
Sbjct: 232 MHHDSRVSGQNQIIMEGRCPGSRIPPRVNSIDNPQGIQFSAVLDMQISKDKGFIHVLDDK 291
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
KL+VEGSDWA+LLL ASSSFDGPF P DSKKDP SES+S + S++ +SY DLY RHL D
Sbjct: 292 KLRVEGSDWAILLLTASSSFDGPFTKPEDSKKDPASESLSRMVSVKKISYGDLYARHLAD 351
Query: 301 YQKLFHRVSIQLSRSPKDI----VTD----TCSEENI------DTVPSAERVKSFQTDED 346
YQ LFHRVS+QLS+S K + V D S+ NI DT+P++ RVKSFQTDED
Sbjct: 352 YQNLFHRVSLQLSKSSKTVSGKSVLDRRKLVSSQTNISQMGGDDTIPTSARVKSFQTDED 411
Query: 347 PSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPC 406
PS VELLFQ+GRYLLIS SRPGTQVANLQGIWN+D+ P W+ APH+NINL++NYW SL C
Sbjct: 412 PSFVELLFQYGRYLLISCSRPGTQVANLQGIWNKDVEPAWEGAPHLNINLQINYWPSLAC 471
Query: 407 NLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
NL ECQEPLFDF++ LS+ G KTA+V+Y A+GWV HH +DIW K+S +G+ VWA+WPMG
Sbjct: 472 NLHECQEPLFDFISSLSVIGKKTAKVSYEANGWVAHHVSDIWGKTSPGQGQAVWAVWPMG 531
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEF 526
GAWLCTHLWEHY YT+D+DFL+ +AYPLLEGC SFLLDWLIEG G LETNPSTSPEH F
Sbjct: 532 GAWLCTHLWEHYTYTLDKDFLKNKAYPLLEGCTSFLLDWLIEGRGGLLETNPSTSPEHMF 591
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE 586
APDGK A VSYSSTMD++II+EVFS IISAAEVL ++ D ++++ + +L PTK+A
Sbjct: 592 TAPDGKTASVSYSSTMDISIIKEVFSMIISAAEVLGRHNDTIIKRATEYQSKLPPTKVAR 651
Query: 587 DGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWS 646
DGSIMEWA+DFKDP VHHRH+SHLFGLFPGHTI++E PDLCKA E +L KRG++GPGWS
Sbjct: 652 DGSIMEWAEDFKDPTVHHRHVSHLFGLFPGHTISVENTPDLCKAVEVSLIKRGDDGPGWS 711
Query: 647 ITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFT 706
TWK +LWA LH+ EHAYRM+K L LV+P+H EGGL+SNLF AHPPFQIDANFGF+
Sbjct: 712 TTWKASLWAHLHNSEHAYRMIKHLIVLVEPDHGFGLEGGLFSNLFTAHPPFQIDANFGFS 771
Query: 707 AAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNY 766
AA+AEMLVQST DLYLLPALP DKW++GCVKGLKARGG TV+ICWK+GDL E G+++
Sbjct: 772 AAIAEMLVQSTTKDLYLLPALPRDKWANGCVKGLKARGGVTVNICWKEGDLLEFGLWTEN 831
Query: 767 SNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLKCTNLH 806
N S LHYRG V +LS G++Y+++ QLKC +
Sbjct: 832 QN----SKVRLHYRGNVVLASLSPGRVYSYDNQLKCAKTY 867
>gi|158302693|dbj|BAF85832.1| alpha-1,2-fucosidase [Lilium longiflorum]
Length = 854
Score = 1142 bits (2954), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 545/825 (66%), Positives = 651/825 (78%), Gaps = 38/825 (4%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
PLK+ F PAKH+TDA PIGNGRLGAMVWGGVP+ETL+LN+DTLWTGVPG+YTNPDAP
Sbjct: 31 QPLKLRFLEPAKHWTDAAPIGNGRLGAMVWGGVPTETLQLNDDTLWTGVPGNYTNPDAPT 90
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
LS VR LVD G+YAEA+ A+ L GHP+DVYQ LG + LEF DSH+ Y+ Y+RELDL
Sbjct: 91 VLSKVRKLVDDGKYAEASLAAFDLSGHPSDVYQPLGTMNLEFGDSHVAYS--NYQRELDL 148
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
TATA+V YS+G+VEFTREHFSSNP QV+VTKIS ++SGSLSF VSLDS L + S +G
Sbjct: 149 TTATAKVTYSLGDVEFTREHFSSNPHQVLVTKISANKSGSLSFIVSLDSKLHHQSSADGV 208
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
N+IIMEG CPG+RI PK N ++ KGIQFSA+L++KI + + LED KLKVEGSDWA
Sbjct: 209 NRIIMEGSCPGRRIAPKGNLFENNKGIQFSAVLDLKIGGNDSNVQVLEDMKLKVEGSDWA 268
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
VLLL ASSSF+GPFINPSDS+KDP S S+ L +I+ +S+S L+T H++DYQ LFH V++
Sbjct: 269 VLLLAASSSFEGPFINPSDSEKDPKSASLDTLNAIQKISFSQLFTHHVEDYQSLFHCVTL 328
Query: 311 QLSRSPKD---------------IVTDTCSEENIDTV----PS-------------AERV 338
QLS+ I+ TCS N++ V PS AERV
Sbjct: 329 QLSKGSNSGGRTTVPLSQSYDSSILGTTCSLNNMEKVNTSNPSYSDQLTEEVLISTAERV 388
Query: 339 KSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEM 398
KSF+ DEDPSLVELLF +GRYLLIS SRPGTQ+ANLQGIW++D+ P WD+APH+NINL+M
Sbjct: 389 KSFKVDEDPSLVELLFHYGRYLLISCSRPGTQIANLQGIWSKDIEPAWDAAPHLNINLQM 448
Query: 399 NYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKV 458
NYW SL CNLSECQEPLFD++ L+ING+KTA+VNY ASGWV H +DIWAK+S DRG
Sbjct: 449 NYWPSLSCNLSECQEPLFDYIASLAINGAKTAKVNYEASGWVAHQVSDIWAKTSPDRGDP 508
Query: 459 VWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNP 518
VWALWPMGGAWLCTHLWEHY ++MD+ FLE AYPLLEGCASFLLDWLIEG GYLETNP
Sbjct: 509 VWALWPMGGAWLCTHLWEHYTFSMDKVFLENTAYPLLEGCASFLLDWLIEGRGGYLETNP 568
Query: 519 STSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR 578
STSPEH FIAPD K A VSYSSTMDMAIIREVFS IS+AE+L + E LV+++ K++PR
Sbjct: 569 STSPEHSFIAPDSKTASVSYSSTMDMAIIREVFSEFISSAEILGRVESKLVKQIKKAIPR 628
Query: 579 LRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR 638
L PTKIA DG+IMEWAQ+F+DPEVHHRH+SHLFGLFPGHTIT+EK PDLCKAA +L KR
Sbjct: 629 LPPTKIARDGTIMEWAQNFEDPEVHHRHISHLFGLFPGHTITMEKTPDLCKAAANSLYKR 688
Query: 639 GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQ 698
G+ GPGWS TWK + WARL + EHAY+++K+L NLVDP+HE FEGG+YSNLF AHPPFQ
Sbjct: 689 GDVGPGWSTTWKMSCWARLREAEHAYKLIKQLINLVDPDHESDFEGGVYSNLFTAHPPFQ 748
Query: 699 IDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLH 758
IDANFGF+AA+AEML+QST DLYLLPALP KW GCVKGLKARG TVSI WK+G+LH
Sbjct: 749 IDANFGFSAAIAEMLIQSTEQDLYLLPALPRAKWGEGCVKGLKARGNVTVSISWKEGELH 808
Query: 759 EVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLKCT 803
E +++ + + + + LHY+G+ V +NL G +YTFNR L+C
Sbjct: 809 E----AHFLSKNQNLVRKLHYKGSVVTMNLCCGSVYTFNRFLRCV 849
>gi|356495827|ref|XP_003516773.1| PREDICTED: alpha-L-fucosidase 2-like [Glycine max]
Length = 802
Score = 1115 bits (2884), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 541/802 (67%), Positives = 637/802 (79%), Gaps = 12/802 (1%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
AE + N LKI F KH+TDA+PIGNGRLGAMV G V SET+ LNEDTLWTG P DY
Sbjct: 2 AEGRGSRN-LKIRFREGGKHWTDAVPIGNGRLGAMVCGHVHSETIHLNEDTLWTGTPADY 60
Query: 64 TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA-EE 122
TN AP ALS VR+LV Y +ATAAS L G+P++ Y LLGDI+L+FD SHL ++
Sbjct: 61 TNSKAPPALSHVRNLVHRQHYPQATAASSALTGNPSEAYLLLGDIQLDFDYSHLTPGLQQ 120
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
Y RELDL+TAT +V+YSVG+V+FTREHF+S PDQ+IVT+IS S+ LSF VSL S +
Sbjct: 121 PYERELDLDTATVKVRYSVGDVQFTREHFASYPDQLIVTQISSSKPAKLSFTVSLLSKII 180
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
N +YVN NQIIM+G CPGKRI +P GIQFSAIL++KI G I L++ KL
Sbjct: 181 NQTYVNAPNQIIMKGSCPGKRI------QHNPHGIQFSAILDLKIGGTDGVIHILDNNKL 234
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
KVE SDWAVLLLVASSSF GPF PSDSKKDPTS+ + L SI N+SYS LY RHL+DYQ
Sbjct: 235 KVEASDWAVLLLVASSSFSGPFTAPSDSKKDPTSQCFTTLSSISNVSYSHLYARHLNDYQ 294
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
LFHRVS+QL RS + +++ + + +++RVKSFQTDEDPSLVELLFQ+GRYLLI
Sbjct: 295 GLFHRVSLQLMRSTRPNISE---DSTVTQASTSDRVKSFQTDEDPSLVELLFQYGRYLLI 351
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SSSRPGTQVANLQGIWN+DL P WD APH+NINLEMNYW +LPCNLSECQEPLFD+++ L
Sbjct: 352 SSSRPGTQVANLQGIWNKDLEPVWDGAPHLNINLEMNYWPALPCNLSECQEPLFDYISLL 411
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
S+NGSKTA VNY A+GWV H K+DIWA++SA +G VVWALWPMGGAWLCTHLWEHY YTM
Sbjct: 412 SVNGSKTAHVNYQANGWVAHSKSDIWARTSAGQGDVVWALWPMGGAWLCTHLWEHYAYTM 471
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
D DFL+ +AYPL+EGC SFLL WLIE +GYLETNPSTSPEH FIAP+G+ ACVS SSTM
Sbjct: 472 DEDFLKYKAYPLMEGCVSFLLSWLIEDSEGYLETNPSTSPEHYFIAPNGEPACVSQSSTM 531
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
D+AII EVFS +SAAEV+ + +D +V +V K+ PRLRP IA+DGSIMEW +DFKDPEV
Sbjct: 532 DVAIINEVFSTFLSAAEVIGRTKDNIVGEVRKAQPRLRPINIAQDGSIMEWVKDFKDPEV 591
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEH 662
HHRHLSHLFGLFPGHTIT ++ P L +AAEK+L KRGEEGPGWS TWKTA WARL + +
Sbjct: 592 HHRHLSHLFGLFPGHTITFKETPALIEAAEKSLYKRGEEGPGWSTTWKTACWARLQNSSN 651
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
AY+M+K L NLVDP+HE+ F+GGLYSNLFAAHPPFQIDANFGF AAVAEMLVQSTL+DL+
Sbjct: 652 AYKMIKHLINLVDPDHERPFQGGLYSNLFAAHPPFQIDANFGFAAAVAEMLVQSTLSDLF 711
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGT 782
LLPALPW+KW +G +KGLKARGG TV+I W++GDL EVGI+S K +HYRGT
Sbjct: 712 LLPALPWEKWPNGSLKGLKARGGTTVNIYWREGDLQEVGIWSE-DQTRTTLRKRIHYRGT 770
Query: 783 SVKVNLSAGKIYTFNRQLKCTN 804
V +L +G Y FN QLKC N
Sbjct: 771 MVTADLVSGLFYKFNGQLKCLN 792
>gi|297802554|ref|XP_002869161.1| hypothetical protein ARALYDRAFT_912968 [Arabidopsis lyrata subsp.
lyrata]
gi|297314997|gb|EFH45420.1| hypothetical protein ARALYDRAFT_912968 [Arabidopsis lyrata subsp.
lyrata]
Length = 844
Score = 1097 bits (2836), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 518/802 (64%), Positives = 636/802 (79%), Gaps = 22/802 (2%)
Query: 10 TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
+ PLK+TF GP++++TDAIPIGNGRLGA +WGGV SETL +NEDT+WTGVP DYTNP+AP
Sbjct: 48 SRPLKLTFGGPSRNWTDAIPIGNGRLGATIWGGVSSETLNINEDTIWTGVPADYTNPNAP 107
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
+AL++VR LVD YAEAT+ +VKL G P+DVYQL+GD+ LEF SH KY + +YRRELD
Sbjct: 108 EALAEVRRLVDEKNYAEATSEAVKLSGQPSDVYQLVGDLNLEFGSSHRKYTQTSYRRELD 167
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L TA A+V YSVG V+F+RE F+SNPDQVIV KI S+ GSLSF VS DS L +HS N
Sbjct: 168 LETAVAKVSYSVGAVDFSREFFASNPDQVIVAKIYASKPGSLSFKVSFDSELHHHSETNP 227
Query: 190 N-NQIIMEGRCPGKRIP----PKANAN----DDPKGIQFSAILEIKISDDRGTISALEDK 240
NQI+M G C KR+P NA DD KG+QF++ILE+++S+ G++S+L K
Sbjct: 228 KANQILMRGSCRPKRLPVNLKKSINATNIPYDDHKGLQFASILEVRVSNG-GSVSSLGGK 286
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
KL VE +DWAVLLL ASS+FDGPF P+DSK+DP E + S++ SYSDLY RHL D
Sbjct: 287 KLSVEKADWAVLLLAASSNFDGPFTMPADSKRDPAKECAKRISSVQKYSYSDLYARHLGD 346
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
YQKLF+RVS+QLS S + + +AERV+SF+TDEDP+LVELLFQ+GRYL
Sbjct: 347 YQKLFNRVSLQLSGSSGNKTVQQAAS-------TAERVRSFKTDEDPALVELLFQYGRYL 399
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSRPGTQVANLQGIWN D+ P WD APH+NINL+MNYW SLP N+ ECQEPLFD+++
Sbjct: 400 LISSSRPGTQVANLQGIWNRDIQPPWDGAPHLNINLQMNYWHSLPGNIRECQEPLFDYMS 459
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
L+ING KTAQ+NY ASGWV H +DIWAK+S DRG+ VWALWPMGGAWLCTH WEHY Y
Sbjct: 460 ALAINGRKTAQMNYGASGWVAHQVSDIWAKTSPDRGEAVWALWPMGGAWLCTHAWEHYTY 519
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
TMD++FL+K+ YPLLEGC SFLLDWLI+G DG+L+TNPSTSPEH F AP+GK A VSYSS
Sbjct: 520 TMDKEFLKKKGYPLLEGCTSFLLDWLIKGKDGFLQTNPSTSPEHMFTAPNGKPASVSYSS 579
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDP 600
TMD+AII+EVF+ I++A+E+L K D L+ KV+ + +L PT+I++DGSIMEWA+DF+DP
Sbjct: 580 TMDIAIIKEVFADIVTASEILGKTNDTLIGKVIAAQAKLPPTRISKDGSIMEWAEDFEDP 639
Query: 601 EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQ 660
E+HHRH+SHLFGLFPGHTIT+EK+P+L KA E TL+KRGEEGPGWS TWK ALWARLH+
Sbjct: 640 EIHHRHVSHLFGLFPGHTITVEKSPELAKAVEATLKKRGEEGPGWSTTWKAALWARLHNS 699
Query: 661 EHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
EHAYRMV +F+LVDP +E+++EGGLYSN+F AHPPFQIDANFGF AAVAEMLVQST D
Sbjct: 700 EHAYRMVAHIFDLVDPLNERNYEGGLYSNMFTAHPPFQIDANFGFAAAVAEMLVQSTTKD 759
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYR 780
L+LLPALP DKW +G VKGL+ARGG TVSI W +G+L E G++S + + YR
Sbjct: 760 LHLLPALPADKWPNGIVKGLRARGGVTVSIKWMEGNLVEFGLWS-----EQIVSTRIVYR 814
Query: 781 GTSVKVNLSAGKIYTFNRQLKC 802
G S L GK++TF++ L+C
Sbjct: 815 GISAAAELLPGKVFTFDKDLRC 836
>gi|449531868|ref|XP_004172907.1| PREDICTED: alpha-L-fucosidase 2-like, partial [Cucumis sativus]
Length = 764
Score = 1089 bits (2817), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 519/758 (68%), Positives = 619/758 (81%), Gaps = 7/758 (0%)
Query: 52 EDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELE 111
EDTLWTG P DYTNPDAP+AL +VR LVD G+YAEAT A+VKL G P+DVYQLLGDI+LE
Sbjct: 7 EDTLWTGTPADYTNPDAPEALREVRKLVDDGKYAEATEAAVKLSGKPSDVYQLLGDIKLE 66
Query: 112 FDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL 171
F+ SH Y ETY RELDLNTATARVKYSVG+VEFTREHF+SNPDQ IVTKI+ S+ GSL
Sbjct: 67 FEVSHQSYTPETYHRELDLNTATARVKYSVGDVEFTREHFASNPDQAIVTKIAASKPGSL 126
Query: 172 SFNVSLDSLLDNHSYV-NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD 230
+F VS+DS L + S+V +G + I++ G C G RIPPK + +D+PKGIQ+SA+L +++SD
Sbjct: 127 TFIVSIDSKLHHSSHVVDGQSLIVLHGSCRGVRIPPKMDFDDNPKGIQYSAVLSLQVSDG 186
Query: 231 RGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSY 290
+ L++KKLKV GSDWAVL LVASSSF GPF PS S KDP+SES++ ++ I+ LSY
Sbjct: 187 SVVVHDLDEKKLKVNGSDWAVLRLVASSSFKGPFTQPSLSGKDPSSESLATMKKIKGLSY 246
Query: 291 SDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC-SEENIDTVPSAERVKSFQTDEDPSL 349
S+LY RHL+DYQ LF RVS+ LS+S K+ + + + +AERVKSFQTDEDPSL
Sbjct: 247 SNLYARHLNDYQSLFQRVSLHLSKSSKNESSSPNSGGKEVRVASTAERVKSFQTDEDPSL 306
Query: 350 VELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLS 409
VELLFQ+ RYLLIS SRPGTQVANLQGIWN+++ P WD APH+NINL+MNYW SL CNL
Sbjct: 307 VELLFQYSRYLLISCSRPGTQVANLQGIWNKNVEPAWDGAPHLNINLQMNYWPSLSCNLK 366
Query: 410 ECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAW 469
ECQEPLFDF ++LS+NG KTA+ NY ASGWV H +DIWAKSS DRG+ VWALWPMGGAW
Sbjct: 367 ECQEPLFDFTSFLSVNGRKTAKANYEASGWVAHQVSDIWAKSSPDRGQAVWALWPMGGAW 426
Query: 470 LCTHLWEHYNYTMDR-DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIA 528
LCTHLWEHY YTMD+ F + +AYPL+EGCASFLLDWLI+G DGYLETNPSTSPEH FIA
Sbjct: 427 LCTHLWEHYTYTMDKVKFFKNKAYPLMEGCASFLLDWLIDGKDGYLETNPSTSPEHMFIA 486
Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDG 588
PDGK A VSYS+TMDMAI +EVFS+IISAAE+L K +D ++KV K+ RL P KIA+DG
Sbjct: 487 PDGKPASVSYSTTMDMAITKEVFSSIISAAEILGKTKDTFIDKVRKAQARLLPYKIAKDG 546
Query: 589 SIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSIT 648
S+MEWA DF+D +VHHRH+SHLFGLFPGHTIT+EK P++ +AA TL KRGEEGPGWS
Sbjct: 547 SLMEWALDFEDQDVHHRHVSHLFGLFPGHTITVEKTPNISEAASNTLHKRGEEGPGWSTA 606
Query: 649 WKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAA 708
WK ALWARLH+ EHAY+MVK LF+LVDP+HE +EGGLYSNLF AHPPFQIDANFGF+AA
Sbjct: 607 WKIALWARLHNSEHAYQMVKHLFDLVDPDHESDYEGGLYSNLFTAHPPFQIDANFGFSAA 666
Query: 709 VAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN 768
+AEMLVQST+NDLYLLPALP + W GCVKGLKARGG TV++CW GDL+EVG++S
Sbjct: 667 IAEMLVQSTINDLYLLPALPRNVWPDGCVKGLKARGGLTVNMCWTGGDLNEVGLWS---- 722
Query: 769 NDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLKCTNLH 806
++ S TLHYR T+V NLS+G +YTFN+ LKC +
Sbjct: 723 SEQISLTTLHYRETTVAANLSSGTVYTFNKLLKCVRTY 760
>gi|30689979|ref|NP_195152.2| alpha-L-fucosidase 2 [Arabidopsis thaliana]
gi|75245768|sp|Q8L7W8.1|FUCO2_ARATH RecName: Full=Alpha-L-fucosidase 2; AltName:
Full=Alpha-1,2-fucosidase 2; AltName:
Full=Alpha-L-fucoside fucohydrolase 2; Flags: Precursor
gi|21928117|gb|AAM78086.1| AT4g34260/F10M10_30 [Arabidopsis thaliana]
gi|27363438|gb|AAO11638.1| At4g34260/F10M10_30 [Arabidopsis thaliana]
gi|332660949|gb|AEE86349.1| alpha-L-fucosidase 2 [Arabidopsis thaliana]
Length = 843
Score = 1086 bits (2809), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 517/802 (64%), Positives = 631/802 (78%), Gaps = 22/802 (2%)
Query: 10 TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
+ PLK+TF GP++++TDAIPIGNGRLGA +WGGV SE L +NEDT+WTGVP DYTN AP
Sbjct: 49 SRPLKLTFGGPSRNWTDAIPIGNGRLGATIWGGVSSEILNINEDTIWTGVPADYTNQKAP 108
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
+AL++VR LVD YAEAT+ +VKL G P+DVYQ++GD+ LEFD SH KY + +YRRELD
Sbjct: 109 EALAEVRRLVDERNYAEATSEAVKLSGQPSDVYQIVGDLNLEFDSSHRKYTQASYRRELD 168
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L TA A+V YSVG V+F+RE F+SNPDQVI+ KI S+ GSLSF VS DS L +HS N
Sbjct: 169 LETAVAKVSYSVGAVDFSREFFASNPDQVIIAKIYASKPGSLSFKVSFDSELHHHSETNP 228
Query: 190 N-NQIIMEGRCPGKRIP----PKANAN----DDPKGIQFSAILEIKISDDRGTISALEDK 240
NQI+M G C KR+P NA DD KG+QF++ILE+++S+ G++S+L K
Sbjct: 229 KANQILMRGSCRPKRLPVNLKKSINATNIPYDDHKGLQFASILEVRVSNG-GSVSSLGGK 287
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
KL VE +DWAVLLL ASS+FDGPF P DSK DP E ++ + S++ SYSDLY RHL D
Sbjct: 288 KLSVEKADWAVLLLAASSNFDGPFTMPVDSKIDPAKECVNRISSVQKYSYSDLYARHLGD 347
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
YQKLF+RVS+ LS S + +E +AERV+SF+TD+DPSLVELLFQ+GRYL
Sbjct: 348 YQKLFNRVSLHLSGS-------STNETVQQATSTAERVRSFKTDQDPSLVELLFQYGRYL 400
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSRPGTQVANLQGIWN D+ P WD APH+NINL+MNYW SLP N+ ECQEPLFD+++
Sbjct: 401 LISSSRPGTQVANLQGIWNRDIQPPWDGAPHLNINLQMNYWHSLPGNIRECQEPLFDYMS 460
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
L+ING KTAQVNY ASGWV H +DIWAK+S DRG+ VWALWPMGGAWLCTH WEHY Y
Sbjct: 461 ALAINGRKTAQVNYGASGWVAHQVSDIWAKTSPDRGEAVWALWPMGGAWLCTHAWEHYTY 520
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
TMD++FL+K+ YPLLEGC SFLLDWLI+G DG+L+TNPSTSPEH F AP GK A VSYSS
Sbjct: 521 TMDKEFLKKKGYPLLEGCTSFLLDWLIKGKDGFLQTNPSTSPEHMFTAPIGKPASVSYSS 580
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDP 600
TMD+AII+EVF+ I+SA+E+L K D L+ KV+ + +L PT+I++DGSI EWA+DF+DP
Sbjct: 581 TMDIAIIKEVFADIVSASEILGKTNDTLIGKVIAAQAKLPPTRISKDGSIREWAEDFEDP 640
Query: 601 EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQ 660
EVHHRH+SHLFGLFPGHTIT+EK+P+L KA E TL+KRGEEGPGWS TWK ALWARLH+
Sbjct: 641 EVHHRHVSHLFGLFPGHTITVEKSPELAKAVEATLKKRGEEGPGWSTTWKAALWARLHNS 700
Query: 661 EHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
EHAYRMV +F+LVDP +E+++EGGLYSN+F AHPPFQIDANFGF AAVAEMLVQST D
Sbjct: 701 EHAYRMVTHIFDLVDPLNERNYEGGLYSNMFTAHPPFQIDANFGFAAAVAEMLVQSTTKD 760
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYR 780
LYLLPALP DKW +G V GL+ARGG TVSI W +G+L E G++S + + YR
Sbjct: 761 LYLLPALPADKWPNGIVNGLRARGGVTVSIKWMEGNLVEFGLWS-----EQIVSTRIVYR 815
Query: 781 GTSVKVNLSAGKIYTFNRQLKC 802
G S L GK++TF++ L+C
Sbjct: 816 GISAAAELLPGKVFTFDKDLRC 837
>gi|296083105|emb|CBI22509.3| unnamed protein product [Vitis vinifera]
Length = 781
Score = 1069 bits (2764), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 537/833 (64%), Positives = 618/833 (74%), Gaps = 125/833 (15%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+ F GPAKH+TDA+PIGNGRLGAMVWGGV SETL+LNE TLWTG PG+YTNPDAPKA
Sbjct: 34 PLKVRFFGPAKHWTDALPIGNGRLGAMVWGGVASETLQLNEGTLWTGTPGNYTNPDAPKA 93
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPAD------------------------------- 100
LS+VR LVD+G Y AT A+VKL G+P+D
Sbjct: 94 LSEVRKLVDNGDYVAATEAAVKLSGNPSDDELPSLLLDSFFDCDHVGLEVCVKYAPLLMG 153
Query: 101 -------VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSS 153
VYQLLGDI LEF+DSHL YAEETY RELDL+TAT +KYSVG+VE+TREHF+S
Sbjct: 154 YLKFNFGVYQLLGDINLEFEDSHLAYAEETYSRELDLDTATVTIKYSVGDVEYTREHFAS 213
Query: 154 NPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDD 213
PDQVIVTKISGS+ GS+SF VSLDS +IPPK
Sbjct: 214 YPDQVIVTKISGSKPGSVSFTVSLDS-----------------------KIPPKV----- 245
Query: 214 PKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKD 273
G I+ L+DKKLKVEGSDWAV
Sbjct: 246 ------------------GVINVLDDKKLKVEGSDWAVF--------------------- 266
Query: 274 PTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVP 333
L+SI N SYSDLY RHL+DYQ LFHRVS+QLS+S K + ++ V
Sbjct: 267 -------TLKSIGNFSYSDLYARHLNDYQNLFHRVSLQLSKSSKSV---------MNRVS 310
Query: 334 SAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVN 393
+A RVKSF TDEDPSLVELLFQ+GRYLLIS SRPG+Q ANLQGIWN+D+ P WD APH+N
Sbjct: 311 TAARVKSFGTDEDPSLVELLFQYGRYLLISCSRPGSQPANLQGIWNKDIEPAWDGAPHLN 370
Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA 453
INL+MNYW SLPCNLSECQEPLFD+++ LSINGSKTA+VNY ASGWV H +DIWAK+S
Sbjct: 371 INLQMNYWPSLPCNLSECQEPLFDYMSSLSINGSKTAKVNYEASGWVTHQVSDIWAKTSP 430
Query: 454 DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY 513
DRG+ VWALWPMGGAWLCTHLWEHY +TMD+DFL+ +AYPLLEGCA FLLDWLIEG GY
Sbjct: 431 DRGQAVWALWPMGGAWLCTHLWEHYTFTMDKDFLKNKAYPLLEGCARFLLDWLIEGRGGY 490
Query: 514 LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL 573
LETNPSTSPEH FIAPDGK A VSYS+TMD+AIIREVFSA++SAAEVL KNED LV+KV
Sbjct: 491 LETNPSTSPEHMFIAPDGKPASVSYSTTMDIAIIREVFSAVVSAAEVLGKNEDELVQKVR 550
Query: 574 KSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
++ P+L PTKIA DGSIMEWAQDF+DPEVHHRH+SHLFGL+PGHTIT+EK PDLCKA +
Sbjct: 551 QAQPKLPPTKIARDGSIMEWAQDFEDPEVHHRHVSHLFGLYPGHTITVEKTPDLCKAVDY 610
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAA 693
TL KRGE+GPGWS TWKTALWARLH+ EHAYRMVK LF+LVDP E FEGGLYSNLF A
Sbjct: 611 TLYKRGEDGPGWSTTWKTALWARLHNSEHAYRMVKHLFDLVDPAREADFEGGLYSNLFTA 670
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWK 753
HPPFQIDANFGF AAVAEM+VQST DLYLLPALP DKW++GCVKGLKARGG TV++CWK
Sbjct: 671 HPPFQIDANFGFCAAVAEMIVQSTSKDLYLLPALPRDKWANGCVKGLKARGGVTVNVCWK 730
Query: 754 DGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLKCTNLH 806
+G+LH++G++S D +S + LHYRG+ V + AG++YTF+RQLKC +
Sbjct: 731 EGELHQIGVWS----KDQNSTRRLHYRGSIVTAKMLAGRVYTFDRQLKCVKTY 779
>gi|4455171|emb|CAB36703.1| hypothetical protein [Arabidopsis thaliana]
gi|7270376|emb|CAB80143.1| hypothetical protein [Arabidopsis thaliana]
Length = 847
Score = 1056 bits (2732), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 511/807 (63%), Positives = 626/807 (77%), Gaps = 28/807 (3%)
Query: 10 TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
+ PLK+TF GP++++TDAIPIGNGRLGA +WGGV SE L +NEDT+WTGVP DYTN AP
Sbjct: 49 SRPLKLTFGGPSRNWTDAIPIGNGRLGATIWGGVSSEILNINEDTIWTGVPADYTNQKAP 108
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
+AL++VR LVD YAEAT+ +VKL G P+DVYQ++GD+ LEFD SH KY + +YRRELD
Sbjct: 109 EALAEVRRLVDERNYAEATSEAVKLSGQPSDVYQIVGDLNLEFDSSHRKYTQASYRRELD 168
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L TA A+V YSVG V+F+RE F+SNPDQVI+ KI S+ GSLSF VS DS L +HS N
Sbjct: 169 LETAVAKVSYSVGAVDFSREFFASNPDQVIIAKIYASKPGSLSFKVSFDSELHHHSETNP 228
Query: 190 N-NQIIMEGRCPGKRIP----PKANAN----DDPKGIQFSAILEIKISDDRGTISALEDK 240
NQI+M G C KR+P NA DD KG+QF++ILE+++S+ G++S+L K
Sbjct: 229 KANQILMRGSCRPKRLPVNLKKSINATNIPYDDHKGLQFASILEVRVSNG-GSVSSLGGK 287
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
KL VE +DWAVLLL ASS+FDGPF P DSK DP E ++ + S++ SYSDLY RHL D
Sbjct: 288 KLSVEKADWAVLLLAASSNFDGPFTMPVDSKIDPAKECVNRISSVQKYSYSDLYARHLGD 347
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
YQKLF+RVS+ LS S + +E +AERV+SF+TD+DPSLVELLFQ+GRYL
Sbjct: 348 YQKLFNRVSLHLSGS-------STNETVQQATSTAERVRSFKTDQDPSLVELLFQYGRYL 400
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTW-----DSAPHVNINLEMNYWQSLPCNLSECQEPL 415
LISSSRPGTQVANLQ + L+P APH+NINL+MNYW SLP N+ ECQEPL
Sbjct: 401 LISSSRPGTQVANLQA-FVVSLTPLLLLRYCSGAPHLNINLQMNYWHSLPGNIRECQEPL 459
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
FD+++ L+ING KTAQVNY ASGWV H +DIWAK+S DRG+ VWALWPMGGAWLCTH W
Sbjct: 460 FDYMSALAINGRKTAQVNYGASGWVAHQVSDIWAKTSPDRGEAVWALWPMGGAWLCTHAW 519
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
EHY YTMD++FL+K+ YPLLEGC SFLLDWLI+G DG+L+TNPSTSPEH F AP GK A
Sbjct: 520 EHYTYTMDKEFLKKKGYPLLEGCTSFLLDWLIKGKDGFLQTNPSTSPEHMFTAPIGKPAS 579
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ 595
VSYSSTMD+AII+EVF+ I+SA+E+L K D L+ KV+ + +L PT+I++DGSI EWA+
Sbjct: 580 VSYSSTMDIAIIKEVFADIVSASEILGKTNDTLIGKVIAAQAKLPPTRISKDGSIREWAE 639
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWA 655
DF+DPEVHHRH+SHLFGLFPGHTIT+EK+P+L KA E TL+KRGEEGPGWS TWK ALWA
Sbjct: 640 DFEDPEVHHRHVSHLFGLFPGHTITVEKSPELAKAVEATLKKRGEEGPGWSTTWKAALWA 699
Query: 656 RLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQ 715
RLH+ EHAYRMV +F+LVDP +E+++EGGLYSN+F AHPPFQIDANFGF AAVAEMLVQ
Sbjct: 700 RLHNSEHAYRMVTHIFDLVDPLNERNYEGGLYSNMFTAHPPFQIDANFGFAAAVAEMLVQ 759
Query: 716 STLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFK 775
ST DLYLLPALP DKW +G V GL+ARGG TVSI W +G+L E G++S +
Sbjct: 760 STTKDLYLLPALPADKWPNGIVNGLRARGGVTVSIKWMEGNLVEFGLWS-----EQIVST 814
Query: 776 TLHYRGTSVKVNLSAGKIYTFNRQLKC 802
+ YRG S L GK++TF++ L+C
Sbjct: 815 RIVYRGISAAAELLPGKVFTFDKDLRC 841
>gi|357146134|ref|XP_003573887.1| PREDICTED: alpha-L-fucosidase 2-like [Brachypodium distachyon]
Length = 857
Score = 1037 bits (2681), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 502/820 (61%), Positives = 615/820 (75%), Gaps = 33/820 (4%)
Query: 10 TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
+ PLK+ F PAK+FTDA PIGNGRLGAMVWGGV SE L+LN DTLWTG PG+YTNP+AP
Sbjct: 38 SRPLKVVFASPAKYFTDAAPIGNGRLGAMVWGGVASERLQLNHDTLWTGGPGNYTNPNAP 97
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
LS VRSLV G YAEATA + L G +YQ LGDI+L F H+KY Y+R LD
Sbjct: 98 TVLSKVRSLVGKGLYAEATAVAYDLSGDQTQIYQPLGDIDLAFGQ-HIKYTN--YKRYLD 154
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L +AT V Y+VG V ++REHFSSNP QVI TK+S ++ G++SF VSL + LD+ +V
Sbjct: 155 LESATVNVTYTVGEVVYSREHFSSNPHQVIATKVSANKPGAVSFTVSLATPLDHRIHVTD 214
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
N+IIMEG C G+R +A+DDP GI+F AIL ++IS GT+ L D LK++G+D
Sbjct: 215 TNEIIMEGCCAGERPVGDDSASDDPTGIKFCAILYLQISGANGTLQVLNDNMLKLDGADS 274
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
AVLLL A++SF+GPF+ PS+S +P + + + L R +SYS L H+DDYQ LF RVS
Sbjct: 275 AVLLLAAATSFEGPFVKPSESTLNPKTSAFTTLNMARTMSYSQLKAYHMDDYQSLFQRVS 334
Query: 310 IQLSR-----------------SPKDIVTDTCSEE----------NIDTVPSAERVKSFQ 342
+QLSR S +DI C E+ N P+ +R+ SF
Sbjct: 335 LQLSRGSDNVLRGNSLPNSPENSCQDIAVSHCVEQISDRSWLKELNNSDKPTVDRIISFV 394
Query: 343 TDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQ 402
DEDPSLVELLFQFGRYLLIS SRPGTQ++NLQGIW+ D P WD+APH NINL+MNYW
Sbjct: 395 DDEDPSLVELLFQFGRYLLISCSRPGTQISNLQGIWSNDTRPPWDAAPHPNINLQMNYWP 454
Query: 403 SLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWAL 462
+LPCNLSECQEPLFDF+ LSING+KTA+VNY ASGWV H TD+WAK+S D G +WAL
Sbjct: 455 ALPCNLSECQEPLFDFIESLSINGAKTAKVNYEASGWVSHQVTDLWAKTSPDAGDPMWAL 514
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSP 522
WPMGG+WL THLWEHY++T+D FLEK AYPLLEG ASFLL WLIEG G LETNPSTSP
Sbjct: 515 WPMGGSWLATHLWEHYSFTLDTQFLEKTAYPLLEGSASFLLSWLIEGQGGQLETNPSTSP 574
Query: 523 EHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT 582
EH FIAPDGK ACVSYS+TMDM++IREVFSA++ +A++L K+ +V+++ K+LPRL P
Sbjct: 575 EHYFIAPDGKKACVSYSTTMDMSVIREVFSAVLLSADILGKSGTDVVQRIKKALPRLPPI 634
Query: 583 KIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG 642
KIA D +IMEWA+DF+DPEVHHRH+SHLFGL+PGHT+T+E+ PDLCKA +L KRG+EG
Sbjct: 635 KIARDITIMEWARDFQDPEVHHRHVSHLFGLYPGHTMTLEQTPDLCKAVGNSLYKRGDEG 694
Query: 643 PGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDAN 702
PGWS WK ALWA LH+ EHAY+M+ +L +L+DP+HE EGGLYSNLFAAHPPFQIDAN
Sbjct: 695 PGWSTAWKMALWAHLHNSEHAYKMILQLISLIDPKHEVEKEGGLYSNLFAAHPPFQIDAN 754
Query: 703 FGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
FGF AA++EMLVQST +DLYLLPALP DKW GCVKGLKARGG TV+ICWK+G LHE +
Sbjct: 755 FGFPAALSEMLVQSTGSDLYLLPALPRDKWPHGCVKGLKARGGVTVNICWKEGSLHEALL 814
Query: 763 YSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLKC 802
+S S N S LHY G +V +++SAG++Y+F+ LKC
Sbjct: 815 WSGSSQN---SLARLHYGGHNVMISVSAGQVYSFSSDLKC 851
>gi|78708252|gb|ABB47227.1| large secreted protein, putative, expressed [Oryza sativa Japonica
Group]
gi|222612646|gb|EEE50778.1| hypothetical protein OsJ_31136 [Oryza sativa Japonica Group]
Length = 851
Score = 1031 bits (2667), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 498/819 (60%), Positives = 620/819 (75%), Gaps = 35/819 (4%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PL++ F P+++FTDA PIGNG LGA+VWGGV SE L+LN DTLWTG PG+YTNP AP
Sbjct: 34 PLEVVFASPSRYFTDAAPIGNGSLGALVWGGVASEKLQLNHDTLWTGGPGNYTNPKAPAV 93
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET-YRRELDL 130
LS VR LV+ GQYA+ATA + L G VYQ LGDI+L FD+ + E+T Y+R LDL
Sbjct: 94 LSKVRDLVNRGQYAKATAVAYGLSGDQTQVYQPLGDIDLAFDE----HVEDTNYKRNLDL 149
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
TAT V Y++G V +REHFSSNP QVIVTKIS + G++SF VSL + L++ V
Sbjct: 150 RTATVNVSYTIGEVVHSREHFSSNPHQVIVTKISADKPGNVSFTVSLTTPLNHQIRVTNA 209
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
N+IIMEG CPG+R NA+D P GI+FSAIL +++S GT+ L DK LK+ G+D A
Sbjct: 210 NEIIMEGYCPGERPTEYGNASDHPVGIKFSAILYLQMSGSNGTVEILNDKMLKLVGADSA 269
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
VLLL A++SF+GPF+NPS+SK DPT+ +++ L RN+SYS L H+DDYQ LF RVS+
Sbjct: 270 VLLLAAATSFEGPFVNPSESKLDPTASALTTLTVARNMSYSQLKAYHVDDYQNLFQRVSL 329
Query: 311 QLSRS-------------PKDIVTDT-----------CSEE---NIDTVPSAERVKSFQT 343
QLSR P++ + +T CS N P+ +R+ SF+
Sbjct: 330 QLSRDSNDALGGNGLVNLPENSLQETSVSDYAVQMVECSRFQGFNNSGKPTVDRILSFRD 389
Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
DEDPSLVELLFQFGRYLLIS SRPGTQ++NLQGIWN++ SP WD+APH NINL+MNYW +
Sbjct: 390 DEDPSLVELLFQFGRYLLISCSRPGTQISNLQGIWNDETSPPWDAAPHPNINLQMNYWPA 449
Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
LPCNLSECQEPLFDF+ LS+NG+KTA+VNY ASGWV H TD+WAK+S D G +WALW
Sbjct: 450 LPCNLSECQEPLFDFIGSLSVNGAKTAKVNYEASGWVSHQVTDLWAKTSPDAGDPMWALW 509
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
PMGG WL THLWEHY+YTMD+ FLEK AYPLLEG ASFLLDWLIEG+ YLETNPSTSPE
Sbjct: 510 PMGGPWLATHLWEHYSYTMDKQFLEKTAYPLLEGSASFLLDWLIEGNGDYLETNPSTSPE 569
Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTK 583
H FIAPDG+ ACVSYS+TMDM+IIREVFSA++ ++++L K++ +V+++ K++PRL P K
Sbjct: 570 HYFIAPDGRKACVSYSTTMDMSIIREVFSAVLMSSDILGKSDSDMVQRIKKAIPRLPPIK 629
Query: 584 IAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP 643
+A DG+IMEWAQDF+DPEVHHRH+SHLFGL+PGHT+++EK PDLCKA +L KRG+EGP
Sbjct: 630 VARDGTIMEWAQDFQDPEVHHRHVSHLFGLYPGHTMSLEKTPDLCKAVANSLYKRGDEGP 689
Query: 644 GWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANF 703
GWS +WK ALWA LH+ EHAY+M+ +L LVDP+HE EGGLY NLF AHPPFQIDANF
Sbjct: 690 GWSTSWKMALWAHLHNSEHAYKMILQLITLVDPKHEVEKEGGLYCNLFTAHPPFQIDANF 749
Query: 704 GFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIY 763
GF AA++EMLVQST +DLYLLPALP DKW GCVKGLKARGG T++I W++G LHE ++
Sbjct: 750 GFPAALSEMLVQSTGSDLYLLPALPRDKWPQGCVKGLKARGGVTINIRWEEGSLHEALLW 809
Query: 764 SNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLKC 802
S+ S N S LHY +++S ++Y F++ LKC
Sbjct: 810 SSSSQN---SRIKLHYGDQVGTISVSPCQVYRFSKDLKC 845
>gi|218184333|gb|EEC66760.1| hypothetical protein OsI_33136 [Oryza sativa Indica Group]
Length = 851
Score = 1028 bits (2658), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 497/819 (60%), Positives = 619/819 (75%), Gaps = 35/819 (4%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PL++ F P+++FTDA PIGNG LGA+VWGGV SE L+LN DTLWTG PG+YTNP AP
Sbjct: 34 PLEVVFASPSRYFTDAAPIGNGSLGALVWGGVASEKLQLNHDTLWTGGPGNYTNPKAPAV 93
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET-YRRELDL 130
LS VR LV+ GQYA+ATA + L G VYQ LGDI+L FD+ + E+T Y+R LDL
Sbjct: 94 LSKVRDLVNRGQYAKATAVAYGLSGDQTQVYQPLGDIDLAFDE----HVEDTNYKRNLDL 149
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
TAT V Y++G V +REHFSSNP QVIVTKIS + G++SF VSL + L++ V
Sbjct: 150 RTATVNVSYTIGGVVHSREHFSSNPHQVIVTKISADKPGNVSFTVSLTTPLNHQIRVTNA 209
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
N+IIMEG CPG+R NA+D P GI+FSAIL +++S GT+ L DK LK+ G+D A
Sbjct: 210 NEIIMEGYCPGERPTEYGNASDHPVGIKFSAILYLQMSGSNGTVEILNDKMLKLVGADSA 269
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
VLLL AS+SF+GPF+NPS+SK DPT+ +++ L RN+ YS L H+DDYQ LF RVS+
Sbjct: 270 VLLLAASTSFEGPFVNPSESKLDPTASALTTLTVARNMPYSQLKAYHVDDYQNLFQRVSL 329
Query: 311 QLSRS-------------PKDIVTDT-----------CSEE---NIDTVPSAERVKSFQT 343
QLS+ P++ + +T CS N P+ +R+ SF+
Sbjct: 330 QLSQDSNDALGGNGLVNLPENSLQETSVSDYAVQMVECSRFQGFNNSGKPTVDRILSFRD 389
Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
DEDPSLVELLFQFGRYLLIS SRPGTQ++NLQGIWN++ SP WD+APH NINL+MNYW +
Sbjct: 390 DEDPSLVELLFQFGRYLLISCSRPGTQISNLQGIWNDETSPPWDAAPHPNINLQMNYWPA 449
Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
LPCNLSECQEPLFDF+ LS+NG+KTA+VNY ASGWV H TD+WAK+S D G +WALW
Sbjct: 450 LPCNLSECQEPLFDFIGSLSVNGAKTAKVNYEASGWVSHQVTDLWAKTSPDAGDPMWALW 509
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
PMGG WL THLWEHY+YTMD+ FLEK AYPLLEG ASFLLDWLIEG+ YLETNPSTSPE
Sbjct: 510 PMGGPWLATHLWEHYSYTMDKQFLEKTAYPLLEGSASFLLDWLIEGNGDYLETNPSTSPE 569
Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTK 583
H FIAPDG+ ACVSYS+TMDM+IIREVFSA++ ++++L K++ +V+++ K++PRL P K
Sbjct: 570 HYFIAPDGRKACVSYSTTMDMSIIREVFSAVLMSSDILGKSDSDMVQRIKKAIPRLPPIK 629
Query: 584 IAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP 643
+A DG+IMEWAQDF+DPEVHHRH+SHLFGL+PGHT+++EK PDLCKA +L KRG+EGP
Sbjct: 630 VARDGTIMEWAQDFQDPEVHHRHVSHLFGLYPGHTMSLEKTPDLCKAVANSLYKRGDEGP 689
Query: 644 GWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANF 703
GWS +WK ALWA LH+ EHAY+M+ +L LVDP+HE EGGLY NLF AHPPFQIDANF
Sbjct: 690 GWSTSWKMALWAHLHNSEHAYKMILQLITLVDPKHEVEKEGGLYCNLFTAHPPFQIDANF 749
Query: 704 GFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIY 763
GF AA++EMLVQST +DLYLLPALP DKW GCVKGLKARGG T++I W++G LHE ++
Sbjct: 750 GFPAALSEMLVQSTGSDLYLLPALPRDKWPQGCVKGLKARGGVTINIRWEEGSLHEALLW 809
Query: 764 SNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLKC 802
S+ S N S LHY +++S ++Y F++ LKC
Sbjct: 810 SSSSQN---SRIKLHYGDQVGTISVSPCQVYRFSKDLKC 845
>gi|326508462|dbj|BAJ95753.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 857
Score = 1013 bits (2619), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 489/818 (59%), Positives = 605/818 (73%), Gaps = 33/818 (4%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+ F PA++FTDA PIGNGRLGA+VWGGV SE L+LN DTLWTG PG+YTNP AP
Sbjct: 40 PLKVVFASPARYFTDAAPIGNGRLGALVWGGVTSEKLQLNHDTLWTGGPGNYTNPKAPTV 99
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
LS+VRSLVD G Y EATA + L G YQ LGDI+L F + H+KY Y R LDL
Sbjct: 100 LSEVRSLVDKGLYPEATAVAYGLSGDETQSYQPLGDIDLAFGE-HIKYTN--YTRYLDLE 156
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
+AT V YSVG V ++REHFSSNP QVI TKIS ++ G++S VSL + LD+ V N
Sbjct: 157 SATVNVTYSVGEVVYSREHFSSNPHQVIATKISANKPGAVSCTVSLATPLDHRIRVTDAN 216
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
+IIMEG CPG++ NA+D P G++F AIL + +S G + L DK LK++G+D AV
Sbjct: 217 EIIMEGSCPGEKPAGDGNASDHPPGMRFCAILYLLMSGANGQVQVLNDKMLKLDGADSAV 276
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
LLL A++SF+GPF+ P++S DP + + + L R++SY+ L H+DDYQ LF RVS+Q
Sbjct: 277 LLLAAATSFEGPFVKPTESTLDPVASAFTTLNMARSMSYAQLKAYHMDDYQSLFQRVSLQ 336
Query: 312 LSRS-------------PKDIVTDT----CSEENIDTV----------PSAERVKSFQTD 344
LSRS P++I DT C+ + +D P+ +R+ SF+ D
Sbjct: 337 LSRSSNDVLGGSTLARLPENISQDTAVSDCTVQMVDCSRLNELNNSEKPTVDRIISFRHD 396
Query: 345 EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSL 404
EDPSLVELLFQFGRYLLIS SRPGTQV+NLQGIWN + + W +APH NINL+MNYW SL
Sbjct: 397 EDPSLVELLFQFGRYLLISCSRPGTQVSNLQGIWNNETNAPWGAAPHPNINLQMNYWPSL 456
Query: 405 PCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWP 464
PCNLSECQ+PLFDF+ LS+NG+KTA+VNY SGWV H TD+WAK+S D G WALWP
Sbjct: 457 PCNLSECQDPLFDFIGSLSVNGAKTAKVNYGVSGWVSHQVTDLWAKTSPDAGDPSWALWP 516
Query: 465 MGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEH 524
MGG WL THLWEHY++TMDR+FLE+ AYPLLEG ASFLL WLIEG +GYLETNPSTSPEH
Sbjct: 517 MGGPWLATHLWEHYSFTMDREFLERTAYPLLEGSASFLLSWLIEGQEGYLETNPSTSPEH 576
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
FIAPDGK A VSYS+TMDM+IIREVFSA++ +A++L K+ +V+++ +LPRL P KI
Sbjct: 577 YFIAPDGKRASVSYSTTMDMSIIREVFSAVLLSADILGKSSTDVVQRIKAALPRLPPIKI 636
Query: 585 AEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPG 644
DG+IMEWA+DF+D E HHRH+SHLFGL+PGHT+T+E+ PDLCKA TL KRG++GPG
Sbjct: 637 GRDGTIMEWARDFQDAEPHHRHVSHLFGLYPGHTMTLEQTPDLCKAVANTLYKRGDKGPG 696
Query: 645 WSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFG 704
WS +WK ALWA LH+ EHAY+M+ +L L+DP HE+ EGGLYSNLF AHPPFQIDANFG
Sbjct: 697 WSTSWKMALWAHLHNSEHAYKMILQLITLIDPNHERDKEGGLYSNLFTAHPPFQIDANFG 756
Query: 705 FTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
F AA+ EMLVQST +DLYLLPALP +KW G VKGL+ARGG TV+ICWK+G LHE ++S
Sbjct: 757 FPAALCEMLVQSTGSDLYLLPALPRNKWPHGSVKGLRARGGVTVNICWKEGSLHEALVWS 816
Query: 765 NYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLKC 802
S N S +HY S ++ S G++Y FN +LKC
Sbjct: 817 GSSGN---SLARVHYGDRSAMISTSPGQVYRFNSELKC 851
>gi|218197301|gb|EEC79728.1| hypothetical protein OsI_21058 [Oryza sativa Indica Group]
Length = 815
Score = 1007 bits (2604), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 477/801 (59%), Positives = 615/801 (76%), Gaps = 11/801 (1%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+ F+ PA+HFTDA PIGNG LGAMVWG V SE L+LN DTLWTGVPG+YT+P+AP A
Sbjct: 21 PLKVVFDSPAEHFTDAAPIGNGSLGAMVWGSVASEKLQLNHDTLWTGVPGNYTDPNAPYA 80
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
L+ VR LVD ++ +AT A+ LFG P +VYQ LGDI LEFD S L Y +Y+RELDL
Sbjct: 81 LAVVRKLVDGEKFVDATEAASGLFGGPTEVYQPLGDINLEFDSSSLGYT--SYKRELDLR 138
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TAT + Y++G V+++REHF SNP QV TKIS ++SG +SF +SL+S L+++ + N
Sbjct: 139 TATVCISYNIGEVQYSREHFCSNPHQVFATKISANKSGHVSFTLSLNSQLNHNVRITNAN 198
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
++IM+G CPG+R N +D GI+F+ + ++I ++ ++D+KL+++ +DW V
Sbjct: 199 EMIMQGTCPGRRPALHHNGANDAIGIKFATAVGLQIGGTSAKVTIIDDQKLRIDAADWVV 258
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
LL+ A+SSFDGPF+NPS+SK +P +++ L RN ++S L HL+DYQ LFHRV++Q
Sbjct: 259 LLVAAASSFDGPFVNPSESKLNPEVAALNTLNISRNATFSQLKAAHLEDYQGLFHRVTLQ 318
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
LS++ + D E + D +AER+ SF++DEDPSLVELLFQ+GRYLLISSSRPGTQV
Sbjct: 319 LSQASM-LEKDILEEVDHDVKTTAERINSFRSDEDPSLVELLFQYGRYLLISSSRPGTQV 377
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
+NLQGIWN+D +P W+++PH+NINLEMNYW +LPCNLSECQEPLFD + L++NG+KTA+
Sbjct: 378 SNLQGIWNQDFAPAWEASPHLNINLEMNYWPTLPCNLSECQEPLFDLIGSLAVNGTKTAK 437
Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
VNY ASGWV HH TDIWAKSSA ++ALWPMGGAWLCTHLWE+Y Y++D++FLEKRA
Sbjct: 438 VNYQASGWVTHHVTDIWAKSSAYYVDAMYALWPMGGAWLCTHLWENYQYSLDKEFLEKRA 497
Query: 492 YPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP--DGKLACVSYSSTMDMAIIRE 549
YPLLEGCA FL+DWLI+G YLETNPSTSPEH FIAP G LA VSYS+TMD++IIRE
Sbjct: 498 YPLLEGCAMFLIDWLIKGPGDYLETNPSTSPEHPFIAPGTGGHLASVSYSTTMDISIIRE 557
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSH 609
VF A+IS+AEVL K++ LVE++ K+LP L P KI++DG+IMEWAQDF+DPEVHHRHLSH
Sbjct: 558 VFLAVISSAEVLGKSDTNLVERIKKALPMLPPVKISKDGTIMEWAQDFEDPEVHHRHLSH 617
Query: 610 LFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKR 669
LFGL+PGHTIT++KNP++CKA +L KRGE+GPGWS TWK ALWARL + E+AYRM+ +
Sbjct: 618 LFGLYPGHTITMQKNPEVCKAVANSLHKRGEDGPGWSTTWKMALWARLLNSENAYRMILK 677
Query: 670 LFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLN--DLYLLPAL 727
L LV P + FEGGLY+NL+ AHPPFQIDANFGFTAA+AEML+QST DLYLLPAL
Sbjct: 678 LITLVPPGGKVDFEGGLYTNLWTAHPPFQIDANFGFTAAIAEMLLQSTHGDADLYLLPAL 737
Query: 728 PWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVN 787
P +KW G VKGL+ARG TV+I W+ G+L E + +S+N + + LHY V
Sbjct: 738 PREKWPKGYVKGLRARGNVTVNISWEKGELQEATV---WSSNPKCTLR-LHYGEQVAMVT 793
Query: 788 LSAGKIYTFNRQLKCTNLHQS 808
+ G +Y FN L+C + +
Sbjct: 794 VLGGNVYRFNGGLQCVETYMA 814
>gi|110288916|gb|ABG66022.1| large secreted protein, putative, expressed [Oryza sativa Japonica
Group]
gi|222612642|gb|EEE50774.1| hypothetical protein OsJ_31132 [Oryza sativa Japonica Group]
Length = 815
Score = 1006 bits (2601), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 476/801 (59%), Positives = 615/801 (76%), Gaps = 11/801 (1%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+ F+ PA+HFTDA PIGNG LGAMVWG V SE L+LN DTLWTGVPG+YT+P+AP A
Sbjct: 21 PLKVVFDSPAEHFTDAAPIGNGSLGAMVWGSVASEKLQLNHDTLWTGVPGNYTDPNAPYA 80
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
L+ VR LVD ++ +AT A+ LFG P +VYQ LGDI LEFD S L Y +Y+RELDL
Sbjct: 81 LAVVRKLVDGEKFVDATEAASGLFGGPTEVYQPLGDINLEFDSSSLGYT--SYKRELDLR 138
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TAT + Y++G V+++REHF SNP QV TKIS ++SG +SF +SL+S L+++ + N
Sbjct: 139 TATVCISYNIGEVQYSREHFCSNPHQVFATKISANKSGHVSFTLSLNSQLNHNVRITNAN 198
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
++IM+G CPG+R N +D GI+F+ + ++I ++ ++D+KL+++ +DW V
Sbjct: 199 EMIMQGTCPGRRPALHHNGANDAIGIKFATAVGLQIGGTSAKVTIIDDQKLRIDAADWVV 258
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
LL+ A+SSFDGPF+NPS+SK +P +++ L RN ++S L HL+DYQ LFHRV++Q
Sbjct: 259 LLVAAASSFDGPFVNPSESKLNPEVAALNTLNISRNATFSQLKAAHLEDYQGLFHRVTLQ 318
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
LS++ + D E + D +AER+ SF++DEDPSLVELLFQ+GRYLLISSSRPGTQV
Sbjct: 319 LSQASM-LEKDILEEVDHDVKTTAERINSFRSDEDPSLVELLFQYGRYLLISSSRPGTQV 377
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
+NLQGIWN+D +P W+++PH+NINLEMNYW +LPCNL+ECQEPLFD + L++NG+KTA+
Sbjct: 378 SNLQGIWNQDFAPAWEASPHLNINLEMNYWPTLPCNLTECQEPLFDLIGSLAVNGTKTAK 437
Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
VNY ASGWV HH TDIWAKSSA ++ALWPMGGAWLCTHLWE+Y Y++D++FLEKRA
Sbjct: 438 VNYQASGWVTHHVTDIWAKSSAYYVDAMYALWPMGGAWLCTHLWENYQYSLDKEFLEKRA 497
Query: 492 YPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP--DGKLACVSYSSTMDMAIIRE 549
YPLLEGCA FL+DWLI+G YLETNPSTSPEH FIAP G LA VSYS+TMD++IIRE
Sbjct: 498 YPLLEGCAMFLIDWLIKGPGDYLETNPSTSPEHPFIAPGTGGHLASVSYSTTMDISIIRE 557
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSH 609
VF A+IS+AEVL K++ LVE++ K+LP L P KI++DG+IMEWAQDF+DPEVHHRHLSH
Sbjct: 558 VFLAVISSAEVLGKSDTNLVERIKKALPMLPPVKISKDGTIMEWAQDFEDPEVHHRHLSH 617
Query: 610 LFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKR 669
LFGL+PGHTIT++KNP++CKA +L KRGE+GPGWS TWK ALWARL + E+AYRM+ +
Sbjct: 618 LFGLYPGHTITMQKNPEVCKAVANSLHKRGEDGPGWSTTWKMALWARLLNSENAYRMILK 677
Query: 670 LFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLN--DLYLLPAL 727
L LV P + FEGGLY+NL+ AHPPFQIDANFGFTAA+AEML+QST DLYLLPAL
Sbjct: 678 LITLVPPGGKVDFEGGLYTNLWTAHPPFQIDANFGFTAAIAEMLLQSTHGDADLYLLPAL 737
Query: 728 PWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVN 787
P +KW G VKGL+ARG TV+I W+ G+L E + +S+N + + LHY V
Sbjct: 738 PREKWPKGYVKGLRARGNVTVNISWEKGELQEATV---WSSNPKCTLR-LHYGEQVAMVT 793
Query: 788 LSAGKIYTFNRQLKCTNLHQS 808
+ G +Y FN L+C + +
Sbjct: 794 VLGGNVYRFNGGLQCVETYMA 814
>gi|326518094|dbj|BAK07299.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 832
Score = 999 bits (2582), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 478/796 (60%), Positives = 607/796 (76%), Gaps = 10/796 (1%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+ F+ PA++FTDA PIGNG LGAMVWGGV S+ L+LN DTLWTGVPG+YT+P AP
Sbjct: 34 PLKVAFSSPAEYFTDAAPIGNGSLGAMVWGGVSSDKLQLNHDTLWTGVPGNYTDPKAPGV 93
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
L++VR LVD G++A+ATA++ LFG ++VYQ LG++ +EF S Y ++Y+RELDL+
Sbjct: 94 LAEVRGLVDQGRFADATASAKGLFGGLSEVYQPLGELNIEFSTSEQVY--DSYKRELDLH 151
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TATA V Y++G V++TREHF SNP Q IVT+ S S G +S +SL S L++ V N
Sbjct: 152 TATALVTYNIGGVQYTREHFCSNPHQAIVTRFSASTPGHVSCTLSLSSQLNHSVTVINEN 211
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
++IMEG CPG+R + N D+ GI+F+A L +++ + L D+KL+++ +DW V
Sbjct: 212 EMIMEGICPGQRPGMRENGGDNVTGIRFTAALGLQMGGSAAKSTVLNDQKLRLDSADWVV 271
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
++ A+SSF GP +NP+DSK DPTS ++S L RN ++ L HLDDYQ LF+RV++Q
Sbjct: 272 FVVAAASSFYGPHVNPADSKLDPTSLALSMLNHSRNFTFDQLKAAHLDDYQSLFNRVTLQ 331
Query: 312 LSRSPKDI---VTDTCSEENI--DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
LS+ D VT T +E + D SA+RVKSF +DEDPSLVELLFQ+GRYLLIS SR
Sbjct: 332 LSQGSNDACTSVTRTDIQEQVAEDIRTSADRVKSFSSDEDPSLVELLFQYGRYLLISCSR 391
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PGTQV+NLQGIW++D++P WD+APH+NINL+MNYW +LPCNLSECQEPLFDFL L++NG
Sbjct: 392 PGTQVSNLQGIWSQDIAPEWDAAPHLNINLQMNYWPALPCNLSECQEPLFDFLGSLAVNG 451
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
+KTA+VNY A GWV HH +DIWAKSSA A+WPMGGAWLCTHLWEHY +++D+DF
Sbjct: 452 TKTAKVNYQAGGWVTHHVSDIWAKSSAFLKNPKHAVWPMGGAWLCTHLWEHYQFSLDKDF 511
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
LE AYPLLEGCA+FL+DWLIEG GYLETNPSTSPEH F+APDGK A VSYS+TMD++I
Sbjct: 512 LENTAYPLLEGCANFLVDWLIEGPGGYLETNPSTSPEHAFVAPDGKPASVSYSTTMDVSI 571
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRH 606
IREVF A++S+AE+L K + LVE++ K+LPRL P +IA D ++MEWA DFKDPEV HRH
Sbjct: 572 IREVFLAVLSSAELLGKADIDLVERIKKALPRLPPIQIARDRTVMEWALDFKDPEVQHRH 631
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRM 666
LSHLFGL+PGHTI+++ +P++C+A +L KRGE+GPGWS TWK ALWARL D E+AYRM
Sbjct: 632 LSHLFGLYPGHTISMDNDPEICEAVANSLYKRGEDGPGWSTTWKMALWARLLDSENAYRM 691
Query: 667 VKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPA 726
V +L LV P + FEGGLYSNL+ AHPPFQIDANFGF AA+AEML+QST +DLYLLPA
Sbjct: 692 VLKLITLVPPGGKVAFEGGLYSNLWTAHPPFQIDANFGFAAAIAEMLIQSTQSDLYLLPA 751
Query: 727 LPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKV 786
LP DKW SG VKGLKARG TV I WK+G+LHE + +S+N+ +S LHY +
Sbjct: 752 LPRDKWPSGSVKGLKARGDVTVDIRWKEGELHEAVL---WSSNNQNSVARLHYGKEVAAL 808
Query: 787 NLSAGKIYTFNRQLKC 802
L G Y F L+C
Sbjct: 809 TLRHGIFYKFGSGLRC 824
>gi|357116946|ref|XP_003560237.1| PREDICTED: alpha-L-fucosidase 2-like [Brachypodium distachyon]
Length = 818
Score = 998 bits (2581), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 474/794 (59%), Positives = 595/794 (74%), Gaps = 8/794 (1%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+ F PA+HFTDA PIGNG LGAMVWGGV SE L+LN DTLWTGVPG+YT+P P A
Sbjct: 20 PLKVVFASPAEHFTDAAPIGNGSLGAMVWGGVASEKLQLNLDTLWTGVPGNYTDPSVPSA 79
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
++ VR LV Q+ +AT A+ L+G P +VYQ LGD+ +EF S Y+ +Y+RELDL+
Sbjct: 80 VAVVRKLVHDRQFVDATNAASGLYGGPTEVYQPLGDVNIEFGTSSQDYS--SYKRELDLH 137
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TAT V Y++G V++TREHF SNP QVIVTK+S ++SG +S +SLDS L + V N
Sbjct: 138 TATVLVTYNIGEVQYTREHFCSNPHQVIVTKLSANKSGHISCTLSLDSKLTHSVRVTNAN 197
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
++IM+G CPG+R + N +D GI+F+A+L +++ L D L+++ +DW +
Sbjct: 198 EMIMDGTCPGQRHVLQQNETNDATGIKFTAVLSLQMGGAMAKAEVLNDHNLRIDNADWVL 257
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
LL+ A+SSF GPFINPS+SK DP S ++ L RN+++ L HL DYQ LFHRVS+
Sbjct: 258 LLVTAASSFSGPFINPSNSKIDPESVALRNLNMSRNVTFDQLKAAHLKDYQGLFHRVSLI 317
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
LS +P I +E +AERV SF+++EDPSLVELLFQ+GRYLLIS SRPGTQV
Sbjct: 318 LSHAPA-IEKTNLNETGEAIKITAERVNSFRSNEDPSLVELLFQYGRYLLISCSRPGTQV 376
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
+NLQGIWN+DLSP W SAPH+NINL+MNYW +LPCNL ECQEPL DF+ L++NG+KTA+
Sbjct: 377 SNLQGIWNQDLSPAWQSAPHLNINLQMNYWPTLPCNLGECQEPLIDFIAALAVNGTKTAK 436
Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
+NY SGWV HH +DIWAKSSA +A+WPMGGAWLCTHLWEHY Y++D++FL+ A
Sbjct: 437 INYQTSGWVTHHVSDIWAKSSAFNEDAKYAVWPMGGAWLCTHLWEHYQYSLDKEFLKNTA 496
Query: 492 YPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD--GKLACVSYSSTMDMAIIRE 549
YPLLEGCA FL DWL EG +GYLETNPS SPEH FIAPD G+ A VSYS+TMD++IIRE
Sbjct: 497 YPLLEGCALFLADWLTEGRNGYLETNPSISPEHSFIAPDSGGQQASVSYSTTMDVSIIRE 556
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSH 609
+F AIIS+AEVL K++ LV K+ K+L RL P IA+D +IMEWAQDF+DPEVHHRHLSH
Sbjct: 557 IFMAIISSAEVLGKSDSTLVPKIKKALSRLTPIMIAKDHTIMEWAQDFEDPEVHHRHLSH 616
Query: 610 LFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKR 669
LFGL+PGHTIT++KNP +C+A +L KRGE+GPGWS TWK ALWARL + ++AYRM+ +
Sbjct: 617 LFGLYPGHTITMQKNPGICEAVANSLYKRGEDGPGWSSTWKMALWARLLNSQNAYRMILK 676
Query: 670 LFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPW 729
L LV P + FEGGLYSNL+ AHPPFQIDANFGFTAAVAEML+QS+L DLYLLPALP
Sbjct: 677 LITLVPPGDDVQFEGGLYSNLWTAHPPFQIDANFGFTAAVAEMLLQSSLTDLYLLPALPR 736
Query: 730 DKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLS 789
DKW GCVKGL+ARG TV+ICW +L E + +SNN + S LHY + ++
Sbjct: 737 DKWPEGCVKGLRARGDTTVNICWGKQELQEAVL---WSNNRNSSVIRLHYGERVTEATVA 793
Query: 790 AGKIYTFNRQLKCT 803
AG +Y FN L+C
Sbjct: 794 AGIVYKFNGDLQCV 807
>gi|326513306|dbj|BAK06893.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 815
Score = 976 bits (2522), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 469/803 (58%), Positives = 594/803 (73%), Gaps = 10/803 (1%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
A+ PLK+ F PA+HFTDA PIGNG LGAMVWGGV S+ L+LN DTLWTGVPGDY
Sbjct: 12 ADEAEEERPLKVVFASPAEHFTDAAPIGNGSLGAMVWGGVASDKLQLNLDTLWTGVPGDY 71
Query: 64 TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET 123
T+P AP AL+ VR LVD G++ +AT+A+ LFG +VYQ LGD+ LEFD S+ +Y+ +
Sbjct: 72 TDPKAPAALAAVRKLVDDGRFVDATSAASGLFGGQTEVYQPLGDMNLEFDISNQEYS--S 129
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
Y+RELDL+TAT + Y++G V+ TREHF SNP QVIVTKIS ++S +S +SL+S L++
Sbjct: 130 YKRELDLHTATTVITYNIGEVQHTREHFCSNPHQVIVTKISANKSEHVSLTLSLNSKLNH 189
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
V N++IMEG CP R+ N D GI F+A+L +++S + L D+KL+
Sbjct: 190 RVRVMNANEMIMEGSCPVHRL--HENEASDASGIGFAAVLSLQMSGAAAKVVVLNDQKLR 247
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
++ +DW +L + A+SSF+GP +NPSDSK DP S ++ A+ RNL++ L HL DYQ
Sbjct: 248 IDNADWVLLRVTAASSFNGPSVNPSDSKLDPESAALRAMNMSRNLTFDQLKASHLKDYQG 307
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
LFHRVS++LS+SP I E +AERV F++DED SLVELLFQ+GRYLLIS
Sbjct: 308 LFHRVSLRLSQSPA-IEKINMKEVGEAIKTTAERVNGFRSDEDSSLVELLFQYGRYLLIS 366
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SRPGTQ++NLQGIWN+DL P W+ APH+NINL+MNYW +LPCNL ECQEPL DF+ L+
Sbjct: 367 CSRPGTQISNLQGIWNQDLLPQWECAPHLNINLQMNYWPTLPCNLIECQEPLLDFIASLA 426
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+NG+KTA++NY ASGWV HH TDIWAKSSA +++WPMGGAWLCTHLWEHY Y +D
Sbjct: 427 VNGTKTAKINYQASGWVTHHVTDIWAKSSAFNEDAKYSVWPMGGAWLCTHLWEHYQYLLD 486
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDG--KLACVSYSST 541
+DFL+ AYPLLEGCA FL DWLIEG G LETNPSTSPEH FIAP A VSYS+T
Sbjct: 487 KDFLKNTAYPLLEGCALFLTDWLIEGPRGLLETNPSTSPEHAFIAPGSGDHQASVSYSTT 546
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPE 601
MD+AIIRE+FSA+IS+AE+L K++ LV+K+ ++LPRL IA+D +++EWAQDFKDPE
Sbjct: 547 MDIAIIREIFSAVISSAEILGKSDTPLVQKIKEALPRLPQNTIAKDQTLVEWAQDFKDPE 606
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQE 661
HRHLSHLFGL+PGHTIT++ NP++C+A +L KRGE+GPGWS TWK ALWARL + E
Sbjct: 607 PSHRHLSHLFGLYPGHTITMQGNPEICEAISNSLHKRGEDGPGWSSTWKMALWARLLNSE 666
Query: 662 HAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDL 721
+AYRM+ +L LV P FEGGLY+NL+ AHPPFQID NFGFTAA+AEML+QST D+
Sbjct: 667 NAYRMILKLITLVPPGDTIKFEGGLYTNLWTAHPPFQIDGNFGFTAAIAEMLLQSTPTDV 726
Query: 722 YLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRG 781
YLLPALP DKW GCVKGL+ARG T++I W+ G+L E ++ N NN S LHY G
Sbjct: 727 YLLPALPRDKWPDGCVKGLRARGDTTINIFWEKGELQEAVLWFNNRNN---SVLWLHYGG 783
Query: 782 TSVKVNLSAGKIYTFNRQLKCTN 804
+ AG +Y FN L+C +
Sbjct: 784 QDAVATVEAGNVYRFNGVLQCVD 806
>gi|357479527|ref|XP_003610049.1| Macrophage migration inhibitory factor-like protein [Medicago
truncatula]
gi|355511104|gb|AES92246.1| Macrophage migration inhibitory factor-like protein [Medicago
truncatula]
Length = 855
Score = 971 bits (2509), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 460/704 (65%), Positives = 564/704 (80%), Gaps = 30/704 (4%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
+ NA+ + PLK+TF+ AK++TDAIPIGNGRLGAM+WGG+ SE L+LNEDTLWTG+P
Sbjct: 22 LANADDDEPSMPLKVTFSRSAKYWTDAIPIGNGRLGAMIWGGIQSEVLQLNEDTLWTGIP 81
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
G+YT+ +AP+AL++VR LVD +Y+EAT A++KL G P +VYQLLGDIEL+FDDSHLKY+
Sbjct: 82 GNYTDKNAPEALAEVRKLVDDRKYSEATTAALKLLGPPGEVYQLLGDIELQFDDSHLKYS 141
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
EE+Y RELDL+ AT HF+SNPDQV+VTK S S SGSLSF VSLDS
Sbjct: 142 EESYHRELDLDNAT---------------HFASNPDQVLVTKFSTSNSGSLSFTVSLDSK 186
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
L +++ ++ NQIIMEG CPGKRIPP+ N++D+PKGIQFSA+L+++IS+++G I L+DK
Sbjct: 187 LHHNTRLSSKNQIIMEGSCPGKRIPPQVNSSDEPKGIQFSAVLDVQISNEKGVIHVLDDK 246
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
KL+VEGSDWA+LLL ASSSFDGPF NP +SKKD TSES+S ++ + +L Y D+Y RHLDD
Sbjct: 247 KLRVEGSDWAILLLTASSSFDGPFTNPENSKKDLTSESLSKMKFVTSLKYDDIYARHLDD 306
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEE--------NI------DTVPSAERVKSFQTDED 346
YQ LFHRVS+QLS+S K ++ +E NI D VP++ R+KSFQ DED
Sbjct: 307 YQNLFHRVSLQLSKSSKTVLGKPILDEGKMVSCQTNISQLRGGDIVPTSSRIKSFQNDED 366
Query: 347 PSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPC 406
PS VELLFQ+GRYLLI+ SRPGTQVANLQGIWN+D+ P WD APH+NINL+MNYW SL C
Sbjct: 367 PSFVELLFQYGRYLLIACSRPGTQVANLQGIWNKDVVPKWDGAPHLNINLQMNYWPSLSC 426
Query: 407 NLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
NL ECQEPLFD ++ LS+NGSKTA+VNY A+GWV HH +D+WAK+S RG VWALWPMG
Sbjct: 427 NLHECQEPLFDCISSLSVNGSKTAKVNYDANGWVAHHVSDLWAKTSTYRGPAVWALWPMG 486
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEF 526
GAWLCTHLWEHY YT D++FL+ +AYPLLEGC SFLLDWLIEG G LETNPSTSPEH F
Sbjct: 487 GAWLCTHLWEHYTYTTDKEFLKNKAYPLLEGCTSFLLDWLIEGPGGLLETNPSTSPEHMF 546
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE 586
IA D K A VSYSSTMD++II+EVFS +ISAAE+L + +DA++++V +S +L P KIA
Sbjct: 547 IASDQKRASVSYSSTMDISIIKEVFSIVISAAEILGRQDDAIIKRVFESQSKLPPIKIAR 606
Query: 587 DGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWS 646
DGSIMEWA+DF+DP+VHH H+SHLFGLFPGHTI IEK P+LCKA +L KRG+EGPGWS
Sbjct: 607 DGSIMEWAEDFQDPDVHHWHVSHLFGLFPGHTINIEKTPNLCKAVNYSLIKRGDEGPGWS 666
Query: 647 ITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEK-HFEGGLYSN 689
TWK ALWARLH+ EHAYRM+K L L DPE E FEGGL+S+
Sbjct: 667 TTWKAALWARLHNSEHAYRMIKHLVVLADPEQEAVGFEGGLHSH 710
>gi|242047972|ref|XP_002461732.1| hypothetical protein SORBIDRAFT_02g007180 [Sorghum bicolor]
gi|241925109|gb|EER98253.1| hypothetical protein SORBIDRAFT_02g007180 [Sorghum bicolor]
Length = 864
Score = 944 bits (2441), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 467/787 (59%), Positives = 583/787 (74%), Gaps = 37/787 (4%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PL + F PA++FTDA PIGNG LG MVWGGV ++ L+LN DTLWTG PG YT+PDAP A
Sbjct: 47 PLTVVFASPAENFTDAAPIGNGSLGGMVWGGVATDKLQLNHDTLWTGAPGSYTDPDAPAA 106
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEF--DDSHLKYAEETYRRELD 129
L+ VR LVD G++A+ATAA+ +LFG ++VYQ +GD+ LE S + A ++Y+RELD
Sbjct: 107 LAAVRELVDQGRFADATAAATRLFGGQSEVYQPMGDVNLELGGSGSDQQPAYDSYKRELD 166
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L+TAT V YSVG V++TREHF SNP QVI+T+I+ SE G +S +SL S L N V
Sbjct: 167 LHTATVLVTYSVGPVQYTREHFCSNPHQVIITRIAASEPGHVSCTLSLSSQLKNTVTVTN 226
Query: 190 NNQIIMEGRCPG-------------KRIPPKANANDDPKGIQFSAILEIKISDDRGTISA 236
NQ++MEG CP + + GI+F+A+L +++ D+ +
Sbjct: 227 ANQVVMEGVCPRQRPPAPPRLMLLRNSSSGDDDDDLTTGGIKFAAVLGVQMGGDKAKAAV 286
Query: 237 LEDK-KLKVEGSDWAVLLLVASSSFDGPFINPSDSK-KDPTSESMSALQSIRNLSYSDLY 294
L D+ KL +E +DW VL++ ASSSFDGPF++PSDS+ DPTS +++ L +L+Y L
Sbjct: 287 LNDENKLSLESADWIVLIVAASSSFDGPFVSPSDSRLDDPTSAAVATLNRATSLTYEQLK 346
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTD-------------------TCSEENIDTVPSA 335
HLDDYQ+LFHRV+++LS ++ D +E I SA
Sbjct: 347 AAHLDDYQRLFHRVTLRLSPPGGGLLEDARGGGLMMTGGKETMLKRGVGGDEGIIRT-SA 405
Query: 336 ERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNIN 395
+RVKSF TDEDPSLVELLFQ+GRYLLIS SRPGTQV+NLQGIWN++++P WD+APH+NIN
Sbjct: 406 DRVKSFATDEDPSLVELLFQYGRYLLISCSRPGTQVSNLQGIWNQEVAPAWDAAPHLNIN 465
Query: 396 LEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADR 455
L+MNYW +LPCNLSECQEPLFDFL L++NG+KTA+VNY A GWV HH +DIWAKSSA
Sbjct: 466 LQMNYWPTLPCNLSECQEPLFDFLQSLAVNGTKTAKVNYQARGWVTHHVSDIWAKSSAFI 525
Query: 456 GKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLE 515
A+WPMGGAWLCTHLWEHY Y++D+DFLE AYPLLEGCA+FL+DWLIEG G+L+
Sbjct: 526 KNPKHAVWPMGGAWLCTHLWEHYQYSLDKDFLEYTAYPLLEGCATFLVDWLIEGPGGFLQ 585
Query: 516 TNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS 575
TNPSTSPEH F APDGK A VSYS+TMD++IIREV SA++ +AE+LEK++ LVEK+ K+
Sbjct: 586 TNPSTSPEHAFTAPDGKPASVSYSTTMDISIIREVSSAVLLSAEILEKSDTDLVEKIKKA 645
Query: 576 LPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTL 635
LPRL P + A D +IMEWA DF+DPEVHHRHLSHLFGL+PGHTIT+E NPD+C A +L
Sbjct: 646 LPRLPPIQFARDNTIMEWALDFQDPEVHHRHLSHLFGLYPGHTITMENNPDVCGAVSNSL 705
Query: 636 QKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHP 695
KRGE+GPGWS TWK ALWARL + E+AYRMV +L LV P + FEGGLY+NL+ AHP
Sbjct: 706 YKRGEDGPGWSTTWKMALWARLMNSENAYRMVLKLITLVPPGEKVQFEGGLYNNLWTAHP 765
Query: 696 PFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
PFQIDANFGFTAA+AEMLVQST DLYLLPALP DKW GC KGL+ARG TV+ICW +G
Sbjct: 766 PFQIDANFGFTAAIAEMLVQSTQTDLYLLPALPRDKWPRGCAKGLRARGDVTVNICWDEG 825
Query: 756 DLHEVGI 762
+L E +
Sbjct: 826 ELQEAMV 832
>gi|110288917|gb|ABG66023.1| large secreted protein, putative, expressed [Oryza sativa Japonica
Group]
Length = 708
Score = 887 bits (2291), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 419/712 (58%), Positives = 545/712 (76%), Gaps = 11/712 (1%)
Query: 101 VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIV 160
VYQ LGDI LEFD S L Y +Y+RELDL TAT + Y++G V+++REHF SNP QV
Sbjct: 3 VYQPLGDINLEFDSSSLGYT--SYKRELDLRTATVCISYNIGEVQYSREHFCSNPHQVFA 60
Query: 161 TKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFS 220
TKIS ++SG +SF +SL+S L+++ + N++IM+G CPG+R N +D GI+F+
Sbjct: 61 TKISANKSGHVSFTLSLNSQLNHNVRITNANEMIMQGTCPGRRPALHHNGANDAIGIKFA 120
Query: 221 AILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMS 280
+ ++I ++ ++D+KL+++ +DW VLL+ A+SSFDGPF+NPS+SK +P +++
Sbjct: 121 TAVGLQIGGTSAKVTIIDDQKLRIDAADWVVLLVAAASSFDGPFVNPSESKLNPEVAALN 180
Query: 281 ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKS 340
L RN ++S L HL+DYQ LFHRV++QLS++ + D E + D +AER+ S
Sbjct: 181 TLNISRNATFSQLKAAHLEDYQGLFHRVTLQLSQASM-LEKDILEEVDHDVKTTAERINS 239
Query: 341 FQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
F++DEDPSLVELLFQ+GRYLLISSSRPGTQV+NLQGIWN+D +P W+++PH+NINLEMNY
Sbjct: 240 FRSDEDPSLVELLFQYGRYLLISSSRPGTQVSNLQGIWNQDFAPAWEASPHLNINLEMNY 299
Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVW 460
W +LPCNL+ECQEPLFD + L++NG+KTA+VNY ASGWV HH TDIWAKSSA ++
Sbjct: 300 WPTLPCNLTECQEPLFDLIGSLAVNGTKTAKVNYQASGWVTHHVTDIWAKSSAYYVDAMY 359
Query: 461 ALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPST 520
ALWPMGGAWLCTHLWE+Y Y++D++FLEKRAYPLLEGCA FL+DWLI+G YLETNPST
Sbjct: 360 ALWPMGGAWLCTHLWENYQYSLDKEFLEKRAYPLLEGCAMFLIDWLIKGPGDYLETNPST 419
Query: 521 SPEHEFIAP--DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR 578
SPEH FIAP G LA VSYS+TMD++IIREVF A+IS+AEVL K++ LVE++ K+LP
Sbjct: 420 SPEHPFIAPGTGGHLASVSYSTTMDISIIREVFLAVISSAEVLGKSDTNLVERIKKALPM 479
Query: 579 LRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR 638
L P KI++DG+IMEWAQDF+DPEVHHRHLSHLFGL+PGHTIT++KNP++CKA +L KR
Sbjct: 480 LPPVKISKDGTIMEWAQDFEDPEVHHRHLSHLFGLYPGHTITMQKNPEVCKAVANSLHKR 539
Query: 639 GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQ 698
GE+GPGWS TWK ALWARL + E+AYRM+ +L LV P + FEGGLY+NL+ AHPPFQ
Sbjct: 540 GEDGPGWSTTWKMALWARLLNSENAYRMILKLITLVPPGGKVDFEGGLYTNLWTAHPPFQ 599
Query: 699 IDANFGFTAAVAEMLVQSTLN--DLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGD 756
IDANFGFTAA+AEML+QST DLYLLPALP +KW G VKGL+ARG TV+I W+ G+
Sbjct: 600 IDANFGFTAAIAEMLLQSTHGDADLYLLPALPREKWPKGYVKGLRARGNVTVNISWEKGE 659
Query: 757 LHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLKCTNLHQS 808
L E + +S+N + + LHY V + G +Y FN L+C + +
Sbjct: 660 LQEATV---WSSNPKCTLR-LHYGEQVAMVTVLGGNVYRFNGGLQCVETYMA 707
>gi|15451592|gb|AAK98716.1|AC090483_6 Hypothetical protein [Oryza sativa Japonica Group]
Length = 872
Score = 869 bits (2246), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 454/855 (53%), Positives = 574/855 (67%), Gaps = 86/855 (10%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PL++ F P+++FTDA PIGNG LGA+VWGGV SE L+LN DTLWTG PG+YTNP AP
Sbjct: 34 PLEVVFASPSRYFTDAAPIGNGSLGALVWGGVASEKLQLNHDTLWTGGPGNYTNPKAPAV 93
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET-YRRELDL 130
LS VR LV+ GQYA+ATA + L G VYQ LGDI+L FD+ + E+T Y+R LDL
Sbjct: 94 LSKVRDLVNRGQYAKATAVAYGLSGDQTQVYQPLGDIDLAFDE----HVEDTNYKRNLDL 149
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
TAT V Y++G V +REHFSSNP QVIVTKIS + G++SF VSL + L++ V
Sbjct: 150 RTATVNVSYTIGEVVHSREHFSSNPHQVIVTKISADKPGNVSFTVSLTTPLNHQIRVTNA 209
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
N+IIMEG CPG+R NA+D P GI+FSAIL +++S GT+ L DK LK+ G+D A
Sbjct: 210 NEIIMEGYCPGERPTEYGNASDHPVGIKFSAILYLQMSGSNGTVEILNDKMLKLVGADSA 269
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
VLLL A++SF+GPF+NPS+SK DPT+ +++ L RN+SYS L H+DDYQ LF RVS+
Sbjct: 270 VLLLAAATSFEGPFVNPSESKLDPTASALTTLTVARNMSYSQLKAYHVDDYQNLFQRVSL 329
Query: 311 QLSRS-------------PKDIVTDT-----------CSEE---NIDTVPSAERVKSFQT 343
QLSR P++ + +T CS N P+ +R+ SF+
Sbjct: 330 QLSRDSNDALGGNGLVNLPENSLQETSVSDYAVQMVECSRFQGFNNSGKPTVDRILSFRD 389
Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
DEDPSLVELLFQFGRYLLIS SRPGTQ++NLQGIWN++ SP WD+APH NINL+MNYW +
Sbjct: 390 DEDPSLVELLFQFGRYLLISCSRPGTQISNLQGIWNDETSPPWDAAPHPNINLQMNYWPA 449
Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
LPCNLSECQEPLFDF+ LS+NG+KTA+VNY ASGWV H TD+WAK+S D G +WALW
Sbjct: 450 LPCNLSECQEPLFDFIGSLSVNGAKTAKVNYEASGWVSHQVTDLWAKTSPDAGDPMWALW 509
Query: 464 PMGGAWLCTHLWEHYNYTMD--------------------RDFLEKRAYPLLEGCASFLL 503
PMGG WL THLWEHY+YTMD + FLEK AYPLLEG ASFLL
Sbjct: 510 PMGGPWLATHLWEHYSYTMDKKENVFRPNKVDMIVLKDAKKQFLEKTAYPLLEGSASFLL 569
Query: 504 DWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEK 563
DWLIEG+ YLETNPSTSPEH FIAPDG+ ACVSYS+TMDM+IIREVFSA++ ++++L K
Sbjct: 570 DWLIEGNGDYLETNPSTSPEHYFIAPDGRKACVSYSTTMDMSIIREVFSAVLMSSDILGK 629
Query: 564 NEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD----FKDPEVHHRHLSHLFGLFPGHTI 619
++ +V+++ K++PRL P K+A DG+IMEW + D R L ++ +
Sbjct: 630 SDSDMVQRIKKAIPRLPPIKVARDGTIMEWLFSECLLYVDRHRIFRILKFTTDMYLTCLV 689
Query: 620 TIE------------KNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMV 667
I+ P + ++ ++ G PG W +
Sbjct: 690 FIQDILCHLRKHLTFAKPLQIVSIKEVMKVLGGPLPG---RWPFG------------PIF 734
Query: 668 KRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPAL 727
L LVDP+HE EGGLY NLF AHPPFQIDANFGF AA++EMLVQST +DLYLLPAL
Sbjct: 735 ITLITLVDPKHEVEKEGGLYCNLFTAHPPFQIDANFGFPAALSEMLVQSTGSDLYLLPAL 794
Query: 728 PWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVN 787
P DKW GCVKGLKARGG T++I W++G LHE ++S+ S N S LHY ++
Sbjct: 795 PRDKWPQGCVKGLKARGGVTINIRWEEGSLHEALLWSSSSQN---SRIKLHYGDQVGTIS 851
Query: 788 LSAGKIYTFNRQLKC 802
+S ++Y F++ LKC
Sbjct: 852 VSPCQVYRFSKDLKC 866
>gi|302773137|ref|XP_002969986.1| hypothetical protein SELMODRAFT_171005 [Selaginella moellendorffii]
gi|300162497|gb|EFJ29110.1| hypothetical protein SELMODRAFT_171005 [Selaginella moellendorffii]
Length = 791
Score = 837 bits (2162), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 402/784 (51%), Positives = 552/784 (70%), Gaps = 15/784 (1%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + F PA+++ +A+P+GNGRLGAMV+GG S+ ++LNEDTLW+G P D+ NP+A + L
Sbjct: 5 LSVDFFDPARYWVEALPVGNGRLGAMVFGGSSSDLIQLNEDTLWSGGPRDWNNPNAVQVL 64
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
VR LV +YAEA+ S ++ G +VYQ LGDI+L+F SH Y ++Y R+LDLNT
Sbjct: 65 PKVRQLVWDEKYAEASDLSKEMLGPYTEVYQPLGDIKLDFGASHATYDAQSYHRQLDLNT 124
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A V Y+VG + +TRE F+S P QVIV +I+ S++G++SF+ +LDS L ++YV +N
Sbjct: 125 ALVSVSYAVGGINYTREVFASYPHQVIVIRITSSKAGAVSFSATLDSPLQTNAYVKDSNF 184
Query: 193 IIMEGRCPGKRIPPKANA----NDDPKGIQFSAILEIKISDDRGT-ISALEDKKLKVEGS 247
I+++G+CP P ++ +D G+ F+A++E++ S G+ I+ L ++++VE
Sbjct: 185 IVVQGQCPLHVEEPTLSSPRCESDQKTGMSFAAVMEVRTSSGAGSVITKLGIQQVRVENV 244
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
DWA+L+L ASSSFDGPF +P+ + KDP + S++ L+ + LSY LY HL DYQ LFHR
Sbjct: 245 DWAMLVLAASSSFDGPFKDPTSTGKDPVAASLATLKLVEALSYKKLYAAHLKDYQALFHR 304
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
VS+Q+++ ++ + + + ER+++F ++EDP++V LLFQFGRYLLISSSRP
Sbjct: 305 VSLQINKKSRENSVVSSTSMSTQ-----ERIQAFASNEDPAMVVLLFQFGRYLLISSSRP 359
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
GT VANLQGIWN+DL P W PH+NINLEMNYW + CNL+EC EPLFDF++ ++INGS
Sbjct: 360 GTFVANLQGIWNKDLKPAWRCVPHLNINLEMNYWPAEVCNLAECHEPLFDFVSSMAINGS 419
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
TA+VNY GWV HH DIW +++ G V+AL+PMGGAWLC HLWEHY +++D +FL
Sbjct: 420 HTAKVNYNMRGWVTHHNADIWVQTAPIGGDPVYALFPMGGAWLCLHLWEHYRFSLDMEFL 479
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
+AYPLL GCA FL DWL + G L TNPSTSPEH FIAPDGK A VSY+S MDMAII
Sbjct: 480 RSKAYPLLTGCAQFLFDWLTGDNHGMLVTNPSTSPEHVFIAPDGKEASVSYASAMDMAII 539
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHL 607
R VF A SAA +L++ + + L P +I+ G +MEWA+DF+DP+V+HRH+
Sbjct: 540 RAVFDATSSAATILQEPNSQFTANLKHATENLFPPEISSSGLLMEWAKDFQDPDVNHRHM 599
Query: 608 SHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMV 667
SHLFGL+PGH+I+IE P+LC+AA +++ RG+ GPGWS+ WK ALW+RL ++AYR+V
Sbjct: 600 SHLFGLYPGHSISIESTPELCQAAVRSMYVRGDVGPGWSMAWKIALWSRLWSAQNAYRVV 659
Query: 668 KRLFNLVDPEH--EKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
KR+F L+D E+ GGLY NLF AHPPFQID NFGFTAA+AEML+QS ++YLLP
Sbjct: 660 KRMFTLMDATQTTERLDGGGLYGNLFNAHPPFQIDGNFGFTAAIAEMLLQSDETNIYLLP 719
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVK 785
+LP + W SG V GL+ARG +V I W+ G L I + H + +HYR S +
Sbjct: 720 SLP-EVWISGAVTGLRARGDTSVDIAWERGTLSSARIVPGPKCSSHT--RRIHYRWKSFE 776
Query: 786 VNLS 789
+ LS
Sbjct: 777 IRLS 780
>gi|302799394|ref|XP_002981456.1| hypothetical protein SELMODRAFT_178891 [Selaginella moellendorffii]
gi|300150996|gb|EFJ17644.1| hypothetical protein SELMODRAFT_178891 [Selaginella moellendorffii]
Length = 788
Score = 834 bits (2154), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 406/785 (51%), Positives = 553/785 (70%), Gaps = 20/785 (2%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + F PA+++ +A+P+GNGRLGAMV+GG S+ ++LN DTLW+G P D+ NP+A + L
Sbjct: 5 LSVDFFDPARYWVEALPVGNGRLGAMVFGGSSSDLIQLN-DTLWSGGPRDWNNPNAVQVL 63
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
VR LV +YAEA+ S ++ G +VYQ LGDI+L+F SH Y ++Y R+LDLN
Sbjct: 64 PKVRQLVWDEKYAEASDLSKQMLGPYTEVYQPLGDIKLDFGTSHATYDAQSYHRQLDLNA 123
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A V+Y++G V +TRE F+S P QVIV +IS S++G++SF+ +LDS L ++YV +N
Sbjct: 124 ALVSVRYAIGGVNYTREVFASYPHQVIVIRISSSKAGAVSFSATLDSPLQTNAYVKDSNF 183
Query: 193 IIMEGRCPGKRIPPKANA----NDDPKGIQFSAILEIKISDDRGT-ISALEDKKLKVEGS 247
I+++G+CP P ++ +D G+ F+A++E++ S G+ I+ L ++++VE
Sbjct: 184 IVVQGQCPLHVEEPTLSSPRCESDQKTGMSFAAVMEVRTSSGAGSVITKLGIQQVRVENV 243
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
DWA+L+L ASSSFDGPF NP+ KDP + S++ L+S+ LSY LY HL DYQ LFHR
Sbjct: 244 DWAMLVLAASSSFDGPFKNPTG--KDPVAASLATLKSVEALSYEKLYATHLKDYQALFHR 301
Query: 308 VSIQLSR-SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
VS+++++ S ++ V T S + + ER+++F ++EDP++V LLFQFGRYLLISSSR
Sbjct: 302 VSLRINKKSGENSVASTTS------MSTQERIQAFASNEDPAMVSLLFQFGRYLLISSSR 355
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PGT VANLQGIWN+DL P W PH+NINLEMNYW + CNL+EC EPLFDF++ ++ING
Sbjct: 356 PGTFVANLQGIWNKDLKPAWRCVPHLNINLEMNYWPAEVCNLAECHEPLFDFVSSMAING 415
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
S TA+VNY GWV HH DIW +++ G V+AL+PMGGAWLC HLWEHY +++D +F
Sbjct: 416 SHTAKVNYNMRGWVTHHNADIWVQTAPIGGDPVYALFPMGGAWLCLHLWEHYRFSLDMEF 475
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
L +AYPLL GCA FL DWL + G L TNPSTSPEH FIAPDGK A VSY+S MDMAI
Sbjct: 476 LRSKAYPLLTGCAQFLFDWLTGDNHGMLVTNPSTSPEHVFIAPDGKQASVSYASAMDMAI 535
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRH 606
IR VF A SAA +L++ + + L P +I+ G +MEWA+DF+DP+V+HRH
Sbjct: 536 IRSVFDATSSAAAILQEPNSQFTANLKHATENLFPPEISSSGLLMEWAKDFQDPDVNHRH 595
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRM 666
+SHLFGL+PGH+I+IE P+LC+AA +++ RG+ GPGWS+ WK ALW+RL + AYR+
Sbjct: 596 MSHLFGLYPGHSISIESTPELCQAAVRSMYVRGDVGPGWSMAWKIALWSRLWSAQDAYRV 655
Query: 667 VKRLFNLVDPEH--EKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLL 724
VKR+F L+D E+ GGLY NLF AHPPFQID NFGFTAA+AEML+QS ++YLL
Sbjct: 656 VKRMFTLIDATQTTERLDGGGLYGNLFNAHPPFQIDGNFGFTAAIAEMLLQSDETNIYLL 715
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSV 784
P+LP + W SG V GL+ARG +V I W+ G L I + H + +HYR S
Sbjct: 716 PSLP-EVWISGAVTGLRARGDTSVDIAWERGTLSSARIVPGPKCSSHT--RRIHYRWKSF 772
Query: 785 KVNLS 789
++ LS
Sbjct: 773 EIRLS 777
>gi|168043560|ref|XP_001774252.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162674379|gb|EDQ60888.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 818
Score = 807 bits (2085), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 401/802 (50%), Positives = 535/802 (66%), Gaps = 39/802 (4%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGH 97
MV GGV SE ++LNEDTLW+G P D+ NP A + L VR LV G+YAEAT + K+ G
Sbjct: 1 MVHGGVKSELVQLNEDTLWSGGPTDWNNPKALETLPRVRELVKEGKYAEATTEAQKMLGP 60
Query: 98 PADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQ 157
+VYQ LGD++LEFDDSH Y +E+YRR+LDL+TA V Y +G+V + R+ F+S P Q
Sbjct: 61 DPEVYQPLGDLKLEFDDSHNTYDKESYRRQLDLDTAMTYVNYEIGDVSYLRQAFTSYPHQ 120
Query: 158 VIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCP--GKRIPPKANANDDPK 215
V +I+GS+SGS+SF+V+LDS L V G+ I ++G+CP ++ A+ K
Sbjct: 121 VFAMRIAGSKSGSVSFSVTLDSQLMLGKEVVGSKYIALKGQCPIDSNKVTEVASPTRSSK 180
Query: 216 --GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKD 273
G++F A+L++++S + G + ++ + LKV +DWAVL L ASSSFDGPF +PS S +
Sbjct: 181 KQGMEFVAVLQVEVSGEAGRLQVVDKQTLKVHQADWAVLYLTASSSFDGPFKDPSISGIE 240
Query: 274 PTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKD-----------IVTD 322
PTS + +AL ++ +LS+ D+ HL DYQ LFHRVS+ + KD IV
Sbjct: 241 PTSLAFAALANLVDLSFDDILAAHLADYQTLFHRVSLHVDNEEKDLGLWELIVPSEIVES 300
Query: 323 TCSEENI-----------------DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
E + + + +R+ +F DEDP LV LLFQFGRYLLI+SS
Sbjct: 301 KTVESGAQVSTGVDGEVYPQNAWKERISTRDRILNFDGDEDPDLVVLLFQFGRYLLIASS 360
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RP + V+NLQG+W+ L P W P +NINLEMNYW + C+L+EC PLFDFL +++
Sbjct: 361 RPNSFVSNLQGVWSNSLHPAWRCCPTLNINLEMNYWPAETCSLAECHLPLFDFLEQIAVT 420
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G+ TA+VNY GWV HH DIWA S+ G VWALWPM GAW+C HLWEHY ++ D +
Sbjct: 421 GATTAKVNYGLGGWVSHHNADIWAHSAPVSGDPVWALWPMSGAWICLHLWEHYTFSQDEE 480
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
FL RAYPL +GCA F ++WL+E G+L TNPSTSPEH FIAPDG+ ACVSY STMDMA
Sbjct: 481 FLRNRAYPLFKGCAEFFVNWLVEDGKGHLVTNPSTSPEHHFIAPDGQSACVSYGSTMDMA 540
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
I+ F+A++SAA+++ ++E LV +V ++ RL P KI DG ++EW ++FKDPE HR
Sbjct: 541 ILHNFFNAVVSAAKIVGQDEAELVSEVKSAVGRLLPAKIGSDGRLLEWVEEFKDPEDTHR 600
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H+SHLFGL+PGH+IT + P+LC AA +++ KRGE GPGWS WKTALWARL + +HAY
Sbjct: 601 HMSHLFGLYPGHSITPQSTPELCAAATQSILKRGEIGPGWSTAWKTALWARLWNSDHAYS 660
Query: 666 MVKRLFNLV-DPEHEKHFE-GGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYL 723
M+KR+F LV E E+ F+ GGLYSNLF+AHPPFQID N GFTAAVAEML QS ++LYL
Sbjct: 661 MIKRMFTLVPSEEKEERFDGGGLYSNLFSAHPPFQIDGNLGFTAAVAEMLFQSDESNLYL 720
Query: 724 LPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTS 783
LPALP KW G + GL+ RG TV I W G+L EV + + + + LHY
Sbjct: 721 LPALPLRKWCDGLIAGLRGRGAVTVGIRWLGGNLQEVTV---QVEKNFSATRMLHYNTKV 777
Query: 784 VKV--NLSAGKIYTFNRQLKCT 803
V + + S ++YT++ L T
Sbjct: 778 VTLPKSTSGPQLYTYDGDLNLT 799
>gi|414868290|tpg|DAA46847.1| TPA: hypothetical protein ZEAMMB73_390456 [Zea mays]
Length = 727
Score = 790 bits (2041), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 393/657 (59%), Positives = 485/657 (73%), Gaps = 30/657 (4%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+ F PAK+FTDA PIGNGRLGAMVWG V SE L+LN DTLWTG PG+YTNP+AP
Sbjct: 41 PLKVVFGSPAKYFTDAAPIGNGRLGAMVWGCVESERLQLNHDTLWTGGPGNYTNPNAPAV 100
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
LS VRSLV++G+Y EAT+A+ L G V+Q LGDI+L F + +KY YRRELDL+
Sbjct: 101 LSKVRSLVENGKYPEATSAAYDLSGDQTQVFQPLGDIDLVFGED-IKYTN--YRRELDLH 157
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TAT V Y+VG++ +TREHFSSNP QVIVTKIS ++ G++SF VSL S LD+ V N
Sbjct: 158 TATVTVTYTVGDIVYTREHFSSNPHQVIVTKISANKPGNVSFTVSLTSPLDHKIRVTHAN 217
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
+IIMEG CPG+R A D P GI+FSAIL ++I+ T+ L D LK++ +D V
Sbjct: 218 EIIMEGSCPGQRPEEIKTAADQPIGIKFSAILYLQINGANSTVEVLNDNMLKLDCADSVV 277
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
LLL A++SF FI PS+SK DPT + + L R SYS L H+DDYQ LF RVS+Q
Sbjct: 278 LLLAATTSFQSAFIKPSESKLDPTVSAFTTLSIARRTSYSQLKAYHIDDYQTLFQRVSLQ 337
Query: 312 LS-------RSPKDIVTDTCSEENIDTV--------------------PSAERVKSFQTD 344
LS R + + + S + + P+ ER+ +F+ +
Sbjct: 338 LSQGSNYDLRRSRLVQSAETSSQGANVSDYGFQISGCTRLTSLNSFVKPTVERIVTFKDN 397
Query: 345 EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSL 404
EDPSLVELLFQFGRYLLIS SRPGTQ++NLQGIW+ D SP WD+APH NINL+MNYW +L
Sbjct: 398 EDPSLVELLFQFGRYLLISCSRPGTQISNLQGIWSNDTSPPWDAAPHPNINLQMNYWPAL 457
Query: 405 PCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWP 464
PCNLSECQEPLFDF+ LSING+KTA+VNY ASGWV H TD+WAK+S D G VWALWP
Sbjct: 458 PCNLSECQEPLFDFIGSLSINGAKTAKVNYEASGWVSHQVTDLWAKTSPDAGDPVWALWP 517
Query: 465 MGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEH 524
MGG WL THLWEHY +T+D+ FLEK AYPLLEG A FLLDWLIEGH GYLETNPSTSPEH
Sbjct: 518 MGGPWLATHLWEHYCFTLDKHFLEKTAYPLLEGSARFLLDWLIEGHRGYLETNPSTSPEH 577
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
FIAPDGK ACVSYS+TMD++IIREVFSA+I +A++L K++ +V+++ K+LP L P K+
Sbjct: 578 YFIAPDGKEACVSYSTTMDISIIREVFSALILSADILGKSDTNVVQRIKKALPNLPPMKV 637
Query: 585 AEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEE 641
A DG+IMEWAQDF+DPE+HHRH+SHLFGL+PGHT+++E+ PDLC+A +L KRG +
Sbjct: 638 ARDGTIMEWAQDFQDPEIHHRHVSHLFGLYPGHTMSLEETPDLCRAVANSLYKRGSQ 694
>gi|326493958|dbj|BAJ85441.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 636
Score = 707 bits (1826), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 349/597 (58%), Positives = 434/597 (72%), Gaps = 30/597 (5%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+ F PA++FTDA PIGNGRLGA+VWGGV SE L+LN DTLWTG PG+YTNP AP
Sbjct: 40 PLKVVFASPARYFTDAAPIGNGRLGALVWGGVTSEKLQLNHDTLWTGGPGNYTNPKAPTV 99
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
LS+VRSLVD G Y EATA + L G YQ LGDI+L F + H+KY Y R LDL
Sbjct: 100 LSEVRSLVDKGLYPEATAVAYGLSGDETQSYQPLGDIDLAFGE-HIKYTN--YTRYLDLE 156
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
+AT V YSVG V ++REHFSSNP QVI TKIS ++ G++S VSL + LD+ V N
Sbjct: 157 SATVNVTYSVGEVVYSREHFSSNPHQVIATKISANKPGAVSCTVSLATPLDHRIRVTDAN 216
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
+IIMEG CPG++ NA+D P G++F AIL + +S G + L DK LK++G+D AV
Sbjct: 217 EIIMEGSCPGEKPAGDGNASDHPPGMRFCAILYLLMSGANGQVQVLNDKMLKLDGADSAV 276
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
LLL A++SF+GPF+ P++S DP + + + L R++SY+ L H+DDYQ LF RVS+Q
Sbjct: 277 LLLAAATSFEGPFVKPTESTLDPVASAFTTLNMARSMSYAQLKAYHMDDYQSLFQRVSLQ 336
Query: 312 LSRS-------------PKDIVTDT----CSEENIDTV----------PSAERVKSFQTD 344
LSRS P++I DT C+ + +D P+ +R+ SF+ D
Sbjct: 337 LSRSSNDVLGGSTLARLPENISQDTAVSDCTVQMVDCSRLNELNNSEKPTVDRIISFRHD 396
Query: 345 EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSL 404
EDPSLVELLFQFGRYLLIS SRPGTQV+NLQGIWN + + W +APH NINL+MNYW SL
Sbjct: 397 EDPSLVELLFQFGRYLLISCSRPGTQVSNLQGIWNNETNAPWGAAPHPNINLQMNYWPSL 456
Query: 405 PCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWP 464
PCNLSECQ+PLFDF+ LS+NG+KTA+VNY SGWV H TD+WAK+S D G WALWP
Sbjct: 457 PCNLSECQDPLFDFIGSLSVNGAKTAKVNYGVSGWVSHQVTDLWAKTSPDAGDPSWALWP 516
Query: 465 MGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEH 524
MGG WL THLWEHY++TMDR+FLE+ AYPLLEG ASFLL WLIEG +GYLETNPSTSPEH
Sbjct: 517 MGGPWLATHLWEHYSFTMDREFLERTAYPLLEGSASFLLSWLIEGQEGYLETNPSTSPEH 576
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
FIAPDGK A VSYS+TMDM+IIREVFSA++ +A++L K+ +V+++ +LPRL P
Sbjct: 577 YFIAPDGKRASVSYSTTMDMSIIREVFSAVLLSADILGKSSTDVVQRIKAALPRLPP 633
>gi|414868293|tpg|DAA46850.1| TPA: hypothetical protein ZEAMMB73_390456 [Zea mays]
Length = 579
Score = 665 bits (1716), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 306/448 (68%), Positives = 367/448 (81%), Gaps = 3/448 (0%)
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
QFGRYLLIS SRPGTQ++NLQGIW+ D SP WD+APH NINL+MNYW +LPCNLSECQEP
Sbjct: 129 QFGRYLLISCSRPGTQISNLQGIWSNDTSPPWDAAPHPNINLQMNYWPALPCNLSECQEP 188
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
LFDF+ LSING+KTA+VNY ASGWV H TD+WAK+S D G VWALWPMGG WL THL
Sbjct: 189 LFDFIGSLSINGAKTAKVNYEASGWVSHQVTDLWAKTSPDAGDPVWALWPMGGPWLATHL 248
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
WEHY +T+D+ FLEK AYPLLEG A FLLDWLIEGH GYLETNPSTSPEH FIAPDGK A
Sbjct: 249 WEHYCFTLDKHFLEKTAYPLLEGSARFLLDWLIEGHRGYLETNPSTSPEHYFIAPDGKEA 308
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWA 594
CVSYS+TMD++IIREVFSA+I +A++L K++ +V+++ K+LP L P K+A DG+IMEWA
Sbjct: 309 CVSYSTTMDISIIREVFSALILSADILGKSDTNVVQRIKKALPNLPPMKVARDGTIMEWA 368
Query: 595 QDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALW 654
QDF+DPE+HHRH+SHLFGL+PGHT+++E+ PDLC+A +L KRG+EGPGWS +WK LW
Sbjct: 369 QDFQDPEIHHRHVSHLFGLYPGHTMSLEETPDLCRAVANSLYKRGDEGPGWSTSWKMVLW 428
Query: 655 ARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLV 714
ARLH+ +HAY+M+ +L LVDPEHE EGGLYSNLF AHPPFQIDANFGF AA++EMLV
Sbjct: 429 ARLHNSDHAYKMILQLITLVDPEHEVSREGGLYSNLFTAHPPFQIDANFGFPAALSEMLV 488
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSF 774
QST DLYLLPALP +KW G VKGLKARGG TV+I WK+G LHE ++S+ N +
Sbjct: 489 QSTGTDLYLLPALPRNKWPQGYVKGLKARGGVTVNISWKEGSLHEALLWSSGGQN---TL 545
Query: 775 KTLHYRGTSVKVNLSAGKIYTFNRQLKC 802
LHY V+LS+G++Y F+ LKC
Sbjct: 546 SRLHYGDQIATVSLSSGQVYRFSMDLKC 573
Score = 125 bits (313), Expect = 1e-25, Method: Compositional matrix adjust.
Identities = 59/85 (69%), Positives = 69/85 (81%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+ F PAK+FTDA PIGNGRLGAMVWG V SE L+LN DTLWTG PG+YTNP+AP
Sbjct: 41 PLKVVFGSPAKYFTDAAPIGNGRLGAMVWGCVESERLQLNHDTLWTGGPGNYTNPNAPAV 100
Query: 72 LSDVRSLVDSGQYAEATAASVKLFG 96
LS VRSLV++G+Y EAT+A+ L G
Sbjct: 101 LSKVRSLVENGKYPEATSAAYDLSG 125
>gi|386726157|ref|YP_006192483.1| alpha-L-fucosidase [Paenibacillus mucilaginosus K02]
gi|384093282|gb|AFH64718.1| alpha-L-fucosidase [Paenibacillus mucilaginosus K02]
Length = 801
Score = 644 bits (1661), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/771 (43%), Positives = 479/771 (62%), Gaps = 44/771 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+T++ PA+ +T+A+P GNGRLGAMV+GG+ E L+LNEDTLW+G PGD+ NP A + L
Sbjct: 1 MKLTYDKPARVWTEALPAGNGRLGAMVFGGMEHELLQLNEDTLWSGAPGDHNNPRAREVL 60
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
+VR L G+Y EA ++ G Y LGD+ L F H +A + Y R LD+
Sbjct: 61 PEVRRLALEGRYREADRLCKEMLGPYTQSYLPLGDLSLRF--HHGDHAGD-YERHLDVEG 117
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
+ R Y +G V +TRE F S+PDQV+V +++ G+LSF LDS L + + + +
Sbjct: 118 SILRTSYRIGAVTYTRELFVSHPDQVLVLRLTTDRPGALSFTAKLDSALKHRTAADAGD- 176
Query: 193 IIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLK 243
++++GR P K + P D+P G++F A L ++ G ++ L
Sbjct: 177 LVLKGRAPVK-VDPNYYRTDEPVRYAEGDAGGGMRFEARLRVQAD---GAELQVDGGALH 232
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
VE + LLL A++SF+G P++ +D + + + L++ L+Y +L RH DDY+
Sbjct: 233 VERATEVTLLLTAATSFNGYDKQPAEQGRDESRAAANDLRAASGLTYEELLQRHQDDYRA 292
Query: 304 LFHRVSIQL--SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
LF RV++ L SR+P+ + TD R+ + DP L ELLF +GRYLL
Sbjct: 293 LFGRVTLSLGASRAPEGMPTD-------------RRITEYGAS-DPGLAELLFHYGRYLL 338
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
ISSSR GTQ ANLQGIWN+++ W S +NIN +MNYW + CNLSEC EPL F+
Sbjct: 339 ISSSREGTQPANLQGIWNKEVRAPWSSNYTLNINAQMNYWPAETCNLSECHEPLLGFIGR 398
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEH 477
L++NG+KT VNY GW HH +DIWA+S+ G VWA WPM GAWL HLWEH
Sbjct: 399 LAVNGAKTVSVNYGLRGWTAHHNSDIWAQSAPVGAYGHGDPVWAYWPMAGAWLSAHLWEH 458
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y + + D+L ++AYP+++ A F LDWL+E DG+L + PSTSPEH F+ +G+LA V+
Sbjct: 459 YAFCREEDYLREQAYPVMKEAALFCLDWLVEDADGFLVSAPSTSPEHRFVMAEGELAAVT 518
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
++TMD+A++ ++F+ I AA L + + + +L RL+P +I + G + EW +DF
Sbjct: 519 AAATMDLALVHDLFTNCIEAARTLGTDVE-FSAGLQDALDRLQPLQIGQYGQLQEWKRDF 577
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARL 657
+D +VHHRH+SHL+G++PG +T E +PDL +AA ++L++RG+ G GWS+ WK LWAR
Sbjct: 578 EDEDVHHRHVSHLYGVYPGRQLTAEDSPDLFQAARQSLERRGDAGTGWSLAWKICLWARF 637
Query: 658 HDQEHAYRMVKRLFNLVDPEHEK----HFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
D A+R++ L +L E+E +GG+Y NLF AHPPFQID NFG+TA VAEML
Sbjct: 638 GDGNRAHRLIGNLLSLTS-EYEAGGQRGQQGGVYPNLFDAHPPFQIDGNFGYTAGVAEML 696
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
VQS + LLPALP D W G V GL+ARGG + + W+ G L E I S
Sbjct: 697 VQSHTGVIRLLPALP-DAWPDGEVSGLRARGGFEIGLSWQAGRLAEARIRS 746
>gi|379723425|ref|YP_005315556.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
gi|378572097|gb|AFC32407.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
Length = 801
Score = 644 bits (1660), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/771 (43%), Positives = 479/771 (62%), Gaps = 44/771 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+T++ PA+ +T+A+P GNGRLGAMV+GGV E L+LNEDTLW+G PGD+ NP A + L
Sbjct: 1 MKLTYDKPARVWTEALPAGNGRLGAMVFGGVEHELLQLNEDTLWSGAPGDHNNPRAREVL 60
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
+VR L G+Y EA ++ G Y LGD+ L F H +A + Y R LD+
Sbjct: 61 PEVRRLALEGRYREADRLCKEMLGPYTQSYLPLGDLSLRF--HHGDHAGD-YERHLDVEG 117
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
+ R Y +G V +TRE F S+PDQV+V +++ G+LSF LDS L + + + +
Sbjct: 118 SILRTSYRIGAVTYTRELFVSHPDQVLVLRLTADRPGALSFTAKLDSALKHRTAADAGD- 176
Query: 193 IIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLK 243
++++GR P K + P D+P G++F A L ++ G ++ L
Sbjct: 177 LVLKGRAPAK-VDPNYYRTDEPVRYAEGDAGGGMRFEARLRVQAD---GAELQVDSGALH 232
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
VE + LLL A++SF+G P++ +D + + L++ L+Y +L RH DDY+
Sbjct: 233 VERATEVTLLLTAATSFNGYDKQPAEQGRDESRAAADDLRAASGLTYDELLQRHQDDYRA 292
Query: 304 LFHRVSIQL--SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
LF RV++ L SR+P+ + TD R+ + DP L ELLF +GRYLL
Sbjct: 293 LFGRVTLSLGASRAPEGMPTD-------------RRIAEYGAS-DPGLAELLFHYGRYLL 338
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
ISSSR GTQ ANLQGIWN+++ W S +NIN +MNYW + CNLSEC EPL F+
Sbjct: 339 ISSSREGTQPANLQGIWNKEVRAPWSSNYTLNINAQMNYWPAETCNLSECHEPLLGFIGR 398
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEH 477
L++NG+KT VNY GW HH +DIWA+S+ G VWA WPM GAWL HLWEH
Sbjct: 399 LAVNGTKTVSVNYGLRGWTAHHNSDIWAQSAPVGAYGHGDPVWAYWPMAGAWLSAHLWEH 458
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y + + D+L ++AYP+++ A F LDWL+E DG+L ++PSTSPEH F+ +G+LA V+
Sbjct: 459 YAFCREEDYLREQAYPVMKEAALFCLDWLVEDADGFLVSSPSTSPEHRFVTAEGELAAVT 518
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
++TMD+A++ ++F+ I AA L + + + +L RL+P +I + G + EW +DF
Sbjct: 519 AAATMDLALVHDLFTNCIEAARTLGTDVE-FSAGLQDALDRLQPLQIGQYGQLQEWKRDF 577
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARL 657
+D +VHHRH+SHL+G++PG +T E +PDL +AA ++L++RG+ G GWS+ WK LWAR
Sbjct: 578 EDEDVHHRHVSHLYGVYPGRQLTAEDSPDLFQAARQSLERRGDAGTGWSLAWKICLWARF 637
Query: 658 HDQEHAYRMVKRLFNLVDPEHEK----HFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
D A+R++ L +L E+E +GG+Y NLF AHPPFQID NFG+TA VAEML
Sbjct: 638 GDGNRAHRLIGNLLSLTS-EYEAGGQRGQQGGVYPNLFDAHPPFQIDGNFGYTAGVAEML 696
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
VQS + LLPALP D W G V GL+ARGG + + W+ G L E + S
Sbjct: 697 VQSHTGVIRLLPALP-DAWPDGEVSGLRARGGFEIGLSWQAGRLAEARVRS 746
>gi|337750325|ref|YP_004644487.1| alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
gi|336301514|gb|AEI44617.1| Alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
Length = 831
Score = 643 bits (1658), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 336/771 (43%), Positives = 479/771 (62%), Gaps = 44/771 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+T++ PA+ +T+A+P GNGRLGAMV+GGV E L+LNEDTLW+G PGD+ NP A + L
Sbjct: 31 MKLTYDKPARVWTEALPAGNGRLGAMVFGGVEHELLQLNEDTLWSGAPGDHNNPRAREVL 90
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
+VR L G+Y EA ++ G Y LGD+ L F H +A + Y R LD+
Sbjct: 91 PEVRRLALEGRYREADRLCKEMLGPYTQSYLPLGDLSLRF--HHGDHAGD-YERHLDVEG 147
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
+ R Y +G V +TRE F S+PDQV+V +++ G+LSF LDS L + + + +
Sbjct: 148 SILRTSYRIGAVTYTRELFVSHPDQVLVLRLTADRPGALSFTAKLDSALKHRTAADAGD- 206
Query: 193 IIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLK 243
++++GR P K + P D+P G++F A L ++ G ++ L
Sbjct: 207 LVLKGRAPAK-VDPNYYRTDEPVRYAEGDAGGGMRFEARLRVQAD---GAELQVDGGALH 262
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
VE + LLL A++SF+G P++ +D + + L++ L+Y +L RH DDY+
Sbjct: 263 VERATEVTLLLTAATSFNGYDKQPAEQGRDESRAAADDLRAASGLTYDELLQRHQDDYRA 322
Query: 304 LFHRVSIQL--SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
LF RV++ L SR+P+ + TD R+ + DP L ELLF +GRYLL
Sbjct: 323 LFGRVTLSLGASRAPEGMPTD-------------RRIAEYGAS-DPGLAELLFHYGRYLL 368
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
ISSSR GTQ ANLQGIWN+++ W S +NIN +MNYW + CNLSEC EPL F+
Sbjct: 369 ISSSREGTQPANLQGIWNKEVRAPWSSNYTLNINAQMNYWPAETCNLSECHEPLLGFIGR 428
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEH 477
L++NG+KT VNY GW HH +DIWA+S+ G VWA WPM GAWL HLWEH
Sbjct: 429 LAVNGAKTVSVNYGLRGWTAHHNSDIWAQSAPVGAYGHGDPVWAYWPMAGAWLSAHLWEH 488
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y + + D+L ++AYP+++ A F LDWL+E DG+L ++PSTSPEH F+ +G+LA V+
Sbjct: 489 YAFCREEDYLREQAYPVMKEAALFCLDWLVEDADGFLVSSPSTSPEHRFVTAEGELAAVT 548
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
++TMD+A++ ++F+ I AA L + + + +L RL+P +I + G + EW +DF
Sbjct: 549 AAATMDLALVHDLFTNCIEAARTLGTDVE-FSAGLQDALDRLQPLQIGQYGQLQEWKRDF 607
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARL 657
+D +VHHRH+SHL+G++PG +T E +PDL +AA ++L++RG+ G GWS+ WK LWAR
Sbjct: 608 EDEDVHHRHVSHLYGVYPGRQLTAEDSPDLFQAARQSLERRGDAGTGWSLAWKICLWARF 667
Query: 658 HDQEHAYRMVKRLFNLVDPEHEK----HFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
D A+R++ L +L E+E +GG+Y NLF AHPPFQID NFG+TA VAEML
Sbjct: 668 GDGNRAHRLIGNLLSLTS-EYEAGGQRGQQGGVYPNLFDAHPPFQIDGNFGYTAGVAEML 726
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
VQS + LLPALP D W G V GL+ARGG + + W+ G L E + S
Sbjct: 727 VQSHTGVIRLLPALP-DAWPDGEVSGLRARGGFEIGLSWQAGRLAEARVRS 776
>gi|379721553|ref|YP_005313684.1| hypothetical protein PM3016_3718 [Paenibacillus mucilaginosus 3016]
gi|378570225|gb|AFC30535.1| hypothetical protein PM3016_3718 [Paenibacillus mucilaginosus 3016]
Length = 806
Score = 624 bits (1608), Expect = e-176, Method: Compositional matrix adjust.
Identities = 332/768 (43%), Positives = 464/768 (60%), Gaps = 39/768 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ I F PA ++T+A+P+GNGRLGAM++GGV E + LNEDTLW+G P D+ NP+A + L
Sbjct: 14 MNIQFQSPAVYWTEALPVGNGRLGAMIFGGVEKERIALNEDTLWSGYPTDWNNPEAREVL 73
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
VR L+ +Y EA + G Y GD+ + + H + Y R+LDL+T
Sbjct: 74 PKVRQLIAEQRYEEADRFCKFMMGPFTQSYLPFGDLHILME--HGQVCGRGYERKLDLST 131
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
V Y +G+V +TRE F+S+PDQVIV +++ S+ G LSF LDS L + S + ++
Sbjct: 132 GIVTVTYDIGDVSYTREVFASHPDQVIVVRLTASKEGLLSFRAKLDSPLRSSSKPDADH- 190
Query: 193 IIMEGRCPGKRIPPKANANDD--------PKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+ G P P N + PK ++F L + G +E L +
Sbjct: 191 YTLSGIAPEYVAPNYYNVKNPVHYGDQQAPKSLKFYGRLS---AVHEGGNMKVEADGLSI 247
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
G+ A L A++SFD P I S + + P + A+Q+I YSD+ H+DD+ +L
Sbjct: 248 VGATSATLYFSAATSFD-PLIGASSTNRVPEQVTEEAIQAILGKKYSDIRKHHVDDHSRL 306
Query: 305 FHRVSIQL--SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
FHRV + L S +P+D+ TD +R+ + + DP LVELLF +GRYL+I
Sbjct: 307 FHRVDLHLGESSAPQDLPTD-------------QRIAEYGS-RDPGLVELLFHYGRYLMI 352
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
+SSRPGTQ ANLQGIWNED W S +NIN EMNYW + CN++E EPL DF+ L
Sbjct: 353 ASSRPGTQPANLQGIWNEDTRAPWSSNYTLNINAEMNYWPAETCNMAELHEPLIDFIGRL 412
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHY 478
++NG KTA+VNY A GWV HH +D+WA+++ G VWA WP+GG WL HLWEHY
Sbjct: 413 AVNGRKTAEVNYGARGWVAHHNSDVWAQTAPVGDYGHGDPVWAFWPLGGVWLTQHLWEHY 472
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
++ + FL AYP+++ A F LDWL DGY T+PSTSPEH+F+ D + A V
Sbjct: 473 AFSGNEAFLRDTAYPIMKQAALFCLDWLTPNEDGYWITSPSTSPEHKFMIGDQRYA-VGA 531
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFK 598
++TMD+A+I E+FS I++AE L+ +E+ +L++ +L P +I + G + EW++DF+
Sbjct: 532 AATMDLALIGELFSNCITSAETLQVDEE-FANTLLETKQKLLPMQIGKKGQLQEWSEDFE 590
Query: 599 DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLH 658
D +VHHRH+SHL G++PG +T PDL AA ++L+ RG+ G GWS+ WK LWAR
Sbjct: 591 DEDVHHRHVSHLVGVYPGRLLTEHLAPDLFHAARRSLEIRGDGGTGWSLGWKIGLWARFK 650
Query: 659 DQEHAYRMVKRLFNLVDP-EHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST 717
+ A R++ L LV E GG+Y+NLF AHPPFQID NF TA +AEML+QS
Sbjct: 651 NGNRAERLLSNLLTLVKGDEPLNAHRGGVYANLFDAHPPFQIDGNFAATAGIAEMLLQSH 710
Query: 718 LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
L LLPALP D W G V+GL+ RGG V + WK+G L + I S+
Sbjct: 711 QGFLELLPALP-DAWKDGYVRGLRGRGGYEVDLEWKNGLLSKAVITSS 757
>gi|337748528|ref|YP_004642690.1| hypothetical protein KNP414_04288 [Paenibacillus mucilaginosus
KNP414]
gi|336299717|gb|AEI42820.1| hypothetical protein KNP414_04288 [Paenibacillus mucilaginosus
KNP414]
Length = 806
Score = 623 bits (1607), Expect = e-175, Method: Compositional matrix adjust.
Identities = 332/768 (43%), Positives = 463/768 (60%), Gaps = 39/768 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ I F PA ++T+A+P+GNGRLGAM++GGV E + LNEDTLW+G P D+ NP+A + L
Sbjct: 14 MNIQFQSPAVYWTEALPVGNGRLGAMIFGGVEKERIALNEDTLWSGYPTDWNNPEAREVL 73
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
VR L+ +Y EA + G Y GD+ + + H + Y R+LDL+T
Sbjct: 74 PKVRQLIAEQRYEEADRFCKFMMGPFTQSYLPFGDLHIVME--HGQVCGRGYERKLDLST 131
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
V Y +G+V +TRE F+S+PDQVIV +++ S+ G LSF LDS L + S + ++
Sbjct: 132 GIVTVTYDIGDVSYTREVFASHPDQVIVVRLTASKEGLLSFRAKLDSPLRSSSKPDADH- 190
Query: 193 IIMEGRCPGKRIPPKANANDD--------PKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+ G P P N + PK ++F L + G +E L +
Sbjct: 191 YTLSGIAPEYVAPNYYNVKNPVHYGDQQAPKSLKFYGRLS---AVHEGGNMKVEADGLSI 247
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
G+ A L A++SFD P I S + + P + A+Q+I YSD+ H+DD+ +L
Sbjct: 248 VGATSATLYFSAATSFD-PLIGASSTNRMPEQVTEEAIQAILGKKYSDIRKHHVDDHSRL 306
Query: 305 FHRVSIQL--SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
FHRV + L S +P+D+ TD R+ + + DP LVELLF +GRYL+I
Sbjct: 307 FHRVDLHLGESSAPQDLPTD-------------RRIAEYGS-RDPGLVELLFHYGRYLMI 352
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
+SSRPGTQ ANLQGIWNED W S +NIN EMNYW + CN++E EPL DF+ L
Sbjct: 353 ASSRPGTQPANLQGIWNEDTRAPWSSNYTLNINAEMNYWPAETCNMAELHEPLIDFIGRL 412
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHY 478
++NG KTA+VNY A GWV HH +D+WA+++ G VWA WP+GG WL HLWEHY
Sbjct: 413 AVNGRKTAEVNYGARGWVAHHNSDVWAQTAPVGDYGHGDPVWAFWPLGGVWLTQHLWEHY 472
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
++ + FL AYP+++ A F LDWL DGY T+PSTSPEH+F+ D + A V
Sbjct: 473 AFSGNEAFLRDTAYPIMKQAALFCLDWLTPNEDGYWITSPSTSPEHKFMIGDQRYA-VGA 531
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFK 598
++TMD+A+I E+FS I++AE L+ +E+ +L++ +L P +I + G + EW++DF+
Sbjct: 532 AATMDLALIGELFSNCITSAETLQVDEE-FANTLLETKQKLLPMQIGKKGQLQEWSEDFE 590
Query: 599 DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLH 658
D +VHHRH+SHL G++PG +T PDL AA ++L+ RG+ G GWS+ WK LWAR
Sbjct: 591 DEDVHHRHVSHLVGVYPGRLLTEHLAPDLFHAARRSLEIRGDGGTGWSLGWKIGLWARFK 650
Query: 659 DQEHAYRMVKRLFNLVDP-EHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST 717
+ A R++ L LV E GG+Y+NLF AHPPFQID NF TA +AEML+QS
Sbjct: 651 NGNRAERLLSNLLTLVKGDEPLNAHRGGVYANLFDAHPPFQIDGNFAATAGIAEMLLQSH 710
Query: 718 LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
L LLPALP D W G V+GL+ RGG V + WK+G L + I S+
Sbjct: 711 QGFLELLPALP-DAWKDGYVRGLRGRGGYEVDLEWKNGLLSKAVITSS 757
>gi|15613405|ref|NP_241708.1| hypothetical protein BH0842 [Bacillus halodurans C-125]
gi|10173457|dbj|BAB04561.1| BH0842 [Bacillus halodurans C-125]
Length = 795
Score = 619 bits (1596), Expect = e-174, Method: Compositional matrix adjust.
Identities = 332/767 (43%), Positives = 455/767 (59%), Gaps = 40/767 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+KI F+ PA +T+A+PIGNG LGAMV+G V E + LNEDTLW+G P D+ NP A + L
Sbjct: 1 MKIQFDFPASFWTEALPIGNGNLGAMVFGKVEKERIALNEDTLWSGYPKDWNNPKAKEVL 60
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
VR L+ +Y EA S + G Y GD+ + D H + Y RELDL+T
Sbjct: 61 PKVRELIAQEKYEEADQLSRDMMGPYTQSYLPFGDLNIFMD--HGQVVAPHYHRELDLST 118
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
V Y++G V++TRE F + PD+ IV +++ S+ G LSF LDSLL + S V G
Sbjct: 119 GIVTVTYTIGGVQYTRELFVTYPDRAIVVRLTASKEGFLSFRAKLDSLLRHVSSV-GAEH 177
Query: 193 IIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLK 243
+ G P + + P ++P +G+ F L + + G ++ L
Sbjct: 178 YTISGTAP-EHVSPSYYDEENPVRYGHPDMSQGMTFHGRL---AAVNEGGSLKVDADGLH 233
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V G+ A L AS+SFD P S ++DP+ ++ +++I Y ++ RHL+DY K
Sbjct: 234 VMGATCATLYFSASTSFD-PSTGASCLERDPSLRTIETIKAICKRGYKEIVNRHLEDYTK 292
Query: 304 LFHRVSIQLSRS--PKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
LF+RVS+ L S P D+ TD +R+K + + D LVELLFQ+GRYL+
Sbjct: 293 LFNRVSLHLGESIAPADMSTD-------------QRIKEYGS-RDLGLVELLFQYGRYLM 338
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
I+SSRPGTQ ANLQGIWNE+ W S +NIN EMNYW + CNL+E +PL F+
Sbjct: 339 IASSRPGTQPANLQGIWNEETRAPWSSNYTLNINAEMNYWPAETCNLAELHKPLIHFIER 398
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEH 477
L+ NG KTA++NY A GWV HH D+W +++ G VWA WPMGG WL HLWEH
Sbjct: 399 LAANGKKTAEINYGARGWVAHHNADLWGQTAPVGDFGHGDPVWAFWPMGGVWLTQHLWEH 458
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y + D +L AYP+++ A F LDWLIE GYL T+PSTSPE F + A VS
Sbjct: 459 YTFGEDEAYLRDTAYPIMKEAALFCLDWLIENEAGYLVTSPSTSPEQRFRIGEKGYA-VS 517
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
++TMD+++I E F I AA+ L +ED V+ + + RL P +I + G + EW+ DF
Sbjct: 518 SATTMDLSLIAECFDNCIQAAKRLSIDED-FVKALSDAKQRLLPLQIGKRGQLQEWSNDF 576
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARL 657
+D +VHHRH+SHL G++PG IT + P+L +AA+ +L+ RG+EG GWS+ WK +LWAR
Sbjct: 577 EDEDVHHRHVSHLVGIYPGRLITEQSAPNLFEAAKTSLEIRGDEGTGWSLGWKISLWARF 636
Query: 658 HDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST 717
D R++ + L+ + GG+Y+NLF AHPPFQID NF TA +AEML+QS
Sbjct: 637 KDGNRCERLLSNMLTLIKEDESMQHRGGVYANLFGAHPPFQIDGNFSATAGIAEMLLQSH 696
Query: 718 LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
L LPALP D W G VKGL+ RGG V + W +G L +V I S
Sbjct: 697 QGYLEFLPALP-DSWKDGYVKGLRGRGGYEVDLAWTNGALVKVEIVS 742
>gi|261406536|ref|YP_003242777.1| alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
gi|261282999|gb|ACX64970.1| Alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
Length = 806
Score = 612 bits (1578), Expect = e-172, Method: Compositional matrix adjust.
Identities = 338/816 (41%), Positives = 489/816 (59%), Gaps = 48/816 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + PA +T+A+PIGNGRLG MV+G V ET+ LNEDTLW+G P D+ NP A +AL
Sbjct: 1 MKLQYVKPATVWTEALPIGNGRLGGMVYGCVERETISLNEDTLWSGYPRDWNNPSALEAL 60
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
++R L G+Y EA K+ G + Y LGD+ L FD + + +YRR LD+
Sbjct: 61 PEIRELASQGRYMEADQLGRKMMGPYTESYLPLGDLCLRFDHGGVFH---SYRRTLDIAN 117
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A R +Y +G V +TRE F+S+PDQ+I +++ S + +L+F+ L+S L ++ +
Sbjct: 118 AVQRTEYRIGEVTYTRECFASSPDQMIALRLTSSAACALNFHAYLESPL-RYTVKTEEDM 176
Query: 193 IIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLK 243
M G P +R+ P ++D P + F+ L + +D R T+ + +
Sbjct: 177 YAMSGFAP-ERVEPSYVSSDHPIRYGDPDHTAAMAFNGRLAVAETDGRVTV---DSAGIH 232
Query: 244 VEGSDWAVLLLVASSSFDGPFINPS--DSKKDPTSE----SMSALQSIRNLSYSDLYTRH 297
V + AV+ A++SF+G P D P + + +++ + S+++L RH
Sbjct: 233 VLDASEAVIYFTAATSFNGFDQIPGHRDGGDHPAAAAAALTAGTMKAACSQSWTELRDRH 292
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
++DY+ LF RVS++L +T + E++DT ER++ F DP LVELLF +G
Sbjct: 293 INDYRSLFDRVSLRLG--------ETLAAEDMDT---GERIERFGA-RDPGLVELLFHYG 340
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLISSSRPGTQ ANLQGIWN P W S +NIN +MNYW + CNL+EC +PL +
Sbjct: 341 RYLLISSSRPGTQAANLQGIWNASTRPPWSSNWTLNINAQMNYWPAEVCNLAECHQPLLE 400
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTH 473
+ LS+NG++TA V+Y GW +HH TDIWA ++ G WALW MGG WL H
Sbjct: 401 LIRSLSVNGAETAAVHYGTRGWTVHHNTDIWAHTAPVGNYGDGDPSWALWQMGGIWLTQH 460
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
LWEHY Y+ D +L AYPL++ + F LDWLIE G+L T+PSTSPEH+F +G +
Sbjct: 461 LWEHYAYSGDEAYLRSFAYPLMKEASLFALDWLIENDAGHLVTSPSTSPEHKFRTSEG-M 519
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
A +S +TMD+++I E+F+ + AA +L +E+ E+ RL P K+ G + EW
Sbjct: 520 AAISEGATMDISLIWELFTNCMEAAGILGVDEE-FREEWSSKRERLLPLKVGRYGQLQEW 578
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
+ D +D +V HRH SHL G++PG ++ E++PDL AA+ +L++RGEE GWS+ W+ AL
Sbjct: 579 SHDSEDEDVFHRHTSHLVGVYPGRQLSAEESPDLFAAAQTSLERRGEESTGWSLGWRVAL 638
Query: 654 WARLHDQEHAYRMVKRLFNLV-DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
W+R D A R++ + LV D + E++ GG+Y++L AHPPFQID NF TA +AEM
Sbjct: 639 WSRFGDGNRALRLLTNMLRLVRDGDSERYDHGGVYASLLGAHPPFQIDGNFAATAGIAEM 698
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHD 772
L+QS + L LLPALP D W G V+GL+ARGG V I WK+G L E I S N
Sbjct: 699 LLQSHRSLLMLLPALP-DAWQEGEVRGLRARGGFEVGIRWKNGRLTEAEIMSRLGNVCSV 757
Query: 773 SFKTLH----YRG-TSVKVNLSAGKIYTFNRQLKCT 803
S + Y+G TS+ V +SA + +F + T
Sbjct: 758 SIGNGNGIAVYQGDTSIPVPVSAKGVVSFETEQGLT 793
>gi|329926959|ref|ZP_08281359.1| hypothetical protein HMPREF9412_5205 [Paenibacillus sp. HGF5]
gi|328938789|gb|EGG35165.1| hypothetical protein HMPREF9412_5205 [Paenibacillus sp. HGF5]
Length = 812
Score = 600 bits (1547), Expect = e-168, Method: Compositional matrix adjust.
Identities = 339/818 (41%), Positives = 486/818 (59%), Gaps = 50/818 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + PA +T+A+PIGNGRLG MV+GGV ET+ LNEDTLW+G P D+ NP A +AL
Sbjct: 5 MKLQYVKPATVWTEALPIGNGRLGGMVYGGVERETISLNEDTLWSGYPRDWNNPSAREAL 64
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
++R L G+Y EA K+ G Y LGD+ L FD + + +YRR LD+
Sbjct: 65 PEIRELASQGRYMEADQLGRKMMGPYTQSYLPLGDLCLRFDHGGVFH---SYRRTLDIAN 121
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A R +Y +G V +TRE F+S+PDQ+I +++ S + SL+F+ L+S L ++ +
Sbjct: 122 AVQRTEYRIGEVTYTRECFASSPDQMIALRLTSSAACSLNFHAYLESPL-RYTVKTEEDM 180
Query: 193 IIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLK 243
M G P +R+ P ++D P + F L + +D R T+ A +
Sbjct: 181 YAMSGFAP-ERVEPSYVSSDRPIRYGDPEHTAAMAFDGRLAVAETDGRVTMDA---AGIH 236
Query: 244 VEGSDWAVLLLVASSSFDGPFINPS--DSKKDPTSESM----SALQSIRNLSYSDLYTRH 297
V + AV+ A++SF+G P D P + + +++ + S+++L RH
Sbjct: 237 VLEASEAVIYFTAATSFNGFDQIPGHRDGGDHPAAAAAAIAAGTMKAACSQSWTELRDRH 296
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
++DY+ LF RVS++L +T + ++DT ER++ F DP LVELLF +G
Sbjct: 297 VNDYRSLFDRVSLRLG--------ETLAVGDMDT---EERIERFGA-RDPGLVELLFHYG 344
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLISSSRPGTQ ANLQGIWN P W S +NIN +MNYW + CNL+EC +PL +
Sbjct: 345 RYLLISSSRPGTQAANLQGIWNASTRPPWSSNWTLNINAQMNYWPAEVCNLAECHQPLLE 404
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTH 473
+ LS+NG++TA V+Y GW +HH TDIWA ++ G WALW MGG WL H
Sbjct: 405 LIRSLSVNGAETAAVHYGTRGWTVHHNTDIWAHTAPVGNYGDGDPSWALWQMGGIWLTQH 464
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
LWEHY Y+ D +L AYPL++ + F +DWLIE G+L T+PSTSPEH+F +G L
Sbjct: 465 LWEHYAYSGDEAYLRSFAYPLMKEASLFAMDWLIENDAGHLLTSPSTSPEHKFRTSEG-L 523
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
A VS +TMD+++I E+F+ + AA +L +E+ E+ RL P ++ G + EW
Sbjct: 524 AAVSEGATMDISLIWELFTNCMEAAVILGVDEE-FREEWSSKRERLLPLQVGRYGQLQEW 582
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
+ D +D +V+HRH SHL G++PG ++ E+NPDL AA+ +L++RGEE GWS+ W+ AL
Sbjct: 583 SHDSEDEDVYHRHTSHLVGVYPGRQLSAEENPDLFAAAQTSLERRGEESTGWSLGWRVAL 642
Query: 654 WARLHDQEHAYRMVKRLFNLV-DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
W R D A R++ + LV D + E++ GG+Y++L AHPPFQID NF A +AEM
Sbjct: 643 WGRFGDGNRALRLLTNMLRLVRDGDSERYDHGGVYASLLGAHPPFQIDGNFAAAAGIAEM 702
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN---- 768
L+QS L LLPALP D W G V+GL+ARGG V I WK+G L E I S N
Sbjct: 703 LLQSHRPLLMLLPALP-DAWPEGEVRGLRARGGFEVGIRWKNGRLTEAQIMSRLGNVCSV 761
Query: 769 ---NDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLKCT 803
N H + ++ TS+ V +SA +++F + T
Sbjct: 762 SIGNGHGNGIAVYQGDTSIPVQVSAKGVFSFETEQGLT 799
>gi|374374701|ref|ZP_09632359.1| Alpha-L-fucosidase [Niabella soli DSM 19437]
gi|373231541|gb|EHP51336.1| Alpha-L-fucosidase [Niabella soli DSM 19437]
Length = 855
Score = 599 bits (1545), Expect = e-168, Method: Compositional matrix adjust.
Identities = 319/780 (40%), Positives = 467/780 (59%), Gaps = 38/780 (4%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
A+ + ++ LK+ + PA + +A+P+GNG+ GAMV+GGV +E +LN++TLW+G P
Sbjct: 20 AQRSQSSQELKLWYTKPASIWEEALPLGNGKTGAMVFGGVGTERFQLNDNTLWSGAPNPG 79
Query: 64 TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET 123
P P L+ VR LV +GQY A ++ G + Y + D+ L+ +
Sbjct: 80 NTPGGPAILAAVRKLVFAGQYDSAAVVWKQMHGPYSARYLPMADLWLKLKGADT--IASA 137
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
Y R+LDL+TATA V Y++ V +TR+ F S PD+ +V +I+ + ++SF +L S L
Sbjct: 138 YYRDLDLHTATATVNYTLHGVRYTRQTFISYPDKAMVIRITADKKNAVSFTAALSSKLKY 197
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANAN-----DDPKGIQFSAILEIKISDDRGTISALE 238
+NG N ++++G+ P K + +A DD G + +++K+ GT++
Sbjct: 198 KVALNGKNGLLLKGKAP-KFVANRAYEKEQVVYDDWNGEGTNFEVQVKVIAQEGTVNG-A 255
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D++L V ++ + L ++SF+G +P KDP E+ + +Q ++ + + L H
Sbjct: 256 DEQLTVSNANAVTIYLTNATSFNGFDKSPGKEGKDPHVEATATMQRVQVMPFERLLQNHT 315
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFG 357
DY++LF+RVS + + +P+ ER+K F + +D L L +QFG
Sbjct: 316 TDYRRLFNRVSFAIENRSANA-----------KLPTNERLKVFTKAPDDFGLQTLYYQFG 364
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYL+I++SRPG+Q NLQGIWN+ + P W S VNIN EMNYW + NLSEC +PLFD
Sbjct: 365 RYLMIAASRPGSQPTNLQGIWNDQVQPPWGSNYTVNINTEMNYWPAENTNLSECHQPLFD 424
Query: 418 FLTYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSSADRG--------KVVWALWPMGGA 468
F+ L++NG+ TA+VNY + GW +HH +DIWAK+S G K W+ WPM G
Sbjct: 425 FMKELAVNGAVTAKVNYGIKEGWTVHHNSDIWAKTSPPGGQGWVDPSAKTRWSCWPMAGG 484
Query: 469 WLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH-DGYLETNPSTSPEHEFI 527
W THLWEHY YT D FL AYPL++G A FL WL++ GY TNPSTSPE+ +
Sbjct: 485 WFSTHLWEHYLYTGDEAFLRNTAYPLMKGAAQFLQHWLVKDPVTGYWVTNPSTSPENT-M 543
Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP-RLRPTKIAE 586
+GK V+ +STMDM+IIRE+F+ +I AA VL+ DA L ++ +L P I +
Sbjct: 544 KVNGKEYEVAMASTMDMSIIRELFTDVIKAAAVLK--TDAAFAATLSTIKEKLYPFHIGQ 601
Query: 587 DGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWS 646
G + EW +D+ DP+ HRHLSHLFGL+PG IT+ + P+L AA+++L RG+ GWS
Sbjct: 602 YGQLQEWFKDWDDPKDQHRHLSHLFGLYPGSQITLSETPELAAAAKQSLIFRGDVSTGWS 661
Query: 647 ITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHF--EGGLYSNLFAAHPPFQIDANFG 704
+ WK WARLHD EHAY+++ F+ +DP ++ GG Y NLF AHPPFQID NFG
Sbjct: 662 MAWKINWWARLHDGEHAYKILSDAFHYIDPREKRAVMGGGGAYPNLFDAHPPFQIDGNFG 721
Query: 705 FTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
TA + E+L+QS L+LLPALP W G + G++ARG VSI W + L + IY+
Sbjct: 722 ATAGMTELLLQSHEGYLFLLPALP-SVWKKGSISGIRARGDFNVSIDWSNSRLSKAIIYA 780
>gi|158430814|pdb|2RDY|A Chain A, Crystal Structure Of A Putative Glycoside Hydrolase Family
Protein From Bacillus Halodurans
gi|158430815|pdb|2RDY|B Chain B, Crystal Structure Of A Putative Glycoside Hydrolase Family
Protein From Bacillus Halodurans
Length = 803
Score = 598 bits (1541), Expect = e-168, Method: Compositional matrix adjust.
Identities = 328/766 (42%), Positives = 442/766 (57%), Gaps = 38/766 (4%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LKI F+ PA +T+A+PIGNG LGA V+G V E + LNEDTLW+G P D+ NP A + L
Sbjct: 3 LKIQFDFPASFWTEALPIGNGNLGAXVFGKVEKERIALNEDTLWSGYPKDWNNPKAKEVL 62
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
VR L+ +Y EA S G Y GD+ + D H + Y RELDL+T
Sbjct: 63 PKVRELIAQEKYEEADQLSRDXXGPYTQSYLPFGDLNIFXD--HGQVVAPHYHRELDLST 120
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
V Y++G V++TRE F + PD+ IV +++ S+ G LSF LDSLL + S V G
Sbjct: 121 GIVTVTYTIGGVQYTRELFVTYPDRAIVVRLTASKEGFLSFRAKLDSLLRHVSSV-GAEH 179
Query: 193 IIMEGRCP--------GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+ G P + P + D +G F L + + G ++ L V
Sbjct: 180 YTISGTAPEHVSPSYYDEENPVRYGHPDXSQGXTFHGRL---AAVNEGGSLKVDADGLHV 236
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
G+ A L AS+SFD P S ++DP+ ++ +++I Y ++ RHL+DY KL
Sbjct: 237 XGATCATLYFSASTSFD-PSTGASCLERDPSLRTIETIKAICKRGYKEIVNRHLEDYTKL 295
Query: 305 FHRVSIQLSRS--PKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
F+RVS+ L S P D TD +R+K + + D LVELLFQ+GRYL I
Sbjct: 296 FNRVSLHLGESIAPADXSTD-------------QRIKEYGS-RDLGLVELLFQYGRYLXI 341
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
+SSRPGTQ ANLQGIWNE+ W S +NIN E NYW + CNL+E +PL F+ L
Sbjct: 342 ASSRPGTQPANLQGIWNEETRAPWSSNYTLNINAEXNYWPAETCNLAELHKPLIHFIERL 401
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHY 478
+ NG KTA++NY A GWV HH D+W +++ G VWA WP GG WL HLWEHY
Sbjct: 402 AANGKKTAEINYGARGWVAHHNADLWGQTAPVGDFGHGDPVWAFWPXGGVWLTQHLWEHY 461
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
+ D +L AYP+ + A F LDWLIE GYL T+PSTSPE F + K VS
Sbjct: 462 TFGEDEAYLRDTAYPIXKEAALFCLDWLIENEAGYLVTSPSTSPEQRFRIGE-KGYAVSS 520
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFK 598
++T D+++I E F I AA+ L +ED V+ + + RL P +I + G + EW+ DF+
Sbjct: 521 ATTXDLSLIAECFDNCIQAAKRLSIDED-FVKALSDAKQRLLPLQIGKRGQLQEWSNDFE 579
Query: 599 DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLH 658
D +VHHRH+SHL G++PG IT + P+L +AA+ +L+ RG+EG GWS+ WK +LWAR
Sbjct: 580 DEDVHHRHVSHLVGIYPGRLITEQSAPNLFEAAKTSLEIRGDEGTGWSLGWKISLWARFK 639
Query: 659 DQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTL 718
D R++ L+ + GG+Y+NLF AHPPFQID NF TA +AE L+QS
Sbjct: 640 DGNRCERLLSNXLTLIKEDESXQHRGGVYANLFGAHPPFQIDGNFSATAGIAEXLLQSHQ 699
Query: 719 NDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
L LPALP D W G VKGL+ RGG V + W +G L +V I S
Sbjct: 700 GYLEFLPALP-DSWKDGYVKGLRGRGGYEVDLAWTNGALVKVEIVS 744
>gi|375148572|ref|YP_005011013.1| alpha-L-fucosidase [Niastella koreensis GR20-10]
gi|361062618|gb|AEW01610.1| Alpha-L-fucosidase [Niastella koreensis GR20-10]
Length = 850
Score = 597 bits (1540), Expect = e-168, Method: Compositional matrix adjust.
Identities = 324/768 (42%), Positives = 457/768 (59%), Gaps = 34/768 (4%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ +N PA + +A+P+GNG+ GAMV+GGV +E L+LN++TLW+G P NP+ P L
Sbjct: 25 LKLWYNKPADAWEEALPLGNGKTGAMVFGGVATERLQLNDNTLWSGYPEAGNNPNGPTVL 84
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
VR V G Y +A A K+ G + Y LGD+ A TY RELDLN
Sbjct: 85 PQVRQAVFEGDYEKAAALWKKMQGPYSARYLPLGDLWWRVQSKDTLPA--TYYRELDLNK 142
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A + V+Y +G V + RE F S P +++V +I+ + G + + L S L +
Sbjct: 143 AVSTVRYKIGEVTYQRETFISYPSKLLVMRITADKKGVIDGVLDLTSKLHFKVTTTDADY 202
Query: 193 IIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
+++ G+ P + P+ D G + + +KI + G + + LKV G++
Sbjct: 203 LVLRGKAPKFVANRDYEPQQVGYDSANGEGMNFEVHVKIKTEGGKVEQ-SNNALKVSGAN 261
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
+ L ++SF+G +P KDP++E+ + LQ L+Y L H+ DYQ LF RV
Sbjct: 262 TVTIYLSEATSFNGFNKSPGLEGKDPSTEAKANLQKALRLTYEQLKAAHMRDYQNLFKRV 321
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRP 367
+ L +P+ ER+K + ++ D L L +QFGRYLLI+SSRP
Sbjct: 322 ELNLGPG-----------NGAAKLPTDERLKQYASNPTDQQLQVLYYQFGRYLLIASSRP 370
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
G++ ANLQGIWN+ + P W S NIN EMNYW + NLSEC +PLFDF+ L++NG+
Sbjct: 371 GSRPANLQGIWNDHIQPPWGSNYTTNINTEMNYWLAENTNLSECHQPLFDFMKELAVNGA 430
Query: 428 KTAQVNY-LASGWVIHHKTDIWAKSSA-------DRGKVVWALWPMGGAWLCTHLWEHYN 479
+TA+VNY ++ GWV+HH +D+WAK+S +G W+ WPM GAWL THLWEHY
Sbjct: 431 QTAKVNYNISEGWVVHHNSDLWAKTSPPGGWDWDPKGMPRWSAWPMAGAWLSTHLWEHYL 490
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
YT D+ FL K A+PL++G A F++ WLI + +G L TNPSTSPE+ + GK V
Sbjct: 491 YTGDKTFL-KNAWPLMKGAAQFMIHWLITDPANGLLVTNPSTSPENT-MKIKGKEYQVGM 548
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFK 598
++TMDM+IIRE+F+A+I + VL + + ++V+K+ +L P I + G + EW +D+
Sbjct: 549 ATTMDMSIIRELFTAVIKTS-VLLQTDAVFRDQVIKAKEKLYPFHIGQYGQLQEWFKDWD 607
Query: 599 DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLH 658
DP HRHLSHLFGL+PG I P+L AA+++L RG+ GWS+ WK WARL
Sbjct: 608 DPNDKHRHLSHLFGLYPGSQINPATTPELAAAAKQSLIFRGDVSTGWSMAWKINWWARLQ 667
Query: 659 DQEHAYRMVKRLFNLVDPE--HEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
D HAY+++ F +DP + GG Y NLF AHPPFQID NFG TA + E+L+QS
Sbjct: 668 DGNHAYKILSDAFTYIDPRVTRDAMSGGGTYPNLFDAHPPFQIDGNFGATAGITELLLQS 727
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+L LLPALP D W SG +KG+KARG TV+I WKDG L + I S
Sbjct: 728 HNGELALLPALP-DAWKSGSIKGIKARGNFTVAIDWKDGKLSKATITS 774
>gi|326800263|ref|YP_004318082.1| alpha-L-fucosidase [Sphingobacterium sp. 21]
gi|326551027|gb|ADZ79412.1| Alpha-L-fucosidase [Sphingobacterium sp. 21]
Length = 855
Score = 597 bits (1539), Expect = e-168, Method: Compositional matrix adjust.
Identities = 327/775 (42%), Positives = 476/775 (61%), Gaps = 37/775 (4%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PA + +A+P+GN + GAMV+GGV E +LN++TLW+G P NP+ PK L
Sbjct: 30 LKLWYTKPASVWEEALPLGNAKTGAMVFGGVQVERYQLNDNTLWSGFPNPGNNPNGPKIL 89
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFD--DSHLKYAEETYRRELDL 130
VR + G Y +A + ++ G + Y LGD+ L+F DS +Y+R+LDL
Sbjct: 90 PRVRRAIFDGDYEKAASLWKQMQGPYSARYLPLGDLLLDFHRPDS----LTTSYQRDLDL 145
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
+ A + +KY+ V +TRE F S PD+ + +I+ ++ G+++F+V+L S L + + +
Sbjct: 146 DKALSTIKYTYRGVMYTRETFISRPDKTMAIRITANKPGAVAFDVALTSKLKHQTKAARH 205
Query: 191 NQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ +I++G+ P + P+ DD G + + +K+ G + +D +L V G
Sbjct: 206 DYLILQGKAPKFVANREYEPQQIVYDDRDGEGMNFEIHVKVQAIGGEVKT-DDNRLCVSG 264
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D +L L ++SF+G +P + KDP E+ + ++ SY ++ +RH+ D+ LF
Sbjct: 265 ADSVILWLTEATSFNGFDKSPGLNGKDPAVEAAACMERASKSSYQEVKSRHIADHAALFR 324
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQFGRYLLISSS 365
RVSI L + P+ + +P ER+ + + D +L L +Q+GRYLLI+SS
Sbjct: 325 RVSIDLGKDPEAV-----------RLPIDERMLRLAEGKSDNALQALYYQYGRYLLIASS 373
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPG + ANLQGIWN+ + P W S NIN EMNYW + NLSEC +PLFDF+ L++N
Sbjct: 374 RPGGRPANLQGIWNDMVQPPWGSNYTTNINTEMNYWLAENTNLSECHQPLFDFMKELAVN 433
Query: 426 GSKTAQVNY-LASGWVIHHKTDIWAKSSAD-------RGKVVWALWPMGGAWLCTHLWEH 477
G+ TA+VNY + GWV HH +D+WAK+S +G W+ WPM GAW CTHLWEH
Sbjct: 434 GAVTAKVNYNIDDGWVTHHNSDLWAKTSPPGGYDWDPKGMPRWSAWPMAGAWFCTHLWEH 493
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACV 536
Y YT D+ FL++ AYPL++G ASF+L WLIE YL TNPSTSPE+ + GK +
Sbjct: 494 YLYTGDKKFLKEEAYPLMKGAASFMLHWLIEDPGSHYLITNPSTSPENT-VKIAGKEYQL 552
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +STMDMAIIRE+F+A I +A++L ++D EK++ + +L P I + G + EW QD
Sbjct: 553 SMASTMDMAIIRELFNACIRSADILGSDKD-FKEKLIMAKAKLYPYHIGQYGQLQEWYQD 611
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
+ DP HRH+SHLFGL+PG+ IT+ +P+L A +++L RG+ GWS+ WKT WAR
Sbjct: 612 WDDPADKHRHISHLFGLYPGNQITVLGSPELAAATKQSLIHRGDVSTGWSMAWKTNWWAR 671
Query: 657 LHDQEHAYRMVKRLFNLVDP--EHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLV 714
L D HAY+++K +DP E E+ GG Y NLF AHPPFQID NFG TA + EML+
Sbjct: 672 LQDGNHAYKILKDALRYIDPNEEKEQMSGGGAYPNLFDAHPPFQIDGNFGATAGMTEMLL 731
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
QS ++ LLPALP D W +G +KG+KARG TV I W + +L I S N
Sbjct: 732 QSHAGEVQLLPALP-DAWPAGSIKGIKARGNFTVEINWANRNLTRALIRSELGGN 785
>gi|251797558|ref|YP_003012289.1| alpha-L-fucosidase [Paenibacillus sp. JDR-2]
gi|247545184|gb|ACT02203.1| Alpha-L-fucosidase [Paenibacillus sp. JDR-2]
Length = 790
Score = 594 bits (1532), Expect = e-167, Method: Compositional matrix adjust.
Identities = 320/768 (41%), Positives = 460/768 (59%), Gaps = 40/768 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ +N + +TDA+P GNGRLGAM++GG E ++LNEDTLW+G P N +A K L
Sbjct: 1 MKLQYNRASVRWTDALPTGNGRLGAMMFGGSEMERIQLNEDTLWSGGPRYGDNDNAVKVL 60
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
+VR L++ GQYA A ++ G Y + D+ ++F + + YRR L L
Sbjct: 61 PEVRKLIEEGQYAAADRLCKQMMGTYTQSYLPMADLYIKFLHGNTM---KNYRRALHLGD 117
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
AT+ V+Y +GNV +TR F S PDQV+V ++ S+ G L+F L+S L + + +
Sbjct: 118 ATSTVEYQIGNVTYTRRLFVSYPDQVVVLRLEASQPGKLNFLARLESPLRYETAFD-QDA 176
Query: 193 IIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLK 243
+I+ G P +++ P D P ++F + ++ D G S D L+
Sbjct: 177 LILRGDAP-EQVDPSYYDTDMPVKYGEPGSANAMRFEGRMAARL--DEGQASYGHDG-LR 232
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V G+ L+ A++SF+G +P KD ++ + + L+ + LSY L RH++D++K
Sbjct: 233 VTGATAVTLIFSAATSFNGYDRSPGSEGKDESAAASAYLEQAKKLSYESLLQRHVEDHRK 292
Query: 304 LFHRVSIQLSRS--PKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
LF+RV + L S P D TD R++ + DP LVELL+ +GRYL+
Sbjct: 293 LFNRVELSLGESVAPPDYPTDA-------------RIRDYGA-SDPGLVELLYHYGRYLM 338
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
I SSR GTQ ANLQGIWNE+ W +NIN EMNYW + CNL++C PL DF+
Sbjct: 339 IGSSRKGTQPANLQGIWNEETRAPWSGNYTLNINAEMNYWPAETCNLADCHTPLLDFIGN 398
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEH 477
LS NG KTA NY A+GW HH +DIW +S+ G WA WPMGG WLC HLWEH
Sbjct: 399 LSKNGRKTASTNYGAAGWTAHHNSDIWCQSAPAGDYGHGDPGWAFWPMGGVWLCQHLWEH 458
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y + +D FL +AYP+++ A F LDWL E DG L T+PSTSPEH+F +G LA VS
Sbjct: 459 YAFGLDEAFLRDKAYPVMKEAALFCLDWLHEDKDGRLITSPSTSPEHKFRTAEG-LAAVS 517
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
+STMD+++I ++F+ +I A+ +L +E E++ + RL P +I E+G + EW++DF
Sbjct: 518 AASTMDLSLIWDLFTNLIEASTILGVDE-PFRERLADTRSRLHPLQIGENGRLQEWSKDF 576
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARL 657
+D + HRH+SHLFG++PG +T + P+L AA+++L+ RG+ G GWS+ WK LWAR
Sbjct: 577 EDEDQFHRHVSHLFGVYPGRQLTWGETPELMAAAQRSLEIRGDGGTGWSLGWKVGLWARF 636
Query: 658 HDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST 717
+ A ++ L LV+ + + GG+Y NLF AHPPFQID NF T+ +AE+LVQS
Sbjct: 637 GNGNRALGLLSNLLTLVEEGNTNYHHGGVYGNLFDAHPPFQIDGNFAATSGIAELLVQSH 696
Query: 718 LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
L LLP+LP D W G V+GL+ARG VS+ W++G + I SN
Sbjct: 697 QGYLELLPSLP-DAWPQGYVRGLRARGHFDVSLQWEEGAVTTAEIVSN 743
>gi|338213674|ref|YP_004657729.1| alpha-L-fucosidase [Runella slithyformis DSM 19594]
gi|336307495|gb|AEI50597.1| Alpha-L-fucosidase [Runella slithyformis DSM 19594]
Length = 880
Score = 593 bits (1528), Expect = e-166, Method: Compositional matrix adjust.
Identities = 327/782 (41%), Positives = 464/782 (59%), Gaps = 50/782 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ F PA+ + +A+P+GNG+ GAMV+G V E +LN++TLW+G P + NP+ P L
Sbjct: 43 LKLWFTQPARIWEEALPLGNGKTGAMVFGRVNRERYQLNDNTLWSGYPIEGNNPNGPTVL 102
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFD--DSHLKYAEETYRRELDL 130
+VR + G+Y +A + K+ G Y +GD+ L+F DS Y RELDL
Sbjct: 103 PEVRKAIFEGKYDKADSLWKKMQGPYCARYLPMGDLHLDFGFRDS----TATDYYRELDL 158
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
NTA A VKY+VG V +TRE F S+P V+V +I+ ++ S++ + +L S L
Sbjct: 159 NTAVAIVKYTVGGVTYTRETFISHPASVMVVRITANKKNSINMSAALSSRLRFSVLPGET 218
Query: 191 NQIIMEGRCPGKRI------PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
N+I+++G+ P K + P + +DDPKG + L +K + G I+ ++ KL +
Sbjct: 219 NEIVLKGKAP-KHVAHRAAEPQQIVYDDDPKGEGTNFELRVKAQTEGGKITN-QNGKLLI 276
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
G++ + ++SF+G +P KDP+ E+ + L+ + SY+ L + H+ DYQ+L
Sbjct: 277 SGANAVTYYVAGATSFNGFDKSPGREGKDPSVETNAILKKAGSQSYAQLKSAHISDYQRL 336
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAER-VKSFQTDEDPSLVELLFQFGRYLLIS 363
F RVS+ L P+ + +P+ ER ++ D L L +QFGRYLLI+
Sbjct: 337 FQRVSLDLGTDPEAL-----------KLPTDERLIRQQNGPADTHLQTLYYQFGRYLLIA 385
Query: 364 SSRPGTQ-----VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
SSR G ANLQGIWN+ + P W S NIN EMNYW + NLSEC P+ F
Sbjct: 386 SSRNGASGAAGTPANLQGIWNDHIQPPWGSNFTTNINFEMNYWLAENANLSECHLPMLQF 445
Query: 419 LTYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSSAD-------RGKVVWALWPMGGAWL 470
+ +L++NG+KTA+VNY + GW+ HH TDIWAK+SA R + W+ W M GAWL
Sbjct: 446 IGHLAVNGAKTAKVNYGINEGWITHHGTDIWAKTSAGGGYEWDPRSRGSWSSWLMAGAWL 505
Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD 530
THLWEHY +T D+ FL + YPL++ A F+L WL+E G+L TNPS+SPE+ +
Sbjct: 506 STHLWEHYQFTGDQTFLRDQGYPLMKSAAQFMLHWLLEDGQGHLITNPSSSPENT-VKIS 564
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSI 590
GK ++ +STMDMAIIRE+FS I AA+ L K + A ++ ++ RL P +I + G +
Sbjct: 565 GKEYQITMASTMDMAIIRELFSDCIQAAKQL-KTDAAFQTQLEQAKARLYPYQIGQYGQL 623
Query: 591 MEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
EW +D+ DP HRH+SHLFGL PGH I + P+L AA+K+L +RG+ GWS+ WK
Sbjct: 624 QEWYRDWDDPNDKHRHISHLFGLHPGHQINPRQTPELAAAAKKSLMQRGDVSTGWSMAWK 683
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEK--------HFEGGLYSNLFAAHPPFQIDAN 702
WARL D HAY++++ + V P+ GG Y NLF AHPPFQID N
Sbjct: 684 INWWARLEDGNHAYKILRDGLSYVGPKSSSRNGEVLTTQSGGGTYPNLFDAHPPFQIDGN 743
Query: 703 FGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
FG TA + EML+QS ++ LLPALP D W G V+GLKARG V I W+ G L + I
Sbjct: 744 FGGTAGITEMLLQSHTGEISLLPALP-DAWPKGSVRGLKARGNFDVDIRWEAGKLTQASI 802
Query: 763 YS 764
S
Sbjct: 803 VS 804
>gi|253574360|ref|ZP_04851701.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
str. D14]
gi|251846065|gb|EES74072.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
str. D14]
Length = 817
Score = 592 bits (1527), Expect = e-166, Method: Compositional matrix adjust.
Identities = 337/801 (42%), Positives = 480/801 (59%), Gaps = 53/801 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA +T+A+P+GNGRLGAM++GGV ET+ LNEDTLW+G P D+ NP A + L
Sbjct: 6 KLQYDRPATVWTEALPVGNGRLGAMIYGGVERETISLNEDTLWSGYPRDWNNPSARQVLP 65
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
+VR LV G+Y EA ++ G + Y GD++L F+ A +YRR LDL A
Sbjct: 66 EVRKLVREGRYEEADQLGRQMLGPYTESYLPFGDLQLTFEHGA---ACRSYRRTLDLADA 122
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
+Y+VG V + RE F S+PD++I +++ S+ G+L+F+ LDS L + + V +
Sbjct: 123 IHVTEYTVGKVSYKREIFVSHPDRIIAMRLTCSQPGALAFHARLDSPLRHIAAVE-DGIF 181
Query: 194 IMEGRCPGKRIPPKANAN-----DDPK---GIQFSAILEIKISDDRGTISALEDKKLKVE 245
+M G P + P NA+ DP + F L + +D R ++ + ++V
Sbjct: 182 VMRGTAPERVEPNYVNADRPIRYGDPAVSPAMAFEGRLAVTETDGRVSV---DGDGIRVL 238
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLS------YSDLYTRHLD 299
+ AVL A++SFD P + + ++A ++ +L+ Y ++ RH++
Sbjct: 239 DATEAVLYFSAATSFDRFDQIPGAGRPESVPADVAAARARADLTGALANRYLEIRARHIE 298
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
DYQ LF RVS++L +T + E +DT ER DP LVELLF +GRY
Sbjct: 299 DYQALFSRVSLRLG--------ETAAPEGLDT----ERRIVEYGAADPGLVELLFHYGRY 346
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LLI+SSRPGTQ ANLQGIWN P W S +NIN EMNYW + CNL+EC PL + +
Sbjct: 347 LLIASSRPGTQAANLQGIWNAMTRPPWSSNWTLNINAEMNYWPAEVCNLAECHWPLLEMI 406
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLW 475
L+ NG+KTA VNY GWV HH +DIW +++ G VWALWP+GG WL HLW
Sbjct: 407 GNLAENGAKTAAVNYGTRGWVAHHNSDIWGQTAPVGDFGGGDPVWALWPLGGVWLTQHLW 466
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
EHY + D +L AYP+L+ A F LDWLIE G+L T+PSTSPEH+F +G +A
Sbjct: 467 EHYVFGGDVAYLHDFAYPILKDAALFALDWLIEDESGHLVTSPSTSPEHKFRTANG-VAA 525
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ 595
+S STMD+++I E+F+ I AA VL +E A E++ ++ RL P ++ + G + EW++
Sbjct: 526 ISEGSTMDLSLIWELFTNCIEAAGVLGIDE-AFREELRQARERLLPLQVGKYGQLQEWSR 584
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWA 655
DF+D +VHHRH SHL G++PG ++ E+ P+L AA + L++RG+E GWS+ W+ ALW+
Sbjct: 585 DFEDEDVHHRHTSHLVGVYPGRQLSAEETPELFAAARQVLERRGDESTGWSLGWRVALWS 644
Query: 656 RLHDQEHAYRMVKRLFNLV-DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLV 714
R D + A R++ + LV D E E++ GG+Y++L AHPPFQID NF +A +AEML+
Sbjct: 645 RFGDGDRALRLLGNMLRLVKDGETERYNHGGVYASLLGAHPPFQIDGNFAASAGIAEMLL 704
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSF 774
QS L L LLPALP W G V+GL+ARGG VS+ W +G L E I S +
Sbjct: 705 QSHLPALVLLPALP-QAWPDGEVRGLRARGGFEVSLRWANGKLTEAEIVSTLGH------ 757
Query: 775 KTLHYRGTSVKVNLSAGKIYT 795
V+V LS G+ T
Sbjct: 758 ------ACRVRVGLSGGEPLT 772
>gi|374603684|ref|ZP_09676661.1| alpha-L-fucosidase [Paenibacillus dendritiformis C454]
gi|374390787|gb|EHQ62132.1| alpha-L-fucosidase [Paenibacillus dendritiformis C454]
Length = 818
Score = 591 bits (1523), Expect = e-166, Method: Compositional matrix adjust.
Identities = 326/813 (40%), Positives = 457/813 (56%), Gaps = 52/813 (6%)
Query: 2 MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
M T+ LK+ + PA +T+A+P+GNGR GAMV+GGV E ++LNEDTLW G P
Sbjct: 1 MATSKTARDEDLKLWYTRPADKWTEALPLGNGRFGAMVFGGVRRERIQLNEDTLWAGHPV 60
Query: 62 DYTNPDAPKALSDVRSLVDSGQYAEAT----AASVKLFGHPADVYQLLGDIELEFDDSHL 117
NP A + L + R L+ +G+YAEA V GH YQ LG++ LEFD
Sbjct: 61 SEYNPAAGELLPEARQLLHAGKYAEAMELIGTRMVGTEGHGIQPYQPLGNVYLEFDGPEA 120
Query: 118 KYAEET-------YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGS 170
Y+REL L A A G+ R F S DQV+V ++
Sbjct: 121 TGGAAGGKPAAPAYKRELQLKQALAVTSCQAGDSLEKRTVFVSAADQVMVVRLESDSPYG 180
Query: 171 LSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGK-------RIPP------KANANDDPKGI 217
+ VSLDS L++ + ++M GRCP + +PP A + + + +
Sbjct: 181 VRVTVSLDSRLEHSVAADDEGGLVMTGRCPQRVRNHNNSAVPPIAYDGDGAESEESGRAL 240
Query: 218 QFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSE 277
+F+ + + D + + D +LK+ G LL A++SF G P ++ P
Sbjct: 241 RFAVKMAVLEEDGETRVRCI-DNRLKIGGGRAVTLLFAAATSFRGYDRMPDEAAVPPAER 299
Query: 278 SMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAER 337
+ L+ SY L H+ DY++LF RVS++L D D + +P+ ER
Sbjct: 300 CHAVLKEALRRSYGQLLDAHIQDYRRLFERVSLEL-----DDADDAGRK-----LPTDER 349
Query: 338 VKSFQTD-EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINL 396
++ D + LLFQ+GRYLLISSSRPGTQ ANLQGIWN+++ P W+ H+NINL
Sbjct: 350 LRRIGAGGSDNGIYALLFQYGRYLLISSSRPGTQAANLQGIWNDEVQPPWNCDYHLNINL 409
Query: 397 EMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSAD-R 455
+MNYW + C+L EC +PLF + L++ G+ ++V+Y GW+ H TD W +
Sbjct: 410 QMNYWLAEVCHLQECHDPLFRLMEELAVTGAAASRVHYGCGGWMAHAMTDQWRNHNVGPS 469
Query: 456 GKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYL 514
G WA WPMGGAWLC HLWEHY YT DR FL +RA+PLL G A+FLLDW++ E DG L
Sbjct: 470 GDPSWAYWPMGGAWLCRHLWEHYEYTRDRAFLAERAWPLLRGAAAFLLDWVVQEDEDGRL 529
Query: 515 ETNPSTSPEHEFIAPDG----KLAC-VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALV 569
T+PS SPE+ F+ P K C VS SS MDM I +++ + A +VL + D
Sbjct: 530 MTSPSVSPENAFLIPGAEEGEKQTCTVSQSSAMDMQIAYDLWMIVKQANDVLGLD-DTFA 588
Query: 570 EKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCK 629
+ RL +I G +MEW +D+ + + HRHLSHL+GL+PG +E NP+L +
Sbjct: 589 RACEAAALRLPQPRIGARGQLMEWERDYAEADPKHRHLSHLYGLYPGSQFALEDNPELLR 648
Query: 630 AAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHF-EGGLYS 688
A +T++ RG+EG GWS+ WK A+WARL D +HA R++ ++++ E ++ GG+Y
Sbjct: 649 AIARTMELRGDEGTGWSMGWKMAVWARLLDGDHALRILNNFLHVIEEEGSANYHHGGIYV 708
Query: 689 NLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETV 748
NLF AHPPFQID NFG A +AEML+QS ++LLPALP +W SG V+GL+ARGG TV
Sbjct: 709 NLFCAHPPFQIDGNFGAAAGIAEMLLQSH-RGIHLLPALP-RQWPSGTVRGLRARGGFTV 766
Query: 749 SICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRG 781
S+ W+DG L + D D + YRG
Sbjct: 767 SLAWRDGALAAAEVAP-----DADGECLVRYRG 794
>gi|410456476|ref|ZP_11310337.1| alpha-L-fucosidase [Bacillus bataviensis LMG 21833]
gi|409928145|gb|EKN65268.1| alpha-L-fucosidase [Bacillus bataviensis LMG 21833]
Length = 789
Score = 582 bits (1501), Expect = e-163, Method: Compositional matrix adjust.
Identities = 306/751 (40%), Positives = 436/751 (58%), Gaps = 37/751 (4%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
+++ A H+T+A+P+GNGR+GAM +GGV +E +LNEDTLW+G P + +L
Sbjct: 4 LSYKKAASHWTEALPLGNGRIGAMHFGGVETERFQLNEDTLWSGPPQHKREYNDQASLKK 63
Query: 75 VRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTAT 134
VR L+D +Y +A + + +FG + Y LG++ + + A + Y+R LD+NTA
Sbjct: 64 VRKLLDEEKYEDAISETKNMFGPYTESYMPLGNLFIHYLHGD---AAQKYQRTLDINTAI 120
Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQII 194
+ VKY+VG + +TRE F S+P QV+ +++ S + L+ N+SLDSLL + N +
Sbjct: 121 STVKYTVGKINYTREAFISHPHQVLAIQLTSSAANQLNVNISLDSLL-KYQTANSKEALS 179
Query: 195 MEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++G CP K P N ++ P K I F L + + D S + +L ++
Sbjct: 180 LQGVCPEKCAPVYFNESETPIVYGEFGETKAIHFEGRLGLVLEDGTALTS---NGRLSIQ 236
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
+ VL ++SF G P ++ ++ + L ++ Y L H+ DYQ L+
Sbjct: 237 DATRVVLYFSVATSFKGYDQLPGTDFEELIQKNEAILAKAMSIPYEQLRETHIQDYQTLY 296
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
+RV L + SEE +DT ERV + D D +VELLF +GRYLLI+SS
Sbjct: 297 NRVGFSLG--------NKQSEEMLDT---DERVTKYSAD-DLEMVELLFHYGRYLLIASS 344
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
R GTQ ANLQGIWN+ W S +NIN EMNYW + NL+EC PL + LS+
Sbjct: 345 REGTQPANLQGIWNDITRAPWSSNYTLNINTEMNYWPAEVTNLAECHRPLLQAIKELSVT 404
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKS----SADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
G Y GW HH TD+W + G WA WPM G WLC HLWEHY Y+
Sbjct: 405 GENMVNQRYGLHGWTAHHNTDLWRHAHPVGDERHGDPNWAFWPMSGPWLCRHLWEHYQYS 464
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
DRDFLEK A+P+++G A F L+WL+E +GYL T+PSTSPEH F DG+L V+ ST
Sbjct: 465 QDRDFLEKEAFPVMKGAAQFCLEWLVEDENGYLITSPSTSPEHHFYTEDGQLGSVTKGST 524
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPE 601
MD+ II ++FS I AAE+ +E+ +++V ++ RL P +I + G + EW D++D E
Sbjct: 525 MDLQIIWDLFSNCIEAAEICGVDEE-WIQQVREAKDRLHPNQIGKYGQLQEWLMDYEDAE 583
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQE 661
+HHRH+SHL+G++PG+ IT +AA +TL +RG+ G GWS+ WK LWARL D E
Sbjct: 584 LHHRHVSHLYGVYPGNQIT---EGSFLEAARQTLNRRGDAGTGWSLGWKICLWARLKDGE 640
Query: 662 HAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDL 721
++ +LF + + E GGLY NL AHPPFQID NF +TA VAEM++QS +
Sbjct: 641 RVNALLHQLFKICTAKREVFVGGGLYPNLLGAHPPFQIDGNFSYTAGVAEMIIQSHKGYV 700
Query: 722 YLLPALPWDKWSSGCVKGLKARGGETVSICW 752
LLPALP W G + G++ RGG +I W
Sbjct: 701 ELLPALP-STWLQGSLSGVRVRGGFETNISW 730
>gi|325103197|ref|YP_004272851.1| alpha-L-fucosidase [Pedobacter saltans DSM 12145]
gi|324972045|gb|ADY51029.1| Alpha-L-fucosidase [Pedobacter saltans DSM 12145]
Length = 868
Score = 579 bits (1492), Expect = e-162, Method: Compositional matrix adjust.
Identities = 322/796 (40%), Positives = 467/796 (58%), Gaps = 55/796 (6%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
A+S S N L + + P+K + +A+PIGNG GAMV+GGV E +LN TLW+G P
Sbjct: 20 AQSKSDPN-LVLWYKEPSKIWEEALPIGNGFQGAMVFGGVGKERFQLNNGTLWSGFPNPG 78
Query: 64 TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADV-YQLLGDIELEFDDSHLKYAEE 122
NP P AL VR +D G YA+A K P Y + D+ L+F+ H +
Sbjct: 79 NNPKGPAALPQVRKAIDDGDYAKAAEIWKKNNQGPYSARYLTMADLYLDFN--HKDSDVQ 136
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
Y+R LDLN+A V Y VG V + RE SNPD+V+ +++ + +LSF L S L
Sbjct: 137 AYKRSLDLNSAVHTVTYKVGGVTYKRETLMSNPDKVMAIRLTADKKNALSFTTDLISKLK 196
Query: 183 NHSYVNGNNQIIMEGRCPGKRI------PPKANANDDPKGIQFSAILEIKISDDRGTISA 236
+ G N +I++G+ P K + P + +++ +G+ F + +K+ ++ GT+
Sbjct: 197 YKTNAVGQNALILKGKAP-KHVAHRPTEPEQIIYDENGEGMTFE--VHLKVLNEGGTVKT 253
Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
+ +K + V+ ++ + L + +SF+G +P+ + K+P+ E+ + L + Y +
Sbjct: 254 VGNK-ITVQNANAVTIYLSSGTSFNGFDKSPTIAGKNPSIEASANLAAAVGKKYDVMKQA 312
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQ 355
H+ DY KLF+RV ++L P ++ +P+ R+ + Q D L L FQ
Sbjct: 313 HIADYSKLFNRVVLKLGNRP-----------DLANLPTNIRLSRQGQKGNDQELQVLYFQ 361
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
FGRYL+ISSSRPG+Q NLQG+WN+ + P W S VNIN EMNYW + NLSE PL
Sbjct: 362 FGRYLMISSSRPGSQATNLQGLWNDHVQPPWGSNYTVNINTEMNYWLAENTNLSELHYPL 421
Query: 416 FDFLTYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSSAD-------RGKVVWALWPMGG 467
FDFL L++NG +TA++NY + GWV+HH TDIWAK+S +G W+ WPMGG
Sbjct: 422 FDFLERLAVNGKETAKINYNINKGWVLHHNTDIWAKTSPTGGYDWDPKGSPRWSAWPMGG 481
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFI 527
AWL THL++HY +T D+ FL+++AYPL++G A FLL WL+ GYL TNPSTSPE+ F
Sbjct: 482 AWLSTHLYDHYLFTGDKRFLKEKAYPLMKGAAEFLLAWLVPDQSGYLITNPSTSPENTFT 541
Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAED 587
+ K +S +TMD+ I+ E+F+A I +A+ L+ + + V+++ + +L P +I +
Sbjct: 542 I-NKKQYEISKGTTMDLGIMLELFNACIQSAKALDTDAN-FVKQLEAAKAKLYPYQIGKY 599
Query: 588 GSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSI 647
G + EW D DP+ HRH+SHL+GL+PG+ IT+E P+L AA+++L RG+ GWS+
Sbjct: 600 GQLQEWFFDIDDPKDTHRHISHLYGLYPGNQITLETTPELAAAAKQSLIHRGDVSTGWSM 659
Query: 648 TWKTALWARLHDQEHAYRMVKRLFNLVDPEHE-----KHFE-------------GGLYSN 689
WK WARL D HA +++K L+DP KH GG Y N
Sbjct: 660 AWKINWWARLQDGNHALKILKDGLTLIDPAKTAEGDGKHSAGVNQQLTNVQMSGGGTYPN 719
Query: 690 LFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVS 749
L AHPPFQID NFG TA + EML+QS L+LLPALP D+W G VKG+K+RG TV
Sbjct: 720 LLDAHPPFQIDGNFGATAGIIEMLLQSHNGALHLLPALP-DEWKEGAVKGIKSRGNFTVD 778
Query: 750 ICWKDGDLHEVGIYSN 765
+ W L + I SN
Sbjct: 779 MEWNQNKLVKSVILSN 794
>gi|255532589|ref|YP_003092961.1| alpha-L-fucosidase [Pedobacter heparinus DSM 2366]
gi|255345573|gb|ACU04899.1| Alpha-L-fucosidase [Pedobacter heparinus DSM 2366]
Length = 868
Score = 577 bits (1487), Expect = e-161, Method: Compositional matrix adjust.
Identities = 320/791 (40%), Positives = 466/791 (58%), Gaps = 55/791 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PAK + +A+P+GNG+ GAMV+G V E +LN++TLW+G P NP P L
Sbjct: 29 LKLWYTQPAKVWEEALPLGNGKTGAMVFGRVNKERFQLNDNTLWSGSPEAGNNPKGPANL 88
Query: 73 SDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEET-YRRELDL 130
VR V G YA A A K L G + Y + D+ L+F+ LK + T Y RELD+
Sbjct: 89 PLVRQAVFEGDYARAAALWKKNLQGPYSARYLTMADLFLDFN---LKDSIPTAYHRELDI 145
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
+ A + V Y+VG + + RE S PD+ +V +I+ + +L+F+ S+ S L + G
Sbjct: 146 DNAISTVTYTVGGITYKRESLISYPDKAVVIRITTDQKNALNFSTSISSKLKYTARAVGA 205
Query: 191 NQIIMEGRCPGKRIPPKAN-----ANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
+ ++++G+ P K + +A DD +G+ F ++++I + GT +A + ++ V
Sbjct: 206 DLLVLKGKAP-KHVAHRATEAAQVVYDDKEGMTFE--VDVRIKAEGGTTTA-KGTEILVS 261
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
++ + L ++SF+G +P K+P +E+ L+ + YS + T H+ DY+ LF
Sbjct: 262 KANAVTIYLSGATSFNGYNKSPGLEGKNPATEAAGILKKVYPKPYSTIKTAHVADYKALF 321
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQFGRYLLISS 364
RVS L S ++ +P+ R+ + D L L +QFGRYL+I+S
Sbjct: 322 DRVSFSLG-----------SNAELEGLPTNVRLSRQGAMGNDQGLQVLYYQFGRYLMIAS 370
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
SRPG+Q NLQGIWN+ + P W S VN N +MNYW + NLSE +PLFDF+ +++
Sbjct: 371 SRPGSQATNLQGIWNDHVQPPWGSNYTVNANTQMNYWLAEQTNLSELHQPLFDFIGRMAV 430
Query: 425 NGSKTAQVNY-LASGWVIHHKTDIWAKSSAD-------RGKVVWALWPMGGAWLCTHLWE 476
NG+KTA++NY + GWV+HH TDIWAKSS +G W+ WPMGGAWL THL++
Sbjct: 431 NGAKTAKINYDIRQGWVVHHNTDIWAKSSPTGGYDWDPKGAPRWSAWPMGGAWLTTHLYD 490
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLAC 535
HY +T D+ FL+++ YPL++G A F+L WL++ YL TNPSTSPE+ F +GK
Sbjct: 491 HYLFTGDKQFLKEKGYPLMKGAAEFMLKWLVKDDKTEYLVTNPSTSPENIFKI-EGKEYE 549
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ 595
VS ++TMDM II+E+F+ I+A+++L+ + D VE + K+ +L P I G + EW
Sbjct: 550 VSKATTMDMGIIKELFTDCIAASKILDMDADFRVE-LEKAKAKLYPFNIGRYGQLQEWFN 608
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWA 655
D DP+ HRHLSHLF L+PG+ IT+ P+L AA+++L RG+ GWS+ WK WA
Sbjct: 609 DVDDPKDSHRHLSHLFALYPGNQITVYHTPELAAAAKQSLLHRGDLSTGWSMAWKINWWA 668
Query: 656 RLHDQEHAYRMVKRLFNLVDPEHEKHFE-----------------GGLYSNLFAAHPPFQ 698
RL D HA +++K L+DP + GG Y NLF AHPPFQ
Sbjct: 669 RLQDGNHALKILKAGLTLIDPAKTTEPQKGPSASMAQLTNVQMSGGGTYPNLFDAHPPFQ 728
Query: 699 IDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLH 758
ID NFG TA + EML+QS ++L LLPALP D W G +KG+KARG V I W +G L
Sbjct: 729 IDGNFGATAGMTEMLLQSNTDELSLLPALP-DDWEKGSIKGIKARGNFRVDISWAEGKLS 787
Query: 759 EVGIYSNYSNN 769
+ IYS N
Sbjct: 788 KALIYSGSGGN 798
>gi|182419971|ref|ZP_02951207.1| twin-arginine translocation pathway signal [Clostridium butyricum
5521]
gi|237666001|ref|ZP_04525989.1| alpha-L-fucosidase 2 (Alpha-L-fucosidefucohydrolase 2)
(Alpha-1,2-fucosidase 2) [Clostridium butyricum E4 str.
BoNT E BL5262]
gi|182376222|gb|EDT73807.1| twin-arginine translocation pathway signal [Clostridium butyricum
5521]
gi|237658948|gb|EEP56500.1| alpha-L-fucosidase 2 (Alpha-L-fucosidefucohydrolase 2)
(Alpha-1,2-fucosidase 2) [Clostridium butyricum E4 str.
BoNT E BL5262]
Length = 799
Score = 576 bits (1484), Expect = e-161, Method: Compositional matrix adjust.
Identities = 319/804 (39%), Positives = 464/804 (57%), Gaps = 56/804 (6%)
Query: 10 TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDA 68
+ L++ + PA+ + +A+P+GNGR+GAMV+GGV E L+LNEDTLW+GVP + T+ +
Sbjct: 2 NDKLRLWYTKPAEKWVEALPLGNGRIGAMVFGGVYRERLQLNEDTLWSGVPITEETDENF 61
Query: 69 PKALSDVRSLVDSGQYAEA-TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
L R L+ G+Y ++ + KL G + Y LG++ +FD+ Y + Y R+
Sbjct: 62 IDDLEKARKLIFEGKYCKSENIINNKLLGPWNESYLPLGNLYFDFDNEG-DYVD--YERD 118
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
L+L A++ VKY++ N+ + R F S D IV K S+ G +SF S DSLL
Sbjct: 119 LNLEDASSCVKYTMNNIRYKRTTFISKSDNAIVIKFESSKEGKISFKASFDSLLRYTVVT 178
Query: 188 NGNNQIIMEGRCPGKRIPPKANAN-----DDPKGIQFSAILEIKISDDRGTISALEDKKL 242
N I + G+ P +P + DD +G+ F A+LE+ + G I + E+ L
Sbjct: 179 ENKNSISLLGKAPIHVLPSYEDGEKPVIYDDKRGMNFKAVLEV--NGINGDIKS-ENGIL 235
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
KV+ +D ++ +V +SF+G KD +++Q IR+ +Y +LY H +Y+
Sbjct: 236 KVKDADEVIIKIVVHTSFNGYKNEAGTQGKDVNDLCENSIQKIRDKTYVNLYNAHKIEYK 295
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLL 361
LF R+ L+ D ++ P+ +R+++F+ ++ D L+ L FQ+GRYLL
Sbjct: 296 SLFDRLQFTLNSDFTD-----------NSTPTDKRIENFKENKNDLGLISLYFQYGRYLL 344
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
ISSSR GTQ ANLQGIWNEDL P W S NINLEMNYW + CNL EC EPLF F+
Sbjct: 345 ISSSRKGTQPANLQGIWNEDLRPAWSSNYTTNINLEMNYWLAEVCNLQECHEPLFKFIRE 404
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
+S G +TA++ Y GW +H D+W ++S G WA WPM GAWLC+H+WEHY +T
Sbjct: 405 VSEVGKETAKIRYNCRGWTANHNIDLWRQTSPAGGSTEWAYWPMAGAWLCSHIWEHYEFT 464
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
D FL K YP+++ CA FL+DWL+E +GYL T PS SPE+ FI +G+ +CVS +ST
Sbjct: 465 NDVKFL-KEMYPIMKSCAEFLVDWLMEDENGYLVTCPSISPENNFITEEGEKSCVSIAST 523
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPE 601
MDM+I + +F I AA +LE ++ E + L P KI + G + EW +DF++ E
Sbjct: 524 MDMSITKNLFKNCIDAANILEIDKKFRSE-LKNYYNNLYPYKIGKFGQLQEWFKDFEEFE 582
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLH 658
HRHLSHLFGL+PG+ I + N ++ +A K+L++R G GWS +W L+ARL
Sbjct: 583 KGHRHLSHLFGLYPGNEINEDNNKEIFEACRKSLERRLTYGGGHTGWSCSWAVCLFARLK 642
Query: 659 DQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTL 718
D E A + ++ L + +SNL PPFQID NFG TAA++EML+QS
Sbjct: 643 DSESANKYLEILLKKLT-----------FSNLLNVCPPFQIDGNFGGTAAISEMLIQSNK 691
Query: 719 NDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLH 778
+ +LP +P +W G VKG+KARGG + W G + E+ I SN L
Sbjct: 692 GYIEILPCIP-KEWKQGNVKGIKARGGFELDFEWNKGYIKEIYIKSN-----------LE 739
Query: 779 YRGTSVKVNLSAGKIYTFNRQLKC 802
Y +K+N K+Y+ +LKC
Sbjct: 740 YGICKIKLNTKIIKLYS---KLKC 760
>gi|298246864|ref|ZP_06970669.1| Alpha-L-fucosidase [Ktedonobacter racemifer DSM 44963]
gi|297549523|gb|EFH83389.1| Alpha-L-fucosidase [Ktedonobacter racemifer DSM 44963]
Length = 809
Score = 575 bits (1482), Expect = e-161, Method: Compositional matrix adjust.
Identities = 321/770 (41%), Positives = 454/770 (58%), Gaps = 61/770 (7%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+ + PA + +A+P+GNG LGAMV GG+ E L+LNEDTLW+G P D NPDA
Sbjct: 15 PLKLWYRQPATQWLEALPVGNGHLGAMVHGGISEEVLQLNEDTLWSGEPYDTDNPDAVTH 74
Query: 72 LSDVRSLV-DSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
L ++R L+ + Y A + ++ G + YQ LG + L+F+ + + Y+R LDL
Sbjct: 75 LPELRRLILEERDYVAAQELAHRMQGPYNESYQPLGYVRLKFEQ---RGEVQAYQRALDL 131
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
NTA A V+Y G++ F+RE FSS D ++V +++ +LS L+SL G+
Sbjct: 132 NTALATVQYKAGDILFSREVFSSAADDLLVIRLTSDTPHALSLTAHLESLHPFTCAPAGS 191
Query: 191 NQIIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKK 241
N+I M GRCP + + P + DP G++F L+ + + G ISA D
Sbjct: 192 NKIRMTGRCP-RHVDPDYLSTSDPVIYDHGEDGHGMRFETQLQAMV--EGGRISADVDGA 248
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L+VE + L A++S+ G P S + + L + + Y L H++DY
Sbjct: 249 LRVENAHAVTFFLSAATSYRGFASRPDLSAHVLEQQCTTRLAAGMSKGYEVLRAAHINDY 308
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYL 360
Q+LF RV++ L S + +P+ ER+ + Q D +L+ L FQ+GRYL
Sbjct: 309 QQLFQRVTLDLGTS------------DGQELPTDERLAAVQKGASDDALLALYFQYGRYL 356
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LI+SSRPGTQ ANLQGIWN+ + P W S +NIN +MNYW + CNL+EC PLFD L
Sbjct: 357 LIASSRPGTQSANLQGIWNDHVRPAWSSNYTININTQMNYWLAETCNLAECHSPLFDLLE 416
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA---DRGKVVWALWPMGGAWLCTHLWEH 477
S++G +TAQV Y GWV HH D+W ++ G WA W MGGAWLC HLWEH
Sbjct: 417 EASVSGERTAQVYYGCRGWVAHHNMDLWRNTAPVGNGSGGPQWANWNMGGAWLCQHLWEH 476
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y ++ DR FL +RAYP+++ A FLLD+L+E G+L T PST+PE+ FI G+L+ VS
Sbjct: 477 YAFSGDRSFLSQRAYPIMKKAAQFLLDFLVEDKQGHLTTCPSTAPENLFITESGELSGVS 536
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
STMD+AI E+F+ I+A++VL+ ++ ++ ++L RL I G + EW +DF
Sbjct: 537 AGSTMDIAITHELFTHCIAASQVLDIDQ-GFAHELAQALARLPQPGIGSYGQLQEWNEDF 595
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTALW 654
+ E HRH+SHL+GL+PG IT+EK P+L +AA K+L++R G G GWS W +ALW
Sbjct: 596 AEHEPGHRHMSHLYGLYPGEQITLEKTPELLQAARKSLERRLEHGGGGTGWSQAWVSALW 655
Query: 655 ARLHD----QEHAYRMVK-----RLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGF 705
ARL + EH +++K LF+L+D L S L FQID NFG
Sbjct: 656 ARLGEGDLAHEHMIQLLKYSTAANLFDLID----------LQSPLI-----FQIDGNFGA 700
Query: 706 TAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
TAA+AEMLVQS ++L +LPALP W+ G V+GL+ARGG V + W +G
Sbjct: 701 TAAIAEMLVQSHADELAILPALP-HTWNEGYVRGLRARGGLEVDVEWNNG 749
>gi|261406875|ref|YP_003243116.1| alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
gi|261283338|gb|ACX65309.1| Alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
Length = 802
Score = 574 bits (1480), Expect = e-161, Method: Compositional matrix adjust.
Identities = 317/782 (40%), Positives = 451/782 (57%), Gaps = 55/782 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA + DA+ +GNGRLG MV+GG+ E + LNEDTLW+G P D N +A L V+
Sbjct: 16 YRNPAAEWVDALAVGNGRLGGMVYGGIFRERISLNEDTLWSGHPYDPNNREAAAYLETVQ 75
Query: 77 SLVDSGQYAEATAA-SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
LV G+Y EA + G ++ YQ LGD+ LE +++ E YRRELDLN A
Sbjct: 76 KLVFEGKYPEAQRTIEEHMLGPWSESYQPLGDLYLELEETG---KAEHYRRELDLNDAVC 132
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
R ++++ V + RE F S DQV+V + + + G ++ + SLDS L + + +++ M
Sbjct: 133 RTRFTLNGVRYVRETFVSAVDQVMVVRFTADQPGRIAVSASLDSQLRHQALRVSADKLAM 192
Query: 196 EGRCPGKRIPPKANAND-----DPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
+GR P P A +ND + +GI+F A ++ + G + + ++++EG+D
Sbjct: 193 KGRSPSHVEPLHARSNDPVIYEEGRGIRFEA--QLLALPEGGATTEDGEGRIRIEGADAV 250
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
LL AS+SF+G NP ++P S L + LSY +L RH+ DY+ L+ RV +
Sbjct: 251 TFLLAASTSFNGFDKNPVLEGRNPAELCRSCLDAAAKLSYGELLDRHVQDYRALYGRVEL 310
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGT 369
+L +P + +P+ ER+++ + D+ D L L FQFGRYLL+SSSRPGT
Sbjct: 311 ELD-AP-----------GLQHLPTDERIRALREDKTDEQLAVLFFQFGRYLLLSSSRPGT 358
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN+ + P W VNIN +MNYW + CNL+EC EPLF L L I G +T
Sbjct: 359 QAANLQGIWNQSMRPPWSCNYTVNINTQMNYWPAEVCNLAECHEPLFRLLEDLRIAGRET 418
Query: 430 AQVNYLASGWVIHHKTDIWAKSS----ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
A +Y A GWV HH D+W ++ G WA WPMGGAWL H+WEHY + DR
Sbjct: 419 ASAHYKARGWVSHHAVDLWRITTPSGGPSGGPASWAYWPMGGAWLSQHVWEHYRFGGDRT 478
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
FL + YP+++ A F LD+L+E DGYL +NPSTSPE+ F PDG+ A VS +TMD+A
Sbjct: 479 FLSQVGYPIMKEAALFFLDYLVEDADGYLVSNPSTSPENTFALPDGRKAAVSMDATMDIA 538
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
++RE+F + A++ L + + +E + + RLRP +I G + EW DF++ E HR
Sbjct: 539 LLRELFGNCMEASDHLGIDRELRLE-LAAARARLRPFQIGRRGQLQEWFSDFEEAEPGHR 597
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKT----LQKRGEEGPGWSITWKTALWARLHDQE 661
H++HL+ L PG + + P+L A + LQ GE+ GW W +L+ARL D E
Sbjct: 598 HMAHLYPLHPGSELDHRRTPELANACRVSIDLRLQHEGEDAVGWCFAWLISLFARLDDGE 657
Query: 662 HAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHP-------PFQIDANFGFTAAVAEMLV 714
A+R + +L L +P + NLF AH P I+AN G TA +AEML+
Sbjct: 658 MAHRYLTKL--LKNP----------FDNLFNAHRHPMLTFYPLTIEANLGATAGIAEMLL 705
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSF 774
QS +L LLPALP + W G V GL+ARGG TVS+ W D L E I S +N +H
Sbjct: 706 QSHAGELNLLPALP-EAWKGGRVSGLRARGGFTVSLAWTDRALSEAVIAS--ANGEHCRI 762
Query: 775 KT 776
+T
Sbjct: 763 RT 764
>gi|224539148|ref|ZP_03679687.1| hypothetical protein BACCELL_04050 [Bacteroides cellulosilyticus
DSM 14838]
gi|224519233|gb|EEF88338.1| hypothetical protein BACCELL_04050 [Bacteroides cellulosilyticus
DSM 14838]
Length = 822
Score = 573 bits (1477), Expect = e-160, Method: Compositional matrix adjust.
Identities = 330/774 (42%), Positives = 452/774 (58%), Gaps = 36/774 (4%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-DYTNPDAPKA 71
L++ + PA + +A+P+GNG +GAMV+G V +E ++LNE TLWTGVP NPDA
Sbjct: 24 LRLWYEKPANTWVEALPLGNGYIGAMVYGKVENELIQLNEGTLWTGVPCVKSVNPDAYSY 83
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELE--FDDSHLKYAEETYRRELD 129
LS++R + +A A S K+ G+ + + LGD+E++ F D Y Y+RELD
Sbjct: 84 LSEMREALSRDDFAAAGTLSKKMQGYFSQSFLPLGDLEIKQSFGDRKAWYL--GYKRELD 141
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
LN A + G V++ RE F+S PD+V+V + + S+ G L+ + + S L + G
Sbjct: 142 LNEAILTTSFWEGGVQYVREMFTSAPDRVMVLRFTASQKGKLALDFTTKSRLSDAVEALG 201
Query: 190 NNQIIMEGRCPGKRIPPKANAN----------DDPKGIQFSAILEIKISDDRGTISALED 239
+N + M+G P + P N + G++F ++L K GT++ +
Sbjct: 202 DNCLAMDGAAPARLDPAYYNRKGREPMMRVDENGCSGMRFRSLL--KAIPVGGTVTT-DK 258
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
K + + G+D +++ A++SF+G P+ KD + L S+ +L H+
Sbjct: 259 KGIHINGADEILVIWTAATSFNGFDKCPACEGKDEKMLAGQYLAKASIKSFDELKDSHIR 318
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ-TDEDPSLVELLFQFGR 358
D+ F RVS+QL TDT + +PS R+K + + DP L ELLFQ+GR
Sbjct: 319 DFASYFERVSLQL--------TDTVGSKVNAQLPSDFRLKLYSYGNYDPQLEELLFQYGR 370
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSSR G ANLQGIWN+D P W S +NIN EMNYW + NLSE PL +
Sbjct: 371 YLLISSSRLGGTAANLQGIWNKDFRPPWSSNYTININTEMNYWLAETTNLSEMHTPLLSW 430
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKS----SADRGKVVWALWPMGGAWLCTHL 474
+ LS G TA+ Y A GWV HH +DIW S + G WA W MGG WLC HL
Sbjct: 431 IKDLSKAGRATAKEFYHAKGWVAHHNSDIWGLSNPVGNKGDGSPEWANWTMGGNWLCQHL 490
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
WEHY +T D+ FL AYP+++ A F LDWL+E D YL T+PS SPE+ F+ DGK
Sbjct: 491 WEHYCFTGDKQFLADEAYPVMKEAALFCLDWLVERGD-YLITSPSVSPENLFVV-DGKKY 548
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWA 594
VS +STMDMAIIR++FS +I A+EVL + ++++ + +L P +I G + EW+
Sbjct: 549 AVSEASTMDMAIIRDLFSNLIEASEVLNIDRK-FRKQLVTAKNKLFPYQIGAKGQLQEWS 607
Query: 595 QDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALW 654
+D+ + + HHRHLSHLFGL PG I+ P+L KAA+KT + RG++G GWS WK
Sbjct: 608 KDYVENDPHHRHLSHLFGLHPGRDISPLLTPELAKAAQKTFELRGDDGTGWSKGWKINFA 667
Query: 655 ARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLV 714
ARL D HAY+M++ + VDP + GG Y N F AHPPFQID NFG TA VAEML+
Sbjct: 668 ARLLDGNHAYKMIREIMRYVDPTLNTN-HGGTYPNFFDAHPPFQIDGNFGATAGVAEMLL 726
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN 768
QS L +L+LLPALP W SG VKGLKARG V I W+ G L I SN N
Sbjct: 727 QSHLKELHLLPALP-VVWPSGKVKGLKARGNFEVDIVWEKGTLKSARIRSNLGN 779
>gi|403743768|ref|ZP_10953247.1| aliphatic sulfonates family ABC transporter, periplasmic
ligand-binding protein [Alicyclobacillus hesperidum
URH17-3-68]
gi|403122358|gb|EJY56572.1| aliphatic sulfonates family ABC transporter, periplasmic
ligand-binding protein [Alicyclobacillus hesperidum
URH17-3-68]
Length = 804
Score = 573 bits (1476), Expect = e-160, Method: Compositional matrix adjust.
Identities = 322/764 (42%), Positives = 439/764 (57%), Gaps = 34/764 (4%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP-DAPKA 71
+ + + PA +TDA+PIGNGRLG MV+GG+ E + LNEDTLW+G P P A +
Sbjct: 6 VALWYEKPAVAWTDALPIGNGRLGGMVFGGIEHERIHLNEDTLWSGYPRTLAVPRKAEET 65
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
L VR LV +G+Y EA AS L G ++ Y LG +EL F+ L + YRR LDL
Sbjct: 66 LRQVRELVLAGRYQEAHEASRGLSGPYSESYLPLGWLELVFEHGDLAH---DYRRSLDLR 122
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TA A V Y +G +FTRE F S+PD+ +V ++ L+F + + S L H+
Sbjct: 123 TAVATVSYRIGRTQFTREMFVSHPDEAMVIHLTADGPLPLAFTLCMGSKL-RHAIAEMAG 181
Query: 192 QIIMEGRCPGKRIPP--------KANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ + G+ P P + A DDP+ I+F+A + + D GT++ D L+
Sbjct: 182 DLALTGQAPIHVAPSYEVDDHPIQYAAPDDPRPIRFAARITVARCD--GTVAWCGDG-LR 238
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
+EG+ LLL A ++F + P D D ++ L +R +++L +RH+ D+Q+
Sbjct: 239 IEGATRVTLLLGAGTNFRSFALRP-DEALDVSANLGRQLADLRTTPFAELKSRHVADHQR 297
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
LF RV L+ D E +P+ E + + LVELLF +GRYLLI+
Sbjct: 298 LFDRVEFVLADPRPD------ENEGYRDLPTDELIARYGVHAK-RLVELLFHYGRYLLIA 350
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSRPGTQ ANLQGIWN+ P W S +NIN EMN+W CN+ EC EPL + L+
Sbjct: 351 SSRPGTQPANLQGIWNDATRPPWSSNLTLNINAEMNFWPVEVCNIGECHEPLLRMIGELA 410
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHYN 479
G + A+ Y GWV HH TDIW + A RG W++WPM G WLC HLWEHY
Sbjct: 411 QTGREVAK-RYGCRGWVAHHNTDIWRMAHAAGGDGRGDPSWSMWPMAGPWLCAHLWEHYL 469
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
++ D FL+ AYPL+ A F +DWL G PSTSPEH F+ DG+ A VS S
Sbjct: 470 FSRDHAFLQNVAYPLMRDAALFCIDWLASDPSGRGLAIPSTSPEHHFVTQDGQKAAVSAS 529
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKD 599
STMD+ ++RE+FS I AA L + + E RLRP +I DG + EW +D++D
Sbjct: 530 STMDVMLMRELFSHCIEAASTLGVDAELSAEWAAWQ-ERLRPLRIGRDGRLQEWMEDWQD 588
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHD 659
E HRHLSHL+ L+PG+ +T L +AA K+L RGE G GWS+ WK L+ARL +
Sbjct: 589 GEPQHRHLSHLYALYPGYQLTEPDCAKLREAARKSLIDRGESGTGWSLAWKVCLFARLGE 648
Query: 660 QEHAYRMVKRLFNLVDPEHEKHFE-GGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTL 718
A+R++ ++ LV E + E GG+Y NLF AHPPFQID NFG A +AEMLVQS
Sbjct: 649 GNAAWRLLGKMLTLV--EDTAYGEGGGVYRNLFDAHPPFQIDGNFGVIAGIAEMLVQSHR 706
Query: 719 NDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
++++LPALP D W G V+GL+ RGG T+ I W+ G H V +
Sbjct: 707 GEIHVLPALP-DAWPRGRVRGLRCRGGYTIDIAWEGGRWHTVAL 749
>gi|255035537|ref|YP_003086158.1| hypothetical protein Dfer_1752 [Dyadobacter fermentans DSM 18053]
gi|254948293|gb|ACT92993.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
Length = 833
Score = 572 bits (1475), Expect = e-160, Method: Compositional matrix adjust.
Identities = 322/794 (40%), Positives = 456/794 (57%), Gaps = 39/794 (4%)
Query: 1 MMNAESTSTT----NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLW 56
++NA ST LK+ ++ PA + +A+P+GNG +GAMV+GGV E ++LNE TLW
Sbjct: 12 LLNALSTDVIAQKGQDLKLWYSKPASRWVEALPVGNGHIGAMVFGGVEEELMQLNESTLW 71
Query: 57 TGVP-GDYTNPDAPKALSDVR-SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDD 114
+G P NP + L VR +L++ Y +A K+ G + Y + D+++ D
Sbjct: 72 SGGPVKTNVNPASASYLPQVRKALLEEQDYQKANELLKKMQGLYTESYMPMADLKIVHD- 130
Query: 115 SHLK-YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF 173
LK Y R+LD+ + A ++S G V++ RE F+S PD ++V K+S S+ +L+F
Sbjct: 131 --LKGQPASAYYRDLDIAHSKATTRFSAGGVDYKREVFTSAPDNIMVIKVSASKPNALNF 188
Query: 174 NVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANA-------NDDPKGIQFSAILEIK 226
VSL S L +GN ++++ G+ P P N DDP G +
Sbjct: 189 TVSLSSQLRYRLEASGNKELLVNGKAPSHVAPNYYNPPGQEPIIYDDPNGCNGTRFQIRT 248
Query: 227 ISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIR 286
+ RG + ++ + V+ + V+ L A++SF+G P KD + + + L
Sbjct: 249 KAVSRGGTTVVDTAGIHVKNATEVVIFLSAATSFNGFDKCPDKDGKDEKALAKNYLDKAL 308
Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDE 345
Y+ L T H DY F+RVS VTDT + +PS ER+ ++ + D
Sbjct: 309 AKGYATLATSHQHDYHSYFNRVSFS--------VTDTLTRNPNTALPSDERLMAYAKGDY 360
Query: 346 DPSLVELLFQFGRYLLISSSR------PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMN 399
DP L L +QFGRYLLISSSR P ANLQGIWN+++ P W S +NIN +MN
Sbjct: 361 DPGLETLYYQFGRYLLISSSRAALPGVPAGPPANLQGIWNKEMRPPWSSNYTININTQMN 420
Query: 400 YWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS----ADR 455
YW + NLSE PL ++ LS G+ TA+ Y A GWV HH DIW S+
Sbjct: 421 YWPAEVANLSEMHRPLLSWIKDLSQTGAVTAKEFYDAKGWVAHHNADIWGMSNPVGNVGD 480
Query: 456 GKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLE 515
G VWA W MG WLC HLWEHY ++ D+ FL + YPL++ A F LDWL+E DGYL
Sbjct: 481 GDPVWANWYMGANWLCQHLWEHYRFSGDKAFLRDKGYPLMKEAALFTLDWLVEDKDGYLV 540
Query: 516 TNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS 575
T PSTSPE++F P G A VS ++TMD++II ++FS +I AAEVL +ED + +++
Sbjct: 541 TAPSTSPENKFKDPKGGEAAVSVATTMDISIIHDLFSNLIDAAEVLGTDED-FRKLLIEK 599
Query: 576 LPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTL 635
+L P KI G + EW +DF++ + HRH+SHLF L PG I+ E P+ +AA+KTL
Sbjct: 600 RAKLYPLKIDGRGRLQEWYKDFEETDTLHRHVSHLFALHPGRRISPE-TPEFFQAAKKTL 658
Query: 636 QKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHP 695
+ RG+ G GWS WK WARL D +HAY ++++L + + ++ GG Y N F AHP
Sbjct: 659 EVRGDHGTGWSKGWKINFWARLLDGDHAYLLIRQLMKYTNEGNSEYRGGGTYPNFFDAHP 718
Query: 696 PFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
PFQID NF TA ++EML+QS LN++YLLPALP + W G VKGL+ARGG V++ WK+G
Sbjct: 719 PFQIDGNFAGTAGMSEMLIQSHLNEVYLLPALP-NAWKHGQVKGLRARGGFEVTMNWKNG 777
Query: 756 DLHEVGIYSNYSNN 769
L + S NN
Sbjct: 778 KLANASVKSENGNN 791
>gi|332668180|ref|YP_004450968.1| alpha-L-fucosidase [Haliscomenobacter hydrossis DSM 1100]
gi|332336994|gb|AEE54095.1| Alpha-L-fucosidase [Haliscomenobacter hydrossis DSM 1100]
Length = 861
Score = 572 bits (1475), Expect = e-160, Method: Compositional matrix adjust.
Identities = 315/796 (39%), Positives = 458/796 (57%), Gaps = 53/796 (6%)
Query: 8 STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNP 66
+ N L++ ++ PA +T+A+PIGNG +GAMV+G E L+LNE TL++G P G +T+
Sbjct: 17 AQNNHLQLWYDQPASVWTEALPIGNGYMGAMVFGDPLQEHLQLNEGTLYSGDPKGTFTSI 76
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEETYR 125
+ KA V +L+++ +Y EA K G +YQ +GD+ L D H K + + Y+
Sbjct: 77 NVRKAYPQVTALLEAKKYQEAQPLITKEWLGRNHQMYQPMGDLWL--DVEHDKSSIKAYK 134
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
R LDL TATA +Y G+ + R +F+S PD V+V K++ + G + N +L + +
Sbjct: 135 RGLDLQTATAFTEYQSGSTTYRRTYFTSYPDHVLVMKMTATGPGKI--NCTLRQSTPHTA 192
Query: 186 ---YVNGNNQIIMEGRCPG---------------------------KRIPPKANANDDPK 215
Y+ N + M+ R PG +R P AN D +
Sbjct: 193 PAKYLGQGNVLRMQSRAPGFALRRNFDLVEKLGDQHKYPELYEKTGERKPGAANFLYDQQ 252
Query: 216 --GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKD 273
G+ + +K+ GTIS + D K++V+ + V++L A++S++G +P+ KD
Sbjct: 253 IEGLGMAFESRLKVIHTGGTISNV-DGKIRVQNATELVIILSAATSYNGFDKSPAYEGKD 311
Query: 274 PTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVP 333
P + ++I N +S LY RHL DYQ LF RV I L+ +E +P
Sbjct: 312 PAKLLDTYFRAIDNKPFSTLYQRHLLDYQNLFKRVEINLA-----------AETEQSKLP 360
Query: 334 SAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVN 393
+ RV+ F +DP+ L FQFGRYL+I+ SRPG Q NLQGIWN+ L+P W+ A +N
Sbjct: 361 TDRRVELFSNGQDPAFAALYFQFGRYLMIAGSRPGGQPLNLQGIWNDQLTPPWNGAYTIN 420
Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA 453
IN +MNYW + NL+ECQEP F + L+ING +TA+ Y +GWV HH DIW + +
Sbjct: 421 INAQMNYWPAEITNLAECQEPFFKAIKELAINGRETARNMYGNAGWVAHHNMDIW-RHAE 479
Query: 454 DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY 513
+ WPMGG WL +HLWEHY ++ D+ FL+ +PLL+G F WL++ GY
Sbjct: 480 PIDNCACSFWPMGGGWLVSHLWEHYLFSGDQQFLKNEVFPLLKGVVDFYQGWLVKNEAGY 539
Query: 514 LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL 573
L T SPE F+ K A S TMDMAI+RE F+ + AA+VL D V+ V
Sbjct: 540 LVTPVGHSPEQNFVYEGNKQATYSPGPTMDMAIVREAFARYLEAAQVLGV-ADKSVDSVR 598
Query: 574 KSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
++L +L P +I + G + EW+ DF+D +V HRH+SHL+ + PG+ I + NP+L A ++
Sbjct: 599 QNLAKLLPYQIGKYGQLQEWSADFEDGDVQHRHISHLYAIHPGNQINAQTNPELTAAVKR 658
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAA 693
+++RG+ GWS+ WK +WARL+D +HA +++ LF L+ GG Y NLF A
Sbjct: 659 VMERRGDFATGWSMGWKVNIWARLYDGDHALKLMTNLFKLIRSNVTTMQGGGTYPNLFDA 718
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWK 753
HPPFQID NFG TA +AEMLVQS +++LLPALP + W +G VKGLKARGG V + W
Sbjct: 719 HPPFQIDGNFGATAGIAEMLVQSHAGEIHLLPALP-EAWHTGKVKGLKARGGFVVDMEWA 777
Query: 754 DGDLHEVGIYSNYSNN 769
+G L + I S N
Sbjct: 778 NGKLTQATIRSTLGGN 793
>gi|256424518|ref|YP_003125171.1| alpha-L-fucosidase [Chitinophaga pinensis DSM 2588]
gi|256039426|gb|ACU62970.1| Alpha-L-fucosidase [Chitinophaga pinensis DSM 2588]
Length = 841
Score = 572 bits (1474), Expect = e-160, Method: Compositional matrix adjust.
Identities = 319/773 (41%), Positives = 451/773 (58%), Gaps = 40/773 (5%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAP 69
N LK+ + PA ++ A+P+GNGR+GAMV+GG E ++LNE TLW+G P NP A
Sbjct: 38 NNLKLWYKEPAIEWSQALPLGNGRVGAMVFGGTSEELIQLNEATLWSGGPVSKQVNPAAA 97
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIEL--EFDDSHLKYAEETYRRE 127
L VR+ + S +Y EA + K+ G + + LGDI + + D+ + Y R+
Sbjct: 98 SYLPAVRAALFSEKYHEADSLLRKMQGAFSQSFLPLGDIRIHQQLKDTLV----SQYSRD 153
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
LD+ A + ++ G + +TRE F S PDQVIV ++ S+ G+L F S L + V
Sbjct: 154 LDIANAKSITRFVSGGITYTRELFISAPDQVIVIRLRSSKKGALQFKADPSSQLHYQNSV 213
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALE 238
G +I M G+ P + P N N +P KG+++ L ++ GT++ +
Sbjct: 214 TGAKEIAMRGKAPSQVDPSYINYNAEPIQYEAAGSCKGMRYE--LRMRAISPDGTVTT-D 270
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+ V+ + A+LLL A++SF+G P D + + ++ LSY++L RH
Sbjct: 271 ATGITVKNATEAILLLTAATSFNGFDKCPDSEGLDEKAIAGGQMKKAAALSYANLLQRHE 330
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFG 357
DY K F+RVS+ LS ++ P+ ER++ + +D +L L FQFG
Sbjct: 331 QDYHKYFNRVSLNLS------------GDDQSAQPTDERLRRYTAGGKDQALESLYFQFG 378
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLIS SR + ANLQGIWN++L W S +NIN +MNYW + CNL E Q+PL+
Sbjct: 379 RYLLISCSRTPSAPANLQGIWNKELRAPWSSNYTININTQMNYWPAEVCNLMEMQQPLYQ 438
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS--ADRGK--VVWALWPMGGAWLCTH 473
L LS+ G+ TA Y GWV HH TDIWA ++ D+GK WA W MGG WLC
Sbjct: 439 LLKELSVTGAATAGEFYNTRGWVAHHNTDIWAIANPVGDKGKGDPQWANWMMGGNWLCQF 498
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGK 532
LW+HY YT D FL AYP+++ A F LD+L++ GYL T P+TSPE++F+ +G
Sbjct: 499 LWQHYCYTGDEKFLRDTAYPIMKSAALFSLDFLVKDPASGYLVTAPATSPENKFLLANGT 558
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
VS +STMDM IIRE+F+ +I A EVL K ++ L + + + RL P KI +DGS+ E
Sbjct: 559 QESVSIASTMDMTIIRELFNNVIKAGEVL-KVDNGLRDSLQVAADRLYPFKIGKDGSLQE 617
Query: 593 WAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTA 652
W +D+ E HRH+SHL+ LFPG I+ P+L A ++TL+ RG+ G GWS WK
Sbjct: 618 WYKDWPSGETEHRHISHLYALFPGDQISPSATPELANATKRTLEIRGDGGTGWSKAWKIN 677
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEH-EKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAE 711
WARL D HAY++++ L L + H GG Y+NLF AHPPFQID NFG T+ +A+
Sbjct: 678 TWARLEDGNHAYKLLRELLTLTGKGAVDMHNAGGTYANLFCAHPPFQIDGNFGGTSGIAQ 737
Query: 712 MLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
ML+ N + LLPALP D W++G VKGL A GG T+ + WK+G L V IY+
Sbjct: 738 MLLNGQSNMIRLLPALP-DAWATGDVKGLLAYGGHTIDMSWKEGKLVRVTIYA 789
>gi|423342630|ref|ZP_17320344.1| hypothetical protein HMPREF1077_01774 [Parabacteroides johnsonii
CL02T12C29]
gi|409217547|gb|EKN10523.1| hypothetical protein HMPREF1077_01774 [Parabacteroides johnsonii
CL02T12C29]
Length = 844
Score = 572 bits (1473), Expect = e-160, Method: Compositional matrix adjust.
Identities = 312/796 (39%), Positives = 443/796 (55%), Gaps = 51/796 (6%)
Query: 2 MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
M E T PL + ++ PA+++ +A+PIGNGR GAM++G +E L+LNE+TL++G P
Sbjct: 14 MACEETPQKEPLVLWYDSPARNWDEALPIGNGRSGAMIFGRTDNEQLQLNENTLYSGEPS 73
Query: 62 DYTN--PDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLK 118
P+ V L+ +G+Y EA+ K G YQ GD+ ++ ++ +
Sbjct: 74 VVFKDVKITPEMFDKVVGLMKAGKYTEASDLVCKNWLGRLHQYYQPFGDLHIQ---NNKQ 130
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y+R L+++ A A Y G + RE F+S+PD VIV ++ + + +++
Sbjct: 131 GEANRYKRTLNISDAVATTVYEQGGTHYEREVFASHPDNVIVMRLKSNTPDGIDISLNFT 190
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGK----------------RIPPKANAND---------- 212
S ++++I+ G+ PG + P +AN
Sbjct: 191 SPHPTALQKGRDDRLILHGQAPGYVERRTFEQIEQWGDPYKHPELYDANGKRKFNKRMLY 250
Query: 213 ----DPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPS 268
D KG+ F A L+ D + D + V +D +L ++SF+G +PS
Sbjct: 251 GEEIDGKGMFFEAQLKPVFPKD--GKCDITDSGIHVYDTDEVYFVLSMATSFNGFDKSPS 308
Query: 269 DSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEEN 328
DP++++ L + +Y L RH +DY+ LF+RV +L+ SP+
Sbjct: 309 REGIDPSAKAAGILDKALSYNYQTLKQRHTEDYRSLFNRVDFKLASSPEQ---------- 358
Query: 329 IDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDS 388
+P+ +R++ F DP L LLFQFGRYL+IS SRPG Q NLQG+WN+D P W+
Sbjct: 359 -KAMPTDKRIEQFAQTADPELAALLFQFGRYLMISGSRPGGQPLNLQGMWNKDTIPAWNC 417
Query: 389 APHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIW 448
+NIN EMNYW + NLSECQ+PLF + L+++G++TA+ Y GWV HH T IW
Sbjct: 418 GYTININTEMNYWPAELTNLSECQQPLFRMIRELAVSGAETARNMYNRRGWVAHHNTSIW 477
Query: 449 AKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE 508
+S + + WPM WLC+HLWEHY +T D FL+ AYPL++G A F DWLIE
Sbjct: 478 RESLPNDNVPTASFWPMVQGWLCSHLWEHYQFTQDETFLKNEAYPLMKGAAEFFADWLIE 537
Query: 509 GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL 568
+GYL T SPE+ FI DG+ A +S TMDMAIIRE F+ I A+E+ +E +L
Sbjct: 538 DENGYLVTPVGVSPENRFITEDGQTAAMSMGPTMDMAIIRETFTRTIEASEMFNLDE-SL 596
Query: 569 VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLC 628
++ L RL+P +I E G + EW DFK+ E HRH SHL+G P IT +K P+L
Sbjct: 597 RNELKNKLARLQPYQIGERGQLQEWIYDFKEAEPQHRHFSHLYGFHPSDQITPDKTPELF 656
Query: 629 KAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYS 688
A KTL+ RG+ GWS+ WK WARL D HAY+++ LFN V + H GGL+
Sbjct: 657 NAVRKTLELRGDLASGWSMGWKINCWARLLDGNHAYKIIANLFNPVGFGNSAHKGGGLFR 716
Query: 689 NLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETV 748
NL AHPPFQID NFG+TA V EML+QS ++LLPALP D W G V GLKARG +
Sbjct: 717 NLLCAHPPFQIDGNFGYTAGVVEMLLQSHTGYIHLLPALP-DVWKEGSVSGLKARGNFEI 775
Query: 749 SICWKDGDLHEVGIYS 764
++ W+DG L EV I S
Sbjct: 776 AMNWQDGILTEVKIRS 791
>gi|218258383|ref|ZP_03474775.1| hypothetical protein PRABACTJOHN_00430 [Parabacteroides johnsonii
DSM 18315]
gi|218225510|gb|EEC98160.1| hypothetical protein PRABACTJOHN_00430 [Parabacteroides johnsonii
DSM 18315]
Length = 844
Score = 572 bits (1473), Expect = e-160, Method: Compositional matrix adjust.
Identities = 312/796 (39%), Positives = 443/796 (55%), Gaps = 51/796 (6%)
Query: 2 MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
M E T PL + ++ PA+++ +A+PIGNGR GAM++G +E L+LNE+TL++G P
Sbjct: 14 MACEETPQKKPLVLWYDSPARNWDEALPIGNGRSGAMIFGRTDNEQLQLNENTLYSGEPS 73
Query: 62 DYTN--PDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLK 118
P+ V L+ +G+Y EA+ K G YQ GD+ ++ ++ +
Sbjct: 74 VVFKDVKITPEMFDKVVGLMKAGKYTEASDLVCKNWLGRLHQYYQPFGDLHIQ---NNKQ 130
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y+R L+++ A A Y G + RE F+S+PD VIV ++ + + +++
Sbjct: 131 GEANRYKRTLNISDAVATTVYEQGGTHYEREVFASHPDNVIVMRLKSNTPDGIDISLNFT 190
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGK----------------RIPPKANAND---------- 212
S ++++I+ G+ PG + P +AN
Sbjct: 191 SPHPTALQKGRDDRLILHGQAPGYVERRTFEQIEQWGDPYKHPELYDANGKRKFNKRMLY 250
Query: 213 ----DPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPS 268
D KG+ F A L+ D + D + V +D +L ++SF+G +PS
Sbjct: 251 GEEIDGKGMFFEAQLKPVFPKD--GKCDITDSGIHVYDTDEVYFVLSMATSFNGFDKSPS 308
Query: 269 DSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEEN 328
DP++++ L + +Y L RH +DY+ LF+RV +L+ SP+
Sbjct: 309 REGIDPSAKAAGILDKALSYNYRTLKQRHTEDYRSLFNRVDFKLASSPEQ---------- 358
Query: 329 IDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDS 388
+P+ +R++ F DP L LLFQFGRYL+IS SRPG Q NLQG+WN+D P W+
Sbjct: 359 -KAMPTDKRIEQFAQTADPELAALLFQFGRYLMISGSRPGGQPLNLQGMWNKDTIPAWNC 417
Query: 389 APHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIW 448
+NIN EMNYW + NLSECQ+PLF + L+++G++TA+ Y GWV HH T IW
Sbjct: 418 GYTININTEMNYWPAELTNLSECQQPLFRMIRELAVSGAETARNMYNRRGWVAHHNTSIW 477
Query: 449 AKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE 508
+S + + WPM WLC+HLWEHY +T D FL+ AYPL++G A F DWLIE
Sbjct: 478 RESLPNDNVPTASFWPMVQGWLCSHLWEHYQFTQDETFLKNEAYPLMKGAAEFFADWLIE 537
Query: 509 GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL 568
+GYL T SPE+ FI DG+ A +S TMDMAIIRE F+ I A+E+ +E +L
Sbjct: 538 DENGYLVTPVGVSPENRFITEDGQTAAMSMGPTMDMAIIRETFTRTIEASEMFNLDE-SL 596
Query: 569 VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLC 628
++ L RL+P +I E G + EW DFK+ E HRH SHL+G P IT +K P+L
Sbjct: 597 RNELKNKLARLQPYQIGERGQLQEWIYDFKEAEPQHRHFSHLYGFHPSDQITPDKTPELF 656
Query: 629 KAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYS 688
A KTL+ RG+ GWS+ WK WARL D HAY+++ LFN V + H GGL+
Sbjct: 657 NAVRKTLELRGDLASGWSMGWKINCWARLLDGNHAYKIIANLFNPVGFGNSAHKGGGLFR 716
Query: 689 NLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETV 748
NL AHPPFQID NFG+TA V EML+QS ++LLPALP D W G V GLKARG +
Sbjct: 717 NLLCAHPPFQIDGNFGYTAGVVEMLLQSHTGYIHLLPALP-DVWKEGSVSGLKARGNFEI 775
Query: 749 SICWKDGDLHEVGIYS 764
++ W+DG L EV I S
Sbjct: 776 AMNWQDGILTEVKIRS 791
>gi|218263534|ref|ZP_03477615.1| hypothetical protein PRABACTJOHN_03302 [Parabacteroides johnsonii
DSM 18315]
gi|218222657|gb|EEC95307.1| hypothetical protein PRABACTJOHN_03302 [Parabacteroides johnsonii
DSM 18315]
Length = 811
Score = 570 bits (1470), Expect = e-160, Method: Compositional matrix adjust.
Identities = 315/773 (40%), Positives = 457/773 (59%), Gaps = 43/773 (5%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPK 70
P + F PA + +A+PIGNG++GAM++GGV E ++LNE TLW+G P NP+A K
Sbjct: 22 PKTLWFEQPANQWVEALPIGNGQIGAMIFGGVEEELIQLNEGTLWSGSPLKKNVNPEAYK 81
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
L+ VR + Y +AT K+ G + + LGD++++ D H K Y+R L L
Sbjct: 82 FLAPVREALAKEDYQQATKLCKKMQGFFTENFLPLGDLKIKQDFGH-KARVVDYKRILQL 140
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
+ A A +++ V V +TR+ F+S PD V+V + + + L+ ++ L SLL +H NG
Sbjct: 141 DKAIASIEFVVDEVHYTRKMFTSAPDSVMVIQFTADKLRKLTLDIHLTSLLKHHVTANGK 200
Query: 191 NQIIMEGRCPG----------KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ ++ G+ P R P D +G++F +L K D GTI + ++K
Sbjct: 201 DLFVLSGQAPACVDPIYYERPGREPIVQVDKDGLQGMRFQTVL--KAIPDGGTIVS-DEK 257
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ V+ ++ LLL A++SF+G +P KD S + I + ++ L RH+ D
Sbjct: 258 GIHVKDANSLTLLLSAATSFNGFNKHPDSEGKDEKVISCHRIDRIDKVDFAVLKKRHITD 317
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ-TDEDPSLVELLFQFGRY 359
++ F RVS+ L TDT + +P+ R+K + + DP L EL FQ+GRY
Sbjct: 318 FKSYFDRVSLHL--------TDTLNSTINKKLPTDFRLKLYSYGNYDPQLEELYFQYGRY 369
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LLIS+SRPG NLQG+W+ ++ P W S +NIN EMNYW + NLSE + L +F+
Sbjct: 370 LLISASRPGGSAINLQGLWSNEVRPPWASNYTININTEMNYWLAESTNLSEMHQSLLNFI 429
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLW 475
LSI G TA+ Y A GW+ HH +DIWA S++ G WA W MGG WL HLW
Sbjct: 430 KNLSITGEDTAKEYYHARGWMAHHNSDIWALSNSVGNCGDGNPSWASWYMGGNWLSLHLW 489
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
EHY YT D++FL+ AYP+++G A F DWL+E +GYL T+PSTSPE+ F D +
Sbjct: 490 EHYCYTGDKEFLKNEAYPIMKGAALFCFDWLLE-KNGYLITSPSTSPENNFFV-DNNVYA 547
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ 595
VS ++TMDMAII ++F+ +I A+E+L ++ E V+K RL P +I G + EW++
Sbjct: 548 VSEAATMDMAIIHDLFTNVIEASEILGIDKKFRSE-VIKKKERLFPYQIGSFGQLQEWSK 606
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWA 655
D+K+ +++HRHLSHLFG++PG I+ P+L KA +TL+ RG++G GWS WK L A
Sbjct: 607 DYKETDMNHRHLSHLFGVYPGRQISPLITPELAKAVSRTLELRGDKGTGWSKAWKICLIA 666
Query: 656 RLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQ 715
RL D HAY+M++ + + Y+NLF + PPFQID NFG TA EML+Q
Sbjct: 667 RLLDGNHAYKMIREM-----------LQYSTYANLFNSCPPFQIDGNFGATAGFVEMLLQ 715
Query: 716 STLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN 768
S L +++LLPALP D W SGC+ GLK+RG V+I WK+ L + I SN N
Sbjct: 716 SQLKEIHLLPALP-DNWPSGCISGLKSRGNFEVAIAWKNHQLKQAEIKSNLGN 767
>gi|326801540|ref|YP_004319359.1| alpha-L-fucosidase [Sphingobacterium sp. 21]
gi|326552304|gb|ADZ80689.1| Alpha-L-fucosidase [Sphingobacterium sp. 21]
Length = 801
Score = 569 bits (1466), Expect = e-159, Method: Compositional matrix adjust.
Identities = 318/768 (41%), Positives = 444/768 (57%), Gaps = 35/768 (4%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD--YTNPDA 68
N LK+ ++ PA F +A+P+GNGRLGAMV+GGV E L LNE TLW+G P D NP A
Sbjct: 26 NNLKLWYSKPAGKFEEALPLGNGRLGAMVYGGVQEERLSLNEATLWSGKPVDENKVNPQA 85
Query: 69 PKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRREL 128
L V+ + + Y A + + G + Y+ LG++ + F + +RREL
Sbjct: 86 KDHLPAVQEALFNEDYQTADSLIRFMQGAYSQSYEPLGNLLIHFKH---QGTPTHFRREL 142
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
D++ A ARV Y + + RE F+S+PDQ+IV +++ L F +SLL + S
Sbjct: 143 DISQAIARVSYQLNGTSYRREIFASHPDQLIVIRLTAEGKDRLDFTCRFNSLLRSKS-KK 201
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDP------KGIQFSAILEIKISDDRGTISALEDKKL 242
+ + M G P P N +P ++F+++L++ +D + ++ +D L
Sbjct: 202 QSTSLWMHGWAPIHTEPNYRNKEKNPVVYDTLNSMRFASMLKVLKNDGQ---TSWQDSSL 258
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
+ + VLLL ++S+ G NP + K+ ++S L+ S++ L +H+ DY+
Sbjct: 259 AISNAKEVVLLLSMATSYSGFDKNPGRAGKNELDLALSYLKEAEKQSFASLQAKHIQDYR 318
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLL 361
F RVSI L K +P+ ER++ F + D D +LV L +Q+ RYLL
Sbjct: 319 HYFDRVSINLGHGEKA------------NLPTDERLERFAKGDGDNNLVALFYQYSRYLL 366
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
ISSSRPG Q NLQ +WNE + P W S NIN EMNYW + NL E +PLFDF+
Sbjct: 367 ISSSRPGGQPTNLQALWNEIVRPPWSSNYTTNINTEMNYWGTEVANLPEMHQPLFDFIGR 426
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEH 477
L+ G+ TA+ Y A GWV HH TDIWA + G WA W M G WL THLWEH
Sbjct: 427 LAQTGAITAKNYYNADGWVCHHNTDIWAMTHPVGHFGEGHPSWANWQMAGVWLSTHLWEH 486
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
+ +T D DFL K+AYPL++G F L +L DGYL T PSTSPE+ +I G V
Sbjct: 487 FAFTADADFLRKQAYPLMKGAVDFCLSFLTTNKDGYLVTAPSTSPENIYITDKGYKGAVL 546
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
Y ST D+A+IRE+F+ + AA +L+K++ E V +L +L P KI G++ EW D+
Sbjct: 547 YGSTADIAMIRELFADYLKAAVILKKDKKT-QEAVTNALAKLPPYKIGRKGNLREWYHDW 605
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARL 657
+D E HRH+SHLFGL+PG TI+ P+L +A +K+L R E GW+ITW+ LWARL
Sbjct: 606 EDAEPQHRHVSHLFGLYPGTTISDASTPELARAVQKSLDIRTNESTGWAITWRINLWARL 665
Query: 658 HDQEHAYRMVKRLF-NLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
H+ AY +K+LF N DPE K EGGLYSNLF+ PPFQIDANFG A ++EML+QS
Sbjct: 666 HNSAMAYDALKKLFRNANDPEIIKKGEGGLYSNLFSTCPPFQIDANFGGGAGISEMLLQS 725
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+ + LLPALP +W G V GL ARGG + + W++G + I S
Sbjct: 726 HEHYIELLPALP-KEWPDGEVNGLVARGGFVIDMQWRNGKIVHASIVS 772
>gi|333380580|ref|ZP_08472271.1| hypothetical protein HMPREF9455_00437 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826575|gb|EGJ99404.1| hypothetical protein HMPREF9455_00437 [Dysgonomonas gadei ATCC
BAA-286]
Length = 823
Score = 569 bits (1466), Expect = e-159, Method: Compositional matrix adjust.
Identities = 323/773 (41%), Positives = 457/773 (59%), Gaps = 38/773 (4%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY-TNPDAP 69
N L++ + PA +T+A+P+GNG +G M++GGV +E ++LNE +LW+G P NP+A
Sbjct: 22 NKLQLWYEKPAGKWTEALPVGNGFIGGMIFGGVDNELIQLNEGSLWSGGPQKKNVNPEAY 81
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELE---FDDSHLKYAEETYRR 126
K L +R + Y AT K+ G+ + + LGD+ ++ D+ LK YRR
Sbjct: 82 KYLQPIREALAKEDYKLATELCKKMQGYYGESFLPLGDLHIKQTYADNRRLK----NYRR 137
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
LDL A A ++ + V++ RE F+S PD V+V I+ S G ++ VSL+S L
Sbjct: 138 TLDLENAIATTEFEINGVKYIREIFTSAPDSVLVMHITASMPGMINLEVSLNSQLSGTLS 197
Query: 187 VNGNNQIIMEGRCPGK----------RIPPKANANDDPKGIQFSAILEIKISDDRGTISA 236
+G N+I++ G+ P + R P + + G++F +++ + S D IS
Sbjct: 198 ADGKNRIVLRGKAPARVDPNYYNKPGRNPIEQTDAEGCNGMRFQTVVQAR-SKDGAIIS- 255
Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
++ + ++ + LLL A++SF+G P KD S S + +++ Y DL T
Sbjct: 256 -DNNGIYIKNATSVTLLLSAATSFNGFDKCPDSEGKDEKRISESYIAHVQDKGYYDLKTT 314
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ-TDEDPSLVELLFQ 355
H++DYQK F+RVS L P +T + + +PS R+K + + DP L L F
Sbjct: 315 HINDYQKYFNRVSFSL---PNTTITRDVNRK----LPSDMRLKLYSYGNYDPELESLFFH 367
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
+GRYLLIS+SRPG ANLQG+WN++ P W S +NIN +MNYW + NLSE +PL
Sbjct: 368 YGRYLLISASRPGGSAANLQGLWNKEFRPPWSSNYTININTQMNYWPAEIANLSEMHQPL 427
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA--DR--GKVVWALWPMGGAWLC 471
F+ LS G+ TAQ Y A GWV HH TDIW S+A DR G WA W MGG WLC
Sbjct: 428 LQFIQNLSKTGTITAQEYYRAKGWVAHHNTDIWGLSNAVGDRGDGDPNWANWYMGGNWLC 487
Query: 472 THLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDG 531
HLWEHY +T D+ FL+ AYP+++ A F DWLIE DGYL T+PSTSPE F+ DG
Sbjct: 488 QHLWEHYQFTGDKGFLKDIAYPVMKEAALFCFDWLIE-KDGYLITSPSTSPEAAFVTADG 546
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
K V+ ++TMD+AIIR++F+ +I A++ L ++ E+++K +L P KI G +
Sbjct: 547 KRYSVTEAATMDIAIIRDLFTNLIEASQELNFDK-KFREQLIKKRDKLLPYKIGSQGQLQ 605
Query: 592 EWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKT 651
EW++D+KD + HHRH+SHLFGL PG I+ PDL A ++T + RG+EG GWS WK
Sbjct: 606 EWSKDYKDQDPHHRHISHLFGLHPGRQISPLITPDLAAACQRTFEIRGDEGTGWSKGWKI 665
Query: 652 ALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAE 711
ARL D HAY+M++ + V E GG Y N F AHPPFQID NFG TA E
Sbjct: 666 NFAARLLDGNHAYKMIREIMKYV--EEGGSSTGGTYPNFFDAHPPFQIDGNFGATAGFIE 723
Query: 712 MLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
ML+QS LN+++LLPALP D W+ G +KG+ ARGG + I WK+ L I S
Sbjct: 724 MLLQSHLNEIHLLPALP-DVWTEGEIKGIMARGGFEIGIEWKNNVLDNAMIKS 775
>gi|408674119|ref|YP_006873867.1| hypothetical protein Emtol_2704 [Emticicia oligotrophica DSM 17448]
gi|387855743|gb|AFK03840.1| hypothetical protein Emtol_2704 [Emticicia oligotrophica DSM 17448]
Length = 785
Score = 569 bits (1466), Expect = e-159, Method: Compositional matrix adjust.
Identities = 333/811 (41%), Positives = 476/811 (58%), Gaps = 51/811 (6%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
A+ST+T + + PA++F + + +GNG+LGA V+GGV S+ + LN+ TLW+G P +
Sbjct: 8 AQSTNT-----LWYKQPAQYFEETLVLGNGKLGATVFGGVESDKIYLNDATLWSGEPVNA 62
Query: 64 T-NPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEE 122
NP+A K L +R + + Y A + KL G ++ Y LG + L +D Y
Sbjct: 63 NMNPEAYKHLPAIREALRNENYKLADQLNKKLQGKFSESYAPLGTMYLT-NDKATNYT-- 119
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
Y RELD++ A ++V Y V V++TRE+F S PDQ++V K++ S+ G+LSF+V +SLL
Sbjct: 120 NYYRELDISKAISKVTYEVDGVKYTREYFVSYPDQIMVIKLTSSKKGALSFDVKFNSLLK 179
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDP------KGIQFSAILEIKISDDRGTISA 236
+ VN + + + G P P +D+P KGI+F+ + +IK +D G I +
Sbjct: 180 YKTIVN-DKTLKINGYAP-IHAEPNYRRSDNPVIFDENKGIRFTTLAKIKNTD--GAIVS 235
Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
D L ++ + A++ + ++SF+G NP+ + + + ++L +Y +
Sbjct: 236 -TDTTLGIKNASEAIVYVSIATSFNGFDKNPATQGLNNQAIAATSLAKAYAKTYEQIRQS 294
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQ 355
HL DYQK F+RVS+ L ++ +P+ +R++ + + +ED +L L FQ
Sbjct: 295 HLLDYQKFFNRVSLDLGKT------------TAPNLPTDDRLRRYAKGEEDKNLEVLYFQ 342
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
+GRYLLISSSR ANLQGIWN + P W S NIN E NYW + NLSE PL
Sbjct: 343 YGRYLLISSSRTMGVPANLQGIWNPYIRPPWSSNYTTNINAEENYWLAENTNLSEMHAPL 402
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLC 471
F+ ++ G+ TA+ Y A+GWV+ H +DIWA S+ G WA W MGG WL
Sbjct: 403 LGFIKNVAKTGAITAKTFYGANGWVVAHNSDIWAMSNPVGAFGEGDPGWANWNMGGTWLS 462
Query: 472 THLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDG 531
THLWEHY +T D++FL+ AYPL+ G A F L+W++E +G L T+PSTSPE+ +IAPDG
Sbjct: 463 THLWEHYIFTKDQNFLKNEAYPLMRGAAQFCLEWMVEDKNGKLITSPSTSPENIYIAPDG 522
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDA-LVEKVLKSLPRLRPTKIAEDGSI 590
Y + D+A+IRE F I A+++L N DA K+ +L +L P +I + G++
Sbjct: 523 YKGATMYGGSADLAMIRECFIQTIKASKIL--NTDANFRTKLETALAKLYPYQIGKKGNL 580
Query: 591 MEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
EW D++D E HRH SHLFGLFPG+ IT + PDL A +TL+ +G+E GWS W+
Sbjct: 581 QEWYYDWEDAEPKHRHQSHLFGLFPGNHITPNQTPDLANACRRTLEIKGDETTGWSKGWR 640
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEK---HFEGGLYSNLFAAHPPFQIDANFGFTA 707
LWARL D HAY+M++ L N V+P+ K GG Y NLF AHPPFQID NFG A
Sbjct: 641 INLWARLWDGNHAYKMIRELLNYVEPDGVKTNYARGGGTYPNLFDAHPPFQIDGNFGGAA 700
Query: 708 AVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYS 767
A AEMLVQS ++ LLPALP D WSSG VKG+ ARGG +S+ W + L +V I S
Sbjct: 701 AFAEMLVQSDEQEIRLLPALP-DAWSSGSVKGICARGGFELSLEWDNKLLKKVTISSKKG 759
Query: 768 NNDHDSFKTLHYRGTSVK-VNLSAGKIYTFN 797
N T G K ++L AG+ T N
Sbjct: 760 GN------TKLISGEKTKNISLKAGEKLTIN 784
>gi|410098957|ref|ZP_11293931.1| hypothetical protein HMPREF1076_03109 [Parabacteroides goldsteinii
CL02T12C30]
gi|409220088|gb|EKN13045.1| hypothetical protein HMPREF1076_03109 [Parabacteroides goldsteinii
CL02T12C30]
Length = 848
Score = 568 bits (1464), Expect = e-159, Method: Compositional matrix adjust.
Identities = 319/795 (40%), Positives = 443/795 (55%), Gaps = 52/795 (6%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN- 65
T L + +N P++++ +A+PIGNGR GAMV+GGV E L+LNE+TL++G P
Sbjct: 21 TQKKESLVLWYNEPSENWNEALPIGNGRAGAMVFGGVDKEQLQLNENTLYSGEPSTVFKD 80
Query: 66 -PDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEET 123
P+ V L+ + +Y EA+ K G YQ GD+ F +++
Sbjct: 81 IKITPEMFDKVVGLMKAQKYDEASDLVCKHWLGRLHQYYQPFGDL---FIENNKPGEVSG 137
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
Y+REL+++ A R + V++ RE F+S+PD VI+ + S L +++ S
Sbjct: 138 YKRELNISDAVTRTVFEQNGVQYEREIFASHPDDVIIVHLKSSTPDGLDLSLNFTSPHPT 197
Query: 184 HSYVNGNNQIIMEGRCPG----------------------------KRIPPKANAND--D 213
G +++++ G+ PG ++ + D D
Sbjct: 198 AKQSKGTDRLVLHGQAPGYVERRTFEQMEAWGDQYKHPELYDEKGNRKFDKRVLYGDEID 257
Query: 214 PKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKD 273
KG+ F A ++K +G + D + V ++ +L ++SF+G +PS D
Sbjct: 258 NKGMFFEA--QLKPVLPKGGDYEITDAGVHVYNTNEVYFVLSMATSFNGFDKSPSREGVD 315
Query: 274 PTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVP 333
P++++ L Y L RH+ DYQKLF RV +QL SP+ +P
Sbjct: 316 PSAKAAGILDKALAYDYKQLKQRHMADYQKLFDRVDLQLPSSPEQ-----------KAMP 364
Query: 334 SAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVN 393
+ +R+ F+T DP L LLFQFGRYL+IS SRPG Q NLQGIWN+D+ P W+S +N
Sbjct: 365 TDQRIAQFETMGDPDLAALLFQFGRYLMISGSRPGGQPLNLQGIWNKDVVPAWNSGYTIN 424
Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA 453
IN EMNYW + NLSEC EPLF + L+++G++TA+ Y GWV HH T IW +S
Sbjct: 425 INTEMNYWPAEVTNLSECHEPLFRLIDELAVSGAETARNMYNRRGWVGHHNTSIWRESVP 484
Query: 454 DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY 513
+ + WPM WLC+HLWEHY YT D+DFL+ RAYPL++G A F DWLI+ +G
Sbjct: 485 NDNVPTASFWPMVQGWLCSHLWEHYLYTQDQDFLKNRAYPLMKGAAEFFADWLIDDGNGR 544
Query: 514 LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL 573
L T SPE+ FI +GK ++ TMDMAI+RE F+ + AAE+L +E +L ++
Sbjct: 545 LVTPVGVSPENRFIMDNGKQGAMTMGPTMDMAIVRETFTRTLQAAEMLGLDE-SLQAELK 603
Query: 574 KSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
LPRL P +I G + EW DFK+ E HRH SHL+GL PG+ IT + PDL A ++
Sbjct: 604 DKLPRLLPYQIGARGQLQEWMYDFKEWEPKHRHFSHLYGLHPGNQITADGTPDLFDAVKQ 663
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAA 693
TL RG+E GWS+ WK WARL D HAY++V LFN V GGL+ N+ A
Sbjct: 664 TLILRGDEATGWSMGWKINCWARLQDGNHAYKIVSNLFNPVG-FGNGRKGGGLFKNMLDA 722
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWK 753
HPPFQID NFG+TA VAEML+QS + LLPALP D WS G V GLKARG V++ WK
Sbjct: 723 HPPFQIDGNFGYTAGVAEMLMQSHAGFIQLLPALP-DVWSEGSVSGLKARGNFEVAMNWK 781
Query: 754 DGDLHEVGIYSNYSN 768
G L E I S N
Sbjct: 782 QGHLSEATILSGSGN 796
>gi|298246866|ref|ZP_06970671.1| Alpha-L-fucosidase [Ktedonobacter racemifer DSM 44963]
gi|297549525|gb|EFH83391.1| Alpha-L-fucosidase [Ktedonobacter racemifer DSM 44963]
Length = 809
Score = 567 bits (1461), Expect = e-159, Method: Compositional matrix adjust.
Identities = 319/777 (41%), Positives = 451/777 (58%), Gaps = 50/777 (6%)
Query: 1 MMNAESTSTTN---PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWT 57
M A++ + PLK+ + PA + +A+P+GNG LGAM+ GG+ E L+LNEDTLW+
Sbjct: 1 MYQAQAAGVSQDKPPLKLWYRQPATQWLEALPVGNGHLGAMIHGGIGEEVLQLNEDTLWS 60
Query: 58 GVPGDYTNPDAPKALSDVRSLV-DSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSH 116
G P D NPDA L ++R L+ + Y A + ++ G + YQ LG + L+F+
Sbjct: 61 GEPYDTDNPDAVTLLPELRRLILEERDYVAAQELAHRMQGPYNESYQPLGYVRLKFEQ-- 118
Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
+ + Y+R LDLNTA A V+Y G++ F+RE FSS D ++V +++ +LS
Sbjct: 119 -RGEVQAYQRALDLNTALATVQYKAGDILFSREVFSSAADDLLVIRLTSDTPHALSLTAH 177
Query: 177 LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKI 227
L+SL G+N+I M GRCP + + P DP G++F L+ +
Sbjct: 178 LESLHPFTCAPAGSNKIRMTGRCP-RHVDPDYLPTSDPVIYDHGEDGHGMRFETQLQAMV 236
Query: 228 SDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRN 287
+ G ISA D L+VE + L A++S+ G P S + + L +
Sbjct: 237 --EGGRISADVDGALRVENAHTVTFFLSAATSYRGFASRPDLSAHVLEQQCTTRLAVGMS 294
Query: 288 LSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-ED 346
Y L H+ DYQ+LF RV++ L RS + + +P+ ER+ + Q D
Sbjct: 295 KGYEVLRAAHISDYQRLFQRVTLDLGRS------------DGENLPTDERLVAVQKGASD 342
Query: 347 PSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPC 406
+L+ L FQ+GRYLLISSSRPGTQ A+LQGIWN+ + P W S +N+N +MNYW + C
Sbjct: 343 DALLALFFQYGRYLLISSSRPGTQPAHLQGIWNDHVRPAWSSNWTINMNTQMNYWPAETC 402
Query: 407 NLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA---DRGKVVWALW 463
NL+EC PLFD L S++G +TAQV Y GWV HH D+W ++ G WA W
Sbjct: 403 NLAECHSPLFDLLEEASVSGERTAQVYYGCRGWVAHHNMDLWRNTAPVGNGSGDPQWANW 462
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
MGGAWLC HLWEHY ++ DR FL +RAYP+++ A FLLD+L+E G+L T PS SPE
Sbjct: 463 NMGGAWLCQHLWEHYAFSGDRSFLSQRAYPIMKKAAQFLLDFLVEDRQGHLTTCPSMSPE 522
Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTK 583
+ FI G+L+ VS STMD+AI E+F+ I+A++VL+ ++ ++ ++L RL
Sbjct: 523 NLFITESGELSGVSAGSTMDIAITHELFTHCIAASQVLDIDQ-GFAHELAQALARLPQPG 581
Query: 584 IAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG- 642
I G + EW +DF + E HRH+SHL+GL+PG IT+EK P+L +AA K+L++R E G
Sbjct: 582 IGSYGQLQEWNEDFAEHEPGHRHMSHLYGLYPGEQITLEKTPELLQAARKSLERRLEHGG 641
Query: 643 --PGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP--FQ 698
GWS ALWARL + + A+ V +L K +L HPP FQ
Sbjct: 642 GATGWSRALVAALWARLGEGDLAHEHVIQLL--------KDLTATNLFDLIYQHPPIIFQ 693
Query: 699 IDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
ID NFG TAA+AEMLVQS ++L +LPALP W+ G V GL+ARGG V + W +G
Sbjct: 694 IDGNFGATAAIAEMLVQSHADELAILPALP-HAWNEGYVCGLRARGGLEVDVEWSNG 749
>gi|253574718|ref|ZP_04852058.1| twin-arginine translocation pathway signal [Paenibacillus sp. oral
taxon 786 str. D14]
gi|251845764|gb|EES73772.1| twin-arginine translocation pathway signal [Paenibacillus sp. oral
taxon 786 str. D14]
Length = 799
Score = 567 bits (1460), Expect = e-158, Method: Compositional matrix adjust.
Identities = 314/790 (39%), Positives = 451/790 (57%), Gaps = 43/790 (5%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA + +A+P+GNGR+G MV+GG+ E + LNEDTLW+G P D N DA + L
Sbjct: 13 KLWYDRPASRWEEALPVGNGRIGGMVFGGIHRERIALNEDTLWSGFPRDPQNYDALRHLG 72
Query: 74 DVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAE---ETYRRELD 129
R L+ +G+Y EA K+ G + YQ LGD+ LE DS + + +RRELD
Sbjct: 73 PARELIFAGKYKEAEKLIDAKMLGRRTESYQPLGDLWLEQGDSATEADGNELQGFRRELD 132
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN- 188
L T A Y +G E+ RE F S DQV+V +I+ S ++ SLDSLL + ++
Sbjct: 133 LATGIATTTYRIGGAEYRREVFISAVDQVMVLRITALGSEPVNMAASLDSLLRHQAFGGP 192
Query: 189 -GNNQIIMEGRCPG------KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
+I M G+ P + P++ +D G+ F A L + + + GT+ A +
Sbjct: 193 AETARICMRGQAPSHIADNYRGDHPQSVLYEDGLGLTFEAQL-LALPEGGGTVQADASGR 251
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L V G+ LLL A++ + G P DP +AL + L Y L RH D+
Sbjct: 252 LTVSGAKAVTLLLAAATDYAGYDQAPGSGGIDPAERCQAALDAAAALGYEQLRQRHEADH 311
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYL 360
++LF RV ++L P+ ER+++++ E D L L F +GRYL
Sbjct: 312 RRLFGRVELRLG--------RAEEAAERAARPTDERLEAYRRGESDLGLESLYFHYGRYL 363
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
L++SSR GT+ A+LQGIWN + P W+ NIN +MNYW + L++C EPLF+ +
Sbjct: 364 LMASSRTGTEAAHLQGIWNPHVQPPWNCGYTTNINTQMNYWHAEVAGLADCHEPLFELIR 423
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
LS+ G++TA+++Y A GWV HH D+W +S+ G+ WA WPMGG WLC HLWEHY +
Sbjct: 424 DLSVTGARTARIHYGARGWVAHHNVDVWRQSTPSDGEASWAFWPMGGVWLCRHLWEHYEF 483
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC-VSYS 539
+D FL + AYPL++G A F DWL+ G DG L T PSTSPE++F+ PDG C VS
Sbjct: 484 GLDEQFLRETAYPLMKGAAEFCQDWLVPGPDGQLVTAPSTSPENKFLTPDGGEPCSVSAG 543
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKD 599
STMD+ +IRE+ I A+E+L +E A +++ L R+ +I DG + EW++ F +
Sbjct: 544 STMDLFLIRELLEHTIQASEILGVDE-AWRQELSHMLARMAEPQIGPDGRLQEWSEPFAE 602
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWAR 656
E HRH+SHL G +PG+ IT+ + P+L +A +TL++R G GWS W L+AR
Sbjct: 603 AEPGHRHVSHLVGFYPGNAITVRQTPELAEAVRRTLEERIRNGGGHTGWSCAWLINLYAR 662
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
L D + A+R V L + Y NLF HPPFQID NFG A +AEML+QS
Sbjct: 663 LGDGDTAHRFVNTLLSRST-----------YPNLFDDHPPFQIDGNFGGAAGIAEMLLQS 711
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKT 776
+ + LLPALP W+ G V GL+ARGG TV + W++G L I S ++ + +
Sbjct: 712 HMGGIDLLPALP-AAWTRGQVSGLRARGGFTVDMTWEEGRLTSACITS--TSGGECTLRG 768
Query: 777 LHYRGTSVKV 786
LH G SV++
Sbjct: 769 LH--GLSVRL 776
>gi|255530725|ref|YP_003091097.1| alpha-L-fucosidase [Pedobacter heparinus DSM 2366]
gi|255343709|gb|ACU03035.1| Alpha-L-fucosidase [Pedobacter heparinus DSM 2366]
Length = 786
Score = 566 bits (1458), Expect = e-158, Method: Compositional matrix adjust.
Identities = 310/762 (40%), Positives = 454/762 (59%), Gaps = 37/762 (4%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-YTNPDAPKALSDV 75
+N PA+ F + + +GNG+LGA V+GG+ S+ + LN+ TLW+G P + Y NP+A K + +
Sbjct: 32 YNKPAQFFEETMVLGNGKLGAAVFGGIKSDKIFLNDATLWSGEPVNPYMNPEAYKQIPSI 91
Query: 76 RSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
R + + Y A + K+ G + Y LG + ++F+ + + YRRELD++ + +
Sbjct: 92 REALKNENYKLANELNRKVQGAFSQSYAPLGTMHIKFNHTD---SASMYRRELDISKSLS 148
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
++ Y+V V FTRE+F S P +V++ K++ S+ G+LSFNV +SLL N N + +
Sbjct: 149 KITYNVSGVTFTREYFISKPARVMMIKLTSSKKGALSFNVDFESLLK-FEITNQGNTLRV 207
Query: 196 EGRCPGKRIPP-KAN-AN----DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
+G P P + N AN D+ +G +FS++ IK +D + I + + ++
Sbjct: 208 KGYAPYHAEPVYRGNIANSVKFDENRGTRFSSLFRIKNTDGQVII---QHGSIGLKNGTE 264
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A+L + +SF+G NP+ K + S L+ + ++Y + H++DYQ F+RVS
Sbjct: 265 AILYIAIETSFNGFDKNPATEGKSDALLADSCLKKVVPVNYESVKHAHINDYQNYFNRVS 324
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPG 368
L ++ N +P+ ER+K + + ED +L L FQFGRYLLISSSR
Sbjct: 325 FNLGKT------------NAPELPTDERLKRYAEGKEDKNLEILYFQFGRYLLISSSRTA 372
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
ANLQGIWN + P W S NINL+ NYW + NLSE EPL F+ +++ G
Sbjct: 373 GVPANLQGIWNPYIRPPWSSNYTTNINLQENYWLAENTNLSELHEPLMKFIGHVAHTGKV 432
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
TA+ Y GW + H +DIWA S+ +G VWA W MGG WL THLWEHY +T+D+
Sbjct: 433 TAKTFYGVEGWALCHNSDIWAMSNPVGGFGQGDPVWANWNMGGTWLSTHLWEHYIFTLDK 492
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
+FL+++AYPL++G A F L+WL++ G L T+PSTSPE FI DG Y T D+
Sbjct: 493 NFLKQKAYPLMKGAARFCLNWLVKDKKGNLITSPSTSPEASFITADGSKGSTLYGGTADL 552
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHH 604
A+IRE F I A+++L + ++V +L +L+P ++ ++G++ EW D+ D + H
Sbjct: 553 AMIRECFLQTIRASQIL-GTDITFRKEVESALRQLQPYQVGKNGNLQEWYYDWDDADPKH 611
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAY 664
RH SHLFGLFPGH IT P+L A +KTLQ +G+E GWS W+ LWARL D HAY
Sbjct: 612 RHQSHLFGLFPGHHITPGLTPELANACKKTLQIKGDETTGWSKGWRINLWARLLDGNHAY 671
Query: 665 RMVKRLFNLVDPEH----EKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
+M + L + VDP+ +K GG Y NL AHPPFQID NFG AAVAEMLVQS N
Sbjct: 672 QMYRTLLSYVDPDQYKGPDKKTGGGTYPNLLDAHPPFQIDGNFGGAAAVAEMLVQSNENQ 731
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
+ LLPALP D W +G +KG+ ARGG + + W++ + + I
Sbjct: 732 IRLLPALP-DAWDTGKIKGICARGGFEIEMEWQNKSVKKYTI 772
>gi|261409383|ref|YP_003245624.1| alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
gi|261285846|gb|ACX67817.1| Alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
Length = 799
Score = 565 bits (1457), Expect = e-158, Method: Compositional matrix adjust.
Identities = 311/781 (39%), Positives = 461/781 (59%), Gaps = 43/781 (5%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
+ +AE +TT + + PA + +A+P+GNGRLGAMV+GGV E ++ NEDTLW+G P
Sbjct: 3 LYSAEHRNTT----LWYRKPAAKWEEALPLGNGRLGAMVFGGVQEECMQWNEDTLWSGFP 58
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKY 119
D N +A + L+ R L+ SG+YAEA ++ G + + LGD+ + S +
Sbjct: 59 RDTNNYEALRYLAKARELIASGKYAEAEQLIEGRMVGRNTESFLPLGDLLIR--QSGIGD 116
Query: 120 AEETYRRELDLNTATARVKYSVG--NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL 177
+ YRREL+L+ A ++ G N F+R+ F S DQV V + S SGS+ + L
Sbjct: 117 SCSEYRRELNLDMGIASTRFQGGGSNHIFSRDMFISAVDQVGVIRYECSGSGSIQLEIGL 176
Query: 178 DSLLDNHSYVNGNNQIIMEGRCPG------KRIPPKANANDDPKGIQFSAILEIKISDDR 231
S L + + + +++ G P + P + +D GI++ + + D
Sbjct: 177 RSPLQHRTRTEEDGTLVLHGHAPTHIADNYRGDHPGSVLYEDGLGIRYE--MRLLALTDS 234
Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYS 291
G ++ ++D +++ + LL+ A+++F+G +P DP+ LQ +
Sbjct: 235 GQVT-VDDSGMRICAAGSVTLLIAAATNFEGFDRSPGSGGTDPSGICRERLQDAMRHGFE 293
Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLV 350
L +RH+ D+Q LF RV +QL R P++ E +I + + ER+++++ ED +L
Sbjct: 294 QLRSRHVQDHQALFRRVELQLGR-PEN-------ERSIAALATDERMEAYREGREDSALE 345
Query: 351 ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
L+FQFGRYLLI+SSRPGTQ A+LQGIWN + P W+S NIN EMNYW + L+E
Sbjct: 346 ALMFQFGRYLLIASSRPGTQPAHLQGIWNPHVQPPWNSDYTTNINTEMNYWPAETTRLNE 405
Query: 411 CQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWL 470
C EPL + LS++G++TA+++Y A GWV HH D+W +S G+ +WA WPMGGAWL
Sbjct: 406 CHEPLIQMIRELSVSGARTAKIHYGARGWVAHHNVDLWRMASPSDGRAMWAFWPMGGAWL 465
Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD 530
C HLWE Y + D ++L + AYPL+ G A F LD LIE +G+L T+PSTSPE++F+ +
Sbjct: 466 CRHLWERYQFQPDLEYLRETAYPLMRGAALFCLDLLIEDGEGHLVTSPSTSPENQFLTAE 525
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSI 590
G VS STMDMAIIR++F I A+++LE++ D L E+ ++ RL P I ++G +
Sbjct: 526 GLPCSVSAGSTMDMAIIRDLFHNCIEASQLLEQD-DELREEWKAAVARLLPYAIDDEGRL 584
Query: 591 MEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSI 647
MEW++ + + E HRH+SHL+GL+PG IT++ P L +AA +TL R + G GWS
Sbjct: 585 MEWSKPYPEAEPGHRHVSHLYGLYPGSDITLQDTPQLAEAAYRTLMSRIDHGGGHTGWSC 644
Query: 648 TWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTA 707
W L+ARL + AY V+ L + ++ NL HPPFQIDANFG +A
Sbjct: 645 VWLINLFARLQQPDKAYVYVRTLISR-----------SMHPNLLGDHPPFQIDANFGGSA 693
Query: 708 AVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYS 767
+ EML+QS L+ + LLPALP W+ G V+GLKARGG V + WKDG L I S +
Sbjct: 694 GLVEMLLQSHLDAIQLLPALP-KAWAEGSVRGLKARGGFIVDMEWKDGILASASITSTHG 752
Query: 768 N 768
Sbjct: 753 R 753
>gi|399029093|ref|ZP_10730146.1| hypothetical protein PMI10_01974 [Flavobacterium sp. CF136]
gi|398073115|gb|EJL64299.1| hypothetical protein PMI10_01974 [Flavobacterium sp. CF136]
Length = 802
Score = 564 bits (1454), Expect = e-158, Method: Compositional matrix adjust.
Identities = 310/764 (40%), Positives = 466/764 (60%), Gaps = 38/764 (4%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPKALSDV 75
++ PA+ F +++ +GNG+LGA V+GGV S+ + LN+ TLW+G P + NP+A K + V
Sbjct: 32 YDKPAEFFEESLVLGNGKLGATVFGGVNSDKIYLNDATLWSGEPVNANMNPEAYKNIPAV 91
Query: 76 RSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
R + + Y A + K+ G ++ + LG +E+ ++ K Y RELD++ A +
Sbjct: 92 REALKNENYKLAEELNKKIQGKNSESFAPLGTLEI---NNSEKGKAVNYHRELDISNAVS 148
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
+V Y + +++TRE+F S PDQ+++ K++ + G+L+F+++L SLL ++ V NN ++M
Sbjct: 149 KVSYEMAGIKYTREYFVSAPDQIMIIKLTSDQKGALNFDINLKSLLKSNVEVR-NNILVM 207
Query: 196 EGRCP-----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
G P G + PK + +G +F+ +++IK +D + T S + L ++ + A
Sbjct: 208 TGSAPIHENAGYAVLPKY-LDIKERGTRFTTLIQIKKTDGKITNSR---ESLTLKDATEA 263
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
++ + ++SF+G NP+ D + ++ + S+ L H+ DYQK ++RVS+
Sbjct: 264 IIYVSVATSFNGFDKNPATEGLDDVAIALQNMNKAFAKSFDKLKQSHITDYQKFYNRVSL 323
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPGT 369
L ++ T S +P+ ER+ + +ED +L L FQ+GRYLLISSSR
Sbjct: 324 DLGKT-------TAS-----NLPTDERLLRYADGNEDKNLEILYFQYGRYLLISSSRTLG 371
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
ANLQGIWN L+P W S +NINLE NYW + NLSE PL F+ LSI G T
Sbjct: 372 VPANLQGIWNPYLNPPWSSNYTMNINLEENYWLAENTNLSEMHLPLLSFIKNLSITGKIT 431
Query: 430 AQVNY-LASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
A+ Y + GW H +DIWA ++ + + +WA WPM GAWL TH+WEHY +T D+
Sbjct: 432 AKTFYGVDKGWAAGHNSDIWAMTNPVGQFGKEEPMWACWPMAGAWLSTHIWEHYVFTQDK 491
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
++L+K YPL++G A F L W++ +G L T+PSTSPE+++IAPDG + Y T D+
Sbjct: 492 EYLKKEGYPLMKGAAEFCLGWMVTDKNGNLITSPSTSPENQYIAPDGFVGATMYGGTADL 551
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHH 604
A+IRE F I A++VL + D K+ +L +L P +I + G++ EW D++D + H
Sbjct: 552 AMIRECFDKTIKASKVLNIDAD-FRAKLETALSKLHPYQIGKKGNLQEWYHDWEDKDPKH 610
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAY 664
RH S LFGLFPG+ IT K PDL +A+ KTL+ +G++ GWS W+ LWARL D HAY
Sbjct: 611 RHQSQLFGLFPGNHITPLKTPDLAEASRKTLEIKGDQTTGWSKGWRINLWARLWDGNHAY 670
Query: 665 RMVKRLFNLVDPEHEKHFE----GGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
+M + L VDP+ +K + GG Y NLF AHPPFQID NFG AAVAEMLVQS N+
Sbjct: 671 KMFRELLQYVDPDGKKTEKPRRGGGTYPNLFDAHPPFQIDGNFGGAAAVAEMLVQSDENE 730
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+ LLPALP D W SG VKG+ ARGG +++ W + L++V + S
Sbjct: 731 IRLLPALP-DAWESGSVKGICARGGFEIAMEWNNKTLNKVVVSS 773
>gi|340617674|ref|YP_004736127.1| alpha-L-fucosidase [Zobellia galactanivorans]
gi|339732471|emb|CAZ95739.1| Alpha-L-fucosidase, family GH95 [Zobellia galactanivorans]
Length = 807
Score = 564 bits (1454), Expect = e-158, Method: Compositional matrix adjust.
Identities = 316/767 (41%), Positives = 450/767 (58%), Gaps = 44/767 (5%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-YTNPDAPKALSDV 75
F+ PA+HF + + +GNG+ GA ++GGV ++++ LN+ TLW+G P D Y NP+A K L +
Sbjct: 37 FDRPAEHFEETLVLGNGKAGASIFGGVATDSIYLNDATLWSGEPVDPYMNPEAYKNLPAI 96
Query: 76 RSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
R + + Y A + KL G + Y LG + L F+ K ++Y R+L+L A +
Sbjct: 97 REALKNENYKLADSLQSKLQGSFSQSYMPLGTVYLNFEH---KNQPQSYHRQLELEKALS 153
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
V Y V V FTRE+F S+ DQ +V ++ S+ G+L+FN+ +SLL NG + +
Sbjct: 154 TVTYKVDGVTFTREYFISHADQAMVIRLKSSKKGALNFNIGFNSLLKYELATNGPT-LEV 212
Query: 196 EGRCPGKRIP------PKANANDDPKGIQFSAILEIKISDDR--GTISALEDKKLKVEGS 247
G P P P D +G +F+++ IK +D + GT D + ++ +
Sbjct: 213 NGYAPYHVEPSYRGKMPNPVQFDPNRGTRFTSLFRIKHTDGKLIGT-----DNTVALKDA 267
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
AV+ + ++SF+G NP+ D + + S L + + L+ HL D+QK F+R
Sbjct: 268 TEAVVYVSIATSFNGFDKNPATEGLDHKAMASSQLSKASSKPFDALFEAHLKDHQKYFNR 327
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSR 366
V + L +S + +P+ ER+K + + +ED +L L FQ+GRYLLISSSR
Sbjct: 328 VHLDLGKS------------TAEDLPTDERLKRYAKGEEDKNLEVLYFQYGRYLLISSSR 375
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
ANLQGIWN + P W S +NIN E NYW + NLSE +P+ F+ ++ G
Sbjct: 376 TPNVPANLQGIWNPYIRPPWSSNYTLNINAEENYWLAENANLSEMHQPMLGFIENIAQTG 435
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
TA+ Y A GW H +DIWA S+ +G + WA W MGG WL +HLWEHY ++
Sbjct: 436 KITAKTFYGAGGWAACHNSDIWAMSNPVGDFGQGGINWANWNMGGTWLSSHLWEHYTFSQ 495
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
D DFL+ RAYPLL+G A F L+WL+E DG L T+P TSPE++FI PDG Y ST
Sbjct: 496 DLDFLKNRAYPLLKGAAEFCLEWLVEDKDGNLVTSPGTSPENKFITPDGYQGATLYGSTS 555
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
D+A+IRE F I+A+E L K + A ++ K+L +L P ++ + G++ EW D++D +
Sbjct: 556 DLAMIRECFQQTIAASETL-KTDAAFRTQLEKALAKLYPYQVGKKGNLQEWYHDWEDVDP 614
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEH 662
HRH SHL+GL+PGH I+ EK P+L A TL +G+E GWS W+ LWARL D
Sbjct: 615 KHRHQSHLYGLYPGHHISPEKTPELADATRTTLNIKGDETTGWSKGWRINLWARLLDGNR 674
Query: 663 AYRMVKRLFNLVDPE-----HEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST 717
AY+ + L V P+ +EK GG Y NLF AHPPFQID NFG AAV EMLVQST
Sbjct: 675 AYKQYRELLRYVAPDGVRASYEK--GGGTYPNLFDAHPPFQIDGNFGGAAAVVEMLVQST 732
Query: 718 LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
L ++ LLPALP D W++G V+GLKARG V+I W + +V I+S
Sbjct: 733 LQEIRLLPALP-DVWANGSVEGLKARGNFEVAITWNNKVPTQVKIHS 778
>gi|436838082|ref|YP_007323298.1| alpha-L-fucosidase [Fibrella aestuarina BUZ 2]
gi|384069495|emb|CCH02705.1| alpha-L-fucosidase [Fibrella aestuarina BUZ 2]
Length = 801
Score = 563 bits (1450), Expect = e-157, Method: Compositional matrix adjust.
Identities = 318/792 (40%), Positives = 454/792 (57%), Gaps = 41/792 (5%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-YTNPDAPKALSDV 75
+ PA +F + + +GNG GA V+GGV S+ + LN+ TLW+G P D NP+A K + +
Sbjct: 29 YKQPAHYFEETLVLGNGTQGASVFGGVRSDKIYLNDATLWSGGPVDPNMNPEAYKNIPAI 88
Query: 76 RSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
R + + Y A KL G ++ Y LG + F D+ + Y R+L+L AT+
Sbjct: 89 REALQNENYQLADQFQKKLQGKFSESYAPLGTL---FIDTDAPADPQNYYRQLNLADATS 145
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
+V+Y+V V FTR++F S PDQ++V ++ S G+L F V +S L N GN +
Sbjct: 146 QVRYTVNGVTFTRDYFISKPDQLMVIRLKSSRKGALGFTVRFNSQLRNQVSATGN-VLKA 204
Query: 196 EGRCPGKRIP------PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
G P K P P A D KG +F+ ++ IK D G A D L ++G
Sbjct: 205 TGYAPQKAEPNYRGNIPNAVVFDPAKGTRFTTLMGIKTQD--GGTVATTDTSLTLKGGTE 262
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A+L + ++SF+G +P+ + + + L + SY+ L H+ DYQ+LF+RVS
Sbjct: 263 ALLFVSIATSFNGFDKDPATNGLPHETIAAERLSRAMSKSYAQLLAAHVSDYQRLFNRVS 322
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPG 368
++L+ S E I +P+ ER++ + + D L +L F FGRYLLISSSR
Sbjct: 323 LRLT-----------SAETIPNLPTDERLQRYAEGKPDTDLEQLYFNFGRYLLISSSRTP 371
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
ANLQGIWN + P W S NINL+ NYW + NL E EP+ F+ L+ G+
Sbjct: 372 GVPANLQGIWNPYMRPPWSSNYTTNINLQENYWPAETANLPEMHEPMLSFIGNLAKTGTI 431
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
TA+ Y A+GW + H +DIWA ++ +G VWA W MGGAW+ THLWEH+ + D+
Sbjct: 432 TARTFYGANGWTVAHNSDIWAMTNPVGDFGQGDPVWANWNMGGAWISTHLWEHFTFGQDK 491
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
+L + AYPLL+G A F LDWL+ G L T+P TSPE++++ P G + T D+
Sbjct: 492 TYLRETAYPLLKGAAQFCLDWLVRDKAGKLVTSPGTSPENQYLTPSGYKGATLFGGTADL 551
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLK-SLPRLRPTKIAEDGSIMEWAQDFKDPEVH 603
A++RE S + AA+VL N DA + LK +L L P +I + G++ EW D+ D +
Sbjct: 552 AMVRECLSQTLQAAQVL--NTDADFQATLKQTLADLHPYQIGKAGNLQEWYYDWADVDPK 609
Query: 604 HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHA 663
HRH SHLFGL+PGH I ++ P+L +A KTL+ +G+E GWS W+ LWARL D HA
Sbjct: 610 HRHQSHLFGLYPGHQIRPDRTPELAQACRKTLEIKGDETTGWSKGWRINLWARLWDGNHA 669
Query: 664 YRMVKRLFNLVDPEHEK---HFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
Y+M + L + V P+ K GG Y NLF AHPPFQID NFG TAAVAEML+QS+ N+
Sbjct: 670 YKMYRELLHFVLPDGVKTDYARGGGTYPNLFDAHPPFQIDGNFGGTAAVAEMLLQSSDNE 729
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYR 780
+ LLPALP D W +G V GL+ARGG +++ W++G + ++S TL
Sbjct: 730 IRLLPALP-DAWPAGSVSGLRARGGFELTLDWQNGRPVKATVFSKMGGQ-----TTLVGG 783
Query: 781 GTSVKVNLSAGK 792
G S +NL G+
Sbjct: 784 GKSQSLNLKPGQ 795
>gi|423346384|ref|ZP_17324072.1| hypothetical protein HMPREF1060_01744 [Parabacteroides merdae
CL03T12C32]
gi|409220202|gb|EKN13158.1| hypothetical protein HMPREF1060_01744 [Parabacteroides merdae
CL03T12C32]
Length = 844
Score = 561 bits (1445), Expect = e-157, Method: Compositional matrix adjust.
Identities = 312/791 (39%), Positives = 439/791 (55%), Gaps = 51/791 (6%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN- 65
T + PL + ++ PA+++ +A+PIGNGR GAMV+GGV E L+LNE+TL++G P
Sbjct: 19 TPSKAPLTLWYDKPAQNWDEALPIGNGRAGAMVFGGVEKEQLQLNENTLYSGEPSVVFKD 78
Query: 66 -PDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEET 123
P+ V L+ +G+Y A+ K G YQ GD+ ++ +
Sbjct: 79 VKITPEMFDKVVGLMKAGKYKTASDLVCKNWLGRLHQYYQPFGDLHIQNNKPG---DAAG 135
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
Y+R L+++ A A Y V++ RE F+S+PD VIV + + ++ S
Sbjct: 136 YKRALNISDAVATTVYKQNGVKYEREVFASHPDNVIVMHLKSDTPNGIDISLDFTSPHPT 195
Query: 184 HSYVNGNNQIIMEGRCPGK----------------RIPPKANAND--------------D 213
++++I+ G+ PG + P +AN D
Sbjct: 196 ALQKGTDDRLILHGQAPGYVERRTFEQIEQWGDQYKHPELYDANGKRKFDKRMLYGDEID 255
Query: 214 PKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKD 273
KG+ F A L+ D + D + + +D +L ++SF+G +PS D
Sbjct: 256 GKGMFFEAQLKPVFPKD--GKCEITDAGIHIYNTDEVYFILSMATSFNGFDKSPSRDGID 313
Query: 274 PTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVP 333
P++++ S L+ + Y L RH +DY LF RV +QL S SE+ +P
Sbjct: 314 PSAKAASILEKALSYDYQTLKQRHTEDYHSLFDRVDLQLVSS---------SEQK--AMP 362
Query: 334 SAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVN 393
+ +R++ F DP+L LLFQFGRYL+IS SRPG Q NLQGIWN+D P W+ +N
Sbjct: 363 TDKRLEQFTQTADPALAALLFQFGRYLMISGSRPGGQPLNLQGIWNKDTIPAWNCGYTIN 422
Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA 453
IN EMNYW + NLSECQEPLF + LS++G++TA+ Y GWV HH T IW +S
Sbjct: 423 INTEMNYWPAELTNLSECQEPLFRMIRELSVSGAETARNMYNRRGWVAHHNTSIWRESLP 482
Query: 454 DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY 513
+ + WPM WLC+HLWEHY +T D FL+ AYPL++G A F DWLI+ +G+
Sbjct: 483 NDNVPTASFWPMVQGWLCSHLWEHYQFTQDEAFLKNEAYPLMKGAAEFFADWLIDDGNGH 542
Query: 514 LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL 573
L T SPE+ FI DG+ A +S TMDMAIIRE F+ I+A+E+ +E + ++
Sbjct: 543 LVTPVGVSPENRFITEDGQTAAMSMGPTMDMAIIRETFTRTIAASEMFNLDE-SFRNELK 601
Query: 574 KSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
L RL P +I + G + EW DFK+ E HRH SHL+G P IT +K P+L A K
Sbjct: 602 DKLARLLPYQIGKRGQLQEWIYDFKEWEPQHRHFSHLYGFHPSDQITPDKTPELFNAVRK 661
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAA 693
TL+ RG+ GWS+ WK WARL D HAY+++ LFN V + H GGL+ NL A
Sbjct: 662 TLELRGDLASGWSMGWKINCWARLLDGNHAYKIIANLFNPVGFGNSAHKGGGLFRNLLCA 721
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWK 753
HPPFQID NFG+TA V EML+QS ++LLPALP D W+ G V GLKARG +++ WK
Sbjct: 722 HPPFQIDGNFGYTAGVVEMLLQSHAGYIHLLPALP-DVWAEGSVYGLKARGNFEITMNWK 780
Query: 754 DGDLHEVGIYS 764
+G L E I+S
Sbjct: 781 NGKLTEANIHS 791
>gi|295132871|ref|YP_003583547.1| six-hairpin glycosidase [Zunongwangia profunda SM-A87]
gi|294980886|gb|ADF51351.1| six-hairpin glycosidase [Zunongwangia profunda SM-A87]
Length = 819
Score = 560 bits (1443), Expect = e-156, Method: Compositional matrix adjust.
Identities = 307/771 (39%), Positives = 450/771 (58%), Gaps = 45/771 (5%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPKALSDV 75
++ PA+ + +A+P+GNG++GAMV+G V E ++LNE +L++G P NPDA L +
Sbjct: 28 YDAPAREWVEALPLGNGKIGAMVFGRVTDELIQLNESSLYSGGPVPQRINPDAASYLQPL 87
Query: 76 RSLVDSGQYAEATAASVKLFGHPADVYQLLGDI----ELEFDDSHLKYAEETYRRELDLN 131
R + YA+AT + K+ G+ Y +GD+ +L+ D H Y+R L++
Sbjct: 88 REAIFDKDYAQATLLAKKMQGYYTQSYMPMGDLLLHQDLQNDSVH------AYKRSLNIE 141
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
A + V +TRE F+S PD V+V K++ + +L+ N+S +S L V N
Sbjct: 142 NAITTTSFESDGVNYTREFFTSAPDNVLVMKLTADSAKALNLNLSAESQLRAEVSVTKNQ 201
Query: 192 QIIMEGRCPGKRIPPKANAN-------DDPKG---IQFSAILEIKISDDRGTISALEDKK 241
++++ G+ P P N DDP+G ++F +++ +D + T +D
Sbjct: 202 ELVVSGKAPANVNPNYYNPEGVEPITYDDPEGCDGMRFQYRIKVLKTDGKLTT---QDTS 258
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L + + V+LL A++SF+G P D + +Q+ SY+ L + H+ D+
Sbjct: 259 LAIADASEVVILLTAATSFNGFDKCPDKDGLDEAKLASEFMQAASAKSYAQLKSDHIADF 318
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYL 360
RV++ L ++PKD + P+ R+K++ + DP L L FQ+GRYL
Sbjct: 319 STYMQRVALDLGKTPKDQLDQ----------PTDSRLKAYSEGANDPELEALYFQYGRYL 368
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
L+S+SRPG ANLQGIWN+++ P W S NIN EMNYW + NLSE +P ++
Sbjct: 369 LVSASRPGGIAANLQGIWNKEMRPPWSSNYTTNINAEMNYWPAETTNLSEMHQPFLAYIQ 428
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSS--ADR--GKVVWALWPMGGAWLCTHLWE 476
++ G + A+ Y A GWV+HH +DIWA ++ DR G +WA W MGG WL HLWE
Sbjct: 429 NAAVTGGRVAKEFYDAPGWVVHHNSDIWATANPVGDRGDGDPLWANWYMGGNWLTLHLWE 488
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
HY +T D +L + YP+++ A F LDWL+E HDG L T PSTSPE+ F+ +GK V
Sbjct: 489 HYAFTQDTSYL-AQVYPVMKEAAVFTLDWLVE-HDGKLITAPSTSPENLFLV-NGKGYAV 545
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
+ +TMD+AIIRE+F+ I A+++L K D ++ + RL P +I G + EW D
Sbjct: 546 TEGATMDIAIIRELFNNTIKASKILGKEAD-FRHELSAAQDRLIPYQIGAKGQLQEWYLD 604
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
F++ + HHRH+SHLFGL PG +I+ P+L KA EKT + RG+EG GWS WK AR
Sbjct: 605 FEEEDPHHRHVSHLFGLHPGTSISPLTTPELAKATEKTFELRGDEGTGWSKAWKINFAAR 664
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
L D +HAY+M++ L + VDP ++H +GG Y NLF AHPPFQID NFG TA +AEML+QS
Sbjct: 665 LLDGDHAYKMIRELMHYVDPYSKEH-KGGTYPNLFDAHPPFQIDGNFGATAGIAEMLLQS 723
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYS 767
L +L+LLPALP W +G V GLKARG V + W + L I+S S
Sbjct: 724 HLGELHLLPALP-QAWDTGSVTGLKARGNFKVDLAWNNHKLQNARIHSESS 773
>gi|423722949|ref|ZP_17697102.1| hypothetical protein HMPREF1078_01162 [Parabacteroides merdae
CL09T00C40]
gi|409241779|gb|EKN34546.1| hypothetical protein HMPREF1078_01162 [Parabacteroides merdae
CL09T00C40]
Length = 864
Score = 558 bits (1438), Expect = e-156, Method: Compositional matrix adjust.
Identities = 308/791 (38%), Positives = 436/791 (55%), Gaps = 51/791 (6%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN- 65
T + PL + ++ PA+++ +A+PIGNGR GAMV+GGV E L+LNE+TL++G P
Sbjct: 39 TPSKAPLTLWYDKPAQNWDEALPIGNGRAGAMVFGGVEKEQLQLNENTLYSGEPSVVFKD 98
Query: 66 -PDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEET 123
P+ V L+ +G+Y A+ K G YQ GD+ ++ +
Sbjct: 99 VKITPEMFDKVVGLMKAGKYKTASDLVCKNWLGRLHQYYQPFGDLHIQNNKPG---DAAG 155
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
Y+R L+++ A A Y V++ RE F+S+PD VIV + + ++ S
Sbjct: 156 YKRALNISDAVATTVYKQNGVKYEREVFASHPDNVIVMHLKSDTPNGIDISLDFTSPHPT 215
Query: 184 HSYVNGNNQIIMEGRCPGK----------------RIPPKANANDDPK------------ 215
++++I+ G+ PG + P +AN K
Sbjct: 216 ALQKGTDDRLILHGQAPGYVERRTFEQIEQWGDQYKHPELYDANGKRKFDKRMLYGDEIG 275
Query: 216 --GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKD 273
G+ F A L+ D + D + + +D +L ++SF+G +PS D
Sbjct: 276 GKGMFFEAQLKPVFPKD--GKCEITDAGIHIYNTDEVYFILSMATSFNGFDKSPSRDGID 333
Query: 274 PTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVP 333
P++++ S L+ + Y L RH +DY+ LF RV +L SP+ +P
Sbjct: 334 PSAKAASILEKALSYDYQTLKQRHTEDYRSLFDRVDFELFSSPEQ-----------KAMP 382
Query: 334 SAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVN 393
+ +R++ F + DP L LLFQFGRYL+IS SRP Q NLQGIWN+D P W+ +N
Sbjct: 383 TDKRLEQFAGNADPDLAALLFQFGRYLMISGSRPDGQPLNLQGIWNKDTIPAWNCGYTIN 442
Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA 453
IN EMNYW + NLSECQEPLF + LS++G++TA+ Y GWV HH T IW +S
Sbjct: 443 INTEMNYWPAELTNLSECQEPLFRMIRELSVSGAETARNMYNRRGWVAHHNTSIWRESLP 502
Query: 454 DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY 513
+ + WPM WLC+HLWEHY +T D FL+ AYPL++G A F DWLI+ +G+
Sbjct: 503 NDNVPTASFWPMVQGWLCSHLWEHYQFTQDETFLKNEAYPLMKGAAEFFADWLIDDGNGH 562
Query: 514 LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL 573
L T SPE+ FI DG+ A +S TMDMAIIRE F+ I+A+E+ +E + ++
Sbjct: 563 LVTPVGVSPENRFITEDGQTAAMSMGPTMDMAIIRETFTRTIAASEMFNLDE-SFRNELK 621
Query: 574 KSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
L RL P +I + G + EW DFK+ E HRH SHL+G P IT +K P+L A K
Sbjct: 622 DKLARLLPYQIGKRGQLQEWIYDFKEWEPQHRHFSHLYGFHPSDQITPDKTPELFNAVRK 681
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAA 693
TL+ RG+ GWS+ WK WARL D HAY+++ LFN V + H GGL+ NL A
Sbjct: 682 TLELRGDLASGWSMGWKINCWARLLDGNHAYKIIANLFNPVGFGNSAHRGGGLFRNLLCA 741
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWK 753
HPPFQID NFG+TA V EML+QS ++LLPALP D W+ G V GLKARG +++ WK
Sbjct: 742 HPPFQIDGNFGYTAGVVEMLLQSHAGYIHLLPALP-DVWAEGSVSGLKARGNFEITMNWK 800
Query: 754 DGDLHEVGIYS 764
+G L E I+S
Sbjct: 801 NGKLTEANIHS 811
>gi|154489941|ref|ZP_02030202.1| hypothetical protein PARMER_00170 [Parabacteroides merdae ATCC
43184]
gi|154089383|gb|EDN88427.1| hypothetical protein PARMER_00170 [Parabacteroides merdae ATCC
43184]
Length = 846
Score = 558 bits (1437), Expect = e-156, Method: Compositional matrix adjust.
Identities = 308/792 (38%), Positives = 437/792 (55%), Gaps = 51/792 (6%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
T + PL + ++ PA+++ +A+PIGNGR GAMV+GGV E L+LNE+TL++G P
Sbjct: 20 GTPSKAPLTLWYDKPAQNWDEALPIGNGRAGAMVFGGVEKEQLQLNENTLYSGEPSVVFK 79
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEE 122
P+ V L+ +G+Y A+ K G YQ GD+ ++ ++
Sbjct: 80 DVKITPEMFDKVVGLMKAGKYKTASDLVCKNWLGRLHQYYQPFGDLHIQ---NNKPGDAA 136
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
Y+R L+++ A A Y V++ RE F+S+PD VIV + + ++ S
Sbjct: 137 GYKRALNISDAVATTVYKQNGVKYEREVFASHPDNVIVMHLKSDTPNGIDISLDFTSPHP 196
Query: 183 NHSYVNGNNQIIMEGRCPGK----------------RIPPKANANDDPK----------- 215
++++I+ G+ PG + P +AN K
Sbjct: 197 TALQKGTDDRLILHGQAPGYVERRTFEQIEQWGDQYKHPELYDANGKRKFDKRMLYGDEI 256
Query: 216 ---GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKK 272
G+ F A L+ D + D + + +D +L ++SF+G +PS
Sbjct: 257 GGKGMFFEAQLKPVFPKD--GKCEITDAGIHIYNTDEVYFILSMATSFNGFDKSPSRDGI 314
Query: 273 DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTV 332
DP++++ S L+ + Y L RH +DY+ LF RV +L SP+ +
Sbjct: 315 DPSAKAASILEKALSYDYQTLKQRHTEDYRSLFDRVDFELFSSPEQ-----------KAM 363
Query: 333 PSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHV 392
P+ +R++ F + DP L LLFQFGRYL+IS SRP Q NLQGIWN+D P W+ +
Sbjct: 364 PTDKRLEQFAGNADPDLAALLFQFGRYLMISGSRPDGQPLNLQGIWNKDTIPAWNCGYTI 423
Query: 393 NINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS 452
NIN EMNYW + NLSECQEPLF + LS++G++TA+ Y GWV HH T IW +S
Sbjct: 424 NINTEMNYWPAELTNLSECQEPLFRMIRELSVSGAETARNMYNRRGWVAHHNTSIWRESL 483
Query: 453 ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG 512
+ + WPM WLC+HLWEHY +T D FL+ AYPL++G A F DWLI+ +G
Sbjct: 484 PNDNVPTASFWPMVQGWLCSHLWEHYQFTQDETFLKNEAYPLMKGAAEFFADWLIDDGNG 543
Query: 513 YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV 572
+L T SPE+ FI DG+ A +S TMDMAIIRE F+ I+A+E+ +E + ++
Sbjct: 544 HLVTPVGVSPENRFITEDGQTAAMSMGPTMDMAIIRETFTRTIAASEMFNLDE-SFRNEL 602
Query: 573 LKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAE 632
L RL P +I + G + EW DFK+ E HRH SHL+G P IT +K P+L A
Sbjct: 603 KDKLARLLPYQIGKRGQLQEWIYDFKEWEPQHRHFSHLYGFHPSDQITPDKTPELFNAVR 662
Query: 633 KTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFA 692
KTL+ RG+ GWS+ WK WARL D HAY+++ LFN V + H GGL+ NL
Sbjct: 663 KTLELRGDLASGWSMGWKINCWARLLDGNHAYKIIANLFNPVGFGNSAHRGGGLFRNLLC 722
Query: 693 AHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICW 752
AHPPFQID NFG+TA V EML+QS ++LLPALP D W+ G V GLKARG +++ W
Sbjct: 723 AHPPFQIDGNFGYTAGVVEMLLQSHAGYIHLLPALP-DVWAEGSVSGLKARGNFEITMNW 781
Query: 753 KDGDLHEVGIYS 764
K+G L E I+S
Sbjct: 782 KNGKLTEANIHS 793
>gi|315649545|ref|ZP_07902630.1| Alpha-L-fucosidase [Paenibacillus vortex V453]
gi|315275018|gb|EFU38393.1| Alpha-L-fucosidase [Paenibacillus vortex V453]
Length = 796
Score = 556 bits (1434), Expect = e-155, Method: Compositional matrix adjust.
Identities = 307/769 (39%), Positives = 449/769 (58%), Gaps = 39/769 (5%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ + PA + +A+P+GNGRLGAMV+GGV E ++ NEDTLW+G P D N +A + L+
Sbjct: 10 KLWYREPAAKWEEALPLGNGRLGAMVFGGVEEERIQWNEDTLWSGFPRDTNNYEARRHLA 69
Query: 74 DVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
R L+ SG+Y EA K+ G + + LGD+ + H E YRRELDL+T
Sbjct: 70 AARKLITSGKYKEAEELIEDKMVGRGTESFLPLGDLLIRQSGIHGHRTE--YRRELDLDT 127
Query: 133 ATARVKY-SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY-VNGN 190
A V++ S G+ + R+ F S DQV V + +G + ++ LDS L + + +
Sbjct: 128 GIASVRFQSGGSATYARDMFISAVDQVAVIRCAGPNYEDIRLDIRLDSPLRHGTRRCAED 187
Query: 191 NQIIMEGRCPG------KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+++ G P K P + ++ GI++ + + D G ++ ++D+ + +
Sbjct: 188 GSLVLYGHAPTHIADNYKGDHPGSVLYEEGLGIRYE--MRLLALPDSGQVT-VDDRGMHI 244
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
GS LL+ A+++F G +P DP+ LQ Y +L RH+ D+Q L
Sbjct: 245 NGSGPVTLLIAAATNFAGFDRSPGSGGIDPSVICRKRLQDAVQHGYEELRARHVKDHQAL 304
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLIS 363
F RV ++L + C E + ++ + ER+K++ + EDP+L L+FQFGRYLL++
Sbjct: 305 FRRVDLRLE-------SLDC-ERSTESAATDERMKAYREGQEDPALEALMFQFGRYLLMA 356
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSRPGTQ A+LQGIWN + P W+S NIN EMNYW + +LSEC EPL + LS
Sbjct: 357 SSRPGTQPAHLQGIWNPHVQPPWNSDYTTNINTEMNYWPAETTHLSECHEPLIQMIRELS 416
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
++G +TA+++Y A GWV HH D+W +S G+ +WA WPMGGAWLC HLWE Y + D
Sbjct: 417 VSGRRTAKIHYGARGWVAHHNVDLWRMASPSDGRAMWAFWPMGGAWLCRHLWERYQFQPD 476
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
++L AYPL+ A F LDWLIE G+L T+PSTSPE++F+ +G VS STMD
Sbjct: 477 LEYLRGTAYPLMREAALFCLDWLIEDGKGHLVTSPSTSPENQFLTAEGVPCSVSAGSTMD 536
Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVH 603
MAIIR++F I A+++L ++ D L E+ + RL P + +G +MEW++ +++ E
Sbjct: 537 MAIIRDLFHNCIEASQLLGQDAD-LREEWESAAARLLPYGMDGEGKLMEWSEPYREAEPG 595
Query: 604 HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQ 660
HRH+SHL+GL+PG IT++ P L +AA +TL R G GWS W L+ARL
Sbjct: 596 HRHVSHLYGLYPGSDITLQGTPQLAEAAYRTLSSRISNGGGHTGWSCVWLINLFARLRQA 655
Query: 661 EHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
+ AY ++ L + ++ NL HPPFQIDANFG TA + EML+QS L +
Sbjct: 656 DKAYGYIRMLISR-----------SMHPNLLGDHPPFQIDANFGGTAGLVEMLLQSHLGE 704
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
L LLPALP+ W G VKGLKARGG +++ W G L + S + +
Sbjct: 705 LQLLPALPY-AWREGSVKGLKARGGFIINMEWSQGLLISASLTSTHGQH 752
>gi|373958368|ref|ZP_09618328.1| alpha-L-fucosidase [Mucilaginibacter paludis DSM 18603]
gi|373894968|gb|EHQ30865.1| alpha-L-fucosidase [Mucilaginibacter paludis DSM 18603]
Length = 827
Score = 555 bits (1431), Expect = e-155, Method: Compositional matrix adjust.
Identities = 310/775 (40%), Positives = 452/775 (58%), Gaps = 42/775 (5%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY-T 64
S+ N K+ ++ PAK +T+A+P+GNGRLGAM++G V E ++LNE TLW+G P +
Sbjct: 18 SSFAQNSSKLWYSHPAKVWTEALPLGNGRLGAMIFGRVDQELIQLNEGTLWSGGPVKHNV 77
Query: 65 NPDAPKALSDVR-SLVDSGQYAEATAASVKLFGHPADVYQLLGDIEL--EFDDSHLKYAE 121
NPDA L R +L+ Y +A A + K+ G ++ ++ LGD+ + +F ++ +
Sbjct: 78 NPDAYSYLLQTREALLKEENYVKAAALARKMQGVYSESFEPLGDVMISQKFKEA----SP 133
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
Y R+LD++ A + ++++ +FTR+ F S PDQVIV ++ S+ G L+F VS S L
Sbjct: 134 SAYYRDLDISDAVSTTRFTIDGTQFTRQMFISAPDQVIVIRLKASKPGQLNFKVSTKSQL 193
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRG 232
+ V +QI M G P P N N P +G++++ +L+ + G
Sbjct: 194 KFGNSVINGSQIAMLGHAPLHADPSYVNYNKTPVIYQDSTGKQGMRYALLLK---AVGNG 250
Query: 233 TISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSD 292
TI+ + L V+ +L L A++SF+G +P +D + L + +
Sbjct: 251 TITT-DTSGLSVKNGSDIILFLSAATSFNGFDKSPDKDGQDEVRIATQYLNTALKKDWQS 309
Query: 293 LYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVE 351
L+ HL DY + ++RV+ L+ +PKD +P+ ER+ + + +DP+L
Sbjct: 310 LFDAHLADYHRYYNRVTFNLA-APKDNTNAL--------LPTDERLIGYTRGTKDPALET 360
Query: 352 LLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSEC 411
L + +GRYLLIS SRPG ANLQGIWN + P W S NIN +MNYW S NLSE
Sbjct: 361 LYYNYGRYLLISCSRPGGAAANLQGIWNNIVRPPWSSNFTTNINTQMNYWPSEMTNLSEL 420
Query: 412 QEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA---DRGKVVWALWPMGGA 468
EPLF+ + +L++ G TA+ Y A GW +HH +DIWA S+ RG WA W MG
Sbjct: 421 NEPLFEQIKHLAVTGKATAKEFYHAEGWAVHHNSDIWALSNPVGDKRGDPKWANWSMGSP 480
Query: 469 WLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIA 528
WL HLW HY +T D+ FL+ AYPL++G A F L WL+E DG L T PS SPE++FI
Sbjct: 481 WLSQHLWTHYQFTGDKLFLKDTAYPLMKGAAQFCLSWLVENKDGLLVTAPSVSPENDFID 540
Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDG 588
G VS ++TMDM+II ++F+ +I A VL + D + ++ +L P I + G
Sbjct: 541 DRGHEGSVSIATTMDMSIIWDLFTNVIEACNVLNTDRD-FRDLIIAKRAKLFPLHIGKKG 599
Query: 589 SIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSIT 648
++ EW +D++D + HHRH+SHLFGL PG I+ PD +AA+KTL+ RG+EG GWS+
Sbjct: 600 NLQEWYKDWEDVDPHHRHVSHLFGLHPGREISPLTTPDFAEAAKKTLELRGDEGTGWSLA 659
Query: 649 WKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEG------GLYSNLFAAHPPFQIDAN 702
WK WARL D HAY +++ L + + G G Y NLF AHPPFQID N
Sbjct: 660 WKINFWARLLDGNHAYGLIRDLLRAAGAKIDPSASGKPGNGSGAYPNLFDAHPPFQIDGN 719
Query: 703 FGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
FG A + E+L+QS ++++ LLPALP D+W+SG + GLKARG V+I WKD L
Sbjct: 720 FGGVAGMTELLLQSQMSEIDLLPALP-DEWASGSILGLKARGNFEVAIIWKDHRL 773
>gi|313149824|ref|ZP_07812017.1| glycoside hydrolase family 95 [Bacteroides fragilis 3_1_12]
gi|313138591|gb|EFR55951.1| glycoside hydrolase family 95 [Bacteroides fragilis 3_1_12]
Length = 824
Score = 555 bits (1431), Expect = e-155, Method: Compositional matrix adjust.
Identities = 320/797 (40%), Positives = 446/797 (55%), Gaps = 63/797 (7%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAP 69
N L + + PA ++ +A+P+GNG LGAMV+G E L+LNE TL++G P P
Sbjct: 25 NNLSLWYRQPAANWNEALPLGNGYLGAMVFGDAGREHLQLNESTLYSGEPFSGVGVPSIG 84
Query: 70 KALSDVRSLVDSGQYAEATAASVKLF-GHPADVYQLLGDIELEFDDSHLKYAEETYRREL 128
++V +L++ G YA A + + G + YQ L D+ L FD ++ E Y REL
Sbjct: 85 SVYNEVLALLNHGDYAGAHRLITRNWQGRLSQSYQPLADLFLSFD---VQGKVENYVREL 141
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
+L A ++Y G + +TRE+F SNPD+V+V +IS S ++ VS S
Sbjct: 142 NLQDAVHTIRYQAGGIRYTREYFISNPDRVMVIRISASRRSPVNVAVSYTSEHPTAKVDG 201
Query: 189 GNNQIIMEGRCPG---------------------------KRIPPKANANDDP---KGIQ 218
++I+ G+ PG +R K D KG+
Sbjct: 202 TGEELILSGQAPGCVERRTLDFLEKNRLTDRHPELFDSHGRRKTDKQVLYADEVGGKGMF 261
Query: 219 FSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSES 278
F + +K+ T L+D +LKV G +LL+ A++S++G +PS D ++
Sbjct: 262 FQS--RVKVLKGNAT---LQDNQLKVSGEGEIILLVAAATSYNGFDRSPSQDGSDYQAKL 316
Query: 279 MSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV 338
+ L L Y DL RHL DYQ+LF RV++ L SE++ +P+ R+
Sbjct: 317 DTILSVAGQLPYEDLKKRHLADYQRLFGRVALTLK-----------SEKDYSGLPTDRRI 365
Query: 339 KSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEM 398
F+ + D +L LLFQ+GRYLLI+SSR G Q ANLQGIWN+D+ P W S+ +NIN EM
Sbjct: 366 IGFRDNPDNALAALLFQYGRYLLIASSREGGQPANLQGIWNKDVVPAWSSSYTININTEM 425
Query: 399 NYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKV 458
NYW + L EC EPLF + L++NGS TA Y GW HH T IW +S G+
Sbjct: 426 NYWPAETTGLPECSEPLFRLIRELAVNGSATAAKMYNLPGWTSHHITSIWRESGPADGEP 485
Query: 459 VWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNP 518
W +W M WLC HLW+HY ++ D+ FL + AYPL+ A F WL+E DG +T
Sbjct: 486 TWFMWNMSAGWLCRHLWDHYLFSGDKKFLRETAYPLMRDAARFYNAWLVE-KDGMWQTPL 544
Query: 519 STSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNE-----DALVEKVL 573
SPE++F+ P+ K + V+ + MDMAIIRE+FS AA +L + D L+ V+
Sbjct: 545 GVSPENQFLTPEKKTSAVAPAPAMDMAIIRELFSNTAEAAAILAADSILPPADTLLLHVM 604
Query: 574 KSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
+ +L P +I + G IMEW++DF + E HHRHLSHL+G PG IT K P+L A +
Sbjct: 605 GA-KQLVPYRIGKRGQIMEWSEDFDEVEPHHRHLSHLYGFHPGCEITPGKTPELVSAVRR 663
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVD--PEHEKHFEGGLYSNLF 691
TL+ RG+E GWS+ WK +WAR+HD HAYR+++ LF D PE +H GGLY NLF
Sbjct: 664 TLELRGDEATGWSMGWKINMWARMHDGNHAYRIIRNLFTPTDFGPEVNRH--GGLYKNLF 721
Query: 692 AAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSIC 751
AHPPFQID NFG+TA VAEML+QS + +LPALP D W+ G V GL+ARGG + I
Sbjct: 722 DAHPPFQIDGNFGYTAGVAEMLLQSHDGVIDVLPALP-DVWAEGKVTGLRARGGFIIDIT 780
Query: 752 WKDGDLHEVGIYSNYSN 768
W V ++S N
Sbjct: 781 WSKSGKTVVKVFSEQGN 797
>gi|332662390|ref|YP_004445178.1| alpha-L-fucosidase [Haliscomenobacter hydrossis DSM 1100]
gi|332331204|gb|AEE48305.1| Alpha-L-fucosidase [Haliscomenobacter hydrossis DSM 1100]
Length = 801
Score = 555 bits (1430), Expect = e-155, Method: Compositional matrix adjust.
Identities = 310/759 (40%), Positives = 448/759 (59%), Gaps = 34/759 (4%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY-TNPDAPKALSDVRSL 78
PAKHF +++ +GNGR+GA+V GGV S+ + LN+ TLW G P D NP A L +R
Sbjct: 34 PAKHFEESLVLGNGRIGAVVHGGVKSDKIFLNDATLWAGSPVDPDMNPAAHTHLPAIREA 93
Query: 79 VDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARV 137
+ Y +A + + + L G ++ Y LG + + D +H + A YRR+LDL+TA +
Sbjct: 94 LRQEDYRKADSLNRRHLQGKFSESYAPLGTMYI--DMAHTETASN-YRRQLDLSTAISTT 150
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
Y V +TRE+F S+P QV++ +++ S+ G LSFN+ +SLL H N + G
Sbjct: 151 SYQQAGVTYTREYFISHPQQVLLIRMTASQLGKLSFNLRFNSLL-RHQVNTSTNVLNASG 209
Query: 198 RCPGKRIP-----PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
R P P P DD K ++F ++++I +D + + D + V+G A++
Sbjct: 210 RAPAHAEPSYRRVPDPIQYDDQKSMRFLSLVKIIKTDGKIVRT---DSTIGVQGGKEAII 266
Query: 253 LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
++ ++SF+G NP+ KD + + L+ + +SY+ + H+ D+Q+ F+RV QL
Sbjct: 267 MVSIATSFNGFDQNPALHGKDEVTLANEWLKKAQIISYATIKAAHIKDHQQFFNRVQFQL 326
Query: 313 SRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
+ + ++P+ ER+K F + +DP L L F FGRYLLI+SSR
Sbjct: 327 AGRSSNA-----------SLPTDERLKRFAEGAKDPDLELLYFNFGRYLLIASSRTPQVP 375
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
ANLQGIWN L P W S +NIN EMNYW + NLSE +PL FL L+ G+ TA+
Sbjct: 376 ANLQGIWNHHLQPPWSSNYTININTEMNYWPAESGNLSELHQPLLGFLGNLAKTGAVTAK 435
Query: 432 VNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
Y A GW H TDIWA S+ +G WA W MGGAWL THLWEH++YT D +L
Sbjct: 436 TFYNAGGWCAAHNTDIWAMSNPVGHFGQGSPSWANWNMGGAWLATHLWEHFDYTRDTIWL 495
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
+ Y L++G A F LD L++ G L T+PSTSPE+ FI P G Y +T D+ +I
Sbjct: 496 KTYGYGLMKGAAQFCLDILVDDGKGNLVTSPSTSPENIFITPSGYKGATLYGATADLGMI 555
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHL 607
RE+F I+AA+ L ++ D +++ SL +L P +I++ G + EW D++D + HRH
Sbjct: 556 RELFLQTIAAAKTLVQDAD-FQQQLEASLSKLYPYQISKKGHLQEWYHDWEDEDPKHRHQ 614
Query: 608 SHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMV 667
SHLFGL+PG+ I++++ P+L A ++TL+ +G+E GWS W+T LWARL D Y+M
Sbjct: 615 SHLFGLYPGNHISVDQTPELAAACKQTLEVKGDETTGWSKGWRTNLWARLRDGNRTYKMY 674
Query: 668 KRLFNLVDPEHEKHFE--GGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
+ L VDP E + GG Y NL AHPPFQID NFG TAAV EMLVQS ++ LLP
Sbjct: 675 RELMRFVDPNPETRYNGGGGAYPNLMDAHPPFQIDGNFGGTAAVLEMLVQSRSEEITLLP 734
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
ALP D W++G V+G+ ARGG +++ W G L + I S
Sbjct: 735 ALP-DAWATGSVRGVCARGGFVLNLTWSAGKLTKTEISS 772
>gi|333379822|ref|ZP_08471540.1| hypothetical protein HMPREF9456_03135 [Dysgonomonas mossii DSM
22836]
gi|332884726|gb|EGK04982.1| hypothetical protein HMPREF9456_03135 [Dysgonomonas mossii DSM
22836]
Length = 813
Score = 554 bits (1427), Expect = e-155, Method: Compositional matrix adjust.
Identities = 317/762 (41%), Positives = 451/762 (59%), Gaps = 47/762 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ + PAK + +A+P+GN RLGAMV+G E L+LNE+T+W G P +P+ K L
Sbjct: 24 KLLYKRPAKEWVEALPLGNSRLGAMVFGNPAREQLQLNEETMWGGGPHRNDSPNMLKVLD 83
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+VRSL+ +G+ EA A K P + YQ +G + L+F H KY+ Y R+LDL
Sbjct: 84 EVRSLIFAGKEKEAEALLEKNMRTPHNGMPYQTIGSLYLDFA-GHNKYS--NYSRQLDLT 140
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TA A KY+V + +TRE FSS D VI+ +I+ + S+SF DS + ++ +
Sbjct: 141 TAVATTKYTVDGINYTREVFSSFTDNVIIMRITADKPNSISFTAGYDSPVKDYKVQAKGD 200
Query: 192 QIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
++I++G ++ KG I+F +IK G +E KL V+ ++
Sbjct: 201 KLILKGM---------GAEHEGIKGVIRFENQTQIKT---EGGSVKVESNKLSVKAANSV 248
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
V+ + +++F +N D + ++ + L++ + Y H+ Y+K F RVS+
Sbjct: 249 VIYISIATNF----VNYQDVSANESTSATHFLKTAISKPYEKALADHIKYYKKQFDRVSL 304
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
L +S D+ EE + RV++F+ +D SLV LLFQFGRYLLISSS+PG Q
Sbjct: 305 DLGKS------DSILEE------TDVRVRNFKEGKDQSLVTLLFQFGRYLLISSSQPGGQ 352
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWN+ L P WDS +NIN EMNYW + NLSE +PLF L L++ G +TA
Sbjct: 353 PANLQGIWNDQLVPPWDSKYTININTEMNYWPAEVTNLSETHQPLFQMLKELAVTGQETA 412
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
+V Y A+GWV HH TD+W + G +WP GGAWL H+W+HY YT D+ FL K
Sbjct: 413 KVMYNANGWVAHHNTDLWRTTGPVDG-AFHGMWPNGGAWLSQHMWQHYLYTGDKSFL-KE 470
Query: 491 AYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
AYP+L+G A F LD+L+E H Y + T+PSTSPE P GK ++ STMD I+
Sbjct: 471 AYPVLKGAADFFLDFLVE-HPTYKWMVTSPSTSPEQ---GPPGKNTSITAGSTMDNQIVF 526
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLS 608
+V + + A++ L ++A +K+ + RL P +I + + EW D+ DP+ HRH+S
Sbjct: 527 DVLNNALEASKTLGVGDEAYNQKLEDMISRLAPMQIGKYNQLQEWLGDWDDPKNDHRHVS 586
Query: 609 HLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVK 668
HL+GL+P + I+ +P L +AA+ +L RG+ GWSI WK WARL D HAY+++
Sbjct: 587 HLYGLYPSNQISPYSHPTLFQAAKNSLLYRGDMATGWSIGWKINFWARLLDGNHAYKIIS 646
Query: 669 RLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALP 728
+ +LV+P + +G Y NLF AHPPFQID NFGFTA VAEML+QS ++LLPALP
Sbjct: 647 NMLSLVEPGNN---DGRTYPNLFDAHPPFQIDGNFGFTAGVAEMLLQSHDGAIHLLPALP 703
Query: 729 WDKWSSGCVKGLKARGG-ETVSICWKDGDLHEVGIYSNYSNN 769
DKW +G VKGL ARGG E S+ W DG++ V I S N
Sbjct: 704 -DKWKNGSVKGLMARGGFEISSMDWSDGEISSVTITSKLGGN 744
>gi|424665666|ref|ZP_18102702.1| hypothetical protein HMPREF1205_01541 [Bacteroides fragilis HMW
616]
gi|404573919|gb|EKA78670.1| hypothetical protein HMPREF1205_01541 [Bacteroides fragilis HMW
616]
Length = 821
Score = 551 bits (1421), Expect = e-154, Method: Compositional matrix adjust.
Identities = 319/797 (40%), Positives = 445/797 (55%), Gaps = 63/797 (7%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAP 69
N L + + PA ++ +A+P+GNG LGAMV+G E L+LNE TL++G P P
Sbjct: 22 NNLSLWYRQPAANWNEALPLGNGYLGAMVFGDAGREHLQLNESTLYSGEPFSGVGVPSIG 81
Query: 70 KALSDVRSLVDSGQYAEATAASVKLF-GHPADVYQLLGDIELEFDDSHLKYAEETYRREL 128
++V +L++ G YA A + + G + YQ L D+ L FD ++ E Y REL
Sbjct: 82 SVYNEVLALLNHGDYAGAHRLITRNWQGRLSQSYQPLADLFLSFD---VQGKVENYVREL 138
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
+L A ++Y + +TRE+F SNPD+V+V +IS S ++ VS S
Sbjct: 139 NLQDAVHTIRYQAEGIRYTREYFISNPDRVMVIRISASRRSPVNVAVSYTSEHPTAKVDG 198
Query: 189 GNNQIIMEGRCPG---------------------------KRIPPKANANDDP---KGIQ 218
++I+ G+ PG +R K D KG+
Sbjct: 199 TGEELILSGQAPGCVERRTLDFLEKNRLTDRHPELFDSHGRRKTDKQVLYADEVGGKGMF 258
Query: 219 FSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSES 278
F + +K+ T L+D +LKV G +LL+ A++S++G +PS D ++
Sbjct: 259 FQS--RVKVLKGNAT---LQDNQLKVSGEGEIILLVAAATSYNGFDRSPSQDGSDYQAKL 313
Query: 279 MSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV 338
+ L L Y DL RHL DYQ+LF RV++ L SE++ +P+ R+
Sbjct: 314 DTILSVAGQLPYEDLKKRHLADYQRLFGRVALTLK-----------SEKDYSGLPTDRRI 362
Query: 339 KSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEM 398
F+ + D +L LLFQ+GRYLLI+SSR G Q ANLQGIWN+D+ P W S+ +NIN EM
Sbjct: 363 IGFRDNPDNALAALLFQYGRYLLIASSREGGQPANLQGIWNKDVVPAWSSSYTININTEM 422
Query: 399 NYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKV 458
NYW + L EC EPLF + L++NGS TA Y GW HH T IW +S G+
Sbjct: 423 NYWPAETTGLPECSEPLFRLIRELAVNGSVTAAKMYNLPGWTSHHITSIWRESGPADGEP 482
Query: 459 VWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNP 518
W +W M WLC HLW+HY ++ D+ FL + AYPL+ A F WL+E DG +T
Sbjct: 483 TWFMWNMSAGWLCRHLWDHYLFSEDKKFLRETAYPLMRDAARFYNAWLVE-KDGMWQTPL 541
Query: 519 STSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNE-----DALVEKVL 573
SPE++F+ P+ K + V+ + MDMAIIRE+FS AA +L + D L+ V+
Sbjct: 542 GVSPENQFLTPEKKTSAVAPAPAMDMAIIRELFSNTAEAAAILAADSILPPADTLLLHVM 601
Query: 574 KSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
+ +L P +I + G IMEW++DF + E HHRHLSHL+G PG IT K P+L A +
Sbjct: 602 GA-KQLVPYRIGKRGQIMEWSEDFDEVEPHHRHLSHLYGFHPGCEITPGKTPELVSAVRR 660
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVD--PEHEKHFEGGLYSNLF 691
TL+ RG+E GWS+ WK +WAR+HD HAYR+++ LF D PE +H GGLY NLF
Sbjct: 661 TLELRGDEATGWSMGWKINMWARMHDGNHAYRIIRNLFTPTDFGPEVNRH--GGLYKNLF 718
Query: 692 AAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSIC 751
AHPPFQID NFG+TA VAEML+QS + +LPALP D W+ G V GL+ARGG + I
Sbjct: 719 DAHPPFQIDGNFGYTAGVAEMLLQSHDGVIDVLPALP-DVWAEGKVTGLRARGGFIIDIT 777
Query: 752 WKDGDLHEVGIYSNYSN 768
W V ++S N
Sbjct: 778 WSKSGKTVVKVFSEQGN 794
>gi|414868291|tpg|DAA46848.1| TPA: hypothetical protein ZEAMMB73_390456 [Zea mays]
Length = 567
Score = 551 bits (1420), Expect = e-154, Method: Compositional matrix adjust.
Identities = 285/500 (57%), Positives = 350/500 (70%), Gaps = 30/500 (6%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+ F PAK+FTDA PIGNGRLGAMVWG V SE L+LN DTLWTG PG+YTNP+AP
Sbjct: 41 PLKVVFGSPAKYFTDAAPIGNGRLGAMVWGCVESERLQLNHDTLWTGGPGNYTNPNAPAV 100
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
LS VRSLV++G+Y EAT+A+ L G V+Q LGDI+L F + +KY YRRELDL+
Sbjct: 101 LSKVRSLVENGKYPEATSAAYDLSGDQTQVFQPLGDIDLVFGED-IKYTN--YRRELDLH 157
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TAT V Y+VG++ +TREHFSSNP QVIVTKIS ++ G++SF VSL S LD+ V N
Sbjct: 158 TATVTVTYTVGDIVYTREHFSSNPHQVIVTKISANKPGNVSFTVSLTSPLDHKIRVTHAN 217
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
+IIMEG CPG+R A D P GI+FSAIL ++I+ T+ L D LK++ +D V
Sbjct: 218 EIIMEGSCPGQRPEEIKTAADQPIGIKFSAILYLQINGANSTVEVLNDNMLKLDCADSVV 277
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
LLL A++SF FI PS+SK DPT + + L R SYS L H+DDYQ LF RVS+Q
Sbjct: 278 LLLAATTSFQSAFIKPSESKLDPTVSAFTTLSIARRTSYSQLKAYHIDDYQTLFQRVSLQ 337
Query: 312 LS-------RSPKDIVTDTCSEENIDTV--------------------PSAERVKSFQTD 344
LS R + + + S + + P+ ER+ +F+ +
Sbjct: 338 LSQGSNYDLRRSRLVQSAETSSQGANVSDYGFQISGCTRLTSLNSFVKPTVERIVTFKDN 397
Query: 345 EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSL 404
EDPSLVELLFQFGRYLLIS SRPGTQ++NLQGIW+ D SP WD+APH NINL+MNYW +L
Sbjct: 398 EDPSLVELLFQFGRYLLISCSRPGTQISNLQGIWSNDTSPPWDAAPHPNINLQMNYWPAL 457
Query: 405 PCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWP 464
PCNLSECQEPLFDF+ LSING+KTA+VNY ASGWV H TD+WAK+S D G VWALWP
Sbjct: 458 PCNLSECQEPLFDFIGSLSINGAKTAKVNYEASGWVSHQVTDLWAKTSPDAGDPVWALWP 517
Query: 465 MGGAWLCTHLWEHYNYTMDR 484
MGG WL THLWEHY +T+D+
Sbjct: 518 MGGPWLATHLWEHYCFTLDK 537
>gi|284036792|ref|YP_003386722.1| alpha-L-fucosidase [Spirosoma linguale DSM 74]
gi|283816085|gb|ADB37923.1| Alpha-L-fucosidase [Spirosoma linguale DSM 74]
Length = 825
Score = 550 bits (1416), Expect = e-153, Method: Compositional matrix adjust.
Identities = 315/811 (38%), Positives = 453/811 (55%), Gaps = 42/811 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPKA 71
LK+ + PA +T+A+P+GNGR+GAM++G V E ++LNE TLW+G P NP++P
Sbjct: 23 LKLWYTKPAAVWTEALPVGNGRIGAMIFGKVEDELIQLNESTLWSGGPVSGNVNPESPSY 82
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET-YRRELDL 130
L VR ++ Y +A K+ G Y LGD+ L+ +L A T Y R+LD+
Sbjct: 83 LPQVREALNREDYKQAVTLVKKMQGLYTQSYMPLGDLSLK---QNLNGATPTGYYRDLDI 139
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A +++ V + RE F+S PD V+V +++ S+ G LSF+ S S L + N
Sbjct: 140 QKALATTRFTANGVTYKREMFTSAPDGVMVIRLTASKPGQLSFDASTSSQLRAENMRGSN 199
Query: 191 NQIIMEGRCPGKRIPPKANANDDP----------KGIQFSAILEIKISDDRGTISALEDK 240
++M+G+ P + P N D KG++F L +K + GT+ + +
Sbjct: 200 GDLVMKGKAPTQVDPNYYNPKDREHVIYEDATGCKGMRFQ--LRLKALNKGGTVQT-DKE 256
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ V + +L + A++SF+G P KD + ++ SY L RH D
Sbjct: 257 GIHVRNASEVLLFVAATTSFNGYDKCPDKDGKDENKLAEELIRKATATSYQALLNRHTAD 316
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRY 359
YQ F+R S Q +TDT S +PS ER++ + DP + L Q+GRY
Sbjct: 317 YQSYFNRFSFQ--------ITDTTSVNKNAALPSDERLEMYSKGVYDPGIETLYCQYGRY 368
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LLISSSR ANLQGIWN++L W S +NIN +MNYW NLSE PL F+
Sbjct: 369 LLISSSRVTNVPANLQGIWNKELRAPWSSNYTININTQMNYWPVEVTNLSELHRPLLSFI 428
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS--ADRGK--VVWALWPMGGAWLCTHLW 475
L+ G+ TA+ Y +GWV+HH TDIWA S+ D+G+ WA W G WL HLW
Sbjct: 429 GELAKTGAVTAKEFYNMNGWVVHHNTDIWAISNPVGDKGQGDPKWANWNQGAGWLSQHLW 488
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
EHY +T D+ FL + AYP+++G A F LDWL+ DGYL +PS SPE++FI G+ A
Sbjct: 489 EHYRFTGDKKFLRESAYPIMKGAAEFYLDWLVADKDGYLVVSPSVSPENDFIDAKGQPAS 548
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ 595
+S ++TMDM+I+ ++F+ +I A+ VL D + +++ + P I G++ EW++
Sbjct: 549 ISVATTMDMSIMWDLFTNLIDASTVLNIEPD-FRKMLIEKRSKFYPLHIGHKGNLQEWSK 607
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWA 655
DF+D + HRH+SHLFGL PG I+ P+ AA++TL+ RG+ G GWS WK WA
Sbjct: 608 DFEDVDPQHRHVSHLFGLHPGRQISPISTPEFAAAAKRTLELRGDAGTGWSRAWKVNFWA 667
Query: 656 RLHDQEHAYRMVKRLFNL---VDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
RL D HAY++++ L + + GG Y N F AHPPFQID NFG TA +AEM
Sbjct: 668 RLLDGNHAYKLLRELLRYTSQTNTNYSSQGGGGTYPNFFDAHPPFQIDGNFGGTAGMAEM 727
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHD 772
LVQS L+ ++LL ALP D W G V GL+ARGG +++ WK+ L + S + +
Sbjct: 728 LVQSHLDAIHLLAALP-DAWRDGRVSGLRARGGFELAMQWKNRRLTTATVKS--LDGEPC 784
Query: 773 SFKTLH-YRGTSVKVNLSA---GKIYTFNRQ 799
+ +T R VKV A G + TFN Q
Sbjct: 785 TLRTSEPIRIKGVKVESKATNLGYVTTFNTQ 815
>gi|389792551|ref|ZP_10195739.1| alpha/beta hydrolase domain-containing protein [Rhodanobacter
fulvus Jip2]
gi|388436250|gb|EIL93122.1| alpha/beta hydrolase domain-containing protein [Rhodanobacter
fulvus Jip2]
Length = 791
Score = 549 bits (1415), Expect = e-153, Method: Compositional matrix adjust.
Identities = 320/804 (39%), Positives = 456/804 (56%), Gaps = 53/804 (6%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
+ ++ PA + +A+P+GNG +GAMV+GGVP E ++LN TLW G P DY A L
Sbjct: 25 LVYDKPASQWNEALPLGNGLMGAMVFGGVPDERVQLNLGTLWGGAPNDYIAQGAASRLKP 84
Query: 75 VRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
++ L+ SG+ A+A A S G P + +Q GD+ L ++ K Y+REL L+
Sbjct: 85 IQKLIFSGKVAQAEALSAGFMGDPKLLMPFQPFGDLHLHVEN---KGKVSDYQRELRLDD 141
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY-VNGNN 191
A + V Y+V V F RE F S PD+V+V +S + + +F V+L S + G +
Sbjct: 142 AISTVSYAVDGVHFRRETFMSYPDRVLVMHLSADQPAAQNFTVTLTSPQPGAKVALVGKD 201
Query: 192 QIIMEGRCPGKRIPPKANANDDPK-GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
I + G+ + P + K G+ ++ L IK G+I D L+V G+D
Sbjct: 202 TIALTGQIEPRTNPASSWTGSWSKPGMTYAGRLVIKTKG--GSIRQAGDH-LEVRGADAV 258
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
L+ ++SF + D + + + + L SY L HL DY+ LF RV +
Sbjct: 259 TLVFSGATSFK----SYRDISGNAEAAARAPLDKAVQRSYEALKNAHLADYRALFDRVHL 314
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
+L D S EN+ T +R++ F+T +DPSLV L +Q+GRYLLISSSR G Q
Sbjct: 315 RLG--------DDASRENVAT---DKRIRDFKTHDDPSLVALYYQYGRYLLISSSRAGGQ 363
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWN+DL P W S NINLEMNYW + L E Q PL+D + L + G+KTA
Sbjct: 364 PANLQGIWNQDLLPAWGSKWTTNINLEMNYWPAETGALWETQTPLWDLIDDLQVAGAKTA 423
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
Q Y A GWV+HH +D+W ++ G W LWPMGG WL +W+HY ++ D FL R
Sbjct: 424 QRYYGAHGWVLHHNSDLWRATTPVDGP--WGLWPMGGVWLSNQMWDHYTFSGDETFLRNR 481
Query: 491 AYPLLEGCASFLLDWLIEGHD-----GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
AYP ++G A F+LD+L+E G L TNPSTSPE+ ++ GK ++Y+ TMD+
Sbjct: 482 AYPAMKGAAEFVLDFLVEAPKGSPVAGKLVTNPSTSPENRYLL-GGKPVGLTYAPTMDIE 540
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
+I ++F+ + +AA L + ALV ++ + PRL P +I G + EW +D+ + E HR
Sbjct: 541 LINDLFNHVRAAARHLGVDA-ALVSRIDAAQPRLPPLQIGHKGQLQEWIEDYPETEPDHR 599
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H+SHL+ L+PG I+ ++ P L KAA ++L+ RG+ G GW+ WKTALWARL D +HAYR
Sbjct: 600 HVSHLYALYPGDAISPDRTPALAKAARRSLELRGDGGTGWARAWKTALWARLGDGDHAYR 659
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
++ H+ E L N+F PPFQID NFG TAA+AEML+QS + ++ +LP
Sbjct: 660 LL----------HDLIAENTL-PNMFDDCPPFQIDGNFGGTAAIAEMLMQSRIGEITVLP 708
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVK 785
ALP +W G V GL+ARGG V I W+ G EV + S + + H L Y+ +
Sbjct: 709 ALP-SRWQDGEVDGLRARGGLRVGITWRKGVPTEVRLLSTTATSVH-----LRYQHQRIV 762
Query: 786 VNLSAGKIYTFN--RQLKCTNLHQ 807
V L GK T R + TN Q
Sbjct: 763 VALEPGKELTVGAARLMPSTNGRQ 786
>gi|436835055|ref|YP_007320271.1| Alpha-L-fucosidase [Fibrella aestuarina BUZ 2]
gi|384066468|emb|CCG99678.1| Alpha-L-fucosidase [Fibrella aestuarina BUZ 2]
Length = 874
Score = 549 bits (1415), Expect = e-153, Method: Compositional matrix adjust.
Identities = 310/787 (39%), Positives = 436/787 (55%), Gaps = 55/787 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPKA 71
L + ++ PA +T+A+PIGNG +GAM++GGV E L+LNE TL++G P G +T D K
Sbjct: 32 LTLWYDKPAAAWTEALPIGNGYMGAMLFGGVEQEHLQLNEGTLYSGDPSGTFTAIDVRKK 91
Query: 72 LSDVRSLVDSGQYAEA-TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
V SLV G Y EA + G YQ LGD+ + F + YRR LDL
Sbjct: 92 FKAVDSLVKQGNYKEAQNLVAADWLGRNHQDYQPLGDLWMAFTHTG---PVTKYRRSLDL 148
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKI--SGSES--GSLSFNVSLDSLLDNHSY 186
+T ++++Y+V N + RE F+S PD+VIV ++ G E+ G + F+ L Y
Sbjct: 149 STGISQIQYTVANTTYRREIFASYPDRVIVIRLLAEGKETINGEIRFSTPHKPLA---RY 205
Query: 187 VNGNNQIIMEGRCPG---------------KRIPPKANAND--------------DPKGI 217
+Q+IM G+ PG + P+ A D D G
Sbjct: 206 SASADQLIMAGKAPGFVLRRTVKLVQKLGDQHKYPEVFAKDGSVLPNASDVLYGADATGW 265
Query: 218 QFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSE 277
++ + GT+ A D+ +K+ G+ +L+L ++SF+G +P +P +
Sbjct: 266 GMGFEARLRATQQGGTLQA-TDQTIKISGAREVLLVLTCATSFNGFDKSPVTQGLNPAAS 324
Query: 278 SMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAER 337
+ L S+ SY DL HL DYQ LF R +Q+ T S+++ T + +R
Sbjct: 325 TQKYLASVAGRSYDDLAKTHLSDYQHLFSRSQLQIG---------TVSDQSART--TDQR 373
Query: 338 VKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLE 397
+ F +D SLV LL+QFGRYL+I+ SRPG Q NLQGIWN+ + P W+ A VNIN +
Sbjct: 374 IALFANGKDQSLVGLLYQFGRYLMIAGSRPGGQPLNLQGIWNDKVIPPWNGAYTVNINAQ 433
Query: 398 MNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK 457
MNYW + NLSEC EP + L+ING+ TA+ Y +GWV+HH TDIW + +
Sbjct: 434 MNYWPAELTNLSECHEPFLTAVRELAINGAVTARAMYGNNGWVVHHNTDIW-RHTEPVDY 492
Query: 458 VVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETN 517
A WPM G WL +H WE Y + D FL YPLL+G F DWLI DGYL T
Sbjct: 493 CNCAFWPMAGGWLTSHFWERYLFRGDTTFLRTDVYPLLKGVVLFYKDWLIPNKDGYLVTP 552
Query: 518 PSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP 577
SPEH F+ +G+ + +S TMDMAIIRE F+ I A++ L +E L +++ L
Sbjct: 553 IGHSPEHAFVYGNGQTSTLSPGPTMDMAIIRESFTRFIEASDKLGTSEQPLYDEIKAKLA 612
Query: 578 RLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQK 637
+L P +I + G + EW DF+D E HRH+SHL+G P + I P+L A ++++
Sbjct: 613 KLLPYQIGKYGQLQEWQFDFEDGEKEHRHISHLYGFHPSNQINPYTTPELTAAVATSMER 672
Query: 638 RGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
RG++ GWS+ WK ++ARL D + A++++ L +LV + K GGLY NLF AHPPF
Sbjct: 673 RGDKATGWSMGWKINVYARLQDGDKAHKLLTNLVHLVQEDGTKMVGGGLYPNLFDAHPPF 732
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
QID NFG TA +AEMLVQS D+ LLPALP W +G + GL+ARGG V I W + L
Sbjct: 733 QIDGNFGATAGIAEMLVQSHAGDIQLLPALP-KAWPNGKITGLRARGGFVVDIEWANSRL 791
Query: 758 HEVGIYS 764
+ I S
Sbjct: 792 RKATIRS 798
>gi|237722004|ref|ZP_04552485.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
gi|423291145|ref|ZP_17269993.1| hypothetical protein HMPREF1069_05036 [Bacteroides ovatus
CL02T12C04]
gi|229448873|gb|EEO54664.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
gi|392664179|gb|EIY57721.1| hypothetical protein HMPREF1069_05036 [Bacteroides ovatus
CL02T12C04]
Length = 792
Score = 549 bits (1414), Expect = e-153, Method: Compositional matrix adjust.
Identities = 306/770 (39%), Positives = 439/770 (57%), Gaps = 51/770 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ +N PA + +A+PIGNGR+GAMV+G E +LNE+++W+G P D+ NP A AL
Sbjct: 27 KLWYNAPATVWEEALPIGNGRIGAMVYGNPLQEVYQLNEESIWSGYPQDWNNPKAANALP 86
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
VR VD G YA+A+ K + L L D A YR EL+++ A
Sbjct: 87 QVREAVDRGDYAKASEL-WKANAQGPYTARYLPMANLMLDQLTRGEARNLYR-ELNISNA 144
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
+ V Y V++ R F S PDQV+V KI+ ++S ++ L+SLL G +
Sbjct: 145 LSTVTYEADGVKYRRTSFISYPDQVMVIKIAADRPQAVSLHIRLNSLLRYTVQTKGEKTL 204
Query: 194 IMEGRCPG----KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
I+ G+ P + P DD +G QF +++++ D G A D L V ++
Sbjct: 205 ILNGKAPAYVANRDYDPHQVVYDDKRGTQFK--VQVELLPDGGHCEA-NDSALTVRNANE 261
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
VLLL A + F + K+ Y +L RH DD+Q+LF+R+
Sbjct: 262 VVLLLSAVTDFGNKKMTLKKCKR----------------PYQELLQRHTDDHQQLFNRLQ 305
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPG 368
+ L T+ +E +P+ ER+KSF+ D D L EL +Q+GRYLLI+SSRPG
Sbjct: 306 LSLG-------TENLQKE---ALPTNERLKSFEQDPTDNGLTELYYQYGRYLLIASSRPG 355
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
ANLQGIWN + P W S NIN EMNYW + NL EC PL DF+ L++NG++
Sbjct: 356 GLPANLQGIWNRHVQPPWGSNYTTNINTEMNYWPAEITNLPECFLPLSDFIGRLAVNGAQ 415
Query: 429 TAQVNY-LASGWVIHHKTDIWAKS-------SADRGKVVWALWPMGGAWLCTHLWEHYNY 480
TA+VNY + GW+ HH +D+WA++ S +G W+ WPM G WLC HLWEHY +
Sbjct: 416 TAKVNYGINRGWLAHHNSDVWAQTAPTGGYDSDPKGAPRWSCWPMAGVWLCQHLWEHYAF 475
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEF--IAPDGKL--AC 535
D+ +L K AYPL++G A FLL WL + + GY TNPSTSPE+ F I +GK
Sbjct: 476 GGDKKYLSKTAYPLMKGAAEFLLQWLQKDPETGYWITNPSTSPENRFRYIDKEGKKQNGE 535
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ 595
+S SS MD+ + ++ + I A+ VL+ ++ A ++ + L+P +I G ++EW +
Sbjct: 536 ISRSSGMDLGLAWDLLTNCIEASTVLDTDK-AFRQQCMDVRANLQPFRIGSKGQLLEWDK 594
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWA 655
+F++ + +HRH+SHLF L PG I E+ P+L A ++TL+ RG+ G GW++ WK WA
Sbjct: 595 EFEETDPNHRHVSHLFALHPGRQIIPEQQPELAAACQRTLEIRGDGGTGWAMAWKINFWA 654
Query: 656 RLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQ 715
RL D HA+ M+K VD GG Y+NLF AHPPFQID NFG TA + EML+Q
Sbjct: 655 RLRDGNHAFGMLKNGLRYVDATQVSVRGGGTYANLFDAHPPFQIDGNFGGTAGITEMLLQ 714
Query: 716 STLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
S ++LLPALP D W SG +KG++ARGG T+ + WK+ + + + S+
Sbjct: 715 SHAGYIHLLPALP-DNWQSGSIKGVRARGGFTIDMEWKESRITRLSVTSH 763
>gi|376260116|ref|YP_005146836.1| hypothetical protein [Clostridium sp. BNL1100]
gi|373944110|gb|AEY65031.1| hypothetical protein Clo1100_0760 [Clostridium sp. BNL1100]
Length = 775
Score = 548 bits (1413), Expect = e-153, Method: Compositional matrix adjust.
Identities = 313/801 (39%), Positives = 450/801 (56%), Gaps = 57/801 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA+ + +A+PIGNGRLG MV GG+ E + LN DTLW+G+PG + N + L V+
Sbjct: 7 YKSPARIWEEALPIGNGRLGGMVHGGISQECIDLNNDTLWSGLPGQHINKNILPVLPKVQ 66
Query: 77 SLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
LV+ G+ EA + + Y LG + L ++ L + Y R L LNTA
Sbjct: 67 RLVNQGKNYEAQKLIEENILTGYSQSYLPLGRLLLTYE---LSGDAKGYNRSLSLNTAVC 123
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
+Y+ G V + RE S PD V+ I+ +SG+L+FN++LDS L + NN +IM
Sbjct: 124 ETRYTSGGVNYCREVICSYPDDVMAVHITADKSGALTFNITLDSQL-RYQIAKMNNTLIM 182
Query: 196 EGRCPGKRIPPKANAN--------DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
G CP IP A+ + + I+FS + + +G ++ ++ V +
Sbjct: 183 TGDCPSCMIPDYVEADKHLIYDHEEYSRSIRFSVGMRANV---KGGSLIVDADRISVTAA 239
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D +L+L ++++F+G P S DP ++ M L + S+++L +RH D+ LF R
Sbjct: 240 DEVLLILSSTTNFEGFDKMPGSSGNDPLTKCMRILDNTVGYSWNELLSRHKADHAALFER 299
Query: 308 VSIQL-SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSS 365
V + L ++SP +P+ +R+ ++ DPSL LLF +GRYLLI+ S
Sbjct: 300 VCLDLGTQSP---------------MPTDKRLAAYAAGHHDPSLDSLLFAYGRYLLIACS 344
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPGTQ ANLQGIWN++L+ W S NIN EMNYW + NL EC PLFD L +S
Sbjct: 345 RPGTQAANLQGIWNKELTAPWSSNYTTNINTEMNYWPAETANLPECHIPLFDLLKDVSKA 404
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
GS+ + V+Y G+V+HH TD+W +S+ G+ W WPMGGAWL H+ EHY ++ D D
Sbjct: 405 GSEISLVHYGCRGFVLHHNTDLWRMASSVSGQARWGFWPMGGAWLSIHIMEHYRFSCDTD 464
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
FL+ Y + E FLLD+L +GY TNPSTSPE+ FI DG++ ++ STMD+A
Sbjct: 465 FLKDYYYIMREAVL-FLLDYLKPDDNGYFLTNPSTSPENAFIDADGRICSITKGSTMDLA 523
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
IIRE+F + I A +L K + L + + L +L P +I G ++EW ++ + E HR
Sbjct: 524 IIRELFESCIEAQSIL-KIDSYLSGLLAQRLCKLPPFQIGSKGQLLEWLDEYVEEEPGHR 582
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEH 662
H+SHLFGL+PG I+ P+L +A K+L++R G GWS W L+ARL D +
Sbjct: 583 HMSHLFGLYPGSVISPLHTPELAEACRKSLEQRLANGGGHTGWSCAWLICLYARLGDGNN 642
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
AYR V +L +Y NLF AHPPFQID NFGFT + EML+QS +L+
Sbjct: 643 AYRFVNQLLTR-----------SVYPNLFDAHPPFQIDGNFGFTTGIIEMLLQSHKGELH 691
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN----NDHDSF---K 775
LLPALP D W +G V G+KARG TV I W++ L I + + ++F K
Sbjct: 692 LLPALP-DNWKNGSVTGIKARGNYTVDISWQNHHLIRAKITAGQNGVCRIRISEAFTADK 750
Query: 776 TLHYRGTSVKVNLSAGKIYTF 796
+ + SV VNLSA + F
Sbjct: 751 YVERKENSVLVNLSANESVNF 771
>gi|375145718|ref|YP_005008159.1| alpha-L-fucosidase [Niastella koreensis GR20-10]
gi|361059764|gb|AEV98755.1| Alpha-L-fucosidase [Niastella koreensis GR20-10]
Length = 825
Score = 548 bits (1413), Expect = e-153, Method: Compositional matrix adjust.
Identities = 310/774 (40%), Positives = 452/774 (58%), Gaps = 38/774 (4%)
Query: 8 STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NP 66
S + L + +N PA+ + +A+P+GNG +G M++G V E ++LNE TL++G P + NP
Sbjct: 23 SAQSGLSLWYNKPAEAWVEALPVGNGHIGGMIFGRVEEELIQLNESTLYSGGPVKQSINP 82
Query: 67 DAPKALSDVR-SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYR 125
DA + L+ +R +L+ Y++A + K+ G+ + Y LGD+ L+ S Y+
Sbjct: 83 DAFQYLAPIREALLKEQDYSKANELAKKMQGYFTESYLPLGDLLLK--QSFNGRTPSAYQ 140
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
R LDL TA A +++V VE+TRE F S P V+V +I G++ +V+L+S L
Sbjct: 141 RRLDLQTAIATTRFTVDGVEYTREVFCSAPANVMVIRIRAGVPGAIDLSVALNSPLHYTI 200
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDP----------KGIQFSAILEIKISDDRGTIS 235
NN++IM G+ P P N D G++F +K GT++
Sbjct: 201 SAKANNEVIMSGKAPAHVDPSYYNPKDRQPVIYEDTAGCNGMRFQC--RVKAITKTGTVT 258
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
A + L V+ + VL++ A++SF+G P K+ + + + + SY+ L
Sbjct: 259 A-DTLGLHVQHATELVLIVSAATSFNGFDKCPDKEGKNEQAIAAGLIDAAAKRSYTGLQQ 317
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENID-TVPSAERVKSFQTDE-DPSLVELL 353
H++D+Q+ F+RVS I+ DT + N + T+P +R++++ DP+L L
Sbjct: 318 DHVNDHQRYFNRVSF--------ILKDTGAASNTNSTLPVDKRLQAYSAGAYDPALETLY 369
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
+Q+GRYLLI++SRPG ANLQGIWN++L W S +NIN +MNYW + NLSE
Sbjct: 370 YQYGRYLLIAASRPGGPPANLQGIWNKELRAPWSSNYTININTQMNYWPAESTNLSEMHL 429
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIW--AKSSADRGK--VVWALWPMGGAW 469
PL +L LS+ G++ A+ Y GWV HH +DIW A DRG VWA W MGG W
Sbjct: 430 PLLQWLKILSVTGARVAREFYHCDGWVAHHNSDIWGCANPVGDRGAGDPVWANWYMGGNW 489
Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP 529
LC HLWEHY +T D+ FL AYP+++ A F L+WL++ GY T PSTSPE++F
Sbjct: 490 LCQHLWEHYAFTQDKKFLAT-AYPIMKQAAVFTLNWLVKDSSGYWVTAPSTSPENKFRDE 548
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDG 588
G+ VS ++TMDM+IIR++F+ +I A+E L N D L L + + L P + G
Sbjct: 549 KGRAQAVSVATTMDMSIIRDLFTNVIEASEAL--NTDQLFRNRLTEVRKHLYPLRKGSKG 606
Query: 589 SIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSIT 648
++EW ++F + + HRH+SHLFGL PG I+ P+ +AA+KTL+ RG+ G GWS
Sbjct: 607 ELLEWYKEFAETDPQHRHVSHLFGLHPGRQISQHNTPEFFEAAKKTLEIRGDAGTGWSRG 666
Query: 649 WKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAA 708
WK WARL D +HAY+++++L N + GG Y NLF AHPPFQID NF TA
Sbjct: 667 WKINWWARLLDGDHAYKLIRQLLNY--SGADGKGGGGTYPNLFDAHPPFQIDGNFAGTAG 724
Query: 709 VAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
+ EM++QS L +++LLPALP W G VKGLKARGG TV I W G LH+ I
Sbjct: 725 MTEMMLQSHLGEVHLLPALP-AAWKEGAVKGLKARGGFTVDILWAKGKLHKAMI 777
>gi|390452435|ref|ZP_10237963.1| candidate alpha-l-fucosidase; glycoside hydrolase family 95
[Paenibacillus peoriae KCTC 3763]
Length = 826
Score = 548 bits (1412), Expect = e-153, Method: Compositional matrix adjust.
Identities = 303/782 (38%), Positives = 441/782 (56%), Gaps = 63/782 (8%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
PL++ + PA+ + +A+P+GNGRLGAMV+GG+ E L+LNEDTLW+G P D DA +
Sbjct: 8 QPLRLWYRQPAEVWEEALPVGNGRLGAMVFGGIREERLQLNEDTLWSGFPRDGVQYDALR 67
Query: 71 ALSDVRSLVDSGQYAEAT-AASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET-YRREL 128
L VR L+ +G+Y +A + + G + YQ LGD+ + + E T Y REL
Sbjct: 68 YLKPVRELIAAGKYKDAEHLINTHMLGCDTEAYQPLGDLWI----AQEGLGEITHYEREL 123
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--------DSL 180
DL T TA V + + +TRE +S+PD +I+ ++ + +G ++ +V + ++
Sbjct: 124 DLPTGTAAVAFHSDGIRYTREVIASSPDGIIMVSLTANRAGQINASVRITTPHPCEDEAG 183
Query: 181 LDNHSYV---------------NGNNQIIMEGRCPGKRIP------PKANANDDPKGIQF 219
D H V N I + GR P P++ + G+ F
Sbjct: 184 EDEHFAVLSQWDSDVAEGPSDEAARNCITLTGRAPSHVESNYHGDHPQSVVYEHDLGMAF 243
Query: 220 SAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESM 279
+ ++ ++ + G ++ D + V G+D + L A++ F G P +
Sbjct: 244 A--VQARMVSEGGIVTTKADGTVIVSGADTLTIYLAAATGFRGFHTMPDSDPAESAEVCQ 301
Query: 280 SALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVK 339
L + +L + RH D++ LF RV+++L DT +EE+I +P+ R++
Sbjct: 302 VTLDKVISLGSEQVRQRHEQDHRALFDRVALELG-------GDTRTEESI--LPTDLRLE 352
Query: 340 SF-QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEM 398
+ Q + DP L LLFQ+GRYLL+ SSRPG+Q ANLQGIWN+ + P W+S NIN +M
Sbjct: 353 RYKQGEADPRLEVLLFQYGRYLLMGSSRPGSQPANLQGIWNDRVQPPWNSNYTTNINTQM 412
Query: 399 NYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKV 458
NYW + CNL+EC EPL + +S G + A VNY A GW HH D+W + G
Sbjct: 413 NYWPAEVCNLAECHEPLLHMIGEISRTGRRVASVNYGAQGWAAHHNVDLWRYAGPSGGHA 472
Query: 459 VWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNP 518
WA WP+GG WL HLW+ Y +T D +L ++AYPL++G A+F +DWL+EG +G+L T+P
Sbjct: 473 SWAFWPLGGVWLTAHLWDRYLFTQDTAYLAEQAYPLMKGAAAFCMDWLVEGPNGWLVTSP 532
Query: 519 STSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR 578
STSPE++FI P G+ +S STMDM +IRE+ I AA++LE +E+ + ++ R
Sbjct: 533 STSPENKFITPSGEECSISMGSTMDMTLIRELLGNCIQAADLLELDEE-FRNRCEETQQR 591
Query: 579 LRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR 638
L P ++ G + EW DF++ E HRH+SHL+GL+PG I I P+L +AA +L +R
Sbjct: 592 LLPYQMGRHGQLQEWFVDFEEAEPGHRHVSHLYGLYPGRQIHIRDTPELAEAARISLYRR 651
Query: 639 GEEG---PGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHP 695
+ G GWS W L+ARL D E A+R V+ L + Y NLF AHP
Sbjct: 652 LDHGGGYTGWSCAWLINLYARLEDGEAAHRYVRTLLSR-----------SAYPNLFDAHP 700
Query: 696 PFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
PFQID NFG TA +AEML+QS ++ LLPALP WS G V GL+ RGG TVSI W
Sbjct: 701 PFQIDGNFGATAGIAEMLLQSRPGEITLLPALP-AAWSQGRVSGLRGRGGMTVSIEWSGS 759
Query: 756 DL 757
L
Sbjct: 760 RL 761
>gi|336429038|ref|ZP_08609009.1| hypothetical protein HMPREF0994_05015 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336003732|gb|EGN33810.1| hypothetical protein HMPREF0994_05015 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 779
Score = 548 bits (1411), Expect = e-153, Method: Compositional matrix adjust.
Identities = 308/763 (40%), Positives = 444/763 (58%), Gaps = 37/763 (4%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+++ + PA ++ +A+P+GNGRLGAMVW G E + LNED+LW+G P + A +
Sbjct: 1 MELWYKEPASYWEEALPLGNGRLGAMVWSGTDQEKISLNEDSLWSGYPQSHDISGAAEYY 60
Query: 73 SDVRSLVDSGQYAEATAA-SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
R L +Y EA A + G Y LG EL D +H + Y+R L+L
Sbjct: 61 LQARRLSMEKKYEEAQALLEQNVLGEYTQSYLPLG--ELTLDMAHPEGEIRNYKRALELE 118
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
A +R++YS G+ +TRE F S PDQV+V IS G +S L + N
Sbjct: 119 KALSRLEYSAGDTNYTREMFISAPDQVMVMHISADRPGMVSLKAGFSCQLRAEVSIE-EN 177
Query: 192 QIIMEGRCPGKRIPPKANAND--------DPKGIQFSAILEIKISDDRGTISALEDKKLK 243
++I++G P + P ++ D + KG+QF A+LEI + + G + L + L+
Sbjct: 178 RMILDGIAPSQVDPSYIDSPDPVIYEDAPEKKGMQFCAVLEIDV--EGGEMKRLPEG-LE 234
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V +D L L A +SF+GPF +P K + LQ+ R + Y L RH+++YQ+
Sbjct: 235 VIHADSVTLFLAARTSFNGPFRHPFLEGKPYKEPCFAELQAAREMGYDRLLERHIEEYQQ 294
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
F+RVS+ L +++ P ER+ + D DP+ LLFQ+GRYLLIS
Sbjct: 295 YFNRVSMDLGPGREEL-------------PVPERLADWDKDVDPARFTLLFQYGRYLLIS 341
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSRPGTQ ANLQGIWN+ L W S VNIN EMNYW + NL E EPLFD + L
Sbjct: 342 SSRPGTQPANLQGIWNQHLRAPWSSNYTVNINTEMNYWGAETVNLPEMHEPLFDLIRNLR 401
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSS--ADRGK--VVWALWPMGGAWLCTHLWEHYN 479
I+G TA+++Y A G+V HH +DIW S+ +RGK V+A WP+ WL H+++HY
Sbjct: 402 ISGGNTARIHYNAGGFVSHHNSDIWCLSTPVGNRGKGTAVYAFWPLSAGWLSAHVYDHYL 461
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
++ D DFL + YP++ A F LD L E DG L PSTSPE++FI GK+ VS +
Sbjct: 462 FSGDLDFLRQTGYPVIHDAARFFLDVLTENEDGELIFAPSTSPENQFIY-HGKVCAVSQT 520
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKD 599
+TM MAI+REV + +L +++ L E ++L RL +I G ++EW ++ ++
Sbjct: 521 TTMTMAIVREVLENAAACCRLLGIDQEFLAE-AEEALGRLPSYRIGSRGELLEWNEELEE 579
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHD 659
E HRH SHL+ L+PG I++E+ P+L +A ++L+ RGEE GW++ W+ LWARLHD
Sbjct: 580 NEPTHRHTSHLYPLYPGRQISLEETPELAEACRRSLELRGEESTGWALAWRICLWARLHD 639
Query: 660 QEHAYRMVKRLFNLVDPEHEKHFE--GGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST 717
E AY M+K+ VD + +++ GG Y N+F AHPPFQID+NFG A +AEML+QST
Sbjct: 640 GEKAYGMLKKQLRPVDGSNPMNYQQGGGCYPNMFGAHPPFQIDSNFGSCAGIAEMLMQST 699
Query: 718 LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEV 760
+ LLPALP + +G V GL+ R G TV++ ++DG L +
Sbjct: 700 EETIDLLPALP-RAFGTGMVSGLRTRAGATVAVSFRDGRLEKA 741
>gi|146300857|ref|YP_001195448.1| hypothetical protein Fjoh_3112 [Flavobacterium johnsoniae UW101]
gi|146155275|gb|ABQ06129.1| Candidate alpha-L-fucosidase; Glycoside hydrolase family 95
[Flavobacterium johnsoniae UW101]
Length = 822
Score = 547 bits (1410), Expect = e-153, Method: Compositional matrix adjust.
Identities = 307/771 (39%), Positives = 450/771 (58%), Gaps = 39/771 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPKA 71
LK+ +N PA +T+A+PIGNG LGAMV+G V SE ++LNE TLW+G P NP+A +
Sbjct: 26 LKLQYNQPAVEWTEALPIGNGTLGAMVFGRVDSELIQLNEATLWSGGPVQKNVNPNAFQN 85
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
L+ +R + + + +A + + G ++ + LGD+ L D K + Y R LD+
Sbjct: 86 LALIREALKAEDFDKAYNLTKNMQGAYSESFMPLGDLLLTQDLGSKK--TDFYNRSLDIQ 143
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
T A + V + RE F+S P + IV K+S + LS ++ SLL N + N
Sbjct: 144 TGLAVTNFKADGVNYKREIFASAPAKCIVMKLSADQLKKLSVSIDASSLLKNQKEIQ-NQ 202
Query: 192 QIIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKL 242
++++G+ P P + N +P +G++F I++ + D GT+S E K+
Sbjct: 203 SLVLKGKAPSHADPNYIDYNKEPVIYDDPAGCRGMRFELIVKPIVKD--GTVS-YEGNKI 259
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
++ + VL + A++SF+G P KD + + + ++ Y L HL D+Q
Sbjct: 260 VIKNASEIVLFISAATSFNGFDKCPDSQGKDEHAFAENPIKKASVKKYDILVKEHLQDFQ 319
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLL 361
K F+RVS+QL+ E + +P+ R++ + E D L L FQ+GRYLL
Sbjct: 320 KFFNRVSLQLNEK----------ETHKSNLPTDIRLEQYAKGEKDAGLEALFFQYGRYLL 369
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
ISSSR ANLQGIWN L W S NINL+MNYW +LSE PL DF+
Sbjct: 370 ISSSRTHNAPANLQGIWNNKLRAPWSSNYTTNINLQMNYWPVESASLSELFFPLDDFVKN 429
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEH 477
+S+ G++TA+ Y A+GWV+HH +DIWA ++ +G +WA W MG WL HLWEH
Sbjct: 430 VSVTGAETAKSYYHANGWVLHHNSDIWATTNPVGDFGKGDPMWANWYMGANWLSRHLWEH 489
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y YT D ++L K+ YP+++G A F LDWL + +GYL T PSTSPE+++ K V+
Sbjct: 490 YQYTGDTEYL-KKVYPIIKGAAEFSLDWLQQDKNGYLVTMPSTSPENKYFYDGKKGGVVT 548
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
+STMD+ II+++F A+++L + D +KV K+ +L P +I G + EW +DF
Sbjct: 549 TASTMDIGIIKDLFENTSQASKILNIDAD-FRQKVDKAANQLLPFQIGAKGQLQEWYKDF 607
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARL 657
+D + HHRH SHL+ L P + I+ P+L AA+KTL+ RG++G GWS+ WK +WARL
Sbjct: 608 EDEDPHHRHTSHLYALHPANLISPLNTPELAAAAKKTLELRGDDGTGWSLAWKVNMWARL 667
Query: 658 HDQEHAYRMVKRLFNLV---DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLV 714
D HAY++ K L DP++++ +GG Y NLF AHPPFQID NF TA V EML+
Sbjct: 668 LDGNHAYKLFKNQLRLTKDNDPKYKR--QGGCYPNLFDAHPPFQIDGNFAGTAGVIEMLM 725
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
QS N+++LLPALP D W G +KG+ A+G TV+I W DG + + I SN
Sbjct: 726 QSQNNEIHLLPALP-DDWKEGEIKGITAKGNFTVNIKWNDGKMSQTKIVSN 775
>gi|392965675|ref|ZP_10331094.1| alpha-L-fucosidase [Fibrisoma limi BUZ 3]
gi|387844739|emb|CCH53140.1| alpha-L-fucosidase [Fibrisoma limi BUZ 3]
Length = 846
Score = 547 bits (1409), Expect = e-152, Method: Compositional matrix adjust.
Identities = 308/775 (39%), Positives = 434/775 (56%), Gaps = 33/775 (4%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPK 70
PL I + PA+++ +A+P+GNGRLGAMV+G V E ++LNE +LW+G P + NP A
Sbjct: 22 PLTIWYRQPARNWNEALPVGNGRLGAMVFGRVNDELIQLNEASLWSGGPVNLNPNPGAAT 81
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
L VR + Y EA + G + YQ LGD+ + L Y R L++
Sbjct: 82 YLPQVREALFREDYKEADKLVRNMQGLYTEAYQPLGDLTIR---QILTGEPADYYRNLNI 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A+A ++ G V +TRE F S PDQVIV ++ + G L+ + S V
Sbjct: 139 TEASATTRFKSGGVGYTREIFVSAPDQVIVIRLRADQKGKLNVTLGTRSPHPISKVVVSR 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKK 241
+++ M G+ P P N N P +G +F L++K +D + A +
Sbjct: 199 DELAMRGKSPAHADPNYVNYNKVPVRYTDSSGCRGTRFDLRLKVKSTDGQ---VATDTAG 255
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
+++ + AV+ L A++SF+G P K+ + S L S + H+ DY
Sbjct: 256 IRITNATEAVVYLSAATSFNGFDKCPDKDGKNEIQLAQSYLNKALAKSPDAIRKAHVADY 315
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYL 360
Q+ +RVS L+ D + N ++P ER+ + E DP+L L FQFGRYL
Sbjct: 316 QRYLNRVSFTLN--------DAQTPGNPASLPMDERLMRYAGGEPDPALETLYFQFGRYL 367
Query: 361 LISSSRPGTQVA-NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LISSSRPGT +A NLQGIWN + P W S NIN +MNYW + NLSE PL D +
Sbjct: 368 LISSSRPGTGIAANLQGIWNPMVRPPWSSNYTTNINAQMNYWPAEMTNLSEFHRPLIDQI 427
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLW 475
+ ++ G TA+ Y A GW +HH +DIWA S+ +G +WA W MGGAWL HLW
Sbjct: 428 KHAAVTGKATAKNFYGAGGWTVHHNSDIWAASNPVGDLGKGGPMWANWSMGGAWLAQHLW 487
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
EHY +T DR +L++ AYPL++ A F +DWL+E G+L T P+TSPE+ F+ G
Sbjct: 488 EHYAFTGDRTYLKQTAYPLMKDAAQFCVDWLVEDKQGHLVTAPATSPENVFVTEKGDKES 547
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ 595
VS ++TMDM +I ++FS +I A+E L + D + + + +L P +I G++ EW +
Sbjct: 548 VSVATTMDMGLIWDLFSNVIEASEHLGIDVD-FRKMLTEKKSKLFPLQIGRKGNLQEWYK 606
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWA 655
D++D + HRH+SHLF L PG I+ P +AA KTL+ RG+ G GWS +WK WA
Sbjct: 607 DWEDEDPQHRHVSHLFVLHPGREISPLTTPKYVEAARKTLEIRGDGGTGWSKSWKINFWA 666
Query: 656 RLHDQEHAYRMVKRLFNLVDPEHEKHFE-GGLYSNLFAAHPPFQIDANFGFTAAVAEMLV 714
RLHD HAY++++ L L E + GG Y NLF AHPPFQID NFG T+ + EML+
Sbjct: 667 RLHDGNHAYKLLRELLKLTGVEGTNYANGGGTYPNLFCAHPPFQIDGNFGGTSGIGEMLL 726
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
QS ++LLPA P D+W G VKGLKARGG + WKDG L + + S N
Sbjct: 727 QSHDGVVHLLPARP-DQWKDGSVKGLKARGGFELDYTWKDGKLTRLTVRSQQGGN 780
>gi|384427644|ref|YP_005637003.1| hypothetical protein XCR_1996 [Xanthomonas campestris pv. raphani
756C]
gi|341936746|gb|AEL06885.1| expressed protein [Xanthomonas campestris pv. raphani 756C]
Length = 764
Score = 546 bits (1408), Expect = e-152, Method: Compositional matrix adjust.
Identities = 317/793 (39%), Positives = 448/793 (56%), Gaps = 60/793 (7%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+ + + L + + PA + A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D TN
Sbjct: 12 AVAPDDALHLWYREPANEWVQALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDATN 71
Query: 66 PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEE 122
P A AL VR+L+ +G+YAEA A KL P YQ LGD+ L+FD +
Sbjct: 72 PQALAALPQVRALIFAGRYAEAEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GIS 128
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
YRR+LDL+TA A + G RE F S Q IV ++S G +S V +DS
Sbjct: 129 EYRRQLDLDTAVATTTFRSGGAVQRREVFVSAQSQCIVVRLSCDRPGGISLRVGIDSPQS 188
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
V ++ GR + A D K ++F+ L + G+++A+ D+ L
Sbjct: 189 GEVTVE-QGSLLFSGRN-------GSFAGIDGK-LRFA--LRVLPQVKGGSVTAVRDR-L 236
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
+++G+D VLLL A++S+ + DP + + ++LQ LSY+ L HL D+Q
Sbjct: 237 RIQGADEVVLLLTAATSYQ----RFDAVEGDPLALTAASLQKAGKLSYAALLRAHLADHQ 292
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
+LF RV+I L S T+P+ ERV+ F DP+L L Q+GRYLLI
Sbjct: 293 RLFRRVAIDLGSS------------EAATLPTDERVQRFAEGNDPALAALYHQYGRYLLI 340
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SSRPGTQ ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL L L
Sbjct: 341 CSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFDL 400
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
+ G+ TA+ Y A GWV+H+ TD+W ++ G W+LWPMGG WL LW+ ++Y
Sbjct: 401 ARTGAHTARAMYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDYGR 459
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
DR +L K YPL +G A F + L+ + G + TNPS SPE++ P G C T
Sbjct: 460 DRAYLAK-IYPLFKGAAEFFVATLVRDPGTGAMVTNPSMSPENQH--PFGAAVCA--GPT 514
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF--KD 599
MD ++R++F+ I+ +++L+ + AL +++ +L P +I + G + EW QD+ +
Sbjct: 515 MDAQLLRDLFAQCIAMSKLLDVDA-ALAQQLATLREQLPPNRIGKAGQLQEWQQDWDMQA 573
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHD 659
PE+HHRH+SHL+ L P I + P+L AA ++L+ RG+ GW I W+ LWARL D
Sbjct: 574 PEIHHRHVSHLYALHPSSQINLRDTPELAAAARRSLEIRGDNATGWGIGWRLNLWARLAD 633
Query: 660 QEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLN 719
EHAYR+++ L + PE Y NLF AHPPFQID NFG TA + EML+QS
Sbjct: 634 GEHAYRILQLLLS---PERT-------YPNLFDAHPPFQIDGNFGGTAGITEMLLQSWGG 683
Query: 720 DLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHY 779
++LLPALP W G V+GL+ RGG +V + W G L + ++S D L Y
Sbjct: 684 SVFLLPALP-KAWPRGSVRGLRVRGGASVDLEWDAGRLQQARVHS-----DRGGRYQLSY 737
Query: 780 RGTSVKVNLSAGK 792
G ++ + L AG+
Sbjct: 738 AGQTLDLQLGAGR 750
>gi|423214472|ref|ZP_17201000.1| hypothetical protein HMPREF1074_02532 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692887|gb|EIY86123.1| hypothetical protein HMPREF1074_02532 [Bacteroides xylanisolvens
CL03T12C04]
Length = 792
Score = 546 bits (1407), Expect = e-152, Method: Compositional matrix adjust.
Identities = 305/770 (39%), Positives = 439/770 (57%), Gaps = 51/770 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ +N PA + +A+PIGNGR+GAMV+G E +LNE+++W+G P D+ NP A AL
Sbjct: 27 KLWYNAPATVWEEALPIGNGRIGAMVYGNPLQEVYQLNEESIWSGYPQDWNNPKAANALP 86
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
VR VD G YA+A+ K + L L D A YR EL+++ A
Sbjct: 87 QVREAVDRGDYAKASEL-WKANAQGPYTARYLPMANLMLDQLTRGEARNLYR-ELNISNA 144
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
+ V Y V++ R F S PDQV+V KI+ ++S ++ L+SLL G +
Sbjct: 145 LSTVTYEADGVKYRRTSFISYPDQVMVIKIAADRPQAVSLHIRLNSLLRYTVQTKGEKTL 204
Query: 194 IMEGRCPG----KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
I+ G+ P + P DD +G QF +++++ D G A D L V ++
Sbjct: 205 ILNGKAPAYVANRDYDPHQVVYDDKRGTQFK--VQVELLPDGGHCEA-NDSALTVRNANE 261
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
VLLL A + F + K+ Y +L RH DD+Q+LF+R+
Sbjct: 262 VVLLLSAVTDFGNKKMTLKKCKR----------------PYQELLQRHTDDHQQLFNRLQ 305
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPG 368
+ L T+ +E +P+ ER+KSF+ D D L EL +Q+GRYLLI+SSRPG
Sbjct: 306 LSLG-------TENLQKE---ALPTNERLKSFEQDPTDNGLTELYYQYGRYLLIASSRPG 355
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
ANLQGIWN + P W S NIN EMNYW + NL EC PL DF+ L++NG++
Sbjct: 356 GLPANLQGIWNRHVQPPWGSNYTTNINTEMNYWPAEITNLPECFLPLSDFIGRLAVNGAQ 415
Query: 429 TAQVNY-LASGWVIHHKTDIWAKS-------SADRGKVVWALWPMGGAWLCTHLWEHYNY 480
TA+VNY + GW+ HH +D+WA++ S +G W+ WPM G WLC HLWEHY +
Sbjct: 416 TAKVNYGINRGWLAHHNSDVWAQTAPTGGYDSDPKGAPRWSCWPMAGVWLCQHLWEHYAF 475
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEF--IAPDGKL--AC 535
D+ +L K AYPL++G A FLL WL + + GY TNPSTSPE+ F I +GK
Sbjct: 476 GGDKKYLSKTAYPLMKGAAEFLLQWLQKDPETGYWITNPSTSPENRFRYIDKEGKKQNGE 535
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ 595
+S SS MD+ + ++ + I A+ VL+ ++ A ++ + L+P +I G ++EW +
Sbjct: 536 ISRSSGMDLGLAWDLLTNCIEASTVLDTDK-AFRQQCMDVRANLQPFRIGSKGQLLEWDK 594
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWA 655
+F++ + +HRH+SHLF L PG I E+ P+L A ++TL+ RG+ G GW++ WK WA
Sbjct: 595 EFEETDPNHRHVSHLFALHPGRQIIPEQQPELAAACQRTLEIRGDGGTGWAMAWKINFWA 654
Query: 656 RLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQ 715
RL D HA+ ++K VD GG Y+NLF AHPPFQID NFG TA + EML+Q
Sbjct: 655 RLRDGNHAFGILKNGLRYVDATQVSVRGGGTYANLFDAHPPFQIDGNFGGTAGITEMLLQ 714
Query: 716 STLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
S ++LLPALP D W SG +KG++ARGG T+ + WK+ + + + S+
Sbjct: 715 SHAGYIHLLPALP-DNWQSGSIKGVRARGGFTIDMEWKESRITRLSVTSH 763
>gi|294054095|ref|YP_003547753.1| alpha-L-fucosidase [Coraliomargarita akajimensis DSM 45221]
gi|293613428|gb|ADE53583.1| Alpha-L-fucosidase [Coraliomargarita akajimensis DSM 45221]
Length = 783
Score = 546 bits (1406), Expect = e-152, Method: Compositional matrix adjust.
Identities = 307/765 (40%), Positives = 441/765 (57%), Gaps = 54/765 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + ++ PA +TDA+P+GNG +GAMV+GG+ E ++ N+DTLW G P Y + DA L
Sbjct: 26 LTLRYDRPADAWTDALPVGNGSMGAMVFGGIEKERIQFNQDTLWAGEPRSYAHEDAVDVL 85
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHP--ADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
++R+L+ G+ AEAT A + P YQ GD+ ++F ++ + E Y R LD
Sbjct: 86 PEIRTLLFDGKQAEATKLAGERFMSEPLRQAAYQPFGDLWIQFP-AYGQAGE--YERSLD 142
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L+ A A Y++G+VEFTR F+S PD VI +I S+ G ++F L + ++S V
Sbjct: 143 LDGALATTSYTIGDVEFTRTVFASYPDGVIAIRIEASKPGMVNFTAGLTTPHQSNSVVEP 202
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
N+ + R K ++F A ++++ D G A ++V G+
Sbjct: 203 LNRNTLRLRGQVDAFTDKKETFTFEGAMRFEA--QLRVYTDGGMCQA-SGGVVEVGGATS 259
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L LVA++ F N +P S + L+++ + SY+D+ RH D++ LF R S
Sbjct: 260 ATLYLVAATDF----TNYKRLAGNPNSRCTTTLRALNSASYADVLQRHQADHRALFRRAS 315
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
I+L + + +T+P+ ER+ +Q DPSLV LLFQ+GRYLLI+SSRPG+
Sbjct: 316 IELGGT------------DANTMPTNERLNQYQAKPDPSLVALLFQYGRYLLIASSRPGS 363
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
+ ANLQG+WNE P W+S +NIN EMNYW + NLSEC EPLFD + LS+ G++
Sbjct: 364 EAANLQGLWNESQQPAWESKYTLNINAEMNYWPAELTNLSECHEPLFDLIEDLSVTGAEV 423
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+++Y A GWV HH TD+W + +A +WP GGAWLCTHLWEH+ YT DR FL+
Sbjct: 424 AELHYDARGWVAHHNTDLW-RGAAPINAANHGIWPTGGAWLCTHLWEHFLYTGDRQFLKS 482
Query: 490 RAYPLLEGCASFLLDWLIEG---HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
RAYPL++G A F +D L+E +G+L + PS SPE + TMD I
Sbjct: 483 RAYPLMKGAAQFFVDTLVEDPVFDEGWLISGPSNSPER---------GGLVMGPTMDHQI 533
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSL-PRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
IR +F A AA+VL + DA L+ L ++ P+++ ++G + EW +DP+ HR
Sbjct: 534 IRSLFHATADAADVLGR--DAAFAAELRELAAKITPSQVGQEGQVKEWLYK-EDPKTSHR 590
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H+SHL+GL PG+ IT K P+L A+++TL RG+ G GW+ WK WARL D + +
Sbjct: 591 HVSHLWGLHPGNEIT-SKTPELFAASKRTLNLRGDGGSGWARAWKVNFWARLKDGDRMAK 649
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS------TLN 719
++ FN + G Y+NLF AHPPFQID NFG TA +AE LVQS +
Sbjct: 650 IIHGFFN----NSSEQGGAGFYNNLFDAHPPFQIDGNFGLTAGIAEALVQSHELTARGVR 705
Query: 720 DLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+ +LPALP +W G V GL+ RGG +S W DG L V + S
Sbjct: 706 IVDILPALP-TEWGEGAVSGLRTRGGFELSFSWADGKLEAVELES 749
>gi|188991901|ref|YP_001903911.1| hypothetical protein xccb100_2506 [Xanthomonas campestris pv.
campestris str. B100]
gi|167733661|emb|CAP51866.1| conserved exported protein [Xanthomonas campestris pv. campestris]
Length = 790
Score = 545 bits (1405), Expect = e-152, Method: Compositional matrix adjust.
Identities = 316/795 (39%), Positives = 444/795 (55%), Gaps = 64/795 (8%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+ + + L + + PA + A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D TN
Sbjct: 38 AVAPDDALHLWYREPANEWVQALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDATN 97
Query: 66 PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEE 122
P A AL VR+L+ +G+YAEA A KL P YQ LGD+ L+FD +
Sbjct: 98 PQALAALPQVRALIFAGRYAEAEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GIS 154
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
YRR+LDL+TA A + G RE F S Q IV ++S G +S V +DS
Sbjct: 155 EYRRQLDLDTAVATTTFRSGGAVHRREVFVSAQSQCIVVRLSCDRPGGISLRVGIDSPQS 214
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGI--QFSAILEIKISDDRGTISALEDK 240
V ++ GR N GI + L + G+++A+ D+
Sbjct: 215 GEVTVE-QGSLLFSGR------------NGSFAGIDGKLRFALRVLPQVKGGSVTAVRDR 261
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
L+++G+D VLLL A++S+ + DP + + ++LQ LSY+ L HL D
Sbjct: 262 -LRIQGADEVVLLLTAATSYQ----RFDAVEGDPLALTAASLQKAGKLSYAALLRAHLAD 316
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
+Q+LF RV+I L S T+P+ ERV+ F DP+L L Q+GRYL
Sbjct: 317 HQRLFRRVAIDLGSS------------EAATLPTDERVQRFAEGNDPALAALYHQYGRYL 364
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LI SSRPGTQ ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL L
Sbjct: 365 LICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLF 424
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
L+ G+ TA+ Y A GWV+H+ TD+W ++ G W+LWPMGG WL LW+ ++Y
Sbjct: 425 DLARTGAHTARALYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDY 483
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
DR +L K YPL +G A F + L+ + G + TNPS SPE++ P G C
Sbjct: 484 GRDRAYLAK-IYPLFKGAAEFFVATLVRDPGTGAMVTNPSMSPENQH--PFGAAVCA--G 538
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF-- 597
TMD ++R++F+ I+ +++L+ + AL +++ +L P +I + G + EW QD+
Sbjct: 539 PTMDAQLLRDLFAQCIAMSKLLDVDA-ALAQQLATLREQLPPNRIGKAGQLQEWQQDWDM 597
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARL 657
+ PE+HHRH+SHL+ L P I + P+L AA ++L+ RG+ GW I W+ LWARL
Sbjct: 598 QAPEIHHRHVSHLYALHPSSQINLRDTPELAAAARRSLEIRGDNATGWGIGWRLNLWARL 657
Query: 658 HDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST 717
D EHAYR+++ L + PE Y NLF AHPPFQID NFG TA + EML+QS
Sbjct: 658 ADGEHAYRILQLLLS---PERT-------YPNLFDAHPPFQIDGNFGGTAGITEMLLQSW 707
Query: 718 LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTL 777
++LLPALP W G V+GL+ RGG +V + W G L + ++S D L
Sbjct: 708 GGSVFLLPALP-KAWPRGSVRGLRVRGGASVDLEWDAGRLQQARVHS-----DRGGRYQL 761
Query: 778 HYRGTSVKVNLSAGK 792
Y G ++ + L AG+
Sbjct: 762 SYAGQTLDLQLGAGR 776
>gi|329926814|ref|ZP_08281220.1| hypothetical protein HMPREF9412_3407 [Paenibacillus sp. HGF5]
gi|328938931|gb|EGG35301.1| hypothetical protein HMPREF9412_3407 [Paenibacillus sp. HGF5]
Length = 764
Score = 545 bits (1403), Expect = e-152, Method: Compositional matrix adjust.
Identities = 298/745 (40%), Positives = 437/745 (58%), Gaps = 39/745 (5%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
MV+GGV E ++ NEDTLW+G P D N +A + L+ R L+ SG+YAEA ++ G
Sbjct: 1 MVFGGVQEECIQWNEDTLWSGFPRDTNNYEALRYLAKARELIASGKYAEAEQLIEGRMVG 60
Query: 97 HPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVE--FTREHFSSN 154
+ + LGD+ + S + + YRREL+L+T A ++ V + F+R+ F S
Sbjct: 61 RNTESFLPLGDLLIR--QSGIGDSCSEYRRELNLDTGIASTRFQVSGSDPIFSRDMFISA 118
Query: 155 PDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPG------KRIPPKA 208
DQV V + + S S+ + L S L + + + +++ G P + P +
Sbjct: 119 VDQVGVIRYESTGSSSVQLEIGLRSPLQHRTRTEEDGTLVLHGHAPTHIADNYRGDHPGS 178
Query: 209 NANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPS 268
+D GI++ + + D G ++ ++D +++ + LL+ A+++F+G P
Sbjct: 179 VLYEDGLGIRYE--MRLLALTDSGQVT-VDDSGMRISAAGSVTLLIAAATNFEGFDRFPG 235
Query: 269 DSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEEN 328
DP+ LQ + L +RH+ D+Q LF RV +QL R P++ E +
Sbjct: 236 SGGTDPSGICRERLQDAMRHGFEQLRSRHVQDHQALFRRVELQLGR-PEN-------ERS 287
Query: 329 IDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWD 387
I + + ER+++++ ED +L L+FQFGRYLLI+SSRPGTQ A+LQGIWN + P W+
Sbjct: 288 IAALATDERMEAYREGREDAALEALMFQFGRYLLIASSRPGTQPAHLQGIWNPHVQPPWN 347
Query: 388 SAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDI 447
S NIN EMNYW + LSEC EPL + LS++G++TA+++Y A GWV HH D+
Sbjct: 348 SDYTTNINTEMNYWPAETTRLSECHEPLIQMIRELSVSGARTAKIHYGARGWVAHHNVDL 407
Query: 448 WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI 507
W +S G+ +WA WPMGGAWLC HLWE Y + D ++L + AYPL+ G A F LDWLI
Sbjct: 408 WRMASPSDGRAMWAYWPMGGAWLCRHLWERYQFQPDIEYLRETAYPLMRGAALFCLDWLI 467
Query: 508 EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDA 567
E +G+L T+PSTSPE++F+ +G VS STMDMAIIR++F I A+++LE++ D
Sbjct: 468 EDGEGHLVTSPSTSPENQFLTEEGLPCSVSAGSTMDMAIIRDLFHNCIEASQLLEQD-DE 526
Query: 568 LVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDL 627
L E+ ++ RL P I +G +MEW++ + + E HRH+SHL+GL+PG IT++ P L
Sbjct: 527 LREEWKMAVERLLPYAIDNEGRLMEWSKPYPEAEPGHRHVSHLYGLYPGSDITLQDTPQL 586
Query: 628 CKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEG 684
+AA +TL R + G GWS W L+ARL E AY V+ L +
Sbjct: 587 AEAAYRTLMSRIDHGGGHTGWSCVWLINLFARLQQPEKAYDYVRTLISR----------- 635
Query: 685 GLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARG 744
++ NL HPPFQIDANFG +A + EML+QS L+ + LLPALP W+ G V+GLKARG
Sbjct: 636 SMHPNLLGDHPPFQIDANFGGSAGLVEMLLQSHLDAIQLLPALP-KAWAEGSVRGLKARG 694
Query: 745 GETVSICWKDGDLHEVGIYSNYSNN 769
G V + WKDG L I S + N
Sbjct: 695 GFIVDMEWKDGILASASITSTHGRN 719
>gi|21231206|ref|NP_637123.1| hypothetical protein XCC1756 [Xanthomonas campestris pv. campestris
str. ATCC 33913]
gi|66768787|ref|YP_243549.1| hypothetical protein XC_2479 [Xanthomonas campestris pv. campestris
str. 8004]
gi|21112850|gb|AAM41047.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. ATCC 33913]
gi|66574119|gb|AAY49529.1| conserved hypothetical protein [Xanthomonas campestris pv.
campestris str. 8004]
Length = 790
Score = 545 bits (1403), Expect = e-152, Method: Compositional matrix adjust.
Identities = 315/795 (39%), Positives = 444/795 (55%), Gaps = 64/795 (8%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+ + + L + + PA + A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D TN
Sbjct: 38 AVAPDDALHLWYREPANEWVQALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDATN 97
Query: 66 PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEE 122
P A AL VR+L+ +G+YAEA A KL P YQ LGD+ L+FD +
Sbjct: 98 PQALAALPQVRALIFAGRYAEAEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GIS 154
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
YRR+LDL+TA A + G RE F S Q IV ++S G +S V +DS
Sbjct: 155 EYRRQLDLDTAVATTTFRSGGAVHRREVFVSAQSQCIVVRLSCDRPGGISLRVGIDSPQS 214
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGI--QFSAILEIKISDDRGTISALEDK 240
V ++ GR N GI + L + G+++A+ D+
Sbjct: 215 GEVTVE-QGSLLFSGR------------NGSFAGIDGKLRFALRVLPQVKGGSVTAVRDR 261
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
L+++G+D VLLL A++S+ + DP + ++++LQ LSY+ L HL D
Sbjct: 262 -LRIQGADEVVLLLTAATSYQ----RFDAVEGDPLALTVASLQKAGKLSYAALLRAHLAD 316
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
+Q+LF RV+I L S +P+ ERV+ F DP+L L Q+GRYL
Sbjct: 317 HQRLFRRVAIDLGSS------------EAARLPTDERVQRFAEGNDPALAALYHQYGRYL 364
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LI SSRPGTQ ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL L
Sbjct: 365 LICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLF 424
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
L+ G+ TA+ Y A GWV+H+ TD+W ++ G W+LWPMGG WL LW+ ++Y
Sbjct: 425 DLARTGAHTARALYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDY 483
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
DR +L K YPL +G A F + L+ + G + TNPS SPE++ P G C
Sbjct: 484 GRDRAYLAK-IYPLFKGAAEFFVATLVRDPGTGAMVTNPSMSPENQH--PFGAAVCA--G 538
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF-- 597
TMD ++R++F+ I+ +++L+ + AL +++ +L P +I + G + EW QD+
Sbjct: 539 PTMDAQLLRDLFAQCIAMSKLLDVDA-ALAQQLATLREQLPPNRIGKAGQLQEWQQDWDM 597
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARL 657
+ PE+HHRH+SHL+ L P I + P+L AA ++L+ RG+ GW I W+ LWARL
Sbjct: 598 QAPEIHHRHVSHLYALHPSSQINLRDTPELAAAARRSLEIRGDNATGWGIGWRLNLWARL 657
Query: 658 HDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST 717
D EHAYR+++ L + PE Y NLF AHPPFQID NFG TA + EML+QS
Sbjct: 658 ADGEHAYRILQLLLS---PERT-------YPNLFDAHPPFQIDGNFGGTAGITEMLLQSW 707
Query: 718 LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTL 777
++LLPALP W G V+GL+ RGG +V + W G L + ++S D L
Sbjct: 708 GGSVFLLPALP-KAWPRGSVRGLRVRGGASVDLEWDAGRLQQARVHS-----DRGGRYQL 761
Query: 778 HYRGTSVKVNLSAGK 792
Y G ++ + L AG+
Sbjct: 762 SYAGQTLDLQLGAGR 776
>gi|338214785|ref|YP_004658848.1| alpha-L-fucosidase [Runella slithyformis DSM 19594]
gi|336308614|gb|AEI51716.1| Alpha-L-fucosidase [Runella slithyformis DSM 19594]
Length = 835
Score = 543 bits (1398), Expect = e-151, Method: Compositional matrix adjust.
Identities = 309/772 (40%), Positives = 440/772 (56%), Gaps = 37/772 (4%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPKALS 73
I + PA+++ +A+P+GNGRLG M +G V E L+LNE+TLW+G P + NPDA K L
Sbjct: 24 IHYKQPARNWNEALPVGNGRLGVMTFGRVNEELLQLNEETLWSGGPVEKNPNPDALKHLP 83
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
VR ++ Y A+ K+ G + YQ LGD+ ++ + Y R+LDL A
Sbjct: 84 AVREALNREDYEMASKELQKIQGLYTEAYQPLGDVLIK---QPFEAQPTAYFRDLDLQNA 140
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
TA ++++ V ++RE F S PDQVIV +++ S+ G L+F+ S S + G N++
Sbjct: 141 TAHTQFTIEGVTYSRELFVSAPDQVIVLRLTASQKGKLNFSASTRSPHPFLKQITGKNEL 200
Query: 194 IMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLKV 244
M G+ P P N N P KG++F ++++ +D G ++A + + +
Sbjct: 201 SMRGKAPAHADPNYVNYNAKPVYYEDPSGCKGMRFDWRVKVQTTD--GKVTA-DTSGISI 257
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
+ A+LL+ A++SF+G P +D + + L+ S + H+ DY+K
Sbjct: 258 SNATEAILLVTAATSFNGFDKCPDSQGRDEKALVEAYLKRASAKSMDLIRKAHIADYRKY 317
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLIS 363
F RV + L +S + +P R+ + Q DP L L F FGRYLLIS
Sbjct: 318 FDRVKLTLGQSGEAA-----------HLPMDARLARYAQLGNDPELEALYFDFGRYLLIS 366
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSRPG ANLQGIWN P W S NIN EMNYW + NLSE D++ +
Sbjct: 367 SSRPGGIPANLQGIWNPMTRPPWSSNYTTNINAEMNYWPAEVANLSELHTTFTDWIAGAA 426
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSS--ADRGK--VVWALWPMGGAWLCTHLWEHYN 479
G +TA+ Y GW +HH +DIW S+ D+GK WA W MGGAWL HLWEHY
Sbjct: 427 ATGRETAKNFYGMKGWTVHHNSDIWGASNPVGDKGKGSPSWANWAMGGAWLSQHLWEHYV 486
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
Y+ D +L+ AYPL+ A F LDWL++ G T+PSTSPE+ FI G VS +
Sbjct: 487 YSGDEKYLKNYAYPLMRDAAQFCLDWLVKDAGGNWITSPSTSPENVFITEKGITQAVSVA 546
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLK-SLPRLRPTKIAEDGSIMEWAQDFK 598
+TMDMA++ +VF+ +I A+E L+ DA + K L+ + L P +I + G++ EW +D++
Sbjct: 547 TTMDMALVYDVFTNVIHASEHLKV--DAELRKTLEDRVQHLFPLQIGKKGNLQEWYKDWE 604
Query: 599 DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLH 658
D + HRH+SHLF + PG I+ + P AA KTL+ RG+ G GWS +WK WARLH
Sbjct: 605 DQDPQHRHVSHLFAVHPGRYISPLRTPKYTDAARKTLEIRGDGGTGWSKSWKINFWARLH 664
Query: 659 DQEHAYRMVKRLFNLVDPEHEKHFE-GGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST 717
D HA+++++ L L E + + GG Y NLF AHPPFQID NFG T+ +AEML+QS
Sbjct: 665 DGNHAHKLLQELLKLTGVEGTDYAKGGGTYLNLFCAHPPFQIDGNFGGTSGIAEMLIQSQ 724
Query: 718 LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
+ LLPALP D W++G +KGLKARGG + + WKDG + V I S N
Sbjct: 725 DGLVNLLPALP-DAWATGNIKGLKARGGFEIDMTWKDGKITRVIIKSLLGGN 775
>gi|354582995|ref|ZP_09001895.1| Alpha-L-fucosidase [Paenibacillus lactis 154]
gi|353198412|gb|EHB63882.1| Alpha-L-fucosidase [Paenibacillus lactis 154]
Length = 758
Score = 542 bits (1396), Expect = e-151, Method: Compositional matrix adjust.
Identities = 307/759 (40%), Positives = 426/759 (56%), Gaps = 65/759 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + +A+PIGNGRLGAM++GG E L+LNED++W G P D N DA L
Sbjct: 12 RLWYRKPAAEWNEALPIGNGRLGAMIFGGTAEEKLQLNEDSVWYGGPRDRNNEDALPHLP 71
Query: 74 DVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
++R L+ G+ EA A++ + G P Y LGD+ L F SH Y RELDL
Sbjct: 72 EIRKLIMEGRLREAEELAAMTMAGLPEAQRHYMPLGDLLLSF--SHHDLPAVDYVRELDL 129
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
+RV Y +G + +TRE F+S PDQ IV +IS + G++S + N Y+
Sbjct: 130 ENGISRVSYRIGEIRYTRELFASYPDQAIVIRISADKQGTVSLKARFNR--RNWRYLEKT 187
Query: 191 NQ-----IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ + M G C G+ G FSA+L K D G L + L V+
Sbjct: 188 DKWKESGLAMRGDCGGE------------GGSSFSAVL--KAVPDGGVCRTL-GEYLLVD 232
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G+ LL+ A ++F P DP + L+ + + Y++L RH+ DY++L+
Sbjct: 233 GASSVTLLITAGTTFRHP---------DPELDGKRRLEMLSRVPYAELLARHVADYRELY 283
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISS 364
RV ++L SP V +P+ ER+ FQ ED L+ FQFGRYLLI+S
Sbjct: 284 GRVDLKLPESPDKTV-----------LPTDERLMQFQQGGEDHGLIATYFQFGRYLLIAS 332
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
SRPG+ ANLQGIWN++ +P WDS +NIN +MNYW + CNL+EC EPLF+ + +
Sbjct: 333 SRPGSLPANLQGIWNDNFTPPWDSKFTININAQMNYWHAENCNLAECHEPLFELIERMRE 392
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA V Y G+ HH TDIWA ++ + + WPMG AWLC HLWEHY + DR
Sbjct: 393 PGRVTAHVMYGCRGFTAHHNTDIWADTAPQDTYLPASFWPMGAAWLCLHLWEHYRFGQDR 452
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
FL R Y ++ A FLLD+LIE +G L T PS SPE+ + P+G+ + + MD
Sbjct: 453 YFL-ARVYETMKEAALFLLDYLIEDAEGRLVTCPSVSPENRYKLPNGETGVLCVGAAMDF 511
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHH 604
II +F A I A+E++ ++E A +++ +L RL +I + G I EW +D+++ E H
Sbjct: 512 QIIEALFDACIRASEIIGRDE-AFRDELTGTLKRLPQPQIGKYGQIQEWMEDYEEVEPGH 570
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQE 661
RH+SHLF L+PG ++E+ PDL +AA+ TL++R G GWS W WARL D
Sbjct: 571 RHISHLFALYPGERFSVERTPDLAEAAKTTLERRLASGGGHTGWSRAWIINFWARLQDGA 630
Query: 662 HAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDL 721
AY V+ L + H NLF HPPFQID NFG TA +AEML+QS +
Sbjct: 631 TAYENVRALLD-----HST------LPNLFDDHPPFQIDGNFGGTAGIAEMLLQSHDGAI 679
Query: 722 YLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEV 760
LLPA+P D WS G VKGL+ARGG TV W +G + E
Sbjct: 680 RLLPAVP-DCWSEGSVKGLRARGGYTVDFVWAEGKVTEA 717
>gi|261416181|ref|YP_003249864.1| alpha-L-fucosidase [Fibrobacter succinogenes subsp. succinogenes
S85]
gi|385791048|ref|YP_005822171.1| carbohydrate binding protein, CMB family 6 [Fibrobacter
succinogenes subsp. succinogenes S85]
gi|261372637|gb|ACX75382.1| Alpha-L-fucosidase [Fibrobacter succinogenes subsp. succinogenes
S85]
gi|302326443|gb|ADL25644.1| carbohydrate binding protein, CMB family 6 [Fibrobacter
succinogenes subsp. succinogenes S85]
Length = 999
Score = 542 bits (1396), Expect = e-151, Method: Compositional matrix adjust.
Identities = 324/805 (40%), Positives = 455/805 (56%), Gaps = 70/805 (8%)
Query: 8 STTNPLKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+T NPL + +N A FT+A+PIGNG +G +++GGV + + LNE T+W+G PGD
Sbjct: 30 TTDNPLTLWYNSDAGTEFTNALPIGNGYMGGLIYGGVEKDYIGLNESTVWSGGPGDNNKQ 89
Query: 67 DAPKALSDVRSLVDSGQYAEA-TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYR 125
A L D R + G Y A + S + G +Q +GD L SH YR
Sbjct: 90 GAASHLKDARDALWRGDYRTAESIVSQYMIGPGPASFQPVGD--LVISTSH--KGSSNYR 145
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
RELDL TA A+ Y+VG V+ TRE+F+S PD VIV +S + GS+SF ++ + N+
Sbjct: 146 RELDLKTAIAKTTYTVGGVKHTREYFASYPDHVIVVHLSADKDGSVSFGATMTTPHRNNR 205
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
+ N +I + I+F + + D GT+S + + + V+
Sbjct: 206 MTSSGNTLIYDVTV---------------NSIKFQN--RLTVVADGGTVS-VSNGNINVQ 247
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G++ A L+L +++F + +D DP + + + + SY DL HL DYQ +F
Sbjct: 248 GANSATLILTTATNFK----SYNDVSGDPGAIASEIMSKVAKKSYEDLLAAHLKDYQTIF 303
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
+RV + L + K S +I ++ RVK+F + DPSLVEL +Q+GRYLLI+SS
Sbjct: 304 NRVKLDLGTADK-------SAGDI----TSTRVKNFNSTNDPSLVELHYQYGRYLLIASS 352
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
R G Q ANLQGIWN+D +P W S NINLEMNYW + NL EC PL D + +
Sbjct: 353 RKGGQPANLQGIWNKDTNPIWGSKYTTNINLEMNYWPAESGNLEECVWPLIDKIKSMVPQ 412
Query: 426 GSKTAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT-MD 483
G KTA+V++ + GWV HH TD+W +S+ G W LWP G WL THLWEH+ Y D
Sbjct: 413 GEKTAKVHWGVDEGWVEHHNTDLWNRSAPIDG--AWGLWPTGAGWLTTHLWEHFLYNPTD 470
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIE---GHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
+ +L+ Y ++G A F ++ L+E + YL T PS SPE++ G C +
Sbjct: 471 KAYLQD-VYSTMKGAALFFVNSLVEEPTTGNKYLVTAPSDSPENDH---GGYNVC--FGP 524
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDP 600
TMD IIR+V + I A+++L +ED + K+ ++ RL PTK + G I EW QD+ DP
Sbjct: 525 TMDNQIIRDVLNYTIEASKILGVDED-VRAKMEATVKRLPPTKTGKYGQITEWLQDWDDP 583
Query: 601 EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQ 660
+RH+SHL+GLFP IT E+ PDL K A TLQ+RG++ GWS+ WK WAR+HD
Sbjct: 584 NNKNRHISHLYGLFPSAQITPEETPDLIKGAGVTLQQRGDDATGWSLAWKINFWARMHDG 643
Query: 661 EHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
+HAYRM++ L P Y+NLF AHPPFQID NFG + V EML+QS N
Sbjct: 644 DHAYRMIRMLLT---PSKT-------YNNLFDAHPPFQIDGNFGAVSGVNEMLMQSHNNR 693
Query: 721 LYLLPALPWDKWSSGCVKGLKARGG-ETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHY 779
+ LLPALP +W++G VKG++ARGG E S+ WK G L V I S + + T +
Sbjct: 694 INLLPALP-SQWANGSVKGIRARGGFEIDSMAWKGGKLTYVAIKSLVGSTLNVVSGTNKF 752
Query: 780 RGTSVKVNLSAGKIYTFNRQLKCTN 804
++V GK+Y F+ LK TN
Sbjct: 753 STSTV-----PGKVYEFDGNLKVTN 772
>gi|374320465|ref|YP_005073594.1| putative alpha-l-fucosidase; glycoside hydrolase family 95
[Paenibacillus terrae HPL-003]
gi|357199474|gb|AET57371.1| putative alpha-l-fucosidase; glycoside hydrolase family 95
[Paenibacillus terrae HPL-003]
Length = 829
Score = 542 bits (1396), Expect = e-151, Method: Compositional matrix adjust.
Identities = 304/778 (39%), Positives = 431/778 (55%), Gaps = 56/778 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L++ + PAK + +A+P+GNGRLGAMV+GG+ E L+LNEDTLW+G P D DA + L
Sbjct: 10 LRLWYRQPAKVWEEALPVGNGRLGAMVFGGIGEERLQLNEDTLWSGFPRDGVQYDALRYL 69
Query: 73 SDVRSLVDSGQYAEAT-AASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
VR L+ G+Y +A + + G + YQ LGD+ + + AE Y RELDL
Sbjct: 70 KPVRELIADGKYKDAEHLINANMLGRDTEAYQPLGDLWIT-QEGLGSIAE--YERELDLV 126
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL-------------- 177
T TA V + G + +TRE +S PD +I+ +++ G ++ V +
Sbjct: 127 TGTAAVTFQGGGIRYTREVIASAPDGIIMVRLTADTPGKINATVRITTPHSCEAEAGEDA 186
Query: 178 ----DSLLDNHSYVNGNNQ-----IIMEGRCPGK------RIPPKANANDDPKGIQFSAI 222
S DN + + + I + GR P P++ +D G+ F+
Sbjct: 187 HFGDSSEWDNDKEDDSSGEPERDLITLTGRAPSHVESDYHGYHPQSVVYEDELGMAFA-- 244
Query: 223 LEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSAL 282
++ +I + GT++ D ++V G+D + L A++ F G P + T L
Sbjct: 245 IQARIIAEGGTLTRGADGVIRVAGADKLTVYLAAATGFRGFDTQPDIDATESTGVCEVTL 304
Query: 283 QSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ 342
+L Y + RH D+ +LF RV ++L + TD ++ I T E+ + Q
Sbjct: 305 ARAVSLGYEQVRHRHEQDHWELFGRVELELGDEGR---TDPSTKRQIPTDLRLEQYREGQ 361
Query: 343 TDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQ 402
D D L LFQ+GRYLLI+SSR G+Q ANLQGIWN+ + P W+S NIN +MNYW
Sbjct: 362 ADLD--LEVTLFQYGRYLLIASSRSGSQPANLQGIWNDHVQPPWNSDYTTNINTQMNYWP 419
Query: 403 SLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWAL 462
+ CNL+EC EPL + +S G + A + Y A GW HH D+W + G WA
Sbjct: 420 AEICNLAECHEPLLHMVGEVSRTGRRVASIYYGAQGWTAHHNVDVWRYAGPSGGHASWAF 479
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSP 522
WP+GG WL HLWE Y T D +L ++AYPL++G A+F +DWL+EG DG+L T+PSTSP
Sbjct: 480 WPLGGVWLTAHLWERYLLTQDTAYLAEQAYPLMKGAAAFCMDWLVEGPDGWLVTSPSTSP 539
Query: 523 EHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT 582
E++FI PDG+ +S STMDM +IRE+ S I A E+LE + D + ++L RL P
Sbjct: 540 ENKFITPDGEHCSISMGSTMDMTLIRELLSNCIQATELLELD-DEFRNRCEETLQRLLPY 598
Query: 583 KIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG 642
+I G + EW DF++ E HRH+SHL+GL+PG I + P+L +AA +L++R + G
Sbjct: 599 QIGRHGQLQEWFADFEEAEPGHRHVSHLYGLYPGRQIHVRDTPELAEAARISLRRRLDHG 658
Query: 643 ---PGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
GWS W L+ARL D E A+R V+ L + Y NLF AHPPFQI
Sbjct: 659 GGHTGWSCAWLINLYARLEDGEAAHRYVRTLLSR-----------STYPNLFDAHPPFQI 707
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
D NFG T+ +AEML+QS +L LLPALP W G V GL+ GG TV + W L
Sbjct: 708 DGNFGATSGIAEMLLQSRPGELTLLPALP-SAWPEGRVSGLRGHGGMTVGMEWSGSRL 764
>gi|395804709|ref|ZP_10483944.1| hypothetical protein FF52_22604 [Flavobacterium sp. F52]
gi|395433097|gb|EJF99055.1| hypothetical protein FF52_22604 [Flavobacterium sp. F52]
Length = 823
Score = 541 bits (1393), Expect = e-151, Method: Compositional matrix adjust.
Identities = 308/778 (39%), Positives = 447/778 (57%), Gaps = 39/778 (5%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYT 64
S S LK+ + PA +T+A+P+GNG LGAMV+G V +E ++LNE TLW+G P
Sbjct: 20 SASAQKDLKLQYKQPAVEWTEALPVGNGTLGAMVFGRVEAEFIQLNEATLWSGGPVHKNV 79
Query: 65 NPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETY 124
NPDA K L+ +R + + + +A + + G ++ + LGD+ L+ D K A +Y
Sbjct: 80 NPDAFKNLALIREALKNEDFEKANVLTKNMQGPYSESFMPLGDLILKQDFGGQKAA--SY 137
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
R LD+ T A ++ G V + RE F+S P Q IV K+S + LS + SLL N
Sbjct: 138 DRSLDIQTGLAVTSFNAGGVNYKREIFASAPAQCIVIKLSADQLKKLSVTIDAASLLKNQ 197
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTIS 235
V N ++++G+ P P + N +P +G++F I++ + D G IS
Sbjct: 198 KAVQ-NQTLVLKGKAPSHADPNYIDYNKEPVIYEDVTGCRGMRFELIIKPVVKD--GQIS 254
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
+ E KL ++ + +L + A++SF+G P KD + + ++ + Y L
Sbjct: 255 S-EGDKLVIKNASEILLFVSAATSFNGFDKCPDSQGKDEHKFAEAPIKKVAGKKYDSLLK 313
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLF 354
H+ D+QK F+RVS+ L+ E + +P+ R++ + E D L L F
Sbjct: 314 EHIADFQKFFNRVSLMLNEK----------ETSKSDLPTDIRLEQYAKGEKDAGLEALFF 363
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
QFGRYLLISSSR ANLQGIWN L W S NINL+MNYW +LSE
Sbjct: 364 QFGRYLLISSSRTHNAPANLQGIWNNKLRAPWSSNYTTNINLQMNYWPVESGSLSELFFS 423
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWL 470
L +F+ S G++TA+ Y A+GWV+HH +DIWA ++ +G +WA W MG WL
Sbjct: 424 LDEFIKNASATGAETAKSYYHANGWVLHHNSDIWAMTNPVGDFGKGDPMWANWYMGANWL 483
Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD 530
HLWEHY YT D+++L K+ YP+++G A F LDWL + +G+L T PSTSPE+ F
Sbjct: 484 SRHLWEHYQYTGDKNYL-KKVYPIIKGAAEFSLDWLQKDKNGHLVTMPSTSPENIFYYDG 542
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSI 590
K V+ +STMD+AII+++F I A++VL + + +KV + L P +I G +
Sbjct: 543 KKQGTVTTASTMDIAIIKDLFENTIEASKVLYADLE-FRQKVNSAREELLPFQIGSKGQL 601
Query: 591 MEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
EW +DF++ + HHRH SHL+ L P + I+ + P+L AA+KTL+ RG++G GWS+ WK
Sbjct: 602 QEWYKDFEEEDPHHRHTSHLYALHPANLISPLQTPELAAAAKKTLELRGDDGTGWSLAWK 661
Query: 651 TALWARLHDQEHAYRMVKRLFNLV---DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTA 707
+WARL D HAY++ K L DP + +H GG Y NLF AHPPFQID NF TA
Sbjct: 662 VNMWARLLDGNHAYQLFKNQLRLTKDNDPNYSRH--GGCYPNLFDAHPPFQIDGNFAGTA 719
Query: 708 AVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
V EML+QS +++LLPALP D W G +KG+ A+G TV I W +G + + I SN
Sbjct: 720 GVIEMLMQSQNKEIHLLPALP-DSWKDGEIKGITAKGNFTVDIKWNEGKMSQTTIVSN 776
>gi|251795949|ref|YP_003010680.1| alpha-L-fucosidase [Paenibacillus sp. JDR-2]
gi|247543575|gb|ACT00594.1| Alpha-L-fucosidase [Paenibacillus sp. JDR-2]
Length = 787
Score = 540 bits (1391), Expect = e-150, Method: Compositional matrix adjust.
Identities = 311/771 (40%), Positives = 435/771 (56%), Gaps = 53/771 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ + PA + +A+PIGNGR+G MV+ G + + LNEDTLW G P D N +A + L+
Sbjct: 8 KLWYEQPASVWEEALPIGNGRIGGMVFAGTEIDQILLNEDTLWAGFPRDPINYEAQRYLA 67
Query: 74 DVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
R L+ SG+YAEA + G + Y LG + + + + A Y+REL LN
Sbjct: 68 KARQLIFSGKYAEAERLIESTMQGRDVEPYLPLGGLSIVRREDR-ESAVSQYKRELHLNE 126
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A Y G+V ++F S PDQ +V + + G+L+ ++ +DSLL G Q
Sbjct: 127 GIAAACYQDGDVTVQSQYFVSVPDQALVVRYEAA-GGTLNRDIVMDSLLQYRLEEAGERQ 185
Query: 193 IIMEGRCPGK------RIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ + G+ P + P ++ G+ F + +K+ D GT+ E K L+V
Sbjct: 186 LHLIGQAPSHVAGNYHKDHPMDVLYEEGLGLPFE--IRVKVETD-GTVKNGE-KGLEVRN 241
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIR-----NLSYSDLYTRHLDDY 301
+ + + L A + F G + P E+ SA SIR L + L +RH +D+
Sbjct: 242 AAYLHIYLTAETGFAG-------YDQSPDQEACSARCSIRLEKAAALGFEGLLSRHTEDH 294
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYL 360
++LF RVS L+ E + P+ R+ +QT +D L L F FGRYL
Sbjct: 295 RQLFDRVSFSLA-----------DETDGSDKPTDRRLADYQTTKQDSHLEALYFHFGRYL 343
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
L+ SSRPGTQ ANLQGIWN +SP W S +NIN +MNYW + CNLSEC EPLF L
Sbjct: 344 LMGSSRPGTQPANLQGIWNHHVSPPWHSDYTININTQMNYWPAEVCNLSECHEPLFTMLR 403
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
+S GS+TA+++Y + GW HH DIW ++ G WA WP+GGAWL +WE Y Y
Sbjct: 404 EMSEAGSRTARIHYGSRGWTAHHNVDIWRMTTPTGGSASWAFWPLGGAWLVRQVWESYLY 463
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
MD+DFL ++AYPLL+G A F LDWL+EG +G L TNPSTSPE++F+ +G+ VSY S
Sbjct: 464 NMDKDFLGEKAYPLLKGAALFCLDWLVEGPNGDLVTNPSTSPENKFLTSEGEPCSVSYGS 523
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDP 600
TMD+AIIR++F + A + L E +++L SL RL KI G + EW +DF++
Sbjct: 524 TMDIAIIRDLFQNCLEAIDALGVEEAEFRDELLASLDRLPAYKIGRHGQLQEWYEDFEES 583
Query: 601 EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARL 657
E HRH+SHL+G++PG I EK P+L +A TL +R G GWS W L+ARL
Sbjct: 584 EPGHRHVSHLYGVYPGKEIN-EKKPELLEAVVATLDRRLANGGGHTGWSCAWLLNLFARL 642
Query: 658 HDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST 717
D++ AY V+ L Y NL AHPPFQID NFG +A +AE+L+QS
Sbjct: 643 KDEKQAYGAVQTLLAR-----------STYPNLLDAHPPFQIDGNFGGSAGIAELLLQSH 691
Query: 718 LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN 768
L+ + LLPALP W++G + GLKARGG V + W +G L + I + S
Sbjct: 692 LDTIDLLPALP-ASWTNGQISGLKARGGYVVDVEWANGTLKQAAIEARISG 741
>gi|392304738|emb|CCI71101.1| Alpha-L-fucosidase 2 [Paenibacillus polymyxa M1]
Length = 867
Score = 540 bits (1391), Expect = e-150, Method: Compositional matrix adjust.
Identities = 297/778 (38%), Positives = 441/778 (56%), Gaps = 61/778 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L++ + PA+ + +A+P+GNGRLGAMV+GG+ E L+LNEDTLW+G P D DA + L
Sbjct: 53 LRLWYRQPAEVWEEALPVGNGRLGAMVFGGIREEHLQLNEDTLWSGFPRDGVQYDALRYL 112
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDD-SHLKYAEETYRRELDL 130
R L+ G+Y EA + + G + YQ LGD+ + ++ + + Y RELD+
Sbjct: 113 EPARKLIADGKYKEAEQLITSNMLGRDTEAYQPLGDLWITQENLGEIAH----YERELDM 168
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS----------- 179
T TA V + V +TR+ +S PD VI+ ++ ++ G + +V + +
Sbjct: 169 QTGTAAVTFQSDGVRYTRKVIASAPDGVIMVSLTANKVGKIHASVRMTTPHSCDDEAGED 228
Query: 180 --LLDNHSYVNGNNQ--------IIMEGRCPGKRIP------PKANANDDPKGIQFSAIL 223
D+ + + N+ I + GR P P++ ++ G+ F+ +
Sbjct: 229 VHFSDSSQWASDNDPSEEPTRDFITLTGRAPSHVESNYHGDHPQSVVYENDLGMAFA--V 286
Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ 283
+ ++ + GT++ +D L + +D + L A++ F G P+ + L
Sbjct: 287 QARVIPEGGTLTTRDDGALIISDADKITVYLAAATGFRGFQAMPNSDATESAEACKVILD 346
Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
+L + RH D++KLF RV+++L +DT ++E++ +P+ R++ +Q
Sbjct: 347 GAISLGSEQVRQRHEQDHRKLFDRVALELG-------SDTLTDESV--LPTDLRLERYQK 397
Query: 344 DE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQ 402
+ D L LLFQ+GRYLL+ SSRPG+Q ANLQGIWN+ + P W+S NIN +MNYW
Sbjct: 398 GQADRGLEVLLFQYGRYLLMGSSRPGSQPANLQGIWNDRVQPPWNSNYTTNINTQMNYWP 457
Query: 403 SLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWAL 462
+ CNL+EC EPL + +S G + A ++Y A GW HH D+W + G WA
Sbjct: 458 AEVCNLAECHEPLLHMIGEVSRTGRRVASIHYGAQGWTAHHNIDVWRYAGPSAGHASWAF 517
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSP 522
WP+GG WL HLWE Y +T+D +L ++AYPL++G A+F LDWL EG DG L T+PSTSP
Sbjct: 518 WPLGGVWLTAHLWERYLFTLDTTYLAEQAYPLMKGAAAFCLDWLAEGPDGRLATSPSTSP 577
Query: 523 EHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT 582
E++FI P G+ +S STMDM +IRE+ S I AA++LE + D ++ ++ RL P
Sbjct: 578 ENKFITPGGEDCSISMGSTMDMTLIRELLSNCIQAADLLELD-DEFRKRCEETRERLVPY 636
Query: 583 KIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG 642
+I G + EW DF++ E HRH+SHL+G++PG I I P+L +AA +L++R + G
Sbjct: 637 QIGRHGQLQEWLVDFEEAEPGHRHVSHLYGVYPGRQIHIRDTPELAEAARISLRRRLDHG 696
Query: 643 ---PGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
GWS W L+ARL D + A+R V+ L + Y NLF AHPPFQI
Sbjct: 697 GGHTGWSCAWLINLYARLEDGDTAHRYVRTLLSR-----------STYPNLFDAHPPFQI 745
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
D NFG TA +AEML+QS L +L LLPALP W G V GLK GG TVS+ W L
Sbjct: 746 DGNFGATAGIAEMLLQSRLGELTLLPALP-SAWPEGRVSGLKGCGGITVSMEWSGSRL 802
>gi|325923835|ref|ZP_08185445.1| hypothetical protein XGA_4498 [Xanthomonas gardneri ATCC 19865]
gi|325545691|gb|EGD16935.1| hypothetical protein XGA_4498 [Xanthomonas gardneri ATCC 19865]
Length = 795
Score = 540 bits (1390), Expect = e-150, Method: Compositional matrix adjust.
Identities = 315/795 (39%), Positives = 443/795 (55%), Gaps = 64/795 (8%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+ + + L++ + PA + A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D T+
Sbjct: 43 AAAAGDALQLWYREPANEWVQALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDATS 102
Query: 66 PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEE 122
PDA AL VR+L+ +G+YAEA A A K+ P YQ LGD+ L+FD +
Sbjct: 103 PDALAALPQVRALIFAGRYAEAEALADAKMLSRPLKQMPYQPLGDLLLDFDRAD---GIS 159
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
YRR+LDL+T + G RE F S Q IV ++S ++S V +DS
Sbjct: 160 EYRRQLDLDTGVVTTTFRSGGAVHKREVFVSAQSQCIVVRLSCDRPRAISLRVGIDSPQT 219
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGI--QFSAILEIKISDDRGTISALEDK 240
V ++ GR N GI + L + GT+S L D+
Sbjct: 220 GEVTVE-QGGLLFSGR------------NGSFAGIDGKLRFALRVLPQIKGGTVSDLRDR 266
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
L++EG+D VLLL A++S+ + D DP + + ++L+ L Y+ L HL D
Sbjct: 267 -LRIEGADEVVLLLTAATSYQ--RFDAVDG--DPLALTAASLKKAGKLDYTALLRAHLAD 321
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
+Q+LF RV+I L S +P+ ERV++F DP+L L QFGRYL
Sbjct: 322 HQRLFRRVAIDLGTS------------EAAKLPTDERVQAFAKGNDPALAALYHQFGRYL 369
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LI SSRPG+Q ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL L
Sbjct: 370 LICSSRPGSQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLESMLF 429
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
L+ G+ TA+ Y A GWV+H+ TD+W ++ G W+LWPMGG WL LW+ ++Y
Sbjct: 430 DLAKTGAHTARAMYDAPGWVVHNNTDLWRQAGPIDG-AKWSLWPMGGVWLLQQLWDRWDY 488
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
DR +L K YPL +G A F + L++ G + TNPS SPE++ P C
Sbjct: 489 GRDRAYLGK-IYPLFKGAAEFFVATLVKDPQTGAMVTNPSISPENQH--PFNAALCA--G 543
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF-- 597
TMD ++R++F+ I+ +++L K +DA + + +L P +I + G + EW QD+
Sbjct: 544 PTMDAQLLRDLFAQCIAMSKLL-KVDDAFAQHLSTLREQLPPNRIGKAGQLQEWQQDWDM 602
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARL 657
+ PE+HHRH+SHL+ L P I + P+L AA++TL+ RG+ GW I W+ LWARL
Sbjct: 603 QAPEIHHRHVSHLYALHPSSQINLRDTPELAAAAKRTLETRGDNTTGWGIGWRLNLWARL 662
Query: 658 HDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST 717
D EHAYR+++ L+ PE Y NLF AHPPFQID NFG TA + EML+QS
Sbjct: 663 TDGEHAYRILQL---LISPERT-------YPNLFDAHPPFQIDGNFGGTAGITEMLLQSW 712
Query: 718 LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTL 777
++LLPALP W G V+GL+ RGG +V + W G L + ++S D L
Sbjct: 713 GGSVFLLPALP-SAWPRGSVRGLRIRGGASVDLEWDGGRLQQARVHS-----DRGGRYQL 766
Query: 778 HYRGTSVKVNLSAGK 792
Y G ++ + L AG+
Sbjct: 767 SYAGQTLDLELGAGR 781
>gi|146298534|ref|YP_001193125.1| hypothetical protein Fjoh_0772 [Flavobacterium johnsoniae UW101]
gi|146152952|gb|ABQ03806.1| Candidate alpha-L-fucosidase; Glycoside hydrolase family 95
[Flavobacterium johnsoniae UW101]
Length = 802
Score = 539 bits (1389), Expect = e-150, Method: Compositional matrix adjust.
Identities = 305/765 (39%), Positives = 451/765 (58%), Gaps = 40/765 (5%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPKALSDV 75
+ PA+ F +++ +GNG++G+ V+GGV S+ + LN+ TLW+G P + NP+A K + +
Sbjct: 32 YKQPAEFFEESLVLGNGKMGSTVFGGVNSDKIYLNDITLWSGEPVNANMNPEAYKNIPAI 91
Query: 76 RSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
R + + Y A + K+ G ++ Y LG +E+ ++ K YRRELD++ A +
Sbjct: 92 RETLQNENYKLAEELNKKVQGKNSESYAPLGTLEI---NNSEKGKAVNYRRELDISNAVS 148
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
+V Y + +++TRE+F S DQ+++ K++ + G+L+F+++L SLL ++ V NN ++M
Sbjct: 149 KVSYEMAGIKYTREYFVSAQDQIMIIKLTADQKGALNFDINLKSLLKSNVEVR-NNILVM 207
Query: 196 EGRCP-----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
G P G + PK A D +G +F+ +++IK +D + T S + L ++ + A
Sbjct: 208 TGSAPIHENAGYNVLPKYLALKD-RGTRFTGLVQIKKTDGKITSSR---ETLTLKDATEA 263
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
++ + ++SF+G NP+ D + + L + + H+ DYQK ++RV +
Sbjct: 264 IIYVSVATSFNGFDKNPASEGLDDIAIAAQNLNKAFEKPFDKIKESHIADYQKFYNRVDL 323
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPGT 369
L ++ +P+ ER+ + +ED +L L F +GRYLLISSSR
Sbjct: 324 NLGKT------------TAPDLPTDERLLRYADGNEDKNLEILYFNYGRYLLISSSRTLG 371
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
ANLQG+WN LSP W S +NINLE NYW + NLSE + L F+ LS+ G T
Sbjct: 372 VPANLQGLWNLHLSPPWSSNYTMNINLEENYWLAENTNLSEMHKSLLSFIKNLSVTGKVT 431
Query: 430 AQVNY-LASGWVIHHKTDIWAKSS--ADRGKV--VWALWPMGGAWLCTHLWEHYNYTMDR 484
A+ Y + GW H +DIWA ++ GK +WA WPM GAWL TH+WEHY +T D
Sbjct: 432 AKTFYGVDKGWAAAHNSDIWAMTNPVGQFGKEDPMWACWPMAGAWLSTHIWEHYIFTQDE 491
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
+L+K YPL++G A F L WL+ G L T+PSTSPE+++ DG + Y T D+
Sbjct: 492 TYLKKEGYPLMKGAAEFCLGWLVTDKKGNLITSPSTSPENQYKLEDGFVGATFYGGTADL 551
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWAQDFKDPEVH 603
A+IRE F I A++VL N DA L++ L +L P +I + G++ EW D+ D +
Sbjct: 552 AMIRECFDKTIKASKVL--NTDASFRVKLETVLSKLHPYQIGKKGNLQEWYFDWDDQDPK 609
Query: 604 HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHA 663
HRH S LFGLFPG IT K PDL +A++KTL+ +G+E GWS W+ LWARL D A
Sbjct: 610 HRHQSQLFGLFPGDHITPLKTPDLAEASKKTLEIKGDETTGWSKGWRINLWARLWDGNRA 669
Query: 664 YRMVKRLFNLVDPEHEKHFE----GGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLN 719
Y+M + L VDP+ +K + GG Y NLF AHPPFQID NFG AAVAEMLVQS N
Sbjct: 670 YKMFRELLRYVDPDGKKTEKPRRGGGTYPNLFDAHPPFQIDGNFGGAAAVAEMLVQSDEN 729
Query: 720 DLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
++ LLPALP D W+ G VKG+ ARGG + + W + +L V I S
Sbjct: 730 EIRLLPALP-DAWAEGSVKGICARGGFEIEMAWSNKNLTHVVISS 773
>gi|308070789|ref|YP_003872394.1| hypothetical protein PPE_04076 [Paenibacillus polymyxa E681]
gi|305860068|gb|ADM71856.1| Conserved hypothetical protein [Paenibacillus polymyxa E681]
Length = 822
Score = 539 bits (1389), Expect = e-150, Method: Compositional matrix adjust.
Identities = 308/838 (36%), Positives = 457/838 (54%), Gaps = 69/838 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L++ + PAK + +A+P+GNGRLGAMV+GG+ E L+LNEDTLW+G P D + DA + L
Sbjct: 10 LRLWYRQPAKVWEEALPVGNGRLGAMVFGGIREEHLQLNEDTLWSGFPRDGVHYDALRYL 69
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET-YRRELDL 130
VR + G+Y EA + + G + YQ LGD+ + + E Y RELDL
Sbjct: 70 QPVRKRIADGKYKEAEQLINTNMLGRDTEAYQPLGDLWV----TQEGLGEIVHYERELDL 125
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
T TA V + V +TRE +S PD +++ ++ ++ G + +V + S V +
Sbjct: 126 LTGTAAVTFQSDGVRYTREVIASAPDGIMMVSLTANKLGRIHASVRITSPHPCEDEVGED 185
Query: 191 NQ----------------------IIMEGRCPGKRIP------PKANANDDPKGIQFSAI 222
I + GR P P++ ++ G+ F+
Sbjct: 186 AHFGDSSKWDSDNDDSSDESSGDFITLTGRAPSHVESNYHGDHPQSVVYENDLGMAFA-- 243
Query: 223 LEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSAL 282
++ ++ + GT++ D L + G+D + L A++ F G P+ + L
Sbjct: 244 VQARVIPEGGTLTKGADGALIISGADKITVYLAAATGFQGFHAMPNSDATESVDACQVIL 303
Query: 283 QSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ 342
+L + RH D++KLF RV+++L DT + E++ +P+ +R++ +Q
Sbjct: 304 DGAISLGSEQVRQRHEQDHRKLFDRVALELG-------GDTLTNESV--LPTDQRLELYQ 354
Query: 343 TDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
+ DP L LLFQ+GRYLL+ SSRPG+Q ANLQGIWN+ + P W+S NIN +MNYW
Sbjct: 355 KGQADPGLEVLLFQYGRYLLMGSSRPGSQPANLQGIWNDRVQPPWNSNYTTNINTQMNYW 414
Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+ CNL+EC EPL + ++ G + A ++Y A GW HH D+W + G WA
Sbjct: 415 PAEVCNLAECHEPLLHMIGEVARTGRRVASIHYGAQGWAAHHNVDVWRYAGPSGGHASWA 474
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTS 521
WP+GG WL HLWE Y +T+D +L ++AYPL++G A+F +DWL+EG G L T+PSTS
Sbjct: 475 FWPLGGVWLTAHLWERYLFTLDTAYLAEQAYPLMKGAAAFCMDWLVEGPKGRLVTSPSTS 534
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
PE++F PDG+ +S STMDM +IRE+ S I AA++LE ++D + + RL P
Sbjct: 535 PENKFKTPDGEECSISMGSTMDMTLIRELLSNCIQAADLLELDDD-FRNRCEGTRARLMP 593
Query: 582 TKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEE 641
+I G + EW DF++ E HRH+SHL+GL+PG I I P+L +AA +L++R +
Sbjct: 594 YQIGRHGQLQEWFVDFEEAEPGHRHVSHLYGLYPGRQIHIRDTPELAEAARISLRRRLDH 653
Query: 642 G---PGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQ 698
G GWS W L+ARL D + A+R V+ L + +Y NLF AHPPFQ
Sbjct: 654 GGGHTGWSCAWLINLYARLEDGDAAHRYVRTLLSR-----------SIYPNLFDAHPPFQ 702
Query: 699 IDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLH 758
ID NFG TA +AEML+QS +L LLPALP WS G V GLK GG TV + W L
Sbjct: 703 IDGNFGATAGIAEMLLQSRPGELTLLPALP-TAWSEGRVSGLKGHGGMTVGMEWSGSRLV 761
Query: 759 EVGIYSNYSNNDHDSFKTLHYRGTSVKVNLS----AGKI--YTFNRQLKCTNLHQSIV 810
+ ++ S + ++ H + L G I + F ++ + TN H I+
Sbjct: 762 RAQLATSISAGSC-TIRSAHPFSADARQALPDPEYGGFILSWIFTKEQEITNGHTIII 818
>gi|310644025|ref|YP_003948783.1| candidate alpha-l-fucosidase; glycoside hydrolase family 95
[Paenibacillus polymyxa SC2]
gi|309248975|gb|ADO58542.1| Candidate alpha-L-fucosidase; Glycoside hydrolase family 95
[Paenibacillus polymyxa SC2]
Length = 824
Score = 539 bits (1388), Expect = e-150, Method: Compositional matrix adjust.
Identities = 297/778 (38%), Positives = 441/778 (56%), Gaps = 61/778 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L++ + PA+ + +A+P+GNGRLGAMV+GG+ E L+LNEDTLW+G P D DA + L
Sbjct: 10 LRLWYRQPAEVWEEALPVGNGRLGAMVFGGIREEHLQLNEDTLWSGFPRDGVQYDALRYL 69
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDD-SHLKYAEETYRRELDL 130
R L+ G+Y EA + + G + YQ LGD+ + ++ + + Y RELD+
Sbjct: 70 EPARKLIADGKYKEAEQLITSNMLGRDTEAYQPLGDLWITQENLGEIAH----YERELDM 125
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS----------- 179
T TA V + V +TR+ +S PD VI+ ++ ++ G + +V + +
Sbjct: 126 QTGTAAVTFQSDGVRYTRKVIASAPDGVIMVSLTANKVGKIHASVRMTTPHSCDDEAGED 185
Query: 180 --LLDNHSYVNGNNQ--------IIMEGRCPGKRIP------PKANANDDPKGIQFSAIL 223
D+ + + N+ I + GR P P++ ++ G+ F+ +
Sbjct: 186 VHFSDSSQWASDNDPSEEPTRDFITLTGRAPSHVESNYHGDHPQSVVYENDLGMAFA--V 243
Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ 283
+ ++ + GT++ +D L + +D + L A++ F G P+ + L
Sbjct: 244 QARVIPEGGTLTTRDDGALIISDADKITVYLAAATGFRGFQAMPNSDATESAEACKVILD 303
Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
+L + RH D++KLF RV+++L +DT ++E++ +P+ R++ +Q
Sbjct: 304 GAISLGSEQVRQRHEQDHRKLFDRVALELG-------SDTLTDESV--LPTDLRLERYQK 354
Query: 344 DE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQ 402
+ D L LLFQ+GRYLL+ SSRPG+Q ANLQGIWN+ + P W+S NIN +MNYW
Sbjct: 355 GQADRGLEVLLFQYGRYLLMGSSRPGSQPANLQGIWNDRVQPPWNSNYTTNINTQMNYWP 414
Query: 403 SLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWAL 462
+ CNL+EC EPL + +S G + A ++Y A GW HH D+W + G WA
Sbjct: 415 AEVCNLAECHEPLLHMIGEVSRTGRRVASIHYGAQGWTAHHNIDVWRYAGPSAGHASWAF 474
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSP 522
WP+GG WL HLWE Y +T+D +L ++AYPL++G A+F LDWL EG DG L T+PSTSP
Sbjct: 475 WPLGGVWLTAHLWERYLFTLDTTYLAEQAYPLMKGAAAFCLDWLAEGPDGRLATSPSTSP 534
Query: 523 EHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT 582
E++FI P G+ +S STMDM +IRE+ S I AA++LE + D ++ ++ RL P
Sbjct: 535 ENKFITPGGEDCSISMGSTMDMTLIRELLSNCIQAADLLELD-DEFRKRCEETRERLVPY 593
Query: 583 KIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG 642
+I G + EW DF++ E HRH+SHL+G++PG I I P+L +AA +L++R + G
Sbjct: 594 QIGRHGQLQEWLVDFEEAEPGHRHVSHLYGVYPGRQIHIRDTPELAEAARISLRRRLDHG 653
Query: 643 ---PGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
GWS W L+ARL D + A+R V+ L + Y NLF AHPPFQI
Sbjct: 654 GGHTGWSCAWLINLYARLEDGDTAHRYVRTLLSR-----------STYPNLFDAHPPFQI 702
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
D NFG TA +AEML+QS L +L LLPALP W G V GLK GG TVS+ W L
Sbjct: 703 DGNFGATAGIAEMLLQSRLGELTLLPALP-SAWPEGRVSGLKGCGGITVSMEWSGSRL 759
>gi|337748853|ref|YP_004643015.1| alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
gi|336300042|gb|AEI43145.1| Alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
Length = 762
Score = 539 bits (1388), Expect = e-150, Method: Compositional matrix adjust.
Identities = 314/758 (41%), Positives = 424/758 (55%), Gaps = 57/758 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
F PAK + +A+P+GNGRLGAMV+G E ++LNEDT+W G P D NPDA + L ++R
Sbjct: 8 FRQPAKDWNEALPLGNGRLGAMVFGLPRKERIQLNEDTVWYGGPVDRHNPDALRYLPEIR 67
Query: 77 SLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
+ SG+ AEA A++ L G P Y LGD+ + D H E YRRELDL+ +
Sbjct: 68 EKLLSGRLAEAHKLAAMALSGIPESQRHYMPLGDLWITMD--HPPGEAEEYRRELDLSKS 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVNGN 190
A + Y +G+ F RE F S+PDQ +V ++ G++ LD S + G
Sbjct: 126 VAGLHYRIGDTAFIRETFISHPDQALVLRLRADRPGAIGLTARLDRGKSRYLDEIEAAGP 185
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
N ++M G C GK G F A L +D G + + L VEG+D
Sbjct: 186 NVLVMRGNCGGK------------GGSDFRAALR---ADAEGGSVRIIGEHLIVEGADAV 230
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
L L A+++F ++DP + ++ L S Y+ L RH +DY+ L+ RV +
Sbjct: 231 TLYLSAATTF---------RQEDPEAYCLNTLSSAAARGYASLLERHTEDYRGLYDRVQL 281
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGT 369
L ++ TD + + +P+ ER++ + EDP L+ L FQ+GRYLLISSSRPG+
Sbjct: 282 SL-----ELQTDEAAAAAV--LPTDERLELVKKGGEDPGLIPLYFQYGRYLLISSSRPGS 334
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
ANLQGIWNE + P WDS +NIN +MNYW + C+LSEC EPLFD + +S GS+T
Sbjct: 335 LPANLQGIWNEQMRPPWDSKYTININTQMNYWPAESCHLSECHEPLFDLIQRMSERGSRT 394
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+V Y GW HH TD+W ++ + WP+GGAWLC HLWEHY + D L +
Sbjct: 395 AEVMYGCRGWTAHHNTDLWGDTAPQDIYLPATHWPLGGAWLCLHLWEHYRFGGDTQRLAE 454
Query: 490 RAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
YP+++G A FLLD++IE DG+L T PS SPE+ +I P+G+ + MD I RE
Sbjct: 455 -FYPVMKGAARFLLDYMIEAKDGHLITCPSVSPENTYILPNGESGTLCAGPAMDSQIARE 513
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSH 609
+F A AA L +ED E L +L R+ ++AE G + EW +D+K+ + HRH+SH
Sbjct: 514 LFQACREAARELGTDEDFRSELEL-ALQRIPLPQLAEGGYLQEWLEDYKEKDPGHRHISH 572
Query: 610 LFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHAYRM 666
LF L PG IT + P+ AA +TL +R G GWS W WARL D E AY
Sbjct: 573 LFALHPGTQITPARTPEWAAAARQTLVRRLANGGGHTGWSRAWIINFWARLGDGEEAYGH 632
Query: 667 VKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPA 726
+ LF NLF HPPFQID NFG AAVAEML+QS L+LLPA
Sbjct: 633 MLGLFR-----------KSTLPNLFDNHPPFQIDGNFGAAAAVAEMLLQSHDGALHLLPA 681
Query: 727 LPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
LP W +G + GL+ARGG V + W DG L E I S
Sbjct: 682 LP-KAWPAGRISGLRARGGFEVDLVWSDGSLTEAVIRS 718
>gi|322512626|gb|ADX05719.1| putative carbohydrate-active enzyme [uncultured organism]
Length = 999
Score = 539 bits (1388), Expect = e-150, Method: Compositional matrix adjust.
Identities = 322/806 (39%), Positives = 453/806 (56%), Gaps = 72/806 (8%)
Query: 8 STTNPLKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+T NPL + +N A FT+A+PIGNG +G +++GGV + + LNE T+W+G PGD
Sbjct: 30 TTDNPLTLWYNSDAGSEFTNALPIGNGYMGGLIYGGVTKDFIGLNESTVWSGGPGDNNKQ 89
Query: 67 DAPKALSDVRSLVDSGQY--AEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETY 124
A L D R + G Y AE+ + PA +Q +GD+ + S Y
Sbjct: 90 GAASHLKDARDALFRGDYRAAESIVNQYMIGPGPAS-FQPVGDLIISTSHS----GASDY 144
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
RRELDL TA A+ Y+ V+ TRE+F+S PD VIV +S +SGS+SF ++ + ++
Sbjct: 145 RRELDLKTAIAKTTYTHSGVKHTREYFASYPDHVIVVYLSADKSGSVSFGATMTTPHNSK 204
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
N N +I + I+F L + + ++S + + V
Sbjct: 205 RMSNDGNTLIYDVTV---------------NSIKFQNRLTVVTDGGKASVS---NGNINV 246
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
EG++ A L+L +++F +D DP + + + + SY DL HL DYQ +
Sbjct: 247 EGANSATLILTTATNFKAY----NDVSGDPGAIAAEIMSKVAKKSYEDLLAAHLKDYQTI 302
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV + L + K S +I ++ RVK+F + DPSLVEL +Q+GRYLLI+S
Sbjct: 303 FNRVKLDLGTADK-------SAGDI----TSTRVKNFNSTNDPSLVELHYQYGRYLLIAS 351
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
SR G Q ANLQGIWN+D +P W S NINLEMNYW + NL EC PL D + +
Sbjct: 352 SRKGGQPANLQGIWNKDTNPIWGSKYTTNINLEMNYWPAESGNLEECVWPLIDKIKSMVP 411
Query: 425 NGSKTAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT-M 482
G KTA+V++ + GWV HH TD+W +S+ G W LWP G WL THLWEH+ Y
Sbjct: 412 QGEKTAKVHWGVDEGWVEHHNTDLWNRSAPIDG--AWGLWPSGAGWLSTHLWEHFLYNPT 469
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIE---GHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
D+ +L+ YP ++G A F ++ L+E + YL T PS SPE++ G C +
Sbjct: 470 DKAYLQD-VYPTMKGAALFFVNSLVEEPETGNKYLVTAPSDSPENDH---GGYNVC--FG 523
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKD 599
TMD IIR+V + I A+++L +ED + K+ ++ RL PTK + G I EW QD+ D
Sbjct: 524 PTMDNQIIRDVLNYTIEASKILGVDED-VRAKMEATVKRLPPTKTGKYGQITEWLQDWDD 582
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHD 659
P +RH+SHL+GLFP IT E+ PDL K A TLQ+RG++ GWS+ WK WAR+HD
Sbjct: 583 PNNKNRHISHLYGLFPSAQITPEETPDLIKGAGVTLQQRGDDATGWSLAWKINFWARMHD 642
Query: 660 QEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLN 719
+HAYRM++ L P Y+NLF AHPPFQID NFG + V EML+QS N
Sbjct: 643 GDHAYRMIRMLLT---PSKT-------YNNLFDAHPPFQIDGNFGAVSGVNEMLMQSHNN 692
Query: 720 DLYLLPALPWDKWSSGCVKGLKARGG-ETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLH 778
+ LLPALP +W++G VKG++ARGG E S+ WK G L V I S + + T
Sbjct: 693 RINLLPALP-SQWANGSVKGIRARGGFEIDSMAWKGGKLTYVAIKSLVGSTLNVVSGTNK 751
Query: 779 YRGTSVKVNLSAGKIYTFNRQLKCTN 804
+ ++V GK+Y F+ LK TN
Sbjct: 752 FSTSTV-----PGKVYEFDGNLKITN 772
>gi|284039852|ref|YP_003389782.1| alpha-L-fucosidase [Spirosoma linguale DSM 74]
gi|283819145|gb|ADB40983.1| Alpha-L-fucosidase [Spirosoma linguale DSM 74]
Length = 864
Score = 539 bits (1388), Expect = e-150, Method: Compositional matrix adjust.
Identities = 299/784 (38%), Positives = 429/784 (54%), Gaps = 49/784 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-YTNPDAPKA 71
L + +N PA +++A+P+GNG +GAMV+G E L+LNE TL++G P + + K
Sbjct: 25 LTLWYNKPATVWSEALPLGNGYMGAMVFGDPAKEHLQLNEGTLYSGDPASTFKAINVRKD 84
Query: 72 LSDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
V +L+ + QY EA + K G +YQ +GD ++ D H A YRR+ D+
Sbjct: 85 FKQVSALLAAKQYQEAQSLIAKEWLGRNHQLYQPMGDFWIDVD--HKNEAITDYRRQFDI 142
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS-YVNG 189
TATA +Y VGN +TR +F+S PD VIV K++ + G ++ L + ++ + Y
Sbjct: 143 ATATATTRYKVGNTTYTRTYFASYPDHVIVVKLTANGPGKINCTFHLSTPHESTARYAAQ 202
Query: 190 NNQIIMEGRCPG---------------------------KRIPPKANANDDPK--GIQFS 220
N + M G+ PG +R P N D + G+ +
Sbjct: 203 GNTLTMRGKVPGFGLRRTFEQIEKAGDQYKYPEVYEKNGQRKPGIDNMLYDRQINGLGMA 262
Query: 221 AILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMS 280
+K+ G I ++ L V+ + V +L A++S++G +P+ DP
Sbjct: 263 FETRVKVQHTGGRIRQ-DNNALTVQDASEVVFVLSAATSYNGFDKSPAYEGVDPKPILDQ 321
Query: 281 ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKS 340
++I SY+ LY HL DY+KLF RV IQL+ +E P+ +RV+
Sbjct: 322 RFKAIEKKSYAALYQTHLADYKKLFDRVDIQLA-----------AETEQSQRPTDQRVEL 370
Query: 341 FQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
F DPS L FQ+GRYL+I+ SRPG Q NLQG+WN+ + P W+ +NIN +MNY
Sbjct: 371 FSNGLDPSFAALYFQYGRYLMIAGSRPGGQPLNLQGMWNDLMVPPWNGGYTININAQMNY 430
Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVW 460
W + NLSECQEP F + L+ING +TA+ Y GWV HH DIW + +
Sbjct: 431 WPAELTNLSECQEPFFKAVKELAINGHETARSMYGNDGWVAHHNMDIW-RHAEPVDLCNC 489
Query: 461 ALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPST 520
+ WPM WL +H WE Y ++ D FL+K +PLL+G F WL++ GYL T
Sbjct: 490 SFWPMAAGWLTSHFWERYLFSGDPIFLKKEVFPLLKGAVQFYQGWLVKNEQGYLVTPVGH 549
Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLR 580
SPE F+ D K A S TMDMAI+RE FS + A + L +D V ++L +L
Sbjct: 550 SPEQNFLYDDKKQATFSPGPTMDMAIVRESFSRYLEACKTLGITDD-FTAGVKQNLSQLL 608
Query: 581 PTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGE 640
P +I + G + EW DF D +V HRH SHL+ + P + I+++ P+L AA + +++RG+
Sbjct: 609 PYQIGKYGQLQEWQTDFDDADVQHRHFSHLYAMHPSNQISLQSTPELAAAARRVMERRGD 668
Query: 641 EGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQID 700
GWS+ WK +WARL D +HA +++ LF LV GG Y NLF AHPPFQID
Sbjct: 669 GATGWSMGWKVNVWARLLDGDHALKLITNLFKLVRTNSTSMQGGGTYPNLFCAHPPFQID 728
Query: 701 ANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEV 760
NFG TA +AEMLVQS +++LLPALP W +G VKGLKARGG + + WK G L +
Sbjct: 729 GNFGATAGIAEMLVQSHAGEVHLLPALP-QAWHTGHVKGLKARGGYEIDLEWKAGKLTKA 787
Query: 761 GIYS 764
++S
Sbjct: 788 VVHS 791
>gi|325915867|ref|ZP_08178165.1| hypothetical protein XVE_2098 [Xanthomonas vesicatoria ATCC 35937]
gi|325537923|gb|EGD09621.1| hypothetical protein XVE_2098 [Xanthomonas vesicatoria ATCC 35937]
Length = 776
Score = 538 bits (1387), Expect = e-150, Method: Compositional matrix adjust.
Identities = 308/795 (38%), Positives = 448/795 (56%), Gaps = 64/795 (8%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+ + T+ L++ + PA + +A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D T+
Sbjct: 24 AVAPTDALQLWYREPANEWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTS 83
Query: 66 PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEE 122
P+ AL VR+L+ G+YAEA A KL P YQ LGD+ L+FD +
Sbjct: 84 PEGLAALPQVRALIFGGRYAEAEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GIS 140
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
YRR+LDL+TA A + G R+ F Q IV ++S ++S V +DS
Sbjct: 141 EYRRQLDLDTAVATTSFRSGGALHQRDVFVCAQSQCIVVRLSCDRPRAISLRVGIDSPQS 200
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD--DRGTISALEDK 240
V ++ GR N GI+ +++ G ++AL D+
Sbjct: 201 GEVTVE-QGGLLFTGR------------NGSFAGIEGKLRFALRVVPRVKGGAVTALRDR 247
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
L++EG+D VLLL A++S+ + D DP + + ++L+ + L Y+ L HL D
Sbjct: 248 -LRIEGADEVVLLLTAATSYR--RFDAVDG--DPLALAAASLRKAQALDYAALLRAHLAD 302
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
+Q+LF RV+I L S + +P+ +RV+ F DP+L L Q+GRYL
Sbjct: 303 HQRLFRRVAIDLGTS------------DAAALPTDQRVRQFAGGNDPALAALYHQYGRYL 350
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LI SSRPGTQ ANLQGIWN+ + P W+S +N+N EMNYW S L EC EPL +
Sbjct: 351 LICSSRPGTQPANLQGIWNDLMQPPWESKYTINVNTEMNYWPSEANALHECVEPLESMVF 410
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
L+I G+ TA+ Y A GWV+H+ TD+W ++ G W+LWPMGG WL LW+ ++Y
Sbjct: 411 DLAITGAHTARALYGAPGWVVHNNTDLWRQAGPIDG-AKWSLWPMGGVWLLQQLWDRWDY 469
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
DR +L K YPL +G A F + L+ + G + TNPS SPE++ P G C
Sbjct: 470 GRDRAYLSK-IYPLFKGAAEFFVATLVRDPQTGAMVTNPSISPENQH--PFGAAICA--G 524
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF-- 597
TMD ++R++F+ I+ +++L+ + AL +++ +L P +I + G + EW QD+
Sbjct: 525 PTMDAQLLRDLFAQCIAMSKLLDVDA-ALAQQLATLREQLPPNRIGKAGQLQEWQQDWDM 583
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARL 657
PE+HHRH+SHL+ L P I + P+L AA++TL+ RG+ GW I W+ LWARL
Sbjct: 584 DAPEIHHRHVSHLYALHPSSQINLRDTPELAAAAKRTLETRGDNTTGWGIGWRLNLWARL 643
Query: 658 HDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST 717
D EHAYR+++ L+ PE Y NLF AHPPFQID NFG TA + EML+QS
Sbjct: 644 ADGEHAYRILQL---LISPERT-------YPNLFDAHPPFQIDGNFGGTAGITEMLLQSW 693
Query: 718 LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTL 777
++LLPALP + W G V+G++ RGG ++ + W G L + ++S D L
Sbjct: 694 GGSVFLLPALP-NAWPRGSVRGVRVRGGASIDLEWDGGRLQQARLHS-----DRGGRYQL 747
Query: 778 HYRGTSVKVNLSAGK 792
Y G ++ + L AG+
Sbjct: 748 SYAGQTLDLELGAGR 762
>gi|261407087|ref|YP_003243328.1| alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
gi|261283550|gb|ACX65521.1| Alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
Length = 755
Score = 538 bits (1386), Expect = e-150, Method: Compositional matrix adjust.
Identities = 308/798 (38%), Positives = 442/798 (55%), Gaps = 69/798 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + +A+PIGNGRLGAM++GG E L+LNED++W G P D N DA L
Sbjct: 12 RLWYRKPAAEWNEALPIGNGRLGAMIFGGTAEEKLQLNEDSVWYGGPRDRNNEDALPHLP 71
Query: 74 DVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
++R L+ G+ EA A++ + G P Y LGD+ L F H + AE+ Y RELDL
Sbjct: 72 EIRKLIMEGRLQEAEELAAMTMAGLPEAQRHYVPLGDLLLSFG-QHGQLAED-YMRELDL 129
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
+RV Y +G + +TRE F+S PDQ +V +I+ + +++F + N YV
Sbjct: 130 ERGVSRVSYRIGGIRYTRELFASYPDQAVVIRITADKQEAVTFKARFNR--RNWRYVEKT 187
Query: 191 NQ-----IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ ++M G C G+ G FSA+L+ + G + + L V+
Sbjct: 188 DKWEASGLVMRGDCGGE------------GGSSFSAVLK---AVPEGGVCRTLGEYLLVD 232
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G+ LLL A ++F P DP + L+ + + Y++L RH+ DY++L+
Sbjct: 233 GASSVTLLLAAGTTFRHP---------DPELDGKRRLEELSRVPYAELLARHVADYRELY 283
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ-TDEDPSLVELLFQFGRYLLISS 364
RV ++L +P +P+ ER+K FQ +ED L+ FQFGRYLLI+S
Sbjct: 284 GRVELKLPENPDKAA-----------LPTDERLKRFQHGEEDHGLIATYFQFGRYLLIAS 332
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
SRPG+ ANLQGIWN+ +P WDS +NIN +MNYW + CNL+EC EPLF+ + +
Sbjct: 333 SRPGSLPANLQGIWNDSFTPPWDSKFTININAQMNYWHAENCNLAECHEPLFELIERMRE 392
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA V Y G+ HH TDIWA ++ + + WPMG AWLC HLWEHY + DR
Sbjct: 393 PGRVTAGVMYGCRGFTAHHNTDIWADTAPQDTYLPASFWPMGAAWLCLHLWEHYRFGQDR 452
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
FL RAY ++ A FLLD+LIE +G L T PS SPE+ + P+G+ + +TMD
Sbjct: 453 YFL-ARAYETMKEAALFLLDYLIEDGEGRLVTCPSVSPENRYKLPNGETGVLCTGATMDF 511
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHH 604
II +F A + +AE+ ++E A E++ +L RL +I + G I EW +D+++ E H
Sbjct: 512 QIIEALFDACMQSAEIFGRDE-AFREELAAALKRLPKPQIGKYGQIQEWMEDYEEVEPGH 570
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQE 661
RH+SHLF L+PG + ++ P+L AA TL++R G GWS W WARL D +
Sbjct: 571 RHISHLFALYPGEGMNVDSTPELAAAARTTLERRLANGGGHTGWSRAWIINFWARLLDAD 630
Query: 662 HAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDL 721
AY V+ + + H NLF HPPFQID NFG TA +AEML+QS +
Sbjct: 631 KAYENVRAMLH-----HST------LPNLFDNHPPFQIDGNFGGTAGIAEMLLQSHAGLI 679
Query: 722 YLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRG 781
LLPALP + WS G V+GL+ARGG T++ W G + EV + + S L
Sbjct: 680 RLLPALP-NSWSDGEVRGLRARGGFTLNFTWTKGQVTEVVVSCSVSGPCRLQAPGL---- 734
Query: 782 TSVKVNLSAGKIYTFNRQ 799
V AG+ Y F ++
Sbjct: 735 DPVSFTGEAGRSYMFTKK 752
>gi|295689298|ref|YP_003592991.1| alpha-L-fucosidase [Caulobacter segnis ATCC 21756]
gi|295431201|gb|ADG10373.1| Alpha-L-fucosidase [Caulobacter segnis ATCC 21756]
Length = 781
Score = 538 bits (1386), Expect = e-150, Method: Compositional matrix adjust.
Identities = 304/796 (38%), Positives = 448/796 (56%), Gaps = 66/796 (8%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
T +PL++ + PAK + +A+P+G GRLGAMV+GGV E L+LNEDTLW G P + NP
Sbjct: 27 TPKASPLRLWYRQPAKTWVEALPVGTGRLGAMVFGGVDVERLQLNEDTLWAGGPYEPINP 86
Query: 67 DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAE-E 122
+A AL ++R L+D+G YA+A A K G P YQ +GD++L+F AE
Sbjct: 87 EAGAALPEIRRLIDTGDYAKAAQLAETKFVGVPKQQMSYQTIGDLKLDFPG----LAEPA 142
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
+Y REL+L+ A A ++ G V+ RE +S PD VI +++ S G++S ++ S L
Sbjct: 143 SYVRELNLDGAIATTRFKAGGVDHVREVIASAPDGVIAVRLTASRRGAISVDLGFASPLK 202
Query: 183 NH--SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALED 239
+ + V G + ++ A AND +GI E ++ +G + +
Sbjct: 203 SAPAARVEGRSLVL-------------AGANDSQQGIPAKLRFECRVDVRAKGGRVSGQG 249
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+ L + +D +LL+ A++S+ +D DPT+ + + L + N ++ + H
Sbjct: 250 ETLSIRDADEVILLIAAATSYR----RYNDVSGDPTALNKATLARLSNKPWAKILAGHQA 305
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
D+ LF RV + R+ ++ P+ ER+K+ +DPSL L +Q+GRY
Sbjct: 306 DHHALFRRVEVDFGRTRAELS------------PTDERIKASPMTDDPSLAALYYQYGRY 353
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LLI+ SRPGTQ ANLQG+WN+ S W +NIN EMNYW + P +L E EPL +
Sbjct: 354 LLIACSRPGTQPANLQGVWNDKPSAPWGGKYTININTEMNYWPAEPTSLPELVEPLIALV 413
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
LS G++TA+ Y A GWV HH TD+W +++A W +WP GGAWLC HLW+HY+
Sbjct: 414 RDLSETGARTAKAMYGARGWVAHHNTDLW-RATAPVDGAPWGVWPTGGAWLCKHLWDHYD 472
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
Y DR +L R YPL++G A F LD L ++ G L TNPS SPE++ G A +
Sbjct: 473 YGRDRAYL-ARVYPLMKGSARFFLDTLVVDPKFGVLVTNPSLSPENDH----GHGASIVA 527
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF- 597
TMD AIIR++F + A VL ++ V ++ + +L P K+ +DG + EW +D+
Sbjct: 528 GPTMDQAIIRDLFDNCLKAEAVLGADQ-TFVAELKTARDKLAPYKVGKDGQLQEWQEDWD 586
Query: 598 -KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
P++HHRH+SHL+GLFP I I+ P L AA +TL RG+ GW+I W+ LWAR
Sbjct: 587 ADAPDIHHRHVSHLYGLFPSDQIAIDTTPKLAAAARQTLVTRGDLSTGWAIAWRLNLWAR 646
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
L + +HA+ +++ L PE Y N+F AHPPFQID NFG + + EM++QS
Sbjct: 647 LGEGDHAHGILRLLLG---PERT-------YPNMFDAHPPFQIDGNFGGASGMTEMILQS 696
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKT 776
+ +YLLPALP W +G +KGL+ARG V + W G L E + + D
Sbjct: 697 RNDRIYLLPALP-SAWPTGHIKGLRARGAVGVDVRWTGGKLAEAVLRAKV-----DGRHV 750
Query: 777 LHYRGTSVKVNLSAGK 792
+ G+S+ V L G+
Sbjct: 751 VVLGGSSLTVELRRGQ 766
>gi|189465240|ref|ZP_03014025.1| hypothetical protein BACINT_01585 [Bacteroides intestinalis DSM
17393]
gi|189437514|gb|EDV06499.1| hypothetical protein BACINT_01585 [Bacteroides intestinalis DSM
17393]
Length = 826
Score = 538 bits (1385), Expect = e-150, Method: Compositional matrix adjust.
Identities = 302/775 (38%), Positives = 451/775 (58%), Gaps = 45/775 (5%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
+++ ++ T ++ ++ PA+ + +A+PIGNGR+GAMV+GG+ E ++LNE+T+WTG P
Sbjct: 20 LLSCQNNPDTTIWRLWYDQPAEKWEEALPIGNGRIGAMVFGGITKEKIQLNEETVWTGEP 79
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEATA---ASVKLFGHPADVYQLLGDIELEFDDSHL 117
+NPDA A+ D+R L+ G+Y EA V + +YQ +GD+ L F
Sbjct: 80 NSNSNPDALNAIPDIRKLIFQGKYKEAQKLVDEKVISKTNHGMIYQPVGDLNLTFPGHE- 138
Query: 118 KYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL 177
+ Y RELD+ +A A+ +Y+V +VE+ RE F+S DQVIV ++ S G + F+ L
Sbjct: 139 --TAKNYYRELDIESAIAKTRYTVNDVEYQREIFTSFTDQVIVIHLTASRKGKIVFSAEL 196
Query: 178 DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISA 236
+S + + + N + ++G G ++ +G I FS + +KI ++G +
Sbjct: 197 NSPQKSQT-ITLENGLSLQGSTEG---------HEGLEGKISFSTL--VKIVPEKGQMKT 244
Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
E ++ V +D AV + V+ ++ F+N ++ +P + S LQ Y+ L T
Sbjct: 245 -EASRITVSNAD-AVTIYVSIAT---NFVNYANLSGNPDQKVKSYLQHATQKDYAKLKTD 299
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
H+D Y+ F+RV +L VT+ + + R+ F +DP+L L FQF
Sbjct: 300 HMDYYRDYFNRVKFKLD------VTEAIQKT------TDVRIAEFAQGKDPNLAALYFQF 347
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLIS S+PGTQ ANLQGIWNE + P WDS NINLEMNYW + NLSE EPL
Sbjct: 348 GRYLLISCSQPGTQPANLQGIWNERMKPAWDSKYTTNINLEMNYWPTEITNLSELHEPLI 407
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
+ L++ G TA++ Y A GW++HH TD+W + A DR +WP GAWL HLW
Sbjct: 408 QMIKELAVTGGHTAKIMYGARGWMLHHNTDLWRTTGAVDRSGP--GMWPTCGAWLSRHLW 465
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLA 534
EH+ Y+ D+ +LE+ YP+++G A FLLD+ +E + +L PS+SPE+ F + KL
Sbjct: 466 EHFLYSGDKTYLEE-VYPIMKGAALFLLDFAVEEPEHHWLVIAPSSSPENTFDKKN-KLT 523
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWA 594
+ TMD ++ E+FS +ISA E+LE+++ + + + R+ P +I + EW
Sbjct: 524 NTA-GVTMDNQLMFELFSNLISATEILERDQH-FADTLRQIRTRIPPMQIGRYSQLQEWM 581
Query: 595 QDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALW 654
D DP HRH+SHL+GLFPG+ I+ + PDL AA +L RG+ GWS+ WK LW
Sbjct: 582 HDLDDPNDKHRHISHLYGLFPGNQISPYRTPDLFNAARNSLNHRGDASTGWSMGWKVCLW 641
Query: 655 ARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLV 714
AR D + AY+++ L ++ ++ GG Y NL AHPPFQID NFG TA +AEML+
Sbjct: 642 ARFMDGDRAYKLITEQLRLTGDKNTEYDGGGTYPNLLDAHPPFQIDGNFGCTAGIAEMLL 701
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
QS L++LPALP W +G ++GLKARGG I WK+G + + I SN N
Sbjct: 702 QSHDGALHILPALP-SAWRNGIIQGLKARGGFLTDIEWKNGQVKTIKIKSNLGGN 755
>gi|408671641|ref|YP_006870551.1| alpha-L-fucosidase [Emticicia oligotrophica DSM 17448]
gi|387857648|gb|AFK05740.1| alpha-L-fucosidase [Emticicia oligotrophica DSM 17448]
Length = 868
Score = 537 bits (1384), Expect = e-150, Method: Compositional matrix adjust.
Identities = 302/791 (38%), Positives = 432/791 (54%), Gaps = 61/791 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPKALSDV 75
++ PA +T+A+PIGN +GAM++G E ++LNE TL++G P + N K V
Sbjct: 31 YDKPASVWTEALPIGNSYMGAMIFGDSRQEHIQLNESTLYSGEPDATFKNISVRKYYQQV 90
Query: 76 RSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTAT 134
L+ +G+Y EA A K L G VYQ LGD F+ A Y+R LD+++AT
Sbjct: 91 TELLKAGKYQEADAIVAKELLGRNHQVYQPLGDFWANFEHGQ---AVSAYKRWLDISSAT 147
Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS-LLDNHSYVNGNNQI 193
A +Y VGN +F R++F+S PD +IV K S + ++ + + + Y N +
Sbjct: 148 AYTEYVVGNTKFKRQYFASYPDHIIVVKFSTEGTDKINCTLRFTTPHISTAKYEANGNML 207
Query: 194 IMEGRCP---------------------------GKRIPPKANAND-------DPKGIQF 219
M G+ P G R KANA + +GI F
Sbjct: 208 KMMGKAPYFVQRREFEQVESVGDQYKYPELYENDGTR---KANAKNILYDSTKGGRGISF 264
Query: 220 SAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESM 279
+ + KI + G + D +KVE + V++L A++S++G +PS K+ +
Sbjct: 265 ES--QAKILNLGGKLIRTGD-SIKVENASEIVVVLTAATSYNGFDKSPSKQGKNSSFLVN 321
Query: 280 SALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVK 339
S L+SI ++ LY+ HL DY+KLF RV +L+ E +P+ +RV
Sbjct: 322 SYLKSIEKKIFTQLYSTHLTDYKKLFDRVDFELAE-----------ETEQSKLPTDQRVS 370
Query: 340 SFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMN 399
F +DPS L FQ+ RYL+I+ SRP Q NLQGIWN+ + P W+ NIN EMN
Sbjct: 371 LFSNGKDPSFPSLYFQYSRYLMIAGSRPNGQPLNLQGIWNDQIVPPWNGGYTTNINTEMN 430
Query: 400 YWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVV 459
YW + NLSEC EPLF + L++NG TA+ Y GW HH DIW +++ + +
Sbjct: 431 YWIAESTNLSECHEPLFKAIKELAVNGKNTAKFMYGNEGWTSHHNMDIW-RNAEPIDRCL 489
Query: 460 WALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNP 518
+ WPMG WL +H WE Y +T D+ FL+ YP+L+G F WL+ + GYL T
Sbjct: 490 CSFWPMGAGWLTSHFWERYLHTGDKVFLKNEVYPVLKGVVEFYQGWLVKDAKTGYLITPI 549
Query: 519 STSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR 578
SPE F+ D K A +S TMDM I+RE F+ + + L N D LV+ + + LP+
Sbjct: 550 GHSPESYFLYEDNKRATISQGPTMDMGIVREAFARYVEMCQTLGIN-DELVKNIKQQLPQ 608
Query: 579 LRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR 638
L P +I + G + EW +DF+D + HRH SHL+ L P + I P+L A++K +++R
Sbjct: 609 LLPYQIGKYGQLQEWKEDFEDADPKHRHFSHLYALHPSNQINNFTTPELAAASKKVIERR 668
Query: 639 GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQ 698
G+ GWS+ WK +WARL D +HA +++ LF LV + GG YSNLF AHPPFQ
Sbjct: 669 GDLATGWSMGWKVNVWARLLDGDHALKLLTNLFTLVKTQETNMTGGGTYSNLFCAHPPFQ 728
Query: 699 IDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLH 758
ID NFG A +A+MLVQS +L+LLPALP W SG + GLKARGG TV + W++G L
Sbjct: 729 IDGNFGAAAGIAQMLVQSHAGELHLLPALP-STWQSGKINGLKARGGFTVDLEWENGKLT 787
Query: 759 EVGIYSNYSNN 769
+ I+S N
Sbjct: 788 KARIHSALGGN 798
>gi|404484444|ref|ZP_11019648.1| hypothetical protein HMPREF9448_00050 [Barnesiella intestinihominis
YIT 11860]
gi|404339449|gb|EJZ65880.1| hypothetical protein HMPREF9448_00050 [Barnesiella intestinihominis
YIT 11860]
Length = 802
Score = 537 bits (1384), Expect = e-150, Method: Compositional matrix adjust.
Identities = 310/797 (38%), Positives = 460/797 (57%), Gaps = 41/797 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPKA 71
+K+ ++ PA++F +A+ IGNG +GA ++GGV + + N+ TLWTG P + ++PDA
Sbjct: 25 MKLHYDRPAEYFEEALVIGNGTMGATLYGGVKKDKISFNDITLWTGEPESENSSPDAFNV 84
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+ ++R+L+D+ Y A A K+ GH ++ YQ LG + +E+ D ++ Y R LD+
Sbjct: 85 IPEIRALLDNEDYEGADKAQYKVQGHYSENYQPLGTLTIEYLDDTAGISD--YHRWLDIG 142
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
ATAR +Y FT ++F+S PD VIV ++ + +S DS L + S V +N
Sbjct: 143 NATARTQYLKDGKLFTSDYFASAPDSVIVIRLKSENKEGIHALLSFDSPLPHSSQV-ADN 201
Query: 192 QIIMEGRCPGKRIPPKANAND----DP-KGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+I +EG P A D DP +GI F ++ + +S D + D +++++G
Sbjct: 202 EISVEGYAAYHSFPVYYKAEDKHRYDPERGIHFKTLVRV-LSVDGSVKNRYSDSRIEIDG 260
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
S ++L+ +SF+G +P ++ S ++ +Y L H+ DY+ F
Sbjct: 261 STEVLILIANVTSFNGFDKDPVKEGRNYRSHVEKRMKCAIGKTYDALREAHIRDYKYYFD 320
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD---EDPSLVELLFQFGRYLLIS 363
RV + L + DI +P+ +++ F TD ++P L EL FQFGRYLLIS
Sbjct: 321 RVKLDLGNTDDDIAA----------LPTDKQL-LFYTDCKQQNPDLEELYFQFGRYLLIS 369
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSR ANLQG+WNE + P W S VNINLE NYW S NL E Q PL +F+ LS
Sbjct: 370 SSRTPGVPANLQGLWNESVLPPWSSNYTVNINLEENYWASGTTNLIEMQYPLIEFIANLS 429
Query: 424 INGSKTAQVNY-LASGWVIHHKTDIWAKS---SADRGKVVWALWPMGGAWLCTHLWEHYN 479
G KTA+ Y + GW + H +D+WA + + G WA W MGG WL TH+WEHY
Sbjct: 430 KTGRKTAKDYYGVERGWCLGHNSDVWAMTCPVGLNEGDPSWACWTMGGTWLSTHIWEHYL 489
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
+T+D+ FL K YP+L+G A F +DWL+E DG L T+P TSPE+++I PDG + SY
Sbjct: 490 FTLDKGFLCK-FYPVLKGAAEFCMDWLVE-KDGKLVTSPGTSPENKYITPDGYVGATSYG 547
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKD 599
+T D+A+IRE A++VL ++ + +++ K+L RL P +I DG++ EW D++D
Sbjct: 548 NTSDLAMIRECLIDAAEASKVLGVDK-SFRKRIKKTLSRLYPYQIGTDGNLQEWYYDWQD 606
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHD 659
+ +HRH SHLFGL+PGH +++E+ P+L A +TLQ +G++ GWS W+ L ARL D
Sbjct: 607 QDPYHRHQSHLFGLYPGHHLSVEETPELAAACARTLQIKGDDTTGWSTGWRVNLLARLRD 666
Query: 660 QEHAYRMVKRLFNLVDPEHEKHFE----GGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQ 715
E AY M +RL V P++ K + GG Y NL AH PFQID NFG + V EML+Q
Sbjct: 667 GEKAYHMYRRLLRYVSPDNYKGEDARRGGGTYPNLLDAHSPFQIDGNFGGCSGVIEMLMQ 726
Query: 716 STLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFK 775
S+ N + LLPALP + W+ G V+G+ ARGG V + WK+ ++ + + S F
Sbjct: 727 SSTNKIVLLPALP-ESWADGRVQGICARGGFVVDMEWKNREVVSLIVSSLKGGRTEICFN 785
Query: 776 TLHYRGTSVKVNLSAGK 792
G S KV AG+
Sbjct: 786 -----GVSKKVVFKAGE 797
>gi|294675358|ref|YP_003575974.1| hypothetical protein PRU_2729 [Prevotella ruminicola 23]
gi|294473191|gb|ADE82580.1| conserved hypothetical protein [Prevotella ruminicola 23]
Length = 821
Score = 537 bits (1383), Expect = e-149, Method: Compositional matrix adjust.
Identities = 302/765 (39%), Positives = 438/765 (57%), Gaps = 53/765 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA + +A+PIGN LGAMV+GG+ +E ++LNE+T W+G P + NPDA A+
Sbjct: 23 KLWYSKPAAQWLEALPIGNSHLGAMVYGGIGTEQIQLNEETFWSGSPHNNNNPDAKVAMK 82
Query: 74 DVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEET-YRRELDL 130
DVR L+ G+ EA A K F G Y LGD+ L FD + AE + YRREL+L
Sbjct: 83 DVRRLIFEGKEKEAEALIDKTFFKGPHGQKYLPLGDLMLSFD--YQNGAEPSNYRRELNL 140
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A + V +V++ R F+S D I+ +++ S+ +L+F VS
Sbjct: 141 GDALCTTSFDVADVKYIRTAFASQADNAIIIQLTASKKKALNFGVSYQ-----------R 189
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGI--QFSAILEIKISDDRGTISALEDKKLKVEGSD 248
NQ +EG K N + +GI + A + +K+ D GT++ + ++V +
Sbjct: 190 NQQAVEGGAVAKNEHAYIINNVEHEGIAGKLQAEVRVKVVAD-GTVTDM-GSDMQVRNAT 247
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
A + + A++++ +N DP +++ +Q ++ +Y L RHLD YQ + RV
Sbjct: 248 NATIFITAATNY----VNYQTINGDPVAKNNLTMQLLKGKNYKQLLKRHLDKYQDQYDRV 303
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ-TDEDPSLVELLFQFGRYLLISSSRP 367
S+ L++S + +P+ ER+ +F TD D +V L+ Q+GRYLLISSS+P
Sbjct: 304 SLSLAKSAQS------------ELPTDERLAAFDGTDLD--MVSLMMQYGRYLLISSSQP 349
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
G Q ANLQG+WN + P WDS +NIN EMNYW + NL+E QEPLF + LS+ G+
Sbjct: 350 GGQPANLQGVWNHKMDPAWDSKYTININAEMNYWPANVGNLAETQEPLFSMIRDLSVTGA 409
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
KTA+ Y GWV HH TD+W + G W ++P GGAWL THLW++Y YT D+ FL
Sbjct: 410 KTARTMYNCPGWVAHHNTDLWRIAGPVDG-TSWGMFPTGGAWLTTHLWQYYLYTGDKRFL 468
Query: 488 EKRAYPLLEGCASFLLDWL--------IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
+ YP+L+G + FLL ++ ++ G+L T P+ SPEH P GK V+
Sbjct: 469 DA-CYPILKGASDFLLSYMQEYPKNGEVKQAAGWLVTVPTVSPEH---GPVGKNTTVTAG 524
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKD 599
STMD I+ +V S+ + A ++L N + ++ +L P +I G + EW D D
Sbjct: 525 STMDNQIVFDVLSSTLRAHQILGYNNVVYTTMLSNAIAKLPPMQIGRYGQLQEWLIDGDD 584
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHD 659
P+ HRH+SHL+GL+P + I+ +PDL AA TL +RG+ GWS+ WK WAR+ D
Sbjct: 585 PKDEHRHISHLYGLYPSNQISPYSHPDLFTAASNTLNQRGDMATGWSLGWKINFWARMQD 644
Query: 660 QEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLN 719
HA++++K + N++ E GG Y NLF AHPPFQID NFG +A V EML+QS
Sbjct: 645 GNHAFKIIKNMLNVIPSTTEWGRSGGTYPNLFDAHPPFQIDGNFGCSAGVCEMLLQSHDG 704
Query: 720 DLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
++LLPALP D W G V GL ARG TVS+ W G+L E IYS
Sbjct: 705 AVHLLPALP-DSWKDGEVSGLVARGAFTVSMKWHQGELTEATIYS 748
>gi|379721830|ref|YP_005313961.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
gi|378570502|gb|AFC30812.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
Length = 781
Score = 536 bits (1382), Expect = e-149, Method: Compositional matrix adjust.
Identities = 313/758 (41%), Positives = 423/758 (55%), Gaps = 57/758 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
F PAK + +A+P+GNGRLGAMV+G E ++LNEDT+W G P D NPDA + L ++R
Sbjct: 8 FRQPAKDWNEALPLGNGRLGAMVFGLPRKERIQLNEDTVWYGGPVDRHNPDALRYLPEIR 67
Query: 77 SLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
+ SG+ AEA A++ L G P Y LGD+ + D H E YRRELDL+ +
Sbjct: 68 EKLLSGRLAEAHKLAAMALSGIPESQRHYMPLGDLWITMD--HPPGEAEEYRRELDLSKS 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVNGN 190
A + Y +G+ F RE F S+PDQ +V ++ G++ LD S + G
Sbjct: 126 VAGLHYRIGDTAFIRETFISHPDQALVLRLRADRPGAIGLTARLDRGKSRYLDEIEAAGP 185
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
N ++M G C GK G F A L +D G + + L VEG+D
Sbjct: 186 NVLVMRGNCGGK------------GGSDFRAALR---ADAEGGSVRIIGEHLIVEGADAV 230
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
L L A+++F ++DP + ++ L S Y+ L RH +DY+ L+ RV +
Sbjct: 231 TLYLSAATTF---------RQEDPEAYCLNTLSSAAARGYASLLERHTEDYRGLYDRVQL 281
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGT 369
L ++ TD + + +P+ ER++ + EDP L+ L FQ+GRYLLISSSRPG+
Sbjct: 282 SL-----ELQTDEAAAAAV--LPTDERLELVKKGGEDPGLIPLYFQYGRYLLISSSRPGS 334
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
ANLQGIWNE + P WDS +NIN +MNYW + C+LSEC EPLFD + +S GS+T
Sbjct: 335 LPANLQGIWNEQMRPPWDSKYTININTQMNYWPAESCHLSECHEPLFDLIKRMSERGSRT 394
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+V Y GW HH TD+W ++ + WP+GGAWLC HLWEHY + L +
Sbjct: 395 AEVMYGCRGWTAHHNTDLWGDTAPQDIYLPATHWPLGGAWLCLHLWEHYRFGGGTARLAE 454
Query: 490 RAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
YP+++G A FLLD++IE DG+L T PS SPE+ +I P+G+ + MD I RE
Sbjct: 455 -FYPVMKGAARFLLDYMIEAKDGHLITCPSVSPENTYILPNGESGTLCAGPAMDSQIARE 513
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSH 609
+F A AA L +ED E L +L R+ ++AE G + EW +D+K+ + HRH+SH
Sbjct: 514 LFQACREAARELGTDEDFRSELEL-ALQRIPLPQVAEGGYLQEWLEDYKEKDPGHRHISH 572
Query: 610 LFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHAYRM 666
LF L PG IT + P+ AA +TL +R G GWS W WARL D E AY
Sbjct: 573 LFALHPGTQITPARTPEWAAAARQTLVRRLANGGGHTGWSRAWIINFWARLGDGEEAYGH 632
Query: 667 VKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPA 726
+ LF NLF HPPFQID NFG AAVAEML+QS L+LLPA
Sbjct: 633 MLELFR-----------KSTLPNLFDNHPPFQIDGNFGAAAAVAEMLLQSHDGTLHLLPA 681
Query: 727 LPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
LP W +G + GL+ARGG V + W DG L E I S
Sbjct: 682 LP-KAWPAGRISGLRARGGFEVDLFWSDGSLTEAVIRS 718
>gi|198277528|ref|ZP_03210059.1| hypothetical protein BACPLE_03750 [Bacteroides plebeius DSM 17135]
gi|198270026|gb|EDY94296.1| hypothetical protein BACPLE_03750 [Bacteroides plebeius DSM 17135]
Length = 809
Score = 536 bits (1381), Expect = e-149, Method: Compositional matrix adjust.
Identities = 303/797 (38%), Positives = 447/797 (56%), Gaps = 41/797 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-DYTNPDAPKA 71
L + +N PA+ F +A+ IGNG +GA+++GG + L LN+ TLWTG P T P+A KA
Sbjct: 32 LVLHYNRPAEFFEEALVIGNGTMGAILYGGTDKDVLSLNDITLWTGEPDRKVTTPNAYKA 91
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+ ++R+L+D Y A A K+ GH ++ YQ LG + + + K + Y+R LD++
Sbjct: 92 IPEIRALLDKEDYRGADRAQRKVQGHYSENYQPLGQLSITYSAEPAKVSH--YQRTLDIS 149
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
A AR Y +F ++F+S PD VIV ++ + L +S +SLL + + NGN
Sbjct: 150 RAMARTAYQRNGADFACDYFASAPDSVIVLRLQTESTEGLQATLSFNSLLPHATTANGN- 208
Query: 192 QIIMEGRCPGKRIP------PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
+I EG P + D +G F + I++ + + + +LKV+
Sbjct: 209 EISAEGYAAYHSYPVYFDGVNNKHLYDPERGTHFRTL--IRVIAPQSEVKSFPSGELKVK 266
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G A++L+ +SF+G +P +D + ++ ++ +L H+ DY+ F
Sbjct: 267 GGKEALILIANVTSFNGFDKDPMKEGRDYRNLVTRRMERAAQKTFEELENAHVADYKSFF 326
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF--QTDEDPSLVELLFQFGRYLLIS 363
RV + L ++ ++ I +P+ E++ + ++ +P L L FQ+GRYLLIS
Sbjct: 327 DRVELHLGKT----------DQAIAALPTDEQLLQYTDKSQRNPELEALYFQYGRYLLIS 376
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSR ANLQG+WNE L P W NINLE NYW + NLSE PL DF+ L
Sbjct: 377 SSRTPGVPANLQGLWNERLLPPWSCNYTSNINLEENYWAAETANLSEMHRPLMDFIANLQ 436
Query: 424 INGSKTAQVNY-LASGWVIHHKTDIWAKSSA---DRGKVVWALWPMGGAWLCTHLWEHYN 479
G ++A+ Y + GW + TDIWA + + G WA W MGGAWL TH+WE Y
Sbjct: 437 HTGEESAKAYYGVQKGWCLGQNTDIWAMTCPVGLNVGDPSWACWTMGGAWLSTHIWERYT 496
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
+T D++FL+K YP+L+G A F L+WLIE DG L T+P TSPE++F+ PDG SY
Sbjct: 497 FTQDKEFLQKY-YPVLKGAAEFCLNWLIE-KDGKLITSPGTSPENKFLTPDGYAGATSYG 554
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKD 599
T D+A+ RE AAE L ++D +++ K+LPRL P ++ + G++ EW D++D
Sbjct: 555 CTSDLAMTRECLIDAAKAAEALGTDKD-FRKQIEKTLPRLLPYQVGKKGNLQEWFHDWED 613
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHD 659
E HRH SHLFGL+PGH +++++ P+L KA +TL+ +G+ GWS W+ L+ARL D
Sbjct: 614 QEPQHRHQSHLFGLYPGHHLSVKETPELAKACARTLEIKGDNTTGWSTGWRVNLYARLQD 673
Query: 660 QEHAYRMVKRLFNLVDPEHEKHFE----GGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQ 715
++AY + +RL V P+ K + GG Y NL AH PFQID NFG A V EML+Q
Sbjct: 674 SKNAYHIYRRLLRYVSPDGYKGKDARRGGGTYPNLLDAHSPFQIDGNFGGCAGVIEMLMQ 733
Query: 716 STLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFK 775
S+ N + LLPALP +W G VKG+ ARGG V + WK+G + + I S F
Sbjct: 734 SSENSITLLPALP-AEWKDGSVKGICARGGFIVDMEWKNGKVTSLYIQSRKGGKTKVCFD 792
Query: 776 TLHYRGTSVKVNLSAGK 792
G S + L AGK
Sbjct: 793 -----GKSKNITLKAGK 804
>gi|315500396|ref|YP_004089199.1| alpha-l-fucosidase [Asticcacaulis excentricus CB 48]
gi|315418408|gb|ADU15048.1| Alpha-L-fucosidase [Asticcacaulis excentricus CB 48]
Length = 783
Score = 536 bits (1381), Expect = e-149, Method: Compositional matrix adjust.
Identities = 304/767 (39%), Positives = 434/767 (56%), Gaps = 58/767 (7%)
Query: 8 STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
+ +N L + + PA +T+A+P+GNGRLGAMV+GG+ E L+LNEDTL+ G P NPD
Sbjct: 32 TASNDLTLWYREPANEWTEALPLGNGRLGAMVFGGIARERLQLNEDTLYAGAPYQPANPD 91
Query: 68 APKALSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETY 124
P AL ++R L+ G+Y EA A K G+P YQ +G++ L F S A Y
Sbjct: 92 GPAALPEIRKLIFEGKYLEAQALIQAKFMGNPMRQVSYQTIGEMTLTFGPSSNASA---Y 148
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
RRELDL A + V Y V +TRE F S DQV+V ++S + G +SF + ++
Sbjct: 149 RRELDLTKALSTVTYRQDGVTYTRETFISPVDQVLVMRLSADKPGKVSFQLGFETPQLGA 208
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+ +I++ GR G N ++F + +++ G S D+ L V
Sbjct: 209 VTIESPQEIVLSGRNGGH--------NGKDGALRFES--RVRVVASGGQQSTGTDE-LVV 257
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
G+D A++ + A++++ + D D T+ + + + S+ LY+ HLD ++ +
Sbjct: 258 SGADSALVFMAAATNYK----SFRDVSGDATAITKDQITRAASRSFGALYSAHLDAHKAV 313
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F RVS+ R+ + +P+ ER+ T DP+L L FQ+GRYLLI+
Sbjct: 314 FDRVSVDFGRT------------EVADLPTNERIAKSLTLNDPALAALYFQYGRYLLIAC 361
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
SRPGTQ ANLQG+WNE L+ W +NIN EMNYW + P L E EPL + +SI
Sbjct: 362 SRPGTQPANLQGLWNEKLNAPWGGKYTININTEMNYWPAEPTALPELTEPLIRMVREISI 421
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G++TA++ Y A GWV HH TD+W +++A + WP GGAWLC HLW+ Y+Y D
Sbjct: 422 TGAETAKIMYGARGWVAHHNTDLW-RATAPIDAAFYGTWPTGGAWLCLHLWDRYDYGRDP 480
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPE--HEFIAPDGKLACVSYSST 541
+L + YP+L+G + F LD L++ GY+ T PS SPE H+F G C T
Sbjct: 481 AYL-REIYPILKGASQFFLDTLVKDPASGYMVTAPSISPENQHKF----GTSICA--GPT 533
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ--DFKD 599
MDM IIR++F+ AAE+L K + + +VL +L P +I + G + EW D +
Sbjct: 534 MDMQIIRDLFANTARAAEIL-KTDKSFRAEVLAMRNKLVPNQIGKAGQLQEWKDDWDMEA 592
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHD 659
++HHRH+SHL+GLFP H IT K P+L AA+K+L+ RG+ GW+I W+ LWARL +
Sbjct: 593 ADMHHRHVSHLYGLFPSHQITTRKTPELAAAAKKSLELRGDMSTGWAIGWRINLWARLGE 652
Query: 660 QEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLN 719
E + ++K L PE Y N+F AHPPFQID NFG T+ + EML+QS +
Sbjct: 653 GERTHSILKLLLG---PERT-------YPNMFDAHPPFQIDGNFGGTSGMTEMLMQSYDD 702
Query: 720 DLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNY 766
++ LLPALP W G V GLKARGG TV + W D L V I S +
Sbjct: 703 EIILLPALP-TAWPKGRVTGLKARGGFTVDLHWADMTLERVTIRSAF 748
>gi|374605049|ref|ZP_09677992.1| alpha-L-fucosidase [Paenibacillus dendritiformis C454]
gi|374389319|gb|EHQ60698.1| alpha-L-fucosidase [Paenibacillus dendritiformis C454]
Length = 779
Score = 535 bits (1379), Expect = e-149, Method: Compositional matrix adjust.
Identities = 300/764 (39%), Positives = 435/764 (56%), Gaps = 59/764 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + +A+PIGNGRLGAM +GGV S+ L+LNED++W G P NPDA L
Sbjct: 12 RLWYRQPAGQWVEALPIGNGRLGAMQFGGVDSDRLQLNEDSVWYGGPAARENPDAAAYLP 71
Query: 74 DVRSLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET-YRRELD 129
+R + G+ EA AS+ L P YQ LG++++ F H + E + Y REL
Sbjct: 72 VIRQYLLEGKPEEAERIASLALASVPKHFGPYQTLGELKMFF---HGEEGEVSGYSRELS 128
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL-LDNHSYVN 188
L ARV+Y+ + ++RE SS PDQVI +++ S + LS ++ L+ ++ + V
Sbjct: 129 LPDGLARVEYTRNGIAYSRELLSSVPDQVIALRLTASAAKRLSLSLYLNRRSFEDGTTVI 188
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
++ I M+G+C G+++ L K D G ++A+ D L ++ +D
Sbjct: 189 ASDTIAMQGQC-------------GAGGVRYCVAL--KALADNGEVTAIGDC-LSIDAAD 232
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
L + A+++F + +P + +++ Y + + H+ D++ L+ RV
Sbjct: 233 AVTLYVAAATTF---------RESNPLQTCLRQVEAAAAKGYQQVRSDHVRDHRALYERV 283
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRP 367
+++L SE+++ +P+ ER+K Q DP L L FQ+GRYLL+ SSRP
Sbjct: 284 ALRLG---------ATSEDSLCRLPTDERLKRVRQGQADPGLFALFFQYGRYLLMGSSRP 334
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
GT ANLQGIWN ++P W+S H+NINL+MNYW + NL+EC EP+FD L L NG
Sbjct: 335 GTLPANLQGIWNPHMTPPWESDFHLNINLQMNYWPAEAANLAECHEPVFDLLDRLRTNGR 394
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
TA V Y A G+V HH T++WA ++ V WPMGGAWL H WEHY Y D FL
Sbjct: 395 HTAAVMYGADGFVAHHATNLWADTAPVSDVVSATFWPMGGAWLALHAWEHYQYGGDETFL 454
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
+RAYP+++ A FLL++L+E G T+PS SPE+ + P+G+ + +MD I+
Sbjct: 455 RERAYPVMKDAALFLLNYLVENAQGEWVTSPSISPENRYRLPNGQQGTLCMGPSMDTQIM 514
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHL 607
R +F A + A+ EDA E++ ++ RL P +I DG ++EWA+D + ++ HRH+
Sbjct: 515 RALFQACLDAS-AGRTEEDAFRERLQAAMTRLPPHRIGRDGQLLEWAEDVDEVDLGHRHI 573
Query: 608 SHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHAY 664
SHLF LFPG IT P+ +AA +TL++R G GWS W WARL D E AY
Sbjct: 574 SHLFALFPGGDITPFTAPEAAQAARRTLERRLAHGGGHTGWSRAWIILFWARLEDAEQAY 633
Query: 665 RMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLL 724
++ L + ++ NLF HPPFQIDANFG TAA+AEML+QS L LL
Sbjct: 634 ANLEAL-----------LQKSVHPNLFGDHPPFQIDANFGGTAAIAEMLLQSHAGTLALL 682
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN 768
PALP D W SG V+GL+ARGG V I W+ G L E I + S
Sbjct: 683 PALPGD-WPSGAVRGLRARGGYEVDIAWEAGRLTEARITAARSG 725
>gi|255035049|ref|YP_003085670.1| hypothetical protein Dfer_1256 [Dyadobacter fermentans DSM 18053]
gi|254947805|gb|ACT92505.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
Length = 768
Score = 535 bits (1378), Expect = e-149, Method: Compositional matrix adjust.
Identities = 309/809 (38%), Positives = 440/809 (54%), Gaps = 85/809 (10%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PL + + PA+ + +A+PIGNG L AM++GGV +E ++ NE+TLWTG P Y + A
Sbjct: 25 PLTLWYEQPARQWEEALPIGNGALAAMIFGGVETEQIQFNEETLWTGEPRSYAHKGASAY 84
Query: 72 LSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRREL 128
L +R L++ G+ EA A A+ + P YQ GD+ L+F H+++ Y REL
Sbjct: 85 LEQIRRLLNEGKQKEAEALANEQFMSQPMRQMAYQAFGDVYLDFP-GHVQH--RAYHREL 141
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
DL AT + Y G V +TRE F+S P + I I+ S+ L F V + ++
Sbjct: 142 DLRAATVKSSYESGGVRYTREAFASYPAKAIYYHINSSQKSKLDFTVRMSTI-------- 193
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE---------- 238
PK NA + +E+++ + G + L
Sbjct: 194 --------------HAKPKVNAEKN--------TIELEVQVENGALHGLARLKLLTDGKL 231
Query: 239 ---DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
D K++V G+ A ++L A++++ IN + DP ++ +ALQ+ + Y +
Sbjct: 232 KTADGKIEVTGATSATIVLSAATNY----INYKNVNGDPRAKVTAALQNAPD-DYKKAAS 286
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLF 354
HL DYQKLF+R ++ L S +P+ +R+ F+ + +DP+L+ L
Sbjct: 287 GHLADYQKLFNRFALDLPASKGS------------ALPTDQRLSQFKHNPDDPALLALYV 334
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
QF RYLLI+SSRPGT ANLQG WN L+P+WDS VNIN EMNYW + NLSEC +P
Sbjct: 335 QFARYLLITSSRPGTHPANLQGKWNHKLNPSWDSKYTVNINTEMNYWPAELTNLSECHQP 394
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
LF + +S G++ A+ +Y A+GWV+HH TD+W + +A +W GGAWL HL
Sbjct: 395 LFQMVKEVSETGAEVAKEHYNANGWVLHHNTDVW-RGAAPINASNHGIWVTGGAWLSLHL 453
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKL 533
WEHY +T D+ FL+ AYPL++G A F LD+L++ G+L ++PS SPE +G L
Sbjct: 454 WEHYRFTEDKAFLQNTAYPLMKGAAQFFLDFLVKDPKTGHLVSSPSNSPE------NGGL 507
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
TMD IIR +F A A +L K + +K+ ++ ++ P +I G + EW
Sbjct: 508 VA---GPTMDHQIIRALFKACAETAGIL-KTDAVFAQKLTETAKQIAPNQIGRHGQLQEW 563
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
D D HHRH+SHL+G++PG IT PDL KAA K+L+ RG++G GWS+ WK
Sbjct: 564 MTDIDDTTNHHRHVSHLWGVYPGEEITPTGTPDLLKAAIKSLEYRGDDGTGWSLAWKINY 623
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WAR D EHAY M+++LFN V K GG Y NLF AHPPFQID NFG + + E L
Sbjct: 624 WARFLDGEHAYTMIRKLFNPVFESGRKMSGGGSYPNLFDAHPPFQIDGNFGGASGILETL 683
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDS 773
VQS L ++ LLPALP G V GL ARGG + + WK+G L + I S N
Sbjct: 684 VQSHLGEINLLPALP-KALPDGRVSGLCARGGFEMDMDWKNGKLTGLSIRSKAGNE---- 738
Query: 774 FKTLHYRGTSVKVNLSAGKIYTFNRQLKC 802
+ Y + + GK Y F LK
Sbjct: 739 -CKVRYGAQVISIPTEKGKTYRFGPDLKV 766
>gi|192359217|ref|YP_001984046.1| alpha-L-fucosidase [Cellvibrio japonicus Ueda107]
gi|190685382|gb|ACE83060.1| alpha-L-fucosidase, putative, afc95B [Cellvibrio japonicus Ueda107]
Length = 839
Score = 535 bits (1377), Expect = e-149, Method: Compositional matrix adjust.
Identities = 303/773 (39%), Positives = 440/773 (56%), Gaps = 45/773 (5%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
++ S++T L++ +N PA + A+PIGNGRLGAMV+G E L+LNEDT+W G P +
Sbjct: 37 SSHSSATKQDLRLWYNTPASDWNQALPIGNGRLGAMVFGQPAQEQLQLNEDTIWAGGPNN 96
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-----GHPADVYQLLGDIELEFDDSHL 117
NP A + + V L+ GQ+ +A + + G P YQ LG++ L+F H
Sbjct: 97 NVNPAAAQTIEQVTRLLLQGQHQQAQTLADQQIRSLNNGMP---YQTLGNLRLDFA-GHG 152
Query: 118 KYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL 177
+ + Y R+LDL A ARV Y V FTRE FSS DQVIV ++S S+ G ++ +
Sbjct: 153 QV--DDYYRDLDLANAIARVSYVKAGVTFTRELFSSLSDQVIVVRLSASKPGQINTRIGF 210
Query: 178 DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
DS + + V+ + ++GR ++ D K I+F+A++ ++ RG
Sbjct: 211 DSPMQHQLSVH-ERWLQVDGRG-------GSHEGLDGK-IRFTALIAPEL---RGGTLRR 258
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
+DK L++EG+D ++ + A+++F + +D D + + + L + ++ L H
Sbjct: 259 DDKALRIEGADEVLIRIAAATNF----VRYNDLGGDSLARAQAYLSAAEGKGFAQLQQAH 314
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
+ YQ F+RVS+ L S P+ +R+ F +DP L L FQ+G
Sbjct: 315 VAAYQAQFNRVSLDLGTSAAM------------ARPTDQRIAEFAHSQDPHLAMLYFQYG 362
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLISSS+PGTQ ANLQGIWN SP WDS VNIN EMNYW + L E +PLF
Sbjct: 363 RYLLISSSQPGTQPANLQGIWNPHTSPPWDSKYTVNINTEMNYWPAEVTQLPELHQPLFA 422
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
L L++ G +AQ Y A GW++HH TD+W + + K + W GGAWLC H+W H
Sbjct: 423 MLEDLALTGRASAQQLYGARGWMMHHNTDLW-RITGQVDKAFYGQWQTGGAWLCQHIWYH 481
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
Y ++ DRDFL+ R YP+L + F +D L +E + G L PS SPE+ + G +
Sbjct: 482 YLHSGDRDFLQ-RYYPVLREASRFFVDSLTLEPNSGALVVVPSNSPENTY-ERAGYPTSI 539
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +TMD ++ ++FS I AA +L + D L ++ + RL P +I G + EW +D
Sbjct: 540 SAGTTMDNQLVFDLFSITIDAAHILGVDSD-LAAQLRQKRERLAPMRIGHFGQLQEWLED 598
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
+ P+ HHRH+SHL+GL+PG+ I+ + P L +AA +L +RG++ GWS+ WK WAR
Sbjct: 599 WDHPDDHHRHVSHLYGLYPGNQISPYRTPALFEAARVSLMQRGDKSTGWSMGWKINWWAR 658
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
HD AY++++ NL + +GG Y+N+ AHPPFQID NFG TA +AEMLVQS
Sbjct: 659 FHDGNRAYQLLQEQINLTEETQAVSEKGGTYANMLDAHPPFQIDGNFGVTAGIAEMLVQS 718
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
++LLPALP D W G VKGL RGG V I W++G L +YS N
Sbjct: 719 HDGVIHLLPALP-DAWPKGEVKGLVTRGGFVVDIAWENGQLTRASLYSRLGGN 770
>gi|354584579|ref|ZP_09003473.1| Alpha-L-fucosidase [Paenibacillus lactis 154]
gi|353194100|gb|EHB59603.1| Alpha-L-fucosidase [Paenibacillus lactis 154]
Length = 761
Score = 534 bits (1376), Expect = e-149, Method: Compositional matrix adjust.
Identities = 290/743 (39%), Positives = 425/743 (57%), Gaps = 38/743 (5%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
MV+GG+ E ++ NEDTLW+G P D N +A + L R L+ S +YAEA ++ G
Sbjct: 1 MVFGGIQEERIQWNEDTLWSGFPRDTNNYEALRYLQAARELIASEKYAEAEKLIEERMVG 60
Query: 97 HPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVE-FTREHFSSNP 155
+ + LGD+ +E + + + YRRELDL A V + G E F RE F S
Sbjct: 61 RNTEAFLPLGDLLIE--QTGIDDWQSNYRRELDLGNGVASVVFRTGRGEHFQREMFISAA 118
Query: 156 DQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPG------KRIPPKAN 209
DQ+ V + +GS GS+ + L S L + + + + G P + P++
Sbjct: 119 DQIAVIRYTGSAEGSIHLKLKLQSPLRYETEITPGGVMRLFGHAPTHIADNYRGDHPQSV 178
Query: 210 ANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD 269
++ G+++ +++ + D G I + L V G+ L + A++ F+G + P
Sbjct: 179 LYEEGSGLRYE--MQVAVRADGGRI-GINGDVLTVTGASAVTLHVAAATDFEGFDVMPGA 235
Query: 270 SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
DP + L++ L RH +++ LF RV+++L D +
Sbjct: 236 KGSDPARLCSARLEAAAGYDDEALRLRHTEEHWALFGRVAVELG--------DAEHRARM 287
Query: 330 DTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDS 388
+ +P+ +R+ ++ EDPSL L+FQ+GRYLL++SSRPGTQ A+LQG+WN + P W+S
Sbjct: 288 EAIPTDQRLAAYAGGQEDPSLEALMFQYGRYLLMASSRPGTQPAHLQGLWNPHVQPPWNS 347
Query: 389 APHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIW 448
NIN EMNYW + NLSEC EPL + L+++G++TA+++Y A GW HH D+W
Sbjct: 348 NYTTNINTEMNYWAAETGNLSECHEPLIQMVRELAVSGARTAKIHYNARGWAAHHNVDLW 407
Query: 449 AKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE 508
++ G+ +WA WPM G WLC HLWEHY + D ++L AYPL+ A F LDWLIE
Sbjct: 408 RMANPSNGRAMWAFWPMAGPWLCRHLWEHYVFNPDPEYLRNTAYPLMREAALFCLDWLIE 467
Query: 509 GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL 568
+G+L T+PSTSPE++F+ +G VS STMDMA+IRE+F + A+E+LE + + L
Sbjct: 468 NGEGHLVTSPSTSPENQFLTKEGVPCSVSAGSTMDMALIRELFRHCLEASELLEIDRE-L 526
Query: 569 VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLC 628
E++ +L RL P +I +DG +MEW++ F + E HRH+SHL+GL+PG I + P+L
Sbjct: 527 QEELRSALERLLPYQIDDDGRLMEWSKPFAEAEPGHRHVSHLYGLYPGTDINLRDTPELA 586
Query: 629 KAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGG 685
+AA ++L R G GWS W L+ARL E AY+ V+ L
Sbjct: 587 EAALQSLMSRIRSGGGHTGWSCVWLINLFARLQQPELAYQYVRTLLTR-----------S 635
Query: 686 LYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGG 745
++ NLF HPPFQIDANFG A +AEML+QS L ++ LLPALP WSSG V+GLKARGG
Sbjct: 636 VHPNLFGDHPPFQIDANFGGAAGLAEMLLQSHLGEIVLLPALP-AAWSSGAVRGLKARGG 694
Query: 746 ETVSICWKDGDLHEVGIYSNYSN 768
+ + WKDG L I S +
Sbjct: 695 FLIDMEWKDGALASASITSTHGQ 717
>gi|409198450|ref|ZP_11227113.1| alpha-L-fucosidase [Marinilabilia salmonicolor JCM 21150]
Length = 767
Score = 533 bits (1374), Expect = e-148, Method: Compositional matrix adjust.
Identities = 301/793 (37%), Positives = 444/793 (55%), Gaps = 70/793 (8%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
+PL + ++ PA + +A+PIGNG +GAM++GG+ E ++LNE+T+WT PD K
Sbjct: 25 SPLTLWYDQPASQWEEALPIGNGHMGAMIFGGIDKERIQLNEETIWTKRDEFTDKPDGHK 84
Query: 71 ALSDVRSLVDSGQYAEATAASVK-----LFGHPADVYQLLGDIELEFDDSHLKYAE-ETY 124
++ +R+L+ QY EA + + + YQ LGD+ L+F+ K+ + Y
Sbjct: 85 YINKIRTLLFEEQYEEAEKLVRRHLLEDRMPNNTNTYQTLGDLHLDFE----KFEQISQY 140
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
RR+L+L ATA V + V ++RE FSSNP K+S + G +SF SL+ +
Sbjct: 141 RRQLNLENATASVSFISDGVHYSRESFSSNPANATFMKLSADKPGRISFTASLNRPGEGE 200
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+ + IIM + D+ G+ + ++I+ GT+ A +DK +K+
Sbjct: 201 NISVDGHTIIMNQKV------------DNKDGVTYETRIQIRAKG--GTLEA-KDKSIKI 245
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
G+ VL+ VA++ + G ++PT L+ I SY DL H+ DYQ L
Sbjct: 246 SGAAEVVLIQVAATDYRG---------ENPTQSCKKYLKDIAEKSYDDLRKEHISDYQSL 296
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLIS 363
F+RVS+ L S D + P ER+ + + EDP+L L +QFGRYLLIS
Sbjct: 297 FNRVSLDLGTS--DAIY----------FPVDERLTALRKGAEDPALFSLYYQFGRYLLIS 344
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSRPG+ ANLQG+W L+P W++ H+NIN++MNYW ++ NL EC P +F+ L
Sbjct: 345 SSRPGSLPANLQGLWESTLTPPWNADYHININIQMNYWPAVVTNLPECHLPFLNFIGQLR 404
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
NG KTA Y A G+ HH TD W ++A +G+ WA+WPMG AW TH+WEH+ +T D
Sbjct: 405 ENGRKTANTLYGARGFTAHHTTDAWHFTTA-QGQPQWAMWPMGAAWASTHIWEHFLFTRD 463
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
FL + +++ A FL D+L++ + G L + PS SPE+ F P G A V +M
Sbjct: 464 TTFLRNYGFDVMKEAALFLSDFLVKDPETGRLVSGPSMSPENTFFTPRGNRASVVMGPSM 523
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
D II +FS++I AA+VL ED K+ + L +L P++I EDG I+EW++D K+ E
Sbjct: 524 DHQIIHHLFSSVIEAAKVLNA-EDHFTRKITRQLKQLTPSEIGEDGRILEWSEDLKEAEP 582
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHD 659
HRH+SHL+GL+P + +K P+L +AA K ++KR + G GWS W +ARL D
Sbjct: 583 GHRHMSHLYGLYPSSQFSWQKTPELMEAARKVIEKRLKHGGGHTGWSRAWMVNFYARLKD 642
Query: 660 QEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLN 719
AY+ ++ L + NLF HPPFQID NFG TA + EML+QS
Sbjct: 643 SNEAYQNMRALLT-----------KSTHPNLFDNHPPFQIDGNFGGTAGLTEMLLQSHQG 691
Query: 720 DLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHY 779
++ LLPALP+ +W G VKGLKARGG T++I W DG L I D+ + Y
Sbjct: 692 NIELLPALPF-QWREGSVKGLKARGGYTINISWSDGALTTAEIIGPV-----DTDVPVVY 745
Query: 780 RGTSVKVNLSAGK 792
G ++ V ++ G+
Sbjct: 746 NGQAINVTINKGE 758
>gi|399028921|ref|ZP_10730010.1| hypothetical protein PMI10_01838 [Flavobacterium sp. CF136]
gi|398073242|gb|EJL64421.1| hypothetical protein PMI10_01838 [Flavobacterium sp. CF136]
Length = 820
Score = 533 bits (1372), Expect = e-148, Method: Compositional matrix adjust.
Identities = 297/768 (38%), Positives = 440/768 (57%), Gaps = 33/768 (4%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPKA 71
+K+ ++ PA + +A+P+GNGR+GAMV+G V E ++LNE +LW+G P NP A +
Sbjct: 23 IKLWYDKPAAQWVEALPLGNGRIGAMVFGSVEDELIQLNEGSLWSGGPMKKNVNPKAYQY 82
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
L +R + + + +A K+ G+ ++ + +GD+ + D K + Y R+L L+
Sbjct: 83 LQPLREALYAEDFQKADELCRKMQGYFSESFLPMGDLVIHHDFGSDK--SQNYYRDLKLD 140
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
A + ++V V+++RE F S P +++ K+ S+ G+L+F+ L S+L N V ++
Sbjct: 141 QAVSTTNFTVKGVKYSREIFISAPANIMIVKMKASKKGALTFDAKLSSVLTNSVSVLADD 200
Query: 192 QIIMEGRCPGKRIPPKANA-NDDP---------KGIQFSAILEIKISDDRGTISALEDKK 241
+++++G+ P + P N N P G++F L+ + D G++ +
Sbjct: 201 RLVLDGKAPARVDPSYYNKKNRQPIILEDTTGCNGMRFRMDLKASLKD--GSVKT-DANG 257
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
+ V + +L A++SF+G P K+ + S +++ Y L H+ DY
Sbjct: 258 IHVTNATEVILYFAAATSFNGFDKCPDSEGKNEKVITDSIIKNSTAQKYESLKKDHIADY 317
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYL 360
QK F+RV++ L + + +N +P ER+K++ +DP L + +Q+GRYL
Sbjct: 318 QKYFNRVNLDLE--------EENTNKNTSVLPWDERLKAYTAGGKDPILEQTFYQYGRYL 369
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSR G Q ANLQGIWN++L W S +NIN +MNYW + NLSE +PL D++
Sbjct: 370 LISSSRLGGQPANLQGIWNKELRAPWSSNYTININTQMNYWPAEQTNLSEMHQPLLDWIG 429
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWE 476
LS G A Y A+GWV HH +DIWA S+A G WA W MGG WLC HLWE
Sbjct: 430 NLSQTGRTAASEYYHANGWVAHHNSDIWALSNAVGNKGDGSPTWANWYMGGNWLCQHLWE 489
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
HY +T D++FL K AYP+++ A F DWL E DGYL T PS+SPE+E I +GK V
Sbjct: 490 HYIFTGDKEFLRKTAYPVMKEAALFSFDWLQE-KDGYLVTAPSSSPENE-IHINGKNYGV 547
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
+ +STMDM+I R++F +I A+E+L +ED E +K +L P KI G ++EW ++
Sbjct: 548 TVASTMDMSICRDLFGNLIKASEILNIDEDFRKELEVKK-AKLFPLKIGSKGQLLEWNKE 606
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
F++ RH S LFGL PG I+ PD A +K+L+ RG+EG GWS WK WAR
Sbjct: 607 FEEATPKQRHASQLFGLHPGAEISPITTPDFANACKKSLELRGDEGTGWSKAWKINFWAR 666
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
L D HAY+M++ + + GG Y N F AHPPFQID NFG TA + EML+QS
Sbjct: 667 LFDGNHAYKMIRDILKYTNSSASGVTGGGTYPNFFDAHPPFQIDGNFGATAGMTEMLLQS 726
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
++LLPALP + W +G V GL+AR G + I W DG L I S
Sbjct: 727 QSGFIHLLPALP-EAWKNGKVSGLRARNGFELDIKWSDGKLKSARIKS 773
>gi|333381508|ref|ZP_08473190.1| hypothetical protein HMPREF9455_01356 [Dysgonomonas gadei ATCC
BAA-286]
gi|332830478|gb|EGK03106.1| hypothetical protein HMPREF9455_01356 [Dysgonomonas gadei ATCC
BAA-286]
Length = 813
Score = 532 bits (1371), Expect = e-148, Method: Compositional matrix adjust.
Identities = 305/763 (39%), Positives = 444/763 (58%), Gaps = 48/763 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + PAK + +A+P+GN RLGAMV+G E L+LNE+T+W G P +P +L
Sbjct: 23 IKLQYKRPAKEWVEALPLGNSRLGAMVFGSPVRERLQLNEETMWGGGPHRNDSPALLGSL 82
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDL 130
++VRSL+ +G+ EA A K P + YQ +G++ L+F H Y++ Y R LDL
Sbjct: 83 NEVRSLIFAGKEKEAEALLDKTMRTPHNGMPYQTIGNLYLDFT-GHDNYSD--YSRNLDL 139
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
TA A +Y+V V +TRE F+S D VI+ +I+ ++ S++F+ S DS + +S
Sbjct: 140 KTAVATTRYAVDGVTYTREVFTSFTDNVIIMRITADKANSINFSASYDSQVKGYSVSVKG 199
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE--IKISDDRGTISALEDKKLKVEGSD 248
N+++++G D +GI+ E +I + GT+ A +D + +
Sbjct: 200 NRLVLKG------------TGSDHEGIKGVVRFENQTEIKTEGGTVKAGKDNIVVKNANT 247
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
+ + +A++ D ++ ++++K T L+S Y T H+ YQK F+RV
Sbjct: 248 ATIYISIATNFIDYKNVSGNEARKAET-----ILKSALTKPYQTALTDHIKYYQKQFNRV 302
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
+ L SE D S RV++F+ +D +LV LLFQFGRYLLISSS+PG
Sbjct: 303 ELDLG----------TSERMNDETDS--RVRNFKDGKDQNLVTLLFQFGRYLLISSSQPG 350
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
Q + LQGIWN+ L P WDS +NIN EMNYW + NLSE PLF+ + ++ G +
Sbjct: 351 GQPSTLQGIWNDQLVPPWDSKYTININTEMNYWPAEVTNLSETHFPLFEMVKEIAETGKE 410
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
TA+V Y A+GWV HH TDIW + G + +WP GGAWL H+W+HY YT D+ FL
Sbjct: 411 TAKVMYNANGWVTHHNTDIWRTTGPVDG-AFYGMWPDGGAWLSRHMWQHYLYTGDKAFLS 469
Query: 489 KRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
+ YP+L+G A F LD+L+E H Y + + PSTSPE P G ++ STMD I
Sbjct: 470 E-VYPVLKGAADFFLDFLVE-HPKYKWMVSAPSTSPEQ---GPPGTGTSITAGSTMDNQI 524
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRH 606
+ +V S ++A+ L+ ++A +++ + RL P +I + + EW D DP+ HRH
Sbjct: 525 VFDVLSDALNASRALQLADNAYEKRLEDMISRLAPMQIGKYNQLQEWLDDVDDPKNDHRH 584
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRM 666
+SHL+GL+P + I+ +P L +AA+ +L RG+ GWSI WK WARL D H Y++
Sbjct: 585 VSHLYGLYPSNQISPYSHPALFQAAKNSLLYRGDMATGWSIGWKINFWARLLDGNHTYKI 644
Query: 667 VKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPA 726
+ + +LV+P + +G Y NLF AHPPFQID NFGFTA VAEML+QS L+LLPA
Sbjct: 645 ISNMLSLVEPGNN---DGRTYPNLFDAHPPFQIDGNFGFTAGVAEMLLQSHDGALHLLPA 701
Query: 727 LPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
LP D W G VKGL ARGG VS+ W +G+L V + S N
Sbjct: 702 LP-DVWKKGTVKGLIARGGFEVSMEWDNGELLTVSVLSKLGGN 743
>gi|325927089|ref|ZP_08188358.1| hypothetical protein XPE_2359 [Xanthomonas perforans 91-118]
gi|325542534|gb|EGD14007.1| hypothetical protein XPE_2359 [Xanthomonas perforans 91-118]
Length = 790
Score = 532 bits (1371), Expect = e-148, Method: Compositional matrix adjust.
Identities = 312/795 (39%), Positives = 440/795 (55%), Gaps = 66/795 (8%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+ L++ + PA + +A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D T+P
Sbjct: 39 VAAAQALQLWYREPANQWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSP 98
Query: 67 DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
DA AL VR+L+ +G+YAEA A KL P YQ LGD+ L+FD +
Sbjct: 99 DALAALPQVRALIFAGRYAEAEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GISD 155
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
YRR+LDL+TA A + G RE F S Q IV ++S G +S V +DS
Sbjct: 156 YRRQLDLDTAVATTTFRSGGAVHRREVFVSAQAQCIVVRLSCDRPGGISVRVGIDSPQTG 215
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD--RGTISALEDKK 241
++ GR N GI+ +++ G +S + D +
Sbjct: 216 EVTAE-QGGLLFSGR------------NGSFAGIEGKLRFALRVLPQVRGGKLSQVRD-R 261
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L+++ +D VLLL A++S+ + D DP + + + L+ L + L HL D+
Sbjct: 262 LRIDAADEVVLLLSAATSYQ--RFDAVDG--DPLASTAACLRKAAKLDFPALLRAHLADH 317
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
Q+LF RV+I L S +P+ ERV+ F DP+L L Q+GRYLL
Sbjct: 318 QRLFRRVAIDLGSSAA------------TQLPTDERVQRFAEGNDPALAALYHQYGRYLL 365
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
I SSRPGTQ ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL L
Sbjct: 366 ICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFD 425
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
L+ G++TA+ Y A GWV+H+ TD+W ++ G W+LWP+GG WL LW+ ++Y
Sbjct: 426 LAQTGARTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPLGGVWLLQQLWDRWDYG 484
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
DR +L K YPL +G A F + L+ + G + TNPS SPE++ P G C S
Sbjct: 485 RDRAYLSK-VYPLFKGAAEFFVATLMRDPQTGAMVTNPSMSPENQH--PFGAAVCAGPS- 540
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL-PRLRPTKIAEDGSIMEWAQDF-- 597
MD ++R++F+ I+ +++L DA + L +L +L P +I + G + EW QD+
Sbjct: 541 -MDAQLLRDLFAQCIAMSKLL--GIDAEFAQQLAALREQLPPNRIGKAGQLQEWQQDWDM 597
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARL 657
+ PE+HHRH+SHL+ L P I + PDL AA ++L+ RG+ GW I W+ LWARL
Sbjct: 598 QAPEIHHRHVSHLYALHPSSQINLRDTPDLAAAARRSLEIRGDNATGWGIGWRLNLWARL 657
Query: 658 HDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST 717
D EHAYR+++ L+ PE Y NLF AHPPFQID NFG TA + EML+QS
Sbjct: 658 ADGEHAYRILQL---LISPERT-------YPNLFDAHPPFQIDGNFGGTAGITEMLLQSW 707
Query: 718 LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTL 777
++LLPALP W G V+GL+ RGG +V + W+ G L + ++S D L
Sbjct: 708 GGSVFLLPALP-KAWPRGSVRGLRVRGGASVDLEWEGGRLQQARLHS-----DRGGRYQL 761
Query: 778 HYRGTSVKVNLSAGK 792
Y G ++ + L AG+
Sbjct: 762 SYAGQTLDLELGAGR 776
>gi|346724703|ref|YP_004851372.1| hypothetical protein XACM_1797 [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346649450|gb|AEO42074.1| hypothetical protein XACM_1797 [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 790
Score = 532 bits (1370), Expect = e-148, Method: Compositional matrix adjust.
Identities = 309/794 (38%), Positives = 439/794 (55%), Gaps = 64/794 (8%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+ L++ + PA + +A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D T+P
Sbjct: 39 VAAAQALQLWYREPANQWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSP 98
Query: 67 DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
DA AL VR+L+ +G+YAEA A KL P YQ LGD+ L+FD +
Sbjct: 99 DALAALPQVRALIFAGRYAEAEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GISD 155
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
YRR+LDL+TA A + G RE F S Q IV ++S G +S V +DS
Sbjct: 156 YRRQLDLDTAVATTTFRSGGAVHRREVFVSAQAQCIVVRLSCDRPGGISVRVGIDSPQTG 215
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD--RGTISALEDKK 241
++ GR N GI+ +++ G +S + D +
Sbjct: 216 EVTAE-QGGLLFSGR------------NGSFAGIEGKLRFALRVLPQVRGGKLSQVRD-R 261
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L+++ +D VLLL A++S+ + D DP + + + L+ L + L HL D+
Sbjct: 262 LRIDAADEVVLLLSAATSYQ--RFDAVDG--DPLASTAACLRKAAKLDFPALLRAHLADH 317
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
Q+LF RV+I L S +P+ ERV+ F DP+L L Q+GRYLL
Sbjct: 318 QRLFRRVAIDLGSSAA------------TQLPTDERVQRFAEGNDPALAALYHQYGRYLL 365
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
I SSRPGTQ ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL L
Sbjct: 366 ICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECAEPLEAMLFD 425
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
L+ G+ TA+ Y A GWV+H+ TD+W ++ G W+LWP+GG WL LW+ ++Y
Sbjct: 426 LAQTGAHTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPLGGVWLLQQLWDRWDYG 484
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
DR +L K YPL +G A F + L+ + G + TNPS SPE++ P G C S
Sbjct: 485 RDRAYLSK-VYPLFKGAAEFFVATLMRDPQTGAMVTNPSMSPENQH--PFGAAVCAGPS- 540
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF--K 598
MD ++R++F+ I+ +++L + + L +++ +L P +I + G + EW QD+ +
Sbjct: 541 -MDAQLLRDLFAQCIAMSKLLGIDAE-LAQQLAALREQLPPNRIGKAGQLQEWQQDWDMQ 598
Query: 599 DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLH 658
PE+HHRH+SHL+ L P I + PDL AA ++L+ RG+ GW I W+ LWARL
Sbjct: 599 APEIHHRHVSHLYALHPSSQINLRDTPDLAAAARRSLEIRGDNATGWGIGWRLNLWARLA 658
Query: 659 DQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTL 718
D EHAYR+++ L+ PE Y NLF AHPPFQID NFG TA + EML+QS
Sbjct: 659 DGEHAYRILQL---LISPERT-------YPNLFDAHPPFQIDGNFGGTAGITEMLLQSWG 708
Query: 719 NDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLH 778
++LLPALP W G V+GL+ RGG +V + W+ G L + ++S D L
Sbjct: 709 GSVFLLPALP-KAWPRGSVRGLRVRGGASVDLEWEGGRLQQARLHS-----DRGGRYQLS 762
Query: 779 YRGTSVKVNLSAGK 792
Y G ++ + L AG+
Sbjct: 763 YAGQTLDLELGAGR 776
>gi|334144837|ref|YP_004538046.1| alpha/beta hydrolase domain-containing protein [Novosphingobium sp.
PP1Y]
gi|333936720|emb|CCA90079.1| alpha/beta hydrolase domain-containing protein [Novosphingobium sp.
PP1Y]
Length = 806
Score = 531 bits (1368), Expect = e-148, Method: Compositional matrix adjust.
Identities = 309/799 (38%), Positives = 444/799 (55%), Gaps = 64/799 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA+ +T+A+P+GNGR+GAMV+GG E L+LNEDTLWTG P + NP A +AL
Sbjct: 63 RLWYCQPAREWTEALPVGNGRIGAMVFGGTGLERLQLNEDTLWTGGPYNPVNPSAREALP 122
Query: 74 DVRSLVDSGQYAEA-TAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEE-TYRRELD 129
+R L++ G + +A T A +L P YQ GD+ + HL E+ +Y RELD
Sbjct: 123 QIRRLIEQGHFTQAQTLADARLMARPLSQMAYQTFGDLTIAM--PHLGTIEQGSYLRELD 180
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L+ A A + V ++R+ +S QVI +S G + V L + D ++G
Sbjct: 181 LDAALAATTFKADGVSWSRKVIASPDHQVIAVHLSADRPGRMHCLVGLGAPHDGVLSIDG 240
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK--ISDDRGTISALEDKKLKVEGS 247
+I GR N+ G++ + E + + G IS + D KL VEG+
Sbjct: 241 GT-LIFGGR------------NNAAHGVEGALRFEARARVLPQGGRIS-VSDNKLAVEGA 286
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D +L+ ++S+ D DP+ + S +++ S++ + +++L+ R
Sbjct: 287 DAVTILIAMATSYR----QFDDVGGDPSQITRSQIEAASRHSFARIAADTAASHRRLYRR 342
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
VS+ L +P P+ ER+++ +T +D +L L FQ+GRYLLI SSRP
Sbjct: 343 VSLDLGETPAA------------HRPTDERIRTSETSQDSALAALYFQYGRYLLICSSRP 390
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
G+Q ANLQGIWN+ P W S +NIN EMNYW + P L EC PL + L+ G+
Sbjct: 391 GSQPANLQGIWNDSDDPPWGSKYTININTEMNYWPAEPTALGECVAPLVALVRDLAQTGA 450
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
TA+ Y A GWV HH TD+W +++A W LWPMGGAWLCTHLW+HY+Y D FL
Sbjct: 451 STAREMYGARGWVAHHNTDLW-RATAPIDGAAWGLWPMGGAWLCTHLWDHYDYHRDTAFL 509
Query: 488 EKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
+ YPLL G A F LD L + GYL TNPS SPE+E P G C S +D I
Sbjct: 510 -RSVYPLLRGAALFFLDTLQRDPASGYLVTNPSISPENEH--PGGASVCAGPS--VDRQI 564
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKD--PEVHH 604
+R++F+ AA +L ++D L ++L + RL P +I G + EW +D+ PE HH
Sbjct: 565 LRDLFAQTARAATILGLDDD-LSAQILDTSRRLAPDEIGAQGQLQEWLEDWDSSAPEPHH 623
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAY 664
RH+SHL+GLFP H I +++ PDL AA K+L+ RG+E GW+ W+ LWARL + +HA+
Sbjct: 624 RHVSHLYGLFPSHQINLDETPDLAMAARKSLELRGDESTGWATAWRANLWARLREGDHAH 683
Query: 665 RMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLL 724
R+++ L P+ Y N+F AHPPFQID NFG AA+AEMLVQ +++ LL
Sbjct: 684 RILRYLLG---PDRT-------YPNMFDAHPPFQIDGNFGGAAAIAEMLVQCRDDEIRLL 733
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSV 784
PALP W G V+GL+ RG VS+ W+ G+L + S + + +H S
Sbjct: 734 PALP-RAWPDGSVRGLRIRGACKVSLEWRAGELVCARLVSRIAG-----MRIVHLNERSA 787
Query: 785 KVNLSAGKIYTFNRQLKCT 803
+V L G+ T N L T
Sbjct: 788 EVELVPGRPVTLNGPLLRT 806
>gi|326800280|ref|YP_004318099.1| alpha-L-fucosidase [Sphingobacterium sp. 21]
gi|326551044|gb|ADZ79429.1| Alpha-L-fucosidase [Sphingobacterium sp. 21]
Length = 826
Score = 531 bits (1367), Expect = e-148, Method: Compositional matrix adjust.
Identities = 307/782 (39%), Positives = 455/782 (58%), Gaps = 57/782 (7%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
+ +A + + ++ K+ ++ PA H+ +A+PIGNGRLGAM++GGV + L+LNE+T+W+G P
Sbjct: 21 IYSAVNATGSDSYKLWYDKPAAHWNEALPIGNGRLGAMLFGGVKQDHLQLNEETIWSGGP 80
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADV-------YQLLGDIELEFD 113
G+ ++ D + ++R L+ +G+Y EA S K + YQ GD+ ++F
Sbjct: 81 GNNSSKDLYSTMQEIRRLLFAGKYKEAQDLSNKEMPREPEANNNYGMSYQPAGDLWIDF- 139
Query: 114 DSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLS 172
L E YRRELD+ A + V Y VG V + RE+ ++ DQVI+ +++ +GS+S
Sbjct: 140 ---LHEGETVAYRRELDIADALSTVTYRVGEVTYKREYLATAHDQVIMMRVTADRAGSIS 196
Query: 173 FNVSLDS--LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISD 229
N+ L++ L+ ++ N+I + G K+ + KG ++FS +E K+
Sbjct: 197 CNLKLNTPHLIHQQPFIG--NRIYVNGTSGDKQ---------NKKGQVKFSIAVEPKV-- 243
Query: 230 DRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLS 289
+G E + L+V +D + + ++F+ N D D + L + S
Sbjct: 244 -KGGALQAEGEMLRVRQADELTVYIAIGTNFN----NYHDLGGDARERADDYLNTALKKS 298
Query: 290 YSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSL 349
Y + ++H++DY++ F RVS+ L ++ + + +++ RV F DP L
Sbjct: 299 YRKIKSKHVEDYRRYFDRVSLDLGQT---VAMNKATDQ---------RVADFHLGNDPQL 346
Query: 350 VELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLS 409
V L FQFGRYLLISSSRPGTQ ANLQGIWN+ LSP W S VNIN EMNYW + NLS
Sbjct: 347 VSLYFQFGRYLLISSSRPGTQPANLQGIWNDKLSPPWSSKYTVNINTEMNYWPAEVTNLS 406
Query: 410 ECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAW 469
E EPLF L LS+ G ++A Y A GW +HH TDIW + G + +WPMGGAW
Sbjct: 407 EMHEPLFAMLEDLSVTGKESAWNYYRARGWNMHHNTDIWRVTGIIDGG-FYGMWPMGGAW 465
Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIA 528
L H+W+HY + D FL K YP+L+G F +D L E +L PS SPE+ + +
Sbjct: 466 LSQHIWQHYLFNGDNAFLAKY-YPILKGVTQFYVDVLQEEPKHKWLVVAPSMSPENSYQS 524
Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDG 588
G +S +TMD ++ +VFS + AA VL+ +ED ++ V L RL P +I + G
Sbjct: 525 GVG----ISAGTTMDNQLVFDVFSNFLEAAHVLQVDED-FMDTVASKLKRLPPMQIGKLG 579
Query: 589 SIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSIT 648
+ EW +D+ + HHRH+SHL+GL+P I+ ++P L +AA+K+L RG++ GWS+
Sbjct: 580 QLQEWMEDWDRADDHHRHISHLYGLYPAAQISPIRHPTLFEAAKKSLVFRGDKSTGWSMG 639
Query: 649 WKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFE-GGLYSNLFAAHPPFQIDANFGFTA 707
WK WARL D AY+++ L ++ + E GG Y+NL AHPPFQID NFG TA
Sbjct: 640 WKVNWWARLLDGNRAYKLIAD--QLSPAANDGNGEAGGTYANLLDAHPPFQIDGNFGCTA 697
Query: 708 AVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYS 767
+AEML+QS L++LPALP D+W +G VKGLKARGG V I WKDG L ++ ++S
Sbjct: 698 GIAEMLIQSHDGCLHILPALP-DQWQNGEVKGLKARGGFIVDIAWKDGKLQKLKVHSRLG 756
Query: 768 NN 769
N
Sbjct: 757 GN 758
>gi|395803591|ref|ZP_10482835.1| hypothetical protein FF52_16993 [Flavobacterium sp. F52]
gi|395434145|gb|EJG00095.1| hypothetical protein FF52_16993 [Flavobacterium sp. F52]
Length = 816
Score = 530 bits (1366), Expect = e-147, Method: Compositional matrix adjust.
Identities = 299/768 (38%), Positives = 445/768 (57%), Gaps = 41/768 (5%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+ + N LK+ ++ PA + +A+P+GNGRLGAMV+G E L+LNE+T+W G P +
Sbjct: 18 TATAQNDLKLWYDKPASIWNEALPLGNGRLGAMVFGDPAVERLQLNEETIWAGSPNSNAH 77
Query: 66 PDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPAD--VYQLLGDIELEFDDSHLKYAEE 122
+ +AL VR L+ G++ EA + K + D YQ G + + F+ H KY +
Sbjct: 78 TKSIEALPKVRQLIFEGKFDEAQDLATKDIMSQTNDGMPYQTFGSVYISFN-GHQKYTD- 135
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
Y R+LD++ ATA+VKY V VEFTRE ++ DQVIV K+S S+ G ++ NV ++S +D
Sbjct: 136 -YYRDLDISNATAKVKYKVNGVEFTREILTAFSDQVIVMKLSASKPGQITCNVFMNSPID 194
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
NQII+ G N + ++F L K + G I A + L
Sbjct: 195 KTVTSTEGNQIILSGTG--------TNFENVKGKVKFQGRLTAK--NKGGEIDA-SNGVL 243
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
+ +D +L + +++F N D D ++S L + ++ H+D YQ
Sbjct: 244 SINKADEVILYISIATNFK----NYKDISGDEIAKSKDYLAKAEIKDFENIKKAHVDYYQ 299
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
K F+RV++ L S E + P+ ER++ F DP L L FQFGRYLLI
Sbjct: 300 KFFNRVALDLG-----------SNELVKK-PTNERIRDFSKQFDPQLASLYFQFGRYLLI 347
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SSS+PG Q ANLQGIWN+ ++P WDS NIN EMNYW + NL E EP L
Sbjct: 348 SSSQPGGQPANLQGIWNDMVTPPWDSKYTTNINAEMNYWPAQVTNLQELHEPFVQMAKEL 407
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
+I G++TA++ Y A+GWV+HH TDIW + +A +WP GGAW+C LWE Y YT
Sbjct: 408 AITGAETARMMYNANGWVLHHNTDIW-RVTAPVDSAASGMWPTGGAWVCQDLWERYLYTG 466
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
D+ +L + YP+++G A F LD++I + + GYL PS+SPE+ GK + ++ +T
Sbjct: 467 DKKYLAE-IYPIMKGAADFFLDFMIVDPNTGYLVVVPSSSPENTHAGGTGK-STIASGTT 524
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPE 601
MD +I ++F+ ++ A+ ++ + A V+KV ++L ++ P KI + + EW D+ +P+
Sbjct: 525 MDNQLIFDLFTHVMEASALISPDA-AYVKKVSEALAKMPPMKIGKHSQLQEWQDDWDNPK 583
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQE 661
+HRH+SHL+GL+P + I+ K P+L +AA+++L R +E GWS+ WK LWARL +
Sbjct: 584 DNHRHVSHLYGLYPSNQISPIKTPELFEAAKQSLIYRTDESTGWSMGWKVNLWARLLEGN 643
Query: 662 HAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDL 721
HAY++++ +LV + K GG Y N+ AH PFQID NFG TA AEML+QS + +
Sbjct: 644 HAYKLIQDQLHLVTADQRKG--GGTYPNMLDAHQPFQIDGNFGCTAGFAEMLMQSQEDAI 701
Query: 722 YLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
LLPALP W G +KGL ARGG + + WK+ + E+ IYS N
Sbjct: 702 QLLPALP-TVWKDGSIKGLVARGGFVIDMTWKNNKVSELKIYSKIGGN 748
>gi|182416090|ref|YP_001821156.1| alpha/beta hydrolase domain-containing protein [Opitutus terrae
PB90-1]
gi|177843304|gb|ACB77556.1| Alpha/beta hydrolase fold-3 domain protein [Opitutus terrae PB90-1]
Length = 1094
Score = 530 bits (1365), Expect = e-147, Method: Compositional matrix adjust.
Identities = 313/777 (40%), Positives = 443/777 (57%), Gaps = 66/777 (8%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
+A + T LK+ + PA + +A+P+GNGRLGAMV+GG+ E L+LNEDTLW G P D
Sbjct: 337 SAPEEAATAALKLWYRQPAAQWVEALPVGNGRLGAMVFGGIQQERLQLNEDTLWAGGPYD 396
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASV-KLFGHP--ADVYQLLGDIELEFDDSHLKY 119
+P+A AL ++R L+ +G YA A + K G P YQ +GD+ + S
Sbjct: 397 PASPEARAALPEIRRLISAGNYAAAQQLTQGKFMGRPIVQMPYQTVGDLMITQAGSE--- 453
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSES-------GSLS 172
YRRELDL+TA AR +Y +G V F RE F+S DQVIV +++ S + G LS
Sbjct: 454 QVANYRRELDLDTAIARTEYVLGGVTFVREVFASPVDQVIVIRLTASRNPPRPEWGGPLS 513
Query: 173 FNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK--ISDD 230
F ++ S + +G ++++ G +N D GI+ E + + +
Sbjct: 514 FTLAFQSPQRATAAADGA-ELVLSG------------SNSDAAGIKGRLKFEARARLIVE 560
Query: 231 RGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSY 290
G + A + L+V+G+ A +LL A++S+ D DP + + + L ++ Y
Sbjct: 561 GGAVVA-DGTDLQVQGAHAATILLAAATSYR----RYDDVSGDPAALNRATLAAVATKPY 615
Query: 291 SDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLV 350
+ H+ ++Q+LF RVS+ D+ T ++ +P+ ERV+ T DP+L
Sbjct: 616 EAIRAAHVAEHQRLFRRVSL-------DLGTSYAAQ-----LPTDERVRLSTTSVDPALA 663
Query: 351 ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
L FQ+ RYLLISSSRPG+Q ANLQG+WN+ ++P W S +NIN EMNYW + NL+E
Sbjct: 664 ALYFQYARYLLISSSRPGSQPANLQGLWNDHVTPPWGSKYTININTEMNYWPAEVANLAE 723
Query: 411 CQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWL 470
C EP+F + L+ G+K AQ Y A GWV+HH TD+W +++A W +WP GGAWL
Sbjct: 724 CTEPVFSMIRDLTETGTKMAQAQYGARGWVVHHNTDLW-RAAAPIDGAFWGMWPTGGAWL 782
Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAP 529
C WEHY Y+ DR+FL R YP L+G A F LD L+ E +L T+PS SPE+
Sbjct: 783 CRTAWEHYLYSGDREFL-ARIYPWLKGAAEFFLDTLVEEPRHRWLVTSPSISPENAH--- 838
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
+S TMD IIR++FS +I+A+E L + D +KV + RL P +I G
Sbjct: 839 -HPGVTISAGPTMDEQIIRDLFSEVITASEQLGVDAD-FRQKVAAARARLAPNQIGAQGQ 896
Query: 590 IMEWAQDFK--DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSI 647
+ EW +D+ PE HRH+SHL+GLFP I P+L AA+KTL+ RG+ GW+I
Sbjct: 897 LQEWVEDWDAIAPEQDHRHVSHLYGLFPSDQIDPRTTPELAAAAKKTLETRGDISTGWAI 956
Query: 648 TWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTA 707
W+ LW RL D E AY++++ L+ PE Y NLF AHPPFQID NFG
Sbjct: 957 AWRLNLWTRLADAERAYKILR---ALLAPERT-------YPNLFDAHPPFQIDGNFGGAN 1006
Query: 708 AVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+AEML+QS ++ LLPALP W +G VKGL+ARGG V + W + L V + S
Sbjct: 1007 GIAEMLLQSHRGEIELLPALP-KAWPTGSVKGLRARGGFEVDLAWANQQLVRVELRS 1062
>gi|380694480|ref|ZP_09859339.1| alpha-L-fucosidase [Bacteroides faecis MAJ27]
Length = 804
Score = 530 bits (1364), Expect = e-147, Method: Compositional matrix adjust.
Identities = 307/815 (37%), Positives = 448/815 (54%), Gaps = 61/815 (7%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
T+ N L + + PAK + +A+P+GNGRLGAM++G E ++ NE+TL++G P N
Sbjct: 11 TNAQNHLTLWYKSPAKAWEEALPVGNGRLGAMIFGDTQKERIQFNENTLYSGEPETPKNI 70
Query: 67 DAPKALSDVRSLVDSGQYAEA-TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYR 125
+ L+ +R L+ G+ AEA T K G + YQ GD+ ++FD K A Y
Sbjct: 71 NIVPDLAHIRQLLGEGKNAEAGTIMQEKWIGRLNEAYQPFGDLYIDFDS---KEAVTDYM 127
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
LD+ A Y V+ +RE F+S P Q IV + S+ L+F L S +
Sbjct: 128 HSLDMENAVVTTSYKQNGVDISREVFASYPAQAIVIHLKSSKP-VLNFTAYLAS--PHPV 184
Query: 186 YVNGNNQII-MEGRCPG---------------KRIPPK--------------ANAND-DP 214
++Q++ ++G+ P +R+ P+ N+ D
Sbjct: 185 TKESDSQVVYLKGQAPAHAQRRDTDHMKRFNTQRLHPEYFDASGHIIQKKQVIYGNEMDG 244
Query: 215 KGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDP 274
KG F A L + +G ++ D ++ L+L A++S++GP +PS K+P
Sbjct: 245 KGTFFEACL---LPTHKGGQLSISDNQITARNCSEVTLMLYAATSYNGPRKSPSKEGKNP 301
Query: 275 TSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPS 334
M+ + +Y +L +H DYQ LF+RVS L + + +P+
Sbjct: 302 HQAIMNYRRISEGETYKELKRQHTTDYQALFNRVSFDLPANKQQ-----------KELPT 350
Query: 335 AERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNI 394
ER+K F+ +ED +L+ LFQFGRYL+I+ SR Q NLQG+WN+ + P W+S +NI
Sbjct: 351 DERLKRFKDEEDQALIAQLFQFGRYLMIAGSRGEGQPLNLQGLWNDQILPPWNSGYTLNI 410
Query: 395 NLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSAD 454
NLEMNYW + NLSEC +PLF + ++ G A+ Y +GW IHH IW ++
Sbjct: 411 NLEMNYWPAEVTNLSECHQPLFKLIEEIADKGKDLARDMYGLNGWAIHHNISIWREAYPS 470
Query: 455 RGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYL 514
G V W W M G WLC HLWEHY +T D +FL K+ YP+L+G A+F +WL++ G L
Sbjct: 471 DGFVYWFFWNMSGPWLCNHLWEHYLFTKDANFL-KKYYPILKGAATFCSEWLVKNSKGEL 529
Query: 515 ETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLK 574
T STSPE+ ++ D A V STMD+AIIR +FS I AAE+L+ + D E ++K
Sbjct: 530 VTPVSTSPENAYLMGDHTPASVCEGSTMDIAIIRSLFSNTIQAAEILQTDMDFRSE-LIK 588
Query: 575 SLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKT 634
+L+ +I G ++EW +++K+ E HRH+SHLFGL+PG IT + P++ KAA K+
Sbjct: 589 KRNKLKKYQIGSKGQLLEWDKEYKESEPQHRHVSHLFGLYPGCDIT-DSTPEVFKAARKS 647
Query: 635 LQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAH 694
L RG + GWS+ WK +LW+RL+D +AY + L N +DP + GGLY NL A
Sbjct: 648 LDDRGNKTTGWSMAWKISLWSRLYDSSNAYEALSNLINYIDPHMKAENRGGLYRNLLNA- 706
Query: 695 PPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKD 754
PFQID NFG TA +AEML+QS +++LLPALP W G +KGLKARGG TV + WK+
Sbjct: 707 LPFQIDGNFGATAGIAEMLLQSHKGNIHLLPALP-PTWKEGNIKGLKARGGFTVDMEWKE 765
Query: 755 GDLHEVGIYSNYSNN----DHDSFKTLHYRGTSVK 785
G + I S Y ++S K H+ K
Sbjct: 766 GKITVANITSPYEQTVEIVYNNSIKKTHFNAGERK 800
>gi|427384395|ref|ZP_18880900.1| hypothetical protein HMPREF9447_01933 [Bacteroides oleiciplenus YIT
12058]
gi|425727656|gb|EKU90515.1| hypothetical protein HMPREF9447_01933 [Bacteroides oleiciplenus YIT
12058]
Length = 809
Score = 529 bits (1363), Expect = e-147, Method: Compositional matrix adjust.
Identities = 298/760 (39%), Positives = 428/760 (56%), Gaps = 45/760 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PAK +T+A+P+GN RLGAM++GGV +E ++LNE+T+W G P +P A L
Sbjct: 23 LKLWYSQPAKVWTEALPLGNSRLGAMLYGGVVNEQIQLNEETVWGGGPHRNDSPKALGVL 82
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ +G+ EA F G +Q +G + LEFD H Y++ YRRELDL
Sbjct: 83 PQVRELLFTGREKEAEKMIADNFFTGQHGMPFQTIGSLMLEFD-GHADYSD--YRRELDL 139
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A V+Y +G V +TR F+S D ++ +I + G++SF + ++
Sbjct: 140 EKAIASVRYKIGEVNYTRTVFTSLADNALIVRIEADKPGAVSFTTRYSTPYKEYAVKKSG 199
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
+++ G P A I+F +IK ++G +S D ++V+G+D A
Sbjct: 200 KSLLLSGHGSAHEGIPGA--------IRFETRTQIKA--EKGKVSVTNDC-IEVKGADAA 248
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
V+ + A+++F +N D + T + L Y+ + H + YQKLF RVS+
Sbjct: 249 VIYVTAATNF----VNYKDVSANETRRATEFLAKAMKRPYAQALSAHEEAYQKLFGRVSL 304
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
+ S K+ ++ R+K F +DP LV L+FQFGRYLLISSS+PG Q
Sbjct: 305 NVGASAKE--------------ETSYRIKHFNEGKDPGLVALMFQFGRYLLISSSQPGGQ 350
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
A LQGIWN +L WD +NIN EMNYW + NL+E EPLF + LS + TA
Sbjct: 351 PAGLQGIWNHELFAPWDGKYTININTEMNYWPAEVTNLTEMHEPLFQMVKELSESAQGTA 410
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
Y GW +HH TD+W + G +WP+GGAWL HLW+HY YT D+ FL+
Sbjct: 411 HTLYDCRGWTVHHNTDLWRMAGPVDGASY--VWPLGGAWLSQHLWQHYLYTGDQAFLQT- 467
Query: 491 AYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
AYP L+G A F LD+L+E G++ PS SPE P G ++ TMD I+ +
Sbjct: 468 AYPALKGAADFFLDFLVEHPKYGWMVCAPSMSPEQ---GPPGTGTMLTAGCTMDTQIVLD 524
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSH 609
++++SA ++L + + + + + RL P +I + + EW D DP HRH+SH
Sbjct: 525 ALTSVLSATKLLYPDHTSYCDSLQSMIKRLPPMQIGKHNQLQEWLADVDDPRNDHRHVSH 584
Query: 610 LFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKR 669
L+GL+P + I+ +P L +AA+++L RG+ GWSI WK LWARL D +HAY+++K
Sbjct: 585 LYGLYPSNQISPYAHPQLFQAAKRSLLYRGDMATGWSIGWKINLWARLLDGDHAYKIIKN 644
Query: 670 LFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPW 729
+ NLV+ + + G Y N+F AHPPFQID NFGFTA VAEML+QS L+LLPALP
Sbjct: 645 MLNLVE---DGNPNGRTYPNMFDAHPPFQIDGNFGFTAGVAEMLLQSHDEALHLLPALPG 701
Query: 730 DKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
D WS G VKGL ARG V + W G+L + S N
Sbjct: 702 D-WSKGSVKGLVARGAFEVDMDWDGGELTTATVTSRIGGN 740
>gi|392964290|ref|ZP_10329711.1| Alpha-L-fucosidase [Fibrisoma limi BUZ 3]
gi|387847185|emb|CCH51755.1| Alpha-L-fucosidase [Fibrisoma limi BUZ 3]
Length = 821
Score = 529 bits (1363), Expect = e-147, Method: Compositional matrix adjust.
Identities = 306/768 (39%), Positives = 434/768 (56%), Gaps = 53/768 (6%)
Query: 13 LKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
K+ +N PA + + +A+PIGNGRLGAMV+G V ET++LNE T+W+G P NPDA A
Sbjct: 25 FKLWYNQPAGQTWENALPIGNGRLGAMVYGNVARETIQLNEHTVWSGGPNRNDNPDALAA 84
Query: 72 LSDVRSLVDSGQYAEATAASVKLF----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
L ++R+L+ G+ EA + K H ++Q +G++ L F+ H Y Y R+
Sbjct: 85 LPEIRTLIFDGKQKEAEKLANKAIITKKAH-GQMFQPVGNLHLTFN-GHDNYTN--YYRD 140
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
LD+ A A+ Y+V V +TRE F+S PDQVIV ++ S+ G + F S +
Sbjct: 141 LDIERAIAKTTYTVDGVAYTREVFTSFPDQVIVVHLTASKPGRIDFTASYST-------- 192
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDP--KG-IQFSAILEIKISDDRGTISALEDKKLKV 244
Q P K + +D KG ++F I IK ++GT+++ D L V
Sbjct: 193 ---QQKADRKTTPAKDLTIAGTTSDHEGVKGMVRFKGITRIKT--EKGTLAS-TDTTLTV 246
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
+G++ A + + +++F+ + D D + + S L SY+ + T H+ YQ
Sbjct: 247 KGANAATIYISIATNFN----SYKDVSGDENARAESYLNKAYPKSYAAMLTPHVAAYQNY 302
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV + L +P + +P+ ER+K+F+T DP L +Q+GRYLLISS
Sbjct: 303 FNRVRLDLGSTPTEAAK----------LPTDERLKNFRTATDPEFATLYYQYGRYLLISS 352
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+PG Q ANLQGIWN + P WDS +NIN +MNYW + NL+E EP + LS
Sbjct: 353 SQPGGQPANLQGIWNHRMRPPWDSKYTININAQMNYWPAEKTNLAELHEPFLRMVNELSE 412
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G +TA+V Y A GW+ HH TDIW + A G W +W GG W HLWEHY Y D+
Sbjct: 413 AGQETARVMYGARGWMAHHNTDIWRTTGAIDG-ATWGMWIAGGGWTAQHLWEHYLYNGDK 471
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+L YP+L+G A F +D+LIE H Y L NP TSPE+ A G + + +TM
Sbjct: 472 AYLAS-VYPILKGAAQFYVDYLIE-HPKYHWLVVNPGTSPENAPKAHGG--SSLDAGTTM 527
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
D I +VFS I AAE+L K + A V+ + + +L P + + G + EW +D DP
Sbjct: 528 DNQIAFDVFSTAIRAAEIL-KTDVAFVDTLKQKRSQLPPMHVGQHGQLQEWLEDIDDPND 586
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEH 662
HRH+SHL+GLFP + I+ + PDL AA+ +L RG+ GWS+ WK WARL D H
Sbjct: 587 KHRHISHLYGLFPSNQISPYRTPDLYSAAQTSLIHRGDVSTGWSMGWKVNWWARLQDGNH 646
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
AY +++ N + P GG Y+NLF AHPPFQID NFG T+ + EML+QS ++
Sbjct: 647 AYTLIQ---NQLTPLGVNKEGGGTYNNLFDAHPPFQIDGNFGCTSGITEMLLQSADGAIH 703
Query: 723 LLPALPWDKWSSGCVKGLKARGG-ETVSICWKDGDLHEVGIYSNYSNN 769
+LPALP D W +G V GL+ARGG E V + WK G L ++ + SN N
Sbjct: 704 ILPALP-DVWPTGSVTGLRARGGFEVVDMQWKAGKLTKLTVKSNLGGN 750
>gi|346225024|ref|ZP_08846166.1| alpha-L-fucosidase [Anaerophaga thermohalophila DSM 12881]
Length = 828
Score = 529 bits (1362), Expect = e-147, Method: Compositional matrix adjust.
Identities = 309/776 (39%), Positives = 438/776 (56%), Gaps = 55/776 (7%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
A+ + LK+ ++ PA + +A+PIGNGRLGAMV+G +E ++LNE+T W+G P
Sbjct: 20 AKEMAQKTDLKLWYDKPANVWNEALPIGNGRLGAMVFGDPANEKIQLNEETFWSGGPSHN 79
Query: 64 TNPDAPKALSDVRSLVDSGQYAEA------TAASVKLFGHPADVYQLLGDIELEFDDSHL 117
NP A KAL VR L+ G+Y EA + + +L G +YQ +G++ L FD H
Sbjct: 80 DNPKALKALPKVRQLIFEGKYYEAEKMVNESMVAEQLHG---SMYQTIGNLNLSFD-GHE 135
Query: 118 KYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL 177
Y Y RELD+ A Y+V +V F RE F+S P+Q+I K+S + GSLSF SL
Sbjct: 136 NYT--NYYRELDIENALFSTTYTVNDVNFKREVFASFPNQIIAVKLSSDQHGSLSFTASL 193
Query: 178 DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISA 236
+ L ++ V N + M G +++++ +G ++F+ KI +D G I
Sbjct: 194 NGPLAKNTQVLDTNILEMTGI---------SSSHEGVEGQVKFNT--RAKILNDGGKIKT 242
Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
+ K+ V +D V+L+ +++F ++ + + L S+++L
Sbjct: 243 -DGNKITVTKADEVVILISMATNF----VDYKTLSANENEQCQKFLSEASQKSFAELKNA 297
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
H+ DY+K F R S+ L +P SE P+ R+K+F DP+LV L +QF
Sbjct: 298 HIKDYRKYFTRSSLNLGTTP-------ASE-----YPTDVRIKNFSQTNDPALVALYYQF 345
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLISSSRPG Q ANLQGIWN P WDS +NIN EMNYW + CNL+E EPL
Sbjct: 346 GRYLLISSSRPGGQPANLQGIWNNSTHPAWDSKYTININTEMNYWPAEKCNLTELHEPLI 405
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
+ LS GS TAQ Y GWV HH TDIW G W +WPMGGAWL HLWE
Sbjct: 406 QMVRELSETGSHTAQTMYGCDGWVTHHNTDIWRICGVVDG-AFWGMWPMGGAWLSQHLWE 464
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH-DGYLETNPSTSPEHEFIAPDGKLAC 535
+ Y D +L Y +++ F ++LIE +G+L +PS SPE+ AP G+
Sbjct: 465 KFLYNGDMKYLAS-VYSIMKSACRFYQNFLIEEPVNGWLVVSPSVSPEN---APAGR-PS 519
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEW 593
++ +TMD I+ ++FS I AA +L ++E+ + +L SLP P +I + G + EW
Sbjct: 520 ITAGATMDNQILFDLFSKTIKAATLLNQDENLISDFRNILDSLP---PMQIGQYGQLQEW 576
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
+D PE HRH+SHL+GL+P + I+ +P+L +AA TLQ RG+ GWS+ WK
Sbjct: 577 MEDLDSPEDKHRHISHLYGLYPSNQISPYSSPELFEAARTTLQHRGDVSTGWSMAWKVNF 636
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WAR+ D HA +++K +LVDP + GG Y NL AHPPFQID NFG TA +AEML
Sbjct: 637 WARMLDGNHARKLIKDQLSLVDPGKDGR-NGGTYPNLLDAHPPFQIDGNFGCTAGIAEML 695
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
+QS ++ LPALP D+W +G + GL+ GG VS W++G L + I S N
Sbjct: 696 LQSHDGAIHFLPALP-DEWKNGEITGLRTPGGFEVSCKWENGQLIKAEIKSTLGGN 750
>gi|399073647|ref|ZP_10750601.1| hypothetical protein PMI01_01669 [Caulobacter sp. AP07]
gi|398041300|gb|EJL34368.1| hypothetical protein PMI01_01669 [Caulobacter sp. AP07]
Length = 783
Score = 529 bits (1362), Expect = e-147, Method: Compositional matrix adjust.
Identities = 302/787 (38%), Positives = 453/787 (57%), Gaps = 60/787 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PAK + +A+P+G GR+GAMV+GGV E L+LN+DTLW G P D NP A AL
Sbjct: 35 RLWYRQPAKEWVEALPVGTGRIGAMVFGGVAEERLQLNDDTLWAGGPYDPVNPQARAALP 94
Query: 74 DVRSLVDSGQYAEAT-AASVKLFGHP--ADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
++R L+ +G AEAT A + P YQ +GD+ L F L + Y R+LDL
Sbjct: 95 EIRRLIAAGDIAEATKVADARFLATPRYQMSYQTIGDLRLAF--PGLPETADDYVRDLDL 152
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH--SYVN 188
+ A A ++S G FTRE +S PD+VI +++ ++ +LS ++S S L++ +
Sbjct: 153 DGAIATTRFSAGATRFTREVIASAPDRVIAVRLTADKAKALSLDLSFASPLNSRPTARAE 212
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
G + +++ G + N ++F +++ + GT+ A + L V G+D
Sbjct: 213 GADTLVLAGTGEAQ--------NGVEAALKFEC--RVRVLNKGGTVVA-DGAGLAVRGAD 261
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
VLLL+AS++ F D DP + + +A+++ + DL RH D++KLF RV
Sbjct: 262 -EVLLLIASATSYRRF---DDVGGDPAAINRTAVEAASARPWRDLLARHQADHRKLFRRV 317
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
++ L + + P+ ER+K+ T +DP+L L +Q+GRYLLI+ SRPG
Sbjct: 318 AVDLGTTSAALK------------PTDERIKASPTTDDPALAALYYQYGRYLLIACSRPG 365
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
Q ANLQG+WN+ +P W S +NIN EMNYW + P L+EC PL + + LS+ G++
Sbjct: 366 GQPANLQGLWNDQAAPPWGSKYTININTEMNYWPAEPTGLAECVAPLVEMVRDLSVTGAR 425
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
TAQ Y A GWV HH TD+W +++A + +WP GGAWLC HLW+HY+Y D+ +L
Sbjct: 426 TAQAMYGARGWVAHHNTDLW-RATAPIDGAKYGVWPTGGAWLCKHLWDHYDYGRDQAYLA 484
Query: 489 KRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
YPL+ G A F +D L+ + G + T+PS SPE++ G + TMD AII
Sbjct: 485 D-VYPLMRGAALFFVDTLVRDPRTGQVVTSPSISPENDH----GHGGSLVAGPTMDQAII 539
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF--KDPEVHHR 605
R++FS+ I+AA +L + L + + RL P KI +DG + EW D+ E+HHR
Sbjct: 540 RDLFSSCIAAAAIL-GTDAPLAAILAAARDRLAPYKIGKDGQLQEWQDDWDADAKEIHHR 598
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H+SHL+GLFP I I+K P L AA ++L+ RG+ GW+I W+ LWARL + +HA+
Sbjct: 599 HVSHLYGLFPSDQIAIDKTPALAAAARRSLEIRGDLSTGWAIAWRLNLWARLGEGDHAHG 658
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
+ L L+ PE Y N+F AHPPFQID NFG T+ + EM++QS ++ LLP
Sbjct: 659 I---LGLLLGPERT-------YPNMFDAHPPFQIDGNFGGTSGMTEMILQSRNGEILLLP 708
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVK 785
ALP W SG + GL+ARG V + W G L E +++ ++ H + Y G ++
Sbjct: 709 ALP-SAWPSGRLTGLRARGAVGVDVVWARGRL-ESAVFTAAADGRHH----VRYAGGAID 762
Query: 786 VNLSAGK 792
++L AG+
Sbjct: 763 LDLKAGQ 769
>gi|430749774|ref|YP_007212682.1| hypothetical protein Theco_1545 [Thermobacillus composti KWC4]
gi|430733739|gb|AGA57684.1| hypothetical protein Theco_1545 [Thermobacillus composti KWC4]
Length = 845
Score = 528 bits (1361), Expect = e-147, Method: Compositional matrix adjust.
Identities = 316/832 (37%), Positives = 455/832 (54%), Gaps = 87/832 (10%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + +A+PIGNGRLGAM++GGV + + LNEDTLW G P + + +A + L+
Sbjct: 7 RLWYRRPAGVWEEALPIGNGRLGAMLFGGVRLDRILLNEDTLWAGYPRETVDCEARRHLA 66
Query: 74 DVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
R L+ +G+ EA ++ G Y LG++ +E+ D + Y R L +
Sbjct: 67 RARELIFAGRLTEAQRLIESRMTGRNVQPYLPLGELAIEWLDGEDDAPD--YVRSLRIFD 124
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY-VNGNN 191
A V+++ G + R +++S PDQVIV + +E G ++ +L S + + ++
Sbjct: 125 GVADVRFASGGLRMRRAYWASAPDQVIVVRYE-AEGGMMNLAAALSSPVRSSVSVMDDGR 183
Query: 192 QIIMEGRCPG------KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
+++ GR P + P+ ++ +G++F A +++ D G + A E ++L V
Sbjct: 184 TLVLAGRAPSHVADNWRGDHPEPVLYEEGRGMRFEA--RVRLETD-GVVEA-EGERLIVR 239
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G+ + A+++F + P D ++ + L+ Y L RHL D++
Sbjct: 240 GASRLTAYIAAATAFVD-WRTPPDESGAHSARCEAWLREAERSGYEALLERHLADHRAFM 298
Query: 306 HRVSIQLSR----------SP------KDIV-TDTCSEENIDT----------------- 331
RVS++L+ SP KD +DT + + +
Sbjct: 299 GRVSLRLAGGEAAGLPDADSPGSHAAGKDATGSDTAGSDAVGSAAATAESGQAGMDRSEA 358
Query: 332 ---------------VPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQ 375
+P+ ER+K++Q+ + DP+L L FQ+GRYLL++SSRPGTQ ANLQ
Sbjct: 359 GWTASFGLNRVSMNDLPTDERLKAYQSGNPDPALEALYFQYGRYLLLASSRPGTQPANLQ 418
Query: 376 GIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYL 435
GIWN + P W S +NIN EMNYW + CNLSEC EPLF L L+ +G++TA+++Y
Sbjct: 419 GIWNPHVQPPWFSDYTININTEMNYWPAEVCNLSECHEPLFAMLGELAESGTRTARIHYG 478
Query: 436 ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLL 495
GW HH D+W S+ G WA WPMGGAWL THLWE Y + D DFL AYPL+
Sbjct: 479 CRGWTAHHNVDLWRMSTPSDGSASWAFWPMGGAWLATHLWERYLFEPDLDFLRGTAYPLM 538
Query: 496 EGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAII 555
G A F LDWL+ G DG L TNPSTSPE+ F+ P+G+ V++ STMDMAIIRE+F+A I
Sbjct: 539 RGAAQFCLDWLVPGPDGTLVTNPSTSPENVFLTPEGEPCSVTWGSTMDMAIIRELFAACI 598
Query: 556 SAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFP 615
A+ +L +E L ++ +L +L P +I G + EWA D+ + E HRH+SHLFGLFP
Sbjct: 599 EASRLLGTDE-PLRGELEAALAKLPPYRIGRHGQLQEWAVDYDEHEPGHRHVSHLFGLFP 657
Query: 616 GHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHAYRMVKRLFN 672
G + E P+L +AA TL++R + G GWS W L+ARL D E A ++ L
Sbjct: 658 GSHLN-ETTPELLEAARVTLERRLKHGGGHTGWSCAWLILLYARLKDAETARGFIRTLLA 716
Query: 673 LVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKW 732
Y NL AHPPFQID NFG A +AE+LVQS L + LLPALP D W
Sbjct: 717 R-----------STYPNLLDAHPPFQIDGNFGGAAGIAELLVQSHLGSVDLLPALPAD-W 764
Query: 733 SSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSV 784
SG V+GL ARGG T+ I W DG L E I S Y + H R +V
Sbjct: 765 RSGEVRGLHARGGFTIDIAWADGTLREARITSRYGK----PLRVRHARPVAV 812
>gi|78047362|ref|YP_363537.1| hypothetical protein XCV1806 [Xanthomonas campestris pv.
vesicatoria str. 85-10]
gi|78035792|emb|CAJ23483.1| conserved hypothetical protein [Xanthomonas campestris pv.
vesicatoria str. 85-10]
Length = 856
Score = 528 bits (1361), Expect = e-147, Method: Compositional matrix adjust.
Identities = 311/795 (39%), Positives = 438/795 (55%), Gaps = 66/795 (8%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+ L++ + PA + +A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D +P
Sbjct: 105 VAAAQALQLWYREPANQWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSNSP 164
Query: 67 DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
DA AL VR+L+ +G+YAEA A KL P YQ LGD+ L+FD +
Sbjct: 165 DALAALPQVRALIFAGRYAEAEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GISD 221
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
YRR+LDL+TA A + G RE F S Q IV ++S G +S V +DS
Sbjct: 222 YRRQLDLDTAVATTTFRSGGAVHRREVFVSAQAQCIVVRLSCDRPGGISVRVGIDSPQTG 281
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD--RGTISALEDKK 241
++ GR N GI+ +++ G +S + D +
Sbjct: 282 EVTAE-QGGLLFSGR------------NGSFAGIEGKLRFALRVLPQVRGGKLSQVRD-R 327
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L+++ +D VLLL A++S+ + D DP + + + L+ L + L HL D+
Sbjct: 328 LRIDAADEVVLLLSAATSYQ--RFDAVDG--DPLASTAACLRKAAKLDFPALLRAHLADH 383
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
Q+LF RV+I L S +P+ ERV+ F DP+L L Q+GRYLL
Sbjct: 384 QRLFRRVAIDLGSSAA------------TQLPTDERVQRFAEGNDPALAALYHQYGRYLL 431
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
I SSRPGTQ ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL L
Sbjct: 432 ICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFD 491
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
L+ G+ TA+ Y A GWV+H+ TD+W ++ G W+LWP+GG WL LW+ ++Y
Sbjct: 492 LAQAGAHTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPLGGVWLLQQLWDRWDYG 550
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
DR +L K YPL +G A F + L+ + G + TNPS SPE++ P G C S
Sbjct: 551 RDRAYLSK-VYPLFKGAAEFFVATLMRDPQTGAMVTNPSMSPENQH--PFGAAVCAGPS- 606
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL-PRLRPTKIAEDGSIMEWAQDF-- 597
MD ++R++F+ I+ +++L DA + L +L +L P +I + G + EW QD+
Sbjct: 607 -MDAQLLRDLFAQCIAMSKLL--GIDAEFAQQLAALREQLPPNRIGKAGQLQEWQQDWDM 663
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARL 657
+ PE+HHRH+SHL+ L P I + PDL AA ++L+ RG+ GW I W+ LWARL
Sbjct: 664 QAPEIHHRHVSHLYALHPSSQINLRDTPDLAAAARRSLEIRGDNATGWGIGWRLNLWARL 723
Query: 658 HDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST 717
D EHAYR+++ L+ PE Y NLF AHPPFQID NFG TA + EML+QS
Sbjct: 724 ADGEHAYRILQL---LISPERT-------YPNLFDAHPPFQIDGNFGGTAGITEMLLQSW 773
Query: 718 LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTL 777
++LLPALP W G V+GL+ RGG +V + W+ G L + ++S D L
Sbjct: 774 GGSVFLLPALP-KAWPRGSVRGLRVRGGASVDLEWEGGRLQQARLHS-----DRGGRYQL 827
Query: 778 HYRGTSVKVNLSAGK 792
Y G ++ + L AG+
Sbjct: 828 SYAGQTLDLELGAGR 842
>gi|312792729|ref|YP_004025652.1| alpha-L-fucosidase [Caldicellulosiruptor kristjanssonii 177R1B]
gi|312179869|gb|ADQ40039.1| Alpha-L-fucosidase [Caldicellulosiruptor kristjanssonii 177R1B]
Length = 752
Score = 528 bits (1361), Expect = e-147, Method: Compositional matrix adjust.
Identities = 310/797 (38%), Positives = 445/797 (55%), Gaps = 66/797 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ FN PA+ + +A+PIGNG LGAM++GGV ET++LNE+++W+ P NPDA K L
Sbjct: 6 LKVIFNKPARCWEEALPIGNGSLGAMIYGGVKYETIQLNEESIWSCGPRRRENPDAFKYL 65
Query: 73 SDVRSLVDSGQYAEATAASVKLFG---HPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
++R + G A SV H Y+ LG +++ F++ + Y R LD
Sbjct: 66 PEIRKTILEGNIKRAEELSVFALSGTPHSEGNYEPLGYLDIYFEEVESDKVK-NYTRYLD 124
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLS----FNVSLDSLLDNHS 185
++ A +V++ V N+ + + +FSS PD+VIV KI S++G++S F +D
Sbjct: 125 ISNAICKVEFDVDNIRYKKIYFSSYPDKVIVVKICSSKTGAVSLRAKFRREYQEDIDKCG 184
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
V+ N++I E C + +G+ FSA+L+ +S D G + + D L V+
Sbjct: 185 KVD-NDKIFFE--CLA----------GEGRGVSFSAVLK-AVSKD-GDVYTIGDN-LFVK 228
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
+ +LL+ +++S+ +KD + + ++ + +LY RH +DY+ LF
Sbjct: 229 NATEVMLLITSTTSY---------KEKDYFNWCLKTVEQASKYVFENLYKRHTEDYKSLF 279
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
RV + T+ + E I+ + + D L+ LLFQFGRYLLISSS
Sbjct: 280 SRVEFYIDTKDSSKCTELTTPERINLLREGYK--------DEELIVLLFQFGRYLLISSS 331
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPG NLQGIWN+++ P W S +NINL+MNYW + CNLSEC PLFD L + N
Sbjct: 332 RPGCLPPNLQGIWNKEMKPPWGSKYTININLQMNYWPAEVCNLSECHMPLFDLLEKMYEN 391
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G TAQ Y G+ HH TDIW ++ + WPMG AWLC H+WEHY YT D +
Sbjct: 392 GKITAQRMYGCRGFCAHHNTDIWGDTAPQDIYLPATYWPMGAAWLCLHIWEHYEYTGDIN 451
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
FL KR Y L++ A FLLD+LIE +GYL T PS SPE+ + +G++ ++Y TMD+
Sbjct: 452 FL-KRYYYLMKEAALFLLDYLIEDKNGYLVTCPSCSPENRY-KLNGEVYSLTYMPTMDIQ 509
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
II +F + A VL+ N D +VEK+ +L +L P KI + G I EW +D+++ E HR
Sbjct: 510 IITALFEKVKKANNVLKLN-DEIVEKIEYALNKLPPIKIGKHGQIQEWIEDYEEAEPGHR 568
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP---GWSITWKTALWARLHDQEH 662
H+SHLFGL+P IT EK P L KAA+KTLQ+R + G GWS W WARL +
Sbjct: 569 HISHLFGLYPEDQITFEKTPHLFKAAKKTLQRRLDYGSGHTGWSRAWIICFWARLKEGNK 628
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
AY + L + NL HPPFQID NFG TA +AEML+QS+ +
Sbjct: 629 AYENILEL-----------LKKSTLPNLLDNHPPFQIDGNFGATAGIAEMLMQSSDETIE 677
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGT 782
LLPALP D W G +KGLKARGG T+ + W++G I + + + Y+ +
Sbjct: 678 LLPALP-DSWERGYIKGLKARGGHTIDLYWENGTFKMARIVIGFRES-----VAIKYKDS 731
Query: 783 SVKVNLSAG--KIYTFN 797
V + S G KI ++N
Sbjct: 732 FVVIKGSQGEEKIISYN 748
>gi|189462578|ref|ZP_03011363.1| hypothetical protein BACCOP_03268 [Bacteroides coprocola DSM 17136]
gi|189430739|gb|EDU99723.1| intein C-terminal splicing region [Bacteroides coprocola DSM 17136]
Length = 866
Score = 528 bits (1360), Expect = e-147, Method: Compositional matrix adjust.
Identities = 310/766 (40%), Positives = 434/766 (56%), Gaps = 45/766 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PAK + +A+P+GN +GAMV+GG E L+LNE+TLW G P NP A ++L
Sbjct: 68 LKLWYQQPAKTWVEALPVGNSSMGAMVYGGTSREELQLNEETLWGGGPYRNDNPKALESL 127
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
++VR+L+ SG+ +A + F G YQ +G + +E H K + Y R+L+L
Sbjct: 128 AEVRNLIFSGKTMDAQNLIDQTFYTGRNGMPYQTIGSLIIE-APGHEK--AKNYYRDLNL 184
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A +Y V V F RE F+S PD+VI+ + + + G L+F VS DS L + G
Sbjct: 185 ERAVATTRYQVDGVNFQREVFASFPDRVIIVRFTTDKPGELNFKVSYDSPLQSTVRKQGK 244
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD---RGTISALEDKKLKVEGS 247
++++ G+ D +G++ ++E++ G +L DK + VE +
Sbjct: 245 -KLVLRGK------------GGDHEGVK--GVIEVETQSQVIAEGGKVSLTDKYISVEHA 289
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
A L + A+++F +N + K + + ++ + L YS+ H D YQ F+R
Sbjct: 290 TAATLYIAAATNF----VNYHNVKGNESKKASALLAGAMKKEYSEALKAHTDYYQSQFNR 345
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
VS+ L T T +E + +R+ F DP+L L+FQ+GRYLLISSS+P
Sbjct: 346 VSLSLGGEN----TKTARQETV------KRIAGFSQGNDPALAALMFQYGRYLLISSSQP 395
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
G Q ANLQGIWN L+ WD +NIN EMNYW + NLSE EPLF + LS+ G
Sbjct: 396 GGQPANLQGIWNHQLNAPWDGKYTININTEMNYWPAEVTNLSETHEPLFGLVQDLSVTGR 455
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
+TA+ Y +GWV HH TDIW + + K + WP+GGAWL THLW+HY YT D+DFL
Sbjct: 456 ETARTMYGCNGWVAHHNTDIW-RVTGPVDKAFYGTWPVGGAWLTTHLWQHYLYTGDKDFL 514
Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS-TMDMA 545
K +YP ++G A F L ++I G+ T PS SPEH D K A S TMD
Sbjct: 515 RK-SYPAMKGAADFFLGYMIPHPKYGWKVTAPSMSPEHGPKGEDTKKASTIVSGCTMDNQ 573
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
II +V S ++A+E+LE + A + + L + P +I + EW +D DP+ HR
Sbjct: 574 IIFDVLSNTLAASEILELSA-AYRDSLRTLLSEMAPMQIGRYNQLQEWLEDLDDPKDGHR 632
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H+SH +GLFP + I+ +P L +A + TL +RG++ GWSI WK LWARL D HAY+
Sbjct: 633 HVSHAYGLFPSNQISPFTHPQLFQAVKNTLLQRGDKATGWSIGWKINLWARLLDGNHAYK 692
Query: 666 MVKRLFNLV--DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYL 723
M+ L L+ D E++ EG Y NLF AHPPFQID NFGFTA VAEML+QS ++L
Sbjct: 693 MISNLLVLLPNDEVKEEYPEGRTYPNLFDAHPPFQIDGNFGFTAGVAEMLLQSHDGAVHL 752
Query: 724 LPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
LPALP DKW G VKGL A GG V + W L I+S N
Sbjct: 753 LPALP-DKWEEGKVKGLVAHGGFVVDMDWNGVQLDTAKIHSRIGGN 797
>gi|260910947|ref|ZP_05917588.1| fibronectin type III domain protein [Prevotella sp. oral taxon 472
str. F0295]
gi|260634938|gb|EEX52987.1| fibronectin type III domain protein [Prevotella sp. oral taxon 472
str. F0295]
Length = 792
Score = 528 bits (1360), Expect = e-147, Method: Compositional matrix adjust.
Identities = 308/796 (38%), Positives = 453/796 (56%), Gaps = 47/796 (5%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP-- 69
P K+ ++ PA F +A+PIGNG+LGAMV+G V ++ L LN+ TLW+G P D N DA
Sbjct: 24 PQKLWYDKPATFFEEALPIGNGKLGAMVYGDVWNDNLFLNDLTLWSGQPID-PNEDAGAH 82
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSH--LKYAEETYRRE 127
K + ++R + Y A + +++ GH + YQ L + ++ +S + + + YRRE
Sbjct: 83 KWIPEIRKALFEENYKLADSLQLRVQGHNSAWYQPLSIVSIQPINSQGSSQASIKNYRRE 142
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
LDL++A A+V Y + V + RE+ +++PD+ I+ +++ S+ +L+ +SL S+L +
Sbjct: 143 LDLDSALAKVSYEIDGVTYRREYLATHPDRAILLRLTASKPRALNLRLSLTSILSH---- 198
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ R G I +A P + F +L+ K +D GTI+A +D L +
Sbjct: 199 --------QLRAEGDLIRLTGHAMGHPDSTVHFCNLLQAKATD--GTITA-QDTTLLINN 247
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ VL LV +S++G +P + + L+S+++ S+ L HLDDYQ LF
Sbjct: 248 ATQVVLYLVNETSYNGFDKHPVTQGAPYVQLAEADLKSLQDCSFEQLKQNHLDDYQALFG 307
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RVS+QL + D T ++ +D E +P L L FQFGRYLLISSSR
Sbjct: 308 RVSLQLGGAQFD-TNRTTEQQLLDYTDKCE--------ANPYLEALYFQFGRYLLISSSR 358
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
ANLQG+WN L W S VNINLE NYW + NL+E PL + LS+NG
Sbjct: 359 TPGVPANLQGLWNPHLKAQWRSNYTVNINLEENYWPAQVANLAEMTMPLTGMVKALSVNG 418
Query: 427 SKTAQVNY-LASGWVIHHKTDIWAKSSA---DRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
A+ Y + GW H TD+WA ++ R WA W +GGAWL ++LWE Y++T
Sbjct: 419 RYAARNYYGINEGWCSSHNTDLWAMTNPVGEKRESPEWADWNLGGAWLLSNLWEQYDFTR 478
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
DR++L + +PL++G F+L WLI G L T PSTSPE+E++ P+G Y
Sbjct: 479 DRNYLRETLFPLMKGACDFMLQWLIGNPKKPGELITAPSTSPENEYVTPEGYHGTTMYGG 538
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDP 600
T D+AI+RE+F+ +A E L A +K+ +++ RL P I ++G + EW D++D
Sbjct: 539 TADLAILRELFANTATADETLNGRPTAYSKKLRQTIARLHPYTIGKEGDLNEWYYDWRDF 598
Query: 601 EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQ 660
+ HRH +HL GL+PGH +++ P+L +AA K+L ++G+ GWS W+ LWARL++
Sbjct: 599 DPQHRHQTHLIGLYPGHHLSLGTTPELAEAARKSLIQKGDISTGWSTGWRINLWARLYNG 658
Query: 661 EHAYRMVKRLFNLVDPEH----EKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
E AY++ +RL V P+ +K GG Y N F AHPPFQID NFG TA + EML+QS
Sbjct: 659 EKAYQIFRRLLTYVSPDKYKGPDKRVSGGTYPNFFDAHPPFQIDGNFGGTAGICEMLIQS 718
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKT 776
+ + LLPALP W+SG VKGL ARGG + W DG + +V I S T
Sbjct: 719 S-RGIKLLPALP-SAWTSGSVKGLCARGGFVLDFSWHDGRITQVRIKSTVGGQ-----TT 771
Query: 777 LHYRGTSVKVNLSAGK 792
L+Y G KVNL AG+
Sbjct: 772 LYYNGKVQKVNLKAGE 787
>gi|294666331|ref|ZP_06731579.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292603880|gb|EFF47283.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 830
Score = 528 bits (1359), Expect = e-147, Method: Compositional matrix adjust.
Identities = 312/790 (39%), Positives = 439/790 (55%), Gaps = 68/790 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L++ + PA + +A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D T+PDA AL
Sbjct: 85 LQLWYREPANEWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSPDALAAL 144
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
VR+L+ +G+YAEA A KL P YQ LGD+ L+FD + YRR+LD
Sbjct: 145 PQVRALIFAGRYAEAEKLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GISDYRRQLD 201
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L+TA A + G RE F S Q IV ++S + G +S V +DS N
Sbjct: 202 LDTAVATTTFRSGGAVHRREVFVSAQAQCIVVRLSCNRPGGISLRVGIDSP-QNGEVTAE 260
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI--SDDRGTISALEDKKLKVEGS 247
++ GR N GI+ +++ G +S + D+ L++E +
Sbjct: 261 QGGLLFSGR------------NGSFAGIEGKLRFALRVLPQVSGGKLSQVRDR-LRIEAA 307
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D VLLL A++S+ + D DP + + ++L+ L + L HL D+Q+LF R
Sbjct: 308 DEVVLLLSAATSYQ--RFDAVDG--DPLALTAASLRRAAKLDFPALSRAHLADHQRLFRR 363
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
V+I L S D P+ ERV+ F DP+L L Q+GRYLLI SSRP
Sbjct: 364 VAIDLGSS------DALQR------PTDERVQRFAEGNDPALAALYHQYGRYLLICSSRP 411
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
GTQ ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL L L+ G+
Sbjct: 412 GTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFDLAQTGA 471
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
TA+ Y A GWV+H+ TD+W ++ G W+LWPMGG WL LW+ ++Y DR +L
Sbjct: 472 HTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDYGRDRAYL 530
Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
K YPL +G A F + L+ + G + TNPS SPE++ P G C S MD +
Sbjct: 531 SK-IYPLFKGAAEFFVATLMRDPQTGAMVTNPSISPENQH--PFGAAVCAGPS--MDAQL 585
Query: 547 IREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF--KDPEV 602
+R++F+ I+ +++L + + + + LP P +I + G + EW QD+ + PE+
Sbjct: 586 LRDLFAQCIAMSKLLGIDAQLAQQLAALREQLP---PNRIGKAGQLQEWQQDWDMQAPEI 642
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEH 662
HHRH+SHL+ L P I + P+L AA ++L+ RG+ GW I W+ LWARL D EH
Sbjct: 643 HHRHVSHLYALHPSSQINLRDTPELAAAARRSLEIRGDNATGWGIGWRLNLWARLADGEH 702
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
AYR+++ L+ PE Y NLF AHPPFQID NFG TA + EML+QS ++
Sbjct: 703 AYRILQL---LISPERT-------YPNLFDAHPPFQIDGNFGGTAGITEMLLQSWGGSVF 752
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGT 782
LLPALP W G V+GL+ RGG +V + W+ G L + ++S D L Y G
Sbjct: 753 LLPALP-KAWPRGSVRGLRVRGGASVDLEWEGGRLRQARLHS-----DRGGRYQLSYAGQ 806
Query: 783 SVKVNLSAGK 792
++ + L AG+
Sbjct: 807 TLDLELGAGR 816
>gi|390989152|ref|ZP_10259452.1| hypothetical protein XAPC_114 [Xanthomonas axonopodis pv. punicae
str. LMG 859]
gi|372556186|emb|CCF66427.1| hypothetical protein XAPC_114 [Xanthomonas axonopodis pv. punicae
str. LMG 859]
Length = 790
Score = 527 bits (1357), Expect = e-147, Method: Compositional matrix adjust.
Identities = 311/792 (39%), Positives = 438/792 (55%), Gaps = 60/792 (7%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+ L++ + PA + +A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D T+P
Sbjct: 39 VAAAEALQLWYREPANEWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSP 98
Query: 67 DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
DA AL VR+L+ +G+YAEA A KL P YQ LGD+ L+FD +
Sbjct: 99 DALAALPQVRALIFAGRYAEAEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GISD 155
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
YRR+LDL+TA A + G RE F Q IV ++S G +S V +DS
Sbjct: 156 YRRQLDLDTAVATTTFRSGGAVHRREVFVCAQAQCIVVRLSCDRPGGISLRVGIDS---- 211
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+I E PG + N + + L + G +S + D+ L+
Sbjct: 212 ----PQTGEITAE---PGGLLFSGRNGSFAGIEGRLRFALRVLPQVSGGKLSQVRDR-LR 263
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
++ +D VLLL A++S+ + D DP + + + L+ NL + L HL D+Q+
Sbjct: 264 IDAADEVVLLLSAATSYQ--RFDAVDG--DPLALTAARLRKAANLDFPALLRAHLADHQR 319
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
LF RV+I D S E + +P+ ERV+ F DP+L L Q+GRYLLI
Sbjct: 320 LFRRVAI-----------DLGSSEAVQ-LPTNERVQRFAEGNDPALAALYHQYGRYLLIC 367
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSRPGTQ ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL L L+
Sbjct: 368 SSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFDLA 427
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
G+ TA+ Y A GWV+H+ TD+W ++ G W+LWPMGG WL LW+ ++Y D
Sbjct: 428 QTGTHTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDYGRD 486
Query: 484 RDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
R +L K YPL +G A F + L+ + G + TNPS SPE++ P G C S M
Sbjct: 487 RAYLSK-VYPLFKGAAEFFVATLMRDPQTGAMVTNPSMSPENQH--PFGAAVCAGPS--M 541
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF--KDP 600
D ++R++F+ I+ +++L + + +L P +I + G + EW QD+ + P
Sbjct: 542 DAQLLRDLFAQCIAMSKLLGIDAQLAQQLAALR-EQLPPNRIGKAGQLQEWQQDWDMQAP 600
Query: 601 EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQ 660
E+HHRH+SHL+ L P I + P+L AA ++L+ RG+ GW I W+ LWARL D
Sbjct: 601 EIHHRHVSHLYALHPSSQINLRDTPELAAAARRSLEIRGDNATGWGIGWRLNLWARLADG 660
Query: 661 EHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
EHAYR+++ L+ PE Y NLF AHPPFQID NFG TA + EML+QS
Sbjct: 661 EHAYRILQL---LISPERT-------YPNLFDAHPPFQIDGNFGGTAGITEMLLQSWGGS 710
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYR 780
++LLPALP W G V+GL+ RGG +V + W+ G L +V ++S D L Y
Sbjct: 711 VFLLPALP-KAWPRGSVRGLRVRGGASVDLEWEGGRLQQVRLHS-----DRGGRYQLSYA 764
Query: 781 GTSVKVNLSAGK 792
G ++ + L AG+
Sbjct: 765 GQTLDLELGAGR 776
>gi|344997079|ref|YP_004799422.1| alpha-L-fucosidase [Caldicellulosiruptor lactoaceticus 6A]
gi|343965298|gb|AEM74445.1| alpha-L-fucosidase [Caldicellulosiruptor lactoaceticus 6A]
Length = 752
Score = 527 bits (1357), Expect = e-146, Method: Compositional matrix adjust.
Identities = 308/797 (38%), Positives = 444/797 (55%), Gaps = 66/797 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ FN PA+ + +A+PIGNG LGAM++GGV ET++LNE+++W+ P NPDA K L
Sbjct: 6 LKVIFNKPARCWEEALPIGNGSLGAMIYGGVKYETIQLNEESIWSCGPRRRENPDAFKYL 65
Query: 73 SDVRSLVDSGQYAEATAASVKLFG---HPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
++R + G A SV H Y+ LG +++ F++ + Y R LD
Sbjct: 66 PEIRKTILEGNIKRAEELSVFALSGTPHSEGNYEPLGYLDIYFEEVESDKVK-NYTRYLD 124
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLS----FNVSLDSLLDNHS 185
++ A +V++ V N+ + + +FSS PD+VIV KI S++G++S F +D
Sbjct: 125 ISNAICKVEFDVDNIRYKKIYFSSYPDKVIVVKICSSKTGAVSLRAKFRREYQEDIDKCG 184
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
V+ N++I E C + +G+ FSA+L+ +S D G + + D L V+
Sbjct: 185 KVD-NDKIFFE--CLA----------GEGRGVSFSAVLK-AVSKD-GDVYTIGDN-LFVK 228
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
+ +LL+ +++S+ +KD + + ++ + +LY RH +DY+ LF
Sbjct: 229 NATEVMLLITSTTSY---------KEKDYFNWCLKTVEQASKYVFENLYKRHTEDYKSLF 279
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
RV + T+ + E I+ + + D L+ LLFQFGRYLLISSS
Sbjct: 280 SRVEFYIDTKDSSKCTELTTPERINLLREGYK--------DEELIVLLFQFGRYLLISSS 331
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPG NLQGIWN+++ P W S +NINL+MNYW + CNLSEC PLFD L + N
Sbjct: 332 RPGCLPPNLQGIWNKEMKPPWGSKYTININLQMNYWPAEVCNLSECHMPLFDLLEKMYEN 391
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G TAQ Y G+ HH TDIW ++ + WPMG AWLC H+W+HY YT D +
Sbjct: 392 GKITAQRMYGCRGFCAHHNTDIWGDTAPQDIYIPATYWPMGAAWLCLHIWDHYEYTGDLE 451
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
FL K Y L+ A FLLD+LIE +GYL T PS SPE+ + +G++ ++Y TMD+
Sbjct: 452 FL-KEYYYLMREAALFLLDYLIEDRNGYLVTCPSCSPENRY-KLNGEVYSLTYMPTMDIQ 509
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
II +F + A VL+ N D +VEK+ +L +L P KI + G I EW +D+++ E HR
Sbjct: 510 IITALFEKVKKANNVLKLN-DEIVEKIEYALNKLPPIKIGKHGQIQEWIEDYEEAEPGHR 568
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP---GWSITWKTALWARLHDQEH 662
H+SHLFGL+P IT EK P L KAA+KTLQ+R + G GWS W WARL + +
Sbjct: 569 HISHLFGLYPEDQITFEKTPHLFKAAKKTLQRRLDYGSGHTGWSRAWIICFWARLKEGDK 628
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
AY + L + NL HPPFQID NFG TA +AEML+QS+ +
Sbjct: 629 AYENILEL-----------LKKSTLPNLLDNHPPFQIDGNFGVTAGIAEMLMQSSDETIE 677
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGT 782
LLPALP D W G +KGLKARGG T+ + W++G I + + + Y+ +
Sbjct: 678 LLPALP-DSWERGYIKGLKARGGHTIDLYWENGTFKMARIVIGFRES-----VAIKYKDS 731
Query: 783 SVKVNLSAG--KIYTFN 797
V + S G KI ++N
Sbjct: 732 FVVIKGSQGEEKIISYN 748
>gi|330467858|ref|YP_004405601.1| cellulose-binding family ii [Verrucosispora maris AB-18-032]
gi|328810829|gb|AEB45001.1| cellulose-binding family ii [Verrucosispora maris AB-18-032]
Length = 998
Score = 527 bits (1357), Expect = e-146, Method: Compositional matrix adjust.
Identities = 301/739 (40%), Positives = 409/739 (55%), Gaps = 54/739 (7%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
A+PIGNGRLGAMV+G +E L+LNEDT+W G P D +NP +L+++R LV + Q+ +
Sbjct: 61 ALPIGNGRLGAMVFGNSDTERLQLNEDTVWAGGPHDSSNPRGQGSLAEIRRLVFANQWTQ 120
Query: 87 A-TAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
A + + G+P YQ +G++ L F + Y R+LDL TAT V Y +
Sbjct: 121 AQNLINQTMLGNPVGQLAYQTVGNLRLAFASAS---GTSQYNRQLDLTTATTSVSYVMNG 177
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKR 203
V F RE F+S PDQVI +++ S S++F + DS I ++G
Sbjct: 178 VRFQREVFASAPDQVIAMRLTADRSASITFTATFDSPQRTTVSSPDGATIALDG------ 231
Query: 204 IPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGP 263
N ++F L + + G + L+V G+ LL+ SS+
Sbjct: 232 --VSGNQEGVTGAVRF---LALAHATVSGGTVSSSGGTLRVTGATSVTLLVSIGSSY--- 283
Query: 264 FINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDT 323
+N + D + L + R SY L RH+ DYQ LF RVS+ L R+ +
Sbjct: 284 -VNFRNVGGDYQGIARRHLTAARASSYDQLRARHVADYQALFGRVSLDLGRT-------S 335
Query: 324 CSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLS 383
+++ P+ R+ + DP LLFQ+GRYLLISSSRPGTQ ANLQGIWN+ L+
Sbjct: 336 AADQ-----PTDVRIAQHNSVNDPQFSTLLFQYGRYLLISSSRPGTQPANLQGIWNDSLT 390
Query: 384 PTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHH 443
P WDS +N NL MNYW + NLSEC +P+F + L+++G++TAQV Y A GWV HH
Sbjct: 391 PAWDSKYTINANLPMNYWPADTTNLSECYQPVFSMIQDLTVSGARTAQVQYGAGGWVTHH 450
Query: 444 KTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
TD W SS G W +W GGAWL T +W+HY +T D DFL YP ++G A F L
Sbjct: 451 NTDAWRGSSVVDG-AFWGMWQTGGAWLATMIWDHYLFTGDLDFLRAN-YPAMKGAAQFFL 508
Query: 504 DWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLE 562
D L+ E GYL TNPS SPE A A V TMD I+R++F A+E+L
Sbjct: 509 DTLVTEPSLGYLVTNPSNSPEIGHHAD----ASVCAGPTMDNQILRDLFDGCARASEIL- 563
Query: 563 KNEDALVE-KVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITI 621
N DA +V + RL PT+I G+IMEW D+ + E +HRH+SHL+GL P + IT
Sbjct: 564 -NTDATFRAQVRATRDRLAPTRIGSRGNIMEWLYDWVETERNHRHVSHLYGLAPSNQITR 622
Query: 622 EKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKH 681
P L +AA +TL+ RG++G GWS+ WK WARL + A+ +++ L
Sbjct: 623 RGTPQLFEAARRTLEIRGDDGTGWSLAWKINFWARLEEGNRAHDLIRYLATTAR------ 676
Query: 682 FEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLK 741
L N+F HPPFQID NFG TA +AEML+ S +L+LLPALP W SG V GL+
Sbjct: 677 ----LAPNMFDLHPPFQIDGNFGATAGIAEMLLHSHAGELHLLPALP-AAWPSGSVSGLR 731
Query: 742 ARGGETVSICWKDGDLHEV 760
RGG TV I W +G E+
Sbjct: 732 GRGGHTVGITWSNGQATEI 750
>gi|430742223|ref|YP_007201352.1| hypothetical protein Sinac_1268 [Singulisphaera acidiphila DSM
18658]
gi|430013943|gb|AGA25657.1| hypothetical protein Sinac_1268 [Singulisphaera acidiphila DSM
18658]
Length = 806
Score = 526 bits (1356), Expect = e-146, Method: Compositional matrix adjust.
Identities = 295/750 (39%), Positives = 433/750 (57%), Gaps = 48/750 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ I + PA+ +T+A+PIGNG+LGAMV+GG SE + LNEDT+W G D TNPDA K+L
Sbjct: 38 MVIHYRRPAEAWTEALPIGNGQLGAMVFGGTGSERIALNEDTVWAGERRDRTNPDALKSL 97
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
++R L+ G+ EA A A + P + YQ LGD+ + F + YRRELD
Sbjct: 98 PEIRRLLRVGKPDEAEALAERTMIAVPKRLPPYQPLGDLRILFPGHD---QADDYRRELD 154
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L++A RV Y VG+ F RE F+S DQV+V +++ G L+F+ +LD D +
Sbjct: 155 LDSAMVRVSYRVGDATFRREVFASAKDQVLVVRLTCDRPGRLAFSATLDRERDARAEAVA 214
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
+++++ G + + + ++ G++FSA L + R E +++V +D
Sbjct: 215 PDRVLLRGEAIAR---DERHEDERKVGVKFSAFLRVVTEGGR---VFTEGDRVEVRDADA 268
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L LVA++ F KDP + AL + + Y L + H DD++ F RVS
Sbjct: 269 ATLRLVAATDF---------RSKDPDAACERALAAA-DRPYEPLRSEHEDDHRSFFRRVS 318
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPG 368
++ + +P D +++ +P+ R+ + E DP+L+ FQFGRYLLI+SSRPG
Sbjct: 319 LEFA-APGD-------KDDRAALPTDVRLARVRKGESDPALIAQYFQFGRYLLIASSRPG 370
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
T ANLQGIWNE L+P W+S +NIN +MNYW + NL+E +PLFD + + +G +
Sbjct: 371 TMPANLQGIWNESLTPPWESKYTININTQMNYWPAEVANLAELHQPLFDLIEAMRPSGRQ 430
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
TA+ Y A G++ HH TD+WA + KV LWPMG AWL HLW+HY++ DRDFL
Sbjct: 431 TAKALYGARGFMAHHNTDLWAH-TVPVDKVGSGLWPMGAAWLSLHLWDHYDFGRDRDFLA 489
Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
+RAYP+++ A FLLD+L++ G L PS SPE+ + DGK+A + TMD+ I
Sbjct: 490 QRAYPVMKEAAEFLLDYLVDDGQGQLIPGPSISPENRYRTADGKVAKLCMGPTMDVEIAH 549
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLS 608
+F ++ A+E+L+ + D ++V ++ RL +I + G + EW +D+ +P+ HRH+S
Sbjct: 550 ALFGRVVEASELLDLDPD-FRKRVAEARRRLPSLRIGKHGQLQEWLEDYDEPDPGHRHIS 608
Query: 609 HLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHAYR 665
HLF L PG I++ P+L AA TL++R G GWS W WARL D E A+
Sbjct: 609 HLFALHPGDQISLRGTPELAVAARTTLERRLAHGGGRTGWSRAWIINFWARLGDGEQAHE 668
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
V L NL HPPFQID NFG TA +AEML+QS ++ LLP
Sbjct: 669 NVVALLR-----------KSTLPNLLDTHPPFQIDGNFGGTAGIAEMLLQSHSGEISLLP 717
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDG 755
LP W +G +GL+ARGG V++ W++G
Sbjct: 718 TLP-RAWPTGQFRGLRARGGVDVALSWQNG 746
>gi|436835731|ref|YP_007320947.1| alpha-L-fucosidase [Fibrella aestuarina BUZ 2]
gi|384067144|emb|CCH00354.1| alpha-L-fucosidase [Fibrella aestuarina BUZ 2]
Length = 821
Score = 526 bits (1356), Expect = e-146, Method: Compositional matrix adjust.
Identities = 311/772 (40%), Positives = 437/772 (56%), Gaps = 57/772 (7%)
Query: 13 LKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
LK+ +N P+ + + +A+PIGNGRLGAMV+G VP ET++LNE TLW+G P NP+A +
Sbjct: 24 LKLWYNTPSGQTWENALPIGNGRLGAMVYGNVPRETIQLNEHTLWSGGPNRNDNPEALAS 83
Query: 72 LSDVRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDDSHLKYAEETYRREL 128
L ++R L+ + + EA A + K ++Q +G + L FD H Y Y REL
Sbjct: 84 LPEIRQLIFTNKQKEAEALANKTIITKKSHGQMFQPVGSLHLTFD-GHENYTN--YYREL 140
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY-V 187
D+ A A+ Y+V V +TRE +S PDQV+V +++ S+ G L+F S +
Sbjct: 141 DIERAVAKTTYTVDGVTYTREILASLPDQVLVMQLTASKPGRLAFRASYATPQAKPVIKT 200
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
N N++ + G A+ +D KG +++ I IK G++SA +D L V+G
Sbjct: 201 NSTNELTIAG---------TASDHDGVKGLVRYKGIARIKTQG--GSVSA-DDSTLTVKG 248
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ A + L +++F I +D D + + + L + +Y+ + T H+ YQ+ F
Sbjct: 249 ATTATIYLSVATNF----IKYNDVSGDENARAATYLNNAFPKTYAAILTPHVAAYQRYFK 304
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RVS L + +P+ ER+K+F+T DP LV L +Q+GRYLLISSS+
Sbjct: 305 RVSFDLGST------------EAANLPTDERLKNFRTANDPQLVTLYYQYGRYLLISSSQ 352
Query: 367 PGT-----QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
PG Q ANLQGIWN + P WDS +NIN +MNYW + NL+E EP +
Sbjct: 353 PGRDGVMGQPANLQGIWNNKMRPPWDSKYTININAQMNYWPAEKTNLAELHEPFLQMVRD 412
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
LS G +TA+V Y A GW+ HH TDIW + A G W +W GG W HLWEHY Y+
Sbjct: 413 LSETGQETARVMYGARGWMAHHNTDIWRATGAIDG-AFWGMWIAGGGWTSQHLWEHYLYS 471
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYS 539
D+ +L YP+L+G A F D+L+E H Y L NP +SPE+ A G + +
Sbjct: 472 GDKTYLAS-VYPILKGAALFYADFLVE-HPTYHWLVANPGSSPENAPKAHGG--SSLDAG 527
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL-PRLRPTKIAEDGSIMEWAQDFK 598
+TMD I +VF+ I AA++L+ DA LK L +L P + + G + EW D
Sbjct: 528 TTMDNQIAFDVFTTTIRAADILKT--DAAFADTLKQLRSKLPPMHVGQYGQLQEWLDDVD 585
Query: 599 DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLH 658
DP HHRH+SHL+GLFP I+ + P+L AA TL RG+ GWS+ WK WARL
Sbjct: 586 DPNDHHRHVSHLYGLFPAVQISPYRTPELFNAARTTLTHRGDVSTGWSMGWKVNWWARLQ 645
Query: 659 DQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTL 718
D HAY +++ N + P GG Y+NLF AHPPFQID NFG T+ + EML+QS
Sbjct: 646 DGNHAYTLIQ---NQLTPLGVTKEGGGTYNNLFDAHPPFQIDGNFGCTSGITEMLMQSAD 702
Query: 719 NDLYLLPALPWDKWSSGCVKGLKARGG-ETVSICWKDGDLHEVGIYSNYSNN 769
++LLPALP D WS+G + GL+A GG E V++ WKDG L +V I SN N
Sbjct: 703 GAIHLLPALP-DVWSAGSIGGLRAIGGFEVVNMAWKDGKLTKVAIKSNLGGN 753
>gi|313203234|ref|YP_004041891.1| alpha-L-fucosidase [Paludibacter propionicigenes WB4]
gi|312442550|gb|ADQ78906.1| Alpha-L-fucosidase [Paludibacter propionicigenes WB4]
Length = 822
Score = 526 bits (1356), Expect = e-146, Method: Compositional matrix adjust.
Identities = 298/758 (39%), Positives = 429/758 (56%), Gaps = 50/758 (6%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYA 85
+A+PIGNG LGAMV+G V E ++LNE TLW+G P D NP A +ALS +R+ + G+Y
Sbjct: 55 NALPIGNGFLGAMVYGNVNQELIQLNEKTLWSGSPDDNNNPQAAEALSQIRNFLFEGKYK 114
Query: 86 EATAASVK-------------LFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
EA + K P YQ LG++ +F + E Y RELDLN
Sbjct: 115 EANELTNKTQICKGVGSGTGSGTNVPYGSYQTLGNLFFDFGKTA---PFENYVRELDLNR 171
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
V YS V + RE F+S PD+ ++ ++ + G+LSF L + V N+
Sbjct: 172 GVVTVSYSQNGVRYKREIFASYPDRALIIHLTADKKGALSFTTELTRPERFETRVE-NDH 230
Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
++M G + G++++A L+ + RG ++ +++VEG+D ++
Sbjct: 231 LLMTGALTNGQ---------GGDGMKYAARLK---ATTRGGKLNYKNNEIRVEGADEVIM 278
Query: 253 LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
+L AS+++ + PS DP + + L + Y L H DY LF +VS+ L
Sbjct: 279 ILTASTNYKQEY--PSFVGDDPRLTTQNQLSKASSKPYPTLLKNHTVDYAALFGKVSLNL 336
Query: 313 SRSPKDIVTDTCSEENIDTVPSAERVKS-FQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
S + + DT+P+ R+++ + +D L E+ FQFGRYLLISSSR G+
Sbjct: 337 S------------DNDPDTIPTDRRLRNQTKNPDDLHLQEVYFQFGRYLLISSSREGSLP 384
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
ANLQGIW + W+ H NIN++MNYW + NLSEC PL + L G +A
Sbjct: 385 ANLQGIWCNKIQAPWNCDYHSNINVQMNYWGADIVNLSECFSPLSRLIESLVKPGEISAA 444
Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
V Y ASGW + T++W +S G + W L+ GG WLC HLW+HY +T+DR++L+ R
Sbjct: 445 VQYNASGWCVQPITNVWGYTSPGEG-INWGLYVAGGGWLCRHLWDHYTFTLDRNYLQ-RV 502
Query: 492 YPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREV 550
YP++ A F LDWL+ + G L + PSTSPE+ FIAPDG + + D II E+
Sbjct: 503 YPVMLNAARFYLDWLVTDPKTGKLVSGPSTSPENSFIAPDGSRGSICMGPSHDQEIIHEL 562
Query: 551 FSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHL 610
F+ +++A++VL KN D L+ K+ +L L KI DG +MEW+++FK+ E++HRH+SHL
Sbjct: 563 FTNVLTASKVL-KNTDPLLAKIDIALRNLATPKIGSDGRLMEWSEEFKETEINHRHVSHL 621
Query: 611 FGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRL 670
+ L+PG I + P+L AA K+L R + G GWS+ WK LWARL D AY+++K L
Sbjct: 622 YMLYPGSQIDPNRTPELAAAARKSLDVRTDIGTGWSLAWKVNLWARLKDGNRAYQLLKNL 681
Query: 671 FNLVD-PEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPW 729
D + GG Y NLF AHPPFQID NFG TA +AEML+QS + LLPALP
Sbjct: 682 LKSTDNADLNMSNGGGTYPNLFCAHPPFQIDGNFGGTAGIAEMLLQSHNGYIELLPALP- 740
Query: 730 DKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYS 767
D W SG VKGL ARGG + I W++G ++ + N +
Sbjct: 741 DVWKSGEVKGLVARGGFVLDIEWRNGKPQKIVVKPNLT 778
>gi|357008575|ref|ZP_09073574.1| alpha-L-fucosidase [Paenibacillus elgii B69]
Length = 765
Score = 526 bits (1354), Expect = e-146, Method: Compositional matrix adjust.
Identities = 294/764 (38%), Positives = 429/764 (56%), Gaps = 55/764 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + ++ PA+ + +A+PIG GRLG MV+G V + ++LNED++W G P NPDA +
Sbjct: 8 LALWYSAPARRWEEALPIGGGRLGGMVFGTVGQDKIQLNEDSVWYGGPKKANNPDARANV 67
Query: 73 SDVRSLVDSGQYAEAT-AASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
++R L+ G+ EA A + L P + YQ LGD+ L + H K + Y RELD
Sbjct: 68 PEIRRLLMEGKQQEAEHLARMALMSAPKYLHPYQPLGDLLL-YMLGHDK-PPQAYERELD 125
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL-DSLLDNHSYVN 188
L A RV+Y + V +TRE+FSS QV+ +++ + GSL+F+ + D S
Sbjct: 126 LERALVRVRYDMDGVRYTREYFSSAVHQVLAVRLTAARPGSLTFSTHMMRRPFDMGSQKY 185
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
G + +IM G C +G++FS +L+ D ++ + D + VEG+D
Sbjct: 186 GEDTMIMYGEC-------------GTEGVRFSVVLKAVAEGD--SVKPIGDF-ISVEGAD 229
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
LLL A ++F DP + + + +L Y +L H +D+ + F RV
Sbjct: 230 AVTLLLAAGTTF---------RHDDPKAVCLEQIARAASLPYEELKRAHTEDHDRYFRRV 280
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
++L++ D ++E + ERVK + +DP LVE FQFGRYLL+S SRPG
Sbjct: 281 GLELAKPEPDAAASLPTDERL------ERVK--EGHDDPGLVETFFQFGRYLLLSCSRPG 332
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
+ A LQGIWN++ +P W+S +NIN +MNYW + C+L EC EPLFD + + NG
Sbjct: 333 SLAATLQGIWNDNYTPPWESKYTININTQMNYWPAEVCHLQECLEPLFDLIERMRENGRV 392
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
TA+ Y G++ HH T++W + + V ++WPMG AWL HLWEHY + +DR FL
Sbjct: 393 TAREVYGCGGFMAHHNTNLWGDTHVEGIPVSASIWPMGAAWLSLHLWEHYRFGLDRSFLA 452
Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
RAYP+++ A FLLD+L+E G L T PS SPE++F+ +G + + +MD I
Sbjct: 453 DRAYPVMKEAAQFLLDYLLEDEQGRLLTGPSISPENKFVLSNGVTGNLCMAPSMDSQIAF 512
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLS 608
+F A AA VL +E A +++ +++ +L +I G IMEW +D+++ + HRH+S
Sbjct: 513 TLFDACREAAAVLGLDE-AFRQRLAEAMAKLPQPQIGRHGQIMEWLEDYEEADPGHRHIS 571
Query: 609 HLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHAYR 665
LF L PG I + + P+L +AA++TL++R G GWS W WARL + + A+
Sbjct: 572 QLFALHPGEMIHLHRTPELAEAAKRTLERRLAHGGGHTGWSRAWIINFWARLGEGDKAFD 631
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
V L Y NLF AHPPFQID NFG TA +AEML+QS +L LLP
Sbjct: 632 NVAALLAQ-----------STYPNLFDAHPPFQIDGNFGGTAGIAEMLLQSHGGELALLP 680
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
ALP W SGCV GL+ARGG V++ W D L E I + YS
Sbjct: 681 ALP-KAWPSGCVYGLRARGGYEVAMTWDDHRLTEATIRAGYSGT 723
>gi|372210566|ref|ZP_09498368.1| alpha-L-fucosidase [Flavobacteriaceae bacterium S85]
Length = 793
Score = 526 bits (1354), Expect = e-146, Method: Compositional matrix adjust.
Identities = 304/796 (38%), Positives = 447/796 (56%), Gaps = 51/796 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + +N P+ + DA+P+GNGRLGAMV+GG E ++ NE+TLW+G P DY N A K+L
Sbjct: 30 LTLWYNQPSNTWNDALPVGNGRLGAMVYGGKTKEVIQFNEETLWSGQPHDYVNRRAFKSL 89
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELD 129
+ +++ + G+ EA A+ K +P + YQ ++ ++F + H + Y+R LD
Sbjct: 90 AKIKNSLWDGKRKEAEEIANKKFMSNPINQSSYQSFANVLIDFKN-HSNVTD--YKRSLD 146
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L A A Y + RE F+S+PDQVIV ++ S G L+F+++LDS ++
Sbjct: 147 LERAIASTVYKLDKAVIKREVFASHPDQVIVVHLTSSVKGILNFDITLDSNHSDYKVSIE 206
Query: 190 NNQIIMEGRCPGKRIPPKANANDDP-KGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
N+I+++G+ + N N P I+F A L++ +G ++ K+ ++ +
Sbjct: 207 ENEIVIKGKADNFKRDLDINKNKFPLSKIKFEARLKLV---QKGGELISKNNKVTIKNAT 263
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
LV +++F +N D +P + + N Y+ + H+ D+QK F+R+
Sbjct: 264 EVTCYLVGATNF----VNFKDISGNPHKRCKEYFKKLNNKPYNLVKENHIKDFQKYFNRL 319
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
I L E I P+ ER+ SF D DP+LV LL+Q+GRYLLISSSR G
Sbjct: 320 HIDLG------------ETKISRRPTNERLMSFSQDMDPNLVALLYQYGRYLLISSSRKG 367
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
TQ ANLQGIWN+ +SP W S +NINLEMNYW + NLSE EPL + LS G K
Sbjct: 368 TQPANLQGIWNDRISPPWGSKYTLNINLEMNYWITEVTNLSELSEPLIKLIDDLSNTGEK 427
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
A+ +Y GWV HH TDIW + +A + +WP GGAWL HLW HY +T ++DFL+
Sbjct: 428 IAKEHYNMPGWVAHHNTDIW-RGAAPINRSNHGIWPTGGAWLSQHLWWHYEFTQNKDFLK 486
Query: 489 KRAYPLLEGCASFLLDWLIEGHDG--YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
K AYP+L+ + F ++L+E D L + PS SPEH + TMD I
Sbjct: 487 KMAYPILKKASLFFSNYLLEFPDNKELLISGPSNSPEH---------GGLVMGPTMDHQI 537
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRH 606
IR +F I A+++L + K+ K + R+ P KI + G + EW +D +P+ HRH
Sbjct: 538 IRNLFRVTIEASKILNVDR-GFRMKLEKKMNRIMPNKIGKHGQLQEWVKDIDNPKDKHRH 596
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRM 666
+SHL+GL PG I P+L +A + TLQ RG+ G GWS WK WARL D +H++++
Sbjct: 597 ISHLWGLHPGSEIHPLTTPELAEACKITLQNRGDGGTGWSKAWKINFWARLLDGDHSFQL 656
Query: 667 VKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND------ 720
+K L V +K+ +GGLY NLF AHPPFQID NFG T+ + EM++Q+ L +
Sbjct: 657 LKELVVPVKKSVDKNKKGGLYLNLFDAHPPFQIDGNFGITSGITEMILQNHLKNSKGETI 716
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYR 780
+ +LPALP + S G + GLKARG VSI WK+ +L +V + S + L Y+
Sbjct: 717 IDILPALP-SRISKGEIFGLKARGNFEVSILWKERELSKVVVKS-----INGGKLNLRYK 770
Query: 781 GTSVKVNLSAGKIYTF 796
+ N + G + TF
Sbjct: 771 KNVITKNTNRGDVLTF 786
>gi|336414990|ref|ZP_08595333.1| hypothetical protein HMPREF1017_02441 [Bacteroides ovatus
3_8_47FAA]
gi|335941851|gb|EGN03702.1| hypothetical protein HMPREF1017_02441 [Bacteroides ovatus
3_8_47FAA]
Length = 815
Score = 526 bits (1354), Expect = e-146, Method: Compositional matrix adjust.
Identities = 314/765 (41%), Positives = 429/765 (56%), Gaps = 50/765 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA +T+A+P+GN RLG MV+GG SE L+LNE+T+W G P NP A AL
Sbjct: 25 LKLWYSRPATVWTEALPLGNSRLGVMVYGGAGSEELQLNEETVWGGGPHRNDNPKALAAL 84
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+R LV G+Y EA + F P + YQ +G + L+F H K + Y R+LD+
Sbjct: 85 PQIRQLVFEGRYREAQEMVAQNFETPRNGMPYQTIGSLMLDFP-GHEKATD--YYRDLDI 141
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A +Y VG V + RE F+S D VI+ +++ ++ G+LSF S S L +
Sbjct: 142 ERAIATTRYKVGEVTYNREVFTSFVDNVIIVRLTANKQGTLSFTASYKSPLQH------- 194
Query: 191 NQIIMEGRCPGKRIPPKANANDD---PKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
E R GKR+ + P I+ E+K + G + + ++V G+
Sbjct: 195 -----EVRKSGKRLVLIGKGTEHEGVPGAIRVETQTEVK---NEGGHVVVTGENIQVNGA 246
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D L + A+++F +N D D +S S L R Y H+ YQ F+R
Sbjct: 247 DAVTLYISAATNF----VNYKDVSGDAHRKSKSYLDIARKKKYEQAREAHIAYYQNQFNR 302
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
V + L T E +T RVK F +D SL L+FQ+GRYLLISSS+P
Sbjct: 303 VKLDLG---------TSEEAKRET---HLRVKHFNKGKDVSLATLMFQYGRYLLISSSQP 350
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
G Q ANLQGIWN++L WD VNINLEMNYW S NLSE PL L LS G
Sbjct: 351 GGQPANLQGIWNDNLLAPWDGKYTVNINLEMNYWPSEVTNLSETHLPLMQMLKELSETGR 410
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
+TA+ Y GWV+HH TDIW + + K W +WP GGAWLC HLW+HY +T D+ FL
Sbjct: 411 ETARTMYGCDGWVLHHNTDIW-RCTGLVDKAFWGMWPNGGAWLCQHLWQHYLFTGDKAFL 469
Query: 488 EKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSS-TMDMA 545
K+AYP+++G + F L +L+E G++ T PS SPEH + K A + + TMD
Sbjct: 470 -KKAYPIMKGASDFFLHFLVEHPKYGWMVTCPSNSPEHGPEGDEKKNAPSTVAGCTMDNQ 528
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVL-KSLPRLRPTKIAEDGSIMEWAQDFKDPEVHH 604
I+ ++FS + A ++L EDA+ K L K + RL P +I + EW +D DP H
Sbjct: 529 IVFDLFSNTLQACKILM--EDAVYAKHLQKMIDRLPPMQIGRYNQLQEWLEDVDDPTSEH 586
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAY 664
RH+SHLFGL+P + I+ +P L +AA+ +L RG++ GWSI WK LWARL D A+
Sbjct: 587 RHVSHLFGLYPSNQISPYTDPLLFQAAKNSLIYRGDQATGWSIGWKINLWARLLDGNRAF 646
Query: 665 RMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLL 724
+++ + LV+P EG Y NLF AHPPFQID NFG+TA VAEML+QS N ++LL
Sbjct: 647 KIINNMLVLVEPGKS---EGRTYPNLFDAHPPFQIDGNFGYTAGVAEMLLQSHDNAIHLL 703
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
PALP D W G V+GL ARGG + W L +V I++ N
Sbjct: 704 PALP-DAWRKGRVEGLVARGGFVTDMEWDGAQLSKVIIHARLGGN 747
>gi|424794811|ref|ZP_18220740.1| exported protein [Xanthomonas translucens pv. graminis ART-Xtg29]
gi|422795776|gb|EKU24406.1| exported protein [Xanthomonas translucens pv. graminis ART-Xtg29]
Length = 775
Score = 526 bits (1354), Expect = e-146, Method: Compositional matrix adjust.
Identities = 309/785 (39%), Positives = 439/785 (55%), Gaps = 60/785 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + + PA + +A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D T+P+A AL
Sbjct: 30 LTLWYPRPATQWVEALPLGNGRLGAMVWGGIAHERLQLNEDTLYAGQPYDATSPEALAAL 89
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
VR+L+ +G+Y EA A A KL P YQ L D+ L++D + + YRRELD
Sbjct: 90 PQVRALIFAGRYVEAEALADAKLLSRPRKQMPYQPLADLLLDYDRAD---GIDGYRRELD 146
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L+TA A ++ RE F S +Q I+ ++S G ++ + +DS +
Sbjct: 147 LDTALASTRFVSDGATHLREVFVSATEQCILVRLSCDHPGRIALRIGIDSP-QAGEVTHE 205
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
++ GR A G++F+ + + S G + +E +++++G+D
Sbjct: 206 QGALLFAGR--------NAGFAGIEGGLRFALRVLPRAS---GGSTRIERGRIRIDGADE 254
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
VLLL A++S+ D DP + S + L++ LSY+ L RHL ++++LF RV+
Sbjct: 255 VVLLLTAATSYR----RYDDVGGDPLALSAAQLRTAAALSYAQLRERHLAEHRRLFRRVA 310
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
I L S +P+ ERV+ + DP+L L Q+GRYLLISSSRPG+
Sbjct: 311 IDLGSSAAA------------QLPTDERVRRYADGNDPALAALYHQYGRYLLISSSRPGS 358
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQG+WNE + P W S VNIN EMNYW S L EC EPL L L+ G+ T
Sbjct: 359 QPANLQGVWNELMQPPWQSKYTVNINTEMNYWPSEANALHECVEPLEAMLFDLAETGAHT 418
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
AQ Y A GWV+H+ TD+W ++ G V W+LWPMGG WL LW+ ++Y DR +L +
Sbjct: 419 AQAMYAAPGWVVHNNTDLWRQAGPVDG-VKWSLWPMGGVWLLQQLWDRWDYGRDRAYL-R 476
Query: 490 RAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
R YPL +G A F + L+ + G + TNPS SPE+ P G C MD ++R
Sbjct: 477 RIYPLFKGAAEFFVATLVRDPQSGAMVTNPSLSPENRH--PFGAALCA--GPAMDAQLLR 532
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF--KDPEVHHRH 606
++F+ I +L + A E++ +L P +I G + EW QD+ + PE+HHRH
Sbjct: 533 DLFAQCIKMGALLGVDA-AFGERLATLRTQLPPDRIGRAGQLQEWQQDWDMQAPELHHRH 591
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRM 666
+SHL+ L P I + P L AA ++LQ+RG+ GW + W+ LWARLHD EHA+R+
Sbjct: 592 VSHLYALHPSSQINLRDTPALAAAARRSLQRRGDSATGWGLGWRLNLWARLHDGEHAHRI 651
Query: 667 VKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPA 726
L L+ PE Y NLF AHPPFQID NFG TA + EML+QS + ++LLPA
Sbjct: 652 ---LALLLSPERT-------YPNLFDAHPPFQIDGNFGGTAGITEMLLQSWGDSIWLLPA 701
Query: 727 LPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKV 786
LP W G V+GL+ RG V + W+DG L Y+ S+ + TL Y G ++
Sbjct: 702 LP-QAWPQGQVRGLRVRGAAGVDLAWRDGRLQ----YARLSSERGGHY-TLAYGGQTLTA 755
Query: 787 NLSAG 791
+LS G
Sbjct: 756 DLSPG 760
>gi|149199357|ref|ZP_01876394.1| putative large secreted protein [Lentisphaera araneosa HTCC2155]
gi|149137599|gb|EDM26015.1| putative large secreted protein [Lentisphaera araneosa HTCC2155]
Length = 840
Score = 525 bits (1353), Expect = e-146, Method: Compositional matrix adjust.
Identities = 300/776 (38%), Positives = 419/776 (53%), Gaps = 48/776 (6%)
Query: 2 MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
++ E+ + N L + + PA H+ +A+P+GNGRLGAMV+GG+ E L+LNEDT+W+G P
Sbjct: 60 LSGEAVAPANDLSLWYRKPASHWVEALPVGNGRLGAMVYGGINKEWLQLNEDTMWSGEPV 119
Query: 62 DYTNPDAPKALSDVRSLVDSGQYAEATAASVKL-----FGHPADVYQLLGDIELEFDDSH 116
+ P+ +++ R L+ +Y EA + G YQ++ D+EL F
Sbjct: 120 ERDKPNVQAGIAEARKLLFDEKYVEAQKVVEEKVMGTSLGRGTHNYQMMADLELIFPK-- 177
Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
+ YRR+L+L A + V+Y + RE FSS DQ I ++S E +SF+ S
Sbjct: 178 -RDEVSNYRRDLNLENAISSVQYEFAGTTYKRELFSSAVDQAIYLRLSSDEKAKISFSAS 236
Query: 177 LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISA 236
L + + N ++++G+ + KG+ F +K+ ++ G I
Sbjct: 237 LTRPQSSQLKMMENGALVLKGQARTSKKKVIEQFPSAAKGVAFET--HLKVLNEGGKIFY 294
Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
ED ++VE +D L+LVASS + G K T+ L SY T
Sbjct: 295 EEDS-IRVENADAVTLVLVASSDYYG--------DKKLTASCQKQLNHATQKSYHQARTD 345
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
H+ DYQKLF RV + L SP + ID + + D L E FQ+
Sbjct: 346 HIQDYQKLFKRVDLDLGASPS--AHKPTDQRLIDLI---------KGQYDAQLFEQYFQY 394
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLISSSRPGT ANLQG+W + L P W+S H+NIN +MNYW + NLSEC P F
Sbjct: 395 GRYLLISSSRPGTMPANLQGLWTDGLMPAWNSDFHININFQMNYWHAETTNLSECHMPAF 454
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
L L G + AQ N+ GW H TD W +S GK + +WP+GGAW HLWE
Sbjct: 455 YLLERLQERGREVAQKNFGCRGWTAGHTTDAWFFASLI-GKPQYGMWPVGGAWCSRHLWE 513
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLAC 535
HY + D+DFL RAYP+++G A F +DWL+E G L + PSTSPE+ F PDGK A
Sbjct: 514 HYEFNGDKDFLRNRAYPIMKGAALFCMDWLVENPATGLLVSGPSTSPENRFKTPDGKEAN 573
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ 595
++ TMD I+R++F+ I +AE+L +++ E L L +L PTKIA+DG IMEWA+
Sbjct: 574 LTMGPTMDHQIMRDLFTNTIKSAEILNIDQEFRKELNL-ILQKLSPTKIAKDGRIMEWAE 632
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTA 652
+ ++ + HRH+SHL+GL+P I + P L +AA K+L R G GWS W
Sbjct: 633 ELEEVDPGHRHISHLYGLYPAKEINTARTPKLAQAARKSLDHRLSSGGGHTGWSRAWIIN 692
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
ARL+D E ++ + L NLF HPPFQID NFG TA +AEM
Sbjct: 693 FLARLNDGEKSHENLLALLT-----------KSTLPNLFDNHPPFQIDGNFGGTAGIAEM 741
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN 768
L+QS + LPALP W +G VKGL+ARG V + WK+G L++ I S N
Sbjct: 742 LLQSHAGAIEFLPALP-AVWKNGSVKGLRARGAFEVDVDWKEGALYKAKIKSLKGN 796
>gi|365118140|ref|ZP_09336940.1| autotransporter-associated beta strand [Tannerella sp.
6_1_58FAA_CT1]
gi|363651034|gb|EHL90117.1| autotransporter-associated beta strand [Tannerella sp.
6_1_58FAA_CT1]
Length = 1402
Score = 525 bits (1353), Expect = e-146, Method: Compositional matrix adjust.
Identities = 307/785 (39%), Positives = 453/785 (57%), Gaps = 60/785 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA ++ +A+P+GNGRL AMV+G + +T+++NEDT W+G P + NP+A L
Sbjct: 26 LKLWYDRPADYWVEALPLGNGRLAAMVYGTILQDTIQINEDTYWSGSPYNNANPNAKTHL 85
Query: 73 SDVRSLVDSGQYAEATA-------ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYR 125
+ +R ++ G+YAEA A + GH +Y+ +G++ L+F +SH Y
Sbjct: 86 NQIREYINDGEYAEAQKIALANIIADRNITGHGM-IYESIGNLLLDFPESH--KTPTNYY 142
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH- 184
RELDL+ A A+V Y+V V++TRE F+S D +I+ KIS S+ G ++FN S L ++
Sbjct: 143 RELDLSNAIAKVTYTVDGVDYTREAFTSFTDDLIIIKISASKQGMVNFNTSFVGPLKSNR 202
Query: 185 -----SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISA-LE 238
V+G N I PGK A ++ + I++ + GT SA
Sbjct: 203 VKASTEIVSGTNNTIRVKNTPGKT------AEENIPNL-LRPTTYIRVVAEGGTQSADSS 255
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+K LKV +D A + + ++++F IN D D ++++S L + Y H+
Sbjct: 256 NKILKVSDADVAYIYISSATNF----INYKDISGDSDAKALSYLNKF-DKDYEQAKNDHI 310
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
YQ+ F RVS+ D+ ++ E+ P+ +R++ F DPSL L FQFGR
Sbjct: 311 TRYQEQFGRVSL-------DLGNNSVQEKK----PTDKRIEEFSNTNDPSLASLYFQFGR 359
Query: 359 YLLISSSRPGTQVANLQGIWNEDLS--PTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSS+PG+Q ANLQGIWN + P WDS NIN+EMNYW + NLSEC +P
Sbjct: 360 YLLISSSQPGSQPANLQGIWNPNAGQYPAWDSKYTTNINVEMNYWPAEVTNLSECHQPFL 419
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
+ + +S+ G ++A+ Y GW +HH TD+W +S+ K +WP AW C+HLWE
Sbjct: 420 EMVKDVSVTGQESAETMYGCRGWTLHHNTDLW-RSTGAVDKSACGIWPTCNAWFCSHLWE 478
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH-----EFIAPD 530
HY +T D++FL + YP+L+ F D+LI + GY +PS SPE+ ++
Sbjct: 479 HYLFTGDKEFLSE-VYPILKSACEFYQDFLITDPKTGYKVVSPSNSPENHPGLFSYVDDS 537
Query: 531 GKLACVSYSS--TMDMAIIREVFSAIISAAEVLEKNED--ALVEKVLKSLPRLRPTKIAE 586
G V+ S TMD ++ ++ I AAE+L K+ D A ++K+ LP P + +
Sbjct: 538 GNKQNVALFSGVTMDNQMVFDLLKNTIDAAEILGKDADFAADLKKLKDQLP---PMHVGK 594
Query: 587 DGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWS 646
G + EW +D+ HRH+SHL+G+FPG+ I+ NP L +AA+K+L+ RG+ GWS
Sbjct: 595 YGQLQEWLEDWDKETSGHRHVSHLWGMFPGNQISPYTNPQLFQAAKKSLEGRGDASRGWS 654
Query: 647 ITWKTALWARLHDQEHAYRMVKRLFNLVDPEHE-KHFEGGLYSNLFAAHPPFQIDANFGF 705
+ WK LWARL D HAY++++ L DP +GG Y+N+F AHPPFQID NFG
Sbjct: 655 MGWKVCLWARLLDGNHAYKLIQNQLKLKDPNATIDDPDGGTYANMFDAHPPFQIDGNFGC 714
Query: 706 TAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGG-ETVSICWKDGDLHEVGIYS 764
A +AEML+QS ++LLPALP D WS G VKGLKARGG E V + WK G++ V I S
Sbjct: 715 CAGIAEMLLQSHDGTVHLLPALP-DAWSEGNVKGLKARGGFEIVDMQWKWGEIVSVTIKS 773
Query: 765 NYSNN 769
+ N
Sbjct: 774 SIGGN 778
>gi|146301819|ref|YP_001196410.1| hypothetical protein Fjoh_4083 [Flavobacterium johnsoniae UW101]
gi|146156237|gb|ABQ07091.1| Candidate alpha-L-fucosidase; Glycoside hydrolase family 95
[Flavobacterium johnsoniae UW101]
Length = 816
Score = 525 bits (1353), Expect = e-146, Method: Compositional matrix adjust.
Identities = 301/761 (39%), Positives = 436/761 (57%), Gaps = 41/761 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA + +A+P+GNGRLGAMV+G E L+LNE+T+W G P + + KAL
Sbjct: 25 LKLWYDKPASIWNEALPLGNGRLGAMVFGDPAVERLQLNEETIWAGSPNGNAHNKSIKAL 84
Query: 73 SDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELD 129
VR L+ G++ EA A+ + D YQ G + + F H KYA+ Y R+LD
Sbjct: 85 PIVRQLIFDGKFDEAQDLATQDIMSQTNDGMPYQTFGSVYISFA-GHQKYAD--YYRDLD 141
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
++ ATA+VKY V VEFTRE ++ DQVIV K+S S+ G ++ NV ++S +D
Sbjct: 142 ISNATAKVKYKVNGVEFTREILTAFSDQVIVVKLSASQPGQITCNVFMNSPIDKTVASTE 201
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
NQII+ G N ++F L K + G I A + L + +D
Sbjct: 202 GNQIILSGVG--------TNFEGVKGKVKFQGRLTAK--NKGGEIDA-SNGVLSINKADE 250
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
L + +++F N D D ++S L + + H+D YQK F+RVS
Sbjct: 251 VTLYISIATNFK----NYQDISGDEIAKSKDYLAKAEVKDFETIKKAHVDYYQKFFNRVS 306
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L + D+V P+ ER++ F DP L L FQFGRYLLISSS+PG
Sbjct: 307 LNLGSN--DLVKK----------PTNERIRDFSKQFDPQLASLYFQFGRYLLISSSQPGG 354
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN+ ++P WDS NIN EMNYW + NL E EP L++ G++T
Sbjct: 355 QPANLQGIWNDMVTPPWDSKYTTNINAEMNYWPAQVTNLQEMHEPFVQMAKELAVTGAET 414
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+ Y ASGWV+HH TDIW + +A +WP GGAW+C LWE Y YT D+ +L +
Sbjct: 415 AKTMYNASGWVLHHNTDIW-RVTAPVDSAASGMWPTGGAWVCQDLWERYLYTGDKKYLVE 473
Query: 490 RAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
YP+++G A F LD++ I+ + YL PS+SPE+ GK A ++ +TMD ++
Sbjct: 474 -IYPIMKGAADFFLDFMVIDPNTKYLVVVPSSSPENTHAGGTGK-ATIASGTTMDNQLVF 531
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLS 608
++F+ +I A+ ++ + A +KV +L ++ P KI + + EW D+ +P+ +HRH+S
Sbjct: 532 DLFTHVIEASALVSPDV-AYAKKVSDALAKMPPMKIGKYNQLQEWQDDWDNPKDNHRHVS 590
Query: 609 HLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVK 668
HL+GL+P + I+ K P+L +AA+++L R +E GWS+ WK LWARL D HAY++++
Sbjct: 591 HLYGLYPSNQISAIKTPELFEAAKQSLIYRTDESTGWSMGWKVNLWARLLDGNHAYKLIQ 650
Query: 669 RLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALP 728
+LV + K GG Y N+ AH PFQID NFG TA AEML+QS ++LLPALP
Sbjct: 651 DQLHLVTADQRKG--GGTYPNMLDAHQPFQIDGNFGCTAGFAEMLMQSQEEAIHLLPALP 708
Query: 729 WDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
W G +KGL ARGG + + WK+ + E+ IYS N
Sbjct: 709 -TVWKDGSIKGLVARGGFVIDMTWKNNKVSELKIYSKIGGN 748
>gi|294626600|ref|ZP_06705197.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
gi|292599020|gb|EFF43160.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
Length = 830
Score = 525 bits (1353), Expect = e-146, Method: Compositional matrix adjust.
Identities = 311/790 (39%), Positives = 437/790 (55%), Gaps = 68/790 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L++ + PA + +A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D T+PDA AL
Sbjct: 85 LQLWYREPANEWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSPDALAAL 144
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
VR+L+ +G+YAEA A KL P YQ LGD+ L+FD + YRR+LD
Sbjct: 145 PQVRALIFAGRYAEAEKLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GISDYRRQLD 201
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L+TA A + G RE F S Q IV ++S G +S V +DS N
Sbjct: 202 LDTAVATTTFRSGGAVHRREVFVSAQAQCIVVRLSCDRPGGISLRVGIDSP-QNGEVTAE 260
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI--SDDRGTISALEDKKLKVEGS 247
++ GR N GI+ +++ G +S + D+ L++E +
Sbjct: 261 QGGLLFSGR------------NGSFAGIEGKLRFALRVLPQVSGGKLSQVRDR-LRIEAA 307
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D VLLL A++S+ + D DP + + ++L+ L + L HL D+Q+LF R
Sbjct: 308 DEVVLLLSAATSYQ--RFDAVDG--DPLALTAASLRRAAKLDFPALSRAHLADHQRLFRR 363
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
V+I L S D P+ ERV+ F DP+L L Q+GRYLLI SSRP
Sbjct: 364 VAIDLGSS------DALQR------PTDERVQRFAEGNDPALAALYHQYGRYLLICSSRP 411
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
GTQ ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL L L+ G+
Sbjct: 412 GTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFDLAKTGA 471
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
TA+ Y A GWV+H+ TD+W ++ G W+LWPMGG WL LW+ ++Y DR +L
Sbjct: 472 HTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDYGRDRAYL 530
Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
K YPL +G A F + L+ + G + TNPS SPE++ P G C S MD +
Sbjct: 531 SK-IYPLFKGAAEFFVATLVRDPQTGAMVTNPSISPENQH--PFGAAVCAGPS--MDAQL 585
Query: 547 IREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF--KDPEV 602
+R++F+ I+ +++L + + + + LP P +I + G + EW QD+ + PE+
Sbjct: 586 LRDLFAQCIAMSKLLGIDAQLAQQLAALREQLP---PNRIGKAGQLQEWQQDWDMQAPEI 642
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEH 662
HHRH+SHL+ L P I + P+L AA ++L+ RG+ GW I W+ LWARL D EH
Sbjct: 643 HHRHVSHLYALHPSSQINLRDTPELAAAARRSLEIRGDNATGWGIGWRLNLWARLADGEH 702
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
AYR+++ L+ PE Y NLF AHPPFQID NFG TA + EML+QS ++
Sbjct: 703 AYRILQL---LISPERT-------YPNLFDAHPPFQIDGNFGGTAGITEMLLQSWGGSVF 752
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGT 782
LLPALP W G V+GL+ RGG +V + W+ G L + ++S L Y G
Sbjct: 753 LLPALP-KAWPRGSVRGLRVRGGASVDLEWEGGRLRQARLHSERGGR-----YQLSYAGQ 806
Query: 783 SVKVNLSAGK 792
++ + L AG+
Sbjct: 807 TLDLELGAGR 816
>gi|254472686|ref|ZP_05086085.1| large secreted protein [Pseudovibrio sp. JE062]
gi|211958150|gb|EEA93351.1| large secreted protein [Pseudovibrio sp. JE062]
Length = 835
Score = 525 bits (1352), Expect = e-146, Method: Compositional matrix adjust.
Identities = 304/794 (38%), Positives = 436/794 (54%), Gaps = 66/794 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-YTNPDAPKALSDV 75
++ PA H+ +A+P+GNGRLGAMV+G S + LNEDTL++G P Y P+ + V
Sbjct: 17 YDTPAAHWNEALPLGNGRLGAMVYGNPLSARIHLNEDTLYSGEPTRIYPVPEIAHQIDHV 76
Query: 76 RSLVDSGQYAEATAASVKLF-GHPADVYQLLGDIELEF-DDSHLKYAEETYRRELDLNTA 133
+L+ G+ EA K + G YQ +G++ + DDS + YRR LD+ +
Sbjct: 77 EALLRDGKLFEAQEFVRKNWTGRQGQAYQPVGNLFITMADDSPVS----NYRRALDIRHS 132
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL--LDNHSYVNGNN 191
Y +F R F+S PD VIV +++ + +LSFN+ DS ++ N
Sbjct: 133 LHHESYEQNGTKFERTSFASFPDNVIVVRLTADKPCALSFNLRYDSPHPTCRTTHEGENT 192
Query: 192 QIIMEGRCP---------------------------GKRIPPKANANDDPKG-------- 216
++ + G+ P GK P N D +G
Sbjct: 193 RLHLRGQAPAFTSSRVIERIEHDLEQHRNPEIFGADGKLRPFAENEQDGHRGGIVYSEDG 252
Query: 217 ----IQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKK 272
F A L +++ R E +L +EG+ L + ++SF+GP +PS K
Sbjct: 253 LGEGTYFEAGLSVELEGGR---IRPERGELHIEGATAVTLRIAMATSFNGPDKSPSREGK 309
Query: 273 DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTV 332
DP S L + ++SY+D+ +H DD +LF R+S++L D ++D +
Sbjct: 310 DPAPIVKSILNAAGSVSYADMLQKHSDDVLRLFDRISLKLG---NDAISD---------L 357
Query: 333 PSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHV 392
P++ R++ FQ DP+L L FQ+GRYLLI+SSR G+Q NLQGIWN P W S +
Sbjct: 358 PTSTRLEQFQEKGDPALAALQFQYGRYLLIASSRAGSQPPNLQGIWNNLRRPQWSSNYTM 417
Query: 393 NINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS 452
NINLEMNYW + LS+ EPLF + L+++G++TA+ + A GW H T IW S
Sbjct: 418 NINLEMNYWPAEITGLSDLHEPLFMLIEELAVSGARTAKKMFNAPGWCAFHNTTIWRDSV 477
Query: 453 ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG 512
A WPM WL +H+WEH+ YT D++FL+ RAYPL++ A F WL E DG
Sbjct: 478 PSPCDPASAFWPMAAGWLLSHMWEHFLYTGDKEFLKNRAYPLMKSAAEFYEWWLCENKDG 537
Query: 513 YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV 572
YL STSPE+ ++ DG + V STMD AIIRE F+ +AA++L + + L +
Sbjct: 538 YLVPKVSTSPENRYLDEDGHVITVDQGSTMDCAIIRETFANTATAAKLLGLDAE-LANTL 596
Query: 573 LKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAE 632
+ RL P +I G + EW+QDFK+ HRHLSHL+GLFP I + PDL KA+
Sbjct: 597 EEKAARLLPYQIGAQGQVQEWSQDFKEFMPTHRHLSHLYGLFPCDQIG-KDTPDLLKASV 655
Query: 633 KTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFA 692
++L+ RG+ GWS+ WK LWAR+ D +HAY+++ +FN V+ E K +GGLY NL
Sbjct: 656 RSLEIRGDLATGWSMGWKICLWARVGDGDHAYKIIHNMFNRVENEAPKSEDGGLYGNLMI 715
Query: 693 AHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICW 752
AHPPFQID NFG+T VAEML+ +T N + LLPALP W G V+GL+ARGG V + W
Sbjct: 716 AHPPFQIDGNFGYTRGVAEMLMNTTHNGIELLPALP-SAWPEGEVRGLRARGGFEVDLNW 774
Query: 753 KDGDLHEVGIYSNY 766
+ + I S++
Sbjct: 775 QHSKPTQAKIISHH 788
>gi|227538538|ref|ZP_03968587.1| possible alpha-L-fucosidase [Sphingobacterium spiritivorum ATCC
33300]
gi|227241457|gb|EEI91472.1| possible alpha-L-fucosidase [Sphingobacterium spiritivorum ATCC
33300]
Length = 826
Score = 525 bits (1352), Expect = e-146, Method: Compositional matrix adjust.
Identities = 310/769 (40%), Positives = 438/769 (56%), Gaps = 51/769 (6%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
N LK+ ++ PA ++ +A+PIGNGRLGAMV+G E ++LNE+T+W G PG+ + +A
Sbjct: 28 NSLKLEYDKPAGNWNEALPIGNGRLGAMVFGQPDLEQIQLNEETIWAGGPGNNVSKNAYD 87
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPADV-------YQLLGDIELEFDDSHLKYAEET 123
+ +R L+ G+ EA S F PA YQ GD+ + F D H +Y+ +
Sbjct: 88 KIQQIRRLLFEGKAKEAQDLSNATFPRPAPTGIDYGMPYQTFGDLRISFPD-HKQYS--S 144
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
Y RELD+ A R +Y G V +TRE F+S D V++ K+S SLSF++ L S DN
Sbjct: 145 YSRELDIQDAITRTRYKAGAVNYTREVFASLKDDVVIIKLSADTKKSLSFSIGLTSPHDN 204
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKL 242
N Q+ + G + +++ G IQF+ I+ + +G +D +L
Sbjct: 205 THITVENKQLTLSG---------ISGSHEGKTGQIQFTGIVRPIL---KGGKLIQKDNQL 252
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
+V +D +L + ++F N +D + T+++++ L Y H+ YQ
Sbjct: 253 EVTHADEVILYISIGTNFK----NYNDITGNATAKALNILNKASGNKYGKAKADHIQKYQ 308
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
+ F+RVS+ L SP+ S++ D R++ F +DP LV L FQFGRYLLI
Sbjct: 309 QYFNRVSLYLGESPQ-------SKKMTDI-----RIREFGGADDPELVTLYFQFGRYLLI 356
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SSS+PG Q A LQGIWN+ LSP WDS VNIN EMNYW + NL E EPLF L L
Sbjct: 357 SSSQPGGQPATLQGIWNDKLSPPWDSKYTVNINTEMNYWPAEVTNLKELHEPLFAMLKDL 416
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
++ G ++A+ Y A GW IHH TD+W S G + +WPMGGAWL HLW+H+ Y+
Sbjct: 417 AVTGQESAKELYHARGWNIHHNTDLWRISGVVDGG-FYGMWPMGGAWLSQHLWQHFLYSG 475
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIE--GHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
DR FL K Y +L+G A F LD L E H +L PS SPE+ ++ G VS +
Sbjct: 476 DRSFL-KEYYHVLKGKALFYLDVLQEEPTHQ-WLVVAPSMSPENSYLPGVG----VSAGT 529
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDP 600
TMD ++ +VF I A+ VL+++ D L + V +L RL P +I + + EW QD P
Sbjct: 530 TMDNQLVFDVFHNFIQASAVLKQDAD-LRDSVQVALDRLPPMQIGQHNQLQEWLQDLDKP 588
Query: 601 EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQ 660
HRH+SHL+GLFP I+ +NP+L +AA+ ++ RG++ GWS+ WK WARL D
Sbjct: 589 ADKHRHISHLYGLFPSGQISPFRNPELLEAAKNSMIYRGDKSTGWSMGWKVNWWARLLDG 648
Query: 661 EHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
+ AY+++K + P E GG Y NL AHPPFQID NFG T+ +AEML+QS +
Sbjct: 649 DQAYKLIKDQLSPA-PMEESGQSGGTYPNLLDAHPPFQIDGNFGCTSGIAEMLLQSYDGN 707
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
+YLLPALP ++G V GLKARGG V + WKD + +V I S N
Sbjct: 708 IYLLPALP-RALANGKVTGLKARGGFEVDMEWKDNKVKKVVIRSALGGN 755
>gi|312621675|ref|YP_004023288.1| alpha-L-fucosidase [Caldicellulosiruptor kronotskyensis 2002]
gi|312202142|gb|ADQ45469.1| Alpha-L-fucosidase [Caldicellulosiruptor kronotskyensis 2002]
Length = 752
Score = 525 bits (1352), Expect = e-146, Method: Compositional matrix adjust.
Identities = 316/816 (38%), Positives = 451/816 (55%), Gaps = 74/816 (9%)
Query: 2 MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
MN++S LKI F+ PA + +A+PIGNG LGAM++GGV ET++LNE+++W+ P
Sbjct: 1 MNSQS------LKIIFDKPASCWEEALPIGNGSLGAMIYGGVEYETIQLNEESIWSCGPR 54
Query: 62 DYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFG---HPADVYQLLGDIELEFDDSHLK 118
NPDA K L ++R + G A SV H Y+ LG +++ F+
Sbjct: 55 RRENPDAIKYLPEIRKSILEGNIKRAEELSVFALSGTPHSQGNYEPLGYLDIYFEGIEAD 114
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL----SFN 174
E Y R LD++ AT +V++ V ++ + + +FSS PD+VIV KI ++ G+L F
Sbjct: 115 KVER-YTRYLDISNATCKVEFDVDDIRYEKIYFSSYPDKVIVVKICCNKKGALFLRAKFR 173
Query: 175 VSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
+D V+ N++I +E R G+ FSA+L+ +S D G +
Sbjct: 174 REYQEDIDRCGRVD-NDKIFIECSAGSGR------------GVSFSAVLK-AVSKD-GDV 218
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
+ D L V+ + VLL+ +++S+ KD + + L+ + +LY
Sbjct: 219 YTIGDN-LFVKDATEVVLLITSTTSYKA---------KDYFNWCVKTLEQASKHDFEELY 268
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
RH +DY+ LF RV + + T+ + E I+ + ER K D L+ LLF
Sbjct: 269 KRHTEDYKSLFDRVEFYIDTENTNKRTELTTPERINLL--KERYK------DEELIVLLF 320
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
QFGRYLLISSSRPG NLQGIWN+++ P W S +NINL+MNYW + CNLSEC P
Sbjct: 321 QFGRYLLISSSRPGCLPPNLQGIWNKEMKPPWGSKYTININLQMNYWPAEVCNLSECHMP 380
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
LFD L + NG TAQ Y G+ HH TDIW ++ + WPMG AWLC H+
Sbjct: 381 LFDLLEKMYENGKITAQRMYGCRGFCAHHNTDIWGDTAPQDIYIPATYWPMGAAWLCLHI 440
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
+HY YT D DFL K+ Y L+ A FLLD+LIE +GYL T PS SPE+ + +G +
Sbjct: 441 LDHYEYTGDLDFL-KKYYYLMREAALFLLDYLIEDKNGYLVTCPSCSPENSY-KLNGDVY 498
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWA 594
++Y TMD+ II +F I A +VL+ N D +VEK+ +L +L P KI + G I EW
Sbjct: 499 SMTYMPTMDIQIITALFDKIKKANDVLKLN-DEIVEKIEYALNKLPPLKIGKYGQIQEWI 557
Query: 595 QDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP---GWSITWKT 651
+D+++ E HRH+SHLFGL+P + IT EK P L +AA+KTLQ+R E G GWS W
Sbjct: 558 EDYEEAEPGHRHISHLFGLYPENQITFEKTPQLFEAAKKTLQRRLEHGSGHTGWSRAWII 617
Query: 652 ALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAE 711
WARL + AY + L + NL HPPFQID NFG TA +AE
Sbjct: 618 CFWARLKEGNKAYENILEL-----------LKKSTLPNLLDNHPPFQIDGNFGTTAGIAE 666
Query: 712 MLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDH 771
M++QS + + LLPALP D W SG +KGL+ARGG + I W++G L + I +
Sbjct: 667 MIMQSCDDTIELLPALPSD-WKSGYIKGLRARGGHIIDIYWENGVLKKAEIILGFRET-- 723
Query: 772 DSFKTLHYRGTSVKVNLSAGKIYTFNRQLKCTNLHQ 807
L Y+G+ +++ + G+ + + C N +
Sbjct: 724 ---VVLKYKGSYIEIKGNIGE----EKVISCDNFSK 752
>gi|289669688|ref|ZP_06490763.1| hypothetical protein XcampmN_14597 [Xanthomonas campestris pv.
musacearum NCPPB 4381]
Length = 790
Score = 525 bits (1351), Expect = e-146, Method: Compositional matrix adjust.
Identities = 304/794 (38%), Positives = 442/794 (55%), Gaps = 64/794 (8%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+ L++ + PA + +A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D T+P
Sbjct: 39 VAAAQALQLWYREPANQWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSP 98
Query: 67 DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
A AL VR+L+ +G+YAEA A L P YQ LGD+ L+FD +
Sbjct: 99 GALAALPQVRALIFAGRYAEAEQLADATLLSRPLKQMPYQPLGDLLLDFDRAD---GISD 155
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
YRR+LDL+TA A + G RE F S Q IV ++S G +S V +DS +
Sbjct: 156 YRRQLDLDTAVAITTFRSGEAVHRREVFVSAQAQCIVVRLSCDHPGGISLRVGIDSP-QS 214
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD--RGTISALEDKK 241
++ GR N GI+ +++ G +S + D+
Sbjct: 215 GDVTAEQGGLLFSGR------------NGSFAGIEGKLRFALRVLPQVTGGKLSQVRDR- 261
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L++E +D VLLL A++S+ + D DP + + ++L+ +L + L HL D+
Sbjct: 262 LRIEAADEVVLLLSAATSYQ--RFDAVDG--DPLALTAASLRKAASLDFPALLHAHLADH 317
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
Q+LF RV+I L S + +P+ ERV+ F DP+L L Q+GRYLL
Sbjct: 318 QRLFRRVAIDLGSS------------DAAQLPTDERVQRFAEGNDPALAALYHQYGRYLL 365
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
I SSRPGTQ ANLQGIWN+ + P W+S +NIN EMNYW S + EC EPL +
Sbjct: 366 ICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANAMHECVEPLESMVFD 425
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
L+ G+ TA+ Y ASGWV+H+ TD+W ++ G W+LWPMGG WL LW+ ++Y
Sbjct: 426 LAKTGAHTAKAIYDASGWVVHNNTDLWRQAGPIDG-AKWSLWPMGGVWLLQQLWDRWDYG 484
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
DR +L K YPL +G A F + L+ + G + TNPS SPE++ P G C
Sbjct: 485 RDRAYLSK-IYPLFKGAAEFFVATLLRDPQTGAMVTNPSMSPENQH--PFGAAVCA--GP 539
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF--K 598
TMD ++R++F+ I+ +++L + + L +++ +L P +I + G + EW QD+ +
Sbjct: 540 TMDAQLLRDLFAQCIAMSKLLGVDAE-LAQQLATLREQLPPNRIGKAGQLQEWQQDWDMQ 598
Query: 599 DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLH 658
PE+HHRH+SHL+ L P I + P+L AA ++L+ RG+ GW + W+ LWARL
Sbjct: 599 APEIHHRHVSHLYALHPSSQINLRDTPELAAAARRSLEIRGDNATGWGLGWRLNLWARLA 658
Query: 659 DQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTL 718
D EHAYR+++ L+ P+ Y NLF AHPPFQID NFG TA + EML+QS
Sbjct: 659 DGEHAYRILQL---LISPDRT-------YPNLFDAHPPFQIDGNFGGTAGITEMLLQSWG 708
Query: 719 NDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLH 778
++LLPALP W G V+G++ RGG +V + W+ G L + ++S D L
Sbjct: 709 GSVFLLPALP-KAWPRGSVRGMRVRGGASVDLEWEGGRLQQARLHS-----DRGGRYQLS 762
Query: 779 YRGTSVKVNLSAGK 792
Y G ++ + L AG+
Sbjct: 763 YAGQTLDLELGAGR 776
>gi|311746497|ref|ZP_07720282.1| alpha-L-fucosidase 2 [Algoriphagus sp. PR1]
gi|126575394|gb|EAZ79726.1| alpha-L-fucosidase 2 [Algoriphagus sp. PR1]
Length = 826
Score = 525 bits (1351), Expect = e-146, Method: Compositional matrix adjust.
Identities = 293/766 (38%), Positives = 450/766 (58%), Gaps = 48/766 (6%)
Query: 12 PLKITFNGPAKH-FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
PL + + PA +T+A+PIGNG+LGAMV+G V +E ++LNE T+W+G P NPDA
Sbjct: 32 PLTLWYEQPAGEVWTNALPIGNGKLGAMVYGNVENELIQLNEHTVWSGGPNRNDNPDALA 91
Query: 71 ALSDVRSLVDSGQYAEA---TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
AL ++R L+ G+ EA + +++ +Q +GD+ + F+ H + YRRE
Sbjct: 92 ALPEIRRLIFEGKQKEAEELASKTIQTKKSNGQKFQPVGDLNIAFE-GHTTFT--NYRRE 148
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY- 186
LD+ A ++V Y V V +TRE +S + VI ++ S+ G +SF S+ + N S
Sbjct: 149 LDIERAVSKVTYEVDGVVYTREAIASFAENVIAVHLTASKPGMISFIASMTTPQPNASIA 208
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVE 245
+N +N++ + G ++ KG I+F ++ +IK + T + + V+
Sbjct: 209 LNSDNELAISGTT---------TDHEGVKGKIKFKSLTKIKNIGGKLTSTG---TSIAVK 256
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
+D A + + +++F+ N D + D S + L + S++DL +L DYQ F
Sbjct: 257 NADEATIYIAIATNFN----NYLDLEGDENSRAKGFLVNATTQSFNDLLKTNLVDYQNYF 312
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
+RVS+ L E + +P+ ER+++F+T DPSLV L +Q+GRYLLISSS
Sbjct: 313 NRVSLSLG------------ETDASKLPTDERLRNFRTGNDPSLVSLYYQYGRYLLISSS 360
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
+PG Q ANLQGIWN+++SP WDS +NIN +MNYW + NL+E EP ++ ++
Sbjct: 361 QPGGQPANLQGIWNKEMSPPWDSKYTININAQMNYWPAEKTNLAELHEPFLKMVSEMAEA 420
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G +TA+V Y A GW+ HH TDIW + + + W +W GGAW HLW+H+ Y+ D +
Sbjct: 421 GEETARVMYGARGWMAHHNTDIW-RITGPVDAIFWGIWSGGGAWTSQHLWDHFQYSGDME 479
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
+L K YP+L+G A F +D+L+E D +L NP TSPE+ A DG + + +TMD
Sbjct: 480 YL-KSIYPILKGAAMFYVDFLVEHPDKPWLVVNPGTSPENAPAAHDG--SSLDAGTTMDN 536
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHH 604
++ + FS +I A+E+L K + A + + +L P +I + G + EW D DP HH
Sbjct: 537 QLVFDAFSTVIQASELL-KIDQAFADTLQLMRDQLPPMQIGKHGQLQEWLDDIDDPNDHH 595
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAY 664
RH+SHL+GL+P + I+ + P+L A++ TL +RG+ GWS+ WK WAR+ D HAY
Sbjct: 596 RHISHLYGLYPSNQISPLRTPELYSASKNTLIQRGDVSTGWSMGWKVNWWARMLDGNHAY 655
Query: 665 RMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLL 724
++++ N + P GG Y+NLF AHPPFQID NFG T+ + EMLVQS +++LL
Sbjct: 656 KLIQ---NQLSPVGSNQGGGGSYNNLFDAHPPFQIDGNFGCTSGITEMLVQSANGEIHLL 712
Query: 725 PALPWDKWSSGCVKGLKARGG-ETVSICWKDGDLHEVGIYSNYSNN 769
PALP D W G + G++A+GG E V + W+DG + ++ I SN N
Sbjct: 713 PALP-DVWQDGSITGIRAKGGFEVVELDWEDGQIEKLVIKSNIGGN 757
>gi|224536536|ref|ZP_03677075.1| hypothetical protein BACCELL_01411 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521792|gb|EEF90897.1| hypothetical protein BACCELL_01411 [Bacteroides cellulosilyticus
DSM 14838]
Length = 811
Score = 523 bits (1348), Expect = e-145, Method: Compositional matrix adjust.
Identities = 296/764 (38%), Positives = 430/764 (56%), Gaps = 49/764 (6%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA+++ +A+P+GNGRLGAMV+G +E ++LNE+T+ G P NP+ LS++R
Sbjct: 19 YDSPAEYWEEALPLGNGRLGAMVYGNPVNEEIQLNEETISAGAPYKNYNPETKNYLSEIR 78
Query: 77 SLVDSGQYAEA-TAASVKL-----FGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
L+ G+Y EA T A +L FG P YQ G + L F D +RRELDL
Sbjct: 79 QLIFEGKYPEAQTLAGERLLSKNGFGMP---YQTAGSLRLRFQDQE---GYTNFRRELDL 132
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A Y+V V++ RE F+S DQ+++ +++ S+ G L+F +L D +G
Sbjct: 133 EKAVASTTYTVDGVDYKREVFTSFADQLVIIRLTASQPGKLTFTTALTCPQDVDVTTSGK 192
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
+ + MEG G A ++F L++ + +G ++ D L V ++ A
Sbjct: 193 DAMTMEGVTKGNEFVEGA--------VRFRTDLKLNV---QGGKTSANDSTLVVTRANSA 241
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
+ L S++F IN D DP + L++ +Y+ H+ +YQK ++RVS+
Sbjct: 242 TIYLAISTNF----INYKDISGDPVKRNKVYLKNAGK-NYTKALQAHISEYQKYYNRVSL 296
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
L R+ + P+ RVK F T DP LV L FQFGRYLLISSS+PG Q
Sbjct: 297 DLGRTAQA------------DKPTDIRVKEFATANDPHLVALYFQFGRYLLISSSQPGGQ 344
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWN+ L+P W NIN EMNYW + NL E EP + L NG + A
Sbjct: 345 PANLQGIWNQKLNPAWKCRYTTNINAEMNYWPAEVTNLPEMHEPFLQMIKELYENGQEAA 404
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
+ Y GW++HH TD+W + A K WP AWLC HLW+ Y Y+ D+DFL +
Sbjct: 405 REMYGCRGWMLHHNTDLWRMNGA-VDKAYCGPWPTCNAWLCHHLWDRYLYSGDKDFLAQ- 462
Query: 491 AYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSS-TMDMAIIR 548
AYP+++ + F +D+L++ + GY+ PS SPE+ P + ++ TMD ++
Sbjct: 463 AYPIMKSASEFFVDFLVKDPNTGYMVVTPSNSPENS--PPQWRTKANLFAGITMDNQLVF 520
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLS 608
++F+ AA +LEK+E + +L +L P ++ + G + EW +D+ +P+ HHRH+S
Sbjct: 521 DLFTNTERAARLLEKDE-LFCDTILSLRKQLPPMQVGQYGQLQEWFEDWDNPKDHHRHIS 579
Query: 609 HLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVK 668
HL+G FPG I+ +P L +AA TL +RG+ GWS+ WK WAR D HA++++
Sbjct: 580 HLWGFFPGFQISPYSSPVLFEAARNTLIQRGDPSTGWSMGWKVCFWARCLDGNHAFKLIT 639
Query: 669 RLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALP 728
NLV PE +K GG Y NLF AHPPFQID NFG TA +AEML+QS ++LLPALP
Sbjct: 640 DQLNLVSPEIQKGQGGGTYPNLFDAHPPFQIDGNFGCTAGIAEMLMQSHDEAIHLLPALP 699
Query: 729 WDKWSSGCVKGLKARGG-ETVSICWKDGDLHEVGIYSNYSNNDH 771
D W G +KGL+ARGG E +S+ WK+G + I S N H
Sbjct: 700 -DVWKDGEIKGLRARGGFEIISLKWKNGQIESAVIKSTLGGNLH 742
>gi|312135763|ref|YP_004003101.1| alpha-L-fucosidase [Caldicellulosiruptor owensensis OL]
gi|311775814|gb|ADQ05301.1| Alpha-L-fucosidase [Caldicellulosiruptor owensensis OL]
Length = 752
Score = 523 bits (1348), Expect = e-145, Method: Compositional matrix adjust.
Identities = 307/802 (38%), Positives = 449/802 (55%), Gaps = 68/802 (8%)
Query: 9 TTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
++ LKI F+ PA + +A+PIGNG LGAM++GGV ETL+LNE+++W+ P NPDA
Sbjct: 2 SSQNLKIIFDKPASCWEEALPIGNGSLGAMIYGGVEYETLQLNEESIWSCGPRRRENPDA 61
Query: 69 PKALSDVRSLVDSGQYAEATAASVKLFG---HPADVYQLLGDIELEFDDSHLKYAEETYR 125
K L +R + G A SV H Y+ LG +++ F+ E+ Y
Sbjct: 62 LKYLQVIRKSILEGNIKRAEELSVFALSGTPHSEGNYEPLGYLDIYFEGVKTDKVEK-YT 120
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL----SFNVSLDSLL 181
R LD++ AT +V+++V ++ + + +FSS PD+VIV KI S+ G++ F +
Sbjct: 121 RYLDISNATCKVEFNVDDIRYEKTYFSSYPDKVIVVKICCSKKGAIFLRAKFRREYQEDI 180
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
D V+ N++I E R G+ FSA+L+ +S D G + + D
Sbjct: 181 DRCGRVD-NDKIFFECSAGSGR------------GVSFSAVLK-AVSKD-GDVYTIGDN- 224
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L V+ + +LL+ +++S+ +KD + + L+ + + +LY RH +DY
Sbjct: 225 LFVKNATEVMLLITSTTSY---------KEKDYFNWCLKTLEQVSKHDFEELYKRHTEDY 275
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYL 360
+ LF RV + DT + N + + ER+ + +D L+ LLFQFGRYL
Sbjct: 276 KSLFDRVEFYI---------DTANTNNRIELTTPERINLLKEGYKDEELIVLLFQFGRYL 326
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSRPG NLQGIWN+++ P W S +NINL+MNYW + CNLSEC LFD L
Sbjct: 327 LISSSRPGCLPPNLQGIWNKEMKPPWGSKYTININLQMNYWPAEVCNLSECHMSLFDLLE 386
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
+ NG TAQ Y G+ HH TDIW ++ + WPMG AWLC H+W+HY Y
Sbjct: 387 KMYENGKITAQRMYGCRGFCAHHNTDIWGDTAPQDIYIPATYWPMGAAWLCLHIWDHYEY 446
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
T D DFL K+ Y L+ A FLLD+LIE +GYL T PS SPE+ + +G + ++Y
Sbjct: 447 TGDLDFL-KKYYYLMREAALFLLDYLIEDENGYLVTCPSCSPENSY-KLNGDVYSLTYMP 504
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDP 600
TMD+ +I +F + A ++L+ N D +VEK+ +L + P KI + G I EW +D+++
Sbjct: 505 TMDIQVISALFEKVKKANDILKLN-DEIVEKIEYALNKFPPIKIGKYGQIQEWIEDYEEA 563
Query: 601 EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP---GWSITWKTALWARL 657
E HRH+SHLFGL+P + IT EK P L +AA+KTLQ+R E G GWS W WARL
Sbjct: 564 EPGHRHISHLFGLYPENQITPEKTPQLFEAAKKTLQRRLEHGSGHTGWSRAWIICFWARL 623
Query: 658 HDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST 717
+ AY + L + NL HPPFQID NFG TA++AEM++QS
Sbjct: 624 KEGNKAYENILEL-----------LKKSTLPNLLDNHPPFQIDGNFGVTASIAEMIMQSY 672
Query: 718 LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTL 777
+ + LLPALP + W SG +KGLKARGG TV I W++G + + + + L
Sbjct: 673 DDTIELLPALPRN-WESGYIKGLKARGGHTVDIYWENGIFKKAKVILGFKES-----VVL 726
Query: 778 HYRGTSVKVNLSAG--KIYTFN 797
Y+ + +++ + G K+ ++N
Sbjct: 727 KYKKSCIEIRGNQGEEKVISYN 748
>gi|289664854|ref|ZP_06486435.1| hypothetical protein XcampvN_17740 [Xanthomonas campestris pv.
vasculorum NCPPB 702]
Length = 792
Score = 523 bits (1348), Expect = e-145, Method: Compositional matrix adjust.
Identities = 304/788 (38%), Positives = 441/788 (55%), Gaps = 64/788 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L++ + PA + +A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D T+P A AL
Sbjct: 47 LQLWYREPANEWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSPGALAAL 106
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
VR+L+ +G+YAEA A L P YQ LGD+ L+FD + YRR+LD
Sbjct: 107 PQVRALIFAGRYAEAEQLADATLLSRPLKQMPYQPLGDLLLDFDRAD---GISDYRRQLD 163
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L+TA A + G RE F S Q IV ++S G +S V +DS +
Sbjct: 164 LDTAVAITTFRSGEAVHRREVFVSAQAQCIVVRLSCDHPGGISLRVGIDSP-QSGDVTAE 222
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD--RGTISALEDKKLKVEGS 247
++ GR N GI+ +++ G +S + D+ L++E +
Sbjct: 223 QGGLLFSGR------------NGSFAGIEGKLRFALRVLPQVTGGKLSQVRDR-LRIEAA 269
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D VLLL A++S+ + D DP + + ++L+ +L + L HL D+Q+LF R
Sbjct: 270 DEVVLLLSAATSYQ--RFDAVDG--DPLALTAASLRKAASLDFPALLHAHLADHQRLFRR 325
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
V+I L S + +P+ ERV+ F DP+L L Q+GRYLLI SSRP
Sbjct: 326 VAIDLGSS------------DAAQLPTDERVQRFAEGNDPALAALYHQYGRYLLICSSRP 373
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
GTQ ANLQGIWN+ + P W+S +NIN EMNYW S + EC EPL + L+ G+
Sbjct: 374 GTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANAMHECVEPLESMVFDLAKTGA 433
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
TA+ Y ASGWV+H+ TD+W ++ G W+LWPMGG WL LW+ ++Y DR +L
Sbjct: 434 HTAKAIYDASGWVVHNNTDLWRQAGPIDG-AKWSLWPMGGVWLLQQLWDRWDYGRDRAYL 492
Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
K YPL +G A F + L+ + G + TNPS SPE++ P G C TMD +
Sbjct: 493 SK-IYPLFKGAAEFFVATLLRDPQTGAMVTNPSMSPENQH--PFGAAVCA--GPTMDAQL 547
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF--KDPEVHH 604
+R++F+ I+ +++L + + L +++ +L P +I + G + EW QD+ + PE+HH
Sbjct: 548 LRDLFAQCIAMSKLLGVDAE-LAQQLATLREQLPPNRIGKAGQLQEWQQDWDMQAPEIHH 606
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAY 664
RH+SHL+ L P I + P+L AA ++L+ RG+ GW + W+ LWARL D EHAY
Sbjct: 607 RHVSHLYALHPSSQINLRDTPELAAAARRSLEIRGDNATGWGLGWRLNLWARLADGEHAY 666
Query: 665 RMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLL 724
R+++ L+ P+ Y NLF AHPPFQID NFG TA + EML+QS ++LL
Sbjct: 667 RILQL---LISPDRT-------YPNLFDAHPPFQIDGNFGGTAGITEMLLQSWGGSVFLL 716
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSV 784
PALP W G V+G++ RGG +V + W+ G L + ++S D L Y G ++
Sbjct: 717 PALP-KAWPRGSVRGMRVRGGASVDLEWEGGRLQQARLHS-----DRGGRYQLSYAGQTL 770
Query: 785 KVNLSAGK 792
+ L AG+
Sbjct: 771 DLELGAGR 778
>gi|418518724|ref|ZP_13084861.1| hypothetical protein MOU_18224 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
gi|418519757|ref|ZP_13085808.1| hypothetical protein WS7_01835 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410702673|gb|EKQ61175.1| hypothetical protein MOU_18224 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
gi|410704417|gb|EKQ62899.1| hypothetical protein WS7_01835 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
Length = 790
Score = 523 bits (1348), Expect = e-145, Method: Compositional matrix adjust.
Identities = 308/794 (38%), Positives = 436/794 (54%), Gaps = 64/794 (8%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+ L++ + PA + +A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D T+P
Sbjct: 39 VAAAEALQLWYREPANEWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSP 98
Query: 67 DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
DA AL VR+L+ +G+YAEA A KL P YQ LGD+ L+FD +
Sbjct: 99 DALAALPQVRALIFAGRYAEAEKLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GISD 155
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
YRR+LDL+TA A + G RE F Q IV ++S G +S V +DS
Sbjct: 156 YRRQLDLDTAVATTTFRSGGAVHRREVFVCAQAQCIVVRLSCDRPGGISLRVGIDSPQTG 215
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI--SDDRGTISALEDKK 241
++ GR N GI+ +++ G +S + D+
Sbjct: 216 EVTAEPGG-LLFSGR------------NGSFAGIEGRLRFALRVLPQVSGGKLSQVRDR- 261
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L+++ +D VLLL A++S+ + D DP + + + L+ L + L HL D+
Sbjct: 262 LRIDAADEVVLLLSAATSYQ--RFDAVDG--DPLALTAARLRKAAKLDFPALLRAHLADH 317
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
Q+LF RV+I D S E + +P+ ERV+ F DP+L L Q+GRYLL
Sbjct: 318 QRLFRRVAI-----------DLGSSEAVQ-LPTDERVQRFAEGNDPALAALYHQYGRYLL 365
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
I SSRPGTQ ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL L
Sbjct: 366 ICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFD 425
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
L+ G+ TA+ Y A GWV+H+ TD+W ++ G W+LWPMGG WL LW+ ++Y
Sbjct: 426 LAQTGAHTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDYG 484
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
DR +L K YPL +G A F + L+ + G + TNPS SPE++ P G C S
Sbjct: 485 RDRAYLSK-VYPLFKGAAEFFVATLMRDPQTGAMVTNPSMSPENQH--PFGAAVCAGPS- 540
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF--K 598
MD ++R++F+ I+ +++L + + +L P +I + G + EW QD+ +
Sbjct: 541 -MDAQLLRDLFAQCIAMSKLLGIDAQLAQQLAALR-EQLPPNRIGKAGQLQEWQQDWDMQ 598
Query: 599 DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLH 658
PE+HHRH+SHL+ L P I + P+L AA ++L+ RG+ GW I W+ LWARL
Sbjct: 599 APEIHHRHVSHLYALHPSSQINLRDTPELAAAARRSLEIRGDNATGWGIGWRLNLWARLA 658
Query: 659 DQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTL 718
D EHAYR+++ L+ PE Y NLF AHPPFQID NFG TA + EML+QS
Sbjct: 659 DGEHAYRILQL---LISPERT-------YPNLFDAHPPFQIDGNFGGTAGITEMLLQSWG 708
Query: 719 NDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLH 778
++LLPALP W G V+GL+ RGG +V + W+ G L + ++S D L
Sbjct: 709 GSVFLLPALP-KAWPRGSVRGLRVRGGASVDLEWEGGRLQQARLHS-----DRGGRYQLS 762
Query: 779 YRGTSVKVNLSAGK 792
Y G ++ + L AG+
Sbjct: 763 YAGQTLDLELGAGR 776
>gi|423223594|ref|ZP_17210063.1| hypothetical protein HMPREF1062_02249 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638219|gb|EIY32066.1| hypothetical protein HMPREF1062_02249 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 823
Score = 523 bits (1347), Expect = e-145, Method: Compositional matrix adjust.
Identities = 296/764 (38%), Positives = 430/764 (56%), Gaps = 49/764 (6%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA+++ +A+P+GNGRLGAMV+G +E ++LNE+T+ G P NP+ LS++R
Sbjct: 31 YDSPAEYWEEALPLGNGRLGAMVYGNPVNEEIQLNEETISAGAPYKNYNPETKNYLSEIR 90
Query: 77 SLVDSGQYAEA-TAASVKL-----FGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
L+ G+Y EA T A +L FG P YQ G + L F D +RRELDL
Sbjct: 91 QLIFEGKYPEAQTLAGERLLSKNGFGMP---YQTAGSLRLRFQDQE---GYTNFRRELDL 144
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A Y+V V++ RE F+S DQ+++ +++ S+ G L+F +L D +G
Sbjct: 145 EKAVASTTYTVDGVDYKREVFTSFADQLVIIRLTASQPGKLTFTTALTCPQDVDVTTSGK 204
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
+ + MEG G A ++F L++ + +G ++ D L V ++ A
Sbjct: 205 DAMTMEGVTKGNEFVEGA--------VRFRTDLKLNV---QGGKTSANDSTLIVTRANSA 253
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
+ L S++F IN D DP + L++ +Y+ H+ +YQK ++RVS+
Sbjct: 254 TIYLAISTNF----INYKDISGDPVKRNKVYLKNAGK-NYTKALQAHISEYQKYYNRVSL 308
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
L R+ + P+ RVK F T DP LV L FQFGRYLLISSS+PG Q
Sbjct: 309 NLGRTAQA------------DKPTDIRVKEFATANDPHLVALYFQFGRYLLISSSQPGGQ 356
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWN+ L+P W NIN EMNYW + NL E EP + L NG + A
Sbjct: 357 PANLQGIWNQKLNPAWKCRYTTNINAEMNYWPAEVTNLPEMHEPFLQMIKELYENGQEAA 416
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
+ Y GW++HH TD+W + A K WP AWLC HLW+ Y Y+ D+DFL +
Sbjct: 417 REMYGCRGWMLHHNTDLWRMNGA-VDKAYCGPWPTCNAWLCHHLWDRYLYSGDKDFLAQ- 474
Query: 491 AYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSS-TMDMAIIR 548
AYP+++ + F +D+L++ + GY+ PS SPE+ P + ++ TMD ++
Sbjct: 475 AYPIMKSASEFFVDFLVKDPNTGYMVVTPSNSPENS--PPQWRTKANLFAGITMDNQLVF 532
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLS 608
++F+ AA +LEK+E + +L +L P ++ + G + EW +D+ +P+ HHRH+S
Sbjct: 533 DLFTNTERAARLLEKDE-LFCDTILSLRKQLPPMQVGQYGQLQEWFEDWDNPKDHHRHIS 591
Query: 609 HLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVK 668
HL+G FPG I+ +P L +AA TL +RG+ GWS+ WK WAR D HA++++
Sbjct: 592 HLWGFFPGFQISPYSSPVLFEAARNTLIQRGDPSTGWSMGWKVCFWARCLDGNHAFKLIT 651
Query: 669 RLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALP 728
NLV PE +K GG Y NLF AHPPFQID NFG TA +AEML+QS ++LLPALP
Sbjct: 652 DQLNLVSPEIQKGQGGGTYPNLFDAHPPFQIDGNFGCTAGIAEMLMQSHDEAIHLLPALP 711
Query: 729 WDKWSSGCVKGLKARGG-ETVSICWKDGDLHEVGIYSNYSNNDH 771
D W G +KGL+ARGG E +S+ WK+G + I S N H
Sbjct: 712 -DVWKDGEIKGLRARGGFEIISLKWKNGQIESAVIKSTLGGNLH 754
>gi|442803588|ref|YP_007371737.1| alpha-1,2-L-fucosidase [Clostridium stercorarium subsp.
stercorarium DSM 8532]
gi|442739438|gb|AGC67127.1| alpha-1,2-L-fucosidase [Clostridium stercorarium subsp.
stercorarium DSM 8532]
Length = 761
Score = 523 bits (1347), Expect = e-145, Method: Compositional matrix adjust.
Identities = 307/750 (40%), Positives = 430/750 (57%), Gaps = 63/750 (8%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
A+P+GNGR+GAM++GGV +E ++LNED++W G P D NP+A + L +R L+ G+ E
Sbjct: 30 ALPLGNGRIGAMIYGGVENELIQLNEDSIWYGGPRDRNNPEAVRYLPTIRKLISEGRIRE 89
Query: 87 A-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
A A++ L G P YQ LG++ L F++ YRRELD++ A ARV+Y + +
Sbjct: 90 AENLAAIALSGIPESQRHYQPLGELYLNFENHK---NPSYYRRELDIDNAVARVEYKIVD 146
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPG 201
+TRE F S P QV+ KI S S+SF L + +N +N + M G C G
Sbjct: 147 TLYTREMFVSAPQQVLAIKIKAEGSKSISFRTKLRRSRYFEKVDALN-HNTLKMAGSCGG 205
Query: 202 KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFD 261
+ I + A+L +I + G++ A+ + L V+ S V+ L +++F
Sbjct: 206 E------------GAINYCALL--RIIPENGSVEAI-GEHLVVKNSKSVVIFLSVATTF- 249
Query: 262 GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVT 321
++P ES+ L+ L Y +L H++DY+ LF RV + +T
Sbjct: 250 --------RHEEPEKESLRILEEAEKLRYDELLQNHIEDYRSLFDRVDL--------YIT 293
Query: 322 DTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNE 380
+ +++N+D++P+ ER++ + ++DP LV L FQFGRYLLISSSRPGT ANLQGIWN+
Sbjct: 294 NHSADKNVDSLPTDERLERVKAGNDDPGLVSLYFQFGRYLLISSSRPGTLPANLQGIWNK 353
Query: 381 DLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWV 440
D P WDS +NIN +MNYW + CNLSEC PLFD + + G KTA+V Y G+
Sbjct: 354 DYLPPWDSKYTININTQMNYWPAEVCNLSECHLPLFDLIERMREPGRKTARVMYGCRGFC 413
Query: 441 IHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCAS 500
HH TDIWA ++ WPMG AWLC HLWEHY +T D++FL + AY ++
Sbjct: 414 AHHNTDIWADTAPQDIYFGATYWPMGAAWLCLHLWEHYEFTRDKEFLAQ-AYLTMKEAVE 472
Query: 501 FLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEV 560
FLLD+L E G L T+PS SPE+ +I P+G+ + +MD II E+F I A +
Sbjct: 473 FLLDFLTEDDKGRLVTSPSVSPENTYILPNGESGRLCQGPSMDSQIIHELFGVCIKATSI 532
Query: 561 LEKNEDALVE--KVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHT 618
L + + E KVL+ +P+ +I + G I EWA+++++ E HRH+SHLF L+PG
Sbjct: 533 LNIDGEFAAELGKVLERVPK---PEIGKYGQIKEWAEEYEEAEPGHRHISHLFALYPGKQ 589
Query: 619 ITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHAYRMVKRLFNLVD 675
I++ K P+L KAA TL++R G GWS W LWARL D E AY V L
Sbjct: 590 ISVHKTPELVKAARVTLERRLAHGGGHTGWSRAWIINLWARLEDAEKAYENVMAL----- 644
Query: 676 PEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSG 735
NL HPPFQID NFG TA +AEML+QS + LLPALP + WS G
Sbjct: 645 ------LRKSTLPNLLDNHPPFQIDGNFGGTAGIAEMLIQSHEGMITLLPALP-EAWSDG 697
Query: 736 CVKGLKARGGETVSICWKDGDLHEVGIYSN 765
VKGL+ARGG V + WK G L + I S+
Sbjct: 698 YVKGLRARGGFEVEMEWKQGRLVKACIVSD 727
>gi|386819251|ref|ZP_10106467.1| hypothetical protein JoomaDRAFT_1168 [Joostella marina DSM 19592]
gi|386424357|gb|EIJ38187.1| hypothetical protein JoomaDRAFT_1168 [Joostella marina DSM 19592]
Length = 818
Score = 523 bits (1346), Expect = e-145, Method: Compositional matrix adjust.
Identities = 296/764 (38%), Positives = 440/764 (57%), Gaps = 55/764 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA ++ +A+PIGNGR+GAM++GG + ++LNE+T+W G PG+ D + + +R
Sbjct: 27 YDEPADNWNEALPIGNGRIGAMLYGGEKVDQIQLNEETVWAGSPGNNIAKDYYQDVESIR 86
Query: 77 SLVDSGQYAEATAASVKLF----------GHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
L+ +G+Y EA ++++F G P YQ +G+I+L F + H K + +RR
Sbjct: 87 ELLFNGKYTEAQQKALEVFPKNTPDNTNYGMP---YQTVGNIKLAFKN-HNKIS--NFRR 140
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
EL++ A A+V Y V++ R++F S PDQV+ + ++S L+F++ + S H
Sbjct: 141 ELNIENAVAKVSYLADGVQYNRQYFVSYPDQVMAIHLQANKSEKLNFDIEIQSA-QKHVA 199
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
NN + ++G + + P ++FS ++ KI + +S + KL VE
Sbjct: 200 SIENNILHLKGVSETRE--------NKPGKVKFSTLIYPKIIGEGKIVS--REGKLSVEK 249
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ +L + ++F +D ++ L +++N S L H++DYQ LF
Sbjct: 250 AQEVLLFISIGTNFK----KYNDLSNAEDEVALKFLNNVKNKSIEALLESHIEDYQDLFK 305
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV ++L + EN+ + + ER+K+F + D SL+ L FQFGRYLLISSSR
Sbjct: 306 RVDLKLGK------------ENLSNLTTDERLKTFSKNHDLSLISLYFQFGRYLLISSSR 353
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
G Q ANLQGIWN LSP WDS VNIN EMNYW + NLSE PLF L LS G
Sbjct: 354 EGGQPANLQGIWNNKLSPPWDSKYTVNINTEMNYWPAEVTNLSELHAPLFSMLEDLSETG 413
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
++A Y A GW +HH TDIW S G + WPMGGAWL HLW+H+ +T D +F
Sbjct: 414 KESAHKMYHARGWNMHHNTDIWRISGIVDGG-FYGFWPMGGAWLSQHLWQHFLFTGDINF 472
Query: 487 LEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L K+ YP+L+ A F +D L E +G+L PS SPE+++I DG V+Y +TMD
Sbjct: 473 L-KKYYPILKETALFYVDVLQKEPKNGWLVVTPSISPENKYI--DG--VGVTYGTTMDNQ 527
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
++ +VF+ +I+AA+ L + D ++ V + +L P +I + + EW +D+ +P HR
Sbjct: 528 LVFDVFNNVITAAKTLNIDAD-FIKVVEEKKSKLPPMQIGKHAQLQEWIEDWDNPNNKHR 586
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H+SHL+GL+P I+ KNP+L +A+ TL +RG++ GWS+ WK WAR+ + AY+
Sbjct: 587 HISHLYGLYPSAQISPFKNPELFQASRNTLNQRGDKSTGWSMGWKVNFWARMLNGNRAYK 646
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
+++ +V+ + GG Y NLF AHPPFQID NFG TA +AEML+QS L+LLP
Sbjct: 647 LIQEQLTMVE---DGTTSGGTYPNLFDAHPPFQIDGNFGCTAGIAEMLIQSHDEALFLLP 703
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
ALP D W G VKGL ARGG V + W L V + S N
Sbjct: 704 ALPSD-WDKGGVKGLMARGGFEVDLNWTHNKLVSVKVKSKLGGN 746
>gi|337748987|ref|YP_004643149.1| alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
gi|336300176|gb|AEI43279.1| Alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
Length = 827
Score = 523 bits (1346), Expect = e-145, Method: Compositional matrix adjust.
Identities = 305/770 (39%), Positives = 427/770 (55%), Gaps = 52/770 (6%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA + +A+P GNGRLGAMV+GG E + LNEDTLW+G P D DA L R
Sbjct: 12 YEQPAGSWFEALPAGNGRLGAMVFGGTCRERIALNEDTLWSGEPRDTVREDAHLHLDPAR 71
Query: 77 SLVDSGQYAEATAASVKLFGHP-ADVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTAT 134
L+ G++AEA + P + Y LGD+EL+ D K E T YRREL L+ A
Sbjct: 72 KLIFEGRHAEAEEIIEQYMQGPDIESYLPLGDLELQSD----KEGEITDYRRELILDDAV 127
Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQII 194
R +Y RE F S DQV+ +I + L+ +SL S L G++ +
Sbjct: 128 IRTQYRTDGALQIRELFVSAADQVLALRIESEQP--LNLTISLGSPLQYAVRRTGSSGMA 185
Query: 195 MEGRCPGKRIPPKANANDDP------KGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
+ GRCP R+ P +D+P +GI F A L + + ++G I + +++V
Sbjct: 186 LSGRCP-VRVLPNTVRSDEPARYEEGRGIAFEAALHV--TAEKGRIES-SGGRIRVVSGR 241
Query: 249 WAVLLLVASSSFDGPFINPSDSK--KDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
LLL A++S+DG +P+ + P + L+ L YS L RHL ++ + +
Sbjct: 242 GVTLLLAAATSYDGFDRDPAAASLAGRPRALCAERLREAAGLGYSRLKERHLREHAEKYG 301
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSS 365
RV ++L + S + D +P+ R+++ Q +DP L L FQ+GRYLL+SSS
Sbjct: 302 RVDLELG------GSAADSGADADALPTDARIRAAAQGADDPGLAALFFQYGRYLLLSSS 355
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPGTQ ANLQGIWN+ L P W S+ NIN++MNYW + NL+EC EPL F+ L +
Sbjct: 356 RPGTQPANLQGIWNDKLQPPWCSSWTANINVQMNYWPAEAANLAECHEPLLRFVDDLRES 415
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G + A V+Y GW HH D+W ++ G WA WPM GAWLC HLWEHY ++ D +
Sbjct: 416 GRRAASVHYRCRGWTAHHNIDLWRTATPVGGSPSWAFWPMAGAWLCEHLWEHYAFSRDEE 475
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
+L R YP+L+ A F LDWL+EG DG+L T PSTSPE+ F+ DG CV+Y+STMD+A
Sbjct: 476 YL-ARVYPVLKEAAQFGLDWLVEGPDGFLVTCPSTSPENHFLTADGSQGCVTYASTMDIA 534
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
++R +F + A+ L+K+ A E + ++L R+ P +I G + EWA+DF + E HR
Sbjct: 535 LLRNLFGRCMEASRQLQKD-TAFRELLEQTLRRMPPYRIGRHGQLQEWAEDFGEAEPGHR 593
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEH 662
H +HL L P IT E P+L +A K L++R G GWS W +LWARL + E
Sbjct: 594 HTAHLAALHPLEEITPEGEPELAEACRKALERRLAHGGAHTGWSCAWMISLWARLGEPET 653
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP-------FQIDANFGFTAAVAEMLVQ 715
A+R + L GL+ NL AH FQID + TA + EML+Q
Sbjct: 654 AHRFLGELL------------AGLHPNLTNAHRHPKVKMDIFQIDGSLAGTAGILEMLLQ 701
Query: 716 STLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
S + LLPALP + W G V+GL+ARGG + + WKDG L + S
Sbjct: 702 SHRGTVRLLPALP-ENWREGRVRGLRARGGFEIDMEWKDGRLIRAALISR 750
>gi|345513950|ref|ZP_08793465.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
gi|423230895|ref|ZP_17217299.1| hypothetical protein HMPREF1063_03119 [Bacteroides dorei
CL02T00C15]
gi|423244606|ref|ZP_17225681.1| hypothetical protein HMPREF1064_01887 [Bacteroides dorei
CL02T12C06]
gi|229435764|gb|EEO45841.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
gi|392630015|gb|EIY24017.1| hypothetical protein HMPREF1063_03119 [Bacteroides dorei
CL02T00C15]
gi|392641455|gb|EIY35231.1| hypothetical protein HMPREF1064_01887 [Bacteroides dorei
CL02T12C06]
Length = 824
Score = 522 bits (1345), Expect = e-145, Method: Compositional matrix adjust.
Identities = 300/763 (39%), Positives = 439/763 (57%), Gaps = 51/763 (6%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA+++ +A+P+GNGRLGAMV+G +E ++LNE+T+ G P NP+A AL+ +R
Sbjct: 31 YDKPARYWEEALPLGNGRLGAMVYGNPVTEEIQLNEETVSAGSPYKNYNPEAKGALATIR 90
Query: 77 SLVDSGQYAEATA-ASVKL-----FGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
L+ +G+Y EA A A K+ FG P YQ +G + L+F SH Y +RRELDL
Sbjct: 91 QLIFAGRYPEAQALAGEKILSKNGFGMP---YQTVGSLRLDFP-SHENYT--NFRRELDL 144
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A Y+V +++ RE F+S DQ+++ +++ S+ G L+F+ SL V+G
Sbjct: 145 EKAVATTAYTVNGIDYKREVFTSFVDQLVIVRLTASQPGKLTFSASLTCPQKVDVTVSGK 204
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
N +I+EG G +D KG I F A L++ D +G S D L V ++
Sbjct: 205 NALILEGTTKG---------DDFTKGSICFRADLKL---DLQGGKSVAGDTLLSVTNANS 252
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A + + +++F +N D +P+ + ++++ +Y+ H+ YQK ++RVS
Sbjct: 253 ATIYIAMATNF----VNYKDISGNPSGRNKVSMKNAGK-NYARALQAHISAYQKYYNRVS 307
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L R+ + P+ R+K F +DP LV L FQFGRYLLISSS+PG
Sbjct: 308 LNLGRTSQA------------DKPTDVRIKEFAISDDPHLVALYFQFGRYLLISSSQPGG 355
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN+ L+P W NIN EMNYW + NL E EP + L NG +
Sbjct: 356 QPANLQGIWNQKLNPAWKCRYTTNINAEMNYWPAEVTNLREMHEPFLQMVKELYENGQEA 415
Query: 430 AQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
A+ Y GWV+HH TD+W + A DR WP AWLC HLW+ Y Y+ D+++L
Sbjct: 416 AREMYGCRGWVLHHNTDLWRMNGAVDRAYC--GPWPTCNAWLCQHLWDRYLYSGDKEYLA 473
Query: 489 KRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
YP+L+ + F +D+L+ + + GYL PS SPE+ GK A + TMD ++
Sbjct: 474 S-VYPILKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPSIWKGK-ANLFAGITMDNQLV 531
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHL 607
++FS SAA++L ++ + +L +L P ++ + G + EW +D+ +P HHRH+
Sbjct: 532 SDLFSNTRSAAQILNLDKQ-FCDTILSLKRQLPPMQVGQYGQLQEWFEDWDNPNDHHRHI 590
Query: 608 SHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMV 667
SHL+GLFPG+ I+ +P L +AA TL +RG+ GWS+ WK WAR D HA++++
Sbjct: 591 SHLWGLFPGYQISPYSSPILFEAARNTLIQRGDPSTGWSMGWKVCFWARCLDGNHAFKLI 650
Query: 668 KRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPAL 727
N V PE +K GG Y NLF AHPPFQID NFG A +AEML+QS ++LLPAL
Sbjct: 651 ANQLNFVSPEVQKGQGGGTYPNLFDAHPPFQIDGNFGCAAGIAEMLMQSHDGAVHLLPAL 710
Query: 728 PWDKWSSGCVKGLKARGG-ETVSICWKDGDLHEVGIYSNYSNN 769
P D W +G ++GL+ARGG E VS+ WKDG + I S N
Sbjct: 711 P-DTWKNGEIRGLRARGGFEIVSLKWKDGKVESAIIKSTIGGN 752
>gi|393788377|ref|ZP_10376507.1| hypothetical protein HMPREF1068_02787 [Bacteroides nordii
CL02T12C05]
gi|392656050|gb|EIY49691.1| hypothetical protein HMPREF1068_02787 [Bacteroides nordii
CL02T12C05]
Length = 809
Score = 522 bits (1344), Expect = e-145, Method: Compositional matrix adjust.
Identities = 310/791 (39%), Positives = 429/791 (54%), Gaps = 57/791 (7%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+ N L + + PA+ + +A+P+GNGRLGAMV+G E ++ NE+TL++G P
Sbjct: 17 VNAQNDLTLWYTTPARVWEEALPLGNGRLGAMVFGDTQKERIQFNENTLYSGEPAALNRS 76
Query: 67 DA--PKALSDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET 123
P+ VR L+ G+ AEA + G +VYQ GD+ +F +K
Sbjct: 77 TCILPQ-YEKVRDLLKQGKNAEAEKIMQYEWIGRLNEVYQPFGDVCFDFK---MKGEVTE 132
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
Y LD+ A +Y G E RE F+S P Q IV + +E L F + L SL
Sbjct: 133 YVHSLDMEQAVVTTRYKQGGTEILREVFASFPGQAIVIHLK-AEKPVLHFEMQLASLHPV 191
Query: 184 HSYVNGNNQIIMEGRCP---------------------------GKRIPPKANANDDPKG 216
H G ++ MEGR P GK I + + G
Sbjct: 192 HLSCEGE-RLQMEGRAPAHVQRRTIEGMRKYNTERLHPEYFDEKGKVIRTEQVIYAEDAG 250
Query: 217 IQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTS 276
+ F A + + + D G I+ +D +L V+ + LL A++S++G +PS + K+
Sbjct: 251 MAFEAYV-VPLKKD-GVIT-FKDNRLVVKDASEITFLLYAATSYNGFDKSPSKAGKNIAK 307
Query: 277 ESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAE 336
E + + + Y + H+ DYQ LF RV + L SP N P+
Sbjct: 308 ELQAQRKKLAGKEYQQIRNEHVADYQSLFKRVDLALPSSP-----------NQKDKPTDI 356
Query: 337 RVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINL 396
R+K FQT D SL+ LFQ+GRYL+IS SRPG Q NLQG+WN+ + P W+S NINL
Sbjct: 357 RLKEFQTKTDLSLIAQLFQYGRYLMISGSRPGGQPLNLQGLWNDKIIPPWNSGYTTNINL 416
Query: 397 EMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRG 456
+MNYWQ+ NLSEC +PLF F+ ++ +G + A Y +GW+ HH IW ++ G
Sbjct: 417 QMNYWQAEVTNLSECHQPLFTFIEEIAQSGKEAAHNMYGRNGWIAHHNMSIWREAYPADG 476
Query: 457 KVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLET 516
V W W M G WLC+H+WEHY YT D FL + Y +L+ A F +WL++ G T
Sbjct: 477 FVHWFFWNMSGPWLCSHIWEHYLYTKDVAFL-REYYSILKESARFCSEWLVQNTKGEWVT 535
Query: 517 NPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL 576
STSPE+ F PDG+ A V STMDMAIIR +F I AAE+L D K+L+
Sbjct: 536 PVSTSPENAFRMPDGREAAVCEGSTMDMAIIRNLFGNTIHAAELL--GVDVEFRKMLEQK 593
Query: 577 PR-LRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTL 635
+ L +I G ++EW +++K+ E HRHLSHLFGL+PG I I P++ KAA +TL
Sbjct: 594 SKYLAGYRIGSHGQLLEWDKEYKETEPQHRHLSHLFGLYPGCDI-IPDTPEVFKAARQTL 652
Query: 636 QKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHP 695
RG + GWS+ WKTALWAR ++ E +Y +K L + +DP E GGLY N+ A
Sbjct: 653 IDRGNKTTGWSMAWKTALWARQYEGEQSYAALKNLMSFIDPLVESKKGGGLYRNMLNA-L 711
Query: 696 PFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
PFQID NFG TA +AEML+QS L +++LLPALP + W G V GLKARG TV++ W+DG
Sbjct: 712 PFQIDGNFGITAGIAEMLLQSHLGNIHLLPALPIE-WKKGKVTGLKARGNFTVNMEWEDG 770
Query: 756 DLHEVGIYSNY 766
L I S Y
Sbjct: 771 KLQTATIQSEY 781
>gi|381169519|ref|ZP_09878684.1| hypothetical protein XMIN_121 [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
gi|380690109|emb|CCG35171.1| hypothetical protein XMIN_121 [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
Length = 790
Score = 522 bits (1344), Expect = e-145, Method: Compositional matrix adjust.
Identities = 308/794 (38%), Positives = 435/794 (54%), Gaps = 64/794 (8%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+ L++ + PA + +A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D T+P
Sbjct: 39 VAAAEALQLWYREPANEWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSP 98
Query: 67 DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
DA AL VR+L+ +G+YAEA A KL P YQ LGD+ L+FD +
Sbjct: 99 DALAALPQVRALIFAGRYAEAEKLADAKLLSRPLKKMPYQPLGDLLLDFDRAD---GISD 155
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
YRR+LDL+TA A + G RE F Q IV ++S G +S V +DS
Sbjct: 156 YRRQLDLDTAVATTTFRSGGAVHRREVFVCAQAQCIVVRLSCDRPGGISLRVGIDSPQTG 215
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI--SDDRGTISALEDKK 241
++ GR N GI+ +++ G +S + D+
Sbjct: 216 EVTAEPGG-LLFSGR------------NGSFAGIEGRLRFALRVLPQVSGGKLSQVRDR- 261
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L+++ +D VLLL A++S+ + D DP + + + L+ L + L HL D+
Sbjct: 262 LRIDAADEVVLLLSAATSYQ--RFDAVDG--DPLALTAARLRKAAKLDFPALLRAHLADH 317
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
Q+LF RV+I D S E + +P+ ERV+ F DP+L L Q+GRYLL
Sbjct: 318 QRLFRRVAI-----------DLGSSEAVQ-LPTDERVQRFAEGNDPALAALYHQYGRYLL 365
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
I SSRPGTQ ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL L
Sbjct: 366 ICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFD 425
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
L+ G+ TA+ Y A GWV+H+ TD+W ++ G W+LWPMGG WL LW+ ++Y
Sbjct: 426 LAQTGTHTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDYG 484
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
DR +L K YPL +G A F + L+ + G + TNPS SPE++ P G C S
Sbjct: 485 RDRAYLSK-VYPLFKGAAEFFVATLMRDPQTGAMVTNPSMSPENQH--PFGAAVCAGPS- 540
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF--K 598
MD ++R++F+ I+ +++L + + +L P +I + G + EW QD+ +
Sbjct: 541 -MDAQLLRDLFAQCIAMSKLLGIDAQLAQQLAALR-EQLPPNRIGKAGQLQEWQQDWDMQ 598
Query: 599 DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLH 658
PE+HHRH+SHL+ L P I + P+L AA ++L+ RG+ GW I W+ LWARL
Sbjct: 599 APEIHHRHVSHLYALHPSSQINLRDTPELAAAARRSLEIRGDNATGWGIGWRLNLWARLA 658
Query: 659 DQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTL 718
D EHAYR+++ L+ PE Y NLF AHPPFQID NFG TA + EML+QS
Sbjct: 659 DGEHAYRILQL---LISPERT-------YPNLFDAHPPFQIDGNFGGTAGITEMLLQSWG 708
Query: 719 NDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLH 778
++LLPALP W G V+GL+ RGG +V + W+ G L ++S D L
Sbjct: 709 GSVFLLPALP-KAWPRGSVRGLRVRGGASVDLEWEGGRLQHARLHS-----DRGGRYQLS 762
Query: 779 YRGTSVKVNLSAGK 792
Y G ++ + L AG+
Sbjct: 763 YAGQTLDLELGAGR 776
>gi|300725824|ref|ZP_07059290.1| alpha-L-fucosidase 2 [Prevotella bryantii B14]
gi|299776871|gb|EFI73415.1| alpha-L-fucosidase 2 [Prevotella bryantii B14]
Length = 802
Score = 522 bits (1344), Expect = e-145, Method: Compositional matrix adjust.
Identities = 305/779 (39%), Positives = 438/779 (56%), Gaps = 59/779 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY-TNPDAPKA 71
+++ ++ PA +F +++PIGNG+LG +V+G +T+ LN+ TLWTG P D A
Sbjct: 23 MQLLYHEPAHYFEESLPIGNGKLGGLVYGNPKHDTIYLNDITLWTGKPVDLDEGKGASLW 82
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIEL-EFDDSHLKYAEETYRRELDL 130
L ++R + + Y +A + + L G + YQ LG ++L D +Y++ Y+R+LDL
Sbjct: 83 LPEIRKALFAENYRKADSLQLHLQGKNSAFYQPLGTLQLTSLTDE--RYSD--YQRQLDL 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN-- 188
+++ ++ Y G V + RE+F+ NPD ++ +ISG + GS+S ++S+ SLL +
Sbjct: 139 DSSLVKISYRQGGVLYQREYFADNPDNMLAIRISGDKKGSVSMDISIGSLLPVQVKASLT 198
Query: 189 -------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
Q+ M G G + F +L+ + GT+ + K
Sbjct: 199 RSLQANTAQGQLTMLGHAQGV----------SSESTHFCTMLQARAQG--GTVQVIHGK- 245
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L+VE +D ++ +V +SF G +P ++ L ++N SY +L +RH+ DY
Sbjct: 246 LRVEHADTLIIYIVNETSFAGADKHPVQDGAPYLAQVTDDLWHLQNYSYDELRSRHVADY 305
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQFGRYL 360
QK ++RV ++L T + + +DT + K+ Q D L L FQ+GRYL
Sbjct: 306 QKFYNRVKLRLG-------TVDHAPQTVDTWSLLKNYGKNHQAYLDRYLETLYFQYGRYL 358
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LIS SR ANLQG+WN L W VNINLE NYW + NLSE +EP+ DF+
Sbjct: 359 LISCSRTSGVPANLQGLWNHYLEAPWRGNYTVNINLEENYWPAEVANLSEMEEPIHDFMA 418
Query: 421 YLSINGSKTAQVNY-LASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAWLCTHLWE 476
L+ NG TA Y + GW H +DIWAK++ R W+ W MGGAWL + LWE
Sbjct: 419 SLAQNGHFTAHHFYGIDRGWCSSHNSDIWAKTAPVGEGRESPEWSNWNMGGAWLSSTLWE 478
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLA 534
HY YT D DFL + AYP+L G + F+L WL++ G L T PSTSPE+E++ G
Sbjct: 479 HYLYTQDLDFLRRTAYPILNGASQFVLRWLVDNPQKSGELITAPSTSPENEYVTDKGYHG 538
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVL---EKNEDAL-VEKVLKSLPRLRPTKIAEDGSI 590
Y T D+AIIRE+ + A +VL EK ED V ++L RL P + +DG +
Sbjct: 539 TTCYGGTADLAIIRELLLNTLHARQVLGLKEKKEDQKGYPTVSEALARLHPYTVGKDGDL 598
Query: 591 MEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
EW D+KD ++HHRH SHL GL+PGH ITI++ P L AAEKTL ++GEE GWS W+
Sbjct: 599 NEWYYDWKDYDIHHRHQSHLIGLYPGHHITIDQQPQLAAAAEKTLLQKGEETTGWSTGWR 658
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEH----EKHFEGGLYSNLFAAHPPFQIDANFGFT 706
LWARLH + AYR +RL V P+ ++ GG Y NLF AHPPFQID NFG T
Sbjct: 659 INLWARLHRADMAYRTFQRLLQYVTPDQYQGKDRMHRGGTYPNLFDAHPPFQIDGNFGGT 718
Query: 707 AAVAEMLVQSTLN--------DLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
A V EML+QS ++ +YLLPALP ++W G V GL ARGG V++ W++G +
Sbjct: 719 AGVCEMLLQSEVDYSKRKPQYHVYLLPALP-EEWKDGEVSGLCARGGIVVNMKWRNGKV 776
>gi|302872475|ref|YP_003841111.1| alpha-L-fucosidase [Caldicellulosiruptor obsidiansis OB47]
gi|302575334|gb|ADL43125.1| Alpha-L-fucosidase [Caldicellulosiruptor obsidiansis OB47]
Length = 753
Score = 521 bits (1342), Expect = e-145, Method: Compositional matrix adjust.
Identities = 306/790 (38%), Positives = 440/790 (55%), Gaps = 64/790 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LKI FN PA + +A+PIGNG LGAM++GGV ET++LNE+++W+ P NPDA + L
Sbjct: 6 LKILFNHPANCWEEALPIGNGSLGAMIYGGVEYETIQLNEESIWSCGPRRRENPDALRYL 65
Query: 73 SDVRSLVDSGQYAEATAASVKLFG---HPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
++R + G A SV H Y+ LG +++ F+ K E Y R LD
Sbjct: 66 QEIRKSILEGNIKRAEELSVFALSGTPHSEGNYEPLGYLDIYFEGIE-KDKIENYCRYLD 124
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLS----FNVSLDSLLDNHS 185
++ A +V++SVG + + +FSS PD+VIV KIS SE ++ F +D
Sbjct: 125 ISNAICKVEFSVGKARYDKLYFSSFPDKVIVIKISCSEKCGVTLRAKFRREFQEDIDRCG 184
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
+ GN++I E R G+ FSA+L+ +S D G + + D L ++
Sbjct: 185 KI-GNDKIFFECTAGSGR------------GVSFSAMLK-AVSKD-GDVYTIGDN-LFIK 228
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
+ +LL+ +++S+ +KD + + L+ + + +LY RH +DY+ LF
Sbjct: 229 NATEVMLLITSTTSY---------KEKDYFNWCLKTLEQVSKHDFEELYKRHTEDYKSLF 279
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
RV + + + + E I+ + R D L+ LLFQFGRYLLISSS
Sbjct: 280 DRVEFYIDTANTNDRIGLTTPERINLLKKGYR--------DEELIVLLFQFGRYLLISSS 331
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPG NLQGIWN+++ P W S +NINL+MNYW + CNLSEC PLF L + N
Sbjct: 332 RPGCLPPNLQGIWNKEMKPPWGSKYTININLQMNYWPAEICNLSECHLPLFTLLERMYEN 391
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G TAQ Y G+ HH TDIW ++ + WPMG AWLC H+WEHY YT D D
Sbjct: 392 GKITAQKMYNCRGFCAHHNTDIWGDTAPQDIYIPATYWPMGAAWLCLHIWEHYEYTGDLD 451
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
FL K+ Y L+ A FLLD+LIE +GYL T PS SPE+ + +G + ++Y T+D+
Sbjct: 452 FL-KKYYYLMREAALFLLDYLIEDKNGYLVTCPSCSPENSY-KLNGNVYSLTYMPTIDIQ 509
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
II +F + A ++L+ N D ++EK+ +L +L P KI + G I EW +D+++ E HR
Sbjct: 510 IISVLFEKVKKANDILKLN-DEIIEKIDYALEKLPPIKIGKYGQIQEWIEDYEEAEPGHR 568
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP---GWSITWKTALWARLHDQEH 662
H+SHLFGL+P + IT EK P L +AA+KTLQ+R E G GWS W + ARL + +
Sbjct: 569 HISHLFGLYPENQITFEKTPQLFEAAKKTLQRRLEHGSGHTGWSRAWVICILARLKEGDK 628
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
AY+ + L + NL HPPFQID NFG TA +AEML+QS + +
Sbjct: 629 AYKNILEL-----------LKRSTLPNLLDNHPPFQIDGNFGATAGIAEMLMQSYDDTIE 677
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGT 782
LLPALP D W SG +KGLKARGG TV I W++G + + + + L Y+ +
Sbjct: 678 LLPALPSD-WKSGYIKGLKARGGHTVDIYWENGIFKKAKVILGFKES-----VILKYKKS 731
Query: 783 SVKVNLSAGK 792
+++ G+
Sbjct: 732 CIEIRGCEGE 741
>gi|372220893|ref|ZP_09499314.1| alpha-L-fucosidase [Mesoflavibacter zeaxanthinifaciens S86]
Length = 805
Score = 521 bits (1342), Expect = e-145, Method: Compositional matrix adjust.
Identities = 285/761 (37%), Positives = 442/761 (58%), Gaps = 35/761 (4%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAP 69
N +I F+ PA +F + + +GNG++GA ++GG+ +E + LN+ TLW+G P ++ N P+A
Sbjct: 30 NSDEIWFDKPATYFEETLVLGNGKMGASIFGGIQTEKIFLNDITLWSGEPMNHNNNPEAY 89
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
K L ++R+ + + Y A + + KL G + Y LG + L F + + Y+R LD
Sbjct: 90 KNLPEIRAALKAENYKLADSLNKKLQGQFSQSYAPLGTLWLHFKN---ETNITNYKRSLD 146
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L TA A V Y V++ RE+F SNP +V+V +++ ++SF++ +S L
Sbjct: 147 LTTAIADVSYESNGVKYKREYFISNPKKVMVVRLTSDRKKAISFDLKFESQL-RFKIKEL 205
Query: 190 NNQIIMEGRCPGKRIPPKANANDDP------KGIQFSAILEIKISDDRGTISALEDKKLK 243
++++I G P P + +P KG +F++ IK +D GT+ ++D L
Sbjct: 206 DSKLIATGYAPVHVEPSYRGSIKNPIVFDADKGTRFTSAFSIKQTD--GTVK-IQDSVLS 262
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V+ + LL+ ++SF+G NP+ + + ++ ++S + +Y++L H+ DY +
Sbjct: 263 VQNATEVELLVAVATSFNGFDKNPATEGLNHENIALEQIKSSKKETYANLKKEHVADYSE 322
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL-FQFGRYLLI 362
L++RV +LS + + VP+ +R+ ++T + +E+L F +GRYLLI
Sbjct: 323 LYNRVDFKLSH------------KELPNVPTDQRLLRYETGANDQNLEILYFNYGRYLLI 370
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
+SSR ANLQG+WN + P W S +NINL+ NYW + NLSE +PL F+ L
Sbjct: 371 ASSRTKEVPANLQGLWNPHIRPPWSSNYTININLQENYWLAETANLSELHQPLLSFIGNL 430
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHY 478
S G+ TA+ Y +GW H +DIWA ++ +G WA W MGG WL +HLWEHY
Sbjct: 431 SKTGAITAKTYYGTNGWAAGHNSDIWALTNPVGDFGQGNPNWANWNMGGVWLTSHLWEHY 490
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
YT D +L++ AYP+++G A+F +WLI+ G ++PSTSPE+ + P+G + Y
Sbjct: 491 LYTKDTTYLKEYAYPIIKGAATFASEWLIKDQHGQFISSPSTSPENLYKTPEGYVGATLY 550
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFK 598
+T DMA+I+E+F + ++A++ L +D K+ +L L P KI + G++ EW D++
Sbjct: 551 GATADMAMIKELFYSYLNASKTLAIQDD-FTRKIKFNLENLSPYKIGQKGNLQEWYYDWE 609
Query: 599 DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLH 658
D HRH +HL+GL PG+ IT P L +AA+ TL+ +G+E GWS W+ LWARL
Sbjct: 610 DQNPKHRHQTHLYGLHPGNQITPYDTPKLAEAAKTTLEIKGDETTGWSKGWRINLWARLW 669
Query: 659 DQEHAYRMVKRLFNLVDPEHEK--HFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
D AY+M + L V+P+ K GG Y NLF AHPPFQID NFG A V EML+QS
Sbjct: 670 DGNRAYKMYRELLRYVNPDTSKPNSKRGGTYPNLFDAHPPFQIDGNFGGAAGVIEMLMQS 729
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
+YLLPALP D W G +KG+KARGG + + W+ L
Sbjct: 730 NPETIYLLPALP-DAWQKGSIKGIKARGGFEIDLDWEQHKL 769
>gi|423241477|ref|ZP_17222590.1| hypothetical protein HMPREF1065_03213 [Bacteroides dorei
CL03T12C01]
gi|392641370|gb|EIY35147.1| hypothetical protein HMPREF1065_03213 [Bacteroides dorei
CL03T12C01]
Length = 824
Score = 521 bits (1342), Expect = e-145, Method: Compositional matrix adjust.
Identities = 300/763 (39%), Positives = 439/763 (57%), Gaps = 51/763 (6%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA+++ +A+P+GNGRLGAMV+G +E ++LNE+T+ G P NP+A AL+ +R
Sbjct: 31 YDKPARYWEEALPLGNGRLGAMVYGNPVTEEIQLNEETVSAGSPYKNYNPEAKGALATIR 90
Query: 77 SLVDSGQYAEATA-ASVKL-----FGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
L+ + +Y EA A A K+ FG P YQ +G + L+F SH Y +RRELDL
Sbjct: 91 QLIFADRYPEAQALAGEKILSKNGFGMP---YQTVGSLRLDFP-SHENYT--NFRRELDL 144
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A Y+V V++ RE F+S DQ+++ +++ S+ G L+F+ SL V+G
Sbjct: 145 EKAVATTAYTVNGVDYKREVFTSFVDQLVIVRLTASQPGKLTFSASLTCPQKVDVTVSGK 204
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
N +I+EG G +D KG I+F A L++ D +G S D L V ++
Sbjct: 205 NALILEGTTKG---------DDFTKGSIRFRADLKL---DLQGGKSVAGDTLLSVTNANS 252
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A + + +++F +N D +P+ + ++++ +Y+ H+ YQK ++RVS
Sbjct: 253 ATIYIAMATNF----VNYKDISGNPSGRNKVSMKNAGK-NYARALQAHISAYQKYYNRVS 307
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L R+ + P+ R+K F +DP LV L FQFGRYLLISSS+PG
Sbjct: 308 LNLRRTSQA------------DKPTDVRIKEFAISDDPHLVALYFQFGRYLLISSSQPGG 355
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN+ L+P W NIN EMNYW + NL E EP + L NG +
Sbjct: 356 QPANLQGIWNQKLNPAWKCRYTTNINAEMNYWPAEVTNLREMHEPFLQMVKELYENGQEA 415
Query: 430 AQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
A+ Y GWV+HH TD+W + A DR WP AWLC HLW+ Y Y+ D+++L
Sbjct: 416 AREMYGCRGWVLHHNTDLWRMNGAVDRAYC--GPWPTCNAWLCQHLWDRYLYSGDKEYLA 473
Query: 489 KRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
YP+L+ + F +D+L+ + + GYL PS SPE+ GK A + TMD ++
Sbjct: 474 S-VYPILKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPSIWKGK-ANLFAGITMDNQLV 531
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHL 607
++FS SAA++L ++ + +L +L P ++ + G + EW +D+ +P HHRH+
Sbjct: 532 SDLFSNTRSAAQILNLDKQ-FCDTILSLKRQLPPMQVGQYGQLQEWFEDWDNPNDHHRHI 590
Query: 608 SHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMV 667
SHL+GLFPG+ I+ +P L +AA TL +RG+ GWS+ WK WAR D HA++++
Sbjct: 591 SHLWGLFPGYQISPYSSPILFEAARNTLIQRGDPSTGWSMGWKVCFWARCLDGNHAFKLI 650
Query: 668 KRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPAL 727
N V PE +K GG Y NLF AHPPFQID NFG A +AEML+QS ++LLPAL
Sbjct: 651 TNQLNFVSPEVQKGQGGGTYPNLFDAHPPFQIDGNFGCAAGIAEMLMQSHDGAVHLLPAL 710
Query: 728 PWDKWSSGCVKGLKARGG-ETVSICWKDGDLHEVGIYSNYSNN 769
P D W +G ++GL+ARGG E VS+ WKDG + I S N
Sbjct: 711 P-DTWKNGEIRGLRARGGFEIVSLKWKDGKVESAIIKSTIGGN 752
>gi|332882277|ref|ZP_08449905.1| hypothetical protein HMPREF9074_05703 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|357046479|ref|ZP_09108106.1| hypothetical protein HMPREF9441_02131 [Paraprevotella clara YIT
11840]
gi|332679661|gb|EGJ52630.1| hypothetical protein HMPREF9074_05703 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|355530718|gb|EHH00124.1| hypothetical protein HMPREF9441_02131 [Paraprevotella clara YIT
11840]
Length = 807
Score = 521 bits (1341), Expect = e-145, Method: Compositional matrix adjust.
Identities = 309/770 (40%), Positives = 430/770 (55%), Gaps = 62/770 (8%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
S + LK+ ++ PAK +T+A+P+GN RLGAMV+GG E L+LNE+T W G P D N
Sbjct: 15 SVAWAGELKLWYSKPAKDWTEALPVGNSRLGAMVYGGTGREELQLNEETFWAGGPYDNNN 74
Query: 66 PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV-YQLLGDIELEFDDSHLKYAEET 123
+A L VR+L+ G+ EA H + Y +G + L+F H + E
Sbjct: 75 TNALYVLPVVRNLIFQGKTREAQQLVDANFLAHKDGMSYLTMGSLFLDFP-GHEEATE-- 131
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
+ R+L++ ATA +Y V V +TR F+S D VIV ++ ++G+L+F VS D+ L +
Sbjct: 132 FYRDLNIEDATATTRYKVDGVTYTRRVFASFTDSVIVVRLQADKAGALAFTVSYDAPLKH 191
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQ--FSAILEIKISDDRGTISALEDKK 241
G+ I C GK D +G++ A +K+ D TI+ E K
Sbjct: 192 EVSAEGDLLTIT---CEGK----------DQEGVKAALRAECRVKVVSDGQTIT--EGKN 236
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
LKV G+ A L L A++++ +N D D + + LQ + Y H+ Y
Sbjct: 237 LKVTGATEATLYLSAATNY----VNYHDVSGDAAARADCCLQRAVQIPYKKALENHVAYY 292
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
+KLF RV + L VT S+E + R++ F DPSL LLFQ+GRYLL
Sbjct: 293 RKLFGRVQLDLG------VTAASSKE------TTLRIRDFSQGNDPSLATLLFQYGRYLL 340
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
ISSS+PG Q ANLQGIWN + WDS +NIN EMNYW + NLSE +PLF L
Sbjct: 341 ISSSQPGGQPANLQGIWNRSTNAPWDSKYTININTEMNYWLAEVANLSEMHQPLFSMLED 400
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHY 478
LS+ G+KTA+ Y GWV HH TD+W G V +A +WP GGAWL HLW+HY
Sbjct: 401 LSVTGAKTAREMYGCGGWVAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHLWQHY 456
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACV 536
+T D+DFL K YP+L+G A F LD+L+E H Y PS SPEH V
Sbjct: 457 LFTADKDFL-KTYYPVLKGTARFFLDFLVE-HPSYKWWVVAPSVSPEH---------GPV 505
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
+ TMD I+ + + A+E++ ++ A + + + L +L P ++ G + EW QD
Sbjct: 506 TAGCTMDNQIVFDALRNTLLASEIV-GDDAAFRDSLAQMLDKLPPMQVGRHGQLQEWLQD 564
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
DP+ HRH+SHL+GL+P + ++ P+L +AA TL++RG++ GWSI WK WAR
Sbjct: 565 VDDPKDEHRHISHLYGLYPSNQVSPFLYPELFRAARTTLEQRGDKATGWSIGWKINFWAR 624
Query: 657 LHDQEHAYRMVKRLFNLV--DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLV 714
+ D HAYR++ + L+ D ++ EG Y N+F AHPPFQID NFG A +AEML+
Sbjct: 625 MLDGNHAYRLISNMLQLLPSDAVANEYPEGRTYPNMFDAHPPFQIDGNFGAAAGIAEMLL 684
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
QS ++LLPALP D W G VKGL+ARGG V + W DG L E + S
Sbjct: 685 QSHDGAVHLLPALP-DVWKEGSVKGLRARGGYEVDMEWTDGRLSEATVRS 733
>gi|237710563|ref|ZP_04541044.1| glycoside hydrolase family 95 protein [Bacteroides sp. 9_1_42FAA]
gi|265750338|ref|ZP_06086401.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_33FAA]
gi|345516324|ref|ZP_08795817.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
gi|423232070|ref|ZP_17218472.1| hypothetical protein HMPREF1063_04292 [Bacteroides dorei
CL02T00C15]
gi|423238857|ref|ZP_17219973.1| hypothetical protein HMPREF1065_00596 [Bacteroides dorei
CL03T12C01]
gi|423246621|ref|ZP_17227674.1| hypothetical protein HMPREF1064_03880 [Bacteroides dorei
CL02T12C06]
gi|229433914|gb|EEO43991.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
gi|229455285|gb|EEO61006.1| glycoside hydrolase family 95 protein [Bacteroides sp. 9_1_42FAA]
gi|263237234|gb|EEZ22684.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_33FAA]
gi|392625607|gb|EIY19671.1| hypothetical protein HMPREF1063_04292 [Bacteroides dorei
CL02T00C15]
gi|392635319|gb|EIY29221.1| hypothetical protein HMPREF1064_03880 [Bacteroides dorei
CL02T12C06]
gi|392647735|gb|EIY41433.1| hypothetical protein HMPREF1065_00596 [Bacteroides dorei
CL03T12C01]
Length = 819
Score = 521 bits (1341), Expect = e-145, Method: Compositional matrix adjust.
Identities = 308/765 (40%), Positives = 437/765 (57%), Gaps = 45/765 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PA + +A+P+GN R+GAMV+GG E L+LN++T+W G P P+A K+L
Sbjct: 23 LKLWYKQPAGTWVEALPVGNSRMGAMVYGGTAREELQLNDETMWGGSPYRNDRPEALKSL 82
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ +G+ EA F G YQ +G + +E H K + Y R+LDL
Sbjct: 83 PQVRELIFAGKNMEAQNLIQDNFYAGKHGMPYQTIGSLIIE-TPGHEKVTD--YYRDLDL 139
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A +Y V V F RE F+S PD+VIV +++ G L+F V S L+ H
Sbjct: 140 ERAVATTRYKVDGVTFQREVFASFPDKVIVVRLTADRPGKLNFKVGYVSPLE-HKVSRKG 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD-RGTISALEDKKLKVEGSDW 249
++++ G+ D +G++ +E + D G ++D+ + VEG+D
Sbjct: 199 KKLVLTGK------------GRDHEGVKGLIRMETQTQADVDGGKVKIDDQNITVEGAD- 245
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
+V L V+S + FIN D + + ++ L YS + H+ Y++ F RV
Sbjct: 246 SVTLYVSSGT---NFINYHDISGNESKKASGYLSLALGRPYSQVLQEHIALYKEQFDRVR 302
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L T ++TV +R++ F +D SL LLFQ+GRYLLISSS+PG
Sbjct: 303 LDLG---------TSERAKLETV---KRIELFNEGKDVSLAVLLFQYGRYLLISSSQPGG 350
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN L+ WD +NIN EMNYW + NLSE +PLF+ + LS+ G +T
Sbjct: 351 QPANLQGIWNNKLAAPWDGKYTININTEMNYWPAEVTNLSETHQPLFEMVKELSVTGRET 410
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+ Y +GWV HH TDIW +++ K + WPMGGAWL THLW+HY Y+ D+ FL +
Sbjct: 411 ARTMYGCNGWVAHHNTDIW-RATGPVDKAFYGTWPMGGAWLTTHLWQHYLYSGDKLFLSE 469
Query: 490 RAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLA-CVSYSSTMDMAII 547
AYP L+G A F LD+LIE + G++ T PS SPEH D K A + TMD II
Sbjct: 470 -AYPALKGAADFYLDYLIEHPEYGWMVTAPSMSPEHGPSGEDTKKASTIVAGCTMDNQII 528
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWAQDFKDPEVHHRH 606
+V S + A+ +L+ + A + L+S L RL P +I + + EW +D +P HRH
Sbjct: 529 FDVLSNALHASRILKMS--ASYQDSLRSMLNRLAPMQIGKYNQLQEWLEDLDNPNDKHRH 586
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRM 666
+SH++GLFP + I+ +P L +AA+ TL +RG+E GWSI WK LWARL D HA+R+
Sbjct: 587 ISHVYGLFPSNQISPYTHPLLFQAAKNTLLQRGDEATGWSIGWKVNLWARLLDGNHAFRI 646
Query: 667 VKRLFNLV--DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLL 724
+ + L+ D E + +G Y NLF AHPPFQID NFG+TA VAEML+QS ++LL
Sbjct: 647 INNMLKLLPGDEVKEAYPQGRTYPNLFDAHPPFQIDGNFGYTAGVAEMLLQSHDGAVHLL 706
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
PALP D W++G V+GL ARGG V + W L + I+S N
Sbjct: 707 PALP-DAWATGSVQGLVARGGFVVDMNWNGVQLDKAKIHSRLGGN 750
>gi|21242520|ref|NP_642102.1| hypothetical protein XAC1774 [Xanthomonas axonopodis pv. citri str.
306]
gi|21107972|gb|AAM36638.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 790
Score = 521 bits (1341), Expect = e-145, Method: Compositional matrix adjust.
Identities = 308/796 (38%), Positives = 438/796 (55%), Gaps = 68/796 (8%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+ L++ + PA + +A+P+GNGRLGAMVWGG+ E L+LNEDTL+ G P D T+P
Sbjct: 39 VAAAEALQLWYREPANEWVEALPVGNGRLGAMVWGGIAHERLQLNEDTLYAGGPYDSTSP 98
Query: 67 DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
DA AL VR+L+ +G+YAEA A KL P YQ LGD+ L+FD +
Sbjct: 99 DALAALPQVRALIFAGRYAEAEKLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GISD 155
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
YRR+LDL+TA A + G RE F Q IV ++S G +S V +DS
Sbjct: 156 YRRQLDLDTAVATTTFRSGGAVHRREVFVCAQAQCIVVRLSCDRPGGISLRVGIDSPQTG 215
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI--SDDRGTISALEDKK 241
++ GR N GI+ +++ G +S + D+
Sbjct: 216 EVTAEPGG-LLFSGR------------NGSFAGIEGRLRFALRVLPQVSGGKLSQVRDR- 261
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L+++ +D VLLL A++S+ + D DP + + + L+ L + L HL D+
Sbjct: 262 LRIDAADEVVLLLSAATSYQ--RFDAVDG--DPLALTAARLRKAAKLDFPALLRAHLADH 317
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
Q+LF RV+I D S E + +P+ ERV+ F DP+L L Q+GRYLL
Sbjct: 318 QRLFRRVAI-----------DLGSSEAVQ-LPTDERVQRFAEGNDPALAALYHQYGRYLL 365
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
I SSRPGTQ ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL L
Sbjct: 366 ICSSRPGTQPANLQGIWNDLMQPPWESKYTININTEMNYWPSEANALHECVEPLEAMLFD 425
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
L+ G+ TA+ Y A GWV+H+ TD+W ++ G W+LWPMGG WL LW+ ++Y
Sbjct: 426 LAQTGTHTARAIYDAPGWVVHNNTDLWRQAGPIDG-AQWSLWPMGGVWLLQQLWDRWDYG 484
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
DR +L K YPL +G A F + L+ + G + TNPS SPE++ P G C S
Sbjct: 485 RDRAYLSK-VYPLFKGAAEFFVATLMRDPQTGAMVTNPSMSPENQH--PFGAAVCAGPS- 540
Query: 541 TMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF- 597
MD ++R++F+ I+ +++L + + + + LP P +I + G + EW QD+
Sbjct: 541 -MDAQLLRDLFAQCIAMSKLLGIDAQLAQQLAALREQLP---PNRIGKAGQLQEWQQDWD 596
Query: 598 -KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
+ PE++HRH+SHL+ L P I + P+L AA ++L+ RG+ GW I W+ LWAR
Sbjct: 597 MQAPEINHRHVSHLYALHPSSQINLRDTPELAAAARRSLEIRGDNATGWGIGWRLNLWAR 656
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
L D EHAYR+++ L+ PE Y NLF AHPPFQID NFG TA + EML+QS
Sbjct: 657 LADGEHAYRILQL---LISPERT-------YPNLFDAHPPFQIDGNFGGTAGITEMLLQS 706
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKT 776
++LLPALP W G V+GL+ RGG +V + W+ G L + ++S D
Sbjct: 707 WGGSVFLLPALP-KAWPRGSVRGLRVRGGASVDLEWEGGRLQQARLHS-----DRGGRYQ 760
Query: 777 LHYRGTSVKVNLSAGK 792
L Y G ++ + L AG+
Sbjct: 761 LSYAGQTLDLELGAGR 776
>gi|212695001|ref|ZP_03303129.1| hypothetical protein BACDOR_04539 [Bacteroides dorei DSM 17855]
gi|212662454|gb|EEB23028.1| hypothetical protein BACDOR_04539 [Bacteroides dorei DSM 17855]
Length = 819
Score = 520 bits (1340), Expect = e-144, Method: Compositional matrix adjust.
Identities = 308/765 (40%), Positives = 437/765 (57%), Gaps = 45/765 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PA + +A+P+GN R+GAMV+GG E L+LN++T+W G P P+A K+L
Sbjct: 23 LKLWYKQPAGTWVEALPVGNSRMGAMVYGGTAREELQLNDETMWGGSPYRNDRPEALKSL 82
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ +G+ EA F G YQ +G + +E H K + Y R+LDL
Sbjct: 83 PQVRELIFAGKNMEAQNLIQDNFYAGKHGMPYQTIGSLIIE-APGHEKVTD--YYRDLDL 139
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A +Y V V F RE F+S PD+VIV +++ G L+F V S L+ H
Sbjct: 140 ERAVATTRYKVDGVTFQREVFASFPDKVIVVRLTADRPGKLNFKVGYVSPLE-HKVSRKG 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD-RGTISALEDKKLKVEGSDW 249
++++ G+ D +G++ +E + D G ++D+ + VEG+D
Sbjct: 199 KKLVLTGK------------GRDHEGVKGLIRMETQTQADVDGGKVKIDDQNITVEGAD- 245
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
+V L V+S + FIN D + + ++ L YS + H+ Y++ F RV
Sbjct: 246 SVTLYVSSGT---NFINYHDISGNESKKASGYLSLALGRPYSQVLQEHIALYKEQFDRVR 302
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L T ++TV +R++ F +D SL LLFQ+GRYLLISSS+PG
Sbjct: 303 LDLG---------TSERAKLETV---KRIELFNEGKDVSLAVLLFQYGRYLLISSSQPGG 350
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN L+ WD +NIN EMNYW + NLSE +PLF+ + LS+ G +T
Sbjct: 351 QPANLQGIWNNKLAAPWDGKYTININTEMNYWPAEVTNLSETHQPLFEMVKELSVTGRET 410
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+ Y +GWV HH TDIW +++ K + WPMGGAWL THLW+HY Y+ D+ FL +
Sbjct: 411 ARTMYGCNGWVAHHNTDIW-RATGPVDKAFYGTWPMGGAWLTTHLWQHYLYSGDKLFLSE 469
Query: 490 RAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLA-CVSYSSTMDMAII 547
AYP L+G A F LD+LIE + G++ T PS SPEH D K A + TMD II
Sbjct: 470 -AYPALKGAADFYLDYLIEHPEYGWMVTAPSMSPEHGPSGEDTKKASTIVAGCTMDNQII 528
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWAQDFKDPEVHHRH 606
+V S + A+ +L+ + A + L+S L RL P +I + + EW +D +P HRH
Sbjct: 529 FDVLSNALHASRILKMS--ASYQDSLRSMLNRLAPMQIGKYNQLQEWLEDLDNPNDKHRH 586
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRM 666
+SH++GLFP + I+ +P L +AA+ TL +RG+E GWSI WK LWARL D HA+R+
Sbjct: 587 ISHVYGLFPSNQISPYTHPLLFQAAKNTLLQRGDEATGWSIGWKVNLWARLLDGNHAFRI 646
Query: 667 VKRLFNLV--DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLL 724
+ + L+ D E + +G Y NLF AHPPFQID NFG+TA VAEML+QS ++LL
Sbjct: 647 INNMLKLLPGDEVKEAYPQGRTYPNLFDAHPPFQIDGNFGYTAGVAEMLLQSHDGAVHLL 706
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
PALP D W++G V+GL ARGG V + W L + I+S N
Sbjct: 707 PALP-DAWATGSVQGLVARGGFVVDMNWNGVQLDKAKIHSRLGGN 750
>gi|300770084|ref|ZP_07079963.1| possible alpha-L-fucosidase [Sphingobacterium spiritivorum ATCC
33861]
gi|300762560|gb|EFK59377.1| possible alpha-L-fucosidase [Sphingobacterium spiritivorum ATCC
33861]
Length = 826
Score = 520 bits (1338), Expect = e-144, Method: Compositional matrix adjust.
Identities = 305/775 (39%), Positives = 434/775 (56%), Gaps = 49/775 (6%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
A N LK+ ++ PA ++ +A+PIGNGRLGAMV+G E ++LNE+T+W G PG+
Sbjct: 21 ATCLQAQNSLKLQYDKPAGNWNEALPIGNGRLGAMVFGQPDQEQIQLNEETIWAGGPGNN 80
Query: 64 TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADV-------YQLLGDIELEFDDSH 116
+ +A + +R L+ G+ EA S F PA YQ GD+ + F H
Sbjct: 81 VSKNAYDKIQQIRRLLFEGKAKEAQDLSNATFPRPAPSGIDYGMPYQTFGDLRISFP-GH 139
Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
+Y +Y RELD+ A R +Y G V +TRE F+S D V++ K+S SLSF++
Sbjct: 140 KQYT--SYSRELDIQDAITRTRYKAGAVTYTREVFASLKDDVVIIKLSADTKKSLSFSIG 197
Query: 177 LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTIS 235
L S DN N Q+ + G + +++ G IQFS I+ + +G
Sbjct: 198 LTSPHDNTHITVENKQLTLSG---------ISGSHEGKTGRIQFSGIVRPVL---KGGTL 245
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
+D +L++ +D +L + ++F +D + ++++ L Y
Sbjct: 246 IQKDNQLEITNADEVILYISIGTNFK----KYNDITSNAAAKALDILNKATARKYEKAKA 301
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
H+ YQ+ F+RVS+ L SP+ S++ D R++ F +DP LV L FQ
Sbjct: 302 DHIQKYQQYFNRVSLYLGESPQ-------SKKMTDI-----RIREFGGADDPELVTLYFQ 349
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
FGRYLLISSS+PG+Q A LQGIWN+ LSP WDS VNIN EMNYW + NL E EPL
Sbjct: 350 FGRYLLISSSQPGSQPATLQGIWNDKLSPPWDSKYTVNINTEMNYWPAEVTNLKELHEPL 409
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
F L L++ G ++A+ Y A GW IHH TD+W S G + +WPMGGAWL HLW
Sbjct: 410 FAMLKDLAVTGQESAKELYHARGWNIHHNTDLWRISGVVDGG-FYGIWPMGGAWLSQHLW 468
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLA 534
+H+ Y+ DR FL K Y +L+G A F LD L E +L PS SPE+ + G
Sbjct: 469 QHFLYSGDRSFL-KEYYHVLKGKALFYLDVLQEEPTHKWLVVAPSMSPENSYQPGVG--- 524
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWA 594
VS +TMD ++ +VF I A+E+L+++ D L + V +L RL P +I + + EW
Sbjct: 525 -VSAGTTMDNQLVFDVFHNFIQASEILKEDAD-LRDSVQVALHRLPPMQIGQHNQLQEWL 582
Query: 595 QDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALW 654
QD P HRH+SHL+GLFP I+ +NP+L +AA+ ++ RG++ GWS+ WK W
Sbjct: 583 QDLDKPTDKHRHISHLYGLFPSGQISPFRNPELLEAAKNSMIYRGDKSTGWSMGWKVNWW 642
Query: 655 ARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLV 714
ARL D + AY+++K + P E GG Y NL AHPPFQID NFG T+ +AEML+
Sbjct: 643 ARLLDGDQAYKLIKDQLSPA-PLEESGQSGGTYPNLLDAHPPFQIDGNFGCTSGIAEMLL 701
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
QS ++YLLPALP ++G V GLKARGG V + WKD + ++ + S N
Sbjct: 702 QSYDGNIYLLPALP-RALANGKVTGLKARGGFEVDMEWKDNKVKKLVVRSTLGGN 755
>gi|423223718|ref|ZP_17210187.1| hypothetical protein HMPREF1062_02373 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638093|gb|EIY31946.1| hypothetical protein HMPREF1062_02373 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 809
Score = 519 bits (1337), Expect = e-144, Method: Compositional matrix adjust.
Identities = 295/760 (38%), Positives = 423/760 (55%), Gaps = 45/760 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PAK +T+A+P+GN RLGAMV+GGV +E ++LNE+T+W G P +P A L
Sbjct: 23 LKLWYQQPAKVWTEALPLGNSRLGAMVYGGVVNEQIQLNEETVWGGGPHRNDSPKAFGVL 82
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ +G+ EA F G +Q +G + LEFD H Y+ YRR+LDL
Sbjct: 83 PKVRELIFAGREKEAEKVMADNFFTGQHGMPFQTIGSLMLEFD-GHADYS--NYRRDLDL 139
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A V+Y +G V +TR F+S D ++ +I + G+++F + +
Sbjct: 140 ERAVASVRYKIGEVNYTRTIFTSLVDNALIIRIETDKPGAVNFTTRYSTPYKEYEIKKNG 199
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
+++ G P A I+F +IK ++G ++ D ++V+G+D A
Sbjct: 200 KSLLLSGHGSAHEGIPGA--------IRFETRTQIKA--EKGKVNVTNDC-IEVKGADAA 248
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
V+ + A+++F +N D + T + L Y+ T H + YQKLF RVS+
Sbjct: 249 VIYVTAATNF----VNYKDVSANETRRATEFLAKAMKRPYAQALTAHEEAYQKLFGRVSL 304
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
+ S ++ ++ R+K F +D LV L+FQFGRYLLISSS+PG Q
Sbjct: 305 NIGPSSQE--------------ETSYRIKHFNERKDLGLVALMFQFGRYLLISSSQPGGQ 350
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
A LQGIWN +L WD +NIN EMNYW + NL E EPLF + LS + TA
Sbjct: 351 PAGLQGIWNHELLAPWDGKYTININTEMNYWPAEVTNLPEMHEPLFQMVKELSESAQGTA 410
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
+ Y GW +HH TD+W + G +WP+GGAWL HLW+HY YT D+ FL K
Sbjct: 411 RTLYECRGWTVHHNTDLWRMAGPVDGASY--VWPLGGAWLSQHLWQHYLYTGDQAFL-KT 467
Query: 491 AYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
AYP L+G A F LD+L+E G++ PS SPE P G ++ TMD I+ +
Sbjct: 468 AYPALKGAADFFLDFLVEHPKYGWMVCTPSMSPEQ---GPPGTGTMITAGCTMDTQIVLD 524
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSH 609
++++SA ++L + + + + RL P +I + + EW D DP HRH+SH
Sbjct: 525 ALTSVLSATQLLYPANTSYRDSLQSMIKRLPPMQIGKHNQLQEWLADVDDPNNDHRHVSH 584
Query: 610 LFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKR 669
L+GL+P + I+ +P L +AA+++L RG+ GWSI WK LWARL D +HAY+++K
Sbjct: 585 LYGLYPSNQISPYAHPQLFQAAKRSLLYRGDMATGWSIGWKINLWARLLDGDHAYKIIKN 644
Query: 670 LFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPW 729
+ LV+ ++ +G Y N+F AHPPFQID NFGFTA VAEML+QS L+LLPALP
Sbjct: 645 MLKLVEKDNP---DGRTYPNMFDAHPPFQIDGNFGFTAGVAEMLLQSHDEALHLLPALPQ 701
Query: 730 DKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
D W+ G VKGL ARG V + W G+L I S N
Sbjct: 702 D-WNKGSVKGLVARGAFEVDMDWDGGELTTATITSRIGGN 740
>gi|374324082|ref|YP_005077211.1| alpha-L-fucosidase [Paenibacillus terrae HPL-003]
gi|357203091|gb|AET60988.1| alpha-L-fucosidase [Paenibacillus terrae HPL-003]
Length = 772
Score = 519 bits (1337), Expect = e-144, Method: Compositional matrix adjust.
Identities = 311/777 (40%), Positives = 436/777 (56%), Gaps = 64/777 (8%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
N I FN PA+ + +AIPIGNG LG M++G E ++LNED+LW G P D NP + +
Sbjct: 2 NEKMIWFNQPAEKWEEAIPIGNGTLGGMIFGKTSIERIQLNEDSLWYGGPMDRNNPHSFE 61
Query: 71 ALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRE 127
L ++RSL+ SGQ +A ASV L G P Y+ LGD+ L D + + YRR+
Sbjct: 62 YLDEIRSLLFSGQIKQAEELASVALVGVPDGQRHYESLGDLYLNIGDGEEEIKD--YRRQ 119
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV------------ 175
LDL+ V Y V V + RE+FSS PDQV+V +++ SE G+LSF+
Sbjct: 120 LDLDHGIVSVNYRVNQVNYCREYFSSFPDQVLVVRLNSSEYGALSFSALFGRGIVLEPTP 179
Query: 176 ---SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRG 232
L + H+Y++ +E R P I + ++ GI+F + I+I + G
Sbjct: 180 WSDVLKHPVGLHAYLDR-----IETRSPADLIIRGRSGGEE--GIRFCCV--IRIVTEEG 230
Query: 233 TISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSD 292
IS + +L ++ + A +L+ A + F P K+ +E + L SY
Sbjct: 231 QIS-YSNGQLSLKDVNAATILVSACTDFRIP-------KEQMEAECICRLDRAAGKSYDQ 282
Query: 293 LYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVEL 352
L T H++DYQ LF RV + L + V T + + T ER+K+ ED L+ L
Sbjct: 283 LRTGHIEDYQALFGRVELSLQGN----VDSTSTSSFLTTDQRLERIKN--GAEDNELISL 336
Query: 353 LFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
FQFGRYLLISSSRPG+ ANLQGIWN+D+ P WDS +NIN +MNYW + CNL+EC
Sbjct: 337 YFQFGRYLLISSSRPGSLPANLQGIWNKDMLPIWDSKYTININTQMNYWPAEICNLAECH 396
Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
PL DF+ + G +TA++ Y G+V HH +DIWA ++ + W MG AWL
Sbjct: 397 IPLIDFIDRMQERGKETARIMYRCRGFVAHHNSDIWADTAPQDVCITSTFWTMGAAWLSL 456
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGK 532
HLW+HY + D FL K AY ++ A FLLD+LIE G L +PS+SPE+ ++ P+G+
Sbjct: 457 HLWDHYEFGQDASFL-KEAYDTMKEAAFFLLDYLIEDPYGNLVISPSSSPENRYVLPNGE 515
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNED--ALVEKVLKSLPRLRPTKIAEDGSI 590
+ Y ++MD IIRE+F I + +L+++++ A++ K LK +P+L + + G I
Sbjct: 516 SGALCYGASMDSQIIRELFERCIKSTIILQEDQEFGAMLRKALKRIPKL---AVGKHGQI 572
Query: 591 MEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSI 647
EW+ D+++ E HRH+SHLF L PG IT E P L +AA TL++R G GWS
Sbjct: 573 QEWSIDYEELEPGHRHISHLFALHPGSQITPESTPALAEAARVTLRRRLTHGGGHTGWSR 632
Query: 648 TWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTA 707
W +WARL + E AY ++ L NLF HPPFQID NFG TA
Sbjct: 633 AWILNMWARLEESELAYENIQEL-----------LRSSTLPNLFCDHPPFQIDGNFGGTA 681
Query: 708 AVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+AEML+QS ++ LLPALP W +G V+GL+ARGG V I W DG L I S
Sbjct: 682 GIAEMLLQSHGGEIRLLPALP-SVWPNGSVRGLRARGGFEVDIEWSDGRLQNARIRS 737
>gi|298482732|ref|ZP_07000916.1| alpha-L-fucosidase 2 [Bacteroides sp. D22]
gi|298271195|gb|EFI12772.1| alpha-L-fucosidase 2 [Bacteroides sp. D22]
Length = 823
Score = 519 bits (1336), Expect = e-144, Method: Compositional matrix adjust.
Identities = 306/765 (40%), Positives = 438/765 (57%), Gaps = 55/765 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA+++ +A+P+GNGRLGAMV+G +E ++LNE+T+ G P NP+A AL+ +R
Sbjct: 32 YDKPARYWEEALPLGNGRLGAMVYGNPVAEEIQLNEETVSAGSPYKNYNPEAKGALATIR 91
Query: 77 SLVDSGQYAEATA-ASVKL-----FGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
L+ +G+Y EA A K+ FG P YQ +G + L+F SH Y +RRELDL
Sbjct: 92 QLIFAGRYPEAQELAGEKILSKNGFGMP---YQTVGSLCLDFP-SHENYT--NFRRELDL 145
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A Y+V V++ RE F+S DQ+++ +++ S+ G L+F+ SL V+G
Sbjct: 146 EKAVATTAYTVNGVDYKREVFTSFVDQLVIVRLTASQPGKLTFSASLTCPQKVDVTVSGK 205
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
N + +EG G +D KG I+F A L++ D +G S D L V ++
Sbjct: 206 NALTLEGTTKG---------DDFTKGSIRFRADLKL---DLQGGKSVAGDTLLSVTNANS 253
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A + + +++F +N D +P+ + ++++ +Y H+ YQK ++RVS
Sbjct: 254 ATIYIAMATNF----VNYKDISGNPSGRNKVSMKNAGK-NYVRALQAHISAYQKYYNRVS 308
Query: 310 IQLSR-SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
+ L R S D TD R+K F +DP LV L FQFGRYLLISSS+PG
Sbjct: 309 LNLGRTSQADKPTDV-------------RIKEFAISDDPHLVALYFQFGRYLLISSSQPG 355
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
Q ANLQGIWN+ L+P W NIN EMNYW + NL E EP + L NG +
Sbjct: 356 GQPANLQGIWNQKLNPAWKCRYTTNINAEMNYWPAEVTNLREMHEPFLQMVKELYENGQE 415
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
A+ Y GWV+HH TD+W + A DR WP AWLC HLW+ Y Y+ D+++L
Sbjct: 416 AAREMYGCRGWVLHHNTDLWRMNGAVDRAYC--GPWPTCNAWLCQHLWDRYLYSGDKEYL 473
Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
YP+L+ + F +D+L+ + + GYL PS SPE+ GK A + TMD +
Sbjct: 474 AS-VYPILKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPSIWKGK-ANLFAGITMDNQL 531
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWAQDFKDPEVHHR 605
+ ++FS SAA++L N+D + SL R L P ++ + G + EW +D+ +P HHR
Sbjct: 532 VSDLFSNTRSAAQIL--NQDKQFCDTILSLKRQLPPMQVGQYGQLQEWFEDWDNPNDHHR 589
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H+SHL+GLFPG+ I+ +P L +AA TL +RG+ GWS+ WK WAR D HA++
Sbjct: 590 HISHLWGLFPGYQISPYSSPVLFEAARNTLIQRGDPSTGWSMGWKVCFWARCLDGNHAFK 649
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
++ NLV PE +K GG Y NLF AHPPFQID NFG A +AEML+QS ++LLP
Sbjct: 650 LITNQLNLVSPEVQKGQGGGTYPNLFDAHPPFQIDGNFGCAAGIAEMLMQSHDGAVHLLP 709
Query: 726 ALPWDKWSSGCVKGLKARGG-ETVSICWKDGDLHEVGIYSNYSNN 769
ALP D W +G ++GL+ARGG E VS+ WK G + I S N
Sbjct: 710 ALP-DTWKNGEIRGLRARGGFEIVSLKWKGGKIESAVIKSTIGGN 753
>gi|224536380|ref|ZP_03676919.1| hypothetical protein BACCELL_01254 [Bacteroides cellulosilyticus
DSM 14838]
gi|224522018|gb|EEF91123.1| hypothetical protein BACCELL_01254 [Bacteroides cellulosilyticus
DSM 14838]
Length = 793
Score = 518 bits (1335), Expect = e-144, Method: Compositional matrix adjust.
Identities = 293/760 (38%), Positives = 424/760 (55%), Gaps = 45/760 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PAK +T+A+P+GN RLGAMV+GGV +E ++LNE+T+W G P +P A L
Sbjct: 7 LKLWYQQPAKVWTEALPLGNSRLGAMVYGGVVNEQIQLNEETVWGGGPHRNDSPKAFGVL 66
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ +G+ EA F G +Q +G + LEFD H Y+ YRR+LDL
Sbjct: 67 PKVRELIFAGREKEAEKVMADNFFTGQHGMPFQTIGSLMLEFD-GHADYS--NYRRDLDL 123
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A V+Y +G V +TR F+S D ++ +I + G+++F + +
Sbjct: 124 ERAVASVRYKIGEVNYTRTIFTSLVDNALIIRIEADKPGAVNFTTRYSTPYKEYEIKKNG 183
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
+++ G P A I+F +IK ++G ++ + + ++V+G+D A
Sbjct: 184 KSLLLSGHGSAHEGIPGA--------IRFETRTQIKA--EKGKVN-VTNNCIEVKGADAA 232
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
V+ + A+++F +N D + T + L Y+ T H + YQKLF RVS+
Sbjct: 233 VIYVTAATNF----VNYKDVSANETRRATEFLVKAMKRPYAQALTAHEEAYQKLFGRVSL 288
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
+ S ++ ++ R+K F +D LV L+FQFGRYLLISSS+PG Q
Sbjct: 289 NIGPSSQE--------------ETSYRIKHFNERKDLGLVALMFQFGRYLLISSSQPGGQ 334
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
A LQGIWN +L WD +NIN EMNYW + NL E EPLF + LS + TA
Sbjct: 335 PAGLQGIWNHELLAPWDGKYTININTEMNYWPAEVTNLPEMHEPLFQMVKELSESAQGTA 394
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
+ Y GW +HH TD+W + G +WP+GGAWL HLW+HY YT D+ FL K
Sbjct: 395 RTLYECRGWTVHHNTDLWRMAGPVDGASY--VWPLGGAWLSQHLWQHYLYTGDQAFL-KT 451
Query: 491 AYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
AYP L+G A F LD+L+E G++ PS SPE P G ++ TMD I+ +
Sbjct: 452 AYPALKGAADFFLDFLVEHPKYGWMVCAPSMSPEQ---GPPGTGTMITAGCTMDTQIVLD 508
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSH 609
++++SA ++L + + + + RL P +I + + EW D DP HRH+SH
Sbjct: 509 ALTSVLSATQLLYPANTSYRDSLQSMIKRLPPMQIGKHNQLQEWLADVDDPNNDHRHVSH 568
Query: 610 LFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKR 669
L+GL+P + I+ +P L +AA+++L RG+ GWSI WK LWARL D +HAY+++K
Sbjct: 569 LYGLYPSNQISPYAHPQLFQAAKRSLLYRGDMATGWSIGWKINLWARLLDGDHAYKIIKN 628
Query: 670 LFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPW 729
+ LV+ ++ +G Y N+F AHPPFQID NFGFTA VAEML+QS L+LLPALP
Sbjct: 629 MLKLVEKDNP---DGRTYPNMFDAHPPFQIDGNFGFTAGVAEMLLQSHDEALHLLPALPQ 685
Query: 730 DKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
D W+ G VKGL ARG V + W G+L + S N
Sbjct: 686 D-WNKGSVKGLVARGAFEVDMDWDGGELTTATVTSRIGGN 724
>gi|319640719|ref|ZP_07995432.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_40A]
gi|345517731|ref|ZP_08797196.1| glycoside hydrolase family 95 [Bacteroides sp. 4_3_47FAA]
gi|254836837|gb|EET17146.1| glycoside hydrolase family 95 [Bacteroides sp. 4_3_47FAA]
gi|317387531|gb|EFV68397.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_40A]
Length = 819
Score = 518 bits (1334), Expect = e-144, Method: Compositional matrix adjust.
Identities = 306/765 (40%), Positives = 436/765 (56%), Gaps = 45/765 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PA + +A+P+GN R+GAMV+GG E L+LN++T+W G P P+A ++L
Sbjct: 23 LKLWYKQPAGTWVEALPVGNSRMGAMVYGGTAREELQLNDETMWGGSPYRNDKPEALESL 82
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ +G+ EA + F G YQ +G + +E H K + Y R+LDL
Sbjct: 83 PQVRELIFAGKNMEAQDLIQENFYAGKHGMPYQTIGSLIIE-TPGHEKVTD--YYRDLDL 139
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A +Y V V F RE F+S PD+V+V +++ G L+F V S L+ H
Sbjct: 140 ERAVATTRYKVDGVTFQREVFASFPDKVVVVRLTADRPGKLNFKVGYVSPLE-HKVSRKG 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD-RGTISALEDKKLKVEGSDW 249
++++ GR D +G++ +E + D G ++D+ + VEG+D
Sbjct: 199 KKLVLTGR------------GRDHEGVKGLIRMETQTQADVDGGKVKIDDQNITVEGAD- 245
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
+V L V+S + FIN D + + ++ L YS + H+ Y++ F RV
Sbjct: 246 SVTLYVSSGT---NFINYHDISGNESKKASGYLSLALGRPYSQVLQEHIALYKEQFDRVR 302
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L T ++TV +R++ F +D SL LLFQ+GRYLLISSS+PG
Sbjct: 303 LDLG---------TSERAKLETV---KRIELFNEGKDVSLAVLLFQYGRYLLISSSQPGG 350
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN L+ WD +NIN EMNYW + NLSE +PLF+ + LS+ G +T
Sbjct: 351 QPANLQGIWNNKLAAPWDGKYTININTEMNYWPAEVTNLSETHQPLFEMVKELSVTGRET 410
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+ Y +GWV HH TDIW +++ K + WPMGGAWL THLW+HY Y+ D+ FL +
Sbjct: 411 ARTMYGCNGWVAHHNTDIW-RATGPVDKAFYGTWPMGGAWLTTHLWQHYLYSGDKLFLSE 469
Query: 490 RAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLA-CVSYSSTMDMAII 547
AYP L+G A F LD+L E + G++ T PS SPEH D K A + TMD II
Sbjct: 470 -AYPALKGAADFYLDYLTEHPEYGWMVTAPSMSPEHGPSGEDTKKASTIVAGCTMDNQII 528
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWAQDFKDPEVHHRH 606
+V S + A+ +L+ + A + L+S L RL P +I + + EW +D +P HRH
Sbjct: 529 FDVLSNALHASRILKMS--ASYQDSLRSMLNRLAPMQIGKYNQLQEWLEDLDNPNDKHRH 586
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRM 666
+SH++GLFP + I+ +P L +AA+ TL +RG+E GWSI WK LWARL D HA+R+
Sbjct: 587 ISHVYGLFPSNQISPYTHPLLFQAAKNTLLQRGDEATGWSIGWKVNLWARLLDGNHAFRI 646
Query: 667 VKRLFNLV--DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLL 724
+ + L+ D E + +G Y NLF AHPPFQID NFG+TA VAEML+QS ++LL
Sbjct: 647 INNMLKLLPGDEVKEAYPQGRTYPNLFDAHPPFQIDGNFGYTAGVAEMLLQSHDGAVHLL 706
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
PALP D W +G V+GL ARGG V + W L + I+S N
Sbjct: 707 PALP-DAWVTGSVQGLVARGGFVVDMSWNGVQLDKAKIHSRLGGN 750
>gi|261406666|ref|YP_003242907.1| alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
gi|261283129|gb|ACX65100.1| Alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
Length = 775
Score = 518 bits (1334), Expect = e-144, Method: Compositional matrix adjust.
Identities = 311/797 (39%), Positives = 429/797 (53%), Gaps = 69/797 (8%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
I F+ PA+++ +A+PIGNGRLG MV+G E ++ NED++W G P D NPDA + L
Sbjct: 9 IWFDQPAQNWNEALPIGNGRLGGMVFGCAQQEKIQFNEDSVWYGGPRDRNNPDALRHLPL 68
Query: 75 VRSLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+R L+ G+ EA S F G P Y GD ++ D H + YRRELDL
Sbjct: 69 IRKLLFEGRLKEAHRLSETAFSGTPRSQRPYLTAGDFCIQVD--HPQGELSHYRRELDLE 126
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS---YVN 188
A A Y G V FTRE F S PDQV+V ++ G L+ + H + +
Sbjct: 127 KAIAVTSYQYGGVTFTREVFCSYPDQVMVIRLEADRPGVLTLTARFERQKGKHMDAVHRH 186
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
G + ++M C GK G+ +SA + + GT+ + + L V+ +D
Sbjct: 187 GTDTVVMTNDCGGK------------DGLTYSAAAKAITAG--GTVRVV-GEHLLVDQAD 231
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
V++L A+S+F DP L+ N Y+ L RH+ DYQ LF RV
Sbjct: 232 EVVIILAAASTF---------RVDDPKLRCAELLEHAANQGYAALKKRHIADYQPLFERV 282
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPS-LVELLFQFGRYLLISSSRP 367
+ L R+P D + +P+ +R++ + ED + L L F FGRYLLI+ SRP
Sbjct: 283 KLDL-RAPAD--------QERHLLPTPKRLERVRAGEDDAGLYTLYFHFGRYLLIACSRP 333
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
G+ ANLQGIWN+ ++P WDS +NIN +MNYW + CNLSEC EPLF+ + + NG
Sbjct: 334 GSLPANLQGIWNDSMAPPWDSKFTININTQMNYWPAESCNLSECHEPLFELIERMRDNGR 393
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
TA+ Y G+V HH TDIWA ++ W MG AWL HLWEHY + + DFL
Sbjct: 394 VTARTMYGCRGFVAHHNTDIWADTAPQDIYPPATQWVMGAAWLTLHLWEHYKFNPNPDFL 453
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
KRAY ++ A F D+L+E +GYL TNPS SPE+ ++ +G+ + Y +MD II
Sbjct: 454 -KRAYETMKEAALFFTDFLVESPEGYLVTNPSVSPENRYLLRNGESGTLCYGPSMDTQII 512
Query: 548 REVFSAIISAAEVLEKNEDALVE--KVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
E++SA I A+ L+ +E+A E ++ LP + K+ G + EW +D+++ + HR
Sbjct: 513 SELYSACIQASLELDIDENARQEWAAIMDRLPEM---KVGRHGQLQEWLEDYEEADPGHR 569
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEH 662
H+SHLFGL PG T++ + PDL +AA TL++R G GWS W WARL D E
Sbjct: 570 HISHLFGLHPGTTVSPDSTPDLAEAARVTLRRRLAHGGGHTGWSRAWIINFWARLLDGEQ 629
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
AY +K L NLF HPPFQID NFG A +AEML+QS L+ +
Sbjct: 630 AYVHLKELLR-----------QSTLPNLFDNHPPFQIDGNFGAAAGIAEMLIQSHLDHIR 678
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGT 782
LLPALP + W G V+GL+ARGG V I W+DG L E I S LH +
Sbjct: 679 LLPALP-EAWPQGRVQGLRARGGFQVDIDWRDGSLAEAVITSVSGRK-----LRLHAK-R 731
Query: 783 SVKVNLSAGKIYTFNRQ 799
SV+V S G+ R
Sbjct: 732 SVRVTTSDGREVPMERH 748
>gi|220928453|ref|YP_002505362.1| hypothetical protein Ccel_1020 [Clostridium cellulolyticum H10]
gi|219998781|gb|ACL75382.1| conserved hypothetical protein [Clostridium cellulolyticum H10]
Length = 759
Score = 518 bits (1333), Expect = e-144, Method: Compositional matrix adjust.
Identities = 310/781 (39%), Positives = 446/781 (57%), Gaps = 63/781 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ + PAK + +A+PIGNGRLGAMV+G V +E ++LNED++W G P D NPDA L+
Sbjct: 4 KLWYKSPAKEWNEALPIGNGRLGAMVYGCVKNENIQLNEDSIWYGDPIDRNNPDALANLA 63
Query: 74 DVRSLVDSGQYAEATAASV-KLFGHPADV--YQLLGDIELEF--DDSHLKYAEETYRREL 128
++R+ + G+ EA +V L G P YQ LG+++L F D+S ++ Y REL
Sbjct: 64 EIRNFLSDGRIKEAEKLAVLSLSGVPESQRPYQTLGNLKLNFEIDESDIR----DYSREL 119
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF--NVSLDSLLDNHSY 186
D+ A A VK+ V +TRE+F+S DQVIV ++ G +SF N+ LDN
Sbjct: 120 DIENACASVKFVSKGVMYTREYFASAVDQVIVVRLFADAPGKISFTANMRRGRFLDNSGA 179
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
++G K I A+ D KG++F ++ ++ + G ++ + + L VE
Sbjct: 180 IDG------------KTIGMFASCGSD-KGVRFCSM--VRAVSEGGKVNTI-GENLIVEE 223
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D LL+ ++SF K+ ++ + L + +Y++L + H++DY +L+
Sbjct: 224 ADAVTLLISTATSF---------YHKEYETQCLKYLDGVEEKTYTELMSNHIEDYSQLYG 274
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSS 365
RV +++ + + + I ++ +AER++ ++ + D L L F FGRYLLIS S
Sbjct: 275 RVELEIGNAEE--------HDKIQSLDTAERLERLESGKPDHQLECLYFSFGRYLLISCS 326
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPG+ ANLQGIWN+D+ P WDS +NIN EMNYW + CNLSEC PLFD + +
Sbjct: 327 RPGSLPANLQGIWNQDILPAWDSKYTININTEMNYWPAETCNLSECHFPLFDHIERMRAP 386
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G +TA+V Y SG+V HH TDIW ++ + WPMG AWL HLWEHY + +D++
Sbjct: 387 GRRTARVMYGCSGFVAHHNTDIWGDTAPQDIYIPATYWPMGAAWLSLHLWEHYEFGLDKE 446
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
FL K AYP+++ A F LD+LIE G L T+PS SPE+ +I +G+ C+ +MD
Sbjct: 447 FL-KDAYPVMKEAAQFFLDFLIEDSKGRLVTSPSVSPENTYILENGEKGCLCIGPSMDSQ 505
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
I+ +FS I A+ +L+ + + EK++K L +I G I EW++D+++ E HR
Sbjct: 506 ILYALFSGCIEASNILD-TDISFAEKLIKVRDSLPKPQIGRYGQIQEWSEDYEEEEPGHR 564
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEH 662
H+SHLFGL PG + K P+L AA KTL++R G GWS W +WARL D E
Sbjct: 565 HISHLFGLHPGKQFSTRKTPELATAARKTLERRLANGGGHTGWSRAWIINMWARLKDGEK 624
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
AY N+VD + NLF HPPFQID NFG A +AEML+QS +
Sbjct: 625 AYE------NVVD-----LLKKSTLPNLFDNHPPFQIDGNFGGAAGIAEMLLQSHEGGIE 673
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGT 782
LPALP WS G VKGL ARG V + WKDG L+ I S S + F +L YR T
Sbjct: 674 FLPALP-GAWSEGRVKGLVARGNFEVEMEWKDGKLNRATILSR-SGGNCKIFTSLKYRVT 731
Query: 783 S 783
S
Sbjct: 732 S 732
>gi|423311596|ref|ZP_17289533.1| hypothetical protein HMPREF1058_00145 [Bacteroides vulgatus
CL09T03C04]
gi|392690241|gb|EIY83511.1| hypothetical protein HMPREF1058_00145 [Bacteroides vulgatus
CL09T03C04]
Length = 819
Score = 518 bits (1333), Expect = e-144, Method: Compositional matrix adjust.
Identities = 306/765 (40%), Positives = 437/765 (57%), Gaps = 45/765 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PA + +A+P+GN R+GAMV+GG E L+LN++T+W G P P+A ++L
Sbjct: 23 LKLWYKQPAGTWVEALPVGNSRMGAMVYGGTAREELQLNDETMWGGSPYRNDKPEALESL 82
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ +G+ EA + F G YQ +G + +E H K + Y R+LDL
Sbjct: 83 PQVRELIFAGKNMEAQNLIQENFYAGKHGMPYQTIGSLIIE-TPGHEKVTD--YYRDLDL 139
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A +Y V V F RE F+S PD+V+V +++ G L+F V S L+ H
Sbjct: 140 ERAVATTRYKVDGVTFQREVFASFPDKVVVVRLTADRPGKLNFKVGYVSPLE-HKVSRKG 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD-RGTISALEDKKLKVEGSDW 249
++++ GR D +G++ +E + D G ++D+ + VEG+D
Sbjct: 199 KKLVLTGR------------GRDHEGVKGLIRMETQTQADVDGGKVKIDDQNITVEGAD- 245
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
+V L V+S + FIN D + + ++ L YS + H+ Y++ F RV
Sbjct: 246 SVTLYVSSGT---NFINYHDISGNESKKASGYLSLALGRPYSQVLQEHIALYKEQFDRVR 302
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L T ++TV +R++ F +D SL LLFQ+GRYLLISSS+PG
Sbjct: 303 LDLG---------TSERAKLETV---KRIELFNEGKDVSLAVLLFQYGRYLLISSSQPGG 350
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN L+ WD +NIN EMNYW + NLSE +PLF+ + LS+ G +T
Sbjct: 351 QPANLQGIWNNKLAAPWDGKYTININTEMNYWPAEVTNLSETHQPLFEMVKELSVTGRET 410
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+ Y +GWV HH TDIW +++ K + WPMGGAWL THLW+HY Y+ D+ FL +
Sbjct: 411 ARTMYGCNGWVAHHNTDIW-RATGPVDKAFYGTWPMGGAWLTTHLWQHYLYSGDKLFLSE 469
Query: 490 RAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLA-CVSYSSTMDMAII 547
AYP L+G A F LD+L E + G++ T PS SPEH D K A + TMD II
Sbjct: 470 -AYPALKGAADFYLDYLTEHPEYGWMVTAPSMSPEHGPSGEDTKKASTIVAGCTMDNQII 528
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWAQDFKDPEVHHRH 606
+V S + A+ +L+ + A + L+S L RL P +I + + EW +D +P HRH
Sbjct: 529 FDVLSNALHASRILKMS--ASYQDSLRSMLNRLAPMQIGKYNQLQEWLEDLDNPNDKHRH 586
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRM 666
+SH++GLFP + I+ +P L +AA+ TL +RG+E GWSI WK LWARL D HA+R+
Sbjct: 587 ISHVYGLFPSNQISPYTHPLLFQAAKNTLLQRGDEATGWSIGWKVNLWARLLDGNHAFRI 646
Query: 667 VKRLFNLV--DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLL 724
+ + L+ D E + +G Y NLF AHPPFQID NFG+TA VAEML+QS ++LL
Sbjct: 647 INNMLKLLPGDEVKEAYPQGRTYPNLFDAHPPFQIDGNFGYTAGVAEMLLQSHDGAVHLL 706
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
PALP D W++G V+GL ARGG V + W L + I+S N
Sbjct: 707 PALP-DAWATGSVQGLVARGGFVVDMNWNGVQLDKAKIHSRLGGN 750
>gi|345013386|ref|YP_004815740.1| large hypothetical protein [Streptomyces violaceusniger Tu 4113]
gi|344039735|gb|AEM85460.1| large secreted protein [Streptomyces violaceusniger Tu 4113]
Length = 805
Score = 518 bits (1333), Expect = e-144, Method: Compositional matrix adjust.
Identities = 302/757 (39%), Positives = 416/757 (54%), Gaps = 54/757 (7%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
PL + + PA + A+P+GNGRLGAMV+G +E L+LN DTLW G P Y N
Sbjct: 44 RPLALWYREPAADWLSALPLGNGRLGAMVFGATETERLQLNADTLWAGGPHSYDNHKGLA 103
Query: 71 ALSDVRSLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRE 127
AL +R LV G++ EA T + G P YQ +G + L A YRRE
Sbjct: 104 ALPRIRQLVFDGKWPEAETLINSDFLGVPGGQAQYQTVGSLLLSLPTGG---AVTGYRRE 160
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
LDL++A A Y+ V FTRE F+S PD+VIV ++S S+ G+LSF + +S L
Sbjct: 161 LDLDSAVATTTYTRDGVTFTREAFASAPDRVIVVRLSASKKGALSFGATFESPLRTSLSS 220
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
++G +A G + F A++ + + + V G
Sbjct: 221 PDPLTAALDG---------TGDATGGVDGAVGFRALVRVLAEG---GTTTSAGGTVTVRG 268
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D A +L+ +++ +N ++ D ++ + L N Y L +RH+DD++ LF
Sbjct: 269 ADAATVLVAIGTTY----VNWENANGDAAGQAAADLNPAANRPYGQLRSRHVDDHRALFR 324
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
R S+ + + +P+ ERV F + DP LVEL FQ+GRYLLI++SR
Sbjct: 325 RTSLDVGSG------------DAAALPTDERVSRFASGGDPQLVELHFQYGRYLLIAASR 372
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PGTQ A LQGIWN+ SP W S +NIN EMNYW + P NL EC EP+F L L++ G
Sbjct: 373 PGTQPATLQGIWNDLTSPPWGSKYTININTEMNYWPAAPANLLECWEPVFALLDELAVAG 432
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
TA+ Y A GWV HH TD+W + +A W +WPMGGAW+ +WEHY YT D +
Sbjct: 433 RSTARTQYGADGWVTHHNTDVW-RGTAPVDGAFWGMWPMGGAWMSMAIWEHYRYTRDTEK 491
Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L R YP+L+G A F LD L+ + G L T PS SPE+ + G C TMDM
Sbjct: 492 LRAR-YPVLKGAAQFFLDALVTDPATGALVTCPSVSPENAHHSGGGGSLCA--GPTMDMQ 548
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF--KDPEVH 603
++R++F A+ SAA+ L + AL ++VL + RL P KI G + EW QD+ PE
Sbjct: 549 LLRDLFGAVASAADTL-GTDAALRDQVLAARGRLAPMKIGAQGRLQEWQQDWDAGAPEQE 607
Query: 604 HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHA 663
HRH+SHL+GL P + I+ PDL AA TL +RG+ G GWS+ WK WARL + + +
Sbjct: 608 HRHVSHLYGLHPSNQISRTGTPDLFTAARTTLVRRGDAGTGWSLAWKVNFWARLEEGDRS 667
Query: 664 YRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYL 723
Y++ L +L+ PE NLF HPPFQID NFG A V E L+QS ++L+L
Sbjct: 668 YKL---LADLLTPERTA-------PNLFDLHPPFQIDGNFGACAGVTEWLLQSQHDELHL 717
Query: 724 LPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEV 760
LPALP + G V+GL ARGG V + W+ G L+E
Sbjct: 718 LPALP-SQLPDGSVRGLLARGGFEVDMSWRGGALNEA 753
>gi|325103216|ref|YP_004272870.1| alpha-L-fucosidase [Pedobacter saltans DSM 12145]
gi|324972064|gb|ADY51048.1| Alpha-L-fucosidase [Pedobacter saltans DSM 12145]
Length = 822
Score = 518 bits (1333), Expect = e-144, Method: Compositional matrix adjust.
Identities = 289/768 (37%), Positives = 435/768 (56%), Gaps = 50/768 (6%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
NP+++ +N PA ++ +A+PIGNG L MV+GGV + ++LNE+T+W G PG+ P+
Sbjct: 27 NPMELWYNQPAANWNEALPIGNGFLAGMVFGGVQKDRIQLNEETIWAGEPGNNIIPNVYP 86
Query: 71 ALSDVRSLVDSGQYAEATAASVKLF-------GHPADVYQLLGDIELEFDDSHLKYAEET 123
A++++R L+ G+Y EA S K F G+ YQ G++ L+F
Sbjct: 87 AIAEIRKLLVEGKYKEAQDLSNKAFPRQAPKGGNYGMQYQTAGNLFLDFGHGGFI----N 142
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
YRR LD+ ATA + Y +++ RE+ + P +VI +++ S++ S+SF + +D+
Sbjct: 143 YRRNLDIEKATASISYQANGIDYKREYIALIPKKVIAIRLTASKTKSISFTIDMDAPFKE 202
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKL 242
+ ++++++ +++ D KG ++F + K+ + GT+ ++D KL
Sbjct: 203 FQKIALTDRLLLKAV---------SSSVDGKKGRVKFETQVVPKL--EGGTLE-IKDNKL 250
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
V+ ++ L + ++F+ N D + L + SY L H+ YQ
Sbjct: 251 VVKEANAVTLFISIGTNFN----NYQDISANENIRVKQRLAEVTGQSYKKLKANHIKSYQ 306
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
+ F+RV + L VT + P+ +RV F+ DP+LV L FQFGRYLLI
Sbjct: 307 QYFNRVKLDLG------VTSVMDK------PTNQRVIDFKEGNDPALVSLYFQFGRYLLI 354
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SS PG+Q ANLQG WNE LSP WDS VNIN EMNYW + NL E +PLF L L
Sbjct: 355 CSSFPGSQPANLQGKWNEKLSPPWDSKYTVNINTEMNYWPAEVTNLPEMHQPLFKMLKEL 414
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
S G ++A Y A GW +HH TD+W + G + +WPMGGAWL H+W+HY Y
Sbjct: 415 SETGKESAGQMYKARGWNLHHNTDLWRITGPVDGG-FYGMWPMGGAWLSQHIWQHYLYNG 473
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
D DFL + Y +L+G A F +D L E +L PS SPE+ ++ G V +T
Sbjct: 474 DNDFL-REYYDVLKGAAMFYVDVLQEEPKHKWLVVAPSMSPENTYLPSVG----VGAGTT 528
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPE 601
MD ++ +VF+ I +E+L K + + + V + RL P ++ + + EW QD+
Sbjct: 529 MDNQLVFDVFANFIRTSEIL-KQDQSFADTVRNMINRLPPMQVGQHAQLQEWLQDWDKVN 587
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQE 661
HRH+SHL+GLFPG+ I+ ++P+L +AA +L RG++ GWS+ WK LWARL D
Sbjct: 588 DKHRHVSHLYGLFPGNQISPYRHPELFEAARNSLIYRGDKSTGWSMGWKVNLWARLLDGN 647
Query: 662 HAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDL 721
AY++++ + P+ EK GG Y NLF AHPPFQID NFG T+ +AEML+QS D+
Sbjct: 648 RAYKLIEDQLSPA-PQEEKGQNGGTYPNLFDAHPPFQIDGNFGCTSGIAEMLMQSHDGDI 706
Query: 722 YLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
+LLPALP DKW SG + GL ARGG + + W+DG++ + I+S N
Sbjct: 707 HLLPALP-DKWRSGSISGLIARGGFVIDMAWQDGEITNLKIHSKLGGN 753
>gi|150005495|ref|YP_001300239.1| glycoside hydrolase [Bacteroides vulgatus ATCC 8482]
gi|294778696|ref|ZP_06744115.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
gi|149933919|gb|ABR40617.1| glycoside hydrolase family 95 [Bacteroides vulgatus ATCC 8482]
gi|294447352|gb|EFG15933.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
Length = 819
Score = 518 bits (1333), Expect = e-144, Method: Compositional matrix adjust.
Identities = 306/765 (40%), Positives = 437/765 (57%), Gaps = 45/765 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PA + +A+P+GN R+GAMV+GG E L+LN++T+W G P P+A ++L
Sbjct: 23 LKLWYKQPAGTWVEALPVGNSRMGAMVYGGTAREELQLNDETMWGGSPYRNDKPEALESL 82
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ +G+ EA + F G YQ +G + +E H K + Y R+LDL
Sbjct: 83 PQVRELIFAGKNMEAQNLIQENFYAGKHGMPYQTIGSLIIE-TPGHEKVTD--YYRDLDL 139
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A +Y V V F RE F+S PD+V+V +++ G L+F V S L+ H
Sbjct: 140 ERAVATTRYKVDGVTFQREVFASFPDKVVVVRLTADRPGKLNFKVGYVSPLE-HKVSRKG 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD-RGTISALEDKKLKVEGSDW 249
++++ G+ D +G++ +E + D G ++D+ + VEG+D
Sbjct: 199 KKLVLTGK------------GRDHEGVKGLIRMETQTQADVDGGKVKIDDQNITVEGAD- 245
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
+V L V+S + FIN D + + ++ L YS + H+ Y++ F RV
Sbjct: 246 SVTLYVSSGT---NFINYHDISGNESKKASGYLSLALGRPYSQVLQEHIALYKEQFDRVR 302
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L T ++TV +R++ F +D SL LLFQ+GRYLLISSS+PG
Sbjct: 303 LDLG---------TSERAKLETV---KRIELFNEGKDVSLAVLLFQYGRYLLISSSQPGG 350
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN L+ WD +NIN EMNYW + NLSE +PLF+ + LS+ G +T
Sbjct: 351 QPANLQGIWNNKLAAPWDGKYTININTEMNYWPAEVTNLSETHQPLFEMVKELSVTGRET 410
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+ Y +GWV HH TDIW +++ K + WPMGGAWL THLW+HY Y+ D+ FL +
Sbjct: 411 ARTMYGCNGWVAHHNTDIW-RATGPVDKAFYGTWPMGGAWLTTHLWQHYLYSGDKLFLSE 469
Query: 490 RAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSS-TMDMAII 547
AYP L+G A F LD+L E + G++ T PS SPEH D K A S TMD II
Sbjct: 470 -AYPALKGAADFYLDYLTEHPEYGWMVTAPSMSPEHGPSGEDTKKASTIVSGCTMDNQII 528
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWAQDFKDPEVHHRH 606
+V S + A+ +L+ + A + L+S L RL P +I + + EW +D +P HRH
Sbjct: 529 FDVLSNALHASRILKMS--ASYQDSLRSMLNRLAPMQIGKYNQLQEWLEDLDNPNDKHRH 586
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRM 666
+SH++GLFP + I+ +P L +AA+ TL +RG+E GWSI WK LWARL D HA+R+
Sbjct: 587 ISHVYGLFPSNQISPYTHPLLFQAAKNTLLQRGDEATGWSIGWKVNLWARLLDGNHAFRI 646
Query: 667 VKRLFNLV--DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLL 724
+ + L+ D E + +G Y NLF AHPPFQID NFG+TA VAEML+QS ++LL
Sbjct: 647 INNMLKLLPGDEVKEAYPQGRTYPNLFDAHPPFQIDGNFGYTAGVAEMLLQSHDGAVHLL 706
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
PALP D W++G V+GL ARGG V + W L + I+S N
Sbjct: 707 PALP-DAWATGSVQGLVARGGFVVDMNWNGVQLDKAKIHSRLGGN 750
>gi|326204164|ref|ZP_08194024.1| Alpha-L-fucosidase [Clostridium papyrosolvens DSM 2782]
gi|325985675|gb|EGD46511.1| Alpha-L-fucosidase [Clostridium papyrosolvens DSM 2782]
Length = 775
Score = 517 bits (1332), Expect = e-144, Method: Compositional matrix adjust.
Identities = 309/801 (38%), Positives = 436/801 (54%), Gaps = 55/801 (6%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA+ + +A+P+GNG LG MV GG+ E + LN DTLW+G+PG N + L +V+
Sbjct: 7 YKSPARIWEEALPVGNGGLGGMVHGGISHECIDLNNDTLWSGLPGQLINKNILPLLPEVQ 66
Query: 77 SLVDSGQ-YAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
LVD G Y + + Y LG + L + L Y R L LNTA
Sbjct: 67 CLVDEGNNYDAQKLIEENILTGYSQSYLPLGRLLLTCE---LSGEINNYSRSLSLNTAVC 123
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
+Y+ G V RE S PD V+ ++ +S S + +LDS L G +IM
Sbjct: 124 ETRYTCGGVNHCREVICSYPDNVMAVHMTADKSESFTLTATLDSQLRYQVNKKGRT-LIM 182
Query: 196 EGRCPGKRIPPKANAN--------DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
G CP IP A + + I FS + I +G +E+ + + +
Sbjct: 183 TGDCPSCMIPDYVEAGKHIVYDSEEHSRSIGFSVGMRAYI---KGGSVIVEENGISINAA 239
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D +L+L +S++F+G I P S DP S+ + L S+++L +RH DD+ LF R
Sbjct: 240 DEVLLVLSSSTNFEGFDIMPGSSGVDPLSKCIRTLDKAAGYSWNELLSRHKDDHSSLFKR 299
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSR 366
V + L + +P+ ER+ ++ + DPSL L+F +GRYLLI+ SR
Sbjct: 300 VCLDLGTQSQ--------------LPTDERLAAYAKGQYDPSLDSLMFAYGRYLLIACSR 345
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PGTQ ANLQGIWN+DL+ W S NINLEMNYW + NLSEC +PLFD L +S G
Sbjct: 346 PGTQAANLQGIWNKDLAAPWSSNYTTNINLEMNYWPAETANLSECHKPLFDLLKDVSKAG 405
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
S+ ++ NY G+V+HH TD+W +SA G+ W WPMGGAWL H+ EHY ++ D F
Sbjct: 406 SEISRENYGCRGFVLHHNTDLWRMASAVSGQARWGFWPMGGAWLSLHIMEHYRFSCDVVF 465
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
L+ Y + E F LD++ GY TNPSTSPE+ FI +G++ ++ STMD+ I
Sbjct: 466 LQNHYYIMREA-VLFFLDYMKPDKKGYYITNPSTSPENAFIDKEGRICSITKGSTMDLFI 524
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRH 606
IRE+F + + A +L K + L +++ L +L P +I + G ++EW ++ + E HRH
Sbjct: 525 IRELFESCVEAQSIL-KIDSELSGLLVQRLCKLPPFRIGKKGQLLEWPDEYVEEEPGHRH 583
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHA 663
+SHLFGLFPG I+ P+L +A K+L++R G GWS W L+ARL D ++A
Sbjct: 584 ISHLFGLFPGSVISPWHTPELAEACRKSLEQRLANGGGHTGWSCAWLICLYARLGDGDNA 643
Query: 664 YRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYL 723
YR V +L +Y NLF AHPPFQID NFGFT + EML+QS +L+L
Sbjct: 644 YRFVNQLLTR-----------SVYPNLFDAHPPFQIDGNFGFTTGIIEMLLQSHNGELHL 692
Query: 724 LPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN----NDHDSF---KT 776
LPALP + W G GLKARG TV I W++ +L +V I + SN ++SF K
Sbjct: 693 LPALP-NSWKDGSATGLKARGNYTVDILWRNHNLLKVRITAGNSNVCRIRINESFTADKY 751
Query: 777 LHYRGTSVKVNLSAGKIYTFN 797
G V V LS + FN
Sbjct: 752 FEKTGNLVFVYLSENESVNFN 772
>gi|408671718|ref|YP_006875526.1| alpha-L-fucosidase [Emticicia oligotrophica DSM 17448]
gi|387857567|gb|AFK05662.1| alpha-L-fucosidase [Emticicia oligotrophica DSM 17448]
Length = 818
Score = 517 bits (1332), Expect = e-144, Method: Compositional matrix adjust.
Identities = 299/769 (38%), Positives = 438/769 (56%), Gaps = 53/769 (6%)
Query: 12 PLKITFNGPAKH-FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
PLK+ + P+ + + +A+PIGNGRLGAM++G V E ++LNE T+W+G P NP A +
Sbjct: 22 PLKLWYKQPSGNTWENAMPIGNGRLGAMIYGNVEQEIIQLNEHTVWSGSPNRNDNPLALE 81
Query: 71 ALSDVRSLVDSGQYAEA----TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
L+++R L+ G + EA A + H ++ +G++ L F + Y R
Sbjct: 82 KLAEIRKLIFEGNHKEAEKLANQAIISKTSH-GQKFEPVGNLNLVFAGQE---NYKNYYR 137
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
ELD+ A ++ Y VG+V +TRE F+S D+VI+ KIS +++G++SFN ++ S +
Sbjct: 138 ELDIERAISKTTYQVGDVTYTREAFASLADRVIIMKISANKAGNVSFNANISSPQKRKTI 197
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDP--KG-IQFSAILEIKISDDRGTISALEDKKLK 243
P K + +D KG + F I IK+ + G++ + D L
Sbjct: 198 AT----------TPNKDLTLSGITSDHETVKGMVAFKGISRIKL--EGGSLQS-TDTSLV 244
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V+G++ A++ + +++F+ N D D + L + +Y+ L + H+ YQK
Sbjct: 245 VKGANSAIIFISIATNFN----NYQDLSGDENKRANDYLNNAFAKTYTTLLSSHILAYQK 300
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
LF+RV I L E + +P+ ER+++F+ DP +V L +QFGRYLLIS
Sbjct: 301 LFNRVKIDLG------------ETDAAKLPTDERLRNFRNINDPQMVALYYQFGRYLLIS 348
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SS+PG Q ANLQGIWN ++P WDS +NIN EMNYW + NLSE EP + LS
Sbjct: 349 SSQPGGQPANLQGIWNNRINPPWDSKYTININAEMNYWPAEKTNLSELHEPFLKMVKELS 408
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
I G KTA+ Y A GW+ HH TDIW + A G W +W GG W+ HLWEHY YT D
Sbjct: 409 ITGQKTAKDMYGARGWMAHHNTDIWRATGAIDG-AFWGMWTAGGGWVSQHLWEHYLYTGD 467
Query: 484 RDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
+ FL AYP L G A F D+L+ + +L NP SPE+ A DG + + T
Sbjct: 468 KAFLAS-AYPALRGAAQFYADFLVPHPNKNNWLVVNPGNSPENAPAAHDG--SSLDAGVT 524
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPE 601
MD I+ +VF+ ISAAE+L+ + + V+ + K +L P I + + EW D DP
Sbjct: 525 MDNQIVFDVFNKAISAAEILKIDAN-FVDSLKKLRAKLPPMHIGQHNQLQEWLDDIDDPN 583
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQE 661
HRH+SHL+GL+P + I+ + P+L +A++ +L RG+ GWS+ WK WA+L D
Sbjct: 584 DTHRHISHLYGLYPSNQISAYRTPELFEASKNSLIYRGDVSTGWSMGWKVNWWAKLQDGN 643
Query: 662 HAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDL 721
HAY++++ N + P + GG Y+NLF AHPPFQID NFG T+ + EML+QS+ +
Sbjct: 644 HAYQLIQ---NQLTPISGERGAGGTYNNLFDAHPPFQIDGNFGCTSGITEMLMQSSDGAV 700
Query: 722 YLLPALPWDKWSSGCVKGLKARGG-ETVSICWKDGDLHEVGIYSNYSNN 769
+LLPALP D W +G + GLKA GG E V + WKD L ++ I SN N
Sbjct: 701 HLLPALP-DVWPTGKIAGLKAIGGFEIVEMQWKDAKLVKLVIKSNLGGN 748
>gi|21218886|ref|NP_624665.1| large hypothetical protein [Streptomyces coelicolor A3(2)]
gi|5912520|emb|CAB56146.1| putative large secreted protein [Streptomyces coelicolor A3(2)]
Length = 809
Score = 516 bits (1330), Expect = e-143, Method: Compositional matrix adjust.
Identities = 293/767 (38%), Positives = 424/767 (55%), Gaps = 59/767 (7%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
P+++ + PA+ + +A+P+GNGRLGAMV+GG +E L+LNED+LW G PGDY PDA +
Sbjct: 50 PMRLWYRAPAQEWLEALPVGNGRLGAMVFGGTDTERLQLNEDSLWAGGPGDYARPDAVRH 109
Query: 72 LSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRREL 128
L+++R LV ++ A + G P++ YQ+LGD+EL + Y REL
Sbjct: 110 LAEIRRLVVEEKWNRAQRLIDAEFLGSPSEQAAYQVLGDLELTLAG---EGEAADYEREL 166
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
DL TA AR Y+ G V RE F+S PDQV+V ++S G++ F S +
Sbjct: 167 DLETAVARTTYTRGGVRHVREVFASAPDQVLVVRLSADTPGAVGFTARFTSPQRSGGSAV 226
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEI-----KISDDRGTISALEDKKLK 243
+ I ++G + P ++F + ++S D GT L
Sbjct: 227 DAHTIALDGVG--------GDWYGRPGSVRFRGLARAESEGGRVSTDGGT--------LT 270
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
VEG+D A L++ ++S+ N D DP S + + L Y+ L TRH+ D+++
Sbjct: 271 VEGADAATLVISLATSYR----NYLDVGADPASRARNHLAPAARKPYAHLRTRHVADHRR 326
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
LF RV++ L S + +P+ ER+ F +DP L L FQ+GRYLL S
Sbjct: 327 LFGRVALDLGPSERA------------ELPTDERIPLFADGKDPQLAALYFQYGRYLLAS 374
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SR Q ANLQG+WN+ L+P W+S VNIN EMNYW + P NL+EC +P + L+
Sbjct: 375 CSRSPGQPANLQGLWNDSLNPAWESKYTVNINFEMNYWPAGPGNLAECWDPAVRMVHELA 434
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+G++TA+ Y A GWV+HH TD W + +A + +WP GGAWLC LW+HY +T D
Sbjct: 435 ESGTRTAKALYDAPGWVLHHNTDGW-RGTAPVDAAQYGMWPTGGAWLCVMLWDHYRFTGD 493
Query: 484 RDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
L R YP+++G F LD L ++ G+L TNPS SPE +G+ + TM
Sbjct: 494 TGAL-SRNYPVMKGAVEFFLDTLQVDAETGWLVTNPSQSPEVTHHQDEGESVSICAGPTM 552
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPE- 601
DM ++R++F A AAEVL+++ LV +V + RL PT++ G I EW D+++
Sbjct: 553 DMQLLRDLFDAYRQAAEVLDRDSR-LVGRVTEVRDRLAPTRVGHLGQIQEWLVDWEEAAL 611
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQE 661
V RH+SHL+G+FP IT P+L AA+K+L+ RG G GWS+ WK +WARL +
Sbjct: 612 VRSRHVSHLYGVFPSAQITPRGTPELAAAAKKSLELRGTAGQGWSLAWKINMWARLLEPA 671
Query: 662 HAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDL 721
AY + L +L+ P NLF HPPFQID NFG + + EML+QS ++
Sbjct: 672 RAY---QHLADLLTPARTA-------PNLFDLHPPFQIDGNFGGVSGITEMLLQSHAGEI 721
Query: 722 YLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN 768
LLPALP + W +G +GL+ARGG V + W + + S N
Sbjct: 722 ELLPALP-EAWPTGSFRGLRARGGFEVDLEWTGAGITRAEVRSLLGN 767
>gi|399031123|ref|ZP_10731262.1| hypothetical protein PMI10_03140 [Flavobacterium sp. CF136]
gi|398070592|gb|EJL61884.1| hypothetical protein PMI10_03140 [Flavobacterium sp. CF136]
Length = 821
Score = 516 bits (1330), Expect = e-143, Method: Compositional matrix adjust.
Identities = 293/770 (38%), Positives = 443/770 (57%), Gaps = 41/770 (5%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
A + + + LK+ +N PA + +A+P+GNGRLGAMV+G E L+LNE+T+W G P
Sbjct: 18 ASTAQSKSELKLWYNKPATIWNEALPLGNGRLGAMVFGDPAVERLQLNEETIWAGSPNSN 77
Query: 64 TNPDAPKALSDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYA 120
+ + +AL VR LV G++ EA A+ + D YQ G + F H KY
Sbjct: 78 AHTKSIEALPKVRKLVFEGKFDEAQDLATRDIMSQTNDGMPYQTFGSAYISFP-GHQKYT 136
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
Y R+LD+ A+A+VKY+V +EFTRE +S DQVIV K+S S+ G ++ NV ++S
Sbjct: 137 --NYYRDLDIENASAKVKYTVNGIEFTREILTSFSDQVIVVKLSASQPGQITANVFMNSP 194
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+D NQII+ G N ++F +E K + G +SA +
Sbjct: 195 IDKTVPSTEGNQIILSGVG--------TNFEGVKGKVKFQGRIEAK--NKGGEVSA-SNG 243
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
L + +D L + +++F N D +D ++S L+ + + + H+
Sbjct: 244 ILIINKADEVTLYISIATNFK----NYQDITEDEVAKSKVYLEKAISKDFETIKKAHVAY 299
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
YQK F+RV++ L + D + P+ ER++ F+ + DP L L FQFGRYL
Sbjct: 300 YQKFFNRVALDLGSN------DAIKK------PTNERIRDFKKEFDPQLASLYFQFGRYL 347
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSS+PG Q ANLQGIWN+ ++P WDS NIN EMNYW + NL+E EP
Sbjct: 348 LISSSQPGGQPANLQGIWNDMVTPPWDSKYTTNINAEMNYWPAEVTNLTEMHEPFIQMAK 407
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
LS+ G++TA+ Y A+GWV+HH TDIW + +A +W GGAW+ LWE Y Y
Sbjct: 408 ELSVAGAETAKTMYNANGWVLHHNTDIW-RVTAPVDSAASGMWMTGGAWVSQDLWERYLY 466
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
T D ++L K YP+++G A F LD++I + + GYL PS+SPE+ GK + ++
Sbjct: 467 TGDINYL-KEIYPVIKGAADFFLDFMITDPNTGYLVVVPSSSPENTHAGGTGK-STIASG 524
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKD 599
+TMD ++ ++FS +I A++++ +E+ +K+ +L ++ P KI + + EW D+ +
Sbjct: 525 TTMDNQLVFDLFSNVIKASKLVAPDEN-YTKKLSDALAKMPPMKIGKHSQLQEWQDDWDN 583
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHD 659
P+ +HRH+SHL+GLFP + I+ K P+L + A+++L R +E GWS+ WK LWARL D
Sbjct: 584 PKDNHRHVSHLYGLFPSNQISPIKTPELFEGAKQSLIYRTDESTGWSMGWKVNLWARLLD 643
Query: 660 QEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLN 719
HAY++++ +LV + K GG Y N+ AH PFQID NFG TA +AEML+QS +
Sbjct: 644 GNHAYKLIQDQLHLVTADQRKG--GGTYPNMLDAHQPFQIDGNFGCTAGIAEMLMQSQED 701
Query: 720 DLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
++LLPALP W G ++GL RGG + + WK+ + + +YS N
Sbjct: 702 AIHLLPALP-TVWKDGSIQGLVTRGGFVIDMTWKNNKVSTLKVYSKLGGN 750
>gi|390943730|ref|YP_006407491.1| hypothetical protein Belba_2169 [Belliella baltica DSM 15883]
gi|390417158|gb|AFL84736.1| hypothetical protein Belba_2169 [Belliella baltica DSM 15883]
Length = 836
Score = 516 bits (1329), Expect = e-143, Method: Compositional matrix adjust.
Identities = 293/759 (38%), Positives = 442/759 (58%), Gaps = 46/759 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ ++ PA+ + +A+PIGNGRLGAMV+G E ++LNE+T + G P NP+A KAL
Sbjct: 45 MKLWYDRPAQQWVEALPIGNGRLGAMVFGNPQEEVIQLNENTFYAGHPYRNDNPNALKAL 104
Query: 73 SDVRSLVDSGQYAEAT-AASVKLFGHPADV-YQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ G+Y +A FG P + YQ +G+++L++ D E Y RELDL
Sbjct: 105 EGVRKLIFDGEYVQAQDTIDQNFFGGPHGMPYQTIGNLKLKYQDES---EVENYYRELDL 161
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A ++ V F+ + SS PDQVIV KI+ + S+SF+ ++D G
Sbjct: 162 EYAVVSNRFKKSGVNFSTKIISSFPDQVIVAKITADKPKSISFSATMDRPGPFEITTTGE 221
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE--IKISDDRGTISALEDKKLKVEGSD 248
+Q+IM G + D +GI+ + + +K + G+I + E+K++ + +D
Sbjct: 222 DQLIMSG------------ISSDHEGIKGAVKFQANVKFVNKNGSIKS-ENKEIIISEAD 268
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
+ + +++F +N D D + +S S L+ + +Y +H+ DY+ LF RV
Sbjct: 269 EVTIYISIATNF----VNYKDISADASEKSTSLLEKAIENDFERIYKKHVTDYRNLFDRV 324
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
+ L +S D V +P+ +R+ F D L L FQFGRYLLI++SRPG
Sbjct: 325 QLDLGKS--DAVN----------LPTDKRIAQFAEGNDAHLAALYFQFGRYLLIAASRPG 372
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
Q ANLQGIWN ++P WDS VNIN EMNYW + NLSE EP LS +G +
Sbjct: 373 GQPANLQGIWNHQMNPAWDSKYTVNINAEMNYWPAEITNLSELHEPFIQMAKDLSESGQQ 432
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
TA+ Y A GWV+HH TD+W + + +WP+GGAW+ HL+E Y+++ D +L
Sbjct: 433 TARNMYGARGWVLHHNTDLW-RVTGPIDFAAAGMWPLGGAWVSQHLFEKYDFSGDEKYL- 490
Query: 489 KRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
K YP+ + A+F LD+L++ G+ +PS SPE+ I + V+ +TMD ++
Sbjct: 491 KSVYPVAKEAATFFLDFLVKDPQTGFWVVSPSVSPEN--IPYQFHNSAVAAGNTMDNQLV 548
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHL 607
++F+ I AAE+L +ED L+ ++ + L L P +I + G + EW D+ +P+ +HRH+
Sbjct: 549 FDLFTKTIRAAEIL-GDEDDLINEMKEKLSMLPPMQIGKWGQLQEWMGDWDNPQDNHRHV 607
Query: 608 SHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMV 667
SHL+GL+P + I+ + P+L AA+ +L RG+E GWS+ WK LWAR D HAY+++
Sbjct: 608 SHLYGLYPSNQISPYRTPELFGAAKTSLLARGDESTGWSMGWKVNLWARFLDGNHAYKLI 667
Query: 668 K-RLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPA 726
K +L + P+ ++ GG Y NLF +HPPFQID NFG TA +AEMLVQS +++LPA
Sbjct: 668 KDQLSPAILPDGKER--GGTYPNLFDSHPPFQIDGNFGCTAGIAEMLVQSHDGAIHILPA 725
Query: 727 LPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
LP D W +G V GL+ARGG VS+ WK+ +V I SN
Sbjct: 726 LP-DAWENGSVCGLRARGGFEVSVDWKNAKPEKVSILSN 763
>gi|408378982|ref|ZP_11176577.1| alpha-L-fucosidase [Agrobacterium albertimagni AOL15]
gi|407747109|gb|EKF58630.1| alpha-L-fucosidase [Agrobacterium albertimagni AOL15]
Length = 805
Score = 516 bits (1329), Expect = e-143, Method: Compositional matrix adjust.
Identities = 304/811 (37%), Positives = 434/811 (53%), Gaps = 64/811 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + P ++F +A+P+GNG LGAM+ GG + + LN+D W G P L
Sbjct: 27 RLWYTAPGRNFNEALPLGNGSLGAMIRGGTAEDLVCLNDDRFWAGRDAPAPVATGPLVLE 86
Query: 74 DVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
+VR + +G A A A KL Y D+ +++D A E Y R+LDLNT
Sbjct: 87 EVRRRLFAGDVAGAEALVEQKLLTDFNQPYLTAADLVIQWDHD----AVERYTRQLDLNT 142
Query: 133 ATARVKYSVGNVEFTREH-FSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
A A V Y V R FSS PDQV V ++ +SL S + S ++ +
Sbjct: 143 AVAEVNYVASRVGGVRRRAFSSFPDQVFVLDAGFADPSQARTVLSLSSKTRHVSRMSARD 202
Query: 192 QIIM-------EGRCPGKRIPPKANA--NDDP--KGIQFSAILEIKISDDRGTISALEDK 240
I++ + R RI N DP + + + +L +S + +
Sbjct: 203 LIVVADAPSMVDWRGIDDRIRDGENIFYEVDPPRRCLTVACVLAASVS--------VHGE 254
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
L V G D+ VL+ + S G + + ++ L++ + +S L RH+
Sbjct: 255 GLVV-GGDFTVLVATSVGSDVGLLLE----------DCLARLEAAESRGFSALLERHVAA 303
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQFGRY 359
++ L+ R ++ L RSP + +P+ ER+ + DP+L LLF +GRY
Sbjct: 304 HRALYDRAALTL-RSPV----------GLSALPTDERLHRQASKMRDPALEALLFNYGRY 352
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
L+I+SSRPG++ NLQGIWN+ + P W S +NINL+MNYW + PCNL+EC EPLFDF+
Sbjct: 353 LMIASSRPGSRAINLQGIWNDKVQPPWWSNYTININLQMNYWPAEPCNLAECHEPLFDFV 412
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRG--------KVVWALWPMGGAWLC 471
LS+ G++TA V Y GWV HH+ D +++A + + LW MGGAWLC
Sbjct: 413 KNLSLAGARTASVQYGMRGWVAHHQVDGRFQTTAIGALNGRAYDFPIRYGLWTMGGAWLC 472
Query: 472 THLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDG 531
H W+HY + D FL + A+P+L A F LDW++E DG L T PSTSPE+ ++ PDG
Sbjct: 473 QHFWQHYLFNGDTKFLRETAWPILRNAAEFYLDWVVELPDGSLTTAPSTSPENSYLLPDG 532
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
+S +TMD+AI+RE FS I+ AA VL +D + +LPRL IA DG ++
Sbjct: 533 TRHALSIGATMDIAILREFFSTIVDAASVLGIPDDPIAISASAALPRLPGYGIAADGQLL 592
Query: 592 EWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKT 651
EW +D E HRH+SHL+G+FP I+ + P+L AA + L++RG+ G GWS WK
Sbjct: 593 EWREDLPQAEHPHRHVSHLYGVFPAAQISPTETPELAAAAARVLEERGDTGTGWSFAWKA 652
Query: 652 ALWARLHDQEHAYRMVKRLFNLVDP--EHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAV 709
ALWARL E AYR + L N VDP E + GGLY+NL A PPF IDANFG+T AV
Sbjct: 653 ALWARLGRPEMAYRNIGHLLNPVDPAIELQADLGGGLYTNLLTACPPFNIDANFGYTGAV 712
Query: 710 AEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
AEMLVQS ++ +LPALP W+ G +GL+ RG + + W+ G L E+ I S
Sbjct: 713 AEMLVQSQSGEIVILPALP-KAWADGEARGLRCRGQVEIDMVWRSGRLAELRIKSQIMQA 771
Query: 770 DHDSFKTLHYRGTSVKVNLSAGKIYTFNRQL 800
+T G + + L AG+ R L
Sbjct: 772 -----RTFRLDGEPLALMLPAGREVRLLRTL 797
>gi|224538426|ref|ZP_03678965.1| hypothetical protein BACCELL_03320 [Bacteroides cellulosilyticus
DSM 14838]
gi|224519961|gb|EEF89066.1| hypothetical protein BACCELL_03320 [Bacteroides cellulosilyticus
DSM 14838]
Length = 828
Score = 515 bits (1326), Expect = e-143, Method: Compositional matrix adjust.
Identities = 297/778 (38%), Positives = 436/778 (56%), Gaps = 50/778 (6%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
M N + +P+ + ++ PA+++ +A+P+GNGRLGAMV+G E ++LNE+T+ G P
Sbjct: 22 MGNVNVYAQKHPI-LWYDKPAQYWEEALPLGNGRLGAMVYGNPVHEEIQLNEETVSAGSP 80
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKL-----FGHPADVYQLLGDIELEFDD 114
+ NP+A ALS +R L+ G+Y EA A A K+ FG P YQ +G + L+F
Sbjct: 81 YNNYNPEAKNALSTIRQLIFDGKYPEAQALAETKILSKNGFGMP---YQTVGSLRLDFQG 137
Query: 115 SHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFN 174
Y+ +RRELDL A YSV V++ RE F+S DQ+I+ +++ S++G L+F+
Sbjct: 138 QE-NYS--NFRRELDLERAVTTTTYSVDGVKYKREVFASLTDQLIIIRLTASQAGKLTFS 194
Query: 175 VSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
+L G N++IMEG G P A + F A +E+ D +G
Sbjct: 195 AALTCPQKVDVSTLGKNRLIMEGTTKGDGFTPGA--------VCFRADVEL---DLQGGK 243
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
S D L + + A + + +++F IN D +P + L++ R Y+
Sbjct: 244 SVANDTLLSITNATSATIYIAMATNF----INYKDISGNPVERNKVYLKNARK-PYTKAL 298
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H++ YQK + RV++ L +P+ P+ RVK F T DP LV L F
Sbjct: 299 QAHVNMYQKYYRRVALDLGYTPQA------------DKPTDIRVKEFATSNDPHLVALYF 346
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
Q+GRYLLIS S+PG Q ANLQGIWN +P W NIN EMNYW + NL E EP
Sbjct: 347 QYGRYLLISCSQPGGQPANLQGIWNHKTNPAWRCRYTTNINAEMNYWPAEVTNLREMHEP 406
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTH 473
+ L NG + A+ Y GW++HH TD+W + A DR WP AWLC H
Sbjct: 407 FLQMIRELYENGQEAAREMYGCRGWMLHHNTDLWRMNGAVDRPYC--GPWPTCNAWLCQH 464
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGK 532
LW+ Y Y+ D+++L YP+++ + F +D+L++ + GY+ PS SPE+ GK
Sbjct: 465 LWDRYLYSGDKEYLNS-IYPIMKSASEFFVDFLVKDPNTGYMVVTPSNSPENSPKLWKGK 523
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
+ TMD ++ ++FS +AA++L +++ + +L RL P ++ + G + E
Sbjct: 524 SNLFA-GVTMDNQLVFDLFSNTNAAAQILNRDKQ-FCDTILSLKKRLPPMQVGQYGQLQE 581
Query: 593 WAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTA 652
W +D+ +P+ HHRH+SHL+GLFPG+ I+ +P L +AA TL +RG+ GWS+ WK
Sbjct: 582 WFEDWDNPKDHHRHISHLWGLFPGYQISPYSSPVLFEAARNTLIQRGDPSTGWSMGWKVC 641
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
WAR D HA++++ NLV PE +K GG Y NLF AHPPFQID NFG A +AEM
Sbjct: 642 FWARCLDGNHAFKLITNQLNLVSPEIQKGQGGGTYPNLFDAHPPFQIDGNFGCVAGIAEM 701
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGG-ETVSICWKDGDLHEVGIYSNYSNN 769
L+QS ++LLPALP D W G + GL+ARGG E +S+ WK+G + V I S N
Sbjct: 702 LMQSHDGAVHLLPALP-DVWKDGEIAGLRARGGFEIISLKWKNGRIESVTIKSTIGGN 758
>gi|340616355|ref|YP_004734808.1| alpha-L-fucosidase [Zobellia galactanivorans]
gi|339731152|emb|CAZ94416.1| Alpha-L-fucosidase, family GH95 [Zobellia galactanivorans]
Length = 791
Score = 515 bits (1326), Expect = e-143, Method: Compositional matrix adjust.
Identities = 310/795 (38%), Positives = 455/795 (57%), Gaps = 53/795 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+ + +A+P+GNG LGAMV+G E ++ NEDT W G P + P+ L
Sbjct: 37 LKLWYDRPAEIWEEALPVGNGSLGAMVFGRPVMERIQFNEDTFWAGGPITPSKPETKSYL 96
Query: 73 SDVRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELE---FDDSHLKYAEETYRREL 128
+VR LV G+Y EA A K + G Y +GD+ +E DD +RREL
Sbjct: 97 PEVRKLVFDGKYKEADALINKHIIGPKMMPYLPMGDVVIEMKGLDDI------TDFRREL 150
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
DL TA ++V +S + + RE FS+ + IV ++ S+ SL+F+++LD+ + S V
Sbjct: 151 DLRTAISKVGFSSKGIAYKREVFSAVEENAIVIRLEASKEKSLNFSIALDNQIGATSQVL 210
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
N + + G P + AN + ++F + L I +D I+ D + V G+
Sbjct: 211 DANNLELSGTAPDR-----ANRKSE---LRFVSRLNIGENDGHTIIN---DSTITVSGAS 259
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
LLL A+++F N D +P + + L + S+ + +H+ ++Q+LF R+
Sbjct: 260 KVTLLLFAATNFK----NYKDVSGNPDFKCKTLLDLVHLKSFEQIREQHITNHQRLFERL 315
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
D+ T++ S +P+ ER++ FQ + DPSLV L +QFGRYLL+SSSR
Sbjct: 316 DF-------DMPTNSNS-----GLPTNERLEKFQEETDPSLVALYYQFGRYLLMSSSRGN 363
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
+Q ANLQGIWN++ +P WDS NINLEMNYW + NL+EC PLF + L+ G+
Sbjct: 364 SQPANLQGIWNQNPTPPWDSKYTTNINLEMNYWPAEASNLAECAIPLFTSIRQLAEAGAV 423
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
TA+ NY A GWV+HH TDIW ++ G W +WP GGAWL THLWEHY ++ D FL
Sbjct: 424 TAKNNYGADGWVLHHNTDIWKTTTPLDG-AAWGIWPTGGAWLTTHLWEHYLFSEDEAFL- 481
Query: 489 KRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
+ YP+++G A F ++ L+ + GYL TNPS SPE+ + +G ++ V MD +I
Sbjct: 482 RLHYPVIKGAAEFFVNTLVAHPEYGYLVTNPSISPENRHM--EGNIS-VCAGPAMDTQLI 538
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF--KDPEVHHR 605
R++F+ I A+E+L + D E ++++ +L P KI +G + EW D+ K PE+ HR
Sbjct: 539 RDLFAQCIKASEILNVDSD-FRELLVETRSKLAPDKIGSEGQLQEWLDDWDMKVPELQHR 597
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H+SHL+GL+PG T EK P AA K+L+ RG+ G GWS+ WK ALWARL+D +HA++
Sbjct: 598 HVSHLYGLYPGAQFTPEKTPKEWNAARKSLEIRGDGGTGWSLGWKVALWARLNDGDHAFK 657
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
++K L D GG Y NLF A PPFQID NFG A + EML+QS N+ LL
Sbjct: 658 ILKTLLKSTDFVGHGG-PGGTYPNLFDACPPFQIDGNFGALAGINEMLLQSQ-NNRVLLL 715
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVK 785
+ G ++G++ARGG +SI WK+G L V I S N + L Y S+
Sbjct: 716 PALPAELKDGSIQGIRARGGFELSIAWKEGKLMAVKILSKKGNTCN-----LVYGDKSMA 770
Query: 786 VNLSAGKIYTFNRQL 800
+ AGK Y + +L
Sbjct: 771 LETEAGKSYLLDGEL 785
>gi|393781509|ref|ZP_10369704.1| hypothetical protein HMPREF1071_00572 [Bacteroides salyersiae
CL02T12C01]
gi|392676572|gb|EIY70004.1| hypothetical protein HMPREF1071_00572 [Bacteroides salyersiae
CL02T12C01]
Length = 827
Score = 514 bits (1325), Expect = e-143, Method: Compositional matrix adjust.
Identities = 307/771 (39%), Positives = 441/771 (57%), Gaps = 64/771 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ +T+A+P+GNGRLGAMV+G E L+LNE+TLW G P + NP+ K +
Sbjct: 38 KLWYDRPAQVWTEALPLGNGRLGAMVFGNPAVEQLQLNEETLWAGRPNNNANPEGLKYIP 97
Query: 74 DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +Y + Y RE
Sbjct: 98 KVRELVFAGKYLEAQTLATEKVMSKTNSGMP---YQSFGDLRISFP-GHTRYRD--YYRE 151
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL-----DSLLD 182
L+L++A +V Y V +V + RE F+S DQVI+ +++ G ++FN L D+L+D
Sbjct: 152 LNLDSACVKVGYRVDDVNYLREMFTSFTDQVIMVRLTADRPGKITFNAVLTTPHQDALVD 211
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKK 241
+G C + ++ ++ KG ++F L ++ +G + D
Sbjct: 212 T------------DGEC--VTLSGVSSWHEGLKGKVEFQGRLATRV---QGGAVSCRDGV 254
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L VEG+D AV+ + +++F IN D D + L+ +Y++ H+D +
Sbjct: 255 LTVEGADEAVVYVSLATNF----INYKDISADQVERARQYLEKAMQKNYTEAKQSHVDFF 310
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
+ RVS+ L T S E + P+ +RV+ F+T D LV FQFGRYLL
Sbjct: 311 KAYMDRVSLNLG---------TGSTEQL---PTDKRVEKFKTTHDAGLVATYFQFGRYLL 358
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
I SS+PG Q ANLQGIWN+ L P+WDS NIN+EMNYW + NLSE EPLF
Sbjct: 359 ICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINVEMNYWPAEVTNLSELHEPLFRMTRE 418
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
+S G +TA++ Y A GWV+HH TDIW + + K +WP GGAWLC HLWE Y YT
Sbjct: 419 VSETGKETAEIMYGAKGWVLHHNTDIW-RITGPLDKAPSGMWPSGGAWLCRHLWERYLYT 477
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
D +FL + AYP+++ F + ++ E +L PS SPE+ GK A +
Sbjct: 478 GDVEFL-RSAYPIMKEAGRFFDETMVKEPLHNWLVVCPSNSPENTHAGSGGK-ATTAAGC 535
Query: 541 TMDMAIIREVFSAIISAAEVLEKNED--ALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFK 598
TMD ++ +++++II+ A +L + + + +E+ LK +P P +I G + EW D+
Sbjct: 536 TMDNQLVFDLWTSIIATARLLGVDTEYASHLEERLKEMP---PMQIGRWGQLQEWMFDWD 592
Query: 599 DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLH 658
DP+ HRH+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK LWARL
Sbjct: 593 DPDDIHRHVSHLYGLFPSNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKVCLWARLL 652
Query: 659 DQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTL 718
D HAY+++ LV E +K GG Y NLF AHPPFQID NFG TA + EML+QS
Sbjct: 653 DGNHAYKLITEQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCTAGIVEMLMQSHD 709
Query: 719 NDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
+YLLPALP D W G +KG+ ARGG + I WK G + +V I S + N
Sbjct: 710 GFIYLLPALP-DVWEEGEIKGIVARGGFEMDIRWKKGKVEQVVIRSRHGGN 759
>gi|189464329|ref|ZP_03013114.1| hypothetical protein BACINT_00670 [Bacteroides intestinalis DSM
17393]
gi|189438119|gb|EDV07104.1| hypothetical protein BACINT_00670 [Bacteroides intestinalis DSM
17393]
Length = 794
Score = 514 bits (1325), Expect = e-143, Method: Compositional matrix adjust.
Identities = 293/760 (38%), Positives = 422/760 (55%), Gaps = 45/760 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PAK +T+A+P+GN RLGAMV+GGV +E ++LNE+T+W G P +P A L
Sbjct: 8 LKLWYKQPAKVWTEALPLGNSRLGAMVYGGVVNEQIQLNEETVWGGGPHRNDSPKALGVL 67
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ SG+ EA F G +Q +G + LEF+ H Y++ YRRELDL
Sbjct: 68 PTVRELLFSGREKEAEKVIADNFFTGQHGMPFQTIGSLMLEFE-GHADYSD--YRRELDL 124
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A V+Y +G V +TR F+S D ++ +I + G+++F + +
Sbjct: 125 EKAIASVRYKIGEVNYTRTVFTSLADNALIVRIEADKPGAVNFTTRYSTPYKEYEIKKNG 184
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
+++ G P A I+F +IK ++G ++ D ++V+G+D A
Sbjct: 185 KSLLLSGHGSAHEGIPGA--------IRFETRTQIKA--EKGKVNVTNDC-IEVKGADAA 233
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
V+ + A+++F +N D + T + L Y+ H + YQKLF RVS+
Sbjct: 234 VIYVTAATNF----VNYKDVSANETRRATEFLSQAMKRPYAQALAAHEEAYQKLFGRVSL 289
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
+ S K+ ++ R+K F +D LV L+FQFGRYLLISSS+PG Q
Sbjct: 290 NVGASSKE--------------ETSYRIKHFNEGKDLGLVALMFQFGRYLLISSSQPGGQ 335
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
A LQGIWN +L WD +NIN EMNYW + NL E +PLF + LS + TA
Sbjct: 336 PAGLQGIWNHELLAPWDGKYTININTEMNYWPAEVTNLPEMHQPLFQMVKELSESAQGTA 395
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
+ Y GW +HH TD+W + G +WP+GGAWL HLW+HY YT D+ FL+
Sbjct: 396 RTLYDCRGWTVHHNTDLWRMAGPVDGASY--VWPLGGAWLSQHLWQHYLYTGDQAFLQT- 452
Query: 491 AYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
AYP L+G A F LD+L+E G++ PS SPE P G ++ TMD I+ +
Sbjct: 453 AYPALKGAADFFLDFLVEHPKYGWMVCAPSMSPEQ---GPPGTGTMLTAGCTMDTQIVLD 509
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSH 609
++++SA ++L + + + + + RL P +I + + EW D DP HRH+SH
Sbjct: 510 ALTSVLSATKLLYPDHTSYCDSLQGMIKRLPPMQIGKHNQLQEWLADVDDPHNDHRHVSH 569
Query: 610 LFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKR 669
L+GL+P + I+ +P L +AA+++L RG+ GWSI WK LWARL D +HAY ++K
Sbjct: 570 LYGLYPSNQISPYAHPQLFQAAKRSLLYRGDMATGWSIGWKINLWARLLDGDHAYTIIKN 629
Query: 670 LFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPW 729
+ LV+ + + +G Y N+F AHPPFQID NFGFTA VAEML+QS L+LLPALP
Sbjct: 630 MLKLVE---KGNPDGRTYPNMFDAHPPFQIDGNFGFTAGVAEMLLQSHDEALHLLPALP- 685
Query: 730 DKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
WS G VKGL ARG V + W G+L + S N
Sbjct: 686 TAWSKGSVKGLVARGAFEVDMDWDGGELTTAIVTSRIGGN 725
>gi|409196602|ref|ZP_11225265.1| alpha-L-fucosidase [Marinilabilia salmonicolor JCM 21150]
Length = 823
Score = 514 bits (1323), Expect = e-142, Method: Compositional matrix adjust.
Identities = 297/770 (38%), Positives = 431/770 (55%), Gaps = 60/770 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PAK + +A+PIGNGRLGAMV+G E ++LNE+T W+G P NP A +AL
Sbjct: 30 LKLWYDKPAKVWNEALPIGNGRLGAMVFGDPTLENIQLNEETFWSGSPSRNDNPKAIEAL 89
Query: 73 SDVRSLVDSGQYAEA------TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
+VR+L+ G+Y EA + +L G +YQ +G++ L F+ H Y+ Y R
Sbjct: 90 PEVRNLIFEGKYHEAEKIVNENMVAEQLHG---SMYQTIGNLNLTFE-GHENYS--NYSR 143
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
ELD+ A Y+V +V F RE F+S PDQVIV K+S + SLSF +L L ++
Sbjct: 144 ELDIEKALHTTSYTVDDVNFKREIFASFPDQVIVVKLSADQPESLSFTANLIGPLAKNTK 203
Query: 187 VNGNNQIIMEG------RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ + M G R GK ++F+ + +I +D G SA DK
Sbjct: 204 AVDASTLEMTGISGNHERVEGK--------------VEFNTLAKILNTD--GATSADGDK 247
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ S+ +L+ +A++ F++ D + L + + YS++ H+ D
Sbjct: 248 ITVKDASEVVILISMATN-----FVDYKTLTADENEKCRKFLTAAQTKEYSEIKEAHIRD 302
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
Y+K F R S+ L +P P+ R+K+F DP+LV L +QFGRYL
Sbjct: 303 YRKYFTRSSLDLGTTPAS------------QRPTDVRIKNFSHTNDPALVSLYYQFGRYL 350
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSRPG Q ANLQGIWN +P WDS +NIN EMNYW + NL E EPL + +
Sbjct: 351 LISSSRPGGQPANLQGIWNNSTNPAWDSKYTININTEMNYWPAEKTNLPELHEPLIEMVK 410
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
LS GS+TA+ Y +GWV HH TDIW + G W +WPMGGAWL HLW+ Y Y
Sbjct: 411 DLSEAGSQTARNMYGCNGWVTHHNTDIWRITGVVDG-AFWGMWPMGGAWLTQHLWDKYLY 469
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
+ +R++L YP+++ F D+L+E +G+L NPS SPE+ AP G+ V+
Sbjct: 470 SGNREYLAS-VYPIMKSACKFYQDFLVEEPSNGWLVVNPSNSPEN---APVGR-PSVTAG 524
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKD 599
+TMD I+ ++F+ AA +L ++E L+ + + RL P +I + G + EW +D
Sbjct: 525 ATMDNQILFDLFTKTKKAATLLNEDE-KLINDFQRIIDRLPPMQIGQHGQLQEWMEDLDS 583
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHD 659
P+ HRH+SHL+GL P + I+ +P+L +AA T++ RG+ GWS+ WK WAR+ D
Sbjct: 584 PDDKHRHISHLYGLHPSNQISPYSSPELFEAARTTMKHRGDISTGWSMGWKVNFWARMLD 643
Query: 660 QEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLN 719
HA+++++ LV ++ GG Y NL AHPPFQID NFG +AEML+QS
Sbjct: 644 GNHAFKLIQDQLTLVGTDNNSGEGGGTYPNLLDAHPPFQIDGNFGCAVGIAEMLLQSHDG 703
Query: 720 DLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
++ LPALP D W +G + GL+ GG VS W++G L + I S N
Sbjct: 704 TIHFLPALP-DDWKNGEITGLRTPGGFEVSFKWQNGHLIKAEIKSTLGGN 752
>gi|304404820|ref|ZP_07386481.1| Alpha-L-fucosidase [Paenibacillus curdlanolyticus YK9]
gi|304346627|gb|EFM12460.1| Alpha-L-fucosidase [Paenibacillus curdlanolyticus YK9]
Length = 769
Score = 514 bits (1323), Expect = e-142, Method: Compositional matrix adjust.
Identities = 299/814 (36%), Positives = 451/814 (55%), Gaps = 66/814 (8%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
N + + PA+ + +A PIGNG+LGAMV+G E ++LNE+++W G P N +A
Sbjct: 2 NNTTLRYKKPAQEWVEAFPIGNGKLGAMVFGRPFEERIQLNEESVWHGGPLQRDNVEALP 61
Query: 71 ALSDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRE 127
L ++R L+ +GQ EA + + + P D+ YQ LG++ ++FD + Y RE
Sbjct: 62 NLPEIRRLLFAGQPDEAEKLAFQTMISTPEDLGPYQTLGELAIQFDRED-QGEPSDYVRE 120
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
LDL T V Y G V F R+ F+S PD VIV ++S L F +L S +
Sbjct: 121 LDLATGVVSVHYEAGGVRFRRDSFASGPDGVIVYRLSADRQRRLFFTSTLSREEGTVSPL 180
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
G++ ++++G+C P+G+Q++A+L +I + G +SA E + + +
Sbjct: 181 -GSDTLVLQGQC-------------GPEGVQYAAVL--RIVCEGGRLSA-EGNTIMISDA 223
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D A + + A+++F + D + S L + + ++ H+ +++ LF R
Sbjct: 224 DTATIYIAAATTF---------READLLAVSEQKLNAAIAKGFEEVRRSHIAEHRGLFDR 274
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSR 366
V+++L ++ D +E +++P+ ER+ F+ D + L+EL F FGRYLL+SSSR
Sbjct: 275 VALELRKA-----GDHPAEH--ESLPTDERLARFRNGDRESGLIELFFHFGRYLLLSSSR 327
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
G+ ANLQGIWN+ ++P W+S H NIN++MNYW + NL+EC EPLFD++ L +NG
Sbjct: 328 RGSLPANLQGIWNDSMTPPWESDFHTNINIQMNYWPAEVTNLAECHEPLFDYIDQLRVNG 387
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
+TAQ Y A G+ +HH +++WA +S + WPMGGAWL H+WEHY Y D F
Sbjct: 388 RRTAQAMYGARGFCVHHTSNLWADASITSRWLPAMFWPMGGAWLTLHMWEHYLYGGDIAF 447
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
L RAYP + A F LD++++ G T PS SPE+ + P+G + +MD +
Sbjct: 448 LRDRAYPAMRESALFFLDFMVQDPQGRWVTAPSVSPENSYRLPNGNEGALCAGPSMDTQM 507
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRH 606
IR +F A ++A E+LE++ D + ++ + L + IA +G++MEWA ++++PE HRH
Sbjct: 508 IRMLFEACLTALELLEES-DEIASELRERLAGMPEQGIASNGTLMEWADEYEEPEPGHRH 566
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHA 663
+SHLF L P IT+E P L AA KTL++R G GWS W WARLHD E A
Sbjct: 567 ISHLFALHPADQITLEGTPALAAAARKTLERRLSHGGGHTGWSRAWIIHFWARLHDGEEA 626
Query: 664 YRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYL 723
Y L L+D ++ NLF HPPFQIDANFG T+AVAEML+QS + L
Sbjct: 627 Y---ANLAGLLDKS--------VHPNLFGDHPPFQIDANFGGTSAVAEMLLQSHAGIIEL 675
Query: 724 LPALPWDKWSSGCVKGLKARGGETVSICWKDGDL------------HEVGIYSNYSNNDH 771
LPALP W G V GL+ RGG I W +G L + +N+S +
Sbjct: 676 LPALPM-AWPDGRVAGLRVRGGAETDIAWSEGQLSSAELRVTRDGAFRIRTAANWSIRCN 734
Query: 772 DSFKTLHYRGTSVKVNLSAGKIYTFNRQLKCTNL 805
DS + G+ V+V++ AG T + NL
Sbjct: 735 DSVVSPSSDGSIVQVSVRAGDRITIHAHELNINL 768
>gi|374296937|ref|YP_005047128.1| hypothetical protein [Clostridium clariflavum DSM 19732]
gi|359826431|gb|AEV69204.1| hypothetical protein Clocl_2638 [Clostridium clariflavum DSM 19732]
Length = 742
Score = 513 bits (1321), Expect = e-142, Method: Compositional matrix adjust.
Identities = 304/788 (38%), Positives = 439/788 (55%), Gaps = 68/788 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ + PA + +A+PIGNGR+GAM++G + +E ++LNED++W G D NPDA K L
Sbjct: 3 KLWYTKPAGCWEEALPIGNGRMGAMIFGSIETEHIQLNEDSVWYGAFVDRNNPDALKNLP 62
Query: 74 DVRSLVDSGQYAEATAASV-KLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
+R L+ GQ EA V L G P YQ LGD+ + F ++ + Y R L L
Sbjct: 63 KIRELIIKGQIPEAEELMVYALSGIPQSQRPYQSLGDLTIRFKG--MEGDKSGYIRCLSL 120
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVN 188
+ A VK V + RE F S D V+V +I+ +SF+ L + D V
Sbjct: 121 DDAIHTVKVKVAENTYKRETFLSAADDVLVMRITSDGDKKISFSALLTRERFYDRVIKV- 179
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
G + ++++G N G+ F ++ +K + G+ + + L V +D
Sbjct: 180 GQDAVMLDG-------------NLGKGGLDF--VMMLKAVAEGGSCDVV-GEHLIVNDAD 223
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
LL A ++F F N + K L N SY DL RH++DY L++RV
Sbjct: 224 AVTLLFTAGTTFR--FQNLKEQLK-------KILNDAANRSYDDLRKRHVEDYMSLYNRV 274
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRP 367
S +L+ + E + + + ER+K + E D L +L F FGRYLLIS SR
Sbjct: 275 SFELNGT-----------EKYEELTTEERLKKAKEGEVDKGLAKLYFDFGRYLLISCSRE 323
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
G+ ANLQG+WN+D++P WDS +NIN +MNYW + CNLSEC +PLFD + + NG
Sbjct: 324 GSLPANLQGVWNKDMNPAWDSKYTININTQMNYWPAEVCNLSECHKPLFDLIKRMVPNGQ 383
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
KTA+ Y G+V HH TDIW ++ + + W MG AWLCTHLW HY YT D+DFL
Sbjct: 384 KTARTMYNCRGFVAHHNTDIWGDTAVQDHWIPASYWVMGAAWLCTHLWMHYEYTQDKDFL 443
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
K A+P++ F LD+LIE GYL+T PS SPE+ +I P+G V+ +TMD I+
Sbjct: 444 -KEAFPIMREAVLFFLDFLIE-DKGYLKTCPSVSPENTYILPNGVQGSVTIGATMDNQIL 501
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHL 607
R++FS I AAE+L + D + + +++ +L PT+I G+IMEW +D+ + E HRH+
Sbjct: 502 RDLFSQCIKAAEIL-RVCDQMNRDIEETVKKLEPTRIGSRGNIMEWTEDYDEAEPGHRHI 560
Query: 608 SHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHAY 664
SHL+GL P IT++ P+L +AA +TL+ R G GWS W L+A+L D E AY
Sbjct: 561 SHLYGLHPSTQITVDGTPELAEAARRTLELRLAHGGGHTGWSRAWIINLYAKLWDGEEAY 620
Query: 665 RMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLL 724
+ +++L + N+F HPPFQID NFG TAA+AEMLVQST + LL
Sbjct: 621 KNLEQLIS-----------KSTLPNMFCNHPPFQIDGNFGGTAAIAEMLVQSTEQRIVLL 669
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSV 784
PALP W +G +KGL RGG +S+ W+D +L + I + H + Y+ +
Sbjct: 670 PALP-KVWKNGSIKGLCVRGGAEISLHWQDCELTKCIIKAK-----HKIQTDVVYKQKRI 723
Query: 785 KVNLSAGK 792
K++L AG+
Sbjct: 724 KISLEAGE 731
>gi|332662485|ref|YP_004445273.1| alpha-L-fucosidase [Haliscomenobacter hydrossis DSM 1100]
gi|332331299|gb|AEE48400.1| Alpha-L-fucosidase [Haliscomenobacter hydrossis DSM 1100]
Length = 819
Score = 513 bits (1321), Expect = e-142, Method: Compositional matrix adjust.
Identities = 301/765 (39%), Positives = 435/765 (56%), Gaps = 48/765 (6%)
Query: 13 LKITFN-GPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
LK+ +N + +A+PIGNGRLGAMV+G V ET++LNE T+W+G P NP A +
Sbjct: 24 LKLWYNQSSGTKWENALPIGNGRLGAMVYGNVDKETIQLNEHTVWSGSPNRNDNPAALDS 83
Query: 72 LSDVRSLVDSGQY--AEATAASVKLFGHP-ADVYQLLGDIELEFDDSHLKYAEETYRREL 128
L+++R L+ G++ AE A V + ++Q +G + L F H Y+ Y REL
Sbjct: 84 LAEIRKLIFEGKHKAAERLANRVIITKKSHGQMFQPVGSLHLSFP-GHENYSN--YYREL 140
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
D+ A A+ Y+V V +TRE +S PD+VIV +++ S++GSLSF+ + S +
Sbjct: 141 DIEKAVAKTSYTVDGVTYTREALASFPDRVIVVRLTASKAGSLSFSANYSSPQRKKVFAT 200
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGS 247
+ + I + ++ KG ++F I IK+ D G++S+ D L V+G+
Sbjct: 201 TATKDLT--------ISGTTSDHEGVKGMVEFKGITRIKL--DGGSLSS-NDTSLTVKGA 249
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
+ A L + +++F+ N D D + L +Y+ + T H+ YQK F R
Sbjct: 250 NSATLFISIATNFN----NYKDVSGDEEKRAADYLNKAYPKAYATILTGHIAAYQKYFKR 305
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
V + L +P +P ER+K+F + DP LV L +QFGRYLLISSS+P
Sbjct: 306 VKLDLGTTPAA------------NLPIDERLKNFSSSNDPHLVSLYYQFGRYLLISSSQP 353
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
G Q ANLQGIWN L+P WDS +NIN EMNYW + NL+E PL + + LSI G
Sbjct: 354 GGQPANLQGIWNNRLNPPWDSKYTININTEMNYWPAERTNLAELHRPLLEMVKELSITGQ 413
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
+TA+ Y GW+ HH TDIW + A G W +W GGAWL HLWEHY Y D+ +L
Sbjct: 414 ETARTMYGTRGWMAHHNTDIWRMNGAIDG-AFWGMWTAGGAWLTQHLWEHYLYNGDKTYL 472
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
YP L+G A F +D+LIE H Y L +P SPE+ A G + + +TMD
Sbjct: 473 AS-VYPALKGAALFYVDFLIE-HPQYKWLVVSPGNSPENAPKAHGG--SSLDAGTTMDNQ 528
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
I+ +VFS+ I A++L K+ A V+ + + RL P I + + EW D P+ HHR
Sbjct: 529 IVYDVFSSTIRTAQLLGKDA-AFVDTLKQLRSRLAPMHIGQHNQLQEWLDDVDAPDDHHR 587
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H+SHL+GLFP + I+ + P+L A+ TL +RG+ GWS+ WK WA+L D HAY+
Sbjct: 588 HVSHLYGLFPSNQISPYRTPELFAASRNTLLQRGDVSTGWSMGWKVNWWAKLQDGNHAYK 647
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
+++ N + P GG Y+NLF AHPPFQID NFG T+ + EML+QS+ +++LP
Sbjct: 648 LIQ---NQLTPLGVNPDGGGTYNNLFDAHPPFQIDGNFGCTSGITEMLLQSSDAAVHVLP 704
Query: 726 ALPWDKWSSGCVKGLKARGG-ETVSICWKDGDLHEVGIYSNYSNN 769
ALP D W +G + GL+A GG E V + WKDG + ++ + S N
Sbjct: 705 ALP-DVWPNGSIGGLRAWGGFEVVDLQWKDGKVVKLVVKSTLGGN 748
>gi|336402504|ref|ZP_08583239.1| hypothetical protein HMPREF0127_00552 [Bacteroides sp. 1_1_30]
gi|335948353|gb|EGN10067.1| hypothetical protein HMPREF0127_00552 [Bacteroides sp. 1_1_30]
Length = 822
Score = 513 bits (1320), Expect = e-142, Method: Compositional matrix adjust.
Identities = 303/778 (38%), Positives = 449/778 (57%), Gaps = 59/778 (7%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E S+ K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+++W G P +
Sbjct: 23 EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NPDA + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 83 NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y+ Y REL L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196
Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
S +Q +M EG C + ++ ++ KG ++F L K ++G
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
A D L VEG+D AV+ + +++F+ N D + T + + L + +
Sbjct: 242 KIACADGVLSVEGADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+D Y++ RVS+ L R + V + +RV++F+ D LV
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLF + +S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSDTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDG 531
LWE Y YT D +FL + YP+L+ F + +++ H+ +L PS SPE+ +G
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKDPVHN-WLVVCPSNSPENVHSGSNG 522
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
K A + TMD +I ++++AIISA+++L+ +++ + + L + P ++ G +
Sbjct: 523 K-ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQRLKEMAPMQVGHWGQLQ 580
Query: 592 EWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKT 651
EW D+ DP+ HRH+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK
Sbjct: 581 EWMFDWDDPKDVHRHISHLYGLFPSNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKV 640
Query: 652 ALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAE 711
LWARL D +HAY+++ LV E +K GG Y NLF AHPPFQID NFG A +AE
Sbjct: 641 CLWARLLDGDHAYKLITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCAAGIAE 697
Query: 712 MLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
ML+QS +YLLPALP W +G +KG+ ARGG + + WK+G + + + S+ N
Sbjct: 698 MLMQSYDGFIYLLPALP-TVWKAGSIKGIIARGGFELDLSWKNGKVSRLVVKSHKGGN 754
>gi|315607320|ref|ZP_07882320.1| possible alpha-L-fucosidase [Prevotella buccae ATCC 33574]
gi|315251023|gb|EFU31012.1| possible alpha-L-fucosidase [Prevotella buccae ATCC 33574]
Length = 787
Score = 513 bits (1320), Expect = e-142, Method: Compositional matrix adjust.
Identities = 298/780 (38%), Positives = 435/780 (55%), Gaps = 53/780 (6%)
Query: 2 MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
M A S +P K+ + PA+ +TDA+P+GNGRLGAMV+G +E ++LNE+T+W G P
Sbjct: 16 MLASLFSQAHPYKLWYREPAQVWTDALPLGNGRLGAMVYGIPATERIQLNEETIWAGQPN 75
Query: 62 DYTNPDAPKALSDVRSLVDSGQYAEAT------AASVKLFGHPADVYQLLGDIELEFDDS 115
N A KA+ ++ L+ G+Y +A S +G P YQ G++ +
Sbjct: 76 GNANAKALKAIPVIQDLIWKGEYKKAQDLATSDVMSATNWGMP---YQAFGNVYISMPGM 132
Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
Y REL L++A A +++ V + RE +S D V+ + + + G ++FN
Sbjct: 133 G---NYTNYYRELSLDSARAITRWTANGVTYRREVITSLADNVVTVRFTADQRGCITFNA 189
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSA-ILEIKISDDRGT 233
+ D+ I+++ + + ++ KG ++F + + G
Sbjct: 190 YFTTPHDD---------IMIKSEGDEATLFGVTSKHEGLKGKVRFMGRMAAVAKGKGEGA 240
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
++ +D + V+G+D AVL + +++F+ N D D S L++ Y+
Sbjct: 241 VTHSKDGIVSVKGADEAVLYISIATNFN----NYKDISGDEAVRSEQILRAAMARDYAQS 296
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+ +++L HRV++ L E+ +P+ ER+ F +D LV
Sbjct: 297 KAEHISRFRQLMHRVTLNLG------------EDQYKDLPTDERIIRFADRDDNYLVATY 344
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P WDS NINLEMNYW + P L+E E
Sbjct: 345 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPAWDSKYTTNINLEMNYWPAEPTQLTELTE 404
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCT 472
PLF + +S G+KTA+ Y SGWV+HH TDIW + D + +W GGAWLC
Sbjct: 405 PLFRLIREVSETGAKTARTMYGKSGWVLHHNTDIWCVTGGIDHAQS--GMWMTGGAWLCR 462
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDG 531
HLWEHY YTMD+DFL +R YP+++G A FL LI E G+L +PS SPE+ + DG
Sbjct: 463 HLWEHYLYTMDKDFL-RRYYPVMKGAAEFLDQMLIPEPQHGWLVISPSVSPENSHPSKDG 521
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
K+A +S +TMD+ ++ E+F +++A++VL ++ AL + L + P ++ + G +
Sbjct: 522 KVA-ISAGTTMDVQLVNELFREVMAASKVLGEDA-ALAAHYAERLKLMPPMQVGKWGQLQ 579
Query: 592 EWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKT 651
EW +D+ DP HRH+SHL+GL+PG IT+ P L AA +L RG+ GWS+ WK
Sbjct: 580 EWMEDWDDPNDTHRHVSHLYGLYPGCQITLSGTPRLFDAARTSLIHRGDPSTGWSMGWKV 639
Query: 652 ALWARLHDQEHAYRMVKRLFNLVDPEHEKHF----EGGLYSNLFAAHPPFQIDANFGFTA 707
LWARL D HAY++++ +L D + +GG Y NLF AHPPFQID NFG TA
Sbjct: 640 CLWARLFDGNHAYKLIRNQLSLTDDRFVAYGTDKKKGGTYRNLFDAHPPFQIDGNFGCTA 699
Query: 708 AVAEMLVQSTLNDLYLLPALPWDKWSSGC-VKGLKARGG-ETVSICWKDGDLHEVGIYSN 765
+AEMLVQS + LLPALP D W +G VKGL ARG E + WKDG + + I SN
Sbjct: 700 GIAEMLVQSHEGYINLLPALP-DAWKAGGEVKGLMARGAFEIEHLAWKDGRVVRLAIRSN 758
>gi|408371866|ref|ZP_11169623.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
gi|407742715|gb|EKF54305.1| glycoside hydrolase [Galbibacter sp. ck-I2-15]
Length = 803
Score = 512 bits (1319), Expect = e-142, Method: Compositional matrix adjust.
Identities = 298/765 (38%), Positives = 436/765 (56%), Gaps = 64/765 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PA+ + +IP+GNGRLGAM GGV E + LN+ TLW+G P D +P+A K L
Sbjct: 26 LKLWYKQPAELWEGSIPLGNGRLGAMPDGGVSQENIVLNDITLWSGGPQDADDPNAIKYL 85
Query: 73 SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
++R L+ G+ ++A A K F G+ ADV YQ+LG++ + HL
Sbjct: 86 PEIRRLLFEGKNSQAEALMYKTFVSKGPGSGKGNGADVPYGSYQILGNLHFNY---HLPN 142
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+ Y+RELD+ ATA +SV VE+TRE+F+S D VIV K++ S++ +SF++ +D
Sbjct: 143 KAQDYKRELDITNATATTTFSVDGVEYTREYFTSFSDDVIVFKLTASKAAQISFDLGVDR 202
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+ + +++M+G+ N D G++++ L +++ + GT+ A +D
Sbjct: 203 P-ERFTTTTQGEELLMQGQL---------NNGTDGNGMKYA--LRVRVIPEGGTLKA-KD 249
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
L+V G++ AV+L+ A++ + P + + L Y+ L H+D
Sbjct: 250 GTLQVNGANSAVILISAATDYFVPNVE---------QWVETQLDKAEKKPYNTLKETHID 300
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ-TDEDPSLVELLFQFGR 358
Y+ +F R SI+L SE + +P+ ER+K F+ T +DP L EL FQ+GR
Sbjct: 301 FYKNMFDRASIELG-----------SETQAEALPTDERLKRFEITKDDPGLAELYFQYGR 349
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YL ISS+RPG NLQG+W + W+ H+NINL+MN+W NL +P +
Sbjct: 350 YLAISSTRPGLLPPNLQGLWANTVQTPWNGDYHLNINLQMNHWPIDVVNLPMLNQPYYKL 409
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ L G KTA+ Y GWV H T+IW +S W G W+C LW HY
Sbjct: 410 IKGLVEPGEKTAKTYYGGDGWVAHVITNIWGYTSPGE-HPSWGSTNSGSGWMCQMLWRHY 468
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVS 537
+ D D+L K+ YP+L+G A F L+E D +L T PS SPE+ F +G+ A V+
Sbjct: 469 AFNQDMDYL-KKIYPILKGSAQFYNSTLVEHPDRDWLVTAPSNSPENAFFLTNGEKANVA 527
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLK-SLPRLRPTKIAEDGSIMEWAQD 596
+ T+D IIR +F +I A+++L+ D K LK + +L P +IA++G +MEW +D
Sbjct: 528 IAPTIDNQIIRSLFQNVIEASQLLDV--DKQFRKQLKHRITKLPPNQIAKNGRLMEWIKD 585
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
+K+PE HRH+SHL+GL+PG+ I++EK P+L +AA+KTL KRG+ GWS+ WK WAR
Sbjct: 586 YKEPEPTHRHVSHLWGLYPGNEISLEKTPELAQAAKKTLLKRGDISTGWSLAWKINFWAR 645
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFE----GGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
L D EHAY++ L +L+ P E F GG Y NLF AHPPFQID NFG A +AEM
Sbjct: 646 LADGEHAYKL---LGDLLKPSTETGFNMSDGGGTYPNLFCAHPPFQIDGNFGAAAGIAEM 702
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
LVQS + LPALP W G +GL+ RGG V W+ G L
Sbjct: 703 LVQSHEGFINFLPALP-KVWKDGNFEGLRVRGGAEVGAAWERGKL 746
>gi|329925668|ref|ZP_08280486.1| hypothetical protein HMPREF9412_3835 [Paenibacillus sp. HGF5]
gi|328939695|gb|EGG36038.1| hypothetical protein HMPREF9412_3835 [Paenibacillus sp. HGF5]
Length = 767
Score = 512 bits (1319), Expect = e-142, Method: Compositional matrix adjust.
Identities = 315/797 (39%), Positives = 426/797 (53%), Gaps = 69/797 (8%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
I F+ PA+++ +A+PIGNGRLG MV+G V E ++ NED++W G P D NPDA L
Sbjct: 9 IWFDQPAQNWNEALPIGNGRLGGMVFGSVMQEKIQFNEDSVWYGGPRDRNNPDALLHLPL 68
Query: 75 VRSLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+R L+ G+ EA S F G P Y GD ++ D H + YRRELDL
Sbjct: 69 IRKLLFEGRLKEAHRLSETAFSGTPRSQRPYMTAGDFCIQVD--HPQGELSHYRRELDLE 126
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS---YVN 188
A Y G V FTRE F S PDQV+V ++ G+L+ + H +
Sbjct: 127 KAITVTSYQYGGVTFTREVFCSYPDQVMVIRLEADRPGALTLTSRFERQKGKHMDAVHRA 186
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE-IKISDDRGTISALEDKKLKVEGS 247
G + ++M C GK G+ +SA + I + GT+ + + L V+ +
Sbjct: 187 GTDTVVMTNDCGGK------------DGLTYSAAAKAIAVG---GTVRVV-GEHLLVDQA 230
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D V++L A+S+F +D K +E L+ N Y+ L RH+ DYQ LF R
Sbjct: 231 DEVVIILAAASTFR------ADDSKLRCNE---LLEHAANQGYAALKKRHIADYQPLFDR 281
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSR 366
V + L ++ VP+ +R++ + D+D L L F FGRYLLI+ SR
Sbjct: 282 VKLDLG---------AAADREHHLVPTPKRLERVRAGDDDAGLYTLYFHFGRYLLIACSR 332
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG+ ANLQGIWN+ ++P WDS +NIN +MNYW + CNL EC EPLF+ + + NG
Sbjct: 333 PGSLPANLQGIWNDSMAPPWDSKFTININTQMNYWPAESCNLPECHEPLFELIERMKDNG 392
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
TA+ Y G+V HH TDIWA ++ W MG AWL HLWEHY + + DF
Sbjct: 393 RVTARKMYGCRGFVAHHNTDIWADTAPQDIYPPATQWVMGAAWLTLHLWEHYKFNPNPDF 452
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
L +RAY ++ A F D+L+E +GYL TNPS SPE+ ++ +G+ + Y +MD I
Sbjct: 453 L-RRAYETMKEAALFFTDFLVESPEGYLVTNPSVSPENRYMLRNGESGTLCYGPSMDTQI 511
Query: 547 IREVFSAIISAAEVLEKNEDALVE-KVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
I E+FSA I A+ L+ +E A E +K RL K+ G + EW +D+++ + HR
Sbjct: 512 ISELFSACIEASLELDTDESARREWAAIKD--RLPEMKVGRHGQLQEWLEDYEEADPGHR 569
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEH 662
H+SHLFGL PG TI+ + PDL +AA TL++R G GWS W WARL D E
Sbjct: 570 HISHLFGLHPGTTISPDSTPDLAEAARVTLRRRLAHGGGHTGWSRAWIINFWARLLDGEQ 629
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
AY +K L NLF HPPFQID NFG A VAEML+QS L+ +
Sbjct: 630 AYVHLKELLR-----------QSTLPNLFDNHPPFQIDGNFGAAAGVAEMLIQSHLDHIR 678
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGT 782
LLPALP D W G VKGL+ARGG V I W+DG L E I S LH +
Sbjct: 679 LLPALP-DAWPQGRVKGLRARGGFEVDIDWRDGSLAEAMITSVSGQK-----LRLHAK-P 731
Query: 783 SVKVNLSAGKIYTFNRQ 799
SV+V S G+ R
Sbjct: 732 SVRVTTSDGREVPMERH 748
>gi|237720803|ref|ZP_04551284.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
gi|229449638|gb|EEO55429.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
Length = 822
Score = 512 bits (1319), Expect = e-142, Method: Compositional matrix adjust.
Identities = 303/778 (38%), Positives = 449/778 (57%), Gaps = 59/778 (7%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E S+ K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+++W G P +
Sbjct: 23 EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NPDA + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 83 NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y+ Y REL L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196
Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
S +Q +M EG C + ++ ++ KG ++F L K ++G
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
A D L VEG+D AV+ + +++F+ N D + T + + L + +
Sbjct: 242 KIACADGVLSVEGADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+D Y++ RVS+ L R + V + +RV++F+ D LV
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLF + +S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSDTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDG 531
LWE Y YT D +FL + YP+L+ F + +++ H+ +L PS SPE+ +G
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKDPVHN-WLVVCPSNSPENVHSGSNG 522
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
K A + TMD +I ++++AIISA+++L+ +++ + + L + P ++ G +
Sbjct: 523 K-ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQHLKEMAPMQVGHWGQLQ 580
Query: 592 EWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKT 651
EW D+ DP+ HRH+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK
Sbjct: 581 EWMFDWDDPKDVHRHISHLYGLFPSNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKV 640
Query: 652 ALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAE 711
LWARL D +HAY+++ LV E +K GG Y NLF AHPPFQID NFG A +AE
Sbjct: 641 CLWARLLDGDHAYKLITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCAAGIAE 697
Query: 712 MLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
ML+QS +YLLPALP W +G +KG+ ARGG + + WK+G + + + S+ N
Sbjct: 698 MLMQSYDGFIYLLPALP-TVWKAGSIKGIIARGGFELDLSWKNGKVSRLVVKSHKGGN 754
>gi|319786653|ref|YP_004146128.1| alpha-L-fucosidase [Pseudoxanthomonas suwonensis 11-1]
gi|317465165|gb|ADV26897.1| Alpha-L-fucosidase [Pseudoxanthomonas suwonensis 11-1]
Length = 805
Score = 512 bits (1319), Expect = e-142, Method: Compositional matrix adjust.
Identities = 312/793 (39%), Positives = 437/793 (55%), Gaps = 61/793 (7%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+ + PL++ + PA + +A+P+GNGRLGAMVWGG SE L+LNEDTL+ G P D
Sbjct: 47 TAAPGRPLRLWYPRPATRWVEALPLGNGRLGAMVWGGGRSERLQLNEDTLYAGRPYDPVP 106
Query: 66 PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDD-SHLKYAE 121
A +AL +VR L+ +G++AEA A A + G P YQ LGD+ L+F + S L
Sbjct: 107 DGALEALPEVRRLLFAGRHAEAEALADATMMGAPRKQMPYQPLGDLCLDFVEVSDL---- 162
Query: 122 ETYRRELDLNTATARVKYSVG-NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
+ YRRELDL+ A A + G +E TRE F S DQ + ++ S+ G + + LDS
Sbjct: 163 DDYRRELDLDRAVATTSFGSGWKLEHTREAFVSAEDQCLAVRLRTSQPGRVRVRIGLDSD 222
Query: 181 LDNHSYV-NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
V +G+ +++ GR +A G++F+A L +++ RG
Sbjct: 223 HAQAEVVPDGDAGLLLRGR--------NGDAFGIEGGLRFAARLGVQV---RGGTLRRRG 271
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+++VEG+D VLLL A++SF D DP + + + L++ S+ L H
Sbjct: 272 DRIEVEGADEVVLLLTAATSFR----RYDDIGGDPEATTRTQLEAAARRSWDALLAAHEA 327
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
+Q+LF RV+I L RS E + +P ERV F DP L L QFGRY
Sbjct: 328 AHQRLFRRVAIDLGRS----------AEEVAALPIDERVARFAEGHDPELAALYHQFGRY 377
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LL+ SSRPGTQ ANLQGIWN+ L+P W+S +NIN EMNYW + L EC EPL +
Sbjct: 378 LLVCSSRPGTQPANLQGIWNDLLAPPWESKYTININTEMNYWPAEANALPECVEPLERMV 437
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
L+ G+ A+ Y A GWV+HH TD+W +++ G W LWP+GGAWL HLW+ ++
Sbjct: 438 AELAQTGADVARRMYGAPGWVVHHNTDLWRQAAPIDG-AKWGLWPLGGAWLLQHLWDRWD 496
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSY 538
Y + +LEK +PL G A F L+E G + T PS SPE+E P G C
Sbjct: 497 YGREPGYLEK-VWPLFRGAAEFFAATLVEDPTTGAMVTAPSISPENEH--PHGAALCAGP 553
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF- 597
S MD I+R++F I A +L + D L ++ + RL P +I G + EW QD+
Sbjct: 554 S--MDAQILRDLFGQCIEIAGLLGVDAD-LAARLARLRERLPPHRIGRAGQLQEWQQDWD 610
Query: 598 -KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
PE+ HRH+SHL+ L P I + P+L AA ++L+ RG+E GW I W+ LWAR
Sbjct: 611 MDAPEMDHRHVSHLYALHPSSQINMRDTPELAAAARRSLEIRGDEATGWGIGWRLNLWAR 670
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
L D HAY++ L L+ PE Y NLF AHPPFQID NFG TA + EML+QS
Sbjct: 671 LRDAGHAYKV---LGMLLSPERT-------YPNLFDAHPPFQIDGNFGGTAGITEMLLQS 720
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKT 776
++LLPALP W G V GL+ RG V++ W G L + +++ F+
Sbjct: 721 WGGTVFLLPALP-QAWPRGRVSGLRVRGAAEVALEWDAGRLRQARLHAWRGGR----FR- 774
Query: 777 LHYRGTSVKVNLS 789
L YR ++++ L
Sbjct: 775 LEYRDQALELALG 787
>gi|281421059|ref|ZP_06252058.1| putative large secreted protein [Prevotella copri DSM 18205]
gi|281404977|gb|EFB35657.1| putative large secreted protein [Prevotella copri DSM 18205]
Length = 790
Score = 512 bits (1318), Expect = e-142, Method: Compositional matrix adjust.
Identities = 296/809 (36%), Positives = 449/809 (55%), Gaps = 58/809 (7%)
Query: 2 MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
+ A S PLK+ +N PA F +++PIGNG+LGA+++GG ++++ LN+ TLWTG P
Sbjct: 17 LQAVPKSNIPPLKLWYNKPATAFEESLPIGNGKLGALIYGGANNDSIYLNDITLWTGKPV 76
Query: 62 DYT-NPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
+ DA K + +R + Y A + + + GH ++ YQ L I ++ D + +++
Sbjct: 77 NREEGGDAYKWIPKIREALFKEDYKAADSLQLHVQGHNSEYYQPLAIINIK-DANKGQFS 135
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
Y+REL L+ ATA + Y+ G +++ RE+F+S+PD++I ++ ++ +++ ++SL SL
Sbjct: 136 --NYKRELSLDNATAALSYTRGGIQYQREYFASHPDKMIAIHLTATQKKAINCDISLTSL 193
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ H N Q+ + G GK I F +IL IK D GTI+A D
Sbjct: 194 IP-HQVKASNKQLTITGHAMGK----------PENSIHFCSILSIKNQD--GTITA-SDS 239
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIR-------NLSYSDL 293
L ++G AV+ LV +S++G K P E ++ + N +Y +L
Sbjct: 240 ILHLQGVSEAVIYLVNETSYNG-------FDKHPVKEGAPYIEKVNDNAWHLVNYTYPEL 292
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
RH+ DYQ +F+R L + D T ++ D E ++P L L
Sbjct: 293 KQRHITDYQNIFNRAKFALKGAKFD-NKRTTDQQLFDYTEKEE--------QNPYLEMLY 343
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQ+GRYLLIS SR ANLQG+W W +NINLE NYW + N+SE
Sbjct: 344 FQYGRYLLISCSRTPGIPANLQGLWAPARKSPWRGNYTININLEENYWPAEVTNMSELVM 403
Query: 414 PLFDFLTYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAW 469
P+ + +S+ G TA+ Y + +GW H TD WA ++ + W+ W MGGAW
Sbjct: 404 PVDGLVKAMSVTGKYTAKHYYGIENGWCGGHNTDAWAMTNPVGTKKESPKWSNWNMGGAW 463
Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFI 527
L LW+HY+YT D+++L + AYPL++G A F+LDW+IE G L T P TSPE E+I
Sbjct: 464 LVQTLWDHYDYTRDKEYLRQTAYPLMKGAADFMLDWIIENPKKPGELLTAPCTSPEAEYI 523
Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAED 587
G C Y T D+ I+RE+F + A++L+ ++ A K+ ++ RL P +I +
Sbjct: 524 TDKGYQGCSFYGGTADLTILRELFKNTLKGAQILDIDQ-AYQAKLQDAINRLHPYQIGKR 582
Query: 588 GSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSI 647
G++ EW D+ D + HHRH SHL GL P + I+++K PDL AA KTL+ +G+ GWS
Sbjct: 583 GNLQEWYYDWDDQDWHHRHQSHLLGLHPFYQISLDKTPDLAAAAAKTLEIKGDFSTGWST 642
Query: 648 TWKTALWARLHDQEHAYRMVKRLFNLVDPEH----EKHFEGGLYSNLFAAHPPFQIDANF 703
W+ +LWARLH + +Y M+++L N V P + + GG Y NLF AHPPFQID NF
Sbjct: 643 GWRISLWARLHRADKSYSMIRKLLNYVHPGNYNNPKNRPSGGTYPNLFDAHPPFQIDGNF 702
Query: 704 GFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIY 763
G TA V EML+Q ++LLPALP +W +G +KG+KARG +++ W +G + + I
Sbjct: 703 GGTAGVCEMLMQCDGETMHLLPALP-KEWPAGEIKGIKARGNYEINLVWNNGKVSKASIT 761
Query: 764 SNYSNNDHDSFKTLHYRGTSVKVNLSAGK 792
S + N T+ Y G +N AG+
Sbjct: 762 SKNAGN-----LTVKYNGKQKALNFKAGE 785
>gi|240144516|ref|ZP_04743117.1| alpha-L-fucosidase 2 [Roseburia intestinalis L1-82]
gi|257203465|gb|EEV01750.1| alpha-L-fucosidase 2 [Roseburia intestinalis L1-82]
Length = 741
Score = 511 bits (1317), Expect = e-142, Method: Compositional matrix adjust.
Identities = 306/794 (38%), Positives = 439/794 (55%), Gaps = 82/794 (10%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PAK + +A+P+GNGR+GAM++GGV E +++NE+++W G P D NPDA L ++R
Sbjct: 6 YKEPAKVWEEALPLGNGRIGAMIFGGVEQERIQVNEESIWYGGPVDRNNPDAKAHLEEIR 65
Query: 77 SLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIEL---EFDDSHLKYAEETYRRELDL 130
+ G+ EA ++ + G P + YQ LGDI + +D E Y+R L+L
Sbjct: 66 QHIFEGRLKEAQRLMNLTMSGCPDSMHPYQTLGDINIYSSGIEDV------ENYKRSLNL 119
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A V++ +V F RE F S P +V + + +S +SF +L Y +G
Sbjct: 120 EEAVCLVEFDSRSVHFKREMFLSYPKDCLVIRFTADKSSQISFQANLS----RGRYFDGI 175
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
N++ G C N G F ++ IK G SA+ L V+G+D
Sbjct: 176 NKLGENGIC--------LYGNLGRGGSDF--VMGIKAWAKGGVASAV-GGNLCVQGADEV 224
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRN----LSYSDLYTRHLDDYQKLFH 306
+L A+SSF K E + ++ N L+Y +L+ H +DY+ LF
Sbjct: 225 LLTFCAASSF---------RNKKKCDELLREIEEKMNNAAMLTYEELFEEHKEDYRTLFA 275
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQFGRYLLISSS 365
RV QL E D +P+ ER+ ++ + D L ++LF +GRYLLIS S
Sbjct: 276 RVEFQLD-----------GVEKFDVIPTNERIERAAKETPDIGLSKMLFDYGRYLLISCS 324
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPG A LQGIWN+D +P W+S +NIN EMNYW + CNLSEC PLFD L + N
Sbjct: 325 RPGGLPATLQGIWNQDFTPPWESKYTININTEMNYWLAESCNLSECHMPLFDLLERMVEN 384
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G +TA+ Y G+V HH TDI ++ W MG AWLCTHLW HY YT+DR+
Sbjct: 385 GRRTAEKMYGCRGFVAHHNTDIHGDTAPQDTWYPATYWVMGAAWLCTHLWTHYEYTLDRE 444
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
FLE R+YP++ A F +D+L+E DGYL T PS SPE+ + P+G++ VSY +TMD
Sbjct: 445 FLE-RSYPIMCEAALFFIDFLVE-KDGYLVTCPSLSPENTYCLPNGEMGAVSYGATMDNQ 502
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
I+R++FS ++A ++L+ A +EK L +L PT+I DG IMEW +++++ E HR
Sbjct: 503 ILRDLFSQCLAAGKILQATNSAFLEKAEYVLQKLLPTRIGSDGRIMEWMEEYEECEPGHR 562
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEH 662
H+SHL+GL P IT++ P L +AA KTL+ R + G GWS W +A+L D E
Sbjct: 563 HISHLYGLHPSEQITVDNTPKLAEAARKTLETRLKNGGGHTGWSRAWIINHYAKLWDGEI 622
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
AY + E+ +Y NLF HPPFQID NFG TAA+AEMLVQST +
Sbjct: 623 AYHNI-----------EQMLASSIYPNLFDRHPPFQIDGNFGVTAAIAEMLVQSTAERII 671
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLH---- 778
LLPALP W++G VKGL+ +G +S+ W++ L E I+ +++ LH
Sbjct: 672 LLPALP-VAWTTGSVKGLRIKGNAEISLKWEEHKLTECTIH---------AYEKLHTRII 721
Query: 779 YRGTSVKVNLSAGK 792
YR ++K+ L G+
Sbjct: 722 YRNKTMKIILEKGE 735
>gi|423213429|ref|ZP_17199958.1| hypothetical protein HMPREF1074_01490 [Bacteroides xylanisolvens
CL03T12C04]
gi|392693889|gb|EIY87119.1| hypothetical protein HMPREF1074_01490 [Bacteroides xylanisolvens
CL03T12C04]
Length = 822
Score = 511 bits (1315), Expect = e-142, Method: Compositional matrix adjust.
Identities = 302/777 (38%), Positives = 445/777 (57%), Gaps = 57/777 (7%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E S+ K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+++W G P +
Sbjct: 23 EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NPDA + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 83 NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y+ Y REL L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196
Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
S +Q +M EG C + ++ ++ KG ++F L K ++G
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
A D L VEG+D A + + +++F+ N D + T + + L + +
Sbjct: 242 KIACADGVLSVEGADEATVYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+D Y++ RVS+ L R + V + +RV++F+ D LV
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLF + +S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSNTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
LWE Y YT D +FL + YP+L+ F + ++ E +L PS SPE+ +GK
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK 523
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
A + TMD +I ++++AIISA+++L+ +++ + + L + P ++ G + E
Sbjct: 524 -ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQRLKEMAPMQVGHWGQLQE 581
Query: 593 WAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTA 652
W D+ DP+ HRH+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK
Sbjct: 582 WMFDWDDPKDVHRHISHLYGLFPSNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKVC 641
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
LWARL D +HAY+++ LV E +K GG Y NLF AHPPFQID NFG A +AEM
Sbjct: 642 LWARLLDGDHAYKLITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCAAGIAEM 698
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
L+QS +YLLPALP W G +KG+ ARGG + + WK+G + + + S+ N
Sbjct: 699 LMQSYDGFIYLLPALP-TVWKEGSIKGIIARGGFELDLSWKNGKVSRLVVKSHKGGN 754
>gi|294648173|ref|ZP_06725715.1| putative lipoprotein [Bacteroides ovatus SD CC 2a]
gi|292636492|gb|EFF54968.1| putative lipoprotein [Bacteroides ovatus SD CC 2a]
Length = 822
Score = 510 bits (1314), Expect = e-141, Method: Compositional matrix adjust.
Identities = 302/777 (38%), Positives = 445/777 (57%), Gaps = 57/777 (7%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E S+ K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+++W G P +
Sbjct: 23 EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NPDA + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 83 NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y+ Y REL L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196
Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
S +Q +M EG C + ++ ++ KG ++F L K ++G
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
A D L VE +D AV+ + +++F+ N D + T + + L + +
Sbjct: 242 KIACADGILSVEKADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+D Y++ RVS+ L R + V + +RV++F+ D LV
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLF + +S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSNTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
LWE Y YT D +FL + YP+L+ F + ++ E +L PS SPE+ +GK
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK 523
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
A + TMD +I ++++AIISA+++L+ +++ + + L + P ++ G + E
Sbjct: 524 -ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQRLKEMAPMQVGHWGQLQE 581
Query: 593 WAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTA 652
W D+ DP+ HRH+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK
Sbjct: 582 WMFDWDDPKDVHRHISHLYGLFPSNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKVC 641
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
LWARL D +HAY+++ LV E +K GG Y NLF AHPPFQID NFG A +AEM
Sbjct: 642 LWARLLDGDHAYKLITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCAAGIAEM 698
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
L+QS +YLLPALP W G +KG+ ARGG + + WK+G + + + S+ N
Sbjct: 699 LMQSYDGFIYLLPALP-TVWKEGSIKGIIARGGFELDLSWKNGKVSRLVVKSHKGGN 754
>gi|423289667|ref|ZP_17268517.1| hypothetical protein HMPREF1069_03560 [Bacteroides ovatus
CL02T12C04]
gi|423298161|ref|ZP_17276220.1| hypothetical protein HMPREF1070_04885 [Bacteroides ovatus
CL03T12C18]
gi|392663702|gb|EIY57249.1| hypothetical protein HMPREF1070_04885 [Bacteroides ovatus
CL03T12C18]
gi|392667378|gb|EIY60888.1| hypothetical protein HMPREF1069_03560 [Bacteroides ovatus
CL02T12C04]
Length = 810
Score = 510 bits (1313), Expect = e-141, Method: Compositional matrix adjust.
Identities = 309/774 (39%), Positives = 437/774 (56%), Gaps = 59/774 (7%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
T LK+ ++ PA+ + +A+P+GN RLGAM++G E ++LNE+T+W G P NP
Sbjct: 16 TVRAEELKLWYSHPAEEWVEALPLGNSRLGAMIYGNPFEEEIQLNEETVWGGSPYRNDNP 75
Query: 67 DAPKALSDVRSLVDSGQYAEATA-------ASVKLFGHPADVYQLLGDIELEFDDSHLKY 119
+A LS+VR L+ +G+ E TA A K G P YQ +G ++L F H KY
Sbjct: 76 EAYGVLSEVRKLIFAGR--EITAEKLWKEHAFTKQNGMP---YQTVGSLKLHFP-GHEKY 129
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+ Y R+L++ A A V Y VG+V +TR F+S D ++ + S++F S +
Sbjct: 130 TD--YYRDLNIENAVATVSYKVGDVTYTRTLFTSLADNALIIHLEADRPHSIAFEASYST 187
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDD-PKGIQFSAILEIKISDDRGTISALE 238
+ + + N++ + KA+A+++ P I+ + IK S G + + +
Sbjct: 188 PFEESAVIASKNRLTLSA---------KASAHEEVPAAIRLESQARIKTSG--GKVES-D 235
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+ KL V +D + + A+++F +N D + + L + SY L H+
Sbjct: 236 NGKLIVTEADVVTIYVSAATNF----VNYQDVSANESKRVDVILNQVGKKSYRQLLDSHI 291
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
YQ+ F RV + L S S++ R+K F+ +DP+LV L+FQFGR
Sbjct: 292 GKYQQQFGRVKLDLGHS-------LASQKETPV-----RLKEFREGKDPALVTLMFQFGR 339
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSS+PG Q ANLQGIWN+ L WD +NIN EMNYW + NL E EPLF
Sbjct: 340 YLLISSSQPGGQPANLQGIWNQHLLAPWDGKYTININTEMNYWPAEITNLPETHEPLFRL 399
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ L+ G KTAQ Y +GWV HH TDIW + G + WP GGAWL HLW+HY
Sbjct: 400 VNELAETGKKTAQTMYHCNGWVAHHNTDIWRATGPVDGP-FYGTWPNGGAWLSQHLWQHY 458
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACV 536
YT D+DFL K YP+L+G A F +D+L+E H Y L T PS SPE AP GK +
Sbjct: 459 LYTGDKDFLIKN-YPVLKGAADFYMDFLVE-HPQYHWLVTIPSISPEQG--AP-GKETSL 513
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE-KVLKSLPRLRPTKIAEDGSIMEWAQ 595
+ TMD I+ +V S + AA+++ ED + + +V K L RL P +I + + EW +
Sbjct: 514 TAGCTMDNQIVFDVLSNTLQAAKIV--GEDIVYQDRVKKVLDRLPPMQIGKYNQLQEWLE 571
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWA 655
D DP+ HRH+SHL+GL+P + I+ +P L +AA+++L RG+ GWSI WK LWA
Sbjct: 572 DVDDPQSDHRHVSHLYGLYPSNQISPYAHPGLFQAAKRSLLYRGDMATGWSIGWKINLWA 631
Query: 656 RLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQ 715
RL D +HAY+++ + NLV+ E + +G Y NLF AHPPFQID NFGFTA VAEML+Q
Sbjct: 632 RLLDGDHAYKIIGNMLNLVE---EGNPDGRTYPNLFDAHPPFQIDGNFGFTAGVAEMLLQ 688
Query: 716 STLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
S N L+LLPALP W G + GL ARG V + W+ G+L I S N
Sbjct: 689 SHDNALHLLPALP-TAWQKGHISGLVARGAFEVDMSWEGGELLAATILSRIGGN 741
>gi|295084327|emb|CBK65850.1| hypothetical protein [Bacteroides xylanisolvens XB1A]
Length = 822
Score = 510 bits (1313), Expect = e-141, Method: Compositional matrix adjust.
Identities = 302/777 (38%), Positives = 446/777 (57%), Gaps = 57/777 (7%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E S+ K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+++W G P +
Sbjct: 23 EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NPDA + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 83 NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y+ Y REL L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196
Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
S +Q +M EG C + ++ ++ KG ++F L K ++G
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
A D L VE +D AV+ + +++F+ N D + T + + L + +
Sbjct: 242 KIACADGILSVEKADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+D Y++ RVS+ L R + V + +RV++F+ D LV
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLF + +S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSDTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
LWE Y YT D +FL + YP+L+ F + ++ E +L PS SPE+ +GK
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK 523
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
A + TMD +I ++++AIISA+++L+ +++ + + L + P ++ G + E
Sbjct: 524 -ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQRLKEMAPMQVGHWGQLQE 581
Query: 593 WAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTA 652
W D+ DP+ HRH+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK
Sbjct: 582 WMFDWDDPKDVHRHISHLYGLFPSNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKVC 641
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
LWARL D +HAY+++ LV E +K GG Y NLF AHPPFQID NFG A +AEM
Sbjct: 642 LWARLLDGDHAYKLITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCAAGIAEM 698
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
L+QS +YLLPALP W+ G +KG+ ARGG + + WK+G + + + S+ N
Sbjct: 699 LMQSYDGFIYLLPALP-TVWTEGSIKGIIARGGFELDLSWKNGRVSRLVVKSHKGGN 754
>gi|298481330|ref|ZP_06999523.1| alpha-L-fucosidase 2 [Bacteroides sp. D22]
gi|298272534|gb|EFI14102.1| alpha-L-fucosidase 2 [Bacteroides sp. D22]
Length = 822
Score = 509 bits (1312), Expect = e-141, Method: Compositional matrix adjust.
Identities = 302/777 (38%), Positives = 445/777 (57%), Gaps = 57/777 (7%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E S+ K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+++W G P +
Sbjct: 23 EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NPDA + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 83 NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y+ Y REL L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196
Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
S +Q +M EG C + ++ ++ KG ++F L K ++G
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
A D L VE +D AV+ + +++F+ N D + T + + L + +
Sbjct: 242 KIACADGILSVEKADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+D Y++ RVS+ L R + V + +RV++F+ D LV
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLF + +S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSNTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
LWE Y YT D +FL + YP+L+ F + ++ E +L PS SPE+ +GK
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK 523
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
A + TMD +I ++++AIISA+++L+ +++ + + L + P ++ G + E
Sbjct: 524 -ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQRLKEMAPMQVGHWGQLQE 581
Query: 593 WAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTA 652
W D+ DP+ HRH+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK
Sbjct: 582 WMFDWDDPKDVHRHISHLYGLFPSNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKVC 641
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
LWARL D +HAY+++ LV E +K GG Y NLF AHPPFQID NFG A +AEM
Sbjct: 642 LWARLLDGDHAYKLITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCAAGIAEM 698
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
L+QS +YLLPALP W G +KG+ ARGG + + WK+G + + + S+ N
Sbjct: 699 LMQSYDGFIYLLPALP-TVWKEGSIKGIIARGGFELDLSWKNGKVSRLVVKSHKGGN 754
>gi|383113013|ref|ZP_09933793.1| hypothetical protein BSGG_0144 [Bacteroides sp. D2]
gi|382948895|gb|EFS29444.2| hypothetical protein BSGG_0144 [Bacteroides sp. D2]
Length = 822
Score = 509 bits (1312), Expect = e-141, Method: Compositional matrix adjust.
Identities = 299/777 (38%), Positives = 445/777 (57%), Gaps = 57/777 (7%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E+ + K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+T+W G P +
Sbjct: 23 ETNVSAQEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEETIWAGRPNNNA 82
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NPDA + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F SH +
Sbjct: 83 NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-SHTR 138
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y+ Y REL L++A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 139 YS--NYYRELSLDSARVIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196
Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
S +Q +M EG C + ++ ++ KG ++F L K ++G
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
A D L VE +D A++ + +++F+ N D + + + L+ + +
Sbjct: 242 EIACADGILSVEKADEAIVYVSIATNFN----NYQDITGNQIERAKNYLEKAMVHPFIES 297
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+D Y++ RVS+ L + + VP+ +RV++F+ D LV
Sbjct: 298 KKNHIDFYRQYLTRVSLDLGK------------DQYSNVPTDKRVENFKNTNDAHLVATY 345
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLF + +S G +TA+V Y A+GWV+HH TDIW + A K +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSDTGKETAKVMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
LWE Y YT D +FL + YP+L+ F + ++ E +L PS SPE+ +GK
Sbjct: 465 LWERYLYTGDIEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK 523
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
A + TMD ++ ++++ IISA+++L+ +++ + + L + P ++ G + E
Sbjct: 524 -ATTAAGCTMDNQLVFDLWTTIISASQILDTDQE-FATHLAQRLKEMAPMQVGHWGQLQE 581
Query: 593 WAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTA 652
W D+ DP+ HRH+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK
Sbjct: 582 WMFDWDDPKDVHRHISHLYGLFPSNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKVC 641
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
LWARL D +HAY+++ LV E +K GG Y NLF AHPPFQID NFG A +AEM
Sbjct: 642 LWARLLDGDHAYKLITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCAAGIAEM 698
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
L+QS +YLLPALP W G +KG+ ARGG + + WK+G + + + S+ N
Sbjct: 699 LMQSYDGFIYLLPALP-TVWQEGSIKGIIARGGFELDLSWKNGKVSRLVVKSHKGGN 754
>gi|261406479|ref|YP_003242720.1| alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
gi|261282942|gb|ACX64913.1| Alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
Length = 783
Score = 509 bits (1312), Expect = e-141, Method: Compositional matrix adjust.
Identities = 294/766 (38%), Positives = 426/766 (55%), Gaps = 53/766 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ PA+ +T+A P+GNGRLGAMV+GGV +E + LNED++W G P + NP+A + L
Sbjct: 7 KLVERRPAQVWTEAFPVGNGRLGAMVFGGVSTERIGLNEDSVWYGGPKQHDNPEAIEKLD 66
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPADV---YQLLGDIELEFDDSHLKYAEETYRRELDL 130
D+RSL+ G+ EA ++ F + YQ LGD+ L+F + YRREL+L
Sbjct: 67 DIRSLLRCGELREAEQLALTHFTNAPPYFGPYQPLGDLLLQFKSGTSEVNH--YRRELNL 124
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL-LDNHSYVNG 189
T A V + + + RE F+S QV+V +IS SE ++ + L D +
Sbjct: 125 RTGVASVSWEENGILYEREVFASAVHQVLVIRISSSEPAAIHLSARLSRRPFDGNIKREN 184
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
+ MEG C P G+ ++ +L+ + G L ++ +D
Sbjct: 185 ERTLAMEGIC-------------GPDGVTYATVLQ---AHTIGGKCHTVGNYLDIQSADA 228
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
LLL A +SF DP E++ +S L Y+ L H+ D+ L RVS
Sbjct: 229 VTLLLAAQTSF---------RCDDPYREALRQAESAVLLPYASLLEEHITDHCALLERVS 279
Query: 310 IQL-----SRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLIS 363
+++ S +P + + +E P++ER++ + Q DP L L +Q+GRYL+++
Sbjct: 280 LEIEAADTSIAPVSEESASEAEAVAVDRPTSERLQLYRQGGNDPGLEALFYQYGRYLMMA 339
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSRPG+ ANLQGIWNE +P W+S H+NINL+MNYW + NL EC EPLFDF+ L
Sbjct: 340 SSRPGSLPANLQGIWNESFTPPWESDYHLNINLQMNYWIAETGNLPECHEPLFDFIDRLV 399
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
ING KTA Y A G+ H +++WA+S WPMGGAWL HLWEHY Y +
Sbjct: 400 INGRKTAASLYGARGFTAHASSNLWAESGLFGAWTPAIFWPMGGAWLALHLWEHYRYNLS 459
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
FL +RAYP+L+ + F LD+L+ +G L T+PS SPE+ +I G++ +S +MD
Sbjct: 460 ESFLSERAYPVLKEASLFFLDFLVFDENGSLVTSPSLSPENSYINEKGQIGSLSSGPSMD 519
Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVH 603
+I + +A I AAE+L +++ + + + +L +I G +MEWA D+++ E
Sbjct: 520 SQMIYALLTACIEAAEILGLDKE-WSRQWMDTRAKLPQPQIGRYGQVMEWAVDYEEFEPG 578
Query: 604 HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQ 660
HRH+SHLF L PG I + P+L KA+ TL++R + G GWS W W RL +
Sbjct: 579 HRHISHLFALHPGEQIIPHRMPELGKASRVTLERRLKYGGGHTGWSQAWIANFWTRLGEG 638
Query: 661 EHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
E A+ ++ L ++ NLF HPPFQIDANFG AA+ EML+QS +
Sbjct: 639 EKAHDSLREL-----------LAKAVHPNLFGDHPPFQIDANFGGAAAIQEMLLQSHGGE 687
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNY 766
+ LLPALP W+SG VKGL+ARGG TV+I WK+G L IYS +
Sbjct: 688 IRLLPALP-SSWASGSVKGLRARGGYTVNIWWKEGKLEAAEIYSGH 732
>gi|262408009|ref|ZP_06084557.1| glycoside hydrolase family 95 [Bacteroides sp. 2_1_22]
gi|345511517|ref|ZP_08791057.1| glycoside hydrolase family 95 [Bacteroides sp. D1]
gi|229444055|gb|EEO49846.1| glycoside hydrolase family 95 [Bacteroides sp. D1]
gi|262354817|gb|EEZ03909.1| glycoside hydrolase family 95 [Bacteroides sp. 2_1_22]
Length = 822
Score = 509 bits (1312), Expect = e-141, Method: Compositional matrix adjust.
Identities = 302/778 (38%), Positives = 448/778 (57%), Gaps = 59/778 (7%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E S+ K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+++W G P +
Sbjct: 23 EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NPDA + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 83 NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y+ Y REL L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196
Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
S +Q +M EG C + ++ ++ KG ++F L K ++G
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
A D L VE +D AV+ + +++F+ N D + T + + L + +
Sbjct: 242 KIACADGILSVEKADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+D Y++ RVS+ L R + V + +RV++F+ D LV
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLF + +S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSDTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDG 531
LWE Y YT D +FL + YP+L+ F + +++ H+ +L PS SPE+ +G
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKDPVHN-WLVVCPSNSPENVHSGSNG 522
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
K A + TMD +I ++++AIISA+++L+ +++ + + L + P ++ G +
Sbjct: 523 K-ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQRLKEMAPMQVGHWGQLQ 580
Query: 592 EWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKT 651
EW D+ DP+ HRH+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK
Sbjct: 581 EWMFDWDDPKDVHRHISHLYGLFPSNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKV 640
Query: 652 ALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAE 711
LWARL D +HAY+++ LV E +K GG Y NLF AHPPFQID NFG A +AE
Sbjct: 641 CLWARLLDGDHAYKLITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCAAGIAE 697
Query: 712 MLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
ML+QS +YLLPALP W +G +KG+ ARGG + + WK+G + + + S+ N
Sbjct: 698 MLMQSYDGFIYLLPALP-TVWKAGSIKGIIARGGFELDLSWKNGKVSRLVVKSHKGGN 754
>gi|320107748|ref|YP_004183338.1| alpha-L-fucosidase [Terriglobus saanensis SP1PR4]
gi|319926269|gb|ADV83344.1| Alpha-L-fucosidase [Terriglobus saanensis SP1PR4]
Length = 814
Score = 509 bits (1310), Expect = e-141, Method: Compositional matrix adjust.
Identities = 298/762 (39%), Positives = 423/762 (55%), Gaps = 53/762 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + PA + DA+P+GNGRLGAMV+G E + LNEDTLW G P D TNPDA L
Sbjct: 35 LTLWMETPAAQWADALPLGNGRLGAMVFGEPLKERIALNEDTLWAGQPRDTTNPDAKNHL 94
Query: 73 SDVRSLV-DSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETY-RRELDL 130
VR LV + Y A K+ G ++ LGD+ +E HL E T+ +R LDL
Sbjct: 95 PIVRKLVLEDKNYVAADKECQKMQGPENFAFEPLGDLHIE----HLGLTEATHLKRSLDL 150
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
+TA A+ + V F+RE F S PDQV+ +I+ S+ SL+ +SL + + + +
Sbjct: 151 DTAVAKTSFQSSGVTFSREVFVSFPDQVVALRITASKPSSLNLRLSLTCEMPAKTSAHAD 210
Query: 191 NQIIMEGRCPGKRIPPKANA----NDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+++ G+ P + P +++ D +G++F+A+L K + GT+ E L +
Sbjct: 211 GTLLLAGKVPTENNPQISDSIRYSEVDGEGMRFAAVLSAKA--EGGTVQP-EGDTLAISK 267
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ LLL A++ F G F P D+ E + ++ +Y+ L T+H+ D++ LF
Sbjct: 268 ATSVTLLLTAATGFRG-FAFPPDTPAAALEEKCRKGLAGKS-AYAVLKTKHVADHRALFR 325
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV L+ + D +P+ R+K+F T +DP+L+ L FQ+GRYLLI+SSR
Sbjct: 326 RVGANLNSTVPDGAN----------LPTDARLKNFPTTQDPALLALYFQYGRYLLIASSR 375
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PGTQ ANLQGIWN+ + P W S NIN++MNYW NL+E PL D +++ G
Sbjct: 376 PGTQPANLQGIWNDLVRPPWSSNWTANINIQMNYWPVFTANLAELNGPLVDLTQDMTVTG 435
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+KTA VNY A GW HH D+W ++S G WA + M G WLC HL+EH+ +T D
Sbjct: 436 AKTASVNYGARGWCSHHNIDLWRQASPVGMGSGDPTWANFAMSGPWLCQHLYEHFQFTGD 495
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
D+L KR YP+L A F LDWL+ DG L T PS S E+ F P + A VS T+D
Sbjct: 496 VDYLRKRVYPILRSSALFCLDWLVPAGDGTLTTCPSFSTENNFFTPQHQKAVVSAGCTLD 555
Query: 544 MAIIREVFSAIISAAEVLEKNED-ALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
+A+I E+F ISA++VL NED A +K+ +L +L P K+ G + EW+++F++
Sbjct: 556 LALIHELFGNCISASQVL--NEDQAFADKLKAALAKLPPYKVGSAGELQEWSENFEEATP 613
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHD 659
RH+SHL+ L+PG T P A+ ++L++R E G GWS W LWARL D
Sbjct: 614 GQRHMSHLYPLYPGAQFT-RDTPKWMAASRRSLERRLENGGAYTGWSRAWAIGLWARLGD 672
Query: 660 QEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP------FQIDANFGFTAAVAEML 713
+ A+ + L +H G +NLF +HP FQID NFG TAA+ EML
Sbjct: 673 GDKAWESLGMLM--------QHSTG---NNLFDSHPAGPNRSIFQIDGNFGATAAMIEML 721
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
+QS + L PALP W SG GL+ARGG + W G
Sbjct: 722 LQSHAGKIILFPALP-KAWPSGNFTGLRARGGLQCDLIWTGG 762
>gi|330996330|ref|ZP_08320214.1| hypothetical protein HMPREF9442_01297 [Paraprevotella xylaniphila
YIT 11841]
gi|329573380|gb|EGG54991.1| hypothetical protein HMPREF9442_01297 [Paraprevotella xylaniphila
YIT 11841]
Length = 809
Score = 508 bits (1309), Expect = e-141, Method: Compositional matrix adjust.
Identities = 298/762 (39%), Positives = 423/762 (55%), Gaps = 58/762 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PAK +T+A+P+GN +LGAMV+GG E L+LNE+T W G P D NP+A L
Sbjct: 22 LKLWYGKPAKDWTEALPVGNSKLGAMVYGGTGREELQLNEETFWAGGPYDNNNPNALYVL 81
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR+L+ G+ EA F D Y +G + L+F H K + + R+LD+
Sbjct: 82 PVVRNLIFQGKTREAQRLVDANFFTRKDGMSYLTMGSLFLDFP-GHDKATD--FYRDLDI 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
ATA +Y V V + R F+S D VIV ++ ++G+L+F V D+ L + +G+
Sbjct: 139 GNATATTRYKVDGVAYARTVFASFTDSVIVVRLQADKAGALAFTVGYDAPLKHEVSADGD 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
++ C GK D +G++ + E ++ + + KKL+V G+ A
Sbjct: 199 ---MLSIACEGK----------DQEGVKAALCAECRVKVVSDGKTTADGKKLEVVGATKA 245
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
L L A++++ ++ D D + + LQ + Y +H+ Y+ LF RV +
Sbjct: 246 TLYLSAATNY----VDYHDVSGDAAARADRCLQRAVQIPYKKALEKHVAYYRNLFGRVEL 301
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
L T+ + E + R++ F DPSL LLFQ+GRYLLISSS+PG Q
Sbjct: 302 DLGE------TEAAARE------TPLRIRDFSQGGDPSLAALLFQYGRYLLISSSQPGGQ 349
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWN + WDS +NIN EMNYW + NLSE +PLF L LS+ G+KTA
Sbjct: 350 PANLQGIWNRSTNAPWDSKYTININTEMNYWLAEVANLSEMHQPLFSMLEDLSVTGAKTA 409
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDFL 487
+ Y GWV HH TD+W S G V +A +WP GGAWL HLW+HY +T D+ FL
Sbjct: 410 RDMYNCGGWVAHHNTDLWRIS----GVVDFAAAGMWPSGGAWLAQHLWQHYLFTADKKFL 465
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
K YP+L+G A F LD+L E H Y PS SPEH V+ TMD
Sbjct: 466 -KAYYPVLKGTARFFLDFLTE-HPSYKWWVVAPSVSPEH---------GPVTAGCTMDNQ 514
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
I+ + + A+E++ ++ A + + + L RL P ++ G + EW QD DP+ HR
Sbjct: 515 IVFDALYNTLQASEIV-GDDAAFRDSLAQMLDRLPPMQVGRHGQLQEWLQDVDDPKDEHR 573
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H+SHL+GL+P + ++ +P L +AA TL++RG++ GWSI WK WAR+ D HAYR
Sbjct: 574 HISHLYGLYPSNQVSPFSHPGLFRAARTTLEQRGDKATGWSIGWKINFWARMLDGNHAYR 633
Query: 666 MVKRLFNLV--DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYL 723
++ + L+ D ++ EG Y N+F AHPPFQID NFG A +AEML+QS ++L
Sbjct: 634 LISNMLQLLPSDAVAGEYPEGRTYPNMFDAHPPFQIDGNFGAAAGIAEMLLQSHDGAVHL 693
Query: 724 LPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
LPALP D W G VKGL+ARGG V + W DG L + S
Sbjct: 694 LPALP-DVWREGRVKGLRARGGYEVDMEWADGRLSSATVRST 734
>gi|383114822|ref|ZP_09935584.1| hypothetical protein BSGG_1004 [Bacteroides sp. D2]
gi|313693469|gb|EFS30304.1| hypothetical protein BSGG_1004 [Bacteroides sp. D2]
Length = 812
Score = 508 bits (1307), Expect = e-141, Method: Compositional matrix adjust.
Identities = 293/796 (36%), Positives = 438/796 (55%), Gaps = 55/796 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + + PAK + +A+P+GNGRLGAM++G E ++ NE+TL++G P + + L
Sbjct: 24 LTLWYKSPAKVWEEALPVGNGRLGAMIFGEPQKERIQFNENTLYSGEPETPKDINVASDL 83
Query: 73 SDVRSLVDSGQYAEA-TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+R L++ G+ EA K G + YQ GD+ +EF K A Y LD+N
Sbjct: 84 GHIRQLLNEGKNTEAGNIIQQKWIGRLNEAYQPFGDLYIEFAS---KGAITDYIHSLDMN 140
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
+ Y + RE F+S P Q I+ +S S+ L+F L+S H ++
Sbjct: 141 NSIVTTSYKQNGIAIRREVFASYPAQAIIIHLSASKP-VLNFTAHLES---PHPVTQDSD 196
Query: 192 Q--IIMEGRCPG---------------KRIPPKANANDDPKGIQFSAILEIKISDDRGT- 233
I ++G+ P +R+ P+ + IQ ++ +GT
Sbjct: 197 SQAIYLKGQAPAHAQRRDIEHMKRFNTQRLHPEY-FDQTGHVIQKKQVIYGNELGGKGTF 255
Query: 234 -----ISALEDKKLKVEGSDW-------AVLLLVASSSFDGPFINPSDSKKDPTSESMSA 281
+S+ +D KL +E + + L+L A++S++G +PS K+P E +
Sbjct: 256 FEACLLSSHKDGKLVIENNQFIAQDCSEVTLVLYAATSYNGLHKSPSKEGKNPHQEINNY 315
Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF 341
+ SY L H+ DYQ LF RVS L + + + P+ +R+K F
Sbjct: 316 RKISEKHSYKKLKEEHITDYQSLFKRVSFNLH-----------TNKQLKKTPTDQRLKLF 364
Query: 342 QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
+ ED +++ LFQFGRYL+I+ SR Q NLQG+WN ++ P W+S +NINLEMNYW
Sbjct: 365 KKKEDQTIITQLFQFGRYLMIAGSRGEGQPLNLQGLWNNEVLPPWNSGYTLNINLEMNYW 424
Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+ NLSEC +PLF + ++ G A+ Y +GW IHH IW ++ G V W
Sbjct: 425 PAEVTNLSECHQPLFKLIEEIADKGKNLARDMYGLNGWAIHHNISIWREAYPSDGFVYWF 484
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTS 521
W M G WLC H+WEHY YT D DFL K+ YP+L+G A+F +WL+E +G L T STS
Sbjct: 485 FWNMSGPWLCNHIWEHYLYTKDIDFL-KKYYPILKGSATFCSEWLVENSEGELVTPVSTS 543
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
PE+ ++ PDG A V STMD+AIIR +FS I+A++VL+ + ++ + + +L+
Sbjct: 544 PENAYLMPDGISASVCEGSTMDIAIIRSLFSNTINASKVLQ-TDSLFCAELTQKVNKLKK 602
Query: 582 TKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEE 641
+I G ++EW +++ + E HRH+SHLFGL+PG IT + P+L AA K+L RG +
Sbjct: 603 YQIGSKGQLLEWDKEYMENEPQHRHVSHLFGLYPGCDIT-DYTPELFDAARKSLNARGNK 661
Query: 642 GPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDA 701
GWS+ WK +LW+RL++ AY + L N VD + + +GGLY NL A PFQID
Sbjct: 662 TTGWSMAWKISLWSRLYNSLKAYEALSNLINYVDSDTKAENQGGLYRNLLNA-LPFQIDG 720
Query: 702 NFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVG 761
NFG TA +AEML+QS +++LLPALP W G +KGLKARGG TV + W+ G +
Sbjct: 721 NFGATAGIAEMLLQSHKGNIHLLPALP-PTWEKGNIKGLKARGGFTVDMEWEKGKITVAY 779
Query: 762 IYSNYSNNDHDSFKTL 777
+ S Y + ++K +
Sbjct: 780 VTSPYEQTTNITYKDM 795
>gi|402306106|ref|ZP_10825157.1| hypothetical protein HMPREF1146_1457 [Prevotella sp. MSX73]
gi|400379873|gb|EJP32702.1| hypothetical protein HMPREF1146_1457 [Prevotella sp. MSX73]
Length = 785
Score = 507 bits (1306), Expect = e-141, Method: Compositional matrix adjust.
Identities = 297/780 (38%), Positives = 434/780 (55%), Gaps = 53/780 (6%)
Query: 2 MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
M A S +P K+ + PA+ +TDA+P+GNGRLGAMV+G +E ++LNE+T+W G P
Sbjct: 14 MLASLFSQAHPYKLWYREPAQVWTDALPLGNGRLGAMVYGIPATERIQLNEETIWAGQPN 73
Query: 62 DYTNPDAPKALSDVRSLVDSGQYAEAT------AASVKLFGHPADVYQLLGDIELEFDDS 115
N A KA+ ++ L+ G+Y +A S +G P YQ G++ +
Sbjct: 74 GNANAKALKAIPVIQDLIWKGEYKKAQDLATSDVMSATNWGMP---YQAFGNVYISMPGM 130
Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
Y REL L++A A +++ V + RE +S D V+ + + + G ++FN
Sbjct: 131 G---NYTNYYRELSLDSARAITRWTADGVTYRREVITSLADNVVTVRFTADQRGCITFNA 187
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSA-ILEIKISDDRGT 233
+ D+ II++ + + ++ KG ++F + + G
Sbjct: 188 YFTTPHDD---------IIIKSEGDEATLFGVTSKHEGLKGKVRFMGRMAAVAKGKGEGA 238
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
++ +D + V+G+D AVL + +++F+ N D D S L++ Y+
Sbjct: 239 VTHSKDGIVSVKGADEAVLYISIATNFN----NYKDISGDEAVRSEQILRAAMARDYAQS 294
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+ +++L HRV++ L E+ +P+ ER+ F +D LV
Sbjct: 295 KAEHISRFRQLMHRVTLNLG------------EDQYKDLPTDERIIRFAAHDDNYLVATY 342
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P WDS NINLEMNYW + L+E E
Sbjct: 343 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPAWDSKYTTNINLEMNYWPAELTQLTELNE 402
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCT 472
PLF + +S G++TA+ Y SGWV+HH TDIW + D + +W GGAWLC
Sbjct: 403 PLFRLIREVSETGAETARTMYGKSGWVLHHNTDIWRVTGGIDHAQS--GMWMTGGAWLCR 460
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDG 531
HLWEHY YTMD+DFL +R YP+++G A FL LI E G+L +PS SPE+ + DG
Sbjct: 461 HLWEHYLYTMDKDFL-RRYYPVMKGAAEFLDQMLIPEPQHGWLVISPSVSPENSHPSKDG 519
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
K+A +S +TMD+ ++ E+F +++A++VL ++ AL + L + P ++ + G +
Sbjct: 520 KVA-ISAGTTMDVQLVNELFREVMAASKVLGEDA-ALAAHYAERLKLMPPMQVGKWGQLQ 577
Query: 592 EWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKT 651
EW +D+ DP HRH+SHL+GL+PG IT+ P L AA +L RG+ GWS+ WK
Sbjct: 578 EWMEDWDDPNDTHRHVSHLYGLYPGCQITLSGTPRLFDAARISLIHRGDPSTGWSMGWKV 637
Query: 652 ALWARLHDQEHAYRMVKRLFNLVDPEHEKHF----EGGLYSNLFAAHPPFQIDANFGFTA 707
LWARL D HAY++++ +L D + +GG Y NLF AHPPFQID NFG TA
Sbjct: 638 CLWARLFDGNHAYKLIRNQLSLTDDRFVAYGTDKKKGGTYRNLFDAHPPFQIDGNFGCTA 697
Query: 708 AVAEMLVQSTLNDLYLLPALPWDKWSSGC-VKGLKARGG-ETVSICWKDGDLHEVGIYSN 765
+AEMLVQS + LLPALP D W +G VKGL ARG E + WKDG + + I SN
Sbjct: 698 GIAEMLVQSHEGYINLLPALP-DAWKAGGEVKGLMARGAFEIEHLAWKDGRVVRLAIRSN 756
>gi|329930748|ref|ZP_08284172.1| Alpha-L-fucosidase 2 family protein [Paenibacillus sp. HGF5]
gi|328934680|gb|EGG31180.1| Alpha-L-fucosidase 2 family protein [Paenibacillus sp. HGF5]
Length = 673
Score = 507 bits (1306), Expect = e-141, Method: Compositional matrix adjust.
Identities = 285/706 (40%), Positives = 398/706 (56%), Gaps = 65/706 (9%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLV 79
PA + +A+PIGNGRLGAM++GG+ E L+LNED++W G P D N DA L +R LV
Sbjct: 21 PATDWNEALPIGNGRLGAMIFGGIAEEKLQLNEDSVWYGGPRDRNNEDALPHLPVIRELV 80
Query: 80 DSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATAR 136
+G+ EA A A + + G P Y LGD+ + FD + + Y RELDL +R
Sbjct: 81 MNGRLHEAEALAGMAMAGLPESQRHYLPLGDLLISFDRHEMA---KDYERELDLEHGVSR 137
Query: 137 VKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ---- 192
Y +G + +TRE F+S PDQ I+ +IS + G++S + N Y+ ++
Sbjct: 138 SSYRIGEIRYTRELFASYPDQAIIMRISADKPGAVSLKARFNR--RNWRYMEKTDKWDQQ 195
Query: 193 -IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
++M+G C GK G F AI++ + G + + L VE +D
Sbjct: 196 GLVMQGECGGK------------GGSSFCAIVK---ALSEGGVCKTIGEYLLVENADAVT 240
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
LLL A ++F P DP L+ + +SY++L RH+ DY +LF RV++
Sbjct: 241 LLLTAGTTFRHP---------DPELYGKRRLEELSQVSYTELLVRHIKDYTELFGRVTLS 291
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
LS SP +T+P+ +R+K + + +ED L+E FQFGRYLLISSSRPG+
Sbjct: 292 LSESPGK-----------NTLPTDDRLKRYREGEEDNGLIETYFQFGRYLLISSSRPGSL 340
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWN+ +P WDS +NIN +MNYW + CNL+EC EPLF+ + + G TA
Sbjct: 341 PANLQGIWNDSYTPPWDSKFTININTQMNYWPAENCNLAECHEPLFELIERMREPGRVTA 400
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
V Y G+ HH TDIWA ++ + + WPMG AWLC HLWEHY + DR FL R
Sbjct: 401 GVMYGCRGFTAHHNTDIWADTAPQDTYLPASFWPMGAAWLCLHLWEHYRFGQDRYFL-AR 459
Query: 491 AYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREV 550
AY ++ A FLLD+LIE +G L T PS SPE+ + P+G+ + +TMD II +
Sbjct: 460 AYETMKEAALFLLDYLIEDGEGRLVTCPSVSPENRYKLPNGETGVLCAGATMDFQIIEAL 519
Query: 551 FSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHL 610
F A I + E++EK+E A E++ +L RL +I + G I EW +D+++ E HRH+SHL
Sbjct: 520 FEACIRSGEIIEKDE-AFREELAAALKRLPKPQIGKYGQIQEWMEDYEEVEPGHRHISHL 578
Query: 611 FGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHAYRMV 667
F L+PG I ++ P+L AA TL++R G GWS W WARL D + AY V
Sbjct: 579 FALYPGEGINVDSTPELAAAARTTLERRLANGGGHTGWSRAWIINFWARLLDADKAYENV 638
Query: 668 KRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
+ + H+ NLF HPPFQID NFG TA +AEML
Sbjct: 639 RAML---------HYS--TLPNLFDNHPPFQIDGNFGGTAGIAEML 673
>gi|293369104|ref|ZP_06615699.1| putative lipoprotein [Bacteroides ovatus SD CMC 3f]
gi|292635816|gb|EFF54313.1| putative lipoprotein [Bacteroides ovatus SD CMC 3f]
Length = 822
Score = 507 bits (1306), Expect = e-141, Method: Compositional matrix adjust.
Identities = 302/777 (38%), Positives = 443/777 (57%), Gaps = 57/777 (7%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E S+ K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+++W G P +
Sbjct: 23 EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NPDA + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 83 NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y+ Y REL L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196
Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
S +Q +M EG C + ++ ++ KG ++F L K ++G
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
A D L VE +D AV+ + +++F+ N D + T + + L + +
Sbjct: 242 KIACADGILSVEKADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+D Y++ RVS+ L R + V + +RV++F+ D LV
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLF + +S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSNTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
LWE Y YT D +FL + YP+L+ F + ++ E +L PS SPE+ +GK
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK 523
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
A + TMD +I ++++AIISA+++L+ + + + + L + P ++ G + E
Sbjct: 524 -ATTAAGCTMDNQLIFDLWTAIISASQILDTDLE-FASHLTQRLKEMAPMQVGHWGQLQE 581
Query: 593 WAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTA 652
W D+ DP+ HRH+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK
Sbjct: 582 WMFDWDDPKDVHRHISHLYGLFPSNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKVC 641
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
LWARL D +HAY+++ LV E +K GG Y NLF AHPPFQID NFG A +AEM
Sbjct: 642 LWARLLDGDHAYKLITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCAAGIAEM 698
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
L+QS +YLLPALP W G +KG+ ARGG + + WK+G + + + S N
Sbjct: 699 LMQSYDGFIYLLPALP-TVWKEGSIKGIIARGGFELDLSWKNGKVSRLVVKSYKGGN 754
>gi|288925248|ref|ZP_06419183.1| alpha-L-fucosidase 2 (Alpha-L-fucosidefucohydrolase 2)
(Alpha-1,2-fucosidase 2) [Prevotella buccae D17]
gi|288338013|gb|EFC76364.1| alpha-L-fucosidase 2 (Alpha-L-fucosidefucohydrolase 2)
(Alpha-1,2-fucosidase 2) [Prevotella buccae D17]
Length = 787
Score = 507 bits (1305), Expect = e-140, Method: Compositional matrix adjust.
Identities = 296/780 (37%), Positives = 434/780 (55%), Gaps = 53/780 (6%)
Query: 2 MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
M A S +P K+ + PA+ +TDA+P+GNGRLGAMV+G +E ++LNE+T+W G P
Sbjct: 16 MLASLFSQAHPYKLWYREPAQVWTDALPLGNGRLGAMVYGIPATERIQLNEETIWAGQPN 75
Query: 62 DYTNPDAPKALSDVRSLVDSGQYAEAT------AASVKLFGHPADVYQLLGDIELEFDDS 115
N A KA+ ++ L+ G+Y +A S +G P YQ G++ +
Sbjct: 76 GNANAKALKAIPVIQDLIWKGEYKKAQDLATSDVMSATNWGMP---YQAFGNVYISMPGM 132
Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
Y REL L++A A +++ V + RE +S D V+ + + + G ++FN
Sbjct: 133 G---NYTNYYRELSLDSARAITRWTADGVTYRREVITSLADNVVTVRFTADQRGCITFNA 189
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSA-ILEIKISDDRGT 233
+ D+ II++ + + ++ KG ++F + + G
Sbjct: 190 YFTTPHDD---------IIIKSEGDEATLFGVTSKHEGLKGKVRFMGRMAAVAKGKGEGA 240
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
++ +D + V+G+D AVL + +++F+ N D D S L++ Y+
Sbjct: 241 VTHSKDGIVSVKGADEAVLYISIATNFN----NYKDISGDEAVRSEQILRAAMARDYAQS 296
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+ +++L HRV++ L E+ +P+ ER+ F +D LV
Sbjct: 297 KAEHISRFRQLMHRVTLNLG------------EDQYKDLPTDERIIRFADHDDNYLVATY 344
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P WDS NINLEMNYW + P L+E E
Sbjct: 345 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPAWDSKYTTNINLEMNYWPAEPTQLTELNE 404
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCT 472
PLF + +S G++TA+ Y SGWV+HH TDIW + D + +W GGAWLC
Sbjct: 405 PLFRLIREVSETGAETARTMYGKSGWVLHHNTDIWRVTGGIDHAQS--GMWMTGGAWLCR 462
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDG 531
HLWEHY YTMD+DFL +R YP+++G A FL LI E G+L +PS SPE+ + DG
Sbjct: 463 HLWEHYLYTMDKDFL-RRYYPVMKGAAEFLDQMLIPEPQHGWLVISPSVSPENSHPSKDG 521
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
K+A ++ +TMD+ ++ E+F +++A++VL ++ AL + L + P ++ + G +
Sbjct: 522 KMA-IAAGTTMDVQLVNELFREVMAASKVLGEDA-ALAAHYAERLKLMPPMQVGKWGQLQ 579
Query: 592 EWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKT 651
EW +D+ DP HRH+SHL+GL+PG IT+ L AA +L RG+ GWS+ WK
Sbjct: 580 EWMEDWDDPNDTHRHVSHLYGLYPGCQITLSGTHRLFDAARTSLIHRGDPSTGWSMGWKV 639
Query: 652 ALWARLHDQEHAYRMVKRLFNLVDPEHEKHF----EGGLYSNLFAAHPPFQIDANFGFTA 707
LWARL D HAY++++ +L D + +GG Y NLF AHPPFQID NFG TA
Sbjct: 640 CLWARLFDGNHAYKLIRNQLSLTDDRFVAYGTDKKKGGTYRNLFDAHPPFQIDGNFGCTA 699
Query: 708 AVAEMLVQSTLNDLYLLPALPWDKWSSGC-VKGLKARGG-ETVSICWKDGDLHEVGIYSN 765
+AEMLVQS + LLPALP D W +G VKGL ARG E + WKDG + + I SN
Sbjct: 700 GIAEMLVQSHEGYINLLPALP-DAWKTGGEVKGLMARGAFEIEHLAWKDGRVVRLAIRSN 758
>gi|340347371|ref|ZP_08670480.1| alpha-L-fucosidase [Prevotella dentalis DSM 3688]
gi|433651138|ref|YP_007277517.1| hypothetical protein Prede_0088 [Prevotella dentalis DSM 3688]
gi|339609463|gb|EGQ14335.1| alpha-L-fucosidase [Prevotella dentalis DSM 3688]
gi|433301671|gb|AGB27487.1| hypothetical protein Prede_0088 [Prevotella dentalis DSM 3688]
Length = 784
Score = 506 bits (1304), Expect = e-140, Method: Compositional matrix adjust.
Identities = 303/794 (38%), Positives = 422/794 (53%), Gaps = 44/794 (5%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPK 70
PL++ ++ PA F +++PIGNG+LGA+++GG + LN+ T W+G P D T + DA
Sbjct: 26 PLRLWYDRPATCFEESLPIGNGKLGAIIYGGPDDNVIHLNDITFWSGKPVDLTIDSDAHV 85
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET-YRRELD 129
+ +R + Y A + + G + YQ LG + + L+ E + Y R+L
Sbjct: 86 WIPKIREALFREDYRLADSLQHHVQGANSQYYQPLGTLRIR----DLQPGEASGYHRQLS 141
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L++A +Y G V +TRE+F+S PD+VI ++ S G LS ++ L S +D H
Sbjct: 142 LDSAVCHDRYVRGGVTYTREYFASAPDKVIAVRLRASRPGMLSCSIGLGSQVD-HGTKTS 200
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPK-GIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
+ QIIM G NA DP+ I F +L ++S+D G++ D L V G++
Sbjct: 201 DRQIIMTG-----------NAAGDPQETIHFCTVL--RVSNDGGSVER-TDSSLVVTGAN 246
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
A + LV +SF+G +P +M + N S L RHLDDYQ +FHRV
Sbjct: 247 GATIYLVNETSFNGYDKHPVTQGTPYIENAMDDAWHLANYSCDSLLRRHLDDYQPIFHRV 306
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
S L S + T S R Q D L L FQFGRYLLISSSR
Sbjct: 307 SFTLDGSRYNATQPT---------DSMLRAYGSQPAYDRYLEALYFQFGRYLLISSSRTP 357
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
ANLQG+WNE W +NINLE NYW N+ E PL F L+ G++
Sbjct: 358 GVPANLQGLWNEKKKAPWRGNYTININLEENYWPCDVANMPEMFAPLATFCQNLAQTGAQ 417
Query: 429 TAQVNY-LASGWVIHHKTDIWAKSSA---DRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
A+ Y + GW H +DIWA ++ R W+ W MGGAWL ++++HY YT DR
Sbjct: 418 NARNYYGIGRGWSCGHNSDIWAMTNPVGEKRESPTWSNWNMGGAWLMQNVYDHYLYTQDR 477
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
D+L AYPL+ G + F+LDWL+ + L T PSTSPE ++ G Y T
Sbjct: 478 DYLSGTAYPLMRGASDFILDWLVPNPRNPEELITAPSTSPEAYYVTDKGYKGATLYGGTA 537
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
D+AIIRE+ + + AA L ++ A + + +L RL P + G + EW D+ D +
Sbjct: 538 DLAIIRELLTNTLEAARTLNRDR-AYQDTLRHTLARLHPYTVGRQGDLNEWYYDWADEDT 596
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEH 662
HRH SHL GL+PGH IT+ P L +AA ++L+ +G GWS W+ LWARLH+
Sbjct: 597 CHRHQSHLIGLYPGHQITVGATPQLAQAAARSLEMKGGRTTGWSTGWRINLWARLHNASQ 656
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
AYR+ ++L VDP H + GG + NLF AHPPFQID NFG TA V EML+QS +
Sbjct: 657 AYRIYQKLLAYVDPAHTQKQHGGTFPNLFDAHPPFQIDGNFGGTAGVCEMLMQSDGKTIE 716
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGT 782
LLPALP + W +G + GL+ARGG VS+ WKDG + I S + S Y G
Sbjct: 717 LLPALP-EAWPAGEICGLRARGGFEVSMGWKDGRVTWAEISSGKGGKVNVS-----YNGR 770
Query: 783 SVKVNLSAGKIYTF 796
+++ GK T
Sbjct: 771 VKPISVGKGKTKTL 784
>gi|325105420|ref|YP_004275074.1| alpha-L-fucosidase [Pedobacter saltans DSM 12145]
gi|324974268|gb|ADY53252.1| Alpha-L-fucosidase [Pedobacter saltans DSM 12145]
Length = 768
Score = 506 bits (1303), Expect = e-140, Method: Compositional matrix adjust.
Identities = 306/798 (38%), Positives = 433/798 (54%), Gaps = 66/798 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ +N PA + +A+P+GNG LGAM++G +E L+LNE ++W G D+ NP A +L
Sbjct: 28 LKLWYNKPALDWNEALPVGNGSLGAMIFGNTFNEVLQLNESSVWAGKDEDFVNPRAKASL 87
Query: 73 SDVRSLVDSGQYAEAT-AASVKLFGHPA--DVYQLLGDIELEFDDSHLKYAEETYRRELD 129
VR+L+ +Y EA A L G YQ LG++ L+F S+ + Y REL+
Sbjct: 88 KKVRNLLFQEKYTEAQDLADSSLMGDKKIWSSYQELGNLRLDFKKSNRSVS--NYNRELN 145
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
+ A A ++V F RE FSS + K+S +++ +S + +D +
Sbjct: 146 IENAIATTTFNVDGTLFEREVFSSAVANTVFIKLSSNKTKQISLTIGMDRAGNLAKISAS 205
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
++QI + ++ G+ +I I R ++S + K+ VE +D
Sbjct: 206 DHQIYLTEHV------------NNGVGVILHSIANIANKGGRLSVS---NNKIIVENADE 250
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
V+ L A+++F+ NP ++ K SES++ +Y H+ DYQ+ F+RV
Sbjct: 251 VVITLAAATNFN--HTNPLETVKSRISESLAK-------AYQQHKEEHIKDYQQYFNRVK 301
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPG 368
+ L + N P+ R+ + + DPSL+ L +Q+GRYLLISSSRPG
Sbjct: 302 LNLGNN------------NSSLFPTDARLSALKNGNFDPSLITLFYQYGRYLLISSSRPG 349
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
ANLQGIW E L W+ H+NIN +MNYW + NLSE P D+LT L +G K
Sbjct: 350 GLPANLQGIWAEGLQVPWNGDYHININAQMNYWLAENTNLSEMHMPFLDYLTNLGKDGKK 409
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
TA+ Y SG V H +DI+ + GK WA+WP G AW H WEHY YT D+ FLE
Sbjct: 410 TAKDMYGLSGEVAHFASDIFYYTEP-WGKPKWAMWPTGLAWCSQHAWEHYLYTQDKAFLE 468
Query: 489 KRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
K+ Y +L+ + F LDWL++ G L + PS SPE+ F PDGK+A V MD II
Sbjct: 469 KQGYEILKQSSIFFLDWLVKNPKTGLLVSGPSISPENTFKTPDGKIATVIMGPAMDHMII 528
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHL 607
RE+F ISAA++L K++ LV K+ K+L +L PT+I DG I+EW+++ + E HRH+
Sbjct: 529 RELFGNTISAAQILGKDKK-LVTKLQKALKQLTPTQIGSDGRILEWSEELPEAEPGHRHI 587
Query: 608 SHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHAY 664
SHLFGL+PG IT +KNP+ AA+KT+ R G GWS W +ARLHD E AY
Sbjct: 588 SHLFGLYPGREIT-DKNPETFNAAKKTIDYRLSHGGGHTGWSRAWIINFFARLHDGEKAY 646
Query: 665 RMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLL 724
++ L + LY NLF HPPFQID NFG TA + EML+QS N + LL
Sbjct: 647 ENLELLLK----------KSTLY-NLFDNHPPFQIDGNFGATAGITEMLMQSHTNQINLL 695
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSV 784
PALP W G + G+ ARGG + I W + +L EV + S N L Y+G
Sbjct: 696 PALP-SVWKDGEICGIVARGGFELDIVWGNNELKEVVVTSKTGNT-----LNLEYKGKVH 749
Query: 785 KVNLSAGKIYTFNRQLKC 802
+ S G Y FN+ L+
Sbjct: 750 QTATSKGNTYRFNKNLEL 767
>gi|56962910|ref|YP_174637.1| hypothetical protein ABC1138 [Bacillus clausii KSM-K16]
gi|56909149|dbj|BAD63676.1| conserved hypothetical protein [Bacillus clausii KSM-K16]
Length = 782
Score = 505 bits (1301), Expect = e-140, Method: Compositional matrix adjust.
Identities = 280/799 (35%), Positives = 437/799 (54%), Gaps = 44/799 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+++T PA+ +T+A PIGNGR+GAMV+GGV E + LN D+LW+G P +
Sbjct: 1 MQLTEQQPAQTWTEAYPIGNGRIGAMVYGGVEHEKIALNVDSLWSGPPAKRKQAPVKGTV 60
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
+D+R+ + + + A+ + + G Y LGD+ + F ++ Y R L L T
Sbjct: 61 ADMRAAIAARDFQAASRYAKDMQGPYTQSYLPLGDLHILF--PLCTHSSTRYERTLQLET 118
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
AT V+ + + R F+S PD+ I+ ++ LSF+ L S L + + +
Sbjct: 119 ATVTVEDGL----YKRSVFASKPDEAIILRLEAVAELPLSFSAWLTSPLRTIGWPD-QDH 173
Query: 193 IIMEGRCPGKRIPPKANANDDP---------KGIQFSAILEIKISDDRGTISALEDKKLK 243
+ + G CP + + P + +P I+F++ +++ +D +A+++ KL
Sbjct: 174 VGLAGWCP-EYVAPNYVPSSEPIRYTSYETSSAIRFASAVQLLETDGN---AAVKNNKLV 229
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
VE + +A +L+ +SF + K+P + L +Y L +RHL DYQ
Sbjct: 230 VEDARYATVLVHMETSFASA---QAPQGKEPITLIRKRLSETVTSTYETLQSRHLQDYQS 286
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
LF R++ L+ + ++ ++ ++ER+ + + D LVELLFQ GRYLLI+
Sbjct: 287 LFQRMTFTLNETEREKLS------------TSERLAKYGAN-DGKLVELLFQMGRYLLIA 333
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSR GT+ ANLQGIWNE + P W S +NIN +MNYW + L EC +P F+ LS
Sbjct: 334 SSREGTEAANLQGIWNEHIRPPWSSNYTLNINAQMNYWPAETAALPECHQPFLTFIEELS 393
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSA----DRGKVVWALWPMGGAWLCTHLWEHYN 479
G AQ Y GW HH +DIW ++ G VWA WPM WL HLWEHY
Sbjct: 394 EQGKAVAQNYYQCRGWTAHHNSDIWRQAEPVGGFGGGDPVWAFWPMAAPWLTRHLWEHYL 453
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
++ DR +L +RAYP+++G F LDWL++ G + T+PSTSPEH F+ G+ VS
Sbjct: 454 FSADRAYLTERAYPVMKGAILFCLDWLVQDESGAVYTSPSTSPEHRFLY-KGQPYPVSEG 512
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKD 599
+ MD+A++ +VF ++A E++ ++ L V +L +L+ ++ +G++ EW F
Sbjct: 513 AVMDLALLEDVFHLFLAANELVGGDQQ-LATDVKDALNQLKKPPLSAEGALQEWTHGFPG 571
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHD 659
++HHRHLSHL+G++PG + +AA+++L +RG+ G GWS+ WK LWAR D
Sbjct: 572 EDMHHRHLSHLYGVYPGSQWSSNHQQKRYQAAKQSLSERGDGGTGWSLAWKLCLWARFLD 631
Query: 660 QEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLN 719
+ ++ R LV E+H GG+Y NLF+AHPPFQID NFGF A V E LVQS
Sbjct: 632 GDRTDALISRSMQLVREGDEQHESGGVYPNLFSAHPPFQIDGNFGFVAGVIETLVQSHEG 691
Query: 720 DLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSF-KTLH 778
+ LLPALP +W G + G++ RGG T+ + W++ + +Y++ N F +
Sbjct: 692 FIRLLPALP-RRWKQGAITGVRCRGGFTIDLKWQNSSVLACTVYASCENACVVVFPNAMS 750
Query: 779 YRGTSVKVNLSAGKIYTFN 797
++ + AGK+Y F
Sbjct: 751 TTENGERMAIDAGKLYAFK 769
>gi|395776471|ref|ZP_10456986.1| large protein [Streptomyces acidiscabies 84-104]
Length = 802
Score = 505 bits (1300), Expect = e-140, Method: Compositional matrix adjust.
Identities = 300/744 (40%), Positives = 418/744 (56%), Gaps = 60/744 (8%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
A+P+GNGRLGAMV+G +E L+LNEDTLW G P +Y NP AL +R LV + Q+ +
Sbjct: 46 ALPVGNGRLGAMVFGNTDTERLQLNEDTLWAGGPHNYDNPRGAAALGRIRQLVFADQWGQ 105
Query: 87 AT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
A + + G PA YQ +GD+ L F A Y R LDL TAT V Y+ N
Sbjct: 106 AQDLINQTMLGDPAAQLAYQPVGDLRLTFPAGS---AVSAYERLLDLTTATTAVTYTANN 162
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKR 203
V + RE F+S PDQVIV +++ GS++F+ + S I ++G
Sbjct: 163 VSYRREVFASAPDQVIVMRLTARTPGSITFSATFASPQRTSLSSPDGTTIALDG------ 216
Query: 204 IPPKANANDDPKGI----QFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSS 259
+ D +GI +F A+ K + G++++ L+V G+D LL+ +S
Sbjct: 217 ------VSGDMRGIAGTVRFLAL--AKAVAEGGSVTS-SGGTLRVTGADSVTLLVSIGTS 267
Query: 260 FDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDI 319
+ ++ D + + L + + ++Y L RH+ DYQ LF RVS+ + R+P
Sbjct: 268 Y----VDYRTVDGDYQGIARTHLDAAQGVAYDTLRARHVADYQALFGRVSLDVGRTP--- 320
Query: 320 VTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN 379
+++ P+ R+ + +DP LLFQ+GRYLLISSSRPGTQ ANLQGIWN
Sbjct: 321 ----AADQ-----PTDVRIAQHGSADDPQFSALLFQYGRYLLISSSRPGTQPANLQGIWN 371
Query: 380 EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGW 439
+ L+P+WDS +N NL MNYW + NL+EC P+F + L+ G++TAQ Y A GW
Sbjct: 372 DQLTPSWDSKYTINANLPMNYWPADTTNLAECLAPVFAMIDDLTATGARTAQAQYGARGW 431
Query: 440 VIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCA 499
V HH TD W +S G VW +W GGAWL + +W+HY +T D +FL +R YP L+G A
Sbjct: 432 VTHHNTDAWRGTSVVDG-AVWGMWQTGGAWLASLIWDHYRFTGDVEFL-RRNYPALKGAA 489
Query: 500 SFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAA 558
F LD L+ G+L TNPS SPE PD V TMDM I+R +F SA+
Sbjct: 490 RFFLDTLVPHPGLGHLVTNPSNSPELTH-HPD---VSVCAGPTMDMQILRSLFDGCASAS 545
Query: 559 EVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHT 618
EVL + A +V + RL P KI G+I EW D+ + E HRH+SHL+GL PG+
Sbjct: 546 EVLGVDA-AFRAQVRSARRRLAPMKIGSRGNIQEWLHDWVETEPGHRHISHLYGLHPGNE 604
Query: 619 ITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEH 678
IT P L +AA +TL+ RG+ G GWS+ WK WAR+ + A+ +++ +LV +
Sbjct: 605 ITRRGTPQLFEAARRTLELRGDAGTGWSLAWKINYWARMEEGARAHELLR---DLVTTDR 661
Query: 679 EKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVK 738
L N+F HPPFQID NFG T+ +AEML+ S +L++LPALP W +G V
Sbjct: 662 -------LAPNMFDLHPPFQIDGNFGATSGIAEMLLHSHHGELHVLPALP-PAWPTGSVT 713
Query: 739 GLKARGGETVSICWKDGDLHEVGI 762
GL+ RGG TV W DG L E+ +
Sbjct: 714 GLRGRGGHTVGAVWHDGRLTELTV 737
>gi|284036403|ref|YP_003386333.1| alpha-L-fucosidase [Spirosoma linguale DSM 74]
gi|283815696|gb|ADB37534.1| Alpha-L-fucosidase [Spirosoma linguale DSM 74]
Length = 842
Score = 505 bits (1300), Expect = e-140, Method: Compositional matrix adjust.
Identities = 292/777 (37%), Positives = 433/777 (55%), Gaps = 61/777 (7%)
Query: 13 LKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
LK+ +N PA K +T A+P+GNGRLGAMV+G E +KLNE T+W+G P NPDA A
Sbjct: 37 LKLWYNQPAGKVWTSALPVGNGRLGAMVYGNPEQELIKLNEATVWSGGPNRNDNPDALAA 96
Query: 72 LSDVRSLVDSGQYAEA---TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRREL 128
L ++R L+ +G+ AEA AA+++ + YQ +G+++L F + Y REL
Sbjct: 97 LPEIRRLIFAGKQAEAQKLAAANIETKKNNGMKYQPVGNLQLSFTGHQ---SVTNYYREL 153
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
D+ A A Y+V V + R+ +S PDQVI +++ + G LSF L+S V
Sbjct: 154 DIEKAIATTMYTVDGVRYMRQVIASVPDQVIAVRLTADKPGKLSFTAFLNSPQKVQRSVE 213
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGS 247
+++M G + ++ KG + F+A + + + T + D + + G+
Sbjct: 214 ETTKLVMTGTT---------SDHEGVKGQVNFNAHVRVVAEGGQTTKT---DTSVVISGA 261
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
+ L + +++ ++ DP + + S L S++ + H+ YQ+ F R
Sbjct: 262 NATTLYVSMATNV----VDYKTLTADPKTRADSYLTPAAKRSFNAVLAAHVAAYQRYFKR 317
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
V++ L S + +P+ ER++ F + DP LV L FQFGRYLLIS+S+P
Sbjct: 318 VNLDLGTS------------DAAKLPTDERIRQFASGNDPQLVSLYFQFGRYLLISASQP 365
Query: 368 GT-----QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
QVA LQG+WN+ + P WDS +NIN EMNYW + NL+E EPL + L
Sbjct: 366 SRNGVVGQVATLQGLWNDRMDPPWDSKYTININTEMNYWPAEVTNLTELHEPLVQMVKEL 425
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
S G +TA+V Y ASGW+ HH TD+W + + + +++WPMGGAWL HLWE Y Y+
Sbjct: 426 SQTGQETARVMYGASGWLAHHNTDLW-RITGPVDPIYYSMWPMGGAWLSQHLWEKYQYSG 484
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLAC-VSYSS 540
D+ +L K YP ++G A F +D+L+E + YL P SPE+ AP + +
Sbjct: 485 DKAYL-KSVYPAMKGAAQFFVDYLVEDPNHHYLVVCPGMSPEN---APSTRPGVSIDAGV 540
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDP 600
TMD ++ ++F+ I AA+ L + D V+ V L +L P ++ + G + EW D P
Sbjct: 541 TMDNQLVFDIFTNTIRAAQALGTDAD-FVKIVASKLAQLPPMQVGKHGQLQEWIDDLDSP 599
Query: 601 EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQ 660
+ HRH+SHL+GL+P ++ + P L +AA TL++RG+ GWS+ WK WARL D
Sbjct: 600 DDKHRHISHLYGLYPSAQLSAYRTPQLFRAARNTLEQRGDASTGWSMGWKVNWWARLLDG 659
Query: 661 EHAYRMVKRLFNLVDPEHEKHFE-------GGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
AYR++ N + P E GG Y+NLF AHPPFQID NFG TA +AEML
Sbjct: 660 NRAYRLIT---NQLSPVSEGGRNRPGGTGVGGTYNNLFDAHPPFQIDGNFGCTAGIAEML 716
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGG-ETVSICWKDGDLHEVGIYSNYSNN 769
+QS ++LLPALP D+W +G + GL+ARGG E VS+ WK+G + V I S N
Sbjct: 717 MQSHDEAIHLLPALP-DRWPTGRISGLRARGGFEIVSLDWKEGKVASVTIKSTLGGN 772
>gi|260642325|ref|ZP_05415419.2| alpha-L-fucosidase 2 [Bacteroides finegoldii DSM 17565]
gi|260622630|gb|EEX45501.1| hypothetical protein BACFIN_06792 [Bacteroides finegoldii DSM
17565]
Length = 824
Score = 505 bits (1300), Expect = e-140, Method: Compositional matrix adjust.
Identities = 312/775 (40%), Positives = 453/775 (58%), Gaps = 53/775 (6%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E+ ++T K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+T+W G P +
Sbjct: 25 ETNASTQEYKLWYDRPAQVWTEALPLGNGRLGAMVYGIPGTEQIQLNEETIWAGRPNNNA 84
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NP+A + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 85 NPNALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 140
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y++ Y REL L++A A V+Y V V++ RE +S DQV++ +++ S G ++FN L
Sbjct: 141 YSD--YYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTASRPGQITFNAQLT 198
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISAL 237
S H V +++ EG C + ++ ++ KG ++F L + +RG A
Sbjct: 199 S---PHQDVMISSE---EGNC--VTLSGVSSWHEGLKGKVEFQGRLTAR---NRGGKIAC 247
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
D L VEG+D A++ + +++F+ N D + + L + + H
Sbjct: 248 ADGILSVEGADEAIIYVSIATNFN----NYLDITGNQIERTKDYLSKAMKHPFPEAKKNH 303
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
D Y++ RVS+ L ++ ENI T +RV++F+ D LV FQFG
Sbjct: 304 TDFYRRYLTRVSLNLGKN---------RYENITT---DKRVENFKDTNDAHLVATYFQFG 351
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE EPLF
Sbjct: 352 RYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNEPLFR 411
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ +S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC HLWE
Sbjct: 412 LIKEVSETGKETAKIMYGANGWVLHHNTDIWRVTGAI-DKAPSGMWPSGGAWLCRHLWER 470
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACV 536
Y YT D DFL + YP+L+ F + ++ E +L PS SPE+ +GK A
Sbjct: 471 YLYTGDTDFL-RSIYPILKESGRFFDEIMVKEPIHNWLVVCPSNSPENVHSGNNGK-ATT 528
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNED--ALVEKVLKSLPRLRPTKIAEDGSIMEWA 594
+ TMD +I ++++AIISA+E+L+ ++D +++ LK +P P +I G + EW
Sbjct: 529 AAGCTMDNQLIFDLWTAIISASEILDTDKDFATHLKQRLKEMP---PMQIGHWGQLQEWM 585
Query: 595 QDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALW 654
D+ DP+ HRH+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK LW
Sbjct: 586 FDWDDPKDVHRHVSHLYGLFPSNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKVCLW 645
Query: 655 ARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLV 714
ARL D +HAY+++ LV E +K GG Y NLF AHPPFQID NFG TA + EML+
Sbjct: 646 ARLLDGDHAYKLITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCTAGIVEMLM 702
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
QS +YLLPALP W G VKG+ ARGG + + WKDG ++ + + S+ N
Sbjct: 703 QSYDGFIYLLPALP-TLWKEGSVKGIIARGGFELDLSWKDGKVNHLIVKSHKGGN 756
>gi|430751368|ref|YP_007214276.1| hypothetical protein Theco_3223 [Thermobacillus composti KWC4]
gi|430735333|gb|AGA59278.1| hypothetical protein Theco_3223 [Thermobacillus composti KWC4]
Length = 768
Score = 504 bits (1298), Expect = e-140, Method: Compositional matrix adjust.
Identities = 297/768 (38%), Positives = 423/768 (55%), Gaps = 70/768 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + ++ PA + +A+PIGNGR+GAMV+G SE L+LNED+LW G P D NPDA K L
Sbjct: 1 MVMKYDRPAAEWNEALPIGNGRMGAMVFGHPVSERLQLNEDSLWYGGPRDRNNPDAAKVL 60
Query: 73 SDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
++R L+ G+ EA +V L G P Y+ LG + L F+ A E Y+R LD
Sbjct: 61 PEIRRLIFEGKPREAERLAVTGLSGIPETQRHYEPLGQLLLHFEGIDPD-AVEQYQRSLD 119
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN- 188
L A A V++ V RE+++S PDQ I+ + + G +S L+ YV+
Sbjct: 120 LERAVASVEFLHRGVRHRREYYASCPDQAIIVRATADRPGQISLTARLERA--RWRYVDA 177
Query: 189 ----GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
G + I M G A+ +G+ F+A + + G++ A+ + L V
Sbjct: 178 TGRSGTDAIYMTG------------ASGGAEGVSFAAAVTARTEG--GSLDAI-GEHLVV 222
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
E +D L++ A++SF +K+P + ++ +++ + Y RH+ DY++L
Sbjct: 223 EHADSVTLVISAATSF---------REKEPLAHCLAHARTVCAAPDDERYARHVRDYREL 273
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLIS 363
F RVS+ L +E +P ER++ + +EDP+L L FQ+GRYLLI+
Sbjct: 274 FGRVSLALG-----------GDEERSVLPVPERLERLRKGEEDPALAALYFQYGRYLLIA 322
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSRPG+ ANLQGIWN+ P WDS +NIN +MNYW + C L EC EPLFD + L
Sbjct: 323 SSRPGSLPANLQGIWNDHFLPPWDSKYTININAQMNYWPAESCALPECHEPLFDLIERLR 382
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
G +TA+V Y G+ HH TDIWA ++ + + WP+G AWLC HLWEHY +T D
Sbjct: 383 EPGRRTARVMYGCRGFAAHHNTDIWADTAPQDTYIPASYWPLGAAWLCLHLWEHYRFTQD 442
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
FLE R+ ++ A F++D+L+EG G L T PS SPE+ ++ P+G+ + TMD
Sbjct: 443 LPFLE-RSLETMKEAARFVMDYLVEGPSGELVTCPSVSPENSYVLPNGETGVLCAGPTMD 501
Query: 544 MAIIREVFSAIISAAEVL-----EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFK 598
IIR + SA + A VL + +++A + + L RL KI + G+I EW +D+
Sbjct: 502 TQIIRALLSACVEAERVLSDRTGKASDEAFIREAELVLKRLPKEKIGKLGTIQEWYEDYD 561
Query: 599 DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWA 655
+ E HRH+SHLF L PG IT + P+L +AA +TL++R G GWS W WA
Sbjct: 562 EAEPGHRHISHLFALHPGDQITPRRTPELAQAARRTLERRLSHGGGHTGWSRAWIINFWA 621
Query: 656 RLHDQEHAYR-MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLV 714
RL D E A+ +V L P NL HPPFQID NFG TA +AEML+
Sbjct: 622 RLEDGELAHENLVALLCKSTLP------------NLLDNHPPFQIDGNFGGTAGIAEMLL 669
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
QS ++LLPALP W +G V GL+ RGG V I W +G L E I
Sbjct: 670 QSHDGVIHLLPALP-KAWPAGEVAGLRTRGGYEVDIRWAEGVLVEAWI 716
>gi|294146663|ref|YP_003559329.1| hypothetical protein SJA_C2-02340 [Sphingobium japonicum UT26S]
gi|292677080|dbj|BAI98597.1| hypothetical protein SJA_C2-02340 [Sphingobium japonicum UT26S]
Length = 777
Score = 504 bits (1297), Expect = e-140, Method: Compositional matrix adjust.
Identities = 303/794 (38%), Positives = 434/794 (54%), Gaps = 61/794 (7%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
+PL + + PA +T+A+PIGNGRLGAM++GGV E L+LNE TLW G P D NP+A
Sbjct: 33 HPLTLWYRQPAAAWTEALPIGNGRLGAMLFGGVARERLQLNEGTLWAGQPYDPVNPEAKA 92
Query: 71 ALSDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRE 127
L VR L+ +G+ AEA A + K L P YQ LGD+ L+F A Y RE
Sbjct: 93 NLPQVRELIFAGRIAEAEALADKTLMAKPLAQMPYQTLGDLILDFPGVGQATA---YHRE 149
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL-DSLLDNHSY 186
LDL++ATA +++ G V R+ +S D VI +S +G L ++SL S +
Sbjct: 150 LDLDSATATTRFTAGGVAHVRQAIASPADNVIAVHLS--STGRLDVDISLRSSQIGVQVA 207
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+G N +++ GR R + N ++F+A L ++ T SA D L + G
Sbjct: 208 ADGPNGLLLTGRNGASR---GIDGN-----LRFAARLAARVEGGHATHSA--DGSLSIRG 257
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ LLL ++ F D DP + + + L R+ S++ + T D +++LF
Sbjct: 258 AKSVTLLLAMATGFR----RFDDVGGDPVAGTAATLARARDRSFATIATDAADAHRRLFR 313
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV++ L +P +P+ R+ QT +DP+L L F + RYLLI SSR
Sbjct: 314 RVTLDLGSTPAA------------QLPTDRRIADSQTSDDPALAALYFHYARYLLICSSR 361
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQG+WN+ L P W S +NIN +MNYW + P L EC PL + + L++ G
Sbjct: 362 PGGQPANLQGLWNDSLDPPWGSKYTININTQMNYWPAEPAALGECVAPLVEMVRDLAVTG 421
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
++TA+ Y A GWV HH TD+W +++A + LWP GGAWLC HLW+HY+Y DR +
Sbjct: 422 ARTARSMYGARGWVAHHNTDLW-RATAPIDGAQFGLWPTGGAWLCMHLWDHYDYHRDRAY 480
Query: 487 LEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L YPL+ G A F LD L + G+L TNPS SPE+ P G + TMDMA
Sbjct: 481 LAS-VYPLMAGAARFFLDTLQRDPASGFLVTNPSMSPEN----PHGHGGTICAGPTMDMA 535
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF--KDPEVH 603
I+R++F+ + AA +L+++ +LV ++ + RL P +I G + EW QD+ PE +
Sbjct: 536 ILRDLFTRTMEAAAILDRDA-SLVAEMRAARDRLAPYRIGRQGQLQEWQQDWDADAPEQN 594
Query: 604 HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHA 663
HRH+SHL+GL P IT + P L AA +TL+ RG+ GW+ W+ LWARL + + A
Sbjct: 595 HRHVSHLYGLHPSRQITPDGTPALAAAARRTLEIRGDRATGWATAWRINLWARLREGDRA 654
Query: 664 YRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYL 723
+ +++ L PE Y N+F AHPPFQID NFG A + E+L+ S + + L
Sbjct: 655 HDILRFLLG---PERT-------YPNMFDAHPPFQIDGNFGGAAGIVEILMDSHGDIIDL 704
Query: 724 LPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTS 783
LPALP W +G V GL+ARG V + W++G L + +TL S
Sbjct: 705 LPALP-RAWPAGRVTGLRARGRCAVDLHWREGRLDRAILRPELGGP-----RTLRLGAGS 758
Query: 784 VKVNLSAGKIYTFN 797
+ L AG T
Sbjct: 759 RTLVLKAGTPVTLT 772
>gi|365122610|ref|ZP_09339511.1| hypothetical protein HMPREF1033_02857 [Tannerella sp.
6_1_58FAA_CT1]
gi|363642358|gb|EHL81716.1| hypothetical protein HMPREF1033_02857 [Tannerella sp.
6_1_58FAA_CT1]
Length = 852
Score = 503 bits (1296), Expect = e-139, Method: Compositional matrix adjust.
Identities = 299/761 (39%), Positives = 422/761 (55%), Gaps = 47/761 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PA + +A+P+GN RLGAMV+G +E ++LNE+T+W G P NP+A L
Sbjct: 64 LKLWYKQPATQWVEALPLGNSRLGAMVYGIPDNEEIQLNEETVWGGGPHRNDNPEAKDIL 123
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+VR L+ G+ EA K F P + YQ +G ++L FD H Y + Y R+LDL
Sbjct: 124 PEVRRLIFEGKSKEAKPIMEKKFRTPRNGMPYQTIGSLKLHFD-GHENYTD--YYRDLDL 180
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A +Y V V +TRE F+S D V++ +I+ + G+L+F S L H+
Sbjct: 181 TRAVATTRYKVNGVTYTRELFTSFADNVVIMQITSDKQGALNFTADYVSPL-KHTVSTKK 239
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
++I+ G+ A+ P I+ IK +D + S D K+ V + A
Sbjct: 240 GKLILSGKG--------ADHEGVPGVIRLENQTFIKTTDGKVKTS---DNKISVSDATTA 288
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
+ + A+++F +N +D + + + +++ Y H+ Y+KLF RV++
Sbjct: 289 TIYISAATNF----VNYNDVSANEHKRADAYMKAALKKPYEKALADHIAYYKKLFDRVTL 344
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
L S + EE + RVK+F+ D SL L+FQFGRYLLISSS+PG Q
Sbjct: 345 DLGTSKE------AQEE------THLRVKNFKNGNDVSLAVLMFQFGRYLLISSSQPGGQ 392
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWNE L WD +NIN EMNYW + NLSE EPL + LS++G +TA
Sbjct: 393 PANLQGIWNEKLQAPWDGKYTININTEMNYWPAEVTNLSETHEPLIQMVKELSVSGQETA 452
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
+ Y +GWV HH TD+W G +WP GGAWL H+W+HY YT D+++L+
Sbjct: 453 KEMYGCNGWVTHHNTDLWRSCGPVDGADY--VWPNGGAWLSQHVWQHYLYTGDKEYLQD- 509
Query: 491 AYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
YP L+G A F LD+L E H Y + T PS+SPEH P G + TMD I
Sbjct: 510 VYPALKGVADFFLDFLTE-HPTYKWMVTVPSSSPEH---GPRGNGNSIVAGCTMDNQIAF 565
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLS 608
+ S + A ++L + D K+ + RL P +I + + EW QD DP HRH+S
Sbjct: 566 DALSNALQATKILNGDAD-YCNKLQNMIDRLAPMQIGQYNQLQEWLQDVDDPNNDHRHVS 624
Query: 609 HLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVK 668
HL+GL+P + I+ +P+L +AA +L RG++ GWSI WK LWARL D HAY++++
Sbjct: 625 HLYGLYPSNQISPYNHPELFQAARNSLVYRGDKATGWSIGWKINLWARLLDGNHAYKIIQ 684
Query: 669 RLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALP 728
+ LV+ + +G Y NLF AHPPFQID NFG+TA VAEML+QS ++LLPALP
Sbjct: 685 NMLMLVEKGNN---DGRTYPNLFDAHPPFQIDGNFGYTAGVAEMLLQSHDGAVHLLPALP 741
Query: 729 WDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
D W G V GL ARGG VS+ W L++ I S N
Sbjct: 742 -DVWRRGSVNGLMARGGFEVSMDWDGVQLNKARILSKLGGN 781
>gi|337748975|ref|YP_004643137.1| alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
gi|379721944|ref|YP_005314075.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
gi|386724687|ref|YP_006191013.1| alpha-L-fucosidase [Paenibacillus mucilaginosus K02]
gi|336300164|gb|AEI43267.1| Alpha-L-fucosidase [Paenibacillus mucilaginosus KNP414]
gi|378570616|gb|AFC30926.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
gi|384091812|gb|AFH63248.1| alpha-L-fucosidase [Paenibacillus mucilaginosus K02]
Length = 786
Score = 503 bits (1296), Expect = e-139, Method: Compositional matrix adjust.
Identities = 287/752 (38%), Positives = 410/752 (54%), Gaps = 55/752 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ + PA+ +TDA+P+GNGRLGAMV+G V E L++NED++W G P + NPD K L
Sbjct: 11 KLWYEKPARAWTDALPVGNGRLGAMVFGKVNQERLQINEDSVWYGGPLNGDNPDGRKYLP 70
Query: 74 DVRSLVDSGQYAEAT-AASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
+VR L+ G+ EA AA + L P + YQ LGD+ + D K Y R+LD+
Sbjct: 71 EVRRLLLKGKQLEAEEAAQMGLMSIPKSMRPYQPLGDLHIYHDGE--KKMISNYYRDLDI 128
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL-LDNHSYVNG 189
A V Y + V RE FSS D V+ +I+ L+ +++ D +
Sbjct: 129 EEGIAHVSYCLNEVPHVREVFSSAVDGVLAVRITCGPDAKLNLRMNVSRRPFDEGTQQLA 188
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
++ I M G + G+ + + +K + G ++A D L V ++
Sbjct: 189 HDTIAMCG-------------ENGKNGVTYC--MAVKAVPEGGWVNAFGDF-LAVRDANA 232
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
+ + ++F DP +E + L+ Y + H+ D++ L+ RV+
Sbjct: 233 VTIYIAGGTTF---------RSDDPLAECVRQLEQAERKGYEAVRRDHVADHRSLYRRVN 283
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPG 368
++L P S + T+P+ R++ F + EDP L L FQ+GRYL+++SSRPG
Sbjct: 284 LELDPEP-------VSGPDPSTLPTDARLQRFREGGEDPGLFRLYFQYGRYLMMASSRPG 336
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
+ ANLQGIWNE +P W+S +NIN EMNYW + CNL EC EPLFD + + NG K
Sbjct: 337 SNPANLQGIWNESFTPPWESKYTININTEMNYWPAESCNLPECHEPLFDLIDRMRPNGRK 396
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
TA+ Y G+V HH TD+W + + + ++WPMG AWL HLWEHY Y ++ FL
Sbjct: 397 TAEQLYGCRGFVAHHNTDMWGSTQVEGNYMPGSIWPMGAAWLSLHLWEHYRYGLEETFLR 456
Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
+RAYP+++ A F LD+L E +G L T PSTSPE++FI PDG + ++ +MD+ I+
Sbjct: 457 ERAYPVMKEAAEFFLDYLFEDKEGRLVTGPSTSPENKFIMPDGSVGTLTIGPSMDIQIVY 516
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLS 608
+ SA AAE+L + +D L EK + L RL P +I G + EW D+ + HRH+S
Sbjct: 517 SLLSACTDAAEIL-RTDDLLREKWEEVLRRLPPPQIGRHGQLQEWTGDWDEVHPGHRHIS 575
Query: 609 HLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHAYR 665
HLF L PG I + P+ +AA TL +R E G GWS W +ARL D +AY
Sbjct: 576 HLFALHPGEIIHVRHTPEWAQAARVTLDRRLENGGGHTGWSRAWILNFYARLEDGVNAYA 635
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
++ L + NLF HPPFQID NFG TA +AEML+QS ++ LLP
Sbjct: 636 HLRALLSQ-----------STLPNLFDNHPPFQIDGNFGGTAGIAEMLLQSHRGEIALLP 684
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
ALP W SG V GL+ARGG V + W DG L
Sbjct: 685 ALP-PVWRSGRVSGLRARGGFEVDLEWADGAL 715
>gi|260066219|gb|ACX30659.1| Fuc19 [Sphingobacterium sp. TN19]
Length = 821
Score = 503 bits (1294), Expect = e-139, Method: Compositional matrix adjust.
Identities = 296/767 (38%), Positives = 434/767 (56%), Gaps = 60/767 (7%)
Query: 13 LKITFNGPA--KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
LK+ ++ P + + A+PIGNGRLGAMV+G E L+LNE+T++ G P NP+A
Sbjct: 33 LKLWYDQPVVDQIWEQALPIGNGRLGAMVYGIPEREELQLNEETIYAGGPYRNDNPNALN 92
Query: 71 ALSDVRSLVDSGQYAEATAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYR 125
AL ++ L+ +G+ EA + + F G P YQ G + L F D H Y + Y
Sbjct: 93 ALPQIQQLIFAGKTEEADRLTNQSFFTKTHGMP---YQTAGSVILNFPD-HKHY--QHYY 146
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
RELDL A R +Y+V V +TR+ FSS D VIV +I+ S+ G+L+F++ + +
Sbjct: 147 RELDLEKAVVRSRYTVEGVTYTRQVFSSFADDVIVMEITASKKGALNFDLEYANPSECKV 206
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKV 244
Y +G + +I+EG +++ +G I++ +K D R T L D KL V
Sbjct: 207 YKSGQS-LILEG---------SGTSHEGIEGKIRYQKHTAVKNKDGRVT---LTDNKLTV 253
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
G+ V+ + +++F +N ++ ++ S L + ++ +H+ Y K
Sbjct: 254 SGATSVVIYMAVATNF----VNYKTVDQNAGVKAASTLALAQKKAFQTALKQHIAMYSKQ 309
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F R + L + T +EN+ T +R++SF+T +DP+LV LL QFGRYLLI S
Sbjct: 310 FARFKLDLGQ--------TAGQENLTTT---KRIESFKTTQDPALVALLVQFGRYLLICS 358
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+PG Q ANLQGIWN ++P WDS VNIN EMNYW + NLSE EPLF + LS
Sbjct: 359 SQPGGQPANLQGIWNRSMNPPWDSKYTVNINTEMNYWPAEVTNLSETHEPLFQLIKELSE 418
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
+G +TA+V Y A GWV HH TD+W +S +WP GG WL HLWEHY YT D+
Sbjct: 419 SGRETARVLYGADGWVTHHNTDLWRVTSPIDFAAA-GMWPTGGTWLTQHLWEHYLYTGDQ 477
Query: 485 DFLEKRAYPLLEGCASFLLDWLIE--GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
FL + YP+++G A F+L LI H +L PS SPEH +S TM
Sbjct: 478 KFLTE-VYPVMKGAADFILSILIAHPKHKDWLVIAPSISPEH---------GPISTGITM 527
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
D + ++ + A+E+++++ A K++K+ +L P ++ + EW +D DP+
Sbjct: 528 DNQLAFDILTRTALASEIVDQDA-AYKAKLIKTARKLPPMQVGRYAQLQEWLEDLDDPKS 586
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEH 662
HRH+SHL+GL+PG+ I+ + P L +AA +LQ RG+ GWSI WK LWARL +
Sbjct: 587 DHRHVSHLYGLYPGNQISAYRTPQLFEAAANSLQYRGDFATGWSIGWKINLWARLLNGNK 646
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
AY+++ + L + K+ +G Y N+F AHPPFQID NFG +A VAEML+QS ++
Sbjct: 647 AYQIIDNMLTLAN---HKNPDGRTYPNMFTAHPPFQIDGNFGLSAGVAEMLLQSHDGAVH 703
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
+LPAL + W G V G+ ARGG TV + WKDG + + + S N
Sbjct: 704 VLPALS-ELWRDGAVSGIVARGGFTVDMNWKDGQIRNIAVTSKIGGN 749
>gi|443622308|ref|ZP_21106841.1| putative Large secreted protein [Streptomyces viridochromogenes
Tue57]
gi|443344193|gb|ELS58302.1| putative Large secreted protein [Streptomyces viridochromogenes
Tue57]
Length = 973
Score = 503 bits (1294), Expect = e-139, Method: Compositional matrix adjust.
Identities = 292/740 (39%), Positives = 409/740 (55%), Gaps = 56/740 (7%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D N ++++R V + Q+
Sbjct: 60 ALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPYDSANTRGAANIAEIRRRVFADQWGP 119
Query: 87 AT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
A + + G PA YQ +G++ L F + Y R LDL TATA Y +
Sbjct: 120 AQDLINQTMLGSPAGQLAYQPVGNLRLSFGSA---TGASQYNRTLDLTTATAITTYVLNG 176
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKR 203
V + RE F+ PDQVIV +++ + S++F + DS I ++G
Sbjct: 177 VRYQREVFAGAPDQVIVVRLTADRANSIAFIATFDSPQRTTVSSPDGATIALDG------ 230
Query: 204 IPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDG 262
+ A + G ++F A+ ++ GT+S+ L+V G+ +L+ SS+
Sbjct: 231 ---ISGAMEGIAGRVRFLALANAAVTG--GTVSS-SGGTLRVSGATSVTMLVSIGSSY-- 282
Query: 263 PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTD 322
+N + D + S L + R++ L +RHL DYQ LF+RVS+ L R
Sbjct: 283 --VNFRKADGDYQGIARSHLNAARDVGIDVLRSRHLADYQALFNRVSVDLGR-------- 332
Query: 323 TCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDL 382
T + + P+ R+ DP LLFQFGRYLLISSSRPGTQ ANLQGIWN+ +
Sbjct: 333 TAAADQ----PTDVRIAQHAQVNDPQFSALLFQFGRYLLISSSRPGTQPANLQGIWNDQM 388
Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIH 442
+P+WDS +N NL MNYW + NLSEC P+FD + L++ G++ AQ Y A GWV H
Sbjct: 389 APSWDSKFTINANLPMNYWPADTTNLSECFRPVFDMINDLTVTGARVAQAQYGAGGWVTH 448
Query: 443 HKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFL 502
H TD W +S G W +W GGAWL T +W+HY +T D DFL YP L+G A F
Sbjct: 449 HNTDAWRGASVVDG-AQWGMWQTGGAWLATLIWDHYLFTGDIDFLRSN-YPALKGAAQFF 506
Query: 503 LDWLIEGHD--GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEV 560
LD L+ H G+L TNPS SPE A V TMD I+R++F+++ A E+
Sbjct: 507 LDTLVA-HPALGHLVTNPSNSPELAHHTN----ATVCAGPTMDNQILRDLFNSVARAGEI 561
Query: 561 LEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTIT 620
L + + L + RL PT++ G+I EW D+ + E HRH+SHL+GL P + IT
Sbjct: 562 LGADA-TFRAQALAARDRLPPTRVGSRGNIQEWLADWVETERTHRHVSHLYGLHPSNQIT 620
Query: 621 IEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEK 680
P L +AA +TL+ RG+EG GWS+ WK WAR+ D A+++++ +LV +
Sbjct: 621 KRGTPQLHEAARRTLELRGDEGTGWSLAWKINFWARMEDGARAHKLIR---DLVRTDR-- 675
Query: 681 HFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGL 740
L N+F HPPFQID NFG T+ +AEML+QS +L++LPALP W +G V GL
Sbjct: 676 -----LAPNMFDLHPPFQIDGNFGATSGIAEMLLQSHNGELHVLPALP-AAWPTGRVSGL 729
Query: 741 KARGGETVSICWKDGDLHEV 760
+ RGG TV W G + V
Sbjct: 730 RGRGGHTVGAEWSSGRIEVV 749
>gi|325105288|ref|YP_004274942.1| alpha-L-fucosidase [Pedobacter saltans DSM 12145]
gi|324974136|gb|ADY53120.1| Alpha-L-fucosidase [Pedobacter saltans DSM 12145]
Length = 826
Score = 502 bits (1293), Expect = e-139, Method: Compositional matrix adjust.
Identities = 301/769 (39%), Positives = 426/769 (55%), Gaps = 64/769 (8%)
Query: 13 LKITFNGPA--KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
LK+ +N P + A+PIGNGRLGAMV+G E L+LNE+T+W G P N A +
Sbjct: 39 LKLWYNKPVIDNVWEQALPIGNGRLGAMVYGIPQREQLQLNEETIWGGGPYRNDNNKALE 98
Query: 71 ALSDVRSLVDSGQYAEATAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYR 125
L V+ +V GQ EA + F G P +Q G + L F H +Y E Y
Sbjct: 99 VLPLVQKMVFDGQTQEADKLINQSFFTQTHGMP---FQTAGSLILNFP-GHNQY--ENYY 152
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
RELDLN A + Y+V V++TRE FSS D VI+ +++ SE G L+F++ + H+
Sbjct: 153 RELDLNKAVVKTTYTVNGVKYTREVFSSFTDDVIIMQLTSSEKGGLNFDIGYVNP-SQHT 211
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK--ISDDRGTISALEDKKLK 243
+N +++EGR D +GI+ +I +S G + A+ D K+
Sbjct: 212 VSKKDNSLVLEGR------------GSDHEGIEGKIRYQIHTLVSHADGHV-AVSDHKIN 258
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
+ + A + + ++F N +P + S L + ++ +H Y K
Sbjct: 259 ITEASSATIYISIGTNF----TNYKSVDANPAERAASKLAVAKKKNFKSALQQHSATYYK 314
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
F R + L D EE P+ R+++F+ +DP+LV LL QFGRYLLIS
Sbjct: 315 QFGRFKLNLGSQ------DISKEE-----PTDVRIRNFKETQDPALVTLLTQFGRYLLIS 363
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SS+PG Q +NLQGIW + P WDS +NIN EMNYW + NLS+ EPLF L LS
Sbjct: 364 SSQPGGQPSNLQGIWCNSMHPAWDSKYTININTEMNYWPAEVTNLSDTHEPLFQMLKDLS 423
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+G +TA+ Y A GWV HH TDIW +S +WP GGAWL HLWEHY +T D
Sbjct: 424 ESGRETAKTLYGADGWVAHHNTDIWRVTSPIDFAAA-GMWPTGGAWLSQHLWEHYLFTGD 482
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIE--GHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
R FL + AYP+L+G A F L +LIE + G++ +PS SPEH ++ T
Sbjct: 483 RKFLAE-AYPILKGSADFFLSFLIEHPKYKGWMVVSPSISPEH---------GPITAGVT 532
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP-RLRPTKIAEDGSIMEWAQDFKDP 600
MD ++ +V + + A E+L K+ + + LKS+ R+ P +I + + EW +D DP
Sbjct: 533 MDNQLVFDVLTRTVVAGEMLGKDTNYIAR--LKSMAKRIPPMQIGKYTQLQEWLEDIDDP 590
Query: 601 EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQ 660
+ HRH+SHL+GL+PG+ I+ P+L +A+ +L RG+ GWSI WK LWARL +
Sbjct: 591 KNEHRHVSHLYGLYPGNQISPYTTPELFEASRNSLIYRGDFATGWSIGWKINLWARLLEG 650
Query: 661 EHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
AY+++ + LVD E+ +G Y N+F AHPPFQID NFG TA VAEMLVQS +
Sbjct: 651 NRAYKIINNMLTLVDKENR---DGRTYPNMFTAHPPFQIDGNFGLTAGVAEMLVQSHDSA 707
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
L+LLPALP D W +G V G+ ARGG + + W++G + EV + S N
Sbjct: 708 LHLLPALP-DVWDTGSVSGIVARGGFEIDMKWQEGAVQEVKVLSKIGGN 755
>gi|383642312|ref|ZP_09954718.1| alpha-l-fucosidase [Sphingomonas elodea ATCC 31461]
Length = 788
Score = 502 bits (1292), Expect = e-139, Method: Compositional matrix adjust.
Identities = 296/797 (37%), Positives = 441/797 (55%), Gaps = 60/797 (7%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
S +PL + + PA+ + +A+P+GNGRLGAMV+GG +E +LNEDT + G P D
Sbjct: 33 GGAGASPRDPLTLWYRQPAQEWVEALPLGNGRLGAMVFGGTTTERFQLNEDTFFAGSPYD 92
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKY 119
TNP A A+ +R LV G+ EA A + K + G PA YQ +GD+ L F
Sbjct: 93 ATNPAAGPAIRRIRQLVFEGKGKEAQALADKDVIGRPAGQMPYQPIGDLLLLFPGLE--- 149
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKIS-GSESGSLSFNVSLD 178
Y R LDL+ A A ++ G+ RE +S DQVI +++ G G ++ ++L
Sbjct: 150 GIRGYERSLDLDGAIATTRFRTGSTTHVREAIASAVDQVIAIRLTAGQGRGGVTTTLALT 209
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
S + S+V G + +++ G PG R P GI+F + + +D G ++A +
Sbjct: 210 SPQQSESFVEGGDTLVLRGIGPGAR--------GVPGGIRFETRVRMIATD--GIVTAGK 259
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
L VE + VLLLVA+++ + D DP++ + + + ++ L H
Sbjct: 260 -SDLSVEQAS-EVLLLVATAT---SYRRWDDIGGDPSAIVRAQIDAAAGKGWARLLADHQ 314
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
D+++LF R+++ L R+P +P+ ER++ +DP+L L QFGR
Sbjct: 315 ADHRRLFRRMTLDLGRTPAA------------ALPTDERIRRSTELDDPALATLYHQFGR 362
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLI++SRPGTQ ANLQGIWNE + P+WDS +NIN EMNYW + L E EPL
Sbjct: 363 YLLIAASRPGTQPANLQGIWNERVHPSWDSKWTLNINAEMNYWPADMTGLGELTEPLLRL 422
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ LS+ G +TA+ ++ A GW+ +H D++ ++ G VW LWPM GAWL + LW+H+
Sbjct: 423 VKELSVAGQRTARNDWGARGWMSYHNVDLFRNTALIDG-AVWGLWPMAGAWLLSSLWDHW 481
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
+Y+ DR FL + YPL+ G F LD L+ G L NPS SPE++ A V+
Sbjct: 482 DYSRDRTFLAE-LYPLMAGACDFYLDALVPHPTTGELVMNPSNSPENQHHAG----ISVT 536
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
+ MD ++R++F AA +L ++E + + +I + G + EW D+
Sbjct: 537 AGAAMDSQLLRDLFGRTAEAARLLGRDESRARAVLAARARLPK-DRIGKAGQLQEWLDDW 595
Query: 598 --KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWA 655
+ PE+HHRH+SHL+ L+PG IT+ + P L AA ++L+ RG++ GW I W+ LWA
Sbjct: 596 DMEAPEIHHRHVSHLYALYPGDQITVHETPALAAAARRSLEIRGDDATGWGIGWRINLWA 655
Query: 656 RLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQ 715
RL D EHA+R+VK L++P Y N+F AHPPFQID NFG TA + +ML+Q
Sbjct: 656 RLEDGEHAHRVVK---MLLEPRRT-------YPNMFDAHPPFQIDGNFGGTAGITQMLLQ 705
Query: 716 STLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFK 775
S + ++LLPALP WS G + G++ARGG V + W+ G L E + + S
Sbjct: 706 SYRDTIHLLPALP-SAWSDGSITGVRARGGVRVDLRWRGGKLVEAVLLPDVSGT-----T 759
Query: 776 TLHYRGTSVKVNLSAGK 792
TL Y G +V L G+
Sbjct: 760 TLRYAGKRKQVKLVRGQ 776
>gi|329849976|ref|ZP_08264822.1| alpha-fucosidase [Asticcacaulis biprosthecum C19]
gi|328841887|gb|EGF91457.1| alpha-fucosidase [Asticcacaulis biprosthecum C19]
Length = 806
Score = 502 bits (1292), Expect = e-139, Method: Compositional matrix adjust.
Identities = 297/787 (37%), Positives = 421/787 (53%), Gaps = 68/787 (8%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
A++ S + L + + PA + +A+P+GNGRLGAMV+G V E L+LNEDTLW G P D
Sbjct: 25 AQAKSRPSDLTLWYAQPAGPWVEALPVGNGRLGAMVFGRVAQERLQLNEDTLWAGSPYDP 84
Query: 64 TNPDAPKALSDVRSLVDSGQYAEAT-AASVKLFGHPADV--YQLLGDIELEFDDSHLKYA 120
NP + L+ R+L+D+ ++ +A+ + + P Y GD+ L+F H
Sbjct: 85 NNPGCLENLAKCRALIDAEKFKDASDLVNASMMAQPKTQMPYGAAGDLLLDF---HGLAQ 141
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS---- 176
YRR LDL+TA A + +G +TRE FSS DQV+V +++ G L F++
Sbjct: 142 PSDYRRSLDLDTAVATTTFKIGATTYTREVFSSAVDQVLVVRLTAKGKGRLDFDLGYRHP 201
Query: 177 -------------LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKAN------ANDDPKGI 217
+ L + + + E R +N AN GI
Sbjct: 202 DQVDYGAPVYDGKVTDTLSQGAAWDKREGLSRERRPQSLAFAASSNELLVTGANIASAGI 261
Query: 218 QFSAILEIKI-SDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTS 276
++I + G I+A D L V G+ LL+ A++SF + D+ DP +
Sbjct: 262 PAGLTYAVRIRAIGDGNITAAGDS-LTVRGATTVTLLIAAATSF----VRFDDTGGDPIA 316
Query: 277 ESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAE 336
+ +AL + Y+ L H+ ++ LF R++I L + + C+ +I
Sbjct: 317 RT-AALNTAAAKPYAALKADHIAAHRALFRRMTIDLGNT-----SAACAATDI------- 363
Query: 337 RVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINL 396
R+ +DP L L QF RYL+ISSSRPGTQ ANLQGIWNE ++P W S +NIN
Sbjct: 364 RIGKSLASDDPQLAALYVQFARYLMISSSRPGTQPANLQGIWNEGVNPPWGSKYTININT 423
Query: 397 EMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRG 456
EMNYW P N+ C EPL + LS+ G+KTA+V Y ASGW+ HH TD+W ++SA
Sbjct: 424 EMNYWLVEPANIGVCVEPLVRMVEDLSMTGAKTAKVMYGASGWMAHHNTDLW-RASAPID 482
Query: 457 KVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LE 515
W +WP GGAWLC LW+HY+Y D +FL KR YPLL+G + F D L+E G L
Sbjct: 483 GAWWGMWPTGGAWLCKTLWDHYDYNRDPEFL-KRIYPLLKGASQFFADTLVEDPKGRGLV 541
Query: 516 TNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS 575
T+PS SPE+E + G C MD IIR++F++ I+A ++L +D K+
Sbjct: 542 TSPSISPENEHM--KGVATCA--GPAMDSQIIRDLFASTIAAQKLLANGDDGFTAKLAAM 597
Query: 576 LPRLRPTKIAEDGSIMEWAQDF--KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
RL +I G + EW +D+ + P+ HRH+SHL+GL+P I + PDL AA+
Sbjct: 598 HARLPADRIGAQGQLQEWLEDWDARAPDQQHRHVSHLYGLYPSEQINVRDTPDLVAAAKV 657
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAA 693
TL RG+ GW W+ ALWAR+ + EHA+ + L L+ P+ Y NLF A
Sbjct: 658 TLNTRGDLATGWGTAWRLALWARMGEAEHAHSI---LMGLMGPQRT-------YPNLFDA 707
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWK 753
HPPFQID NFG + EML+QS ++ +LPALP W SG V GL ARGG T + W
Sbjct: 708 HPPFQIDGNFGGATGILEMLLQSWGGEILVLPALP-AAWPSGRVTGLMARGGITADLAWN 766
Query: 754 DGDLHEV 760
G L ++
Sbjct: 767 GGRLTKL 773
>gi|424878767|ref|ZP_18302405.1| hypothetical protein Rleg8DRAFT_5620 [Rhizobium leguminosarum bv.
trifolii WU95]
gi|392520277|gb|EIW45007.1| hypothetical protein Rleg8DRAFT_5620 [Rhizobium leguminosarum bv.
trifolii WU95]
Length = 747
Score = 502 bits (1292), Expect = e-139, Method: Compositional matrix adjust.
Identities = 299/787 (37%), Positives = 428/787 (54%), Gaps = 64/787 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA+ +TDA+P+GNGRLGAMV+G E L++NE T W G P NPDA L VR
Sbjct: 8 YDAPAQLWTDALPLGNGRLGAMVFGDPLREHLQINESTFWAGGPYQPVNPDAFGHLGTVR 67
Query: 77 SLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEET--YRRELDLN 131
L+ G YA+A A A +L P YQ +GD+ LEF K+AE YRR LDL+
Sbjct: 68 QLIFDGHYADAEALAEKRLMARPIKQMSYQPIGDLRLEF-----KFAESVSGYRRALDLD 122
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TA A Y+ + + RE F S D V+V ++S ++S +S+DS + +
Sbjct: 123 TAIATSSYTANGIAYLREAFVSPVDGVLVLRLSADRKRAISCRISIDSPQQGEMSIGEGS 182
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
Q+ G+ GK A A ++F+ +++ + GT+ A L VEG+D +
Sbjct: 183 QLSFSGK--GKAESGIAAA------LRFA--FGVRLINSGGTVKA-SGGGLSVEGADEVL 231
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
+ L A++SF D P + + L+ + + L H+ ++++LF +I
Sbjct: 232 VFLDAATSFR----RYDDVLGHPERDIVDRLERAASRDFVSLRDDHIAEHRRLFSAFAID 287
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
L +P ++P+ +R+ F +DP+L L QFGRYL+I+SSRPGTQ
Sbjct: 288 LGSTPAA------------SLPTDQRIAGFAGGDDPALAALYVQFGRYLMIASSRPGTQP 335
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
ANLQGIWN P W S NINL+MNYW P NL EC EPL + L+ G A
Sbjct: 336 ANLQGIWNAQTDPPWGSKYTANINLQMNYWLPAPANLRECLEPLVEMAEELAETGKAMAH 395
Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
V+Y ASGWV+HH TD+W + G W LWPMGG WL L + +Y D + + +R
Sbjct: 396 VHYRASGWVMHHNTDLWRATGPIDG-AKWGLWPMGGIWLMAQLLDACDYLDDAEAMRRRL 454
Query: 492 YPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
+P+ A FL D L+ G D YL TNPS SPE+ P G C MD +IR+
Sbjct: 455 FPIAREAAHFLFDVLVPFPGTD-YLVTNPSLSPENAH--PYGASICA--GPAMDSQLIRD 509
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF--KDPEVHHRHL 607
F ++ V E LV + + L RL P +I +G + EW +D+ + PE+HHRH+
Sbjct: 510 -FLGLLRPLAVSIGGEPELVADIDRVLSRLAPDRIGANGQLQEWLEDWDMQAPEMHHRHV 568
Query: 608 SHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMV 667
SHL+GL+P I +++ PDL AA ++L+ RG+E GW I W+ LWARL D HA+ ++
Sbjct: 569 SHLYGLYPSWQIDMDRTPDLAAAARRSLEIRGDEATGWGIGWRINLWARLRDGNHAHNVL 628
Query: 668 KRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPAL 727
K L PE Y NLF AHPPFQID NFG A + EMLVQS +++LLPAL
Sbjct: 629 KLLLT---PERS-------YKNLFDAHPPFQIDGNFGGAAGIVEMLVQSRPGEIHLLPAL 678
Query: 728 PWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVN 787
P W G ++GL+ RGG + + W+DG+ + + ++ + + L + T KV+
Sbjct: 679 P-TAWPGGSIRGLRLRGGMLLDLDWEDGEPLTIRLTASRNVS-----SILRFGQTRRKVD 732
Query: 788 LSAGKIY 794
L+AG+ +
Sbjct: 733 LAAGESF 739
>gi|241518404|ref|YP_002979032.1| Alpha-L-fucosidase [Rhizobium leguminosarum bv. trifolii WSM1325]
gi|240862817|gb|ACS60481.1| Alpha-L-fucosidase [Rhizobium leguminosarum bv. trifolii WSM1325]
Length = 747
Score = 501 bits (1290), Expect = e-139, Method: Compositional matrix adjust.
Identities = 299/787 (37%), Positives = 429/787 (54%), Gaps = 64/787 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA+ +TDA+P+GNGRLGAMV+G E L++NE T W G P NPDA L VR
Sbjct: 8 YDAPAQLWTDALPLGNGRLGAMVFGDPLREHLQINESTFWAGGPYQPVNPDAFGHLGTVR 67
Query: 77 SLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEET--YRRELDLN 131
L+ G YA+A A A +L P YQ +GD+ LEF K+AE YRR LDL+
Sbjct: 68 QLIFDGHYADAEALAEKRLMARPIKQMSYQPIGDLRLEF-----KFAESVSGYRRALDLD 122
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TA A Y+ + + RE F S D V+V ++S ++S +S+DS + +
Sbjct: 123 TAIATSSYTANGIAYLREAFVSPVDGVLVLRLSADRKRAISCRISIDSPQQGEMSIGERS 182
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
+ G+ GK A A ++F+ +++ + GT++A L VEG+D +
Sbjct: 183 LLSFSGK--GKAESGIAAA------LRFA--FGVRLINSGGTVNA-SGGGLSVEGADEVL 231
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
+ L A++SF D P + + L+ + + L H++++++LF +I
Sbjct: 232 VFLDAATSFR----RYDDILGHPERDIIDRLERAASRDFVSLRDDHIEEHRRLFSAFAID 287
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
L +P ++P+ +R+ F +DP+L L QFGRYL+I+SSRPGTQ
Sbjct: 288 LGSTPAA------------SLPTDQRIAGFAGGDDPALAALYVQFGRYLMIASSRPGTQP 335
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
ANLQGIWN P W S NINL+MNYW P NL EC EPL + L+ G A
Sbjct: 336 ANLQGIWNAQTDPPWGSKYTANINLQMNYWLPAPANLRECLEPLVEMAEELAETGKVMAH 395
Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
V+Y A GWV+HH TD+W + G W LWPMGG WL L E +Y D + + +R
Sbjct: 396 VHYRARGWVMHHNTDLWRATGPIDG-AKWGLWPMGGIWLMAQLLEACDYLDDAEAMRRRL 454
Query: 492 YPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
+P+ A FL D L+ G D YL TNPS SPE+ P G C MD +IR+
Sbjct: 455 FPIALEAAHFLFDVLVPFPGTD-YLVTNPSLSPENAH--PYGASICA--GPAMDSQLIRD 509
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF--KDPEVHHRHL 607
F ++ V E LV + + LPRL P +I +G + EW +D+ + PE+HHRH+
Sbjct: 510 -FLGLLRPLAVSIGGEPELVADIDRVLPRLAPDRIGANGQLQEWLEDWDMQAPEMHHRHV 568
Query: 608 SHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMV 667
SHL+GL+P I +++ PDL AA ++L+ RG+E GW I W+ LWARL D HA+ ++
Sbjct: 569 SHLYGLYPSWQIDMDRTPDLAAAARRSLEIRGDEATGWGIGWRINLWARLRDGNHAHNVL 628
Query: 668 KRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPAL 727
K L PE Y NLF AHPPFQID NFG A + EMLVQS +++LLPAL
Sbjct: 629 KLLLT---PERS-------YKNLFDAHPPFQIDGNFGGAAGIVEMLVQSRPGEIHLLPAL 678
Query: 728 PWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVN 787
P W G ++GL+ RGG + + W+DG+ + + ++ + + L + T KV+
Sbjct: 679 P-TAWPGGSIRGLRLRGGMLLDLDWEDGEPLTIRLTASRNVS-----SILRFGQTRRKVD 732
Query: 788 LSAGKIY 794
L+AG+ +
Sbjct: 733 LAAGESF 739
>gi|288929797|ref|ZP_06423640.1| alpha-L-fucosidase 2 (Alpha-L-fucosidefucohydrolase 2)
(Alpha-1,2-fucosidase 2) [Prevotella sp. oral taxon 317
str. F0108]
gi|288328898|gb|EFC67486.1| alpha-L-fucosidase 2 (Alpha-L-fucosidefucohydrolase 2)
(Alpha-1,2-fucosidase 2) [Prevotella sp. oral taxon 317
str. F0108]
Length = 792
Score = 501 bits (1289), Expect = e-139, Method: Compositional matrix adjust.
Identities = 309/800 (38%), Positives = 446/800 (55%), Gaps = 55/800 (6%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP-- 69
PL+I N P F +++PIGNG+LGAMV G + LKLN+ TLW+G P D N DA
Sbjct: 24 PLRIWDNRPGSFFENSMPIGNGKLGAMVDGNPHCDYLKLNDITLWSGKPID-PNEDAGAH 82
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLG-----DIELEFD-DSHLKYAEET 123
K + +R + YA A + +++ GH + YQ L D++ + D+ LK
Sbjct: 83 KWIPQIRKALFEENYALADSLQLRVQGHNSAWYQPLSTLCICDVKAAANADAPLK----N 138
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
YRRELDL+++ +V Y V + RE+F+S+P + I+ +++ ++ ++S +SL SLL++
Sbjct: 139 YRRELDLDSSLVKVSYESEGVSYRREYFASHPGRAIMVRLTANKPHAISLQLSLTSLLNH 198
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKL 242
+ V GN +M +A P + F +L+ K + GTI+A +D L
Sbjct: 199 QTRVEGNTIRLM------------GHAEGHPDSTVHFCNLLQAKATG--GTITA-QDSTL 243
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
+ + VL +V +S++G +P + + L++++N ++ L H DDYQ
Sbjct: 244 LISNATQVVLYIVNETSYNGFDKHPVTQGAPYVQLAETDLKNLQNCTFEQLKQNHTDDYQ 303
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
LF R+++ L + D+ T ++ D E +P L L FQFGRYLLI
Sbjct: 304 ALFGRLALHLDGTKLDM-HRTTEQQLQDYTKRGE--------TNPYLETLYFQFGRYLLI 354
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SSSR ANLQG+WN + W S VNINLE NYW + NL+E PL + L
Sbjct: 355 SSSRTPGVPANLQGLWNPHVRAPWRSNYTVNINLEENYWPAQVANLAELTTPLVGMVKAL 414
Query: 423 SINGSKTAQVNY-LASGWVIHHKTDIWAKSSA---DRGKVVWALWPMGGAWLCTHLWEHY 478
S+NG A+ Y + GW H TD+WA ++ R WA W +GGAWL ++LWE Y
Sbjct: 415 SVNGRYAARNYYGINEGWCSSHNTDLWAMTNPVGEKRESPEWANWNLGGAWLLSNLWEQY 474
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACV 536
++T DR +L YPL++G F+L WL+E G L T PSTSPE+E++ PDG
Sbjct: 475 DFTRDRHYLRHTLYPLMKGACDFMLQWLVENPKQPGELITAPSTSPENEYVTPDGYHGTT 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
Y T D+AI+RE+F+ +A E+L A + + +++ RL P I ++G + EW D
Sbjct: 535 VYGGTADLAILRELFANTATADEILNGRPTAYSKILRQTIGRLHPYTIGKEGDLNEWYYD 594
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
+ D + HRH +HL GL+PGH I E P+L +AA KTL ++G+ GWS W+ LWAR
Sbjct: 595 WNDFDPQHRHQTHLIGLYPGHHIAPETTPELAEAARKTLVQKGDISTGWSTGWRINLWAR 654
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFE----GGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
L++ E AY++ ++L V P+ + + GG Y NLF AHPPFQID NFG TA V EM
Sbjct: 655 LYNGEKAYQIYRKLLTYVAPDAIRKSDAGPGGGTYPNLFDAHPPFQIDGNFGGTAGVCEM 714
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHD 772
L+QS + LLPALP W SG VKGL ARGG V W++G + +V I SN
Sbjct: 715 LMQSA-RGIRLLPALP-AAWPSGSVKGLCARGGFVVDFSWRNGSVTQVRIKSNVGGQ--- 769
Query: 773 SFKTLHYRGTSVKVNLSAGK 792
TL+Y G + KV L AGK
Sbjct: 770 --TTLYYNGKAHKVKLKAGK 787
>gi|340619498|ref|YP_004737951.1| alpha-L-fucosidase [Zobellia galactanivorans]
gi|339734295|emb|CAZ97672.1| Alpha-L-fucosidase, family GH95 [Zobellia galactanivorans]
Length = 792
Score = 500 bits (1288), Expect = e-138, Method: Compositional matrix adjust.
Identities = 300/779 (38%), Positives = 431/779 (55%), Gaps = 74/779 (9%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
N K+ + PA + +A+PIGNG+LGAMV+GGV SE L+LNE+++W G P A K
Sbjct: 34 NGNKLWYTQPAADWMEALPIGNGKLGAMVFGGVESERLQLNEESVWAGPPIPENRVGAFK 93
Query: 71 ALSDVRSLVDSGQYAEATAASV-KLFGH--PADVYQLLGDIELEFDDSHLKYAEETYRRE 127
++ R+L+ G Y EA + G YQ LG++ L F+ LK + YRRE
Sbjct: 94 SIEKARALIFQGDYLEANKVMQDNVMGERIAPRSYQPLGNLILNFN---LKGSPTDYRRE 150
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
LDL A A+ ++V V +TRE+FSS + IV ++ ++ ++S + +D D
Sbjct: 151 LDLKRAIAKTDFTVNGVRYTREYFSSAIENTIVVVLTANQPKAISLELKMDRKADFEVAG 210
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI-SDDRGTISALEDKKLKVEG 246
G N++ M G+ KG E ++ + +G + E+ +K+
Sbjct: 211 VGKNRLRMWGQA-------------SQKGKHLGVKYETQVMALPKGGKMSSENGNIKITA 257
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDP--------TSESMSALQSIRNLSYSDLYTRHL 298
++ VLL+ A + ++ KKDP ++ S L+ S L H+
Sbjct: 258 ANSVVLLVSAKTDYN---------KKDPFSPFTENLSTACASVLKKTARKSVKKLKEEHI 308
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
DDYQ F+RV + L P + D + E ++ V + +DP L+EL FQ+GR
Sbjct: 309 DDYQHYFNRVVLDLGSFPGE---DKPTNERLEAVINGA--------DDPGLMELYFQYGR 357
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSSRPG+ ANLQGIWN+ L+ W+S H NIN++MNYW + NLSEC EP F+F
Sbjct: 358 YLLISSSRPGSLPANLQGIWNDHLAAPWNSDYHTNINMQMNYWPAEVANLSECHEPFFEF 417
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ L +G KTA+ Y + G+V+HH TD+W +S GKV + +WPMGGAW H EHY
Sbjct: 418 IESLVPSGKKTAKEVYDSEGFVVHHTTDVWHWTSP-IGKVQYGMWPMGGAWCTRHFMEHY 476
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDG--KLAC 535
++T D FL ++AYP+++ A FLLDWL+ + G L + PSTSPE++F P K A
Sbjct: 477 SFTGDTTFLAEQAYPIMKESAKFLLDWLVTDPRSGKLVSGPSTSPENKFYTPKNGEKFAN 536
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ 595
V + MD II + FS ++ AA++L K EDA V++V +L L KI DG +MEW+Q
Sbjct: 537 VDMGNAMDQEIIWDNFSNVLEAAKIL-KIEDAFVDEVKAALSNLSLPKIGSDGRLMEWSQ 595
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTA 652
+F + + HRHLSHL+GL+PG +K P A ++++ R G GWS W
Sbjct: 596 EFDEVDKGHRHLSHLYGLYPGKQFDKKKTPYYIDAINRSIEHRLSNGGGHTGWSRAWIIN 655
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
+ARL + + AY +K L +NLF HPPFQID NFG TA +AEM
Sbjct: 656 FYARLGNADKAYENMKVL-----------LAKSTATNLFDYHPPFQIDGNFGGTAGIAEM 704
Query: 713 LVQSTLND------LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
++QS D + LLPALP +W +G V GLKARGG VS W++G L V + S+
Sbjct: 705 ILQSHETDENGNTIINLLPALP-SEWPTGSVSGLKARGGFEVSFAWENGVLKSVSLISS 762
>gi|427383711|ref|ZP_18880431.1| hypothetical protein HMPREF9447_01464 [Bacteroides oleiciplenus YIT
12058]
gi|425728416|gb|EKU91274.1| hypothetical protein HMPREF9447_01464 [Bacteroides oleiciplenus YIT
12058]
Length = 1074
Score = 500 bits (1288), Expect = e-138, Method: Compositional matrix adjust.
Identities = 295/767 (38%), Positives = 431/767 (56%), Gaps = 53/767 (6%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
TS N +K+ + PA+ + +A+P+GN RLGAMV+GG E L+LNE+T W G P + NP
Sbjct: 278 TSAQN-MKLWYGRPAQDWLEALPLGNSRLGAMVFGGTAREELQLNEETFWAGGPYNNNNP 336
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETY 124
+ L ++R L+ G+ EA + + P Y +G + L F H +E Y
Sbjct: 337 RGLQVLPEIRRLIFEGKTLEAQKLIDENYMTPQHGMRYLTMGSLFLNFP-GHENPSE--Y 393
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
R+L+L ATA ++Y V V+F R F+S D VI+ +I ++ +L+F +S +S L ++
Sbjct: 394 YRDLNLENATATIRYEVDGVKFVRTAFASLSDDVIIVRIQADKAKALNFAISYNSPLKSN 453
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
V G II C G A P ++ +++K G +S E+ L V
Sbjct: 454 VQVKGGKLII---SCQG------AEHEGVPAAMRAECQVQVKTD---GKVSK-EESSLAV 500
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
G+ A L + A+++F +N D + + + + LQ + Y H+ Y+K
Sbjct: 501 NGATEATLYISAATNF----VNYHDVSANESKRAATYLQKATRIPYEQALKSHIASYRKQ 556
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
+ RV++ L + + + + RV+ F D ++ L+FQ+GRYLLISS
Sbjct: 557 YDRVALTLEST------------KVSALETPVRVQRFMEGNDMAMAALMFQYGRYLLISS 604
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+PG Q ANLQGIWN WDS +NIN EMNYW + NLSE EPLFD + L++
Sbjct: 605 SQPGGQPANLQGIWNHSPYAPWDSKYTININAEMNYWPAEVTNLSETHEPLFDMVADLAV 664
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
GS+TA+V Y A GWV HH TDIW ++ + +WP GGAWL HLW+HY +T D+
Sbjct: 665 AGSETAKVLYDAKGWVAHHNTDIW-RACGPVDAAYFGMWPNGGAWLAQHLWQHYLFTGDK 723
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+FL K+ YP+L+G A F L L+E H Y + T PS SPEH + G ++ TM
Sbjct: 724 EFL-KKYYPVLKGTADFYLSHLVE-HPKYKWMVTVPSMSPEHGY---RGSQTTITAGCTM 778
Query: 543 DMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKD 599
D I + + + A+ +L+ + ED+L + +L LP P +I + + EW D +
Sbjct: 779 DNQIAFDALYSTLQASRILDGDKQYEDSL-QTMLDKLP---PMQIGKHNQLQEWLIDADN 834
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHD 659
P HRH+SHL+GL+PG+ I+ NP+L +AA TL +RG+ GWSI WK WAR+ D
Sbjct: 835 PLDDHRHISHLYGLYPGNQISPTTNPELFQAARNTLIQRGDMATGWSIGWKINFWARMLD 894
Query: 660 QEHAYRMVKRLFNLV--DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST 717
HAY++++ + +L+ D +++ EG Y NLF AHPPFQID NFG+TA VAEML+QS
Sbjct: 895 GNHAYKIIQNMLHLLPNDKVQKEYPEGRTYPNLFDAHPPFQIDGNFGYTAGVAEMLLQSH 954
Query: 718 LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+ LLPALP + W G VKGL ARGG V + W L++ I+S
Sbjct: 955 DGAVQLLPALP-EAWKKGSVKGLVARGGFVVDMEWDGAQLNKTKIHS 1000
>gi|224538524|ref|ZP_03679063.1| hypothetical protein BACCELL_03418 [Bacteroides cellulosilyticus
DSM 14838]
gi|224519862|gb|EEF88967.1| hypothetical protein BACCELL_03418 [Bacteroides cellulosilyticus
DSM 14838]
Length = 1061
Score = 500 bits (1288), Expect = e-138, Method: Compositional matrix adjust.
Identities = 302/767 (39%), Positives = 429/767 (55%), Gaps = 53/767 (6%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
TS N +K+ +N PA+ + +A+P+GN RLGAMV+GG E L+LNE+T W G P + NP
Sbjct: 265 TSAQN-MKLWYNRPAQDWLEALPLGNSRLGAMVFGGTAREELQLNEETFWAGGPYNNNNP 323
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETY 124
+ L ++R L+ G+ EA + + P Y LG + L F H +E Y
Sbjct: 324 KGLQVLPEIRRLIFEGKTLEAQKLIDENYMTPQHGMRYLTLGSLFLNFP-GHENPSE--Y 380
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
R+L+L ATA +Y V V+F R F+S D VI+ +I ++ +L+F VS S L +
Sbjct: 381 YRDLNLENATATTRYEVDGVKFVRTAFASLSDDVIIVRIQADKAKALNFAVSYSSPLKSD 440
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
V G II C G A P ++ A ++++ D G +S E+ L V
Sbjct: 441 VQVKGGKLII---SCQG------AEHEGIPAAMR--AECQVQVRTD-GKVSK-EESTLAV 487
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
G+ A L + A+++F +N D + + + + LQ + Y H+ Y+K
Sbjct: 488 NGATEATLYISAATNF----VNYHDVSANESKRAATYLQKATRIPYEQALKSHIASYRKQ 543
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
+ RVS+ L + + + + RV+ F D ++ L+FQ+GRYLLISS
Sbjct: 544 YDRVSLTLEST------------GVSALETPVRVQRFMEGNDMAMAALMFQYGRYLLISS 591
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+PG Q ANLQGIWN WDS VNIN EMNYW + NLSE EPLFD +T L++
Sbjct: 592 SQPGGQPANLQGIWNHSPYAPWDSKYTVNINAEMNYWPAEVTNLSETHEPLFDMVTDLAV 651
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
GS+TA+V Y A GWV HH TDIW ++ + +WP GGAWL HLW+HY +T D+
Sbjct: 652 TGSETAKVLYDAKGWVAHHNTDIW-RACGPVDAAYFGMWPNGGAWLAQHLWQHYLFTGDK 710
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+FL K YPLL+G A F L L+E H Y + T PS SPEH + G ++ TM
Sbjct: 711 EFLRKY-YPLLKGTADFYLSHLVE-HPKYKWMVTVPSMSPEHGY---RGSQTTITAGCTM 765
Query: 543 DMAIIREVFSAIISAAEVL---EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKD 599
D I + + A+ +L ++ ED+L + +L LP P +I + + EW D +
Sbjct: 766 DNQIAFDALYNTLQASRILGGDKQYEDSL-QVMLSKLP---PMQIGKHNQLQEWLIDADN 821
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHD 659
P HRH+SHL+GL+P + I+ NP+L +AA TL +RG+ GWSI WK WAR+ D
Sbjct: 822 PLDDHRHISHLYGLYPSNQISPTTNPELFQAARNTLIQRGDMATGWSIGWKINFWARMLD 881
Query: 660 QEHAYRMVKRLFNLV--DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST 717
HAY++++ + +L+ D +++ EG Y NLF AHPPFQID NFG+TA VAEML+QS
Sbjct: 882 GNHAYKIIQNMLHLLPNDKVQKEYPEGRTYPNLFDAHPPFQIDGNFGYTAGVAEMLLQSH 941
Query: 718 LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
++LLPALP + W G VKGL ARGG V + W L + I+S
Sbjct: 942 DGAVHLLPALP-EAWKKGSVKGLVARGGFVVDMEWDGVQLKKAKIHS 987
>gi|423221840|ref|ZP_17208310.1| hypothetical protein HMPREF1062_00496 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392645258|gb|EIY38987.1| hypothetical protein HMPREF1062_00496 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 1074
Score = 500 bits (1287), Expect = e-138, Method: Compositional matrix adjust.
Identities = 300/767 (39%), Positives = 430/767 (56%), Gaps = 53/767 (6%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
TS N +K+ +N PA+ + +A+P+GN RLGAMV+GG E L+LNE+T W G P + NP
Sbjct: 278 TSAQN-MKLWYNRPAQDWLEALPLGNSRLGAMVFGGTAREELQLNEETFWAGGPYNNNNP 336
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETY 124
+ L ++R L+ G+ EA + + P Y LG + L F H +E Y
Sbjct: 337 KGLQVLPEIRRLIFEGKTLEAQKLIDENYMTPQHGMRYLTLGSLFLNFP-GHENPSE--Y 393
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
R+L+L ATA +Y V V+F R F+S D VI+ +I ++ +L+F VS S L +
Sbjct: 394 YRDLNLENATATTRYEVDGVKFVRTAFASLSDDVIIVRIQADKAKALNFAVSYSSPLKSD 453
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
V G II C G A P ++ A ++++ D G +S E+ L V
Sbjct: 454 VQVKGGKLII---SCQG------AEHEGIPAAMR--AECQVQVRTD-GKVSK-EESTLAV 500
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
G+ A L + A+++F +N D + + + + LQ + Y H+ Y+K
Sbjct: 501 NGATEATLYISAATNF----VNYHDVSANESKRAATYLQKATRIPYEQALKSHIASYRKQ 556
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
+ RV++ L + + + + RV+ F D ++ L+FQ+GRYLLISS
Sbjct: 557 YDRVALTLEST------------GVSALETPVRVQRFMEGNDMAMAALMFQYGRYLLISS 604
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+PG Q ANLQGIWN WDS +NIN EMNYW + NLSE EPLFD +T L++
Sbjct: 605 SQPGGQPANLQGIWNHSPYAPWDSKYTININAEMNYWPAEVTNLSETHEPLFDMVTDLAV 664
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
GS+TA+V Y A GWV HH TDIW ++ + +WP GGAWL HLW+HY +T D+
Sbjct: 665 TGSETAKVLYDAKGWVAHHNTDIW-RACGPVDAAYFGMWPNGGAWLAQHLWQHYLFTGDK 723
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+FL K+ YPLL+G A F L L+E H Y + T PS SPEH + G ++ TM
Sbjct: 724 EFL-KKYYPLLKGTADFYLSHLVE-HPKYKWMVTVPSMSPEHGY---RGSQTTITAGCTM 778
Query: 543 DMAIIREVFSAIISAAEVL---EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKD 599
D I + + A+ +L ++ ED+L + +L LP P +I + + EW D +
Sbjct: 779 DNQIAFDALYNTLQASRILGGDKQYEDSL-QVMLSKLP---PMQIGKHNQLQEWLIDADN 834
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHD 659
P HRH+SHL+GL+P + I+ NP+L +AA TL +RG+ GWSI WK WAR+ D
Sbjct: 835 PLDDHRHISHLYGLYPSNQISPTTNPELFQAARNTLIQRGDMATGWSIGWKINFWARMLD 894
Query: 660 QEHAYRMVKRLFNLV--DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST 717
HAY++++ + +L+ D +++ EG Y NLF AHPPFQID NFG+TA VAEML+QS
Sbjct: 895 GNHAYKIIQNMLHLLPNDKVQKEYPEGRTYPNLFDAHPPFQIDGNFGYTAGVAEMLLQSH 954
Query: 718 LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
++LLPALP + W G VKGL ARGG V + W L + I+S
Sbjct: 955 DGAVHLLPALP-EAWKKGSVKGLVARGGFVVDMEWDGVQLKKAKIHS 1000
>gi|423299820|ref|ZP_17277845.1| hypothetical protein HMPREF1057_00986 [Bacteroides finegoldii
CL09T03C10]
gi|408473629|gb|EKJ92151.1| hypothetical protein HMPREF1057_00986 [Bacteroides finegoldii
CL09T03C10]
Length = 824
Score = 500 bits (1287), Expect = e-138, Method: Compositional matrix adjust.
Identities = 311/775 (40%), Positives = 449/775 (57%), Gaps = 53/775 (6%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E+ ++T K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+T+W G P +
Sbjct: 25 ETNASTQEYKLWYDRPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEETIWAGRPNNNA 84
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NP+A + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 85 NPNALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 140
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y++ Y REL L++A A V+Y V V++ RE +S DQV++ +++ S G ++FN L
Sbjct: 141 YSD--YYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTASRPGQITFNAQLT 198
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISAL 237
S H V +++ EG C + ++ ++ KG ++F L + +RG A
Sbjct: 199 S---PHQDVMISSE---EGNC--VTLSGVSSWHEGLKGKVEFQGRLTAR---NRGGKIAC 247
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
D L VEG+D AV+ + +++F+ N D + + L + + H
Sbjct: 248 ADGILSVEGADEAVIYVSIATNFN----NYLDITGNQIERAKDYLSKAMKHPFPEAKKNH 303
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
Y++ RVS+ L ++ ENI T +RV++F+ D LV FQFG
Sbjct: 304 TGFYRRYLTRVSLNLGKN---------RYENITT---DKRVENFKDTNDAHLVATYFQFG 351
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE EPLF
Sbjct: 352 RYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVSNLSELNEPLFR 411
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ +S G +TA++ Y A+GWV+HH TDIW + A K +W GGAWLC HLWE
Sbjct: 412 LIKEVSETGKETARIMYGANGWVLHHNTDIWRVTGAI-DKAPSGMWSSGGAWLCRHLWER 470
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACV 536
Y YT D DFL + YP+L+ F + ++ E +L PS SPE+ +GK A
Sbjct: 471 YLYTGDTDFL-RSIYPILKESGRFFDEIMVKEPIHNWLVVCPSNSPENVHSGSNGK-ATT 528
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNED--ALVEKVLKSLPRLRPTKIAEDGSIMEWA 594
+ TMD +I ++++AIISA+E+L+ ++D +++ LK +P P +I G + EW
Sbjct: 529 AAGCTMDNQLIFDLWTAIISASEILDTDKDFATHLKQRLKEMP---PMQIGHWGQLQEWM 585
Query: 595 QDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALW 654
D+ DP HRH+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK LW
Sbjct: 586 FDWDDPNDVHRHVSHLYGLFPSNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKVCLW 645
Query: 655 ARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLV 714
ARL D HAY+++ LV E +K GG Y NLF AHPPFQID NFG TA + EML+
Sbjct: 646 ARLLDGNHAYKLITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCTAGIVEMLM 702
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
QS +YLLPALP W G VKG+ ARGG + + WKDG ++ + + S+ N
Sbjct: 703 QSYDGFIYLLPALP-TLWKEGSVKGIIARGGFELDLSWKDGKVNHLIVKSHKGGN 756
>gi|408371030|ref|ZP_11168802.1| alpha-L-fucosidase [Galbibacter sp. ck-I2-15]
gi|407743587|gb|EKF55162.1| alpha-L-fucosidase [Galbibacter sp. ck-I2-15]
Length = 821
Score = 499 bits (1286), Expect = e-138, Method: Compositional matrix adjust.
Identities = 300/776 (38%), Positives = 430/776 (55%), Gaps = 65/776 (8%)
Query: 10 TNPLKITFNGPAKH-FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
T+PLK+ ++ P+ + +A+P+GNG +GAMV+G V E +LNE T+W+G P NP A
Sbjct: 21 TDPLKLWYDEPSGDVWENALPLGNGNIGAMVYGNVSKEIFQLNESTVWSGSPNRNDNPAA 80
Query: 69 PKALSDVRSLVDSGQYAEAT-AASVKLFGHPA--DVYQLLGDIELEFDDSHLKYAEETYR 125
+AL +R L+ QY A A+ K+ + ++Q +G++EL F+ H + Y
Sbjct: 81 LEALPKIRQLIFDKQYKAAEDLANEKIITKKSHGQMFQPVGNLELTFE-GHQDF--HNYS 137
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
REL++ A ++ Y+V V +TRE F+S D+V+V KIS + G +SF +
Sbjct: 138 RELNIGNAVSKTTYTVDGVTYTREAFTSLTDKVLVIKISADQPGKISFKADFTTPHKKQK 197
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGI----QFSAILEIK-----ISDDRGTISA 236
+N + + G D +G+ +F A+L IK I+ R TI
Sbjct: 198 IAIMDNNLSLWG------------VTSDHEGVLGKVEFQALLRIKTLNGDITQGRNTI-- 243
Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
+V +D A L + +S+F N D D T + + L +Y +L
Sbjct: 244 ------EVTNADSATLYISIASNFK----NYDDLSADETLRAKNDLDKAFIENYENLKDA 293
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
H+ YQ F+RVS+QL T N P+ ER+++F+ ++DPS V L FQ+
Sbjct: 294 HIKAYQNYFNRVSLQLG---------TIEASN---QPTDERLENFRKNQDPSFVSLYFQY 341
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLISSS+PG Q ANLQGIWN+ L+P WDS +NIN +MNYW + NLSE EP
Sbjct: 342 GRYLLISSSKPGGQAANLQGIWNKSLTPPWDSKYTININAQMNYWPAEKTNLSELHEPFL 401
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
+ + LS G KTA Y A GW+ HH TDIW + A G W +W GGAWL H+WE
Sbjct: 402 NMVQELSQTGKKTANDMYGARGWMAHHNTDIWRVTGAIDG-AFWGIWNGGGAWLSQHIWE 460
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLAC 535
HY YT D +FL + Y LL+G A F +D+L + D YL P SPE+ G
Sbjct: 461 HYLYTGDTEFL-RENYDLLKGAALFYVDFLAQHPDHPYLVVAPGNSPENAAQGRQG--TS 517
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP-RLRPTKIAEDGSIMEWA 594
++ STMD ++ ++F+A+ISA+E L N D LK + +L P +I + + EW
Sbjct: 518 ITAGSTMDNQLVEDIFNAVISASEAL--NTDTAFTDSLKVIKNKLPPMQIGKHNQLQEWL 575
Query: 595 QDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALW 654
+D P +HRH+SHL+GL+P + I+ + P L AA TL +RG+ GWS+ WK W
Sbjct: 576 EDLDSPTDNHRHISHLYGLYPSNLISPYRTPLLFAAARNTLIQRGDVSTGWSMGWKVNWW 635
Query: 655 ARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLV 714
A++ D HA+ ++K N + P + +GG Y+NLF AHPPFQID NFG T+ + EML+
Sbjct: 636 AKMQDGNHAFELIK---NQLTPVAGEQSQGGSYANLFDAHPPFQIDGNFGCTSGITEMLM 692
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGG-ETVSICWKDGDLHEVGIYSNYSNN 769
QS+ L+LLPA+ D G V GLK+RGG E +++ WKD L V I S N
Sbjct: 693 QSSDGALHLLPAIA-DALKDGEVTGLKSRGGFEIINMKWKDKKLESVTIKSELGGN 747
>gi|423241186|ref|ZP_17222300.1| hypothetical protein HMPREF1065_02923 [Bacteroides dorei
CL03T12C01]
gi|392642334|gb|EIY36101.1| hypothetical protein HMPREF1065_02923 [Bacteroides dorei
CL03T12C01]
Length = 825
Score = 499 bits (1286), Expect = e-138, Method: Compositional matrix adjust.
Identities = 289/765 (37%), Positives = 434/765 (56%), Gaps = 47/765 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
KI ++ PA ++ +AIPIGNGR+ AMV+G E L+LNE+T+ G P N + AL
Sbjct: 27 KIWYDTPAHYWEEAIPIGNGRIAAMVFGNPQLEQLQLNEETISAGSPYQNYNKEGKGALK 86
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPADV---YQLLGDIELEFDDSHLKYAEETYRRELDL 130
++R L+ G Y EA + K P YQ +G++ + + + + Y RELDL
Sbjct: 87 EIRRLIFDGHYEEAQNMAEKKILSPVGREMPYQTVGNLNIRYKNHK---QIKKYYRELDL 143
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN-HSYVNG 189
A A +Y + +VE T E F+S DQ+I+ I S+ GS++ + + +D G
Sbjct: 144 TRAIATTRYQIKDVEITEETFASFTDQLIIKHIKSSKKGSINCELFFQTPMDAPKRSACG 203
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
++ +EG G N P + + A L +K SD G + AL D +KVE +
Sbjct: 204 KKKLRLEGITSGN--------NHIPGKVHYCADLSVKNSD--GKVFALNDTLIKVEKATE 253
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ-SIRNLSYSDLYTRHLDDYQKLFHRV 308
L + +++F +N D +P + L+ S+++ + + H+ Y+K+F+RV
Sbjct: 254 ICLYVSMATNF----VNYKDISANPYERNEKYLKNSMKDFEKAKI--EHVAAYKKMFNRV 307
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
+++L SP+ P+ R+K F++ DP LV L FQFGRYLLISSS+PG
Sbjct: 308 TLELGHSPQI------------NKPTNIRLKEFESSYDPHLVSLYFQFGRYLLISSSQPG 355
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
Q ANLQG WN + P W S NIN EMNYW + NLSE EPL + S +G +
Sbjct: 356 CQPANLQGKWNAKVRPPWSSNYTTNINTEMNYWPAEVTNLSELHEPLIQIIQDWSQSGRE 415
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
TA Y GWV+HH +D+W + A DR +WP GAW+C HLW+ Y ++ ++++L
Sbjct: 416 TADQMYGCRGWVLHHNSDLWRVTGAVDRAYC--GVWPTAGAWMCQHLWDRYLFSGNKEYL 473
Query: 488 EKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
K+ YP++ + F +D+L++ + GY PS SPE+ K + S +TMD +
Sbjct: 474 -KKIYPIMRSASKFFIDFLVQNPNTGYWVVGPSPSPENSPKKIKQKASLFS-GNTMDNQL 531
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLP-RLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
I ++FS AA++L ++D+ + LK++ +L P ++ E G + EW +D+ P HHR
Sbjct: 532 IFDLFSNTCEAAKIL--SQDSTLCDTLKTMRNQLPPMQVGEYGQLQEWFEDWDSPNDHHR 589
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H+SHL+GLFPG+ I+ ++P L +AA TL +RG+ GWS+ WK LWAR+ D +HAY+
Sbjct: 590 HVSHLWGLFPGYQISPYRSPILLEAARNTLIQRGDLSTGWSMGWKVCLWARMLDGDHAYK 649
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
++K+ V P+++K GG Y NLF AHPPFQID NFG TA +AEMLVQS ++LLP
Sbjct: 650 LIKKQLTFVSPQNQKGPGGGTYPNLFDAHPPFQIDGNFGCTAGIAEMLVQSHDEAVHLLP 709
Query: 726 ALPWDKWSSGCVKGLKARGGETV-SICWKDGDLHEVGIYSNYSNN 769
ALP + G VKGL+ RGG + + W+DG + + I S N
Sbjct: 710 ALP-SNFKQGKVKGLRIRGGFILEELNWQDGKIKKAVIRSTIGGN 753
>gi|431796298|ref|YP_007223202.1| hypothetical protein Echvi_0919 [Echinicola vietnamensis DSM 17526]
gi|430787063|gb|AGA77192.1| hypothetical protein Echvi_0919 [Echinicola vietnamensis DSM 17526]
Length = 813
Score = 499 bits (1285), Expect = e-138, Method: Compositional matrix adjust.
Identities = 288/765 (37%), Positives = 441/765 (57%), Gaps = 54/765 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA ++ +A+P+GNGRLGAMV+G E L+LNE+T+W G P + A +A+
Sbjct: 26 KLWYDQPASNWNEALPLGNGRLGAMVFGVPAMERLQLNEETIWAGSPNSNAHTSAKEAIP 85
Query: 74 DVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ G Y A A+ K+ D Y+ G++ + F H Y + Y R+L+L
Sbjct: 86 YVRRLIFDGDYQAAQELANEKIMSQTNDGMPYETFGNVYISFP-GHQDY--QDYYRDLNL 142
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
AT+ V+YSV V++TRE S+ D VI+ K++ GS++ NV + S DN
Sbjct: 143 EDATSTVRYSVDGVQYTREVLSAFEDDVIMVKLTADRPGSITCNVHMTSPHDNAEARVRG 202
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
+Q+ + G + +D +G ++F IK ++ G + A++D + V+G+D
Sbjct: 203 DQLTLSG---------VSQTHDHQRGGVKFQG--RIKATNKGGQL-AVKDGLISVDGADE 250
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
L + +++F N +D + ++ + L + ++ + H++ YQ+ + RV+
Sbjct: 251 VTLYISIATNFK----NYNDLSVEYERKAEALLDAALQKDFAAIKREHIEHYQQFYDRVA 306
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
I D+ + +E+ P+ +R++ F DP L L FQF RYLLIS S+PG
Sbjct: 307 I-------DLGSTEAAEK-----PTDQRIQQFSEVHDPQLAALYFQFARYLLISCSQPGG 354
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN+ L P W+S VNIN EMNYW + NLSE EP + +S G +T
Sbjct: 355 QPANLQGIWNDMLFPPWESKYTVNINAEMNYWPAELTNLSEMHEPFLQMVREVSETGQQT 414
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
A++ Y A GWV+HH TDIW + G + +A +WP GGAWL HLWE Y Y+ D DF
Sbjct: 415 AKMMYGARGWVLHHNTDIWRIT----GPIDYAASGMWPSGGAWLSQHLWERYLYSGDEDF 470
Query: 487 LEKRAYPLLEGCASFLLDWLIEGH-DGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L K AYP+++G A F LD LIE +G+L +PS+SPE+ + A ++ TMD
Sbjct: 471 L-KEAYPIMKGAAQFFLDVLIEEPVNGWLVVSPSSSPENSHVHG----ATIAAGVTMDNQ 525
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
++ ++FS +I ++E+L +++ A + + + +L P ++ + G + EW D+ DP HR
Sbjct: 526 LLFDLFSNLIRSSEILGEDQ-AFADTLKATRSKLAPMQVGQYGQLQEWMHDWDDPADKHR 584
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H+SHL+G+FP + I+ + P+L AA +L RG+ GWS+ WK LWAR D +HAY+
Sbjct: 585 HVSHLYGVFPSNQISPFRTPELFDAARTSLMFRGDPSTGWSMGWKVNLWARFLDGDHAYK 644
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
+++ +LV P GG Y+N+F AHPPFQID NFG A +AEML+QS ++LLP
Sbjct: 645 LLQNQLSLVTPSTRG---GGTYANMFDAHPPFQIDGNFGCAAGIAEMLMQSQEGAIHLLP 701
Query: 726 ALPWDKWSSGCVKGLKARGG-ETVSICWKDGDLHEVGIYSNYSNN 769
ALP W G ++GL+ARGG E V + WKD + ++ I S N
Sbjct: 702 ALP-SVWGKGSIEGLRARGGFEIVELTWKDNKVDKLVIKSTLGGN 745
>gi|373952811|ref|ZP_09612771.1| alpha-L-fucosidase [Mucilaginibacter paludis DSM 18603]
gi|373889411|gb|EHQ25308.1| alpha-L-fucosidase [Mucilaginibacter paludis DSM 18603]
Length = 833
Score = 499 bits (1284), Expect = e-138, Method: Compositional matrix adjust.
Identities = 294/762 (38%), Positives = 428/762 (56%), Gaps = 56/762 (7%)
Query: 6 STSTTNP-----LKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV 59
+ + TNP L++ +N P+ K + +A+PIGNGRLGAM++G V ET++LNE TLW+G
Sbjct: 26 AKAQTNPKDQTTLRLWYNKPSGKVWENALPIGNGRLGAMIYGNVGVETIQLNEHTLWSGG 85
Query: 60 PGDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDDSH 116
P NP A +L+ +R L+ +G+ +A + K+ +++ G++ L F++
Sbjct: 86 PNRNDNPLALDSLAAIRKLIFNGKQKQAEQLANKVIISKKSQGQIFEPAGELYLAFNNQE 145
Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
Y RELD+ A ++ Y VG+V FTRE F+S PD+VIV ++ S+ GS+SF
Sbjct: 146 ---NYTNYYRELDIEKAISKTSYQVGDVSFTREAFASIPDRVIVMHLTASKPGSISFTAF 202
Query: 177 LDSLLDNHSYVNGN-NQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTI 234
S + + QI G ++ KG +++ I E K + GT
Sbjct: 203 YSSPQHDVAVATFQARQITFAGTTID---------HEGVKGMVRYKGIAEFKT--NGGTK 251
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
SA D + + G++ + + +++F+ N D + T + + L SY++L
Sbjct: 252 SA-TDTSVTIYGANDVTIYISIATNFN----NYHDLGGNETERAANYLNKASGKSYTELQ 306
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ YQK F+RV L + +I +P+ ER+K+F +DP L F
Sbjct: 307 KTHIAAYQKYFNRVRFSLGAA------------DISKLPTDERLKNFNQGQDPQFAALYF 354
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
Q+GRYLLISSS+PG Q ANLQGIWN L P WDS +NIN EMNYW + NL E EP
Sbjct: 355 QYGRYLLISSSQPGGQPANLQGIWNNKLYPAWDSKYTININAEMNYWPAEKTNLPEIHEP 414
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
+ L++NG +TA+V Y A GW+ HH TDIW + A G W +W GG W HL
Sbjct: 415 FLQMVKELAVNGEQTAKVMYGARGWMAHHNTDIWRATGAVDG-AFWGIWNQGGGWTSEHL 473
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGK 532
WEHY Y D+D+L + Y +L G A F +D+L+E H +L NP SPE+ A G
Sbjct: 474 WEHYLYNGDKDYL-RSVYGVLRGAALFYVDFLVEQPVHH-WLVINPDMSPENAPAAHQG- 530
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
+ + +TM I+ +VFS+ I AAE+L ++ V+ + + +L P I + G + E
Sbjct: 531 -SSLDAGTTMSNQIVFDVFSSTIRAAEILNIDK-PFVDTLKQMRSKLSPMHIGQFGQLQE 588
Query: 593 WAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTA 652
W D DP+ +HRH+SHL+GLFP I+ + P L AA+ TL +RG+ GWS+ WK
Sbjct: 589 WLDDIDDPKDNHRHISHLYGLFPSGQISAYRTPQLFNAAKNTLLQRGDVSTGWSMGWKVN 648
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
WAR+ D HAY++++ N + P GG Y+NLF AHPPFQID NFG T+ +AEM
Sbjct: 649 WWARMLDGNHAYKLIQ---NQLTPLGVNKGGGGTYNNLFDAHPPFQIDGNFGCTSGMAEM 705
Query: 713 LVQSTLNDLYLLPALPWDKW-SSGCVKGLKARGG-ETVSICW 752
L+QS ++LLPALP D W + G + GL+A GG E VS+ W
Sbjct: 706 LMQSADGAVFLLPALP-DAWENEGSISGLRAIGGFEIVSMDW 746
>gi|182413173|ref|YP_001818239.1| hypothetical protein Oter_1354 [Opitutus terrae PB90-1]
gi|177840387|gb|ACB74639.1| hypothetical protein Oter_1354 [Opitutus terrae PB90-1]
Length = 1139
Score = 499 bits (1284), Expect = e-138, Method: Compositional matrix adjust.
Identities = 302/803 (37%), Positives = 420/803 (52%), Gaps = 74/803 (9%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
+ F+ PA+HFT A P+GNGRLG M +GGV E + LNE +W+G P D P+A AL +
Sbjct: 321 VRFDAPARHFTAATPLGNGRLGLMPFGGVDEERVVLNEAGMWSGSPQDADRPNAAAALPE 380
Query: 75 VRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIELEFDDSHLKYAE 121
+R L+ +GQ AEA + F P YQ+LG++ L F S
Sbjct: 381 IRRLLLAGQNAEAEKVVAENFTCAGAGSGRGRGANVPYGSYQVLGELRLAFASSASGTEV 440
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
Y RELDL A +RV Y V F RE F S PD+V V +++ ++ G++SF ++L+
Sbjct: 441 TNYARELDLADAVSRVSYERDGVRFEREAFVSAPDEVAVIRLTANKRGAISFELALERPE 500
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
+ V +++M GR R + + F+ I I +RG D
Sbjct: 501 RATTRVLEGGRLLMSGRLSDGR---------GGENVGFATIARIV---NRGGSVESGDGV 548
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNL--SYSDLYTRHLD 299
L+V +D ++L+ A++ I +K + + + R+ S+ L HL
Sbjct: 549 LRVRAADEVLVLVTAATD-----IKSFAGRKVEDAAATAMADMDRSAQKSFGALRAAHLA 603
Query: 300 DYQKLFHRVSIQLSR----------SPKDIVTD-TCSEENIDTVPSAERVKSFQTDEDPS 348
Y+ LF RV ++LS SP + TD +E N A V DP
Sbjct: 604 HYRGLFDRVLLRLSEDGTEGGRRVPSPPQMTTDDRGAERNPRPTTQARLVAQAAGANDPG 663
Query: 349 LVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL 408
L +L F FGRYLLISS+RP NLQGIW + + W+ H+NIN++MN+W + C L
Sbjct: 664 LAQLYFDFGRYLLISSTRPDGFPPNLQGIWADGVQTPWNGDWHLNINVQMNFWPAEICGL 723
Query: 409 SECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGA 468
E + LF F L+ G++TA+ Y A GWV H + W +S G W G A
Sbjct: 724 PELHDSLFSFTQSLTEPGARTARAYYGARGWVAHVLANPWGFTSPGEG-ASWGATTTGSA 782
Query: 469 WLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE-GHDGYLETNPSTSPEHEFI 527
WLC HLW+HY +T DR FLE RAYP+++G A F LD LIE G+L T P+ SPE+EF+
Sbjct: 783 WLCQHLWDHYLFTGDRAFLE-RAYPMMKGSAEFYLDMLIEEPTHGWLVTAPANSPENEFV 841
Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAED 587
DG A V T D I+R +F+A AA VL+ + + L ++ RL PT+IA D
Sbjct: 842 LADGTKAHVCLGPTFDNQILRSLFTATAEAARVLDVDAE-LQRELGAKTARLPPTRIAPD 900
Query: 588 GSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSI 647
G +MEW +++ + + HHRH+SHL+GL+PG I++ P+L AA KTL RG+ G GW +
Sbjct: 901 GRVMEWLENYGEADPHHRHISHLWGLYPGDEISVAGTPELAAAARKTLDARGDGGTGWCL 960
Query: 648 TWKTALWARLHDQEHAYRMVKRLFN-LVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFT 706
K LWARLHD A +++ L V + GG Y NLF AHPPFQID NFG T
Sbjct: 961 AHKLTLWARLHDGARAADLLRSLLKPAVGADQITTTGGGTYPNLFDAHPPFQIDGNFGGT 1020
Query: 707 AAVAEMLVQSTLN-------------------------DLYLLPALPWDKWSSGCVKGLK 741
A +AE+L+QS ++ LLPALP W G V+GL+
Sbjct: 1021 AGIAELLLQSRALPAAGSADQSGVTGVSPDRSAQSAGWEIELLPALP-PTWRGGEVRGLR 1079
Query: 742 ARGGETVSICWKDGDLHEVGIYS 764
ARGG V + W+DG L I+S
Sbjct: 1080 ARGGFVVDLRWRDGALERAVIHS 1102
>gi|388259826|ref|ZP_10136995.1| hypothetical protein O59_004218 [Cellvibrio sp. BR]
gi|387936552|gb|EIK43114.1| hypothetical protein O59_004218 [Cellvibrio sp. BR]
Length = 836
Score = 498 bits (1281), Expect = e-138, Method: Compositional matrix adjust.
Identities = 294/806 (36%), Positives = 447/806 (55%), Gaps = 67/806 (8%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
S+ + +P + + A+H+ +A+P+GNGRLGAMV+GGV + +++NE+T W G P + N
Sbjct: 29 SSPSVSPHTLWYEQAAQHWEEALPLGNGRLGAMVYGGVTRDNIQINENTFWAGGPHNNVN 88
Query: 66 PDAPKALSDVRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDDSHLKYAEE 122
P A ++L ++R L+ +G+Y A A + K G YQ G++ LEF +H +++
Sbjct: 89 PKALESLPEIRRLITAGEYLAAEALAEKTITSQGSNGMPYQTAGNLHLEFP-AHKQFSH- 146
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
Y R+LD+ A A +Y VG+V +TRE FSS DQV+V K+S S+ G LSF L
Sbjct: 147 -YYRDLDIGKAIATTRYQVGDVVYTREVFSSFVDQVVVVKLSASKPGQLSFTAHLSHPAT 205
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE--IKISDDRGTISALEDK 240
N+ ++M+G + D +GI+ L + ++ G++S +
Sbjct: 206 MQFAQENNHTLLMQG------------MSKDHEGIKGQVKLATLVDVNTSGGSLSQ-NNN 252
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR---- 296
++ V +D A++L+ +++F +N D D + + + L S +N + YT
Sbjct: 253 RIAVSNADSALILISMATNF----VNYKDISGDALARARNYLASAKNQFTHNQYTARKHV 308
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
H + Y++ F RV++QL +S ++E P+ +R++ F + DP L L FQF
Sbjct: 309 HSNFYKQYFDRVALQLGKS-------EFAQE-----PTDQRIRLFASRHDPELASLYFQF 356
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLIS S+PG Q NLQGIWN + P WDS +NIN EMNYW S L+E EP
Sbjct: 357 GRYLLISGSQPGGQPTNLQGIWNHRMDPPWDSKYTLNINAEMNYWPSEVTQLNELNEPFI 416
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
+ L+ G +TA+ Y A GW+ HH TDIW + D+ W WP AWL HLW
Sbjct: 417 QMVKELAQTGQQTAKEMYGARGWMAHHNTDIWRITGGIDK---TWGSWPTSNAWLSQHLW 473
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLA 534
E Y Y+ D+ +L YP+++ +F D+LIE D +L +PS SPE+ AP
Sbjct: 474 EKYLYSGDKTYLAD-VYPVMKSAVTFFEDFLIESPDKKWLIVSPSMSPEN---APTATGV 529
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
++ TMD ++ ++ S I+AAE+L +K + + +K+L LP P +I + + E
Sbjct: 530 KIAAGVTMDNQLLFDLLSNTIAAAEILGQDKTQIPVWKKILSRLP---PMQIGKHHQLQE 586
Query: 593 WAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTA 652
W +D+ +P+ HRH+SHL+GL+P + I+ P+L AA T+++RG+ GWS+ WK
Sbjct: 587 WLEDWDEPQDKHRHVSHLYGLYPSNQISPLTAPELFSAARVTMEQRGDPSTGWSMNWKIN 646
Query: 653 LWARLHDQEHAYRMVK-RLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAE 711
LWARL D + A ++++ ++ + + + GG Y N+F AHPPFQID NFGFT+ +AE
Sbjct: 647 LWARLLDGDRALKLMREQISPAMTLDGSVNESGGTYPNMFDAHPPFQIDGNFGFTSGMAE 706
Query: 712 MLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDH 771
ML QS ++LLPALP W G VKGL RGG V + W +G + E+ I+S N
Sbjct: 707 MLAQSHDGAVHLLPALP-QAWPEGEVKGLLMRGGFVVDMRWANGQIRELKIHSRLGGNLR 765
Query: 772 ----------DSFKTLHYRGTSVKVN 787
FKT RGT N
Sbjct: 766 LRTHSELPAVSDFKTKKVRGTKANPN 791
>gi|374991896|ref|YP_004967391.1| hypothetical protein SBI_09142 [Streptomyces bingchenggensis BCW-1]
gi|297162548|gb|ADI12260.1| hypothetical protein SBI_09142 [Streptomyces bingchenggensis BCW-1]
Length = 822
Score = 498 bits (1281), Expect = e-138, Method: Compositional matrix adjust.
Identities = 306/770 (39%), Positives = 435/770 (56%), Gaps = 58/770 (7%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
A T+ L + + PA + +A+PIGNGRLGAMV+GG +E L+LNEDT+W G P D
Sbjct: 49 AGGTTLPGELTLWYPRPASEWLEALPIGNGRLGAMVFGGTDTERLQLNEDTVWAGGPYDP 108
Query: 64 TNPDAPKALSDVRSLVDSGQYAEATAASVKLF-GHPAD--VYQLLGDIELEFDDSHLKYA 120
NP L ++R V +G++ +A A F G+P YQ +GD+ L F +
Sbjct: 109 ANPQGLSNLPEIRRRVFAGEWGDAQALIDSTFMGNPLSELPYQTVGDLRLTFSS---QGE 165
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
YRRELD+++AT V+Y+ V + RE +S+PDQVI +++ GS+SF + DS
Sbjct: 166 VSDYRRELDIDSATTSVRYTQSGVTYRREIIASHPDQVIALRLTADTPGSISFTAAFDSP 225
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALED 239
I ++G G ++F A+ + + GT+ + ED
Sbjct: 226 QSVTGSSPDRITIAIDG---------TGQTRSGITGQVRFRAL--ARACAEGGTVGS-ED 273
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
KL V G+D A LL+ +S+ F NP+ D T+ + + L + ++ ++ L RH D
Sbjct: 274 GKLTVAGADSATLLVSIGTSYTD-FGNPT---GDHTARAAAPLNAASDVPFTTLRKRHTD 329
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
DY++LF RV++ L + + +P+ ERVK+F + DP LV L +QFGRY
Sbjct: 330 DYRRLFRRVTLDLGST------------DAAKLPTDERVKNFASASDPQLVSLHYQFGRY 377
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LLIS SRPGTQ ANLQGIWN+ LSP W +NIN EMNYW + NL EC EP+FD L
Sbjct: 378 LLISCSRPGTQPANLQGIWNDLLSPPWSCRYTININTEMNYWPAPVTNLLECWEPVFDML 437
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
LS++G++TA+ Y A GWV HH D W + +A + + WP GGAWL T +W+HY
Sbjct: 438 ADLSVSGARTARTQYGARGWVAHHNVDGW-RGTAPCDQAFYGTWPTGGAWLATSIWDHYL 496
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
+T D++ L KR YP+L G F LD L+ + G+L T PS SPEH PD A V
Sbjct: 497 FTGDKEALRKR-YPVLRGAVLFFLDTLVTDPSSGHLVTCPSMSPEHAH-HPD---ASVCA 551
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVE-KVLKSLPRLRPTKIAEDGSIMEWAQDF 597
TMD I+R+VF + A+E+L ++ D E + ++ +L P KI G + EW +D+
Sbjct: 552 GPTMDNQILRDVFDGFVIASELLGEDADMRAEARTVRG--KLPPMKIGAQGQLQEWQEDW 609
Query: 598 K--DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWA 655
PE +HRH+SHL+GL P + IT P+L AA KT+++RG+ G GWS+ WK WA
Sbjct: 610 DAIAPEQNHRHISHLYGLHPSNQITKRGTPELFAAARKTMEQRGDAGTGWSLAWKINFWA 669
Query: 656 RLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQ 715
RL + + ++++ L +L+ PE NLF HPPFQID NFG T+ + E L+Q
Sbjct: 670 RLLEGDRSFKL---LGDLLTPERTA-------PNLFDLHPPFQIDGNFGATSGITEWLLQ 719
Query: 716 STLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
S +L+LLPALP G + GL ARGG V + W D L + + S
Sbjct: 720 SHAGELHLLPALP-PALPDGRIHGLVARGGFEVDLTWSDAALADCRLRSR 768
>gi|300726579|ref|ZP_07060021.1| alpha-L-fucosidase 2 [Prevotella bryantii B14]
gi|299776147|gb|EFI72715.1| alpha-L-fucosidase 2 [Prevotella bryantii B14]
Length = 803
Score = 497 bits (1279), Expect = e-137, Method: Compositional matrix adjust.
Identities = 294/772 (38%), Positives = 433/772 (56%), Gaps = 54/772 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ +N PA+ +TDA+P+GNGRLGAMV+G +E ++LNE+T+WTG P N A A+
Sbjct: 6 KLWYNEPAQVWTDALPLGNGRLGAMVYGIPSTEHIQLNEETIWTGQPNHNANKKALNAIP 65
Query: 74 DVRSLVDSGQY--AEATAASVKLFG-HPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
++ L+ G+Y A+ A + G + YQ GD+ + ++ L+Y YRREL L
Sbjct: 66 KIQQLLFEGRYHTADKMANDNVMSGTNWGMAYQTFGDVYITTPNA-LRYT--NYRRELSL 122
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
++A A Y+V V + RE +S VI ++ S+ G L+F + + +
Sbjct: 123 DSAIAVTTYTVDGVTYRREVITSFDSNVITIHLTASKPGKLTFGAHYSTPQEEILIRSEK 182
Query: 191 NQIIMEG------RCPGK-RIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
N+ I+EG C GK R + G++ A + D ++
Sbjct: 183 NEAILEGVSGKLEGCKGKVRFMGRMLCETMKNGVRQEA--------------SSRDGEIT 228
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
VE +D A + + +++F +N D D ++S L+ +Y H+ +Q
Sbjct: 229 VENADEATIYISIATNF----VNYKDISGDEVAKSEQILRQAIAKNYEQSKKTHIAKFQS 284
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
+RVS+ L KD+ + P+ +R+ +F +D L+ F FGRYLLI
Sbjct: 285 FMNRVSLSLG---KDLYQNE---------PTDQRIINFAHRDDNGLIATYFNFGRYLLIC 332
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SS+PG Q ANLQGIWN + P+WDS NINLEMNYW S NLS+ EPLF + +S
Sbjct: 333 SSQPGGQAANLQGIWNHRVWPSWDSKYTTNINLEMNYWPSEIANLSDLNEPLFRLIREVS 392
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+GS +A++ Y GWV+HH TDIW + + +W +GGAWLC HLW+HY YT D
Sbjct: 393 ESGSISAKMMYGKDGWVLHHNTDIW-RVTGGIDHASSGMWMLGGAWLCAHLWQHYLYTGD 451
Query: 484 RDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
++FL K+AYPL++G A FL + LI E G+L +PS SPE+ + DGK+A ++Y +TM
Sbjct: 452 KEFL-KKAYPLMKGAAIFLDEMLIPEPEHGWLVISPSVSPENYHPSKDGKIA-ITYGTTM 509
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
D ++ E+F+++ A+++L +D L + L ++ P +I + G + EW +D+ DPE
Sbjct: 510 DNTLLHELFNSVSVASQILGV-DDTLKSYYAERLKKMAPMQIGKWGQLQEWLKDWDDPED 568
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEH 662
HRH+SHL+G+FPG+ I+ + P+L AA +L RG+ GWS+ WK LWAR D H
Sbjct: 569 THRHVSHLYGVFPGNLISPYRTPELFDAARTSLIHRGDPSTGWSMGWKVCLWARFLDGNH 628
Query: 663 AYRMVKRLFNLVDPEH----EKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTL 718
AY+++ L + +GG Y NLF AHPPFQID NFG TA + EML+QS
Sbjct: 629 AYKLIHNQLTLTNDRFVAFGTNKKKGGTYRNLFDAHPPFQIDGNFGCTAGIVEMLMQSHD 688
Query: 719 NDLYLLPALPWDKWSSGCVKGLKARGG-ETVSICWKDGDLHEVGIYSNYSNN 769
+ LLPALP D W G VKG+ ARGG E V + WK+G L ++ I S N
Sbjct: 689 GCVALLPALP-DAWKDGEVKGIVARGGFEIVDMAWKNGKLTKLVIKSKVGGN 739
>gi|268608709|ref|ZP_06142436.1| hypothetical protein RflaF_04322 [Ruminococcus flavefaciens FD-1]
Length = 772
Score = 496 bits (1278), Expect = e-137, Method: Compositional matrix adjust.
Identities = 291/773 (37%), Positives = 427/773 (55%), Gaps = 70/773 (9%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
+ +N PA +F +A+P+GNGR+GAM++G E + LNED++W+G NPDA + L +
Sbjct: 7 LRYNDPAANFNEALPLGNGRIGAMIYGDAAFEKIPLNEDSVWSGGLRHRVNPDAAEGLEE 66
Query: 75 VRSLVDSGQYAEATAASV-KLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLN 131
VR L+ G EA + KL G ++ Y LGD+ ++ + L Y R LD+
Sbjct: 67 VRRLIKEGNIPEAERIAFDKLQGVTPNMRRYMPLGDLHIDLE---LSGRARNYNRRLDIG 123
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
A A V ++V +V + +E+F S PD+V+ +IS +E G ++ + +Y++G
Sbjct: 124 NAVADVTFTVNDVLYRKEYFISAPDEVMAVRISCAERGMINLS----------AYIDGRE 173
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
+ R GK + + GI F+A+L K G+I L ++ VE +D +
Sbjct: 174 DYYDDNRPCGKNMILFTGGSGSRDGIFFAAVLGAKARG--GSIRTL-GGRIAVEKADEVI 230
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
L+ +SF G + +K ++ AL++ Y +L H++DY+ +F RV
Sbjct: 231 LIFSVRTSFYG-----DNYEKSALIDAEMALKT----EYDELRLHHVNDYKDMFDRVDFS 281
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-----------DPSLVELLFQFGRYL 360
L + +EEN+D + +AER+K + DE D L+EL F FGRYL
Sbjct: 282 LCDN---------TEENLDRLDTAERIKRLKGDELDNKDCERLIHDNKLIELYFNFGRYL 332
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
+IS+SRPGTQ NLQGIWNE++ W S VNIN EMNYW + CNLSEC PLFD L
Sbjct: 333 MISASRPGTQPMNLQGIWNEEMIAPWGSRYAVNINTEMNYWPAESCNLSECHLPLFDLLE 392
Query: 421 YLSINGSKTAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
+ NG TA+ Y + G+V HH TDIW ++ V LWP GGAWL H++EHY
Sbjct: 393 RVCENGHITAREMYGVNKGFVCHHNTDIWGDTAPQDMWVPGTLWPTGGAWLALHIFEHYE 452
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
YT+D++FL ++ Y +L+ A F ++LIE G L T PS SPE+ + PDG C+
Sbjct: 453 YTLDKEFLAEK-YHILKQAAEFFTEFLIEDESGMLVTCPSVSPENTYKLPDGTKGCLCMG 511
Query: 540 STMDMAIIREVFSAIISAAEVLEKNED--ALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
+MD II +F+ +I AAE+L+K++ A ++++LK +P+ ++ + G I EW D+
Sbjct: 512 PSMDSQIITVLFTDVIRAAEILDKDKTFAAKLKRMLKKIPQ---PEVGKYGQIKEWLVDY 568
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTALW 654
+ E+ HRH+S LF L P IT K P L AA TL +R G GWS W T +W
Sbjct: 569 DEVEIGHRHISQLFALHPADLITPSKTPKLADAARATLVRRLIHGGGHTGWSCAWITNMW 628
Query: 655 ARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLV 714
ARL+D Y +K+L H N+ HPPFQID NFG +A+AE L+
Sbjct: 629 ARLYDSRMVYENLKKLL-----AHSTS------PNMMDTHPPFQIDGNFGGISAIAESLL 677
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYS 767
QS ++ LLPALP + W +G + GL+A+GG V I WK+ L I S++
Sbjct: 678 QSVAGEIVLLPALPVE-WETGHIHGLRAKGGFGVDIEWKNSRLSSAVITSDFG 729
>gi|374333663|ref|YP_005086791.1| hypothetical protein PSE_p0242 [Pseudovibrio sp. FO-BEG1]
gi|359346451|gb|AEV39824.1| hypothetical protein PSE_p0242 [Pseudovibrio sp. FO-BEG1]
Length = 798
Score = 496 bits (1278), Expect = e-137, Method: Compositional matrix adjust.
Identities = 294/773 (38%), Positives = 415/773 (53%), Gaps = 66/773 (8%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGD-YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF- 95
MV+G S + LNEDTL++G P Y P+ + V +L+ G+ EA K +
Sbjct: 1 MVYGNPLSARIHLNEDTLYSGEPTRIYPVPEIAHQIDHVEALLRDGKLFEAQEFVRKNWT 60
Query: 96 GHPADVYQLLGDIELEF-DDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSN 154
G YQ +G++ + DDS + YRR LD+ + Y F R F+S
Sbjct: 61 GRQGQAYQPVGNLFITMADDSPVS----NYRRALDIRHSLHHESYEQNRTTFERTSFASF 116
Query: 155 PDQVIVTKISGSESGSLSFNVSLDSL--LDNHSYVNGNNQIIMEGRCP------------ 200
PD VIV +++ + G+LSF++ DS ++ N ++ + G+ P
Sbjct: 117 PDNVIVVRLTADKPGTLSFSLRYDSPHPTCRTTHEAENTRLHLRGQAPAFTSSRVIERIE 176
Query: 201 ---------------GKRIPPKANANDDPKG------------IQFSAILEIKISDDRGT 233
GK P N D +G F A L +++ R
Sbjct: 177 HDQEQSRNPEIFGADGKLRPFAENEQDGHRGGIVYSEDGLGEGTYFEAGLSVELEGGR-- 234
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
E +L +EG+ L + ++SF+GP +PS KDP SAL + ++SY D
Sbjct: 235 -IRPERGELHIEGATAVTLRIAIATSFNGPDKSPSREGKDPAPIVKSALDTAGSVSYEDT 293
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
+H DD +LF RVS++L + I +P++ R++ FQ DP+L L
Sbjct: 294 LQKHSDDVLRLFDRVSLKLGNNA------------IPDLPTSTRLEQFQEKGDPALAALQ 341
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQ+GRYLLI+SSR G+Q NLQGIW+ P W S +NINLEMNYW + LS+ E
Sbjct: 342 FQYGRYLLIASSRGGSQPPNLQGIWSNLRRPQWSSNYTMNINLEMNYWPAEITGLSDLHE 401
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLF + L+++G++TA+ + A GW H T IW S A WPM WL +H
Sbjct: 402 PLFMLIEELAVSGARTAKKMFNAPGWCAFHNTTIWRDSVPSPCDPASAFWPMAAGWLLSH 461
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
+WEH+ YT D++FL+ RAYPL++ A F WL E DGYL STSPE+ ++ DG +
Sbjct: 462 MWEHFLYTGDKEFLKNRAYPLMKSAAEFYEWWLCENKDGYLVPKVSTSPENRYLDEDGHV 521
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
V STMD AIIRE F+ +AA++L + + L + RL P +I G + EW
Sbjct: 522 ITVDQGSTMDCAIIRETFTNTAAAAKLLGLDAE-LANTLEAKAARLLPYQIGAQGQVQEW 580
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
+QDFK+ HRHLSHL+GLFP I + PDL KA+ ++L+ RG+ GWS+ WK L
Sbjct: 581 SQDFKEFMPTHRHLSHLYGLFPCDQIG-KDTPDLLKASVRSLEIRGDLATGWSMGWKICL 639
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WAR+ D +HAY+++ +FN V+ E K EGGLY NL AHPPFQID NFG+T VAEML
Sbjct: 640 WARVGDGDHAYKIIHNMFNRVENEAPKSEEGGLYGNLMIAHPPFQIDGNFGYTRGVAEML 699
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNY 766
+ +T N + LLPALP W G V+GL+ARGG V + W+ G + I S++
Sbjct: 700 MNTTHNGIELLPALP-SAWPEGEVRGLRARGGFEVDLNWQRGKPTQAKIISHH 751
>gi|224535714|ref|ZP_03676253.1| hypothetical protein BACCELL_00578 [Bacteroides cellulosilyticus
DSM 14838]
gi|224522669|gb|EEF91774.1| hypothetical protein BACCELL_00578 [Bacteroides cellulosilyticus
DSM 14838]
Length = 822
Score = 496 bits (1278), Expect = e-137, Method: Compositional matrix adjust.
Identities = 301/773 (38%), Positives = 443/773 (57%), Gaps = 49/773 (6%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
ES + K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+T+W G P +
Sbjct: 23 ESRLSAQEYKLWYDRPAQVWTEALPLGNGRLGAMVYGTPATEQIQLNEETIWAGRPNNNA 82
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NP+A + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 83 NPNALEYIPRVRDLVFAGKYLEAQTLATEKVMAKSNSGMP---YQSFGDLRIAFP-GHTR 138
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y Y REL L++A V+Y V V++ RE +S DQVI+ +++ + G ++FN L
Sbjct: 139 YT--NYYRELSLDSARTLVRYEVDGVQYRRETITSFTDQVIMVRLTANRPGRITFNAQLT 196
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISAL 237
S H V ++ EG C + ++ ++ KG ++F L + + R T +
Sbjct: 197 S---PHQDVVITSE---EGNC--VTLSGVSSLHEGLKGKVEFQGRLTARNTGGRMTCA-- 246
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
D L VEG+D A++ + +++F+ N D +P + L S+++ H
Sbjct: 247 -DGVLSVEGADEAIVYVSIATNFN----NYQDITGNPAERAKDYLVRAMTHSFTEARKNH 301
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
D Y++ RVS+ L + + V + +RV++F+ D LV FQFG
Sbjct: 302 TDFYRRYLTRVSLDLG------------DNRYEHVTTDKRVENFKQTNDAHLVATYFQFG 349
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE EPLF
Sbjct: 350 RYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNEPLFR 409
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ +S G +TA++ Y A+GWV+HH TDIW + A K LWP GGAWLC HLWE
Sbjct: 410 LIREVSETGKETARIMYGANGWVLHHNTDIWRITGA-VDKAPSGLWPSGGAWLCRHLWER 468
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACV 536
Y YT D +FL + YP+L F + ++ E +L PS SPE+ +GK +
Sbjct: 469 YLYTGDTEFL-RSVYPILRESGRFFDEIMVKEPAHNWLVVCPSNSPENVHSGSNGK-STT 526
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
+ T+D +I ++++AII+A+++L+ + A ++ + L + P ++ G + EW D
Sbjct: 527 AAGCTLDNQLIFDLWTAIIAASDILDTDR-AFAARLSQRLREMAPMQVGRWGQLQEWMFD 585
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
+ DP+ HRH+SHL+GLFP + I+ ++P+L AA +L RG+ GWS+ WK LWAR
Sbjct: 586 WDDPKDVHRHVSHLYGLFPSNQISPYRSPELFDAARTSLIHRGDPSTGWSMGWKVCLWAR 645
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
L D HAY+++ LV E +K GG Y NLF AHPPFQID NFG A +AEML+QS
Sbjct: 646 LLDGNHAYKLITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCAAGIAEMLMQS 702
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
+YLLPALP W G VKG+ ARGG + + WK+G + + + S+ N
Sbjct: 703 HDGFIYLLPALP-TVWKDGTVKGIIARGGFELELSWKNGKVERLVVKSHKGGN 754
>gi|443292342|ref|ZP_21031436.1| Alpha-L-fucosidase [Micromonospora lupini str. Lupac 08]
gi|385884621|emb|CCH19587.1| Alpha-L-fucosidase [Micromonospora lupini str. Lupac 08]
Length = 1000
Score = 496 bits (1277), Expect = e-137, Method: Compositional matrix adjust.
Identities = 295/748 (39%), Positives = 409/748 (54%), Gaps = 55/748 (7%)
Query: 19 GPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSL 78
G + A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D +N AL+++R L
Sbjct: 53 GAGTDWLRALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPHDPSNTRGAAALAEIRRL 112
Query: 79 VDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
V++ Q+ +A + + G+P YQ +G++ L F + + R LDL TAT
Sbjct: 113 VNANQWTQAQDLINQTMMGNPGGQLAYQTVGNLRLAFGSAS---GASQHNRTLDLTTATT 169
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
Y + + + RE F+S PDQVI +++ S S+SF + DS I +
Sbjct: 170 TTSYVLNGIRYQREVFASAPDQVIAMRLTADRSNSISFTATFDSPQRTTVSSPDGATIGL 229
Query: 196 EGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLV 255
+G N ++F L + + G + L+V + +L+
Sbjct: 230 DG--------VSGNMEGVTGQVRF---LALANATVSGGTVSSSGGTLRVTNATSVTVLVS 278
Query: 256 ASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR- 314
SS+ +N + D + L + R SY L +RH+ DYQ LF RV++ L R
Sbjct: 279 IGSSY----VNYRNVGGDYGGIARQRLSAARASSYDQLRSRHVADYQALFGRVTLDLGRT 334
Query: 315 SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANL 374
S D TD R+ + DP LLFQFGRYLLISSSRPGTQ ANL
Sbjct: 335 SAADQTTDV-------------RIAQHNSVNDPQFSALLFQFGRYLLISSSRPGTQPANL 381
Query: 375 QGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY 434
QGIWN+ L+P+WDS +N NL MNYW + NL+EC P+FD + L++ G++TAQV Y
Sbjct: 382 QGIWNDSLAPSWDSKYTINANLPMNYWPANTTNLAECHNPVFDLVRDLAVTGTRTAQVQY 441
Query: 435 -LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYP 493
ASGWV HH TD W +++A W +W GGAWL T +W+HY + D +FL YP
Sbjct: 442 GAASGWVTHHNTDAW-RATAVVDGAFWGMWQTGGAWLSTLIWDHYLFNGDIEFLRTN-YP 499
Query: 494 LLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFS 552
++G A F L+ L+ E GYL TNPS SPE A A V TMD I+R++F
Sbjct: 500 AMKGAAQFFLNTLVTEPTLGYLVTNPSNSPELSHHAN----ASVCAGPTMDNQILRDLFD 555
Query: 553 AIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFG 612
A A+E+L+ + +V + RL P K+ G+IMEW D+ + E +HRH+SHL+G
Sbjct: 556 ACARASEILDV-DSTFRAQVRATRDRLPPMKVGSRGNIMEWLYDWVETEPNHRHISHLYG 614
Query: 613 LFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFN 672
L P + IT P L +AA +TL RG++G GWS+ WK WAR+ + + A+ +++ L
Sbjct: 615 LAPSNQITKRGTPQLFEAARRTLALRGDDGTGWSLAWKINFWARMEEGKRAHDLIRYLAT 674
Query: 673 LVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKW 732
L N+F HPPFQID NFG TA +AEML+QS +L++LPALP W
Sbjct: 675 TAR----------LAPNMFDLHPPFQIDGNFGATAGIAEMLLQSHAGELHILPALP-PAW 723
Query: 733 SSGCVKGLKARGGETVSICWKDGDLHEV 760
SG V GL+ RGG TVSI W +G EV
Sbjct: 724 PSGRVAGLRGRGGHTVSITWSNGLASEV 751
>gi|261878761|ref|ZP_06005188.1| fibronectin type III domain protein [Prevotella bergensis DSM
17361]
gi|270334768|gb|EFA45554.1| fibronectin type III domain protein [Prevotella bergensis DSM
17361]
Length = 814
Score = 496 bits (1276), Expect = e-137, Method: Compositional matrix adjust.
Identities = 294/758 (38%), Positives = 420/758 (55%), Gaps = 48/758 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ ++ PA + +A+PIGNG LGAMV+GG ETL LNE T W+G P D + ++ L
Sbjct: 23 RLWYHQPASKWVEALPIGNGFLGAMVYGGTRQETLALNETTFWSGGPHDNNSTESLSYLP 82
Query: 74 DVRSLVDSGQYAEATAASVK--LFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
++R + G+ EA + + G + LGD+ + F++ H + + Y R L+L
Sbjct: 83 EIRQKIFEGKENEAQKLIDQHVVKGPHGMRFLPLGDVRIRFEE-HGEVGQ--YSRSLNLE 139
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
A V Y++G V+ R F+S PD+VI +I S SF +S+ SL + + +GN
Sbjct: 140 KALHEVSYTIGGVKIQRVSFASLPDRVIGMRIKSSRR--TSFTISVHSLFQSEAQTHGN- 196
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGI--QFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
+EG G D +G+ + A I + + G + D L+VE +
Sbjct: 197 --ALEGTVYG----------DSQEGVAGRLRAHYRIVVKGN-GKVVPTGDS-LRVERASN 242
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
+ + A+++F +N D D + + + S+ L RH+ Y+ + RVS
Sbjct: 243 TEIYMAAATNF----VNFKDVSGDEKAVVNRLMAGVSGQSFDRLLKRHVRAYRCQYDRVS 298
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L + S +P+ ER++ F +D +V L+F +GRYLLISSS+PG
Sbjct: 299 LTL---------NGASPSPHAQLPTDERLRQFAGSQDMGMVALIFNYGRYLLISSSQPGG 349
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN + + WDS +NIN EMNYW + CNL E +PLF + LS+ G KT
Sbjct: 350 QPANLQGIWNGERNAPWDSKYTININTEMNYWPAETCNLREAVKPLFSLIGDLSLTGEKT 409
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+ Y GWV HH TD+W + G W ++P GG WL THLW+HY YT DR FL +
Sbjct: 410 ARQMYGCRGWVAHHNTDLWRIAGPVDG-AYWGMFPNGGGWLSTHLWQHYLYTGDRVFL-R 467
Query: 490 RAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
Y +L+G A F LD++ + GYL PS SPEH P GK + V TMD I
Sbjct: 468 LWYSVLKGAADFYLDYMQTDPRTGYLVVVPSVSPEH---GPHGK-SPVGAGCTMDNQIAF 523
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLS 608
+V S + A E+L N A + + K++ L P KI G + EW +D DP+ HRH+S
Sbjct: 524 DVLSNCLQATEILNGNR-AYADSLRKAIAALPPMKIGRHGQLQEWQEDADDPKDEHRHIS 582
Query: 609 HLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVK 668
HL+GL+P + I+ NP+L AA TL +RG+ GWS+ WK WAR+HD HA++++
Sbjct: 583 HLYGLYPSNQISPYTNPELFGAARNTLLQRGDMATGWSLAWKMNFWARMHDGNHAFKILS 642
Query: 669 RLFNLV--DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPA 726
L ++ D ++ G +Y NLF AHPPFQID NFG TA + EML+QS L+LLPA
Sbjct: 643 NLLRILPHDGVTRQYPNGRMYPNLFDAHPPFQIDGNFGCTAGIVEMLMQSHDGALHLLPA 702
Query: 727 LPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
LP D W+SG V+GL ARGG VS+ WKDG L E + S
Sbjct: 703 LP-DAWASGHVRGLCARGGFEVSMSWKDGRLTEAKVLS 739
>gi|379721956|ref|YP_005314087.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
gi|378570628|gb|AFC30938.1| alpha-L-fucosidase [Paenibacillus mucilaginosus 3016]
Length = 768
Score = 496 bits (1276), Expect = e-137, Method: Compositional matrix adjust.
Identities = 291/735 (39%), Positives = 406/735 (55%), Gaps = 55/735 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA + +A+P GNGRLGAMV+GG E + LNEDTLW+G P D DA L R
Sbjct: 12 YEQPAGSWFEALPAGNGRLGAMVFGGTCRERIALNEDTLWSGEPRDTVREDAHLHLDPAR 71
Query: 77 SLVDSGQYAEATAASVKLFGHP-ADVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTAT 134
L+ G++AEA + P + Y LGD+EL+ D K E T YRREL L+ A
Sbjct: 72 KLIFEGRHAEAEEIIQQYMQGPDIESYLPLGDLELQSD----KEGEITDYRRELILDEAV 127
Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQII 194
R +Y TRE F S DQV+ +I + L+ +SL S L G++ +
Sbjct: 128 VRTQYRTDGALQTRELFVSAADQVLALRIESEQP--LNLTISLGSPLQYAVRRTGSSGMA 185
Query: 195 MEGRCPGKRIPPKANANDDP------KGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
+ GRCP R+ P +D+P +GI F A L + + ++G I + +++V
Sbjct: 186 LSGRCP-VRVLPNTVRSDEPARYEEGRGIAFEAALHV--TAEKGRIES-SGGRIRVVSGR 241
Query: 249 WAVLLLVASSSFDGPFINPSDSK--KDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
LLL A++S+DG +P+ + P + L+ L YS L RHL ++ + +
Sbjct: 242 GVTLLLAAATSYDGFDRDPAAASLAGRPRALCAERLREAAGLGYSRLKERHLREHAEKYG 301
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSS 365
RV ++L + S + D +P+ R+++ Q +DP L L FQ+GRYLL+SSS
Sbjct: 302 RVDLELG------GSAADSGADADALPTDARIRAAAQGADDPGLAALFFQYGRYLLLSSS 355
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPGTQ ANLQGIWN+ L P W S+ NIN++MNYW + NL+EC EPL F+ L +
Sbjct: 356 RPGTQPANLQGIWNDKLQPPWCSSWTANINVQMNYWPAEAANLAECHEPLLRFVDDLRES 415
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G + A V+Y GW HH D+W ++ G WA WPM GAWLC HLWEHY ++ D
Sbjct: 416 GRRAASVHYRCRGWTAHHNIDLWRTATPVGGSPSWAFWPMAGAWLCEHLWEHYAFSRDEK 475
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
+L R YP+L+ A F LDWL+EG DG+L T PSTSPE+ F+ DG CV+Y+STMD+A
Sbjct: 476 YL-ARVYPVLKEAAQFGLDWLVEGPDGFLVTCPSTSPENHFLTADGSQGCVTYASTMDIA 534
Query: 546 IIREVFSAIISAAEVLEKNE--DALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVH 603
++R +F + A+ L+K+ L+E+ L+ +P P +I G + EWA+DF + E
Sbjct: 535 LLRNLFGRCMEASRQLQKDTAFRVLLEQTLRRMP---PYRIGRHGQLQEWAEDFGEAEPG 591
Query: 604 HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQ 660
HRH +HL L P IT E P+L +A K L++R G GWS W +LWARL +
Sbjct: 592 HRHTAHLAALHPLEEITPEGEPELAEACRKALKRRLAHGGAHTGWSCAWMISLWARLCEP 651
Query: 661 EHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP-------FQIDANFGFTAAVAEML 713
E A+R + L GL+ NL AH FQID + TA + EML
Sbjct: 652 ETAHRFLDELL------------AGLHPNLTNAHRHPKVKMDIFQIDGSLAGTAGILEML 699
Query: 714 VQSTLNDLYLLPALP 728
+QS + LLPALP
Sbjct: 700 LQSHRGTVRLLPALP 714
>gi|380695292|ref|ZP_09860151.1| hypothetical protein BfaeM_15197 [Bacteroides faecis MAJ27]
Length = 824
Score = 496 bits (1276), Expect = e-137, Method: Compositional matrix adjust.
Identities = 301/773 (38%), Positives = 447/773 (57%), Gaps = 49/773 (6%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E + K+ ++ PA+ +T+A+P+GNGRLGAMV+G E ++LNE+T+W G P +
Sbjct: 25 EKKVSAQEYKLWYDRPAQVWTEALPLGNGRLGAMVYGTPGIEQIQLNEETIWAGRPNNNA 84
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NP+A + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 85 NPNALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 140
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y++ Y REL L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 141 YSD--YYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 198
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISAL 237
S H V N++ +G C + ++ ++ KG ++F L ++ ++G A
Sbjct: 199 S---PHQDVMINSE---KGNC--VILSGVSSLHEGLKGKVEFQGRLTVR---NQGGKIAC 247
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
D L VEG+D A + + +++F+ N D + T + S L +++ H
Sbjct: 248 TDGVLSVEGADEATIYVSIATNFN----NYLDITGNQTERAKSYLSEALVHPFAEAKKNH 303
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
++ Y++ RVS+ L E+ V + +RV++F+ D LV FQFG
Sbjct: 304 VEFYRRYLTRVSLDLG------------EDQYKNVTTDKRVENFKDTHDAHLVATYFQFG 351
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLS+ EPLF
Sbjct: 352 RYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSDLNEPLFR 411
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ +S +G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC HLWE
Sbjct: 412 LIKEVSESGKETARIMYGANGWVLHHNTDIWRITGA-LDKAPSGMWPSGGAWLCRHLWER 470
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACV 536
Y YT D +FL + YP+L+G F + ++ E +L PS SPE+ DGK A
Sbjct: 471 YLYTGDTEFL-RSVYPILKGSGLFFDEIMVKEPVHNWLVVCPSNSPENVHSGSDGK-ATT 528
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
+ TMD +I ++++AIISA+ +L+ +++ + + L + P ++ G + EW D
Sbjct: 529 AAGCTMDNQLIFDLWTAIISASRILDTDKE-FAAHLEQRLKEMAPMQVGHWGQLQEWMFD 587
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
+ DP HRH+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK LWAR
Sbjct: 588 WDDPNDVHRHVSHLYGLFPSNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKVCLWAR 647
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
L D +HAY+++ LV E +K GG Y NLF AHPPFQID NFG A +AEML+QS
Sbjct: 648 LLDGDHAYKLITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCAAGIAEMLMQS 704
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
+YLLPALP W G V G+ ARGG + + WK+G ++ + + S+ N
Sbjct: 705 YDGFIYLLPALP-TLWKDGSVTGIIARGGFELDLSWKNGKVNRLVVKSHKGGN 756
>gi|298385755|ref|ZP_06995313.1| alpha-L-fucosidase 2 [Bacteroides sp. 1_1_14]
gi|298261896|gb|EFI04762.1| alpha-L-fucosidase 2 [Bacteroides sp. 1_1_14]
Length = 824
Score = 495 bits (1275), Expect = e-137, Method: Compositional matrix adjust.
Identities = 301/773 (38%), Positives = 446/773 (57%), Gaps = 49/773 (6%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E + K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+T+W G P +
Sbjct: 25 EKKVSVQEYKLWYDRPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEETIWAGRPNNNA 84
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NP+A + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 85 NPNALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 140
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y++ Y REL L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 141 YSD--YYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 198
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISAL 237
S H V N++ EG C + ++ ++ KG ++F L + ++G A
Sbjct: 199 S---PHQDVMINSE---EGNC--VTLSGVSSLHEGLKGKVEFQGRLTAR---NQGGKIAC 247
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
D L VEG+D A + + +++F+ N D + T + S L +++ H
Sbjct: 248 TDGVLSVEGADEATIYVSIATNFN----NYLDITGNQTERAKSYLSEALVRPFAEAKKNH 303
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
++ Y++ RVS+ L E+ V + +RV++F+ D LV FQFG
Sbjct: 304 VEFYRRYLTRVSLDLG------------EDQYKNVTTDKRVENFKDTHDAHLVATYFQFG 351
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLS+ EPLF
Sbjct: 352 RYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSDLNEPLFR 411
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ +S +G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC HLWE
Sbjct: 412 LIKEVSESGKETAKIMYGANGWVLHHNTDIWRITGA-LDKAPSGMWPSGGAWLCRHLWER 470
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACV 536
Y YT D +FL + YP+L+ F + ++ E +L PS SPE+ DGK A
Sbjct: 471 YLYTGDTEFL-RSVYPILKESGLFFDEIMVKEPVHNWLVVCPSNSPENVHSGSDGK-ATT 528
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
+ TMD +I ++++AIISA+ +L+ +++ + + L + P ++ G + EW D
Sbjct: 529 AAGCTMDNQLIFDLWTAIISASRILDTDKE-FAAHLEQRLKEMAPMQVGHWGQLQEWMFD 587
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
+ DP HRH+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK LWAR
Sbjct: 588 WDDPNDVHRHVSHLYGLFPSNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKVCLWAR 647
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
L D +HAY+++ LV E +K GG Y NLF AHPPFQID NFG A +AEML+QS
Sbjct: 648 LLDGDHAYKLITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCAAGIAEMLMQS 704
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
+YLLPALP W G V G+ ARGG + + WK+G ++ + + S+ N
Sbjct: 705 YDGFIYLLPALP-TLWKDGSVTGIIARGGFELDLSWKNGKVNRLVVKSHKGGN 756
>gi|300785873|ref|YP_003766164.1| large protein [Amycolatopsis mediterranei U32]
gi|384149183|ref|YP_005531999.1| large protein [Amycolatopsis mediterranei S699]
gi|399537756|ref|YP_006550418.1| large protein [Amycolatopsis mediterranei S699]
gi|299795387|gb|ADJ45762.1| large secreted protein [Amycolatopsis mediterranei U32]
gi|340527337|gb|AEK42542.1| large protein [Amycolatopsis mediterranei S699]
gi|398318526|gb|AFO77473.1| large protein [Amycolatopsis mediterranei S699]
Length = 949
Score = 495 bits (1275), Expect = e-137, Method: Compositional matrix adjust.
Identities = 302/759 (39%), Positives = 417/759 (54%), Gaps = 57/759 (7%)
Query: 11 NPLKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
N L + ++ PA + A+PIGNGRLGAMV G +E L+LNEDT+W G P DY+N
Sbjct: 39 NDLALWYDKPAGTEWLRALPIGNGRLGAMVSGNTDTERLQLNEDTVWAGGPHDYSNAQGA 98
Query: 70 KALSDVRSLVDSGQYAEATA-ASVKLFGHPA--DVYQLLGDIELEFDDSHLKYAEETYRR 126
ALS +R LV + Q+ +A + K+ G PA YQ +G + L + +Y+R
Sbjct: 99 GALSQIRQLVFANQWTQAQSLIDQKMLGTPAAQQPYQPVGTLSLALPGNS---GVSSYQR 155
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
LDL TAT V Y NV + RE F+S DQVIV +++ GS+SF+ SL + +
Sbjct: 156 WLDLTTATTVVTYVANNVRYRREVFASAADQVIVLRLTAETPGSISFSASLGTPQRATTS 215
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSA-ILEIKISDDRGTISALEDKKLKVE 245
I ++G + D +GI S L + + G ++ L+V
Sbjct: 216 SPNGTTIALDG------------ISGDSRGIAGSVRFLALAGATAEGGSTSSSGGTLRVS 263
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G+D LL+ +S+ ++ D + S L + + L + L RHL DYQKLF
Sbjct: 264 GADAVTLLISIGTSY----VDYRTVNGDYQGIARSRLAAAQALPHDTLRGRHLADYQKLF 319
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
R ++ L R T + + P+ R+ + DP LLFQFGRYLLISSS
Sbjct: 320 GRTTLDLGR--------TAAADQ----PTDVRIAQHNSVNDPQFAALLFQFGRYLLISSS 367
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPGTQ ANLQGIWN+ L+P+W+S +N NL MNYW + NL+EC EP+F + L++
Sbjct: 368 RPGTQPANLQGIWNDQLNPSWESKYTLNANLPMNYWPADVTNLAECYEPVFAMIGDLAVT 427
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSS-ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G++TAQV Y A GWV HH TD W SS D + +W GGAWL T +W+HY +T D
Sbjct: 428 GARTAQVEYGARGWVTHHNTDGWRGSSIVDFAQA--GMWQTGGAWLATMIWDHYRFTGDV 485
Query: 485 DFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
+FL R YPLL+G A F LD L+ E GYL TNP+ SPE A A V TMD
Sbjct: 486 EFLRAR-YPLLKGAAQFFLDTLVTEPSLGYLVTNPANSPELNHHAN----ASVCAGPTMD 540
Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVH 603
M I+R++F A +VL + ++V + RL P K+ G+I EW D+ + E
Sbjct: 541 MQILRDLFDGCAGACQVLGVDA-TFADQVTAARQRLAPMKVGSRGNIQEWLYDWVETEQT 599
Query: 604 HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHA 663
HRH+SHL+GL+P + I+ P L AA +TL+ RG++G GWS+ WK WAR+ + A
Sbjct: 600 HRHISHLYGLYPSNQISKRGTPQLFTAARRTLELRGDDGTGWSLAWKINYWARMEEGAKA 659
Query: 664 YRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYL 723
+ ++ RL D L N+F HPPFQID NFG T+ +AE+L+ S +L+L
Sbjct: 660 HDLL-RLLVRTDR---------LAPNMFDLHPPFQIDGNFGATSGIAELLLHSHNGELHL 709
Query: 724 LPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
LPALP W +G V GL+ RGG TV W G ++ I
Sbjct: 710 LPALP-PAWPAGSVTGLRGRGGYTVGAAWSSGAATQLTI 747
>gi|383641029|ref|ZP_09953435.1| hypothetical protein SchaN1_11878 [Streptomyces chartreusis NRRL
12338]
Length = 953
Score = 495 bits (1275), Expect = e-137, Method: Compositional matrix adjust.
Identities = 296/756 (39%), Positives = 413/756 (54%), Gaps = 55/756 (7%)
Query: 11 NPLKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
N + ++ PA + A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D NP
Sbjct: 23 NDFALWYDKPAGTEWLRALPIGNGRLGAMVFGNVDNERLQLNEDTVWAGGPYDSANPRGA 82
Query: 70 KALSDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRR 126
++++R V + Q+ A + + G PA YQ +G++ L + Y R
Sbjct: 83 ANIAEIRRRVFADQWGPAQDLINQTMLGSPAGQLAYQPVGNLLLSLGSA---TGASQYNR 139
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
LDL TATA Y +G V + RE F+S PDQVIV +++ + S++FN + DS
Sbjct: 140 TLDLTTATAVTTYVLGGVRYQREVFASAPDQVIVVRLTADRANSIAFNATFDSPQRTTVS 199
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
I ++G ++F A+ ++ GT+S+ L+V G
Sbjct: 200 SPDGATIALDGVS--------GTMEGITGRVRFLALAHAAVTG--GTVSS-SGGTLRVSG 248
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ +L+ S + ++ D + L + R++ L RHL DYQ LF+
Sbjct: 249 ATSVTVLVSIGSGY----VDFRRVDGDYQGIARRHLNAARDIGIDQLRKRHLADYQALFN 304
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RVS+ L R T + + P+ R+ DP L LLFQFGRYLLISSSR
Sbjct: 305 RVSVDLGR--------TAAADQ----PTDVRIAQHAQANDPQLSALLFQFGRYLLISSSR 352
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PGTQ ANLQGIWN+ ++P+WDS +N NL MNYW + NLSEC P+FD + L++ G
Sbjct: 353 PGTQPANLQGIWNDQMAPSWDSKFTINANLPMNYWPADTTNLSECFLPVFDMIDDLTVTG 412
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSS-ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
++ AQ Y A GWV HH TD W +S D + W +W GGAWL T +W+HY +T D D
Sbjct: 413 ARVAQAQYGAGGWVTHHNTDAWRGASVVDEAR--WGMWQTGGAWLATLIWDHYLFTGDTD 470
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
FL YP L+G A F LD L+ GYL TNPS SPE A A V TMD
Sbjct: 471 FLRSN-YPALKGAAQFFLDTLVAHPSLGYLVTNPSNSPELAHHAN----ATVCAGPTMDN 525
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHH 604
I+R++F+++ A EVL + + L + RL PTK+ G++ EW D+ + E H
Sbjct: 526 QILRDLFNSVARAGEVLGVDA-GFRAQALAARDRLAPTKVGSRGNVQEWLADWVETERTH 584
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAY 664
RH+SHL+GL P + IT P L +AA +TL+ RG++G GWS+ WK WARL D A+
Sbjct: 585 RHVSHLYGLHPSNQITKRGTPQLHEAARRTLELRGDDGTGWSLAWKINFWARLEDGARAH 644
Query: 665 RMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLL 724
++++ +LV + L N+F HPPFQID NFG T+ +AEML+QS +L++L
Sbjct: 645 KLIR---DLVRTDR-------LAPNMFDLHPPFQIDGNFGATSGIAEMLLQSHNGELHVL 694
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEV 760
PALP W +G V GL+ RGG TV W G + V
Sbjct: 695 PALP-AAWPTGRVSGLRGRGGYTVGAEWSSGRIEFV 729
>gi|399025527|ref|ZP_10727523.1| hypothetical protein PMI13_03496 [Chryseobacterium sp. CF314]
gi|398077904|gb|EJL68851.1| hypothetical protein PMI13_03496 [Chryseobacterium sp. CF314]
Length = 820
Score = 495 bits (1275), Expect = e-137, Method: Compositional matrix adjust.
Identities = 289/769 (37%), Positives = 425/769 (55%), Gaps = 59/769 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ + +A+PIGNGRL AMV+G E L+LNE T W+G P NPD PK L
Sbjct: 27 KLWYDKPARQWVEALPIGNGRLAAMVFGDPFKEKLQLNESTFWSGGPSRNDNPDGPKVLD 86
Query: 74 DVRSLVDSGQYAEATAASVKLFGHP---ADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+R + + Y +A + K +Q +GD+ LEF++ E Y RELD+
Sbjct: 87 SIRYYLFNENYKKAEILANKGLTAKTLHGSAFQNIGDLNLEFNNPG---DIENYYRELDI 143
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A +S + + RE F+S PD VI+ K+S + +L+FN +S L +
Sbjct: 144 EKALITTTFSSNGIHYKREAFASVPDNVIIIKLSSDKKNALNFNAGFNSELKKNVKTIDA 203
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
N + M+G ++ D +G ++F+ + + +G +++ D ++ V +D
Sbjct: 204 NTLQMDGI---------SSTLDGVQGQVKFNVLAKFIT---KGGTNSVSDNRISVANADE 251
Query: 250 AVLLLVASSSF-DGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
++L+ +++F D +N D S+S + +++ L+ HL+ YQK F R+
Sbjct: 252 VLILISIATNFTDYKTLN-----TDEVSKSKKYISQSETKNFNTLFKNHLNAYQKYFKRI 306
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
L SP P+ RVK+F + DP L+ L +QFGRYLLISSS+PG
Sbjct: 307 DFSLGTSPAA------------QFPTDLRVKNFASGYDPELISLYYQFGRYLLISSSQPG 354
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
Q ANLQGIWN P WDS +NIN EMNYW + NL+E EPL + LS+ G +
Sbjct: 355 GQPANLQGIWNNSNKPAWDSKYTININTEMNYWPAEKTNLAEMHEPLVQLVKDLSVTGVE 414
Query: 429 TAQVNYLASGWVIHHKTDIWAKSS----ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
TA++ Y + GWV HH TDIW + A+ G+ WPMGGAWL HLWE Y Y D+
Sbjct: 415 TARIMYKSRGWVAHHNTDIWRITGVVDFANAGQ-----WPMGGAWLSQHLWEKYLYGGDK 469
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
++L K Y +L+ A F D+LIE H +L +PS SPE+ I + + +S +TM
Sbjct: 470 NYL-KSIYTVLKSAALFYEDFLIEEPVHQ-WLVVSPSISPEN--IPKRNRGSALSAGNTM 525
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALV--EKVLKSLPRLRPTKIAEDGSIMEWAQDFKDP 600
D +I ++FS AA++L + D + ++ LP P KI G + EW +D+ +P
Sbjct: 526 DNQLIFDLFSKTKKAAQILNVDSDKIPVWNTIISKLP---PMKIGRYGQLQEWMEDWDNP 582
Query: 601 EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQ 660
+ +HRH+SHL+GLFPG+ I P+L A++ L RG+ GWS+ WK LWA+L D
Sbjct: 583 KDNHRHVSHLYGLFPGNQINPITTPELFDASKTVLIHRGDVSTGWSMGWKINLWAKLLDG 642
Query: 661 EHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
HA +++K L++ + GG Y NLF AHPPFQID NFG T+ + EML+Q+
Sbjct: 643 NHANKLIKDQLTLIEKDGRSE-SGGTYPNLFDAHPPFQIDGNFGCTSGITEMLLQTQNGS 701
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
+ +LPALP D+W +G + GLKA GG +SI WKD E+ I SN N
Sbjct: 702 IDILPALP-DEWKNGNISGLKAYGGFEISIVWKDHQATEIMIRSNLGGN 749
>gi|430751376|ref|YP_007214284.1| hypothetical protein Theco_3231 [Thermobacillus composti KWC4]
gi|430735341|gb|AGA59286.1| hypothetical protein Theco_3231 [Thermobacillus composti KWC4]
Length = 765
Score = 495 bits (1274), Expect = e-137, Method: Compositional matrix adjust.
Identities = 300/802 (37%), Positives = 444/802 (55%), Gaps = 67/802 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + PA + +A+PIGNGRLGAMV GG+ E L++NE+T W+G P DY P A + L
Sbjct: 1 MKLWYAKPASDWLEALPIGNGRLGAMVHGGMERERLQINEETFWSGGPHDYRRPGASRYL 60
Query: 73 SDVRSLVDSGQYAEATAA-SVKLFGHPADVYQLL--GDIELEFDDSHLKYAEETYRRELD 129
VR L+ + EA ++ G P ++ L D+ L F H Y RELD
Sbjct: 61 RQVRELIFQDKVEEAQQLFDERMKGDPELLHAFLPCCDMMLHFP-GHAD--GRDYYRELD 117
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL-LDNHSYVN 188
L+ A A +Y V V +TRE F S PDQ I+ +IS G + L + +
Sbjct: 118 LDRAVATTRYRVNGVTYTREVFCSYPDQAIIMRISSDCPGKIDMAGELAAANGEQRVRFA 177
Query: 189 GNNQIIMEGRCPGKR--IPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
G++ +++ G+ GKR P + NA D G++F A ++ + G + E + L+V G
Sbjct: 178 GDDTLVLTGQA-GKREARPRRLNAGWDGPGVRFEA--RLRAFSEGGRVLRGE-QALEVRG 233
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D L+ A++SF +N DP +++ ++ ++ +Y +L RHL+DY L+
Sbjct: 234 ADAVTLIFSAATSF----VNYRSIDGDPGAKAAGVIERLQGKTYGELLGRHLEDYTALYR 289
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV ++L D P+ ERV+ + EDP L L +Q+GRYLLI+SSR
Sbjct: 290 RVELELGDGAGD------------GTPTDERVRMYAETEDPGLAALFYQYGRYLLIASSR 337
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQGIWN+D P W S NIN++MNYW + NL EC PLFD + L I G
Sbjct: 338 PGGQPANLQGIWNDDPWPLWGSKWTTNINVQMNYWPAESGNLRECHLPLFDLIDDLRITG 397
Query: 427 SKTAQVNYLASGWVIHHKTDIW-AKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
++TA+ +Y G+V+HH TD+W A + D A+WPMGG WL HLW+HY Y D+
Sbjct: 398 AETAETHYGCRGFVVHHNTDLWRAATPVDYDA---AVWPMGGVWLVQHLWDHYEYCPDQA 454
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGY-----LETNPSTSPEHEFIAPDGKLACVSYSS 540
FL R YP L A F+LD+L E +G L TNPS SPE+ +I G+ ++ ++
Sbjct: 455 FLRNRVYPALREAALFVLDYLTEAPEGTRLAGKLVTNPSYSPENHYIDDKGRRRYLTCAA 514
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDP 600
TMD+ +IR++F + AAE+L +ED E + +++ RL +I + G + EWA+D+ P
Sbjct: 515 TMDIQLIRDLFQRCMKAAEMLGVDEDFRGE-LEEAMARLPGMQIGKYGQLQEWAEDWDRP 573
Query: 601 EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG-EEGPGWSITWKTALWARLHD 659
+ H+ H+SHL+GL+PG+ I+++ P+L +A ++L+ RG + W W+ AL A L D
Sbjct: 574 DDHNSHVSHLYGLYPGNQISVKDTPELAEAVGRSLELRGTHDFRAWPAAWRIALHAHLRD 633
Query: 660 QEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF--QIDANFGFTAAVAEMLVQS- 716
A+R RL NL+ NL PP QID NFG TAA+AEML+QS
Sbjct: 634 ARMAHR---RLVNLIALSAN--------PNLLNEKPPLPMQIDGNFGGTAAIAEMLLQSR 682
Query: 717 -------TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
+ ++ LLPALP +WS G VKGL+ARGG ++ W++ L E +++
Sbjct: 683 SRYDGTAAVYEIELLPALP-AQWSRGRVKGLRARGGFELAFAWENERLTEASLHALCG-- 739
Query: 770 DHDSFKTLHYRGTSVKVNLSAG 791
++Y SV++ S G
Sbjct: 740 ---GICRIYYGDRSVQLETSKG 758
>gi|443289925|ref|ZP_21029019.1| Extracellular cellulase [Micromonospora lupini str. Lupac 08]
gi|385886837|emb|CCH17093.1| Extracellular cellulase [Micromonospora lupini str. Lupac 08]
Length = 947
Score = 494 bits (1273), Expect = e-137, Method: Compositional matrix adjust.
Identities = 293/744 (39%), Positives = 413/744 (55%), Gaps = 54/744 (7%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D N L+++R V + Q+ +
Sbjct: 61 ALPIGNGRLGAMVFGNVDTERLQLNEDTIWAGGPYDSANTRGAANLAEIRRRVFADQWTQ 120
Query: 87 AT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
A + + G+P YQ +G++ L F + Y R LDL TAT Y +
Sbjct: 121 AQDLINQTMMGNPGGQLAYQPVGNLRLAFGSAS---GASQYNRTLDLTTATVTTTYVLNG 177
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKR 203
V + RE F+S PDQVIV +++ +GS++FN + DS I ++G
Sbjct: 178 VRYQRESFASAPDQVIVIRLTADRAGSITFNATFDSPQRTTVSSPDAATIGVDG------ 231
Query: 204 IPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDG 262
+ A + G ++F A+ + GT+S+ L+V G+ +L+ SS+
Sbjct: 232 ---ISGAMEGVNGSVRFLALAHAVATG--GTVSS-SGGTLRVSGATSVTVLISIGSSY-- 283
Query: 263 PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTD 322
+N D + + L + R +++ L +RHL DYQ LF+RV+I L R
Sbjct: 284 --VNFRTVNGDYQGIARTRLNAARGVAFDQLRSRHLADYQALFNRVTIDLGR-------- 333
Query: 323 TCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDL 382
T + + P+ R+ + DP LLFQFGRYLLISSSRPGTQ ANLQGIWN+ +
Sbjct: 334 TAAADQ----PTDVRIAQHASTNDPQFSALLFQFGRYLLISSSRPGTQPANLQGIWNDSM 389
Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIH 442
+P WDS +N NL MNYW + NL EC P+FD + L++ G++ AQ Y A GWV H
Sbjct: 390 TPPWDSKYTINANLPMNYWPADTTNLPECFLPVFDMIKDLTVTGARVAQAQYGAGGWVTH 449
Query: 443 HKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFL 502
H TD W +S G +W +W GGAWL T +WEHY +T D FL YP L+G A F
Sbjct: 450 HNTDGWRGASVVDG-ALWGMWQTGGAWLSTLIWEHYLFTGDVGFLSAN-YPALKGAAQFF 507
Query: 503 LDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVL 561
LD L+ GYL TNPS SPE P A V TMD I+R++F A+ A EVL
Sbjct: 508 LDTLVAHPTLGYLVTNPSNSPE----LPHHSNASVCAGPTMDNQILRDLFDAVAQAGEVL 563
Query: 562 EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITI 621
+ +V + RL P+++ G++ EW D+ + E +HRH+SHL+GL P + IT
Sbjct: 564 GVDA-TFRSQVRTARDRLAPSRVGSRGNVQEWLADWVETERNHRHVSHLYGLHPSNQITK 622
Query: 622 EKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKH 681
P L +AA +TL+ RG++G GWS+ WK WARL D A+++++ +LV +
Sbjct: 623 RGTPALYEAARRTLELRGDDGTGWSLAWKINYWARLEDGTRAHKLIR---DLVRTDR--- 676
Query: 682 FEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLK 741
L N+F HPPFQID NFG T+ +AEML+ S +L+LLPALP W +G V GL+
Sbjct: 677 ----LAPNMFDLHPPFQIDGNFGATSGIAEMLLHSHTGELHLLPALP-SGWPTGQVAGLR 731
Query: 742 ARGGETVSICWKDGDLHEVGIYSN 765
RGG TV + W G E+ + ++
Sbjct: 732 GRGGYTVGVRWTSGQADEISVRAD 755
>gi|189467819|ref|ZP_03016604.1| hypothetical protein BACINT_04211 [Bacteroides intestinalis DSM
17393]
gi|189436083|gb|EDV05068.1| hypothetical protein BACINT_04211 [Bacteroides intestinalis DSM
17393]
Length = 1061
Score = 494 bits (1273), Expect = e-137, Method: Compositional matrix adjust.
Identities = 295/765 (38%), Positives = 425/765 (55%), Gaps = 49/765 (6%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
TS N +K+ + PA+ + +A+P+GN RLGAMV+GG E L+LNE+T W G P + NP
Sbjct: 265 TSAQN-MKLWYARPAQDWLEALPLGNSRLGAMVFGGTAREELQLNEETFWAGGPYNNNNP 323
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETY 124
+ L ++R L+ G+ EA + + P Y +G + L F H +E Y
Sbjct: 324 KGLQVLPEIRRLIFEGKTLEAQKLIDENYMTPQHGMRYLTMGSLFLNFP-GHENPSE--Y 380
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
R+L+L ATA +Y V V+F R F+S D VI+ +I ++ +L+F VS S L +
Sbjct: 381 YRDLNLENATATTRYEVDGVKFVRTAFASLSDDVIIVRIQADKAKALNFAVSYSSPLKSD 440
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
V G II C G A P ++ +++K G +S E L V
Sbjct: 441 VQVKGGKLII---SCQG------AEHEGIPAAMRAECQVQVKTD---GKVSKAESA-LAV 487
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
G+ L + A+++F +N D + + + + LQ + Y H+ Y+K
Sbjct: 488 NGATEVTLYISAATNF----VNYHDVSANESKRAATYLQKATRIPYEQALKSHIASYRKQ 543
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
+ RV++ L + + + + RV+ F D ++ L+FQ+GRYLLISS
Sbjct: 544 YDRVALTLEST------------GVSALETPVRVQRFIEGNDMAMAALMFQYGRYLLISS 591
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+PG Q ANLQGIWN L WDS +NIN EMNYW + NLSE EPLFD +T L++
Sbjct: 592 SQPGGQPANLQGIWNHSLYAPWDSKYTININAEMNYWPAEVTNLSETHEPLFDMVTDLAV 651
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
GS+TA+V Y A GWV HH TDIW ++ + +WP GGAW+ HLW+HY +T D+
Sbjct: 652 TGSETAKVLYDAKGWVAHHNTDIW-RACGPVDAASFGMWPNGGAWVAQHLWQHYLFTGDK 710
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+FL K+ YP+L+G A F L L+E H Y + T PS SPEH + G ++ TM
Sbjct: 711 EFL-KKYYPILKGTADFYLSHLVE-HPKYKWMVTVPSMSPEHGY---RGSQTTITAGCTM 765
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWAQDFKDPE 601
D I + + + A+ +L D L E L++ L +L P +I + + EW D +P
Sbjct: 766 DNQIAFDALYSTLLASRIL--GGDKLYEDSLQAMLDKLPPMQIGKHNQLQEWLIDADNPL 823
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQE 661
HRH+SHL+GL+P + I+ NP+L +AA TL +RG+ GWSI WK WAR+ D
Sbjct: 824 DDHRHISHLYGLYPSNQISPITNPELFQAARNTLIQRGDMATGWSIGWKINFWARMLDGN 883
Query: 662 HAYRMVKRLFNLV--DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLN 719
HAY++++ + +L+ D +++ EG Y NLF AHPPFQID NFG+TA VAEML+QS
Sbjct: 884 HAYKIIQNMLHLLPSDKVQKEYPEGRTYPNLFDAHPPFQIDGNFGYTAGVAEMLLQSHDG 943
Query: 720 DLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
++LLPALP + W G VKGL ARGG V + W L + I+S
Sbjct: 944 AVHLLPALP-EAWKKGSVKGLVARGGFVVDMEWDGVQLKKAKIHS 987
>gi|238062935|ref|ZP_04607644.1| large secreted protein [Micromonospora sp. ATCC 39149]
gi|237884746|gb|EEP73574.1| large secreted protein [Micromonospora sp. ATCC 39149]
Length = 932
Score = 494 bits (1273), Expect = e-137, Method: Compositional matrix adjust.
Identities = 291/739 (39%), Positives = 409/739 (55%), Gaps = 54/739 (7%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D N L+++R V + Q+ +
Sbjct: 42 ALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPYDSANTRGAANLAEIRRRVFADQWTQ 101
Query: 87 AT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
A + + G+PA YQ +G++ L F + Y R LDL TATA Y +
Sbjct: 102 AQDLINQTMVGNPAGQLAYQPVGNLRLAFGSAS---GASQYNRALDLTTATATTTYVLNG 158
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKR 203
V + RE F+S PDQVIV +++ + S++FN + DS + I ++G
Sbjct: 159 VRYQREVFASAPDQVIVIRLTADRANSITFNATFDSPQRTTVSSPDSATIGLDG------ 212
Query: 204 IPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGP 263
AN + ++F A+ ++ GT+S+ L+V G+ +L+ +S+
Sbjct: 213 --ISANMDGVTGQVRFLALANASVTG--GTVSS-SGGTLRVSGATSVTVLVSIGTSY--- 264
Query: 264 FINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDT 323
+N D + + L + R + L RHL DYQ LF+RV+I L R+
Sbjct: 265 -VNYRTVNGDYQGIARTRLNAARTAGFDQLRARHLADYQALFNRVTIDLGRT-------A 316
Query: 324 CSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLS 383
+++ D R+ DP LLFQFGRYLLISSSRPGTQ ANLQGIWN+ ++
Sbjct: 317 AADQTTDV-----RIAQHANTNDPQFSALLFQFGRYLLISSSRPGTQPANLQGIWNDQMA 371
Query: 384 PTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHH 443
P+WDS +N NL MNYW + NLSEC P+FD + L++ G++ AQ Y A GWV HH
Sbjct: 372 PSWDSKYTINANLPMNYWPADTTNLSECFLPVFDMIKDLTVTGARVAQAQYGAGGWVTHH 431
Query: 444 KTDIWAKSS-ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFL 502
TD W +S D + +W GGAWL T +W+HY +T D +FL YP ++G A F
Sbjct: 432 NTDAWRGASVVDYAQS--GMWQTGGAWLATMIWDHYLFTGDLEFLRAN-YPAMKGAAQFF 488
Query: 503 LDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVL 561
LD L+ YL TNPS SPE + A V TMD I+R++F+ + A+EVL
Sbjct: 489 LDTLVAHPTLSYLVTNPSNSPELSHHSN----AFVCAGPTMDNQILRDLFNGVALASEVL 544
Query: 562 EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITI 621
+ +V + RL PTK+ G++ EW D+ + E HRH+SHL+GL P + IT
Sbjct: 545 GVDA-TFRTQVRTAKDRLPPTKVGSRGNVQEWLADWVETERTHRHVSHLYGLHPSNQITK 603
Query: 622 EKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKH 681
P L +AA +TL+ RG++G GWS+ WK WARL D A++++K +LV +
Sbjct: 604 RGTPQLYEAARRTLELRGDDGTGWSLAWKINFWARLEDAARAHKLLK---DLVRTDR--- 657
Query: 682 FEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLK 741
L N+F HPPFQID NFG T+ +AEML+QS N+L+LLPALP W +G V GL+
Sbjct: 658 ----LAPNMFDLHPPFQIDGNFGATSGIAEMLLQSHNNELHLLPALP-SAWPTGSVTGLR 712
Query: 742 ARGGETVSICWKDGDLHEV 760
RGG TV W + V
Sbjct: 713 GRGGYTVGAAWSSSRIELV 731
>gi|424876717|ref|ZP_18300376.1| hypothetical protein Rleg5DRAFT_1090 [Rhizobium leguminosarum bv.
viciae WSM1455]
gi|393164320|gb|EJC64373.1| hypothetical protein Rleg5DRAFT_1090 [Rhizobium leguminosarum bv.
viciae WSM1455]
Length = 747
Score = 494 bits (1272), Expect = e-137, Method: Compositional matrix adjust.
Identities = 296/785 (37%), Positives = 427/785 (54%), Gaps = 60/785 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA+ +TDA+P+GNGRLGAMV+G E L++NE T W G P NPDA L VR
Sbjct: 8 YDTPAQLWTDALPLGNGRLGAMVFGDPLREHLQINEATFWAGGPYQPVNPDAFGHLETVR 67
Query: 77 SLVDSGQYAEATAASVK-LFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
L+ +YA+A A + K L P YQ +GD+ LEFD + + YRR LDL+TA
Sbjct: 68 QLIFDCRYADAEALAEKHLMARPIKQMSYQPIGDLHLEFDH---RESVSGYRRALDLDTA 124
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y+ + + RE F S D V+V ++S ++S +S+DS + +Q+
Sbjct: 125 IATSSYTADGIAYLREAFVSPVDGVLVLRLSADRKRAISCRISIDSPQQGEMRIGQGSQL 184
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
G+ GK A A ++F+ +++ + GT++A L VEG+D ++
Sbjct: 185 SFSGK--GKAESGIAAA------LRFT--FGVRMVNSGGTVNA-SRGALSVEGADEVLVF 233
Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
L A++SF D P + + L+ + ++ L H++++++LF +I L
Sbjct: 234 LDAATSFR----RYDDVLGHPERDIVDRLERAASRDFASLRDDHIEEHRRLFSAFAIDLG 289
Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVAN 373
+P ++P+ +R+ F +DP+L L QFGRYL+I+SSRPGTQ AN
Sbjct: 290 STPAA------------SLPTDQRIAGFAGGDDPALAALYVQFGRYLMIASSRPGTQPAN 337
Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVN 433
LQGIWN + P W S NINL+MNYW P NL EC EPL + L+ G A ++
Sbjct: 338 LQGIWNAETDPPWGSKYTANINLQMNYWLPAPANLPECLEPLVEMAEELAETGKAMAHIH 397
Query: 434 YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYP 493
Y A GWV+HH TD+W + G W LWP GG WL L + +Y D + + +R +P
Sbjct: 398 YRARGWVMHHNTDLWRATGPIDG-AKWGLWPTGGIWLMAQLLDACDYLDDAEAMRRRLFP 456
Query: 494 LLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
+ A FL D L+ G D YL TNPS SPE+ P G C MD +IR+ F
Sbjct: 457 VAREAAHFLFDVLVPFPGTD-YLVTNPSLSPENAH--PHGASICA--GPAMDSQLIRD-F 510
Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF--KDPEVHHRHLSH 609
++ V E LV + + LPRL P +I +G + EW +D+ + PE+HHRH+SH
Sbjct: 511 LGLLRPLAVSIGGEPDLVADIDRVLPRLAPDRIGANGQLQEWLEDWDMQAPEMHHRHVSH 570
Query: 610 LFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKR 669
L+GL+P I ++K P+L AA ++L+ RG++ GW I W+ LWARL D HA+ ++K
Sbjct: 571 LYGLYPSWQIDMDKTPELAAAARRSLEIRGDDATGWGIGWRINLWARLRDGNHAHNVLKL 630
Query: 670 LFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPW 729
L PE Y NLF AHPPFQID NFG A + EMLVQS +++LLPALP
Sbjct: 631 LLT---PERS-------YKNLFDAHPPFQIDGNFGGAAGIVEMLVQSRPGEIHLLPALP- 679
Query: 730 DKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLS 789
W G ++GL+ RGG + + W+DG + I S N L + T KV+L+
Sbjct: 680 TAWPGGRIRGLRLRGGILLDLDWEDG--RPLAIRLTASRN---VSSILRFGETRRKVDLA 734
Query: 790 AGKIY 794
AG+ +
Sbjct: 735 AGESF 739
>gi|254446849|ref|ZP_05060324.1| hypothetical protein VDG1235_197 [Verrucomicrobiae bacterium
DG1235]
gi|198256274|gb|EDY80583.1| hypothetical protein VDG1235_197 [Verrucomicrobiae bacterium
DG1235]
Length = 800
Score = 494 bits (1271), Expect = e-137, Method: Compositional matrix adjust.
Identities = 289/783 (36%), Positives = 423/783 (54%), Gaps = 48/783 (6%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+T+ T + F+ T++IP+GNGRLGA +G V ET+ LNE +W+G P +
Sbjct: 21 ATAQTPERSVWFDSAGASLTESIPLGNGRLGASFFGMVEEETVILNESGMWSGSPQEADR 80
Query: 66 PDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIELEF 112
DA KAL +++ L+ G+ AEA A F P YQ+L + +
Sbjct: 81 MDAHKALPEIKRLLLEGRNAEAEALVNANFTCAGRGSGYGGGANDPYGSYQILAKLHIVD 140
Query: 113 DDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLS 172
+ YRRELDL TAT R + G V + RE F+S PD+ +V + + SE+G L
Sbjct: 141 RSESSDTVVKNYRRELDLATATYRHSFERGGVGYIRESFASRPDEALVVRFTASEAGGLD 200
Query: 173 FNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRG 232
+ SL G + ++M G+ + G++++ +L+ + RG
Sbjct: 201 LDFSLSREERMQVEPLGADALLMTGQL--------NDGYGGEDGVRYAGVLK---ASARG 249
Query: 233 TISALEDKKLKVEGSDWAVLLL-----VASSSFDGPFINPSDSKKDPTSESMSALQSIRN 287
E+ +L+V G+D ++ +A SF G + +DP + + L + +
Sbjct: 250 GEVRSEEGRLEVRGADEVIVYFTTANDIAKRSFAGRMV------EDPIATAKLDLAGVES 303
Query: 288 LSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDP 347
S+ +L RH+ +++ + RVS+QL ++ + V ++ +DP
Sbjct: 304 YSFEELKRRHVAAFREYYGRVSLQLG-------SEELAASRAKVATPQRLVDHWEGVDDP 356
Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
L L F FGRYLLISSSRPG Q ANLQGIW++ + W+ H NIN++MNYW + CN
Sbjct: 357 DLAALYFDFGRYLLISSSRPGGQPANLQGIWSDTIQTPWNGDWHANINVQMNYWPAELCN 416
Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
LSE EP+F + L G KTA+ Y A GWV + W +S W
Sbjct: 417 LSELHEPMFKLIESLVEPGRKTAKAYYDAEGWVSFLLANPWGFTSPGE-SASWGSTVSCS 475
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEF 526
AWLC HLW+HY +T D FL + AYP+L+ A F L+E G+L T PS SPE F
Sbjct: 476 AWLCQHLWDHYLFTKDEAFL-RWAYPILKDSAVFYSQMLMEDTRTGWLVTCPSNSPESAF 534
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE 586
+G+ VS T+D ++R +F A I AAE+L ++ + E KS RL PT+I
Sbjct: 535 KLANGETVHVSMGPTIDQQLLRYLFGACIEAAEILGQDPEFAAELAEKS-ARLAPTQIGS 593
Query: 587 DGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWS 646
DG +MEW +++++ + HHRH+SHL+GL+PG+ I E P L AA KTL++RG+ G GWS
Sbjct: 594 DGRVMEWLEEYEEVDPHHRHISHLWGLYPGNEIHPETTPQLAAAARKTLERRGDGGTGWS 653
Query: 647 ITWKTALWARLHDQEHAYRMVKRLFNLVDPEH-EKHFEGGLYSNLFAAHPPFQIDANFGF 705
+ K LWARL D + +++++ L D + E +F GG Y NL+ AHPPFQID NFG
Sbjct: 654 LAHKLNLWARLGDGDRVHKLMRALLKPADVKTPEFNFSGGTYPNLYDAHPPFQIDGNFGG 713
Query: 706 TAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
TAA+AE L+QS + LLPALP +W G V GL+ARGG VS+ W +G L + + S+
Sbjct: 714 TAAIAESLLQSDGKRIVLLPALP-SEWKEGYVSGLRARGGFEVSLIWSEGMLKQAEVRSD 772
Query: 766 YSN 768
+S
Sbjct: 773 FSG 775
>gi|325298118|ref|YP_004258035.1| alpha-L-fucosidase [Bacteroides salanitronis DSM 18170]
gi|324317671|gb|ADY35562.1| Alpha-L-fucosidase [Bacteroides salanitronis DSM 18170]
Length = 820
Score = 494 bits (1271), Expect = e-136, Method: Compositional matrix adjust.
Identities = 302/771 (39%), Positives = 421/771 (54%), Gaps = 44/771 (5%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+S LK+ + PA + +A+P+GN +G MV+GG E L+LNE+T+W G P NP
Sbjct: 18 SSWAESLKLWYRQPAHVWVEALPLGNSNMGVMVYGGTGVEQLQLNEETMWGGGPHRNDNP 77
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETY 124
A +AL +VR L+ + EA K F G YQ +G + +E H ++A + Y
Sbjct: 78 KALQALPEVRKLIFDNRNMEAQQLIDKTFYSGRNGMPYQTIGSLMIE-QPGH-EHATDYY 135
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
R +LDL A A V+Y V V + RE F+S D+VI ++ G L+F + S L H
Sbjct: 136 R-DLDLERAVATVRYQVDGVTYRREVFASLVDKVIRVHLTADRPGMLTFTLGYQSPLTRH 194
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEI--KISDDRGTISALEDKKL 242
C GK + N +D +G++ +E ++ G + A DK L
Sbjct: 195 QVT-----------CKGKTLVLTGNG-EDHEGVKGVIRMETGTQVMAKGGKVKAQGDK-L 241
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
VEG+D V L VAS++ F + +D +P L+ SY+ H Y+
Sbjct: 242 CVEGAD-EVTLYVASAT---NFRSYNDVSGNPHRSVQELLKKAVKTSYTQALADHEAYYR 297
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
K F RV + L E D + ER++ F +D SL L+FQ+GRYLLI
Sbjct: 298 KQFDRVRLDLG------------EGQGDQWETTERIRRFNEGKDVSLAALMFQYGRYLLI 345
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SSS+PG Q ANLQGIWN+ L WD +NIN EMNYW + NL E +PLF+ + L
Sbjct: 346 SSSQPGGQAANLQGIWNDKLLAPWDGKYTININTEMNYWPAEVTNLPETHQPLFELVKEL 405
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
S G +TA+V Y A+GWV HH TDIW + + K + WP GGAWL THLW+HY YT
Sbjct: 406 SQTGQETARVMYGANGWVAHHNTDIW-RCTGPVDKAFYGTWPNGGAWLTTHLWQHYLYTG 464
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPD-GKLACVSYSS 540
D++FLE+ YP L+G A F L +LI G++ PS SPEH + GK + +
Sbjct: 465 DKEFLEE-VYPALKGAADFYLSYLIPHPKYGWMVEAPSMSPEHGPQGENTGKASTIVAGC 523
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDP 600
TMD I+ +V + + A +L+ + A + + + +L P +I + + EW +D +P
Sbjct: 524 TMDNQIVFDVLNNALHATRILDGSV-AYQDSLRWMIEQLPPMQIGQYNQLQEWLEDLDNP 582
Query: 601 EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQ 660
HRH+SH +GLFP + I+ +P L +A + T+ +RG+E GWSI WK LWARL D
Sbjct: 583 RDRHRHISHAYGLFPSNQISPYAHPLLFQAIKNTMLQRGDEATGWSIGWKINLWARLLDG 642
Query: 661 EHAYRMVKRLFNLV--DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTL 718
HAY+M+ + L+ D ++ EG Y NLF AHPPFQID NFG+TA VAEML+QS
Sbjct: 643 NHAYKMIGNMLKLLPSDSVKTQYPEGRTYPNLFDAHPPFQIDGNFGYTAGVAEMLMQSHD 702
Query: 719 NDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
++LLPALP D W G VKGL ARGG V + W L + I+S N
Sbjct: 703 GAVHLLPALP-DVWVKGSVKGLVARGGFVVDMEWDGVQLAKAKIHSRLGGN 752
>gi|380693852|ref|ZP_09858711.1| alpha-L-fucosidase [Bacteroides faecis MAJ27]
Length = 772
Score = 493 bits (1270), Expect = e-136, Method: Compositional matrix adjust.
Identities = 290/741 (39%), Positives = 406/741 (54%), Gaps = 48/741 (6%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEAT-AASVKL-- 94
MV+G +E ++LNE+T+ G P N +A +AL +R L+ G YAEA A K+
Sbjct: 1 MVYGDPVNEEIQLNEETVSAGSPYKNYNSEAKEALPAIRKLIFDGNYAEAQLMAGEKILS 60
Query: 95 ---FGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHF 151
FG P YQ +G + L F YRRELD++ A A Y V VE+ RE F
Sbjct: 61 KNGFGMP---YQTVGSLRLHFQGQE---NHTDYRRELDIDKALAITTYRVNGVEYKRETF 114
Query: 152 SSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANAN 211
+S DQ+++ +++ S+ G L+F +L V+G N I M G G + A
Sbjct: 115 TSFTDQLVIVRLTASKPGMLTFTAALTCPQAVEVSVSGKNTIKMSGITKGDQFTEGA--- 171
Query: 212 DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSK 271
I+F+A L++++ +G S +D L V +D AVL + +++F +N D
Sbjct: 172 -----IRFAADLKLEL---QGGKSIAQDSVLSVSNADSAVLYIAMATNF----VNYKDIS 219
Query: 272 KDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKDIVTDTCSEENID 330
D + L++ +YS H+ YQK +HRVS+ L S D TD
Sbjct: 220 ADAVKRNQVYLRNAGK-NYSKALQEHIAAYQKYYHRVSLDLGYTSQADKPTDV------- 271
Query: 331 TVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAP 390
RVK F +DP L+ L FQ+GRYLLISSS+PG Q ANLQGIWN+ L+P W
Sbjct: 272 ------RVKEFAVSDDPQLISLYFQYGRYLLISSSQPGRQPANLQGIWNDKLNPVWKCRY 325
Query: 391 HVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAK 450
N+N EMNYW + NLSE EP + L NG + A+ Y GWV+HH TD+W
Sbjct: 326 TTNVNAEMNYWPAEVTNLSEMHEPFLQMIRELYENGQEAAREMYGCRGWVLHHNTDLWRM 385
Query: 451 SSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EG 509
+ A K WP AWLC HLWE Y Y+ D+DFL YP+++ + F +D+L+ +
Sbjct: 386 NGA-VDKAYCGTWPTCNAWLCHHLWERYLYSGDKDFLAS-VYPIMKSASEFFVDFLVRDP 443
Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALV 569
+ GY+ PS SPE+ GK A + TMD ++ ++F+ +AA +L ++
Sbjct: 444 NTGYMVVTPSNSPENAPRQWKGK-ANLFAGITMDNQLVFDLFTNTEAAAHILNGKDEQFC 502
Query: 570 EKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCK 629
+ + +L P ++ + G + EW +D+ +P HHRHLSHL+GLFPG I+ +P L +
Sbjct: 503 DTIRSLKKQLPPMQVGQYGQLQEWFEDWDNPNDHHRHLSHLWGLFPGFQISPYSSPILFE 562
Query: 630 AAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSN 689
A TL +RG+ GWS+ WK WAR D HA +++ NLV P +K GG Y N
Sbjct: 563 ATRNTLMQRGDPSTGWSMGWKVCFWARCLDGNHALKLITNQLNLVSPLVQKGQGGGTYPN 622
Query: 690 LFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGG-ETV 748
LF AHPPFQID NFG TA +AEMLVQS + ++LLPALP D W +G VKGL+ RGG E V
Sbjct: 623 LFDAHPPFQIDGNFGCTAGIAEMLVQSHDDAVHLLPALP-DAWRNGEVKGLRTRGGFEIV 681
Query: 749 SICWKDGDLHEVGIYSNYSNN 769
S+ WKDG + V + S N
Sbjct: 682 SLKWKDGKIESVVVKSTIGGN 702
>gi|160886122|ref|ZP_02067125.1| hypothetical protein BACOVA_04129 [Bacteroides ovatus ATCC 8483]
gi|423286896|ref|ZP_17265747.1| hypothetical protein HMPREF1069_00790 [Bacteroides ovatus
CL02T12C04]
gi|156108935|gb|EDO10680.1| hypothetical protein BACOVA_04129 [Bacteroides ovatus ATCC 8483]
gi|392674434|gb|EIY67882.1| hypothetical protein HMPREF1069_00790 [Bacteroides ovatus
CL02T12C04]
Length = 822
Score = 493 bits (1270), Expect = e-136, Method: Compositional matrix adjust.
Identities = 300/777 (38%), Positives = 445/777 (57%), Gaps = 57/777 (7%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E S+ K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+++W G P +
Sbjct: 23 EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NP+A + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 83 NPNALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y+ Y REL L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196
Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
S +Q +M EG C + ++ ++ KG ++F L K ++G
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
A D L VE +D AV+ + +++F+ N D + T + + L + +
Sbjct: 242 KIACADGILSVEKADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+D Y++ RVS+ L R + V + +RV++F+ D LV
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLF + +S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSDTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
LWE Y YT D +FL + YP+L+ F + ++ E +L PS SPE+ +GK
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK 523
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
A + TMD +I ++++AIISA+++L+ +++ + + L + P ++ G + E
Sbjct: 524 -ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQRLKEMAPMQVGHWGQLQE 581
Query: 593 WAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTA 652
W D+ DP+ HRH+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK
Sbjct: 582 WMFDWDDPKDVHRHISHLYGLFPSNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKVC 641
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
LWARL D +HAY+++ LV E +K G Y NLF AHPPFQID NFG A +AEM
Sbjct: 642 LWARLLDGDHAYKLITDQLTLVRNEKKK---GSTYPNLFDAHPPFQIDGNFGCAAGIAEM 698
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
L+QS +YLLPALP W+ G +KG+ ARGG + + WK+G + + + S+ N
Sbjct: 699 LMQSYDGFIYLLPALP-TVWTEGSIKGIIARGGFELDLSWKNGKVSRLVVKSHKGGN 754
>gi|315506426|ref|YP_004085313.1| cellulose-binding family II [Micromonospora sp. L5]
gi|315413045|gb|ADU11162.1| cellulose-binding family II [Micromonospora sp. L5]
Length = 936
Score = 493 bits (1270), Expect = e-136, Method: Compositional matrix adjust.
Identities = 294/760 (38%), Positives = 417/760 (54%), Gaps = 53/760 (6%)
Query: 11 NPLKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
N L + ++ PA + A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D N
Sbjct: 44 NDLALWYDEPAGTDWLRALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPYDSANTRGA 103
Query: 70 KALSDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRR 126
L+++R V + Q+ A + + G P YQ +GD+ L F + Y R
Sbjct: 104 ANLAEIRRRVFADQWTSAQDLINQTMMGSPGGQLAYQTVGDLRLAFGSAS---GATQYNR 160
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
LDL TAT Y G V + RE F+S PDQV+V +++ + +++F+ + DS
Sbjct: 161 TLDLTTATITTTYVQGGVRYQREMFASAPDQVMVLRLTADRANAITFSAAFDSPQRTTVS 220
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
I ++G + ++F A+ ++ GT+S+ L+V G
Sbjct: 221 SPDGATIALDG--------VSGSMEGVTGSVRFLALANAAVTG--GTVSS-SGGTLRVSG 269
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ +L+ +S+ +N D + + L + ++++ L TRH DYQ LF+
Sbjct: 270 ATSVTVLVSIGTSY----VNYRTVNGDYQGIARNRLNAAKSVAVDQLRTRHRADYQALFN 325
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV+I L R T + + P+ R+ + DP LLFQFGRYLLISSSR
Sbjct: 326 RVTIDLGR--------TAAADQ----PTDVRIAQHASTNDPQFAALLFQFGRYLLISSSR 373
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PGTQ ANLQGIWN+ L+P+WDS VN NL MNYW + NLSEC P+FD + L++ G
Sbjct: 374 PGTQPANLQGIWNDSLTPSWDSKYTVNANLPMNYWPADTTNLSECFLPVFDMVKDLTVTG 433
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
++ AQ Y A GWV HH TD W +S G W +W GGAWL T +W+HY +T D F
Sbjct: 434 ARVAQAQYGAGGWVTHHNTDAWRGASVVDG-AFWGMWQTGGAWLSTLIWDHYLFTGDSGF 492
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L+ YP L+G A F LD L+ GYL TNPS SPE A A V TMD
Sbjct: 493 LQAN-YPALKGAAQFFLDTLVAHPTLGYLVTNPSNSPELAHHAN----ASVCAGPTMDNQ 547
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
I+R++F A A+EVL + +V + RL P+++ G++ EW D+ + E HR
Sbjct: 548 ILRDLFDAAARASEVLGV-DTTFRSQVRTARDRLPPSRVGSRGNVQEWLADWVETERTHR 606
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H+SHL+GL P + IT P L +AA +TL+ RG++G GWS+ WK WARL D A++
Sbjct: 607 HVSHLYGLHPSNQITRRGTPALYEAARRTLELRGDDGTGWSLAWKINFWARLEDGARAHK 666
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
+++ +LV + L N+F HPPFQID NFG T+ +AEML+ S +L+LLP
Sbjct: 667 LLR---DLVRTDR-------LAPNMFDLHPPFQIDGNFGATSGIAEMLLHSHTGELHLLP 716
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
ALP W +G V GL+ RGG TVS+ W G E+ + ++
Sbjct: 717 ALP-TAWPAGQVAGLRGRGGYTVSLTWSSGQADEITVRAD 755
>gi|116248791|ref|YP_764632.1| hypothetical protein pRL120117 [Rhizobium leguminosarum bv. viciae
3841]
gi|115253441|emb|CAK11831.1| conserved hypothetical protein [Rhizobium leguminosarum bv. viciae
3841]
Length = 747
Score = 493 bits (1269), Expect = e-136, Method: Compositional matrix adjust.
Identities = 293/785 (37%), Positives = 430/785 (54%), Gaps = 60/785 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA+ +TDA+P+GNGRLGAMV+G E L++NE T W G P NPDA L VR
Sbjct: 8 YDTPAQLWTDALPLGNGRLGAMVFGDPLREHLQINEATFWAGGPYQPVNPDAFGHLETVR 67
Query: 77 SLVDSGQYAEATAASVK-LFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
L+ +YA+A A + K L P YQ +GD+ LEFD + + YRR LDL+TA
Sbjct: 68 QLIFDCRYADAEALAEKHLMARPIKQMSYQPIGDLHLEFDH---RESVSGYRRALDLDTA 124
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y+ + + RE F S D V+V ++S +++ +S+DS + +Q+
Sbjct: 125 IATSSYTADGIAYLREAFVSPVDGVLVLRLSADRKRAINCRISIDSPQQGEMRIGQGSQL 184
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
G+ GK A A ++F+ +++ + GT++A L VEG+D ++
Sbjct: 185 SFSGK--GKAESGIAAA------LRFA--FGVRLINSGGTVNA-SGGALSVEGADEVLVF 233
Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
L A++SF D P + + L+S + + L H++++++LF +I L
Sbjct: 234 LDAATSFR----RYDDVLGHPERDIVDRLESAVSRDFVSLRDDHIEEHRRLFSAFAIDLR 289
Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVAN 373
+P ++P+ +R+ F +DP+L L QFGRYL+I+SSRPGTQ AN
Sbjct: 290 STPAA------------SLPTDQRIAGFAGGDDPALAALYVQFGRYLMIASSRPGTQPAN 337
Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVN 433
LQGIWN + P W S NINL+MNYW P NL EC EPL + L+ G A V+
Sbjct: 338 LQGIWNAETDPPWGSKYTANINLQMNYWLPAPANLPECLEPLVEMAEELAETGKAMAHVH 397
Query: 434 YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYP 493
Y A GWV+HH TD+W + G W LWP GG WL L + +Y D + + +R +P
Sbjct: 398 YRARGWVMHHNTDLWRATGPIDG-AKWGLWPTGGIWLMAQLLDACDYLDDAEAMRRRLFP 456
Query: 494 LLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
+ A FL D L+ G D +L TNPS SPE+ P G C MD +IR+ F
Sbjct: 457 IAREAAHFLFDVLVPFPGTD-HLVTNPSLSPENAH--PHGASICA--GPAMDSQLIRD-F 510
Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF--KDPEVHHRHLSH 609
++ V E LV + + LPRL P +I +G + EW +D+ + PE+HHRH+SH
Sbjct: 511 LGLLRPLAVSIGGEPDLVADIDRVLPRLAPDRIGANGQLQEWLEDWDMQAPEMHHRHVSH 570
Query: 610 LFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKR 669
L+GL+P I ++K P+L AA ++L+ RG++ GW I W+ LWARL D HA+ ++K
Sbjct: 571 LYGLYPSWQIDMDKTPELAAAARRSLEIRGDDATGWGIGWRINLWARLRDGNHAHNVLKL 630
Query: 670 LFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPW 729
L PE Y NLF AHPPFQID NFG A + EMLVQS +++LLPALP
Sbjct: 631 LLT---PERS-------YKNLFDAHPPFQIDGNFGGAAGIVEMLVQSRPGEIHLLPALP- 679
Query: 730 DKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLS 789
W G ++GL+ RGG + + W+DG+ + + ++ + + L + T KV+L+
Sbjct: 680 TAWPGGSIRGLRLRGGMLLDLDWEDGEPLTIRLTASRNVS-----SILRFGQTRRKVDLA 734
Query: 790 AGKIY 794
AG+ +
Sbjct: 735 AGESF 739
>gi|383122650|ref|ZP_09943342.1| hypothetical protein BSIG_0605 [Bacteroides sp. 1_1_6]
gi|382984352|gb|EES70332.2| hypothetical protein BSIG_0605 [Bacteroides sp. 1_1_6]
Length = 822
Score = 493 bits (1269), Expect = e-136, Method: Compositional matrix adjust.
Identities = 300/773 (38%), Positives = 445/773 (57%), Gaps = 49/773 (6%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E + K+ ++ PA+ +T+A+P+GNGRLGAMV+G E ++LNE+T+W G P +
Sbjct: 23 EKKVSAQEYKLWYDRPAQVWTEALPLGNGRLGAMVYGTPGIEQIQLNEETIWAGRPNNNA 82
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NP+A + + VR L+ +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 83 NPNALEYIPKVRELIFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y++ Y REL L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 139 YSD--YYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISAL 237
S H N++ EG C + ++ ++ KG ++F L + ++G A
Sbjct: 197 S---PHQDAMINSE---EGNC--VTLSGVSSLHEGLKGKVEFQGRLTAR---NQGGKIAC 245
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
D L VEG+D A + + +++F+ N D + T + S L +++ H
Sbjct: 246 TDGVLSVEGADEATIYVSIATNFN----NYLDITGNQTERAKSYLSEALVHPFAEAKKNH 301
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
++ Y++ RVS+ L E+ V + +RV++F+ D LV FQFG
Sbjct: 302 VEFYRQYLTRVSLDLG------------EDQYKNVTTDKRVENFKDTHDAHLVATYFQFG 349
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLS+ EPLF
Sbjct: 350 RYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSDLNEPLFR 409
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ +S +G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC HLWE
Sbjct: 410 LIKEVSESGKETARIMYGANGWVLHHNTDIWRITGA-LDKAPSGMWPSGGAWLCRHLWER 468
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACV 536
Y YT D +FL + YP+L+G F + ++ E +L PS SPE+ DGK A
Sbjct: 469 YLYTGDTEFL-RSVYPILKGSGLFFDEIMVKEPVHNWLVVCPSNSPENVHSGNDGK-ATT 526
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
+ TMD +I ++++AIISA+ +L+ +++ + + L + P ++ G + EW D
Sbjct: 527 AAGCTMDNQLIFDLWTAIISASRILDTDKE-FAAHLEQRLKEMAPMQVGHWGQLQEWMFD 585
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
+ DP HRH+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK LWAR
Sbjct: 586 WDDPNDVHRHVSHLYGLFPSNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKVCLWAR 645
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
L D +HAY+++ LV E +K GG Y NLF AHPPFQID NFG A +AEML+QS
Sbjct: 646 LLDGDHAYKLITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCAAGIAEMLMQS 702
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
+YLLPALP W G V G+ ARGG + + WK+G ++ + + S+ N
Sbjct: 703 YDGFIYLLPALP-TLWKDGSVTGIIARGGFELDLSWKNGKVNRLVVKSHKGGN 754
>gi|423346901|ref|ZP_17324588.1| hypothetical protein HMPREF1060_02260 [Parabacteroides merdae
CL03T12C32]
gi|409218562|gb|EKN11530.1| hypothetical protein HMPREF1060_02260 [Parabacteroides merdae
CL03T12C32]
Length = 809
Score = 493 bits (1268), Expect = e-136, Method: Compositional matrix adjust.
Identities = 293/777 (37%), Positives = 427/777 (54%), Gaps = 52/777 (6%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
N T PL F+ PA + + P+GNGRLG M GGV +E + LNE ++W+G D
Sbjct: 17 NMPEVKTGKPLSYHFDAPAGIWEASFPLGNGRLGLMPDGGVDTENIVLNEISMWSGSKQD 76
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIE 109
NP A +L +R L+ G+ EA F G A+V YQLLG++
Sbjct: 77 TDNPQAYHSLGTIRKLLFEGRNDEAQELMYNTFVCKGEGSGQGQGANVPYGSYQLLGNLV 136
Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
L +D + YRREL+L+ A A + G V + RE F+S D + V ++
Sbjct: 137 LNYDYQGTSDSIFGYRRELNLDNAIATASFRRGKVTYNREVFTSFADDLGVIHLTADADR 196
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD 229
+L+F+ ++ +++ N ++M+G+ P + KG+++++ +++
Sbjct: 197 ALNFSFGMNRP-EHYKVTADGNDLLMQGQLP------DGVDTLEMKGLRYAS--RVRVIL 247
Query: 230 DRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSALQSIRNL 288
+G D + V + A+LL+ +A+ FD KD + S L +
Sbjct: 248 PKGGNVTPGDSTVSVRNASEAILLVSMATDYFD----------KDLAGKVSSLLANAEKK 297
Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDP 347
++ L H+ Y+ LF RV + L S S EN+ P ER+ +F + +DP
Sbjct: 298 DFASLKKGHIAAYRSLFGRVELDLGHS---------SRENL---PMDERLAAFHENPDDP 345
Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
SL L FQFGRYLLISS+R G NLQG+W ++ W+ H+NINL+MN+W + N
Sbjct: 346 SLAALYFQFGRYLLISSTRVGLLPPNLQGLWCNTINTPWNGDYHLNINLQMNHWPAEVAN 405
Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
LSE PL ++ +G +TA+ Y A GWV H ++W + +A W
Sbjct: 406 LSELHLPLVEWTKQQVASGERTAKAYYNARGWVTHILGNVW-EFTAPGEHPSWGATNTSA 464
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEF 526
AWLC HL+ HY YT+D+++L K YP+L+G + F +D L+E + YL T P+TSPE+ +
Sbjct: 465 AWLCEHLYMHYLYTLDKEYL-KDVYPVLKGASLFFVDMLVEDPRNKYLVTAPTTSPENGY 523
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE 586
P+GK A + STMD I+RE+F+ I AA++L + A ++ RL PT I +
Sbjct: 524 KLPNGKTAHICAGSTMDNQIVRELFTNTIEAADILGL-DSAFAGELAAKRARLMPTTIGK 582
Query: 587 DGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWS 646
DG IMEW + +++ E HHRH+SHL+GL+PG+ I+ E+ P+L +AA K+L RG++ GWS
Sbjct: 583 DGCIMEWLEPYEEVEPHHRHVSHLYGLYPGNEISTERTPELAEAARKSLIARGDKSTGWS 642
Query: 647 ITWKTALWARLHDQEHAYRMVKRLFN-LVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGF 705
+ WK WARLHD +HAY++ L VD + GG Y NLF AHPPFQID NFG
Sbjct: 643 MGWKMNFWARLHDGDHAYKLFADLLRPCVDRKTNMTNGGGTYPNLFCAHPPFQIDGNFGG 702
Query: 706 TAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
A +AEMLVQS ++ LLPALP W SG KGLK RGG VS WK+G L E G+
Sbjct: 703 CAGIAEMLVQSQTGEIELLPALP-SAWKSGSFKGLKVRGGGEVSAKWKEGRLAEAGL 758
>gi|393782187|ref|ZP_10370376.1| hypothetical protein HMPREF1071_01244 [Bacteroides salyersiae
CL02T12C01]
gi|392674221|gb|EIY67670.1| hypothetical protein HMPREF1071_01244 [Bacteroides salyersiae
CL02T12C01]
Length = 1400
Score = 492 bits (1267), Expect = e-136, Method: Compositional matrix adjust.
Identities = 296/783 (37%), Positives = 435/783 (55%), Gaps = 58/783 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA ++ +A+P+GNGRLGAMV+G +T+++NEDT W+G P + NP+A L
Sbjct: 27 LKLWYDRPADYWVEALPLGNGRLGAMVYGIASQDTIQINEDTYWSGSPYNNANPNALTHL 86
Query: 73 SDVRSLVDSGQYAEATA-------ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYR 125
D+R+ +++G+YAEA A + GH +Y+ +G++ L+F ++H Y
Sbjct: 87 EDIRNYINNGEYAEAQKLALANIIADRNITGHGM-IYESIGNLLLDFPENH--KTPSNYY 143
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH- 184
RELDL+ A A++ Y+V V +TRE F+S DQ+I+ KIS + G ++F S L +
Sbjct: 144 RELDLSNAVAKITYTVDGVNYTREVFTSLADQLIIIKISADQPGKVTFKTSFVGPLKTNR 203
Query: 185 -----SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
V G + ++ GK+ P + ++ IK+ D G+ +A +
Sbjct: 204 TKVTVKLVEGADNMLSVYTEGGKKTEENI-----PNLLHAHSL--IKVVADGGSQTA-AN 255
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
L V ++ A + + +++F ++ D D + + L + Y H+
Sbjct: 256 SSLNVTNANSACIYISTATNF----VSYKDISADSEARAKEYLDKF-DKDYEQAKADHIA 310
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
YQ+ F RV++ L + SE+ + P+ R++ F T DPSL L FQFGRY
Sbjct: 311 KYQEQFGRVTLNLGNN---------SEQ--EKKPTDVRIEEFSTVNDPSLAALYFQFGRY 359
Query: 360 LLISSSRPGTQVANLQGIWNEDLS--PTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
LLISSS+PGTQ ANLQGIWN + P WDS NIN+EMNYW + NLSEC P
Sbjct: 360 LLISSSQPGTQPANLQGIWNPNAGQYPAWDSKYTANINVEMNYWPAEVTNLSECHNPFLQ 419
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ +S+ G ++A Y GW +HH TDIW +S+ K +WP AW C HLWEH
Sbjct: 420 MVKDVSVTGEESAGKMYGCRGWTLHHNTDIW-RSTGAVDKSACGVWPTCNAWFCFHLWEH 478
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHE---FIAPD--- 530
Y +T D++FL + YP+L+ + F D+LI + + GY +PS SPE+ F D
Sbjct: 479 YLFTGDKEFLAE-IYPVLKSASEFYQDFLITDPNTGYKVVSPSNSPENHPGLFSYTDDSG 537
Query: 531 -GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP-RLRPTKIAEDG 588
+ A + TMD ++ ++ I AAE+L ++ + + LK L +L P + + G
Sbjct: 538 SKQNAAIFSGVTMDNQMVYDLLRNTIEAAEILNTDKGFVAD--LKELKEQLPPMHVGKYG 595
Query: 589 SIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSIT 648
+ EW +D+ HRH+SHL+G+FPG I+ N L +A +K+L RG+E GWS+
Sbjct: 596 QLQEWLEDWDRESSGHRHVSHLWGMFPGTQISPYTNSALFQAVKKSLVGRGDESRGWSMG 655
Query: 649 WKTALWARLHDQEHAYRMVKRLFNLVDPEHE-KHFEGGLYSNLFAAHPPFQIDANFGFTA 707
WK LWARL D HAY++++ L DP GG Y+N+F AHPPFQID NFG A
Sbjct: 656 WKVCLWARLQDGNHAYQLIQNQLKLKDPNVTISDANGGTYANMFDAHPPFQIDGNFGCCA 715
Query: 708 AVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGG-ETVSICWKDGDLHEVGIYSNY 766
+AEMLVQS ++LLPALP D WS G V GLKARGG E V + WK G + V + S
Sbjct: 716 GIAEMLVQSHDGAVHLLPALP-DVWSEGKVTGLKARGGFEIVDMQWKWGKIVSVTVKSGI 774
Query: 767 SNN 769
N
Sbjct: 775 GGN 777
>gi|333381846|ref|ZP_08473525.1| hypothetical protein HMPREF9455_01691 [Dysgonomonas gadei ATCC
BAA-286]
gi|332829775|gb|EGK02421.1| hypothetical protein HMPREF9455_01691 [Dysgonomonas gadei ATCC
BAA-286]
Length = 808
Score = 492 bits (1267), Expect = e-136, Method: Compositional matrix adjust.
Identities = 296/766 (38%), Positives = 409/766 (53%), Gaps = 61/766 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ +N PAK + +A+P+GN RLG MV+G E L+LNE+T+W G P NP A AL
Sbjct: 24 LKLWYNTPAKIWEEALPLGNSRLGVMVYGIPEKEELQLNEETIWGGGPYRNDNPKALGAL 83
Query: 73 SDVRSLVDSGQYAEATAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
+ R L+ G+ EA + F G P +Q G + L F H Y + Y RE
Sbjct: 84 PEARELIFKGKSREADQLINRTFFTKTHGMP---FQTAGSVILNFP-GHQNY--QDYSRE 137
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
LDL+ A A +Y+V V++TRE FSS D VI+ +I+ G+L+F + H+
Sbjct: 138 LDLDKALAITRYTVNGVKYTREVFSSFADDVIIMRITAGRKGTLNFETEYTNN-SQHTIS 196
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
+N +I+EG+ D +GI E KI T+ D K++V GS
Sbjct: 197 KKDNILILEGK------------GSDHEGI------EGKIRYQIHTLIRNHDGKIEVTGS 238
Query: 248 DWAVLLLVASS---SFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
++ ++ S F+N + DP ++ AL Y H D Y K
Sbjct: 239 KISISGATVATIYISIGTNFLNYKSVEGDPAKKASDALAKALKTDYRSALKNHSDIYGKQ 298
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F R + L P+ + T +R+ FQ + DP+LV LL QFGRYLLI S
Sbjct: 299 FKRFKLDLGNVPEAMKLTTT-----------QRIIDFQKNHDPALVTLLTQFGRYLLICS 347
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+ G Q ANLQGIW + P WDS +NIN EMNYW + NLSE P+ + LS
Sbjct: 348 SQLGGQPANLQGIWCNSMHPAWDSKYTININAEMNYWPAEVTNLSETHLPMIQMVKDLSE 407
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
+G +TA+ Y A GWV HH TDIW +S +WP GGAWL HLWEHY +T D+
Sbjct: 408 SGQQTAKTMYGARGWVAHHNTDIWRVTSPVDFAAA-GMWPTGGAWLVQHLWEHYLFTGDK 466
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
+L YP ++G A + L L+E G++ PS SPEH +S TMD
Sbjct: 467 KYLAD-VYPAMKGAADYFLSSLVEHPQYGWMVVCPSVSPEH---------GPMSAGCTMD 516
Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVH 603
++ +V + A +L +NE+ ++L + +L P I + + EW +D DP+
Sbjct: 517 NQLVFDVLTRTAQANNILGENEE-YRNQLLAMVSKLPPMHIGKYSQLQEWLEDKDDPQNE 575
Query: 604 HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHA 663
HRH+SHL+GL+PG+ I+ NP+L +AA +L RG+ GWSI WK LWARL HA
Sbjct: 576 HRHVSHLYGLYPGNQISPYTNPELFEAARNSLIYRGDMATGWSIGWKVNLWARLLHGNHA 635
Query: 664 YRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYL 723
Y++V + L +E +G Y N+F AHPPFQID NFG TA +AEMLVQS ++L
Sbjct: 636 YKIVSNMLTLAGKGNE---DGRTYPNMFTAHPPFQIDGNFGLTAGIAEMLVQSHDGAVHL 692
Query: 724 LPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
LPALP D W +G V G+ ARGG +S+ WKDG++ E+ I S N
Sbjct: 693 LPALP-DVWKNGSVSGIMARGGFEISMKWKDGEVSEISILSKLGGN 737
>gi|282878225|ref|ZP_06287021.1| conserved hypothetical protein [Prevotella buccalis ATCC 35310]
gi|281299643|gb|EFA92016.1| conserved hypothetical protein [Prevotella buccalis ATCC 35310]
Length = 793
Score = 492 bits (1267), Expect = e-136, Method: Compositional matrix adjust.
Identities = 298/804 (37%), Positives = 437/804 (54%), Gaps = 53/804 (6%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPK 70
P+++ ++ PA++F +++PIGNGR+GA+V+GG + LN+ TLWTG P D + +A +
Sbjct: 23 PMQLWYDKPAQYFEESMPIGNGRMGALVYGGTRDNLIYLNDITLWTGQPVDPNLDQNAHQ 82
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+ +R + Y +A + +++ G + YQ L + L D + Y R LD+
Sbjct: 83 WIPAIREALFKEDYRKADSLQLRVQGPNSQYYQPLATLHL-LDPRGGQ--ATNYTRTLDI 139
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
+ A YS+ V+ RE+F+S+PD VI I+ ++ S+S V+L + + HS
Sbjct: 140 DKALLTDSYSLNGVKIKREYFASHPDSVICIHITANKPRSISLEVNLSAQIP-HSVKAAG 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
N I M+G G + I F ++L + +G I A + L ++ ++ A
Sbjct: 199 NLITMKGHAMG----------NPENSIHFCSVL--RAVTKQGKIQATDSTLLIIDATE-A 245
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
L V +SF+G +P K +++ +++ Y + +H+ DY + R+ +
Sbjct: 246 TLFFVNETSFNGFDKHPVRQGKPCEQLALAHQKALEKKDYQTIKKQHVADYTHYYDRMKL 305
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSF--QTDEDPSLVELLFQFGRYLLISSSRPG 368
L S VTD CS + +++K + Q +P L L Q+GRYLLI+SSR
Sbjct: 306 FLGGS----VTD-CSRT------TEQQLKDYTDQGGHNPYLETLYMQYGRYLLIASSRTK 354
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
ANLQG+W+ L W S VNINLE NYW + NL E +PLF F+ L+ NG
Sbjct: 355 GIPANLQGLWSHYLRAPWRSNYTVNINLEENYWLAEVANLGEMAKPLFTFMQALAANGRH 414
Query: 429 TAQVNY-LASGWVIHHKTDIWAKSSA---DRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
TA+ Y + GW H +D+WA ++ R W+ W MGGAWL +LWEHY + D
Sbjct: 415 TAKNYYGINRGWCSSHNSDVWAMTNPVGEKRESPEWSNWNMGGAWLTQNLWEHYRFNPDA 474
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
FL A PLLEG ++F+LDWL+E + L T PSTSPE+E+ P+G Y T
Sbjct: 475 QFLNDTALPLLEGASAFMLDWLVENPKNPSELITAPSTSPENEYKTPEGYHGTTCYGGTA 534
Query: 543 DMAIIREVFSAIISAAEVLEKN------EDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
D+AIIRE+F I+ AE + K + L++ + SL RL P I G + EW D
Sbjct: 535 DLAIIRELF---INTAEAINKKGADYARQSQLLKDIEASLKRLHPYTIGHLGDLNEWYYD 591
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
+ D ++ HRH SHL GLFPGH +++++ P L AAEKTL ++G+ GWS W+ LWAR
Sbjct: 592 WDDWDIKHRHQSHLIGLFPGHHLSLKETPQLALAAEKTLLQKGDHTTGWSTGWRINLWAR 651
Query: 657 LHDQEHAYRMVKRLFNLVDPEH----EKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
L + AY M ++L V P+ +K GG Y NL AHPPFQID NFG TA V EM
Sbjct: 652 LRKAKQAYHMYQKLLTYVSPDQYQGADKRSSGGTYPNLMDAHPPFQIDGNFGGTAGVCEM 711
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHD 772
L+QST N+LYLLPALP D W G V+G++ARGG VS+ W++G + V + H
Sbjct: 712 LLQSTDNELYLLPALP-DAWKDGEVRGIRARGGYEVSMKWRNGQVEWVQLKP--GTQHHV 768
Query: 773 SFKTLHYRGTSVKVNLSAGKIYTF 796
T++ G +V L K T
Sbjct: 769 KTVTVYMNGKLTRVGLKRDKTTTI 792
>gi|257053761|ref|YP_003131594.1| alpha/beta hydrolase domain-containing protein [Halorhabdus
utahensis DSM 12940]
gi|256692524|gb|ACV12861.1| alpha/beta hydrolase domain-containing protein [Halorhabdus
utahensis DSM 12940]
Length = 784
Score = 492 bits (1266), Expect = e-136, Method: Compositional matrix adjust.
Identities = 291/766 (37%), Positives = 416/766 (54%), Gaps = 60/766 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA + +A+PIGNGRLG M++G E ++ N DTLW G D TNPDA + + +VR
Sbjct: 13 YDEPASAWLEALPIGNGRLGGMIFGRPGCERVQFNADTLWAGGHEDRTNPDAREHVEEVR 72
Query: 77 SLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
L+ G+ A A A KL G P + YQ GD+ ++ A YRRELDL+
Sbjct: 73 RLLFDGEVQRAQALADEKLMGDPIRLRPYQTFGDLSIDVGHD----AVTDYRRELDLSAG 128
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
ARV+Y + RE+F+S PD IV +++ E G+++ V LD D V + +
Sbjct: 129 VARVRYDHEGTTYVREYFASAPDDAIVIRLTAEEPGAVTATVGLDREQDADDSVR-DGTL 187
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS-----D 248
+ GR + +G+ F A ++ D G + + E S +
Sbjct: 188 QLRGRVVDDPDDDRGAGG---EGMAFEA--RASVTADAGNVQRVTGADAPEESSVGFRAE 242
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
A + + + F G +DP + S L ++ + SY DL H+ D+++LF RV
Sbjct: 243 AADAMTIVLTGFTG------HETEDPGAACESVLDAVADQSYDDLRDTHVADHRELFDRV 296
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
+ L P D TD E +D V + E DP+L L QFGRYLLI+SSRPG
Sbjct: 297 ELDLG-EPLDRPTD----ERLDRVATGE--------ADPNLTALYAQFGRYLLIASSRPG 343
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
T+ ANLQG+WN++ P W+S +NINLEMNYW +L NL+EC PL+DF+ L G +
Sbjct: 344 TEPANLQGVWNQEFDPPWNSGYTLNINLEMNYWPALQTNLAECAAPLYDFVDDLREPGRR 403
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
A+ +Y +G+ +HH +D+W +++A W LWPMG AWL +++HY +T D D L
Sbjct: 404 VAETHYDCAGFAVHHNSDLW-RNAAPVDGAHWGLWPMGAAWLSRLVFDHYAFTRDEDHLR 462
Query: 489 KRAYPLLEGCASFLLDWLIE--GHDG----YLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+ A P+L A+F+ D+L+E +G +L T PS SPE+ ++ DG+ A V+Y+ TM
Sbjct: 463 ETAEPILREAAAFVADFLVEHPAEEGEAEDWLVTAPSNSPENAYVTDDGQEATVTYAPTM 522
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
D+ + R++F I+AAE+LE ED + + +L RL P ++ E G + EW +D+ + +
Sbjct: 523 DVQLTRDLFEHTIAAAEILEV-EDEFHDDLRAALDRLPPMQVGEHGQLQEWIEDYDEADP 581
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHD 659
HRH+SHL+G P IT P L A E TL +R E G GWS W +ARL D
Sbjct: 582 GHRHISHLYGAHPSDQITSRNTPKLADAVETTLDRRLEHGGGHTGWSAAWLVNQFARLED 641
Query: 660 QEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLN 719
E A+ V+ L L D NLF HPPFQID NFG TA + EML+ S +
Sbjct: 642 AERAHEWVRTL--LAD---------STAPNLFDLHPPFQIDGNFGATAGITEMLLGSHAD 690
Query: 720 DLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
++ LLPALP D W+ G V GL+ARG V I W G L I S
Sbjct: 691 EIRLLPALP-DAWAEGSVSGLRARGDFGVDIEWSGGSLDSATIRSG 735
>gi|224538245|ref|ZP_03678784.1| hypothetical protein BACCELL_03136 [Bacteroides cellulosilyticus
DSM 14838]
gi|224520142|gb|EEF89247.1| hypothetical protein BACCELL_03136 [Bacteroides cellulosilyticus
DSM 14838]
Length = 827
Score = 492 bits (1266), Expect = e-136, Method: Compositional matrix adjust.
Identities = 285/764 (37%), Positives = 428/764 (56%), Gaps = 52/764 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++GPA + +A+P+GNGR+GAMV+G E +LNE+T+W G P + TNP A AL
Sbjct: 28 LKLWYDGPATQWVEALPLGNGRIGAMVFGDPVHEQFQLNEETVWGGSPYNNTNPKAKDAL 87
Query: 73 SDVRSLVDSGQYAEA------TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
+R L+ G+ EA T S G P YQ +G + L+FD Y + Y R
Sbjct: 88 PRIRQLIFEGKNKEAQELCGPTICSPSANGMP---YQTVGSLHLDFDGIS-NYND--YYR 141
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
+LD+ A A +++ V +TRE ++S PDQV+V +++ S+ S+SF + ++
Sbjct: 142 DLDIAKAIATTRFTTNGVTYTREAYTSFPDQVLVIRLTASQKKSISFTAKYTTPYKSNVV 201
Query: 187 --VNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLK 243
++ ++ + G KAN ++ KG ++F+A+ +I + G++ A D L+
Sbjct: 202 RSISSRKELQLSG---------KANDHEGIKGKVEFTAL--TRIENSGGSLEATSDSTLQ 250
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V+ ++ +V L V S F+N D + S + L+ + N +Y+ H++ YQK
Sbjct: 251 VKNAN-SVTLYV---SIGTNFVNYKDVSGNALSTAQKYLKQV-NKNYAKSKAAHINAYQK 305
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
F+RVS+ L R+ + P+ RVK F T DP + L FQFGRYLLI
Sbjct: 306 YFNRVSLDLGRNAQA------------DKPTDVRVKEFSTSFDPQMAALYFQFGRYLLIC 353
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SS+PG Q ANLQGIWN L WD +IN+EMNYW + +L E EP + +
Sbjct: 354 SSQPGGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAESTSLPEMHEPFLQLVKEAA 413
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
I G ++A + Y GW +HH TDIW + A G + +WP AW C HLW+ Y ++ D
Sbjct: 414 IQGRESAAM-YGCRGWTLHHNTDIWRSTGAVDGP-SYGVWPTCNAWFCQHLWDRYLFSGD 471
Query: 484 RDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+++L + YPL+ G F LD+L+ E + +L PS SPE+ + + V +TM
Sbjct: 472 KNYLAE-VYPLMRGACEFYLDFLVREPENNWLVVAPSYSPENSPVVNGKRTFVVVAGTTM 530
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
D ++ ++F I+AA ++ +N A + + + L P ++ G + EW D+ +P+
Sbjct: 531 DNQMVYDLFYNTIAAAGLMNENT-AFTDSLQTVVNNLAPMQVGRWGQLQEWMHDWDNPKD 589
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEH 662
HRH+SHL+GL+PG I+ +P L +AA+K+L RG+ GWS+ WK LWARL D H
Sbjct: 590 RHRHVSHLWGLYPGRQISAYNSPILFEAAKKSLIGRGDHSTGWSMGWKVCLWARLLDGNH 649
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
AY+++ L EK GG Y NLF AHPPFQID NFG +A +AEM VQS ++
Sbjct: 650 AYKLITE--QLHPTTDEKGQNGGTYPNLFDAHPPFQIDGNFGCSAGIAEMFVQSHDGAIH 707
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVS-ICWKDGDLHEVGIYSN 765
LLPALP D W G +KG++ RGG TV + W++G+L I SN
Sbjct: 708 LLPALP-DVWKQGTLKGIRCRGGFTVKEMKWENGELQTAVITSN 750
>gi|423221590|ref|ZP_17208060.1| hypothetical protein HMPREF1062_00246 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392645917|gb|EIY39637.1| hypothetical protein HMPREF1062_00246 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 826
Score = 491 bits (1265), Expect = e-136, Method: Compositional matrix adjust.
Identities = 285/764 (37%), Positives = 428/764 (56%), Gaps = 52/764 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++GPA + +A+P+GNGR+GAMV+G E +LNE+T+W G P + TNP A AL
Sbjct: 27 LKLWYDGPATQWVEALPLGNGRIGAMVFGDPVHEQFQLNEETVWGGSPYNNTNPKAKDAL 86
Query: 73 SDVRSLVDSGQYAEA------TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
+R L+ G+ EA T S G P YQ +G + L+FD Y + Y R
Sbjct: 87 PRIRQLIFEGKNKEAQELCGPTICSPSANGMP---YQTVGSLHLDFDGIS-NYND--YYR 140
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
+LD+ A A +++ V +TRE ++S PDQV+V +++ S+ S+SF + ++
Sbjct: 141 DLDIAKAIATTRFTTNGVTYTREAYTSFPDQVLVIRLTASQKKSISFTAKYTTPYKSNVV 200
Query: 187 --VNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLK 243
++ ++ + G KAN ++ KG ++F+A+ +I + G++ A D L+
Sbjct: 201 RSISSRKELQLSG---------KANDHEGIKGKVEFTAL--TRIENSGGSLEATSDSTLQ 249
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V+ ++ +V L V S F+N D + S + L+ + N +Y+ H++ YQK
Sbjct: 250 VKNAN-SVTLYV---SIGTNFVNYKDVSGNALSTAQKYLKQV-NKNYAKSKAAHINAYQK 304
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
F+RVS+ L R+ + P+ RVK F T DP + L FQFGRYLLI
Sbjct: 305 YFNRVSLDLGRNAQA------------DKPTDVRVKEFSTSFDPQMAALYFQFGRYLLIC 352
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SS+PG Q ANLQGIWN L WD +IN+EMNYW + +L E EP + +
Sbjct: 353 SSQPGGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAESTSLPEMHEPFLQLVKEAA 412
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
I G ++A + Y GW +HH TDIW + A G + +WP AW C HLW+ Y ++ D
Sbjct: 413 IQGRESAAM-YGCRGWTLHHNTDIWRSTGAVDGP-SYGVWPTCNAWFCQHLWDRYLFSGD 470
Query: 484 RDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+++L + YPL+ G F LD+L+ E + +L PS SPE+ + + V +TM
Sbjct: 471 KNYLAE-VYPLMRGACEFYLDFLVREPENNWLVVAPSYSPENSPVVNGKRTFVVVAGTTM 529
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
D ++ ++F I+AA ++ +N A + + + L P ++ G + EW D+ +P+
Sbjct: 530 DNQMVYDLFYNTIAAAGLMNENT-AFTDSLQTVVNNLAPMQVGRWGQLQEWMHDWDNPKD 588
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEH 662
HRH+SHL+GL+PG I+ +P L +AA+K+L RG+ GWS+ WK LWARL D H
Sbjct: 589 RHRHVSHLWGLYPGRQISAYNSPILFEAAKKSLIGRGDHSTGWSMGWKVCLWARLLDGNH 648
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
AY+++ L EK GG Y NLF AHPPFQID NFG +A +AEM VQS ++
Sbjct: 649 AYKLITE--QLHPTTDEKGQNGGTYPNLFDAHPPFQIDGNFGCSAGIAEMFVQSHDGAIH 706
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVS-ICWKDGDLHEVGIYSN 765
LLPALP D W G +KG++ RGG TV + W++G+L I SN
Sbjct: 707 LLPALP-DVWKQGTLKGIRCRGGFTVKEMKWENGELQTAVITSN 749
>gi|295132887|ref|YP_003583563.1| six-hairpin glycosidase [Zunongwangia profunda SM-A87]
gi|294980902|gb|ADF51367.1| six-hairpin glycosidase [Zunongwangia profunda SM-A87]
Length = 820
Score = 491 bits (1265), Expect = e-136, Method: Compositional matrix adjust.
Identities = 289/765 (37%), Positives = 435/765 (56%), Gaps = 50/765 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + ++ PAK + +A+P+GNGRLGAMV+G ET++LNE+T+W G PG+ + + L
Sbjct: 27 MTLNYDEPAKVWEEALPVGNGRLGAMVFGRTGMETIQLNEETVWAGEPGNNVVTLSEEQL 86
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPAD----VYQLLGDIELEFDDSHLKYAEETYRREL 128
++R + +Y +A + K + YQ +G++ L F +S+ A Y+REL
Sbjct: 87 EEIRKAIFQEEYQKAQQLADKYLSKKDNNSGMSYQTVGNLILNFPNSN---AVRDYKREL 143
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
D++ A + V Y G V + R SS PD VI+ +++ ++ GS+SF + L S +H
Sbjct: 144 DISKAVSTVTYKTGGVAYKRRIISSFPDDVIMVELTANKPGSISFEMGLKSPHKSHDIQI 203
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGS 247
N+++ + G ++ ++ KG ++F I + KI + G I E++ LK+ G+
Sbjct: 204 KNDEVWLSGT---------SSDQENKKGKVKFLVIAKPKI--EGGRIETTENR-LKITGA 251
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
+ AV+ + +S+F N D +D S++++ L ++ + H+ +YQ+ F+R
Sbjct: 252 NRAVIYISIASNFK----NYKDLSEDAESKAIALLNAVYIKEFGKCLDAHIAEYQQYFNR 307
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
V + D+ T + D R++ F +DP L+ L FQFGRYLLISSS P
Sbjct: 308 VQL-------DLGTSNAINKTTDI-----RLEEFNDSDDPQLIALYFQFGRYLLISSSMP 355
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
GTQ ANLQGIWN++++ WDS VNIN EMNYW + NLSE +PLF + +S G
Sbjct: 356 GTQPANLQGIWNKEINAPWDSKYTVNINTEMNYWPAEVANLSEMHKPLFGLIKDISETGK 415
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
++A+ Y A GW +HH TDIW + S + LWP GG WL HLW+HY +T D FL
Sbjct: 416 ESAEKMYHARGWNMHHNTDIW-RISGVVDPPFYGLWPHGGGWLSQHLWQHYLFTGDTKFL 474
Query: 488 EKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
K YP+L+G A F D L E + ++ NPS SPE+ + ++ +TM I
Sbjct: 475 -KEVYPILKGTALFYKDILQQEPENKWMVVNPSNSPENGHTGG----SSLAAGTTMGNQI 529
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSL-PRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
+++VFS + A+++L NED +K++ P L P +I + G + EW +D+ + HR
Sbjct: 530 VQDVFSNFLEASQIL--NEDKKFSDSIKNVTPNLAPMQIGKWGQLQEWMKDWDRQDDKHR 587
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H+SHL+GLFP + I+ + P L AA+ +L RG+E GWS+ WK LWARL D +HA
Sbjct: 588 HVSHLYGLFPSNLISPYRTPKLFAAAKNSLLARGDESTGWSMGWKVNLWARLLDGDHALA 647
Query: 666 MVKRLFNLVDPEHEKHFE-GGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLL 724
++ L H E GG Y NLF AHPPFQID NFG TA +AEML+QS +++L
Sbjct: 648 LIHD--QLTPSRQAGHGEKGGTYPNLFDAHPPFQIDGNFGCTAGIAEMLLQSQDGAVHIL 705
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
PALP W+ G VKGLKARG + I W++ +V I S N
Sbjct: 706 PALP-STWNKGEVKGLKARGNFEIDIAWEENKPVKVNITSAIGGN 749
>gi|325299782|ref|YP_004259699.1| alpha-L-fucosidase [Bacteroides salanitronis DSM 18170]
gi|324319335|gb|ADY37226.1| Alpha-L-fucosidase [Bacteroides salanitronis DSM 18170]
Length = 826
Score = 491 bits (1265), Expect = e-136, Method: Compositional matrix adjust.
Identities = 290/782 (37%), Positives = 433/782 (55%), Gaps = 55/782 (7%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
+ N KI ++ PA ++ +A+P+GNGR+ AMV+G E L+LNE+T+ G P
Sbjct: 15 VCNVTGLCAQESYKIWYDKPAAYWEEALPVGNGRIAAMVFGNARMERLQLNEETVSAGSP 74
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEAT----AASVKLFGHPADVYQLLGDIELEFDDSH 116
NP+A AL ++R L+ G+ EA A + G+ YQ +G++ + + + H
Sbjct: 75 YQNYNPEAKAALPEIRRLIFEGKNEEAQLLAGKAIISQVGNEMP-YQTVGNLNIRYKN-H 132
Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
++ Y R+LD++ A A +Y VG+ E+T E F+S DQ+IV I S++G++ +V
Sbjct: 133 ENVSD--YYRDLDISRAVATTRYRVGSTEYTEETFASFTDQLIVKHIKASKAGAIDCDVF 190
Query: 177 LDSLLDN-HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
D+ + G + +EG G + P + + A L++K+ + S
Sbjct: 191 FDTPMKRPQRSAIGKKGLRLEGMADGTKFFPGK--------VHYCADLQVKLKGGKAETS 242
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
D L V+G+ L + +++F +N D DP + L++ Y +
Sbjct: 243 --NDTLLSVKGATELTLYISMATNF----VNYKDVSADPYVRNRVYLKNAGK-EYEKAKS 295
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
H+ Y++ F RV++ + +P+ +++ +D R+K F + DP L+ L FQ
Sbjct: 296 AHIAAYREQFDRVTLDMGTTPQ-------ADKPMDV-----RIKEFASSYDPHLIALYFQ 343
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
+GRYLLISSS+PG Q ANLQG WN P W+ NIN EMNYW + NL E EPL
Sbjct: 344 YGRYLLISSSQPGCQPANLQGKWNAKTKPAWNCNYTTNINTEMNYWPAEVTNLPELHEPL 403
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWAL---WPMGGAWLCT 472
+ LS NG + A Y GWV+HH TD+W + G V +A WP+ AWLC
Sbjct: 404 IRMIRELSENGKEAASKMYGCRGWVLHHNTDLWRMT----GAVDYAYCGTWPVCNAWLCQ 459
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPD- 530
HLW+ Y Y+ D+ +L K YP+++ + F +D+L+ + + GYL PS SPE+ AP
Sbjct: 460 HLWDRYLYSGDKQYL-KEVYPIMKSASQFFVDFLVRDPNTGYLVVTPSNSPEN---APRW 515
Query: 531 -GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDG 588
K A + TMD ++ ++FS AA VL NED L L+S+ R L P ++ + G
Sbjct: 516 IKKKANLFAGITMDNQLVFDLFSNTCRAASVL--NEDTLFCDTLRSMRRQLPPMQVGQYG 573
Query: 589 SIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSIT 648
+ EW +D+ P+ HHRH+SHL+GLFPG+ I+ ++P L +AA TL +RG+ GWS+
Sbjct: 574 QLQEWFEDWDRPDDHHRHISHLWGLFPGYQISPYRSPVLFEAARNTLIQRGDPSTGWSMG 633
Query: 649 WKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAA 708
WK WAR+ D +HAY+++K V PE +K GG Y NLF AHPPFQID NFG TA
Sbjct: 634 WKVCFWARMLDGDHAYKLIKNQLTYVSPESQKGQGGGTYPNLFDAHPPFQIDGNFGCTAG 693
Query: 709 VAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETV-SICWKDGDLHEVGIYSNYS 767
+AEMLVQS + LLPALP +W SG +KGL+ RGG + + W++G L + I S
Sbjct: 694 IAEMLVQSHDGAVQLLPALP-SEWKSGTIKGLRVRGGFLLEELSWENGKLKKAVIRSVIG 752
Query: 768 NN 769
N
Sbjct: 753 GN 754
>gi|116622997|ref|YP_825153.1| hypothetical protein Acid_3901 [Candidatus Solibacter usitatus
Ellin6076]
gi|116226159|gb|ABJ84868.1| conserved hypothetical protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 759
Score = 491 bits (1264), Expect = e-136, Method: Compositional matrix adjust.
Identities = 293/776 (37%), Positives = 416/776 (53%), Gaps = 99/776 (12%)
Query: 9 TTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
+ +PL + + PA +TDA+P+GNGR+GAMV+GG E ++ NE T+WTG P DY + A
Sbjct: 15 SQSPLTLWYTHPADIWTDALPVGNGRMGAMVFGGAAHERIQFNEQTVWTGEPHDYAHKGA 74
Query: 69 PKALSDVRSLVDSGQYAEATA-ASVKLFGHP--ADVYQLLGDIELEFDDSHLKYAEETYR 125
K+L +R L+ +G+ EA A A + P YQ LGD+ +E + A Y+
Sbjct: 75 SKSLQQIRELLWAGKQKEAEALAMTEFMSEPLHQKAYQALGDLIIETPGAETPTA---YK 131
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
R LDL+T A +++ + + RE F+S+P IV ++ S+ S +L H+
Sbjct: 132 RSLDLDTGIAVTEFTANGITYRREVFASHPASAIVVHLTSSQPAEFS-----ATLKCAHA 186
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
G M G+ + I+F + LE I
Sbjct: 187 ACKGG--ATMSGQV-------------ENSAIRFDSRLEKHIDSPTS------------- 218
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
A LLL A+++F D DP +++ L +I N SY L H+ D+Q LF
Sbjct: 219 ----ATLLLTAATNFK----TYQDVTADPVQRNLATLVAIGNKSYDALRAEHIRDHQSLF 270
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
RV++ L + +P+ ER+ +F DP+L+ LLFQFGRYL+I SS
Sbjct: 271 RRVTLDLGATAAS------------QLPTDERIAAFAKGSDPALITLLFQFGRYLMIGSS 318
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPG Q ANLQG+WNE +P WDS NIN EMNYW NLSEC PLFD L L+ +
Sbjct: 319 RPGGQPANLQGLWNESNTPAWDSKYTDNINTEMNYWPVEETNLSECHLPLFDALKDLAQS 378
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G+ TA+ Y A GWV+HH D+W + +A +W GGAWL THLWEHY +T DR+
Sbjct: 379 GAITAREQYNARGWVLHHNFDLW-RGTAPINASNHGIWQTGGAWLSTHLWEHYLFTGDRE 437
Query: 486 FLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
FL AYPL++G ++F +D L++ G+L T PS SPE + TMD
Sbjct: 438 FLRAAAYPLMKGASTFFIDALVKDPKTGFLYTGPSNSPEQ---------GGLVMGPTMDR 488
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWAQDFKDPEVH 603
I+R +F I+AA++L N D +++ L +L + + P +I + G + EW +D DP+
Sbjct: 489 EIVRSLFGETIAAAKIL--NLDPALQEQLATLRKQIAPLQIGKYGQLQEWMEDVDDPKNE 546
Query: 604 HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHA 663
HRH+SHL+ ++PG +T P+L KAA ++L RG+ GWS+ WK LWAR D +HA
Sbjct: 547 HRHVSHLWAVYPGSEVTPYGTPELFKAARQSLIFRGDAATGWSMGWKLNLWARFLDGDHA 606
Query: 664 YRMVKRLFNLVDPEHEKH------FEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS- 716
Y++++ NL+ P ++ + G++ N+F AHPPFQID NFG TA + EML+QS
Sbjct: 607 YKILQ---NLLAPANDGNRALKIPAHPGVFKNMFDAHPPFQIDGNFGATAGITEMLLQSD 663
Query: 717 ---------------TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
L+LLPALP G V GL ARGG VS+ WK G L
Sbjct: 664 DPYATPTSLTPVQSGAAGFLHLLPALP-SALPDGKVTGLLARGGFEVSLNWKAGKL 718
>gi|255532590|ref|YP_003092962.1| alpha-L-fucosidase [Pedobacter heparinus DSM 2366]
gi|255345574|gb|ACU04900.1| Alpha-L-fucosidase [Pedobacter heparinus DSM 2366]
Length = 825
Score = 491 bits (1264), Expect = e-136, Method: Compositional matrix adjust.
Identities = 286/775 (36%), Positives = 434/775 (56%), Gaps = 49/775 (6%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
++ T LK+ ++ PA ++ +A+PIGNGRLGAMV+G E L+LNE+T+W+G P
Sbjct: 21 GQAKKTDGTLKLWYDRPAANWNEALPIGNGRLGAMVFGNPAKEQLQLNEETVWSGGPNSN 80
Query: 64 TNPDAPKALSDVRSLVDSGQYAEATA-ASVKLF--GHPADVYQLLGDIELEFDDSHLKYA 120
+ A+ +R L+ G++ EA A A V++F + +YQ +G++ LEF+ +
Sbjct: 81 VTAASGAAIPALRKLIFEGKFEEAQALADVEMFPKKNSGMIYQPVGNLFLEFEGTE---K 137
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
Y R+L++ A A V Y G + + RE FSS DQV++ +++ + G ++F +D+
Sbjct: 138 ARNYYRDLNIEKALATVTYEAGGIRYKREIFSSFTDQVLIVRLTADKPGKITFRALMDTE 197
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ +++++ G A+ + I+F++ ++K+ + G S L++
Sbjct: 198 QKGGLRME-KDRLLLSGLT--------ADHEGEQGKIRFAS--QVKVVAEGGKAS-LQNN 245
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
V+ ++ A + + +++F N D D ++ S L +Y++ H+
Sbjct: 246 AWIVKAANSATVYVSIATNFK----NYHDVSADAGLKAASFLDRAVKKNYAEALAAHIKF 301
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
YQ+ F+RV + +TD ++ P+ ER+ +F DP L L FQFGRYL
Sbjct: 302 YQQYFNRVKFDIG------ITDAVNK------PTDERIAAFARSNDPHLTALYFQFGRYL 349
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSS+PG Q LQGIWN+ + WDS +NIN EMNYW + NLSE +PLF L
Sbjct: 350 LISSSQPGNQPPTLQGIWNDKMLAPWDSKYTININTEMNYWPAEVTNLSELHDPLFKMLK 409
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
LS+ G +TA++ Y A GWV HH TD+W + + + LWPMGG WL HLW+HY +
Sbjct: 410 DLSVTGRETAKLMYGAKGWVTHHNTDLW-RITGPVDRPYAGLWPMGGNWLSQHLWDHYMF 468
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
T D+ FL K YP+L+G + F LD L E +L +PS SPE+ ++ GK ++
Sbjct: 469 TGDKQFL-KEYYPVLKGASEFYLDVLQEEPTHKWLVVSPSNSPENTYVP--GKRVSIAAG 525
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWAQDFK 598
+TMD ++ ++F+ AAE+L DA +LK+ L RL P +I + + EW D
Sbjct: 526 TTMDNQLLFDLFTRTGKAAELL--GMDAEFRGLLKTALGRLAPMQIGKYSQLQEWMHDSD 583
Query: 599 DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLH 658
+ HRH+SHL+GL+P + I+ + P+L AA +L RG+ GWS+ WK WAR
Sbjct: 584 RTDDKHRHVSHLYGLYPSNQISPTRTPELFDAARTSLMYRGDPATGWSMGWKVNFWARFL 643
Query: 659 DQEHAYRMVKRLFNL----VDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLV 714
D HAY+++ L VD + K GG Y N+F AHPPFQID NFG TA +AEML+
Sbjct: 644 DGNHAYKLITDQLKLVGGRVDSVNTKG--GGTYPNMFDAHPPFQIDGNFGCTAGIAEMLL 701
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
QS +++LPALP D+W SG VKGL ARGG V I WKD + + + S N
Sbjct: 702 QSHDGAIHILPALP-DQWPSGEVKGLVARGGYVVDISWKDKVITHLKVLSRLGGN 755
>gi|423298609|ref|ZP_17276665.1| hypothetical protein HMPREF1070_05330 [Bacteroides ovatus
CL03T12C18]
gi|392662352|gb|EIY55913.1| hypothetical protein HMPREF1070_05330 [Bacteroides ovatus
CL03T12C18]
Length = 822
Score = 491 bits (1264), Expect = e-136, Method: Compositional matrix adjust.
Identities = 298/768 (38%), Positives = 441/768 (57%), Gaps = 57/768 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+T+W G P + NP+A + +
Sbjct: 32 KLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEETIWAGRPNNNANPNALEYIP 91
Query: 74 DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +Y+ Y RE
Sbjct: 92 KVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTRYS--NYYRE 145
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
L L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L S
Sbjct: 146 LSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLTS-------- 197
Query: 188 NGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKL 242
+Q +M EG C + ++ ++ KG ++F L K ++G A D L
Sbjct: 198 --PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGGEIACADGIL 250
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
VE +D A++ + +++F+ N D + + + L + + H+D Y+
Sbjct: 251 SVEKADEAIIYVSIATNFN----NYQDITGNQIERAKNYLAKAMVHPFIESKRNHVDFYR 306
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
+ RVS+ L E+ V + +RV++F+ D LV FQFGRYLLI
Sbjct: 307 QYLTRVSLDLG------------EDQYANVTTDKRVENFKNTNDTHLVATYFQFGRYLLI 354
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE EPLF + +
Sbjct: 355 CSSQPGGQPANLQGIWNDKLVPSWDSKYTCNINLEMNYWPSEVTNLSELNEPLFRLIKEV 414
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC HLWE Y YT
Sbjct: 415 SDTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRHLWERYLYTG 473
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
D +FL + YP+L+ F + ++ E +L PS SPE+ +GK A + T
Sbjct: 474 DVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK-ATTAAGCT 531
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPE 601
MD ++ ++++AIISA+++L+ + + + + L + P ++ G + EW D+ DP+
Sbjct: 532 MDNQLVFDLWTAIISASQILDTDRE-FASHLTQRLKEMAPMQVGHWGQLQEWMFDWDDPK 590
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQE 661
HRH+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK LWARL D +
Sbjct: 591 DVHRHISHLYGLFPSNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKVCLWARLLDGD 650
Query: 662 HAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDL 721
HAY+++ LV E +K GG Y NLF AHPPFQID NFG A +AEML+QS + +
Sbjct: 651 HAYKLITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCAAGIAEMLMQSYDSFI 707
Query: 722 YLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
YLLPALP W G +KG+ ARGG + + WK+G + + I S+ N
Sbjct: 708 YLLPALP-AVWKEGSIKGIIARGGFELDLSWKNGKVSRLVIKSHKGGN 754
>gi|154494326|ref|ZP_02033646.1| hypothetical protein PARMER_03681 [Parabacteroides merdae ATCC
43184]
gi|423725485|ref|ZP_17699622.1| hypothetical protein HMPREF1078_03511 [Parabacteroides merdae
CL09T00C40]
gi|154085770|gb|EDN84815.1| hypothetical protein PARMER_03681 [Parabacteroides merdae ATCC
43184]
gi|409234609|gb|EKN27437.1| hypothetical protein HMPREF1078_03511 [Parabacteroides merdae
CL09T00C40]
Length = 809
Score = 491 bits (1263), Expect = e-136, Method: Compositional matrix adjust.
Identities = 291/777 (37%), Positives = 427/777 (54%), Gaps = 52/777 (6%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
N T PL F+ PA + + P+GNGRLG M GGV +E + LNE ++W+G D
Sbjct: 17 NMPEVKTGKPLSYHFDAPAGIWEASFPLGNGRLGLMPDGGVDTENIVLNEISMWSGSKQD 76
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIE 109
NP A +L +R L+ G+ EA F G A+V YQLLG++
Sbjct: 77 TDNPQAYHSLGTIRKLLFEGRNDEAQELMYNTFVCKGEGSGQGQGANVPYGSYQLLGNLV 136
Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
L +D + YRREL+L+ A A + G V + RE F+S D + V ++
Sbjct: 137 LNYDYQGTSDSIFGYRRELNLDNAIATASFRRGKVTYNREVFTSFADDLGVIHLTADADR 196
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD 229
+L+F+ ++ +++ N ++M+G+ P + KG+++++ +++
Sbjct: 197 ALNFSFGMNRP-EHYKVTADGNDLLMQGQLP------DGVDTLEMKGLRYAS--RVRVIL 247
Query: 230 DRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSALQSIRNL 288
+G D + V + A+LL+ +A+ FD KD + S L +
Sbjct: 248 PKGGNVTPGDSTVSVRNASEAILLVSMATDYFD----------KDLEGKVSSLLANAEKK 297
Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDP 347
++ L H+ Y+ LF RV + L S ++ +P ER+ +F + +DP
Sbjct: 298 DFASLKKGHIAAYRSLFGRVELDLGHSSRE------------DLPMDERLAAFHENPDDP 345
Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
SL L FQFGRYLLISS+R G NLQG+W ++ W+ H+NINL+MN+W + N
Sbjct: 346 SLAALYFQFGRYLLISSTRVGLLPPNLQGLWCNTINTPWNGDYHLNINLQMNHWPAEVAN 405
Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
LSE PL ++ +G +TA+ Y A GWV H ++W + +A W
Sbjct: 406 LSELHLPLVEWTKQQVASGERTAKAYYNARGWVTHILGNVW-EFTAPGEHPSWGATNTSA 464
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEF 526
AWLC HL+ HY YT+D+++L K YP+L+G + F +D L+E + YL T P+TSPE+ +
Sbjct: 465 AWLCEHLYMHYLYTLDKEYL-KDVYPVLKGASLFFVDMLVEDPRNKYLVTAPTTSPENGY 523
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE 586
P+GK A + STMD I+RE+F+ I AA++L + A ++ RL PT I +
Sbjct: 524 KLPNGKTAHICAGSTMDNQIVRELFTNTIEAADILGL-DSAFAGELAAKRARLMPTTIGK 582
Query: 587 DGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWS 646
DG IMEW + +++ E HHRH+SHL+GL+PG+ I+ E+ P+L +AA K+L RG++ GWS
Sbjct: 583 DGRIMEWLEPYEEVEPHHRHVSHLYGLYPGNEISTERTPELAEAARKSLIARGDKSTGWS 642
Query: 647 ITWKTALWARLHDQEHAYRM-VKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGF 705
+ WK WARLHD +HAY++ V L VD + GG Y NLF AHPPFQID NFG
Sbjct: 643 MGWKMNFWARLHDGDHAYKLFVDLLRPCVDRKTNMTNGGGTYPNLFCAHPPFQIDGNFGG 702
Query: 706 TAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
A +AEMLVQS ++ LLPALP W SG KGLK RGG VS WK+G L E G+
Sbjct: 703 CAGIAEMLVQSQTGEIELLPALP-SAWKSGSFKGLKVRGGGEVSAKWKEGRLAEAGL 758
>gi|251798253|ref|YP_003012984.1| alpha-L-fucosidase [Paenibacillus sp. JDR-2]
gi|247545879|gb|ACT02898.1| Alpha-L-fucosidase [Paenibacillus sp. JDR-2]
Length = 767
Score = 490 bits (1262), Expect = e-135, Method: Compositional matrix adjust.
Identities = 287/754 (38%), Positives = 417/754 (55%), Gaps = 59/754 (7%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
+ ++ PAK + +A+PIGNGRLGAM++G +E ++LNED+LW G P D NPDA L++
Sbjct: 12 LLYHSPAKQWEEALPIGNGRLGAMIFGDPRAERVQLNEDSLWYGGPRDRHNPDALPNLAE 71
Query: 75 VRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAE-ETYRRELDL 130
+R L+ G+ EA AS+ L P Y LGD+ L F+ + AE Y R LDL
Sbjct: 72 IRKLIFEGKLQEAERLASLALTAIPESQRHYVPLGDLFLRFEHA----AEIRNYERRLDL 127
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
+ A V Y+ G +F RE F+S PD+ IV +++ G +SF + + YV+
Sbjct: 128 SEAIVHVSYTAGETKFAREIFASYPDRAIVLRLTADSPGQISFTARMGR--ERFRYVD-- 183
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
E R RI N+ G+++ +L + G++ + + L V +D
Sbjct: 184 -----EIRAEEGRIVMCGNSGG---GVRYCGVL--ACVPEGGSMRTI-GEHLVVSNADAV 232
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
+L++ AS+ F + DP + ++ + +YS+L H+ DY+ L+ R +
Sbjct: 233 LLVVTASTDF---------READPEAAALGDAGRVAAAAYSELKASHISDYRSLYDRTRL 283
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGT 369
+ S + ++ER+ + + EDP L L F +GRYLLI+SSRPG+
Sbjct: 284 WIGAE---------SGLKPEISETSERLVNVKAGREDPGLTALYFHYGRYLLIASSRPGS 334
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
ANLQGIWN+D+ P WDS +NIN +MNYW + C L EC PLF+ + + NG T
Sbjct: 335 LPANLQGIWNKDMLPAWDSKFTININTQMNYWPAESCYLPECHLPLFELIERMIPNGRHT 394
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+ Y G HH TDIWA ++ WP+G AWL HLWEHY Y D FLE
Sbjct: 395 ARSMYGCRGSAAHHNTDIWADTAPQDLWPSSTYWPLGLAWLSLHLWEHYRYGGDTAFLE- 453
Query: 490 RAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
R YP+++ A FLLD+L+E G T+PS SPE+ + P+G+ + Y +MD I RE
Sbjct: 454 RVYPMMKEAAVFLLDYLVELPSGEWVTSPSVSPENTYRLPNGETGVLCYGPSMDSQIARE 513
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSH 609
+F A +A E + N D L+ ++ +++ +L P +I G ++EW +D+++ E HRH+SH
Sbjct: 514 LFQACAAAGERIGSN-DELLGELRQAIDKLPPPRIGRYGQLLEWYEDYEEVEPGHRHISH 572
Query: 610 LFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHAYRM 666
LF L PG IT +K P+L AA +TL++R G GWS W WARL + E A+
Sbjct: 573 LFALHPGTQITPDKTPELSAAARRTLERRLANGGGHTGWSRAWIINFWARLQEAEEAHAN 632
Query: 667 VKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPA 726
V L + NL HPPFQID NFG TA +AE+L+QS + ++LLPA
Sbjct: 633 VTALLS-----------HSTLPNLLDNHPPFQIDGNFGGTAGIAELLLQSHEDTIHLLPA 681
Query: 727 LPWDKWSSGCVKGLKARGGETVSICWKDGDLHEV 760
LP W +G V+GL+ARGG TV I WKDG +H+
Sbjct: 682 LP-KAWPAGEVRGLRARGGVTVDIAWKDGLIHQA 714
>gi|336416256|ref|ZP_08596592.1| hypothetical protein HMPREF1017_03700 [Bacteroides ovatus
3_8_47FAA]
gi|335938987|gb|EGN00866.1| hypothetical protein HMPREF1017_03700 [Bacteroides ovatus
3_8_47FAA]
Length = 822
Score = 490 bits (1262), Expect = e-135, Method: Compositional matrix adjust.
Identities = 297/768 (38%), Positives = 440/768 (57%), Gaps = 57/768 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+T+W G P + NP+A + +
Sbjct: 32 KLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEETIWAGRPNNNANPNALEYIP 91
Query: 74 DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +Y+ Y RE
Sbjct: 92 KVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTRYS--NYYRE 145
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
L L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L S
Sbjct: 146 LSLDSARAIVRYEVDGVQYQREMITSFTDQVVMVRLTANRPGQITFNAQLTS-------- 197
Query: 188 NGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKL 242
+Q +M EG C + ++ ++ KG ++F L K ++G A D L
Sbjct: 198 --PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGGEIACADGIL 250
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
VE +D A++ + +++F+ N D + + + L + + H+D Y+
Sbjct: 251 SVEKADEAIIYVSIATNFN----NYQDITGNQIERAKNYLAKAMVHPFIESKRNHIDFYR 306
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
+ RVS+ L E+ V + +RV++F+ D LV FQFGRYLLI
Sbjct: 307 QYLTRVSLDLG------------EDQYANVTTDKRVENFKNTNDAHLVATYFQFGRYLLI 354
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE EPLF + +
Sbjct: 355 CSSQPGGQPANLQGIWNDKLVPSWDSKYTCNINLEMNYWPSEVTNLSELNEPLFRLIKEV 414
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC HLWE Y YT
Sbjct: 415 SDTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRHLWERYLYTG 473
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
D +FL + YP+L+ F + ++ E +L PS SPE+ +GK A + T
Sbjct: 474 DVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK-ATTAAGCT 531
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPE 601
MD ++ ++++AIISA+++L+ + + + + L + P ++ G + EW D+ DP+
Sbjct: 532 MDNQLVFDLWTAIISASQILDTDRE-FASHLTQRLKEMAPMQVGHWGQLQEWMFDWDDPK 590
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQE 661
HRH+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK LWARL D +
Sbjct: 591 DVHRHISHLYGLFPSNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKVCLWARLLDGD 650
Query: 662 HAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDL 721
HAY+++ LV E +K GG Y NLF AHPPFQID NFG A +AEML+QS +
Sbjct: 651 HAYKLITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCAAGIAEMLMQSYDGFI 707
Query: 722 YLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
YLLPALP W G +KG+ ARGG + + WK+G + + + S+ N
Sbjct: 708 YLLPALP-AVWKEGSIKGIIARGGFELDLSWKNGKVSRLVVKSHKGGN 754
>gi|237718536|ref|ZP_04549017.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
gi|229452243|gb|EEO58034.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
Length = 1100
Score = 490 bits (1262), Expect = e-135, Method: Compositional matrix adjust.
Identities = 291/764 (38%), Positives = 411/764 (53%), Gaps = 52/764 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ +N PA+H+ +A+PIGN RLGAMV+GG E L++NE+T W G P +P A L
Sbjct: 288 LKLWYNRPAQHWEEALPIGNSRLGAMVYGGAGCEELQINEETFWAGGPHHNNSPKAKTVL 347
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+ R L+ + EA + F P + L L H K Y RELD+
Sbjct: 348 DEARRLIFEDKTMEAQKLINPNFFSGPHGMSYLNMGSLLILQPGHEK--ATNYYRELDIE 405
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS--------LLDN 183
ATA Y V V +TR FSS DQVI+ ++ + G+L F++ D+ LL
Sbjct: 406 DATATTCYEVDGVTYTRTAFSSMTDQVIIVRLEANRKGALQFSLGYDTPKEADGSALLHP 465
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
V GN + +C G A+A ++++ D ++ + +L
Sbjct: 466 VVKVRGNKLTM---QCIGMEQEGVASA--------IKGEWQVQVVHDGKQVN--QPDRLG 512
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V+G+ A + L A+++F +N D + + + + L++ Y H YQ
Sbjct: 513 VQGATTATVYLSAATNF----VNYKDVSGNASRRAAAYLKTALKQPYPKALEAHSKAYQT 568
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
F+RV + L P I + P+ +RV F +D +L+ LL+Q+GRYLLI
Sbjct: 569 QFNRVKLDL---PATIAS---------LAPTNQRVADFNRVDDRNLMALLYQYGRYLLIC 616
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SS+PG Q ANLQGIW L WDS +NIN EMNYW + NLSEC EPLF L LS
Sbjct: 617 SSQPGGQPANLQGIWCRSLHAPWDSKYTININTEMNYWPAEVTNLSECHEPLFSMLEDLS 676
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+ G +TA+ Y A GWV HH TD+W + G W +WP GGAWLC HLW+HY YT D
Sbjct: 677 VTGHETARTLYGAKGWVAHHNTDLWRIAGPVDG-ATWGMWPNGGAWLCQHLWQHYLYTGD 735
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+ FL K YP+++G A F++ L++ G+L T PS SPEH + A C TM
Sbjct: 736 QAFLRKY-YPVMKGAADFMMSHLVKHPKYGWLVTVPSVSPEHGYTASTLTAGC-----TM 789
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
D I ++ + AA +L + A + + + +L P +I + I EW D DP+
Sbjct: 790 DNQIAFDILNNTRLAATILGE-PTAYQDSLQATCTQLPPMQIGKYNQIQEWMVDADDPKN 848
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEH 662
HRH+SHL+GL+P + I+ P L AA+ TL +RG++ GWSI WK WAR+ D H
Sbjct: 849 EHRHISHLYGLYPSNQISPHLQPTLFAAAKNTLLQRGDQATGWSIGWKINFWARMLDGNH 908
Query: 663 AYRMVKRLFNLV--DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
AYR+++ + L+ D + ++H +G Y NLF AHPPFQID NFG+TA V+EML+QS
Sbjct: 909 AYRIIRNMLRLLPSDGKQKEHPDGRTYPNLFDAHPPFQIDGNFGYTAGVSEMLLQSHDGA 968
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
++LLPALP ++W G + GL ARGG V + W L I S
Sbjct: 969 VHLLPALP-EEWREGRISGLVARGGFVVDMEWSGAQLFRAEICS 1011
>gi|302867165|ref|YP_003835802.1| cellulose-binding family II protein [Micromonospora aurantiaca ATCC
27029]
gi|302570024|gb|ADL46226.1| cellulose-binding family II [Micromonospora aurantiaca ATCC 27029]
Length = 936
Score = 490 bits (1261), Expect = e-135, Method: Compositional matrix adjust.
Identities = 293/760 (38%), Positives = 416/760 (54%), Gaps = 53/760 (6%)
Query: 11 NPLKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
N L + ++ PA + A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D N
Sbjct: 44 NDLALWYDEPAGTDWLRALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPYDSANTRGA 103
Query: 70 KALSDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRR 126
L+++R V + Q+ A + + G P YQ +GD+ L F + Y R
Sbjct: 104 ANLAEIRRRVFADQWTLAQDLINQTMMGSPGGQLAYQTVGDLRLAFGSAS---GATQYNR 160
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
LDL TAT Y G V + RE F+S PDQV+V +++ + +++F+ + DS
Sbjct: 161 TLDLTTATVTTTYVQGGVRYQREVFASAPDQVMVLRLTADRANAITFSAAFDSPQRTTVS 220
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ ++G + ++F A+ ++ GT+S+ L+V G
Sbjct: 221 SPDGATVALDG--------VSGSMEGVTGSVRFLALANAAVTG--GTVSS-SGGTLRVSG 269
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ +L+ SS+ +N D + + L + ++++ L TRH DYQ LF
Sbjct: 270 ATSVTVLVSIGSSY----VNYRTVNGDYQGIARNRLNAAKSVAVDQLRTRHRADYQALFD 325
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV+I L R T + + P+ R+ + DP LLFQFGRYLLISSSR
Sbjct: 326 RVTIDLGR--------TAAADQ----PTDVRIAQHASTNDPQFAALLFQFGRYLLISSSR 373
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PGTQ ANLQGIW++ L+P+WDS VN NL MNYW + NLSEC P+FD + L++ G
Sbjct: 374 PGTQPANLQGIWSDSLTPSWDSKYTVNANLPMNYWPADTTNLSECFLPVFDMVKDLTVTG 433
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
++ AQ Y A GWV HH TD W +S G W +W GGAWL T +W+HY +T D F
Sbjct: 434 ARVAQAQYGAGGWVTHHNTDAWRGASVVDG-AFWGMWQTGGAWLSTLIWDHYLFTGDSGF 492
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L+ YP L+G A F LD L+ GYL TNPS SPE A A V TMD
Sbjct: 493 LQAN-YPALKGAAQFFLDTLVAHPTLGYLVTNPSNSPELAHHAN----ASVCAGPTMDNQ 547
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
I+R++F A A+EVL + +V + RL P+++ G++ EW D+ + E HR
Sbjct: 548 ILRDLFDAAARASEVLGV-DTTFRSQVRTARDRLPPSRVGSRGNVQEWLADWVETERTHR 606
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H+SHL+GL PG+ IT P L +AA +TL+ RG++G GW + WK WARL D A++
Sbjct: 607 HVSHLYGLHPGNQITRRGTPALYEAARRTLELRGDDGTGWYLAWKINFWARLEDGARAHK 666
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
+++ +LV + L N+F HPPFQID NFG T+ +AEML+ S +L+LLP
Sbjct: 667 LLR---DLVRTDR-------LAPNMFDLHPPFQIDGNFGATSGIAEMLLHSHTGELHLLP 716
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
ALP W +G V GL+ RGG TVS+ W G E+ + ++
Sbjct: 717 ALP-TAWPAGQVAGLRGRGGYTVSLTWSSGQADEITVRAD 755
>gi|302548581|ref|ZP_07300923.1| alpha-L-fucosidase 2 [Streptomyces hygroscopicus ATCC 53653]
gi|302466199|gb|EFL29292.1| alpha-L-fucosidase 2 [Streptomyces himastatinicus ATCC 53653]
Length = 809
Score = 490 bits (1261), Expect = e-135, Method: Compositional matrix adjust.
Identities = 302/766 (39%), Positives = 429/766 (56%), Gaps = 58/766 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + + A + +A+PIGNGRLGAMV+GG SE L+LNEDT+W G P + +P A +L
Sbjct: 49 LALWYPRAASTWLEALPIGNGRLGAMVFGGAESELLQLNEDTVWAGGPYEPASPKALASL 108
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELD 129
++R V +G++ A + G P +YQ +G++ L FD + YRR LD
Sbjct: 109 PEIRRRVFAGEWEAAQSLIDSDFLGTPKGELMYQPVGNLRLAFDAAG---EVGDYRRTLD 165
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L++A A V+Y+ G V + RE F+S+PDQVIV +++ G++SF + DS
Sbjct: 166 LDSAVASVRYAQGGVTYDRECFASHPDQVIVMRLTADRPGAVSFTAAFDS---------- 215
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGI--QFSAILEIKISDDRGTISALEDKKLKVEGS 247
Q ++ P + ++ +G+ Q + D GT+S+ E+ L V G+
Sbjct: 216 -PQTVIAS-SPDRITVAIDGTSETREGVTGQVRFRALARARADGGTVSS-ENGTLTVTGA 272
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D LL+ +S+ + NP+ D + + + L + ++ Y+ L RH+ DY+ LF R
Sbjct: 273 DSVTLLVSVGTSYTD-YRNPT---GDHAARATAPLNAASDVPYARLRKRHVADYRGLFRR 328
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
V + L TD + +P+ ERV +F + DP LV L FQ+GRYLLISSSRP
Sbjct: 329 VGLDLG------TTDAAA------LPTDERVANFASATDPQLVALHFQYGRYLLISSSRP 376
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
GTQ ANLQGIWN+ LSP+WDS +NIN EMNYW + NL EC EP+FD L LS+ G+
Sbjct: 377 GTQPANLQGIWNDSLSPSWDSKYTININTEMNYWPAPVTNLLECWEPVFDLLADLSVAGA 436
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
TA+ Y A GWV HH TD W + +A + +W GGAWL T +W+HY +T D+ L
Sbjct: 437 TTAKRQYGAGGWVTHHNTDAW-RGTAPVDRAFPGMWQTGGAWLSTGIWDHYLFTGDKKAL 495
Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
+R YP+L G F LD L+ + G+ T P+ SPE+ V TMD I
Sbjct: 496 RRR-YPVLRGSVRFFLDTLVTDPATGHFVTCPANSPENAHHTN----VSVCAGPTMDNQI 550
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWAQDFK--DPEVH 603
+R++F + A+E+L ++ DA + ++ + R L P KI G + EW +D+ PE
Sbjct: 551 LRDLFDGFVKASELLGEDADAGMRAEVRRVRRKLPPMKIGAQGQLREWQEDWDAIAPEQK 610
Query: 604 HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHA 663
HRH+SHL+GL P + IT P+L AA KTL++RG+ G GWS+ WK WARL D +
Sbjct: 611 HRHVSHLYGLHPSNQITKRDTPELFAAARKTLERRGDAGTGWSLAWKINFWARLEDGARS 670
Query: 664 YRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYL 723
+++ L +L+ PE NLF HPPFQID NFG TA V+E L+QS +L L
Sbjct: 671 FKL---LTDLLTPERTA-------PNLFDLHPPFQIDGNFGATAGVSEWLLQSHAGELRL 720
Query: 724 LPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
LPALP G V+GL ARGG V + W+ G L + S N
Sbjct: 721 LPALP-PTLLDGRVRGLLARGGFEVDLTWRQGALLTGKLRSRSGNQ 765
>gi|218262384|ref|ZP_03476870.1| hypothetical protein PRABACTJOHN_02544 [Parabacteroides johnsonii
DSM 18315]
gi|218223418|gb|EEC96068.1| hypothetical protein PRABACTJOHN_02544 [Parabacteroides johnsonii
DSM 18315]
Length = 809
Score = 489 bits (1260), Expect = e-135, Method: Compositional matrix adjust.
Identities = 287/777 (36%), Positives = 425/777 (54%), Gaps = 52/777 (6%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
N T L F+ PA+ + + +P+GNGRLG M GGV +E + LNE ++W+G D
Sbjct: 17 NMPEVKTGKSLSYHFDAPAEIWEETLPLGNGRLGLMPDGGVDTEKIVLNEISMWSGSKQD 76
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
NP A +L +R L+ G+ EA F P YQLLG++
Sbjct: 77 TDNPQAYYSLGTIRKLLFEGRNDEAQELMYNTFVCKGEGSALGQGANAPYGSYQLLGNLV 136
Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
L +D + YRREL+L+ A A + G V++ RE F+S D + V ++
Sbjct: 137 LNYDYQGSSDSISGYRRELNLDNAIATASFRRGKVKYDREVFTSFADDLGVIHLTADADK 196
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD 229
+L+F+ ++ +++ N ++M+G+ P + KG+++++ + + +
Sbjct: 197 ALNFSFGMNRP-EHYKVTADGNDLLMQGQLP------DGVDTLEMKGLRYASRVRVVLPK 249
Query: 230 DRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSALQSIRNL 288
I D + + + A+LL+ +A+ FD KD + S L +
Sbjct: 250 GGNVIPG--DSTVTIRNASEAILLVSMATDYFD----------KDLDEKVASLLANAEKK 297
Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDP 347
++ L H+ Y+ LF RV + L S ++ +P ER+ +F D +DP
Sbjct: 298 DFASLKKGHIAAYRSLFGRVDLDLGHSSRE------------DLPIDERLATFNADPDDP 345
Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
SL L FQFGRYLLISS+R G NLQG+W ++ W+ H+NINL+MN+W + N
Sbjct: 346 SLGALYFQFGRYLLISSTRVGLLPPNLQGLWCNTVNTPWNGDYHLNINLQMNHWPAEVAN 405
Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
LSE PL ++ +G +TA+ Y A GWV H ++W + +A W
Sbjct: 406 LSELHLPLVEWTKQQVASGEQTAKAYYNAGGWVTHILGNVW-EFTAPGEHPSWGATNTSA 464
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEF 526
AWLC HL+ HY YT+D+++L K YP+L+G + F +D L+E + YL T P+TSPE+ +
Sbjct: 465 AWLCEHLYMHYLYTLDKEYL-KDVYPVLKGASRFFVDMLVEDPRNKYLVTAPTTSPENGY 523
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE 586
P+GK A + STMD I+RE+F+ I AA +L + A +++ RL PT I +
Sbjct: 524 KLPNGKTAHICAGSTMDNQIVRELFTNTIEAANILGI-DSAFAGELVAKRARLMPTTIGK 582
Query: 587 DGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWS 646
DG IMEW + F++ E HHRH+SHL+GL+PG+ I+I+ P+L +AA K+L RG++ GWS
Sbjct: 583 DGRIMEWLEPFEEVEPHHRHVSHLYGLYPGNEISIKHTPELAEAARKSLVARGDKSTGWS 642
Query: 647 ITWKTALWARLHDQEHAYRMVKRLFN-LVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGF 705
+ WK WARLHD +HAY+++ L VD + GG Y NLF AHPPFQID NFG
Sbjct: 643 MAWKINFWARLHDGDHAYKLLVDLLRPCVDRKTNMTNGGGTYPNLFCAHPPFQIDGNFGG 702
Query: 706 TAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
A +AEMLVQS ++ LLPALP W +G KGLK RGG VS WK+G L E G+
Sbjct: 703 CAGIAEMLVQSQTGEIELLPALP-SAWKNGSFKGLKVRGGGEVSAKWKEGRLTEAGL 758
>gi|299145505|ref|ZP_07038573.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
gi|298515996|gb|EFI39877.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
Length = 822
Score = 489 bits (1260), Expect = e-135, Method: Compositional matrix adjust.
Identities = 297/768 (38%), Positives = 440/768 (57%), Gaps = 57/768 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+T+W G P + NP+A + +
Sbjct: 32 KLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEETIWAGHPNNNANPNALEYIP 91
Query: 74 DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +Y+ Y RE
Sbjct: 92 KVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTRYS--NYYRE 145
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
L L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L S
Sbjct: 146 LSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLTS-------- 197
Query: 188 NGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKL 242
+Q +M EG C + ++ ++ KG ++F L K ++G A D L
Sbjct: 198 --PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGGEIACADGIL 250
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
VE +D A++ + +++F+ N D + + + L + + H+D Y+
Sbjct: 251 SVEKADEAIIYVSIATNFN----NYQDITGNQIERAKNYLAKAMVHPFIESKRNHVDFYR 306
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
+ RVS+ L E+ V + +RV++F+ D LV FQFGRYLLI
Sbjct: 307 QYLTRVSLDLG------------EDQYANVTTDKRVENFKNTNDAHLVATYFQFGRYLLI 354
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE EPLF + +
Sbjct: 355 CSSQPGGQPANLQGIWNDKLVPSWDSKYTCNINLEMNYWPSEVTNLSELNEPLFRLIKEV 414
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC HLWE Y YT
Sbjct: 415 SDTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRHLWERYLYTG 473
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
D +FL + YP+L+ F + ++ E +L PS SPE+ +GK A + T
Sbjct: 474 DVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK-ATTAAGCT 531
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPE 601
MD ++ ++++AIISA+++L+ + + + + L + P ++ G + EW D+ DP+
Sbjct: 532 MDNQLVFDLWTAIISASQILDTDRE-FASHLTQRLKEMAPMQVGHWGQLQEWMFDWDDPK 590
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQE 661
HRH+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK LWARL D +
Sbjct: 591 DVHRHISHLYGLFPSNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKVCLWARLLDGD 650
Query: 662 HAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDL 721
HAY+++ LV E +K GG Y NLF AHPPFQID NFG A +AEML+QS +
Sbjct: 651 HAYKLITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCAAGIAEMLMQSYDGFI 707
Query: 722 YLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
YLLPALP W G +KG+ ARGG + + WK+G + + + S+ N
Sbjct: 708 YLLPALP-AVWKEGSIKGIIARGGFELDLSWKNGKVSRLVVKSHKGGN 754
>gi|302549607|ref|ZP_07301949.1| large secreted protein [Streptomyces viridochromogenes DSM 40736]
gi|302467225|gb|EFL30318.1| large secreted protein [Streptomyces viridochromogenes DSM 40736]
Length = 953
Score = 489 bits (1260), Expect = e-135, Method: Compositional matrix adjust.
Identities = 294/751 (39%), Positives = 411/751 (54%), Gaps = 55/751 (7%)
Query: 11 NPLKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
N L + ++ PA + A+PIGNGRLGAMV+G +E L+LNEDT+W G P D NP
Sbjct: 23 NDLALWYDKPAGADWLRALPIGNGRLGAMVFGNADTERLQLNEDTVWAGGPYDSANPRGA 82
Query: 70 KALSDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRR 126
++++R V + Q+ A + + G PA YQ +G++ L F + Y R
Sbjct: 83 ANIAEIRRRVFADQWGPAQDLINQTMLGSPAGQLAYQPVGNLLLSFGSA---TGVSQYNR 139
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
LDL TATA Y + V + RE F+S PDQVIV +++ + S++FN + DS
Sbjct: 140 TLDLTTATAVTTYVLNGVRYQREVFASAPDQVIVVRLTADRANSIAFNATFDSPQRTTVS 199
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
I ++G ++F A+ ++ GT+S+ L+V G
Sbjct: 200 SPDGATIALDGVS--------GTMEGITGRVRFLALANAAVTG--GTVSS-SGGTLRVSG 248
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ +L+ SS+ ++ D + L + R++ L RHL DYQ LF+
Sbjct: 249 ATSVTVLVAIGSSY----VDFRRVDGDYQGIARRHLNAARDIGIDQLRRRHLADYQALFN 304
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RVS+ L R+ T +++ P+ R+ DP LLFQFGRYLLISSSR
Sbjct: 305 RVSVDLGRT-------TAADQ-----PTDVRIAQHAQANDPQFSALLFQFGRYLLISSSR 352
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PGTQ ANLQGIWN+ ++P+WDS VN NL MNYW + NLSEC P+FD + L++ G
Sbjct: 353 PGTQPANLQGIWNDQMAPSWDSKFTVNANLPMNYWPADTTNLSECFLPVFDMIDDLTVTG 412
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSS-ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
++ AQ Y A GWV HH TD W +S D + W +W GGAWL T +W+HY +T D D
Sbjct: 413 ARVAQAQYGAGGWVTHHNTDAWRGASVVDEAR--WGMWQTGGAWLATLIWDHYLFTGDID 470
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
FL YP L+G A F LD L+ G+L TNPS SPE A A V TMD
Sbjct: 471 FLRSN-YPALKGAAQFFLDTLVAHPSLGHLVTNPSNSPELAHHAD----ATVCAGPTMDN 525
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHH 604
I+R++F ++ A E+L+ + + RL PTK+ G++ EW D+ + E H
Sbjct: 526 QILRDLFHSVARAGEILDVDAAFRAQAKAAR-ERLAPTKVGSRGNVQEWLADWVETERTH 584
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAY 664
RH+SHL+GL P + IT P L +AA +TL+ RG++G GWS+ WK WARL D A+
Sbjct: 585 RHVSHLYGLHPSNQITKRGTPQLHEAARRTLELRGDDGTGWSLAWKINFWARLEDGARAH 644
Query: 665 RMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLL 724
++++ +LV + L N+F HPPFQID NFG TA +AEML+QS +L++L
Sbjct: 645 KLIR---DLVRTDR-------LAPNMFDLHPPFQIDGNFGATAGIAEMLLQSHNGELHVL 694
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDG 755
PALP W +G V GL+ RGG TV W G
Sbjct: 695 PALP-AAWPTGRVSGLRGRGGYTVGAEWSSG 724
>gi|281422553|ref|ZP_06253552.1| putative large secreted protein [Prevotella copri DSM 18205]
gi|281403377|gb|EFB34057.1| putative large secreted protein [Prevotella copri DSM 18205]
Length = 807
Score = 489 bits (1260), Expect = e-135, Method: Compositional matrix adjust.
Identities = 280/772 (36%), Positives = 431/772 (55%), Gaps = 66/772 (8%)
Query: 2 MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
M A+++ T N + +N PA+ + +A+PIGN LG MV+GG E ++LNE+T W+G P
Sbjct: 21 MMAKTSCTDNSTLLWYNAPAQQWLEALPIGNSHLGGMVYGGTTDENIQLNEETFWSGGPH 80
Query: 62 DYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAE 121
+ + + + L VR L+ +G+ EA A + F + L L + AE
Sbjct: 81 NNNSKKSLENLPKVRELIFNGREEEAAALINQTFIPGPHGMRFLPMANLHITMKNQGKAE 140
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
+ + R LDL A A + + V +TR F+S D VIV I S G+L+ +V+LDS
Sbjct: 141 Q-FVRNLDLKRAIATTSFVMDGVRYTRTTFASLADGVIVCHIKASRKGALNIDVTLDSPF 199
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK- 240
++ + P G+ +L++K D G +AL +
Sbjct: 200 EHQT-------------------------QKMPSGV----MLKVKGQDQEGIKAALTAEC 230
Query: 241 --KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
++ +G++ +++ A++ F+N D + + + ++ +SY+ L RH+
Sbjct: 231 VADVRKDGTEATIIVSAATN-----FVNYHDVSGNAAQRNADYINKVKLMSYAQLEKRHV 285
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+ YQK F S+ L P DI ++P+ +R++ F +D ++V L++ +GR
Sbjct: 286 EAYQKQFATSSLIL---PTDINA---------SLPTNQRLEKFAGSKDMAMVALMYNYGR 333
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSS+PG Q ANLQG+WN+ + WDS +NIN EMNYW + NL EPL+
Sbjct: 334 YLLISSSQPGGQAANLQGVWNDSKNAPWDSKYTININTEMNYWPAEVTNLGNTTEPLYSL 393
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ LS+ G++TA+ Y GW+ HH TDIW + G W ++P GGAWL THLW+HY
Sbjct: 394 IKDLSVTGAQTAREMYGCRGWMAHHNTDIWRIAGPVDG-AQWGMFPNGGAWLTTHLWQHY 452
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
YT D+ FL K+ YP+++G A F LD++ + G + + PS SPE P GK V
Sbjct: 453 LYTGDKAFL-KQWYPVIKGAAEFYLDYMQKLPGTEWKVSV-PSVSPEQ---GPKGKRTAV 507
Query: 537 SYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWA 594
+ TMD I + ++ + A+E+L ++ E +++++ +P P +I + G + EW
Sbjct: 508 TAGCTMDNQIAFDALTSAVKASEILGVDEAERKDMQQLVSQIP---PMQIGKYGQLQEWL 564
Query: 595 QDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALW 654
D DP+ HRH+SHL+GL+P + I+ +P+L AA TL+ RG++ GWS+ WKT W
Sbjct: 565 VDADDPKNEHRHISHLYGLYPSNQISPFSHPELFHAAATTLKHRGDQATGWSLGWKTNFW 624
Query: 655 ARLHDQEHAYRMVKRLFNLV--DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
AR+ D HA+R++ + L+ D + +++ +G Y NLF AHPPFQID NFG TA +AEM
Sbjct: 625 ARMLDGNHAFRIISNMLRLLPSDAQAKEYPDGRTYPNLFDAHPPFQIDGNFGVTAGIAEM 684
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
L+QS ++LLPALP D W G VKGL+ARGG V + WKDG L + I S
Sbjct: 685 LLQSHDGAVHLLPALP-DAWKEGSVKGLRARGGFVVDMDWKDGKLKQAKIRS 735
>gi|319642679|ref|ZP_07997325.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_40A]
gi|345520274|ref|ZP_08799672.1| glycoside hydrolase family 95 [Bacteroides sp. 4_3_47FAA]
gi|254836101|gb|EET16410.1| glycoside hydrolase family 95 [Bacteroides sp. 4_3_47FAA]
gi|317385767|gb|EFV66700.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_40A]
Length = 814
Score = 489 bits (1259), Expect = e-135, Method: Compositional matrix adjust.
Identities = 285/764 (37%), Positives = 432/764 (56%), Gaps = 50/764 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ +T+A+P+GNGRLGAMV+G E ++LNE+T+W G P + NPDA + +
Sbjct: 25 KLWYDEPAQVWTEALPLGNGRLGAMVFGNPGVEHIQLNEETIWAGRPNNNANPDALEYIP 84
Query: 74 DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
VR LV G+Y EA T A+ K+ G P YQ GD+ + F H +Y++ Y RE
Sbjct: 85 KVRRLVFEGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLHISFP-GHTRYSD--YYRE 138
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
L L++A V+Y V V + RE +S DQV++ +++ S+ G ++ N +L + +
Sbjct: 139 LSLDSARTIVRYKVDGVTYQRETLTSFADQVVMVRLTASQPGKITCNANLTTPHQDVMVA 198
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
++ + G ++ ++ KG ++F + + +G + D L +EG
Sbjct: 199 TEGEEVTLSG---------VSSWHEGLKGKVEFQGRMTART---QGGTRSCRDGVLSIEG 246
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D AV+ + +++F N D + + + L+ + Y H+D +++
Sbjct: 247 ADEAVIYISIATNF----TNYKDITGNQVERAKNYLRRAVSKDYMTSRKAHVDFFKQYMD 302
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RVS+ D+ D + D RV++F+ +D LV F+FGRYLLI SS+
Sbjct: 303 RVSL-------DLGIDKYAGVTTDM-----RVQNFKETKDDFLVATYFRFGRYLLICSSQ 350
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQGIWN+ L P+WDS NIN+EMNYW + NLSE EPL + +S G
Sbjct: 351 PGGQPANLQGIWNDKLFPSWDSKYTCNINVEMNYWPAEVTNLSELHEPLIQLIREVSETG 410
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
++A++ Y A GWV+HH TDIW + A K LWP GGAWLC HLWE Y YT D +F
Sbjct: 411 RESAKIMYGADGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDMEF 469
Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L + AYP+++ F + ++ E +L PS SPE+ +GK A + T+D
Sbjct: 470 L-RSAYPIMKEAGKFFDEIMVKEPLHNWLVVCPSNSPENTHAGSNGK-ATTAAGCTLDNQ 527
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
+I ++++ II+ A +L + + ++ + L + P +I G + EW D+ +P+ HR
Sbjct: 528 LIFDLWNQIITTARLLGTDAE-FATRLEQRLKEMAPMQIGRWGQLQEWMMDWDNPQDVHR 586
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H+SHL+GLFPG+ I+ + P+L AA +L RG+ GWS+ WK LWARL D +HAY+
Sbjct: 587 HVSHLYGLFPGNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKVCLWARLLDGDHAYK 646
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
++ LV E +K GG Y NLF AHPPFQID NFG TA + EML+QS +YLLP
Sbjct: 647 LITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCTAGIVEMLMQSHDGFIYLLP 703
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
ALP +W G V G+ ARGG + + WK+G + + + S + N
Sbjct: 704 ALP-AQWKEGSVNGIIARGGFELDLSWKNGKVSRLVVKSRHGGN 746
>gi|29346420|ref|NP_809923.1| hypothetical protein BT_1010 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29338316|gb|AAO76117.1| glycoside hydrolase family 95 [Bacteroides thetaiotaomicron
VPI-5482]
Length = 824
Score = 489 bits (1258), Expect = e-135, Method: Compositional matrix adjust.
Identities = 298/773 (38%), Positives = 445/773 (57%), Gaps = 49/773 (6%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E + K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+T+W G P +
Sbjct: 25 EKKVSAQEYKLWYDRPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEETIWAGRPNNNA 84
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NP+A + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 85 NPNALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 140
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y++ Y R+L L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 141 YSD--YYRDLSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 198
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISAL 237
S H V +++ EG C + ++ ++ KG ++F L + ++G A
Sbjct: 199 S---PHQDVMIHSE---EGNC--VTLSGVSSLHEGLKGKVEFQGRLTAR---NQGGKIAC 247
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
D L VEG+D A + + +++F+ N D + T + S L +++ H
Sbjct: 248 TDGVLSVEGADEATIYVSIATNFN----NYLDITGNQTERAKSYLSEALVRPFAEAKKNH 303
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
++ Y++ RVS+ L E+ V + +RV++F+ D LV FQFG
Sbjct: 304 VEFYRRYLTRVSLDLG------------EDQYKNVTTDKRVENFKDTHDAHLVATYFQFG 351
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLS+ EPLF
Sbjct: 352 RYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSDLNEPLFR 411
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ +S +G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC HLWE
Sbjct: 412 LIKEVSESGKETAKIMYGANGWVLHHNTDIWRITGA-LDKAPSGMWPSGGAWLCRHLWER 470
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACV 536
Y YT D +FL + YP+L+ F + ++ E +L PS SPE+ DGK A
Sbjct: 471 YLYTGDTEFL-RSVYPILKESGLFFDEIMVKEPVHNWLVVCPSNSPENVHSGSDGK-ATT 528
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
+ TMD +I ++++AIISA+ +L+ +++ + + L + P ++ G + EW D
Sbjct: 529 AAGCTMDNQLIFDLWTAIISASRILDTDKE-FAAHLEQRLKEMAPMQVGHWGQLQEWMFD 587
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
+ DP HRH+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK LWAR
Sbjct: 588 WDDPNDVHRHVSHLYGLFPSNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKVCLWAR 647
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
L D +HAY+++ LV E +K GG Y NLF AHPPFQID NFG A + EML+QS
Sbjct: 648 LLDGDHAYKLITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCAAGIVEMLMQS 704
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
+YLLPALP W G V G+ ARGG + + WK+G ++ + + S+ N
Sbjct: 705 YDGFIYLLPALP-TLWKDGSVTGIIARGGFELDLNWKNGKVNRLVVKSHKGGN 756
>gi|291545123|emb|CBL18232.1| hypothetical protein RUM_22260 [Ruminococcus champanellensis 18P13]
Length = 776
Score = 488 bits (1257), Expect = e-135, Method: Compositional matrix adjust.
Identities = 285/756 (37%), Positives = 407/756 (53%), Gaps = 61/756 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA++F A+P+GNGR+GAMV+GGV +E LKLNED++W+G + NPDA + + +R
Sbjct: 9 YTKPAENFDQALPVGNGRMGAMVFGGVETEHLKLNEDSIWSGGLRNRNNPDAYQGMQQIR 68
Query: 77 SLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
L+ + +EA + + + G P + Y LGD+++ F H + YRR LDL++
Sbjct: 69 MLLQQEKISEAEELAFQTMQGCPENSRHYMPLGDLDVVF---HKESHSTAYRRTLDLSSG 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A +Y++ V++ R F S PD V+V +S + G +SF S G +
Sbjct: 126 IALTEYTLDGVQYQRSVFVSEPDNVLVLHVSADQPGQVSFAASF----------GGRDDY 175
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
E R G+ +GIQF+ ++ + R +L VEG+D A LL
Sbjct: 176 YDENRPDGEASICVTGGQGGQQGIQFAVVMTAAVQGGRAFTRG---NQLCVEGADEATLL 232
Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSA-LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
L +SF K + E+ + + S+ +L RH+DDY+ LF RV ++L
Sbjct: 233 LAVQTSF---------YKGEGYLEAAQLDAEYAADCSFHELMVRHVDDYRALFDRVKLEL 283
Query: 313 -------SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
++ P D + D +A + D L EL F +GRYL+IS S
Sbjct: 284 EDNSGEGAQLPTDARLSRLRGNDFDGKDAAGLIL------DNKLTELYFNYGRYLMISGS 337
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPG+Q NLQGIWN+D+ P W S VNIN EMNYW + CNLSEC PLFD + + N
Sbjct: 338 RPGSQPLNLQGIWNQDMWPAWGSRFTVNINTEMNYWCAESCNLSECHLPLFDLIRRMRPN 397
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G +TA+ Y G+V HH TD+W + + +WPMG AWLC H++EHY YT+DRD
Sbjct: 398 GEQTARDMYHCGGFVCHHNTDLWGDCAPQDRWMPATIWPMGAAWLCLHIFEHYQYTLDRD 457
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
FL ++ + L G A F +++ E G L T PS SPE+ ++ G + +MD
Sbjct: 458 FLAQQ-FDTLCGAAQFFTEYMFENSAGQLVTGPSVSPENTYLTASGAKGSLCIGPSMDSQ 516
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
II +F+ ++ AA +LE+ E L+EK+ + LPRL +I + G I EWA D+ + E+ HR
Sbjct: 517 IITLLFTDVLEAARILER-ESPLLEKIRQMLPRLPMPEIGKYGQIKEWAVDYDEVEIGHR 575
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTALWARLHDQEH 662
H+S LF L P IT E P L AA TL +R G GWS W +WARLHD E
Sbjct: 576 HISQLFALHPADLITPEDTPKLADAARATLVRRLVHGGGHTGWSRAWIMNMWARLHDGEM 635
Query: 663 AYRMVKRLFNL-VDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDL 721
+ +++L +P NL +HPPFQID NFG TAAV E L+QS +
Sbjct: 636 VFENMQKLLAYSTNP------------NLLDSHPPFQIDGNFGGTAAVCEALLQSHGGVM 683
Query: 722 YLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
LPALP +W+ G V GL+A+G TV + W+D L
Sbjct: 684 QFLPALP-PQWAKGSVMGLRAKGAYTVDLFWQDARL 718
>gi|384098831|ref|ZP_09999943.1| hypothetical protein W5A_09224 [Imtechella halotolerans K1]
gi|383834974|gb|EID74405.1| hypothetical protein W5A_09224 [Imtechella halotolerans K1]
Length = 786
Score = 488 bits (1257), Expect = e-135, Method: Compositional matrix adjust.
Identities = 301/814 (36%), Positives = 444/814 (54%), Gaps = 64/814 (7%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
A + ++ + ++ + PA + +A+P+GNGRLGAM++G +E ++LNED++W G P
Sbjct: 17 ANAQNSQSKERLWYKEPATKWMEALPVGNGRLGAMIFGQPINERIQLNEDSMWPGGPDWG 76
Query: 64 TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAE 121
+ P+ L +R L+ GQY +A V F + V +Q +GD+ ++F +
Sbjct: 77 DSKGTPEDLVYIRQLLKEGQYHKADEEIVTRFSNKGVVRSHQTMGDLYIDFSTKKVA--- 133
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
Y RELD+ TA A Y+ +T+E F+S P V++ + + + + + ++
Sbjct: 134 -NYYRELDIETAVATTSYNSEGYNYTQEVFASAPHNVLIIRYTTTNPKGMDATLRMNRPK 192
Query: 182 D---NHSYVN--GNNQIIMEGRCP--GKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
D N V+ NQI M+G G R+ +A D G++F L +K + G I
Sbjct: 193 DEGFNTVQVSSPAPNQIQMKGMVTQNGGRLNSEAKPLD--YGVKFDTRLVVK---NNGGI 247
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
+D L+++ + AVLLLV S+SF + S + L ++ LSY+++
Sbjct: 248 VVSKDGILELKNVNEAVLLLVGSTSFY--------HGNNYESYNEQLLGQVQELSYNEML 299
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ-TDEDPSLVELL 353
+ H+ DYQ L+ RV++ L + + +P+ ER+K + D +L LL
Sbjct: 300 SAHVADYQSLYKRVTLDLGGN------------EFNKIPTDERLKKIKDGGTDKALSALL 347
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQ+GRYLLISSSRPGT ANLQGIWNE + W++ H+N+NL+MNYW + NLSEC
Sbjct: 348 FQYGRYLLISSSRPGTNPANLQGIWNEHIRAPWNADYHLNVNLQMNYWPAEVTNLSECHS 407
Query: 414 PLFDFLTYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
PLFD+ L G TA+ Y + G VIHH +DIWA + + W W GG WL
Sbjct: 408 PLFDYTDRLINRGRITAKDQYGIHRGAVIHHTSDIWAPAWMHAERAYWGAWIHGGGWLAQ 467
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYL-ETNPSTSPEHEFIAPDG 531
H WEHY+YT D DFL+ RA+P ++ A F LDWLI D ++P TSPE+ ++APDG
Sbjct: 468 HYWEHYSYTNDIDFLKNRAWPAMKALAEFYLDWLIYDQDSKTWVSSPETSPENSYMAPDG 527
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSI 590
A VS+ + M II EVF+ + AA +L+ N+D V++V L ++ P + DG I
Sbjct: 528 TPAAVSHGAAMGHQIIGEVFNNTLKAASILKINDD-FVQEVKSKLKKIHPGVVLGPDGRI 586
Query: 591 MEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSI 647
+EW + ++PE HRH+S L+ L PG +IT +K +AA+KT+ R G G GWS
Sbjct: 587 LEWTKPVEEPEKGHRHMSQLYALHPGISIT-QKTSAHFEAAKKTIDYRLQHGGAGTGWSR 645
Query: 648 TWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTA 707
W ARL D A +++ + + NLF HPPFQID NFGFTA
Sbjct: 646 AWMINFNARLQDAVAAQTNIQKFLEISTAD-----------NLFDMHPPFQIDGNFGFTA 694
Query: 708 AVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYS 767
VAEML+QS + LLPALP + W SG V GLKARG VSI WK+ + + + S
Sbjct: 695 GVAEMLMQSHEGFIRLLPALP-ESWDSGEVTGLKARGNIQVSIKWKEHTIERIELVSK-- 751
Query: 768 NNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLK 801
D+ TL Y+ ++LS+ + N+ LK
Sbjct: 752 ---EDTKATLVYKDRKKTISLSSNETIILNQYLK 782
>gi|238060476|ref|ZP_04605185.1| large secreted protein [Micromonospora sp. ATCC 39149]
gi|237882287|gb|EEP71115.1| large secreted protein [Micromonospora sp. ATCC 39149]
Length = 826
Score = 488 bits (1256), Expect = e-135, Method: Compositional matrix adjust.
Identities = 299/787 (37%), Positives = 421/787 (53%), Gaps = 60/787 (7%)
Query: 19 GPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSL 78
G + A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D N L+++R
Sbjct: 53 GAGTDWLRALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPYDSANTRGAANLAEIRRR 112
Query: 79 VDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
V + Q++ A + + G P YQ +G++ L F + Y R LDL TAT
Sbjct: 113 VFADQWSSAQDLINQTMMGTPGGQLAYQTVGNLRLAFGSAS---GASQYNRTLDLTTATV 169
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
Y + V + RE F+S PDQVIV +++ + S++F+ + DS N I
Sbjct: 170 TTTYVLNGVRYQREVFASAPDQVIVLRLTADRASSITFSATFDSPQRTTMSSPDANTIAA 229
Query: 196 EGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLV 255
+G + ++F A+ + GT+S+ L+V G+ +L+
Sbjct: 230 DG--------ISGSMEGINGSVRFLALAHAVATG--GTVSS-SGGTLRVSGATSVTVLIS 278
Query: 256 ASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRS 315
+SS+ +N D + + L + R +S L +RH+ DYQ LF+RV+I L R
Sbjct: 279 IASSY----VNYRTVNGDYQGIARTRLNAARTVSIDQLRSRHIADYQALFNRVTINLGR- 333
Query: 316 PKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQ 375
T + + P+ R+ + DP LLFQFGRYLLISSSRPGTQ ANLQ
Sbjct: 334 -------TAAADQ----PTDVRIAQHASSNDPQFSALLFQFGRYLLISSSRPGTQPANLQ 382
Query: 376 GIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYL 435
GIWN+ L+P+WDS +N NL MNYW + NLSEC P+FD + L++ G++ AQ Y
Sbjct: 383 GIWNDSLAPSWDSKYTINANLPMNYWPADTTNLSECFLPVFDMIKDLTVTGARVAQAQYG 442
Query: 436 ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLL 495
A GWV HH TD W +S G +W +W GGAWL T +WEHY +T D FL+ YP L
Sbjct: 443 AGGWVTHHNTDAWRGASVVDG-ALWGMWQTGGAWLATLIWEHYLFTGDVGFLQAN-YPAL 500
Query: 496 EGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAI 554
+G A F LD L+ YL TNPS SPE P V TMD I+R++F A
Sbjct: 501 KGAAQFFLDTLVVHPTLNYLVTNPSNSPE----LPHHSNVSVCAGPTMDNQILRDLFDAA 556
Query: 555 ISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLF 614
A+E L + +V + RL P+++ G+I EW D+ + E HRH+SHL+GL
Sbjct: 557 ARASETLGV-DTTFRSQVRTAKDRLPPSRVGSRGNIQEWLADWIETERTHRHVSHLYGLH 615
Query: 615 PGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLV 674
P + IT P L +AA +TL+ RG++G GWS+ WK WARL D A++++K +LV
Sbjct: 616 PSNQITKRGTPQLYEAARRTLELRGDDGTGWSLAWKINFWARLEDAARAHKLLK---DLV 672
Query: 675 DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSS 734
+ L N+F HPPFQID NFG T+ +AEML+ S +L++LPALP W +
Sbjct: 673 RTDR-------LAPNMFDLHPPFQIDGNFGATSGIAEMLLHSHTGELHVLPALP-TAWPT 724
Query: 735 GCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYR---GTSVKVNLSAG 791
G V GL+ RGG TV + W G E+ + + D D + R G+ V+++ G
Sbjct: 725 GQVAGLRGRGGYTVGVAWTSGQADEISVRA-----DRDGTLKMRARLLTGSFTLVDVTDG 779
Query: 792 KIYTFNR 798
T R
Sbjct: 780 STPTVTR 786
>gi|410096023|ref|ZP_11291014.1| hypothetical protein HMPREF1076_00192 [Parabacteroides goldsteinii
CL02T12C30]
gi|409227429|gb|EKN20327.1| hypothetical protein HMPREF1076_00192 [Parabacteroides goldsteinii
CL02T12C30]
Length = 821
Score = 488 bits (1256), Expect = e-135, Method: Compositional matrix adjust.
Identities = 285/773 (36%), Positives = 431/773 (55%), Gaps = 56/773 (7%)
Query: 9 TTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
T N + F+ PA+ + + +P+GNGRLG M GG+ E + LNE ++W+G D NP A
Sbjct: 35 TANKIAYHFDEPARIWEETLPLGNGRLGMMPDGGINKENILLNEISMWSGSKQDTDNPQA 94
Query: 69 PKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIELEFDDS 115
+L+++R L+ G+ EA + F P YQLLG++ L++
Sbjct: 95 VWSLANIRRLLFEGKNDEAQDLMYRTFVCKGAGSGQGQGANVPYGSYQLLGNLVLDYVYV 154
Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
+ YRREL+LN A A + G V ++RE F+S + V + +L+F V
Sbjct: 155 DGSDSVAAYRRELNLNDAIASTSFRKGKVNYSRESFTSFSGDLGVVHLMADADKALNFTV 214
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
++ V+G + ++M+G+ P + KGI++ A + + + IS
Sbjct: 215 GMNRPEHYALSVDGKD-LLMKGQLP------DGVDTLEMKGIKYGARVRVLLPKGGSLIS 267
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
D L V+ + A+LL+ ++++ ++ +D + S L YS L
Sbjct: 268 G--DSSLTVQNASEAILLVSMATNYK------NEGFED---QLFSLLAESERKDYSTLRK 316
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLF 354
H++ Y+ LF RV + L RS +D +P ER+ +FQ D+ DPSL L F
Sbjct: 317 EHVNAYRSLFDRVDLDLGRSARD------------EMPINERLHAFQEDQNDPSLGALYF 364
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
QFGRYLLISS+R G+ NLQG+W ++ W+ H+NIN +MN+W + NLSE P
Sbjct: 365 QFGRYLLISSTRTGSLPPNLQGLWCNTINTPWNGDYHLNINFQMNHWPAEVTNLSELHLP 424
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
+ ++ +G +TA+V Y A G V H ++W + +A W AWLC HL
Sbjct: 425 MIEWTKQQVESGERTAKVFYNARGLVTHILGNVW-EFTAPGEHPSWGATNTSAAWLCEHL 483
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKL 533
+ HY YT+D+++L K YP+++G A F D L+ + + YL T P+TSPE+ + P+GK+
Sbjct: 484 FTHYQYTLDKEYL-KEVYPVMKGAALFFTDMLVRDPRNNYLVTAPTTSPENAYRMPNGKV 542
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ STMD I+RE+F+ I+AA +L + A +++ RL PT I +DG I+EW
Sbjct: 543 VHICAGSTMDNQIVRELFTNTIAAANILGI-DSAFCQELADKRSRLMPTTIGKDGRILEW 601
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
+ +++ E HHRH+SHL+GL+PG+ I++E P+L +AA KTL+ RG++ GWS+ WK
Sbjct: 602 LEPYEEVEPHHRHVSHLYGLYPGNEISMEHTPELAEAARKTLEARGDKSTGWSMAWKINF 661
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFE----GGLYSNLFAAHPPFQIDANFGFTAAV 709
WARLHD +HAY++ L +L+ P EK GG Y NLF AHPPFQID N+G A +
Sbjct: 662 WARLHDGDHAYKL---LVDLLRPCVEKTTNMVNGGGSYPNLFCAHPPFQIDGNYGGCAGI 718
Query: 710 AEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
AEMLVQS ++ LLPALP W +G KGLK +GG VS W +G + E G+
Sbjct: 719 AEMLVQSQTGNIELLPALP-TAWKTGSFKGLKVQGGGEVSAKWAEGKMTEAGL 770
>gi|313204128|ref|YP_004042785.1| alpha-L-fucosidase [Paludibacter propionicigenes WB4]
gi|312443444|gb|ADQ79800.1| Alpha-L-fucosidase [Paludibacter propionicigenes WB4]
Length = 826
Score = 488 bits (1256), Expect = e-135, Method: Compositional matrix adjust.
Identities = 284/769 (36%), Positives = 424/769 (55%), Gaps = 54/769 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA ++ +A+PI NGR+ AMV G E L+LNE + W+G P NPD K L
Sbjct: 29 LKLWYDKPAANWNEALPIANGRIAAMVHGNPSKELLQLNESSFWSGGPSRNDNPDGLKGL 88
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHP---ADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
+R+ + G Y A S + +Q +G++ + F ++ K+ + Y R+LD
Sbjct: 89 DSIRTYIFQGNYTRANTLSNQFLTAKQLHGSKFQSIGNLNISFPNAE-KFTD--YYRDLD 145
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
+ A + V Y V +V + RE +S PDQVIV +++ S+ G L+F + DS L S
Sbjct: 146 IENALSSVSYKVDDVIYKREILASIPDQVIVVRLTASKPGKLTFTTNFDSQLKKTSVALD 205
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
N+ + M G + ++ G ++F A K+ ++ GT+S + D LKV+ ++
Sbjct: 206 NHTLEMTGL---------SGTHEGVIGQVKFDA--RAKVINNGGTVSFVSDS-LKVKNAN 253
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
++++ +++F ++ + + T + + L ++ + H+ YQK F RV
Sbjct: 254 EVIIMVSIATNF----VDYQNLTANETQKCIQYLSVAEKKPFNTILKNHISTYQKYFKRV 309
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
+ L S T + +R+K+F DP LV L +QFGRYLLI SS+P
Sbjct: 310 NFDLGTSEAAKAT------------TKDRIKNFSKSYDPELVSLYYQFGRYLLICSSQPN 357
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
Q +NLQGIWN +P WDS +NIN EMNYW + NL+E EPL + LS +G +
Sbjct: 358 GQPSNLQGIWNGSNNPMWDSKYTININTEMNYWPAEKTNLTEMHEPLIKMIKELSQSGKE 417
Query: 429 TAQVNYLASGWVIHHKTDIWAKSS----ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
TA+V Y ++GWV HH TDIW + AD G+ WPMGGAWL HLWE Y Y +
Sbjct: 418 TAKVMYGSNGWVAHHNTDIWRITGVVDFADAGQ-----WPMGGAWLSQHLWEKYLYNGNL 472
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
+LE YP+L+ F D+LIE +L +PS SPE+ P G + + T+D
Sbjct: 473 KYLES-VYPVLKSACEFYKDFLIEEPTHKWLVVSPSVSPEN---TPQGHKSALVAGCTID 528
Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVH 603
++ ++F+ I AA++L+K+ +V+ K L RL P +I G + EW +D+ + +
Sbjct: 529 NQLLFDLFTKTIKAAKLLKKDASLMVD-FQKILDRLPPMQIGRLGQLQEWLEDWDNAKDQ 587
Query: 604 HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHA 663
+RH+SHL+GLFP + IT P L AA+ +L RG+ GWS+ WK WARL D HA
Sbjct: 588 NRHVSHLYGLFPSNQITPYTTPQLFDAAKTSLLYRGDVSTGWSMGWKVNFWARLLDGNHA 647
Query: 664 YRMVKRLFNLVDPEHEKHFE---GGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
+++ LV+P ++ GG Y N+F AHPPFQID NFG T+ + EML+QS
Sbjct: 648 KKLISDQLTLVEPGQGRNSTMGGGGTYPNMFDAHPPFQIDGNFGCTSGITEMLLQSHDGS 707
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
+ +LPALP D W +G + GLKA GG VSI WKD +V I SN+ N
Sbjct: 708 VDILPALP-DDWKNGSITGLKAYGGFEVSIIWKDNKAQKVIIKSNFGGN 755
>gi|160882310|ref|ZP_02063313.1| hypothetical protein BACOVA_00258 [Bacteroides ovatus ATCC 8483]
gi|156112318|gb|EDO14063.1| hypothetical protein BACOVA_00258 [Bacteroides ovatus ATCC 8483]
Length = 1100
Score = 488 bits (1255), Expect = e-135, Method: Compositional matrix adjust.
Identities = 291/764 (38%), Positives = 410/764 (53%), Gaps = 52/764 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ +N PA+H+ +A+PIGN RLGAMV+GG E L++NE+T W G P +P A L
Sbjct: 288 LKLWYNRPAQHWEEALPIGNSRLGAMVYGGAGREELQINEETFWAGGPHHNNSPKAKTVL 347
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+ R L+ + EA + F P + L L H K Y RELD+
Sbjct: 348 DEARRLIFEDKTMEAQKLINPNFFSGPHGMSYLNMGSLLILQPGHEK--ATNYYRELDIE 405
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS--------LLDN 183
ATA Y V V +TR FSS DQVI+ ++ + G+L F++ D+ LL
Sbjct: 406 DATATTCYEVDGVTYTRTAFSSMTDQVIIVRLEANRKGALLFSLGYDTPKEADGSALLHP 465
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
V GN + +C G A+A ++++ D ++ + +L
Sbjct: 466 VVKVRGNKLTM---QCIGMEQEGVASA--------IKGEWQVQVVHDGKQVN--QPDRLG 512
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V+G+ A + L A+++F +N D + + + + L++ Y H YQ
Sbjct: 513 VQGATTATVYLSAATNF----VNYKDVSGNASRRAAAYLKTALKQPYPKALEAHSKAYQT 568
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
F+RV + L P I + P+ +RV F +D +L+ LL+Q+GRYLLI
Sbjct: 569 QFNRVKLDL---PATIAS---------LAPTNQRVADFNRVDDRNLMALLYQYGRYLLIC 616
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SS+PG Q ANLQGIW L WDS +NIN EMNYW + NLSEC EPLF L LS
Sbjct: 617 SSQPGGQPANLQGIWCRSLHAPWDSKYTININTEMNYWPAEVTNLSECHEPLFSMLEDLS 676
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+ G +TA+ Y A GWV HH TD+W + G W +WP GGAWLC HLW+HY YT D
Sbjct: 677 VTGHETARTLYGAKGWVAHHNTDLWRIAGPVDG-ATWGMWPNGGAWLCQHLWQHYLYTGD 735
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+ FL K YP+++G A F++ L++ G+L T PS SPEH + A C TM
Sbjct: 736 QAFLRKY-YPVMKGAADFMMSHLVKHPKYGWLVTVPSVSPEHGYTASTLTAGC-----TM 789
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
D I ++ + AA +L + A + + + +L P +I + I EW D DP+
Sbjct: 790 DNQIAFDILNNTRLAATILGE-PTAYQDSLQATCTQLPPMQIGKYNQIQEWMVDADDPKN 848
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEH 662
HRH+SHL+GL+P + I+ P L AA+ TL +RG++ GWSI WK WAR+ D H
Sbjct: 849 EHRHISHLYGLYPSNQISPHLQPTLFAAAKNTLLQRGDQATGWSIGWKINFWARMLDGNH 908
Query: 663 AYRMVKRLFNLV--DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
AYR+++ + L+ D + ++H +G Y NLF AHPPFQID NFG+TA V+EML+QS
Sbjct: 909 AYRIIRNMLRLLPGDGKQKEHPDGRTYPNLFDAHPPFQIDGNFGYTAGVSEMLLQSHDGA 968
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
++LLPALP +W G + GL ARGG V + W L I S
Sbjct: 969 VHLLPALP-KEWREGRISGLVARGGFVVDMEWSGAQLFRAEICS 1011
>gi|150005172|ref|YP_001299916.1| glycoside hydrolase [Bacteroides vulgatus ATCC 8482]
gi|149933596|gb|ABR40294.1| glycoside hydrolase family 95 [Bacteroides vulgatus ATCC 8482]
Length = 814
Score = 488 bits (1255), Expect = e-135, Method: Compositional matrix adjust.
Identities = 285/764 (37%), Positives = 430/764 (56%), Gaps = 50/764 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ +T+A+P+GNGRLGAMV+G E ++LNE+T+W G P + NPDA + +
Sbjct: 25 KLWYDEPAQVWTEALPLGNGRLGAMVFGNPGVEHIQLNEETIWAGRPNNNANPDALEYIP 84
Query: 74 DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
VR LV G+Y EA T A+ K+ G P YQ GD+ + F H +Y++ Y RE
Sbjct: 85 KVRRLVFEGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLHISFP-GHTRYSD--YYRE 138
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
L L++A V+Y V V + RE +S DQV++ +++ S+ G ++ N +L + +
Sbjct: 139 LSLDSARTIVRYKVDGVTYQRETLTSFADQVVMVRLTASQPGKITCNANLTTPHQDVMVA 198
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
++ + G ++ ++ KG ++F + + +G + D L +EG
Sbjct: 199 TEGEEVTLSG---------VSSWHEGLKGKVEFQGRMTART---QGGTRSCRDGVLSIEG 246
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D AV+ + +++F N D + + + L+ + Y H+D +++
Sbjct: 247 ADEAVIYISIATNF----TNYKDITGNQVERAKNYLRRAVSKDYMTSRKAHVDFFKQYMD 302
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RVS+ L VT + RV++F+ +D LV F+FGRYLLI SS+
Sbjct: 303 RVSLDLGIDKYAGVT------------TDMRVQNFKETKDDFLVATYFRFGRYLLICSSQ 350
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQGIWN+ L P+WDS NIN+EMNYW + NLSE EPL + +S G
Sbjct: 351 PGGQPANLQGIWNDKLFPSWDSKYTCNINVEMNYWPAEVTNLSELHEPLIQLIREVSETG 410
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
++A++ Y A GWV+HH TDIW + A K LWP GGAWLC HLWE Y YT D +F
Sbjct: 411 RESAKIMYGADGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDMEF 469
Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L + AYP+++ F + ++ E +L PS SPE+ +GK A + T+D
Sbjct: 470 L-RSAYPIMKEAGKFFDEIMVKEPLHNWLVVCPSNSPENTHAGSNGK-ATTAAGCTLDNQ 527
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
+I ++++ II+ A +L + + ++ + L + P +I G + EW D+ +P+ HR
Sbjct: 528 LIFDLWNQIITTARLLGTDAE-FATRLEQRLKEMAPMQIGRWGQLQEWMMDWDNPQDVHR 586
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H+SHL+GLFPG+ I+ + P+L AA +L RG+ GWS+ WK LWARL D +HAY+
Sbjct: 587 HVSHLYGLFPGNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKVCLWARLLDGDHAYK 646
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
++ LV E +K GG Y NLF AHPPFQID NFG TA + EML+QS +YLLP
Sbjct: 647 LITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCTAGIVEMLMQSHDGFIYLLP 703
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
ALP +W G V G+ ARGG + + WK+G + + + S N
Sbjct: 704 ALP-AQWKEGSVSGIIARGGFELDLSWKNGKVSRLVVKSRNGGN 746
>gi|456392980|gb|EMF58323.1| hypothetical protein SBD_0995 [Streptomyces bottropensis ATCC
25435]
Length = 974
Score = 488 bits (1255), Expect = e-135, Method: Compositional matrix adjust.
Identities = 291/738 (39%), Positives = 408/738 (55%), Gaps = 52/738 (7%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D N ++++R V + Q+
Sbjct: 61 ALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPYDSANTRGAANIAEIRRRVFADQWGP 120
Query: 87 AT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
A + G PA YQ +G++ L F + Y R LDL TATA Y +
Sbjct: 121 AQDLIDQTMLGSPAGQLAYQPVGNLLLSFGGA---TGVSQYNRTLDLTTATALTTYVLNG 177
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKR 203
V + RE F+S PD+VIV +++ + SL+FN + DS I ++G
Sbjct: 178 VRYQREVFASAPDRVIVVRLTADRANSLTFNATFDSPQRTTVSSPDGATIALDGTS---- 233
Query: 204 IPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGP 263
A ++F A+ ++ GT+S+ L+V G+ +L+ SS+
Sbjct: 234 ----ATMEGIAGRVRFLALANAAVTG--GTVSS-SGGTLRVSGATSVTVLVSIGSSY--- 283
Query: 264 FINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDT 323
+N + D + S L + R++ L +RHL DYQ LF+RVS+ L R+ T
Sbjct: 284 -VNFRNVAGDYQGTARSRLNAARDVGIDALRSRHLADYQALFNRVSVDLGRT-------T 335
Query: 324 CSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLS 383
+++ P+ R+ DP LLFQFGRYLLISSSRPGTQ ANLQGIWN+ ++
Sbjct: 336 AADQ-----PTDVRIAQHAQVNDPQFSALLFQFGRYLLISSSRPGTQPANLQGIWNDQMA 390
Query: 384 PTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHH 443
P+WDS +N NL MNYW + NLSEC P+FD + L++ G++ AQ Y A GWV HH
Sbjct: 391 PSWDSKFTINANLPMNYWPADTTNLSECFLPVFDMINDLTVTGARVAQAQYGAGGWVTHH 450
Query: 444 KTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
TD W +S G W +W GGAWL T +W+HY +T D DFL YP L+G A F L
Sbjct: 451 NTDAWRGASVVDG-AQWGMWQTGGAWLATLIWDHYLFTGDIDFLRSN-YPALKGAAQFFL 508
Query: 504 DWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLE 562
D L+ GYL TNPS SPE P A V TMD I+R++F+++ A E+L
Sbjct: 509 DTLVAHPTLGYLVTNPSNSPE----LPHHANATVCAGPTMDNQILRDLFNSVARAGELLG 564
Query: 563 KNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIE 622
+ + V RL P ++ G++ EW D+ + E +HRH+SHL+GL P + IT
Sbjct: 565 VDAAFRAQAVAAR-DRLAPMRVGSRGNVQEWLADWVETERNHRHVSHLYGLHPSNQITKR 623
Query: 623 KNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHF 682
P L +AA +TL+ RG++G GWS+ WK WAR+ D A+++++ +LV +
Sbjct: 624 GTPQLYEAARRTLELRGDDGTGWSLAWKINFWARMEDGARAHKLIR---DLVRTDR---- 676
Query: 683 EGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKA 742
L N+F HPPFQID NFG T+ +AEML+QS +L++LPALP W +G V GL+
Sbjct: 677 ---LAPNMFDLHPPFQIDGNFGATSGIAEMLLQSHNGELHVLPALP-AAWPTGRVSGLRG 732
Query: 743 RGGETVSICWKDGDLHEV 760
RGG TV W G + V
Sbjct: 733 RGGYTVGAEWSSGRIEFV 750
>gi|153812246|ref|ZP_01964914.1| hypothetical protein RUMOBE_02645 [Ruminococcus obeum ATCC 29174]
gi|149831653|gb|EDM86740.1| hypothetical protein RUMOBE_02645 [Ruminococcus obeum ATCC 29174]
Length = 754
Score = 487 bits (1254), Expect = e-135, Method: Compositional matrix adjust.
Identities = 282/793 (35%), Positives = 416/793 (52%), Gaps = 63/793 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + F+ PA+ + +A+P+GNG +GAM +G + E ++LN DTLW+G N +
Sbjct: 9 LTLAFDRPAEAWNEALPLGNGSMGAMSYGRLREEKIELNLDTLWSGTGRSKENKNTDVDW 68
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+R + G+Y EA A + G + Y G++ ++ + LK +Y+R+L +
Sbjct: 69 DFLRQKIFDGEYEEAEAYCKENILGDWTESYLPAGNLHIDANIPELK-EHGSYQRQLSIK 127
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
A +V Y + RE F S + V+ SL +SLDS + + G +
Sbjct: 128 DALEQVVYRQDGQGYLREFFVSMSEPVMALHYRADAGSSLELRISLDSQIRHVCSGYGTS 187
Query: 192 QIIMEGRCPGKRIPPKANAN-----DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
++++EG+ P P + ++ KG +F+ + I + +G I +D L V
Sbjct: 188 ELVLEGQAPVYAAPLYYSCEQPIVYEEGKGTRFA--IGISVQAPKGCIRQ-KDNTLLVTA 244
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ L + F ++ S L+ I +LSY L H Y F
Sbjct: 245 DGDVYIYLSGITDFQ--------AQDSYLSRKKQMLEQICDLSYPQLKEAHKKAYAAYFD 296
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
R+ + L Q D L+ +F + RYL+ISSS+
Sbjct: 297 RMDLTLD-------------------------PGIQND----LITKMFHYARYLMISSSK 327
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PGTQ ANLQGIWN +L W S VNIN EMNYW + NLS+C E LFD + + +G
Sbjct: 328 PGTQCANLQGIWNHNLRAPWSSNYTVNINTEMNYWMAEKANLSDCHESLFDLIERTASHG 387
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSS------ADRGKVVWALWPMGGAWLCTHLWEHYNY 480
KTA+ Y +GWV HH DIW SS D +++WPM WLC+HLWEHY Y
Sbjct: 388 KKTAKEVYHLNGWVSHHNVDIWGHSSPVGYFGQDENPCTYSMWPMSSGWLCSHLWEHYRY 447
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
T+DR+FL K+A+PL+ G F L +L+ +DGYL T PSTSPE+ F A D + V++ S
Sbjct: 448 TLDREFLRKKAFPLIRGAVEFYLGYLVP-YDGYLVTAPSTSPENTFTASDHSVHSVTFGS 506
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDP 600
TMD +I++E+F + A E+L+ + L+++V +L +L P KI ++G + EW D+ +
Sbjct: 507 TMDCSILKELFGNYLKACEILDITD--LMDEVKAALKKLLPFKIGKEGQLQEWYLDYPEV 564
Query: 601 EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQ 660
++HHRH+S L+GL+PG+ I E + +L A L +RG EG GW + WK LWARL D
Sbjct: 565 DMHHRHVSQLYGLYPGNLIHRE-DKELLAACRVALDRRGNEGTGWCMAWKACLWARLGDG 623
Query: 661 EHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
E A +++K ++ E+ GG Y N+ AHPPFQID NFGF AAV EMLVQ +
Sbjct: 624 ERALKLLKNQLHVTKEENCSLVGGGTYPNMLCAHPPFQIDGNFGFAAAVLEMLVQYQDDR 683
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYR 780
++ LPALP ++W G + GL+A GG T+ WKD + E + S D + L Y
Sbjct: 684 IFFLPALP-EEWKDGKISGLRAPGGITIDFAWKDRCITECSLQSQ-----TDMVRILLYN 737
Query: 781 GTSVKVNLSAGKI 793
G K+ L A I
Sbjct: 738 GIEKKIMLKADTI 750
>gi|198275212|ref|ZP_03207743.1| hypothetical protein BACPLE_01371 [Bacteroides plebeius DSM 17135]
gi|198271795|gb|EDY96065.1| hypothetical protein BACPLE_01371 [Bacteroides plebeius DSM 17135]
Length = 800
Score = 486 bits (1252), Expect = e-134, Method: Compositional matrix adjust.
Identities = 299/794 (37%), Positives = 423/794 (53%), Gaps = 73/794 (9%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
+P+GNG LGA+V+G V E ++LNE+T+W+G P + NPDAP+ L +R L+ G+Y E
Sbjct: 56 GLPLGNGSLGAVVFGDVAMERIQLNEETMWSGSPQECDNPDAPQYLDKIRQLLLEGKYKE 115
Query: 87 ATAASVKL-------------FGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
AT + + P +Q +GD+ ++F + K A YRREL+L A
Sbjct: 116 ATELTNRTQVCTGKGSGGGNGSTVPFGCFQTMGDLWIDFAN---KEAYSDYRRELNLEDA 172
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
TA V Y+ G+V F RE F S+PDQV+V ++S + +SF + ++ + Q+
Sbjct: 173 TATVTYTQGDVHFKREIFISHPDQVMVIRLSADKQQQMSFTCRMTRPEYFFTHTE-DGQL 231
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
IM G + G+Q+ A L+ + +G D L V G+D +LL
Sbjct: 232 IMSGALSDGK---------GGDGLQYMARLK---AVTKGGEVICTDSTLTVSGADEVMLL 279
Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
L AS+ + P +D S + ++ ++ LY H +Y F R S QL+
Sbjct: 280 LAASTDYQ--LTYPHYKGRDYLSLTRESIAKAEKKTFESLYQAHQKEYAAYFDRASFQLA 337
Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVAN 373
SP + TD E A ++ +P L EL+FQ+GRYLLISSSRPGT AN
Sbjct: 338 ESPDTLATDVLVAE-----AKAGKI-------NPHLYELMFQYGRYLLISSSRPGTMPAN 385
Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVN 433
LQGIW L W+ H ++N+EMNYW + NLSE P+FD + L G+KTAQ
Sbjct: 386 LQGIWANKLQTPWNGDYHTDVNIEMNYWPAEVTNLSEMHLPMFDLIASLVAPGTKTAQTQ 445
Query: 434 YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYP 493
Y GWV+H T++W +S W + AW+C H+ EHY +T D+DFL K+ YP
Sbjct: 446 YQKKGWVVHPITNVWGYTSPGE-SASWGMHTGAPAWICQHIGEHYRFTGDKDFL-KKMYP 503
Query: 494 LLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFS 552
+L+G F +DWL+ + G L + P+ SPE+ F+APDG +S T D I ++F
Sbjct: 504 VLKGAVEFYMDWLVTDPKTGKLVSGPAVSPENTFVAPDGSQCQISMGPTHDQQTIWQLFD 563
Query: 553 AIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFG 612
A+E L+ N DA + V + +L T+I DG IMEWAQ+F + E HRH+SHLF
Sbjct: 564 DFEMASEALQIN-DAFTQAVGDAKGKLLETRIGSDGRIMEWAQEFPEAEPGHRHISHLFA 622
Query: 613 LFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHAYRMVKR 669
+ PG I + + P+L +AA K++ R G GWS W + +ARLH E A +
Sbjct: 623 VHPGSQINLLQTPELAEAASKSMDYRISHGGGHTGWSSAWLISQYARLHRSEKAKESL-- 680
Query: 670 LFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTL--NDLY---LL 724
+K E L NLF PPFQIDANFG TA +AEML+QS + D Y LL
Sbjct: 681 ---------DKVLEKSLNPNLFTQCPPFQIDANFGTTAGIAEMLLQSHVYEQDAYTIQLL 731
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSV 784
P+LP W +G GLKARGG VS+ WKDG + I S N F+ + Y+G +
Sbjct: 732 PSLP-AGWKNGKFSGLKARGGFEVSVEWKDGVMVHAEIKSLLGN----PFR-VWYQGQYI 785
Query: 785 KV-NLSAGKIYTFN 797
+ NL GK + +N
Sbjct: 786 ETGNLEKGKTWKWN 799
>gi|312131915|ref|YP_003999255.1| alpha-l-fucosidase [Leadbetterella byssophila DSM 17132]
gi|311908461|gb|ADQ18902.1| Alpha-L-fucosidase [Leadbetterella byssophila DSM 17132]
Length = 793
Score = 486 bits (1252), Expect = e-134, Method: Compositional matrix adjust.
Identities = 290/774 (37%), Positives = 423/774 (54%), Gaps = 70/774 (9%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
+ F PA+HFT+++P+GNGRLGAMV+G E + LNE +LW+G P D +A K+L
Sbjct: 23 LLFYAPARHFTESLPLGNGRLGAMVFGQTAKERIALNEISLWSGGPQDADREEAYKSLKP 82
Query: 75 VRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIELEFDDSHLKYAE 121
++ L+ G+ EA K F P YQ LGD+ LE+ D +
Sbjct: 83 IQQLLLEGKNKEAQTLLEKEFIAKGRGSGFGRGAKDPYGSYQTLGDLFLEWKDGEVS--- 139
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
Y+R LDL+ A A +++ ++ T E F+ + +I ++ S++ L V L S
Sbjct: 140 -NYKRWLDLDNALATTQFTRNGIQITEEVFTDFKNDLIWVRLRSSKAKGLYLKVGL-SRE 197
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
+N + +I + G+ P A +P G++F+AIL+ A D K
Sbjct: 198 ENAQVQADSKEIKLWGQLP---------AGSEP-GMKFAAILQ----------EAHVDGK 237
Query: 242 LKVEGSDW-------AVLLLVASSSF-DGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
++VEG+ W +L + A++++ +G I ++D T ++ Q + L+YS
Sbjct: 238 VEVEGNTWNIVGASEVILQISAATNYHEGKLI-----EEDVTQKARKYFQ--KGLTYSAA 290
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVEL 352
+ L+ +Q FHR +QL ++ + + + +R+K + D L L
Sbjct: 291 FKSSLEKFQSYFHRSELQLK-----------GQDKLAHLSTPDRLKRLAEGKSDLDLYAL 339
Query: 353 LFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
+ +GRYLLI SSRPG ANLQG+W + W+ H+NIN++MNYW + L E
Sbjct: 340 YYHYGRYLLICSSRPGLLPANLQGLWAVEYQAPWNGDYHLNINVQMNYWPAELTGLGELA 399
Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
EPL F L NG KTA+ Y A GWV H ++ W +S G W GGAWLC
Sbjct: 400 EPLHRFTANLVKNGEKTAKAYYQAEGWVAHVISNPWFFTSPGEG-ADWGSTLTGGAWLCE 458
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDG 531
H+WEHY +T D +FL K YP+L+G A FL LIE +G+L T PS SPEH ++ PDG
Sbjct: 459 HIWEHYRFTKDIEFLRKY-YPVLKGSAQFLSSILIEEPKNGWLVTAPSNSPEHAYVLPDG 517
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
+ TMDM I RE+F+A+I +AE+L +++ +++ + L P ++ ++G +
Sbjct: 518 TKVNTAMGPTMDMQICRELFNAVIQSAEILGVDKE-FRDELSAKVRNLAPNRVGKNGDLN 576
Query: 592 EWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKT 651
EW +D++D EVHHRH+SHL+GL P I + P+L +AA KTL+ RG+ G GWS+ WK
Sbjct: 577 EWLEDYEDEEVHHRHVSHLYGLHPYDEINVYDTPELAEAARKTLEIRGDAGTGWSMAWKI 636
Query: 652 ALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAE 711
WARL D +H+ ++ +L E GG Y NLF AHPPFQID NFG TA +AE
Sbjct: 637 NFWARLRDGDHSLSLLNQLLKPAFEEKIVMSGGGSYPNLFCAHPPFQIDGNFGGTAGIAE 696
Query: 712 MLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
ML+QS + L LLPALP W G V GL+ARGG V I WK+G + I S
Sbjct: 697 MLLQSGDHFLVLLPALP-KAWKVGKVTGLQARGGFKVDIEWKNGQISTANIKSQ 749
>gi|443288639|ref|ZP_21027733.1| Secreted Ricin B-related lectin [Micromonospora lupini str. Lupac
08]
gi|385888040|emb|CCH15807.1| Secreted Ricin B-related lectin [Micromonospora lupini str. Lupac
08]
Length = 952
Score = 486 bits (1252), Expect = e-134, Method: Compositional matrix adjust.
Identities = 291/740 (39%), Positives = 404/740 (54%), Gaps = 56/740 (7%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D N L+++R V + Q+ +
Sbjct: 61 ALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPYDSANTRGAANLAEIRRRVFADQWTQ 120
Query: 87 AT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
A + + G P YQ +GD+ L F + Y+R LDL TAT Y +
Sbjct: 121 AQDLINQTMLGSPVGQLAYQPVGDLRLAFGSAS---GASQYQRTLDLTTATTTTSYVLNG 177
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKR 203
V F RE F+S PDQVIV +++ + +++F + S I ++G
Sbjct: 178 VRFQREMFASAPDQVIVIRLTADRANAITFTATFSSPQRTTVSSPDAATIGLDG------ 231
Query: 204 IPPKANANDDPKGIQFSA-ILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDG 262
+ +GI L + + G + L+V G+ LL+ SS+
Sbjct: 232 ------VSGSMEGITGQVRFLALANASVSGGTVSSSGGTLRVSGATSVTLLVSIGSSY-- 283
Query: 263 PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTD 322
+N D + L + R + + L RH+ DYQ LF+RVSI L R+
Sbjct: 284 --VNYRTVNGDYQGIARRHLDAARAIGFDQLRGRHVADYQALFNRVSIDLGRT------- 334
Query: 323 TCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDL 382
T +++ D R+ + DP LLFQ+GRYLLISSSRPG+Q ANLQGIWN+ +
Sbjct: 335 TAADQTTDV-----RIAQHASVNDPQFSALLFQYGRYLLISSSRPGSQPANLQGIWNDQM 389
Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIH 442
+P+WDS +N NL MNYW + NL+EC P+FD + L++ G++TAQV Y A GWV H
Sbjct: 390 APSWDSKFTINANLPMNYWPADTTNLAECYLPVFDMIKDLTVTGARTAQVQYGAGGWVTH 449
Query: 443 HKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFL 502
H TD W SS + +W +W GGAWL T +W+HY +T D +FL YP ++G A F
Sbjct: 450 HNTDAWRGSSV-VDEALWGMWQTGGAWLATMIWDHYQFTGDIEFLRAN-YPAMKGAAQFF 507
Query: 503 LDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVL 561
LD L+ GYL TNPS SPE A V TMD I+R++F+ + A+EVL
Sbjct: 508 LDTLVSHPTLGYLVTNPSNSPELRHHTN----ASVCAGPTMDNQILRDLFNGVARASEVL 563
Query: 562 EKNEDALVE-KVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTIT 620
N DA +VL + RL PT++ G++ EW D+ + E HRH+SHL+GL P + IT
Sbjct: 564 --NVDATYRAQVLTARDRLPPTRVGSRGNVQEWLADWVETERTHRHVSHLYGLHPSNQIT 621
Query: 621 IEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEK 680
P L +AA +TL+ RG++G GWS+ WK WARL D A+++ L +LV +
Sbjct: 622 KRGTPQLHQAARQTLELRGDDGTGWSLAWKINYWARLEDGTRAHKL---LGDLVRTDR-- 676
Query: 681 HFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGL 740
L N+F HPPFQID NFG T+ +AEML+QS +L+LLPALP W +G V GL
Sbjct: 677 -----LAPNMFDLHPPFQIDGNFGATSGIAEMLLQSHAGELHLLPALP-SAWPTGQVTGL 730
Query: 741 KARGGETVSICWKDGDLHEV 760
+ RGG TV W + V
Sbjct: 731 RGRGGYTVGAAWSSSRIELV 750
>gi|222106243|ref|YP_002547034.1| hypothetical protein Avi_5141 [Agrobacterium vitis S4]
gi|221737422|gb|ACM38318.1| conserved hypothetical protein [Agrobacterium vitis S4]
Length = 741
Score = 486 bits (1252), Expect = e-134, Method: Compositional matrix adjust.
Identities = 296/787 (37%), Positives = 426/787 (54%), Gaps = 59/787 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ ++ A +T+A+P+GNGRLGAMV+G +E L++NE T W+G P NPDA AL
Sbjct: 5 ELWYDRAASVWTEALPVGNGRLGAMVFGDAWNERLQINESTFWSGGPYQPINPDARAALP 64
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPAD---VYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+VR+L+ + +Y EA + + D YQ +GD+ L D H YRR LDL
Sbjct: 65 EVRNLILAERYQEADRKAYEGAMAKPDRQTSYQPIGDVWL---DLHHDMTVTNYRRSLDL 121
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
TA A +Y V F R+ F+S VIV KIS + G+LS V L S + +
Sbjct: 122 ETAVAVTQYDCHGVHFRRDVFASAIQDVIVCKISVDQPGALSMTVMLSSPQNGDPIDIAD 181
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
+ +GR N ++F+ +++ + G + + ++ ++V +
Sbjct: 182 ATLGYDGR--------NRRQNGIDSALRFA--FRVRVLAEGGFVD-IGEETIRVREASSV 230
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
+LL+ A +SF N DP ++ + L + LSY L H+ ++++LF+R+ I
Sbjct: 231 MLLIDAGTSFQ----NYRTVDGDPQAQIKARLDAAAMLSYEALLEAHVTEHRRLFNRMQI 286
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
L P + T+P+ +RV ++ +DPSL L Q+GRYL IS SRPGTQ
Sbjct: 287 ALGDKP------------VPTLPTDKRVAAYAEGDDPSLAALYLQYGRYLAISCSRPGTQ 334
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWNED+ P W S VNINLEMNYW + NLSE PL + + ++ G + A
Sbjct: 335 AANLQGIWNEDILPAWGSKYTVNINLEMNYWLADVANLSETFLPLVELVEDVAETGREMA 394
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
+ +Y A GWV+HH TDIW + G W LWPMGGAWLC L++HY + DR LE R
Sbjct: 395 KAHYGARGWVLHHNTDIWRATGPIDGP-HWGLWPMGGAWLCAQLYDHYRFNPDRAVLE-R 452
Query: 491 AYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
YPL++G F LD L+ D YL T PS SPE+ P G C + MD I+R+
Sbjct: 453 IYPLIKGAVEFALDTLVALPDSNYLGTCPSLSPENSH--PFGSSLCA--APAMDNQILRD 508
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF--KDPEVHHRHL 607
+F A A+ L ++ + E + RL +I + G + EW D+ PE HRH+
Sbjct: 509 LFEAFADASATLGRDGELRTEAA-ATRARLPEDRIGKGGQLQEWMDDWDLDAPEQQHRHV 567
Query: 608 SHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMV 667
SHL+GL+P I + P++ KAA+ L++RG++ GW I W+ LWARL + R
Sbjct: 568 SHLYGLYPSLQIDPLETPEMAKAAQVVLERRGDDATGWGIGWRLNLWARLGN---GNRAA 624
Query: 668 KRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPAL 727
+ L L+ PE Y NL AHPPFQID NFG A + EMLVQS +L LLPAL
Sbjct: 625 EVLVKLLTPERT-------YPNLMDAHPPFQIDGNFGGAAGIVEMLVQSRPGELRLLPAL 677
Query: 728 PWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVN 787
P ++WSSG +KG++ RGG TV + W+ G L + I + H T+ ++V
Sbjct: 678 P-EQWSSGSLKGVRIRGGHTVDLSWQAGKLTSLRITAG-----HSGPLTIRQPAGVLEVQ 731
Query: 788 LSAGKIY 794
L G+++
Sbjct: 732 LREGEVW 738
>gi|255692382|ref|ZP_05416057.1| fibronectin type III domain protein [Bacteroides finegoldii DSM
17565]
gi|260621848|gb|EEX44719.1| hypothetical protein BACFIN_07502 [Bacteroides finegoldii DSM
17565]
Length = 826
Score = 486 bits (1252), Expect = e-134, Method: Compositional matrix adjust.
Identities = 287/769 (37%), Positives = 428/769 (55%), Gaps = 49/769 (6%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
N +I ++ PA ++ +A+P+GNGR+ AMV+G E ++LNE+T+ G P N +A
Sbjct: 25 NIYRIWYDKPASYWEEALPVGNGRIAAMVFGDPQLEQIQLNEETVSAGSPYQNYNEEAKA 84
Query: 71 ALSDVRSLVDSGQYAEA-TAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRE 127
AL ++R L+ G+Y EA A+ K+ + YQ +G + + + D H K Y R+
Sbjct: 85 ALPEMRRLIFEGKYEEAQNMAATKILSQVGNEMPYQTVGRLNIRYQD-HKKV--NNYYRD 141
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS-LLDNHSY 186
LD++ A A +Y V VEFT E F+S DQ+++ I S+ G+++ + ++ + D
Sbjct: 142 LDISNAVAVTRYEVDGVEFTEETFASFTDQLVIRHIKASKPGTINCELFFNTPMRDPKRS 201
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ G + +EG G R P + + A L++K G + D L V+G
Sbjct: 202 IYGKKGLRLEGITYGSRYFPGK--------VHYCADLDVK--HKGGKVITANDTLLSVQG 251
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ L + +++F +N D DP + + L++ YS H+ YQK F+
Sbjct: 252 ASELTLYISMATNF----VNYKDISGDPYQRNKAYLKNAAK-DYSKAKAAHIAAYQKQFN 306
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV++ L + S+ N P R+K F + DP+L+ L FQ+GRYLLISSS+
Sbjct: 307 RVTLDLGET---------SQAN---KPMDVRIKEFSSSYDPALIALYFQYGRYLLISSSQ 354
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQG WN + P W NIN EMNYW + NL+E +P + LS NG
Sbjct: 355 PGCQPANLQGKWNHNPGPPWSCNYTTNINAEMNYWPAEITNLAELHKPFIQMVRELSENG 414
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
+ A Y GWV+HH TD+W + A DR WP+ AWLC HLW+ Y ++ D+
Sbjct: 415 REAASRMYGCRGWVLHHNTDLWRMTGAVDRPYC--GTWPVANAWLCQHLWDRYLFSGDKK 472
Query: 486 FLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH--EFIAPDGKLACVSYSSTM 542
+LE+ YP+++ + F +D+L+ + + GYL PS SPE+ +I L TM
Sbjct: 473 YLEE-VYPMMKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPRWIKKKSNLFA---GITM 528
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWAQDFKDPE 601
D ++ ++FS AA+VL N D LK++ R L P ++ + G + EW +D+ P
Sbjct: 529 DNQLVFDLFSNTCEAAKVL--NADTDFCDTLKNMRRQLPPMQVGQYGQLQEWFEDWDHPN 586
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQE 661
HRH+SHL+GL+PG+ I+ ++P L +AA+ TL +RG+ GWS+ WK WAR+ D +
Sbjct: 587 DRHRHISHLWGLYPGYQISPYRSPVLFEAAKNTLIQRGDPSTGWSMGWKVCFWARMLDGD 646
Query: 662 HAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDL 721
HAY+++K V PE +K GG Y NLF AHPPFQID NFG TA +AEMLVQS +
Sbjct: 647 HAYQLIKNQLTYVSPEIQKGQGGGTYPNLFDAHPPFQIDGNFGCTAGIAEMLVQSHDGAI 706
Query: 722 YLLPALPWDKWSSGCVKGLKARGGETV-SICWKDGDLHEVGIYSNYSNN 769
+LLP+LP +W SG VKGL+ARGG + + WKDG L + + S N
Sbjct: 707 HLLPSLP-SEWKSGTVKGLRARGGFLIDELIWKDGKLVKAVLRSETGGN 754
>gi|423311885|ref|ZP_17289822.1| hypothetical protein HMPREF1058_00434 [Bacteroides vulgatus
CL09T03C04]
gi|392689264|gb|EIY82542.1| hypothetical protein HMPREF1058_00434 [Bacteroides vulgatus
CL09T03C04]
Length = 814
Score = 486 bits (1251), Expect = e-134, Method: Compositional matrix adjust.
Identities = 284/764 (37%), Positives = 431/764 (56%), Gaps = 50/764 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ +T+A+P+GNGRLGAMV+G E ++LNE+T+W G P + NPDA + +
Sbjct: 25 KLWYDEPAQVWTEALPLGNGRLGAMVFGNPGVEHIQLNEETIWAGRPNNNANPDALEYIP 84
Query: 74 DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
VR LV G+Y EA T A+ K+ G P YQ GD+ + F H +Y++ Y RE
Sbjct: 85 KVRRLVFEGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLHISFP-GHTRYSD--YYRE 138
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
L L++A V+Y V V + RE +S DQV++ +++ S+ G ++ N +L + +
Sbjct: 139 LSLDSARTIVRYKVDGVTYQRETLTSFADQVVMVRLTASQPGKITCNANLTTPHQDVMVA 198
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
++ + G ++ ++ KG ++F + + +G + D L +EG
Sbjct: 199 TEGEEVTLSG---------VSSWHEGLKGKVEFQGRMTART---QGGTRSCRDGVLSIEG 246
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D AV+ + +++F N D + + + L+ + Y H+D +++
Sbjct: 247 ADEAVIYISIATNF----TNYKDITGNQVERAKNYLRRAVSKDYMTSRKAHVDFFKQYMD 302
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RVS+ D+ D + D RV++F+ +D LV F+FGRYLLI SS+
Sbjct: 303 RVSL-------DLGIDKYAGVTTDM-----RVQNFKETKDDFLVATYFRFGRYLLICSSQ 350
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQGIWN+ L P+WDS NIN+EMNYW + NLSE EPL + +S G
Sbjct: 351 PGGQPANLQGIWNDKLFPSWDSKYTCNINVEMNYWPAEVTNLSELHEPLIQLIREVSETG 410
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
++A++ Y A GWV+HH TDIW + A K LWP GGAWLC HLWE Y YT D +F
Sbjct: 411 RESAKIMYGADGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDMEF 469
Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L + AYP+++ F + ++ E +L PS SPE+ +GK A + T+D
Sbjct: 470 L-RSAYPIMKEAGKFFDEIMVKEPLHNWLVVCPSNSPENTHAGSNGK-ATTAAGCTLDNQ 527
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
+I ++++ II+ A +L + + ++ + L + P +I G + EW D+ +P+ HR
Sbjct: 528 LIFDLWNQIITTARLLGTDAE-FATRLEQRLKEMAPMQIGRWGQLQEWMMDWDNPQDVHR 586
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK LWARL D +HAY+
Sbjct: 587 HVSHLYGLFPSNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKVCLWARLLDGDHAYK 646
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
++ LV E +K GG Y NLF AHPPFQID NFG TA + EML+QS +YLLP
Sbjct: 647 LITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCTAGIVEMLMQSHDGFIYLLP 703
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
ALP +W G V G+ ARGG + + WK+G + + + S + N
Sbjct: 704 ALP-AQWKEGSVNGIIARGGFELDLSWKNGKVSRLVVKSRHGGN 746
>gi|423303028|ref|ZP_17281049.1| hypothetical protein HMPREF1057_04190 [Bacteroides finegoldii
CL09T03C10]
gi|408470357|gb|EKJ88892.1| hypothetical protein HMPREF1057_04190 [Bacteroides finegoldii
CL09T03C10]
Length = 1100
Score = 486 bits (1251), Expect = e-134, Method: Compositional matrix adjust.
Identities = 288/764 (37%), Positives = 410/764 (53%), Gaps = 52/764 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ +N PA+ + +A+PIGN RLGAMV+GG E L++NE+T W G P +P A L
Sbjct: 288 LKLWYNRPAQRWEEALPIGNSRLGAMVYGGAGHEELQINEETFWAGGPHHNNSPKAKAVL 347
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+ R L+ + EA + F P + L L H K Y RELD+
Sbjct: 348 DEARRLIFEDKTMEAQKLINPNFFSGPHGMSYLNMGSLLILQPGHEK--ATNYYRELDIE 405
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY----- 186
ATA Y V V +TR FSS DQVI+ ++ + G+L F++ D+ + +
Sbjct: 406 DATATTCYEVDGVTYTRTAFSSMTDQVIIVRLEANRKGALQFSLGYDTPKEADGFAPLHP 465
Query: 187 ---VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
V GN + +C G A+A ++++ D ++ + +L
Sbjct: 466 IVKVRGNRLTM---QCTGMEQEGVASA--------IKGEWQVQVVHDGKQVN--QPDRLG 512
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V+G+ A + L A+++F +N D + + + + L++ Y H YQ
Sbjct: 513 VQGATTATVYLSAATNF----VNYKDVSGNASRRAAAYLKTALKQPYPKALEAHSKAYQT 568
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
F+RV + L P I + P+ +RV F +D +L+ LL+Q+GRYLLI
Sbjct: 569 QFNRVKLDL---PATIAS---------LAPTNQRVADFNRVDDRNLMALLYQYGRYLLIC 616
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SS+PG Q ANLQGIW L WDS +NIN EMNYW + NLSEC EPLF L LS
Sbjct: 617 SSQPGGQPANLQGIWCRSLHAPWDSKYTININTEMNYWPAEVTNLSECHEPLFSMLEDLS 676
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+ G +TA+ Y A GWV HH TD+W + G W +WP GGAWLC HLW+HY YT D
Sbjct: 677 VTGHETARTLYGAKGWVAHHNTDLWRIAGPVDG-ATWGMWPNGGAWLCQHLWQHYLYTGD 735
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+ FL K YP+++G A F++ L++ G+L T PS SPEH + A C TM
Sbjct: 736 QAFLRKY-YPVMKGAADFMMSHLVKHPKYGWLVTVPSVSPEHGYTASTLTAGC-----TM 789
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
D I ++ + AA +L + A + + + +L P +I + I EW D DP+
Sbjct: 790 DNQIAFDILNNTRLAATILGE-PTAYQDSLQATCTQLPPMQIGKYNQIQEWMVDADDPKN 848
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEH 662
HRH+SHL+GL+P + I+ P L AA+ TL +RG++ GWSI WK WAR+ D H
Sbjct: 849 EHRHISHLYGLYPSNQISPHLQPTLFAAAKNTLLQRGDQATGWSIGWKINFWARMLDGNH 908
Query: 663 AYRMVKRLFNLV--DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
AYR+++ + L+ D + ++H +G Y NLF AHPPFQID NFG+TA V+EML+QS
Sbjct: 909 AYRIIRNMLRLLPGDGKQKEHPDGRTYPNLFDAHPPFQIDGNFGYTAGVSEMLLQSHDGA 968
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
++LLPALP ++W G + GL ARGG V + W L I S
Sbjct: 969 VHLLPALP-EEWREGRISGLVARGGFVVDMEWSGAQLFRAEICS 1011
>gi|189468049|ref|ZP_03016834.1| hypothetical protein BACINT_04443 [Bacteroides intestinalis DSM
17393]
gi|189436313|gb|EDV05298.1| hypothetical protein BACINT_04443 [Bacteroides intestinalis DSM
17393]
Length = 830
Score = 486 bits (1251), Expect = e-134, Method: Compositional matrix adjust.
Identities = 286/774 (36%), Positives = 425/774 (54%), Gaps = 52/774 (6%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
N ++ LK+ ++ PA + +A+P+GNGR+G MV+G E +LNE+T+W G P +
Sbjct: 18 NLQAQQEDQTLKLWYDKPATQWVEALPLGNGRIGTMVFGDPVHEQFQLNEETVWGGSPHN 77
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEA------TAASVKLFGHPADVYQLLGDIELEFDDSH 116
TNP A AL +R L+ G+ EA T S G P YQ +G + L+FD +
Sbjct: 78 NTNPKAKDALPRIRQLIFEGKNKEAQELCGPTICSQSANGMP---YQTVGSLHLDFDGIN 134
Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
+Y + Y R+LD+ A A +++ V +TRE ++S PDQV+V +++ S+ S+SF
Sbjct: 135 -EYND--YYRDLDIEKAIATTRFTANGVTYTREAYTSFPDQVLVIRLTASQKKSISFTAK 191
Query: 177 LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPK---GIQFSAILEIKISDDRGT 233
Y ++ P K + AND ++F+A+ +I ++ G
Sbjct: 192 ---------YSTPYKSSVIRCISPRKELQLNGKANDHEGIEGKVEFTAL--TRIENNGGK 240
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
+ L D L+V+ ++ +V+L V S F+N D D + + L+ + N +Y
Sbjct: 241 LEILSDSTLQVKDAN-SVILYV---SIGTNFVNYKDVSGDALNSAQQYLKLV-NKNYPKS 295
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H++ YQK F+RVS+ L S I+ P+ RVK F + DP + L
Sbjct: 296 KASHINAYQKYFNRVSLNLG-----------SNAQINK-PTDVRVKEFSSSFDPQMAVLY 343
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN L WD +IN+EMNYW + +L E E
Sbjct: 344 FQFGRYLLICSSQPGGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAESTSLPEMHE 403
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
P + ++I G ++A + Y GW +HH TDIW + A G + +WP AW C H
Sbjct: 404 PFLQLVKEVAIQGRESAAM-YGCRGWTLHHNTDIWRSTGAVDGS-SYGVWPTCNAWFCQH 461
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
LW+ Y ++ D+++L + AYPL+ G F LD+L+ E + +L PS SPE+ +
Sbjct: 462 LWDRYLFSGDKNYLSE-AYPLMRGACEFYLDFLVREPENNWLVVAPSYSPENSPAVNGQR 520
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
V +TMD ++ ++F ISAA+++ + A + + + L P ++ G + E
Sbjct: 521 TFVVVAGTTMDNQMVYDLFYNTISAAKLMNETT-AFTDSLQTVVNNLAPMQVGRWGQLQE 579
Query: 593 WAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTA 652
W D+ +P+ HRH+SHL+GL+PG I+ +P L +AA+K+L RG+ GWS+ WK
Sbjct: 580 WMHDWDNPKDRHRHISHLWGLYPGRQISAYHSPVLFEAAKKSLIGRGDHSTGWSMGWKVC 639
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
LWARL D HAY+++ L EK GG Y NLF AHPPFQID NFG A +AEM
Sbjct: 640 LWARLLDGNHAYKLITD--QLHPTTDEKGQNGGTYPNLFDAHPPFQIDGNFGCAAGIAEM 697
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVS-ICWKDGDLHEVGIYSN 765
LVQS ++LLPALP D W G +KG++ RGG TV+ + W++G L I SN
Sbjct: 698 LVQSHDGAIHLLPALP-DVWKEGTLKGIRCRGGFTVNEMKWENGKLQTAVIASN 750
>gi|333380444|ref|ZP_08472135.1| hypothetical protein HMPREF9455_00301 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826439|gb|EGJ99268.1| hypothetical protein HMPREF9455_00301 [Dysgonomonas gadei ATCC
BAA-286]
Length = 786
Score = 486 bits (1251), Expect = e-134, Method: Compositional matrix adjust.
Identities = 280/773 (36%), Positives = 429/773 (55%), Gaps = 61/773 (7%)
Query: 9 TTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
+ N + FN PA + ++IP+GNGR+G M WGGV E + LNE +LW G D NPDA
Sbjct: 20 SQNKWQYYFNEPASAWEESIPLGNGRIGMMPWGGVDKERIVLNEISLWAGNKQDADNPDA 79
Query: 69 PKALSDVRSLVDSGQYAEATAASVKLF------GHPADV--YQLLGDIELEFDDSHLKYA 120
K L ++R L+ + EA K F G AD ++ G++ ++ A
Sbjct: 80 YKHLGEIRKLLFEKKNREAQELMYKTFTCKGEGGSGADYGKFENFGNLYIDITYPDASAA 139
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
YRR LD+N A + V Y+ G +++TRE+F+S D + + + + +S +L+ +SLD
Sbjct: 140 VSDYRRTLDMNNALSDVTYTKGGIKYTREYFTSFTDDIGIARYTADKSKALNMCISLDRD 199
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ +Y +G I G+ P A + +G+++ +++ ++ +G +
Sbjct: 200 ENYETYASGPVLYIF-GQLP---------AGEGKEGMKYLGMVK---AEHKGGQLFTNAR 246
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR---H 297
++++ +D L + +++++G E + N D TR H
Sbjct: 247 DIEIKNADEVTLFISLATNYNG-------------VEHEKLAGYLLNKLKGDYKTRKQKH 293
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
++ YQ LF+RV + L ++ +N D +P +R+++F D D L L Q+
Sbjct: 294 IEKYQNLFNRVDLTLGKN-----------KNSD-LPINKRLEAFVNDRSDYDLAALYMQY 341
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLISS+R G NLQG+W + W+ H+NINL+MN W + CNLSE P
Sbjct: 342 GRYLLISSTREGGLPPNLQGLWAPQIHTPWNGDYHLNINLQMNLWPAEVCNLSELHLPTI 401
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
+++ L+ G KTA+V Y + GWV H ++W +S W GAW+C HLWE
Sbjct: 402 EYVKSLTEPGHKTAKVYYNSDGWVTHILGNVWGFTSPGESPS-WGATNTSGAWMCQHLWE 460
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLAC 535
HY Y+ D ++L K YP ++G A F + L+E ++GYL T P+TSPE+ +I G +
Sbjct: 461 HYLYSQDVEYL-KSVYPTMKGAALFFENMLVEDPNNGYLVTAPTTSPENTYITESGDVLS 519
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ 595
V STMD I+RE+F+ + AA++L +E + + RL PT I + G IMEW +
Sbjct: 520 VCAGSTMDNQIVRELFTNVSEAAKILNTDEQ-WIRTIETKKQRLAPTTIGKYGQIMEWLE 578
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWA 655
D+++ E+HHRH+S L+GL PG+ +T EK P+L +AA+KTL++RG+E GWS+ WK WA
Sbjct: 579 DYEEAEIHHRHVSQLYGLHPGYELTYEKTPELMEAAKKTLERRGDESTGWSMAWKINFWA 638
Query: 656 RLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQ 715
RL D + Y+++ +L+ P + H G Y NLF+AHPP QID NFG A +AEMLVQ
Sbjct: 639 RLKDGDRTYKLIG---DLLKPAGKGH---GTYPNLFSAHPPMQIDGNFGGCAGIAEMLVQ 692
Query: 716 STLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN 768
S + LLP++P D W G VKGLK RGG VS WK+G + +V + +N
Sbjct: 693 SHAGYIELLPSVP-DAWKDGSVKGLKVRGGGEVSFAWKNGKVTDVDFIARTAN 744
>gi|160885575|ref|ZP_02066578.1| hypothetical protein BACOVA_03577 [Bacteroides ovatus ATCC 8483]
gi|156109197|gb|EDO10942.1| hypothetical protein BACOVA_03577 [Bacteroides ovatus ATCC 8483]
Length = 826
Score = 486 bits (1250), Expect = e-134, Method: Compositional matrix adjust.
Identities = 287/769 (37%), Positives = 428/769 (55%), Gaps = 49/769 (6%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
N +I ++ PA ++ +A+P+GNGR+ AMV+G E ++LNE+T+ G P N +A
Sbjct: 25 NIYRIWYDKPASYWEEALPVGNGRIAAMVFGDPQLEQIQLNEETVSAGSPYQNYNEEAKT 84
Query: 71 ALSDVRSLVDSGQYAEA-TAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRE 127
AL ++R L+ G+Y EA A+ K+ + YQ +G + + + D H K Y R+
Sbjct: 85 ALPEMRRLIFEGKYEEAQNMAATKILSQVGNEMPYQTVGRLNIRYQD-HKKV--NNYYRD 141
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS-LLDNHSY 186
LD++ A A +Y V VEFT E F+S DQ+++ I S+ G+++ + ++ + D
Sbjct: 142 LDISNAVAVTRYEVDGVEFTEETFASFTDQLVIRHIKASKPGTINCELFFNTPMRDPKRS 201
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ G + +EG G R P + + A L++K G + D L V+G
Sbjct: 202 IYGKKGLRLEGITHGSRYFPGK--------VHYCADLDVK--HKGGKVITANDTLLSVQG 251
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ L + +++F +N D DP + + L++ YS H+ YQK F+
Sbjct: 252 ASELTLYISMATNF----VNYKDISGDPYQRNKAYLKNAAK-DYSKAKAAHIAAYQKQFN 306
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV++ L + S+ N P R+K F + DP+L+ L FQ+GRYLLISSS+
Sbjct: 307 RVTLDLGET---------SQAN---KPMDVRIKEFSSSYDPALIALYFQYGRYLLISSSQ 354
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQG WN + P W NIN EMNYW + NL+E +P + LS NG
Sbjct: 355 PGCQPANLQGKWNHNPGPPWSCNYTTNINAEMNYWPAEITNLAELHKPFIQMVRELSENG 414
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
+ A Y GWV+HH TD+W + A DR WP+ AWLC HLW+ Y ++ D+
Sbjct: 415 REAASRMYGCRGWVLHHNTDLWRMTGAVDRPYC--GTWPVANAWLCQHLWDRYLFSGDKK 472
Query: 486 FLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH--EFIAPDGKLACVSYSSTM 542
+LE+ YP+++ + F +D+L+ + + GYL PS SPE+ +I L TM
Sbjct: 473 YLEE-VYPMMKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPRWIKKKSNLFA---GITM 528
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWAQDFKDPE 601
D ++ ++FS AA+VL N D LK++ R L P ++ + G + EW +D+ P
Sbjct: 529 DNQLVFDLFSNTCEAAKVL--NADTDFCDTLKNMRRQLPPMQVGQYGQLQEWFEDWDHPN 586
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQE 661
HRH+SHL+GL+PG+ I+ ++P L +AA+ TL +RG+ GWS+ WK WAR+ D +
Sbjct: 587 DRHRHISHLWGLYPGYQISPYRSPVLFEAAKNTLIQRGDPSTGWSMGWKVCFWARMLDGD 646
Query: 662 HAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDL 721
HAY+++K V PE +K GG Y NLF AHPPFQID NFG TA +AEMLVQS +
Sbjct: 647 HAYQLIKNQLTYVSPEIQKGQGGGTYPNLFDAHPPFQIDGNFGCTAGIAEMLVQSHDGAI 706
Query: 722 YLLPALPWDKWSSGCVKGLKARGGETV-SICWKDGDLHEVGIYSNYSNN 769
+LLP+LP +W SG VKGL+ARGG + + WKDG L + + S N
Sbjct: 707 HLLPSLP-SEWKSGTVKGLRARGGFLIDELTWKDGKLVKAVLRSETGGN 754
>gi|335437953|ref|ZP_08560710.1| alpha/beta hydrolase domain-containing protein [Halorhabdus
tiamatea SARL4B]
gi|334893557|gb|EGM31768.1| alpha/beta hydrolase domain-containing protein [Halorhabdus
tiamatea SARL4B]
Length = 784
Score = 486 bits (1250), Expect = e-134, Method: Compositional matrix adjust.
Identities = 287/769 (37%), Positives = 416/769 (54%), Gaps = 66/769 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA + +A+PIGNGRLGAM++G +E ++ N DTLW G D TNPDA + + +VR
Sbjct: 13 YDAPASAWLEAVPIGNGRLGAMLFGRPGTERVQFNADTLWAGGHEDSTNPDAREHVEEVR 72
Query: 77 SLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
L+ G+ A A A L G P + YQ GD+ ++ A YRRELDL+
Sbjct: 73 RLLFDGEVERAQALADEHLMGDPFRLRPYQSFGDLSIDVGHD----AVTDYRRELDLSAG 128
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
RV+Y + RE+F+S PD IV +++ GS++ V LD D + G+ +
Sbjct: 129 VTRVRYDHDGTTYVREYFASAPDDAIVIRLATDSPGSVTATVGLDRERDARADARGDT-L 187
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK--------LKVE 245
+ G P + +G+ F A +++ D G + + L+ E
Sbjct: 188 TLRGTVVDD---PDDDRGAGGEGMAFEA--RARVTADGGDVQRVTGADAPAGSSVGLRTE 242
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
+D + L ++ + DP + L ++ + Y DL H+ D+++LF
Sbjct: 243 AADAVTIALTGFTTHE---------TDDPGEACEAVLDALADRPYHDLRETHVADHRELF 293
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
RV + L P D TD E +D V + E EDP L L QFGRYLLI+SS
Sbjct: 294 DRVELDLG-DPVDRPTD----ERLDRVAAGE--------EDPHLAALYAQFGRYLLIASS 340
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPGT+ ANLQG+WN++ P W+S +N+NLEMNYW +L NL+EC PL+DF+ L
Sbjct: 341 RPGTEPANLQGVWNQEFDPPWNSGYTLNVNLEMNYWPALQTNLAECAAPLYDFVDDLREP 400
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G + A+ +Y G+ +HH +D+W +++A W LWPMG AWL +++HY +T D
Sbjct: 401 GRRVAEAHYDCDGFAVHHNSDLW-RNAAPVDGARWGLWPMGAAWLSRLVFDHYAFTKDET 459
Query: 486 FLEKRAYPLLEGCASFLLDWLIE--GHDG----YLETNPSTSPEHEFIAPDGKLACVSYS 539
FL + AYP+L A+F+LD+L+E +G +L T PS SPE+ ++ DG+ A V+Y+
Sbjct: 460 FLRETAYPILREAAAFVLDFLVEHPAEEGEAEDWLVTAPSISPENAYVTDDGEEATVTYA 519
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKD 599
TMD+ + R++F I AAE+L+ E A +++ +L RL P ++ G + EW +D+++
Sbjct: 520 PTMDVQLTRDLFEHTIDAAEILDV-ESAFHDELRAALDRLPPMQVGAHGQLQEWIEDYEE 578
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWAR 656
+ HRH+SHL+G P IT + PDL A TL +R E G GWS W +AR
Sbjct: 579 ADPGHRHISHLYGAHPSDLITPRETPDLADAVRTTLDRRLEHGGGHTGWSAAWLVNQFAR 638
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
L D E A+ VK L L D NLF HPPFQID NFG TA + EML+ S
Sbjct: 639 LEDGERAHEWVKTL--LAD---------STAPNLFDLHPPFQIDGNFGATAGITEMLLGS 687
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
++ LLPALP + W+ G V GL+ARG V I W G L I S
Sbjct: 688 HGGEIRLLPALP-EAWTEGSVSGLRARGDFEVDIEWSGGSLDSATIRSG 735
>gi|423290259|ref|ZP_17269108.1| hypothetical protein HMPREF1069_04151 [Bacteroides ovatus
CL02T12C04]
gi|423294445|ref|ZP_17272572.1| hypothetical protein HMPREF1070_01237 [Bacteroides ovatus
CL03T12C18]
gi|392665646|gb|EIY59169.1| hypothetical protein HMPREF1069_04151 [Bacteroides ovatus
CL02T12C04]
gi|392675636|gb|EIY69077.1| hypothetical protein HMPREF1070_01237 [Bacteroides ovatus
CL03T12C18]
Length = 816
Score = 486 bits (1250), Expect = e-134, Method: Compositional matrix adjust.
Identities = 287/769 (37%), Positives = 428/769 (55%), Gaps = 49/769 (6%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
N +I ++ PA ++ +A+P+GNGR+ AMV+G E ++LNE+T+ G P N +A
Sbjct: 15 NIYRIWYDKPASYWEEALPVGNGRIAAMVFGDPQLEQIQLNEETVSAGSPYQNYNEEAKT 74
Query: 71 ALSDVRSLVDSGQYAEA-TAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRE 127
AL ++R L+ G+Y EA A+ K+ + YQ +G + + + D H K Y R+
Sbjct: 75 ALPEMRRLIFEGKYEEAQNMAATKILSQVGNEMPYQTVGRLNIRYQD-HKKV--NNYYRD 131
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS-LLDNHSY 186
LD++ A A +Y V VEFT E F+S DQ+++ I S+ G+++ + ++ + D
Sbjct: 132 LDISNAVAVTRYEVDGVEFTEETFASFTDQLVIRHIKASKPGTINCELFFNTPMRDPKRS 191
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ G + +EG G R P + + A L++K G + D L V+G
Sbjct: 192 IYGKKGLRLEGITHGSRYFPGK--------VHYCADLDVK--HKGGKVITANDTLLSVQG 241
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ L + +++F +N D DP + + L++ YS H+ YQK F+
Sbjct: 242 ASELTLYISMATNF----VNYKDISGDPYQRNKAYLKNAAK-DYSKAKAAHIAAYQKQFN 296
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV++ L + S+ N P R+K F + DP+L+ L FQ+GRYLLISSS+
Sbjct: 297 RVTLDLGET---------SQAN---KPMDVRIKEFSSSYDPALIALYFQYGRYLLISSSQ 344
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQG WN + P W NIN EMNYW + NL+E +P + LS NG
Sbjct: 345 PGCQPANLQGKWNHNPGPPWSCNYTTNINAEMNYWPAEITNLAELHKPFIQMVRELSENG 404
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
+ A Y GWV+HH TD+W + A DR WP+ AWLC HLW+ Y ++ D+
Sbjct: 405 REAASRMYGCRGWVLHHNTDLWRMTGAVDRPYC--GTWPVANAWLCQHLWDRYLFSGDKK 462
Query: 486 FLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH--EFIAPDGKLACVSYSSTM 542
+LE+ YP+++ + F +D+L+ + + GYL PS SPE+ +I L TM
Sbjct: 463 YLEE-VYPMMKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPRWIKKKSNLFA---GITM 518
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWAQDFKDPE 601
D ++ ++FS AA+VL N D LK++ R L P ++ + G + EW +D+ P
Sbjct: 519 DNQLVFDLFSNTCEAAKVL--NADTDFCDTLKNMRRQLPPMQVGQYGQLQEWFEDWDHPN 576
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQE 661
HRH+SHL+GL+PG+ I+ ++P L +AA+ TL +RG+ GWS+ WK WAR+ D +
Sbjct: 577 DRHRHISHLWGLYPGYQISPYRSPVLFEAAKNTLIQRGDPSTGWSMGWKVCFWARMLDGD 636
Query: 662 HAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDL 721
HAY+++K V PE +K GG Y NLF AHPPFQID NFG TA +AEMLVQS +
Sbjct: 637 HAYQLIKNQLTYVSPEIQKGQGGGTYPNLFDAHPPFQIDGNFGCTAGIAEMLVQSHDGAI 696
Query: 722 YLLPALPWDKWSSGCVKGLKARGGETV-SICWKDGDLHEVGIYSNYSNN 769
+LLP+LP +W SG VKGL+ARGG + + WKDG L + + S N
Sbjct: 697 HLLPSLP-SEWKSGTVKGLRARGGFLIDELTWKDGKLVKAVLRSETGGN 744
>gi|329962425|ref|ZP_08300425.1| hypothetical protein HMPREF9446_02012 [Bacteroides fluxus YIT
12057]
gi|328529981|gb|EGF56869.1| hypothetical protein HMPREF9446_02012 [Bacteroides fluxus YIT
12057]
Length = 827
Score = 486 bits (1250), Expect = e-134, Method: Compositional matrix adjust.
Identities = 287/775 (37%), Positives = 425/775 (54%), Gaps = 58/775 (7%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
++ N LK+ ++ PA + +A+P+GNGRLGAMV+G +E +LNE+T+W G P + T
Sbjct: 20 QAQQQENNLKLWYDKPATQWVEALPLGNGRLGAMVFGDPANEQFQLNEETVWGGSPYNNT 79
Query: 65 NPDAPKALSDVRSLVDSGQYAEATA------ASVKLFGHPADVYQLLGDIELEFDDSHLK 118
NP A AL +R L+ G+ AEA A S G P YQ +G + L+F+ +
Sbjct: 80 NPKAKDALPRIRQLIFEGRNAEAQALCGPGICSQSANGMP---YQTVGSLHLDFEGTS-- 134
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y RELDL A +++ G + +TRE ++S P+Q++V +++ S+ S+SF
Sbjct: 135 -GYTNYYRELDLEKAVTTTRFTAGGITYTREAYTSFPEQLLVIRLTASQKKSISFTAR-- 191
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQ----FSAILEIKISDDRGTI 234
Y + + P K + AND +GI+ F+A+ +I + G++
Sbjct: 192 -------YTTPYKKNVERSISPDKELQLDGKANDH-EGIEGKVRFTAL--TRIENSGGSL 241
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSAL-QSIRNLSYSDL 293
L D L+V+ ++ +V L V S F+N D D + + + Q+ +N + L
Sbjct: 242 EVLSDSTLQVKNAN-SVTLYV---SIGTNFVNYKDVSGDALATARKYMKQAGKNYTKGKL 297
Query: 294 YTRHLDDYQKLFHRVSIQL-SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVEL 352
H++ Y+K F RVS+ L S + D TD RVK F DP + L
Sbjct: 298 --AHINAYRKYFDRVSLNLGSNAQADKPTDV-------------RVKEFSGSFDPQMAAL 342
Query: 353 LFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
FQFGRYLLI SS+PG Q ANLQGIWN L WD +IN+EMNYW + +L E
Sbjct: 343 YFQFGRYLLICSSQPGGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAESTSLPEMH 402
Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
EP + +++ G ++A + Y GW +HH TDIW + A G + +WP AW C
Sbjct: 403 EPFLQLVKEVALTGRESAAM-YGCRGWTLHHNTDIWRSTGAVDGPG-YGIWPTCNAWFCQ 460
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDG 531
HLW+ Y ++ D+ +L + YPL+ G F LD+L+ E + +L PS SPE+ +
Sbjct: 461 HLWDRYLFSGDKAYLAE-IYPLMRGACEFYLDFLVREPKNNWLVVAPSYSPENRPVVNGK 519
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
+ V +TMD ++ ++F I AA+++ +N A + + L P ++ G +
Sbjct: 520 RDFVVVAGTTMDNQMVYDLFYNTIQAAKLMNEN-IAFTDSLQAVSDHLAPMQVGRWGQLQ 578
Query: 592 EWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKT 651
EW +D+ +P+ HHRH+SHL+GL+PG I+ +P L +AA+K+L RG+ GWS+ WK
Sbjct: 579 EWMEDWDNPKDHHRHVSHLWGLYPGRQISAYNSPVLFEAAKKSLIARGDHSTGWSMGWKV 638
Query: 652 ALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAE 711
LWARL D HAY+++ L EK GG Y NLF AHPPFQID NFG A +AE
Sbjct: 639 CLWARLLDGNHAYKLITE--QLHPTTDEKGQNGGTYPNLFDAHPPFQIDGNFGCAAGIAE 696
Query: 712 MLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETV-SICWKDGDLHEVGIYSN 765
MLVQS ++LLPALP D W G +KG++ RGG T+ + W++G L V I SN
Sbjct: 697 MLVQSHDGAIHLLPALP-DVWQQGTLKGIRCRGGFTIDELNWENGQLQTVSITSN 750
>gi|423343039|ref|ZP_17320753.1| hypothetical protein HMPREF1077_02183 [Parabacteroides johnsonii
CL02T12C29]
gi|409216715|gb|EKN09698.1| hypothetical protein HMPREF1077_02183 [Parabacteroides johnsonii
CL02T12C29]
Length = 809
Score = 486 bits (1250), Expect = e-134, Method: Compositional matrix adjust.
Identities = 285/777 (36%), Positives = 423/777 (54%), Gaps = 52/777 (6%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
N T L F+ PA+ + + +P+GNGR G M GGV +E + LNE ++W+G D
Sbjct: 17 NMPEVKTGKSLSYHFDAPAEIWEETLPLGNGRFGLMPDGGVDTEKIVLNEISMWSGSKQD 76
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
NP A +L +R L+ G+ EA F P YQLLG++
Sbjct: 77 TDNPQAYYSLGTIRKLLFEGRNDEAQELMYNTFVCKGEGSALGQGANAPYGSYQLLGNLV 136
Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
L +D + YRREL+L+ A A + G V++ RE F+S D + V ++
Sbjct: 137 LNYDYQGSSDSISGYRRELNLDNAIATASFRRGKVKYDREVFTSFADDLGVIHLTADADK 196
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD 229
+L+F+ ++ +++ N ++M+G+ P + KG+++++ + + +
Sbjct: 197 ALNFSFGMNRP-EHYKVTADGNDLLMQGQLP------DGVDTLEMKGLRYASRVRVVLPK 249
Query: 230 DRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSALQSIRNL 288
I D + + + A+LL+ +A+ FD KD + S L +
Sbjct: 250 GGNVIPG--DSTVTIRNASEAILLVSMATDYFD----------KDLDEKVASLLANAEKK 297
Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDP 347
++ L H+ Y+ LF RV + L S ++ +P ER+ +F D +DP
Sbjct: 298 DFASLKKGHIVAYRSLFGRVDLDLGHSSRE------------DLPIDERLAAFNADPDDP 345
Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
SL L FQFGRYLLISS+R G NLQG+W ++ W+ H+NINL+MN+W + N
Sbjct: 346 SLGALYFQFGRYLLISSTRVGLLPPNLQGLWCNTVNTPWNGDYHLNINLQMNHWPAEVAN 405
Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
LSE PL ++ +G +TA+ Y A GWV H ++W + +A W
Sbjct: 406 LSELHLPLVEWTKQQVASGEQTAKAYYNAGGWVTHILGNVW-EFTAPGEHPSWGATNTSA 464
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEF 526
AWLC HL+ HY YT+D+++L K YP+L+G + F +D L+E + YL T P+TSPE+ +
Sbjct: 465 AWLCEHLYMHYLYTLDKEYL-KDVYPVLKGASRFFVDMLVEDPRNKYLVTAPTTSPENGY 523
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE 586
P+GK A + STMD I+RE+F+ I AA +L + A +++ RL PT I +
Sbjct: 524 KLPNGKTAHICAGSTMDNQIVRELFTNTIEAANILGI-DSAFAGELVAKRARLMPTTIGK 582
Query: 587 DGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWS 646
DG IMEW + F++ E HHRH+SHL+GL+PG+ I+I+ P+L +AA K+L RG++ GWS
Sbjct: 583 DGRIMEWLEPFEEVEPHHRHVSHLYGLYPGNEISIKHTPELAEAARKSLVARGDKSTGWS 642
Query: 647 ITWKTALWARLHDQEHAYRMVKRLFN-LVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGF 705
+ WK WARLHD +HAY+++ L VD + GG Y NLF AHPPFQID NFG
Sbjct: 643 MAWKINFWARLHDGDHAYKLLVDLLRPCVDRKTNMTNGGGTYPNLFCAHPPFQIDGNFGG 702
Query: 706 TAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
A +AEMLVQS ++ LLPALP W +G KGL RGG VS WK+G L E G+
Sbjct: 703 CAGIAEMLVQSQTGEIELLPALP-SAWKNGSFKGLIVRGGGEVSAKWKEGRLTEAGL 758
>gi|336415223|ref|ZP_08595564.1| hypothetical protein HMPREF1017_02672 [Bacteroides ovatus
3_8_47FAA]
gi|335941256|gb|EGN03114.1| hypothetical protein HMPREF1017_02672 [Bacteroides ovatus
3_8_47FAA]
Length = 816
Score = 486 bits (1250), Expect = e-134, Method: Compositional matrix adjust.
Identities = 287/769 (37%), Positives = 428/769 (55%), Gaps = 49/769 (6%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
N +I ++ PA ++ +A+P+GNGR+ AMV+G E ++LNE+T+ G P N +A
Sbjct: 15 NIYRIWYDKPASYWEEALPVGNGRIAAMVFGDPQLEQIQLNEETVSAGSPYQNYNEEAKT 74
Query: 71 ALSDVRSLVDSGQYAEA-TAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRE 127
AL ++R L+ G+Y EA A+ K+ + YQ +G + + + D H K Y R+
Sbjct: 75 ALPEMRRLIFEGKYEEAQNMAATKILSQVGNEMPYQTVGRLNIRYQD-HKKV--NNYYRD 131
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS-LLDNHSY 186
LD++ A A +Y V VEFT E F+S DQ+++ I S+ G+++ + ++ + D
Sbjct: 132 LDISNAVAVARYEVDGVEFTEETFASFTDQLVIRHIKASKPGTINCELFFNTPMRDPKRS 191
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ G + +EG G R P + + A L++K G + D L V+G
Sbjct: 192 IYGKKGLRLEGITHGSRYFPGK--------VHYCADLDVK--HKGGKVITANDTLLSVQG 241
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ L + +++F +N D DP + + L++ YS H+ YQK F+
Sbjct: 242 ASELTLYISMATNF----VNYKDISGDPYQRNKAYLKNAAK-DYSKAKAAHIAAYQKQFN 296
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV++ L + S+ N P R+K F + DP+L+ L FQ+GRYLLISSS+
Sbjct: 297 RVTLDLGET---------SQAN---KPMDVRIKEFSSSYDPALIALYFQYGRYLLISSSQ 344
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQG WN + P W NIN EMNYW + NL+E +P + LS NG
Sbjct: 345 PGCQPANLQGKWNHNPGPPWSCNYTTNINAEMNYWPAEITNLAELHKPFIQMVRELSENG 404
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
+ A Y GWV+HH TD+W + A DR WP+ AWLC HLW+ Y ++ D+
Sbjct: 405 REAASRMYGCRGWVLHHNTDLWRMTGAVDRPYC--GTWPVANAWLCQHLWDRYLFSGDKK 462
Query: 486 FLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH--EFIAPDGKLACVSYSSTM 542
+LE+ YP+++ + F +D+L+ + + GYL PS SPE+ +I L TM
Sbjct: 463 YLEE-VYPMMKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPRWIKKKSNLFA---GITM 518
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWAQDFKDPE 601
D ++ ++FS AA+VL N D LK++ R L P ++ + G + EW +D+ P
Sbjct: 519 DNQLVFDLFSNTCEAAKVL--NADTDFCDTLKNMRRQLPPMQVGQYGQLQEWFEDWDHPN 576
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQE 661
HRH+SHL+GL+PG+ I+ ++P L +AA+ TL +RG+ GWS+ WK WAR+ D +
Sbjct: 577 DRHRHISHLWGLYPGYQISPYRSPVLFEAAKNTLIQRGDPSTGWSMGWKVCFWARMLDGD 636
Query: 662 HAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDL 721
HAY+++K V PE +K GG Y NLF AHPPFQID NFG TA +AEMLVQS +
Sbjct: 637 HAYQLIKNQLTYVSPEIQKGQGGGTYPNLFDAHPPFQIDGNFGCTAGIAEMLVQSHDGAI 696
Query: 722 YLLPALPWDKWSSGCVKGLKARGGETV-SICWKDGDLHEVGIYSNYSNN 769
+LLP+LP +W SG VKGL+ARGG + + WKDG L + + S N
Sbjct: 697 HLLPSLP-SEWKSGTVKGLRARGGFLIDELIWKDGKLVKAVLRSETGGN 744
>gi|410096950|ref|ZP_11291934.1| hypothetical protein HMPREF1076_01112 [Parabacteroides goldsteinii
CL02T12C30]
gi|409224744|gb|EKN17668.1| hypothetical protein HMPREF1076_01112 [Parabacteroides goldsteinii
CL02T12C30]
Length = 804
Score = 485 bits (1249), Expect = e-134, Method: Compositional matrix adjust.
Identities = 302/820 (36%), Positives = 444/820 (54%), Gaps = 79/820 (9%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
A++T+T NP K ++ A+ + A+P+GNG LGAMV+G V E ++LNE+T+W+G D
Sbjct: 39 ADATATDNPNK-GYDDDAE-WLKALPLGNGSLGAMVFGDVHKERIQLNEETMWSGSIQDS 96
Query: 64 TNPDAPKALSDVRSLVDSGQYAEAT-------AASVKLFGH------PADVYQLLGDIEL 110
NP+A K + +++ L+ G+Y EAT + K GH P YQ +GD+ +
Sbjct: 97 DNPEAAKHIEEIKQLLFDGKYKEATDLTNRTQICTGKGSGHGQGSNAPFGCYQTMGDLWI 156
Query: 111 EFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGS 170
+FD+ K YRREL+L+ ATAR+ Y G+V F RE F S+PDQ +V +IS +
Sbjct: 157 DFDN---KSPYTDYRRELNLDDATARISYKQGDVNFKREIFISHPDQSMVMRISADKKQQ 213
Query: 171 LSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD 230
LSF ++ + +S N Q+IM G +D G + +K
Sbjct: 214 LSFTCRMNRP-ERYSTYTENEQLIMAGAL-----------SDGKGGDGLQYMTRLKAVPM 261
Query: 231 RGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSY 290
G+++ D L V+ +D +L L AS+ + + P +D +S + ++L N SY
Sbjct: 262 NGSVT-YSDSTLTVKDADEVLLFLTASTDYKLEY--PIYKGRDFSSITEASLNKAINKSY 318
Query: 291 SDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSL 349
+ LY H+ +Y F R ++QL+ +P DT+P+ +V + + DP L
Sbjct: 319 NQLYETHVKEYTDYFQRANLQLTNTP-------------DTIPTDIKVMNARKGMIDPHL 365
Query: 350 VELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLS 409
E +FQ+GRYLLISSSRPGT ANLQGIW L W+ H ++N+EMNYW + NLS
Sbjct: 366 YEQMFQYGRYLLISSSRPGTMPANLQGIWANKLQTAWNGDYHTDVNIEMNYWPAEVTNLS 425
Query: 410 ECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAW 469
E P+FD + L GSKTAQ+ Y GWV+H T++W +S W + AW
Sbjct: 426 EMHLPMFDLIASLVEPGSKTAQIQYNKKGWVVHPITNVWGYTSPGEA-ASWGMHTGAPAW 484
Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIA 528
+C H+ EHY +T D+DFL ++ YP+L+G F +DWL E L + P+ SPE+ F+A
Sbjct: 485 ICQHIGEHYRFTGDKDFL-RKTYPVLKGAIEFYMDWLTENPKTKELVSGPAVSPENTFVA 543
Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDG 588
PDG + +S D I ++F + L ++D +V + RL TKI DG
Sbjct: 544 PDGSHSQISMGPAHDQQTIWQLFDDFAMISSELSIDDD-FTRQVADAKDRLADTKIGSDG 602
Query: 589 SIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP---GW 645
IMEWA +F + E HRH+SHLF + PG I + + PDL +AA K+L R + GW
Sbjct: 603 RIMEWADEFPEVEPGHRHISHLFAIHPGSQINMLQTPDLIEAANKSLDYRIQHRRGYVGW 662
Query: 646 SITWKTALWARLHDQEHAYRMVKRLF-NLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFG 704
S W + +ARLH E A + + ++P NLF PPFQIDANFG
Sbjct: 663 SSAWAISQYARLHQAEKAKENLDDVMKKCINP------------NLFTICPPFQIDANFG 710
Query: 705 FTAAVAEMLVQSTLND-----LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
TA +AEML+QS + D + LLP+LP D W G GLKARGG V++ W++G + +
Sbjct: 711 TTAGIAEMLLQSHVYDQGGYIIQLLPSLPAD-WKKGEFSGLKARGGFEVAVKWENGQIVD 769
Query: 760 VGIYSNYSNNDHDSFKTLHYRGTSVKVN-LSAGKIYTFNR 798
+ S N F+ + Y G ++ N L G+I+ +N+
Sbjct: 770 ASVKSLQGN----KFR-IWYNGNYLQANGLKKGEIWKWNK 804
>gi|256426140|ref|YP_003126793.1| alpha-L-fucosidase [Chitinophaga pinensis DSM 2588]
gi|256041048|gb|ACU64592.1| Alpha-L-fucosidase [Chitinophaga pinensis DSM 2588]
Length = 811
Score = 485 bits (1249), Expect = e-134, Method: Compositional matrix adjust.
Identities = 288/768 (37%), Positives = 422/768 (54%), Gaps = 57/768 (7%)
Query: 13 LKITFNGPAKH-FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
LK+ + PA + +T A+P+GNGR+ MV+G E L+LNE T+WTG P NP+A A
Sbjct: 22 LKLWYKQPAGNVWTAALPVGNGRIAGMVFGNPAEELLQLNEATVWTGSPNRNENPEALAA 81
Query: 72 LSDVRSLVDSGQYAEAT-----AASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
L +R L+ G+ EA KL G +YQ +G + L F H Y + Y R
Sbjct: 82 LPQIRQLIFDGKQKEAQDLAGEKIQTKLSG--GQMYQPVGTLHLAFP-GHEHY--DNYYR 136
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
ELD+ A A Y V V++TRE F+S P Q I+ ++S S+ G+L F+ L + N
Sbjct: 137 ELDIEKAVATTTYMVDGVKYTREVFASVPAQTIIVRLSSSKPGTLGFSAYLTTPQKNAVV 196
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVE 245
+ + G +++ +G ++F+ I + S G A D + ++
Sbjct: 197 KASGKDLTVNGIT---------GSHEGVEGKVKFNGITRVIAS---GGSVATSDTAVTIK 244
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
++ A+L + ++++ +N D D ++ + L + Y+ L H+ YQ+ F
Sbjct: 245 NANSALLFISMATNY----VNYQDLSADEVKKASAYLNAAVKQPYATLLKEHIAAYQRYF 300
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
+RV I L S D+ D P+ R+ +F DP + L FQFGRYLLIS S
Sbjct: 301 NRVKIDLGTS--DVAKD----------PTDVRLVNFSKTYDPQFISLYFQFGRYLLISCS 348
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
+PG Q A LQG+WN ++SP WDS +NIN EMNYW + NL E EPL + LS+
Sbjct: 349 QPGGQPATLQGLWNSEMSPPWDSKYTININTEMNYWPAEKDNLPEMHEPLVQMVKELSVT 408
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G TA++ Y A GWV HH TD+W + + ++ + +W MGGAWL HLW+ Y Y DR
Sbjct: 409 GQGTARILYGARGWVAHHNTDLW-RITGPVDRIFYGIWSMGGAWLAQHLWDRYLYNGDRR 467
Query: 486 FLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSS--TM 542
+L YP ++G A F +D L+E YL NP TSPE+ AP + VS+ + TM
Sbjct: 468 YLAD-VYPAIKGAALFFVDDLVEDPKRKYLVVNPGTSPEN---APSTR-PNVSFDAGCTM 522
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
D I+ + SA I+AAE+L K+ ALV+ RL P ++ + G + EW D +P+
Sbjct: 523 DNQIVFDALSAAINAAEILGKDA-ALVDTFKTVRRRLPPMQVGQYGQLQEWIDDLDNPKD 581
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEH 662
+HRH+SHL+GL+P I+ ++ P L AA TL +RG+ GWS+ WK WARL + EH
Sbjct: 582 NHRHISHLYGLYPSAQISPDRTPLLASAANTTLLQRGDVSTGWSMGWKVNWWARLQNGEH 641
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
A +++ + V GG Y+NLF AH PFQID NFG T+ + EML+QS +Y
Sbjct: 642 ALKLITNQLSPVG-----QHGGGTYTNLFDAHAPFQIDGNFGCTSGITEMLMQSHDGVIY 696
Query: 723 LLPALPWDKWSSGCVKGLKARGGETV-SICWKDGDLHEVGIYSNYSNN 769
+LPALP +W +G +KGL+ARGG + + W+DG + ++ I S N
Sbjct: 697 VLPALP-PQWKNGNIKGLRARGGFVIDDLVWQDGKITKLVITSTLGGN 743
>gi|294777781|ref|ZP_06743227.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
gi|294448369|gb|EFG16923.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
Length = 814
Score = 485 bits (1249), Expect = e-134, Method: Compositional matrix adjust.
Identities = 284/764 (37%), Positives = 429/764 (56%), Gaps = 50/764 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ +T+A+P+GNGRLGAMV+G E ++LNE+T+W G P + NPDA + +
Sbjct: 25 KLWYDEPAQVWTEALPLGNGRLGAMVFGNPGVEHIQLNEETIWAGRPNNNANPDALEYIP 84
Query: 74 DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
VR LV G+Y EA T A+ K+ G P YQ GD+ + F H +Y++ Y RE
Sbjct: 85 KVRRLVFEGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLHISFP-GHTRYSD--YYRE 138
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
L L++A V+Y V V + RE +S DQV++ +++ S+ G ++ N +L + +
Sbjct: 139 LSLDSARTIVRYKVDGVTYQRETLTSFADQVVMVRLTASQPGKITCNANLTTPHQDVMVA 198
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
++ + G ++ ++ KG ++F + + +G + D L +EG
Sbjct: 199 TEGEEVTLSG---------VSSWHEGLKGKVEFQGRMTART---QGGTRSCRDGVLSIEG 246
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D AV+ + +++F N D + + + L+ + Y H+D +++
Sbjct: 247 ADEAVIYISIATNF----TNYKDITGNQVERAKNYLRRAVSKDYMTSRKAHVDFFKQYMD 302
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RVS+ L VT + RV++F+ +D LV F+FGRYLLI SS+
Sbjct: 303 RVSLNLGIDKYAGVT------------TDMRVQNFKETKDDFLVATYFRFGRYLLICSSQ 350
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQGIWN+ L P+WDS NIN+EMNYW + NLSE EPL + +S G
Sbjct: 351 PGGQPANLQGIWNDKLFPSWDSKYTCNINVEMNYWPAEVTNLSELHEPLIQLIREVSETG 410
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
++A++ Y A GWV+HH TDIW + A K LWP GGAWLC HLWE Y YT D +F
Sbjct: 411 RESAKIMYGADGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDMEF 469
Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L + AYP+++ F + ++ E +L PS SPE+ +GK A + T+D
Sbjct: 470 L-RSAYPIMKEAGKFFDEIMVKEPLHNWLVVCPSNSPENTHAGSNGK-ATTAAGCTLDNQ 527
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
+I ++++ II+ A +L + + ++ + L + P +I G + EW D+ +P+ HR
Sbjct: 528 LIFDLWNQIITTARLLGTDAE-FATRLEQRLKEMAPMQIGRWGQLQEWMMDWDNPQDVHR 586
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK LWARL D +HAY+
Sbjct: 587 HVSHLYGLFPSNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKVCLWARLLDGDHAYK 646
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
++ LV E +K GG Y NLF AHPPFQID NFG TA + EML+QS +YLLP
Sbjct: 647 LITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCTAGIVEMLMQSHDGFIYLLP 703
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
ALP +W G V G+ ARGG + + WK+G + + + S N
Sbjct: 704 ALP-AQWKEGSVNGIIARGGFELDLSWKNGKVSRLVVKSRNGGN 746
>gi|349572636|gb|AEP84398.1| glycoside hydrolase family protein [bacterium enrichment culture
clone g13]
Length = 824
Score = 485 bits (1249), Expect = e-134, Method: Compositional matrix adjust.
Identities = 294/775 (37%), Positives = 428/775 (55%), Gaps = 55/775 (7%)
Query: 8 STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
ST K+ + PAK + +++P+GNGRLGAMV+G V S+ ++LNE+T W G P + NP
Sbjct: 21 STAVEQKLWYEQPAKQWEESLPLGNGRLGAMVYGDVLSDNIQLNENTFWAGGPHNNLNPA 80
Query: 68 APKALSDVRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDDSHLKYAEETY 124
A AL ++R L+ G Y A + K G YQ G++ LEF + H Y Y
Sbjct: 81 ALNALPEIRRLITVGDYLAAEKLAAKTIASQGSNGMPYQTAGNLRLEFSE-HKNYNH--Y 137
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
R+LD+ +A A +Y V +V +TRE FSS DQVIV K++ S+ G LSF+ +
Sbjct: 138 YRDLDIGSAVATTRYRVNDVVYTREVFSSFVDQVIVVKLTASKRGQLSFDAYMSHPSAMV 197
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE--IKISDDRGTISALEDKKL 242
N ++M+G+ D +GI+ L + IS G+I+ D ++
Sbjct: 198 FSREDANTLLMQGQSM------------DHEGIKGQVRLASLVNISTIGGSINQ-RDNRI 244
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY----TRHL 298
V+ +D A++L+ +++F +N D + + + + +N +D Y H
Sbjct: 245 TVKNADSALILVSMATNF----VNYKDVSANALARARHYMAQAKNNFANDHYELRKQAHS 300
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+ Y+ F RV + L +S S+E+ D +R+ F DP L L FQFGR
Sbjct: 301 NFYKNYFDRVILNLGKS-------EFSKESTD-----QRIALFSGRHDPELASLYFQFGR 348
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSS+PG Q ANLQG+WN P WDS +NIN EMNYW + NLSE EPL
Sbjct: 349 YLLISSSQPGGQPANLQGLWNHRQDPPWDSKYTLNINAEMNYWPAEITNLSELHEPLITM 408
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
LSI G ++A+ Y A GW+ HH TDIW + W WP AWL HLWE Y
Sbjct: 409 TKELSITGQESAKTMYGARGWMAHHNTDIWRITGGV--DYTWGSWPTSSAWLSQHLWERY 466
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y+ D+ +L + YP+++ F D+LI + +L +PS SPE+ A K+A
Sbjct: 467 LYSGDKQYLAE-IYPVMKSAVVFFDDFLISSPNKKWLIVSPSMSPENVPKATGTKIAA-- 523
Query: 538 YSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ 595
TMD ++ ++FS I+AA++L +K L EK L LP P +I + + EW +
Sbjct: 524 -GVTMDNQLLFDLFSNTIAAAKILGEDKQHIPLWEKTLSRLP---PMQIGKYHQLQEWLE 579
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWA 655
D+ DPE HRH+SHL+GL+P + I+ +P+L AA T+++RG+ GWS+ WK +WA
Sbjct: 580 DWDDPEDKHRHISHLYGLYPSNQISPLHSPELFSAARVTMEQRGDPSTGWSMNWKINIWA 639
Query: 656 RLHDQEHAYRMVK-RLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLV 714
RL D + A+++++ ++ + + + GG Y N+F AHPPFQID NFGFT+ +AEML
Sbjct: 640 RLLDGDRAFKLMRDQIKPAMTLDGTVNESGGTYPNMFDAHPPFQIDGNFGFTSGMAEMLA 699
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
QS ++LLPALP W +G VKGL RGG V + W DG + E+ I+S N
Sbjct: 700 QSHDGAVHLLPALP-HAWPAGEVKGLVMRGGFVVDMRWADGQISELKIHSRLGGN 753
>gi|399078665|ref|ZP_10752953.1| hypothetical protein PMI01_04050 [Caulobacter sp. AP07]
gi|398033293|gb|EJL26598.1| hypothetical protein PMI01_04050 [Caulobacter sp. AP07]
Length = 786
Score = 485 bits (1248), Expect = e-134, Method: Compositional matrix adjust.
Identities = 285/753 (37%), Positives = 415/753 (55%), Gaps = 57/753 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PAK + +A+PIGNGRLGAM++G V +E L+LNE+TLW+G P D NP A + L VR
Sbjct: 39 YDQPAKEWVEALPIGNGRLGAMIFGDVWAERLQLNENTLWSGGPYDPVNPRAREGLEPVR 98
Query: 77 SLVDSGQYAEA-TAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
+L+ +G++AEA A+ L P YQ GD+ L + + + A YRR LD++ A
Sbjct: 99 ALIAAGRFAEAEQRANETLVATPPREMAYQPFGDLGLRW--AGARGAVSGYRRSLDIDNA 156
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A + + V + R +S DQVI +++ S G+L F+++L + +I
Sbjct: 157 VAETTFEIDGVRYRRRAVASPVDQVIALELTASRPGALDFDLTL-------APAQTVREI 209
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
++E R +I + N + + ++ G++ D ++ V G+ A +
Sbjct: 210 VVE-RPDTLKISGRNNDGEGGVSGALTYCGRARVVTQGGSVKG-ADGQIAVRGASRATIY 267
Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
L ++S+ D DP + + + S+ L ++ LF RVS+ L
Sbjct: 268 LAMATSYR----RYDDVGGDPDAITRGQIDKAAAKSFDQLARAATAAHRALFDRVSLDLG 323
Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVAN 373
+++I P+ R+ +T +DP LVEL FQ+ RYLLI+ SRPG Q AN
Sbjct: 324 -----------GKDDIG-APTDIRIARNETTDDPGLVELYFQYARYLLIACSRPGGQPAN 371
Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVN 433
LQG+WN+ + P W S +NIN +MNYW + L+EC EPLFDF+ L+ G+ TA+
Sbjct: 372 LQGLWNDQVKPPWGSNYTININTQMNYWPAEAGGLAECAEPLFDFIAELAERGAVTAREM 431
Query: 434 YLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
Y A GWV HH +D+W ++ D K LWP GGAWLC HLW+HY+Y D+ FL RAY
Sbjct: 432 YGARGWVAHHNSDLWRGTAPFDHAKA--GLWPTGGAWLCVHLWDHYDYGRDKRFL-ARAY 488
Query: 493 PLLEGCASFLLDWL-IEGHDGYLETNPSTSPE--HEFIAPDGKLACVSYSSTMDMAIIRE 549
PL++G + F LD L + G+L T+PS SPE H F G C TMDM I+R+
Sbjct: 489 PLMKGASQFFLDTLQTDAATGWLVTSPSVSPENRHGF----GSTLCA--GPTMDMQILRD 542
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV--HHRHL 607
+F A +L + D E + ++ RL PT+I G +MEW D+ V HRH+
Sbjct: 543 LFDHTREAGRILGLDPD-FGEDLARARDRLAPTRIGAGGQLMEWKDDWDAVAVDPKHRHV 601
Query: 608 SHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMV 667
SHL+GL+P + +PDL AA +TL+ RG++ GW+I W+ LWARL D +HA+ ++
Sbjct: 602 SHLYGLYPSWQLDPATHPDLAAAARRTLETRGDKTTGWAIAWRINLWARLKDGDHAHEVL 661
Query: 668 KRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPAL 727
+ L E+ Y NLF AHPPFQID NFG AA+ EMLVQS + LLPAL
Sbjct: 662 RLLL-----ARER-----TYPNLFDAHPPFQIDGNFGGAAAILEMLVQSKGEIIDLLPAL 711
Query: 728 PWDKWSSGCVKGLKARGGETVSICWKDGDLHEV 760
P W G ++G++ R V + W+DG L V
Sbjct: 712 P-AAWPQGSIRGVRVRNAGEVDLFWRDGKLERV 743
>gi|254445766|ref|ZP_05059242.1| hypothetical protein VDG1235_4013 [Verrucomicrobiae bacterium
DG1235]
gi|198260074|gb|EDY84382.1| hypothetical protein VDG1235_4013 [Verrucomicrobiae bacterium
DG1235]
Length = 784
Score = 485 bits (1248), Expect = e-134, Method: Compositional matrix adjust.
Identities = 299/769 (38%), Positives = 416/769 (54%), Gaps = 58/769 (7%)
Query: 8 STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
S+ + LK G + ++ +PIGNG LGA+V G E + LN DTLW G P D + P+
Sbjct: 24 SSASILKYDEPGQFEPLSEGLPIGNGSLGALVMGRTAEERIVLNHDTLWAGGPYDPSYPE 83
Query: 68 APKALSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETY 124
A + L ++RSL+ ++ EA A P YQ + D+ L H + + Y
Sbjct: 84 AAEVLPEIRSLIFQDKHREAQALVQSSFMSKPMRQMSYQAMADLLL-LVPGHERV--DDY 140
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
R LDL+ A A V Y V V +TREH +S D V+ +I + GS+ + LDSL
Sbjct: 141 ERSLDLDKAIATVSYEVDGVRYTREHIASAVDGVVAIRIRADKPGSVDLTLQLDSL---- 196
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+ Q E G RI + A++ G +E+ + D G S D LKV
Sbjct: 197 -----HEQTRSEYWPEGMRISGRNGASEGIAG-ALDWSVEVAVQLD-GGWSMPGDGYLKV 249
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
+D LL+ A +S+ +N +D +P ++ + + +S+L RHL+D+Q L
Sbjct: 250 READSVTLLVAADTSY----VNWNDVSGNPRQKNAKTIVAASEFDFSELNERHLEDFQSL 305
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
+ RV ++L+ S ++ E N D R+ SF D+DP + EL F F RYL+IS
Sbjct: 306 YGRVDLELNTSRPEL-----GERNTDA-----RIASFSKDQDPKMAELYFNFARYLIISC 355
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
SRPG+Q ANLQG+WN+ L W S +NIN EMNYW + L EC EPL L LSI
Sbjct: 356 SRPGSQSANLQGLWNDKLFAPWGSKYTININTEMNYWPTQVVQLGECMEPLAAMLQDLSI 415
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
+G +TA+ Y ASGWV HH TD+W + G W +WPMGGAWL LWE Y +T D
Sbjct: 416 SGQRTAKNFYGASGWVTHHNTDLWRATGPIDG-AFWGMWPMGGAWLSLFLWERYEFTGDV 474
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
D LE Y +L+G A F LD L+E GYL T PS SPE+ A A TMD
Sbjct: 475 DQLETD-YAILKGSAQFFLDTLVEDPRTGYLVTAPSNSPENAHHAGVSNAA----GPTMD 529
Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWA--QDFKDPE 601
AI+R++F+A A+ +L + A E VL++ +L P K+ + G + EW D + PE
Sbjct: 530 NAILRDLFAATAEASRIL-GVDSAFRESVLQTSNQLPPFKVGKAGQLQEWQFDWDLEAPE 588
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQE 661
+ HRH+SHL+ L P + I+ P L +AA K+L+ RG+EG GWS+ WK WARL + E
Sbjct: 589 MGHRHVSHLYALHPSNQISPITTPALSQAARKSLELRGDEGTGWSLAWKVNFWARLLEGE 648
Query: 662 HAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND- 720
A+ ++++L + G Y+NLF AHPPFQID NFG V EML+QS L D
Sbjct: 649 RAHDLLEQLIS----------PGFCYTNLFDAHPPFQIDGNFGGANGVIEMLLQSHLKDE 698
Query: 721 -----LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+ LLPALP W +G ++G + RGG TV + W G+L + S
Sbjct: 699 EGDPIVQLLPALP-SNWQAGSLRGFRTRGGFTVDMEWAGGNLKSARVVS 746
>gi|395213355|ref|ZP_10400162.1| alpha-L-fucosidase [Pontibacter sp. BAB1700]
gi|394456724|gb|EJF10981.1| alpha-L-fucosidase [Pontibacter sp. BAB1700]
Length = 827
Score = 485 bits (1248), Expect = e-134, Method: Compositional matrix adjust.
Identities = 294/765 (38%), Positives = 428/765 (55%), Gaps = 49/765 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ + PA ++ +A+P+GNGRLGAMV+ E L+LNE+T+W G PG+ P AL
Sbjct: 32 KLWYKQPAANWNEALPLGNGRLGAMVFSQPAREQLQLNEETVWAGEPGNNVLPALNSALP 91
Query: 74 DVRSLVDSGQYAEAT-AASVKLFGHPAD------VYQLLGDIELEFDDSHLKYAEETYRR 126
++R L+ +G++ EA A KL PA YQ +G++ + F H + + Y R
Sbjct: 92 EIRQLIAAGKHKEAQDLAMEKLPRQPAADNNYGMPYQPVGNLFISFP-GHEQATD--YYR 148
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
+LD+ A + V Y V V F RE FSS D V++ ++S + S++F +S DS N++
Sbjct: 149 DLDIQQAISTVYYKVNGVTFKREMFSSFTDDVVIVRLSADKPKSINFTLSADSPHKNYTV 208
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVE 245
NQ+I+ G + D+ KG ++F ++E + + G I++ + ++V
Sbjct: 209 RTRGNQLILSG---------VSGDVDNKKGKVKFQTLVEPET--EGGKITSTPEG-VQVS 256
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G++ A L + ++F + D D +++ L S Y H Y+ +
Sbjct: 257 GANAATLYISIGTNFK----SYRDLSGDGEAKAAKLLSSAVKKKYKKAKAEHTAFYRNYY 312
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
R S+ L + D+ P+ ER+ +F DP L L FQFGRYLLISSS
Sbjct: 313 DRASLNLGTT-ADLQK-----------PTDERLAAFARSNDPHLAALYFQFGRYLLISSS 360
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
+PGTQ ANLQGIWN+ ++P WDS VNIN EMNYW + NLSE PLF L LS +
Sbjct: 361 QPGTQPANLQGIWNDKIAPPWDSKYTVNINTEMNYWPAEVTNLSEMHGPLFSMLKDLSES 420
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G ++A Y A GW++HH TDIW + G + +WPMGGAWL HLW+HY YT D+
Sbjct: 421 GRESASKMYGARGWMMHHNTDIWRITGPIDG-AFYGMWPMGGAWLTQHLWQHYLYTGDQK 479
Query: 486 FLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
FL K YP+L+G A F D L E + +L +PS SPE++ + +S +TMD
Sbjct: 480 FL-KVVYPVLKGSAMFYADVLQEEPTNKWLVVSPSMSPENKHQSG----VSISAGTTMDN 534
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHH 604
+I ++FS +I AEVL ++ A + + RL P +I + + EW +D + H
Sbjct: 535 QLIFDLFSNVIRTAEVLNTDQ-AFADSLRTMRDRLPPMQIGQHNQLQEWLRDLDRKDDKH 593
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAY 664
RH+SHL+GLFP + ++ ++P L +AA+ +L RG++ GWS+ WK LWARL D AY
Sbjct: 594 RHVSHLYGLFPSNQVSPYRHPLLFEAAKNSLVYRGDKSTGWSMGWKVNLWARLLDGNRAY 653
Query: 665 RMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLL 724
++++ E K GG Y NLF AHPPFQID NFG TA +AEML+QS L++L
Sbjct: 654 KLIQDQLTPAGTEG-KGESGGTYPNLFDAHPPFQIDGNFGCTAGIAEMLLQSHDGALHML 712
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
PALP D W G VKGL ARGG + + W+ G + + I+S N
Sbjct: 713 PALP-DVWQIGEVKGLVARGGFVIDMAWEGGKIKTLKIHSKLGGN 756
>gi|338209373|ref|YP_004646344.1| alpha-L-fucosidase [Runella slithyformis DSM 19594]
gi|336308836|gb|AEI51937.1| Alpha-L-fucosidase [Runella slithyformis DSM 19594]
Length = 849
Score = 484 bits (1247), Expect = e-134, Method: Compositional matrix adjust.
Identities = 301/774 (38%), Positives = 443/774 (57%), Gaps = 52/774 (6%)
Query: 6 STSTTNPLKITFNGPAKH-FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
S+ LK+ + P+ + + +A+PIGNG+LGAMV+G V ET++LNE T+W+G P
Sbjct: 47 SSQEVKSLKLWYTKPSGNTWENALPIGNGQLGAMVYGNVEKETIQLNEHTVWSGSPNRND 106
Query: 65 NPDAPKALSDVRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDDSHLKYAE 121
NP+A AL ++R L+ G+ +A + K+ ++Q +G++ L FD H Y +
Sbjct: 107 NPEALAALPEIRQLIFDGKQKDAERLANKVIITKKSHGQMFQPVGNLHLTFD-GHGNYTD 165
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
Y RELDL A A+ Y+V V++TRE +S PD+VIV ++ + SLSF S +
Sbjct: 166 --YYRELDLERAVAKTAYTVNGVKYTREILASFPDRVIVMHLTADKPNSLSFVASYATQH 223
Query: 182 DNHSYVN--GNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALE 238
+ +N +N++ + G + ++ KG + F + IK + GT++A
Sbjct: 224 KKRA-INPTASNELSLSGTT---------SDHEGVKGMVNFKGVTRIKT--EGGTVAA-N 270
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D + V+G+ A L + +++F+ + D D + + + L SY+ + T H+
Sbjct: 271 DSSIAVKGATTATLYVSIATNFN----SYKDISGDENARATAYLNKAYPKSYAAILTPHM 326
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
YQK F+RV D+ T ++ +P+ ER+K+F+T DP +V L +QFGR
Sbjct: 327 AAYQKYFNRVQF-------DLGTTEAAK-----LPTDERLKNFRTVNDPHMVTLYYQFGR 374
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSS+PG+Q ANLQGIWN ++P WDS +NIN +MNYW + NLSE P
Sbjct: 375 YLLISSSQPGSQPANLQGIWNHRMNPPWDSKYTININAQMNYWPAEKTNLSELHAPFLKM 434
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ LS G +TA+V Y A GW+ HH TDIW + A G +W GG W HLWEHY
Sbjct: 435 VKELSETGQETARVMYGAKGWMAHHNTDIWRATGAIDGAFW-GMWTGGGGWTAQHLWEHY 493
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACV 536
Y+ D+ FL + YP+L+G A+F D+L+E H Y L NP +SPE+ A G + +
Sbjct: 494 LYSGDKAFLTE-IYPILKGAAAFYADFLVE-HPKYHWLVINPGSSPENAPKAHAG--SSL 549
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
+TMD I+ + FS I AAE+L+K + A V+ + + +L P + + G + EW D
Sbjct: 550 DAGTTMDNQIVFDAFSTAIRAAELLKK-DAAFVDTLRQLRNKLAPMHVGQHGQLQEWLDD 608
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
DP+ HHRH+SHL+GLFP I+ + P+L A+ TL RG+ GWS+ WK WAR
Sbjct: 609 VDDPDDHHRHVSHLYGLFPAVQISAYRTPELFNASRTTLMHRGDVSTGWSMGWKVNWWAR 668
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
L D HAY +++ N + P GG Y+NLF AHPPFQID NFG T+ + EML+QS
Sbjct: 669 LQDGNHAYSLIQ---NQLTPLGVTKEGGGTYNNLFDAHPPFQIDGNFGCTSGITEMLMQS 725
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGG-ETVSICWKDGDLHEVGIYSNYSNN 769
++LLPALP D W SG + GL+A GG E ++ WK+G L +V + S N
Sbjct: 726 ADGAVHLLPALP-DVWPSGRIGGLRAIGGFEVANMEWKNGKLTKVTVKSTLGGN 778
>gi|427387089|ref|ZP_18883145.1| hypothetical protein HMPREF9447_04178 [Bacteroides oleiciplenus YIT
12058]
gi|425725694|gb|EKU88563.1| hypothetical protein HMPREF9447_04178 [Bacteroides oleiciplenus YIT
12058]
Length = 826
Score = 484 bits (1247), Expect = e-134, Method: Compositional matrix adjust.
Identities = 281/774 (36%), Positives = 421/774 (54%), Gaps = 52/774 (6%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
N ++ LK+ ++ PA + +A+P+GNGR+GAMV+G E +LNE+T+W G P +
Sbjct: 17 NVQAQQADETLKLWYDTPATQWVEALPLGNGRIGAMVFGDPVHEQFQLNEETVWGGSPHN 76
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATA------ASVKLFGHPADVYQLLGDIELEFDDSH 116
TNP A +AL +R L+ G+ AEA A S G P YQ +G + L+FD
Sbjct: 77 NTNPKAKEALPRIRQLIFEGKNAEAQALCGPAICSQSANGMP---YQTVGTLHLDFDGIS 133
Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
Y + Y R+LD+ A + +++ V +TRE ++S PDQV+V +++ S+ S+SF
Sbjct: 134 -NYTD--YYRDLDIEKAISTTRFTANGVTYTREAYTSFPDQVLVIRLTASQKKSISFTAK 190
Query: 177 LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPK---GIQFSAILEIKISDDRGT 233
Y + I+ P K + AND ++F+ + +I + G
Sbjct: 191 ---------YTTPYKENIVRCISPRKELQLNGKANDHEGIEGKVEFTTL--TRIENSGGN 239
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
+ L D L+V+ ++ +V L V S F+N D + + + L ++ N +Y+
Sbjct: 240 LEVLSDSTLQVKNAN-SVTLYV---SIGTNFVNYKDVSGNAQTTAQKYLANV-NKNYTKS 294
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H YQK F+RVS+ L R+ + P+ RVK F + DP + L
Sbjct: 295 KATHTSTYQKFFNRVSLDLGRNAQA------------DKPTDVRVKEFSSSFDPQMAALY 342
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+P Q ANLQGIWN L WD +IN+EMNYW + +L E E
Sbjct: 343 FQFGRYLLICSSQPDGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAESTSLPEMHE 402
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
P + ++I G K+A + Y GW +HH TDIW + A G + +WP AW C H
Sbjct: 403 PFLQLVKEVAIQGRKSAAM-YGCRGWTLHHNTDIWRSTGAVDGP-GYGIWPTCNAWFCQH 460
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
LW+ Y ++ D+++L + YPL+ G F LD+L+ E + +L PS SPE+ + +
Sbjct: 461 LWDRYLFSGDKNYLAE-VYPLMRGACEFYLDFLVREPENNWLVVAPSYSPENRPVVNGKR 519
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
V +TMD ++ ++F I+AA+++ +N + + + L P ++ G + E
Sbjct: 520 DFVVVAGATMDNQMVYDLFYNTIAAAQLMNENT-TFTDSLQTVVNHLAPMQVGRWGQLQE 578
Query: 593 WAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTA 652
W D+ +P+ HRH+SHL+GL+PG I+ +P L +AA+K+L RG+ GWS+ WK
Sbjct: 579 WMHDWDNPKDRHRHVSHLWGLYPGRQISAYNSPILFEAAKKSLIGRGDHSTGWSMGWKVC 638
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
LWARL D HAY+++ L EK GG Y NLF AHPPFQID NFG A +AEM
Sbjct: 639 LWARLLDGNHAYQLITE--QLHPTTDEKGQNGGTYPNLFDAHPPFQIDGNFGCAAGIAEM 696
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVS-ICWKDGDLHEVGIYSN 765
L+QS ++LLPALP + W G +KG++ RGG TV + W +G+L I SN
Sbjct: 697 LIQSHDGAVHLLPALP-EVWKQGTLKGIRCRGGFTVKEMTWANGELQTAIITSN 749
>gi|402307321|ref|ZP_10826347.1| putative lipoprotein [Prevotella sp. MSX73]
gi|400378835|gb|EJP31686.1| putative lipoprotein [Prevotella sp. MSX73]
Length = 796
Score = 484 bits (1246), Expect = e-134, Method: Compositional matrix adjust.
Identities = 289/765 (37%), Positives = 420/765 (54%), Gaps = 47/765 (6%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPK 70
PLK+ +N PA F +A+PIGNGRLGA+V+GG ++++ +N+ TLWTG P + DA +
Sbjct: 26 PLKLWYNRPATAFEEALPIGNGRLGALVYGGADTDSIYINDLTLWTGKPVELNEGGDAHR 85
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQ---LLGDIELEFDDSHLKYAEE--TYR 125
+ +R + +G Y A + GH ++ YQ LL +L + + E+ +
Sbjct: 86 WIPVIRKELIAGNYKNADILQHLVQGHNSEYYQSLALLTLTDLNASPAEGRSGEDFGGLK 145
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
R LD+++A R Y G V + RE+F+S PD +I +I + SG+++ ++L S++ +
Sbjct: 146 RSLDIDSAVCRDTYHRGGVTYIREYFASAPDSLIAIRIRANRSGAINCRLALTSVVPHQV 205
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
G Q+ M G G D + I F AIL++K D G ++A D L V
Sbjct: 206 KATGR-QLTMTGHAIG----------DPLQSIHFCAILKVKTDD--GQVAA-SDSSLTVN 251
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G+ + V +SF+G +P + +++ + N++Y++ RH+ DY++LF
Sbjct: 252 GASEVTVYFVNRTSFNGFDKHPVTAGAPYLAKATDDIWHTENITYNECRQRHVTDYKRLF 311
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
R LS + D + T E+ + + ER +P L L Q+GRYLLIS S
Sbjct: 312 DRFRFTLSGAKPD-YSRTTEEQLMAYSDNGER--------NPYLEMLYMQYGRYLLISCS 362
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
R ANLQG+W W +NINLE NYW + +L E P+ + ++
Sbjct: 363 RTPGVPANLQGLWAPQKYSPWRGNYTININLEENYWPAEVTDLGELVMPVDGLVRAMAAT 422
Query: 426 GSKTAQVNY-LASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
G TA Y + GW H +DIWA ++ + W+ W MGGAWL LW+HY++T
Sbjct: 423 GRHTAAHYYGIDEGWCAGHNSDIWAMTNPVGTGKESPKWSNWNMGGAWLVQTLWDHYDFT 482
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
D +L AYPL++G A F+L WL+E G L T P TSPE E+I G C Y
Sbjct: 483 RDTHYLRNTAYPLMKGAADFMLRWLVESPVKRGELITAPCTSPEAEYINDKGYQGCTFYG 542
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWAQDFK 598
T D+AI+RE+F+ + AAE+L N DA + L+S L L P KI + G++ EW D+
Sbjct: 543 GTSDLAIVRELFTNTLRAAEIL--NIDAGYRQQLRSSLDHLAPYKIGKRGNLQEWYYDWD 600
Query: 599 DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLH 658
D + HHRH SHL G++P I++ P L AA KTL+ +G+ GWS W+ +LWARLH
Sbjct: 601 DQDWHHRHQSHLLGVYPFKQISVYHTPLLANAAIKTLEIKGDNSTGWSTGWRISLWARLH 660
Query: 659 DQEHAYRMVKRLFNLV------DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
++ AY+M+++L V DP+H GG Y NLF AHPPFQID NFG TA V EM
Sbjct: 661 RRDKAYQMLRKLLTYVRPANYNDPKHRP--AGGTYPNLFDAHPPFQIDGNFGGTAGVCEM 718
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
LVQS + LLPALP + W +G V GLKARG V + WK+G +
Sbjct: 719 LVQSDGALMELLPALP-EAWPAGSVSGLKARGNYRVDMTWKNGKV 762
>gi|390958737|ref|YP_006422494.1| hypothetical protein Terro_2924 [Terriglobus roseus DSM 18391]
gi|390413655|gb|AFL89159.1| hypothetical protein Terro_2924 [Terriglobus roseus DSM 18391]
Length = 824
Score = 484 bits (1246), Expect = e-134, Method: Compositional matrix adjust.
Identities = 291/770 (37%), Positives = 414/770 (53%), Gaps = 55/770 (7%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
P ++ F PA + DA+PIGNGRLG MV+GG + + LNEDTLW+G P D NP A
Sbjct: 38 PYQLWFRTPAAEWIDALPIGNGRLGGMVFGGALEDHIALNEDTLWSGYPQDGNNPAAKSK 97
Query: 72 LSDVR-SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
L VR +++ + Y A ++ G + YQ LG + + H + YRR+L+L
Sbjct: 98 LPLVRQAVLKNKDYHLADTLCKEMQGPYSAAYQPLGGLHVTL---HQEGELADYRRDLNL 154
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
+TA A+ Y +G+V +++ F S PD V+V I ++ ++ + LDS L + V G+
Sbjct: 155 DTAIAKTTYRLGDVSVSKKAFVSFPDDVLVMLIETTKP--VTMEIRLDSKLRHEVSVAGH 212
Query: 191 NQIIMEGRCPGKRIP-------PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ ++G+ P P P ++ KG+ F+A I SD ++ +D L+
Sbjct: 213 -ALQLKGKAPVVSRPNYVKSQDPIQYSDTPGKGMFFAAGASIH-SDG---VTNAKDGALQ 267
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
+ + V+LL A + F G + P + L + + + L H+ ++
Sbjct: 268 IANAKSVVILLAAGTGFRGHGLLPDKPMAEIMGRVQQTLANASRKTAAQLERVHIAAHRA 327
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
+F R + L + +D+ T AER+ F DPSL+ L FQFGRYLLIS
Sbjct: 328 VFRRTLLDLGK--QDLTRST-----------AERLSDFAAHPDPSLLALYFQFGRYLLIS 374
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSRPGTQ ANLQGIWN+DL W NIN++MNYW + CNLS+ P FD L LS
Sbjct: 375 SSRPGTQPANLQGIWNDDLRAPWSCNWTSNINIQMNYWLAETCNLSDFHAPFFDLLQSLS 434
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSA---DRGKVVWALWPMGGAWLCTHLWEHYNY 480
G++TA+ NY GWV HH DIW+ SS G WA + M WLC HLW+HY +
Sbjct: 435 ETGARTAKTNYGLPGWVSHHNIDIWSLSSPVGEGEGDPSWANFAMSAPWLCAHLWDHYCF 494
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
T D++FL RAYPL++G A F WLI G L T PS S E++F APDGK A VS
Sbjct: 495 TQDQNFLRTRAYPLMKGAAQFCSSWLIPDDQGNLTTCPSVSTENQFTAPDGKRASVSAGC 554
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDP 600
TMD+A+IRE+FS AA+VL + D ++ + +L P + + G + EW+ DF +P
Sbjct: 555 TMDIALIREIFSNCAEAAKVLNVDHD-WANQLQQQSAKLVPYAVGQYGQLQEWSVDFPEP 613
Query: 601 EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARL 657
E RH+SHL+ ++PG E+ P A +L++R G GWS W + LWAR+
Sbjct: 614 EPGQRHMSHLYPIYPGSEFDSERTPQWMAAGRVSLERRLSHGGAYTGWSRAWASNLWARM 673
Query: 658 HDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP-----FQIDANFGFTAAVAEM 712
D + +L+N + + H +N HP FQID NFG T+A+AEM
Sbjct: 674 GDGD-------QLWNSL----QMHLMHSSAANFLDTHPAGKGSIFQIDGNFGTTSAIAEM 722
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
L+QS + +LPALP +G V GLKARG TV I W+ G L ++
Sbjct: 723 LLQSHNGTIRILPALP-KAIHTGSVAGLKARGDVTVDIAWEQGRLSKLAF 771
>gi|262405238|ref|ZP_06081788.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|294646990|ref|ZP_06724607.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|345508052|ref|ZP_08787692.1| glycoside hydrolase family 95 [Bacteroides sp. D1]
gi|229444703|gb|EEO50494.1| glycoside hydrolase family 95 [Bacteroides sp. D1]
gi|262356113|gb|EEZ05203.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|292637661|gb|EFF56062.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
Length = 811
Score = 483 bits (1244), Expect = e-133, Method: Compositional matrix adjust.
Identities = 287/763 (37%), Positives = 421/763 (55%), Gaps = 60/763 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PAK++++A+PIGN RLGAMV+GG E L+LNE+T W G P + NP+A L
Sbjct: 22 LKLWYSQPAKNWSEALPIGNSRLGAMVYGGTEREELQLNEETFWAGSPYNNNNPNAVHVL 81
Query: 73 SDVRSLVDSGQYAEATA---ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
VR L+ G+ EA A+ H Y LG++ LEF K A++ YR +L+
Sbjct: 82 PIVRKLIFEGRNKEAQRLIDANFLTRQHGMS-YLTLGNLYLEFPGH--KDADDFYR-DLN 137
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L AT +Y V + +TR F+S D VI+ I S+ +L+FNVS + L N V
Sbjct: 138 LENATTTTRYQVNGINYTRTTFASFTDNVIIMHIKASQPNALNFNVSYNCPLKNEVNVQN 197
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
+ II C GK + +G++ + E ++ I L++ G
Sbjct: 198 DKLIIT---CQGK----------EQEGMKAALRAECQVQVKTDGIIHPAGNILQINGGTE 244
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L + A++++ +N + D + + L+ + Y H+ Y+K F RV
Sbjct: 245 ATLYISAATNY----VNYQNVSADESRRTTDYLEEAILIPYEKALKEHIAFYKKQFDRVQ 300
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L S + + R+++F D ++ LLFQ+GRYLLISSS+PG
Sbjct: 301 LHLPSS------------EASQIETPRRIENFGQGNDMAMAALLFQYGRYLLISSSQPGG 348
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN WDS +NIN EMNYW + NLSE PLF L LS+ G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 408
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
A+ Y GWV HH TD+W G V +A +WP GGAWL H+W+HY +T +++F
Sbjct: 409 ARTMYDCWGWVAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L K YP+L+G A F +D+L+E H Y L +PS SPEH ++ TMD
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPTYKWLVVSPSVSPEH---------GPITAGCTMDN 513
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHH 604
I + + A+ + + + + + ++L +L P +I + + EW +D +P+ H
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLEDIDNPKDEH 572
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAY 664
RH+SHL+GL+P + I+ NP+L +AA TL +RG++ GWSI WK WAR+ D HA+
Sbjct: 573 RHISHLYGLYPSNQISPYSNPELFQAARNTLLQRGDKATGWSIGWKVNFWARMLDGNHAF 632
Query: 665 RMVKRLFNLVDPEH--EKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
+++K + L+ +H +++ G Y N+ AHPPFQID NFG+TA VAEML+QS ++
Sbjct: 633 QIIKNMIQLLPNDHLAKEYPNGRTYPNMLDAHPPFQIDGNFGYTAGVAEMLLQSHDGAVH 692
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
LLPALP D W G VKGL ARG TV + WK+ L++ I SN
Sbjct: 693 LLPALP-DAWEEGSVKGLVARGNFTVDMDWKNNVLNKAIIRSN 734
>gi|317474862|ref|ZP_07934132.1| glycoside hydrolase [Bacteroides eggerthii 1_2_48FAA]
gi|316909000|gb|EFV30684.1| glycoside hydrolase [Bacteroides eggerthii 1_2_48FAA]
Length = 801
Score = 483 bits (1244), Expect = e-133, Method: Compositional matrix adjust.
Identities = 298/765 (38%), Positives = 426/765 (55%), Gaps = 52/765 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ +T+A+P+GNGRLGAMV+G E ++LNE+TLW G P + NP+A + +
Sbjct: 12 KLWYDEPAQVWTEALPLGNGRLGAMVYGTPAMENIQLNEETLWAGRPNNNANPNALEYIP 71
Query: 74 DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
VR LV G+Y EA T A+ K+ G P YQ G + + F H +Y + Y RE
Sbjct: 72 KVRQLVFEGKYLEAQTLATEKVMAKTNSGMP---YQSFGHLRIAFP-GHTRYTD--YYRE 125
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
L L++A V Y+V V + RE +S DQV++ ++S S G ++ N L S +
Sbjct: 126 LSLDSARTVVCYTVDGVRYRRETITSLADQVVMVRLSASRPGMITCNAHLTSPHQDVMIA 185
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ ++I + G ++ ++ KG + F + ++ +G S+ D L VE
Sbjct: 186 SEGDEITLSG---------VSSWHEGLKGKVLFQGRMAVRT---QGGHSSCADGVLAVEK 233
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D A L +++F +N D + S + L + SY HL Y+
Sbjct: 234 ADEATFYLSIATNF----VNYKDITGNEVERSKNYLHAALKHSYRQSLLEHLAIYKSYMD 289
Query: 307 RVSIQLSRSP-KDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
RV + L D+ TD RV++F+ +D LV F+FGRYLLI SS
Sbjct: 290 RVDLDLGHDRYADVTTDM-------------RVQNFRETQDDFLVATYFRFGRYLLICSS 336
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
+PG Q ANLQGIWN+ L P+WDS NINLEMNYW + NLSE +PL ++ +S
Sbjct: 337 QPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPAEVTNLSELHQPLMQLISEVSET 396
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G +TA+ Y A GWV+HH TDIW + A K LWP GGAWLC HLWE Y YT D
Sbjct: 397 GRETAKTMYGAEGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDVG 455
Query: 486 FLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
FL + AYP+++ A F ++ E +L PS SPE+ GK + + TMD
Sbjct: 456 FL-RTAYPIMKEAAKFFDQIMVKEPVHNWLVVCPSNSPENVHAGSKGK-STTAPGCTMDN 513
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHH 604
+I ++++ +I+ A +L +E L + L + P ++ G + EW D+ DP+ H
Sbjct: 514 QLIFDLWNQVITTARLLNTDE-TLAVHYEQRLREMAPMQVGRWGQLQEWMFDWDDPKDVH 572
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAY 664
RH+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK LWARL D +HAY
Sbjct: 573 RHISHLYGLFPSNQISPFRTPELWDAARTSLIHRGDPSTGWSMGWKVCLWARLLDGDHAY 632
Query: 665 RMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLL 724
+++ LV E +K GG Y NLF AHPPFQID NFG TA +AEML+QS +YLL
Sbjct: 633 KLITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCTAGIAEMLMQSHDGFVYLL 689
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
PALP W G ++G+KARGG + CWK+G L ++ IYS+ N
Sbjct: 690 PALP-ANWKEGRIRGIKARGGFELDFCWKNGKLDKLTIYSSKGGN 733
>gi|354584080|ref|ZP_09002977.1| Alpha-L-fucosidase [Paenibacillus lactis 154]
gi|353197342|gb|EHB62835.1| Alpha-L-fucosidase [Paenibacillus lactis 154]
Length = 844
Score = 483 bits (1242), Expect = e-133, Method: Compositional matrix adjust.
Identities = 287/792 (36%), Positives = 429/792 (54%), Gaps = 70/792 (8%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
S + PL++ + PA + +A+PIGNGRLG MV+G E ++LNED+LW G PG N
Sbjct: 31 SGAVERPLRLWYTSPAAEWNEALPIGNGRLGGMVFGRTGLERVQLNEDSLWYGGPGRGGN 90
Query: 66 PDAPKALSDVRSLVDSGQYAEAT-AASVKLFGHPA--DVYQLLGDIELEFDDSHLKYAEE 122
P+A L D+R L+ G+ AEA A + + P YQ LGD+ L+F ++
Sbjct: 91 PNAIPYLGDIRQLLQDGRQAEAEHLARMAMTSSPKYEQPYQPLGDLLLKFLNAEAPATH- 149
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL-DSLL 181
Y RELDL + A V Y+ G + + R++F+S PD V+V +++ GSL+F +L
Sbjct: 150 -YERELDLQRSMAAVTYTSGGITYRRQYFASAPDGVLVIRLTADRPGSLTFAANLMRRPF 208
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
D + GN+ + M+G +A A+ G+ F A L + + + G I + D
Sbjct: 209 DCGTRSIGNDTLTMKG---------EAGAD----GVSFCASL--RGAAEGGNIRIIGDF- 252
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
+ VEG+D LLL A ++F + P + L ++ Y L++RH+++Y
Sbjct: 253 MSVEGADAVTLLLSAQTTF---------RCRKPEEMCLQQLDHASSIPYERLFSRHVEEY 303
Query: 302 QKLFHRVSIQL---------SRSPKD----------IVTDTCSEENIDTVPSAERVKSFQ 342
++ F R S++L + P D V+++ + ++ E
Sbjct: 304 REKFGRFSLKLEVDAGARDYASLPTDQRLNLLKERVRVSNSGANPEGNSGADPEGNSGAY 363
Query: 343 TDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQ 402
D+DP L+EL Q+GRYLL+SSSRPG+ ANLQGIWN+ +P W+S +N N++MNYW
Sbjct: 364 PDDDPGLIELYVQYGRYLLLSSSRPGSLAANLQGIWNDSFTPPWESKYTINANIQMNYWP 423
Query: 403 SLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWAL 462
+ L EC EPLFD + + NG KTA Y G+ HH T++W ++ + + +
Sbjct: 424 AELLGLPECHEPLFDLIHRMLPNGRKTAGEMYGCRGFAAHHNTNVWGETRPEGILMTCTV 483
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSP 522
WPMG AWLC HLWEH + D DFL RAYP+++ A FLLD++ +G T PS SP
Sbjct: 484 WPMGAAWLCLHLWEHVRFGGDADFLRDRAYPVMKEAAIFLLDYMTIDGEGRRITGPSVSP 543
Query: 523 EHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLR 580
E+ F+ PDG + + +MD I + A + A +L ++ L +E ++++P
Sbjct: 544 ENRFVLPDGAVGSLCMGPSMDSQIAHALLQACLEAGRLLGEDTRFLDELEAAIRNIP--- 600
Query: 581 PTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGE 640
+I G IMEW +D+++ + HRH+S LF L+PG I P+L +AA++TL++R
Sbjct: 601 APQIGRHGGIMEWLEDYEEADPGHRHISQLFALYPGEQIDPFHTPELAEAAKRTLERRLA 660
Query: 641 EG---PGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
G GWS W +ARL + AY + +L + N+ HPPF
Sbjct: 661 HGGGHTGWSRAWIINYYARLLNGTEAYGHLLQL-----------LASSTFPNMLDCHPPF 709
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
QID NFG A V EML+QS +L LLPALP WSSG VKGL+ARGG V I W+DG+L
Sbjct: 710 QIDGNFGGIAGVGEMLLQSHAGELRLLPALP-SGWSSGDVKGLRARGGWVVDIRWEDGEL 768
Query: 758 HEVGIYSNYSNN 769
E +Y++ +
Sbjct: 769 SEAKVYASRAGR 780
>gi|431798012|ref|YP_007224916.1| hypothetical protein Echvi_2666 [Echinicola vietnamensis DSM 17526]
gi|430788777|gb|AGA78906.1| hypothetical protein Echvi_2666 [Echinicola vietnamensis DSM 17526]
Length = 819
Score = 483 bits (1242), Expect = e-133, Method: Compositional matrix adjust.
Identities = 285/756 (37%), Positives = 413/756 (54%), Gaps = 43/756 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA + +A+PIGNGRLGAMV+G E ++LNE+TL+ G P NPDA +AL
Sbjct: 30 LKLWYDDPAASWVEALPIGNGRLGAMVFGDPYEEVIQLNENTLYAGRPHRNDNPDAKEAL 89
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
++V+S++ GQY A + F G YQ +G ++L FDD + YRRELDL
Sbjct: 90 AEVQSMIFDGQYGAAQHRINETFFSGINGMPYQTMGQLKLYFDDER---EVKEYRRELDL 146
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A Y G+ FT + +S+PDQV+V ++ + G++ F +D N
Sbjct: 147 KKALVTTHYKKGDTHFTTQVLASHPDQVMVIHLTADKPGAIHFTALVDRPGPFQLQHAAN 206
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
+++M G + G++F+ + +K S + + + V ++ A
Sbjct: 207 GELLMTGTS--------GDHEGIKGGVEFATRVRVKHSKGEMVKTG---EGIAVNNANSA 255
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
+ + +++F D + S L+ S+ + H +D+++ F RVS+
Sbjct: 256 TIYISMATNFK----QYDDISGNAVELSKQHLEKALGKSFDQIRKSHEEDHRRYFDRVSL 311
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
L E + P+ +RV++F +DP L L FQFGRYLLI++SR G Q
Sbjct: 312 DLG------------ESEAEKDPTDKRVENFSKRDDPGLAALYFQFGRYLLIAASRAGGQ 359
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWN+ L+P WDS VNIN EMNYW S +LSE EPL + + LS G KTA
Sbjct: 360 PANLQGIWNDQLNPAWDSKYTVNINTEMNYWPSEITHLSEMNEPLVEMVRELSQTGRKTA 419
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
+ Y A GW +HH TD+W + G W +WPMGGAWL HL + ++++ D +L K
Sbjct: 420 KDMYGARGWAMHHNTDLWRITGPVDG-AFWGMWPMGGAWLTQHLLDKFDFSGDTTYL-KS 477
Query: 491 AYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHE-FIAPDGKLACVSYSSTMDMAIIR 548
YP+L+ F LD L + G+ PS SPE+ ++ D A V TMD ++
Sbjct: 478 IYPILKEACLFYLDILKVAPETGWKVVVPSISPENAPYLDHD---ASVGAGHTMDNQLLS 534
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLS 608
++F AA +L+ + A E++ S L P +I G + EW D+ +PE HHRH+S
Sbjct: 535 DLFQRTSRAASILD--DKAFAEQLKDSWALLAPMQIGRWGQLQEWMYDWDNPEDHHRHVS 592
Query: 609 HLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVK 668
HL+GL+P + I+ P L +AA+ +L RG+E GWS+ WK LWARL D HA +++K
Sbjct: 593 HLYGLYPSNQISPYHTPKLFQAAKTSLMARGDESTGWSMGWKVNLWARLLDGNHALKLIK 652
Query: 669 RLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALP 728
+ K +GG Y NLF AHPPFQID NFG A +AEMLVQS ++LLPALP
Sbjct: 653 DQLSPSIQADGKQ-KGGTYPNLFDAHPPFQIDGNFGCAAGIAEMLVQSHDGAIHLLPALP 711
Query: 729 WDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
D W +G V GL+ RGG V + WK+G +V I S
Sbjct: 712 -DAWETGKVSGLRTRGGFEVEMAWKNGKPQKVTISS 746
>gi|299147445|ref|ZP_07040510.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
gi|298514723|gb|EFI38607.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
Length = 826
Score = 483 bits (1242), Expect = e-133, Method: Compositional matrix adjust.
Identities = 288/786 (36%), Positives = 436/786 (55%), Gaps = 50/786 (6%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
N +I ++ PA ++ +A+P+GNGR+ AMV+G E ++LNE+T+ G P N +A
Sbjct: 25 NIYRIWYDKPASYWEEALPVGNGRIAAMVFGNPQLEQIQLNEETVSAGSPYQNYNEEAKT 84
Query: 71 ALSDVRSLVDSGQYAEA-TAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRE 127
AL ++R L+ G+Y EA A+ K+ + YQ +G + + + D H K Y R+
Sbjct: 85 ALPEMRRLIFEGKYEEAQNMAATKILSQVGNEMPYQTVGRLNIRYPD-HKKV--NNYYRD 141
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS-LLDNHSY 186
LD++ A A +Y V VEFT E F+S DQ+++ I S+ G+++ + ++ + D
Sbjct: 142 LDISNAVAVTRYEVDGVEFTEETFASFTDQLVIRHIKASKPGTINCELFFNTPMRDPKRS 201
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ G + +EG G R + + A L++K G + D L V+G
Sbjct: 202 IYGKKGLRLEGITHGSRYFSGK--------VHYCADLDVK--HKGGKVITANDTLLSVQG 251
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ L + +++F +N D DP + + L++ YS H+ YQK F+
Sbjct: 252 ASELTLYISMATNF----VNYKDISGDPYQRNKAYLKNAAK-DYSKAKAAHIAAYQKQFN 306
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV++ L + + + +++D R+K F + DP+L+ L FQ+GRYLLISSS+
Sbjct: 307 RVTLDLGETSQ-------ANKSMDV-----RIKEFSSSYDPALIALYFQYGRYLLISSSQ 354
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQG WN + P W NIN EMNYW + NL+E +P + LS NG
Sbjct: 355 PGCQPANLQGKWNHNPGPPWSCNYTTNINAEMNYWPAEITNLAELHKPFIQMVRELSENG 414
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
+ A Y GWV+HH TD+W + A DR WP+ AWLC HLW+ Y ++ D+
Sbjct: 415 REAASRMYGCRGWVLHHNTDLWRMTGAVDRPYC--GTWPVANAWLCQHLWDRYLFSGDKK 472
Query: 486 FLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH--EFIAPDGKLACVSYSSTM 542
+LE+ YP+++ + F +D+L+ + + GYL PS SPE+ +I L TM
Sbjct: 473 YLEE-VYPMMKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPRWIKKKSNLFA---GITM 528
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWAQDFKDPE 601
D ++ ++FS AA+VL N D LK++ R L P ++ + G + EW +D+ P
Sbjct: 529 DNQLVFDLFSNTCEAAKVL--NADTDFCDTLKNMRRQLPPMQVGQYGQLQEWFEDWDHPN 586
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQE 661
HRH+SHL+GL+PG+ I+ ++P L +AA+ TL +RG+ GWS+ WK W+R+ D +
Sbjct: 587 DRHRHISHLWGLYPGYQISPYRSPVLFEAAKNTLIQRGDPSTGWSMGWKVCFWSRMLDGD 646
Query: 662 HAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDL 721
HAY+++K V PE +K GG Y NLF AHPPFQID NFG TA +AEMLVQS +
Sbjct: 647 HAYQLIKNQLTYVSPEIQKGQGGGTYPNLFDAHPPFQIDGNFGCTAGIAEMLVQSHDGAI 706
Query: 722 YLLPALPWDKWSSGCVKGLKARGGETV-SICWKDGDLHEVGIYSNYSNNDH-DSFKTLHY 779
+LLP+LP +W SG VKGL+ARGG + + WKDG L + + S N S+ L
Sbjct: 707 HLLPSLP-SEWKSGTVKGLRARGGFLIDELTWKDGKLVKAVLRSEIGGNLRLRSYWKLAA 765
Query: 780 RGTSVK 785
G S+K
Sbjct: 766 EGASLK 771
>gi|423215145|ref|ZP_17201673.1| hypothetical protein HMPREF1074_03205 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692408|gb|EIY85646.1| hypothetical protein HMPREF1074_03205 [Bacteroides xylanisolvens
CL03T12C04]
Length = 816
Score = 482 bits (1241), Expect = e-133, Method: Compositional matrix adjust.
Identities = 284/769 (36%), Positives = 429/769 (55%), Gaps = 49/769 (6%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
N +I ++ PA ++ +A+P+GNGR+ AMV+G E ++LNE+T+ G P N +A
Sbjct: 15 NIYRIWYDKPASYWEEALPVGNGRIAAMVFGDPQLEQIQLNEETVSAGSPYQNYNEEAKT 74
Query: 71 ALSDVRSLVDSGQYAEA-TAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRE 127
AL ++R L+ G+Y EA A+ K+ + YQ +G + + + D H K Y R+
Sbjct: 75 ALPEMRRLIFEGKYEEAQNMAATKILSQVGNEMPYQTVGRLNIRYQD-HKKV--NNYYRD 131
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS-LLDNHSY 186
LD++ A A +Y V VEFT E F+S DQ+++ I S+ G+++ + ++ + D
Sbjct: 132 LDISNAVAVTRYEVDGVEFTEETFASFTDQLVIRHIKASKPGTINCELFFNTPMRDPKRS 191
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ G + +EG G R + + A L++K G + D L V+G
Sbjct: 192 IYGKKGLRLEGITHGSRYFSGK--------VHYCADLDVK--HKGGKVITANDTLLSVQG 241
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ L + +++F +N D DP + + L++ YS H+ YQK F+
Sbjct: 242 ASELTLYISMATNF----VNYKDISGDPYQRNKAYLKNAAK-DYSKAKAAHIAAYQKQFN 296
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV++ L + + + +++D R+K F + DP+L+ L FQ+GRYLLISSS+
Sbjct: 297 RVTLDLGETSQ-------ANKSMDV-----RIKEFSSSYDPALIALYFQYGRYLLISSSQ 344
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQG WN + P W NIN EMNYW + NL+E +P + LS NG
Sbjct: 345 PGCQPANLQGKWNHNPGPPWSCNYTTNINAEMNYWPAEITNLAELHKPFIQMVRELSENG 404
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
+ A Y GWV+HH TD+W + A DR WP+ AWLC HLW+ Y ++ D+
Sbjct: 405 REAASRMYGCRGWVLHHNTDLWRMTGAVDRPYC--GTWPVANAWLCQHLWDRYLFSGDKK 462
Query: 486 FLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH--EFIAPDGKLACVSYSSTM 542
+LE+ YP+++ + F +D+L+ + + GYL PS SPE+ +I L TM
Sbjct: 463 YLEE-VYPMMKSASEFFVDFLVRDPNTGYLVVTPSNSPENSPRWIKKKSNLFA---GITM 518
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWAQDFKDPE 601
D ++ ++FS AA+VL N D LK++ R L P ++ + G + EW +D+ P
Sbjct: 519 DNQLVFDLFSNTCEAAKVL--NADTDFCDTLKNMRRQLPPMQVGQYGQLQEWFEDWDHPN 576
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQE 661
HRH+SHL+GL+PG+ I+ ++P L +AA+ TL +RG+ GWS+ WK WAR+ D +
Sbjct: 577 DRHRHISHLWGLYPGYQISPYRSPVLFEAAKNTLIQRGDPSTGWSMGWKVCFWARMLDGD 636
Query: 662 HAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDL 721
HAY+++K V PE +K GG Y NLF AHPPFQID NFG TA +AEMLVQS +
Sbjct: 637 HAYQLIKNQLTYVSPEIQKGQGGGTYPNLFDAHPPFQIDGNFGCTAGIAEMLVQSHDGAI 696
Query: 722 YLLPALPWDKWSSGCVKGLKARGGETV-SICWKDGDLHEVGIYSNYSNN 769
+LLP+LP +W SG VKGL+ARGG + + WKDG L + + S N
Sbjct: 697 HLLPSLP-SEWKSGTVKGLRARGGFLIDELTWKDGKLVKAVLRSETGGN 744
>gi|218129080|ref|ZP_03457884.1| hypothetical protein BACEGG_00654 [Bacteroides eggerthii DSM 20697]
gi|217988715|gb|EEC55034.1| hypothetical protein BACEGG_00654 [Bacteroides eggerthii DSM 20697]
Length = 828
Score = 482 bits (1241), Expect = e-133, Method: Compositional matrix adjust.
Identities = 298/765 (38%), Positives = 426/765 (55%), Gaps = 52/765 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ +T+A+P+GNGRLGAMV+G E ++LNE+TLW G P + NP+A + +
Sbjct: 39 KLWYDEPAQVWTEALPLGNGRLGAMVYGTPAMENIQLNEETLWAGRPNNNANPNALEYIP 98
Query: 74 DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
VR LV G+Y EA T A+ K+ G P YQ G + + F H +Y + Y RE
Sbjct: 99 KVRQLVFEGKYLEAQTLATEKVMAKTNSGMP---YQSFGHLRIAFP-GHTRYTD--YYRE 152
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
L L++A V Y+V V + RE +S DQV++ ++S S G ++ N L S +
Sbjct: 153 LSLDSARTVVCYTVDGVRYRRETITSLADQVVMVRLSASRPGMITCNAHLTSPHQDVMIA 212
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ ++I + G ++ ++ KG + F + ++ +G S+ D L VE
Sbjct: 213 SEGDEITLSG---------VSSWHEGLKGKVLFQGRMAVRT---QGGHSSCADGVLAVEK 260
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D A L +++F +N D + S + L + SY HL Y+
Sbjct: 261 ADEATFYLSIATNF----VNYKDITGNEVERSKNYLHAALKHSYRQSLLEHLAIYKSYMD 316
Query: 307 RVSIQLSRSP-KDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
RV + L D+ TD RV++F+ +D LV F+FGRYLLI SS
Sbjct: 317 RVDLDLGPDRYADVTTDM-------------RVQNFRETQDDFLVATYFRFGRYLLICSS 363
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
+PG Q ANLQGIWN+ L P+WDS NINLEMNYW + NLSE +PL ++ +S
Sbjct: 364 QPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPAEVTNLSELHQPLMQLISEVSET 423
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G +TA+ Y A GWV+HH TDIW + A K LWP GGAWLC HLWE Y YT D
Sbjct: 424 GRETAKTMYGAEGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDVG 482
Query: 486 FLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
FL + AYP+++ A F ++ E +L PS SPE+ GK + + TMD
Sbjct: 483 FL-RTAYPIMKEAAKFFDQIMVKEPVHNWLVVCPSNSPENVHAGSKGK-STTAPGCTMDN 540
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHH 604
+I ++++ +I+ A +L +E L + L + P ++ G + EW D+ DP+ H
Sbjct: 541 QLIFDLWNQVITTARLLNTDE-TLAVHYEQRLREMAPMQVGRWGQLQEWMFDWDDPKDVH 599
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAY 664
RH+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK LWARL D +HAY
Sbjct: 600 RHISHLYGLFPSNQISPFRTPELWDAARTSLIHRGDPSTGWSMGWKVCLWARLLDGDHAY 659
Query: 665 RMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLL 724
+++ LV E +K GG Y NLF AHPPFQID NFG TA +AEML+QS +YLL
Sbjct: 660 KLITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCTAGIAEMLMQSHDGFVYLL 716
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
PALP W G ++G+KARGG + CWK+G L ++ IYS+ N
Sbjct: 717 PALP-ANWKEGRIRGIKARGGFELDFCWKNGKLDKLTIYSSKGGN 760
>gi|406660853|ref|ZP_11068981.1| hypothetical protein B879_00989 [Cecembia lonarensis LW9]
gi|405555406|gb|EKB50440.1| hypothetical protein B879_00989 [Cecembia lonarensis LW9]
Length = 778
Score = 482 bits (1241), Expect = e-133, Method: Compositional matrix adjust.
Identities = 288/796 (36%), Positives = 423/796 (53%), Gaps = 58/796 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA-PKALSDV 75
+ PA + +A+P+GNGRLGAMV+G +E ++LNED+LW G P D+ + P+ L +
Sbjct: 28 YEKPASIWEEALPLGNGRLGAMVFGQTSTERIQLNEDSLWPGGPDDWGPAEGKPEDLEFI 87
Query: 76 RSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
R L+ G+ +A + V F + +Q LGD+ L+ + YRRELDL+ A
Sbjct: 88 RQLLLHGENKKADSLLVAKFSRKSITRSHQTLGDLWLDLGHEEVS----NYRRELDLDRA 143
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH-----SYVN 188
+ Y+V F ++ FSS PDQ IV ++ ++ + L D+
Sbjct: 144 LVTISYTVEGYVFLQKVFSSAPDQAIVIRLESKHPKGINGKIRLSRPEDDGYPTVTVQAT 203
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
N + MEG +R + + G++F I + I ++ G D +++EG +
Sbjct: 204 SNQTLQMEGEITQRRGQIDSKPSPILHGVKFQTI--VFIENESGKTFQKGDH-IELEGVE 260
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
+ LV ++S+ +D ++ LQ+I+ ++ +L RH+ DYQ LF RV
Sbjct: 261 ALNIKLVTNTSY---------YHQDFQRKNQEQLQNIKAKTFEELEQRHITDYQSLFQRV 311
Query: 309 SIQLSR-SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
L +P DI TD ERVK + + D L LLF FGRYLLISSSRP
Sbjct: 312 KFSLEEPNPLDIPTDQ----------RIERVK--EGNSDLYLESLLFDFGRYLLISSSRP 359
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
GT ANLQG+WN + W++ H+NINL+MNYW + NLSE EP FD++ L ++G
Sbjct: 360 GTLPANLQGLWNRHIEAPWNADYHLNINLQMNYWPAEVTNLSELHEPFFDYMDQLILSGK 419
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
KTA+ Y G + H +D+W + + W W G W+ H WE Y +T D++FL
Sbjct: 420 KTARETYGMRGSALAHGSDLWHMTFLQAAQAYWGAWLGAGGWMMQHFWERYLFTQDKNFL 479
Query: 488 EKRAYPLLEGCASFLLDWLIE-GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
+R P +E A+F LDWL+ DG ++PSTSPE+ FI G+ + + MD I
Sbjct: 480 RQRFLPAMEEIAAFYLDWLVPYPEDGTWVSSPSTSPENSFINAKGESVASTMGAAMDQQI 539
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRH 606
I EVF + A+++L L E K + DG ++EW Q++++PE HRH
Sbjct: 540 IAEVFDHFMQASKILGYQSPVLDEVKSKRQNLRSGLRTGNDGRLLEWDQEYEEPEKGHRH 599
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTALWARLHDQEHA 663
+SHL+ PG+ IT K P+L +A +KTL R G G GWS W ARLHD E A
Sbjct: 600 MSHLYAFHPGNAITKNKTPNLFEAVKKTLDYRLAHGGAGTGWSRAWLINFSARLHDGEMA 659
Query: 664 YRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYL 723
+ +++L + LY NLF AHPPFQID NFG+TA VAEML+QS ++L
Sbjct: 660 HEHIQKL-----------IQQSLYPNLFDAHPPFQIDGNFGYTAGVAEMLLQSHDGFIHL 708
Query: 724 LPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTS 783
LPALP W +G + GLKARG TV++ WK+G+L I + L Y+G
Sbjct: 709 LPALP-KAWKNGKITGLKARGNFTVNMEWKEGELKTASISAPIGGK-----AFLKYKGNL 762
Query: 784 VKVNLSAGKIYTFNRQ 799
++++L G+ + F+ Q
Sbjct: 763 LEIDLEKGETFEFSLQ 778
>gi|325103196|ref|YP_004272850.1| alpha-L-fucosidase [Pedobacter saltans DSM 12145]
gi|324972044|gb|ADY51028.1| Alpha-L-fucosidase [Pedobacter saltans DSM 12145]
Length = 821
Score = 482 bits (1241), Expect = e-133, Method: Compositional matrix adjust.
Identities = 282/779 (36%), Positives = 432/779 (55%), Gaps = 51/779 (6%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
+ NA + LK+ ++ P++++ +A+PIGNGRLGAMV+G E ++LNE+T+W+G P
Sbjct: 15 VANANAQQHDKTLKLWYDAPSRNWNEALPIGNGRLGAMVFGNPDREKIQLNEETVWSGGP 74
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLF--GHPADVYQLLGDIELEFDDSHL 117
++ A+ +R L+ ++ EA A A V +F + +YQ +GD+ + F H
Sbjct: 75 NTNITAESGAAIPKLRQLIFEEKFLEAQALADVDMFPKKNSGMIYQPVGDLLINFP-GHA 133
Query: 118 KYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL 177
+ E Y R+L++ A V Y + V + RE F+S PDQVI+ +++ + ++FN SL
Sbjct: 134 QV--EKYYRDLNIEKAVTTVSYRLNGVNYKRETFASFPDQVIIVRLTADKPNKITFNASL 191
Query: 178 DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
S ++ + N ++I+ G A+ + I+F ++ K+ +G + L
Sbjct: 192 TSPQNSAQKIE-NGKLILTGLT--------ADHEGEKGQIKFETQVKTKV---KGGKAEL 239
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
KV ++ A++ + +++F + +D + ++ + L +Y D +H
Sbjct: 240 TGSLWKVTNANEAIIYISMATNF----VKYNDISGNQHVKASNYLDKAFVKNYDDALKQH 295
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
+ YQ+ F+RV D+ + + P+ R+ F DP L L FQFG
Sbjct: 296 IAFYQQYFNRVKF-------DVGVNASVNK-----PTDRRIYEFAKSFDPHLAALYFQFG 343
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLI SS+PG Q LQGIWN+ + WDS +NIN EMNYW + NLSE +PLF+
Sbjct: 344 RYLLICSSQPGNQPPTLQGIWNDRMDAPWDSKYTININTEMNYWPAEVTNLSELHQPLFN 403
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS-ADRGKVVWALWPMGGAWLCTHLWE 476
L L++ G TAQ Y A GWV HH TD+W + DR LWPMGG WL HLW+
Sbjct: 404 MLEDLAVTGQATAQSMYGAKGWVTHHNTDLWRITGPVDRPYA--GLWPMGGNWLSQHLWD 461
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLAC 535
HY +T ++DFL K+ YP+L+G + F LD L E +L +PS SPE+ ++ +GK
Sbjct: 462 HYQFTGNKDFL-KKYYPVLKGASDFYLDILQEEPKHKWLVVSPSNSPENTYV--EGKRVS 518
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWA 594
++ +TMD ++ ++FS AAE+L ++D +LK + RL P +I + + EW
Sbjct: 519 IAAGTTMDNQLLFDLFSKTAKAAEILGIDKD--YSTLLKQKINRLAPMQIGKYSQLQEWM 576
Query: 595 QDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALW 654
D+ P+ HRH+SHL+GL+P + I+ P+L AA +L RG+ GWS+ WK LW
Sbjct: 577 YDWDRPDDKHRHVSHLYGLYPSNQISPYSTPELFDAARTSLIYRGDPATGWSMGWKVNLW 636
Query: 655 ARLHDQEHAYRMVKRLFNLV----DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
AR D HAY+++ LV D + K GG Y N+F AHPPFQID NFG TA +A
Sbjct: 637 ARFLDGNHAYKLITDQLKLVGGSIDSVNVKG--GGTYPNMFDAHPPFQIDGNFGCTAGIA 694
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
EM++QS +++LPALP D W +G + GL ARGG V + W+ L E+ + S N
Sbjct: 695 EMILQSHDGAIHILPALP-DIWPTGKMTGLVARGGFVVDVVWEKSKLKELKVTSRLGGN 752
>gi|431797172|ref|YP_007224076.1| hypothetical protein Echvi_1807 [Echinicola vietnamensis DSM 17526]
gi|430787937|gb|AGA78066.1| hypothetical protein Echvi_1807 [Echinicola vietnamensis DSM 17526]
Length = 792
Score = 482 bits (1240), Expect = e-133, Method: Compositional matrix adjust.
Identities = 288/801 (35%), Positives = 435/801 (54%), Gaps = 60/801 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA + +A+P+GNGRLGAMV+G +E ++LNED++W G + +P L+ +R
Sbjct: 37 YEQPAGSWEEALPVGNGRLGAMVFGQTSTERIQLNEDSMWPGAADWGDSKGSPADLASLR 96
Query: 77 SLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTAT 134
+LV SG+ EA + F + V +Q +GD+ ++F D + YRR+L L+ A
Sbjct: 97 ALVKSGRVHEADKEIIDKFSYRGIVRSHQTMGDLFIDFGDER---EIQHYRRQLSLDDAL 153
Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN-HSYVNGN--- 190
V+Y G ++T E F+S D +V +++ ++ ++F + L D+ H VN N
Sbjct: 154 VSVRYQSGGEQYTEEVFASAVDDALVIRLTTTDEAGMNFKLRLGRPKDDGHPTVNVNAPA 213
Query: 191 -NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
++++M+G + + G++F L++ S G S+ E+ +L++EG
Sbjct: 214 ADELVMDGEVTQYKAAKEGQPTPLDYGVKFQTKLKVVTS---GGASSAENGELRLEGVKE 270
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
AV+ LV ++S+ + D S++ LQ + + +L H +D+ + + RVS
Sbjct: 271 AVIYLVCNTSY---------YEDDYASKNEKTLQKLGTKGFDELLLAHQEDFDEYYSRVS 321
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPG 368
+ L +DT+P+ +R+K Q +D L LFQ+GRYLLISSSRPG
Sbjct: 322 LDLGG------------HALDTLPTDKRLKRVQDGRKDEGLAAALFQYGRYLLISSSRPG 369
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
T ANLQGIWN+D+ W++ H+NINL+MNYW + P +L E PLFD++ L G
Sbjct: 370 TNPANLQGIWNKDIEAPWNADYHLNINLQMNYWPAGPTHLPEMHLPLFDYVDQLIQRGKI 429
Query: 429 TAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
TA+ Y + G V+HH +D+WA + W W GG W+ H WE++ +T D FL
Sbjct: 430 TAKEQYGVERGSVVHHASDLWAAPWMRANRAYWGAWIHGGGWISRHYWEYFQFTGDTTFL 489
Query: 488 EKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
++R YP L+ A+F +DWL + G + P TSPE+ ++A DG+ A +SY + M I
Sbjct: 490 KERGYPALKEFAAFYMDWLQKDDQTGLYVSYPETSPENSYLAADGQPAAISYGAAMGHQI 549
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEWAQDFKDPEVHHR 605
I +VF +SAA+VL ED E+V L +L P I DG I+EW + +++PE HR
Sbjct: 550 ISDVFQNTLSAAKVLSI-EDDFTEEVSGKLAKLYPGVGIGPDGRILEWNEPYEEPEKGHR 608
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTALWARLHDQEH 662
H+SHL+ L PG IT E P+ A+KT+ R G G GWS W ARL D +
Sbjct: 609 HMSHLYALHPGDDIT-EDIPEAFAGAQKTIDYRLQHGGAGTGWSRAWMINFNARLLDSKS 667
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
A + +L + + NLF HPPFQID NFGFTA VAE+L+QS L
Sbjct: 668 AEENLYKLLQVSTAK-----------NLFNEHPPFQIDGNFGFTAGVAELLLQSHEGFLR 716
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGT 782
+LPALP + W SG VKGL ARG V + W+ G L ++G+ S + K + Y G
Sbjct: 717 ILPALP-ESWQSGSVKGLVARGNIEVDMIWEGGQLLKLGLKSATNQT-----KPILYNGK 770
Query: 783 SVKVNLSAGKIYTFNRQLKCT 803
+ V LSA + ++ L
Sbjct: 771 KMSVTLSADEKVWLDKDLNVV 791
>gi|410029118|ref|ZP_11278954.1| hypothetical protein MaAK2_07959 [Marinilabilia sp. AK2]
Length = 754
Score = 481 bits (1237), Expect = e-133, Method: Compositional matrix adjust.
Identities = 291/799 (36%), Positives = 428/799 (53%), Gaps = 64/799 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA-PKALSDV 75
+ PA + +A+P+GNGRLGAMV+G +E ++LNED+LW G P D+ + P+ L +
Sbjct: 4 YEKPASIWEEALPLGNGRLGAMVFGQTSTERIQLNEDSLWPGGPDDWGPAEGKPEDLEFI 63
Query: 76 RSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
R L+ G+ +A + V F + +Q LGD+ L+ + YRRELDL+ A
Sbjct: 64 RQLLLHGENKKADSLLVAKFSRKSITRSHQTLGDLWLDLGHEEVS----NYRRELDLDRA 119
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH-----SYVN 188
+ Y+V F ++ FSS PDQ IV ++ ++ + L D+
Sbjct: 120 LVTISYTVEGYVFLQKVFSSAPDQAIVIRLESKHPKGINGKIKLSRPEDDGYPTVTVQAT 179
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
N + MEG +R + + G++F I + I ++ G D +++EG +
Sbjct: 180 SNQTLHMEGEITQRRGQIDSKPSPILHGVKFQTI--VFIENESGKTFQKGDH-IELEGVE 236
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
+ LV ++S+ +D ++ LQ+I+ ++ +L RH+ DYQ LFHRV
Sbjct: 237 ALNIKLVTNTSY---------YHQDFQRKNQEQLQNIKAKTFEELEQRHITDYQSLFHRV 287
Query: 309 SIQLSR-SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
L +P D TD ERVK +TD L LLF FGRYLLISSSRP
Sbjct: 288 KFSLDDPNPLDSPTDQ----------RIERVKGGKTD--LYLESLLFDFGRYLLISSSRP 335
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
GT ANLQG+WN + W++ H+NINL+MNYW + NLSE EP FD++ L ++G
Sbjct: 336 GTLPANLQGLWNRHIEAPWNADYHLNINLQMNYWPAEVTNLSELHEPFFDYMDQLILSGK 395
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
KTA+ Y G + H +D+W + + W W G W+ H WE Y +T D++FL
Sbjct: 396 KTARETYGMRGAALAHGSDLWNMTFLQAAEAYWGAWLGAGGWMMQHFWERYLFTQDKNFL 455
Query: 488 EKRAYPLLEGCASFLLDWLI---EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
+R P +E A+F LDWL+ EG G ++PSTSPE+ FI G+ + + MD
Sbjct: 456 RQRFLPAMEEIAAFYLDWLVPYPEG--GKWVSSPSTSPENSFINAKGESVASTMGAAMDQ 513
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEWAQDFKDPEVH 603
+I EVF + A+++L + ++++V LR +I DG ++EW Q++++PE
Sbjct: 514 QVIAEVFDNFMQASKIL-GYQSPILDEVKSKRQNLRSGLRIGSDGRLLEWDQEYEEPEKG 572
Query: 604 HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTALWARLHDQ 660
HRH+SHL+ PG+ IT K PDL A KTL R G G GWS W ARLHD
Sbjct: 573 HRHMSHLYAFHPGNAITKNKTPDLFDAVRKTLDYRLAHGGAGTGWSRAWLINFSARLHDG 632
Query: 661 EHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
E A+ +++L + LY NLF AHPPFQID NFG+TA VAEML+QS
Sbjct: 633 EMAHVHIQKL-----------IQQSLYPNLFDAHPPFQIDGNFGYTAGVAEMLLQSHDGF 681
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYR 780
++LLPALP W +G + GLKARG TV++ WK+G+L I + L Y+
Sbjct: 682 IHLLPALP-KAWKNGKITGLKARGNFTVNMEWKEGELKTASISAPIGGK-----AFLKYK 735
Query: 781 GTSVKVNLSAGKIYTFNRQ 799
G ++++L G+ + F+ Q
Sbjct: 736 GNLLEIDLEKGETFEFSLQ 754
>gi|383110853|ref|ZP_09931671.1| hypothetical protein BSGG_1961 [Bacteroides sp. D2]
gi|382949363|gb|EFS31261.2| hypothetical protein BSGG_1961 [Bacteroides sp. D2]
Length = 810
Score = 480 bits (1236), Expect = e-132, Method: Compositional matrix adjust.
Identities = 291/763 (38%), Positives = 424/763 (55%), Gaps = 62/763 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+++++A+PIGN RLGAMV+GG E L+LNE+T W G P N +A L
Sbjct: 22 LKLWYSQPARNWSEALPIGNSRLGAMVYGGTEREELQLNEETFWAGGPYSNNNSNAKYVL 81
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR+L+ G+ EA + F Y LG++ ++F K A YR +L+L
Sbjct: 82 PVVRNLIFDGKNREAQSLVDANFLTKQHGMSYLTLGNLYIDFPGH--KDASGFYR-DLNL 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
AT +Y V V +TR F+S D VI+ I ++ +L+FN++ + L+ + +
Sbjct: 139 ENATTTTRYEVNGVTYTRTTFASFTDNVIIVHIQADKTQALNFNMTYNCPLEYNVNAQDD 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
II C GK IQ ++++K + G IS K L+VE + A
Sbjct: 199 KLIIT---CQGKE------QEGIKAAIQAECVVQVKTN---GAISP-AGKVLQVEKATEA 245
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
L + A++++ +N + + + + L+ Y+ H+ Y+K F RV +
Sbjct: 246 TLYIAAATNY----VNYQNVSANASERANKFLEKAIQTPYNKALKDHIAFYKKQFDRVRL 301
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
L SE + P R+++F ED ++ LLFQFGRYLLISSS+PG Q
Sbjct: 302 NLP----------SSEASKAETP--RRIENFNKGEDMAMAALLFQFGRYLLISSSQPGGQ 349
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWN WDS +NIN EMNYW + NLSE PLF L LS+ G++TA
Sbjct: 350 PANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVANLSETHSPLFSMLKDLSVTGAETA 409
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDFL 487
Q Y GWV HH TD+W G V +A +WP GGAWL H+W+HY +T D++FL
Sbjct: 410 QSMYNCRGWVAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGDKEFL 465
Query: 488 EKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
K YP+L+G A F +D+L+E D +L PS SPEH ++ TMD I
Sbjct: 466 -KEYYPILKGTAQFYMDFLVEHPDYKWLVVAPSVSPEH---------GPITAGCTMDNQI 515
Query: 547 IREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVH 603
+ + A+ + + +D+L +++L LP P +I + + EW +D +P+
Sbjct: 516 AFDALHNTLLASRITGETSSFQDSL-QQILDKLP---PMQIGKHHQLQEWLEDVDNPKDE 571
Query: 604 HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHA 663
HRH+SHL+GL+P + I+ NP+L +AA TL +RG++ GWSI WK WAR+ D HA
Sbjct: 572 HRHISHLYGLYPSNQISPYANPELFQAARNTLLQRGDKATGWSIGWKVNFWARMQDGNHA 631
Query: 664 YRMVKRLFNLV--DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDL 721
++++K + L+ D +++ EG Y N+F AHPPFQID NFG+TA VAEML+QS +
Sbjct: 632 FQIIKNMIQLLPSDNLAKEYPEGRTYPNMFDAHPPFQIDGNFGYTAGVAEMLLQSHDGAV 691
Query: 722 YLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+LLPALP D W G VKGL ARG TV + WK+ L++ I+S
Sbjct: 692 HLLPALP-DAWKEGNVKGLVARGNFTVDMDWKNSQLNKAVIHS 733
>gi|315606675|ref|ZP_07881686.1| alpha-L-fucosidase [Prevotella buccae ATCC 33574]
gi|315251685|gb|EFU31663.1| alpha-L-fucosidase [Prevotella buccae ATCC 33574]
Length = 807
Score = 480 bits (1235), Expect = e-132, Method: Compositional matrix adjust.
Identities = 287/765 (37%), Positives = 417/765 (54%), Gaps = 47/765 (6%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPK 70
PLK+ +N PA F +A+PIGNGRLGA+V+GG ++++ +N+ TLWTG P + DA +
Sbjct: 37 PLKLWYNRPATAFEEALPIGNGRLGALVYGGADTDSIYINDLTLWTGKPVELNEGGDAHR 96
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQ---LLGDIELEFDDSHLKYAEE--TYR 125
+ +R + +G Y A + GH ++ YQ LL +L + + E+ +
Sbjct: 97 WIPVIRKELFAGNYKNADILQHLVQGHNSEYYQSLALLTLTDLNASPAEGRSGEDFGELK 156
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
R LD+++A Y G V + RE+F+S PD +I + + SG+++ ++L S++ +
Sbjct: 157 RSLDIDSAVCSDTYHRGGVTYIREYFASAPDSLIAIRFRANRSGAINCRLALTSVVPHQV 216
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
G Q+ M G G D + I F AIL++K D G ++A D L V
Sbjct: 217 KATGR-QLTMTGHAIG----------DPLQSIHFCAILKVKTDD--GQVAA-SDSSLTVN 262
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G+ + V +SF+G +P + +++ + N++Y++ RH+ DY++LF
Sbjct: 263 GASEVTVYFVNRTSFNGFDKHPVTAGAPYLAKATDDIWHTENITYNECRQRHVADYKRLF 322
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
R LS + + T EE + S Q + +P L L Q+GRYLLIS S
Sbjct: 323 DRFKFTLSGAKPNYSRTT--EEQL-------MAYSDQGERNPYLEMLYMQYGRYLLISCS 373
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
R ANLQG+W W +NINLE NYW + +L E P+ + ++
Sbjct: 374 RTPGVPANLQGLWAPQKYSPWRGNYTININLEENYWPAEMTDLGELVMPVDGLVRAMAAT 433
Query: 426 GSKTAQVNY-LASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
G TA Y + GW H +DIWA ++ + W+ W MGGAWL LW+HY++T
Sbjct: 434 GRHTAAHYYGIDEGWCAGHNSDIWAMTNPVGTGKESPKWSNWNMGGAWLVQTLWDHYDFT 493
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
D +L AYPL++G A F+L WL+E G L T P TSPE E+I G C Y
Sbjct: 494 RDTHYLRNTAYPLMKGAADFMLRWLVESPVKRGELITAPCTSPEAEYINDKGYQGCTFYG 553
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWAQDFK 598
T D+AI+RE+F+ + AAE+L N DA + L+S L L P KI + G++ EW D+
Sbjct: 554 GTSDLAIVRELFTNTLRAAEIL--NIDAGYRQQLRSSLDHLAPYKIGKRGNLQEWYYDWD 611
Query: 599 DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLH 658
D + HHRH SHL G++P I++ P L AA KTL+ +G+ GWS W+ +LWARLH
Sbjct: 612 DQDWHHRHQSHLLGVYPFKQISVYHTPQLANAAIKTLEIKGDNSTGWSTGWRISLWARLH 671
Query: 659 DQEHAYRMVKRLFNLV------DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
++ AY+M+++L V DP+H GG Y NLF AHPPFQID NFG TA V EM
Sbjct: 672 RRDKAYQMLRKLLTYVRPANYNDPKHRP--AGGTYPNLFDAHPPFQIDGNFGGTAGVCEM 729
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
LVQS + LLPALP + W +G V GLKARG V + WK+G +
Sbjct: 730 LVQSDGTLMELLPALP-EAWPAGSVSGLKARGNYRVDMTWKNGKV 773
>gi|288925542|ref|ZP_06419475.1| alpha-L-fucosidase 2 (Alpha-L-fucosidefucohydrolase 2)
(Alpha-1,2-fucosidase 2) [Prevotella buccae D17]
gi|288337758|gb|EFC76111.1| alpha-L-fucosidase 2 (Alpha-L-fucosidefucohydrolase 2)
(Alpha-1,2-fucosidase 2) [Prevotella buccae D17]
Length = 796
Score = 479 bits (1233), Expect = e-132, Method: Compositional matrix adjust.
Identities = 288/765 (37%), Positives = 416/765 (54%), Gaps = 47/765 (6%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPK 70
PLK+ +N PA F +A+PIGNGRLGA+V+GG ++++ +N+ TLWTG P + DA +
Sbjct: 26 PLKLWYNRPATAFEEALPIGNGRLGALVYGGADTDSIYINDLTLWTGKPVELNEGGDAHR 85
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQ---LLGDIELEFDDSHLKYAEE--TYR 125
+ +R + +G Y A + GH ++ YQ LL +L + + E+ +
Sbjct: 86 WIPVIRKELFAGNYKNADILQHLVQGHNSEYYQSLALLTLTDLNASPAEGRSGEDFGELK 145
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
R LD+++A R Y G V + RE+F+S PD +I I G+++ ++L S++ +
Sbjct: 146 RSLDIDSAVCRDTYHRGGVTYIREYFASAPDSLIAMHIRADRPGAINCCLALTSIVPHQV 205
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
G Q+ M G G D + I F AIL++K SD G ++A D L V
Sbjct: 206 KATGR-QLTMTGHAIG----------DPLQSIHFCAILKVKTSD--GQVAA-SDSSLTVS 251
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G+ + V +SF+G +P + +++ + N++Y++ RH+ DY++LF
Sbjct: 252 GASEVTVYFVNRTSFNGFDKHPVTAGAPYLAKATDDIWHTENITYNECRQRHVADYKRLF 311
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
R L + + T EE + S Q + +P L L Q+GRYLLIS S
Sbjct: 312 DRFKFTLGGAKPNYSRTT--EEQL-------MAYSDQGERNPYLEMLYMQYGRYLLISCS 362
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
R ANLQG+W W +NINLE NYW + +L E P+ + ++
Sbjct: 363 RTPGVPANLQGLWAPQKYSPWRGNYTININLEENYWPAEVTDLGELVMPVDGLVRAMAAT 422
Query: 426 GSKTAQVNY-LASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
G TA Y + GW H +DIWA ++ + W+ W MGGAWL LW+HY++T
Sbjct: 423 GRHTAAHYYGIDEGWCAGHNSDIWAMTNPVGTGKESPKWSNWNMGGAWLVQTLWDHYDFT 482
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
D +L AYPL++G A F+L WL+E G L T P TSPE E+I G C Y
Sbjct: 483 RDTHYLRNTAYPLMKGAADFMLRWLVESPVKRGELITAPCTSPEAEYINDKGYQGCTFYG 542
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWAQDFK 598
T D+AI+RE+F+ + AAE+L N DA + L+S L L P KI + G++ EW D+
Sbjct: 543 GTSDLAIVRELFTNTLRAAEIL--NIDAGYRQQLRSSLDHLAPYKIGKRGNLQEWYYDWD 600
Query: 599 DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLH 658
D + HHRH SHL G++P I++ P L AA KTL+ +G+ GWS W+ +LWARLH
Sbjct: 601 DQDWHHRHQSHLLGVYPFKQISVYHTPLLANAAIKTLEIKGDNSTGWSTGWRISLWARLH 660
Query: 659 DQEHAYRMVKRLFNLV------DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
++ AY+M+++L V DP+H GG Y NLF AHPPFQID NFG TA V EM
Sbjct: 661 RRDKAYQMLRKLLTYVRPANYNDPKHRP--AGGTYPNLFDAHPPFQIDGNFGGTAGVCEM 718
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
LVQS + LLPALP + W +G V GLKARG V + WK+G +
Sbjct: 719 LVQSDGALMELLPALP-EAWPAGSVSGLKARGNYRVDMTWKNGKV 762
>gi|300777551|ref|ZP_07087409.1| alpha-L-fucosidase [Chryseobacterium gleum ATCC 35910]
gi|300503061|gb|EFK34201.1| alpha-L-fucosidase [Chryseobacterium gleum ATCC 35910]
Length = 836
Score = 479 bits (1233), Expect = e-132, Method: Compositional matrix adjust.
Identities = 284/767 (37%), Positives = 418/767 (54%), Gaps = 55/767 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PAK + +A+P+GNG + AMV+G E L+LNE T W+G P NPDAPK L
Sbjct: 26 KLWYDKPAKQWVEALPVGNGNMAAMVYGDPYQEKLQLNEGTFWSGGPSRNDNPDAPKVLD 85
Query: 74 DVRSLVDSGQYAEATAASVKLFGHP---ADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+R + G Y A + K +Q +GD L+ ++ LK Y RELD+
Sbjct: 86 SIRYYLFHGNYKRAQILADKGLTAKTVHGSAFQNIGDFTLDLNN--LKEIR-NYYRELDI 142
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A ++ G + F RE F+S PD VIV K+S +L+F +S L +
Sbjct: 143 EKAIATTTFTSGGIYFKREVFASIPDHVIVIKLSSDHKNALNFTAKFNSELKKNVKAIDA 202
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
N + M+G + + P ++F+A+ + +G + ++ + V +
Sbjct: 203 NTLQMDGIS--------STLDGIPGQVKFNALAKFIT---KGGKTQTSEEGISVSNAHEV 251
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
++L+ +++F + + D +++ +++ N S+ L HL+ YQ F RV +
Sbjct: 252 MILISIATNF----TDYKNLNTDEVAKARKYIEAAANKSFKTLVQNHLNAYQNYFKRVDL 307
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
L S + +N P+ R+K+F T DP L+ L +QFGRYLLISSS+PG Q
Sbjct: 308 NLGTSE--------AAKN----PTDVRIKNFATGYDPELISLYYQFGRYLLISSSQPGGQ 355
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWN P WDS +NIN EMNYW + NLSE EPL + LS G +TA
Sbjct: 356 PANLQGIWNNSNKPAWDSKYTININTEMNYWPAEKTNLSEMHEPLIQMIKDLSETGKETA 415
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDFL 487
+ Y + GWV HH TDIW + G V +A +WPMGGAWL HLWE Y Y+ D +L
Sbjct: 416 KTMYNSRGWVAHHNTDIWRIT----GVVDFANAGMWPMGGAWLSQHLWEKYLYSGDEHYL 471
Query: 488 EKRAYPLLEGCASFLLDWLIE--GHDGYLETNPSTSPEHEFIAPDG-KLACVSYSSTMDM 544
+ YP+L+ A F D+LIE H +L +PS SPE+ P G + + ++ +TMD
Sbjct: 472 -RTIYPVLKSAAQFYEDFLIEEPAHH-WLVASPSMSPEN---IPQGHQGSALAAGNTMDN 526
Query: 545 AIIREVFSAIISAAEVLEKNEDALV--EKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
++ ++F+ AA++L + D + ++ LP P KI G + EW +D DP+
Sbjct: 527 QLMFDLFTKTKKAAQILNTDSDKIQVWNTIISKLP---PMKIGSYGQLQEWMEDLDDPKD 583
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEH 662
+HRH+SHL+GLFP + I+ P+L A+ L RG+ GWS+ WK LWA+L D H
Sbjct: 584 NHRHVSHLYGLFPSNQISPFTTPELLDASRTVLIHRGDVSTGWSMGWKVNLWAKLLDGNH 643
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
A +++K LV+ + +GG Y NLF AHPPFQID NFG T+ + EML+Q+ +
Sbjct: 644 ANKLIKDQLTLVEKDGWGS-KGGTYPNLFDAHPPFQIDGNFGCTSGITEMLLQTQNGFID 702
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
+LP LP D+W SG + GLKA GG VS+ W++ E+ I S N
Sbjct: 703 ILPTLP-DEWKSGSISGLKAYGGFEVSVSWENNQAKEMTIKSGLGGN 748
>gi|218129730|ref|ZP_03458534.1| hypothetical protein BACEGG_01309 [Bacteroides eggerthii DSM 20697]
gi|217988142|gb|EEC54466.1| hypothetical protein BACEGG_01309 [Bacteroides eggerthii DSM 20697]
Length = 1063
Score = 479 bits (1233), Expect = e-132, Method: Compositional matrix adjust.
Identities = 274/757 (36%), Positives = 416/757 (54%), Gaps = 46/757 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ ++ PA + +A+P+GN RLGAMV+GG E ++LNE+T W G P NP AL
Sbjct: 271 MKLWYSAPAHRWVEALPVGNSRLGAMVYGGTDKEEIQLNEETFWAGGPYSNDNPKGKGAL 330
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+ VR LV + + +EA + F G + +G + F + E Y RELD+
Sbjct: 331 AKVRELVFANRLSEAQKMIDENFFTGQHGMRFLTMGSL---FINQPEHKNVENYYRELDI 387
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A +Y V V +TR FSS D VIV ++ + +L+F++S +S L + GN
Sbjct: 388 ENAVAVTRYMVDGVTYTRTVFSSFADDVIVVRMEADKPKALNFDLSYNSPLKHAVTAKGN 447
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
I+ +C G + +GI + E ++ S ++ + V + A
Sbjct: 448 ELIV---KCEGA----------EQEGIPAALNAECRVLVKHNGKSGKSNESVVVNQATVA 494
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
L + A+++F +N D + + ++L+ + Y H+ Y+K F RV
Sbjct: 495 TLYISAATNF----VNYHDVSGNASKLVSTSLKRAVKIPYEQALANHIAAYKKQFDRVKF 550
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
+ + T+ + +RV +F +D +L+ L+FQ+GRYLLISSS+PG Q
Sbjct: 551 SIPST------------ETSTLETDKRVAAFGEGKDQNLMALMFQYGRYLLISSSQPGGQ 598
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQG+W + WDS +NIN EMNYW + NLSE +PLFD ++ LS++G KTA
Sbjct: 599 PANLQGLWCNSVYAPWDSKYTININTEMNYWPAEVTNLSENHQPLFDMVSDLSVSGKKTA 658
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
+ Y A GWV HH TD+W ++ + +WP GGAWL HLW+HY +T D++FL +R
Sbjct: 659 ETVYGARGWVAHHNTDLW-RACGPIDAAYFGMWPNGGAWLTQHLWQHYLFTGDKEFL-RR 716
Query: 491 AYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
YP+++G A F L L++ +G+L T PS SPEH + C TMD I +
Sbjct: 717 YYPVMKGAADFYLSHLVKHPQNGWLVTAPSVSPEHGYAGSSITAGC-----TMDNQIAFD 771
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSH 609
+ AA +L +++ A + + + +L P +I + EW D +P HRH+SH
Sbjct: 772 ALYNTMLAARILGESQ-AYQDSLAVAFKQLPPMQIGRHNQLQEWLIDADNPRDDHRHISH 830
Query: 610 LFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKR 669
L+GL+P + I+ +P+L +AA+ TL +RG+ GWSI WK WAR+ D HAY+++K
Sbjct: 831 LYGLYPSNQISPRLHPELFQAAKNTLLQRGDAATGWSIGWKINFWARMLDGNHAYKIIKN 890
Query: 670 LFNLV--DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPAL 727
+ ++ D + + EG Y NLF AHPPFQID NFG+TA VAEML+QS + LLPAL
Sbjct: 891 MLRILPGDDKMREFPEGRTYPNLFDAHPPFQIDGNFGYTAGVAEMLLQSHDGAVQLLPAL 950
Query: 728 PWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
P ++W+ G + GL ARGG V + W+ L + ++S
Sbjct: 951 P-EEWNEGSISGLVARGGFVVDMQWEGAQLLKAKVHS 986
>gi|255035637|ref|YP_003086258.1| glycoside hydrolase [Dyadobacter fermentans DSM 18053]
gi|254948393|gb|ACT93093.1| glycoside hydrolase family protein [Dyadobacter fermentans DSM
18053]
Length = 781
Score = 478 bits (1231), Expect = e-132, Method: Compositional matrix adjust.
Identities = 286/790 (36%), Positives = 432/790 (54%), Gaps = 68/790 (8%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
+N + + PL++ + PA + + IP+GNGRLG M GGV ET+ LN+ TLW+G P
Sbjct: 13 FLNLAALAQQAPLRLWYTKPASQWEETIPLGNGRLGMMGDGGVTKETVVLNDITLWSGAP 72
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------GH------PADVYQLLGD 107
D DA ++L ++R L+ +G+ EA A K F GH P YQ+LG+
Sbjct: 73 QDANRYDAHESLPEIRRLILAGKNDEAQALVNKNFVAKGAGSGHGDGANVPFGCYQVLGN 132
Query: 108 IELEFDDSHLKYAE---ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKIS 164
+ LEF + A Y+REL L+ A + V Y V V +TRE+F+S D + + KI+
Sbjct: 133 LHLEFGYKGVDTARVQVRDYKRELSLDEAVSSVIYQVNGVTYTREYFTSFGDDLGIIKIT 192
Query: 165 GSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE 224
+ G L+ ++LD + V NN + M G+ N D KG+++ ++
Sbjct: 193 ADKPGQLNLRIALDRP-ERFQTVIKNNTLEMSGQL---------NNGTDGKGMRYLTKIK 242
Query: 225 IKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS 284
+ + ++S K++ + +D ++ A + F K+ +E+ + +
Sbjct: 243 PLVKGGKTSVSG---KQIVISDADEIIVYFSAGTDF---------KNKNFETETQRLIDA 290
Query: 285 IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT- 343
SYS H +YQKLF+R I L S D VP+ +R+ +FQ
Sbjct: 291 AVKKSYSVQKNLHTTNYQKLFNRTKIHLGGSKGD------------GVPTDQRLSAFQKN 338
Query: 344 -DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQ 402
++D L L FQFGRYL ISS+R G NLQG+W + W+ H+++N++MN+W
Sbjct: 339 PEKDNELAVLYFQFGRYLSISSTRVGLLPPNLQGLWANQIRTPWNGDYHLDVNVQMNHWP 398
Query: 403 SLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWAL 462
NLSE PL D + + G KTA+ Y A+GWV H T++W + + W
Sbjct: 399 VEVANLSELNLPLADLVKGMVKQGEKTAKAYYNANGWVAHVITNVWGYTEPGE-EASWGA 457
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTS 521
G W+C +LWEHY +T D+++L K YP+L+G A F + LI+ G+L T PS S
Sbjct: 458 SNAGSGWICNNLWEHYAFTHDKNYL-KDIYPVLKGSAEFYISALIKDPKTGWLVTAPSVS 516
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNED--ALVEKVLKSLPRL 579
PE+ F P+GK A + T+D I RE+F+ +I+A EVL + D ++ LK LP
Sbjct: 517 PENSFYLPNGKTAAICMGPTIDNQITRELFTNVITACEVLGVDADFAKSLQNKLKELPP- 575
Query: 580 RPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
P + DG +MEW +++K+ + HRH+SHL+GL+P IT +K P+L A+ KTL+ RG
Sbjct: 576 -PGVVGSDGRLMEWLEEYKETDPKHRHISHLYGLYPAPLITPDKTPELAAASAKTLEVRG 634
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFE----GGLYSNLFAAHP 695
++ PGWS +K WARLHD A ++++ +L+ P + + GG+Y NL +A P
Sbjct: 635 DDSPGWSKAYKLLFWARLHDGNRAGKLLR---DLLTPTLQTNMNYGGGGGVYPNLLSAGP 691
Query: 696 PFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKW-SSGCVKGLKARGGETVSICWKD 754
PFQID NFG A +AEML+QS ++ +LPA+P D+W SG VKGLKARG TV W++
Sbjct: 692 PFQIDGNFGGAAGIAEMLIQSHDGNIDILPAIP-DEWKGSGEVKGLKARGNFTVDFKWEN 750
Query: 755 GDLHEVGIYS 764
G + + I S
Sbjct: 751 GKVTDYKITS 760
>gi|333377780|ref|ZP_08469513.1| hypothetical protein HMPREF9456_01108 [Dysgonomonas mossii DSM
22836]
gi|332883800|gb|EGK04080.1| hypothetical protein HMPREF9456_01108 [Dysgonomonas mossii DSM
22836]
Length = 788
Score = 478 bits (1231), Expect = e-132, Method: Compositional matrix adjust.
Identities = 272/768 (35%), Positives = 433/768 (56%), Gaps = 56/768 (7%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
N + F+ P+ + ++IP+GNGR+G M WGGV E + LNE +LW+G D NP+A K
Sbjct: 25 NEWQYYFDKPSSIWEESIPLGNGRIGMMPWGGVERERVVLNEISLWSGNKQDADNPEAYK 84
Query: 71 ALSDVRSLVDSGQYAEATAASVKLF--------GHPADVYQLLGDIELEFDDSHLKYAEE 122
L ++R L+ + EA K F G +Q+ ++ ++F A +
Sbjct: 85 YLGEIRRLLFEKKNKEAQELMYKTFTCKGKGSAGLEYGKFQIFANLYVDFLYPDKSEATQ 144
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
Y+R LD+N A + V +S +VE+ RE+F+S + + + K + S+S +LS +SL +
Sbjct: 145 -YKRVLDMNNALSTVSFSKNDVEYKREYFTSFSNDIGLVKYTASKSEALSLKISLQRDEN 203
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+Y +GN I + A ++ G+++ + +K+ + G +SA DK +
Sbjct: 204 FKTYASGNTLYIF----------GQLEAGENHSGMKYLGM--VKVINKGGKLSA-TDKVI 250
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
++ ++ L + +++++G + S L + ++Y L +H+ YQ
Sbjct: 251 DIKNANEVTLYVSLATNYNGT----------NHEKVASDLLNNAGVNYEKLKKKHIAKYQ 300
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLL 361
LF+RV + L ++ + ID +R+++F TD+ D +L L Q+GRYLL
Sbjct: 301 ALFNRVDLTLEKNKNSSLA-------ID-----KRLEAFATDKTDYNLAALYMQYGRYLL 348
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
ISS+R G NLQG+W ++ W++ H+NINL+MN W + NLSE +P +F+
Sbjct: 349 ISSTREGGLPPNLQGLWAPQINTPWNADYHLNINLQMNLWGAEMFNLSELHKPTIEFVKS 408
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
L G KTA++ Y + GWV+H +++W +S W GAW+C HLWEHY YT
Sbjct: 409 LVEPGEKTAKIYYNSRGWVVHILSNVWGFTSPGE-HPSWGATNTAGAWMCQHLWEHYLYT 467
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
D+++L K YP ++ A F D LIE ++GYL T P+TSPE+ +I P G + + S
Sbjct: 468 QDKEYL-KSVYPTMKSAALFFEDMLIEDPNNGYLVTAPTTSPENAYITPSGDVVSICAGS 526
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDP 600
MD IIRE+F+ + +AA++LE + + ++ + RL PT I + G +MEW +D+++
Sbjct: 527 AMDNQIIRELFTNVENAAKILEVDNE-WIKDISAKKERLAPTSIGKYGQVMEWLEDYEES 585
Query: 601 EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQ 660
E+HHRH+S L+GL PG+ +T EK P+L +AA+ TL +RG++ GWS+ WK WARL D
Sbjct: 586 EIHHRHVSQLYGLHPGNELTYEKTPELMEAAKVTLTRRGDQSTGWSMAWKINFWARLKDG 645
Query: 661 EHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
AY+++ +L+ P G Y NLF+AHPP QID NFG +A + EML+QS
Sbjct: 646 NKAYKLIG---DLLKPAENNW---GTYPNLFSAHPPMQIDGNFGGSAGIGEMLLQSHEGF 699
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN 768
+ LLPA+P D W G V+G+K RGG +S WKD + + I + +N
Sbjct: 700 IELLPAIP-DGWKDGEVRGMKVRGGAEISFKWKDNKIQNIHITATTNN 746
>gi|408787527|ref|ZP_11199255.1| hypothetical protein C241_15883 [Rhizobium lupini HPC(L)]
gi|408486464|gb|EKJ94790.1| hypothetical protein C241_15883 [Rhizobium lupini HPC(L)]
Length = 739
Score = 478 bits (1231), Expect = e-132, Method: Compositional matrix adjust.
Identities = 289/786 (36%), Positives = 433/786 (55%), Gaps = 66/786 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ ++ A +T+A+PIGNGRLGAMV+GG E +++NE T + G P NPDA L
Sbjct: 5 RLWYDTAASAWTEALPIGNGRLGAMVFGGAWDERIQINESTFYNGGPYQPINPDAKDHLP 64
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPADV---YQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR + G+Y EA + D+ YQ +GD+++ F YRRELDL
Sbjct: 65 AVRQRILDGKYMEAERLAYDHVMARPDLQTSYQPIGDLKIAFQHDMTTI---NYRRELDL 121
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
T A +Y V + R+ F+S VIV K++ + GSLS ++ L S + + +
Sbjct: 122 ETGIAVTRYDCDGVHYHRQIFASAIADVIVCKVTVDKPGSLSLSLLLSSPQNGEAEDRRD 181
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD---DRGTISALEDKKLKVEGS 247
+ + GR N P ++F+ ++ + DRG + ++V +
Sbjct: 182 HVLGYLGR--------NRKQNGIPGALRFAFRTQVVATGGFVDRGP------ESIRVREA 227
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D ++ + A +SF D DP + L ++ DL H++D+++LF R
Sbjct: 228 DSVIIFIDAGTSFR----RYDDVSGDPEKTTEMRLARASTRAFEDLLEEHVEDHRRLFGR 283
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
++I + ++ VP+ +RV+ DP L L Q+GRYL I+SSRP
Sbjct: 284 MAIDIG-------------PDLSHVPTDKRVRDNVAKPDPQLAALYTQYGRYLAIASSRP 330
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
GTQ +NLQGIWNE++ P W+S +NIN +MNYW + P NL+E PL + + L+ G
Sbjct: 331 GTQPSNLQGIWNEEILPPWNSKFTLNINTQMNYWLADPANLAETFIPLIEMVEDLAETGQ 390
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
+ A+ +Y A GWV+HH TDIW S G W LWP GGAWLC L++HY+++ D L
Sbjct: 391 EMARAHYGARGWVVHHNTDIWRASGPIDGP-KWGLWPTGGAWLCAQLYDHYSFSGDEAIL 449
Query: 488 EKRAYPLLEGCASFLLDWLIE-GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
+R YPL++G A F+LD L++ Y T PS SPE+ P G C MD I
Sbjct: 450 -RRIYPLMKGSAEFILDILVDLPGTSYRVTCPSLSPENRH--PGGTSLCA--GPAMDNQI 504
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF--KDPEVHH 604
IR+VF+A+ISA+E L +E AL +++ + RL K+ + G + EW +D+ + PE H
Sbjct: 505 IRDVFAAVISASEALAIDE-ALRAELVAARARLPEDKVGKVGQLQEWIEDWDVEAPEQGH 563
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAY 664
RH+SHL+GL+P H I + + P L AA+ L++RG++ GW I W+ LWARL + E A
Sbjct: 564 RHVSHLYGLYPSHQIDLYETPALANAAKVALERRGDDATGWGIGWRINLWARLGEAERAA 623
Query: 665 RMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLL 724
+V++L + PE+ Y NLF AHPPFQID NFG A + EMLVQS ++ LL
Sbjct: 624 EVVQKLLS---PEYT-------YPNLFDAHPPFQIDGNFGGAAGIIEMLVQSKPGEVRLL 673
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSV 784
PALP WS G V+G++ RGG T+ + W+DG + +V + + D D+ T+ Y S
Sbjct: 674 PALP-KSWSEGYVRGVRLRGGVTLDMTWQDGQVQDVTLAA-----DRDTSMTVIYNDNSP 727
Query: 785 KVNLSA 790
+V+++
Sbjct: 728 RVSVTG 733
>gi|290769720|gb|ADD61497.1| putative multimodular carbohydrate-active enzyme [uncultured
organism]
Length = 1083
Score = 478 bits (1231), Expect = e-132, Method: Compositional matrix adjust.
Identities = 277/767 (36%), Positives = 422/767 (55%), Gaps = 45/767 (5%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
M+N + + +K+ ++ PA+ + +A+P+GN RLGAMV+GG E ++LNE+T W G P
Sbjct: 282 MINKQEATR---MKLWYSAPARRWVEALPVGNSRLGAMVYGGTDKEEIQLNEETFWAGGP 338
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
NP + L+ R LV + + +EA + F + L L + K
Sbjct: 339 YRNDNPKGKEVLAKTRELVFANRLSEAQKLIDENFFTGQHGMRFLTMGSLLINQPEHKNV 398
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
E Y RELD+ A A +Y V V +TR FSS D VIV ++ + +L+F++S +S
Sbjct: 399 E-NYYRELDIENAVAVTRYMVDGVTYTRTVFSSFADNVIVVRMEADKPKALNFDLSYNSP 457
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
L + GN ++ +C G + +GI + E ++ S +K
Sbjct: 458 LKHVVMAKGNELVV---KCEGM----------EQEGIPAALNAECRVLVRHNGKSGKSNK 504
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ V+ + A L + A+++F +N D + + + S L+ + Y H+
Sbjct: 505 SVVVDQATVATLYISAATNF----VNYHDVGGNASKLASSILKRAVKVPYEQALANHIAA 560
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
Y++ F RV+ + T+T T+ + +RV +F +D +L+ L+FQ+GRYL
Sbjct: 561 YKEQFDRVTFSIPS------TET------STLETDKRVVAFGEGKDLNLIALMFQYGRYL 608
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSS+PG Q ANLQG+W + WDS +NIN EMNYW + NLSE +PLFD ++
Sbjct: 609 LISSSQPGGQPANLQGLWCNSVYAPWDSKYTININTEMNYWPAEVTNLSENHQPLFDMVS 668
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
LS+NG KTA+ Y A GWV HH TD+W ++ + +WP GGAWL HLW+HY +
Sbjct: 669 DLSVNGKKTAETVYGARGWVAHHNTDLW-RACGPIDAAYFGMWPNGGAWLTQHLWQHYLF 727
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
T D++FL +R YP+++G A F L L++ +G+L T PS SPEH + C
Sbjct: 728 TGDKEFL-RRYYPVMKGAADFYLSHLVKHPQNGWLVTAPSVSPEHGYAGSSITAGC---- 782
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKD 599
TMD I + + AA +L +++ A + + + +L P +I I EW D +
Sbjct: 783 -TMDNQIAFDALYNTMLAARILGESQ-AYQDSLAVAFKQLPPMQIGRHNQIQEWLIDADN 840
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHD 659
P HRH+SHL+GL+P + I+ +P+L +AA+ TL +RG+ GWSI WK WAR+ D
Sbjct: 841 PRDDHRHISHLYGLYPSNQISPRLHPELFQAAKNTLLQRGDAATGWSIGWKINFWARMLD 900
Query: 660 QEHAYRMVKRLFNLV--DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST 717
HAY+++K + ++ D + + EG Y NLF AHPPFQID NFG+TA VAEML+QS
Sbjct: 901 GNHAYKIIKNMLRILPGDDKMREFPEGRTYPNLFDAHPPFQIDGNFGYTAGVAEMLLQSH 960
Query: 718 LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+ LLPALP ++W+ G + L ARGG V + W+ L + ++S
Sbjct: 961 DGAVQLLPALP-EEWNEGSISALVARGGFVVDMQWEGAQLLKAKVHS 1006
>gi|393719778|ref|ZP_10339705.1| alpha/beta hydrolase domain-containing protein [Sphingomonas
echinoides ATCC 14820]
Length = 811
Score = 477 bits (1227), Expect = e-131, Method: Compositional matrix adjust.
Identities = 292/793 (36%), Positives = 434/793 (54%), Gaps = 86/793 (10%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
++ A+++ ++ L++ + PA +T+A+P+GNGRLGAMV+G V E L+LNEDTLW G P
Sbjct: 28 LLAAKASDASSDLRLWYRQPAGAWTEALPVGNGRLGAMVFGRVAQERLQLNEDTLWAGAP 87
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEAT-AASVKLFGHPADV--YQLLGDIELEFDDSHL 117
D NP+A AL +VR+L+ +G+Y +AT AS K+ G P Y LGD+ L F +H+
Sbjct: 88 YDPDNPEALAALPEVRALLAAGRYKDATDLASAKMMGKPPAQMPYGTLGDVLLTFASAHV 147
Query: 118 KYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL 177
YRRELDL + A ++ + + RE +S PDQVIV ++ +E+G+L F+++
Sbjct: 148 P---TVYRRELDLASGIATTEFETADGRYRREVLASAPDQVIVMRLE-AEAGTLDFDLAY 203
Query: 178 DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDD------------------------ 213
+ ++ EG P P + +D
Sbjct: 204 RA----PRAISTPRAQFSEGATPQTTRPTEWMQREDAERPGPDVTIAADGAHALLVTGSN 259
Query: 214 ------PKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP 267
P G++++ L ++ D G I A K + V G+ +L+ A++S+ +
Sbjct: 260 EAALGVPAGLRYA--LRVQAVGD-GVIIA-NQKGITVSGARSVTVLITAATSYR----SY 311
Query: 268 SDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEE 327
SD+ DP +A ++ Y L H+ D+ LF V I L SP
Sbjct: 312 SDTGGDPVGAVRAAGRAAERKGYPALRRSHVADHAALFGGVKIDLGTSPAA--------- 362
Query: 328 NIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWD 387
+P+ R+ + T DP+L L Q+GRYLLI+SSRPG+Q + LQGIWNE +P W
Sbjct: 363 ---ALPTDARIAAGATAVDPALAALYLQYGRYLLIASSRPGSQPSTLQGIWNEGTTPPWG 419
Query: 388 SAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDI 447
S +NIN EMNYW + P L C EPL + LS+ G++TA+ Y A GWV HH TD+
Sbjct: 420 SKYTININTEMNYWAADPGGLGLCVEPLVRMVEDLSVTGARTARTMYGARGWVAHHNTDL 479
Query: 448 WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI 507
W +++A +W LWP GGAWLC L+ H+++ D L R YPLL+G A F +D LI
Sbjct: 480 W-RATAPIDGPLWGLWPCGGAWLCNTLFTHWDFARDPALL-ARLYPLLKGAAHFFVDTLI 537
Query: 508 EGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNED 566
E G L T+PS SPE+E P G CV MD I+R++F+ + A L ++ +
Sbjct: 538 EDPKGRGLVTSPSLSPENEH--PFGSSLCV--GPAMDRQIVRDLFTNTVVAGRTLGRDGE 593
Query: 567 --ALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFK--DPEVHHRHLSHLFGLFPGHTITIE 622
A++E+V R+ P +I G + EW +D+ P+ +HRH+SHL+ ++P I +
Sbjct: 594 WLAMLEQVGA---RIAPDRIGAGGQLQEWLEDWDAHAPDPYHRHVSHLYAVYPSAQINVR 650
Query: 623 KNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHF 682
P L +AA+ +L++RG+ GW+ W+ LWAR+ + +HAY ++K L+ P+
Sbjct: 651 DTPALIEAAKVSLRQRGDLSTGWATAWRMCLWARMGEGDHAYAVLK---GLLGPQRT--- 704
Query: 683 EGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKA 742
Y N+F AHPPFQID NFG A + EMLVQS +L LLPALP W G + G++A
Sbjct: 705 ----YPNMFDAHPPFQIDGNFGGAAGILEMLVQSWGGELLLLPALP-TAWPDGSIAGVRA 759
Query: 743 RGGETVSICWKDG 755
RGG V + W+ G
Sbjct: 760 RGGVRVDLTWRQG 772
>gi|375256587|ref|YP_005015754.1| putative lipoprotein [Tannerella forsythia ATCC 43037]
gi|363407344|gb|AEW21030.1| putative lipoprotein [Tannerella forsythia ATCC 43037]
Length = 850
Score = 476 bits (1226), Expect = e-131, Method: Compositional matrix adjust.
Identities = 286/797 (35%), Positives = 426/797 (53%), Gaps = 84/797 (10%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
F+ PA + ++ P+GNGR+G M GG+ E + LNE ++W+G NP A K+L +R
Sbjct: 32 FDEPATLWEESFPLGNGRIGLMPDGGIEKENIVLNEISMWSGSKQQTDNPAAQKSLGRIR 91
Query: 77 SLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIELEF-----DDSHLK 118
L+ +G+ EA F P YQLLG++ L+F DD+ +
Sbjct: 92 ELLFAGRNDEAQELMYDTFVCYGDGSGRGSGANKPYGSYQLLGNLMLDFTYDAADDAQVS 151
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
YRRELDL A + + G E++RE F+S D V V ++ + L + ++
Sbjct: 152 ----DYRRELDLEQALTTLSFRKGKTEYSREVFTSFADDVAVIRLKVNNGRKLQCQIGMN 207
Query: 179 SLLDNHSYVNGNNQIIMEGRC-----------------------PGKRIPPKANAN---- 211
+ ++ N+++ M GR IP
Sbjct: 208 RP-ERYAVRAENSELEMRGRLYEGDAYKTKEQLEREEAMRNRTNNSDSIPAAEQKTMPGA 266
Query: 212 DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSK 271
+D +G+++++ +++ + + G + A D L VE + +LL+ ++ + G + D++
Sbjct: 267 EDGQGVRYASRVQVVLPNG-GEVKAFNDTTLIVEEASEIILLVGMATDYFGKAV---DAQ 322
Query: 272 KDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDT 331
D S L + + SY L H+ YQ+L+HRV++ R+ + +
Sbjct: 323 ID------SLLTAAASKSYETLKEEHIRAYQELYHRVAVHFGRNAQK-----------EA 365
Query: 332 VPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAP 390
+P +R+++FQ D+ DPSL+ L +QFGRYLLISS+RPG NLQG+W + W+
Sbjct: 366 LPMNKRLEAFQNDKNDPSLLALYYQFGRYLLISSTRPGLLPPNLQGLWCNTIHTPWNGDY 425
Query: 391 HVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAK 450
H+NINL+MN W + NLSE PL ++ +G +TA+ Y A GWV H ++W +
Sbjct: 426 HLNINLQMNLWPAETGNLSELHLPLIEWTKQQVESGRQTAKAFYNARGWVTHILGNVW-E 484
Query: 451 SSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG- 509
+A W AWLC HL+ HY +T+D +L + YP++ A F +D L+E
Sbjct: 485 FTAPGEHPSWGATNTSAAWLCEHLYTHYLFTLDTAYL-RDVYPVMRESALFFVDMLVEDP 543
Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALV 569
YL T P+TSPE+ ++ P+GK V STMD I+RE+FS I AA +L+ +E+ LV
Sbjct: 544 RSHYLVTAPTTSPENAYVMPNGKKVSVCAGSTMDNQILRELFSNTIQAARLLKTDEE-LV 602
Query: 570 EKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCK 629
+ + RL PT I DG IMEW + +++ E HHRH+SHL+GL+P + I+ E+ PDL
Sbjct: 603 QTLAAYQARLMPTTIGPDGRIMEWLEPYEEAEPHHRHVSHLYGLYPANEISPERTPDLAA 662
Query: 630 AAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFE----GG 685
AA KTL+ RG+E GWS+ WK WARLHD EHAY++ L +L+ P K + GG
Sbjct: 663 AARKTLEARGDESTGWSMGWKVNFWARLHDGEHAYKL---LADLLRPSLRKDMDMKHGGG 719
Query: 686 LYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGG 745
Y NLF AHPPFQID NFG A +AEMLVQS + LPALP W +G KGL +G
Sbjct: 720 TYPNLFCAHPPFQIDGNFGGCAGIAEMLVQSHNGYIEFLPALP-TAWKNGEFKGLCVQGA 778
Query: 746 ETVSICWKDGDLHEVGI 762
V W DG+L G+
Sbjct: 779 GEVHAQWSDGELLHAGL 795
>gi|160883519|ref|ZP_02064522.1| hypothetical protein BACOVA_01491 [Bacteroides ovatus ATCC 8483]
gi|156110932|gb|EDO12677.1| hypothetical protein BACOVA_01491 [Bacteroides ovatus ATCC 8483]
Length = 793
Score = 476 bits (1226), Expect = e-131, Method: Compositional matrix adjust.
Identities = 285/767 (37%), Positives = 416/767 (54%), Gaps = 62/767 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PAK + +A+P+GN RLGAMV+G E L+LNE+T+W G P NP A +AL
Sbjct: 10 LKLWYDRPAKVWEEALPLGNSRLGAMVYGIPQREELQLNEETIWGGSPYRNDNPKAVQAL 69
Query: 73 SDVRSLVDSGQYAEATAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
+ R L+ +G+ EA + F G P +Q G I L F H Y + + RE
Sbjct: 70 PEARKLIFAGKNTEADKLINETFFTRAHGMP---FQTAGSIILNFP-GHENY--QNFYRE 123
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
LDL A + +Y+V VE+ RE ++S D VIV +I+ S +++F + ++ + V
Sbjct: 124 LDLGRAVSTTRYTVDGVEYAREAYASFADDVIVMRITASRKRAINFVLEYSRPVNFNVSV 183
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
G+ I + IP + N + ++ + G L ++ + V+ +
Sbjct: 184 KGSTLIFHSKGTDHEGIPGEINYQ-----------IHTRVVTNDGEAEVLNNR-IVVKNA 231
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
A L + S+F D ++ + +I+N +Y +H++ + + F+R
Sbjct: 232 TVATLYISIGSNFIDYKTLGGDEYVAKVTQKLDC--AIKN-NYKAALKKHIEIFSQQFNR 288
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
+ L + +T +R+ FQ D+DPSLV LL QFGRYLLI SS+P
Sbjct: 289 FKLNLGNRSDGVKKNTL-----------QRIADFQIDQDPSLVTLLTQFGRYLLICSSQP 337
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
G Q ANLQGIW ++P+WDS +NIN EMNYW + NLSE P + LS NG
Sbjct: 338 GGQPANLQGIWCHQMNPSWDSKYTLNINAEMNYWPAEVTNLSETHLPFLQMVKDLSENGR 397
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDR 484
+TA + Y A GW +HH TDIW + G + +A +WP GGAW+C HLWEHY YT D+
Sbjct: 398 RTAAMMYNAEGWTVHHNTDIWRVT----GPIDFARSGMWPTGGAWVCQHLWEHYLYTGDK 453
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTM 542
FL YP ++G A + L +++ H Y + PS SPE V TM
Sbjct: 454 KFLAD-VYPAMKGAADYFLSSMVK-HPKYDWMVVCPSVSPEQ---------GGVVAGCTM 502
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
D +I E+ + A E+L ++ +K+ + L +L P I + + EW +D DP+
Sbjct: 503 DNQLIIELLTKTAKANEILGESP-VYRQKLYELLEKLPPMHIGKHTQLQEWLEDIDDPKN 561
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEH 662
HRH+SHL+GL+PG+ I+ + P+L +AA +L RG+ GWSI WK LWARL D H
Sbjct: 562 KHRHVSHLYGLYPGNQISPYRTPELFEAARNSLIYRGDMATGWSIGWKVNLWARLLDGNH 621
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
AY++VK + L + G Y N+F AHPPFQID NFG TA VAEML+QS ++
Sbjct: 622 AYKIVKNMLTLAGGSSQ---SGRTYPNMFTAHPPFQIDGNFGLTAGVAEMLLQSHDGAVH 678
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
LLPALP + W+ G V G+KARGG VS+ W G++ EV + S+ +N
Sbjct: 679 LLPALP-EVWNKGSVSGIKARGGFEVSMQWDKGEVTEVTVLSSLGDN 724
>gi|300726087|ref|ZP_07059544.1| alpha-L-fucosidase 2 [Prevotella bryantii B14]
gi|299776557|gb|EFI73110.1| alpha-L-fucosidase 2 [Prevotella bryantii B14]
Length = 824
Score = 476 bits (1225), Expect = e-131, Method: Compositional matrix adjust.
Identities = 285/758 (37%), Positives = 418/758 (55%), Gaps = 51/758 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ +N PA + +A+PIGNGR+ M++GGV SE ++LNE+T+W G P L
Sbjct: 22 LKLWYNHPASIWQEALPIGNGRIAGMIYGGVQSEEIQLNEETVWGGGPHSNVRAIPVDTL 81
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ GQ A A + F G Y+ +G ++++F+ + YRRELDL
Sbjct: 82 RQVRQLIFDGQEKAAHAMINRNFMTGQHGMPYESVGSLKIDFN--YRAGDTRNYRRELDL 139
Query: 131 NTATARVKYSVGNVEFTREHFS--SNPDQ---VIVTKISGSESGSLSFNVSLDSLLDNHS 185
N A + + VG V + RE F+ S+P+ V+V +++ S+ GS+SF + S L +
Sbjct: 140 NRAVSTTTFQVGKVTYKREVFTTFSSPEHHANVMVIRLTASKRGSISFKLHYTSPLRHAI 199
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQ--FSAILEIKISDDRGTISALEDKKLK 243
+N + M G D +GI+ A ++ + G I + ++
Sbjct: 200 TLNQQGDLCMLGYGA------------DHEGIKGVIQASTVTRVLNIGGKIKR-NGESIE 246
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V ++ + L ++F + ++ D +++ LQ+ +Y L +H YQ
Sbjct: 247 VTNANQVEIRLAMGTNFK----SYNEVSLDAKAQTFGELQTASPYTYEALLQQHEQVYQN 302
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
F RVS+ L + N ++P+ ER++ FQ DP+L L+FQ+GRYLLIS
Sbjct: 303 QFGRVSLDLGEN-----------TNETSLPTDERLRRFQQSNDPALATLVFQYGRYLLIS 351
Query: 364 SSRPGTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SS+ ++ ANLQGIWN+D++ WD +NIN EMNYW + NLS+ + PL+ + L
Sbjct: 352 SSQIDSRTPANLQGIWNKDMNAPWDGKYTININTEMNYWPAQTTNLSDNEWPLYRLVQNL 411
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
S G + A Y A G++ HH TDIWA + G W +WP G WL THLW+ Y +T
Sbjct: 412 SKTGVEAASKMYGAKGYMAHHNTDIWATTGMVDG-ATWGIWPNGAGWLSTHLWQRYLFTG 470
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSST 541
D+ FL + YP L+G A F L ++ GY+ T PS SPEH P GK V+ T
Sbjct: 471 DQQFL-RTFYPQLKGAADFYLTAMVRHPKYGYMVTVPSISPEH---GPHGK-PSVTAGCT 525
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPE 601
MD I +V + A EVL ++E A + + + + +L P ++ + EW +D DP+
Sbjct: 526 MDNQIAFDVLQDALQATEVLGESE-AYADSLRQHIRQLAPMQVGRYCQLQEWLEDADDPK 584
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQE 661
HRH+SH +GLFP + I+ + P+L +A TL +RG+E GWSI WK LWARL D
Sbjct: 585 DGHRHVSHAYGLFPSNQISATRTPELFEAIRNTLVQRGDEATGWSIGWKINLWARLLDGN 644
Query: 662 HAYRMVKRLFNLV--DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLN 719
HAY++V+ L +++ D + + +G +Y NLF AHPPFQID NFGFTA VAEML+QS
Sbjct: 645 HAYQLVRNLLSVLPSDADAANYPKGRMYPNLFDAHPPFQIDGNFGFTAGVAEMLLQSQDG 704
Query: 720 DLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
+ LLPALP D W G V GLKARG V++ WK G L
Sbjct: 705 MVQLLPALP-DVWQQGQVSGLKARGNFEVAMNWKQGKL 741
>gi|357043574|ref|ZP_09105265.1| hypothetical protein HMPREF9138_01737 [Prevotella histicola F0411]
gi|355368238|gb|EHG15659.1| hypothetical protein HMPREF9138_01737 [Prevotella histicola F0411]
Length = 808
Score = 476 bits (1225), Expect = e-131, Method: Compositional matrix adjust.
Identities = 303/812 (37%), Positives = 427/812 (52%), Gaps = 81/812 (9%)
Query: 7 TSTTNP----LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
TST N + + ++ PA+ F +++P+GNG+LGA+++GG ++T+ LN+ T WTG P
Sbjct: 14 TSTINAQQQSMLLWYDHPAQFFEESLPMGNGKLGALIYGGTKNDTIYLNDITYWTGKP-- 71
Query: 63 YTNPDAPKALS----DVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLK 118
NP+ S +R + + Y A + + G + YQ LG L +
Sbjct: 72 -VNPNEGIGKSVWIPRIREALFAENYRLADSLQHYVQGEQSASYQPLGTFNL---INLTP 127
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
A + YRREL++++A A V Y V + +E+F S D +I +I+ ++ G ++F +SL
Sbjct: 128 GAIQNYRRELNIDSAMAHVSYQQDGVTYKKEYFVSQSDSLIAIRITANKPGKVNFKISLT 187
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
+ + H + Q+ M G GK + A ++++ G S
Sbjct: 188 AQVP-HKTKASDEQLTMIGHATGK------------ENETIHACTIVRLTHKEGQDSH-T 233
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D L VE +D A L +V ++SF+G +P D D + ++ A +N +Y++ RH+
Sbjct: 234 DSTLTVENADEATLYIVNATSFNGFNKHPVDDGADYMNNAIDAAWHTKNFTYNEFKQRHI 293
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDP-------SLVE 351
+ YQ+L+ R+++QL D + +P+ E +K + T P L
Sbjct: 294 NAYQRLYQRLNLQLGHDKYD-----------NNIPTDELLKKYSTPHTPLSVAAQRYLET 342
Query: 352 LLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSEC 411
L FQFGRYLL+S SR ANLQG+W L W +NINLE NYW + N+SE
Sbjct: 343 LYFQFGRYLLLSCSRTPGVPANLQGLWTPYLFSPWRGNYTMNINLEENYWPANSTNISET 402
Query: 412 QEPLFDFLTYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSS-ADRGKVV--WALWPMGG 467
+PLF FL L+ NG TA Y + GW H +DIW K++ GK WA W +GG
Sbjct: 403 IQPLFSFLKGLAANGKYTAHNFYGVNEGWCASHNSDIWCKTAPVGEGKESPEWANWNLGG 462
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHE 525
AWL LW++Y YT D L+ YPL+EG + F WLIE H G L T PST+PE+E
Sbjct: 463 AWLVNTLWDYYLYTQDFQMLKSTIYPLMEGASRFCKQWLIENPKHPGELITAPSTTPENE 522
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
++ G Y T D+AIIRE+F A +L D + LK RL P I
Sbjct: 523 YLTDKGYHGTTCYGGTADLAIIRELFENTQQARRILNIKPDKQLNNTLK---RLHPYTIG 579
Query: 586 EDGSIMEWAQDFKDPEVHHRHLSHLFGLFPG-----HTITIEKNPDLCKAAEKTLQKRGE 640
+G + EW D+KD + HRH SHL GL+PG H I K+ L KAA++TL ++G+
Sbjct: 580 AEGDLNEWYYDWKDYDPQHRHQSHLIGLYPGMHLQRHAIQT-KDSSLLKAAKQTLIQKGD 638
Query: 641 EGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKH-----FEGGLYSNLFAAHP 695
E GWS W+ LWARL + +HAY + RL + V PE E H GG Y NLF AHP
Sbjct: 639 ESTGWSTGWRINLWARLGEGKHAYEIYHRLLSYVSPE-EYHGPDAVHRGGTYPNLFDAHP 697
Query: 696 PFQIDANFGFTAAVAEMLVQSTLN--------DLYLLPALPWDKWSSGCVKGLKARGGET 747
PFQID NFG TA V EMLVQSTL ++LLPALP W G +KGLK RGG T
Sbjct: 698 PFQIDGNFGGTAGVCEMLVQSTLEIVNNKPVYYIHLLPALP-HVWKDGEIKGLKTRGGLT 756
Query: 748 VSICWKDGDLHEVGIYSNYSNNDHDSFKTLHY 779
+ + W D H+V Y+ + D D LHY
Sbjct: 757 IDMQWYD---HQV--YALHIKADADVTINLHY 783
>gi|298481665|ref|ZP_06999856.1| alpha-L-fucosidase 2 [Bacteroides sp. D22]
gi|298272206|gb|EFI13776.1| alpha-L-fucosidase 2 [Bacteroides sp. D22]
Length = 812
Score = 475 bits (1223), Expect = e-131, Method: Compositional matrix adjust.
Identities = 282/763 (36%), Positives = 425/763 (55%), Gaps = 59/763 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+++++A+PIGN RLGAM++GG+ E L+LNE+T W G P + NP+A L
Sbjct: 22 LKLWYSQPAQNWSEALPIGNSRLGAMIYGGIEREELQLNEETFWAGSPYNNNNPNAIHVL 81
Query: 73 SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ G+ EA + L Y LG + LEF + + R+L+L
Sbjct: 82 PVVRKLIFEGRNKEAQRLIDANFLTQQHGMSYLTLGSLYLEFPEHQ---NASGFYRDLNL 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
AT +Y V +V +TR F+S D VI+ I S++ +L+F ++ + L + V N
Sbjct: 139 ENATTTTRYQVDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNCPLVHKVNVQ-N 197
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
+Q+ + C GK + +G++ + E +I GT+ + EG++
Sbjct: 198 DQLTVT--CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L + A++++ +N D D + + L+ + Y H+ Y+K F RV
Sbjct: 245 ATLYISAATNY----VNYQDVSADESHRTSEYLKKAMQIPYEKALKSHIAYYKKQFDRVR 300
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L + K +T +R+++F ED ++ LLF +GRYLLISSS+PG
Sbjct: 301 LTLPAAGKASQLET-----------PKRIENFGNGEDMAMAALLFHYGRYLLISSSQPGG 349
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN WDS +NIN EMNYW + NLSE PLF L LS+ G++T
Sbjct: 350 QSANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 409
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
A+ Y GW+ HH TD+W G V +A +WP GGAWL H+W+HY +T +++F
Sbjct: 410 ARTMYDCRGWMAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 465
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L K YP+L+G A F +D+L+E H Y L +PS SPEH ++ TMD
Sbjct: 466 L-KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDN 514
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHH 604
I + + A+ + + + + + ++L +L P +I + + EW +D +P+ H
Sbjct: 515 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLEDIDNPKDEH 573
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAY 664
RH+SHL+GL+P + I+ NP+L +AA TL +RG++ GWSI WK WAR+ D HA+
Sbjct: 574 RHISHLYGLYPSNQISPYSNPELFQAARNTLLQRGDKATGWSIGWKVNFWARMLDGNHAF 633
Query: 665 RMVKRLFNLVDPEH--EKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
+++K + L+ +H +++ G Y N+ AHPPFQID NFG+TA VAEML+QS ++
Sbjct: 634 QIIKNMIQLLPSDHLAKEYPNGRTYPNMLDAHPPFQIDGNFGYTAGVAEMLLQSHDGAVH 693
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
LLPALP D W G VKGL ARG TV I WK+ L++ I SN
Sbjct: 694 LLPALP-DAWEEGSVKGLVARGNFTVDIDWKNNMLNKAIIRSN 735
>gi|393773725|ref|ZP_10362119.1| twin-arginine translocation pathway signal protein [Novosphingobium
sp. Rr 2-17]
gi|392720900|gb|EIZ78371.1| twin-arginine translocation pathway signal protein [Novosphingobium
sp. Rr 2-17]
Length = 852
Score = 475 bits (1223), Expect = e-131, Method: Compositional matrix adjust.
Identities = 292/760 (38%), Positives = 404/760 (53%), Gaps = 69/760 (9%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
+I N PA + P+GNGRLGAM+ G V + + LN DTLWTG P + + D L+
Sbjct: 56 RIADNSPATEWLLGHPVGNGRLGAMMGGSVRRDVISLNHDTLWTGQPSPHPDHDGRATLA 115
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
VR V +G YA A S L G + + + D+ LE D + A YRRELDL+ A
Sbjct: 116 AVRKAVFAGDYAAADLLSRPLQGTFSQSFAPMADMTLELDHTQ---AVTAYRRELDLDRA 172
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A V Y G+V F RE F+S PD VIV ++S S + ++S + L + L + GN
Sbjct: 173 IASVAYHCGDVAFRRELFASYPDNVIVLRLSASRAAAISGRIGLATSLLGSTRAAGNTLR 232
Query: 194 IMEGRCPGKRIP-------PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+M G+ P + P P A + +G+ F+ +L +++ G + A D L V G
Sbjct: 233 LM-GKAPTRCEPNYREVPDPVAYSEQPGQGMAFATVLGVEVQG--GEVVASGDA-LSVRG 288
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D V+ + A++ F + P + ++ + + L SY L RHL D+Q L+
Sbjct: 289 ADVVVIRIAAATGFRRFDLLPDIAAEEVAAVAERNLAIAHQNSYGSLLKRHLADHQALYR 348
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
R SI+L + D VT P AER LF GRYLLI+SSR
Sbjct: 349 RASIELQGAGDDQVT-----------PKAER---------------LFNLGRYLLIASSR 382
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
P T ANLQG+WN + P W + NINL+MNYW + CNL+EC PL D + L++NG
Sbjct: 383 PDTMPANLQGLWNAQVRPPWSANYTTNINLQMNYWSAETCNLAECHLPLMDHIERLALNG 442
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+K A+ Y GW +HH +D+WA ++ A G WA WPM G WL H+WEHY ++ D
Sbjct: 443 AKVARDLYGMPGWSVHHNSDVWAMANPVGAGDGDPNWANWPMAGPWLAQHVWEHYRFSGD 502
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTM 542
FL KR + L+ CA F WL+ + L T PS SPE+ F+ P GK + +S TM
Sbjct: 503 IAFLAKRGFALMRDCAEFCAAWLVRDPSSHRLTTAPSISPENLFLGPHGKPSAISSGCTM 562
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
D+A+ RE+F I+AA ++ + L + L L P +I G + EW+ DF + +
Sbjct: 563 DLALTRELFENCIAAANLV-GDRSGLAVHLKGLLQELEPYRIGRYGQLQEWSSDFDEQDA 621
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTALWARLHD 659
HRH+SHL+ L+PG + + PDL +AA +L +R G GWS W TA WARL D
Sbjct: 622 GHRHISHLYPLYPGGAVDPTRTPDLARAARASLVRREAHGGASTGWSRAWATAAWARLGD 681
Query: 660 QEHAYRMVKRLF--NLVDPEHEKHFEGGLYSNLFAAHPP-----FQIDANFGFTAAVAEM 712
A R + N+ D NL HP FQID NFG TAA+AEM
Sbjct: 682 GAEAGRSLSAFITHNVAD-------------NLLDTHPAQPRPVFQIDGNFGITAAMAEM 728
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICW 752
L+QS N + LLPALP +W+SG +GL+ARGG V+I W
Sbjct: 729 LLQSHGNAIALLPALP-PQWTSGRARGLRARGGHEVAIEW 767
>gi|423290387|ref|ZP_17269236.1| hypothetical protein HMPREF1069_04279 [Bacteroides ovatus
CL02T12C04]
gi|392665774|gb|EIY59297.1| hypothetical protein HMPREF1069_04279 [Bacteroides ovatus
CL02T12C04]
Length = 811
Score = 475 bits (1222), Expect = e-131, Method: Compositional matrix adjust.
Identities = 282/763 (36%), Positives = 423/763 (55%), Gaps = 60/763 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+++++A+PIGN RLGAMV+GG+ E L+LNE+T W G P + NP+A L
Sbjct: 22 LKLWYSQPAQNWSEALPIGNSRLGAMVYGGIEREELQLNEETFWAGSPYNNNNPNAVHVL 81
Query: 73 SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ G+ EA + L Y LG + LEF + + REL+L
Sbjct: 82 PVVRKLIFEGRNKEAQRLIDANFLTQQHGMSYLTLGSLYLEFPEHQ---NGSGFYRELNL 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
AT +Y V +V +TR F+S D VI+ I S++ +L+F ++ + L + V N
Sbjct: 139 ENATTTTRYQVDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNCPLVHKVNVQ-N 197
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
+Q+ + C GK + +G++ + E +I GT+ + EG++
Sbjct: 198 DQLTVT--CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L + A++++ +N D D + + L+ + Y H+ Y+K F RV
Sbjct: 245 ATLYISAATNY----VNYQDVSADESHRTSEYLKRAMQIPYEKALKNHIAYYKKQFDRVR 300
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L + + +R+++F ED ++ LLF +GRYLLISSS+PG
Sbjct: 301 LTLPAG------------KASQLETPKRIENFGNGEDMAMAALLFHYGRYLLISSSQPGG 348
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN WDS +NIN EMNYW + NLSE PLF L LS+ G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 408
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
A+ Y GWV HH TD+W G V +A +WP GGAWL H+W+HY +T +++F
Sbjct: 409 ARTMYDCRGWVAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L K YP+L+G A F +D+L+E H Y L +PS SPEH ++ TMD
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDN 513
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHH 604
I + + A+ + + + + + ++L +L P +I + + EW +D +P+ H
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLEDIDNPKDEH 572
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAY 664
RH+SHL+GL+P + I+ NP+L +AA TL +RG++ GWSI WK WAR+ D HA+
Sbjct: 573 RHISHLYGLYPSNQISPYSNPELFQAARNTLLQRGDKATGWSIGWKVNFWARMLDGNHAF 632
Query: 665 RMVKRLFNLVDPEH--EKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
+++K + L+ +H +++ G Y N+ AHPPFQID NFG+TA VAEML+QS ++
Sbjct: 633 QIIKNMIQLLPNDHLAKEYPNGRTYPNMLDAHPPFQIDGNFGYTAGVAEMLLQSHDGAVH 692
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
LLPALP D W G VKGL ARG TV + WK+ L++ I SN
Sbjct: 693 LLPALP-DAWEEGSVKGLVARGNFTVDMDWKNNVLNKAIIRSN 734
>gi|329928902|ref|ZP_08282716.1| hypothetical protein HMPREF9412_5464 [Paenibacillus sp. HGF5]
gi|328937273|gb|EGG33698.1| hypothetical protein HMPREF9412_5464 [Paenibacillus sp. HGF5]
Length = 874
Score = 475 bits (1222), Expect = e-131, Method: Compositional matrix adjust.
Identities = 285/803 (35%), Positives = 422/803 (52%), Gaps = 82/803 (10%)
Query: 8 STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
S L++ ++ PA + +A+PIGNGRLG MV+G E ++LNED+LW G PG NP+
Sbjct: 52 SANRRLRLWYDSPAAEWNEALPIGNGRLGGMVFGKPSLERVQLNEDSLWYGGPGRGGNPN 111
Query: 68 APKALSDVRSLVDSGQYAEAT-AASVKLFGHPA--DVYQLLGDIELEFDDSHLKYAEETY 124
A + LS++R ++ G+ AEA A + + P YQ LGD+ L+F D + E Y
Sbjct: 112 ASRYLSEIRQMLFDGRQAEAEHLARMAMTSSPKYEQPYQPLGDLLLKFLDG--EETVEHY 169
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL-DSLLDN 183
RELDL + V YS + F R++F++ PD V+V ++S G+L+F +L D
Sbjct: 170 ERELDLERSMVTVSYSSRGIRFRRQYFATAPDGVLVIRLSADRPGALTFAANLMRRPFDG 229
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ ++ ++MEG C GI F + ++ + G + + D L
Sbjct: 230 GTASLRHDTLLMEGEC-------------GADGISFG--MALRAAAVGGIVQTIGDF-LS 273
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
VEG+D LLL A +SF + P + L +SY L RH +Y++
Sbjct: 274 VEGADSVTLLLSAQTSF---------RCRQPVQVCLEQLDRAAGMSYEQLVNRHQAEYRE 324
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENI------DTVPSAERVK----------SFQTDE-- 345
F R S+ L C + + + +++RV+ S TD
Sbjct: 325 KFERFSLTLGTGKNGAGRTECVDSGTSFSNGTEVIRASDRVEYPNGIEDDQPSLPTDRRL 384
Query: 346 -----------------DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDS 388
DP L+ L Q+GRYLLIS SRP + ANLQGIWN+ +P W+S
Sbjct: 385 NLLKDRVKTEGASAENSDPELIALYVQYGRYLLISCSRPESLAANLQGIWNDSFTPPWES 444
Query: 389 APHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIW 448
+N+N++MNYW + L+EC EPLFD + + NG TA+ Y G+ HH T++W
Sbjct: 445 KYTINVNIQMNYWPAELLGLAECHEPLFDLIDRMLPNGRDTAREMYGCRGFAAHHNTNLW 504
Query: 449 AKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE 508
++ + + +WPMG AWLC HLWEHY + D DFL +RAYP+++ A FLLD++
Sbjct: 505 GETRPEGILMTCTVWPMGAAWLCLHLWEHYRFGGDADFLRERAYPVMKEAAEFLLDYMTV 564
Query: 509 GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL 568
+G T PS SPE+ F+ +G + + MD I +F A + A ++ +E A
Sbjct: 565 DEEGRRMTGPSVSPENRFVLSNGAVGSLCMGPAMDGQIATALFRACLEAGHLV-GDEPAF 623
Query: 569 VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLC 628
+ ++ +L + +I G IMEW D+++ + HRH+S LF L+PG I + P+L
Sbjct: 624 LGELQTALEEIPAPQIGRHGGIMEWLNDYEEADPGHRHISQLFALYPGEQIDPARTPELA 683
Query: 629 KAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGG 685
+AA KTL++R G GWS W +ARL A+ + L NL+
Sbjct: 684 EAACKTLERRLAHGGGHTGWSRAWIINYYARLQRGAEAH---EHLVNLL--------ASS 732
Query: 686 LYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGG 745
Y NL HPPFQID NFG A VAEML+QS + +L LLPALP +W+SG VKGL+ARGG
Sbjct: 733 TYPNLLDCHPPFQIDGNFGGIAGVAEMLLQSHMGELRLLPALP-PQWNSGEVKGLRARGG 791
Query: 746 ETVSICWKDGDLHEVGIYSNYSN 768
V + W++G+L EV I ++ +
Sbjct: 792 YVVDMRWEEGELTEVKIRADRAG 814
>gi|315499511|ref|YP_004088314.1| alpha-l-fucosidase [Asticcacaulis excentricus CB 48]
gi|315417523|gb|ADU14163.1| Alpha-L-fucosidase [Asticcacaulis excentricus CB 48]
Length = 789
Score = 474 bits (1220), Expect = e-131, Method: Compositional matrix adjust.
Identities = 291/776 (37%), Positives = 424/776 (54%), Gaps = 65/776 (8%)
Query: 2 MNAESTSTTNP---LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG 58
+ A+S P L + + PA + A+P+GNGRLG MV+GGV E ++LNEDT + G
Sbjct: 24 VKAQSAPPEQPSPDLSLWYERPADEWVKALPVGNGRLGGMVFGGVAFERIQLNEDTFFAG 83
Query: 59 VPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPAD--VYQLLGDIELEFD-- 113
P TNP + L V+SL+ G+YAEA A+ L PA YQ +GD+ L F
Sbjct: 84 SPYTPTNPRSRDGLPQVQSLIFEGKYAEAERLANETLISQPAKQMAYQPVGDLILLFPGL 143
Query: 114 DSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF 173
D+ KY R LDL+ A +++ G+ RE F S DQV+V ++S + +++
Sbjct: 144 DNTSKYV-----RRLDLSEGVAVTEFNAGSNRHRREVFVSAVDQVMVVRLSSEKGKAITV 198
Query: 174 NVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEI--KISDDR 231
++SL + + +I++G P + +GI+ E+ K+
Sbjct: 199 DLSLSTPQKAEIDTIDGDTLIIKGVSPTQ------------QGIEGKLPFELRAKVIAPT 246
Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYS 291
GT+++ E + + G+ AV+L+ A++ + + D DP+ + + Y+
Sbjct: 247 GTLTSREGG-VYISGAQDAVVLISAATGY----VRYDDISGDPSVLNAGRIAIAAAKGYA 301
Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVE 351
L HL DY+ LF RVS+ L P +P+ +R+ + +DP L
Sbjct: 302 ALKADHLKDYKALFDRVSLSLGEGPNA------------RLPTDQRIARYGEGKDPGLAA 349
Query: 352 LLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSEC 411
L Q+GRYLL+SSSR Q ANLQGIWN+ L+P+W S +NIN +MNYW + CNL+E
Sbjct: 350 LYLQYGRYLLVSSSRGSRQPANLQGIWNDKLNPSWQSKWTLNINTQMNYWPAEMCNLTET 409
Query: 412 QEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLC 471
+PL + L+ G+K A+ Y A GWV + TD+W +S G VWALWPMGGAWL
Sbjct: 410 IDPLVCLVEDLAETGAKLAKDMYGAPGWVAFNNTDVWRVASPPDG-AVWALWPMGGAWLL 468
Query: 472 THLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPD 530
+LWE + Y D +L +R YPL++G + F L++ Y+ TNPS SPE+ P
Sbjct: 469 QNLWEPWLYNGDEAYL-RRIYPLMKGASEFYQATLLKDPRSDYMVTNPSNSPENRH--PF 525
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSI 590
G C MD ++R++F+ AA+VL K + A L +L P KI + G +
Sbjct: 526 GSSVCA--GPAMDNQLLRDLFAHTAEAAKVL-KTDAAFARACLAMRSKLPPEKIGKAGQL 582
Query: 591 MEWAQDF--KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSIT 648
EW +D+ + P++HHRH+SHL+ L P IT+E P+L +AA K+L+ RG++ GW I
Sbjct: 583 QEWQEDWDMQAPDIHHRHVSHLYALHPSDQITVEDTPELAQAARKSLEIRGDDATGWGIG 642
Query: 649 WKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAA 708
W+ LWARL D +HA+ ++K L + P Y NLF AHPPFQID NFG A
Sbjct: 643 WRINLWARLKDGDHAHDVIKLLLH---PRRS-------YPNLFDAHPPFQIDGNFGGAAG 692
Query: 709 VAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+AEML+QS + LLPALP W +G KGLKARGG + I W+D L +V + S
Sbjct: 693 IAEMLIQSHRGRIELLPALP-SVWPTGAFKGLKARGGFELDIEWQDRRLTQVVVRS 747
>gi|393781489|ref|ZP_10369684.1| hypothetical protein HMPREF1071_00552 [Bacteroides salyersiae
CL02T12C01]
gi|392676552|gb|EIY69984.1| hypothetical protein HMPREF1071_00552 [Bacteroides salyersiae
CL02T12C01]
Length = 821
Score = 474 bits (1220), Expect = e-131, Method: Compositional matrix adjust.
Identities = 278/764 (36%), Positives = 418/764 (54%), Gaps = 53/764 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ ++ PA + +++P+GNGRLGAMV+G E +LNE+T+W G P + TNP A +AL
Sbjct: 24 MKLWYDRPATQWVESLPLGNGRLGAMVYGDPIHEEFQLNEETIWGGSPYNNTNPKAKEAL 83
Query: 73 SDVRSLVDSGQYAEATA------ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
+R L+ G+ EA A S G P YQ +G + L+F+ + Y R
Sbjct: 84 PQIRQLIFEGRNKEAQALCGPNICSQTANGMP---YQTVGSLHLDFEGIS---SYSNYYR 137
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH-- 184
ELD+ A +++ G V +TRE F+S PDQ+++ +++ SE G LSF + +
Sbjct: 138 ELDIEKAVTTTRFTAGGVTYTREAFTSFPDQLLIIRLTASEKGKLSFTARYSTPYQENIT 197
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLK 243
++ ++ M+G KAN ++ +G +QF+A+ +I + G + ++ D L+
Sbjct: 198 KSISSRKELQMDG---------KANDHEGIEGKVQFTAL--TRIERNGGHMESVSDTLLR 246
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V ++ +V + V S FIN D + + + L++ +Y H Y K
Sbjct: 247 VRNAN-SVTIYV---SIGTNFINYKDISGNARKTAQTYLKNAGK-NYLKAKEAHCATYGK 301
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
F+RVS+ L + + P+ RV F + DP L L FQFGRYLLI
Sbjct: 302 WFNRVSLDLGSNAQA------------AKPTDVRVHEFASAFDPQLAALYFQFGRYLLIC 349
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SS+PG Q ANLQGIWN L WD +IN+EMNYW + P NL+E EP + ++
Sbjct: 350 SSQPGGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAEPTNLTEMHEPFLQLVKEVA 409
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
G ++A + Y GW +HH TDIW + + G + +WP AW C HLW+ Y ++ +
Sbjct: 410 EQGRQSAAM-YGCRGWTLHHNTDIWRSTGSVDGP-GYGIWPTCNAWFCQHLWDRYLFSGN 467
Query: 484 RDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
RD+L + YPL+ F LD+LI E + +L +PS SPE+ + V +TM
Sbjct: 468 RDYLAE-VYPLMRSACEFYLDFLIREPQNNWLVVSPSYSPENRPSVNGKRDFVVVAGATM 526
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
D ++ ++F + AA ++ ++ ++ + + L P ++ G + EW +D+ +P+
Sbjct: 527 DNQMVSDLFHNTLEAASLMGES-STFMDSLQTVVQNLAPMQVGRWGQLQEWMEDWDNPKD 585
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEH 662
HRH SHL+GL+PG IT + P L +AA++TL+ RG+ GWS+ WK WARL D H
Sbjct: 586 RHRHTSHLWGLYPGRQIT-QNTPILFEAAKRTLEGRGDHSTGWSMGWKVCFWARLLDGNH 644
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
AY+++ L EK GG Y NLF AHPPFQID NFG TA ++EMLVQS ++
Sbjct: 645 AYKLITE--QLHPTTDEKGQNGGTYPNLFDAHPPFQIDGNFGCTAGISEMLVQSHAGSVH 702
Query: 723 LLPALPWDKWSSGCVKGLKARGGETV-SICWKDGDLHEVGIYSN 765
LLPALP D W G VKGL+ RGG TV + W+D L I S+
Sbjct: 703 LLPALP-DVWKKGSVKGLRCRGGFTVEELNWEDNQLQTARITSS 745
>gi|294674990|ref|YP_003575606.1| hypothetical protein PRU_2351 [Prevotella ruminicola 23]
gi|294471732|gb|ADE81121.1| conserved hypothetical protein [Prevotella ruminicola 23]
Length = 769
Score = 473 bits (1218), Expect = e-130, Method: Compositional matrix adjust.
Identities = 292/805 (36%), Positives = 429/805 (53%), Gaps = 68/805 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPKA 71
+ + +N PA F +++PIGNG++GA+++GG + LN+ TLWTG P D + DA K
Sbjct: 1 MVLEYNKPATFFEESLPIGNGKMGALIYGGTDDNVIYLNDITLWTGKPVDRNLDADAHKW 60
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIEL-EFDDSHLKYAEETYRRELDL 130
+ ++R + + YA A + + + G + YQ LG + + + +KY YRR LD+
Sbjct: 61 IPEIRKALFNENYALADSLQLHVQGPNSQHYQPLGTLHIKDLGLGEIKY----YRRTLDI 116
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
++A R Y TRE+F+SNPD++I ++ G + ++ + H +G
Sbjct: 117 DSAIVRDSYERDGRHITREYFASNPDKLIAIRLRGDINCQIALTAQVP-----HQVKSGL 171
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
Q+ M G G D + F IL +K + A D L + + A
Sbjct: 172 GQLTMTGHATG----------DAQESTHFCTILSVKTDGEM----AASDSSLTITKAKEA 217
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
++ +V +SF+G +P + + L +N+++ + Y RHL DY+ ++ RV I
Sbjct: 218 IIYIVNETSFNGFDKHPVREGANYLEAVTNDLWHTQNMTFDEFYARHLADYKTIYDRVKI 277
Query: 311 QLS---RSPKDIVTDTCSEENIDTVPSAERVKSFQT--DEDPSLVELLFQFGRYLLISSS 365
L+ R+PKD+ D + E + + D+ P L EL FQFGRYLLIS+S
Sbjct: 278 CLNKGGRNPKDLPGAK------DRRMTDEMLLDYTNGNDQTPYLEELYFQFGRYLLISAS 331
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
R ANLQG+W L W VNINLE NYW + N++E EPL F+ L+ N
Sbjct: 332 RTKNVPANLQGLWAPQLWSPWRGNYTVNINLEENYWPAFVANMAEMAEPLDGFIAGLAAN 391
Query: 426 GSKTAQVNY-LASGWVIHHKTDIWAKSSADRGK---VVWALWPMGGAWLCTHLWEHYNYT 481
G TA+ Y + GW H +DIWA ++ K W+ W +GGAWL LWE Y +T
Sbjct: 392 GKFTAKNYYNIHEGWCSSHNSDIWAMTNPVGEKNESPEWSNWNLGGAWLVNTLWERYQFT 451
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEG--HDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
D+ +L+ AYPL++G A F L WLI+ G L T PSTSPE+E+ G Y
Sbjct: 452 QDKTYLKNIAYPLMKGAAQFCLRWLIDNPKQPGELITAPSTSPENEYKTDKGYHGTTCYG 511
Query: 540 STMDMAIIREVFSAIISAAEVLE-KNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFK 598
T D+AIIRE+F I+A +VL KN++ + ++L +L P I G + EW D+
Sbjct: 512 GTADLAIIRELFINTIAAGKVLGLKNKE-----MEQALAKLHPYTIGHMGDLNEWYYDWD 566
Query: 599 DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLH 658
D + HRH SHL GL+PG+ +T + L KAAE++L+ +G++ GWS W+ LWARLH
Sbjct: 567 DWDFQHRHQSHLIGLYPGNHLT---DATLQKAAERSLEIKGDKTTGWSTGWRINLWARLH 623
Query: 659 DQEHAYRMVKRLFNLVDPEHEK-------HFEGGLYSNLFAAHPPFQIDANFGFTAAVAE 711
+ + AY + ++L + P + H GG Y NLF AHPPFQID NFG TA V E
Sbjct: 624 NAKQAYHIYQKLLTPIAPRGVRKEDWKAWHKGGGTYPNLFDAHPPFQIDGNFGGTAGVCE 683
Query: 712 MLVQSTLND----LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYS 767
ML+QS++ + + LLPA P ++W G + GL ARGG VS WK+G + I + +
Sbjct: 684 MLMQSSIVNGQCSIELLPACP-EQWQDGAISGLCARGGYEVSFEWKNGKVRGCSIKAKKA 742
Query: 768 NNDHDSFKTLHYRGTSVKVNLSAGK 792
TL Y G KV L AG+
Sbjct: 743 GT-----LTLIYNGQQKKVKLKAGE 762
>gi|160885438|ref|ZP_02066441.1| hypothetical protein BACOVA_03438 [Bacteroides ovatus ATCC 8483]
gi|423294310|ref|ZP_17272437.1| hypothetical protein HMPREF1070_01102 [Bacteroides ovatus
CL03T12C18]
gi|156109060|gb|EDO10805.1| hypothetical protein BACOVA_03438 [Bacteroides ovatus ATCC 8483]
gi|392675501|gb|EIY68942.1| hypothetical protein HMPREF1070_01102 [Bacteroides ovatus
CL03T12C18]
Length = 811
Score = 473 bits (1218), Expect = e-130, Method: Compositional matrix adjust.
Identities = 281/763 (36%), Positives = 423/763 (55%), Gaps = 60/763 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+++++A+PIGN RLGAMV+GG+ E L+LNE+T W G P + NP+A L
Sbjct: 22 LKLWYSQPAQNWSEALPIGNSRLGAMVYGGIEREELQLNEETFWAGSPYNNNNPNAVHVL 81
Query: 73 SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ G+ EA + L Y LG + LEF + + R+L+L
Sbjct: 82 PVVRKLIFEGRNKEAQRLIDANFLTQQHGMSYLTLGSLYLEFPEHQ---NGSGFYRDLNL 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
AT +Y V +V +TR F+S D VI+ I S++ +L+F ++ + L + V N
Sbjct: 139 ENATTTTRYQVDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNCPLVHKVNVQ-N 197
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
+Q+ + C GK + +G++ + E +I GT+ + EG++
Sbjct: 198 DQLTVT--CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L + A++++ +N D D + + L+ + Y H+ Y+K F RV
Sbjct: 245 ATLYISAATNY----VNYQDVSADESHRTSEYLKRAMQIPYEKALKNHIAYYKKQFDRVR 300
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L + + +R+++F ED ++ LLF +GRYLLISSS+PG
Sbjct: 301 LTLPAG------------KASQLETPKRIENFGNGEDMAMAALLFHYGRYLLISSSQPGG 348
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN WDS +NIN EMNYW + NLSE PLF L LS+ G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 408
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
A+ Y GWV HH TD+W G V +A +WP GGAWL H+W+HY +T +++F
Sbjct: 409 ARTMYDCRGWVAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L K YP+L+G A F +D+L+E H Y L +PS SPEH ++ TMD
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDN 513
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHH 604
I + + A+ + + + + + ++L +L P +I + + EW +D +P+ H
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLEDIDNPKDEH 572
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAY 664
RH+SHL+GL+P + I+ NP+L +AA TL +RG++ GWSI WK WAR+ D HA+
Sbjct: 573 RHISHLYGLYPSNQISPYSNPELFQAARNTLLQRGDKATGWSIGWKVNFWARMLDGNHAF 632
Query: 665 RMVKRLFNLVDPEH--EKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
+++K + L+ +H +++ G Y N+ AHPPFQID NFG+TA VAEML+QS ++
Sbjct: 633 QIIKNMIQLLPNDHLAKEYPNGRTYPNMLDAHPPFQIDGNFGYTAGVAEMLLQSHDGAVH 692
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
LLPALP D W G VKGL ARG TV + WK+ L++ I SN
Sbjct: 693 LLPALP-DAWEEGSVKGLVARGNFTVDMDWKNNVLNKAIIRSN 734
>gi|302873491|ref|YP_003842124.1| alpha-L-fucosidase [Clostridium cellulovorans 743B]
gi|307688330|ref|ZP_07630776.1| Alpha-L-fucosidase [Clostridium cellulovorans 743B]
gi|302576348|gb|ADL50360.1| Alpha-L-fucosidase [Clostridium cellulovorans 743B]
Length = 769
Score = 473 bits (1218), Expect = e-130, Method: Compositional matrix adjust.
Identities = 280/755 (37%), Positives = 407/755 (53%), Gaps = 64/755 (8%)
Query: 13 LKITFNGPAK--HFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
K+ ++ PA+ ++ A+P+GNG+LGAMV+G V E ++LNE++LW+G D NPDA
Sbjct: 13 FKLWYDEPAEVWNWDQALPVGNGKLGAMVFGHVHKEQIQLNEESLWSGGYLDRNNPDALA 72
Query: 71 ALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEF--DDSHLKYAEETYR 125
L VR L+ G+ EA ++ + G P Y+ LGD+ ++F D +K YR
Sbjct: 73 QLPKVRQLLFDGKLKEAERLCAIAMMGTPEHQRHYETLGDLFIDFYHDSDEVK----NYR 128
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFN--VSLDSLLDN 183
RELD+N A V+Y + V F RE SS D IV +I+ + ++SF V + +D
Sbjct: 129 RELDINKAMVTVQYEIDGVNFKREILSSAVDDAIVIRITADKKEAISFRGFVGRELFMDT 188
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ +N ++ + + G C G P I +S IL K + + G + + +
Sbjct: 189 RTALN-DSTVALRGGCGG------------PDSINYSIIL--KGTSEGGNLYTM-GGNIV 232
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
VE +D L L + +S+ D + ++S +++ +Y + H+ +YQ
Sbjct: 233 VENADAVTLYLTSKTSY---------LSNDFDAVAISTAEAVSKRTYESILQDHIAEYQS 283
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLI 362
F R+++QL + + + +P+ ER++ + + D L+ L F FGRYLLI
Sbjct: 284 YFSRMTLQLGNKQEAL--------ELSKIPTDERLERVKEGKLDDGLISLYFHFGRYLLI 335
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
S SRPGT ANLQGIWN+ + W +NIN EMNYW + CNLS+C PLFD + +
Sbjct: 336 SCSRPGTLPANLQGIWNKHHTSPWGCKFTININTEMNYWPAETCNLSDCHTPLFDLIEKM 395
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
G TA+V Y G+V HH D+W ++ + +WPMG AWLC HLWEHY +T
Sbjct: 396 REPGRHTAKVMYDCGGFVAHHNVDLWGDTAPQDHWMPATVWPMGAAWLCLHLWEHYEFTC 455
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
D FL K+AY L+ A F +D+LIE +GYL T PS SPE+ + G+ + +M
Sbjct: 456 DLKFL-KKAYETLKESAEFFVDYLIEDRNGYLVTCPSVSPENTYRLESGETGSLCIGPSM 514
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
D II +FS+ I A+E+L +++ E ++ RL I + G IMEWA+D+ + E
Sbjct: 515 DSQIIYALFSSCIEASELLNTDKE-FAETLISLRERLPKPSIGKYGQIMEWAEDYDEVEP 573
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHD 659
HRH+S LF L P + IT++ P L KAA TL++R G GWS W WARL +
Sbjct: 574 GHRHISQLFALHPSNQITVKDTPQLAKAARNTLERRLAHGGGHTGWSRAWIINFWARLEE 633
Query: 660 QEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLN 719
E AY + L NL HPPFQID NFG A VAEMLVQS N
Sbjct: 634 GEKAYENINAL-----------LAKSTLINLLDNHPPFQIDGNFGGAAGVAEMLVQSHSN 682
Query: 720 DLYLLPALPWDKWSSGCVKGLKARGGETVSICWKD 754
++ + PA+P +WS G V GL ARGG +SI W +
Sbjct: 683 EINIFPAMP-KQWSEGEVTGLCARGGFELSIKWTE 716
>gi|423215045|ref|ZP_17201573.1| hypothetical protein HMPREF1074_03105 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692308|gb|EIY85546.1| hypothetical protein HMPREF1074_03105 [Bacteroides xylanisolvens
CL03T12C04]
Length = 811
Score = 473 bits (1218), Expect = e-130, Method: Compositional matrix adjust.
Identities = 282/763 (36%), Positives = 426/763 (55%), Gaps = 60/763 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+++++A+PIGN RLGAMV+GG+ E L+LNE+T W G P + NP+A L
Sbjct: 22 LKLWYSQPAQNWSEALPIGNSRLGAMVYGGIEREELQLNEETFWAGSPYNNNNPNAVHVL 81
Query: 73 SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ G+ EA + L Y LG + LEF + + R+L+L
Sbjct: 82 PIVRKLIFEGRNKEAQRLIDANFLTQQHGMSYLTLGSLYLEFPEHQ---NASGFYRDLNL 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
AT +Y V +V +TR F+S D VI+ I S++ +L+F ++ + L + V N
Sbjct: 139 ENATTTTRYQVDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNCPLVHKVNVQ-N 197
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
+Q+ + C GK + +G++ + E +I GT+ + EG++
Sbjct: 198 DQLTVT--CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L + A++++ +N D D + + L+ + Y H+ Y+K F RV
Sbjct: 245 ATLYISAATNY----VNYQDVSADESRRTSEYLKRAMQIPYEKALKSHIAYYKKQFDRVR 300
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L T S+ + + +R+++F ED ++ LLF +GRYLLISSS+PG
Sbjct: 301 LTLP-------TGKTSQ-----LETPKRIENFGNGEDMAMAALLFHYGRYLLISSSQPGG 348
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN WDS +NIN EMNYW + NLSE PLF L LS+ G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 408
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
A+ Y GW+ HH TD+W G V +A +WP GGAWL H+W+HY +T +++F
Sbjct: 409 ARTMYDCRGWMAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L K YP+L+G A F +D+L+E H Y L +PS SPEH ++ TMD
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDN 513
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHH 604
I + + A+ + + + + + ++L +L P +I + + EW +D +P+ H
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLEDIDNPKDEH 572
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAY 664
RH+SHL+GL+P + I+ NP+L +AA TL +RG++ GWSI WK WAR+ D HA+
Sbjct: 573 RHISHLYGLYPSNQISPYSNPELFQAARNTLLQRGDKATGWSIGWKVNFWARMLDGNHAF 632
Query: 665 RMVKRLFNLVDPEH--EKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
+++K + L+ +H +++ G Y N+ AHPPFQID NFG+TA VAEML+QS ++
Sbjct: 633 QIIKNMIQLLPNDHLAKEYPNGRTYPNMLDAHPPFQIDGNFGYTAGVAEMLLQSHDGAVH 692
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
LLPALP D W G VKGL ARG TV + WK+ L++ I SN
Sbjct: 693 LLPALP-DAWEEGSVKGLVARGNFTVDMDWKNNVLNKAIIRSN 734
>gi|293370624|ref|ZP_06617176.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
gi|292634358|gb|EFF52895.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
Length = 811
Score = 473 bits (1217), Expect = e-130, Method: Compositional matrix adjust.
Identities = 283/763 (37%), Positives = 422/763 (55%), Gaps = 60/763 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+++++A+PIGN RLGAMV+GG+ E L+LNE+T W G P + NP+A L
Sbjct: 22 LKLWYSQPAQNWSEALPIGNSRLGAMVYGGIKREELQLNEETFWAGSPYNNNNPNAIHVL 81
Query: 73 SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ G+ EA + L Y LG + LEF + + R+L+L
Sbjct: 82 PIVRKLIFEGRNKEAQRLIDTNFLTQQHGMSYLTLGSLYLEFPEHQ---NASGFYRDLNL 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
AT +Y V +V +TR F+S D VI+ I S++ +L+F ++ + L + V +
Sbjct: 139 ENATTTTRYQVDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNCPLVHKVNVQND 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
+ C GK + +G++ + E +I GT+ + EG++
Sbjct: 199 KLTVT---CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L + A++++ +N D + + + L+ + Y H+ Y+K F RV
Sbjct: 245 ATLYISAATNY----VNYQDVSANESRRTSEYLKRAMQIPYEKALKSHIAYYKKQFDRVR 300
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L T S+ + + +R+++F ED ++ LLF +GRYLLISSS+PG
Sbjct: 301 LTLP-------TGKASQ-----LETPKRIENFGNGEDMAMAALLFHYGRYLLISSSQPGG 348
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN WDS +NIN EMNYW + NLSE PLF L LS+ G+KT
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGTKT 408
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
A+ Y + GWV HH TD+W G V +A +WP GGAWL H+W+HY +T D++F
Sbjct: 409 ARNMYNSRGWVAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGDQEF 464
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L K YP+L+G A F +D+L+E H Y L PS SPEH V+ TMD
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPTYKWLVVAPSVSPEH---------GPVTAGCTMDN 513
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHH 604
I + + A+ + + + + + ++L +L P +I + + EW +D +P+ H
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLEDIDNPKDEH 572
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAY 664
RH+SHL+GL+P + I+ NP+L +AA TL +RG++ GWSI WK WAR+ D HA+
Sbjct: 573 RHISHLYGLYPSNQISPYSNPELFQAARNTLLQRGDKATGWSIGWKVNFWARMLDGNHAF 632
Query: 665 RMVKRLFNLV--DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
+++K + L+ D +++ G Y N+ AHPPFQID NFG+TA VAEML+QS ++
Sbjct: 633 QIIKNMIQLLPNDNLAKEYPNGRTYPNMLDAHPPFQIDGNFGYTAGVAEMLLQSHDGAVH 692
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
LLPALP D W G VKGL ARG TV + WK+ L++ I SN
Sbjct: 693 LLPALP-DAWEEGSVKGLVARGNFTVDMDWKNNVLNKAIIRSN 734
>gi|423223626|ref|ZP_17210095.1| hypothetical protein HMPREF1062_02281 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638251|gb|EIY32098.1| hypothetical protein HMPREF1062_02281 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 814
Score = 473 bits (1216), Expect = e-130, Method: Compositional matrix adjust.
Identities = 299/824 (36%), Positives = 438/824 (53%), Gaps = 81/824 (9%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAPKALS 73
I ++ PA+ + +A+PIGNGRLGAM +GG+ E L+LN+ T+W+G P ++ DA K L
Sbjct: 34 IWYDKPAEIWEEALPIGNGRLGAMCFGGIHEEKLQLNDVTIWSGEPQPNSDRTDAYKKLP 93
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPA----DVY--------QLLGDIELEFDDSHLKYAE 121
++R + + Y A + + + D+Y Q LGD+ L+F +
Sbjct: 94 EIREALRNRDYKLAEVLTHQYMTCNSVSNEDIYNTIYSSSYQTLGDLSLKFKLPEGEMG- 152
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
+YRR LD+ A + V + +G F+RE FSS PD VIV K+ G LSF++ LD
Sbjct: 153 -SYRRWLDITRAISGVDFKIGEYSFSREIFSSAPDSVIVMKLGTDMKGGLSFSMLLDRKF 211
Query: 182 ------DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
D+H V N ME R N + + + +K+ D G +S
Sbjct: 212 SAVTTSDSHGLVMKGNTDYMEHR---------GNCDYEAR---------VKVVADGGRVS 253
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
K+ V+G+D A + + +S+ + D + +++ L + Y D+ +
Sbjct: 254 N-SKGKISVQGADSAYVYITCQTSYILDYKKNYRRAID-SKDAVRKLNIVSRKKYDDVKS 311
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLF 354
H+ DYQ +F+R+S+ L + ++ID +P+ +R+ F + +D V+L +
Sbjct: 312 IHVADYQGIFNRLSLNLG-----------NNKSID-IPTDQRLTRFNEKSDDLGFVDLFY 359
Query: 355 QFGRYLLISSSRPGTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
QFGRYL+ISSSR + N QGIW + W S NIN +MNYW NLSEC
Sbjct: 360 QFGRYLMISSSRENNPLPGNCQGIWGDGYKLPWHSDYKANINYQMNYWMVEASNLSECHI 419
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
P+ L G KTAQ + ASGW+ T+ W +S + +W + G W C
Sbjct: 420 PMLRLTASLVEPGRKTAQSYFNASGWMYAMMTNAWGWTSPGQ-YTIWGSFFGGSGWACQD 478
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
WEHY YT D+++L K YP+L+ F L LIE DGYL T+PSTSPE+ +IAPDG
Sbjct: 479 FWEHYAYTQDKEYLRK-VYPILKEACEFYLSVLIENKDGYLVTSPSTSPENRYIAPDGSR 537
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL-KSLPRLRPTKIAEDGSIME 592
V+ ST++++IIR +FS I A +L NED +++L KSL RLRP +I G +ME
Sbjct: 538 VAVTEGSTIELSIIRNLFSNTIYATGIL--NEDNSFKEILEKSLARLRPLQIGRAGQLME 595
Query: 593 WAQDF--KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
W DF ++ HRH+SHLF L PG I ++ +L +AA+++LQ RG+EG GWS+ WK
Sbjct: 596 WNDDFDLNAEDIRHRHVSHLFALHPGREIIPFEHKELAEAAKRSLQIRGDEGTGWSLAWK 655
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHF-EGGLYSNLFAAHPPFQIDANFGFTAAV 709
WARL + ++AY+++ R LV + +GG Y NLF AHPPFQID N+GF + V
Sbjct: 656 INFWARLLEGDYAYKLLCRQLKLVRSNDTNYSNQGGTYPNLFDAHPPFQIDGNYGFVSGV 715
Query: 710 AEMLVQSTL---------NDLY---LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
EML+QS DLY +LPALP K G + G++ARGG +S WKDG L
Sbjct: 716 NEMLLQSHEMYIDPSSPNEDLYVIRILPALP-QKIREGKISGIRARGGFELSFEWKDGRL 774
Query: 758 HEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLK 801
I S D + Y+ + +N++ G+ N K
Sbjct: 775 VNAVITSL-----ADKQARVFYQEKEISLNIAKGETKELNELCK 813
>gi|224536491|ref|ZP_03677030.1| hypothetical protein BACCELL_01366 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521893|gb|EEF90998.1| hypothetical protein BACCELL_01366 [Bacteroides cellulosilyticus
DSM 14838]
Length = 815
Score = 472 bits (1214), Expect = e-130, Method: Compositional matrix adjust.
Identities = 298/824 (36%), Positives = 438/824 (53%), Gaps = 81/824 (9%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAPKALS 73
I ++ PA+ + +A+PIGNGRLGAM +GG+ E L+LN+ T+W+G P ++ DA K L
Sbjct: 35 IWYDKPAEIWEEALPIGNGRLGAMCFGGIHEEKLQLNDVTIWSGEPQPNSDRTDAYKKLP 94
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPA----DVY--------QLLGDIELEFDDSHLKYAE 121
++R + + Y A + + + D+Y Q LGD+ L+F+ +
Sbjct: 95 EIREALRNRDYKLAEVLTHQYMTCNSVSNEDIYNTIYSSSYQTLGDLSLKFELPEGEMG- 153
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
+YRR LD+ A + V + +G F+RE FSS PD VIV K+ G LSF++ LD
Sbjct: 154 -SYRRWLDITRAISGVDFKIGEYSFSREIFSSAPDSVIVMKLGTDMKGGLSFSMLLDRKF 212
Query: 182 ------DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
D+H V N ME R N + + + +K+ D G +S
Sbjct: 213 SAVTTSDSHGLVMKGNTDYMEHR---------GNCDYEAR---------VKVVADGGRVS 254
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
K+ V+G+D A + + +S+ + D + +++ L + Y D+ +
Sbjct: 255 N-SKGKISVQGADSAYVYITCQTSYILDYKKNYRRAID-SKDAVRKLNIVSRKKYDDVKS 312
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLF 354
H+ DYQ +F+R+S+ L + ++ID +P+ +R+ F + +D V+L +
Sbjct: 313 IHVADYQGIFNRLSLNLG-----------NNKSID-IPTDQRLTRFNEKSDDLGFVDLFY 360
Query: 355 QFGRYLLISSSRPGTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
QFGRYL+ISSSR + N QGIW + W S NIN +MNYW NLSEC
Sbjct: 361 QFGRYLMISSSRENNPLPGNCQGIWGDGYKLPWHSDYKANINYQMNYWMVEASNLSECHI 420
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
P+ L G KTAQ + ASGW+ T+ W +S + +W + G W C
Sbjct: 421 PMLRLTASLVEPGRKTAQSYFNASGWMYAMMTNAWGWTSPGQ-YTIWGSFFGGSGWACQD 479
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
WEHY YT D+++L K YP+L+ F L LIE DGYL T+PSTSPE+ +IAPDG
Sbjct: 480 FWEHYAYTQDKEYLRK-VYPILKEACEFYLSVLIENKDGYLVTSPSTSPENRYIAPDGSR 538
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL-KSLPRLRPTKIAEDGSIME 592
V+ ST++++IIR +FS I A +L NED +++L KSL RLRP +I G +ME
Sbjct: 539 VAVTEGSTIELSIIRNLFSNTIYATGIL--NEDNSFKEILEKSLARLRPLQIGRAGQLME 596
Query: 593 WAQDF--KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
W DF ++ HRH+SHLF L PG I ++ +L +AA+++LQ RG+EG GWS+ WK
Sbjct: 597 WNDDFDLNAEDIRHRHVSHLFALHPGREIIPFEHKELAEAAKRSLQIRGDEGTGWSLAWK 656
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHF-EGGLYSNLFAAHPPFQIDANFGFTAAV 709
WARL + ++AY+++ R LV + +GG Y NLF AHPPFQID N+GF + V
Sbjct: 657 INFWARLLEGDYAYKLLCRQLKLVRSNDTNYSNQGGTYPNLFDAHPPFQIDGNYGFVSGV 716
Query: 710 AEMLVQSTL---------NDLY---LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
EML+QS DLY +LPALP K G + G++ARGG +S WKDG L
Sbjct: 717 NEMLLQSHEMYIDPSSPNEDLYVIRILPALP-QKIREGKISGIRARGGFELSFEWKDGRL 775
Query: 758 HEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLK 801
I S + Y+ + +N++ G+ N K
Sbjct: 776 VNAVITSLAGKQAR-----VFYQEKEISLNIAKGETKELNELCK 814
>gi|253580291|ref|ZP_04857557.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39B_FAA]
gi|251848384|gb|EES76348.1| conserved hypothetical protein [Ruminococcus sp. 5_1_39BFAA]
Length = 751
Score = 472 bits (1214), Expect = e-130, Method: Compositional matrix adjust.
Identities = 273/796 (34%), Positives = 421/796 (52%), Gaps = 67/796 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + F+ PA+ + +A+P+GNG +GAM +G +E ++LN D+LW+G + NP+
Sbjct: 4 LALIFDKPAEAWNEALPLGNGTMGAMSYGRFQNERIELNLDSLWSGNGRNKENPNKNVDW 63
Query: 73 SDVRSLVDSGQYAEA-TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
R + +G Y A + G + Y G + + + ++ YRREL L
Sbjct: 64 DLFRKHIFAGDYQGAENYCKENVLGDWTESYLPAGTLSINVKEP-IQNGNSFYRRELCLT 122
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
AT ++++ ++ + RE F S + V+ S + +L +++L+S + + S N
Sbjct: 123 NATEKIEFCQDDLIYQREFFVSMSEPVMAIHYHTSPNCNLEMSITLESEIKHKSAFFAEN 182
Query: 192 QIIMEGRCPGKRIPPKANAN-----DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
II+EG+ P PP + ++ +GI+F+ + + + + G + DK
Sbjct: 183 GIILEGQAPIYVAPPYYSCEVPVVYEEGQGIRFA--IGLYVQTNGGNVYQQADKLFINTP 240
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D V + V+ + K+ S+ +++I+++ Y H+D Y F
Sbjct: 241 ND--VYIYVSG-------VTDFKQKELFFSKRNCMMENIQHIQYEKQKKAHMDVYANYFD 291
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
R+ + ++ +P D L +F + RYL+I SS
Sbjct: 292 RMHLDINYTP-----------------------------DNELALKMFHYARYLMICSSV 322
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG+Q NLQGIWN + W S VNIN EMNYW + NLS+C PL + + S G
Sbjct: 323 PGSQCTNLQGIWNHHMRAPWSSNYTVNINTEMNYWMAEKANLSDCHMPLLELIERTSKKG 382
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSS------ADRGKVVWALWPMGGAWLCTHLWEHYNY 480
KTAQ Y +GWV HH DIW SS D +++WPM WLC HLWEHY Y
Sbjct: 383 EKTAQDVYHLAGWVSHHNLDIWGHSSPVGQFGQDENPCTYSMWPMSSGWLCCHLWEHYCY 442
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
T+D FL+K+A+P+++G F L +L+ + GY T PSTSPE+ F+APD V+++S
Sbjct: 443 TLDEAFLKKKAFPIIQGAVEFYLGYLVP-YKGYYVTAPSTSPENTFLAPDMTTHGVTFAS 501
Query: 541 TMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFK 598
TMD++I+RE+F + A E+L E +A V+ VL+ LP P KI ++G + EW D+
Sbjct: 502 TMDISILRELFGLYLKACEILGVEDFTNA-VKNVLQKLP---PYKIGKEGQLQEWFYDYP 557
Query: 599 DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLH 658
+ +++HRH+SHLFGL+PG+ I E P L +A +L++RG++G GW + WK LWA+L
Sbjct: 558 EADINHRHISHLFGLYPGNQIHKENEP-LIEACRTSLERRGDKGTGWCMAWKACLWAKLG 616
Query: 659 DQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTL 718
D HA ++K L E GG+Y N+ AHPPFQID NFGF AAV EMLVQ
Sbjct: 617 DGNHALTLLKNQLRLTREEACSLVGGGIYPNMLCAHPPFQIDGNFGFAAAVLEMLVQYEE 676
Query: 719 NDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLH 778
+ LPALP D+W G +G+KA G T++ WK+ + E+ + S D+ +
Sbjct: 677 QKIVFLPALP-DEWKDGMAEGVKAPGNITLNFKWKEKRVTEINLKSPI-----DAKLVIL 730
Query: 779 YRGTSVKVNLSAGKIY 794
Y G ++ L+AG Y
Sbjct: 731 YNGMEEEIVLNAGSSY 746
>gi|340619504|ref|YP_004737957.1| alpha-L-fucosidase [Zobellia galactanivorans]
gi|339734301|emb|CAZ97678.1| Alpha-L-fucosidase, family GH95 [Zobellia galactanivorans]
Length = 817
Score = 471 bits (1213), Expect = e-130, Method: Compositional matrix adjust.
Identities = 287/814 (35%), Positives = 433/814 (53%), Gaps = 77/814 (9%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA + +A+P+GNGR+GAMV+G E ++ NE+T W+G P K L +++
Sbjct: 42 YDKPASMWEEALPVGNGRIGAMVYGKSGEEKIQFNEETYWSGGPYSQVVKGGYKKLPEIQ 101
Query: 77 SLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
+ +G+ +A + L G+P + YQ L ++ L F + + YRR LDL T
Sbjct: 102 KYIFNGEPIKAHKLFGRALMGYPVEQQKYQSLANLHLFFGQDSV----DNYRRSLDLKTG 157
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
V+Y+ G V +T+E F+S DQ I +I+ + GS++F+ L + ++ +
Sbjct: 158 VVTVEYTYGGVNYTKEVFASAVDQTIAIRITADKPGSINFDAELRGVRNSAHSNYATDYF 217
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILE--IKISDDRGTISALEDKKLKVEGSDWAV 251
M+G GK + D G++ E IK + GT+S ++ L ++ +D A
Sbjct: 218 RMDGL--GKDQLKLTGKSADYMGVEGKLRYEARIKAVPEGGTMS-IDGTMLSIKNADAAT 274
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
L VA+++F +N D D L ++ S+ + L DY++ F RVS+
Sbjct: 275 LYFVAATNF----VNYKDVSADENKRVEDMLAKVQQSSFDAIKKSALADYKEYFDRVSLT 330
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
L + + P+ +R+ Q+ DP L L + FGRYLLISSSRPGTQ
Sbjct: 331 LPTTDNSFL------------PTDKRMVEIQSSPDPQLSTLCYNFGRYLLISSSRPGTQP 378
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
ANLQGIWN D++P WDS NIN EMNYW NLSE EPL + L+ G+K A+
Sbjct: 379 ANLQGIWNNDMNPAWDSKYTTNINTEMNYWAVESANLSELSEPLTTMVKELTDQGAKVAK 438
Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
+Y A GWV H TD+W + +A W + +GGAWL THLWEHY +T D+++L K
Sbjct: 439 EHYGADGWVFHQNTDLW-RVAAPMDGPTWGTFTVGGAWLTTHLWEHYLFTQDKEYL-KDI 496
Query: 492 YPLLEGCASFLLDWLIE--GHDGYLETNPSTSPEHEFIAPDGK--------------LAC 535
YP+++G F +D+L+E G D +L TNPS SPE+ P+GK
Sbjct: 497 YPVMKGSVEFFMDFLVEYPGTD-WLVTNPSNSPEN---PPEGKGYKYFYDEITGMYYFTT 552
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ 595
+ ST+DM I++++FS SA+E+L+ + + L ++V + RL P++I +DG++ EW +
Sbjct: 553 IVAGSTIDMQILKDLFSYYDSASEILDVDPE-LRKQVSIARSRLVPSQIGKDGTLQEWTE 611
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWA 655
D+ E +HRH SHL+GLFPG+ I++ + P+L + +KTL+ RG+ GWS WKT LWA
Sbjct: 612 DYGQMEKNHRHASHLYGLFPGNVISVTRTPELIEPVKKTLELRGDGASGWSRAWKTCLWA 671
Query: 656 RLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFA-AHPPFQIDANFGFTAAVAEMLV 714
RL D + A + K + + YS+LFA FQ+D G TA ++EML+
Sbjct: 672 RLRDGDRANSIFK-----------GYLKEQAYSSLFAICARQFQVDGTLGMTAGISEMLI 720
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN------ 768
QS L LLPALP +W+ G G+ ARGG + WKD + + I S
Sbjct: 721 QSQEGYLDLLPALP-SEWADGQFSGVCARGGFELDFSWKDKQITSLEILSKAGTTCSLKA 779
Query: 769 -------NDHDSFKTLHYRGTSVKVNLSAGKIYT 795
+D KT + V+ N GK Y+
Sbjct: 780 GSKVKVFSDGKQIKTKKRKNQIVEFNTEQGKTYS 813
>gi|376260262|ref|YP_005146982.1| trehalose/maltose hydrolase or phosphorylase [Clostridium sp.
BNL1100]
gi|373944256|gb|AEY65177.1| trehalose/maltose hydrolase or phosphorylase [Clostridium sp.
BNL1100]
Length = 1159
Score = 471 bits (1213), Expect = e-130, Method: Compositional matrix adjust.
Identities = 285/763 (37%), Positives = 407/763 (53%), Gaps = 65/763 (8%)
Query: 22 KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDS 81
+ F A+P+GNGR+GAMV+G P E + LNE T W+ PG+ A +L + + +
Sbjct: 74 ESFYKALPLGNGRIGAMVYGNYPDERIDLNEATFWSSGPGNNNRAGAANSLKTAQDQLFA 133
Query: 82 GQYAE-ATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYS 140
GQY +T + + G YQ +GD++L F S + Y R+LD+NT Y+
Sbjct: 134 GQYKTGSTTIANSMIGGGEAKYQSIGDLKLLFGHSSV----SNYSRQLDMNTGVVSSDYT 189
Query: 141 VGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN--GNNQIIMEGR 198
++ RE F S PDQ++VTKI+ S GS+S +S L V+ GN+ ++M G
Sbjct: 190 YNGKQYHRESFVSYPDQIMVTKITCSSPGSISLTAGYESSLTGQYTVSTSGNDTLVMNGH 249
Query: 199 CPGKRIPPKANANDDPKGIQFSAILEI--KISDDRGTISALEDKKLKVEGSDWAVLLLVA 256
D GI ++ KI + G++SA + ++ V +D V+L
Sbjct: 250 ------------GDSDNGISYAVWFSTRSKIINSNGSVSA-NNNQISVSNADSVVIL--- 293
Query: 257 SSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSP 316
+S F+N D ++ + + + SY LY H+ DYQ LF RV + L S
Sbjct: 294 -TSIRTNFVNYKTCNGDEKGKATTDITNASAKSYDTLYNNHVADYQNLFKRVDVDLGGS- 351
Query: 317 KDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQG 376
SE N P +R+ F T DP L ++LFQ+GRYL+IS+SR +Q NLQG
Sbjct: 352 -------GSENN---KPMGQRISEFGTTNDPKLAKVLFQYGRYLMISASRD-SQSMNLQG 400
Query: 377 IWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY-L 435
IWN+ +P W NIN EMNYW + NL+EC EP L G++TA+ +Y +
Sbjct: 401 IWNKFRNPAWGCKMTTNINYEMNYWPAFTTNLAECFEPFVKKAKELQAPGNETARAHYNI 460
Query: 436 ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLL 495
++GWV+HH TD+W +++ G+ W LWP G W+ L++ YN+ D +L + YP++
Sbjct: 461 SNGWVLHHNTDLWNRTAPIDGE--WGLWPTGAGWVSNMLYDAYNFNQDTAYLNE-IYPVI 517
Query: 496 EGCASFLLDWL----IEGHDGYLETNPSTSPEHEFIAP----DGKLACVSYSSTMDMAII 547
+G A FL + I G + Y PSTSPE + P G+ A SY TMD I
Sbjct: 518 KGAADFLQTLMQSKSINGQN-YQVICPSTSPE---LTPPGTSGGQGAYNSYGVTMDNGIS 573
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWAQDFKDPEVHHRH 606
RE+F +I AA +L N D L+S + +++P I G + EWA D+ +RH
Sbjct: 574 RELFKDVIQAAGIL--NVDPAFRSTLQSKVSQIKPNTIGSWGQLQEWAYDWDSQSERNRH 631
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRM 666
+S + LFPG I P + A K+L RG+ G GWS WK WARL D HAY +
Sbjct: 632 ISFAYDLFPGLEINKRNTPSIANAVIKSLNTRGDAGTGWSEAWKLNCWARLEDGAHAYNL 691
Query: 667 VKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPA 726
VK L + V+ +G LY NL+ AHPPFQID NFGFT+ +AEML+QS N++ LLPA
Sbjct: 692 VKLLISPVNK------DGRLYDNLWDAHPPFQIDGNFGFTSGIAEMLLQSHNNEIQLLPA 745
Query: 727 LPWDKWSSGCVKGLKARGGETVS-ICWKDGDLHEVGIYSNYSN 768
LP +WS+G GL ARG T++ + W +G L I SN N
Sbjct: 746 LP-SQWSTGHADGLCARGNFTITKMNWANGVLTGATIKSNSGN 787
>gi|198275795|ref|ZP_03208326.1| hypothetical protein BACPLE_01970 [Bacteroides plebeius DSM 17135]
gi|198271424|gb|EDY95694.1| hypothetical protein BACPLE_01970 [Bacteroides plebeius DSM 17135]
Length = 816
Score = 471 bits (1212), Expect = e-130, Method: Compositional matrix adjust.
Identities = 286/761 (37%), Positives = 423/761 (55%), Gaps = 53/761 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+++ +A+P+GN LG MV+GG+ E ++LNE+T W G P A L
Sbjct: 26 LKLWYSAPARNWWEALPVGNSHLGGMVFGGINHEEIQLNEETFWAGGPYSNNRTGASGYL 85
Query: 73 SDVRSLVDSGQYAEATAASVKLF--GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+VR L+ + EA + F H Y LG + ++F+ + ++Y R+L+L
Sbjct: 86 DEVRRLIFENKNLEARTLLDEKFMTSHHGMRYLTLGSLLMDFN---CEGKVDSYYRDLNL 142
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
ATA V++ VE+TR F+S D V+V +++ ++ G+ +V L S V
Sbjct: 143 EDATASVRFRCDGVEYTRRVFTSFSDNVMVVEMA-TDKGNKKLDVDLRYTCPLTSEVKSE 201
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
++ +C G A P + A++ +++ D G I +D +L V G+ A
Sbjct: 202 GDYLIM-KCNG------AEHEGIPAALH--AVVMMRVKSD-GKIEC-KDGRLSVRGASSA 250
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
+ L A+++F +N D D +++ A++ + LY H Y F RV++
Sbjct: 251 TVFLSAATNF----VNYQDVSGDAYAKARCAIEGAWDKQNKKLYDEHKAIYSAQFGRVAL 306
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
L S + E N+ R+ F +D SL L+FQ+GRYLLISSS+PG+Q
Sbjct: 307 HLPSS-----EFSKKETNV-------RINEFNKVKDCSLAALMFQYGRYLLISSSQPGSQ 354
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWN+DL WDS +NIN EMNYW + NLSE P F LS+ G + A
Sbjct: 355 PANLQGIWNKDLYAPWDSKYTININAEMNYWPAEVTNLSETHVPFFQMAHELSVTGKEAA 414
Query: 431 QVNYLASGWVIHHKTDIWAKSS----ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
+V Y A GWV HH TDIW + AD G +WP GGAW+ HLW+HY Y+ D++F
Sbjct: 415 RVLYGAKGWVAHHNTDIWRAAGPVDFADAG-----MWPNGGAWVAQHLWQHYLYSGDKNF 469
Query: 487 LEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L + YP+L+G A FLL ++ + G+ T PS SPEH P+G + TMD
Sbjct: 470 L-REYYPVLKGTADFLLSFMTKHPRYGWRVTAPSVSPEH---GPNG--VSIVAGCTMDNQ 523
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
I +V S + AA ++ + A + + + +L P +I + + EW +D DP+ HR
Sbjct: 524 IAFDVLSNTLRAARII-GDSKAYCDSLQSLISQLPPMQIGQYNQLQEWLEDVDDPKDQHR 582
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H+SHL+GL+P + I+ ++P+L +AA+ TL +RG+ GWSI WK WAR+ D HAY
Sbjct: 583 HISHLYGLYPSNQISPYRHPELFQAAKNTLLQRGDMATGWSIGWKINFWARMLDGNHAYN 642
Query: 666 MVKRLFNLV--DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYL 723
+++ + +L+ D K+ G Y N+F AHPPFQID NFGFTA VAEML+QS ++L
Sbjct: 643 IIRNMLSLLPCDSLAGKYPLGRTYPNMFDAHPPFQIDGNFGFTAGVAEMLLQSHDGAVHL 702
Query: 724 LPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
LPA+P D+W G VKGL ARGG V + WK+ L + IYS
Sbjct: 703 LPAVP-DEWQDGNVKGLVARGGFVVDMDWKNVHLTKAVIYS 742
>gi|373956599|ref|ZP_09616559.1| alpha-L-fucosidase [Mucilaginibacter paludis DSM 18603]
gi|373893199|gb|EHQ29096.1| alpha-L-fucosidase [Mucilaginibacter paludis DSM 18603]
Length = 783
Score = 471 bits (1212), Expect = e-130, Method: Compositional matrix adjust.
Identities = 286/781 (36%), Positives = 426/781 (54%), Gaps = 63/781 (8%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
S ++PL++ +N PA+ + + +P+GNGRLG M GGV ET+ LN+ TLW+G P D N
Sbjct: 20 SFGQSHPLRLWYNKPAQMWEETLPLGNGRLGMMPDGGVSQETIVLNDITLWSGAPQDANN 79
Query: 66 PDAPKALSDVRSLVDSGQYAEATAASVKLF--------GHPADVYQLLGDIELEFD---- 113
A K+L +R L+ G+ EA A + F G YQ+LG++ L F
Sbjct: 80 YQAYKSLPQIRKLLMEGKNDEAQALVDQAFICTGKGSGGVNYGCYQVLGNLSLNFQYPDH 139
Query: 114 ---DSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGS 170
+S + Y + Y REL L+ A A+ Y V V + RE+ +S D V + K++ + G
Sbjct: 140 NTANSPVNY--QNYERELTLDNAIAKCTYQVNGVTYKREYITSFGDDVDIIKLTADKPGQ 197
Query: 171 LSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD 230
L+ ++ + + + V N + MEG+ + D KG+Q+ AI++ ++
Sbjct: 198 LNLSIGISRPERSATSV-ANGALQMEGQL---------DNGIDGKGMQYQAIVK---AEQ 244
Query: 231 RGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSY 290
+G ++ ++ + ++ + A + F P K+ S A+Q Y
Sbjct: 245 QGGSVNYSSSQINIKDATSVIIYISAGTDFRNPHF-----KQSIQSVLTKAIQK----PY 295
Query: 291 SDLYTRHLDDYQKLFHRVSIQLSRSP-KDIVTDTCSEENIDTVPSAERVKSFQTDE--DP 347
S +H+ YQKLF+RV + L P K++ TD +R+ +F D D
Sbjct: 296 SLQKQQHIARYQKLFNRVHVNLGAEPAKELTTD-------------QRLIAFHADRKADN 342
Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
L L FQFGRYL I S+R G NLQG+W +S W H+++N++MN+W N
Sbjct: 343 GLPALFFQFGRYLSICSTRVGLLPPNLQGLWANQISTPWTGDYHLDVNVQMNHWPLEVAN 402
Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
LSE PL D + + +G KTA+ Y A GWV H T++W + W G
Sbjct: 403 LSELNLPLADLVKRMVPHGEKTAKAYYNAKGWVAHVITNVWQFTEPGE-SASWGATKAGS 461
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEF 526
WLC +LWEHY +T D ++L + YP+L+G A F D LI+ G+L T+PS+SPE+ F
Sbjct: 462 GWLCDNLWEHYAFTNDVNYL-RDIYPVLKGAAQFYNDMLIKDPKSGWLVTSPSSSPENSF 520
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNE--DALVEKVLKSLPRLRPTKI 584
P+GK A + T+D IIRE+F+ +I+A+ L + A +++ + LP P +I
Sbjct: 521 YLPNGKHASICLGPTIDNQIIRELFNNVITASGKLGVDAALSAELQQRVTQLPP--PGRI 578
Query: 585 AEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPG 644
A DG IMEW +++K+ E HRH+SHL+GL+P IT P L +AA+KTL+ RG++GPG
Sbjct: 579 ASDGRIMEWMEEYKETEPQHRHISHLYGLYPASLITSNHTPALAEAAKKTLEVRGDDGPG 638
Query: 645 WSITWKTALWARLHDQEHAYRMVKRLFN-LVDPEHEKHFEGGLYSNLFAAHPPFQIDANF 703
WSI +K WARLHD + AY++ L + + GG+Y NL A PPFQID NF
Sbjct: 639 WSIAYKALFWARLHDGDRAYKLFCGLMKPTIKTDMNYGAGGGIYPNLLDAGPPFQIDGNF 698
Query: 704 GFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIY 763
G AAVAEML+QS + LLPA+P + ++G V+GLKARG TV + WK+G + I
Sbjct: 699 GGAAAVAEMLLQSNAGFIELLPAIPSEWKATGKVQGLKARGNFTVDMEWKNGKVISYKIA 758
Query: 764 S 764
S
Sbjct: 759 S 759
>gi|336404644|ref|ZP_08585337.1| hypothetical protein HMPREF0127_02650 [Bacteroides sp. 1_1_30]
gi|335941548|gb|EGN03401.1| hypothetical protein HMPREF0127_02650 [Bacteroides sp. 1_1_30]
Length = 811
Score = 470 bits (1210), Expect = e-129, Method: Compositional matrix adjust.
Identities = 277/763 (36%), Positives = 420/763 (55%), Gaps = 60/763 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+++++A+PIGN RLGAM++GG+ E L+LNE+T W G P + NP+A L
Sbjct: 22 LKLWYSQPAQNWSEALPIGNSRLGAMIYGGIEREELQLNEETFWAGSPYNNNNPNAVHVL 81
Query: 73 SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ G+ EA + L Y LG + LEF + + R+L+L
Sbjct: 82 PVVRKLIFEGRNKEAQRLIDTNFLTQQHGMSYLTLGSLYLEFPEHQ---NASGFYRDLNL 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
AT +Y V +V +TR F+S D VI+ I S++ +L+F ++ + L + V +
Sbjct: 139 ENATTTTRYQVDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNCPLVHKVNVQND 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
+ C GK + +G++ + E +I GT+ + EG++
Sbjct: 199 KLTVT---CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L + A++++ +N D D + + L+ + Y H+ Y+K F RV
Sbjct: 245 ATLYISAATNY----VNYQDVSADESHRTSEYLKRAMQIPYEKALKSHIAYYKKQFDRVR 300
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L + + +R+++F ED ++ LLF +GRYLLISSS+PG
Sbjct: 301 LTLPAG------------KASQLETPKRIENFGNGEDMAMAALLFHYGRYLLISSSQPGG 348
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN WDS +NIN EMNYW + NLSE PLF L LS+ G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 408
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
A+ Y GW+ HH TD+W G V +A +WP GGAWL H+W+HY +T +++F
Sbjct: 409 ARTMYDCRGWMAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L K YP+L+G A F +D+L+E H Y L +PS SPEH ++ TMD
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDN 513
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHH 604
I + + A+ + + + + + ++L +L P +I + + EW +D +P+ H
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLEDVDNPKDEH 572
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAY 664
RH+SHL+GL+P + I+ NP+L +AA TL +RG++ GWSI WK WAR+ D HA+
Sbjct: 573 RHISHLYGLYPSNQISPYSNPELFQAARNTLLQRGDKATGWSIGWKVNFWARMLDGNHAF 632
Query: 665 RMVKRLFNLVDPEH--EKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
+++K + L+ +H +++ G Y N+ AHPPFQID NFG+TA VAEML+QS ++
Sbjct: 633 QIIKNMIQLLPNDHLAKEYPNGRTYPNMLDAHPPFQIDGNFGYTAGVAEMLLQSHDGAVH 692
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
LLPALP D W G VKGL ARG TV + WK+ L++ I SN
Sbjct: 693 LLPALP-DAWEEGSVKGLVARGNFTVDMDWKNNVLNKAIIRSN 734
>gi|329957629|ref|ZP_08298104.1| hypothetical protein HMPREF9445_02986 [Bacteroides clarus YIT
12056]
gi|328522506|gb|EGF49615.1| hypothetical protein HMPREF9445_02986 [Bacteroides clarus YIT
12056]
Length = 827
Score = 470 bits (1210), Expect = e-129, Method: Compositional matrix adjust.
Identities = 269/774 (34%), Positives = 425/774 (54%), Gaps = 52/774 (6%)
Query: 2 MNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
++ + T+ N LK+ ++ PAK + +A+P+GNGR+GAMV+G E +LNE+T+W G P
Sbjct: 15 ISGKITAHDNSLKLWYDKPAKQWVEALPLGNGRIGAMVFGDPAHERFQLNEETVWGGSPH 74
Query: 62 DYTNPDAPKALSDVRSLVDSGQYAEA------TAASVKLFGHPADVYQLLGDIELEFDDS 115
+ TNP+A +AL +R L+ G+ EA S G P YQ +G + L+F+
Sbjct: 75 NNTNPNAKEALPRIRRLIFEGKNKEAQELCGPAICSQSANGMP---YQTVGTLHLDFEGI 131
Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
+ + + R+LD+ A A +++ + + RE F+S PD++++ K++ S+ S+SF
Sbjct: 132 N---QYDDFYRDLDIEKAIATTRFTANGITYIREAFTSFPDRLLIIKLTASKKKSISFTA 188
Query: 176 SLDS-LLDNHSY-VNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRG 232
+ +N + ++ ++ + G KAN ++ +G I+F+A+ +I ++ G
Sbjct: 189 HYTTPYTENTEFCISPRKELQLNG---------KANDHEGIEGKIRFTAL--TRIDNNGG 237
Query: 233 TISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSD 292
T+ D L+V+ +D L + ++F IN D D + ++ +Y+
Sbjct: 238 TLKVTSDSTLQVKNADSVTLYVSIGTNF----INYKDVSGDALKAARQYMKQAGK-NYTK 292
Query: 293 LYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVEL 352
H+ YQ+ F+RVS+ L S + I P+ RV+ F + DP + L
Sbjct: 293 RKEAHIAAYQQYFNRVSLDLG-----------SNDQIKK-PTDRRVREFSSVTDPQMAAL 340
Query: 353 LFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
FQFGRYLLI SS+PG Q ANLQGIWN L WD +IN+EMNYW + LSE
Sbjct: 341 YFQFGRYLLICSSQPGGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAETTALSEMH 400
Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
EP + ++I G ++A + Y GW +HH TDIW + A G + +WP AW C
Sbjct: 401 EPFLQLVKEVAIQGRESASM-YSCRGWTLHHNTDIWRTTGAVDG-AKYGVWPTCNAWFCQ 458
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDG 531
HLW+ Y ++ D+++L + YP++ G F LD+L+ E + +L PS SPE+
Sbjct: 459 HLWDRYLFSGDKNYLAE-VYPIMRGACEFYLDFLVREPKNNWLVVAPSYSPENSPSVNGK 517
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
+ + +TMD ++ ++F I AA ++ +N A + + L P ++ G +
Sbjct: 518 RGFVIVAGTTMDNQMVYDLFYNTIQAANLMNENT-AFTDSLQTVANHLAPMQVGRWGQLQ 576
Query: 592 EWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKT 651
EW +D+ +P+ HHRH+SHL+GL+PG I+ +P L +AA+ +L RG+ GWS+ WK
Sbjct: 577 EWMEDWDNPQDHHRHVSHLWGLYPGRQISAYHSPVLFEAAKTSLTARGDHSTGWSMGWKV 636
Query: 652 ALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAE 711
LWARL D HAY+++ + E ++ GG Y NLF AHPPFQID NFG TA + E
Sbjct: 637 CLWARLLDGNHAYKLITEQLHPTTDERGQN--GGTYPNLFDAHPPFQIDGNFGCTAGITE 694
Query: 712 MLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETV-SICWKDGDLHEVGIYS 764
M VQS ++LLPALP D W G +KG++ RGG + + W+ G + I S
Sbjct: 695 MFVQSHDGAVHLLPALP-DVWERGVIKGIRCRGGFLLEEMKWEKGQMQTATICS 747
>gi|295086436|emb|CBK67959.1| hypothetical protein [Bacteroides xylanisolvens XB1A]
Length = 811
Score = 470 bits (1210), Expect = e-129, Method: Compositional matrix adjust.
Identities = 280/763 (36%), Positives = 426/763 (55%), Gaps = 60/763 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+++++A+PIGN RLGAMV+GG+ E L+LNE+T W G P + NP+A L
Sbjct: 22 LKLWYSQPAQNWSEALPIGNSRLGAMVYGGIEREELQLNEETFWAGSPYNNNNPNAIHVL 81
Query: 73 SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ G+ EA + L Y LG + LEF + + R+L+L
Sbjct: 82 PAVRKLIFEGRNKEAQRLIDANFLTQQHGMSYLTLGSLYLEFPEHQ---NASGFYRDLNL 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
AT +Y + +V +TR F+S D VI+ I S++ +L+F ++ + L + V N
Sbjct: 139 ENATTTTRYQMDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNCPLVHKVNVQ-N 197
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
+Q+ + C GK + +G++ + E +I GT+ + EG++
Sbjct: 198 DQLTVT--CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L + A++++ +N D + + + L+ + Y H+ Y+K F RV
Sbjct: 245 ATLYISAATNY----VNYQDVSANESHRTSEYLKRAMQIPYEKALKSHIAYYKKQFDRVR 300
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L T S+ + + +R+++F ED ++ LLF +GRYLLISSS+PG
Sbjct: 301 LTLP-------TGKASQ-----LETPKRIENFGYGEDMAMAALLFHYGRYLLISSSQPGG 348
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN WDS +NIN EMNYW + NLSE PLF L LS+ G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 408
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
A+ Y GW+ HH TD+W G V +A +WP GGAWL H+W+HY +T +++F
Sbjct: 409 ARTMYDCRGWMAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L K YP+L+G A F +D+L+E H Y L +PS SPEH ++ TMD
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDN 513
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHH 604
I + + A+ + + + + + ++L +L P +I + + EW +D +P+ H
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLEDVDNPKDEH 572
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAY 664
RH+SHL+GL+P + I+ NP+L +AA TL +RG++ GWSI WK WAR+ D HA+
Sbjct: 573 RHISHLYGLYPSNQISPYSNPELFQAARNTLLQRGDKATGWSIGWKVNFWARMLDGNHAF 632
Query: 665 RMVKRLFNLVDPEH--EKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
+++K + L+ +H +++ G Y N+ AHPPFQID NFG+TA VAEML+QS ++
Sbjct: 633 QIIKNMIQLLPSDHLAKEYPNGRTYPNMLDAHPPFQIDGNFGYTAGVAEMLLQSHDGAVH 692
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
LLPALP D W G VKGL ARG TV + WK+ L++ I SN
Sbjct: 693 LLPALP-DAWEEGSVKGLVARGNFTVDMDWKNNVLNKAIIRSN 734
>gi|423228044|ref|ZP_17214450.1| hypothetical protein HMPREF1063_00270 [Bacteroides dorei
CL02T00C15]
gi|423243307|ref|ZP_17224383.1| hypothetical protein HMPREF1064_00589 [Bacteroides dorei
CL02T12C06]
gi|392637080|gb|EIY30955.1| hypothetical protein HMPREF1063_00270 [Bacteroides dorei
CL02T00C15]
gi|392645314|gb|EIY39042.1| hypothetical protein HMPREF1064_00589 [Bacteroides dorei
CL02T12C06]
Length = 814
Score = 470 bits (1209), Expect = e-129, Method: Compositional matrix adjust.
Identities = 291/797 (36%), Positives = 439/797 (55%), Gaps = 51/797 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ +T+A+P+GNGRLGAMV+G E ++LNE+T+W G P + NP+A + +
Sbjct: 25 KLWYDEPAQVWTEALPLGNGRLGAMVFGNPGVEHIQLNEETIWAGRPNNNANPNALEYIP 84
Query: 74 DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
VR LV G+Y EA T A+ K+ G P YQ GD+ + F H +Y++ Y RE
Sbjct: 85 KVRQLVFEGKYLEAQTLATEKIMTKTNSGMP---YQSFGDLHISFP-GHTRYSD--YYRE 138
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
L L++A V+Y V V + RE +S DQV++ +++ S+ G ++ N +L + +
Sbjct: 139 LSLDSARTIVRYKVDGVTYQRETLTSFADQVVMVRLTASQPGKITCNANLTTPHQDVMVS 198
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
++ + G ++ ++ KG ++F + + +G A D L +EG
Sbjct: 199 TEGEEVTLSG---------VSSWHEGLKGKVEFQGRMTAR---SQGGTQACRDGVLSIEG 246
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D AV+ + +++F N D + + + L+ + Y H+D +++
Sbjct: 247 ADEAVIYISIATNF----TNYKDITGNQVERAKNYLRRAVSKDYVTSRKAHVDFFKQYMD 302
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RVS+ L VT + RV++F+ +D LV F+FGRYLLI SS+
Sbjct: 303 RVSLDLGIDKYAGVT------------TDMRVQNFKETKDDFLVATYFRFGRYLLICSSQ 350
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQGIWN+ L P+WDS NIN+EMNYW + NLSE EPL + +S G
Sbjct: 351 PGGQPANLQGIWNDKLFPSWDSKYTCNINVEMNYWPAEVTNLSELHEPLIQLIREVSETG 410
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
++A++ Y A GWV+HH TDIW + A K LWP GGAWLC HLWE Y YT D +F
Sbjct: 411 RESAKIMYGADGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDMEF 469
Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L + AYP+++ F + ++ E +L PS SPE+ +GK A + T+D
Sbjct: 470 L-RSAYPIMKEAGKFFDEIMVKEPLHNWLVVCPSNSPENTHAGSNGK-ATTAAGCTLDNQ 527
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
+I ++++ II+ A +L + + + + L + P +I G + EW D+ +P+ HR
Sbjct: 528 LIFDLWNQIITTARLLGTDAE-FATHLEQRLKEMAPMQIGRWGQLQEWMTDWDNPQDVHR 586
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK LWARL D +HAY+
Sbjct: 587 HVSHLYGLFPSNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKVCLWARLLDGDHAYK 646
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
++ LV E +K GG Y NLF AHPPFQID NFG TA + EML+QS +YLLP
Sbjct: 647 LITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCTAGIVEMLMQSHDGFIYLLP 703
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS-NYSNNDHDSFKTLHYRGTSV 784
ALP +W G V G+ ARGG + + WK+G + + + S N N S L +G
Sbjct: 704 ALP-AQWKEGSVNGIIARGGFELDLSWKNGKVSRLVVKSRNGGNCRLRSLNPLAGKGLRT 762
Query: 785 KVNLSAGKIYTFNRQLK 801
+ K+Y L+
Sbjct: 763 AKGENPNKLYAIPEILQ 779
>gi|389793150|ref|ZP_10196324.1| hypothetical protein UU9_03133 [Rhodanobacter fulvus Jip2]
gi|388434883|gb|EIL91810.1| hypothetical protein UU9_03133 [Rhodanobacter fulvus Jip2]
Length = 802
Score = 470 bits (1209), Expect = e-129, Method: Compositional matrix adjust.
Identities = 288/808 (35%), Positives = 420/808 (51%), Gaps = 49/808 (6%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
+ + S + N L++ ++ PA F +A+P+GNGR+G MV+GGV L+E ++++G
Sbjct: 28 LFSGASLAAQN-LQLHYDAPANTFNEALPLGNGRMGVMVYGGVQQARYSLSEISMFSGSR 86
Query: 61 GDYTN-PDAPKALSDVRSLVDSGQYAEATAASVKLF---GHPADV----YQLLGDIELEF 112
D + +A L +R L+ G+ EA + + F G A+ YQ LG + L+F
Sbjct: 87 YDGADRKEAVNYLPKIRQLLLQGRNVEAEQLTNQHFTWSGEGANAHYGTYQGLGTLTLDF 146
Query: 113 DDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLS 172
+ ++ YRR LD+ +AT+ V+Y+ V + RE F S PDQV+V +S +G+L+
Sbjct: 147 AANAAPVSD--YRRRLDIPSATSDVRYAQDGVRYRREMFVSAPDQVMVLHLSADRAGALN 204
Query: 173 FNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRG 232
F LD +G N ++M G ++ KG+ F+A + + G
Sbjct: 205 FVARLDRAERASVEGDGANGLLMRGEL---------DSGGSGKGLAFAARVRVIAP---G 252
Query: 233 TISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSD 292
+ ++VE +L+ ++ +DG DP + S + LQ + + S +
Sbjct: 253 ASMHADAHGIRVEHGTDVTVLISEATDYDG---FAGRHTTDPVAASATDLQRVASRSVAQ 309
Query: 293 LYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVEL 352
L+ H+ D+ F R S+QL + +T+ R+ ++ DP L
Sbjct: 310 LHAAHVADFSSWFDRFSLQLG----------SVDNTRETMSMRARLDTYGASGDPGFAAL 359
Query: 353 LFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
FQ+ RYLLISSSRPG ANLQG+W E S W+ H N+N+EMNYW + P L E
Sbjct: 360 YFQYARYLLISSSRPGGLPANLQGLWAEGTSTPWNGDYHTNVNIEMNYWPAEPTGLGELV 419
Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
+PLF L G+KTAQ Y A GWV+H T++W +A + W +W AWL
Sbjct: 420 QPLFALTASLQQPGAKTAQRYYGARGWVVHTLTNLWG-FTAPGAEASWGVWQGAPAWLSF 478
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDGYLETNPSTSPEHEFIAPD 530
H+W+HY YT DRDFL +R YP+L G A F D LIE H +L T PS+SPE+ +
Sbjct: 479 HIWDHYRYTGDRDFL-RRYYPVLRGAAQFYADVLIEEPSHH-WLVTAPSSSPENTVYMEN 536
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSI 590
G A + TMD +IR +F A+I A++ L + D E K RL P +I DG I
Sbjct: 537 GGKAAIVMGPTMDEELIRFLFGAVIEASQTLHVDADFRRELEAKR-ARLAPIQIGPDGRI 595
Query: 591 MEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
E+ + +++ EVHHRH+SHL+ LFPG+ I + K P L AA ++L RG++ GWS +K
Sbjct: 596 QEYLKPYREVEVHHRHVSHLWALFPGNQIDLAKTPKLAAAAARSLDVRGDDSTGWSEAYK 655
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHE-KHFEGGLYSNLFAAHPPFQIDANFGFTAAV 709
LWA L D A ++ LF + H G Y NLF A PPFQID NFG T+ +
Sbjct: 656 VNLWAHLGDGNRALHLLNVLFKPASRDTRLGHEWAGTYPNLFNAGPPFQIDGNFGATSGM 715
Query: 710 AEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
EML+QS L LLPALP D W G V+GL ARGG + + W G L E + S +
Sbjct: 716 VEMLMQSEPGQLDLLPALP-DAWPQGEVRGLHARGGFVIDMRWAKGKLVEASVRSLRGGD 774
Query: 770 DHDSFKTLHYRGTSVKVNLSAGKIYTFN 797
+ Y V ++ AG+ Y
Sbjct: 775 -----CKVRYGKRQVLLSTKAGQTYKLQ 797
>gi|167764888|ref|ZP_02437009.1| hypothetical protein BACSTE_03280 [Bacteroides stercoris ATCC
43183]
gi|167697557|gb|EDS14136.1| hypothetical protein BACSTE_03280 [Bacteroides stercoris ATCC
43183]
Length = 825
Score = 470 bits (1209), Expect = e-129, Method: Compositional matrix adjust.
Identities = 275/764 (35%), Positives = 422/764 (55%), Gaps = 54/764 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+ + +A+P+GNG LGAMV+G E +LNE+T+W G P + TNP A +AL
Sbjct: 27 LKLWYDSPARQWVEALPLGNGSLGAMVFGDPIHERFQLNEETVWGGSPHNNTNPKAKEAL 86
Query: 73 SDVRSLVDSGQYAEA------TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
+R L+ G+ EA S G P YQ +G + L+F+ KY + Y R
Sbjct: 87 PRIRQLIFEGKNKEAQELCGPAICSQSANGMP---YQTVGTLHLDFEGIS-KY--DDYYR 140
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL--DNH 184
+LD+ A A +++ + + RE F+S PD+++V +++ S+ S+SF + +
Sbjct: 141 DLDIEKAIATTRFTANGITYVRETFTSFPDRLLVIRLTASKKRSISFTAHYTTPYTENTE 200
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLK 243
++ N++ + G KAN ++ +G ++F+A+ +I ++ GT+ A D L+
Sbjct: 201 RRISSLNELQLNG---------KANDHEGIEGKVRFTAL--TRIENNGGTLKATSDSTLQ 249
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V+ ++ VL + S FIN D D + ++ +Y+ H+ YQK
Sbjct: 250 VKNANSVVLYV----SIGTNFINYKDISGDALKTAQQYMKQAGK-NYTKRKEAHIAAYQK 304
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
F+RVS+ L S I P+ RVK F + DP + L FQFGRYLLI
Sbjct: 305 YFNRVSLDLG-----------SNSQIKK-PTDRRVKEFSSTADPQMAALYFQFGRYLLIC 352
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SS+PG Q ANLQGIWN L WD +IN+EMNYW + L E EP + ++
Sbjct: 353 SSQPGGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAETTALPEMHEPFLQLVKEVA 412
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
I G ++A + Y GW +HH TDIW + A G + +WP AW C HLW+ Y ++ D
Sbjct: 413 IQGRESAAM-YGCRGWTLHHNTDIWRSTGAVDGPK-YGIWPTCNAWFCQHLWDRYLFSGD 470
Query: 484 RDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+++L + YP++ G F LD+L+ E + +L PS SPE+ + + +TM
Sbjct: 471 KNYLAE-VYPIMRGACEFYLDFLVREPQNNWLVVAPSYSPENSPSVNGKRDFVIVAGATM 529
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR-LRPTKIAEDGSIMEWAQDFKDPE 601
D ++ ++F I AA ++ NE L+++ + L P ++ G + EW +D+ +P+
Sbjct: 530 DNQMVYDLFHNTIQAATLM--NEHKSFTDSLQTVAKHLAPMQVGRWGQLQEWMEDWDNPQ 587
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQE 661
HHRH+SHL+GL+PG I+ +P L +AA+K+L RG+ GWS+ WK LWARL D
Sbjct: 588 DHHRHVSHLWGLYPGRQISAYNSPVLFEAAKKSLIARGDHSTGWSMGWKVCLWARLLDGN 647
Query: 662 HAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDL 721
HAY+++ + E ++ GG Y NLF AHPPFQID NFG TA +AEMLVQS +
Sbjct: 648 HAYKLITEQLHPTTDERGQN--GGTYPNLFDAHPPFQIDGNFGCTAGIAEMLVQSHDGAI 705
Query: 722 YLLPALPWDKWSSGCVKGLKARGGETV-SICWKDGDLHEVGIYS 764
+LLPALP + W G +KG++ RGG + + W+ G + V I S
Sbjct: 706 HLLPALP-NVWEHGTIKGIRCRGGFLLEEMKWEKGKVQTVTIAS 748
>gi|237719758|ref|ZP_04550239.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
gi|229451027|gb|EEO56818.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
Length = 811
Score = 469 bits (1208), Expect = e-129, Method: Compositional matrix adjust.
Identities = 279/763 (36%), Positives = 426/763 (55%), Gaps = 60/763 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+++++A+PIGN RLGAM++GG+ E L+LNE+T W G P + NP+A L
Sbjct: 22 LKLWYSQPAQNWSEALPIGNSRLGAMIYGGIEREELQLNEETFWAGSPYNNNNPNAIHVL 81
Query: 73 SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ G+ EA + L Y LG + LEF + + R+L+L
Sbjct: 82 PAVRKLIFEGRNKEAQRLIDANFLTQQHGMSYLTLGSLYLEFPEHQ---NASGFYRDLNL 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
AT +Y + +V +TR F+S D VI+ I S++ +L+F ++ + L + V N
Sbjct: 139 ENATTTTRYQMDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNCPLVHKVNVQ-N 197
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
+Q+ + C GK + +G++ + E +I GT+ + EG++
Sbjct: 198 DQLTVT--CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L + A++++ +N D + + + L+ + Y H+ Y+K F RV
Sbjct: 245 ATLYISAATNY----VNYQDVSANESHRTSEYLKRAMQIPYEKALKSHIAYYKKQFDRVR 300
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L T S+ + + +R+++F ED ++ LLF +GRYLLISSS+PG
Sbjct: 301 LTLP-------TGKASQ-----LETPKRIENFGYGEDMAMAALLFHYGRYLLISSSQPGG 348
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN WDS +NIN EMNYW + NLSE PLF L LS+ G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 408
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
A+ Y GW+ HH TD+W G V +A +WP GGAWL H+W+HY +T +++F
Sbjct: 409 ARTMYDCRGWMAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L K YP+L+G A F +D+L+E H Y L +PS SPEH ++ TMD
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDN 513
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHH 604
I + + A+ + + + + + ++L +L P +I + + EW +D +P+ H
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLEDVDNPKDEH 572
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAY 664
RH+SHL+GL+P + I+ NP+L +AA TL +RG++ GWSI WK WAR+ D HA+
Sbjct: 573 RHISHLYGLYPSNQISPYSNPELFQAARNTLLQRGDKATGWSIGWKVNFWARMLDGNHAF 632
Query: 665 RMVKRLFNLVDPEH--EKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
+++K + L+ +H +++ G Y N+ AHPPFQID NFG+TA VAEML+QS ++
Sbjct: 633 QIIKNMIQLLPSDHLAKEYPNGRTYPNMLDAHPPFQIDGNFGYTAGVAEMLLQSHDGAVH 692
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
LLPALP D W G VKGL ARG TV + WK+ L++ I SN
Sbjct: 693 LLPALP-DAWEEGSVKGLVARGNFTVDMDWKNNVLNKAIIRSN 734
>gi|212694638|ref|ZP_03302766.1| hypothetical protein BACDOR_04169 [Bacteroides dorei DSM 17855]
gi|237711097|ref|ZP_04541578.1| glycoside hydrolase family 95 protein [Bacteroides sp. 9_1_42FAA]
gi|265750683|ref|ZP_06086746.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_33FAA]
gi|423239195|ref|ZP_17220311.1| hypothetical protein HMPREF1065_00934 [Bacteroides dorei
CL03T12C01]
gi|212663139|gb|EEB23713.1| hypothetical protein BACDOR_04169 [Bacteroides dorei DSM 17855]
gi|229454941|gb|EEO60662.1| glycoside hydrolase family 95 protein [Bacteroides sp. 9_1_42FAA]
gi|263237579|gb|EEZ23029.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_33FAA]
gi|392646982|gb|EIY40688.1| hypothetical protein HMPREF1065_00934 [Bacteroides dorei
CL03T12C01]
Length = 814
Score = 469 bits (1208), Expect = e-129, Method: Compositional matrix adjust.
Identities = 291/797 (36%), Positives = 439/797 (55%), Gaps = 51/797 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ +T+A+P+GNGRLGAMV+G E ++LNE+T+W G P + NP+A + +
Sbjct: 25 KLWYDEPAQVWTEALPLGNGRLGAMVFGNPGVEHIQLNEETIWAGRPNNNANPNALEYIP 84
Query: 74 DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
VR LV G+Y EA T A+ K+ G P YQ GD+ + F H +Y++ Y RE
Sbjct: 85 KVRQLVFEGKYLEAQTLATEKIMAKTNSGMP---YQSFGDLHISFP-GHTRYSD--YYRE 138
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
L L++A V+Y V V + RE +S DQV++ +++ S+ G ++ N +L + +
Sbjct: 139 LSLDSARTIVRYKVDGVTYQRETLTSFADQVVMVRLTASQPGKITCNANLTTPHQDVMVS 198
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
++ + G ++ ++ KG ++F + + +G A D L +EG
Sbjct: 199 TEGEEVTLSG---------VSSWHEGLKGKVEFQGRMTAR---SQGGTQACRDGVLSIEG 246
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D AV+ + +++F N D + + + L+ + Y H+D +++
Sbjct: 247 ADEAVIYISIATNF----TNYKDITGNQVERAKNYLRRAVSKDYVTSRKAHVDFFKQYMD 302
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RVS+ L VT + RV++F+ +D LV F+FGRYLLI SS+
Sbjct: 303 RVSLDLGIDKYAGVT------------TDMRVQNFKETKDDFLVATYFRFGRYLLICSSQ 350
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQGIWN+ L P+WDS NIN+EMNYW + NLSE EPL + +S G
Sbjct: 351 PGGQPANLQGIWNDKLFPSWDSKYTCNINVEMNYWPAEVTNLSELHEPLIQLIREVSETG 410
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
++A++ Y A GWV+HH TDIW + A K LWP GGAWLC HLWE Y YT D +F
Sbjct: 411 RESAKIMYGADGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDMEF 469
Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L + AYP+++ F + ++ E +L PS SPE+ +GK A + T+D
Sbjct: 470 L-RSAYPIMKEAGKFFDEIMVKEPLHNWLVVCPSNSPENTHAGSNGK-ATTAAGCTLDNQ 527
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
+I ++++ II+ A +L + + + + L + P +I G + EW D+ +P+ HR
Sbjct: 528 LIFDLWNQIITTARLLGTDAE-FATHLEQRLKEMAPMQIGRWGQLQEWMTDWDNPQDVHR 586
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK LWARL D +HAY+
Sbjct: 587 HVSHLYGLFPSNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKVCLWARLLDGDHAYK 646
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
++ LV E +K GG Y NLF AHPPFQID NFG TA + EML+QS +YLLP
Sbjct: 647 LITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCTAGIVEMLMQSHDGFIYLLP 703
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS-NYSNNDHDSFKTLHYRGTSV 784
ALP +W G V G+ ARGG + + WK+G + + + S N N S L +G
Sbjct: 704 ALP-AQWKEGSVNGIIARGGFELDLSWKNGKVSRLVVKSRNGGNCRLRSLNPLAGKGLRT 762
Query: 785 KVNLSAGKIYTFNRQLK 801
+ K+Y L+
Sbjct: 763 AKGENPNKLYAIPEILQ 779
>gi|345515268|ref|ZP_08794774.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
gi|229434306|gb|EEO44383.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
Length = 814
Score = 469 bits (1208), Expect = e-129, Method: Compositional matrix adjust.
Identities = 284/764 (37%), Positives = 428/764 (56%), Gaps = 50/764 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ ++ PA+ +T+A+P+GNGRLGAMV+G E ++LNE+T+W G P + NP+A + +
Sbjct: 25 KLWYDEPAQVWTEALPLGNGRLGAMVFGNPGVEHIQLNEETIWAGRPNNNANPNALEYIP 84
Query: 74 DVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRE 127
VR LV G+Y EA T A+ K+ G P YQ GD+ + F H +Y++ Y RE
Sbjct: 85 KVRQLVFEGKYLEAQTLATEKIMAKTNSGMP---YQSFGDLHISFP-GHTRYSD--YYRE 138
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV 187
L L++A V+Y V V + RE +S DQV++ +++ S+ G ++ N +L + +
Sbjct: 139 LSLDSARTIVRYKVDGVTYQRETLTSFADQVVMVRLTASQPGKITCNANLTTPHQDVMVS 198
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLKVEG 246
++ + G ++ ++ KG ++F + + +G A D L +EG
Sbjct: 199 TEGEEVTLSG---------VSSWHEGLKGKVEFQGRMTAR---SQGGTQACRDGVLSIEG 246
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+D AV+ + +++F N D + + + L+ + Y H+D +++
Sbjct: 247 ADEAVIYISIATNF----TNYKDITGNQVERAKNYLRRAVSKDYVTSRKAHVDFFKQYMD 302
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RVS+ L VT + RV++F+ +D LV F+FGRYLLI SS+
Sbjct: 303 RVSLDLGIDKYAGVT------------TDMRVQNFKETKDDFLVATYFRFGRYLLICSSQ 350
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG Q ANLQGIWN+ L P+WDS NIN+EMNYW + NLSE EPL + +S G
Sbjct: 351 PGGQPANLQGIWNDKLFPSWDSKYTCNINVEMNYWPAEVTNLSELHEPLIQLIREVSETG 410
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
++A++ Y A GWV+HH TDIW + A K LWP GGAWLC HLWE Y YT D +F
Sbjct: 411 RESAKIMYGADGWVLHHNTDIWRVTGAI-DKAPSGLWPTGGAWLCRHLWERYLYTGDMEF 469
Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L + AYP+++ F + ++ E +L PS SPE+ +GK A + T+D
Sbjct: 470 L-RSAYPIMKEAGKFFDEIMVKEPLHNWLVVCPSNSPENTHAGSNGK-ATTAAGCTLDNQ 527
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
+I ++++ II+ A +L + + + + L + P +I G + EW D+ +P+ HR
Sbjct: 528 LIFDLWNQIITTARLLGTDAE-FATHLEQRLKEMAPMQIGRWGQLQEWMTDWDNPQDVHR 586
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK LWARL D +HAY+
Sbjct: 587 HVSHLYGLFPSNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKVCLWARLLDGDHAYK 646
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
++ LV E +K GG Y NLF AHPPFQID NFG TA + EML+QS +YLLP
Sbjct: 647 LITDQLTLVRNEKKK---GGTYPNLFDAHPPFQIDGNFGCTAGIVEMLMQSHDGFIYLLP 703
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
ALP +W G V G+ ARGG + + WK+G + + + S N
Sbjct: 704 ALP-AQWKEGSVNGIIARGGFELDLSWKNGKVSRLVVKSRNGGN 746
>gi|336251922|ref|YP_004585890.1| alpha-L-fucosidase [Halopiger xanaduensis SH-6]
gi|335339846|gb|AEH39084.1| Alpha-L-fucosidase [Halopiger xanaduensis SH-6]
Length = 786
Score = 469 bits (1207), Expect = e-129, Method: Compositional matrix adjust.
Identities = 281/760 (36%), Positives = 416/760 (54%), Gaps = 62/760 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA + DA+P+GNGRLGAM +GG+ E ++ NE+TLW G + A + ++R
Sbjct: 11 YDEPADEWIDALPLGNGRLGAMAYGGLERERIQCNEETLWAGGHEEKVVEGASEHGEEIR 70
Query: 77 SLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
L G+Y EA + L G P + L +L + A YRRELDL
Sbjct: 71 QLCFEGEYEEAQRRCNEHLQGEPPGIRPYLPFCDLLIEQPGHDEAT-AYRRELDLADGCY 129
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
RV+Y + +TRE+F S PD V+V ++ S+ ++ LD + V+ N++++
Sbjct: 130 RVEYDLEGTTYTREYFVSAPDDVLVVRLECDGPRSIDASIRLDRDRCARAGVDEENRLLL 189
Query: 196 EGRCPGKRIPPKANANDDPKG--IQF---------SAILEIKISDDRGTISALEDKKLKV 244
G+ +P A+ G ++F A +E + DD G + + V
Sbjct: 190 RGQV--IDVPNTADMYQGSGGWGLRFEGRAAVRTSGASVEPNVDDDWGQSPS----AVTV 243
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
G+D ++ A++ FDG DP+ + + L++ + Y +L RH+DD++ L
Sbjct: 244 TGADAVTVVFAAATDFDG---------DDPSDATTATLEAAADRRYEELKRRHVDDHRAL 294
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F RVS++L P D D E + V + R DP LV+L FQ+GRYLL++S
Sbjct: 295 FDRVSLELG-DPVDAPID----ERLAAVRNGSR--------DPHLVQLYFQYGRYLLLAS 341
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
SRPGT ANLQGIWNE+ P W S +++NLEMNYW + NL+EC EPL F+ +
Sbjct: 342 SRPGTLPANLQGIWNEEYDPPWHSCYTLDVNLEMNYWHAEVANLAECAEPLVAFVDSMRE 401
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
+G +TA+ Y G+ H TD+W +++ W WPM AWLC +LW+HY ++ DR
Sbjct: 402 SGRRTAREYYDCDGFAAHVDTDLW-RTTVQTVDARWGHWPMAPAWLCRNLWDHYAFSGDR 460
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
LE YP+L+ A FLLD+L+E D G+L T PS SPE++F PDG+ A V TMD
Sbjct: 461 TDLET-IYPILKDAARFLLDFLVEHPDRGWLVTAPSASPENQFRTPDGQEATVCEGPTMD 519
Query: 544 MAIIREVFSAIISAAE---VLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDP 600
+ + ++F+ I AA V + +++ V + +L RL P +I E G + EW +D++
Sbjct: 520 VQLATDLFTHCIEAATELGVADGADESFVADLSDALERLPPMQIGEHGQLQEWLEDYEAV 579
Query: 601 EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARL 657
+ HRH+SHLFG +P IT +P L A +L++R E G GWS W AL+ARL
Sbjct: 580 DPGHRHVSHLFGFYPADVITRRDDPALADAVRTSLERRLEHGGGHTGWSCAWTIALFARL 639
Query: 658 HDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST 717
D + A V++L + Y +L +HPPFQID NFG A +AE+L+QS
Sbjct: 640 EDGDRALEAVRKLLS-----------ESTYDSLLDSHPPFQIDGNFGGAAGIAELLLQSH 688
Query: 718 LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
++L LLPALP + W+ G V+GL+ARGG V + W DG L
Sbjct: 689 GDELRLLPALP-EAWTDGSVEGLRARGGLEVDLRWTDGRL 727
>gi|159127378|gb|EDP52493.1| conserved hypothetical protein [Aspergillus fumigatus A1163]
Length = 745
Score = 469 bits (1207), Expect = e-129, Method: Compositional matrix adjust.
Identities = 282/763 (36%), Positives = 414/763 (54%), Gaps = 64/763 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA ++ +A+P+GNGRLGAMV+G +E L+LNED++W G P + DA + L +R
Sbjct: 7 YQQPAGNWEEALPVGNGRLGAMVYGRTDTEMLQLNEDSVWYGGPQNRVPHDAFECLPRLR 66
Query: 77 SLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
SL+ G +AEA + F HP Y+ LG + L+F HL + YRR LD+ A
Sbjct: 67 SLIREGNHAEAEKLVRLAFFSHPISQRHYEPLGTLFLDF--GHLPECTQNYRRSLDIERA 124
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL--DNHSYVNGNN 191
T RV+Y V+ RE +SNPD VI ++ S+ + ++ S L + + Y++
Sbjct: 125 TTRVEYEHKGVKVRREVIASNPDSVIAIRVQASQKTDFTLRLTRMSELQYETNEYLD--- 181
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
+ E R I P + K + +++++ ++D+ +++ + +K L V D A+
Sbjct: 182 DVTTEDRTITMHITPGGH-----KSNRACCMVKVRTAEDQDSVTQIGNKLL-VNAQD-AL 234
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
+L+ A +++ D K +S+ +AL S +++ RH++DY+ L+ R+ +
Sbjct: 235 ILISAQTTY-----RCDDIDKKASSDLETALLH----STDEIWERHVNDYRSLYGRMELH 285
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
LS S D+ TD K + DP L+ L + RYLLIS SR G +V
Sbjct: 286 LSPSNCDMPTD----------------KRIKNSRDPGLIALYHNYCRYLLISCSRNGDKV 329
Query: 372 --ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
A LQGIWN P W +NINL+MNYW + CNLS+C+ PLF L ++ +G +T
Sbjct: 330 LPATLQGIWNPSFHPAWGCKYTININLQMNYWPANICNLSDCEMPLFSLLERVAKSGEET 389
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
AQ Y GWV HH TDIWA +S + LWP+GGAWLC H+W+H+ +T D++FLE
Sbjct: 390 AQKMYGCRGWVAHHCTDIWADTSPGDTWMPATLWPLGGAWLCVHIWDHFRFTRDKEFLE- 448
Query: 490 RAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
R +P+L+GC FLLD+L+E G YL TNPS SPE+ F +G+ + ST+D+ I+
Sbjct: 449 RMFPILQGCVQFLLDFLVEDASGEYLVTNPSLSPENTFYEKNGERGVLCEGSTIDIQIVN 508
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLS 608
V SA + + E LE D L L +L RL P +I G + EWA D+ + E HRH+S
Sbjct: 509 AVLSAYLKSVEELEI-VDKLAPAALDALHRLPPLRIGSFGQLQEWASDYAEVEPGHRHVS 567
Query: 609 HLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP---GWSITWKTALWARLHDQEHAYR 665
HL+ L+PG TI+ E P + A TL +R G GWS W L ARL E +
Sbjct: 568 HLWALYPGDTISPETTPKIADACSVTLHRREAHGSGHTGWSRAWLINLHARLLAAEECAK 627
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY-LL 724
+ L NL HPPFQID NFG A + EML+QS + LL
Sbjct: 628 HIDLL-----------LAQSTLPNLLDTHPPFQIDGNFGAGAGILEMLLQSHEEGIIRLL 676
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE-VGIYSNY 766
PA P WSSG ++ + ARGG + W++G + + V +YS +
Sbjct: 677 PACP-RAWSSGSLRNICARGGFKLDFSWENGKIKDAVTVYSEF 718
>gi|338213645|ref|YP_004657700.1| alpha-L-fucosidase [Runella slithyformis DSM 19594]
gi|336307466|gb|AEI50568.1| Alpha-L-fucosidase [Runella slithyformis DSM 19594]
Length = 829
Score = 468 bits (1204), Expect = e-129, Method: Compositional matrix adjust.
Identities = 286/785 (36%), Positives = 419/785 (53%), Gaps = 79/785 (10%)
Query: 11 NPLKIT-FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
NP ++ +N PAK + DA+P+GNGRLGAMV+G E ++LNE+T W+G P
Sbjct: 47 NPSTVSWYNAPAKKWEDALPVGNGRLGAMVFGRSGEERIQLNEETYWSGGPYSTVVKGGY 106
Query: 70 KALSDVRSLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRR 126
K L +++ LV +Y A L G+P + YQ L ++ L F + + Y+R
Sbjct: 107 KVLPEIQKLVFEEKYLAAHNLFGRHLMGYPVEQQKYQSLANLHLFFQNQD---STTEYKR 163
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
L+L + V Y + + R+ F+S PDQVIV +++ +SGS+SF +L + N ++
Sbjct: 164 WLNLESGITSVSYKSNGITYQRDVFASAPDQVIVIRLTADKSGSISFKANLRGV-RNQAH 222
Query: 187 VN-----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI-SDDRGTI 234
N G++ +I+ G+ D G+ E +I + G
Sbjct: 223 SNYATDYFRMDPYGSDGLILTGKSA------------DYMGVAGKLKYEARIKAIPEGGR 270
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
+ L +E ++ L A+++F +N D + +P I++ SY+ +
Sbjct: 271 MKTDGVDLIIENANTVTLYFAAATNF----VNYKDVRANPHQRVEDYFARIKSKSYTSIL 326
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
L DY+ F RVS+QL + + P ER++ Q+ DPSL L +
Sbjct: 327 EAALADYKHFFDRVSLQLPTTENSFL------------PLPERIQKIQSSPDPSLSALSY 374
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
FGRYL+I+SSRPGT+ ANLQGIWN++++P WDS NIN +MNYW NLSEC EP
Sbjct: 375 NFGRYLMIASSRPGTEPANLQGIWNDNMNPDWDSKYTTNINTQMNYWPVESSNLSECAEP 434
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
L F+ L+ G++ A+ +Y A GWV H TD+W + +A W + +GGAWLCTHL
Sbjct: 435 LVRFIKELTDQGTQVAREHYGAKGWVFHQNTDLW-RVAAPMDGPTWGTFTVGGAWLCTHL 493
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEH--------- 524
WEHY YTMD FL K YPL++G F +D+L +G +L TNPSTSPE+
Sbjct: 494 WEHYQYTMDAAFL-KETYPLMKGSVQFFMDFLKPHPNGKWLVTNPSTSPENFPDGGGNKP 552
Query: 525 ---EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
E A + + S++DM I+ ++F I A+ +L N A V++V + +L P
Sbjct: 553 YFDEVTAGFREGTTICAGSSIDMQILFDLFGYFIEASAILGDN-SAFVQQVKVAREKLVP 611
Query: 582 TKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEE 641
+I DGS+ EW+ D+K E +HRH SH++GL+PG + ++ P L +A +K L++RG+
Sbjct: 612 PQIGRDGSLQEWSDDWKSLEKNHRHFSHMYGLYPGKVLYEKRTPALTEAYKKVLEERGDA 671
Query: 642 GPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFA--AHPPFQI 699
GWS WK ALWARL D A ++ K E S LFA P Q+
Sbjct: 672 STGWSRAWKMALWARLGDGNRANKIYKGFIK----------EQSCLS-LFALCGRAP-QV 719
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D FG TAA+ EML+QS + LLPALP D WSSG KG+ ARG + W++ L +
Sbjct: 720 DGTFGATAAITEMLLQSHDGFIKLLPALP-DDWSSGAFKGVCARGAFELDYVWENKQLKQ 778
Query: 760 VGIYS 764
V I S
Sbjct: 779 VKITS 783
>gi|299147305|ref|ZP_07040370.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
gi|298514583|gb|EFI38467.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
Length = 811
Score = 468 bits (1203), Expect = e-129, Method: Compositional matrix adjust.
Identities = 274/762 (35%), Positives = 416/762 (54%), Gaps = 58/762 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+++++A+PIGN RLGAMV+GG+ E L+LNE+T W G P + NP+A L
Sbjct: 22 LKLWYSQPAQNWSEALPIGNSRLGAMVYGGIEREELQLNEETFWAGSPYNNNNPNAVHVL 81
Query: 73 SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ G+ EA + L Y LG + LEF + + R+L+L
Sbjct: 82 PVVRKLIFEGRNKEAQRLIDANFLTQQHGMSYLTLGSLYLEFPEHQ---NGSGFYRDLNL 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
AT +Y V +V +TR F+S D VI+ I S++ +L+F ++ + L + V +
Sbjct: 139 ENATTTTRYQVDDVTYTRTTFASFTDNVIIMHIKASKANTLNFTIAYNFPLVHKVNVQND 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
+ C GK + +G++ + E +I + L++ A
Sbjct: 199 KLTVT---CQGK----------EQEGLKAALRAECQIQVKTNSTLRPGGNTLQINEGTEA 245
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
L + A++++ +N + D + + L+ + Y H+ Y+K F RV +
Sbjct: 246 TLYISAATNY----VNYQNVSADESHRTSEYLKRATQIPYEKALKSHIAYYKKQFDRVRL 301
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
L I + + +R+++F ED ++ LLF +GRYLLISSS+PG Q
Sbjct: 302 TLPTG------------KISQLETPKRIENFGNGEDMAMAALLFHYGRYLLISSSQPGGQ 349
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWN WDS +NIN EMNYW + NLSE PLF L LS+ G++TA
Sbjct: 350 PANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAETA 409
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDFL 487
+ Y GW+ HH TD+W G V +A +WP GGAWL H+W+HY +T +++FL
Sbjct: 410 RTMYDCRGWMAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEFL 465
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
K YP+L+G A F +D+L+E H Y L +PS SPEH ++ TMD
Sbjct: 466 -KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDNQ 514
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
I + + A+ + + + + + ++L +L P +I + + EW +D + + HR
Sbjct: 515 IAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLEDIDNSKDEHR 573
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H+SHL+GL+P + I+ NP+L +AA TL +RG++ GWSI WK WAR+ D HA++
Sbjct: 574 HISHLYGLYPSNQISPYSNPELFQAARNTLLQRGDKATGWSIGWKVNFWARMLDGNHAFQ 633
Query: 666 MVKRLFNLVDPEH--EKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYL 723
++K + L+ +H +++ G Y N+ AHPPFQID NFG+TA VAEML+QS ++L
Sbjct: 634 IIKNMIQLLPNDHLAKEYPNGRTYPNMLDAHPPFQIDGNFGYTAGVAEMLLQSHDGAVHL 693
Query: 724 LPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
LPALP D W G VKGL ARG TV + WK+ L++ I SN
Sbjct: 694 LPALP-DAWEEGSVKGLVARGNFTVDMDWKNNVLNKAIIRSN 734
>gi|70999286|ref|XP_754362.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
gi|66851999|gb|EAL92324.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
Length = 745
Score = 468 bits (1203), Expect = e-129, Method: Compositional matrix adjust.
Identities = 281/763 (36%), Positives = 413/763 (54%), Gaps = 64/763 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA ++ +A+P+GNGRLGAMV+G +E L+LNED++W G P + DA + L +R
Sbjct: 7 YQQPAGNWEEALPVGNGRLGAMVYGRTDTEMLQLNEDSVWYGGPQNRVPHDAFECLPRLR 66
Query: 77 SLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
SL+ G +AEA + F HP Y+ LG + L+F HL + YRR LD+ A
Sbjct: 67 SLIREGNHAEAEKLVRLAFFSHPISQRHYEPLGTLFLDF--GHLPECTQNYRRSLDIERA 124
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL--DNHSYVNGNN 191
T RV+Y V+ RE +SNPD VI ++ S+ + ++ S L + + Y++
Sbjct: 125 TTRVEYEHKGVKVRREVIASNPDSVIAIRVQASQKTDFTLRLTRMSELQYETNEYLD--- 181
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
+ E R I P + K + +++++ ++D+ +++ + +K L V D A+
Sbjct: 182 DVTTEDRTITMHITPGGH-----KSNRACCMVKVRTAEDQDSVTQIGNKLL-VNAQD-AL 234
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
+L+ A +++ D K +S+ +AL S +++ RH++DY+ L+ R+ +
Sbjct: 235 ILISAQTTY-----RCDDIDKKASSDLETALLH----STDEIWERHVNDYRSLYGRMELH 285
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
LS S D+ TD K + DP L+ L + RYLLIS SR G +
Sbjct: 286 LSPSNCDMPTD----------------KRIKNSRDPGLIALYHNYCRYLLISCSRNGDKA 329
Query: 372 --ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
A LQGIWN P W +NINL+MNYW + CNLS+C+ PLF L ++ +G +T
Sbjct: 330 LPATLQGIWNPSFHPAWGCKYTININLQMNYWPANICNLSDCEMPLFSLLERVAKSGEET 389
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
AQ Y GWV HH TDIWA +S + LWP+GGAWLC H+W+H+ +T D++FLE
Sbjct: 390 AQKMYGCRGWVAHHCTDIWADTSPGDTWMPATLWPLGGAWLCVHIWDHFRFTRDKEFLE- 448
Query: 490 RAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
R +P+L+GC FLLD+L+E G YL TNPS SPE+ F +G+ + ST+D+ I+
Sbjct: 449 RMFPILQGCVQFLLDFLVEDASGEYLVTNPSLSPENTFYEKNGERGVLCEGSTIDIQIVN 508
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLS 608
V SA + + E LE D L L +L RL P +I G + EWA D+ + E HRH+S
Sbjct: 509 AVLSAYLKSVEELEI-VDKLAPAALDALHRLPPLRIGSFGQLQEWASDYAEVEPGHRHVS 567
Query: 609 HLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP---GWSITWKTALWARLHDQEHAYR 665
HL+ L+PG TI+ E P + A TL +R G GWS W L ARL E +
Sbjct: 568 HLWALYPGDTISPETTPKIADACSVTLHRREAHGSGHTGWSRAWLINLHARLLAAEECAK 627
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY-LL 724
+ L NL HPPFQID NFG A + EML+QS + LL
Sbjct: 628 HIDLL-----------LAQSTLPNLLDTHPPFQIDGNFGAGAGILEMLLQSHEEGIIRLL 676
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE-VGIYSNY 766
PA P WSSG ++ + ARGG + W++G + + V +YS +
Sbjct: 677 PACP-RAWSSGSLRNICARGGFKLDFSWENGKIKDAVTVYSEF 718
>gi|326201460|ref|ZP_08191331.1| coagulation factor 5/8 type domain protein [Clostridium
papyrosolvens DSM 2782]
gi|325988060|gb|EGD48885.1| coagulation factor 5/8 type domain protein [Clostridium
papyrosolvens DSM 2782]
Length = 1026
Score = 468 bits (1203), Expect = e-129, Method: Compositional matrix adjust.
Identities = 284/794 (35%), Positives = 414/794 (52%), Gaps = 70/794 (8%)
Query: 24 FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
F A+P+GNGR+GAMV+G P E + LNE T W+ PG+ A +L + + +GQ
Sbjct: 76 FYKALPLGNGRIGAMVYGNYPDERIDLNEATFWSSGPGNNNRAGAANSLKTAQDQLFAGQ 135
Query: 84 YAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVG 142
Y + K + G YQ +GD++L F S + Y R+LD+NT Y+
Sbjct: 136 YTNGSTTIAKSMIGGGEAKYQSIGDLKLSFGHSSV----SNYSRQLDMNTGVVSSDYTYN 191
Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN--GNNQIIMEGRCP 200
++ RE F S PDQ++VTKI+ S GS+S +S L V+ GN+ ++M G
Sbjct: 192 GKKYHRESFVSYPDQIMVTKITCSSPGSISLTAGYESSLSGQYTVSTSGNDTLVMNGH-- 249
Query: 201 GKRIPPKANANDDPKGIQFSAILEI--KISDDRGTISALEDKKLKVEGSDWAVLLLVASS 258
D GI ++ K+ + G++SA + ++ V +D V+L +
Sbjct: 250 ----------GDSDNGISYAVWFSTRSKLINTNGSVSA-NNNQISVSNADSVVIL----T 294
Query: 259 SFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKD 318
S +IN D ++ + + + SY L H+ DYQ LF RV + L S +
Sbjct: 295 SIRTNYINYKTCNGDEKGKATTDITNASAKSYDTLLNNHVADYQSLFKRVDVDLGGSGSE 354
Query: 319 IVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIW 378
++ P ++R+ F + DP L ++LFQ+GRYL+IS+SR +Q NLQGIW
Sbjct: 355 -----------NSKPMSQRISEFGSTNDPKLAKVLFQYGRYLMISASRD-SQPMNLQGIW 402
Query: 379 NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY-LAS 437
N+ +P W NIN EMNYW + NL+EC EP + L G++TA+ +Y +++
Sbjct: 403 NKFRNPAWGCKMTTNINYEMNYWPAFTTNLAECFEPFVEKAKALQAPGNETARAHYNISN 462
Query: 438 GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEG 497
GWV+HH TD+W +++ G+ W WP G W+ L++ YN+ D +L + YP+++G
Sbjct: 463 GWVLHHNTDLWNRTAPIDGE--WGFWPTGAGWVSNMLYDAYNFNQDTAYLNE-IYPVIKG 519
Query: 498 CASFLLDWL----IEGHDGYLETNPSTSPEHEFIAP----DGKLACVSYSSTMDMAIIRE 549
A FL + I G + Y P TSPE + P G+ A SY TMD I RE
Sbjct: 520 AADFLQTLMQSKSINGQN-YQVICPGTSPE---LTPPGNSGGQGAYNSYGVTMDNGISRE 575
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLS 608
+F A+I AA +L N D+ L+S + +++P I G + EWA D+ +RH+S
Sbjct: 576 LFKAVIQAAGIL--NIDSSFRSTLQSKVSQIKPNTIGSWGQLQEWAYDWDSQSEKNRHIS 633
Query: 609 HLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVK 668
+ LFPG I P + A K+L RG+ G GWS WK WARL D HAY +VK
Sbjct: 634 FAYDLFPGLEINKRNTPSIANAVIKSLNTRGDVGTGWSEAWKLNCWARLEDGTHAYNLVK 693
Query: 669 RLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALP 728
L V+ +G LY NL+ AHPPFQID NFGFT+ +AEML+QS N++ LLPALP
Sbjct: 694 LLITPVNK------DGRLYDNLWDAHPPFQIDGNFGFTSGIAEMLLQSHNNEIQLLPALP 747
Query: 729 WDKWSSGCVKGLKARGGETVS-ICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVN 787
+WS+G GL ARG TV+ + W +G L I SN N + Y ++
Sbjct: 748 -SQWSTGHADGLCARGNFTVTKMNWANGVLTGATIKSNSGN-----VCNVRYGNKTISFP 801
Query: 788 LSAGKIYTFNRQLK 801
G Y N L+
Sbjct: 802 TKKGYTYQVNGSLQ 815
>gi|375145023|ref|YP_005007464.1| alpha-L-fucosidase [Niastella koreensis GR20-10]
gi|361059069|gb|AEV98060.1| Alpha-L-fucosidase [Niastella koreensis GR20-10]
Length = 834
Score = 467 bits (1202), Expect = e-129, Method: Compositional matrix adjust.
Identities = 285/772 (36%), Positives = 430/772 (55%), Gaps = 69/772 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L++ + PA+ + +A+P+GNG+LG MV+GG E + ++EDTLWTG P AP+ L
Sbjct: 46 LELWYQKPAEKWLEALPVGNGKLGGMVFGGPVQERISISEDTLWTGGPYQPAVEVAPETL 105
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHP--ADVYQLLGDIELEFDDSHLKYAEET--YRREL 128
+ +R L G++AEA +L G P YQ +G+++L F D ET YRR L
Sbjct: 106 ASIRKLSFEGKFAEAQELVKQLQGKPHRQAAYQTVGEVQLNFSD-----ITETSDYRRSL 160
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL-LDNHSYV 187
+L A V+++ + + F+S PD VIVT+I+ + + ++ SL D +
Sbjct: 161 NLQNGVAGVQFTANGTFYKHKTFASYPDHVIVTRITAGKP--IHLTITCTSLHPDKKLTI 218
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
GNN +IM+G+ + P + + + ++I RG + D ++V G+
Sbjct: 219 AGNNTLIMDGKNGDLVVEGDGTI---PAALTWQCRVLVQI---RGGVQTAVDNGIQVIGA 272
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D ++L A++S+ + +D P + ++ SY L+ HL DYQ LF++
Sbjct: 273 DEVLILTTAATSY----VRYNDVSGKPDQLCAAVIKKCIAKSYDILFEAHLKDYQPLFNK 328
Query: 308 VSIQLSR-SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
V ++L+ +P ++ P+ ER+K+F T DPSL L FQ+GRYLL++SSR
Sbjct: 329 VKLKLTNLAPSNL-------------PTTERIKNFATGNDPSLAALYFQYGRYLLLTSSR 375
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG+Q ANLQG WN+ LS +W VNIN EMNYW + NL+ C+ PL + + L+I G
Sbjct: 376 PGSQPANLQGRWNDSLSASWGGKYTVNINTEMNYWPAQKTNLASCELPLLELVKDLAITG 435
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
TAQ Y A GWV HH TD+W +S+A + WP GGAWLC HL++HY Y+ D +
Sbjct: 436 QITAQKTYHARGWVCHHNTDLW-RSTAPIDSAFFGQWPTGGAWLCNHLYQHYLYSGDTAY 494
Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYS--STMD 543
L++ YPL++G A F D L+ E G+ T+PS SPE +G+ VS S TMD
Sbjct: 495 LQE-LYPLMKGSARFFFDTLVQEPKHGWYVTSPSMSPE------NGRAKGVSNSPGPTMD 547
Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP-RLRPTKIAEDGSIMEWAQ--DFKDP 600
M I+RE+F+ +AA VL+K+ D +K + +L P +I + G + EW D +
Sbjct: 548 MQILRELFTHCATAAAVLKKDAD--FQKACNDMVFKLAPDQIGKGGQLQEWLDDVDMESD 605
Query: 601 EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG--EEGPGWSITWKTALWARLH 658
+ HRH+S L+GLFPG+ IT ++ L AA K + RG EG GW++ W+ LWARL
Sbjct: 606 KYEHRHMSPLYGLFPGYEITSDRTA-LFAAAHKLTEMRGFFGEGMGWALAWRLNLWARLQ 664
Query: 659 DQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTL 718
D + +++V +L+ + E+ NLF P Q+D NFG T+ + EML+QS
Sbjct: 665 DAGNCWKLVN---SLISTKTEQ--------NLF-DKPHIQLDGNFGGTSGITEMLLQSHA 712
Query: 719 NDLYLLPALPWDKWSSGCVKGLKARGG-ETVSICWKDGDLHEVGIYSNYSNN 769
++LLPALP +KWS G + GL A+GG E + WK+ + + I S N
Sbjct: 713 GAVHLLPALP-EKWSEGALSGLCAQGGFEITGLEWKNSRITTLKIRSTLGGN 763
>gi|149197929|ref|ZP_01874977.1| hypothetical protein LNTAR_21070 [Lentisphaera araneosa HTCC2155]
gi|149138841|gb|EDM27246.1| hypothetical protein LNTAR_21070 [Lentisphaera araneosa HTCC2155]
Length = 765
Score = 467 bits (1201), Expect = e-128, Method: Compositional matrix adjust.
Identities = 289/806 (35%), Positives = 423/806 (52%), Gaps = 95/806 (11%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
A P GNGRLGAMV+G + E + LN+DTL+ G D NPD L +R L+ G+ +E
Sbjct: 19 AFPAGNGRLGAMVFGDIDEERIALNDDTLYNGGQRDRFNPDCLPNLDCIRQLIFDGKLSE 78
Query: 87 ATAASVK-LFGHPADV--YQLLGDIEL---------------EFDDSHLKYAE------E 122
A A + + + G P + Y+ L D+ + FD L Y +
Sbjct: 79 AEALTQEAVTGLPPIMRNYEPLADLLISQKYSKEAYKQVDPNNFDPMDLAYGKIYQAAFS 138
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---- 178
YR+ LDL + ++ V +++ RE SS PD +I ++S SE S++ + ++
Sbjct: 139 DYRKSLDLENSIITTEFEVAGIKYKRELISSFPDDLIYLRLSASEKKSINVKLRIERGDA 198
Query: 179 SLLDNHSYVNGN----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
++ Y + N + +EGR +GI F A L ++ +G
Sbjct: 199 AMYSTRHYDKVSSPVENSLFIEGRTGSN------------EGIDFVAGLRTQV---QGGS 243
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
+ L ++ +D V+ + +S + P + +L+ +N + ++Y
Sbjct: 244 CEKIGESLIIKDADEVVIAICGHTSV---------RQNSPMTSLKKSLE--KNFDWQEVY 292
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELL 353
RH +DYQKL+ RV ++++ +EN+ P+ ER++ Q ++ D L +L
Sbjct: 293 LRHREDYQKLYKRVKLEIAHQ---------DDENL---PTDERLRKAQNNQSDVVLDQLY 340
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
F FGRYLLIS SRPG+ ANLQGIWN+ SP+W S +NIN++MNYW + CNLSEC E
Sbjct: 341 FNFGRYLLISCSRPGSMTANLQGIWNDSFSPSWGSKYTININIQMNYWPAEVCNLSECHE 400
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLFD L L ING +TA+ Y G+V HH TD + V + WPMGGAWL H
Sbjct: 401 PLFDMLEKLHINGQETAKKMYNCRGFVCHHNTDNTYDTYPTDRNVTASYWPMGGAWLALH 460
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
LWEHY +T DRDFL K Y ++ A F +D+L E G L T+PS SPE+ ++ P+G+
Sbjct: 461 LWEHYKFTQDRDFLSK-YYQIIHDAALFFVDFLCENPKGQLVTSPSVSPENTYLLPNGEY 519
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ TMD +IIRE+ A A+ +L K D + +L LP P +I + G IMEW
Sbjct: 520 GTICAGPTMDNSIIREIILATQEASRLLNKTLDQDYDGILAKLP---PLEIGKHGQIMEW 576
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWK 650
++D+ + E HRH+S LF L PG+ I ++KNPD +AA+ TL +R +G GWS W
Sbjct: 577 SEDWDEIEQGHRHISQLFALHPGNEIDVDKNPDFAQAAKITLDRRLADGGGHTGWSRAWI 636
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
+ARL + + AY+ L + H NLF HPPFQID NFG TAAVA
Sbjct: 637 INFFARLRNPQKAYKNFHAL--------QSH---STLPNLFDDHPPFQIDGNFGGTAAVA 685
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNND 770
EML+QS + LLP LP +W++G V GL+ARG V I W++ + + S D
Sbjct: 686 EMLLQSHQGRIDLLPCLP-KQWATGRVSGLRARGSVQVDIEWQNEKVTSFQLLS-----D 739
Query: 771 HDSFKTLHYRGTSVKVNLSAGKIYTF 796
D T+ + + L A + Y +
Sbjct: 740 FDQEVTVTFNSQKQVIKLQAKEPYQY 765
>gi|220928668|ref|YP_002505577.1| family 6 carbohydrate binding protein [Clostridium cellulolyticum
H10]
gi|219998996|gb|ACL75597.1| Carbohydrate binding family 6 [Clostridium cellulolyticum H10]
Length = 1164
Score = 466 bits (1199), Expect = e-128, Method: Compositional matrix adjust.
Identities = 285/796 (35%), Positives = 413/796 (51%), Gaps = 70/796 (8%)
Query: 22 KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDS 81
+ F A+P+GNGR+GAMV+G P E + LNE T W+ PG+ A L + + +
Sbjct: 74 ESFYKALPLGNGRIGAMVYGNYPDERIDLNEATFWSSGPGNNNRAGAANFLKTAQDQLFA 133
Query: 82 GQYAEATAA-SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYS 140
GQY +A + + G YQ +GD++L F S + Y R+LD+NT Y+
Sbjct: 134 GQYKTGSATIANNMIGGGEAKYQSIGDLKLSFGHSSV----SNYSRQLDMNTGVVSSDYT 189
Query: 141 VGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN--GNNQIIMEGR 198
++ RE F S PDQV+VTKI+ S GS+S +S L V+ GN+ ++M G
Sbjct: 190 YNGKKYHRESFVSYPDQVMVTKITCSSPGSISLTAGYESSLTGQYTVSTSGNDTLVMNGH 249
Query: 199 CPGKRIPPKANANDDPKGIQFSAILEI--KISDDRGTISALEDKKLKVEGSDWAVLLLVA 256
D GI ++ KI + G++SA + ++ V +D V+L
Sbjct: 250 ------------GDSDNGISYAVWFSTRSKIINSNGSVSA-NNNQISVSNADSVVIL--- 293
Query: 257 SSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSP 316
+S F+N D ++ + + + SY LY H+ DYQ LF RV + L S
Sbjct: 294 -TSIRTNFVNYKTCNGDEKGKATTDIANASAKSYDTLYNNHVTDYQNLFKRVDVDLGGSG 352
Query: 317 KDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQG 376
+ + P +R+ F T DP L ++LFQ+GRYL+IS+SR +Q NLQG
Sbjct: 353 SE-----------NGKPMGQRISEFGTTNDPKLAKVLFQYGRYLMISASRD-SQPMNLQG 400
Query: 377 IWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY-L 435
IWN+ +P W NIN EMNYW + NL+EC EP L G++TA+V+Y +
Sbjct: 401 IWNKFRNPAWGCKMTTNINYEMNYWPAFTTNLAECFEPFVKKAKELQAPGNETARVHYNI 460
Query: 436 ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLL 495
++GWV+HH TD+W +++ G W WP G W+ L++ Y++ D +L + YP++
Sbjct: 461 SNGWVLHHNTDLWNRTAPIDGD--WGFWPTGAGWVSNMLFDAYSFNQDTVYLNE-IYPVI 517
Query: 496 EGCASFLLDWL----IEGHDGYLETNPSTSPEHEFIAP----DGKLACVSYSSTMDMAII 547
+G A FL + I G + Y PSTSPE + P G+ A SY TMD I
Sbjct: 518 KGAADFLQTLMQSKSINGQN-YQVICPSTSPE---LTPPGTSGGQGAYNSYGVTMDNGIS 573
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWAQDFKDPEVHHRH 606
RE+F +I A+++L N D+ L S + +++P + G + EWA D+ +RH
Sbjct: 574 RELFKDVIQASKIL--NIDSSFRSTLASKVSQIKPNTVGSWGQLQEWAYDWDSQSEKNRH 631
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRM 666
+S + LFPG I P + A K+L RG+ G GWS WK WARL D H+Y +
Sbjct: 632 ISFAYDLFPGLEINKRNTPAIASAVSKSLNTRGDVGTGWSEAWKLNCWARLEDGAHSYNL 691
Query: 667 VKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPA 726
VK L V +G LY NL+ AHPPFQID NFGFT+ +AEML+QS N++ LLPA
Sbjct: 692 VKLLITPVSK------DGRLYDNLWDAHPPFQIDGNFGFTSGIAEMLLQSHNNEIQLLPA 745
Query: 727 LPWDKWSSGCVKGLKARGGETVS-ICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVK 785
LP +WS+G GL ARG TV+ + W +G L + I SN N + Y ++
Sbjct: 746 LP-SQWSTGHANGLCARGNFTVTKMNWANGVLTDATIKSNSGN-----VCNVRYGNKTIS 799
Query: 786 VNLSAGKIYTFNRQLK 801
G Y N L+
Sbjct: 800 FPTKKGYTYQLNGSLQ 815
>gi|404448807|ref|ZP_11013799.1| hypothetical protein A33Q_05728 [Indibacter alkaliphilus LW1]
gi|403765531|gb|EJZ26409.1| hypothetical protein A33Q_05728 [Indibacter alkaliphilus LW1]
Length = 778
Score = 466 bits (1198), Expect = e-128, Method: Compositional matrix adjust.
Identities = 280/786 (35%), Positives = 417/786 (53%), Gaps = 60/786 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA-PKALSDV 75
+ PA + +A+P+GNGRLGAMV+G E ++LNED+LW G P D+ P L+ +
Sbjct: 29 YEQPADKWEEALPLGNGRLGAMVFGRTDVERIQLNEDSLWPGGPNDWGLAQGKPDDLACI 88
Query: 76 RSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
R L+ G+ +A + V LF + +Q +GD+ LE + Y+R LDL+ A
Sbjct: 89 RELLVKGENKKADSLMVALFSRKSITRSHQTMGDLWLELGHQDIS----NYQRSLDLDKA 144
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN-----HSYVN 188
A V Y EF ++ +S DQ I+ +I+ + L+ + LD D+
Sbjct: 145 LATVTYQYEGYEFEQKAIASAKDQGIIIQITTTHPKGLNGKIRLDRPEDDGYPTVKISTP 204
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
NN + M+G ++ + G++F TI+ LE++ K+EG
Sbjct: 205 ANNSLQMDGEVTQRKGQIDSKPAPILHGVRFQ------------TIALLENEGGKLEGKG 252
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
A+ + + N S D ++ + L +++ L++++L RH D+Q LF RV
Sbjct: 253 DAIWIENVKTLSIKLVANTSFYHTDFRGKNQADLMALKELNFAELQKRHQKDHQGLFRRV 312
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRP 367
+ QL E++IDT+P+ R+++ + D L +LLF +GRYLLI SSRP
Sbjct: 313 NFQLG------------EKSIDTIPTDRRIENIKAGATDLHLEKLLFDYGRYLLIGSSRP 360
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
GT ANLQGIWN+ ++ W++ H+NIN++MNYW + NLSE +P F+F L +G
Sbjct: 361 GTLPANLQGIWNQHIAAPWNADYHMNINMQMNYWPAEVTNLSELHDPFFEFTDALIPSGQ 420
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
KTA+ Y G H TD+W + + W W G W+ H WE Y +T D +FL
Sbjct: 421 KTAKETYGMRGAAFAHGTDLWKMTFLQAAQAYWGSWLGAGGWMMQHYWERYLFTQDVEFL 480
Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
++R P+ E +F DW++ DG L ++PSTSPE+ FI +G A + + MD I
Sbjct: 481 KERFIPVAEEIVAFYADWIVPHPLDGKLASSPSTSPENSFINSNGDHAASTIGAAMDQQI 540
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEWAQDFKDPEVHHR 605
I EVF I+A E+L D L++++ + RLR ++ DG +MEW Q++K+ E HR
Sbjct: 541 IAEVFDNYINAVELLGIQSD-LLQEIKEKRSRLRSGLQVGSDGRLMEWDQEYKETEKGHR 599
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTALWARLHDQEH 662
H+SHL+ PG+ +T + P+L A +TL R G G GWS W ARL D E
Sbjct: 600 HMSHLYAFHPGNAVTKTQTPELFDAVRRTLDYRLEHGGAGTGWSRAWLINFSARLMDGEM 659
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
A+ V++L + LY NLF AHPPFQID NFG+TA +AEML+QS +
Sbjct: 660 AHEHVRKLIEI-----------SLYPNLFDAHPPFQIDGNFGYTAGIAEMLLQSHDGFIE 708
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGT 782
LLPALP WS G ++GLKARG + I W +G L + I S N + Y+G
Sbjct: 709 LLPALP-SIWSEGKIEGLKARGNFNIDIEWSNGTLTKASIMSPLGGN-----ALIRYKGK 762
Query: 783 SVKVNL 788
++V L
Sbjct: 763 EIEVVL 768
>gi|375144807|ref|YP_005007248.1| alpha-L-fucosidase [Niastella koreensis GR20-10]
gi|361058853|gb|AEV97844.1| Alpha-L-fucosidase [Niastella koreensis GR20-10]
Length = 780
Score = 466 bits (1198), Expect = e-128, Method: Compositional matrix adjust.
Identities = 283/777 (36%), Positives = 425/777 (54%), Gaps = 64/777 (8%)
Query: 9 TTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
T PL++ ++ PA + + +P+GNGRLG M GGV E + LN+ TLW+G P D N A
Sbjct: 27 TNKPLRLWYDKPAAQWEETLPLGNGRLGMMPDGGVLQENIVLNDITLWSGAPQDANNYKA 86
Query: 69 PKALSDVRSLVDSGQYAEATAASVKLF--------GHPADVYQLLGDIELEFD-DSHLKY 119
+ L +++ L+ G+ EA A K F P +Q LG + + F+ D
Sbjct: 87 NQKLPEIQKLLLEGKNDEAQALINKDFICTGKGSGAEPFGCFQTLGRLGIAFNYDGPANA 146
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
A Y R+L LN A A Y VG+V + RE+F+S + V + K++ S +G L+F VSL S
Sbjct: 147 AFTNYSRQLSLNDAAAACTYKVGDVTYNREYFTSFGNDVGIIKLTASAAGKLNFEVSL-S 205
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+ + N++ M G+ D KG+Q+ A++ K++ G++SA +
Sbjct: 206 RPEKATVTVAGNKLEMAGQLEN---------GTDGKGMQYVALVSAKLTG--GSLSAAGN 254
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
K L V+ + A+L A +S+ D + L ++Y +HL+
Sbjct: 255 K-LVVKNATKAILFFSAKTSY---------KDADYRQHAQQLLDKAMLVAYDAEKKKHLN 304
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF--QTDEDPSLVELLFQFG 357
+Y KLF+R+ + L S D +P+ +R+ F T D L L +Q+
Sbjct: 305 NYGKLFNRLQVDLGSS------------GADELPTDQRLDKFYNATTPDNRLTVLFYQYS 352
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYL ISS+R G NLQG+W ++ W+ H+++N++MN+W P NLSE PL D
Sbjct: 353 RYLSISSTRVGLLPPNLQGLWAHEVHTPWNGDYHLDVNVQMNHWGVEPANLSELNLPLAD 412
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ + +G KTA+ Y A GWV H T+ W + W + G WLC +LW+H
Sbjct: 413 LVKEMGPHGEKTAKAYYNARGWVAHVITNPWLFTEPGE-SASWGVTKAGSGWLCNNLWDH 471
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDG-KLAC 535
Y ++ D ++L K+ YP+L+G A F D LI+ + G+L T PS+SPE+ F PDG K +
Sbjct: 472 YTFSNDLNYL-KKIYPVLKGSALFYSDILIKDPETGWLVTAPSSSPENWFYMPDGSKQSS 530
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNE---DALVEKVLKSLPRLRPTKIAEDGSIME 592
+ +T+D IIRE+F+ +I+A+E L +E L EK LK +P +I+ DG +ME
Sbjct: 531 ICMGATIDNQIIRELFNNVITASEQLHIDEPFRKELKEK-LKQIPP--AAQISADGRVME 587
Query: 593 WAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTA 652
W +D+K+ + HRH+SHL+GL+P IT + P +A +K+L RG++GP WSI +K
Sbjct: 588 WLKDYKEADPQHRHISHLYGLYPASLITPSQTPAFAEACKKSLNVRGDDGPSWSIAYKQL 647
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFE----GGLYSNLFAAHPPFQIDANFGFTAA 708
WARLHD AY++ + ++ P H+ GG+Y NL +A PPFQID NFG A
Sbjct: 648 FWARLHDGNRAYKLFRE---IMKPTHKTGINYGAGGGVYPNLLSAGPPFQIDGNFGAGAG 704
Query: 709 VAEMLVQSTLNDLYLLPALPWDKWSS-GCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+AEML+QS + LPA+P D W + G VKG+KARG TV WKDG + +YS
Sbjct: 705 IAEMLLQSHEGYINFLPAIP-DVWKAEGSVKGMKARGNITVDFSWKDGVVTGYKLYS 760
>gi|423330223|ref|ZP_17308007.1| hypothetical protein HMPREF1075_00020 [Parabacteroides distasonis
CL03T12C09]
gi|409231839|gb|EKN24687.1| hypothetical protein HMPREF1075_00020 [Parabacteroides distasonis
CL03T12C09]
Length = 809
Score = 465 bits (1196), Expect = e-128, Method: Compositional matrix adjust.
Identities = 279/777 (35%), Positives = 416/777 (53%), Gaps = 55/777 (7%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
N T + F+ PA+ + + +P+GNGR+G M GG+ E + LNE +LW+G D
Sbjct: 16 NLPDMQTGKEVSYYFDQPAQIWEETLPLGNGRIGMMPDGGIERENVVLNEISLWSGSKQD 75
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
NP A +L+++R L+ G+ EA K F P YQL G++
Sbjct: 76 TDNPYAYYSLANIRRLLFEGRNDEAQDLMYKTFVCKGTGSNLGDGANAPYGSYQLFGNLV 135
Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
L++ + + YRR L+L+ A A V + GNV + RE F+S + V +
Sbjct: 136 LKYTYPNESDSIAEYRRRLNLSEAIASVSFKRGNVNYQREMFTSFSGDLGVIHLVADTDR 195
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQ-IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
+L+F++ ++ H+ ++ + + ++M G+ P + KG++F++ ++I
Sbjct: 196 ALNFSLGMNR--PEHATISLDGKDLLMRGQLP------DGVDTLEMKGMRFAS--RVRIV 245
Query: 229 DDRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSA-LQSIR 286
+G A D L V + A++L+ + + FD KD +S+ L
Sbjct: 246 LPKGGDLATTDSCLSVRSASEAIILVSLGTDYFD----------KDGAGQSLEKYLSQAE 295
Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE- 345
+ +S L H Y+ LF RVS+ L R +D +P ER+ +F D+
Sbjct: 296 SKDFSTLRREHTLAYRSLFDRVSLDLGRGERD------------HLPINERLAAFAQDKN 343
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
DP L L FQFGRYLLISS+R G NLQG+W + W+ H+NINL+MN+W +
Sbjct: 344 DPGLAALYFQFGRYLLISSTRQGLLPPNLQGLWCNTIHTPWNGDYHLNINLQMNHWPAEV 403
Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
NLSE PL ++ +G +TA+ Y A GWV H ++W + +A W
Sbjct: 404 TNLSELHLPLIEWTKLQVPSGERTAKAFYNARGWVTHILGNVW-EFTAPGEHPSWGATNT 462
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEH 524
AWLC HL+ HY YT+D+ +L + YP ++G A F +D L++ YL T P+TSPE+
Sbjct: 463 SAAWLCEHLYTHYQYTLDKAYL-RDVYPTMKGAALFFVDMLVQDPRTKYLVTAPTTSPEN 521
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
+ P+G + + STMD I+RE+F+ I AA +L + A ++ RL PT I
Sbjct: 522 AYKMPNGSVVSICAGSTMDNQIVRELFTNTIEAAGILGV-DSAFAAELAAKRDRLMPTTI 580
Query: 585 AEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPG 644
+DG IMEW + +++ E HRH+SHL+GL+PG+ I+IE P+L +AA K+L+ RG++ G
Sbjct: 581 GKDGRIMEWLEPYEEVEPTHRHVSHLYGLYPGNEISIEHTPELAEAARKSLEVRGDQSTG 640
Query: 645 WSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFE-GGLYSNLFAAHPPFQIDANF 703
WS+ WK WARL D +HAY+++ L EH K + GG Y NLF AHPPFQID NF
Sbjct: 641 WSMAWKINFWARLQDGDHAYKLLGDLLRPCVDEHTKEVKGGGSYPNLFCAHPPFQIDGNF 700
Query: 704 GFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEV 760
G TA +AEML+QS + LPALP W +G GLK R G VS W +G L E
Sbjct: 701 GGTAGIAEMLIQSQTGLIEFLPALP-SAWKNGSFSGLKVRNGGEVSAKWTEGLLTEA 756
>gi|256840971|ref|ZP_05546478.1| glycoside hydrolase, family 95 [Parabacteroides sp. D13]
gi|298375740|ref|ZP_06985696.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_19]
gi|256736814|gb|EEU50141.1| glycoside hydrolase, family 95 [Parabacteroides sp. D13]
gi|298266777|gb|EFI08434.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_19]
Length = 811
Score = 465 bits (1196), Expect = e-128, Method: Compositional matrix adjust.
Identities = 279/777 (35%), Positives = 416/777 (53%), Gaps = 55/777 (7%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
N T + F+ PA+ + + +P+GNGR+G M GG+ E + LNE +LW+G D
Sbjct: 18 NLPDMQTGKEVSYYFDQPAQIWEETLPLGNGRIGMMPDGGIERENVVLNEISLWSGSKQD 77
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
NP A +L+++R L+ G+ EA K F P YQL G++
Sbjct: 78 TDNPYAYYSLANIRRLLFEGRNDEAQDLMYKTFVCKGTGSNLGDGANAPYGSYQLFGNLV 137
Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
L++ + + YRR L+L+ A A V + GNV + RE F+S + V +
Sbjct: 138 LKYTYPNESDSIAEYRRRLNLSEAIASVSFKRGNVNYQREMFTSFSGDLGVIHLVADTDR 197
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQ-IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
+L+F++ ++ H+ ++ + + ++M G+ P + KG++F++ ++I
Sbjct: 198 ALNFSLGMNR--PEHATISLDGKDLLMRGQLP------DGVDTLEMKGMRFAS--RVRIV 247
Query: 229 DDRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSA-LQSIR 286
+G A D L V + A++L+ + + FD KD +S+ L
Sbjct: 248 LPKGGDLATTDSCLSVRSASEAIILVSLGTDYFD----------KDGAGQSLEKYLSQAE 297
Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE- 345
+ +S L H Y+ LF RVS+ L R +D +P ER+ +F D+
Sbjct: 298 SKDFSTLRREHTLAYRSLFDRVSLDLGRGERD------------HLPINERLAAFAQDKN 345
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
DP L L FQFGRYLLISS+R G NLQG+W + W+ H+NINL+MN+W +
Sbjct: 346 DPGLAALYFQFGRYLLISSTRQGLLPPNLQGLWCNTIHTPWNGDYHLNINLQMNHWPAEV 405
Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
NLSE PL ++ +G +TA+ Y A GWV H ++W + +A W
Sbjct: 406 TNLSELHLPLIEWTKLQVPSGERTAKAFYNARGWVTHILGNVW-EFTAPGEHPSWGATNT 464
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEH 524
AWLC HL+ HY YT+D+ +L + YP ++G A F +D L++ YL T P+TSPE+
Sbjct: 465 SAAWLCEHLYTHYQYTLDKAYL-RDVYPTMKGAALFFVDMLVQDPRTKYLVTAPTTSPEN 523
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
+ P+G + + STMD I+RE+F+ I AA +L + A ++ RL PT I
Sbjct: 524 AYKMPNGSVVSICAGSTMDNQIVRELFTNTIEAAGILGV-DSAFAAELAAKRDRLMPTTI 582
Query: 585 AEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPG 644
+DG IMEW + +++ E HRH+SHL+GL+PG+ I+IE P+L +AA K+L+ RG++ G
Sbjct: 583 GKDGRIMEWLEPYEEVEPTHRHVSHLYGLYPGNEISIEHTPELAEAARKSLEVRGDQSTG 642
Query: 645 WSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFE-GGLYSNLFAAHPPFQIDANF 703
WS+ WK WARL D +HAY+++ L EH K + GG Y NLF AHPPFQID NF
Sbjct: 643 WSMAWKINFWARLQDGDHAYKLLGDLLRPCVDEHTKEVKGGGSYPNLFCAHPPFQIDGNF 702
Query: 704 GFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEV 760
G TA +AEML+QS + LPALP W +G GLK R G VS W +G L E
Sbjct: 703 GGTAGIAEMLIQSQTGLIEFLPALP-SAWKNGSFSGLKVRNGGEVSAKWTEGLLTEA 758
>gi|290962265|ref|YP_003493447.1| hypothetical protein SCAB_79571 [Streptomyces scabiei 87.22]
gi|260651791|emb|CBG74917.1| putative secreted protein [Streptomyces scabiei 87.22]
Length = 945
Score = 465 bits (1196), Expect = e-128, Method: Compositional matrix adjust.
Identities = 284/753 (37%), Positives = 410/753 (54%), Gaps = 53/753 (7%)
Query: 13 LKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
L + ++ PA + A+PIGNGRLGAMV+G V +E L+LNEDT+W G P D N
Sbjct: 42 LALWYDKPAGADWLRALPIGNGRLGAMVFGNVDTERLQLNEDTVWAGGPYDSANTRGAAN 101
Query: 72 LSDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRREL 128
++++R V + Q+ A + + G PA YQ +G++ L F + Y+R L
Sbjct: 102 IAEIRRRVFADQWGPAQDLINQTMLGSPAGQLAYQPVGNLLLSFGSA---TGASQYKRTL 158
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
DL TATA Y++ V + RE F DQVIV +++ + +++ + + DS
Sbjct: 159 DLTTATALTTYALNGVRYQREVFVGARDQVIVVRLTADRANAITCSATFDSPQRTTLSSP 218
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
I ++G ++F A+ + GT+S+ L+V G+
Sbjct: 219 DGATIALDG--------TSGTMEGITGRVRFLALAHAAATG--GTVSS-SGGTLRVSGAT 267
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
+L+ SS+ ++ ++ D + L + R++ L +RH D+Q LF RV
Sbjct: 268 SVTVLVSIGSSY----VDFRNTDGDHRGIARRHLDAARDIDIDALRSRHRTDHQALFDRV 323
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
SI L R+ T +++ P+ R+ DP LLFQFGRYLLISSSRPG
Sbjct: 324 SIDLGRT-------TAADQ-----PTDVRIAQHAQVSDPQFAALLFQFGRYLLISSSRPG 371
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
TQ ANLQGIWN+ ++P+WDS +N NL MNYW + NLSEC P+FD + L++ G++
Sbjct: 372 TQPANLQGIWNDQMAPSWDSKFTINANLPMNYWPADTTNLSECLLPVFDMIDDLTVTGAR 431
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
A+ Y A GWV HH TD W +S G W +W GGAWL T +W+HY +T D DFL
Sbjct: 432 VARAQYGAGGWVTHHNTDAWRGASVVDG-AQWGMWQTGGAWLATLIWDHYLFTGDTDFLR 490
Query: 489 KRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
YP L+G A F LD L+ G+L TNPS SPE P A V TMD I+
Sbjct: 491 SN-YPALKGAAQFFLDTLVAHPTLGHLVTNPSNSPE----LPHHTNATVCAGPTMDNQIL 545
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHL 607
R++F+++ A E L + + L + RL PT++ G++ EW D+ + E +HRH+
Sbjct: 546 RDLFTSVARAGETLGVDA-GFRAQALAARDRLAPTRVGSRGNVQEWLADWVETERNHRHV 604
Query: 608 SHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMV 667
SHL+GL P + IT P L +AA +TL+ RG++G GWS+ WK WARL D A++++
Sbjct: 605 SHLYGLHPSNQITKRGTPQLHEAARRTLELRGDDGTGWSLAWKINFWARLEDGARAHKLL 664
Query: 668 KRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPAL 727
+ +LV + L N+F HPPFQID NFG T+ +AEML+ S +L++LPAL
Sbjct: 665 R---DLVRTDR-------LAPNMFDLHPPFQIDGNFGATSGIAEMLLHSHNGELHVLPAL 714
Query: 728 PWDKWSSGCVKGLKARGGETVSICWKDGDLHEV 760
P W +G V GL+ RGG TV W G + V
Sbjct: 715 P-AAWPTGRVSGLRGRGGYTVGAEWSGGRIECV 746
>gi|325281855|ref|YP_004254397.1| Alpha-L-fucosidase [Odoribacter splanchnicus DSM 20712]
gi|324313664|gb|ADY34217.1| Alpha-L-fucosidase [Odoribacter splanchnicus DSM 20712]
Length = 807
Score = 464 bits (1194), Expect = e-128, Method: Compositional matrix adjust.
Identities = 281/773 (36%), Positives = 413/773 (53%), Gaps = 61/773 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PA+ + + +P+GNGRLG M GGV ET+ LN+ T+W+G D NP+A K L
Sbjct: 28 LKLWYTRPAERWEETLPLGNGRLGMMPDGGVVQETIVLNDITMWSGSFQDTRNPEALKYL 87
Query: 73 SDVRSLVDSGQYAEATAASVKLFG-------------HPADVYQLLGDIELEF---DDSH 116
++R L+ G+ EA K F P +QLLG++ L++ D S
Sbjct: 88 PEIRRLLLEGKNDEAQELMYKHFACGGQGSAFGQGANAPYGAFQLLGNLHLQYHFPDSSD 147
Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
+ Y+ Y R L L+ A A + G V++ RE+F S + V++ K++ G L F+V+
Sbjct: 148 VGYS--AYERGLSLDKALAWTCFRKGKVKYRREYFVSQTEDVMIMKLTADRKGMLDFDVA 205
Query: 177 LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISA 236
+D + Y N + + MEG+ + G ++ L++ +D R
Sbjct: 206 IDRPENYTCYAN-DGVVYMEGQL---------DNGKGKAGTKYMVQLKVWTADGR---QV 252
Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
+ + V+ + A +L+ A +S D +Q N+ Y L R
Sbjct: 253 ADSACIHVKEATTAYVLVSAGTSL---------WAADYPERVEKLMQIAGNMDYGYLLER 303
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
H ++ ++RV + L +P+DI+ P+ +R+ FQ EDP LV L FQ+
Sbjct: 304 HDSAWRYKYNRVELDLG-TPQDIL------------PTDQRLARFQEQEDPGLVALYFQY 350
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLIS +R + NLQG+W + W+ H+NINL+MNYW NLSE PL
Sbjct: 351 GRYLLISGTRENSFPLNLQGLWANSVQTPWNGDYHLNINLQMNYWPVEIVNLSELHTPLK 410
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
+ + L +G TA Y A GWV H T+ W + +A W GGAWLC HLWE
Sbjct: 411 NLVKDLVTSGEVTAHSFYGAQGWVAHMMTNPW-RFTAPGEHASWGATNTGGAWLCEHLWE 469
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDG-KLA 534
HY +T+D+++L + YP+L G + F L +IE G+L T PS+SPE+ F P K
Sbjct: 470 HYAFTLDQEYL-REVYPVLSGASRFFLSSMIEEPTQGWLVTAPSSSPENAFYMPGTRKEV 528
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA-EDGSIMEW 593
V MD IIRE+FS I AA +LE + A + + K+L +L P +I+ + G + EW
Sbjct: 529 SVCMGPAMDTQIIRELFSNTIQAARLLEIDA-AFADSLEKALDKLPPMQISPKGGYLQEW 587
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
+D+++ + HRH+SHLFGL+P + I++ K P+L +AA KTLQ+RG+ G GWS+ WK
Sbjct: 588 LEDYEEVDPRHRHVSHLFGLYPSNQISLAKTPELAEAARKTLQRRGDGGTGWSMAWKINF 647
Query: 654 WARLHDQEHAYRMVKRLFN-LVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
WARL + + A ++K L +V + GG Y NLF AHPPFQID N G A +AEM
Sbjct: 648 WARLQEGDKALELLKNLLKPVVTGGKVDYTGGGTYPNLFCAHPPFQIDGNLGGCAGIAEM 707
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
L+QS + +LPALP W G KGL RGG V WK G L ++ ++S
Sbjct: 708 LIQSQQGFIEVLPALP-AVWKEGSFKGLCVRGGGVVDASWKAGRLEKLTLHSR 759
>gi|197302771|ref|ZP_03167824.1| hypothetical protein RUMLAC_01500 [Ruminococcus lactaris ATCC
29176]
gi|197298169|gb|EDY32716.1| hypothetical protein RUMLAC_01500 [Ruminococcus lactaris ATCC
29176]
Length = 773
Score = 464 bits (1194), Expect = e-128, Method: Compositional matrix adjust.
Identities = 283/760 (37%), Positives = 415/760 (54%), Gaps = 42/760 (5%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA- 71
+K+ ++ PA+++ +++P+GNGR+GAMV+GG E L LNEDTLW+G P + T P+
Sbjct: 1 MKLYYDHPAENWHESLPLGNGRIGAMVYGGTKKEILALNEDTLWSGYP-EKTQKKLPEGY 59
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
L VR L + +Y +A + F DV Y G++ +E D + ++ Y REL
Sbjct: 60 LEKVRELTEKREYQKAMEYLEECFSSSEDVQMYVPFGNVYMEMLDGTEEISD--YHRELC 117
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L+TA R+ Y + S P QV+V KI ++ SL V ++
Sbjct: 118 LDTAEVRITYKNQGALVEKSCIVSQPAQVLVYKIRSEKAFSLKLYVEGGYARES---CCT 174
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQ-FSAILEIKISDDRGTISALEDKKLK----- 243
+ + +G+CPG R+P K + F E + G + D K+
Sbjct: 175 DGILKTKGQCPG-RVPFTVGEGGSEKAVPVFPEEPEKQGMCYEGWGKIVTDGKVNEAGNA 233
Query: 244 --VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
VE ++ L SSF G +P + P E + A SY L T HL +Y
Sbjct: 234 VIVENAEEVTLYYGIRSSFAGFDRHPVIEGRCP-EELLKADFDCTGKSYEALRTEHLKEY 292
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYL 360
QK + RVS L D +E+++ +R+ FQ ED L LLFQ+GRYL
Sbjct: 293 QKYYKRVSFSLGEK------DEYAEKDLR-----QRLTDFQDHPEDVGLNALLFQYGRYL 341
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LI++SRPGTQ ANLQGIWN +L P W S +NIN EMNYWQ+ PCNL E EPL
Sbjct: 342 LIAASRPGTQAANLQGIWNAELVPPWFSDYTININTEMNYWQTGPCNLEEMGEPLVRLCE 401
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
++ +G +TA + G H TD+W K++ G+ W WPMG AWLC +L++ Y +
Sbjct: 402 EMAADGKETAMHYFGKEGVCSFHNTDLWRKTTPADGRAEWNFWPMGYAWLCRNLYDQYLF 461
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD---GKLACVS 537
T DR +LE R YP+L+ F ++ ++ GY +P+TSPE++F+ + KL
Sbjct: 462 TEDRAYLE-RIYPVLKENVRFCVESVVGTAQGYA-MSPATSPENDFLFGEEKKEKLTVAQ 519
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
Y+ + AI+R + + A +L D L + K + + +G I+EW +DF
Sbjct: 520 YTEN-ENAIVRNLLRDYLEAGRIL-GIRDELTGQAEKIFEEMAAPAVGSNGQILEWNEDF 577
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARL 657
++ + HHRHLS L+ L PG IT EK P+L +AA +L +RG+ G GWS+ WK +WAR+
Sbjct: 578 EEADPHHRHLSQLYELHPGRGIT-EKTPELYEAARTSLLRRGDAGTGWSLAWKILMWARM 636
Query: 658 HDQEHAYRMVKRLFNLVDPEHEKHFE--GGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQ 715
D H +++ + +LV+P+ + GG+Y+NLF AHPP+QID NFG+TA VAE L+Q
Sbjct: 637 KDGVHTGKLMNEILHLVEPKESMNMANGGGVYANLFCAHPPYQIDGNFGYTAGVAEALLQ 696
Query: 716 STLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
S + +LPALP +KW+ G + GLKARG TVSI W++G
Sbjct: 697 SHDGVITILPALP-EKWTKGEISGLKARGNITVSIRWENG 735
>gi|393786769|ref|ZP_10374901.1| hypothetical protein HMPREF1068_01181 [Bacteroides nordii
CL02T12C05]
gi|392658004|gb|EIY51634.1| hypothetical protein HMPREF1068_01181 [Bacteroides nordii
CL02T12C05]
Length = 821
Score = 463 bits (1191), Expect = e-127, Method: Compositional matrix adjust.
Identities = 270/763 (35%), Positives = 418/763 (54%), Gaps = 53/763 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA + +A+P+GNGR+GAMV+G V E +LNE+++W G P + NP A +AL
Sbjct: 24 LKLWYDRPATQWVEALPLGNGRIGAMVYGDVLHEEFQLNEESIWGGSPYNNVNPKAKEAL 83
Query: 73 SDVRSLVDSGQYAEA------TAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
+R L+ G+ EA S G P YQ +G + L+F+ + Y++ Y R
Sbjct: 84 PRIRQLIFEGRNKEAQEMCGHAICSQTANGMP---YQTVGSLHLDFEGVN-NYSD--YYR 137
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL--DNH 184
ELD+ A K++ V +TRE F+S PDQ+++ +++ S+ +SF ++ D
Sbjct: 138 ELDIEKAIVTTKFTSEGVTYTREAFTSFPDQLLIIRLTASQKRKISFTARYNTPYGKDII 197
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDKKLK 243
V+ ++ + G KAN ++ +G ++FS + ++ + G A+ D L+
Sbjct: 198 RNVSSRKELQLHG---------KANDHEGIEGKVRFSTL--TRVEHNGGYTEAIADTLLR 246
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
+ ++ +V L V S FIN +D + + + L++ +Y H Y+K
Sbjct: 247 ISNAN-SVTLYV---SIGTNFINYNDVSGNALKTAQNYLKNAGK-NYQKAKETHCSTYRK 301
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
F+RVS+ L + + P+ RV+ F + DP L L FQFGRYLLI
Sbjct: 302 WFNRVSLDLGSNAQSFK------------PTDVRVREFTSTFDPQLAALYFQFGRYLLIC 349
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SS+PG Q ANLQGIWN L WD +IN+EMNYW + NL E EP + ++
Sbjct: 350 SSQPGGQAANLQGIWNYQLRAPWDGKYTTDINVEMNYWPAESTNLPEMHEPFLQLIKEVA 409
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
G ++A + Y GW +HH TDIW + + G + +WP +W C HLW+HY ++ +
Sbjct: 410 EKGKQSAAM-YGCRGWTLHHNTDIWRSTGSVDGP-GYGIWPTCNSWFCQHLWDHYLFSGN 467
Query: 484 RDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
RD+L + YPL+ F LD+LI + + +L +PS SPE+ + + + +TM
Sbjct: 468 RDYLTE-IYPLMRSACEFYLDFLIRDPKNNWLVVSPSYSPENRPVVNGKRDFTIVAGATM 526
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
D ++ ++F + AA ++ ++ A ++ + + L P ++ G + EW +D+ +P+
Sbjct: 527 DNQMVNDLFRNTLEAASLIGES-SAFIDSLQTVIQNLAPMQVGRWGQLQEWMEDWDNPQD 585
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEH 662
HRH SHL+GL+PG IT + P L +AA++TL+ RG+ GWS+ WK WARL D H
Sbjct: 586 RHRHTSHLWGLYPGRQIT-PRTPILFEAAKRTLEGRGDHSTGWSMGWKVCFWARLLDGNH 644
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
AY+++ L EK GG Y NLF AHPPFQID NFG TA ++EM VQS ++
Sbjct: 645 AYKLITE--QLHPTTDEKGQNGGTYPNLFDAHPPFQIDGNFGCTAGISEMFVQSHAGSVH 702
Query: 723 LLPALPWDKWSSGCVKGLKARGGETV-SICWKDGDLHEVGIYS 764
LLPALP D W G + GL+ RGG T+ + W+D L V I S
Sbjct: 703 LLPALP-DVWKKGSITGLRCRGGFTIDELNWEDNQLQSVRITS 744
>gi|408370425|ref|ZP_11168202.1| hypothetical protein I215_05947 [Galbibacter sp. ck-I2-15]
gi|407744183|gb|EKF55753.1| hypothetical protein I215_05947 [Galbibacter sp. ck-I2-15]
Length = 792
Score = 462 bits (1188), Expect = e-127, Method: Compositional matrix adjust.
Identities = 278/762 (36%), Positives = 409/762 (53%), Gaps = 55/762 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ A+ + A+P+GNGRLGAM++G E L+LNED++W G P + + L +R
Sbjct: 35 YEQAAEDWMQALPVGNGRLGAMIFGNPDIEHLQLNEDSMWPGGPTLGDSKGTVEDLVALR 94
Query: 77 SLVDSGQYAEATAASVKLFGH--PADVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
+L+D G+ +A V F H +Q GD+ L+F + E T Y R LDL+ A
Sbjct: 95 ALIDQGKVHQADKFIVDKFSHLEVTRSHQTAGDLFLDFK----RKGEVTDYYRGLDLDKA 150
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS-----YVN 188
A V Y V +FT + +SN D ++ + + L F++ L +D + +
Sbjct: 151 VATVSYKVDGDQFTEKIIASNVDDALIISLETTAEKGLDFDIQLSRPMDKSAPTVLVTTH 210
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
++++IM+G + + +G++F ++ + + GTI D L++ G
Sbjct: 211 NSDELIMDGMVTQRGGVVENKPYPMQEGVEFQT--RLRATTEGGTIEP-SDGILELRGVR 267
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
AV+ LV +SF +D +++ L + + S+ +L RH D+ + + RV
Sbjct: 268 KAVIYLVTKTSF---------YHQDFKAKAQENLNEVASKSFDELLRRHSQDFGEFYDRV 318
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRP 367
+ L S ++D++P+ +R++ ++ + D L LF +GRYLLISSSR
Sbjct: 319 NFSLGSS------------DLDSLPTDKRLQRYKDGQVDLDLQTKLFDYGRYLLISSSRE 366
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
GT ANLQGIWN +S W++ H+NINL+MNYW S+ NLSE Q+PLFDF L G
Sbjct: 367 GTNPANLQGIWNNHISAPWNADYHLNINLQMNYWPSMVANLSELQQPLFDFSDRLLQRGK 426
Query: 428 KTAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
KTA+ Y + G V+HH TD+WA + + W W GG WL H W+HY +T D DF
Sbjct: 427 KTAKEQYGIQRGAVMHHTTDLWAPAFMFSSQPYWGSWIHGGGWLAQHYWDHYRFTQDADF 486
Query: 487 LEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
LE RAYP ++ A F +DWL + G + P TSPE+ ++A DGK A VS + M
Sbjct: 487 LENRAYPFMKEIALFYMDWLQKDATTGKWVSYPETSPENSYLAADGKPAAVSKGAAMGHQ 546
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
II EVF +SAA+VL N++ E K + EDG I+EW + +K+PE HR
Sbjct: 547 IIAEVFDNALSAAKVLNINDEFTQELKAKRADLTPGIVLGEDGRILEWDKPYKEPEKGHR 606
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTALWARLHDQEH 662
HLSHL+ L PG IT E P+ KAA+KT+ R G G GWS W + ARL D+
Sbjct: 607 HLSHLYALHPGDAIT-EATPEQFKAAKKTIDYRLEHGGAGTGWSRAWMISFNARLFDKAS 665
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
A + + F + + NLF HPPFQID NFG+TA V E+L+QS + L
Sbjct: 666 AEENINKFFQI-----------SIADNLFDEHPPFQIDGNFGYTAGVIELLLQSHEDFLR 714
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+LP+LP + WS G + G+KARG V I W L ++ + S
Sbjct: 715 ILPSLP-ENWSEGSISGIKARGNIEVGITWDQNKLTQLSLVS 755
>gi|254475685|ref|ZP_05089071.1| conserved hypothetical protein [Ruegeria sp. R11]
gi|214029928|gb|EEB70763.1| conserved hypothetical protein [Ruegeria sp. R11]
Length = 792
Score = 462 bits (1188), Expect = e-127, Method: Compositional matrix adjust.
Identities = 271/768 (35%), Positives = 401/768 (52%), Gaps = 64/768 (8%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA--LSDVRSLVDSGQYAEATAASVKLF 95
MV+GG + LNEDTL++G P + P P A + V L++ G+Y EA + F
Sbjct: 1 MVYGGADIFKMHLNEDTLYSGEPSEVFKP-TPVADQVPKVSKLLEQGEYEEAQELVRRSF 59
Query: 96 -GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSN 154
G YQ +G +E + + + Y R LD+ V + + R+ + S+
Sbjct: 60 LGKQGASYQPVGYFLVEPRN---RVSASAYERRLDIGNGVHSETIEVDDAKILRDCYISH 116
Query: 155 PDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPG------------K 202
Q IV + S L+ + + + N + + + G+ P +
Sbjct: 117 EHQAIVITMETSADEGLNLDARIVTQHPNGKATHRGRRYVFSGQAPSFTQHAKSLLQMHQ 176
Query: 203 RI---------------------PPKANA------NDDPKGIQFSAILEIKISDDRGTIS 235
R+ P + ++ N D +G+ + + D GT+
Sbjct: 177 RLGDTWKQPALYDRNGDIHPYLTPAEMSSEHTVLYNQDGRGLGMFFEAAVDVRHDGGTVE 236
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
+ D + + L+ ++S++G +PS DP + + L ++ ++ + +
Sbjct: 237 -VSDAGISLTNVQSVTFLISLATSYNGFDKSPSREGADPVRRNNNVLDALVGVAEPKIRS 295
Query: 296 RHLDDYQKLFHRVSIQL-SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H DD Q L RVS+ L SP ++ TD +R+K Q DP L L F
Sbjct: 296 SHTDDIQALMSRVSLHLDGESPANLTTD-------------QRLKQAQDRPDPELAALAF 342
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
Q+GRYLLISSSRPG+Q NLQGIWN W S +NINL+MNYW + P L+E EP
Sbjct: 343 QYGRYLLISSSRPGSQPPNLQGIWNNSTCAMWSSNYTMNINLQMNYWPAEPTGLAELTEP 402
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
LF+ + LS+ G++ A+ + A GW+ H T +W + + A WP+G WL HL
Sbjct: 403 LFNLIDELSVTGARQAKHMFDAPGWMAFHNTTLWREVTPSHATPQSAFWPVGAGWLVAHL 462
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
WE Y Y+ D +FL RA+P +EG FLLDW++EG DG+L T STSPE++F+ +G
Sbjct: 463 WERYEYSGDLEFLRDRAWPRMEGALEFLLDWMVEGSDGFLTTPISTSPENKFLDENGVEC 522
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWA 594
V STMD+AIIR + ++ AAE L+K + + + +L +L P + G ++EWA
Sbjct: 523 TVHQGSTMDIAIIRGLLEQMLQAAEALDKPAE-ISARYQTALDKLPPYRTGAKGELLEWA 581
Query: 595 QDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALW 654
+D + + HHRH+SHL+G+FPG+ IT E P+L A K+L RG+E GWS+ WK AL
Sbjct: 582 EDLPEWDPHHRHVSHLYGVFPGNQITHE-TPELQDAVRKSLAIRGDEATGWSMGWKLALH 640
Query: 655 ARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLV 714
ARL D + AY +++ +F V+ + K +GGLY NL +HPPFQID NFG+TA VAEML+
Sbjct: 641 ARLGDGDRAYDILRNVFEFVECDRPKGQKGGLYPNLLGSHPPFQIDGNFGYTAGVAEMLM 700
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
QS + LLPALP W G V GL+AR G V I W G+L E +
Sbjct: 701 QSHAGRVELLPALP-SVWPGGEVSGLRARQGFIVDIKWAKGELVEAEV 747
>gi|402814854|ref|ZP_10864447.1| hypothetical protein PAV_3c01950 [Paenibacillus alvei DSM 29]
gi|402507225|gb|EJW17747.1| hypothetical protein PAV_3c01950 [Paenibacillus alvei DSM 29]
Length = 810
Score = 461 bits (1187), Expect = e-127, Method: Compositional matrix adjust.
Identities = 277/784 (35%), Positives = 424/784 (54%), Gaps = 78/784 (9%)
Query: 21 AKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVD 80
A +T+A PIGNGRLG +V+GG+ E ++LNED++W G D N A AL D+++L+
Sbjct: 15 ASKWTEAFPIGNGRLGGVVYGGIQREQIQLNEDSIWYGGARDNDNRAAQAALPDIKNLLL 74
Query: 81 SGQYAEATAASVKLFGHPADV------YQLLGDIELEFDDSHLKYAEETYRRELDLNTAT 134
G +A +K H +V YQ LG++ L+F+ + +A Y R+LDL+ A
Sbjct: 75 QGNVRKAEKLVLK---HMTNVPQYFNPYQTLGNLFLDFEPNIEVHAINQYCRKLDLDHAL 131
Query: 135 ARVKYSVGN-------------------VEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
+V Y VG ++++RE FSS DQV+V +++ ++ L+F
Sbjct: 132 VQVNYEVGRQDKEGRTATQATGEAQKEAIQYSREIFSSAADQVLVIRMTTTDEAGLTFAA 191
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
D V ++ G+ I + D G++++ +L+ + G
Sbjct: 192 KFDRRPFTGEMVQTDD---------GQGIAMQGQLGAD--GVRYAVVLQAVVE---GGQC 237
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
L + + L++ A +SF +D+ +++ A + + Y L
Sbjct: 238 QTAGNYLDIRQARAVTLIVAAQTSF-----RCADAYAVACQQAIQAAK----VPYEKLKQ 288
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLF 354
RHLDDY+ LF+RV++ L + + +++R++ + Q D L L +
Sbjct: 289 RHLDDYKPLFNRVTLDLEAEEGERTEPQQQVPGQQCLSTSQRLERYRQGATDNGLEALFY 348
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
Q+GRYLL++SSRPGT ANLQGIWN+ +P W+S H+NINL+MNYW + NL+EC P
Sbjct: 349 QYGRYLLLASSRPGTLPANLQGIWNDSFTPPWESDYHLNINLQMNYWLAETGNLAECHMP 408
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
LFDF+ L ING +TA+ Y A G+V H +++WA + V +WPMGGAW+ H+
Sbjct: 409 LFDFIERLVINGRQTARNIYGARGFVAHTSSNLWADTGIYGEYVSANMWPMGGAWIALHM 468
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
WEHY Y FL +RAYP+L+ A F LD+L+E G L T PS SPE+ + + G++
Sbjct: 469 WEHYCYNGSLSFLRERAYPVLKEAALFFLDFLLELPSGQLVTVPSLSPENSYRSEQGEVG 528
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDA-----------LVEKVLKSLPRLRPTK 583
+ Y +MD I+ +F+A I A E+L+ +E+ L+ + + +L +
Sbjct: 529 ALCYGPSMDSQILYALFTACIRAGELLQLDEEGHLKQGFHEDKDLLAQWQQVRSKLPQPQ 588
Query: 584 IAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG- 642
I G IMEWA D+++ E+ HRH+SHLF L PG I ++P+L +AA+ TLQ+R G
Sbjct: 589 IGRHGQIMEWAVDYEEVELGHRHISHLFALHPGEQIIPHRSPELGQAAKFTLQRRLAHGG 648
Query: 643 --PGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQID 700
GWS W W+RL + + A+ ++ L + ++ NLF HPPFQID
Sbjct: 649 GHTGWSQAWIANFWSRLEEGDQAHLSLRNLLS-----------KAVHPNLFGDHPPFQID 697
Query: 701 ANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEV 760
ANFG AA+ EML+QS +++ LLPALP W G V GL+ARGG T+ + W+ G L +
Sbjct: 698 ANFGGAAAMQEMLLQSHGDEIRLLPALPL-AWRQGHVTGLRARGGFTIDMAWQAGKLQQA 756
Query: 761 GIYS 764
I S
Sbjct: 757 QITS 760
>gi|150009027|ref|YP_001303770.1| glycoside hydrolase [Parabacteroides distasonis ATCC 8503]
gi|149937451|gb|ABR44148.1| glycoside hydrolase family 95 [Parabacteroides distasonis ATCC
8503]
Length = 809
Score = 461 bits (1186), Expect = e-127, Method: Compositional matrix adjust.
Identities = 279/777 (35%), Positives = 414/777 (53%), Gaps = 55/777 (7%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
N T + F+ PA+ + + +P+GNGR+G M GG+ E + LNE +LW+G D
Sbjct: 16 NLPDMQTGKEVSYYFDQPAQIWEETLPLGNGRIGMMPDGGIERENVVLNEISLWSGSKQD 75
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
NP A +L+++R L+ G+ EA K F P YQL G++
Sbjct: 76 TDNPYAYYSLANIRRLLFEGRNDEAQDLMYKTFVCKGTGSNLGDGANAPYGSYQLFGNLV 135
Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
L++ + + YRR L+L+ A A V + GNV + RE F+S + V +
Sbjct: 136 LKYTYPNESDSIAEYRRRLNLSEAIASVSFKRGNVNYQREMFTSFSGDLGVIHLVADTDR 195
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQ-IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
+L+F++ ++ H+ ++ + + ++M G+ P + KG++F++ ++I
Sbjct: 196 ALNFSLGMNR--PEHATISLDGKDLLMRGQLP------DGVDTLEMKGMRFAS--RVRIV 245
Query: 229 DDRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSA-LQSIR 286
+G A D L V + A++L+ + + FD KD +S+ L
Sbjct: 246 LPKGGDLATTDSCLSVRSASEAIILVSLGTDYFD----------KDGAGQSLEKYLSQAE 295
Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE- 345
+ +S L H Y+ LF RVS+ L R +D +P ER+ +F D+
Sbjct: 296 SKDFSTLRREHTLAYRSLFDRVSLDLGRGERD------------HLPINERLAAFAQDKN 343
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
DP L L FQFGRYLLISS+R G NLQG+W + W+ H+NINL+MN+W +
Sbjct: 344 DPGLAALYFQFGRYLLISSTRQGLLPPNLQGLWCNTIHTPWNGDYHLNINLQMNHWPAEV 403
Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
NLSE PL ++ +G +TA+ Y A GWV H ++W + +A W
Sbjct: 404 TNLSELHLPLIEWTKLQVPSGERTAKAFYNARGWVTHILGNVW-EFTAPGEHPSWGATNT 462
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEH 524
AWLC HL+ HY YT+D+ +L + YP ++G A F +D L++ YL T P+TSPE+
Sbjct: 463 SAAWLCEHLYTHYQYTLDKAYL-RDVYPTMKGAALFFVDMLVQDPRTKYLVTAPTTSPEN 521
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
+ P+ + + STMD I+RE+F+ I AA +L + E K RL PT I
Sbjct: 522 AYKMPNESVVSICAGSTMDNQILRELFTNTIEAAGILGVDSTFAAELAAKR-DRLMPTTI 580
Query: 585 AEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPG 644
+DG IMEW + +++ E HRH+SHL+GL+PG+ I+IE P+L +AA K+L+ RG++ G
Sbjct: 581 GKDGRIMEWLEPYEEVEPTHRHVSHLYGLYPGNEISIEHTPELAEAARKSLEVRGDQSTG 640
Query: 645 WSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFE-GGLYSNLFAAHPPFQIDANF 703
WS+ WK WARL D +HAY+++ L EH K + GG Y NLF AHPPFQID NF
Sbjct: 641 WSMAWKINFWARLQDGDHAYKLLGDLLRPCVDEHTKEVKGGGSYPNLFCAHPPFQIDGNF 700
Query: 704 GFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEV 760
G TA +AEML+QS + LPALP W +G GLK R G VS W +G L E
Sbjct: 701 GGTAGIAEMLIQSQTGLIEFLPALP-SAWKNGSFSGLKVRNGGEVSAKWTEGLLTEA 756
>gi|384419108|ref|YP_005628468.1| hypothetical protein XOC_2159 [Xanthomonas oryzae pv. oryzicola
BLS256]
gi|353462021|gb|AEQ96300.1| hypothetical protein XOC_2159 [Xanthomonas oryzae pv. oryzicola
BLS256]
Length = 776
Score = 461 bits (1185), Expect = e-127, Method: Compositional matrix adjust.
Identities = 285/748 (38%), Positives = 400/748 (53%), Gaps = 60/748 (8%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+ L++ + PA + +A+P+GNGRLGAMVWGG L+LNEDTL+ G P D T+P
Sbjct: 41 VAAAEALQLWYPQPANEWVEALPVGNGRLGAMVWGGSAHAHLQLNEDTLYAGGPYDATSP 100
Query: 67 DAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET 123
DA AL VR+L+ +G YAE A KL P YQ LGD+ L+FD +
Sbjct: 101 DALAALPQVRALIFAGGYAEVEQLADAKLLSRPLKQMPYQPLGDLLLDFDRAD---GMSD 157
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
YRR+LDL+TA A + G RE F S Q +V ++S G +S V +DS N
Sbjct: 158 YRRQLDLDTAVATTTFRSGGAVHRREVFVSAHAQCVVVRLSCDHPGGISLRVGIDSP-QN 216
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQ--FSAILEIKISDDRGTISALEDKK 241
++ GR N GI+ L + G S + D+
Sbjct: 217 GEVTAEQGGLLFSGR------------NGSCAGIEGKLRFALPVLPQVTGGKRSQVRDR- 263
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L+++ +D VLLL A++S ++ D DP + + ++L+ L ++ L HL D+
Sbjct: 264 LRIDAADEVVLLLSAATSDQ--RVDTVDG--DPLALTAASLRKAAKLEFAALLRAHLADH 319
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
Q+LF RV+I L S D V + + ERV+ F +DP+L L Q+GRYLL
Sbjct: 320 QRLFRRVAINLGSS--DAVQ----------LSTNERVQRFAEGDDPALAALYHQYGRYLL 367
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
I SSRP TQ ANLQGIWN+ + P W+S +NIN EMNYW S L EC EPL
Sbjct: 368 ICSSRPCTQPANLQGIWNDLMQPPWESKYTININAEMNYWPSEANALHECVEPLEAMWFD 427
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
L+ G+ TA+ Y A WV+H+ TD+W ++ G W LWPMGG W LW ++Y
Sbjct: 428 LAKTGAHTAKAMYDAPAWVVHNNTDLWRQAGPIDG-AKWRLWPMGGVWQ-QQLWHRWDYG 485
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
DR L YPL +G A F + L+ + G + TNPS SPE+++ P G C
Sbjct: 486 RDRADLST-IYPLFKGAAEFFVATLLRDPQTGAMVTNPSMSPENQY--PFGAALCA--VP 540
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ--DFK 598
TMD ++R++F+ I+ ++L + D L +++ RL P +I + G + EW Q D +
Sbjct: 541 TMDAQLLRDLFAQCIAMRKLLCIDAD-LAQQLAALRERLPPNRIGKAGQLQEWQQDGDMQ 599
Query: 599 DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLH 658
PE+HH H+SHL+ L P I P+L AA ++L+ RG+ GW + W+ LWAR
Sbjct: 600 APEIHHLHVSHLYALHPSSQIKPRDPPELAAAARRSLEIRGDNATGWGLGWRLNLWARPA 659
Query: 659 DQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTL 718
D EHAYR+++ L+ P+ NL AHPPFQID NFG TA + EML+Q +
Sbjct: 660 DGEHAYRILQL---LISPDRT-------CPNLLDAHPPFQIDGNFGGTAGITEMLLQRWV 709
Query: 719 NDLYLLPALPWDKWSSGCVKGLKARGGE 746
+ LLPALP W G V+ ++ RGG
Sbjct: 710 GSVLLLPALP-KAWPRGSVRDVRVRGGR 736
>gi|260591756|ref|ZP_05857214.1| putative alpha-L-fucosidase 2 [Prevotella veroralis F0319]
gi|260536040|gb|EEX18657.1| putative alpha-L-fucosidase 2 [Prevotella veroralis F0319]
Length = 804
Score = 460 bits (1184), Expect = e-126, Method: Compositional matrix adjust.
Identities = 278/815 (34%), Positives = 429/815 (52%), Gaps = 69/815 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA- 71
+++ +N PA F ++IP+GNG+LGA+V+GG +T+ LN+ T WTG P D N KA
Sbjct: 24 MRLWYNQPAHFFEESIPLGNGKLGALVYGGTQKDTIYLNDITYWTGKPVD-PNEGLGKAK 82
Query: 72 -LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+ ++R + + Y A + + G + YQ LG + + ++ A Y REL+L
Sbjct: 83 WIPEIRKALFAENYRTADSLQHFVQGEQSASYQPLGTLNIINLNTG---AVSNYYRELNL 139
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
++A A + Y ++FTRE+F+++ D +I I +++G+++ ++ L + H N
Sbjct: 140 DSALAHISYQQNGIQFTREYFATHRDSLIAIHIKANQAGAINLHIQLTAQTP-HKVKATN 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
NQ+ M G G A +++ G + A D L + +D A
Sbjct: 199 NQLTMTGHTTGSETE------------SVHACTIVRLLPQGGKVIA-SDSTLTLTNADNA 245
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
+ +V ++SF+G +P +++A +N +YS+ RH+ +YQ++++R+ +
Sbjct: 246 TIYIVNATSFNGFDKHPVKDGASYIDNAVNAAWHTQNFTYSEFKDRHIKEYQQIYNRIKL 305
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDP-------SLVELLFQFGRYLLIS 363
QL ++E + +P+ + ++ + + P L L FQFGRYLL+S
Sbjct: 306 QLG-----------NKEYTNNLPTDQLLRRYSSSTAPLPEAAQRYLETLYFQFGRYLLLS 354
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SR ANLQG+W L W +NINLE NYW + P N+SE +PL F+ LS
Sbjct: 355 CSRTPNIPANLQGLWTPHLFSPWRGNYTMNINLEENYWPADPANMSETIQPLIGFVKGLS 414
Query: 424 INGSKTAQVNY-LASGWVIHHKTDIWAKSS-ADRGKVV--WALWPMGGAWLCTHLWEHYN 479
G TA+ Y + GW H +D W K+S GK WA W +GGAWL LW+HY
Sbjct: 415 ATGKHTARNFYGINEGWCAAHNSDPWCKTSPVGEGKESPEWANWNLGGAWLVNALWDHYL 474
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD--GYLETNPSTSPEHEFIAPDGKLACVS 537
Y+ D+ L+ YPL+EG + F WL+ + L T PSTSPE+E++ G
Sbjct: 475 YSQDKQLLQNTIYPLMEGSSKFFQQWLVTNPNKPNELITAPSTSPENEYVTDKGYHGTTC 534
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
Y T D+AIIRE+F + A + L D ++ L RL P + G + EW D+
Sbjct: 535 YGGTADLAIIRELFMNMQQARKSLGLKPDKEMD---DKLHRLHPYTVGSQGDLNEWYYDW 591
Query: 598 KDPEVHHRHLSHLFGLFPGHTITI----EKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
KD ++HHRH SHL GL+PG + K+ + AA +TL ++G+E GWS W+ L
Sbjct: 592 KDYDIHHRHQSHLIGLYPGMHLQALAKQTKDSTILAAAHQTLIQKGDESTGWSTGWRINL 651
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKH----FEGGLYSNLFAAHPPFQIDANFGFTAAV 709
WARL D HAY++ + L + V PE + GG Y NLF AHPPFQID NFG TA V
Sbjct: 652 WARLGDGNHAYKIYQNLLSYVSPEGYRGKDAVHHGGTYPNLFDAHPPFQIDGNFGGTAGV 711
Query: 710 AEMLVQSTLN--------DLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVG 761
EMLVQS+++ +++LLPALP D W++G +KG++ RGG T+ + W++ + +
Sbjct: 712 CEMLVQSSVDMTAKKPVYNIHLLPALP-DAWANGEIKGIRTRGGLTIDMKWENKLVTSLQ 770
Query: 762 IYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTF 796
I + D + Y S ++ L G I F
Sbjct: 771 IKA-----VTDVDVNITYNNKSSRMKLRQGGIIKF 800
>gi|301312083|ref|ZP_07218005.1| alpha-L-fucosidase 2 [Bacteroides sp. 20_3]
gi|423339363|ref|ZP_17317104.1| hypothetical protein HMPREF1059_03029 [Parabacteroides distasonis
CL09T03C24]
gi|300830185|gb|EFK60833.1| alpha-L-fucosidase 2 [Bacteroides sp. 20_3]
gi|409230744|gb|EKN23605.1| hypothetical protein HMPREF1059_03029 [Parabacteroides distasonis
CL09T03C24]
Length = 809
Score = 459 bits (1182), Expect = e-126, Method: Compositional matrix adjust.
Identities = 276/777 (35%), Positives = 413/777 (53%), Gaps = 55/777 (7%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
N T + F+ PA+ + + +P+GNGR+G M GG+ E + LNE +LW+G D
Sbjct: 16 NLPDMQTGKEVSYYFDQPAQIWEETLPLGNGRIGMMPDGGIERENVVLNEISLWSGSKQD 75
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
NP A +L+++R L+ G+ EA K F P YQL G++
Sbjct: 76 TDNPYAYYSLANIRRLLFEGRNDEAQDLMYKTFVCKGTGSNLGDGANAPYGSYQLFGNLV 135
Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
L + + + YRR L+L+ A A V + GNV + RE F+S + V +
Sbjct: 136 LRYTYPNESDSIAEYRRRLNLSEAIASVSFKRGNVNYQREMFTSFSGDLGVIHLVADADR 195
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQ-IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
+L+F++ ++ H+ ++ + + ++M G+ P + KG++F++ ++I
Sbjct: 196 ALNFSLGMNR--PEHATISLDGKDLLMRGQLP------DGVDTLEMKGMRFAS--RVRIV 245
Query: 229 DDRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSA-LQSIR 286
+G D L V + A++L+ + + FD KD +S+ L
Sbjct: 246 LPKGGDLTATDSSLSVRSASEAIILVSLGTDYFD----------KDGVGQSLEKYLSQAE 295
Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE- 345
+ +S L H Y+ LF RVS+ L + +D +P ER+ +F D+
Sbjct: 296 SKDFSTLRREHTFAYRSLFDRVSLDLGKGERD------------HLPIHERLAAFAQDKN 343
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
DP L L FQFGRYLLISS+R G NLQG+W + W+ H+NINL+MN+W +
Sbjct: 344 DPGLAALYFQFGRYLLISSTRQGLLPPNLQGLWCNTIHTPWNGDYHLNINLQMNHWPAEV 403
Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
NLSE PL ++ +G +TA+ Y A GW H ++W + +A W
Sbjct: 404 TNLSELHLPLIEWTKQQVPSGERTAKAFYNARGWGTHILGNVW-EFTAPGEHPSWGATNT 462
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEH 524
AWLC HL+ HY YT+D+ +L + YP ++G A F +D L++ YL T P+TSPE+
Sbjct: 463 SAAWLCEHLYTHYQYTLDKAYL-RDVYPTMKGAAQFFVDMLVQDPRTKYLVTAPTTSPEN 521
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
+ P+G + + STMD I+RE+F+ I AA +L + A ++ RL PT I
Sbjct: 522 AYKMPNGSVVSICAGSTMDNQILRELFTNTIEAAGILGV-DSAFAAELAAKRDRLMPTTI 580
Query: 585 AEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPG 644
+DG IMEW + +++ E HRH+SHL+GL+PG+ I+IE P+L +AA K+L+ RG++ G
Sbjct: 581 GKDGRIMEWLEPYEEVEPTHRHVSHLYGLYPGNEISIEHTPELAEAARKSLEVRGDQSTG 640
Query: 645 WSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFE-GGLYSNLFAAHPPFQIDANF 703
WS+ WK WARL D +HAY+++ L EH K + GG Y NLF AHPPFQID NF
Sbjct: 641 WSMAWKINFWARLQDGDHAYKLLGDLLRPCVDEHTKEVKGGGSYPNLFCAHPPFQIDGNF 700
Query: 704 GFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEV 760
G TA +AEML+QS + LPALP W +G GLK R G VS W +G L E
Sbjct: 701 GGTAGIAEMLIQSQSGLIEFLPALP-SAWKNGSFSGLKVRNGGEVSAKWTEGLLTEA 756
>gi|115391619|ref|XP_001213314.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114194238|gb|EAU35938.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 749
Score = 459 bits (1181), Expect = e-126, Method: Compositional matrix adjust.
Identities = 287/758 (37%), Positives = 404/758 (53%), Gaps = 67/758 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + +A+P+GNGRLGAMV G +E L+LNED++W G PGD T A + L
Sbjct: 3 ELWYRSPAATWDEALPVGNGRLGAMVHGRTTTELLQLNEDSVWYGGPGDRTPVGASRYLQ 62
Query: 74 DVRSLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
+R + G +AEA ++F HP Y+ LG + L+F HL+ YRR LDL
Sbjct: 63 QLRQYIRKGAHAEAEELVRRVFFAHPISQRHYEPLGTLFLDF--GHLESEVTEYRRSLDL 120
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
RV+Y V F RE +S+PD VI ++ SE + F V L + D N
Sbjct: 121 QRGITRVQYMHTGVHFEREVLASHPDAVIAIRVRASEP--VEFVVRLTRMSDLEYETNEY 178
Query: 191 -NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD-DRGTISALEDKKLKVEGSD 248
+ + ++ C + P ++ + + I+ D D TI+ + +KL V +
Sbjct: 179 LDDVAVDDNCVTMHVTPGGRNSN-----RACCKVAIRCDDPDGATIARVGGRKLMVRARE 233
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYS--DLYTRHLDDYQKLFH 306
LLLVA+ + + + + +AL L +S ++++RH++DYQ+L+
Sbjct: 234 --TLLLVAAQT----------TYRYQDIDGRAALDVADALRWSTEEIWSRHIEDYQQLYA 281
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
R+++ +S I TD ER+K DP LV L FGRYLLI+SSR
Sbjct: 282 RMTLAMSPDASHIPTD-------------ERIKH---SRDPGLVSLYHNFGRYLLIASSR 325
Query: 367 PG----TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
G ANLQGIWN P W S +NINL+MNYW + CNL+EC+ PLFD L +
Sbjct: 326 EGNGNKVLPANLQGIWNPSFHPAWGSKYTLNINLQMNYWPANVCNLAECEMPLFDLLERI 385
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
+ G KTA Y GW +HH TDIWA ++ + LWP+GGAWLC H+WE + ++
Sbjct: 386 ASAGQKTAHEVYGCRGWAVHHCTDIWADTAPVDQWMPATLWPLGGAWLCFHVWERFLFSK 445
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSST 541
D FL +R +P+L GC FLLD+L+E G YL T+PS SPE+ F +G+ + ST
Sbjct: 446 DEMFL-RRMFPVLRGCVEFLLDFLVEDATGQYLVTSPSLSPENLFYDAEGRQGVLCEGST 504
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPE 601
+DM ++ VF A I + +L N+D LV +V + RL P +I G + EW D+ + E
Sbjct: 505 IDMQLVDAVFHAFIQSVNILNLNDD-LVSRVNHASERLPPARIGSFGQLQEWTADYAEVE 563
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLH 658
HRH+SHL+ L+PGHTI + DL A TL +R G GWS W L ARL
Sbjct: 564 PGHRHVSHLWALYPGHTILPGRTKDLAAACAATLARRQAHGGGHTGWSRAWLINLHARLR 623
Query: 659 DQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTL 718
+ R V++L NL HPPFQID NFG TA + EMLVQS
Sbjct: 624 AADECGRHVEQL-----------LAQSTLPNLLDTHPPFQIDGNFGATAGIVEMLVQSHE 672
Query: 719 NDLY-LLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
+ LLPA P D W +G ++G+KARGG + W+DG
Sbjct: 673 EGIIRLLPACP-DSWKAGSIRGVKARGGFELDFRWEDG 709
>gi|423304137|ref|ZP_17282136.1| hypothetical protein HMPREF1072_01076 [Bacteroides uniformis
CL03T00C23]
gi|423310748|ref|ZP_17288732.1| hypothetical protein HMPREF1073_03482 [Bacteroides uniformis
CL03T12C37]
gi|392681018|gb|EIY74381.1| hypothetical protein HMPREF1073_03482 [Bacteroides uniformis
CL03T12C37]
gi|392685663|gb|EIY78977.1| hypothetical protein HMPREF1072_01076 [Bacteroides uniformis
CL03T00C23]
Length = 820
Score = 459 bits (1181), Expect = e-126, Method: Compositional matrix adjust.
Identities = 272/765 (35%), Positives = 417/765 (54%), Gaps = 53/765 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + + +P+GNGRLG M GG+ E + LNE +LW+G+ DY+NPDA K+L
Sbjct: 29 QLYYTSPAAIWEETLPLGNGRLGMMPDGGILREHIVLNEISLWSGMEADYSNPDASKSLP 88
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPAD-------VYQLLGDIELEF----DDSHLKYAEE 122
+R L+ G+ EA F YQ+LGD++++F S L
Sbjct: 89 AIRQLLFEGKNREAQELMYSSFVPKKQETDGRYGTYQVLGDLDIDFTYNSSLSILNSPLN 148
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
YRR L+L A A + + +V++ RE+F S V++ + G+L+F+ L
Sbjct: 149 NYRRWLNLRDAVAYTAFRLEDVDYRREYFVSRDRDVMLIHLVAGREGALNFSARLSRAEH 208
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+ V GN ++M+G + P + G+++ +++ + ++S +L
Sbjct: 209 SSVTVQGNT-LLMDGMLESGK--PGLD------GMKYRVAMQLVQNGGESSVSPENGIRL 259
Query: 243 KVEGSDWAVL-----LLVASSSFDGP-FINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
K W +L A + F G + DS P + ++ SI + S+S
Sbjct: 260 KNGQEAWLILSAATSYAAAGTDFPGERYAEVCDSLLRPFTAPANSPCSILHSSFSS---- 315
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
H+ ++ L+ RVS+ L +P D T+P+ ER+ F E P+L L + +
Sbjct: 316 HVTAHRFLYDRVSLTLPATPDD------------TLPTNERILRFTQQESPALAALYYNY 363
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLISS+RPG+ NLQG+W +S W+ H NIN++MN+W LSE +PL
Sbjct: 364 GRYLLISSTRPGSLPPNLQGLWANGVSTPWNGDYHTNINIQMNHWPLEQAGLSELYQPLT 423
Query: 417 DFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
+ L +G +A+ Y A GWV+H T++W +A W GGAWLC HL
Sbjct: 424 TLMERLIPSGEASARTFYGDEADGWVLHMMTNVW-NYTAPGEHPSWGATNTGGAWLCAHL 482
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKL 533
WEHY YT D+D+L +R YP+L+G A F + E G+L T P++SPE+ F P +
Sbjct: 483 WEHYLYTQDKDYL-RRIYPVLKGAARFFSSTTVQEPSHGWLVTAPTSSPENSFYVPGDSV 541
Query: 534 ACVS--YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
VS TMD+ ++ E++ +I+AA +L+ + D V K+ L R P +I+++G +
Sbjct: 542 TPVSICMGPTMDVQLLTELYINVIAAARLLDCDAD-YVAKLEADLKRFPPMQISKEGYLQ 600
Query: 592 EWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKT 651
EW +D+K+ +VHHRH+SHL+GL PG+ I+ E P+L +A TL +RG+EG GWS WK
Sbjct: 601 EWLEDYKEVDVHHRHVSHLYGLHPGNLISPESTPELAEACRMTLNRRGDEGTGWSRAWKI 660
Query: 652 ALWARLHDQEHAYRMVKRLFN-LVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
WARL D A+++ K L + VD H G + NLF +HPPFQID N+G A V
Sbjct: 661 NFWARLGDGNRAWKLFKSLLHPAVDAATGGH-GSGTFPNLFCSHPPFQIDGNYGGAAGVG 719
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
EML+QS ++LLPALP D W++G +G++ RGG ++ + WKDG
Sbjct: 720 EMLLQSHEGFIHLLPALP-DSWTTGNFRGMRVRGGASIDLDWKDG 763
>gi|262383921|ref|ZP_06077057.1| glycoside hydrolase family 95 [Bacteroides sp. 2_1_33B]
gi|262294819|gb|EEY82751.1| glycoside hydrolase family 95 [Bacteroides sp. 2_1_33B]
Length = 809
Score = 459 bits (1180), Expect = e-126, Method: Compositional matrix adjust.
Identities = 278/777 (35%), Positives = 414/777 (53%), Gaps = 55/777 (7%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
N T + F+ PA+ + + +P+GNGR+G M GG+ E + LNE +LW+G D
Sbjct: 16 NLPDMQTGKEVSYYFDQPAQIWEETLPLGNGRIGMMPDGGIERENVVLNEISLWSGSKQD 75
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
NP A +L+++R L+ G+ EA K F P YQL G++
Sbjct: 76 TDNPYAYYSLANIRRLLFEGRNDEAQDLMYKTFVCKGTGSNLGDGANAPYGSYQLFGNLV 135
Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
L++ + + YRR L+L+ A A V + GNV + RE F+S + V +
Sbjct: 136 LKYTYPNESDSIAEYRRRLNLSEAIASVSFKRGNVNYQREMFTSFSGDLGVIHLVADADR 195
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQ-IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
+L+F++ ++ H+ ++ + + ++M G+ P + KG++F++ ++I
Sbjct: 196 ALNFSLGMNR--PEHATISLDGKDLLMRGQLP------DGVDTLEMKGMRFAS--RVRIV 245
Query: 229 DDRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSA-LQSIR 286
+G A D L V + A++L+ + + FD KD +S+ L
Sbjct: 246 LPKGGDLATTDSCLSVRSASEAIILVSLGTDYFD----------KDGVGQSLEKYLSQAE 295
Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE- 345
+ +S L H Y+ LF RVS+ L + +D +P ER+ +F D+
Sbjct: 296 SKDFSTLRREHTLAYRSLFDRVSLDLGKGERD------------HLPINERLAAFAQDKN 343
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
DP L L FQFGRYLLISS+R G NLQG+W + W+ H+NINL+MN+W +
Sbjct: 344 DPGLAALYFQFGRYLLISSTRQGLLPPNLQGLWCNTIHTPWNGDFHLNINLQMNHWPAEV 403
Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
NLSE PL ++ +G +TA+ Y A GWV H ++W + +A W
Sbjct: 404 TNLSELHLPLIEWTKQQVPSGERTAKAFYNARGWVTHILGNVW-EFTAPGEHPSWGATNT 462
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEH 524
AWLC HL+ HY YT+D+ +L + YP ++G A F +D L++ YL T P+TSPE+
Sbjct: 463 SAAWLCEHLYTHYQYTLDKAYL-RDVYPTMKGAALFFVDMLVQDPRTKYLVTAPTTSPEN 521
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
+ P+ + + STMD I+RE+F+ I AA +L + E K RL PT I
Sbjct: 522 AYKMPNESVVSICAGSTMDNQILRELFTNTIEAAGILGVDSTFAAELAAKR-DRLMPTTI 580
Query: 585 AEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPG 644
+DG IMEW + +++ E HRH+SHL+GL+PG+ I+IE P+L +AA K+L+ RG++ G
Sbjct: 581 GKDGRIMEWLEPYEEVEPTHRHVSHLYGLYPGNEISIEHTPELAEAARKSLEVRGDQSTG 640
Query: 645 WSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFE-GGLYSNLFAAHPPFQIDANF 703
WS+ WK WARL D +HAY+++ L EH K + GG Y NLF AHPPFQID NF
Sbjct: 641 WSMAWKINFWARLQDGDHAYKLLGDLLRPCVDEHTKEVKGGGSYPNLFCAHPPFQIDGNF 700
Query: 704 GFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEV 760
G TA +AEML+QS + LPALP W +G GLK R G VS W +G L E
Sbjct: 701 GGTAGIAEMLIQSQTGLIEFLPALP-SAWKNGSFSGLKVRNGGEVSAKWTEGLLTEA 756
>gi|317477822|ref|ZP_07937009.1| glycoside hydrolase [Bacteroides sp. 4_1_36]
gi|316906021|gb|EFV27788.1| glycoside hydrolase [Bacteroides sp. 4_1_36]
Length = 820
Score = 457 bits (1177), Expect = e-126, Method: Compositional matrix adjust.
Identities = 271/765 (35%), Positives = 415/765 (54%), Gaps = 53/765 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + + +P+GNGRLG M GG+ E + LNE +LW+G+ DY+NPDA K+L
Sbjct: 29 QLYYTSPAAIWEETLPLGNGRLGMMPDGGILREHIVLNEISLWSGMEADYSNPDASKSLP 88
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPAD-------VYQLLGDIELEF----DDSHLKYAEE 122
+R L+ G+ EA F YQ+LGD++++F S L
Sbjct: 89 AIRQLLFEGKNREAQELMYSSFVPKKQETDGRYGTYQVLGDLDIDFTYNSSLSILNSPLN 148
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
YRR L+L A A + + +V++ RE+F S V++ + G+L+F+ L
Sbjct: 149 NYRRWLNLRDAVAYTAFRLEDVDYRREYFVSRDRDVMLIHLVAGREGTLNFSARLSRAEH 208
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+ V GN ++M+G + G+++ +++ + ++S L
Sbjct: 209 SSVTVQGNT-LLMDGML--------ESGKPGLDGMKYRVAMQLVQNGGESSVSPGNGICL 259
Query: 243 KVEGSDWAVL-----LLVASSSFDGP-FINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
K W +L A + F G + DS P + ++ SI + S S+
Sbjct: 260 KNGQEAWLILSAATSYAAAGTDFPGERYAEVCDSLLRPFTTPANSPCSILHSSLSN---- 315
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
H+ ++ L+ RVS+ L +P D T+P+ ER+ F E P+L L + +
Sbjct: 316 HVTAHRFLYDRVSLTLPATPDD------------TLPTNERILRFTQQESPALAALYYNY 363
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLISS+RPG+ NLQG+W +S W+ H NIN++MN+W LSE +PL
Sbjct: 364 GRYLLISSTRPGSLPPNLQGLWANGVSTPWNGDYHTNINIQMNHWPLEQAGLSELYQPLT 423
Query: 417 DFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
+ L +G +A+ Y A GWV+H T++W +A W GGAWLC HL
Sbjct: 424 TLMERLVPSGEASARTFYGDEADGWVLHMMTNVW-NYTAPGEHPSWGATNTGGAWLCAHL 482
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKL 533
WEHY YT DRD+L +R YP+L+G A F + E G+L T P++SPE+ F P +
Sbjct: 483 WEHYLYTQDRDYL-RRIYPVLKGAARFFSSTTVQEPSHGWLVTAPTSSPENSFYVPGDSV 541
Query: 534 ACVS--YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
VS TMD+ ++ E+++ +I+AA +L+ + D V K+ L + P +I+++G +
Sbjct: 542 TPVSICMGPTMDVQLLTELYTNVIAAARLLDCDAD-YVAKLEADLKKFPPMQISKEGYLQ 600
Query: 592 EWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKT 651
EW +D+K+ +VHHRH+SHL+GL PG+ I+ E P+L +A TL +RG+EG GWS WK
Sbjct: 601 EWLEDYKEVDVHHRHVSHLYGLHPGNLISPESTPELAEACRMTLNRRGDEGTGWSRAWKI 660
Query: 652 ALWARLHDQEHAYRMVKRLFN-LVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
WARL D A+++ K L + VD H G + NLF +HPPFQID N+G A V
Sbjct: 661 NFWARLGDGNRAWKLFKSLLHPAVDAATGGH-GSGTFPNLFCSHPPFQIDGNYGGAAGVG 719
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
EML+QS ++LLPALP D W++G +G++ RGG ++ + WKDG
Sbjct: 720 EMLLQSHEGFIHLLPALP-DSWTTGNFRGMRVRGGASIDLDWKDG 763
>gi|270294825|ref|ZP_06201026.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|270274072|gb|EFA19933.1| conserved hypothetical protein [Bacteroides sp. D20]
Length = 820
Score = 457 bits (1176), Expect = e-125, Method: Compositional matrix adjust.
Identities = 271/765 (35%), Positives = 415/765 (54%), Gaps = 53/765 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + + +P+GNGRLG M GG+ E + LNE +LW+G+ DY+NPDA K+L
Sbjct: 29 QLYYTSPAAIWEETLPLGNGRLGMMPDGGILREHIVLNEISLWSGMEADYSNPDASKSLP 88
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPAD-------VYQLLGDIELEF----DDSHLKYAEE 122
+R L+ G+ EA F YQ+LGD++++F S L
Sbjct: 89 AIRQLLFEGKNREAQELMYSSFVPKKQETDGRYGTYQVLGDLDIDFTYNSSLSILNSPLN 148
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
YRR L+L A A + + +V++ RE+F S V++ + G+L+F+ L
Sbjct: 149 NYRRWLNLRDAVAYTAFRLEDVDYRREYFVSRDRDVMLIHLVAGREGTLNFSARLSRAEH 208
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+ V GN ++M+G + G+++ +++ + ++S L
Sbjct: 209 SLVTVQGNT-LLMDGML--------ESGKPGLDGMKYRVAMQLVQNGGESSVSPENGICL 259
Query: 243 KVEGSDWAVL-----LLVASSSFDGP-FINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
K W +L A + F G + DS P + ++ SI + S S+
Sbjct: 260 KNGQEAWLILSAATSYAAAGTDFPGERYAEVCDSLLRPFTTPANSPCSILHSSLSN---- 315
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
H+ ++ L+ RVS+ L +P D T+P+ ER+ F E P+L L + +
Sbjct: 316 HVTAHRFLYDRVSLTLPATPDD------------TLPTNERILRFTQQESPALAALYYNY 363
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLISS+RPG+ NLQG+W +S W+ H NIN++MN+W LSE +PL
Sbjct: 364 GRYLLISSTRPGSLPPNLQGLWTNGVSTPWNGDYHTNINIQMNHWPLEQAGLSELYQPLT 423
Query: 417 DFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
+ L +G +A+ Y A GWV+H T++W +A W GGAWLC HL
Sbjct: 424 TLMERLVPSGEASARTFYGDEADGWVLHMMTNVW-NYTAPGEHPSWGATNTGGAWLCAHL 482
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKL 533
WEHY YT DRD+L +R YP+L+G A F + E G+L T P++SPE+ F P +
Sbjct: 483 WEHYLYTQDRDYL-RRIYPVLKGAARFFSSTTVQEPSHGWLVTAPTSSPENSFYVPGDSV 541
Query: 534 ACVS--YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
VS TMD+ ++ E+++ +I+AA +L+ + D V K+ L + P +I+++G +
Sbjct: 542 TPVSICMGPTMDVQLLTELYTNVIAAARLLDCDAD-YVAKLEADLKKFPPMQISKEGYLQ 600
Query: 592 EWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKT 651
EW +D+K+ +VHHRH+SHL+GL PG+ I+ E P+L +A TL +RG+EG GWS WK
Sbjct: 601 EWLEDYKEVDVHHRHVSHLYGLHPGNLISPESTPELAEACRMTLNRRGDEGTGWSRAWKI 660
Query: 652 ALWARLHDQEHAYRMVKRLFN-LVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
WARL D A+++ K L + VD H G + NLF +HPPFQID N+G A V
Sbjct: 661 NFWARLGDGNRAWKLFKSLLHPAVDAATGGH-GSGTFPNLFCSHPPFQIDGNYGGAAGVG 719
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
EML+QS ++LLPALP D W++G +G++ RGG ++ + WKDG
Sbjct: 720 EMLLQSHEGFIHLLPALP-DSWTTGNFRGMRVRGGASIDLDWKDG 763
>gi|406867099|gb|EKD20138.1| hypothetical protein MBM_02090 [Marssonina brunnea f. sp.
'multigermtubi' MB_m1]
Length = 743
Score = 457 bits (1175), Expect = e-125, Method: Compositional matrix adjust.
Identities = 278/764 (36%), Positives = 403/764 (52%), Gaps = 77/764 (10%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA ++ ++PIGNGRLGAMV+G +E L+LNED++W G P D DA K L
Sbjct: 4 RLHYTTPATEWSQSLPIGNGRLGAMVYGRTTTELLQLNEDSVWYGGPQDRIPRDALKNLP 63
Query: 74 DVRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+R L+ + Q++EA K F H Y+ LG LEF H Y+RELDL
Sbjct: 64 RLRELIRAEQHSEAEDLVRKAFFATPHSKRHYEPLGTFTLEF--GHEDSEVTDYKRELDL 121
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS--------LDSLLD 182
TA A V+Y V++ R+ F+S PD VIV ++ SE + ++ + LD
Sbjct: 122 ETAIASVQYRYRGVDYKRKVFASGPDNVIVLQLKSSERVRATLRLTRVSEREYETNEYLD 181
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+ + N + I+M PG R +P ++++K +D GT+ A+ L
Sbjct: 182 SVTASN-DGSIVMRA-TPGGR-------GSNP----LCCVVKVKC-EDGGTLEAV-GGCL 226
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
+E S ++++ A + F P DP S ++ + R L+ L RH+++Y+
Sbjct: 227 VIE-SKATMIVISAQTKFRSP---------DPESAALE--DATRALTRGGLRGRHVENYR 274
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
L+ R+ +QL ++ TD K DP LV L +GRYLL+
Sbjct: 275 SLYARMKLQLGSPASELSTD----------------KRLLRSVDPGLVALYHNYGRYLLV 318
Query: 363 SSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
+SSRPG + A LQGIWN P W S +NIN +MNYW + CNL+EC+ PLFD L
Sbjct: 319 ASSRPGPRALPATLQGIWNPSFQPAWGSRYTININTQMNYWPANLCNLAECEMPLFDLLE 378
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
++I G +TAQ Y GW HH TDIWA + V +WP+ GAWLC H+WE+Y +
Sbjct: 379 RMAIRGKQTAQEMYGCRGWCAHHNTDIWADTDPQDRWVPATVWPLAGAWLCFHIWENYLF 438
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG--YLETNPSTSPEHEFIAPDGKLACVSY 538
LE R +P+L+G F+LD+L+E YL TNPS SPE+ F++ + + +
Sbjct: 439 NGSTTLLE-RMFPILKGSVQFILDFLVEDATSGQYLVTNPSLSPENTFLSANNREGVLCE 497
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFK 598
ST+D+ II +F A I A L++ +D L+ V+ + RL P + G + EW +D+
Sbjct: 498 GSTIDIQIINALFGAFIDALGELDRTDD-LLPAVIHARDRLPPMAVGSLGQLQEWQKDYG 556
Query: 599 DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWA 655
+ E HRH SHL+ L+PG I+ P L A+ L++R E G GWS W L A
Sbjct: 557 EHEPGHRHTSHLWALYPGSAISPNTTPGLAAASAVVLKRRAEHGGGHTGWSRAWLINLHA 616
Query: 656 RLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQ 715
RL D E ++ VKRL N+ +HPPFQID NFG A + EML+Q
Sbjct: 617 RLGDAEGSWDHVKRLLG-----------DSTLPNMLDSHPPFQIDGNFGGCAGIVEMLIQ 665
Query: 716 STLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
S ++LLPA P +W SG +KG++ARGG + W DG + E
Sbjct: 666 SHDGFIHLLPACP-KEWKSGLLKGVRARGGFELDFAWDDGVVKE 708
>gi|255014859|ref|ZP_05286985.1| glycoside hydrolase family protein [Bacteroides sp. 2_1_7]
Length = 850
Score = 457 bits (1175), Expect = e-125, Method: Compositional matrix adjust.
Identities = 275/777 (35%), Positives = 411/777 (52%), Gaps = 55/777 (7%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
N T + F+ PA+ + + +P+GNGR+G M GG+ E + LNE +LW+G D
Sbjct: 57 NLPDMQTGKEVSYYFDQPAQIWEETLPLGNGRIGMMPDGGIERENVVLNEISLWSGSKQD 116
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
NP A +L+++R L+ G+ EA K F P YQL G++
Sbjct: 117 TDNPYAYYSLANIRRLLFEGRNDEAQDLMYKTFVCKGTGSNLGDGANAPYGSYQLFGNLV 176
Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
L + + + YRR L+L+ A A V + GNV + RE F+S + V +
Sbjct: 177 LRYMYPNESDSIAEYRRRLNLSEAIASVSFKRGNVNYQREMFTSFSGDLGVIHLVADADR 236
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQ-IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
+L+F++ ++ H+ ++ + + ++M G+ P + KG++F++ ++I
Sbjct: 237 ALNFSLGMNR--PEHATISLDGKDLLMRGQLP------DGVDTLEMKGMRFAS--RVRIV 286
Query: 229 DDRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSA-LQSIR 286
+G D L V + A++L+ + + FD KD + + L
Sbjct: 287 LPKGGDLTATDSSLSVRSASEAIILVSLGTDYFD----------KDGVGQFLEKYLSQAE 336
Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE- 345
+ +S L H Y+ LF RVS+ L + +D +P ER+ +F D+
Sbjct: 337 SKDFSTLRREHTLAYRSLFDRVSLDLGKGERD------------HLPIHERLAAFAQDKN 384
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
DP L L FQFGRYLLISS+R G NLQG+W + W+ H+NINL+MN+W +
Sbjct: 385 DPGLAALYFQFGRYLLISSTRQGLLPPNLQGLWCNTIHTPWNGDYHLNINLQMNHWPAEV 444
Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
NLSE PL + +G +TA+ Y A GWV H ++W + +A W
Sbjct: 445 TNLSELHLPLIELTKQQVPSGERTAKAFYNARGWVTHILGNVW-EFTAPGEHPSWGATNT 503
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEH 524
AWLC HL+ HY YT+D+ +L + YP ++G A F +D L++ YL T P+TSPE+
Sbjct: 504 SAAWLCEHLYTHYQYTLDKAYL-RDVYPTMKGAALFFVDMLVQDPRTKYLVTAPTTSPEN 562
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
+ P+G + + S MD I+RE+F+ I AA +L + A ++ RL PT I
Sbjct: 563 AYKMPNGSVVSICAGSMMDNQILRELFTNTIEAAGILGV-DSAFAAELAAKRDRLMPTTI 621
Query: 585 AEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPG 644
+DG IMEW + +++ E HRH+SHL+GL+PG+ I+IE P+L +AA K+L+ RG++ G
Sbjct: 622 GKDGRIMEWLEPYEEVEPTHRHVSHLYGLYPGNEISIEHTPELAEAARKSLEVRGDQSTG 681
Query: 645 WSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFE-GGLYSNLFAAHPPFQIDANF 703
WS+ WK WARL D +HAY+++ L EH K + GG Y NLF AHPPFQID NF
Sbjct: 682 WSMAWKINFWARLQDGDHAYKLLGDLLRPCVDEHTKEVKGGGSYPNLFCAHPPFQIDGNF 741
Query: 704 GFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEV 760
G TA +AEML+QS + LPALP W +G GLK R G VS W +G L E
Sbjct: 742 GGTAGIAEMLIQSQSGLIEFLPALP-SAWKNGSFSGLKVRNGGEVSAKWTEGLLTEA 797
>gi|410102732|ref|ZP_11297657.1| hypothetical protein HMPREF0999_01429 [Parabacteroides sp. D25]
gi|409237859|gb|EKN30654.1| hypothetical protein HMPREF0999_01429 [Parabacteroides sp. D25]
Length = 809
Score = 457 bits (1175), Expect = e-125, Method: Compositional matrix adjust.
Identities = 275/777 (35%), Positives = 411/777 (52%), Gaps = 55/777 (7%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
N T + F+ PA+ + + +P+GNGR+G M GG+ E + LNE +LW+G D
Sbjct: 16 NLPDMQTGKEVSYYFDQPAQIWEETLPLGNGRIGMMPDGGIERENVVLNEISLWSGSKQD 75
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIE 109
NP A +L+++R L+ G+ EA K F P YQL G++
Sbjct: 76 TDNPYAYYSLANIRRLLFEGRNDEAQDLMYKTFVCKGTGSNLGDGANAPYGSYQLFGNLV 135
Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
L + + + YRR L+L+ A A V + GNV + RE F+S + V +
Sbjct: 136 LRYMYPNESDSIAEYRRRLNLSEAIASVSFKRGNVNYQREMFTSFSGDLGVIHLVADADR 195
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQ-IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
+L+F++ ++ H+ ++ + + ++M G+ P + KG++F++ ++I
Sbjct: 196 ALNFSLGMNR--PEHATISLDGKDLLMRGQLP------DGVDTLEMKGMRFAS--RVRIV 245
Query: 229 DDRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDSKKDPTSESMSA-LQSIR 286
+G D L V + A++L+ + + FD KD + + L
Sbjct: 246 LPKGGDLTATDSSLSVRSASEAIILVSLGTDYFD----------KDGVGQFLEKYLSQAE 295
Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE- 345
+ +S L H Y+ LF RVS+ L + +D +P ER+ +F D+
Sbjct: 296 SKDFSTLRREHTLAYRSLFDRVSLDLGKGERD------------HLPIHERLAAFAQDKN 343
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
DP L L FQFGRYLLISS+R G NLQG+W + W+ H+NINL+MN+W +
Sbjct: 344 DPGLAALYFQFGRYLLISSTRQGLLPPNLQGLWCNTIHTPWNGDYHLNINLQMNHWPAEV 403
Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
NLSE PL + +G +TA+ Y A GWV H ++W + +A W
Sbjct: 404 TNLSELHLPLIELTKQQVPSGERTAKAFYNARGWVTHILGNVW-EFTAPGEHPSWGATNT 462
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEH 524
AWLC HL+ HY YT+D+ +L + YP ++G A F +D L++ YL T P+TSPE+
Sbjct: 463 SAAWLCEHLYTHYQYTLDKAYL-RDVYPTMKGAALFFVDMLVQDPRTKYLVTAPTTSPEN 521
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
+ P+G + + S MD I+RE+F+ I AA +L + A ++ RL PT I
Sbjct: 522 AYKMPNGSVVSICAGSMMDNQILRELFTNTIEAAGILGV-DSAFAAELAAKRDRLMPTTI 580
Query: 585 AEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPG 644
+DG IMEW + +++ E HRH+SHL+GL+PG+ I+IE P+L +AA K+L+ RG++ G
Sbjct: 581 GKDGRIMEWLEPYEEVEPTHRHVSHLYGLYPGNEISIEHTPELAEAARKSLEVRGDQSTG 640
Query: 645 WSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFE-GGLYSNLFAAHPPFQIDANF 703
WS+ WK WARL D +HAY+++ L EH K + GG Y NLF AHPPFQID NF
Sbjct: 641 WSMAWKINFWARLQDGDHAYKLLGDLLRPCVDEHTKEVKGGGSYPNLFCAHPPFQIDGNF 700
Query: 704 GFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEV 760
G TA +AEML+QS + LPALP W +G GLK R G VS W +G L E
Sbjct: 701 GGTAGIAEMLIQSQSGLIEFLPALP-SAWKNGSFSGLKVRNGGEVSAKWTEGLLTEA 756
>gi|305665057|ref|YP_003861344.1| hypothetical protein FB2170_02120 [Maribacter sp. HTCC2170]
gi|88709809|gb|EAR02041.1| hypothetical protein FB2170_02120 [Maribacter sp. HTCC2170]
Length = 787
Score = 457 bits (1175), Expect = e-125, Method: Compositional matrix adjust.
Identities = 271/772 (35%), Positives = 424/772 (54%), Gaps = 72/772 (9%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD---APK 70
K+ + PAK + A+P+GNGRLGAMV+G E ++LNED++W PG+ PD
Sbjct: 30 KLWYGKPAKEWMQALPVGNGRLGAMVFGDPNHERIQLNEDSMW---PGEADWPDYRGNSD 86
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRREL 128
L ++R+L++ G+ E + V+ F + V +Q +GD+ ++F++ + E Y R L
Sbjct: 87 DLEEIRNLLNEGKTGEVDSLIVEKFSYKTIVRSHQTMGDLYIDFENER---SVENYTRSL 143
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
+LN A Y G ++++ FSS PD V+V ++S + + F + ++ D+
Sbjct: 144 NLNDALITAAYQSGGNSYSQKVFSSKPDDVMVIELSTDATDGMDFTLRMNRPTDD----- 198
Query: 189 GNNQIIM----EGRCPGKRIPPKANANDDPK------GIQFSAILEIKISDDRGTISALE 238
GN + E K + + + D K G++F L ++ ++ GT++A +
Sbjct: 199 GNATVTTRNPSESEISMKGVVTQYSGKRDSKSFPLDYGVKFETRL--RVHNEGGTVTA-D 255
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+L ++G ++ LV ++SF ++ T +++ L+ + N S+ L H
Sbjct: 256 KGQLTLKGVKTVLIHLVGNTSFY--------HGENYTKKNLETLEKVNNSSFKTLLKNHT 307
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFG 357
DY++L++RV + L +D++P R++ + ++DP L LF++G
Sbjct: 308 KDYEELYNRVGLDLGG------------RELDSLPIDARLQRIKEGNDDPDLAAKLFKYG 355
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLI+SSR GT ANLQGIWNE ++ W++ H+NINL+MNYW + NLSE +P F+
Sbjct: 356 RYLLIASSRQGTNPANLQGIWNEHITAPWNADYHLNINLQMNYWPAEVANLSELHQPFFE 415
Query: 418 FLTYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
+L + G TA+ Y + G + HH +D+WA + W W GG W H WE
Sbjct: 416 YLDRVLERGKNTAKKQYGINRGTMAHHASDLWATPFMRAERAYWGSWVHGGGWCAQHYWE 475
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLA 534
HY YT D++FL+ RAYP+L+G + F LDWL+ E ++ ++P TSPE+ + DG A
Sbjct: 476 HYRYTEDKEFLKNRAYPVLKGISEFYLDWLVWDETSKAWV-SSPETSPENSYFNADGNSA 534
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEW 593
VS+ S M II EVF ++ AA+VL +D ++V +L P + +DG ++EW
Sbjct: 535 AVSFGSAMGHQIIAEVFDNVLEAAKVL-GIQDEFTKEVKAKREKLFPGIVVGDDGRLLEW 593
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWK 650
+ + +PE HRH+SHL+ L PG IT + N + AA+KT+ R G G GWS W
Sbjct: 594 NEPYDEPEKGHRHMSHLYALHPGDEITAD-NSEAFAAAKKTIDYRLEHGGAGTGWSRAWM 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
L ARL D A +++ + + N+F HPPFQID NFGFTAAV
Sbjct: 653 INLNARLLDGNAAEENIRKFLEI-----------SIADNMFDEHPPFQIDGNFGFTAAVP 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
E+L QS L +LPALP + W +G + G+KARG V I WKDG+L ++G+
Sbjct: 702 ELLFQSHEGFLRILPALPAN-WKNGKINGIKARGDIEVDIEWKDGELVKLGL 752
>gi|256425749|ref|YP_003126402.1| alpha-L-fucosidase [Chitinophaga pinensis DSM 2588]
gi|256040657|gb|ACU64201.1| Alpha-L-fucosidase [Chitinophaga pinensis DSM 2588]
Length = 778
Score = 456 bits (1173), Expect = e-125, Method: Compositional matrix adjust.
Identities = 272/778 (34%), Positives = 420/778 (53%), Gaps = 51/778 (6%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
M A + NPL + ++ PA + + +P+GNGRLG M GG+ +E + LN+ TLW+G P
Sbjct: 16 MPAALCKAQQNPLTLKYDKPAAVWEETLPLGNGRLGMMPDGGIQTEKVVLNDITLWSGAP 75
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLF--------GHPADVYQLLGDIELEF 112
+ N +A K L ++ L+ G+ EA + K F P YQ LG+++++F
Sbjct: 76 QNANNYEAYKQLPKIQELLKEGRNDEAQSLMDKDFICTGKGSGDVPFGCYQTLGELQIQF 135
Query: 113 D-DSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL 171
D K Y R+L L A A Y V NV + RE+F+S D + +++ S++G L
Sbjct: 136 AYDKADKVEPTAYERKLSLQQAIASCSYKVNNVTYNREYFTSFGDDLSFIRLTASQAGKL 195
Query: 172 SFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDR 231
+ +++ S + + N ++++ G+ ++ +D KG+Q+ A +K
Sbjct: 196 NLRITM-SRPEKAATRTENGELLLYGQL---------DSGNDTKGMQYQA--NVKAQLKG 243
Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYS 291
GTI+ E+ L ++ + +L + A + F + +D KK ++ +A++ Y
Sbjct: 244 GTITT-EEHALVIKNATEVILYVAAGTDF-----HKNDFKKQISTVLATAVKK----PYE 293
Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE--DPSL 349
H+ +Y KLF+RV + L + T+ + +R+ +F + D L
Sbjct: 294 AQKQAHMRNYTKLFNRVQVDLGKG------------TAGTLTTDKRLAAFYNNAAADNEL 341
Query: 350 VELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLS 409
L +QFGRYL I S+R G NLQG+W + W+ H+++N++MN+W NLS
Sbjct: 342 PVLFYQFGRYLTICSTRKGLLPPNLQGLWANQVHTPWNGDYHLDVNVQMNHWPVEVSNLS 401
Query: 410 ECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAW 469
E PL D + L G +TA+ Y A GWV H T++W + W G W
Sbjct: 402 ELNLPLADLVKGLVAPGQRTAKAYYNAPGWVAHVITNVWGFTEPGE-SASWGATKSGSGW 460
Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIA 528
LC +LWEHY +T D+ +L YP+L+G A F LI+ G+L +PS+SPE+ F
Sbjct: 461 LCNNLWEHYAFTNDKKYLAD-IYPVLKGSAEFYNSLLIKDEKTGWLVMSPSSSPENAFYL 519
Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDG 588
P+GK A + +T+D I+R++F+ II+A+ L + D E K P IA DG
Sbjct: 520 PNGKHASICIGATIDNQIVRDLFNNIITASTELGIDADFKKELQQKVALLPPPGVIAPDG 579
Query: 589 SIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSIT 648
IMEW +D+K+ E HRH+SHL+GL+P IT E PDL AA+KTL+ RG++GP W+I
Sbjct: 580 RIMEWLEDYKETEPQHRHISHLWGLYPASLITAENTPDLAAAAKKTLEVRGDDGPSWTIA 639
Query: 649 WKTALWARLHDQEHAYRMVKRLFN-LVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTA 707
+K WARL D +++++K L + GG+Y N+ +A PPFQID NFG TA
Sbjct: 640 YKLLFWARLQDGNRSFKLLKELLKPTARTDINYGAGGGVYQNMLSAGPPFQIDGNFGATA 699
Query: 708 AVAEMLVQSTLNDLYLLPALPWDKW-SSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+AEML+QS + +LP++P D+W ++G VKGLKARG TV WKDG + I S
Sbjct: 700 GIAEMLIQSHAGFINILPSIP-DQWKATGSVKGLKARGNFTVDFAWKDGKVTSYRILS 756
>gi|325298040|ref|YP_004257957.1| alpha-L-fucosidase [Bacteroides salanitronis DSM 18170]
gi|324317593|gb|ADY35484.1| Alpha-L-fucosidase [Bacteroides salanitronis DSM 18170]
Length = 1004
Score = 456 bits (1172), Expect = e-125, Method: Compositional matrix adjust.
Identities = 264/767 (34%), Positives = 419/767 (54%), Gaps = 47/767 (6%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PAK + + +P+GNGRLG M GG+ E + LNE ++W+G DY NP+A ++L +R
Sbjct: 232 YDKPAKQWEETLPLGNGRLGMMPDGGITKEHIVLNEISMWSGSEADYRNPEAAESLPRIR 291
Query: 77 SLVDSGQYAEATAASVKLF-------GHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
L+ G+ EA F G +Q+L D+ + + + Y R L+
Sbjct: 292 QLLFEGKNKEAQELMYTSFVPKKPEKGGTFGCFQMLADMYINYTFPDTISQAKDYLRWLN 351
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L+ A ++ + RE+F S V++ + +L F+++L H
Sbjct: 352 LDEGVAYTTFTKNATRYIREYFVSRNKDVMLIHLQADRPDALGFHLTLSRPERGHVRKLS 411
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
++ + G + N+ +GI+++AI +K+S + + D ++V +D
Sbjct: 412 EGKLEITGTL--------DSGNERQEGIRYAAIAGVKLSGKKSRMHTHADG-IEVSDADE 462
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A +++ A++S+ I +++++ S L + + +YQ+LFHR
Sbjct: 463 AWIIVSANTSYMKGEIYQTETQRLLDQALASDLTQAKQEA--------TGEYQQLFHRAG 514
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
I+L + T S+ + D +R+++FQT +DPSL L + +GRYLLISS+RPG+
Sbjct: 515 IELPEN------KTVSQLSTD-----KRLEAFQTQDDPSLAALYYNYGRYLLISSTRPGS 563
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
NLQG+W + W+ H NIN++MN+W PCNLSE +PL D + L +G +T
Sbjct: 564 LPPNLQGLWANGVMTPWNGDYHTNINVQMNHWPVEPCNLSELYQPLVDLIKRLVPSGEET 623
Query: 430 AQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
A+ Y A GWV+H T++W +S W GGAWLC HLWEHY YT ++ +L
Sbjct: 624 AKAFYGSEAKGWVLHMMTNVWNYTSPGE-HPSWGATNTGGAWLCAHLWEHYLYTGNKQYL 682
Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIA--PDGKLACVSYSSTMDM 544
YPLL+G + F ++ E G+L T P++SPE+EF D V TMD+
Sbjct: 683 AD-IYPLLKGASEFFYSTMVREPEHGWLVTAPTSSPENEFYVSKKDRTPISVCMGPTMDI 741
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLK-SLPRLRPTKIAEDGSIMEWAQDFKDPEVH 603
++RE+++ +I AA +L + D+L LK + +L P +I++ G +MEW +D+++ +VH
Sbjct: 742 QLVRELYTHVIEAASIL--HTDSLYANQLKEASAQLPPHQISKKGYLMEWLKDYEETDVH 799
Query: 604 HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHA 663
HRH+SHL+GL PG+ I++ P+L +A + TL++RG+ G GWS WK WARL D A
Sbjct: 800 HRHVSHLYGLHPGNQISLYYTPELAEACKVTLERRGDGGTGWSRAWKINFWARLGDGNRA 859
Query: 664 YRMVKRLFNLVDPEHEKHFEG-GLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
Y + + L + H G G + NLF +HPPFQID N+G T+ ++EML+QS +
Sbjct: 860 YTLFRNLLYPAYTQENPHEHGSGTFPNLFCSHPPFQIDGNWGGTSGISEMLIQSQDGFIN 919
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
LLPALP D W G + G K RGG VS+ WK+G EV + ++ N
Sbjct: 920 LLPALP-DSWKEGNLYGFKVRGGAMVSMKWKEGKPVEVILTGGWNPN 965
>gi|189460419|ref|ZP_03009204.1| hypothetical protein BACCOP_01058 [Bacteroides coprocola DSM 17136]
gi|189432851|gb|EDV01836.1| GDSL-like protein [Bacteroides coprocola DSM 17136]
Length = 1006
Score = 455 bits (1171), Expect = e-125, Method: Compositional matrix adjust.
Identities = 260/767 (33%), Positives = 419/767 (54%), Gaps = 47/767 (6%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA + + +P+GNGRLG M GG+ E + LNE ++W+G +Y NPDA K+L ++R
Sbjct: 233 YDEPAAQWEETLPLGNGRLGMMPDGGIVKEHIVLNEISMWSGSEANYLNPDASKSLPEIR 292
Query: 77 SLVDSGQYAEATAASVKLF-------GHPADVYQLLGDIELEFD-DSHLKYAEETYRREL 128
L+ G+ EA F G +Q+LG++ LE H K Y R L
Sbjct: 293 RLLFEGKNKEAQELMYTSFVPKKPEKGGTYGTFQMLGNLFLEHQYGVHEKDVPADYHRWL 352
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
DL+ A +S GNV + RE+ S V++ + + GS++F ++L
Sbjct: 353 DLSKGIAYTTFSRGNVNYVREYVVSRDKDVMLIHLKANVPGSINFKMNLSRP------ER 406
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
G+ + + EG+ + ++ G++++AI I R T + +++ + V+ +D
Sbjct: 407 GSVRKLAEGKL---ELYGSLDSGSSQTGVRYAAIAGI-TCKGRQTNQSTDEQSITVQNAD 462
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
A +++ A +SF I +++ + L + + + + YQ LF+R
Sbjct: 463 EAWIVVSAKTSFLAGEIYETEADR--------ILNDALKSNLCETVSEAILSYQALFNRA 514
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
I+L + E + + + +R++ FQ +DPSL L + +GRYLLISS+RPG
Sbjct: 515 GIRLPEN-----------EAVSHLTTDQRIERFQQQDDPSLAALYYNYGRYLLISSTRPG 563
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
+ NLQG+W + W+ H NIN++MN+W NLSE PL D + L +G +
Sbjct: 564 SLPPNLQGLWANEPGTPWNGDYHTNINVQMNHWPVEQANLSELYLPLVDLVKRLVPSGEE 623
Query: 429 TAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
+A+ Y A GWV+H T++W +A W GGAWLC HLWEHY ++ DR++
Sbjct: 624 SAKAFYGPQAKGWVLHMMTNVW-NYTAPGEHPSWGATNTGGAWLCAHLWEHYLFSGDRNY 682
Query: 487 LEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAP--DGKLACVSYSSTMD 543
L YP+++G + F ++ E G+L T P++SPE+ F P D V TMD
Sbjct: 683 LAD-IYPIMKGASEFFYSTMVREPKHGWLVTAPTSSPENAFYLPGKDRTPISVCMGPTMD 741
Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVH 603
+ ++RE+++ +I A+ +L + A E + +++ L P +I++ G +MEW +D+++ ++H
Sbjct: 742 IQLVRELYTNVIEASHILH-TDTAYAEALQEAIGLLPPHQISKKGYLMEWLEDYEETDIH 800
Query: 604 HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHA 663
HRH+SHL+GL PG+ I++ K P+L +A KTL +RG+EG GWS WK WARL D A
Sbjct: 801 HRHVSHLYGLHPGNQISVLKTPELAEACRKTLNRRGDEGTGWSRAWKINFWARLGDGNRA 860
Query: 664 YRMVKR-LFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
Y++ + L+ ++ G + NLF +HPPFQ+D N+G T+ ++EML+QS ++
Sbjct: 861 YKLFRSLLYPAYTAQNPTQHGSGTFPNLFCSHPPFQMDGNWGGTSGISEMLLQSQDGFIH 920
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
LLPALP + W G GLK RGG TV + WKDG + I + NN
Sbjct: 921 LLPALP-ESWKDGSFYGLKVRGGATVDLVWKDGKPVQATITGGWQNN 966
>gi|160887922|ref|ZP_02068925.1| hypothetical protein BACUNI_00326 [Bacteroides uniformis ATCC 8492]
gi|156862608|gb|EDO56039.1| hypothetical protein BACUNI_00326 [Bacteroides uniformis ATCC 8492]
Length = 820
Score = 455 bits (1171), Expect = e-125, Method: Compositional matrix adjust.
Identities = 271/765 (35%), Positives = 417/765 (54%), Gaps = 53/765 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + + +P+GNGRLG M GG+ E + LNE +LW+G+ DY+NPDA K+L
Sbjct: 29 QLYYTSPAAIWEETLPLGNGRLGMMPDGGILREHIVLNEISLWSGMEADYSNPDASKSLP 88
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPAD-------VYQLLGDIELEF----DDSHLKYAEE 122
+R L+ G+ EA F YQ+LGD++++F S L
Sbjct: 89 AIRQLLFEGKNREAQELMYSSFVPKKQETDGRYGTYQVLGDLDIDFTYNSSLSILNSPLN 148
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
YRR L+L A A + + +V++ RE+F S V++ + G+L+F+ L
Sbjct: 149 NYRRWLNLRDAVAYTAFRLEDVDYRREYFVSRDRDVMLIHLVAGHEGTLNFSARLSRAEH 208
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+ V GN ++M+G + P + G+++ +++ + ++S L
Sbjct: 209 SLVTVQGNT-LLMDGMLESGK--PGLD------GMKYRVAMQLVQNGGESSVSPENGICL 259
Query: 243 KVEGSDWAVL-----LLVASSSFDGP-FINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
K W +L A + F G + DS P + ++ +I + S S+
Sbjct: 260 KNGQEAWLILSAATSYAAAGTDFPGERYAEVCDSLLRPFTAPANSPCAILHSSLSN---- 315
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
H+ ++ L+ RVS+ L +P D T+P+ ER+ F E P+L L + +
Sbjct: 316 HVTAHRSLYDRVSLTLPATPDD------------TLPTNERILRFTQQESPALAALYYNY 363
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLISS+RPG+ NLQG+W +S W+ H NIN++MN+W LSE +PL
Sbjct: 364 GRYLLISSTRPGSLPPNLQGLWANGVSTPWNGDYHTNINIQMNHWPLEQAGLSELYQPLT 423
Query: 417 DFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
+ L +G +A+ Y A GWV+H T++W +A W GGAWLC HL
Sbjct: 424 TLMERLIPSGEASARTFYGDEADGWVLHMMTNVW-NYTAPGEHPSWGATNTGGAWLCAHL 482
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKL 533
WEHY YT D+D+L +R YP+L+G A F + E G+L T P++SPE+ F P +
Sbjct: 483 WEHYLYTQDKDYL-RRIYPVLKGAARFFSSTTVQEPSHGWLVTAPTSSPENSFYVPGDSV 541
Query: 534 ACVS--YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
VS TMD+ ++ E+++ +I+AA +L+ + D V K+ L R P +I+++G +
Sbjct: 542 TPVSICMGPTMDVQLLTELYTNVIAAARLLDCDAD-YVAKLEVDLKRFPPMQISKEGYLQ 600
Query: 592 EWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKT 651
EW +D+K+ +VHHRH+SHL+GL PG+ I+ E P+L +A TL +RG+EG GWS WK
Sbjct: 601 EWLEDYKEVDVHHRHVSHLYGLHPGNLISPESTPELAEACRMTLNRRGDEGTGWSRAWKI 660
Query: 652 ALWARLHDQEHAYRMVKRLFN-LVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
WARL D A+++ K L + VD H G + NLF +HPPFQID N+G A V
Sbjct: 661 NFWARLGDGNRAWKLFKSLLHPAVDAATGGH-GSGTFPNLFCSHPPFQIDGNYGGAAGVG 719
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
EML+QS ++LLPALP D W++G +G++ RGG ++ + WKDG
Sbjct: 720 EMLLQSHEGFIHLLPALP-DSWTAGNFRGMRVRGGASIDLDWKDG 763
>gi|119491166|ref|XP_001263205.1| hypothetical protein NFIA_064720 [Neosartorya fischeri NRRL 181]
gi|119411365|gb|EAW21308.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 744
Score = 455 bits (1171), Expect = e-125, Method: Compositional matrix adjust.
Identities = 279/763 (36%), Positives = 408/763 (53%), Gaps = 64/763 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA ++ +A+P+GNGRLGAMV+G +E L+LNED++W G P + DA + L +R
Sbjct: 6 YQQPAGNWEEALPVGNGRLGAMVYGRTDTEMLQLNEDSVWYGGPQNRVPRDAFECLPRLR 65
Query: 77 SLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
SL+ G +AEA + F HP Y+ LG + L+F H + YRR LD+ A
Sbjct: 66 SLIREGNHAEAEKLVRLAFFSHPISQRHYEPLGTLFLDF--GHAPEYMQNYRRSLDIERA 123
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD--NHSYVNGNN 191
T+RV+Y V+ RE +SNPD VI +I S+ + ++ S L+ + Y++
Sbjct: 124 TSRVEYEHKGVKVRREVIASNPDGVIAIRIQASQKTEFALRLTRMSELEYETNEYLD--- 180
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
+ E R I P + K + + +++ +DD+ +++ + +K L V D A+
Sbjct: 181 DVTAEDRTITMHITPGGH-----KSNRACCMAKVRTADDQDSVTQIGNKLL-VNAQD-AL 233
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
+L+ A +++ D K+ +S+ +AL S +++ RH++DY+ L+ R+ +
Sbjct: 234 VLISAQTTY-----RCDDIDKEASSDLETALLH----STDEIWERHVNDYRSLYGRMELH 284
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
LS + D+ TD K + DP L+ L + RYLLIS SR +
Sbjct: 285 LSPNNCDMPTD----------------KRIKNSRDPGLIALYHNYCRYLLISCSRNEDKA 328
Query: 372 --ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
A LQGIWN P W +NINL+MNYW + CNLS+C+ PLF L ++ +G +
Sbjct: 329 LPATLQGIWNPSFHPAWGCKYTININLQMNYWPANICNLSDCEMPLFSLLERVAKSGEEA 388
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
AQ Y GWV HH TDIWA +S + LWP+GGAWLC H+W+H+ +T D+ FL+
Sbjct: 389 AQTMYGCRGWVAHHCTDIWADTSPVDTWMPATLWPLGGAWLCVHIWDHFRFTRDKGFLQ- 447
Query: 490 RAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
R +P+L+GC FLLD+L+E G YL TNPS SPE+ F +G+ + ST+D+ I+
Sbjct: 448 RMFPILQGCVQFLLDFLVEDASGEYLVTNPSLSPENTFYDKNGERGVLCEGSTIDIQIVN 507
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLS 608
V SA + + E LE E L L +L RL P +I G + EWA D+ + E HRH+S
Sbjct: 508 AVLSAYLKSVEELEI-EAKLAPAALDALHRLPPLRIGSYGQLQEWASDYAEVEPGHRHVS 566
Query: 609 HLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHAYR 665
HL+ L PG TI+ E P + A L +R G GWS W L ARL E +
Sbjct: 567 HLWALHPGDTISPETTPKIADACSVALHRRETHGGGHTGWSRAWLINLHARLLAAEECAK 626
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY-LL 724
V L NL HPPFQID NFG A + EMLVQS + LL
Sbjct: 627 HVDLL-----------LAHSTLPNLLDTHPPFQIDGNFGAGAGILEMLVQSYEEGIIRLL 675
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE-VGIYSNY 766
PA P WSSG ++ + ARGG + W++G + + V +YS +
Sbjct: 676 PACP-KAWSSGSLRNICARGGFKLDFSWENGQIKDAVTVYSEF 717
>gi|383812006|ref|ZP_09967453.1| hypothetical protein HMPREF9969_0999 [Prevotella sp. oral taxon 306
str. F0472]
gi|383355392|gb|EID32929.1| hypothetical protein HMPREF9969_0999 [Prevotella sp. oral taxon 306
str. F0472]
Length = 781
Score = 455 bits (1171), Expect = e-125, Method: Compositional matrix adjust.
Identities = 267/773 (34%), Positives = 414/773 (53%), Gaps = 64/773 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA- 71
+++ +N PA F +++P+GNG+LGA+V+GG +T+ LN+ T WTG P D N KA
Sbjct: 1 MRLWYNQPAHFFEESLPLGNGKLGALVYGGTQKDTIYLNDITYWTGNPVD-PNEGLGKAK 59
Query: 72 -LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+ ++R + + Y A + + G + YQ LG + + ++ A Y REL+L
Sbjct: 60 WIPEIRKALFAENYRTADSLQHFVQGEQSASYQPLGTLNIINLNTG---AVSNYYRELNL 116
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
++A + Y ++FTRE+F+++ D +I I +++G+++ + L + H N
Sbjct: 117 DSALVHISYQQNGIQFTREYFATHRDSLIAIHIKANQAGAINLRIQLTAQTP-HKVKATN 175
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
NQ+ M G G A +++ G + A D L + +D A
Sbjct: 176 NQLTMTGHTTGSETE------------SVHACTIVRLLPQGGKVIA-SDSTLTLTNADNA 222
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
+ +V ++SF+G +P +++A +N +Y++ RH+ +YQ++++RV +
Sbjct: 223 TIYIVNATSFNGFDKHPVKDGASYIDNAVNAAWHTQNFTYNEFKDRHIKEYQQIYNRVKL 282
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDP-------SLVELLFQFGRYLLIS 363
+L ++E + +P+ + ++ + + P L L FQFGRYLL+S
Sbjct: 283 KLG-----------NKEYTNNLPTDQLLRRYSSSTAPLPEAAQRYLETLYFQFGRYLLLS 331
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SR ANLQG+W L W +NINLE NYW + P N+SE +PL F+ LS
Sbjct: 332 CSRTPNIPANLQGLWTPHLFSPWRGNYTMNINLEENYWPADPANMSETIQPLIGFVKGLS 391
Query: 424 INGSKTAQVNY-LASGWVIHHKTDIWAKSS-ADRGKVV--WALWPMGGAWLCTHLWEHYN 479
G TA+ Y + GW H +D W K+S GK WA W +GGAWL LW+HY
Sbjct: 392 ATGKHTARNFYGINEGWCAAHNSDPWCKTSPVGEGKESPEWANWNLGGAWLVNALWDHYL 451
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD--GYLETNPSTSPEHEFIAPDGKLACVS 537
Y+ D+ L+ YPL+EG + F WL+ + L T PSTSPE+E++ G
Sbjct: 452 YSQDKQLLQNTIYPLMEGSSKFFQQWLVTNPNKPNELITAPSTSPENEYVTDKGYHGTTC 511
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
Y T D+AIIRE+F + A + L D +++ L RL P + G + EW D+
Sbjct: 512 YGGTADLAIIRELFMNMQQARKSLGLKPD---KEIDDKLHRLHPYTVGSQGDLNEWYYDW 568
Query: 598 KDPEVHHRHLSHLFGLFPGHTITI----EKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
KD ++HHRH SHL GL+PG + K+ + AA +TL ++G+E GWS W+ L
Sbjct: 569 KDYDIHHRHQSHLIGLYPGMHLQALAKQTKDSTILAAARQTLIQKGDESTGWSTGWRINL 628
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKH----FEGGLYSNLFAAHPPFQIDANFGFTAAV 709
WARL D HAY++ + L + V PE + GG Y NLF AHPPFQID NFG TA V
Sbjct: 629 WARLGDGNHAYKIYQNLLSYVSPEGYRGKDAVHHGGTYPNLFDAHPPFQIDGNFGGTAGV 688
Query: 710 AEMLVQSTLN--------DLYLLPALPWDKWSSGCVKGLKARGGETVSICWKD 754
EMLVQS+++ +++LLPALP D W++G +KG++ RGG T+ + W++
Sbjct: 689 CEMLVQSSVDMTAKKPIYNIHLLPALP-DAWANGEIKGIRTRGGLTIDMKWEN 740
>gi|386820649|ref|ZP_10107865.1| hypothetical protein JoomaDRAFT_2613 [Joostella marina DSM 19592]
gi|386425755|gb|EIJ39585.1| hypothetical protein JoomaDRAFT_2613 [Joostella marina DSM 19592]
Length = 780
Score = 455 bits (1170), Expect = e-125, Method: Compositional matrix adjust.
Identities = 273/760 (35%), Positives = 404/760 (53%), Gaps = 64/760 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA + +A+P+GNGR+GAM++GG+ +E +LNED++W G P + L+ +R
Sbjct: 27 YSQPADTWMEALPVGNGRMGAMIYGGIETEHFQLNEDSMWPGSPNLSNAKGTAEDLALIR 86
Query: 77 SLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTAT 134
L+D G+ EA + + F V +Q GD+ L F + + Y+R LD AT
Sbjct: 87 KLIDEGKVHEADSLIIDKFSRQDIVRSHQTAGDLFLHFKN---RGEVTNYKRSLDFEKAT 143
Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH------SYVN 188
+ V YSV F FSS PD V+V K+ S + F++ + D +
Sbjct: 144 SYVSYSVDGNTFKETAFSSQPDNVLVIKLETSNRNGMDFDIEMSRPKDEGVETVKVATFP 203
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
++M G ++ G++F L++K G I++ +L V +
Sbjct: 204 EKQLMLMNGEVTQMGGVVESVPTPIKNGVKFQTRLKVK--SKSGIITS-NGNRLTVRNAK 260
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
+LL+ +S+ P D ++ +++ + Y L H+ D++ L++RV
Sbjct: 261 EVLLLIATETSYYHP---------DYIEKAELVIENAESKGYKALVNNHIQDFKNLYNRV 311
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRP 367
S+ I TD ++E P+ +R++ ++ D L E LF +GRYLLISSSR
Sbjct: 312 SLH-------IETDNSNKE----FPTDKRLERYKAGVVDVGLQETLFNYGRYLLISSSRK 360
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
GT ANLQGIWN ++ W++ H+NINL+MNYW + NL+EC+ PLFDF L I G
Sbjct: 361 GTNPANLQGIWNNHITAPWNADYHLNINLQMNYWLAPITNLAECELPLFDFGNRLIIRGK 420
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
+TA+ + G + HH TD+W + W W G WL H W +Y +T D FL
Sbjct: 421 ETAKQYGINRGSMSHHATDLWGPAFMRARTPYWGAWIHGAGWLAQHYWGYYLFTEDEVFL 480
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETN------PSTSPEHEFIAPDGKLACVSYSST 541
+++ YP L+ A+F LDWL Y E+ P TSPE+ +IA DGK A VS +
Sbjct: 481 KEQGYPYLKEVATFYLDWL-----QYDESTKEWFSYPETSPENSYIANDGKPAAVSRGTA 535
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEWAQDFKDP 600
M II EVF IISA+E+L +D L+++V K LRP +I DG ++EW +++++
Sbjct: 536 MGQQIIGEVFRNIISASEILAI-DDELIKEVKKKAENLRPGVQIGADGRVLEWDKNYEEA 594
Query: 601 EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTALWARL 657
E HRH+SH++ L+PG+ IT E PD KAA+K+++ R G EG GWS W ARL
Sbjct: 595 EKGHRHISHMYALYPGNKITPE-TPDAFKAAQKSIEYRLEHGGEGTGWSRVWMINFNARL 653
Query: 658 HDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST 717
D A + K FE + NLF HPPFQID NFG+TA +AE+L+QS
Sbjct: 654 LDAMSAEENIN-----------KFFEKSIAPNLFDEHPPFQIDGNFGYTAGIAELLLQSH 702
Query: 718 LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
+ +LP LP +W SG + GLKARG V I W +G L
Sbjct: 703 EGFIRILPTLP-KQWKSGTISGLKARGNIEVDITWNNGKL 741
>gi|405378422|ref|ZP_11032344.1| hypothetical protein PMI11_02312 [Rhizobium sp. CF142]
gi|397325094|gb|EJJ29437.1| hypothetical protein PMI11_02312 [Rhizobium sp. CF142]
Length = 750
Score = 455 bits (1170), Expect = e-125, Method: Compositional matrix adjust.
Identities = 292/789 (37%), Positives = 419/789 (53%), Gaps = 60/789 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ PA+ +TDA+P+GNGRLGAMV+G SE L++N+ T W G P NPD+ L +R
Sbjct: 10 YDAPARLWTDALPLGNGRLGAMVFGDPVSERLQINDSTFWAGGPYRPVNPDSYGHLEKIR 69
Query: 77 SLVDSGQYAEATAASVK-LFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
L+ +G YAEA A + + L P YQ +GD+ ++F S +YRR LDL+TA
Sbjct: 70 ELIFAGHYAEAEAMAEEHLMARPIKQMSYQPIGDLHIDFLHSQ---TIGSYRRTLDLDTA 126
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y + F RE F S D V+V ++S G++ +SLDS + +
Sbjct: 127 IATTSYVADGITFFREAFISTVDGVLVLRLSADRPGAIRCRISLDSPQQGQLFDQDAAGL 186
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
G GK A A ++F+ + + + G + + V+ +D V+L
Sbjct: 187 TFSGT--GKAEWGIAAA------LRFAFGIRVI---NTGGSLSSSSGIISVDSTDELVIL 235
Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
L A++SF D DP + L S + H+ ++Q+LF +I L
Sbjct: 236 LDAATSFR----RFDDVSGDPDGAITARLSKATGHSIEAMRRDHIIEHQRLFRAFAIDLG 291
Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVAN 373
T S P+ R+ F EDP+L L QFGRYL+I+SSRPGTQ AN
Sbjct: 292 ------TTQAASH------PTDRRIAGFADGEDPALAALYVQFGRYLMIASSRPGTQPAN 339
Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVN 433
LQGIWNE++ P W S NINL+MNYW P NL +C PL + L+ G +TAQV+
Sbjct: 340 LQGIWNEEVDPPWGSKYTANINLQMNYWLPAPANLPQCIVPLVEMAEELAEAGRETAQVH 399
Query: 434 YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYP 493
Y A GWV+HH TD+W + G W LWP GGAWL T L + +Y D D L +R +P
Sbjct: 400 YRARGWVMHHNTDLWRATGPIDG-AKWGLWPTGGAWLMTQLLDLSDYLDDADRLRRRLFP 458
Query: 494 LLEGCASFLLDWL--IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
+ + A F+ D L + G + YL T PS SPE+ + P G C MD IIR+
Sbjct: 459 VAKAAAEFVFDALASLPGTN-YLVTTPSLSPEN--VHPHGASICA--GPAMDNQIIRDFL 513
Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ--DFKDPEVHHRHLSH 609
+ + A + ED V ++ + LPRL P +I G + EW + D + PE+HHRH+SH
Sbjct: 514 NLLRPIATSI-GGEDEFVSEIDRVLPRLPPDRIGSAGQLQEWLEDWDLQAPEMHHRHVSH 572
Query: 610 LFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKR 669
L+GL+P I ++ P L AA ++L+ RG++ GW I W+ LWARL D +HA +VK
Sbjct: 573 LYGLYPSWQIDMDNTPALAAAARRSLEIRGDDATGWGIGWRINLWARLRDGDHALEVVKL 632
Query: 670 LFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPW 729
L+ PE Y+NLF AHPPFQID NFG A + EMLVQS +++LLPALP
Sbjct: 633 ---LISPERT-------YANLFDAHPPFQIDGNFGGAAGILEMLVQSRPGEIHLLPALP- 681
Query: 730 DKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLS 789
W G ++GL+ RGG + + W++G ++ I + D + + + L+
Sbjct: 682 KAWPRGSLRGLRVRGGMLLDLDWENGRPVKIAISAA-----RDIQTAIRFADGRFTITLT 736
Query: 790 AGKIYTFNR 798
AG+ + ++
Sbjct: 737 AGQTFMASK 745
>gi|300771448|ref|ZP_07081323.1| possible alpha-L-fucosidase [Sphingobacterium spiritivorum ATCC
33861]
gi|300761437|gb|EFK58258.1| possible alpha-L-fucosidase [Sphingobacterium spiritivorum ATCC
33861]
Length = 778
Score = 453 bits (1166), Expect = e-124, Method: Compositional matrix adjust.
Identities = 275/781 (35%), Positives = 417/781 (53%), Gaps = 70/781 (8%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+ +N LK+ ++ AK + + +P+GNG +G M GGV E + LNE ++W+G D N
Sbjct: 22 VAQSNSLKLWYDKAAKRWEETLPLGNGLIGMMPDGGVQKEKIVLNEISMWSGSEEDPNNY 81
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLF-------GH------PADVYQLLGDIELEFD 113
A K++ +++ L+ G+ EA K F GH P YQ LG + L+F
Sbjct: 82 TAYKSVGEIQKLLFEGKNDEAERLVNKNFVTSGKGSGHGNGANVPFGCYQNLGFLNLQFT 141
Query: 114 DSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF 173
++ Y R LDL A AR +++ V++TRE+F+S V V +++ S+ G+L+F
Sbjct: 142 GTN---QPTGYERSLDLKDAVARTHFTINGVKYTREYFTSYDQNVGVVRLTSSKKGALNF 198
Query: 174 NVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGT 233
+ SL S + Y + N+ M G + P D GI FS+ + I RG
Sbjct: 199 SASL-SREERARYTSKGNEFSMSG------VLPDGKGGD---GISFSSKIRIF---HRGG 245
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
A D L V + ++ A++S+ P DP L+ + Y L
Sbjct: 246 KVAASDTALTVSKASEVLIFFAAATSYFHP---------DPQQYVNEQLKLAYDTPYPQL 296
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDT--VPSAERVKSFQTD--EDPSL 349
+ +HL Y+ +F+RV +QL E++ID + + +R+++F + +D L
Sbjct: 297 FKQHLSRYESVFNRVDLQL-------------EDDIDKSDITTDKRLRAFYDNPAQDNGL 343
Query: 350 VELLFQFGRYLLISSSRPGTQVA---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPC 406
L +QFGRYL ISS+ P + A NLQG+W + W+ H+NIN +MN+W
Sbjct: 344 AALYYQFGRYLNISSTAPDVKGALPPNLQGLWAHQIQTPWNGDYHLNINAQMNHWGVEVN 403
Query: 407 NLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
NLSE P + + ++ G KTA+ Y A GWV++ T++W S+ + W
Sbjct: 404 NLSEYHTPFIELIKKIAKTGEKTARAYYNAPGWVVYMMTNVWGYSAPGE-QASWGASTAS 462
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHE 525
G WLC HLWEHY +T D +L K YP+++G A F ++ + G+L T+PS SPE+
Sbjct: 463 G-WLCNHLWEHYQFTKDSVYL-KEVYPVMQGAARFYAHTMVTDPKTGWLVTSPSVSPENA 520
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNE---DALVEKVLKSLPRLRPT 582
F +GK A V +D I+RE++ +I A +L ++ D L ++ + P P
Sbjct: 521 FRMKNGKTAAVVMGPAIDNQIVRELYKNLIDADSILGQHNTFTDTLRTQIQQLAP---PV 577
Query: 583 KIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG 642
I++ G + EW +D+++ E HRH+SHL+GL+P + I+ + P AA+KTL RG+EG
Sbjct: 578 LISKSGRVQEWLEDYEEVEPQHRHVSHLYGLYPANFISPQITPQYVDAAKKTLTVRGDEG 637
Query: 643 PGWSITWKTALWARLHDQEHAYRMVKRLFN-LVDPEHEKHFEGGLYSNLFAAHPPFQIDA 701
GWS WK WARL D H+ ++++L + + GG Y NLF AHPPFQID
Sbjct: 638 TGWSRAWKILFWARLQDGNHSLEILRQLLKPAYRDDTDYRAGGGTYPNLFCAHPPFQIDG 697
Query: 702 NFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVG 761
NFG +A +AEML+QS ++LLPALP W SG VKGLKARGG T+ + WKDG + E
Sbjct: 698 NFGGSAGIAEMLIQSHSGFIHLLPALP-SAWKSGQVKGLKARGGHTIDMIWKDGRVLEYK 756
Query: 762 I 762
I
Sbjct: 757 I 757
>gi|332663343|ref|YP_004446131.1| alpha-L-fucosidase [Haliscomenobacter hydrossis DSM 1100]
gi|332332157|gb|AEE49258.1| Alpha-L-fucosidase [Haliscomenobacter hydrossis DSM 1100]
Length = 818
Score = 452 bits (1164), Expect = e-124, Method: Compositional matrix adjust.
Identities = 274/815 (33%), Positives = 426/815 (52%), Gaps = 73/815 (8%)
Query: 1 MMNAES--TSTTNPLKIT-FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWT 57
+NA+S NP + + PA+ + +A+P+GNGRLGAMV+G E ++LNE+T WT
Sbjct: 18 FVNAQSFDQPNFNPSTVLWYKEPAQKWEEALPVGNGRLGAMVFGKSGEERIQLNEETYWT 77
Query: 58 GVPGDYTNPDAPKALSDVRSLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDD 114
G P + L +++ V G+ +A + G+P + YQ L ++ L F +
Sbjct: 78 GGPYSTVVKGGHEVLPEIQKYVFEGKMLKAHNLFGRRTMGYPVEQQKYQSLANLHLFFAE 137
Query: 115 SHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFN 174
+ Y+R LDL T V+Y V V + R+ F S PDQV+V +++ SE+ +SF
Sbjct: 138 AE---PATVYKRWLDLETGITSVEYRVQEVRYRRDVFVSAPDQVVVLRLTASEAQKISFK 194
Query: 175 VSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE--IKISDDRG 232
+L + + G + M+ G+ + D G++ E +K+ + G
Sbjct: 195 ANLRGVRNPAHSNYGTDYFTMDPY--GQDGLMLKGKSSDYLGVEGKLRFEGQVKVVAEGG 252
Query: 233 TISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSD 292
T+ +D L VE +D + A+++F +N D DP + + +++ SY
Sbjct: 253 TVRT-DDVDLWVEKADAVTVYFTAATNF----VNYHDVSADPHARVEAVWKNMAGKSYPQ 307
Query: 293 LYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVEL 352
+ + D+QK F R ++QL + + P+ ER+ + Q DPSL L
Sbjct: 308 IRDAAVKDHQKYFQRTTLQLEIAASSYL------------PTNERMLNIQKTADPSLAAL 355
Query: 353 LFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
+ FGRYLLI SSRPGTQ ANLQGIWN D++P WDS NIN EMNYW + NL EC
Sbjct: 356 CYNFGRYLLIGSSRPGTQPANLQGIWNNDMNPAWDSKYTTNINTEMNYWPAETGNLPECV 415
Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
EPL + L GS+ A+ +Y GWV H TD+W + +A W + GGAWLCT
Sbjct: 416 EPLIQMVKELMDQGSQVAKEHYGCRGWVFHQNTDLW-RVAAPMDGPSWGTFTTGGAWLCT 474
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDG 531
LWEHY ++MD+++L K YP+++G F +D+L+E D +L TNPSTSPE+ +P
Sbjct: 475 QLWEHYLFSMDKEYL-KEIYPVMQGSVQFFMDFLVETPDKKWLVTNPSTSPENFPASPGN 533
Query: 532 KL------------ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
+ + Y S++DM I+ ++F + A+ +L+ +++ KV + R
Sbjct: 534 QPYFDEVTGMNLPGTTICYGSSIDMQILSDLFGYYVQASALLQVDQE-FAAKVAAARKRF 592
Query: 580 RPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
P +I +DG++ EWA+D+ E HRH SHL+GL+PG+ ++ + P ++ L++RG
Sbjct: 593 PPPQIGKDGALQEWAEDWGQLEKAHRHYSHLYGLYPGNVLSTWRTPQWIAGVKQVLEQRG 652
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFA-AHPPFQ 698
+E GWS WK LWARL+D + +D + + + Y LFA + P Q
Sbjct: 653 DEASGWSRAWKMCLWARLYDGDR-----------LDKIFKGYLKDQAYPQLFAKCYTPMQ 701
Query: 699 IDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLH 758
+D +FG A V E LVQS ++LLPALP W +G + G + RGG + WK G +
Sbjct: 702 VDGSFGVAAGVMEALVQSHEGRIHLLPALP-SAWHTGSLNGTRVRGGFLLDFSWKAGKVQ 760
Query: 759 EVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKI 793
+ + SN G S ++ ++ GK+
Sbjct: 761 QAKLVSN--------------AGQSCRLKIAEGKL 781
>gi|414868294|tpg|DAA46851.1| TPA: hypothetical protein ZEAMMB73_390456 [Zea mays]
Length = 353
Score = 452 bits (1163), Expect = e-124, Method: Compositional matrix adjust.
Identities = 209/317 (65%), Positives = 256/317 (80%), Gaps = 3/317 (0%)
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
FLEK AYPLLEG A FLLDWLIEGH GYLETNPSTSPEH FIAPDGK ACVSYS+TMD++
Sbjct: 34 FLEKTAYPLLEGSARFLLDWLIEGHRGYLETNPSTSPEHYFIAPDGKEACVSYSTTMDIS 93
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
IIREVFSA+I +A++L K++ +V+++ K+LP L P K+A DG+IMEWAQDF+DPE+HHR
Sbjct: 94 IIREVFSALILSADILGKSDTNVVQRIKKALPNLPPMKVARDGTIMEWAQDFQDPEIHHR 153
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H+SHLFGL+PGHT+++E+ PDLC+A +L KRG+EGPGWS +WK LWARLH+ +HAY+
Sbjct: 154 HVSHLFGLYPGHTMSLEETPDLCRAVANSLYKRGDEGPGWSTSWKMVLWARLHNSDHAYK 213
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
M+ +L LVDPEHE EGGLYSNLF AHPPFQIDANFGF AA++EMLVQST DLYLLP
Sbjct: 214 MILQLITLVDPEHEVSREGGLYSNLFTAHPPFQIDANFGFPAALSEMLVQSTGTDLYLLP 273
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVK 785
ALP +KW G VKGLKARGG TV+I WK+G LHE ++S+ N + LHY
Sbjct: 274 ALPRNKWPQGYVKGLKARGGVTVNISWKEGSLHEALLWSSGGQN---TLSRLHYGDQIAT 330
Query: 786 VNLSAGKIYTFNRQLKC 802
V+LS+G++Y F+ LKC
Sbjct: 331 VSLSSGQVYRFSMDLKC 347
>gi|395804734|ref|ZP_10483969.1| glycoside hydrolase [Flavobacterium sp. F52]
gi|395433122|gb|EJF99080.1| glycoside hydrolase [Flavobacterium sp. F52]
Length = 778
Score = 451 bits (1161), Expect = e-124, Method: Compositional matrix adjust.
Identities = 278/779 (35%), Positives = 427/779 (54%), Gaps = 60/779 (7%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
S S N L++ + PA + + +P+GNGRLG M GG+ +E L LN+ TLW+G P D N
Sbjct: 18 SFSQNNQLELWYTKPASQWEETLPLGNGRLGIMPDGGIETEKLVLNDITLWSGSPQDANN 77
Query: 66 PDAPKALSDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEF 112
A L +R L+ + + +EA + F G A+V YQ+LGD+ L+F
Sbjct: 78 YKAYTFLPQIRELLLANKNSEAEQLINQNFVCTGPGSGSGDGANVQFGCYQVLGDMTLKF 137
Query: 113 DDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLS 172
D K Y R L++ TA A ++++ V + RE+F+ D V+ K++ S+ G L+
Sbjct: 138 D-YKTKSKAINYSRNLNIQTALASTQFTIDGVIYKREYFAGFGDDVLFVKLTSSKKGKLN 196
Query: 173 FNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRG 232
F V LD ++ VN +N ++M G+ N D KG+++ A ++ K +D G
Sbjct: 197 FTVKLDRS-EHFKTVNSDNSLVMTGQL---------NNGIDGKGMKYKAKVKAKTAD--G 244
Query: 233 TISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSD 292
++ + ++V+ + VL + A + F ++ D T E ALQ Y +
Sbjct: 245 SV-LYTNNTIEVKNATEVVLYVSAGTDFKNQNF---ETAVDKTLEI--ALQK----KYDE 294
Query: 293 LYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT--DEDPSLV 350
H+ +YQKLF+RV++ ++ ++ T+P+ ER+ +F D D L
Sbjct: 295 QKKTHIQNYQKLFNRVALNFGKTARN------------TLPTNERLDAFMKNPDSDTGLP 342
Query: 351 ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
L +Q+GRYL ISS+R G NLQG+W + W+ H+++N++MN+W NLSE
Sbjct: 343 VLFYQYGRYLSISSTRVGLLPPNLQGLWAHQIQTPWNGDYHLDVNVQMNHWALETGNLSE 402
Query: 411 CQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWL 470
PL D + + G KTA+ Y A GWV H T+IW + W + G WL
Sbjct: 403 LNLPLKDLVKEMVPYGEKTAKAYYNADGWVAHVITNIWGFTEPGE-SASWGIAKAGSGWL 461
Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAP 529
C +LW HY YT D+ +L YP+++G A F L++ + G+L T+PS SPE+ F P
Sbjct: 462 CNNLWNHYLYTNDQAYLAD-IYPIIKGAAQFYNSMLVKDPETGWLVTSPSVSPENSFFLP 520
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAED 587
+G+ A V T+D I+RE+F+ +I+A+ L + A +EK LK LP P ++ D
Sbjct: 521 NGQDAHVCMGPTIDNQIVRELFNNVIAASSKLGLDNTLKAELEKRLKLLPP--PGVVSPD 578
Query: 588 GSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSI 647
G I EW + +K+P+ HRH+SHL+GL+P IT E P+L +AA+K L+ RG++GP WSI
Sbjct: 579 GRIQEWLKPYKEPDPQHRHVSHLYGLYPAPLITPESTPELAEAAKKILEVRGDDGPSWSI 638
Query: 648 TWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFE-GGLYSNLFAAHPPFQIDANFGFT 706
+K W+RL + AY+++K + + + GG+Y NL +A PPFQID NFG
Sbjct: 639 AYKMLFWSRLKEGNRAYKLLKTILRPTLATNINYGAGGGVYPNLLSAGPPFQIDGNFGAA 698
Query: 707 AAVAEMLVQSTLNDLYLLPALPWDKW-SSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
A + EML+QS + LLPA+P D W G VKGLKA G T+++ W+ G + + I S
Sbjct: 699 AGIGEMLIQSHAGFIELLPAMP-DVWLKEGEVKGLKAEGNFTINMKWEKGKVTKYEILS 756
>gi|427388255|ref|ZP_18884138.1| hypothetical protein HMPREF9447_05171 [Bacteroides oleiciplenus YIT
12058]
gi|425724838|gb|EKU87712.1| hypothetical protein HMPREF9447_05171 [Bacteroides oleiciplenus YIT
12058]
Length = 829
Score = 451 bits (1159), Expect = e-124, Method: Compositional matrix adjust.
Identities = 269/772 (34%), Positives = 424/772 (54%), Gaps = 62/772 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + + +P+GNGRLG M GG+ E + LNE +LW+G+ DY NPDA ++L
Sbjct: 33 QLYYTTPATIWEETLPLGNGRLGMMPDGGIDKEHIVLNEISLWSGMEADYGNPDASRSLP 92
Query: 74 DVRSLVDSGQYAEATAASVKLF-------GHPADVYQLLGDIELEFDDSHLK--YAEET- 123
++ L+ G+ EA F G YQ+L D+ L F K ++ +T
Sbjct: 93 AIQQLLFEGKNREAQELMYNSFVPKKPESGGTYGNYQMLADLTLNFSIPVKKEFFSGDTV 152
Query: 124 ----YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
YRR LDL A A ++ G +++ RE+++S V++ ++ S SL F SL
Sbjct: 153 PVTGYRRWLDLRDAVAYTTFTKGGIDYQREYYTSRDKDVMIIHLTASRRRSLFFTASLSR 212
Query: 180 LLDNH-SYVNGNNQ----IIMEGRC----PGKRIPPKANANDDPKGIQFSAILEIKISDD 230
S+V GN + +++EG PG+ G+++ + + D
Sbjct: 213 PQQGTVSFVPGNGKESGTLLLEGVLDSGKPGQ------------DGMKYRVAMRVVSKDG 260
Query: 231 RGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESM--SALQSIRNL 288
+ ISA E+ + +G++ A L++ A++S+ + S S+ +S+ +A QS L
Sbjct: 261 KQHISA-ENGVMLTQGTE-AWLVISATTSYAAAGTDFSGSRYKEVCDSLLNAATQSHSQL 318
Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPS 348
S + ++ +++L+ RVS+ L + D +P+ ER+ F E P+
Sbjct: 319 SILNSQLKNAS-HRELYDRVSLTLPATEDD------------ALPTNERIVRFTERESPA 365
Query: 349 LVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL 408
L L + +GRYLLISS+RPG+ NLQG+W + W+ H NIN++MN+W L
Sbjct: 366 LATLYYNYGRYLLISSTRPGSLPPNLQGLWANGIQTPWNGDYHTNINIQMNHWPLEQAGL 425
Query: 409 SECQEPLFDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
SE +PL + L +G +TA Y A GWV+H T++W +A W G
Sbjct: 426 SELYQPLTTLIERLVPSGKETACTFYGNRAQGWVLHMMTNVW-NYTAPGEHPSWGATNTG 484
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHE 525
GAWLCTHLWEHY YT D ++L K+ YP+L+G + F ++ E G+L T P++SPE+
Sbjct: 485 GAWLCTHLWEHYQYTQDLEYL-KKIYPILKGASEFFYSTMVQEPKHGWLVTAPTSSPENA 543
Query: 526 F-IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
F + D + TMD+ ++ E+++ ++ AA +L K +D K+ +L + P +I
Sbjct: 544 FFVGDDPTPVSICMGPTMDVQLLTELYTNVVQAASIL-KCDDGYAAKLRAALEKFPPMQI 602
Query: 585 AEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPG 644
+++G + EW +D+K+ +VHHRH+SHL+GL PG+ I+ + P+L A TL +RG+ G G
Sbjct: 603 SKEGYLQEWLEDYKEQDVHHRHVSHLYGLHPGNLISPDATPELANACRVTLNRRGDGGTG 662
Query: 645 WSITWKTALWARLHDQEHAYRMVKRLFN-LVDPEHEKHFEGGLYSNLFAAHPPFQIDANF 703
WS WK WARL D + A+ + K L + VDP+ ++H G + NLF +HPPFQID N+
Sbjct: 663 WSRAWKINFWARLGDGDRAWTLFKSLLHPAVDPQTKRH-GSGTFPNLFCSHPPFQIDGNY 721
Query: 704 GFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
G A + EML+QS ++LLP LP W +G G+KARGG +V + WKDG
Sbjct: 722 GGAAGIGEMLMQSHEGFIHLLPTLP-KSWHTGNFHGMKARGGISVDLEWKDG 772
>gi|326798066|ref|YP_004315885.1| alpha-L-fucosidase [Sphingobacterium sp. 21]
gi|326548830|gb|ADZ77215.1| Alpha-L-fucosidase [Sphingobacterium sp. 21]
Length = 794
Score = 451 bits (1159), Expect = e-124, Method: Compositional matrix adjust.
Identities = 272/819 (33%), Positives = 427/819 (52%), Gaps = 82/819 (10%)
Query: 8 STTNPLKITFNGPAKHFT-DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN- 65
S L++ ++ PA + +A+PIGNG +GAM +GG+ E ++ +E +LW+G PG N
Sbjct: 25 SQQKALQLWYDRPATDWMREALPIGNGYIGAMFFGGIGEEQIQFSEGSLWSGGPGANPNY 84
Query: 66 -----PDAPKALSDVRSLVDSGQYAEAT---------AASVKLFGHPAD-----VYQLLG 106
P+A K L +VR+L+ G+ EA A VKL G D Q +G
Sbjct: 85 NFGNRPNAWKYLGEVRALIKQGKLKEANELVEKQMTGMAPVKLAGDSTDWGDYGAQQTMG 144
Query: 107 DIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGS 166
D+ ++ H + YRR LD+ A +V YSV ++ R F S P V+V K +
Sbjct: 145 DLFIKV--GHGSIPVQDYRRTLDIQRAIGKVSYSVAGNKYQRSFFGSYPQGVMVYKFTSD 202
Query: 167 ESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK 226
+S S + + S + S+ + G P ++ + + + + +
Sbjct: 203 KSESYTLHFSTPQYKEKESFEGLRYSCV--GYVPNNKLAFET---------AYQLVTDGR 251
Query: 227 ISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIR 286
+ GT+S + K L +++ A++++ + P + D S L + +
Sbjct: 252 VKYTNGTVSVEKAKSL--------LIIHTAATAYTMQY--PHYNGNDFRSIIKKRLDAAK 301
Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKS-FQTDE 345
SY L+ H +DYQ LF RVS QL ++ D +P+ +R ++ F+ E
Sbjct: 302 GKSYKQLFQIHQEDYQPLFDRVSFQLQ------------GKSADHLPTDKRQQALFEGAE 349
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
D L +L FQ+GRYL+I++SRPGT +LQG WN ++P W + H NIN +M YW +
Sbjct: 350 DVGLEQLYFQYGRYLMIAASRPGTMPMHLQGKWNNSVNPPWAADYHTNINEQMLYWPAEV 409
Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
NLSEC EPL D++ L G K+A + GW+++ + + ++ + G + W +P
Sbjct: 410 TNLSECHEPLIDYIESLVEPGKKSAHDFFHTRGWIVNTMNNAFGYTAVNWG-LPWGFYPA 468
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHE 525
G AWL H+WEHY YT D+ +L RAYP+++ A F +D+L +G+L ++PS SPEH
Sbjct: 469 GAAWLTQHVWEHYAYTQDKAYLRNRAYPIMKEAARFWIDYLTLDENGHLVSSPSYSPEH- 527
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S ++MD I ++ + + AA VL+ + A + R+ P ++
Sbjct: 528 --------GGISGGASMDHQIAWDILNNSLEAAMVLD--DKAFADTAQHVRDRILPPQVG 577
Query: 586 EDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGW 645
G + EW +D DP HRH+SHLF L PG I+ K P+L +AA+ +L+ RG+E GW
Sbjct: 578 RWGQLQEWKEDVDDPHNKHRHVSHLFALHPGRQISPLKTPELAEAAKVSLEARGDEATGW 637
Query: 646 SITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEK----HFEG---GLYSNLFAAHPPFQ 698
S+ WK WARL + + A ++ K + ++EG G Y+NL AHPPFQ
Sbjct: 638 SLGWKVNFWARLKNGDRALKLYKMVIKPAGATKSSSGAINYEGEGSGSYANLLDAHPPFQ 697
Query: 699 IDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLH 758
+D N G TA VAEML+QS ++ LLPALP W +G + GL+ARGG TV++ W+ G L
Sbjct: 698 LDGNMGATAGVAEMLLQSQTGEIELLPALP-KNWPTGRISGLRARGGFTVNLNWEAGQLK 756
Query: 759 EVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFN 797
I ++ S KTL Y+G + ++ +GK Y +
Sbjct: 757 SAEIIADRSGQ-----KTLTYKGKTKAIDFVSGKKYQLS 790
>gi|325680593|ref|ZP_08160136.1| hypothetical protein CUS_5001 [Ruminococcus albus 8]
gi|324107730|gb|EGC02003.1| hypothetical protein CUS_5001 [Ruminococcus albus 8]
Length = 759
Score = 451 bits (1159), Expect = e-123, Method: Compositional matrix adjust.
Identities = 281/759 (37%), Positives = 402/759 (52%), Gaps = 64/759 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA + A+P+GNGR+GAMV+ E ++LNED++W+G + N A L VR
Sbjct: 9 YKTPADDWNKALPLGNGRIGAMVFSQPLEERIQLNEDSVWSGGFRERNNKSALPNLEKVR 68
Query: 77 SLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLKYAEETYR-RELDLNT 132
L+ + EA F G P + Y LGD+ + H K +E ++ R LDLNT
Sbjct: 69 KLLFEEKINEAEKIIYDAFCGTPVNQRHYMPLGDMNV----IHYKESECDFKSRSLDLNT 124
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS---LLDNHSYVNG 189
A +Y++ V++TRE F S PDQV+V I+ SE ++S V +D D++S V+
Sbjct: 125 AVCTTEYAINGVDYTREVFISQPDQVLVMHITASEKKAISVRVRIDGRDDYFDDNSPVHD 184
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
N+ + G + ++D GI F+A IK+ G + + E D
Sbjct: 185 NDILFYGG-----------SGSED--GINFAAY--IKVLHKGGKVYPY-GSFITCEDCDE 228
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
+LL A +S+ +D +++ ++ +Y+ L H+ DY+ + R +
Sbjct: 229 VTILLGAQTSY---------RCEDYKGQAVFDVERAEEKTYAQLKADHIADYKSYYDRAN 279
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPG 368
I L D S + T+P+ +R+ + + D L+E+ FGRYLLI+ SR
Sbjct: 280 ISLC--------DNSSGNS--TLPTDKRLALVKEGNPDNKLIEMYHNFGRYLLIAGSREK 329
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
T NLQGIWN+D+ P W +NIN EMNYW + CNLSE PL D + L NG K
Sbjct: 330 TLPTNLQGIWNKDMWPAWGCKFTININTEMNYWCAENCNLSELHMPLIDHIEKLRPNGRK 389
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
TA+ Y G+V HH TDIW ++ + WPMG AWLC H+WEHY Y DR+FL
Sbjct: 390 TARNMYGCRGFVCHHNTDIWGDTAPQDLWIPGTQWPMGAAWLCLHIWEHYLYVQDREFLS 449
Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
++ Y L+ A F LD+LIE G L T PS SPE+ ++ G + +MD II
Sbjct: 450 EK-YDTLKEAAEFFLDFLIEDKKGRLVTCPSVSPENTYLTASGSKGSICIGPSMDSQIIY 508
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLS 608
E+F+A+ A+++LE + +KVL++ RL +I + G IMEWA+D+ + E HRH+S
Sbjct: 509 ELFTAVAEASKILE-TDGGFRKKVLEARDRLPAPEIGKYGQIMEWAEDYDEVEPGHRHIS 567
Query: 609 HLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHAYR 665
LF L+P IT+ K P+L KAA TL++R G GWS W WARL D E Y
Sbjct: 568 QLFALYPADIITMRKTPELAKAARATLERRLSHGGGHTGWSRAWIINHWARLFDGEKVYE 627
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
V L + E N+F HPPFQID NFG TA + E L+QS ++ LLP
Sbjct: 628 NVIALLSNSTSE-----------NMFDMHPPFQIDGNFGGTAGITEALLQSENGEIILLP 676
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
ALP +WS G KGL ARGG + + WK+ + I+S
Sbjct: 677 ALP-KEWSEGSFKGLCARGGFVIDLEWKNSKITACHIHS 714
>gi|326790118|ref|YP_004307939.1| alpha-L-fucosidase [Clostridium lentocellum DSM 5427]
gi|326540882|gb|ADZ82741.1| Alpha-L-fucosidase [Clostridium lentocellum DSM 5427]
Length = 756
Score = 450 bits (1158), Expect = e-123, Method: Compositional matrix adjust.
Identities = 271/754 (35%), Positives = 406/754 (53%), Gaps = 69/754 (9%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ A+++ +A+PIGNG LG M++GG+ E +++NE++LW G D N DA K L +R
Sbjct: 8 YKQAARNWNEALPIGNGALGGMIFGGIKKELIQMNEESLWYGTFRDRNNKDARKYLPVIR 67
Query: 77 SLVDSGQYAEATAA-SVKLFGHP--ADVYQLLGDIELEFDDSHLKYAEET----YRRELD 129
L+ G+ EA S+ +FG P Y +LGD+ ++ + +E YRR LD
Sbjct: 68 DLLWQGKIGEAEKLLSMSMFGTPDGQRQYSVLGDLVIQC------FGQEEPVSHYRRTLD 121
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L TA A V Y +F RE+F S PD ++ ++ + + +D N
Sbjct: 122 LETACATVGYVSPKGKFEREYFCSKPDNLLAVRLRCDQEEQIELMAYIDRWKYNDEIEMS 181
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
+ + + G ++ +GI + ++ K+ + GT + ++L +G +
Sbjct: 182 KDGMSLYG----------SSGPCSSEGIGYHFMM--KLIPNGGTAQNI-GQRLYAKGCNE 228
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
++L+ A++ + DS +P S L+ Y +L RH+ DY+ L+ R+S
Sbjct: 229 VIILVTATTDY-------KDS--NPRSICEERLKKATQKGYEELKARHVADYKSLYKRLS 279
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPG 368
+ L E+++ +P+ ER++ + ED L+ + FQ+GRYLLIS SR G
Sbjct: 280 LDLKG------------ESLNHLPTDERLERIKKGGEDLDLIAMYFQYGRYLLISCSREG 327
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
A LQGIWN + P WDS +NIN EMNYW + C+LSEC PL + L + I+G K
Sbjct: 328 GLPATLQGIWNGEWLPPWDSKYTININTEMNYWLAEKCHLSECHLPLVEHLEKVRIHGEK 387
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
TA+ Y G++ HH TDIW ++ + +WPMG AWL H+WEHY YT+D+ FL
Sbjct: 388 TAEQMYGCRGFMAHHNTDIWGDAAPQDMWMPATIWPMGAAWLVLHIWEHYEYTLDQAFL- 446
Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
K Y LL+G F D+L+ +GYL T PSTSPE+ + G+ V +MD I+
Sbjct: 447 KEKYHLLKGAGDFFKDYLMMDENGYLVTGPSTSPENTYRLSSGEQGTVCIGPSMDSQILF 506
Query: 549 EVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRH 606
E+F+AII A +++ + E+ + +++ K LP P +I + G IMEW +D ++ E HRH
Sbjct: 507 ELFTAIIEAGQLVGEAEEEIQCFKEMRKKLP---PIQIGKYGQIMEWREDHEEVEPGHRH 563
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTALWARLHDQEHA 663
+S LF L+PGH IT E P+ KAA+KTL++R G GWS W LWARL + + A
Sbjct: 564 ISQLFALYPGHQITKEDTPEWAKAAKKTLERRLSYGGGHTGWSRAWIINLWARLKEGDLA 623
Query: 664 YRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYL 723
Y +K L NL HPPFQID NFG A ++E+L+Q + + L
Sbjct: 624 YSNIKELLKC-----------STLINLLDNHPPFQIDGNFGAAAGISELLLQGEKDYIEL 672
Query: 724 LPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
LPALP +G V GL A+G TV I W+DG L
Sbjct: 673 LPALP-KGIPNGKVTGLCAKGKVTVDIDWEDGHL 705
>gi|374384834|ref|ZP_09642351.1| hypothetical protein HMPREF9449_00737 [Odoribacter laneus YIT
12061]
gi|373227638|gb|EHP49951.1| hypothetical protein HMPREF9449_00737 [Odoribacter laneus YIT
12061]
Length = 780
Score = 450 bits (1157), Expect = e-123, Method: Compositional matrix adjust.
Identities = 270/771 (35%), Positives = 400/771 (51%), Gaps = 66/771 (8%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG---DYTNPDAP---KALSDVRSLV 79
+A+P+GNG +GAM +GG + ++L E++ W G PG Y + K L +VR L+
Sbjct: 36 EALPVGNGYMGAMWFGGPVRDEIQLAEESFWAGGPGASKSYKGGNKEGSWKYLKEVRELL 95
Query: 80 DSGQYAEATAASVKLFGH---PADVYQLLGDIELEFDDSHLKYAEET-------YRRELD 129
+SG+ +A + + F P + GD L E YRR LD
Sbjct: 96 ESGEKEKAAELAGRYFVGEITPTEAGDQFGDFGGNQPFGSLGVTVEAADTSWTDYRRSLD 155
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L A +V+Y +G F +F+S P ++ V K + + G + V+ ++
Sbjct: 156 LERAMGKVEYDMGGTHFRNTYFASYPARMFVFKYTNNAPGGKDYRVTFETPHQGTKITVR 215
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
+ I++G+ +P + IK+ D G I + ++EG+
Sbjct: 216 KDLWIIQGKLASNGLPFEGR---------------IKVKTD-GKIR-FQKGVFRIEGAKN 258
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
+ +S++ + P D + A++ ++ DL H DY+ LF RV
Sbjct: 259 TEFYVSIASAYANTY--PLYRGNDYEEVNRKAIERAERGTWEDLQAEHETDYRSLFERVK 316
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPG 368
++L S ++ +P+ +R + DP L L FQ+GRYLLISSSRPG
Sbjct: 317 LELGHS------------GLEKLPTDKRQLRYSLGAYDPGLEALYFQYGRYLLISSSRPG 364
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
T A+LQG WN L+ W H+NINL+M YW + NLSEC PL +++ L G
Sbjct: 365 TLPAHLQGRWNHQLNAPWACDYHMNINLQMIYWPAEVANLSECHLPLLEYIDKLREPGRV 424
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
TA+ + A GWV+H + + +A W P AWLC HLWEH+NYT DR+FL
Sbjct: 425 TAREYFNARGWVVHTMNNAFG-YTAPGWDFYWGYAPNSAAWLCAHLWEHFNYTRDREFLG 483
Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
++AYP+++ A F +D+L+ DG+L ++PS SPEH IA +TMD I
Sbjct: 484 RKAYPIMKEVARFWMDYLVADEDGFLVSSPSYSPEHGDIA---------IGATMDQEIAW 534
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLS 608
++F+ ++ A + + K + A + V RL P +I + G + EW +D DP HRH+S
Sbjct: 535 DLFTNVLQAMDYV-KEDPAFADSVSDFRKRLLPLRIGKFGQLQEWKEDLDDPGNTHRHIS 593
Query: 609 HLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVK 668
HL+ LFPGH I++E+ P+ KAA+++L RGEEG GWS+ WK WARL D +Y+M++
Sbjct: 594 HLYALFPGHQISLEETPEWAKAAKRSLTYRGEEGTGWSLAWKINFWARLQDGNQSYKMLR 653
Query: 669 RLFNLVDPEHEKHFE----GGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLL 724
L L + +++F G Y NL AHPPFQID N G A +AEML+QS L LL
Sbjct: 654 NL--LRSAKGQENFSNPSGSGSYCNLLCAHPPFQIDGNMGAVAGIAEMLLQSHAGMLDLL 711
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFK 775
PALP W SG VKGLKARGG TV + W+DG L E I ++ + +K
Sbjct: 712 PALP-AAWPSGYVKGLKARGGYTVDLVWQDGLLKEAVIRADEAGKGKIRYK 761
>gi|281419724|ref|ZP_06250723.1| putative alpha-L-fucosidase 2 [Prevotella copri DSM 18205]
gi|281406253|gb|EFB36933.1| putative alpha-L-fucosidase 2 [Prevotella copri DSM 18205]
Length = 1246
Score = 449 bits (1156), Expect = e-123, Method: Compositional matrix adjust.
Identities = 283/817 (34%), Positives = 427/817 (52%), Gaps = 78/817 (9%)
Query: 7 TSTTNPL---------KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWT 57
T+ TNP+ + +N PA ++ +A+P+GNGRLG M G V +TL+LNEDT W
Sbjct: 333 TADTNPIPAPTIESKNHLWYNKPAGYWEEALPLGNGRLGVMHSGSVACDTLQLNEDTFWD 392
Query: 58 GVPGDYTNPDAPKALSDVRSLVDSGQYAEAT------------------AASVKLFGHPA 99
P N +A L +V+ + + YA AA V L G P
Sbjct: 393 QGPNTNYNANAFGVLREVQQGIFNKDYASVQNLAVTNWMSQGSHGASYRAAGVVLLGFPG 452
Query: 100 DVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
+ ++E + + Y R LD+NTAT+ V+Y V V + R F+S D V
Sbjct: 453 QRFD-----DMESAQTSDAVDAQGYVRYLDMNTATSNVEYHVKGVGYKRTVFTSFKDNVT 507
Query: 160 VTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQF 219
V ++ + G L FNV+ ++ +N + E P + + +
Sbjct: 508 VVRLEADQKGKLDFNVAYAGCNKSNIEKLTSNVLYDEHTVKATMGPARDKCENVENKLNL 567
Query: 220 SAILEI-----KISDD------RGTISALEDK-KLKVEGSDWAVLLLVASSSFDGPFINP 267
L I I++D +GT+ A + +L V G+ +A +++ +++F
Sbjct: 568 CTYLRIVDTDGTITNDNVNIYAQGTVGAATNAPRLNVTGATYATIIISQATNFK----KY 623
Query: 268 SDSKKDPTSESMSALQSIRNLS--YSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCS 325
D D ++ +++ L++ N Y + H Y+ F RV + L+ + +
Sbjct: 624 DDVSGDASASALAYLEAYENSKKDYVTTLSDHESVYRAQFDRVDLTLAGN--------AT 675
Query: 326 EENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLS-- 383
+E+ +T +R+K F DP L FQFGRYLLISSS+PGTQ ANLQGIWN D
Sbjct: 676 QESKNT---EQRIKEFHKTSDPQLAANYFQFGRYLLISSSQPGTQPANLQGIWNPDARQY 732
Query: 384 PTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHH 443
P WDS NIN+EMNYW + NL+EC EP + + +S+ G++TA+ Y A GW +HH
Sbjct: 733 PAWDSKYTSNINVEMNYWPAEVTNLAECHEPFVEMVKDVSVTGAETAKKMYGARGWALHH 792
Query: 444 KTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFL 502
TDIW + A D G V +WP AW C+HLWE Y ++ D+ +L + YP+++G A F
Sbjct: 793 NTDIWRTTGAVDNGTV--GVWPTCNAWFCSHLWERYLFSGDKTYLAE-VYPIMKGAAEFF 849
Query: 503 LDWLIEG-HDGYLETNPSTSPEH-----EFIAPDGKLACVSY--SSTMDMAIIREVFSAI 554
D+L++ + GY+ PS SPE+ + PDGK A ++ MD ++ ++
Sbjct: 850 QDFLVKDPNTGYMVVCPSNSPENHPGIGSYTKPDGKTANIALFGGVAMDNEMVYDLLKNT 909
Query: 555 ISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLF 614
AA L+K+ D ++ P KI + G + EW +D+ HRHLSHL+G +
Sbjct: 910 ALAARALDKDADFADALDALK-AQITPWKIGQYGQVQEWQEDWDKENSSHRHLSHLWGAY 968
Query: 615 PGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLV 674
PG+ ++ +N L +A K+L RG+ GWS+ WK A+WAR+ D +HA +++K L+
Sbjct: 969 PGNQVSPYENATLYQAVHKSLVGRGDAARGWSMGWKEAMWARMLDGDHAMKILKNQLVLL 1028
Query: 675 DPEHE-KHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWS 733
DP +GG Y+N+F AHPPFQID NFG TAA+AEMLVQS L++LPALP + +
Sbjct: 1029 DPNVTIASSDGGSYANMFDAHPPFQIDGNFGATAAIAEMLVQSHAGFLHVLPALPTEWKA 1088
Query: 734 SGCVKGLKARGGETVS-ICWKDGDLHEVGIYSNYSNN 769
G VKGL ARGG V+ + W DG + ++ + S N
Sbjct: 1089 GGEVKGLCARGGFVVTDMKWVDGKIEKLAVKSTVGGN 1125
>gi|149196081|ref|ZP_01873137.1| putative large secreted protein [Lentisphaera araneosa HTCC2155]
gi|149140928|gb|EDM29325.1| putative large secreted protein [Lentisphaera araneosa HTCC2155]
Length = 790
Score = 449 bits (1155), Expect = e-123, Method: Compositional matrix adjust.
Identities = 286/827 (34%), Positives = 434/827 (52%), Gaps = 109/827 (13%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ + A+ F ++PIGNGRLGAMV+G V E + +NE+++W+G + P K L+
Sbjct: 28 KLWYKQAAQGFEQSLPIGNGRLGAMVFGDVDEERIVINEESVWSGSKVENNIPVGYKHLA 87
Query: 74 DVRSLVDSGQYAEAT---------------AASVKLFGHPADVYQLLGDIELEFDDSHLK 118
+R L+ ++ EA A + FG YQ+LG+I L+F + K
Sbjct: 88 KIRQLLGEEKFTEANKLMKQAFKVKNAPKYAKGISAFGR----YQVLGNIHLKFLGNKAK 143
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
++ Y+RELDLN+A A V Y G +FTREHF S PD+V V++ SG +SF++S+D
Sbjct: 144 VSQ--YKRELDLNSALATVNYQAGKQQFTREHFVSAPDEVFVSRFSGP----ISFSISMD 197
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
+ V ++++M G ND + + + +++ I A +
Sbjct: 198 RPERFKTSVVNKHELLMTGAL-----------NDGFEKDGLTYVARLRVIAPNAKIKA-D 245
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
KL VE + +LLL A++ + G DP + L S+++L
Sbjct: 246 GNKLIVESQEEVMLLLAAATDYRGI---AGRQLSDPFKATSEDLDKAEKKSFTELRQAQK 302
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFG 357
D++K + RV + L+ E + +P+ +R+ +++ + DP+L L F G
Sbjct: 303 ADHEKYYRRVKLNLA------------ESHNSALPTDQRLAAYRKGKADPALAALFFNVG 350
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RY LISSSRPG ANLQGIW E++ W+ H NIN +MNYW +L CN+ E QEP+ +
Sbjct: 351 RYFLISSSRPGGLPANLQGIWAEEVHTMWNGDYHFNINTQMNYWPALSCNMVEMQEPMNN 410
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIW---AKSSADRGKVVWALWPMGGAWLCTHL 474
F+ L GSKTA+ Y + GW+ H T+IW A + D G G AWLC HL
Sbjct: 411 FIASLVEPGSKTAKAYYDSPGWIAHRLTNIWGYTAPAGMDIG---------GPAWLCEHL 461
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKL 533
WE Y YT+DR+FL K YP+++ F L L E + +L T PS SPE+ F P K
Sbjct: 462 WEQYAYTLDREFL-KSVYPIMKSSIDFYLHNLWEEPENKWLVTGPSASPENGFKLPGNKR 520
Query: 534 --ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL-KSLPRLRPTKIAEDGSI 590
+ + T+DM +RE+F + AA++L DA ++K L + PRL P +IA DG +
Sbjct: 521 GGSGICAGPTIDMQQLRELFGNTLRAAKIL--GIDAELQKELAEKRPRLAPNQIAPDGVL 578
Query: 591 MEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG-EEGPGWSITW 649
EW + + + E HRH+S L+GL+P + IT E P++ +A+ K L++RG + GW+ W
Sbjct: 579 QEWLKPYVEREPTHRHVSPLYGLYPYYEITPEGTPEMAEASRKLLERRGVGQSTGWANAW 638
Query: 650 KTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP---------FQID 700
K +LWARLHD + AY V+++ N + N+ + P FQI+
Sbjct: 639 KVSLWARLHDSKMAYTFVQQMLN-----------DNCFDNMMSLFRPLKNGKGKKLFQIE 687
Query: 701 ANFGFTAAVAEMLVQSTLND--------LYLLPALPWDKWSSGCVKGLKARGGETVSICW 752
ANFG TA +AEML+QS + + +LPALP +WS+G V GL ARG V + W
Sbjct: 688 ANFGLTAGIAEMLMQSHPDSPAVDSRPLIQILPALP-KEWSTGSVSGLLARGAFEVDLKW 746
Query: 753 KDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAG--KIYTFN 797
++G L E + S + Y + + L+AG K++T +
Sbjct: 747 QEGKLVEARVRS-----LKGQAAKIRYGSVTKDLKLAAGESKVFTLS 788
>gi|227536429|ref|ZP_03966478.1| possible alpha-L-fucosidase [Sphingobacterium spiritivorum ATCC
33300]
gi|227243805|gb|EEI93820.1| possible alpha-L-fucosidase [Sphingobacterium spiritivorum ATCC
33300]
Length = 798
Score = 449 bits (1154), Expect = e-123, Method: Compositional matrix adjust.
Identities = 275/778 (35%), Positives = 418/778 (53%), Gaps = 64/778 (8%)
Query: 7 TSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
+ + L++ ++ PAK + + +P+GNG +G M GGV E + LNE ++W+G D N
Sbjct: 42 VAQSGSLRLWYDKPAKRWEETLPLGNGLIGMMPDGGVQKEKIVLNEISMWSGSEEDPNNY 101
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLF-------GH------PADVYQLLGDIELEFD 113
A K++ +++ L+ G+ EA K F GH P YQ LG + L+F
Sbjct: 102 AAYKSVGEIQKLLVEGKNDEAEQLVNKNFVTSGKGSGHGNGANVPFGCYQNLGFLNLQFK 161
Query: 114 DSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLS 172
++ A+ T Y R LDL A AR +++ V++TRE+F+S V V ++ S+ G+L+
Sbjct: 162 EA----AQSTDYERSLDLKDAVARTNFTINGVKYTREYFTSFDQNVGVVRLKSSKKGALN 217
Query: 173 FNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRG 232
F+ SL S + Y + N+ M G I P D GI FS+ +IK+ G
Sbjct: 218 FSASL-SREEGVQYSSKGNEFSMSG------ILPDGKGGD---GISFSS--KIKVFHRGG 265
Query: 233 TISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSD 292
+ A D L V + ++ A++S+ DP L+ + Y
Sbjct: 266 KVVA-SDTALTVSKASEVLIFFAAATSY---------FHADPLQYVDEQLKQANDTPYPQ 315
Query: 293 LYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLV 350
L+ +HL Y+ +F+RV +QL D + I T +R+++F + +D L
Sbjct: 316 LFKQHLSRYESVFNRVDLQLE--------DDADKSGITT---DKRLRAFYDNPAQDNGLA 364
Query: 351 ELLFQFGRYLLISSSRPGTQVA---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
L +QFGRYL ISS+ P + A NLQG+W + W+ H+NIN +MN+W N
Sbjct: 365 ALYYQFGRYLNISSTAPDVKGALPPNLQGLWAHQIQTPWNGDYHLNINAQMNHWGVEVNN 424
Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
LSE P + + ++ G KTA+ Y A GWV++ T++W S+ + W G
Sbjct: 425 LSEYHIPFIELIKKIAKTGEKTARAYYNAPGWVVYMMTNVWGYSAPGE-QASWGASTASG 483
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEF 526
WLC HLWEHY +T D +L K YP+++G A F ++ + G+L T+PS SPE+ F
Sbjct: 484 -WLCNHLWEHYQFTKDSVYL-KEVYPVMQGAARFYAHTMVTDPKTGWLVTSPSVSPENAF 541
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIA 585
+GK A V +D I+RE++ +I A +L ++ +A + + + +L P I+
Sbjct: 542 RMKNGKTAAVVMGPAIDNQIVRELYRNLIDADSILGQH-NAFTDTLRIQIQQLAPPVLIS 600
Query: 586 EDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGW 645
+ G + EW +D+++ E HRH+SHL+GL+P + I+ + P AA+KTL RG+EG GW
Sbjct: 601 KSGRVQEWLEDYEEVEPQHRHVSHLYGLYPANFISPQITPQYVDAAKKTLTVRGDEGTGW 660
Query: 646 SITWKTALWARLHDQEHAYRMVKRLFN-LVDPEHEKHFEGGLYSNLFAAHPPFQIDANFG 704
S WK WARL D H+ ++++L + + GG Y NLF AHPPFQID NFG
Sbjct: 661 SRAWKILFWARLQDGNHSLEILRQLLKPAYRDDTDYRAGGGTYPNLFCAHPPFQIDGNFG 720
Query: 705 FTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
+A +AEML+QS ++LLPALP W SG VKGLKARGG T+ + WKDG + E I
Sbjct: 721 GSAGIAEMLIQSHSGFIHLLPALP-SAWKSGQVKGLKARGGHTIDMIWKDGRVLEYKI 777
>gi|423575217|ref|ZP_17551336.1| hypothetical protein II9_02438 [Bacillus cereus MSX-D12]
gi|401209825|gb|EJR16582.1| hypothetical protein II9_02438 [Bacillus cereus MSX-D12]
Length = 940
Score = 448 bits (1153), Expect = e-123, Method: Compositional matrix adjust.
Identities = 281/779 (36%), Positives = 414/779 (53%), Gaps = 88/779 (11%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
L + +N PAK + A+PIGNG +G MV+GGV E ++ NE TLWTG P Y N
Sbjct: 84 LSLWYNEPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPSSTSEYTYGNR 143
Query: 67 D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEF---DDS 115
D A L +R + G + A S + FG YQ GDI L+F D S
Sbjct: 144 DGAASHLGSIREKLSKGDKSGAERESTQFLTGLQKGFGS----YQNFGDIYLDFNMPDGS 199
Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
YRREL+LN + V Y+ V++ RE+F+S PD+V+V +++ SES LS +V
Sbjct: 200 SF----SNYRRELNLNEGISTVSYNYKGVQYNREYFASYPDRVMVMRLTASESKQLSLDV 255
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
S + +N+I ++G+ AN+ G+++ + E K+ ++ GT++
Sbjct: 256 RPTSAQGGQ-VTSKDNKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLT 299
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
A E+ K+KV +D +++ A++ ++ + PS +DP + + +I N SY L
Sbjct: 300 A-ENGKIKVANADSLTIIMTAATDYENKY--PSYKGEDPHQKVEKIMSAISNKSYEVLKY 356
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
H+ DY LF+RVS+ L +VP+ E + S+ + L EL FQ
Sbjct: 357 THIKDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQ 403
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
+GRYLLISSSRPGT ANLQG+WN +P W+S H NINL+MNYW + NLSE EPL
Sbjct: 404 YGRYLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPL 463
Query: 416 FDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
D++ L G +A+ ++ GW ++ + + ++ G + W P A++ +
Sbjct: 464 MDYVDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQN 522
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
LWEHY +T D+ +L+++ YP+L+ A F +L+E + L +P SPE L
Sbjct: 523 LWEHYKFTNDKQYLQEKIYPILKEAAVFHSKFLVEDQNKKLVVSPCWSPE---------L 573
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSI 590
+S D ++ E+FS +I A+EVL+ + D L K K P P +I G +
Sbjct: 574 GGISNGCAFDQQLVYELFSNVIEASEVLQVDNVFRDELKAKRDKLFP---PIQIGRYGQV 630
Query: 591 MEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
EW D DP HRH+S L L+PG I P+ +AA+ TL RG+EG GWS K
Sbjct: 631 QEWKDDIDDPGETHRHISQLVALYPGSMIN-HNTPEWLEAAKVTLNHRGDEGTGWSKANK 689
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D +HAY+++ + G SNLF HPPFQID NFG T+ +A
Sbjct: 690 INLWARLLDGDHAYKIL-----------QGQLTGSTLSNLFDTHPPFQIDGNFGATSGIA 738
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
EML+QS + + LLPALP W G KGL+ARG T+ WK+G + + S++ N+
Sbjct: 739 EMLIQSHTDSIQLLPALP-KAWKDGSYKGLRARGAFTIDADWKNGTPTVIQVTSDHGND 796
>gi|423373036|ref|ZP_17350376.1| hypothetical protein IC5_02092 [Bacillus cereus AND1407]
gi|401097368|gb|EJQ05391.1| hypothetical protein IC5_02092 [Bacillus cereus AND1407]
Length = 1193
Score = 448 bits (1153), Expect = e-123, Method: Compositional matrix adjust.
Identities = 281/779 (36%), Positives = 414/779 (53%), Gaps = 88/779 (11%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
L + +N PAK + A+PIGNG +G MV+GGV E ++ NE TLWTG P Y N
Sbjct: 84 LSLWYNEPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPSSTSEYTYGNR 143
Query: 67 D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEF---DDS 115
D A L +R + G + A S + FG YQ GDI L+F D S
Sbjct: 144 DGAASHLGSIREKLSKGDKSGAERESTQFLTGLQKGFGS----YQNFGDIYLDFNMPDGS 199
Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
YRREL+LN + V Y+ V++ RE+F+S PD+V+V +++ SES LS +V
Sbjct: 200 SF----SNYRRELNLNEGISTVSYNYKGVQYNREYFASYPDRVMVMRLTASESKQLSLDV 255
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
S + +N+I ++G+ AN+ G+++ + E K+ ++ GT++
Sbjct: 256 RPTSAQGGQ-VTSKDNKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLT 299
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
A E+ K+KV +D +++ A++ ++ + PS +DP + + +I N SY L
Sbjct: 300 A-ENGKIKVANADSLTIIMTAATDYENKY--PSYKGEDPHQKVEKIMSAISNKSYEVLKY 356
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
H+ DY LF+RVS+ L +VP+ E + S+ + L EL FQ
Sbjct: 357 THIKDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQ 403
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
+GRYLLISSSRPGT ANLQG+WN +P W+S H NINL+MNYW + NLSE EPL
Sbjct: 404 YGRYLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPL 463
Query: 416 FDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
D++ L G +A+ ++ GW ++ + + ++ G + W P A++ +
Sbjct: 464 MDYVDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQN 522
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
LWEHY +T D+ +L+++ YP+L+ A F +L+E + L +P SPE L
Sbjct: 523 LWEHYKFTDDKQYLQEKIYPILKEAAVFHSKFLVEDQNKKLVVSPCWSPE---------L 573
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSI 590
+S D ++ E+FS +I A+EVL+ + D L K K P P +I G +
Sbjct: 574 GGISNGCAFDQQLVYELFSNVIEASEVLQVDNVFRDELKAKRDKLFP---PIQIGRYGQV 630
Query: 591 MEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
EW D DP HRH+S L L+PG I P+ +AA+ TL RG+EG GWS K
Sbjct: 631 QEWKDDIDDPGETHRHISQLVALYPGSMIN-HNTPEWLEAAKVTLNHRGDEGTGWSKANK 689
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D +HAY+++ + G SNLF HPPFQID NFG T+ +A
Sbjct: 690 INLWARLLDGDHAYKIL-----------QGQLTGSTLSNLFDTHPPFQIDGNFGATSGIA 738
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
EML+QS + + LLPALP W G KGL+ARG T+ WK+G + + S++ N+
Sbjct: 739 EMLIQSHTDSIQLLPALP-KAWKDGSYKGLRARGAFTIDADWKNGTPTVIQVTSDHGND 796
>gi|217960596|ref|YP_002339160.1| alpha-fucosidase [Bacillus cereus AH187]
gi|375285103|ref|YP_005105542.1| hypothetical protein BCN_3009 [Bacillus cereus NC7401]
gi|423352889|ref|ZP_17330516.1| hypothetical protein IAU_00965 [Bacillus cereus IS075]
gi|423567917|ref|ZP_17544164.1| hypothetical protein II7_01140 [Bacillus cereus MSX-A12]
gi|217068135|gb|ACJ82385.1| alpha-fucosidase [Bacillus cereus AH187]
gi|358353630|dbj|BAL18802.1| conserved hypothetical protein [Bacillus cereus NC7401]
gi|401090895|gb|EJP99046.1| hypothetical protein IAU_00965 [Bacillus cereus IS075]
gi|401211256|gb|EJR18004.1| hypothetical protein II7_01140 [Bacillus cereus MSX-A12]
Length = 1193
Score = 448 bits (1152), Expect = e-123, Method: Compositional matrix adjust.
Identities = 281/779 (36%), Positives = 414/779 (53%), Gaps = 88/779 (11%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
L + +N PAK + A+PIGNG +G MV+GGV E ++ NE TLWTG P Y N
Sbjct: 84 LSLWYNEPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPSSTSEYTYGNR 143
Query: 67 D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEF---DDS 115
D A L +R + G + A S + FG YQ GDI L+F D S
Sbjct: 144 DGAASHLGSIREKLSKGDKSGAERESTQFLTGLQKGFGS----YQNFGDIYLDFNMPDGS 199
Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
YRREL+LN + V Y+ V++ RE+F+S PD+V+V +++ SES LS +V
Sbjct: 200 SF----SNYRRELNLNEGISTVSYNYKGVQYNREYFASYPDRVMVMRLTASESKQLSLDV 255
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
S + +N+I ++G+ AN+ G+++ + E K+ ++ GT++
Sbjct: 256 RPTSAQGGQ-VTSKDNKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLT 299
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
A E+ K+KV +D +++ A++ ++ + PS +DP + + +I N SY L
Sbjct: 300 A-ENGKIKVANADSLTIIMTAATDYENKY--PSYKGEDPHQKVEKIMSAISNKSYEVLKY 356
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
H+ DY LF+RVS+ L +VP+ E + S+ + L EL FQ
Sbjct: 357 THIKDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQ 403
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
+GRYLLISSSRPGT ANLQG+WN +P W+S H NINL+MNYW + NLSE EPL
Sbjct: 404 YGRYLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPL 463
Query: 416 FDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
D++ L G +A+ ++ GW ++ + + ++ G + W P A++ +
Sbjct: 464 MDYVDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQN 522
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
LWEHY +T D+ +L+++ YP+L+ A F +L+E + L +P SPE L
Sbjct: 523 LWEHYKFTDDKQYLQEKIYPILKEAAVFHSKFLVEDQNKKLVVSPCWSPE---------L 573
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSI 590
+S D ++ E+FS +I A+EVL+ + D L K K P P +I G +
Sbjct: 574 GGISNGCAFDQQLVYELFSNVIEASEVLQVDNVFRDELKAKRDKLFP---PIQIGRYGQV 630
Query: 591 MEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
EW D DP HRH+S L L+PG I P+ +AA+ TL RG+EG GWS K
Sbjct: 631 QEWKDDIDDPGETHRHISQLVALYPGSMIN-HNTPEWLEAAKVTLNHRGDEGTGWSKANK 689
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D +HAY+++ + G SNLF HPPFQID NFG T+ +A
Sbjct: 690 INLWARLLDGDHAYKIL-----------QGQLTGSTLSNLFDTHPPFQIDGNFGATSGIA 738
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
EML+QS + + LLPALP W G KGL+ARG T+ WK+G + + S++ N+
Sbjct: 739 EMLIQSHTDSIQLLPALP-KAWKDGSYKGLRARGAFTIDADWKNGTPTVIQVTSDHGND 796
>gi|222096655|ref|YP_002530712.1| alpha-fucosidase [Bacillus cereus Q1]
gi|221240713|gb|ACM13423.1| alpha-fucosidase [Bacillus cereus Q1]
Length = 1172
Score = 447 bits (1151), Expect = e-123, Method: Compositional matrix adjust.
Identities = 281/779 (36%), Positives = 414/779 (53%), Gaps = 88/779 (11%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
L + +N PAK + A+PIGNG +G MV+GGV E ++ NE TLWTG P Y N
Sbjct: 63 LSLWYNEPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPSSTSEYTYGNR 122
Query: 67 D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEF---DDS 115
D A L +R + G + A S + FG YQ GDI L+F D S
Sbjct: 123 DGAASHLGSIREKLSKGDKSGAERESTQFLTGLQKGFGS----YQNFGDIYLDFNMPDGS 178
Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
YRREL+LN + V Y+ V++ RE+F+S PD+V+V +++ SES LS +V
Sbjct: 179 SF----SNYRRELNLNEGISTVSYNYKGVQYNREYFASYPDRVMVMRLTASESKQLSLDV 234
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
S + +N+I ++G+ AN+ G+++ + E K+ ++ GT++
Sbjct: 235 RPTSAQGGQ-VTSKDNKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLT 278
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
A E+ K+KV +D +++ A++ ++ + PS +DP + + +I N SY L
Sbjct: 279 A-ENGKIKVANADSLTIIMTAATDYENKY--PSYKGEDPHQKVEKIMSAISNKSYEVLKY 335
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
H+ DY LF+RVS+ L +VP+ E + S+ + L EL FQ
Sbjct: 336 THIKDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQ 382
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
+GRYLLISSSRPGT ANLQG+WN +P W+S H NINL+MNYW + NLSE EPL
Sbjct: 383 YGRYLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPL 442
Query: 416 FDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
D++ L G +A+ ++ GW ++ + + ++ G + W P A++ +
Sbjct: 443 MDYVDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQN 501
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
LWEHY +T D+ +L+++ YP+L+ A F +L+E + L +P SPE L
Sbjct: 502 LWEHYKFTDDKQYLQEKIYPILKEAAVFHSKFLVEDQNKKLVVSPCWSPE---------L 552
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSI 590
+S D ++ E+FS +I A+EVL+ + D L K K P P +I G +
Sbjct: 553 GGISNGCAFDQQLVYELFSNVIEASEVLQVDNVFRDELKAKRDKLFP---PIQIGRYGQV 609
Query: 591 MEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
EW D DP HRH+S L L+PG I P+ +AA+ TL RG+EG GWS K
Sbjct: 610 QEWKDDIDDPGETHRHISQLVALYPGSMIN-HNTPEWLEAAKVTLNHRGDEGTGWSKANK 668
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D +HAY+++ + G SNLF HPPFQID NFG T+ +A
Sbjct: 669 INLWARLLDGDHAYKIL-----------QGQLTGSTLSNLFDTHPPFQIDGNFGATSGIA 717
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
EML+QS + + LLPALP W G KGL+ARG T+ WK+G + + S++ N+
Sbjct: 718 EMLIQSHTDSIQLLPALP-KAWKDGSYKGLRARGAFTIDADWKNGTPTVIQVTSDHGND 775
>gi|229139796|ref|ZP_04268363.1| hypothetical protein bcere0013_29050 [Bacillus cereus BDRD-ST26]
gi|228643676|gb|EEK99940.1| hypothetical protein bcere0013_29050 [Bacillus cereus BDRD-ST26]
Length = 1172
Score = 447 bits (1151), Expect = e-123, Method: Compositional matrix adjust.
Identities = 281/779 (36%), Positives = 414/779 (53%), Gaps = 88/779 (11%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
L + +N PAK + A+PIGNG +G MV+GGV E ++ NE TLWTG P Y N
Sbjct: 63 LSLWYNEPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPSSTSEYTYGNR 122
Query: 67 D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEF---DDS 115
D A L +R + G + A S + FG YQ GDI L+F D S
Sbjct: 123 DGAASHLGSIREKLSKGDKSGAERESTQFLTGLQKGFGS----YQNFGDIYLDFNMPDGS 178
Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
YRREL+LN + V Y+ V++ RE+F+S PD+V+V +++ SES LS +V
Sbjct: 179 SF----SNYRRELNLNEGISTVSYNYKGVQYNREYFASYPDRVMVMRLTASESKQLSLDV 234
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
S + +N+I ++G+ AN+ G+++ + E K+ ++ GT++
Sbjct: 235 RPTSAQGGQ-VTSKDNKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLT 278
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
A E+ K+KV +D +++ A++ ++ + PS +DP + + +I N SY L
Sbjct: 279 A-ENGKIKVANADSLTIIMTAATDYENKY--PSYKGEDPHQKVEKIMSAISNKSYEVLKY 335
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
H+ DY LF+RVS+ L +VP+ E + S+ + L EL FQ
Sbjct: 336 THIKDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQ 382
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
+GRYLLISSSRPGT ANLQG+WN +P W+S H NINL+MNYW + NLSE EPL
Sbjct: 383 YGRYLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPL 442
Query: 416 FDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
D++ L G +A+ ++ GW ++ + + ++ G + W P A++ +
Sbjct: 443 MDYVDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQN 501
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
LWEHY +T D+ +L+++ YP+L+ A F +L+E + L +P SPE L
Sbjct: 502 LWEHYKFTDDKQYLQEKIYPILKEAAVFHSKFLVEDQNKKLVVSPCWSPE---------L 552
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSI 590
+S D ++ E+FS +I A+EVL+ + D L K K P P +I G +
Sbjct: 553 GGISNGCAFDQQLVYELFSNVIEASEVLQVDNVFRDELKAKRDKLFP---PIQIGRYGQV 609
Query: 591 MEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
EW D DP HRH+S L L+PG I P+ +AA+ TL RG+EG GWS K
Sbjct: 610 QEWKDDIDDPGETHRHISQLVALYPGSMIN-HNTPEWLEAAKVTLNHRGDEGTGWSKANK 668
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D +HAY+++ + G SNLF HPPFQID NFG T+ +A
Sbjct: 669 INLWARLLDGDHAYKIL-----------QGQLTGSTLSNLFDTHPPFQIDGNFGATSGIA 717
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
EML+QS + + LLPALP W G KGL+ARG T+ WK+G + + S++ N+
Sbjct: 718 EMLIQSHTDSIQLLPALP-KAWKDGSYKGLRARGAFTIDADWKNGTPTVIQVTSDHGND 775
>gi|329962213|ref|ZP_08300219.1| hypothetical protein HMPREF9446_01800 [Bacteroides fluxus YIT
12057]
gi|328530321|gb|EGF57198.1| hypothetical protein HMPREF9446_01800 [Bacteroides fluxus YIT
12057]
Length = 834
Score = 447 bits (1150), Expect = e-122, Method: Compositional matrix adjust.
Identities = 270/787 (34%), Positives = 419/787 (53%), Gaps = 71/787 (9%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + + +P+GNGRLG M GGV E + LNE +LW+G+ DY+NPDA K+L
Sbjct: 29 RLYYTKPASVWEETLPLGNGRLGMMPDGGVLREHIVLNEISLWSGMEADYSNPDASKSLP 88
Query: 74 DVRSLVDSGQYAEATAASVKLF---GHPAD----VYQLLGDIELEF-----------DDS 115
+R L+ G+ EA F AD YQ LG ++++F +
Sbjct: 89 AIRKLLFEGKNREAQELMYSSFVPKKQEADGRYGTYQTLGTLDIDFAYQSQTSVSKSESL 148
Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
L YRR LDL A A +++ V++ RE+F S V++ ++ G+L+F+
Sbjct: 149 ALDGGTSRYRRCLDLRDAVAYTAFALEGVDYRREYFVSRDRDVMLVHLTAGSKGALNFSA 208
Query: 176 SLDSLLDNHSYVNGNNQII---MEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRG 232
L V GN ++ +E PG+ +G+++ + +++ D G
Sbjct: 209 RLGRAEHGTVTVKGNALLMDGTLESGSPGR------------EGMKYR--VAMQLVSDGG 254
Query: 233 TISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESM--SALQSIRN--- 287
++A + + ++ A L+L A++S+ + S+ +S+ +A I+N
Sbjct: 255 EVAADPENGISLKHGQEAWLVLSATTSYAAEGTDFPGSRYAEVCDSLLKNAGVQIKNEMR 314
Query: 288 ----LSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
+ + H ++ L+ RVS+ L +P D T+P+ ER+ F
Sbjct: 315 MRGMAAEATALKSHAAAHRSLYDRVSLTLPSTPDD------------TLPTDERILRFTR 362
Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
E P+L L + +GRYLLISS+RPG+ NLQG+W L W+ H NIN++MN+W
Sbjct: 363 QESPALAALYYNYGRYLLISSTRPGSLPPNLQGLWANSLLTPWNGDYHTNINVQMNHWPL 422
Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWA 461
LSE +PL + L +G TA+ Y A GWV+H T++W +A W
Sbjct: 423 EQAGLSELYQPLTTLMERLVPSGEATARTFYGKEAEGWVLHMMTNVW-NYTAPGEHPSWG 481
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE-GHDGYLETNPST 520
GGAWLC HLWEHY YT D+D+L +R YP+L+G A F +E G+L T P++
Sbjct: 482 ATNTGGAWLCAHLWEHYLYTQDKDYL-RRIYPVLKGAARFFSSTTVEEPSHGWLVTAPTS 540
Query: 521 SPEHEFIAPDGKLACVS--YSSTMDMAIIREVFSAIISAAEVLEKNED--ALVEKVLKSL 576
SPE+ F P + VS TMD+ ++ E+++ +I+AA +L + + A +E LK
Sbjct: 541 SPENSFYVPGDSVTPVSICMGPTMDVQLLTELYTNVITAARLLGCDAEYAAKLEADLKKF 600
Query: 577 PRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQ 636
P P +I+++G + EW +D+K+ EVHHRH+SHL+GL PG+ I+ P L A TL
Sbjct: 601 P---PMQISKEGYLQEWLEDYKEAEVHHRHVSHLYGLHPGNLISPTATPALADACRMTLN 657
Query: 637 KRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFN-LVDPEHEKHFEGGLYSNLFAAHP 695
+RG+ G GWS WK WARL D A+++ K L + +D + +H G + NLF +HP
Sbjct: 658 RRGDGGTGWSRAWKVNFWARLGDGNRAWKLFKSLLHPAIDLQTGRH-GSGTFPNLFCSHP 716
Query: 696 PFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
PFQID N+G A + EML+QS + LLPALP D W+ G +G++ RGG ++ + WK+G
Sbjct: 717 PFQIDGNYGGAAGIGEMLLQSHEGFVNLLPALP-DSWNCGNFRGMRVRGGASIDLHWKNG 775
Query: 756 DLHEVGI 762
E +
Sbjct: 776 KATEAAV 782
>gi|375310399|ref|ZP_09775670.1| alpha-L-fucosidase, partial [Paenibacillus sp. Aloe-11]
gi|375077548|gb|EHS55785.1| alpha-L-fucosidase, partial [Paenibacillus sp. Aloe-11]
Length = 643
Score = 447 bits (1149), Expect = e-122, Method: Compositional matrix adjust.
Identities = 244/650 (37%), Positives = 368/650 (56%), Gaps = 48/650 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L++ F PA+ + +A+P+GNGRLGAMV+GG+ E L+LNEDTLW+G P D DA + L
Sbjct: 10 LRLWFRQPAEVWEEALPVGNGRLGAMVFGGIRKERLQLNEDTLWSGFPRDGVQYDALRYL 69
Query: 73 SDVRSLVDSGQYAEAT-AASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET-YRRELDL 130
VR L+ +G+Y +A + + G + YQ LGD+ + + + E T Y RELDL
Sbjct: 70 KPVRELIAAGKYKDAEHLINTHMLGCDTEAYQPLGDLWI----TQKGFGEITHYERELDL 125
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--------DSLLD 182
T TA V + + +TRE +S+PD +I+ ++ +G ++ +V + +S D
Sbjct: 126 PTGTAAVAFHSDGIRYTREVIASSPDGIIMVSLTADRAGQINASVRITTPHPCEDESGED 185
Query: 183 NHSYV---------------NGNNQIIMEGRCPGKRIP------PKANANDDPKGIQFSA 221
H V N I + GR P P++ + G+ F+
Sbjct: 186 EHFAVLSQWDSDVAEGLSDEATRNCITLNGRAPSHVESNDHGDHPQSVVYEHDLGMAFA- 244
Query: 222 ILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA 281
+++++ + G ++A +D + V G+D + L A++ F G + P +
Sbjct: 245 -VQVRMVSEGGIVTAKDDGTVIVSGADTLTVYLAAATGFRGFDVMPDSDPAESAEACQIT 303
Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF 341
L +L + RH D++ LF RV+++L +DT +EE I +P+ R++ +
Sbjct: 304 LDKAISLGSEQVRQRHEQDHRTLFERVALELG-------SDTRTEELI--LPTDLRLERY 354
Query: 342 -QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
Q + DP L LLFQ+GRYLL+ SSRPG+Q ANLQGIWN+ + P W+S NIN +MNY
Sbjct: 355 KQGEADPGLEVLLFQYGRYLLMGSSRPGSQPANLQGIWNDRVQPPWNSNYTTNINTQMNY 414
Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVW 460
W + CNL+EC EPL + +S G + A VNY A GW HH D+W + G W
Sbjct: 415 WPAEICNLAECHEPLLHMVGEISRTGRRVASVNYGAQGWAAHHNVDLWRYAGPSGGHASW 474
Query: 461 ALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPST 520
A WP+GG WL HLWE Y +T D +L ++AYPL++G A+F +DWLIEG DG+L T+PST
Sbjct: 475 AFWPLGGVWLTAHLWERYLFTQDTAYLAEQAYPLMKGAAAFCMDWLIEGPDGWLVTSPST 534
Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLR 580
SPE++FI G+ +S STMDM +IRE+ I AA++LE +E+ + ++ RL
Sbjct: 535 SPENKFITSSGEECSISMGSTMDMTLIRELLGNCIQAADLLELDEE-FRNRCEETQQRLL 593
Query: 581 PTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKA 630
P ++ G + EW D+++ E HRH+SHL+GL+PG I I P+L +A
Sbjct: 594 PYQMGRHGQLQEWFVDWEEAEPGHRHVSHLYGLYPGRQIHIRDTPELAEA 643
>gi|225157647|ref|ZP_03725037.1| alpha/beta hydrolase fold-3 domain protein [Diplosphaera
colitermitum TAV2]
gi|224802714|gb|EEG20967.1| alpha/beta hydrolase fold-3 domain protein [Diplosphaera
colitermitum TAV2]
Length = 852
Score = 447 bits (1149), Expect = e-122, Method: Compositional matrix adjust.
Identities = 276/819 (33%), Positives = 418/819 (51%), Gaps = 111/819 (13%)
Query: 17 FNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDV 75
++ PA + + A+P+GNGRLGAM++G + SE L+LNED+LW G P D NPD + L +
Sbjct: 14 YSQPAGQDWNRALPVGNGRLGAMIFGDIVSERLQLNEDSLWNGGPRDRRNPDTREHLPVL 73
Query: 76 RSLVDSGQYAEATAASVKLFGHPAD---VYQLLGDIELEF-----------DDSHLKYAE 121
R L+ G+ A A + D Y+ L D+ L F D+ L
Sbjct: 74 RQLLADGRLAAAHELVHDVMAGIPDSQRCYEPLADLFLNFEHPGAPVSVSADEMALAAGY 133
Query: 122 ET----------YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL 171
T YRR LDL TA A V Y++ ++ ++R +S DQVI ++ GSL
Sbjct: 134 TTPRFDPSLLSHYRRALDLRTAVASVDYTLNSIAYSRRMVASAVDQVIAIQLRAGRPGSL 193
Query: 172 SFNVSLDS---------LLDNHSYVN----GNNQIIMEGRCPGKRIPPKANANDDPKGIQ 218
+ V ++ D +V+ + +++ GR G+ +G++
Sbjct: 194 TLRVRMERGPRNSYSTRYADTVGFVSDACSSSPTLLLRGRAGGE------------EGVR 241
Query: 219 FSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSES 278
F+ L +IS G + + + L ++G+D L+L A++SF + DP +
Sbjct: 242 FATGLRAQISG--GALRHI-GETLYIDGADSVTLVLAAATSF---------READPAASV 289
Query: 279 MSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRS-PKDIVTDTCSEENIDTVPSAER 337
+ ++ + + H +Y+ F R S+ L + T T T+P+ ER
Sbjct: 290 IERTRAALARGWEKILADHEREYRSFFDRASLTLGAGFASEAPTATA------TLPTDER 343
Query: 338 VK-SFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINL 396
++ + +T DP+L L F + RYLLISSSRPG+ +NLQG+WN D P+W S +NIN
Sbjct: 344 LRHAHETSGDPALASLYFNYARYLLISSSRPGSLPSNLQGLWNGDFWPSWGSKYTININT 403
Query: 397 EMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRG 456
EMNYW + P NL++C +PLFD L + +G +TA+V Y G+V+HH TDIWA +
Sbjct: 404 EMNYWIAEPANLADCHKPLFDHLERMVESGRETARVMYGCRGFVVHHNTDIWADTCPTDR 463
Query: 457 KVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLET 516
+ W +GGAW H W+ +++ D L AY L+ A F LD+L+E G L
Sbjct: 464 NAGASYWLLGGAWFVLHAWDRFDFDRDPASLAA-AYERLKEAALFFLDFLVEDARGRLVI 522
Query: 517 NPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEK----------NED 566
+PS SPE+ + P+G+ + STMD ++ +F + AA +LE+ +E
Sbjct: 523 SPSCSPENTYRLPNGEAGVLCVGSTMDSQMLAILFRRTLQAARLLEQRNATAGGGGGDER 582
Query: 567 ALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPD 626
+ +V + RL I G ++EW +D+++ + HRH+SH FGL PG I+ + P+
Sbjct: 583 EFLAQVAAAAERLPKMTIGRHGQLLEWLEDYEELDPEHRHVSHAFGLHPGDLISPRRTPE 642
Query: 627 LCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVD--PEHEK---H 681
L +A TL +RG+ G GW + WK +WARL D E A+R++ L N V+ P K +
Sbjct: 643 LAEAIRVTLNRRGDAGTGWCMAWKACMWARLGDGERAHRLLGNLLNPVETVPPSSKDTAY 702
Query: 682 FEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS------------------------- 716
GG Y NL AHPPFQID NFG AA+ EML+QS
Sbjct: 703 LHGGSYPNLLCAHPPFQIDGNFGGAAAIIEMLLQSHETEPDDGDGDGDCNGNVTTDGEAL 762
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
L ++LLPALP ++G +GL+ RGG V + W DG
Sbjct: 763 GLPVIHLLPALPSAWAAAGEFRGLRTRGGGEVDLRWVDG 801
>gi|423482848|ref|ZP_17459538.1| hypothetical protein IEQ_02626 [Bacillus cereus BAG6X1-2]
gi|401143214|gb|EJQ50752.1| hypothetical protein IEQ_02626 [Bacillus cereus BAG6X1-2]
Length = 1156
Score = 447 bits (1149), Expect = e-122, Method: Compositional matrix adjust.
Identities = 280/776 (36%), Positives = 417/776 (53%), Gaps = 82/776 (10%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG---DYT--NP 66
L + +N PAK + A+PIGNG +G MV+GGV E ++ NE TLWTG P DYT N
Sbjct: 47 LTLWYNQPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPNSTSDYTYGNR 106
Query: 67 D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEFDDSHLK 118
D A L +R V G + A S + FG YQ GDI L+F+ +
Sbjct: 107 DGAASHLDSIREKVSKGDKSGAEEESSQFLTGLQNGFGS----YQNFGDIYLDFNMPD-Q 161
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
+ YRREL+LN A V Y+ +V++ RE+F+S PD+V+V +++ SES LS +V
Sbjct: 162 ASFSNYRRELNLNEGIATVSYNYKDVQYNREYFTSYPDRVMVMRLTASESKQLSLDVRPT 221
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
S + +N+I ++G+ AN+ G+++ + E K+ ++ GT++A E
Sbjct: 222 SA-QGGEITSIDNKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLTA-E 264
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+ K+KV +D +++ A++ ++ + P+ +DP + + +I N SY L H+
Sbjct: 265 NGKIKVANADSLTIIMTAATDYENKY--PTYKGEDPHQKVEKIMAAISNKSYEVLKYTHI 322
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
DY LF+RVS+ L +VP+ E + S+ L EL FQ+GR
Sbjct: 323 KDYHSLFNRVSLDLGGEKP-------------SVPTNELLASYNKQNSKYLEELFFQYGR 369
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSSRPGT ANLQG+WN +P W+S H NINL+MNYW + NLSE EPL D+
Sbjct: 370 YLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPLMDY 429
Query: 419 LTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
+ L G +A+ ++ GW ++ + + ++ G + W P A++ +LWE
Sbjct: 430 VDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQNLWE 488
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
HY +T D+ +L+++ YP+L+ A F +L+E + L +P SPE + +
Sbjct: 489 HYKFTDDKQYLQEKIYPILKEAAEFHSKFLVEDQNKKLVVSPCWSPE---------IGGI 539
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNE---DALVEKVLKSLPRLRPTKIAEDGSIMEW 593
S D ++ E+FS +I A+EVL+ ++ D L K + P P +I G + EW
Sbjct: 540 SNGCAFDQQLVYELFSNVIEASEVLQTDKVFRDELKAKRDRLFP---PIQIGRYGQVQEW 596
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
D DP HRH+S L L+PG I P+ AA+ TL RG+EG GWS K L
Sbjct: 597 KDDIDDPAETHRHISQLVALYPGSMIN-HNTPEWLNAAKVTLNHRGDEGTGWSKANKINL 655
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARL D +HAY+++ + G SNLF HPPFQID NFG T+ +AEML
Sbjct: 656 WARLLDGDHAYKIL-----------QGQLTGSTLSNLFDTHPPFQIDGNFGATSGIAEML 704
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
+QS + + LLPALP W G KGL+ARG T+ WK+G + + S++ N+
Sbjct: 705 IQSHTDSIQLLPALP-KAWKDGSYKGLRARGAFTIDANWKNGIPTVIHLTSDHGND 759
>gi|312131012|ref|YP_003998352.1| alpha-l-fucosidase [Leadbetterella byssophila DSM 17132]
gi|311907558|gb|ADQ17999.1| Alpha-L-fucosidase [Leadbetterella byssophila DSM 17132]
Length = 805
Score = 446 bits (1148), Expect = e-122, Method: Compositional matrix adjust.
Identities = 292/779 (37%), Positives = 421/779 (54%), Gaps = 66/779 (8%)
Query: 6 STSTTNPLKITFNGPA-KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
S S K+ + PA K + A+P+GNG +G MV+G E + LNE + W+G P +
Sbjct: 14 SLSFAQEYKMWYQNPAGKVWEKALPVGNGFIGGMVYGNTEEERIDLNETSFWSGGPYATS 73
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLFGHPA--DVYQLLGDIELEFDDSHLKYAE 121
+L +RSLV S +Y EA A+ LF H + ++ +G + L+F +
Sbjct: 74 PTLNRDSLEKLRSLVFSEKYKEAENMANRVLFSHGSHGQMFLPIGSLILKFPG---QKEA 130
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
+Y RELDL+ A A ++SVG + RE F+ ++V+V K+S +E+ ++
Sbjct: 131 TSYYRELDLSKAVASTRFSVGKQNYEREVFTPLQEKVLVMKLSSTEAMNVEVLYRTPLPE 190
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGTISALEDK 240
V GN E + G+ I A++ +G ++F I+ +K S G S+ D
Sbjct: 191 GRVVQVQGN-----ELQIGGRNI-----AHEGSEGALRFHGIIHVKQS---GGNSSRTDS 237
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
L + + VL + ++++ D K + SAL+S Y++L +H++
Sbjct: 238 SLIISNAKELVLYVSLATNYQSYQDVSGDEKALARARLTSALKS----PYTELKRKHIEK 293
Query: 301 YQKLFHRVSIQLS---RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
YQ L++RV + L R P DI R++ F+ DP L FQFG
Sbjct: 294 YQSLYNRVELTLGSDRREPTDI-----------------RLEKFREGNDPGFAALYFQFG 336
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLISSS+PG Q ANLQGIWN + P WDS +NIN EMNYW + NLSE +PLF+
Sbjct: 337 RYLLISSSQPGGQPANLQGIWNASIRPPWDSKYTININTEMNYWPAERTNLSEMHKPLFE 396
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ L+ G+ TA+ Y A GWV HH TD+W + + + LWP GGAWL H+WEH
Sbjct: 397 MVKDLTKTGAVTAKRLYGAGGWVAHHNTDLW-RLTWPVDAAFYGLWPSGGAWLSQHIWEH 455
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG--YLETNPSTSPEHEFIAPDG-KLA 534
Y YT + FL K +L G A F +D +++ H YL NPSTSPE+ AP+ + +
Sbjct: 456 YQYTGNLHFL-KENQEVLFGAARFYVD-ILQKHPKYPYLVINPSTSPEN---APEAHQRS 510
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVL---EKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
+S TMD + +VF I A+++L + D+L +++LK LP P I + G +
Sbjct: 511 SLSAGVTMDNQLAFDVFQNAIWASKILGVKTQFSDSL-KQLLKQLP---PMHIGKHGQLQ 566
Query: 592 EWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKT 651
EW D P+ HRH+SHL+GLFP I+ ++P L AA TL+ RG+ GWS+ WK
Sbjct: 567 EWLDDVDSPQDKHRHVSHLYGLFPSSQISPYRHPALFSAARTTLEHRGDVSTGWSMGWKV 626
Query: 652 ALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAE 711
WARL D +HAY +++ N + P + GG Y NLF AHPPFQID NFG TA +AE
Sbjct: 627 NWWARLKDGDHAYLLIE---NQLTPLGKNKDGGGTYPNLFDAHPPFQIDGNFGCTAGIAE 683
Query: 712 MLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGG-ETVSICWKDGDLHEVGIYSNYSNN 769
MLVQS + +LPALP +W+ G VKGLK GG E + W+ G L + + S+ N
Sbjct: 684 MLVQSADGAVEVLPALP-SRWAEGKVKGLKCLGGFEIEELVWEKGQLKRLVVKSHLGGN 741
>gi|429751943|ref|ZP_19284832.1| hypothetical protein HMPREF9073_00790 [Capnocytophaga sp. oral
taxon 326 str. F0382]
gi|429178378|gb|EKY19657.1| hypothetical protein HMPREF9073_00790 [Capnocytophaga sp. oral
taxon 326 str. F0382]
Length = 806
Score = 446 bits (1146), Expect = e-122, Method: Compositional matrix adjust.
Identities = 296/812 (36%), Positives = 428/812 (52%), Gaps = 76/812 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + F+ PAKHFT+++PIGNGRLGAM++G + + LNE +LW+G D +PDA L
Sbjct: 23 VSVVFHEPAKHFTESLPIGNGRLGAMLFGKTDIDRIVLNEISLWSGGTQDADDPDAHIHL 82
Query: 73 SDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIELEFDDSHLKY 119
++ L+ G+ EA + K F YQ+LG+++L++ +
Sbjct: 83 KTIQQLLLDGKNLEAQSLLQKHFIAKGKGSCNGNGANGNYGCYQILGELQLDWKTN---L 139
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+ Y+R L L+ ATA + G+ + F+ + +I KI+ S+ L ++SL+
Sbjct: 140 PIKNYQRILRLDQATAFTSFKRGDNHIQQIAFADFKNDLIWIKITASQP--LDMDISLNR 197
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD-RGTISALE 238
+N + +N+II+ G P N+D +G+QF+++++I+ + + T SA
Sbjct: 198 K-ENATTSYKSNKIILSGALP----------NNDIQGMQFASVIDIQTDGNLQNTASATS 246
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+K K VL + A++++D F ++ D ++ + LQ + + +
Sbjct: 247 VQKAKE-----IVLKISAATNYD--FTKGRLTQDDVLQKANNYLQKT-TIPFDNAIIESQ 298
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF-QFG 357
YQ LF+R +R D TDT S + ER++ F + +L+ +L+ FG
Sbjct: 299 KAYQVLFNR-----NRWYSDANTDTSS------FSTFERLQRFYKGKKDALLPILYYNFG 347
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLISSSR G ANLQG+W E+ W+ H+NINL+MNYW + NLSE PL
Sbjct: 348 RYLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINLQMNYWLAESTNLSELTTPLHQ 407
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
F L NG KTA+ Y A GWV H ++ W +S W GGAWLC H+W+H
Sbjct: 408 FTKNLVANGRKTAKAYYNAKGWVAHVISNPWFYTSPGES-AEWGSTLTGGAWLCEHIWQH 466
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP---DGK- 532
Y YT++ DFL K YP+L+ A F LI+ GY T PS SPE+ +I P DGK
Sbjct: 467 YLYTLNTDFL-KEYYPVLKEAADFFQSLLIKDPKTGYWVTAPSNSPENAYIMPQLKDGKK 525
Query: 533 -LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
+ + TMDM I+RE+FS + AA++L + D L + + + P +I G +
Sbjct: 526 QIGNTCIAPTMDMQIVRELFSNTLQAAKILGVDSD-LYSQWQEIITHTVPNRIGRKGDLN 584
Query: 592 EWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKT 651
EW D+KD E +HRH+SHL+GL+P IT P L KAA+KTL+ RG+ G GWS WK
Sbjct: 585 EWLDDWKDAEPNHRHVSHLYGLYPYDEITPWDTPALAKAAKKTLKIRGDGGTGWSRAWKI 644
Query: 652 ALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAE 711
WARL D HA ++++L + VDP GG Y NLF AHPPFQID N G A +AE
Sbjct: 645 NFWARLQDGNHALVLLRQLLHPVDPNSTSGQNGGTYPNLFCAHPPFQIDGNLGGAAGIAE 704
Query: 712 MLVQSTLND--LYLLPALP-WDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN 768
ML+QS + + LPALP W G V+G+KAR G VS WK L I S Y
Sbjct: 705 MLLQSHGKNYTIRFLPALPSHPDWEKGTVEGMKARNGFEVSFNWKKHRLKTATITSLY-- 762
Query: 769 NDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQL 800
G V L AGK + + L
Sbjct: 763 ------------GADCSVLLPAGKSIYYKQTL 782
>gi|347840685|emb|CCD55257.1| glycoside hydrolase family 95 protein [Botryotinia fuckeliana]
Length = 747
Score = 446 bits (1146), Expect = e-122, Method: Compositional matrix adjust.
Identities = 274/769 (35%), Positives = 410/769 (53%), Gaps = 76/769 (9%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
+ + PAK +++++PIGNGRLGAMV+GG+ ETL+LNE+++W G P D T DA + L
Sbjct: 10 LHYTSPAKEWSESLPIGNGRLGAMVYGGISRETLQLNENSIWYGGPQDRTPKDAFRNLDR 69
Query: 75 VRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+R + G + EA + + F H Y+ LG + L+ K ++ Y R L+L+
Sbjct: 70 LRHFIRIGDHTEAEKLAEQAFFATPHSQRHYEPLGTLTLDLGHDPAKVSK--YWRGLELS 127
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS--------LDSLLDN 183
TA +Y V R F+S PD V+V ++ SE + +S D +D+
Sbjct: 128 TANVTTEYEHLGVRHKRTVFASYPDDVLVVQLESSEKAQFTIRLSRYSDREFATDEFVDS 187
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+G I+M G PG R N+N+ F ++ ++ G + + +
Sbjct: 188 IEAQDGT--IVMHG-TPGGR-----NSNN------FCCVVSVQELAGDGNVETVGN--CV 231
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
+ S A++++ A ++F +D + ++ +AL S ++DL RH+ DY
Sbjct: 232 IVNSSKAIIIISAQTTF-----RYTDVEAKTLIQARNALHS-----HADLSKRHVQDYSS 281
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
L+ R ++L I P+ ER+ T DP LV L +GRYLLIS
Sbjct: 282 LYGRFKLRLFPDAAHI-------------PTNERL---LTSPDPGLVALYANYGRYLLIS 325
Query: 364 SSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
SRPG + A LQG+WN P W S +NIN +MNYW + CNL EC++PLFD L
Sbjct: 326 CSRPGDKALPATLQGLWNPSFQPAWGSKYTININTQMNYWPANVCNLEECEDPLFDMLER 385
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
++ G KTA+V Y GW H TDIWA + + LWPM GAWLCTH+W+ + +
Sbjct: 386 MANRGEKTARVMYGCRGWASHSCTDIWADTDPQDRWMPGTLWPMSGAWLCTHIWQRHLFG 445
Query: 482 MDRDF-LEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYS 539
D++ +R +P+L G F+LD+L++ G YL TNPS SPE+ +I G+ +
Sbjct: 446 GDQNLKFLQRMFPVLRGSVQFILDFLVKDSSGDYLITNPSLSPENSYIDLKGQKGVLCEG 505
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKD 599
S +D+ II+ +F A + + + L+ +D L E + + +L P++I E G + EW QDFK+
Sbjct: 506 SAIDIQIIKSLFKAFLLSVDSLQM-KDELTEPLKLARDKLPPSEIGEFGQLQEWLQDFKE 564
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWAR 656
E HRH SHL+ L+PG++I + PD AAE TL++R E G GWS W L AR
Sbjct: 565 HEPGHRHTSHLWSLYPGNSIHPHETPDFASAAEVTLRRRAENGGGHTGWSRAWLICLHAR 624
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
LHD + + + RL + NL HPPFQID NFG A + EML+QS
Sbjct: 625 LHDADGSLGHIFRL-----------LKDSTMPNLLDVHPPFQIDGNFGGCAGIVEMLIQS 673
Query: 717 -TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+N + +LPA P +W SG + G+KAR G + I W +G L +V ++S
Sbjct: 674 HQINTIQVLPACP-KEWRSGELSGVKARTGFDLDIAWNEGVLTKVLVHS 721
>gi|224537768|ref|ZP_03678307.1| hypothetical protein BACCELL_02651 [Bacteroides cellulosilyticus
DSM 14838]
gi|224520588|gb|EEF89693.1| hypothetical protein BACCELL_02651 [Bacteroides cellulosilyticus
DSM 14838]
Length = 833
Score = 446 bits (1146), Expect = e-122, Method: Compositional matrix adjust.
Identities = 265/775 (34%), Positives = 416/775 (53%), Gaps = 72/775 (9%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + + +P+GNGRLG M GGV E + LNE +LW+G+ DY NPDA ++L
Sbjct: 41 QLYYTAPATIWEETLPLGNGRLGMMPDGGVDREHIVLNEISLWSGMEADYGNPDASRSLP 100
Query: 74 DVRSLVDSGQYAEATAASVKLF-------GHPADVYQLLGDIELEFDDSHLKYAEET--- 123
++ L+ G+ EA F G YQ+L D+ ++F H +
Sbjct: 101 AIQQLLFEGKNKEAQELMYSSFVPKKPESGGTYGNYQMLADLNIDFSFPHRRKTISENDA 160
Query: 124 -----YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
YRR LDL A A ++ +++ RE+F+S V++ ++ S +LSF+ L
Sbjct: 161 APVTDYRRWLDLRDAVAYTSFTKEGIDYQREYFTSRDKDVMIIHLTTSRRRALSFSAQLS 220
Query: 179 -------SLLDNHSYVNGNNQIIMEGRC----PGKRIPPKANANDDPKGIQFSAILEIKI 227
S+L G +++EG PG+ +G+++ + +
Sbjct: 221 RPKQGAVSMLPGIGKEEGT--LLLEGTLDSGKPGR------------EGMKYRVAMRLIS 266
Query: 228 SDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESM--SALQSI 285
+ ISA ++ + + A L+L A++S+ + S ++ +S+ +A Q +
Sbjct: 267 KGGKQNISA--ERGITLTQGREAWLVLSATTSYAASGTDFSGNRYKEVCDSLLNAATQHV 324
Query: 286 RNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE 345
+ + H+ ++ + RVS+ L + D++ P+ ER+ F E
Sbjct: 325 Q------IKESHIASHRTFYDRVSLTLPFTEDDVL------------PTNERITRFTERE 366
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
P+L L + +GRYL ISS+RPG+ NLQG+W + W+ H NIN++MN+W
Sbjct: 367 SPALAALYYNYGRYLFISSTRPGSLPPNLQGLWANGVETPWNGDYHTNINIQMNHWPLEQ 426
Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALW 463
LSE +PL + L +G +TA+ Y A GWV+H T+IW +A W
Sbjct: 427 AGLSELYQPLTALVERLIPSGEETARTFYGTHAQGWVLHMMTNIW-NYTAPGEHPSWGAT 485
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSP 522
GGAWLC HLWEHY YT D +FL KR YP+L+G + F ++ E G+L T P++SP
Sbjct: 486 NTGGAWLCAHLWEHYQYTQDIEFL-KRIYPVLKGASEFFYSTMVREPKHGWLVTAPTSSP 544
Query: 523 EHEF-IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
E+ F + D V TMD+ ++ E+++ +I A +LE + D K+ ++L + P
Sbjct: 545 ENAFFVGNDPTPVSVCMGPTMDVQLLTELYTNVIEATSILECDAD-YAAKLREALDKFPP 603
Query: 582 TKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEE 641
+I++ G + EW +D+K+ +VHHRH+SHL+GL PG+ I+ + P+L A +TL +RG+
Sbjct: 604 MQISKGGYLQEWLEDYKEQDVHHRHVSHLYGLHPGNLISPDATPELANACRETLNRRGDG 663
Query: 642 GPGWSITWKTALWARLHDQEHAYRMVKR-LFNLVDPEHEKHFEGGLYSNLFAAHPPFQID 700
G GWS WK WARL D + A+ + K L+ VDP+ ++H G + NLF +HPPFQID
Sbjct: 664 GTGWSRAWKVNFWARLGDGDRAWTLFKSLLYPAVDPQTKRH-GSGTFPNLFCSHPPFQID 722
Query: 701 ANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
N+G TA V EML+QS ++LLPALP W +G G+KARGG +V + WKDG
Sbjct: 723 GNYGGTAGVGEMLLQSHEGFIHLLPALP-KSWHTGNFHGMKARGGISVDLEWKDG 776
>gi|229197298|ref|ZP_04324028.1| hypothetical protein bcere0001_28460 [Bacillus cereus m1293]
gi|228586175|gb|EEK44263.1| hypothetical protein bcere0001_28460 [Bacillus cereus m1293]
Length = 1172
Score = 445 bits (1145), Expect = e-122, Method: Compositional matrix adjust.
Identities = 279/779 (35%), Positives = 414/779 (53%), Gaps = 88/779 (11%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
L + +N PAK + A+PIGNG +G MV+GGV E ++ NE TLWTG P Y N
Sbjct: 63 LSLWYNEPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPSSTSEYTYGNR 122
Query: 67 D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEF---DDS 115
D A L +R + G + A S + FG YQ GDI L+F D S
Sbjct: 123 DGAASHLGSIREKLSKGDKSGAERESTQFLTGLQKGFGS----YQNFGDIYLDFNMPDGS 178
Query: 116 HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
YRREL+LN + V Y+ V++ RE+F+S PD+V+V +++ SES LS +V
Sbjct: 179 SF----SNYRRELNLNEGISTVSYNYKGVQYNREYFASYPDRVMVMRLTASESKQLSLDV 234
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
S + +N+I ++G+ AN+ G+++ + E K+ ++ GT++
Sbjct: 235 RPTSAQGGQ-VTSKDNKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLT 278
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
A E+ K+KV +D +++ A++ ++ + PS +DP + + +I N SY L
Sbjct: 279 A-ENGKIKVANADSLTIIMTAATDYENKY--PSYKGEDPHQKVEKIMSAISNKSYEVLKY 335
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
H+ DY LF+RVS+ L +VP+ E + S+ + L EL FQ
Sbjct: 336 THIKDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQ 382
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
+GRYLLISSSRPGT ANLQG+WN +P W+S H NINL+MNYW + NLSE EPL
Sbjct: 383 YGRYLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPL 442
Query: 416 FDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
D++ L G +A+ ++ GW ++ + + ++ G + W P A++ +
Sbjct: 443 MDYVDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQN 501
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
LWEHY +T D+ +L+++ YP+L+ A F +L+E + L +P SPE L
Sbjct: 502 LWEHYKFTDDKQYLQEKIYPILKEAAEFHSKFLVEDQNKKLVVSPCWSPE---------L 552
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNE---DALVEKVLKSLPRLRPTKIAEDGSI 590
+S D ++ E+FS +I A+ +L+ ++ D L K K P P +I G +
Sbjct: 553 GGISNGCAFDQQLVYELFSNVIEASNLLQIDKGFRDELKAKRDKLFP---PIQIGRYGQV 609
Query: 591 MEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
EW D DP HRH+S L L+PG I P+ +AA+ TL RG+EG GWS K
Sbjct: 610 QEWKDDIDDPGETHRHISQLVALYPGSMIN-HNTPEWLEAAKVTLNHRGDEGTGWSKANK 668
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D +HAY+++ + G SNLF HPPFQID NFG T+ +A
Sbjct: 669 INLWARLLDGDHAYKIL-----------QGQLTGSTLSNLFDTHPPFQIDGNFGATSGIA 717
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
EML+QS + + LLPALP W G KGL+ARG T+ WK+G + + S++ N+
Sbjct: 718 EMLIQSHTDSIQLLPALP-KAWKDGSYKGLRARGAFTIDADWKNGTPTVIQVTSDHGND 775
>gi|87200424|ref|YP_497681.1| twin-arginine translocation pathway signal protein [Novosphingobium
aromaticivorans DSM 12444]
gi|87136105|gb|ABD26847.1| Twin-arginine translocation pathway signal [Novosphingobium
aromaticivorans DSM 12444]
Length = 824
Score = 445 bits (1145), Expect = e-122, Method: Compositional matrix adjust.
Identities = 278/756 (36%), Positives = 396/756 (52%), Gaps = 46/756 (6%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ F+ PA+ + +A+P+GNGRLGAM+ G + E L LNEDTLW+G P A L
Sbjct: 45 RLVFDSPAREWIEALPVGNGRLGAMMHGLLDGERLSLNEDTLWSGQP-SVGGAAADGLLE 103
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
+R L+ +G Y A + ++ GH ++ Y L D+ ++ D + A RR LDL A
Sbjct: 104 QMRDLIFAGDYPGADRLARRMQGHFSEAYLPLADLHVDLDQAGPARA---IRRTLDLREA 160
Query: 134 TARVKYSV-GNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
TA V+ G +E R F S P Q++V +I + +V LD L + +
Sbjct: 161 TAGVEIDRDGGIE-RRTLFVSAPAQLVVFRIEREGAARFGASVRLDCQLRSSIRAVSPRR 219
Query: 193 IIMEGRCPGKRIPPKANANDDPK-------GIQFSAILEIKISDDRGTISALEDKKLKVE 245
+++ G+ P P N D + G+ F+AI EI D G++ E L+VE
Sbjct: 220 LVLAGKAPTVCEPDYRNVPDPVRYSDRAGYGMAFAAIAEI---DTDGSVRKGE-GALRVE 275
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
+ W + L A++ + GP + P + + + L+ R ++ L H D++ L+
Sbjct: 276 NAGWLEIRLAAATGYRGPHVLPDLDPGAVEALAAAPLRRARGKPHTRLLADHRRDHRALY 335
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
R ++ L DT D +P+ R + DP+L LL+ +GRYLLI+SS
Sbjct: 336 ERSALALGGG------DTARRH--DGLPTDARRAA--DPGDPALAALLYNYGRYLLIASS 385
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
RPGT+ ANLQGIWN L W NIN+ MNYW + NL++C PL DF L+ N
Sbjct: 386 RPGTRPANLQGIWNAQLRAPWSCNYTTNINVPMNYWMAETANLADCHRPLVDFAEALARN 445
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSS---ADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
G TA+ Y GW +HH TD+WA S+ A G WA WPMG W+ HLWEHY ++
Sbjct: 446 GGDTARDYYRMPGWCLHHNTDLWAMSNPVGAGEGDPNWANWPMGAPWIAQHLWEHYRFSG 505
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
D FL RA+P++ G A F + WL+ + G L T PS SPE+ F+ DG+ A +S T
Sbjct: 506 DLAFLRDRAWPVMRGAADFCVGWLVRDPASGQLTTAPSISPENLFVTADGRTAAISAGCT 565
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP-RLRPTKIAEDGSIMEWAQDFKDP 600
MD+A+IRE+F I+AA VL EDA KVL++L L P +I G + EW+ DF +
Sbjct: 566 MDIAMIRELFGNCIAAAAVL--GEDAAFAKVLRNLSEELPPYRIGRHGQLQEWSVDFAEQ 623
Query: 601 EVHHRHLSHLFGLFPGHTIT---IEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARL 657
+ HR +SHL+ +FPG IT + + + G GWS W TA+ ARL
Sbjct: 624 DPGHRTVSHLYPIFPGGDITPRRSPRLAAAAARSLDRREAHGGSSTGWSRAWATAIRARL 683
Query: 658 HDQEHAYRMVKRLFNLVDPEHEKHFEGGLY-SNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
D + ++R H L ++ F HP FQIDAN G AA+AE LVQS
Sbjct: 684 GDGKACGEALERFL-------ADHVARSLLGTHPFHPHPVFQIDANLGIAAAIAECLVQS 736
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICW 752
+ + L PALP +W G VKGL+ R G TV + W
Sbjct: 737 HEDRIELFPALP-PRWREGAVKGLRTRHGATVDLEW 771
>gi|384181040|ref|YP_005566802.1| alpha-fucosidase [Bacillus thuringiensis serovar finitimus YBT-020]
gi|324327124|gb|ADY22384.1| alpha-fucosidase [Bacillus thuringiensis serovar finitimus YBT-020]
Length = 1172
Score = 445 bits (1144), Expect = e-122, Method: Compositional matrix adjust.
Identities = 276/776 (35%), Positives = 410/776 (52%), Gaps = 82/776 (10%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
L + +N PAK + A+PIGNG +G MV+GGV E ++ NE TLWTG P Y N
Sbjct: 63 LTLWYNEPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPNSTSEYTYGNR 122
Query: 67 D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEFDDSHLK 118
D A L +R + G + A S + FG YQ GDI L+F+
Sbjct: 123 DGAASHLGSIREKLAKGDKSGAERESSQFLTGLQKGFGS----YQNFGDIYLDFNMPDAS 178
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
A YRREL+LN A V Y+ +V++ RE+F+S PD+V+V +++ SE+ +S +V
Sbjct: 179 -AFSNYRRELNLNEGIATVSYNYKDVQYNREYFTSYPDRVMVMRLTASEAKKISLDVRPT 237
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
S + +N+I M+G+ G+++ A K+ ++ GT++A E
Sbjct: 238 SAQGGQ-VTSVDNKITMKGQITNN-------------GMKYEAAF--KVLNEGGTLTA-E 280
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+ K+KV +D +++ A++ ++ + P+ +DP + + +I SY L H+
Sbjct: 281 NGKIKVANADSLTIIMTAATDYENKY--PTYKGQDPHEKVEKVMSAISKKSYEVLKYTHI 338
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
DY LF+RVS+ L +VP+ E + S+ + L EL FQ+GR
Sbjct: 339 KDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQYGR 385
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSSRPGT ANLQG+WN +P W+S H NINL+MNYW + NLSE EPL D+
Sbjct: 386 YLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPLMDY 445
Query: 419 LTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
+ L G +A+ ++ GW ++ + + ++ G + W P A++ +LWE
Sbjct: 446 VDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQNLWE 504
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
HY +T D+ +L+++ YP+L+ A F +L+E + L +P SPE L +
Sbjct: 505 HYKFTDDKQYLQEKIYPILKEAAEFHSKFLVEDQNKKLVVSPCWSPE---------LGGI 555
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
S D ++ E+FS +I A+EVL+ + D L K K P P +I G + EW
Sbjct: 556 SNGCAFDQQLVYELFSNVIEASEVLQVDNVFRDELKAKRDKLFP---PIQIGRYGQVQEW 612
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
D DP HRH+S L L+PG I K P+ +AA+ TL RG+EG GWS K L
Sbjct: 613 KDDIDDPAETHRHISQLVALYPGSMIN-HKTPEWLEAAKVTLNHRGDEGTGWSKANKINL 671
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARL D +HAY+++ + G SNLF HPPFQID NFG T+ +AEML
Sbjct: 672 WARLLDGDHAYKIL-----------QGQLTGSTLSNLFDTHPPFQIDGNFGATSGIAEML 720
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
+QS + + LLPALP W G KGL+ARG T+ WK+ + + S++ N+
Sbjct: 721 IQSHTDSIQLLPALP-KAWKDGSYKGLRARGAFTIDADWKNSTPTVIQVTSDHGND 775
>gi|423605155|ref|ZP_17581048.1| hypothetical protein IIK_01736 [Bacillus cereus VD102]
gi|401244303|gb|EJR50667.1| hypothetical protein IIK_01736 [Bacillus cereus VD102]
Length = 1193
Score = 445 bits (1144), Expect = e-122, Method: Compositional matrix adjust.
Identities = 278/776 (35%), Positives = 413/776 (53%), Gaps = 82/776 (10%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
L + +N PAK + A+PIGNG +G MV+GGV E ++ NE TLWTG P Y N
Sbjct: 84 LSLWYNEPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPSSTSEYTYGNR 143
Query: 67 D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEFDDSHLK 118
D A L +R + G + A S + FG YQ GDI L+F+
Sbjct: 144 DGAASHLGSIREKLSKGDKSGAERESTQFLTGLQKGFGS----YQNFGDIYLDFNMPDAS 199
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
+ YRREL+LN + V YS V++ RE+F+S PD+V+V +++ SES LS +V
Sbjct: 200 -SFSNYRRELNLNEGISTVSYSYKGVQYNREYFASYPDRVMVMRLTASESKQLSLDVRPT 258
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
S + + +I ++G+ AN+ G+++ + E K+ ++ GT++A E
Sbjct: 259 SAQGGQ-VTSKDKKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLTA-E 301
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+ K+KV +D +++ A++ ++ + P+ +DP + + +I N SY L H+
Sbjct: 302 NGKIKVANADSLTIIMTAATDYENKY--PNYKGEDPHQKVEKIMSAISNKSYEVLKYTHI 359
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
DY LF+RVS+ L +VP+ E + S+ + L EL FQ+GR
Sbjct: 360 KDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQYGR 406
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSSRPGT ANLQG+WN +P W+S H NINL+MNYW + NLSE EPL D+
Sbjct: 407 YLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPLMDY 466
Query: 419 LTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
+ L G +A+ ++ GW ++ + + ++ G + W P A++ +LWE
Sbjct: 467 VDSLREPGRVSAEKHFGVTGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQNLWE 525
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
HY +T D+ +L+++ YP+L+ A F +L+E + L +P SPE L +
Sbjct: 526 HYKFTDDKQYLQEKIYPILKEAAVFHSKFLVEDQNKKLVVSPCWSPE---------LGGI 576
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
S D ++ E+FS +I A+EVL+ + D L K K P P +I G + EW
Sbjct: 577 SNGCAFDQQLVYELFSNVIEASEVLQVDNVFRDELKAKRDKLFP---PIQIGRYGQVQEW 633
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
D DP HRH+S L L+PG I P+ +AA+ TL RG+EG GWS K L
Sbjct: 634 KDDIDDPGETHRHISQLVALYPGSMIN-HNTPEWLEAAKVTLNHRGDEGTGWSKANKINL 692
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARL D +HAY+++ + G SNLF HPPFQID NFG T+ +AEML
Sbjct: 693 WARLLDGDHAYKIL-----------QGQLTGSTLSNLFDTHPPFQIDGNFGATSGIAEML 741
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
+QS + + LLPALP W G KGL+ARG T+ WK+G + + S++ N+
Sbjct: 742 IQSHTDSIQLLPALP-KVWKDGSYKGLRARGAFTIDADWKNGTPTVIQVTSDHGND 796
>gi|423227144|ref|ZP_17213608.1| hypothetical protein HMPREF1062_05794 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392624284|gb|EIY18376.1| hypothetical protein HMPREF1062_05794 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 825
Score = 444 bits (1143), Expect = e-122, Method: Compositional matrix adjust.
Identities = 280/831 (33%), Positives = 434/831 (52%), Gaps = 95/831 (11%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + + +P+GNGRLG M GG+ E + LNE +LW+G+ DY NPDA ++L
Sbjct: 29 QLYYTAPATIWEETLPLGNGRLGMMPDGGIDKEHIVLNEISLWSGMEADYGNPDASRSLP 88
Query: 74 DVRSLVDSGQYAEATAASVKLF-------GHPADVYQLLGDIELEFD-DSHLKYAEE--- 122
++ L+ G+ EA F G YQ+L D+ L F K+A +
Sbjct: 89 AIQQLLFEGKNKEAQELMYNSFVPKKPESGGTYGNYQMLADLTLNFSIPVKKKFASDEVV 148
Query: 123 ---TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
YRR LDL A A ++ G +++ RE+++S V++ ++ S SL F SL
Sbjct: 149 PVTNYRRWLDLRDAVAYTTFTKGGIDYQREYYTSRDKDVMIIHLTVSRRRSLFFTASLSR 208
Query: 180 LLDNH-SYVNGNNQ----IIMEGRC----PGKRIPPKANANDDPKGIQFSAILEIKISDD 230
S V G+ + +++EG PG+ G+++ + +
Sbjct: 209 PQQGTVSLVPGSGKEAGTLLLEGALDSGKPGQ------------DGMKYRVAMRVVSKGG 256
Query: 231 RGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFIN-PSDSKKD----------PTSESM 279
+ ISA ED + +G++ A L++ A++S+ + P K+ P S +
Sbjct: 257 KQFISA-EDGIMLTQGTE-AWLIISATTSYAAAGTDFPGSRYKEVCDSLLNAATPPSSQL 314
Query: 280 SALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV 338
S L S + N S+ +LY R V+ T D +P+ ER+
Sbjct: 315 SILNSPLTNASHRELYDR-----------------------VSLTLPATEDDALPTNERI 351
Query: 339 KSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEM 398
F E P+L L + +GRYLLISS+RPG+ NLQG+W + W+ H NIN++M
Sbjct: 352 VRFAERESPALAALYYNYGRYLLISSTRPGSLPPNLQGLWANGVQTPWNGDYHTNINIQM 411
Query: 399 NYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRG 456
N+W LSE +PL + L +G TA+ Y A GWV+H T++W +A
Sbjct: 412 NHWPLEQAGLSELYQPLTGLVERLVPSGKGTARTFYGNHAQGWVLHMMTNVW-NYTAPGE 470
Query: 457 KVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLE 515
W GGAWLC HLWEHY YT D ++L K+ YP+L+G + F ++ E G+L
Sbjct: 471 HPSWGATNTGGAWLCAHLWEHYQYTQDIEYL-KKIYPILKGASEFFYSTMVREPKHGWLV 529
Query: 516 TNPSTSPEHEF-IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLK 574
T P++SPE+ F + D V TMD+ ++ E+++ +I AA +LE ++D K+ +
Sbjct: 530 TAPTSSPENAFFVGDDPTPVSVCMGPTMDVQLLTELYTNVIEAASILECDDD-YAAKLRE 588
Query: 575 SLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKT 634
+L + P +I++ G + EW +D+K+ +VHHRH+SHL+GL PG+ I+ + P+L A T
Sbjct: 589 ALGKFPPMQISKGGYLQEWLEDYKEQDVHHRHVSHLYGLHPGNLISPDATPELANACRAT 648
Query: 635 LQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFN-LVDPEHEKHFEGGLYSNLFAA 693
L +RG+ G GWS WK WARL D + A+ + K L VDP+ ++H G + NLF +
Sbjct: 649 LNRRGDGGTGWSRAWKINFWARLGDGDRAWTLFKSLLQPAVDPQTKRH-GSGTFPNLFCS 707
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWK 753
HPPFQID N+G A + EML+QS ++LLPALP W +G +G+KARGG +V + WK
Sbjct: 708 HPPFQIDGNYGGAAGIGEMLMQSHEGFIHLLPALP-KSWHAGNFRGMKARGGLSVDLEWK 766
Query: 754 DGDLHEVGIYSNYSNNDH--------DSFKTLH-----YRGTSVKVNLSAG 791
DG + + + N H + TL+ Y G ++ + L+AG
Sbjct: 767 DGKAVKAILTATVPGNFHIKMPEGVKQAKTTLNGQGNTYTGKTISLKLAAG 817
>gi|340621763|ref|YP_004740215.1| alpha-1,2-fucosidase 2 [Capnocytophaga canimorsus Cc5]
gi|339902029|gb|AEK23108.1| Alpha-1,2-fucosidase 2 [Capnocytophaga canimorsus Cc5]
Length = 806
Score = 444 bits (1141), Expect = e-121, Method: Compositional matrix adjust.
Identities = 285/775 (36%), Positives = 415/775 (53%), Gaps = 64/775 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + F+ PA FT+++P+GNGRLGAMV+G ET+ LNE +LW+G + + +A K L
Sbjct: 23 VSVVFDQPATFFTESLPLGNGRLGAMVFGKTDVETIVLNEISLWSGGKQEADDENAHKYL 82
Query: 73 SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEF-DDSHLK 118
++++L+ G+ EA + +K F G+ A+ YQ LG +++++ D+ +
Sbjct: 83 KEIQNLLLQGKNLEAQSLLMKHFVAKGKGTCHGNGANCHYGCYQTLGQLKIDWKSDASVT 142
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
+ Y+R LDL A A +Y + + F+ + VI KI ++ L ++
Sbjct: 143 H----YKRVLDLEKAVATTQYVRNGNQIEQIVFTDFNNDVIWVKIKSAQKTDLGLSLFRK 198
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
+N + N++IM+G P N++ KG++F+ I E+ + T A
Sbjct: 199 ---ENAHFSYDKNKLIMQGTLP----------NENQKGMEFATIAEVTTDGELTTSLA-- 243
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
L+V + ++ + AS+++ + N D ++++ L++I +LS+ + +
Sbjct: 244 --GLEVRSASEVIVKISASTNYS--YENGELENTDVVKQTLAYLKAINSLSFQNALLENQ 299
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
Y K+F+R ++ S D EN+ T +R ++ TD L L + FGR
Sbjct: 300 VTYGKIFNRNRWEMPTSLTD--------ENLTTWQRLQRYQAGNTD--AQLPVLYYNFGR 349
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSSR G ANLQG+W E+ W+ H+NIN++MNYW + NLS+ EPL F
Sbjct: 350 YLLISSSRKGLLPANLQGLWAEEYQTPWNGDYHLNINVQMNYWLAEVTNLSDLAEPLLRF 409
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
L NG KTA+ Y A GWV H ++ W +S G W GGAWLC H+WEHY
Sbjct: 410 TKNLVPNGKKTAKAYYNAEGWVAHVVSNPWFFTSPGEG-ASWGSTLTGGAWLCQHIWEHY 468
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAP---DGK-- 532
+T + DFL K Y +L+ A F D LI E GY T PS SPE+ + P DGK
Sbjct: 469 QFTQNIDFL-KEYYFVLKEAAHFFEDMLIKEPKSGYWVTAPSNSPENAYYLPELKDGKKQ 527
Query: 533 --LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSI 590
C+ TMDM I+RE+FS ++ A+E+L K+ D K + P I E G +
Sbjct: 528 HGFTCM--GPTMDMQIVRELFSNVLKASEILNKDTDKH-PKWKDIIKNTVPNTIGEQGDL 584
Query: 591 MEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
EW D++D E HRH+SHL+GL P IT P L +AA KTL+ RG+ G GWS WK
Sbjct: 585 NEWFHDWEDAEPTHRHVSHLYGLHPYDEITPWDTPKLAQAARKTLEIRGDGGTGWSKAWK 644
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
WARL D HA ++K+L V ++ GG Y+NLF AHPPFQID NFG TA +A
Sbjct: 645 INFWARLGDGNHALTLLKQLLTPVAMGRQQS-AGGTYANLFCAHPPFQIDGNFGGTAGIA 703
Query: 711 EMLVQS--TLNDLYLLPALPWD-KWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
EML+QS N + LPALP W G + G+KAR G VS W+ G L E I
Sbjct: 704 EMLLQSHGKTNTIRFLPALPSHPDWQKGKITGMKARNGFEVSFSWEKGMLKEAEI 758
>gi|448410558|ref|ZP_21575263.1| alpha-L-fucosidase [Halosimplex carlsbadense 2-9-1]
gi|445671594|gb|ELZ24181.1| alpha-L-fucosidase [Halosimplex carlsbadense 2-9-1]
Length = 822
Score = 442 bits (1137), Expect = e-121, Method: Compositional matrix adjust.
Identities = 286/797 (35%), Positives = 409/797 (51%), Gaps = 85/797 (10%)
Query: 10 TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
T+ ++ ++ PA + +A+P+GNGRLG MV G E + LN+D LW G D T P
Sbjct: 20 THDDRLWYDAPATEWVEALPVGNGRLGGMVHGRPARERVALNDDRLWVGDHADRTADGGP 79
Query: 70 KALSDVRSLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLKYAEETYRR 126
L VR + G++ A +LF G V YQ LGD+ + D + YRR
Sbjct: 80 DDLDAVRECLWDGEFERAQRLCNELFVGDLTGVAPYQPLGDLLI---DCPAHDDPDEYRR 136
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
LDL +RV+Y+VG F RE F+S PD V+ +I ESG++ V LD +
Sbjct: 137 SLDLRAGVSRVEYTVGGTRFERECFASEPDGVLAMRIEADESGAVDARVRLDRDRSARTT 196
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKG--IQFSAILEIK----------------IS 228
V ++ +++ G+ P + + DP G +F A ++ I
Sbjct: 197 VV-DDTVVLRGQVIDL---PGDDESVDPGGWGQRFEARARVRAEGGIVAAAADEAAPSIG 252
Query: 229 DDRGTI--SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIR 286
D G +A + V G+D ++L A + PSD DP E AL +
Sbjct: 253 DGDGEREGAAYGTDGIVVAGADAVTVVLTAG-------VAPSDG--DPRDECREALAGVA 303
Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDED 346
+ Y+ + RH+ D+++ RV + L P D D E +D V ER D
Sbjct: 304 DDDYAAIRERHVADHREHMDRVDLDLG-EPVDAPVD----ERLDRVRDGER--------D 350
Query: 347 PSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPC 406
P L +L Q+GRYLL+ SSRPGT ANLQGIWNE+ P WDS ++NLEMNYW +
Sbjct: 351 PHLAQLYVQYGRYLLLGSSRPGTLPANLQGIWNEEFHPPWDSDYTQDVNLEMNYWHAEVA 410
Query: 407 NLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
NL EC +PL +F+ G +TA+ Y G+ H +D W ++A W WPMG
Sbjct: 411 NLRECADPLVEFVDESREPGRETARERYGCEGFTTHLHSDRW-HTTAQTADAHWGHWPMG 469
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE-GHDGYLETNPSTSPEHE 525
AWLC +LWE Y ++ DR+ LE R YP+L A FLLD+L+E + +L T PS SPE++
Sbjct: 470 AAWLCQNLWERYAFSGDREDLE-RIYPILREAAEFLLDYLVEHPEEEWLVTAPSASPENQ 528
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
F DG+ A MD+ + R++F + AAE L+++ D E + ++L RL P +
Sbjct: 529 FRTADGQEATTCVMPAMDIQLTRDLFGHCVEAAETLDRDADFAAE-LAEALERLPPMGVD 587
Query: 586 EDGSIMEWAQDFKDPEVHHRHLSHLFGLFP-------------GHTITIEKNPDLCKAAE 632
+ G++ EW +D+++ HRH+SHLFG +P G + +PD AA
Sbjct: 588 DRGALREWLRDYEEVNPGHRHVSHLFGYYPADVLHEAESSGDRGGARDLALSPDEVDAAV 647
Query: 633 K-TLQKRGEEG---PGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYS 688
+ +L++R + G GWS W AL+ARL D + V++L L D Y
Sbjct: 648 RASLERRLDNGGGHTGWSCAWTIALFARLGDGDRVGAHVRKL--LAD---------STYD 696
Query: 689 NLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETV 748
+L AHPPFQID NFG TA +AE LV S + LLPALP D+W+ G V GL+ARGG V
Sbjct: 697 SLLDAHPPFQIDGNFGGTAGIAEALVGSHGGTIRLLPALP-DEWAEGSVSGLRARGGFEV 755
Query: 749 SICWKDGDLHEVGIYSN 765
+ W G L I++
Sbjct: 756 DLAWSGGTLDAATIHAG 772
>gi|212540772|ref|XP_002150541.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
gi|210067840|gb|EEA21932.1| conserved hypothetical protein [Talaromyces marneffei ATCC 18224]
Length = 755
Score = 441 bits (1135), Expect = e-121, Method: Compositional matrix adjust.
Identities = 270/762 (35%), Positives = 403/762 (52%), Gaps = 66/762 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ + PA+ + +A+P+GNGRLG MV+G +E L LNED++W G P T + L+
Sbjct: 4 KLWYQQPAQCWNEALPVGNGRLGVMVYGRTSTELLALNEDSVWYGGPQSRTPQPSIGELA 63
Query: 74 DVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFD-DSHLKYAEETYRRELD 129
+R L+ ++ +A + K F PA Y+ LG + ++F+ D+ K + Y+R LD
Sbjct: 64 LLRDLIRKEKHTDAEKLARKSFFASPASQRHYEPLGTVFIDFNHDNEQKLLD--YQRSLD 121
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS---- 185
+ + V+Y + R+ +S PD V+ I S + ++ + LD +
Sbjct: 122 IEKSLCHVEYEYDGICIARDLIASYPDSVLAMHIQSSAPIEFTVRLTRVNELDYETNEFL 181
Query: 186 --YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
N ++M GKR + +L + DD G ++A + L
Sbjct: 182 DDVAAKGNSLVMSVTPGGKR------------SNRACCVLSARCIDDEGIVTARPNNSLH 229
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
+ G + +LL++A+ + +D K ++ +ALQ S+ +L TRH+ DY
Sbjct: 230 IRGQN--ILLVIAAQTE----YRCNDIDKVTVTDCNNALQK----SWDELLTRHIQDYSA 279
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
L+ R+S+++ D+ + + +P+ R++ D L+ L + RYLLIS
Sbjct: 280 LYTRMSLRIG--------DSANLHELQKIPTDVRLRE---SRDLGLISLYHNYSRYLLIS 328
Query: 364 SSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
SSR G + A LQGIWN +P W S +NINL+MNYW CNLSEC +PLF L
Sbjct: 329 SSRNGYKALPATLQGIWNPSFTPAWGSKYTININLQMNYWPVNVCNLSECSQPLFALLRR 388
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
++ NG KTA+ Y GW HH TDIWA + + LWP+GGAWLC H+WEH++YT
Sbjct: 389 MAENGVKTAKSMYNCGGWAAHHNTDIWADTDPQDRWMPATLWPLGGAWLCFHIWEHFDYT 448
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACV-SYS 539
D++FL + +P+L+GC FLLD+LIE DG YL TNPS SPE+ F + + V
Sbjct: 449 QDKEFLSE-MFPVLQGCVEFLLDFLIESVDGKYLVTNPSLSPENTFYTHNRENQGVFCEG 507
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKD 599
ST+D+ II VF+A +S+ +VL ++ L +V + RL P +I G + EW D+ +
Sbjct: 508 STIDIQIIEAVFTAFLSSVDVLNLTDNELGGRVQDAKKRLPPMQIGSFGQLQEWMHDYDE 567
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWAR 656
E HRH SHL+GL PG +I + P+L KAA L++R G GWS W L AR
Sbjct: 568 VEPGHRHTSHLWGLHPGASIKPVQTPELAKAASIVLRRRAAHGGGHTGWSRAWLINLHAR 627
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
L + + + L + NL HPPFQID NFG A + EMLVQS
Sbjct: 628 LFESDECENHIDLL-----------LKNSTLPNLLDTHPPFQIDGNFGAGAGIVEMLVQS 676
Query: 717 -TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
++ + LLPA P + W G V G++ARGG + WKDG++
Sbjct: 677 HEVSAIRLLPACP-ESWKEGAVSGVRARGGFELDFEWKDGEI 717
>gi|345881344|ref|ZP_08832866.1| hypothetical protein HMPREF9431_01530 [Prevotella oulorum F0390]
gi|343920009|gb|EGV30749.1| hypothetical protein HMPREF9431_01530 [Prevotella oulorum F0390]
Length = 834
Score = 441 bits (1133), Expect = e-120, Method: Compositional matrix adjust.
Identities = 279/804 (34%), Positives = 410/804 (50%), Gaps = 88/804 (10%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT--------NPDAPKALSDVRSL 78
++PIGNG LGA + G + +E + LNE +LW G PG + N A L +R+
Sbjct: 82 SLPIGNGSLGANILGSIAAERITLNEKSLWRGGPGVSSDASYYWNVNKHAAPVLKAIRAA 141
Query: 79 VDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETYRR 126
+G A+A + + K F A + +G++ +E + ++++ YRR
Sbjct: 142 FLAGDKAKADSLTRKNFNGLAAYESYAEKPFRFGNFTTMGELTIETGLNDAQFSD--YRR 199
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNH 184
EL L++A V++ V + R F S PD V+V + + G +L F+ + + +
Sbjct: 200 ELSLDSARTLVQFVHDGVRYARTAFISYPDNVMVLRFKANAKGMQNLCFHYAPNPVSTGK 259
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+G N ++ G D G+Q+ ++ I+ GT+ + L +
Sbjct: 260 MQADGANGLVYRGAL-------------DSNGMQY--VVRIQAVTHSGTLEN-SGQTLTI 303
Query: 245 EGSDWAVLLLVASS----SFDGPFINPSD-SKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+G+D V L+ A + +FD F NP P + +Q Y+ L+ RH
Sbjct: 304 KGADEVVFLITADTDYRINFDPDFHNPKTYVGVQPEVTTEKWMQQAAERGYAQLFQRHFK 363
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGR 358
DY LF RV +QL+ ++ N VP+A+R+ +++ D L EL +QFGR
Sbjct: 364 DYSPLFQRVKLQLN----------AAQTNDKDVPTAQRLAAYRNGATDNYLEELYYQFGR 413
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLI+SSRPG ANLQG+W+ ++ W H NIN++MNYW NL+EC PL DF
Sbjct: 414 YLLIASSRPGNLPANLQGLWHNNVDGPWRVDYHNNINVQMNYWPVHTTNLNECALPLVDF 473
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
+ L G+ TA+ Y A GW ++I+ ++ + + W L PMGG WL THLWE+
Sbjct: 474 VRTLVKPGAVTAKAYYGARGWTTSVSSNIFGFTAPLASEDMSWNLCPMGGPWLATHLWEY 533
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y++T D+ FL Y +++ A+F +D+L DG PSTSPEH +
Sbjct: 534 YDFTRDKRFLRSTLYDIIKQSANFAVDYLWHKPDGTYTAAPSTSPEH---------GPID 584
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVE--KVLKSLPRLRPTKIAEDGSIMEWAQ 595
T A+IRE+ I+A++VL+ +E A + VL LP P +I G + EW++
Sbjct: 585 EGVTFVHAVIREILLDAIAASKVLQVDETARKQWQMVLLHLP---PYRIGRYGQLQEWSE 641
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWA 655
D DP HHRH++HLFGL PGHTIT P L KAA L+ RG+ GWS+ WK WA
Sbjct: 642 DIDDPNDHHRHVNHLFGLHPGHTITPSTTPALAKAARVVLEHRGDGATGWSMGWKINQWA 701
Query: 656 RLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQ 715
RLHD HAY +V+ L + G +NL+ HPPFQID NFG TA + EML+Q
Sbjct: 702 RLHDGNHAYLLVRNL-----------LKDGTLNNLWDTHPPFQIDGNFGGTAGITEMLLQ 750
Query: 716 STLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFK 775
S + +LPALP D W G V+GL ARGG V + W+ G L V + S
Sbjct: 751 SHAGFIDVLPALP-DSWKQGEVRGLCARGGFEVGLKWQQGMLQSVVVKSLAGEP-----C 804
Query: 776 TLHYRGTSVKVNLSAGKIYTFNRQ 799
TL Y G ++ G+ Y + Q
Sbjct: 805 TLSYHGKALHFGTKKGQTYRLSWQ 828
>gi|386724573|ref|YP_006190899.1| alpha-L-fucosidase [Paenibacillus mucilaginosus K02]
gi|384091698|gb|AFH63134.1| alpha-L-fucosidase [Paenibacillus mucilaginosus K02]
Length = 714
Score = 440 bits (1132), Expect = e-120, Method: Compositional matrix adjust.
Identities = 250/617 (40%), Positives = 349/617 (56%), Gaps = 42/617 (6%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
F PAK + +A+P+GNGRLGAMV+G E ++LNEDT+W G P D NPDA + L ++R
Sbjct: 8 FKQPAKDWNEALPLGNGRLGAMVFGLPRKERIQLNEDTVWYGGPVDRHNPDALRYLPEIR 67
Query: 77 SLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
+ SG+ AEA A++ L G P Y LGD+ + D H E YRRELDL+
Sbjct: 68 EKLLSGRLAEAHKLAAMALSGIPESQRHYMPLGDLWITMD--HPPGVAEEYRRELDLSKG 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVNGN 190
A + Y +G+ F RE F S+PDQ +V +I G++ F LD S + G
Sbjct: 126 VAGLHYRIGDTAFIRETFISHPDQALVLRIRADRPGAVGFTARLDRGKSRYLDEIEAAGP 185
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
N ++M G C GK G F A L +D G + + L VEG+D
Sbjct: 186 NMLVMRGNCGGK------------GGSDFRAALR---ADAEGGSVRIIGEHLIVEGADAV 230
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
L L A+++F ++DP + ++ L S Y+ L RH +DY+ L+ RV +
Sbjct: 231 TLYLSAATTF---------RQEDPEAYCLNTLSSAAARGYASLLERHTEDYRGLYDRVQL 281
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGT 369
L ++ TD + + +P+ ER++ + EDP L+ L FQ+GRYLLISSSRPG+
Sbjct: 282 SL-----ELQTDEAAAAAV--LPTDERLELVKKGGEDPGLIPLYFQYGRYLLISSSRPGS 334
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
ANLQGIWNE + P WDS +NIN +MNYW + C+LSEC EPLFD + +S GS+T
Sbjct: 335 LPANLQGIWNEQMRPPWDSKYTININTQMNYWPAESCHLSECHEPLFDLIQRMSERGSRT 394
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+V Y GW HH TD+W ++ + WP+GGAWLC HLWEHY + L +
Sbjct: 395 AEVMYGCRGWTAHHNTDLWGDTAPQDIYLPATHWPLGGAWLCLHLWEHYRFGGGTARLAE 454
Query: 490 RAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
YP+++G A FLLD++IE DG+L T PS SPE+ +I P+G+ + MD I RE
Sbjct: 455 -FYPVMKGAARFLLDYMIEAKDGHLITCPSVSPENTYILPNGESGTLCAGPAMDSQIARE 513
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSH 609
+F A AA L +ED E L +L R+ ++AE G + EW +D+K+ + HRH+SH
Sbjct: 514 LFQACREAARELGTDEDFRSELEL-ALQRIPLPQVAEGGYLQEWLEDYKEKDPGHRHISH 572
Query: 610 LFGLFPGHTITIEKNPD 626
LF L PG IT + P+
Sbjct: 573 LFALHPGTQITPARTPE 589
>gi|326799708|ref|YP_004317527.1| alpha-L-fucosidase [Sphingobacterium sp. 21]
gi|326550472|gb|ADZ78857.1| Alpha-L-fucosidase [Sphingobacterium sp. 21]
Length = 943
Score = 439 bits (1130), Expect = e-120, Method: Compositional matrix adjust.
Identities = 255/708 (36%), Positives = 383/708 (54%), Gaps = 57/708 (8%)
Query: 96 GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNP 155
G + YQ GD+ L+F + Y+R LD+ A + Y V F R +FSS P
Sbjct: 287 GKYQESYQPFGDLLLDF---RAQAPFSNYKRTLDVEQAICKTSYVQNGVSFERTYFSSAP 343
Query: 156 DQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPK 215
D + ++ +SF+ SL S ++ ++ I RI + +
Sbjct: 344 DACLAIHLTADRPRQISFDASLASPHKTYNVEKVDDSTI--------RISVQVKQGV-LR 394
Query: 216 GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPT 275
G+ F + + + G + + D K+K+ G++ A L L A++++ + +D D
Sbjct: 395 GVGF-----LHVRHEGGELH-VGDGKIKILGANQATLFLTAATNYK----SYNDVSGDAE 444
Query: 276 SESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSA 335
+ S L ++N Y + H+ DYQ+ F + S++ ++E +++P+
Sbjct: 445 EIAKSQLNKVKNKPYDVIRLAHIQDYQQYFTKFSLKFE-----------ADEASNSLPTD 493
Query: 336 ERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNIN 395
+R+ F DP+L+ L Q+GRYLLISSSR G NLQGIWN+ L+P W S NIN
Sbjct: 494 QRIAQFVKSRDPNLLALFVQYGRYLLISSSRSGGLAPNLQGIWNDLLTPPWGSKYTTNIN 553
Query: 396 LEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADR 455
EMNYW + NLSE QEPLF + LS+ G +TA+ Y A GWV+HH TD+W + +A
Sbjct: 554 AEMNYWLAENTNLSELQEPLFQMIKELSVVGQETAKTYYDAPGWVLHHNTDLW-RGTAPI 612
Query: 456 GKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYL 514
+W GGAWLC HLWEH+ YT D FL ++AYP+++ A F +L+ + G+L
Sbjct: 613 NNPNHGIWVTGGAWLCQHLWEHFLYTQDESFLREQAYPIMKASALFFDHFLVSDPKTGWL 672
Query: 515 ETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLK 574
+ PS SPE G L TMD +IR++F + +AA +L+ +++ + +L
Sbjct: 673 ISTPSNSPEQ------GGLVA---GPTMDHQLIRQLFRNVAAAATILKLDKE-FAQHILD 722
Query: 575 SLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKT 634
++ P +I + G + EW +D DP+ HRH+SHL+ ++PG I + +P L AA+K+
Sbjct: 723 KGAKIAPNQIGKYGQLQEWLEDLDDPDNKHRHVSHLWAVYPGSEINWQDSPKLMNAAKKS 782
Query: 635 LQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAH 694
L RG+ G GWS+ WK LWAR D EHAY+MV RL + PE GG+Y NLF AH
Sbjct: 783 LIFRGDGGTGWSLAWKINLWARFKDAEHAYKMVSRLLS---PEEAG---GGVYPNLFDAH 836
Query: 695 PPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKD 754
PPFQID NFG A VAEML+QS L + +LPALP +G VKG++ARGG +S W++
Sbjct: 837 PPFQIDGNFGGAAGVAEMLLQSHLGSIDILPALP-KALYAGAVKGIRARGGFELSYQWQN 895
Query: 755 GDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLKC 802
G L + ++S+ +L YR ++ G+ Y + LK
Sbjct: 896 GLLTHLEVFSHAGGK-----CSLRYRDKEIQFQTEKGQTYYLDSSLKL 938
Score = 76.3 bits (186), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 34/83 (40%), Positives = 53/83 (63%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + + PA +T+A+PIGNG+LGAMV+GGV ++ ++ NE +LWTG P +Y P A L
Sbjct: 28 LTLWYQHPANTWTEALPIGNGKLGAMVFGGVQADRIQFNESSLWTGGPRNYNQPGAKNYL 87
Query: 73 SDVRSLVDSGQYAEATAASVKLF 95
++R L+ G+ A + + F
Sbjct: 88 GEIRKLLSEGKQQAAEELAGRHF 110
>gi|255532706|ref|YP_003093078.1| alpha-L-fucosidase [Pedobacter heparinus DSM 2366]
gi|255345690|gb|ACU05016.1| Alpha-L-fucosidase [Pedobacter heparinus DSM 2366]
Length = 940
Score = 439 bits (1129), Expect = e-120, Method: Compositional matrix adjust.
Identities = 268/702 (38%), Positives = 373/702 (53%), Gaps = 60/702 (8%)
Query: 102 YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVT 161
YQ GD+ L F + A Y+R+LDLNTA A Y++ + + RE+ +S PDQ IV
Sbjct: 295 YQPFGDLYLNFKTEN--EAVTNYKRKLDLNTAVASTTYTLKGINYLREYLASQPDQAIVI 352
Query: 162 KISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSA 221
+++ + GS+SF D+LL + +G +I ++ ++ +
Sbjct: 353 RLTADKKGSISF----DALLGSPHKYSGVKKINANTIALSLKVRDGV--------LKGES 400
Query: 222 ILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA 281
L+ I+ + ++A K+ + +D L L A +SF +N D +P S ++ A
Sbjct: 401 RLQAIITKGKLLVTA---NKISIVAADAVTLYLTAGTSF----VNDKDVSGNPASAAVKA 453
Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF 341
L + SY+ + H+ +YQK + S+ K ++P+ ER++ F
Sbjct: 454 LTGLNGKSYAQVKAAHIKEYQKYYTAFSVSFGPDSKA------------SLPTDERIEQF 501
Query: 342 QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
DP+ L Q+GRYLLISSSRPGTQ ANLQGIWNE L+P W S NINLEMNYW
Sbjct: 502 SDGNDPAFAALFMQYGRYLLISSSRPGTQPANLQGIWNELLTPPWGSKYTTNINLEMNYW 561
Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+ NLS EPL + L+ NG TA+V+Y A GWV+HH TD+W +A
Sbjct: 562 PTGVLNLSAMAEPLIRKINALAKNGEVTAKVHYNAKGWVLHHNTDLW-NGTAPINASNHG 620
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPST 520
+W G WL HLWEHY +T D +FL+ AYP+++ A F D+LI+ G+L + PS
Sbjct: 621 IWVSGAGWLSQHLWEHYLFTQDLNFLKNEAYPVMKQAAVFFNDFLIKDPKTGWLISTPSN 680
Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL-KSLPRL 579
SPE +G L TMD IIR +F I+A +L DA +K L + + +
Sbjct: 681 SPE------NGGLVA---GPTMDHQIIRTLFRNCIAATALL--GVDADFKKTLEQKITLI 729
Query: 580 RPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
P +I + G + EW +D D HRH+SHL+G+ PG+ IT + PD+ KAA ++L RG
Sbjct: 730 APNQIGKYGQLQEWLEDKDDTTNKHRHVSHLWGVHPGNDITWD-TPDMMKAARQSLIYRG 788
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
+EG GWS+ WK WAR D HA +MVK L+ P + GG Y NLF AHPPFQI
Sbjct: 789 DEGTGWSLAWKINFWARFKDGNHAMKMVKM---LISPAAKG---GGAYINLFDAHPPFQI 842
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D NFG A +AEML+QS + LLPALP D G VKG+ ARGG ++ WKDG L
Sbjct: 843 DGNFGGAAGIAEMLLQSHTQFVELLPALPAD-LPEGEVKGICARGGFVLNFKWKDGALSA 901
Query: 760 VGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLK 801
V +YS L Y + G Y FN L+
Sbjct: 902 VEVYSKTG-----GVCLLRYGNKITSIATQRGASYKFNGDLE 938
Score = 75.1 bits (183), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 52/82 (63%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA+ +TDA+PIGNGRLGAM++ GV + ++ NE+TLWTG P DY + A L
Sbjct: 32 QLWYTKPAEKWTDALPIGNGRLGAMIFAGVEKDHIQFNEETLWTGGPRDYNHKGAAAYLP 91
Query: 74 DVRSLVDSGQYAEATAASVKLF 95
+R L+ G EA + + F
Sbjct: 92 QIRQLLFEGNQQEAEKLAAEKF 113
>gi|259485946|tpe|CBF83399.1| TPA: conserved hypothetical protein [Aspergillus nidulans FGSC A4]
Length = 757
Score = 439 bits (1128), Expect = e-120, Method: Compositional matrix adjust.
Identities = 269/759 (35%), Positives = 397/759 (52%), Gaps = 62/759 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA + +A+P+GNGRLGAMV+G +E L+LNED++W G P + DA K L
Sbjct: 3 ELWYQRPAAGWDEALPVGNGRLGAMVYGRTDTELLQLNEDSVWYGGPQNRLPEDALKCLP 62
Query: 74 DVRSLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
+R L+ G + EA A F P Y+ LG + LEF H YRR LDL
Sbjct: 63 RLRELIREGAHKEAERLARRAFFASPNSQRHYEPLGTLFLEF--GHPCEEVTGYRRSLDL 120
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
N V Y V++ R+ +S PD V+ ++ S +S S L+ +
Sbjct: 121 NEGITHVHYEHNGVQYHRQVIASYPDNVLAMRVQASRCSEFLVRLSRLSELEYETN-EFL 179
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI-SDDRGTISA-LEDKKLKVEGSD 248
+ ++++G+ + P ++ + ++ I+ SDD+ I K L + D
Sbjct: 180 DDLVVDGQSIKMHVTPGGKDSN-----RACCMVAIRCGSDDQEPIKVDCVGKNLIINARD 234
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
A++++VA S++ D +++ L+++ S D++ RH+ DYQ L+ R+
Sbjct: 235 -ALIVIVAQSTY-------RCDDADLDRATVADLEAVLASSVEDIWARHITDYQSLYGRL 286
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
+ L DI TD +R+ + P LV + ++ RYLLIS SRPG
Sbjct: 287 ELNLGPDATDIPTD-------------QRILHVR---GPELVAIYLRYSRYLLISCSRPG 330
Query: 369 TQ-------VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
+ A LQGIWN P W +NINL+MNYW + NL EC+EPLF L
Sbjct: 331 RKGSSDRVLPATLQGIWNASFHPPWGCRYTININLQMNYWPANVGNLLECEEPLFALLER 390
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
L++ G++TA+ Y GW +HH TD+WA ++ + LWP+GGAWLCTH+WE + +
Sbjct: 391 LAVTGTETARKMYGCRGWTVHHNTDLWADTAPVDRWMPATLWPLGGAWLCTHVWERFLFN 450
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSS 540
++ FL KR +P+L GC FL D+L++ G Y TNPS SPE+ F G+ + S
Sbjct: 451 GNKAFL-KRMFPVLRGCVEFLQDFLVDDVSGQYKVTNPSLSPENTFRDEKGQEGVLCEGS 509
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDP 600
T+D+ ++R V A + + EVL ++D L+ V +L RL P +I G + EW D+ +
Sbjct: 510 TIDIQLVRAVLKAFVESLEVLGYSQDELLPSVHDTLRRLPPARIGSKGQLQEWMFDYDEN 569
Query: 601 EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARL 657
E HRH+SHL+ L+PG+ I +E P+L KA TLQ+R G GWS W L ARL
Sbjct: 570 EPGHRHVSHLWALYPGNDINLETTPELAKACAVTLQRRQAAGGGHTGWSRAWLLNLHARL 629
Query: 658 HDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST 717
D + ++RL NL HPPFQID NFG A + EMLVQS
Sbjct: 630 RDADECAEHLERL-----------LAQSTLPNLLDTHPPFQIDGNFGGGAGILEMLVQSH 678
Query: 718 LNDLY-LLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
+ + LLPA P W SG ++G++ARGG + WKDG
Sbjct: 679 EDGIIRLLPACPL-AWRSGRLRGVRARGGFELEFEWKDG 716
>gi|67525297|ref|XP_660710.1| hypothetical protein AN3106.2 [Aspergillus nidulans FGSC A4]
gi|40744501|gb|EAA63677.1| hypothetical protein AN3106.2 [Aspergillus nidulans FGSC A4]
Length = 1679
Score = 439 bits (1128), Expect = e-120, Method: Compositional matrix adjust.
Identities = 269/756 (35%), Positives = 395/756 (52%), Gaps = 62/756 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA + +A+P+GNGRLGAMV+G +E L+LNED++W G P + DA K L +R
Sbjct: 6 YQRPAAGWDEALPVGNGRLGAMVYGRTDTELLQLNEDSVWYGGPQNRLPEDALKCLPRLR 65
Query: 77 SLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
L+ G + EA A F P Y+ LG + LEF H YRR LDLN
Sbjct: 66 ELIREGAHKEAERLARRAFFASPNSQRHYEPLGTLFLEF--GHPCEEVTGYRRSLDLNEG 123
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
V Y V++ R+ +S PD V+ ++ S +S S L+ + + +
Sbjct: 124 ITHVHYEHNGVQYHRQVIASYPDNVLAMRVQASRCSEFLVRLSRLSELEYETN-EFLDDL 182
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI-SDDRGTISA-LEDKKLKVEGSDWAV 251
+++G+ + P ++ + ++ I+ SDD+ I K L + D A+
Sbjct: 183 VVDGQSIKMHVTPGGKDSN-----RACCMVAIRCGSDDQEPIKVDCVGKNLIINARD-AL 236
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
+++VA S++ D +++ L+++ S D++ RH+ DYQ L+ R+ +
Sbjct: 237 IVIVAQSTYRC-------DDADLDRATVADLEAVLASSVEDIWARHITDYQSLYGRLELN 289
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ- 370
L DI TD +R+ + P LV + ++ RYLLIS SRPG +
Sbjct: 290 LGPDATDIPTD-------------QRILHVR---GPELVAIYLRYSRYLLISCSRPGRKG 333
Query: 371 ------VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
A LQGIWN P W +NINL+MNYW + NL EC+EPLF L L++
Sbjct: 334 SSDRVLPATLQGIWNASFHPPWGCRYTININLQMNYWPANVGNLLECEEPLFALLERLAV 393
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G++TA+ Y GW +HH TD+WA ++ + LWP+GGAWLCTH+WE + + ++
Sbjct: 394 TGTETARKMYGCRGWTVHHNTDLWADTAPVDRWMPATLWPLGGAWLCTHVWERFLFNGNK 453
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
FL KR +P+L GC FL D+L++ G Y TNPS SPE+ F G+ + ST+D
Sbjct: 454 AFL-KRMFPVLRGCVEFLQDFLVDDVSGQYKVTNPSLSPENTFRDEKGQEGVLCEGSTID 512
Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVH 603
+ ++R V A + + EVL ++D L+ V +L RL P +I G + EW D+ + E
Sbjct: 513 IQLVRAVLKAFVESLEVLGYSQDELLPSVHDTLRRLPPARIGSKGQLQEWMFDYDENEPG 572
Query: 604 HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQ 660
HRH+SHL+ L+PG+ I +E P+L KA TLQ+R G GWS W L ARL D
Sbjct: 573 HRHVSHLWALYPGNDINLETTPELAKACAVTLQRRQAAGGGHTGWSRAWLLNLHARLRDA 632
Query: 661 EHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
+ ++RL NL HPPFQID NFG A + EMLVQS +
Sbjct: 633 DECAEHLERL-----------LAQSTLPNLLDTHPPFQIDGNFGGGAGILEMLVQSHEDG 681
Query: 721 LY-LLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
+ LLPA P W SG ++G++ARGG + WKDG
Sbjct: 682 IIRLLPACPL-AWRSGRLRGVRARGGFELEFEWKDG 716
>gi|229173820|ref|ZP_04301360.1| hypothetical protein bcere0006_29180 [Bacillus cereus MM3]
gi|228609670|gb|EEK66952.1| hypothetical protein bcere0006_29180 [Bacillus cereus MM3]
Length = 1156
Score = 438 bits (1127), Expect = e-120, Method: Compositional matrix adjust.
Identities = 275/776 (35%), Positives = 415/776 (53%), Gaps = 82/776 (10%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
L + +N PAK + A+PIGNG +G MV+GGV E ++ NE TLWTG P Y N
Sbjct: 47 LSLWYNQPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPNSTSEYTYGNR 106
Query: 67 D-APKALSDVRSLV--DSGQYAEATAASV-----KLFGHPADVYQLLGDIELEFDDSHLK 118
D A L +R + D AE ++ K FG YQ GDI L+F+
Sbjct: 107 DGAASHLGSIREKLAKDDKSGAERESSQFLTGLQKGFGS----YQNFGDIYLDFNMPDAS 162
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
+ YRREL++N A V Y+ V++ RE+F+S PD+V+V +++ SES LS +V
Sbjct: 163 -SFSNYRRELNVNEGIATVSYNYKGVQYNREYFASYPDRVMVMRLTASESKQLSLDVRPT 221
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
S +N+I ++G+ AN+ G+++ + E K+ ++ GT++A E
Sbjct: 222 SAQGGQVSAT-DNKITIKGQI----------ANN---GMKYES--EFKVLNEGGTLTA-E 264
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+ K+KV +D +++ A++ ++ + P+ +DP + + +I SY L H+
Sbjct: 265 NGKIKVANADSLTIIMTAATDYENKY--PTYKGEDPHQKIEKIMSAISKKSYEVLKYTHM 322
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
DY LF+RVS+ L +VP+ E + S+ + L EL FQ+GR
Sbjct: 323 KDYYSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQYGR 369
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSSRPGT ANLQG+WN +P W+S H NINL+MNYW + NLSE EPL D+
Sbjct: 370 YLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETAEPLMDY 429
Query: 419 LTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
+ L G +A+ ++ GW ++ + + ++ G + W P A++ ++WE
Sbjct: 430 VDSLREPGRVSAEKHFGVKGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQNVWE 488
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
HY +T D+ +L+++ YP+++ A F ++L+E + L +P SPE L +
Sbjct: 489 HYKFTDDKQYLQEKIYPIIKEAAEFHSNFLVEDQNKKLVVSPCWSPE---------LGGI 539
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
S D ++ E+FS +I A+EVL+ + D L K + P P +I G + EW
Sbjct: 540 SNGCAFDQQLVYELFSNVIEASEVLQIDNVFRDELKAKRERLFP---PIQIGRYGQVQEW 596
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
D DP HRH+S L L+PG I P+ +AA+ TL RG+EG GWS K L
Sbjct: 597 KDDIDDPAETHRHISQLVALYPGSMIN-HNTPEWLQAAKVTLNHRGDEGTGWSKANKINL 655
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARL D +HAY+++ + G SNLF HPPFQID NFG T+ +AEML
Sbjct: 656 WARLLDGDHAYKIL-----------QGQLTGSTLSNLFDTHPPFQIDGNFGATSGIAEML 704
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
+QS + + LLPALP W G KGL+ARG T++ WK+G + + S++ N+
Sbjct: 705 IQSHTDSIQLLPALP-KAWKDGSYKGLRARGAFTINADWKNGVPTVIQVTSDHGND 759
>gi|315500597|ref|YP_004089399.1| Alpha-L-fucosidase [Asticcacaulis excentricus CB 48]
gi|315418609|gb|ADU15248.1| Alpha-L-fucosidase [Asticcacaulis excentricus CB 48]
Length = 788
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 275/763 (36%), Positives = 387/763 (50%), Gaps = 82/763 (10%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK---- 70
++FN PA + +A+P+GNGRLGAMV+GGV SE L+LN LW+G T D PK
Sbjct: 38 LSFNAPAARWMEALPVGNGRLGAMVYGGVRSERLQLNHIELWSG----RTVEDNPKTTRA 93
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPAD-----VYQLLGDIELEFDDSHLKYAEETYR 125
AL VR L+ + + AEA + P + YQ+LGD+ LE A Y
Sbjct: 94 ALPKVRELLFADKRAEANRLAQDDMMAPMNEVDYGSYQMLGDLRLEMGHEE---AVSDYS 150
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
RELD+ T V+Y +G ++R +S PDQ + +I S LS +L D
Sbjct: 151 RELDMATGQVTVRYRIGKATYSRTVLASAPDQCLAVRIETSAPEGLSLKATLKR--DRDV 208
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
+ Q++ K + P G+ + A L + G A + +V
Sbjct: 209 AFDWQGQVL------------KMSGQPQPFGVHYCAYLACR---SEGGSVAPDGHGFRVS 253
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G+ VL L ++ P +P + +A + S+ L D++ LF
Sbjct: 254 GARAVVLNLTGATDLLAP---------EPEKVAQAAQAKLVARSWQALARDQERDHRALF 304
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVP--SAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
RV + L+ + VP ++ER+ + + +L+E F FGRYLLI
Sbjct: 305 ERVELTLASA---------------GVPRLASERLAAASDAAEMALIETYFNFGRYLLIG 349
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
S+RPG+ NLQG+W + +P W + H+NIN++MNYW + C LSE E LFD++ L
Sbjct: 350 SNRPGSLPPNLQGLWADGFAPPWSADYHININIQMNYWPAEVCGLSELHESLFDYVDRLM 409
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+TAQ+ Y G V H+ T+ W ++ D GKV W LWP G AWL H WEHY YT D
Sbjct: 410 PYARQTAQIAYGCRGAVAHYTTNPWGHTALD-GKVQWGLWPEGLAWLTLHYWEHYLYTGD 468
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+FL+ RA P+ CA F LD+L+E G L + P++SPE+ ++ +G++ V M
Sbjct: 469 LEFLKTRALPVFRACAEFTLDYLVEDPRTGKLVSGPASSPENSYVMDNGEVGYVDMGCAM 528
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
++ V + A E L E L E +L RL KI DG + EW++ K+ E
Sbjct: 529 SQSMAFTVLTLTQKATEALSV-EPELREACAAALARLDRLKIGPDGRVQEWSEPLKEAEP 587
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHD 659
HRH+SHLFGL+PG I PDL AA +TL +R G GWS W T ARL +
Sbjct: 588 GHRHISHLFGLYPGIEIDAHDTPDLADAARRTLGERLRHGGGHTGWSAAWLTMFRARLGE 647
Query: 660 QEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAH-----PPFQIDANFGFTAAVAEMLV 714
+ A M+++LF + G +N F H P FQID N G TAA+AEMLV
Sbjct: 648 GDEALAMLRKLF--------RQSTG---ANFFDTHPYTPEPIFQIDGNLGATAAIAEMLV 696
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
QS L LLPALP W++G V+GL+ARGG V + W +G L
Sbjct: 697 QSHSGILRLLPALP-KSWANGRVRGLRARGGLIVDLEWANGQL 738
>gi|289773991|ref|ZP_06533369.1| large secreted protein [Streptomyces lividans TK24]
gi|289704190|gb|EFD71619.1| large secreted protein [Streptomyces lividans TK24]
Length = 693
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 254/686 (37%), Positives = 367/686 (53%), Gaps = 58/686 (8%)
Query: 92 VKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTRE 149
+ G P++ YQ+LGD+EL + Y RELDL TA AR Y+ G V RE
Sbjct: 15 AEFLGSPSEQAAYQVLGDLELTLAG---EGEAADYERELDLETAVARTTYTRGGVRHVRE 71
Query: 150 HFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKAN 209
F+S PDQV+V ++S G++ F S + + I ++G +
Sbjct: 72 VFASAPDQVLVVRLSADTPGAVGFTARFTSPQRSGGSAVDAHTIALDG--------VGGD 123
Query: 210 ANDDPKGIQFSAILEI-----KISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
P ++F + ++S D GT L VEG+D A L++ ++S+
Sbjct: 124 WYGRPGSVRFRGLARAESEGGRVSTDGGT--------LTVEGADAATLVISLATSYR--- 172
Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
N D DP S + + L Y+ L RH+ D+++LF RV++ L S +
Sbjct: 173 -NYLDVGADPASRARNHLAPAARKPYAHLRARHVADHRRLFGRVALDLGPSERA------ 225
Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSP 384
+P+ +R+ F +DP L L FQ+GRYLL S SR Q ANLQG+WN+ L+P
Sbjct: 226 ------ELPTDQRIPLFADGKDPQLAALYFQYGRYLLASCSRSPGQPANLQGLWNDSLNP 279
Query: 385 TWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHK 444
W+S VNIN EMNYW + P NL+EC +P + L+ +G++TA+ Y A GWV+HH
Sbjct: 280 AWESKYTVNINFEMNYWPAGPGNLAECWDPAVRMVHELAESGTRTAKALYDAPGWVLHHN 339
Query: 445 TDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLD 504
TD W + +A + +WP GGAWLC LW+HY +T D L R YP+++G F LD
Sbjct: 340 TDGW-RGTAPVDAAQYGMWPTGGAWLCVMLWDHYRFTGDTGAL-SRNYPVMKGAVEFFLD 397
Query: 505 WL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEK 563
L ++ G+L TNPS SPE +G+ + TMDM ++R++F A AAEVL++
Sbjct: 398 TLQVDAETGWLVTNPSQSPEVTHHQDEGESVSICAGPTMDMQLLRDLFDAYRQAAEVLDR 457
Query: 564 NEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPE-VHHRHLSHLFGLFPGHTITIE 622
+ LV +V + RL PT++ G I EW D+++ V RH+SHL+G+FP IT
Sbjct: 458 DSR-LVGRVTEVRDRLAPTRVGHLGQIQEWLVDWEEAALVRSRHVSHLYGVFPSAQITPR 516
Query: 623 KNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHF 682
P+L AA+K+L+ RG G GWS+ WK +WARL + AY + L +L+ P
Sbjct: 517 GTPELAAAAKKSLELRGTAGQGWSLAWKINMWARLLEPARAY---QHLADLLTPARTA-- 571
Query: 683 EGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKA 742
NLF HPPFQID NFG + + EML+QS ++ LLPALP + W +G +GL+A
Sbjct: 572 -----PNLFDLHPPFQIDGNFGGVSGITEMLLQSHAGEIELLPALP-EAWPTGSFRGLRA 625
Query: 743 RGGETVSICWKDGDLHEVGIYSNYSN 768
RGG V + W + + S N
Sbjct: 626 RGGFEVDLEWTGAGITRAEVRSLLGN 651
>gi|325103050|ref|YP_004272704.1| alpha-L-fucosidase [Pedobacter saltans DSM 12145]
gi|324971898|gb|ADY50882.1| Alpha-L-fucosidase [Pedobacter saltans DSM 12145]
Length = 938
Score = 437 bits (1125), Expect = e-120, Method: Compositional matrix adjust.
Identities = 272/702 (38%), Positives = 385/702 (54%), Gaps = 61/702 (8%)
Query: 102 YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVT 161
YQ GDI L F H +Y Y+RELDLN+A A+ YS +TR +F + P +V
Sbjct: 294 YQPFGDIYLNF--KHQEYT--NYKRELDLNSALAKTSYSHKGTNYTRTYFVNAPQNTLVI 349
Query: 162 KISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSA 221
+ ++ +++F S DS S ++ R + K A + +
Sbjct: 350 HLEANQPKNVTFTASFDSPHSQKSIRK------IDDRTIALDVKVKYGA------LFGES 397
Query: 222 ILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA 281
IL +K + G IS +++ +L VEG+D A L+L A+++F +N D P+ ++
Sbjct: 398 ILHLK--NKNGKIS-VKNNQLVVEGADEATLMLFAATNF----VNFHDVSGKPSVKNQQT 450
Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF 341
L S +NL Y L HL DY L++R S+ + ++ +P+ ER++ F
Sbjct: 451 LASAKNLDYQTLKQNHLQDYTSLYNRFSLSFGGNSRE------------DLPTDERIREF 498
Query: 342 -QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
+T DP+L+ L Q+GRYLLISSSR TQ ANLQGIWN L+P+W S NIN+EMNY
Sbjct: 499 SKTANDPALLALYAQYGRYLLISSSRANTQPANLQGIWNHLLAPSWGSKYTTNINVEMNY 558
Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVW 460
W S NLS+ +PLF + LS +G++TA+ Y GWV+HH TDIW + +A
Sbjct: 559 WLSEMLNLSDLHQPLFGMIEDLSKSGAETAKNYYNLPGWVLHHNTDIW-RGAAPINNSNH 617
Query: 461 ALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPS 519
+WP GGAWL THL EHY +T D+ FL K+ YP+++ F D+L ++ G L + PS
Sbjct: 618 GIWPTGGAWLTTHLLEHYAFTKDQAFL-KKYYPIIKNSVLFYKDFLVVDPISGCLISTPS 676
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH G L TMD IIR +F ++ + L +ED L +++ ++
Sbjct: 677 NSPEH------GGLVA---GPTMDHQIIRALFDGFVNVSAALGLDED-LRKEIQTKKQQI 726
Query: 580 RPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
P KI + G + EW D D HRH+SHL+ L PG+ I E PDL +A ++TL+ RG
Sbjct: 727 LPNKIGKYGQLQEWMVDVDDRNDKHRHVSHLWALHPGNEINWETTPDLLEATKQTLKFRG 786
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
++G GWS+ WK WARL D EH Y+M++ L+ P + GG Y NLF AHPPFQI
Sbjct: 787 DDGTGWSLAWKINFWARLRDGEHTYKMMQM---LLAPAGK---SGGSYPNLFDAHPPFQI 840
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D NFG A +AEMLVQS + + +LPALP +G VKGLKARGG + W G L +
Sbjct: 841 DGNFGGAAGIAEMLVQSHTSFIEILPALP-RALQTGEVKGLKARGGFELDFSWSKGKLQK 899
Query: 760 VGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLK 801
+ + S N TL + K GK+YTF+ L+
Sbjct: 900 LTVKSLAGGNCRLKVGTLEKDFKTEK-----GKVYTFDGGLQ 936
Score = 87.0 bits (214), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 37/79 (46%), Positives = 57/79 (72%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PAK +T+A+PIGNG++GAM++GGV + ++ NE+TLWTG P +Y PDA K L +R
Sbjct: 32 YKQPAKEWTEALPIGNGKIGAMIFGGVAQDRIQFNEETLWTGSPRNYNKPDAYKYLPQIR 91
Query: 77 SLVDSGQYAEATAASVKLF 95
+L+ G+ EA A +++ F
Sbjct: 92 TLLQQGKQREAEALAMQEF 110
>gi|423668781|ref|ZP_17643810.1| hypothetical protein IKO_02478 [Bacillus cereus VDM034]
gi|423675093|ref|ZP_17650032.1| hypothetical protein IKS_02636 [Bacillus cereus VDM062]
gi|401300760|gb|EJS06350.1| hypothetical protein IKO_02478 [Bacillus cereus VDM034]
gi|401309028|gb|EJS14402.1| hypothetical protein IKS_02636 [Bacillus cereus VDM062]
Length = 1156
Score = 437 bits (1125), Expect = e-120, Method: Compositional matrix adjust.
Identities = 271/776 (34%), Positives = 411/776 (52%), Gaps = 82/776 (10%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----YTNP 66
L + +N PAK + A+PIGNG +G MV+GGV E ++ NE TLWTG P Y N
Sbjct: 47 LTLWYNQPAKDWEKQALPIGNGYMGGMVFGGVQQERIQFNEKTLWTGGPSSTSEYTYGNR 106
Query: 67 D-APKALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEFDDSHLK 118
D A L +R + G + A S + FG YQ GDI L+F+
Sbjct: 107 DGAASHLGSIREKLAKGDKSGAEKESSQFLTGLEKGFGS----YQNFGDIYLDFNMPDAS 162
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
+ YRREL++N A V Y+ +V++ RE+F+S PD+V+V +++ SE+ +S +V
Sbjct: 163 -SFSNYRRELNINEGIATVSYNYKDVQYNREYFTSYPDRVMVMRLTASEAKKISLDVRPT 221
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
S + +N+I M+G+ G+++ A K+ ++ GT++A E
Sbjct: 222 SAQGGQ-VTSVDNKITMKGQITNN-------------GMKYEAAF--KVLNEGGTLTA-E 264
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+ K+KV +D +++ A++ ++ + P+ +DP + + +I SY L H+
Sbjct: 265 NGKIKVANADSLTIIMTAATDYENKY--PAYKGEDPHEKVEKTMAAISKKSYEVLKYTHI 322
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
DY LF+RVS+ L +VP+ E + S+ + L EL FQ+GR
Sbjct: 323 KDYHSLFNRVSLNLGGEKP-------------SVPTNELLASYSKENSKYLEELFFQYGR 369
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSSRPGT ANLQG+WN +P W+S H NINL+MNYW + NLSE PL D+
Sbjct: 370 YLLISSSRPGTLPANLQGVWNNSNTPPWESDYHFNINLQMNYWPAEVTNLSETALPLMDY 429
Query: 419 LTYLSINGSKTAQVNY--LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
+ L G +A+ ++ GW ++ + + ++ G + W P A++ ++WE
Sbjct: 430 VDSLREPGRVSAEKHFGVKGGGWTVNTMNNPFGFTAPGWG-LGWGWAPSANAFIGQNVWE 488
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
HY +T D+ +L+++ YP++ A F +L+E + L +P SPE L +
Sbjct: 489 HYKFTDDKQYLKEKIYPIINEAAEFHSKFLVEDQNKKLVVSPCWSPE---------LGGI 539
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKN---EDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
S D ++ E+FS +I A+EVL+ + D L K + P P +I G + EW
Sbjct: 540 SNGCAFDQQLVYELFSNVIEASEVLQIDNVFRDELKAKRDRLFP---PIQIGRYGQVQEW 596
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
D DP HRH+S L L+PG I K P+ +AA+ TL RG+EG GWS K L
Sbjct: 597 KDDIDDPGETHRHISQLVALYPGSMINY-KTPEWLQAAKVTLNHRGDEGTGWSKANKINL 655
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARL D +HAY+++ + G SNLF HPPFQID NFG T+ +AEML
Sbjct: 656 WARLLDGDHAYKIL-----------QGQLTGSTLSNLFDTHPPFQIDGNFGATSGIAEML 704
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
+QS + + LLPALP W +G KGL+ARG T++ WK+G + + S++ N+
Sbjct: 705 IQSHTDSIQLLPALP-KAWKNGSYKGLRARGAFTINADWKNGVPTVIQVTSDHGND 759
>gi|325263746|ref|ZP_08130479.1| fibronectin type III domain protein [Clostridium sp. D5]
gi|324030784|gb|EGB92066.1| fibronectin type III domain protein [Clostridium sp. D5]
Length = 769
Score = 437 bits (1125), Expect = e-120, Method: Compositional matrix adjust.
Identities = 278/775 (35%), Positives = 404/775 (52%), Gaps = 72/775 (9%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
+I F A+ +T+A+PIGNG LGAMV+G E +++NED++W+G + NPDA L
Sbjct: 3 EIWFRKEAEEWTEALPIGNGFLGAMVFGRTSVERIQVNEDSVWSGGYMERLNPDAKGHLD 62
Query: 74 DVRSLVDSGQYAEATA-ASVKLFG-HP-ADVYQLLGDIELEFDD--------------SH 116
+VR L+ G+ EA AS ++ +P YQ LGD+ ++F + S
Sbjct: 63 EVRQLLMQGRVQEAELLASRSMYAVYPHMRHYQTLGDVWIDFFNTRGRQTVKKKENGTSF 122
Query: 117 LKYAE---ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF 173
++Y E YRR L+L A + Y+ RE F+S+P V+V ++ E +L F
Sbjct: 123 VEYESPVFEEYRRSLNLEDAVGNIVYTAEKGAVKREFFASSPAGVLVYRMCAEEDEALDF 182
Query: 174 NVSLDSLLDNHSYVNGNNQIIMEGRCPGK----RIPPKANANDDPKGIQFSAILEIKISD 229
VSL + DN S G +G R+ K ND GI F + ++I+
Sbjct: 183 EVSL-TRKDNRS---GRGSSFCDGTMAVGDDTIRLYGKNGGND---GIAFE--MAVRIAS 233
Query: 230 DRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLS 289
G + + VEG+ AVL + +++ KDP + M L+ L
Sbjct: 234 VGGRQYRM-GSHIIVEGAKEAVLYITGRTTY---------RSKDPAAWCMETLEKAAGLP 283
Query: 290 YSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPS 348
Y +L +HL+DY L++ V + EE ++ + + ER+ +T ED
Sbjct: 284 YEELKMQHLEDYHSLYN-----------SCVLELDEEEELEQLSTPERLARMRTGKEDVG 332
Query: 349 LVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL 408
LV L + FGRYLLISSSR + ANLQGIWNED P W S +NIN++MNYW + L
Sbjct: 333 LVNLHYNFGRYLLISSSRENSLPANLQGIWNEDFEPAWGSKYTININIQMNYWMAEKTGL 392
Query: 409 SECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGA 468
S PL + L + +G +TA+ Y A G+ HH TDIW + V +WPMGGA
Sbjct: 393 SRLHMPLLEHLKTMRPHGQETAEKMYGARGFCCHHNTDIWGDCAPQDSHVSATIWPMGGA 452
Query: 469 WLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIA 528
WLC H+ EHY YT DR F+E+ Y +L F D++++ G+ T PS+SPE+ ++
Sbjct: 453 WLCLHIIEHYLYTKDRVFMEE-FYGILRDSVQFFADYMVQDEQGHWITGPSSSPENIYMN 511
Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDG 588
G+ C+ MD I+RE+FS + E L++ D L +V L L P KI + G
Sbjct: 512 EQGECGCLCMGPAMDSEILRELFSGYLRITEELDRG-DGLEAEVKMRLEGLPPVKIGKYG 570
Query: 589 SIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGW 645
I EW +D+++ E+ HRH+S LF L+P I +K P+L +AA TL++R G GW
Sbjct: 571 QIQEWRKDYEEMEIGHRHISQLFALYPAAQIRPDKTPELARAARHTLERRLSHGGGHTGW 630
Query: 646 SITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGF 705
S W +ARL D E A++ + L LVD NLF HPPFQID NFG
Sbjct: 631 SKAWIILFYARLGDGEKAWKNQREL--LVD---------ATLDNLFNTHPPFQIDGNFGG 679
Query: 706 TAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEV 760
+ EMLVQ + +YLLPALP SG V+G++ + G + + W+D + E+
Sbjct: 680 ACGLLEMLVQDFEDTVYLLPALP-QALKSGKVRGIRLKCGCILDLEWRDAKITEI 733
>gi|345517561|ref|ZP_08797030.1| hypothetical protein BSFG_03806 [Bacteroides sp. 4_3_47FAA]
gi|254837350|gb|EET17659.1| hypothetical protein BSFG_03806 [Bacteroides sp. 4_3_47FAA]
Length = 828
Score = 436 bits (1121), Expect = e-119, Method: Compositional matrix adjust.
Identities = 273/807 (33%), Positives = 412/807 (51%), Gaps = 91/807 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY--------TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P N + L ++R
Sbjct: 72 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTSAGAAAYWNVNKQSAHILDEIR 131
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
+G A + K F + +G+ +E S + ++ Y
Sbjct: 132 QAFINGDEKRAMLLTQKNFNSEVPYESWKEKPFRFGNFTTMGEFYIETGLSTIGMSD--Y 189
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V+++ V + R +F S P+ V+ + ++ G +L F+ + +
Sbjct: 190 KRILSLDSALAIVQFNKDGVAYERNYFISYPNNVMTIRFKANKPGKQNLVFSYEPNPVST 249
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
NGNN ++ R Q ++ I + GT+S + KL
Sbjct: 250 GKMETNGNNGLVYTARLDNN---------------QMEYVIRIHATAKGGTLSN-QSGKL 293
Query: 243 KVEGSDWAVLLLVASSSFDGPFINP--SDSKK----DPTSESMSALQSIRNLSYSDLYTR 296
V G+D + L+ A + + F NP +D K +P+ + + ++ L Y L+
Sbjct: 294 SVNGADEVIFLVTADTDYQINF-NPDFNDPKAYVGVNPSETTATWMKDAAALGYDALFDA 352
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQ 355
H DY LF+RVS+ L+ S K D +P+ +R+K+++ + D L EL +Q
Sbjct: 353 HYKDYASLFNRVSLSLNGSGK-----------TDNIPTPQRLKNYRKGKPDFYLEELYYQ 401
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
FGRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL+EC PL
Sbjct: 402 FGRYLLIASSRPGNLPANLQGIWHNNVDGPWRVDYHNNINVQMNYWPAGSTNLAECTLPL 461
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHL 474
DF+ L G KTAQ + A GW +I+ ++ + + W PM G WL TH+
Sbjct: 462 IDFIKTLVKPGEKTAQAYFGARGWTASISGNIFGFTAPLESENMSWNFNPMAGPWLATHV 521
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
W++Y+YT D+ FL+K Y L++ A F +D+L + DG PSTSPEH
Sbjct: 522 WDYYDYTRDKQFLKKTGYGLIKSSAQFAVDYLWKKPDGTYTAAPSTSPEH---------G 572
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
+ +T A++RE+ I A+++L +K E E+VL+ +L P +I G +ME
Sbjct: 573 PIDQGATFIHAVVREILLNAIDASKILGVDKKERKQWEEVLE---KLAPYQIGRYGQLME 629
Query: 593 WAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTA 652
W++D DP+ HRH++HLFGL PGHT++ P+L KA++ L+ RG+ GWS+ WK
Sbjct: 630 WSKDIDDPKDEHRHVNHLFGLHPGHTLSPITTPELAKASKVVLEHRGDGATGWSMGWKLN 689
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
WARLHD HAY++ L + G NL+ H PFQID NFG TA V EM
Sbjct: 690 QWARLHDGNHAYKLYGNL-----------LKNGTLDNLWDTHSPFQIDGNFGGTAGVTEM 738
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHD 772
L+QS + ++LLPALP D W G VKG+ A+G V+I WK+ L EV I S +
Sbjct: 739 LMQSHMGFIHLLPALP-DAWKDGEVKGICAKGNFEVNIRWKNRKLEEVVILSK-----NG 792
Query: 773 SFKTLHYRGTSVKVNLSAGKIYTFNRQ 799
+ YR S+K+ + GK Y +
Sbjct: 793 GTCEIKYRHASIKLKTAKGKTYCLTNE 819
>gi|375101342|ref|ZP_09747605.1| alpha-galactosidase family protein [Saccharomonospora cyanea
NA-134]
gi|374662074|gb|EHR61952.1| alpha-galactosidase family protein [Saccharomonospora cyanea
NA-134]
Length = 1130
Score = 436 bits (1120), Expect = e-119, Method: Compositional matrix adjust.
Identities = 285/821 (34%), Positives = 427/821 (52%), Gaps = 84/821 (10%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG----DYTNPD 67
L + ++ PA + ++ +PIG+G LGA V+GGV +E L+ NE TLWTG PG D+ N
Sbjct: 52 LTLWYDEPASDWESEILPIGSGALGAGVFGGVATERLQFNEKTLWTGGPGSAGYDFGNWK 111
Query: 68 APK--ALSDVRSLVDSGQYAEATAASVKLFGHPAD---VYQLLGDIELEFDDSHLKYAEE 122
P+ A+ +V+ +D+ Q + + KL G P YQ G++ + S + E
Sbjct: 112 EPRPGAIEEVQERIDAEQRVDPEWVASKL-GQPKQGYGAYQTFGEVRV----SGAEPQEV 166
Query: 123 T-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
T YRR LD+ A A V Y V TRE+F++ D VIV + SG E+G++ V + +
Sbjct: 167 TDYRRYLDIADAVAGVSYEADGVRHTREYFATAADDVIVARFSGDETGAVDVTVGV-TAP 225
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
DN S N +GR A A DD G+++ A L++ + G+ + D
Sbjct: 226 DNRS----KNVTAKDGRIT------FAGALDD-NGLRYEAQLQVLT--EGGSRTDNPDGS 272
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
+ V +D L+L A + + + P+ DP + + + Y L H+ D+
Sbjct: 273 VTVADADTMTLVLAAGTDYSDEY--PAYRGDDPHAAVTERVDAAVAEGYDALRAAHVADH 330
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
++LF RVS+ L + D+ TD D +AE ++ + L FQ+GRYLL
Sbjct: 331 RELFDRVSLDLGQRMPDLPTDELLARYRDGGLAAEERRALEA--------LYFQYGRYLL 382
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
I+SSRPG+ ANLQG+WN+ SP W + HVNINL+MNYW + NLSE +PLFD++
Sbjct: 383 IASSRPGSLPANLQGVWNDSTSPPWSADYHVNINLQMNYWPAEVTNLSETTDPLFDYVDS 442
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNY 480
L G TA+ + GWV+H++T + + D W +P GAWL WEHY +
Sbjct: 443 LVAPGEVTAREMFDNRGWVVHNETTPFGYTGVHDWATAFW--FPEAGAWLAQSYWEHYLF 500
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
T D FL +RAYP+L+ + F +D L+ + DG L NPS SPE S
Sbjct: 501 TRDETFLRERAYPMLKSLSQFWIDELVTDPRDGKLVVNPSYSPEQ---------GDFSAG 551
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEWAQDFK 598
++M I+ ++ ++ AAE++ E+A ++ +L L P ++ G + EW +D+
Sbjct: 552 ASMSQQIVWDLLTSTAEAAELV-GGEEAFRSELAGTLAELDPGLRVGSWGQLQEWKEDWD 610
Query: 599 DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLH 658
DP HRH+SHLF L PG I P+ +AAE++L RG+ G GWS WK WARL
Sbjct: 611 DPNNQHRHVSHLFALHPGRQIDPYSEPEYVEAAERSLIARGDGGTGWSKAWKINFWARLL 670
Query: 659 DQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTL 718
D +HA++M+ L + H NL+ HPPFQID NFG TA VAEMLVQS
Sbjct: 671 DGDHAHKMLSELLS-----HST------LPNLWDTHPPFQIDGNFGATAGVAEMLVQSHR 719
Query: 719 NDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN--------- 769
+ +LPALP +WS+G V GL+ARG TV + W +G V + +
Sbjct: 720 GVVDVLPALP-GEWSTGSVSGLRARGDVTVDVDWANGVATRVALEAGRDGQLKVRSGLFA 778
Query: 770 ------DHDSFKTLHYR--GTSVKVNLSAGKIYTFNRQLKC 802
D ++ +T+ + G + ++ AG+ Y +++
Sbjct: 779 GRFRVVDAETGRTVDVKRDGQEITIDAKAGRTYVATTRVEV 819
>gi|160879031|ref|YP_001557999.1| hypothetical protein Cphy_0874 [Clostridium phytofermentans ISDg]
gi|160427697|gb|ABX41260.1| conserved hypothetical protein [Clostridium phytofermentans ISDg]
Length = 760
Score = 435 bits (1119), Expect = e-119, Method: Compositional matrix adjust.
Identities = 288/798 (36%), Positives = 414/798 (51%), Gaps = 80/798 (10%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PAK + +A+P+GNGRLGAM++G E +++NED++W+G D NPDA K L +R
Sbjct: 8 YQDPAKDWDEALPLGNGRLGAMIYGKPEHEIIQVNEDSIWSGYAMDRNNPDAKKNLPIIR 67
Query: 77 SLVDSGQYAEATAASV-KLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
SL+ G EA A++ L G P ++ YQ G+I + S + Y+R+L+L+ A
Sbjct: 68 SLIADGNLEEAQNATLHSLSGTPDNMRCYQTAGEIHITTGHSEVT----NYKRQLNLSEA 123
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKIS--GSESGSLSFNVSLDSLLDNHSYVNGNN 191
T V Y F REH S P V V + + G +LS +S +D Y +
Sbjct: 124 TVTVSYDFEGTTFIREHLISTPADVFVMRFTSKGPRKLNLSILLSRPHFMD-RLYCENGD 182
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
I++ R GI F L +A D K+K G+ V
Sbjct: 183 SIVLTYR----------------GGIPFCNRL----------TAASCDGKIKTIGAHLVV 216
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
+ F I + ++ T++ S L +++L + +L H DYQ F R +
Sbjct: 217 SEATTVTLFFD--IRTAYRSENYTNDVKSHLMDVKSLQFDELKRSHKKDYQSFFKRNDLI 274
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQ 370
L+ S ++ E ++ T+ +A+R++ + D L+E F FGRYLLIS SRPGT
Sbjct: 275 LTPSAEE-------EADVATLDTAKRLERMRMGHSDLKLLEDYFHFGRYLLISCSRPGTL 327
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWN ++P W +NIN EMNYW + NL E PLFD L + NG TA
Sbjct: 328 PANLQGIWNNSMTPPWGGKFTININTEMNYWFAEKLNLPELHLPLFDLLKRMHQNGKVTA 387
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
+ Y G+V HH TD+W + + W +GGAWLC H+WEHY YT D +FL
Sbjct: 388 EKMYGCHGFVAHHNTDLWGDCAPQDYWLPGTYWVLGGAWLCLHIWEHYEYTKDINFL-IN 446
Query: 491 AYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREV 550
+P+L FL ++L E +G L +P+ SPE+++ P+G++ + TMD I+RE+
Sbjct: 447 MFPVLSDACLFLTEFLTEDENGKLILSPTASPENKYRHPNGRIGYLCAGCTMDHQIMREL 506
Query: 551 FSAIISAAEVL--EKNED-------ALVEKVLKS----LPRLRPTKIAEDGSIMEWAQDF 597
F I A L KN AL EK+ KS L RL T++ +G+I EW +++
Sbjct: 507 FHHYIDAYHTLLDAKNSTENKEVPIALNEKLTKSVKDCLSRLPETRVHSNGTIKEWNEEY 566
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALW 654
++ E+ HRH+SHLFGLFPG+ IT E+ P L +AA+KTL++R E G GWS W W
Sbjct: 567 EELELGHRHISHLFGLFPGNQITPEQTPKLSEAAKKTLERRLEHGGGHTGWSRAWIINFW 626
Query: 655 ARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLV 714
ARL + + AY+ VK L G NLF HPPFQID NFG + + EM+
Sbjct: 627 ARLGNGDLAYQNVKALLT-----------GSTLPNLFDNHPPFQIDGNFGSISGLCEMIF 675
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSF 774
Q N L+LLPA P D+ G KA G T + + +G+L V + S +
Sbjct: 676 QYRNNTLFLLPAFP-DEIKDVTFLGYKATYGLTADLSYTNGELKSVVLTSKEPRS----- 729
Query: 775 KTLHYRGTSVKVNLSAGK 792
L+YR VK+NL+ G+
Sbjct: 730 ILLNYRNKLVKINLTKGE 747
>gi|255035225|ref|YP_003085846.1| hypothetical protein Dfer_1435 [Dyadobacter fermentans DSM 18053]
gi|254947981|gb|ACT92681.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
Length = 790
Score = 435 bits (1119), Expect = e-119, Method: Compositional matrix adjust.
Identities = 273/814 (33%), Positives = 414/814 (50%), Gaps = 74/814 (9%)
Query: 4 AESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG- 61
A+ TS T PL + ++ PAK + T A+PIGNG +GAM +GG E ++ +E +LW G G
Sbjct: 24 AQPTSKTAPLSLWYDQPAKEWMTQALPIGNGHVGAMFFGGTDEERIQFSEGSLWAGGKGA 83
Query: 62 --DYT---NPDAPKALSDVRSLVDSGQYAEATA-ASVKLFGH--------PADVY---QL 104
DY +A K L +VR L+ +G+ EA A A+ +L G P+ + Q
Sbjct: 84 NADYNFGIKKEAHKHLPEVRELLAAGKLKEAHALANKELTGAIHEKKENTPSSDFGAQQT 143
Query: 105 LGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKIS 164
+GD+ ++ K A + YRREL+++ A +V+Y G F R +F + P +V+V + +
Sbjct: 144 VGDLFIKMPS---KGAAQNYRRELNISDALGKVQYEAGGTRFERSYFGNYPAKVMVYRFT 200
Query: 165 GSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE 224
S + S D R GK+ + D+ + +F +
Sbjct: 201 SSTPETYSIRFETPHAKDYE-------------RFEGKQYTFGGHLKDNHQ--EFETVYR 245
Query: 225 IKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS 284
I D +A D L V G+ VL+ ++ + F P D + + +
Sbjct: 246 I----DTDGKTAFSDGVLTVTGARSIVLIHTVATDYVMKF--PDYKGNDYKKANAATMAG 299
Query: 285 IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD 344
+ +Y+ L DY LF RV++ L + + +P+ +R K++
Sbjct: 300 VAGKNYASLVAAQQKDYHSLFDRVALTLGNA------------DAPAIPTDQRQKAYSAG 347
Query: 345 E-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
+ D L EL FQ+GRYL+ISS+RPGT +LQG WN+ +P W + H NIN++M YW +
Sbjct: 348 QADGRLEELYFQYGRYLMISSTRPGTMPMSLQGKWNDSTNPPWANDYHTNINIQMLYWPA 407
Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
NLSEC PL DF + G A+ + A GW+++ + + +S W +
Sbjct: 408 EVTNLSECHVPLMDFTQSIVAPGRLAAKEFFNAKGWIVNTMLNAYGYTSPGW-DFPWGFF 466
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
P G AWL HLWEHY +T D+ FL+ AYP+++ + F +D+L + G L ++PS SPE
Sbjct: 467 PGGAAWLSQHLWEHYAFTNDKAFLKNTAYPIMKEASEFWMDYLTDDGRGRLVSSPSYSPE 526
Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTK 583
H +S +TMD + +V + AA +L ++D +K + ++ P +
Sbjct: 527 H---------GGISTGATMDHEMAWDVLNNTAEAAAILGVDQD-FAQKARSTRDKILPLQ 576
Query: 584 IAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP 643
I + EW +D D HHRH+SHLF L PG I+ + P +AA +L RG++G
Sbjct: 577 IGRWKQLQEWREDVDDSTNHHRHVSHLFALHPGKQISNAQTPAEAEAARVSLNARGDDGT 636
Query: 644 GWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFE-GGLYSNLFAAHPPFQIDAN 702
GWS+ WK WARL D A+++ K + V + + GG Y+NL AHPPFQ+D N
Sbjct: 637 GWSLAWKVNFWARLQDGNRAHKLFKSVLRPVASQGTNMADGGGSYANLLCAHPPFQLDGN 696
Query: 703 FGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
G TA VAEML+QS + LLPALP D W +G VKGLKARG TV W++G L V +
Sbjct: 697 MGSTAGVAEMLLQSQTGVIELLPALP-DAWPTGSVKGLKARGNVTVDEVWENGKLKTVTL 755
Query: 763 YSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTF 796
S + + L Y ++ L+AGK T+
Sbjct: 756 TSATAQK-----RVLKYGSKTIDAALAAGKAKTW 784
>gi|383778158|ref|YP_005462724.1| hypothetical protein AMIS_29880 [Actinoplanes missouriensis 431]
gi|381371390|dbj|BAL88208.1| hypothetical protein AMIS_29880 [Actinoplanes missouriensis 431]
Length = 746
Score = 435 bits (1119), Expect = e-119, Method: Compositional matrix adjust.
Identities = 278/788 (35%), Positives = 408/788 (51%), Gaps = 67/788 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY-TNPDAPKAL 72
++ ++ PA + +A+PIGNGRLG MV GGV +E ++L+E T W+G P D+ NP A +++
Sbjct: 3 RLLYDRPASRWFEALPIGNGRLGGMVHGGVGTEIIRLSESTAWSGAPSDHDVNPAAAQSI 62
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+R L+ G++AEA A+ L G P L L D + L A+ YRRELDL+
Sbjct: 63 PVIRRLLFEGEHAEAQRLAAEHLTGRPTSFGTNLPLPRLRLDFA-LDQAD-GYRRELDLD 120
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
T A V++ F RE F+S+P VI ++S S + ++SF +LD + ++ G +
Sbjct: 121 TGLASVEFDQNQTHFVRETFASHPHGVIAMRLSASRAAAISFTAALDDTVLPGTFTGGAD 180
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
+ GR + +D +G+ + ++ D GT+ A +D + V G+D
Sbjct: 181 GLAFRGRAV------ETLHSDGEQGVDVE--IRVRFVIDGGTLLAADDT-VTVTGADVVD 231
Query: 252 LLLVASSSFDGP-FINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
+ + S+SF P + P+ Y + H++D+Q+L RVS+
Sbjct: 232 VFVTVSTSFCAPSLVEPA--------------------PYEVMRAAHVEDHQRLMRRVSL 271
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
L +P D+ TD ER+ + D+D L+ L FQ+GRYL I+ SR +
Sbjct: 272 DLG-TPIDLPTDV----------RRERLARGERDDD--LIALYFQYGRYLTIAGSRADSP 318
Query: 371 VA-NLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
+ LQG+WN+ + + W + H++IN + NYW + NL+EC PLF FLT L+ +G
Sbjct: 319 LPLALQGVWNDGFASSMGWSNDFHLDINTQQNYWAAESTNLAECHTPLFRFLTGLASSGR 378
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
TAQ Y A GWV H T+ W S+ RG + W L GGAWL LWEHY Y D FL
Sbjct: 379 STAQQMYGADGWVAHTVTNAWGYSAPGRG-IGWGLNVTGGAWLALQLWEHYEYRPDVRFL 437
Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
+AYP+L CA FLLD+L E G+L PS SPE+ ++A DG ++ +T D
Sbjct: 438 RDQAYPVLRSCALFLLDYLTPEPSHGWLVAGPSESPENSYLAADGTPCSIAMGTTADRVF 497
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRH 606
+ AA +L+ + + L +V + RL P +I G + EW D + + HRH
Sbjct: 498 AEAILRICGQAAAILDVDPE-LRSRVAAARDRLSPFRIGRHGQLQEWLDDVDEADPAHRH 556
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSIT-WK----TALWARLHDQE 661
SHL +FP IT P L AA TL++R + PGW T W A ARL D +
Sbjct: 557 TSHLCAVFPERQITPRGTPSLAAAAAVTLERR-QAAPGWEQTEWAEANFAAFHARLLDGD 615
Query: 662 HAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDL 721
+A V RL + + G + A + D N G T A+AEML+QS ++
Sbjct: 616 NALEHVTRLIADASEANLLSYSAGGIAG--AQQNIYSFDGNAGGTGAIAEMLLQSDGEEI 673
Query: 722 YLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRG 781
LLPALP W G V+GL+ARGG TV I W DG LHE +Y+ D + L YR
Sbjct: 674 ELLPALP-STWRDGAVRGLRARGGFTVDISWSDGRLHEARVYA-----DRPTRTRLRYRD 727
Query: 782 TSVKVNLS 789
T ++V ++
Sbjct: 728 TVIEVTVT 735
>gi|374311601|ref|YP_005058031.1| alpha-L-fucosidase [Granulicella mallensis MP5ACTX8]
gi|358753611|gb|AEU37001.1| Alpha-L-fucosidase [Granulicella mallensis MP5ACTX8]
Length = 790
Score = 435 bits (1119), Expect = e-119, Method: Compositional matrix adjust.
Identities = 281/809 (34%), Positives = 433/809 (53%), Gaps = 61/809 (7%)
Query: 1 MMNAESTST-TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV 59
+M+AE S+ ++ ++ ++ PA + +A+PIGNGR+G M++GG E+ L E T W+G
Sbjct: 14 LMHAEGQSSPSHKTELWYSRPATRWMEAVPIGNGRIGGMIYGGTSIESFALTESTTWSGA 73
Query: 60 PGDY-TNPDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPAD--VYQLLGDIELEF-DD 114
P D P A L +R L+ +G+YAE + L G+P + + +EL F +D
Sbjct: 74 PNDKNVKPTALANLGKIRELMFAGKYAEGGELCKEHLLGNPGSFGTHLPMATLELAFPED 133
Query: 115 SHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFN 174
H + YRR L+L+ A V YS G + F RE F+SNPD ++ IS ++ S+S +
Sbjct: 134 EH----PQNYRRSLNLDEGIAYVDYSRGGLSFHREVFASNPDNALIAHISCNQPKSVSCS 189
Query: 175 VSLDSL-LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGT 233
+S L L GN+ ++++G + ++ +G+ F +++S G
Sbjct: 190 ISFPKLTLPGEVTTEGNDTLVLKGNAF------EHLHSNGKQGVAFET--RVRVSAKGGE 241
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
++A E L ++G+D L +V +++F G + ++ ++ LQ +R +++ L
Sbjct: 242 VTAHEGA-LHLKGADAVTLHVVIATNFRG---------ANASTRNVQTLQVLRPKTFAQL 291
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVEL 352
H+ D+Q LF RV+I D+ T++ +E P+ ER K+ + +DP L L
Sbjct: 292 RAAHVADHQSLFRRVAI-------DLGTNSSAESK----PTDERRKAVEAGADDPGLASL 340
Query: 353 LFQFGRYLLISSSRPGTQVA-NLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLS 409
FQ+GRYL I+ SR + + LQGIWN+ L+ + W H++IN E NYW + CNLS
Sbjct: 341 FFQYGRYLTIAGSRVNSPLPLALQGIWNDGLASSMGWTDDFHLDINTEQNYWAAEVCNLS 400
Query: 410 ECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAW 469
ECQ PLFDF+ LSI G TA+ Y A GWV H T+ W ++A G + W ++ GG W
Sbjct: 401 ECQSPLFDFVEGLSIAGRSTARDMYGAPGWVAHVVTNPWGFTAAGWG-LGWGIFSTGGVW 459
Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIA 528
L LWEHY +T D+ FL++R YP+ +G A F L ++++ G+L T PS SPE+ FIA
Sbjct: 460 LALQLWEHYRFTGDKQFLQQRLYPVYKGAAEFFLAYMVKHPQHGWLVTGPSVSPENWFIA 519
Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDG 588
PDGK S T+D + + S I A+ L +E+ K ++L +L P +I + G
Sbjct: 520 PDGKQCSESMGPTVDRVFVHSLLSGCIEASTTLGIDEE-FRAKATEALKQLPPFQIGKHG 578
Query: 589 SIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGE----EGPG 644
+ EW +DF + HRH+SHL GL+P H I+ P L AA T+++R E
Sbjct: 579 QLQEWLEDFDEAVPGHRHMSHLMGLYPEHQISPAATPALATAARITIERRISQTNWEDSE 638
Query: 645 WSITWKTALWARLHDQEHAYR-MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANF 703
W+ +ARL D E A++ V L + + + GG+ A F +D N
Sbjct: 639 WTRANLVNFYARLLDGESAHKHFVGLLSSAAEDSLLAYSRGGVAG---AESNIFSLDGNT 695
Query: 704 GFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIY 763
A VAEML+QS ++++LLPALP W G +KGL ARGG VS+ W DG L +
Sbjct: 696 AGAAGVAEMLLQSQADEIHLLPALP-SAWPQGSIKGLCARGGIEVSMAWTDGKLISASLK 754
Query: 764 SNYSNNDHDSFKTLHYRGTSVKVNLSAGK 792
S ++ Y + VKV L G+
Sbjct: 755 SKRGGT-----HSVRYGASVVKVALPIGR 778
>gi|388259769|ref|ZP_10136938.1| alpha-L-fucosidase, putative, afc95A [Cellvibrio sp. BR]
gi|387936495|gb|EIK43057.1| alpha-L-fucosidase, putative, afc95A [Cellvibrio sp. BR]
Length = 806
Score = 434 bits (1116), Expect = e-118, Method: Compositional matrix adjust.
Identities = 278/783 (35%), Positives = 399/783 (50%), Gaps = 72/783 (9%)
Query: 4 AESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG- 61
A S I F+ PA + + +PIGNG LGA++ G V + ++ NE TLWTG PG
Sbjct: 28 ASSVQAAGGESIWFDAPAADWEREGLPIGNGALGAVIAGDVTRDRIQFNEKTLWTGGPGA 87
Query: 62 ---DYTNPDAPK--ALSDVRSLVDSGQYAEATAASVKLFGHPADVY---QLLGD--IELE 111
D+ P + A++ VR+ ++ Q + + KL GH Y Q GD I+
Sbjct: 88 QGYDFGWPQQAQGDAVAQVRTTINE-QGSITPEDAAKLLGHKITAYGDYQTFGDLIIDSN 146
Query: 112 FDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL 171
+DS +K YRREL L+ A V Y G V + RE+ +S PD VI K S + S+
Sbjct: 147 KNDSDVKSVFTNYRRELSLSDAQINVSYEQGGVRYRREYLASYPDGVIAIKYSADQPASI 206
Query: 172 SFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDR 231
SF S+ + DN S I +GR A+ G+QF +I++ +
Sbjct: 207 SFTASVQ-VPDNRSLAVA----IDQGRI-------TASGKLHSNGLQFET--QIQLLNQG 252
Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYS 291
G ++ ++ KL+V +D V+LL A + + + P P L S+
Sbjct: 253 GELAVIDGNKLQVTAADSVVILLAAGTDYAQSY--PKYRGAHPHKRLHKQLNKASKKSFE 310
Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVE 351
L H DYQ LF+RV++ + + P+ + T K D +L
Sbjct: 311 QLQATHRADYQTLFNRVALDIGQKPQSLTTPKL----------LAGYKKGDAVLDRTLEA 360
Query: 352 LLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSEC 411
FQFGRYLLISSSRPG+ ANLQG+WN ++P W++ HVNINL+MNYW + NL E
Sbjct: 361 TYFQFGRYLLISSSRPGSLPANLQGVWNNSITPPWNADYHVNINLQMNYWLAETTNLPEL 420
Query: 412 QEPLFDFLTYLSINGSKTAQ-VNYLASGWVIHHKTDIWAKSSADRGKVVW--ALW-PMGG 467
PLFDF+ L + G+ AQ V + GW + T+IW + G + W A W P
Sbjct: 421 TAPLFDFVDSLVVPGTIAAQKVAGVDKGWTLFLNTNIWGFT----GVIDWPTAFWQPEAA 476
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEF 526
AWL H +EHY ++ D+ FL RAYPL++ + F L++L++ DG +PS SPEH
Sbjct: 477 AWLAQHYYEHYLFSGDKKFLRNRAYPLMKSASEFWLEFLVKDPRDGQWIVSPSFSPEH-- 534
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIA 585
P + A +S D+ +R A L + + V + L L R +I
Sbjct: 535 -GPFTRAAAMSQQIVFDL--LRNTHEA------ALLTGDKKFAQAVQEKLANLDRGMRIG 585
Query: 586 EDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGW 645
+ G + EW +D DP+ HRH+SHL+ L PG I P+L AA TL RG+ G GW
Sbjct: 586 KWGQLQEWKEDIDDPKNEHRHISHLYALHPGRDINPRNTPELLAAARTTLNARGDGGTGW 645
Query: 646 SITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGF 705
S WK +WARL D A++++ + + SNL+ HPPFQID NFG
Sbjct: 646 SQAWKVNMWARLLDGNRAHKVLG-----------EQLQRSTLSNLWDNHPPFQIDGNFGA 694
Query: 706 TAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
+A +AEML+QS ++L+ LPALP W SG V GL+ARGG TV + W G+L + I++
Sbjct: 695 SAGIAEMLLQSHGDELHFLPALP-ASWPSGSVTGLRARGGITVDLQWHKGELTQARIHTQ 753
Query: 766 YSN 768
++
Sbjct: 754 HAQ 756
>gi|409099481|ref|ZP_11219505.1| alpha-L-fucosidase [Pedobacter agri PB92]
Length = 937
Score = 434 bits (1115), Expect = e-118, Method: Compositional matrix adjust.
Identities = 254/703 (36%), Positives = 373/703 (53%), Gaps = 62/703 (8%)
Query: 102 YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVT 161
YQ GD+ L F L Y+R LDL TA AR Y++ V +TRE+F+S P+Q IV
Sbjct: 293 YQPFGDLNLAFQHKGLI---TKYKRSLDLTTAIARTNYTIAGVNYTREYFASQPNQSIVI 349
Query: 162 KISGSESGSLSFNVSLDSLLDNHSY-VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFS 220
+S + S+S +L SL G N I + + + ++ + +
Sbjct: 350 HLSADKKASISLTAALSSLHQQSGIKALGKNTISLSVQVKDGALKGES---------RLT 400
Query: 221 AILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMS 280
A+++ G + L +K + + +D L L A ++F IN D DP + ++
Sbjct: 401 AVIK------NGAVKVLNNK-ISISKADEVTLYLTAGTNF----INAQDVSGDPAAANIK 449
Query: 281 ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKS 340
AL ++ + + +++ RH+ +YQ +++ + +S K+ +P+ ER+
Sbjct: 450 ALNTVTDKTSAEIKNRHIKEYQSYYNKFHVDFGQSGKE------------NLPTNERLNK 497
Query: 341 FQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
F T DP L Q+GRYLLISSSRPGTQ ANLQGIWN+ L+P W S NIN+EMNY
Sbjct: 498 FATSNDPGFAALYMQYGRYLLISSSRPGTQPANLQGIWNDLLTPPWGSKYTTNINMEMNY 557
Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVW 460
W + NLS EPLF+ + L+ G++TA+ Y GWV+HH TD+W +A
Sbjct: 558 WPAEVLNLSALNEPLFNKINGLAKTGTETAKEYYNTPGWVLHHNTDLW-NGTAPINASNH 616
Query: 461 ALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPS 519
+W G AWL HLWEHY +T D+ FL AYPL++ A F +LI+ G+L + PS
Sbjct: 617 GIWVTGAAWLSQHLWEHYAFTGDQTFLRNEAYPLMKQAALFFDAFLIKDPKTGWLISTPS 676
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPR 578
SPE +G L TMD IIR +F I+A E+L N DA +L++ + +
Sbjct: 677 NSPE------NGGLVA---GPTMDHQIIRSLFKNCIAATEIL--NVDADFRTILQAKMKQ 725
Query: 579 LRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR 638
+ P +I + G + EW +D D HRH+SHL+G++PG IT + +P + AA+++L R
Sbjct: 726 IAPNQIGKYGQLQEWREDKDDTTNKHRHVSHLWGVYPGDDITWKSDPKMMDAAKQSLLYR 785
Query: 639 GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQ 698
G+E GWS+ WK WAR D +HA +++K L + G Y NLF AHPPFQ
Sbjct: 786 GDEATGWSLAWKINFWARFKDGDHAMKLIKMLMKPANS------GAGSYVNLFDAHPPFQ 839
Query: 699 IDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLH 758
ID NFG A +AE+++QS + +LPALP + +G V GL ARGG V + W G L
Sbjct: 840 IDGNFGGAAGIAELILQSHQGYIDILPALP-TEIPNGNVSGLMARGGFEVGLIWGGGKLK 898
Query: 759 EVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLK 801
+ + S + Y ++ N AG Y N +LK
Sbjct: 899 SILLKSLRGEKCK-----MKYLDKEIEFNTEAGGSYKLNGELK 936
Score = 81.3 bits (199), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 38/82 (46%), Positives = 57/82 (69%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ +N PA+ +TDA+PIGNGRLGAMV+ GV ++ ++ NE+TLWTG P +Y A K L+
Sbjct: 29 QLWYNQPAEKWTDALPIGNGRLGAMVFAGVENDHIQFNEETLWTGKPRNYNRKGAYKYLA 88
Query: 74 DVRSLVDSGQYAEATAASVKLF 95
++R L+ G+ EA + K F
Sbjct: 89 EIRKLLFEGKQKEAEVLAQKEF 110
>gi|336428235|ref|ZP_08608219.1| hypothetical protein HMPREF0994_04225 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336006471|gb|EGN36505.1| hypothetical protein HMPREF0994_04225 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 721
Score = 434 bits (1115), Expect = e-118, Method: Compositional matrix adjust.
Identities = 282/764 (36%), Positives = 402/764 (52%), Gaps = 78/764 (10%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + + A+ + +++PIGNG LGAM+ GG E L LNE+++W+G D N A L
Sbjct: 4 MMLWYEKSAERWEESLPIGNGSLGAMILGGAEEEILGLNEESVWSGYYKDKNNAKAADCL 63
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADVYQLLGDIELEFDDSHLKYAE-ETYRRELDL 130
+VRSLV SG+ EA + G + Y LG+++L+F K + E YRR+LDL
Sbjct: 64 EEVRSLVFSGKNKEAERLIQNNMLGEYNESYLPLGNLKLKFAYGIGKEGKAEGYRRQLDL 123
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS-LLDNHSYVNG 189
A A+V Y+ V + RE+F+S P + I ++ ++ + F VS S L S +G
Sbjct: 124 ENAVAQVSYTCNEVHYQREYFASYPAKAIFVLLT-ADKPVMDFTVSFISQLCLAVSAEDG 182
Query: 190 NNQIIMEGRCPGKRIPP-----KANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
Q+ GRCP P + + KG+Q +A E ++ G + E++ L V
Sbjct: 183 ALQVT--GRCPEHVDPSYLPEREGSVVQGTKGMQVNA--EFRVVSCDGQVRE-EEEMLHV 237
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
G+ +L+L A P + P N+ Y L H+ DY+ +
Sbjct: 238 SGASRCLLMLSAMR----PPVLPD------------------NMDYEALKAAHIQDYRSI 275
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
+ +V + L KD+ T EE ++ + E ED L L FQ+GRYLLI+S
Sbjct: 276 YDKVELYLGEQ-KDLPT----EERLELLKKGE--------EDNGLYGLFFQYGRYLLIAS 322
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
SR G+ ANLQGIW+ +L W S +NIN +MNYW +L CNL EC EP F+ +S
Sbjct: 323 SREGSLPANLQGIWSWELRAPWSSNWTININTQMNYWHALSCNLEECLEPYIRFVERVSE 382
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSS----------ADRGKVVWALWPMGGAWLCTHL 474
G KTA VNY G V HH D W +S + G V WA WPMGGAWL +
Sbjct: 383 EGKKTAAVNYRCRGSVAHHNVDYWGNTSPVGVPQGEKAGEDGCVNWAFWPMGGAWLTQEI 442
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
+ Y Y+ D ++L+ A P++ A FL DWL+E + G T PSTSPE++F PDG++
Sbjct: 443 FRAYEYSGDEEYLKNTAAPIIREAALFLNDWLVE-YQGEWVTCPSTSPENQFRLPDGQIT 501
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWA 594
++Y+S MDMAI++EVF+ E+L +D L ++ + +P L P + G ++EW
Sbjct: 502 GLTYASAMDMAIVKEVFTHYCRICEIL-GAQDELYREICEKMPCLAPFRTGSFGQLLEWH 560
Query: 595 QDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKT 651
+++++PE HRH SHL+GLFP + L +A +L R E G GWS W
Sbjct: 561 EEYEEPEPGHRHASHLYGLFPAEVFA--GDAKLTEACRVSLMHRLENGGGHTGWSCAWII 618
Query: 652 ALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAE 711
L+A L D E AY ++ L Y NL+ AHPPFQID NFG TA +A
Sbjct: 619 NLFAVLKDGEKAYEYLRTLLTR-----------STYPNLWDAHPPFQIDGNFGGTAGIAN 667
Query: 712 MLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
MLVQ + LLPALP ++ G VKGL +G + V I WKDG
Sbjct: 668 MLVQDRGGSVTLLPALP-AQFKEGYVKGLCIKGRKCVDISWKDG 710
>gi|256394373|ref|YP_003115937.1| alpha-L-fucosidase [Catenulispora acidiphila DSM 44928]
gi|256360599|gb|ACU74096.1| alpha-L-fucosidase, putative, afc95A [Catenulispora acidiphila DSM
44928]
Length = 742
Score = 434 bits (1115), Expect = e-118, Method: Compositional matrix adjust.
Identities = 278/806 (34%), Positives = 413/806 (51%), Gaps = 91/806 (11%)
Query: 17 FNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-------DYTNPDA 68
++ PA + +A+PIGNGR+GAMV+GGV +E ++ E+TLWTG PG D+ P
Sbjct: 7 YDAPASDWEREALPIGNGRIGAMVFGGVAAERVQFTEETLWTGGPGHPGYDHGDWREP-R 65
Query: 69 PKALSDVRSLVDSGQYAEATAASVKLFGHPA---DVYQLLGDIELEFDDSHLKYAEETYR 125
P AL +VR +D + T +L G P +Q GD+ +EF L + YR
Sbjct: 66 PGALEEVRRRIDE-HGSLPTQTVTELLGQPKTGFGAFQNYGDLIIEF--PGLSEEAQDYR 122
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLD 182
R LD++ A A V + V TRE+F S+P V++ +++ + G+L + + D
Sbjct: 123 RTLDISDALAGVAFEADGVHHTREYFVSHPAGVLLGRLTADQPGALHCVLRYEPGTDATD 182
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+ +++ G P G++ +A IK+ + G + ED+ L
Sbjct: 183 ATRVTTEDATLVIIGALPDN-------------GLRHAA--RIKVIPEGGRLIEGEDR-L 226
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
+EG+D V++L A++ + + + DP A+ +Y DL H+ D+
Sbjct: 227 TIEGADRVVIILAAATDYADTYPAYRNGI-DPAGPVAEAVAKAAASTYDDLRAAHIADHS 285
Query: 303 KLFHRVSIQLSRS-PKDIVTDTC-SEENID-TVPSAERVKSFQTDEDPSLVELLFQFGRY 359
LF RV + L S P D+ TD + D + P+A+R +L +L F GRY
Sbjct: 286 ALFDRVVLDLGGSLPGDVPTDRLLTAYGTDASTPAADR----------ALEQLFFDHGRY 335
Query: 360 LLISSSRPGTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
LLI+SSRP +Q+ ANLQG+WN +P W HVNINL+MNYW + PC L EC EPLF +
Sbjct: 336 LLIASSRPASQLPANLQGVWNASPTPPWAGDYHVNINLQMNYWLAEPCALGECAEPLFAY 395
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEH 477
+ L G +A+ + GWV+H++T + + D W +P AWLC HLWEH
Sbjct: 396 IEALRAPGRVSARTLFGTEGWVVHNETTPFGFTGVHDWPDAFW--FPEAAAWLCRHLWEH 453
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEH-EFIAPDGKLAC 535
Y +T+D +FL++RAYP+++ A F L L + DG L NPS SPE E+ A
Sbjct: 454 YAFTLDEEFLKERAYPVMKEAAQFWLANLRRDPRDGKLVANPSFSPEQGEYTA------- 506
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ 595
S M IIR++F + A +E + L +I G + EW +
Sbjct: 507 ---GSAMAQQIIRDLFKNTVGLAAEVEDLDTGL--------------RIGSWGQLQEWKE 549
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWA 655
D DP+ HRH+S L+ L PG I ++ DL AA L RG+ G GWS WK WA
Sbjct: 550 DLDDPQNQHRHVSQLYALHPGSDIDPLRDEDLAAAARTILNARGDGGTGWSKAWKINFWA 609
Query: 656 RLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQ 715
RL D +HA+R++ + G NLF HPPFQID NFG TA +AEMLVQ
Sbjct: 610 RLWDGDHAHRLLA-----------EQLTGSTLPNLFDTHPPFQIDGNFGATAGIAEMLVQ 658
Query: 716 STLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFK 775
S L ++ +LP+LP W +G V GL+ARG V + W +G + E+ + + + + D
Sbjct: 659 SHLGEIRILPSLP-AAWPTGSVTGLRARGAVRVDVAWAEGKVTEISVTPD-RDGELDLRS 716
Query: 776 TLHYRGTSVKVNLSAGKIYTFNRQLK 801
L ++ + AG+ Y + ++K
Sbjct: 717 PLFGTAARMRFSAEAGRTYVWKEEIK 742
>gi|296130834|ref|YP_003638084.1| alpha-L-fucosidase [Cellulomonas flavigena DSM 20109]
gi|296022649|gb|ADG75885.1| Alpha-L-fucosidase [Cellulomonas flavigena DSM 20109]
Length = 809
Score = 433 bits (1114), Expect = e-118, Method: Compositional matrix adjust.
Identities = 269/782 (34%), Positives = 399/782 (51%), Gaps = 51/782 (6%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
M++ + +T + L + + PA+ +TDA P+GNGRLGAMV GG +E L++N+DT W+G P
Sbjct: 1 MIDDGAVTTASGLVLRLDEPARWWTDAFPVGNGRLGAMVHGGTGAERLQVNDDTCWSGAP 60
Query: 61 GDYT-------NPD-APKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEF 112
D T PD AP + R L+ G A KL YQ L D+ +E
Sbjct: 61 HDGTVEPVGPLGPDGAPGVVRRARHLLAEGDPLAAQDELAKLQSGWVQAYQPLVDVLVEQ 120
Query: 113 DDSHLKYAEETYRRELDLNTATARVKY-SVGNVEFTREHFSSNPDQVIVTKISGSESGSL 171
+ + YRR LDL + S + +E S+PD ++ + +G+ G
Sbjct: 121 PGA---AGRDDYRRVLDLARGVVTTTWRSAAGEPWRQEVLVSHPDGALLLERAGA-PGET 176
Query: 172 SFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSA----ILEIKI 227
++ + G+ ++ P +P + D P +Q+
Sbjct: 177 RVRLASPHPWASTPAAAGDGILVATLDMPSHVLP---DWVDGPDPVQYGGRSVHAAVALA 233
Query: 228 SDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRN 287
A+ D +++V G+ ++L +++ D + D + AL +R
Sbjct: 234 VLADDAPVAVVDGEVRVTGARRVRVVLTSATDHD---VATGTLHGDRERVAADALAGLRG 290
Query: 288 L--SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE 345
+ RH+ D+ L RVS+ L +P D+ D A + +
Sbjct: 291 ALADVDGIPARHVADHAALLGRVSLDLVAAPPDLPLD------------ARLARHAAGEP 338
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
D L L FQ GRYL ++ SRPGT NLQGIWNE + P W S +NIN EMNYW +L
Sbjct: 339 DAHLAVLAFQLGRYLTVAGSRPGTLPLNLQGIWNERVRPPWSSNYTININTEMNYWPALV 398
Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWA-KSSADRG--KVVWAL 462
+L+EC EPL +L L+ G +TA+ Y A GWV HH +D W RG W+
Sbjct: 399 GDLAECHEPLLSWLDRLAAAGRQTARTLYGARGWVAHHNSDPWCFTGPTGRGHDSASWSA 458
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSP 522
WP+GGAWL H+ +H+++T D D L +R +P++ A +LD L+E DG L T+P TSP
Sbjct: 459 WPLGGAWLARHVVDHHDWTGDDDAL-RRHWPVVRDAARAVLDLLVELPDGTLGTSPGTSP 517
Query: 523 EHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT 582
E+ ++ PDG+ A V+ S+T D+AI+R++ + A V+ ++ L V +L RL
Sbjct: 518 ENHYLLPDGRPAAVAVSTTADLAIVRDLLEQVRRLAPVVRDRDEDLRAAVDGALERLPTE 577
Query: 583 KIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG 642
++A DG + EW +D D E HRH SHL+ +FPG +I + P+L AA +TL RG E
Sbjct: 578 RVAPDGRLAEWHEDVPDAEPEHRHQSHLYRVFPGTSIDPDTTPELAAAARRTLDARGPES 637
Query: 643 PGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHF--EGGLYSNLFAAHPPFQID 700
GWS+ W+ AL ARL D E +V + V E + GG+Y +L AHPPFQ+D
Sbjct: 638 TGWSLAWRLALRARLRDPEGVAALVSAFLHPVPGEEPASWPAPGGVYRSLLCAHPPFQVD 697
Query: 701 ANFGFTAAVAEMLVQS------TLNDLYLLPALPWDKWSSGCVKGLKARGG-ETVSICWK 753
N GFTA V E LVQ+ + +++LLPALP W G V+GL+ RGG + V + W
Sbjct: 698 GNLGFTAGVVEALVQAHHRGPDGVREVHLLPALP-ASWPEGRVQGLRLRGGVDLVDLRWA 756
Query: 754 DG 755
+G
Sbjct: 757 EG 758
>gi|225011898|ref|ZP_03702336.1| conserved hypothetical protein [Flavobacteria bacterium MS024-2A]
gi|225004401|gb|EEG42373.1| conserved hypothetical protein [Flavobacteria bacterium MS024-2A]
Length = 792
Score = 433 bits (1113), Expect = e-118, Method: Compositional matrix adjust.
Identities = 275/757 (36%), Positives = 398/757 (52%), Gaps = 58/757 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD--YTNPDAP-KALS 73
+ A + +A+P+GNGRLG MV+G E ++LN+D+LW P D + NP+ + L
Sbjct: 40 YEQAASEWEEALPLGNGRLGVMVFGNPTKEHIQLNDDSLW---PKDIEWGNPEGTFEDLK 96
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+R+L+ G + ++ F V +Q LGD+ + D + Y+R L+LN
Sbjct: 97 QIRNLLIDGDIEKTDHLLIEKFSRKTVVRSHQTLGDLHIRLDHDSIS----DYKRSLNLN 152
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSE----SGSLSFNVSLDSLLDNHSYV 187
ATA V Y F S+P Q IV I +GS+ + +D S +
Sbjct: 153 KATAYVNYKTEGYPVKESVFVSHPHQAIVVIIESEHPKGINGSIQLSRPMDEGFPTVSVL 212
Query: 188 NGNN-QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+ NN +IIM G + + +G+ F IL K S + G+I++ E+K L+++G
Sbjct: 213 SRNNSEIIMTGEVTQRGGKFDSKTLPILEGVSFETIL--KTSHEGGSIASNENK-LELKG 269
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
AVL +V++SSF ++ TS++ I S SD+ +H+ D+Q +
Sbjct: 270 VRKAVLYIVSNSSF---------YHENYTSQNQKNFAVIEKTSLSDIEEQHIRDHQNYYE 320
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSS 365
R+ +I T S+ +P+ +R+++ + + D L ELLF FGRYLLI+SS
Sbjct: 321 RIDF-------NIETKNISQ----LIPTDKRIEAVKKGNVDLELQELLFHFGRYLLIASS 369
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
R GT ANLQG+WN+ +S W++ H+NINL+MNYW + L E PLFD++ L IN
Sbjct: 370 REGTLPANLQGLWNQHISAPWNADYHLNINLQMNYWLANVTQLDELNNPLFDYVDRLLIN 429
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G KTAQ N+ A G + H TDIWA + W G W+ H W H+ YT D +
Sbjct: 430 GKKTAQENFGARGSFLPHATDIWAPTWLRAPTAYWGASFGAGGWMVQHYWNHFEYTQDYN 489
Query: 486 FLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
FL RA+P +E A F DWLIE DG L + PSTSPE+ +I G S MD
Sbjct: 490 FLRNRAFPAIEEVAKFYSDWLIEDPRDGSLISAPSTSPENRYINDQGVAVSSCLGSAMDQ 549
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI-AEDGSIMEWAQDFKDPEVH 603
+I+EVF+ + A +L + + ++K+ K L +LRP + DG I+EW +++K+ E
Sbjct: 550 QVIKEVFTNYLKAVRLLNIDNE-WIQKIEKQLKQLRPGFVLGSDGRILEWDREYKELEPG 608
Query: 604 HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTALWARLHDQ 660
HRH+SHL+G PG+ I+ P L A KTL R G G GWS W ARL D
Sbjct: 609 HRHMSHLYGFHPGNQISSLTTPKLFDAVRKTLDFRLANGGAGTGWSRAWLINCAARLLDG 668
Query: 661 EHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
+ A ++ + FE ++SNLF AHPPFQID NFG+TA VAE+L+QS +
Sbjct: 669 DMAQEHIQLM-----------FEKSIFSNLFDAHPPFQIDGNFGYTAGVAELLLQSYEEN 717
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
L W G V GLKAR VS+ W +G L
Sbjct: 718 TLRLLPALPPLWKKGNVNGLKARNNILVSMQWDEGKL 754
>gi|425767412|gb|EKV05986.1| hypothetical protein PDIG_81830 [Penicillium digitatum PHI26]
gi|425779681|gb|EKV17720.1| hypothetical protein PDIP_30190 [Penicillium digitatum Pd1]
Length = 740
Score = 432 bits (1112), Expect = e-118, Method: Compositional matrix adjust.
Identities = 283/781 (36%), Positives = 403/781 (51%), Gaps = 66/781 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA+ + A+P+GNGRLGAMV+G +E L+LNED++W G P D DA + L +R
Sbjct: 6 YQQPAEDWNSALPVGNGRLGAMVYGRTDTEMLQLNEDSVWYGGPQDRNPQDALEYLPRLR 65
Query: 77 SLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
+ + +AEA A + F +P Y+ LG++ L D H YRR LDL A
Sbjct: 66 EAIRAENHAEAEKIAKLAFFANPISQRNYEPLGNLFL--DLGHNPSQVTGYRRSLDLARA 123
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG-NNQ 192
TA V+Y + F RE +SNPD V+ ++ S F V L + D N +
Sbjct: 124 TAHVRYEYQGICFEREVLASNPDDVLAIRLHSSSKAE--FVVRLTRMSDVEFETNEWLDD 181
Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
I G + P + + ++ ++ GTI+ + K L V +D +L
Sbjct: 182 ISASGNSITMHVTPGGKNSS-----RVCCVVSVRCDGADGTITKI-GKNLVVNSTD-TLL 234
Query: 253 LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
++ A ++F +D + + LS DL TRH DYQ L+ R+ +QL
Sbjct: 235 VIAAQTTF---------RHEDIDQRTKQDAEIALGLSLKDLRTRHTADYQSLYDRMELQL 285
Query: 313 SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV- 371
+I TD +R+KS DP L+ L + RYLLIS SR G +
Sbjct: 286 GPGSPEIPTD-------------QRLKS---SRDPGLIALYHNYSRYLLISCSRDGHKSL 329
Query: 372 -ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWN P W S NINL+MNYW + CNLSEC+ PLFD L + G TA
Sbjct: 330 PANLQGIWNPSFHPAWGSRFTTNINLQMNYWSANVCNLSECEFPLFDLLERMVEPGKTTA 389
Query: 431 QVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
Q+ Y GW H TDIWA ++ + ++WP+GGAWLC H+W+H+ YT D FL +R
Sbjct: 390 QIMYGCRGWTAHSNTDIWADTAPVDRWMPASIWPLGGAWLCYHIWDHFQYTCDEVFL-RR 448
Query: 491 AYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
+P L GC FLLD+LI +G YL T+PS SPE+ F G+ + ST+D+ II
Sbjct: 449 MFPTLRGCVEFLLDFLIVDANGAYLITSPSASPENSFYDHKGQKGVLCEGSTIDIQIIDA 508
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSH 609
+ A S + L+ +DAL+ V + RL P KI+ G + EWA D+ + E HRH SH
Sbjct: 509 ILGAFQSCTKKLDL-QDALLPAVYATKSRLPPLKISPAGYLQEWAIDYAEVEPGHRHTSH 567
Query: 610 LFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHAYRM 666
L+ L PG+ IT K P L A + L++R E G GWS W L ARL + E +
Sbjct: 568 LWALHPGNAITPAKTPQLAGACGEVLRRRAEHGGGHTGWSRAWLLNLHARLLEAEECSKH 627
Query: 667 VKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS-TLNDLYLLP 725
+ L + SNL +HPPFQID NFG A + EMLVQS + +LP
Sbjct: 628 LDSLLSR-----------STLSNLLDSHPPFQIDGNFGGGAGIIEMLVQSHEPGVIRILP 676
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVK 785
A P D W +G ++G++ARGG + +++G + VG + +S + +H+ + V+
Sbjct: 677 ACPRD-W-TGSIRGVRARGGFELEFDFENGRV--VGGVTIFSERGETT--VVHFNESHVE 730
Query: 786 V 786
+
Sbjct: 731 I 731
>gi|406858935|gb|EKD12015.1| alpha-l-fucosidase [Marssonina brunnea f. sp. 'multigermtubi'
MB_m1]
Length = 835
Score = 432 bits (1112), Expect = e-118, Method: Compositional matrix adjust.
Identities = 290/801 (36%), Positives = 412/801 (51%), Gaps = 93/801 (11%)
Query: 8 STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
S + PL++ + PA F+D+ IGNGR+GA + G E L LNED+LW+G P D NPD
Sbjct: 33 SASVPLRLWDSAPAGGFSDSYLIGNGRIGAALSGSAQKEYLGLNEDSLWSGGPIDRVNPD 92
Query: 68 APKALSDVRSLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETY 124
A + +++S V G++ E T AS G+P Y LG+++L + Y
Sbjct: 93 ASAYMGNIQSSVSKGRFQEGQTTASFAYVGNPVSARHYDYLGELQLVMNHGT---KVTGY 149
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV------SLD 178
R LDL +TA ++YSV V F RE+ +SNP V+ KIS ++G++ FN+ +L+
Sbjct: 150 ERWLDLQDSTAGLQYSVDGVTFQREYLASNPAGVMAIKISADKAGAVDFNILLRRGGTLN 209
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
+D +S GN+ I+M G G K + F+A + S R + +
Sbjct: 210 RWVD-YSVKVGNDTIVMGGGSGGV------------KPVVFAAGASVVASGGR--VYTIG 254
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D +KVEG+D A + A + F K+DP + S L+S+++ SY + H+
Sbjct: 255 DY-VKVEGADEAWIYFSAWTDF---------RKEDPRAAVESDLKSVKSQSYKSIREAHV 304
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ L RVSI L S D S RV DP +V L FQFGR
Sbjct: 305 EDYQSLASRVSIDLGTSSAKQKKDATSA----------RVAGLGAAFDPEIVALAFQFGR 354
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
Y+LISS+R GT LQGIWN+D +P W S +NIN +MN+W +L NL+E EPLF
Sbjct: 355 YMLISSARQGTLAPTLQGIWNKDPNPQWGSRYTININTQMNHWLALVTNLAELNEPLFSL 414
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ + G +TAQ Y A+G V HH TDIW S+ + WP G WL TH+ + Y
Sbjct: 415 IENVRQTGLQTAQKMYGAAGAVCHHNTDIWGDSAPVDNWALSTWWPTGLVWLVTHIHDTY 474
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD--GKLACV 536
+T + LEK+ Y L A+F LD I + G++ TNPS SPE+ + P+ G A +
Sbjct: 475 LFTGNATLLEKK-YDTLVDAAAFFLD-FITPYKGWMVTNPSVSPENVYRIPNGGGGTAAM 532
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAED-GSIMEWAQ 595
+ TMD +++R +FS ++ A VL K + AL +++ + L P +++ G I EW +
Sbjct: 533 TAGPTMDNSLLRALFSIVLEAQSVLGKKDTALADRLEAARASLPPLMVSKRYGGIQEWIE 592
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTA 652
DF++ HRHLSHL+GL+PGH IT N +AA K+L +R + GWS W A
Sbjct: 593 DFEETAPGHRHLSHLWGLYPGHEIT-SANATFFEAARKSLNRRLSFDTDPAGWSQAWAIA 651
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
+ ARL + RM+ L L H K G L + PFQID+ FG TA +AE
Sbjct: 652 ISARLFNATGVARMLDVL--LTTSTHAKSLLGDL------SPAPFQIDSTFGLTAGIAEA 703
Query: 713 LVQS--------------------TLND------LYLLPALP--WDKWSSGCVKGLKARG 744
L+QS T+ + + LLPALP W + G + GL RG
Sbjct: 704 LLQSHELVSPSSSKAPDAASMKATTVGNPSGVPLVRLLPALPKTWAQTGGGSITGLLGRG 763
Query: 745 GETVSICWKD-GDLHEVGIYS 764
G V I W + G L I S
Sbjct: 764 GFVVDISWDEKGQLVNATIVS 784
>gi|311746349|ref|ZP_07720134.1| fibronectin type III domain protein [Algoriphagus sp. PR1]
gi|126575233|gb|EAZ79565.1| fibronectin type III domain protein [Algoriphagus sp. PR1]
Length = 778
Score = 432 bits (1112), Expect = e-118, Method: Compositional matrix adjust.
Identities = 275/796 (34%), Positives = 419/796 (52%), Gaps = 63/796 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA-LSDV 75
+ PA+ + +A+P+GNGRLGAMV+G E ++LNED+LW G GD+ ++ L +
Sbjct: 27 YTSPAEIWEEALPVGNGRLGAMVFGKPSMERIQLNEDSLWPGEQGDWGIAKGRRSDLDQI 86
Query: 76 RSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
R+ + +G+ ++ + V F A +Q LGD+ L+FD + Y+R LDL TA
Sbjct: 87 RAYLRAGENEKSDSLLVAAFSRKAITRSHQTLGDLWLDFDFQEIS----DYKRSLDLTTA 142
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN--- 190
A + T+E SS PD IV ++ + + L S ++ +
Sbjct: 143 VASSTFKSQGYTVTQEVLSSAPDDAIVIRLKTNHPDGFVGKIRL-SRPEDEGFATAETKS 201
Query: 191 ---NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
N + M G ++ +N G++F ++ ++ D G ++ D L++ GS
Sbjct: 202 LSENTLSMAGMITQRKGQLDSNPYPLLTGVKFKTLVYVETED--GNLNNGVDY-LELSGS 258
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
++ LV +SF +D + L++++ ++ + H+ DY + F R
Sbjct: 259 KEVLIKLVTETSF---------YNQDFDHAAELELENVKTKNWEGILEPHIQDYSQWFER 309
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSR 366
+ ++L ++ + VP+ R+++ Q D L +LLF +GRYLLISSSR
Sbjct: 310 MELKLGKAA------------MSEVPTDVRIENVQAGGVDLHLEKLLFDYGRYLLISSSR 357
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PG ANLQGIWN+D++ W++ H+NINL+MNYW + NLS+ +PLFDF+ + G
Sbjct: 358 PGNNPANLQGIWNKDINAPWNADYHLNINLQMNYWPADVTNLSKLNQPLFDFVDGVIHRG 417
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
+ AQ N+ +G + H TD+W W W G W+ H W+HY +T D F
Sbjct: 418 QEVAQTNFGMAGTFLPHATDLWQVPFMRAATAYWGGWVGAGGWMARHYWDHYLFTKDERF 477
Query: 487 LEKRAYPLLEGCASFLLDWLIE-GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L +RA+P + +F DWL+E + L + PSTSPE+ F G+ + + MD
Sbjct: 478 LRERAFPAISQVTAFYSDWLVEYPGENTLVSAPSTSPENRFFNEAGRPVATTMGAAMDQQ 537
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEWAQDFKDPEVHH 604
II +VFS+ ++A+E+L +E L ++V + L RLRP +IAEDG I+EW Q +++ E H
Sbjct: 538 IIADVFSSFLAASEIL-NSESRLRDRVKEQLARLRPGVQIAEDGRILEWDQPYEETEKGH 596
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTALWARLHDQE 661
RH+SHL+ PG IT + P+ A KTL+ R G G GWS W ARL D E
Sbjct: 597 RHMSHLYAFHPGDAITESETPEAFAAVRKTLEYRLEHGGAGTGWSRAWLINFSARLLDGE 656
Query: 662 HAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDL 721
A+ + L + LY NLF HPPFQID NFG+TA VAEML+QS D+
Sbjct: 657 MAHDNILEL-----------IKKSLYPNLFDGHPPFQIDGNFGYTAGVAEMLIQSHEKDI 705
Query: 722 Y-LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYR 780
LLPALP W G VKG+KARG TV + W+DG++ + + N TL Y
Sbjct: 706 VRLLPALP-KAWKDGEVKGIKARGDITVEMKWEDGEITALSLVPGEDQN-----ITLFYN 759
Query: 781 GTSVKVNLSAGKIYTF 796
G+ + + L G+ + F
Sbjct: 760 GSEMNLMLKKGEKFGF 775
>gi|149199940|ref|ZP_01876968.1| hypothetical protein LNTAR_09881 [Lentisphaera araneosa HTCC2155]
gi|149137009|gb|EDM25434.1| hypothetical protein LNTAR_09881 [Lentisphaera araneosa HTCC2155]
Length = 793
Score = 432 bits (1111), Expect = e-118, Method: Compositional matrix adjust.
Identities = 275/764 (35%), Positives = 396/764 (51%), Gaps = 83/764 (10%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTG-VPGDYTNPDAPKALSDVRSLVDSGQYAE 86
+PIGNG++GAMV+GGV E + D+LW+G V G + K + +R ++ +Y
Sbjct: 55 LPIGNGKIGAMVYGGVEQEKINFTIDSLWSGKVDGTQNLAGSYKGMEQLRGMLMKDEYDA 114
Query: 87 ATAASVKLFGHP--AD----VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKY 139
A + L G AD +Q GD+ D+ +K+ + Y+R+LD+N A + V++
Sbjct: 115 AHKLAKDLIGSSPSADGNFGTFQTFGDLVF---DTGIKFESVSDYQRKLDINNALSVVEF 171
Query: 140 SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV---NGNNQIIME 196
++G ++TR F S+PDQ +V + S GS N+ L N +V NGN+ I++
Sbjct: 172 TMGKHKYTRTAFVSHPDQCLVLRFEVSAGGSQ--NIKLGFETPNKDWVPRINGND-IVIS 228
Query: 197 GRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVA 256
G+ +P A +G +FSA +GT+S VEG+ L A
Sbjct: 229 GKAAQNHMPVNARIRVKHEGGKFSA--------SKGTLS--------VEGARVVEFYLSA 272
Query: 257 SSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSP 316
++FD + P+ + P E + L SY++L RHL+DY+ LF R++I + S
Sbjct: 273 DTAFD--YKAPNRIGEAPDQEVLKTLNQASEKSYAELLERHLEDYKDLFDRLTIDIGDSS 330
Query: 317 KDIVTDTCSEENIDTVPSAERVKSF------QTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
++ +P R+K++ + DP L+E ++Q+GRYLLI+SSRPGT
Sbjct: 331 LEL----------RNMPMEARLKNYGDSLASNANPDPDLIETIYQYGRYLLIASSRPGTL 380
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQG+WN L+P W + H+NINL+MNYW + P NL EC+EPL F+ L G TA
Sbjct: 381 PANLQGVWNNSLTPPWAADYHININLQMNYWLAGPTNLIECEEPLLKFIESLVEPGRITA 440
Query: 431 QVNYLASGWVIHHKTDIWAKSS----ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
+ + + GW+ +H T+IW ++ +GK+ W WL HL+EH+ Y D+
Sbjct: 441 KEYFNSEGWMSYHATNIWGHTAPRVGRGKGKLTWKALTTCSLWLSHHLYEHFAYRQDKSQ 500
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
L+ +P+L A F +L + DG + PS S EH I S + D+A
Sbjct: 501 LKNEIWPVLAEAADFAAGYLTQLPDGAYTSMPSWSSEHGLI---------SKGAITDIAT 551
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRH 606
REV + AE+L N + K L KI + G + EW +D DP HRH
Sbjct: 552 TREVLQCALECAEILGINNER-TAKWKNRKDNLLAYKIGQHGQLQEWLEDRDDPNNKHRH 610
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRM 666
++HL+GL PG I+ K P L AA TL RG+ GWS+ WK W R+ + E A +
Sbjct: 611 INHLWGLHPGTQISPLKTPKLADAALVTLAHRGDGATGWSLGWKLNFWTRMRNGEKAMIL 670
Query: 667 VKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND------ 720
L NLV + LY NLF HPPFQID NFG TA V EML+QS D
Sbjct: 671 ---LNNLVKEK--------LYPNLFDVHPPFQIDGNFGATAGVTEMLLQSQERDSEGRYV 719
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+ +LPALP W SG VKGLKARGG V I W+ + E+ I S
Sbjct: 720 IDVLPALP-KSWLSGSVKGLKARGGFEVDITWEQDKIKELSITS 762
>gi|373958328|ref|ZP_09618288.1| glycoside hydrolase family 2 sugar binding [Mucilaginibacter
paludis DSM 18603]
gi|373894928|gb|EHQ30825.1| glycoside hydrolase family 2 sugar binding [Mucilaginibacter
paludis DSM 18603]
Length = 960
Score = 432 bits (1110), Expect = e-118, Method: Compositional matrix adjust.
Identities = 254/703 (36%), Positives = 377/703 (53%), Gaps = 56/703 (7%)
Query: 102 YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVT 161
Y GD+ L F S Y+R+LD+ A A Y+ V FTRE+ +S+P + I+
Sbjct: 311 YLPFGDLILNFKTSS---QVMDYKRDLDIGKAVATTTYNSNGVNFTREYLASDPAKAIII 367
Query: 162 KISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSA 221
+ S+ G +++ +LL ++ +Q+ ++ KG+ A
Sbjct: 368 HLKASKPG----QINMVALLQTSHKISSVHQVDANTIALDVKVQ---------KGV-LKA 413
Query: 222 ILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA 281
+ + I GT+ + ++ + + +D + L A++SF N D P A
Sbjct: 414 VSYLYIKALSGTVKVINNQ-ISISKADDVTIYLTAATSFK----NYKDVSGKPDEICKQA 468
Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF 341
LQ+ + +++ L + + DYQ+ F+ S+ L D+ TD ER+K++
Sbjct: 469 LQAAKTKTFAQLKAQSITDYQQYFNTFSVNLGPGKVDVPTD-------------ERIKTY 515
Query: 342 QTDEDPSLVELLFQFGRYLLISSSRPGTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
DP L+ L Q+GRYLLIS SRP +++ ANLQGIWN+ + P+W S NINL+MNY
Sbjct: 516 SVAFDPGLLALYMQYGRYLLISCSRPNSKLPANLQGIWNDQMVPSWGSKFTTNINLQMNY 575
Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVW 460
W + NL+ C++PLF ++ L++ G++TA+++Y A GW++HH TDIW +A
Sbjct: 576 WPAEELNLTPCEKPLFKMISQLAVTGAQTAKIHYDAPGWILHHNTDIWL-GTAPINASNH 634
Query: 461 ALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH-DGYLETNPS 519
+W G AWLC LWEHY YT D DFL+K Y ++G A F + L++ G+L + PS
Sbjct: 635 GIWQGGAAWLCHQLWEHYLYTGDIDFLKKH-YAEMKGAAEFFVSTLVKDPVTGFLISTPS 693
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH G L TMD IIR++F ISA+E+L K +DA + + + ++
Sbjct: 694 NSPEH------GGLVA---GPTMDRQIIRDLFKNCISASEIL-KTDDAFRKTLQEKYAQI 743
Query: 580 RPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
P K+ + G + EW +D D HRH+SHL+G++PG IT + P + KAAEK+ Q RG
Sbjct: 744 APNKVGKFGQLQEWMEDKDDTADTHRHVSHLWGVYPGTDITWDSTPQMMKAAEKSFQYRG 803
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
+EG GWS+ WK L AR +HA +V +L ++ + K GG+Y NLF AHPPFQI
Sbjct: 804 DEGTGWSLAWKVNLMARFKQGDHAMLLVNKLLSVAENGSAKE-RGGVYHNLFDAHPPFQI 862
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D NFG A +AEML+QS + LLPALP G +KG+ ARGG +++ WK G L +
Sbjct: 863 DGNFGGAAGIAEMLLQSQQGYIDLLPALP-SSLPDGELKGICARGGFVLNMLWKGGKLQQ 921
Query: 760 VGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLKC 802
V + S L Y AGK YT N LK
Sbjct: 922 VQVTSKIGRE-----CVLKYGDMQTSFKTEAGKTYTVNGLLKT 959
Score = 80.9 bits (198), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 39/95 (41%), Positives = 60/95 (63%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
++ A + LK+ + PA+ +TDA+PIGNG LGAM +GG+ S+ ++ NE TLW+G P
Sbjct: 14 LLAAAQNVFSQDLKLWYKKPAEKWTDALPIGNGTLGAMFYGGISSDRIQFNEQTLWSGSP 73
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLF 95
Y A L ++R+L+ +G+ AEA A + K F
Sbjct: 74 RKYQRDGAATYLPEIRNLLFAGKQAEAEALAEKHF 108
>gi|302669281|ref|YP_003832431.1| glycoside hydrolase family 95 Gh95A [Butyrivibrio proteoclasticus
B316]
gi|302396945|gb|ADL35849.1| glycoside hydrolase family 95 Gh95A [Butyrivibrio proteoclasticus
B316]
Length = 714
Score = 432 bits (1110), Expect = e-118, Method: Compositional matrix adjust.
Identities = 266/776 (34%), Positives = 393/776 (50%), Gaps = 102/776 (13%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + AK + A+P+GNG +GAM +GG + +LN D++W P D NPDA +++
Sbjct: 3 RLWYKEAAKDWNSALPLGNGFMGAMCFGGTLIDRFQLNNDSIWWSGPRDRINPDAKESIP 62
Query: 74 DVRSLVDSGQYAEAT-AASVKLFGHP--ADVYQLLGDIEL--------------EFDDSH 116
+R L+ G+ ++A A+ + G P Y+ LGD+ + E
Sbjct: 63 VIRRLIREGRISDAEDLANEAMAGIPEYQSHYEPLGDLFIIPEGKERIQILGIREHWSGQ 122
Query: 117 LKYAEET--YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGS------ES 168
L EE Y+RELD+ V Y+ V+F RE F SN D+V+ K GS E
Sbjct: 123 LNRIEEIPDYKRELDIEKGIHTVSYTKDGVKFCRESFISNVDKVMAIKCLGSRLRIFAER 182
Query: 169 GSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
G V Y N + MEGR G++F ++ +
Sbjct: 183 GDQCEKV----------YKLSENTLCMEGRTGAD-------------GVRFCMVIRVVNG 219
Query: 229 DD--RGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIR 286
+ RG + + D A +L+ + + F +DP ++++ L + +
Sbjct: 220 NPYIRGRM---------LHADDDAEILIASQTDF---------YNEDPVADAVRTLDAAQ 261
Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDE 345
L Y +L RH+ D Q+L R ++++ +N D +P+ +R+++ +
Sbjct: 262 KLGYDELKKRHVCDVQELMDRCTLEID------------SDNRDNIPTDKRLQAVAEGGT 309
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
D L+ LLF +GRYLLISSSRPG+ ANLQGIWN+ SP WDS +NIN +MNYW +
Sbjct: 310 DNGLINLLFAYGRYLLISSSRPGSLPANLQGIWNDSFSPAWDSKFTININAQMNYWPAEV 369
Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
LSE EPLFD + + NG + A Y A GW+ HH TDIW + + W M
Sbjct: 370 TGLSELHEPLFDLMKRMLPNGRRAAAEMYCARGWMAHHNTDIWGDCAPQDTWQAASYWQM 429
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHE 525
G AWLC H+ EHY YT D +F+ + P+++ A F D LIE G L +PS SPE+
Sbjct: 430 GAAWLCLHILEHYRYTQDENFM-REYLPMVKEAALFFEDSLIENEAGQLVVSPSVSPENT 488
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
++ P G+ + ++MD I+ E+FS +I ++L E +L LP+ +I+
Sbjct: 489 YVLPSGERGMMCEGASMDAQILYELFSGLI-GTDMLSSEEKERYTTILCKLPK---PQIS 544
Query: 586 EDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPD-LCKAAEKTLQKRGEEG-- 642
E G++ EWA+++ + E+ HRH+SHLF L+PG ++ D L KAA T+++R G
Sbjct: 545 EIGTVQEWAENYDEVEIGHRHISHLFALYPGKQFFDSEDKDALLKAARATIERRVSHGGG 604
Query: 643 -PGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDA 701
GWS W +WARL D E Y + L + NLF HPPFQID
Sbjct: 605 HTGWSRAWIINMWARLCDGEQCYENIMAL-----------VRKSMLPNLFDNHPPFQIDG 653
Query: 702 NFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
NFG + +AEML+QS + LLPALP +W SG V GL R G+ V I WKDG +
Sbjct: 654 NFGLVSGIAEMLIQSHEGEDKLLPALP-KEWPSGKVTGLHTRSGKIVDIEWKDGKV 708
>gi|336417082|ref|ZP_08597411.1| hypothetical protein HMPREF1017_04519 [Bacteroides ovatus
3_8_47FAA]
gi|335936707|gb|EGM98625.1| hypothetical protein HMPREF1017_04519 [Bacteroides ovatus
3_8_47FAA]
Length = 859
Score = 431 bits (1109), Expect = e-118, Method: Compositional matrix adjust.
Identities = 291/834 (34%), Positives = 430/834 (51%), Gaps = 85/834 (10%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD--------Y 63
LK T+N PAK + ++A+PIGNG +GAM++GGV + ++ NE TLW+G PG+
Sbjct: 32 LKATYNKPAKVWESEALPIGNGYMGAMIFGGVEVDVIQTNEHTLWSGGPGEDPSYNGGHL 91
Query: 64 TNPDAPKA-LSDVRSL---------VDSGQYAEATAASV------------------KLF 95
P+ K+ L R L V+ Y +A + KL
Sbjct: 92 GTPETNKSYLHKTRVLLQQKMNDFTVNHSAYIDADGKLITHNYEGDGNGTELRNLIDKLA 151
Query: 96 GHPADV--YQLLGDIELEFDDSHLKY-AEETYRRELDLNTATARVKYSVGNVEFTREHFS 152
G +Q L +I +E +S A Y R LD++ A RV Y G + F RE+F
Sbjct: 152 GTKEHFGSFQTLSNIIVEVVNSATSEPAYSDYTRTLDIDNAIHRVTYKEGGITFKREYFM 211
Query: 153 SNPDQVIVTKI-SGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANAN 211
S PD ++V ++ S S+ G +S +SL+SL + +N I + G P K +
Sbjct: 212 SYPDNIMVMRLTSDSKKGKISRMISLESLHTDKVIRASDNTITLTGY-PTPTSGDKRVGD 270
Query: 212 DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD-- 269
G++++ L +K + G I+ ++ KKLK+E + ++L+ A++++ +
Sbjct: 271 HWKNGLKYAQQLLVKHTG--GKITVVDGKKLKIEEAKEIIVLMSAATNYVQCMDDSYHYF 328
Query: 270 SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
S ++P + + L+ N Y+ L H DY L+ R+ + L P+ V T
Sbjct: 329 SGEEPLDKVKATLKKAANKKYTALLATHEKDYHSLYDRMKLNLGNLPEMPVATT------ 382
Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
D++ + E+ L L FQFGRYLLISSSR G+ ANLQG+W E LS W+S
Sbjct: 383 DSLLKGMDAHANSESENQYLEMLYFQFGRYLLISSSREGSLPANLQGVWGERLSNPWNSD 442
Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYL------ASGWVIHH 443
H NIN++MNYW + P NLS C P+ +++ L G TAQ Y GWV HH
Sbjct: 443 YHTNINVQMNYWPTQPTNLSRCHLPMVEYVKSLVPRGKYTAQQYYCKPDGGNVRGWVTHH 502
Query: 444 KTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
+ +IW ++ + K +P G W+C +WE+Y + +D+DFLE Y ++ A F +
Sbjct: 503 ENNIWGNTAPAK-KDTPHHFPAGAIWMCQDIWEYYQFNLDKDFLEAY-YDVMLQAALFWV 560
Query: 504 DWL-IEGHDGYLETNPSTSPEH-EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVL 561
D L + DG L NPS SPEH EF L C + A+I E+F +I A++VL
Sbjct: 561 DNLWTDERDGTLVANPSHSPEHGEF-----SLGC-----STSQAMIAEMFDMMIKASKVL 610
Query: 562 EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF-KD--PEVHHRHLSHLFGLFPGHT 618
K+++ + ++ ++ +L KI G +MEW + KD + HRH +HLF L PG
Sbjct: 611 GKDKEPEIAEIKTAMNKLSGPKIGLGGQLMEWKDEVTKDVTGDGGHRHTNHLFWLHPGSQ 670
Query: 619 ITI---EKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVD 675
I I E++ A + TL RG+EG GWS WK WARLHD ++ +++ L
Sbjct: 671 IVIGRSEEDDKYANAMKVTLNTRGDEGTGWSKAWKLNFWARLHDGNRSHALLRSAMKLTV 730
Query: 676 PEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSG 735
P+ GG+Y+NLF AHPPFQID NFG TA +AEML+QS + LLPALP D W G
Sbjct: 731 PQGR---FGGVYTNLFDAHPPFQIDGNFGCTAGIAEMLMQSQGGYIELLPALP-DAWKDG 786
Query: 736 CVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN---DHDSFKTLHYRGTSVKV 786
KG+KARG V + WK+G + + I SN + K+L G V+V
Sbjct: 787 AFKGMKARGNFEVDVTWKEGQITSIEILSNAGAECMLKYPDAKSLKVSGAKVRV 840
>gi|429860996|gb|ELA35710.1| alpha-l-fucosidase [Colletotrichum gloeosporioides Nara gc5]
Length = 776
Score = 431 bits (1109), Expect = e-118, Method: Compositional matrix adjust.
Identities = 277/774 (35%), Positives = 401/774 (51%), Gaps = 68/774 (8%)
Query: 3 NAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
+ +S PL + + PA +++A+PIGNGRLGAMV G +E L+LNED++W G P D
Sbjct: 12 SGQSQQQPRPLLLHYESPASEWSEALPIGNGRLGAMVHGRTQTELLQLNEDSVWYGGPQD 71
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKY 119
T DA + L +R L+ ++AEA + F PA + Y+ LG +EF H+
Sbjct: 72 RTPKDALRHLPKLRQLIRDEEHAEAESLVREAFFATPASMRHYEPLGTCTIEF--GHVVE 129
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
YRR L L TA V+Y V + R+ +S PD V+ ++ SE+ F V L+
Sbjct: 130 DVTDYRRYLCLETAQTTVEYRCRGVSYRRDAIASFPDNVLAFRVVASEA--TRFVVRLNR 187
Query: 180 LLDNHSYVNGNNQII--MEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
L + N I GR K P N+N + + L + D G++ A+
Sbjct: 188 LSEIEYETNEFLDSIDATNGRIVLKATPGGHNSN------RLAIALGVSCDDAEGSVEAI 241
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
+ + S +++ A ++F +DP + ++ + + +SDL RH
Sbjct: 242 GNAL--IVNSTSCTIVIGAQTTF---------RTEDPEAAAVDDVLKALSHQWSDLVERH 290
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFG 357
DY LF+R S+++S D C +P+ ER+K+ DP LV L +G
Sbjct: 291 QQDYAGLFNRTSLRMS-------PDACH------LPTDERIKN---SRDPGLVALYHNYG 334
Query: 358 RYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
RYLLIS SR + A LQGIWN +P W S +NINL+MNYW + PC+L EC P+
Sbjct: 335 RYLLISCSRNSKKALPATLQGIWNPSFAPPWGSKYTININLQMNYWPAGPCSLIECAIPV 394
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
L ++ G KTA+V Y GW H TDIWA + + +WP+GG W+C ++
Sbjct: 395 LGLLEKMAERGKKTARVMYGCEGWCARHNTDIWADTDPHDRWMPSTIWPLGGVWVCIDIF 454
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLA 534
E Y D + L KRA +LEG FLL++LI G YL TNPS SPE+ F++ G+
Sbjct: 455 EMLQYQYDEN-LHKRAAVVLEGAIMFLLEYLIPSACGRYLVTNPSLSPENTFLSVSGEPG 513
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWA 594
+ S +DM II F + + +L E+ L KV ++L RL P I DG I EW
Sbjct: 514 ILCEGSVIDMTIIHIAFEKFLWSTNIL-GGENPLRAKVEEALERLPPLVINSDGLIQEWG 572
Query: 595 -QDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWK 650
+D+K+ E HRH+SHLFGL+PG I+ ++P+L AA+ L++R G GWS W
Sbjct: 573 LKDYKEQEPGHRHVSHLFGLYPGERISPSRSPELAAAAKNVLERRAAHGGGHTGWSRAWL 632
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
L ARL D E + + L +G N+ +HPPFQID NFG A +
Sbjct: 633 LNLHARLLDAEGCGQHMDLL-----------LKGSTLPNMLDSHPPFQIDGNFGGCAGIL 681
Query: 711 EMLVQSTLND-----LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
E LVQS++ D + LLP+ P D W+ G + G++ +GG VS W+DG + E
Sbjct: 682 ECLVQSSIIDANTVEIRLLPSCPKD-WAQGQLTGVRTKGGWLVSFSWQDGVIEE 734
>gi|291550959|emb|CBL27221.1| hypothetical protein RTO_27700 [Ruminococcus torques L2-14]
Length = 775
Score = 431 bits (1108), Expect = e-118, Method: Compositional matrix adjust.
Identities = 263/777 (33%), Positives = 409/777 (52%), Gaps = 72/777 (9%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
KI F AK + +A+PIGNG LGAMV+G +E L++NED++WTG + NPDA +
Sbjct: 3 KICFREEAKDWNEALPIGNGFLGAMVFGKTGTERLQINEDSVWTGSFMERVNPDARENYP 62
Query: 74 DVRSLVDSGQY--AEATAASVKLFGHP-ADVYQLLGDIELEFDDS--------------- 115
VR L+ +G+ AE A +P YQ LGD+ ++F
Sbjct: 63 KVRELLLNGEIEQAELLAERSMYATYPHMRHYQTLGDVWIDFYKQRGKTIFKKDQGGLLS 122
Query: 116 --HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF 173
H +TY RELD++ A +++Y ++ RE F+SNPD +IV ++ + L+F
Sbjct: 123 VQHESVEVQTYNRELDISRAVGKIQYESEKGKYEREFFASNPDHIIVYQMKSIDGELLNF 182
Query: 174 NVSLDSLLDNHSYVNGNNQIIMEGR--CPGKRIPPKANANDDPKGIQFSAILEIKISDDR 231
++SL + DN S G +G G +I D GI F +++++ +
Sbjct: 183 DLSL-TRKDNRS---GRGSSFCDGTEVLDGNKIRLYGKQGGD-HGIAFELLVQVRTKN-- 235
Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYS 291
G IS + L VE + A L + A +SF + P M L + SY
Sbjct: 236 GKISRM-GSHLLVEDAKEATLFITARTSF---------RSEQPLQWCMDVLSNAEKESYG 285
Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLV 350
L RH+ DY + + +++L+ +++ + + + ER++ + ED L+
Sbjct: 286 TLQERHIKDYLSYYEKSNLKLN-----------YKDSYEHLTTPERLEQMRNGIEDIELI 334
Query: 351 ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
+ F RYLLISSSR G+ +NLQGIWNE+ P W S +NIN+EMNYW + LS+
Sbjct: 335 NTYYNFARYLLISSSREGSLPSNLQGIWNEEFEPMWGSKYTININIEMNYWIAEKTGLSK 394
Query: 411 CQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWL 470
PL + L + +G A+ Y G+ HH TDIW + V LWPMGGAW
Sbjct: 395 LHMPLLEHLQRMYPHGKDVAEKMYGIDGFCCHHNTDIWGDCAPQDNHVSSTLWPMGGAWF 454
Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD 530
C HL EHY YT DR+FL K Y +L+ F L ++++ G + PS+SPE+ ++
Sbjct: 455 CLHLIEHYKYTKDREFL-KEYYGILKDAVKFFLQYMVKDAHGKWISGPSSSPENIYLNQK 513
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNE--DALVEKVLKSLPRLRPTKIAEDG 588
G+ C+ ++MD IIRE+F+ + E+ E+N+ + L E + + L + +I + G
Sbjct: 514 GEAGCLCMGASMDTEIIRELFNGYL---EITEENQLPNDLNEAINERLNHMPELQIGKYG 570
Query: 589 SIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGW 645
I EW++D+ + E HRH+S LF L+P I ++K P+L +AA++T+++R + G GW
Sbjct: 571 QIQEWSEDYDEVEPGHRHISQLFALYPAGQIRMDKTPELAQAAKQTIERRLKYGGGHTGW 630
Query: 646 SITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGF 705
S W +ARL ++E A++ +K L E +NLF HPPFQID NFG
Sbjct: 631 SKAWIILFYARLWEKEEAWKNLKEL-----------LEYATLNNLFDNHPPFQIDGNFGG 679
Query: 706 TAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
+ EML+Q + ++LLPALP + +G V G+ + G + + WK+G++ E+ I
Sbjct: 680 ACGLLEMLIQDYSDKVFLLPALP-NSLLNGEVNGICLKSGAVLDMKWKEGNIDEIRI 735
>gi|373850041|ref|ZP_09592842.1| Alpha-L-fucosidase [Opitutaceae bacterium TAV5]
gi|372476206|gb|EHP36215.1| Alpha-L-fucosidase [Opitutaceae bacterium TAV5]
Length = 839
Score = 431 bits (1107), Expect = e-117, Method: Compositional matrix adjust.
Identities = 271/801 (33%), Positives = 403/801 (50%), Gaps = 88/801 (10%)
Query: 17 FNGPAKH-FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDV 75
F+ PA+ + A+PIGNGR GAM++G + +E L+LNED+LW G P D NPDA + L +
Sbjct: 14 FDQPAQQDWNRALPIGNGRFGAMIYGNIVAERLQLNEDSLWNGGPRDRRNPDAREHLPVL 73
Query: 76 RSLVDSGQYAEA-TAASVKLFGHP--ADVYQLLGDIELEF-----------DDSHLKYAE 121
R L+ G+ A A L G P Y+ L D+ L F D+ L
Sbjct: 74 RQLLADGRLAAAHDLVHDALAGIPDSQRCYEPLADLFLHFEHPGAPVAVSADEMALASGY 133
Query: 122 ET----------YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL 171
T YRR LDL TA V Y++ N + R H +S DQVI + G L
Sbjct: 134 ATPRFDPALLSHYRRALDLRTAVMSVDYTLNNTSYARRHLASTVDQVIALHLRAGRPGGL 193
Query: 172 SFNVSLDS---------LLDNHSYVNGNNQIIMEGRC-PGKRIPPKANANDDPKGIQFSA 221
+ + L+ D +V + + R P + +A D G++F+
Sbjct: 194 TLRLRLERGPRESYSTRYADTVGFVADAAREPADARTSPALLLRGRAGGED---GVRFAV 250
Query: 222 ILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA 281
L +I+ G + + + L ++ +D L+L A+++F + DP + +
Sbjct: 251 GLRARIAG--GALRRI-GETLCIDAADSVTLVLAAATTF---------REDDPAAFVIGR 298
Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVK-S 340
+ + + H +Y+ F R S+ L +E ++P R+K +
Sbjct: 299 TGAALARGWDKIRADHEREYRSRFDRASLTLG-------APAAAEAGAGSIPVDLRLKRA 351
Query: 341 FQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
++ DP L L F + RYLLISSSRPG+ ANLQG+WN D P+W S +NIN EMNY
Sbjct: 352 RESGGDPVLASLYFNYARYLLISSSRPGSLPANLQGLWNADFWPSWGSKYTININTEMNY 411
Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVW 460
W + P NL++C +PLFD L + +G +TA+V Y G+V HH TD+WA +
Sbjct: 412 WIAEPANLADCHQPLFDHLGRVVESGRETARVMYGCRGFVAHHNTDLWADTCPTDRNAGA 471
Query: 461 ALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPST 520
+ W +GGAWL H W+ ++Y D L AY LL + F LD+LIE G L +P+
Sbjct: 472 SYWTIGGAWLVLHAWDRFDYDRDPGSLAA-AYALLREASLFFLDFLIEDARGRLVLSPTC 530
Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDA---------LVEK 571
SPE+ + P+G+ + TMD ++ +F AA++L + A + +
Sbjct: 531 SPENTYRLPNGEAGVLCAGCTMDSQLLSILFRRTAQAAQLLGRRPLAAAAIAGDHDFLAR 590
Query: 572 VLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAA 631
V + RL + G ++EW +D+++ + HRH+SH FGL PG I+ + PDL +A
Sbjct: 591 VAAAAARLPQPAVGRHGQLLEWLEDYEELDPQHRHVSHAFGLHPGDLISPRRTPDLARAI 650
Query: 632 EKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDP-----EHEKHFEGGL 686
TL++RG+ G GW + WK +WARL D E A+R++ L V+ + +GG
Sbjct: 651 RVTLERRGDAGTGWCMAWKACMWARLGDGERAHRLLGNLLAPVETVSLANRDTAYEDGGT 710
Query: 687 YSNLFAAHPPFQIDANFGFTAAVAEMLVQS--------------TLNDLYLLPALPWDKW 732
Y NLF AHPPFQID NFG AA+ EML+QS L ++LLPALP W
Sbjct: 711 YPNLFCAHPPFQIDGNFGGAAAILEMLLQSHETEPDADAPDAPLGLPVIHLLPALP-SAW 769
Query: 733 SSGCVKGLKARGGETVSICWK 753
+G +G +ARGG V + W+
Sbjct: 770 PAGSFRGFRARGGCEVDLQWE 790
>gi|299149390|ref|ZP_07042447.1| fibronectin type III domain protein [Bacteroides sp. 3_1_23]
gi|298512577|gb|EFI36469.1| fibronectin type III domain protein [Bacteroides sp. 3_1_23]
Length = 859
Score = 430 bits (1106), Expect = e-117, Method: Compositional matrix adjust.
Identities = 290/834 (34%), Positives = 432/834 (51%), Gaps = 85/834 (10%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD--------Y 63
LK T+N PAK + ++A+PIGNG +GAM++GGV + ++ NE TLW+G PG+
Sbjct: 32 LKATYNKPAKVWESEALPIGNGYMGAMIFGGVEVDVIQTNEHTLWSGGPGEDPSYNGGHL 91
Query: 64 TNPDAPKA-LSDVRSL---------VDSGQYAEATAASV------------------KLF 95
P+ K+ L R L V+ Y +A + KL
Sbjct: 92 GTPETNKSYLHKTRVLLQQKMNDFTVNHSAYIDADGKLITHNYEGDGNGTELRNLIDKLA 151
Query: 96 GHPADV--YQLLGDIELEF-DDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFS 152
G +Q L +I +E + + + A Y R LD++ A RV Y G + F RE+F
Sbjct: 152 GTKEHFGSFQTLSNIIVEVVNPATSEPAYSDYTRTLDIDNAIHRVTYKEGGITFKREYFM 211
Query: 153 SNPDQVIVTKI-SGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANAN 211
S PD ++V ++ S S+ G +S +SL+SL + +N I + G P K +
Sbjct: 212 SYPDNIMVMRLTSDSKKGKISRMISLESLHTDKVIRASDNTITLTGY-PTPTSGDKRVGD 270
Query: 212 DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD-- 269
G++++ L +K + G I+ ++ KKLK+E + ++L+ A++++ +
Sbjct: 271 HWKNGLKYAQQLLVKHTG--GKITVVDGKKLKIEEAKEIIVLMSAATNYVQCMDDSYHYF 328
Query: 270 SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
S ++P + + L+ N Y+ L H DY L+ R+ + L P+ V T
Sbjct: 329 SGEEPLDKVKATLKKAANKKYTALLATHEKDYHSLYDRMKLNLGNLPEMPVVTT------ 382
Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
D++ + E+ L L FQFGRYLLISSSR G+ ANLQG+W E LS W+S
Sbjct: 383 DSLLKGMDAHTNSESENQYLEMLYFQFGRYLLISSSREGSLPANLQGVWGERLSNPWNSD 442
Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYL------ASGWVIHH 443
H NIN++MNYW + P NLS C P+ +++ L G TAQ Y GWV HH
Sbjct: 443 YHTNINVQMNYWPTQPTNLSRCHLPMVEYVKSLVPRGKYTAQQYYCKPDGGNVRGWVTHH 502
Query: 444 KTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
+ +IW ++ + K +P G W+C +WE+Y + +D+DFLE Y ++ A F +
Sbjct: 503 ENNIWGNTAPAK-KDTPHHFPAGAIWMCQDIWEYYQFNLDKDFLEAY-YDVMLQAALFWV 560
Query: 504 DWL-IEGHDGYLETNPSTSPEH-EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVL 561
D L + DG L NPS SPEH EF L C + A+I E+F +I A++VL
Sbjct: 561 DNLWTDERDGTLVANPSHSPEHGEF-----SLGC-----STSQAMIAEMFDMMIKASKVL 610
Query: 562 EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF-KD--PEVHHRHLSHLFGLFPGHT 618
K+++ + ++ ++ +L KI G +MEW + KD + HRH +HLF L PG
Sbjct: 611 GKDKEPEIAEIETAMNKLSGPKIGLGGQLMEWKDEVTKDVTGDGGHRHTNHLFWLHPGSQ 670
Query: 619 ITI---EKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVD 675
I I E++ A + TL RG+EG GWS WK WARLHD ++ +++ L
Sbjct: 671 IVIGRSEEDDKYANAMKVTLNTRGDEGTGWSKAWKLNFWARLHDGNRSHALLRSAMKLTV 730
Query: 676 PEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSG 735
P+ GG+Y+NLF AHPPFQID NFG TA +AEML+QS + LLPALP D W +G
Sbjct: 731 PQGR---FGGVYTNLFDAHPPFQIDGNFGCTAGIAEMLMQSQGGYIELLPALP-DAWKNG 786
Query: 736 CVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN---DHDSFKTLHYRGTSVKV 786
KG+KARG V + WK+G + + I SN + K+L G V+V
Sbjct: 787 AFKGMKARGNFEVDVIWKEGQITSIEILSNAGAECMLKYPDAKSLKVSGAKVRV 840
>gi|374376430|ref|ZP_09634088.1| Alpha-L-fucosidase [Niabella soli DSM 19437]
gi|373233270|gb|EHP53065.1| Alpha-L-fucosidase [Niabella soli DSM 19437]
Length = 946
Score = 429 bits (1103), Expect = e-117, Method: Compositional matrix adjust.
Identities = 254/701 (36%), Positives = 373/701 (53%), Gaps = 52/701 (7%)
Query: 102 YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVT 161
YQ GD+ + K + YRR LDL TA Y+ V+F R + +S P QV+
Sbjct: 289 YQPFGDVVFHVNADETKVKD--YRRVLDLETAVLTTAYNYNGVDFKRTYIASQPQQVLAV 346
Query: 162 KISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSA 221
+ S GS+SF L S H V +Q + + K D ++ +
Sbjct: 347 NFTASRPGSVSFETELTSP-HQHFIVEAVDQ---------QTLVLKIQVKDG--ALRGES 394
Query: 222 ILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA 281
++++++ +G++ A++D KL V +D A + + A+++F N D DP++ +A
Sbjct: 395 YVQVRVT--KGSV-AVKDNKLIVSKADEATVFIAAATNFK----NFKDVSADPSARCRAA 447
Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF 341
++ I+ S++ + H+ +YQ+ F+ +S+ + +++P+ R++ F
Sbjct: 448 IKGIQQQSFASVLKAHVKEYQQYFNTLSVNFYGQKNQPSAN-------ESLPTDLRLEKF 500
Query: 342 QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
DP V L Q+GRYLLISSSRPGT ANLQGIWNE LSP W S NIN EMNYW
Sbjct: 501 ARSGDPEFVALYMQYGRYLLISSSRPGTYPANLQGIWNELLSPPWGSKYTTNINAEMNYW 560
Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+ LS + LF + L+++G +TA+ Y A GWV+HH TD+W ++A
Sbjct: 561 PAELLGLSPLHDALFKMVEELAVSGKETAKEYYNAPGWVLHHNTDLWRGTAAINASNH-G 619
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH-DGYLETNPST 520
+W GGAWLC+HLWE Y +T D FL+ AYP++ A F +LI+ GYL + PS
Sbjct: 620 IWVTGGAWLCSHLWERYLFTKDERFLKDTAYPIMREAALFFNHFLIKDPVTGYLISTPSN 679
Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLR 580
SPEH G L TMD IIR +F + I A+++L K + AL +++ + PR+
Sbjct: 680 SPEH------GGLVA---GPTMDHQIIRALFKSTIEASQIL-KTDAALRKELEEKYPRIA 729
Query: 581 PTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGE 640
P KI G + EW QD D HRH+SHL+G++PG+ I E P+L KAA ++L RG+
Sbjct: 730 PNKIGRFGQLQEWMQDVDDTTDKHRHVSHLWGVYPGNEINWETAPELMKAARQSLIYRGD 789
Query: 641 EGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQID 700
GWS+ WK LWAR D H Y++++ L P G Y NLF AHPPFQID
Sbjct: 790 AATGWSLGWKINLWARFKDGNHTYKLIQMLLT---PAGR---SAGSYPNLFDAHPPFQID 843
Query: 701 ANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEV 760
NFG A + EML+QS + +LPALP D +G + G+ ARGG + I W+ L ++
Sbjct: 844 GNFGGAAGIGEMLLQSHTAFVDILPALP-DALPNGRINGIHARGGLILDIAWEQKHLTQL 902
Query: 761 GIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLK 801
I + D L Y G + N G+ Y+ + K
Sbjct: 903 NIKA-----IADGSAQLRYMGKVLPFNFKKGRQYSVSADFK 938
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 36/75 (48%), Positives = 53/75 (70%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ + PAK + +A+PIGNGRLGAMV+GGV ++ ++ NE+TLW+G P DY A + L
Sbjct: 24 LKLWYQHPAKEWVEALPIGNGRLGAMVFGGVQTDRVQFNEETLWSGYPRDYNKKGAYRYL 83
Query: 73 SDVRSLVDSGQYAEA 87
+R L+ +G+ EA
Sbjct: 84 DSIRGLLFAGKQKEA 98
>gi|288801450|ref|ZP_06406903.1| fibronectin type III domain protein [Prevotella sp. oral taxon 299
str. F0039]
gi|288331661|gb|EFC70146.1| fibronectin type III domain protein [Prevotella sp. oral taxon 299
str. F0039]
Length = 827
Score = 429 bits (1103), Expect = e-117, Method: Compositional matrix adjust.
Identities = 275/804 (34%), Positives = 408/804 (50%), Gaps = 89/804 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA--------PKALSDVR 76
+ ++P+GNG LGA V G + +E + NE TLW G P DA L ++R
Sbjct: 72 SQSLPLGNGSLGANVMGSIEAERITFNEKTLWRGGPNTSAGADAYWNVNKQSAHYLKEIR 131
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
G +A + K F + +G+ +E S + +E Y
Sbjct: 132 QAFIEGNEKKAALLTRKNFNSTVPYESWKDKPFRFGNFTTMGEFYIETGLSSIGMSE--Y 189
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ V + R +F S P+ ++V + + G +L F+ + +
Sbjct: 190 KRALSLDSALAVVQFKKDGVRYERNYFISYPNNIMVVRFKADQPGKQNLVFSYETNPVST 249
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+G+N ++ KA+ +++ Q ++ IK + GTI+ + KL
Sbjct: 250 GKMEADGSNGLVF-----------KAHLDNN----QMEYVVRIKALNQGGTINN-DKGKL 293
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDSKKDPTSESMSA-LQSIRNLSYSDLYTRH 297
+ G++ V L+ A + +F+ + NP SE+ +A ++ Y+ L H
Sbjct: 294 TINGANEVVFLITADTEYKVNFNPDYKNPRTYVGVNPSETTAAWMKKAVAQGYNALLEAH 353
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQF 356
DY LF+RVS+ L+ SE+ +P+ +R+ +++ ED L EL +QF
Sbjct: 354 YKDYSSLFNRVSLTLN-----------SEQRTSDIPTPQRLINYRKGKEDFYLEELYYQF 402
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NLSEC PL
Sbjct: 403 GRYLLIASSRPGNLPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPAGSTNLSECTLPLI 462
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + A GW +I+ ++ + W PM G WL TH+W
Sbjct: 463 DFIRTLVKPGEKTAQAYFDARGWTASISGNIFGFTAPLGSEDMSWNFNPMAGPWLATHVW 522
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
++Y+YT D+ FL++ Y L++ A F +D+L + DG PSTSPEH
Sbjct: 523 DYYDYTRDKKFLKEVGYDLIKSSAIFAVDFLWKKPDGTYTAAPSTSPEH---------GP 573
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A+IRE+ I A++VL +K E E+VLK R+ P K+ G ++EW
Sbjct: 574 IDEGTTFVHAVIREILMNAIDASKVLDVDKKERKQWEEVLK---RIAPYKVGRYGQLLEW 630
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
++D DP HRH++HLFGL PGHTI+ P L +A++ L RG+ GWS+ WK
Sbjct: 631 SKDIDDPNDQHRHVNHLFGLHPGHTISPITTPALAEASKVVLNHRGDGATGWSMGWKLNQ 690
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARLHD HAY++ L + G NL+ HPPFQID NFG TA V EML
Sbjct: 691 WARLHDGNHAYKLYGNL-----------LKNGTLDNLWDTHPPFQIDGNFGGTAGVTEML 739
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDS 773
+QS + ++LLPALP D W G VKGL A+G + ICWK+G L V I S N
Sbjct: 740 MQSHMGFIHLLPALP-DVWKDGEVKGLCAKGNFELDICWKNGILKSVTILSKNGGNCE-- 796
Query: 774 FKTLHYRGTSVKVNLSAGKIYTFN 797
L Y+ + + K YT N
Sbjct: 797 ---LRYKEDKLVLKTIKNKSYTLN 817
>gi|317505420|ref|ZP_07963340.1| alpha-L-fucosidase [Prevotella salivae DSM 15606]
gi|315663460|gb|EFV03207.1| alpha-L-fucosidase [Prevotella salivae DSM 15606]
Length = 861
Score = 429 bits (1103), Expect = e-117, Method: Compositional matrix adjust.
Identities = 276/817 (33%), Positives = 407/817 (49%), Gaps = 102/817 (12%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG------------------------- 61
++PIGNG +GA ++G + +E + LNE +LW G PG
Sbjct: 79 SLPIGNGSVGANIFGSISAERITLNEKSLWRGGPGVSHDASYYWNVNDNNVFPVNIDDGH 138
Query: 62 --DY---TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQL------LGDI-- 108
Y N + L D+R+ +G A+A + + K F A Q G+
Sbjct: 139 DASYYWNVNKRSVSVLKDIRAAFLAGDKAKADSLTRKNFNGWASYEQRDEKPFRFGNFTT 198
Query: 109 --ELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGS 166
EL + + YRREL L++A V+++ V + R F S PD V+V + +
Sbjct: 199 MGELFIETGLTEEGISHYRRELSLDSARTLVQFNQNGVCYQRTAFVSYPDNVLVLRFKAN 258
Query: 167 ESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE 224
G +L+F+ + + + +G N ++ G D G+Q+ ++
Sbjct: 259 AEGRQNLNFSYAPNPVSTGQMQADGANGLVYRGAL-------------DDNGMQY--VVR 303
Query: 225 IKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SFDGPFINPSD-SKKDPTSESM 279
I+ G+++ D LK+ +D + L+ A + +F+ F NP P +
Sbjct: 304 IQAVTKGGSVTNEHDT-LKIRHADEVMFLITADTDYRINFNPDFTNPKTYVGVQPEVTTQ 362
Query: 280 SALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVK 339
+ +Q Y+ L++RH DY LF RV ++L+ S D P+A+R++
Sbjct: 363 AWMQQAEKKDYNQLFSRHYRDYSALFQRVKLRLN----------PSNHAADDKPTAQRLE 412
Query: 340 SFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEM 398
+++ D +L EL +QFGRYLLI+SSRPGT ANLQG+W+ ++ W H NINL+M
Sbjct: 413 AYRNGTTDNALEELYYQFGRYLLIASSRPGTLPANLQGLWHNNVDGPWHVDYHNNINLQM 472
Query: 399 NYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK- 457
NYW +L EC PL DF+ L G++TA+ Y A GW ++I+ ++ +
Sbjct: 473 NYWPVHTTHLDECALPLIDFVRSLVKPGAETAKAYYGARGWTTSVSSNIFGFTAPLSSED 532
Query: 458 VVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETN 517
+ W L PMGG WL THLWE+Y++T D+ L Y L++ A F +D+L DG
Sbjct: 533 MSWNLCPMGGPWLATHLWEYYDFTRDKQLLRSTLYDLIKQSADFAVDYLWRKPDGTYTAA 592
Query: 518 PSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP 577
PSTSPEH + T A+IRE+ I+A++VL + +A ++ + L
Sbjct: 593 PSTSPEH---------GPIDEGVTFVHAVIREILLDAIAASKVLGVDVEAR-KQWQQVLN 642
Query: 578 RLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQK 637
L P +I G + EW++D DP HHRH++HLFGL PGHTIT PDL KA+ L+
Sbjct: 643 HLAPYRIGRYGQLQEWSEDIDDPNDHHRHVNHLFGLHPGHTITPSATPDLAKASRVVLEH 702
Query: 638 RGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
RG+ GWS+ WK WARL D HAY +V+ L + G +NL+ HPPF
Sbjct: 703 RGDGATGWSMGWKINQWARLQDGNHAYLLVRNL-----------LKNGTLNNLWDTHPPF 751
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
QID NFG TA + EML+QS + LPALP D W G V GL+ARGG VS+ W +G L
Sbjct: 752 QIDGNFGGTAGITEMLLQSHAGFIQFLPALP-DSWKQGEVSGLRARGGFEVSLKWNEGTL 810
Query: 758 HEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIY 794
I S L+YRG S+ G+ Y
Sbjct: 811 QSATIKSLAGEP-----CKLNYRGNSIHFATQKGRNY 842
>gi|213963750|ref|ZP_03392000.1| alpha-L-fucosidase 2 [Capnocytophaga sputigena Capno]
gi|213953630|gb|EEB64962.1| alpha-L-fucosidase 2 [Capnocytophaga sputigena Capno]
Length = 806
Score = 429 bits (1103), Expect = e-117, Method: Compositional matrix adjust.
Identities = 285/812 (35%), Positives = 428/812 (52%), Gaps = 76/812 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + F+ PA+HFT+++PIGNGRLGAM +G + + LNE +LW+G D +P+A L
Sbjct: 23 VSVVFHKPAEHFTESLPIGNGRLGAMFFGKTDVDRIVLNEISLWSGGTQDADDPNAHIHL 82
Query: 73 SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
++ L+ G+ EA A K F G+ A+ YQ+LG++ L++ +
Sbjct: 83 KTIQQLLLEGKNLEAQALLQKHFIAKGEGSCKGNGANCSYGCYQILGELLLDWKST---L 139
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
E Y+R L L+ ATA + GN + F+ + +I +I+ S+ L ++SL
Sbjct: 140 PTENYQRILRLDQATAFTSFKRGNNYIQQIAFADFKNDLIWIRITASQP--LDIDISLHR 197
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDD-RGTISALE 238
+N + +N+I + G P N++ +G+QF++ ++++ + + T +A
Sbjct: 198 R-ENATTSYKSNKITLSGVLP----------NENTEGMQFASEIDVQTDGNLQNTTNATS 246
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+K K VL + A+++++ F ++ D ++ LQ + + +
Sbjct: 247 IQKAKE-----IVLKISAATNYN--FTKGGLTQNDVLQKANDYLQKA-TIPFENAIIESQ 298
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF-QFG 357
YQ F+R +R + TDT S + + ER++ F + +L+ +L+ FG
Sbjct: 299 KAYQVFFNR-----NRWYSEANTDTSS------LSTFERLQRFYKGKKDALLPVLYYNFG 347
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLISSSR G ANLQG+W E+ W+ H+NINL+MNYW + NLSE PL
Sbjct: 348 RYLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINLQMNYWLAESTNLSELTTPLHK 407
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
F L NG KTA+ Y A+GW+ H ++ W +S W GGAWLC H+W+H
Sbjct: 408 FTKNLVANGRKTARAYYNANGWMAHVISNPWFYTSPGE-SAEWGSTLTGGAWLCEHIWQH 466
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP---DGK- 532
Y YT++ DFL + YP+L+ A F LI+ GY T PS SPE+ +I P DGK
Sbjct: 467 YLYTLNTDFL-REYYPVLKEAADFFQSLLIKDPKTGYWVTAPSNSPENAYIMPQLKDGKK 525
Query: 533 -LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
+ + TMDM I+RE+FS + AA++L + + L + + + P +I + G +
Sbjct: 526 QIGNTCIAPTMDMQIVRELFSNTLQAAKILGVDNE-LYSQWQEIITHTVPNRIGKKGDLN 584
Query: 592 EWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKT 651
EW D+KD E +HRH+SHL+GL+P IT P L AA+KTL+ RG+ G GWS WK
Sbjct: 585 EWLDDWKDAEPNHRHISHLYGLYPYDEITPWDTPALATAAKKTLKMRGDGGTGWSRAWKI 644
Query: 652 ALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAE 711
WARLHD HA ++++L + VDP GG Y NLF AHPPFQID N G A +AE
Sbjct: 645 NFWARLHDGNHALVLLRQLLHPVDPNSTSGQNGGTYPNLFCAHPPFQIDGNLGGAAGIAE 704
Query: 712 MLVQSTLND--LYLLPALP-WDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN 768
ML+QS + + LPALP W +G ++G+K R G VS W+ L I S
Sbjct: 705 MLLQSHGKNYTIRFLPALPSHPDWKNGTMQGMKVRNGFEVSFDWEKHRLKTATITS---- 760
Query: 769 NDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQL 800
GT V L AGK + + L
Sbjct: 761 ----------LNGTDCSVLLPAGKSIYYKKTL 782
>gi|383113365|ref|ZP_09934137.1| hypothetical protein BSGG_3069 [Bacteroides sp. D2]
gi|313695534|gb|EFS32369.1| hypothetical protein BSGG_3069 [Bacteroides sp. D2]
Length = 859
Score = 429 bits (1102), Expect = e-117, Method: Compositional matrix adjust.
Identities = 287/834 (34%), Positives = 431/834 (51%), Gaps = 85/834 (10%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD--------Y 63
LK T+N PAK + ++A+PIGNG +GAM++GGV + ++ NE TLW+G PG+
Sbjct: 32 LKATYNKPAKVWESEALPIGNGYMGAMIFGGVEVDVIQTNEHTLWSGGPGEDPSYNGGHL 91
Query: 64 TNPDAPKA-LSDVRSLVD------SGQYAEATAASVKLFGHPAD---------------- 100
P+ K+ L R L+ + ++ A KL H +
Sbjct: 92 GTPETNKSYLHKTRVLLQQKMNDFTANHSAYIDADGKLITHNYEGDGNGTELRNLIDKLA 151
Query: 101 -------VYQLLGDIELEF-DDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFS 152
+Q L +I +E + + + A Y R LD++ A RV Y G + F RE+F
Sbjct: 152 GTKEHFGSFQTLSNIIVEVVNPATSEPAYSDYTRTLDIDNAIHRVTYKEGGITFKREYFM 211
Query: 153 SNPDQVIVTKI-SGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANAN 211
S PD ++V ++ S S+ G +S +SL+SL + +N I + G P K +
Sbjct: 212 SYPDNIMVMRLTSDSKKGKISRMISLESLHTDKVIRASDNTITLTGY-PTPTSGDKRVGD 270
Query: 212 DDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD-- 269
G++++ L +K + G I+ ++ KKLK+E + ++L+ A++++ +
Sbjct: 271 HWKNGLKYAQQLLVKHTG--GKITVVDGKKLKIEEAKEIIVLMSAATNYVQCMDDSYHYF 328
Query: 270 SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
S ++P + + L+ N Y+ L H DY L+ R+ + L + V T
Sbjct: 329 SGEEPLDKVKATLKKAANKKYTALLAAHEKDYHSLYDRMKLNLGNLTEMPVVTT------ 382
Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
D++ ++ E+ L L FQFGRYLLISSSR G+ ANLQG+W E LS W+S
Sbjct: 383 DSLLKGMDARTNSESENQYLEMLYFQFGRYLLISSSREGSLPANLQGVWGERLSNPWNSD 442
Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYL------ASGWVIHH 443
H NIN++MNYW + P NLS C P+ +++ L G TAQ Y GWV HH
Sbjct: 443 YHTNINVQMNYWPTQPTNLSRCHLPMVEYVKSLVPRGKYTAQQYYCKPDGGNVRGWVTHH 502
Query: 444 KTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
+ +IW ++ + K +P G W+C +WE+Y + +D+DFLE Y ++ A F +
Sbjct: 503 ENNIWGNTAPAK-KDTPHHFPAGAIWMCQDIWEYYQFNLDKDFLEAY-YDVMLQAALFWV 560
Query: 504 DWL-IEGHDGYLETNPSTSPEH-EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVL 561
D L + DG L NPS SPEH EF L C + A+I E+F +I A++VL
Sbjct: 561 DNLWTDERDGTLVANPSHSPEHGEF-----SLGC-----STSQAMIAEMFDMMIKASKVL 610
Query: 562 EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF-KD--PEVHHRHLSHLFGLFPGHT 618
K+++ + ++ ++ +L KI G +MEW + KD + HRH +HLF L PG
Sbjct: 611 GKDKEPEIAEIETAMNKLSGPKIGLGGQLMEWKDEVTKDVTGDGGHRHTNHLFWLHPGSQ 670
Query: 619 ITI---EKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVD 675
I I E++ A + TL RG+EG GWS WK WARLHD ++ +++ L
Sbjct: 671 IVIGRSEEDDKYANAMKVTLNTRGDEGTGWSKAWKLNFWARLHDGNRSHALLRSAMKLTV 730
Query: 676 PEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSG 735
P+ GG+Y+NLF AHPPFQID NFG TA +AEML+QS + LLPALP D W G
Sbjct: 731 PQGR---FGGVYTNLFDAHPPFQIDGNFGCTAGIAEMLMQSQGGYIELLPALP-DAWKDG 786
Query: 736 CVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN---DHDSFKTLHYRGTSVKV 786
KG+KARG V + WK+G + + I SN + K+L G V+V
Sbjct: 787 AFKGMKARGNFEVDVTWKEGQITSIEILSNAGAECMLKYPDAKSLKVSGARVRV 840
>gi|302413419|ref|XP_003004542.1| alpha-L-fucosidase [Verticillium albo-atrum VaMs.102]
gi|261357118|gb|EEY19546.1| alpha-L-fucosidase [Verticillium albo-atrum VaMs.102]
Length = 765
Score = 429 bits (1102), Expect = e-117, Method: Compositional matrix adjust.
Identities = 275/776 (35%), Positives = 395/776 (50%), Gaps = 86/776 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + ++ PA +++A+P+GNGRLG MV+G +E L+LNED++W G P D T DA + L
Sbjct: 8 LTLHYDAPAASWSEALPVGNGRLGGMVYGRTSTELLQLNEDSVWYGGPQDRTPRDARRHL 67
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADVY--QLLGDIELEFDDSHLKYAEETYRRELD 129
+R L+ ++A A A F PA + + LG+ LEF H YRR LD
Sbjct: 68 DTLRQLIRDEEHAAAEALVREAFFATPASMRHSEPLGNCTLEF--GHEAQDVTGYRRSLD 125
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS--------LDSLL 181
L TA A V+Y V + RE +S PD V+ + S SE ++ + L
Sbjct: 126 LATAQATVEYQCRGVSYRRETIASFPDNVVALRFSASEPTRFVVRLNRVSEIEWETNEFL 185
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI--SDDRGTISALED 239
D+ NG +I++ GK N +P S +L I SDD G+I A+ +
Sbjct: 186 DSIQAANG--RIVLNATPGGK--------NSNP----LSLVLGISCDASDDGGSIEAIGN 231
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+ S L++ A ++F DP + + + + S+ +L R
Sbjct: 232 ALVVKAFS--CTLVIAAHTAF---------RNADPEAAARQDVDNALKRSWHELVLRQRT 280
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
DY LF R S+++ + D+ P+ ER+ + + DP LV L + +GRY
Sbjct: 281 DYASLFQRSSLRMWPAAHDL-------------PTNERI---EKNRDPGLVALYYNYGRY 324
Query: 360 LLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
LLISSSR + A LQGIWN +P W +NINL+MNYW + P NL EC P+
Sbjct: 325 LLISSSRDSDKALPATLQGIWNPSFAPPWGCKYTININLQMNYWLAAPGNLVECALPMLG 384
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ +++ G+KTA++ Y GW HH TDIWA + + +WP+GG WLC + E
Sbjct: 385 LVERMAVRGAKTARIMYDCGGWCAHHNTDIWADTDPQDRWMPSTIWPLGGVWLCIDVLEM 444
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACV 536
Y DR L +RA LLEGC FLLD+LI +L TNPS SPE+ F++ G +
Sbjct: 445 LLYHYDRK-LHERAAVLLEGCIVFLLDFLIPSACRTFLVTNPSLSPENTFVSKSGDTGIL 503
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWA-Q 595
S +D I+R F + + +LEK + LV KV ++ RL I DG I EW +
Sbjct: 504 CEGSAIDTTIVRIAFEKFLWSTAILEKG-NPLVPKVRDAMARLPDLTINNDGLIQEWGLK 562
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTA 652
D+K+ E HRH+SHLFGL+PG +I+ +P L AA+ L +R G GWS W
Sbjct: 563 DYKEHEPGHRHVSHLFGLYPGESISPVTSPKLAAAAKNVLDRRAAHGGGHTGWSRAWLLN 622
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
L ARLHD + + L + N+ HPPFQID NFG A + E
Sbjct: 623 LHARLHDADGCGIHMDNL-----------LKSSTLPNMLDNHPPFQIDGNFGGAAGILEC 671
Query: 713 LVQSTLN---------DLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
+VQS + ++ LLPA P D WS+G ++G++ +GG VS+ WKDG + E
Sbjct: 672 IVQSRIVWGASRPDCIEIRLLPACP-DAWSAGELRGVRVKGGWLVSLAWKDGRIEE 726
>gi|294808085|ref|ZP_06766858.1| putative lipoprotein [Bacteroides xylanisolvens SD CC 1b]
gi|294444726|gb|EFG13420.1| putative lipoprotein [Bacteroides xylanisolvens SD CC 1b]
Length = 698
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 258/688 (37%), Positives = 389/688 (56%), Gaps = 53/688 (7%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
E S+ K+ ++ PA+ +T+A+P+GNGRLGAMV+G +E ++LNE+++W G P +
Sbjct: 23 EQKSSVPEYKLWYDCPAQVWTEALPLGNGRLGAMVYGTPGTEQIQLNEESIWAGRPNNNA 82
Query: 65 NPDAPKALSDVRSLVDSGQYAEA-TAASVKLF-----GHPADVYQLLGDIELEFDDSHLK 118
NPDA + + VR LV +G+Y EA T A+ K+ G P YQ GD+ + F H +
Sbjct: 83 NPDALEYIPKVRELVFAGKYLEAQTLATEKVMAKTNSGMP---YQSFGDLRIAFP-GHTR 138
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y+ Y REL L++A A V+Y V V++ RE +S DQV++ +++ + G ++FN L
Sbjct: 139 YS--NYYRELSLDSARAIVRYEVDGVQYQRETITSFTDQVVMVRLTANRPGQITFNAQLT 196
Query: 179 SLLDNHSYVNGNNQIIM----EGRCPGKRIPPKANANDDPKG-IQFSAILEIKISDDRGT 233
S +Q +M EG C + ++ ++ KG ++F L K ++G
Sbjct: 197 S----------PHQDVMIASEEGNC--VTLSGVSSLHEGLKGKVEFQGRLTAK---NKGG 241
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
A D L VE +D AV+ + +++F+ N D + T + + L + +
Sbjct: 242 KIACADGILSVEKADEAVIYVSIATNFN----NYQDITGNQTERAKNYLAKAMVHPFIES 297
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H+D Y++ RVS+ L R + V + +RV++F+ D LV
Sbjct: 298 KKNHVDFYRQYLTRVSLDLGR------------DQYANVTTDKRVENFKNTNDTHLVATY 345
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQFGRYLLI SS+PG Q ANLQGIWN+ L P+WDS NINLEMNYW S NLSE E
Sbjct: 346 FQFGRYLLICSSQPGGQPANLQGIWNDKLFPSWDSKYTCNINLEMNYWPSEVTNLSELNE 405
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLF + +S G +TA++ Y A+GWV+HH TDIW + A K +WP GGAWLC H
Sbjct: 406 PLFRLIKEVSNTGKETAKIMYGANGWVLHHNTDIWRITGA-VDKAPSGMWPSGGAWLCRH 464
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGK 532
LWE Y YT D +FL + YP+L+ F + ++ E +L PS SPE+ +GK
Sbjct: 465 LWERYLYTGDVEFL-RSVYPILKESGRFFDEIMVKEPVHNWLVVCPSNSPENVHSGSNGK 523
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
A + TMD +I ++++AIISA+++L+ +++ + + L + P ++ G + E
Sbjct: 524 -ATTAAGCTMDNQLIFDLWTAIISASQILDTDQE-FASHLTQRLKEMAPMQVGHWGQLQE 581
Query: 593 WAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTA 652
W D+ DP+ HRH+SHL+GLFP + I+ + P+L AA +L RG+ GWS+ WK
Sbjct: 582 WMFDWDDPKDVHRHISHLYGLFPSNQISPYRTPELFDAARTSLIHRGDPSTGWSMGWKVC 641
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEK 680
LWARL D +HAY+++ LV E +K
Sbjct: 642 LWARLLDGDHAYKLITDQLTLVRNEKKK 669
>gi|391227681|ref|ZP_10263888.1| hypothetical protein OpiT1DRAFT_00166 [Opitutaceae bacterium TAV1]
gi|391223174|gb|EIQ01594.1| hypothetical protein OpiT1DRAFT_00166 [Opitutaceae bacterium TAV1]
Length = 839
Score = 428 bits (1100), Expect = e-117, Method: Compositional matrix adjust.
Identities = 271/810 (33%), Positives = 404/810 (49%), Gaps = 106/810 (13%)
Query: 17 FNGPAKH-FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDV 75
F+ PA+ + A+PIGNGR GAM++G + +E L+LNED+LW G P D NPDA + L +
Sbjct: 14 FDQPAQQDWNRALPIGNGRFGAMIYGNIVAERLQLNEDSLWNGGPRDRRNPDAREHLPVL 73
Query: 76 RSLVDSGQYAEA-TAASVKLFGHP--ADVYQLLGDIELEF-----------DDSHLKYAE 121
R L+ G+ A A L G P Y+ L D+ L F D+ L
Sbjct: 74 RKLLADGRLAAAHDLVHDALAGIPDSQRCYEPLADLFLHFEHPGAPVAVSADEMALASGY 133
Query: 122 ET----------YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL 171
T YRR LDL TA V Y++ N + R H +S DQVI + G L
Sbjct: 134 ATPRFDPALLSHYRRALDLRTAVMSVDYTLNNTSYARRHLASAADQVIALHLRAGRPGGL 193
Query: 172 SFNVSLDS---------LLDNHSYVN----------GNNQIIMEGRCPGKRIPPKANAND 212
+ + L+ D +V + +++ GR G+
Sbjct: 194 TLRLRLERGPRKSYSTRYADTVGFVADAAREPSDACASPALLLRGRAGGE---------- 243
Query: 213 DPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKK 272
G++F+ L +I+ G + + + L ++ +D L+L A+++F +
Sbjct: 244 --DGVRFAVGLRARIAG--GALRRI-GETLCIDAADSVTLVLAAATTF---------RED 289
Query: 273 DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTV 332
DP + + + + + H +Y+ F R S+ L +E ++V
Sbjct: 290 DPAAFVIGRTGAALARGWDKIRADHEREYRSRFDRASLTLG-------APAAAEAGAESV 342
Query: 333 PSAERVK-SFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPH 391
P R+K + ++ DP L L F + RYLLISSSRPG+ ANLQG+WN D P+W S
Sbjct: 343 PVDLRLKRARESGGDPVLASLYFNYARYLLISSSRPGSLPANLQGLWNADFWPSWGSKYT 402
Query: 392 VNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKS 451
+NIN EMNYW + P NL++C +PLFD L + +G +TA+V Y G+V HH TD+WA +
Sbjct: 403 ININTEMNYWIAEPANLADCHQPLFDHLGRVVESGRETARVMYGCRGFVAHHNTDLWADT 462
Query: 452 SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD 511
+ W +GGAWL H W+ ++Y D L AY LL + F LD+LIE
Sbjct: 463 CPTDRNAGASYWTIGGAWLVLHAWDRFDYDRDPGSLAA-AYALLREASLFFLDFLIEDAR 521
Query: 512 GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDA---- 567
G L +P+ SPE+ + P+G+ + TMD ++ +F AA++L + A
Sbjct: 522 GRLVLSPTCSPENTYRLPNGEAGVLCAGCTMDSQLLSILFRRTAQAAQLLGRRPLAAAAI 581
Query: 568 -----LVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIE 622
+ +V + RL + G ++EW +D+++ + HRH+SH FGL PG I+
Sbjct: 582 AGDHDFLARVAAAAARLPQPAVGRHGQLLEWLEDYEELDPQHRHVSHAFGLHPGDLISPR 641
Query: 623 KNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDP-----E 677
+ PDL +A TL++RG+ G GW + WK +WARL D E A+R++ L V+
Sbjct: 642 RTPDLARAIRVTLERRGDAGTGWCMAWKACMWARLGDGERAHRLLGNLLAPVETVSLANR 701
Query: 678 HEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS--------------TLNDLYL 723
+ +GG Y NLF AHPPFQID NFG AA+ EML+QS L ++L
Sbjct: 702 DTAYEDGGTYPNLFCAHPPFQIDGNFGGAAAILEMLLQSHETEPDADAPDAPLGLPVIHL 761
Query: 724 LPALPWDKWSSGCVKGLKARGGETVSICWK 753
LPALP W +G +G +ARGG V + W+
Sbjct: 762 LPALP-SVWPAGSFRGFRARGGCEVDLQWE 790
>gi|269793879|ref|YP_003313334.1| hypothetical protein Sked_05400 [Sanguibacter keddieii DSM 10542]
gi|269096064|gb|ACZ20500.1| hypothetical protein Sked_05400 [Sanguibacter keddieii DSM 10542]
Length = 856
Score = 427 bits (1099), Expect = e-117, Method: Compositional matrix adjust.
Identities = 295/839 (35%), Positives = 415/839 (49%), Gaps = 85/839 (10%)
Query: 5 ESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-- 62
++ + PL + ++ PA +T+A+P+GNGRLGAM +GG + +++N+DT W+G P
Sbjct: 16 DNEAAARPLVLAYDAPAGRWTEALPVGNGRLGAMCFGGTTDDRVQVNDDTCWSGSPATTA 75
Query: 63 ----YTNPDAPKALSDVRSLVDSGQYAEATAASVKL-FGHPADVYQLLGDIEL-EFDDSH 116
+ + P + D R+ + +G A A +L GH + YQ L D+ L E D +
Sbjct: 76 GRRHFETGEGPGIVDDARAALAAGDVRAAERAVQRLQHGH-SQAYQPLVDLLLVEVDPAG 134
Query: 117 LKYAEET---YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF 173
E Y R LDL TA AR ++ +E +SS P V+V ++ +
Sbjct: 135 GAVDPEPRTGYARSLDLRTAVARHTWTGAGGTVVQETWSSAPRGVLVVDRRATDGTLPAL 194
Query: 174 NVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANAN-----DDPKGIQFSAILEIKIS 228
VSL S + + R P +P A+ D G +A + + +
Sbjct: 195 RVSLTSPHPTLDVQGTPTGLAVTVRMPSDVVPDHEPADVHVRYDPAPGAAVTAAVHVAVH 254
Query: 229 DDR----GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSE--SMSAL 282
D G SA D ++V G+ + L+L + F D++ P + S+ A
Sbjct: 255 TDGIVGDGGPSATADA-VEVVGATYVTLVLGTETDF-------VDAETAPHGDVDSLRAA 306
Query: 283 QSIRNLSYSD---------LYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVP 333
++R D L H+ D+ LF RV I L +P +T VP
Sbjct: 307 VALRTSGVVDAITASGLPALRAEHVADHDALFGRVEIDLGPAPDSGLT----------VP 356
Query: 334 SAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHV 392
ER+ DP+L L Q+GRYL+I+ SRPGT+ NLQGIWNE + P W S
Sbjct: 357 --ERLARHAAGAPDPALAALQAQYGRYLMIAGSRPGTRPMNLQGIWNESVVPPWSSNYTT 414
Query: 393 NINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKS- 451
NIN EMNYW + P NL EC EPL +L L+ G TA+ Y GW HH +D+W S
Sbjct: 415 NINTEMNYWPAGPANLDECHEPLTSWLADLARTGGDTAREVYGLPGWAAHHNSDVWGFSL 474
Query: 452 SADRGKV--VWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG 509
A G W WP+GG WL THLW+ Y+++ D FL A+PLL G A F L WL+E
Sbjct: 475 PAGDGDSDPSWTAWPLGGVWLATHLWDRYDWSRDLGFLAD-AWPLLRGAADFALAWLVEQ 533
Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEK------ 563
DG L T+P+TSPE+ ++APDG A V+ S+T D+A++RE+ + AA+VL +
Sbjct: 534 PDGTLGTSPATSPENRYVAPDGLPAAVTVSTTSDLAMVRELLGRCLDAAQVLVEADAPLP 593
Query: 564 ------NEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGH 617
++A +L RL ++ DG + EW+ D D E HRH SHL G++PG
Sbjct: 594 AGAPAPADEAWQAAARAALDRLPLERVLPDGRLAEWSTDLVDAEPEHRHQSHLVGVYPGS 653
Query: 618 TITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPE 677
+ + P L AA TL RG + GWS+ W+ AL ARL D + A L + P
Sbjct: 654 RVDPQTEPGLAAAALATLDARGPDSTGWSLAWRLALRARLRDVDGAE---AALGAFLRPT 710
Query: 678 HEKHFEG-------GLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND-----LYLLP 725
+ G G+Y NLF AHPPFQ+D N GFTA VAEML+QS + LLP
Sbjct: 711 ADGAPAGAPPGTGAGVYPNLFCAHPPFQVDGNLGFTAGVAEMLLQSHRTTAETTVVELLP 770
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSV 784
ALP W G GL+ARGG TV + W+ G + EV + + T R T V
Sbjct: 771 ALP-SGWQDGRATGLRARGGVTVDLVWQSGLVVEVVLAGPAGRRVELTLPTADGRHTVV 828
>gi|149277534|ref|ZP_01883675.1| hypothetical protein PBAL39_05083 [Pedobacter sp. BAL39]
gi|149231767|gb|EDM37145.1| hypothetical protein PBAL39_05083 [Pedobacter sp. BAL39]
Length = 780
Score = 427 bits (1099), Expect = e-117, Method: Compositional matrix adjust.
Identities = 260/784 (33%), Positives = 423/784 (53%), Gaps = 55/784 (7%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
M+++ + LK+ + PA+ + + + +GNGRLG M GG+ ET+ LN+ TLW+G P
Sbjct: 15 MLSSNGVFSQAKLKLWYEHPAQKWEETLALGNGRLGMMPDGGITRETVVLNDITLWSGAP 74
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLF--------GHPADVYQLLGDIELEF 112
D N +A K+L +R L+ G+ EA + F G +Q+LG +++ F
Sbjct: 75 QDANNYEASKSLPQIRKLLAEGKNDEAQELVNRDFICTGKGSGGVNYGCFQVLGTLQMNF 134
Query: 113 D---DSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
+ + + Y REL + A A Y + V++ +E+ +S D + + +I+ + G
Sbjct: 135 SYPGATADQLKDNGYHRELSIGEAIASSSYQINGVKYKKEYLTSFDDDICLIRITADKPG 194
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD 229
+L+F VS+ + + G ++ ++G+ + D KG+Q+ + + +
Sbjct: 195 ALNFKVSISRPERGEASIAGQ-ELQLQGQL---------DNGIDGKGMQYLSRVRAVLKG 244
Query: 230 DRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLS 289
+ T +K+ V V+L VAS G SD + T + M+A R
Sbjct: 245 GKLTT----EKEALVISKATEVILFVAS----GTDFRASDFRMK-TEQVMAAAMKKR--- 292
Query: 290 YSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDP 347
Y+ + H+ ++Q LF+RVS+ + + +D+VP+ R++ F + D
Sbjct: 293 YALQRSNHIRNFQHLFNRVSV------------SIGHQLMDSVPTDLRLERFHKNPAADL 340
Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
L +QFGRYL ISS+R G NLQG+W + W H+++N++MN+W N
Sbjct: 341 GFPALFYQFGRYLSISSTRVGLLPPNLQGLWANQIQTPWTGDYHLDVNVQMNHWPVEVSN 400
Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
LSE PL + + L G +TA+ Y A GW+ H T++W + W G
Sbjct: 401 LSELNLPLAELVRGLVKPGQRTAKAYYNADGWIAHVITNVWGFTEPGE-SASWGSSNAGS 459
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEF 526
WLC +LW+HY ++ D+++L + YP+L+G A F L+ + G+L T PS SPE+ F
Sbjct: 460 GWLCNNLWDHYAFSNDKEYL-RSIYPILKGSAEFYNSVLVRDEETGWLVTAPSVSPENSF 518
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKI 584
P+GK A +S T+D I+RE+F +I+A+E+L + A++++ LKS+P I
Sbjct: 519 YLPNGKTASISMGPTIDNQIVRELFGNVIAASEMLGLDAGFRAILQEKLKSIPP--AGNI 576
Query: 585 AEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPG 644
++DG IMEW +D+K+ + HRH+SHL+GL+P IT P+L +AA+KTL+ RG++GP
Sbjct: 577 SKDGRIMEWLRDYKETDPQHRHISHLYGLYPATLITPAGTPELAEAAKKTLEVRGDDGPS 636
Query: 645 WSITWKTALWARLHDQEHAYRMVKRLF-NLVDPEHEKHFEGGLYSNLFAAHPPFQIDANF 703
W+I +K WARL D E AY+++ L + + GG+Y NL +A PPFQID NF
Sbjct: 637 WTIAYKLLFWARLQDGERAYKLLTELLKSTTRTDMNYGAGGGIYPNLLSAGPPFQIDGNF 696
Query: 704 GFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIY 763
G A +AEML+QS + LLPA P ++G GLKARG TV+ WK+G + + +
Sbjct: 697 GGAAGIAEMLIQSHEGYIELLPAAPAAWKAAGSFSGLKARGNYTVNASWKEGRVTDFKVM 756
Query: 764 SNYS 767
+ ++
Sbjct: 757 APFA 760
>gi|319936285|ref|ZP_08010703.1| hypothetical protein HMPREF9488_01536 [Coprobacillus sp. 29_1]
gi|319808661|gb|EFW05205.1| hypothetical protein HMPREF9488_01536 [Coprobacillus sp. 29_1]
Length = 749
Score = 427 bits (1099), Expect = e-117, Method: Compositional matrix adjust.
Identities = 257/759 (33%), Positives = 399/759 (52%), Gaps = 58/759 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ FN PA + +A+P+GNG LGAMV+G E + +NED+L++G P + NP+ L
Sbjct: 6 KLIFNKPALQWEEAMPLGNGYLGAMVFGQTQKELICMNEDSLYSGGPIERGNPNTLDHLD 65
Query: 74 DVRSLVDSGQYAEATAASVKLF----GHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
++R+L+ G+ EA + F HP YQ LG + +EF ++ + Y++ LD
Sbjct: 66 EMRTLLLDGKVEEAQKKAPNYFYATTPHPRH-YQPLGQVWMEFHHQNV----QDYQKVLD 120
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L + ++Y NVE+ RE F S P+QV V KI S++ L+F D L G
Sbjct: 121 LKNSIGSIQYRYNNVEYQRECFISYPNQVFVYKIKASQNQQLNF----DLYLTRRDIRPG 176
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPK-GIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
++ ++ K + N + K GI ++ +++ D G + +L +E +
Sbjct: 177 RSESYVDDIHIEKDYLYLSGYNGNQKNGISYTMATTVQLKD--GCLKKY-GSRLVIENAT 233
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
A++ +V +S+ +P L SY +L H+ DYQ F ++
Sbjct: 234 EAIVYVVGRTSY---------RSHNPFQWCQKQLDKTLLKSYRNLKQDHIRDYQNYFDQL 284
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSA-ERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
+ L EN+ ++P +++K Q D D L+E F FGRYLLISSSR
Sbjct: 285 ELTLGDH---------KNENMMSIPERLQKMKEGQIDLD--LIETYFHFGRYLLISSSRE 333
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
G+ ANLQGIWN + P W S +NIN++MNYW + LS PL + G
Sbjct: 334 GSLAANLQGIWNGEFEPPWGSRYTININIQMNYWLAEKTGLSRLHLPLMQLQKIMLPRGQ 393
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
K A+ Y G HH TDIW + V LWPMG WL H++EHY YT +++F+
Sbjct: 394 KIAKEMYGCRGTCAHHNTDIWGDCAPADYYVPSTLWPMGSLWLSLHIFEHYQYTHNQEFI 453
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
+ +P+L+ A F LD++ + +G+ T PS SPE+ ++ DG+ A V S +MD+ ++
Sbjct: 454 LE-YFPILKENALFFLDYMFKDANGFYATGPSVSPENAYMTQDGQAATVCLSPSMDIQLL 512
Query: 548 REVFSAIISAAEVLEKNE-DALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRH 606
RE F++ + + L +++ +A + + L+ LP P +I + G IMEW +D+ + E+ HRH
Sbjct: 513 REFFTSYLQLLKELNRHDLEAEINEYLEKLP---PIQIGKYGQIMEWHEDYDEIEIGHRH 569
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHA 663
+S LF L+PG I + P+L +AA +TLQ+R G GWS W +ARLH E A
Sbjct: 570 ISQLFALYPGRHIQYSETPELIEAAYQTLQRRLSHGGGHTGWSCAWIIHFFARLHKGEEA 629
Query: 664 YRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYL 723
+ + +L + NLF HPPFQID NFG + A+ EML+Q N +Y+
Sbjct: 630 FDTLLKL-----------LKNSTLDNLFDNHPPFQIDGNFGGSNAILEMLIQDYENKVYV 678
Query: 724 LPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
LPAL + G +KGL+ + G +++ WKD + + I
Sbjct: 679 LPALS-REMPEGILKGLRLKSGAVLNMSWKDCQVSNIEI 716
>gi|375146879|ref|YP_005009320.1| alpha-L-fucosidase [Niastella koreensis GR20-10]
gi|361060925|gb|AEV99916.1| Alpha-L-fucosidase [Niastella koreensis GR20-10]
Length = 943
Score = 427 bits (1097), Expect = e-116, Method: Compositional matrix adjust.
Identities = 258/704 (36%), Positives = 375/704 (53%), Gaps = 72/704 (10%)
Query: 110 LEFDDSHLKYAE----ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISG 165
L F D + ++A Y+R LDL+ A + V Y+ V + RE+F S P Q +V ++
Sbjct: 296 LPFGDLYFRFAHGNNSSDYQRSLDLDNAISTVSYTANGVSYNREYFISAPHQCVVMHVTA 355
Query: 166 SESGSLSFNVSLDS--------LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGI 217
S+ G+LS L++ +D+H+ + +E +N K +
Sbjct: 356 SKPGALSLQAVLNTPHKKYVVKKIDDHTL-----SLSLE------------VSNGVLKAV 398
Query: 218 QFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSE 277
+ L + R T++ D + ++ + LVA++SF N D DP +
Sbjct: 399 GY---LYATATGGRLTVN---DTAINLQQATEVNFYLVAATSFK----NYKDVSGDPVAA 448
Query: 278 SMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAER 337
+AL ++ + Y+ + T HL++Y KLF S T +P+ ER
Sbjct: 449 CKAALARVKGVPYASIKTAHLNEYHKLFETFSF------------TVPAGKNSGLPTNER 496
Query: 338 VKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLE 397
++ F +D +LV L + RYLLISSSRPGTQ ANLQGIWN+ L+P W S NINLE
Sbjct: 497 IRQFNMKDDAALVPLFLMYSRYLLISSSRPGTQPANLQGIWNDLLTPPWGSKYTTNINLE 556
Query: 398 MNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK 457
MNYW + NLS C +PLF+ + L++ G +TA+ +Y A GWV+HH TD+W + +A
Sbjct: 557 MNYWTAEVLNLSTCTQPLFNMINELAVAGHQTAKDHYNAPGWVLHHNTDLW-RGTAPINA 615
Query: 458 VVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLET 516
+W G AWL H+WEH+ YT D FL + YP L+G A F +L++ GYL +
Sbjct: 616 SNHGIWVTGAAWLTLHIWEHFLYTQDTAFLRAQ-YPNLQGAAQFFEHFLVKDPKTGYLIS 674
Query: 517 NPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL 576
PS SPEH G L TMD IIRE+F +AA VL K + A E++ +
Sbjct: 675 TPSNSPEH------GGLVA---GPTMDHQIIRELFRNCSAAAAVL-KTDAAFAERLKTLI 724
Query: 577 PRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQ 636
P++ P KI + + EW +D D HRH+SHL+G+FPG IT K+ + KAA ++L
Sbjct: 725 PQIAPNKIGKHNQLQEWMEDIDDVNDQHRHISHLWGVFPGTDITW-KDSAMMKAARQSLI 783
Query: 637 KRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP 696
RG+ G GWS++WK +WAR + +HA MV+ LF ++ + GGLY+NLF AHPP
Sbjct: 784 YRGDGGTGWSLSWKVNVWARFKEGDHALLMVRNLFTPAMDDNGRE-RGGLYNNLFDAHPP 842
Query: 697 FQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGD 756
FQID NFG ++ +AEM++QS + LLPALP + G VK + ARGG + I WK G
Sbjct: 843 FQIDGNFGASSGIAEMIMQSHTGVIELLPALP-GELPDGEVKCMCARGGFVLDISWKQGR 901
Query: 757 LHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQL 800
L+ + + S N H L Y +++ Y FN L
Sbjct: 902 LNHLKVVSKNGNTCH-----LKYGAKEIELATKKNGSYIFNGSL 940
Score = 93.6 bits (231), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 67/199 (33%), Positives = 99/199 (49%), Gaps = 25/199 (12%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
++S + PL++ + PA +TDA+P+GNGRLGAMV+GGV E L+LNE+TLW+G P Y
Sbjct: 20 SQSYAQKQPLRLWYQQPAATWTDALPLGNGRLGAMVFGGVGEEHLQLNEETLWSGRPRSY 79
Query: 64 TNPDAPKALSDVRSLVDSGQYAEATAASVKLF-GHPADVYQLLGDIELEFDDSHLKYAEE 122
++P A + L +R L+ G+ AE+ A K F G A DDS + ++
Sbjct: 80 SHPGAAQYLQPMRQLLAEGKQAESEAMGEKYFMGLKAP------------DDSAYELQKD 127
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
T+ R + A V Y+ N + ++V + GS SFNV L
Sbjct: 128 TWFRSVRAQIEPAGVTYNDNNWPAMQLPTPEGWERVGLEGTDGSLWFRTSFNVPAKWLGK 187
Query: 183 N------------HSYVNG 189
N ++YVNG
Sbjct: 188 NLVLDLGRIRDLDYTYVNG 206
>gi|423280895|ref|ZP_17259807.1| hypothetical protein HMPREF1203_04024 [Bacteroides fragilis HMW
610]
gi|404583536|gb|EKA88214.1| hypothetical protein HMPREF1203_04024 [Bacteroides fragilis HMW
610]
Length = 829
Score = 427 bits (1097), Expect = e-116, Method: Compositional matrix adjust.
Identities = 265/801 (33%), Positives = 404/801 (50%), Gaps = 89/801 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY--------TNPDAPKALSDVR 76
+ ++PIGNG +GA + G + +E + NE TLW G P N + L ++R
Sbjct: 75 SQSLPIGNGSIGANIMGSIEAERITFNEKTLWRGGPNTSKGAEAYWNVNKQSAHVLDEIR 134
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
G +A + K F + +G+ +E S + + Y
Sbjct: 135 QAFSEGDQKKAAMLTRKNFNSEVPYESNREKPFRFGNFTTMGEFYVETGLSAVNMS--GY 192
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ +V + R +F S P V+ + G +L+F+ + + +
Sbjct: 193 KRILSLDSALAVVQFKKDDVAYERSYFISYPANVMAIRFKADRPGKQNLTFSYAPNPVST 252
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+G+N + A+ D G+Q+ ++ I + GT+S D K+
Sbjct: 253 GSMTTDGSNGLTY-------------TAHLDNNGMQY--VVRIHATTKGGTLSN-ADGKI 296
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSD-SKKDPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D AV L+ A + +FD F +P +P + + + ++ Y L+ +H
Sbjct: 297 TVKDADEAVFLITADTDYKINFDPDFKDPKTYVGVNPAETTRQWMDNAVSMGYDVLFKQH 356
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
DDY LF+RV +QL+ ++ +P+A+R+++++ + D L EL +QF
Sbjct: 357 YDDYAALFNRVKLQLN-----------PDQQSANLPTAKRLQNYRKGQPDFYLEELYYQF 405
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + P NL+EC PL
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACPTNLNECTLPLV 465
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + GW +I+ ++ + + W PM G WL TH+W
Sbjct: 466 DFIRTLVKPGQKTAQAYFGTRGWTASISANIFGFTAPLESEDMSWNFNPMAGPWLATHIW 525
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D+ FL++ Y L++ A F D+L DG PSTSPEH
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSAQFATDFLWRKPDGTYTAAPSTSPEH---------GP 576
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A+IRE+ I A++VL + E ++VL L P KI G +MEW
Sbjct: 577 IDEGTTFVHAVIREILLDAIEASKVLGVDSKERKQWQEVLA---HLAPYKIGRYGQLMEW 633
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
++D DP+ HRH++HLFGL PGHT++ PDL KAA L+ RG+ GWS+ WK
Sbjct: 634 SKDIDDPKNEHRHVNHLFGLHPGHTLSPITTPDLAKAARVVLEHRGDGATGWSMGWKLNQ 693
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARL D HAY++ L + G NL+ HPPFQID NFG TA + EML
Sbjct: 694 WARLQDGNHAYKLFGNL-----------LKNGTLDNLWDTHPPFQIDGNFGGTAGITEML 742
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDS 773
+QS + + LLPALP D W G + G+ A+G V + WK+G L E ++S
Sbjct: 743 LQSHMGFIQLLPALP-DAWKDGSISGICAKGNFEVDMSWKNGQLAEATVFSKAGEP---- 797
Query: 774 FKTLHYRGTSVKVNLSAGKIY 794
T+ Y ++ S GK+Y
Sbjct: 798 -CTVRYGDKTLSFKTSKGKVY 817
>gi|192360052|ref|YP_001983169.1| alpha-L-fucosidase [Cellvibrio japonicus Ueda107]
gi|190686217|gb|ACE83895.1| alpha-L-fucosidase, putative, afc95A [Cellvibrio japonicus Ueda107]
Length = 782
Score = 426 bits (1096), Expect = e-116, Method: Compositional matrix adjust.
Identities = 276/784 (35%), Positives = 411/784 (52%), Gaps = 72/784 (9%)
Query: 2 MNAESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
M+AE + + PL I F+ PA + + +PIGNG +GA++ GGV + ++ NE TLWTG P
Sbjct: 1 MSAEVSRESVPLAIAFDRPATDWEREGLPIGNGAMGAVISGGVEQDIIQFNEKTLWTGGP 60
Query: 61 G-----DYTNPDAPKA--LSDVR-SLVDSGQYAEATAASV---KLFGHPADVYQLLGDIE 109
G D+ P +A L+ VR S+ G + AA + K+ G+ YQ GD+
Sbjct: 61 GSVRGYDFGIPAESQASALAKVRDSIRKDGSISPEKAAELMGRKILGYGD--YQTFGDLI 118
Query: 110 LEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
L F ++ + Y R L L+ + Y V +TRE+F+S PD VIV ++S + G
Sbjct: 119 LSFPENDSGVIK--YNRRLSLDEGRVILGYQQEGVTYTREYFASYPDGVIVVRLSADKPG 176
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD 229
+ V L + N Q+ R G ++ D+ G F+A I +
Sbjct: 177 QIHLRVGLRT--------PDNRQVTT--RIEGNQLDIVGELQDNKLG--FAA--RIAVVA 222
Query: 230 DRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMS-ALQSIRNL 288
+ G + + L+V+ +D ++ A++++ + + + + +S L +
Sbjct: 223 EGGNLDNSGQQSLQVKRADAVTIVFAAATNYAQRYPHYRQADASYAQQKISNTLAAALQK 282
Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPS 348
+Y+ L RH DYQ L+ RV++ + + + T + K+ D S
Sbjct: 283 NYAQLLARHTQDYQSLYKRVALDIGQGVHSLATPALLAQ----------YKTGNAALDRS 332
Query: 349 LVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL 408
L + FQFGRYLLI+SSRPG+ ANLQG+WN ++P W++ HVNINL+MNYW + NL
Sbjct: 333 LEAIYFQFGRYLLIASSRPGSLPANLQGVWNNSITPPWNADYHVNINLQMNYWLAETANL 392
Query: 409 SECQEPLFDFLTYLSINGSKTAQ-VNYLASGWVIHHKTDIWAKSSADRGKVVW--ALW-P 464
E +P FDF+ L G+ +AQ + ++ GW + T+IW + G + W A W P
Sbjct: 393 PELMQPYFDFVDSLVEPGNISAQRIADVSKGWALFLNTNIWGFT----GVIDWPTAFWQP 448
Query: 465 MGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPE 523
GAWL H +EH+ ++ D+ FL RAYPL++G A F LD+L++ DG PS SPE
Sbjct: 449 EAGAWLAQHYYEHFLFSGDQAFLRNRAYPLMKGAAEFWLDFLVKDPRDGLWVVTPSFSPE 508
Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTK 583
H P A +S D+ +R A AA V +K LV++ LK++ R +
Sbjct: 509 H---GPFTTGAAMSQQIVFDL--LRNTSEA---AALVGDKKFKRLVDQTLKNMD--RGIR 558
Query: 584 IAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP 643
I G + EW +D DP+ HRH+SHLF L PG I K P+L +AA TL RG+ G
Sbjct: 559 IGSWGQLQEWKEDIDDPKNDHRHISHLFALHPGRYIDPRKTPELLQAARTTLNARGDGGT 618
Query: 644 GWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANF 703
GWS WK WARL D A++++ + + NL+ HPPFQID NF
Sbjct: 619 GWSQAWKVNFWARLLDGNRAHKVLG-----------EQLQRSTLPNLWDNHPPFQIDGNF 667
Query: 704 GFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIY 763
G TA VAEMLVQS + LPALP D W++G V+GL+ARGG T+ + W + L + +
Sbjct: 668 GATAGVAEMLVQSHNGVIEFLPALP-DAWATGNVRGLRARGGITLDMQWTNKSLTTLYLR 726
Query: 764 SNYS 767
SN++
Sbjct: 727 SNHT 730
>gi|254785612|ref|YP_003073041.1| alpha-L-fucosidase [Teredinibacter turnerae T7901]
gi|237683920|gb|ACR11184.1| alpha-L-fucosidase [Teredinibacter turnerae T7901]
Length = 814
Score = 426 bits (1095), Expect = e-116, Method: Compositional matrix adjust.
Identities = 269/774 (34%), Positives = 406/774 (52%), Gaps = 83/774 (10%)
Query: 15 ITFNGPAKHFTD-AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG------DYTNPD 67
+ F PA + + +PIGNG +GA++ G + E ++ NE +LW G PG P+
Sbjct: 44 LLFFSPASDWENQGLPIGNGAMGAVITGEINKELVQFNEKSLWEGGPGAQGYNFGLAAPN 103
Query: 68 APKALSDVRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAE-ETY 124
P L V+ + G A + +L P + YQ GD+ +E HL E + Y
Sbjct: 104 FPAKLKAVQQQLAKGAVLSAETVATQLGQDPTEYGNYQTFGDLIIE----HLHSTEVQDY 159
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
RR L++ A A V+Y++ V + RE+F+S PD+VIV +I+ + G+L+ NV L + +
Sbjct: 160 RRNLNIENALASVEYTITGVGYRREYFASFPDKVIVLQIASDKPGALNLNVGLHTSDNRS 219
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+N R+ N++ G++++A++E++ GT++ DK L++
Sbjct: 220 QLLNATTH----------RMSLSGALNNN--GLRYAAMVEVRTQS--GTVARTSDK-LQI 264
Query: 245 EGSDWAVLLLVASSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
+D L+L ++ + P + P + + L S+ Y L +RH+ DY+
Sbjct: 265 RSADKVTLVLATATDYAPVYPTYRVASGAPSPLAVVETRLNSLTKKGYPLLKSRHITDYR 324
Query: 303 KLFHRVSIQLS--RSPKDIVTDTCSEENIDTVPSAERVKSFQTD---EDPSLVELLFQFG 357
LF RV++ L+ SP + DT P R++++ D +L L F +G
Sbjct: 325 SLFQRVTLNLTPNSSPNSVA---------DTKPLPARLEAYHKDTPENKRALETLYFNYG 375
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLI+SSR G+ ANLQG+WN +P W++ HVNINL+MNYW +L NLSE PL+D
Sbjct: 376 RYLLIASSRAGSLPANLQGVWNHSNTPPWNADYHVNINLQMNYWPALVTNLSETTPPLYD 435
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVW--ALW-PMGGAWLCTHL 474
F+ L G K+AQ +GW + T+I+ S G + W A W P AWL
Sbjct: 436 FVDALRAPGEKSAQTLGADAGWAVLLNTNIFGFS----GLISWPTAFWQPEANAWLMRLY 491
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
++ Y +T D+ FL +RAYP ++ + F + +L + DG NPS SPEH
Sbjct: 492 FDFYQFTGDKKFLRERAYPAMKSTSQFWMTFLTQ-RDGTYWVNPSYSPEH---------G 541
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT----KIAEDGSI 590
S ++M I+ E+F +AAE+L+ + A + LK P L+ T +I + G +
Sbjct: 542 PFSEGASMSQQIVSELFRNTHAAAEMLKDRQFA---RSLK--PFLQNTDDGLRIGKWGQL 596
Query: 591 MEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
EW QD DP HRH+SHL+ L+PG+ I+ P+ KAA+ TL RG+ G GWS WK
Sbjct: 597 QEWQQDLDDPTSQHRHISHLYALYPGNQISNADTPEYFKAAKTTLNARGDSGTGWSKAWK 656
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL + + A +++ + E NL+ HPPFQID NFG TA +A
Sbjct: 657 INLWARLREGDRALKLL-----------SEQLEHSTLQNLWDNHPPFQIDGNFGATAGIA 705
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS + LLPALP W++G V GL+AR G TV I WK L + + S
Sbjct: 706 EMLIQSHRGKIELLPALP-QAWANGSVTGLRARTGITVDIYWKQHQLEKAELSS 758
>gi|189464509|ref|ZP_03013294.1| hypothetical protein BACINT_00851 [Bacteroides intestinalis DSM
17393]
gi|189438299|gb|EDV07284.1| hypothetical protein BACINT_00851 [Bacteroides intestinalis DSM
17393]
Length = 817
Score = 426 bits (1095), Expect = e-116, Method: Compositional matrix adjust.
Identities = 284/805 (35%), Positives = 408/805 (50%), Gaps = 93/805 (11%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
++PIGNG LGA + G V +E + LNE TLW G P DY N + L ++R
Sbjct: 64 SLPIGNGSLGANILGSVAAERITLNEKTLWRGGPNTSGGADYYWNVNKQSAPILKEIRQA 123
Query: 79 VDSGQYAEATAASVKLFG----------HPADV--YQLLGDIELEFDDSHLKYAEETYRR 126
G +A + K F HP + +G++ +E D S L+ + YRR
Sbjct: 124 FTEGNGEKAAQLTRKNFNGLAAYEEKDEHPFRFGSFTTMGELYIETDLSELRM--KNYRR 181
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
L L++A A V++ V++ R++F S PD V+ + S ++G + +S + S
Sbjct: 182 ILSLDSAMAVVQFDKEGVQYRRKYFISYPDSVMAMEFSADKAGKQNLVLSYAPNPEAQSN 241
Query: 187 V--NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+ +G + ++ G + G++F+ IK GT+ A D+ L V
Sbjct: 242 IRTDGTDGLVYTGVL-------------NNNGMKFA--FRIKAIAKGGTVIAQNDR-LIV 285
Query: 245 EGSDWAVLLLVASS----SFDGPFINPSD-SKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+G+D V LL A + +F+ F NP DP + S + Y L H
Sbjct: 286 KGADRVVFLLTADTDYKMNFNPDFKNPKTYVGDDPELTTQSMMNQALLKGYETLANNHKA 345
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGR 358
DY LF+RV + L+ P +D +P+ +R+ +++ + D L EL +QFGR
Sbjct: 346 DYTALFNRVKLTLN--PDVTGSD---------LPTYQRLANYRKGQPDFRLEELYYQFGR 394
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLI+SSRPG ANLQG+W+ +L W H NIN++MNYW + P NLSEC PL DF
Sbjct: 395 YLLIASSRPGNLPANLQGMWHNNLDGPWRVDYHNNINIQMNYWPAGPTNLSECTWPLIDF 454
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVV-WALWPMGGAWLCTHLWEH 477
+ L G KTAQ + A GW +I+ +S +++ W PM G WL TH+WE+
Sbjct: 455 IRGLVKPGEKTAQAYFAARGWTASISANIFGFTSPLSSEIMAWNFNPMAGPWLATHIWEY 514
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y+YT DR+FL++ Y L++ A F +D+L DG PSTSPEH V
Sbjct: 515 YDYTRDRNFLKEVGYDLIKSSAQFTVDYLWHKPDGTYTAAPSTSPEH---------GPVD 565
Query: 538 YSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ 595
+T A++RE+ I A++VL + E ++VL L P KI G ++EW++
Sbjct: 566 EGATFVHAVVREILLDAIEASKVLGVDSRERKHWQEVLA---HLVPYKIGRYGQLLEWSK 622
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWA 655
D DP HRH++HLFGL PG T++ P+L KAA L+ RG+ GWS+ WK WA
Sbjct: 623 DIDDPNDKHRHVNHLFGLHPGRTLSPVTTPELAKAARIVLEHRGDGATGWSMGWKLNQWA 682
Query: 656 RLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQ 715
RL D HAY + L + G NL+ H PFQID NFG TA V EML+Q
Sbjct: 683 RLQDGNHAYTLFGNL-----------LKNGTLDNLWDTHAPFQIDGNFGGTAGVTEMLLQ 731
Query: 716 STLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYS-------N 768
S + + LLPALP D W G V GL A+G VSI WK+ L E + S
Sbjct: 732 SHMGFIQLLPALP-DAWKDGVVSGLCAKGNFEVSISWKNNRLDEAILVSKAGAPCTVRYE 790
Query: 769 NDHDSFKTLHYRGTSVKVNLSAGKI 793
+ SFKT+ +G + KV + K+
Sbjct: 791 DKTLSFKTV--KGKTYKVKVDGDKL 813
>gi|313149260|ref|ZP_07811453.1| glycoside hydrolase family 95 [Bacteroides fragilis 3_1_12]
gi|313138027|gb|EFR55387.1| glycoside hydrolase family 95 [Bacteroides fragilis 3_1_12]
Length = 829
Score = 426 bits (1095), Expect = e-116, Method: Compositional matrix adjust.
Identities = 265/801 (33%), Positives = 404/801 (50%), Gaps = 89/801 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY--------TNPDAPKALSDVR 76
+ ++PIGNG +GA + G + +E + NE TLW G P N + L ++R
Sbjct: 75 SQSLPIGNGSIGANIMGSIEAERITFNEKTLWRGGPNTSKGAEAYWNVNKQSAHVLDEIR 134
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
G +A + K F + +G+ +E S + + Y
Sbjct: 135 QAFSEGDQKKAAMLTRKNFNSEVPYESNREKPFRFGNFTTMGEFYVETGLSAVNMS--GY 192
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ +V + R +F S P V+ + G +L+F+ + + +
Sbjct: 193 KRILSLDSALAVVQFKKDDVAYERSYFISYPANVMAIRFKADRPGKQNLTFSYAPNPVST 252
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+G+N + A+ D G+Q+ ++ I + GT+S D K+
Sbjct: 253 GSMTTDGSNGLTY-------------TAHLDNNGMQY--VVRIYATTKGGTLSN-ADGKI 296
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSD-SKKDPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D AV L+ A + +FD F +P +P + + + ++ Y L+ +H
Sbjct: 297 TVKDADEAVFLITADTDYKINFDPDFKDPKTYVGVNPAETTRQWMDNAVSMGYDVLFKQH 356
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
DDY LF+RV +QL+ ++ +P+A+R+++++ + D L EL +QF
Sbjct: 357 YDDYAALFNRVKLQLN-----------PDQQSTNLPTAKRLQNYRKGQPDFYLEELYYQF 405
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + P NL+EC PL
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACPTNLNECTLPLV 465
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + GW +I+ ++ + + W PM G WL TH+W
Sbjct: 466 DFIRTLVKPGQKTAQAYFGTRGWTASISANIFGFTAPLESEDMSWNFNPMAGPWLATHIW 525
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D+ FL++ Y L++ A F D+L DG PSTSPEH
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSAQFATDFLWRKPDGTYTAAPSTSPEH---------GP 576
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A+IRE+ I A++VL + E ++VL L P KI G +MEW
Sbjct: 577 IDEGTTFVHAVIREILLDAIEASKVLGVDSKERKQWQEVLA---HLAPYKIGRYGQLMEW 633
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
++D DP+ HRH++HLFGL PGHT++ PDL KAA L+ RG+ GWS+ WK
Sbjct: 634 SKDIDDPKDEHRHVNHLFGLHPGHTLSPITTPDLAKAARVVLEHRGDGATGWSMGWKLNQ 693
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARL D HAY++ L + G NL+ HPPFQID NFG TA + EML
Sbjct: 694 WARLQDGNHAYKLFGNL-----------LKNGTLDNLWDTHPPFQIDGNFGGTAGITEML 742
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDS 773
+QS + + LLPALP D W G + G+ A+G V + WK+G L E ++S
Sbjct: 743 LQSHMGFIQLLPALP-DAWKDGSISGICAKGNFEVDMSWKNGQLAEATVFSKAGEP---- 797
Query: 774 FKTLHYRGTSVKVNLSAGKIY 794
T+ Y ++ S GK+Y
Sbjct: 798 -CTVRYGDKTLSFKTSKGKVY 817
>gi|393784536|ref|ZP_10372699.1| hypothetical protein HMPREF1071_03567 [Bacteroides salyersiae
CL02T12C01]
gi|392665517|gb|EIY59041.1| hypothetical protein HMPREF1071_03567 [Bacteroides salyersiae
CL02T12C01]
Length = 818
Score = 426 bits (1095), Expect = e-116, Method: Compositional matrix adjust.
Identities = 269/775 (34%), Positives = 389/775 (50%), Gaps = 96/775 (12%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
++PIGNG LGA + G V +E + LNE TLW G P DY N + + ++R
Sbjct: 63 SLPIGNGSLGANILGSVAAERITLNEKTLWKGGPNTAGGADYYWKVNKQSASVMEEIRQA 122
Query: 79 VDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETYRR 126
G Y +A + K F A + +G+I +E S + ++ Y R
Sbjct: 123 FTDGDYEKAELLTRKNFNGLAHYEEGDETPFRFGSFTTMGEIYVETGLSEIGMSD--YYR 180
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
L L++A A V + N + R++F S PD V+ K + +++G
Sbjct: 181 ALSLDSAMAVVSFKKDNTRYMRKYFISYPDSVMAMKFTANKTGK---------------- 224
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE-------IKI-SDDRGTISALE 238
Q ++ CP A DD G+ ++ +LE I+I + +G + +E
Sbjct: 225 -----QNLVLRYCPNSEAKSSLCA-DDTDGLLYTGVLENNGMKFAIRIKAITKGGTTTVE 278
Query: 239 DKKLKVEGSDWAVLLLVASS----SFDGPFINPSD-SKKDPTSESMSALQSIRNLSYSDL 293
+L V+ +D V LL A + +F F +P DP + ++ Y +L
Sbjct: 279 QDRLIVKDADEVVFLLTADTDYKMNFQPDFKDPKTYVGSDPEQTTRKTMEGAIRKGYDEL 338
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVEL 352
Y H DY LF+RV +QL+ E +P+ R+ +++ + D L EL
Sbjct: 339 YRAHEADYTSLFNRVKLQLN-----------PEVTARNLPTNLRLANYRKGQADYRLEEL 387
Query: 353 LFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
+Q+GRYLLI+ SR G ANLQG+W+ +L+ W H NIN++MNYW + NL EC
Sbjct: 388 YYQYGRYLLIACSRSGNMPANLQGMWHNNLNGPWRVDYHNNINIQMNYWPACSTNLGECT 447
Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLC 471
PL DF+ L G++TA+ + A GW +I+ +S + + W PM G WL
Sbjct: 448 RPLVDFIRSLVKPGAETAKAYFNARGWTASISANIFGFTSPLSSEDMSWNFNPMAGPWLA 507
Query: 472 THLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDG 531
TH+WE+Y+YT D++FL+ Y LL+ A F +D+L DG PSTSPEH
Sbjct: 508 THIWEYYDYTRDKEFLKSTGYDLLKSSAQFTVDYLWHKPDGTYTAAPSTSPEH------- 560
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGS 589
V +T A++RE+ I A++VL +K E E VL L P KI G
Sbjct: 561 --GPVDEGTTFVHAVVREILLNAIEASKVLGVDKKERKEWEYVLAHLA---PYKIGRYGQ 615
Query: 590 IMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITW 649
+MEW++D DPE HRH++HLFGL PGHT++ P+L +AA L+ RG+ GWS+ W
Sbjct: 616 LMEWSRDIDDPEDEHRHVNHLFGLHPGHTLSPVTTPELAQAARVVLEHRGDGATGWSMGW 675
Query: 650 KTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAV 709
K WARL D HAY++ L + G NL+ H PFQID NFG TA +
Sbjct: 676 KLNQWARLQDGNHAYKLYGNL-----------LKNGTLDNLWDTHAPFQIDGNFGGTAGI 724
Query: 710 AEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS + + LLPALP D W G V G+ ARGG V++ WKDG L E + S
Sbjct: 725 TEMLLQSHMGFIQLLPALP-DAWQDGSVSGICARGGFEVNLSWKDGKLAEAVVTS 778
>gi|325678667|ref|ZP_08158277.1| hypothetical protein CUS_6446 [Ruminococcus albus 8]
gi|324109717|gb|EGC03923.1| hypothetical protein CUS_6446 [Ruminococcus albus 8]
Length = 761
Score = 426 bits (1094), Expect = e-116, Method: Compositional matrix adjust.
Identities = 271/752 (36%), Positives = 387/752 (51%), Gaps = 64/752 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
KI F PA+ + A+P+GNGR+G M +G E ++LNED++++G NP A + L
Sbjct: 10 KIWFKAPAEDWNVALPVGNGRIGGMCFGQPLYEKIQLNEDSIFSGGQRKRNNPSARENLE 69
Query: 74 DVRSLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ + AEA ++ F G P + Y LGD+ ++ HL+ E R LDL
Sbjct: 70 KVRQLLKEEKIAEAEKIVLEAFCGTPVNQRHYMPLGDLVIQ---HHLESECEYKCRSLDL 126
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS---LLDNHSYV 187
A +YS+ V + R S P QV+ I+ +S S+S ++LD D++S +
Sbjct: 127 ENAVCTAEYSIKGVNYVRRVICSEPAQVMAINITADKSASISLKLTLDGRDDYFDDNSPM 186
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
N + I+ G C G+ GI F+A L ++ G++ + E
Sbjct: 187 N-DTDILYYGGCGGE------------DGINFAAYL--RVIGVGGSVHRW-GSSIVTEDC 230
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D +L+ +S+ SD KK + ++A + + +L H++DY+ F R
Sbjct: 231 DSVTILIGVQTSY-----RVSDYKKSAELDVITAAEK----DFEELLKEHIEDYRSYFDR 281
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSR 366
+IV D E D++P+ ER+K + D LV L F FGRYL+IS SR
Sbjct: 282 T---------EIVFD---EGGNDSLPTDERLKLVKEGGVDNGLVSLYFDFGRYLMISGSR 329
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
GT NLQGIWN+D+ P W VNIN EMNYW + ++ + PLFD + + NG
Sbjct: 330 EGTLPLNLQGIWNKDMWPAWGCRFTVNINTEMNYWLAEVADMGDLHMPLFDHIERMRPNG 389
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
TA+ Y G+V HH TDIW ++ + W G AWLCTH+WEH+ Y+ DR+F
Sbjct: 390 RATAREMYGCGGFVCHHNTDIWGDTAPQDLWMPGTQWVTGAAWLCTHIWEHWLYSRDREF 449
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
L ++ Y L+ + F +D+LI+ G L T PS SPE+ +I G V +MD I
Sbjct: 450 LAEK-YDTLKEASLFFVDFLIDNGKGQLVTCPSVSPENTYITASGAKGSVCMGPSMDSQI 508
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRH 606
I E+F+A+I A EVL + D EK+ +L +I + G IMEWA+D+ + E HRH
Sbjct: 509 IYELFTAVIEAGEVLGIDAD-YREKLKGMREKLPKPQIGKYGQIMEWAEDYDEAEPGHRH 567
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHA 663
+S LF L+P I+ K P+L AA T+++R G GWS W WARLHD
Sbjct: 568 ISQLFALYPADIISYRKTPELAAAARATIERRLAHGGGHTGWSRAWIINHWARLHDGVKV 627
Query: 664 YRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYL 723
+ L E NLF HPPFQID NFG A +AE L+QS ++ L
Sbjct: 628 KENIAAL-----------LENSTSDNLFDMHPPFQIDGNFGAAAGIAESLLQSECGEIEL 676
Query: 724 LPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
LPA D W +G +GL+ARGG V W DG
Sbjct: 677 LPAASPD-WKNGHFRGLRARGGFAVDCDWADG 707
>gi|154503234|ref|ZP_02040294.1| hypothetical protein RUMGNA_01058 [Ruminococcus gnavus ATCC 29149]
gi|153796228|gb|EDN78648.1| hypothetical protein RUMGNA_01058 [Ruminococcus gnavus ATCC 29149]
Length = 784
Score = 425 bits (1093), Expect = e-116, Method: Compositional matrix adjust.
Identities = 269/818 (32%), Positives = 405/818 (49%), Gaps = 90/818 (11%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
I F A+ + A PIGNG LGAMV+G V E +++NED++W+G + NPDA + L
Sbjct: 20 IYFRKEAEEWNQAFPIGNGFLGAMVFGNVAKERIQVNEDSVWSGGFSNRVNPDAGRYLEK 79
Query: 75 VRSLVDSG--QYAEATAASVKLFGHP-ADVYQLLGDIELEF-----------DDSHLKYA 120
+R + G Q AE A P VYQ LGDI + F D+S L Y
Sbjct: 80 IRECLFEGNVQEAEKLAEQSMYATSPNMRVYQPLGDIWIRFMDQEAERKLARDESGLPYL 139
Query: 121 EET------YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFN 174
+E+ Y+R L+L A +++Y VG ++ RE F+SNP +V + I ++
Sbjct: 140 KESAAEVEAYQRILNLEQAVGKIEYCVGRTKWNREFFASNPAKVAMYSICAESGEDINLE 199
Query: 175 VSLDSLLDNHS----------YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE 224
+S + DN S N I +EG G+ +GI F+ +
Sbjct: 200 ISA-TRKDNRSGRGVSFCDRILAEENQYIWLEGSSGGR------------EGIGFA--MG 244
Query: 225 IKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS 284
+++ G + ++ VE + ++ ++F +P L S
Sbjct: 245 VRVCSCGGRQYQM-GSRIIVEKARKVLICFTGRTTF---------RSAEPKQWCREHLAS 294
Query: 285 IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD 344
+ +Y++ H+ DYQ F+ + + E N+D + + ER+K +
Sbjct: 295 LSLDTYAERKREHIQDYQTYFNASRLTFRQ-----------EMNLDNLTTPERLKRIREG 343
Query: 345 E-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
D LV L + F RYLLISSSR G+ ANLQGIWNE+ P W S +NIN++MNYW +
Sbjct: 344 HHDIGLVNLYYDFARYLLISSSREGSLPANLQGIWNEEFEPMWGSKYTININIQMNYWMA 403
Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
L PL + L + G + A Y G+ HH TDIW + +W
Sbjct: 404 EKTGLQALHLPLLEHLKRMHPRGKEVAASMYHVEGFCCHHNTDIWGDCAPQDYHTSSTIW 463
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
PMGGAWLC H++EHY YT D+ FLE+ +P+L+ F ++++++ DG T PS+SPE
Sbjct: 464 PMGGAWLCLHIYEHYQYTKDKGFLEE-YFPILKDSVQFFMNYMVQNSDGKWVTGPSSSPE 522
Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNE--DALVEKVLKSLPRLRP 581
+ +I + C+ TMD+ I+RE+FS + E+LEK E LV+ +++LP+L
Sbjct: 523 NIYITAKNQYGCLCMGPTMDIEIVRELFSNYLKTVEILEKEEPLTGLVKDRIENLPKL-- 580
Query: 582 TKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEE 641
K+ + G I EW QD+++ EV HRH+S LF L+P I ++ P L +AAEKTL +R E
Sbjct: 581 -KVGKYGQIQEWDQDYEELEVGHRHISQLFALYPAQQIRKDQTPKLAQAAEKTLDRRLEN 639
Query: 642 G---PGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQ 698
G GWS W +ARL +E AY+ ++ L E L NL HPPFQ
Sbjct: 640 GGGHTGWSKAWIILFFARLWKKEKAYQNLQELLA----------EATL-DNLLDNHPPFQ 688
Query: 699 IDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLH 758
ID NFG + EM+VQ + +YLLPALP + G V G++ + G +++ W +
Sbjct: 689 IDGNFGGACGILEMIVQDYQDVVYLLPALP-QEMPDGNVSGIRTKSGFILNMEWSGCRVK 747
Query: 759 EVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTF 796
V + S + +TL R ++ K+ F
Sbjct: 748 SVEVESVHGTQITIVNETLESR--KIRCEKGEKKVIVF 783
>gi|346972979|gb|EGY16431.1| alpha-L-fucosidase [Verticillium dahliae VdLs.17]
Length = 765
Score = 425 bits (1093), Expect = e-116, Method: Compositional matrix adjust.
Identities = 273/776 (35%), Positives = 394/776 (50%), Gaps = 86/776 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + ++ PA +++A+P+GNGRLG MV+G +E L+LNED++W G P D T DA + L
Sbjct: 8 LTLHYDAPAASWSEALPVGNGRLGGMVYGRTSTELLQLNEDSVWYGGPQDRTPRDARRHL 67
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADVY--QLLGDIELEFDDSHLKYAEETYRRELD 129
+R L+ ++A A A F PA + + LG+ LEF H YRR LD
Sbjct: 68 DTLRQLIRDEKHAAAEALVREAFFATPASMRHSEPLGNCTLEF--GHEAQDVTGYRRSLD 125
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS--------LDSLL 181
L TA A V+Y V + RE +S PD V+ + S SE ++ + L
Sbjct: 126 LATAQATVEYQCTGVSYRRETIASFPDNVVALRFSASEPTRFVVRLNRVSEIEWETNEFL 185
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI--SDDRGTISALED 239
D+ NG +I++ GK N +P S +L I +D+ G+I A+ +
Sbjct: 186 DSIQAANG--RIVLNATPGGK--------NSNP----LSLVLGISCDANDEGGSIEAVGN 231
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
L++ A S + + K DP + + + S+ +L R
Sbjct: 232 -----------ALVVKAFSCTIAIAAHTTYRKADPEAAARQDVDKALKRSWHELVLRQRT 280
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
DY LF R S+++ + D+ P+ ER+ + + DP LV L + +GRY
Sbjct: 281 DYASLFQRSSLRMWPAAHDL-------------PTNERI---EKNRDPGLVALYYNYGRY 324
Query: 360 LLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
LLISSSR + A LQGIWN +P W +NINL+MNYW + PCNL +C P+
Sbjct: 325 LLISSSRDSDKALPATLQGIWNPSFAPPWGCKYTININLQMNYWLAAPCNLVDCALPMLG 384
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ +++ G+KTA+ Y GW HH TDIWA + + +WP+GG WLC + E
Sbjct: 385 LVERMAVRGAKTARTMYDCGGWCAHHNTDIWADTDPQDRWMPSTIWPLGGVWLCIDVLEM 444
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACV 536
Y DR L +RA LLEGC FLLD+LI G +L TNPS SPE+ F++ G +
Sbjct: 445 LLYQYDRK-LHERAAVLLEGCIVFLLDFLIPSACGKFLVTNPSLSPENTFVSKSGDTGIL 503
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWA-Q 595
S +D IIR F + + +L+K + LV +V ++ RL I DG I EW +
Sbjct: 504 CEGSAIDTTIIRIAFEKFLWSTAILDKG-NPLVPEVRDAMARLPNLTINNDGLIQEWGLK 562
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTA 652
D+K+ E HRH+SHLFGL+PG +I+ +P+L AA+K L +R G GWS W
Sbjct: 563 DYKEHEPGHRHVSHLFGLYPGESISPVTSPELAAAAKKVLDRRAAHGGGHTGWSRAWLLN 622
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
L ARLHD + + L + N+ HPPFQID NFG A + E
Sbjct: 623 LHARLHDADGCGVHMDSL-----------LKSSTLPNMLDNHPPFQIDGNFGGAAGILEC 671
Query: 713 LVQSTLN---------DLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
+VQS + ++ LLPA P D WS G ++G++ +GG VS+ W DG + E
Sbjct: 672 IVQSRIVWGASRPDCIEIRLLPACP-DAWSIGELRGVRVKGGWLVSLAWIDGRIEE 726
>gi|336432957|ref|ZP_08612787.1| hypothetical protein HMPREF0991_01906 [Lachnospiraceae bacterium
2_1_58FAA]
gi|336017627|gb|EGN47385.1| hypothetical protein HMPREF0991_01906 [Lachnospiraceae bacterium
2_1_58FAA]
Length = 768
Score = 425 bits (1093), Expect = e-116, Method: Compositional matrix adjust.
Identities = 269/818 (32%), Positives = 405/818 (49%), Gaps = 90/818 (11%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
I F A+ + A PIGNG LGAMV+G V E +++NED++W+G + NPDA + L
Sbjct: 4 IYFRKEAEEWNQAFPIGNGFLGAMVFGNVAKERIQVNEDSVWSGGFSNRVNPDAGRYLEK 63
Query: 75 VRSLVDSG--QYAEATAASVKLFGHP-ADVYQLLGDIELEF-----------DDSHLKYA 120
+R + G Q AE A P VYQ LGDI + F D+S L Y
Sbjct: 64 IRECLFEGNVQEAEKLAEQSMYATSPNMRVYQPLGDIWIRFMDQEAERKLARDESGLPYL 123
Query: 121 EET------YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFN 174
+E+ Y+R L+L A +++Y VG ++ RE F+SNP +V + I ++
Sbjct: 124 KESAAEVEAYQRILNLEQAVGKIEYCVGRTKWNREFFASNPAKVAMYSICAESGEDINLE 183
Query: 175 VSLDSLLDNHS----------YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE 224
+S + DN S N I +EG G+ +GI F+ +
Sbjct: 184 ISA-TRKDNRSGRGVSFCDRILAEENQYIWLEGSSGGR------------EGIGFA--MG 228
Query: 225 IKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS 284
+++ G + ++ VE + ++ ++F +P L S
Sbjct: 229 VRVCSCGGRQYQM-GSRIIVEKARKVLICFTGRTTF---------RSAEPKQWCREHLAS 278
Query: 285 IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD 344
+ +Y++ H+ DYQ F+ + + E N+D + + ER+K +
Sbjct: 279 LSLDTYAERKREHIQDYQTYFNASRLTFRQ-----------EMNLDNLTTPERLKRIREG 327
Query: 345 E-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
D LV L + F RYLLISSSR G+ ANLQGIWNE+ P W S +NIN++MNYW +
Sbjct: 328 HHDIGLVNLYYDFARYLLISSSREGSLPANLQGIWNEEFEPMWGSKYTININIQMNYWMA 387
Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
L PL + L + G + A Y G+ HH TDIW + +W
Sbjct: 388 EKTGLQALHLPLLEHLKRMHPRGKEVAASMYHVEGFCCHHNTDIWGDCAPQDYHTSSTIW 447
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
PMGGAWLC H++EHY YT D+ FLE+ +P+L+ F ++++++ DG T PS+SPE
Sbjct: 448 PMGGAWLCLHIYEHYQYTKDKGFLEE-YFPILKDSVQFFMNYMVQNSDGKWVTGPSSSPE 506
Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNE--DALVEKVLKSLPRLRP 581
+ +I + C+ TMD+ I+RE+FS + E+LEK E LV+ +++LP+L
Sbjct: 507 NIYITAKNQYGCLCMGPTMDIEIVRELFSNYLKTVEILEKEEPLTGLVKDRIENLPKL-- 564
Query: 582 TKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEE 641
K+ + G I EW QD+++ EV HRH+S LF L+P I ++ P L +AAEKTL +R E
Sbjct: 565 -KVGKYGQIQEWDQDYEELEVGHRHISQLFALYPAQQIRKDQTPKLAQAAEKTLDRRLEN 623
Query: 642 G---PGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQ 698
G GWS W +ARL +E AY+ ++ L E L NL HPPFQ
Sbjct: 624 GGGHTGWSKAWIILFFARLWKKEKAYQNLQELLA----------EATL-DNLLDNHPPFQ 672
Query: 699 IDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLH 758
ID NFG + EM+VQ + +YLLPALP + G V G++ + G +++ W +
Sbjct: 673 IDGNFGGACGILEMIVQDYQDVVYLLPALP-QEMPDGNVSGIRTKSGFILNMEWSGCRVK 731
Query: 759 EVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTF 796
V + S + +TL R ++ K+ F
Sbjct: 732 SVEVESVHGTQITIVNETLESR--KIRCEKGEKKVIVF 767
>gi|255936621|ref|XP_002559337.1| Pc13g09120 [Penicillium chrysogenum Wisconsin 54-1255]
gi|211583957|emb|CAP91981.1| Pc13g09120 [Penicillium chrysogenum Wisconsin 54-1255]
Length = 740
Score = 425 bits (1092), Expect = e-116, Method: Compositional matrix adjust.
Identities = 273/758 (36%), Positives = 400/758 (52%), Gaps = 72/758 (9%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + PA+ + A+P+GNGRLGAMV+G +E L+LNED++W G P D DA + L
Sbjct: 3 ELWYQQPAEDWNSALPVGNGRLGAMVYGRTDTEMLQLNEDSVWYGGPQDRNPQDALEYLP 62
Query: 74 DVRSLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
+R + +G +AEA A + F +P+ Y+ LG++ L D H YRR LDL
Sbjct: 63 RLREAIRAGNHAEAEKIAKLAFFANPSSQRNYEPLGNLFL--DLGHDPSQVTGYRRSLDL 120
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD--NHSYVN 188
+ATA V Y V + R+ +S PD VI K+ S ++ S L+ H +++
Sbjct: 121 TSATAHVSYEYQGVRYERQVLASYPDDVIAIKMYSSSRAEFVVRLTRMSELEFETHEWLD 180
Query: 189 G----NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
N I M GK N+N + ++ I+ TI+ + + L V
Sbjct: 181 DVSATGNSITMHVTPGGK------NSN------RACCMVSIRCDGAESTITRVGNN-LVV 227
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
SD A+L++ A ++F +D +M ++ D+ RH+ DYQ L
Sbjct: 228 NSSD-ALLVVAAQTTF---------RHEDNDQRTMQDAENALGFPLEDIRARHVADYQSL 277
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
++R+ +QL +I TD +R+KS + DP L+ L + RYLLIS
Sbjct: 278 YNRMELQLGPDSPEIPTD-------------QRLKSLR---DPGLIALYHNYNRYLLISC 321
Query: 365 SRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SR + ANLQGIWN P W S +N+NL+MNYW + NLSEC+ PLFD L +
Sbjct: 322 SRDRHKSLPANLQGIWNPSFHPAWGSRFTINVNLQMNYWSANMGNLSECELPLFDLLERM 381
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
G TA++ Y GW H TDIWA ++ + ++WP+GGAWLC H+W+H+ YT
Sbjct: 382 VEPGKVTARIMYGCRGWTAHPNTDIWADTAPFDRWMPASIWPLGGAWLCYHIWDHFRYTG 441
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSST 541
D++FL +R +P L GC FLLD+LIE +G YL T+PSTSPE+ F G+ + ST
Sbjct: 442 DQNFL-RRMFPTLRGCVEFLLDFLIEDANGEYLVTSPSTSPENSFYDGKGQKGVLCEGST 500
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPE 601
+D+ II + A S A+ L EDA++ V + R+ P +++ G + EWA D+ + E
Sbjct: 501 IDIQIIDAILDAFQSCAKSLGL-EDAILPAVQATRSRIPPMRVSPAGYLQEWASDYAEVE 559
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLH 658
HRH SHL+ L PG+ IT + P L +A L++R E G GWS W L ARL
Sbjct: 560 PGHRHTSHLWALHPGNAITPAQTPQLAEACGVVLRRRAEHGGGHTGWSRAWLLNLHARLL 619
Query: 659 DQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS-T 717
+ E + L + NL +HPPFQID NFG A + EMLVQS
Sbjct: 620 EAEECSGHLDLLLSR-----------STLPNLLDSHPPFQIDGNFGGGAGIIEMLVQSHE 668
Query: 718 LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
+ +LPA P D W +G ++G++ARGG + +++G
Sbjct: 669 PGVIRILPACPKD-W-TGSIRGVRARGGFELQFNFENG 704
>gi|312621676|ref|YP_004023289.1| alpha-L-fucosidase [Caldicellulosiruptor kronotskyensis 2002]
gi|312202143|gb|ADQ45470.1| Alpha-L-fucosidase [Caldicellulosiruptor kronotskyensis 2002]
Length = 786
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 275/791 (34%), Positives = 401/791 (50%), Gaps = 87/791 (10%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
+P ++F PA + +A+P+GNGRLGAMV+G E ++LN+D+LW+G D NP +
Sbjct: 3 HPYHLSFYKPASTWYEALPLGNGRLGAMVYGHTAVERIQLNDDSLWSGTFIDRNNPSLKE 62
Query: 71 ALSDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAE------ 121
L ++R LV G A ++ + G PA + Y LG++++ + HL +A
Sbjct: 63 KLPEIRRLVLVGDLYHAEELIMQYMVGTPASMRHYTTLGELDIALN-QHLPFATGWIPNS 121
Query: 122 ---ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
E Y +LDL + + V + RE F S P QV+ + + G+++ ++ LD
Sbjct: 122 NGCEDYYCDLDLMNGILSITHRQAGVRYCREMFVSYPAQVMCIRFVSEKPGTINMDIMLD 181
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIP---PKANANDDPKGIQFSAILEIKISDDRGTIS 235
+ + ++ + + R PG+R+ P N + F ++ + RG S
Sbjct: 182 RTVIS-------DETVPDERRPGQRVRRGWPTVN-------VDFIRTMDERTILMRGNES 227
Query: 236 ALE---------DKKLKVEGSDW------AVLLLVASSSFDGPFINPSDSKKDPTSESMS 280
+E D KL+ S V+L +ASS+ ++ +DP SE
Sbjct: 228 GVEFATAVRVVCDGKLQNPVSQLLARNCGEVILYLASST--------TNRSEDPVSEVFR 279
Query: 281 ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKS 340
L + Y L H++D+ L R + L SP P+ ER+ +
Sbjct: 280 LLDAAEKKGYVALREEHINDFSNLMWRCVLDLGPSPDK--------------PTDERIAA 325
Query: 341 FQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMN 399
+ D DP+L L FQ GRYL++S SR G+ NLQGIWN D P WDS +NINL+MN
Sbjct: 326 LRAGDNDPALAALYFQLGRYLIVSGSREGSAPLNLQGIWNADFMPIWDSKYTLNINLQMN 385
Query: 400 YWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVV 459
YW CNLSE PL + L + G +TA+V Y G V HH TD + + +
Sbjct: 386 YWPVEICNLSELHMPLMELLGKMHEKGRETARVMYGMRGMVCHHNTDFYGDCAPQDRYMA 445
Query: 460 WALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPS 519
W +GGAWL H+WEHY +T D +FL + YP+L A F D+LIE DG L T PS
Sbjct: 446 ATPWVIGGAWLGLHVWEHYLFTKDLNFL-REMYPILRDIAMFYEDFLIE-VDGKLVTCPS 503
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPE+ +I PDG + S MD I+RE+F+A I AA +L +++ L EK L+ RL
Sbjct: 504 VSPENRYILPDGYDTPMCVSPAMDNQILRELFAACIEAANLLGVDQE-LTEKWLEISQRL 562
Query: 580 RPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
KI G ++EW Q++ + H+SHLF +PG I P+L A K+L+ R
Sbjct: 563 PKDKIGSKGQLLEWDQEYPELTPGMGHVSHLFACYPGKGINWRDTPELMNAVRKSLELRM 622
Query: 640 EEGP---GWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP 696
E G GW + W ++ARL D E ++++R+ L+D NL A P
Sbjct: 623 EHGAGKKGWPLAWYINIFARLLDGEMTDKLIRRM--LIDSTAR---------NLLNATPI 671
Query: 697 FQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGD 756
FQID N G TA +AE L+QS + ++ LPALP W G VKGL+ARGG V I WK G
Sbjct: 672 FQIDGNLGATAGIAECLLQSHIA-VHFLPALP-VSWQEGSVKGLRARGGHEVDIKWKGGK 729
Query: 757 LHEVGIYSNYS 767
L E + ++
Sbjct: 730 LVEAVVTPQFT 740
>gi|345569032|gb|EGX51901.1| hypothetical protein AOL_s00043g635 [Arthrobotrys oligospora ATCC
24927]
Length = 723
Score = 423 bits (1087), Expect = e-115, Method: Compositional matrix adjust.
Identities = 266/758 (35%), Positives = 392/758 (51%), Gaps = 82/758 (10%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
MV+G +E L+LNED++W G P D A + L ++R L+ G+ EA A F
Sbjct: 1 MVYGQTTTEVLQLNEDSVWYGGPQDRLPKAALQNLPELRRLIREGRQKEAEALVRAAFFA 60
Query: 97 HPADVY--QLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSN 154
+P+ + LG + L+FD + YRRELD++ A +RV+YS +++ RE +S
Sbjct: 61 YPSSQRHSEPLGTLHLDFDYGYQGIDIRDYRRELDISQAISRVQYSCNGIQYQREAIASY 120
Query: 155 PDQVIVTKISGSESGSLSFNVS--------LDSLLDNHSYVNGNNQIIMEGRCPGKRIPP 206
PDQVI +S S+S + ++ + LD + +G +IIM
Sbjct: 121 PDQVIGINLSSSQSSKYTIRLNRVSEREYETNEFLDTLTTRDG--KIIM----------- 167
Query: 207 KANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFIN 266
+A G + ++ + +D G + L + L V G + +LL + ++F
Sbjct: 168 --HATPGGGGSRLCCVVSARSNDPDGRVQVLGNT-LVVTGKS-STILLASQTTF------ 217
Query: 267 PSDSKKDPTSESMSALQSIRNL-SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCS 325
+DP ++AL I S++ + RHL DY+ L+ RV ++LS I TD
Sbjct: 218 ---RVEDP---ELAALGDIEKCGSWTQILDRHLKDYKNLYGRVCLKLSSDDSHIPTDL-- 269
Query: 326 EENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV--ANLQGIWNEDLS 383
Q DP LV L +GRYLLIS SRPG + A LQGIWN
Sbjct: 270 --------------RLQRKPDPGLVGLYHNYGRYLLISCSRPGDKALPATLQGIWNPSFQ 315
Query: 384 PTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHH 443
P W S +NIN +MNYW + NL EC+ PLF+ L + +NG++TA+ Y GW HH
Sbjct: 316 PPWGSKYTININTQMNYWPANISNLPECETPLFELLERVQVNGARTAKEMYGCRGWCAHH 375
Query: 444 KTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
TDIWA ++ + LWP+GGAWLCTH+WE Y + D+ FL+ R +P+LEGC FLL
Sbjct: 376 NTDIWADTNPQDKWMPATLWPLGGAWLCTHIWERYLFFEDKSFLQ-RLFPVLEGCVRFLL 434
Query: 504 DWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEK 563
D+LI+ G+ TNPS SPE+ F G+ +STMD+ I+ VF A I++ +LE
Sbjct: 435 DFLIKDDHGFYVTNPSLSPENTFKNQRGEEGVFCEASTMDIQILTAVFKAYITSCHILEG 494
Query: 564 NEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ-DFKDPEVHHRHLSHLFGLFPGHTITIE 622
+ +V K+L L P ++ G + EW + D+++ E HRH SHL+GL PG +IT
Sbjct: 495 LGTVDMAEVNKALAGLPPVIVSSTGLLQEWGRNDYEEVEPGHRHTSHLWGLHPGDSITPA 554
Query: 623 KNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHE 679
P+ +AA L +R G GWS W L ARL E + ++ L
Sbjct: 555 STPEFAEAASAVLTRRAAHGGGHTGWSRAWLINLHARLGQAEKSKEHIELL--------- 605
Query: 680 KHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS--TLND---LYLLPALPWDKWSS 734
NL HPPFQID NFG +A + EM+VQS +N + LLPA P + W +
Sbjct: 606 --LRKSTLPNLLDDHPPFQIDGNFGGSAGIIEMIVQSHEIVNGERVVRLLPAWPLE-WGN 662
Query: 735 GCVKGLKARGGETVSICWKDGDLH-EVGIYSNYSNNDH 771
G V+G++ RG ++ W+DG + V + S +++N +
Sbjct: 663 GRVEGIRVRGAAAITFEWRDGRIEGPVLVESEFASNKY 700
>gi|317057786|ref|YP_004106253.1| alpha-L-fucosidase [Ruminococcus albus 7]
gi|315450055|gb|ADU23619.1| Alpha-L-fucosidase [Ruminococcus albus 7]
Length = 756
Score = 422 bits (1086), Expect = e-115, Method: Compositional matrix adjust.
Identities = 265/751 (35%), Positives = 384/751 (51%), Gaps = 64/751 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
+I F PA+ + A+P+GNGR+G M +G +E ++LNED++W+G P N A L
Sbjct: 5 RIWFRRPAEDWNVALPVGNGRIGGMCFGQALNEKIQLNEDSVWSGGPRKRNNASARANLE 64
Query: 74 DVRSLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ + AEA ++ F G P + Y LGD+ ++ H + E R LDL
Sbjct: 65 KVRQLLREEKIAEAEKIVMEAFCGTPVNERHYMPLGDLSIQ---HHKEDTFEYTERSLDL 121
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS---LLDNHSYV 187
A +YS+ V +TR S P QV+ I + S+S VS+D D++S V
Sbjct: 122 ENAVCETRYSINGVNYTRRVICSEPAQVMAVCIDADKPASVSVKVSIDGRDDYFDDNSPV 181
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
N + I+ G C + GI F+A I++ GT+ + +
Sbjct: 182 N-DTDILYYGGCGSE------------DGICFAAY--IRVLGYGGTVGRW-GSSIVTDCC 225
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
D +++L A + F +D KK + ++A ++ +L H +DY+ F R
Sbjct: 226 DRVMIILGAQTDF-----RVTDYKKGAELDVITAAGK----TFEELLAEHTEDYRSYFDR 276
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSR 366
I + D S ++P+ ER+K + D LV L F FGRYL+I+ SR
Sbjct: 277 AEI--------VFEDGGSY----SLPTDERLKLVKDGGVDNGLVSLYFDFGRYLMIAGSR 324
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
GT NLQGIWN+D+ P W VNIN EMNYW + PC L + PLFD + + +G
Sbjct: 325 EGTLPLNLQGIWNKDMWPAWGCRFTVNINTEMNYWCAEPCGLGDLHIPLFDHIERMRPHG 384
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
TA+ Y SG+V HH TDIW ++ + W G AWLCTH+WEH+ +T D++F
Sbjct: 385 RDTAREMYGCSGFVCHHNTDIWGDTAPQDLWIPGTQWVTGAAWLCTHIWEHWLFTQDKEF 444
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
L ++ Y ++ A F +D+LI+ G L T PS SPE+ +I G V +MD I
Sbjct: 445 LAQK-YDTMKEAAKFFVDFLIDDGSGRLVTAPSVSPENTYITESGARGSVCIGPSMDSQI 503
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRH 606
I ++F+A+I A ++L ++ + EK+ RL +I + G I EWA D+ + E HRH
Sbjct: 504 IYQLFTAVIEAGKILGIDK-SFGEKLSAMRERLPKPEIGKYGQIKEWAVDYDEAEPGHRH 562
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHA 663
+S L+ L+P I+I P+L KAA T+ +R G GWS W WARLHD E
Sbjct: 563 ISQLYALYPADMISIRHTPELAKAARATIDRRLAHGGGHTGWSRAWIINHWARLHDGEKV 622
Query: 664 YRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYL 723
+ L F NLF HPPFQID NFG A +AE L+QS ++ L
Sbjct: 623 KENIAAL-----------FANSTSDNLFDMHPPFQIDGNFGAAAGIAEALLQSQNGEIQL 671
Query: 724 LPALPWDKWSSGCVKGLKARGGETVSICWKD 754
LPA+ D W +G +GL+ARGG + W D
Sbjct: 672 LPAVSPD-WKNGSFRGLRARGGYEIDCKWAD 701
>gi|160879541|ref|YP_001558509.1| hypothetical protein Cphy_1395 [Clostridium phytofermentans ISDg]
gi|160428207|gb|ABX41770.1| conserved hypothetical protein [Clostridium phytofermentans ISDg]
Length = 758
Score = 422 bits (1086), Expect = e-115, Method: Compositional matrix adjust.
Identities = 269/766 (35%), Positives = 403/766 (52%), Gaps = 83/766 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYA 85
+A+P+GNG GAM++G V E +KLN++++W G + NPD+ K L VR L+ GQ
Sbjct: 20 EALPLGNGSFGAMLYGNVEEEVIKLNQESVWYGGFRNRINPDSRKVLPKVRELIFDGQLK 79
Query: 86 EATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEE---------TYRRELDLNTA 133
A +FG P Y+ L D+ + F+ L ++E+ Y+R LDL TA
Sbjct: 80 AAEELVYTSMFGTPISQGHYEPLADLRIAFNKRILHHSEQWQERQINHSNYKRFLDLQTA 139
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN-NQ 192
Y+ ++ RE S PDQV+ +++ + + LD +N+ V N N
Sbjct: 140 CYNSSYTWRETDYKREALISYPDQVMAIRLTAD--NPMGVRIELDRG-ENYEKVEANENT 196
Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
I + G C G G +F A +++ ISD GTI L+VE + VL
Sbjct: 197 ITLSGSCGGN-------------GSKFIAKVQV-ISD--GTI-VRAGAFLEVENASEIVL 239
Query: 253 LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
+ + F ++DP L Y ++ H+ DY L+ RV + L
Sbjct: 240 YVAGRTDF---------YEEDPMDWCNEKLALAAQKGYEEIKKDHIADYASLYQRVDLDL 290
Query: 313 SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQV 371
+ ++N +P+ ER++ F+ ++ D L+EL + +GRYLLISSSR G
Sbjct: 291 N-----------GDKNYLNLPTDERLRLFKENKLDDGLLELYYNYGRYLLISSSREGALP 339
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
ANLQGIWN+D+ P W S +NIN +MNYW + NLSEC PLF+ + + +G + A+
Sbjct: 340 ANLQGIWNKDMMPAWGSKYTININTQMNYWPAEVTNLSECHTPLFEHIKRMVPHGREVAE 399
Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
Y G V HH TDI+ + +WPMG AWL TH+ EHY YT D F+ K
Sbjct: 400 KMYGCRGIVAHHNTDIYGDCVPQGKWMPATMWPMGFAWLATHVIEHYRYTKDVSFV-KDF 458
Query: 492 YPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
Y +L+ + F +D+L+ + L T PSTSPE+ +I +G+ + + Y +MD II+E++
Sbjct: 459 YSILKDASLFYVDYLVRDKENQLVTCPSTSPENTYILENGEKSTLCYGPSMDSQIIKELW 518
Query: 552 SAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSH 609
+ I + LE + D + VE +LK LP+ K+ G ++EW +++K+ E HRH+SH
Sbjct: 519 TGFIEVSSDLEVSNDVVSAVENMLKELPK---AKVGSRGQLLEWTKEYKEWEAGHRHISH 575
Query: 610 LFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHAYRM 666
L+GL+PG TIT EK+ + +A++ T+ +R G GWS W +WARL D E A
Sbjct: 576 LYGLYPGSTITFEKDKEFFEASKVTINERLSAGGGHTGWSRGWIINMWARLLDGEKA--- 632
Query: 667 VKRLFNLVDPEHEKHFEGGLYSNLFAAHPP--------FQIDANFGFTAAVAEMLVQSTL 718
L+NL ++ NLF HP FQID NFG TA ++EML+QS
Sbjct: 633 ---LYNL-----QELLCHSTAHNLFDLHPSNTTGMSSIFQIDGNFGGTAGLSEMLLQSHE 684
Query: 719 NDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+ + LLPALP +W +G V GLK RG V++ W++G L+ S
Sbjct: 685 DVICLLPALP-QRWENGYVTGLKVRGNIEVNLWWENGKLNRAEFLS 729
>gi|182626122|ref|ZP_02953882.1| fibronectin type III domain protein [Clostridium perfringens D str.
JGS1721]
gi|177908559|gb|EDT71084.1| fibronectin type III domain protein [Clostridium perfringens D str.
JGS1721]
Length = 1479
Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust.
Identities = 264/780 (33%), Positives = 409/780 (52%), Gaps = 82/780 (10%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG---DYTNPD- 67
L + ++ PA ++ +A+PIGNG +G M++G V SE ++ NE TLW+G PG DY +
Sbjct: 48 LALWYDEPATNWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEDYNGGNK 107
Query: 68 --APKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
A +A+ ++R ++ G + + +G YQ GDI L+F SH +
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQRAYG----AYQNFGDIFLDFK-SHEESK 162
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
YRREL++ + + VKY+ V + RE+F S PD V+V K+ ++ SL+ +V +
Sbjct: 163 VTNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNVMVIKLKADKASSLTVDVRNEGA 222
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ + NN +I+ G + G+++ + +IK+ + G+I ED+
Sbjct: 223 YNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINTGGSIQDKEDR 267
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ VE +D +++ A + + + P+ +DP S + + NL Y +L +RH++D
Sbjct: 268 -ISVENADEITIIMSAGTDYVNEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
Y+ LF RV++ L D TD E + ++T++ SL L FQ+GRYL
Sbjct: 325 YKNLFDRVNLNLGELKLDKPTD-------------EMLNEYKTNQSNSLETLFFQYGRYL 371
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSR G+ ANLQG+WN +P W S H N+N++MNYW + NLSE PL +++
Sbjct: 372 LISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYVE 431
Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
L G KTA+++ +GW ++ + + +A + W P AW+ +
Sbjct: 432 SLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
LWEHY +T D+D+L + YP+++ A F +L+E DG YL ++PS SPEH
Sbjct: 491 LWEHYKFTDDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH----- 545
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
+ +T D +I ++F+ I A+E L +E+ E K L+P +I + G
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGIDEEFRAELENKRERLLKP-QIGKHGQ 600
Query: 590 IMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITW 649
+ EW D DP +HRH+SHL GL+PG I + P+L +AA+ T+ RG+ G GWS
Sbjct: 601 VQEWKDDIDDPNNNHRHISHLVGLYPGTQINQKDTPELYEAAKVTMNHRGDGGTGWSKAN 660
Query: 650 KTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAV 709
K LWARL D + A+R++ E NLF HPPFQID N G + +
Sbjct: 661 KINLWARLLDGDRAHRLL-----------ENQLTTSTLENLFDTHPPFQIDGNMGAVSGM 709
Query: 710 AEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
AEMLVQS L + LPALP W G GLKARG +S W + L+ + I S N+
Sbjct: 710 AEMLVQSHLGTINPLPALP-TAWEDGSFDGLKARGNFEISANWNNNSLNLIKIKSGSGND 768
>gi|424665546|ref|ZP_18102582.1| hypothetical protein HMPREF1205_01421 [Bacteroides fragilis HMW
616]
gi|404574619|gb|EKA79368.1| hypothetical protein HMPREF1205_01421 [Bacteroides fragilis HMW
616]
Length = 829
Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust.
Identities = 258/771 (33%), Positives = 394/771 (51%), Gaps = 84/771 (10%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA--------PKALSDVR 76
+ ++PIGNG +GA + G + +E + NE TLW G P DA L ++R
Sbjct: 75 SQSLPIGNGSIGANIMGSIEAERITFNEKTLWRGGPNTSRGADAYWNVNKQSAHVLKEIR 134
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
G +A + K F + +G+ +E S + ++ Y
Sbjct: 135 QAFTDGDQKKAEMLTRKNFNSEVPYESSREKPFRFGNFTTMGEFYIETGLSAVNMSD--Y 192
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ +V + R++F S P V+ + G +L+F+ + + +
Sbjct: 193 KRILSLDSALAVVQFKKDDVAYERDYFISYPANVMAIRFKADRPGKQNLTFSYAPNPVST 252
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+G N + A+ D G+Q+ ++ I + GT+S D K+
Sbjct: 253 GSMSADGANGLAY-------------TAHLDNNGMQY--VVRIHATAKGGTLSN-ADGKI 296
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSD-SKKDPTSESMSALQSIRNLSYSDLYTRH 297
++ +D V L+ A + +FD F +P +P + + + + Y L+ +H
Sbjct: 297 TIKDADEVVFLVTADTDYKINFDPDFKDPKTYVGVNPAETTRQWMDNAVTMGYDVLFKQH 356
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
DDY LF+RV +QL+ ++ ++P+A+R+++++ + D L EL +QF
Sbjct: 357 YDDYAALFNRVKLQLN-----------PDQQSPSLPTAKRLQNYRKGQPDFYLEELYYQF 405
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL+EC PL
Sbjct: 406 GRYLLITSSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECTLPLV 465
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + GW +I+ ++ + + W PM G WL TH+W
Sbjct: 466 DFIRTLVKPGQKTAQAYFGTRGWTASISANIFGFTAPLESEDMSWNFNPMAGPWLATHIW 525
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D+ FL++ Y L++ A F D+L DG PSTSPEH
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSAQFATDFLWRKPDGTYTAAPSTSPEH---------GP 576
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A+IRE+ I A++VL + E ++VL L P K+ G +MEW
Sbjct: 577 IDEGTTFVHAVIREILLDAIEASKVLGVDSKERKQWQEVLA---HLAPYKVGRYGQLMEW 633
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
++D DP+ HRH++HLFGL PGHT++ PDL KAA L+ RG+ GWS+ WK
Sbjct: 634 SKDIDDPKDEHRHVNHLFGLHPGHTLSPITTPDLAKAARVVLEHRGDGATGWSMGWKLNQ 693
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARL D HAY++ L + G NL+ HPPFQID NFG TA + EML
Sbjct: 694 WARLQDGNHAYKLFGNL-----------LKNGTLDNLWDTHPPFQIDGNFGGTAGITEML 742
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+QS + + LLPALP D W +G + G+ A+G V + WKDG L E I+S
Sbjct: 743 LQSHMGFIQLLPALP-DAWKNGSISGICAKGNFEVDLSWKDGQLAEATIFS 792
>gi|150003836|ref|YP_001298580.1| glycoside hydrolase [Bacteroides vulgatus ATCC 8482]
gi|149932260|gb|ABR38958.1| glycoside hydrolase family 95 [Bacteroides vulgatus ATCC 8482]
Length = 818
Score = 422 bits (1084), Expect = e-115, Method: Compositional matrix adjust.
Identities = 277/810 (34%), Positives = 402/810 (49%), Gaps = 94/810 (11%)
Query: 2 MNAESTSTTNPLKITFNGP------AKHFTDA---------IPIGNGRLGAMVWGGVPSE 46
+ A+ST T L I F+ P A ++ A +PIGNG LG V G + +E
Sbjct: 17 LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPATDKAWENNSLPIGNGSLGGNVMGSIAAE 76
Query: 47 TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
+ LNE TLW G P N ++ LS++R G +A + K F
Sbjct: 77 RITLNEKTLWRGGPNTEKGAAYYWNVNKESAHLLSEIRQAFTDGNQKKAEELTCKNFNGL 136
Query: 99 ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
AD + LG+ +E S + + Y+R L L++A A V + V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEIGMTD--YKRILSLDSAMAIVSFRKDEVNY 194
Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
R++F S PD V+V K + G +L F+ + +G N+++ GR
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKTDGPNRLLYTGRL----- 249
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
K Q L I+ + G+++ D K V +D + LL A + +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298
Query: 261 DGPFINPSD-SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
+ F +P DP +++ L + +Y++L RH DY +LF RV +QL+ +P
Sbjct: 299 NPDFKDPKTYVGPDPDQTTLAMLDAAAAKNYNELCERHKTDYTQLFGRVKLQLNPHAPM- 357
Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
T + +P+ +R+ ++ + D L E+ +QFGRYLLI+SSRPG ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTHQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413
Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
W + W H NIN++MNYW + NL+EC PL DF+ L G KTAQ + A
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473
Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
GW +I+ +S + W PM G WL TH+WE+Y+YT D+ FL++ Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533
Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
A+F +D+L DG PSTSPEH V +T A++RE+ I
Sbjct: 534 SSANFAVDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLNAID 584
Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLF 614
A++ L + + + VLK L P +I G +MEW+ D DP+ HRH++HLFGL
Sbjct: 585 ASKALGVDSKDRKQWQYVLK---HLVPYQIGRYGQLMEWSTDIDDPKDEHRHVNHLFGLH 641
Query: 615 PGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLV 674
PGHT++ P+L AA+ L+ RG+ GWS+ WK WARL D HAY++ L
Sbjct: 642 PGHTLSPITTPELTNAAKVVLEHRGDGATGWSMGWKLNQWARLQDGNHAYKLFGNL---- 697
Query: 675 DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSS 734
+ G NL+ HPPFQID NFG TA + EML+QS + + LLPALP D W
Sbjct: 698 -------LKNGTLDNLWDTHPPFQIDGNFGGTAGITEMLLQSHMGFIQLLPALP-DAWKE 749
Query: 735 GCVKGLKARGGETVSICWKDGDLHEVGIYS 764
G VKGL A+G + I W+DG L E I S
Sbjct: 750 GSVKGLCAKGNFEIDIIWQDGKLKEAVILS 779
>gi|384566468|ref|ZP_10013572.1| hypothetical protein SacglDRAFT_02625 [Saccharomonospora glauca
K62]
gi|384522322|gb|EIE99517.1| hypothetical protein SacglDRAFT_02625 [Saccharomonospora glauca
K62]
Length = 924
Score = 421 bits (1082), Expect = e-115, Method: Compositional matrix adjust.
Identities = 280/776 (36%), Positives = 404/776 (52%), Gaps = 70/776 (9%)
Query: 3 NAESTSTTNP----LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWT 57
A TS P L + ++ PA + ++ +P+GNG LG V+GGV +E L+ NE TLWT
Sbjct: 39 GAAETSDLRPSPEGLTLWYDEPASDWESEVLPVGNGALGVGVFGGVATERLQFNEKTLWT 98
Query: 58 GVPG-----DYTNPDAPK--ALSDVRSLVDSGQYAEATAASVKLFGHPA---DVYQLLGD 107
G PG D+ N P+ A+ +VR +D+ A+ KL G P YQ G+
Sbjct: 99 GGPGAADGYDFGNWREPRPGAIEEVRQRLDTELRADPEWVVSKL-GQPKRGYGAYQTFGE 157
Query: 108 IELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSE 167
I + + L+ + YRR L+L A A V Y V TRE+F+S D V+V + SG
Sbjct: 158 IRVS--GAELEEVAD-YRRYLNLADAVAGVSYEADGVHHTREYFASAADDVVVARFSGEV 214
Query: 168 SGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKI 227
G++ V + + DN S N GR + A DD G+++ A +I++
Sbjct: 215 PGAVDVTVGV-TAPDNRS----KNLTARGGRIT------FSGALDD-NGLRYEA--QIQV 260
Query: 228 SDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRN 287
D G+ D + V +D L+L A + + + P +DP + + +
Sbjct: 261 LTDGGSRVDNPDGSVTVTDADTMTLVLAAGTDYSAEY--PVYRGEDPHAAVTERVDAAVA 318
Query: 288 LSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDP 347
Y L H+ D++ LF RVS+ L + D+ TD D +AE ++ +
Sbjct: 319 KGYDALRAAHVADHRGLFDRVSLDLGQRMPDLPTDELLARYRDGGLAAEERRALEV---- 374
Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
L FQ+GRYLLI+SSR G+ ANLQG+WN+ SP W + HVNINL+MNYW + N
Sbjct: 375 ----LYFQYGRYLLIASSRSGSLPANLQGVWNDSTSPPWSADYHVNINLQMNYWPAEVTN 430
Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMG 466
LSE EPLFD++ L G+ TA+ + GWV+H++T + + D W +P
Sbjct: 431 LSETTEPLFDYVDSLVAPGTVTAKEMFGNRGWVVHNETTPFGYTGVHDWATSFW--FPEA 488
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHE 525
GAWL WEHY +T D FL +RAYP+L+ + F +D L+ + DG L +PS SPE
Sbjct: 489 GAWLAQSYWEHYLFTRDETFLAERAYPMLKSLSRFWIDELVTDSRDGRLVVSPSYSPEQ- 547
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KI 584
S ++M I+ ++ + AAE++ ++E+ E + +L L P +I
Sbjct: 548 --------GDFSAGASMSQQIVWDLLTNTAEAAELVGEDEEFRAE-LAATLADLDPGLRI 598
Query: 585 AEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPG 644
G + EW +D+ DP HRH+SHLF L PG I P+ AAEK+L RG+ G G
Sbjct: 599 GSWGQLQEWKEDWDDPNNQHRHVSHLFALHPGRQIDPYSEPEYTAAAEKSLLARGDGGTG 658
Query: 645 WSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFG 704
WS WK WARL D +HA+ M+ L + H NL+ HPPFQID NFG
Sbjct: 659 WSKAWKINFWARLLDGDHAHTMLSELLS-----HST------LPNLWDTHPPFQIDGNFG 707
Query: 705 FTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEV 760
TA +AEMLVQS + +LPALP +WS+G V GL+ARG TV + W +G + +
Sbjct: 708 ATAGIAEMLVQSHRGVVDVLPALP-TEWSTGSVSGLRARGDVTVDVEWANGTANRI 762
>gi|307565695|ref|ZP_07628164.1| conserved hypothetical protein [Prevotella amnii CRIS 21A-A]
gi|307345521|gb|EFN90889.1| conserved hypothetical protein [Prevotella amnii CRIS 21A-A]
Length = 771
Score = 421 bits (1082), Expect = e-115, Method: Compositional matrix adjust.
Identities = 263/757 (34%), Positives = 384/757 (50%), Gaps = 80/757 (10%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY--------TNPDAPKALSDVRSL 78
++PIGNG LGA + GG+ + LNE +LW G PG N + L +R
Sbjct: 64 SLPIGNGSLGANIMGGIACDRFTLNEKSLWRGGPGVKGGAAYYWDQNKQSAHFLKAIRKA 123
Query: 79 VDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDD-----------SHLKYAEETYRRE 127
G A + F A Y + + F + H + Y+R
Sbjct: 124 FLQGNTKLAAKLTQDNFNGKA-AYSIATEPHFRFGNFTTMGEVTIQTGHKEQDISGYKRC 182
Query: 128 LDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKIS--GSESGSLSFNVSLDSLLDNHS 185
L L++A A V Y + R +F S PD V+V K + G++ +L+ + +
Sbjct: 183 LSLDSAIASVSYHTNTTYYKRSYFISYPDNVMVIKYTAKGADLLNLTLTYTPSPIAQGQV 242
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
+ + I +G+ ND+ ++F+ + IK + D GT S + D KL +
Sbjct: 243 VNDSTDGITYKGKL-----------NDN--NMRFT--IRIKANIDSGT-SKVIDGKLHIL 286
Query: 246 GSDWAVLLLVASSSFDGPFINPS--DSKK----DPTSESMSALQSIRNLSYSDLYTRHLD 299
+ L A + + NPS D K +P + ++ Y++L HL
Sbjct: 287 KAKTVTFFLTADTDYKQN-TNPSFTDPKTYIGVNPDKTTKKWIKHALQKGYNNLLNNHLA 345
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGR 358
DY LF RV + ++ KD C +P+ +R++ ++T + D L L FQ+GR
Sbjct: 346 DYTPLFKRVKLIINPDDKDTKEALC-------LPTNKRLQRYRTGKADYDLEALYFQYGR 398
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLI+SSRPGT ANLQG+W+ ++ W H NINL+MNYW +L NL+EC PL +F
Sbjct: 399 YLLIASSRPGTLPANLQGLWHNNVDGPWRVDYHNNINLQMNYWHALTTNLAECALPLNNF 458
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
+ L G +TA+ Y A GW ++I+ ++ K + W L P+ G WL THLWE+
Sbjct: 459 ICMLEKPGRRTAKAYYNARGWTTSISSNIFGFTAPLIDKDMTWNLSPISGPWLSTHLWEY 518
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y++T ++ +L AYP+L+G A F +D+L DG PSTSPEH +
Sbjct: 519 YDFTRNKTYLRNTAYPILKGSAQFAVDFLWHKPDGTYTAAPSTSPEH---------GSID 569
Query: 538 YSSTMDMAIIREVFSAIISAAEVLE--KNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ 595
+T A++RE+ + I+A++VL+ + E EKVL +L P +I G +MEW++
Sbjct: 570 QGATFVHAVVREILTDAIAASKVLDIDRKERKQWEKVLL---KLSPYRIGRYGQLMEWSE 626
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWA 655
D DP +HRH++HLFGLFPGHTI+ P L +AA L+ RG+ GWS+ WK LWA
Sbjct: 627 DIDDPNDNHRHVNHLFGLFPGHTISTSTTPTLARAARIVLEHRGDGATGWSMAWKICLWA 686
Query: 656 RLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQ 715
RLHD +HAY++ + L NL H PFQID NFG TA +AEMLVQ
Sbjct: 687 RLHDGDHAYKLFQNL-----------LRNSTLDNLLDTHTPFQIDGNFGATAGIAEMLVQ 735
Query: 716 STLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICW 752
S + LLPALP W G VKGL RGG+ + + W
Sbjct: 736 SQMGKTELLPALP-KAWKHGYVKGLVVRGGKEIELKW 771
>gi|332882772|ref|ZP_08450383.1| hypothetical protein HMPREF9074_06194 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|332679274|gb|EGJ52260.1| hypothetical protein HMPREF9074_06194 [Capnocytophaga sp. oral
taxon 329 str. F0087]
Length = 805
Score = 421 bits (1081), Expect = e-114, Method: Compositional matrix adjust.
Identities = 276/774 (35%), Positives = 404/774 (52%), Gaps = 59/774 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + F PAKHFT+++PIGNGRLGA+++G ++ + LNE +LW+G + +P+A L
Sbjct: 23 VSVVFKQPAKHFTESLPIGNGRLGAILFGKTDTDRIVLNEISLWSGGYQEADDPEAHTYL 82
Query: 73 SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
+++ L+ G+ EA A K F G A+ YQ+ D+ L++ + +
Sbjct: 83 KEIQQLLLEGKNLEAQALLQKHFIARGKGSCHGQGANCSYGCYQVFADLLLDWKN---QT 139
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+ Y+R L L+ ATA Y+ + F+ + ++ KI+G++ N+SL
Sbjct: 140 PVKDYKRVLRLDEATAITTYTRDENSIEQVAFADFKNDLLWIKITGTKP--FDLNISLFR 197
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+N + NN I + G P +D +G+ F++ ++++ T E+
Sbjct: 198 K-ENATISYQNNHITLTGVLP----------DDKKEGMHFASAIDVQ------TDGKAEN 240
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
K+ +E L+L S + + + N S ++ S LQ + S+
Sbjct: 241 KEKAIEIQAAKELILKISMATNYQYKNGGLSNVSVKEKAESYLQRCTS-SFEAALAESKT 299
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGR 358
YQ LF++ +R + + N + + ER++ F + D+D L L + FGR
Sbjct: 300 IYQGLFNK-----NRWYGN------ANSNTSHLSTYERLEGFYKGDKDALLPILYYNFGR 348
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSSR G ANLQG+W E+ W+ H+NIN++MNYW + NLSE EPL F
Sbjct: 349 YLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINIQMNYWLAEATNLSELTEPLNRF 408
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
L NG KTA+ Y A GWV H ++ W +S VW GGAWLC H+W+HY
Sbjct: 409 TKNLVPNGYKTAKAYYNADGWVAHVISNPWFYTSPGES-AVWGSTLTGGAWLCEHIWQHY 467
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDG----KL 533
+T D DFL K YP+L+ F LI E GY T PS SPE+ ++ P ++
Sbjct: 468 LFTHDIDFL-KEYYPVLKQATDFFKSLLIKEPKKGYWITAPSNSPENAYLLPSKDNKKQV 526
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ TMDM I+RE+FS + AA +L + D + + P +I + G + EW
Sbjct: 527 GNTCIAPTMDMQIVRELFSNTMQAATILGVDSDKFSQWT-DIIKHTAPNRIGKKGDLNEW 585
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
D++D + HHRH+SHL+GL+P IT P L KAAEKTLQ RG+ G GWS WK
Sbjct: 586 LDDWEDADPHHRHVSHLYGLYPYDEITPWDTPKLAKAAEKTLQMRGDGGTGWSRAWKINF 645
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARL D HA ++++L V E GG Y+NLF AHPPFQID NFG A +AEML
Sbjct: 646 WARLQDGNHALVLLRQLLRPVSSEITTGQVGGSYANLFCAHPPFQIDGNFGGAAGIAEML 705
Query: 714 VQS--TLNDLYLLPALPWD-KWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+QS N + LPALP W +G +KG+KAR VS W+ L + I S
Sbjct: 706 LQSHGKQNVIRFLPALPSHPDWENGVMKGMKARNNFEVSFSWQQHQLQKATITS 759
>gi|345562260|gb|EGX45329.1| hypothetical protein AOL_s00170g36 [Arthrobotrys oligospora ATCC
24927]
Length = 826
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 278/800 (34%), Positives = 412/800 (51%), Gaps = 95/800 (11%)
Query: 8 STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
S ++PL+I +F D+ IGNGR+GA + GG SE +++NED+LW+G NPD
Sbjct: 30 SASHPLRIWTTSAGSYFNDSYLIGNGRIGAALPGGAASEVIRVNEDSLWSGGKLSRVNPD 89
Query: 68 APKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETY 124
A + D++SL+ + EA A G P Y+ LGD++L + S + Y
Sbjct: 90 ANGKMRDIQSLLTQQRNPEAARLAGFAYAGTPVSARHYEPLGDLQLVMNHSS---STTGY 146
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD-----S 179
R LDL ++ V Y+VG V + RE+ +SNPD +I I+ S+ S+SFN+ L +
Sbjct: 147 ERWLDLFDSSVGVYYTVGGVSYRREYIASNPDNIIAIHITASKPASVSFNIHLRKGQSLN 206
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
++++Y G++ +M G GK G++FSA K+ G + L D
Sbjct: 207 RWEDYTYKVGSDTTVMGGESQGK------------DGVKFSA--GTKVVASGGKVYTLGD 252
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+ + +D A + A +++ ++DP ++ +S L SI SYSD+ H+
Sbjct: 253 YVI-CDNADEATIFFTAWTAY---------RQQDPINKVLSDLSSISVKSYSDIRATHVA 302
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
DYQK F RVS+ L S + + + +R+ + + DP LV L FQFGRY
Sbjct: 303 DYQKYFGRVSLSLG----------SSSDTQKALSTPKRLAAIASTFDPELVALYFQFGRY 352
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
L ISSSR T NLQGIWN+++ P W S VNINL+MNYW SL N+ E PL+D +
Sbjct: 353 LFISSSRVNTLPPNLQGIWNQEMDPQWGSKYTVNINLQMNYWPSLVTNMIELTTPLYDLI 412
Query: 420 TYLSINGSKTAQVNYLAS-GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
L +G KTAQ Y S GWV HH TDIWA ++ WP G AWL H+ E Y
Sbjct: 413 ARLHSSGKKTAQSMYGNSQGWVCHHNTDIWADTAPQDNYASSTWWPAGSAWLVHHIIEEY 472
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGK-LACVS 537
+T D++FL+K Y ++ A F ++L + G+ TNP+ SPE+ F K ++
Sbjct: 473 RFTRDKEFLQKY-YNTIKDAALFFTEFLTN-YKGWKVTNPTLSPENTFYLLGTKTTTAIT 530
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
ST+D ++I E+F +++ ++L K+++++ + +L P +I + G IMEW +D+
Sbjct: 531 LGSTLDNSLIWELFGSLLEIMDILGKHDNSMKSTLHDLRAKLPPLRINKWGGIMEWIEDY 590
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTALW 654
+ + HRH+SHLFG++PG IT N + AA ++ +R G GWS W A+
Sbjct: 591 DETDPGHRHISHLFGVYPGSEIT-STNMTVFNAARSSVSRRLSYGSGSTGWSRAWFIAVG 649
Query: 655 ARLH--DQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP--FQIDANFGFTAAVA 710
RL+ DQ H V L+N HF +++ PP FQID NFG TA +
Sbjct: 650 GRLYLPDQVHQ-STVTLLYNYT------HF-----NSMLDTGPPSAFQIDGNFGGTAGIV 697
Query: 711 EMLVQS----------TLND-------------LYLLPALP--WDKWSSGCVKGLKARGG 745
E L+ S T N + LP LP W G V GL+ARGG
Sbjct: 698 EALLHSHETVTATSITTANMKASGTGDATGIPVIRFLPTLPHQWASNGGGFVTGLRARGG 757
Query: 746 ETVSICW-KDGDLHEVGIYS 764
V I W ++G+L I S
Sbjct: 758 AQVDIFWTENGNLDNATITS 777
>gi|294775002|ref|ZP_06740531.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
gi|294451046|gb|EFG19517.1| conserved hypothetical protein [Bacteroides vulgatus PC510]
Length = 818
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 277/810 (34%), Positives = 400/810 (49%), Gaps = 94/810 (11%)
Query: 2 MNAESTSTTNPLKITFNGP------AKHFTDA---------IPIGNGRLGAMVWGGVPSE 46
+ A+ST T L I F+ P A ++ A +PIGNG LG V G + +E
Sbjct: 17 LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPATDKAWENNSLPIGNGSLGGNVMGSIAAE 76
Query: 47 TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
+ LNE TLW G P N ++ L ++R G +A + K F
Sbjct: 77 RITLNEKTLWRGGPNTEKGAAYYWKVNKESAHLLPEIRQAFTDGNQKKAEELTCKNFNGL 136
Query: 99 ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
AD + LG+ +E S + + Y+R L L++A A V + V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEIGMTD--YKRILSLDSAMAIVSFRKDEVNY 194
Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
R++F S PD V+V K + G +L F+ + +G N+++ GR
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKTDGPNRLLYTGRL----- 249
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
K Q L I+ + G+++ D K V +D + LL A + +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298
Query: 261 DGPFINPSD-SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
+ F +P DP +++ L + +Y++L RH DY +LF RV +QL+ +P
Sbjct: 299 NPDFKDPKTYVGPDPDQTTLAMLDAAAAKNYNELCERHKTDYTQLFGRVKLQLNPHAPM- 357
Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
T + +P+ +R+ ++ + D L E+ +QFGRYLLI+SSRPG ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTHQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413
Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
W + W H NIN++MNYW + NL+EC PL DF+ L G KTAQ + A
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473
Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
GW +I+ +S + W PM G WL TH+WE+Y+YT D+ FL++ Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533
Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
A+F +D+L DG PSTSPEH V +T A+IRE+ I
Sbjct: 534 SSANFAVDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVIREILLNAID 584
Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLF 614
A++ L + + + VLK L P +I G +MEW+ D DP HRH++HLFGL
Sbjct: 585 ASKALGVDSKDRKQWQYVLK---HLVPYQIGRYGQLMEWSTDIDDPTDEHRHVNHLFGLH 641
Query: 615 PGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLV 674
PGHT++ P+L AA+ L+ RG+ GWS+ WK WARL D HAY++ L
Sbjct: 642 PGHTLSPITTPELTNAAKVVLEHRGDGATGWSMGWKLNQWARLQDGNHAYKLFGNL---- 697
Query: 675 DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSS 734
+ G NL+ HPPFQID NFG TA + EML+QS + + LLPALP D W
Sbjct: 698 -------LKNGTLDNLWDTHPPFQIDGNFGGTAGITEMLLQSHMGFIQLLPALP-DAWKE 749
Query: 735 GCVKGLKARGGETVSICWKDGDLHEVGIYS 764
G VKGL A+G + I W+DG L E I S
Sbjct: 750 GSVKGLCAKGNFEIDIIWQDGKLKEAVILS 779
>gi|393788805|ref|ZP_10376931.1| hypothetical protein HMPREF1068_03211 [Bacteroides nordii
CL02T12C05]
gi|392653911|gb|EIY47561.1| hypothetical protein HMPREF1068_03211 [Bacteroides nordii
CL02T12C05]
Length = 814
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 276/806 (34%), Positives = 411/806 (50%), Gaps = 91/806 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
T ++P+GNG LGA + G + +E + LNE TLW G P DY N + L ++R
Sbjct: 60 TSSLPLGNGSLGANIMGSIAAERITLNEKTLWKGGPNTSGGADYYWNVNKQSAPILKEIR 119
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
+G A + K F A + +G++ +E S + ++ Y
Sbjct: 120 QAFTAGDQKRAETLTRKNFNGLAAYEEKDETPFRFGSFTTMGEVYVETGLSEIGMSD--Y 177
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ +++ R +F S PD V+V + + + G +L+F+ S ++
Sbjct: 178 KRILSLDSAMATVRFLKDGIKYQRNYFISYPDSVMVMRFTADKPGMQNLTFSYSPNTEAQ 237
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+G N + G K N N ++F AI ++G +E+ KL
Sbjct: 238 GKIEADGTNGLYYAG---------KLNNNQMKFALRFRAI-------NKGGTVRVENGKL 281
Query: 243 KVEGSDWAVLLLVASSSFD---GPFINPSDS--KKDPTSESMSALQSIRNLSYSDLYTRH 297
++ ++ V LL A + + P N ++ +P+ + + ++ +Y LY RH
Sbjct: 282 VIKDANEVVFLLTADTDYKMNYNPDFNSPETYVGNNPSETTRNMMKQAEAKTYEVLYLRH 341
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQF 356
+DY LF+RV +LS +P+ + D +P+ +R+K + Q D L +L +Q+
Sbjct: 342 QNDYTALFNRV--KLSLNPQVPIAD---------LPTDQRLKHYRQGTPDYYLEQLYYQY 390
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ +L W H NIN++MNYW + NL EC PL
Sbjct: 391 GRYLLIASSRPGNMPANLQGIWHNNLDGPWRVDYHNNINIQMNYWPACSTNLDECMIPLI 450
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTA+ + A GW +I+ ++ ++ W PM G WL TH+W
Sbjct: 451 DFIRGLVKPGEKTAKAYFNARGWTASISANIFGFTAPLSSEQMEWNFNPMAGPWLATHIW 510
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D+ FL + YPL++ A F +D+L DG PSTSPEH
Sbjct: 511 EYYDYTRDKKFLSEIGYPLIKSSAQFTVDYLWHKPDGTYTAAPSTSPEH---------GP 561
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWA 594
V +T A++RE+ S ISA+++L DA K K L L P +I G +MEW+
Sbjct: 562 VDQGATFVHAVVREILSDAISASKIL--GVDAKERKQWKDILKNLVPYQIGRYGQLMEWS 619
Query: 595 QDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALW 654
D DP+ HRH++HLFGL PGHT++ P+L +AA+ LQ RG+ GWS+ WK W
Sbjct: 620 VDIDDPDDKHRHVNHLFGLHPGHTLSPITTPELAQAAKIVLQHRGDGATGWSMGWKLNQW 679
Query: 655 ARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLV 714
ARL D HAY + L + G NL+ H PFQID NFG TA + EML+
Sbjct: 680 ARLQDGNHAYMLFGNL-----------LKNGTLDNLWDTHTPFQIDGNFGGTAGITEMLL 728
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYS------- 767
QS + + LLPALP D W G + G+ A+G VSI W++ L E + S
Sbjct: 729 QSHMGFIQLLPALP-DAWKEGSINGICAKGNFEVSIAWENNQLKEAILTSKAGTPCTIKY 787
Query: 768 NNDHDSFKTLHYRGTSVKVNLSAGKI 793
+ SFKT +G S K+ GKI
Sbjct: 788 GDQTLSFKT--QKGQSYKIVGERGKI 811
>gi|319902716|ref|YP_004162444.1| alpha-L-fucosidase [Bacteroides helcogenes P 36-108]
gi|319417747|gb|ADV44858.1| Alpha-L-fucosidase [Bacteroides helcogenes P 36-108]
Length = 832
Score = 420 bits (1079), Expect = e-114, Method: Compositional matrix adjust.
Identities = 279/811 (34%), Positives = 416/811 (51%), Gaps = 89/811 (10%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG +GA + G V +E + NE TLW G P DY N + L +R
Sbjct: 75 SQSLPIGNGSIGASIMGSVEAERITFNEKTLWRGGPNTSKGADYYWNVNKQSAHVLEQIR 134
Query: 77 SLVDSGQYAEATAASVKLFGHPADV-YQLLGDIELEFDD--SHLKYAEET---------Y 124
G A+A + + F +DV Y+ + F + + ++ ET Y
Sbjct: 135 KAFVEGDQAKAEKLTRENFN--SDVPYEAARENPFRFGNFTTMGEFYVETGLNIIGMSGY 192
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V+++ V++ R +F S P V+V + + S +G +L F+ + + +
Sbjct: 193 KRALSLDSAMAVVQFTKDKVDYQRTYFISYPANVMVMRYTASRAGMQNLVFSYAPNPVST 252
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+G + ++ +A D G+++ ++ I + G +S D KL
Sbjct: 253 GSISADGMDGLVY-------------SAVLDNNGMKY--VVRIHAVVNGGKLSN-ADGKL 296
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSD-SKKDPTSESMSALQSIRNLSYSDLYTRH 297
V+G+D V + A + +FD F NP+ +P + + S Y L H
Sbjct: 297 TVKGADEVVFYVTADTDYQINFDPDFANPATYVGVNPAETTRKWMDSAVAKGYDLLRKEH 356
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
+DY LF+RV + L+ P TD +P+++R+K++++ + D L EL +QF
Sbjct: 357 YEDYATLFNRVKLVLN--PDAKATD---------LPTSQRLKNYRSGKPDYYLEELYYQF 405
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL EC EPL
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINVQMNYWPACSTNLDECMEPLI 465
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G +TAQ + A GW +I+ ++ + + W PM G WL TH+W
Sbjct: 466 DFIRTLVKPGKRTAQAYFGARGWTASISGNIFGFTAPLESQDMSWNFNPMAGPWLATHIW 525
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D+ FL++ Y L++ A F +D+L DG PSTSPEH
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSADFAVDYLWHKPDGTFTAAPSTSPEH---------GP 576
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
V +T A+IRE+ I A+ VL +K E E+VL RL P +I G +MEW
Sbjct: 577 VDQGTTFVHAVIREILLDAIEASRVLGVDKAERRQWEQVLA---RLLPYRIGRYGQLMEW 633
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
+ D DP+ HRH++HLFGL PGHT++ P+L +AA L+ RG+ GWS+ WK
Sbjct: 634 SVDIDDPKDEHRHVNHLFGLHPGHTLSPVTTPELAQAARVVLEHRGDGATGWSMGWKLNQ 693
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARL D HAY++ L + G NL+ HPPFQID NFG TA V EML
Sbjct: 694 WARLQDGNHAYKLFGNL-----------LKNGTMDNLWDTHPPFQIDGNFGGTAGVTEML 742
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDS 773
+QS + + LLPALP D W +G V G+ A+G V + WK G L + I S
Sbjct: 743 LQSHMGFIQLLPALP-DAWHTGSVSGICAKGNFEVELVWKTGVLQKAVILSKSGGECIVK 801
Query: 774 F--KTLHY---RGTSVKVNLSAGKIYTFNRQ 799
+ KTL + +G S ++ S K + NR+
Sbjct: 802 YAGKTLSFNTVKGRSYQLKYSVEKGLSVNRE 832
>gi|423313025|ref|ZP_17290961.1| hypothetical protein HMPREF1058_01573 [Bacteroides vulgatus
CL09T03C04]
gi|392686239|gb|EIY79545.1| hypothetical protein HMPREF1058_01573 [Bacteroides vulgatus
CL09T03C04]
Length = 818
Score = 419 bits (1078), Expect = e-114, Method: Compositional matrix adjust.
Identities = 276/810 (34%), Positives = 401/810 (49%), Gaps = 94/810 (11%)
Query: 2 MNAESTSTTNPLKITFNGP------AKHFTDA---------IPIGNGRLGAMVWGGVPSE 46
+ A+ST T L I F+ P A ++ A +PIGNG LG V G + +E
Sbjct: 17 LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPATDKAWENNSLPIGNGSLGGNVMGSIAAE 76
Query: 47 TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
+ LNE TLW G P N ++ L ++R G +A + K F
Sbjct: 77 RITLNEKTLWRGGPNTEKGAAYYWNVNKESAHLLPEIRQAFTDGNQKKAEELTCKNFNGL 136
Query: 99 ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
AD + LG+ +E S + + Y+R L L++A A V + V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEIGMTD--YKRILSLDSAMAIVSFRKDEVNY 194
Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
R++F S PD V+V K + G +L F+ + +G N+++ GR
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKTDGPNRLLYTGRL----- 249
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
K Q L I+ + G+++ D K V +D + LL A + +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298
Query: 261 DGPFINPSD-SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
+ F +P DP +++ L + +Y++L RH DY +LF RV +QL+ +P
Sbjct: 299 NPDFKDPKTYVGPDPDQTTLAMLDAAAAKNYNELCERHKTDYTQLFGRVKLQLNPHAPM- 357
Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
T + +P+ +R+ ++ + D L E+ +QFGRYLLI+SSRPG ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTHQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413
Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
W + W H NIN++MNYW + NL+EC PL DF+ L G KTAQ + A
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473
Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
GW +I+ +S + W PM G WL TH+WE+Y+YT D+ FL++ Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533
Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
A+F +D+L DG PSTSPEH V +T A++RE+ I
Sbjct: 534 SSANFAVDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLNAID 584
Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLF 614
A++ L + + + VLK L P +I G +MEW+ D DP+ HRH++HLFGL
Sbjct: 585 ASKALGVDSKDRKQWQYVLK---HLVPYQIGRYGQLMEWSTDIDDPKDEHRHVNHLFGLH 641
Query: 615 PGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLV 674
PGHT++ P+L AA+ L+ RG+ GWS+ WK WARL D HAY++ L
Sbjct: 642 PGHTLSPITTPELTNAAKVVLEHRGDGATGWSMGWKLNQWARLQDGNHAYKLFGNL---- 697
Query: 675 DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSS 734
+ G NL+ HPPFQID NFG TA + EML+QS + + LLPALP D W
Sbjct: 698 -------LKNGTLDNLWDTHPPFQIDGNFGGTAGITEMLLQSHMGFIQLLPALP-DAWKE 749
Query: 735 GCVKGLKARGGETVSICWKDGDLHEVGIYS 764
G VKGL A+G + I W+DG L E I S
Sbjct: 750 GSVKGLCAKGNFEIDIIWQDGKLKEAVILS 779
>gi|383638758|ref|ZP_09951164.1| alpha-L-fucosidase [Streptomyces chartreusis NRRL 12338]
Length = 740
Score = 419 bits (1078), Expect = e-114, Method: Compositional matrix adjust.
Identities = 264/745 (35%), Positives = 379/745 (50%), Gaps = 75/745 (10%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG--------DYTNPDAPKALSDVRSL 78
A+P+GNG LGAMV+G + SE ++ NE TLWTG PG D+ P P A+ V+
Sbjct: 15 ALPVGNGALGAMVFGSIASERVQFNEKTLWTGGPGSVQGYDHGDWREPR-PTAIDAVQDD 73
Query: 79 VDSGQYAEATAASVKLFGHPA---DVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
+D+ + + +L G P YQ GD+ L+F + E YRREL L+T A
Sbjct: 74 LDTRRRLAPEDVAGRL-GQPRVGFGAYQTFGDLYLDFPGTP---TPEAYRRELALDTGVA 129
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
V Y+ RE F+S PD VIV +I ++F + S + + ++ +
Sbjct: 130 SVAYTHRQTRHRREFFASFPDGVIVGRIGADRPAGITFTLRYTSPRGDFTTTATGGRLTV 189
Query: 196 EGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLV 255
G K N G++F A ++++ D G +++ D + V G+D A +L
Sbjct: 190 RGAL-------KDN------GLRFEA--QVQVRSDGGAVTSGADGTITVTGADSAWFVLA 234
Query: 256 ASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRS 315
A + + +P DP A+ + Y L RH+ D++ LF RV++ + +S
Sbjct: 235 AGTDYAD--THPDYRGADPHPAVTRAVDRASSRGYDSLRARHIADHRTLFARVTLDIGQS 292
Query: 316 -PKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANL 374
P ++ TD +A+R +L L FQ+GRYLLI+SSR G+ ANL
Sbjct: 293 APAEVPTDRLLASYTGGTSAADR----------ALEALFFQYGRYLLIASSRAGSLPANL 342
Query: 375 QGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY 434
QG+WN SP W + HVNINL+MNYW + NL E P F+ L G TA+ +
Sbjct: 343 QGVWNHSTSPPWSADYHVNINLQMNYWLAEAANLPETTVPYDRFVQALRAPGRHTARQMF 402
Query: 435 LASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYP 493
+ GWV+H++T+ + + D W +P AWL L+EHY + D+L AYP
Sbjct: 403 GSRGWVVHNETNPYGFTGVHDWATAFW--FPEAAAWLTQQLYEHYRFGGSTDYLRTTAYP 460
Query: 494 LLEGCASFLLDWL-IEGHDGYLETNPSTSPEH-EFIAPDGKLACVSYSSTMDMAIIREVF 551
+++ A F LD L + DG L PS SPEH +F A + M I+ ++F
Sbjct: 461 VMKEAAEFWLDNLRTDPRDGRLVVTPSYSPEHGDFTA----------GAAMSQQIVHDLF 510
Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEWAQDFKDPEVHHRHLSHL 610
+ + AA VL + D ++V ++L L P +I G + EW +D DP HRH+SHL
Sbjct: 511 TNTLEAARVLGDSRD-FRQRVEQALAHLDPGLRIGSWGQLQEWKEDLDDPADDHRHVSHL 569
Query: 611 FGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRL 670
F L PG IE + +AA+ +L RG+ G GWS WK WARLHD +HA++M+
Sbjct: 570 FALHPGR--QIEPDSRWAEAAKVSLTARGDGGTGWSKAWKINFWARLHDGDHAHKMLG-- 625
Query: 671 FNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWD 730
+ NLF HPPFQID NFG T+ V EML+QS + +LPALP
Sbjct: 626 ---------EQLRSSTLPNLFDTHPPFQIDGNFGATSGVVEMLLQSQHGVIEILPALP-S 675
Query: 731 KWSSGCVKGLKARGGETVSICWKDG 755
W SG V+GL+ARGG V I W DG
Sbjct: 676 AWPSGSVRGLRARGGAVVDIDWTDG 700
>gi|319639947|ref|ZP_07994674.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_40A]
gi|345516953|ref|ZP_08796433.1| glycoside hydrolase family 95 [Bacteroides sp. 4_3_47FAA]
gi|254833732|gb|EET14041.1| glycoside hydrolase family 95 [Bacteroides sp. 4_3_47FAA]
gi|317388225|gb|EFV69077.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_40A]
Length = 818
Score = 419 bits (1078), Expect = e-114, Method: Compositional matrix adjust.
Identities = 276/810 (34%), Positives = 401/810 (49%), Gaps = 94/810 (11%)
Query: 2 MNAESTSTTNPLKITFNGP------AKHFTDA---------IPIGNGRLGAMVWGGVPSE 46
+ A+ST T L I F+ P A ++ A +PIGNG LG V G + +E
Sbjct: 17 LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPATDKAWENNSLPIGNGSLGGNVMGSIAAE 76
Query: 47 TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
+ LNE TLW G P N ++ L ++R G +A + K F
Sbjct: 77 RITLNEKTLWRGGPNTEKGAAYYWNVNKESAHLLPEIRQAFTDGNQKKAEELTCKNFNGL 136
Query: 99 ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
AD + LG+ +E S + + Y+R L L++A A V + V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEIGMTD--YKRILSLDSAMAIVSFRKDEVNY 194
Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
R++F S PD V+V K + G +L F+ + +G N+++ GR
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKTDGPNRLLYTGRL----- 249
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
K Q L I+ + G+++ D K V +D + LL A + +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298
Query: 261 DGPFINPSD-SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
+ F +P DP +++ L + +Y++L RH DY +LF RV +QL+ +P
Sbjct: 299 NPDFKDPKTYVGPDPDQTTLAMLDAAAAKNYNELCERHKTDYTQLFGRVKLQLNPHAPM- 357
Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
T + +P+ +R+ ++ + D L E+ +QFGRYLLI+SSRPG ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTHQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413
Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
W + W H NIN++MNYW + NL+EC PL DF+ L G KTAQ + A
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473
Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
GW +I+ +S + W PM G WL TH+WE+Y+YT D+ FL++ Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533
Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
A+F +D+L DG PSTSPEH V +T A++RE+ I
Sbjct: 534 SSANFAVDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLNAID 584
Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLF 614
A++ L + + + VLK L P +I G +MEW+ D DP+ HRH++HLFGL
Sbjct: 585 ASKALGVDSKDRKQWQYVLK---HLVPYQIGRYGQLMEWSTDIDDPKDEHRHVNHLFGLH 641
Query: 615 PGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLV 674
PGHT++ P+L AA+ L+ RG+ GWS+ WK WARL D HAY++ L
Sbjct: 642 PGHTLSPITTPELTNAAKVVLEHRGDGATGWSMGWKLNQWARLQDGNHAYKLFGNL---- 697
Query: 675 DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSS 734
+ G NL+ HPPFQID NFG TA + EML+QS + + LLPALP D W
Sbjct: 698 -------LKNGTLDNLWDTHPPFQIDGNFGGTAGITEMLLQSHMGFIQLLPALP-DAWKE 749
Query: 735 GCVKGLKARGGETVSICWKDGDLHEVGIYS 764
G VKGL A+G + I W+DG L E I S
Sbjct: 750 GSVKGLCAKGNFEIDIIWQDGKLKEAVILS 779
>gi|345514340|ref|ZP_08793853.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
gi|229437320|gb|EEO47397.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
Length = 818
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 276/810 (34%), Positives = 401/810 (49%), Gaps = 94/810 (11%)
Query: 2 MNAESTSTTNPLKITFNGP------AKHFTDA---------IPIGNGRLGAMVWGGVPSE 46
+ A+ST T L I F+ P A ++ A +PIGNG LG V G + +E
Sbjct: 17 LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPATDKAWENNSLPIGNGSLGGNVMGSIAAE 76
Query: 47 TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
+ LNE TLW G P N ++ L ++R G +A + K F
Sbjct: 77 RITLNEKTLWRGGPNTEKGAAYYWNVNKESAHLLPEIRQAFTDGNQKKAEELTCKNFNGL 136
Query: 99 ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
AD + LG+ +E S + Y+R L L++A A V + V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEI--GMTNYKRILSLDSAMAVVSFRKDEVNY 194
Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
R++F S PD V+V K + G +L F+ + V+G N+++ G
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKVDGPNRLLYTGCL----- 249
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
K Q L I+ + G+++ D K V +D + LL A + +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298
Query: 261 DGPFINPSD-SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
+ F +P DP +++ + + SY++L RH DY +LF RV +QL+ R+P
Sbjct: 299 NPDFKDPKTYVGPDPEQTTLAMMDAAAAKSYNELCERHKTDYTQLFGRVQLQLNPRAPM- 357
Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
T + +P+ +R+ ++ + D L E+ +QFGRYLLI+SSRPG ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTYQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413
Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
W + W H NIN++MNYW + NL+EC PL DF+ L G KTAQ + A
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473
Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
GW +I+ +S + W PM G WL TH+WE+Y+YT D+ FL++ Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533
Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
A+F +D+L DG PSTSPEH V +T A++RE+ I
Sbjct: 534 SSANFAIDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLNAID 584
Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLF 614
A++ L + + + VL L P +I G +MEW+ D DP+ HRH++HLFGL
Sbjct: 585 ASKALGVDSKDRKQWQYVLN---HLVPYQIGRYGQLMEWSTDIDDPKDEHRHVNHLFGLH 641
Query: 615 PGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLV 674
PGHT++ P+L AA+ L+ RG+ GWS+ WK WARL D HAY++ L
Sbjct: 642 PGHTLSPITTPELTHAAKVVLEHRGDGATGWSMGWKLNQWARLQDGNHAYKLFGNL---- 697
Query: 675 DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSS 734
+ G NL+ HPPFQID NFG TA + EML+QS + + LLPALP D W
Sbjct: 698 -------LKNGTLDNLWDTHPPFQIDGNFGGTAGITEMLLQSHMGFIQLLPALP-DAWKE 749
Query: 735 GCVKGLKARGGETVSICWKDGDLHEVGIYS 764
G VKGL A+G ++I W+DG L E I S
Sbjct: 750 GSVKGLCAKGNFEINITWQDGKLKEAVILS 779
>gi|336412946|ref|ZP_08593299.1| hypothetical protein HMPREF1017_00407 [Bacteroides ovatus
3_8_47FAA]
gi|335942992|gb|EGN04834.1| hypothetical protein HMPREF1017_00407 [Bacteroides ovatus
3_8_47FAA]
Length = 799
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 257/768 (33%), Positives = 413/768 (53%), Gaps = 52/768 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAPKA 71
+++ + PAK + ++PIGNGR+GAMV+GG+ ET+ LNE ++W+G + P +
Sbjct: 29 VELWYEQPAKEWMSSVPIGNGRIGAMVFGGIEEETIALNESSMWSGQYDENQEIPFGKER 88
Query: 72 LSDVRSLVDSGQYAEATAASVKLF---GHPADVYQLLGDIELEFDDSHLKYAEETYRREL 128
++++R L G+ E + + GH + +GD++L F S+ + YRR L
Sbjct: 89 MNELRKLFFEGKIQEGNQIAGEFLHGNGHSFGTHLPIGDLKLTF--SYPENTVSNYRRSL 146
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
DL TA + Y++G+V + RE F++NPD V+V ++S S+ +++ +SL L ++ +
Sbjct: 147 DLGTAISTTNYTIGDVNYVRECFATNPDDVLVLRMSASKKKAINAKLSLSMLRESEISTD 206
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
GN Q+I EG P + P G+ F I IS GT+ A ED + V +D
Sbjct: 207 GN-QLIFEGTV---NFPKQG-----PGGVSFQG--RIAISAPNGTLQA-EDSSISVNDAD 254
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
+++ +++ +D+ K E++ + +Y L HL+DY LF RV
Sbjct: 255 MLTIVIDVRTNYK------NDAYKSLCKETVVKAEK---KTYEKLKKTHLNDYTPLFDRV 305
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
S+QL T + T E+VK + DP L LLFQ+GRYLL++SSR
Sbjct: 306 SLQLG---------TGEYAGLPTDKRWEQVK--KGGYDPGLDVLLFQYGRYLLLASSREN 354
Query: 369 TQV-ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
+ + A LQG +N++L+ W + H++IN + NYW + NL+EC PLF ++ LS++
Sbjct: 355 SPLPAALQGFFNDNLACNMGWTNDYHLDINTQQNYWIANVGNLAECHLPLFKYIEDLSVH 414
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G+KTAQ Y GW H +IW +A G ++W L+P +W+ +HLW Y YT D+D
Sbjct: 415 GAKTAQKIYGCKGWTAHTTANIWG-YTAPSGSILWGLFPTASSWIASHLWTQYEYTRDKD 473
Query: 486 FLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
+L K AYPLL+G A FLLD+++E + GY+ T PS SPE+ F+ L C S T D
Sbjct: 474 YLTKTAYPLLKGNAEFLLDYMVEDPNTGYMVTGPSISPENSFLYQGNNL-CASMMPTCDR 532
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHH 604
+ E+F+A I +A++L +++ + + +++ + P ++ +G + EW +D+ + +H
Sbjct: 533 VLAYEIFNACIQSAQILNIDKE-FSDSLQQAIKKFPPIRLRANGGVREWLEDYDEAHPNH 591
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR----GEEGPGWSITWKTALWARLHDQ 660
RH SHL L+P IT++K P+L A KT++ R G E WS +ARL D
Sbjct: 592 RHTSHLLALYPYEQITLDKTPELAAGARKTIEDRLAAEGWEDTEWSRANMICFYARLKDT 651
Query: 661 EHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
+ AY+ V L ++ E+ + A + F +D N A +AEMLVQ
Sbjct: 652 KQAYQSVLTLESIFTRENLLSISPAGIAG--APYDIFILDGNTAGAAGIAEMLVQGHEGY 709
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN 768
+ LP LP ++W+ G KGL +GG VS W ++E + + N
Sbjct: 710 IEFLPCLP-EQWNVGTYKGLCVKGGAEVSAAWNQSLINEATLKATADN 756
>gi|265767320|ref|ZP_06094986.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
gi|263252625|gb|EEZ24137.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
Length = 829
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 263/801 (32%), Positives = 400/801 (49%), Gaps = 89/801 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA--------PKALSDVR 76
+ ++PIGNG +GA + G + +E + NE TLW G P DA L ++R
Sbjct: 75 SQSLPIGNGSIGANIMGSIEAERITFNEKTLWRGGPNTSRGADAYWNVNKQSAHVLKEIR 134
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
G +A + K F + +G+ +E S + ++ Y
Sbjct: 135 QAFTDGDQKKAEMLTRKNFNSEVPYESNREKPFRFGNFTTMGEFYIETGLSTVNMSD--Y 192
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ +V + R++F S P V+ + G +L+F+ S + +
Sbjct: 193 KRILSLDSALAVVQFKKDDVAYERDYFISYPANVMAIRFKADRPGKQNLTFSYSPNPVST 252
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+G N + A+ D G+Q+ ++ I GT+S + K+
Sbjct: 253 GSMSADGANGLAY-------------TAHLDNNGMQY--VVRIHAIAKGGTLSN-ANGKI 296
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINP-SDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D V L+ A + +FD F +P + +P + + + + Y L+ +H
Sbjct: 297 TVKNADEVVFLVTADTDYKINFDPDFKDPKAYVGVNPAETTRQWMDNAVAMGYDVLFKQH 356
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
DDY LF+RV +QL+ + +P+ +R+++++ + D L EL +QF
Sbjct: 357 YDDYAALFNRVKLQLNPDAQSA-----------NLPTGKRLQNYRKGQPDFYLEELYYQF 405
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL EC PL
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLYECTLPLI 465
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + GW +I+ ++ + ++ W PM G WL TH+W
Sbjct: 466 DFIRTLVKPGQKTAQAYFGTRGWTASISANIFGFTTPLESEEMAWNFNPMAGPWLATHVW 525
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D+ FL++ Y L++ A F D+L DG PSTSPEH
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSAQFATDFLWRKPDGTYTAAPSTSPEH---------GP 576
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A+IRE+ I A++VL + E ++VL L P K+ G +MEW
Sbjct: 577 IDEGTTFVHAVIREILQDAIEASKVLGVDSKERKQWQEVLT---HLAPYKVGRYGQLMEW 633
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
++D DP+ HRH++HLFGL PGHT++ PDL KAA L+ RG+ GWS+ WK
Sbjct: 634 SKDIDDPKDEHRHVNHLFGLHPGHTLSPITTPDLAKAARVVLEHRGDGATGWSMGWKLNQ 693
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARL D HAY++ L + G NL+ HPPFQID NFG TA + EML
Sbjct: 694 WARLQDGNHAYKLFGNL-----------LKNGTLDNLWDTHPPFQIDGNFGGTAGITEML 742
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDS 773
+QS + + LLPALP D W G + G+ A+G V + WK+G L E I+S
Sbjct: 743 LQSHMGFIQLLPALP-DAWKDGSISGICAKGNFEVDLSWKNGQLAEATIFSKAGEP---- 797
Query: 774 FKTLHYRGTSVKVNLSAGKIY 794
T+ Y ++ S GK+Y
Sbjct: 798 -CTVRYGDKTLSFKTSKGKVY 817
>gi|116624427|ref|YP_826583.1| hypothetical protein Acid_5349 [Candidatus Solibacter usitatus
Ellin6076]
gi|116227589|gb|ABJ86298.1| conserved hypothetical protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 718
Score = 419 bits (1076), Expect = e-114, Method: Compositional matrix adjust.
Identities = 273/787 (34%), Positives = 399/787 (50%), Gaps = 104/787 (13%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
L + + PA+ + + A+PIGNGRLGAM++G E L+LNE +LWTG
Sbjct: 23 LALWYQQPAEDWQSQALPIGNGRLGAMIFGDARREHLQLNEISLWTG------------- 69
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
D+G+ YQ LGD+ L+ + YRR LD++
Sbjct: 70 -----DEKDTGR------------------YQNLGDLFLDLTHG----PPQNYRRSLDID 102
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TA V YS G + RE+F+S P QVIV + + + G+ + + L D H +
Sbjct: 103 TAIHTVDYSAGGAAWRREYFASAPRQVIVLRCTADKRGAYTGTLRLT---DAHG-----S 154
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
+ E G R+ ++A G++F +++ + R T S L +E +D A+
Sbjct: 155 PVSAE----GTRL---SSAGKLENGLEFETQIQVMATGGRITASG---DALHIENAD-AL 203
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
+ +A+ + P + P + L + + Y+ + H+ DYQ+LF RV++
Sbjct: 204 TIFIAAGTNYVPDRARAWRGDSPHARITRQLAAAAAMDYAGMRAAHIADYQQLFRRVTLN 263
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQ 370
L +P ++ TD ER+ ++ DP L L FQ+GRYLLISSSRPG+
Sbjct: 264 LGSTPGEMPTD-------------ERLLRYRDGSPDPELEALFFQYGRYLLISSSRPGSL 310
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQG+WN +P W S H NIN++MNYW + NL+EC P FD++ S+ G +T
Sbjct: 311 PANLQGLWNNSNNPPWRSDYHSNINIQMNYWPAEVTNLAECALPFFDYVN--SLRGVRTE 368
Query: 431 QVNYL---ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
+ GW + + +I+ G W P G AW H WEHY +T DRDFL
Sbjct: 369 ATHKYYPNVRGWTVQTENNIFGA-----GSFKWN--PPGSAWYAQHFWEHYAFTHDRDFL 421
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
K AYP+L+ F D L+ DG L T SPEH P T D ++
Sbjct: 422 SKMAYPVLKEITQFWEDHLVARPDGALVTPDGWSPEHGPEEP---------GVTYDQELV 472
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHL 607
++F+ + AA VL + + KV + RL K+ G + EW +D D HRH+
Sbjct: 473 WDLFTNYLEAAAVLNVDAGYRI-KVTQLRQRLLKPKVGAWGQLQEWPEDRDDIRDEHRHV 531
Query: 608 SHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMV 667
SHLF L PG I+ P+L AA+ +L RG++ GW++ W+ WARL D +HA+ ++
Sbjct: 532 SHLFALHPGRQISPVGTPELAAAAKVSLTARGDQSTGWAMAWRINFWARLLDGDHAHLLL 591
Query: 668 KRLFNLVDPEHEKHF--EGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
+ L ++ + + GG+YSNLF HPPFQID NFG TA +AEML+QS +++LLP
Sbjct: 592 RNLLHITGKGNNIDYGKGGGVYSNLFDTHPPFQIDGNFGATAGIAEMLLQSQAGEIHLLP 651
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVK 785
ALP D W+ G V GL+ARG TV I WK G L + S S + T+ + G +
Sbjct: 652 ALPKD-WAEGSVTGLRARGNITVDISWKQGLLTSATLRSPVSTS-----ATVRFNGHAQH 705
Query: 786 VNLSAGK 792
V L+AGK
Sbjct: 706 VELAAGK 712
>gi|46118818|ref|XP_384910.1| hypothetical protein FG04734.1 [Gibberella zeae PH-1]
Length = 768
Score = 419 bits (1076), Expect = e-114, Method: Compositional matrix adjust.
Identities = 276/814 (33%), Positives = 410/814 (50%), Gaps = 81/814 (9%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
+E ST L + + PA +++A+PIGNGRLGAMV+G +E L+LNED++W G P D
Sbjct: 5 SEKASTDKSLLLHYAAPASSWSEALPIGNGRLGAMVYGRTSTELLQLNEDSVWYGGPQDR 64
Query: 64 TNPDAPKALSDVRSLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYA 120
T DA L+ +R L+ ++ +A T A F PA + Y+ LG +EF H +
Sbjct: 65 TPRDACSNLATLRQLIRDEKHKDAETLAREAFFATPASMRHYEPLGQCTIEF--GHDEKN 122
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
Y+R LDL T+ + KY V + R+ +S P+ V+ + S ++ S
Sbjct: 123 VSDYKRHLDLATSQSTTKYDYEGVSYRRDVIASFPNNVLAFRFQASAPTRFVVRLNRQSE 182
Query: 181 LDNHS--YVNG----NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
++ + Y++ +N II++ GK N+N + + L + GT+
Sbjct: 183 VEGETNEYLDSIRAQDNHIILQATPGGK------NSN------RLALALGVSCKSINGTV 230
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
KV G+ L++ A + + +P + ++ + S + L
Sbjct: 231 --------KVVGN---CLIVNAEECIIAIGAHTTYRSYNPDASALRDVNSALREPWETLV 279
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
+RH DY +LF + ++++ + VP+ ER+ Q++ DP +V L
Sbjct: 280 SRHRRDYGRLFGKTALRM-------------WPDASHVPTEERI---QSNRDPGVVALYH 323
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
+GRYLLISSSR + A LQGIWN +P W S +NINL+MNYW + PCNL EC
Sbjct: 324 NYGRYLLISSSRKSAKALPATLQGIWNPSFAPPWGSKFTININLQMNYWPAAPCNLIECA 383
Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
PL D + ++ G +TA++ Y GW HH TDIWA + + LWP+GG WLC
Sbjct: 384 IPLIDHIERMAEKGKRTAKMMYNCRGWCAHHNTDIWADTDPQDRWMPATLWPLGGVWLCI 443
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDG 531
+ + Y D L R PLLEGC FLLD+LI G YL T+PS SPE+ FI+ G
Sbjct: 444 DVVKMLIYQYDH-MLHIRIAPLLEGCIQFLLDFLIPSACGKYLVTSPSLSPENSFISESG 502
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
+ S MDM I+R + I + +L K E L + V+ +L +L P +I + G I
Sbjct: 503 ETGTFCEGSVMDMTIVRIALESFIWSTSILNK-EHPLQKDVMATLGKLPPFRINKSGLIQ 561
Query: 592 EWA-QDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSI 647
EW +D K+ E HRH+SHLFGL+P I+++ +P L +AA KTL +R E G GWS
Sbjct: 562 EWGLKDHKEAEPGHRHVSHLFGLYPDDFISLDSSPALVEAARKTLARRAEHGGGHTGWSR 621
Query: 648 TWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTA 707
W L+ARL + D + + N+ HPPFQID NFG A
Sbjct: 622 AWLLNLYARLREPLKC-----------DEHMDLLLKTSTLPNMLDNHPPFQIDGNFGGCA 670
Query: 708 AVAEMLVQSTLND---------LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLH 758
V E L+QS L +YLLP+LP WS+G + ++ GG VS+ W++G L
Sbjct: 671 GVTECLIQSNLRPDELSSQVVMIYLLPSLP-SSWSNGKLSNIRVMGGWLVSLEWREGQLT 729
Query: 759 EVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGK 792
E + + N+ ++ + G V V S G+
Sbjct: 730 EPLLLESTVNHAPNAL-VVFPNGKRVSVIKSKGQ 762
>gi|255693982|ref|ZP_05417657.1| fibronectin type III domain protein [Bacteroides finegoldii DSM
17565]
gi|260620198|gb|EEX43069.1| hypothetical protein BACFIN_09249 [Bacteroides finegoldii DSM
17565]
Length = 820
Score = 419 bits (1076), Expect = e-114, Method: Compositional matrix adjust.
Identities = 274/809 (33%), Positives = 408/809 (50%), Gaps = 90/809 (11%)
Query: 18 NGPAKHFTDA-IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP------GDYTNPDAPK 70
N P K + ++ +PIGNG LGA + G + +E + LNE TLW G P G Y N +
Sbjct: 53 NNPDKAWENSSLPIGNGSLGANILGSISAERITLNEKTLWKGGPNTAKGAGYYWNVNKQS 112
Query: 71 A--LSDVRSLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSH 116
A L D+R G +A + + F A+ + +G++ +E S
Sbjct: 113 ANILKDIRQAFLDGNKEKAARLTQENFNGLAEYEERDETPFRFGSFTTMGELYIETGLSE 172
Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS 176
+ + Y R L L++A A V++ E+ R++F S PD V+V K + ++ G + +S
Sbjct: 173 INM--KNYHRILSLDSAMAVVQFDKDGTEYQRKYFISYPDSVMVMKFTANKKGKQNLVLS 230
Query: 177 LDSLLDNHSYV--NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
+ SY+ +GNN + G N N + A+ +G I
Sbjct: 231 YCPNSEAESYLSADGNNGLGYTGVL---------NNNKMKFAFRIKAL-------HKGGI 274
Query: 235 SALEDKKLKVEGSDWAVLLLVASS----SFDGPFINPSD-SKKDPTSESMSALQSIRNLS 289
E+ ++ V+ +D V LL A + +F+ F +P KDP +++ + +
Sbjct: 275 LKTENSRIIVKDADEVVFLLTADTDYKINFNPDFNDPQTYVGKDPEQTTLAMMNNALEKG 334
Query: 290 YSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPS 348
Y L H DY LF+RV +Q++ E +P+ +R+ +++ D
Sbjct: 335 YDKLIRNHKTDYTALFNRVQLQIN-----------PEAGTPDLPTYKRLDNYRKGVPDYQ 383
Query: 349 LVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL 408
L +L +QFGRYLLI+SSRPG ANLQG+W+ +L W H NIN++MNYW + NL
Sbjct: 384 LEQLYYQFGRYLLIASSRPGNMPANLQGLWHNNLDGPWRVDYHNNINIQMNYWPACSANL 443
Query: 409 SECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVV-WALWPMGG 467
SEC PL DF+ L G KTAQ + A GW +I+ ++ K + W L P+ G
Sbjct: 444 SECTWPLIDFIRSLVKPGEKTAQSYFNARGWTASISANIFGFTAPLSSKSMEWNLNPIVG 503
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFI 527
WL TH+WE+Y+YT D+ FL + Y L++ A F +D L DG PSTSPEH
Sbjct: 504 PWLATHIWEYYDYTRDKRFLSEIGYELIKSSAQFTVDHLWHKPDGTYTAAPSTSPEH--- 560
Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIA 585
V T A++RE+ I A++VL ++ E E +L +L P +I
Sbjct: 561 ------GPVDEGVTFAHAVVREILLDAIQASKVLGVDRKERRQWENIL---AKLVPYRIG 611
Query: 586 EDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGW 645
G ++EW+ D DP+ HRH++HLFGL PGHTI+ P+L KAA+ L+ RG+ G GW
Sbjct: 612 RYGQLLEWSTDIDDPKDEHRHVNHLFGLHPGHTISPLTTPELAKAAKVVLEHRGDGGTGW 671
Query: 646 SITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGF 705
S+ WK WARL D HAY++ L + G NL+ +H PFQID NFG
Sbjct: 672 SMGWKLNQWARLQDGNHAYKLYNNLLS-----------NGTLDNLWDSHAPFQIDGNFGG 720
Query: 706 TAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
TA + EML+QS + LLPALP D W++G + G+ A+G +SI WK G L + I S
Sbjct: 721 TAGITEMLLQSHTGTIQLLPALP-DAWTNGSISGICAKGNYEISILWKKGRLEKACILSK 779
Query: 766 YSNNDHDSFKTLHYRGTSVKVNLSAGKIY 794
TL Y+ +++ + G+ Y
Sbjct: 780 SGGP-----CTLRYKDSTLTLKTVKGRKY 803
>gi|53715738|ref|YP_101730.1| hypothetical protein BF4459 [Bacteroides fragilis YCH46]
gi|60683673|ref|YP_213817.1| hypothetical protein BF4255 [Bacteroides fragilis NCTC 9343]
gi|336411650|ref|ZP_08592113.1| hypothetical protein HMPREF1018_04131 [Bacteroides sp. 2_1_56FAA]
gi|375360504|ref|YP_005113276.1| hypothetical protein BF638R_4337 [Bacteroides fragilis 638R]
gi|383119758|ref|ZP_09940496.1| hypothetical protein BSHG_3421 [Bacteroides sp. 3_2_5]
gi|423252289|ref|ZP_17233283.1| hypothetical protein HMPREF1066_04293 [Bacteroides fragilis
CL03T00C08]
gi|423252862|ref|ZP_17233793.1| hypothetical protein HMPREF1067_00437 [Bacteroides fragilis
CL03T12C07]
gi|52218603|dbj|BAD51196.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
gi|60495107|emb|CAH09926.1| conserved hypothetical exported protein [Bacteroides fragilis NCTC
9343]
gi|251944620|gb|EES85095.1| hypothetical protein BSHG_3421 [Bacteroides sp. 3_2_5]
gi|301165185|emb|CBW24755.1| conserved hypothetical exported protein [Bacteroides fragilis 638R]
gi|335941084|gb|EGN02944.1| hypothetical protein HMPREF1018_04131 [Bacteroides sp. 2_1_56FAA]
gi|392647562|gb|EIY41261.1| hypothetical protein HMPREF1066_04293 [Bacteroides fragilis
CL03T00C08]
gi|392659231|gb|EIY52857.1| hypothetical protein HMPREF1067_00437 [Bacteroides fragilis
CL03T12C07]
Length = 829
Score = 418 bits (1075), Expect = e-114, Method: Compositional matrix adjust.
Identities = 263/801 (32%), Positives = 400/801 (49%), Gaps = 89/801 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA--------PKALSDVR 76
+ ++PIGNG +GA + G + +E + NE TLW G P DA L ++R
Sbjct: 75 SQSLPIGNGSIGANIMGSIEAERITFNEKTLWRGGPNTSRGADAYWNVNKQSAHVLKEIR 134
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
G +A + K F + +G+ +E S + ++ Y
Sbjct: 135 QAFTDGDQKKAEMLTRKNFNSEVPYESNREKPFRFGNFTTMGEFYIETGLSTVNMSD--Y 192
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ +V + R++F S P V+ + G +L+F+ S + +
Sbjct: 193 KRILSLDSALAVVQFKKDDVAYERDYFISYPANVMAIRFKADRPGKQNLTFSYSPNPVST 252
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+G N + A+ D G+Q+ ++ I GT+S + K+
Sbjct: 253 GSMSADGANGLAY-------------TAHLDNNGMQY--VVRIHAIAKGGTLSN-ANGKI 296
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINP-SDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D V L+ A + +FD F +P + +P + + + + Y L+ +H
Sbjct: 297 TVKDADEVVFLVTADTDYKINFDPDFKDPKAYVGVNPAETTRQWMDNAVAMGYDVLFKQH 356
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
DDY LF+RV +QL+ + +P+ +R+++++ + D L EL +QF
Sbjct: 357 YDDYAALFNRVKLQLNPDAQSA-----------NLPTGKRLQNYRKGQPDFYLEELYYQF 405
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL EC PL
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLYECTLPLI 465
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + GW +I+ ++ + ++ W PM G WL TH+W
Sbjct: 466 DFIRTLVKPGQKTAQAYFGTRGWTASISANIFGFTTPLESEEMAWNFNPMAGPWLATHVW 525
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D+ FL++ Y L++ A F D+L DG PSTSPEH
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSAQFATDFLWRKPDGTYTAAPSTSPEH---------GP 576
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A+IRE+ I A++VL + E ++VL L P K+ G +MEW
Sbjct: 577 IDEGTTFVHAVIREILQDAIEASKVLGVDSKERKQWQEVLT---HLAPYKVGRYGQLMEW 633
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
++D DP+ HRH++HLFGL PGHT++ PDL KAA L+ RG+ GWS+ WK
Sbjct: 634 SKDIDDPKDEHRHVNHLFGLHPGHTLSPITTPDLAKAARVVLEHRGDGATGWSMGWKLNQ 693
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARL D HAY++ L + G NL+ HPPFQID NFG TA + EML
Sbjct: 694 WARLQDGNHAYKLFGNL-----------LKNGTLDNLWDTHPPFQIDGNFGGTAGITEML 742
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDS 773
+QS + + LLPALP D W G + G+ A+G V + WK+G L E I+S
Sbjct: 743 LQSHMGFIQLLPALP-DAWKDGSISGICAKGNFEVDLSWKNGQLAEATIFSKAGEP---- 797
Query: 774 FKTLHYRGTSVKVNLSAGKIY 794
T+ Y ++ S GK+Y
Sbjct: 798 -CTVRYGDKTLSFKTSKGKVY 817
>gi|423214184|ref|ZP_17200712.1| hypothetical protein HMPREF1074_02244 [Bacteroides xylanisolvens
CL03T12C04]
gi|392693129|gb|EIY86364.1| hypothetical protein HMPREF1074_02244 [Bacteroides xylanisolvens
CL03T12C04]
Length = 850
Score = 418 bits (1075), Expect = e-114, Method: Compositional matrix adjust.
Identities = 273/806 (33%), Positives = 409/806 (50%), Gaps = 89/806 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 95 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 154
Query: 77 SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDD--SHLKYAEET---------YR 125
G +A + + F Y G+ F + ++ ET Y+
Sbjct: 155 QAFMEGNQEKAEMLTRQNFNSEVS-YDADGETPFRFGSFTTMGEFYVETGLNIIGMSDYK 213
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDN 183
R L L++A A V++ V + R +F S P V+V + S + G +L F+ + + +
Sbjct: 214 RILSLDSAMAVVQFKKDYVAYQRNYFISYPANVMVMRFSADQPGKQNLVFSYAPNPVSTG 273
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ + N ++ +A+ D G+++ ++ I+ GT+S D KL
Sbjct: 274 NMASDSNKGLVY-------------SASLDNNGMKY--VVRIQAETKGGTLSN-ADGKLT 317
Query: 244 VEGSDWAVLLLVASS----SFDGPFINPSD-SKKDPTSESMSALQSIRNLSYSDLYTRHL 298
V+G+D V + A + +FD F +P P + + + + Y+ L+++H
Sbjct: 318 VKGADEVVFYITADTDYKPNFDPDFKDPKTYVGVKPEETTKEWMNNAVSQGYTALFSQHY 377
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFG 357
+DY LF+RV + L+ + K +P+ +R+K+++ + D L EL FQFG
Sbjct: 378 NDYAALFNRVKLNLNPAIKG-----------KNMPTPQRLKNYRAGQPDYDLEELYFQFG 426
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLISSSRPG ANLQGIW+ ++ W H NIN++MNYW + NL+EC PL D
Sbjct: 427 RYLLISSSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECMLPLVD 486
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWE 476
F+ L G KTA+ + A GW +I+ ++ + + W PM G WL TH+WE
Sbjct: 487 FIHTLVKPGEKTAKSYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATHIWE 546
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
+Y+YT D FL++ Y L++ A F +D+L DG PSTSPEH +
Sbjct: 547 YYDYTRDLTFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH---------GPI 597
Query: 537 SYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWA 594
+T A++RE+ I A++VL +K E E VL +L P KI G +MEW+
Sbjct: 598 DQGATFVHAVVREILLDAIEASKVLGVDKKERKQWEHVLANL---VPYKIGRYGQLMEWS 654
Query: 595 QDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALW 654
D DP+ HRH++HLFGL PGHT++ P+L KAA+ L RG+ GWS+ WK W
Sbjct: 655 VDIDDPKDEHRHVNHLFGLHPGHTVSPVTTPELAKAAKVVLVHRGDGATGWSMGWKLNQW 714
Query: 655 ARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLV 714
ARLHD HAY + L + G NL+ H PFQID NFG TA + EML+
Sbjct: 715 ARLHDGNHAYTLFGNL-----------LKNGTLDNLWDTHSPFQIDGNFGGTAGITEMLL 763
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN----- 769
QS + + LLPALP D W G V G+ A+G V++ W++ L E ++SN N
Sbjct: 764 QSHMGFIQLLPALP-DAWKEGSVSGICAKGNFEVAMVWENNQLKEAVVHSNAGGNCVIKY 822
Query: 770 --DHDSFKTLHYRGTSVKVNLSAGKI 793
SFKT+ R V+ +++ G I
Sbjct: 823 ADKTLSFKTVKGRSYRVEYDVTKGLI 848
>gi|336404392|ref|ZP_08585089.1| hypothetical protein HMPREF0127_02402 [Bacteroides sp. 1_1_30]
gi|335943224|gb|EGN05065.1| hypothetical protein HMPREF0127_02402 [Bacteroides sp. 1_1_30]
Length = 850
Score = 418 bits (1074), Expect = e-114, Method: Compositional matrix adjust.
Identities = 273/813 (33%), Positives = 413/813 (50%), Gaps = 103/813 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 95 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 154
Query: 77 SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDD--SHLKYAEET---------YR 125
G +A + + F Y G+ F + ++ ET Y+
Sbjct: 155 QAFTEGNQEKAEMLTRQNFNSEVS-YDADGETPFRFGSFTTMGEFYVETGLNIIGMSDYK 213
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDN 183
R L L++A A V++ +V + R +F S P V+V + S + G +L F+ + + +
Sbjct: 214 RILSLDSAMAVVQFKKDHVAYQRNYFISYPANVMVMRFSADQPGKQNLVFSYAPNPVSTG 273
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ + N ++ +A+ D GI++ ++ I+ GT+S D KL
Sbjct: 274 NMASDSNKGLVY-------------SASLDNNGIKY--VVRIQAETKGGTLSN-ADGKLT 317
Query: 244 VEGSDWAVLLLVASS----SFDGPF--------INPSDSKKDPTSESMSALQSIRNLSYS 291
V+G+D V + A + +FD F +NP ++ K+ + ++S Y+
Sbjct: 318 VKGADEVVFYITADTDYKPNFDPDFKEPKTYVGVNPEETTKEWMNNAVSQ-------GYT 370
Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLV 350
L+++H +DY LF+RV + L+ + K +P+ +R+K+++ + D L
Sbjct: 371 ALFSQHYNDYAALFNRVKLNLNPAIKG-----------RNLPTPQRLKNYRAGQPDYDLE 419
Query: 351 ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
EL FQFGRYLLISSSRPG ANLQGIW+ ++ W H NIN++MNYW + NL+E
Sbjct: 420 ELYFQFGRYLLISSSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNE 479
Query: 411 CQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAW 469
C PL DF+ L G KTA+ + A GW +I+ ++ + + W PM G W
Sbjct: 480 CMLPLVDFIRTLVKPGEKTAKSYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPW 539
Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP 529
L TH+WE+Y+YT D FL++ Y L++ A F +D+L DG PSTSPEH
Sbjct: 540 LATHIWEYYDYTRDLTFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH----- 594
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAED 587
+ +T A++RE+ I A++VL +K E E VL + L P KI
Sbjct: 595 ----GPIDQGATFVHAVVREILLDAIEASKVLGVDKKERKQWEHVLAN---LVPYKIGRY 647
Query: 588 GSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSI 647
G +MEW+ D DP+ HRH++HLFG+ PGHT++ P+L KAA+ L RG+ GW++
Sbjct: 648 GQLMEWSVDIDDPKDEHRHVNHLFGVHPGHTVSPVTTPELAKAAKVVLVHRGDGATGWNM 707
Query: 648 TWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTA 707
WK WARLHD HAY + L + G NL+ H PFQID NFG TA
Sbjct: 708 GWKLNQWARLHDGNHAYTLFGNL-----------LKNGTLDNLWDTHSPFQIDGNFGGTA 756
Query: 708 AVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYS 767
+ EML+QS + + LLPALP D W G V G+ A+G V + W++ L E ++SN
Sbjct: 757 GITEMLLQSHMGFIQLLPALP-DAWKEGSVSGICAKGNFEVDMVWENNQLKEAVVHSNAG 815
Query: 768 NN-------DHDSFKTLHYRGTSVKVNLSAGKI 793
N SFKT+ R ++ +++ G I
Sbjct: 816 GNCVIKYADKTLSFKTVKGRSYRIEYDVTKGLI 848
>gi|423271952|ref|ZP_17250921.1| hypothetical protein HMPREF1079_04003 [Bacteroides fragilis
CL05T00C42]
gi|423276043|ref|ZP_17254986.1| hypothetical protein HMPREF1080_03639 [Bacteroides fragilis
CL05T12C13]
gi|392696307|gb|EIY89503.1| hypothetical protein HMPREF1079_04003 [Bacteroides fragilis
CL05T00C42]
gi|392699548|gb|EIY92724.1| hypothetical protein HMPREF1080_03639 [Bacteroides fragilis
CL05T12C13]
Length = 829
Score = 417 bits (1073), Expect = e-114, Method: Compositional matrix adjust.
Identities = 263/801 (32%), Positives = 400/801 (49%), Gaps = 89/801 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA--------PKALSDVR 76
+ ++PIGNG +GA + G + +E + NE TLW G P DA L ++R
Sbjct: 75 SQSLPIGNGSIGANIMGSIEAERITFNEKTLWRGGPNTSRGADAYWNVNKQSAHVLKEIR 134
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
G +A + K F + +G+ +E S + ++ Y
Sbjct: 135 QAFTDGDQKKAEMLTRKNFNSEVPYESNREKPFRFGNFTTMGEFYIETGLSTVNMSD--Y 192
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ +V + R++F S P V+ + G +L+F+ S + +
Sbjct: 193 KRILSLDSALAVVQFKKDDVAYERDYFISYPANVMAIRFKADRPGKQNLTFSYSPNPVST 252
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+G N + A+ D G+Q+ ++ I GT+S + K+
Sbjct: 253 GSMSADGANGLAY-------------TAHLDNNGMQY--VVRIHAIAKGGTLSN-ANGKI 296
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINP-SDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D V L+ A + +FD F +P + +P + + + + Y L+ +H
Sbjct: 297 TVKDADEVVFLVTADTDYKINFDPDFKDPKAYVGVNPAETTRQWMDNAVAMGYDVLFKQH 356
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
DDY LF+RV +QL+ + +P+ +R+++++ + D L EL +QF
Sbjct: 357 YDDYAALFNRVKLQLNPDAQSA-----------NLPTGKRLQNYRKGQPDFYLEELYYQF 405
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL EC PL
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLYECTLPLI 465
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + GW +I+ ++ + ++ W PM G WL TH+W
Sbjct: 466 DFIRTLVKPGQKTAQAYFGTRGWTASISANIFGFTTPLESEEMAWNFNPMAGPWLATHVW 525
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D+ FL++ Y L++ A F D+L DG PSTSPEH
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSAQFATDFLWRKPDGTYTAAPSTSPEH---------GP 576
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A+IRE+ I A++VL + E ++VL L P K+ G +MEW
Sbjct: 577 IDEGTTFVHAVIREILQDAIEASKVLGVDGKERKQWQEVLT---HLAPYKVGRYGQLMEW 633
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
++D DP+ HRH++HLFGL PGHT++ PDL KAA L+ RG+ GWS+ WK
Sbjct: 634 SKDIDDPKDEHRHVNHLFGLHPGHTLSPITTPDLAKAARVVLEHRGDGATGWSMGWKLNQ 693
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARL D HAY++ L + G NL+ HPPFQID NFG TA + EML
Sbjct: 694 WARLQDGNHAYKLFGNL-----------LKNGTLDNLWDTHPPFQIDGNFGGTAGITEML 742
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDS 773
+QS + + LLPALP D W G + G+ A+G V + WK+G L E I+S
Sbjct: 743 LQSHMGFIQLLPALP-DAWKDGSISGICAKGNFEVDLSWKNGQLAEATIFSKAGEP---- 797
Query: 774 FKTLHYRGTSVKVNLSAGKIY 794
T+ Y ++ S GK+Y
Sbjct: 798 -CTVRYGDKTLSFKTSKGKVY 817
>gi|293373575|ref|ZP_06619926.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
gi|292631473|gb|EFF50100.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
Length = 815
Score = 417 bits (1073), Expect = e-114, Method: Compositional matrix adjust.
Identities = 262/768 (34%), Positives = 392/768 (51%), Gaps = 82/768 (10%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
++PIGNG LGA + G + +E + LNE TLW G P +Y N + L ++R
Sbjct: 63 SLPIGNGSLGANIMGSISAERITLNEKTLWKGGPNTAKGAEYYWNVNKQSSGVLKEIRQA 122
Query: 79 VDSGQYAEATAASVKLFGHPA------------DVYQLLGDIELEFDDSHLKYAEETYRR 126
G +A + + F + +G++ +E S + + YRR
Sbjct: 123 FLDGDSKKAGYLTQENFNGLGAYEEKDETPFRFGAFTTMGELYVETGLSEINMS--NYRR 180
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
L L++A A V++ + + R++F S PD V+V K + + G + +S ++ +H
Sbjct: 181 ILSLDSAMAVVQFYKDGIRYQRKYFISYPDSVMVMKFTADKGGKQNLVLSYCPNNEAKSH 240
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+GN+ ++ G + G++F+ IK GT+ A E+ ++ V
Sbjct: 241 LEADGNDGLVYTGVL-------------NNNGMKFA--FRIKAIHKGGTLKA-ENDRIIV 284
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRHLD 299
+ +D V LL A + + F K DP+ +++ + + Y +LY H
Sbjct: 285 KDADEVVFLLTADTDYKMNFAPDFKDPKAYVGNDPSQTTLAMMDNALKKGYDELYRNHEA 344
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGR 358
DY LF+RV +++ E +P+ +R+ S++ D L +L +QFGR
Sbjct: 345 DYTALFNRVRFEIN-----------PEIGTPNLPTYKRLASYKKGVPDYQLEQLYYQFGR 393
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLI+SSRPG ANLQG+W+ + W H NIN++MNYW + P NLSEC PL DF
Sbjct: 394 YLLIASSRPGNMPANLQGLWHNNTDGPWRVDYHNNINIQMNYWPACPTNLSECTWPLIDF 453
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
+ L G KTAQ + A GW +I+ ++ K + W L P G WL TH+WE+
Sbjct: 454 IRSLVKPGEKTAQAYFNARGWTASISANIFGFTAPLSSKSMAWNLNPTAGPWLATHIWEY 513
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y+YT D FL++ Y L++ A F +D L DG PSTSPEH V
Sbjct: 514 YDYTRDTKFLKEIGYDLIKSSAQFAVDHLWHKPDGTYTAAPSTSPEH---------GPVD 564
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWAQD 596
T A++RE+ I A++VL DA K ++ L +L P +I G ++EW+ D
Sbjct: 565 EGVTFAHAVVREILLDAIQASKVL--GTDAKERKQWENVLTKLVPYRIGRYGQLLEWSTD 622
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
DP+ HRH++HLFGL PGHTI+ P+L +AA L+ RG+ GWS+ WK WAR
Sbjct: 623 IDDPKDEHRHVNHLFGLHPGHTISPVTTPELAQAARVVLEHRGDGATGWSMGWKLNQWAR 682
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
L D HAY++ L + G NL+ H PFQID NFG TA + EML+QS
Sbjct: 683 LQDGNHAYKLYGNL-----------LKNGTLDNLWDTHAPFQIDGNFGGTAGITEMLLQS 731
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+ + LLPALP D W++G + G+ A+G VSI WK+G L +V I+S
Sbjct: 732 HMGFIQLLPALP-DAWANGSISGICAKGNFEVSISWKEGQLEKVIIHS 778
>gi|383115161|ref|ZP_09935919.1| hypothetical protein BSGG_2959 [Bacteroides sp. D2]
gi|313695424|gb|EFS32259.1| hypothetical protein BSGG_2959 [Bacteroides sp. D2]
Length = 829
Score = 417 bits (1073), Expect = e-114, Method: Compositional matrix adjust.
Identities = 271/806 (33%), Positives = 409/806 (50%), Gaps = 93/806 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 74 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 133
Query: 77 SLVDSGQYAEATAASVKLFG----HPAD--------VYQLLGDIELEFDDSHLKYAEETY 124
G +A + + F + AD + +G+ +E + + ++ Y
Sbjct: 134 KAFTEGDQKKAEMLTRQNFNSEVSYEADGENPFRFGSFTTMGEFYVETGLNMIGMSD--Y 191
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ V + R F S P V+V + S +SG +L F+ + + L
Sbjct: 192 KRILSLDSAMAVVQFKKDRVAYQRNFFISYPANVMVVRFSADQSGKQNLVFSYAPNPLST 251
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+GN ++ A+ D G+++ ++ I+ GT+S D KL
Sbjct: 252 GSMVSDGNKGLVY-------------TASLDNNGMKY--VVRIQAETKGGTLSN-ADGKL 295
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D V + A + +FD F +P +P + + + Y+ L+ +H
Sbjct: 296 TVKDADEVVFYITADTDYKINFDPDFKDPKTYIGVNPEETTKQWMNNAVAQGYTALFNQH 355
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
+DY LF+RV + L+ + K + +P+++R+KS++ + D L EL +QF
Sbjct: 356 YNDYATLFNRVRLNLNPAVKGV-----------NLPTSQRLKSYRKGQPDYYLEELYYQF 404
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL+EC PL
Sbjct: 405 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECVLPLI 464
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + A GW +I+ ++ + + W PM G WL TH+W
Sbjct: 465 DFIRTLVKPGEKTAQAYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATHIW 524
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D FL++ Y L++ A F +D+L DG PSTSPEH
Sbjct: 525 EYYDYTRDLKFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH---------GP 575
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A++RE+ I A++VL +K E E VL + L P +I G +MEW
Sbjct: 576 IDQGATFVHAVVREILMDAIEASKVLGVDKKERKQWEHVLAN---LVPYQIGRYGQLMEW 632
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
+ D DP+ HRH++HLFGL PGHT++ P+L KAA+ L RG+ GWS+ WK
Sbjct: 633 SVDIDDPKDEHRHVNHLFGLHPGHTVSPVTTPELAKAAKVVLVHRGDGATGWSMGWKLNQ 692
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARL D HAY + L + G NL+ HPPFQID NFG TA + EML
Sbjct: 693 WARLQDGNHAYTLFGNL-----------LKNGTMDNLWDTHPPFQIDGNFGGTAGITEML 741
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN----- 768
+QS + + LLPALP D W G + G+ A+G V + W++ L E + SN
Sbjct: 742 LQSHMGFIQLLPALP-DAWKDGSISGICAKGNFEVDVIWENHQLKEAVVRSNAGGDCVIK 800
Query: 769 --NDHDSFKTLHYRGTSVKVNLSAGK 792
+ SFKT+ +G S ++ A K
Sbjct: 801 YADQTISFKTV--KGRSYQIGYDAAK 824
>gi|423282784|ref|ZP_17261669.1| hypothetical protein HMPREF1204_01207 [Bacteroides fragilis HMW
615]
gi|404581655|gb|EKA86351.1| hypothetical protein HMPREF1204_01207 [Bacteroides fragilis HMW
615]
Length = 829
Score = 417 bits (1073), Expect = e-114, Method: Compositional matrix adjust.
Identities = 263/801 (32%), Positives = 400/801 (49%), Gaps = 89/801 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA--------PKALSDVR 76
+ ++PIGNG +GA + G + +E + NE TLW G P DA L ++R
Sbjct: 75 SQSLPIGNGSIGANIMGSIEAERITFNEKTLWRGGPNTSRGADAYWNVNKQSAHVLKEIR 134
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
G +A + K F + +G+ +E S + ++ Y
Sbjct: 135 QAFTDGDQKKAEMLTRKNFNSEVPYESNREKPFRFGNFTTMGEFYIETGLSTVNMSD--Y 192
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ +V + R++F S P V+ + G +L+F+ S + +
Sbjct: 193 KRILSLDSALAVVQFKKDDVAYERDYFISYPANVMAIRFKADRPGKQNLTFSYSPNPVST 252
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+G N + A+ D G+Q+ ++ I GT+S + K+
Sbjct: 253 GSMSADGANGLAY-------------TAHLDNNGMQY--VVRIHAIAKGGTLSN-ANGKI 296
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINP-SDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D V L+ A + +FD F +P + +P + + + + Y L+ +H
Sbjct: 297 TVKDADEVVFLVTADTDYKINFDPDFKDPKAYVGVNPAETTRQWMDNAVAMGYDVLFKQH 356
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
DDY LF+RV +QL+ + +P+ +R+++++ + D L EL +QF
Sbjct: 357 YDDYAALFNRVKLQLNPDAQSA-----------NLPTGKRLQNYRKGQPDFYLEELYYQF 405
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL EC PL
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLYECTLPLI 465
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + GW +I+ ++ + ++ W PM G WL TH+W
Sbjct: 466 DFIRTLVKPGQKTAQAYFGTRGWTASISANIFGFTTPLESEEMAWNFNPMAGPWLATHVW 525
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D+ FL++ Y L++ A F D+L DG PSTSPEH
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSAQFATDFLWRKPDGTYTAAPSTSPEH---------GP 576
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A+IRE+ I A++VL + E ++VL L P K+ G +MEW
Sbjct: 577 IDEGTTFVHAVIREILQDAIEASKVLGVDGKERKQWQEVLT---HLAPYKVGRYGQLMEW 633
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
++D DP+ HRH++HLFGL PGHT++ PDL KAA L+ RG+ GWS+ WK
Sbjct: 634 SKDIDDPKDEHRHVNHLFGLHPGHTLSPITTPDLAKAARVVLEHRGDGATGWSMGWKLNQ 693
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARL D HAY++ L + G NL+ HPPFQID NFG TA + EML
Sbjct: 694 WARLQDGNHAYKLFGNL-----------LKNGTLDNLWDTHPPFQIDGNFGGTAGITEML 742
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDS 773
+QS + + LLPALP D W G + G+ A+G V + WK+G L E I+S
Sbjct: 743 LQSHMGFIQLLPALP-DAWKDGSISGICAKGNFEVDLSWKNGQLAEATIFSKAGEP---- 797
Query: 774 FKTLHYRGTSVKVNLSAGKIY 794
T+ Y ++ S GK+Y
Sbjct: 798 -CTVRYGDKTLSFKTSKGKVY 817
>gi|423259841|ref|ZP_17240764.1| hypothetical protein HMPREF1055_03041 [Bacteroides fragilis
CL07T00C01]
gi|423267496|ref|ZP_17246477.1| hypothetical protein HMPREF1056_04164 [Bacteroides fragilis
CL07T12C05]
gi|387775879|gb|EIK37983.1| hypothetical protein HMPREF1055_03041 [Bacteroides fragilis
CL07T00C01]
gi|392696970|gb|EIY90157.1| hypothetical protein HMPREF1056_04164 [Bacteroides fragilis
CL07T12C05]
Length = 829
Score = 417 bits (1073), Expect = e-114, Method: Compositional matrix adjust.
Identities = 263/801 (32%), Positives = 400/801 (49%), Gaps = 89/801 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA--------PKALSDVR 76
+ ++PIGNG +GA + G + +E + NE TLW G P DA L ++R
Sbjct: 75 SQSLPIGNGSIGANIMGSIEAERITFNEKTLWRGGPNTSRGADAYWNVNKQSAHVLKEIR 134
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
G +A + K F + +G+ +E S + ++ Y
Sbjct: 135 QAFTDGDQKKAEMLTRKNFNSEVPYESNREKPFRFGNFTTMGEFYIETGLSTVNMSD--Y 192
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ +V + R++F S P V+ + G +L+F+ S + +
Sbjct: 193 KRILSLDSALAVVQFKKDDVAYERDYFISYPANVMAIRFKADRPGKQNLTFSYSPNPVST 252
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+G N + A+ D G+Q+ ++ I GT+S + K+
Sbjct: 253 GSMSADGANGLAY-------------TAHLDNNGMQY--VVRIHAIAKGGTLSN-ANGKI 296
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINP-SDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D V L+ A + +FD F +P + +P + + + + Y L+ +H
Sbjct: 297 TVKDADEVVFLVTADTDYKINFDPDFKDPKAYVGVNPAETTRQWMDNAVAMGYDVLFKQH 356
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
DDY LF+RV +QL+ + +P+ +R+++++ + D L EL +QF
Sbjct: 357 YDDYAALFNRVKLQLNPDAQSA-----------NLPTGKRLQNYRKGQPDFYLEELYYQF 405
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL EC PL
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLYECTLPLI 465
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + GW +I+ ++ + ++ W PM G WL TH+W
Sbjct: 466 DFIRTLVKPGQKTAQAYFGTRGWTASISANIFGFTTPLESEEMAWNFNPMAGPWLATHVW 525
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D+ FL++ Y L++ A F D+L DG PSTSPEH
Sbjct: 526 EYYDYTRDKKFLKETGYDLIKSSAQFATDFLWRKPDGTYTAAPSTSPEH---------GP 576
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A+IRE+ I A++VL + E ++VL L P K+ G +MEW
Sbjct: 577 IDEGTTFVHAVIREILQDAIEASKVLGVDSKERKQWQEVLT---HLAPYKVGRYGQLMEW 633
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
++D DP+ HRH++HLFGL PGHT++ PDL KAA L+ RG+ GWS+ WK
Sbjct: 634 SKDIDDPKDKHRHVNHLFGLHPGHTLSPITTPDLAKAARVVLEHRGDGATGWSMGWKLNQ 693
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARL D HAY++ L + G NL+ HPPFQID NFG TA + EML
Sbjct: 694 WARLQDGNHAYKLFGNL-----------LKNGTLDNLWDTHPPFQIDGNFGGTAGITEML 742
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDS 773
+QS + + LLPALP D W G + G+ A+G V + WK+G L E I+S
Sbjct: 743 LQSHMGFIQLLPALP-DAWKDGSISGICAKGNFEVDLSWKNGQLAEAIIFSKAGEP---- 797
Query: 774 FKTLHYRGTSVKVNLSAGKIY 794
T+ Y ++ S GK+Y
Sbjct: 798 -CTVRYGDKTLSFKTSKGKVY 817
>gi|342884136|gb|EGU84463.1| hypothetical protein FOXB_05018 [Fusarium oxysporum Fo5176]
Length = 767
Score = 417 bits (1073), Expect = e-114, Method: Compositional matrix adjust.
Identities = 267/779 (34%), Positives = 399/779 (51%), Gaps = 71/779 (9%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP 60
M ES+ T + + + PA +++A+PIGNGRLGAMV+G +E L+LNED++W G P
Sbjct: 1 MDEGESSDTDKGMLLHYAAPASSWSEALPIGNGRLGAMVYGRTSTELLQLNEDSVWYGGP 60
Query: 61 GDYTNPDAPKALSDVRSLVDSGQYAEAT-AASVKLFGHPADV--YQLLGDIELEFDDSHL 117
D T DA L+ +R L+ ++ +A F P+ + Y+ LG ++EFD H
Sbjct: 61 QDRTPRDAHSHLATLRQLIRDEKHKDAEDLVKEAFFATPSSMRHYEPLGQCKIEFD--HD 118
Query: 118 KYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL 177
+ Y R LDLNT+ +Y + R+ +S PD V+ ++ SE F V L
Sbjct: 119 ESEVTDYTRYLDLNTSQVTTRYKCDGRSYRRDIIASFPDSVLAVQVQASEKSR--FVVRL 176
Query: 178 DSLLDNHSYVNG--NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
+ +N N ++ + R IP AN+N + S +L + GT+
Sbjct: 177 NRQSENEGETNEYLDSIFAQDSRIILNAIPGGANSN------RLSLVLGVSCGPGDGTVK 230
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
A+ + + + V+ + A ++F K+DP ++ + + L
Sbjct: 231 AVGN--CLIVNATKCVIAIGAHTTF---------RKEDPERSALLNVDDALRRPWDVLVR 279
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
RH DY LF R+S++L + + +P+ +R+ S + DP LV L
Sbjct: 280 RHRSDYTNLFGRMSLRLF-------------PDANHLPTNKRIVS---NRDPGLVALYHN 323
Query: 356 FGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
+GRYLLISSSR + A LQGIWN SP W S +NINL+MNYW ++PC+L +C
Sbjct: 324 YGRYLLISSSRNSDKALPATLQGIWNPSFSPPWGSKFTININLQMNYWPAIPCSLIQCAI 383
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PL + L ++ G +TA++ Y GW HH TDIWA + + +WP+GGAWLCT
Sbjct: 384 PLINLLERMAERGKRTAKMMYNCKGWCAHHNTDIWADTDPQDRWMPATIWPLGGAWLCTD 443
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGK 532
+ Y + L R P+LEGC FLLD+LI G YL TNPS SPE+ F++ G+
Sbjct: 444 VVRMLIYQYE-PTLHCRIAPILEGCVQFLLDFLIPSACGRYLVTNPSLSPENSFVSQSGE 502
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
S +DM I+R + + + +L+ + + + +L +L P + +DG I E
Sbjct: 503 TGIFCEGSVIDMTIVRIALESFLWSISILDPDHPRRNDAI-AALDKLPPMSLNKDGLIQE 561
Query: 593 WA-QDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSIT 648
W ++ K+ E HRH+SHLFGL+P +I+++ +P L KAA+K L +R E G GWS
Sbjct: 562 WGLKNHKEAEPGHRHVSHLFGLYPDDSISMDSSPLLIKAAKKVLARRAEHGGGHTGWSRA 621
Query: 649 WKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAA 708
W L ARL D E + L + N+ HPPFQID NFG A
Sbjct: 622 WLLNLHARLRDSEGCENHMDLL-----------LKTSTLPNMLDNHPPFQIDGNFGGCAG 670
Query: 709 VAEMLVQSTLND--------LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
+ E LVQSTL ++LLP+LP W+ G + ++A GG VS+ WK+G + E
Sbjct: 671 ILECLVQSTLRSEPSRQVVVIHLLPSLP-SSWAGGKLTHVRAMGGWLVSLEWKEGKVIE 728
>gi|212692624|ref|ZP_03300752.1| hypothetical protein BACDOR_02121 [Bacteroides dorei DSM 17855]
gi|212664909|gb|EEB25481.1| hypothetical protein BACDOR_02121 [Bacteroides dorei DSM 17855]
Length = 818
Score = 417 bits (1073), Expect = e-113, Method: Compositional matrix adjust.
Identities = 274/810 (33%), Positives = 401/810 (49%), Gaps = 94/810 (11%)
Query: 2 MNAESTSTTNPLKITFNGP------AKHFT---------DAIPIGNGRLGAMVWGGVPSE 46
+ A+ST T L I F+ P A ++ +++PIGNG LG V G + +E
Sbjct: 17 LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPVTDKAWENNSLPIGNGSLGGNVMGSIAAE 76
Query: 47 TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
+ LNE TLW G P N ++ L ++R G +A + K F
Sbjct: 77 RITLNEKTLWRGGPNTEKGAAYYWNVNKESAHLLPEIRQAFTDGNQKKAEELTCKNFNGL 136
Query: 99 ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
AD + LG+ +E S + Y+R L L++A A V + V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEI--GMTNYKRILSLDSAMAVVSFRKDEVNY 194
Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
R++F S PD V+V K + G +L F+ + +G N+++ G
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKADGPNRLLYTGCL----- 249
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
K Q L I+ + G+++ D K V +D + LL A + +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298
Query: 261 DGPFINPSD-SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
+ F +P DP +++ + + SY++L RH DY +LF RV +QL+ R+P
Sbjct: 299 NPDFKDPKTYVGPDPEQTTLAMMDAAAAKSYNELCERHKTDYTQLFGRVQLQLNPRAPM- 357
Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
T + +P+ +R+ ++ + D L E+ +QFGRYLLI+SSRPG ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTYQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413
Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
W + W H NIN++MNYW + NL+EC PL DF+ L G KTAQ + A
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473
Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
GW +I+ +S + W PM G WL TH+WE+Y+YT D+ FL++ Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533
Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
A+F +D+L DG PSTSPEH V +T A++RE+ I
Sbjct: 534 SSANFAIDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLNAID 584
Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLF 614
A++ L + + + VL L P +I G +MEW+ D DP+ HRH++HLFGL
Sbjct: 585 ASKALGVDSKDRKQWQYVLN---HLVPYQIGRYGQLMEWSTDIDDPKDEHRHVNHLFGLH 641
Query: 615 PGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLV 674
PGHT++ P+L AA+ L+ RG+ GWS+ WK WARL D HAY++ L
Sbjct: 642 PGHTLSPIMTPELTHAAKVVLEHRGDGATGWSMGWKLNQWARLQDGNHAYKLFGNL---- 697
Query: 675 DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSS 734
+ G NL+ HPPFQID NFG TA + EML+QS + + LLPALP D W
Sbjct: 698 -------LKNGTLDNLWDTHPPFQIDGNFGGTAGITEMLLQSHMGFIQLLPALP-DAWKE 749
Query: 735 GCVKGLKARGGETVSICWKDGDLHEVGIYS 764
G VKGL A+G ++I W+DG L E I S
Sbjct: 750 GSVKGLCAKGNFEINITWQDGKLKEAVILS 779
>gi|408387708|gb|EKJ67420.1| hypothetical protein FPSE_12405 [Fusarium pseudograminearum CS3096]
Length = 768
Score = 417 bits (1072), Expect = e-113, Method: Compositional matrix adjust.
Identities = 271/818 (33%), Positives = 415/818 (50%), Gaps = 89/818 (10%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
+E +T L + + PA +++A+PIGNGRLGAMV+G +E L+LNED++W G P D
Sbjct: 5 SEKANTDKSLLLHYAAPASSWSEALPIGNGRLGAMVYGRASTELLQLNEDSVWYGGPQDR 64
Query: 64 TNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYA 120
T DA L+ +R L+ ++ +A A A F PA + Y+ LG +EF H +
Sbjct: 65 TPRDAYSNLATLRQLIRDEKHKDAEALAREAFFATPASMRHYEPLGQCTIEF--GHDERI 122
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
Y+R LDL T+ + KY V + R+ +S P+ V+ + S ++ S
Sbjct: 123 VSDYKRHLDLATSQSTTKYDYEGVTYRRDVIASFPNNVLAIRFQASAPTRFVVRLNRQSE 182
Query: 181 LDNHS--YVNG----NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
++ + Y++ +N II++ GK N+N + + L + + G +
Sbjct: 183 VEGETNEYLDSIRAQDNHIILQATPGGK------NSN------RLALALGVSCKSNNGNV 230
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
+ + + ++ ++ + A +++ +P + ++ + S + +L
Sbjct: 231 KVVGN--CLIVNTEECIIAIGAHTTY---------RSYNPDASALRDVNSALREPWENLV 279
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
+RH DY +LF + ++++ + VP+ ER+ Q++ DP L+ L
Sbjct: 280 SRHRQDYGRLFSKTALRM-------------WPDASHVPTDERI---QSNRDPGLIALYH 323
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
+ RYLLISSSR + A LQGIWN +P W S +NINL+MNYW + CNL EC
Sbjct: 324 NYSRYLLISSSRKSAKALPATLQGIWNPSFAPPWGSKFTININLQMNYWPAASCNLIECA 383
Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
PL D + ++ G +TA+V Y GW HH TDIWA + + LWP+GG WLC
Sbjct: 384 VPLIDHIERMAQKGKRTAKVMYNCRGWCAHHNTDIWADTDPQDRWMPATLWPLGGVWLCI 443
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDG 531
+ + Y D L R PLLEGC FLLD+LI G YL TNPS SPE+ FI+ G
Sbjct: 444 DVVKMLIYQYDH-MLHIRIAPLLEGCIQFLLDFLIPSACGKYLVTNPSLSPENSFISESG 502
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
+ S MDM I+R + I + +L K E L + V+ +L +L P +I + G I
Sbjct: 503 ETGTFCEGSVMDMTIVRIALESFIWSTSILNK-EHPLQKDVMATLGKLPPFRINKSGLIQ 561
Query: 592 EWA-QDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSI 647
EW +D K+ E HRH+SHLFGL+P I+++ +P L +AA KTL +R E G GWS
Sbjct: 562 EWGLKDHKEAEPGHRHVSHLFGLYPDDFISLDSSPALVEAARKTLARRAEHGGGHTGWSR 621
Query: 648 TWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYS----NLFAAHPPFQIDANF 703
W L+ARL + P+ ++H + L + N+ HPPFQID NF
Sbjct: 622 AWLLNLYARLRE---------------PPKCDEHMDMLLKTSALPNMLDNHPPFQIDGNF 666
Query: 704 GFTAAVAEMLVQSTLND---------LYLLPALPWDKWSSGCVKGLKARGGETVSICWKD 754
G A V E L+QS L ++LLP+LP WS+G + ++ GG VS+ W++
Sbjct: 667 GGCAGVTECLIQSNLRPDELSSQVVMIHLLPSLP-SSWSNGKLTNIRVMGGWLVSLEWRE 725
Query: 755 GDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGK 792
G L E + + N+ ++ G V V S G+
Sbjct: 726 GQLTEPLLLESTVNHAPNALAVFP-NGKRVSVIKSKGQ 762
>gi|265752589|ref|ZP_06088158.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_33FAA]
gi|263235775|gb|EEZ21270.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_33FAA]
Length = 818
Score = 417 bits (1072), Expect = e-113, Method: Compositional matrix adjust.
Identities = 274/810 (33%), Positives = 401/810 (49%), Gaps = 94/810 (11%)
Query: 2 MNAESTSTTNPLKITFNGP------AKHFT---------DAIPIGNGRLGAMVWGGVPSE 46
+ A+ST T L I F+ P A ++ +++PIGNG LG V G + +E
Sbjct: 17 LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPVTDKAWENNSLPIGNGSLGGNVMGSIAAE 76
Query: 47 TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
+ LNE TLW G P N ++ L ++R G +A + K F
Sbjct: 77 RITLNEKTLWRGGPNTEKGAAYYWNVNKESAHLLPEIRQAFTDGNQKKAEELTCKNFNGL 136
Query: 99 ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
AD + LG+ +E S + Y+R L L++A A V + V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEIGIT--NYKRILSLDSAMAVVSFRKDEVNY 194
Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
R++F S PD V+V K + G +L F+ + +G N+++ G
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKADGPNRLLYTGCL----- 249
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
K Q L I+ + G+++ D K V +D + LL A + +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298
Query: 261 DGPFINPSD-SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
+ F +P DP +++ + + SY++L RH DY +LF RV +QL+ R+P
Sbjct: 299 NPDFKDPKTYVGPDPEQTTLAMMDAAAAKSYNELCERHKTDYTQLFGRVQLQLNPRAPM- 357
Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
T + +P+ +R+ ++ + D L E+ +QFGRYLLI+SSRPG ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTYQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413
Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
W + W H NIN++MNYW + NL+EC PL DF+ L G KTAQ + A
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473
Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
GW +I+ +S + W PM G WL TH+WE+Y+YT D+ FL++ Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533
Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
A+F +D+L +G PSTSPEH V +T A++RE+ I
Sbjct: 534 SSANFAIDYLWHKPEGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLNAID 584
Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLF 614
A++ L + + + VLK L P +I G +MEW+ D DP+ HRH++HLFGL
Sbjct: 585 ASKALGVDSKDRKQWQYVLK---HLVPYQIGRYGQLMEWSTDIDDPKDEHRHVNHLFGLH 641
Query: 615 PGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLV 674
PGHT++ P+L AA+ L+ RG+ GWS+ WK WARL D HAY++ L
Sbjct: 642 PGHTLSPITTPELTNAAKVVLEHRGDGATGWSMGWKLNQWARLQDGNHAYKLFGNL---- 697
Query: 675 DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSS 734
+ G NL+ HPPFQID NFG TA + EML+QS + + LLPALP D W
Sbjct: 698 -------LKNGTLDNLWDTHPPFQIDGNFGGTAGITEMLLQSHMGFIQLLPALP-DAWKE 749
Query: 735 GCVKGLKARGGETVSICWKDGDLHEVGIYS 764
G VKGL A+G + I W+DG L E I S
Sbjct: 750 GSVKGLCAKGNFEIDITWQDGKLKEAVILS 779
>gi|299144684|ref|ZP_07037752.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
gi|298515175|gb|EFI39056.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
Length = 829
Score = 417 bits (1072), Expect = e-113, Method: Compositional matrix adjust.
Identities = 270/806 (33%), Positives = 410/806 (50%), Gaps = 93/806 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 74 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 133
Query: 77 SLVDSGQYAEATAASVKLFG----HPAD--------VYQLLGDIELEFDDSHLKYAEETY 124
G +A + + F + AD + +G+ +E + + ++ Y
Sbjct: 134 KAFTEGDQKKAEMLTRQNFNSEVSYEADGENPFRFGSFTTMGEFYVETGLNMIGMSD--Y 191
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ V + R F S P V+V + S +SG +L F+ + + L
Sbjct: 192 KRILSLDSAMAVVQFKKDRVAYQRNFFISYPANVMVVRFSADQSGKQNLVFSYAPNPLST 251
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+GN ++ A+ D G+++ ++ I+ GT+S D KL
Sbjct: 252 GSMVSDGNKGLVY-------------TASLDNNGMKY--VVRIQAETKGGTLSN-ADGKL 295
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D V + A + +FD F +P +P + + + Y+ L+ +H
Sbjct: 296 TVKDADEVVFYITADTDYKINFDPDFKDPKTYIGVNPEETTKQWMNNAVAQGYTALFNQH 355
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
+DY LF+RV + L+ + K + +P+++R+K+++ + D L EL +QF
Sbjct: 356 YNDYATLFNRVRLNLNPAVKGV-----------NLPTSQRLKNYRKGQPDYYLEELYYQF 404
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL+EC PL
Sbjct: 405 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECVLPLI 464
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + A GW +I+ ++ + + W PM G WL TH+W
Sbjct: 465 DFIRTLVKPGEKTAQAYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATHIW 524
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D FL++ Y L++ A F++D+L DG PSTSPEH
Sbjct: 525 EYYDYTRDLKFLKETGYELIKSSADFVVDYLWHKPDGTYTAAPSTSPEH---------GP 575
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A++RE+ I A++VL +K E E VL + L P +I G +MEW
Sbjct: 576 IDQGATFVHAVVREILMDAIEASKVLGVDKKERKQWEHVLAN---LVPYQIGRYGQLMEW 632
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
+ D DP+ HRH++HLFGL PGHT++ P+L KAA+ L RG+ GWS+ WK
Sbjct: 633 SVDIDDPKDEHRHVNHLFGLHPGHTVSPVTTPELAKAAKVVLVHRGDGATGWSMGWKLNQ 692
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARL D HAY + L + G NL+ HPPFQID NFG TA + EML
Sbjct: 693 WARLQDGNHAYTLFGNL-----------LKNGTMDNLWDTHPPFQIDGNFGGTAGIIEML 741
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN----- 768
+QS + + LLPALP D W G + G+ A+G V + W++ L E + SN
Sbjct: 742 LQSHMGFIQLLPALP-DAWKDGSISGICAKGNFEVDVIWENHQLKEAVVRSNAGGDCVIK 800
Query: 769 --NDHDSFKTLHYRGTSVKVNLSAGK 792
+ SFKT+ +G S ++ A K
Sbjct: 801 YADQTISFKTV--KGRSYQIGYDAAK 824
>gi|423230473|ref|ZP_17216877.1| hypothetical protein HMPREF1063_02697 [Bacteroides dorei
CL02T00C15]
gi|423240882|ref|ZP_17221996.1| hypothetical protein HMPREF1065_02619 [Bacteroides dorei
CL03T12C01]
gi|423244182|ref|ZP_17225257.1| hypothetical protein HMPREF1064_01463 [Bacteroides dorei
CL02T12C06]
gi|392630838|gb|EIY24820.1| hypothetical protein HMPREF1063_02697 [Bacteroides dorei
CL02T00C15]
gi|392642736|gb|EIY36499.1| hypothetical protein HMPREF1064_01463 [Bacteroides dorei
CL02T12C06]
gi|392643844|gb|EIY37593.1| hypothetical protein HMPREF1065_02619 [Bacteroides dorei
CL03T12C01]
Length = 818
Score = 417 bits (1072), Expect = e-113, Method: Compositional matrix adjust.
Identities = 275/810 (33%), Positives = 400/810 (49%), Gaps = 94/810 (11%)
Query: 2 MNAESTSTTNPLKITFNGP------AKHFTDA---------IPIGNGRLGAMVWGGVPSE 46
+ A+ST T L I F+ P A ++ A +PIGNG LG V G + +E
Sbjct: 17 LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPATDKAWENNSLPIGNGSLGGNVMGSIAAE 76
Query: 47 TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
+ LNE TLW G P N ++ L ++R G +A + K F
Sbjct: 77 RITLNEKTLWRGGPNTEKGAAYYWNVNKESAHLLPEIRQAFTDGNQKKAEELTCKNFNGL 136
Query: 99 ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
AD + LG+ +E S + Y+R L L++A A V + V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEI--GMTNYKRILSLDSAMAVVSFRKDEVNY 194
Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
R++F S PD V+V K + G +L F+ + +G N+++ G
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKADGPNRLLYTGCL----- 249
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
K Q L I+ + G+++ D K V +D + LL A + +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298
Query: 261 DGPFINPSD-SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
+ F +P DP +++ + + SY++L RH DY +LF RV +QL+ R+P
Sbjct: 299 NPDFKDPKTYVGPDPEQTTLAMMDAAAAKSYNELCERHKTDYTQLFGRVQLQLNPRAPM- 357
Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
T + +P+ +R+ ++ + D L E+ +QFGRYLLI+SSRPG ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTYQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413
Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
W + W H NIN++MNYW + NL+EC PL DF+ L G KTAQ + A
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473
Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
GW +I+ +S + W PM G WL TH+WE+Y+YT D+ FL++ Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533
Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
A+F +D+L DG PSTSPEH V +T A++RE+ I
Sbjct: 534 SSANFAIDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLNAID 584
Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLF 614
A++ L + + + VL L P +I G +MEW+ D DP+ HRH++HLFGL
Sbjct: 585 ASKALGVDSKDRKQWQYVLN---HLVPYQIGRYGQLMEWSTDIDDPKDEHRHVNHLFGLH 641
Query: 615 PGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLV 674
PGHT++ P+L AA+ L+ RG+ GWS+ WK WARL D HAY++ L
Sbjct: 642 PGHTLSPIMTPELTHAAKVVLEHRGDGATGWSMGWKLNQWARLQDGNHAYKLFGNL---- 697
Query: 675 DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSS 734
+ G NL+ HPPFQID NFG TA + EML+QS + + LLPALP D W
Sbjct: 698 -------LKNGTLDNLWDTHPPFQIDGNFGGTAGITEMLLQSHMGFIQLLPALP-DAWKE 749
Query: 735 GCVKGLKARGGETVSICWKDGDLHEVGIYS 764
G VKGL A+G ++I W+DG L E I S
Sbjct: 750 GSVKGLCAKGNFEINITWQDGKLKEAVILS 779
>gi|317505590|ref|ZP_07963500.1| alpha-L-fucosidase [Prevotella salivae DSM 15606]
gi|315663302|gb|EFV03059.1| alpha-L-fucosidase [Prevotella salivae DSM 15606]
Length = 828
Score = 417 bits (1072), Expect = e-113, Method: Compositional matrix adjust.
Identities = 267/804 (33%), Positives = 406/804 (50%), Gaps = 89/804 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA--------PKALSDVR 76
+ ++P+GNG LGA + G + +E + NE TLW G P DA L+++R
Sbjct: 72 SQSLPLGNGSLGANIMGSIEAERITFNEKTLWRGGPNTSAGADAYWNVNKQSAHYLNEIR 131
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
G +A + K F + +G+ +E S + +E Y
Sbjct: 132 QAFIEGDEKKAALLTRKNFNSTVPYESWKENPFRFGNFTTMGEFYIETGLSSIGMSE--Y 189
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ V + R +F S P+ V+V + + G +L F+ + +
Sbjct: 190 KRALSLDSALATVQFKKDGVRYERNYFISYPNNVMVVRFKADQPGKQNLVFSYESNPVST 249
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+G+N ++ KA+ +++ Q ++ I+ + GTIS ++ KL
Sbjct: 250 GKMEADGSNGLVF-----------KAHLDNN----QMEYVVRIQALNQGGTISN-DNGKL 293
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDSKKDPTSESMSA-LQSIRNLSYSDLYTRH 297
+ G++ V L+ A + +F+ F NP SE+ +A ++ Y L H
Sbjct: 294 SINGANEVVFLITADTDYKVNFNPDFKNPRAYVGVNPSETTAAWMKKAVAQGYDALLQVH 353
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQF 356
DY LF+RVS+ L+ K +P+ +R+ +++ ED L EL +QF
Sbjct: 354 YKDYASLFNRVSLTLNDGQK-----------TQDIPTPQRLINYRKGKEDYYLEELYYQF 402
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NLSEC PL
Sbjct: 403 GRYLLIASSRPGNLPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPAGSTNLSECTLPLI 462
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTA+ + A GW +I+ ++ + + W PM G WL TH+W
Sbjct: 463 DFIRTLVKPGEKTAKAYFGARGWTASISGNIFGFTAPLESEDMSWNFNPMAGPWLATHVW 522
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
++Y+YT D+ FL++ Y L++ A F +D+L + DG PSTSPEH
Sbjct: 523 DYYDYTRDKKFLKEVGYDLIKSSAIFAVDYLWKKPDGTYTAAPSTSPEH---------GP 573
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A+IRE+ I A++VL +K E E+VL+ ++ P K+ G ++EW
Sbjct: 574 IDEGTTFVHAVIREILMNAIDASKVLNVDKKERKQWEEVLR---KIAPYKVGRYGQLLEW 630
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
++D DP HRH++HLFGL PGHT++ P L +A++ L RG+ GWS+ WK
Sbjct: 631 SKDIDDPNDQHRHVNHLFGLHPGHTVSPITTPALAEASKVVLNHRGDGATGWSMGWKLNQ 690
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARLHD AY++ L + G NL+ HPPFQID NFG TA V EML
Sbjct: 691 WARLHDGNRAYKLFGNL-----------LKNGTLDNLWDTHPPFQIDGNFGGTAGVTEML 739
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDS 773
+QS + ++LLPALP D W G V+GL A+G + I WK+G L V + S N
Sbjct: 740 MQSHMGFIHLLPALP-DAWKDGEVRGLCAKGNFELDIRWKNGSLSSVTVLSKDGGNCE-- 796
Query: 774 FKTLHYRGTSVKVNLSAGKIYTFN 797
L Y+ + + K YT N
Sbjct: 797 ---LRYKDDKFVLKTNKRKTYTLN 817
>gi|271969414|ref|YP_003343610.1| alpha-L-fucosidase [Streptosporangium roseum DSM 43021]
gi|270512589|gb|ACZ90867.1| Alpha-L-fucosidase [Streptosporangium roseum DSM 43021]
Length = 991
Score = 417 bits (1072), Expect = e-113, Method: Compositional matrix adjust.
Identities = 264/772 (34%), Positives = 407/772 (52%), Gaps = 72/772 (9%)
Query: 3 NAESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
+A + T + L + ++ PA ++ T A+PIGNG LGAMV+GGV SE ++ NE TLWTG PG
Sbjct: 9 SAAAVQTPDDLTLWYDKPATNWETQALPIGNGALGAMVFGGVASEQIQFNEKTLWTGGPG 68
Query: 62 -------DYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPAD---VYQLLGDIELE 111
++T+P P A+++V++ +D +A + KL G P YQ GD+ L+
Sbjct: 69 SGGYNAGNWTSPR-PNAIAEVQAQIDRDGRMSPSAVTAKL-GQPKSGFGAYQTFGDLWLD 126
Query: 112 FDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL 171
D+ + YRREL L A ARV Y+ G V ++RE+F+S+P VIV +IS S++G +
Sbjct: 127 VPDA--PASPTGYRRELSLREAVARVGYTAGGVTYSREYFASHPGGVIVGRISASQAGKV 184
Query: 172 SFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDR 231
SF + S + N ++ + G G++F + +I++
Sbjct: 185 SFTLRTSSPRSDKQVSVANGRLTVRGTLA-------------DNGMRFES--QIQVVTQG 229
Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYS 291
G+ + D+ + V G+D A+ +L A + + G +P+ DP ++ +A+ + ++
Sbjct: 230 GSRTDGTDR-VTVTGADSAMFVLSAGTDYAG--THPAYRGPDPHAKVTAAVDAAAARTFD 286
Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVE 351
L T H +DY+KLF RV + L + I TD + +D +L
Sbjct: 287 QLRTAHQNDYRKLFDRVRLDLGQRVPAIPTDRLRAAYTGRASA----------DDRALEA 336
Query: 352 LLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSEC 411
+ F +GRYLLISSSR ANLQG+WN SP W + HVNINL+MNYW + NL+E
Sbjct: 337 MFFAYGRYLLISSSRDEALPANLQGVWNNSTSPPWSADYHVNINLQMNYWLAEQTNLAET 396
Query: 412 QEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWL 470
++ + G KTAQ + + GWV+H++T+ + + D W +P AW+
Sbjct: 397 TVAYDRYIKAMVAPGRKTAQEMFGSRGWVVHNETNPFGFTGVHDWATAFW--FPEAAAWV 454
Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAP 529
+++HY + D +L AYP+++G A F LD L + DG L +PS SPE
Sbjct: 455 TQQMYDHYRFNGDTAYLRDTAYPVMKGAAEFWLDNLHADPRDGKLVVSPSYSPEQ----- 509
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDG 588
S ++M I+ +V + + AA L + A +V +L +L R ++ G
Sbjct: 510 ----GDFSAGASMSQQIVFDVLTNSLEAARKLNVDP-AFQAEVTAALAKLDRGIRVGSWG 564
Query: 589 SIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSIT 648
+ EW D+ D HRH+SHLF L PG I + P+ AA+ +L RG+ G GWS
Sbjct: 565 QLQEWKSDWDDRANTHRHVSHLFALHPGRQI-VAGTPE-ATAAKVSLTARGDGGTGWSKA 622
Query: 649 WKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAA 708
WK WARL D +H+++M+ + + NL+ HPPFQID NFG T+
Sbjct: 623 WKVNFWARLLDGDHSHKML-----------SEQLKTSTLDNLWDTHPPFQIDGNFGATSG 671
Query: 709 VAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEV 760
VAEML+QS + +++LPALP W +G V GL+ARG TV + W++G +
Sbjct: 672 VAEMLLQSQHDTIHVLPALP-SAWPTGSVTGLRARGDVTVDVSWRNGSGERI 722
>gi|298483252|ref|ZP_07001431.1| fibronectin type III domain protein [Bacteroides sp. D22]
gi|298270569|gb|EFI12151.1| fibronectin type III domain protein [Bacteroides sp. D22]
Length = 815
Score = 417 bits (1071), Expect = e-113, Method: Compositional matrix adjust.
Identities = 261/768 (33%), Positives = 392/768 (51%), Gaps = 82/768 (10%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
++PIGNG LGA + G + +E + LNE TLW G P +Y N + L ++R
Sbjct: 63 SLPIGNGSLGANIMGSISAERITLNEKTLWKGGPNTAKGAEYYWNVNKQSSGVLKEIRQA 122
Query: 79 VDSGQYAEATAASVKLFGHPA------------DVYQLLGDIELEFDDSHLKYAEETYRR 126
+G +A + + F + +G++ +E S + + YRR
Sbjct: 123 FLNGDSKKAGYLTQENFNGLGAYEEKDETPFRFGAFTTMGELYVETGLSEINMS--NYRR 180
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
L L++A A V++ + + R++F S PD V+V K + + G + +S ++ +H
Sbjct: 181 ILSLDSAMAVVQFDKDGIRYQRKYFISYPDSVMVMKFAADKGGKQNLVLSYCPNNEAKSH 240
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+GN+ ++ G + G++F+ IK GT+ A E+ ++ V
Sbjct: 241 LEADGNDGLVYTGVL-------------NNNGMKFA--FRIKAIHKGGTLKA-ENDRIIV 284
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRHLD 299
+ +D V LL A + + F K DP+ +++ + + Y +LY H
Sbjct: 285 KDADEVVFLLTADTDYKMNFAPDFKDPKAYVGNDPSQTTLAMMDNALKKGYDELYRNHEA 344
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGR 358
DY LF+RV +++ E +P+ +R+ S++ D L +L +QFGR
Sbjct: 345 DYTALFNRVRFEIN-----------PEIGTPNLPTYKRLASYKKGVPDYQLEQLYYQFGR 393
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLI+SSRPG ANLQG+W+ + W H NIN++MNYW + P NLSEC PL DF
Sbjct: 394 YLLIASSRPGNMPANLQGLWHNNTDGPWRVDYHNNINIQMNYWPACPTNLSECTWPLIDF 453
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
+ L G KTAQ + A GW +I+ ++ K + W L P G WL TH+WE+
Sbjct: 454 IRSLVKPGEKTAQAYFNARGWTASISANIFGFTAPLSSKSMAWNLNPTAGPWLATHIWEY 513
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y+YT D FL++ Y L++ A F +D L DG PSTSPEH V
Sbjct: 514 YDYTRDTKFLKEIGYDLIKSSAQFAVDHLWHKPDGTYTAAPSTSPEH---------GPVD 564
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWAQD 596
T A++RE+ I A++VL DA K ++ L +L P +I G ++EW+ D
Sbjct: 565 EGVTFAHAVVREILLDAIQASKVL--GTDAKERKQWENVLAKLVPYRIGRYGQLLEWSTD 622
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
DP+ HRH++HLFGL PGHTI+ P+L +AA L+ RG+ GWS+ WK WAR
Sbjct: 623 IDDPKDEHRHVNHLFGLHPGHTISPVTTPELAQAARVVLEHRGDGATGWSMGWKLNQWAR 682
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
L D HAY++ L + G NL+ H PFQID NFG TA + EML+QS
Sbjct: 683 LQDGNHAYKLYGNL-----------LKNGTLDNLWDTHAPFQIDGNFGGTAGITEMLLQS 731
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+ + LLPALP D W++G + G+ A+G VSI WK+G L + I+S
Sbjct: 732 HMGFIQLLPALP-DAWANGSISGICAKGNFEVSISWKEGQLEKAIIHS 778
>gi|298480149|ref|ZP_06998348.1| alpha-L-fucosidase 2 [Bacteroides sp. D22]
gi|298273958|gb|EFI15520.1| alpha-L-fucosidase 2 [Bacteroides sp. D22]
Length = 837
Score = 416 bits (1070), Expect = e-113, Method: Compositional matrix adjust.
Identities = 271/806 (33%), Positives = 409/806 (50%), Gaps = 89/806 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 82 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 141
Query: 77 SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDD--SHLKYAEET---------YR 125
G +A + + F Y G+ F + ++ ET Y+
Sbjct: 142 KAFTEGNQEKAEMLTRQNFNSEVS-YDADGETPFRFGSFTTMGEFYVETGLNIIGMSDYK 200
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDN 183
R L L++A A V++ +V + R +F S P V+V + S + G +L F+ + + +
Sbjct: 201 RILSLDSAMAVVQFKKDHVVYQRNYFISYPANVMVMRFSADQPGKQNLVFSYAPNPVSTG 260
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ +GN ++ +A+ D G+++ ++ I+ GT+ D KL
Sbjct: 261 NMASDGNKGLVY-------------SASLDNNGMKY--VVRIQAETKGGTLFN-ADGKLT 304
Query: 244 VEGSDWAVLLLVASS----SFDGPFINPSD-SKKDPTSESMSALQSIRNLSYSDLYTRHL 298
V+G+D V + A + +FD F +P +P + + + + Y+ L+++H
Sbjct: 305 VKGADEVVFYITADTDYKPNFDPDFKDPKTYVGVNPEETTKEWMNNAVSQGYTALFSQHY 364
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFG 357
+DY LF+RV + L+ + K +P+ +R+K+++ + D L EL FQFG
Sbjct: 365 NDYAALFNRVKLNLNPAIKG-----------RNLPTPQRLKNYRAGQPDYDLEELYFQFG 413
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLISSSRPG ANLQGIW+ ++ W H NIN++MNYW + NL+EC PL D
Sbjct: 414 RYLLISSSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECMLPLVD 473
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWE 476
F+ L G KTA+ + A GW +I+ ++ + + W PM G WL TH+WE
Sbjct: 474 FIRTLVKPGEKTAKSYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATHIWE 533
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
+Y+YT D FL++ Y L++ A F +D+L DG PSTSPEH +
Sbjct: 534 YYDYTRDLTFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH---------GPI 584
Query: 537 SYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWA 594
+T A++RE+ I A++VL +K E E VL + L P KI G +MEW+
Sbjct: 585 DQGATFVHAVVREILLDAIEASKVLGIDKKERKQWEHVLAN---LVPYKIGRYGQLMEWS 641
Query: 595 QDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALW 654
D DP+ HRH++HLFGL PGHT++ P+L KAA+ L RG+ GWS+ WK W
Sbjct: 642 VDIDDPKDEHRHVNHLFGLHPGHTVSPVTTPELAKAAKVVLVHRGDGATGWSMGWKLNQW 701
Query: 655 ARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLV 714
ARL D HAY + L + G NL+ H PFQID NFG TA + EML+
Sbjct: 702 ARLQDGNHAYTLFGNL-----------LKNGTLDNLWDTHSPFQIDGNFGGTAGITEMLL 750
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN----- 769
QS + + LLPALP D W G V G+ A+G V + W++ L E ++SN N
Sbjct: 751 QSHMGFIQLLPALP-DAWKEGSVSGICAKGNFEVDMVWENNQLKEAVVHSNAGGNCVIKY 809
Query: 770 --DHDSFKTLHYRGTSVKVNLSAGKI 793
SFKT+ R ++ +++ G I
Sbjct: 810 ADKTLSFKTVKGRSYRIEYDVTKGLI 835
>gi|423293334|ref|ZP_17271461.1| hypothetical protein HMPREF1070_00126 [Bacteroides ovatus
CL03T12C18]
gi|392678277|gb|EIY71685.1| hypothetical protein HMPREF1070_00126 [Bacteroides ovatus
CL03T12C18]
Length = 829
Score = 416 bits (1070), Expect = e-113, Method: Compositional matrix adjust.
Identities = 269/807 (33%), Positives = 408/807 (50%), Gaps = 91/807 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 74 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 133
Query: 77 SLVDSGQYAEATAASVKLFG----HPAD--------VYQLLGDIELEFDDSHLKYAEETY 124
G +A + + F + AD + +G+ +E + + ++ Y
Sbjct: 134 KAFTEGDQKKAEMLTRQNFNSEVSYEADGENPFRFGSFTTMGEFYVETGLNMIGMSD--Y 191
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ V + R F S P V+V + S +SG +L F+ + + L
Sbjct: 192 KRILSLDSAMAVVQFKKDRVAYQRNFFISYPANVMVVRFSADQSGKQNLVFSYAPNPLST 251
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+GN ++ A+ D G+++ ++ I+ GT+S D KL
Sbjct: 252 GSMVSDGNKGLVY-------------TASLDNNGMKY--VVRIQAETKGGTLSN-ADGKL 295
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D V + A + +FD F +P +P + + + Y+ L+ +H
Sbjct: 296 TVKDADEVVFYITADTDYKINFDPDFKDPKTYIGVNPEETTKQWMNNAVAQGYTALFNQH 355
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
+DY LF+RV + L+ + K + +P+++R+K+++ + D L EL +QF
Sbjct: 356 YNDYATLFNRVRLNLNPAVKGV-----------NLPTSQRLKNYRKGQPDYYLEELYYQF 404
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL+EC PL
Sbjct: 405 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECVLPLI 464
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + A GW +I+ ++ + + W PM G WL TH+W
Sbjct: 465 DFIRTLVKPGEKTAQAYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATHIW 524
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D FL++ Y L++ A F +D+L DG PSTSPEH
Sbjct: 525 EYYDYTRDLKFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH---------GP 575
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A++RE+ I A++VL +K E E VL + L P +I G +MEW
Sbjct: 576 IDQGATFVHAVVREILMDAIEASKVLGVDKKERKQWEHVLAN---LVPYQIGRYGQLMEW 632
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
+ D DP+ HRH++HLFGL PGHT++ P+L KAA+ L RG+ GWS+ WK
Sbjct: 633 SVDIDDPKDEHRHVNHLFGLHPGHTVSPVTTPELAKAAKVVLVHRGDGATGWSMGWKLNQ 692
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARL D HAY + L + G NL+ HPPFQID NFG TA + EML
Sbjct: 693 WARLQDGNHAYTLFGNL-----------LKNGTMDNLWDTHPPFQIDGNFGGTAGITEML 741
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN----- 768
+QS + + LLPALP D W G + G+ A+G V + W++ L E + SN
Sbjct: 742 LQSHMGFIQLLPALP-DAWKDGSISGICAKGNFEVDVIWENHQLKEAVVRSNAGGDCVIK 800
Query: 769 --NDHDSFKTLHYRGTSVKVNLSAGKI 793
+ SFKT+ R + + + G I
Sbjct: 801 YADQTISFKTVKGRSYQIGYDAAKGLI 827
>gi|293371889|ref|ZP_06618293.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
gi|292633135|gb|EFF51712.1| conserved hypothetical protein [Bacteroides ovatus SD CMC 3f]
Length = 829
Score = 416 bits (1070), Expect = e-113, Method: Compositional matrix adjust.
Identities = 270/806 (33%), Positives = 409/806 (50%), Gaps = 93/806 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 74 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 133
Query: 77 SLVDSGQYAEATAASVKLFG----HPAD--------VYQLLGDIELEFDDSHLKYAEETY 124
G +A + + F + AD + +G+ +E + + ++ Y
Sbjct: 134 KAFTEGDQKKAEMLTRQNFNSEVSYEADGENPFRFSSFTTMGEFYVETGLNMIGMSD--Y 191
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ V + R F S P V+V + S +SG +L F+ + + L
Sbjct: 192 KRILSLDSAMAVVQFKKDRVAYQRNFFISYPANVMVVRFSADQSGKQNLVFSYAPNPLST 251
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+GN ++ A+ D G+++ ++ I+ GT+S D KL
Sbjct: 252 GSMVSDGNKGLVY-------------TASLDNNGMKY--VVRIQAETKGGTLSN-ADGKL 295
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D V + A + +FD F +P +P + + + Y+ L+ +H
Sbjct: 296 TVKDADEVVFYITADTDYKINFDPDFKDPKTYIGVNPEETTKQWMNNAVAQGYTALFNQH 355
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
+DY LF+RV + L+ + K + +P+++R+K+++ + D L EL +QF
Sbjct: 356 YNDYATLFNRVRLNLNPAVKGV-----------NLPTSQRLKNYRKGQPDYYLEELYYQF 404
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL+EC PL
Sbjct: 405 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECVLPLI 464
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + A GW +I+ ++ + + W PM G WL TH+W
Sbjct: 465 DFIRTLVKPGEKTAQAYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATHIW 524
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D FL++ Y L++ A F +D+L DG PSTSPEH
Sbjct: 525 EYYDYTRDLKFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH---------GP 575
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A++RE+ I A++VL +K E E VL + L P +I G +MEW
Sbjct: 576 IDQGATFVHAVVREILMDAIEASKVLGVDKKERKQWEHVLAN---LVPYQIGRYGQLMEW 632
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
+ D DP+ HRH++HLFGL PGHT++ P+L KAA+ L RG+ GWS+ WK
Sbjct: 633 SVDIDDPKDEHRHVNHLFGLHPGHTVSPVTTPELAKAAKVVLVHRGDGATGWSMGWKLNQ 692
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARL D HAY + L + G NL+ HPPFQID NFG TA + EML
Sbjct: 693 WARLQDGNHAYTLFGNL-----------LKNGTMDNLWDTHPPFQIDGNFGGTAGITEML 741
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN----- 768
+QS + + LLPALP D W G + G+ A+G V + W++ L E + SN
Sbjct: 742 LQSHMGFIQLLPALP-DAWKDGSISGICAKGNFEVDVIWENHQLKEAVVRSNAGGDCVIK 800
Query: 769 --NDHDSFKTLHYRGTSVKVNLSAGK 792
+ SFKT+ +G S ++ A K
Sbjct: 801 YADQTISFKTV--KGRSYQIGYDAAK 824
>gi|237718842|ref|ZP_04549323.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
gi|229451974|gb|EEO57765.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
Length = 829
Score = 416 bits (1070), Expect = e-113, Method: Compositional matrix adjust.
Identities = 269/807 (33%), Positives = 408/807 (50%), Gaps = 91/807 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 74 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 133
Query: 77 SLVDSGQYAEATAASVKLFG----HPAD--------VYQLLGDIELEFDDSHLKYAEETY 124
G +A + + F + AD + +G+ +E + + ++ Y
Sbjct: 134 KAFTEGDQKKAEMLTRQNFNSEVSYEADGENPFRFGSFTTMGEFYVETGLNMIGMSD--Y 191
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ V + R F S P V+V + S +SG +L F+ + + L
Sbjct: 192 KRILSLDSAMAVVQFKKDRVAYQRNFFISYPANVMVVRFSADQSGKQNLVFSYAPNPLST 251
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+GN ++ A+ D G+++ ++ I+ GT+S D KL
Sbjct: 252 GSMVSDGNKGLVY-------------TASLDNNGMKY--VVRIQAETKGGTLSN-ADGKL 295
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D V + A + +FD F +P +P + + + Y+ L+ +H
Sbjct: 296 TVKDADEVVFYITADTDYKINFDPDFKDPKTYIGVNPEETTKQWMNNAVAQGYTALFNQH 355
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
+DY LF+RV + L+ + K + +P+++R+K+++ + D L EL +QF
Sbjct: 356 YNDYATLFNRVRLNLNPAVKGV-----------NLPTSQRLKNYRKGQPDYYLEELYYQF 404
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL+EC PL
Sbjct: 405 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECVLPLI 464
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + A GW +I+ ++ + + W PM G WL TH+W
Sbjct: 465 DFIRTLVKPGEKTAQAYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATHIW 524
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D FL++ Y L++ A F +D+L DG PSTSPEH
Sbjct: 525 EYYDYTRDLKFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH---------GP 575
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A++RE+ I A++VL +K E E VL + L P +I G +MEW
Sbjct: 576 IDQGATFVHAVVREILMDAIEASKVLGVDKKERKQWEHVLAN---LVPYQIGRYGQLMEW 632
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
+ D DP+ HRH++HLFGL PGHT++ P+L KAA+ L RG+ GWS+ WK
Sbjct: 633 SVDIDDPKDEHRHVNHLFGLHPGHTVSPVTTPELAKAAKVVLVHRGDGATGWSMGWKLNQ 692
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARL D HAY + L + G NL+ HPPFQID NFG TA + EML
Sbjct: 693 WARLQDGNHAYTLFGNL-----------LKNGTMDNLWDTHPPFQIDGNFGGTAGITEML 741
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN----- 768
+QS + + LLPALP D W G + G+ A+G V + W++ L E + SN
Sbjct: 742 LQSHMGFIQLLPALP-DAWKDGSISGICAKGNFEVDVIWENHQLKEAVVRSNAGGDCVIK 800
Query: 769 --NDHDSFKTLHYRGTSVKVNLSAGKI 793
+ SFKT+ R + + + G I
Sbjct: 801 YADQTISFKTVKGRSYQIGYDAAKGLI 827
>gi|333382100|ref|ZP_08473777.1| hypothetical protein HMPREF9455_01943 [Dysgonomonas gadei ATCC
BAA-286]
gi|332829131|gb|EGK01795.1| hypothetical protein HMPREF9455_01943 [Dysgonomonas gadei ATCC
BAA-286]
Length = 820
Score = 416 bits (1070), Expect = e-113, Method: Compositional matrix adjust.
Identities = 270/787 (34%), Positives = 421/787 (53%), Gaps = 74/787 (9%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPKALSDV 75
+ PA + ++P+GNGR+GAMV+GG+ E + LNE T+W+G P + P L+D+
Sbjct: 47 YENPADEWMKSLPLGNGRIGAMVFGGIEKEVIALNEVTMWSGQPDKFQERPLGKTMLNDI 106
Query: 76 RSLVDSGQYAEATAASVKLFG---HPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
R L G+YA+ + H + GD++L+F + A Y+REL+L
Sbjct: 107 RQLFFEGKYAKGNRVVSEFMSGTPHSFGSHVPAGDLKLDF--KYPAGAVSGYKRELNLEN 164
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A V + VGN+ +TRE+F SNPD + +++ +++ SL+ +VSLD L ++ N+
Sbjct: 165 AINTVSFKVGNILYTREYFCSNPDNAFIVRLTANKAKSLTLDVSLDMLRESVIKAVDNSL 224
Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
GK PK P G+ F + + D G +SA + K+ + + +
Sbjct: 225 -----EFSGKVSFPK----QGPGGVDFMGKVGVTAKD--GNVSA-SNNKISIADATSVTI 272
Query: 253 LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
+L + + N K+D + AL Y+ L +H+ DY LF RV + L
Sbjct: 273 ILDLRTDY-----NNKHYKEDCFATVNKALSQ----DYNRLKNKHVSDYSNLFKRVDLFL 323
Query: 313 SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV- 371
+S D + T ERVK+ + ED L L FQ+ RYLLI++SR + +
Sbjct: 324 GKSEAD---------KLPTDKRWERVKAGK--EDVGLDALFFQYARYLLIAASREDSPLP 372
Query: 372 ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
ANLQGIWN++L+ W + H++IN + NYW S NL EC PLFD++ LS+ G KT
Sbjct: 373 ANLQGIWNDNLACNMGWTNDYHLDINTQQNYWLSNIGNLHECNTPLFDYIKDLSVYGQKT 432
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+ Y A GWV + ++W +++ +G V W L+P+ G W+ +HLW HY YTMD ++L
Sbjct: 433 AKNVYGARGWVANTVANVWGYTASGQG-VNWGLFPLAGTWIASHLWTHYIYTMDENYLRN 491
Query: 490 RAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
+AYP+L+ A FLLD++++ +GYL T PSTSPE+ F +L+ VS D +
Sbjct: 492 KAYPILKSNAEFLLDYMVQDPKNGYLMTGPSTSPENSFRYKGNELS-VSLMPACDRQLAY 550
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLS 608
E F++ I A+++L +D + + +L +L P I ++G+I EW +DF++ + +HRH +
Sbjct: 551 EAFASCIQASKILNV-DDKFRDSLSIALKKLPPIIIGKNGAIQEWFEDFEEAQPNHRHTT 609
Query: 609 HLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWS-ITWKTA----LWARLHDQEHA 663
HL L+P I+ K P L AA KT++ R P W + W A L+ARL D + A
Sbjct: 610 HLLALYPFAQISPVKTPGLANAARKTIEYR-LAAPNWEDVEWSRANMICLYARLFDAKKA 668
Query: 664 YRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHP------PFQI---DANFGFTAAVAEMLV 714
Y V +L ++ F NL P P+ I D N A +AEML+
Sbjct: 669 YESVVQL--------QREFT---RENLLTISPEGIAGAPYDIFIFDGNEAGGAGIAEMLI 717
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSF 774
QS + LLPALP +W++G KGL RGG V + WKDG + ++ I + + ++ +F
Sbjct: 718 QSHEGYIELLPALP-QQWNTGYFKGLCIRGGGEVDLKWKDGQVQDIVIKA--ATDNKFTF 774
Query: 775 KTLHYRG 781
K ++ +G
Sbjct: 775 KLVNTKG 781
>gi|345510592|ref|ZP_08790159.1| glycoside hydrolase family 95 [Bacteroides sp. D1]
gi|345454467|gb|EEO49096.2| glycoside hydrolase family 95 [Bacteroides sp. D1]
Length = 850
Score = 416 bits (1069), Expect = e-113, Method: Compositional matrix adjust.
Identities = 276/809 (34%), Positives = 410/809 (50%), Gaps = 95/809 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 95 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 154
Query: 77 SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDD--SHLKYAEET---------YR 125
G +A + + F Y G+ F + ++ ET Y+
Sbjct: 155 KAFTEGNQEKAEMLTRQNFNSEVS-YDADGETPFRFGSFTTMGEFYVETGLNIIGMSDYK 213
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDN 183
R L L++A A V++ +V + R +F S P V+V + S + G +L F+ + + +
Sbjct: 214 RILSLDSAMAVVQFKKDHVVYQRNYFISYPANVMVMRFSADQPGKQNLVFSYAPNPVSTG 273
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ +GN ++ +A+ D G+++ ++ I+ GT+ D KL
Sbjct: 274 NMASDGNKGLVY-------------SASLDNNGMKY--VVRIQAETKGGTLFN-ADGKLT 317
Query: 244 VEGSDWAVLLLVASS----SFDGPFINPSD----SKKDPTSESMSALQSIRNLSYSDLYT 295
V+G+D V + A + +FD F +P + ++ T E M+ S R Y+ L++
Sbjct: 318 VKGADEVVFYITADTDYKPNFDPDFKDPKTYVGVNPEETTKEWMNNAVSQR---YTALFS 374
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLF 354
+H +DY LF RV + L+ + K +P+ +R+K+++ + D L EL F
Sbjct: 375 QHYNDYAALFDRVKLNLNPAIKG-----------RNLPTPQRLKNYRAGQPDYYLEELYF 423
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
QFGRYLLISSSRPG ANLQGIW+ ++ W H NIN++MNYW NL+EC P
Sbjct: 424 QFGRYLLISSSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWSVCSTNLNECMLP 483
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTH 473
L DF+ L G KTA+ + A GW +I+ ++ + + W PM G WL TH
Sbjct: 484 LVDFIRTLVKPGEKTAKSYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATH 543
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
+WE+Y+YT D FL++ Y L++ A F +D+L DG PSTSPEH
Sbjct: 544 IWEYYDYTRDLKFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH--------- 594
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
+ +T A++RE+ I A++VL +K E E VL + L P KI G +M
Sbjct: 595 GPIDQGATFVHAVVREILLDAIEASKVLGVDKKERKQWEHVLAN---LVPYKIGRYGQLM 651
Query: 592 EWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKT 651
EW+ D DP+ HRH++HLFGL PGHT++ P+L KAA+ L RG+ GWS+ WK
Sbjct: 652 EWSVDIDDPKDEHRHVNHLFGLHPGHTVSPVTTPELAKAAKVVLVHRGDGATGWSMGWKL 711
Query: 652 ALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAE 711
WARL D HAY + L + G NL+ H PFQID NFG TA + E
Sbjct: 712 NQWARLQDGNHAYTLFGNL-----------LKNGTLDNLWDTHSPFQIDGNFGGTAGITE 760
Query: 712 MLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN-- 769
ML+QS + + LLPALP D W G V G+ A+G V + W++ L E ++SN N
Sbjct: 761 MLLQSHMGFIQLLPALP-DAWKEGSVSGICAKGNFEVDMVWENNQLKEAVVHSNAGGNCV 819
Query: 770 -----DHDSFKTLHYRGTSVKVNLSAGKI 793
SFKT+ R V+ +++ G I
Sbjct: 820 IKYADKTLSFKTVKGRSYRVEYDVTKGLI 848
>gi|262406087|ref|ZP_06082637.1| glycoside hydrolase family 95 [Bacteroides sp. 2_1_22]
gi|294648155|ref|ZP_06725698.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294809712|ref|ZP_06768400.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
gi|262356962|gb|EEZ06052.1| glycoside hydrolase family 95 [Bacteroides sp. 2_1_22]
gi|292636539|gb|EFF55014.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294443087|gb|EFG11866.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
Length = 830
Score = 416 bits (1069), Expect = e-113, Method: Compositional matrix adjust.
Identities = 276/809 (34%), Positives = 410/809 (50%), Gaps = 95/809 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 75 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 134
Query: 77 SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDD--SHLKYAEET---------YR 125
G +A + + F Y G+ F + ++ ET Y+
Sbjct: 135 KAFTEGNQEKAEMLTRQNFNSEVS-YDADGETPFRFGSFTTMGEFYVETGLNIIGMSDYK 193
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDN 183
R L L++A A V++ +V + R +F S P V+V + S + G +L F+ + + +
Sbjct: 194 RILSLDSAMAVVQFKKDHVVYQRNYFISYPANVMVMRFSADQPGKQNLVFSYAPNPVSTG 253
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ +GN ++ +A+ D G+++ ++ I+ GT+ D KL
Sbjct: 254 NMASDGNKGLVY-------------SASLDNNGMKY--VVRIQAETKGGTLFN-ADGKLT 297
Query: 244 VEGSDWAVLLLVASS----SFDGPFINPSD----SKKDPTSESMSALQSIRNLSYSDLYT 295
V+G+D V + A + +FD F +P + ++ T E M+ S R Y+ L++
Sbjct: 298 VKGADEVVFYITADTDYKPNFDPDFKDPKTYVGVNPEETTKEWMNNAVSQR---YTALFS 354
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLF 354
+H +DY LF RV + L+ + K +P+ +R+K+++ + D L EL F
Sbjct: 355 QHYNDYAALFDRVKLNLNPAIKG-----------RNLPTPQRLKNYRAGQPDYYLEELYF 403
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
QFGRYLLISSSRPG ANLQGIW+ ++ W H NIN++MNYW NL+EC P
Sbjct: 404 QFGRYLLISSSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWSVCSTNLNECMLP 463
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTH 473
L DF+ L G KTA+ + A GW +I+ ++ + + W PM G WL TH
Sbjct: 464 LVDFIRTLVKPGEKTAKSYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATH 523
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
+WE+Y+YT D FL++ Y L++ A F +D+L DG PSTSPEH
Sbjct: 524 IWEYYDYTRDLKFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH--------- 574
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
+ +T A++RE+ I A++VL +K E E VL + L P KI G +M
Sbjct: 575 GPIDQGATFVHAVVREILLDAIEASKVLGVDKKERKQWEHVLAN---LVPYKIGRYGQLM 631
Query: 592 EWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKT 651
EW+ D DP+ HRH++HLFGL PGHT++ P+L KAA+ L RG+ GWS+ WK
Sbjct: 632 EWSVDIDDPKDEHRHVNHLFGLHPGHTVSPVTTPELAKAAKVVLVHRGDGATGWSMGWKL 691
Query: 652 ALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAE 711
WARL D HAY + L + G NL+ H PFQID NFG TA + E
Sbjct: 692 NQWARLQDGNHAYTLFGNL-----------LKNGTLDNLWDTHSPFQIDGNFGGTAGITE 740
Query: 712 MLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN-- 769
ML+QS + + LLPALP D W G V G+ A+G V + W++ L E ++SN N
Sbjct: 741 MLLQSHMGFIQLLPALP-DAWKEGSVSGICAKGNFEVDMVWENNQLKEAVVHSNAGGNCV 799
Query: 770 -----DHDSFKTLHYRGTSVKVNLSAGKI 793
SFKT+ R V+ +++ G I
Sbjct: 800 IKYADKTLSFKTVKGRSYRVEYDVTKGLI 828
>gi|299145135|ref|ZP_07038203.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
gi|298515626|gb|EFI39507.1| alpha-L-fucosidase 2 [Bacteroides sp. 3_1_23]
Length = 815
Score = 416 bits (1068), Expect = e-113, Method: Compositional matrix adjust.
Identities = 258/768 (33%), Positives = 394/768 (51%), Gaps = 82/768 (10%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
++PIGNG LGA + G + +E + LNE TLW G P +Y N + L ++R
Sbjct: 63 SLPIGNGSLGANIMGSISAERITLNEKTLWKGGPNTAKGAEYYWNVNKQSAGVLKEIRQA 122
Query: 79 VDSGQYAEATAASVKLFGHPA------------DVYQLLGDIELEFDDSHLKYAEETYRR 126
G +A + + F + +G++ +E S + + +YRR
Sbjct: 123 FLDGDSQKAGYLTQENFNGLGAYEEKDETPFRFGAFTTMGELYVETGLSEINMS--SYRR 180
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
L L++A A V++ + + R++F S PD V+V K + + G + +S ++ +H
Sbjct: 181 ILSLDSAMAVVQFDKDGIRYQRKYFISYPDSVMVMKFTADKGGKQNLVLSYCPNNEAKSH 240
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+GN+ ++ G + G++F+ IK GT+ A E+ ++ V
Sbjct: 241 LEADGNDGLVYTGVL-------------NNNGMKFA--FRIKAIHKGGTLKA-ENDRIIV 284
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRHLD 299
+ +D V LL A + + F K DP+ +++ + ++ Y +LY H
Sbjct: 285 KDADEVVFLLTADTDYKMNFAPDFKDPKAYVGNDPSQTTLAMMNNVLKKGYDELYRNHEA 344
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGR 358
DY LF+RV ++++ E +P+ +R+ +++ D L +L +QFGR
Sbjct: 345 DYTALFNRVRFEINQ-----------EIGSPNLPTYKRLANYKKGTPDYQLEQLYYQFGR 393
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLI+SSRPG ANLQG+W+ + W H NIN++MNYW + P NL EC PL DF
Sbjct: 394 YLLIASSRPGNMPANLQGLWHNNTDGPWRVDYHNNINIQMNYWPACPTNLPECTWPLIDF 453
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
+ L G KTAQ + A GW +I+ ++ K + W L P G WL TH+WE+
Sbjct: 454 IRSLVKPGEKTAQAYFNARGWTASISANIFGFTAPLSSKSMAWNLNPTAGPWLATHIWEY 513
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y+YT D FL++ Y L++ A F +D L DG PSTSPEH V
Sbjct: 514 YDYTRDTKFLKEIGYDLIKSSAQFAVDHLWHKPDGTYTAAPSTSPEH---------GPVD 564
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWAQD 596
T A++RE+ I A++VL DA K ++ L +L P +I G ++EW+ D
Sbjct: 565 EGVTFAHAVVREILLDAIQASKVL--GTDAKERKQWENVLTKLVPYRIGRYGQLLEWSTD 622
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
DP+ HRH++HLFGL PGHTI+ P+L +AA+ L+ RG+ GWS+ WK WAR
Sbjct: 623 IDDPKDEHRHVNHLFGLHPGHTISPVTTPELAQAAKVVLEHRGDGATGWSMGWKLNQWAR 682
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
L D HAY++ L + G NL+ H PFQID NFG TA + EML+QS
Sbjct: 683 LQDGNHAYKLYGNL-----------LKNGTLDNLWDTHTPFQIDGNFGGTAGITEMLLQS 731
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+ + LLPALP D W++G + G+ A+G VS+ WK+G L + I+S
Sbjct: 732 HMGFIQLLPALP-DAWANGSISGICAKGNFEVSVSWKEGQLEKAIIHS 778
>gi|168211677|ref|ZP_02637302.1| fibronectin type III domain protein [Clostridium perfringens B str.
ATCC 3626]
gi|170710364|gb|EDT22546.1| fibronectin type III domain protein [Clostridium perfringens B str.
ATCC 3626]
Length = 1479
Score = 416 bits (1068), Expect = e-113, Method: Compositional matrix adjust.
Identities = 262/761 (34%), Positives = 401/761 (52%), Gaps = 84/761 (11%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
L + ++ PA ++ +A+PIGNG +G M++G V SE ++ NE TLW+G PG +
Sbjct: 48 LALWYDEPATNWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107
Query: 66 PDAPKALSDVRSLVDSGQYAEATAASVKLF----GHPAD--VYQLLGDIELEFDDSHLKY 119
A +A+ ++R ++ AE S L+ G D YQ GDI L+F SH +
Sbjct: 108 EGAWEAVQEIRKIL-----AEGGTPSNDLYQRVCGDQRDYGAYQNFGDIFLDFK-SHEES 161
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
YRREL++ + + VKY+ V + RE+F S PD V+V K+ ++ SL+ +V +
Sbjct: 162 KVTNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNVMVIKLKADKASSLNVDVRNEG 221
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+ + NN +I+ G + G+++ + +IK+ + G+I ED
Sbjct: 222 AHNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINTGGSIQDKED 266
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+ + VE +D +++ A + + + P+ +DP S + + NL Y +L +RH++
Sbjct: 267 R-ISVENADEITIIMSAGTDYINEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIE 323
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
DY+ LF RV++ L D TD E + ++T++ SL L FQ+GRY
Sbjct: 324 DYKNLFDRVNLNLGELKLDKPTD-------------EMLNEYKTNQSNSLETLFFQYGRY 370
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LLISSSR G+ ANLQG+WN +P W S H N+N++MNYW + NLSE PL +++
Sbjct: 371 LLISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYI 430
Query: 420 TYLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
L G KTA+++ +GW ++ + + +A + W P AW+
Sbjct: 431 ESLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQ 489
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIA 528
+LWEHYN+T D+D+L + YP+++ A F +L+E DG YL ++PS SPEH
Sbjct: 490 NLWEHYNFTDDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH---- 545
Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDG 588
+ +T D +I ++F+ I A+E L +E+ E K L+P +I + G
Sbjct: 546 -----GPRTVGTTFDQELIWQLFTDTIKASETLGIDEEFRAELENKRERLLKP-QIGKHG 599
Query: 589 SIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSIT 648
+ EW D DP +HRH+SHL GL+PG I + P+L +AA+ T+ RG+ G GWS
Sbjct: 600 QVQEWKDDIDDPNNNHRHISHLVGLYPGTQINQKDTPELYEAAKVTMNHRGDGGTGWSKA 659
Query: 649 WKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAA 708
K LWARL D + A+R++ E NLF HPPFQID N G +
Sbjct: 660 NKINLWARLLDGDRAHRLL-----------ENQLTTSTLENLFDTHPPFQIDGNMGAVSG 708
Query: 709 VAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVS 749
+AEMLVQS L + LPALP W G GLKARG +S
Sbjct: 709 MAEMLVQSHLGTINPLPALP-TAWEDGSFDGLKARGNFEIS 748
>gi|423214546|ref|ZP_17201074.1| hypothetical protein HMPREF1074_02606 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692961|gb|EIY86197.1| hypothetical protein HMPREF1074_02606 [Bacteroides xylanisolvens
CL03T12C04]
Length = 815
Score = 416 bits (1068), Expect = e-113, Method: Compositional matrix adjust.
Identities = 261/768 (33%), Positives = 391/768 (50%), Gaps = 82/768 (10%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
++PIGNG LGA + G + +E + LNE TLW G P +Y N + L ++R
Sbjct: 63 SLPIGNGSLGANIMGSISAERITLNEKTLWKGGPNTAKGAEYYWNVNKQSSGVLKEIRQA 122
Query: 79 VDSGQYAEATAASVKLFGHPA------------DVYQLLGDIELEFDDSHLKYAEETYRR 126
G +A + + F + +G++ +E S + + YRR
Sbjct: 123 FLDGDSKKAGYLTQENFNGLGAYEEKDETPFRFGAFTTMGELYVETGLSEINMS--NYRR 180
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
L L++A A V++ + + R++F S PD V+V K + + G + +S ++ +H
Sbjct: 181 ILSLDSAMAVVQFYKDGIRYQRKYFISYPDSVMVMKFTADKGGKQNLVLSYCPNNEAKSH 240
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+GN+ ++ G + G++F+ IK GT+ A E+ ++ V
Sbjct: 241 LEADGNDGLVYTGVL-------------NNNGMKFA--FRIKAIHKGGTLKA-ENDRIIV 284
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRHLD 299
+ +D V LL A + + F K DP+ +++ + + Y +LY H
Sbjct: 285 KDADEVVFLLTADTDYKMNFAPDFKDPKAYVGNDPSQTTLAMMDNALKKGYDELYRNHEA 344
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGR 358
DY LF+RV +++ E +P+ +R+ S++ D L +L +QFGR
Sbjct: 345 DYTALFNRVRFEIN-----------PEIGTPNLPTYKRLASYKKGVPDYQLEQLYYQFGR 393
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLI+SSRPG ANLQG+W+ + W H NIN++MNYW + P NLSEC PL DF
Sbjct: 394 YLLIASSRPGNMPANLQGLWHNNTDGPWRVDYHNNINIQMNYWPACPTNLSECTWPLIDF 453
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
+ L G KTAQ + A GW +I+ ++ K + W L P G WL TH+WE+
Sbjct: 454 IRSLVKPGEKTAQAYFNARGWTASISANIFGFTAPLSSKSMAWNLNPTAGPWLATHIWEY 513
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y+YT D FL++ Y L++ A F +D L DG PSTSPEH V
Sbjct: 514 YDYTRDTKFLKEIGYDLIKSSAQFAVDHLWHKPDGTYTAAPSTSPEH---------GPVD 564
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWAQD 596
T A++RE+ I A++VL DA K ++ L +L P +I G ++EW+ D
Sbjct: 565 EGVTFAHAVVREILLDAIQASKVL--GTDAKERKQWENVLTKLVPYRIGRYGQLLEWSTD 622
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
DP+ HRH++HLFGL PGHTI+ P+L +AA L+ RG+ GWS+ WK WAR
Sbjct: 623 IDDPKDEHRHVNHLFGLHPGHTISPVTTPELAQAARVVLEHRGDGATGWSMGWKLNQWAR 682
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
L D HAY++ L + G NL+ H PFQID NFG TA + EML+QS
Sbjct: 683 LQDGNHAYKLYGNL-----------LKNGTLDNLWDTHAPFQIDGNFGGTAGITEMLLQS 731
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+ + LLPALP D W++G + G+ A+G VSI WK+G L + I+S
Sbjct: 732 HMGFIQLLPALP-DAWANGSISGICAKGNFEVSISWKEGQLEKAIIHS 778
>gi|237722074|ref|ZP_04552555.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
gi|229448943|gb|EEO54734.1| glycoside hydrolase family 95 protein [Bacteroides sp. 2_2_4]
Length = 815
Score = 415 bits (1066), Expect = e-113, Method: Compositional matrix adjust.
Identities = 258/768 (33%), Positives = 393/768 (51%), Gaps = 82/768 (10%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
++PIGNG LGA + G + +E + LNE TLW G P +Y N + L ++R
Sbjct: 63 SLPIGNGSLGANIMGSISAERITLNEKTLWKGGPNTAKGAEYYWNVNKQSSGVLKEIRQA 122
Query: 79 VDSGQYAEATAASVKLFGHPA------------DVYQLLGDIELEFDDSHLKYAEETYRR 126
G +A + + F + +G++ +E S + + YRR
Sbjct: 123 FLDGDSKKAGYLTQENFNGLGAYEEKDETPFRFGAFTTMGELYVETGLSEINMS--NYRR 180
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
L L++A A V++ + + R++F S PD V+V K + + G + +S ++ +H
Sbjct: 181 ILSLDSAMAVVQFYKDGIRYQRKYFISYPDSVMVMKFTADKGGKQNLVLSYCPNNEAKSH 240
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+GN+ ++ G + G++F+ IK GT+ A E+ ++ V
Sbjct: 241 LEADGNDGLVYTGVL-------------NNNGMKFT--FRIKAIHKGGTLKA-ENDRIIV 284
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRHLD 299
+ +D V LL A + + F K DP+ +++ + ++ Y +LY H
Sbjct: 285 KDADEVVFLLTADTDYKMNFAPDFKDPKAYVGNDPSQTTLAMMNNVLKKGYDELYRNHEA 344
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGR 358
DY LF+RV ++++ E +P+ +R+ +++ D L +L +QFGR
Sbjct: 345 DYTALFNRVRFEINQ-----------EIGSPNLPTYKRLANYKKGTPDYQLEQLYYQFGR 393
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLI+SSRPG ANLQG+W+ + W H NIN++MNYW + P NL EC PL DF
Sbjct: 394 YLLIASSRPGNMPANLQGLWHNNTDGPWRVDYHNNINIQMNYWPACPTNLPECTWPLIDF 453
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
+ L G KTAQ + A GW +I+ ++ K + W L P G WL TH+WE+
Sbjct: 454 IRSLVKPGEKTAQAYFNARGWTASISANIFGFTAPLSSKSMAWNLNPTAGPWLATHIWEY 513
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y+YT D FL++ Y L++ A F +D L DG PSTSPEH V
Sbjct: 514 YDYTRDTKFLKEIGYDLIKSSAQFAVDHLWHKPDGTYTAAPSTSPEH---------GPVD 564
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWAQD 596
T A++RE+ I A++VL DA K ++ L +L P +I G ++EW+ D
Sbjct: 565 EGVTFAHAVVREILLDAIQASKVL--GTDAKERKQWENVLTKLVPYRIGRYGQLLEWSTD 622
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
DP+ HRH++HLFGL PGHTI+ P+L +AA+ L+ RG+ GWS+ WK WAR
Sbjct: 623 IDDPKDEHRHVNHLFGLHPGHTISPVTTPELAQAAKVVLEHRGDGATGWSMGWKLNQWAR 682
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
L D HAY++ L + G NL+ H PFQID NFG TA + EML+QS
Sbjct: 683 LQDGNHAYKLYGNL-----------LKNGTLDNLWDTHAPFQIDGNFGGTAGITEMLLQS 731
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+ + LLPALP D W++G + G+ A+G VS+ WK+G L + I+S
Sbjct: 732 HMGFIQLLPALP-DAWANGSISGICAKGNFEVSVSWKEGQLEKAIIHS 778
>gi|422346543|ref|ZP_16427457.1| hypothetical protein HMPREF9476_01530 [Clostridium perfringens
WAL-14572]
gi|373226088|gb|EHP48415.1| hypothetical protein HMPREF9476_01530 [Clostridium perfringens
WAL-14572]
Length = 1479
Score = 415 bits (1066), Expect = e-113, Method: Compositional matrix adjust.
Identities = 258/760 (33%), Positives = 399/760 (52%), Gaps = 82/760 (10%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
L + ++ PA ++ +A+PIGNG +G M++G V SE ++ NE TLW+G PG +
Sbjct: 48 LALWYDEPATNWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107
Query: 66 PDAPKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
A +A+ ++R ++ G + + +G YQ GDI L+F SH +
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQRAYG----AYQNFGDIFLDFK-SHEESK 162
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
YRREL++ + + VKY+ V + RE+F S PD V+V K+ ++ SL+ +V +
Sbjct: 163 ITNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNVLVIKLKADKASSLTVDVRNEGA 222
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ + NN +I+ G + G+++ + +IK+ + G+I ED+
Sbjct: 223 HNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINTGGSIQDKEDR 267
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ VE +D +++ A + + + P+ +DP S + + NL Y +L +RH++D
Sbjct: 268 -ISVENADEITIIMSAGTDYINEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
Y+ LF RV++ L D TD E + ++T++ SL L FQ+GRYL
Sbjct: 325 YKNLFDRVNLNLGELKLDKPTD-------------EMLNEYKTNQSNSLETLFFQYGRYL 371
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSR G+ ANLQG+WN +P W S H N+N++MNYW + NLSE PL +++
Sbjct: 372 LISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYVE 431
Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
L G KTA+++ +GW ++ + + +A + W P AW+ +
Sbjct: 432 SLREPGRKTAEIHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
LWEHYN+T D+D+L + YP+++ A F +L+E DG YL ++PS SPEH
Sbjct: 491 LWEHYNFTDDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH----- 545
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
+ +T D +I ++F+ I A+E L +E+ E K L+P +I + G
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGIDEEFRAELEDKRERLLKP-QIGKHGQ 600
Query: 590 IMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITW 649
+ EW D DP +HRH+SHL GL+PG I + P+L +AA+ T+ RG+ G GWS
Sbjct: 601 VQEWKDDIDDPNNNHRHISHLVGLYPGTQINQKDTPELYEAAKVTMNHRGDGGTGWSKAN 660
Query: 650 KTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAV 709
K LWARL D + A+R++ E NLF HPPFQID N G + +
Sbjct: 661 KINLWARLLDGDRAHRLL-----------ENQLTTSTLENLFDTHPPFQIDGNMGAVSGM 709
Query: 710 AEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVS 749
AEMLVQS L + LPALP W G GLKARG +S
Sbjct: 710 AEMLVQSHLGTINPLPALP-TAWEDGSFDGLKARGNFEIS 748
>gi|302917285|ref|XP_003052415.1| hypothetical protein NECHADRAFT_105964 [Nectria haematococca mpVI
77-13-4]
gi|256733355|gb|EEU46702.1| hypothetical protein NECHADRAFT_105964 [Nectria haematococca mpVI
77-13-4]
Length = 765
Score = 415 bits (1066), Expect = e-113, Method: Compositional matrix adjust.
Identities = 258/758 (34%), Positives = 390/758 (51%), Gaps = 64/758 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + + PA +++A+P+GNGRLGAM++G +E L+LNED++W G P D T DA + L
Sbjct: 8 LALHYTSPASSWSEALPVGNGRLGAMIYGRTTTELLQLNEDSVWYGGPQDRTPRDAKRNL 67
Query: 73 SDVRSLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+ +R L+ + ++ EA T F P + Y+ LG+ +EF+ H +RR LD
Sbjct: 68 AKLRELIRAERHQEAETLVREAFFATPTSMRHYEPLGNCTIEFN--HGVEDVTDFRRRLD 125
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L+T+ +Y+ V + R+ +S PD V+ + SE ++ S ++ +
Sbjct: 126 LSTSQNTTEYTCRGVSYRRDVIASFPDNVLAIRFEASEKTRFVVRLTRRSDVEWETNEFL 185
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
++ +GR P N+N Q + +L + + G + A+ + + +
Sbjct: 186 DSIRAEDGRIILHATPGGRNSN------QLALVLGVSCDANDGEVEAIGN--CLIVNTTR 237
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
V+ + A +++ DP + ++ + +S+L H DY LF R+S
Sbjct: 238 CVIAIGAQTTY---------RVADPEASALHDVDEALKRPWSELAEHHRQDYTNLFGRMS 288
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+++ N +P+ ER+K+ + DP LV L +GRYLLISSSR
Sbjct: 289 LRMG-------------PNAGHIPTDERIKN---NRDPGLVALYHNYGRYLLISSSRNSH 332
Query: 370 QV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
+ A LQGIWN +P W S +NINL+MNYW + CNL EC P+ D L ++ G
Sbjct: 333 KALPATLQGIWNPFFAPPWGSKYTININLQMNYWPAAQCNLLECALPVMDLLEKMAERGR 392
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
KTA+ Y GW HH TDIW + + +LWP+GG W+C ++ Y D L
Sbjct: 393 KTAETMYGCRGWCAHHNTDIWGDTDPQDTWMPASLWPLGGVWVCIDVFNMLKYEYD-SAL 451
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
R P+LEGC FLLD+LI G YL TNPS SPE+ F++ GK + S +DM I
Sbjct: 452 HSRVAPVLEGCIEFLLDFLIPSACGKYLVTNPSLSPENTFLSESGKPGILCEGSVIDMTI 511
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWA-QDFKDPEVHHR 605
+R F + + + ++L ++ L +V ++L +L P I DG I EW +D+++ E HR
Sbjct: 512 VRIAFESFLLSVDILNQDH-PLRSQVQEALEKLPPLTINNDGLIQEWGLKDYQEHEPGHR 570
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEH 662
H+SHLFGL+PG I +P+L AA+K L++R G GWS W L ARL D E
Sbjct: 571 HVSHLFGLYPGEYIDPIMSPELATAAKKVLERRAANGGGHTGWSRAWLLNLHARLFDAEG 630
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLN--- 719
+ + + L G +NL HPPFQID NFG A + E LVQS +
Sbjct: 631 SRQHMDLLLG-----------GSTLANLLDNHPPFQIDGNFGGCAGILECLVQSRIRSEG 679
Query: 720 --DLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
++ L PA P WSSG V + + G VS+ WK+G
Sbjct: 680 VVEIRLFPAWP-AAWSSGKVTKARVKAGWRVSMDWKEG 716
>gi|429848646|gb|ELA24104.1| alpha-l-fucosidase [Colletotrichum gloeosporioides Nara gc5]
Length = 791
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 289/844 (34%), Positives = 417/844 (49%), Gaps = 124/844 (14%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+N PA + DA PIGNGRLGAMV G E L +NED++W G P + NP A AL VR
Sbjct: 8 YNKPANLWDDATPIGNGRLGAMVRGTTDVERLWINEDSVWYGGPQNRLNPAARDALPKVR 67
Query: 77 SLVDSGQYAEA--------TAASVKL------------FGH----PADVYQLLGDIELEF 112
L+D + EA TA L FGH P D ++ G + E
Sbjct: 68 ELIDQNRIREAEQLIKKTQTARPRSLRHYEPLGDVFLTFGHGQDPPGDEVRVSGIVNFEN 127
Query: 113 DDSH-LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL 171
S L + + YRRELDL T + V Y G + R+ FSS D+VI IS G
Sbjct: 128 SFSRDLNRSPQNYRRELDLRTGISSVSYDFGGAHYERQVFSSTVDEVIA--ISVRSEGEY 185
Query: 172 SFNV------------SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQF 219
SF + L+ D+ ++G + I G ++F
Sbjct: 186 SFQIDLNRGDHPEWDRRLNQRYDSLEEIDGGHMITGSMGLKG--------------AVEF 231
Query: 220 SAILEIKISDDRGTISALEDK---KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTS 276
+ + +++ D G D + V D ++L+ ++F P + + T+
Sbjct: 232 A--MGVRVIADPGDGEVQVDNTGYNVVVNAKDRVIVLVSGETTFRNPNAGEAVQNRLATA 289
Query: 277 ESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAE 336
SM S++DL + H++ + L+ RV +QL S VP +
Sbjct: 290 -SMK--------SWNDLKSAHVERFSALYDRVELQLPGSGDKT-----------AVPIDQ 329
Query: 337 RVKSF-QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNIN 395
R+++ Q D L +LLF FGRYLLIS S G ANLQGIWN D P W S +NIN
Sbjct: 330 RIQAVKQGAVDNGLAQLLFHFGRYLLISCSLSGLP-ANLQGIWNRDHMPVWGSKYTININ 388
Query: 396 LEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADR 455
++MNYW + NL+E + LF FL + G++TA+ Y GWV+HH TDIWA ++
Sbjct: 389 IQMNYWPAEVANLAETHDVLFRFLERTAERGAETAKAMYGCRGWVMHHNTDIWADTAPQD 448
Query: 456 GKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLE 515
V W + GAW HLWEHY + D+DFL +R YPL+ G A F D+L+E DG L
Sbjct: 449 DGVQCTYWTLSGAWFMIHLWEHYRFGRDKDFL-RRVYPLMAGSALFFQDFLVE-RDGKLI 506
Query: 516 TNPSTSPEHE-FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLK 574
T+PS+S E+ +I +A ++ D I+ E+F A++ A ++L ++ EKVL
Sbjct: 507 TSPSSSAENSYYILGTKTVASIAAGPAWDGQILTELFRAVVEAGKLLGEDTSEF-EKVLA 565
Query: 575 SLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKT 634
LP ++ + G +MEW D ++ E HRH+SHL+GLFPG+T+ P+L AA+ T
Sbjct: 566 KLP---TPQMGKHGQVMEWKDDVEEAEPGHRHISHLWGLFPGNTLN---TPELHDAAKVT 619
Query: 635 LQKRGEEGPG---WSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLF 691
LQ+R G G WS+ W +ARL D E + ++++ + L +++
Sbjct: 620 LQRRLAGGGGHTSWSLAWILCQYARLRDIEGTHAGIQKMIGDL-----------LLNSML 668
Query: 692 AAHPPFQIDANFGFTAAVAEMLVQSTLND--------LYLLPAL--PWDKWSSGCVKGLK 741
+HPPFQID NFGF AAVAEML+QS ++D + L+P L W++ G V+GL+
Sbjct: 669 TSHPPFQIDGNFGFAAAVAEMLLQSQVDDGTGSGNTIIDLIPTLLPAWEQ--RGGVRGLR 726
Query: 742 ARGG-ETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYR-------GTSVKVNLSAGKI 793
ARG E I W+DG L E S + F+ R ++ V+L GK
Sbjct: 727 ARGAVEIQKIRWEDGKLVEAVAVSKATEPQTRVFRVAQNRLKQGSKSDGTISVDLVPGKA 786
Query: 794 YTFN 797
T +
Sbjct: 787 VTLS 790
>gi|256818918|ref|YP_003140197.1| glycoside hydrolase [Capnocytophaga ochracea DSM 7271]
gi|256580501|gb|ACU91636.1| glycoside hydrolase family protein [Capnocytophaga ochracea DSM
7271]
Length = 835
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 275/778 (35%), Positives = 416/778 (53%), Gaps = 66/778 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + F+ PA HFT++IPIGNGRLGAM++G + + LNE +LW+G D +P+A L
Sbjct: 52 VSVVFHKPANHFTESIPIGNGRLGAMLFGKTDVDRIVLNEISLWSGGTQDADDPNAHNYL 111
Query: 73 SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
+++ L+ G+ EA A + F G A+ YQ+L ++ L++ +
Sbjct: 112 KEIQKLLLEGKNLEAQALLQQHFVAKGKGSCLGQGANCSYGCYQVLAELLLDWKTTS--- 168
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+ Y+R L L+ ATA + N + F+ + VI KI + L+ ++SL
Sbjct: 169 PIQDYKRVLRLDEATAVTSFKRDNNSIEQTAFADFKNDVIWVKIKATSP--LNLDISLFR 226
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+N + NN+I + G P ND +G+ F+++++++ G I +
Sbjct: 227 K-ENATITYQNNKISLNGVLP----------NDGKEGMHFASVVDVQTD---GKIESTH- 271
Query: 240 KKLKVEGSDWAVLLLVASSSFD---GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
K + ++ + L + A ++++ G ++ S +KK + LQ +S+
Sbjct: 272 KAIAIQSAKEITLRISAVTNYNFNKGGLLDISVTKK-----ANEYLQKAP-MSFDKAKAE 325
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF-Q 355
+Q+LF+R + N + + + ER++ F E +L+ +L+
Sbjct: 326 SSIVFQRLFNRNRWYGK-----------ANANTEGLTTFERLERFYKGEQDALLPILYYN 374
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
FGRYLLISSSR G ANLQG+W E+ W+ H+NIN++MNYW + P NLS+ EPL
Sbjct: 375 FGRYLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINIQMNYWLAEPTNLSQLTEPL 434
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
F L NGSKTA+ Y A+GWV H ++ W +S W GGAWLC H+W
Sbjct: 435 QRFTKNLVPNGSKTAKAYYNANGWVAHVISNPWFYTSPGES-ATWGSTLTGGAWLCEHIW 493
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP---DG 531
+HY +T D +FL + YP+L+ +F LI+ GY T PS SPE+ ++ P DG
Sbjct: 494 QHYLFTKDINFL-REYYPVLKEATTFFESLLIKDPKTGYWVTAPSNSPENAYVLPELKDG 552
Query: 532 K--LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
K + + TMDM I+RE+F+ AA++L + E S + P +I ++G
Sbjct: 553 KKQIGTTCVAPTMDMQIVRELFTNTSDAAKILGLDSKKRTEWERISRNTV-PNRIGKEGD 611
Query: 590 IMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITW 649
+ EW D++D E HRH+SHL+GL+P IT PDL KAA+KTL+ RG+ G GWS W
Sbjct: 612 LNEWLDDWEDAEPQHRHVSHLYGLYPYDEITPWDTPDLAKAAKKTLEIRGDAGTGWSRAW 671
Query: 650 KTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAV 709
K WARL D HA ++++L + V+P GG Y NLF AHPPFQID NFG TA +
Sbjct: 672 KINFWARLQDGNHALLLLRQLLHPVNPNITDGQGGGTYPNLFCAHPPFQIDGNFGGTAGI 731
Query: 710 AEMLVQS--TLNDLYLLPALP-WDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
AEML+QS N + LPALP W +G +KG++AR G V+ W+ L + I S
Sbjct: 732 AEMLLQSHGKGNVIRFLPALPSHPNWENGVMKGMRARNGFEVNFEWQQFKLGKAEITS 789
>gi|213693185|ref|YP_002323771.1| hypothetical protein Blon_2335 [Bifidobacterium longum subsp.
infantis ATCC 15697 = JCM 1222]
gi|384200416|ref|YP_005586159.1| glycosyl hydrolase [Bifidobacterium longum subsp. infantis ATCC
15697 = JCM 1222]
gi|213524646|gb|ACJ53393.1| conserved hypothetical protein [Bifidobacterium longum subsp.
infantis ATCC 15697 = JCM 1222]
gi|320459368|dbj|BAJ69989.1| glycosyl hydrolase [Bifidobacterium longum subsp. infantis ATCC
15697 = JCM 1222]
Length = 782
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 267/771 (34%), Positives = 398/771 (51%), Gaps = 59/771 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+TF+G + H+ + IP GNGR+GA++ ++ L LN+DTLW+G P T+P P+ +
Sbjct: 1 MKLTFDGISSHWEEGIPFGNGRMGAVLCSEPDADVLYLNDDTLWSGYPHAETSPLTPEIV 60
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIEL--EFDDSHLKYAEET-----YR 125
+ R G Y AT D Q D ++ F + ++Y+ E +
Sbjct: 61 AKARQASSRGDYVSATRII-------QDATQREKDEQIYEPFGTACIRYSSEAGERKHVK 113
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
R LDL A A + +G + + + S PD ++V ++S S S +V+ + L
Sbjct: 114 RSLDLARALAGESFRLGAADVHVDAWCSAPDDLLVYEMSSSAPVDASVSVT-GTFLKQTR 172
Query: 186 YVNGNNQ------IIMEGRCPGKRIPPKANANDDP-----KGIQFSAILEIKISDDRGTI 234
+G++ +++ G+ PG + A+ D+P GI + ++ G I
Sbjct: 173 ISSGSDSDARQATLVVMGQMPGLNVGSLAHVTDNPWEDERDGIGMAYAGAFSLTVTGGEI 232
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSK---KDPTSESMSALQSIRNLSYS 291
+ ++D L+ G L + S F G P D E+++A S
Sbjct: 233 TVIDDV-LQCSGVTGLSLRFRSLSGFKGSAEQPERDMTVLADRLGETIAAWPS----DSR 287
Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDP---- 347
+ RH+ DY++ F RV ++L + D EE VP AE ++S ++ P
Sbjct: 288 AMLDRHVADYRRFFDRVGVRLGPAHDD------DEE----VPFAEILRS--KEDTPHRLE 335
Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
+L E +F FGRYLLISSSRP TQ +NLQGIWN P W SA NIN+EMNYW + PC
Sbjct: 336 TLSEAMFDFGRYLLISSSRPHTQPSNLQGIWNHKDFPNWYSAYTTNINIEMNYWMTGPCA 395
Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
L E EPL L G A G + H DIW ++ G+ WA WP G
Sbjct: 396 LKELIEPLVAMNRELLEPGHDAAGAILGCGGSAVFHNVDIWRRALPANGEPTWAFWPFGQ 455
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFI 527
AW+C +L++ Y + D +L +P++ A F +D+L + G L P+TSPE+ F+
Sbjct: 456 AWMCRNLFDEYLFNQDESYLAS-IWPIMRDSARFCMDFLSDTEHG-LAPAPATSPENYFV 513
Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAAEV---LEKNEDALVEKVLKSLPRLRPTKI 584
DG+ V+++S AI+R + +I AA+ L+ + ALV + + +L ++
Sbjct: 514 V-DGETIAVAHTSENTTAIVRNLLDDLIHAAQTMPDLDDGDKALVREAESTRAKLAAVRV 572
Query: 585 AEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPG 644
DG I+EW + + + HHRHLSHL+ L PG IT P L +AA K+L+ RG++G G
Sbjct: 573 GSDGRILEWNDELVEADPHHRHLSHLYELHPGAGIT-ANTPRLEEAARKSLEVRGDDGSG 631
Query: 645 WSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKH-FEGGLYSNLFAAHPPFQIDANF 703
WSI W+ +WARL D EHA R++ V+ + E GG+Y++ AHPPFQID N
Sbjct: 632 WSIVWRMIMWARLRDAEHAERIIGMFLRPVEADAETDLLGGGVYASGMCAHPPFQIDGNL 691
Query: 704 GFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKD 754
GF AA+AEMLVQS + +LPALP D W G GL+ARGG +V W D
Sbjct: 692 GFPAALAEMLVQSHDGMVRILPALPED-WHEGSFHGLRARGGLSVDASWTD 741
>gi|110798918|ref|YP_696557.1| fibronectin type III [Clostridium perfringens ATCC 13124]
gi|110673565|gb|ABG82552.1| fibronectin type III domain protein [Clostridium perfringens ATCC
13124]
Length = 1479
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 258/760 (33%), Positives = 399/760 (52%), Gaps = 82/760 (10%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
L + ++ PA ++ +A+PIGNG +G M++G V SE ++ NE TLW+G PG +
Sbjct: 48 LALWYDEPATNWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107
Query: 66 PDAPKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
A +A+ ++R ++ G + + +G YQ GDI L+F SH +
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQRAYG----AYQNFGDIFLDFK-SHEESK 162
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
YRREL++ + + VKY+ V + RE+F S PD V+V K+ ++ SL+ +V +
Sbjct: 163 VTNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNVMVIKLKADKASSLNVDVRNEGA 222
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ + NN +I+ G + G+++ + +IK+ + G+I ED+
Sbjct: 223 HNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINTGGSIQDKEDR 267
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ VE +D +++ A + + + P+ +DP S + + NL Y +L +RH++D
Sbjct: 268 -ISVENADEITIIMSAGTDYINEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
Y+ LF RV++ L D TD E + ++T++ SL L FQ+GRYL
Sbjct: 325 YKNLFDRVNLNLGELKLDKPTD-------------EMLNEYKTNQSNSLETLFFQYGRYL 371
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSR G+ ANLQG+WN +P W S H N+N++MNYW + NLSE PL +++
Sbjct: 372 LISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYIE 431
Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
L G KTA+++ +GW ++ + + +A + W P AW+ +
Sbjct: 432 SLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
LWEHYN+T D+D+L + YP+++ A F +L+E DG YL ++PS SPEH
Sbjct: 491 LWEHYNFTDDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH----- 545
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
+ +T D +I ++F+ I A+E L +E+ E K L+P +I + G
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGIDEEFRAELENKRERLLKP-QIGKHGQ 600
Query: 590 IMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITW 649
+ EW D DP +HRH+SHL GL+PG I + P+L +AA+ T+ RG+ G GWS
Sbjct: 601 VQEWKDDIDDPNNNHRHISHLVGLYPGTQINQKDTPELYEAAKVTMNHRGDGGTGWSKAN 660
Query: 650 KTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAV 709
K LWARL D + A+R++ E NLF HPPFQID N G + +
Sbjct: 661 KINLWARLLDGDRAHRLL-----------ENQLTTSTLENLFDTHPPFQIDGNMGAVSGM 709
Query: 710 AEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVS 749
AEMLVQS L + LPALP W G GLKARG +S
Sbjct: 710 AEMLVQSHLGTINPLPALP-TAWEDGSFDGLKARGNFEIS 748
>gi|262405728|ref|ZP_06082278.1| glycoside hydrolase family 95 [Bacteroides sp. 2_1_22]
gi|294644470|ref|ZP_06722231.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294806820|ref|ZP_06765646.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
gi|345510919|ref|ZP_08790478.1| glycoside hydrolase family 95 [Bacteroides sp. D1]
gi|229442942|gb|EEO48733.1| glycoside hydrolase family 95 [Bacteroides sp. D1]
gi|262356603|gb|EEZ05693.1| glycoside hydrolase family 95 [Bacteroides sp. 2_1_22]
gi|292640192|gb|EFF58449.1| conserved hypothetical protein [Bacteroides ovatus SD CC 2a]
gi|294445990|gb|EFG14631.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
Length = 815
Score = 414 bits (1065), Expect = e-113, Method: Compositional matrix adjust.
Identities = 261/768 (33%), Positives = 391/768 (50%), Gaps = 82/768 (10%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
++PIGNG LGA + G + +E + LNE TLW G P +Y N + L ++R
Sbjct: 63 SLPIGNGSLGANIMGSISAERITLNEKTLWKGGPNTAKGAEYYWNVNKQSSGVLKEIRQA 122
Query: 79 VDSGQYAEATAASVKLFGHPA------------DVYQLLGDIELEFDDSHLKYAEETYRR 126
G +A + + F + +G++ +E S + + YRR
Sbjct: 123 FLDGDSKKAGYLTQENFNGLGAYEEKDETPFRFGAFTTMGELYVETGLSEINMS--NYRR 180
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
L L++A A V++ + + R++F S PD V+V K + + G + +S ++ +H
Sbjct: 181 ILSLDSAMAVVQFYKDGIRYQRKYFISYPDSVMVMKFTADKGGKQNLVLSYCPNNEAKSH 240
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+GN+ ++ G + G++F+ IK GT+ A E+ ++ V
Sbjct: 241 LEADGNDGLVYTGVL-------------NNNGMKFA--FRIKAIHKGGTLKA-ENDRIIV 284
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRHLD 299
+ +D V LL A + + F K DP+ +++ + + Y +LY H
Sbjct: 285 KDADEVVFLLTADTDYKMNFAPDFKDPKAYVGNDPSQTTLAMMDNALKKGYDELYRNHEA 344
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGR 358
DY LF+RV +++ E +P+ +R+ S++ D L +L +QFGR
Sbjct: 345 DYTALFNRVRFEIN-----------PEIGTPNLPTYKRLASYKKGVPDYQLEQLYYQFGR 393
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLI+SSRPG ANLQG+W+ + W H NIN++MNYW + P NLSEC PL DF
Sbjct: 394 YLLIASSRPGNMPANLQGLWHNNTDGPWRVDYHNNINIQMNYWPACPTNLSECTWPLIDF 453
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
+ L G KTAQ + A GW +I+ ++ K + W L P G WL TH+WE+
Sbjct: 454 IRSLVKPGEKTAQAYFNARGWTASISANIFGFTAPLSSKSMAWNLNPTVGPWLATHIWEY 513
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y+YT D FL++ Y L++ A F +D L DG PSTSPEH V
Sbjct: 514 YDYTRDTKFLKEIGYDLIKSSAQFAVDHLWHKPDGTYTAAPSTSPEH---------GPVD 564
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWAQD 596
T A++RE+ I A++VL DA K ++ L +L P +I G ++EW+ D
Sbjct: 565 EGVTFAHAVVREILLDAIQASKVL--GTDAKERKQWENVLTKLVPYRIGRYGQLLEWSTD 622
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
DP+ HRH++HLFGL PGHTI+ P+L +AA L+ RG+ GWS+ WK WAR
Sbjct: 623 IDDPKDEHRHVNHLFGLHPGHTISPVTTPELAQAARVVLEHRGDGATGWSMGWKLNQWAR 682
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
L D HAY++ L + G NL+ H PFQID NFG TA + EML+QS
Sbjct: 683 LQDGNHAYKLYGNL-----------LKNGTLDNLWDTHAPFQIDGNFGGTAGITEMLLQS 731
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+ + LLPALP D W++G + G+ A+G VSI WK+G L + I+S
Sbjct: 732 HMGFIQLLPALP-DAWANGSISGICAKGNFEVSISWKEGQLEKAIIHS 778
>gi|18310857|ref|NP_562791.1| hypothetical protein CPE1875 [Clostridium perfringens str. 13]
gi|18145539|dbj|BAB81581.1| conserved hypothetical protein [Clostridium perfringens str. 13]
Length = 1479
Score = 414 bits (1064), Expect = e-113, Method: Compositional matrix adjust.
Identities = 256/760 (33%), Positives = 399/760 (52%), Gaps = 82/760 (10%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
L + ++ PA ++ +A+PIGNG +G M++G V SE ++ NE TLW+G PG +
Sbjct: 48 LALWYDEPATNWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107
Query: 66 PDAPKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
A +A+ ++R ++ G + + +G YQ GDI L+F SH +
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQRAYG----AYQNFGDIFLDFK-SHEESK 162
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
YRREL++ + + VKY+ V + RE+F S PD V+V K+ ++ SL+ +V +
Sbjct: 163 VTNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNVMVIKLKADKASSLTVDVRNEGA 222
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ + NN +I+ G + G+++ + +IK+ ++ G+I ED+
Sbjct: 223 HNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINNGGSIQDKEDR 267
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ VE +D +++ A + + + P+ +DP S + + NL Y +L +RH++D
Sbjct: 268 -ISVENADEITIIMSAGTDYVNEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
Y+ LF RV + L D P+ E + ++T++ SL L FQ+GRYL
Sbjct: 325 YKNLFDRVDLNLGELKLD-------------KPTDEMLNEYKTNQSNSLETLFFQYGRYL 371
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSR G+ ANLQG+WN +P W S H N+N++MNYW + NLSE PL +++
Sbjct: 372 LISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYVE 431
Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
L G KTA+++ +GW ++ + + +A + W P AW+ +
Sbjct: 432 SLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
LWEHYN+T D+D+L + YP+++ A F +L+E DG YL ++PS SPEH
Sbjct: 491 LWEHYNFTDDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH----- 545
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
+ +T D +I ++F+ I A+E L +E+ E K L+P +I + G
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGIDEEFRAELENKRERLLKP-QIGKHGQ 600
Query: 590 IMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITW 649
+ EW D DP +HRH+SHL GL+PG I + P+L +AA+ T+ RG+ G GWS
Sbjct: 601 VQEWKDDIDDPNNNHRHISHLVGLYPGTQINQKDTPELYEAAKVTMNHRGDGGTGWSKAN 660
Query: 650 KTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAV 709
K LWARL D + A+R++ E NLF HPPFQID N G + +
Sbjct: 661 KINLWARLLDGDRAHRLL-----------ENQLTTSTLENLFDTHPPFQIDGNMGAVSGM 709
Query: 710 AEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVS 749
AEML+QS L + LPALP W G GLKARG +S
Sbjct: 710 AEMLIQSHLGTINPLPALP-TAWEDGSFDGLKARGNFEIS 748
>gi|390958734|ref|YP_006422491.1| hypothetical protein Terro_2921 [Terriglobus roseus DSM 18391]
gi|390413652|gb|AFL89156.1| hypothetical protein Terro_2921 [Terriglobus roseus DSM 18391]
Length = 837
Score = 414 bits (1064), Expect = e-113, Method: Compositional matrix adjust.
Identities = 265/798 (33%), Positives = 389/798 (48%), Gaps = 95/798 (11%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPS-------------------------- 45
P ++ + PA +T+A+PIGNGR+GAMV+GG +
Sbjct: 37 PARLWYRAPAPVWTEALPIGNGRIGAMVFGGANTGPNNGDLEDAAKNADILSGDKTRGQD 96
Query: 46 ETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLV------DSGQYAEATA-ASVKLFGHP 98
E L+LNE T+W G D NP A + VR+L+ D + AEA A + +P
Sbjct: 97 EHLQLNESTVWAGSRADRLNPRAAEGFRRVRALLLESKGTDGKKIAEAEKLAQETMIANP 156
Query: 99 ADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPD 156
+ Y +GD+ L S A Y R+LDL T R+ Y G V FTRE F+S PD
Sbjct: 157 KAMPPYSTVGDLYLRSSSSE---AIADYHRQLDLKTGVVRITYRQGPVHFTREIFASAPD 213
Query: 157 QVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG 216
VIV ++ ++S S+D D +G +++ K
Sbjct: 214 HVIVMHLTADRPNAISLTASMDRPGDFAIRASGQRDLVLTQSATTK------------NA 261
Query: 217 IQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTS 276
F A + + + G + A D+ + + + VL+ AS GP + DP +
Sbjct: 262 THFQA--QARFATHGGAVHADGDRIVVEKAQELTVLIAAASDFKGGPILG-----GDPAT 314
Query: 277 ESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAE 336
L S + +++ L D + R+S+ L P D + +P+ E
Sbjct: 315 LCGDILASAQKKNFAALSAAATKDQFRYIDRMSLSLG--PVDAA--------LAAMPTDE 364
Query: 337 RVKSFQTDEDP-SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNIN 395
R+K +D L L FQ+ RYLL+ SSRPG ANLQG+W LS W S +N+N
Sbjct: 365 RLKRVAAGQDDFGLQALYFQYARYLLLGSSRPGGLAANLQGLWASGLSNPWGSKWTINVN 424
Query: 396 LEMNYWQSLPCNLSECQEPLFDFLTYL----SINGSKTAQVNYLASGWVIHHKTDIWAKS 451
EMNYW + NLSE +PLFD + + S G K A+ Y A G+VIHH TDIW +
Sbjct: 425 TEMNYWLAEAANLSEMHQPLFDLVGMVRDPASGTGVKVAKEYYGAKGFVIHHNTDIWGDA 484
Query: 452 SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD 511
G + +WP GGAWL H W+HY +T ++ FL +A+PLL + F LD+L +
Sbjct: 485 EPIDG-YQYGIWPDGGAWLTLHAWDHYAFTGNKQFLRSQAWPLLHDASLFFLDYLTDDGS 543
Query: 512 GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEK 571
G+L T PS SPE+++ DG ++ TMD+ I+RE+F + A +L ++ A +++
Sbjct: 544 GHLVTGPSLSPENKYKLADGTSHSLTMGPTMDIEIVRELFQRTMQAGTILGEDA-AFLQQ 602
Query: 572 VLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAA 631
V ++ RL P + G + EW QD+++ HRH+SHL+ LFPG I + PDL +AA
Sbjct: 603 VRQASDRLPPFHVGSLGQLQEWQQDYQEDAPGHRHISHLWALFPGTQIDLRHTPDLARAA 662
Query: 632 EKTLQKRGEEG---PGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYS 688
+ +L++R G GWS W W LH+ + AY ++ LF +
Sbjct: 663 QVSLERRLANGGGQTGWSRAWVVNYWDHLHNGQQAYDSLQVLFRQ-----------STFP 711
Query: 689 NLFAAHPP--FQIDANFGFTAAVAEMLVQSTL----NDLYLLPALPWDKWSSGCVKGLKA 742
NL HPP FQID N G + E LVQS ++ L+PALP W G + GL+
Sbjct: 712 NLMDTHPPGVFQIDGNLGGANGMLEALVQSRWYADHGEVDLMPALP-TAWQQGHITGLRV 770
Query: 743 RGGETVSICWKDGDLHEV 760
RG + +S+ W +G L V
Sbjct: 771 RGNQELSLRWSNGKLDAV 788
>gi|336403471|ref|ZP_08584186.1| hypothetical protein HMPREF0127_01499 [Bacteroides sp. 1_1_30]
gi|335945801|gb|EGN07608.1| hypothetical protein HMPREF0127_01499 [Bacteroides sp. 1_1_30]
Length = 815
Score = 414 bits (1064), Expect = e-113, Method: Compositional matrix adjust.
Identities = 261/768 (33%), Positives = 391/768 (50%), Gaps = 82/768 (10%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSL 78
++PIGNG LGA + G + +E + LNE TLW G P +Y N + L ++R
Sbjct: 63 SLPIGNGSLGANIMGSISAERITLNEKTLWKGGPNTAKGAEYYWNVNKQSSGVLKEIRQA 122
Query: 79 VDSGQYAEATAASVKLFGHPA------------DVYQLLGDIELEFDDSHLKYAEETYRR 126
G +A + + F + +G++ +E S + + YRR
Sbjct: 123 FLDGDSKKAGYLTQENFNGLGAYEEKDETPFRFGAFTTMGELYVETGLSEINMS--NYRR 180
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
L L++A A V++ + + R++F S PD V+V K + + G + +S ++ +H
Sbjct: 181 ILSLDSAMAVVQFYKDGIRYQRKYFISYPDSVMVMKFTADKGGKQNLVLSYCPNNEAKSH 240
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+GN+ ++ G + G++F+ IK GT+ A E+ ++ V
Sbjct: 241 LEADGNDGLVYTGVL-------------NNNGMKFA--FRIKAIHKGGTLKA-ENDRIIV 284
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRHLD 299
+ +D V LL A + + F K DP+ +++ + + Y +LY H
Sbjct: 285 KDADEVVFLLTADTDYKMNFAPDFKDPKAYVGNDPSQTTLAMMDNALKKGYDELYRNHEA 344
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGR 358
DY LF+RV +++ E +P+ +R+ S++ D L +L +QFGR
Sbjct: 345 DYTALFNRVRFEIN-----------PEIGTPNLPTYKRLASYKKGVPDYQLEQLYYQFGR 393
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLI+SSRPG ANLQG+W+ + W H NIN++MNYW + P NLSEC PL DF
Sbjct: 394 YLLIASSRPGNMPANLQGLWHNNTDGPWRVDYHNNINIQMNYWPACPTNLSECTWPLIDF 453
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
+ L G KTAQ + A GW +I+ ++ K + W L P G WL TH+WE+
Sbjct: 454 IRSLVKPGEKTAQAYFNARGWTASISANIFGFTAPLSSKSMAWNLNPTVGPWLATHIWEY 513
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y+YT D FL++ Y L++ A F +D L DG PSTSPEH V
Sbjct: 514 YDYTRDTRFLKEIGYDLIKSSAQFAVDHLWHKPDGTYTAAPSTSPEH---------GPVD 564
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSIMEWAQD 596
T A++RE+ I A++VL DA K ++ L +L P +I G ++EW+ D
Sbjct: 565 EGVTFAHAVVREILLDAIQASKVL--GTDAKERKQWENVLTKLVPYRIGRYGQLLEWSTD 622
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
DP+ HRH++HLFGL PGHTI+ P+L +AA L+ RG+ GWS+ WK WAR
Sbjct: 623 IDDPKDEHRHVNHLFGLHPGHTISPVTTPELAQAARVVLEHRGDGATGWSMGWKLNQWAR 682
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
L D HAY++ L + G NL+ H PFQID NFG TA + EML+QS
Sbjct: 683 LQDGNHAYKLYGNL-----------LKNGTLDNLWDTHAPFQIDGNFGGTAGITEMLLQS 731
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+ + LLPALP D W++G + G+ A+G VSI WK+G L + I+S
Sbjct: 732 HMGFIQLLPALP-DAWANGSISGICAKGNFEVSISWKEGQLEKAIIHS 778
>gi|237709067|ref|ZP_04539548.1| glycoside hydrolase family 95 protein [Bacteroides sp. 9_1_42FAA]
gi|229456763|gb|EEO62484.1| glycoside hydrolase family 95 protein [Bacteroides sp. 9_1_42FAA]
Length = 818
Score = 414 bits (1064), Expect = e-113, Method: Compositional matrix adjust.
Identities = 274/810 (33%), Positives = 399/810 (49%), Gaps = 94/810 (11%)
Query: 2 MNAESTSTTNPLKITFNGP------AKHFT---------DAIPIGNGRLGAMVWGGVPSE 46
+ A+ST T L I F+ P A ++ +++PIGNG LG V G + +E
Sbjct: 17 LQAQSTDYTKGLSIWFDTPNNLDGRASWYSPVTDKAWENNSLPIGNGSLGGNVMGSIAAE 76
Query: 47 TLKLNEDTLWTGVPGD--------YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHP 98
+ LNE TLW G P N ++ L ++R G +A + K F
Sbjct: 77 RITLNEKTLWRGGPNTEKGAAYYWNVNKESAHLLPEIRQAFTDGNQKKAEELTCKNFNGL 136
Query: 99 ADV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
AD + LG+ +E S + Y+R L L++A A V + V +
Sbjct: 137 ADYEPSRETPFRFGSFTTLGEAYIETGLSEI--GMTNYKRILSLDSAMAVVSFRKDEVNY 194
Query: 147 TREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
R++F S PD V+V K + G +L F+ + +G N ++ G
Sbjct: 195 ERKYFVSYPDSVMVLKFTADRPGMQNLIFSYGSNPEAIGDIKADGPNCLLYTGCL----- 249
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS----SF 260
K Q L I+ + G+++ D K V +D + LL A + +F
Sbjct: 250 ----------KNNQMKFALRIQAINKGGSLNT-TDGKFIVRNADEVIFLLTADTDYKLNF 298
Query: 261 DGPFINPSD-SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS-RSPKD 318
+ F +P DP +++ + + SY++L RH DY +LF RV +QL+ R+P
Sbjct: 299 NPDFKDPKTYVGPDPEQTTLAMMDAAAAKSYNELCERHKTDYTQLFGRVQLQLNPRAPM- 357
Query: 319 IVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
T + +P+ +R+ ++ + D L E+ +QFGRYLLI+SSRPG ANLQG+
Sbjct: 358 ----TLQYPAVTDLPTYQRLARYRKGNPDYRLEEIYYQFGRYLLIASSRPGNLPANLQGM 413
Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS 437
W + W H NIN++MNYW + NL+EC PL DF+ L G KTAQ + A
Sbjct: 414 WANGVDGPWHVDYHNNINIQMNYWPACSTNLNECVWPLIDFIRTLVKPGEKTAQAYFGAR 473
Query: 438 GWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
GW +I+ +S + W PM G WL TH+WE+Y+YT D+ FL++ Y L++
Sbjct: 474 GWTASISGNIFGFTSPLTDENMSWNFNPMAGPWLATHIWEYYDYTRDKKFLKEVGYDLIK 533
Query: 497 GCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
A+F +D+L DG PSTSPEH V +T A++RE+ I
Sbjct: 534 SSANFAVDYLWYKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLNAID 584
Query: 557 AAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLF 614
A++ L + + + VL L P +I G +MEW+ D DP+ HRH++HLFGL
Sbjct: 585 ASKALGVDSKDRKQWQYVLN---HLVPYQIGRYGQLMEWSTDIDDPKDEHRHVNHLFGLH 641
Query: 615 PGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLV 674
PGHT++ P+L AA+ L+ RG+ GWS+ WK WARL D HAY++ L
Sbjct: 642 PGHTLSPIMTPELTHAAKVVLEHRGDGATGWSMGWKLNQWARLQDGNHAYKLFGNL---- 697
Query: 675 DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSS 734
+ G NL+ HPPFQID NFG TA + EML+QS + + LLPALP D W
Sbjct: 698 -------LKNGTLDNLWDTHPPFQIDGNFGGTAGITEMLLQSHMGFIQLLPALP-DAWKE 749
Query: 735 GCVKGLKARGGETVSICWKDGDLHEVGIYS 764
G VKGL A+G + I W+DG L E I S
Sbjct: 750 GSVKGLCAKGNFEIDITWQDGKLKEAVILS 779
>gi|29348582|ref|NP_812085.1| hypothetical protein BT_3173 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29340487|gb|AAO78279.1| glycoside hydrolase family 95 [Bacteroides thetaiotaomicron
VPI-5482]
Length = 815
Score = 414 bits (1063), Expect = e-112, Method: Compositional matrix adjust.
Identities = 267/770 (34%), Positives = 395/770 (51%), Gaps = 86/770 (11%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR-S 77
++PIGNG LGA + G V +E + LNE TLW G P +Y N + L ++R +
Sbjct: 63 SLPIGNGSLGANILGSVAAERITLNEKTLWKGGPNTSKGAEYYWDVNKQSAGVLKEIRQA 122
Query: 78 LVDSGQYAEATAASVKLFGHPA-----------DVYQLLGDIELEFDDSHLKYAEETYRR 126
+D + A G A + +G++ +E + L+ + YRR
Sbjct: 123 FLDEDKEKAAQLTRNNFNGLAAYEEKDETPFRFGSFTTMGELYVETGLNELRMS--NYRR 180
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
L L++A V++ V++ R++F S PD V+V K + ++SG + +S +S ++
Sbjct: 181 ILSLDSAMVVVQFDKDGVQYQRKYFISYPDSVMVMKFTANQSGKQNLILSYCPNSEAKSN 240
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
+G + ++ G D G++F+ IK GT+ A E+ +L V
Sbjct: 241 LRADGKDGLVYTGVL-------------DNNGMKFA--FRIKAIHKGGTLEA-ENDRLIV 284
Query: 245 EGSDWAVLLLVASSSFDGPFINP--SDSK----KDPTSESMSALQSIRNLSYSDLYTRHL 298
+G+D V LL A + + F NP D K DP + + Y +LY H
Sbjct: 285 KGADEVVFLLTADTDYKMNF-NPDFKDPKTYVGNDPEQTTRIMMDQAVQKGYDELYRNHE 343
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFG 357
D+ LF+RV +QL+ DI + +P+ +R+ +++ D L +L +QFG
Sbjct: 344 ADHTALFNRVRLQLN---PDISSPN--------LPTYQRLANYKKGTPDYQLEQLYYQFG 392
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLI+SSRPG ANLQG+W+ +L W H NIN++MNYW + NLSEC PL D
Sbjct: 393 RYLLIASSRPGNMPANLQGMWHNNLDGPWRVDYHNNINIQMNYWPACSANLSECTWPLID 452
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVV-WALWPMGGAWLCTHLWE 476
F+ L G +TAQ + A GW +I+ ++ ++ W L P G WL TH+WE
Sbjct: 453 FIRSLVKPGEQTAQAYFNARGWTASISANIFGFTAPLSSNMMSWNLNPTAGPWLATHIWE 512
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
+Y+YT D+ FL++ Y L++ A F +D L DG PSTSPEH +
Sbjct: 513 YYDYTRDKKFLKEIGYDLIKSSAQFAVDHLWHKPDGTYTAAPSTSPEH---------GPI 563
Query: 537 SYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWA 594
T A++RE+ I A++ L + E EK+L +L P +I G +MEW+
Sbjct: 564 DEGVTFAHAVVREILLDAIQASKELGIDSKERKQWEKILD---KLVPYRIGRYGQLMEWS 620
Query: 595 QDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALW 654
D DPE HRH++HLFGL PGHTI+ P L +AA+ L+ RG+ GWS+ WK W
Sbjct: 621 TDIDDPEDEHRHVNHLFGLHPGHTISPITTPKLAEAAKVVLEHRGDGATGWSMGWKLNQW 680
Query: 655 ARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLV 714
ARL D HAY++ L + G NL+ H PFQID NFG TA + EML+
Sbjct: 681 ARLQDGNHAYKLYGNL-----------LKNGTLDNLWDTHAPFQIDGNFGGTAGITEMLL 729
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
QS + + LLPALP D W +G + G+ A+G +SI WK+G L + I S
Sbjct: 730 QSHMGFIQLLPALP-DAWKNGSITGICAKGNFEISISWKEGQLDKATILS 778
>gi|29348564|ref|NP_812067.1| hypothetical protein BT_3155 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29340469|gb|AAO78261.1| glycoside hydrolase family 95 [Bacteroides thetaiotaomicron
VPI-5482]
Length = 808
Score = 414 bits (1063), Expect = e-112, Method: Compositional matrix adjust.
Identities = 264/773 (34%), Positives = 400/773 (51%), Gaps = 68/773 (8%)
Query: 8 STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-P 66
+T N +K+ ++ PA + ++P+GNGRLG M++GG+ +ETL LNE T+W+G ++ P
Sbjct: 24 ATENKMKLWYDKPADEWMKSLPLGNGRLGVMIYGGIETETLALNESTMWSGEYDEHQQRP 83
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLFG---HPADVYQLLGDIELEFDDSHLKYAEET 123
+ L+ VR L +E + + H + +GD+++ F S+ +
Sbjct: 84 FGREKLNQVRKLFFENNLSEGNHVAGNMMAGSPHSVGTHLPIGDLKINF--SYPQGEISD 141
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
YR ELDL+TA V Y VGN E+ R+ +SNPD V+ I S +++ + L LL
Sbjct: 142 YRHELDLHTAINTVSYKVGNTEYIRQCIASNPDDVVAMHIKASRPKAITMELEL-KLLRQ 200
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ V NQ+I G ++ G+ F + ++I GTI A E KKL
Sbjct: 201 ANVVASGNQLIYTGNAEFEK--------HGKGGVHFEGRIAVQIKG--GTIKA-EGKKLY 249
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
+E + LL S F N + S + + ++ + L +H++DY
Sbjct: 250 IEKATEVTLL----SDVRTNFKNNTFSGYNYKIKCEKTIELASKKDFKTLKKKHIEDYSP 305
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLI 362
LF RV + K D +P+ ER + E DP L L FQ+ RYLLI
Sbjct: 306 LFSRVGLSFEHHAK-----------FDHLPNDERWARVKKGESDPGLDALFFQYARYLLI 354
Query: 363 SSSRPGTQV-ANLQGIWNEDLSP--TWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
+SSRP + + LQG +N++L+ W + H++IN E NYW + NL+EC PLFD++
Sbjct: 355 ASSRPNSPLPVALQGFFNDNLACHMGWTNDYHLDINTEQNYWIANVGNLAECHLPLFDYI 414
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
LSI+G+KTA+ Y GW H + W ++ G ++W L+P +WL +HLW Y+
Sbjct: 415 KDLSIHGAKTAKDLYGCKGWTAHTTANPWGYTAVS-GSILWGLFPTASSWLASHLWTQYD 473
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
YT D+DFL+ AYPLL+ A FLLD++ I+ + YL T PS SPE+ F G+ C S
Sbjct: 474 YTQDKDFLKNTAYPLLKSNAEFLLDYMVIDPRNNYLVTGPSISPENSF-RHQGQEFCASM 532
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDA-LVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
T D + E+FSA + + E+L N DA + + ++ +L P +I+ +G + EW +D+
Sbjct: 533 MPTCDRVLAYEIFSACLQSTEIL--NVDASFADSLRTAISKLPPFRISTNGGVQEWFEDY 590
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGE----EGPGWSITWKTAL 653
++ +HRH +HL L+P IT+ K P+L KAA KT+++R E WS
Sbjct: 591 EEAHPNHRHTTHLLSLYPYSQITLNKTPELAKAARKTIERRLAAKDWEDTEWSRANMICF 650
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP---------FQIDANFG 704
+ARL D E+AY VK+L + E N+F P F D N
Sbjct: 651 YARLKDSENAYNSVKQLLGKLSRE-----------NMFTVSPAGIAGAGEDIFAFDGNTA 699
Query: 705 FTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
A +AEML+QS N + LLP LP +W +G KGL ARGG + WK+ +
Sbjct: 700 GAAGIAEMLLQSHDNCIELLPCLP-KEWKNGNFKGLCARGGIEIDASWKNSQI 751
>gi|168214908|ref|ZP_02640533.1| fibronectin type III domain protein [Clostridium perfringens CPE
str. F4969]
gi|170713641|gb|EDT25823.1| fibronectin type III domain protein [Clostridium perfringens CPE
str. F4969]
Length = 1479
Score = 414 bits (1063), Expect = e-112, Method: Compositional matrix adjust.
Identities = 256/760 (33%), Positives = 398/760 (52%), Gaps = 82/760 (10%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
L + ++ PA + +A+PIGNG +G M++G V SE ++ NE TLW+G PG +
Sbjct: 48 LALWYDEPATSWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107
Query: 66 PDAPKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
A +A+ ++R ++ G + + +G YQ GDI L+F SH +
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQRAYG----AYQNFGDIFLDFK-SHEESK 162
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
YRREL++ + + VKY+ V + RE+F S PD V+V K+ ++ SL+ +V +
Sbjct: 163 VTNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNVMVIKLKADKASSLTVDVRNEGA 222
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ + NN +I+ G + G+++ + +IK+ + G+I ED+
Sbjct: 223 HNGKNLSVENNTLILSGEI-------------EDNGMKYES--QIKVINTGGSIQDKEDR 267
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ VE +D +++ A + + + P+ +DP S + + NL Y +L +RH++D
Sbjct: 268 -ISVENADEITIIMSAGTDYINEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
Y+ LF RV++ L D P+ E + ++T++ SL L FQ+GRYL
Sbjct: 325 YKNLFDRVNLNLGELKLD-------------KPTDEMLNEYKTNQSNSLETLFFQYGRYL 371
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSR G+ ANLQG+WN +P W S H N+N++MNYW + NLSE PL +++
Sbjct: 372 LISSSRAGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYVE 431
Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
L G KTA+++ +GW ++ + + +A + W P AW+ +
Sbjct: 432 SLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
LWEHYN+T D+D+L + YP+++ A F +L+E DG YL ++PS SPEH
Sbjct: 491 LWEHYNFTDDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH----- 545
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
+ +T D +I ++F+ I A+E L +E+ E K L+P ++ + G
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGVDEEFRAELEDKRERLLKP-QVGKHGQ 600
Query: 590 IMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITW 649
+ EW D DP +HRH+SHL GL+PG I + P+L +AA+ T+ RG+ G GWS
Sbjct: 601 VQEWKDDIDDPNNNHRHISHLVGLYPGTQINQKDTPELYEAAKVTMNHRGDGGTGWSKAN 660
Query: 650 KTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAV 709
K LWARL D + A+R++ E NLF HPPFQID N G + +
Sbjct: 661 KINLWARLLDGDRAHRLL-----------ENQLTTSTLENLFDTHPPFQIDGNMGAVSGM 709
Query: 710 AEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVS 749
AEMLVQS L + LPALP W G GLKARG +S
Sbjct: 710 AEMLVQSHLGTINPLPALP-TAWEDGSFDGLKARGNFEIS 748
>gi|383123942|ref|ZP_09944612.1| hypothetical protein BSIG_4038 [Bacteroides sp. 1_1_6]
gi|251838825|gb|EES66910.1| hypothetical protein BSIG_4038 [Bacteroides sp. 1_1_6]
Length = 812
Score = 413 bits (1062), Expect = e-112, Method: Compositional matrix adjust.
Identities = 270/838 (32%), Positives = 417/838 (49%), Gaps = 104/838 (12%)
Query: 3 NAESTSTTNPLKITFNGP---------------AKHFTDAIPIGNGRLGAMVWGGVPSET 47
+AEST T L I F+ P A + ++PIGNG +GA + G V +E
Sbjct: 19 HAESTDYTKGLSIWFDSPNTLQGKEVWHSAQQDASWESQSLPIGNGSIGANILGSVEAER 78
Query: 48 LKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA 99
+ NE TLW G P DY N + L ++R G +A + + F
Sbjct: 79 ITFNEKTLWRGGPNTAKGADYYWNVNKQSAHILDEIRKAFVEGDQKKAEKLTRENFNSEV 138
Query: 100 DV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFT 147
+ +G+ +E S + ++ Y+R L L++A A V++ +V +
Sbjct: 139 PYEFSREKPFRFGNFTTMGEFYVETGLSTIGMSD--YKRILSLDSAMAVVQFKKDDVAYQ 196
Query: 148 REHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIP 205
R +F S P V+V + S + +L+F + + + +GNN ++
Sbjct: 197 RNYFISYPANVMVMRFSADQPSKQNLTFRYAPNPVSTGQFSTDGNNGLVY---------- 246
Query: 206 PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFI 265
A+ D G++++ + I+ + + GT++ D ++ V+ +D + + A + + F
Sbjct: 247 ---TASLDNNGMKYA--VRIQATVNGGTLNN-ADGRITVKEADEVIFYVTADTDYKMNFA 300
Query: 266 -NPSDSKK----DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIV 320
+ +D K +P + ++ Y++L H DY LF+RV ++L+ + K
Sbjct: 301 PDFTDPKTYVGVNPLETTQQWMKDAVAKGYANLLNEHYKDYASLFNRVKLELNPTVK--- 357
Query: 321 TDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN 379
I +P+A+R+K+++ + D L +L +QFGRYLLI+SSRPG ANLQGIW+
Sbjct: 358 --------IANLPTAQRLKNYRKGQPDYYLEKLYYQFGRYLLIASSRPGNMPANLQGIWH 409
Query: 380 EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGW 439
++ W H NIN++MNYW + NL EC PL DF+ L G KTAQ + A GW
Sbjct: 410 NNIDGPWRVDYHNNINIQMNYWPACSTNLDECMLPLIDFIRTLVKPGEKTAQSYFGARGW 469
Query: 440 VIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGC 498
+I+ ++ + + W PM G WL TH+WE+Y+YT D FL++ Y L++
Sbjct: 470 TASISANIFGFTTPLESQDMSWNFNPMAGPWLATHVWEYYDYTKDLKFLKETGYELIKSS 529
Query: 499 ASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAA 558
A+F +D+L DG PSTSPEH V +T A++RE+ I A+
Sbjct: 530 ANFTVDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLDAIQAS 580
Query: 559 EVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPG 616
+ L +K E E VL + L P KI G ++EW+ D DP+ HRH++HLFGL PG
Sbjct: 581 KELGIDKKERKQWEHVLAN---LVPYKIGRYGQLLEWSTDIDDPKDEHRHVNHLFGLHPG 637
Query: 617 HTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDP 676
HT++ P+L +AA+ L RG+ GWS+ WK WARL D HAY + L
Sbjct: 638 HTVSPITTPELAEAAKVVLVHRGDGATGWSMGWKLNQWARLQDGNHAYTLFGNL------ 691
Query: 677 EHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGC 736
+ G NL+ HPPFQID NFG TA + EML+QS + + LLPALP D W G
Sbjct: 692 -----LKNGTVDNLWDTHPPFQIDGNFGGTAGITEMLLQSHMGFIQLLPALP-DAWKDGS 745
Query: 737 VKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIY 794
+ G+ A+G + + WKDG L E + S N T+ Y G ++ + G+ Y
Sbjct: 746 IHGVCAKGNFEIDMIWKDGLLQEATLLSKAGEN-----CTVKYAGKTISFKTTKGRSY 798
>gi|429755750|ref|ZP_19288384.1| hypothetical protein HMPREF9072_01113 [Capnocytophaga sp. oral
taxon 324 str. F0483]
gi|429173108|gb|EKY14643.1| hypothetical protein HMPREF9072_01113 [Capnocytophaga sp. oral
taxon 324 str. F0483]
Length = 799
Score = 413 bits (1061), Expect = e-112, Method: Compositional matrix adjust.
Identities = 282/815 (34%), Positives = 427/815 (52%), Gaps = 73/815 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + F+ PA HFT++IPIGNGRLGAM++G + + LNE +LW+G + +P+A L
Sbjct: 16 VSVVFHKPANHFTESIPIGNGRLGAMLFGKTDVDRIVLNEISLWSGGTQEADDPNAHNYL 75
Query: 73 SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
D++ L+ G+ EA A + F G A+ YQ+L ++ L++ +
Sbjct: 76 KDIQKLLLQGKNLEAQALLQQHFVAKGKGSCLGQGANCSYGCYQVLAELLLDWKTTS--- 132
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+ Y+R L L+ A A + N + F+ + VI +I + L+ ++SL
Sbjct: 133 PIQDYKRVLRLDEAIAVTSFKRDNNSIEQTAFADFKNDVIWVRIKATSP--LNLDISLFR 190
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+N + NN+I + G P ND +G+ F++I++++ G I +
Sbjct: 191 K-ENATITYQNNKITLNGVLP----------NDGKEGMHFASIVDVQTD---GKIESTH- 235
Query: 240 KKLKVEGSDWAVLLLVASSSFD---GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
K + ++ + L + A ++++ G ++ S +KK + LQ +S+
Sbjct: 236 KAIAIQSAKEITLRISAVTNYNFNKGGLLDISVTKK-----ANEYLQKAP-MSFDKAKAE 289
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF-Q 355
+Q+LF+R + N + + + ER+ F E +L+ +L+
Sbjct: 290 SSIVFQRLFNRNRWYGK-----------ANANTEGLTTFERLGRFYKGEQDALLPILYYN 338
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
FGRYLLISSSR G ANLQG+W E+ W+ H+NIN++MNYW + P NLS+ EPL
Sbjct: 339 FGRYLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINIQMNYWLAEPTNLSQLTEPL 398
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
F L NGSKTA+ Y A+GWV H ++ W +S W GGAWLC H+W
Sbjct: 399 QRFTKNLVPNGSKTAKAYYNANGWVAHVISNPWFYTSPGE-SATWGSTLTGGAWLCEHIW 457
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP---DG 531
+HY +T D +FL + YP+L+ +F LI+ GY T PS SPE+ ++ P DG
Sbjct: 458 QHYLFTKDINFL-REYYPVLKEATTFFESLLIKDPKTGYWVTAPSNSPENAYVLPELKDG 516
Query: 532 K--LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
K + + TMDM I+RE+F+ AA++L + E S + P +I + G
Sbjct: 517 KKQIGTTCVAPTMDMQIVRELFTNTSDAAKILGLDSKKRTEWERISRNTV-PNRIGKKGD 575
Query: 590 IMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITW 649
+ EW D++D E HRH+SHL+GL+P IT PDL KAA+KTL+ RG+ G GWS W
Sbjct: 576 LNEWLDDWEDAEPQHRHVSHLYGLYPYDEITPWDTPDLAKAAKKTLEIRGDAGTGWSRAW 635
Query: 650 KTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAV 709
K WARL D HA ++++L + V+P GG Y NLF AHPPFQID NFG TA +
Sbjct: 636 KINFWARLQDGNHALLLLRQLLHPVNPNITDGQGGGTYPNLFCAHPPFQIDGNFGGTAGI 695
Query: 710 AEMLVQS--TLNDLYLLPALP-WDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNY 766
AEML+QS N + LPALP W +G +KG++AR G V+ W+ L + I S
Sbjct: 696 AEMLLQSHGKGNVIRFLPALPSHPDWENGVMKGMRARNGFEVNFEWQRFKLEKAEITS-- 753
Query: 767 SNNDHDSF-----KTLHYRGTSVKVNLSAGKIYTF 796
N S K ++ RG ++ + K+ TF
Sbjct: 754 LNGGECSVLLPANKNVYSRGKAIVKGSNKDKVITF 788
>gi|29350090|ref|NP_813593.1| hypothetical protein BT_4682 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29342002|gb|AAO79787.1| glycoside hydrolase family 95 [Bacteroides thetaiotaomicron
VPI-5482]
Length = 812
Score = 413 bits (1061), Expect = e-112, Method: Compositional matrix adjust.
Identities = 274/844 (32%), Positives = 417/844 (49%), Gaps = 106/844 (12%)
Query: 3 NAESTSTTNPLKITFNGP---------------AKHFTDAIPIGNGRLGAMVWGGVPSET 47
+AE T T L I F+ P A + ++PIGNG +GA + G + +E
Sbjct: 19 HAEDTDYTKGLSIWFDSPNTLQGKEVWHSSKQDASWESQSLPIGNGSIGANILGSIEAER 78
Query: 48 LKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA 99
+ NE TLW G P DY N + L ++R G +A + + F
Sbjct: 79 ITFNEKTLWRGGPNTAKGADYYWNVNKQSAHILDEIRKAFVEGDQKKAEKLTRENFNSEV 138
Query: 100 DV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFT 147
+ +G+ +E S + ++ Y+R L L++A A V++ +V +
Sbjct: 139 PYEFSREKPFRFGNFTTMGEFYVETGLSTIGMSD--YKRILSLDSAMAVVQFKKDDVAYQ 196
Query: 148 REHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIP 205
R +F S P V+V + S + G +L+F + + + +GNN ++
Sbjct: 197 RNYFISYPANVMVMRFSADQPGKQNLTFRYAPNPVSTGQFSADGNNGLVY---------- 246
Query: 206 PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFI 265
A+ D G++++ + I+ + GT++ D ++ V+ +D V + A + + F
Sbjct: 247 ---TASLDNNGMKYA--VRIQATVKGGTLNN-TDGRITVKEADEVVFYVTADTDYKMNFA 300
Query: 266 -NPSDSKK----DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIV 320
+ +D K +P + ++ + YS+L H DY LF+RV ++L+ + K
Sbjct: 301 PDFTDPKTYVGVNPLETTQQWMKDAVSKGYSNLLDEHYKDYASLFNRVKLELNPTVK--- 357
Query: 321 TDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN 379
+P+A+R+K+++ + D L +L +QFGRYLLI+SSRPG ANLQGIW+
Sbjct: 358 --------TSNLPTAQRLKNYRNGQPDYYLEKLYYQFGRYLLIASSRPGNMPANLQGIWH 409
Query: 380 EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGW 439
++ W H NIN++MNYW + NL EC PL DF+ L G KTAQ + A GW
Sbjct: 410 NNIDGPWRVDYHNNINIQMNYWPACSTNLDECMLPLIDFIRTLVKPGEKTAQSYFGARGW 469
Query: 440 VIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGC 498
+I+ ++ + + W PM G WL TH+WE+Y+YT D FL++ Y L++
Sbjct: 470 TASISANIFGFTTPLESQDMSWNFNPMAGPWLATHVWEYYDYTKDLKFLKETGYELIKSS 529
Query: 499 ASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAA 558
A+F +D+L DG PSTSPEH + +T A++RE+ I A+
Sbjct: 530 ANFTVDYLWHKPDGTYTAAPSTSPEH---------GPIDQGATFVHAVVREILLDAIQAS 580
Query: 559 EVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPG 616
+ L +K E E VL + L P KI G ++EW+ D DP+ HRH++HLFGL PG
Sbjct: 581 KELGIDKKERKQWEHVLAN---LVPYKIGRYGQLLEWSTDIDDPKDEHRHVNHLFGLHPG 637
Query: 617 HTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDP 676
HT++ P+L +AA+ L RG+ GWS+ WK WARL D HAY + L
Sbjct: 638 HTVSPVTTPELAEAAKVVLVHRGDGATGWSMGWKLNQWARLQDGNHAYTLFGNL------ 691
Query: 677 EHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGC 736
+ G NL+ HPPFQID NFG TA + EML+QS + + LLPALP D W G
Sbjct: 692 -----LKNGTVDNLWDTHPPFQIDGNFGGTAGITEMLLQSHMGFIQLLPALP-DAWKDGS 745
Query: 737 VKGLKARGGETVSICWKDGDLHEVGIYSNYSNN-------DHDSFKTLHYRGTSVKVNLS 789
+ G+ A+G + I WKDG L E I S N SFKT+ R +K +
Sbjct: 746 IYGICAKGNFEIDIAWKDGLLKEATILSKAGQNCIVKYAGQTISFKTVKGRSYQLKYDKE 805
Query: 790 AGKI 793
G I
Sbjct: 806 NGLI 809
>gi|298384410|ref|ZP_06993970.1| fibronectin type III domain protein [Bacteroides sp. 1_1_14]
gi|298262689|gb|EFI05553.1| fibronectin type III domain protein [Bacteroides sp. 1_1_14]
Length = 812
Score = 413 bits (1061), Expect = e-112, Method: Compositional matrix adjust.
Identities = 274/844 (32%), Positives = 417/844 (49%), Gaps = 106/844 (12%)
Query: 3 NAESTSTTNPLKITFNGP---------------AKHFTDAIPIGNGRLGAMVWGGVPSET 47
+AE T T L I F+ P A + ++PIGNG +GA + G + +E
Sbjct: 19 HAEDTDYTKGLSIWFDSPNTLQGKEVWHSSKQDASWESQSLPIGNGSIGANILGSIEAER 78
Query: 48 LKLNEDTLWTGVP-----GDY---TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA 99
+ NE TLW G P DY N + L ++R G +A + + F
Sbjct: 79 ITFNEKTLWRGGPNTTKGADYYWNVNKQSAHILDEIRKAFVEGDQKKAEKLTRENFNSEV 138
Query: 100 DV------------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFT 147
+ +G+ +E S + ++ Y+R L L++A A V++ +V +
Sbjct: 139 PYEFSREKPFRFGNFTTMGEFYVETGLSTIGMSD--YKRILSLDSAMAVVQFKKDDVAYQ 196
Query: 148 REHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIP 205
R +F S P V+V + S + G +L+F + + + +GNN ++
Sbjct: 197 RNYFISYPANVMVMRFSADQPGKQNLTFRYAPNPVSTGQFSADGNNGLVY---------- 246
Query: 206 PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFI 265
A+ D G++++ + I+ + GT++ D ++ V+ +D V + A + + F
Sbjct: 247 ---TASLDNNGMKYA--VRIQATVKGGTLNN-TDGRITVKEADEVVFYVTADTDYKMNFA 300
Query: 266 -NPSDSKK----DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIV 320
+ +D K +P + ++ + YS+L H DY LF+RV ++L+ + K
Sbjct: 301 PDFTDPKTYVGVNPLETTQQWMKDAVSKGYSNLLDEHYKDYASLFNRVKLELNPTVK--- 357
Query: 321 TDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN 379
+P+A+R+K+++ + D L +L +QFGRYLLI+SSRPG ANLQGIW+
Sbjct: 358 --------TSNLPTAQRLKNYRNGQPDYYLEKLYYQFGRYLLIASSRPGNMPANLQGIWH 409
Query: 380 EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGW 439
++ W H NIN++MNYW + NL EC PL DF+ L G KTAQ + A GW
Sbjct: 410 NNIDGPWRVDYHNNINIQMNYWPACSTNLDECMLPLIDFIRTLVKPGEKTAQSYFGARGW 469
Query: 440 VIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGC 498
+I+ ++ + + W PM G WL TH+WE+Y+YT D FL++ Y L++
Sbjct: 470 TASISANIFGFTTPLESQDMSWNFNPMAGPWLATHVWEYYDYTKDLKFLKETGYELIKSS 529
Query: 499 ASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAA 558
A+F +D+L DG PSTSPEH + +T A++RE+ I A+
Sbjct: 530 ANFTVDYLWHKPDGTYTAAPSTSPEH---------GPIDQGATFVHAVVREILLDAIQAS 580
Query: 559 EVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPG 616
+ L +K E E VL + L P KI G ++EW+ D DP+ HRH++HLFGL PG
Sbjct: 581 KELGIDKKERKQWEHVLAN---LVPYKIGRYGQLLEWSTDIDDPKDEHRHVNHLFGLHPG 637
Query: 617 HTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDP 676
HT++ P+L +AA+ L RG+ GWS+ WK WARL D HAY + L
Sbjct: 638 HTVSPVTTPELAEAAKVVLVHRGDGATGWSMGWKLNQWARLQDGNHAYTLFGNL------ 691
Query: 677 EHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGC 736
+ G NL+ HPPFQID NFG TA + EML+QS + + LLPALP D W G
Sbjct: 692 -----LKNGTVDNLWDTHPPFQIDGNFGGTAGITEMLLQSHMGFIQLLPALP-DAWKDGS 745
Query: 737 VKGLKARGGETVSICWKDGDLHEVGIYSNYSNN-------DHDSFKTLHYRGTSVKVNLS 789
+ G+ A+G + I WKDG L E I S N SFKT+ R +K +
Sbjct: 746 IYGICAKGNFEIDIAWKDGLLKEATILSKAGQNCIVKYAGQTISFKTVKGRSYQLKYDKE 805
Query: 790 AGKI 793
G I
Sbjct: 806 NGLI 809
>gi|429749280|ref|ZP_19282413.1| hypothetical protein HMPREF9075_01080 [Capnocytophaga sp. oral
taxon 332 str. F0381]
gi|429168711|gb|EKY10529.1| hypothetical protein HMPREF9075_01080 [Capnocytophaga sp. oral
taxon 332 str. F0381]
Length = 805
Score = 412 bits (1060), Expect = e-112, Method: Compositional matrix adjust.
Identities = 274/781 (35%), Positives = 409/781 (52%), Gaps = 73/781 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + F+ PA HFT++ PIGNGR+GAM++GG ++ + LNE +LW+G + P A + L
Sbjct: 23 VSVVFHNPATHFTESAPIGNGRIGAMLYGGTSTDRIVLNEISLWSGGAQESDEPQAYEYL 82
Query: 73 SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
++ L+ + EA A + F G+ A+ YQ+ GD+ +++ D+
Sbjct: 83 PHIQQLLLERKNIEAEALLQQHFIAKGEGSCRGNGANCSYGCYQIFGDLLIKWKDTS--- 139
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+ Y R L L+ ATA Y T+ F+ + +I KIS + F V++
Sbjct: 140 PVQNYSRILRLDEATAVTTYQRNGNTITQTAFADFKNDIIWVKISAQKP----FEVAVSL 195
Query: 180 LLDNHSYVNG-NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK----ISDDRGTI 234
++ V+ ++II+ G P N + +G+ F+ I+ ++ + D I
Sbjct: 196 TRKENAIVSYLPDRIILTGVLP----------NKEQQGMHFAGIVALESDGNMQKDEAAI 245
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
+ ++L LL S S + + N + P + + LQ+ N +
Sbjct: 246 TVQNAREL----------LLKVSMSTNYNYTNSGLTAVSPLETTKAYLQTA-NSDFESAL 294
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
T+ YQ+LF+R +R DT S + + +R+++F + +L+ +L+
Sbjct: 295 TKSKSAYQELFNR-----NRWYAKANADTQS------LSTLQRLENFSKGKKDALLPILY 343
Query: 355 -QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FGRYLLI SSR G ANLQG+W E+ W+ H+NINL+MNYW + NLS E
Sbjct: 344 YNFGRYLLICSSREGLLPANLQGLWAEEYQTPWNGDYHLNINLQMNYWLAEISNLSNLTE 403
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PL F L NG KTA+ Y A GWV H ++ W +S VW GGAWLC H
Sbjct: 404 PLHRFTKNLMPNGRKTAKSYYKAEGWVAHVISNPWFFTSPGES-AVWGSTLTGGAWLCQH 462
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP--D 530
+W+HY +T D DFL K YP+++ +F +LI+ Y T PS SPE+ ++ P
Sbjct: 463 IWQHYLFTHDLDFL-KNYYPVMKEATAFFQSFLIKDPTTDYWVTAPSNSPENAYLFPIDS 521
Query: 531 GK--LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE--KVLKSLPRLRPTKIAE 586
GK A + TMDM I+RE+ + I AA +L+ +++ + E K++++ P P +I +
Sbjct: 522 GKKVAAHTCIAPTMDMQIVRELLNNTIKAATILKVDDEKITEWKKIVENTP---PNRIGK 578
Query: 587 DGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWS 646
G + EW D++D E HRH+SHL+GL+P IT P L KAA+KTL+ RG EG GWS
Sbjct: 579 KGDLNEWLDDWQDAEPTHRHVSHLYGLYPYDEITPWDTPKLAKAAKKTLKIRGNEGTGWS 638
Query: 647 ITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFT 706
WK WARL + + A ++ +L V P+ GG Y NLF AHPPFQID N G
Sbjct: 639 SAWKINFWARLQNGKQALLLLHQLLKPVSPQMLNGEAGGSYPNLFCAHPPFQIDGNLGGA 698
Query: 707 AAVAEMLVQS--TLNDLYLLPALPWD-KWSSGCVKGLKARGGETVSICWKDGDLHEVGIY 763
A +AEML+QS T N + LPALP W +G + G+KAR G VS WK L + I
Sbjct: 699 AGIAEMLLQSHGTDNTIRFLPALPHHPDWENGTISGMKARNGFQVSFSWKKHQLQQATIT 758
Query: 764 S 764
S
Sbjct: 759 S 759
>gi|168215503|ref|ZP_02641128.1| fibronectin type III domain protein [Clostridium perfringens NCTC
8239]
gi|182382428|gb|EDT79907.1| fibronectin type III domain protein [Clostridium perfringens NCTC
8239]
Length = 1479
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 257/760 (33%), Positives = 397/760 (52%), Gaps = 82/760 (10%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
L + ++ PA + +A+PIGNG +G M++G V SE ++ NE TLW+G PG +
Sbjct: 48 LALWYDEPATKWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107
Query: 66 PDAPKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
A +A+ ++R ++ G + + +G YQ GDI L+F SH +
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQRAYG----AYQNFGDIFLDFK-SHEESK 162
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
YRREL++ + + VKY+ V + RE+F S PD V+V K+ ++ SL+ +V +
Sbjct: 163 VTNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNVMVIKLKADKASSLTVDVRNEGA 222
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ + NN +I+ G + G+++ + +IK+ + G+I ED+
Sbjct: 223 HNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINTGGSIQDKEDR 267
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ VE +D +++ A + + + P+ +DP S + + NL Y +L +RH++D
Sbjct: 268 -ISVENADEITIIMSAGTDYVNEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
Y+ LF RV++ L D TD E + ++T++ SL L FQ+GRYL
Sbjct: 325 YKNLFDRVNLNLGELKLDKPTD-------------EMLNEYKTNQSNSLETLFFQYGRYL 371
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSR G+ ANLQG+WN +P W S H N+N++MNYW + NLSE PL +++
Sbjct: 372 LISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYVE 431
Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
L G KTA+++ +GW ++ + + +A + W P AW+ +
Sbjct: 432 SLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
LWEHY +T D+D+L + YP+++ A F +L+E DG YL ++PS SPEH
Sbjct: 491 LWEHYQFTEDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH----- 545
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
+ +T D +I ++F+ I A+E L +E+ E K L+P +I + G
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGVDEEFRAELENKRERLLKP-QIGKHGQ 600
Query: 590 IMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITW 649
+ EW D DP +HRH+SHL GL+PG I + P+L +AA+ T+ RG+ G GWS
Sbjct: 601 VQEWKDDIDDPNNNHRHISHLVGLYPGTQINQKDTPELYEAAKVTMNHRGDGGTGWSKAN 660
Query: 650 KTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAV 709
K LWARL D + A+R++ E NLF HPPFQID N G + +
Sbjct: 661 KINLWARLLDGDRAHRLL-----------ENQLTTSTLENLFDTHPPFQIDGNMGAVSGM 709
Query: 710 AEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVS 749
AEMLVQS L + LPALP W G GLKARG +S
Sbjct: 710 AEMLVQSHLGTINPLPALP-TAWEDGSFDGLKARGNFEIS 748
>gi|168206072|ref|ZP_02632077.1| fibronectin type III domain protein [Clostridium perfringens E str.
JGS1987]
gi|170662403|gb|EDT15086.1| fibronectin type III domain protein [Clostridium perfringens E str.
JGS1987]
Length = 1479
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 258/760 (33%), Positives = 397/760 (52%), Gaps = 82/760 (10%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
L + ++ PA + +A+PIGNG +G M++G V SE ++ NE TLW+G PG +
Sbjct: 48 LALWYDEPATSWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107
Query: 66 PDAPKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
A +A+ ++R ++ G + K +G YQ GDI L+F SH +
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQKAYG----AYQNFGDIFLDFK-SHEESK 162
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
YRREL++ + + VKY+ V + RE+F S PD V+V K+ ++ SL+ +V +
Sbjct: 163 ITNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNVMVIKLKADKASSLTVDVRNEGA 222
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ + NN +I+ G + G+++ + +IK+ + G+I ED+
Sbjct: 223 HNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINTGGSIKDKEDR 267
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ VE +D +++ A + + + P+ +DP S + + NL Y +L +RH++D
Sbjct: 268 -ISVENADEITIIMSAGTDYVNEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
Y+ LF RV++ L D TD E + ++T++ SL L FQ+GRYL
Sbjct: 325 YKNLFDRVNLNLGELKLDKPTD-------------EMLNEYKTNQSNSLETLFFQYGRYL 371
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSR G+ ANLQG+WN +P W S H N+N++MNYW + NLSE PL +++
Sbjct: 372 LISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYVE 431
Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
L G KTA+++ +GW ++ + + +A + W P AW+ +
Sbjct: 432 SLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
LWEHY +T D+D+L + YP+++ A F +L+E DG YL ++PS SPEH
Sbjct: 491 LWEHYQFTEDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH----- 545
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
+ +T D +I ++F+ I A+E L +E+ E K L+P +I + G
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGVDEEFRAELENKRERLLKP-QIGKHGQ 600
Query: 590 IMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITW 649
+ EW D DP +HRH+SHL GL+PG I + P+L +AA+ T+ RG+ G GWS
Sbjct: 601 VQEWKDDIDDPNNNHRHISHLVGLYPGTQINQKDTPELYEAAKVTMNHRGDGGTGWSKAN 660
Query: 650 KTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAV 709
K LWARL D + A+R++ E NLF HPPFQID N G + +
Sbjct: 661 KINLWARLLDGDRAHRLL-----------ENQLTTSTLENLFDTHPPFQIDGNMGAVSGM 709
Query: 710 AEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVS 749
AEMLVQS L + LPALP W G GLKARG +S
Sbjct: 710 AEMLVQSHLGTINPLPALP-TAWEDGSFDGLKARGNFEIS 748
>gi|336412577|ref|ZP_08592930.1| hypothetical protein HMPREF1017_00038 [Bacteroides ovatus
3_8_47FAA]
gi|335942623|gb|EGN04465.1| hypothetical protein HMPREF1017_00038 [Bacteroides ovatus
3_8_47FAA]
Length = 799
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 261/771 (33%), Positives = 395/771 (51%), Gaps = 84/771 (10%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 74 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTEKGADYYWNVNKQSAHLLDEIR 133
Query: 77 SLVDSGQYAEATAASVKLFG----HPAD--------VYQLLGDIELEFDDSHLKYAEETY 124
G +A + + F + AD + +G+ +E + + ++ Y
Sbjct: 134 KAFTEGDQKKAEMLTRQNFNSEVSYEADRENPFRFGSFTTMGEFYVETGLNMIGMSD--Y 191
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ V + R F S P V+V + S +SG +L F+ + + L
Sbjct: 192 KRILSLDSAMAVVQFKKDRVAYQRNFFISYPANVMVVRFSADQSGKQNLVFSYAPNPLST 251
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+GN ++ A+ D G+++ ++ I+ GT+S D KL
Sbjct: 252 GSMVSDGNKGLVY-------------TASLDNNGMKY--VVRIQAETKGGTLSN-ADGKL 295
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D V + A + +FD F +P +P + + + Y+ L+ +H
Sbjct: 296 TVKDADEVVFYITADTDYKINFDPDFKDPKTYIGVNPEETTKQWMNNAVAQGYTALFNQH 355
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
+DY LF+RV + L+ + K + +P+++R+K+++ + D L EL +QF
Sbjct: 356 YNDYAALFNRVRLNLNPAVKGV-----------NLPTSQRLKNYRKGQPDYYLEELYYQF 404
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL+EC PL
Sbjct: 405 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECVLPLI 464
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + A GW +I+ ++ + + W PM G WL TH+W
Sbjct: 465 DFIRTLVKPGEKTAQAYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPWLATHIW 524
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D FL++ Y L++ A F +D+L DG PSTSPEH
Sbjct: 525 EYYDYTRDLKFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH---------GP 575
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A++RE+ I A++VL +K E E VL + L P +I G +MEW
Sbjct: 576 IDQGATFVHAVVREILMDAIEASKVLGVDKKERKQWEHVLAN---LVPYQIGRYGQLMEW 632
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
+ D DP+ HRH++HLFGL PGHT++ P+L KAA+ L RG+ GWS+ WK
Sbjct: 633 SVDIDDPKDEHRHVNHLFGLHPGHTVSPVTTPELAKAAKVVLVHRGDGATGWSMGWKLNQ 692
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARL D HAY + L + G NL+ HPPFQID NFG TA + EML
Sbjct: 693 WARLQDGNHAYTLFGNL-----------LKNGTMDNLWDTHPPFQIDGNFGGTAGITEML 741
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+QS + + LLPALP D W G + G+ A+G V + W++ L E + S
Sbjct: 742 LQSHMGFIQLLPALP-DAWKDGSISGICAKGNFEVDVIWENHQLKEAVVRS 791
>gi|160884032|ref|ZP_02065035.1| hypothetical protein BACOVA_02006 [Bacteroides ovatus ATCC 8483]
gi|423291498|ref|ZP_17270346.1| hypothetical protein HMPREF1069_05389 [Bacteroides ovatus
CL02T12C04]
gi|156110374|gb|EDO12119.1| hypothetical protein BACOVA_02006 [Bacteroides ovatus ATCC 8483]
gi|392663498|gb|EIY57048.1| hypothetical protein HMPREF1069_05389 [Bacteroides ovatus
CL02T12C04]
Length = 829
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 268/807 (33%), Positives = 407/807 (50%), Gaps = 91/807 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-----DY---TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 74 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGVDYYWNVNKQSAHLLDEIR 133
Query: 77 SLVDSGQYAEATAASVKLFG----HPAD--------VYQLLGDIELEFDDSHLKYAEETY 124
G +A + + F + AD + +G+ +E + + ++ Y
Sbjct: 134 KAFTEGDQKKAEMLTRQNFNSEVSYEADGENPFRFGSFTTMGEFYVETGLNMIGMSD--Y 191
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ V + R F S P V+V + S +SG +L F+ + + L
Sbjct: 192 KRILSLDSAMAVVQFKKDRVAYQRNFFISYPANVMVVRFSADQSGKQNLVFSYAPNPLST 251
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+GN ++ A+ D G+++ ++ I+ GT+S D KL
Sbjct: 252 GSMVSDGNKGLVY-------------TASLDNNGMKY--VVCIQAETKGGTLSN-ADGKL 295
Query: 243 KVEGSDWAVLLLVASS----SFDGPFINPSDS-KKDPTSESMSALQSIRNLSYSDLYTRH 297
V+ +D V + A + +FD F +P +P + + + Y+ L+ +H
Sbjct: 296 TVKDADEVVFYITADTDYKINFDPDFKDPKTYIGVNPEETTKQWMNNAVAQGYTALFNQH 355
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
+DY LF+RV + L+ + K + +P+++R+K+++ + D L EL +QF
Sbjct: 356 YNDYATLFNRVRLNLNPAVKGV-----------NLPTSQRLKNYRKGQPDYYLGELYYQF 404
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL+EC PL
Sbjct: 405 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNECVLPLI 464
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + A GW +I+ ++ + + W PM G WL TH+W
Sbjct: 465 DFIRTLVKPGEKTAQAYFGARGWTASISGNIFGFTTPLESRDMSWNFNPMAGPWLATHIW 524
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D FL++ Y L++ A F +D+L DG PSTSPEH
Sbjct: 525 EYYDYTRDLKFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH---------GP 575
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A++RE+ I A++VL +K E VL + L P +I G +MEW
Sbjct: 576 IDQGATFVHAVVREILMDAIEASKVLGVDKKGRKQWEHVLAN---LVPYQIGRYGQLMEW 632
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
+ D DP+ HRH++HLFGL PGHT++ P+L KAA+ L RG+ GWS+ WK
Sbjct: 633 SVDIDDPKDEHRHVNHLFGLHPGHTVSPVTTPELAKAAKVVLVHRGDGATGWSMGWKLNQ 692
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARL D HAY + L + G NL+ HPPFQID NFG TA + EML
Sbjct: 693 WARLQDGNHAYTLFGNL-----------LKNGTMDNLWDTHPPFQIDGNFGGTAGITEML 741
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN----- 768
+QS + + LLPALP D W G + G+ A+G V + W++ L E + SN
Sbjct: 742 LQSHMGFIQLLPALP-DAWKDGSISGICAKGNFEVDVIWENHQLKEAVVRSNAGGDCVIK 800
Query: 769 --NDHDSFKTLHYRGTSVKVNLSAGKI 793
+ SFKT+ R + + + G I
Sbjct: 801 YADQTISFKTVKGRSYQIGYDATKGLI 827
>gi|224026224|ref|ZP_03644590.1| hypothetical protein BACCOPRO_02980 [Bacteroides coprophilus DSM
18228]
gi|224019460|gb|EEF77458.1| hypothetical protein BACCOPRO_02980 [Bacteroides coprophilus DSM
18228]
Length = 825
Score = 412 bits (1058), Expect = e-112, Method: Compositional matrix adjust.
Identities = 265/801 (33%), Positives = 393/801 (49%), Gaps = 87/801 (10%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY--------TNPDAPKALSDVRSL 78
++P+GNG +GA + G V E NE TLW G P N ++ L D+R
Sbjct: 70 SLPVGNGSIGANIMGSVSVERFTFNEKTLWRGGPRTVKNAASYWNVNKESAHVLKDIRQA 129
Query: 79 VDSGQYAEATAASVKLFG----HPADV--------YQLLGDIELEFDDSHLKYAEETYRR 126
G +AT + F + AD + G+ ++ KY+ Y R
Sbjct: 130 FADGNVEKATQLTQDNFNSEVPYEADAEEPFRFGSFTSCGEFRIQTGLDEQKYS--GYSR 187
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDNH 184
L L++A V++ V + R+ F+S P V+V + + + +L N + + L +H
Sbjct: 188 SLLLDSALVTVRFEQEGVHYRRDFFTSYPHNVMVVRFTADQEKRQNLVLNYTPNPL--SH 245
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
N+ +G C R+ Q ++ K + G + + V
Sbjct: 246 GKFKAENR---DGFCFDARL----------DNNQMHYVVRAKAVAEGGKVWTDRQGNIHV 292
Query: 245 EGSDWAVLLLVASS----SFDGPFINPSD-SKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
EG+D L+ A + +FD F +P DP + ++ +LSY++L H
Sbjct: 293 EGADEVYFLITADTDYQINFDPDFKDPKTYVGVDPLRTTREWMKQAASLSYAELLGEHYT 352
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGR 358
DY LF R ++L+ K +T +P+ R++ ++T D SL L +QFGR
Sbjct: 353 DYAALFGRTQLELNPDQKGGMT----------LPTPRRLERYRTGAPDYSLESLYYQFGR 402
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLI+SSRPG ANLQG+W+ ++ W H NIN++MNYW + P NLSEC++PL DF
Sbjct: 403 YLLIASSRPGNLPANLQGMWHNNVDGPWRVDYHNNINVQMNYWPACPTNLSECEQPLIDF 462
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHLWEH 477
+ G +TA+ + A GW ++I+ ++ R K + W P+ G WL TH+W +
Sbjct: 463 IRMQVKPGKETARAYFGARGWTTSISSNIFGFTTPLRDKDMSWNFSPVAGPWLATHVWNY 522
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y+YT D +FL Y L++G A F +D+L DG PSTSPEH +
Sbjct: 523 YDYTRDLEFLRTVGYDLIKGAADFSVDYLWHKPDGTYTAAPSTSPEH---------GPID 573
Query: 538 YSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ 595
+T A+IRE+ I A+ L ++ E A E+VL+ +P P +I G +MEW++
Sbjct: 574 QGATFSHAVIREILLDAIEASRTLNVDEQERARWEEVLQGMP---PYQIGRYGQLMEWSK 630
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWA 655
D DP HRH++HLF L PGHTI+ P L KAA L+ RG+ GWS+ WK WA
Sbjct: 631 DIDDPFDEHRHVNHLFALHPGHTISPVTTPKLAKAARVVLEHRGDGATGWSMGWKLNQWA 690
Query: 656 RLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQ 715
RL D AY + L + G NL+ +HPPFQID NFG TA V EML+Q
Sbjct: 691 RLQDGNRAYTLYGNL-----------LKNGTNDNLWDSHPPFQIDGNFGGTAGVTEMLLQ 739
Query: 716 STLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFK 775
S + LLPALP D W G + G++ARG + + W+D +L ++S H
Sbjct: 740 SHAGFIQLLPALP-DVWHDGKLTGVRARGNFVLDLYWEDNNLKRAVVHSGSGLPCH---- 794
Query: 776 TLHYRGTSVKVNLSAGKIYTF 796
+ Y+G +K AGK YT
Sbjct: 795 -ILYKGKELKFQTEAGKAYTL 814
>gi|429746943|ref|ZP_19280255.1| hypothetical protein HMPREF9078_01397 [Capnocytophaga sp. oral
taxon 380 str. F0488]
gi|429164651|gb|EKY06768.1| hypothetical protein HMPREF9078_01397 [Capnocytophaga sp. oral
taxon 380 str. F0488]
Length = 799
Score = 411 bits (1057), Expect = e-112, Method: Compositional matrix adjust.
Identities = 272/778 (34%), Positives = 416/778 (53%), Gaps = 66/778 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + F+ PA HFT++IPIGNGRLGAM++G + + LNE +LW+G + +P+A L
Sbjct: 16 VSVVFHKPANHFTESIPIGNGRLGAMLFGKTDVDRIVLNEISLWSGGTQEADDPNAHNYL 75
Query: 73 SDVRSLVDSGQYAEATAASVKLF---------GHPAD----VYQLLGDIELEFDDSHLKY 119
+++ L+ G+ EA A + F G A+ YQ+L ++ L++ +
Sbjct: 76 KEIQKLLLQGKNLEAQALLQQHFVAKGKGSCLGQGANCSYGCYQVLAELLLDWKTTS--- 132
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+ Y+R L L+ ATA + N + F+ + VI +I + L+ ++SL
Sbjct: 133 PIQDYKRVLRLDEATAVTSFKRDNNSIEQTAFADFKNDVIWVRIKATS--PLNLDISLFR 190
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+N + NN+I + G P ND +G+ F+++++++ G I +
Sbjct: 191 -KENATITYQNNKITLNGVLP----------NDGKEGMHFASVVDVQTD---GKIESTH- 235
Query: 240 KKLKVEGSDWAVLLLVASSSFD---GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
K + ++ + L + A ++++ G ++ S +KK + LQ +S+
Sbjct: 236 KAIAIQSAKEITLRISAVTNYNFNKGGLLDISVTKK-----ANEYLQKAP-MSFDKAKAE 289
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL-FQ 355
+Q LF+R + N + + + ER++ F E +L+ +L +
Sbjct: 290 SSIVFQGLFNRNRWY-----------GKANANTEGLTTFERLERFYKGEQDALLPILYYN 338
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
FGRYLLISSSR G ANLQG+W E+ W+ H+NIN++MNYW + P NLS+ EPL
Sbjct: 339 FGRYLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINIQMNYWLAEPTNLSQLTEPL 398
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
F L NGSKTA+ Y A+GWV H ++ W +S W GGAWLC H+W
Sbjct: 399 QRFTKNLVPNGSKTAKAYYNANGWVAHVISNPWFYTSPGE-SATWGSTLTGGAWLCEHIW 457
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP---DG 531
+HY +T + +FL + YP+L+ +F + LI+ GY T PS SPE+ ++ P DG
Sbjct: 458 QHYLFTKNINFL-REYYPVLKEATTFFENLLIKDPKTGYWVTAPSNSPENAYVLPELKDG 516
Query: 532 K--LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
K + + TMDM I+RE+F+ AA++L + E S + P +I + G
Sbjct: 517 KKQIGTTCVAPTMDMQIVRELFTNTSDAAKILGLDSKKRTEWERISRNTV-PNRIGKKGD 575
Query: 590 IMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITW 649
+ EW D++D E HRH+SHL+GL+P IT PDL KAA+KTL+ RG+ G GWS W
Sbjct: 576 LNEWLDDWEDAEPQHRHVSHLYGLYPYDEITPWDTPDLAKAAKKTLEIRGDAGTGWSRAW 635
Query: 650 KTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAV 709
K WARL D HA ++++L + V+P GG Y NLF AHPPFQID NFG TA +
Sbjct: 636 KINFWARLQDGNHALLLLRQLLHPVNPNITDGQGGGTYPNLFCAHPPFQIDGNFGGTAGI 695
Query: 710 AEMLVQS--TLNDLYLLPALP-WDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
AEML+QS N + LPALP W +G +KG++AR G V+ W+ +L + I S
Sbjct: 696 AEMLLQSHGKGNVIRFLPALPSHPDWENGVMKGMRARNGFEVNFEWQQFELEKAEITS 753
>gi|422874794|ref|ZP_16921279.1| fibronectin type III domain-containing protein [Clostridium
perfringens F262]
gi|380304435|gb|EIA16724.1| fibronectin type III domain-containing protein [Clostridium
perfringens F262]
Length = 1479
Score = 411 bits (1056), Expect = e-112, Method: Compositional matrix adjust.
Identities = 256/760 (33%), Positives = 398/760 (52%), Gaps = 82/760 (10%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
L + ++ PA + +A+PIGNG +G M++G V SE ++ NE TLW+G PG +
Sbjct: 48 LALWYDEPATKWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107
Query: 66 PDAPKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
A +A+ ++R ++ G + + +G YQ GDI L+F SH +
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQRAYG----AYQNFGDIFLDFK-SHEESK 162
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
YRREL+++ + + VKY+ V + RE+F S PD V+V K+ ++ SL+ +V +
Sbjct: 163 VTNYRRELNIDESLSTVKYNYKGVNYEREYFCSYPDNVMVIKLKADKASSLTVDVRNEGA 222
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ + NN +I+ G + G+++ + +IK+ + G+I ED+
Sbjct: 223 HNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINTGGSIQDKEDR 267
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ VE ++ +++ A + + + P+ +DP S + + NL Y +L +RH++D
Sbjct: 268 -ISVENANEITIIMSAGTDYVNEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
Y+ LF RV++ L D TD E + ++T++ SL L FQ+GRYL
Sbjct: 325 YKNLFDRVNLNLGELKSDKPTD-------------EMLNEYKTNQSNSLETLFFQYGRYL 371
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSR G+ ANLQG+WN +P W S H N+N++MNYW + NLSE PL +++
Sbjct: 372 LISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYVE 431
Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
L G KTA+++ +GW ++ + + +A + W P AW+ +
Sbjct: 432 SLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
LWEHYN+T D+D+L + YP+++ A F +L+E DG YL ++PS SPE
Sbjct: 491 LWEHYNFTDDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEQ----- 545
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
+ +T D +I ++F+ I A+E L +E+ E K L+P +I + G
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGIDEEFRAELEDKRERLLKP-QIGKHGQ 600
Query: 590 IMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITW 649
+ EW D DP +HRH+SHL GL+PG I + P+L +AA+ T+ RG+ G GWS
Sbjct: 601 VQEWKDDIDDPNNNHRHISHLVGLYPGTQINQKDTPELYEAAKVTMNHRGDGGTGWSKAN 660
Query: 650 KTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAV 709
K LWARL D + A+R++ E NLF HPPFQID N G + +
Sbjct: 661 KINLWARLLDGDRAHRLL-----------ENQLTTSTLENLFDTHPPFQIDGNMGAVSGM 709
Query: 710 AEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVS 749
AEMLVQS L + LPALP W G GLKARG +S
Sbjct: 710 AEMLVQSHLGTINPLPALP-TAWEDGSFDGLKARGNFEIS 748
>gi|423301304|ref|ZP_17279328.1| hypothetical protein HMPREF1057_02469 [Bacteroides finegoldii
CL09T03C10]
gi|408471905|gb|EKJ90434.1| hypothetical protein HMPREF1057_02469 [Bacteroides finegoldii
CL09T03C10]
Length = 802
Score = 410 bits (1055), Expect = e-111, Method: Compositional matrix adjust.
Identities = 280/842 (33%), Positives = 414/842 (49%), Gaps = 117/842 (13%)
Query: 4 AESTSTTNPLKITFNGP---AKHF---TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWT 57
A T T L I F+ P +H + ++PIGNG LGA + G V +E + NE TLW
Sbjct: 20 AGETEYTKGLSIWFDTPNVMEEHTAWESRSLPIGNGSLGANIIGSVDTERITFNEKTLWR 79
Query: 58 GVP-----GDY---TNPDAPKALSDVRSLVDSGQYAEATAASVKLFGH--PADV------ 101
G P +Y N + L ++R G +A + + F P +
Sbjct: 80 GGPNTAKGAEYYWNVNKQSAHVLDEIRKAFTEGDQQKAEMLTRQNFNSEVPYEANREKPF 139
Query: 102 ----YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQ 157
+ ++G+ +E L ++ Y+R L L++A A V++ NV + R +F S P
Sbjct: 140 RFGNFTIMGEFYVETGLDTLGISD--YKRILSLDSALAVVQFKKNNVAYQRSYFISYPAN 197
Query: 158 VIVTKISGSESG--SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPK 215
V+V + S +G +L F+ + +S I +G G D K
Sbjct: 198 VMVMRFSADRAGMQNLVFSYAPNS--------------ISQGSLSG----------DGDK 233
Query: 216 GIQFSA---------ILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFIN 266
G+ FSA ++ I+ GT+S +L V+G+D V + A + + F N
Sbjct: 234 GLVFSASLNNNGMKYVVRIQAETKGGTLSN-AGCRLTVKGADEVVFYVTADTDYKMNF-N 291
Query: 267 P--SDSKK----DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIV 320
P D K DP + + + Y+ L+ +H DY LF+R+ + L+ + K
Sbjct: 292 PDFKDPKTYVGVDPAETTCQWINNAVMQGYTALFQQHYSDYAALFNRLRLNLNPTVK--- 348
Query: 321 TDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN 379
+P+ +R+K+++ + D L EL +QFGRYLLI+SSR G ANLQGIW+
Sbjct: 349 --------TSDIPTPQRLKNYRNGQPDYYLEELYYQFGRYLLIASSRAGNMPANLQGIWH 400
Query: 380 EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGW 439
D+ W H NIN++MNYW + P NLSEC PL DF+ L G KTAQ + A GW
Sbjct: 401 NDVDGPWRVDYHNNINVQMNYWPACPTNLSECMLPLVDFIRTLVKPGEKTAQSYFGARGW 460
Query: 440 VIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGC 498
++I+ ++ + + W PM G WL TH+WE+Y+YT D +FL++ Y L++
Sbjct: 461 TASISSNIFGFTTPLESQDMSWNFNPMAGPWLATHIWEYYDYTRDLNFLKETGYELIKSS 520
Query: 499 ASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAA 558
A F +D+L DG PSTSPEH V +T A++RE+ I A+
Sbjct: 521 ADFAVDYLWHKPDGTYTAAPSTSPEH---------GPVDQGATFVHAVVREILLDAIEAS 571
Query: 559 EVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPG 616
+VL +K + VL +L P KI G +MEW+ D DP+ HRH++HLFGL PG
Sbjct: 572 KVLGVDKKKRKQWNDVLS---KLVPYKIGRYGQLMEWSTDIDDPKDEHRHVNHLFGLHPG 628
Query: 617 HTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDP 676
HT++ P+L AA+ L RG+ GWS+ WK WARL D HAY + L
Sbjct: 629 HTVSPVTTPELATAAKVVLLHRGDGATGWSMGWKLNQWARLQDGNHAYTLFGNL------ 682
Query: 677 EHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGC 736
+ G NL+ HPPFQID NFG TA V EML+QS + + LLPALP + W G
Sbjct: 683 -----LKNGTVDNLWDTHPPFQIDGNFGGTAGVTEMLLQSHMGFIQLLPALP-NAWKDGS 736
Query: 737 VKGLKARGGETVSICWKDGDLHEVGIYSNYSNN-------DHDSFKTLHYRGTSVKVNLS 789
+ G+ A+G V + W++ L E + S N SFKT+ + +K +++
Sbjct: 737 ISGICAKGNFEVDMIWENNQLKEATVRSGAGGNCVIRYGDKMLSFKTIKGQSYQIKYDVA 796
Query: 790 AG 791
G
Sbjct: 797 KG 798
>gi|315224299|ref|ZP_07866133.1| possible alpha-L-fucosidase [Capnocytophaga ochracea F0287]
gi|420159534|ref|ZP_14666333.1| hypothetical protein HMPREF1319_1323 [Capnocytophaga ochracea str.
Holt 25]
gi|314945689|gb|EFS97704.1| possible alpha-L-fucosidase [Capnocytophaga ochracea F0287]
gi|394761875|gb|EJF44190.1| hypothetical protein HMPREF1319_1323 [Capnocytophaga ochracea str.
Holt 25]
Length = 799
Score = 410 bits (1054), Expect = e-111, Method: Compositional matrix adjust.
Identities = 271/775 (34%), Positives = 412/775 (53%), Gaps = 60/775 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + F+ PA HFT++IPIGNGRLGAM++G + + LNE +LW+G + +P+A L
Sbjct: 16 VSVVFHKPANHFTESIPIGNGRLGAMLFGKTDVDRIVLNEISLWSGGTQEADDPNAHNYL 75
Query: 73 SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
+++ L+ G+ EA A + F G A+ YQ+L ++ L++ +
Sbjct: 76 KEIQKLLLQGKNLEAQALLQQHFVAKGKGSCLGQGANCSYGCYQVLAELLLDWKTTS--- 132
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+ Y+R L L+ ATA + N + F+ + VI KI + L+ ++SL
Sbjct: 133 PIQDYKRVLRLDEATAVTSFKRDNNSIEQTAFADFKNDVIWVKIKATSP--LNLDISLFR 190
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+N + NN+I + G P N +G+ F+++++++ G I +
Sbjct: 191 K-ENATITYQNNKITLNGVLP----------NGGKEGMHFASVVDVQTD---GKIESTH- 235
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
K + ++ + L + A ++++ F S T ++ LQ +S+
Sbjct: 236 KAIAIQSAKEITLRISAVTNYN--FDKGGLSDISVTKKANEYLQKAP-MSFDKAKAESSI 292
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF-QFGR 358
+Q+LF+R + N + + + ER++ F E +L+ +L+ FGR
Sbjct: 293 VFQRLFNRNRWYGK-----------ANANTEGLTTFERLERFYKGEQDALLPILYYNFGR 341
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLLISSSR G ANLQG+W E+ W+ H+NIN++MNYW + P NLS+ EPL F
Sbjct: 342 YLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINIQMNYWLAEPTNLSQLTEPLQRF 401
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
L NGSKTA+ Y A+GWV H ++ W +S W GGAWLC H+W+HY
Sbjct: 402 TKNLVPNGSKTAKAYYNANGWVAHVISNPWFYTSPGE-SATWGSTLTGGAWLCEHIWQHY 460
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP---DGK-- 532
+T + +FL + YP+L+ +F + LI+ GY T PS SPE+ ++ P DGK
Sbjct: 461 LFTKNINFL-REYYPVLKEATTFFENLLIKDPKTGYWVTAPSNSPENAYVLPELKDGKKQ 519
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
+ + TMDM I+RE+F+ AA++L + E S + P +I + G + E
Sbjct: 520 IGTTCVAPTMDMQIVRELFTNTSDAAKILGLDSKKRTEWERISRNTV-PNRIGKKGDLNE 578
Query: 593 WAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTA 652
W D++D E HRH+SHL+GL+P IT PDL KAA+KTL+ RG+ G GWS WK
Sbjct: 579 WLDDWEDAEPQHRHVSHLYGLYPYDEITPWDTPDLAKAAKKTLEVRGDAGTGWSRAWKIN 638
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
WARL D HA ++++L + V+P GG Y NLF AHPPFQID NFG TA +AEM
Sbjct: 639 FWARLQDGNHALLLLRQLLHPVNPNITDGQGGGTYPNLFCAHPPFQIDGNFGGTAGIAEM 698
Query: 713 LVQS--TLNDLYLLPALP-WDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
L+QS N + LPALP W +G +KG++AR G V+ W+ L + I S
Sbjct: 699 LLQSHGKGNIIRFLPALPSHPDWENGVMKGMRARNGFEVNFEWQQFKLEKAEITS 753
>gi|343083763|ref|YP_004773058.1| glycoside hydrolase [Cyclobacterium marinum DSM 745]
gi|342352297|gb|AEL24827.1| glycoside hydrolase family 65 central catalytic [Cyclobacterium
marinum DSM 745]
Length = 806
Score = 410 bits (1054), Expect = e-111, Method: Compositional matrix adjust.
Identities = 263/781 (33%), Positives = 420/781 (53%), Gaps = 54/781 (6%)
Query: 4 AESTSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
A T+ + +++ + PA + +A+PIGNGRLGAM++GGV E ++LNE++LW G+P D
Sbjct: 32 ARKTNNSKKMQLWYTSPANEWLEALPIGNGRLGAMIFGGVKEEQIQLNEESLWAGMPEDP 91
Query: 64 TNPDAPKALSDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYA 120
D K + + L G+Y EA ++ L P + Y+ LG++ + FD H K +
Sbjct: 92 YPEDVQKHYAAFQQLNMEGKYEEALKYGMEHLAVSPTSIRSYEPLGELHITFD--HQK-S 148
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
E YRR LDL T Y++ + RE FSS+ VI + + ++ + D
Sbjct: 149 PENYRRTLDLETGVVISTYTIDGKRYLREAFSSDKYDVIFYRFQSLDGEPVNSTIRFDRE 208
Query: 181 LDNHSYVNGNNQIIMEGRC---PGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
D + +I++G+ P + + + ++F++ +I + D G++S
Sbjct: 209 KDIVQSIGEGELLIVDGQVFDDPDGYEDNPGGSGETGRHMKFAS--QITATLDEGSMSGN 266
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
E+ L +E S +++ A++ ++ +N D D +++ +L+ +Y H
Sbjct: 267 ENT-LNIENSTGYTVIVSAATDYNLAKLN-FDRNIDAKDKALKSLKGALETAYQTAKDAH 324
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQF 356
+ K+F+RV++ L SP DT+P+ +R+ + D + EL FQ+
Sbjct: 325 TAAHSKMFNRVALSLG-SPLQ-----------DTIPTDKRLDQVREGTNDNHITELFFQY 372
Query: 357 GRYLLISSS-RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
GRYLL+ SS ANLQGIWN+++ W+S H+NINL+MNYW + NLSE PL
Sbjct: 373 GRYLLMGSSVNRAILPANLQGIWNKEMWAPWESDFHLNINLQMNYWPADQTNLSESFVPL 432
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAK-----SSADRGKVVWALWPMGGAWL 470
+F+ L+ NG TA+ +SGW+ HH ++ + + S+ D P+ GAW+
Sbjct: 433 SNFMEKLAKNGEITAEKFIGSSGWMAHHVSNPFGRTTPSGSTKDSQMTNGYSNPLAGAWM 492
Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD 530
LW HY +T D+++L++ AYP+L G A F+LD+L E G L T+PS SPE+ +I P
Sbjct: 493 SLSLWRHYEFTQDQEYLKETAYPVLAGTAQFILDFLKENEKGELVTSPSYSPENAYIDPK 552
Query: 531 -GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
GK + +++MD+ II ++F+A + A E++ + L + K+ +L P KI ++G+
Sbjct: 553 TGKATRNTTAASMDIQIINDIFNACLKAEEII--GDKQLTAAIKKASSKLPPIKIGKNGT 610
Query: 590 IMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR----GEEGPGW 645
+ EW +D ++ E HRH+SHL+ L+P + IT + P+L KAAEKT+++R G GW
Sbjct: 611 LQEWYEDHEEVEPGHRHMSHLYALYPSNQIT-KATPELFKAAEKTIERRLTYGGAGQTGW 669
Query: 646 SITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLF-AAHPPFQIDANFG 704
S W +ARL E + + L N+F FQI+ NFG
Sbjct: 670 SRAWIINFFARLQKGEEGLEHIHEMMATQ-----------LSPNMFDLLGKIFQIEGNFG 718
Query: 705 FTAAVAEMLVQSTLNDLY-LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIY 763
TA +AEMLVQS + LLPALP W++G VKGLKARG +S+ W+DG L + I
Sbjct: 719 ATAGIAEMLVQSHEEGIIRLLPALP-QAWNTGEVKGLKARGNFEISMEWEDGKLKKAEIL 777
Query: 764 S 764
S
Sbjct: 778 S 778
>gi|169343800|ref|ZP_02864799.1| fibronectin type III domain protein [Clostridium perfringens C str.
JGS1495]
gi|169298360|gb|EDS80450.1| fibronectin type III domain protein [Clostridium perfringens C str.
JGS1495]
Length = 1479
Score = 410 bits (1054), Expect = e-111, Method: Compositional matrix adjust.
Identities = 256/760 (33%), Positives = 398/760 (52%), Gaps = 82/760 (10%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY------TN 65
L + ++ PA ++ +A+PIGNG +G M++G V SE ++ NE TLW+G PG +
Sbjct: 48 LALWYDEPATNWENEALPIGNGYMGGMIFGSVASERIQYNEKTLWSGGPGAWEGYNGGNK 107
Query: 66 PDAPKALSDVRSLVDSG-----QYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYA 120
A +A+ ++R ++ G + + +G YQ GDI L+F SH +
Sbjct: 108 EGAWEAVQEIRKILAEGGTPSNDLYQRVCGDQRAYG----AYQNFGDIFLDFK-SHEESK 162
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
YRREL++ + + VKY+ V + RE+F S PD ++V K+ ++ SL+ +V +
Sbjct: 163 VTNYRRELNIEESLSTVKYNYKGVNYEREYFCSYPDNIMVIKLKADKASSLTVDVRNEGA 222
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+ + NN +I+ G + G+++ + +IK+ + G+I ED+
Sbjct: 223 HNGKNLSVENNTLILSGAI-------------EDNGMKYES--QIKVINTGGSIQDKEDR 267
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ VE +D +++ A + + + P+ +DP S + + NL Y +L +RH++D
Sbjct: 268 -ISVENADEITIIMSAGTDYINEY--PTYKGEDPHSAVTERINNAVNLGYDELKSRHIED 324
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
Y+ LF RV++ L D P+ E + ++T++ SL L FQ+GRYL
Sbjct: 325 YKNLFDRVNLNLGELKLD-------------KPTDEILNEYKTNQSNSLETLFFQYGRYL 371
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LISSSR G+ ANLQG+WN +P W S H N+N++MNYW + NLSE PL +++
Sbjct: 372 LISSSREGSLPANLQGVWNNSNNPPWSSDYHFNVNIQMNYWPAEVANLSETAIPLVEYVE 431
Query: 421 YLSINGSKTAQVN-------YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
L G KTA+++ +GW ++ + + +A + W P AW+ +
Sbjct: 432 SLREPGRKTAEMHCGIEGAMENKNGWTVNTMNNPFG-FTAMGWEFDWGWAPTSNAWISQN 490
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE--GHDG--YLETNPSTSPEHEFIAP 529
LWEHYN+T D+D+L + YP+++ A F +L+E DG YL ++PS SPEH
Sbjct: 491 LWEHYNFTDDKDYLRENIYPIMKEAAQFWTQFLVEYTHSDGKTYLVSSPSYSPEH----- 545
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
+ +T D +I ++F+ I A+E L +E+ E K L+P +I + G
Sbjct: 546 ----GPRTVGTTFDQELIWQLFTDTIKASETLGIDEEFRAELEDKRERLLKP-QIGKHGQ 600
Query: 590 IMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITW 649
+ EW D D +HRH+SHL GL+PG I + P+L +AA+ T+ RG+ G GWS
Sbjct: 601 VQEWKDDIDDTNNNHRHISHLVGLYPGTQINQKDTPELYEAAKVTMNHRGDGGTGWSKAN 660
Query: 650 KTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAV 709
K LWARL D + A+R++ E NLF HPPFQID N G + +
Sbjct: 661 KINLWARLLDGDRAHRLL-----------ENQLTTSTLENLFDTHPPFQIDGNMGAVSGM 709
Query: 710 AEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVS 749
AEMLVQS L + LPALP W G GLKARG VS
Sbjct: 710 AEMLVQSHLGTINPLPALP-TAWEDGSFDGLKARGNFEVS 748
>gi|393778744|ref|ZP_10367005.1| hypothetical protein HMPREF1321_0421 [Capnocytophaga sp. oral taxon
412 str. F0487]
gi|392611313|gb|EIW94052.1| hypothetical protein HMPREF1321_0421 [Capnocytophaga sp. oral taxon
412 str. F0487]
Length = 799
Score = 410 bits (1053), Expect = e-111, Method: Compositional matrix adjust.
Identities = 280/815 (34%), Positives = 428/815 (52%), Gaps = 73/815 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + F+ PA HFT++IPIGNGRLGAM++G + + LNE +LW+G + +P+A L
Sbjct: 16 VSVVFHKPANHFTESIPIGNGRLGAMLFGKTDVDRIVLNEISLWSGGTQEADDPNAHNYL 75
Query: 73 SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
+++ L+ G+ EA A + F G A+ YQ+L ++ L++ +
Sbjct: 76 KEIQKLLLQGKNLEAQALLQQHFVAKGKGSCLGQGANCSYGCYQVLAELLLDWKTTS--- 132
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+ Y+R L L+ ATA + N + F+ + VI KI + L+ ++SL
Sbjct: 133 PIQDYKRVLRLDEATAVTSFKRDNNSIGQTAFADFKNDVIWVKIKATSP--LNLDISLFR 190
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+N + NN+I + G P N +G+ F+++++++ G I +
Sbjct: 191 K-ENATITYQNNKITLNGVLP----------NGGKEGMHFASVVDVQTD---GKIESTH- 235
Query: 240 KKLKVEGSDWAVLLLVASSSFD---GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
K + ++ + L + A ++++ G ++ S +KK + LQ +S+
Sbjct: 236 KAIAIQSAKEITLRISAVTNYNFNKGGLLDISVTKK-----ANEYLQKAP-MSFDKAKAE 289
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF-Q 355
+Q+LF+R + N + + + ER++ F E +L+ +L+
Sbjct: 290 SSIVFQRLFNRNRWYGK-----------ANANTEGLTTFERLERFYKGEQDALLPILYYN 338
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
FGRYLLISSSR G ANLQG+W E+ W+ H+NIN++MNYW + P NLS+ EPL
Sbjct: 339 FGRYLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINIQMNYWLAEPTNLSQLTEPL 398
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
F L NGSKTA+ Y A+GWV H ++ W +S W GGAWLC H+W
Sbjct: 399 QRFTKNLVPNGSKTAKAYYNANGWVAHVISNPWFYTSPGE-SATWGSTLTGGAWLCEHIW 457
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP---DG 531
+HY +T + +FL + YP+L+ +F LI+ GY T PS SPE+ ++ P DG
Sbjct: 458 QHYLFTKNINFL-REYYPVLKEATTFFESLLIKDPKTGYWVTAPSNSPENAYVLPELKDG 516
Query: 532 K--LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
K + + TMDM I+RE+F+ AA++L + E S + P +I + G
Sbjct: 517 KKQIGTTCVAPTMDMQIVRELFTNTSDAAKILGLDSKKRTEWERISRNTV-PNRIGKKGD 575
Query: 590 IMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITW 649
+ EW D++D E HRH+SHL+GL+P IT PDL KAA+KTL+ RG+ G GWS W
Sbjct: 576 LNEWLDDWEDAEPQHRHVSHLYGLYPYDEITPWDTPDLAKAAKKTLEIRGDAGTGWSRAW 635
Query: 650 KTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAV 709
K WARL D HA ++++L + V+P GG Y NLF AHPPFQID NFG TA +
Sbjct: 636 KINFWARLQDGNHALLLLRQLLHPVNPNITDGQGGGTYPNLFCAHPPFQIDGNFGGTAGI 695
Query: 710 AEMLVQS--TLNDLYLLPALP-WDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNY 766
AEML+QS N + LPALP W +G +KG++AR G V+ W+ L + I S
Sbjct: 696 AEMLLQSHGKGNVIRFLPALPSHPDWENGVMKGMRARNGFEVNFEWQQFKLEKAEITS-- 753
Query: 767 SNNDHDSF-----KTLHYRGTSVKVNLSAGKIYTF 796
N S K ++ RG ++ + K+ TF
Sbjct: 754 LNGGECSVLLPANKNVYSRGKAIVKGSNKDKVITF 788
>gi|380694581|ref|ZP_09859440.1| hypothetical protein BfaeM_11488 [Bacteroides faecis MAJ27]
Length = 812
Score = 409 bits (1051), Expect = e-111, Method: Compositional matrix adjust.
Identities = 265/807 (32%), Positives = 407/807 (50%), Gaps = 91/807 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG +GA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 56 SQSLPIGNGSIGANILGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHILDEIR 115
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
G +A + + F + +G+ +E + +K +E Y
Sbjct: 116 KAFIEGDQQKAEKLTRENFNSEVPYEYSGEKPFRFGNFTTMGEFYIETGLNTVKMSE--Y 173
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ NV + R +F S P V+V + S + G +L F+ + + +
Sbjct: 174 KRILSLDSAMAVVQFKKDNVAYQRNYFISYPANVMVMRFSADQPGKQNLIFSYAPNPMST 233
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
++G+N ++ +A + G++++ + I+ + GT++ D KL
Sbjct: 234 GQIAIDGSNGLVY-------------SAFLENNGMKYA--VRIQATVKGGTLNN-SDGKL 277
Query: 243 KVEGSDWAVLLLVASSSFDGPFI-NPSDSKK----DPTSESMSALQSIRNLSYSDLYTRH 297
++ +D AV + A + + F + +D K +P + ++ Y++L H
Sbjct: 278 TIKDADEAVFYVTADTDYKMNFAPDFTDPKTYVGVNPLETTQQWMEDAVAKGYTNLLDEH 337
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
DY LF+RV ++L+ + K +P+ +R+K+++ + D L +L +QF
Sbjct: 338 YKDYAALFNRVKLELNPTVKTA-----------NLPTEQRLKNYRKGQPDYYLEKLYYQF 386
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL EC PL
Sbjct: 387 GRYLLIASSRPGNMPANLQGIWHNNIDGPWRVDYHNNINIQMNYWPACSTNLDECMLPLI 446
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + A GW +I+ ++ + + W PM G WL TH+W
Sbjct: 447 DFIRTLVKPGEKTAQSYFGARGWTASISANIFGFTTPLESQDMSWNFNPMAGPWLATHVW 506
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT + FL++ Y L++ A+F +D+L DG PSTSPEH
Sbjct: 507 EYYDYTQNLKFLKETGYELIKSSANFAVDYLWHKPDGTYTAAPSTSPEH---------GP 557
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A+IRE+ I A++ L +K E E VL + L P KI G +MEW
Sbjct: 558 IDQGATFVHAVIREILLDAIKASKELGIDKKERKQWEHVLAN---LTPYKIGRYGQLMEW 614
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
+ D DP+ HRH++HLFGL PGHT++ P+L +AA+ L RG+ GWS+ WK
Sbjct: 615 SVDIDDPKDEHRHVNHLFGLHPGHTVSPVTTPELAEAAKVVLVHRGDGATGWSMGWKLNQ 674
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARL D HAY + L + G NL+ HPPFQID NFG TA + EML
Sbjct: 675 WARLQDGNHAYTLFGNL-----------LKNGTMDNLWDTHPPFQIDGNFGGTAGITEML 723
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN---- 769
+QS + + LLPALP D W G ++G+ A+G + I WKDG L E + S N
Sbjct: 724 LQSHMGFIQLLPALP-DAWKDGSIQGVCAKGNFEIGIIWKDGLLKEATLLSKAGQNCTVK 782
Query: 770 ---DHDSFKTLHYRGTSVKVNLSAGKI 793
SFKT+ +K + G I
Sbjct: 783 YADKTISFKTVKGHSYQLKYDKENGLI 809
>gi|383124735|ref|ZP_09945397.1| hypothetical protein BSIG_1516 [Bacteroides sp. 1_1_6]
gi|251841110|gb|EES69191.1| hypothetical protein BSIG_1516 [Bacteroides sp. 1_1_6]
Length = 808
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 262/773 (33%), Positives = 400/773 (51%), Gaps = 68/773 (8%)
Query: 8 STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-P 66
+T N +K+ ++ PA + ++P+GNGRLG +++GG+ +ETL LNE T+W+G ++ P
Sbjct: 24 ATENKMKLWYDKPADEWMKSLPLGNGRLGVIIYGGIETETLALNESTMWSGEYDEHQQRP 83
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLFG---HPADVYQLLGDIELEFDDSHLKYAEET 123
+ L+ VR L +E + + H + +GD+++ F S+ +
Sbjct: 84 FGREKLNQVRKLFFENNLSEGNHVAGNMMAGSPHSVGTHLPIGDLKINF--SYPQGEISD 141
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
YR ELDL+TA V Y VGN E+ R+ +SNPD V+ I S +++ + L LL
Sbjct: 142 YRHELDLHTAINTVSYKVGNTEYIRQCIASNPDDVVAMHIKASRPKAITMELEL-KLLRQ 200
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ V NQ+I G ++ G+ F + ++I GTI A E KKL
Sbjct: 201 ANVVASGNQLIYTGNAEFEK--------HGKGGVHFEGRIAVQIKG--GTIKA-EGKKLY 249
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
+E + LL S F N + S + + ++ + L +H++DY
Sbjct: 250 IEKATEVTLL----SDVRTNFKNNTFSGYNYKIKCEKTIELASKKDFKTLKKKHIEDYSP 305
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLI 362
LF RV + K D +P+ ER + E DP L L FQ+ RYLLI
Sbjct: 306 LFSRVGLSFEHHAK-----------FDHLPNDERWARVKKGESDPGLDALFFQYARYLLI 354
Query: 363 SSSRPGTQV-ANLQGIWNEDLSP--TWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
+SSRP + + LQG +N++L+ W + H++IN E NYW + NL+EC PLFD++
Sbjct: 355 ASSRPNSPLPVALQGFFNDNLACHMGWTNDYHLDINTEQNYWIANVGNLAECHLPLFDYI 414
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
LSI+G+KTA+ Y GW H + W ++ G ++W L+P +WL +HLW Y+
Sbjct: 415 KDLSIHGAKTAKDLYGCKGWTAHTTANPWGYTAVS-GSILWGLFPTASSWLASHLWTQYD 473
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
YT D+DFL+ AYPLL+ A FLLD++ I+ + YL T PS SPE+ F G+ C S
Sbjct: 474 YTQDKDFLKNTAYPLLKSNAEFLLDYMVIDPRNNYLVTGPSISPENSF-RHQGQEFCASM 532
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDA-LVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
T D + E+FSA + + E+L N DA + + ++ +L P +I+ +G + EW +D+
Sbjct: 533 MPTCDRVLAYEIFSACLQSTEIL--NVDASFADSLRTAISQLPPFRISTNGGVQEWFEDY 590
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGE----EGPGWSITWKTAL 653
++ +HRH +HL L+P IT++K P+L +AA KT++KR E WS
Sbjct: 591 EEAHPNHRHTTHLLSLYPYSQITLDKTPELAQAAAKTIEKRLAAKDWEDTEWSRANMICF 650
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP---------FQIDANFG 704
+ARL D E AY VK+L + E N+F P F D N
Sbjct: 651 YARLKDSEKAYSSVKQLLGKLSRE-----------NMFTVSPAGIAGAGEDIFAFDGNTA 699
Query: 705 FTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
A +AEML+QS N + LL LP ++W +G KGL ARGG + WK+ +
Sbjct: 700 GAAGMAEMLLQSHDNCIELLSCLP-EEWKNGSFKGLCARGGIEIDASWKNARI 751
>gi|153805874|ref|ZP_01958542.1| hypothetical protein BACCAC_00113 [Bacteroides caccae ATCC 43185]
gi|149130551|gb|EDM21757.1| hypothetical protein BACCAC_00113 [Bacteroides caccae ATCC 43185]
Length = 833
Score = 408 bits (1049), Expect = e-111, Method: Compositional matrix adjust.
Identities = 259/771 (33%), Positives = 387/771 (50%), Gaps = 84/771 (10%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG +GA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 77 SQSLPIGNGSIGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHILDEIR 136
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
G +A + + F + +G+ +E + + ++ Y
Sbjct: 137 KAFTEGDQVKAERLTRQNFNSEVPYEGSREKPFRFGSFTTMGEFYVETGLNIIGMSD--Y 194
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ +V + R +F S P V+V + S G +L F+ + + +
Sbjct: 195 KRILSLDSAMAVVQFKKDDVAYQRNYFISYPANVLVMRFSADRPGKQNLIFSYAPNPVST 254
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
G+N ++ +A D G+++ ++ I+ GT+ + KL
Sbjct: 255 GSMVAQGDNGLVY-------------SAALDNNGMKY--VVRIQAETKGGTLVN-RNGKL 298
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRH 297
V+G+D V + A + + F + K +P + L + YS L H
Sbjct: 299 TVKGADEVVFYVTADTDYKANFAPDFKNPKTYVGVNPVETTGQWLANAVAKGYSALLNEH 358
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
DY LF+RV + L+ + K +P+ +R+K+++ + D L EL FQF
Sbjct: 359 YQDYAALFNRVKLNLNPTVK-----------TGNLPTGQRLKNYRKGQPDYYLEELYFQF 407
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL EC PL
Sbjct: 408 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLEECMLPLI 467
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + A GW +I+ ++ + + W PM G WL TH+W
Sbjct: 468 DFIRTLVKPGEKTAQSYFGARGWTASISANIFGFTAPLESQDMSWNFNPMAGPWLATHIW 527
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D FL++ Y L++ A+F +D+L DG PSTSPEH
Sbjct: 528 EYYDYTRDLKFLKETGYELIKSSANFAVDYLWHKPDGTYTAAPSTSPEH---------GP 578
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A++RE+ I A+E L +K E E+VL + L P KI G +MEW
Sbjct: 579 IDQGATFVHAVVREILLDAIKASEELGVDKKERKEWEQVLAN---LVPYKIGRYGQLMEW 635
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
+ D DP+ HRH++HLFGL PGHT++ P+L +AA+ L RG+ GWS+ WK
Sbjct: 636 SVDIDDPKDEHRHVNHLFGLHPGHTVSPVTTPELAEAAKVVLVHRGDGATGWSMGWKLNQ 695
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARL D HAY + L + G NL+ HPPFQID NFG TA + EML
Sbjct: 696 WARLQDGNHAYTLFANL-----------LKNGTLDNLWDTHPPFQIDGNFGGTAGITEML 744
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+QS + + LLPALP D W G V+G+ A+G V + W++G L E I S
Sbjct: 745 LQSHMGFIQLLPALP-DAWKEGSVRGICAKGNFEVDMIWENGLLKEATILS 794
>gi|420150260|ref|ZP_14657420.1| hypothetical protein HMPREF1320_0993 [Capnocytophaga sp. oral taxon
335 str. F0486]
gi|394752319|gb|EJF36021.1| hypothetical protein HMPREF1320_0993 [Capnocytophaga sp. oral taxon
335 str. F0486]
Length = 799
Score = 408 bits (1048), Expect = e-111, Method: Compositional matrix adjust.
Identities = 280/815 (34%), Positives = 426/815 (52%), Gaps = 73/815 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+ + F+ PA HFT++IPIGNGRLGAM++G + + LNE +LW+G + +P+A L
Sbjct: 16 VSVVFHKPANHFTESIPIGNGRLGAMLFGKTDVDRIVLNEISLWSGGTQEADDPNAHNYL 75
Query: 73 SDVRSLVDSGQYAEATAASVKLF---------GHPADV----YQLLGDIELEFDDSHLKY 119
+++ L+ G+ EA A + F G A+ YQ+L ++ L++ +
Sbjct: 76 KEIQKLLLQGKNLEAQALLQQHFVAKGKGSCLGQGANCSYGCYQVLAELLLDWKTTS--- 132
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+ Y+R L L+ A A + N + F+ + VI KI + L+ ++SL
Sbjct: 133 PIQDYKRVLRLDEAIAVTLFKRDNNSIEQTAFADFKNDVIWVKIKATSP--LNLDISLFR 190
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+N + NN+I + G P ND +G+ F+++++++ G I +
Sbjct: 191 K-ENATITYQNNKITLNGALP----------NDGKEGMHFASVVDVQTD---GKIESTH- 235
Query: 240 KKLKVEGSDWAVLLLVASSSFD---GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
K + ++ + L + A ++++ G ++ S +KK + LQ +S+
Sbjct: 236 KAIAIQSAKEITLRISAVTNYNFNKGGLLDISVTKK-----ANEYLQKAP-MSFDKAKAE 289
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF-Q 355
+Q LF+R + N + + + ER+ F E +L+ +L+
Sbjct: 290 SSIVFQGLFNRNRWYGK-----------ANANTEGLTTFERLGRFYKGEQDALLPILYYN 338
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
FGRYLLISSSR G ANLQG+W E+ W+ H+NIN++MNYW + P NLS+ EPL
Sbjct: 339 FGRYLLISSSREGLLPANLQGLWAEEYQTPWNGDYHLNINIQMNYWLAEPTNLSQLTEPL 398
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
F L NGSKTA+ Y A+GWV H ++ W +S W GGAWLC H+W
Sbjct: 399 QRFTKNLVPNGSKTAKAYYNANGWVAHVISNPWFYTSPGE-SATWGSTLTGGAWLCEHIW 457
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAP---DG 531
+HY +T + +FL + YP+L+ +F LI+ GY T PS SPE+ ++ P DG
Sbjct: 458 QHYLFTKNINFL-REYYPVLKEATTFFESLLIKDPKTGYWVTAPSNSPENAYVLPELKDG 516
Query: 532 K--LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
K + + TMDM I+RE+F+ AA++L + E S + P +I + G
Sbjct: 517 KRQIGTTCVAPTMDMQIVRELFTNTSDAAKILGLDSKKRTEWERISRNTV-PNRIGKKGD 575
Query: 590 IMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITW 649
+ EW D++D E HRH+SHL+GL+P IT PDL KAA+KTL+ RG+ G GWS W
Sbjct: 576 LNEWLDDWEDAEPQHRHVSHLYGLYPYDEITPWDTPDLAKAAKKTLEIRGDAGTGWSRAW 635
Query: 650 KTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAV 709
K WARL D HA ++++L + V+P GG Y NLF AHPPFQID NFG TA +
Sbjct: 636 KINFWARLQDGNHALLLLRQLLHPVNPNITDGQGGGTYPNLFCAHPPFQIDGNFGGTAGI 695
Query: 710 AEMLVQS--TLNDLYLLPALP-WDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNY 766
AEML+QS N + LPALP W +G +KG++AR G V+ W+ L + I S
Sbjct: 696 AEMLLQSHGKGNVIRFLPALPSHPDWENGVMKGMRARNGFEVNFEWQQFKLEKAEITS-- 753
Query: 767 SNNDHDSF-----KTLHYRGTSVKVNLSAGKIYTF 796
N S K ++ RG ++ + K+ TF
Sbjct: 754 LNGGECSVLLPANKNVYSRGKAIVKGSNKDKVITF 788
>gi|423219674|ref|ZP_17206170.1| hypothetical protein HMPREF1061_02943 [Bacteroides caccae
CL03T12C61]
gi|392624879|gb|EIY18957.1| hypothetical protein HMPREF1061_02943 [Bacteroides caccae
CL03T12C61]
Length = 831
Score = 408 bits (1048), Expect = e-111, Method: Compositional matrix adjust.
Identities = 259/771 (33%), Positives = 387/771 (50%), Gaps = 84/771 (10%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG +GA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 75 SQSLPIGNGSIGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHILDEIR 134
Query: 77 SLVDSGQYAEATAASVKLFGHPADV------------YQLLGDIELEFDDSHLKYAEETY 124
G A+A + + F + +G+ +E + + ++ Y
Sbjct: 135 KAFTEGDQAKAERLTRQNFNSEVPYEGSREKPFRFGSFTTMGEFYVETGLNIIGMSD--Y 192
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
+R L L++A A V++ +V + R +F S P V+V + S G +L F+ + + +
Sbjct: 193 KRILSLDSAMAVVQFKKDDVAYQRNYFISYPANVLVMRFSADRPGKQNLIFSYAPNPVST 252
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
G+N ++ +A D G+++ ++ I+ GT+ + KL
Sbjct: 253 GSMVAQGDNGLVY-------------SAALDNNGMKY--VVRIQAETKGGTLVN-RNGKL 296
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKK-----DPTSESMSALQSIRNLSYSDLYTRH 297
V+G+D V + A + + F + K +P + L + YS L H
Sbjct: 297 TVKGADEVVFYVTADTDYKANFAPDFKNPKTYVGVNPVETTGQWLANAVAKGYSALLNEH 356
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQF 356
DY LF+RV + L+ + K +P+ +R+K+++ + D L EL FQF
Sbjct: 357 YQDYAALFNRVKLNLNPTVK-----------TGNLPTGQRLKNYRKGQPDYYLEELYFQF 405
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLI+SSRPG ANLQGIW+ ++ W H NIN++MNYW + NL EC PL
Sbjct: 406 GRYLLIASSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLEECMLPLI 465
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLW 475
DF+ L G KTAQ + A GW +I+ ++ + + W PM G WL TH+W
Sbjct: 466 DFIRTLVKPGEKTAQSYFGARGWTASISANIFGFTAPLESQDMSWNFNPMAGPWLATHIW 525
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
E+Y+YT D FL++ Y L++ A+F +D+L DG PSTSPEH
Sbjct: 526 EYYDYTRDLKFLKETGYELIKSSANFAVDYLWHKPDGTYTAAPSTSPEH---------GP 576
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ +T A++RE+ I A+E L +K E E+VL + L P KI G +MEW
Sbjct: 577 IDQGATFVHAVVREILLDAIKASEELGVDKKERKEWEQVLAN---LVPYKIGRYGQLMEW 633
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL 653
+ D DP+ HRH++HLF L PGHT++ P+L +AA+ L RG+ GWS+ WK
Sbjct: 634 SVDIDDPKDEHRHVNHLFSLHPGHTVSPVTTPELAEAAKVVLVHRGDGATGWSMGWKLNQ 693
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARL D HAY + L + G NL+ HPPFQID NFG TA + EML
Sbjct: 694 WARLQDGNHAYTLFANL-----------LKNGTLDNLWDTHPPFQIDGNFGGTAGITEML 742
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+QS + + LLPALP D W G V+G+ A+G V + W++G L E I S
Sbjct: 743 LQSHMGFIQLLPALP-DAWKEGSVRGICAKGNFEVDMIWENGLLKEATILS 792
>gi|90022148|ref|YP_527975.1| hypothetical protein Sde_2503 [Saccharophagus degradans 2-40]
gi|89951748|gb|ABD81763.1| a-L-fucosidase-like protein [Saccharophagus degradans 2-40]
Length = 803
Score = 408 bits (1048), Expect = e-111, Method: Compositional matrix adjust.
Identities = 265/776 (34%), Positives = 403/776 (51%), Gaps = 84/776 (10%)
Query: 6 STSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG--- 61
ST L I F PA + ++ +P+GNG +G +V G V ETL+LNE TLWTG PG
Sbjct: 26 STVAAKSLPIWFGAPALDWESEGLPMGNGAMGIVVTGEVARETLQLNEKTLWTGGPGAKG 85
Query: 62 -------DYTNPDAPKALSDV--RSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEF 112
D D + + +D A+ ++ +GH YQ G++++++
Sbjct: 86 YNFGLPTDSIKQDVAHVRQQITLHNGIDPQTAADKLGQNMHGYGH----YQSFGELDIQY 141
Query: 113 DDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLS 172
+D A Y R LDL A V Y+ N + RE+F S P Q + K+S S S+S
Sbjct: 142 NDQ--TGAVSNYVRSLDLTQGVATVAYTRNNTHYKREYFVSYPQQAAIVKLSASNKQSIS 199
Query: 173 FNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRG 232
F++ + V+ N I + + K N+ +Q+ I +++I D G
Sbjct: 200 FDLGVR--------VHPNRTIETQVKRGVLTFSGKLFDNN----LQY--IGKVQIVVDGG 245
Query: 233 TISALEDK-KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYS 291
++ E +++V ++ AV+ +VA +++ + P + P L+ I+ YS
Sbjct: 246 ELTENEKTGRIQVSRANSAVISIVAGTNYAQAY--PHYRGRLPVKTLDKNLEKIKASEYS 303
Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD---EDPS 348
L HL DY LF RV + L + +E + P+ E +K ++ + + +
Sbjct: 304 ALLAEHLTDYTALFGRVELSLIEN---------AESYLLAKPTPELLKQYKGEGSAPERA 354
Query: 349 LVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL 408
L +L FQFGRYLLI+SSR G+ ANLQG+WN +P W++ HVNINL+MNYW + NL
Sbjct: 355 LEQLYFQFGRYLLIASSRNGSLPANLQGVWNNSATPPWNADYHVNINLQMNYWPAQVTNL 414
Query: 409 SECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVW--ALW-PM 465
E P FDF+ L G ++AQ + A GW + T+I+ + G + W A W P
Sbjct: 415 GETALPFFDFIDSLVEPGKQSAQKVFGARGWTLFLNTNIFGYT----GLIEWPTAFWQPE 470
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH 524
AWL H +EHY + D FL++RAYP+++ A F +D L+ + + G L +PS SPE
Sbjct: 471 AAAWLAQHYFEHYQFYQDNTFLKERAYPVMKEAALFWVDVLVADPNTGLLVVSPSFSPEQ 530
Query: 525 -EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRP- 581
F++ + M I+ ++F+ ++ AA ++ DA +K++++ L +L P
Sbjct: 531 GPFVS----------GAAMSQQIVFDLFTNVVEAANLV---GDAEFKKLIQAKLAKLDPG 577
Query: 582 TKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEE 641
T+I G + EW QD D HRH+SHLF L PG I+++ P +AA+ +L RG+E
Sbjct: 578 TRIGSWGQLQEWQQDIDDKTNKHRHISHLFALHPGDQISVQATPAFAEAAKVSLNARGDE 637
Query: 642 GPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDA 701
G GWS WK WARL D + A++++ G NL+ HPPFQID
Sbjct: 638 GTGWSRAWKVNFWARLLDGDRAHKLLA-----------GQLMGSTLPNLWDTHPPFQIDG 686
Query: 702 NFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
NFG TA +AEML+QS + LLPALP +W +G V GL+ARG VS+ W + L
Sbjct: 687 NFGATAGMAEMLIQSHTGQITLLPALP-KQWQTGAVTGLRARGDVQVSMRWANSKL 741
>gi|340619499|ref|YP_004737952.1| alpha-L-fucosidase [Zobellia galactanivorans]
gi|339734296|emb|CAZ97673.1| Alpha-L-fucosidase, family GH95 [Zobellia galactanivorans]
Length = 809
Score = 407 bits (1047), Expect = e-111, Method: Compositional matrix adjust.
Identities = 263/774 (33%), Positives = 414/774 (53%), Gaps = 64/774 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L + + PAK +TDA P+GNGRL AM +GGV E +LNE++LW GVP + D L
Sbjct: 36 LTLWYTSPAKKWTDAFPLGNGRLAAMTFGGVAQERFQLNEESLWAGVPSNPFAEDYRAKL 95
Query: 73 SDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDS-HLKYAEETYRREL 128
+ ++ L+ G+ EA A ++ + PA Y+ LGDI L+F D+ H+ Y+R L
Sbjct: 96 TKLQKLILEGKTLEANAFGLENMTAAPASFRSYEPLGDIVLDFKDTTHIS----NYKRAL 151
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
DL T ++V Y + E RE F S D + ++S S ++ +SL D
Sbjct: 152 DLETGISKVTYRTEDSEMVRESFISAEDDALFIRLSAKGSKKINCTISLARPKDVRITAT 211
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKG-----IQFSAILEIKISDDRGTISALEDKKLK 243
++ M G+ P + N G + F+A L+ K+S G + L
Sbjct: 212 PEGKLYMLGQIVDIEAPEAHDENAGGSGEGGEHMSFAAGLQTKVS---GGKLCHTEHNLV 268
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
+E +D ++ A++++D +N D+ DP+ + L+ + S+ +L H ++++
Sbjct: 269 IENADEVLIAYTAATNYDLSKLN-FDASVDPSLKVRGILEKLDQKSWKELEYTHREEHRN 327
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLI 362
+F RV L SP D ++P+ ER+ +F+ +D L LFQFGRYLL+
Sbjct: 328 MFDRVQFDLGTSPND------------SLPTDERLLAFKNGAKDTGLPVQLFQFGRYLLM 375
Query: 363 SSSR-PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
SSR P ANLQG W+E + W++ H+N+NL+MNYW + N+SE +PL ++
Sbjct: 376 GSSRGPAVLPANLQGKWSERMWAPWEADYHLNVNLQMNYWPADVTNISETIDPLVNWFEL 435
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVV-----WALWPMGGAWLCTHLWE 476
+ A+ Y + GW HH ++ + + + + L P+ GAW+ +LW+
Sbjct: 436 IVETSKPLAKEMYGSDGWFSHHASNPFGRVTPSASTLPSQFNNAVLDPLPGAWMAMNLWD 495
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP-DGKLAC 535
HY +T D+ FL++R YPLL+G + F+LD L+E +G L PSTSPE+++ P G++
Sbjct: 496 HYEFTQDKVFLKERLYPLLKGASEFILDVLVEDSEGVLHFVPSTSPENQYKDPATGQMMR 555
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL---KSLPRLRPTKIAEDGSIME 592
++ +ST ++IIR +F A + AA +L + + ++++ K+LP K +G +ME
Sbjct: 556 ITSTSTYHLSIIRAMFKATLEAATILGEGNNERCKRIVEAGKALPDFPIDKT--NGRMME 613
Query: 593 WAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITW 649
W Q ++ E HRHLSHL GL P ++ E+ P L +A K+L+ R G+ G GW+
Sbjct: 614 WRQPLEEKEPGHRHLSHLLGLHP-FSLIDEETPGLFEAVRKSLEWREVNGQGGMGWAYAH 672
Query: 650 KTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAV 709
+ ARL + E AY K LF L+ G S+L PFQID N G TA +
Sbjct: 673 GLLMHARLKEGEKAY---KNLFTLLSR--------GRKSSLMNTIGPFQIDGNLGATAGI 721
Query: 710 AEMLVQSTLND------LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
+EML+QS D L LLPA+P +WS+G + GLKARGG +++ WK+ +L
Sbjct: 722 SEMLLQSHRKDAQGDFILDLLPAIP-SEWSTGNISGLKARGGFELAMKWKENEL 774
>gi|339479496|gb|ABE95964.1| Conserved hypothetical protein (Glycosyl hydrolases family 95)
[Bifidobacterium breve UCC2003]
Length = 783
Score = 407 bits (1045), Expect = e-110, Method: Compositional matrix adjust.
Identities = 261/768 (33%), Positives = 393/768 (51%), Gaps = 52/768 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+TF+G + + + IP+GNGR+GA++ ++ L LN+DTLW+G P T+P P+ +
Sbjct: 1 MKLTFDGISSCWEEGIPLGNGRMGAVLCSEPETDVLYLNDDTLWSGYPHAETSPVTPEIV 60
Query: 73 SDVR--SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+ R SL D A L +Y+ G +++ S E+ +R+LDL
Sbjct: 61 AKARQASLQDDYNAATRIIKDATLQEKDEQIYEPFGTARIQYSTS--ADGRESMKRQLDL 118
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS----------LDSL 180
A A + +G+ + + S PD ++V ++S S ++ +VS ++++
Sbjct: 119 ARALAGETFQMGDANVHVDAWCSEPDDLLVYRMSSDASIDVNISVSGTFLKQSRASMETV 178
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIP----PKANANDDPK---GIQFSAILEIKISDDRGT 233
D H +++ GR PG I P N +D + G+ ++ + ++ G
Sbjct: 179 FDGHRAT-----LVVMGRMPGLNIGLLPHPSENPWEDEQDGTGMAYAGAFSLTVT---GG 230
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
+ D L+ L + S F G P S ++ + + +
Sbjct: 231 DVNVGDNSLQCSNITGLSLRFRSMSGFRGSDQQPERSMT-VIADHLEKTIDEWSTDLRTM 289
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDED---PSLV 350
+ RH+ DY++ F RV+I L + D DT +P + ++S + E L
Sbjct: 290 FDRHIADYRRYFDRVAIHLGSAHDD---DT-------ELPFSAILRSDENKEPHRLEMLA 339
Query: 351 ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
E +F FGRYLLISSSRP TQ ANLQGIWN P W SA NIN+EMNYW + PC L E
Sbjct: 340 EAMFDFGRYLLISSSRPHTQPANLQGIWNHKDFPNWYSAYTTNINVEMNYWMTGPCALQE 399
Query: 411 CQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWL 470
EPL L + G A G + H D+W ++ G +W+ WP G AW+
Sbjct: 400 LIEPLVSMNEELLVPGHDAADRILGCRGSAVFHNVDLWRRALPANGDPMWSFWPFGQAWM 459
Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD 530
C +L++ Y + D +L R +P++ A F +D+L E G L +P+TSPE+ F+ +
Sbjct: 460 CRNLFDEYLFNQDASYL-ARIWPIMRDNARFCMDFLSETKHG-LAPSPATSPENCFLV-N 516
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAA---EVLEKNEDALVEKVLKSLPRLRPTKIAED 587
G+L V+ SS AI+R + +I A+ E L++ + LV + L T++ D
Sbjct: 517 GELVSVAQSSENATAIVRNLLDDLIQASHDLENLDEEDRDLVHEAESVRSPLAETRLGAD 576
Query: 588 GSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSI 647
G I+EW +F + + HRHLSHL+ L PG IT K P L +AA K+L+ RG++G GWSI
Sbjct: 577 GRILEWNDEFIESDPQHRHLSHLYELHPGAGIT-SKTPHLEEAARKSLEVRGDDGSGWSI 635
Query: 648 TWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKH-FEGGLYSNLFAAHPPFQIDANFGFT 706
W+ +WARL D EHA R++ VD E + GG+Y + AHPPFQID N GF
Sbjct: 636 VWRMIMWARLRDAEHAKRIIGMFLRPVDANAETNLLGGGVYGSGLCAHPPFQIDGNLGFP 695
Query: 707 AAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKD 754
AA++EMLVQS + +LPALP D W G L+ARGG V W D
Sbjct: 696 AALSEMLVQSHDGWIRILPALPED-WHEGTFHALRARGGIQVDAIWTD 742
>gi|298386944|ref|ZP_06996498.1| alpha-L-fucosidase 2 [Bacteroides sp. 1_1_14]
gi|298260094|gb|EFI02964.1| alpha-L-fucosidase 2 [Bacteroides sp. 1_1_14]
Length = 809
Score = 406 bits (1044), Expect = e-110, Method: Compositional matrix adjust.
Identities = 261/775 (33%), Positives = 407/775 (52%), Gaps = 68/775 (8%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
S +TT+ +K+ ++ PA + ++P+GNGRLGAMV+GGV +ET+ LNE T+W+G ++
Sbjct: 24 SEATTDNMKLWYDKPADEWMKSLPLGNGRLGAMVYGGVETETIGLNESTMWSGEYDEHQQ 83
Query: 66 -PDAPKALSDVRSLVDSGQYAEAT-AASVKLFG--HPADVYQLLGDIELEFDDSHLKYAE 121
P + L ++R L G AE A + G H A + +GD++L F + ++
Sbjct: 84 RPLGREKLDEIRKLFFEGNLAEGNHIAGNTMAGSPHSAGTHLPIGDLKLNFTYPEGELSD 143
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
Y ELDL+TA V Y +G+ E+TR+ +SNPD VI I+ S +++ + L+ LL
Sbjct: 144 --YHHELDLSTAVNTVTYKIGDTEYTRQSIASNPDDVIAMYITASRPEAITMELELN-LL 200
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
N + NQ+I G ++ G+ F + ++I GTI A + KK
Sbjct: 201 RNAEVIASGNQLIYTGNAEFEK--------HGRGGVLFEGRIAVEIKG--GTIKA-DGKK 249
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
L ++ + LL S + N + + D + +++ S+ L H++DY
Sbjct: 250 LLIDKATEVTLL----SDVRTNYKNTTFAGYDYKQKCKETIEAASKKSFKTLRNIHVEDY 305
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYL 360
LF RV++ + K + +P+ +R + E DP L L FQ+ RYL
Sbjct: 306 APLFSRVALSFGDNGK-----------LSHLPNDQRWARVKAGESDPGLDALFFQYARYL 354
Query: 361 LISSSRPGTQV-ANLQGIWNEDLSP--TWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
LI+SSRP + + LQG +N++L+ W + H++IN E NYW + NL EC PLFD
Sbjct: 355 LIASSRPNSPLPVALQGFFNDNLACHMGWTNDYHLDINTEQNYWIANVGNLPECHLPLFD 414
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
++ LS++GSK AQ Y GW H ++ W ++ G ++W L+P +WL +H+W
Sbjct: 415 YIKDLSVHGSKIAQDLYGCKGWTAHTTSNPWGYTAVS-GSILWGLFPTASSWLTSHVWTQ 473
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
Y YT D+ FL++ AYPLL+ A FLLD++ I+ + YL T PS SPE+ F G+ C
Sbjct: 474 YEYTQDKKFLQETAYPLLKSNAEFLLDYMVIDPRNNYLVTGPSISPENSF-HYQGQEFCA 532
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDA-LVEKVLKSLPRLRPTKIAEDGSIMEWAQ 595
S T D + E+FSA + + E+L N DA + + ++ +L P +I+ +G + EW +
Sbjct: 533 SMMPTCDRVLAYEIFSACLQSTEIL--NVDASFADSLRTAISQLPPFRISANGGVQEWFE 590
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGE----EGPGWSITWKT 651
D+++ +HRH +HL L+P IT+ K P+L KAA T+++R E WS
Sbjct: 591 DYEEAHPNHRHTTHLLSLYPYSQITLNKTPELAKAAYTTIERRLAAKDWEDTEWSRANMI 650
Query: 652 ALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP---------FQIDAN 702
+ARL + + AY VK+L + E N+F P F D N
Sbjct: 651 CFYARLKEPKKAYDSVKQLLGPLSRE-----------NMFTVSPAGIAGANDDIFAFDGN 699
Query: 703 FGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
A +AEML+QS N + LLP LP ++W G KGL ARGG + WK+ +
Sbjct: 700 TAGAAGIAEMLLQSYDNRIELLPCLP-EEWKDGSFKGLCARGGIELDANWKNARI 753
>gi|167763307|ref|ZP_02435434.1| hypothetical protein BACSTE_01680 [Bacteroides stercoris ATCC
43183]
gi|167698601|gb|EDS15180.1| hypothetical protein BACSTE_01680 [Bacteroides stercoris ATCC
43183]
Length = 657
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 250/695 (35%), Positives = 360/695 (51%), Gaps = 67/695 (9%)
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLL 181
YRREL L++A A V++ V++ R F S P V+V + S +L F+ + + +
Sbjct: 18 YRRELSLDSARAIVQFCKDGVKYKRTSFISYPANVLVMRYSADRPAKQNLRFSYAPNPVS 77
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
G N ++ R D +++ ++ +++ GT++ D+
Sbjct: 78 AGSLQPEGKNGLVFRARL-------------DNNSMEY--VVRMRVLTQGGTVTNTHDQL 122
Query: 242 LKVEGSDWAVLLLVASS----SFDGPFINPSD-SKKDPTSESMSALQSIRNLSYSDLYTR 296
L +EG+D V L+ A + +F+ F NP +P + + Y LY
Sbjct: 123 L-IEGADEVVFLITADTDYLINFNPDFTNPKTYVGVNPEETTAYWINEAEKQGYEALYQA 181
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQ 355
H DY LF+RV + L+ S + +P +R+ ++ + D L +L +Q
Sbjct: 182 HYADYTALFNRVKLNLTNS-----------SDFRDMPITQRLSRYREGQKDFYLEQLYYQ 230
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
FGRYLLI+SSRPG ANLQGIW+ ++ W H NINL+MNYW + NLSEC +PL
Sbjct: 231 FGRYLLIASSRPGNFPANLQGIWHNNVDGPWRVDYHNNINLQMNYWPACSTNLSECMKPL 290
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHL 474
DF+ L G KTAQ + A GW +I+ ++ + + W PM G WL TH+
Sbjct: 291 IDFIRTLVKPGEKTAQAYFGARGWTASISGNIFGFTTPLESENMSWNFNPMAGPWLATHI 350
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
WE+Y+YT D FL++ Y L++ A+F +D+L DG PSTSPEH
Sbjct: 351 WEYYDYTRDVKFLKEIGYELIKSSANFAVDYLWHKPDGTYTAAPSTSPEH---------G 401
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
V +T A++RE+ I A++VL + E E+VL+ +L P KI G +ME
Sbjct: 402 PVDQGATFVHAVVREILLDAIDASKVLRVDAKERKYWEQVLE---KLVPYKIGRYGQLME 458
Query: 593 WAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTA 652
W+ D DP+ HRH++HLFGL PGHT++ P+L A+ L+ RG+ GWS+ WK
Sbjct: 459 WSGDMDDPKDQHRHVNHLFGLHPGHTVSPITTPELSDASRVVLEHRGDGATGWSMGWKLN 518
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
WARLHD HAY++ L KH G +NL+ HPPFQID NFG TA V EM
Sbjct: 519 QWARLHDGNHAYKLFGNLL--------KH---GTLNNLWDMHPPFQIDGNFGGTAGVTEM 567
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHD 772
L+QS + ++LLPALP D WS G V GL ARG ++ +CWKDG L +V I S Y+
Sbjct: 568 LLQSHMGFIHLLPALP-DAWSDGSVSGLCARGNFSLDVCWKDGKLRQVDIIS-YAGTP-- 623
Query: 773 SFKTLHYRGTSVKVNLSAGKIYTFNRQLKCTNLHQ 807
L YR + GK Y Q C L++
Sbjct: 624 --CILRYRDAVLIFKTQKGKSYRVTYQNGCLILNK 656
>gi|296453497|ref|YP_003660640.1| hypothetical protein BLJ_0327 [Bifidobacterium longum subsp. longum
JDM301]
gi|296182928|gb|ADG99809.1| hypothetical protein BLJ_0327 [Bifidobacterium longum subsp. longum
JDM301]
Length = 783
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 265/769 (34%), Positives = 393/769 (51%), Gaps = 54/769 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+TF+G + + + IP+GNGR+GA++ ++ L LN+DTLW+G P T+P P+ +
Sbjct: 1 MKLTFDGISSCWEEGIPLGNGRMGAVLCSEPETDVLYLNDDTLWSGYPHAETSPVTPEIV 60
Query: 73 SDVRSLVDSGQYAEAT--AASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+ R Y AT L +Y+ G +++ S E+ +R+LDL
Sbjct: 61 AKARQAASGDDYTAATRIIKEATLQEKDEQIYEPFGTARIQY--STPADGRESMKRQLDL 118
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS----------LDSL 180
A A + +G+ + + S PD ++V ++S ++ +VS L+++
Sbjct: 119 ARALAGETFQMGDANVHVDAWCSEPDDLLVYRMSSDAPIDVNISVSGTFLKQSRASLETV 178
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRI-----PPKANANDDPKGIQFSAILEIKISDDRGTIS 235
D H +I+ GR PG + P + D+ G + ++ G I+
Sbjct: 179 SDGHRAT-----LIVMGRMPGLNVGLLPHPSEHPWEDEQDGTGMAYAGAFSLTATGGDIN 233
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
++D L+ L + S F G P S + L+ + +DL T
Sbjct: 234 -VDDNSLQCSHITGLSLRFRSMSGFKGSDQQPERS----MTVIADHLEKTIDEWSTDLQT 288
Query: 296 ---RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDED---PSL 349
RH+ DY++ F RV+I L + D DT +P + ++S + E L
Sbjct: 289 MLDRHIADYRRYFDRVAIHLGSAHDD---DT-------ELPFSAILRSDENKEPHRLEML 338
Query: 350 VELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLS 409
E +F FGRYLLISSSRP TQ ANLQGIWN P W SA NIN+EMNYW + PC L
Sbjct: 339 AEAMFDFGRYLLISSSRPHTQPANLQGIWNHKDFPNWYSAYTTNINVEMNYWMTGPCALK 398
Query: 410 ECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAW 469
E EPL L G A G + H D+W ++ G+ +WA WP G AW
Sbjct: 399 ELIEPLVSMNEELLAPGHDAADKILGCRGSAVFHNVDLWRRALPANGEPMWAFWPFGQAW 458
Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP 529
+C +L++ Y + D +L R +P++ A F +D+L E G L +P+TSPE+ F+
Sbjct: 459 MCRNLFDEYLFNQDASYL-ARIWPIMRDNARFCMDFLSETEHG-LAPSPATSPENCFLV- 515
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAA---EVLEKNEDALVEKVLKSLPRLRPTKIAE 586
+G+ V+ SS AI+R + +I A+ E L++ + ALV + +L T++
Sbjct: 516 NGEPVSVAQSSENATAIVRNLLDDLIQASHDLENLDEEDSALVREAESVRSQLAETRLGA 575
Query: 587 DGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWS 646
DG I+EW +F + + HRHLSHL+ L PG IT K P L +AA K+L+ RG++G GWS
Sbjct: 576 DGRILEWNDEFIESDPQHRHLSHLYELHPGAGIT-SKTPHLEEAARKSLEVRGDDGSGWS 634
Query: 647 ITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKH-FEGGLYSNLFAAHPPFQIDANFGF 705
I W+ +WARL D EHA R++ VD E + GG+Y + AHPPFQID N GF
Sbjct: 635 IVWRMIMWARLRDAEHAKRIIGMFLRPVDANAETNLLGGGVYGSGLCAHPPFQIDGNLGF 694
Query: 706 TAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKD 754
AA++EMLVQS + +LPALP D W G L+ARGG V W D
Sbjct: 695 PAALSEMLVQSHDGWIRVLPALPED-WHEGSFHALRARGGIQVDATWTD 742
>gi|393789783|ref|ZP_10377902.1| hypothetical protein HMPREF1068_04182 [Bacteroides nordii
CL02T12C05]
gi|392650186|gb|EIY43857.1| hypothetical protein HMPREF1068_04182 [Bacteroides nordii
CL02T12C05]
Length = 800
Score = 404 bits (1039), Expect = e-110, Method: Compositional matrix adjust.
Identities = 251/764 (32%), Positives = 404/764 (52%), Gaps = 57/764 (7%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP---KALSDVR 76
PAK + +++PIGNGRLGAM +GG+ ETL LNE ++W+G + N D P L ++R
Sbjct: 35 PAKEWMESLPIGNGRLGAMTYGGIEEETLALNESSMWSGQFNE--NQDKPFGRAKLDNLR 92
Query: 77 SLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
L G+ E A L G + +GD++++F ++ K YRR L+LN A
Sbjct: 93 KLFFEGKLWEGNQTAGDNLNGMQTSFGTHLPIGDLKMKF--TYPKGDITGYRRSLNLNEA 150
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
+ V ++ G V + RE+F++NPD V+V ++S + S++ +++LD L+ ++ NNQ+
Sbjct: 151 ISSVSFNAGGVNYKREYFATNPDNVLVLRLSADKPKSVTMDMALD-LMRQSAFTVENNQL 209
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
I G+ P P G+ F I + D G + +++ + V +D ++
Sbjct: 210 IFTGKV---DFPLHG-----PGGVNFEG--RIAVLADNGEVK-MDEAGISVSNADAVTMI 258
Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
+ + + P D + + ++ Y L H+ DY LF+RV + L
Sbjct: 259 VDVRTDYKSP---------DYKALCATTVEEAGMKPYEALKLMHIKDYSNLFNRVELSLG 309
Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQV- 371
+ D T+P+ R K ++ + D S L FQ+GRYL I+SSR + +
Sbjct: 310 KDSND------------TIPTDIRWKQIRSGKTDTSFDALYFQYGRYLTIASSRENSPLP 357
Query: 372 ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
LQG +N++ + W + H++IN + NYW S NL+EC PLF+++ LS++G+KT
Sbjct: 358 IALQGFFNDNQACNMGWTNDYHLDINTQQNYWVSNVGNLAECNTPLFNYIKDLSVHGAKT 417
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+V Y GW + +IW + A G ++W L+P+ G+W+ THLW Y YT D+ +L +
Sbjct: 418 AEVVYGCKGWTANTTANIWGYTPAS-GSIIWGLFPLAGSWIATHLWTQYEYTQDKKYLAE 476
Query: 490 RAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
AYPLL+G A F+LD++ E +GYL T PS SPE+ F +G+ S T D ++
Sbjct: 477 VAYPLLKGNAEFILDYMTENPANGYLMTGPSISPENWFKTANGQEMVASMMPTCDRELVY 536
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLS 608
E+F++ I AA++L ++ A + +L +L P ++ +G+I EW +D+++ +HRH S
Sbjct: 537 EIFTSCIQAADILGIDK-AFSNNLQTALAKLPPIQLRANGAIREWFEDYEEAHPNHRHTS 595
Query: 609 HLFGLFPGHTITIEKNPDLCKAAEKTLQKR----GEEGPGWSITWKTALWARLHDQEHAY 664
HL L+P IT+EK P+L AA KT++ R E WS +ARL D E AY
Sbjct: 596 HLLALYPFSQITLEKTPELAAAARKTIEARLAAENWEDTEWSRANMICFYARLKDAEEAY 655
Query: 665 RMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLL 724
+ VK L ++ E+ G + A + + D N A +AEML+Q+ + L
Sbjct: 656 KSVKTLQGMLSRENLLTVSPGGIAG--APNNIYSFDGNPAGAAGMAEMLIQNHEGYVEFL 713
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN 768
P LP W +G KGL RGG VS W++ + + + N
Sbjct: 714 PCLP-VAWKNGQFKGLCIRGGAEVSAQWENAVIQHASLKATADN 756
>gi|365122414|ref|ZP_09339317.1| hypothetical protein HMPREF1033_02663 [Tannerella sp.
6_1_58FAA_CT1]
gi|363642654|gb|EHL82000.1| hypothetical protein HMPREF1033_02663 [Tannerella sp.
6_1_58FAA_CT1]
Length = 837
Score = 403 bits (1036), Expect = e-109, Method: Compositional matrix adjust.
Identities = 272/805 (33%), Positives = 404/805 (50%), Gaps = 93/805 (11%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR-S 77
+ PIGNG G + G V +E + LNE +LW G P Y N + K L +R S
Sbjct: 79 SFPIGNGSFGGNILGSVKTERITLNEKSLWKGGPNVSGGARYYWDANKEGYKVLDQIRHS 138
Query: 78 LVD-SGQYAEATAASVKLF----GHPADV--------YQLLGDIELEFDDSHLKYAE-ET 123
+ SG + AT + F G+ D + +G+ + D+ + +E
Sbjct: 139 FIQFSGINSVATELTRNNFNGKCGYEPDSEKSFRFGSFTTMGEFHI---DTGIAESEISD 195
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLL 181
YRR L L++A V+++ G F R+ FSS PD +++ + + G +L+F +
Sbjct: 196 YRRILSLDSALVVVQFNAGGDCFYRKFFSSYPDSIMIYRFECTRPGRQNLTFRYVANPQA 255
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
+G I+ GR D G+QF ++ ++ + GT++ +E+
Sbjct: 256 SGSVEADGTAGIVYNGRL-------------DSNGMQF--VIRVRAVAESGTVT-VENGA 299
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINP--SDSKK----DPTSESMSALQSIRNLSYSDLYT 295
+KV G+D + + + + NP +D + DP + + L Y +Y
Sbjct: 300 IKVIGADNVTFYVAGDTDYKMNY-NPDFNDDRAYVGVDPVMTTQNNLDFALAKGYDAVYN 358
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLF 354
H DY LF RV I L+ S + V+D +P+ R+ +++ D L EL F
Sbjct: 359 AHRADYSALFDRVKIDLNES--NPVSD---------IPTDMRLSNYRNGISDHYLEELYF 407
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
QFGRYLLI+SSR G ANLQG+W+ ++ W H NINL+MNYW + P NLSECQ P
Sbjct: 408 QFGRYLLIASSRAGNLPANLQGLWHNNVEGPWRVDYHNNINLQMNYWPACPANLSECQTP 467
Query: 415 LFDFLTYLSINGSKTAQVNYL--ASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLC 471
L +++ L G +TA+ Y GW ++I+ +S + + W + G WL
Sbjct: 468 LIEYIRTLVKPGERTAKAYYGPDTRGWTTSVSSNIFGFTSPLSSRDMSWNFSFVAGPWLA 527
Query: 472 THLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDG 531
TH+WE+Y+YT D DFL Y L++G A F +D L DG PSTSPEH
Sbjct: 528 THVWEYYDYTRDEDFLRTTGYELIKGSAEFAVDHLWHKPDGSYAAAPSTSPEH------- 580
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
V +T A++RE+ I +++L+ + E+ + L +L P +I G +M
Sbjct: 581 --GPVDQGATFAHAVVREILLDAIETSKILDVDASER-EEWQEVLNKLMPYEIGRYGQLM 637
Query: 592 EWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKT 651
EW+ D DP+ HRH++HLFGL PG TI+ P+L A+ L+KRG+ GWS+ WK
Sbjct: 638 EWSADIDDPKDKHRHVNHLFGLHPGRTISPITTPELSTASRIVLEKRGDGATGWSMGWKL 697
Query: 652 ALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAE 711
WARLHD HAY + + L + G NL+ HPPFQID NFG TA + E
Sbjct: 698 NQWARLHDGNHAYLLFQNL-----------LKNGTADNLWDMHPPFQIDGNFGGTAGIIE 746
Query: 712 MLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDH 771
ML+QS + ++LLPALP DKW+SG V GL ARG V I W+ G+L + I S
Sbjct: 747 MLMQSHMGFIHLLPALP-DKWASGDVIGLCARGNFEVDIHWEKGELVKAVIRSG-----S 800
Query: 772 DSFKTLHYRGTSVKVNLSAGKIYTF 796
++ Y+ + V + AGK Y+
Sbjct: 801 GGMCSIRYKDSMVNFDTKAGKSYSL 825
>gi|359404666|ref|ZP_09197493.1| hypothetical protein HMPREF0673_00698 [Prevotella stercorea DSM
18206]
gi|357560094|gb|EHJ41501.1| hypothetical protein HMPREF0673_00698 [Prevotella stercorea DSM
18206]
Length = 838
Score = 402 bits (1033), Expect = e-109, Method: Compositional matrix adjust.
Identities = 265/772 (34%), Positives = 380/772 (49%), Gaps = 86/772 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY--------TNPDAPKALSDVR 76
+ ++PIGNG +G V G V +E + NE TLW G P N + + ++R
Sbjct: 75 SQSLPIGNGNIGGNVLGSVEAERITFNEKTLWRGGPNTARGAAYYWDVNKQSAHVVGEIR 134
Query: 77 SLVDSGQYAEATAASVKLFG----HPADV--------YQLLGDIELEFDDSHLKYAEETY 124
G + +A + K F + AD + G+ +E S + + Y
Sbjct: 135 EAFTKGDWQKAELLTRKNFNSVVPYEADAEEPFRFGSFTTAGEFYIETGLSSVGMTD--Y 192
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLD 182
RREL L++A A+V + V++ RE+F S+P V+ + + S+ G +L F+ + + +
Sbjct: 193 RRELSLDSALAKVSFCKDGVQYEREYFVSHPANVMAVRFAASQRGKQNLVFSYAPNPVST 252
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+G + + R D ++++ + IK G +S E KL
Sbjct: 253 GEMKADGTDALCWLARL-------------DNNSMEYA--VRIKAVAKGGAVSN-EGGKL 296
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKK------DPTSESMSALQSIRNLSYSDLYTR 296
V+ +D V L+ A + + P +P S DP + L Y+ L
Sbjct: 297 TVKDADEVVFLITADTDYK-PNYDPDFSAPKAYVGVDPAQTTADWLAKAATKGYAYLLNE 355
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQ 355
H DY +LF+RV + ++ + D D +P R++++ Q D L +L +Q
Sbjct: 356 HYADYSELFNRVRLNINNATADA----------DDLPVNRRLEAYRQGKPDYYLEQLYYQ 405
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
FGRYLLISSSR ANLQG+W+ ++ W H NINL+MNYW + P LSEC+ PL
Sbjct: 406 FGRYLLISSSRADNLPANLQGLWHNNVDGPWRIDYHNNINLQMNYWLACPTGLSECELPL 465
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-VVWALWPMGGAWLCTHL 474
F+F+ L G TA+ + GW +I+ +S + + W P G WL THL
Sbjct: 466 FNFIRTLVKPGRVTAKSYFGTRGWTTSVSGNIFGFTSPLSSEDMSWNFSPFAGPWLATHL 525
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
W +Y++T DR FL Y +L+ A F D+L DG PSTSPEH
Sbjct: 526 WNYYDFTRDRKFLADN-YEILKESADFASDYLWHRADGVYTAAPSTSPEH---------G 575
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKN--EDALVEKVLKSLPRLRPTKIAEDGSIME 592
V +T A+IREV + A VL K+ E E LK L P KI G +ME
Sbjct: 576 PVDEGATFAHAVIREVLLDAVEANRVLGKSAKERRQWEDALK---HLAPYKIGRYGQLME 632
Query: 593 WAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTA 652
W+ D DP+ HRH++HLFGL PG T++ P+L KA+ L+ RG+ GWS+ WK
Sbjct: 633 WSTDIDDPKDEHRHVNHLFGLHPGRTVSPVTTPELAKASRVVLEHRGDGATGWSMGWKLN 692
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
WARLHD HAY + L + G NL+ H PFQID NFG TA V EM
Sbjct: 693 QWARLHDGNHAYTLYGNL-----------LKNGTLDNLWDTHAPFQIDGNFGGTAGVTEM 741
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
L+QS + ++LLPALP D W+ G V GL+A+G TVSI WK+G L E I S
Sbjct: 742 LMQSHMGFVHLLPALP-DAWAEGSVSGLRAKGNFTVSISWKNGKLAEATILS 792
>gi|225351622|ref|ZP_03742645.1| hypothetical protein BIFPSEUDO_03219 [Bifidobacterium
pseudocatenulatum DSM 20438 = JCM 1200]
gi|225157966|gb|EEG71249.1| hypothetical protein BIFPSEUDO_03219 [Bifidobacterium
pseudocatenulatum DSM 20438 = JCM 1200]
Length = 783
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 265/768 (34%), Positives = 393/768 (51%), Gaps = 52/768 (6%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+TF+G + + + IP+GNGR+GA++ ++ L LN+DTLW+G P T+P P+ +
Sbjct: 1 MKLTFDGISSCWEEGIPLGNGRMGAVLCSEPETDVLYLNDDTLWSGYPHAETSPVTPEIV 60
Query: 73 SDVRSLVDSGQYAEAT--AASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+ R YA AT L +Y+ G +++ S E+ +R+LDL
Sbjct: 61 AKARQASLHDDYATATRIIKEATLQEKDEQIYEPFGTARIQYSTS--ADGRESMKRQLDL 118
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS----------LDSL 180
A A + +G+ + + S PD ++V ++S ++ +VS L+++
Sbjct: 119 ARALAGETFQMGDANVHVDAWCSEPDDLLVYRMSSDAPIDVNISVSGTFLKQSRASLETV 178
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIP----PKANANDDPK---GIQFSAILEIKISDDRGT 233
D H +I+ GR PG I P N +D + G+ ++ + ++ G
Sbjct: 179 SDGHRAT-----LIVMGRMPGLNIGLLPHPSENPWEDEQDGTGMAYAGAFSLTVTG--GD 231
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
I+ + D L+ L + S F G P S + L+ + +DL
Sbjct: 232 IN-VGDNSLQCSNITGLSLRFRSMSGFKGSDQQPERS----MTVIADHLEKTIDEWSTDL 286
Query: 294 YT---RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLV 350
T RH+ DY++ F RV+I L + D S + S E +S + + L
Sbjct: 287 QTMLDRHIADYRRYFDRVAIHLGSAHADDAELLFSA----ILRSDENKESHRLE---MLA 339
Query: 351 ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
E +F FGRYLLISSSRP TQ ANLQGIWN P W SA NIN+EMNYW + PC L E
Sbjct: 340 EAMFDFGRYLLISSSRPHTQPANLQGIWNHKDFPNWYSAYTTNINVEMNYWMTGPCALQE 399
Query: 411 CQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWL 470
EPL L G A G + H D+W ++ G +W+ WP G AW+
Sbjct: 400 LIEPLVSMNEELLAPGHDAADRILGCRGSAVFHNVDLWRRALPANGDPMWSFWPFGQAWM 459
Query: 471 CTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD 530
C +L++ Y + D +L R +P++ A F +D+L E G L +P+TSPE+ F+ +
Sbjct: 460 CRNLFDEYLFNQDASYL-ARIWPIMRDNARFCMDFLSETEHG-LAPSPATSPENCFLV-N 516
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAA---EVLEKNEDALVEKVLKSLPRLRPTKIAED 587
G+ V+ SS AI+R + +I A+ E L++ + LV + +L T++ D
Sbjct: 517 GEPVSVAQSSENATAIVRNLLDDLIQASHDLENLDEEDRDLVREAEAVRSQLAETRLGAD 576
Query: 588 GSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSI 647
G I+EW +F + + HRHLSHL+ L PG IT K P L +AA K+L+ RG++G GWSI
Sbjct: 577 GRILEWNDEFIESDPQHRHLSHLYELHPGAGIT-SKTPHLEEAARKSLEVRGDDGSGWSI 635
Query: 648 TWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKH-FEGGLYSNLFAAHPPFQIDANFGFT 706
W+ +WARL D EHA R++ VD E + GG+Y + AHPPFQID N GF
Sbjct: 636 VWRMIMWARLRDAEHAKRIIGMFLRPVDANAETNLLGGGVYDSGLCAHPPFQIDGNLGFP 695
Query: 707 AAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKD 754
AA++EMLVQS + +LPALP D W G L+ARGG V W D
Sbjct: 696 AALSEMLVQSHDGWIRILPALPED-WHEGTFHALRARGGIQVDATWTD 742
>gi|453085568|gb|EMF13611.1| glycoside hydrolase family 95 protein [Mycosphaerella populorum
SO2202]
Length = 811
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 285/804 (35%), Positives = 406/804 (50%), Gaps = 99/804 (12%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA + D +PIGNGRLGAM+ G E L LNED++W G P + NP A K L VR
Sbjct: 9 YESPANLWEDGLPIGNGRLGAMIRGTTNVERLWLNEDSVWYGGPQNRVNPAAHKNLELVR 68
Query: 77 SLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLK----YAEETYRRELD 129
L+D + AEA + F G P + Y+ LGD+ + F A ++YRR LD
Sbjct: 69 ELIDQNKIAEAENIMSRTFTGMPESMRHYEPLGDVFMHFGHGRFSGRGGAAVQSYRRALD 128
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L T A V Y+ F RE FSS +VI +IS + LSF ++L+ DN ++
Sbjct: 129 LQTGLATVSYACQGGNFQREVFSSTVAEVICMRISSDQC--LSFLLTLNRGDDNDAH--- 183
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE----------IKISDDRGTISALED 239
R + N +D G+ +A++ +KI D G
Sbjct: 184 --------RQFDRAFDTLTNTDD---GLVLTAVMGGRNAVELAIGVKIVCDDGVKVDSCG 232
Query: 240 KKLKVEGSDWAVLLLVAS-SSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
++V +VL+L+A ++F N D+ + E+ + ++ L + H+
Sbjct: 233 IDVEVSMQKGSVLILIAGETTFRN--TNAVDAVQQRLEEAAKS-------TWDQLLSAHV 283
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT--DEDPSLVELLFQF 356
+ +L++RV + L + E N+D V + +R++ + +D L LLF +
Sbjct: 284 AHFGRLYNRVELHLDQ-----------ELNVDHVSTDQRLEQARQHPGQDNELTALLFHY 332
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYLLISSS ANLQGIWN D P W S NINLEMNYW + NL EC + LF
Sbjct: 333 GRYLLISSSLS-GLPANLQGIWNCDAKPVWGSKYTANINLEMNYWPAEVTNLPECHQVLF 391
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
+FL L+ G++TAQ Y GW HH TDIWA ++ + W + GAWL TH+WE
Sbjct: 392 NFLERLAERGTQTAQQMYGCRGWTCHHNTDIWADTAPQDRSICATYWNLTGAWLSTHIWE 451
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGK---- 532
HY +T+D DFL+ R +P++ G A F D+LIE DG+L T+PS S E+ + P+
Sbjct: 452 HYLFTLDLDFLQ-RYFPIMRGSAQFFQDFLIE-RDGHLVTSPSISAENSYFLPNSNSNNN 509
Query: 533 ---LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
+ + T D I+RE+F A I A +L + A E VL LP PT+I + G
Sbjct: 510 KPVVGSICAGPTWDSQILRELFHACIQAGNLLHE-PVAEYEHVLNKLP---PTQIGKHGQ 565
Query: 590 IMEWAQDFKDPEVHHRHLSHLFGLFPG-----------------HTITIEKNPDLCKAAE 632
IMEW D + E+ HRH+SHL+GL+PG EK L AA+
Sbjct: 566 IMEWLHDVDEVEIGHRHISHLWGLYPGTSLSSSSSSFSSGGEKEKENEKEKESQLHLAAK 625
Query: 633 KTLQKRGEEGPG---WSITWKTALWARLHDQEHAYRMVKRLFNL--------VDPEHEKH 681
+TL++R G G WS+ W L+ARL ++E + ++ + + + +
Sbjct: 626 RTLERRLSGGSGHTSWSLAWILCLYARLGNEEEDEKEKEKQKTMDGGGGGGDMAQKMLRK 685
Query: 682 FEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY-LLPALPWDKWSSGCVKGL 740
+ N A HPPFQID NFGFTAAVAEML+QS + LLP L D G V+GL
Sbjct: 686 MSHAVLQNCLANHPPFQIDGNFGFTAAVAEMLLQSHRTTIINLLPCLLADWERGGSVRGL 745
Query: 741 KARGGETVSICWKDGDLHEVGIYS 764
+ARG V + W++G L + S
Sbjct: 746 RARGDVLVDLEWREGKLERAVLLS 769
>gi|380692991|ref|ZP_09857850.1| hypothetical protein BfaeM_03308 [Bacteroides faecis MAJ27]
Length = 779
Score = 401 bits (1030), Expect = e-109, Method: Compositional matrix adjust.
Identities = 257/770 (33%), Positives = 397/770 (51%), Gaps = 66/770 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAPKA 71
+K+ ++ PA + ++P+GNGRLGAMV+GGV +ET+ LNE T+W+G ++ P +
Sbjct: 1 MKLWYDKPADKWMKSLPLGNGRLGAMVYGGVETETIGLNESTMWSGEYDEHQQRPLGREK 60
Query: 72 LSDVRSLVDSGQYAEAT-AASVKLFG--HPADVYQLLGDIELEFDDSHLKYAEETYRREL 128
L +R L AE A + G H A + +GD++L F + ++ Y EL
Sbjct: 61 LDQIRKLFFEDNLAEGNHIAGNTMAGSPHSAGTHLPIGDLKLNFTYPEGELSD--YHHEL 118
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
DL TAT V Y VG+ E+TR+ +SNPD VI I S S++ + L LL N V
Sbjct: 119 DLTTATNTVTYKVGDTEYTRQCIASNPDDVIAMHIKASRPESITVELELQ-LLRNAEVVA 177
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
NQ+I G ++ G+ F + +I GTI A + KKL ++ +
Sbjct: 178 SGNQLIYTGNAEFEK--------HGRGGVLFEGRIAAEIKG--GTIKA-DGKKLLIDKAT 226
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
+LL S + N + + D + +++ S+ L H++DY LF RV
Sbjct: 227 EVLLL----SDVRTNYKNTTFAGYDYQQKCKETIEAASKKSFKTLRNTHVEDYTPLFSRV 282
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRP 367
++ + K +P+ +R + E DP L L FQ+ RYLLISSSRP
Sbjct: 283 ALSFGENGK-----------FSHLPNDQRWARVKAGESDPGLDALFFQYARYLLISSSRP 331
Query: 368 GTQV-ANLQGIWNEDLSP--TWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
+ + LQG +N++L+ W + H++IN E NYW + NL EC PLFD++ LS+
Sbjct: 332 NSPLPVALQGFFNDNLACHMGWTNDYHLDINTEQNYWIANVGNLPECHLPLFDYIKDLSV 391
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
+GSK AQ Y GW H ++ W ++ G ++W L+P +W+ +H+W Y YT D+
Sbjct: 392 HGSKIAQDLYGCKGWTAHTTSNPWGYAAVS-GSILWGLFPTASSWITSHVWTQYEYTQDK 450
Query: 485 DFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
+FL++ AYPLL+ A FLLD+++ + + YL T PS SPE+ F G+ C S T D
Sbjct: 451 NFLKETAYPLLKSNAEFLLDYMVTDPRNNYLVTGPSISPENSF-RYQGQEFCASMMPTCD 509
Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVH 603
++ E+FSA + + E+L + A + + ++ +L P +I+ +G + EW +D+++ +
Sbjct: 510 RVLVYEIFSACLKSTEILNVDA-AFADSLRTAISKLPPFRISANGGVQEWFEDYEEAHPN 568
Query: 604 HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGE----EGPGWSITWKTALWARLHD 659
HRH +HL L+P IT+ K P+L AA T+++R E WS +ARL D
Sbjct: 569 HRHTTHLLSLYPYSQITLNKTPELANAARITIERRLAAKDWEDTEWSRANMICFYARLKD 628
Query: 660 QEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP---------FQIDANFGFTAAVA 710
AY VK+L + E N+F P F D N A +A
Sbjct: 629 PIKAYNSVKQLLGPLSRE-----------NMFTVSPAGIAGAGEDIFAFDGNTAGAAGIA 677
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEV 760
EML+Q N + LLP LP ++W +G KGL ARGG + WK+ + +
Sbjct: 678 EMLLQGYDNRIELLPCLP-EEWKNGSFKGLCARGGIELDASWKNAQIEQT 726
>gi|384196720|ref|YP_005582464.1| hypothetical protein HMPREF9228_0580 [Bifidobacterium breve
ACS-071-V-Sch8b]
gi|333110104|gb|AEF27120.1| conserved hypothetical protein [Bifidobacterium breve
ACS-071-V-Sch8b]
Length = 783
Score = 400 bits (1029), Expect = e-108, Method: Compositional matrix adjust.
Identities = 262/771 (33%), Positives = 395/771 (51%), Gaps = 58/771 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+TF+G + + ++IP+GNGR+GA++ ++ L LN+DTLW+G P T+P P+ +
Sbjct: 1 MKLTFDGISSCWEESIPLGNGRMGAVLCSEPETDVLYLNDDTLWSGYPHAETSPVTPEIV 60
Query: 73 SDVR--SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+ R SL D A L +Y+ G +++ S E+ +R+LDL
Sbjct: 61 AKARQASLQDDYNAATRIIKDATLQEKDEQIYEPFGTARIQYSTS--ADGRESMKRQLDL 118
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS----------LDSL 180
A A + +G+ + + S PD ++V ++S ++ +VS ++++
Sbjct: 119 ARALAGETFQMGDANVHVDAWCSEPDDLLVYRMSSDAPIDVNISVSGTFLKQSRASMETV 178
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIP----PKANANDDPK---GIQFSAILEIKISDDRGT 233
D H +++ GR PG I P N +D + G+ ++ + ++ G
Sbjct: 179 SDGHRAT-----LVVMGRMPGLNIGLLPHPSENPWEDEQDGTGMTYAGAFSLTVT---GG 230
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
+ D L+ L + S F G P S + L+ + +DL
Sbjct: 231 DVNVGDNSLQCSNITGLSLRFRSMSGFRGSDQQPERS----MTVIADHLEKTIDEWSTDL 286
Query: 294 YT---RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDED---P 347
T RH+ DY++ F RV+I L + D DT +P + ++S + E
Sbjct: 287 RTMLDRHIADYRRYFDRVAIHLGSAHDD---DT-------ELPFSAILRSDEKKEPHRLE 336
Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
L E +F FGRYLLISSSRP TQ ANLQGIWN P W SA NIN+EMNYW + PC
Sbjct: 337 MLAEAMFDFGRYLLISSSRPHTQPANLQGIWNHKDFPNWYSAYTTNINVEMNYWMTGPCA 396
Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
L E EPL L + G A G + H D+W ++ G +W+ WP G
Sbjct: 397 LQELIEPLVSMNEELLVPGHDAADRILGCRGSAVFHNVDLWRRALPANGDPMWSFWPFGQ 456
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFI 527
AW+C +L++ Y + D +L R +P++ A F +D+L E G L +P+TSPE+ F+
Sbjct: 457 AWMCRNLFDEYLFNQDASYL-ARIWPIMRDNARFCMDFLSETKHG-LAPSPATSPENCFL 514
Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAA---EVLEKNEDALVEKVLKSLPRLRPTKI 584
+G+ V+ SS AI+R + +I A+ E L++ + LV + +L T++
Sbjct: 515 V-NGEPVSVAQSSENATAIVRNLLDDLIQASHDLENLDEEDRDLVHEAESVRSQLAETRL 573
Query: 585 AEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPG 644
DG I+EW +F + + HRHLSHL+ L PG IT + P L +AA K+L+ RG++G G
Sbjct: 574 GADGRILEWNDEFIESDPQHRHLSHLYELHPGAGIT-SQTPHLEEAARKSLEVRGDDGSG 632
Query: 645 WSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKH-FEGGLYSNLFAAHPPFQIDANF 703
WSI W+ +WARL D EHA R++ VD E + GG+Y + AHPPFQID N
Sbjct: 633 WSIVWRMIMWARLRDAEHAKRIIGMFLRPVDANAETNLLGGGVYGSGLCAHPPFQIDGNL 692
Query: 704 GFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKD 754
GF AA++EMLVQS + +LPALP D W G L+ARGG V W D
Sbjct: 693 GFPAALSEMLVQSHDGWIRILPALPED-WHEGTFHALRARGGIQVDATWTD 742
>gi|255693316|ref|ZP_05416991.1| fibronectin type III domain protein [Bacteroides finegoldii DSM
17565]
gi|260620891|gb|EEX43762.1| hypothetical protein BACFIN_08516 [Bacteroides finegoldii DSM
17565]
Length = 861
Score = 400 bits (1027), Expect = e-108, Method: Compositional matrix adjust.
Identities = 278/850 (32%), Positives = 427/850 (50%), Gaps = 93/850 (10%)
Query: 1 MMNAESTSTT--NPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWT 57
++NA++T PL+ T++ PAK + ++A+PIGNG +GAM++GGV + ++ NE TLW+
Sbjct: 21 VVNAKTTDRNFPPPLRATYDTPAKIWESEALPIGNGYMGAMIFGGVYVDVIQTNEHTLWS 80
Query: 58 GVP--------GDYTNPDAPK-ALSDVRSLV----------------------------- 79
G P G P+ K L R+L+
Sbjct: 81 GGPSENPGYNGGHLRTPEINKDNLQKARNLLQQKMIDFMADKAAHFDANGKLITYDYEGD 140
Query: 80 ----DSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHL-KYAEETYRRELDLNTAT 134
D +Y + A + + FG YQ L +I + +++ A Y R LD++ +
Sbjct: 141 GEETDLRRYIDNIAGTKEHFGS----YQTLSNIVITSNNTKCPDCAYSDYNRTLDIDNSI 196
Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQII 194
V Y + + RE+F S PD V+V +++ +S ++L+SL + ++ N I
Sbjct: 197 HTVSYKESGITYKREYFMSYPDNVMVIRLTSDSKDGISRTIALESLHKTKNIISEGNTIT 256
Query: 195 MEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLL 254
M G P K + G++++ ++ + +D G ISA+ D +KV G+ V+L+
Sbjct: 257 MTGY-PTPVGGDKRVGDHWKNGLRYAQ--QVMVRNDGGKISAV-DGMIKVAGAKEIVILM 312
Query: 255 VASSSFDGPFINPSD--SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
A++++ + + SK+DP + + L+ SY L H DY+ L+ R+ I L
Sbjct: 313 SAATNYVQCMDDSYNFFSKEDPLDKVKAILKKASAKSYKKLLIAHQKDYRSLYDRMKINL 372
Query: 313 SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVA 372
+ V T D + ++ ++ L L +QFGRYLLISSSR G+ A
Sbjct: 373 GNVKEAPVMTT------DKLLKGMDERTNLQADNLYLEMLYYQFGRYLLISSSREGSLPA 426
Query: 373 NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQV 432
NLQG+W + L W+S H NIN++MNYW + P NLS C P+ +++ L G TAQ
Sbjct: 427 NLQGVWADRLQNAWNSDYHTNINVQMNYWPAQPTNLSPCHLPMVEYVKSLVPRGRYTAQH 486
Query: 433 NYL------ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
Y GWV HH+ +IW ++ + K +P G W+C +WE+Y + DR F
Sbjct: 487 YYCRPDGKPVRGWVTHHENNIWGNTAPAK-KDTPHHFPAGAIWMCQDIWEYYQFNQDRKF 545
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
LE+ +L+ ++ + + DG L NPS SPEH + L C + A+
Sbjct: 546 LEEYYDTMLQAALFWVDNLWTDKRDGMLVANPSHSPEHG----EYSLGC-----STSQAM 596
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFK---DPEVH 603
I E+F+ +I A++ L + D ++++ SL +L KI G MEW + + +
Sbjct: 597 IWEIFNIMIKASKELGRENDPEIKEISASLAKLSGPKIGLGGQFMEWKDEVTKDINGDGG 656
Query: 604 HRHLSHLFGLFPGHTITI---EKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQ 660
HRH +HLF L PG I E + +A + TL RG+ G GWS WK WARLHD
Sbjct: 657 HRHTNHLFWLHPGSAIVAGRSEWDNKYAEAMKVTLNTRGDAGTGWSKAWKLNFWARLHDG 716
Query: 661 EHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
++++++ L P +F GG+Y+NLF AHPPFQID NFG TA VAEML+QS
Sbjct: 717 NRSHKLLESALKLTKP--GANF-GGVYTNLFDAHPPFQIDGNFGVTAGVAEMLMQSHGGY 773
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNND----HDSFKT 776
+ LLP+LP D W G KG+KARG V W +G + V I ++YS + K
Sbjct: 774 IELLPSLP-DVWKEGSFKGMKARGNFEVDAEWSNGKITSV-IITSYSGKECIVKCPDAKN 831
Query: 777 LHYRGTSVKV 786
L GTS KV
Sbjct: 832 LKVSGTSAKV 841
>gi|332880351|ref|ZP_08448029.1| hypothetical protein HMPREF9074_03804 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|357047449|ref|ZP_09109054.1| hypothetical protein HMPREF9441_03090 [Paraprevotella clara YIT
11840]
gi|332681796|gb|EGJ54715.1| hypothetical protein HMPREF9074_03804 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|355529520|gb|EHG98947.1| hypothetical protein HMPREF9441_03090 [Paraprevotella clara YIT
11840]
Length = 746
Score = 399 bits (1026), Expect = e-108, Method: Compositional matrix adjust.
Identities = 267/768 (34%), Positives = 379/768 (49%), Gaps = 112/768 (14%)
Query: 12 PLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
P+K+ ++ PAK + T A+P+GNG +GAM +GGV E L+ N+ TLW G
Sbjct: 25 PMKLWYDEPAKVWMTSALPVGNGGIGAMFFGGVAKEQLQFNDKTLWAG------------ 72
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
S R YQ +GD+ EFD YRREL L
Sbjct: 73 --STTRR----------------------GAYQNMGDLFFEFDTPE---TCTNYRRELSL 105
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISG-SESGSLSFNVSLDSLLDNHSYVNG 189
+ A RV Y++ V++ RE+F+SNPD VIV +++ G L+F++ + + V+G
Sbjct: 106 DDAIGRVSYTIDGVDYLREYFASNPDSVIVVRLTTPGHKGKLNFSLRMQDGRQGMTRVDG 165
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
+ I D + A+L+ D G + D+ L+V+G+D
Sbjct: 166 HTMTI--------------KGTLDLLSYEAQALLQA----DGGMVETKSDR-LEVKGADA 206
Query: 250 AVLLLVASSSFD--GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
++L +++FD P D+ + S ++ R SY L HL DYQ LF R
Sbjct: 207 VTVVLTGATNFDLASPTYTRGDAYEIHRRVSARMDKATRK-SYKKLKAAHLADYQPLFAR 265
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
V + L D TD E+ D + L L FQ+GRYL++ SSR
Sbjct: 266 VELDLDAEQPDYTTDVLVREHKD---------------NAYLDMLYFQYGRYLMLGSSRG 310
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI--- 424
G +NLQG+WN +P W+ H NIN++MNYW + NLSEC P F+TY+S
Sbjct: 311 GQLPSNLQGLWNNVNNPAWECDYHSNINVQMNYWPAEVTNLSECYAP---FITYVSTEAL 367
Query: 425 -NGSKTAQVNYL--ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
+G QV GW +H + +I+ G W + AW CTHLW+HY YT
Sbjct: 368 KDGGAWQQVARKENCRGWAVHTQNNIF-------GYTDWLINRPANAWYCTHLWQHYAYT 420
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP--DGKLACVSYS 539
+D+++L A+P+++ + D L E +G L SPEH P DG V+Y+
Sbjct: 421 LDKEYLRDTAWPVMKVTCQYWFDRLKENAEGRLVAPNEWSPEH---GPWEDG----VAYA 473
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIMEWAQDFK 598
+ A+ E ++AA+VL +DA V ++ + RL I G I EW
Sbjct: 474 QQLVYALFEET----LAAADVLAV-DDAFVSELKEKFSRLDNGLHIGSWGQIKEWTIQED 528
Query: 599 DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLH 658
H RHLSHL L+P I+ K+ +AA+ L RG+ GWS WK A WARL
Sbjct: 529 KQGDHQRHLSHLMALYPCDQISYLKDKRYAEAAKVALDSRGDGATGWSRAWKVACWARLW 588
Query: 659 DQEHAYRMVKRLFNLVDPE--HEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
D E AYR++K+ N+ D GG+Y NLF AHP FQID NFG TA +AEM++Q+
Sbjct: 589 DGERAYRLLKQAQNITDVTVVSMDDNAGGVYENLFCAHPSFQIDGNFGATAGIAEMMLQN 648
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
T+ ++LLPALP W G KGLKA+GG T + WKDG + E +YS
Sbjct: 649 TVKGVHLLPALP-SAWDDGHFKGLKAKGGFTFDVTWKDGKMVEGRVYS 695
>gi|318059330|ref|ZP_07978053.1| alpha-L-fucosidase [Streptomyces sp. SA3_actG]
Length = 783
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 264/797 (33%), Positives = 398/797 (49%), Gaps = 75/797 (9%)
Query: 15 ITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-------DYTNP 66
+ + PA + T+++P+GNG LGA V+G +P+E ++ E TLWTG PG ++ NP
Sbjct: 46 LRYGAPATDWETESLPVGNGALGASVFGTLPTEHIQFAEKTLWTGGPGTPGYRYGNWENP 105
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVY---QLLGDIELEFDDSHLKYAEET 123
P AL+ VR+ +++ A+ +L G P Y Q GD+ ++ D + + E
Sbjct: 106 R-PDALASVRADIEARTKITPEDAAARL-GQPRIGYGGHQTFGDLLIDVDGA--PGSAEG 161
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
Y R LDL A A V Y F R F+S PD+V+V + GS+ N+ S +
Sbjct: 162 YTRTLDLAQALATVSYPHDGTTFHRTVFASCPDKVLVGHFTADRGGSVGLNLRYTSPRQD 221
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ +++ + G G++F A +I++ + GT++A D+ L
Sbjct: 222 FTATTNGDRLTVRGAL-------------QDNGMRFEA--QIRLLSEGGTVTANGDR-LT 265
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V G+D A +L A + + + P DP +A+ Y +L RH D+
Sbjct: 266 VSGADSAWFVLSAGTDYADTY--PDYRGADPHDRVTTAVDQAAARPYRELLDRHTSDHAA 323
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
LF RV + L + D+ + D + A + +D +L L FQ+GRYLLI+
Sbjct: 324 LFSRVVLDLGQ-------DSAPDRTTDALLKA--YTGGNSADDRALEALFFQYGRYLLIA 374
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSR G+ ANLQG WN +P W + HVNINL+MNYW + NL+E P F+ L
Sbjct: 375 SSRAGSLPANLQGAWNNSTAPPWSADYHVNINLQMNYWPAEATNLAETTAPYDRFVEALR 434
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
G TA+ + A GWV+H +T + + D W +P AWL + L+EHY +
Sbjct: 435 APGRTTARSMFDARGWVVHDETTPFGFTGVHDWPTSFW--FPEAAAWLTSQLYEHYRFDG 492
Query: 483 DRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEH-EFIAPDGKLACVSYSS 540
D+L AYP ++ A F +D L + D L PS SPEH +F A +
Sbjct: 493 STDYLRATAYPAMKEAAEFWIDVLRTDPRDNTLVVTPSFSPEHGDFTA----------GA 542
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEWAQDFKD 599
M I+RE+F + AA+ L ++ A + ++L R+ P +I G +MEW D
Sbjct: 543 AMSQQIVRELFLNTLEAAQTL-GDDPAFRATLKETLDRIDPGLRIGSWGQLMEWKTDLDG 601
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHD 659
HRH+SHL+ L PG IE D +AA+ +L RG+ G GWS WK WARL D
Sbjct: 602 RTDDHRHVSHLYALHPGR--QIEPGSDFAEAAKVSLTARGDGGTGWSKAWKINFWARLRD 659
Query: 660 QEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLN 719
+HA+ M+ + +G +NL+ HPPFQID NFG T+ + EML+QS +
Sbjct: 660 GDHAHTMLA-----------EQLKGSTLANLWDTHPPFQIDGNFGATSGITEMLLQSQHD 708
Query: 720 DLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHY 779
+ +LPALP WSSG V+GL+ARGG T+ W++G + + + S + +
Sbjct: 709 VIEVLPALP-AAWSSGTVRGLRARGGATLEFSWENGRATRIALTA--SRTRELTVRNALV 765
Query: 780 RGTSVKVNLSAGKIYTF 796
G + AG+ YT+
Sbjct: 766 PGGTTTFKAVAGETYTW 782
>gi|380472541|emb|CCF46724.1| alpha-L-fucosidase [Colletotrichum higginsianum]
Length = 780
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 261/780 (33%), Positives = 386/780 (49%), Gaps = 76/780 (9%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA + +A+PIGNGRLGAMV+G +E ++LNED++W G P D T DA + L +R
Sbjct: 23 YQSPASEWAEALPIGNGRLGAMVYGRTGTELVQLNEDSVWYGGPQDRTPKDALRHLPKLR 82
Query: 77 SLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
L+ ++AEA + F PA + Y+ LG +E H YRR L L+TA
Sbjct: 83 QLIRDEKHAEAESLVREAFFATPASMRHYEPLGTCTIEL--GHAVEDVTGYRRHLCLDTA 140
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
V+Y V + R+ +S P+ V+ +++ SE ++ S ++ + ++
Sbjct: 141 QTTVEYLSRGVSYRRDAIASFPNNVLAFRVTASEPTRFVVRLNRVSEIEWETNEFLDSIE 200
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
+GR P N+N + S +L + D +G++ A+ + L+
Sbjct: 201 ADDGRIVLNATPGGRNSN------RLSIVLGVSCHDAQGSVEAIGNS-----------LV 243
Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL- 312
+ +SS + P + + ++ +L + DL H DYQ LF R ++++
Sbjct: 244 VKSSSCTIAIGAQTTYRTLHPETVATEDVRKALDLPWDDLIRHHRSDYQTLFGRTALRMW 303
Query: 313 ---SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
S +P D+ + D LV L +GRYLLISSSR
Sbjct: 304 PDASHNPTDM--------------------RIEKGRDAGLVALYHNYGRYLLISSSRHAE 343
Query: 370 QV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
+ A LQGIWN +P W S +NINL+MNYW + PCNL EC P+ D L ++ G
Sbjct: 344 KALPATLQGIWNPSFAPPWGSKYTININLQMNYWPAGPCNLVECAIPVLDLLERMAERGR 403
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
KTAQ Y GW HH TDIWA + + +WP+GG WLC ++E Y D D L
Sbjct: 404 KTAQAMYGCRGWCAHHNTDIWADTDPQDRWMPSTIWPLGGVWLCIDVFEMLQYHHD-DGL 462
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
+RA +LEGC FLLD+LI G YL TNPS SPE+ FI+ GK + S +D I
Sbjct: 463 HRRAAAVLEGCILFLLDFLIPSSCGKYLVTNPSLSPENTFISNSGKAGILCEGSAIDTTI 522
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWA-QDFKDPEVHHR 605
IR F + + +L NE L KV ++L +L G I EW +++++ E HR
Sbjct: 523 IRIAFEKFLWSNSMLGTNE-PLCSKVREALGKLPELMTNAHGLIQEWGLKNYEELEPGHR 581
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEH 662
H+SHLFGL+PG +I+ + PDL AA++ L++R G GWS W L ARL D +
Sbjct: 582 HVSHLFGLYPGESISPRRTPDLAAAAKRVLERRAAHGGGHTGWSRAWLLNLHARLLDADG 641
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST----- 717
+ + L +N+ HPPFQID NFG A + E LVQS+
Sbjct: 642 CGQHMDMLLG-----------SSTLANMLDNHPPFQIDGNFGGCAGILECLVQSSVLPSA 690
Query: 718 ----LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDS 773
+ ++ LLP+ P WS G + +GG VS W+DG + E + + + D ++
Sbjct: 691 SKPAVVEIRLLPSCPL-SWSEGELTRGCTKGGWLVSFIWRDGSIVEPVLVESPATKDAEA 749
>gi|443630249|ref|ZP_21114539.1| putative Fibronectin type III domain-containing protein
[Streptomyces viridochromogenes Tue57]
gi|443336258|gb|ELS50610.1| putative Fibronectin type III domain-containing protein
[Streptomyces viridochromogenes Tue57]
Length = 744
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 262/791 (33%), Positives = 399/791 (50%), Gaps = 80/791 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-----DYTN-----PDAPKALSD- 74
+A+PIGNG LGAMV+G + SE L+ NE TLWTG PG D+ N PDA A+ D
Sbjct: 14 EALPIGNGALGAMVFGTLASERLQFNEKTLWTGGPGSAQGYDHGNWRTPRPDAITAVQDD 73
Query: 75 --VRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
R+ +D + A+ +G +Q GD+ L+ + + YRRELDL+
Sbjct: 74 LDARTTLDPEEVADRLGQPRIGYG----AHQTFGDLHLDIPGAPTTPPAD-YRRELDLDK 128
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A A V Y+ V R+ +S PD VI ++ GS++F + S + + +
Sbjct: 129 AVASVGYTYQGVRHQRDFLASYPDGVIAGRLHADRPGSVTFTLRYTSPRADFTATAADGT 188
Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
+ + G A A++ G++F A ++++ GT+++ + + V G+D A
Sbjct: 189 LTVRG----------ALADN---GLRFEA--QVRVRSRGGTVTSDANGTITVTGADSAWF 233
Query: 253 LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
+L A + + + P DP + A++ + Y L RH+ D++ LF RV++ +
Sbjct: 234 VLAAGTDYADTY--PDYRGPDPHAAVGRAVRQAGD-RYEALLARHVRDHRALFRRVALDI 290
Query: 313 SRS-PKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
+S P D+ TD +A+R L F++GRYLLI+SSRPG+
Sbjct: 291 GQSLPADVPTDRLLAAYAGGAGAADRALE----------ALYFEYGRYLLIASSRPGSLP 340
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
ANLQG+WN +P W + H NIN++MNYW + NL+E P F+ L G +TAQ
Sbjct: 341 ANLQGVWNNSTTPPWSADYHTNINIQMNYWPAEAANLAETTPPYDRFVEALRAPGRRTAQ 400
Query: 432 VNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
+ + GWV+H++T+ + + D W +P AWL L+EHY + D+L
Sbjct: 401 EMFGSRGWVVHNETNPYGFTGVHDWATAFW--FPEAAAWLTQQLYEHYRFAGSTDYLRTT 458
Query: 491 AYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEH-EFIAPDGKLACVSYSSTMDMAIIR 548
AYP ++ F LD L + DG L PS SPEH +F A + M I+
Sbjct: 459 AYPAMKEATEFWLDNLRTDPRDGTLVVTPSYSPEHGDFTA----------GAAMSQQIVH 508
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEWAQDFKDPEVHHRHL 607
++F++ + AA +L D +V +L RL P +I G + EW D DP HRH+
Sbjct: 509 DLFTSTLEAARILGDAPD-FRRRVEAALNRLDPGLRIGSWGQLQEWKADLDDPTDTHRHV 567
Query: 608 SHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMV 667
SHLF L PG IE +AA+ +L RG+ G GWS WK WARL D +HA++M+
Sbjct: 568 SHLFALHPGR--QIEPGSKWAEAAKVSLTARGDGGTGWSKAWKINFWARLRDGDHAHKML 625
Query: 668 KRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPAL 727
+ + NL+ HPPFQID NFG T+ + EML+QS + + +LPAL
Sbjct: 626 G-----------EQLKYSTLPNLWDTHPPFQIDGNFGATSGIVEMLLQSQHDVIEVLPAL 674
Query: 728 PWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVN 787
P W +G V+GL+ARGG T+ I W DG + + + S + ++ + +
Sbjct: 675 P-AAWPTGSVRGLRARGGATLDIEWADGRATRIALKA--SRTRELTVRSDLFEEGELTFK 731
Query: 788 LSAGKIYTFNR 798
AG+ YT+ +
Sbjct: 732 AVAGRRYTWQK 742
>gi|330933451|ref|XP_003304180.1| hypothetical protein PTT_16648 [Pyrenophora teres f. teres 0-1]
gi|311319408|gb|EFQ87743.1| hypothetical protein PTT_16648 [Pyrenophora teres f. teres 0-1]
Length = 792
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 273/821 (33%), Positives = 411/821 (50%), Gaps = 73/821 (8%)
Query: 4 AESTSTTNPLKITFNGPAKH--FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
A S N ++ + PA+ +TDA+PIGNGRLGAM +G E + LNE+T+W+G
Sbjct: 14 ASLASAGNNTRLWYTTPAQSSAWTDALPIGNGRLGAMAFGIPVQERIALNEETIWSGGQQ 73
Query: 62 DYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLK 118
D ++P+ +S+VR L+ G +A A++ + G P YQ LGD+++ FD +
Sbjct: 74 DRIGQNSPQTVSEVRDLLAQGHAGDAEKLANMGMMGTPQSCRNYQPLGDMDIFFDGT-TG 132
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y TY+R LD++TA A V++ V + RE F S PD V+V + + SG LSF + +
Sbjct: 133 YDNATYKRWLDVDTALAGVQFQVNGTLYEREMFVSAPDDVLVHHLKATGSGKLSFQIRV- 191
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
+ GN E G DP + F+ L ++ SD G + L
Sbjct: 192 ----HRPEKGGNEASDHEWNADGLAYMTGGAGGIDP--VVFTTALAVQ-SD--GHVKNL- 241
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+ +E + A + AS+S+ D + S +Q R +Y +L RH+
Sbjct: 242 GPFIVIENATEATAIFAASTSY---------RHNDTRAAVESTIQQARQHTYEELRQRHI 292
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFG 357
DY L++ + LS S DI ++P+ R+ + + DP+L L + +G
Sbjct: 293 ADYAPLYNASVLDLSGS--DI--------EASSLPTDARINATREGASDPALAALSYNYG 342
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLI+SSR G +NLQGIWN++ +P W S VNINL+MNYW + +LS EPLFD
Sbjct: 343 RYLLIASSRAGNLPSNLQGIWNKEFAPQWGSKYTVNINLQMNYWPAEVTSLSSLHEPLFD 402
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
L + +G+KTA+ Y ASGWV HH TD+W ++ + W + WL TH+ EH
Sbjct: 403 LLDLMRKDGTKTARQMYNASGWVTHHNTDLWGDTAPVDRWLPATYWTLSSGWLVTHILEH 462
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWL----IEGHDGYLETNPSTSPEHEFIAPDGKL 533
Y YT D+ FL + + E A F LD L I G YL TNPS SPE+ ++ D
Sbjct: 463 YWYTGDKKFLASKLDVVSEAIA-FYLDILQPYSINGTQ-YLVTNPSVSPENSYLDADNNT 520
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKN--EDALVEKVLKSLPRLRPTKIAED--GS 589
+ T D+ I+ E+F+ ++A L + + + + + +L P + ++ G+
Sbjct: 521 YHFDIAPTCDIEILNELFTNYLNAVATLPNSTVDSTFLTHIRDTQAKLPPYRYSKRYPGT 580
Query: 590 IMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNP----DLCKAAEKTLQKR---GEEG 642
+ EW QD++ E+ HRH+SHL+ L+PG I P L AA TL+ R G
Sbjct: 581 LQEWMQDYEQAELGHRHVSHLYALYPGTQILPPGAPGYDAKLFNAAAGTLEGRLSHNGAG 640
Query: 643 PGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP-FQIDA 701
GWS W +ARL + V + FN +Y NL + FQID
Sbjct: 641 TGWSRAWTINWYARLQNSTAVAENVYQFFNT-----------SVYDNLMDVNEGVFQIDG 689
Query: 702 NFGFTAAVAEMLVQS------TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
N GF + VAE L+QS + +++LLP LP +W++G V GL ARGG I W DG
Sbjct: 690 NLGFVSGVAEALIQSHIVVEEGVREVWLLPVLP-KQWNTGSVNGLAARGGFVFDITWADG 748
Query: 756 DLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTF 796
+ ++ + S +K T+ ++ AG++ F
Sbjct: 749 AITKMKMESRVGGTVVLRYKGGSGNSTTTRLETKAGEVKEF 789
>gi|318078709|ref|ZP_07986041.1| alpha-L-fucosidase [Streptomyces sp. SA3_actF]
Length = 769
Score = 399 bits (1024), Expect = e-108, Method: Compositional matrix adjust.
Identities = 264/797 (33%), Positives = 398/797 (49%), Gaps = 75/797 (9%)
Query: 15 ITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-------DYTNP 66
+ + PA + T+++P+GNG LGA V+G +P+E ++ E TLWTG PG ++ NP
Sbjct: 32 LRYGAPATDWETESLPVGNGALGASVFGTLPTEHIQFAEKTLWTGGPGTPGYRYGNWENP 91
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVY---QLLGDIELEFDDSHLKYAEET 123
P AL+ VR+ +++ A+ +L G P Y Q GD+ ++ D + + E
Sbjct: 92 R-PDALASVRADIEARTKITPEDAAARL-GQPRIGYGGHQTFGDLLIDVDGA--PGSAEG 147
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
Y R LDL A A V Y F R F+S PD+V+V + GS+ N+ S +
Sbjct: 148 YTRTLDLAQALATVSYPHDGTTFHRTVFASCPDKVLVGHFTADRGGSVGLNLRYTSPRQD 207
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ +++ + G G++F A +I++ + GT++A D+ L
Sbjct: 208 FTATTNGDRLTVRGAL-------------QDNGMRFEA--QIRLLSEGGTVTANGDR-LT 251
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V G+D A +L A + + + P DP +A+ Y +L RH D+
Sbjct: 252 VSGADSAWFVLSAGTDYADTY--PDYRGADPHDRVTTAVDQAAARPYRELLDRHTSDHAA 309
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
LF RV + L + D+ + D + A + +D +L L FQ+GRYLLI+
Sbjct: 310 LFSRVVLDLGQ-------DSAPDRTTDALLKA--YTGGNSADDRALEALFFQYGRYLLIA 360
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSR G+ ANLQG WN +P W + HVNINL+MNYW + NL+E P F+ L
Sbjct: 361 SSRAGSLPANLQGAWNNSTAPPWSADYHVNINLQMNYWPAEATNLAETTAPYDRFVEALR 420
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
G TA+ + A GWV+H +T + + D W +P AWL + L+EHY +
Sbjct: 421 APGRTTARSMFDARGWVVHDETTPFGFTGVHDWPTSFW--FPEAAAWLTSQLYEHYRFDG 478
Query: 483 DRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEH-EFIAPDGKLACVSYSS 540
D+L AYP ++ A F +D L + D L PS SPEH +F A +
Sbjct: 479 STDYLRATAYPAMKEAAEFWIDVLRTDPRDNTLVVTPSFSPEHGDFTA----------GA 528
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEWAQDFKD 599
M I+RE+F + AA+ L ++ A + ++L R+ P +I G +MEW D
Sbjct: 529 AMSQQIVRELFLNTLEAAQTL-GDDPAFRATLKETLDRIDPGLRIGSWGQLMEWKTDLDG 587
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHD 659
HRH+SHL+ L PG IE D +AA+ +L RG+ G GWS WK WARL D
Sbjct: 588 RTDDHRHVSHLYALHPGR--QIEPGSDFAEAAKVSLTARGDGGTGWSKAWKINFWARLRD 645
Query: 660 QEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLN 719
+HA+ M+ + +G +NL+ HPPFQID NFG T+ + EML+QS +
Sbjct: 646 GDHAHTMLA-----------EQLKGSTLANLWDTHPPFQIDGNFGATSGITEMLLQSQHD 694
Query: 720 DLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHY 779
+ +LPALP WSSG V+GL+ARGG T+ W++G + + + S + +
Sbjct: 695 VIEVLPALP-AAWSSGTVRGLRARGGATLEFSWENGRATRIALTA--SRTRELTVRNALV 751
Query: 780 RGTSVKVNLSAGKIYTF 796
G + AG+ YT+
Sbjct: 752 PGGTTTFKAVAGETYTW 768
>gi|291457532|ref|ZP_06596922.1| putative alpha-L-fucosidase 2 [Bifidobacterium breve DSM 20213 =
JCM 1192]
gi|291380585|gb|EFE88103.1| putative alpha-L-fucosidase 2 [Bifidobacterium breve DSM 20213 =
JCM 1192]
Length = 783
Score = 398 bits (1023), Expect = e-108, Method: Compositional matrix adjust.
Identities = 261/771 (33%), Positives = 395/771 (51%), Gaps = 58/771 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+TF+G + + ++IP+GNGR+GA++ ++ L LN+DTLW+G P T+P P+ +
Sbjct: 1 MKLTFDGISSCWEESIPLGNGRMGAVLCSEPETDVLYLNDDTLWSGYPHAETSPVTPEIV 60
Query: 73 SDVR--SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+ R SL D A L +Y+ G +++ S E+ +R+LDL
Sbjct: 61 AKARQASLQDDYNAATRIIKDATLQEKDEQIYEPFGTARIQYSTS--ADGRESMKRQLDL 118
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS----------LDSL 180
A A + +G+ + + S PD ++V ++S ++ +VS ++++
Sbjct: 119 ARALAGETFRMGDANVHVDAWCSEPDDLLVYRMSSDAPIDVNISVSGTFLKQSRASMETV 178
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIP----PKANANDDPK---GIQFSAILEIKISDDRGT 233
D H +++ GR PG I P N +D + G+ ++ + ++ G
Sbjct: 179 SDGHRAT-----LVVMGRMPGLNIGLLPHPSENPWEDEQDGTGMAYAGAFSLTVT---GG 230
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
+ D L+ L + S F G P S + L+ + +DL
Sbjct: 231 DVNVGDNSLQCSNITGLSLRFRSMSGFRGSDQQPERS----MTVIADHLEKTIDEWSTDL 286
Query: 294 YT---RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDED---P 347
T R + DY++ F RV+I L + D DT +P + ++S + E
Sbjct: 287 RTMLDRRIADYRRYFDRVAIHLGSAHDD---DT-------ELPFSAILRSDEKKEPHRLE 336
Query: 348 SLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
L E +F FGRYLLISSSRP TQ ANLQGIWN P W SA NIN+EMNYW + PC
Sbjct: 337 MLAEAMFDFGRYLLISSSRPHTQPANLQGIWNHKDFPNWYSAYTTNINVEMNYWMTGPCA 396
Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
L E EPL L + G A G + H D+W ++ G+ +W+ WP G
Sbjct: 397 LQELIEPLVSMNEELLVPGHDAADRILGCRGSAVFHNVDLWRRALPANGEPMWSFWPFGQ 456
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFI 527
AW+C +L++ Y + D +L R +P++ A F +D+L E G L +P+TSPE+ F+
Sbjct: 457 AWMCRNLFDEYLFNQDASYL-ARIWPIMRDNARFCMDFLSETKHG-LAPSPATSPENCFL 514
Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAA---EVLEKNEDALVEKVLKSLPRLRPTKI 584
+G+ V+ SS AI+R + +I A+ E L++ + LV + +L T++
Sbjct: 515 V-NGEPVSVAQSSENATAIVRNLLDDLIQASHDLEDLDEEDRDLVHEAESVRSQLAETRL 573
Query: 585 AEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPG 644
DG I+EW +F + + HRHLSHL+ L PG IT + P L +AA K+L+ RG++G G
Sbjct: 574 GADGRILEWNDEFIESDPQHRHLSHLYELHPGAGIT-SQTPHLEEAARKSLEVRGDDGSG 632
Query: 645 WSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKH-FEGGLYSNLFAAHPPFQIDANF 703
WSI W+ +WARL D EHA R++ VD E + GG+Y + AHPPFQID N
Sbjct: 633 WSIVWRMIMWARLRDAEHAKRIIGMFLRPVDANAETNLLGGGVYGSGLCAHPPFQIDGNL 692
Query: 704 GFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKD 754
GF AA++EMLVQS + +LPALP D W G L+ARGG V W D
Sbjct: 693 GFPAALSEMLVQSHDGWIRILPALPED-WHEGTFHALRARGGIQVDATWTD 742
>gi|402847334|ref|ZP_10895629.1| hypothetical protein HMPREF1323_1685 [Porphyromonas sp. oral taxon
279 str. F0450]
gi|402266647|gb|EJU16068.1| hypothetical protein HMPREF1323_1685 [Porphyromonas sp. oral taxon
279 str. F0450]
Length = 838
Score = 397 bits (1021), Expect = e-107, Method: Compositional matrix adjust.
Identities = 258/798 (32%), Positives = 399/798 (50%), Gaps = 60/798 (7%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG-VPGDYTNPDAPKA 71
L F+ PA +A+P+GNGRLG + GGV + + LNE ++W+G V N +A K
Sbjct: 46 LTYFFDRPATSMMEALPLGNGRLGMLSDGGVQHQRITLNESSMWSGSVDSTAWNAEAYKQ 105
Query: 72 LSDVRSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIELEFDDSHLK 118
L +R L+ +G+ EA + F P YQ+ G + L +D +
Sbjct: 106 LPAIRKLLLAGRAKEAEDLIYRTFVCGGVGSGRGQGANTPYGSYQVGGFLHLNWDKAP-- 163
Query: 119 YAEETYRRELDLNTATARVKYSV-GNVEFTREHFSSNPDQVIVTKIS--GSESGSLSFNV 175
Y R L L+ +R + V G T+ +S +V V ++ E+ + +
Sbjct: 164 -ELSGYYRGLSLSEGVSRESFVVDGQAYRTKRLYSVLGREVQVVHLTNHSEEARRDTLRL 222
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
SL + H + + G+ P + +G+ + AI+ + GT+
Sbjct: 223 SLSRPENGHPAAEAGF-LTLSGQLPDGK---------GGRGMSY-AIVVRPVLPQGGTLI 271
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
D+ L V V L +A ++ N D + + S+ + + ++L+
Sbjct: 272 TRGDELLIVNAP--TVELYIAHNT------NYYDKRLPVMARSIEQTLQAKAVGEANLFA 323
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF--QTDEDPSLVELL 353
H+ + RV + S+ + ++P R+ ++ + DP+L L
Sbjct: 324 EHVQRFTAQMDRVQARF----------LGSDPALSSLPIQRRLIAYYEHPERDPALAALY 373
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
Q GRYLLISS+RPG NLQGIW E + W+ H+NINL+MNYW + L E
Sbjct: 374 MQLGRYLLISSTRPGALPPNLQGIWTETIQAPWNGDYHLNINLQMNYWPAEKGALPETVG 433
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
L D++ + +G +TA+ Y A GWV H ++W + +A W AWLC H
Sbjct: 434 ALTDWVESIVPSGERTARTFYRAKGWVTHVLGNVW-QFTAPGEHPSWGATNTSAAWLCEH 492
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGK 532
L+ HY Y+ DR +LE R YP+++G A F L L++ GYL P+TSPE+ + P GK
Sbjct: 493 LYNHYRYSQDRAYLE-RIYPVMQGAARFFLTTLVKDPKSGYLVNVPTTSPENSYYTPQGK 551
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
V+ STMD I+RE+FS AA L ++ V+ + +L +L+PT + DG IME
Sbjct: 552 AVAVAAGSTMDNQILRELFSTTREAAMTLGRDR-TFVDSLSTALRQLKPTTLGPDGRIME 610
Query: 593 WAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTA 652
W +D+K+ E HHRH+SHL+GLFPG IT P+L + A+KTL RG WS+ WK
Sbjct: 611 WMEDYKEVEPHHRHVSHLYGLFPGSEITPHGTPELAEGAKKTLIARGSSSTSWSMGWKVN 670
Query: 653 LWARLHDQEHAYR---MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAV 709
ARL D E AY M+ R + +DP+ K + G NLF++HPPFQID NFG ++ +
Sbjct: 671 FHARLGDAEGAYEVLNMLLRPVDAIDPKTNKPYGSGTEPNLFSSHPPFQIDGNFGGSSGI 730
Query: 710 AEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNN 769
EML+ S + LPALP W +G ++GL+ G T S+ W G+L + + ++++
Sbjct: 731 MEMLLSSETGCIIPLPALP-KAWKAGSIQGLRVIGNATCSLSWSAGELDRLVLEAHHAYR 789
Query: 770 DHDSFKTLHYRGTSVKVN 787
H RG ++++N
Sbjct: 790 -HTLLLPGEGRGYALRLN 806
>gi|429740665|ref|ZP_19274345.1| hypothetical protein HMPREF9134_00217 [Porphyromonas catoniae
F0037]
gi|429160458|gb|EKY02921.1| hypothetical protein HMPREF9134_00217 [Porphyromonas catoniae
F0037]
Length = 837
Score = 397 bits (1020), Expect = e-107, Method: Compositional matrix adjust.
Identities = 254/758 (33%), Positives = 390/758 (51%), Gaps = 58/758 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG-VPGDYTNPDAPKALSDV 75
F+ PA+ + +P+GNGRLG + G + + + LNE ++W+G + N DA K L +
Sbjct: 48 FDRPAESMMEELPLGNGRLGMLSDGALRHQRVTLNESSMWSGSIDSLALNRDAAKHLPKI 107
Query: 76 RSLVDSGQYAEATAASVKLF-------------GHPADVYQLLGDIELEFDDSHLKYAEE 122
R L+ +G++ +A K F P Y++ G + L++
Sbjct: 108 RELLFAGRHKDAEELIYKTFVCGGKGSGQGAGAKVPYGSYEVGGFLHLDWGRD---IPSP 164
Query: 123 TYRRELDLNTATARVKYSV-GNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD-SL 180
+Y+R LDL + G + +++S V V I + + + L S
Sbjct: 165 SYKRSLDLTYGISTETIETWGQPYRMKTYYTSYTHDVNVITIYNQAISARTDTLRLSLSR 224
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
+N + + + + G P + +G+ ++ + + + G + + ++
Sbjct: 225 PENGTSTVSDGLLTLSGDLPNGK---------GGEGLHYAIVAKPYLLHG-GKVISRGNE 274
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
L V S + +L+A ++ + NP S P + + + ++ + L H
Sbjct: 275 LLIVNAS--VIQILIAHNTN---YYNPQLS---PIAHGVEQIVKAAGITSAILERDHRAA 326
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGR 358
+ RVS+++ + EN+ P +R++++ D DP+L L QFGR
Sbjct: 327 FSSQMGRVSMRIGKG-------NAKAENL---PIDKRLEAYHKDPQSDPNLASLYMQFGR 376
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
YLL+SS+R G NLQGIW + W+S H+NINL+MNYW S NLSE PL +
Sbjct: 377 YLLLSSTRKGALPPNLQGIWTNLIQAPWNSDYHLNINLQMNYWPSEKGNLSETVLPLTSW 436
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ L +G +TA+ Y GWV H ++W ++ W G AWLC HL+ HY
Sbjct: 437 VEGLLPSGRETARAFYGGKGWVTHILGNVWGFTAPGE-HPSWGATNTGAAWLCQHLFNHY 495
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
YT DR++L +R YP+L+G + F L L+ + ++GYL T P+TSPE+ ++APD + VS
Sbjct: 496 LYTQDREYL-RRIYPILKGASQFFLSTLVRDPNNGYLVTAPTTSPENHYLAPDSSVVAVS 554
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
STMD IIRE+F+ ++A L E + ++++L L PT IA DG IMEW ++
Sbjct: 555 AGSTMDNQIIRELFTNTRTSALAL--GERVFADTLVRTLSELMPTTIAPDGRIMEWLSNY 612
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARL 657
K+ E HHRH+SHL+GLFPG+ IT E+ PDL AA K+L RG WS+ WK L ARL
Sbjct: 613 KETEPHHRHVSHLYGLFPGNEITREQTPDLIAAARKSLDARGASSTSWSMAWKVNLRARL 672
Query: 658 HDQEHAYRMVKRLFNLV---DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLV 714
D E AY ++ L V DP+ K + G +NLF++HPPFQID NFG A + EML+
Sbjct: 673 GDAEEAYNVLNMLLRPVAALDPQSHKPYGSGTNNNLFSSHPPFQIDGNFGGAAGIMEMLL 732
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICW 752
QS + LPALP W G + GLK G T S+ W
Sbjct: 733 QSETGSITPLPALP-KAWGEGAITGLKVIGNATCSLEW 769
>gi|421235008|ref|ZP_15691623.1| large secreted protein [Streptococcus pneumoniae 2061617]
gi|395599385|gb|EJG59558.1| large secreted protein [Streptococcus pneumoniae 2061617]
Length = 764
Score = 396 bits (1018), Expect = e-107, Method: Compositional matrix adjust.
Identities = 261/780 (33%), Positives = 391/780 (50%), Gaps = 93/780 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S KD ++ I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
I+R + I A+ L N D + V+++ K LPR TKI +G I EW +D+++ E
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLEDYEEVEP 554
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR------------------------ 638
HRH+S LFGL+P + I I K P+L +AA+ T+ +R
Sbjct: 555 GHRHISPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLSSQEREQAINNWLVSGL 614
Query: 639 -GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
GWS W +ARL+ E AY + L N NLF HPPF
Sbjct: 615 HASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLN-----------NATLGNLFLDHPPF 663
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
QID N G + + E+LVQS N L L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 QIDGNLGLVSGICELLVQSHHNWLSLIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 722
>gi|410477499|ref|YP_006744258.1| hypothetical protein HMPREF1038_02170 [Streptococcus pneumoniae
gamPNI0373]
gi|421269340|ref|ZP_15720202.1| hypothetical protein SPAR95_2166 [Streptococcus pneumoniae SPAR95]
gi|444387345|ref|ZP_21185368.1| hypothetical protein PCS125219_00757 [Streptococcus pneumoniae
PCS125219]
gi|444391139|ref|ZP_21189052.1| hypothetical protein PCS70012_02198 [Streptococcus pneumoniae
PCS70012]
gi|444391645|ref|ZP_21189459.1| hypothetical protein PCS81218_00254 [Streptococcus pneumoniae
PCS81218]
gi|444395928|ref|ZP_21193466.1| hypothetical protein PNI0002_01943 [Streptococcus pneumoniae
PNI0002]
gi|444398446|ref|ZP_21195928.1| hypothetical protein PNI0006_02051 [Streptococcus pneumoniae
PNI0006]
gi|444399000|ref|ZP_21196473.1| hypothetical protein PNI0007_00277 [Streptococcus pneumoniae
PNI0007]
gi|444402193|ref|ZP_21199365.1| hypothetical protein PNI0008_00811 [Streptococcus pneumoniae
PNI0008]
gi|444404331|ref|ZP_21201289.1| hypothetical protein PNI0009_00393 [Streptococcus pneumoniae
PNI0009]
gi|444408063|ref|ZP_21204730.1| hypothetical protein PNI0010_01508 [Streptococcus pneumoniae
PNI0010]
gi|444415928|ref|ZP_21212144.1| hypothetical protein PNI0199_01872 [Streptococcus pneumoniae
PNI0199]
gi|444417791|ref|ZP_21213797.1| hypothetical protein PNI0360_01205 [Streptococcus pneumoniae
PNI0360]
gi|444419629|ref|ZP_21215476.1| hypothetical protein PNI0427_00501 [Streptococcus pneumoniae
PNI0427]
gi|395866259|gb|EJG77390.1| hypothetical protein SPAR95_2166 [Streptococcus pneumoniae SPAR95]
gi|406370444|gb|AFS44134.1| conserved hypothetical membrane protein [Streptococcus pneumoniae
gamPNI0373]
gi|444253440|gb|ELU59896.1| hypothetical protein PCS125219_00757 [Streptococcus pneumoniae
PCS125219]
gi|444255297|gb|ELU61653.1| hypothetical protein PCS70012_02198 [Streptococcus pneumoniae
PCS70012]
gi|444255745|gb|ELU62088.1| hypothetical protein PNI0002_01943 [Streptococcus pneumoniae
PNI0002]
gi|444259175|gb|ELU65491.1| hypothetical protein PNI0006_02051 [Streptococcus pneumoniae
PNI0006]
gi|444265102|gb|ELU71130.1| hypothetical protein PCS81218_00254 [Streptococcus pneumoniae
PCS81218]
gi|444266940|gb|ELU72867.1| hypothetical protein PNI0008_00811 [Streptococcus pneumoniae
PNI0008]
gi|444269354|gb|ELU75162.1| hypothetical protein PNI0007_00277 [Streptococcus pneumoniae
PNI0007]
gi|444271659|gb|ELU77410.1| hypothetical protein PNI0010_01508 [Streptococcus pneumoniae
PNI0010]
gi|444277109|gb|ELU82631.1| hypothetical protein PNI0009_00393 [Streptococcus pneumoniae
PNI0009]
gi|444278655|gb|ELU84090.1| hypothetical protein PNI0199_01872 [Streptococcus pneumoniae
PNI0199]
gi|444282561|gb|ELU87815.1| hypothetical protein PNI0360_01205 [Streptococcus pneumoniae
PNI0360]
gi|444286393|gb|ELU91377.1| hypothetical protein PNI0427_00501 [Streptococcus pneumoniae
PNI0427]
Length = 764
Score = 396 bits (1017), Expect = e-107, Method: Compositional matrix adjust.
Identities = 261/780 (33%), Positives = 391/780 (50%), Gaps = 93/780 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAASNWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S KD ++ I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
I+R + I A+ L N D + V+++ K LPR TKI +G I EW +D+++ E
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLEDYEEVEP 554
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR------------------------ 638
HRH+S LFGL+P + I I K P+L +AA+ T+ +R
Sbjct: 555 GHRHISPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLSSQDREQAINNWLVSGL 614
Query: 639 -GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
GWS W +ARL+ E AY + L N NLF HPPF
Sbjct: 615 HASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLN-----------NATLGNLFLDHPPF 663
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
QID N G + + E+LVQS N L L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 QIDGNLGLVSGICELLVQSHHNWLSLIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 722
>gi|387760237|ref|YP_006067215.1| hypothetical protein SPNINV200_19710 [Streptococcus pneumoniae
INV200]
gi|419515658|ref|ZP_14055280.1| putative alpha-L-fucosidase [Streptococcus pneumoniae England14-9]
gi|301802826|emb|CBW35604.1| conserved hypothetical protein [Streptococcus pneumoniae INV200]
gi|379633974|gb|EHZ98540.1| putative alpha-L-fucosidase [Streptococcus pneumoniae England14-9]
Length = 764
Score = 395 bits (1016), Expect = e-107, Method: Compositional matrix adjust.
Identities = 261/780 (33%), Positives = 391/780 (50%), Gaps = 93/780 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S KD ++ I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
I+R + I A+ L N D + V+++ K LPR TKI +G I EW +D+++ E
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLEDYEEVEP 554
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR------------------------ 638
HRH+S LFGL+P + I I K P+L +AA+ T+ +R
Sbjct: 555 GHRHISPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLSSQDREQAINNWLVSGL 614
Query: 639 -GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
GWS W +ARL+ E AY + L N NLF HPPF
Sbjct: 615 HASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLN-----------NATLGNLFLDHPPF 663
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
QID N G + + E+LVQS N L L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 QIDGNLGLVSGICELLVQSHHNWLSLIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 722
>gi|149012024|ref|ZP_01833172.1| hypothetical protein CGSSp19BS75_03168 [Streptococcus pneumoniae
SP19-BS75]
gi|418077389|ref|ZP_12714618.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47502]
gi|147763979|gb|EDK70912.1| hypothetical protein CGSSp19BS75_03168 [Streptococcus pneumoniae
SP19-BS75]
gi|353745563|gb|EHD26232.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47502]
Length = 764
Score = 395 bits (1016), Expect = e-107, Method: Compositional matrix adjust.
Identities = 261/780 (33%), Positives = 391/780 (50%), Gaps = 93/780 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S KD ++ I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
I+R + I A+ L N D + V+++ K LPR TKI +G I EW +D+++ E
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLEDYEEVEP 554
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR------------------------ 638
HRH+S LFGL+P + I I K P+L +AA+ T+ +R
Sbjct: 555 GHRHISPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLSSQDREQAINNWLVSGL 614
Query: 639 -GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
GWS W +ARL+ E AY + L N NLF HPPF
Sbjct: 615 HASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLN-----------NATLGNLFLDHPPF 663
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
QID N G + + E+LVQS N L L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 QIDGNLGLVSGICELLVQSHHNWLSLIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 722
>gi|225861978|ref|YP_002743487.1| large secreted protein [Streptococcus pneumoniae Taiwan19F-14]
gi|298229408|ref|ZP_06963089.1| large secreted protein [Streptococcus pneumoniae str. Canada
MDR_19F]
gi|298255588|ref|ZP_06979174.1| large secreted protein [Streptococcus pneumoniae str. Canada
MDR_19A]
gi|298501665|ref|YP_003723605.1| alpha-L-fucosidase [Streptococcus pneumoniae TCH8431/19A]
gi|387789197|ref|YP_006254265.1| large secreted protein [Streptococcus pneumoniae ST556]
gi|417313623|ref|ZP_12100332.1| alpha-L-fucosidase [Streptococcus pneumoniae GA04375]
gi|418083982|ref|ZP_12721174.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44288]
gi|418086144|ref|ZP_12723319.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47281]
gi|418094961|ref|ZP_12732084.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA49138]
gi|418119732|ref|ZP_12756683.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA18523]
gi|418142694|ref|ZP_12779502.1| alpha-L-fucosidase [Streptococcus pneumoniae GA13455]
gi|418151670|ref|ZP_12788412.1| alpha-L-fucosidase [Streptococcus pneumoniae GA14798]
gi|418153939|ref|ZP_12790673.1| alpha-L-fucosidase [Streptococcus pneumoniae GA16121]
gi|418199016|ref|ZP_12835468.1| alpha-L-fucosidase [Streptococcus pneumoniae GA47778]
gi|418224372|ref|ZP_12851007.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 5185-06]
gi|418228657|ref|ZP_12855270.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 3063-00]
gi|419430394|ref|ZP_13970551.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA11856]
gi|419439146|ref|ZP_13979210.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA13499]
gi|419502823|ref|ZP_14042501.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47628]
gi|419529128|ref|ZP_14068665.1| hypothetical protein SPAR51_2171 [Streptococcus pneumoniae GA17719]
gi|225728210|gb|ACO24061.1| large secreted protein [Streptococcus pneumoniae Taiwan19F-14]
gi|298237260|gb|ADI68391.1| possible alpha-L-fucosidase [Streptococcus pneumoniae TCH8431/19A]
gi|327388899|gb|EGE87247.1| alpha-L-fucosidase [Streptococcus pneumoniae GA04375]
gi|353753506|gb|EHD34129.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44288]
gi|353754984|gb|EHD35594.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47281]
gi|353762498|gb|EHD43057.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA49138]
gi|353788845|gb|EHD69241.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA18523]
gi|353803816|gb|EHD84107.1| alpha-L-fucosidase [Streptococcus pneumoniae GA13455]
gi|353811993|gb|EHD92229.1| alpha-L-fucosidase [Streptococcus pneumoniae GA14798]
gi|353815265|gb|EHD95485.1| alpha-L-fucosidase [Streptococcus pneumoniae GA16121]
gi|353859431|gb|EHE39382.1| alpha-L-fucosidase [Streptococcus pneumoniae GA47778]
gi|353876904|gb|EHE56749.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 5185-06]
gi|353878966|gb|EHE58794.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 3063-00]
gi|379138939|gb|AFC95730.1| large secreted protein [Streptococcus pneumoniae ST556]
gi|379535583|gb|EHZ00782.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA13499]
gi|379548700|gb|EHZ13818.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA11856]
gi|379562772|gb|EHZ27781.1| hypothetical protein SPAR51_2171 [Streptococcus pneumoniae GA17719]
gi|379598038|gb|EHZ62833.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47628]
Length = 764
Score = 395 bits (1016), Expect = e-107, Method: Compositional matrix adjust.
Identities = 260/780 (33%), Positives = 392/780 (50%), Gaps = 93/780 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAASNWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFDPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + +++ G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTNYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S KD ++ I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
I+R + I A+ L N D + V+++ K LP+ TKI +G I EW +D+++ E
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLEDYEEVEP 554
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR------------------------ 638
HRH+S LFGL+P + I I K P+L +AA+ T+ +R
Sbjct: 555 GHRHISPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLSSQEREQAINNWLVSGL 614
Query: 639 -GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
GWS W +ARL+ E AY + L N NLF HPPF
Sbjct: 615 HASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLN-----------NATLGNLFLDHPPF 663
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
QID N G + + E+LVQS N L L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 QIDGNLGLVSGICELLVQSHHNWLSLIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 722
>gi|295085494|emb|CBK67017.1| Trehalose and maltose hydrolases (possible phosphorylases)
[Bacteroides xylanisolvens XB1A]
Length = 782
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 256/756 (33%), Positives = 383/756 (50%), Gaps = 96/756 (12%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDY---TNPDAPKALSDVR 76
+ ++PIGNG LGA + G V +E + NE TLW G P DY N + L ++R
Sbjct: 75 SQSLPIGNGSLGANIMGSVEAERITFNEKTLWRGGPNTAKGADYYWNVNKQSAHLLDEIR 134
Query: 77 SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDD--SHLKYAEET---------YR 125
G +A + + F Y G+ F + ++ ET Y+
Sbjct: 135 KAFTEGNQEKAEMLTRQNFNSEVS-YDADGETPFRFGSFTTMGEFYVETGLNIIGMSDYK 193
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG--SLSFNVSLDSLLDN 183
R L L++A A V++ +V + R +F S P V+V + S + G +L F+ + + +
Sbjct: 194 RILSLDSAMAVVQFKKDHVVYQRNYFISYPANVMVMRFSADQPGKQNLVFSYAPNPVSTG 253
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ + N ++ +A+ D G+++ ++ I+ GT+S D KL
Sbjct: 254 NMASDSNKGLVY-------------SASLDNNGMKY--VVRIQAETKGGTLSN-ADGKLM 297
Query: 244 VEGSDWAVLLLVASSSFDGPF------------INPSDSKKDPTSESMSALQSIRNLSYS 291
V+G+D V + A + + F +NP ++ K+ + ++S Y+
Sbjct: 298 VKGADEVVFYITADTDYKPDFDPDFKDPKTYVGVNPEETTKEWMNNAVSQ-------GYT 350
Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLV 350
L+++H +DY LF RV + L+ + K +P+ +R+K+++ + D L
Sbjct: 351 ALFSQHYNDYAALFDRVKLNLNPAIKG-----------RNLPTPQRLKNYRAGQPDYDLE 399
Query: 351 ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
EL FQFGRYLLISSSRPG ANLQGIW+ ++ W H NIN++MNYW + NL+E
Sbjct: 400 ELYFQFGRYLLISSSRPGNMPANLQGIWHNNVDGPWRVDYHNNINIQMNYWPACSTNLNE 459
Query: 411 CQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAW 469
C PL DF+ L G KTA+ + A GW +I+ ++ + + W PM G W
Sbjct: 460 CMLPLVDFIRTLVKPGEKTAKSYFGARGWTASISGNIFGFTTPLESQDMSWNFNPMAGPW 519
Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP 529
L TH+WE+Y+YT D FL++ Y L++ A F +D+L DG PSTSPEH
Sbjct: 520 LATHIWEYYDYTRDLTFLKETGYELIKSSADFAVDYLWHKPDGTYTAAPSTSPEH----- 574
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAED 587
+ +T A++RE+ I A++VL +K E E VL + L P KI
Sbjct: 575 ----GPIDQGATFVHAVVREILLDAIEASKVLGVDKKERKQWEHVLAN---LVPYKIGRY 627
Query: 588 GSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSI 647
G +MEW+ D DP+ HRH++HLFGL PGHT++ P+L KAA+ L RG+ GWS+
Sbjct: 628 GQLMEWSVDIDDPKDEHRHVNHLFGLHPGHTVSPVTTPELAKAAKVVLVHRGDGATGWSM 687
Query: 648 TWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTA 707
WK WARL D HAY + L + G NL+ H PFQID NFG TA
Sbjct: 688 GWKLNQWARLQDGNHAYTLFGNL-----------LKNGTLDNLWDTHSPFQIDGNFGGTA 736
Query: 708 AVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKAR 743
+ EML+QS + + LLPALP D W G V G+ A+
Sbjct: 737 GITEMLLQSHIGFIQLLPALP-DAWKGGAVSGICAK 771
>gi|307705834|ref|ZP_07642675.1| alpha-fucosidase [Streptococcus mitis SK597]
gi|307620620|gb|EFN99715.1| alpha-fucosidase [Streptococcus mitis SK597]
Length = 764
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 261/780 (33%), Positives = 391/780 (50%), Gaps = 93/780 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAASNWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEVQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SSALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGDI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S KD ++ I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTATKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RVLTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
I+R + I A+ L N D + V+++ K LP+ TKI +G I EW +D+++ E
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLEDYEEVEP 554
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR------------------------ 638
HRH+S LFGL+P + I I K P+L +AAE T+ +R
Sbjct: 555 GHRHISPLFGLYPYNEIDIYKTPELAEAAEITINRRLSNANFLSSQDREQAINNWLVSGL 614
Query: 639 -GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
GWS W +ARL+ E AY + L N NLF HPPF
Sbjct: 615 HASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLN-----------NATLGNLFLDHPPF 663
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
QID N G + + E+LVQS N L L+PALP WS G VKGL+ RGG VS W++GD+
Sbjct: 664 QIDGNLGLVSGICELLVQSHHNWLSLIPALP-SAWSEGEVKGLRVRGGYKVSFAWENGDI 722
>gi|418101640|ref|ZP_12738719.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 7286-06]
gi|353768739|gb|EHD49262.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 7286-06]
Length = 764
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 261/779 (33%), Positives = 390/779 (50%), Gaps = 91/779 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAASNWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFDPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
+ L L + +++ G PS SI + D H+ YQ+ F
Sbjct: 225 NATEVFLYLKSMTNYWGNIDIPS---------LQGEFSSIDYFTEKD---EHVKKYQEQF 272
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
+RV +L S KD ++ I T E K + L LLF +GRYLLISSS
Sbjct: 273 NRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISSS 320
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 321 QPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMREP 380
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 381 GRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDER 440
Query: 486 FLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 441 ILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQ 498
Query: 546 IIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVH 603
I+R + I A+ L N D + V+++ K LP+ TKI +G I EW +D+++ E
Sbjct: 499 ILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLEDYEEVEPG 555
Query: 604 HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR------------------------- 638
HRH+S LFGL+P + I I K P+L +AA+ T+ +R
Sbjct: 556 HRHISPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLSSQEREQAINNWLVSGLH 615
Query: 639 GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQ 698
GWS W +ARL+ E AY + L N NLF HPPFQ
Sbjct: 616 ASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLN-----------NATLGNLFLDHPPFQ 664
Query: 699 IDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
ID N G + + E+LVQS N L L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 665 IDGNLGLVSGICELLVQSHHNWLSLIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 722
>gi|15904007|ref|NP_359557.1| hypothetical protein spr1966 [Streptococcus pneumoniae R6]
gi|116517212|ref|YP_817374.1| hypothetical protein SPD_1988 [Streptococcus pneumoniae D39]
gi|148988800|ref|ZP_01820215.1| hypothetical protein CGSSp6BS73_06978 [Streptococcus pneumoniae
SP6-BS73]
gi|148991988|ref|ZP_01821762.1| hypothetical protein CGSSp9BS68_10895 [Streptococcus pneumoniae
SP9-BS68]
gi|149020072|ref|ZP_01835046.1| hypothetical protein CGSSp23BS72_08554 [Streptococcus pneumoniae
SP23-BS72]
gi|168494084|ref|ZP_02718227.1| large secreted protein [Streptococcus pneumoniae CDC3059-06]
gi|387627290|ref|YP_006063466.1| hypothetical protein INV104_18640 [Streptococcus pneumoniae INV104]
gi|417687620|ref|ZP_12336887.1| alpha-L-fucosidase [Streptococcus pneumoniae GA41301]
gi|418075010|ref|ZP_12712256.1| alpha-L-fucosidase [Streptococcus pneumoniae GA11184]
gi|418081811|ref|ZP_12719017.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 6735-05]
gi|418090533|ref|ZP_12727683.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA43265]
gi|418099496|ref|ZP_12736589.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 6901-05]
gi|418103895|ref|ZP_12740963.1| putative alpha-L-fucosidase [Streptococcus pneumoniae NP070]
gi|418106297|ref|ZP_12743347.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44500]
gi|418115675|ref|ZP_12752658.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 5787-06]
gi|418117845|ref|ZP_12754811.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 6963-05]
gi|418135939|ref|ZP_12772788.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA11426]
gi|418160899|ref|ZP_12797595.1| alpha-L-fucosidase [Streptococcus pneumoniae GA17227]
gi|418174588|ref|ZP_12811195.1| alpha-L-fucosidase [Streptococcus pneumoniae GA41277]
gi|418183706|ref|ZP_12820260.1| alpha-L-fucosidase [Streptococcus pneumoniae GA43380]
gi|418203403|ref|ZP_12839826.1| alpha-L-fucosidase [Streptococcus pneumoniae GA52306]
gi|418217614|ref|ZP_12844290.1| alpha-L-fucosidase [Streptococcus pneumoniae Netherlands15B-37]
gi|419432556|ref|ZP_13972681.1| putative alpha-L-fucosidase [Streptococcus pneumoniae EU-NP05]
gi|419434785|ref|ZP_13974899.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA40183]
gi|419456417|ref|ZP_13996371.1| putative alpha-L-fucosidase [Streptococcus pneumoniae EU-NP04]
gi|419465661|ref|ZP_14005549.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA04175]
gi|419467835|ref|ZP_14007713.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA05248]
gi|419469963|ref|ZP_14009827.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA06083]
gi|419476555|ref|ZP_14016386.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA14688]
gi|419480975|ref|ZP_14020776.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA19101]
gi|419487705|ref|ZP_14027464.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44128]
gi|419498536|ref|ZP_14038238.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47522]
gi|419500675|ref|ZP_14040366.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47597]
gi|419513550|ref|ZP_14053180.1| hypothetical protein SPAR149_2151 [Streptococcus pneumoniae
GA05578]
gi|419517761|ref|ZP_14057373.1| hypothetical protein SPAR154_2086 [Streptococcus pneumoniae
GA02506]
gi|419522113|ref|ZP_14061704.1| hypothetical protein SPAR7_2189 [Streptococcus pneumoniae GA05245]
gi|421207672|ref|ZP_15664716.1| large secreted protein [Streptococcus pneumoniae 2090008]
gi|421209866|ref|ZP_15666875.1| large secreted protein [Streptococcus pneumoniae 2070005]
gi|421221343|ref|ZP_15678174.1| large secreted protein [Streptococcus pneumoniae 2070425]
gi|421223600|ref|ZP_15680377.1| large secreted protein [Streptococcus pneumoniae 2070531]
gi|421226019|ref|ZP_15682753.1| large secreted protein [Streptococcus pneumoniae 2070768]
gi|421230717|ref|ZP_15687375.1| large secreted protein [Streptococcus pneumoniae 2061376]
gi|421241635|ref|ZP_15698176.1| large secreted protein [Streptococcus pneumoniae 2080913]
gi|421267146|ref|ZP_15718023.1| hypothetical protein SPAR27_2104 [Streptococcus pneumoniae SPAR27]
gi|421284302|ref|ZP_15735084.1| hypothetical protein SPAR151_2120 [Streptococcus pneumoniae
GA04216]
gi|421292975|ref|ZP_15743706.1| hypothetical protein SPAR159_2221 [Streptococcus pneumoniae
GA56348]
gi|444381684|ref|ZP_21179890.1| hypothetical protein PCS8106_00075 [Streptococcus pneumoniae
PCS8106]
gi|444384154|ref|ZP_21182250.1| hypothetical protein PCS8203_00020 [Streptococcus pneumoniae
PCS8203]
gi|15459667|gb|AAL00768.1| Conserved hypothetical protein [Streptococcus pneumoniae R6]
gi|116077788|gb|ABJ55508.1| conserved hypothetical protein [Streptococcus pneumoniae D39]
gi|147925611|gb|EDK76687.1| hypothetical protein CGSSp6BS73_06978 [Streptococcus pneumoniae
SP6-BS73]
gi|147929037|gb|EDK80048.1| hypothetical protein CGSSp9BS68_10895 [Streptococcus pneumoniae
SP9-BS68]
gi|147930750|gb|EDK81731.1| hypothetical protein CGSSp23BS72_08554 [Streptococcus pneumoniae
SP23-BS72]
gi|183575953|gb|EDT96481.1| large secreted protein [Streptococcus pneumoniae CDC3059-06]
gi|301795076|emb|CBW37545.1| conserved hypothetical protein [Streptococcus pneumoniae INV104]
gi|332071430|gb|EGI81924.1| alpha-L-fucosidase [Streptococcus pneumoniae GA41301]
gi|353745184|gb|EHD25855.1| alpha-L-fucosidase [Streptococcus pneumoniae GA11184]
gi|353750133|gb|EHD30775.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 6735-05]
gi|353759533|gb|EHD40117.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA43265]
gi|353767716|gb|EHD48248.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 6901-05]
gi|353773458|gb|EHD53955.1| putative alpha-L-fucosidase [Streptococcus pneumoniae NP070]
gi|353774259|gb|EHD54752.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44500]
gi|353783638|gb|EHD64065.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 5787-06]
gi|353787046|gb|EHD67455.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 6963-05]
gi|353820164|gb|EHE00352.1| alpha-L-fucosidase [Streptococcus pneumoniae GA17227]
gi|353835112|gb|EHE15207.1| alpha-L-fucosidase [Streptococcus pneumoniae GA41277]
gi|353846724|gb|EHE26752.1| alpha-L-fucosidase [Streptococcus pneumoniae GA43380]
gi|353864851|gb|EHE44761.1| alpha-L-fucosidase [Streptococcus pneumoniae GA52306]
gi|353868852|gb|EHE48736.1| alpha-L-fucosidase [Streptococcus pneumoniae Netherlands15B-37]
gi|353899786|gb|EHE75353.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA11426]
gi|379535787|gb|EHZ00985.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA04175]
gi|379536100|gb|EHZ01291.1| hypothetical protein SPAR7_2189 [Streptococcus pneumoniae GA05245]
gi|379542257|gb|EHZ07415.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA05248]
gi|379542673|gb|EHZ07828.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA06083]
gi|379557271|gb|EHZ22317.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA14688]
gi|379569141|gb|EHZ34115.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA19101]
gi|379575027|gb|EHZ39964.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA40183]
gi|379584597|gb|EHZ49463.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44128]
gi|379597600|gb|EHZ62398.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47522]
gi|379597787|gb|EHZ62584.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47597]
gi|379626380|gb|EHZ90998.1| putative alpha-L-fucosidase [Streptococcus pneumoniae EU-NP04]
gi|379626589|gb|EHZ91206.1| putative alpha-L-fucosidase [Streptococcus pneumoniae EU-NP05]
gi|379632837|gb|EHZ97407.1| hypothetical protein SPAR149_2151 [Streptococcus pneumoniae
GA05578]
gi|379637411|gb|EIA01967.1| hypothetical protein SPAR154_2086 [Streptococcus pneumoniae
GA02506]
gi|395571912|gb|EJG32514.1| large secreted protein [Streptococcus pneumoniae 2090008]
gi|395572036|gb|EJG32637.1| large secreted protein [Streptococcus pneumoniae 2070005]
gi|395584331|gb|EJG44724.1| large secreted protein [Streptococcus pneumoniae 2070425]
gi|395586059|gb|EJG46437.1| large secreted protein [Streptococcus pneumoniae 2070531]
gi|395588107|gb|EJG48442.1| large secreted protein [Streptococcus pneumoniae 2070768]
gi|395592519|gb|EJG52784.1| large secreted protein [Streptococcus pneumoniae 2061376]
gi|395605911|gb|EJG66022.1| large secreted protein [Streptococcus pneumoniae 2080913]
gi|395865531|gb|EJG76670.1| hypothetical protein SPAR27_2104 [Streptococcus pneumoniae SPAR27]
gi|395879316|gb|EJG90376.1| hypothetical protein SPAR151_2120 [Streptococcus pneumoniae
GA04216]
gi|395891223|gb|EJH02225.1| hypothetical protein SPAR159_2221 [Streptococcus pneumoniae
GA56348]
gi|429316926|emb|CCP36654.1| putative alpha-L-fucosidase [Streptococcus pneumoniae SPN034156]
gi|444252808|gb|ELU59268.1| hypothetical protein PCS8203_00020 [Streptococcus pneumoniae
PCS8203]
gi|444253936|gb|ELU60383.1| hypothetical protein PCS8106_00075 [Streptococcus pneumoniae
PCS8106]
Length = 764
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 260/780 (33%), Positives = 391/780 (50%), Gaps = 93/780 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAASNWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S KD ++ I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
I+R + I A+ L N D + V+++ K LP+ TKI +G I EW +D+++ E
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLEDYEEVEP 554
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR------------------------ 638
HRH+S LFGL+P + I I K P+L +AA+ T+ +R
Sbjct: 555 GHRHISPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLSSQEREQAINNWLVSGL 614
Query: 639 -GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
GWS W +ARL+ E AY + L N NLF HPPF
Sbjct: 615 HASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLN-----------NATLGNLFLDHPPF 663
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
QID N G + + E+LVQS N L L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 QIDGNLGLVSGICELLVQSHHNWLSLIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 722
>gi|168484015|ref|ZP_02708967.1| large secreted protein [Streptococcus pneumoniae CDC1873-00]
gi|417697350|ref|ZP_12346525.1| alpha-L-fucosidase [Streptococcus pneumoniae GA47368]
gi|418108816|ref|ZP_12745849.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA41410]
gi|418111150|ref|ZP_12748165.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA49447]
gi|418163224|ref|ZP_12799902.1| alpha-L-fucosidase [Streptococcus pneumoniae GA17328]
gi|418168087|ref|ZP_12804735.1| alpha-L-fucosidase [Streptococcus pneumoniae GA19077]
gi|418176974|ref|ZP_12813561.1| alpha-L-fucosidase [Streptococcus pneumoniae GA41437]
gi|418219924|ref|ZP_12846585.1| putative alpha-L-fucosidase [Streptococcus pneumoniae NP127]
gi|419423904|ref|ZP_13964112.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA43264]
gi|419461001|ref|ZP_14000923.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA02270]
gi|419463323|ref|ZP_14003222.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA02714]
gi|421273944|ref|ZP_15724780.1| alpha-L-fucosidase [Streptococcus pneumoniae SPAR55]
gi|172042696|gb|EDT50742.1| large secreted protein [Streptococcus pneumoniae CDC1873-00]
gi|332198777|gb|EGJ12859.1| alpha-L-fucosidase [Streptococcus pneumoniae GA47368]
gi|353775273|gb|EHD55754.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA41410]
gi|353780261|gb|EHD60720.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA49447]
gi|353825359|gb|EHE05524.1| alpha-L-fucosidase [Streptococcus pneumoniae GA17328]
gi|353837695|gb|EHE17777.1| alpha-L-fucosidase [Streptococcus pneumoniae GA19077]
gi|353838933|gb|EHE19009.1| alpha-L-fucosidase [Streptococcus pneumoniae GA41437]
gi|353871990|gb|EHE51859.1| putative alpha-L-fucosidase [Streptococcus pneumoniae NP127]
gi|379528874|gb|EHY94127.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA02270]
gi|379529046|gb|EHY94298.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA02714]
gi|379584326|gb|EHZ49194.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA43264]
gi|395872020|gb|EJG83121.1| alpha-L-fucosidase [Streptococcus pneumoniae SPAR55]
Length = 764
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 260/780 (33%), Positives = 392/780 (50%), Gaps = 93/780 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S KD ++ I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD ++ ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFSDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
I+R + I A+ L N D + V+++ K LP+ TKI +G I EW +D+++ E
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLEDYEEVEP 554
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR------------------------ 638
HRH+S LFGL+P + I I K P+L +AA+ T+ +R
Sbjct: 555 GHRHISPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLSSQEREQAINNWLVSGL 614
Query: 639 -GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
GWS W +ARL+ E AY + L N NLF HPPF
Sbjct: 615 HASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLN-----------NATLGNLFLDHPPF 663
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
QID N G + + E+LVQS N L L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 QIDGNLGLVSGICELLVQSHHNWLSLIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 722
>gi|148998038|ref|ZP_01825551.1| hypothetical protein CGSSp11BS70_05545 [Streptococcus pneumoniae
SP11-BS70]
gi|168576031|ref|ZP_02721936.1| large secreted protein [Streptococcus pneumoniae MLV-016]
gi|307068776|ref|YP_003877742.1| hypothetical protein SPAP_2209 [Streptococcus pneumoniae AP200]
gi|419472044|ref|ZP_14011900.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA07914]
gi|419504884|ref|ZP_14044547.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47760]
gi|421315019|ref|ZP_15765603.1| hypothetical protein SPAR100_2088 [Streptococcus pneumoniae
GA47562]
gi|147756048|gb|EDK63091.1| hypothetical protein CGSSp11BS70_05545 [Streptococcus pneumoniae
SP11-BS70]
gi|183578125|gb|EDT98653.1| large secreted protein [Streptococcus pneumoniae MLV-016]
gi|306410313|gb|ADM85740.1| hypothetical protein SPAP_2209 [Streptococcus pneumoniae AP200]
gi|379543433|gb|EHZ08583.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA07914]
gi|379604070|gb|EHZ68832.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47760]
gi|395911603|gb|EJH22468.1| hypothetical protein SPAR100_2088 [Streptococcus pneumoniae
GA47562]
Length = 764
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 261/780 (33%), Positives = 390/780 (50%), Gaps = 93/780 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAASNWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S KD ++ I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
I+R + I A+ L N D + V+++ K LPR TKI +G I EW +D+++ E
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLEDYEEVEP 554
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR------------------------ 638
HRH S LFGL+P + I I K P+L +AA+ T+ +R
Sbjct: 555 GHRHTSPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLSSQDREQAINNWLVSGL 614
Query: 639 -GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
GWS W +ARL+ E AY + L N NLF HPPF
Sbjct: 615 HASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLN-----------NATLGNLFLDHPPF 663
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
QID N G + + E+LVQS N L L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 QIDGNLGLVSGICELLVQSHHNWLSLIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 722
>gi|333022556|ref|ZP_08450620.1| putative fibronectin type III domain-containing protein
[Streptomyces sp. Tu6071]
gi|332742408|gb|EGJ72849.1| putative fibronectin type III domain-containing protein
[Streptomyces sp. Tu6071]
Length = 783
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 264/801 (32%), Positives = 400/801 (49%), Gaps = 83/801 (10%)
Query: 15 ITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-------DYTNP 66
+ + PA + T+++P+GNG LGA V+G +P+E ++ E TLWTG PG ++ NP
Sbjct: 46 LRYGAPATDWETESLPVGNGALGASVFGTLPTEHIQFAEKTLWTGGPGTSGYRYGNWENP 105
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVY---QLLGDIELEFDDSHLKYAEET 123
P AL+ VR+ +++ A+ +L G P Y Q GD+ ++ D + + +
Sbjct: 106 R-PDALASVRADIEARTKITPEDAAARL-GQPRIGYGGHQTFGDLLIDVDGA--PGSADG 161
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
Y R LDL A A V Y F R F+S PD+V+V + GS+ N+ S +
Sbjct: 162 YTRTLDLAQALATVSYPHDGTTFRRTVFASCPDKVLVGHFTADRGGSVGLNLRYTSPRQD 221
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ +++ + G G++F A +I++ + G+++A D+ L
Sbjct: 222 FTATTDGDRLTVRGAL-------------QDNGMRFEA--QIRLLSEGGSVTANGDR-LT 265
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V G+D A +L A + + + P DP +A+ Y +L RH D+
Sbjct: 266 VSGADSAWFVLSAGTDYADTY--PDYRGADPHDRVTTAVDQAAARPYRELLDRHTSDHAA 323
Query: 304 LFHRVSIQLSR-SPKDIVTDTCSEENIDTVPSAERVKSF---QTDEDPSLVELLFQFGRY 359
LF RV + L + S D TD +K++ + +D +L L FQ+GRY
Sbjct: 324 LFSRVVLDLGQGSAPDRTTDAL-------------LKAYTGGNSADDRALEALFFQYGRY 370
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LLI+SSR G+ ANLQG WN +P W + HVNINL+MNYW + NL+E P F+
Sbjct: 371 LLIASSRAGSLPANLQGAWNNSTAPPWSADYHVNINLQMNYWPAEATNLAETTAPYDRFV 430
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHY 478
L G TA+ + A GWV+H +T + + D W +P AWL + L+EHY
Sbjct: 431 EALRAPGRTTARSMFDARGWVVHDETTPFGFTGVHDWPTSFW--FPEAAAWLTSQLYEHY 488
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEH-EFIAPDGKLACV 536
+ D+L AYP ++ A F +D L + D L PS SPEH +F A
Sbjct: 489 RFDGSTDYLRATAYPAMKEAAEFWIDVLRTDPRDNTLVVTPSFSPEHGDFTA-------- 540
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEWAQ 595
+ M I+RE+F + AA+ L ++ A + ++L R+ P +I G +MEW
Sbjct: 541 --GAAMSQQIVRELFLNTLEAAQTL-GDDPAFRTTLKETLDRIDPGLRIGSWGQLMEWKT 597
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWA 655
D HRH+SHL+ L PG IE D +AA+ +L RG+ G GWS WK WA
Sbjct: 598 DLDGRTDDHRHVSHLYALHPGR--QIEPGSDFAEAAKVSLTARGDGGTGWSKAWKINFWA 655
Query: 656 RLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQ 715
RL D +HA+ M+ + +G +NL+ HPPFQID NFG T+ + EML+Q
Sbjct: 656 RLRDGDHAHTMLA-----------EQLKGSTLANLWDTHPPFQIDGNFGATSGITEMLLQ 704
Query: 716 STLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFK 775
S + + +LPALP WSSG V+GL+ARGG T+ W++G + + + S + +
Sbjct: 705 SQHDVIEVLPALP-AAWSSGTVRGLRARGGATLEFSWENGRATRIALTA--SRTRELTVR 761
Query: 776 TLHYRGTSVKVNLSAGKIYTF 796
G + AG+ YT+
Sbjct: 762 NALVPGGTTTFKAVAGETYTW 782
>gi|115443166|ref|XP_001218390.1| predicted protein [Aspergillus terreus NIH2624]
gi|114188259|gb|EAU29959.1| predicted protein [Aspergillus terreus NIH2624]
Length = 796
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 267/806 (33%), Positives = 415/806 (51%), Gaps = 74/806 (9%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA + ++PIGNGRLGA VWG E + LNE+++W+G D NP+A + R
Sbjct: 31 YESPASDYAGSLPIGNGRLGATVWG-TAVEKITLNENSIWSGPFQDRVNPNAYDGFTQAR 89
Query: 77 SLVDSGQYAEATAASVKLFGH----PADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
SL++ G A +++ P + Y LG + L+F+ H YRR LDL +
Sbjct: 90 SLLEKGDMTGAGEVTLRDMASIPTSPRE-YHPLGVLHLDFN--HDVNLMTNYRRSLDLYS 146
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGN 190
A V+Y V ++RE+ +S P VI +++ SE G+L+ SL D + ++S + N
Sbjct: 147 GNAVVEYDYNGVRYSREYIASAPAGVIAIRVTASEPGNLTVACSLARDRYVIDNSASSPN 206
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
I+ R+ AN D IQF I E +I G + + + + +
Sbjct: 207 ETGIL-------RL--MANTGDMEDPIQF--ISEARIIGHGGRVVSNSTTVVVRDATSVE 255
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
+ +S + P + K++ +E L + Y+ + T + D+ L RV+I
Sbjct: 256 IFFDAETS-----YRYPDEDKRE--AEMDRKLSTAMGRGYNAVKTAAVADHLSLARRVNI 308
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ--TDEDPSLVELLFQFGRYLLISSSR-- 366
+L S + +P+ R+K+++ D DP L L+F FGR+ LI+SSR
Sbjct: 309 KLG-----------SSGSAGQLPTDTRLKNYKDNPDSDPELATLMFNFGRHSLIASSRQS 357
Query: 367 --PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
PG ANLQGIWN+D SP W V++NLEMNYW + NL++ +P D + +
Sbjct: 358 GSPGLP-ANLQGIWNQDYSPAWGGKYTVDVNLEMNYWPAEVTNLADTFDPFMDLMDTVVP 416
Query: 425 NGSKTAQVNYLA--SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
+G A+ Y G+V+HH TD+W ++ W +WPMG AWL +L +HY +T
Sbjct: 417 HGIDVAKRMYQCDNGGYVLHHNTDLWGDAAPVDNGTTWTMWPMGSAWLSENLMQHYRFTQ 476
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVS 537
+++ L +R +PLL+ A F +L E DGY + PS SPE+ FI P GK +
Sbjct: 477 NKEVLRERIWPLLKSAAQFYYCYLFE-FDGYFSSGPSISPENAFIVPSDMSVAGKSEGID 535
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
S TMD A++ E+F+++I A++LE + V+K + L +++P +I DG I+EW +++
Sbjct: 536 ISPTMDNALLYELFNSVIETADILEITGEE-VDKAKEYLAKIKPPQIGSDGQILEWRREY 594
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP---GWSITWKTALW 654
++ E HRH+S + GL+PG +T N L AA+ L +R + G GWS TW +L+
Sbjct: 595 QETEPGHRHMSPIVGLYPGSQLTPLVNQTLADAAKVLLDRRIDHGSGSTGWSRTWTMSLY 654
Query: 655 ARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLV 714
ARL D + ++ K + + L++ FQID NFGFTA +AEML+
Sbjct: 655 ARLLDGDAVWKHAKVFL-------QTYPSVNLWNTDSGPGSAFQIDGNFGFTAGIAEMLL 707
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN------ 768
QS ++LLPALP +G V GL ARG V I W +G L + + S
Sbjct: 708 QSH-QVVHLLPALP-SAVPTGHVSGLVARGNFVVDIQWVEGSLTQATVKSRSGGQLSLRV 765
Query: 769 NDHDSFKTLHYRGTSVKVNLSAGKIY 794
D +F T++ + ++ SAGK Y
Sbjct: 766 QDGKAF-TVNGEEYTEPISTSAGKSY 790
>gi|354604085|ref|ZP_09022078.1| hypothetical protein HMPREF9450_00993 [Alistipes indistinctus YIT
12060]
gi|353348517|gb|EHB92789.1| hypothetical protein HMPREF9450_00993 [Alistipes indistinctus YIT
12060]
Length = 777
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 256/758 (33%), Positives = 378/758 (49%), Gaps = 107/758 (14%)
Query: 17 FNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDV 75
+ PA ++ T+A+P+GNGR+GAM++GG+P E ++ N+ TLWTG
Sbjct: 42 YTRPATNWMTEALPVGNGRIGAMIFGGLPVERIQFNDKTLWTG----------------- 84
Query: 76 RSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDS--HLKYAEETYRRELDLNTA 133
S + G YQ GDI ++F + + YRRELDL+ A
Sbjct: 85 -STTERG------------------AYQNFGDIFIDFGAAGGNNPRGPVDYRRELDLDDA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A+V Y V +TRE+ +S PD VI + + ++ G + F V +D N I
Sbjct: 126 LAKVVYKADGVTYTREYLASYPDDVIAMRFTANKKGKIGFTVRMDDAHTGGQRTVTGNSI 185
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
+ G+ S ++ + ++ GT+ A D L + G+D A LL
Sbjct: 186 TISGKL-----------------TLLSYKAQLTVLNEGGTLQA-GDSTLTLTGADAATLL 227
Query: 254 LVASSSFDGP---FINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
L A + +D ++ SD K ++ + A Y+ L HLDDY L++R+S+
Sbjct: 228 LSAGTDYDPQSPDYLTRSDWKGKVSTVAARAGSK----GYAALRKAHLDDYHALYNRLSL 283
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
+ + ++ TD V+ + + DP+ L FQ+GRYL I+SSRPG
Sbjct: 284 NVGNTTPELPTDELF------------VRYSKGEYDPAADVLYFQYGRYLTIASSRPGLD 331
Query: 371 V-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS-INGSK 428
+ +NLQG+WN+ +P W S H NIN++MNYW + P NL+EC EP ++ S ++ S
Sbjct: 332 LPSNLQGLWNDSNTPPWQSDIHSNINVQMNYWPAEPTNLAECHEPFTRYIYNESQLHDSW 391
Query: 429 TAQVNYL-ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
L GW + + +I+ S W AW C H+W+ Y + RD+L
Sbjct: 392 KKMAGELDCGGWALKTQNNIFGYSD-------WNWNRPANAWYCMHVWDKYLFDPQRDYL 444
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA-- 545
E+ AYP+++ F LD LI DG L SPEH + S + A
Sbjct: 445 EQEAYPVMKSACRFWLDRLIVDDDGKLVAPNEWSPEHG-----------PWESGIPYAQQ 493
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIMEWAQDFKDPEVHH 604
+I ++F+ + A +L ++ A V+++ L RL + G + EW DP H
Sbjct: 494 LIWDLFNNTVRAGRILGTDQ-AFVDQLESKLERLDNGLTVGSWGQLREWKHLEDDPANQH 552
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAY 664
RH+SHL GL+PG I+ + AA +TL RG+ G GWS WK A WARL D +HA+
Sbjct: 553 RHVSHLIGLYPGRAISPALDTLYANAARRTLAARGDFGTGWSRAWKIAFWARLLDGDHAH 612
Query: 665 RMVKRLFNLVDP-----EHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLN 719
++K L D + ++ G+Y+NLF AHPPFQID NFG TA VAEML+QS L
Sbjct: 613 LLLKNAMTLTDNTGLTYQTHQNSGSGIYANLFDAHPPFQIDGNFGATAGVAEMLLQSQLG 672
Query: 720 DLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
+L+LLPALP W +G VKGL+ RGG V + W G L
Sbjct: 673 ELHLLPALP-SVWGTGEVKGLRGRGGYVVDMDWSGGRL 709
>gi|168491689|ref|ZP_02715832.1| large secreted protein [Streptococcus pneumoniae CDC0288-04]
gi|183574053|gb|EDT94581.1| large secreted protein [Streptococcus pneumoniae CDC0288-04]
Length = 764
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 259/780 (33%), Positives = 389/780 (49%), Gaps = 93/780 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S + +I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYSKGCL--------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
I+R + I A+ L N D + V+++ K LPR TKI +G I EW +D+++ E
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFINRVKELKKKLPR---TKIGSNGQIQEWLEDYEEVEP 554
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR------------------------ 638
HRH+S LFGL+P + I I K P+L +AA+ T+ +R
Sbjct: 555 GHRHISPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLSSQDREQAINNWLVSGL 614
Query: 639 -GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
GWS W +ARL+ E AY + L N NLF HPPF
Sbjct: 615 HASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLN-----------NATLGNLFLDHPPF 663
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
QID N G + + E+LVQS N L L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 QIDGNLGLVSGICELLVQSHHNWLSLIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 722
>gi|419436976|ref|ZP_13977057.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 8190-05]
gi|379611263|gb|EHZ75990.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 8190-05]
Length = 764
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 260/780 (33%), Positives = 391/780 (50%), Gaps = 93/780 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTVFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S KD ++ I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
I+R + I A+ L N D + V+++ K LP+ TKI +G I EW +D+++ E
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLEDYEEVEP 554
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR------------------------ 638
HRH+S LFGL+P + I I K P+L +AA+ T+ +R
Sbjct: 555 GHRHISPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLSSQEREQAINNWLVSGL 614
Query: 639 -GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
GWS W +ARL+ E AY + L N NLF HPPF
Sbjct: 615 HASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLN-----------NATLGNLFLDHPPF 663
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
QID N G + + E+LVQS N L L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 QIDGNLGLVSGICELLVQSHHNWLSLIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 722
>gi|421212007|ref|ZP_15668985.1| large secreted protein [Streptococcus pneumoniae 2070035]
gi|421232851|ref|ZP_15689488.1| large secreted protein [Streptococcus pneumoniae 2080076]
gi|395571698|gb|EJG32309.1| large secreted protein [Streptococcus pneumoniae 2070035]
gi|395593380|gb|EJG53629.1| large secreted protein [Streptococcus pneumoniae 2080076]
Length = 764
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 259/780 (33%), Positives = 389/780 (49%), Gaps = 93/780 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAASNWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S + +I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYSKGCL--------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
I+R + I A+ L N D + V+++ K LPR TKI +G I EW +D+++ E
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLEDYEEVEP 554
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR------------------------ 638
HRH+S LFGL+P + I I K P+L +AA+ T+ +R
Sbjct: 555 GHRHISPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLSSQDREQAINNWLVSGL 614
Query: 639 -GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
GWS W +ARL+ E AY + L N NLF HPPF
Sbjct: 615 HASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLN-----------NATLGNLFLDHPPF 663
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
QID N G + + E+LVQS N L L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 QIDGNLGLVSGICELLVQSHHNWLSLIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 722
>gi|452000004|gb|EMD92466.1| glycoside hydrolase family 95 protein [Cochliobolus heterostrophus
C5]
Length = 806
Score = 394 bits (1012), Expect = e-106, Method: Compositional matrix adjust.
Identities = 264/780 (33%), Positives = 396/780 (50%), Gaps = 83/780 (10%)
Query: 11 NPLKITFNGP--AKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
N ++ + P + +TDA+PIGNGRLGAM++G E ++LNE+T+W+G D N +
Sbjct: 21 NSTRLWYTAPVASSTWTDALPIGNGRLGAMIYGIPVQELIQLNEETIWSGGRRDRVNQNG 80
Query: 69 PKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYR 125
+ +S+VR L+ G A A++ + G P YQ LGD+E+ FD + +Y TY
Sbjct: 81 AQTVSEVRDLLARGDAGGAQKLANLGMMGTPQTCRNYQTLGDMEISFDGTS-EYDNTTYE 139
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS 185
R LDL+TA A V++ V + + RE F S PD V V + + +G LSF + + D +
Sbjct: 140 RWLDLDTALAGVRFKVNDTLYEREMFVSVPDDVFVHHLKATGNGKLSFQIRVHRPKDGLN 199
Query: 186 YV-----NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
N N M G G DP + F+ L ++ T+
Sbjct: 200 EASDQNWNENGWTYMTGGTGGI----------DP--VVFTTALAVESDGHVRTLGEF--- 244
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ VE + A L A++S+ D + S +Q R +Y +L RH++D
Sbjct: 245 -IVVENATEATAFLAAATSY---------RHNDTRAAVDSTIQKARQHTYEELRRRHIED 294
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRY 359
Y L++ + L+ D+ T + +P+ R+ + + DP LV L + +GRY
Sbjct: 295 YSPLYNASVLNLN--GPDLGTSS--------LPTNARINATRRGANDPGLVALAYNYGRY 344
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LLISSSR G +NLQGIWN++ P W S VNINL+MNYW + +LS EP FD L
Sbjct: 345 LLISSSRAGNLPSNLQGIWNKEFDPLWGSKYTVNINLQMNYWPAEVTSLSSLHEPFFDLL 404
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
+ +G+ TA+ Y ASGW+ HH TD+W ++ + W + WL TH+ EHY
Sbjct: 405 ELMRKDGTHTAKAMYNASGWMSHHNTDLWGDTAPVDTYLPATYWTLSSGWLVTHILEHYW 464
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWL----IEGHDGYLETNPSTSPEHEFIAPDGKLAC 535
YT D+ FL + + E F LD L G + YL TNPS SPE+ ++ PDGK
Sbjct: 465 YTGDKSFLASNLHIVSEAI-EFYLDTLQPYKTNGTE-YLVTNPSVSPENTYVGPDGKSYN 522
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKN--EDALVEKVLKSLPRLRPTKIAED--GSIM 591
+ T D+ I+ E+F+ ++A L + + A + ++ + +L P + + G++
Sbjct: 523 FDIAPTCDVEILNELFTNYLNAVATLSNSTVDSAFLTRIRDTQAKLPPYRYSTRYPGTLQ 582
Query: 592 EWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNP----DLCKAAEKTLQKR---GEEGPG 644
EW QD++ E HRH+SHL+ L+PG I P L AA TL+ R G G
Sbjct: 583 EWMQDYEQAEPGHRHVSHLYALYPGTQIPPPGAPGYDAKLFNAAAATLEDRLSHNGAGTG 642
Query: 645 WSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP-FQIDANF 703
WS W +ARL ++A + + F + F +++NL + FQID N
Sbjct: 643 WSRAWTINWYARL---QNATALAENTF--------QFFNTSVFNNLMDVNEGIFQIDGNL 691
Query: 704 GFTAAVAEMLVQSTLND------LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
GF + VAE L+QS + D ++LLP LP ++WS G V G+ ARGG + W DG L
Sbjct: 692 GFVSGVAEGLLQSHVVDDKGVREVWLLPVLP-EEWSDGSVNGIAARGGFVFDLEWADGKL 750
>gi|302540737|ref|ZP_07293079.1| alpha-L-fucosidase 2 [Streptomyces hygroscopicus ATCC 53653]
gi|302458355|gb|EFL21448.1| alpha-L-fucosidase 2 [Streptomyces himastatinicus ATCC 53653]
Length = 775
Score = 394 bits (1012), Expect = e-106, Method: Compositional matrix adjust.
Identities = 265/762 (34%), Positives = 386/762 (50%), Gaps = 81/762 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV----------PGDYTNPDAPKALSDV 75
+A+PIGNG LGAMV+GGV E ++ NE +LWTG G++ P P AL+ V
Sbjct: 18 EALPIGNGTLGAMVFGGVARERIQFNEKSLWTGGPGGPGSAPYDSGNWREPR-PGALAAV 76
Query: 76 RSLVDSGQYAEATAASVKLFGHPAD---VYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
+ L+D A + +L G P YQ GD+ LE + + ++YRR L++
Sbjct: 77 QRLIDEHGAAAPEDVAARL-GQPRSRYGAYQPFGDLWLEIPGA--PESPDSYRRLLEIRK 133
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A VKY+ V RE F+S PD+VIV + + G++ F + S +V ++
Sbjct: 134 GVALVKYTAQGVRHRREFFASYPDRVIVGRFDAA-PGTVGFTLRHTSPRPGDHHVTAHD- 191
Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
R+ + D+ G++F A ++++ D GT+++ ED L V G+ A
Sbjct: 192 ---------GRLTIRGALEDN--GLRFEA--QVRVMADGGTVTSGEDGTLTVTGAHSAWF 238
Query: 253 LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
+L A + + +P +DP + + + Y L +RH+ D++ LF R ++ L
Sbjct: 239 VLAAGTDYAD--THPHYRGEDPHRTVTGTVDAAADRGYLTLLSRHVRDHRALFDRTALDL 296
Query: 313 S-RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
R+P TD A+R +L EL F +GRYLLI+SSRPG +
Sbjct: 297 GGRTPPRTPTDRQRAAYTGGESPADR----------ALEELFFDYGRYLLIASSRPGAPL 346
Query: 372 -ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQGIWN+ + P W + H NINL+M YW + +L+E EPL F+T L G TA
Sbjct: 347 PANLQGIWNDSVRPAWSADYHTNINLQMAYWPAHALHLAETAEPLHRFITALRAPGRITA 406
Query: 431 QVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
+ + A GWV+H++T+ + + D W +P AWL HL+EHY +T+D FL
Sbjct: 407 REMFGARGWVVHNETNAYGFTGVHDWSTAFW--FPEAAAWLVHHLYEHYRFTLDTGFLRD 464
Query: 490 RAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH-EFIAPDGKLACVSYSSTMDMAII 547
AYP + A+F LD L + DG L +P SPEH +F A M I+
Sbjct: 465 TAYPAMREAAAFWLDTLRPDPRDGTLVVSPGYSPEHGDFTA----------GPAMSQQIV 514
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEWAQDFKDPEVHHRH 606
++ +A + AA L ++ AL + ++L L P +I G + EW D DP HRH
Sbjct: 515 HDLLTATLEAARTL-GDDPALQAGLRRALDALDPGLRIGSWGQLQEWKADLDDPADTHRH 573
Query: 607 LSHLFGLFPGHTITIEKNPD--LCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAY 664
SHLF L PG I PD AA +L RG+ G GWS WK WARL D + A+
Sbjct: 574 ASHLFALHPGRQIA----PDGPWAGAAAVSLDARGDGGTGWSRAWKVNFWARLRDGDRAH 629
Query: 665 RMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLL 724
R++ L D NL+ HPPFQID NFG A +A+ML+QS L +L
Sbjct: 630 RLLA--GQLTD---------STLPNLWDTHPPFQIDGNFGAAAGIAQMLLQSHRAVLDVL 678
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNY 766
PALP +W G V+GL+A G TV I W++G + + + +
Sbjct: 679 PALP-RRWPDGAVRGLRAHGDLTVDITWREGRARTLTVAAGH 719
>gi|418147412|ref|ZP_12784184.1| alpha-L-fucosidase [Streptococcus pneumoniae GA13637]
gi|353810492|gb|EHD90743.1| alpha-L-fucosidase [Streptococcus pneumoniae GA13637]
Length = 764
Score = 394 bits (1011), Expect = e-106, Method: Compositional matrix adjust.
Identities = 259/780 (33%), Positives = 389/780 (49%), Gaps = 93/780 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTVFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S + +I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYSKGCL--------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
I+R + I A+ L N D + V+++ K LPR TKI +G I EW +D+++ E
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLEDYEEVEP 554
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR------------------------ 638
HRH+S LFGL+P + I I K P+L +AA+ T+ +R
Sbjct: 555 GHRHISPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLSSQDREQAINNWLVSGL 614
Query: 639 -GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
GWS W +ARL+ E AY + L N NLF HPPF
Sbjct: 615 HASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLN-----------NATLGNLFLDHPPF 663
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
QID N G + + E+LVQS N L L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 QIDGNLGLVSGICELLVQSHHNWLSLIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 722
>gi|149003007|ref|ZP_01827918.1| hypothetical protein CGSSp14BS69_00740 [Streptococcus pneumoniae
SP14-BS69]
gi|168489226|ref|ZP_02713425.1| large secreted protein [Streptococcus pneumoniae SP195]
gi|221232865|ref|YP_002512019.1| hypothetical protein SPN23F_21920 [Streptococcus pneumoniae ATCC
700669]
gi|225855653|ref|YP_002737165.1| large secreted protein [Streptococcus pneumoniae JJA]
gi|237650653|ref|ZP_04524905.1| large secreted protein [Streptococcus pneumoniae CCRI 1974]
gi|237822208|ref|ZP_04598053.1| large secreted protein [Streptococcus pneumoniae CCRI 1974M2]
gi|415701401|ref|ZP_11458355.1| large secreted protein [Streptococcus pneumoniae 459-5]
gi|415750467|ref|ZP_11478309.1| large secreted protein [Streptococcus pneumoniae SV35]
gi|415753360|ref|ZP_11480342.1| large secreted protein [Streptococcus pneumoniae SV36]
gi|417680132|ref|ZP_12329525.1| alpha-L-fucosidase [Streptococcus pneumoniae GA17570]
gi|418124532|ref|ZP_12761459.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44378]
gi|418126808|ref|ZP_12763710.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44511]
gi|418129072|ref|ZP_12765961.1| putative alpha-L-fucosidase [Streptococcus pneumoniae NP170]
gi|418138272|ref|ZP_12775106.1| alpha-L-fucosidase [Streptococcus pneumoniae GA11663]
gi|418144761|ref|ZP_12781556.1| alpha-L-fucosidase [Streptococcus pneumoniae GA13494]
gi|418179304|ref|ZP_12815881.1| alpha-L-fucosidase [Streptococcus pneumoniae GA41565]
gi|418192602|ref|ZP_12829101.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47388]
gi|418215362|ref|ZP_12842093.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA54644]
gi|418235345|ref|ZP_12861918.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA08780]
gi|419458700|ref|ZP_13998639.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA02254]
gi|419474246|ref|ZP_14014091.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA13430]
gi|419485376|ref|ZP_14025147.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA43257]
gi|419494282|ref|ZP_14034004.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47210]
gi|419509243|ref|ZP_14048891.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA49542]
gi|421279922|ref|ZP_15730725.1| alpha-L-fucosidase [Streptococcus pneumoniae GA17301]
gi|421300242|ref|ZP_15750913.1| alpha-L-fucosidase [Streptococcus pneumoniae GA19998]
gi|147759010|gb|EDK66005.1| hypothetical protein CGSSp14BS69_00740 [Streptococcus pneumoniae
SP14-BS69]
gi|183572159|gb|EDT92687.1| large secreted protein [Streptococcus pneumoniae SP195]
gi|220675327|emb|CAR69925.1| conserved hypothetical protein [Streptococcus pneumoniae ATCC
700669]
gi|225723250|gb|ACO19103.1| large secreted protein [Streptococcus pneumoniae JJA]
gi|332071597|gb|EGI82090.1| alpha-L-fucosidase [Streptococcus pneumoniae GA17570]
gi|353794144|gb|EHD74502.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44378]
gi|353794344|gb|EHD74701.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44511]
gi|353797122|gb|EHD77459.1| putative alpha-L-fucosidase [Streptococcus pneumoniae NP170]
gi|353807227|gb|EHD87499.1| alpha-L-fucosidase [Streptococcus pneumoniae GA13494]
gi|353840818|gb|EHE20880.1| alpha-L-fucosidase [Streptococcus pneumoniae GA41565]
gi|353854436|gb|EHE34414.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47388]
gi|353867652|gb|EHE47543.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA54644]
gi|353885068|gb|EHE64858.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA08780]
gi|353899629|gb|EHE75198.1| alpha-L-fucosidase [Streptococcus pneumoniae GA11663]
gi|379528696|gb|EHY93950.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA02254]
gi|379549315|gb|EHZ14425.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA13430]
gi|379580149|gb|EHZ45044.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA43257]
gi|379591544|gb|EHZ56368.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47210]
gi|379609534|gb|EHZ74272.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA49542]
gi|381309007|gb|EIC49850.1| large secreted protein [Streptococcus pneumoniae SV36]
gi|381313067|gb|EIC53859.1| large secreted protein [Streptococcus pneumoniae 459-5]
gi|381316317|gb|EIC57067.1| large secreted protein [Streptococcus pneumoniae SV35]
gi|395877150|gb|EJG88220.1| alpha-L-fucosidase [Streptococcus pneumoniae GA17301]
gi|395899666|gb|EJH10605.1| alpha-L-fucosidase [Streptococcus pneumoniae GA19998]
Length = 764
Score = 394 bits (1011), Expect = e-106, Method: Compositional matrix adjust.
Identities = 259/780 (33%), Positives = 389/780 (49%), Gaps = 93/780 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELCIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S + +I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYSKGCL--------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
I+R + I A+ L N D + V+++ K LPR TKI +G I EW +D+++ E
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLEDYEEVEP 554
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR------------------------ 638
HRH+S LFGL+P + I I K P+L +AA+ T+ +R
Sbjct: 555 GHRHISPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLSSQDREQAINNWLVSGL 614
Query: 639 -GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
GWS W +ARL+ E AY + L N NLF HPPF
Sbjct: 615 HASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLN-----------NATLGNLFLDHPPF 663
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
QID N G + + E+LVQS N L L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 QIDGNLGLVSGICELLVQSHHNWLSLIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 722
>gi|330998117|ref|ZP_08321945.1| hypothetical protein HMPREF9442_03053 [Paraprevotella xylaniphila
YIT 11841]
gi|329569206|gb|EGG50997.1| hypothetical protein HMPREF9442_03053 [Paraprevotella xylaniphila
YIT 11841]
Length = 746
Score = 394 bits (1011), Expect = e-106, Method: Compositional matrix adjust.
Identities = 263/769 (34%), Positives = 375/769 (48%), Gaps = 114/769 (14%)
Query: 12 PLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
P+K+ ++ PAK + T A+P+GNG +GAM +GGV E L+ N+ TLW G
Sbjct: 25 PMKLWYDEPAKVWMTSALPVGNGGIGAMFFGGVAKERLQFNDKTLWAG------------ 72
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
S G YQ +GD+ EFD YRREL L
Sbjct: 73 ------STTRRG------------------AYQNMGDLFFEFDTPE---TCTNYRRELSL 105
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISG-SESGSLSFNVSLDSLLDNHSYVNG 189
+ A RV Y++ V++ RE+F+SNPD VIV +++ G L+F++ + + V+G
Sbjct: 106 DDAIGRVSYTIDGVDYLREYFASNPDSVIVVRLTTPRHKGKLNFSLRMQDGRQGMTRVDG 165
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQ--FSAILEIKISDDRGTISALEDKKLKVEGS 247
+ I KG S + ++ D G + D+ L+V+G+
Sbjct: 166 HTMTI--------------------KGTLDLLSYEAQARLQADGGMVETKSDR-LEVKGA 204
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA-LQSIRNLSYSDLYTRHLDDYQKLFH 306
D ++L +++FD + D +SA + SY L HL DYQ LF
Sbjct: 205 DAVTVVLTGATNFDLASPTYTRGDADEIHRRVSARMDKAARKSYKKLKAVHLADYQPLFA 264
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV + L D TD E+ D + L L FQ+GRYL++ SSR
Sbjct: 265 RVELDLDAEQPDYTTDVLVREHKD---------------NAYLDMLYFQYGRYLMLGSSR 309
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI-- 424
G +NLQG+WN +P W+ H NIN++MNYW + NLSEC P F+TY+S
Sbjct: 310 GGQLPSNLQGLWNNVNNPAWECDYHSNINVQMNYWPAEVANLSECYAP---FITYVSTEA 366
Query: 425 --NGSKTAQVNYL--ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
+G QV GW +H + +I+ G W + AW CTHLW+HY Y
Sbjct: 367 LKDGGSWQQVARKENCRGWAVHTQNNIF-------GYTDWLINRPANAWYCTHLWQHYAY 419
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP--DGKLACVSY 538
T+D+++L A+P+++ + D L E +G L SPEH P DG V+Y
Sbjct: 420 TLDKEYLRDTAWPVMKVTCQYWFDRLKENTEGRLVAPNEWSPEH---GPWEDG----VAY 472
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIMEWAQDF 597
+ + A+ E ++AA VL +DA V ++ + RL + G I EW
Sbjct: 473 AQQLVYALFEET----LAAAGVLAV-DDAFVSELKEKFSRLDNGLHVGSWGQIKEWTIQE 527
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARL 657
H RHLSHL L+P I+ K+ +AA+ L RG+ GWS WK A WARL
Sbjct: 528 DKQGDHQRHLSHLMALYPCDQISYLKDKRYAEAAKVALDSRGDGATGWSRAWKVACWARL 587
Query: 658 HDQEHAYRMVKRLFNLVDPE--HEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQ 715
D E AYR++K+ N+ D GG+Y NLF AHP FQID NFG TA +AEM++Q
Sbjct: 588 WDGERAYRLLKQAQNITDVTVVSMDDNAGGVYENLFCAHPSFQIDGNFGATAGIAEMMLQ 647
Query: 716 STLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+T+ ++LLPALP W G KGLKA+GG + WKDG + E ++S
Sbjct: 648 NTVKGVHLLPALP-SAWDDGHFKGLKAKGGFVFDVAWKDGKMVEGRVHS 695
>gi|417695030|ref|ZP_12344214.1| alpha-L-fucosidase [Streptococcus pneumoniae GA47901]
gi|332198979|gb|EGJ13060.1| alpha-L-fucosidase [Streptococcus pneumoniae GA47901]
Length = 764
Score = 393 bits (1010), Expect = e-106, Method: Compositional matrix adjust.
Identities = 259/780 (33%), Positives = 390/780 (50%), Gaps = 93/780 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAASNWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S KD ++ I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P NLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPVNLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
I+R + I A+ L N D + V+++ K LP+ TKI +G I EW +D+++ E
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLEDYEEVEP 554
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR------------------------ 638
HRH+S LFGL+P + I I K P+L +AA+ T+ +R
Sbjct: 555 GHRHISPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLSSQEREQAINNWLVSGL 614
Query: 639 -GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
GWS W +ARL+ E AY + L N NLF HPPF
Sbjct: 615 HASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLN-----------NATLGNLFLDHPPF 663
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
QID N G + + E+LVQS N L L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 QIDGNLGLVSGICELLVQSHHNWLSLIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 722
>gi|419443562|ref|ZP_13983582.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA13224]
gi|379549113|gb|EHZ14224.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA13224]
Length = 764
Score = 393 bits (1010), Expect = e-106, Method: Compositional matrix adjust.
Identities = 258/780 (33%), Positives = 389/780 (49%), Gaps = 93/780 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S + +I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYSKGCL--------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L + + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPKVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
I+R + I A+ L N D + V+++ K LPR TKI +G I EW +D+++ E
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLEDYEEVEP 554
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR------------------------ 638
HRH+S LFGL+P + I I K P+L +AA+ T+ +R
Sbjct: 555 GHRHISPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLSSQEREQAINNWLVSGL 614
Query: 639 -GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
GWS W +ARL+ E AY + L N NLF HPPF
Sbjct: 615 HASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLN-----------NATLGNLFLDHPPF 663
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
QID N G + + E+LVQS N L L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 QIDGNLGLVSGICELLVQSHHNWLSLIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 722
>gi|225857727|ref|YP_002739238.1| large secreted protein [Streptococcus pneumoniae P1031]
gi|444410728|ref|ZP_21207248.1| hypothetical protein PNI0076_01706 [Streptococcus pneumoniae
PNI0076]
gi|444412459|ref|ZP_21208780.1| hypothetical protein PNI0153_00823 [Streptococcus pneumoniae
PNI0153]
gi|444422182|ref|ZP_21217843.1| hypothetical protein PNI0446_00530 [Streptococcus pneumoniae
PNI0446]
gi|225724930|gb|ACO20782.1| large secreted protein [Streptococcus pneumoniae P1031]
gi|444274421|gb|ELU80068.1| hypothetical protein PNI0153_00823 [Streptococcus pneumoniae
PNI0153]
gi|444276759|gb|ELU82299.1| hypothetical protein PNI0076_01706 [Streptococcus pneumoniae
PNI0076]
gi|444288455|gb|ELU93349.1| hypothetical protein PNI0446_00530 [Streptococcus pneumoniae
PNI0446]
Length = 764
Score = 393 bits (1009), Expect = e-106, Method: Compositional matrix adjust.
Identities = 258/780 (33%), Positives = 389/780 (49%), Gaps = 93/780 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S + +I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYSKGCL--------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
I+R + I A+ L N D + V+++ K LP+ TKI +G I EW +D+++ E
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLEDYEEVEP 554
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR------------------------ 638
HRH+S LFGL+P + I I K P+L +AA+ T+ +R
Sbjct: 555 GHRHISPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLSSQEREQAINNWLVSGL 614
Query: 639 -GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
GWS W +ARL+ E AY + L N NLF HPPF
Sbjct: 615 HASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLN-----------NATLGNLFLDHPPF 663
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
QID N G + + E+LVQS N L L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 QIDGNLGLVSGICELLVQSHHNWLSLIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 722
>gi|220911208|ref|YP_002486517.1| twin-arginine translocation pathway signal [Arthrobacter
chlorophenolicus A6]
gi|219858086|gb|ACL38428.1| twin-arginine translocation pathway signal [Arthrobacter
chlorophenolicus A6]
Length = 781
Score = 393 bits (1009), Expect = e-106, Method: Compositional matrix adjust.
Identities = 271/803 (33%), Positives = 393/803 (48%), Gaps = 64/803 (7%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP-----DAPKA 71
+ GPA+ F +++P+GNG GA + G E +++NE + W+G P D + P +
Sbjct: 4 YRGPAEKFVESLPVGNGLAGATLRGLAGGERIQINEGSAWSG-PTDRSAPPLDPAEGTAR 62
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
L VR VD+G A + G + Y L L D + R LDL
Sbjct: 63 LHAVREAVDAGDVRRAEELLLAFQGTHSQAY--LPFAVLSVDAEGTAAPADGPARWLDLR 120
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
T A +Y + E F+S+PD VIV I+ S L ++ D + G +
Sbjct: 121 TGVAGHRYLLDGAEARHRTFASHPDAVIVHDIAFSAPADLRIGIAPDKI-----TATGMD 175
Query: 192 QIIME-------GRCPGKRIPPKANANDDP----KGIQFSAI-LEIKISDDRGTISALED 239
+ + G + P D P G + A+ + D G +
Sbjct: 176 AVTRDWGTELRLGLLLPADVAPAHEQADHPVVYGHGSRAGAVHAGVATDGDAGFARGV-- 233
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIR------NLSYSDL 293
L + G+ + +++ + + PF +++ D +++++ L S R +
Sbjct: 234 --LAIRGATFVRIVVATGTVLNHPFARHANTADD--ADALAGLLSARIAGVLEEEAVEPA 289
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVEL 352
RHL D+ +L+ RV+++L P P+ ER+++F+TD+ D +L+ L
Sbjct: 290 LQRHLADHARLYSRVTLELGGGPAAAAGK----------PTDERIRAFETDKSDSALMAL 339
Query: 353 LFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
LF +GRYLLI+SSR G ANLQGIWNE+L W S +NIN +MNYW +L +L+EC
Sbjct: 340 LFHYGRYLLIASSREGGFPANLQGIWNEELQAPWSSNYTININTQMNYWPALTTSLAECH 399
Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKS---SADRGKVVWALWPMGGAW 469
EPL + L+ A Y A GWV HH TD W A +G +WA W MGG W
Sbjct: 400 EPLLRLVDTLART-GAAAAGLYGARGWVAHHNTDPWGHPFAVGAGKGNAMWASWAMGGTW 458
Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP 529
L +W HY +T D LEK ++P LEG F LDW+ T+PSTSPE+ F+A
Sbjct: 459 LAEAVWRHYAFTGDLARLEK-SWPALEGACLFALDWITGEPGSGTHTSPSTSPENRFVAD 517
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
DG A V S+TMD++++R + + AA VL L E K +P I G
Sbjct: 518 DGGPAAVGRSATMDVSLLRALCGSARQAAAVLGAPVPWLDEFTRKVAALPQPA-IGSRGE 576
Query: 590 IMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITW 649
++EW+ + E HRH SHL GLFP + E P+L AA +TL+ RG E GW++ W
Sbjct: 577 VLEWSFPATEHEPEHRHTSHLAGLFPLRDWSPEATPELAAAAARTLELRGPESTGWAMAW 636
Query: 650 KTALWARLHDQEHAYRMVKRLFNLV-DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAA 708
+ LWA L + A + + D E+ GG+Y NLF AHPPFQIDANFG TA
Sbjct: 637 RLGLWASLGNAGKAEESLHLALRVAGDGLAER---GGVYPNLFTAHPPFQIDANFGTTAG 693
Query: 709 VAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN 768
+AEMLVQS + LLPALP W G V+GL+ GG V + W G L + S+ +
Sbjct: 694 IAEMLVQSDAAAIRLLPALP-AAWGDGSVRGLRTVGGIGVDLRWSGGVLRSAVLRSSAAV 752
Query: 769 NDHDSFKTLHYRGTSVKVNLSAG 791
+ + + G + V L+ G
Sbjct: 753 R-----RDIVWNGRRISVELAGG 770
>gi|15901970|ref|NP_346574.1| hypothetical protein SP_2160 [Streptococcus pneumoniae TIGR4]
gi|418131327|ref|ZP_12768207.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA07643]
gi|418230992|ref|ZP_12857587.1| putative alpha-L-fucosidase [Streptococcus pneumoniae EU-NP01]
gi|419478817|ref|ZP_14018636.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA18068]
gi|421243935|ref|ZP_15700445.1| large secreted protein [Streptococcus pneumoniae 2081074]
gi|421248340|ref|ZP_15704814.1| large secreted protein [Streptococcus pneumoniae 2082170]
gi|14973671|gb|AAK76214.1| conserved hypothetical protein [Streptococcus pneumoniae TIGR4]
gi|353800742|gb|EHD81051.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA07643]
gi|353884503|gb|EHE64302.1| putative alpha-L-fucosidase [Streptococcus pneumoniae EU-NP01]
gi|379563089|gb|EHZ28094.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA18068]
gi|395605861|gb|EJG65975.1| large secreted protein [Streptococcus pneumoniae 2081074]
gi|395612201|gb|EJG72246.1| large secreted protein [Streptococcus pneumoniae 2082170]
Length = 764
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 258/780 (33%), Positives = 389/780 (49%), Gaps = 93/780 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTVFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S + +I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYSKGCL--------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
I+R + I A+ L N D + V+++ K LP+ TKI +G I EW +D+++ E
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLEDYEEVEP 554
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR------------------------ 638
HRH+S LFGL+P + I I K P+L +AA+ T+ +R
Sbjct: 555 GHRHISPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLSSQEREQAINNWLVSGL 614
Query: 639 -GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
GWS W +ARL+ E AY + L N NLF HPPF
Sbjct: 615 HASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLN-----------NATLGNLFLDHPPF 663
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
QID N G + + E+LVQS N L L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 QIDGNLGLVSGICELLVQSHHNWLSLIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 722
>gi|333029856|ref|ZP_08457917.1| Alpha-L-fucosidase [Bacteroides coprosuis DSM 18011]
gi|332740453|gb|EGJ70935.1| Alpha-L-fucosidase [Bacteroides coprosuis DSM 18011]
Length = 816
Score = 392 bits (1008), Expect = e-106, Method: Compositional matrix adjust.
Identities = 267/786 (33%), Positives = 391/786 (49%), Gaps = 118/786 (15%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD------YTNPDAPKA--------- 71
++PIGNG GA + G V + + LNE TLW G P Y N + A
Sbjct: 62 SLPIGNGSFGANIMGSVSVDRVTLNEKTLWRGGPNTANGASYYWNVNKLSAKYLPIIRQA 121
Query: 72 -----LSDVRSLVDS---GQYA-EATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEE 122
L VR+L ++ G A E T S FG + LG++ LE + L+ E
Sbjct: 122 FMDKDLDKVRTLTENNFNGLAAYEETDESPFRFGS----FTTLGELYLE---TGLEEKEI 174
Query: 123 T-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGS--------------- 166
+ Y+R L L++A V + N ++R +F+S PD VIV + +
Sbjct: 175 SDYKRALSLDSAVVNVSFKEKNTMYSRSYFASYPDSVIVIRYTSEQKAKQNIKLFYAPNP 234
Query: 167 ESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK 226
ES + D +L +N N Q +E +C IP + GI
Sbjct: 235 ESRGVCIKKGSDRILFKRELLNNNQQFALEIKC----IPIGGYYENIENGI--------S 282
Query: 227 ISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP--SDSKK----DPTSESMS 280
I D +D V +L A++ + F NP SD K P ++
Sbjct: 283 ICD-----------------ADEVVFVLSAATDYQMNF-NPDFSDPKTYVGLPPEIKTSQ 324
Query: 281 ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKS 340
L + Y+ + HL DYQ LF+RV I L+ S + ++P+ R+
Sbjct: 325 RLLRLNGQDYNQMLNEHLQDYQSLFNRVHIDLN-----------SIHSFSSLPTDLRLAQ 373
Query: 341 FQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMN 399
++ + D + EL +Q+GRYLLI+SSR G+ ANLQG+W+ ++ W H NIN++MN
Sbjct: 374 YKEGKLDKAFEELYYQYGRYLLIASSRIGSMPANLQGLWHNNIDGPWRVDYHNNINIQMN 433
Query: 400 YWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK-V 458
YW + NLSEC PL DF+ L G TAQ Y A GW ++I+ ++ K +
Sbjct: 434 YWPASTANLSECIPPLIDFIKTLVKPGKVTAQSYYNARGWTASISSNIFGFTAPLSSKDM 493
Query: 459 VWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNP 518
W PM G WL TH+W++++YT D DFL++ Y L++ A+F +D+L + +G P
Sbjct: 494 SWNFNPMAGPWLATHVWDYFDYTQDLDFLKETGYELIKESANFAVDYLWKMPNGVYSAAP 553
Query: 519 STSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR 578
STSPEH + +T A+IR+V S I A+++L +++D E + L
Sbjct: 554 STSPEH---------GPIDQGATFVHAVIRQVLSNAIEASKLLREDDDNRQEWI-AVLNN 603
Query: 579 LRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR 638
L P ++ G +MEW++D DP +HRH++HLFGL PG++I+ P L AA+ L+ R
Sbjct: 604 LAPYQVGRYGQLMEWSEDIDDPNDNHRHVNHLFGLHPGNSISPITTPQLADAAKVVLEHR 663
Query: 639 GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQ 698
G+ GWS+ WK WARL D HAY++ + L + G NL+ HPPFQ
Sbjct: 664 GDFATGWSMGWKLNQWARLLDGNHAYKLFQNL-----------LQCGTLPNLWDTHPPFQ 712
Query: 699 IDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLH 758
ID NFG A V EML+QS + ++LLPALP D W +G + GL ARG VS+ WK +L
Sbjct: 713 IDGNFGGIAGVMEMLLQSHMGFIHLLPALP-DAWDTGSISGLVARGNFEVSMVWKKCELI 771
Query: 759 EVGIYS 764
E I+S
Sbjct: 772 ETQIFS 777
>gi|169834518|ref|YP_001695515.1| large secreted protein [Streptococcus pneumoniae Hungary19A-6]
gi|168997020|gb|ACA37632.1| large secreted protein [Streptococcus pneumoniae Hungary19A-6]
Length = 764
Score = 392 bits (1007), Expect = e-106, Method: Compositional matrix adjust.
Identities = 258/780 (33%), Positives = 389/780 (49%), Gaps = 93/780 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAAANWNEALPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVVFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S + +I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYSKGCL--------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWIVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
I+R + I A+ L N D + V+++ K LP+ TKI +G I EW +D+++ E
Sbjct: 498 QILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLEDYEEVEP 554
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR------------------------ 638
HRH+S LFGL+P + I I K P+L +AA+ T+ +R
Sbjct: 555 GHRHISPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLSSQEREQAINNWLVSGL 614
Query: 639 -GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
GWS W +ARL+ E AY + L N NLF HPPF
Sbjct: 615 HASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLN-----------NATLGNLFLDHPPF 663
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
QID N G + + E+LVQS N L L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 QIDGNLGLVSGICELLVQSHHNWLSLIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 722
>gi|29347187|ref|NP_810690.1| hypothetical protein BT_1777 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29339086|gb|AAO76884.1| glycoside hydrolase family 95 [Bacteroides thetaiotaomicron
VPI-5482]
Length = 1019
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 253/699 (36%), Positives = 373/699 (53%), Gaps = 48/699 (6%)
Query: 109 ELEFDDS-HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKI-SGS 166
EL D S L Y++ Y R LD++ A V Y + F RE+F S PD V+V ++ S S
Sbjct: 324 ELSIDASTELPYSD--YTRTLDIDNAIHTVMYKENGITFKREYFMSYPDNVMVMRLTSDS 381
Query: 167 ESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK 226
+ G LS +SL+SL + + + I M G P K + G++++ L +K
Sbjct: 382 KKGKLSRIISLESLHTDKTITADGHTITMTGY-PTPVSGDKRVGDAWKNGLKYAQQLVVK 440
Query: 227 ISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD--SKKDPTSESMSALQS 284
+ G IS ++ KLKVE +D ++L+ A++++ + + S++DP + + L
Sbjct: 441 --NKGGKISVVDGTKLKVEDADEIIVLMSAATNYVQCMDDSYNYFSQEDPLEKVQATLHK 498
Query: 285 IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSE-ENIDTVPSAERVKSFQT 343
+ + Y+ L H DY L+ R+ + L P+ V T S + +D ++E+
Sbjct: 499 VADKKYTALLATHQKDYHSLYDRMRLNLGNLPEAPVAPTDSLLKGMDENTNSEQ------ 552
Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
E+ L L FQFGRYLLISSSR G+ ANLQG+W E LS W++ H NIN++MNYW +
Sbjct: 553 -ENQYLEMLYFQFGRYLLISSSREGSLPANLQGVWGERLSNPWNADYHTNINIQMNYWPT 611
Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYL------ASGWVIHHKTDIWAKSSADRGK 457
P NLS C P+ +++ L G TAQ Y GWV HH+ +IW ++A K
Sbjct: 612 QPTNLSPCHLPMVEYVRSLVPRGKYTAQQYYCKPDGGNVRGWVTHHENNIW-DNTAPAKK 670
Query: 458 VVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETN 517
+P G W+C +WE+Y + +D+DFL+K +L+ ++ + + DG L N
Sbjct: 671 STPHHFPAGAIWMCQDIWEYYQFNLDKDFLKKYYDTMLDAALFWVDNLWTDERDGTLVAN 730
Query: 518 PSTSPEH-EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL 576
PS SPEH EF L C + A+I E+F +I A++ L +++D + ++ ++
Sbjct: 731 PSHSPEHGEF-----SLGC-----STSQAMICEMFDMMIKASKELGRDKDPEIIEIATAM 780
Query: 577 PRLRPTKIAEDGSIMEWAQDF-KD--PEVHHRHLSHLFGLFPGHTITI---EKNPDLCKA 630
+L KI G MEW + KD + HRH +HLF L PG I I E++ A
Sbjct: 781 SKLSGPKIGLGGQFMEWKDEVTKDVTGDGGHRHTNHLFWLHPGSQIVIGRSEQDDKYADA 840
Query: 631 AEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNL 690
+ TL RG+EG GWS WK WARLHD ++++++ L P GG+Y+NL
Sbjct: 841 MKVTLNTRGDEGTGWSKAWKLNFWARLHDGNRSHKLLRSAMKLTVPGSHV---GGVYTNL 897
Query: 691 FAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSI 750
F AHPPFQID NFG TA +AEML+QS + LLPALP D W +G KG+KARG V
Sbjct: 898 FDAHPPFQIDGNFGCTAGIAEMLMQSQGGYIELLPALP-DAWKNGSFKGMKARGNFEVDA 956
Query: 751 CWKDGDLHEVGIYSNYSNN---DHDSFKTLHYRGTSVKV 786
W DG + + I SN + + K L+ G VKV
Sbjct: 957 AWTDGKITAIEILSNSGAECVIKYPNAKELNVSGAKVKV 995
Score = 58.2 bits (139), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/63 (44%), Positives = 42/63 (66%), Gaps = 1/63 (1%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV 59
MM LK T+N PAK++ ++A+PIGNG +GAM++G V + ++ NE TLW+G
Sbjct: 23 MMACSEQPHQPTLKATYNKPAKNWESEALPIGNGYMGAMIFGDVYVDVIQTNEHTLWSGG 82
Query: 60 PGD 62
PG+
Sbjct: 83 PGE 85
>gi|427404601|ref|ZP_18895341.1| hypothetical protein HMPREF9710_04937 [Massilia timonae CCUG 45783]
gi|425716772|gb|EKU79741.1| hypothetical protein HMPREF9710_04937 [Massilia timonae CCUG 45783]
Length = 764
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 261/792 (32%), Positives = 393/792 (49%), Gaps = 108/792 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYA 85
+A+PIGNGR+GAMV+G E L+ N+ TLWTG D +++
Sbjct: 46 EALPIGNGRIGAMVFGQPGREHLQFNDITLWTG---------------DDKTM------- 83
Query: 86 EATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVE 145
+Q GD+ +E + YRR LDL V Y+ G V
Sbjct: 84 --------------GAFQPFGDLLVELPGHESGVTD--YRRTLDLGRGVHTVTYTHGGVR 127
Query: 146 FTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIP 205
+ RE ++S P QVIV +++ G S VSL H V N ++ G G +P
Sbjct: 128 YRREAWASFPAQVIVLRLTADRPGRYSGAVSLTDRHGAHLAV-ANGRLHATGTLAGFALP 186
Query: 206 PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFI 265
+A P G S + ++ D G ++A + +++ G+D L+L A +S+ +
Sbjct: 187 DQA-----PSGNVMSYASQAQVISDGGKLTA-DGQRIAFAGADGLTLILGAGTSY---VL 237
Query: 266 NPSDSKKD--PTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDT 323
+ + + P + + + + + L H++D+++L RV+I L +P
Sbjct: 238 DAARRFEGGHPLARVTAQVDQAAARAPAALLEEHVEDFRRLMQRVAIDLGETPA------ 291
Query: 324 CSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDL 382
+P+ R+ ++ + DP L FQ+GRYLL SSSR G+ ANLQG+WN L
Sbjct: 292 ----ARRALPTDARLLAYTKAGGDPELEAQYFQYGRYLLASSSR-GSLPANLQGLWNNSL 346
Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS----- 437
+P W++ H NIN++MNYW + NL E P FDF+ ++ + + +
Sbjct: 347 TPPWNADYHTNINVQMNYWPAEVTNLGESALPFFDFVNGMAPVWRRATTEEFRRADGQPV 406
Query: 438 -GWVIHHKTDIWAKSSADRGKVVWALW-PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLL 495
GW + +++ + LW G AW H WEHY + D FL + AYP++
Sbjct: 407 RGWTLRTESNPFGAMDY--------LWNKTGNAWYAQHFWEHYAFNRDERFLREVAYPVM 458
Query: 496 EGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAII 555
+ ++F D+L DG L SPEH + DG V+Y D I+ ++F+ +
Sbjct: 459 KEASAFWQDYLKALPDGRLVAPQGWSPEHGPVE-DG----VAY----DQQIVWDLFNNTV 509
Query: 556 SAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVH-----HRHLSHL 610
AA +L + D L ++ RL +I G ++EW ++ KDP + HRH+SHL
Sbjct: 510 EAAGILRVDPD-LRAQLAAMRDRLAGPRIGSWGQLLEWLEEKKDPVLDTPRDTHRHVSHL 568
Query: 611 FGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRL 670
F LFPG I + P+L +AA +TL+ RG+ G GWS+ WK A WARLH+ E A+RM++ L
Sbjct: 569 FALFPGRQIDPVRTPELARAARRTLEARGDAGTGWSMAWKMAFWARLHEGERAHRMLRGL 628
Query: 671 FNL----------VDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
V EH GG Y NL AHPPFQID NFG TAA+AEML+QS +
Sbjct: 629 LAAPGARAAEQAGVFSEHNN--AGGTYPNLLDAHPPFQIDGNFGATAAIAEMLLQSQGGE 686
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYR 780
L+LLPALP W+ G VKGL+ARGG V + W DG L V + + N D + Y
Sbjct: 687 LHLLPALP-SAWARGAVKGLRARGGYEVDLRWADGRLQGVTVRAVAGN---DGPVKIRYG 742
Query: 781 GTSVKVNLSAGK 792
++++L+ G+
Sbjct: 743 AKRIEIDLATGQ 754
>gi|383125191|ref|ZP_09945845.1| hypothetical protein BSIG_4345 [Bacteroides sp. 1_1_6]
gi|382983436|gb|EES66608.2| hypothetical protein BSIG_4345 [Bacteroides sp. 1_1_6]
Length = 1019
Score = 390 bits (1003), Expect = e-105, Method: Compositional matrix adjust.
Identities = 252/699 (36%), Positives = 372/699 (53%), Gaps = 48/699 (6%)
Query: 109 ELEFDDS-HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKI-SGS 166
EL D S L Y++ Y R LD++ A V Y + F RE+F S PD V+V ++ S S
Sbjct: 324 ELSIDASTELPYSD--YTRTLDIDNAIHTVMYKENGITFKREYFMSYPDNVMVMRLTSDS 381
Query: 167 ESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK 226
+ G LS +SL+SL + + + I M G P K + G+ ++ L +K
Sbjct: 382 KKGKLSRIISLESLHTDKTITADGHTITMTGY-PTPVSGDKRVGDAWKNGLIYAQQLVVK 440
Query: 227 ISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD--SKKDPTSESMSALQS 284
+ G IS ++ KLKVE +D ++L+ A++++ + + S++DP + + L
Sbjct: 441 --NKGGKISVVDGTKLKVEDADEIIVLMSAATNYVQCMDDSYNYFSQEDPLEKVQATLHK 498
Query: 285 IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSE-ENIDTVPSAERVKSFQT 343
+ + Y+ L H DY L+ R+ + L P+ V T S + +D ++E+
Sbjct: 499 VADKKYTALLATHQKDYHSLYDRMRLNLGNLPEAPVAPTDSLLKGMDENTNSEQ------ 552
Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
E+ L L FQFGRYLLISSSR G+ ANLQG+W E LS W++ H NIN++MNYW +
Sbjct: 553 -ENQYLEMLYFQFGRYLLISSSREGSLPANLQGVWGERLSNPWNADYHTNINIQMNYWPT 611
Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYL------ASGWVIHHKTDIWAKSSADRGK 457
P NLS C P+ +++ L G TAQ Y GWV HH+ +IW ++ + K
Sbjct: 612 QPTNLSPCHLPMVEYVRSLVPRGKYTAQQYYCKPDGGNVRGWVTHHENNIWGNTAPAK-K 670
Query: 458 VVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETN 517
+P G W+C +WE+Y + +D+DFL+K +L+ ++ + + DG L N
Sbjct: 671 STPHHFPAGAIWMCQDIWEYYQFNLDKDFLKKYYDTMLDAALFWVDNLWTDERDGTLVAN 730
Query: 518 PSTSPEH-EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL 576
PS SPEH EF L C + A+I E+F +I A++ L +++D + ++ ++
Sbjct: 731 PSHSPEHGEF-----SLGC-----STSQAMICEMFDMMIKASKELGRDKDPEIIEIATAM 780
Query: 577 PRLRPTKIAEDGSIMEWAQDF-KD--PEVHHRHLSHLFGLFPGHTITI---EKNPDLCKA 630
+L KI G MEW + KD + HRH +HLF L PG I I E++ A
Sbjct: 781 SKLSGPKIGLGGQFMEWKDEVTKDVTGDGGHRHTNHLFWLHPGSQIVIGRSEQDDKYADA 840
Query: 631 AEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNL 690
+ TL RG+EG GWS WK WARLHD ++++++ L P GG+Y+NL
Sbjct: 841 MKVTLNTRGDEGTGWSKAWKLNFWARLHDGNRSHKLLRSAMKLTVPGSH---VGGVYTNL 897
Query: 691 FAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSI 750
F AHPPFQID NFG TA +AEML+QS + LLPALP D W +G KG+KARG V
Sbjct: 898 FDAHPPFQIDGNFGCTAGIAEMLMQSQGGYIELLPALP-DAWKNGSFKGMKARGNFEVDA 956
Query: 751 CWKDGDLHEVGIYSNYSNN---DHDSFKTLHYRGTSVKV 786
W DG + + I SN + + K L+ G VKV
Sbjct: 957 AWTDGKITAIEILSNSGAECVIKYPNAKELNVSGAKVKV 995
Score = 58.5 bits (140), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/63 (47%), Positives = 44/63 (69%), Gaps = 2/63 (3%)
Query: 2 MNAESTSTTNP-LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV 59
M A S P LK T+N PAK++ ++A+PIGNG +GAM++G V + ++ NE TLW+G
Sbjct: 23 MTACSEQPYQPTLKATYNKPAKNWESEALPIGNGYMGAMIFGDVYVDVIQTNEHTLWSGG 82
Query: 60 PGD 62
PG+
Sbjct: 83 PGE 85
>gi|414868292|tpg|DAA46849.1| TPA: hypothetical protein ZEAMMB73_390456 [Zea mays]
Length = 457
Score = 390 bits (1003), Expect = e-105, Method: Compositional matrix adjust.
Identities = 214/404 (52%), Positives = 269/404 (66%), Gaps = 30/404 (7%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
PLK+ F PAK+FTDA PIGNGRLGAMVWG V SE L+LN DTLWTG PG+YTNP+AP
Sbjct: 41 PLKVVFGSPAKYFTDAAPIGNGRLGAMVWGCVESERLQLNHDTLWTGGPGNYTNPNAPAV 100
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
LS VRSLV++G+Y EAT+A+ L G V+Q LGDI+L F + +KY YRRELDL+
Sbjct: 101 LSKVRSLVENGKYPEATSAAYDLSGDQTQVFQPLGDIDLVFGED-IKYTN--YRRELDLH 157
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
TAT V Y+VG++ +TREHFSSNP QVIVTKIS ++ G++SF VSL S LD+ V N
Sbjct: 158 TATVTVTYTVGDIVYTREHFSSNPHQVIVTKISANKPGNVSFTVSLTSPLDHKIRVTHAN 217
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
+IIMEG CPG+R A D P GI+FSAIL ++I+ T+ L D LK++ +D V
Sbjct: 218 EIIMEGSCPGQRPEEIKTAADQPIGIKFSAILYLQINGANSTVEVLNDNMLKLDCADSVV 277
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
LLL A++SF FI PS+SK DPT + + L R SYS L H+DDYQ LF RVS+Q
Sbjct: 278 LLLAATTSFQSAFIKPSESKLDPTVSAFTTLSIARRTSYSQLKAYHIDDYQTLFQRVSLQ 337
Query: 312 LS-------RSPKDIVTDTCSEENIDTV--------------------PSAERVKSFQTD 344
LS R + + + S + + P+ ER+ +F+ +
Sbjct: 338 LSQGSNYDLRRSRLVQSAETSSQGANVSDYGFQISGCTRLTSLNSFVKPTVERIVTFKDN 397
Query: 345 EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDS 388
EDPSLVELLFQFGRYLLIS SRPGTQ++NLQGIW+ D SP WD+
Sbjct: 398 EDPSLVELLFQFGRYLLISCSRPGTQISNLQGIWSNDTSPPWDT 441
>gi|332671290|ref|YP_004454298.1| alpha-L-fucosidase [Cellulomonas fimi ATCC 484]
gi|332340328|gb|AEE46911.1| Alpha-L-fucosidase [Cellulomonas fimi ATCC 484]
Length = 820
Score = 390 bits (1003), Expect = e-105, Method: Compositional matrix adjust.
Identities = 272/819 (33%), Positives = 405/819 (49%), Gaps = 66/819 (8%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG-VPG-------DYTNP 66
++F+GPA+ + +A P+GNGRLGAM+ GG +++N+ T W+G V G
Sbjct: 30 LSFDGPARRWVEAFPVGNGRLGAMLHGGTERALVQVNDATAWSGRVDGPARALAAVRAAG 89
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYR- 125
P L+ R + +G++ EA G +Q D+ + S + A+ +R
Sbjct: 90 AGPDRLARARDALAAGRHDEAADLLAVFQGPWTQAFQPFVDLHVTVA-SAPRPAQVRHRD 148
Query: 126 ---RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
R LDL R + G VE E F+S D + + S +E + +S +
Sbjct: 149 DSPRTLDLRDGVVRERLPAG-VEV--EWFASAVDGALHGRWSAAEPFDVHVELSTPHHVR 205
Query: 183 NHSYVNGNNQIIMEGRCPGKRIP------PKANANDDPKGIQFSAILEIKISDDRGTISA 236
+ G +++E P P P DD + A+L ++ G +
Sbjct: 206 TDHHAPGGRVLVLE--LPDDVAPGHEPDAPAVTRTDDGASLTGVAVL---LACGDGEVGG 260
Query: 237 LEDKKLKVEGSDWAVLLLVASSSF----DGPFINPSDSKKDPTSESMSALQSIRNLSYSD 292
L+VE + W ++L ++ DGP + + D + + AL R +
Sbjct: 261 TPGGALRVERATWVEVVLATGTTSPWPQDGPLRDREEVVADVLACARRALPGDRGTGDA- 319
Query: 293 LYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVEL 352
RH+ D++++ + L P D+ D + I T P A +L +
Sbjct: 320 TRARHVADHRRIADATVLALV--PHDL--DLRLPDAIGTTPHA------------ALAQA 363
Query: 353 LFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
+F GRYLLI+SSRPG+ ANLQG+WN D P W S +N+NLEM YW + L EC
Sbjct: 364 VFDHGRYLLIASSRPGSPPANLQGVWNADPRPPWSSNYTLNVNLEMAYWGAEAVGLGECH 423
Query: 413 EPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKS---SADRGKVVWALWPMGGAW 469
EPL + L+ +G+ A+ Y GWV HH +D+W + A G WA W MGG W
Sbjct: 424 EPLLAHVGLLARHGAHVARELYGCQGWVAHHNSDVWGWALPVGAGHGDPSWAQWWMGGVW 483
Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP 529
LC HLW+H + D FL A+PLL G A F LDWL+E DG L T+PSTSPE++F P
Sbjct: 484 LCRHLWDHADVGGDDAFLRDEAWPLLRGAALFCLDWLVEAPDGSLTTSPSTSPENQFRLP 543
Query: 530 D------GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTK 583
G + ++ STMD+A++R++ + + L+ +D L ++ +L RL
Sbjct: 544 SSADGTGGGVGALATGSTMDLALVRDLLERCLDTIDRLDL-DDPLEGRLRSALARLARPV 602
Query: 584 IAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP 643
+ DG + EWA D + HHRHLSHL GL+P H + ++ PDL AA ++L RG
Sbjct: 603 VGPDGLLREWAHDAPAVDPHHRHLSHLVGLYPLHQVDVDATPDLAAAAARSLDARGPGST 662
Query: 644 GWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEH--EKHFEGGLYSNLFAAHPPFQIDA 701
GWS+ WKTAL ARL D ++ D ++GGL NLF+ HPPFQ+D
Sbjct: 663 GWSLAWKTALRARLGDGVAVGDLLAEAMRPADASSTVSSPWQGGLLPNLFSTHPPFQVDG 722
Query: 702 NFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVG 761
N G AAVAE LVQS L +LPALP +W G V+G++ARGG V + W G L +V
Sbjct: 723 NLGVVAAVAEALVQSAPGRLRVLPALP-PQWPDGSVRGVRARGGLRVDVTWSGGRLTQVV 781
Query: 762 IYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQL 800
+++ + + +H +S ++L AG + + L
Sbjct: 782 LHAARGG----TLEVVHGP-SSRTLDLEAGDVRRLDGHL 815
>gi|451854086|gb|EMD67379.1| glycoside hydrolase family 95 protein [Cochliobolus sativus ND90Pr]
Length = 805
Score = 390 bits (1001), Expect = e-105, Method: Compositional matrix adjust.
Identities = 259/765 (33%), Positives = 384/765 (50%), Gaps = 81/765 (10%)
Query: 24 FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
+TDA+PIGNGRLGAM++G E ++LNE+T+W+G D N + + +S+VR L+ G
Sbjct: 36 WTDALPIGNGRLGAMIYGIPVQERIQLNEETIWSGGRRDRVNQNGAQTVSEVRDLLARGD 95
Query: 84 YAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYS 140
A A A++ + G P YQ LGD+E+ FD + KY + TY R LDL+TA A V++
Sbjct: 96 AAGAQKLANLGMMGTPQTCRNYQTLGDMEISFDGTS-KYDKTTYERWLDLDTALAGVRFR 154
Query: 141 VGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYV-----NGNNQIIM 195
V + + RE F S PD V V ++ + + LSF + + D + N N M
Sbjct: 155 VNDTLYEREMFVSVPDDVFVHRLKATGNEKLSFQIRVHRPKDGLNEASDQNWNENGWTYM 214
Query: 196 EGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLV 255
G G DP + F+ L I+ T+ + VE + A L
Sbjct: 215 TGGTGGI----------DP--VVFTTALAIESDGHVRTLGEF----IVVENATEATAFLA 258
Query: 256 ASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRS 315
A++S+ D + S +Q R +Y +L RH++DY ++ + L+
Sbjct: 259 AATSY---------RHNDTRAAVESTIQKARQHTYEELRRRHIEDYAPFYNASVLNLN-G 308
Query: 316 PKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGTQVANL 374
P +D +P+ R+ + + DP LV L + +GRYLLI+SSR G +NL
Sbjct: 309 PDLKTSD---------LPTNARINATRKGANDPGLVALAYNYGRYLLIASSRAGNLPSNL 359
Query: 375 QGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY 434
QGIWN++ P W S VNINL+MNYW + +LS P FD L + +G TA+ Y
Sbjct: 360 QGIWNKEFDPLWGSKYTVNINLQMNYWPAEVTSLSSLHAPFFDLLELMRKDGMHTAKAMY 419
Query: 435 LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPL 494
ASGW+ HH TD+W ++ + W + WL TH+ EHY YT D+ FL P+
Sbjct: 420 NASGWMSHHNTDLWGDTAPVDTYLPATYWTLSSGWLVTHILEHYWYTGDKGFLASN-LPI 478
Query: 495 LEGCASFLLDWL----IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREV 550
+ F LD L G + YL TNPS SPE+ ++ PDGK + T D+ I+ E+
Sbjct: 479 VSEAIEFYLDTLQPYKANGTE-YLVTNPSVSPENTYVGPDGKSYNFDTAPTCDVQILNEL 537
Query: 551 FSAIISAAEVLEKN--EDALVEKVLKSLPRLRPTKIAED--GSIMEWAQDFKDPEVHHRH 606
F+ ++A L + + A + ++ + +L P + + G++ EW QD++ E HRH
Sbjct: 538 FTNYLNAVATLSNSTVDSAFLTRIRDTQAKLPPYRYSTRYPGTLQEWMQDYEQAEPGHRH 597
Query: 607 LSHLFGLFPGHTITIEKNP----DLCKAAEKTLQKR---GEEGPGWSITWKTALWARLHD 659
+SHL+ L+PG I P L AA TL+ R G GWS W +ARL +
Sbjct: 598 VSHLYALYPGTQIPPPGAPGYDAKLFNAAAATLEDRLSHNGAGTGWSRAWTINWYARLQN 657
Query: 660 QEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP-FQIDANFGFTAAVAEMLVQSTL 718
+ + FN +++NL + FQID N GF + VAE L+QS +
Sbjct: 658 RTALAENTFQFFNT-----------SVFNNLMDVNEGIFQIDGNLGFVSGVAEGLLQSHV 706
Query: 719 ND------LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
D ++LLP LP + W+ G V G+ ARGG + W DG L
Sbjct: 707 VDDKGVREVWLLPVLP-EAWNDGSVNGIAARGGFVFDLEWADGKL 750
>gi|295085851|emb|CBK67374.1| hypothetical protein [Bacteroides xylanisolvens XB1A]
Length = 729
Score = 389 bits (999), Expect = e-105, Method: Compositional matrix adjust.
Identities = 236/674 (35%), Positives = 353/674 (52%), Gaps = 62/674 (9%)
Query: 101 VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIV 160
+ +G++ +E S + + YRR L L++A A V++ + + R++F S PD V+V
Sbjct: 71 AFTTMGELYVETGLSEINMS--NYRRILSLDSAMAVVQFYKDGIRYQRKYFISYPDSVMV 128
Query: 161 TKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQ 218
K + + G + +S ++ +H +GN+ ++ G + G++
Sbjct: 129 MKFTADKGGKQNLVLSYCPNNEAKSHLEADGNDGLVYTGVL-------------NNNGMK 175
Query: 219 FSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKK-----D 273
F+ IK GT+ A E+ ++ V+ +D V LL A + + F K D
Sbjct: 176 FA--FRIKAIHKGGTLKA-ENDRIIVKDADEVVFLLTADTDYKMNFAPDFKDPKAYVGND 232
Query: 274 PTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVP 333
P+ +++ + + Y +LY H DY LF+RV +++ E +P
Sbjct: 233 PSQTTLAMMDNALKKGYDELYRNHEADYTALFNRVRFEIN-----------PEIGTPNLP 281
Query: 334 SAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHV 392
+ +R+ S++ D L +L +QFGRYLLI+SSRPG ANLQG+W+ + W H
Sbjct: 282 TYKRLASYKKGVPDYQLEQLYYQFGRYLLIASSRPGNMPANLQGLWHNNTDGPWRVDYHN 341
Query: 393 NINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSS 452
NIN++MNYW + P NLSEC PL DF+ L G KTAQ + A GW +I+ ++
Sbjct: 342 NINIQMNYWPACPTNLSECTWPLIDFIRSLVKPGEKTAQAYFNARGWTASISANIFGFTA 401
Query: 453 ADRGK-VVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD 511
K + W L P G WL TH+WE+Y+YT D FL++ Y L++ A F +D L D
Sbjct: 402 PLSSKSMAWNLNPTVGPWLATHIWEYYDYTRDTKFLKEIGYDLIKSSAQFAVDHLWHKPD 461
Query: 512 GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEK 571
G PSTSPEH V T A++RE+ I A++VL DA K
Sbjct: 462 GTYTAAPSTSPEH---------GPVDEGVTFAHAVVREILLDAIQASKVL--GTDAKERK 510
Query: 572 VLKS-LPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKA 630
++ L +L P +I G ++EW+ D DP+ HRH++HLFGL PGHTI+ P+L +A
Sbjct: 511 QWENVLTKLVPYRIGRYGQLLEWSTDIDDPKDEHRHVNHLFGLHPGHTISPVTTPELAQA 570
Query: 631 AEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNL 690
A L+ RG+ GWS+ WK WARL D HAY++ L + G NL
Sbjct: 571 ARVVLEHRGDGATGWSMGWKLNQWARLQDGNHAYKLYGNL-----------LKNGTLDNL 619
Query: 691 FAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSI 750
+ H PFQID NFG TA + EML+QS + + LLPALP D W++G + G+ A+G VSI
Sbjct: 620 WDTHAPFQIDGNFGGTAGITEMLLQSHMGFIQLLPALP-DAWANGSISGICAKGNFEVSI 678
Query: 751 CWKDGDLHEVGIYS 764
WK+G L + I+S
Sbjct: 679 SWKEGQLEKAIIHS 692
>gi|419535657|ref|ZP_14075151.1| hypothetical protein SPAR46_2252 [Streptococcus pneumoniae GA17457]
gi|379561797|gb|EHZ26812.1| hypothetical protein SPAR46_2252 [Streptococcus pneumoniae GA17457]
Length = 746
Score = 389 bits (998), Expect = e-105, Method: Compositional matrix adjust.
Identities = 257/765 (33%), Positives = 382/765 (49%), Gaps = 93/765 (12%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEA 87
+PIGNG LG M++G E ++LN++T+W D NPD+ L +R + G+ +A
Sbjct: 1 MPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKA 60
Query: 88 TA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVG 142
+ +F P D Y+LLG++ +E D A Y RELDL+TA + V + +
Sbjct: 61 EELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSC 119
Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCP 200
N++ RE+F+S ++ +I S +L+ N++L + ++ ++ I+M
Sbjct: 120 NLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAG 179
Query: 201 GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSF 260
G+ KG+QF + K++D G +S L + + + + L L + + +
Sbjct: 180 GR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDY 224
Query: 261 DGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDI 319
G +S+LQ ++ Y H+ YQ+ F+RV +L S KD
Sbjct: 225 WGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDC 270
Query: 320 VTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN 379
++ I T E K + L LLF +GRYLLISSS+P ANLQGIW
Sbjct: 271 LS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWC 319
Query: 380 EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGW 439
++L+P W S +NIN +MNYW PC+L E + PLFD L + G TA+ Y A G+
Sbjct: 320 DELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGF 379
Query: 440 VIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCA 499
HH TD + ++ + A+W + WLCTH+WEHY Y D L + + +++
Sbjct: 380 TAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAF 438
Query: 500 SFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAE 559
F D+L E DGYL T PS SPE+++ +G SST+D I+R + I A+
Sbjct: 439 LFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAK 497
Query: 560 VLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGH 617
L N D + V+++ K LP+ TKI +G I EW +D+++ E HRH+S LFGL+P +
Sbjct: 498 QLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLEDYEEVEPGHRHISPLFGLYPYN 554
Query: 618 TITIEKNPDLCKAAEKTLQKR-------------------------GEEGPGWSITWKTA 652
I I K P+L +AA+ T+ +R GWS W
Sbjct: 555 EIDIHKTPELAEAAKITINRRLSNANFLSSQEREQAINNWLVSGLHASTQTGWSAAWLIH 614
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
+ARL+ E AY + L N NLF HPPFQID N G + + E+
Sbjct: 615 FFARLYQGEPAYNQINGLLN-----------NATLGNLFLDHPPFQIDGNLGLVSGICEL 663
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
LVQS N L L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 LVQSHHNWLSLIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 707
>gi|419441357|ref|ZP_13981397.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA40410]
gi|421282151|ref|ZP_15732944.1| hypothetical protein SPAR155_2159 [Streptococcus pneumoniae
GA04672]
gi|421308367|ref|ZP_15759005.1| hypothetical protein SPAR166_2207 [Streptococcus pneumoniae
GA60132]
gi|421310565|ref|ZP_15761187.1| hypothetical protein SPAR168_2146 [Streptococcus pneumoniae
GA62681]
gi|421312927|ref|ZP_15763524.1| hypothetical protein SPAR167_2288 [Streptococcus pneumoniae
GA58981]
gi|379576014|gb|EHZ40943.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA40410]
gi|395878598|gb|EJG89661.1| hypothetical protein SPAR155_2159 [Streptococcus pneumoniae
GA04672]
gi|395905170|gb|EJH16076.1| hypothetical protein SPAR166_2207 [Streptococcus pneumoniae
GA60132]
gi|395907679|gb|EJH18569.1| hypothetical protein SPAR167_2288 [Streptococcus pneumoniae
GA58981]
gi|395908180|gb|EJH19063.1| hypothetical protein SPAR168_2146 [Streptococcus pneumoniae
GA62681]
Length = 749
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 257/765 (33%), Positives = 382/765 (49%), Gaps = 93/765 (12%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEA 87
+PIGNG LG M++G E ++LN++T+W D NPD+ L +R + G+ +A
Sbjct: 1 MPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKA 60
Query: 88 TA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVG 142
+ +F P D Y+LLG++ +E D A Y RELDL+TA + V + +
Sbjct: 61 EELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSC 119
Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCP 200
N++ RE+F+S ++ +I S +L+ N++L + ++ ++ I+M
Sbjct: 120 NLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAG 179
Query: 201 GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSF 260
G+ KG+QF + K++D G +S L + + + + L L + + +
Sbjct: 180 GR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDY 224
Query: 261 DGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDI 319
G +S+LQ ++ Y H+ YQ+ F+RV +L S KD
Sbjct: 225 WGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDC 270
Query: 320 VTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN 379
++ I T E K + L LLF +GRYLLISSS+P ANLQGIW
Sbjct: 271 LS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWC 319
Query: 380 EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGW 439
++L+P W S +NIN +MNYW PC+L E + PLFD L + G TA+ Y A G+
Sbjct: 320 DELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGF 379
Query: 440 VIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCA 499
HH TD + ++ + A+W + WLCTH+WEHY Y D L + + +++
Sbjct: 380 TAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAF 438
Query: 500 SFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAE 559
F D+L E DGYL T PS SPE+++ +G SST+D I+R + I A+
Sbjct: 439 LFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAK 497
Query: 560 VLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGH 617
L N D + V+++ K LP+ TKI +G I EW +D+++ E HRH+S LFGL+P +
Sbjct: 498 QLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLEDYEEVEPGHRHISPLFGLYPYN 554
Query: 618 TITIEKNPDLCKAAEKTLQKR-------------------------GEEGPGWSITWKTA 652
I I K P+L +AA+ T+ +R GWS W
Sbjct: 555 EIDIHKTPELAEAAKITINRRLSNANFLSSQEREQAINNWLVSGLHASTQTGWSAAWLIH 614
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
+ARL+ E AY + L N NLF HPPFQID N G + + E+
Sbjct: 615 FFARLYQGEPAYNQINGLLN-----------NATLGNLFLDHPPFQIDGNLGLVSGICEL 663
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
LVQS N L L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 LVQSHHNWLSLIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 707
>gi|298387491|ref|ZP_06997043.1| alpha-L-fucosidase 2 [Bacteroides sp. 1_1_14]
gi|298259698|gb|EFI02570.1| alpha-L-fucosidase 2 [Bacteroides sp. 1_1_14]
Length = 1036
Score = 387 bits (995), Expect = e-104, Method: Compositional matrix adjust.
Identities = 251/699 (35%), Positives = 371/699 (53%), Gaps = 48/699 (6%)
Query: 109 ELEFDDS-HLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKI-SGS 166
EL D S L Y++ Y R LD++ A V Y + F RE+F S PD V+V ++ S S
Sbjct: 341 ELSIDASTELPYSD--YTRTLDIDNAIHTVMYKENGITFKREYFMSYPDNVMVMRLTSDS 398
Query: 167 ESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIK 226
+ G LS +SL+SL + + ++ I M G P K + G++++ L +K
Sbjct: 399 KKGKLSRIISLESLHTDKTITADSHTITMTGY-PTPVSGDKRIGDAWKNGLKYAQQLVVK 457
Query: 227 ISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD--SKKDPTSESMSALQS 284
+ G +S ++ KLKVE +D ++L+ A++++ + + S++DP + + L
Sbjct: 458 --NKGGKVSVVDGTKLKVEDADEIIVLMSAATNYVQCMDDSYNYFSQEDPLEKVQATLHK 515
Query: 285 IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSE-ENIDTVPSAERVKSFQT 343
+ + Y+ L H DY L+ R+ + L P+ V T S + +D ++E+
Sbjct: 516 VADKKYTALLATHQKDYHSLYDRMRLNLGNLPEAPVAPTDSLLKGMDENTNSEQ------ 569
Query: 344 DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQS 403
E+ L L FQFGRYLLISSSR G+ ANLQG+W E LS W++ H NIN++MNYW +
Sbjct: 570 -ENQYLEMLYFQFGRYLLISSSREGSLPANLQGVWGERLSNPWNADYHTNINIQMNYWPT 628
Query: 404 LPCNLSECQEPLFDFLTYLSINGSKTAQVNYL------ASGWVIHHKTDIWAKSSADRGK 457
NLS C P+ +++ L G TAQ Y GWV HH+ +IW ++ + K
Sbjct: 629 QSTNLSPCHLPMVEYVRSLVPRGKYTAQQYYCKPDGGNVRGWVTHHENNIWGNTAPAK-K 687
Query: 458 VVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETN 517
+P G W+C +WE+Y + +D+DFL+K +L+ ++ + + DG L N
Sbjct: 688 STPHHFPAGAIWMCQDIWEYYQFNLDKDFLKKYYDTMLDAVLFWVDNLWTDERDGTLVAN 747
Query: 518 PSTSPEH-EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL 576
PS SPEH EF L C + A+I E+F +I A++ L +++D + ++ ++
Sbjct: 748 PSHSPEHGEF-----SLGC-----STSQAMICEMFDMMIKASKELGRDKDPEIIEIATAM 797
Query: 577 PRLRPTKIAEDGSIMEWAQDF-KD--PEVHHRHLSHLFGLFPGHTITI---EKNPDLCKA 630
+L KI G MEW + KD + HRH +HLF L PG I I E++ A
Sbjct: 798 SKLSGPKIGLGGQFMEWKDEVTKDVTGDGGHRHTNHLFWLHPGSQIVIGRSEQDDKYADA 857
Query: 631 AEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNL 690
+ TL RG+EG GWS WK WARLHD ++++++ L P GG+Y+NL
Sbjct: 858 MKVTLNTRGDEGTGWSKAWKLNFWARLHDGNRSHKLLRSAMKLTVPGSH---VGGVYTNL 914
Query: 691 FAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSI 750
F AHPPFQID NFG TA +AEML+QS + LLPALP D W G KG+KARG V
Sbjct: 915 FDAHPPFQIDGNFGCTAGIAEMLLQSQGGYIELLPALP-DAWKDGSFKGMKARGNFEVDA 973
Query: 751 CWKDGDLHEVGIYSNYSNN---DHDSFKTLHYRGTSVKV 786
W DG + V I SN + + K L G VKV
Sbjct: 974 AWTDGKITAVEILSNSGAECVIKYPNAKELKVSGAKVKV 1012
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/63 (44%), Positives = 42/63 (66%), Gaps = 1/63 (1%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV 59
MM LK T+N PAK++ ++A+PIGNG +GAM++G V + ++ NE TLW+G
Sbjct: 40 MMACSEQPYQPTLKATYNKPAKNWESEALPIGNGYMGAMIFGDVYVDVIQTNEHTLWSGG 99
Query: 60 PGD 62
PG+
Sbjct: 100 PGE 102
>gi|421295152|ref|ZP_15745870.1| alpha-L-fucosidase [Streptococcus pneumoniae GA56113]
gi|395891509|gb|EJH02504.1| alpha-L-fucosidase [Streptococcus pneumoniae GA56113]
Length = 749
Score = 387 bits (995), Expect = e-104, Method: Compositional matrix adjust.
Identities = 256/765 (33%), Positives = 380/765 (49%), Gaps = 93/765 (12%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEA 87
+PIGNG LG M++G E ++LN++T+W D NPD+ L +R + G+ +A
Sbjct: 1 MPIGNGHLGGMIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKA 60
Query: 88 TA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVG 142
+ +F P D Y+LLG++ +E D A Y RELDL+TA + V + +
Sbjct: 61 EELIKLTMFATPRDQSHYELLGELCIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSC 119
Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCP 200
N++ RE+F+S ++ +I S +L+ N++L + ++ ++ I+M
Sbjct: 120 NLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAG 179
Query: 201 GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSF 260
G+ KG+QF + K++D G +S L + + + + L L + + +
Sbjct: 180 GR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDY 224
Query: 261 DGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDI 319
G +S+LQ ++ Y H+ YQ+ F+RV +L S +
Sbjct: 225 WGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYSKGCL 271
Query: 320 VTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN 379
+I T E K + L LLF +GRYLLISSS+P ANLQGIW
Sbjct: 272 --------SIPTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWC 319
Query: 380 EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGW 439
++L+P W S +NIN +MNYW PC+L E + PLFD L + G TA+ Y A G+
Sbjct: 320 DELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGF 379
Query: 440 VIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCA 499
HH TD + ++ + A+W + WLCTH+WEHY Y D L + + +++
Sbjct: 380 TAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAF 438
Query: 500 SFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAE 559
F D+L E DGYL T PS SPE+++ +G SST+D I+R + I A+
Sbjct: 439 LFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAK 497
Query: 560 VLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGH 617
L N D + V+++ K LPR TKI +G I EW +D+++ E HRH+S LFGL+P +
Sbjct: 498 QLGDNSDFISRVKELKKKLPR---TKIGSNGQIQEWLEDYEEVEPGHRHISPLFGLYPYN 554
Query: 618 TITIEKNPDLCKAAEKTLQKR-------------------------GEEGPGWSITWKTA 652
I I K P+L +AA+ T+ +R GWS W
Sbjct: 555 EIDIHKTPELAEAAKITINRRLSNANFLSSQDREQAINNWLVSGLHASTQTGWSAAWLIH 614
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
+ARL+ E AY + L N NLF HPPFQID N G + + E+
Sbjct: 615 FFARLYQGEPAYNQINGLLN-----------NATLGNLFLDHPPFQIDGNLGLVSGICEL 663
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
LVQS N L L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 LVQSHHNWLSLIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 707
>gi|295835067|ref|ZP_06822000.1| fibronectin type III domain-containing protein [Streptomyces sp.
SPB74]
gi|197698025|gb|EDY44958.1| fibronectin type III domain-containing protein [Streptomyces sp.
SPB74]
Length = 790
Score = 387 bits (994), Expect = e-104, Method: Compositional matrix adjust.
Identities = 263/799 (32%), Positives = 389/799 (48%), Gaps = 77/799 (9%)
Query: 15 ITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-------DYTNP 66
+ + PA + T ++P+GNG LGA V+G +P+E ++ E TLWTG PG ++ NP
Sbjct: 53 LRYTAPATDWETQSLPVGNGALGASVFGTLPTEHVQFAEKTLWTGGPGTPGYRYGNWENP 112
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVY---QLLGDIELEFDDSHLKYAEET 123
P ALS VR+ +++ A+ +L G P Y Q GD L D + +
Sbjct: 113 R-PDALSSVRADIEARTKITPEDAAARL-GQPRIGYGGHQTFGD--LLIDVAGAPASANG 168
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
Y R LDL A V Y F R F+S PD+V+V + GS+ ++ S +
Sbjct: 169 YSRTLDLAQGLAGVTYPHDGTTFRRTVFASYPDKVLVGHFTADRGGSVELSLRYTSPRQD 228
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLK 243
+ +++ + G G++F A +I++ + GT+SA D+ L
Sbjct: 229 FTATASGDRLTLRGAL-------------QDNGMRFEA--QIRLLSEGGTVSANGDR-LT 272
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
V G+D A +L A + + + P DP A+ Y +L RH D+
Sbjct: 273 VSGADSAWFVLSAGTDYADTY--PGYRGADPHDRVTGAVNQAAARPYRELLDRHTSDHGG 330
Query: 304 LFHRVSIQLSR-SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
LF RV + L + S D TD + +A+R +L L FQ+GRYLLI
Sbjct: 331 LFSRVVLDLGQQSAPDQSTDALLKAYTGGNSAADR----------ALEALFFQYGRYLLI 380
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
+SSR G+ ANLQG WN +P W + HVNINL+MNYW + NL+E P F+ L
Sbjct: 381 ASSRAGSLPANLQGAWNNSTTPPWSADYHVNINLQMNYWPAEATNLAETTAPYDRFVEAL 440
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVWALWPMGGAWLCTHLWEHYNYT 481
+ G TAQ + A GWV+H +T + + D W +P AWL + L+EHY +
Sbjct: 441 RVPGRTTAQSMFGARGWVVHDETTPFGFTGVHDWPTSFW--FPEAAAWLTSQLYEHYRFD 498
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEH-EFIAPDGKLACVSYS 539
D+L AYP ++ A F +D L + D L PS SPEH +F A
Sbjct: 499 GSTDYLRATAYPAMKEAAEFWIDVLRTDPRDNTLVVTPSFSPEHGDFTA----------G 548
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEWAQDFK 598
+ M I+ E+F+ + AA+ L ++ A ++ ++L R+ P ++ G +MEW D
Sbjct: 549 AAMSQQIVHELFTNTLEAAQTL-GDDPAFRGRLKETLDRIDPGLRVGSWGQLMEWKTDLD 607
Query: 599 DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLH 658
HRH+SHL+ L PG IE L +AA+ +L RG+ G GWS WK WARL
Sbjct: 608 GRTDDHRHVSHLYALHPGR--AIEPGSALAEAAKVSLTARGDGGTGWSKAWKINFWARLR 665
Query: 659 DQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTL 718
D HA+ M+ + +NL+ HPPFQID NFG T+ + EML+QS
Sbjct: 666 DGNHAHTMLA-----------EQLRNSTLANLWDTHPPFQIDGNFGATSGITEMLLQSQH 714
Query: 719 NDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLH 778
+ + +LPALP WS G V+GL+ARGG T+ + W G + + + S + +
Sbjct: 715 DVIDVLPALP-AAWSDGTVRGLRARGGATLDVTWAGGKATRIALTA--SRTRELTVRNSL 771
Query: 779 YRGTSVKVNLSAGKIYTFN 797
G + AG+ YT+
Sbjct: 772 VPGGTTTFKAVAGETYTWQ 790
>gi|380696427|ref|ZP_09861286.1| hypothetical protein BfaeM_21066 [Bacteroides faecis MAJ27]
Length = 1014
Score = 387 bits (994), Expect = e-104, Method: Compositional matrix adjust.
Identities = 238/672 (35%), Positives = 352/672 (52%), Gaps = 48/672 (7%)
Query: 114 DSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKI-SGSESGSLS 172
D+ L+ Y R LD++ A V Y G + F RE+F S PD V+V ++ S + G LS
Sbjct: 328 DASLELPYSDYARTLDIDNAIHTVTYKEGGITFKREYFMSYPDHVMVMRLTSDANEGKLS 387
Query: 173 FNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRG 232
+SL+SL + N I M G P K + G++++ L +K + G
Sbjct: 388 RIISLESLHTDKVIAADGNTITMTGY-PTPVSGDKRVGDAWKNGLRYAQQLVVK--NKGG 444
Query: 233 TISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD------SKKDPTSESMSALQSIR 286
IS ++ KLKVE +D ++L+ A++++ + D S++DP + + L +
Sbjct: 445 KISVVDGAKLKVEDADEIIVLMSAATNY----VQCMDDSYCYFSEEDPLDKVRATLHKVA 500
Query: 287 NLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDED 346
+ Y+ L H DY L+ R+ + L + T D++ + ++
Sbjct: 501 DKKYTSLLAAHQKDYHSLYDRMQLNLGEQLEAPAATT------DSLLKGMDANTNSEQDN 554
Query: 347 PSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPC 406
L L FQFGRYLLISSSR G+ ANLQG+W E L+ W++ H NIN++MNYW + P
Sbjct: 555 QYLEMLYFQFGRYLLISSSREGSLPANLQGVWGERLANPWNADYHTNINVQMNYWPTQPT 614
Query: 407 NLSECQEPLFDFLTYLSINGSKTAQVNYL------ASGWVIHHKTDIWAKSSADRGKVVW 460
NLS C P+ +++ L G TAQ Y GWV HH+ +IW ++ + K
Sbjct: 615 NLSPCHLPMVEYVKSLVPRGKYTAQQYYCKPDGGNVRGWVTHHENNIWGNTAPAK-KSTP 673
Query: 461 ALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPST 520
+P G W+C +WE+Y + +D+DFL+K +L+ ++ + + DG L NPS
Sbjct: 674 HHFPAGAIWMCQDIWEYYQFNLDKDFLKKYYDTMLDAALFWVDNLWTDERDGTLVANPSH 733
Query: 521 SPEH-EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH EF L C + A+I E+F +I A++ L + +D + ++ ++ +L
Sbjct: 734 SPEHGEF-----SLGC-----STSQAMICEMFGMMIKASKELGREKDPEIAEIATAMSKL 783
Query: 580 RPTKIAEDGSIMEWAQDF-KD--PEVHHRHLSHLFGLFPGHTITI---EKNPDLCKAAEK 633
KI G MEW + KD + HRH +HLF L PG I I E++ A +
Sbjct: 784 SGPKIGLGGQFMEWKDEVTKDVTGDGGHRHTNHLFWLHPGSQIVIGRSEQDDKYADAMKV 843
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAA 693
TL RG+EG GWS WK WARLHD ++++++ L P GG+Y+NLF A
Sbjct: 844 TLNTRGDEGTGWSKAWKLNFWARLHDGNRSHKLLRSAMKLTVPGSHV---GGVYTNLFDA 900
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWK 753
HPPFQID NFG TA +AEML+QS + LLPALP D W G KG+KARG V WK
Sbjct: 901 HPPFQIDGNFGCTAGIAEMLMQSQGGYIELLPALP-DAWKDGAFKGMKARGNFEVDAAWK 959
Query: 754 DGDLHEVGIYSN 765
+G + + I SN
Sbjct: 960 EGKITSIEILSN 971
Score = 61.2 bits (147), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 27/51 (52%), Positives = 41/51 (80%), Gaps = 1/51 (1%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
LK T+N PAK++ ++A+PIGNG +GAM++GGV + ++ NE TLW+G PG+
Sbjct: 35 LKATYNKPAKNWESEALPIGNGYMGAMIFGGVYVDVIQTNEHTLWSGGPGE 85
>gi|70985434|ref|XP_748223.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
gi|66845851|gb|EAL86185.1| conserved hypothetical protein [Aspergillus fumigatus Af293]
Length = 792
Score = 387 bits (994), Expect = e-104, Method: Compositional matrix adjust.
Identities = 249/772 (32%), Positives = 396/772 (51%), Gaps = 59/772 (7%)
Query: 11 NPLKIT-FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
NP T + PA F +PIGNGRL A +WGG + + +NE+++W+G D NP+A
Sbjct: 22 NPSTYTWYTSPAADFASTLPIGNGRLAAAIWGGA-VDNITVNENSIWSGPFQDRVNPNAY 80
Query: 70 KALSDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRR 126
+ +D R+++++G + A ++ + P+ Y LG ++L+F H + Y R
Sbjct: 81 EGFTDSRAMLEAGNLSSANDVVLREMVSIPSSPREYHPLGPLKLDF--GHEASSLHNYTR 138
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
LDL T A V+Y VG+V ++RE+ +S+PD V+ ++ S+ +L+ VSL+ + Y
Sbjct: 139 FLDLGTGVAGVRYHVGDVVYSREYVASHPDGVLAVRLRASKDSALNVVVSLE----RNRY 194
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
V + +G + KAN+ + I+F++ + + R T + + V G
Sbjct: 195 VESLTAVSSKGMG---TLTLKANSGQNTDPIRFTSQARVVSREGRITTNG---TSVVVTG 248
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ + +S+ P ++++D S L + L Y + DYQ L
Sbjct: 249 ASTVDIFFDTQTSYR----YPDETERD--SAVKKQLDAAVKLIYPAVKQAATSDYQSLSG 302
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE--DPSLVELLFQFGRYLLISS 364
RV + D S + P+ R+ +++T+ DP LV L+F FGR+ LI+S
Sbjct: 303 RVKL-----------DLGSSGSAGNQPTDIRLTNYKTNPNGDPELVTLMFNFGRHSLIAS 351
Query: 365 SRPGTQVA---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
SR G+ A NLQGIWN+D SP W V++NLEMNYW + NL++ EP+ D +
Sbjct: 352 SREGSSSALPANLQGIWNQDYSPAWGGKYTVDVNLEMNYWHAQVTNLADTFEPVIDLMDK 411
Query: 422 LSINGSKTAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
+ +G A+ Y +G+++HH TD+W ++ W +WPMG AWL +L + Y +
Sbjct: 412 VLPHGQAVARKMYHCDTGYILHHNTDLWGDAAPVDNGTKWTMWPMGSAWLSMNLMDQYRF 471
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLAC 535
T D+ L +R +PLL+ A F +L E +GY + PS SPE+ F P+ GK
Sbjct: 472 TQDKTLLRERIWPLLKSAADFYYCYLFE-FEGYYTSGPSISPENAFRIPEDMTIAGKSTG 530
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ 595
+ + TMD ++ E+F A+I + L+ + L K + R+R +I G I+EW +
Sbjct: 531 IDLAPTMDNLLLHELFLAVIETCKALDITGEDLA-NAQKYISRIRQPQIGSYGQILEWRR 589
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTA 652
++++ E+ HRH+S + GL+PG +T N L AA+ L R G GWS W +
Sbjct: 590 EYQETELGHRHMSPILGLYPGSQMTPLVNQTLANAAKVLLDHRITSGSGSTGWSRAWTMS 649
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
L+ARL D + + + + L++ + FQID NFGF A +AEM
Sbjct: 650 LYARLFDGNSVWHHAQYFL-------QNYPTDNLWNTDYGPGSAFQIDGNFGFAAGIAEM 702
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
L+QS ++LLPALP D G V GL ARG V + W +G+L I S
Sbjct: 703 LLQSHAV-VHLLPALP-DAVPDGRVSGLVARGNFVVDMEWSNGELKSAKIES 752
>gi|223932290|ref|ZP_03624293.1| conserved hypothetical protein [Streptococcus suis 89/1591]
gi|386584235|ref|YP_006080638.1| hypothetical protein SSUD9_1198 [Streptococcus suis D9]
gi|223898971|gb|EEF65329.1| conserved hypothetical protein [Streptococcus suis 89/1591]
gi|353736381|gb|AER17390.1| conserved hypothetical protein [Streptococcus suis D9]
Length = 763
Score = 386 bits (992), Expect = e-104, Method: Compositional matrix adjust.
Identities = 257/777 (33%), Positives = 390/777 (50%), Gaps = 93/777 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+K+ + A ++ +A+PIGNG LG M++G E ++LN++T+W D NPD+ L
Sbjct: 1 MKLWYKKAASNWNEALPIGNGHLGGMIYGSAVKECIQLNDETIWYRGKSDRNNPDSLLHL 60
Query: 73 SDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
VR + G+ +A + +F P D Y+LLG++ +E D A Y RELD
Sbjct: 61 KKVREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-PSALSLYERELD 119
Query: 130 LNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHS 185
L+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 120 LDTAISNVIFEPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEV 179
Query: 186 YVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
++ I+M G+ KG++F + K++D G ++ L + + +
Sbjct: 180 SKLDSSTILMSASAGGR------------KGVRFKVVCHSKVTD--GEVNVL-GETIVIR 224
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKL 304
+ L L + + + G +S+LQ ++ Y H+ YQ+
Sbjct: 225 NATEVFLYLKSMTDYWGNL-------------DISSLQGEFSSIDYFTEKDEHVKKYQEQ 271
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV +L S KD ++ I T E K + L LLF +GRYLLISS
Sbjct: 272 FNRVDFKLDYS-KDCLS-------IPTNLLLEDTKKYSN----YLTNLLFHYGRYLLISS 319
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
S+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 320 SQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMRE 379
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 380 PGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDE 439
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 440 RIL-REHFEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDN 497
Query: 545 AIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
I+R + I A+ L N D + V+++ K LP+ TKI +G I EW +D+++ E
Sbjct: 498 QILRYFCDSCIGIAKQLVDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLEDYEEVEP 554
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR------------------------ 638
HRH+S LFGL+P + I I K P+L +AA+ T+ +R
Sbjct: 555 GHRHISPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLSSQDREQAINNWLVSGL 614
Query: 639 -GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
GWS W +ARL+ E AY + L + NLF HPPF
Sbjct: 615 HASTQTGWSAVWLIHFFARLYQGEPAYNQINGLLH-----------NATLGNLFLDHPPF 663
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKD 754
QID N G + + E+LVQS N L L+PALP WS+G VKGL+ RGG VS WK+
Sbjct: 664 QIDGNLGLVSGICELLVQSHHNWLSLIPALP-SAWSAGEVKGLRVRGGYKVSFAWKN 719
>gi|317138010|ref|XP_001816599.2| alpha-fucosidase [Aspergillus oryzae RIB40]
gi|195972741|dbj|BAG68493.1| probable secreted protein [Aspergillus oryzae]
Length = 792
Score = 386 bits (992), Expect = e-104, Method: Compositional matrix adjust.
Identities = 266/765 (34%), Positives = 377/765 (49%), Gaps = 71/765 (9%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA +FT +P+GNGRLGA VWG E + LNE+++W+G D NPD+ AL VR
Sbjct: 28 YTSPASNFTSTLPLGNGRLGAAVWGST-VENITLNENSIWSGQFMDRVNPDSYSALDPVR 86
Query: 77 SLVDSGQYAEATAASVK-LFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
++ G A +++ + G P + Y LG + L+F H E Y R LDL
Sbjct: 87 YMLKEGNMTAAGQTTLEHMVGSPDEPRAYHPLGSLVLDF--GHEDSQVENYTRSLDLLKG 144
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A V Y VEF RE+ +S+P VI +++ SE+G L+ SL YV N
Sbjct: 145 RAVVHYGYHGVEFRREYIASHPAGVIAARLTASEAGRLNVAASLS----RGRYVTENT-- 198
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG---SDWA 250
A A +D ++ A SDD IS ++ G S A
Sbjct: 199 --------------ATAGNDTGSLKLRA--STAESDD---ISFSAAARIVTHGGWVSRSA 239
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQKLF 305
+++ +++ FI+ S + T E+ A L + + + D++ L
Sbjct: 240 SSVVIQNATTVDIFIDAETSYRFETQEAWEAEIERKLDAAMRAGFPAIEQAATADHEALA 299
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
RV + L+ S + N+ T ER K+ D DP LV L+FQFGRY LI+SS
Sbjct: 300 GRVHLDLASS--------GAAGNLPTDVRLERYKT-HPDADPELVTLMFQFGRYSLIASS 350
Query: 366 RP-GTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
R GT NLQG+WNED P W VNINLEMNYW + NL+E PL L +
Sbjct: 351 RKTGTSPLPPNLQGLWNEDYEPAWGGRYTVNINLEMNYWPAGVTNLAETLGPLIFLLETV 410
Query: 423 SINGSKTAQVNYLA--SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
G A+ Y G+V+HH TDIW + W +WPMGGAWL +L E+Y +
Sbjct: 411 KPRGQDIARRMYNCDNGGYVLHHNTDIWGDAVPVNNGTKWTMWPMGGAWLSANLMEYYRF 470
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLAC 535
T D + L++R +PLL A F ++ +GYL T PS+SPE+ F+ P+ G
Sbjct: 471 TQDTNLLKERIWPLLRSAAQFYHCYVFS-FNGYLSTGPSSSPENAFVVPNDMSESGNEEG 529
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ 595
+ + TMD ++ E+F +II +VL N + K SLP ++ +I G I+EW
Sbjct: 530 IDIAPTMDNTLLSELFHSIIETGKVLGIN-NTDTTKAASSLPLIKLPQIGSYGQILEWRH 588
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTA 652
++++ E HRH+S +FGL+PG +T N L AA L R G GWS W +
Sbjct: 589 EYQETEPGHRHMSPIFGLYPGSQMTPLVNSTLAAAATVLLDHRIAHGSGSTGWSRAWTIS 648
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
L++RL D + A+ + + + L++ FQID NFGFTA +AEM
Sbjct: 649 LYSRLFDGDAAWNHTQVFL-------KTYPSANLWNTDSGPGSAFQIDGNFGFTAGIAEM 701
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
L+QS ++LLPALP G V GL ARG V + W DG L
Sbjct: 702 LLQSHAGVVHLLPALP-SAVPHGKVSGLVARGNFVVDMEWSDGKL 745
>gi|238504526|ref|XP_002383494.1| alpha-L-fucosidase 2 precursor, putative [Aspergillus flavus
NRRL3357]
gi|220690965|gb|EED47314.1| alpha-L-fucosidase 2 precursor, putative [Aspergillus flavus
NRRL3357]
Length = 792
Score = 386 bits (991), Expect = e-104, Method: Compositional matrix adjust.
Identities = 267/765 (34%), Positives = 377/765 (49%), Gaps = 71/765 (9%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA +FT +P+GNGRLGA VWG E + LNE+++W+G D NPD+ AL VR
Sbjct: 28 YTSPASNFTSTLPLGNGRLGAAVWGST-VENITLNENSIWSGQFMDRVNPDSYSALDPVR 86
Query: 77 SLVDSGQYAEATAASVK-LFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
S++ G A +++ + G P + Y LG + L+F H E Y R LDL
Sbjct: 87 SMLKEGNMTAAGQTTLEHMVGSPDEPRAYHPLGSLVLDF--GHEDSQVENYTRSLDLLKG 144
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A V Y VEF RE+ +S+P VI +++ SE+G L+ SL YV N
Sbjct: 145 RAVVHYGYHGVEFRREYIASHPAGVIAARLTASEAGRLNVAASLS----RGRYVTENT-- 198
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG---SDWA 250
A A +D ++ A SDD IS ++ G S A
Sbjct: 199 --------------ATAGNDTGSLKLRA--STAESDD---ISFSAAARIVTHGGWVSRSA 239
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQKLF 305
+++ +++ FI+ S + T E+ A L + + + D++ L
Sbjct: 240 SSVVIQNATTVDIFIDAETSYRFETQEAWEAEIERKLDAAMRAGFPAIEQAATADHEALA 299
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
RV + L+ S + N+ T ER K+ D DP LV L+FQFGRY LI+SS
Sbjct: 300 GRVHLDLASS--------GAAGNLPTDVRLERYKT-HPDADPELVTLMFQFGRYSLIASS 350
Query: 366 R-PGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
R GT NLQG+WNED P W VNINLEMNYW + NL+E PL L +
Sbjct: 351 RETGTSPLPPNLQGLWNEDYEPAWGGRYTVNINLEMNYWPAGVTNLAETLGPLIFLLETV 410
Query: 423 SINGSKTAQVNYLA--SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
G A+ Y G+V+HH TDIW + W +WPMGGAWL +L E+Y +
Sbjct: 411 KPRGQDIARRMYNCDNGGYVLHHNTDIWGDAVPVNNGTKWTMWPMGGAWLSANLMEYYRF 470
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLAC 535
T D + L++R +PLL A F ++ +GYL T PS+SPE+ F+ P+ G
Sbjct: 471 TQDTNLLKERIWPLLRSAAQFYHCYVFS-FNGYLSTGPSSSPENAFVVPNDMSESGNEEG 529
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ 595
+ + TMD ++ E+F +II +VL N + K SLP ++ +I G I+EW
Sbjct: 530 IDIAPTMDNTLLSELFHSIIETGKVLGIN-NTDTTKAASSLPLIKLPQIGSYGQILEWRH 588
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTA 652
++++ E HRH+S +FGLFPG +T N L AA L R G GWS W +
Sbjct: 589 EYQETEPGHRHMSPIFGLFPGSQMTPLVNSTLAAAATVLLDHRIAHGSGSTGWSRAWIIS 648
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
L++RL D + A+ + + + L++ FQID NFGFTA +AEM
Sbjct: 649 LYSRLFDGDAAWNHTQVFL-------KTYPSANLWNTDSGPGSAFQIDGNFGFTAGIAEM 701
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
L+QS ++LLPALP G V GL ARG V + W G L
Sbjct: 702 LLQSHAGVVHLLPALP-SAVPHGKVSGLVARGNFVVDMEWSGGKL 745
>gi|159125849|gb|EDP50965.1| conserved hypothetical protein [Aspergillus fumigatus A1163]
Length = 792
Score = 385 bits (990), Expect = e-104, Method: Compositional matrix adjust.
Identities = 249/772 (32%), Positives = 395/772 (51%), Gaps = 59/772 (7%)
Query: 11 NPLKIT-FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
NP T + PA F +PIGNGRL +WGG + + LNE+++W+G D NP+A
Sbjct: 22 NPSTYTWYTSPAADFASTLPIGNGRLATAIWGGA-VDNITLNENSIWSGPFQDRVNPNAY 80
Query: 70 KALSDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRR 126
+ +D R+++++G + A ++ + P+ Y LG ++L+F H + Y R
Sbjct: 81 EGFTDSRAMLEAGNLSSANDVVLREMVSIPSSPREYHPLGSLKLDF--GHEASSLHNYTR 138
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
LDL T A V+Y VG+V ++RE+ +S+PD V+ ++ S+ +L+ VSL+ + Y
Sbjct: 139 FLDLGTGVAGVRYHVGDVVYSREYVASHPDGVLAVRLRASKDSALNVVVSLE----RNRY 194
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
V + +G + KAN+ + I+F++ + + R T + + V G
Sbjct: 195 VESLTAVSSKGMG---TLTLKANSGQNTDPIRFTSQARVVSREGRITTNG---TSVVVTG 248
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ + +S+ P ++++D S L + L+Y + DYQ L
Sbjct: 249 ASTVDIFFDTQTSYR----YPDETERD--SAVKKQLDAAVKLNYPAVKQAATSDYQSLSG 302
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE--DPSLVELLFQFGRYLLISS 364
RV + D S + P+ R+ +++T+ DP LV L+F FGR+ LI+S
Sbjct: 303 RVKL-----------DLGSSGSAGNQPTDIRLTNYKTNPNGDPELVTLMFNFGRHSLIAS 351
Query: 365 SRPGTQV---ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
SR G+ ANLQGIWN+D SP W V++NLEMNYW + NL++ EP+ D +
Sbjct: 352 SREGSSSGLPANLQGIWNQDYSPAWGGKYTVDVNLEMNYWHAQVTNLADTFEPVIDLMDK 411
Query: 422 LSINGSKTAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
+ +G A+ Y +G+++HH TD+W ++ W +WPMG AWL +L + Y +
Sbjct: 412 VLPHGQDVARKMYHCDTGYILHHNTDLWGDAAPVDNGTKWTMWPMGSAWLSMNLMDQYRF 471
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLAC 535
T D+ L +R +PLL+ A F +L E +GY + PS SPE+ F P+ GK
Sbjct: 472 TQDKTLLRERIWPLLKSAADFYYCYLFE-FEGYYTSGPSISPENAFRIPEDMTIAGKSTG 530
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ 595
+ + TMD ++ E+F A+I + L+ + L K + R+R +I G I+EW +
Sbjct: 531 IDLAPTMDNLLLHELFLAVIETCKALDITGEDLA-NAQKYISRIRQPQIGSYGQILEWRR 589
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTA 652
++++ E+ HRH+S + GL+PG +T N L AA+ L R G GWS W +
Sbjct: 590 EYQETELGHRHMSPILGLYPGSQMTPLVNQTLANAAKVLLDHRITSGSGSTGWSRAWTMS 649
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
L+ARL D + + + + L++ FQID NFGF A +AEM
Sbjct: 650 LYARLFDGNSVWHHAQYFL-------QNYPTDNLWNTDHGPGSAFQIDGNFGFAAGIAEM 702
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
L+QS ++LLPALP D G V GL ARG V + W +G+L I S
Sbjct: 703 LLQSHAV-VHLLPALP-DAVPDGRVSGLVARGNFVVDMEWSNGELKSAKIES 752
>gi|346311070|ref|ZP_08853080.1| hypothetical protein HMPREF9452_00949 [Collinsella tanakaei YIT
12063]
gi|345901764|gb|EGX71561.1| hypothetical protein HMPREF9452_00949 [Collinsella tanakaei YIT
12063]
Length = 770
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 254/791 (32%), Positives = 379/791 (47%), Gaps = 101/791 (12%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
+++ + PA + +A+PIGNG + MV+GGV +E LN++T+W P D NP + L
Sbjct: 1 MRLWYTSPASVWNEALPIGNGHIAGMVFGGVENEKFSLNDETIWYRGPADRNNPSSADNL 60
Query: 73 SDVRSLVDSGQYAEAT-AASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+R L+ G A ++ +F P D Y++LG++ LE L+ A E+Y RELD
Sbjct: 61 GKIRELLAVGDVEAAEDLVALTMFATPRDQSHYEVLGEMFLEQRGVALE-ACESYERELD 119
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L A RV +S G V++ RE+FSS VI+ +++ S+ GS+S +L
Sbjct: 120 LENALCRVSFSCGGVDYRREYFSSFARNVILARLTASKEGSISLRATL------------ 167
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQ-----------FSAILEIKISDDRGTISALE 238
GRC KR D I F L + D G++ L
Sbjct: 168 -------GRC--KRFNDSVRQYRDRGVIMAAHAGGAAGVGFEVGLRVVSCD--GSVRVLG 216
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+ + E ++ VL LV+S+ + S +P + S+ + L + H+
Sbjct: 217 ETIVVDEATE-VVLALVSSTDY------WSAGAVEPDASSL--MDGFDGLDFDCALDDHV 267
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDED-PSLVELLFQFG 357
Y++ + RV++ D ++E ++P+ + + P L+ L F +G
Sbjct: 268 AAYREQYGRVAL-----------DIAADEEAPSIPTDGLIACAREGRHVPYLLNLAFDYG 316
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLL+SSS+PG ANLQGIW ED+ P W S +NIN EMNYW P +L E Q PLFD
Sbjct: 317 RYLLLSSSQPGGLPANLQGIWCEDIDPIWGSKYTININTEMNYWMCGPADLPEAQLPLFD 376
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
L + G +TA+ Y A G+ HH TD +A ++ + A+WP+ WL TH+WE
Sbjct: 377 LLERMREPGRRTARAMYGARGFTCHHNTDGFADTAPQSHAIGAAVWPLTVPWLLTHVWEQ 436
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
Y + D L + + + F D+L E + GYL T PS SPE+ + P+G V
Sbjct: 437 YRFFGDASVLAEH-LDMFKEALLFFEDYLFE-YQGYLVTGPSASPENRYRLPNGVEGNVC 494
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
S +D I+R F + A VL D ++ RL PT+I G I EW +D+
Sbjct: 495 LSPAIDNQILRFFFDCCVDVARVLGDQSD-FADRAKALAERLPPTRIGSHGQIQEWLEDY 553
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG--------------- 642
++ E HRH+S LFGL+PG+ + + P+L A +T+++R
Sbjct: 554 EEVEPGHRHISPLFGLYPGNEFDVRRTPELAAACLRTIERRTSNAGYLDLASRDVAIGNW 613
Query: 643 ----------PGWSITWKTALWARLHDQEHAY-RMVKRLFNLVDPEHEKHFEGGLYSNLF 691
GWS W ARL + + L + P NLF
Sbjct: 614 KGAGLHASTRTGWSSAWLVHFNARLGRGDACMDELTGMLAHCSLP------------NLF 661
Query: 692 AAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSIC 751
+ HPPFQID N G T+ V EML+QS +++ +LPALP D +G GL+ARGG VS
Sbjct: 662 SDHPPFQIDGNLGLTSGVCEMLLQSNADEVRILPALP-DALPNGSFTGLRARGGFKVSAS 720
Query: 752 WKDGDLHEVGI 762
W G L + +
Sbjct: 721 WTKGTLCSIEV 731
>gi|423220535|ref|ZP_17207030.1| hypothetical protein HMPREF1061_03803 [Bacteroides caccae
CL03T12C61]
gi|392623612|gb|EIY17715.1| hypothetical protein HMPREF1061_03803 [Bacteroides caccae
CL03T12C61]
Length = 775
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 270/795 (33%), Positives = 394/795 (49%), Gaps = 119/795 (14%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV 59
+ E + T P+++ ++ PA ++ T A+PIGNG LGA+ +GGV SE + NE TLWTG
Sbjct: 21 VAGVEQKTETVPMRLWYDRPATNWMTSALPIGNGELGALFFGGVESEQILFNEKTLWTG- 79
Query: 60 PGDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKY 119
S G YQ GD+ + FD
Sbjct: 80 -----------------STTTRG------------------AYQKFGDVWIHFDGQE--- 101
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGS-LSFNVSLD 178
YRREL L+ A +V Y+ + RE+F+S PD+VIV ++S ++G L+F+VSL
Sbjct: 102 DVREYRRELSLDEAIGKVSYTSAGTHYLREYFASRPDEVIVLRLSTPKAGKKLNFSVSL- 160
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL-------EIKISDDR 231
+GR PG R + GI F L ++K+ ++
Sbjct: 161 ----------------ADGR-PGTRQEVTKD------GILFRRKLDLLSYEAQLKVINEG 197
Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFD---GPFINPSDSKKDPTSESMSALQSIRNL 288
GT+ A + KL V ++ ++LL A++++D ++ + + A S +
Sbjct: 198 GTLVA-DSNKLCVNAANSVLILLTAATNYDLSSATYVGETSGQLHKRLTDRLARASAK-- 254
Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPS 348
Y L + HL+DYQ LF+RV L R+ + I +VP+ E V + E
Sbjct: 255 GYDQLKSTHLNDYQSLFNRVRFDL-RTAAKTGGKIGMKTEIPSVPTNELVHLHK--EALY 311
Query: 349 LVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL 408
L L FQ+GRYL+I+SSR NLQGIWN D +P W+ H NIN++MNYW + CNL
Sbjct: 312 LDMLYFQYGRYLMIASSRGMNLSNNLQGIWNGDNAPPWECDIHSNINIQMNYWPAEVCNL 371
Query: 409 SECQEPLFDFLTYLSINGSKTAQVNYLA-----SGWVIHHKTDIWAKSSADRGKVVWALW 463
SEC EP ++ ++ + Q LA GW ++ + +I+ G W +
Sbjct: 372 SECHEPFIRYIATEALRPGGSWQ--QLARSEGLRGWTVNTQNNIF-------GYTDWNIN 422
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
AW C HLW+HY YT D ++L AYP++ + D L DG L SPE
Sbjct: 423 RPANAWYCMHLWKHYAYTQDINYLRSVAYPVMRSTCEYWFDRLQLTADGVLLAPAEWSPE 482
Query: 524 HEFIAP--DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL----VEKVLKSLP 577
H P DG V+Y+ + + ++FS + A VL L V K+ + L
Sbjct: 483 H---GPWEDG----VAYAQQL----VWQLFSETMQAVRVLRGAGIPLDADFVRKLSEKLK 531
Query: 578 RL-RPTKIAEDGSIMEWAQDFKDPEVH---HRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
RL + G I EW +D + + HRHLS L L+PG+ I+ K+ AA++
Sbjct: 532 RLDNGVTLGAWGQIREWREDSQKLDTLGNPHRHLSQLIALYPGNQISYYKDAKYADAAKR 591
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRL--FNLVDPEHEKHFEGGLYSNLF 691
TL+ RG+ G GWS WK A WARL D EHAYR++K F+ + + +GG+Y NLF
Sbjct: 592 TLESRGDLGTGWSRAWKIAAWARLQDGEHAYRLLKSALDFSTLTVISMDNDQGGVYENLF 651
Query: 692 AAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSIC 751
+HPPFQID NFG TA +AEML+QS ++LLPALP W++G V GL+A G T ++
Sbjct: 652 DSHPPFQIDGNFGATAGIAEMLLQSHQGFIHLLPALP-SVWANGSVTGLRAEGDFTFTME 710
Query: 752 WKDGDLHEVGIYSNY 766
W G L + + S +
Sbjct: 711 WNAGRLTQCAVTSGH 725
>gi|302884741|ref|XP_003041265.1| hypothetical protein NECHADRAFT_88794 [Nectria haematococca mpVI
77-13-4]
gi|256722164|gb|EEU35552.1| hypothetical protein NECHADRAFT_88794 [Nectria haematococca mpVI
77-13-4]
Length = 765
Score = 384 bits (986), Expect = e-103, Method: Compositional matrix adjust.
Identities = 254/765 (33%), Positives = 389/765 (50%), Gaps = 68/765 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
L++ +N P+ +++++P+GNGRLGA+V G +E L+LNE+++W+G P + T PDA + L
Sbjct: 8 LRLQYNSPSSQWSESLPVGNGRLGAVVHGQPGAEVLQLNENSVWSGGPQERTPPDARRML 67
Query: 73 SDVRSLVDSGQYAEATAASVKLF-GHPADV--YQLLGDIELEFDDSHLKYAEETYRRELD 129
+RSL+ + ++AEA A + F +P Y+ +G EF + Y R LD
Sbjct: 68 PKLRSLIRADKHAEAEALAKLAFYANPKSQRHYEPMGTASFEFGHEQVS----NYHRHLD 123
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L TA A V+Y G + R+ +S PD V++ + + S+ F V LD + D+ N
Sbjct: 124 LATAQAVVEYEHGGASYRRDMIASFPDNVLLWRFTASQ--KTRFIVRLDRINDDPIETNT 181
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
I + G RI A G + ++L D+ G I A+ V S
Sbjct: 182 YADTI---KSEGSRIVLHATPR-GAGGNRLCSVLRAVCDDEEGAIEAV--GSCLVINSAS 235
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
+ + A ++F P DP + + + ++S+L RH DY+ LF R+S
Sbjct: 236 CTIAIGAQTTFRHP---------DPELVATTDVDCALMRTWSELVVRHRRDYEGLFGRMS 286
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+++ + TD R+++ Q+ DP LV L +GRYLLISSSR G
Sbjct: 287 LRMWPDASEKPTDA-------------RLETRQS-RDPGLVALYHNYGRYLLISSSRDGH 332
Query: 370 QV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL-SECQEPLFDFLTYLSING 426
+ A LQGIWN +P W S +NINL+MNYW + PC+L EC P+ D L +SI G
Sbjct: 333 RALPATLQGIWNPSFTPPWGSKYTININLQMNYWLTAPCSLVDECTLPVIDLLERMSIRG 392
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
+TA+ Y GW HH TDIWA +S + +WP+GG W+ + + Y +
Sbjct: 393 QETAKAMYGCRGWCAHHNTDIWADTSPQDHWISATVWPLGGLWVSVTVMDMLRYQYSEE- 451
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L +R + EG F++D+L+ DG YL NPS SPE+ F + G++ STMDM
Sbjct: 452 LHRRIFACHEGAVQFVIDFLVPSSDGLYLIANPSISPENTFYSTTGEVGVFCEGSTMDMT 511
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLK-SLPRLRPTKIAEDGSIMEWA-QDFKDPEVH 603
+IR + + + + LE ++ ++ V++ +L R+ P + + G I EW ++++ E
Sbjct: 512 LIRVALTQFLWSLDRLEGLQEHTLKTVVQDTLDRIPPILVNDAGRIQEWGLNNYEEAEPG 571
Query: 604 HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQ 660
HRH+SHLFGL P I+ K P L +AA+ L++R G GWS W L+ARL D
Sbjct: 572 HRHVSHLFGLHPADLISPSKTPKLVEAAKAVLKRRLAHGGGHTGWSRAWLLNLYARLLDG 631
Query: 661 EHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST--- 717
E + L + NL HPPFQID NFG A + E L+QS
Sbjct: 632 EACGENMDLLLS-----------QSTLPNLLDTHPPFQIDGNFGACAGILECLMQSMEVN 680
Query: 718 -----LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
+ ++ LLPA P W G ++ ++ + G VS W+ G +
Sbjct: 681 KEGVDVVEVRLLPACP-RSWEKGALERVRTKQGWLVSFSWEMGQV 724
>gi|336425540|ref|ZP_08605561.1| hypothetical protein HMPREF0994_01567 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336012115|gb|EGN42041.1| hypothetical protein HMPREF0994_01567 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 835
Score = 384 bits (986), Expect = e-103, Method: Compositional matrix adjust.
Identities = 261/799 (32%), Positives = 389/799 (48%), Gaps = 98/799 (12%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLV 79
PA+ F DA +GNG LG V G E + +NEDTLW+G G Y NP + R L
Sbjct: 11 PAEQFWDAHYLGNGSLGMSVMGDPVLEEVYINEDTLWSGSEGFYLNPQHYDRFMEARRLA 70
Query: 80 DSGQYAEA-TAASVKLFGHPADVYQLLGDIELEFDDSH------LKYAEET-------YR 125
G+ EA T + + G + Y L + + + LK E YR
Sbjct: 71 LEGKGKEANTIINNDMEGRWLETYLPLASLHITMGQADNRRNMPLKMVIEPQPGDIEDYR 130
Query: 126 RELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISG------SESGSLSFNVSLDS 179
R L L+ A V + + + RE+F S PD+ + L F +DS
Sbjct: 131 RCLSLSDAAETVSWIRDGIRYRREYFVSYPDRTAYVYCTAEPKEGEGRDKVLDFAFGVDS 190
Query: 180 LLDNHSYVNG--NNQIIMEGRCPGKRIP------PKANAND--DPKGIQFSAILEIKISD 229
L Y+NG + + + G P P P+ D + ++F+ + +D
Sbjct: 191 SL---HYINGAEDGEAFLTGIAPDHAEPSYTAVAPRFIYKDPENSDALRFACCARVISTD 247
Query: 230 DRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESM-SALQSIRNL 288
GT+++ + ++ V G+ +A+L + A +S+ G F P D E + L ++
Sbjct: 248 --GTVAS-DGARVYVNGASYALLAVRAGTSYAG-FRVPRDRDAGKVLEELRKGLDGLQKA 303
Query: 289 S--YSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVK-SFQTDE 345
Y H+ DYQ L++RV + L E +P+ +R+ + +
Sbjct: 304 GRDYEGARKDHVTDYQALYNRVDLDLG------------TELSGNLPTTQRLHFCGEGVD 351
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLP 405
DPSL L+ Q+ RYL I+ SRPG+Q NLQGIWN+ +P W S NIN+EMNYW
Sbjct: 352 DPSLAALMLQYSRYLTIAGSRPGSQALNLQGIWNDTPNPPWSSNYTNNINVEMNYWPCEV 411
Query: 406 CNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPM 465
L EC P+ D LT L+ G +TA+ Y +GWV HH D+W + W+ WP
Sbjct: 412 LGLPECHLPMMDLLTELADAGKQTAKEYYHMNGWVAHHNADLWRSTEPSCEDASWSWWPF 471
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHE 525
GGAW+C H+W HY YT DR+FL K YP+L A+F+LD+L+E +GYL T PS SPE++
Sbjct: 472 GGAWMCEHIWTHYEYTQDREFLRK-MYPVLREAAAFMLDFLVENKEGYLVTAPSLSPENK 530
Query: 526 F--------------IAPDGK-------LACVSYSSTMDMAIIREVFSAIISAAEVLEKN 564
F +A + + ++ V+ STMDM+I+RE+FS + AA++L+ +
Sbjct: 531 FLTSGEETVIELIDEVAKESRCSPNHPCISAVTIGSTMDMSILRELFSNVARAAQILDIS 590
Query: 565 EDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKN 624
+D + + L+S+ + P + G + EW +D+++ H SH++ ++PG IT
Sbjct: 591 DDPVPVQALESMKKFPPYRTGRFGQLQEWYEDYEECTPGMSHTSHMYPVYPGGLITETGT 650
Query: 625 PDLCKAAEKTLQKR---GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKH 681
P+L +AA ++L++R + GW +WK +L AR +P H
Sbjct: 651 PELFEAARRSLERRLLHAKRQGGWPGSWKISLMARFK----------------NPLECGH 694
Query: 682 FEGGLYSNLFAA---HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVK 738
NL A QIDA FG A VAEML+QS + LLPA+P D W G +
Sbjct: 695 ILKSTGENLGAGMLTEGSQQIDAIFGLGAGVAEMLLQSHQGFIELLPAVPVD-WIDGSFR 753
Query: 739 GLKARGGETVSICWKDGDL 757
G+ ARGG VS WK G L
Sbjct: 754 GMCARGGFVVSASWKRGRL 772
>gi|325261844|ref|ZP_08128582.1| fibronectin type III domain protein [Clostridium sp. D5]
gi|324033298|gb|EGB94575.1| fibronectin type III domain protein [Clostridium sp. D5]
Length = 805
Score = 383 bits (984), Expect = e-103, Method: Compositional matrix adjust.
Identities = 257/773 (33%), Positives = 386/773 (49%), Gaps = 65/773 (8%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA-PKA-LSD 74
+ PAK FT A+P+GNG LGAMV+GG P E + LN DTLW+G PG + P+ +
Sbjct: 10 YTHPAKDFTQALPLGNGHLGAMVYGGFPRERISLNLDTLWSGHPGHWHGKQKIPQGTMER 69
Query: 75 VRSLVDSGQYAEATAASVK-LFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
VRSL+D+G Y EA K + G + Y G +EL+FD + Y E R L L A
Sbjct: 70 VRSLIDAGAYWEAQKQIQKHMLGCNNESYLSAGSLELQFD-TEADY--EGCERRLSLEEA 126
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
R + + + + F S + +I +E +S +SL + L + +
Sbjct: 127 ITRTDWELKGQKVREDVFVSAVQNGMYIRIF-TEGAPVSVAISLQTQLRVLQSAAEADGL 185
Query: 194 IMEGRCPG----KRIPPKA--NANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
++ + P +P + +++ G+ + L I D G I E+ + VE
Sbjct: 186 LLVAQAPSHVEPNYVPSREPIQYDEEKPGMIYGLFLGINECD--GGIKRTEEG-ICVENF 242
Query: 248 DWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNL-SYSDLYTRHLDDYQKLFH 306
+ L + ++G + P + + + + L S+ + + HL ++Q+L+
Sbjct: 243 TCLTMFLSGETEYEG-YGKPLNGQAESIIRYLRERGHRAKLKSWEENFRAHLREHQRLYL 301
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISSS 365
R V + E + P+ ER++ ++ EDP L LLF +GRYL+++SS
Sbjct: 302 RT-----------VLELEGGEEEEQRPTDERLEMVRSGKEDPGLSALLFHYGRYLILASS 350
Query: 366 RPG---TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
RP Q A LQGIW ED+ W S VNIN +MNYW P NL EC+ PL + L
Sbjct: 351 RPLDGLVQPATLQGIWCEDVRSVWSSNWTVNINTQMNYWICGPGNLPECEIPLIRMVKEL 410
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
S + + A N G+V+HH D+W + G+V WA WPMGG WL THL+ HY YT
Sbjct: 411 S-DAGREAAANLNCRGFVVHHNVDLWRQCIPALGEVKWAYWPMGGLWLTTHLYRHYLYTG 469
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPSTSPEHEFIAPDGKLACVSYSST 541
D+++LEK YP+ + C +F+LD+L HDG +T PSTSPE+ F + S T
Sbjct: 470 DKEYLEK-IYPVFQECTAFILDYLY--HDGSAYQTCPSTSPENTFYDEQERECAACVSPT 526
Query: 542 MDMAIIREVFSAIISAAEVLE--KNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKD 599
MD+A+IREV ++ E++ + E + + L L + G ++EW +++++
Sbjct: 527 MDIALIREVLCNLLEIDEIIRGTRPESGQCREARRVLNELPAFQTGSRGQLLEWREEYRE 586
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGE---EGPGWSITWKTALWAR 656
+ HRH +HL G P I E+ P+L +A +K+L R E + GW+ W AR
Sbjct: 587 ADPGHRHFAHLIGFHPFSQINGEETPELVEAVKKSLGIRLEGRKQYIGWNCAWLINFSAR 646
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP----------FQIDANFGFT 706
L D E A+ V+++ +Y NLF HPP FQID N G
Sbjct: 647 LGDTEQAWEYVQQMLKF-----------SVYDNLFDLHPPLGENEGEREIFQIDGNLGAA 695
Query: 707 AAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
A +AE L+Q ++LLPALP W SG +G+ A G +S+ WKDG L E
Sbjct: 696 AGMAEFLLQYLRGKIHLLPALP-KAWKSGRAEGIAAPGQMELSMSWKDGVLTE 747
>gi|150003335|ref|YP_001298079.1| glycoside hydrolase [Bacteroides vulgatus ATCC 8482]
gi|149931759|gb|ABR38457.1| glycoside hydrolase family 95 [Bacteroides vulgatus ATCC 8482]
Length = 803
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 249/805 (30%), Positives = 408/805 (50%), Gaps = 71/805 (8%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+T + ++ + PA+ + +++PIGNGRLGAM +GG+ E L LNE T+W+G + N
Sbjct: 24 ATDSCETTELWYAQPAEVWMESLPIGNGRLGAMTYGGIEEEKLALNESTMWSGQYNENQN 83
Query: 66 -PDAPKALSDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAE 121
P + ++ +R L G+ +E A L G+ + +GD++++F K
Sbjct: 84 IPFGREKMNQLRKLFFEGKLSEGNRIAGDNLHGNQTSFGTHLPIGDLKMQFIYPEGKVT- 142
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
YRR L L+ A + V ++ G V + RE+F++NPD V+V +++ + S++ N+ LD L+
Sbjct: 143 -GYRRSLSLDEAVSSVSFNSGGVNYKREYFATNPDNVLVLRLTADKQKSITMNMGLD-LM 200
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
+NQ++ G+ P P G+ F I + D G + +E +
Sbjct: 201 RQADLSVEDNQLVFTGKVD---FPLHG-----PGGVCFEG--RIAVLADNGEVK-MEQSE 249
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
+ ++ +D L++ + + P D + ++ SY +L H+ DY
Sbjct: 250 VGIKEADAVTLIVDVRTDYKSP---------DYKTLCADGVKKAAAKSYDELKQAHIKDY 300
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
L++RVSI + + + T ++VK +TD L L FQ+GRYL
Sbjct: 301 NTLYNRVSIHFGQD---------ANRALPTDVRWKQVKEGKTD--TGLDALFFQYGRYLT 349
Query: 362 ISSSRPGTQV-ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
I+SSR + + LQG +N++ + W + H++IN E NYW + NL+EC PLF +
Sbjct: 350 IASSRENSPLPIALQGFFNDNKACNMGWTNDYHLDINTEQNYWAANVGNLAECNAPLFTY 409
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ L+ +G+KTA+V Y GW H ++W + A ++W L+PM +W+ +HLW Y
Sbjct: 410 IKDLAHHGAKTAEVVYGCKGWTAHTTANVWGYTPAS-STIIWGLFPMASSWIASHLWTQY 468
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVS 537
+T D+ +L + AYPLL+G A F+LD+L + GYL T PS SPE+ F G+ S
Sbjct: 469 EFTQDKQYLAETAYPLLKGNAQFILDFLAKDPKSGYLMTGPSISPENWFRTAGGEEMVAS 528
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
D + E+ S + A+E+L + + + + ++ +L P ++ +G+I EW +DF
Sbjct: 529 MMPACDRELAYEILSNCVQASEILNTDRE-FADSLRTAIAQLPPIQLRANGAIREWFEDF 587
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGE----EGPGWSITWKTAL 653
++ +HRH SHL L+P IT+EK P+L +AA KT++ R E WS +
Sbjct: 588 EEAHPNHRHTSHLLALYPFSQITLEKTPELAEAARKTIENRLSAENWEDTEWSRANMICM 647
Query: 654 WARLHDQEHAYRMVKRL--------FNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGF 705
+ARL D + AY+ V+ L V P EG +YS D N
Sbjct: 648 YARLKDAQEAYKSVQLLQGKLSRENLMTVSPGGIAGAEGDIYS----------FDGNPAG 697
Query: 706 TAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
TA +AEMLVQ+ + LP LP D+W G KGL RGG V+ W + ++ + +
Sbjct: 698 TAGMAEMLVQNHEGYVEFLPCLP-DEWKEGSFKGLCIRGGAEVAAEWTNAVINSASLKA- 755
Query: 766 YSNNDHDSFKTLHYRGTSVKVNLSA 790
+ +FK +G S KV L+
Sbjct: 756 ---TANQTFKVKLPQGKSYKVMLNG 777
>gi|251795324|ref|YP_003010055.1| alpha-L-fucosidase [Paenibacillus sp. JDR-2]
gi|247542950|gb|ACS99968.1| Alpha-L-fucosidase [Paenibacillus sp. JDR-2]
Length = 775
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 262/816 (32%), Positives = 427/816 (52%), Gaps = 78/816 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT-NPDAPKA 71
+K+ + PA + +P+GNG+LGA++ GG+ SET + E T W+G P + +PDA +
Sbjct: 4 MKMIYTQPAAGWKQGLPLGNGQLGAVLHGGINSETWNMTEITFWSGKPERFGGSPDAKEK 63
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYR--RELD 129
L +R +G Y KL G + + L D + Y +E + RELD
Sbjct: 64 LKTMREAFFNGNYVLGD----KLAGEQLEPVKGNFGTNLSLCDVLISYNDEGSQLVRELD 119
Query: 130 LNTATARVKYSVGN-VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH-SYV 187
L A A V Y G+ RE F S+PD V+V++I G ++GS+S ++ ++ + +
Sbjct: 120 LEKAVAAVSYRSGSGAAMRRETFVSHPDGVLVSRIKGDQAGSVSLSLRIEGRTTTFDARL 179
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPK-GIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+G ++++ R N + D G+ L+ ++ R E + +E
Sbjct: 180 DGPDKLVF-------RTQATENIHSDGTCGVWSEGALKAVVTGGR---VFGEAGTVIIEQ 229
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPT--SESMSALQSIRNLSYSDLYTRHLDDYQKL 304
+D VL L ++ + + D T ES L++ + L H+ DY+ L
Sbjct: 230 ADEVVLYLAVATDY---------GRMDDTWKVESTERLEAAEAKGFERLLRDHIADYRSL 280
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE--DPSLVELLFQFGRYLLI 362
+ RV + L S + D +P+ ER++ + E D L+ L +Q+GRYL I
Sbjct: 281 YGRVDLDLGGS-----------KAFDLLPTDERIRKLRAGEQTDNGLIALFYQYGRYLTI 329
Query: 363 SSSRPGTQVA-NLQGIWN--EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
+ +R +++ +LQG+WN E + W H+++N EMNY+ + NL+EC PL +++
Sbjct: 330 AGTRADSRLPLHLQGLWNDGEANAMAWSCDYHLDVNTEMNYYPTEISNLAECHIPLMNYI 389
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
LS G A+ Y GWV H ++ W +S G+ W L GG W+ THL EHY
Sbjct: 390 EQLSFAGRTAAEDFYGCEGWVAHVFSNAWGFASPGWGRS-WGLNVTGGLWIATHLKEHYE 448
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFI-APDGK-LACV 536
Y+ DR FL ++AYP+++ A F LD++ I G+L T PSTSPE+ F P+ + +
Sbjct: 449 YSRDRGFLTRQAYPVMKEAALFFLDYMTIHPKYGWLVTGPSTSPENSFYPGPEEQGEQQL 508
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S STMD ++R++F ++ AAE+L +E+ L ++ ++ L P +I + G + EW +D
Sbjct: 509 SMGSTMDQMLVRDLFGFVLEAAEMLAVDEE-LQHRLKDAMELLPPLQIGKRGQLQEWLED 567
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWA- 655
+++ + HRH SH++G++PG+ IT E+ P+L +A +TL R I + AL+A
Sbjct: 568 YEEAQPQHRHFSHMYGVYPGNQITPEETPELGQAMRQTLLGRMLVDELEDIEFTAALFAL 627
Query: 656 ---RLHDQEHAYRMVKRLF------NLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFT 706
RLHD A + V+ L NL+ + K G +N+F ID NFG T
Sbjct: 628 GFSRLHDGNQAVKHVRHLIGELCFDNLLS--YSKPGVAGAETNIFV------IDGNFGGT 679
Query: 707 AAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNY 766
AA+A+ML+QS ++LLPA+P D WSSG +GL+A+G ++ W++G L E + + Y
Sbjct: 680 AAIADMLLQSHAGSIHLLPAVPAD-WSSGSYRGLRAKGNAETAVSWENGQLTEA-VITAY 737
Query: 767 SNNDHDSFKTLHYRGTS-VKVNLSAGKIYTFNRQLK 801
S+ +T G+S + + + AGK Y + QLK
Sbjct: 738 SD-----LETFVKCGSSQIHLRMEAGKRYLLDGQLK 768
>gi|350633298|gb|EHA21663.1| hypothetical protein ASPNIDRAFT_53702 [Aspergillus niger ATCC 1015]
Length = 833
Score = 381 bits (978), Expect = e-102, Method: Compositional matrix adjust.
Identities = 261/771 (33%), Positives = 384/771 (49%), Gaps = 70/771 (9%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA +FT +PIGNGRLGA +WG +E + LNE+++W+G + NP + AL VR
Sbjct: 70 YTTPANNFTSTLPIGNGRLGAAIWG-TATENVTLNENSIWSGPFINRVNPRSYDALWPVR 128
Query: 77 SLVDSGQYAEATAASV-KLFGHP--ADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
SL+ G E A++ + G P Y LG + L+F H + Y R LDL +
Sbjct: 129 SLLAEGNMTEGNDATLANMVGIPDSPQSYSALGSLVLDF--GHDEAGISNYTRYLDLRSG 186
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A V+Y+ V + RE+ +S+PD V+ ++S SE G L NV+ S L YV NN
Sbjct: 187 MAVVEYTYRAVRYRREYLASHPDNVVAVRLSSSEPGGL--NVA--SSLVRDRYVVSNNAT 242
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
+ G + +A +N+ IQF+A + +SD R T S+ L+
Sbjct: 243 LSHD---GGLLTLRAYSNNVSNPIQFTAEARV-VSDGRAT-------------SNGTSLV 285
Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQKLFHRV 308
+ +S+ D FI+ S + E+ A L + + + + + DY L RV
Sbjct: 286 VRNASTID-IFIDTETSYRYSAQENWEAEIKSKLDTACSSGFVAVKKNAIADYSALAQRV 344
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLISSSR 366
+ L S + +P+ R+ +++ D DP LV L+F FGR+ LI+SSR
Sbjct: 345 DLNLG-----------SSGSAGNLPTDSRLVNYRIDPDSDPELVVLMFHFGRHSLIASSR 393
Query: 367 PGTQVA---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
A NLQG+WN+D P W ++INLEMNYW + NL++ P D L +
Sbjct: 394 ATESPALPANLQGLWNQDFDPAWGGRFTIDINLEMNYWPAEVTNLADTFSPFIDLLDVVH 453
Query: 424 INGSKTAQVNYLAS--GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
G A+ Y S G+V+HH TD+W ++ W +WPMGGAWL +L EHY ++
Sbjct: 454 DRGLDVAESMYHCSNGGYVLHHNTDLWGDAAPVDNGTTWTMWPMGGAWLSANLIEHYRFS 513
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACV 536
D L R +PLL+ A F +L +GY T PS SPE +I P+ GK +
Sbjct: 514 RDESILRNRIWPLLQSAARFYYCYLFP-FEGYYSTGPSLSPEASYIVPNDMTTAGKEEGI 572
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
+ TMD +++ E+F A+I +VL N L +++P +I G I+EW D
Sbjct: 573 DIAPTMDNSLLHELFQAVIETCDVLAINNTDCTTAA-SYLAKIKPPQIGSSGRILEWRLD 631
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTAL 653
+++ + HRH+S +FGLFPG + N L AA+ L R G GWS TW L
Sbjct: 632 YEESDPGHRHMSPVFGLFPGDQMAPLVNETLATAAKAFLDWRIAHGSGSTGWSRTWTMNL 691
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
+ARL D + + + ++ L++ FQID NFGFT+ +AE+L
Sbjct: 692 YARLFDGDQVWNHTQIYL-------QRFPSPNLWNTDSGPDTVFQIDGNFGFTSGIAEIL 744
Query: 714 VQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+QS ++LLPALP +G V GL ARG V + W G L E I S
Sbjct: 745 LQS-YKVVHLLPALP-AAVPTGHVSGLVARGNFVVDMEWSGGVLTEAKITS 793
>gi|265753143|ref|ZP_06088712.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_33FAA]
gi|263236329|gb|EEZ21824.1| glycoside hydrolase family 95 [Bacteroides sp. 3_1_33FAA]
Length = 803
Score = 380 bits (975), Expect = e-102, Method: Compositional matrix adjust.
Identities = 241/777 (31%), Positives = 397/777 (51%), Gaps = 67/777 (8%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+T + ++ + PAK + +++PIGNGRLGAM +GG+ E L LNE T+W+G + N
Sbjct: 24 ATDSCETTELWYAQPAKVWMESLPIGNGRLGAMTYGGIEEEKLALNESTMWSGQYNENQN 83
Query: 66 -PDAPKALSDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAE 121
P + ++ +R L G+ +E A L G+ + +GD++++F K +
Sbjct: 84 KPFGREKMNQLRKLFFEGKLSEGNRIAGDNLHGNQTSFGTHLPIGDLKMQFIYPEGKVTD 143
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
YRR L L+ A + V ++ G V + RE+F++NPD V+V +++ + S++ N+ LD L+
Sbjct: 144 --YRRSLSLDEAVSSVSFNSGGVNYKREYFATNPDNVLVLRLTADKQKSITMNMGLD-LM 200
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
NNQ++ G+ P P G+ F I + D G + +E
Sbjct: 201 RQADLSVENNQLVFTGKVD---FPLHG-----PGGVCFEG--RIAVLADNGEVK-MEQSG 249
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
+ ++ +D L++ + + P D + ++ SY +L H+ DY
Sbjct: 250 VSIKEADAVTLIVDVRTDYKSP---------DYKTLCADGVEKAAAKSYDELKQAHIKDY 300
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
L++RVSI + + + T ++VK +TD L L FQ+GRYL
Sbjct: 301 NTLYNRVSIHFGQD---------ANRAMPTDVRWKQVKEGKTD--TGLDALFFQYGRYLT 349
Query: 362 ISSSRPGTQV-ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
I+SSR + + LQG +N++ + W + H++IN E NYW + NL+EC PLF +
Sbjct: 350 IASSRENSPLPIALQGFFNDNKACNMGWTNDYHLDINTEQNYWAANVGNLAECNAPLFTY 409
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ L+ +G+KTA+V Y GW H ++W + A ++W L+PM G+W+ +HLW Y
Sbjct: 410 IKDLAHHGAKTAEVVYGCKGWTAHTTANVWGYTPAS-STIIWGLFPMAGSWIASHLWTQY 468
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVS 537
+T D+ +L + AYPLL+G A F+LD+L + GYL T PS SPE+ F G+ S
Sbjct: 469 EFTQDKQYLAETAYPLLKGNAQFILDFLAKDPKSGYLMTGPSISPENWFRTAGGEEMVAS 528
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
D + E+ S + A+E+L+ + + + + ++ +L P ++ +G+I EW +DF
Sbjct: 529 MMPACDRELAYEILSNCVRASEILDTDRE-FADSLRTAIAQLPPIQLRANGAIREWFEDF 587
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGE----EGPGWSITWKTAL 653
++ +HRH SHL L+P IT+EK P+L +AA KT++ R E WS +
Sbjct: 588 EEAHPNHRHTSHLLALYPFSQITLEKTPELAEAARKTIENRLSAENWEDTEWSRANMICM 647
Query: 654 WARLHDQEHAYRMVKRL--------FNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGF 705
+ARL D + AY+ V+ L V P EG +YS D N
Sbjct: 648 YARLKDAQEAYKSVQLLQGKLSRENLMTVSPGGIAGAEGDIYS----------FDGNPAG 697
Query: 706 TAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
TA +AEML+Q+ + LP LP + W G KGL +GG + W + +++ +
Sbjct: 698 TAGMAEMLIQNHEGYVEFLPCLPVE-WKDGSFKGLCLKGGAEATAEWTNAVINKASL 753
>gi|423231014|ref|ZP_17217418.1| hypothetical protein HMPREF1063_03238 [Bacteroides dorei
CL02T00C15]
gi|423244725|ref|ZP_17225800.1| hypothetical protein HMPREF1064_02006 [Bacteroides dorei
CL02T12C06]
gi|392630134|gb|EIY24136.1| hypothetical protein HMPREF1063_03238 [Bacteroides dorei
CL02T00C15]
gi|392641574|gb|EIY35350.1| hypothetical protein HMPREF1064_02006 [Bacteroides dorei
CL02T12C06]
Length = 800
Score = 379 bits (974), Expect = e-102, Method: Compositional matrix adjust.
Identities = 241/777 (31%), Positives = 397/777 (51%), Gaps = 67/777 (8%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+T + ++ + PAK + +++PIGNGRLGAM +GG+ E L LNE T+W+G + N
Sbjct: 21 ATDSCETTELWYAQPAKVWMESLPIGNGRLGAMTYGGIEEEKLALNESTMWSGQYNENQN 80
Query: 66 -PDAPKALSDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAE 121
P + ++ +R L G+ +E A L G+ + +GD++++F K +
Sbjct: 81 KPFGREKMNQLRKLFFEGKLSEGNRIAGDNLHGNQTSFGTHLPIGDLKMQFIYPEGKVTD 140
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
YRR L L+ A + V ++ G V + RE+F++NPD V+V +++ + S++ N+ LD L+
Sbjct: 141 --YRRSLSLDEAVSSVSFNSGGVNYKREYFATNPDNVLVLRLTADKQKSITMNMGLD-LM 197
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
NNQ++ G+ P P G+ F I + D G + +E
Sbjct: 198 RQADLSVENNQLVFTGKVD---FPLHG-----PGGVCFEG--RIAVLADNGEVK-MEQSG 246
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
+ ++ +D L++ + + P D + ++ SY +L H+ DY
Sbjct: 247 VSIKEADAVTLIVDVRTDYKSP---------DYKTLCADGVEKAAAKSYDELKQAHIKDY 297
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
L++RVSI + + + T ++VK +TD L L FQ+GRYL
Sbjct: 298 NTLYNRVSIHFGQD---------ANRAMPTDVRWKQVKEGKTD--TGLDALFFQYGRYLT 346
Query: 362 ISSSRPGTQV-ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
I+SSR + + LQG +N++ + W + H++IN E NYW + NL+EC PLF +
Sbjct: 347 IASSRENSPLPIALQGFFNDNKACNMGWTNDYHLDINTEQNYWAANVGNLAECNAPLFTY 406
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ L+ +G+KTA+V Y GW H ++W + A ++W L+PM G+W+ +HLW Y
Sbjct: 407 IKDLAHHGAKTAEVVYGCKGWTAHTTANVWGYTPAS-STIIWGLFPMAGSWIASHLWTQY 465
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVS 537
+T D+ +L + AYPLL+G A F+LD+L + GYL T PS SPE+ F G+ S
Sbjct: 466 EFTQDKQYLAETAYPLLKGNAQFILDFLAKDPKSGYLMTGPSISPENWFRTAGGEEMVAS 525
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
D + E+ S + A+E+L+ + + + + ++ +L P ++ +G+I EW +DF
Sbjct: 526 MMPACDRELAYEILSNCVRASEILDTDRE-FADSLRTAIAQLPPIQLRANGAIREWFEDF 584
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGE----EGPGWSITWKTAL 653
++ +HRH SHL L+P IT+EK P+L +AA KT++ R E WS +
Sbjct: 585 EEAHPNHRHTSHLLALYPFSQITLEKTPELAEAARKTIENRLSAENWEDTEWSRANMICM 644
Query: 654 WARLHDQEHAYRMVKRL--------FNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGF 705
+ARL D + AY+ V+ L V P EG +YS D N
Sbjct: 645 YARLKDAQEAYKSVQLLQGKLSRENLMTVSPGGIAGAEGDIYS----------FDGNPAG 694
Query: 706 TAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
TA +AEML+Q+ + LP LP + W G KGL +GG + W + +++ +
Sbjct: 695 TAGMAEMLIQNHEGYVEFLPCLPVE-WKDGSFKGLCLKGGAEATAEWTNAVINKASL 750
>gi|423241353|ref|ZP_17222466.1| hypothetical protein HMPREF1065_03089 [Bacteroides dorei
CL03T12C01]
gi|392641729|gb|EIY35503.1| hypothetical protein HMPREF1065_03089 [Bacteroides dorei
CL03T12C01]
Length = 800
Score = 379 bits (974), Expect = e-102, Method: Compositional matrix adjust.
Identities = 241/777 (31%), Positives = 398/777 (51%), Gaps = 67/777 (8%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+T + ++ + PAK + +++PIGNGRLGAM +GG+ E L LNE T+W+G + N
Sbjct: 21 ATDSCETTELWYAQPAKVWMESLPIGNGRLGAMTYGGIEEEKLALNESTMWSGQYNENQN 80
Query: 66 -PDAPKALSDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAE 121
P + ++ +R L G+ +E A L G+ + +GD++++F K +
Sbjct: 81 KPFGREKMNQLRKLFFEGKLSEGNRIAGDNLHGNQTSFGTHLPIGDLKMQFIYPEGKVTD 140
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
YRR L L+ A + V ++ G V + RE+F++NPD V+V +++ + S++ N+ LD L+
Sbjct: 141 --YRRSLSLDEAVSSVSFNSGGVNYKREYFATNPDNVLVLRLTADKQKSITMNMGLD-LM 197
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
NNQ++ G+ P P G+ F I + D G + +E
Sbjct: 198 RQADLSVENNQLVFTGKVD---FPLHG-----PGGVCFEG--RIAVLADNGEVK-MEQSG 246
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
+ ++ +D L++ + + P D + ++ SY +L H+ DY
Sbjct: 247 VSIKEADAVTLIVDVRTDYKSP---------DYKTLCADGVEKAAAKSYDELKQAHIKDY 297
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
L++RVSI + + + T ++VK +TD L L FQ+GRYL
Sbjct: 298 NTLYNRVSIHFGQD---------ANRAMPTDVRWKQVKEGKTD--TGLDALFFQYGRYLT 346
Query: 362 ISSSRPGTQV-ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
I+SSR + + LQG +N++ + W + H++IN E NYW + NL+EC PLF +
Sbjct: 347 IASSRENSPLPIALQGFFNDNKACNMGWTNDYHLDINTEQNYWAANVGNLAECNAPLFTY 406
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ L+ +G+KTA+V Y GW H ++W + A ++W L+PM G+W+ +HLW Y
Sbjct: 407 IKDLAHHGAKTAEVVYGCKGWTAHTTANVWGYTPAS-STIIWGLFPMAGSWIASHLWTQY 465
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVS 537
+T D+ +L + AYPLL+G A F+LD+L + GYL T PS SPE+ F G+ S
Sbjct: 466 EFTQDKQYLAETAYPLLKGNAQFILDFLAKDPKSGYLMTGPSISPENWFRTAGGEEMVAS 525
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
D + E+ S + A+E+L+ + + + + ++ +L P ++ +G+I EW +DF
Sbjct: 526 MMPACDRELAYEILSNCVRASEILDTDRE-FADSLRTAIAQLPPIQLRANGAIREWFEDF 584
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGE----EGPGWSITWKTAL 653
++ +HRH SHL L+P IT+EK P+L +AA KT++ R E WS +
Sbjct: 585 EEAHPNHRHTSHLLALYPFSQITLEKTPELAEAARKTIENRLSAENWEDTEWSRANMICM 644
Query: 654 WARLHDQEHAYRMVKRL--------FNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGF 705
+ARL D + AY+ V+ L V P EG +YS D N
Sbjct: 645 YARLKDAQEAYKSVQLLQGKLSRENLMTVSPGGIAGAEGDIYS----------FDGNPAG 694
Query: 706 TAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
TA +AEML+Q+ + + LP LP + W G KGL +GG + W + +++ +
Sbjct: 695 TAGMAEMLIQNHESYVEFLPCLPVE-WKDGSFKGLCLKGGVEATAEWTNAVINKASL 750
>gi|269955992|ref|YP_003325781.1| hypothetical protein Xcel_1192 [Xylanimonas cellulosilytica DSM
15894]
gi|269304673|gb|ACZ30223.1| conserved hypothetical protein [Xylanimonas cellulosilytica DSM
15894]
Length = 837
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 274/853 (32%), Positives = 397/853 (46%), Gaps = 99/853 (11%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-------DYTNPD 67
+ ++ PA + +A+P+GNG AM G E L LN+ W+G G D P
Sbjct: 4 LRYDSPATCWDEALPVGNGVRAAMCEGRAGGERLWLNDLRAWSGPVGAGPRGDVDAPVPA 63
Query: 68 A-----------------------PKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQL 104
A P+ L+ VR+ +D G A + Y
Sbjct: 64 AQDSASQDPAAEDPAAASRRAAAGPEHLAAVRAAIDDGDVRTAERLLQESQSPWVQAYLP 123
Query: 105 LGDIELEFD--DSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTK 162
LG++E+ L + R LDL TA A Y++G E ++ +V
Sbjct: 124 LGELEVTVTAVGDELAAPGGAHARSLDLRTAVAAHSYALGAARVRHETWADAAGGALVHV 183
Query: 163 ISGSESGSLSFNVSLDSLLDNHSY-------------------VNGNNQIIMEGRCPGKR 203
++ + SLL S +++ P
Sbjct: 184 VTADRP--VRLTARFTSLLRAESDAGAVPVAAAAPDAAAPGVDAPAPRDVLLHRLVPPVD 241
Query: 204 IPPKANANDDPKGIQFSA-----ILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS 258
+ P + +P +++ ++ ++ + D + +ED +L+ G+ A LLL+ ++
Sbjct: 242 VAPGHESAPEP--VRYGPTTARLVVAVRAAGDPDAV--VEDGELRT-GAATAHLLLIGTA 296
Query: 259 SFDGPFINPSDSKKDPTSESMSALQSIRNL-SYSDLYTRHLDDYQKLFHRVSIQLSRSPK 317
+ P + ++ PT +AL + S H ++ L+ RV + L
Sbjct: 297 TTHDPA---AGTQATPTEAVAAALALVTGPEPASPRRAAHEAAHRALYDRVELTLP---- 349
Query: 318 DIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGI 377
S DT+P+ R+ + +DP L L F +GRYLL++SSRPG A LQGI
Sbjct: 350 -------SSSGADTLPTDARIAAAADVDDPGLTALAFHYGRYLLLASSRPGGLPATLQGI 402
Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS-INGSKTAQVNYLA 436
WN L W SA NINL+M YW + L EC EPL F+ L+ G + A+ Y A
Sbjct: 403 WNPLLPGPWSSAYTTNINLQMAYWPAETTALPECHEPLLAFVERLATTTGPEAARRLYGA 462
Query: 437 SGWVIHHKTDIWAKS---SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYP 493
GWV HH +D W + A G WA W +GG WL HLWE + + D FL +RA+P
Sbjct: 463 RGWVAHHNSDAWGHADPVGAGHGDPAWASWALGGVWLAHHLWERWLFGGDATFLRERAWP 522
Query: 494 LLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSA 553
+L G F LDW ++ T+PSTSPE+ ++APDG+ V S+TMD ++R + +A
Sbjct: 523 VLRGAGLFALDW-VQSDGTRAWTSPSTSPENHYVAPDGRPTGVGTSATMDGELLRWLAAA 581
Query: 554 IISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLF 611
+AA+ L +ED L + KV LP ++ G ++EWA + E HRH+SHL
Sbjct: 582 CRAAADALGVSEDWLDDLAKVTALLPA---PEVGPRGELLEWAAPVAEAEPEHRHVSHLV 638
Query: 612 GLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLF 671
G FP ++T + P L A ++++ RG E GWS+ W+ ALWARL D E + ++R
Sbjct: 639 GAFPLASVTPWRTPGLAAATARSIELRGPESTGWSLAWRAALWARLGDGERVHATLRRAQ 698
Query: 672 N-LVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWD 730
V P +H GGLY NLFAAHPPFQ+D N G TAAVAE L+QS L LLPALP
Sbjct: 699 RPAVAPGGAEH-RGGLYPNLFAAHPPFQVDGNLGLTAAVAEALLQSHDGVLRLLPALP-A 756
Query: 731 KWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSA 790
W G V+GL+ARGG V + W DG L S +ND S T R V +A
Sbjct: 757 AWPDGAVRGLRARGGLRVDLTWADGAL-----VSARVHNDTPSTTT---RAVVVGPQTAA 808
Query: 791 GKIYTFNRQLKCT 803
G L +
Sbjct: 809 GPTLPTASPLPAS 821
>gi|212695253|ref|ZP_03303381.1| hypothetical protein BACDOR_04793 [Bacteroides dorei DSM 17855]
gi|237711725|ref|ZP_04542206.1| glycoside hydrolase family 95 protein [Bacteroides sp. 9_1_42FAA]
gi|212662163|gb|EEB22737.1| hypothetical protein BACDOR_04793 [Bacteroides dorei DSM 17855]
gi|229454420|gb|EEO60141.1| glycoside hydrolase family 95 protein [Bacteroides sp. 9_1_42FAA]
Length = 818
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 240/778 (30%), Positives = 395/778 (50%), Gaps = 69/778 (8%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+T + ++ + PAK + +++PIGNGRLGAM +GG+ E L LNE T+W+G + N
Sbjct: 39 ATDSCETTELWYAQPAKVWMESLPIGNGRLGAMTYGGIEEEKLALNESTMWSGQYNENQN 98
Query: 66 -PDAPKALSDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAE 121
P + ++ +R L G+ +E A L G+ + +GD++++F K +
Sbjct: 99 KPFGREKMNQLRKLFFEGKLSEGNRIAGDNLHGNQTSFGTHLPIGDLKMQFIYPEGKVTD 158
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
YRR L L+ A + V ++ G V + RE+F++NPD V+V +++ + S++ N+ LD L+
Sbjct: 159 --YRRSLSLDEAVSSVSFNSGGVNYKREYFATNPDNVLVLRLTADKQKSITMNMGLD-LM 215
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
NNQ++ G+ P P G+ F I + D G + +E
Sbjct: 216 RQADLSVENNQLVFTGKVD---FPLHG-----PGGVCFEG--RIAVLADNGEVK-MEQSG 264
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
+ ++ +D L++ + + P D + ++ SY +L H+ DY
Sbjct: 265 VSIKEADAVTLIVDVRTDYKSP---------DYKTLCADGVEKAAAKSYDELKQAHIKDY 315
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYL 360
L++RVSI + +P+ R K + + D L L FQ+GRYL
Sbjct: 316 NTLYNRVSIHFGQDANR------------AMPTDVRWKQVKEGKTDTGLDALFFQYGRYL 363
Query: 361 LISSSRPGTQV-ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
I+SSR + + LQG +N++ + W + H++IN E NYW + NL+EC PLF
Sbjct: 364 TIASSRENSPLPIALQGFFNDNKACNMGWTNDYHLDINTEQNYWAANVGNLAECNAPLFT 423
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
++ L+ +G+KTA+V Y GW H ++W + A ++W L+PM G+W+ +HLW
Sbjct: 424 YIKDLAHHGAKTAEVVYGCKGWTAHTTANVWGYTPAS-STIIWGLFPMAGSWIASHLWTQ 482
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACV 536
Y +T D+ +L + AYPLL+G A F+LD+L + GYL T PS SPE+ F G+
Sbjct: 483 YEFTQDKQYLAETAYPLLKGNAQFILDFLAKDPKSGYLMTGPSISPENWFRTAGGEEMVA 542
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S D + E+ S + A+E+L+ + + + + ++ +L P ++ +G+I EW +D
Sbjct: 543 SMMPACDRELAYEILSNCVRASEILDTDRE-FADSLRTAIAQLPPIQLRANGAIREWFED 601
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGE----EGPGWSITWKTA 652
F++ +HRH SHL L+P IT+EK P+L +AA KT++ R E WS
Sbjct: 602 FEEAHPNHRHTSHLLALYPFSQITLEKTPELAEAARKTIENRLSAENWEDTEWSRANMIC 661
Query: 653 LWARLHDQEHAYRMVKRL--------FNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFG 704
++ARL D + AY+ V+ L V P EG +YS D N
Sbjct: 662 MYARLKDAQEAYKSVQLLQGKLSRENLMTVSPGGIAGAEGDIYS----------FDGNPA 711
Query: 705 FTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
TA +AEML+Q+ + LP LP + W G KGL +GG + W + +++ +
Sbjct: 712 GTAGMAEMLIQNHEGYVEFLPCLPVE-WKDGSFKGLCLKGGAEATAEWTNAVINKASL 768
>gi|375088282|ref|ZP_09734622.1| hypothetical protein HMPREF9703_00704 [Dolosigranulum pigrum ATCC
51524]
gi|374562320|gb|EHR33650.1| hypothetical protein HMPREF9703_00704 [Dolosigranulum pigrum ATCC
51524]
Length = 820
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 258/784 (32%), Positives = 399/784 (50%), Gaps = 93/784 (11%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG--VPGD--YTNPDAP---KALSDVRSL 78
+A+P+GNG +G+ V+G V E ++ NE TLW+G PGD Y + L ++R
Sbjct: 22 EALPVGNGTMGSKVFGWVGRERIQFNEKTLWSGGPKPGDDSYNGGNLEGKHSVLPEIRQA 81
Query: 79 VDSGQYAEATAASVKLFGHPAD----VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ G +A + + P Y GDI L+F + + T Y+R LD++TA
Sbjct: 82 LEDGNTEKAKQLAEEHLVGPNSPEYGRYLSFGDIYLDFTNQSKELESVTDYKRVLDMDTA 141
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL---DSLLDNHS-YVN- 188
T V+Y F R+ F S+PD+V+VT +S L FN L L+D S +VN
Sbjct: 142 TTSVRYKEDGTTFKRDTFISHPDKVMVTHLSKEGDKPLEFNAGLYLTKELVDGGSNHVNH 201
Query: 189 ------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
Q +E G + K D+ G++F++ +EI D G I L D L
Sbjct: 202 YAEKESDYKQATVEYTEKGALL--KGTVRDN--GLEFASYMEI---DTDGVIEVL-DGYL 253
Query: 243 KVEGSDWAVLLLVASSSF-DGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
+V G+ +A L+ A +++ P N D+ D + S +Q + +Y + H++D+
Sbjct: 254 RVTGATYATLMTHAVTNYAQNPETNYRDTTMDVAEVAQSTVQQAIDKTYEQVKVDHINDH 313
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
Q LFHRV + L + TD + ++ + +L EL +Q+GRYLL
Sbjct: 314 QDLFHRVQLDLGAKTSALFTDDL-------------LATYDKQDGRALEELFYQYGRYLL 360
Query: 362 ISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
I+SSRPG ANLQG+WN +P W+S H+N+NL+MNYW + N++E PL +F+
Sbjct: 361 ITSSRPGKNALPANLQGVWNAVDNPAWNSDYHMNVNLQMNYWPAYSANMAETALPLINFV 420
Query: 420 TYLSINGSKTAQVNYL--------ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLC 471
L G + A Y +GW+ H + + ++ W P AW+
Sbjct: 421 DDLRYYG-RVAASEYANITSKEGEENGWLAHTQVTPFGWTTPGW-NYYWGWSPAANAWIM 478
Query: 472 THLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPSTSPEHEFIAP 529
+++E+Y YT D++FL+++ YP+L+ A F +L E D ++ ++PS SPEH
Sbjct: 479 QNVYEYYRYTQDKEFLQEKIYPMLKETAKFWNQFLHYDEASDRWV-SSPSYSPEH----- 532
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLE-----KNEDALVEKVLKSLPRLRPTKI 584
++ +T D +++ ++F A EVL + +D L+ ++ + +L+P I
Sbjct: 533 ----GTITIGNTFDQSLVWQLFHDFKEATEVLRDVEGFRPDDTLLAEISEKFAKLKPLHI 588
Query: 585 AEDGSIMEWAQDFKDP------EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR 638
DG I EW ++ D E HHRH+S L GLFPG T+ + NPD +AA+ TL R
Sbjct: 589 NNDGHIKEWYEEDTDAFTGEKVEKHHRHVSELVGLFPG-TLFSKDNPDYMEAAKATLNHR 647
Query: 639 GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQ 698
G+ G GW+ K LWARL D A+ ++ + +NL+ HPPFQ
Sbjct: 648 GDGGTGWAKANKINLWARLLDGNRAHHLLS-----------EQLRQSTLNNLWDTHPPFQ 696
Query: 699 IDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLH 758
ID NFG T+ + EML+QS + LPALP D W G VKGLKARG V++ WK+ L+
Sbjct: 697 IDGNFGATSGITEMLLQSHDGYIAPLPALP-DVWKDGSVKGLKARGNVEVAMNWKNSTLY 755
Query: 759 EVGI 762
E+ +
Sbjct: 756 ELQL 759
>gi|345513833|ref|ZP_08793348.1| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
gi|345456122|gb|EEO45721.2| glycoside hydrolase family 95 protein [Bacteroides dorei 5_1_36/D4]
Length = 800
Score = 378 bits (971), Expect = e-102, Method: Compositional matrix adjust.
Identities = 241/777 (31%), Positives = 397/777 (51%), Gaps = 67/777 (8%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+T + ++ + PAK + +++PIGNGRLGAM +GG+ E L LNE T+W+G + N
Sbjct: 21 ATDSCETTELWYAQPAKVWMESLPIGNGRLGAMTYGGIEEEKLALNESTMWSGQYNENQN 80
Query: 66 -PDAPKALSDVRSLVDSGQYAEAT-AASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAE 121
P + ++ +R L G+ +E A L G+ + +GD++++F K +
Sbjct: 81 KPFGREKMNQLRKLFFEGKLSEGNRIAGDNLHGNQTSFGTHLPIGDLKMQFIYPEGKVTD 140
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
YRR L L+ A + V ++ G V + RE+F++NPD V+V +++ + S++ N+ LD L+
Sbjct: 141 --YRRSLSLDEAVSSVSFNSGGVNYKREYFATNPDNVLVLRLTADKQKSITMNMGLD-LM 197
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK 241
NNQ++ G+ P P G+ F I + D G + +E
Sbjct: 198 RQADLSVENNQLVFTGKVD---FPLHG-----PGGVCFEG--RIAVLADNGEVK-MEQSG 246
Query: 242 LKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
+ ++ +D L++ + + P D + ++ SY +L H+ DY
Sbjct: 247 VSIKEADTVTLIVDVRTDYKSP---------DYKTLCADGVEKAAVKSYDELKQAHIKDY 297
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
L++RVSI + + + T ++VK +TD L L FQ+GRYL
Sbjct: 298 NTLYNRVSIHFGQD---------ANRAMPTDVRWKQVKEGKTD--TGLDALFFQYGRYLT 346
Query: 362 ISSSRPGTQV-ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
I+SSR + + LQG +N++ + W + H++IN E NYW + NL+EC PLF +
Sbjct: 347 IASSRENSPLPIALQGFFNDNKACNMGWTNDYHLDINTEQNYWAANVGNLAECNAPLFTY 406
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ L+ +G+KTA+V Y GW H ++W + A ++W L+PM G+W+ +HLW Y
Sbjct: 407 IKDLAHHGAKTAEVVYGCKGWTAHTTANVWGYTPAS-STIIWGLFPMAGSWIASHLWTQY 465
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVS 537
+T D+ +L + AYPLL+G A F+LD+L + GYL T PS SPE+ F G+ S
Sbjct: 466 EFTQDKQYLAETAYPLLKGNAQFILDFLAKDPKSGYLMTGPSISPENWFRTAGGEEMVAS 525
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
D + E+ S + A+E+L+ + + + + ++ +L P ++ +G+I EW +DF
Sbjct: 526 MMPACDRELAYEILSNCVRASEILDTDRE-FADSLRTAIAQLPPIQLRANGAIREWFEDF 584
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGE----EGPGWSITWKTAL 653
++ +HRH SHL L+P IT+EK P+L +AA KT++ R E WS +
Sbjct: 585 EEAHPNHRHTSHLLALYPFSQITLEKTPELAEAARKTIENRLSAENWEDTEWSRANMICM 644
Query: 654 WARLHDQEHAYRMVKRL--------FNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGF 705
+ARL D + AY+ V+ L V P EG +YS D N
Sbjct: 645 YARLKDAQEAYKSVQLLQGKLSRENLMTVSPGGIAGAEGDIYS----------FDGNPAG 694
Query: 706 TAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
TA +AEML+Q+ + LP LP + W G KGL +GG + W + +++ +
Sbjct: 695 TAGMAEMLIQNHEGYVEFLPCLPVE-WKDGSFKGLCLKGGAEATAEWTNAVINKASL 750
>gi|402087340|gb|EJT82238.1| hypothetical protein GGTG_02212 [Gaeumannomyces graminis var.
tritici R3-111a-1]
Length = 833
Score = 377 bits (969), Expect = e-101, Method: Compositional matrix adjust.
Identities = 268/791 (33%), Positives = 387/791 (48%), Gaps = 90/791 (11%)
Query: 8 STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
S + PL++ P +F D+ IGNGRLG + GG SE++ LNED+ W+G D NPD
Sbjct: 27 SASKPLRMWQTTPGVNFNDSFLIGNGRLGFSLPGGALSESIVLNEDSFWSGGEMDRVNPD 86
Query: 68 APKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETY 124
A + ++++L+ G+ EA+ AS+ G P V + +G + + S + + Y
Sbjct: 87 AAAHMPEIQALIARGEIREASRLASMSYVGTPVSVRHFDWVGKLGISMRGSAGQVRD--Y 144
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD-----S 179
R LD+ A V Y+VG V + RE+ +S PD VI +IS ++SG++SF++ +
Sbjct: 145 ERWLDVGEGVAGVYYTVGGVAYKREYTASFPDDVIAVQISANKSGAVSFDLHQSRGIGLN 204
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
L + + +G + I+M G G K I F+A ++ I D G++ + D
Sbjct: 205 LFQDSAGGSGKDTILMGGGSFGA------------KAIVFAAGAKVTI--DGGSMKRIGD 250
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+ V+G+D A + A +++ S + S M+ L Y L + H+
Sbjct: 251 T-IVVDGADSATIYWSAWTTY-------RKSAGELQSAVMADLSQASRKGYGALRSDHVK 302
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
DYQ L RV + L +S SE+ T +A+R++ +T DP + L F F RY
Sbjct: 303 DYQSLAGRVELSLGKS--------TSEQKAKT--TADRLRGLRTAFDPEIATLYFYFARY 352
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LLI+S RPGT ANLQG+WN DL+P W S +NINLEMNYW SL N+ E E +F+ +
Sbjct: 353 LLIASGRPGTLPANLQGLWNNDLNPMWGSKYTININLEMNYWPSLLTNMPELHESMFEHI 412
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
+ G A+ Y ASG V HH TDIW + WP G AW+ TH++EHY
Sbjct: 413 MKMHEKGRDVAKRMYNASGSVCHHNTDIWGDCAPQDNYAASTFWPSGLAWMATHIYEHYQ 472
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDG-KLACVSY 538
+T D D L K YP L A F LD++ E HDG+L TNPS SPE + P+ + ++
Sbjct: 473 FTGDVDVLRKY-YPALRDAAVFFLDFMTE-HDGHLVTNPSVSPEISYRLPNTTQSVALTL 530
Query: 539 SSTMDMAIIREVFSAIISAAEVL-EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
T D +II E+ ++ + ++L + + D + +++ RL P + + G I E+ DF
Sbjct: 531 GPTADNSIIWELVGMVLESQKILGDSDPDNIGQRLTGLRARLPPLRKDQYGGIAEFHADF 590
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR--GEEGPGWSITWKTALWA 655
+ E HRH S LFGLFPG IT A ++ G GWS W AL A
Sbjct: 591 TEDEPGHRHFSQLFGLFPGSQITASNGTTFAAARASLRRRLAFGGGDTGWSRAWAVALEA 650
Query: 656 RLHDQEH-AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHP-PFQIDANFGFTAAVAEML 713
RL + A L L P S L P FQ+D N+G + E L
Sbjct: 651 RLLNATGVAASYAHLLTRLTYPN----------SMLDVNEPSAFQLDGNYG-GVTIVEAL 699
Query: 714 VQS-----------TLNDLY---------------LLPALP--WDKWSSGCVKGLKARGG 745
VQS ++ Y LLPALP W G KGL RGG
Sbjct: 700 VQSHELVAAAAASGSMTPAYVGESGGGKAAHHLIRLLPALPRQWAVNGGGFAKGLLVRGG 759
Query: 746 ETVSICWKDGD 756
+ + W DGD
Sbjct: 760 FELDVHW-DGD 769
>gi|119499317|ref|XP_001266416.1| hypothetical protein NFIA_040960 [Neosartorya fischeri NRRL 181]
gi|119414580|gb|EAW24519.1| conserved hypothetical protein [Neosartorya fischeri NRRL 181]
Length = 792
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 251/772 (32%), Positives = 397/772 (51%), Gaps = 59/772 (7%)
Query: 11 NPLKIT-FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
NP T + PA F +PIGNGRL A +WGG + + LNE+++W+G D NP+A
Sbjct: 22 NPSTYTWYTTPAADFASTLPIGNGRLAAAIWGGA-VDNITLNENSIWSGPFQDRVNPNAY 80
Query: 70 KALSDVRSLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRR 126
+ +D R+++++G + A ++ + P+ Y LG + L+F H + ++Y R
Sbjct: 81 EGFTDSRAMLEAGNLSSANDVVLQDMVSIPSSPREYHPLGSLRLDF--GHDATSLQSYTR 138
Query: 127 ELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
LDL T A V+Y VG+V ++RE+ +S+PD V+ ++ S++G+L+ SL+ Y
Sbjct: 139 FLDLGTGVAGVRYQVGDVVYSREYVTSHPDGVLAVRLRASKNGALNVVTSLE----RSRY 194
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
V + G + KAN+ I+F+A + +RG + V G
Sbjct: 195 VESLTAVSSRGMG---TLTLKANSGQSTDPIRFTAQARVV---NRGGRITTNGTAVVVAG 248
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ + +S+ P ++++D + L + SY + DY+ L
Sbjct: 249 ASTVDIFFDTQTSYR----YPDETERDAVVKKQ--LDAAVKASYPAVKQAATSDYKSLSG 302
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLISS 364
RV + L S + P+ R+K+++TD DP L+ L+F FGR+ LI+S
Sbjct: 303 RVKLDLG-----------SSGSAGNQPTDIRLKNYKTDPDRDPELMTLMFNFGRHSLIAS 351
Query: 365 SRPGTQV---ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
SR G+ ANLQGIWN+D SP W V++NL+MNYW + NL++ EP+ D +
Sbjct: 352 SRAGSSSGLPANLQGIWNQDYSPAWGGKYTVDVNLQMNYWHAQVTNLADTFEPVIDLMDK 411
Query: 422 LSINGSKTAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
+ +G A+ Y +G+++HH TD+W ++ W +WPMG AWL +L + + +
Sbjct: 412 VVPHGQDVAKKMYHCDTGYILHHNTDLWGDAAPVDNGTKWTMWPMGSAWLSMNLMDQFRF 471
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLAC 535
T D+ L++R +PLL+ A F +L + +GY + PS SPE+ FI P+ GK
Sbjct: 472 TQDKTLLQERIWPLLKSAADFYYCYLFD-FEGYYTSGPSISPENAFIIPEDMTIAGKSTG 530
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ 595
+ S TMD ++ E+F+A+I + L+ + L K + R+R +I G I+EW +
Sbjct: 531 IDLSPTMDNLLLHELFTAVIETCKALDITGEDLT-NAHKYISRIRHPQIGSYGQILEWRR 589
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTA 652
+++ E HRH+S + GL+PG +T N L AA+ L R G GWS W T+
Sbjct: 590 EYEGTEPGHRHMSPILGLYPGSQMTPLVNQTLANAAKVLLDHRITSGSGSTGWSRAWTTS 649
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
L+ARL D + L+ L + + L++ FQID NFGF A +AEM
Sbjct: 650 LYARLFDGNSVWHHA--LYFL-----QNYPTDNLWNTDHGPGSAFQIDGNFGFAAGIAEM 702
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
L+QS ++LLPALP G V GL ARG V + W +G+L I S
Sbjct: 703 LLQSHAV-VHLLPALP-GAVPDGRVSGLVARGNFVVDMQWSNGELKFAKIES 752
>gi|336427807|ref|ZP_08607799.1| hypothetical protein HMPREF0994_03805 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336008767|gb|EGN38776.1| hypothetical protein HMPREF0994_03805 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 784
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 260/820 (31%), Positives = 392/820 (47%), Gaps = 123/820 (15%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYA 85
DA P+GNG LGAMV+G + ++LNED+LW G D NP+A + L +V+ L+ ++
Sbjct: 37 DATPMGNGFLGAMVYGHTARDRIQLNEDSLWHGKFRDRINPNAKEHLKEVQELILDRKFE 96
Query: 86 EATAASVKLFGH----PADV--YQLLGDIELEFDDS---HLKYAEET----YRRELDLNT 132
EA +F H P ++ + LG++ L + + + + E+ Y +L++
Sbjct: 97 EAEEL---MFSHMVSAPGNMRNFSPLGELNLALNTALPFQMGWLPESDGENYVSDLNMEE 153
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
+ + V++TRE F SNPD+V+ ++ + + + LD LL+ + + Q
Sbjct: 154 GILCISHEDKGVQYTREMFVSNPDRVLCIRLKSDKEKA----IRLDMLLNRVPFTD---Q 206
Query: 193 IIMEGRCPGKRIPPKA-----------------NANDDPKGIQFSAILEIKISDDRGTIS 235
+ + R PGK + D G +F+ L + ++D R
Sbjct: 207 RLPDDRRPGKFVSAGVWPVTRCERIYTENGHTLMMEGDENGTRFACGLTV-VTDGR---- 261
Query: 236 ALED--KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
+ED KL + V+ L ASS + ++D S+L + R Y+D+
Sbjct: 262 -IEDCYAKLVAHEAGEVVIYLAASSD---------NREEDFVGNVKSSLAAARAKGYADI 311
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
T H+ D+ R ++ L P E+ +
Sbjct: 312 RTDHIADFTSYMKRCTLAL--------------------PEDEKAGMY------------ 339
Query: 354 FQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
FQ+ RY+++S+ R G NLQGIWN + P+W+S NINL+MNYW + CNLS E
Sbjct: 340 FQYARYMMVSAGREGATAMNLQGIWNHEFCPSWESKYTTNINLQMNYWPAEICNLSTLHE 399
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
PLFD + + G A+ Y G + HH TDI+ A W MGGAW+ H
Sbjct: 400 PLFDLIHTVQERGRDVAKRMYGCRGTMCHHNTDIYGDCGTQDMYAAAAFWQMGGAWMAMH 459
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
LWEHY +T+D DFL K YP++E A F +D+LI+ +GYL T PS SPE+ F+ DG
Sbjct: 460 LWEHYLFTLDEDFLRKE-YPVMEEFALFFVDFLIKDKEGYLVTCPSVSPENRFVLEDGSD 518
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
+ TMD IIR + SA + AA++L E A E++++ LRP +I G +
Sbjct: 519 TPICAGPTMDNQIIRGLMSACLEAAKILGIESPYKADFERIIRE---LRPNQIDSIGRLK 575
Query: 592 EWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP---GWSIT 648
EWA + K+ + H SHL+ +FPG I+ K+ ++ +AA K+L R E G GW
Sbjct: 576 EWAWEEKELTPNMVHTSHLWAVFPGDEISWNKDKEIYEAARKSLDSRIEHGAKATGWGGA 635
Query: 649 WKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAA 708
W A +AR + E A + R+F+ L +L A FQID N G +
Sbjct: 636 WHIAFFARFLNGEGAQTAIDRMFH-----------KSLTESLLNAGNVFQIDGNLGLLSG 684
Query: 709 VAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN 768
+AE L+QS ++ LPALP KW +G VKGL+ARGG V + WK+G L + I ++ S
Sbjct: 685 MAECLLQSHAG-VHFLPALP-PKWKNGEVKGLRARGGLEVDMEWKNGTLQKAEIRADKSR 742
Query: 769 ND------------HDSFKTLHYRGTSVKVNLSAGKIYTF 796
D + V L AGK Y F
Sbjct: 743 RTLFVGEVPERISCQDETLSWEKEEFGYSVELEAGKAYEF 782
>gi|334137826|ref|ZP_08511252.1| hypothetical protein HMPREF9413_0062 [Paenibacillus sp. HGF7]
gi|333604667|gb|EGL16055.1| hypothetical protein HMPREF9413_0062 [Paenibacillus sp. HGF7]
Length = 852
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 214/559 (38%), Positives = 313/559 (55%), Gaps = 42/559 (7%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
K+ + PA+ +T+A+P+GNGRLGAM++G V E + LNE++LW G P D TNP+A AL
Sbjct: 5 KLWYIKPAQAWTEALPVGNGRLGAMIFGRVEEELISLNEESLWYGGPKDRTNPEAAAALL 64
Query: 74 DVRSLVDSGQYAEATA-ASVKLFGHP--ADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
++R L+ G+ EA A + L P A YQ LGD+ + F + TYRRELDL
Sbjct: 65 EIRRLLLEGRVTEAQELAHMGLTPIPKYAGPYQPLGDLRIWFAEHEPDAG--TYRRELDL 122
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL-LDNHSYVNG 189
T RV+Y+ TRE F+S P V+ +++ + L+F L D + +G
Sbjct: 123 ATGLCRVEYAWQGASCTRELFASAPAGVLACRLTTAHPEGLTFRFHLGRRPFDEGAAPDG 182
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
+ ++M+GRC P G++++A+ +S + GT+ + D + V G+
Sbjct: 183 PHAVLMQGRC-------------GPDGVRYAAL--ASVSPEGGTVRTIGDF-VHVAGAAE 226
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A + + A +SF +DP + ++ R Y + H DY LF R+S
Sbjct: 227 ATIYVAAQTSF---------RHEDPAAACRRQVEEARRKGYEAVKAEHGADYMPLFARMS 277
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
++L DI +P+ ER+ + + EDP L+ L FQ+GRYLL++SSRPG
Sbjct: 278 LELGTPGADI----------RLLPTDERLDRVREGGEDPELLALFFQYGRYLLLASSRPG 327
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
T ANLQGIWN D P W+ +NINL+MNYW + CNL EC EPLFDF+ L NG +
Sbjct: 328 TLPANLQGIWNADYQPPWECNYTLNINLQMNYWPAEVCNLRECHEPLFDFIDRLVANGRE 387
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
TA+ Y G+V HH +++WA+S + A+WPMGG WL HLWEHY + DR FL+
Sbjct: 388 TARKLYGCRGFVAHHNSNLWAESGINGMLPRAAVWPMGGVWLALHLWEHYRFGGDRHFLD 447
Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
+RAYP+++ A FLLD++ E G L T PS SPE++++ P GK + + MD+ + R
Sbjct: 448 RRAYPVMKEAALFLLDYMTEDGKGGLLTGPSVSPENKYVLPGGKSGYLCMAPAMDIQLAR 507
Query: 549 EVFSAIISAAEVLEKNEDA 567
+F A+ AA VL A
Sbjct: 508 TLFGAVREAAAVLACERGA 526
Score = 142 bits (357), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 82/193 (42%), Positives = 107/193 (55%), Gaps = 17/193 (8%)
Query: 569 VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLC 628
+E++ + RL G ++EW D ++ + HRH+SHLFGLFPG I+ + P L
Sbjct: 614 LERLTAAESRLPQPAAGRHGQLLEWLGDEEEADPGHRHISHLFGLFPGELISPVRTPALA 673
Query: 629 KAAEKTLQKR---GEEGPGWSITWKTALWARLHDQEHAYRMVKRLF-NLVDPEHEKHFEG 684
+AA TL++R G GWS W WARL + + A+R + L + DP
Sbjct: 674 EAARVTLERRLAGGSGHTGWSRVWIAHYWARLREGDEAHRHLTALLRHAADP-------- 725
Query: 685 GLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARG 744
NLF HPPFQID N G T+A AEML+QS L LLPALP W SG VKGL+ARG
Sbjct: 726 ----NLFTEHPPFQIDGNLGGTSAAAEMLLQSQEGMLDLLPALP-SAWPSGRVKGLRARG 780
Query: 745 GETVSICWKDGDL 757
G + W+ G L
Sbjct: 781 GYEAGLEWERGLL 793
>gi|261408195|ref|YP_003244436.1| alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
gi|261284658|gb|ACX66629.1| Alpha-L-fucosidase [Paenibacillus sp. Y412MC10]
Length = 779
Score = 374 bits (960), Expect = e-100, Method: Compositional matrix adjust.
Identities = 268/826 (32%), Positives = 408/826 (49%), Gaps = 88/826 (10%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA- 71
+K+ + PA+ ++ +PIGNGR+G +V E + E T W+G P KA
Sbjct: 4 MKLWYTKPAQGWSQGLPIGNGRMGNVVISAPDREIWNITETTYWSGQPEPAQGRSNSKAD 63
Query: 72 LSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEFDDS--------H 116
L +R G Y E + K FG + Q++ LEFD +
Sbjct: 64 LERMRQHFFQGDYREGDRLAKKHLEPEKLNFGTNLGLCQVV----LEFDHNVKPSEGGRQ 119
Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGS-LSFNV 175
AE + RELDL A AR + E TRE F+S+ DQVIV++I S S +SF +
Sbjct: 120 EAAAEPLFYRELDLQEAVARSFCEIDGAEMTREVFASHADQVIVSRIRSSHGSSGVSFRI 179
Query: 176 SLDSLLDN---HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRG 232
S+ +N H+ V G + I G+ ++N + S +++++ + G
Sbjct: 180 SIRG--ENGPFHANVTGKDTIEFRGQAL-----EDVHSNGE---CGVSCQGQLRVAAEGG 229
Query: 233 TISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSD 292
+S D + V G+D A + ++ + + +S L+ L Y
Sbjct: 230 KVSCTADT-ISVSGADEAAIYFAVNTDY-------RQEGESWREKSAFQLEQAVLLGYDA 281
Query: 293 LYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT--DEDPSLV 350
L +HL DYQ L+ RV + L S ++P+ ER+ F+ +DP+L
Sbjct: 282 LRAKHLADYQPLYARVRLDLGSSEHA------------SLPTDERIGRFKQGKQDDPALF 329
Query: 351 ELLFQFGRYLLISSSRPGTQV-ANLQGIWN--EDLSPTWDSAPHVNINLEMNYWQSLPCN 407
L +Q+GRYL IS SRP + + +LQGIWN E W H++ N +MNY+ + N
Sbjct: 330 ALFYQYGRYLTISGSRPDSILPMHLQGIWNDGEANKMAWSCDYHLDTNTQMNYFPTEAAN 389
Query: 408 LSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
LSE EPL ++ LS+ G A+ Y A GWV H ++ W +S + W L GG
Sbjct: 390 LSESHEPLMRYIQQLSVAGRSAARHYYDAEGWVAHVFSNAWGFASPGW-ETSWGLNVTGG 448
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEF 526
W+ TH+ EHY Y D+ FLE+ AYP+L+ A+F +D++ + G+L T PS SPE+ F
Sbjct: 449 LWIATHMMEHYAYNQDQAFLEELAYPVLKEAAAFFMDYMTVHPKYGWLVTGPSNSPENSF 508
Query: 527 IA--PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
P+ +S TMD ++R++ + + AA+ L +E+ L +K +L +L P I
Sbjct: 509 YTGNPEDGHQQLSMGPTMDQVLVRDLLAFCVKAAQTLGVDEE-LRQKWQTALDQLPPLMI 567
Query: 585 AEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPG 644
+ G + EW +D+++ + HRHLSHLF L+PG IT + P+L AA TL+ R
Sbjct: 568 GKKGQLQEWLEDYEEAQPEHRHLSHLFALYPGSQITPHRTPELAAAARVTLENRNSRADL 627
Query: 645 WSITWKTAL----WARLHDQEHAYRMVKRLF------NLVDPEHEKHFEGGLYSNLFAAH 694
I + AL +ARLHD + A + + L N++ + K G +N+F
Sbjct: 628 EDIEFTAALFGLFYARLHDGDQAVQHIAHLIGELCFDNMLT--YSKPGVAGAEANIFV-- 683
Query: 695 PPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKD 754
ID NFG TAA+AEML+QS +++LLPALP W +G V GLKA+G V + W+D
Sbjct: 684 ----IDGNFGGTAAIAEMLLQSHEGEIHLLPALP-AIWPTGSVTGLKAKGNIEVDMSWED 738
Query: 755 GDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQL 800
G L E + N D + Y G ++V L GK+ +L
Sbjct: 739 GKLVEARVKGN-----EDKSVRVFYGGREMEVVLEKGKVQELKVEL 779
>gi|418092776|ref|ZP_12729912.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44452]
gi|353761446|gb|EHD42013.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44452]
Length = 739
Score = 373 bits (958), Expect = e-100, Method: Compositional matrix adjust.
Identities = 250/755 (33%), Positives = 376/755 (49%), Gaps = 93/755 (12%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
M++G E ++LN++T+W D NPD+ L +R + G+ +A + +F
Sbjct: 1 MIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKAEELIKLTMFA 60
Query: 97 HPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVGNVEFTREHFS 152
P D Y+LLG++ +E D A Y RELDL+TA + V + + N++ RE+F+
Sbjct: 61 TPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFDPNSCNLQIKREYFT 119
Query: 153 SNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANA 210
S ++ +I S +L+ N++L + ++ ++ I+M G+
Sbjct: 120 SFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGR-------- 171
Query: 211 NDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS 270
KG+QF + K++D G +S L + + + + L L + +++ G
Sbjct: 172 ----KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTNYWGNI------ 218
Query: 271 KKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
+S+LQ ++ Y H+ YQ+ F+RV +L S KD ++ I
Sbjct: 219 -------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDCLS-------I 263
Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
T E K + L LLF +GRYLLISSS+P ANLQGIW ++L+P W S
Sbjct: 264 PTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSK 319
Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWA 449
+NIN +MNYW PC+L E + PLFD L + G TA+ Y A G+ HH TD ++
Sbjct: 320 YTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTDGFS 379
Query: 450 KSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG 509
++ + A+W + WLCTH+WEHY Y D L + + +++ F D+L E
Sbjct: 380 DTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYLFEV 438
Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL- 568
DGYL T PS SPE+++ +G SST+D I+R + I A+ L N D +
Sbjct: 439 -DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFIS 497
Query: 569 -VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDL 627
V+++ K LP+ TKI +G I EW +D+++ E HRH+S LFGL+P + I I K P+L
Sbjct: 498 RVKELKKKLPK---TKIGSNGQIQEWLEDYEEVEPGHRHISPLFGLYPYNEIDIHKTPEL 554
Query: 628 CKAAEKTLQKR-------------------------GEEGPGWSITWKTALWARLHDQEH 662
+AA+ T+ +R GWS W +ARL+ E
Sbjct: 555 AEAAKITINRRLSNANFLSSQEREQAINNWLVSGLHASTQTGWSAAWLIHFFARLYQGEP 614
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
AY + L N NLF HPPFQID N G + + E+LVQS N L
Sbjct: 615 AYNQINGLLN-----------NATLGNLFLDHPPFQIDGNLGLVSGICELLVQSHHNWLS 663
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 LIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 697
>gi|389642921|ref|XP_003719093.1| hypothetical protein MGG_00050 [Magnaporthe oryzae 70-15]
gi|351641646|gb|EHA49509.1| hypothetical protein MGG_00050 [Magnaporthe oryzae 70-15]
gi|440473491|gb|ELQ42283.1| alpha-L-fucosidase 2 [Magnaporthe oryzae Y34]
gi|440483559|gb|ELQ63936.1| alpha-L-fucosidase 2 [Magnaporthe oryzae P131]
Length = 827
Score = 373 bits (958), Expect = e-100, Method: Compositional matrix adjust.
Identities = 262/786 (33%), Positives = 378/786 (48%), Gaps = 85/786 (10%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
+ S NPL++ + D+ IGNGRLG + G +E + LNED+ W+G D N
Sbjct: 26 ANSAANPLRLWQTTAGVTYNDSFLIGNGRLGFSLPGSALTEAITLNEDSFWSGGKMDRVN 85
Query: 66 PDAPKALSDVRSLVDSGQYAEA-TAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEE 122
PDA + ++ L+ G+ EA T A + G P V Y LG + L +
Sbjct: 86 PDAAANMPQIQQLITQGRIEEAATLAGMAYKGLPDSVRHYDWLGRLHLAMKGPAGQAGN- 144
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS------ 176
Y R LD+ A V Y++ F+RE+ +S PDQ+I ++ ++SGS+SF +S
Sbjct: 145 -YERWLDVGEGLAGVDYTLNGTAFSREYLASFPDQIIAVRMKSNQSGSISFTLSQSRGSG 203
Query: 177 LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISA 236
L+ D + ++G+ I+M G G I FS+ ++ +S G+I
Sbjct: 204 LNRFQDYTTSLDGDT-ILMGGGSMGS------------DAIVFSSGAKVTVSG--GSIKT 248
Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTR 296
+ + + V +D AV+ A +++ P K+ + L++ Y + +
Sbjct: 249 I-GETIVVSDADSAVIYWTAWTTYRKP-------KEQLRESVLVDLRTAAAKGYDAIRSE 300
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
H+ DYQKL RV + L S SE+ + +A+R++ DP + L F F
Sbjct: 301 HVKDYQKLAGRVDLNLGMS--------SSEQK--SKSTAQRLRGMSQAFDPEMATLYFYF 350
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
RYLLI+S RPGT ANLQGIWN D+SP W S VNINL+MNYW +L N+ E L
Sbjct: 351 ARYLLIASGRPGTLPANLQGIWNTDISPQWGSKYTVNINLQMNYWPALLTNMPELHHSLL 410
Query: 417 DFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
D L + NG A+ Y ASG V HH TD+W + WP G WL TH++E
Sbjct: 411 DHLKIMHENGKDVARRMYNASGSVCHHNTDLWGDCAPQDNYAASTFWPTGLGWLVTHVYE 470
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDG---KL 533
HY +T D L + YP+L A F LD+L E + G+L TNPS SPE ++ P+ +
Sbjct: 471 HYLFTGDEQVL-RDYYPVLRDSALFFLDFLTE-YQGHLVTNPSVSPEIQYYLPNSTTRQG 528
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDA-LVEKVLKSLPRLRPTKIAEDGSIME 592
++ T D +II EVF + A E+L E ++++ + RL P + + G + E
Sbjct: 529 VALTLGPTCDNSIIWEVFGLVFHATEILGNVEGKEFQDRLMSARARLPPLRRDQYGGLAE 588
Query: 593 WAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITW 649
+ D+ + E HRH S LFGLFPG IT + +AA ++L +R G GWS W
Sbjct: 589 FIHDYTEDEPGHRHFSQLFGLFPGSQITSSTSLPF-EAARRSLARRLGNGGGDTGWSRAW 647
Query: 650 KTALWARLHDQEHAYRMVKRLF-NLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAA 708
AL ARL D + + L NL P A FQ+D N+G
Sbjct: 648 SIALAARLFDADGVAKSYNHLLVNLTYPNSMLDIN---------APSAFQLDGNYG-GVT 697
Query: 709 VAEMLVQS-----------TLND-------LYLLPALP--WDKWSSGCVKGLKARGGETV 748
+ E +VQS TL D + LLPALP W G KGL RGG +
Sbjct: 698 IVEAIVQSHELVTAEGTAATLGDDTSAHHLIRLLPALPRQWAANGGGHAKGLLTRGGFQL 757
Query: 749 SICWKD 754
+ W D
Sbjct: 758 DVLWDD 763
>gi|334337751|ref|YP_004542903.1| alpha-L-fucosidase [Isoptericola variabilis 225]
gi|334108119|gb|AEG45009.1| Alpha-L-fucosidase [Isoptericola variabilis 225]
Length = 879
Score = 373 bits (958), Expect = e-100, Method: Compositional matrix adjust.
Identities = 275/848 (32%), Positives = 386/848 (45%), Gaps = 97/848 (11%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD----YTNPDAPK 70
+ ++ PA + +A+P+GNG AM G E L LN+ T W+G P D T P+
Sbjct: 49 LRYDRPASKWIEALPVGNGHRAAMCAGRPARERLWLNDVTAWSGPPPDDPLAGTRARGPE 108
Query: 71 ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEF--DDSHLKYAEETYR-RE 127
L VR VD G A L Y L ++E+ + + + T+ R
Sbjct: 109 HLDRVRRAVDEGDVRTAERLLQDLQTPWVQAYLPLAELEVSVVPGEGNGPTDDVTFAGRH 168
Query: 128 LDLNTATARVKY-SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY 186
LDL TA A + S G +E ++ V+V + + V + SLL
Sbjct: 169 LDLRTAVATHAWTSPGTGRVVQETWADARGGVLVHVVRAERP--VRAEVRVSSLLRRADE 226
Query: 187 VNGNNQIIMEGRCPGKRIPPKANANDDPK--GIQFSAILEIKISDDRG------------ 232
V P A+ P G + A+L++ + G
Sbjct: 227 VR-----------------PDADRGAGPADGGARLHAVLDLPVDVAPGHEPVDDPVRYAP 269
Query: 233 -------TISALEDKKLKVE------GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESM 279
++AL D + VE + +L VA+++ D P P+D +M
Sbjct: 270 DGRQGVVAVAALGDPEAVVEQDVLRTATARCHVLAVATATTDPPGDVPADRSAASRVAAM 329
Query: 280 -----------SALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEEN 328
A R +L H+ +++L+ R + L P+ +
Sbjct: 330 LREAGSVAVPGPAGDGARTALARELRAAHVAAHRRLYDRCRLVLPTPPEAL--------- 380
Query: 329 IDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDS 388
+P+ RV + Q DP L L F GRYLL +SSR G A LQGIWN +L W S
Sbjct: 381 --GLPTDVRVAAAQHRPDPGLAALAFHHGRYLLAASSRDGGLPATLQGIWNAELPGPWSS 438
Query: 389 APHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN-GSKTAQVNYLASGWVIHHKTDI 447
A +NIN +M YW + L+EC EPL + ++ G A+ Y GW HH +D
Sbjct: 439 AYTLNINTQMAYWPAEVTGLAECHEPLLRLVARIAAGPGGVVARELYGTDGWTAHHNSDA 498
Query: 448 WAKSS---ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD---FLEKRAYPLLEGCASF 501
WA ++ A G WA W MGG WL HL EH+ + D D FL A+P+LEG A F
Sbjct: 499 WAHAAPVGAGHGDASWAAWAMGGLWLAQHLVEHHRFAADTDGDAFLRDVAWPVLEGAARF 558
Query: 502 LLDWLIEGHDG------YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAII 555
L W+ D T+PSTSPE+ F A DG A V+ S TMD+A++R + A
Sbjct: 559 ALGWVRTETDADSGRVVRAWTSPSTSPENRFTADDGAPAAVTTSVTMDVALVRWLAEACR 618
Query: 556 SAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFP 615
AAEVL + DA V+++++ L + G ++EW ++ + E HRHLSHL GLFP
Sbjct: 619 EAAEVLGRR-DAWVDRLVEVAAALPHPRAGARGELLEWDRERPEAEPEHRHLSHLVGLFP 677
Query: 616 GHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMV-KRLFNLV 674
T+ PDL AAE+TL+ RG E GWS+ W+ ALWARL A+ V L
Sbjct: 678 LGTLDSATTPDLAAAAERTLELRGPESTGWSLAWRVALWARLGRAGRAHEQVLLALRPAA 737
Query: 675 DPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLN-----DLYLLPALPW 729
D H GGLY NLF+AHPPFQ+D N G TA +AEML+QS + L +LPALP
Sbjct: 738 DGRHGGEHRGGLYPNLFSAHPPFQVDGNCGLTAGIAEMLLQSHRSVDGTPALDVLPALP- 796
Query: 730 DKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLS 789
D W G V GL+ARGG V + W+ G V ++ + + + +
Sbjct: 797 DAWPDGRVTGLRARGGLRVDLVWRAGRAERVRVHGPRERDAAVVVRVPGGPPAGTALRVP 856
Query: 790 AGKIYTFN 797
G TF
Sbjct: 857 RGATVTFE 864
>gi|418172315|ref|ZP_12808932.1| alpha-L-fucosidase [Streptococcus pneumoniae GA19451]
gi|418196823|ref|ZP_12833294.1| alpha-L-fucosidase [Streptococcus pneumoniae GA47688]
gi|419426112|ref|ZP_13966303.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 7533-05]
gi|419445683|ref|ZP_13985694.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA19923]
gi|419447843|ref|ZP_13987844.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 7879-04]
gi|419449944|ref|ZP_13989937.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 4075-00]
gi|419452089|ref|ZP_13992069.1| putative alpha-L-fucosidase [Streptococcus pneumoniae EU-NP02]
gi|419519881|ref|ZP_14059484.1| alpha-L-fucosidase [Streptococcus pneumoniae GA08825]
gi|421288567|ref|ZP_15739325.1| alpha-L-fucosidase [Streptococcus pneumoniae GA58771]
gi|353833518|gb|EHE13628.1| alpha-L-fucosidase [Streptococcus pneumoniae GA19451]
gi|353858855|gb|EHE38814.1| alpha-L-fucosidase [Streptococcus pneumoniae GA47688]
gi|379569503|gb|EHZ34473.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA19923]
gi|379611583|gb|EHZ76306.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 7879-04]
gi|379616518|gb|EHZ81213.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 7533-05]
gi|379620888|gb|EHZ85538.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 4075-00]
gi|379621308|gb|EHZ85956.1| putative alpha-L-fucosidase [Streptococcus pneumoniae EU-NP02]
gi|379638035|gb|EIA02581.1| alpha-L-fucosidase [Streptococcus pneumoniae GA08825]
gi|395885199|gb|EJG96226.1| alpha-L-fucosidase [Streptococcus pneumoniae GA58771]
Length = 739
Score = 373 bits (958), Expect = e-100, Method: Compositional matrix adjust.
Identities = 250/755 (33%), Positives = 375/755 (49%), Gaps = 93/755 (12%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
M++G E ++LN++T+W D NPD+ L +R + G+ +A + +F
Sbjct: 1 MIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKAEELIKLTMFA 60
Query: 97 HPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVGNVEFTREHFS 152
P D Y+LLG++ +E D A Y RELDL+TA + V + + N++ RE+F+
Sbjct: 61 TPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFDPNSCNLQIKREYFT 119
Query: 153 SNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANA 210
S ++ +I S +L+ N++L + ++ ++ I+M G+
Sbjct: 120 SFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGR-------- 171
Query: 211 NDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS 270
KG+QF + K++D G +S L + + + + L L + +++ G
Sbjct: 172 ----KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTNYWGNI------ 218
Query: 271 KKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
+S+LQ ++ Y H+ YQ+ F+RV +L S KD ++ I
Sbjct: 219 -------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDCLS-------I 263
Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
T E K + L LLF +GRYLLISSS+P ANLQGIW ++L+P W S
Sbjct: 264 PTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSK 319
Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWA 449
+NIN +MNYW PC+L E + PLFD L + G TA+ Y A G+ HH TD +
Sbjct: 320 YTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTDGFG 379
Query: 450 KSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG 509
++ + A+W + WLCTH+WEHY Y D L + + +++ F D+L E
Sbjct: 380 DTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYLFEV 438
Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL- 568
DGYL T PS SPE+++ +G SST+D I+R + I A+ L N D +
Sbjct: 439 -DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFIS 497
Query: 569 -VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDL 627
V+++ K LP+ TKI +G I EW +D+++ E HRH+S LFGL+P + I I K P+L
Sbjct: 498 RVKELKKKLPK---TKIGSNGQIQEWLEDYEEVEPGHRHISPLFGLYPYNEIDIHKTPEL 554
Query: 628 CKAAEKTLQKR-------------------------GEEGPGWSITWKTALWARLHDQEH 662
+AA+ T+ +R GWS W +ARL+ E
Sbjct: 555 AEAAKITINRRLSNANFLSSQEREQAINNWLVSGLHASTQTGWSAAWLIHFFARLYQGEP 614
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
AY + L N NLF HPPFQID N G + + E+LVQS N L
Sbjct: 615 AYNQINGLLN-----------NATLGNLFLDHPPFQIDGNLGLVSGICELLVQSHHNWLS 663
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 LIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 697
>gi|418239710|ref|ZP_12866256.1| putative alpha-L-fucosidase [Streptococcus pneumoniae
NorthCarolina6A-23]
gi|419489948|ref|ZP_14029693.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44386]
gi|419526922|ref|ZP_14066473.1| hypothetical protein SPAR35_2240 [Streptococcus pneumoniae GA14373]
gi|353890745|gb|EHE70505.1| putative alpha-L-fucosidase [Streptococcus pneumoniae
NorthCarolina6A-23]
gi|379555528|gb|EHZ20595.1| hypothetical protein SPAR35_2240 [Streptococcus pneumoniae GA14373]
gi|379584934|gb|EHZ49797.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA44386]
Length = 739
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 250/755 (33%), Positives = 375/755 (49%), Gaps = 93/755 (12%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
M++G E ++LN++T+W D NPD+ L +R + G+ +A + +F
Sbjct: 1 MIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKAEELIKLTMFA 60
Query: 97 HPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVGNVEFTREHFS 152
P D Y+LLG++ +E D A Y RELDL+TA + V + + N++ RE+F+
Sbjct: 61 TPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSCNLQIKREYFT 119
Query: 153 SNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANA 210
S ++ +I S +L+ N++L + ++ ++ I+M G+
Sbjct: 120 SFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGR-------- 171
Query: 211 NDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS 270
KG+QF + K++D G +S L + + + + L L + + + G
Sbjct: 172 ----KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDYWGNI------ 218
Query: 271 KKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
+S+LQ ++ Y H+ YQ+ F+RV +L S KD ++ I
Sbjct: 219 -------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDCLS-------I 263
Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
T E K + L LLF +GRYLLISSS+P ANLQGIW ++L+P W S
Sbjct: 264 PTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSK 319
Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWA 449
+NIN +MNYW PC+L E + PLFD L + G TA+ Y A G+ HH TD ++
Sbjct: 320 YTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTDGFS 379
Query: 450 KSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG 509
++ + A+W + WLCTH+WEHY Y D L + + +++ F D+L E
Sbjct: 380 DTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYLFEV 438
Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL- 568
DGYL T PS SPE+++ +G SST+D I+R + I A+ L N D +
Sbjct: 439 -DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFIS 497
Query: 569 -VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDL 627
V+++ K LP+ TKI +G I EW +D+++ E HRH+S LFGL+P + I I K P+L
Sbjct: 498 RVKELKKKLPK---TKIGSNGQIQEWLEDYEEVEPGHRHISPLFGLYPYNEIDIHKTPEL 554
Query: 628 CKAAEKTLQKR-------------------------GEEGPGWSITWKTALWARLHDQEH 662
+AA+ T+ +R GWS W +ARL+ E
Sbjct: 555 AEAAKITINRRLSNANFLSSQEREQAINNWLVSGLHASTQTGWSAAWLIHFFARLYQGEP 614
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
AY + L N NLF HPPFQID N G + + E+LVQS N L
Sbjct: 615 AYNQINGLLN-----------NATLGNLFLDHPPFQIDGNLGLVSGICELLVQSHHNWLS 663
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 LIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 697
>gi|67541006|ref|XP_664277.1| hypothetical protein AN6673.2 [Aspergillus nidulans FGSC A4]
gi|40738426|gb|EAA57616.1| hypothetical protein AN6673.2 [Aspergillus nidulans FGSC A4]
gi|259480257|tpe|CBF71222.1| TPA: alpha-fucosidase (Eurofung) [Aspergillus nidulans FGSC A4]
Length = 831
Score = 372 bits (956), Expect = e-100, Method: Compositional matrix adjust.
Identities = 262/759 (34%), Positives = 364/759 (47%), Gaps = 58/759 (7%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
A+PIGNGRL A ++GGV +E + LNE+T+W+G + T +A AL R L+ +G E
Sbjct: 45 ALPIGNGRLAATIYGGVRAEVITLNENTIWSGPFQERTPENALAALPIARELLLNGSITE 104
Query: 87 ATAASVKLFGHPAD---VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
A + H D Y G++EL F H + E YRR LD A V+Y V
Sbjct: 105 AGEFIQREMMHEIDSMRAYSYFGNLELGF--GHDEAKVEGYRRWLDTRKGDAGVEYVVEG 162
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKR 203
V++TRE+ +S P V+ + + SE G+L+ N + + D S Q + R P R
Sbjct: 163 VKYTREYIASFPAGVLAARFTASEKGALTLNATFCRVSDATSL-----QASVSDRAPWIR 217
Query: 204 IPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGP 263
+ + + I FS G S + + L + L LV +++ D
Sbjct: 218 LSGTSGQPAEEYPIVFS-----------GQASFVAEGALFTSSN--GTLTLVNATTVD-I 263
Query: 264 FINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKD 318
F + + + P+ E++ A L N Y + L D L R SI S D
Sbjct: 264 FFDAETNYRYPSQEAIDAEIAHKLTDALNKGYDRIRDEALADSSSLLDRASIDFGIS-TD 322
Query: 319 IVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV----ANL 374
+D ++E I V SA + D D L L + +GR+LL++SSR T+ ANL
Sbjct: 323 ETSDLATDERIALVRSAGGL-----DGDLELATLAWNYGRHLLVASSRNTTEAIDLPANL 377
Query: 375 QGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY 434
QGIWN + W +NIN EMNYW + P NL E QEPLFD G K A+ Y
Sbjct: 378 QGIWNNQTTAAWGGKYTININTEMNYWPAGPTNLIETQEPLFDLFAVAYPRGQKLARDMY 437
Query: 435 LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPL 494
SG V HH D+W + ++WPMG AWL THL++ Y +T D+ L YP
Sbjct: 438 NCSGVVFHHNLDVWGDPAPVDNYTSSSMWPMGAAWLATHLYDQYRFTGDKALLADTIYPY 497
Query: 495 LEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYSSTMDMAIIRE 549
L A F + E H+GY T PS SPE+ FI P+ G A + + MD II E
Sbjct: 498 LVDVAKFYQCYTFE-HEGYKVTGPSLSPENTFIIPENWTVAGNKAAMDVAIPMDDQIIWE 556
Query: 550 VFSAIISAA-EVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLS 608
V ++ AA E+ ++D V L ++ P +I G I EW D++ HRHLS
Sbjct: 557 VLHNLLDAASELGIADDDHTVSAAKSFLHKIHPPRIGFQGQIQEWRLDYESSAPGHRHLS 616
Query: 609 HLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTALWARLHDQEHAYR 665
LFGL PG + N L AAE L+ R G GWS W +ARL+ + A+
Sbjct: 617 PLFGLHPGGQFSPLVNSTLSAAAEVLLEDRLSHGSGSTGWSNAWFINQYARLYRGDDAWA 676
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
+++ F+L + + G FQID NFG + + EML+QS ++LLP
Sbjct: 677 QIEKWFSLYPTNTLWNTDDG---------ATFQIDGNFGVVSGITEMLLQSHAGVVHLLP 727
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
ALP G +GL ARGG TV I W+DG L I S
Sbjct: 728 ALPAVAVPRGSARGLMARGGFTVDIDWEDGRLRTAVIRS 766
>gi|418079608|ref|ZP_12716827.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 4027-06]
gi|418087855|ref|ZP_12725020.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47033]
gi|421286404|ref|ZP_15737176.1| hypothetical protein SPAR162_2099 [Streptococcus pneumoniae
GA60190]
gi|353745351|gb|EHD26021.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 4027-06]
gi|353755532|gb|EHD36135.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47033]
gi|395884860|gb|EJG95894.1| hypothetical protein SPAR162_2099 [Streptococcus pneumoniae
GA60190]
Length = 739
Score = 372 bits (955), Expect = e-100, Method: Compositional matrix adjust.
Identities = 250/755 (33%), Positives = 374/755 (49%), Gaps = 93/755 (12%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
M++G E ++LN++T+W D NPD+ L +R + G+ +A + +F
Sbjct: 1 MIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKAEELIKLTMFA 60
Query: 97 HPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVGNVEFTREHFS 152
P D Y+LLG++ +E D A Y RELDL+TA + V + + N++ RE+F+
Sbjct: 61 TPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSCNLQIKREYFT 119
Query: 153 SNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANA 210
S ++ +I S +L+ N++L + ++ ++ I+M G+
Sbjct: 120 SFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGR-------- 171
Query: 211 NDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS 270
KG+QF + K++D G +S L + + + + L L + + + G
Sbjct: 172 ----KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDYWGNI------ 218
Query: 271 KKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
+S+LQ ++ Y H+ YQ+ F+RV +L S KD ++ I
Sbjct: 219 -------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDCLS-------I 263
Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
T E K + L LLF +GRYLLISSS+P ANLQGIW ++L+P W S
Sbjct: 264 PTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSK 319
Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWA 449
+NIN +MNYW PC+L E + PLFD L + G TA+ Y A G+ HH TD +
Sbjct: 320 YTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTDGFG 379
Query: 450 KSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG 509
++ + A+W + WLCTH+WEHY Y D L + + +++ F D+L E
Sbjct: 380 DTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYLFEV 438
Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL- 568
DGYL T PS SPE+++ +G SST+D I+R + I A+ L N D +
Sbjct: 439 -DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFIS 497
Query: 569 -VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDL 627
V+++ K LP+ TKI +G I EW +D+++ E HRH+S LFGL+P + I I K P+L
Sbjct: 498 RVKELKKKLPK---TKIGSNGQIQEWLEDYEEVEPGHRHISPLFGLYPYNEIDIHKTPEL 554
Query: 628 CKAAEKTLQKR-------------------------GEEGPGWSITWKTALWARLHDQEH 662
+AA+ T+ +R GWS W +ARL+ E
Sbjct: 555 AEAAKITINRRLSNANFLSSQEREQAINNWLVSGLHASTQTGWSAAWLIHFFARLYQGEP 614
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
AY + L N NLF HPPFQID N G + + E+LVQS N L
Sbjct: 615 AYNQINGLLN-----------NATLGNLFLDHPPFQIDGNLGLVSGICELLVQSHHNWLS 663
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 LIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 697
>gi|210613381|ref|ZP_03289701.1| hypothetical protein CLONEX_01908 [Clostridium nexile DSM 1787]
gi|210151223|gb|EEA82231.1| hypothetical protein CLONEX_01908 [Clostridium nexile DSM 1787]
Length = 1549
Score = 371 bits (953), Expect = e-100, Method: Compositional matrix adjust.
Identities = 255/786 (32%), Positives = 397/786 (50%), Gaps = 108/786 (13%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG----DYTNPDAPK------ALSDVRS 77
+PIGNG +GA V+G + SE L NE TLWTG P DY ++ + +L +++
Sbjct: 73 LPIGNGDMGANVYGEIASEHLTFNEKTLWTGGPSESRKDYMGGNSTEKGQDGASLKNIQK 132
Query: 78 LVDSGQYAEATAASVKLF-----GHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
L G+ +EATAA L G+ A YQ GDI ++ D K A E Y+R+LDL T
Sbjct: 133 LFAEGKTSEATAACNNLLVGISNGYGA--YQPWGDIYFDYKDITEKNATE-YQRDLDLKT 189
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A + V + ++TRE F S+ D V+V ++ S L+ +V S + GN+
Sbjct: 190 AISTVSFKEDGTQYTREFFMSHDDDVLVARLEAKGSEKLNLDVRFPSKQGGKTVAEGNDT 249
Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
+ + G ++ ++++ L +K D G+++ DK L V+ + +
Sbjct: 250 LKLCGALTDNQM-------------KYASYLTVKA--DNGSVTGSGDK-LTVKDASAVTV 293
Query: 253 LLVASSSFDGPFINPSDS-----KKDPTSESMS-----ALQSIRNLSYSDLYTRHLDDYQ 302
L A++ + F N + + T E+++ + Y ++ HL+DYQ
Sbjct: 294 YLSAATDYKNAFYNEDKTEDYYYRTGETDEALAKRVKETVDKAVEKGYKEVKATHLEDYQ 353
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
+LF+RVS+ + + T SE+ D + + S E L +LFQ+GRYL I
Sbjct: 354 ELFNRVSLNIGQ--------TVSEKTTDDLLKTYKDGSASESEKRQLENMLFQYGRYLTI 405
Query: 363 SSSRPGTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
+SSR +Q+ +NLQG+WN +P W S H+N+NL+MNYW + NLSEC PL D++
Sbjct: 406 ASSREDSQLPSNLQGVWNSLTNPPWSSDYHMNVNLQMNYWPTYSTNLSECALPLIDYVDS 465
Query: 422 LSINGSKTAQV-------NYLASGWVIHHKTD-------IWAKSSADRGKVVWALWPMGG 467
L G TA+V + A+G++ H + WA S W P
Sbjct: 466 LREPGRVTAKVYAGVESKDGEANGFMAHTQNTPFGWTCPGWAFS--------WGWSPAAV 517
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFI 527
W+ + WE+Y +T D +F+E+ YP+L+ A+F L E DG L ++PS SPEH
Sbjct: 518 PWILQNCWEYYEFTGDTEFMEENIYPMLKEEATFYNQILTEDKDGKLVSSPSYSPEH--- 574
Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLR-PTKIAE 586
+ +T + +I +++ AAEVL ++ + L K ++ +L+ P +I +
Sbjct: 575 ------GPYTAGNTYEHTLIWQLYEDAAKAAEVLGQDTE-LAAKWKENQSKLKGPIEIGD 627
Query: 587 DGSIMEWAQ----DFKDPE----VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR 638
DG I EW + D P+ HRHLSH+ GLFPG I + + +AA+ ++ R
Sbjct: 628 DGQIKEWYEETTLDSMKPQGADPAGHRHLSHMLGLFPGDLIA--QKEEWLQAAKVSMDYR 685
Query: 639 GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQ 698
+ GW + + WARL + A+ +++ L F+GG+Y NL+ H PFQ
Sbjct: 686 TDNSTGWGMGQRINTWARLGEGNKAHELIQNL-----------FKGGIYPNLWDTHAPFQ 734
Query: 699 IDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLH 758
ID NFG+T+ V+EML+QS + L LLPA+P D W+ G V GL ARG V + W L
Sbjct: 735 IDGNFGYTSGVSEMLLQSNMGYLNLLPAIP-DVWADGSVDGLIARGNFEVDMDWAKTSLT 793
Query: 759 EVGIYS 764
+ I S
Sbjct: 794 KAEILS 799
>gi|421290728|ref|ZP_15741475.1| alpha-L-fucosidase [Streptococcus pneumoniae GA54354]
gi|421306123|ref|ZP_15756774.1| alpha-L-fucosidase [Streptococcus pneumoniae GA62331]
gi|395885632|gb|EJG96654.1| alpha-L-fucosidase [Streptococcus pneumoniae GA54354]
gi|395903807|gb|EJH14730.1| alpha-L-fucosidase [Streptococcus pneumoniae GA62331]
Length = 739
Score = 371 bits (952), Expect = e-100, Method: Compositional matrix adjust.
Identities = 249/755 (32%), Positives = 373/755 (49%), Gaps = 93/755 (12%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
M++G E ++LN++T+W D NPD+ L +R + G+ +A + +F
Sbjct: 1 MIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKAEELIKLTMFA 60
Query: 97 HPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVGNVEFTREHFS 152
P D Y+LLG++ +E D A Y RELDL+TA + V + + N++ RE+F+
Sbjct: 61 TPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSCNLQIKREYFT 119
Query: 153 SNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANA 210
S ++ +I S +L+ N++L + ++ ++ I+M G+
Sbjct: 120 SFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGR-------- 171
Query: 211 NDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS 270
KG+QF + K++D G +S L + + + + L L + + + G
Sbjct: 172 ----KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDYWGNI------ 218
Query: 271 KKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
+S+LQ ++ Y H+ YQ+ F+RV +L S KD ++ I
Sbjct: 219 -------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDCLS-------I 263
Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
T E K + L LLF +GRYLLISSS+P NLQGIW ++L+P W S
Sbjct: 264 PTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPVNLQGIWCDELNPIWGSK 319
Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWA 449
+NIN +MNYW PC+L E + PLFD L + G TA+ Y A G+ HH TD +
Sbjct: 320 YTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTDGFG 379
Query: 450 KSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG 509
++ + A+W + WLCTH+WEHY Y D L + + +++ F D+L E
Sbjct: 380 DTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYLFEV 438
Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL- 568
DGYL T PS SPE+++ +G SST+D I+R + I A+ L N D +
Sbjct: 439 -DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFIS 497
Query: 569 -VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDL 627
V+++ K LP+ TKI +G I EW +D+++ E HRH+S LFGL+P + I I K P+L
Sbjct: 498 RVKELKKKLPK---TKIGSNGQIQEWLEDYEEVEPGHRHISPLFGLYPYNEIDIHKTPEL 554
Query: 628 CKAAEKTLQKR-------------------------GEEGPGWSITWKTALWARLHDQEH 662
+AA+ T+ +R GWS W +ARL+ E
Sbjct: 555 AEAAKITINRRLSNANFLSSQEREQAINNWLVSGLHASTQTGWSAAWLIHFFARLYQGEP 614
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
AY + L N NLF HPPFQID N G + + E+LVQS N L
Sbjct: 615 AYNQINGLLN-----------NATLGNLFLDHPPFQIDGNLGLVSGICELLVQSHHNWLS 663
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 LIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 697
>gi|418188158|ref|ZP_12824676.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47360]
gi|421271597|ref|ZP_15722447.1| hypothetical protein SPAR48_2212 [Streptococcus pneumoniae SPAR48]
gi|353847967|gb|EHE27986.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47360]
gi|395865736|gb|EJG76874.1| hypothetical protein SPAR48_2212 [Streptococcus pneumoniae SPAR48]
Length = 739
Score = 370 bits (951), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 248/755 (32%), Positives = 372/755 (49%), Gaps = 93/755 (12%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
M++G E ++LN++T+W D NPD+ L +R + G+ +A + +F
Sbjct: 1 MIYGSATKECIQLNDETIWYRGKSDRNNPDSLLHLKKIREYLLDGEIQKAEELIKLTVFA 60
Query: 97 HPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVGNVEFTREHFS 152
P D Y+LLG++ +E D A Y RELDL+TA + V + + N++ RE+F+
Sbjct: 61 TPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSCNLQIKREYFT 119
Query: 153 SNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANA 210
S ++ +I S +L+ N++L + ++ ++ I+M G+
Sbjct: 120 SFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGR-------- 171
Query: 211 NDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS 270
KG+QF + K++D G +S L + + + + L L + + + G
Sbjct: 172 ----KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDYWGNI------ 218
Query: 271 KKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
+S+LQ ++ Y H+ YQ+ F+RV +L S + +I
Sbjct: 219 -------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYSKGCL--------SI 263
Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
T E K + L LLF +GRYLLISSS+P ANLQGIW ++L+P W S
Sbjct: 264 PTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSK 319
Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWA 449
+NIN +MNYW PC+L E + PLFD L + G TA+ Y A G+ HH TD +
Sbjct: 320 YTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTDGFG 379
Query: 450 KSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG 509
++ + A+W + WLCTH+WEHY Y D L + + +++ F D+L E
Sbjct: 380 DTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYLFEV 438
Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL- 568
DGYL T PS SPE+++ +G SST+D I+R + I A+ L N D +
Sbjct: 439 -DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFIS 497
Query: 569 -VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDL 627
V+++ K LP+ TKI +G I EW +D+++ E HRH+S LFGL+P + I I K P+L
Sbjct: 498 RVKELKKKLPK---TKIGSNGQIQEWLEDYEEVEPGHRHISPLFGLYPYNEIDIHKTPEL 554
Query: 628 CKAAEKTLQKR-------------------------GEEGPGWSITWKTALWARLHDQEH 662
+AA+ T+ +R GWS W +ARL+ E
Sbjct: 555 AEAAKITINRRLSNANFLSSQEREQAINNWLVSGLHASTQTGWSAAWLIHFFARLYQGEP 614
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
AY + L N NLF HPPFQID N G + + E+LVQS N L
Sbjct: 615 AYNQINGLLN-----------NATLGNLFLDHPPFQIDGNLGLVSGICELLVQSHHNWLS 663
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 LIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 697
>gi|358368086|dbj|GAA84703.1| alpha-L-fucosidase 2 precursor [Aspergillus kawachii IFO 4308]
Length = 790
Score = 370 bits (950), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 268/821 (32%), Positives = 392/821 (47%), Gaps = 87/821 (10%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+N PA +FT +PIGNGRLGA +WG +E + LNE+++W G + NP + AL VR
Sbjct: 27 YNTPANNFTSTLPIGNGRLGAAIWG-TATENVTLNENSIWNGPFINRVNPRSYDALWPVR 85
Query: 77 SLVDSGQYAEATAASV-KLFGHP--ADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
SL+ G E ++ + G P + LG + L+F H + Y R LDL T
Sbjct: 86 SLLAQGNMTEGNDVTLANMVGIPDSPQSFSALGSLVLDF--GHDQAGISNYTRYLDLRTG 143
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNN 191
A V+Y+ V + RE+ +S PD V+ ++S S+ G L+ SL D + ++ ++
Sbjct: 144 VAVVEYTYREVHYRREYVASYPDGVVAVRLSSSQPGRLNVASSLARDRYVVSNQAAVSSD 203
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV 251
++ R K I DP IQF+ I +SD R T + V
Sbjct: 204 LGVLTLRAYSKNI-------SDP--IQFTTEARI-VSDGRATSNG--------------V 239
Query: 252 LLLVASSSFDGPFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQKLFH 306
L+V ++S FI+ S + T E+ A L + + + + DY L
Sbjct: 240 SLVVRNASTVDIFIDTETSYRYTTRETREAEIKDKLDTASRSGFLTVKQNAIADYSTLAQ 299
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLISS 364
RV + L S + +P+ R+ +++TD DP L L+F FGR+ LI+S
Sbjct: 300 RVDLNLG-----------SSGSAGNLPTDTRLVNYRTDPDSDPELAVLMFHFGRHSLIAS 348
Query: 365 SRPGTQVA---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
SR A NLQG+WN++ P W ++INLEMNYW + NL++ P D L
Sbjct: 349 SRATESPALPANLQGLWNQEFDPAWGGRFTIDINLEMNYWPAEVTNLADTFSPFIDLLDI 408
Query: 422 LSINGSKTAQVNYLAS--GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
+ G A+ Y S G+V+HH TD+W ++ W +WPMGGAWL +L EHY
Sbjct: 409 VHGRGLDVAESMYHCSNGGYVLHHNTDLWGDAAPVDNGTTWTMWPMGGAWLSANLIEHYR 468
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLA 534
+T D L R +PLL+ A F +L +GY T S SPE +I PD G +
Sbjct: 469 FTRDETILRDRIWPLLQSAARFYYCYLFP-FEGYYSTGLSLSPEASYIVPDDMTTAGNVE 527
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWA 594
+ + TMD +++ E+F A+ +VL N K L +++ +I G I+EW
Sbjct: 528 GIDIAPTMDNSLLHELFQAVTETCDVLGINNTDCTTAA-KYLSKIKQPQIGSSGRILEWR 586
Query: 595 QDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKT 651
D+++ + HRH+S + GLFPG + N L AA+ L R G GWS TW
Sbjct: 587 LDYEESDPGHRHMSPIVGLFPGDQLAPLVNETLATAAKAFLDWRIAHGSGSTGWSRTWTM 646
Query: 652 ALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAE 711
L+ARL D + + + ++ L++ FQID NFGFT+ +AE
Sbjct: 647 NLYARLFDGDQVWNHTQIYL-------QRFPSPNLWNTDSGPDTVFQIDGNFGFTSGIAE 699
Query: 712 MLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDH 771
ML+QS ++LLPALP SG V GL ARG V + W G L I S
Sbjct: 700 MLLQS-YQVVHLLPALP-AAVPSGHVSGLVARGNFVVDMAWSGGVLTGANITSQ------ 751
Query: 772 DSFKTLHYR---GTSVKVNLSAGKIYTFNRQLKCTNLHQSI 809
S TL R G + VN G+ YT Q N++ +
Sbjct: 752 -SGSTLDIRVQDGLNFTVN---GERYTGGIQTDAGNVYTVV 788
>gi|242815487|ref|XP_002486578.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
gi|218714917|gb|EED14340.1| conserved hypothetical protein [Talaromyces stipitatus ATCC 10500]
Length = 787
Score = 369 bits (948), Expect = 3e-99, Method: Compositional matrix adjust.
Identities = 259/819 (31%), Positives = 396/819 (48%), Gaps = 86/819 (10%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ A F A+P+GNGRLG +++ P+E + LNE+++W+G + NP+A L++VR
Sbjct: 29 YTSAATDFNSALPVGNGRLGGLMYC-TPTERVSLNENSIWSGPFLNRLNPNAKSVLTEVR 87
Query: 77 SLVDSGQYAEATAASV-KLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
S+++SG A ++ + G+P Y LG + L+F S ++ + R LD
Sbjct: 88 SMLESGNITGAGQVALPNMAGNPNSPQHYTPLGQLNLDFGHS----SQGSLNRWLDTYQG 143
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD----NHSYVNG 189
+ Y V +TRE ++ P V+ ++ S++G L+ +SL L + S G
Sbjct: 144 NSGCSYIYNGVNYTREIIANYPTGVLAMRLQASQAGQLNIKISLSRLQNVISNTASTSGG 203
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
N I+M+G G +P F+A ++ S + L V G+
Sbjct: 204 ANSIVMKGNSGGS----------NPY---FAAEAQVIASGGS---VSASGSTLSVSGATT 247
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
+ A +S+ ++ +E L S + Y L T + D L RVS
Sbjct: 248 VDIFFDAEASYR------YSTEAAAETELTRKLSSATSQGYQALRTAAIADNTALVGRVS 301
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLISSSR- 366
+ L S P+ +R+ +++++ D LV L++ GR+LL++SSR
Sbjct: 302 LNLGSSSGSAANQ----------PTDKRLSNYKSNPGNDVQLVTLMYNMGRHLLVASSRD 351
Query: 367 --PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
P + ANLQGIWNED +P W S +NINLEMNYW + NL+E +P +D L
Sbjct: 352 TGPLSLPANLQGIWNEDFNPAWGSKYTININLEMNYWHAETTNLAETTKPFWDLLAVAKT 411
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G A Y SG+V+HH D W + + +WP+GG WL THL EHY +T ++
Sbjct: 412 RGELAASSMYGCSGFVLHHNIDCWGDPAPVDYGTPYTIWPLGGVWLSTHLMEHYRFTGNK 471
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYS 539
FL++ A+P+L+ A F + +GY T PS SPE+ FI P G + S
Sbjct: 472 TFLQETAWPILQSAADFCFCYTFL-WNGYYTTGPSLSPENSFIVPSNESKAGNAEGIDIS 530
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKD 599
TMD +++ ++FS +I A ++L L +++P + G I+EW Q++ +
Sbjct: 531 PTMDNSLLYQLFSDVIEACQILGLTSSE-CSNAKNYLSKIKPPQTGSYGQILEWRQEYGE 589
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTALWAR 656
E RHLS LFGL+PG +T + L AA L R G GWS W A +AR
Sbjct: 590 TEPGMRHLSPLFGLYPGSQMTPTVSSSLASAAGILLDHRIKYGSGDTGWSRAWVIACYAR 649
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAH--PPFQIDANFGFTAAVAEMLV 714
L + A+ V + + + +NLF ++ PP QID NFGFTA V E+ +
Sbjct: 650 LFNGNSAWNSV-----------QTYLQTFPLTNLFNSNNGPPMQIDGNFGFTAGVTELFL 698
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSF 774
QS N +++LPALP +G V GL ARGG V I W +G L I SN
Sbjct: 699 QSHANLVHILPALP-SSVPTGSVTGLVARGGFKVDIHWSNGVLGSATITSNLG------- 750
Query: 775 KTLHYR---GTSVKVNLSAGKIYTFNRQLKCTNLHQSIV 810
TL R G+S +VN G+ Y+ K ++ I+
Sbjct: 751 STLALRVANGSSFQVN---GQTYSGAIGTKAGGVYNVIL 786
>gi|393785852|ref|ZP_10373996.1| hypothetical protein HMPREF1068_00276 [Bacteroides nordii
CL02T12C05]
gi|392660966|gb|EIY54563.1| hypothetical protein HMPREF1068_00276 [Bacteroides nordii
CL02T12C05]
Length = 810
Score = 369 bits (947), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 245/788 (31%), Positives = 393/788 (49%), Gaps = 91/788 (11%)
Query: 15 ITFNGPAKHFTD-AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA----- 68
+ F PAK +++ A+ IGNG +GA +G V E L + E T W G P + PD
Sbjct: 35 VWFRYPAKSWSEQALHIGNGYMGASFYGEVEKERLDIAEKTFWAGGP--HAAPDFNYGII 92
Query: 69 ---PKALSDVRSLVDSGQYAEATAAS-VKLFGHPADV--YQLLGDIELEFDDSHLKYAEE 122
++ +R L+ ++AEA + S + + G + + ++G++ ++F + K +
Sbjct: 93 KGDKDKIATIRQLIVERRFAEADSLSRIYMTGDYTNYGYFSMVGNLWIDFGKN--KQPVQ 150
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
Y R +DL+T+ V+Y+ G V+F RE+F S PD+++ + ++G +SF++S +
Sbjct: 151 NYLRGIDLSTSRGFVEYTQGGVQFNREYFCSYPDKLMALHFTADKAGKISFSLSHSLVYP 210
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+ N + G + N S + IKI G++ + +++
Sbjct: 211 PEEVIESENGLTFNGII-------RKNG--------LSYTIRIKIVQQGGSVK-VAHQRI 254
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
VE ++ A + + + P + P ++P + + Y + H+ DYQ
Sbjct: 255 VVEKANEATVFYAVDTEY-AP-VYPLYKGENPQQNTGKVITKAITKGYETVKNTHISDYQ 312
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYL 360
L++RV L+ DT SE+ +P+ RVK Q +D SL L F RYL
Sbjct: 313 TLYNRVRFTLT-------GDTASEQ----LPTNMRVKQLQKGFTDDASLKVLGFNLSRYL 361
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
LIS+SRPGT + LQG+WN W+ NINL+ YW P +L EC+E +++
Sbjct: 362 LISASRPGTLPSTLQGVWNTFEKAPWNGNFQSNINLQEMYWGCGPTHLPECEEAYLEWIE 421
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
L G +TA+ Y GWV H +IW + ++W L+P G AW C HLWEHY +
Sbjct: 422 GLVEPGRQTAREYYGTKGWVSHSTGNIWGHTVPG-DDILWGLYPSGAAWHCRHLWEHYAF 480
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
D+++L + YP+++ A F L+ ++E + G+ PS S EH +G + V YS+
Sbjct: 481 NGDKEYLRTKGYPIMKEAAEFWLENMVE-YQGHFIIAPSVSAEHGIEMKNG--SPVEYST 537
Query: 541 T---------------MDMAIIREVFSAIISAAEVLEKNEDALV-EKVLKSLPRLRPTKI 584
T D+ ++ +++S +I AAE L N D++ +K+L + +L P KI
Sbjct: 538 TNGEQTEGRLFTVPAYQDIEMVYDLYSHVIKAAECL--NTDSVFRQKLLIAKNKLLPLKI 595
Query: 585 AEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGE---- 640
G + EW D +P HHRHL+HL+ L+PG+ I+ + P L +A K+L+ RG+
Sbjct: 596 GRYGQLQEWIDDVDNPHDHHRHLAHLYALYPGNRISYTRTPALAQAVRKSLEMRGKGKFG 655
Query: 641 -----EGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHP 695
G WS+ W+TALWARL+D A R+ E G Y N+ +
Sbjct: 656 DRWPHTGGNWSMAWRTALWARLYDGNQAIGTFNRMIK----------ESG-YENMMSNQS 704
Query: 696 P-FQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKD 754
Q+DA + AEML+QS ++LLPALP +W G ++GL AR G V+I WK
Sbjct: 705 GNMQVDATMATSGLFAEMLLQSHEGFIHLLPALP-TEWPEGKIEGLMARNGYQVTIEWKY 763
Query: 755 GDLHEVGI 762
G L + I
Sbjct: 764 GRLTKAEI 771
>gi|405761776|ref|YP_006702372.1| hypothetical protein SPNA45_02013 [Streptococcus pneumoniae SPNA45]
gi|404278665|emb|CCM09296.1| conserved hypothetical protein [Streptococcus pneumoniae SPNA45]
Length = 739
Score = 369 bits (946), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 248/755 (32%), Positives = 373/755 (49%), Gaps = 93/755 (12%)
Query: 38 MVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFG 96
M++G E ++LN++T+W + NPD+ L +R + G+ +A + +F
Sbjct: 1 MIYGSATKECIQLNDETIWYRGKSNRNNPDSLLHLKKIREYLLDGEIQKAEELIKLTMFA 60
Query: 97 HPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVGNVEFTREHFS 152
P D Y+LLG++ +E D A Y RELDL+TA + V + + N++ RE+F+
Sbjct: 61 TPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSCNLQIKREYFT 119
Query: 153 SNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANA 210
S ++ +I S +L+ N++L + ++ ++ I+M G+
Sbjct: 120 SFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGR-------- 171
Query: 211 NDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS 270
KG+QF + K++D G +S L + + + + L L + + + G
Sbjct: 172 ----KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDYWGNI------ 218
Query: 271 KKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENI 329
+S+LQ ++ Y H+ YQ+ F+RV +L S KD ++ I
Sbjct: 219 -------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDCLS-------I 263
Query: 330 DTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
T E K + L LLF +GRYLLISSS+P ANLQGIW ++L+P W S
Sbjct: 264 PTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSK 319
Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWA 449
+NIN +MNYW PC+L E + PLFD L + G TA+ Y A G+ HH TD +
Sbjct: 320 YTININTQMNYWMVGPCDLPEVEYPLFDMLERIREPGRLTAKKMYGARGFTAHHNTDGFG 379
Query: 450 KSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG 509
++ + A+W + WLCTH+WEHY Y D L + + +++ F D+L E
Sbjct: 380 DTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYLFEV 438
Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL- 568
DGYL PS SPE+++ +G SST+D I+R + I A+ L N D +
Sbjct: 439 -DGYLMIGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFIS 497
Query: 569 -VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDL 627
V+++ K LP+ TKI +G I EW +D+++ E HRH+S LFGL+P + I I K P+L
Sbjct: 498 RVKELKKKLPK---TKIGSNGQIQEWLEDYEEVEPGHRHISPLFGLYPYNEIDIHKTPEL 554
Query: 628 CKAAEKTLQKR-------------------------GEEGPGWSITWKTALWARLHDQEH 662
+AA+ T+ +R GWS W +ARL+ E
Sbjct: 555 AEAAKITINRRLSNANFLSSQEREQAINNWLVSGLHASTQTGWSAAWLIHFFARLYQGEP 614
Query: 663 AYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
AY + L N NLF HPPFQID N G + + E+LVQS N L
Sbjct: 615 AYNQINGLLN-----------NATLGNLFLDHPPFQIDGNLGLVSGICELLVQSHHNWLS 663
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 664 LIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 697
>gi|224537148|ref|ZP_03677687.1| hypothetical protein BACCELL_02025 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521203|gb|EEF90308.1| hypothetical protein BACCELL_02025 [Bacteroides cellulosilyticus
DSM 14838]
Length = 776
Score = 368 bits (944), Expect = 7e-99, Method: Compositional matrix adjust.
Identities = 258/792 (32%), Positives = 390/792 (49%), Gaps = 68/792 (8%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLV 79
PA + A+PIGNGR+G M++G +E + +NE+T+W G P NP P+ ++ +R+L+
Sbjct: 32 PASDWKSALPIGNGRIGGMIFGDPSTEQIVINENTIWCGPPLPPNNPKGPELINKMRNLI 91
Query: 80 DSGQYAEATAASVKLFG----HPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
+G+Y EA K F A YQ G + ++F D K A Y+R LD A
Sbjct: 92 FNGKYEEAVIVCEKEFADGVHENARSYQPFGFLNIDFKD---KGAISNYKRWLDYTKAIT 148
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
V Y+ V +TRE F S P++V+V +I+ + G +SF + N +
Sbjct: 149 YVSYTQNGVTYTREAFVSKPNEVMVVRITADKPGQVSFKSKYTRPFGATTKAENNRSQYV 208
Query: 196 EGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLV 255
+G+ + N + G++F I I ++ G I A E +++ ++ +++
Sbjct: 209 QGQAYAE--------NGEFVGVKFEGI--INYYNEGGKIKANETD-IEINNANSVTIMIA 257
Query: 256 ASSSFDGPFINPSDSKKDPTSESMS----ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
S+ + N D+K T L + L Y L H+D+Y L++R S
Sbjct: 258 ISTDY-----NIHDTKNVLTHNRKKICEKQLSQAQKLGYKKLKQTHIDEYSALYNRSSF- 311
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF---GRYLLISSSRPG 368
DI +T N P +R++ + + S ELLF++ RYL ISSSR G
Sbjct: 312 ------DITFNTPVNNN----PIDKRIQLAASGQIDS--ELLFEYYNYCRYLFISSSRKG 359
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
NLQGIWN + W S H+N+N++ YW + NLSEC EP+F L NG +
Sbjct: 360 GLPMNLQGIWNPLMLAPWRSNFHINVNIQEAYWFAEQANLSECHEPIFTLTENLIKNGKE 419
Query: 429 TAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
TAQV + G V H+TD W + K W + AWLC H EHY YT+D++FL
Sbjct: 420 TAQVMFGTKRGSVAGHRTDAWFYAPPTFLKAHWGMSITNAAWLCLHHMEHYRYTLDKEFL 479
Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
+ RA P+L A F +DWL+ + G L + P+ SPE+ F +GK+A ++ T D I
Sbjct: 480 KTRALPILRETALFFVDWLVPDPRSGKLVSGPTASPENRFKV-NGKVASLTMGCTYDQEI 538
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRH 606
I F + A ++L N + VE V S+ +L IA DG +MEW ++ ++ E HRH
Sbjct: 539 IWNTFRDFLEACKILGINNEETVE-VEASMKKLSMPTIANDGRLMEWTEESEETEPGHRH 597
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTALWARLHDQEHA 663
+SHL+G+ PG+ IT +K P L A K+L R GWS+ W T++ ARL + + +
Sbjct: 598 ISHLWGMMPGNRITQDKTPHLVDAVRKSLDYRLNHNYHAQGWSLGWVTSMLARLKEGDKS 657
Query: 664 YRMVKRLFNLVDPEHEKHFEGGLYSNLFA-AHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
M+ + ++ Y N+F AH Q+ G A+ E+++QS + +
Sbjct: 658 LDMM-----------QHNYFTKAYPNMFVDAHGRPQVGDMMGVPLAMIELILQSHTDYID 706
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGT 782
LLP+LP W G V GL ARG + WK G L I S L Y G
Sbjct: 707 LLPSLP-TAWKDGKVTGLCARGAFVFDMEWKAGKLISTNIKSLKGEKC-----LLRYEGK 760
Query: 783 SVKVNLSAGKIY 794
+++ AGK Y
Sbjct: 761 VKELSTEAGKSY 772
>gi|325964568|ref|YP_004242474.1| hypothetical protein Asphe3_32320 [Arthrobacter phenanthrenivorans
Sphe3]
gi|323470655|gb|ADX74340.1| hypothetical protein Asphe3_32320 [Arthrobacter phenanthrenivorans
Sphe3]
Length = 863
Score = 367 bits (943), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 275/845 (32%), Positives = 399/845 (47%), Gaps = 95/845 (11%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVP-----SETLKLNEDTLWTGVPGD------ 62
++ ++ PA + +A+P+GNGR GAMV+GG P S +LN+ + W+G P
Sbjct: 6 RLAYDAPAAEWLEALPLGNGRHGAMVFGGSPANGGMSHRFQLNDSSAWSGSPHSQDREPV 65
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEE 122
++ +A + LS R L+ SG +A A L + Y L F D HL A
Sbjct: 66 FSREEADRILSGSRRLISSGDFAGAAETLKGLQHRHSQAY-------LPFVDLHLTAAPA 118
Query: 123 T-------------YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESG 169
Y R LDL TA + Y + E F S+ V+V +
Sbjct: 119 ATPTAGPAAGRPSDYHRGLDLATAVSTNTYCLEGHAVRVEAFISHDPSVLVISLLADAPE 178
Query: 170 SLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKAN-----ANDDPKGIQFSAILE 224
++ ++ LDS L +E + P P + D+ +Q +A +
Sbjct: 179 GVNLSLRLDSPLRVLRRTEDRGTCSLELKLPSDAAPAHDGGLVEYSEDESLSLQGAAAVS 238
Query: 225 IKISD---DRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA 281
D +A L G A + + A+++F G +P+ +E+
Sbjct: 239 WAHDGQDVDAPGGTAGHYGGLAATGVRRADVFVTAATTFAGLGRHPAGDAASAAAEARGV 298
Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDT---VPSAERV 338
L+ S S L RH + + +L+ I+L D + E DT + +A
Sbjct: 299 LELAHAASPSTLKERHQESHSRLYRAAQIEL---------DVPAWEGTDTGRRLLAANAH 349
Query: 339 KSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ-----------VANLQGIWNEDLSPTWD 387
D L LLF +GRYLLISSSRPG ANLQG+WN +L W
Sbjct: 350 PGGPLAADAGLAALLFNYGRYLLISSSRPGPAGSGKGSAWRGVPANLQGLWNAELPAPWS 409
Query: 388 SAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDI 447
S NINL+MNYW + P L+EC PLF + + + G+ A+ Y A GW +HH +DI
Sbjct: 410 SNYTTNINLQMNYWGAEPTGLAECVVPLFALIEAMQVTGAAVAREYYGARGWTVHHNSDI 469
Query: 448 WAKSSA---DRGKVVWALWPMGGAWLCTHLWEHYNY---TMDRD---FLEKRAYPLLEGC 498
WA + W+ WPM G WL HLWEH + T+DRD F A+P + G
Sbjct: 470 WAYAKPVGHGAHSPEWSYWPMAGLWLVRHLWEHLQFGAATVDRDKAGFARDAAWPAIRGA 529
Query: 499 ASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD---GKL--ACVSYSSTMDMAIIREVFSA 553
A F LD L E DG L T PSTSPE+ F A D G+ V+ SSTMD+ + +VF
Sbjct: 530 AEFALDLLAELPDGSLGTGPSTSPENTFAAVDPSSGRRIQGSVAQSSTMDLTLTGDVFRM 589
Query: 554 IISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGL 613
+ + L + D ++++ ++LPRL + DG + EW D ++ E HRH+SHL+
Sbjct: 590 LDALGRDLGMDADPVLDEARRALPRLPAPEPGRDGKLREWLADPEEWEPGHRHVSHLYLA 649
Query: 614 FPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLF-N 672
+PG T + +L A +L RG+E GWS+ WK L +RL E +++ F +
Sbjct: 650 YPGDT---PLSAELEAAVRASLDGRGDEATGWSLAWKILLRSRLRQPEKVSDLLRLFFRD 706
Query: 673 LVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS-----TLNDLYLLPAL 727
+ P + GGLY NLF AHPPFQID N GF A +AE L+QS L+++ LLPAL
Sbjct: 707 MSTPRGGQ--SGGLYPNLFGAHPPFQIDGNLGFVAGLAECLLQSHRLVDGLHEIELLPAL 764
Query: 728 PWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVK-V 786
P + +G GL+AR G V + W+DG L + + + +H H GT+V+ V
Sbjct: 765 P-AELPAGRAAGLRARPGVEVDLGWQDGRL----VRARLATGEHRRVLVRH--GTAVQDV 817
Query: 787 NLSAG 791
L G
Sbjct: 818 RLRPG 822
>gi|429847882|gb|ELA23431.1| alpha-l-fucosidase [Colletotrichum gloeosporioides Nara gc5]
Length = 798
Score = 367 bits (943), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 252/766 (32%), Positives = 369/766 (48%), Gaps = 77/766 (10%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
A+PIGNGRLG VWGG +ETL +NEDT+W+G D T P+A L R L SG+ E
Sbjct: 42 ALPIGNGRLGGTVWGGA-NETLTINEDTIWSGPIQDRTPPNALATLPVARKLFLSGKITE 100
Query: 87 ATAASVKLFGHPAD----VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVG 142
++ PA+ + G+++L+F S E Y R LD + Y+
Sbjct: 101 GGQLVLREM-TPAEKSERQFGYFGNLDLDFGHSG---NLENYVRWLDTKQGNSGSSYAFD 156
Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS---LDSLLDNHSYVNGNNQIIMEGRC 199
V FTRE +S P V+ + + SE G+L+ S L ++L N + G +
Sbjct: 157 GVNFTREFVASYPAGVLAARFTSSEEGALNLKASFSRLANILVNVASTAGGVNSVTLMSS 216
Query: 200 PGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSS 259
G+ + D I F+ K + +GS VL + +++
Sbjct: 217 SGQPL--------DENPILFTGQARF----------VAPGAKFENDGS---VLRITGATA 255
Query: 260 FDGPFINPSD----SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRS 315
D F ++ S+ + +E L + YSDL L D L R SI L +S
Sbjct: 256 IDLFFDAETNYRFASQDEWEAEIDRKLNAALTKGYSDLRDEALKDSSSLLGRASIDLGKS 315
Query: 316 PKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQV--- 371
P+ + +P+ ERV + + D L L + GR++L+ +SR T+
Sbjct: 316 PR----------GLSALPTDERVAIARNNSSDVELSTLTWNLGRHMLVGASR-NTEADID 364
Query: 372 --ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
ANLQGIWN + W +NIN EMNYW + P NL E QEPLFD + + G
Sbjct: 365 MPANLQGIWNNKTTAAWGGKYTININTEMNYWSAGPTNLIETQEPLFDLMKVANPRGKAM 424
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A+ Y G + HH D+W A +WPMG AWL H+ +HY++T D+ FL
Sbjct: 425 AKAMYGCDGTMFHHNLDVWGDPGATDNYTSSTMWPMGAAWLVQHMVDHYHFTGDKTFLAD 484
Query: 490 RAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYSSTMDM 544
AYP L A+F + E H+GY T PS SPE+ F+ P G+ + MD
Sbjct: 485 VAYPFLIDVATFYECYTFE-HEGYRITGPSLSPENTFVVPSNFSVAGRSEPMDIDIPMDN 543
Query: 545 AIIREVFSAIISAAEVL---EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPE 601
++ +VFSAII AA++L + N+D ++K LPR++P +I G I+EW ++K+
Sbjct: 544 QLMHDVFSAIIEAADILGIDDTNQD--LKKAKDFLPRIKPAQIGSKGQILEWRYEYKESA 601
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP---GWSITWKTALWARLH 658
HRHLS L+ L PG + N L +AA+ L +R + G GWS TW ++AR
Sbjct: 602 PSHRHLSPLYALHPGKEFSPLVNETLSEAAQVLLDRRRDAGSGSTGWSRTWMINMYARSF 661
Query: 659 DQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTL 718
A+ VK F + + + G FQID N+GFT+ + EML+QS
Sbjct: 662 RGADAWEQVKGWFATFPTANLWNTDKG---------STFQIDGNYGFTSGITEMLLQSHT 712
Query: 719 NDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+++LPALP + +G KGL ARG + + W++G GI S
Sbjct: 713 GTVHILPALPGEAVPTGSAKGLVARGNFIIDVEWENGAFKRAGITS 758
>gi|374992668|ref|YP_004968163.1| alpha-L-fucosidase [Streptomyces bingchenggensis BCW-1]
gi|297163320|gb|ADI13032.1| Alpha-L-fucosidase [Streptomyces bingchenggensis BCW-1]
Length = 789
Score = 367 bits (942), Expect = 1e-98, Method: Compositional matrix adjust.
Identities = 259/773 (33%), Positives = 363/773 (46%), Gaps = 65/773 (8%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL-S 73
I PA F D+ IGNG LG + G V +E + LN D+LW+G P + +P L
Sbjct: 6 IQLTEPATAFHDSFLIGNGSLGGTLRGAVGTERIDLNLDSLWSGGPVTAEDTGSPAGLLP 65
Query: 74 DVRSLVDSGQYAEATAASVKLFGHP-ADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
+R+ + + + + G + YQ LG +E + D+ Y+R L+L
Sbjct: 66 QLRAAIRAEDNVRVEKLAQAMMGPGWTESYQPLGWLEWHYADTSDATG---YQRRLNLAD 122
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A A Y E F S PD V+V ++G G+ S V L + + H +
Sbjct: 123 AVATTGYGPAGAEVEMSSFVSAPDNVLVVTVTGP--GAASHPV-LPTFVSPHPVTTAAPR 179
Query: 193 ---IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
++ GR P + +P N D+ + + ++ G +
Sbjct: 180 PGLLVATGRVPARVLP---NYVDEEPAVVYGEDEPDGAGTVAAGAGFAVAVAVERTGPEA 236
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSI-RNLSYS--DLYTRHLDDYQKLFH 306
L+ A+S F G PS D + + SA +++ R L+ + L RH+ DY+ F
Sbjct: 237 LRLIAAAASGFRGYDRRPS---ADLAALARSAEETVTRALTRTAEQLVQRHVQDYRSYFD 293
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
RV + LS SP DP+ ELLF FGRYLLISSSR
Sbjct: 294 RVDLDLSASPA------------------------ADHGDPARAELLFHFGRYLLISSSR 329
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
PGT+ ANLQGIWN D+ P W + NIN+EMNYW + L + P+ L+ +G
Sbjct: 330 PGTEAANLQGIWNIDVRPGWSANYTTNINVEMNYWAAESTALEDVHGPMLTLADDLAESG 389
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
+ TA Y A+G V+HH TDIW S+ +G WA WP G WL H+W+HY Y + DF
Sbjct: 390 TATAARYYGAAGAVVHHNTDIWRFSTPVKGDTQWATWPTGLYWLAAHVWDHYEYGGNDDF 449
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDG-KLACVSYSSTMDMA 545
A + A F LD L+ DG L T+PSTSPEH F+ P + A VS +TMD
Sbjct: 450 GAGPALRVHRSAALFALDMLVPDDDGLLVTSPSTSPEHRFVLPPAPRGAAVSEGTTMDQE 509
Query: 546 IIREVFSAIISAAEVLEK-NEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHH 604
++ EV S ++ AE + ++D L+ + +L LR I G ++EW + E H
Sbjct: 510 LVHEVLSRYVTLAERFGRGDDDVLLARARHALGALRLPGIGASGELLEWKDERPGSEPGH 569
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP---GWSITWKTALWARLHDQE 661
RHLSHL+G+ PG IT P++ AA K L R + G GWS W L ARL D
Sbjct: 570 RHLSHLYGIHPGTRITEGGTPEVFAAARKALATRLQHGSGYTGWSQAWILCLAARLRDTG 629
Query: 662 HAYRMVKRLFN------LVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQ 715
A R + L N L+D + GG FQID N G A + E+LVQ
Sbjct: 630 LAERSLDVLLNDLTSWSLLDLHPHSEWPGGYI---------FQIDGNLGAVAGMVELLVQ 680
Query: 716 STLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN 768
S + LL LP W SG V G++ RGG TV + W G+L + + +S
Sbjct: 681 SHEGAVSLLKTLP-RGWRSGHVAGIRCRGGLTVDVDWDAGELTTATVRTGFSG 732
>gi|329923050|ref|ZP_08278566.1| hypothetical protein HMPREF9412_5028 [Paenibacillus sp. HGF5]
gi|328941823|gb|EGG38108.1| hypothetical protein HMPREF9412_5028 [Paenibacillus sp. HGF5]
Length = 767
Score = 367 bits (942), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 271/817 (33%), Positives = 407/817 (49%), Gaps = 94/817 (11%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA- 71
+K+ + PA+ ++ +PIGNGR+G +V E + E T W+G P KA
Sbjct: 4 MKLWYTKPAQGWSQGLPIGNGRMGNVVVSTPDREIWNITETTYWSGQPEPAQGRSNSKAD 63
Query: 72 LSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEFDDSHLK------ 118
L +R G Y E + K FG + Q++ LEFD H+K
Sbjct: 64 LERMRQHFFQGDYREGDRLAKKHLEPEKLNFGTNLGLCQVV----LEFD-HHVKPSEGGR 118
Query: 119 ---YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGS-LSFN 174
AE + RELDL A AR + E RE F+S+ DQVIV +I S S +SF
Sbjct: 119 QDAAAEPLFHRELDLQEAVARSLCEIDGAEMAREVFASHADQVIVARIRSSHGSSGVSFR 178
Query: 175 VSLDSLLDN---HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDR 231
+S+ +N H+ V G + I +G+ + I +G+ +++ +
Sbjct: 179 ISIRG--ENGPFHAVVTGKDTIDFQGQA-WEGIHSNGECGVSCQGL-------LRVVTEG 228
Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRN--LS 289
G +S ++D + V G+D A + +N ++ + SALQ + L
Sbjct: 229 GQVSCMDDTII-VSGADEAAIYFA---------VNTDYRQEGESWREKSALQLEQAVLLG 278
Query: 290 YSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDP 347
Y +L +HL DYQ L+ RV + L S ++P+ ER+ F+ +D
Sbjct: 279 YDELKAKHLADYQPLYARVRLDLGSSEHA------------SLPTDERIGRFKQGKRDDQ 326
Query: 348 SLVELLFQFGRYLLISSSRPGTQV-ANLQGIWN--EDLSPTWDSAPHVNINLEMNYWQSL 404
+L L +Q+GRYL IS SR + + +LQGIWN E W H+++N +MNY+ +
Sbjct: 327 ALFALFYQYGRYLTISGSRQDSILPMHLQGIWNDGEANKMAWSCDYHLDVNTQMNYFPTE 386
Query: 405 PCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWP 464
NLSE EPL ++ LS+ G A+ Y A GWV H ++ W +S G W L
Sbjct: 387 AANLSESHEPLMRYIQQLSVAGCSAARHYYDAEGWVAHVFSNAWGFASPGWG-TSWGLNV 445
Query: 465 MGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPE 523
GG W+ THL EHY Y D+ FLE+ AYP+L+ A+F +D++ + G+L T PS SPE
Sbjct: 446 TGGLWIATHLIEHYAYNRDQAFLEELAYPVLKEAAAFFMDYMTVHPQYGWLVTGPSNSPE 505
Query: 524 HEFIA--PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
+ F P+ +S TMD ++R++ + + AA+ L +E+ L +K +L +L P
Sbjct: 506 NSFYTSKPEDGHQQLSMGPTMDQVLVRDLLAFCVKAAQTLGVDEE-LQQKWQTALDQLPP 564
Query: 582 TKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEE 641
I + G + EW +D+++ + HRHLSHL+ L+PG IT P+L AA TL+ R
Sbjct: 565 LIIGKKGQLQEWLEDYEEAQPEHRHLSHLYALYPGSQITPHHTPELAAAARVTLENRNSR 624
Query: 642 GPGWSITWKTAL----WARLHDQEHAYRMVKRLF------NLVDPEHEKHFEGGLYSNLF 691
I + AL +ARLHD + A + + L N++ + K G +N+F
Sbjct: 625 ADLEDIEFTAALFGLFYARLHDGDQAVQHIAHLIGELCFDNMLT--YSKPGVAGAEANIF 682
Query: 692 AAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSIC 751
ID NFG TAA+AEML+QS +++LLPALP W +G VKGLKA+G V +
Sbjct: 683 V------IDGNFGGTAAIAEMLLQSHEGEIHLLPALP-AMWPTGSVKGLKAKGNIEVDMS 735
Query: 752 WKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNL 788
W+ G L E + N S S K L Y G ++V L
Sbjct: 736 WEHGKLVEARVKGNESG----SVKVL-YGGREMEVGL 767
>gi|423223092|ref|ZP_17209561.1| hypothetical protein HMPREF1062_01747 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392639998|gb|EIY33805.1| hypothetical protein HMPREF1062_01747 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 776
Score = 366 bits (940), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 257/792 (32%), Positives = 390/792 (49%), Gaps = 68/792 (8%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLV 79
PA + A+PIGNGR+G M++G +E + +NE+T+W G P NP P+ ++ +R+L+
Sbjct: 32 PASDWKSALPIGNGRIGGMIFGDPSTEQIVINENTIWCGPPLPPNNPKGPELINKMRNLI 91
Query: 80 DSGQYAEATAASVKLFG----HPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
+G+Y EA K F A YQ G + ++F D K A Y+R LD A
Sbjct: 92 FNGKYEEAVIVCEKEFADGVHENARSYQPFGFLNIDFKD---KGAISNYKRWLDYTKAIT 148
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
V Y+ V +TRE F S P++V+V +I+ + G +SF + N +
Sbjct: 149 YVSYTQNGVTYTREAFVSKPNEVMVVRITADKPGQVSFKSKYTRPFGATTKAENNRSQYV 208
Query: 196 EGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLV 255
+G+ + N + G++F I I ++ G I A +++ ++ +++
Sbjct: 209 QGQAYAE--------NGEFVGVKFEGI--INYYNEGGKIKA-NGTDIEINNANSVTIMIA 257
Query: 256 ASSSFDGPFINPSDSKKDPTSESMS----ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
S+ + N D+K T L + L Y L H+D+Y L++R S
Sbjct: 258 ISTDY-----NIHDTKNVLTHNRKKICEKQLSQAQKLGYKKLKQTHIDEYSALYNRSSF- 311
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF---GRYLLISSSRPG 368
DI +T N P +R++ + + S ELLF++ RYL ISSSR G
Sbjct: 312 ------DIAFNTPVNNN----PIDKRIQLAASGQIDS--ELLFEYYNYCRYLFISSSRKG 359
Query: 369 TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
NLQGIWN + W S H+N+N++ YW + NLSEC EP+F L NG +
Sbjct: 360 GLPMNLQGIWNPLMLAPWRSNFHINVNIQEAYWFAEQANLSECHEPMFTLTENLIKNGKE 419
Query: 429 TAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
TAQV + G V H+TD W + K W + AWLC H EHY YT+D++FL
Sbjct: 420 TAQVMFGTKRGSVAGHRTDAWFYAPPTFLKAHWGMSITNAAWLCLHHMEHYRYTLDKEFL 479
Query: 488 EKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
+ RA P+L A F +DWL+ + G L + P+ SPE+ F +GK+A ++ S T D I
Sbjct: 480 KTRALPVLRETALFFVDWLVPDPRSGKLVSGPTASPENRFKV-NGKVASLTMSCTYDQEI 538
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRH 606
I F + A ++L + + VE V S+ +L IA DG +MEW ++ ++ E HRH
Sbjct: 539 IWNTFRDFLEACKILGISNEETVE-VEASMKKLSMPTIANDGRLMEWTEELEETEPGHRH 597
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTALWARLHDQEHA 663
+SHL+G+ PG+ IT +K P L A K+L R GWS+ W T++ ARL + + +
Sbjct: 598 ISHLWGMMPGNRITQDKTPHLVDAVRKSLDYRLNHNYHAQGWSLGWVTSMLARLKEGDKS 657
Query: 664 YRMVKRLFNLVDPEHEKHFEGGLYSNLFA-AHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
M+ + ++ Y N+F AH Q+ G A+ E+++QS + +
Sbjct: 658 LDMM-----------QHNYFTKAYPNMFVDAHGRPQVGDMMGVPLAMIELILQSHTDYID 706
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGT 782
LLP+LP W G V GL ARG + WK G L I S L Y G
Sbjct: 707 LLPSLP-TAWKDGKVTGLCARGAFVFDMEWKAGKLISTNIKSLKGGKC-----LLRYEGK 760
Query: 783 SVKVNLSAGKIY 794
+++ AGK Y
Sbjct: 761 VKELSTEAGKSY 772
>gi|336415344|ref|ZP_08595684.1| hypothetical protein HMPREF1017_02792 [Bacteroides ovatus
3_8_47FAA]
gi|335940940|gb|EGN02802.1| hypothetical protein HMPREF1017_02792 [Bacteroides ovatus
3_8_47FAA]
Length = 648
Score = 366 bits (940), Expect = 2e-98, Method: Compositional matrix adjust.
Identities = 225/646 (34%), Positives = 347/646 (53%), Gaps = 57/646 (8%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PA+++++A+PIGN RLGAMV+GG+ E L+LNE+T W G P + NP+A L
Sbjct: 22 LKLWYSQPAQNWSEALPIGNSRLGAMVYGGIEREELQLNEETFWAGSPYNNNNPNAVHVL 81
Query: 73 SDVRSLVDSGQYAEATAA--SVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
VR L+ G+ EA + L Y LG + LEF + + R+L+L
Sbjct: 82 PVVRKLIFEGRNKEAQRLIDANFLTQQHGMSYLTLGSLYLEFPEHQ---NASGFYRDLNL 138
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
AT +Y V +V +TR F+S D VI+ I S++ +L+F ++ + L + V +
Sbjct: 139 ENATTTTRYQVDDVTYTRTTFASFTDNVIIMHIKASKANALNFTIAYNFPLVHKVNVQND 198
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS-DDRGTISALEDKKLKVEGSDW 249
+ C GK + +G++ + E +I GT+ + EG++
Sbjct: 199 KLTVT---CQGK----------EQEGLKAALRAECQIQVKTNGTLRPAGNTLQINEGTE- 244
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L + A++++ +N D D + + L+ + Y H+ Y+K F RV
Sbjct: 245 ATLYISAATNY----VNYQDVSADESRRTSEYLKRAMQIPYEKALKSHIAYYKKQFDRVR 300
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L TD S+ + + +R+++F ED ++ LLF +GRYLLISSS+PG
Sbjct: 301 LTLP-------TDKTSQ-----LETPKRIENFGNGEDMAMAALLFHYGRYLLISSSQPGG 348
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN WDS +NIN EMNYW + NLSE PLF L LS G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSATGAET 408
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
A+ Y GW+ HH TD+W G V +A +WP GGAWL H+W+HY +T +++F
Sbjct: 409 ARTMYDCRGWMAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L K YP+L+G A F +D+L+E H Y L +PS SPEH ++ TMD
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPVYKWLVVSPSVSPEH---------GPITAGCTMDN 513
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHH 604
I + + A+ + + + + + ++L +L P +I + + EW +D +P+ H
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGKHNQLQEWLEDIDNPKDEH 572
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
RH+SHL+GL+P + I+ NP+L +AA TL +RG++ GWSI WK
Sbjct: 573 RHISHLYGLYPSNQISPYSNPELFQAARNTLLQRGDKATGWSIGWK 618
>gi|429852446|gb|ELA27582.1| alpha-l-fucosidase [Colletotrichum gloeosporioides Nara gc5]
Length = 796
Score = 366 bits (939), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 239/761 (31%), Positives = 372/761 (48%), Gaps = 52/761 (6%)
Query: 22 KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDS 81
+ F +A+PIGNGRLGAM+ G E ++LNE+++W G P D A AL +R +
Sbjct: 37 RDFYEALPIGNGRLGAMIHGYTDKELIRLNEESIWNGGPRDKIPTTALDALEPLREQILD 96
Query: 82 GQYAEATAASVKLFGHPAD---VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVK 138
G+ EA V F D YQ G++ L+F+ H YR LD++ + +
Sbjct: 97 GRLTEADQNWVANFTPEYDDMRRYQPAGELRLDFN--HTLNETSGYRHSLDVSKGLSSLS 154
Query: 139 YSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGR 198
Y G VE+TRE F + P V+ + S + SGSLS + SL N +
Sbjct: 155 YVFGGVEYTREAFGNAPKNVLAFRFSCNSSGSLSLDASLS---------RDRNVTELTAD 205
Query: 199 CPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS 258
G+ + +D +F + ++ + D G I + L + + ++ A +
Sbjct: 206 AAGRILKLDGTGEEDDT-YRFVSQAQVLLPDGVGDIIS-NGTALHIRNATDVFIIYTAET 263
Query: 259 SFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKD 318
+F +P + + L++ + Y + + DY++ + R SI S
Sbjct: 264 AFR----HPDATMAQLETIVNGRLETAQEAGYETIQREAVKDYKQYYDRTSIDFGTS--- 316
Query: 319 IVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIW 378
+ S++ I + +R + TD P L+ L F G+YLLI SSRPG+ ANLQGIW
Sbjct: 317 --QEIGSKDTIARLEDWKRGSNITTD--PELMALQFNVGKYLLIQSSRPGSLPANLQGIW 372
Query: 379 NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASG 438
N D P WDS +N+NLEMNYW + P NL E P+ DFL L++ GS+ A+ Y A G
Sbjct: 373 NRDFGPPWDSKFTINVNLEMNYWPAQPLNLPEIAGPVVDFLDRLAVTGSEVAKGMYGADG 432
Query: 439 WVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGC 498
W HH TDI + + A +P+GGAWL E++ +T D + R P+L+G
Sbjct: 433 WCCHHNTDITGDCTPFHAITIAAPYPLGGAWLAFEAIEYFRFTGDTTYARDRILPILKGA 492
Query: 499 ASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYSSTMDMAIIREVFSA 553
F+ W E DG+ TNPS SPE+ + P+ G+ + + D AI+ E+ S
Sbjct: 493 MDFIYSWATE-RDGWRITNPSCSPENSYYIPENMTVAGETTGIDAGAMNDRAIMWEIMSG 551
Query: 554 IISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGL 613
+ +E L +E A + + +++P G ++E+++++++ + HRH S L
Sbjct: 552 FLEISEALSSDEGADRARSFRD--KIQPPVAGSFGQLLEYSREYRENQPGHRHFSPLVCA 609
Query: 614 FPGHTITIEKNPDLCKAAEKTLQKRGEEGPG---WSITWKTALWARLHDQEHAYRMVKRL 670
PG +T P+ A K L+ R + G G W++TW + L ARL D +A + L
Sbjct: 610 HPGTWVTPLTTPEYADMAYKLLRHRMDNGGGVNSWAVTWASLLHARLFDATNALKNAMEL 669
Query: 671 FNLVDPEHEKHFEGGLYSNLFAAHPP-FQIDANFGFTAAVAEMLVQSTLNDLYLLPALP- 728
+ +++NLF+ + FQID N GFTAA+ EM +QS ++L PA+P
Sbjct: 670 LSRW-----------VHNNLFSRNGSYFQIDGNSGFTAAIVEMFLQSHAGVVHLGPAIPP 718
Query: 729 -WDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN 768
SSG +G ARGG V + W +G + + I S N
Sbjct: 719 AGQGLSSGSFRGWIARGGFEVDMTWSNGVVVQAEIISLLGN 759
>gi|121719440|ref|XP_001276419.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
gi|119404617|gb|EAW14993.1| conserved hypothetical protein [Aspergillus clavatus NRRL 1]
Length = 781
Score = 365 bits (938), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 257/766 (33%), Positives = 383/766 (50%), Gaps = 75/766 (9%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PAK F+ +PIGN RL A +WG + ++ + LNE+++W+G D NP + + + VR
Sbjct: 29 YTSPAKDFSSTLPIGNSRLAAAIWGSL-TDNITLNENSIWSGPFQDRVNPRSYEGFTQVR 87
Query: 77 SLVDSGQYAEATAAS-VKLFGHPAD--VYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
S++ G+ + A + V + G P Y LG ++L+F + Y R LDL
Sbjct: 88 SMLQDGKISAANQLTLVDMAGIPTSPRAYNPLGALKLDFGHDTVN----NYTRFLDLGMG 143
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A V+Y NV ++RE+ +S+PD ++ ++ S GSL+ SL+ YV N
Sbjct: 144 VAGVEYEYDNVTYSREYVASHPDGILAVRLRASTPGSLNVACSLE----RSRYVKSNTAN 199
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
+ R + KAN I F A E +I G +S+ + + + G+ +
Sbjct: 200 V---RKSWGTLTLKANTGQANDPISFVA--EAQIVSVGGHMSS-DGSSVVINGASTIDIF 253
Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL- 312
A +S+ F DS+ S+ + A + TR DY L RV + L
Sbjct: 254 FDAQTSYR--FFE-EDSRAAQLSKQLDAAVKQGYPAVKKAATR---DYASLTSRVRLNLG 307
Query: 313 -SRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLISSSRPGT 369
S + TD R+ +++ D DP L L+F FGR+LLI+SSR G
Sbjct: 308 SSGAAGGFSTDV-------------RLFNYKKDANSDPELATLMFNFGRHLLIASSRGGD 354
Query: 370 QV---ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
ANLQGIWNED P W V++NLEMNYW + NL+E P+ D + + +G
Sbjct: 355 TPGLPANLQGIWNEDYEPAWGGKYTVDVNLEMNYWPAQVTNLAETFGPVVDLMDTVVPHG 414
Query: 427 SKTAQVNY-LASGWVIHHKTDIWAKSS-ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
AQ Y +G+V+HH TD+W ++ D G AW+ +L E Y +T D+
Sbjct: 415 KDVAQRMYHCDAGYVLHHNTDLWGDAAPVDNGT----------AWMSMNLIEQYRFTQDK 464
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYS 539
L++R +PLL+ A+F +L E H+G+ + PS SPEH FI PD GK A + S
Sbjct: 465 SLLKERIWPLLKEAANFYYCYLFE-HEGHYISGPSISPEHAFIVPDEMSVPGKEAGIDLS 523
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKD 599
TMD ++++E+F+A+I A L D ++K K L +L P I G I+EW +++ +
Sbjct: 524 PTMDNSLLQELFAAVIEACTTLGITGDD-IDKAQKYLSKLPPPPIGSYGQILEWRREYNE 582
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP---GWSITWKTALWAR 656
E HRH+S + GL+PG +T N L AA+ L R E G GWS TW L+AR
Sbjct: 583 TEPGHRHMSPILGLYPGSQMTPAVNKTLADAAKVLLDHRIEHGSGSTGWSRTWTMNLYAR 642
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
L D + + + + + L++ FQID NFG+TAA+AEML+QS
Sbjct: 643 LLDGDQVWHHAQNFL-------QTYPSDNLWNTDHGPGSAFQIDGNFGYTAAIAEMLLQS 695
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
++LLPALP G V GL ARG + + W G L + I
Sbjct: 696 HAV-VHLLPALP-PAVPDGSVTGLVARGNFVIDMTWAQGMLKQAKI 739
>gi|393782601|ref|ZP_10370784.1| hypothetical protein HMPREF1071_01652 [Bacteroides salyersiae
CL02T12C01]
gi|392672828|gb|EIY66294.1| hypothetical protein HMPREF1071_01652 [Bacteroides salyersiae
CL02T12C01]
Length = 804
Score = 364 bits (934), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 239/785 (30%), Positives = 390/785 (49%), Gaps = 89/785 (11%)
Query: 17 FNGPAKHFTD-AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD-------- 67
F PA+++++ A+ IGNG +GA +G V E + E T WTG P ++ PD
Sbjct: 35 FTYPARNWSEQALHIGNGYMGASFYGDVEKERFDIAEKTFWTGGP--HSVPDFNYGVVKG 92
Query: 68 APKALSDVRSLVDSGQYAEATAAS-VKLFGHPADV--YQLLGDIELEFDDSHLKYAEETY 124
++ +R + ++AEA + S + + G + + ++G++ ++F + + Y
Sbjct: 93 GKDKIAAIRRSITDRRFAEADSLSRLYMVGDYTNYGYFSMVGNLFVDFGKKNQPV--QNY 150
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
R +DL+T+ V+Y+ G+V F RE+F S PD+++ + + G +SF++S +
Sbjct: 151 LRGIDLSTSRGFVEYTQGDVRFNREYFCSYPDKLMALHFTADQKGKISFSLSHSLVYQPE 210
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
G +++I G G G+ ++ + +K+ G+I + +++ V
Sbjct: 211 KVTEGKDELIFNGIIQGN-------------GLGYT--IRMKVLHQGGSIK-VGHQQITV 254
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
EG+D A + + + + P + P + ++S Y + H+ DYQ L
Sbjct: 255 EGADEATVFYTVDTEYSP--VYPLYKGEKPRQTTEKIIKSAITKGYETVKHTHISDYQTL 312
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLI 362
++RV LS DT SE+ +P+ RVK Q +D SL L F RYLLI
Sbjct: 313 YNRVKFTLS-------GDTASEK----LPTDIRVKQLQQGFTDDASLKVLWFNLSRYLLI 361
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
S+SRPGT +NLQG+WN W+ NINL+ YW P L EC+E +++ L
Sbjct: 362 SASRPGTLPSNLQGVWNTFEKAPWNGNFQSNINLQEMYWGCGPTQLPECEEAYLEWIEGL 421
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
G KTA Y GWV H +IW + ++W L+P G AW C HLWEHY +
Sbjct: 422 VEPGRKTAGEYYGTKGWVSHSTGNIWGHTVPGD-DILWGLYPSGAAWHCRHLWEHYAFGG 480
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST- 541
D+ +LE + YP+++ A F L+ ++E ++ PS S EH +G + V YS+
Sbjct: 481 DKSYLETKGYPIMKEAAEFWLENMVEYQKHFI-IAPSVSAEHGIEMKNG--SPVDYSTAN 537
Query: 542 --------------MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAED 587
D+ ++ ++++ +I A+E L + A EKV + +L P KI
Sbjct: 538 GEQTAGRIFTLPAYQDIEMVYDLYTHVIKASECL-GIDSAFREKVTIARNKLLPLKIGRY 596
Query: 588 GSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGE------- 640
G + EW D +P HHRH++HL+ L+PG+ I+ + P L A +K+L+ RG+
Sbjct: 597 GQLQEWIDDVDNPRDHHRHIAHLYALYPGNMISYSQTPALALAVKKSLEMRGKGKFGERW 656
Query: 641 --EGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP-F 697
G WS+ W+TALW RL++ + A ++ E G Y N+ +
Sbjct: 657 PHTGGNWSMAWRTALWTRLYEGDQAIGTFNQMIK----------ESG-YENMMSNQSGNM 705
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
Q+DA + AEML+QS ++LLPALP +W G ++GL AR G V++ WK G L
Sbjct: 706 QVDATMATSGLFAEMLLQSQEGFIHLLPALP-TEWPEGKIEGLMARNGYRVNMEWKYGKL 764
Query: 758 HEVGI 762
+ I
Sbjct: 765 MKAEI 769
>gi|253574361|ref|ZP_04851702.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
str. D14]
gi|251846066|gb|EES74073.1| conserved hypothetical protein [Paenibacillus sp. oral taxon 786
str. D14]
Length = 793
Score = 362 bits (930), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 265/824 (32%), Positives = 410/824 (49%), Gaps = 76/824 (9%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD--YTNPDAPKA 71
K+ ++ PA +++ +P+GNGR+GA+V E L E T W+G + A
Sbjct: 12 KLWYDKPAAGWSEGLPVGNGRIGAIVMAAPEREVWNLTESTYWSGQADETASAASGGKAA 71
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPA---DVYQLLGDIELEFDDSHLKYAEET----- 123
L+ +R + +G YA + + P + + D+ +EF S ET
Sbjct: 72 LAAIRERLFAGDYAGGDRLAKQALQPPKRNFGTHLAMCDVVIEFAPSGEPSETETGAVNG 131
Query: 124 ----YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+RRELDL+TA RE F+S+ D V+V++I +G +SF + L
Sbjct: 132 ACSPFRRELDLSTALLTTTSGQPGSTLVRELFASHADDVLVSRIWSEAAGGVSFTLGLAG 191
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
L V+ + +E R GK + +D G++ +E+ D RG +++
Sbjct: 192 LTPEFE-VSASGMAALEFR--GKAT--ETVHSDGACGVRCRGRIEL---DTRGGSLYVQN 243
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+L V G+D A + L ++ + +S+ + + A ++ Y L HL
Sbjct: 244 DRLVVRGADEACIYLTVATDYR------CESRSWELAPRLQASLALSK-GYDQLKADHLA 296
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGR 358
DY+ LF RVSI+L S E +P+ +R++ Q DP L L Q+GR
Sbjct: 297 DYEPLFRRVSIELGPS-----------EEAAKLPTDQRIRLLRQGYSDPQLFALFLQYGR 345
Query: 359 YLLISSSRPGTQVA-NLQGIWN--EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
YL ++ SR + + +LQGIWN E W H+++N EMNY+ + +L E Q+PL
Sbjct: 346 YLTLAGSREDSPLPLHLQGIWNDGEACRMGWSCDYHLDVNTEMNYYPTEVVHLGESQQPL 405
Query: 416 FDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRG-KVVWALWPMGGAWLCTHL 474
+L L+ G KTA+ Y + GWV H +++W + D G W L GG WL +
Sbjct: 406 MRYLEDLARAGQKTARDVYGSPGWVAHVFSNVWGFT--DPGWDTSWGLNVTGGLWLAMQM 463
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKL 533
EHY + +DR FLEK+AYP+L A F LD++ + G+L T PS SPE+ F +
Sbjct: 464 IEHYRFGLDRVFLEKQAYPVLREAALFFLDYMTVHPKYGWLVTGPSNSPENHFYPGRPEE 523
Query: 534 AC--VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIM 591
C +S STMD A++RE+F+ + AAE+LE++ + L ++ ++P L P +I + G +
Sbjct: 524 GCWQLSMGSTMDQALVRELFTFCLEAAELLEEDVE-LRSRLSSAIPLLPPLQIGKKGQLQ 582
Query: 592 EWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKT 651
EW +D+++ + HRHLSHLF L+P H IT E+ P+L AA TL+ R ++ I +
Sbjct: 583 EWLEDYEEAQPEHRHLSHLFALYPAHQITPEETPELAAAARVTLENRMQQDELEDIEFTA 642
Query: 652 AL----WARLHDQEHAYRMVKRLF------NLVDPEHEKHFEGGLYSNLFAAHPPFQIDA 701
AL +ARL++ + A + + L NL+ + K G +N+F ID
Sbjct: 643 ALFGLFFARLYNGDRALKHISHLIGELCFDNLLS--YSKAGIAGAETNIFV------IDG 694
Query: 702 NFGFTAAVAEMLVQSTL-NDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEV 760
NFG TAA+AEML+QS ++ LLPALP W +G V GL+A+G V + W+ G L
Sbjct: 695 NFGGTAAIAEMLLQSRPGGNIRLLPALP-AAWPTGRVTGLRAKGNAEVDLAWEAGRLSSA 753
Query: 761 GIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLKCTN 804
+ YS TL V AG Y F+ L N
Sbjct: 754 -VVRTYSPGTF----TLSLGDRRVTFEAKAGGEYRFDGALTLQN 792
>gi|417920435|ref|ZP_12563942.1| Gram-positive signal peptide protein, YSIRK family [Streptococcus
australis ATCC 700641]
gi|342829385|gb|EGU63741.1| Gram-positive signal peptide protein, YSIRK family [Streptococcus
australis ATCC 700641]
Length = 1209
Score = 361 bits (926), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 264/812 (32%), Positives = 403/812 (49%), Gaps = 126/812 (15%)
Query: 14 KITFNGPAKHFTD-----AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-------- 60
++T+N PA D A+P+GNG +GA V+G + E ++ NE TLW+G P
Sbjct: 123 QLTYNQPATPSYDGWEKQALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYN 182
Query: 61 -GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDS 115
G+Y D K L+++R +++G +A + + P + Y GDI + F++
Sbjct: 183 GGNY--EDRHKVLAEIRKALEAGDRQKAKQLAEENLVGPNNAQYGRYLSFGDIFMVFNNQ 240
Query: 116 HLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF- 173
T Y R LD+ A YS F RE FSS PD V VT +S +L F
Sbjct: 241 KKGLENVTDYHRALDITQAITTTAYSQDGTTFKRETFSSYPDDVTVTHLSKKGDKTLDFT 300
Query: 174 --NVSLDSLLDNHSY------------VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQF 219
N + LL N Y +N I+++G K N G+QF
Sbjct: 301 LWNSLTEDLLANGDYSWEYSNYKQGAVTTDSNGILLKGTV-------KDN------GLQF 347
Query: 220 SAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSES 278
++ L IK G ++A +D L V G+ +A LLL A ++F NP ++ +KD E+
Sbjct: 348 ASYLGIKTD---GQVTA-QDGYLTVTGASYATLLLSAKTNFAQ---NPKTNYRKDIDVEN 400
Query: 279 M--SALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAE 336
S +++ + Y L H++DYQ LF+RV + L S T + E
Sbjct: 401 TVKSIVEAAKAKDYETLKHDHIEDYQSLFNRVQLNLGGSKS-------------TQTTKE 447
Query: 337 RVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNI 394
++++ ++ L EL FQ+GRYL+ISSSR T ANLQG+WN +P W+S H+N+
Sbjct: 448 ALQTYNPEKGQQLEELFFQYGRYLIISSSRDRTDALPANLQGVWNAVDNPPWNSDYHLNV 507
Query: 395 NLEMNYWQSLPCNLSECQEPLFDFLTYLSING-----------SKTAQVNYLASGWVIHH 443
NL+MNYW + NL+E P+ +++ L G SK Q N GW++H
Sbjct: 508 NLQMNYWPAYMSNLAETARPMVNYIDDLRYYGRIAAKEYAGIESKEGQEN----GWLVHT 563
Query: 444 KTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASF 501
+ W D W P AW+ +++++Y +T D +L+++ YP+L+ A F
Sbjct: 564 QATPFGWTTPGWD---YYWGWSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKF 620
Query: 502 LLDWL--IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAE 559
+L + D ++ ++PS SPEH ++ +T D +++ ++F + AA
Sbjct: 621 WNSFLHYDQTSDRWV-SSPSYSPEH---------GTITIGNTFDQSLVWQLFHDYMEAAN 670
Query: 560 VLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD----FKDP--EVHHRHLSHLFGL 613
L ++D LV +V +L+P I ++G I EW ++ F + E HHRH+SHL GL
Sbjct: 671 HLNVDQD-LVTEVKTKFDKLKPLHINQEGRIKEWYEEDSPQFTNEGIENHHRHVSHLVGL 729
Query: 614 FPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNL 673
FPG T+ + P+ +AA TL RG+ G GWS K LWARL D A+R++
Sbjct: 730 FPG-TLFSKDQPEYLEAARATLNHRGDGGTGWSKANKINLWARLLDGNRAHRLLA----- 783
Query: 674 VDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWS 733
+ + NL+ H PFQID NFG T+ +AEML+QS + LPALP D W
Sbjct: 784 ------EQLKSSTLENLWDTHAPFQIDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWK 836
Query: 734 SGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
G + GL ARG VS+ WK+ +L + S+
Sbjct: 837 DGQISGLVARGNFEVSMKWKEKNLESLAFLSH 868
>gi|319946487|ref|ZP_08020723.1| alpha-L-fucosidase [Streptococcus australis ATCC 700641]
gi|319747318|gb|EFV99575.1| alpha-L-fucosidase [Streptococcus australis ATCC 700641]
Length = 1643
Score = 360 bits (924), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 264/812 (32%), Positives = 403/812 (49%), Gaps = 126/812 (15%)
Query: 14 KITFNGPAKHFTD-----AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-------- 60
++T+N PA D A+P+GNG +GA V+G + E ++ NE TLW+G P
Sbjct: 148 QLTYNQPATPSYDGWEKQALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYN 207
Query: 61 -GDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDS 115
G+Y D K L+++R +++G +A + + P + Y GDI + F++
Sbjct: 208 GGNYE--DRHKVLAEIRKALEAGDRQKAKQLAEENLVGPNNAQYGRYLSFGDIFMVFNNQ 265
Query: 116 HLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF- 173
T Y R LD+ A YS F RE FSS PD V VT +S +L F
Sbjct: 266 KKGLENVTDYHRALDITQAITTTAYSQDGTTFKRETFSSYPDDVTVTHLSKKGDKTLDFT 325
Query: 174 --NVSLDSLLDNHSY------------VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQF 219
N + LL N Y +N I+++G K N G+QF
Sbjct: 326 LWNSLTEDLLANGDYSWEYSNYKQGAVTTDSNGILLKGTV-------KDN------GLQF 372
Query: 220 SAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSES 278
++ L IK G ++A +D L V G+ +A LLL A ++F NP ++ +KD E+
Sbjct: 373 ASYLGIKTD---GQVTA-QDGYLTVTGASYATLLLSAKTNFAQ---NPKTNYRKDIDVEN 425
Query: 279 M--SALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAE 336
S +++ + Y L H++DYQ LF+RV + L S T + E
Sbjct: 426 TVKSIVEAAKAKDYETLKHDHIEDYQSLFNRVQLNLGGSKS-------------TQTTKE 472
Query: 337 RVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNI 394
++++ ++ L EL FQ+GRYL+ISSSR T ANLQG+WN +P W+S H+N+
Sbjct: 473 ALQTYNPEKGQQLEELFFQYGRYLIISSSRDRTDALPANLQGVWNAVDNPPWNSDYHLNV 532
Query: 395 NLEMNYWQSLPCNLSECQEPLFDFLTYLSING-----------SKTAQVNYLASGWVIHH 443
NL+MNYW + NL+E P+ +++ L G SK Q N GW++H
Sbjct: 533 NLQMNYWPAYMSNLAETARPMVNYIDDLRYYGRIAAKEYAGIESKEGQEN----GWLVHT 588
Query: 444 KTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASF 501
+ W D W P AW+ +++++Y +T D +L+++ YP+L+ A F
Sbjct: 589 QATPFGWTTPGWD---YYWGWSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKF 645
Query: 502 LLDWL--IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAE 559
+L + D ++ ++PS SPEH ++ +T D +++ ++F + AA
Sbjct: 646 WNSFLHYDQTSDRWV-SSPSYSPEH---------GTITIGNTFDQSLVWQLFHDYMEAAN 695
Query: 560 VLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD----FKDP--EVHHRHLSHLFGL 613
L ++D LV +V +L+P I ++G I EW ++ F + E HHRH+SHL GL
Sbjct: 696 HLNVDQD-LVTEVKTKFDKLKPLHINQEGRIKEWYEEDSPQFTNEGIENHHRHVSHLVGL 754
Query: 614 FPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNL 673
FPG T+ + P+ +AA TL RG+ G GWS K LWARL D A+R++
Sbjct: 755 FPG-TLFSKDQPEYLEAARATLNHRGDGGTGWSKANKINLWARLLDGNRAHRLLA----- 808
Query: 674 VDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWS 733
+ + NL+ H PFQID NFG T+ +AEML+QS + LPALP D W
Sbjct: 809 ------EQLKSSTLENLWDTHAPFQIDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWK 861
Query: 734 SGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
G + GL ARG VS+ WK+ +L + S+
Sbjct: 862 DGQISGLVARGNFEVSMKWKEKNLESLAFLSH 893
>gi|332881351|ref|ZP_08449001.1| hypothetical protein HMPREF9074_04789 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|357045233|ref|ZP_09106870.1| hypothetical protein HMPREF9441_00875 [Paraprevotella clara YIT
11840]
gi|332680727|gb|EGJ53674.1| hypothetical protein HMPREF9074_04789 [Capnocytophaga sp. oral
taxon 329 str. F0087]
gi|355531816|gb|EHH01212.1| hypothetical protein HMPREF9441_00875 [Paraprevotella clara YIT
11840]
Length = 798
Score = 360 bits (924), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 241/776 (31%), Positives = 389/776 (50%), Gaps = 74/776 (9%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG-VPGDYTNPDAPKALSDV 75
++ PA + ++P+GNGR+GAMV+GGV ET+ LNE ++W G + P + L ++
Sbjct: 29 YDAPADEWMKSLPVGNGRVGAMVFGGVNEETVALNESSMWAGEYDPNQEKPFGREKLDEL 88
Query: 76 RSLVDSGQYAEATA-ASVKLFG--HPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
R L G+ E A +L G H + +GD++++FD + + E YRRELDL
Sbjct: 89 RKLFFEGKLIEGNGIAGRELVGTPHSFGTHLPIGDLKIKFDYTGKEGGVEDYRRELDLTN 148
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A V + G ++ RE SSNP +V + + S+SF++ + + GN
Sbjct: 149 AVVTVSFKKGGTKYKREFISSNPQDAVVMHFTADKKQSVSFDMRMKMITAAQVRTEGNLL 208
Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
+ G+ + PK G+ F + +K+ DRG + A + ++V+ +D +
Sbjct: 209 VF-----DGQALFPKLGTG----GVHFQGRVVVKV--DRGEVEA-TGETVRVKHADAVTI 256
Query: 253 LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLS--YSDLYTRHLDDYQKLFHRVSI 310
+ + + K+ ES+ + ++ + + H+ DY LF RVS+
Sbjct: 257 VADVRTDY-----------KNGQYESLCEKTVEKAIARPFETMKEEHVADYAPLFARVSL 305
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPGT 369
+L+ K ++P R K+ + ++D L L FQ+GRYL I+SSR +
Sbjct: 306 KLADDSKK------------SIPVDRRWKALCEGNKDAGLQALFFQYGRYLTIASSRENS 353
Query: 370 QV-ANLQGIWNEDLSP--TWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
+ LQG +N++L+ W S H++IN E NYW + NL+EC PLF ++ L+ +G
Sbjct: 354 PLPIALQGFFNDNLACNMCWTSDYHLDINTEQNYWLTNVGNLAECNAPLFTYIADLAHHG 413
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
+KT + Y GW H ++W ++ G + W L+P+ G+W+ THLW Y YT+D+D+
Sbjct: 414 AKTVRTVYGCKGWTAHTVANVWGFTAPSEG-MGWGLFPLAGSWMATHLWTQYEYTLDKDY 472
Query: 487 LEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
L + AYPLL+G A FLLD+++E + GY+ T P SPE+ F +L S +T D
Sbjct: 473 LRRTAYPLLKGNAEFLLDYMVEDPNTGYMVTGPCVSPENSFRYQGWELG-ASMMTTCDKV 531
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHR 605
+ E+ SA + A+++L ++ A + + +L + P +I G + EW +D+++ +HR
Sbjct: 532 LAHEIMSACVQASDILGVDK-AFADSLRLALAKFPPFRINSFGGLCEWYEDYEEAHPNHR 590
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR----GEEGPGWSITWKTALWARLHDQE 661
H SHL +P IT EK+P+L +A T++ R G E WS +ARL D
Sbjct: 591 HTSHLLSFYPYAQITKEKDPELTEAVRTTIEHRLAAEGWEDVEWSRANMVCFYARLKDAA 650
Query: 662 HAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHP------PFQI---DANFGFTAAVAEM 712
A + L + D E NL P PF + D N A +AEM
Sbjct: 651 KAEESLNIL--MTDFARE---------NLLTISPEGIAGAPFDVFIFDGNAAGAAGMAEM 699
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN 768
LVQ+ + LLP LP + W G GL +GG VS WKD + + + + N
Sbjct: 700 LVQAQEGYVELLPCLPVE-WKDGSFSGLCVKGGAEVSAEWKDSRVVKASLKATADN 754
>gi|256376305|ref|YP_003099965.1| hypothetical protein Amir_2174 [Actinosynnema mirum DSM 43827]
gi|255920608|gb|ACU36119.1| conserved hypothetical protein [Actinosynnema mirum DSM 43827]
Length = 646
Score = 360 bits (924), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 189/440 (42%), Positives = 261/440 (59%), Gaps = 23/440 (5%)
Query: 328 NIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWD 387
+D P+ + S E P+L LLFQ GR+LL++SSRPGT ANLQG+WN P W
Sbjct: 199 ELDLGPAPDGPPSTWPREHPALAALLFQHGRHLLVASSRPGTLPANLQGVWNPHAEPPWR 258
Query: 388 SAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDI 447
S +NIN EMNYW + P L+EC EPL +FL L+ +G++ A+ Y GW HH TD
Sbjct: 259 SNYTLNINTEMNYWPAEPTALAECHEPLLEFLHGLAESGTRVARELYGLPGWCAHHNTDR 318
Query: 448 WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI 507
W ++ +G WA WPM GAWL HLWE Y + D +L RA+PLL G A F L WL+
Sbjct: 319 WFLATPVQGDPAWANWPMAGAWLSLHLWERYEFGGDAVWLRGRAWPLLLGAAEFCLAWLV 378
Query: 508 EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDA 567
E G L T PSTSPE+ ++ DG+ V +TMD+A+ E+ ++ A VL ++
Sbjct: 379 EDR-GELTTAPSTSPENHYLTADGREVAVGVGATMDLALTWELLDRVVRAGAVLGED--- 434
Query: 568 LVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDL 627
V + ++L R+ + DG ++EW ++ +PE HRHLSHL GL+PG + IE+ L
Sbjct: 435 -VGRFAEALARIPEPPVGSDGRVLEWRDEWAEPEPEHRHLSHLVGLYPG--VRIERGSAL 491
Query: 628 CKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLY 687
+AA ++L+ RG GPGWS WK ALWARL + E A + + LY
Sbjct: 492 AEAARRSLEARGPGGPGWSHAWKAALWARLGEGERAADSLAGMP--------------LY 537
Query: 688 SNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGET 747
NL A+ PFQ+D + G+ AAVAE+L+QS L LLPALP W +G V GL+ARGG
Sbjct: 538 PNLTCAN-PFQVDGSLGYPAAVAELLLQSHRGVLELLPALP-PSWPTGRVTGLRARGGIA 595
Query: 748 VSICWKDGDLHEVGIYSNYS 767
+ + W+DG+L V + ++ +
Sbjct: 596 IDLEWRDGELRSVALTADRA 615
Score = 46.6 bits (109), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 39/114 (34%), Positives = 51/114 (44%), Gaps = 12/114 (10%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN---PDAPKALSDVR 76
PA + +A PIG+GR GAM WG LN+D LWT + AP+ + R
Sbjct: 15 PAARWEEAHPIGDGRFGAMCWG---DGRFDLNDDRLWTDPSPPDPSQPAAGAPEVVRAAR 71
Query: 77 SLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
+ +G A + G YQ LG + L + AE YRRELDL
Sbjct: 72 AAALAGDPERADELLRSVQGPDTASYQPLGTLVLGY------RAEGGYRRELDL 119
>gi|418966542|ref|ZP_13518273.1| gram positive anchor [Streptococcus mitis SK616]
gi|383347120|gb|EID25122.1| gram positive anchor [Streptococcus mitis SK616]
Length = 1697
Score = 359 bits (921), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 259/787 (32%), Positives = 395/787 (50%), Gaps = 107/787 (13%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 141 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSQDYNGGNYK--DRYKVLAEIRK 198
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD++
Sbjct: 199 ALEDGDRQKAKRLAERNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLENVTDYHRGLDISE 258
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV--SLDSLL--------D 182
A Y+ F RE FSS PD V VT +S +L F + SL L D
Sbjct: 259 AITTTSYTQDGTSFKRETFSSFPDDVTVTHLSKKGDKTLDFTLWNSLTEDLIANGQYSRD 318
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
N +Y G + G I K D+ G++F++ L IK G ++A +D L
Sbjct: 319 NSNYKKGTISVDSNG------ILLKGTVKDN--GLKFASYLGIKTD---GQVTA-QDGYL 366
Query: 243 KVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLYTRHLD 299
V+G+ +A LLL A ++F NP ++ +KD E S +++ + Y L H+
Sbjct: 367 TVKGASYATLLLSAKTNFAQ---NPETNYRKDIDVEKTVKSIVEAAKAKDYETLKNDHIK 423
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
DYQ LF+RV + L S + T E ++++ + L EL FQ+GRY
Sbjct: 424 DYQSLFNRVQLNLGGSKSNQTT-------------KEALQTYNPTKGQKLEELFFQYGRY 470
Query: 360 LLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
LLISSSR T ANLQG+WN +P W+S H+N+NL+MNYW + NL+E +P+ +
Sbjct: 471 LLISSSRNRTDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYMSNLAETAKPMIN 530
Query: 418 FLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
++ + G SK Q N GW++H + + ++ W P
Sbjct: 531 YIDDMRYYGRIAAKEYAGIESKEGQEN----GWLVHTQATPFGWTTPGW-NYYWGWSPAA 585
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPSTSPEH 524
AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS SPEH
Sbjct: 586 NAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPSYSPEH 644
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
++ +T D +++ ++F + AA L+ ++D LV +V +L+P I
Sbjct: 645 ---------GNITIGNTFDQSLVWQLFHDYMEAANHLKIDQD-LVTEVKAKFDKLKPLHI 694
Query: 585 AEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR 638
+DG I EW ++ F + E HHRH+SHL GLFPG T+ + P+ +AA TL R
Sbjct: 695 NQDGRIKEWYEEDSPQFTNEGIENHHRHVSHLVGLFPG-TLFGKDQPEYLEAARATLNHR 753
Query: 639 GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQ 698
G+ G GWS K LWARL D A+R++ + NL+ H PFQ
Sbjct: 754 GDGGTGWSKANKINLWARLLDGNRAHRLLA-----------EQLRSSTLENLWDTHAPFQ 802
Query: 699 IDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLH 758
ID NFG T+ +AEML+QS + LPALP D W G V GL ARG VS+ WK+ +L
Sbjct: 803 IDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLVARGNFEVSMKWKEKNLE 861
Query: 759 EVGIYSN 765
+ SN
Sbjct: 862 TLSFLSN 868
>gi|225016842|ref|ZP_03706034.1| hypothetical protein CLOSTMETH_00754 [Clostridium methylpentosum
DSM 5476]
gi|224950383|gb|EEG31592.1| hypothetical protein CLOSTMETH_00754 [Clostridium methylpentosum
DSM 5476]
Length = 1957
Score = 358 bits (918), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 245/810 (30%), Positives = 413/810 (50%), Gaps = 92/810 (11%)
Query: 4 AESTSTTNPLKITFNGPAKH-----FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG 58
AE++ N L++ + PA T+++PIGNG +G+ V+GGV E L LNE TLW+G
Sbjct: 37 AEASVNDNDLRLWYTSPAPDTYNGWMTNSLPIGNGYMGSNVFGGVGRERLSLNEKTLWSG 96
Query: 59 VPG---DYTNPDAP------KALSDVRSLVDSGQYAEATAASVKLFGHPAD-------VY 102
P DY + + + ++ G + A + +L G D Y
Sbjct: 97 GPAEGRDYNGGNLESRGKNGETMKQIQQAFAEGNTSLANSLCNQLTGLSDDGGTQGYGYY 156
Query: 103 QLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTK 162
G++ LEF A+ Y R+LD+ TA A V Y V + RE+F+S PD ++V +
Sbjct: 157 LSYGNMYLEFPGMSDGNAQN-YVRDLDMKTAIASVNYDYDGVNYNREYFTSYPDNMMVAR 215
Query: 163 ISGSESGSLSFNVSLDSLLDNHS------YVNGNNQIIMEGRCPGKRIPPKANANDDPKG 216
++ SE+G L+FN+S++ DN S N Q G I + +D+
Sbjct: 216 LTASEAGKLTFNLSVNP--DNTSGKGQGPNTNNGYQRTWIQTADGGLITIQGQLSDNQ-- 271
Query: 217 IQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDG--PFINPSDSKKDP 274
++F++ + K+ + GT+ ED + V G+D V+L+ + +D P + +
Sbjct: 272 LKFAS--QTKVLNTGGTLVDNEDGTVSVTGADEVVILMTMGTDYDDNYPVYRTGQTDAEL 329
Query: 275 TSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPS 334
++ + + L Y L HL DYQ +F RV + L + I +P+
Sbjct: 330 LADIQGRIDAATELGYEGLLKSHLADYQGIFDRVHLDLGQE-------------ISQIPT 376
Query: 335 AERVKSFQTDED-PSLVE----LLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
+ + +++ + P+L + LL+Q+GRYL I+SSR G+ +NLQG+W + W S
Sbjct: 377 NQLLTNYKNGSNTPALNQALEVLLYQYGRYLTIASSREGSLPSNLQGVWTGANNSPWHSD 436
Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA---------SGWV 440
H+N+NL+MNYW + N++EC PL +++ L G TA++ Y +G++
Sbjct: 437 YHMNVNLQMNYWPTYSTNMAECAIPLIEYVDALRAPGRVTAKI-YAGIESTEENPENGFM 495
Query: 441 IHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCAS 500
H + + + + W P W+ + WE+Y YT D D++++ YP+L+ A
Sbjct: 496 AHTQNNPYGWTCPGW-SFDWGWSPAATPWIIQNCWEYYEYTGDLDYMKENIYPMLKEEAR 554
Query: 501 FLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAE 559
LIE + G L +P+ SPEH + +T + ++I ++F+ I A +
Sbjct: 555 LYEQMLIEDPETGKLVCSPAYSPEH---------GPRTNGNTYEQSLIWQLFTDAIIAGK 605
Query: 560 VLEKNEDALVEKVLKSLPRLR-PTKIAEDGSIMEWAQDFKDPEV---HHRHLSHLFGLFP 615
++++++ A ++K + + L+ P +I + G I EW ++ + HRH+SHL GLFP
Sbjct: 606 LVDEDQ-ATLDKWQEIIDNLKGPIEIGDSGQIKEWYEETTLGSMGAKGHRHMSHLLGLFP 664
Query: 616 GHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVD 675
G I++E P+L +AA+ ++ RG++ GW++ + AR + AY ++K
Sbjct: 665 GDLISVE-TPELLEAAKISMDDRGDDSTGWAMGQRINSRARSGEGNRAYNIIKNYL---- 719
Query: 676 PEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSG 735
F+ G+Y+NL+ +H PFQID NFG+T+ V EML+QS + + LLPALP D WS+G
Sbjct: 720 ------FQKGIYNNLWDSHAPFQIDGNFGYTSGVTEMLMQSNMGYINLLPALP-DAWSAG 772
Query: 736 CVKGLKARGGETVSICWKDGDLHEVGIYSN 765
+ G+ ARG +S+ W+ L I SN
Sbjct: 773 HIDGIVARGNFEISMDWEKKALTTATIKSN 802
>gi|225019389|ref|ZP_03708581.1| hypothetical protein CLOSTMETH_03342 [Clostridium methylpentosum
DSM 5476]
gi|224947845|gb|EEG29054.1| hypothetical protein CLOSTMETH_03342 [Clostridium methylpentosum
DSM 5476]
Length = 1708
Score = 358 bits (918), Expect = 9e-96, Method: Compositional matrix adjust.
Identities = 232/708 (32%), Positives = 355/708 (50%), Gaps = 73/708 (10%)
Query: 96 GHPADVYQLLGDIELEFD-DSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSN 154
G+ D QL EL FD S + Y+R LDL+ ATA+V+Y++ +V FTRE+F SN
Sbjct: 320 GNTTDGVQL---SELSFDLKSSTGPSYTNYKRTLDLDNATAKVEYTLDDVNFTREYFVSN 376
Query: 155 PDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDP 214
PD + +++ + G++S +S+ + + + I M G+ +R
Sbjct: 377 PDNFMAIRLTADQPGAISKAISITTPQSKKTITAEGDTITMTGQPADQR----------E 426
Query: 215 KGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD--SKK 272
G++F+ +IK+ G+++A + + VEG+D +LL+ A +++ + D + +
Sbjct: 427 DGLKFAQ--QIKVVPQGGSMTA-ANGTITVEGADSVLLLMTAGTNYQQCMDDTFDYFTDE 483
Query: 273 DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTV 332
DP + ++ Y DL H+ DYQ LF+ + + L +P E+ D +
Sbjct: 484 DPLDAVSQRIATVAAKDYDDLLAAHVADYQSLFNNMKLNLCDAP-------MPEKPTDEL 536
Query: 333 PSAERVKSFQTD---EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSA 389
+A ++ + ED L L +QFGRYLLI+SSR G+ ANLQGIW + L+P WD+
Sbjct: 537 LAAYGGRTSNPNTALEDRYLETLYYQFGRYLLIASSRDGSLPANLQGIWADGLNPPWDAD 596
Query: 390 PHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS------GWVIHH 443
H NIN++MNYW + NL+EC P+ D++ L G TAQ + GW +H
Sbjct: 597 YHTNINVQMNYWLAESTNLTECHLPIVDYINSLVPRGEITAQRYHCTEDGGDVRGWTTYH 656
Query: 444 KTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
+ +IW ++ + +P GGAW+ +WE Y + D++FL + + L G A F +
Sbjct: 657 ENNIWGNTAPATSSAFY--FPAGGAWMTQDIWEIYAFNQDKEFLAEN-FDTLLGAALFWV 713
Query: 504 DWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLE 562
D L+ + DG L ++PS SPEH S + D II + F I AAE L
Sbjct: 714 DNLVTDTRDGTLVSSPSYSPEH---------GPYSLGAACDQGIIWDTFQNTIEAAEALG 764
Query: 563 KNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFK---DPEVHHRHLSHLFGLFPGHTI 619
+ + E + ++ +L +I G MEW + + HRH++ LF L PG +
Sbjct: 765 IDTPEIAE-IREAQSKLAGPQIGLAGQFMEWKDEITMDITGDGGHRHVNQLFALHPGRQV 823
Query: 620 TIEKNPD---LCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDP 676
++ + +A + TL RG+ G GWS WK WARL D +HA MV ++
Sbjct: 824 VANRSAEDDAFVEAMKVTLNTRGDGGTGWSKAWKINFWARLRDGDHAQTMVNQI------ 877
Query: 677 EHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGC 736
+ Y NLF HPPFQID NFG TA + EML+QS + + LL ALP W G
Sbjct: 878 -----LKESTYGNLFDTHPPFQIDGNFGATAGMTEMLLQSQGDSIDLLAALP-QAWDHGD 931
Query: 737 VKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSV 784
V GLKARG V + W L + SN + L RGT++
Sbjct: 932 VTGLKARGNVEVDMEWSHATLTGATLRPGTSN------EALKVRGTNI 973
Score = 54.7 bits (130), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 31/83 (37%), Positives = 49/83 (59%), Gaps = 3/83 (3%)
Query: 6 STSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT 64
++ + L+ + PA + +A P+GNG LGAMV+GGV S+ +++NE +LW+G PG
Sbjct: 35 ASDSATKLQAFYTKPATDWEKEATPLGNGFLGAMVFGGVESDRIQINEHSLWSGGPGANE 94
Query: 65 NPDAPKALSDVRSLVDSGQYAEA 87
N D +SD + V+ EA
Sbjct: 95 NYDG--GMSDTPAEVNRQNLMEA 115
>gi|365119726|ref|ZP_09337619.1| hypothetical protein HMPREF1033_00965 [Tannerella sp.
6_1_58FAA_CT1]
gi|363648290|gb|EHL87470.1| hypothetical protein HMPREF1033_00965 [Tannerella sp.
6_1_58FAA_CT1]
Length = 1009
Score = 357 bits (917), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 237/679 (34%), Positives = 349/679 (51%), Gaps = 53/679 (7%)
Query: 105 LGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKI 163
L DIELE++ + + Y R LD++ A V Y FTRE F S PD V+V ++
Sbjct: 317 LSDIELEYEQLYEPLEPYSDYVRMLDIDNAVHSVIYKENGTTFTRECFMSYPDNVMVMRL 376
Query: 164 SGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL 223
+ G +S + S N + M G+ P N G++F+
Sbjct: 377 KADKGGCISRTFGITSPQPKKRIFASGNTLTMTGQ------PALHKEN----GLKFAQ-- 424
Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSD--SKKDPTSESMSA 281
++K+ + G + +++KK++V+ +D +LL+ A++++ D S +DP +
Sbjct: 425 QVKVLNKGGYLEVIDNKKIRVKDADEVILLMSAATNYQQSMDEKFDYFSDEDPLTTVKRT 484
Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF 341
L + + +Y DL + H DY+ L+ R+S+ L T + + K
Sbjct: 485 LMAAESKTYEDLLSSHKKDYKALYDRMSLNLGNI-------TGMSTKTTDILLKDFYKGN 537
Query: 342 QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
+E+ L +QFGRYLLI+SSR + ANLQG+W E LS W++ H NIN++MNYW
Sbjct: 538 TVEENLYTEMLYYQFGRYLLIASSRENSLPANLQGVWGERLSNPWNADYHTNINVQMNYW 597
Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYL------ASGWVIHHKTDIWAKSSADR 455
+ NLS C PL ++ L G TA+ Y GWV HH+ +IW ++
Sbjct: 598 PAQQTNLSPCHIPLISYINSLVPRGKITARHYYCKPDGGDVRGWVTHHENNIWGNTAP-- 655
Query: 456 GKVVWAL-WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGY 513
G A +P G AW+C +WE+Y + D+ FLE+ Y L G A F +D L + DG
Sbjct: 656 GTSYGAFHFPAGAAWMCQDIWEYYQFNCDKKFLEQN-YNTLLGAALFWVDNLWTDERDGT 714
Query: 514 LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE-KV 572
L NPS SPEH + L C ST+ A+I E+F +I A+E L K+ + E K
Sbjct: 715 LVANPSHSPEH----GEYSLGC----STV-QAMIAEIFDIVIKASEDLGKDTKEVAEIKA 765
Query: 573 LKSLPRLRPTKIAEDGSIMEWAQDF-KD--PEVHHRHLSHLFGLFPGHTITIEKN---PD 626
KS +L +I G MEW + KD + HRH++HLF L PG I ++
Sbjct: 766 AKS--KLAGPQIGLGGQFMEWKDEVTKDITGDGQHRHVNHLFWLHPGSQIVAGRSVQEDK 823
Query: 627 LCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGL 686
+A +KTL+ RG+ G GWS WK WARL D A++++K L + + GG+
Sbjct: 824 YVEAMKKTLETRGDGGTGWSKAWKINFWARLRDGNRAHKLLKEALTLTYTGNPANI-GGV 882
Query: 687 YSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGE 746
Y NLF HPPFQID NFG T+ +AEML+QS + LLPA+P D W++G +GLKARG
Sbjct: 883 YQNLFDTHPPFQIDGNFGATSGIAEMLLQSQGGYIELLPAIP-DDWANGTFEGLKARGNF 941
Query: 747 TVSICWKDGDLHEVGIYSN 765
+ WK+G L + SN
Sbjct: 942 EIDAEWKNGVLVTAELTSN 960
Score = 56.6 bits (135), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 26/56 (46%), Positives = 42/56 (75%), Gaps = 3/56 (5%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
+K +N PAK + ++A+PIGNG +GAM++G V + +++NE +LW+G PG+ NPD
Sbjct: 40 MKAVYNKPAKVWESEALPIGNGYMGAMIFGDVYRDVIQVNEHSLWSGGPGE--NPD 93
>gi|419765946|ref|ZP_14292168.1| Gram-positive signal peptide protein, YSIRK family / gram positive
anchor multi-domain protein [Streptococcus mitis SK579]
gi|383354600|gb|EID32158.1| Gram-positive signal peptide protein, YSIRK family / gram positive
anchor multi-domain protein [Streptococcus mitis SK579]
Length = 1662
Score = 357 bits (916), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 257/787 (32%), Positives = 392/787 (49%), Gaps = 107/787 (13%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 141 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSQDYNGGNYK--DRYKVLAEIRK 198
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD++
Sbjct: 199 ALEDGDRQKAKRLAERNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLESVTDYHRGLDISE 258
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV--SLDSLL--------D 182
A Y+ F RE FSS PD V VT +S +L F + SL L D
Sbjct: 259 AITTTSYTQDGTSFKRETFSSFPDDVTVTHLSKKGDKNLDFTLWNSLTEDLIANGQYSRD 318
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
N +Y G + G I K D+ G++F++ L IK G ++A +D L
Sbjct: 319 NSNYKKGTISVDSNG------ILLKGTVKDN--GLKFASYLGIKTD---GQVTA-QDGYL 366
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKK---DPTSESMSALQSIRNLSYSDLYTRHLD 299
V+G+ +A LLL A ++F NP + + D S +++ + Y L H+
Sbjct: 367 TVKGASYATLLLSAKTNFAQ---NPETNYRKDIDVGKTVKSIVEAAKAKDYETLKNDHIK 423
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
DYQ LF+RV + L S + T E ++++ + L EL FQ+GRY
Sbjct: 424 DYQSLFNRVQLNLGGSKSNQTT-------------KEALQTYNPTKGQKLEELFFQYGRY 470
Query: 360 LLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
LLISSSR T ANLQG+WN +P W+S H+N+NL+MNYW + NL+E +P+ +
Sbjct: 471 LLISSSRNRTDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYMSNLAETAKPMIN 530
Query: 418 FLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
++ + G SK Q N GW++H + + ++ W P
Sbjct: 531 YIDDMRYYGRIAAKEYAGIESKEGQEN----GWLVHTQATPFGWTTPG-WNYYWGWSPAA 585
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPSTSPEH 524
AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS SPEH
Sbjct: 586 NAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPSYSPEH 644
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
++ +T D +++ ++F + AA L+ ++D LV +V +L+P I
Sbjct: 645 ---------GNITIGNTFDQSLVWQLFHDYMEAANHLKIDQD-LVTEVKAKFDKLKPLHI 694
Query: 585 AEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR 638
+DG I EW ++ F + E HHRH+SHL GLFPG T+ + P+ +AA TL R
Sbjct: 695 NQDGRIKEWYEEDSPQFTNEGIENHHRHVSHLVGLFPG-TLFGKDQPEYLEAARATLNHR 753
Query: 639 GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQ 698
G+ G GWS K LWARL D A+R++ + NL+ H PFQ
Sbjct: 754 GDGGTGWSKANKINLWARLLDGNRAHRLLA-----------EQLRSSTLENLWDTHAPFQ 802
Query: 699 IDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLH 758
ID NFG T+ +AEML+QS + LPALP D W G V GL ARG VS+ WK+ +L
Sbjct: 803 IDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLVARGNFEVSMKWKEKNLE 861
Query: 759 EVGIYSN 765
+ SN
Sbjct: 862 TLSFLSN 868
>gi|421276774|ref|ZP_15727594.1| alpha-L-fucosidase, putative, afc95A [Streptococcus mitis SPAR10]
gi|395876055|gb|EJG87131.1| alpha-L-fucosidase, putative, afc95A [Streptococcus mitis SPAR10]
Length = 922
Score = 357 bits (916), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 259/806 (32%), Positives = 402/806 (49%), Gaps = 120/806 (14%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GD 62
P +++G K A+P+GNG +GA V+G + E ++ NE TLW+G P G+
Sbjct: 125 PTAPSYDGWEKQ---ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGN 181
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLK 118
Y D K LS++R ++ G +A + + P + Y GDI + F++
Sbjct: 182 YQ--DRYKVLSEIRKALEEGNRQKAKQLAERNLVGPNNAQYGRYLSFGDIFMVFNNQKKG 239
Query: 119 YAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---N 174
T Y R LD++ A + Y+ F RE FSS PD V VT +S +L F N
Sbjct: 240 LENVTDYHRGLDISEAISTTSYTQDGTNFKRETFSSYPDDVTVTHLSKKGDKTLDFTLWN 299
Query: 175 VSLDSLLDNHSY------------VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAI 222
+ L+ N Y +N I+++G K N G++F++
Sbjct: 300 SLTEDLIANGDYSWEYSNYKQGAVTTDSNGILLKGTV-------KDN------GLKFASY 346
Query: 223 LEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM-- 279
L IK G ++A +D L V G+ +A LLL A ++F NP ++ +KD E
Sbjct: 347 LGIKTD---GQVTA-QDGYLTVTGASYATLLLSAKTNFAQ---NPKTNYRKDIDLEKTVK 399
Query: 280 SALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVK 339
S +++ + Y L H+ DYQ LF+RV + L S + T E +
Sbjct: 400 SIVEASKAKDYETLKNNHIKDYQSLFNRVQLNLGGSRSNQTT-------------KEALH 446
Query: 340 SFQTDEDPSLVELLFQFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLE 397
++ ++ L EL FQ+GRYLLISSSR T ANLQG+WN +PTW+S H+N+NL+
Sbjct: 447 TYNPEKGQKLEELFFQYGRYLLISSSRDRTDALPANLQGVWNAVDNPTWNSDYHLNVNLQ 506
Query: 398 MNYWQSLPCNLSECQEPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTD 446
MNYW + NL+E +P+ +++ + G SK Q N GW++H +
Sbjct: 507 MNYWPAYMNNLAETAKPMINYIDDMRYYGRIAAKEYAGIESKEGQEN----GWLVHTQAT 562
Query: 447 IWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL 506
+ ++ W P AW+ +++++Y +T D +L+++ YP+L+ A F +L
Sbjct: 563 PFGWTTPG-WNYYWGWSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFL 621
Query: 507 --IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKN 564
+ D ++ ++PS SPEH ++ +T D +++ ++F + AA L +
Sbjct: 622 HYDKASDRWV-SSPSYSPEH---------GTITIGNTFDQSLVWQLFHDYMEAANHLNVD 671
Query: 565 EDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHT 618
+D LV +V +L+P I +DG I EW ++ F + E +HRH+SHL GLFPG T
Sbjct: 672 QD-LVTEVKAKFDKLKPLHINQDGRIKEWYEEDSPQFTNEGIENYHRHVSHLVGLFPG-T 729
Query: 619 ITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEH 678
+ + +P+ +AA TL RG+ G GWS K LWARL D A+R++
Sbjct: 730 LFSKDHPEYLEAARATLNHRGDGGTGWSKANKINLWARLLDGNRAHRLLA---------- 779
Query: 679 EKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVK 738
+ + NL+ H PFQID NFG T+ +AEML+QS + LPALP D W G +
Sbjct: 780 -EQLKSSTLENLWDTHAPFQIDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQIS 837
Query: 739 GLKARGGETVSICWKDGDLHEVGIYS 764
GL ARG VS+ WK+ +L + S
Sbjct: 838 GLVARGNFEVSMKWKEKNLESLAFLS 863
>gi|322377414|ref|ZP_08051905.1| fibronectin type III domain protein [Streptococcus sp. M334]
gi|321281614|gb|EFX58623.1| fibronectin type III domain protein [Streptococcus sp. M334]
Length = 803
Score = 356 bits (913), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 260/818 (31%), Positives = 397/818 (48%), Gaps = 86/818 (10%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
P T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPASTTYKGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEF-DDSHLK 118
D L+++R ++ Y A + + P +Y GDI +EF +
Sbjct: 72 NLQDQYVFLAEIRQDLEKRDYNRAKELAEQHLVGPKTSQYGIYLSFGDIHIEFSNQGKTL 131
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y Y+R+L+++ A A Y F RE F+S PD ++V + + S +L F + L
Sbjct: 132 YQVTDYQRQLNISKALATTSYVYKGTRFEREVFASFPDDLLVQRFTKEGSETLDFTMDLS 191
Query: 179 SLLDNHSYVNGNNQIIMEGRC----PGKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
D S + + C I K D+ +QF++ L K G I
Sbjct: 192 LTRDLASDGKYEQEKLDYKECQLDISTSHILMKGRVKDND--LQFASCLAWKTD---GDI 246
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
DK +++ G+ +A L LVA + F + K D + +++ + Y+ L
Sbjct: 247 RVWSDK-VQISGASYANLFLVAKTDFAQNPASNYRKKIDLEQQVKDLVETAKEEGYTQLK 305
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
+RH++DYQ LF RV + L N D + + +K++++ E L EL F
Sbjct: 306 SRHIEDYQALFQRVQLDLG-------------ANGDISTTDDLLKNYKSQEGQDLEELFF 352
Query: 355 QFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW S NL E
Sbjct: 353 QYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPSYVTNLLETA 412
Query: 413 EPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWAL 462
P+ +++ L + G + A Y +GW++H + W D W
Sbjct: 413 FPVINYIDDLRVYG-RLAAAKYAGIISREGEENGWLVHTQATPFGWTAPGWD---YYWGW 468
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTS 521
P AW+ ++E Y++ D+D+L ++ YP+L F D+L + ++PS S
Sbjct: 469 SPASNAWMMQTVYEVYSFYRDQDYLREKIYPMLSETVRFWNDFLHKDQQAQRWVSSPSYS 528
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
PEH +S +T D ++I ++F I AA+ L + D L E V + L P
Sbjct: 529 PEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDADLLTE-VKEKFDLLNP 578
Query: 582 TKIAEDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTL 635
+I + G I EW ++ F++ +V HRH SHL GL+PG+ + K D +AA +L
Sbjct: 579 LQITQSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPGNLFS-HKGQDYLEAARASL 637
Query: 636 QKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHP 695
RG+ G GWS K LWARL D A++++ + + NL+ +HP
Sbjct: 638 NDRGDGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHP 686
Query: 696 PFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
PFQID NFG T+ +AEML+QS L L ALP D WSSG V GL ARG VS+ W+D
Sbjct: 687 PFQIDGNFGATSGMAEMLLQSHTAYLVPLAALP-DAWSSGSVSGLMARGHFEVSMRWEDK 745
Query: 756 DLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKI 793
L ++ I S + S+ L + ++VN K+
Sbjct: 746 KLLQMTILSRSGGDLSVSY--LGIEKSVIEVNQEKAKV 781
>gi|331092304|ref|ZP_08341132.1| hypothetical protein HMPREF9477_01775 [Lachnospiraceae bacterium
2_1_46FAA]
gi|330401736|gb|EGG81315.1| hypothetical protein HMPREF9477_01775 [Lachnospiraceae bacterium
2_1_46FAA]
Length = 1730
Score = 356 bits (913), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 239/771 (30%), Positives = 375/771 (48%), Gaps = 78/771 (10%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS- 77
+PIGN +GA V+G + E L N+ TLW G P G+ D K +SDV
Sbjct: 76 LPIGNSFMGANVYGEIGKERLTFNQKTLWNGGPSTSRPNYKGGNKDTADNGKKMSDVYKE 135
Query: 78 ---LVDSGQYAEATAASVKLFGHPAD--VYQLLGDI--ELEFDDSHLKYAEETYRRELDL 130
L G+ A+A + KL G A YQ GDI + +FD+S K Y R+L++
Sbjct: 136 IIELYKKGEDAKANELAKKLTGEVAGYGAYQSWGDIYVDFKFDESQAK----NYVRDLNM 191
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
A A V + N + RE+F S PD V+ K + + L+ ++S +DN V G
Sbjct: 192 ENAVASVDFDYKNTKMHREYFVSYPDNVLAMKFTADGNEKLNLDISFP--IDNAEGVTG- 248
Query: 191 NQIIMEGRCPGKRIPPKANAN-----DDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
+ GK + N + + Q ++K+ + GT+ A + KL V
Sbjct: 249 -------KKLGKNVQTTVKDNTITVAGEMQDNQLKLNGKLKVETENGTVEAKDGDKLHVA 301
Query: 246 GSDWAVLLLVASSSF--DGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
+ + + A + + D P ++K+ + Y + H+ DY +
Sbjct: 302 NASEVTVYVSADTDYKNDYPKYRTGETKEQLNDSVQKTIDKASKKGYEKVKEDHIADYTE 361
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
+F RV + L +S + D + + + K ED +L +LFQ+GRYL I+
Sbjct: 362 IFDRVDLDLGQS--------VPTKTTDVLLNDYKAKKNTAAEDRALEVMLFQYGRYLTIA 413
Query: 364 SSRPGTQVANLQGIWNEDLSP----TWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
SSR G +NLQG+W + W S H+N+NL+MNYW + N++EC PL D++
Sbjct: 414 SSRAGDLPSNLQGVWQNRVGDHNRVPWASDYHMNVNLQMNYWPTYSTNMAECATPLVDYI 473
Query: 420 TYLSINGSKTAQVNY-LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
L G TA+ + + +G H + + W P W+ + WE+Y
Sbjct: 474 NSLVEPGKVTAKTYFGVENGGFTAHTQNTPFGWTCPGWNFSWGWSPAALPWILQNCWEYY 533
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVS 537
YT D ++E+ YP+L+ A LIE G L + P+ SPEH V+
Sbjct: 534 EYTGDVKYMEEHIYPMLKEAALLYDQILIEDTKTGRLVSAPAYSPEH---------GPVT 584
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
+T + ++I +++ +AAE+L ++D + + +L+P +I + G I EW +
Sbjct: 585 AGNTYEQSLIWQLYEDAATAAEILNVDKDKAAQ-WRERQAKLKPIEIGDSGQIKEWYTET 643
Query: 598 ---KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALW 654
+ HRH+SHL GLFPG I+++ NP+ AA +L++RGE+ GW + + W
Sbjct: 644 TLGSMGQKGHRHMSHLLGLFPGDLISVD-NPEFMDAAIVSLKERGEKSTGWGMGQRINAW 702
Query: 655 ARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLV 714
AR D A+++++ LFN G+Y NL+ H PFQID NFG T+ V+EML+
Sbjct: 703 ARTGDGNQAHKLIQNLFN-----------DGIYPNLWDTHTPFQIDGNFGMTSGVSEMLL 751
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
QS + + +LP+LP D W++G VKGL ARG VS+ W D ++ E I SN
Sbjct: 752 QSNMGYINMLPSLP-DVWANGSVKGLVARGNFEVSMKWADKNVTEATILSN 801
>gi|417935794|ref|ZP_12579111.1| gram positive anchor [Streptococcus infantis X]
gi|343402703|gb|EGV15208.1| gram positive anchor [Streptococcus infantis X]
Length = 1764
Score = 356 bits (913), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 257/792 (32%), Positives = 397/792 (50%), Gaps = 117/792 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 153 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYQ--DRYKVLAEIRK 210
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD++
Sbjct: 211 ALEEGNRQKAKQLAERNLVGPNNAQYGRYLSFGDIFMVFNNQKKGLESVTDYHRGLDISE 270
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDNHSY--- 186
A + Y+ F RE FSS PD V VT +S +L F N + L+ N Y
Sbjct: 271 AISTTSYTQDGTNFKRETFSSYPDDVTVTHLSKKGDKTLDFTLWNSLTEDLIANGDYSWE 330
Query: 187 ---------VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+N I+++G K N G++F++ L IK G ++A
Sbjct: 331 YSNYKQGAVTTDSNGILLKGTV-------KDN------GLKFASYLGIKTD---GQVTA- 373
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D L V G+ +A LLL A ++F NP ++ +KD E+ S +++ + Y L
Sbjct: 374 QDGYLTVTGASYATLLLSAKTNFAQ---NPKTNYRKDIDLENTVKSIVEAAKAKDYETLK 430
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L S + T E ++++ + L EL F
Sbjct: 431 NDHIKDYQSLFNRVQLNLGGSKSNQTT-------------KEALQTYNPTKGQKLEELFF 477
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W+S H+N+NL+MNYW + NL+E
Sbjct: 478 QYGRYLLISSSRNRTDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYMSNLAETA 537
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 538 KPMINYIDDMRYYGRIAAKEYAGIESKEGQEN----GWLVHTQATPFGWTTPGW-NYYWG 592
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 593 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPS 651
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + AA L+ ++D LV +V +L
Sbjct: 652 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEAANHLKIDQD-LVTEVKAKFNKL 701
Query: 580 RPTKIAEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
+P I +DG I EW ++ F + E HHRH+SHL GLFPG T+ + P+ +AA
Sbjct: 702 KPLHINQDGRIKEWYEEDSPQFTNEGIENHHRHVSHLVGLFPG-TLFGKDQPEYLEAARA 760
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAA 693
TL RG+ G GWS K LWARL D A+R++ + + NL+
Sbjct: 761 TLNHRGDGGTGWSKANKINLWARLLDGNRAHRLLA-----------EQLKSSTLENLWDT 809
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWK 753
H PFQID NFG T+ +AEML+QS + LPALP D W G V GL ARG VS+ WK
Sbjct: 810 HAPFQIDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLVARGNFEVSMKWK 868
Query: 754 DGDLHEVGIYSN 765
+ +L + SN
Sbjct: 869 EKNLETLSFISN 880
>gi|330996466|ref|ZP_08320348.1| hypothetical protein HMPREF9442_01433 [Paraprevotella xylaniphila
YIT 11841]
gi|329573022|gb|EGG54641.1| hypothetical protein HMPREF9442_01433 [Paraprevotella xylaniphila
YIT 11841]
Length = 798
Score = 356 bits (913), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 238/765 (31%), Positives = 384/765 (50%), Gaps = 52/765 (6%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTG-VPGDYTNPDAPKALSDV 75
++ PA + ++P+GNGR+GAMV+GGV ET+ LNE ++W G + P L +
Sbjct: 29 YDAPADEWMKSLPVGNGRVGAMVFGGVDEETVALNESSMWAGEYDPNQEKPFGRARLDSL 88
Query: 76 RSLVDSGQYAEATA-ASVKLFG--HPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
R L +G+ E A +L G H + +GD++++FD + + E YRRELDL
Sbjct: 89 RELFFAGKLIEGNGIAGRELVGTPHSFGTHLPIGDLKIKFDYAGKEGGVEDYRRELDLTN 148
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A A V + G ++ RE+ SSNP +V + + S+SF++ + + GN
Sbjct: 149 AVATVSFKKGGTKYKREYISSNPQDAVVMHFTADKKQSVSFDMRMKMITAAQVRTEGNLL 208
Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
+ G+ + PK G++F + +K+ D G + A + ++V+ +D +
Sbjct: 209 VF-----DGQALFPKLGTG----GVKFQGRVVVKV--DNGEVEA-AGETVRVKHAD--AV 254
Query: 253 LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
+VA D + + E+++ + + H+ DY LF RVS++L
Sbjct: 255 TIVADVRTDYKNGQYASLCEKTVGEAIAR-------PFETMKEEHVADYAPLFARVSLKL 307
Query: 313 SRSPKDIVTDTCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
+ K +VP R K+ + ++D L L FQ+GRYL I+SSR + +
Sbjct: 308 ADDSKK------------SVPVDRRWKALCEGNKDAGLQALFFQYGRYLTIASSRENSPL 355
Query: 372 -ANLQGIWNEDLSP--TWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
LQG +N++L+ W S H++IN E NYW + NL+EC PLF ++ L+ +G+K
Sbjct: 356 PIALQGFFNDNLACNMCWTSDYHLDINTEQNYWLANVGNLAECNAPLFTYIADLARHGAK 415
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
T + Y GW H ++W ++ G + W L+P+ G+W+ THLW Y YT+D+D+L
Sbjct: 416 TVRTVYGCKGWTAHTVANVWGFTAPSEG-MGWGLFPLAGSWMATHLWTQYEYTLDKDYLR 474
Query: 489 KRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
+ AYPLL+G A FLLD+++E + GY+ T P SPE+ F +L S +T D +
Sbjct: 475 RTAYPLLKGNAEFLLDYMVEDPNTGYMVTGPCVSPENSFRYQGWELG-ASMMTTCDRVLA 533
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHL 607
E+ SA + A+++L ++D + + +L + P ++ G + EW +D+++ +HRH
Sbjct: 534 HEIMSACVQASDILGVDKD-FADSLRLALAKFPPFRVNSYGGLCEWYEDYEEAHPNHRHT 592
Query: 608 SHLFGLFPGHTITIEKNPDLCKAAEKTLQKR----GEEGPGWSITWKTALWARLHDQEHA 663
SHL +P IT K+P+L +A T++ R G E WS +ARL D A
Sbjct: 593 SHLLAYYPYSQITNGKDPELTEAVRTTIEHRLAAEGWEDTEWSRANMVCFYARLKDAAKA 652
Query: 664 YRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYL 723
+ L L D E A F D N A +AEMLVQ+ + +
Sbjct: 653 EESLNIL--LTDFARENLLTISPEGIAGAPFDVFIFDGNAAGAAGLAEMLVQAHEGYVEI 710
Query: 724 LPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSN 768
LP LP +W G GL +GG VS WKD + + + + N
Sbjct: 711 LPCLP-TEWKDGSFSGLCVKGGAEVSAEWKDSRVVKASLKATADN 754
>gi|302523529|ref|ZP_07275871.1| fibronectin type III domain-containing protein [Streptomyces sp.
SPB78]
gi|302432424|gb|EFL04240.1| fibronectin type III domain-containing protein [Streptomyces sp.
SPB78]
Length = 661
Score = 355 bits (911), Expect = 6e-95, Method: Compositional matrix adjust.
Identities = 231/699 (33%), Positives = 345/699 (49%), Gaps = 62/699 (8%)
Query: 102 YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVT 161
+Q GD+ ++ D + + E Y R LDL A A V Y F R F+S PD+V+V
Sbjct: 20 HQTFGDLLIDVDGA--PGSAEGYTRTLDLAQALATVSYPHDGATFHRTVFTSCPDKVLVG 77
Query: 162 KISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSA 221
+ GS+ N+ S + + +++ + G G++F A
Sbjct: 78 HFTADRGGSVGLNLRYTSPRQDFTATTDGDRLTVRGAL-------------QDNGMRFEA 124
Query: 222 ILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA 281
+I++ + GT++A D+ L V G+D A +L A + + + P DP +A
Sbjct: 125 --QIRLLSEGGTVTANGDR-LAVSGADSAWFVLSAGTDYADTY--PDYRGADPHDRVATA 179
Query: 282 LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSF 341
+ Y +L RH D+ LF RV + L + D+ + D + A S
Sbjct: 180 VDQAAARPYRELLDRHTSDHAALFSRVVLDLGQ-------DSAPDRTTDALLKAYTGGS- 231
Query: 342 QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
+ +D +L L FQ+GRYLLI+SSR G+ ANLQG WN +P W + HVNINL+MNYW
Sbjct: 232 -SADDRALEALFFQYGRYLLIASSRAGSLPANLQGAWNNSTAPPWSADYHVNINLQMNYW 290
Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA-DRGKVVW 460
+ NL+E P F+ L G TA+ + A GWV+H +T + + D W
Sbjct: 291 PAEATNLAETTAPYDRFVEALRAPGRTTARSMFDARGWVVHDETTPFGFTGVHDWPTSFW 350
Query: 461 ALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPS 519
+P AWL + L+EHY + D+L AYP ++ A F +D L + D L PS
Sbjct: 351 --FPEAAAWLTSQLYEHYRFDGSTDYLRATAYPAMKEAAEFWIDVLRTDPRDNTLVVTPS 408
Query: 520 TSPEH-EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPR 578
SPEH +F A + M I+RE+F + AA+ L ++ A + ++L R
Sbjct: 409 FSPEHGDFTA----------GAAMSQQIVRELFLNTLEAAQTL-GDDPAFRATLKETLDR 457
Query: 579 LRPT-KIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQK 637
+ P +I G +MEW D HRH+SHL+ L PG IE D +AA+ +L
Sbjct: 458 IDPGLRIGSWGQLMEWKTDLDGRTDDHRHVSHLYALHPGR--QIEPGSDFAEAAKVSLTA 515
Query: 638 RGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
RG+ G GWS WK WARL D +HA+ M+ + +G +NL+ HPPF
Sbjct: 516 RGDGGTGWSKAWKINFWARLRDGDHAHTMLA-----------EQLKGSTLANLWDTHPPF 564
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
QID NFG T+ + EML+QS + + +LPALP WSSG V+GL+ARGG T+ W++G
Sbjct: 565 QIDGNFGATSGITEMLLQSQHDVIEVLPALP-AAWSSGTVRGLRARGGATLEFSWENGRA 623
Query: 758 HEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTF 796
+ + + S + + G + AG+ YT+
Sbjct: 624 TRIALTA--SRTRELTVRNALVPGGTTTFKAVAGETYTW 660
>gi|257070006|ref|YP_003156261.1| hypothetical protein Bfae_29100 [Brachybacterium faecium DSM 4810]
gi|256560824|gb|ACU86671.1| hypothetical protein Bfae_29100 [Brachybacterium faecium DSM 4810]
Length = 762
Score = 355 bits (910), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 249/761 (32%), Positives = 363/761 (47%), Gaps = 67/761 (8%)
Query: 9 TTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDA 68
T P + +GPA+ + +A+P+GNGRLGAM WG LNE TLW+G PG
Sbjct: 14 VTPPPALLRHGPAERWLEALPLGNGRLGAMAWGDPGRARFSLNESTLWSGAPGVDLPHRT 73
Query: 69 PK-----ALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET 123
P+ AL R+L SG EA +L + Y +GD+ + D +
Sbjct: 74 PRAEAAAALERSRALFTSGAVQEAQEEIERLGASWSQAYLPVGDLTVRLDGDAGPEGGDG 133
Query: 124 YRRELDLNTATARVKYSVGNVEFTREH--FSSNPDQVIVTKISGSESGS--LSFNVSLDS 179
RRELDL RV + G EH F S D+V+V + E L + L
Sbjct: 134 -RRELDLQHGEHRVLAADG------EHLSFVSAADEVLVHCLPCPEGARAVLELDSPLVE 186
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+G+ + + R P +D P G QF +I + + +A+
Sbjct: 187 EQREEQPADGDAALTIVLRAP----------SDVPGG-QFRQQEQIAWESEGASRAAVVV 235
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+ + G V +V +++ G P + + E+ + ++ +L+ RH D
Sbjct: 236 RTRREAGRLLVVCAIV--TTWQGLGRTPDRAVAEAVQEATAQAETALARGAEELHRRHRD 293
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
+ V +QL+ S + + TC F +GRY
Sbjct: 294 RPRPGADAVGLQLTGSEEAELLATC-----------------------------FAYGRY 324
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LL S+SRPG ANLQG+WN L W S VNINLEMN+W + + E L ++
Sbjct: 325 LLASASRPGLPPANLQGLWNAKLEAPWSSNYTVNINLEMNHWGAAIAQVPEAAGALEQYV 384
Query: 420 TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
L G TA+ Y A GW +HH +D W + RG+ WA WPMGG WL L + +
Sbjct: 385 EMLREQGRDTARRLYGADGWTVHHNSDPWGYTDPVRGEPSWATWPMGGLWL-EQLLDTFA 443
Query: 480 YTMDRDFLE--KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVS 537
D E + +P L +F L L E DG+L T PSTSPE+ + DG + C+S
Sbjct: 444 ACSGSDPAEVARDRFPALREAVAFALGLLHESADGHLATFPSTSPENRWRTADGTVVCLS 503
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD- 596
+ MD ++RE ++ AA VL + +D +V++ +L + ++ DG I+EW +D
Sbjct: 504 EGTGMDRWLLRETAQHLVEAAAVLGREDDPVVQQAASALDLVPGPRVGADGRILEWHRDG 563
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
+ E HRH+SHL L+P + P +AA ++L+ RG+E GWS+ WK LWAR
Sbjct: 564 LTEAEPDHRHVSHLGFLYPS---GLPAEPRHEQAAARSLEARGDEATGWSLVWKVCLWAR 620
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
LH + +++ L+ + GLY NLF+AHPPFQID N G AA+AE LVQS
Sbjct: 621 LHRPDRVQSLLE-LYLRPAEAPDGTARSGLYPNLFSAHPPFQIDGNLGIVAALAECLVQS 679
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
+L LLPALP + G ++GL+AR G + + W DG L
Sbjct: 680 HRGELELLPALP-PMMADGALRGLRARPGIEMDMTWNDGTL 719
>gi|385260919|ref|ZP_10039057.1| gram positive anchor [Streptococcus sp. SK140]
gi|385190192|gb|EIF37641.1| gram positive anchor [Streptococcus sp. SK140]
Length = 1717
Score = 355 bits (910), Expect = 8e-95, Method: Compositional matrix adjust.
Identities = 255/782 (32%), Positives = 393/782 (50%), Gaps = 97/782 (12%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 141 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSQDYNGGNYK--DRYKVLAEIRK 198
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD++
Sbjct: 199 ALEDGDRQKAKQLAERNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLENVTDYHRGLDISE 258
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDNHSYVNG 189
A Y+ F RE FSS PD V VT ++ +L F N + L+ N Y +
Sbjct: 259 AITTTSYTQDGTSFKRETFSSYPDDVTVTHLTKKGDKTLDFTLWNSLTEDLIANGDY-SW 317
Query: 190 NNQIIMEGRCP--GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGS 247
N +G I K D+ G++F++ L IK G ++A +D L V G+
Sbjct: 318 ENSKYKQGTVSVDSNGILLKGTVKDN--GLKFASYLGIKTD---GQVTA-QDGYLTVTGA 371
Query: 248 DWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLYTRHLDDYQKL 304
+A LLL A ++F NP ++ +KD E S +++ + Y L H+ DYQ L
Sbjct: 372 SYATLLLSAKTNF---AQNPKTNYRKDIDVEKTVKSIVEAAKAKDYETLKNDHIKDYQSL 428
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
F+RV + L S + T E ++++ + L EL FQ+GRYLLISS
Sbjct: 429 FNRVQLNLGGSKSNQTT-------------KEALQTYNPTKGQKLEELFFQYGRYLLISS 475
Query: 365 SRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SR T ANLQG+WN +P W+S H+N+NL+MNYW + NL+E +P+ +++ +
Sbjct: 476 SRNRTDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYMSNLAETAKPMINYIDDM 535
Query: 423 SING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLC 471
G SK Q N GW++H + + ++ W P AW+
Sbjct: 536 RYYGRIAAKEYAGIESKEGQEN----GWLVHTQATPFGWTTPGW-NYYWGWSPAANAWMM 590
Query: 472 THLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPSTSPEHEFIAP 529
+++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS SPEH
Sbjct: 591 QNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPSYSPEH----- 644
Query: 530 DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
++ +T D +++ ++F + AA L+ +++ LV +V +L+P I +DG
Sbjct: 645 ----GTITIGNTFDQSLVWQLFHDYMEAANHLKVDQN-LVTEVKAKFDKLKPLHINQDGR 699
Query: 590 IMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP 643
I EW ++ F + E HHRH+SHL GLFPG T+ + P+ +AA TL RG+ G
Sbjct: 700 IKEWYEEDSPQFTNEGIENHHRHVSHLVGLFPG-TLFGKDQPEYLEAARATLNHRGDGGT 758
Query: 644 GWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANF 703
GWS K LWARL D A+R++ + NL+ H PFQID NF
Sbjct: 759 GWSKANKINLWARLLDGNRAHRLLA-----------EQLRSSTLENLWDTHAPFQIDGNF 807
Query: 704 GFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIY 763
G T+ +AEML+QS + LPALP D W G V GL ARG VS+ WK+ +L +
Sbjct: 808 GATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLVARGNFEVSMKWKEKNLETLSFL 866
Query: 764 SN 765
SN
Sbjct: 867 SN 868
>gi|419527991|ref|ZP_14067534.1| hypothetical protein SPAR51_0989 [Streptococcus pneumoniae GA17719]
gi|379566144|gb|EHZ31135.1| hypothetical protein SPAR51_0989 [Streptococcus pneumoniae GA17719]
Length = 803
Score = 353 bits (907), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 254/789 (32%), Positives = 387/789 (49%), Gaps = 84/789 (10%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTNP 66
P T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPASTTYKGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71
Query: 67 DAPKA---LSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
+ L+++R ++ Y A + + P Y GDI +EF
Sbjct: 72 NLQNQHNFLAEIRQALEKRDYNRAKELAEQHLVGPKTSQYGTYLSFGDIFIEFSQQGTIL 131
Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
++ T Y+R+L+++ A A Y+ F RE F+S PD ++V + + S +L F + L
Sbjct: 132 SQVTDYQRQLNVSKALATTSYAYKGTRFEREAFASFPDDLLVQRFTKEGSETLDFTIELS 191
Query: 179 SLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
D S + C I K D+ ++F++ L + G I
Sbjct: 192 LTRDLASDGKYEQEKTDYKECQLDITASHILMKGWVKDND--LRFASYLAWETD---GDI 246
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
DK +++ G+ +A L L A + F + K D + + +++ + Y+ L
Sbjct: 247 RVWSDK-VQISGASYANLFLAAKTDFAQNPASNYRKKIDLEQQVKNLVETAKEKGYARLK 305
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
+RH++DYQ LF RV + L ++DT + + +K+++ E +L EL F
Sbjct: 306 SRHIEDYQALFQRVQLDLG-------------SDVDTSTTDDLLKNYKPQEGQALEELFF 352
Query: 355 QFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR P ANLQGIWN +P W+S H+NINL+MNYW + NL E
Sbjct: 353 QYGRYLLISSSRDCPDALPANLQGIWNGVDNPPWNSDYHLNINLQMNYWPAYVTNLLETA 412
Query: 413 EPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWAL 462
P+ +++ L + G + A Y+ +GW++H + W D W
Sbjct: 413 FPVINYVDDLRVYG-RLAATRYVGIVSREGEENGWLVHTQATPFGWTAPGWD---YYWGW 468
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTS 521
P AW+ ++E Y + D+D+L ++ YP+L F +L E + ++PS S
Sbjct: 469 SPAANAWMMQTVYEAYLFYRDQDYLREKIYPILRETVRFWNAFLHEDNQAQRWVSSPSYS 528
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
PEH +S +T D ++I ++F I AA+ LE + D L E V + L P
Sbjct: 529 PEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELELDADLLTE-VKEKFDLLNP 578
Query: 582 TKIAEDGSIMEW----AQDFKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTL 635
+I + G I EW Q F++ +V HRH SHL GL+PG+ + K D +AA +L
Sbjct: 579 LQITQSGRIREWYEEEEQHFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQDYLEAASASL 637
Query: 636 QKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHP 695
RG+ G GWS K LWARL D A++++ + + NL+ +HP
Sbjct: 638 NDRGDGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKSSTLPNLWCSHP 686
Query: 696 PFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
PFQID NFG T+ +AEML+QS L L ALP D WSSG V GL ARG VS+ W D
Sbjct: 687 PFQIDGNFGATSGMAEMLLQSHTAYLVPLAALP-DAWSSGSVSGLMARGHFEVSMSWADK 745
Query: 756 DLHEVGIYS 764
L ++ I S
Sbjct: 746 KLLQLTILS 754
>gi|225016900|ref|ZP_03706092.1| hypothetical protein CLOSTMETH_00813 [Clostridium methylpentosum
DSM 5476]
gi|224950294|gb|EEG31503.1| hypothetical protein CLOSTMETH_00813 [Clostridium methylpentosum
DSM 5476]
Length = 1565
Score = 353 bits (906), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 261/851 (30%), Positives = 406/851 (47%), Gaps = 137/851 (16%)
Query: 6 STSTTNPLKITFNGPAKHFTDA-------IPIGNGRLGAMVWGGVPSETLKLNEDTLWTG 58
S TNPL++ + PA TD+ +P+GNG +G MV+GG+ E + NE ++WTG
Sbjct: 38 SVRNTNPLRLWYTKPAPVNTDSKQWQYTVLPLGNGYMGGMVFGGISKERVHFNEKSMWTG 97
Query: 59 VPG---------DYTNPDAPKALSDVRSLVDSGQY----AEATAASVKLF----GHPAD- 100
P + T P + L + R+ +D ++A + KL G D
Sbjct: 98 GPSASRPNHNGSNRTEPVTTEWLDEFRAELDDKTNDVWGLSSSAGNNKLLDLIRGPKRDN 157
Query: 101 ------VYQLLGDIELEFDDSHLKY-AEETYRRELDLNTATARVKYSVGNVEFTREHFSS 153
+YQ GDI ++F + + E Y R+LDL TA + V Y +G V +TRE+F+S
Sbjct: 158 WDNGMGMYQDFGDIYMDFARAGITDDMAENYVRDLDLTTALSTVSYDIGGVHYTREYFNS 217
Query: 154 NPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDD 213
PD V+ +++ SE+G L+F+ S+ S + N + EG R + N
Sbjct: 218 YPDNVLAMRLNASEAGKLTFDASITPA---SSTSSTNRTVTAEGDIITLRGQIRDNQ--- 271
Query: 214 PKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKD 273
+Q+ A ++K+ ++ GT+ A ED + ++G+D L+L + + + P +D
Sbjct: 272 ---LQYEA--QLKVLNEGGTLKANEDGTISIDGADSVTLILACGTDYKNEW--PKYRGED 324
Query: 274 PTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVP 333
P + + + + + LY HL+DYQ+LF RV + L E + +P
Sbjct: 325 PHEAISARIDNAADKGFDALYQTHLEDYQELFSRVDLDLG-------------EELPNIP 371
Query: 334 SAERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWN-EDLSPTWDSAPH 391
+ E +++++ E + SL L +Q GRYL I+ SR T NL G+W S W++ H
Sbjct: 372 TDELIQNYRDGEHNKSLEVLTYQMGRYLTIAGSRENTLPTNLNGVWMIGSASQFWNADYH 431
Query: 392 VNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS-----------GWV 440
N+N +MNYW ++ NL+EC P D++ L G TA S G+
Sbjct: 432 FNVNFQMNYWPTMAANLAECMLPYNDYMESLVEPGRVTAGATAGLSTEPGTPIGEGNGFN 491
Query: 441 IHHKTDIWAKSSADRGKVVWALWPMGGA-WLCTHLWEHYNYTMDRDFLEKRAYPLLEGCA 499
H +I+ + +V W +GGA W + +++Y YT D D+L + YP+L+ A
Sbjct: 492 AHTVNNIFGTTGP--YQVQEFGWTLGGASWALENSYDYYAYTQDEDYLRDKIYPMLKEQA 549
Query: 500 SFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAA 558
+F +L + L PS SPE + ST D +I E F I+A+
Sbjct: 550 TFYSKFLWHSDYQNRLVVGPSVSPEQ---------GPTTNGSTFDQSIAWEAFEEAINAS 600
Query: 559 EVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW--------AQDFKDPEVH------- 603
E L +ED L + +L P + ++G I EW AQ EV+
Sbjct: 601 EALGVDED-LRATWAEMQSQLNPIIVGDEGQIKEWYEETTIGKAQAGDLDEVNIPNYNAG 659
Query: 604 ----HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHD 659
HRH+SHL GLFPG T+ E P+ +AA+ +L+K+G + GWS K WAR D
Sbjct: 660 YAGPHRHISHLVGLFPG-TLINENTPEWLEAAKYSLEKKGFKATGWSKAHKLNTWARTKD 718
Query: 660 QEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAH---------PPFQIDANFGFTAAVA 710
E+ Y+MV+ + + G+ NLFA+H P FQI+AN+G+T+ +
Sbjct: 719 AENTYKMVQAMLS--------SNYAGIMDNLFASHGQGTNHEGTPVFQIEANYGYTSGIN 770
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNND 770
EMLVQS L + +LPA+P + W G V+G+ ARG + + W SNN
Sbjct: 771 EMLVQSQLGYVDMLPAIP-EAWDEGSVEGIVARGNFELDMEW--------------SNNS 815
Query: 771 HDSFKTLHYRG 781
D F L G
Sbjct: 816 ADRFVILSRAG 826
>gi|336321550|ref|YP_004601518.1| alpha-L-fucosidase [[Cellvibrio] gilvus ATCC 13127]
gi|336105131|gb|AEI12950.1| Alpha-L-fucosidase [[Cellvibrio] gilvus ATCC 13127]
Length = 792
Score = 353 bits (905), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 249/825 (30%), Positives = 390/825 (47%), Gaps = 81/825 (9%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG-------DYTNPD 67
+ F GPA + +A P+GNG +GAMV GG +++N+ T W+G P + D
Sbjct: 5 LRFAGPALRWDEAFPLGNGSVGAMVHGGHRRARVQVNDATAWSGHPAGPGLALAELRRRD 64
Query: 68 -APKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFD------DSHLKYA 120
P+ LS +RS + G+ EA + + G A +Q D+ + D + A
Sbjct: 65 VGPRTLSALRSAIAEGRDDEAARLAQRFQGPYAQAFQPFVDLLVTLSPADPTGDDDVDAA 124
Query: 121 EETYRRELDLNTATAR--VKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
E R LDL V + F+S PD + + + + F++ L+
Sbjct: 125 YEG--RSLDLRDGLVHEAVTFESAGCRVMTTWFTSAPDGCLHARWRAPD---VPFSLELE 179
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRI----------------PPKANANDDPKGIQFSAI 222
+ G + +++E G ++ P + + ++ +
Sbjct: 180 L---RGAQPGGPSALVVEAGVVGAQVRVELPFDVAPGHEPDRPGRIAVGSHASLVGYATV 236
Query: 223 LEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSF----DGPFINPSDSKKDPTSES 278
L +D R T S ++V G+ W +L +++ GP +P++++ +
Sbjct: 237 L--VSTDGRATASP---GGVRVAGATWVEAVLATATTTRWPEPGPLAHPAEAEHASRERA 291
Query: 279 MSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV 338
+AL + + RH++D++ L ++L P D++ +P A
Sbjct: 292 RAALPP-SPAAGAVAQRRHVEDHRALADATRLELG-EPADLL-----------LPDA--- 335
Query: 339 KSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEM 398
T P+ F FGRYLL+++SRPG NLQG+WN++ P W S +NINL+M
Sbjct: 336 --LGTAPLPARARAAFAFGRYLLMAASRPGAPPVNLQGVWNDEARPPWSSGYTLNINLQM 393
Query: 399 NYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKS---SADR 455
YW + P L C EPL D + L+ G+ A+ Y +GWV HH +D+W +
Sbjct: 394 AYWPAEPTGLGVCVEPLVDQVRVLAREGAAVARDLYGCAGWVAHHNSDVWGWALPVGDGH 453
Query: 456 GKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLE 515
G WA W MGGAWLC HLW+ Y Y++D D L + +PLL G A+F++DWL+ G L
Sbjct: 454 GDPSWASWWMGGAWLCRHLWDRYEYSLDEDVL-RDVWPLLRGAAAFVVDWLVPDGRGGLV 512
Query: 516 TNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS 575
+PS+SPE+ G+ + ST+D+A+ R++ S + A ++L +E L + + +
Sbjct: 513 PSPSSSPEN-VRERAGREVALCAGSTVDVALARDLLSHCLEAVDILGLDE-PLAARWVDA 570
Query: 576 LPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTL 635
+ RL + DG + EW D + + HHRHLSHL GLFP + ++ +AA +L
Sbjct: 571 VARLPRPDVDADGLLREWPDDARAIDPHHRHLSHLVGLFPLDEL-VDDPWGRSEAARASL 629
Query: 636 QKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHP 695
RG GWS+ WK AL ARL D +++ P+ + GGL N+F+ HP
Sbjct: 630 DARGPGSTGWSMAWKAALRARLGDGPGVDEILRGALTRA-PQDGGSWAGGLLPNMFSTHP 688
Query: 696 PFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
PFQ+D N G AA+AE L+ ST L +LPALP W G GL+ARG V + W G
Sbjct: 689 PFQVDGNLGLVAAMAEALLSSTRTRLVVLPALP-PSWPDGAATGLRARGALVVDLTWAGG 747
Query: 756 DLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQL 800
L E+ ++ D + + G S V L AG L
Sbjct: 748 RLVELVLHPGA-----DGEREVVVDGVSRHVVLRAGTTVRLGEGL 787
>gi|322387111|ref|ZP_08060722.1| alpha-L-fucosidase [Streptococcus infantis ATCC 700779]
gi|321142098|gb|EFX37592.1| alpha-L-fucosidase [Streptococcus infantis ATCC 700779]
Length = 1840
Score = 352 bits (903), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 255/792 (32%), Positives = 394/792 (49%), Gaps = 117/792 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 230 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYQ--DRYKVLAEIRK 287
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD++
Sbjct: 288 ALEEGNRQKAKQLAERNLVGPNNAQYGRYLSFGDIFMVFNNQKKGLENVTDYHRGLDISE 347
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDNHSY--- 186
A + Y+ F RE FSS PD V VT +S +L F N + L+ N Y
Sbjct: 348 AISTTSYTQDGTNFKRETFSSYPDDVTVTHLSKKGDKTLDFTLWNSLTEDLIANGDYSWE 407
Query: 187 ---------VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+N I+++G K N G++F++ L IK G ++A
Sbjct: 408 YSNYKQGAVTTDSNGILLKGTV-------KDN------GLKFASYLGIKTD---GQVTA- 450
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D L V G+ +A LLL A ++F NP ++ +KD E + +++ + Y L
Sbjct: 451 QDGYLTVTGASYATLLLSAKTNFAQ---NPKTNYRKDIDLEKTVKNIVETAKAKGYEKLK 507
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + S T E + ++ ++ L EL F
Sbjct: 508 EDHVKDYQSLFNRVQLNFGGSKSSQTT-------------KEALHTYNPEKGQKLEELFF 554
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W+S H+N+NL+MNYW + NL+E
Sbjct: 555 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYMNNLAETA 614
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 615 KPMVNYIDDMRYYGRIAAKEYAGIESKEGQEN----GWLVHTQATPFGWTTPGW-NYYWG 669
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 670 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKSSDRWV-SSPS 728
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + AA L+ ++D LV +V +L
Sbjct: 729 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEAANHLKVDQD-LVTEVKAKFDKL 778
Query: 580 RPTKIAEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
+P I +DG I EW ++ F + E HHRH+SHL GLFPG T+ + P+ +AA
Sbjct: 779 KPLHINQDGRIKEWYEEDSPRFTNEGIENHHRHVSHLVGLFPG-TLFGKDQPEYLEAARA 837
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAA 693
TL RG+ G GWS K LWARL D A+R++ + + NL+
Sbjct: 838 TLNHRGDGGTGWSKANKINLWARLLDGNRAHRLLA-----------EQLKSSTLENLWDT 886
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWK 753
H PFQID NFG T+ +AEML+QS + LPALP D W G V GL ARG VS+ WK
Sbjct: 887 HAPFQIDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLVARGNFEVSMKWK 945
Query: 754 DGDLHEVGIYSN 765
+ +L + SN
Sbjct: 946 EKNLETLSFLSN 957
>gi|429766026|ref|ZP_19298301.1| LPXTG-motif protein cell wall anchor domain protein [Clostridium
celatum DSM 1785]
gi|429185266|gb|EKY26251.1| LPXTG-motif protein cell wall anchor domain protein [Clostridium
celatum DSM 1785]
Length = 1927
Score = 352 bits (902), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 242/785 (30%), Positives = 393/785 (50%), Gaps = 94/785 (11%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-----------YTNPDAP---KALS 73
+PIGNG +G V+G + E + NE TLWTG P D Y N + L
Sbjct: 70 LPIGNGDIGGNVYGEIVHERITFNEKTLWTGGPSDKRPNYNGGNKEYANDGITPMYEILQ 129
Query: 74 DVRS----LVDSGQYAEATAASV--KLFG--HPADVYQLLGDIELEF---DDSHLKYAEE 122
VR D G +ATA+S+ +L G YQ G+I L+F D++++
Sbjct: 130 QVRENFALHTDEG---DATASSLCNQLVGISDGYGAYQAWGEINLDFIGIDENNVT---- 182
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
Y R+L+L A + V Y+ G+ E+ RE+F S+PD V+V ++ + L+F+VS S
Sbjct: 183 DYVRDLNLRNAISSVNYTYGDTEYIRENFVSHPDDVMVIRVEANGENKLNFDVSFPSKQG 242
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+ V N+ I +EG ++ K N+ ++KI D G ++ DK L
Sbjct: 243 ATTIVE-NDTITLEGEVSDNQL--KYNS-------------QLKIVSDDGEVTEGTDK-L 285
Query: 243 KVEGSDWAVLLLVASSSF--DGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
VE + A + + A++ + D P ++ ++ + ++++ SY ++ H+ D
Sbjct: 286 TVENATSATIYISAATDYKNDYPEYRTGETAEELDARVGDVIEALDGKSYEEVKADHIAD 345
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYL 360
Y+ +F RV + L ++ +I TD + S E ++ + + FQ+GRYL
Sbjct: 346 YKSIFDRVDLDLGQALPNIPTDELLSGYGNNTVSEEARRALEV--------MFFQYGRYL 397
Query: 361 LISSSRPGTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
I+SSR +Q+ +NLQG+WN +P W S H+N+NL+MNYW + N++EC PL +++
Sbjct: 398 TIASSREDSQLPSNLQGVWNNKNNPAWSSDYHMNVNLQMNYWPTYSTNMAECATPLVEYI 457
Query: 420 TYLSINGSKTAQV------------NYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
L G +TA++ Y+ + + H + + W P
Sbjct: 458 DSLREPGRETARIYAGVESAKDENGEYIEANGFMAHTQNTPFGWTCPGWSFDWGWSPAAV 517
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEF 526
W+ ++WE Y YT D +++ YP+++ + + L+ + + ++P+ SPEH
Sbjct: 518 PWILQNVWEMYEYTGDVEYMRDVIYPMMKEEVNLYENMLVWDEVQQRMVSSPTYSPEH-- 575
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE-KVLKSLPRLRPTKIA 585
+ +T + +I +++ I+AAE L + D +VE K +S +L P +I
Sbjct: 576 -------GPRTVGNTYEQTLIWQLYEDTITAAETLGVDADLVVEWKDTQS--KLDPIQIG 626
Query: 586 EDGSIMEWAQDFKDPEV-----HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGE 640
+DG I EW ++ + HRH+SHL GLFPG +I++E P+L AA +L R +
Sbjct: 627 DDGQIKEWFEETTLNSIPSEGYGHRHMSHLLGLFPGDSISVET-PELLDAALVSLNNRTD 685
Query: 641 EGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQID 700
+ GW + + WAR + AY ++ + V GG YSNL+ AHPPFQID
Sbjct: 686 QSTGWGMGQRINSWARAGEGNKAYELLTKQLKRVGTGQANG--GGTYSNLWDAHPPFQID 743
Query: 701 ANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEV 760
NFG TA +AEML+QS + +Y LPALP D W+ G GL ARG V W +G +E+
Sbjct: 744 GNFGATAGIAEMLMQSNMGYVYFLPALP-DTWADGSYDGLLARGNFEVGAKWSNGVAYEL 802
Query: 761 GIYSN 765
+ SN
Sbjct: 803 TVKSN 807
>gi|419844091|ref|ZP_14367392.1| gram positive anchor [Streptococcus infantis ATCC 700779]
gi|385702207|gb|EIG39356.1| gram positive anchor [Streptococcus infantis ATCC 700779]
Length = 1757
Score = 352 bits (902), Expect = 6e-94, Method: Compositional matrix adjust.
Identities = 255/792 (32%), Positives = 394/792 (49%), Gaps = 117/792 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 147 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYQ--DRYKVLAEIRK 204
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD++
Sbjct: 205 ALEEGNRQKAKQLAERNLVGPNNAQYGRYLSFGDIFMVFNNQKKGLENVTDYHRGLDISE 264
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDNHSY--- 186
A + Y+ F RE FSS PD V VT +S +L F N + L+ N Y
Sbjct: 265 AISTTSYTQDGTNFKRETFSSYPDDVTVTHLSKKGDKTLDFTLWNSLTEDLIANGDYSWE 324
Query: 187 ---------VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+N I+++G K N G++F++ L IK G ++A
Sbjct: 325 YSNYKQGAVTTDSNGILLKGTV-------KDN------GLKFASYLGIKTD---GQVTA- 367
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D L V G+ +A LLL A ++F NP ++ +KD E + +++ + Y L
Sbjct: 368 QDGYLTVTGASYATLLLSAKTNFAQ---NPKTNYRKDIDLEKTVKNIVETAKAKGYEKLK 424
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + S T E + ++ ++ L EL F
Sbjct: 425 EDHVKDYQSLFNRVQLNFGGSKSSQTT-------------KEALHTYNPEKGQKLEELFF 471
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W+S H+N+NL+MNYW + NL+E
Sbjct: 472 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYMNNLAETA 531
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 532 KPMVNYIDDMRYYGRIAAKEYAGIESKEGQEN----GWLVHTQATPFGWTTPGW-NYYWG 586
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 587 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKSSDRWV-SSPS 645
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + AA L+ ++D LV +V +L
Sbjct: 646 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEAANHLKVDQD-LVTEVKAKFDKL 695
Query: 580 RPTKIAEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
+P I +DG I EW ++ F + E HHRH+SHL GLFPG T+ + P+ +AA
Sbjct: 696 KPLHINQDGRIKEWYEEDSPRFTNEGIENHHRHVSHLVGLFPG-TLFGKDQPEYLEAARA 754
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAA 693
TL RG+ G GWS K LWARL D A+R++ + + NL+
Sbjct: 755 TLNHRGDGGTGWSKANKINLWARLLDGNRAHRLLA-----------EQLKSSTLENLWDT 803
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWK 753
H PFQID NFG T+ +AEML+QS + LPALP D W G V GL ARG VS+ WK
Sbjct: 804 HAPFQIDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLVARGNFEVSMKWK 862
Query: 754 DGDLHEVGIYSN 765
+ +L + SN
Sbjct: 863 EKNLETLSFLSN 874
>gi|187735615|ref|YP_001877727.1| hypothetical protein Amuc_1120 [Akkermansia muciniphila ATCC
BAA-835]
gi|187425667|gb|ACD04946.1| conserved hypothetical protein [Akkermansia muciniphila ATCC
BAA-835]
Length = 796
Score = 351 bits (901), Expect = 7e-94, Method: Compositional matrix adjust.
Identities = 253/777 (32%), Positives = 366/777 (47%), Gaps = 116/777 (14%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQY 84
+ PIGNGR+GAM++ E L LNE +LW+ G Y
Sbjct: 65 AEGYPIGNGRVGAMIFSAPGRERLALNEISLWS------------------GGANPGGGY 106
Query: 85 AEATAASVKLFGHPADVYQLLGDIELEFD--DSHLKYAEETYRRELDLNTATARVKYSVG 142
A FG+ Y GD+ ++F D + E + R LDL +V Y
Sbjct: 107 GYGPDAGTNQFGN----YLPFGDLFVDFKKGDQPASLSVEDFTRSLDLRDGIHKVNYKAD 162
Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGK 202
V + RE FSS P V+V S+ G S + S++S L G+ I +G
Sbjct: 163 GVTYDREAFSSTPANVLVLNYKASKPGQFSADFSVNSQLGADISAKGS-VITWKGMLK-- 219
Query: 203 RIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDG 262
G+ + + I GT+SA DK + V+ +D ++++ + +
Sbjct: 220 ------------NGMNYEG--RVLIRPKGGTLSASGDK-ISVKNADSCMVVIAMETDY-- 262
Query: 263 PFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPK 317
D KKD ES S + Y+ L H+ Y+ +F RV + ++
Sbjct: 263 ----LMDYKKDWKGESPSRKLDRYAAKAASADYAALKQAHISQYKSMFDRVKVNFGKT-- 316
Query: 318 DIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGTQVANLQG 376
EE++ +P+ +R+++++ + DP L E +FQFGRYLL+SSSRPGT ANLQG
Sbjct: 317 --------EEDVAKLPTPKRLEAYKKNPADPDLEETMFQFGRYLLLSSSRPGTLPANLQG 368
Query: 377 IWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVN--- 433
+WN+ + P W H NIN++M YW + P NLSEC E L +++ ++ +Q N
Sbjct: 369 LWNDYVKPPWACDYHNNINVQMAYWGAEPANLSECHEALVNYVEAMAPGCRDASQANKGF 428
Query: 434 -----YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
GW + +I+ + W G AW H+WEHY +T DR +LE
Sbjct: 429 NTKDGKPVRGWTVRTSQNIFGGNG-------WQWNIPGAAWYALHIWEHYAFTGDRKYLE 481
Query: 489 KRAYPLLEGCASFLLDWLIE---GHDGYLETNPSTSPEHE-----------FIAPDG--- 531
K+AYPL++ F D L E G +G+ +TN E E +AP+G
Sbjct: 482 KQAYPLMKEICHFWEDHLKELGAGGEGF-KTNGKDPSEEEKKDLADVKAGTLVAPNGWSP 540
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRLRPTKIAEDGSI 590
+ D +I E+FS I AA +L K DA K L+ L RL KI ++G++
Sbjct: 541 EHGPREDGVMHDQQLIAELFSNTIKAARILGK--DAAWAKSLEGKLKRLAGNKIGKEGNL 598
Query: 591 MEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP---GWSI 647
EW D + P+ HRH SHLF +FPG+ I+ K P L +AA +L+ RG G W+
Sbjct: 599 QEWMID-RIPKTDHRHTSHLFAVFPGNQISKLKTPKLAEAARLSLEWRGTTGDSRRSWTW 657
Query: 648 TWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTA 707
W+TALWARL + A+ MV+ L N+ HPP Q+D NFG
Sbjct: 658 PWRTALWARLGEGNKAHEMVQGLLKF-----------NTLPNMLTTHPPMQMDGNFGIVG 706
Query: 708 AVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+ EMLVQS L ++P+ P + W G VKGLKARG TV WKDG + V +YS
Sbjct: 707 GICEMLVQSHAGGLDIMPS-PVEAWPEGSVKGLKARGNVTVDFSWKDGKVSNVKLYS 762
>gi|149276069|ref|ZP_01882214.1| hypothetical protein PBAL39_22400 [Pedobacter sp. BAL39]
gi|149233497|gb|EDM38871.1| hypothetical protein PBAL39_22400 [Pedobacter sp. BAL39]
Length = 574
Score = 350 bits (899), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 222/585 (37%), Positives = 312/585 (53%), Gaps = 57/585 (9%)
Query: 218 QFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKD-PTS 276
Q +A+L+++ + LK+ ++ +LL A+++F D K++ T+
Sbjct: 15 QATALLQLEGGSAKVQADPQGGSLLKISEANVMTILLSAATNFS------MDRKQNWKTT 68
Query: 277 ESMSA-----LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDT 331
ES +A L+S SY +L +RHL DYQ+L+ RV + L +S EN
Sbjct: 69 ESAAAKVQRLLKSAAAKSYVELLSRHLKDYQQLYGRVKLDLGQS----------NENTIK 118
Query: 332 VPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPH 391
+P+A+R+ ++ DP L L+FQ+GRYLLISSSR G ANLQG+WNE P W S H
Sbjct: 119 MPTAKRLLEYRKSPDPQLEALIFQYGRYLLISSSRRGGLPANLQGLWNESNDPPWGSDYH 178
Query: 392 VNINLEMNYWQSLPCNLSECQEPLFDFLTYL-SINGSKTAQVNYLASGWVIHHKTDIWAK 450
NIN++MNYW + P NLSEC P D + + + T + GW + +++ +
Sbjct: 179 TNINIQMNYWPAEPANLSECHFPYLDHINSIREVRKINTRKEYPGVRGWTLRTESNPFGG 238
Query: 451 SSADRGKVVWALWPM-GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG 509
S LW G AW LWEHY +T D+ +L+ AYP+L+ F D L
Sbjct: 239 ES--------YLWNTPGSAWYAQALWEHYAFTKDKTYLKDFAYPILKEITEFWDDHLKRR 290
Query: 510 HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALV 569
DG L + SPEH T D I+ ++F AA +L + D
Sbjct: 291 PDGTLVSPMGWSPEH---------GPTEDGVTHDQQIVDDLFINYTEAAAILGIDADYRK 341
Query: 570 EKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCK 629
+ L+P KI + G + EW D DP+ HRH+SHLFGL PG +I+ K P+L K
Sbjct: 342 HIIDLKAHLLQP-KIGKWGQLQEWETDRDDPKDTHRHVSHLFGLHPGRSISTIKTPELAK 400
Query: 630 AAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFE-GGLYS 688
AA+ +L RG+E GWS+ WK WARL D +HA+ ++ +LV + E GG+Y+
Sbjct: 401 AAKVSLLARGDESTGWSMAWKINFWARLQDGDHAHTIIHNFISLVGGGGVDYNEGGGIYA 460
Query: 689 NLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETV 748
NLF AHPPFQID NFG+TA VAEMLVQS +++ LLPALP WS+G V+GLKARG V
Sbjct: 461 NLFCAHPPFQIDGNFGYTAGVAEMLVQSHADEIQLLPALP-KAWSTGKVQGLKARGDFEV 519
Query: 749 S-ICWKDGDLHEVGIYSN--------YSNNDH----DSFKTLHYR 780
S + W +G L + I S Y N H + KT H++
Sbjct: 520 SDMSWSNGQLISISIKSGSGGSCLLRYGNLKHTVITEKGKTYHFK 564
>gi|307707449|ref|ZP_07643931.1| alpha-fucosidase [Streptococcus mitis NCTC 12261]
gi|307616401|gb|EFN95592.1| alpha-fucosidase [Streptococcus mitis NCTC 12261]
Length = 803
Score = 350 bits (898), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 256/828 (30%), Positives = 397/828 (47%), Gaps = 108/828 (13%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN-- 65
P T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 16 PASTTYKGWEE---EALPIGNGSLGAKVFGIIGAERIQFNEKSLWSGGPLPDSSDYQGGN 72
Query: 66 -PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYA 120
D L+++R ++ Y A + + P Y GDI +EF +
Sbjct: 73 LQDQYGFLAEIRQALEKRDYNRAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLS 132
Query: 121 EET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD- 178
+ T Y+R+L+++ A A Y +F RE F+S PD ++V + + + +L F + L
Sbjct: 133 QVTDYQRQLNISKALATTSYVYKGTKFEREAFASFPDNLLVQRFTKEGAETLDFTIELSL 192
Query: 179 --SLLDNHSYVN------------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE 224
L + Y ++ I+M+GR ND +QF++ L
Sbjct: 193 SRDLASDGKYEEEKSDYKECKLDITDSHILMKGRVKD---------ND----LQFASCLA 239
Query: 225 IKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS 284
+ G I DK ++ G+ +A L L A + F + K D + ++
Sbjct: 240 WETD---GDIRVWSDKA-QISGASYANLFLAAKTDFAQNPASNYRKKIDLEKQVKDLVEI 295
Query: 285 IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD 344
+ Y+ L +RH+ DYQ LF RV + L ++DT + +K+++
Sbjct: 296 AKEKGYAQLKSRHIQDYQALFQRVQLDLG-------------ADVDTSTTDNLLKNYKPQ 342
Query: 345 EDPSLVELLFQFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQ 402
E +L EL FQ+GRYLLISSSR + ANLQG+WN +P W+S H+NINL+MNYW
Sbjct: 343 EGHALEELFFQYGRYLLISSSRDCSDALPANLQGVWNAVDNPPWNSDYHLNINLQMNYWP 402
Query: 403 SLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSS 452
+ NL E P+ +++ L + G + A Y +GW++H + W
Sbjct: 403 AYVTNLLETAFPVINYIDDLRVYG-RLAAARYAGIVSQEGEENGWLVHTQATPFGWTAPG 461
Query: 453 ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG 512
D W P AW+ ++E Y++ D+D+L ++ YP+L F D+L E
Sbjct: 462 WD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNDFLHEDQQA 518
Query: 513 Y-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEK 571
++PS SPEH +S +T D ++I ++F I AA+ LE + D L E
Sbjct: 519 QRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELELDADLLTE- 568
Query: 572 VLKSLPRLRPTKIAEDGSIMEW----AQDFKDPEV--HHRHLSHLFGLFPGHTITIEKNP 625
V + L P +I + G I EW Q F++ +V HRH SHL GL+PG+ + K
Sbjct: 569 VKEKFDLLNPLQITQSGRIREWYEEEEQHFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQ 627
Query: 626 DLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGG 685
+ ++A +L RG+ G GWS K LWARL D A++++ + +
Sbjct: 628 EYLESARASLNDRGDGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKSS 676
Query: 686 LYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGG 745
NL+ +HPPFQID NFG ++ +AEML+QS L L ALP D WS+G V GL ARG
Sbjct: 677 TLPNLWCSHPPFQIDGNFGASSGMAEMLLQSHTAYLVPLAALP-DAWSTGSVSGLMARGH 735
Query: 746 ETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKI 793
+S+ W D L ++ I S S+ + + V+VN K+
Sbjct: 736 FEISMRWADKKLFQLTILSRSGGELRVSYPGIE--NSVVEVNQEKAKV 781
>gi|417846683|ref|ZP_12492676.1| hypothetical protein HMPREF9958_1458 [Streptococcus mitis SK1073]
gi|339458316|gb|EGP70859.1| hypothetical protein HMPREF9958_1458 [Streptococcus mitis SK1073]
Length = 803
Score = 350 bits (897), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 253/800 (31%), Positives = 387/800 (48%), Gaps = 106/800 (13%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
P T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPASTTYKGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
D L+++R ++ Y A + + P Y GDI +EF +
Sbjct: 72 NLQDQYAFLAEIRQALEKRDYNTAKELAEQHLVGPKTSQYGTYLSFGDIHIEFSNQGKTL 131
Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
++ T Y+R+L+++ A A Y +F RE F+S PD +V + + + +L F + L
Sbjct: 132 SQVTDYQRQLNISKALATTSYVYKGTKFERETFASFPDDFLVQRFTKEGAETLDFTIELS 191
Query: 179 ---SLLDNHSYVN------------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL 223
L + Y ++ I+M+GR ND +QF++ L
Sbjct: 192 LSRDLASDGKYEQEKSDYKECKLDITDSYILMKGRVKD---------ND----LQFASYL 238
Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ 283
+ G I DK +++ G+ +A L L A + F + K D + +
Sbjct: 239 AWETD---GDIRVWSDK-VQISGASYANLFLAAKTDFAQNPASNYRKKIDLEQQVKDLVD 294
Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
+ + Y+ L +RH++DYQ LF RV + L ++DT + + +K+++
Sbjct: 295 TAKEKGYAQLKSRHIEDYQALFQRVQLDLG-------------ADVDTSTTDDLLKNYKP 341
Query: 344 DEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
E +L E+ FQ+GRYLLISSSR P ANLQG+WN +P W+S H+NINL+MNYW
Sbjct: 342 QEGQALEEMFFQYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNINLQMNYW 401
Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKS 451
+ NL E P+ +++ L + G + A Y +GW++H + W
Sbjct: 402 PAYVTNLLETAFPVINYIDDLRVYG-RLAAARYAGIVSQEGEENGWLVHTQATPFGWTAP 460
Query: 452 SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD 511
D W P AW+ ++E Y++ D+D+L ++ YP+L F +L +
Sbjct: 461 GWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQ 517
Query: 512 -GYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE 570
++PS SPEH +S ++ D ++I ++F I AA+ L +ED L E
Sbjct: 518 VQRWVSSPSYSPEH---------GPISIGNSYDQSLIWQLFHDFIQAAQELSLDEDLLTE 568
Query: 571 KVLKSLPRLRPTKIAEDGSIMEW----AQDFKDPEV--HHRHLSHLFGLFPGHTITIEKN 624
V + L P +I + G I EW Q F++ +V HRH SHL GL+PG+ + K
Sbjct: 569 -VKEKFDLLNPLQITQSGRIREWYEEEEQHFQNEKVEAQHRHASHLVGLYPGNLFSY-KG 626
Query: 625 PDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEG 684
D +AA +L RG+ G GWS K LWARL D A+++ + +
Sbjct: 627 QDYLEAARASLNDRGDGGTGWSKANKINLWARLGDGNRAHKLFA-----------EQLKT 675
Query: 685 GLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARG 744
NL+ HPPFQID NFG T+ +AEML+QS L L ALP D WSSG V GL ARG
Sbjct: 676 STLPNLWCTHPPFQIDGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSSGSVSGLMARG 734
Query: 745 GETVSICWKDGDLHEVGIYS 764
VS+ W D L ++ I S
Sbjct: 735 HYEVSMRWADKKLLQLTILS 754
>gi|419767010|ref|ZP_14293181.1| alpha-L-fucosidase [Streptococcus mitis SK579]
gi|383353528|gb|EID31137.1| alpha-L-fucosidase [Streptococcus mitis SK579]
Length = 803
Score = 350 bits (897), Expect = 2e-93, Method: Compositional matrix adjust.
Identities = 252/814 (30%), Positives = 391/814 (48%), Gaps = 105/814 (12%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSFDYQGGNLQDQYSFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GD+ +EF + T Y+R+L+++ A
Sbjct: 87 LEKRDYNTAKELAEQHLVGPKTSQYGTYLSFGDLLIEFSRQGKTLFQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVNG- 189
A Y+ F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 147 LATTSYAYKGTMFKREAFASFPDDLLVQRFTKEGAETLDFTIELSLTRDLTSDEKYEQKK 206
Query: 190 -----------NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ++F+ L + G I
Sbjct: 207 SDYKECQLEITDSHILMKGRVK-------------DNNLRFAGCLAWQTD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
DK +++ G+ +A L L A + F + K D + +++ + Y+ L +RH+
Sbjct: 251 DK-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLVQQVRDLVETAKEKGYAQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
DYQ LF RV + L ++DT + + +K+++ E +L EL FQ+GR
Sbjct: 310 QDYQALFQRVQLDLG-------------ADVDTSTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQGIWN +P W+S H+NINL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGIWNAVDNPPWNSDYHLNINLQMNYWPAYVTNLLETAFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A Y +GW++H + W D W P
Sbjct: 417 NYIDDLRVYG-RLAAARYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F D+L E ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNDFLHEDRQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L + D L E V + L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGMDADLLTE-VKEKFDLLNPLQIT 582
Query: 586 EDGSIMEW----AQDFKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
+ G I EW Q F++ +V HRH SHL GL+PG+ + K + AA +L RG
Sbjct: 583 QSGRIREWYEEEEQHFQNEKVEAQHRHASHLVGLYPGNLFS-HKGQEYLDAARASLNDRG 641
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
+ G GWS K LWARL D A++++ + + NL+ +HPPFQI
Sbjct: 642 DGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKSSTLPNLWCSHPPFQI 690
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D NFG T+ +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D L +
Sbjct: 691 DGNFGATSGMAEMLLQSHTAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMRWEDKKLLQ 749
Query: 760 VGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKI 793
+ I S + S+ + + ++VN K+
Sbjct: 750 MTILSRSGGDLRVSYPGIE--KSVIEVNQEKAKV 781
>gi|444414515|ref|ZP_21210772.1| hypothetical protein PNI0199_00487 [Streptococcus pneumoniae
PNI0199]
gi|444281657|gb|ELU86965.1| hypothetical protein PNI0199_00487 [Streptococcus pneumoniae
PNI0199]
Length = 803
Score = 350 bits (897), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 249/789 (31%), Positives = 387/789 (49%), Gaps = 84/789 (10%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
P+ T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPVSTTYMGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
D L+++R ++ Y A + + P Y GDI +EF
Sbjct: 72 NLQDQYVFLAEIRQALEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTL 131
Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
++ T Y+R+L+++ A Y F RE F+S PD ++V + + + +L F + L
Sbjct: 132 SQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELS 191
Query: 179 SLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
D S + C I K D+ ++F++ L + G I
Sbjct: 192 LTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDI 246
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L
Sbjct: 247 RVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLK 305
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
+RH++DYQ LF RV + L E N+D + + +K+++ E +L EL F
Sbjct: 306 SRHIEDYQALFQRVQLDL-------------EANVDAFTTDDLLKNYKPQEGQALEELFF 352
Query: 355 QFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E
Sbjct: 353 QYGRYLLISSSRDCPDALSANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAV 412
Query: 413 EPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWAL 462
P+ +++ L + G + A V Y +GW++H + W D W
Sbjct: 413 FPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGW 468
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTS 521
P AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS S
Sbjct: 469 SPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYS 528
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
PEH +S +T D ++I ++F I AA+ L +ED L E KS L P
Sbjct: 529 PEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNP 578
Query: 582 TKIAEDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTL 635
+I + G I EW ++ F++ +V HRH SHL GL+PG+ + K + +AA +L
Sbjct: 579 LQITQSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASL 637
Query: 636 QKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHP 695
RG+ G GWS K LWARL D A++++ + + NL+ +HP
Sbjct: 638 NDRGDGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHP 686
Query: 696 PFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
PFQID NFG T+ +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D
Sbjct: 687 PFQIDGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDK 745
Query: 756 DLHEVGIYS 764
L ++ I S
Sbjct: 746 KLLQLTILS 754
>gi|418101115|ref|ZP_12738198.1| hypothetical protein SPAR128_1534 [Streptococcus pneumoniae
7286-06]
gi|418183181|ref|ZP_12819739.1| hypothetical protein SPAR78_1589 [Streptococcus pneumoniae GA43380]
gi|418196314|ref|ZP_12832790.1| hypothetical protein SPAR103_1510 [Streptococcus pneumoniae
GA47688]
gi|418223856|ref|ZP_12850496.1| hypothetical protein SPAR127_1557 [Streptococcus pneumoniae
5185-06]
gi|419447313|ref|ZP_13987318.1| fibronectin type III domain protein [Streptococcus pneumoniae
7879-04]
gi|353770615|gb|EHD51127.1| hypothetical protein SPAR128_1534 [Streptococcus pneumoniae
7286-06]
gi|353848164|gb|EHE28181.1| hypothetical protein SPAR78_1589 [Streptococcus pneumoniae GA43380]
gi|353860325|gb|EHE40270.1| hypothetical protein SPAR103_1510 [Streptococcus pneumoniae
GA47688]
gi|353878654|gb|EHE58484.1| hypothetical protein SPAR127_1557 [Streptococcus pneumoniae
5185-06]
gi|379614853|gb|EHZ79563.1| fibronectin type III domain protein [Streptococcus pneumoniae
7879-04]
Length = 778
Score = 349 bits (896), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 249/789 (31%), Positives = 387/789 (49%), Gaps = 84/789 (10%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
P+ T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPVSTTYMGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
D L+++R ++ Y A + + P Y GDI +EF
Sbjct: 72 NLQDQYVFLAEIRQALEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTL 131
Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
++ T Y+R+L+++ A Y F RE F+S PD ++V + + + +L F + L
Sbjct: 132 SQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELS 191
Query: 179 SLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
D S + C I K D+ ++F++ L + G I
Sbjct: 192 LTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDI 246
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L
Sbjct: 247 RVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLK 305
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
+RH++DYQ LF RV + L E N+D + + +K+++ E +L EL F
Sbjct: 306 SRHIEDYQALFQRVQLDL-------------EANVDAFTTDDLLKNYKPQEGQALEELFF 352
Query: 355 QFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E
Sbjct: 353 QYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAV 412
Query: 413 EPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWAL 462
P+ +++ L + G + A V Y +GW++H + W D W
Sbjct: 413 FPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGW 468
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTS 521
P AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS S
Sbjct: 469 SPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYS 528
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
PEH +S +T D ++I ++F I AA+ L +ED L E KS L P
Sbjct: 529 PEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNP 578
Query: 582 TKIAEDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTL 635
+I + G I EW ++ F++ +V HRH SHL GL+PG+ + K + +AA +L
Sbjct: 579 LQITQSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASL 637
Query: 636 QKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHP 695
RG+ G GWS K LWARL D A++++ + + NL+ +HP
Sbjct: 638 NDRGDGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHP 686
Query: 696 PFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
PFQID NFG T+ +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D
Sbjct: 687 PFQIDGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDK 745
Query: 756 DLHEVGIYS 764
L ++ I S
Sbjct: 746 KLLQLTILS 754
>gi|221232393|ref|YP_002511546.1| hypothetical protein SPN23F_16560 [Streptococcus pneumoniae ATCC
700669]
gi|225857271|ref|YP_002738782.1| alpha-fucosidase [Streptococcus pneumoniae P1031]
gi|298254439|ref|ZP_06978025.1| alpha-fucosidase [Streptococcus pneumoniae str. Canada MDR_19A]
gi|298503399|ref|YP_003725339.1| alpha-L-fucosidase [Streptococcus pneumoniae TCH8431/19A]
gi|410477028|ref|YP_006743787.1| hypothetical protein HMPREF1038_01639 [Streptococcus pneumoniae
gamPNI0373]
gi|415700118|ref|ZP_11457832.1| fibronectin type III domain protein [Streptococcus pneumoniae
459-5]
gi|415752860|ref|ZP_11479842.1| fibronectin type III domain protein [Streptococcus pneumoniae SV36]
gi|418079078|ref|ZP_12716300.1| fibronectin type III domain protein [Streptococcus pneumoniae
4027-06]
gi|418081275|ref|ZP_12718485.1| fibronectin type III domain protein [Streptococcus pneumoniae
6735-05]
gi|418083460|ref|ZP_12720657.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44288]
gi|418123978|ref|ZP_12760909.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44378]
gi|418128522|ref|ZP_12765415.1| fibronectin type III domain protein [Streptococcus pneumoniae
NP170]
gi|418178700|ref|ZP_12815283.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA41565]
gi|419427712|ref|ZP_13967893.1| fibronectin type III domain protein [Streptococcus pneumoniae
5652-06]
gi|419436452|ref|ZP_13976539.1| fibronectin type III domain protein [Streptococcus pneumoniae
8190-05]
gi|419450962|ref|ZP_13990948.1| fibronectin type III domain protein [Streptococcus pneumoniae
EU-NP02]
gi|419473709|ref|ZP_14013558.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA13430]
gi|444394527|ref|ZP_21192078.1| hypothetical protein PNI0002_00522 [Streptococcus pneumoniae
PNI0002]
gi|444398107|ref|ZP_21195590.1| hypothetical protein PNI0006_01690 [Streptococcus pneumoniae
PNI0006]
gi|444402905|ref|ZP_21200052.1| hypothetical protein PNI0008_01502 [Streptococcus pneumoniae
PNI0008]
gi|444404353|ref|ZP_21201309.1| hypothetical protein PNI0009_00413 [Streptococcus pneumoniae
PNI0009]
gi|444407726|ref|ZP_21204393.1| hypothetical protein PNI0010_01147 [Streptococcus pneumoniae
PNI0010]
gi|444409151|ref|ZP_21205749.1| hypothetical protein PNI0076_00189 [Streptococcus pneumoniae
PNI0076]
gi|444412799|ref|ZP_21209118.1| hypothetical protein PNI0153_01181 [Streptococcus pneumoniae
PNI0153]
gi|444422007|ref|ZP_21217672.1| hypothetical protein PNI0446_00358 [Streptococcus pneumoniae
PNI0446]
gi|220674854|emb|CAR69429.1| conserved hypothetical protein [Streptococcus pneumoniae ATCC
700669]
gi|225726032|gb|ACO21884.1| alpha-fucosidase [Streptococcus pneumoniae P1031]
gi|298238994|gb|ADI70125.1| possible alpha-L-fucosidase [Streptococcus pneumoniae TCH8431/19A]
gi|353746605|gb|EHD27265.1| fibronectin type III domain protein [Streptococcus pneumoniae
4027-06]
gi|353752014|gb|EHD32645.1| fibronectin type III domain protein [Streptococcus pneumoniae
6735-05]
gi|353754680|gb|EHD35292.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44288]
gi|353795798|gb|EHD76144.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44378]
gi|353799021|gb|EHD79344.1| fibronectin type III domain protein [Streptococcus pneumoniae
NP170]
gi|353842759|gb|EHE22805.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA41565]
gi|379550873|gb|EHZ15969.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA13430]
gi|379612891|gb|EHZ77606.1| fibronectin type III domain protein [Streptococcus pneumoniae
8190-05]
gi|379617905|gb|EHZ82585.1| fibronectin type III domain protein [Streptococcus pneumoniae
5652-06]
gi|379622667|gb|EHZ87301.1| fibronectin type III domain protein [Streptococcus pneumoniae
EU-NP02]
gi|381308507|gb|EIC49350.1| fibronectin type III domain protein [Streptococcus pneumoniae SV36]
gi|381314814|gb|EIC55580.1| fibronectin type III domain protein [Streptococcus pneumoniae
459-5]
gi|406369973|gb|AFS43663.1| hypothetical protein HMPREF1038_01639 [Streptococcus pneumoniae
gamPNI0373]
gi|444259769|gb|ELU66078.1| hypothetical protein PNI0002_00522 [Streptococcus pneumoniae
PNI0002]
gi|444260764|gb|ELU67072.1| hypothetical protein PNI0006_01690 [Streptococcus pneumoniae
PNI0006]
gi|444265666|gb|ELU71662.1| hypothetical protein PNI0008_01502 [Streptococcus pneumoniae
PNI0008]
gi|444271322|gb|ELU77073.1| hypothetical protein PNI0010_01147 [Streptococcus pneumoniae
PNI0010]
gi|444274038|gb|ELU79693.1| hypothetical protein PNI0153_01181 [Streptococcus pneumoniae
PNI0153]
gi|444276986|gb|ELU82513.1| hypothetical protein PNI0009_00413 [Streptococcus pneumoniae
PNI0009]
gi|444280076|gb|ELU85452.1| hypothetical protein PNI0076_00189 [Streptococcus pneumoniae
PNI0076]
gi|444288631|gb|ELU93522.1| hypothetical protein PNI0446_00358 [Streptococcus pneumoniae
PNI0446]
Length = 803
Score = 349 bits (895), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 249/789 (31%), Positives = 387/789 (49%), Gaps = 84/789 (10%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
P+ T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPVSTTYMGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
D L+++R ++ Y A + + P Y GDI +EF
Sbjct: 72 NLQDQYVFLAEIRQALEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTL 131
Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
++ T Y+R+L+++ A Y F RE F+S PD ++V + + + +L F + L
Sbjct: 132 SQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELS 191
Query: 179 SLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
D S + C I K D+ ++F++ L + G I
Sbjct: 192 LTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDI 246
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L
Sbjct: 247 RVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLK 305
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
+RH++DYQ LF RV + L E N+D + + +K+++ E +L EL F
Sbjct: 306 SRHIEDYQALFQRVQLDL-------------EANVDAFTTDDLLKNYKPQEGQALEELFF 352
Query: 355 QFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E
Sbjct: 353 QYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAV 412
Query: 413 EPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWAL 462
P+ +++ L + G + A V Y +GW++H + W D W
Sbjct: 413 FPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGW 468
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTS 521
P AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS S
Sbjct: 469 SPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYS 528
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
PEH +S +T D ++I ++F I AA+ L +ED L E KS L P
Sbjct: 529 PEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNP 578
Query: 582 TKIAEDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTL 635
+I + G I EW ++ F++ +V HRH SHL GL+PG+ + K + +AA +L
Sbjct: 579 LQITQSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASL 637
Query: 636 QKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHP 695
RG+ G GWS K LWARL D A++++ + + NL+ +HP
Sbjct: 638 NDRGDGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHP 686
Query: 696 PFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
PFQID NFG T+ +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D
Sbjct: 687 PFQIDGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDK 745
Query: 756 DLHEVGIYS 764
L ++ I S
Sbjct: 746 KLLQLTILS 754
>gi|444387033|ref|ZP_21185059.1| hypothetical protein PCS125219_00445 [Streptococcus pneumoniae
PCS125219]
gi|444389242|ref|ZP_21187159.1| hypothetical protein PCS70012_00258 [Streptococcus pneumoniae
PCS70012]
gi|444393004|ref|ZP_21190665.1| hypothetical protein PCS81218_01475 [Streptococcus pneumoniae
PCS81218]
gi|444400918|ref|ZP_21198254.1| hypothetical protein PNI0007_02076 [Streptococcus pneumoniae
PNI0007]
gi|444418365|ref|ZP_21214349.1| hypothetical protein PNI0360_01769 [Streptococcus pneumoniae
PNI0360]
gi|444419893|ref|ZP_21215727.1| hypothetical protein PNI0427_00756 [Streptococcus pneumoniae
PNI0427]
gi|444254243|gb|ELU60689.1| hypothetical protein PCS125219_00445 [Streptococcus pneumoniae
PCS125219]
gi|444257842|gb|ELU64175.1| hypothetical protein PCS70012_00258 [Streptococcus pneumoniae
PCS70012]
gi|444262591|gb|ELU68882.1| hypothetical protein PCS81218_01475 [Streptococcus pneumoniae
PCS81218]
gi|444264795|gb|ELU70844.1| hypothetical protein PNI0007_02076 [Streptococcus pneumoniae
PNI0007]
gi|444281712|gb|ELU87019.1| hypothetical protein PNI0360_01769 [Streptococcus pneumoniae
PNI0360]
gi|444285998|gb|ELU91006.1| hypothetical protein PNI0427_00756 [Streptococcus pneumoniae
PNI0427]
Length = 803
Score = 349 bits (895), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 249/789 (31%), Positives = 387/789 (49%), Gaps = 84/789 (10%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
P+ T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPVSTTYMGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
D L+++R ++ Y A + + P Y GDI +EF
Sbjct: 72 NLQDQYVFLAEIRQALEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTL 131
Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
++ T Y+R+L+++ A Y F RE F+S PD ++V + + + +L F + L
Sbjct: 132 SQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELS 191
Query: 179 SLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
D S + C I K D+ ++F++ L + G I
Sbjct: 192 LTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDI 246
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L
Sbjct: 247 RVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLK 305
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
+RH++DYQ LF RV + L E N+D + + +K+++ E +L EL F
Sbjct: 306 SRHIEDYQALFQRVQLDL-------------EANVDAFTTDDLLKNYKPQEGQALEELFF 352
Query: 355 QFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E
Sbjct: 353 QYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAV 412
Query: 413 EPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWAL 462
P+ +++ L + G + A V Y +GW++H + W D W
Sbjct: 413 FPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGW 468
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTS 521
P AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS S
Sbjct: 469 SPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYS 528
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
PEH +S +T D ++I ++F I AA+ L +ED L E KS L P
Sbjct: 529 PEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNP 578
Query: 582 TKIAEDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTL 635
+I + G I EW ++ F++ +V HRH SHL GL+PG+ + K + +AA +L
Sbjct: 579 LQITQSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASL 637
Query: 636 QKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHP 695
RG+ G GWS K LWARL D A++++ + + NL+ +HP
Sbjct: 638 NDRGDGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHP 686
Query: 696 PFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
PFQID NFG T+ +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D
Sbjct: 687 PFQIDGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMIWEDK 745
Query: 756 DLHEVGIYS 764
L ++ I S
Sbjct: 746 KLLQLTILS 754
>gi|197302981|ref|ZP_03168031.1| hypothetical protein RUMLAC_01709 [Ruminococcus lactaris ATCC
29176]
gi|197297976|gb|EDY32526.1| LPXTG-motif cell wall anchor domain protein [Ruminococcus lactaris
ATCC 29176]
Length = 1960
Score = 349 bits (895), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 254/840 (30%), Positives = 406/840 (48%), Gaps = 122/840 (14%)
Query: 1 MMNAESTSTT-----NPLKITFNGPA---KHFT----DAIPIGNGRLGAMVWGGVPSETL 48
+NAE + T N LK+ + PA K++ ++PIGNG +G V+GG+ E +
Sbjct: 29 QVNAEPAAVTQQTGDNDLKLWYTSPADITKYYEGWQEKSLPIGNGAIGGTVFGGITRERI 88
Query: 49 KLNEDTLWTGVP---------GDYTNPDAPKA-LSDVRSLVDSGQYAEATA-ASVKLFGH 97
+LN+ +LW+G P G+ N A ++ + + +GQ + A + A+ L G
Sbjct: 89 QLNDKSLWSGGPSTSRPNYNGGNLENKGNNGATMTSIHNYFANGQDSSAISLANSNLVGV 148
Query: 98 PADV-------YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREH 150
D Y G++ ++F + Y R+LDL TA A V Y G+ ++RE+
Sbjct: 149 SDDAGTNGYGYYLSWGNMYIDFKNVSSNNDVTNYTRDLDLKTAIAGVNYDKGSTHYSREN 208
Query: 151 FSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG----NNQIIMEGRCPGKRIPP 206
F+S PD VIVT I+ S +S +VS++ S +NG + Q + RI
Sbjct: 209 FTSYPDNVIVTHITADGSEKISLDVSVEPDNSRGSAINGIGDSSYQRTWDTTVSDGRISI 268
Query: 207 KANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFIN 266
D+ ++FS+ ++ I+D+ GT++ D K+ V G+ ++ + + +
Sbjct: 269 NGQLTDNQ--MKFSSQTQV-ITDNAGTVTD-GDGKVSVSGASEVTIITSMGTDYKDEY-- 322
Query: 267 PSDSKKDPTSESMSALQ------SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIV 320
PS + SE + ++ +++ +Y +L H+ DYQ++F+RV + L +
Sbjct: 323 PSYRTGETASELTNRVKWYVDQAAVK--TYEELKANHVSDYQEIFNRVDLNLGQ------ 374
Query: 321 TDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG----------TQ 370
T S + D + SA + + E L +LFQ+GR++ I SSR T
Sbjct: 375 --TVSTKTTDALLSAYKAGTASEAERRQLEVMLFQYGRFMTIESSRETKTDGNGYVRETL 432
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
+NLQG+W + W S H+N+NL+MNYW + N++EC +PL D++ L G TA
Sbjct: 433 PSNLQGLWVGANNSPWHSDYHMNVNLQMNYWPTYSTNMAECAQPLVDYIDALREPGRVTA 492
Query: 431 QVNYLAS-------GWVIHHKTD-------IWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
+ S G++ H + + W+ S W P W+ + W
Sbjct: 493 AIYAGVSSADGEENGFMAHTQNNPFGWTCPGWSFS--------WGWSPAAVPWILQNCWA 544
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
+Y YT D +L YP+++ A L+ DG L ++P+ SPEH V
Sbjct: 545 YYEYTGDTSYLRDNIYPMMKEEAKLYDRMLVRDSDGKLVSSPAYSPEH---------GPV 595
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
+ +T + +I +++ I AAEVL + D + P ++ + G I EW +
Sbjct: 596 TSGNTYEQTLIWQLYEDTIKAAEVLGTDADLVATWKANQADLKGPIEVGDSGQIKEWYTE 655
Query: 597 FK----------DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWS 646
+HRH+SHL GLFPG IT E + + AA+ ++Q R +E GW
Sbjct: 656 TTFNHTASGATLGEGYNHRHMSHLLGLFPGDLIT-EDHAEWFAAAKVSMQNRTDESTGWG 714
Query: 647 ITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP--FQIDANFG 704
+ + WARL D Y+++K LFN GG+Y+NLF H P FQID NFG
Sbjct: 715 MAQRINSWARLGDGNKTYQIIKNLFN-----------GGIYANLFDYHQPKYFQIDGNFG 763
Query: 705 FTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+T+ VAEML+QS + LLPA+P D W++G V GL A+G VS+ WKDG++ I S
Sbjct: 764 YTSGVAEMLLQSNAGYINLLPAVP-DDWANGSVNGLVAQGNFKVSMDWKDGNVTTATILS 822
>gi|419428224|ref|ZP_13968401.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 5652-06]
gi|379616100|gb|EHZ80800.1| putative alpha-L-fucosidase [Streptococcus pneumoniae 5652-06]
Length = 707
Score = 349 bits (895), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 239/721 (33%), Positives = 356/721 (49%), Gaps = 93/721 (12%)
Query: 72 LSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRREL 128
L +R + G+ +A + +F P D Y+LLG++ +E D A Y REL
Sbjct: 3 LKKIREYLLDGEIQKAEELIKLTMFATPRDQSHYELLGELYIEHIDIQ-SCALSLYEREL 61
Query: 129 DLNTATARVKY--SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNH 184
DL+TA + V + + N++ RE+F+S ++ +I S +L+ N++L + ++
Sbjct: 62 DLDTAISNVVFDPNSCNLQIKREYFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDE 121
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
++ I+M G+ KG+QF + K++D G +S L + + +
Sbjct: 122 VSKLDSSTILMSASAGGR------------KGVQFKVVCHSKVTD--GEVSVL-GETIVI 166
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQK 303
+ L L + +++ G +S+LQ ++ Y H+ YQ+
Sbjct: 167 RNATEVFLYLKSMTNYWGNI-------------DISSLQGEFSSIDYFTEKDEHVKKYQE 213
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
F+RV +L S KD ++ I T E K + L LLF +GRYLLIS
Sbjct: 214 QFNRVDFKLDYS-KDCLS-------IPTNLLLENTKKYSN----YLTNLLFHYGRYLLIS 261
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SS+P ANLQGIW ++L+P W S +NIN +MNYW PC+L E + PLFD L +
Sbjct: 262 SSQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMR 321
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
G TA+ Y A G+ HH TD + ++ + A+W + WLCTH+WEHY Y D
Sbjct: 322 EPGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQD 381
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
L + + +++ F D+L E DGYL T PS SPE+++ +G SST+D
Sbjct: 382 ERILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTID 439
Query: 544 MAIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPE 601
I+R + I A+ L N D + V+++ K LP+ TKI +G I EW +D+++ E
Sbjct: 440 NQILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGSNGQIQEWLEDYEEVE 496
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR----------------------- 638
HRH+S LFGL+P + I I K P+L +AA+ T+ +R
Sbjct: 497 PGHRHISPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLSSQEREQAINNWLVSG 556
Query: 639 --GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP 696
GWS W +ARL+ E AY + L N NLF HPP
Sbjct: 557 LHASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLN-----------NATLGNLFLDHPP 605
Query: 697 FQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGD 756
FQID N G + + E+LVQS N L L+PALP WS G VKG + RGG VS WK+GD
Sbjct: 606 FQIDGNLGLVSGICELLVQSHHNWLSLIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGD 664
Query: 757 L 757
+
Sbjct: 665 I 665
>gi|419445162|ref|ZP_13985177.1| alpha-fucosidase domain protein [Streptococcus pneumoniae GA19923]
gi|379572855|gb|EHZ37812.1| alpha-fucosidase domain protein [Streptococcus pneumoniae GA19923]
Length = 778
Score = 349 bits (895), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 249/789 (31%), Positives = 387/789 (49%), Gaps = 84/789 (10%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
P+ T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPVSTTYMGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
D L+++R ++ Y A + + P Y GDI +EF
Sbjct: 72 NLQDQYVFLAEIRQALEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTL 131
Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
++ T Y+R+L+++ A Y F RE F+S PD ++V + + + +L F + L
Sbjct: 132 SQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELS 191
Query: 179 SLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
D S + C I K D+ ++F++ L + G I
Sbjct: 192 LTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDI 246
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L
Sbjct: 247 RVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEEGYTQLK 305
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
+RH++DYQ LF RV + L E N+D + + +K+++ E +L EL F
Sbjct: 306 SRHIEDYQALFQRVQLDL-------------EANVDAFTTDDLLKNYKPQEGQALEELFF 352
Query: 355 QFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E
Sbjct: 353 QYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAV 412
Query: 413 EPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWAL 462
P+ +++ L + G + A V Y +GW++H + W D W
Sbjct: 413 FPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGW 468
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTS 521
P AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS S
Sbjct: 469 SPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYS 528
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
PEH +S +T D ++I ++F I AA+ L +ED L E KS L P
Sbjct: 529 PEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNP 578
Query: 582 TKIAEDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTL 635
+I + G I EW ++ F++ +V HRH SHL GL+PG+ + K + +AA +L
Sbjct: 579 LQITQSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASL 637
Query: 636 QKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHP 695
RG+ G GWS K LWARL D A++++ + + NL+ +HP
Sbjct: 638 NDRGDGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHP 686
Query: 696 PFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
PFQID NFG T+ +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D
Sbjct: 687 PFQIDGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDK 745
Query: 756 DLHEVGIYS 764
L ++ I S
Sbjct: 746 KLLQLTILS 754
>gi|335029650|ref|ZP_08523157.1| hypothetical protein HMPREF9967_1785 [Streptococcus infantis
SK1076]
gi|334268947|gb|EGL87379.1| hypothetical protein HMPREF9967_1785 [Streptococcus infantis
SK1076]
Length = 806
Score = 349 bits (895), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 255/798 (31%), Positives = 399/798 (50%), Gaps = 102/798 (12%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GD 62
P +++G K A+P+GNG +GA ++G + E ++ NE TLW+G P G+
Sbjct: 14 PTAPSYDGWEKQ---ALPVGNGEMGAKIFGLIGEERIQYNEKTLWSGGPQLDSTDYNGGN 70
Query: 63 YTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLK 118
Y D K L+++R +++G +A + + P + Y GDI + F++
Sbjct: 71 YQ--DRYKVLAEIRKALEAGDRQKAKQLAERNLVGPNNAQYGRYLSFGDIFMVFNNQKKG 128
Query: 119 YAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---N 174
T Y R+LD+ A YS F RE FSS PD V VT +S +L F N
Sbjct: 129 LENVTDYHRDLDITEAITTTSYSQDGTNFKRETFSSYPDDVTVTHLSKKGDKTLDFTLWN 188
Query: 175 VSLDSLLDNHSY---VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDR 231
++LL N Y + Q + G I K D+ G++F++ L IK
Sbjct: 189 SLTENLLANGDYSWEYSNYKQGAVTTDSNG--ILLKGTVKDN--GLKFASYLGIKTD--- 241
Query: 232 GTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNL 288
G ++A +D L V G+ +A LLL +++ NP ++ +KD E+ S +++ +
Sbjct: 242 GQVTA-QDGYLTVTGASYATLLLSVKTNYAQ---NPKTNYRKDIDVENTVKSIVEAAKAK 297
Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPS 348
Y L H+ DYQ LF+RV + L N + + E ++++ +
Sbjct: 298 DYETLKNNHIKDYQSLFNRVQLNLGG-------------NKSSQTTKEALQTYDPTKGQQ 344
Query: 349 LVELLFQFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPC 406
L EL FQ+GRYLLISSSR T ANLQG+WN +P W+S H+N+NL+MNYW +
Sbjct: 345 LEELFFQYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYMN 404
Query: 407 NLSECQEPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADR 455
NL+E +P+ +++ + G SK Q N GW++H + + ++
Sbjct: 405 NLAETAKPMINYIDDMRYYGRIAAKEYAGIESKEGQEN----GWLVHTQATPFGWTTPGW 460
Query: 456 GKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGY 513
W P AW+ +++++Y +T D +L+++ YP+L+ F +L + D +
Sbjct: 461 -NYYWGWSPAANAWMMQNVYDYYKFTKDEAYLKEKIYPMLKETTKFWNSFLHYDKSSDRW 519
Query: 514 LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL 573
+ ++PS SPEH ++ +T D +++ ++F + AA L ++D LV +V
Sbjct: 520 V-SSPSYSPEH---------GTITIGNTFDQSLVWQLFHDYMEAANHLNVDQD-LVTEVK 568
Query: 574 KSLPRLRPTKIAEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDL 627
+L+P I +DG I EW ++ F + E HHRH+SHL G+FPG T+ + +
Sbjct: 569 AKFDKLKPLHINQDGRIKEWYEEDSPQFTNEGIENHHRHVSHLVGIFPG-TLFGKDQHEY 627
Query: 628 CKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLY 687
+AA TL RG+ G GWS K LWARL D A+R++ + +
Sbjct: 628 LEAARATLNHRGDCGTGWSKANKINLWARLLDGNRAHRLLA-----------EQLKSSTL 676
Query: 688 SNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGET 747
NL+ H PFQID NFG T+ +AEML+QS + LPALP D W G V GL ARG
Sbjct: 677 ENLWDTHEPFQIDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLVARGNFE 735
Query: 748 VSICWKDGDLHEVGIYSN 765
VS+ WK+ +L + SN
Sbjct: 736 VSMKWKERNLETLSFLSN 753
>gi|154503020|ref|ZP_02040080.1| hypothetical protein RUMGNA_00842 [Ruminococcus gnavus ATCC 29149]
gi|153796374|gb|EDN78794.1| fibronectin type III domain protein [Ruminococcus gnavus ATCC
29149]
Length = 2168
Score = 348 bits (894), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 262/825 (31%), Positives = 404/825 (48%), Gaps = 110/825 (13%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTD-----AIPIGNGRLGAMVWGGVPSETLKLNEDTL 55
+++AE + + LK+ + A D ++PIGN +GA V+GGV +E ++LNE +L
Sbjct: 32 VVHAEESQDRSELKLRYTSAAPDSYDGWEKWSLPIGNSGIGASVFGGVQTERIQLNEKSL 91
Query: 56 WTGVPGDYTNPD-----------APKALSDVRSLVDSGQYAEATAASVKLFGHPADV--- 101
W+G P + + PD + + +++ L +G A++ +L G D
Sbjct: 92 WSGGPSE-SRPDYNGGNLEEKGRNGQTVKEIQQLFANGDNDAASSKCGELVGLSDDAGVN 150
Query: 102 ----YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQ 157
Y G++ L+F K E Y R LDLNTA A V+Y G+ +TRE+F S PD
Sbjct: 151 GYGYYLSYGNMYLDFKGISDKDVE-NYERTLDLNTAIAGVEYDNGDTHYTRENFVSYPDN 209
Query: 158 VIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM--------EGRCPGKRIPPKAN 209
V+VT+++ L+ +V ++ DN + N I E I
Sbjct: 210 VLVTRLTAEGGDKLNLDVRVEP--DNEAGGGSNKNTIQAQSYQREWETTVKDALISIDGQ 267
Query: 210 ANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDG----PFI 265
D+ ++FS+ + K+ + GT ED KV D + ++ S D P
Sbjct: 268 LKDNQ--MRFSS--QTKVLTEGGTT---EDGDEKVTVKDAKAVTIITSIGTDYKNDYPVY 320
Query: 266 NPSDSKKDPTSESMS----ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVT 321
+S++ S + A ++ N SY L H+DDY +F RV++ L + P
Sbjct: 321 RTGESQEQVASRVRAYVDKAADTVVNDSYDTLKQAHVDDYSSIFGRVNLDLGQVP----- 375
Query: 322 DTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP--------GTQVAN 373
SE+ D + A S E L +LFQ+GRYL I SSR T +N
Sbjct: 376 ---SEKTTDKLLKAYNDGSASEQERRYLEVMLFQYGRYLTIESSRETPEDDPSRATLPSN 432
Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVN 433
LQGIW S W S H+N+NL+MNYW + N++EC +PL ++ L G TA++
Sbjct: 433 LQGIWVGANSSAWHSDYHMNVNLQMNYWPTYSTNMAECAQPLISYVDSLREPGRVTAKIY 492
Query: 434 Y-LASGWVIHHKTDIWAKS----SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
+ G++ H + + + + S D W P W+ + WE+Y +T D +++
Sbjct: 493 AGVDQGFMAHTQNNPFGWTCPGWSFD-----WGWSPAAVPWILQNCWEYYEFTGDVSYMQ 547
Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
YP+++ A F + LI+ G+L ++PS SPEH P + A +Y T+ I
Sbjct: 548 NYIYPMMKEEAIFYDNILIDDGTGHLVSSPSYSPEH---GP--RTAGNTYEQTL----IW 598
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLR-PTKIAEDGSIMEWAQDFKDPEVH---- 603
+++ I AAE L + D LV RL+ P +I + G I EW +++ V+
Sbjct: 599 QLYEDTIKAAETLGVDAD-LVATWKDHQSRLKGPIEIGDSGQIKEW---YEETTVNSMGQ 654
Query: 604 ---HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQ 660
HRH+SH+ GLFPG I+ + P+ +AA ++ R +E GW + + WARL D
Sbjct: 655 GYGHRHISHMLGLFPGDLISSD-TPEYFEAARVSMNNRTDESTGWGMGQRINTWARLADG 713
Query: 661 EHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
AY+++ LF + G+ +NL+ HPPFQID NFG T+ VAEML+QS +
Sbjct: 714 NRAYKLITDLF-----------KNGIMTNLWDTHPPFQIDGNFGMTSGVAEMLLQSNMGY 762
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
+ +LPALP D W+SG V GL ARG VS+ WK+ L I SN
Sbjct: 763 INMLPALP-DAWASGSVSGLVARGNFEVSMNWKNKHLTSAEILSN 806
>gi|336433106|ref|ZP_08612933.1| hypothetical protein HMPREF0991_02052, partial [Lachnospiraceae
bacterium 2_1_58FAA]
gi|336017272|gb|EGN47037.1| hypothetical protein HMPREF0991_02052 [Lachnospiraceae bacterium
2_1_58FAA]
Length = 1786
Score = 348 bits (894), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 262/825 (31%), Positives = 404/825 (48%), Gaps = 110/825 (13%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHFTD-----AIPIGNGRLGAMVWGGVPSETLKLNEDTL 55
+++AE + + LK+ + A D ++PIGN +GA V+GGV +E ++LNE +L
Sbjct: 32 VVHAEESQDRSELKLRYTSAAPDSYDGWEKWSLPIGNSGIGASVFGGVQTERIQLNEKSL 91
Query: 56 WTGVPGDYTNPD-----------APKALSDVRSLVDSGQYAEATAASVKLFGHPADV--- 101
W+G P + + PD + + +++ L +G A++ +L G D
Sbjct: 92 WSGGPSE-SRPDYNGGNLEEKGRNGQTVKEIQQLFANGDNDAASSKCGELVGLSDDAGVN 150
Query: 102 ----YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQ 157
Y G++ L+F K E Y R LDLNTA A V+Y G+ +TRE+F S PD
Sbjct: 151 GYGYYLSYGNMYLDFKGISDKDVE-NYERTLDLNTAIAGVEYDNGDTHYTRENFVSYPDN 209
Query: 158 VIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM--------EGRCPGKRIPPKAN 209
V+VT+++ L+ +V ++ DN + N I E I
Sbjct: 210 VLVTRLTAEGGDKLNLDVRVEP--DNEAGGGSNKNTIQAQSYQREWETTVKDALISIDGQ 267
Query: 210 ANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDG----PFI 265
D+ ++FS+ + K+ + GT ED KV D + ++ S D P
Sbjct: 268 LKDNQ--MRFSS--QTKVLTEGGTT---EDGDEKVTVKDAKAVTIITSIGTDYKNDYPVY 320
Query: 266 NPSDSKKDPTSESMS----ALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVT 321
+S++ S + A ++ N SY L H+DDY +F RV++ L + P
Sbjct: 321 RTGESQEQVASRVRAYVDKAADTVVNDSYDTLKQAHVDDYSSIFGRVNLDLGQVP----- 375
Query: 322 DTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP--------GTQVAN 373
SE+ D + A S E L +LFQ+GRYL I SSR T +N
Sbjct: 376 ---SEKTTDKLLKAYNDGSASEQERRYLEVILFQYGRYLTIESSRETPEDDPSRATLPSN 432
Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVN 433
LQGIW S W S H+N+NL+MNYW + N++EC +PL ++ L G TA++
Sbjct: 433 LQGIWVGANSSAWHSDYHMNVNLQMNYWPTYSTNMAECAQPLISYVDSLREPGRVTAKIY 492
Query: 434 Y-LASGWVIHHKTDIWAKS----SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
+ G++ H + + + + S D W P W+ + WE+Y +T D +++
Sbjct: 493 AGVDQGFMAHTQNNPFGWTCPGWSFD-----WGWSPAAVPWILQNCWEYYEFTGDVSYMQ 547
Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
YP+++ A F + LI+ G+L ++PS SPEH P + A +Y T+ I
Sbjct: 548 NYIYPMMKEEAIFYDNILIDDGTGHLVSSPSYSPEH---GP--RTAGNTYEQTL----IW 598
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLPRLR-PTKIAEDGSIMEWAQDFKDPEVH---- 603
+++ I AAE L + D LV RL+ P +I + G I EW +++ V+
Sbjct: 599 QLYEDTIKAAETLGVDAD-LVATWKDHQSRLKGPIEIGDSGQIKEW---YEETTVNSMGQ 654
Query: 604 ---HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQ 660
HRH+SH+ GLFPG I+ + P+ +AA ++ R +E GW + + WARL D
Sbjct: 655 GYGHRHISHMLGLFPGDLISSD-TPEYFEAARVSMNNRTDESTGWGMGQRINTWARLADG 713
Query: 661 EHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLND 720
AY+++ LF + G+ +NL+ HPPFQID NFG T+ VAEML+QS +
Sbjct: 714 NRAYKLITDLF-----------KNGIMTNLWDTHPPFQIDGNFGMTSGVAEMLLQSNMGY 762
Query: 721 LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
+ +LPALP D W+SG V GL ARG VS+ WK+ L I SN
Sbjct: 763 INMLPALP-DAWASGSVSGLVARGNFEVSMNWKNKHLTSAEILSN 806
>gi|418198481|ref|ZP_12834939.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47778]
gi|353861591|gb|EHE41526.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47778]
Length = 782
Score = 348 bits (893), Expect = 6e-93, Method: Compositional matrix adjust.
Identities = 246/774 (31%), Positives = 381/774 (49%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 6 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 66 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
Y F RE F+S PD ++V + + + +L F + L D S +
Sbjct: 126 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDGKYEQEK 185
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 186 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 239
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 240 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEEGYTQLKSRHIEDYQALFQRVQ 299
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E N+D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 300 LDL-------------EANVDAFTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 346
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 347 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG- 405
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 406 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 462
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 463 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 513
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 514 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 572
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA +L RG+ G GWS K
Sbjct: 573 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSKANK 631
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 632 INLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQIDGNFGATSGMA 680
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 681 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 733
>gi|418137717|ref|ZP_12774555.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA11663]
gi|353900672|gb|EHE76223.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA11663]
Length = 782
Score = 348 bits (893), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 246/774 (31%), Positives = 381/774 (49%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 6 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 66 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
Y F RE F+S PD ++V + + + +L F + L D S +
Sbjct: 126 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDGKYEQEK 185
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 186 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 239
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 240 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 299
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E N+D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 300 LDL-------------EANVDAFTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 346
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 347 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG- 405
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 406 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 462
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 463 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 513
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 514 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 572
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA +L RG+ G GWS K
Sbjct: 573 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSKANK 631
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 632 INLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQIDGNFGATSGMA 680
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 681 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 733
>gi|419504391|ref|ZP_14044059.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47760]
gi|379605779|gb|EHZ70529.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47760]
Length = 803
Score = 348 bits (893), Expect = 8e-93, Method: Compositional matrix adjust.
Identities = 247/785 (31%), Positives = 386/785 (49%), Gaps = 103/785 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDILVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
+ G I EW ++ F++ +V HRH SHL GL+PG+ + K + +AA +L RG
Sbjct: 583 QSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRG 641
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
+ G GWS K LWARL D A++++ + + NL+ +HPPFQI
Sbjct: 642 DGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQI 690
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D NFG T+ +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D L +
Sbjct: 691 DGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQ 749
Query: 760 VGIYS 764
+ I S
Sbjct: 750 LTILS 754
>gi|418190394|ref|ZP_12826903.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47373]
gi|353851653|gb|EHE31644.1| putative alpha-L-fucosidase [Streptococcus pneumoniae GA47373]
Length = 682
Score = 348 bits (892), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 236/698 (33%), Positives = 346/698 (49%), Gaps = 92/698 (13%)
Query: 94 LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY--SVGNVEFTRE 149
+F P D Y+LLG++ +E D A Y RELDL+TA + V + + N++ RE
Sbjct: 1 MFATPRDQSHYELLGELYIEHIDIQ-SCALSLYERELDLDTAISNVVFEPNSCNLQIKRE 59
Query: 150 HFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLDNHSYVNGNNQIIMEGRCPGKRIPPK 207
+F+S ++ +I S +L+ N++L + ++ ++ I+M G+
Sbjct: 60 YFTSFNKNILCCRIVSSVQNTLNLNINLGRNKRFNDEVSKLDSSTILMSASAGGR----- 114
Query: 208 ANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP 267
KG+QF + K++D G +S L + + + + L L + + + G
Sbjct: 115 -------KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDYWGNI--- 161
Query: 268 SDSKKDPTSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSE 326
+S+LQ ++ Y H+ YQ+ F+RV +L S KD ++
Sbjct: 162 ----------DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYS-KDCLS----- 205
Query: 327 ENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTW 386
I T E K + L LLF +GRYLLISSS+P ANLQGIW ++L+P W
Sbjct: 206 --IPTNLLLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIW 259
Query: 387 DSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTD 446
S +NIN +MNYW PC+L E + PLFD L + G TA+ Y A G+ HH TD
Sbjct: 260 GSKYTININTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTD 319
Query: 447 IWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL 506
+ ++ + A+W + WLCTH+WEHY Y D L + + +++ F D+L
Sbjct: 320 GFGDTAPQSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYL 378
Query: 507 IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNED 566
E DGYL T PS SPE+++ +G SST+D I+R + I A+ L N D
Sbjct: 379 FEV-DGYLMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSD 437
Query: 567 AL--VEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKN 624
+ V+++ K LPR TKI +G I EW +D+++ E HRH+S LFGL+P + I I K
Sbjct: 438 FISRVKELKKKLPR---TKIGSNGQIQEWLEDYEEVEPGHRHISPLFGLYPYNEIDIHKT 494
Query: 625 PDLCKAAEKTLQKR-------------------------GEEGPGWSITWKTALWARLHD 659
P+L +AA+ T+ +R GWS W +ARL+
Sbjct: 495 PELAEAAKITINRRLSNANFLSSQDREQAINNWLVSGLHASTQTGWSAAWLIHFFARLYQ 554
Query: 660 QEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLN 719
E AY + L N NLF HPPFQID N G + + E+LVQS N
Sbjct: 555 GEPAYNQINGLLN-----------NATLGNLFLDHPPFQIDGNLGLVSGICELLVQSHHN 603
Query: 720 DLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
L L+PALP WS G VKG + RGG VS WK+GD+
Sbjct: 604 WLSLIPALP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 640
>gi|419425597|ref|ZP_13965793.1| hypothetical protein SPAR131_1527 [Streptococcus pneumoniae
7533-05]
gi|379619058|gb|EHZ83732.1| hypothetical protein SPAR131_1527 [Streptococcus pneumoniae
7533-05]
Length = 778
Score = 348 bits (892), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 249/789 (31%), Positives = 386/789 (48%), Gaps = 84/789 (10%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
P+ T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPVSTTYMGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
D L+++R ++ Y A + + P Y GDI +EF
Sbjct: 72 NLQDQYVFLAEIRQALEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTL 131
Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
++ T Y+R+L+++ A Y F RE F+S PD ++V + + + +L F + L
Sbjct: 132 SQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELS 191
Query: 179 SLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
D S + C I K D+ ++F++ L + G I
Sbjct: 192 LTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDI 246
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L
Sbjct: 247 RVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEEGYTQLK 305
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
+RH++DYQ LF RV + L E N+D + + +K+++ E +L EL F
Sbjct: 306 SRHIEDYQALFQRVQLDL-------------EANVDAFTTDDLLKNYKPQEGQALEELFF 352
Query: 355 QFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E
Sbjct: 353 QYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAV 412
Query: 413 EPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWAL 462
P+ +++ L + G + A V Y +GW++H + W D W
Sbjct: 413 FPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGW 468
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTS 521
P AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS S
Sbjct: 469 SPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYS 528
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
PEH +S +T D ++I ++F I AA+ L +ED L E KS L P
Sbjct: 529 PEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNP 578
Query: 582 TKIAEDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTL 635
+I + G I EW ++ F++ +V HRH SHL GL+PG+ + K + +AA +L
Sbjct: 579 LQITQSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASL 637
Query: 636 QKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHP 695
RG+ G GWS K LWARL D A++++ + + NL+ +HP
Sbjct: 638 NDRGDGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHP 686
Query: 696 PFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
PFQID NFG T +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D
Sbjct: 687 PFQIDGNFGATNGMAEMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDK 745
Query: 756 DLHEVGIYS 764
L ++ I S
Sbjct: 746 KLLQLTILS 754
>gi|418085647|ref|ZP_12722826.1| hypothetical protein SPAR90_1552 [Streptococcus pneumoniae GA47281]
gi|418149015|ref|ZP_12785777.1| hypothetical protein SPAR34_1507 [Streptococcus pneumoniae GA13856]
gi|421207087|ref|ZP_15664139.1| alpha-fucosidase domain protein [Streptococcus pneumoniae 2090008]
gi|353756356|gb|EHD36957.1| hypothetical protein SPAR90_1552 [Streptococcus pneumoniae GA47281]
gi|353811351|gb|EHD91593.1| hypothetical protein SPAR34_1507 [Streptococcus pneumoniae GA13856]
gi|395574423|gb|EJG35001.1| alpha-fucosidase domain protein [Streptococcus pneumoniae 2090008]
Length = 778
Score = 348 bits (892), Expect = 9e-93, Method: Compositional matrix adjust.
Identities = 247/785 (31%), Positives = 386/785 (49%), Gaps = 103/785 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
+ G I EW ++ F++ +V HRH SHL GL+PG+ + K + +AA +L RG
Sbjct: 583 QSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRG 641
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
+ G GWS K LWARL D A++++ + + NL+ +HPPFQI
Sbjct: 642 DGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQI 690
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D NFG T+ +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D L +
Sbjct: 691 DGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQ 749
Query: 760 VGIYS 764
+ I S
Sbjct: 750 LTILS 754
>gi|421236745|ref|ZP_15693342.1| fibronectin type III domain protein [Streptococcus pneumoniae
2071004]
gi|395601508|gb|EJG61655.1| fibronectin type III domain protein [Streptococcus pneumoniae
2071004]
Length = 803
Score = 347 bits (891), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 248/789 (31%), Positives = 386/789 (48%), Gaps = 84/789 (10%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
P+ T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPVSTTYMGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
D L+++R ++ Y A + + P Y GDI +EF
Sbjct: 72 NLQDQYVFLAEIRQALEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTL 131
Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
++ T Y+R+L+++ A Y F RE F+S PD ++V + + + +L F + L
Sbjct: 132 SQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELS 191
Query: 179 SLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
D S + C I K D+ ++F++ L + G I
Sbjct: 192 LTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDI 246
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L
Sbjct: 247 RVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLK 305
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
+RH++DYQ LF RV + L E N+D + + +K+++ E +L EL F
Sbjct: 306 SRHIEDYQALFQRVQLDL-------------EANVDAFTTDDLLKNYKPQEGQALEELFF 352
Query: 355 QFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E
Sbjct: 353 QYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAV 412
Query: 413 EPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWAL 462
P+ +++ L + G + A V Y +GW++H + W D W
Sbjct: 413 FPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGW 468
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTS 521
P AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS S
Sbjct: 469 SPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYS 528
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
PEH +S +T D ++I ++F I A+ L +ED L E KS L P
Sbjct: 529 PEH---------GPISIGNTYDQSLIWQLFHDFIQVAQELGLDEDLLTEVKEKS-DLLNP 578
Query: 582 TKIAEDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTL 635
+I + G I EW ++ F++ +V HRH SHL GL+PG+ + K + +AA +L
Sbjct: 579 LQITQSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASL 637
Query: 636 QKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHP 695
RG+ G GWS K LWARL D A++++ + + NL+ +HP
Sbjct: 638 NDRGDGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHP 686
Query: 696 PFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
PFQID NFG T+ +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D
Sbjct: 687 PFQIDGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDK 745
Query: 756 DLHEVGIYS 764
L ++ I S
Sbjct: 746 KLLQLTILS 754
>gi|148997704|ref|ZP_01825268.1| hypothetical protein CGSSp11BS70_02314 [Streptococcus pneumoniae
SP11-BS70]
gi|168491464|ref|ZP_02715607.1| alpha-fucosidase [Streptococcus pneumoniae CDC0288-04]
gi|168575158|ref|ZP_02721121.1| alpha-fucosidase [Streptococcus pneumoniae MLV-016]
gi|225861483|ref|YP_002742992.1| alpha-fucosidase [Streptococcus pneumoniae Taiwan19F-14]
gi|387788703|ref|YP_006253771.1| hypothetical protein MYY_1579 [Streptococcus pneumoniae ST556]
gi|417313133|ref|ZP_12099845.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA04375]
gi|418142169|ref|ZP_12778982.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA13455]
gi|418151161|ref|ZP_12787907.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA14798]
gi|418164950|ref|ZP_12801618.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA17371]
gi|418171792|ref|ZP_12808416.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA19451]
gi|418194221|ref|ZP_12830710.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47439]
gi|418228162|ref|ZP_12854779.1| fibronectin type III domain protein [Streptococcus pneumoniae
3063-00]
gi|419429855|ref|ZP_13970019.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA11856]
gi|419438693|ref|ZP_13978761.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA13499]
gi|419449442|ref|ZP_13989438.1| fibronectin type III domain protein [Streptococcus pneumoniae
4075-00]
gi|419471542|ref|ZP_14011401.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA07914]
gi|419502307|ref|ZP_14041991.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47628]
gi|419506539|ref|ZP_14046200.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA49194]
gi|419519366|ref|ZP_14058972.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA08825]
gi|421238983|ref|ZP_15695547.1| fibronectin type III domain protein [Streptococcus pneumoniae
2071247]
gi|421245493|ref|ZP_15701991.1| fibronectin type III domain protein [Streptococcus pneumoniae
2081685]
gi|421292526|ref|ZP_15743260.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA56348]
gi|421312462|ref|ZP_15763064.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA58981]
gi|421314530|ref|ZP_15765117.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA47562]
gi|147756203|gb|EDK63245.1| hypothetical protein CGSSp11BS70_02314 [Streptococcus pneumoniae
SP11-BS70]
gi|183574240|gb|EDT94768.1| alpha-fucosidase [Streptococcus pneumoniae CDC0288-04]
gi|183578740|gb|EDT99268.1| alpha-fucosidase [Streptococcus pneumoniae MLV-016]
gi|225727028|gb|ACO22879.1| alpha-fucosidase [Streptococcus pneumoniae Taiwan19F-14]
gi|327389841|gb|EGE88186.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA04375]
gi|353806420|gb|EHD86694.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA13455]
gi|353814371|gb|EHD94597.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA14798]
gi|353828782|gb|EHE08918.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA17371]
gi|353835529|gb|EHE15623.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA19451]
gi|353857799|gb|EHE37761.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47439]
gi|353880557|gb|EHE60372.1| fibronectin type III domain protein [Streptococcus pneumoniae
3063-00]
gi|379138445|gb|AFC95236.1| hypothetical protein MYY_1579 [Streptococcus pneumoniae ST556]
gi|379537100|gb|EHZ02285.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA13499]
gi|379546258|gb|EHZ11397.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA07914]
gi|379550033|gb|EHZ15135.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA11856]
gi|379600520|gb|EHZ65301.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47628]
gi|379608453|gb|EHZ73199.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA49194]
gi|379622060|gb|EHZ86696.1| fibronectin type III domain protein [Streptococcus pneumoniae
4075-00]
gi|379641203|gb|EIA05741.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA08825]
gi|395600626|gb|EJG60781.1| fibronectin type III domain protein [Streptococcus pneumoniae
2071247]
gi|395608020|gb|EJG68116.1| fibronectin type III domain protein [Streptococcus pneumoniae
2081685]
gi|395891833|gb|EJH02827.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA56348]
gi|395909316|gb|EJH20192.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA58981]
gi|395913215|gb|EJH24068.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA47562]
Length = 803
Score = 347 bits (891), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 247/785 (31%), Positives = 386/785 (49%), Gaps = 103/785 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
+ G I EW ++ F++ +V HRH SHL GL+PG+ + K + +AA +L RG
Sbjct: 583 QSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRG 641
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
+ G GWS K LWARL D A++++ + + NL+ +HPPFQI
Sbjct: 642 DGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQI 690
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D NFG T+ +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D L +
Sbjct: 691 DGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQ 749
Query: 760 VGIYS 764
+ I S
Sbjct: 750 LTILS 754
>gi|307068282|ref|YP_003877248.1| hypothetical protein SPAP_1662 [Streptococcus pneumoniae AP200]
gi|306409819|gb|ADM85246.1| hypothetical protein SPAP_1662 [Streptococcus pneumoniae AP200]
Length = 796
Score = 347 bits (890), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 247/785 (31%), Positives = 386/785 (49%), Gaps = 103/785 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
+ G I EW ++ F++ +V HRH SHL GL+PG+ + K + +AA +L RG
Sbjct: 583 QSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRG 641
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
+ G GWS K LWARL D A++++ + + NL+ +HPPFQI
Sbjct: 642 DGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQI 690
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D NFG T+ +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D L +
Sbjct: 691 DGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQ 749
Query: 760 VGIYS 764
+ I S
Sbjct: 750 LTILS 754
>gi|167749996|ref|ZP_02422123.1| hypothetical protein EUBSIR_00964 [Eubacterium siraeum DSM 15702]
gi|167657017|gb|EDS01147.1| hypothetical protein EUBSIR_00964 [Eubacterium siraeum DSM 15702]
Length = 796
Score = 347 bits (890), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 244/785 (31%), Positives = 383/785 (48%), Gaps = 91/785 (11%)
Query: 14 KITFNGPA---KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD--- 67
KI F P K PIGNG +GA +GG+ E + LNE TLW G P + + PD
Sbjct: 24 KILFTAPGEFDKWEQQCQPIGNGYMGASFFGGISKEKIVLNEKTLWAGGPSE-SRPDYNS 82
Query: 68 -----APKALSDVRSLVDSGQYAEATAASVKLFGHP--ADVYQLLGDIELEFDDSHLKYA 120
+ + + V+ L+ G+Y EA A L G YQLL D+ L F + A
Sbjct: 83 GIIEGSYEYVKQVQQLLYDGKYDEAVALLPHLTGATDGFGAYQLLCDMMLTFSNIDETQA 142
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
+ Y R LDL+ + +++ RE F++ P VI K+S + + +SLD+L
Sbjct: 143 TD-YTRTLDLDNSIFTTQFTYQGAVHKREAFANYPSNVICLKLSSDKPRRICVKLSLDNL 201
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
NG+ + EG G+++ I K+ + G + +D
Sbjct: 202 QCGSVTANGDT-LTYEGALW-------------DNGLRYCTIF--KVVNKGGELIDAKDS 245
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPS-DSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+ VE +D + L AS+ + + P+ + +P++ +++ + + LY HL
Sbjct: 246 -IMVEHADEVYIYLTASTDYSNKY--PTFRTGVNPSAAVNQRIENAVSKGFDALYEEHLA 302
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVE----LLFQ 355
DY+ LF RV+++++ DI+ P + + ++ + S+ L FQ
Sbjct: 303 DYKALFDRVTLKINEDTDDII------------PCDKLISEYKENGSRSIANRLETLYFQ 350
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
FGRY+LISSSR G+ ANLQG+WNE P W H+N+NL+MNYW + NLSE PL
Sbjct: 351 FGRYMLISSSRAGSLPANLQGVWNESNCPPWCCDYHINVNLQMNYWGAYNTNLSETVPPL 410
Query: 416 FDFLTYLSINGSKTAQVNY--------LASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
DFL + +G K+A+ Y +GW H ++ + +A W
Sbjct: 411 VDFLDSMRPSGRKSAEAYYGIKSDEEHPENGWCAHTQSTPFG-WTAPGWDFYWGWSTAAV 469
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEF 526
AWL +++EH+ +T D+++ + YP++ F WLI + L ++P+ SPEH
Sbjct: 470 AWLMQNIYEHFEFTGDKEYFAEHIYPIMRESVRFYTQWLIYDDKQKRLVSSPTYSPEH-- 527
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE 586
V+ +T + ++I ++++ I+A+E L +E+ L V + +L+P I++
Sbjct: 528 -------GPVTIGNTYEQSLIEQLYNDFITASEALGTDEE-LRNIVKNQVVQLKPFSISK 579
Query: 587 D-GSIMEWAQ------DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
G + EW + D + +HRH+SHL GL+PG I P+L AA TL RG
Sbjct: 580 KTGLLKEWFEEDDDNFDHSKTQKNHRHISHLLGLYPGKAIN-SNTPELMTAAINTLNDRG 638
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
+E GW+ +K LWAR+ D AY +++ L G + NLF HPPFQ+
Sbjct: 639 DESTGWARAYKLNLWARVKDGNRAYSILQGL-----------LRGCTFDNLFDFHPPFQL 687
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D NFG +A +AEML+QS + LLPA P D W +G GL AR G + W++ +
Sbjct: 688 DGNFGGSAGIAEMLIQSHEGYIELLPAAP-DAWRNGAFTGLCARHGFVIDAKWENFNPTA 746
Query: 760 VGIYS 764
V I S
Sbjct: 747 VTIKS 751
>gi|419521584|ref|ZP_14061179.1| hypothetical protein SPAR7_1605 [Streptococcus pneumoniae GA05245]
gi|379538884|gb|EHZ04064.1| hypothetical protein SPAR7_1605 [Streptococcus pneumoniae GA05245]
Length = 803
Score = 347 bits (890), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 247/785 (31%), Positives = 386/785 (49%), Gaps = 103/785 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
+ G I EW ++ F++ +V HHRH SHL GL+ G+ + K + +AA +L RG
Sbjct: 583 QSGRIREWYEEEEQYFQNEKVEAHHRHASHLVGLYSGNLFSY-KGQEYIEAARASLNDRG 641
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
+ G GWS K LWARL D A++++ + + NL+ +HPPFQI
Sbjct: 642 DGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQI 690
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D NFG T+ +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D L +
Sbjct: 691 DGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQ 749
Query: 760 VGIYS 764
+ I S
Sbjct: 750 LTILS 754
>gi|417699038|ref|ZP_12348209.1| hypothetical protein SPAR69_1595 [Streptococcus pneumoniae GA41317]
gi|332199684|gb|EGJ13759.1| hypothetical protein SPAR69_1595 [Streptococcus pneumoniae GA41317]
Length = 757
Score = 347 bits (889), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 247/785 (31%), Positives = 386/785 (49%), Gaps = 103/785 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 6 EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 66 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 126 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 185
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 186 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 229
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 230 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 288
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 289 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 335
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 336 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 395
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 396 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 451
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 452 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 510
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 511 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 561
Query: 586 EDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
+ G I EW ++ F++ +V HRH SHL GL+PG+ + K + +AA +L RG
Sbjct: 562 QSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRG 620
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
+ G GWS K LWARL D A++++ + + NL+ +HPPFQI
Sbjct: 621 DGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQI 669
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D NFG T+ +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D L +
Sbjct: 670 DGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQ 728
Query: 760 VGIYS 764
+ I S
Sbjct: 729 LTILS 733
>gi|383113206|ref|ZP_09933980.1| hypothetical protein BSGG_4923 [Bacteroides sp. D2]
gi|313697388|gb|EFS34223.1| hypothetical protein BSGG_4923 [Bacteroides sp. D2]
Length = 765
Score = 347 bits (889), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 257/779 (32%), Positives = 388/779 (49%), Gaps = 126/779 (16%)
Query: 13 LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
+K+ + PA+++ T A+PIGNG LG + +GG+ E L+ NE TLWTG
Sbjct: 32 MKLWYTRPAQNWMTSALPIGNGELGGLFFGGIACERLQFNEKTLWTG------------- 78
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
S+ + YQ G++ ++F + + + + Y REL L+
Sbjct: 79 -SETKR----------------------GAYQSFGNLYIDFAEHNGEAVD--YCRELCLD 113
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISG-SESGSLSFNVSLDSLLDNHSYVNGN 190
A V Y + V++ RE+F+S PD+VIV +I+ G L+ +V L+ D+H
Sbjct: 114 NAIGSVSYEMNGVKYRREYFASYPDRVIVMRITTPGMKGRLNLSVRLE---DSHF----- 165
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQ-----FSAILEIKISDDRGTISALEDKKLKVE 245
+ + N + GIQ S ++K+ +++G +S + D +L V
Sbjct: 166 ---------------GQLSVNKNILGIQGQLDLLSYDAQVKVLNEKGQLSVV-DNRLTVC 209
Query: 246 GSDWAVLLLVASSSFDGPFINPSD----SKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
+D +LLVA ++F+ I+ +D S +D E + L + +Y+ L HL DY
Sbjct: 210 DADAVTILLVAGTNFN---ISATDYLGTSSEDLHKELYTRLSNASRKNYAALKNIHLKDY 266
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
Q LF RV + L + ++ P+ E V++ + E L L FQ+GRYL+
Sbjct: 267 QSLFSRVKLDL-------------QADMPEYPTDELVRNHK--ESRYLDMLYFQYGRYLM 311
Query: 362 ISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
+ SSR NLQGIWN D +P W+ H NIN++MNYW + NL EC P FL Y
Sbjct: 312 LGSSRGMNLPNNLQGIWNADNTPPWECDIHSNINIQMNYWPAEITNLPECHLP---FLQY 368
Query: 422 LSI------NGS--KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
+++ NGS + AQ L GW I + +I+ S W + AW CTH
Sbjct: 369 IAVEAVGKPNGSWRRIAQGEGL-RGWTIKTQNNIFGYSD-------WNINRPANAWYCTH 420
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP--DG 531
LW+HY Y D ++L A+P+++ + D L E DG L SPE P DG
Sbjct: 421 LWQHYAYNNDLEYLRNIAFPVMQSTCKYWFDRLKENKDGKLVAPDEWSPEQ---GPWEDG 477
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSI 590
V+Y+ + + E A+ + +V + ++ V ++ +L + G I
Sbjct: 478 ----VAYAQQLVWQLFNETLHAVEALKKVDIQIDNVFVSELADKFRKLDNGVSVGSWGQI 533
Query: 591 MEWAQDFKDPEVH---HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSI 647
EW +D + HRHLS L L+PG+ I+ ++ L AA+ TLQ RG+ G GWS
Sbjct: 534 KEWKEDKGKLDFQGNDHRHLSQLIALYPGNQISYHRDTLLADAAKVTLQSRGDMGTGWSR 593
Query: 648 TWKTALWARLHDQEHAYRMVKRLFNL--VDPEHEKHFEGGLYSNLFAAHPPFQIDANFGF 705
WK A WARL D +HAYR++K +L + + +GG+Y NLF +HPPFQID NFG
Sbjct: 594 AWKIACWARLFDGDHAYRLLKSALSLSTLTVISMDNSKGGVYENLFDSHPPFQIDGNFGA 653
Query: 706 TAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
TA +AEML+QS ++LLPALP WS G V GL+ G T ++ W G L + + S
Sbjct: 654 TAGIAEMLLQSNQGFIHLLPALPL-AWSDGSVAGLRTEGDFTFTMKWNAGWLTQCSVLS 711
>gi|418160358|ref|ZP_12797057.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA17227]
gi|353822091|gb|EHE02267.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA17227]
Length = 809
Score = 346 bits (888), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 248/785 (31%), Positives = 385/785 (49%), Gaps = 103/785 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEW----AQDFKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
+ G I EW Q F++ +V HHRH SHL GL+ G+ + K + +AA +L RG
Sbjct: 583 QSGRIREWYEEEEQYFQNEKVEAHHRHASHLVGLYSGNLFSY-KGQEYIEAARASLNDRG 641
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
+ G GWS K LWARL D A++++ + + NL+ +HPPFQI
Sbjct: 642 DGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQI 690
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D NFG T+ +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D L +
Sbjct: 691 DGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQ 749
Query: 760 VGIYS 764
+ I S
Sbjct: 750 LTILS 754
>gi|418119103|ref|ZP_12756060.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA18523]
gi|419453708|ref|ZP_13993678.1| fibronectin type III domain protein [Streptococcus pneumoniae
EU-NP03]
gi|353791055|gb|EHD71436.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA18523]
gi|379625778|gb|EHZ90404.1| fibronectin type III domain protein [Streptococcus pneumoniae
EU-NP03]
Length = 782
Score = 346 bits (888), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 247/785 (31%), Positives = 386/785 (49%), Gaps = 103/785 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 6 EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 66 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 126 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 185
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 186 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 229
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 230 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 288
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 289 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 335
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 336 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 395
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 396 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 451
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 452 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 510
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 511 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 561
Query: 586 EDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
+ G I EW ++ F++ +V HRH SHL GL+PG+ + K + +AA +L RG
Sbjct: 562 QSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRG 620
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
+ G GWS K LWARL D A++++ + + NL+ +HPPFQI
Sbjct: 621 DGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQI 669
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D NFG T+ +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D L +
Sbjct: 670 DGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQ 728
Query: 760 VGIYS 764
+ I S
Sbjct: 729 LTILS 733
>gi|417687098|ref|ZP_12336372.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA41301]
gi|332073988|gb|EGI84466.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA41301]
Length = 782
Score = 346 bits (887), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 247/785 (31%), Positives = 386/785 (49%), Gaps = 103/785 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 6 EALPIGNGSLGAKVFGRIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 66 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 126 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 185
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 186 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 229
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 230 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 288
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 289 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 335
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 336 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 395
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 396 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 451
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 452 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 510
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 511 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 561
Query: 586 EDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
+ G I EW ++ F++ +V HHRH SHL GL+ G+ + K + +AA +L RG
Sbjct: 562 QSGRIREWYEEEEQYFQNEKVEAHHRHASHLVGLYSGNLFSY-KGQEYIEAARASLNDRG 620
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
+ G GWS K LWARL D A++++ + + NL+ +HPPFQI
Sbjct: 621 DGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQI 669
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D NFG T+ +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D L +
Sbjct: 670 DGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQ 728
Query: 760 VGIYS 764
+ I S
Sbjct: 729 LTILS 733
>gi|417679619|ref|ZP_12329015.1| hypothetical protein SPAR50_1655 [Streptococcus pneumoniae GA17570]
gi|332072484|gb|EGI82967.1| hypothetical protein SPAR50_1655 [Streptococcus pneumoniae GA17570]
Length = 778
Score = 346 bits (887), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 249/785 (31%), Positives = 385/785 (49%), Gaps = 103/785 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
A Y F RE F+S PD ++V + +L F + L L N Y
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASNGKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRYCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
+ G I EW ++ F++ +V HRH SHL GL+PG+ + K + +AA +L RG
Sbjct: 583 QSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRG 641
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
+ G GWS K LWARL D A++++ + + NL+ +HPPFQI
Sbjct: 642 DGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQI 690
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D NFG T+ +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D L +
Sbjct: 691 DGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQ 749
Query: 760 VGIYS 764
+ I S
Sbjct: 750 LTILS 754
>gi|417849512|ref|ZP_12495432.1| hypothetical protein HMPREF9957_1083 [Streptococcus mitis SK1080]
gi|339456106|gb|EGP68701.1| hypothetical protein HMPREF9957_1083 [Streptococcus mitis SK1080]
Length = 803
Score = 346 bits (887), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 249/790 (31%), Positives = 383/790 (48%), Gaps = 84/790 (10%)
Query: 10 TNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN 65
T P T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 14 TKPASTTYKGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQG 70
Query: 66 ---PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLK 118
D L+++R ++ Y A + + P Y GDI +EF +
Sbjct: 71 GNLQDQYGFLAEIRQALEKRDYNTAKELAEQHLVGPQTSQYGTYLSFGDIFIEFSNQGKT 130
Query: 119 YAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL 177
++ T Y+R+L+++ A A Y +F RE F+S PD ++V + +L F + L
Sbjct: 131 LSQVTDYQRQLNISKALATTSYVYKGTKFEREAFASFPDDLLVQRFIKEGLETLDFTIEL 190
Query: 178 DSLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGT 233
D S + C I K D+ +QF++ L + G
Sbjct: 191 SLTRDLASDGKYEQEKYDYKECQLNITASHILMKGRVKDND--LQFASYLTWQTD---GD 245
Query: 234 ISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDL 293
I DK +++ G+ +A L L A + F + K D + + + + + Y+ L
Sbjct: 246 IRVWSDK-IQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYAQL 304
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
+RH++DYQ LF V + L ++D + + +K+++ E +L EL
Sbjct: 305 KSRHIEDYQALFQSVQLDLG-------------SDVDASTTDDLLKNYKPQEGQALEELF 351
Query: 354 FQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSEC 411
FQ+GRYLLISSSR P ANLQG+WN +P W+S H+NINL+MNYW + NL E
Sbjct: 352 FQYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNINLQMNYWPAYVTNLLET 411
Query: 412 QEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWA 461
P+ +++ L + G + A Y +GW++H + W D W
Sbjct: 412 AFPVINYIDDLRVYG-RLAAARYAGIVSQEGEENGWLVHTQATPFGWTAPGWD---YYWG 467
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPST 520
P AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS
Sbjct: 468 WSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSY 527
Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLR 580
SPEH +S +T D ++I ++F I AA+ L +ED L E V + L
Sbjct: 528 SPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELSLDEDLLTE-VKEKFDLLN 577
Query: 581 PTKIAEDGSIMEW----AQDFKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKT 634
P +I + G I EW Q F++ +V HRH SHL GL+PG+ + K + +AA +
Sbjct: 578 PLQITQSGRIREWYEEEEQHFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYLEAARAS 636
Query: 635 LQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAH 694
L RG+ G GWS K LWARL D A++++ + + NL+ +H
Sbjct: 637 LNDRGDGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKSSTLPNLWCSH 685
Query: 695 PPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKD 754
PPFQID NFG ++ +AEML+QS L L ALP D WS G V GL ARG VS+ W+D
Sbjct: 686 PPFQIDGNFGASSGMAEMLLQSHAAYLVPLAALP-DAWSRGSVSGLMARGHFEVSMRWED 744
Query: 755 GDLHEVGIYS 764
L ++ I S
Sbjct: 745 KKLLQLTILS 754
>gi|148993776|ref|ZP_01823203.1| hypothetical protein CGSSp9BS68_02603 [Streptococcus pneumoniae
SP9-BS68]
gi|168488632|ref|ZP_02712831.1| alpha-fucosidase [Streptococcus pneumoniae SP195]
gi|418234852|ref|ZP_12861428.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA08780]
gi|421220741|ref|ZP_15677580.1| fibronectin type III domain protein [Streptococcus pneumoniae
2070425]
gi|421222994|ref|ZP_15679776.1| fibronectin type III domain protein [Streptococcus pneumoniae
2070531]
gi|421279430|ref|ZP_15730236.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA17301]
gi|421294642|ref|ZP_15745363.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA56113]
gi|147927732|gb|EDK78756.1| hypothetical protein CGSSp9BS68_02603 [Streptococcus pneumoniae
SP9-BS68]
gi|183572723|gb|EDT93251.1| alpha-fucosidase [Streptococcus pneumoniae SP195]
gi|353886474|gb|EHE66256.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA08780]
gi|395586651|gb|EJG47018.1| fibronectin type III domain protein [Streptococcus pneumoniae
2070425]
gi|395586974|gb|EJG47336.1| fibronectin type III domain protein [Streptococcus pneumoniae
2070531]
gi|395878923|gb|EJG89985.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA17301]
gi|395893211|gb|EJH04198.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA56113]
Length = 803
Score = 345 bits (886), Expect = 4e-92, Method: Compositional matrix adjust.
Identities = 249/785 (31%), Positives = 385/785 (49%), Gaps = 103/785 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
A Y F RE F+S PD ++V + +L F + L L N Y
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASNGKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRV---------KDND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRYCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
+ G I EW ++ F++ +V HRH SHL GL+PG+ + K + +AA +L RG
Sbjct: 583 QSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRG 641
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
+ G GWS K LWARL D A++++ + + NL+ +HPPFQI
Sbjct: 642 DGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQI 690
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D NFG T+ +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D L +
Sbjct: 691 DGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQ 749
Query: 760 VGIYS 764
+ I S
Sbjct: 750 LTILS 754
>gi|149006721|ref|ZP_01830407.1| hypothetical protein CGSSp18BS74_05667 [Streptococcus pneumoniae
SP18-BS74]
gi|147761636|gb|EDK68600.1| hypothetical protein CGSSp18BS74_05667 [Streptococcus pneumoniae
SP18-BS74]
Length = 803
Score = 345 bits (885), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 247/774 (31%), Positives = 380/774 (49%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + SE ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGSERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 593
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA +L RG+ G GWS K
Sbjct: 594 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSKANK 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 653 INLWARLGDGNRAHKLLA-----------EQLKTSTLQNLWCSHPPFQIDGNFGATSGMA 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 702 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 754
>gi|418096757|ref|ZP_12733868.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA16531]
gi|353768478|gb|EHD49002.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA16531]
Length = 782
Score = 345 bits (885), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 248/774 (32%), Positives = 379/774 (48%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + SE ++ NE +LW+G P DY D L+++R
Sbjct: 6 EALPIGNGSLGAKVFGLIGSERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 66 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 126 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 185
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 186 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 239
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 240 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 299
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 300 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 346
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 347 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 405
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 406 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 462
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 463 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 513
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW--- 593
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW
Sbjct: 514 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 572
Query: 594 -AQDFKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
Q F++ +V HRH SHL GL+PG+ + K + +AA +L RG+ G GWS K
Sbjct: 573 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSKANK 631
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 632 INLWARLGDGNRAHKLLA-----------EQLKTSTLQNLWCSHPPFQIDGNFGATSGMA 680
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 681 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 733
>gi|225859410|ref|YP_002740920.1| alpha-fucosidase [Streptococcus pneumoniae 70585]
gi|225721936|gb|ACO17790.1| alpha-fucosidase [Streptococcus pneumoniae 70585]
Length = 803
Score = 345 bits (884), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 249/785 (31%), Positives = 385/785 (49%), Gaps = 103/785 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
A Y F RE F+S PD ++V + +L F + L L N Y
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASNGKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQSPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVCFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
+ G I EW ++ F++ +V HRH SHL GL+PG+ + K + +AA +L RG
Sbjct: 583 QSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRG 641
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
+ G GWS K LWARL D A++++ + + NL+ +HPPFQI
Sbjct: 642 DGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQI 690
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D NFG T+ +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D L +
Sbjct: 691 DGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQ 749
Query: 760 VGIYS 764
+ I S
Sbjct: 750 LTILS 754
>gi|307709595|ref|ZP_07646048.1| alpha-fucosidase [Streptococcus mitis SK564]
gi|307619631|gb|EFN98754.1| alpha-fucosidase [Streptococcus mitis SK564]
Length = 803
Score = 345 bits (884), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 250/789 (31%), Positives = 380/789 (48%), Gaps = 84/789 (10%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
P T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPASTTYKGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLSNSSDYQGG 71
Query: 66 --PDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADVYQL---LGDIELEFDDSHLKY 119
D ++++R ++ Y A A L G Y GDI +EF
Sbjct: 72 NLQDQYAFIAEIRQDLEKRDYNRAKELAEQHLVGSKTSQYGTYLSFGDIHIEFSKQGKTL 131
Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
++ Y+R+L+++ A A Y F RE F+S PD ++V + + +L F + L
Sbjct: 132 SQVMDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLLVQRFTKEGLETLDFTIELS 191
Query: 179 SLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
D S + C I K D+ ++F++ L + G I
Sbjct: 192 LTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDI 246
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
DK +++ G+ +A L L A + F + K D + +++ + Y+ L
Sbjct: 247 RVWSDK-VQISGASYANLFLAAKTDFAQNPASNYRKKIDLEQQVKDLVETAKEKGYAQLK 305
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
+RH++DYQ LF RV + L +D + + +K++ E +L EL F
Sbjct: 306 SRHIEDYQALFQRVQLDLG-------------AEVDASTTDDLLKNYNPQEGQALEELFF 352
Query: 355 QFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR P ANLQG+WN +P W+S H+NINL+MNYW + NL E
Sbjct: 353 QYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNINLQMNYWPAYVTNLLEAV 412
Query: 413 EPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWAL 462
P+ +++ L + G + A Y +GW++H + W D W
Sbjct: 413 FPVINYIDDLRVYG-RLAAARYAGIVSQEGEENGWLVHTQATPFGWTAPGWD---YYWGW 468
Query: 463 WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTS 521
P AW+ ++E Y++ D+D+L ++ YP+L F +L E ++PS S
Sbjct: 469 SPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHEDRQAQRWVSSPSYS 528
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
PEH +S +T D ++I ++F I AA+ L +E L E V + L P
Sbjct: 529 PEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDESLLTE-VKEKFDLLNP 578
Query: 582 TKIAEDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTL 635
+I + G I EW ++ F++ +V HRH SHL GL+PG+ + K D +AA +L
Sbjct: 579 LQITQSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQDYLEAARASL 637
Query: 636 QKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHP 695
RG+ G GWS K LWARL D AY+++ + + NL+ +HP
Sbjct: 638 NDRGDGGTGWSKANKINLWARLGDGNRAYKLLA-----------EQLKSSTLPNLWCSHP 686
Query: 696 PFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
PFQID NFG T+ +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D
Sbjct: 687 PFQIDGNFGATSGMAEMLLQSHTAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMRWEDK 745
Query: 756 DLHEVGIYS 764
L ++ I S
Sbjct: 746 KLLQMTILS 754
>gi|418976823|ref|ZP_13524668.1| hypothetical protein HMPREF1048_1234 [Streptococcus mitis SK575]
gi|383350822|gb|EID28673.1| hypothetical protein HMPREF1048_1234 [Streptococcus mitis SK575]
Length = 803
Score = 345 bits (884), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 254/817 (31%), Positives = 393/817 (48%), Gaps = 111/817 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LG ++G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGVKIFGLIGAERIQFNEKSLWSGGPQPDSSDYQGGNLQDQYSFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF + ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNTAKELAEQHLVGPKTSQYGTYLSFGDIHIEFSNQGKTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y +F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 147 LVTTSYVYKGTKFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLSRDLSSDGKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND +QF++ L + G I
Sbjct: 207 SDYKECQLDISDSYILMKGRV---------KDND----LQFASCLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKK---DPTSESMSALQSIRNLSYSDLYT 295
DK +++ G+ +A L L A + F NP+ + + D + +++ + Y L +
Sbjct: 251 DK-VQISGASYANLFLAAKTDFAQ---NPASNYRKELDLERQVKDLVETAKEKGYDQLKS 306
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
RH+ DYQ LF RV + L +D + + +K+++ E +L EL FQ
Sbjct: 307 RHIQDYQALFQRVQLDLG-------------AEVDASNTDDLLKNYKPQEGQALEELFFQ 353
Query: 356 FGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
+GRYLLISSSR P ANLQG+WN +P W+S H+NINL+MNYW + NL E
Sbjct: 354 YGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNINLQMNYWPAYVTNLLETAF 413
Query: 414 PLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALW 463
P+ +++ L + G + A Y +GW++H + W D W
Sbjct: 414 PVINYIDDLRVYG-RLAAARYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWS 469
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYL-ETNPSTSP 522
P AW+ ++E Y + D+D+L ++ YP+L F D+L E ++PS SP
Sbjct: 470 PAANAWMMQTVYEGYTFYRDKDYLREKIYPMLRETVRFWNDFLHEDRQAQRWVSSPSYSP 529
Query: 523 EHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT 582
EH +S +T D ++I ++F I AA+ L +E L E V + L P
Sbjct: 530 EH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDESLLTE-VKEKFDLLNPL 579
Query: 583 KIAEDGSIMEW----AQDFKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQ 636
+I + G I EW Q F++ +V HRH SHL GL+PG T+ K + +AA +L
Sbjct: 580 QITQSGRIREWYEEEEQHFQNEKVEAQHRHASHLVGLYPG-TLFSYKGKEYLEAARASLN 638
Query: 637 KRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP 696
RG+ G GWS K LWARL D A++++ + + NL+ +HPP
Sbjct: 639 DRGDGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKSSTLPNLWCSHPP 687
Query: 697 FQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGD 756
FQID NFG T+ +AEML+QS L L ALP D WS G V GL ARG VS+ W+D
Sbjct: 688 FQIDGNFGATSGMAEMLLQSHTAYLVPLAALP-DAWSRGSVSGLIARGHFEVSMRWEDKK 746
Query: 757 LHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKI 793
L ++ I S + S+ + + V+VN K+
Sbjct: 747 LLQLTILSRSGGDLRVSYPGIE--NSVVEVNQEKAKV 781
>gi|417696816|ref|ZP_12345994.1| hypothetical protein SPAR93_1691 [Streptococcus pneumoniae GA47368]
gi|418176443|ref|ZP_12813034.1| hypothetical protein SPAR71_1678 [Streptococcus pneumoniae GA41437]
gi|421232359|ref|ZP_15689000.1| alpha-fucosidase domain protein [Streptococcus pneumoniae 2080076]
gi|332200214|gb|EGJ14287.1| hypothetical protein SPAR93_1691 [Streptococcus pneumoniae GA47368]
gi|353840514|gb|EHE20578.1| hypothetical protein SPAR71_1678 [Streptococcus pneumoniae GA41437]
gi|395594862|gb|EJG55097.1| alpha-fucosidase domain protein [Streptococcus pneumoniae 2080076]
Length = 778
Score = 345 bits (884), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 246/774 (31%), Positives = 380/774 (49%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 593
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA +L RG+ G GWS K
Sbjct: 594 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSKANK 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 653 INLWARLGDGNRAHKLLA-----------EQLKTSTLQNLWCSHPPFQIDGNFGATSGMA 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 702 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 754
>gi|291557898|emb|CBL35015.1| hypothetical protein ES1_21610 [Eubacterium siraeum V10Sc8a]
Length = 796
Score = 344 bits (883), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 243/787 (30%), Positives = 382/787 (48%), Gaps = 95/787 (12%)
Query: 14 KITFNGPA---KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD--- 67
KI F P K PIGNG +GA +GG+ E + LNE TLW G P + + PD
Sbjct: 24 KILFTAPGEFDKWEQQCQPIGNGYMGASFFGGISKEKIVLNEKTLWAGGPSE-SRPDYNS 82
Query: 68 -----APKALSDVRSLVDSGQYAEATAASVKLFGHP--ADVYQLLGDIELEFDDSHLKYA 120
+ + + V+ L+ G+Y EA A L G YQLL D+ L F + A
Sbjct: 83 GIIEGSYEYVKQVQQLLYDGKYDEAVALLPHLTGATDGFGAYQLLCDMMLTFSNIDETQA 142
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
+ Y R LDL+ + +++ RE F++ P VI K+S + + +SLD+L
Sbjct: 143 TD-YTRTLDLDNSIFTTQFTYQGAVHKREAFANYPSNVICLKLSSDKPRRICVKLSLDNL 201
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
NG+ + EG G+++ I K+ + G + +D
Sbjct: 202 QCGSVTANGDT-LTYEGALW-------------DNGLRYCTIF--KVVNKGGELIDAKDS 245
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPS-DSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+ VE +D + L AS+ + + P+ + +P++ +++ + + LY HL
Sbjct: 246 -IMVEHADEVYIYLTASTDYSNKY--PTFRTGVNPSAAVNQRIENAVSKGFDALYEEHLA 302
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVE----LLFQ 355
DY+ LF RV+++++ DI+ P + + ++ + S+ L FQ
Sbjct: 303 DYKALFDRVTLKINEDTDDII------------PCDKLISEYKENGSRSIANRLETLYFQ 350
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
FGRY+LISSSR G+ ANLQG+WNE P W H+N+NL+MNYW + NLSE PL
Sbjct: 351 FGRYMLISSSRAGSLPANLQGVWNESNCPPWCCDYHINVNLQMNYWGAYNTNLSETVPPL 410
Query: 416 FDFLTYLSINGSKTAQVNY--------LASGWVIHHKTDI--WAKSSADRGKVVWALWPM 465
DFL + +G K+A+ Y +GW H ++ W D W
Sbjct: 411 VDFLDSMRPSGRKSAEAYYGIKSDEEHPENGWCAHTQSTPFGWTAPGWD---FYWGWSTA 467
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEH 524
AWL +++E++ +T D+++ + YP++ F WLI + L ++P+ SPEH
Sbjct: 468 AVAWLMQNIYEYFEFTGDKEYFAEHIYPIMRESVRFYTQWLIYDDKQKRLVSSPTYSPEH 527
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
V+ +T + ++I ++++ I+A+E L +E+ L V + +L+P +
Sbjct: 528 ---------GPVTIGNTYEQSLIEQLYNDFITASEALGTDEE-LRNIVKNQVVQLKPYSV 577
Query: 585 AED-GSIMEWAQ------DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQK 637
++ G + EW + D + +HRH+SHL GL+PG I P+L AA TL
Sbjct: 578 SKKTGLLKEWFEEDDDNFDHSKTQKNHRHISHLLGLYPGKAIN-SNTPELMTAAINTLND 636
Query: 638 RGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
RG+E GW+ +K LWAR+ D AY +++ L G + NLF HPPF
Sbjct: 637 RGDESTGWARAYKLNLWARVKDGNRAYSILQGL-----------LRGCTFDNLFDFHPPF 685
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
Q+D NFG +A +AEML+QS + LLPA P D W +G GL AR G + W++ +
Sbjct: 686 QLDGNFGGSAGIAEMLIQSHEGYIELLPAAP-DAWRNGAFTGLCARHGFVIDAKWENFNP 744
Query: 758 HEVGIYS 764
V I S
Sbjct: 745 TAVTIKS 751
>gi|168483476|ref|ZP_02708428.1| alpha-fucosidase [Streptococcus pneumoniae CDC1873-00]
gi|418169754|ref|ZP_12806395.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA19077]
gi|418219383|ref|ZP_12846048.1| fibronectin type III domain protein [Streptococcus pneumoniae
NP127]
gi|418221685|ref|ZP_12848338.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47751]
gi|418239181|ref|ZP_12865732.1| fibronectin type III domain protein [Streptococcus pneumoniae
NorthCarolina6A-23]
gi|419460465|ref|ZP_14000393.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA02270]
gi|419462818|ref|ZP_14002721.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA02714]
gi|419489320|ref|ZP_14029069.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44386]
gi|419526372|ref|ZP_14065930.1| hypothetical protein SPAR35_1634 [Streptococcus pneumoniae GA14373]
gi|421273311|ref|ZP_15724151.1| fibronectin type III domain protein [Streptococcus pneumoniae
SPAR55]
gi|172043064|gb|EDT51110.1| alpha-fucosidase [Streptococcus pneumoniae CDC1873-00]
gi|353833733|gb|EHE13841.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA19077]
gi|353873743|gb|EHE53602.1| fibronectin type III domain protein [Streptococcus pneumoniae
NP127]
gi|353874995|gb|EHE54849.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47751]
gi|353892172|gb|EHE71921.1| fibronectin type III domain protein [Streptococcus pneumoniae
NorthCarolina6A-23]
gi|379530250|gb|EHY95490.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA02714]
gi|379530601|gb|EHY95840.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA02270]
gi|379557012|gb|EHZ22059.1| hypothetical protein SPAR35_1634 [Streptococcus pneumoniae GA14373]
gi|379586862|gb|EHZ51712.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44386]
gi|395873742|gb|EJG84832.1| fibronectin type III domain protein [Streptococcus pneumoniae
SPAR55]
Length = 803
Score = 344 bits (883), Expect = 9e-92, Method: Compositional matrix adjust.
Identities = 246/774 (31%), Positives = 380/774 (49%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 593
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA +L RG+ G GWS K
Sbjct: 594 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSKANK 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 653 INLWARLGDGNRAHKLLA-----------EQLKTSTLQNLWCSHPPFQIDGNFGATSGMA 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 702 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 754
>gi|291530512|emb|CBK96097.1| hypothetical protein EUS_08620 [Eubacterium siraeum 70/3]
Length = 776
Score = 344 bits (883), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 242/785 (30%), Positives = 382/785 (48%), Gaps = 91/785 (11%)
Query: 14 KITFNGPA---KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPK 70
KI F P K PIGNG +GA +GG+ E + LNE TLW G P + + PD
Sbjct: 4 KILFTAPGEFDKWEQQCQPIGNGYMGASFFGGISKEKIVLNEKTLWAGGPSE-SRPDYNG 62
Query: 71 ALSD--------VRSLVDSGQYAEATAASVKLFGHP--ADVYQLLGDIELEFDDSHLKYA 120
+ D V+ L+ G+Y EA A L G YQLL D+ L F + A
Sbjct: 63 GIIDGSYEYVKQVQQLLYDGKYDEAVALLPHLTGATDGYGAYQLLCDMMLTFSNIDETQA 122
Query: 121 EETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSL 180
+ Y R LDL+ + +++ RE F++ P VI K+S + + +SLD+L
Sbjct: 123 TD-YTRTLDLDNSIFTTQFTYQGAVHKREAFANYPSNVICIKLSSDKPRRICVKLSLDNL 181
Query: 181 LDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
NG+ + EG G+++ + K+ + G + +D
Sbjct: 182 QCGSVTANGDT-LTYEGALW-------------DNGLRYCTVF--KVVNKGGELIDAKDS 225
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPS-DSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+ VE +D + L AS+ + + P+ + +P++ +++ + ++ LY HL
Sbjct: 226 -IMVEHADEVYIYLTASTDYSNKY--PTFRTGVNPSAAVNQRIENAVSKGFNALYEEHLA 282
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVE----LLFQ 355
DY+ LF V+++++ DI+ P + ++ ++ + S+ L FQ
Sbjct: 283 DYKALFDSVTLKINEDTDDII------------PCDKLIREYKENGSRSIANRLETLYFQ 330
Query: 356 FGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPL 415
FGRY+LISSSR G+ ANLQG+WNE P W H+N+NL+MNYW + NLSE PL
Sbjct: 331 FGRYMLISSSRAGSLPANLQGVWNESNCPPWCCDYHINVNLQMNYWGAYNTNLSETVPPL 390
Query: 416 FDFLTYLSINGSKTAQVNY--------LASGWVIHHKTDIWAKSSADRGKVVWALWPMGG 467
DFL + +G K+A+ Y +GW H ++ + +A W
Sbjct: 391 VDFLDSMRPSGRKSAEAYYGIKSDEEHPENGWCAHTQSTPFG-WTAPGWNFYWGWSTAAV 449
Query: 468 AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEF 526
AWL +++E++ +T D+ + + YP++ F WLI + L ++P+ SPEH
Sbjct: 450 AWLMQNIYEYFEFTGDKKYFAEHIYPIMRESVRFYTQWLIYDDKQKRLVSSPTYSPEH-- 507
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE 586
V+ +T + ++I ++++ I+A+E L +E+ L V + +L+P +++
Sbjct: 508 -------GPVTIGNTYEQSLIEQLYNDFITASEALGTDEE-LRNIVKNQVVQLKPFSVSK 559
Query: 587 D-GSIMEWAQ------DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
G + EW + D + +HRH+SHL GL+PG I P+L AA TL RG
Sbjct: 560 KTGLLKEWFEEDDDNFDHSKTQKNHRHISHLLGLYPGKAIN-SHTPELMTAAINTLNDRG 618
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
+E GWS +K LWAR+ D AY +++ L G + NLF HPPFQ+
Sbjct: 619 DESTGWSRAYKLNLWARVKDGNRAYSILQGL-----------LRGCTFDNLFDFHPPFQL 667
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D NFG +A +AEML+QS + LLPA P D W +G GL AR G + W++ +
Sbjct: 668 DGNFGGSAGIAEMLIQSHEGYIELLPAAP-DAWRNGAFTGLCARHGFVIDAKWENFNPTA 726
Query: 760 VGIYS 764
V I S
Sbjct: 727 VTIKS 731
>gi|418162701|ref|ZP_12799382.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA17328]
gi|353826763|gb|EHE06920.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA17328]
Length = 782
Score = 344 bits (882), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 247/774 (31%), Positives = 379/774 (48%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 6 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 66 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 126 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 185
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 186 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 239
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 240 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 299
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 300 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 346
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 347 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 405
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 406 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 462
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 463 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 513
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW--- 593
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW
Sbjct: 514 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 572
Query: 594 -AQDFKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
Q F++ +V HRH SHL GL+PG+ + K + +AA +L RG+ G GWS K
Sbjct: 573 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSKANK 631
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 632 INLWARLGDGNRAHKLLA-----------EQLKTSTLQNLWCSHPPFQIDGNFGATSGMA 680
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 681 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 733
>gi|419491555|ref|ZP_14031293.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47179]
gi|379592917|gb|EHZ57732.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47179]
Length = 803
Score = 344 bits (882), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 246/774 (31%), Positives = 379/774 (48%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 593
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA +L RG G GWS K
Sbjct: 594 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGNGGTGWSKANK 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D AY+++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 653 INLWARLGDGNRAYKLLA-----------EQLKTSTLPNLWCSHPPFQIDGNFGATSGMA 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 702 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 754
>gi|149011485|ref|ZP_01832732.1| hypothetical protein CGSSp19BS75_08742 [Streptococcus pneumoniae
SP19-BS75]
gi|147764475|gb|EDK71406.1| hypothetical protein CGSSp19BS75_08742 [Streptococcus pneumoniae
SP19-BS75]
Length = 803
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 246/785 (31%), Positives = 385/785 (49%), Gaps = 103/785 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
+ G I EW ++ F++ +V HRH SHL GL+ G+ + K + +AA +L RG
Sbjct: 583 QSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYSGNLFSY-KGQEYIEAARASLNDRG 641
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
+ G GWS K LWARL D A++++ + + NL+ +HPPFQI
Sbjct: 642 DGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQI 690
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D NFG T+ +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D L +
Sbjct: 691 DGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQ 749
Query: 760 VGIYS 764
+ I S
Sbjct: 750 LTILS 754
>gi|289167478|ref|YP_003445747.1| hypothetical protein smi_0630 [Streptococcus mitis B6]
gi|288907045|emb|CBJ21879.1| conserved hypothetical protein [Streptococcus mitis B6]
Length = 803
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 255/827 (30%), Positives = 395/827 (47%), Gaps = 104/827 (12%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
P T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPASTTYKGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEF-DDSHLK 118
D L+D+R ++ Y + + P Y GDI +EF +
Sbjct: 72 NLQDQHNFLTDIRQALEKRDYNRTKELAEQHLVGPKTSQYGTYLSFGDIHIEFSNQGKTL 131
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y Y+R+L+++ A A Y +F RE F+S PD ++V + + +L F + L
Sbjct: 132 YQVTDYQRQLNISKALATASYVYKGTKFERETFASFPDDLLVQRYTKEGLETLDFTIELS 191
Query: 179 ---SLLDNHSYVN------------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL 223
L + Y ++ I+M+GR ND +QF++ L
Sbjct: 192 LTHDLASDGKYEQEKSDYKECQLDISDSYILMKGRV---------KDND----LQFTSCL 238
Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ 283
+ D S K+++ G+ +A L L A + F + K D + ++
Sbjct: 239 AWETDGDIRVWS----NKVQISGASYANLFLAAKTDFAQNPASNYRKKIDLEKQVKDLVE 294
Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
+ Y+ L +RH+ DYQ LF RV + L ++DT + + +K+++
Sbjct: 295 IAKEKGYAQLKSRHIQDYQALFQRVQLDLG-------------ADVDTSTTDDLLKNYKP 341
Query: 344 DEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
E L EL FQ+GRYLLISSSR P ANLQGIWN +P W+S H+NINL+MNYW
Sbjct: 342 QEGQVLEELFFQYGRYLLISSSRDCPDALPANLQGIWNAVDNPPWNSDYHLNINLQMNYW 401
Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDIWAKSSA 453
+ NL E P+ +++ L + G + A Y +GW++H + + +A
Sbjct: 402 PAYVTNLLETAFPVINYIDDLRVYG-RLAAARYAGIVSQEGEENGWLVHTQATPFG-WTA 459
Query: 454 DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY 513
W P AWL ++E Y++ D+D+L ++ YP+L F D+L E
Sbjct: 460 PGWNYYWGWSPAANAWLMQTVYEAYSFYSDQDYLREKIYPMLRETVYFWNDFLHEDRQAQ 519
Query: 514 L-ETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV 572
++PS SPEH +S +T D ++I ++F I AA+ L + D L E V
Sbjct: 520 RWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDGDLLTE-V 569
Query: 573 LKSLPRLRPTKIAEDGSIMEW----AQDFKDPEV--HHRHLSHLFGLFPGHTITIEKNPD 626
+ L P ++ + G I EW Q F++ +V HRH SHL GL+PG+ + K +
Sbjct: 570 KEKFDLLNPLQLTQSGRIREWYEEEEQHFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQE 628
Query: 627 LCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGL 686
+AA +L RG+ G GWS K LWARL D AY+++ + +
Sbjct: 629 YLEAARASLNDRGDGGTGWSKANKINLWARLGDGNRAYKLLA-----------EQLKTST 677
Query: 687 YSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGE 746
NL+ +HPPFQID NFG ++ +AEML+QS L L ALP D S+G V GL ARG
Sbjct: 678 LPNLWCSHPPFQIDGNFGASSGMAEMLLQSHTAYLVPLAALP-DACSTGSVSGLMARGHF 736
Query: 747 TVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKI 793
+S+ W+D L ++ I S + S+ + + ++VN K+
Sbjct: 737 ELSMRWEDEKLLQLTILSRSGGDLRISYPGIE--KSVIEVNQEKAKV 781
>gi|331082986|ref|ZP_08332105.1| hypothetical protein HMPREF0992_01029 [Lachnospiraceae bacterium
6_1_63FAA]
gi|330399723|gb|EGG79384.1| hypothetical protein HMPREF0992_01029 [Lachnospiraceae bacterium
6_1_63FAA]
Length = 1760
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 241/768 (31%), Positives = 376/768 (48%), Gaps = 74/768 (9%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS- 77
+PIGN +GA V+G + E L N+ TLW G P G+ D + +SDV
Sbjct: 75 LPIGNSFMGANVYGEIGQERLTFNQKTLWNGGPSENRPDYDGGNKETADNGQKMSDVYKE 134
Query: 78 ---LVDSGQYAEATAASVKLFG--HPADVYQLLGDIELEFDDSHLKYAE-ETYRRELDLN 131
L G A+A + KL G + YQ GDI ++F LK + E Y R+L+L
Sbjct: 135 IIELYKEGNDAQANELAKKLTGEVNGYGAYQSWGDIYVDFG---LKEEQAENYVRDLNLE 191
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
A A V + + + RE+F S PD V+ K + S L F++S +DN V
Sbjct: 192 NAVASVDFDYQDTKMHREYFISYPDNVLAMKFTAEGSEKLDFDISFP--IDNAEGVADKK 249
Query: 192 -QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWA 250
+E I D+ Q ++K+ + G + + KL V G+ A
Sbjct: 250 LGKSVETTVEDDTITVSGEMQDN----QLQLNGKLKVETEGGKVQEKDGDKLHVSGASEA 305
Query: 251 VLLLVASSSF--DGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
V+ + A + + P ++ ++ + A+ Y + H+ DY ++F RV
Sbjct: 306 VVYVSADTDYLNKYPDYRTGETAQELDASVERAVDKASKKGYEKVKKEHIKDYSEIFSRV 365
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
+ L ++ D TD + + + ++ E+ +L +LFQ+GRYL I+SSR G
Sbjct: 366 QLDLGQNVPDKTTDIL----LKDYNAGKNTEA----ENRALEVILFQYGRYLTIASSRAG 417
Query: 369 TQVANLQGIWNEDLSP----TWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
+NLQG+W + W S H+N+NL+MNYW + N++EC PL D++ L
Sbjct: 418 DLPSNLQGVWQNRVGDHNRIPWASDYHMNVNLQMNYWPTYSTNMAECATPLIDYINSLVE 477
Query: 425 NGSKTAQVNY-LASGWVIHHKTDI---WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
G TA+ + + +G H + W D W P W+ + WE+Y Y
Sbjct: 478 PGKVTAKTYFGVENGGFTAHTQNTPFGWTCPGWD---FSWGWSPAALPWILQNCWEYYEY 534
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKLACVSYS 539
T D ++E+ YP+L+ A LIE G L + P+ SPEH V+
Sbjct: 535 TGDVKYMEEHIYPMLKEAALLYDQILIEDEKTGRLVSAPAYSPEH---------GPVTAG 585
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF-- 597
+T + ++I +++ +AAE+L K+E+ E + +L+P +I E G I EW +
Sbjct: 586 NTYEQSLIWQLYEDAATAAEILSKDEEKAKEWRQRQ-QKLKPIEIGESGQIKEWYTETTL 644
Query: 598 -KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
E HRH+SHL GLFPG I+++ N + AA +L++RGE+ GW + + WAR
Sbjct: 645 GSMGEKGHRHMSHLLGLFPGDLISVD-NAEYMDAAIVSLKERGEKSTGWGMGQRINAWAR 703
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
D A+++++ LF H+ G+Y NL+ H PFQID NFG T+ V+EML+QS
Sbjct: 704 TGDGNQAHKLIQNLF------HD-----GIYPNLWDTHTPFQIDGNFGMTSGVSEMLMQS 752
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
+ + +LP+LP D W++G VKGL ARG VS+ W D +L E + S
Sbjct: 753 NMGYINMLPSLP-DVWANGSVKGLVARGNFEVSMKWADKNLTEATLLS 799
>gi|421211527|ref|ZP_15668509.1| fibronectin type III domain protein [Streptococcus pneumoniae
2070035]
gi|395572635|gb|EJG33230.1| fibronectin type III domain protein [Streptococcus pneumoniae
2070035]
Length = 803
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 245/774 (31%), Positives = 380/774 (49%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDAFTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 593
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA +L RG+ G GWS K
Sbjct: 594 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSKANK 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 653 INLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQIDGNFGATSGMA 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 702 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 754
>gi|419495837|ref|ZP_14035554.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47461]
gi|421302877|ref|ZP_15753541.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA17484]
gi|379593923|gb|EHZ58734.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47461]
gi|395901499|gb|EJH12435.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA17484]
Length = 803
Score = 343 bits (879), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 248/785 (31%), Positives = 385/785 (49%), Gaps = 103/785 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
A Y F RE F+S PD ++V + +L F + L L N Y
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASNGKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
+ G I EW ++ F++ +V HRH SHL GL+PG+ + + + +AA +L R
Sbjct: 583 QSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-RGQEYIEAARASLNDRE 641
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
+ G GWS K LWARL D A++++ + + NL+ +HPPFQI
Sbjct: 642 DGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQI 690
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D NFG T+A+AEML+QS L L ALP D WS+G V GL ARG VS+ W+D L +
Sbjct: 691 DGNFGATSAMAEMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQ 749
Query: 760 VGIYS 764
+ I S
Sbjct: 750 LTILS 754
>gi|418167255|ref|ZP_12803910.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA17971]
gi|353829247|gb|EHE09381.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA17971]
Length = 803
Score = 343 bits (879), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 246/785 (31%), Positives = 385/785 (49%), Gaps = 103/785 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 147 LVTTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
+ G I EW ++ F++ +V HRH SHL GL+ G+ + K + +AA +L RG
Sbjct: 583 QSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYSGNLFSY-KGQEYIEAARASLNDRG 641
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
+ G GWS K LWARL D A++++ + + NL+ +HPPFQI
Sbjct: 642 DGGTGWSKDNKINLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQI 690
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D NFG T+ +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D L +
Sbjct: 691 DGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQ 749
Query: 760 VGIYS 764
+ I S
Sbjct: 750 LTILS 754
>gi|418076872|ref|ZP_12714105.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47502]
gi|353747012|gb|EHD27670.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47502]
Length = 803
Score = 343 bits (879), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 245/774 (31%), Positives = 380/774 (49%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y +F RE F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTQFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 593
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +A +L RG+ G GWS K
Sbjct: 594 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAVRASLNDRGDGGTGWSKANK 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 653 INLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQIDGNFGATSGMA 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 702 EMLLQSHAAYLVPLAALP-DAWSTGFVSGLMARGHFEVSMSWEDKKLLQLTILS 754
>gi|194398489|ref|YP_002038269.1| hypothetical protein SPG_1564 [Streptococcus pneumoniae G54]
gi|418121711|ref|ZP_12758654.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44194]
gi|419532855|ref|ZP_14072370.1| hypothetical protein SPAR107_1539 [Streptococcus pneumoniae
GA47794]
gi|421275369|ref|ZP_15726198.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA52612]
gi|194358156|gb|ACF56604.1| conserved hypothetical protein [Streptococcus pneumoniae G54]
gi|353792547|gb|EHD72919.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44194]
gi|379605375|gb|EHZ70126.1| hypothetical protein SPAR107_1539 [Streptococcus pneumoniae
GA47794]
gi|395873333|gb|EJG84425.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA52612]
Length = 803
Score = 343 bits (879), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 245/774 (31%), Positives = 380/774 (49%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 593
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA +L RG+ G GWS K
Sbjct: 594 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSEANK 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 653 INLWARLGDGNRAHKLLA-----------EQLKTSTLQNLWCSHPPFQIDGNFGATSGMA 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 702 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 754
>gi|419480494|ref|ZP_14020298.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA19101]
gi|419500201|ref|ZP_14039895.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47597]
gi|379569663|gb|EHZ34630.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA19101]
gi|379599509|gb|EHZ64292.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47597]
gi|429316503|emb|CCP36209.1| conserved hypothetical protein [Streptococcus pneumoniae SPN034156]
Length = 803
Score = 343 bits (879), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 246/774 (31%), Positives = 379/774 (48%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + SE ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGSERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +E+ L E KS L P +I + G I EW ++
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDENLLTEVKEKS-DLLNPLQITQSGRIREWYEE 593
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA +L RG+ G GWS K
Sbjct: 594 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSKANK 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + NL+ +HPPFQID NFG T+ +A
Sbjct: 653 INLWARLGDGNRAHKLLAEQLKI-----------STLPNLWCSHPPFQIDGNFGATSGMA 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 702 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 754
>gi|419475985|ref|ZP_14015821.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA14688]
gi|379558767|gb|EHZ23799.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA14688]
Length = 778
Score = 342 bits (878), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 244/774 (31%), Positives = 380/774 (49%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD +++ + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLIQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 593
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA +L RG+ G GWS K
Sbjct: 594 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSEANK 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 653 INLWARLGDGNRAHKLLA-----------EQLKTSTLQNLWCSHPPFQIDGNFGATSGMA 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 702 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 754
>gi|418092259|ref|ZP_12729399.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44452]
gi|353762959|gb|EHD43516.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44452]
Length = 803
Score = 342 bits (878), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 244/774 (31%), Positives = 380/774 (49%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD +++ + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLIQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 593
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA +L RG+ G GWS K
Sbjct: 594 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSKANK 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 653 INLWARLGDGNRAHKLLA-----------EQLKTSTLQNLWCSHPPFQIDGNFGATSGMA 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 702 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 754
>gi|418130799|ref|ZP_12767682.1| hypothetical protein SPAR14_1598 [Streptococcus pneumoniae GA07643]
gi|418187633|ref|ZP_12824156.1| hypothetical protein SPAR92_1607 [Streptococcus pneumoniae GA47360]
gi|353802123|gb|EHD82423.1| hypothetical protein SPAR14_1598 [Streptococcus pneumoniae GA07643]
gi|353849618|gb|EHE29623.1| hypothetical protein SPAR92_1607 [Streptococcus pneumoniae GA47360]
Length = 778
Score = 342 bits (878), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 245/774 (31%), Positives = 379/774 (48%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF+ ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFNQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDYP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL++NYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQLNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 593
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA L RG+ G GWS K
Sbjct: 594 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARAGLNDRGDGGTGWSKANK 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + NL+ +HPPFQID NFG T+ +A
Sbjct: 653 INLWARLGDGNRAHKLLAEQLKI-----------STLPNLWCSHPPFQIDGNFGATSGMA 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 702 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 754
>gi|149021254|ref|ZP_01835500.1| hypothetical protein CGSSp23BS72_00470 [Streptococcus pneumoniae
SP23-BS72]
gi|147930355|gb|EDK81339.1| hypothetical protein CGSSp23BS72_00470 [Streptococcus pneumoniae
SP23-BS72]
Length = 803
Score = 342 bits (878), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 244/774 (31%), Positives = 381/774 (49%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD +++ + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLIQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A ++F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTNFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 593
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA +L RG+ G GWS K
Sbjct: 594 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSEANK 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 653 INLWARLGDGNRAHKLLA-----------EQLKTSTLQNLWCSHPPFQIDGNFGATSGMA 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 702 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 754
>gi|225855085|ref|YP_002736597.1| alpha-fucosidase [Streptococcus pneumoniae JJA]
gi|225723201|gb|ACO19054.1| alpha-fucosidase [Streptococcus pneumoniae JJA]
Length = 803
Score = 342 bits (878), Expect = 4e-91, Method: Compositional matrix adjust.
Identities = 252/800 (31%), Positives = 384/800 (48%), Gaps = 106/800 (13%)
Query: 11 NPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN- 65
P T+ G + +A+PIGNG LGA V+G + +E ++ NE +LW+G P DY
Sbjct: 15 QPASTTYKGWEE---EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGG 71
Query: 66 --PDAPKALSDVRSLVDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKY 119
D L+++R ++ Y A + + P Y GDI +EF
Sbjct: 72 NLQDQYGFLAEIRQALEKRDYNTAKELAEEHLVGPKTSQYGTYLSFGDIFIEFSQQGTIL 131
Query: 120 AEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL- 177
++ T Y+R+L+++ A A Y+ F RE F+S PD ++V + + + +L F + L
Sbjct: 132 SQVTDYQRQLNISKALATTSYAYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIKLF 191
Query: 178 --DSLLDNHSYVN------------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL 223
L + Y ++ I+M GR ND ++F+ L
Sbjct: 192 LTRDLASDGKYDQEKSDYKECQLDITDSHILMNGRV---------KDND----LRFAGCL 238
Query: 224 EIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQ 283
+ G I DK +++ G+ +A L L A + F + K D + ++
Sbjct: 239 AWQTD---GDIRVWSDK-VQISGASYANLFLAAKTDFAQNPDSNYRKKIDLEKQVKDLVE 294
Query: 284 SIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT 343
+ Y+ L +RH+ DYQ LF RV + L E ++DT + + +K+++
Sbjct: 295 IAKEKGYAQLKSRHIQDYQALFQRVQLDL-------------EADVDTFTTDDLLKNYKP 341
Query: 344 DEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYW 401
+L EL FQ+GRYLLISSSR P ANLQG+WN +P W+S H+NINL+MNYW
Sbjct: 342 QAGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNINLQMNYW 401
Query: 402 QSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKS 451
+ NL E P+ +++ L + G + A Y +GW++H + W
Sbjct: 402 PAYVTNLLETAFPVINYIDDLRVYG-RLAAARYAGIVSREGEENGWLVHTQATPFGWTAP 460
Query: 452 SADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD 511
D W P AW+ ++E Y++ D+D+L ++ YP+L F +L E
Sbjct: 461 GWD---YYWGWSPATNAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWTGFLHEDQQ 517
Query: 512 GY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE 570
++PS SPEH +S +T D ++I ++F I A + L + D L E
Sbjct: 518 AQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFYDFIQATQELGLDGDLLTE 568
Query: 571 KVLKSLPRLRPTKIAEDGSIMEW----AQDFKDPEV--HHRHLSHLFGLFPGHTITIEKN 624
V + L P +I + G I EW Q F++ +V HRH+SHL GL+PG T+ K
Sbjct: 569 -VKEKFDLLNPLQITQSGRIREWYEEEEQHFQNEKVEAQHRHVSHLVGLYPG-TLFSYKG 626
Query: 625 PDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEG 684
+ AA +L RG+ G GWS K LWARL D A++++ L
Sbjct: 627 QEYLDAARASLNDRGDGGTGWSKANKINLWARLGDGNRAHKLLAEQLKL----------- 675
Query: 685 GLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARG 744
NL+ +HPPFQID NFG T+ +AEML+QS L L ALP D WS+G V GL ARG
Sbjct: 676 STLPNLWCSHPPFQIDGNFGATSGMAEMLLQSHTAYLVPLAALP-DAWSTGSVSGLMARG 734
Query: 745 GETVSICWKDGDLHEVGIYS 764
VS+ W++ L ++ I S
Sbjct: 735 HFEVSMRWEEKKLLQMTILS 754
>gi|419487130|ref|ZP_14026892.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44128]
gi|421209422|ref|ZP_15666435.1| fibronectin type III domain protein [Streptococcus pneumoniae
2070005]
gi|379585499|gb|EHZ50355.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44128]
gi|395573518|gb|EJG34108.1| fibronectin type III domain protein [Streptococcus pneumoniae
2070005]
Length = 803
Score = 342 bits (877), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 244/774 (31%), Positives = 380/774 (49%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD +++ + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLIQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 593
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA +L RG+ G GWS K
Sbjct: 594 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSEANK 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 653 INLWARLGDGNRAHKLLA-----------EQLKTSTLQNLWCSHPPFQIDGNFGATSGMA 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 702 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 754
>gi|418230426|ref|ZP_12857025.1| fibronectin type III domain protein [Streptococcus pneumoniae
EU-NP01]
gi|419478291|ref|ZP_14018115.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA18068]
gi|421271063|ref|ZP_15721917.1| fibronectin type III domain protein [Streptococcus pneumoniae
SPAR48]
gi|353885307|gb|EHE65096.1| fibronectin type III domain protein [Streptococcus pneumoniae
EU-NP01]
gi|379565727|gb|EHZ30719.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA18068]
gi|395867277|gb|EJG78401.1| fibronectin type III domain protein [Streptococcus pneumoniae
SPAR48]
Length = 803
Score = 342 bits (877), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 245/774 (31%), Positives = 379/774 (48%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF+ ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFNQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDYP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL++NYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQLNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 593
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA L RG+ G GWS K
Sbjct: 594 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARAGLNDRGDGGTGWSKANK 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + NL+ +HPPFQID NFG T+ +A
Sbjct: 653 INLWARLGDGNRAHKLLAEQLKI-----------STLPNLWCSHPPFQIDGNFGATSGMA 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 702 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 754
>gi|418146897|ref|ZP_12783675.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA13637]
gi|353812472|gb|EHD92707.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA13637]
Length = 782
Score = 342 bits (877), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 246/774 (31%), Positives = 378/774 (48%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 6 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 66 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 126 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 185
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 186 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 239
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 240 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 299
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 300 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDYP 346
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 347 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 405
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 406 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 462
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 463 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 513
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 514 SIGNTYDQSLIWQLFYDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 572
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA L RG+ G GWS K
Sbjct: 573 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARAGLNDRGDGGTGWSKANK 631
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + NL+ +HPPFQID NFG T+ +A
Sbjct: 632 INLWARLGDGNRAHKLLAEQLKI-----------STLPNLWCSHPPFQIDGNFGATSGMA 680
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 681 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 733
>gi|418103344|ref|ZP_12740416.1| hypothetical protein SPAR143_1623 [Streptococcus pneumoniae NP070]
gi|421225485|ref|ZP_15682223.1| alpha-fucosidase domain protein [Streptococcus pneumoniae 2070768]
gi|353774645|gb|EHD55132.1| hypothetical protein SPAR143_1623 [Streptococcus pneumoniae NP070]
gi|395588972|gb|EJG49294.1| alpha-fucosidase domain protein [Streptococcus pneumoniae 2070768]
Length = 757
Score = 342 bits (877), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 244/774 (31%), Positives = 380/774 (49%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 6 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 65
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 66 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD +++ + +L F + L D S +
Sbjct: 126 LATTSYVYKGTRFERKAFASFPDDLLIQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 185
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 186 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 239
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 240 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 299
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 300 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 346
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 347 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 405
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 406 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 462
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 463 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 513
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 514 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 572
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA +L RG+ G GWS K
Sbjct: 573 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSEANK 631
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 632 INLWARLGDGNRAHKLLA-----------EQLKTSTLQNLWCSHPPFQIDGNFGATSGMA 680
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 681 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 733
>gi|169624315|ref|XP_001805563.1| hypothetical protein SNOG_15415 [Phaeosphaeria nodorum SN15]
gi|160705148|gb|EAT77080.2| hypothetical protein SNOG_15415 [Phaeosphaeria nodorum SN15]
Length = 792
Score = 342 bits (877), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 252/820 (30%), Positives = 396/820 (48%), Gaps = 102/820 (12%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSD 74
+ FN P + ++PIGNGR+ A +G E + +NE+++W+G D N + ALS
Sbjct: 26 LYFNTPGSSLSSSLPIGNGRVAAAAYG-TTLERITINENSVWSGQWQDRGNSQSLNALSS 84
Query: 75 VRSLVDSGQYAEATAASV-KLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
+R + G + A ++ + G+P Q +++ D H +Y R LD
Sbjct: 85 IRQKLMDGDMSSAGQQTLDAMAGNPQSPKQYHPTVDMTIDFGH-SGTLGSYTRILDTRQG 143
Query: 134 TARVKYSVGNVEFT-----------REHFSSNPDQVIVTKISGSESGSLSFNVSL---DS 179
TA Y +G V +T RE+ +S P V+ ++ +++G L+ +++L +
Sbjct: 144 TAMTTYILGGVNYTLMGAAHLTSFRREYVASYPAGVLAFRMMANQAGKLNVDIALARSQN 203
Query: 180 LLDNHSYVNGN-NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
+ N + +GN N I ++G GI F+A E ++ D G+IS +
Sbjct: 204 VASNAASSSGNINSITLKGNG----------------GIPFTA--EARVVSDTGSIS-VN 244
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+K + V+G+ + A +S+ S E + L + Y+ + T +
Sbjct: 245 EKTMSVKGATIVDIFFDAETSYR------YGSASAWELELKNKLDNAVKAGYNAVKTAAV 298
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE--DPSLVELLFQF 356
D + + RV+I L S + T P R+ +++ + DP LV L F +
Sbjct: 299 KDAEGILSRVNINLG-----------SSGSAGTQPIPSRLSNYKKNAGADPELVTLYFNY 347
Query: 357 GRYLLISSSRPG---TQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
GR+LL++SSR + ANLQGIWN++ P W S VNIN EMNYW +L NL E +
Sbjct: 348 GRHLLLASSRDTGDRSLPANLQGIWNDNYDPPWQSKYTVNINTEMNYWHALTTNLDETHK 407
Query: 414 PLFDFLTYLSINGSKTAQVNYLAS-GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
PLFD + G A+ Y + G+V+HH TD+W ++ P+ T
Sbjct: 408 PLFDLVDMTRAQGRAMAKKMYGCNDGFVVHHNTDLWGDAA-----------PVDKGTPYT 456
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-- 530
HL EHY +T D++FL+ RA+P+L+ A+F +L ++G T PS SPE+ F+ P
Sbjct: 457 HLMEHYRFTQDKNFLQNRAWPVLKDAANFYYCYLFM-YNGSYVTGPSLSPENTFVVPSNM 515
Query: 531 ---GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAED 587
GK V + TMD ++ E+F+ +ISA + L D V K L +++ KI
Sbjct: 516 RTAGKTEGVDIAPTMDNELLWELFNNVISAGKALGIT-DITVSKAKDYLSKIKEPKIGSK 574
Query: 588 GSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPG 644
G ++EW ++K+ E HRH SHLFGLFPG +T + L +A++ L R G G
Sbjct: 575 GQLLEWRNEYKEGEPAHRHFSHLFGLFPGSQMTPLVSETLAQASKVALDNRMRAGSGSTG 634
Query: 645 WSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP--FQIDAN 702
WS W L+ARL D + + + NL+ + FQID N
Sbjct: 635 WSRVWAMNLYARLLDGANVWSNAVTFLQTYTLD-----------NLWNSGENRWFQIDGN 683
Query: 703 FGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
FGFT+A+AEML+QS + +++LPALP G VKGL ARG V I W G + + +
Sbjct: 684 FGFTSAIAEMLLQSH-SVVHILPALPKSAIPKGSVKGLVARGNFVVDIDWSGGSMTQATV 742
Query: 763 YSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLKC 802
+ + G + KV+ GK+YT + +C
Sbjct: 743 TARSGGEVALRVEN----GAAFKVD---GKVYTGTVEDEC 775
>gi|270292150|ref|ZP_06198365.1| fibronectin type III domain protein [Streptococcus sp. M143]
gi|270279678|gb|EFA25520.1| fibronectin type III domain protein [Streptococcus sp. M143]
Length = 1747
Score = 342 bits (877), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 256/792 (32%), Positives = 395/792 (49%), Gaps = 117/792 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y + K L+++R
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYQ--ERYKVLAEIRK 199
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 200 ALEVGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRGLDITE 259
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G++F++ L IK +D + T+
Sbjct: 320 YSNYKNGHVTTDENGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 362
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP ++ +KD E +++ + Y L
Sbjct: 363 QDETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKAKDYETLK 419
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L + D T E ++ + D+ L EL F
Sbjct: 420 KAHIKDYQSLFNRVKLNLGGNKTDQTT-------------KEALQGYNPDKGQKLEELFF 466
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 527 KPMINYIDDMRYYGRVAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDQASDRWV-SSPS 640
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L ++D LV +V +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKAKFDKL 690
Query: 580 RPTKIAEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
+P I ++G I EW ++ F + E HHRH+SHL GLFPG T+ + + +AA
Sbjct: 691 KPLHINKEGRIKEWYEEDSPQFTNEGIENHHRHVSHLVGLFPG-TLFSKDQAEYLEAARA 749
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAA 693
TL RG+ G GWS K LWARL D A+R++ E NL+
Sbjct: 750 TLNHRGDGGTGWSKANKINLWARLLDGNRAHRLLAEQLKYSTLE-----------NLWDT 798
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWK 753
H PFQID NFG T+ +AEML+QS + LPALP D W G V GL ARG VS+ WK
Sbjct: 799 HAPFQIDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLVARGNFEVSMKWK 857
Query: 754 DGDLHEVGIYSN 765
D +L + SN
Sbjct: 858 DKNLQSLSFLSN 869
>gi|421241117|ref|ZP_15697662.1| fibronectin type III domain protein [Streptococcus pneumoniae
2080913]
gi|395607495|gb|EJG67592.1| fibronectin type III domain protein [Streptococcus pneumoniae
2080913]
Length = 782
Score = 342 bits (877), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 245/774 (31%), Positives = 379/774 (48%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 6 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 65
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 66 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD +++ + +L F + L D S +
Sbjct: 126 LATTSYVYKGTRFERKAFASFPDDLLIQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 185
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 186 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 239
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 240 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 299
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 300 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 346
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 347 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 405
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 406 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 462
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 463 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 513
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW--- 593
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW
Sbjct: 514 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 572
Query: 594 -AQDFKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
Q F++ +V HRH SHL GL+PG+ + K + +AA +L RG+ G GWS K
Sbjct: 573 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSEANK 631
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 632 INLWARLGDGNRAHKLLA-----------EQLKTSTLQNLWCSHPPFQIDGNFGATSGMA 680
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 681 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 733
>gi|419778183|ref|ZP_14304079.1| gram positive anchor [Streptococcus oralis SK10]
gi|383187500|gb|EIC79950.1| gram positive anchor [Streptococcus oralis SK10]
Length = 1707
Score = 342 bits (876), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 256/792 (32%), Positives = 398/792 (50%), Gaps = 117/792 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
+++G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 200 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRNLDITE 259
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G+QF++ L IK +D + T+
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLQFASYLGIK-TDGKVTV--- 362
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP ++ +KD E +++ ++ Y L
Sbjct: 363 QDETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKSKDYETLK 419
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L + T + E ++ + ++ L EL F
Sbjct: 420 KAHIKDYQSLFNRVKLNLGGT-------------KTTQTTKEALQGYNPEKGQKLEELFF 466
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDQASDRWV-SSPS 640
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L+ ++D LV +V +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKL 690
Query: 580 RPTKIAEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
+P I ++G I EW ++ F + E HHRH+SHL GLFPG T+ + + +AA
Sbjct: 691 KPLHINKEGRIKEWYEEDSPQFTNEGIENHHRHVSHLVGLFPG-TLFSKDQAEYLEAARA 749
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAA 693
TL RG+ G GWS K LWARL D A+R++ E NL+
Sbjct: 750 TLNHRGDGGTGWSKANKINLWARLLDGNRAHRLLAEQLKYSTLE-----------NLWDT 798
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWK 753
H PFQID NFG T+ +AEML+QS + LPALP D W G V GL ARG VS+ WK
Sbjct: 799 HAPFQIDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLVARGNFEVSMKWK 857
Query: 754 DGDLHEVGIYSN 765
D +L + SN
Sbjct: 858 DKNLQSLSFLSN 869
>gi|417939732|ref|ZP_12583021.1| gram positive anchor [Streptococcus oralis SK313]
gi|343389927|gb|EGV02511.1| gram positive anchor [Streptococcus oralis SK313]
Length = 1727
Score = 342 bits (876), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 255/792 (32%), Positives = 395/792 (49%), Gaps = 117/792 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y + K L+++R
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--ERYKVLAEIRK 199
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
+++G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 200 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRNLDITE 259
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G++F++ L IK +D + T+
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 362
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP ++ +KD E +++ + Y L
Sbjct: 363 QDETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKTKDYETLK 419
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L S T E ++ + ++ L EL F
Sbjct: 420 KAHIKDYQSLFNRVKLNLGGSKTGQTT-------------KEALQGYNPEKGQKLEELFF 466
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 467 QYGRYLLISSSRDQTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDQASDRWV-SSPS 640
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L ++D LV +V +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKTKFDKL 690
Query: 580 RPTKIAEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
+P I ++G I EW ++ F + E HHRH+SHL GLFPG T+ + + +AA
Sbjct: 691 KPLHINKEGRIKEWYEEDSPQFTNEGIENHHRHVSHLVGLFPG-TLFSKDQAEYLEAARA 749
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAA 693
TL RG+ G GWS K LWARL D A+R++ E NL+
Sbjct: 750 TLNHRGDGGTGWSKANKINLWARLLDGNRAHRLLAEQLKYSTLE-----------NLWDT 798
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWK 753
H PFQID NFG T+ +AEML+QS + LPALP D W G V GL ARG VS+ WK
Sbjct: 799 HAPFQIDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLVARGNFEVSMKWK 857
Query: 754 DGDLHEVGIYSN 765
D +L + SN
Sbjct: 858 DKNLQSLSFLSN 869
>gi|293364225|ref|ZP_06610951.1| conserved hypothetical protein [Streptococcus oralis ATCC 35037]
gi|307702420|ref|ZP_07639376.1| alpha-fucosidase [Streptococcus oralis ATCC 35037]
gi|291317071|gb|EFE57498.1| conserved hypothetical protein [Streptococcus oralis ATCC 35037]
gi|307624002|gb|EFO02983.1| alpha-fucosidase [Streptococcus oralis ATCC 35037]
Length = 1707
Score = 342 bits (876), Expect = 6e-91, Method: Compositional matrix adjust.
Identities = 256/792 (32%), Positives = 398/792 (50%), Gaps = 117/792 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
+++G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 200 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRNLDITE 259
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G+QF++ L IK +D + T+
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLQFASYLGIK-TDGKVTV--- 362
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP ++ +KD E +++ ++ Y L
Sbjct: 363 QDETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKSKDYETLK 419
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L + T + E ++ + ++ L EL F
Sbjct: 420 KAHIKDYQSLFNRVKLNLGGT-------------KTTQTTKEALQGYNPEKGQKLEELFF 466
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDQASDRWV-SSPS 640
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L+ ++D LV +V +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKL 690
Query: 580 RPTKIAEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
+P I ++G I EW ++ F + E HHRH+SHL GLFPG T+ + + +AA
Sbjct: 691 KPLHINKEGRIKEWYEEDSPQFTNEGIENHHRHVSHLVGLFPG-TLFSKDQAEYLEAARA 749
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAA 693
TL RG+ G GWS K LWARL D A+R++ E NL+
Sbjct: 750 TLNHRGDGGTGWSKANKINLWARLLDGNRAHRLLAEQLKYSTLE-----------NLWDT 798
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWK 753
H PFQID NFG T+ +AEML+QS + LPALP D W G V GL ARG VS+ WK
Sbjct: 799 HAPFQIDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLVARGNFEVSMKWK 857
Query: 754 DGDLHEVGIYSN 765
D +L + SN
Sbjct: 858 DKNLQSLSFLSN 869
>gi|15903541|ref|NP_359091.1| hypothetical protein spr1498 [Streptococcus pneumoniae R6]
gi|116515332|ref|YP_816923.1| hypothetical protein SPD_1467 [Streptococcus pneumoniae D39]
gi|421266644|ref|ZP_15717524.1| fibronectin type III domain protein [Streptococcus pneumoniae
SPAR27]
gi|15459158|gb|AAL00302.1| Conserved hypothetical protein [Streptococcus pneumoniae R6]
gi|116075908|gb|ABJ53628.1| conserved hypothetical protein [Streptococcus pneumoniae D39]
gi|395866712|gb|EJG77840.1| fibronectin type III domain protein [Streptococcus pneumoniae
SPAR27]
Length = 803
Score = 342 bits (876), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 245/774 (31%), Positives = 380/774 (49%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF+ ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFNQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNTFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L +I + G I EW ++
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNSLQITQSGRIREWYEE 593
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA +L RG+ G GWS K
Sbjct: 594 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSKANK 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 653 INLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQIDGNFGATSGMA 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 702 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 754
>gi|418966783|ref|ZP_13518495.1| hypothetical protein HMPREF1045_1205 [Streptococcus mitis SK616]
gi|383346450|gb|EID24496.1| hypothetical protein HMPREF1045_1205 [Streptococcus mitis SK616]
Length = 803
Score = 342 bits (876), Expect = 7e-91, Method: Compositional matrix adjust.
Identities = 249/803 (31%), Positives = 386/803 (48%), Gaps = 83/803 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSFDYQGGNLQDQYSFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GD+ +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNTAEELAEQHLVGPKTSQYGTYLSFGDLLIEFSRQGKTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y+ F RE F+S PD ++V + + + +L F + L D S +
Sbjct: 147 LATTSYAYKGTMFKRESFASFPDDLLVQRFTKEGAETLDFTIELSLTRDLASDGKYEQKK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F+ L + G I DK +++ G+ +
Sbjct: 207 SDYKECQLEITDSHILMKGRVKDN--NLRFAGCLAWQTD---GDIRVWSDK-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + +++ + Y+ L +RH++D Q LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLVQQVRDLVETAKEKGYAQLKSRHIEDCQTLFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L +D + + +K+++ E SL EL FQ+GRYLLISSSR +
Sbjct: 321 LDLG-------------AEVDASTTDDLLKNYKPQEGQSLEELFFQYGRYLLISSSRDCS 367
Query: 370 QV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+NINL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNGVDNPPWNSDYHLNINLQMNYWPAYVTNLLETAFPVINYIDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAARYAGIVSQEGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTIYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L R YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLRDRIYPILRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW--- 593
S +T D ++I ++F I AA+ L +ED L E V + L P +I + G I EW
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTE-VKEKFELLNPLQITQSGRIREWYEE 593
Query: 594 -AQDFKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
Q F++ +V HRH SHL GL+PG+ + K + AA +L RG+ G GWS K
Sbjct: 594 EEQHFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYLVAASASLNDRGDGGTGWSKANK 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 653 INLWARLGDGNRAHKLLA-----------EQLKSSTLPNLWCSHPPFQIDGNFGATSGMA 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNND 770
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S +
Sbjct: 702 EMLLQSHTAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMRWEDKKLLQMTILSRSGGDL 760
Query: 771 HDSFKTLHYRGTSVKVNLSAGKI 793
S+ + + ++VN K+
Sbjct: 761 RVSYPGIE--KSVIEVNQEKAKV 781
>gi|418192091|ref|ZP_12828593.1| alpha-fucosidase domain protein [Streptococcus pneumoniae GA47388]
gi|353855177|gb|EHE35147.1| alpha-fucosidase domain protein [Streptococcus pneumoniae GA47388]
Length = 778
Score = 341 bits (875), Expect = 8e-91, Method: Compositional matrix adjust.
Identities = 246/785 (31%), Positives = 384/785 (48%), Gaps = 103/785 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T +R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDCQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 147 LVMTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND + F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRV---------KDND----LWFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
+ G I EW ++ F++ +V HRH SHL GL+PG+ + K + +AA +L RG
Sbjct: 583 QSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRG 641
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
+ G GWS K LWARL D A++++ + + NL+ +HPPFQI
Sbjct: 642 DGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQI 690
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D NFG T+ +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D L +
Sbjct: 691 DGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQ 749
Query: 760 VGIYS 764
+ I S
Sbjct: 750 LTILS 754
>gi|419482693|ref|ZP_14022480.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA40563]
gi|379579285|gb|EHZ44192.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA40563]
Length = 803
Score = 341 bits (875), Expect = 9e-91, Method: Compositional matrix adjust.
Identities = 245/774 (31%), Positives = 379/774 (48%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 593
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA +L RG+ G GWS K
Sbjct: 594 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSKANK 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 653 INLWARLGDGNRAHKLLA-----------EQLKTSTLQNLWCSHPPFQIDGNFGATSGMA 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 702 EMLPQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 754
>gi|168486978|ref|ZP_02711486.1| alpha-fucosidase [Streptococcus pneumoniae CDC1087-00]
gi|237650661|ref|ZP_04524913.1| alpha-fucosidase [Streptococcus pneumoniae CCRI 1974]
gi|237822420|ref|ZP_04598265.1| alpha-fucosidase [Streptococcus pneumoniae CCRI 1974M2]
gi|418126305|ref|ZP_12763211.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44511]
gi|418214849|ref|ZP_12841583.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA54644]
gi|419458244|ref|ZP_13998186.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA02254]
gi|419484883|ref|ZP_14024658.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA43257]
gi|419510919|ref|ZP_14050560.1| fibronectin type III domain protein [Streptococcus pneumoniae
NP141]
gi|419530550|ref|ZP_14070077.1| hypothetical protein SPAR62_1499 [Streptococcus pneumoniae GA40028]
gi|421213591|ref|ZP_15670545.1| fibronectin type III domain protein [Streptococcus pneumoniae
2070108]
gi|421215753|ref|ZP_15672674.1| fibronectin type III domain protein [Streptococcus pneumoniae
2070109]
gi|421301508|ref|ZP_15752178.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA19998]
gi|183570104|gb|EDT90632.1| alpha-fucosidase [Streptococcus pneumoniae CDC1087-00]
gi|353796245|gb|EHD76590.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44511]
gi|353869579|gb|EHE49460.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA54644]
gi|379529908|gb|EHY95149.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA02254]
gi|379573458|gb|EHZ38413.1| hypothetical protein SPAR62_1499 [Streptococcus pneumoniae GA40028]
gi|379581636|gb|EHZ46520.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA43257]
gi|379631522|gb|EHZ96099.1| fibronectin type III domain protein [Streptococcus pneumoniae
NP141]
gi|395578822|gb|EJG39332.1| fibronectin type III domain protein [Streptococcus pneumoniae
2070108]
gi|395579960|gb|EJG40455.1| fibronectin type III domain protein [Streptococcus pneumoniae
2070109]
gi|395899068|gb|EJH10012.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA19998]
Length = 803
Score = 341 bits (875), Expect = 9e-91, Method: Compositional matrix adjust.
Identities = 246/785 (31%), Positives = 384/785 (48%), Gaps = 103/785 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T +R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDCQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 147 LVMTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND + F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD---------ND----LWFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
+ G I EW ++ F++ +V HRH SHL GL+PG+ + K + +AA +L RG
Sbjct: 583 QSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRG 641
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
+ G GWS K LWARL D A++++ + + NL+ +HPPFQI
Sbjct: 642 DGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQI 690
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D NFG T+ +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D L +
Sbjct: 691 DGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQ 749
Query: 760 VGIYS 764
+ I S
Sbjct: 750 LTILS 754
>gi|418144603|ref|ZP_12781398.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA13494]
gi|418185405|ref|ZP_12821946.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47283]
gi|353807069|gb|EHD87341.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA13494]
gi|353848689|gb|EHE28701.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47283]
Length = 782
Score = 341 bits (874), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 246/785 (31%), Positives = 384/785 (48%), Gaps = 103/785 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 6 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T +R+L+++ A
Sbjct: 66 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDCQRQLNISKA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
Y F RE F+S PD ++V + + + +L F + L L + Y
Sbjct: 126 LVMTSYVYKGTRFEREAFASFPDDLLVQRFTKEGAETLDFTIELSLTCDLASDEKYEQEK 185
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND + F++ L + G I
Sbjct: 186 SDYKECKLDITDSHILMKGRV---------KDND----LWFASYLAWETD---GDIRVWS 229
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 230 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 288
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 289 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 335
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 336 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVI 395
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A V Y +GW++H + W D W P
Sbjct: 396 NYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 451
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 452 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 510
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 511 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 561
Query: 586 EDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
+ G I EW ++ F++ +V HRH SHL GL+PG+ + K + +AA +L RG
Sbjct: 562 QSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRG 620
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
+ G GWS K LWARL D A++++ + + NL+ +HPPFQI
Sbjct: 621 DGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQI 669
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D NFG T+ +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D L +
Sbjct: 670 DGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQ 728
Query: 760 VGIYS 764
+ I S
Sbjct: 729 LTILS 733
>gi|417923725|ref|ZP_12567182.1| hypothetical protein HMPREF9959_1543 [Streptococcus mitis SK569]
gi|342836607|gb|EGU70818.1| hypothetical protein HMPREF9959_1543 [Streptococcus mitis SK569]
Length = 803
Score = 341 bits (874), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 249/803 (31%), Positives = 386/803 (48%), Gaps = 83/803 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSFDYQGGNLQDQYSFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GD+ +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNTAEELAEQHLVGPKTSQYGTYLSFGDLLIEFSRQGKTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y+ F RE F+S PD ++V + + + +L F + L D S +
Sbjct: 147 LATTSYAYKGTMFKRESFASFPDDLLVQRFTKEGAETLDFTIELSLTRDLASDGKYEQKK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F+ L + G I DK +++ G+ +
Sbjct: 207 SDYKECQLEITDSHILMKGRVKDN--NLRFAGCLAWQTD---GDIRVWSDK-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + +++ + Y+ L +RH++D Q LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLVQQVRDLVETAKEKGYAQLKSRHIEDCQTLFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L +D + + +K+++ E SL EL FQ+GRYLLISSSR +
Sbjct: 321 LDLG-------------AEVDASTTDDLLKNYKPQEGQSLEELFFQYGRYLLISSSRDCS 367
Query: 370 QV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+NINL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNGVDNPPWNSDYHLNINLQMNYWPAYVTNLLETAFPVINYIDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAARYAGIVSQEGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTIYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L R YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLRDRIYPILRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW--- 593
S +T D ++I ++F I AA+ L +ED L E V + L P +I + G I EW
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTE-VKEKFELLNPLQITQSGRIREWYEE 593
Query: 594 -AQDFKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
Q F++ +V HRH SHL GL+PG+ + K + AA +L RG+ G GWS K
Sbjct: 594 EEQHFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYLVAASASLNDRGDGGTGWSKANK 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 653 INLWARLGDGNRAHKLLA-----------EQLKSSTLPNLWCSHPPFQIDGNFGATSGMA 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNND 770
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S +
Sbjct: 702 EMLLQSHTAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMRWEDKKLLQMTILSRSGGDL 760
Query: 771 HDSFKTLHYRGTSVKVNLSAGKI 793
S+ + + ++VN K+
Sbjct: 761 RVSYPGIE--KSVIEVNQEKAKV 781
>gi|406576906|ref|ZP_11052529.1| LPXTG cell surface protein, calx-beta domain-containing protein
[Streptococcus sp. GMD6S]
gi|404460587|gb|EKA06837.1| LPXTG cell surface protein, calx-beta domain-containing protein
[Streptococcus sp. GMD6S]
Length = 1707
Score = 341 bits (874), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 255/792 (32%), Positives = 396/792 (50%), Gaps = 117/792 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
+++G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 200 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRGLDITE 259
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G++F++ L IK +D + T+
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 362
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP ++ +KD E +++ + Y L
Sbjct: 363 QDETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKAKDYETLK 419
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ+LF+RV + L N + E ++ + ++ L EL F
Sbjct: 420 KAHIKDYQRLFNRVKLNLGG-------------NKTAQTTKEALQGYNPEKGQKLEELFF 466
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPS 640
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L ++D LV +V +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKAKFDKL 690
Query: 580 RPTKIAEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
+P I ++G I EW ++ F + E HHRH+SHL GLFPG T+ + + +AA
Sbjct: 691 KPLHINKEGRIKEWYEEDNPQFTNEGIENHHRHVSHLVGLFPG-TLFSKDQAEYLEAARA 749
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAA 693
TL RG+ G GWS K LWARL D A+R++ E NL+
Sbjct: 750 TLNHRGDGGTGWSKANKINLWARLLDGNRAHRLLAEQLKYSTLE-----------NLWDT 798
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWK 753
H PFQID NFG T+ +AEML+QS + LPALP D W G V GL ARG VS+ WK
Sbjct: 799 HAPFQIDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLIARGNFEVSMKWK 857
Query: 754 DGDLHEVGIYSN 765
D +L + SN
Sbjct: 858 DKNLQSLSFLSN 869
>gi|418202865|ref|ZP_12839294.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA52306]
gi|419456006|ref|ZP_13995963.1| fibronectin type III domain protein [Streptococcus pneumoniae
EU-NP04]
gi|421285997|ref|ZP_15736773.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA60190]
gi|421307849|ref|ZP_15758491.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA60132]
gi|353867422|gb|EHE47317.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA52306]
gi|379627982|gb|EHZ92588.1| fibronectin type III domain protein [Streptococcus pneumoniae
EU-NP04]
gi|395885984|gb|EJG97005.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA60190]
gi|395907234|gb|EJH18128.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA60132]
Length = 803
Score = 340 bits (873), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 244/774 (31%), Positives = 380/774 (49%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 593
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH+SHL GL+PG+ + K + +AA +L R + G GWS K
Sbjct: 594 EEQYFQNEKVEAQHRHVSHLVGLYPGNLFSY-KGQEYIEAARASLNDREDGGTGWSKANK 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 653 INLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQIDGNFGATSGMA 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 702 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 754
>gi|405760473|ref|YP_006701069.1| hypothetical protein SPNA45_00586 [Streptococcus pneumoniae SPNA45]
gi|404277362|emb|CCM07874.1| conserved hypothetical protein [Streptococcus pneumoniae SPNA45]
Length = 803
Score = 340 bits (873), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 245/774 (31%), Positives = 378/774 (48%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKMSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y +F RE F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTQFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D ++F++ L K G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD--TDLRFASYLAWKTD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKDYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYL--------ASGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAEIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I A+ L +ED L E KS L P +I + G I EW ++
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQVAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 593
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V +RH SHL GL+PG+ + K + +AA +L RG G GWS K
Sbjct: 594 EEQYFQNEKVEAQYRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGNGGTGWSKANK 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 653 INLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQIDGNFGATSGMA 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 702 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 754
>gi|322375926|ref|ZP_08050437.1| fibronectin type III domain protein [Streptococcus sp. C300]
gi|321279194|gb|EFX56236.1| fibronectin type III domain protein [Streptococcus sp. C300]
Length = 1707
Score = 340 bits (872), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 256/792 (32%), Positives = 396/792 (50%), Gaps = 117/792 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
+++G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 200 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRNLDITE 259
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G+QF++ L IK +D + T+
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLQFASYLGIK-TDGKVTV--- 362
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP ++ +KD E +++ ++ Y L
Sbjct: 363 QDETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKSKDYETLK 419
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L + T + E ++ + ++ L EL F
Sbjct: 420 KAHIKDYQSLFNRVKLNLGGT-------------KTTQTTKEALQGYNPEKGQKLEELFF 466
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPS 640
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L ++D LV +V +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKAKFDKL 690
Query: 580 RPTKIAEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
+P I +G I EW ++ F + E HHRH+SHL GLFPG T+ + + +AA
Sbjct: 691 KPLHINNEGRIKEWYEEDSPQFTNEGIENHHRHVSHLVGLFPG-TLFSKDQAEYLEAARA 749
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAA 693
TL RG+ G GWS K LWARL D A+R++ E NL+
Sbjct: 750 TLNHRGDGGTGWSKANKINLWARLLDGNRAHRLLAEQLKYSTLE-----------NLWDT 798
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWK 753
H PFQID NFG T+ +AEML+QS + LPALP D W G V GL ARG VS+ WK
Sbjct: 799 HAPFQIDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLVARGNFEVSMKWK 857
Query: 754 DGDLHEVGIYSN 765
D +L + SN
Sbjct: 858 DKNLQSLSFLSN 869
>gi|15901489|ref|NP_346093.1| hypothetical protein SP_1654 [Streptococcus pneumoniae TIGR4]
gi|111658563|ref|ZP_01409226.1| hypothetical protein SpneT_02000319 [Streptococcus pneumoniae
TIGR4]
gi|421243582|ref|ZP_15700095.1| fibronectin type III domain protein [Streptococcus pneumoniae
2081074]
gi|421247923|ref|ZP_15704402.1| fibronectin type III domain protein [Streptococcus pneumoniae
2082170]
gi|14973145|gb|AAK75733.1| conserved hypothetical protein [Streptococcus pneumoniae TIGR4]
gi|395606587|gb|EJG66691.1| fibronectin type III domain protein [Streptococcus pneumoniae
2081074]
gi|395612939|gb|EJG72972.1| fibronectin type III domain protein [Streptococcus pneumoniae
2082170]
Length = 803
Score = 340 bits (872), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 244/774 (31%), Positives = 379/774 (48%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL++NYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQLNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y + D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YLFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 593
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA +L RG+ G GWS K
Sbjct: 594 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSKANK 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 653 INLWARLGDGNRAHKLLA-----------EQLKTSTLQNLWCSHPPFQIDGNFGATSGMA 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 702 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 754
>gi|421290215|ref|ZP_15740965.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA54354]
gi|421305607|ref|ZP_15756261.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA62331]
gi|395887900|gb|EJG98914.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA54354]
gi|395904565|gb|EJH15479.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA62331]
Length = 803
Score = 340 bits (872), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 244/774 (31%), Positives = 379/774 (48%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYET 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L + KS L P +I + G I EW ++
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTDVKEKS-DLLNPLQITQSGRIREWYEE 593
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA +L RG G GWS K
Sbjct: 594 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGNGGTGWSKANK 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 653 INLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQIDGNFGATSGMA 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 702 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 754
>gi|418094446|ref|ZP_12731573.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA49138]
gi|353764942|gb|EHD45490.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA49138]
Length = 803
Score = 340 bits (872), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 248/785 (31%), Positives = 383/785 (48%), Gaps = 103/785 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +L +G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLCSGGPLPDSSDYQGGNLQDQYGFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVN-- 188
A Y F RE F+S PD ++V + +L F + L L N Y
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCYLASNGKYEQEK 206
Query: 189 ----------GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
++ I+M+GR ND ++F++ L + G I
Sbjct: 207 SDYKECKLDITDSHILMKGRVKD---------ND----LRFASYLAWETD---GDIRVWS 250
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
D+ +++ G+ +A L L A + F + K D + + + + + Y+ L +RH+
Sbjct: 251 DR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHI 309
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGR 358
+DYQ LF RV + L E ++D + + +K+++ E +L EL FQ+GR
Sbjct: 310 EDYQALFQRVQLDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGR 356
Query: 359 YLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
YLLISSSR P ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+
Sbjct: 357 YLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVI 416
Query: 417 DFLTYLSINGSKTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMG 466
+++ L + G + A + Y +GW++H + W D W P
Sbjct: 417 NYVDDLRVYG-RLAALKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAA 472
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYL-ETNPSTSPEHE 525
AW+ ++E Y++ D+D+L ++ YP+L F +L + ++PS SPEH
Sbjct: 473 NAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH- 531
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
+S +T D ++I ++F I AA+ L +ED L E KS L P +I
Sbjct: 532 --------GPISIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQIT 582
Query: 586 EDGSIMEW----AQDFKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
+ G I EW Q F++ +V HRH SHL GL+PG+ + K + +AA +L RG
Sbjct: 583 QSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRG 641
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
+ G GWS K LWARL D A++++ + + NL+ +HPPFQI
Sbjct: 642 DGGTGWSKANKINLWARLGDGNRAHKLLA-----------EQLKTSTLQNLWCSHPPFQI 690
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D NFG T+ +AEML+QS L L ALP D WS+G V GL ARG VS+ W+D L +
Sbjct: 691 DGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQ 749
Query: 760 VGIYS 764
+ I S
Sbjct: 750 LTILS 754
>gi|418975961|ref|ZP_13523855.1| gram positive anchor [Streptococcus oralis SK1074]
gi|383346616|gb|EID24639.1| gram positive anchor [Streptococcus oralis SK1074]
Length = 1687
Score = 340 bits (872), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 258/787 (32%), Positives = 393/787 (49%), Gaps = 107/787 (13%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199
Query: 78 LVDSGQYAEATA-ASVKLFGHPADVYQ---LLGDIELEFDDSHLKYAEET-YRRELDLNT 132
+++G +A A LFG Y GDI + F++ T Y R LD+
Sbjct: 200 ALEAGDRQKAKQLAEQNLFGPNNAQYGRCLAFGDIFMVFNNQKKGLDTVTDYHRGLDITE 259
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGDYSWE 319
Query: 184 -HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+Y NG+ G I K D+ G++F++ L IK GT++ ++++ L
Sbjct: 320 YSNYKNGHVTTDEHG------ILLKGTVKDN--GLKFASYLGIKTD---GTVT-VQNETL 367
Query: 243 KVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLYTRHLD 299
V G+ +A L L A ++F NP ++ +KD E +++ + Y L H+
Sbjct: 368 TVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKAKDYETLKKAHIK 424
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
DYQ LF+RV + LS S T E ++ + ++ L EL FQ+GRY
Sbjct: 425 DYQSLFNRVKLNLSGSKTAQTT-------------KEALQGYNPEKGQKLEELFFQYGRY 471
Query: 360 LLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
LLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E +P+ +
Sbjct: 472 LLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETAKPMIN 531
Query: 418 FLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
++ + G SK Q N GW++H + + ++ W P
Sbjct: 532 YIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWGWSPAA 586
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPSTSPEH 524
AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS SPEH
Sbjct: 587 NAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKTSDRWV-SSPSYSPEH 645
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
++ +T D +++ ++F + A L+ ++D LV +V +L+P I
Sbjct: 646 ---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVEAKFDKLKPLHI 695
Query: 585 AEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR 638
+G I EW ++ F + E HHRH+SHL GLFPG T+ + + +AA TL R
Sbjct: 696 NNEGRIKEWYEEDSPQFTNEGIENHHRHVSHLVGLFPG-TLFSKDQAEYLEAARATLNHR 754
Query: 639 GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQ 698
G+ G GWS K LWARL D A+R++ E NL+ H PFQ
Sbjct: 755 GDGGTGWSKANKINLWARLLDGNRAHRLLAEQLKYSTLE-----------NLWDTHAPFQ 803
Query: 699 IDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLH 758
ID NFG T+ +AEML+QS + LPALP D W G V GL ARG VS+ WKD +L
Sbjct: 804 IDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLIARGNFEVSMKWKDKNLQ 862
Query: 759 EVGIYSN 765
+ SN
Sbjct: 863 SLSFLSN 869
>gi|260589559|ref|ZP_05855472.1| putative fibronectin type III domain protein [Blautia hansenii DSM
20583]
gi|260540127|gb|EEX20696.1| putative fibronectin type III domain protein [Blautia hansenii DSM
20583]
Length = 1719
Score = 340 bits (871), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 240/774 (31%), Positives = 378/774 (48%), Gaps = 86/774 (11%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS- 77
+PIGN +GA V+G + E L N+ TLW G P G+ D + +S+V
Sbjct: 75 LPIGNSFMGANVYGEIGEERLTFNQKTLWNGGPSESRPNYDGGNKETADNGQKMSEVYKE 134
Query: 78 ---LVDSGQYAEATAASVKLFGHPAD--VYQLLGDIELEFDDSHLKYAE-ETYRRELDLN 131
L G +A + KL G YQ GDI ++F LK + E Y R+L+L
Sbjct: 135 IIKLYKEGNDTQANELAKKLTGEVEGYGAYQSWGDIYVDFG---LKEEQAENYVRDLNLE 191
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
A A V + + + RE+F S PD V+ K + + L F++S +DN V
Sbjct: 192 NAVASVDFDYQDTKMHREYFISYPDNVLAMKFTADGNEKLDFDISFP--IDNAEGV---- 245
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGI-------QFSAILEIKISDDRGTISALEDKKLKV 244
+ GK + K DD + Q ++K+ + G + + KL V
Sbjct: 246 ----ADKKLGKSV--KTTVEDDMITVSGEMQDNQLKLNGKLKVETEGGKVQEKDGDKLHV 299
Query: 245 EGSDWAVLLLVASSSF--DGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
G+ AV+ + A + + P ++ ++ + A+ Y + H+ DY
Sbjct: 300 SGASEAVVYVSADTDYLNKYPDYRTGETAQELDASVEKAVDKASKKGYEKVKKEHIKDYS 359
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
++F RV + L ++ + TD ++ + + ++ E+ +L +LFQ+GRYL I
Sbjct: 360 EIFSRVQLDLGQNVPEKTTDIL----LNDYNAGKNTEA----ENRALEVILFQYGRYLTI 411
Query: 363 SSSRPGTQVANLQGIWNEDLSP----TWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
+SSR G +NLQG+W + W S H+N+NL+MNYW + N++EC PL D+
Sbjct: 412 ASSRAGDLPSNLQGVWQNRVGDHNRIPWASDYHMNVNLQMNYWPTYSTNMAECATPLIDY 471
Query: 419 LTYLSINGSKTAQVNY-LASGWVIHHKTDI---WAKSSADRGKVVWALWPMGGAWLCTHL 474
+ L G TA+ + + +G H + W D W P W+ +
Sbjct: 472 INSLVEPGKVTAKTYFGVENGGFTAHTQNTPFGWTCPGWD---FSWGWSPAALPWILQNC 528
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHD-GYLETNPSTSPEHEFIAPDGKL 533
WE+Y YT D ++E+ YP+L+ A LIE G L + P+ SPEH
Sbjct: 529 WEYYEYTGDVKYMEEHIYPMLKEAALLYDQILIEDEKTGRLVSAPAYSPEH--------- 579
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
V+ +T + ++I +++ +AAE+L K+ED E + +L+P +I E G I EW
Sbjct: 580 GPVTAGNTYEQSLIWQLYEDAATAAEILGKDEDKAKEWRQRQ-EKLKPIEIGESGQIKEW 638
Query: 594 AQDF---KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
+ E HRH+SHL GLFPG I+++ N + AA +L++RGE+ GW + +
Sbjct: 639 YTETTLGSMGEKGHRHMSHLLGLFPGDLISVD-NAEYMDAAIVSLKERGEKSTGWGMGQR 697
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
WAR D A+++++ LF H+ G+Y NL+ H PFQID NFG T+ V+
Sbjct: 698 INAWARTGDGNQAHKLIQNLF------HD-----GIYPNLWDTHTPFQIDGNFGMTSGVS 746
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS + + +LP+LP D W++G VKGL ARG VS+ W D +L E + S
Sbjct: 747 EMLMQSNMGYINMLPSLP-DVWANGSVKGLVARGNFEVSMKWADKNLTEASVLS 799
>gi|302346987|ref|YP_003815285.1| hypothetical protein HMPREF0659_A7263 [Prevotella melaninogenica ATCC
25845]
gi|302151004|gb|ADK97265.1| conserved hypothetical protein [Prevotella melaninogenica ATCC 25845]
Length = 1163
Score = 339 bits (870), Expect = 3e-90, Method: Compositional matrix adjust.
Identities = 241/788 (30%), Positives = 374/788 (47%), Gaps = 105/788 (13%)
Query: 11 NPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
N + + PA ++ T +PIGNG+ GA + G V + ++ N+ TLW+G G T
Sbjct: 341 NKYTLWYTKPATNWMTSCLPIGNGQFGATLMGDVAIDDVQFNDKTLWSGKLGGLT----- 395
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
+TAA +G+ Y G++ + S Y R LD
Sbjct: 396 -----------------STAA----YGY----YLNFGNLYIR---SRGMSKVTDYVRYLD 427
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD-NHSY-V 187
+N A A VKY++ V ++R +F+SNPD +V + + S++G ++ ++L + N SY V
Sbjct: 428 INDAVAGVKYTMDGVAYSRTYFASNPDSCVVVRYTASQNGKINTTLTLKNQNGRNVSYTV 487
Query: 188 NGNNQ--IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
+ NNQ I +G+ A +D S +I D GTI+ ++V
Sbjct: 488 DNNNQATITFDGQV--------ARQDDHGATTPESYYCAARIVTDGGTITKNAKGIIEVN 539
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G++ + L + FD + + + +N Y L H DY+ LF
Sbjct: 540 GANSMTVYLRGLTDFDPDAPTYVSGANLLAGRAAATVNDAQNKGYDALLAAHKADYKSLF 599
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLV--ELLFQFGRYLLIS 363
R + LS +I P+ + + S++ ++ +L EL F +GRYLLIS
Sbjct: 600 DRCQLTLSDVKNNI-------------PTPQLISSYRDNQHDNLFLEELYFNYGRYLLIS 646
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL---T 420
SSR + ANLQGIWN++ +P W S H NIN++MNYW + P NLSE P D++
Sbjct: 647 SSRGVSLPANLQGIWNDNNTPAWHSDIHANINVQMNYWPAEPTNLSELHRPFLDYIYREA 706
Query: 421 YLSINGSKTAQ-VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYN 479
+ + AQ + ++ +GW + + +I+ G + + AW C HLW+HY
Sbjct: 707 CVKPTWRRFAQDMGHVNTGWTLPTENNIYGS-----GTTFANTYTVANAWYCQHLWQHYT 761
Query: 480 YTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
YTMD+DFL +A+P ++ + L++ DG E SPEH +
Sbjct: 762 YTMDKDFLRAKAFPAMKSAVDYWFKKLVKAADGTYECPNEWSPEH---------GPTENA 812
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS---------- 589
+ ++ ++F+ A +VL D +V K + K+ +DG
Sbjct: 813 TAHSQQLVWDLFNNTRKAIKVLG---DDVVSKAFRDSLATYFAKL-DDGCHTEVNPADGQ 868
Query: 590 --IMEW--AQDFKDPEV-------HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR 638
+ EW + F +P HRH+SHL GL+P I+ + + + +AA ++L R
Sbjct: 869 TYLREWKYSSQFNNPSKIGVNEYKAHRHISHLMGLYPCTQISEDADKTVFEAARQSLIAR 928
Query: 639 GE-EGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
G+ G GWS+ K L AR ++ +H + ++KR GG+Y NL+ AH P+
Sbjct: 929 GDGHGTGWSLGHKINLNARAYEGQHCHNLIKRALQQTWDTGTNEAAGGIYENLWDAHAPY 988
Query: 698 QIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
QID NFG+TA VAEML+QS + L +LPALP W G VKGLKA G TV I W
Sbjct: 989 QIDGNFGYTAGVAEMLLQSHNDKLVILPALPTTFWQKGSVKGLKAVGNFTVDIDWAAAKA 1048
Query: 758 HEVGIYSN 765
+V I SN
Sbjct: 1049 TKVQIVSN 1056
>gi|331265740|ref|YP_004325370.1| LPXTG cell surface protein, calx-beta domain-containing protein
[Streptococcus oralis Uo5]
gi|326682412|emb|CBZ00029.1| LPXTG cell surface protein, calx-beta domain protein [Streptococcus
oralis Uo5]
Length = 1707
Score = 339 bits (869), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 256/792 (32%), Positives = 394/792 (49%), Gaps = 117/792 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
+++G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 200 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRNLDITE 259
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G+QF++ L IK +D + T+
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLQFASYLGIK-TDGKVTV--- 362
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS-KKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP ++ +KD E ++ + Y L
Sbjct: 363 QDETLTVTGASYATLYLSAKTNFAQ---NPKNNYRKDIDLEKTVKGIVEVAKAKDYETLK 419
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L + T + E ++ + ++ L EL F
Sbjct: 420 KAHIKDYQSLFNRVKLNLGGT-------------KTTQTTKEALQGYNPEKGQKLEELFF 466
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPS 640
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L ++D LV +V +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKAKFDKL 690
Query: 580 RPTKIAEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
+P I +G I EW ++ F + E HHRH+SHL GLFPG T+ + + +AA
Sbjct: 691 KPLHINNEGRIKEWYEEDSPQFTNEGIENHHRHVSHLVGLFPG-TLFSKDQAEYLEAARA 749
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAA 693
TL RG+ G GWS K LWARL D A+R++ E NL+
Sbjct: 750 TLNHRGDGGTGWSKANKINLWARLLDGNRAHRLLAEQLKYSTLE-----------NLWDT 798
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWK 753
H PFQID NFG T+ +AEML+QS + LPALP D W G V GL ARG VS+ WK
Sbjct: 799 HAPFQIDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLVARGNFEVSMKWK 857
Query: 754 DGDLHEVGIYSN 765
D +L + SN
Sbjct: 858 DKNLQSLSFLSN 869
>gi|418139981|ref|ZP_12776806.1| hypothetical protein SPAR28_1620 [Streptococcus pneumoniae GA13338]
gi|418181010|ref|ZP_12817579.1| hypothetical protein SPAR74_1621 [Streptococcus pneumoniae GA41688]
gi|353843082|gb|EHE23127.1| hypothetical protein SPAR74_1621 [Streptococcus pneumoniae GA41688]
gi|353904760|gb|EHE80210.1| hypothetical protein SPAR28_1620 [Streptococcus pneumoniae GA13338]
Length = 778
Score = 339 bits (869), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 244/774 (31%), Positives = 380/774 (49%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQD 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYL---VWETDGDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ Y +L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYTMLRETVRFWNAFLRKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 593
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA +L RG+ G GWS K
Sbjct: 594 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSKANK 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 653 INLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQIDGNFGATSGMA 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 702 EMLLQSHAAYLVPLAALP-DAWSTGFVSGLMARGHFEVSMSWEDKKLLQLTILS 754
>gi|429764051|ref|ZP_19296381.1| f5/8 type C domain protein [Clostridium celatum DSM 1785]
gi|429188824|gb|EKY29689.1| f5/8 type C domain protein [Clostridium celatum DSM 1785]
Length = 1566
Score = 339 bits (869), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 242/805 (30%), Positives = 391/805 (48%), Gaps = 123/805 (15%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP------------------ 69
+P+GNG LG+ V+GGV E + N+ TLWTG P NPD
Sbjct: 49 LPLGNGNLGSSVFGGVEKERIHFNDKTLWTGGP---DNPDGTMNDGTQYQGGNRLFEFNE 105
Query: 70 KALSDVRSLVDSGQY---AEATAASVKLFGHPADV--YQLLGDIELEFDD--SHLKYAEE 122
+ +++ S DS T S LF + ++ +Q GDI L+F + S+ K +
Sbjct: 106 EGYNNLISKFDSNDPLVPTGNTGVSSTLFSNRPNLGSWQDFGDIYLDFSEMGSNSKNVD- 164
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD---- 178
Y R LD+ A + V Y + REHF S PD V+VT++S G L F+V L
Sbjct: 165 NYERSLDIKNAISEVIYDYNETTYLREHFVSYPDNVLVTRLSKDGDGKLDFDVELKKSSA 224
Query: 179 -SLLDNHSYVNGNNQII-MEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISA 236
S D + ++ NN I + G G ++ ++SA L++ + T+
Sbjct: 225 LSSNDATTSIDDNNTTIKLIGTLNGNKM-------------KYSASLKVIVDGKESTVEP 271
Query: 237 LEDKKLKVEGSDWAVLLLVASSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
+ +KV +D VL+ + + P ++ ++ T+ + Y+ L
Sbjct: 272 NGNSTIKVRNADEVVLIFSTGTDYKNIYPGYRTGETSEEVTNRVNKVINDAAKKGYNTLL 331
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DY++LF RVS+ L+ ++ TD E + + S +L L+F
Sbjct: 332 ENHVSDYKELFDRVSLDLNEIAPNVPTDELIENYRNGIYS------------KALEALVF 379
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
Q+GRYL I+SSR G+ +NL G+W+ SP W H N+N++MNYW + NL+EC +
Sbjct: 380 QYGRYLTIASSREGSLPSNLAGLWSIG-SPLWSGDYHFNVNVQMNYWPAFSTNLAECGKV 438
Query: 415 LFDFLTYLSINGSKTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWA 461
D+++ L I G K+A+++ A +G++IH + + K+ + G+ +
Sbjct: 439 FADYMSSLVIPGRKSAEMSIGAKTDDFETTPIGEGNGFMIHTANNPFGKTCPN-GEEYYG 497
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTS 521
P G W + +++Y +T D+++LE YP+++ A+ + LIE ++ ST
Sbjct: 498 WNPNGATWALQNAFDYYEFTKDKEYLESTIYPMVKEVANMWTNSLIESK---VQKIGSTE 554
Query: 522 PEHEFIAP--DGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL-PR 578
+ +AP + ++ +T D +++ E+F I AA +LEK+ D + K+ + +
Sbjct: 555 EQRLVVAPSTSAEQGPMTVGTTYDQSLVWEIFEKAIKAANILEKDSDEI--KIWTEMQSK 612
Query: 579 LRPTKIAEDGSIMEWAQDFKDPEV-------------------HHRHLSHLFGLFPGHTI 619
L P I E G I EW Q+ + HRH+SHL GLFPG T+
Sbjct: 613 LDPVIIGEGGQIKEWYQETTAGKYLNNGVTTNIPSFNRDYGGESHRHISHLVGLFPG-TL 671
Query: 620 TIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHE 679
+ N + +AA+ +L +RG + GWS K LWAR D E+ Y++V+ + +
Sbjct: 672 INKDNTEEIEAAKVSLLERGFKATGWSKGHKLNLWARTLDSENTYKVVQSMLST------ 725
Query: 680 KHFEGGLYSNLFAAH---------PPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWD 730
G+ NLF +H P FQI+ NFG+T+ +AEML+QS L + LP +P D
Sbjct: 726 --NYAGIMDNLFDSHGFGTDHEQSPGFQIEGNFGYTSGIAEMLLQSQLGYVQFLPTIP-D 782
Query: 731 KWSSGCVKGLKARGGETVSICWKDG 755
+WS G VKGL ARG VS W++G
Sbjct: 783 EWSDGEVKGLVARGNFVVSEKWQNG 807
>gi|345882387|ref|ZP_08833873.1| hypothetical protein HMPREF0666_00049 [Prevotella sp. C561]
gi|345044169|gb|EGW48214.1| hypothetical protein HMPREF0666_00049 [Prevotella sp. C561]
Length = 1163
Score = 338 bits (868), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 240/796 (30%), Positives = 373/796 (46%), Gaps = 98/796 (12%)
Query: 1 MMNAESTSTTNP---LKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLW 56
M+ +T NP + + PA ++ T +PIGNG+ GA + G V + ++ N+ TLW
Sbjct: 328 MVPVSGITTFNPANKYTLWYTKPATNWMTSCLPIGNGQFGATLMGDVAIDDVQFNDKTLW 387
Query: 57 TGVPGDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSH 116
+G G T +TAA +G+ L F + +
Sbjct: 388 SGKLGGLT----------------------STAA----YGY-----------YLNFGNLY 410
Query: 117 LKYAEET----YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLS 172
++ E T Y R LD+N A A V+Y++ V + R +F++NPD +V + + SE G ++
Sbjct: 411 IRSRELTKVTDYVRYLDINDAVAGVRYTMDGVAYDRTYFATNPDSCLVIRYTASEKGRIN 470
Query: 173 FNVSLDSLLD-NHSY-VNGNNQ--IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKIS 228
++L + N +Y V+ NNQ I EG+ A ND S +I
Sbjct: 471 TTLTLKNQNGRNVNYTVDNNNQATITFEGKV--------ARQNDKGATTPESYYCAARIV 522
Query: 229 DDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNL 288
D G+++ ++V G++ + L + FD + + + + N
Sbjct: 523 TDGGSVTKNAKGLIEVSGANSMTVYLRGLTDFDPDAAEYVSGADRLAGRATATVNNAENK 582
Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPS 348
Y L H DY+ LF R + L+ S +T+P+ + + +++ ++ +
Sbjct: 583 GYDALLAAHKADYKSLFDRCQLTLADSK-------------NTIPTPQLISNYRDNQHDN 629
Query: 349 LV--ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPC 406
L EL F +GRYLLISSSR + ANLQGIWN++ +P W S H NIN++MNYW + P
Sbjct: 630 LFLEELYFNYGRYLLISSSRGVSLPANLQGIWNDNNTPAWHSDIHANINVQMNYWPAEPT 689
Query: 407 NLSECQEPLFDFLTYLSINGSKT-----AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
NLSE P D++ Y T + ++ +GW + + +I+ G
Sbjct: 690 NLSELHRPFLDYI-YREACVKPTWRRFAKDMGHVNTGWTLPTENNIYGS-----GTTFAN 743
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTS 521
+ + AW C HLW+HY YTMD++FL +A+P ++ + L++ DG E S
Sbjct: 744 TYTVANAWYCQHLWQHYTYTMDKEFLRTKAFPAMKTAVDYWFKKLVKAADGTYECPNEWS 803
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
PEH P S D+ A++ V + D+L K
Sbjct: 804 PEH---GPTENATAHSQQLVWDLFNNTRKAIAVLGDNVVSKSFRDSLSTYFAKLDDGCHT 860
Query: 582 TKIAEDGS--IMEW--AQDFKDPE-------VHHRHLSHLFGLFPGHTITIEKNPDLCKA 630
DG + EW + F +P ++HRH+SHL GL+P I+ + + + +A
Sbjct: 861 EVNPADGKTYLREWKYSSQFNNPNKIGTKEYINHRHISHLMGLYPCSQISEDADKTVFEA 920
Query: 631 AEKTLQKRGE-EGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSN 689
A +L RG+ G GWS+ K L AR ++ H + ++KR GG+Y N
Sbjct: 921 ARTSLIARGDGHGTGWSLGHKINLNARAYEGLHCHNLIKRALQQTWDTGTNEAAGGIYEN 980
Query: 690 LFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVS 749
L+ AH P+QID NFG+TA VAEML+QS + L +LPALP W G VKGLKA G TV
Sbjct: 981 LWDAHAPYQIDGNFGYTAGVAEMLLQSYNDKLVILPALPTSFWQKGSVKGLKAVGNFTVD 1040
Query: 750 ICWKDGDLHEVGIYSN 765
I W + ++ I SN
Sbjct: 1041 IDWDNAKATQIRIVSN 1056
>gi|148988700|ref|ZP_01820133.1| hypothetical protein CGSSp6BS73_09249 [Streptococcus pneumoniae
SP6-BS73]
gi|182684597|ref|YP_001836344.1| hypothetical protein SPCG_1627 [Streptococcus pneumoniae CGSP14]
gi|303255977|ref|ZP_07342005.1| hypothetical protein CGSSpBS455_10785 [Streptococcus pneumoniae
BS455]
gi|303258584|ref|ZP_07344564.1| hypothetical protein CGSSp9vBS293_05634 [Streptococcus pneumoniae
SP-BS293]
gi|303262671|ref|ZP_07348611.1| hypothetical protein CGSSp14BS292_00767 [Streptococcus pneumoniae
SP14-BS292]
gi|303263611|ref|ZP_07349533.1| hypothetical protein CGSSpBS397_07359 [Streptococcus pneumoniae
BS397]
gi|303266372|ref|ZP_07352261.1| hypothetical protein CGSSpBS457_04537 [Streptococcus pneumoniae
BS457]
gi|303268245|ref|ZP_07354043.1| hypothetical protein CGSSpBS458_04243 [Streptococcus pneumoniae
BS458]
gi|387759769|ref|YP_006066747.1| hypothetical protein SPNINV200_14790 [Streptococcus pneumoniae
INV200]
gi|419515161|ref|ZP_14054786.1| fibronectin type III domain protein [Streptococcus pneumoniae
England14-9]
gi|421296489|ref|ZP_15747198.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA58581]
gi|147925901|gb|EDK76976.1| hypothetical protein CGSSp6BS73_09249 [Streptococcus pneumoniae
SP6-BS73]
gi|182629931|gb|ACB90879.1| hypothetical protein SPCG_1627 [Streptococcus pneumoniae CGSP14]
gi|301802358|emb|CBW35112.1| conserved hypothetical protein [Streptococcus pneumoniae INV200]
gi|302597036|gb|EFL64154.1| hypothetical protein CGSSpBS455_10785 [Streptococcus pneumoniae
BS455]
gi|302636227|gb|EFL66722.1| hypothetical protein CGSSp14BS292_00767 [Streptococcus pneumoniae
SP14-BS292]
gi|302640085|gb|EFL70540.1| hypothetical protein CGSSpBS293_05634 [Streptococcus pneumoniae
SP-BS293]
gi|302642196|gb|EFL72545.1| hypothetical protein CGSSpBS458_04243 [Streptococcus pneumoniae
BS458]
gi|302644072|gb|EFL74330.1| hypothetical protein CGSSpBS457_04537 [Streptococcus pneumoniae
BS457]
gi|302646649|gb|EFL76874.1| hypothetical protein CGSSpBS397_07359 [Streptococcus pneumoniae
BS397]
gi|379635710|gb|EIA00269.1| fibronectin type III domain protein [Streptococcus pneumoniae
England14-9]
gi|395895362|gb|EJH06337.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA58581]
Length = 803
Score = 338 bits (868), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 244/774 (31%), Positives = 380/774 (49%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQD 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYL---VWETDGDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ Y +L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYTMLRETVRFWNAFLRKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 593
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA +L RG+ G GWS K
Sbjct: 594 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSKANK 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 653 INLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQIDGNFGATSGMA 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 702 EMLLQSHAAYLVPLAALP-DAWSTGFVSGLMARGHFEVSMSWEDKKLLQLTILS 754
>gi|387626851|ref|YP_006063027.1| hypothetical protein INV104_14070 [Streptococcus pneumoniae INV104]
gi|444382288|ref|ZP_21180491.1| hypothetical protein PCS8106_00679 [Streptococcus pneumoniae
PCS8106]
gi|444385525|ref|ZP_21183597.1| hypothetical protein PCS8203_01377 [Streptococcus pneumoniae
PCS8203]
gi|301794637|emb|CBW37088.1| conserved hypothetical protein [Streptococcus pneumoniae INV104]
gi|444249595|gb|ELU56083.1| hypothetical protein PCS8203_01377 [Streptococcus pneumoniae
PCS8203]
gi|444252562|gb|ELU59024.1| hypothetical protein PCS8106_00679 [Streptococcus pneumoniae
PCS8106]
Length = 803
Score = 338 bits (867), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 243/774 (31%), Positives = 379/774 (48%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD +++ + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLIQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 593
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+ G+ + K + +AA +L RG+ G GWS K
Sbjct: 594 EEQYFQNEKVEAQHRHASHLVGLYSGNLFSY-KGQEYIEAARASLNDRGDGGTGWSKANK 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 653 INLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQIDGNFGATSGMA 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 702 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 754
>gi|423281388|ref|ZP_17260299.1| hypothetical protein HMPREF1203_04516 [Bacteroides fragilis HMW
610]
gi|404583092|gb|EKA87775.1| hypothetical protein HMPREF1203_04516 [Bacteroides fragilis HMW
610]
Length = 406
Score = 338 bits (867), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 175/378 (46%), Positives = 231/378 (61%), Gaps = 12/378 (3%)
Query: 398 MNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK 457
MNYW + L EC EPLF + L++NGS TA Y GW HH T IW +S G+
Sbjct: 1 MNYWLAETTGLPECSEPLFRLIRELAVNGSATAAKMYNLPGWTSHHITSIWRESGLADGE 60
Query: 458 VVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETN 517
W +W M WLC HLW+HY ++ D+ FL + AYPL+ A F WL+E DG +T
Sbjct: 61 PTWFMWNMSAGWLCRHLWDHYLFSGDKKFLRETAYPLMRDAARFYNAWLVE-KDGMWQTP 119
Query: 518 PSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNE-----DALVEKV 572
SPE++F+ P+ K + ++ + MDMAIIRE+FS AA +L + D L+ V
Sbjct: 120 LGVSPENQFLTPEKKTSAIAPAPAMDMAIIRELFSNTAEAAAILAADSILPPADTLLLHV 179
Query: 573 LKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAE 632
+ + +L P +I + G IMEW++DF + E HHRHLSHL+G PG IT K P+L A
Sbjct: 180 MGA-KQLVPYRIGKRGQIMEWSEDFDEVEPHHRHLSHLYGFHPGCEITPGKTPELVSAVR 238
Query: 633 KTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVD--PEHEKHFEGGLYSNL 690
+TL+ RG+E GWS+ WK +WAR+HD HAYR+++ LF D PE +H GGLY NL
Sbjct: 239 RTLELRGDEATGWSMGWKINMWARMHDGNHAYRIIRNLFTPTDFGPEVNRH--GGLYKNL 296
Query: 691 FAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSI 750
F AHPPFQID NFG+TA VAEML+QS + +LPALP D W+ G V GL+ARGG + I
Sbjct: 297 FDAHPPFQIDGNFGYTAGVAEMLLQSHDGVIDVLPALP-DVWAEGKVTGLRARGGFIIDI 355
Query: 751 CWKDGDLHEVGIYSNYSN 768
W V ++S N
Sbjct: 356 TWSKSGKTVVKVFSEQGN 373
>gi|315611778|ref|ZP_07886700.1| alpha-L-fucosidase [Streptococcus sanguinis ATCC 49296]
gi|315316193|gb|EFU64223.1| alpha-L-fucosidase [Streptococcus sanguinis ATCC 49296]
Length = 1707
Score = 338 bits (867), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 254/792 (32%), Positives = 396/792 (50%), Gaps = 117/792 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
+++G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 200 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRNLDITE 259
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G++F++ L IK +D + T+
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 362
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP ++ +KD E +++ + Y L
Sbjct: 363 QDETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKVKDYETLK 419
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L N + E ++ + ++ L EL F
Sbjct: 420 KAHIKDYQSLFNRVKLNLGG-------------NKTAQTTKEALQGYNPEKGQKLEELFF 466
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDQASDRWV-SSPS 640
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L+ ++D LV +V +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKL 690
Query: 580 RPTKIAEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
+P I ++G I EW ++ F + E +HRH+SHL GLFPG T+ + + +AA
Sbjct: 691 KPLHINKEGRIKEWYEEDSPQFTNEGIENNHRHVSHLVGLFPG-TLFSKDQAEYLEAARA 749
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAA 693
TL RG+ G GWS K LWARL D A+R++ E NL+
Sbjct: 750 TLNHRGDGGTGWSKANKINLWARLLDGNRAHRLLAEQLKYSTLE-----------NLWDT 798
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWK 753
H PFQID NFG T+ +AEML+QS + LPALP D W G V GL ARG VS+ WK
Sbjct: 799 HAPFQIDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLVARGNFEVSMKWK 857
Query: 754 DGDLHEVGIYSN 765
D +L + SN
Sbjct: 858 DKNLQSLSFLSN 869
>gi|306824549|ref|ZP_07457895.1| possible alpha-L-fucosidase [Streptococcus sp. oral taxon 071 str.
73H25AP]
gi|304433336|gb|EFM36306.1| possible alpha-L-fucosidase [Streptococcus sp. oral taxon 071 str.
73H25AP]
Length = 1749
Score = 338 bits (867), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 256/792 (32%), Positives = 395/792 (49%), Gaps = 117/792 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 184 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 241
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
+++G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 242 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRNLDITE 301
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDNHSYV-- 187
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N +Y
Sbjct: 302 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 361
Query: 188 -----NGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
NG+ N I+++G K N G++F++ L IK G + A+
Sbjct: 362 YSHYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIKTD---GKV-AV 404
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP ++ +KD E+ +++ + Y L
Sbjct: 405 QDETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLENTVKGIVEAAKAKDYETLK 461
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L S T E ++S+ ++ L EL F
Sbjct: 462 QDHIKDYQSLFNRVKLNLGGSKTAQTT-------------KEALQSYNPEKGQKLEELFF 508
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NLSE
Sbjct: 509 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLSETA 568
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 569 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 623
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 624 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKVSDRWV-SSPS 682
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L ++D LV +V +L
Sbjct: 683 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKAKFDKL 732
Query: 580 RPTKIAEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
+P I +G I EW ++ F + E +HRH+SHL GLFPG T+ + + +AA
Sbjct: 733 KPLHINNEGRIKEWYEEDSPQFTNEGIENNHRHVSHLVGLFPG-TLFSKDQAEYLEAARA 791
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAA 693
TL RG+ G GWS K LWARL D A+R++ E NL+
Sbjct: 792 TLNHRGDGGTGWSKANKINLWARLLDGNRAHRLLAEQLKYSTLE-----------NLWDT 840
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWK 753
H PFQID NFG T+ +AEML+QS + LPALP D W G V GL ARG V++ WK
Sbjct: 841 HAPFQIDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLVARGNFEVNMKWK 899
Query: 754 DGDLHEVGIYSN 765
D +L + SN
Sbjct: 900 DKNLQSLSFLSN 911
>gi|189207008|ref|XP_001939838.1| hypothetical protein PTRG_09506 [Pyrenophora tritici-repentis
Pt-1C-BFP]
gi|187975931|gb|EDU42557.1| hypothetical protein PTRG_09506 [Pyrenophora tritici-repentis
Pt-1C-BFP]
Length = 742
Score = 338 bits (867), Expect = 7e-90, Method: Compositional matrix adjust.
Identities = 256/822 (31%), Positives = 388/822 (47%), Gaps = 123/822 (14%)
Query: 4 AESTSTTNPLKITFNGPAKH--FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
A S ++ ++ + PA+ +T+A+PIGNGRLGAMV+G E + LNE+T+W+G
Sbjct: 14 ASLASASDNTRLWYKTPAQSSAWTNALPIGNGRLGAMVFGIPLQERIALNEETIWSGGQQ 73
Query: 62 DYTNPDAPKALSDVRSLVDSGQYAEATA-ASVKLFGHPADV--YQLLGDIELEFDDSHLK 118
D D+P+ +S+VR L+ G+ +A A++ + G P YQ LGD+++ FD +
Sbjct: 74 DRIGQDSPQTVSEVRDLLAQGRAGDAEKLANMGMMGTPQSCRNYQPLGDMDIFFDGT-TG 132
Query: 119 YAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD 178
Y TY+R LD++TA A V++ V + RE F S PD V V + + SG LSF + +
Sbjct: 133 YDNATYKRWLDVDTALAGVQFQVNGTLYEREMFVSAPDDVFVHHLKATGSGKLSFQIRV- 191
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
+ GN E G DP I F+ L ++ SD G + L
Sbjct: 192 ----HRPDKGGNEAADHEWNANGLAYMTGGAGGIDP--IVFTTALAVQ-SD--GHVKNL- 241
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
+ VE + A + AS+S+ D + S +Q R +Y +L RH+
Sbjct: 242 GPFIVVENATEATAIFAASTSY---------RHNDTRAAVESTIQQARQHTYEELRQRHI 292
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFG 357
DY L++ + LS S+ ++P+ R+ + + DP+L L + +G
Sbjct: 293 ADYAPLYNASVLDLS----------GSDLKASSLPTDARINATREGASDPALTALSYNYG 342
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLLI+SSR G +NLQGIWN++ +P W S VNINL+MNYW + +LS EPLFD
Sbjct: 343 RYLLIASSRAGNLPSNLQGIWNKEFAPQWGSKYTVNINLQMNYWPAEVTSLSSLHEPLFD 402
Query: 418 FLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
L + +TD EH
Sbjct: 403 LLDLM---------------------RTD-----------------------------EH 412
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWL----IEGHDGYLETNPSTSPEHEFIAPDGKL 533
Y YT D+ FL + + E A F LD L I G YL TNPS SPE+ ++ D
Sbjct: 413 YWYTGDKAFLASKLDVVTEAIA-FYLDILQPYSINGTQ-YLVTNPSVSPENSYLDADNNT 470
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKN--EDALVEKVLKSLPRLRPTKIAED--GS 589
+ T D+ I+ E+F+ ++A L + + ++ + +L P + ++ G+
Sbjct: 471 YHFDIAPTCDIEILNELFTNYLNAVATLPNYTVDSTFLTRIRDTQAQLPPYRYSKRYPGT 530
Query: 590 IMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNP----DLCKAAEKTLQKR---GEEG 642
+ EW QD++ E+ HRH+SHL+ L+PG I P L AA TL+ R G
Sbjct: 531 LQEWMQDYEQAELGHRHVSHLYALYPGTQILPPGAPGYDAKLFNAAAGTLEGRLSHNGAG 590
Query: 643 PGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP-FQIDA 701
GWS W +ARL + V + FN +Y+NL + FQID
Sbjct: 591 TGWSRAWTINWYARLQNSTAVAGNVYQFFNT-----------SVYNNLMDVNEGVFQIDG 639
Query: 702 NFGFTAAVAEMLVQSTLND------LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
N GF + VAE L+QS + D ++LLP LP ++W++G V GL ARGG I W DG
Sbjct: 640 NLGFVSGVAEALIQSHIVDAEGVREVWLLPVLP-EQWNTGSVNGLAARGGFVFDITWADG 698
Query: 756 DLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFN 797
+ ++ + S +K T+ ++ AG + F+
Sbjct: 699 AISKMKMESRVGGTVVLRYKGGSGNSTTTRLETKAGDVKEFD 740
>gi|418098974|ref|ZP_12736071.1| fibronectin type III domain protein [Streptococcus pneumoniae
6901-05]
gi|353768956|gb|EHD49478.1| fibronectin type III domain protein [Streptococcus pneumoniae
6901-05]
Length = 795
Score = 338 bits (866), Expect = 8e-90, Method: Compositional matrix adjust.
Identities = 245/774 (31%), Positives = 377/774 (48%), Gaps = 89/774 (11%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + SE ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGSERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P +Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGIYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN D H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNSDY--------HLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG- 418
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 419 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 475
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYL-ETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 476 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 526
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 527 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 585
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA +L RG+ G GWS K
Sbjct: 586 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSEANK 644
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 645 INLWARLGDGNRAHKLLA-----------EQLKTSTLQNLWCSHPPFQIDGNFGATSGMA 693
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 694 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 746
>gi|385261489|ref|ZP_10039611.1| Gram-positive signal peptide protein, YSIRK family, partial
[Streptococcus sp. SK643]
gi|385193017|gb|EIF40405.1| Gram-positive signal peptide protein, YSIRK family, partial
[Streptococcus sp. SK643]
Length = 1474
Score = 337 bits (865), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 255/791 (32%), Positives = 387/791 (48%), Gaps = 115/791 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y + K L+++R
Sbjct: 152 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPRPDSTDYNGGNYQ--ERYKVLAEIRK 209
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 210 ALEEGDRQKAKRLAERNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLENVTDYHRGLDITE 269
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV--SL-DSLLDNHSY--- 186
AT Y+ F RE FSS PD V VT ++ L F V SL + LL N +Y
Sbjct: 270 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTQKGDKKLDFTVWNSLTEDLLANGNYSAE 329
Query: 187 ---------VNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
N I+++G K N G++F++ L IK G ++
Sbjct: 330 YSHYKSGHVTTDPNGILLKGTV-------KDN------GLRFASYLGIKTD---GKVTVH 373
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
ED L V G+ +A LLL + ++F NP ++ +KD E +++ R Y L
Sbjct: 374 EDS-LTVTGASYATLLLSSKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAARGKDYETLK 429
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L S T E ++++ + L EL F
Sbjct: 430 KNHIKDYQSLFNRVKLNLGGSNTAQTT-------------KEALQTYNPTKGQKLEELFF 476
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 477 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 536
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 537 KPMINYIDDMRYYGRIAAKEYAGIKSKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 591
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG-YLETNPST 520
P AW+ +++++Y +T D +L+++ YP+L+ A F +L D ++PS
Sbjct: 592 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKDSDRWVSSPSY 651
Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLR 580
SPEH ++ +T D +++ ++F + A L+ ++D LV +V +L+
Sbjct: 652 SPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKLK 701
Query: 581 PTKIAEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKT 634
P I ++G I EW ++ F + E +HRH+SHL GLFPG T+ + + +AA T
Sbjct: 702 PLHINKEGRIKEWYEEDSPQFTNEGIENNHRHVSHLVGLFPG-TLFSKDQAEYLEAARAT 760
Query: 635 LQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAH 694
L RG+ G GWS K LWARL D A+R++ E NL+ H
Sbjct: 761 LNHRGDGGTGWSKANKINLWARLLDGNRAHRLLAEQLKYSTLE-----------NLWDTH 809
Query: 695 PPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKD 754
PFQID NFG T+ +AEML+QS + LPALP D W G V GL ARG VS+ WKD
Sbjct: 810 APFQIDGNFGATSGIAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLVARGNFEVSMKWKD 868
Query: 755 GDLHEVGIYSN 765
+L + SN
Sbjct: 869 KNLQSLSFLSN 879
>gi|419779913|ref|ZP_14305766.1| gram positive anchor [Streptococcus oralis SK100]
gi|383185738|gb|EIC78231.1| gram positive anchor [Streptococcus oralis SK100]
Length = 1707
Score = 337 bits (865), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 255/792 (32%), Positives = 392/792 (49%), Gaps = 117/792 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y N K L+++R
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYKN--RYKVLAEIRK 199
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 200 ALEDGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRNLDITE 259
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G++ ++ L IK +D + T+
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLKLASYLGIK-TDGKVTV--- 362
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP ++ +KD E +++ + Y L
Sbjct: 363 QDETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKAKDYETLK 419
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L S T E ++ + ++ L EL F
Sbjct: 420 KAHIKDYQSLFNRVKLNLGGSKTAQTT-------------KEALQGYNPEKGQKLEELFF 466
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPS 640
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L ++D LV +V +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDKD-LVTEVKAKFDKL 690
Query: 580 RPTKIAEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
+P I +G I EW ++ F + E HHRH+SHL GLFPG T+ + + +AA
Sbjct: 691 KPLHINNEGRIKEWYEEDSPQFTNEGIENHHRHVSHLVGLFPG-TLFSKDQAEYLEAARA 749
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAA 693
TL RG+ G GWS K LWARL D A+R++ E NL+
Sbjct: 750 TLNHRGDGGTGWSKANKINLWARLLDGNRAHRLLAEQLKYSTLE-----------NLWDT 798
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWK 753
H PFQID NFG T+ +AEML+QS + LPALP D W G V GL ARG VS+ WK
Sbjct: 799 HAPFQIDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLVARGNFEVSMKWK 857
Query: 754 DGDLHEVGIYSN 765
D +L + SN
Sbjct: 858 DKNLQSLSFLSN 869
>gi|168493554|ref|ZP_02717697.1| alpha-fucosidase [Streptococcus pneumoniae CDC3059-06]
gi|418074476|ref|ZP_12711729.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA11184]
gi|418087331|ref|ZP_12724500.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47033]
gi|418090009|ref|ZP_12727163.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA43265]
gi|418105755|ref|ZP_12742811.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44500]
gi|418115170|ref|ZP_12752156.1| fibronectin type III domain protein [Streptococcus pneumoniae
5787-06]
gi|418117327|ref|ZP_12754296.1| fibronectin type III domain protein [Streptococcus pneumoniae
6963-05]
gi|418217095|ref|ZP_12843775.1| fibronectin type III domain protein [Streptococcus pneumoniae
Netherlands15B-37]
gi|419432029|ref|ZP_13972162.1| fibronectin type III domain protein [Streptococcus pneumoniae
EU-NP05]
gi|419433932|ref|ZP_13974050.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA40183]
gi|419440837|ref|ZP_13980882.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA40410]
gi|419464830|ref|ZP_14004721.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA04175]
gi|419469454|ref|ZP_14009322.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA06083]
gi|419498019|ref|ZP_14037726.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47522]
gi|421281641|ref|ZP_15732438.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA04672]
gi|183576395|gb|EDT96923.1| alpha-fucosidase [Streptococcus pneumoniae CDC3059-06]
gi|353748545|gb|EHD29197.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA11184]
gi|353758347|gb|EHD38939.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47033]
gi|353761200|gb|EHD41772.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA43265]
gi|353775931|gb|EHD56410.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA44500]
gi|353785254|gb|EHD65673.1| fibronectin type III domain protein [Streptococcus pneumoniae
5787-06]
gi|353788008|gb|EHD68406.1| fibronectin type III domain protein [Streptococcus pneumoniae
6963-05]
gi|353870368|gb|EHE50241.1| fibronectin type III domain protein [Streptococcus pneumoniae
Netherlands15B-37]
gi|379536430|gb|EHZ01616.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA04175]
gi|379544258|gb|EHZ09403.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA06083]
gi|379576933|gb|EHZ41857.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA40183]
gi|379577907|gb|EHZ42824.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA40410]
gi|379598852|gb|EHZ63637.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47522]
gi|379629110|gb|EHZ93711.1| fibronectin type III domain protein [Streptococcus pneumoniae
EU-NP05]
gi|395880906|gb|EJG91957.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA04672]
Length = 795
Score = 337 bits (864), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 245/774 (31%), Positives = 376/774 (48%), Gaps = 89/774 (11%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + SE ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGSERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN D H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNSDY--------HLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG- 418
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 419 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 475
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYL-ETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 476 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 526
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 527 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 585
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA +L RG+ G GWS K
Sbjct: 586 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSEANK 644
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 645 INLWARLGDGNRAHKLLA-----------EQLKTSTLQNLWCSHPPFQIDGNFGATSGMA 693
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 694 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 746
>gi|306830121|ref|ZP_07463305.1| possible alpha-L-fucosidase [Streptococcus mitis ATCC 6249]
gi|304427647|gb|EFM30743.1| possible alpha-L-fucosidase [Streptococcus mitis ATCC 6249]
Length = 1685
Score = 337 bits (864), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 259/813 (31%), Positives = 399/813 (49%), Gaps = 107/813 (13%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 141 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 198
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 199 ALEDGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRGLDITE 258
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 259 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGDYSWE 318
Query: 184 -HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+Y NG+ G I K D+ G++F++ L IK GT++ ++++ L
Sbjct: 319 YSNYKNGHVTTDEHG------ILLKGTVKDN--GLKFASYLGIKTD---GTVT-VQNETL 366
Query: 243 KVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLYTRHLD 299
V G+ +A L L A ++F NP ++ +KD E +++ + Y L H+
Sbjct: 367 TVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKAKDYETLKKDHIK 423
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
DYQ LF+RV + L N T + E ++S+ + L EL FQ+GRY
Sbjct: 424 DYQSLFNRVKLNLGG-------------NKTTQTTKEALQSYNPSKGQKLEELFFQYGRY 470
Query: 360 LLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
LLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E +P+ +
Sbjct: 471 LLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETAKPMIN 530
Query: 418 FLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
++ + G SK Q N GW++H + + ++ W P
Sbjct: 531 YIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWGWSPAA 585
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHE 525
AW+ +++++Y +T D +L+++ YP+L+ A F +L + ++PS SPEH
Sbjct: 586 NAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKESDRWVSSPSYSPEH- 644
Query: 526 FIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
++ +T D +++ ++F + A L+ ++D LV +V +L+P I
Sbjct: 645 --------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKLKPLHIN 695
Query: 586 EDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRG 639
+G I EW ++ F + E +HRH+SHL GLFPG T+ + + +AA TL RG
Sbjct: 696 NEGRIKEWYEEDSPQFTNEGIENNHRHVSHLVGLFPG-TLFSKDQAEYLEAARATLNHRG 754
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQI 699
+ G GWS K LWARL D A+R++ E NL+ H PFQI
Sbjct: 755 DGGTGWSKANKINLWARLLDGNRAHRLLAEQLKYSTLE-----------NLWDTHAPFQI 803
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
D NFG T+ +AEML+QS + LPALP D W G V GL ARG VS+ WKD +L
Sbjct: 804 DGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLIARGNFEVSMKWKDKNLQS 862
Query: 760 VGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGK 792
+ SN + + + + VKVN A K
Sbjct: 863 LSFLSNVGGDLVVDYPNIE--ASQVKVNGKAVK 893
>gi|419781688|ref|ZP_14307504.1| gram positive anchor [Streptococcus oralis SK610]
gi|383183996|gb|EIC76526.1| gram positive anchor [Streptococcus oralis SK610]
Length = 1707
Score = 337 bits (863), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 253/791 (31%), Positives = 393/791 (49%), Gaps = 115/791 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y + K L+++R
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--ERYKVLAEIRK 199
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
+++G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 200 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRGLDITE 259
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G++F++ L IK +D + T+
Sbjct: 320 YSNYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 362
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP ++ +KD E +++ + Y L
Sbjct: 363 QDETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKAKDYETLK 419
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L S T E ++ + ++ L EL F
Sbjct: 420 NAHIKDYQSLFNRVKLNLGGSKTAQTT-------------KEALQGYNPEKGQKLEELFF 466
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPST 520
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKESDRWVSSPSY 641
Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLR 580
SPEH ++ +T D +++ ++F + A L+ ++D LV +V +L+
Sbjct: 642 SPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKLK 691
Query: 581 PTKIAEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKT 634
P I ++G I EW ++ F + E +HRH+SHL GLFPG T+ + + +AA T
Sbjct: 692 PLHINKEGRIKEWYEEDSPQFTNEGIENNHRHVSHLVGLFPG-TLFSKDQAEYLEAARAT 750
Query: 635 LQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAH 694
L RG+ G GWS K LWARL D A+R++ E NL+ H
Sbjct: 751 LNHRGDGGTGWSKANKINLWARLLDGNRAHRLLAEQLKYSTLE-----------NLWDTH 799
Query: 695 PPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKD 754
PFQID NFG T+ +AEML+QS + LPALP D W G V GL ARG VS+ WKD
Sbjct: 800 APFQIDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLVARGNFEVSMKWKD 858
Query: 755 GDLHEVGIYSN 765
+L + SN
Sbjct: 859 KNLQSLSFLSN 869
>gi|418134701|ref|ZP_12771558.1| hypothetical protein SPAR23_0971 [Streptococcus pneumoniae GA11426]
gi|419535112|ref|ZP_14074611.1| hypothetical protein SPAR46_1654 [Streptococcus pneumoniae GA17457]
gi|353901938|gb|EHE77468.1| hypothetical protein SPAR23_0971 [Streptococcus pneumoniae GA11426]
gi|379563273|gb|EHZ28277.1| hypothetical protein SPAR46_1654 [Streptococcus pneumoniae GA17457]
Length = 770
Score = 337 bits (863), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 245/774 (31%), Positives = 376/774 (48%), Gaps = 89/774 (11%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + SE ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALPIGNGSLGAKVFGLIGSERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN D H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNSDY--------HLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG- 418
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 419 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 475
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYL-ETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 476 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 526
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 527 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 585
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA +L RG+ G GWS K
Sbjct: 586 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSEANK 644
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 645 INLWARLGDGNRAHKLLA-----------EQLKTSTLQNLWCSHPPFQIDGNFGATSGMA 693
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 694 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 746
>gi|421488290|ref|ZP_15935682.1| Gram-positive signal peptide protein, YSIRK family [Streptococcus
oralis SK304]
gi|400368666|gb|EJP21674.1| Gram-positive signal peptide protein, YSIRK family [Streptococcus
oralis SK304]
Length = 1687
Score = 337 bits (863), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 260/819 (31%), Positives = 404/819 (49%), Gaps = 119/819 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 141 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 198
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
+++G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 199 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRGLDITD 258
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 259 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 318
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G++F++ L IK +D + T+
Sbjct: 319 YSNYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 361
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINPSDS-KKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP S +KD E +++ + Y L
Sbjct: 362 QDETLTVTGASYATLYLSAKTNFAQ---NPKTSYRKDIDLEKTVKGIVEAAKAKDYETLK 418
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L N + E ++ + ++ L EL F
Sbjct: 419 KAHIKDYQSLFNRVKLNLGG-------------NKTAQTTKEALQGYNPEKGQKLEELFF 465
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 466 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 525
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 526 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 580
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 581 WSPAANAWMMQNVYDYYKFTKDESYLKEKIYPMLKETAKFWNSFLHYDKTSDRWV-SSPS 639
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L+ ++D LV +V +L
Sbjct: 640 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKL 689
Query: 580 RPTKIAEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
+P I ++G I EW ++ F + E +HRH+SHL GLFPG T+ + + +AA
Sbjct: 690 KPLHINKEGRIKEWYEEDSPQFTNEGIENNHRHVSHLVGLFPG-TLFSKDQAEYLEAARA 748
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAA 693
TL RG+ G GWS K LW RL D A+R++ E NL+
Sbjct: 749 TLNHRGDGGTGWSKANKINLWVRLLDGNRAHRLLAEQLKYSTLE-----------NLWDT 797
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWK 753
H PFQID NFG T+ +AEML+QS + LPALP D W G V GL ARG VS+ WK
Sbjct: 798 HAPFQIDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLIARGNFEVSMKWK 856
Query: 754 DGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGK 792
D +L + SN + + + + VKVN A K
Sbjct: 857 DKNLQSLSFLSNVGGDLVVDYPNIE--ASQVKVNGKAVK 893
>gi|418174043|ref|ZP_12810655.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA41277]
gi|353837999|gb|EHE18080.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA41277]
Length = 774
Score = 337 bits (863), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 245/774 (31%), Positives = 376/774 (48%), Gaps = 89/774 (11%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+PIGNG LGA V+G + SE ++ NE +LW+G P DY D L+++R
Sbjct: 6 EALPIGNGSLGAKVFGLIGSERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQA 65
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 66 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 125
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F RE F+S PD ++V + +L F + L D S +
Sbjct: 126 LATTSYVYKGTRFEREAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 185
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 186 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 239
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 240 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 299
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 300 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 346
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN D H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 347 DALPANLQGVWNSDY--------HLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG- 397
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 398 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 454
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 455 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 505
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 506 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 564
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +AA +L RG+ G GWS K
Sbjct: 565 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSEANK 623
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 624 INLWARLGDGNRAHKLLA-----------EQLKTSTLQNLWCSHPPFQIDGNFGATSGMA 672
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 673 EMLLQSHAAYLVPLAALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 725
>gi|417794521|ref|ZP_12441772.1| gram positive anchor [Streptococcus oralis SK255]
gi|334269196|gb|EGL87623.1| gram positive anchor [Streptococcus oralis SK255]
Length = 1707
Score = 336 bits (862), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 259/817 (31%), Positives = 404/817 (49%), Gaps = 115/817 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSLV 79
A+P+GNG +GA V+G + E ++ NE TLW+G P DY D K L+++R +
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSSDYNGGNYKDRYKVLAEIRKAL 201
Query: 80 DSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNTAT 134
+ G +A + + P + Y GDI + F++ T Y R LD+ AT
Sbjct: 202 EDGDRQKAKRLAERNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYYRGLDITEAT 261
Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN-------H 184
Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 262 TTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWEYS 321
Query: 185 SYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+Y NG+ N I+++G K N G++F++ L IK GT++ +++
Sbjct: 322 NYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIKTD---GTVT-VQN 364
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLYTR 296
+ L V G+ +A L L A ++F NP ++ +KD E +++ + Y L
Sbjct: 365 ETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKAKDYETLKKA 421
Query: 297 HLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQF 356
H+ DYQ LF+RV + L N + E ++ + ++ L EL FQ+
Sbjct: 422 HIKDYQSLFNRVKLNLGG-------------NKTAQTTKEALQGYNPEKGQKLEELFFQY 468
Query: 357 GRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E +P
Sbjct: 469 GRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMNNLAETAKP 528
Query: 415 LFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 529 MINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWGWS 583
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPSTS 521
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS S
Sbjct: 584 PAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPSYS 642
Query: 522 PEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP 581
PEH ++ +T D +++ ++F + A L+ ++D LV +V +L+P
Sbjct: 643 PEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKLKP 692
Query: 582 TKIAEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTL 635
I ++G I EW ++ F + E +HRH+SHL GLFPG T+ + + +AA TL
Sbjct: 693 LHINKEGRIKEWYEEDSPQFTNEGIENNHRHVSHLVGLFPG-TLFSKDQAEYLEAARATL 751
Query: 636 QKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHP 695
RG+ G GWS K LWARL D A+R++ E NL+ H
Sbjct: 752 NHRGDGGTGWSKANKINLWARLLDGNRAHRLLAEQLKYSTLE-----------NLWDTHA 800
Query: 696 PFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
PFQID NFG T+ +AEML+QS + LPALP D W G V GL ARG VS+ WKD
Sbjct: 801 PFQIDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLVARGNFEVSMKWKDK 859
Query: 756 DLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGK 792
+L + SN + + + + VKVN A K
Sbjct: 860 NLQSLSFLSNVGGDLVVDYPNIE--ASQVKVNGKAVK 894
>gi|401683949|ref|ZP_10815833.1| LPXTG cell wall anchor domain protein [Streptococcus sp. BS35b]
gi|400186628|gb|EJO20836.1| LPXTG cell wall anchor domain protein [Streptococcus sp. BS35b]
Length = 1687
Score = 336 bits (861), Expect = 3e-89, Method: Compositional matrix adjust.
Identities = 255/792 (32%), Positives = 392/792 (49%), Gaps = 117/792 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 122 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPRPDSTDYNGGNYK--DRYKVLAEIRK 179
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 180 ALEDGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRGLDITE 239
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + L F N + LL N
Sbjct: 240 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKKLDFTLWNSLTEDLLANGEYSWE 299
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G++F++ L IK +D + T+
Sbjct: 300 YSNYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 342
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP ++ +KD E +++ + Y L
Sbjct: 343 QDETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKAKDYETLK 399
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L S T E ++S+ + L EL F
Sbjct: 400 QDHIKDYQNLFNRVKLNLGGSKTAQTT-------------KEALQSYNPSKGQKLEELFF 446
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 447 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 506
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 507 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 561
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 562 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDQASDRWV-SSPS 620
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L ++D LV +V +L
Sbjct: 621 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKAKFDKL 670
Query: 580 RPTKIAEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
+P I ++G I EW ++ F + E +HRH+SHL GLFPG T+ + + +AA
Sbjct: 671 KPLHINKEGRIKEWYEEDSPQFTNEGIENNHRHVSHLVGLFPG-TLFSKDQAEYLEAARA 729
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAA 693
TL RG+ G GWS K LWARL D A+R++ E NL+
Sbjct: 730 TLNHRGDGGTGWSKANKINLWARLLDGNRAHRLLAEQLKYSTLE-----------NLWDT 778
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWK 753
H PFQID NFG T+ +AEML+QS + LPALP D W G V GL RG VS+ WK
Sbjct: 779 HAPFQIDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLVTRGNFEVSMKWK 837
Query: 754 DGDLHEVGIYSN 765
D +L + SN
Sbjct: 838 DKNLQSLSFLSN 849
>gi|358383778|gb|EHK21440.1| glycoside hydrolase family 95 protein [Trichoderma virens Gv29-8]
Length = 794
Score = 335 bits (860), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 244/801 (30%), Positives = 366/801 (45%), Gaps = 90/801 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQY 84
T +PIGN RLGA ++GG +E + +NEDT+W G D + AL VR ++ +
Sbjct: 39 TGVLPIGNSRLGAAIFGG-GNEVVTINEDTIWDGPLQDRIPANGLAALPKVRQMLMANNL 97
Query: 85 AEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYS 140
+A + PA + G++ L F Y R LD + V Y+
Sbjct: 98 TDAGNLVLSQM-TPASCCERQFSYFGNLNLNFGHGS---GISNYIRSLDTRQGNSSVSYT 153
Query: 141 VGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS---LDSLLDNHSYVNGN-NQIIME 196
V +TRE+ +SNPD VI + + S++G+LS + + ++++L N + +G N + ++
Sbjct: 154 FNGVTYTREYVASNPDGVIAARYTASKAGALSVSATFSRINNILSNVASTSGGVNSVTLQ 213
Query: 197 GRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVA 256
G G+ P I F+ + + T SA L +
Sbjct: 214 GTS-GQSTNP----------ILFTG--KARFVASGATFSA-----------SGGTLTITG 249
Query: 257 SSSFDGPFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
+++ D F++ + + PT+ +++A L + + + ++ + D L R +I
Sbjct: 250 ATTID-VFVDVETNYRYPTASALAAEVDNKLNAAVSKGFPAVHNSAIADSSALLGRANIN 308
Query: 312 LSRSPK---DIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRP 367
L SP D+ TD +RVKS ++ DP L+ L + +GR+LL++SSR
Sbjct: 309 LGTSPNGLADLSTD-------------QRVKSARSAFNDPQLIVLAWNYGRHLLVASSRD 355
Query: 368 GTQVA----NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
+ NLQG+WN S W +NIN EMN W + NL E Q PLFD L
Sbjct: 356 TSAAIDMPPNLQGVWNNATSAPWGGKFTININTEMNLWPAGQTNLIETQLPLFDLLKVAQ 415
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
G + AQ Y +G V HH D+W + +WPMG WL H+ E Y +T D
Sbjct: 416 PRGQEMAQKLYGCNGTVFHHNLDVWGDPAPTDNYTSSTMWPMGATWLVQHMMEQYRFTGD 475
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC-----VSY 538
+FL AYP L + FL + G T PS SPE+ ++ P G +
Sbjct: 476 LNFLRNTAYPYLLDISKFLQCYTFT-WQGNRVTGPSLSPENTYVVPSGANKAGTQEPMDM 534
Query: 539 SSTMDMAIIREVFSAIISAAEVLE-KNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
+ MD ++R+V ++I+ AA L + D+ V+ LP +R +I G I+EW ++
Sbjct: 535 APEMDNQLMRDVMTSILEAAAALGISSSDSNVQAATNFLPLIRTPRIGSYGQILEWRSEY 594
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTALW 654
+ + HRHLS L+GL PG + N L AA+ L R G GWS TW +
Sbjct: 595 GETDPGHRHLSPLYGLHPGSQFSPLVNSTLSAAAKALLDHRVAGGSGSTGWSRTWLLNQY 654
Query: 655 ARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLV 714
ARL ++ + F + + GG FQID NFGFT+ V EML+
Sbjct: 655 ARLFSGADVWKHIVAWFATYPTPNLWNTNGG---------STFQIDGNFGFTSGVTEMLL 705
Query: 715 QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSF 774
QS ++LLPALP +G V+GL ARGG V I W+ G + S
Sbjct: 706 QSQTGTVHLLPALPGSNLPTGNVRGLLARGGFQVDIDWQSGAFKSATVTSTRGGQ----L 761
Query: 775 KTLHYRGTSVKVNLSAGKIYT 795
K G S KVN G YT
Sbjct: 762 KLRVANGQSFKVN---GATYT 779
>gi|415750047|ref|ZP_11477991.1| fibronectin type III domain protein [Streptococcus pneumoniae SV35]
gi|381318341|gb|EIC59066.1| fibronectin type III domain protein [Streptococcus pneumoniae SV35]
Length = 803
Score = 335 bits (860), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 243/774 (31%), Positives = 378/774 (48%), Gaps = 81/774 (10%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP----GDYTN---PDAPKALSDVRSL 78
+A+ IGNG LGA V+G + +E ++ NE +LW+G P DY D L+++R
Sbjct: 27 EALLIGNGSLGAKVFGLIGAERIQFNEKSLWSGGPLPDSSDYQGGNLQDQYVFLAEIRQD 86
Query: 79 VDSGQYAEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEET-YRRELDLNTA 133
++ Y A + + P Y GDI +EF ++ T Y+R+L+++ A
Sbjct: 87 LEKRDYNLAKELAEQHLIGPKTSQYGTYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKA 146
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A Y F R+ F+S PD ++V + +L F + L D S +
Sbjct: 147 LATTSYVYKGTRFERKAFASFPDDLLVQCFTKEGLETLDFTIELSLTCDLASDGKYEQEK 206
Query: 194 IMEGRCP----GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
C I K D+ ++F++ L + G I D+ +++ G+ +
Sbjct: 207 SDYKECKLDITDSHILMKGRVKDND--LRFASYLAWETD---GDIRVWSDR-VQISGASY 260
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L L A + F + K D + + + + + Y+ L +RH++DYQ LF RV
Sbjct: 261 ANLFLAAKTDFAQNPASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQ 320
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--P 367
+ L E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P
Sbjct: 321 LDL-------------EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCP 367
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
ANLQG+WN +P W+S H+N+NL+MNYW + NL E P+ +++ L + G
Sbjct: 368 DALPANLQGVWNAVDNPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG- 426
Query: 428 KTAQVNYLA--------SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEH 477
+ A V Y +GW++H + W D W P AW+ ++E
Sbjct: 427 RLAAVKYAGIVSQKGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEA 483
Query: 478 YNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACV 536
Y++ D+D+L ++ YP+L F +L + ++PS SPEH +
Sbjct: 484 YSFYRDQDYLREKIYPMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPI 534
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
S +T D ++I ++F I AA+ L +ED L E KS L P +I + G I EW ++
Sbjct: 535 SIGNTYDQSLIWQLFHDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEE 593
Query: 597 ----FKDPEV--HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
F++ +V HRH SHL GL+PG+ + K + +A +L RG+ G GWS K
Sbjct: 594 EEQYFQNEKVEAQHRHASHLVGLYPGNLFSY-KGQEYIEAVRASLNDRGDGGTGWSKANK 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWARL D A++++ + + NL+ +HPPFQID NFG T+ +A
Sbjct: 653 INLWARLGDGNRAHKLLA-----------EQLKTSTLPNLWCSHPPFQIDGNFGATSGMA 701
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS L L ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 702 EMLLQSHAAYLVPLAALP-DAWSTGFVSGLMARGHFEVSMSWEDKKLLQLTILS 754
>gi|417915380|ref|ZP_12558993.1| Gram-positive signal peptide protein, YSIRK family [Streptococcus
mitis bv. 2 str. SK95]
gi|342834366|gb|EGU68637.1| Gram-positive signal peptide protein, YSIRK family [Streptococcus
mitis bv. 2 str. SK95]
Length = 1686
Score = 335 bits (860), Expect = 5e-89, Method: Compositional matrix adjust.
Identities = 251/792 (31%), Positives = 394/792 (49%), Gaps = 117/792 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y + K L+++R
Sbjct: 141 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--ERYKVLAEIRK 198
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 199 ALEDGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRSLDITE 258
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 259 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 318
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G++F++ L IK +D + T+
Sbjct: 319 YSNYKNGHVTTDENGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 361
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+++ L V G+ +A L L A ++F NP ++ +KD E +++ + Y L
Sbjct: 362 QNETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKAKDYKTLK 418
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L N + E ++ + ++ L EL F
Sbjct: 419 KAHIKDYQSLFNRVKLNLGG-------------NKTAQTTKEALQGYNPEKGQKLEELFF 465
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 466 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 525
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 526 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 580
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 581 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDQASDRWV-SSPS 639
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L ++D LV ++ +L
Sbjct: 640 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDKD-LVTEIKAKFDKL 689
Query: 580 RPTKIAEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
+P I ++G I EW ++ F + E HHRH+SHL GLFPG T+ + + +AA
Sbjct: 690 KPLHINKEGRIKEWYEEDSPQFTNEGIENHHRHVSHLVGLFPG-TLFSKDQAEYLEAARA 748
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAA 693
TL RG+ G GWS K LWARL D A+R++ E NL+
Sbjct: 749 TLNHRGDGGTGWSKANKINLWARLLDGNRAHRLLAEQLKYSTLE-----------NLWDT 797
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWK 753
H PFQID NFG T+ +AEM++QS + LPALP D W G V GL ARG VS+ WK
Sbjct: 798 HAPFQIDGNFGATSGMAEMILQSHTGYIAPLPALP-DAWKDGQVSGLVARGNFEVSMKWK 856
Query: 754 DGDLHEVGIYSN 765
D +L + SN
Sbjct: 857 DKNLQSLSFLSN 868
>gi|358463765|ref|ZP_09173746.1| signal peptide protein, YSIRK family [Streptococcus sp. oral taxon
058 str. F0407]
gi|357067821|gb|EHI77905.1| signal peptide protein, YSIRK family [Streptococcus sp. oral taxon
058 str. F0407]
Length = 1707
Score = 335 bits (858), Expect = 7e-89, Method: Compositional matrix adjust.
Identities = 254/792 (32%), Positives = 393/792 (49%), Gaps = 117/792 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 200 ALEDGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRSLDITE 259
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 319
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G++F++ L IK +D + T+
Sbjct: 320 YSNYKNGHVTTDENGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 362
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+++ L V G+ +A L L A ++F NP ++ +KD E +++ + Y L
Sbjct: 363 QNETLTVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKAKDYETLK 419
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L S T E ++ + + L EL F
Sbjct: 420 KDHIKDYQSLFNRVKLNLGGSKTAQTT-------------KEALQGYNPSKGQKLEELFF 466
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 467 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 526
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 527 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 581
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ A F +L + D ++ ++PS
Sbjct: 582 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETAKFWNSFLHYDKASDRWV-SSPS 640
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L ++D LV +V +L
Sbjct: 641 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKAKFDKL 690
Query: 580 RPTKIAEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
+P I ++G I EW ++ F + E +HRH+SHL GLFPG T+ + + +AA
Sbjct: 691 KPLHINKEGRIKEWYEEDSPQFTNEGIENNHRHVSHLVGLFPG-TLFSKDRAEYLEAARA 749
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAA 693
TL RG+ G GWS K LWARL D A+R++ E NL+
Sbjct: 750 TLNHRGDGGTGWSKANKINLWARLLDGNRAHRLLAEQLKYSTLE-----------NLWDT 798
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWK 753
H PFQID NFG T+ +AEML+QS + LPALP D W G V GL ARG VS+ WK
Sbjct: 799 HAPFQIDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLIARGNFEVSMKWK 857
Query: 754 DGDLHEVGIYSN 765
D +L + SN
Sbjct: 858 DKNLQSLSFLSN 869
>gi|414159134|ref|ZP_11415425.1| YSIRK family Gram-positive signal peptide [Streptococcus sp. F0441]
gi|410868266|gb|EKS16233.1| YSIRK family Gram-positive signal peptide [Streptococcus sp. F0441]
Length = 1707
Score = 335 bits (858), Expect = 8e-89, Method: Compositional matrix adjust.
Identities = 252/787 (32%), Positives = 391/787 (49%), Gaps = 107/787 (13%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y D K L+++R
Sbjct: 142 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--DRYKVLAEIRK 199
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
++ G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 200 ALEGGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRGLDITE 259
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF----NVSLDSLLDN----- 183
AT Y+ F RE FSS PD V VT ++ + +L F N++ D L +
Sbjct: 260 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNNLTEDLLANGDYSWE 319
Query: 184 -HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+Y NG+ G I K D+ G++F++ L IK GT++ ++++ L
Sbjct: 320 YSNYKNGHVTTDEHG------ILLKGTVKDN--GLKFASYLGIKTD---GTVT-VQNETL 367
Query: 243 KVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLYTRHLD 299
V G+ +A L L A ++F NP ++ +KD E +++ + Y L H+
Sbjct: 368 TVTGASYATLYLSAKTNFAQ---NPKTNYRKDIDLEKTVKGIVEAAKAKDYETLKQDHIK 424
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
DYQ LF+RV + L S T E ++S+ + L EL FQ+GRY
Sbjct: 425 DYQSLFNRVKLNLGGSKTAQTT-------------KEALQSYNPSKGQKLEELFFQYGRY 471
Query: 360 LLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
LLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E +P+ +
Sbjct: 472 LLISSSRDKTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETAKPMIN 531
Query: 418 FLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMG 466
++ + G SK Q N GW++H + + ++ W P
Sbjct: 532 YIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWGWSPAA 586
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPSTSPEH 524
AW+ +++++Y +T D +L+++ YP+L+ F +L + D ++ ++PS SPEH
Sbjct: 587 NAWMMQNVYDYYKFTKDETYLKEKIYPMLKETTKFWNSFLHYDQASDRWV-SSPSYSPEH 645
Query: 525 EFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKI 584
++ +T D +++ ++F + A L+ ++D LV +V +L+P I
Sbjct: 646 ---------GTITIGNTFDQSLVWQLFHDYMEVANHLKVDQD-LVTEVKAKFDKLKPLHI 695
Query: 585 AEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR 638
+G I EW ++ F + E +HRH+SHL GLFPG T+ + + +AA TL R
Sbjct: 696 NNEGRIKEWYEEDSPQFTNEGIENNHRHVSHLVGLFPG-TLFSKDQAEYLEAARATLNHR 754
Query: 639 GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQ 698
G+ G GWS K LWARL D A+R++ E NL+ H PFQ
Sbjct: 755 GDGGTGWSKANKINLWARLLDGNRAHRLLAEQLKYSTLE-----------NLWDTHAPFQ 803
Query: 699 IDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLH 758
ID NFG T+ +AEML+QS + LPALP D W G V GL ARG VS+ WKD +L
Sbjct: 804 IDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLIARGNFEVSMKWKDKNLQ 862
Query: 759 EVGIYSN 765
+ SN
Sbjct: 863 SLSFLSN 869
>gi|336439275|ref|ZP_08618890.1| hypothetical protein HMPREF0990_01284 [Lachnospiraceae bacterium
1_1_57FAA]
gi|336016192|gb|EGN45981.1| hypothetical protein HMPREF0990_01284 [Lachnospiraceae bacterium
1_1_57FAA]
Length = 1977
Score = 334 bits (857), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 253/853 (29%), Positives = 407/853 (47%), Gaps = 125/853 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY----------TNPDAPKALSDVR 76
A+P+GN +GA V+GGV +E ++LNE +LW+G P D + K ++ ++
Sbjct: 68 ALPLGNSAIGASVFGGVQTERIQLNEKSLWSGGPSDSRPEYNGGNIESKGQNGKVMAQLK 127
Query: 77 SLVDSGQYAEATAASVKLFGHPADV-------YQLLGDIELEFDDSHLKYAEETYRRELD 129
+ SGQ ++ A +L G D Y G++ L+F + K Y R+LD
Sbjct: 128 EKLKSGQGFDSNLAG-QLIGVSDDAGVQGYGYYLSYGNMYLDFKNV-TKNNVSGYSRDLD 185
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L TA A V Y + +TRE+F S PD V+VT+++ ++ G+L F+V ++ +
Sbjct: 186 LRTAVAGVNYDLNGAHYTRENFVSYPDNVLVTRLTATDGGTLDFDVRVEP---DEEKGGS 242
Query: 190 NNQIIME--GRCPGKRIPPKANANDDP---KGIQFSAILEIKISDDRGTISALED--KKL 242
NQ + R K++ A A D ++FS+ K+ D GT ++D K
Sbjct: 243 QNQPGADSYARTFDKKVSDNAIAIDGQLTDNQLKFSSY--TKVIKDDGTAGQIKDDSKNG 300
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSAL---------QSIRNLSYSDL 293
K+ S + ++ S D P + T E ++AL ++ Y L
Sbjct: 301 KITVSGAKAITIITSIGTDYKNDYPK-YRTGETKEQLAALVKGYVSGAEAKVKAGGYETL 359
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H++DY +F R+ + + ++ D TD E A + + E L +L
Sbjct: 360 KEDHVNDYDHIFGRLDLNIGQAVSDKTTDKLLE--------AYKKGTASETEKRYLELML 411
Query: 354 FQFGRYLLISSSRP-------------GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
FQ+GRYL + SSR T +NLQGIW + W S H+N+NL+MNY
Sbjct: 412 FQYGRYLTMGSSRETPVNEDGTKNERRATLPSNLQGIWVGANNSAWHSDYHMNVNLQMNY 471
Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA---------SGWVIHHKTDIWAKS 451
W + N++EC EPL D++ L G TA++ Y +G++ H + + + +
Sbjct: 472 WPTYTTNMAECAEPLIDYVDSLREPGRITAKI-YAGVESTEANPENGFMAHTQNNPYGWT 530
Query: 452 SADRGKVV-WALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH 510
+ G V W P G W+ + WE+Y +T D ++++ YP+++ A+ L+ +
Sbjct: 531 NP--GWVFDWGWSPAGVPWILQNCWEYYEFTGDTEYMQTHIYPMMKEEATLYDQMLMRDN 588
Query: 511 DGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE 570
DG L + PS SPEH + +T + ++I +++ I+AAE L +E A V
Sbjct: 589 DGKLVSVPSYSPEH---------GPRTAGNTYEHSLIWQLYEDTITAAETLGVDE-AKVA 638
Query: 571 KVLKSLPRLR-PTKIAEDGSIMEWAQDFK----------DPEVHHRHLSHLFGLFPGHTI 619
+ K+ L+ P ++ G I EW + HRH+SH+ GL+PG I
Sbjct: 639 QWKKNQADLKGPIEVGASGQIKEWYNETTLNTDENGNQMGQGYGHRHISHMLGLYPGDLI 698
Query: 620 TIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHE 679
++ + AA+ ++Q R +E GW++ + A WARL + + AY ++ ++
Sbjct: 699 A--QSDEWLAAAKVSMQNRTDETTGWAMAQRVATWARLAEGDKAYDVLSKMVT------- 749
Query: 680 KHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKG 739
G + +NL+ H PFQID NFG+TAAVAEMLVQS + + L+PA+P W +G VKG
Sbjct: 750 ---SGKIMTNLWDTHAPFQIDGNFGYTAAVAEMLVQSNMGHIDLMPAVP-KAWGTGNVKG 805
Query: 740 LKARGGETVSICWKDGDLHEVGIYSN--------YSN--------NDHDSFKTLHYRGTS 783
L ARG V + W D L E I+SN Y+N +D + +
Sbjct: 806 LLARGNFAVDMAWADNKLTEASIHSNNGGEAVVQYANLSLATVKDSDGNLVEITPVTSDR 865
Query: 784 VKVNLSAGKIYTF 796
+ N AGK YT
Sbjct: 866 ISFNTEAGKTYTI 878
>gi|317500980|ref|ZP_07959190.1| hypothetical protein HMPREF1026_01133 [Lachnospiraceae bacterium
8_1_57FAA]
gi|316897683|gb|EFV19744.1| hypothetical protein HMPREF1026_01133 [Lachnospiraceae bacterium
8_1_57FAA]
Length = 1966
Score = 334 bits (857), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 254/853 (29%), Positives = 409/853 (47%), Gaps = 125/853 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY----------TNPDAPKALSDVR 76
A+P+GN +GA V+GGV +E ++LNE +LW+G P D + K ++ ++
Sbjct: 68 ALPLGNSAIGASVFGGVQTERIQLNEKSLWSGGPSDSRPEYNGGNIESKGQNGKVMAQLK 127
Query: 77 SLVDSGQYAEATAASVKLFGHPADV-------YQLLGDIELEFDDSHLKYAEETYRRELD 129
+ SGQ ++ A +L G D Y G++ L+F + K Y R+LD
Sbjct: 128 EKLKSGQGFDSNLAG-QLIGVSDDAGVQGYGYYLSYGNMYLDFKNV-TKNNVSGYSRDLD 185
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L TA A V Y + +TRE+F S PD V+VT+++ ++ G+L F+V ++ +
Sbjct: 186 LRTAVAGVNYDLNGAHYTRENFVSYPDNVLVTRLTATDGGTLDFDVRVEP---DEEKGGS 242
Query: 190 NNQIIME--GRCPGKRIPPKANANDDP---KGIQFSAILEIKISDDRGTISALED--KKL 242
NQ + R K++ A A D ++FS+ ++ I DD GT ++D K
Sbjct: 243 QNQPGADSYARTFDKKVSDNAIAIDGQLTDNQLKFSSYTKV-IKDD-GTAGQIKDDSKNG 300
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSAL---------QSIRNLSYSDL 293
K+ S + ++ S D P + T E ++AL ++ Y L
Sbjct: 301 KITVSGAKAITIITSIGTDYKNDYPK-YRTGETKEQLAALVKGYVSGAEAKVKAGGYETL 359
Query: 294 YTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
H++DY +F R+ + + ++ D TD E A + + E L +L
Sbjct: 360 KEDHVNDYDHIFGRLDLNIGQAVSDKTTDKLLE--------AYKKGTASETEKRYLELML 411
Query: 354 FQFGRYLLISSSRP-------------GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
FQ+GRYL + SSR T +NLQGIW + W S H+N+NL+MNY
Sbjct: 412 FQYGRYLTMGSSRETPVNEDGTKNERRATLPSNLQGIWVGANNSAWHSDYHMNVNLQMNY 471
Query: 401 WQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA---------SGWVIHHKTDIWAKS 451
W + N++EC EPL D++ L G TA++ Y +G++ H + + + +
Sbjct: 472 WPTYTTNMAECAEPLIDYVDSLREPGRITAKI-YAGVESTEANPENGFMAHTQNNPYGWT 530
Query: 452 SADRGKVV-WALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH 510
+ G V W P G W+ + WE+Y +T D ++++ YP+++ A+ L+ +
Sbjct: 531 NP--GWVFDWGWSPAGVPWILQNCWEYYEFTGDTEYMQTHIYPMMKEEATLYDQMLMRDN 588
Query: 511 DGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVE 570
DG L + PS SPEH + +T + ++I +++ I+AAE L +E A V
Sbjct: 589 DGKLVSVPSYSPEH---------GPRTAGNTYEHSLIWQLYEDTITAAETLGVDE-AKVA 638
Query: 571 KVLKSLPRLR-PTKIAEDGSIMEWAQDFK----------DPEVHHRHLSHLFGLFPGHTI 619
+ K+ L+ P ++ G I EW + HRH+SH+ GL+PG I
Sbjct: 639 QWKKNQADLKGPIEVGASGQIKEWYNETTLNTDENGNQMGQGYGHRHISHMLGLYPGDLI 698
Query: 620 TIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHE 679
++ + AA+ ++Q R +E GW++ + A WARL + + AY ++ ++
Sbjct: 699 A--QSDEWLAAAKVSMQNRTDETTGWAMAQRVATWARLAEGDKAYDVLSKMVT------- 749
Query: 680 KHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKG 739
G + +NL+ H PFQID NFG+TAAVAEMLVQS + + L+PA+P W +G VKG
Sbjct: 750 ---SGKIMTNLWDTHAPFQIDGNFGYTAAVAEMLVQSNMGHIDLMPAVP-KAWGTGNVKG 805
Query: 740 LKARGGETVSICWKDGDLHEVGIYSN--------YSN--------NDHDSFKTLHYRGTS 783
L ARG V + W D L E I+SN Y+N +D + +
Sbjct: 806 LLARGNFAVDMAWADNKLTEASIHSNNGGEAVVQYANLSLATVKDSDGNLVEITPVTSDR 865
Query: 784 VKVNLSAGKIYTF 796
+ N AGK YT
Sbjct: 866 ISFNTEAGKTYTI 878
>gi|417934856|ref|ZP_12578176.1| gram positive anchor [Streptococcus mitis bv. 2 str. F0392]
gi|340771426|gb|EGR93941.1| gram positive anchor [Streptococcus mitis bv. 2 str. F0392]
Length = 1668
Score = 334 bits (856), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 252/792 (31%), Positives = 394/792 (49%), Gaps = 117/792 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
A+P+GNG +GA V+G + E ++ NE TLW+G P G+Y + K L+++R
Sbjct: 103 ALPVGNGEMGAKVFGLIGEERIQYNEKTLWSGGPQPDSTDYNGGNYK--ERYKVLAEIRK 160
Query: 78 LVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEET-YRRELDLNT 132
+++G +A + + P + Y GDI + F++ T Y R LD+
Sbjct: 161 ALEAGDRQKAKQLAEQNLVGPNNAQYGRYLAFGDIFMVFNNQKKGLDTVTDYHRGLDITE 220
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF---NVSLDSLLDN------ 183
AT Y+ F RE FSS PD V VT ++ + +L F N + LL N
Sbjct: 221 ATTTTSYTQDGTTFKRETFSSYPDDVTVTHLTKKGNKTLDFTLWNSLTEDLLANGNYSWE 280
Query: 184 -HSYVNGN-----NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISAL 237
+Y NG+ N I+++G K N G++F++ L IK +D + T+
Sbjct: 281 YSNYKNGHVTTDANGILLKGTV-------KDN------GLKFASYLGIK-TDGKVTV--- 323
Query: 238 EDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDPTSESM--SALQSIRNLSYSDLY 294
+D+ L V G+ +A L L A ++F NP ++ +KD E +++ + Y L
Sbjct: 324 QDETLTVTGASYATLYLSAKTNF---AQNPKTNYRKDIDLEKTVKGIVEAAKAKDYETLK 380
Query: 295 TRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLF 354
H+ DYQ LF+RV + L N + E ++ + ++ L EL F
Sbjct: 381 KDHIKDYQSLFNRVKLNLGG-------------NKTAQTTKEALQGYNPEKGQKLEELFF 427
Query: 355 QFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
Q+GRYLLISSSR T ANLQG+WN +P W++ H+N+NL+MNYW + NL+E
Sbjct: 428 QYGRYLLISSSRDRTDALPANLQGVWNAVDNPPWNADYHLNVNLQMNYWPAYMSNLAETA 487
Query: 413 EPLFDFLTYLSING-----------SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWA 461
+P+ +++ + G SK Q N GW++H + + ++ W
Sbjct: 488 KPMINYIDDMRYYGRIAAKEYAGIESKDGQEN----GWLVHTQATPFGWTTPGW-NYYWG 542
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL--IEGHDGYLETNPS 519
P AW+ +++++Y +T D +L+++ YP+L+ F +L + D ++ ++PS
Sbjct: 543 WSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETTKFWNSFLHYDQASDRWV-SSPS 601
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
SPEH ++ +T D +++ ++F + A L ++D LV +V +L
Sbjct: 602 YSPEH---------GTITIGNTFDQSLVWQLFHDYMEVANHLNVDQD-LVTEVKAKFDKL 651
Query: 580 RPTKIAEDGSIMEWAQD----FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEK 633
+P I ++G I EW ++ F + E +HRH+SHL GLFPG T+ + + +AA
Sbjct: 652 KPLHINKEGRIKEWYEEDSPQFTNEGIENNHRHVSHLVGLFPG-TLFSKDQAEYLEAARA 710
Query: 634 TLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAA 693
TL RG+ G GWS K LWARL D A+R++ E NL+
Sbjct: 711 TLNHRGDGGTGWSKANKINLWARLLDGNRAHRLLAEQLKYSTLE-----------NLWDT 759
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWK 753
H PFQID NFG T+ +AEML+QS + LPALP D W G V GL ARG VS+ WK
Sbjct: 760 HAPFQIDGNFGATSGMAEMLLQSHTGYIAPLPALP-DAWKDGQVSGLIARGNFEVSMKWK 818
Query: 754 DGDLHEVGIYSN 765
D +L + SN
Sbjct: 819 DKNLQSLSFLSN 830
>gi|374373770|ref|ZP_09631430.1| Alpha-L-fucosidase [Niabella soli DSM 19437]
gi|373234743|gb|EHP54536.1| Alpha-L-fucosidase [Niabella soli DSM 19437]
Length = 733
Score = 334 bits (856), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 237/760 (31%), Positives = 350/760 (46%), Gaps = 108/760 (14%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYA 85
+ +PIGNGRLGAM+ GGV ++T++ NE +LW+G N D D
Sbjct: 39 EGLPIGNGRLGAMMMGGVANDTIQFNEQSLWSGD----NNWDGAYETGD----------- 83
Query: 86 EATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVE 145
H Y+ G + + FD + YRR L+L ++ +
Sbjct: 84 -----------HGFGSYRNFGALVVNFDGDK---SSSGYRRGLNLTDGIYTASLTINKTQ 129
Query: 146 FTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIP 205
+ RE F+S+PDQV+V + + +++G LS +SL S + GN+
Sbjct: 130 YKREAFASHPDQVMVFRYT-AQNGRLSGRISLHSAQGASARATGNSLQF----------- 177
Query: 206 PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFI 265
A P +Q++A ++ + + GT++ L D +L G L L A +++ P
Sbjct: 178 ----AGTMPNQLQYAA--KMLLQQEGGTVTTL-DSQLVFTGCKTLTLYLDARTNYK-PDY 229
Query: 266 NPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCS 325
P L + +Y L H+ D+ L I + +P +
Sbjct: 230 TADWRGAAPRPVIEKELAAALRKTYEQLRAAHIKDFTALAAAAHIDVGTTPVAL------ 283
Query: 326 EENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSP 384
+P+ R++ + DP L E +FQFGRYLLISSSRPG ANLQG+WN +P
Sbjct: 284 ----RALPTDLRLQKYAAGGADPDLEETVFQFGRYLLISSSRPGGLPANLQGLWNNSNTP 339
Query: 385 TWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLAS--GWVIH 442
W S H NIN++MNYW + NLS C PL D++ + + + A+ GW
Sbjct: 340 PWASDYHNNINIQMNYWAAENTNLSACHIPLIDYIVAQAEPCRIATRKAFGAATRGWTAR 399
Query: 443 HKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFL 502
I+ + W AW H++EH+ +T DRD+L+K AYP+L+ +F
Sbjct: 400 TSQSIFGGNG-------WEWNIPASAWYAHHVFEHWAFTKDRDYLKKTAYPVLKEICNFW 452
Query: 503 LDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLE 562
D L + DG L SPEH DG + D ++ ++F + AA+ L
Sbjct: 453 EDRLKQLPDGSLVVPNGWSPEHG-PREDGVM--------HDQQLVWDLFQNYLDAAKALN 503
Query: 563 KNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIE 622
+ A KV RL P KI + G + EW +D DP HRH SHLF ++PG I++
Sbjct: 504 -TDPAYQLKVADMQRRLAPNKIGKWGQLQEWQEDRDDPNDQHRHTSHLFAVYPGRQISLT 562
Query: 623 KNPDLCKAAEKTLQKR------------------GEEGPGWSITWKTALWARLHDQEHAY 664
+ P+L KAA +L+ R G+ W+ W+ ALWARL + E A
Sbjct: 563 QTPELAKAAIISLRSRSGNYGKNIDKPFTVASTIGDSRRSWTWPWRCALWARLGEGEKAG 622
Query: 665 RMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLL 724
MV+ L + NL A HPP Q+D NFG + A+ EML+QS ++ LL
Sbjct: 623 MMVRGLLTY-----------NMLPNLLATHPPLQLDGNFGISGAIPEMLLQSHAGEISLL 671
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
PA+P +G GL+ARGG TVS WK G + I S
Sbjct: 672 PAIPESWKQAGSFNGLRARGGFTVSCSWKAGRVTGYHIVS 711
>gi|288803110|ref|ZP_06408545.1| fibronectin type III domain protein [Prevotella melaninogenica D18]
gi|288334371|gb|EFC72811.1| fibronectin type III domain protein [Prevotella melaninogenica D18]
Length = 1163
Score = 333 bits (853), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 238/789 (30%), Positives = 373/789 (47%), Gaps = 107/789 (13%)
Query: 11 NPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
N + + PA ++ T +PIGNG+ GA + G V + ++ N+ TLW+G G T
Sbjct: 341 NKYTLWYTKPATNWMTSCLPIGNGQFGATLMGDVAIDDVQFNDKTLWSGKLGGLT----- 395
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
+TAA +G+ Y G++ + S Y R LD
Sbjct: 396 -----------------STAA----YGY----YLNFGNLYIR---SRGMSKVTDYVRYLD 427
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD-NHSY-V 187
+N A A V+Y++ V ++R +F+SNPD +V + + S++G ++ ++L + N SY V
Sbjct: 428 INDAVAGVRYTMDGVAYSRTYFASNPDSCVVVRYTASQNGKINTTLTLKNQNGRNVSYTV 487
Query: 188 NGNNQ--IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
+ NNQ I +G+ A +D S +I D GTI+ ++V
Sbjct: 488 DNNNQATITFDGQI--------ARQDDHGATTPESYYCVARIVTDGGTITKNAKGVIEVN 539
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
G++ + L + FD + + + + +N Y L+ H DY+ LF
Sbjct: 540 GANSMTVYLRGLTDFDPDAPTYVSGANLLAARAAATVNGAQNKGYDALFAAHKTDYKSLF 599
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLV--ELLFQFGRYLLIS 363
R + L +I P+ + + S++ ++ +L EL F +GRYLLIS
Sbjct: 600 DRCQLTLGDVKNNI-------------PTPQLISSYRNNQHDNLFLEELYFNYGRYLLIS 646
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSR + ANLQGIWN++ +P W + H NIN++MNYW + P NLSE P D++ Y
Sbjct: 647 SSRGISLPANLQGIWNDNNTPAWHADIHANINVQMNYWPAEPTNLSELHRPFLDYI-YRE 705
Query: 424 INGSKTAQ-----VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
T + + ++ +GW + + +I+ G + + AW C HLW+HY
Sbjct: 706 ACVKPTWRRFAPDMGHVNTGWTLPTENNIYGS-----GTTFANTYTVANAWYCQHLWQHY 760
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
YTMD+DFL +A+P ++ + L++ DG E SPEH
Sbjct: 761 TYTMDKDFLRTKAFPAMKSAVDYWFKKLVKAADGTYECPNEWSPEH---------GPTEN 811
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS--------- 589
++ ++ ++F+ A +VL D +V K + K+ +DG
Sbjct: 812 ATAHSQQLVWDLFNNTRKAIKVLG---DDVVSKAFRDSLATYFAKL-DDGCHTEVNPADG 867
Query: 590 ---IMEW--AQDFKDPEV-------HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQK 637
+ EW + F +P HRH+SHL GL+P I+ + + + +AA ++L
Sbjct: 868 QTYLREWKYSSQFNNPSKIGVNEYKAHRHISHLMGLYPCTQISEDADKTVFEAARQSLIA 927
Query: 638 RGE-EGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP 696
RG+ G GWS+ K L AR ++ H + ++KR GG+Y NL+ AH P
Sbjct: 928 RGDGHGTGWSLGHKINLNARAYEGLHCHNLIKRALQQTWDTGTNEAAGGIYENLWDAHAP 987
Query: 697 FQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGD 756
+QID NFG+TA VAEML+QS + L +LPALP W G VKGLKA G TV I W
Sbjct: 988 YQIDGNFGYTAGVAEMLLQSHNDKLVILPALPTTFWQKGSVKGLKAVGNFTVDIDWAAAK 1047
Query: 757 LHEVGIYSN 765
+V I SN
Sbjct: 1048 ATKVQIVSN 1056
>gi|336430063|ref|ZP_08610019.1| hypothetical protein HMPREF0994_06025 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336001234|gb|EGN31379.1| hypothetical protein HMPREF0994_06025 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 782
Score = 332 bits (850), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 238/801 (29%), Positives = 384/801 (47%), Gaps = 67/801 (8%)
Query: 15 ITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN-PDAPKALS 73
+ + PA ++ +A+P+GNGRLGAM +GG ETL+L+E T W+G + N D+ + L+
Sbjct: 5 LMYKQPAGNWKEALPLGNGRLGAMDFGGAWRETLQLDESTYWSGEASEENNRADSRELLA 64
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPADVYQLL------------GDIELEFDDSHLKYAE 121
+R + Y A G+ + L G E E++++
Sbjct: 65 QIREALLEEDYERADELGHGFVGNKNNYGTNLPVGNFYIDCFPEGRPEKEWEEAAGADTV 124
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
+ R L L A + V + G + RE F SNP Q V + + + + +
Sbjct: 125 TDFVRRLYLEEARSEVSFKAGGSTYRREVFLSNPAQTAVIHMDTRPGKPFALRIRFEGIA 184
Query: 182 DNHSYVNGNNQ-IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK 240
Q ++ G+ + +D G+ + I++ D L++
Sbjct: 185 SRVGITEERQQDYLIRGQAR------ETLHSDGFTGVNLAG--RIRVVTD--GYHHLKES 234
Query: 241 KLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ VE + A LL+ + P DP + L+ Y L H+ D
Sbjct: 235 GIWVENATRATLLVDLETDMFQP---------DPEETAGRRLEEAWQKGYEQLRQEHIQD 285
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQFGRY 359
L++R+ I L E++ +P+ ER+ K + EDP L LLFQ+GRY
Sbjct: 286 VSALYNRMDISLG------------AEDMRELPTDERLRKQTEGKEDPGLAALLFQYGRY 333
Query: 360 LLISSSRPGTQV-ANLQGIWNEDLSPTWDSAP--HVNINLEMNYWQSLPCNLSECQEPLF 416
LLISSSR + + ++ GIWN+++ D HV++NL+M YW + C L EC +P F
Sbjct: 334 LLISSSREDSPLPTHMGGIWNDNIYNNIDCTQDMHVDMNLQMQYWLAALCALPECYQPAF 393
Query: 417 DFLTYLSI-NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
++ + + +G KTA Y A GW H T+ W +S W +W +GG W +W
Sbjct: 394 AYMRDILVPSGEKTAAGVYGARGWTAHVVTNPWGFTSLG-WSYNWGVWSLGGVWCAALIW 452
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLA 534
++Y +T D+DFL + +P+L+G A F D++ + G+ T PS SPE+ F + +GK
Sbjct: 453 DYYEFTGDKDFL-REWWPVLKGAAEFAADYVFPDEKSGFYMTGPSYSPENMF-SVEGKEY 510
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWA 594
+S S+ D ++RE+ I + L D+ +EK ++ L P +I G + EW
Sbjct: 511 FLSLSTACDCILVREILDIIAKGYQELSLERDSFLEKCVEIRENLPPYRIGSRGQLQEWF 570
Query: 595 QDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGE--EGPGWSITWKTA 652
DF +P +HRH SHL GL+P I E+ P L +AA +++++R E E W +
Sbjct: 571 HDFDEPIPNHRHTSHLLGLYPFSQIRPEEQPQLAQAAYESIRRRLEDFEITSWGMNMLMG 630
Query: 653 LWARLHDQEHAYRMVK-RLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAE 711
+ARL D E A + + L LV P ++++A +++D N G TA++AE
Sbjct: 631 YYARLCDGEKALAIYQDTLRRLVKPNLSSVMSD--ETSMWAG--TWELDGNTGLTASMAE 686
Query: 712 MLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDH 771
MLVQS + + +LPALP D+W +G VKG+ RGG+ I WKDG +V +
Sbjct: 687 MLVQSHGDVIRILPALP-DEWRNGYVKGICLRGGQKADIYWKDGIPEKVVLVCG-----K 740
Query: 772 DSFKTLHYRGTSVKVNLSAGK 792
D + L Y +++L G+
Sbjct: 741 DEKRILCYGDQKQEIDLKTGE 761
>gi|325269425|ref|ZP_08136042.1| fibronectin type III domain protein [Prevotella multiformis DSM
16608]
gi|324988346|gb|EGC20312.1| fibronectin type III domain protein [Prevotella multiformis DSM
16608]
Length = 847
Score = 330 bits (847), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 230/774 (29%), Positives = 356/774 (45%), Gaps = 104/774 (13%)
Query: 24 FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
T +P+GNG+ GA V G + + ++ N+ TLW+G G T+ A G
Sbjct: 85 MTSCLPVGNGQFGATVMGQIVVDDVQFNDKTLWSGKLGGLTSTAA------------YGS 132
Query: 84 YAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
Y FG+ L +K + Y R LD+N A A V++S+
Sbjct: 133 YLN--------FGN------------LLIRSRGMKGVTD-YVRYLDINDAVAGVRFSMDG 171
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH-SYV---NGNNQIIMEGRC 199
V ++R +F+SNPD +V + + + G ++ ++L +H SY G I +G+
Sbjct: 172 VGYSRTYFASNPDSCVVIRYTATRGGMINTTLALKDQNGSHVSYTVDGPGRATITFDGQV 231
Query: 200 PGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSS 259
ND+ + S +I D GT++ + ++V ++ + L +
Sbjct: 232 --------GRQNDEGEATPESYCCAARIVADGGTVTKNAEGLVEVSDANSMTVYLRGLTD 283
Query: 260 FDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDI 319
FD + +M+A+ R Y L H DY+ LF R + L + D
Sbjct: 284 FDAAAPEYVSGTEQLAGRAMAAVDGARRKGYDALLAAHKADYKSLFDRCLLTLCSTGSD- 342
Query: 320 VTDTCSEENIDTVPSAERVKSFQTDEDPSLV--ELLFQFGRYLLISSSRPGTQVANLQGI 377
VP+ + + ++ D +L EL F +GRYLLISSSR + ANLQGI
Sbjct: 343 ------------VPTPQLISGYRADPQGNLFLEELYFSYGRYLLISSSRGVSLPANLQGI 390
Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK----TAQVN 433
WN +P W + H NIN++MNYW + P NLSE P D++ + +
Sbjct: 391 WNNSNAPAWHADIHANINVQMNYWPAEPTNLSELHRPFLDYIYREACVKPAWRRFARDMG 450
Query: 434 YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYP 493
+ +GW + + +I+ G + + AW C HLW+HY YT+DR++L ++A+P
Sbjct: 451 KVDAGWTLPTENNIYGS-----GTTFANTYTVANAWYCQHLWQHYAYTLDREYLRRQAFP 505
Query: 494 LLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSA 553
+++ + L L++G DG E SPEH P ++ ++ ++F+
Sbjct: 506 VMKSAVDYWLRKLVKGADGTYECPEEWSPEH---GP------TENATAHSQQLVWDLFNN 556
Query: 554 IISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS------------IMEW--AQDFKD 599
A EVL D +V + + T + +DG + EW F +
Sbjct: 557 TRKAIEVL---GDEVVSRTFRDSLAAYFT-LLDDGCHTEVNPADGQTYLREWKYTSQFNN 612
Query: 600 PE-------VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGE-EGPGWSITWKT 651
P HRH+SHL GL+P I+ + + + +AA +L RG+ G GWS+ K
Sbjct: 613 PGKIGVDEYRAHRHISHLMGLYPCSQISGDADKAVFQAARTSLIARGDGHGTGWSLGHKI 672
Query: 652 ALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAE 711
L AR H+ +H + +++R GG+Y NL+ AH P+QID NFG+TA VAE
Sbjct: 673 NLNARAHEGQHCHNLIRRALQQTWTTDVNEGAGGIYENLWDAHAPYQIDGNFGYTAGVAE 732
Query: 712 MLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
ML+QS L LLPALP W G VKGLKA G TV I W+ +V I S
Sbjct: 733 MLLQSYSGKLVLLPALPAAFWDKGSVKGLKAVGNFTVDIAWEKARAAKVRIVSG 786
>gi|340520176|gb|EGR50413.1| glycoside hydrolase family 95 [Trichoderma reesei QM6a]
Length = 794
Score = 330 bits (847), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 240/801 (29%), Positives = 363/801 (45%), Gaps = 93/801 (11%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQY 84
T +PIGN RLG ++GG +E + +NEDTLW G + + AL VR ++ +
Sbjct: 39 TGVLPIGNSRLGGAIFGG-GNEVITINEDTLWDGPLQNRIPANGLAALPKVRQMLLANNL 97
Query: 85 AEATAASVKLFGHPA----DVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYS 140
+A + PA + G++ L F Y R LD + V Y+
Sbjct: 98 TDAGNLVLSQM-MPAVGGERQFSYFGNLNLNFGHGS---GISNYIRSLDTRQGNSSVSYT 153
Query: 141 VGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS---LDSLLDNHSYVNGN-NQIIME 196
V +TRE+ +S P VI + + S++G+LS + + + ++L N + +G N + ++
Sbjct: 154 FNGVTYTREYVASAPVGVIAARFTASKAGALSVSATFSRISNILSNVASTSGGVNSVTLQ 213
Query: 197 GRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVA 256
G + P I F+ + + G++SA L +
Sbjct: 214 GTSGQAQNP-----------ILFTG--KARFVPQGGSVSA-----------SGGTLTITG 249
Query: 257 SSSFDGPFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQKLFHRVSIQ 311
+++ D FI+ + + PT+ +++A + + + + ++ + D L R +I
Sbjct: 250 ATTID-VFIDVETNYRYPTASALAAEVDNKINTAVSQGFQKVHDDAIADSSALLGRANIN 308
Query: 312 LSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGTQ 370
L SP I P+ +RVKS ++ DP L+ L + +GR+LL++SSR +
Sbjct: 309 LGTSPNGIANQ----------PTDQRVKSARSAFNDPQLIVLAWNYGRHLLVASSRDTSA 358
Query: 371 V----ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
NLQG+WN S W +NIN EMN W + NL E Q PLFD L G
Sbjct: 359 AIDMPPNLQGVWNNATSAPWGGKFTININTEMNLWPAGQTNLIETQLPLFDLLKVAQPRG 418
Query: 427 SKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
+ AQ Y +G V HH D+W + ++WPMG WL H+ E Y +T D DF
Sbjct: 419 QEMAQKLYGCNGTVFHHNLDVWGDPAPTDNYPSSSMWPMGATWLVQHMMEQYRFTGDLDF 478
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA- 545
L AYP L + FL + G T PS SPE+ + P G MDMA
Sbjct: 479 LRNTAYPYLLDISKFLQCYTFT-WQGNRVTGPSLSPENTYAVPQGA-NVAGQQEPMDMAP 536
Query: 546 -----IIREVFSAIISAAEVLE-KNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKD 599
++R+V SAI+ AA L + DA V+ LP +R +I G I+EW ++ +
Sbjct: 537 EMDNQLMRDVMSAIVEAAAALGISSSDANVKAASDFLPLIRTPRIGSYGQILEWRAEYPE 596
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTALWAR 656
+ HRHLS L+GL P + N L AA+ L R G GWS TW +AR
Sbjct: 597 TDPGHRHLSPLYGLHPSSQFSPLVNSTLSAAAKALLDHRVASGSGSTGWSRTWLMNQYAR 656
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
L ++ + F + + GG FQID NFGFT+ V EML+QS
Sbjct: 657 LFSGADVWKHIVAWFATYPTPNLWNTNGG---------STFQIDGNFGFTSGVTEMLLQS 707
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKT 776
++LLPALP +G V+GL ARGG V I W+ G + S
Sbjct: 708 QTGTVHLLPALPGSNLPTGNVRGLLARGGFQVDIDWQGGSFKSATVTST----------- 756
Query: 777 LHYRGTSVKVNLSAGKIYTFN 797
RG +K+ ++ G+ + N
Sbjct: 757 ---RGGQLKLRVANGQSFNVN 774
>gi|153816042|ref|ZP_01968710.1| hypothetical protein RUMTOR_02288 [Ruminococcus torques ATCC 27756]
gi|331089120|ref|ZP_08338023.1| hypothetical protein HMPREF1025_01606 [Lachnospiraceae bacterium
3_1_46FAA]
gi|145846689|gb|EDK23607.1| LPXTG-motif cell wall anchor domain protein [Ruminococcus torques
ATCC 27756]
gi|330405897|gb|EGG85423.1| hypothetical protein HMPREF1025_01606 [Lachnospiraceae bacterium
3_1_46FAA]
Length = 1966
Score = 329 bits (844), Expect = 3e-87, Method: Compositional matrix adjust.
Identities = 250/850 (29%), Positives = 404/850 (47%), Gaps = 119/850 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY----------TNPDAPKALSDVR 76
A+P+GN +GA V+GGV +E ++LNE +LW+G P D + K ++ ++
Sbjct: 68 ALPLGNSAIGASVFGGVQTERIQLNEKSLWSGGPSDSRPEYNGGNIESKGQNGKVMAQLK 127
Query: 77 SLVDSGQYAEATAASVKLFGHPADV-------YQLLGDIELEFDDSHLKYAEETYRRELD 129
+ SGQ ++ A +L G D Y G++ L+F + K Y R+LD
Sbjct: 128 EKLKSGQGFDSNLAG-QLIGVSDDAGVQGYGYYLSYGNMYLDFKNV-TKNNVSGYSRDLD 185
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L TA A V Y + +TRE+F S PD V+VT+++ ++ G+L F+V ++ + N
Sbjct: 186 LRTAVAGVNYDLNGAHYTRENFVSYPDNVLVTRLTATDGGTLDFDVRVEPDEEKGGSQN- 244
Query: 190 NNQIIMEGRCPGKRIPPKANANDDP---KGIQFSAILEIKISDDRGTISALED--KKLKV 244
+ R K++ A A D ++FS+ ++ I DD GT ++D K K+
Sbjct: 245 KPEADSYARTFDKKVSDNAIAIDGQLTDNQLKFSSYTKV-IKDD-GTAGQIKDDSKNGKI 302
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSAL---------QSIRNLSYSDLYT 295
S + ++ S D P + T E ++AL ++ Y L
Sbjct: 303 TVSGAKAITIITSIGTDYKNDYPK-YRTGETKEQLAALVKGYVSGAEAKVKAGGYETLKE 361
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQ 355
H++DY +F R+ + + ++ D TD E A + + E L +LFQ
Sbjct: 362 DHVNDYDHIFGRLDLNIGQAVSDKTTDKLLE--------AYKKGTASETEKRYLELMLFQ 413
Query: 356 FGRYLLISSSRP-------------GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQ 402
+GRYL + SSR T +NLQGIW + W S H+N+NL+MNYW
Sbjct: 414 YGRYLTMGSSRETPVNEDGTKNERRATLPSNLQGIWVGANNSAWHSDYHMNVNLQMNYWP 473
Query: 403 SLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA---------SGWVIHHKTDIWAKSSA 453
+ N++EC EPL D++ L G TA++ Y +G++ H + + + ++
Sbjct: 474 TYTTNMAECAEPLIDYVDSLREPGRITAKI-YAGVESTEANPENGFMAHTQNNPYGWTNP 532
Query: 454 DRGKVV-WALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG 512
G V W P G W+ + WE+Y +T D ++++ YP+++ A+ L+ +G
Sbjct: 533 --GWVFDWGWSPAGVPWILQNCWEYYEFTGDTEYMQTHIYPMMKEEATLYDQMLMRDSEG 590
Query: 513 YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV 572
L + PS SPEH + +T + ++I +++ I+AAE L +E + +
Sbjct: 591 KLVSVPSYSPEH---------GPRTAGNTYEHSLIWQLYEDTITAAETLGVDEAKVAQWK 641
Query: 573 LKSLPRLRPTKIAEDGSIMEWAQDF---------KDPEVH-HRHLSHLFGLFPGHTITIE 622
P +I + G I EW + K E + HRH+SH+ GL+PG I
Sbjct: 642 QNQADLKGPIEIGDSGQIKEWYNETTLNTDENGQKMGEGYGHRHISHMLGLYPGDLIA-- 699
Query: 623 KNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHF 682
+N + AA+ ++Q R + GW++ + A WARL + + AY ++ ++
Sbjct: 700 QNDEWLAAAKVSMQNRTDVTTGWAMAQRVATWARLAEGDKAYDVLSKMIT---------- 749
Query: 683 EGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKA 742
+ +NL+ H PFQID NFG+TAAVAEMLVQS + + L+PA+P W +G VKGL A
Sbjct: 750 NNKIMTNLWDTHAPFQIDGNFGYTAAVAEMLVQSNMGHIDLMPAVP-KAWGTGNVKGLLA 808
Query: 743 RGGETVSICWKDGDLHEVGIYSN--------YSN--------NDHDSFKTLHYRGTSVKV 786
RG V + W D L E I+SN Y+N +D + + +
Sbjct: 809 RGNFAVDMAWADNKLTEASIHSNNGGEAVVQYANLSLATVKDSDGNLVEITPVTSDRISF 868
Query: 787 NLSAGKIYTF 796
N AGK YT
Sbjct: 869 NTEAGKTYTI 878
>gi|332982836|ref|YP_004464277.1| alpha-L-fucosidase [Mahella australiensis 50-1 BON]
gi|332700514|gb|AEE97455.1| Alpha-L-fucosidase [Mahella australiensis 50-1 BON]
Length = 816
Score = 328 bits (842), Expect = 5e-87, Method: Compositional matrix adjust.
Identities = 240/769 (31%), Positives = 372/769 (48%), Gaps = 64/769 (8%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLV 79
PA + DAIP GNG +GA+V+G + +E + LN + L+ N + LS +R ++
Sbjct: 13 PAIRWQDAIPCGNGSIGALVYGHIKNEIITLNHEALFLKSQKPQIN-SIYEYLSQLRKML 71
Query: 80 DSGQYAEATAASVKLFGH------PADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
G+Y E + D YQ DI++ DS A Y R LD T
Sbjct: 72 MEGKYNEGAQFFERKLKENYIGIARTDPYQPAFDIKI---DSETHEAFTGYCRYLDFETG 128
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
A V++S GN + R+ F S D ++ +I+ S ++ +SL V G +
Sbjct: 129 EAVVRWSEGNTNYHRDLFVSRVDDAVILRINAVGSEKVNCVISLVP-----CRVEGATGM 183
Query: 194 IMEGRCPGKRIPPKANANDD----------PKGIQFSAILEIKISDDRGTISALEDKKLK 243
G ++P + A+ + P G +F + + ++ G + +E +
Sbjct: 184 GSGKDVKGDKLPFEWQASSEENWISFEAQYPDGNEFGGVARLIVNG--GCMEGIEAQNNC 241
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
+ D +L++ F+N K T E+ + ++ Y L ++H+ +++
Sbjct: 242 IYIKDATEVLMMVKV-----FVN---EKSKTTIENTKSQLEKMDVCYEALLSKHVYQHRE 293
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
L+ RV+I+ +D + E + ++S+ +L++ +F FGRYLLIS
Sbjct: 294 LYKRVNIEFHEQREDKLAKQKFNEEL-------LLESYNGQIPTALIQRMFYFGRYLLIS 346
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSRPG ANLQGIWN D P W S H + N+EMNYW +LP NL E P FD+ +
Sbjct: 347 SSRPGGLPANLQGIWNGDYVPAWASDYHNDENIEMNYWAALPGNLPETTLPYFDYYMSML 406
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+ A+V Y G + D +WA W G WL ++++ +T D
Sbjct: 407 EDFRTNAKVIYGCRGILAPIAQTTHGLVYTDP---IWATWTAGAGWLSQLFYDYWLFTGD 463
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMD 543
DFL+ +A P ++ A F D+L+EG DG PS SPE+ P+ L V+ ++TMD
Sbjct: 464 MDFLKNKAIPFMKEIALFYEDFLVEGEDGKFMFIPSLSPENTPPIPNASL--VTINATMD 521
Query: 544 MAIIREVFSAIISAAEVL--EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPE 601
+AI REV + + +A + L EK + + +L LP ++ EDG+I EW
Sbjct: 522 IAIAREVLANLCAACKYLGIEKENVKIWKHMLSKLPEY---QVNEDGAIKEWIHSDLPDN 578
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG----PGWSITWKTALWARL 657
HHRH SH++ LFPG +T E NP L A + ++KR G GWS+ ++ARL
Sbjct: 579 YHHRHQSHIYPLFPGFEVTEETNPSLFHAMKVAVEKRLVVGLTSQTGWSLAHMANIYARL 638
Query: 658 HDQEHAYRMVKRLF------NLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAE 711
D + A + ++ + NL ++ +G + PPFQIDANFG TAA+ E
Sbjct: 639 GDGDGAIQCLETMCRSCVGTNLFTYHNDWRSQGLTMFWGHGSQPPFQIDANFGLTAAIFE 698
Query: 712 MLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEV 760
MLV S+ + LLPALP KW G +G+ RG VS+ W D D +E+
Sbjct: 699 MLVFSSPGIIKLLPALP-SKWIKGKAEGITCRGCIEVSVEW-DMDKNEL 745
>gi|153815077|ref|ZP_01967745.1| hypothetical protein RUMTOR_01294 [Ruminococcus torques ATCC 27756]
gi|145847645|gb|EDK24563.1| F5/8 type C domain protein [Ruminococcus torques ATCC 27756]
Length = 1812
Score = 328 bits (841), Expect = 7e-87, Method: Compositional matrix adjust.
Identities = 262/887 (29%), Positives = 409/887 (46%), Gaps = 151/887 (17%)
Query: 13 LKITFNGPAKHFT----------DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-- 60
LK+ + PAK T ++P+GNG LG +++GG+ E + NE TLWTG P
Sbjct: 57 LKLWYGSPAKINTAESSGGEWMQQSLPLGNGNLGNLIFGGISKERIHFNEKTLWTGGPSS 116
Query: 61 -------GDYTNPDAPKALSDVRSLVDS------------GQYAEATAASVKLFGHP--- 98
G+ + + R L+D G Y A ++ G
Sbjct: 117 SRPNYQFGNKATAYTATEIENYRKLLDDKSSNVFNDDQSLGGYG--MGAKIRFPGEDNLN 174
Query: 99 ADVYQLLGDIELEFDDSHLKYAE-ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQ 157
YQ GDI L+F + + YRREL+L T A ++S NV + REHF S+PDQ
Sbjct: 175 KGSYQDFGDIWLDFSAMGITDDNVQNYRRELNLQTGIASTEFSYKNVSYKREHFVSSPDQ 234
Query: 158 VIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGI 217
V+VT +S SE G L+F+ ++ L+N N ++ + R I K ND +
Sbjct: 235 VMVTNLSASEKGKLNFSAKME--LNND---NLEGKLTFDVRNQTCTIEGKVKDND----L 285
Query: 218 QFSAILEIKISDDRGTISALEDKKL-KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTS 276
+F +++ ++ G I+A E ++ +++ +D +++ A + + + D +K+ ++
Sbjct: 286 KFRTTMKLLLTG--GEITADEKNQVYRIKNADQVTIIMAAETDYKNDYPTYRDKEKNLSN 343
Query: 277 ESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAE 336
+ + SY +L H++D+Q LF RVS+ L + TD ID +
Sbjct: 344 VIDTRINDSSKKSYDELKQTHIEDHQSLFDRVSLDLGEFQTSVPTDQL----IDEYRNGS 399
Query: 337 RVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPT-WDSAPHVNIN 395
+T L FQ+GRYL I+ SR GT +NL G+W + P+ W H N+N
Sbjct: 400 YSHYLET--------LAFQYGRYLTIAGSR-GTLPSNLVGLWT--VGPSAWTGDYHFNVN 448
Query: 396 LEMNYWQSLPCNLSECQEPLFDF---------LTYLSINGSKTAQVNYLASGWVIHHKTD 446
++MNYW NL+EC D+ LT ++G K A N+ +G+ +H + +
Sbjct: 449 VQMNYWPVYSTNLAECGTTFVDYMDKLREPGRLTAERVHGIKGAVDNH--TGFTVHTENN 506
Query: 447 IWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL 506
+ ++ + + P G AW +LW HY +T D +L+ YP+++ A F +L
Sbjct: 507 PFGMTAPTNAQE-YGWNPTGAAWAVQNLWWHYEFTQDEAYLKNTIYPIMKEAAQFWDSYL 565
Query: 507 IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS---------STMDMAIIREVFSAIISA 557
Y + N TSP H + +A S+S +T D ++I E+++ I A
Sbjct: 566 WTSE--YQKINDETSPYH---GENRLVAAPSFSEEQGPTAIGTTYDQSLIWELYNECIQA 620
Query: 558 AEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW------------------AQDFKD 599
+++ ++E A+++ + + +L P +I I EW A D +
Sbjct: 621 GKIVGEDE-AVLQSWEEKMQKLDPIEINATNGIKEWYEETRVGQETGHNKSYAKAGDLAE 679
Query: 600 PEV-----------HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSIT 648
V RH SHL GLFPG I E NP AA ++L +RGE GWS
Sbjct: 680 IAVPNSGWNIGHNGEQRHASHLVGLFPGTLINKE-NPTYMNAAIQSLTERGEYSTGWSKA 738
Query: 649 WKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAH------------PP 696
K LWAR + E AY++ L NL+ GL NLF +H P
Sbjct: 739 NKINLWARAENGEKAYKL---LNNLIGGNS-----SGLQHNLFDSHGSGGGDTMMNGTPV 790
Query: 697 FQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGD 756
+QID NFG T+ VAEMLVQS LPA+P D W G V+GLKARG T+ W +G
Sbjct: 791 WQIDGNFGLTSGVAEMLVQSQSGYTQFLPAIP-DAWEKGEVRGLKARGNFTIGEKWANGI 849
Query: 757 LHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLKCT 803
+ Y N + T Y+ N+++ KIY ++++ T
Sbjct: 850 AEAFTV--RYDGNKDSAVFTGSYK------NITSAKIYEDGKEVQVT 888
>gi|317501845|ref|ZP_07960030.1| hypothetical protein HMPREF1026_01974 [Lachnospiraceae bacterium
8_1_57FAA]
gi|336439520|ref|ZP_08619132.1| hypothetical protein HMPREF0990_01526 [Lachnospiraceae bacterium
1_1_57FAA]
gi|316896735|gb|EFV18821.1| hypothetical protein HMPREF1026_01974 [Lachnospiraceae bacterium
8_1_57FAA]
gi|336015952|gb|EGN45750.1| hypothetical protein HMPREF0990_01526 [Lachnospiraceae bacterium
1_1_57FAA]
Length = 1802
Score = 328 bits (840), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 262/887 (29%), Positives = 409/887 (46%), Gaps = 151/887 (17%)
Query: 13 LKITFNGPAKHFT----------DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-- 60
LK+ + PAK T ++P+GNG LG +++GG+ E + NE TLWTG P
Sbjct: 47 LKLWYGSPAKINTAESSGGEWMQQSLPLGNGNLGNLIFGGISKERIHFNEKTLWTGGPSS 106
Query: 61 -------GDYTNPDAPKALSDVRSLVDS------------GQYAEATAASVKLFGHP--- 98
G+ + + R L+D G Y A ++ G
Sbjct: 107 SRPNYQFGNKATAYTATEIENYRKLLDDKSSNVFNDDQSLGGYG--MGAKIRFPGEDNLN 164
Query: 99 ADVYQLLGDIELEFDDSHLKYAE-ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQ 157
YQ GDI L+F + + YRREL+L T A ++S NV + REHF S+PDQ
Sbjct: 165 KGSYQDFGDIWLDFSAMGITDDNVQNYRRELNLQTGIASTEFSYKNVSYKREHFVSSPDQ 224
Query: 158 VIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGI 217
V+VT +S SE G L+F+ ++ L+N N ++ + R I K ND +
Sbjct: 225 VMVTNLSASEKGKLNFSAKME--LNND---NLEGKLTFDVRNQTCTIEGKVKDND----L 275
Query: 218 QFSAILEIKISDDRGTISALEDKKL-KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTS 276
+F +++ ++ G I+A E ++ +++ +D +++ A + + + D +K+ ++
Sbjct: 276 KFRTTMKLLLTG--GEITADEKNQVYRIKNADQVTIIMAAETDYKNDYPTYRDKEKNLSN 333
Query: 277 ESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAE 336
+ + SY +L H++D+Q LF RVS+ L + TD ID +
Sbjct: 334 VIDTRINDSSKKSYDELKQTHIEDHQSLFDRVSLDLGEFQTSVPTDQL----IDEYRNGS 389
Query: 337 RVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPT-WDSAPHVNIN 395
+T L FQ+GRYL I+ SR GT +NL G+W + P+ W H N+N
Sbjct: 390 YSHYLET--------LAFQYGRYLTIAGSR-GTLPSNLVGLWT--VGPSAWTGDYHFNVN 438
Query: 396 LEMNYWQSLPCNLSECQEPLFDF---------LTYLSINGSKTAQVNYLASGWVIHHKTD 446
++MNYW NL+EC D+ LT ++G K A N+ +G+ +H + +
Sbjct: 439 VQMNYWPVYSTNLAECGTTFVDYMDKLREPGRLTAERVHGIKGAVDNH--TGFTVHTENN 496
Query: 447 IWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL 506
+ ++ + + P G AW +LW HY +T D +L+ YP+++ A F +L
Sbjct: 497 PFGMTAPTNAQE-YGWNPTGAAWAVQNLWWHYEFTQDEAYLKNTIYPIMKEAAQFWDSYL 555
Query: 507 IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS---------STMDMAIIREVFSAIISA 557
Y + N TSP H + +A S+S +T D ++I E+++ I A
Sbjct: 556 WTSE--YQKINDETSPYH---GENRLVAAPSFSEEQGPTAIGTTYDQSLIWELYNECIQA 610
Query: 558 AEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW------------------AQDFKD 599
+++ ++E A+++ + + +L P +I I EW A D +
Sbjct: 611 GKIVGEDE-AVLQSWEEKMQKLDPIEINATNGIKEWYEETRVGQETGHNKSYAKAGDLAE 669
Query: 600 PEV-----------HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSIT 648
V RH SHL GLFPG I E NP AA ++L +RGE GWS
Sbjct: 670 IAVPNSGWNIGHNGEQRHASHLVGLFPGTLINKE-NPTYMNAAIQSLTERGEYSTGWSKA 728
Query: 649 WKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAH------------PP 696
K LWAR + E AY++ L NL+ GL NLF +H P
Sbjct: 729 NKINLWARAENGEKAYKL---LNNLIGGNS-----SGLQHNLFDSHGSGGGDTMMNGTPV 780
Query: 697 FQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGD 756
+QID NFG T+ VAEMLVQS LPA+P D W G V+GLKARG T+ W +G
Sbjct: 781 WQIDGNFGLTSGVAEMLVQSQSGYTQFLPAIP-DAWEKGEVRGLKARGNFTIGEKWANGI 839
Query: 757 LHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLKCT 803
+ Y N + T Y+ N+++ KIY ++++ T
Sbjct: 840 AEAFTV--RYDGNKDSAVFTGSYK------NITSAKIYEDGKEVQVT 878
>gi|331091988|ref|ZP_08340820.1| hypothetical protein HMPREF9477_01463 [Lachnospiraceae bacterium
2_1_46FAA]
gi|330402887|gb|EGG82454.1| hypothetical protein HMPREF9477_01463 [Lachnospiraceae bacterium
2_1_46FAA]
Length = 1785
Score = 327 bits (839), Expect = 1e-86, Method: Compositional matrix adjust.
Identities = 253/861 (29%), Positives = 404/861 (46%), Gaps = 139/861 (16%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD----------APKALSDVR 76
++P+GNG LG +++GG+ E + NE TLWTG P + T PD K + R
Sbjct: 71 SLPLGNGNLGNLIFGGISKERIHFNEKTLWTGGPSE-TRPDYQFGNKKTAYTDKEIEAYR 129
Query: 77 SLVDSGQY----------AEATAASVKLFGHP---ADVYQLLGDIELEFDDSHLKYAE-E 122
L+D + +K G YQ GDI ++F ++ ++ +
Sbjct: 130 KLLDDKSKNVFNDDTSLGKPGMSGKIKFPGEDNLNKGSYQDFGDIWIDFSETGIRDDNVK 189
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
YRRELDL T A +S V++ REHF S+PDQV+VT++S S+ L ++ ++
Sbjct: 190 NYRRELDLQTGVAATTFSHQGVDYKREHFVSSPDQVMVTELSASKEKKLDVSIKMEL--- 246
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
N+S + G + E I K N G++F + KI G I+A E +L
Sbjct: 247 NNSGLEGTAKFDAEQNMY--TIFGKVKDN----GLKFRTTM--KIVQSGGDITADEKNQL 298
Query: 243 -KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDY 301
KVE +D ++++ A + + + D+KKD + ++ SY +L H++D+
Sbjct: 299 YKVENADKIMIVMAAETDYKNDYPTYRDTKKDLEKVVVERVKRASEKSYQELKENHIEDH 358
Query: 302 QKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL-FQFGRYL 360
Q LF RVS+ L EN +P+ E + +++ +E+L FQ+GRYL
Sbjct: 359 QGLFDRVSLDLG-------------ENRSNIPTNELIDAYRKGSYSKYLEVLAFQYGRYL 405
Query: 361 LISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
I+ SR GT +NL G+W S W H N+N++MNYW NL+EC + D++
Sbjct: 406 TIAGSR-GTLPSNLVGLWTMGAS-AWTGDYHFNVNVQMNYWPVYVTNLAECGTTMVDYME 463
Query: 421 YLSINGSKTAQ-------VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
L G TA+ +G+ +H + + + ++ + + P G AW +
Sbjct: 464 NLREPGRLTAERVHGIEDATTKKNGFTVHTENNPFGMTAPTNNQ-EYGWNPTGAAWAIQN 522
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-------IEGHDGYLETNPSTSPEHEF 526
LW HY +T ++D+L+ YP+++ A F ++L + + + P F
Sbjct: 523 LWAHYEFTQNKDYLKNTIYPIMKEAAQFWDNYLWTSDYQKVHDKNSKYDGQPRLVVVPSF 582
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS----LPRLRPT 582
A G A +T D +++ E+++ I A +++ ED E VLKS + RL P
Sbjct: 583 SAEQGPTAV---GTTYDQSLVWELYNECIKAGKIV--GED---ETVLKSWEEKMQRLDPI 634
Query: 583 KIAEDGSIMEWAQDFK--DPEVHH---------------------------RHLSHLFGL 613
++ I EW ++ + HH RH SHL GL
Sbjct: 635 EMNATNGIKEWYEETRVGTETGHHQSYAKAGNLAEIPVPNSGWNIGHLGEQRHASHLVGL 694
Query: 614 FPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNL 673
FPG T+ + N + AA ++L++RGE GWS K LWAR + + AYR+ L NL
Sbjct: 695 FPG-TLIHKDNEEYMDAAIQSLEERGEYSTGWSKANKINLWARTGNGDKAYRL---LNNL 750
Query: 674 VDPEHEKHFEGGLYSNLFAAH------------PPFQIDANFGFTAAVAEMLVQSTLNDL 721
+ GL NLF +H P +QID N+G T+ VAEML+QS L +
Sbjct: 751 IGGNT-----SGLQYNLFDSHGSQGGDTMMNGTPVWQIDGNYGLTSGVAEMLLQSQLGYV 805
Query: 722 YLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRG 781
LPA+P W+ G VKGLKARG T+S WK+ + + Y + +S T Y+
Sbjct: 806 QFLPAIP-SAWTDGEVKGLKARGNFTISEKWKNNMAEKFTV--RYDGEEKESTFTGEYK- 861
Query: 782 TSVKVNLSAGKIYTFNRQLKC 802
+++ K+Y ++++
Sbjct: 862 -----DITNAKVYQDGKEVRV 877
>gi|331088642|ref|ZP_08337553.1| hypothetical protein HMPREF1025_01136 [Lachnospiraceae bacterium
3_1_46FAA]
gi|330407599|gb|EGG87099.1| hypothetical protein HMPREF1025_01136 [Lachnospiraceae bacterium
3_1_46FAA]
Length = 1802
Score = 327 bits (838), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 260/887 (29%), Positives = 407/887 (45%), Gaps = 151/887 (17%)
Query: 13 LKITFNGPAKHFT----------DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-- 60
LK+ + PAK T ++P+GNG LG +++GG+ E + NE TLWTG P
Sbjct: 47 LKLWYGSPAKINTAESSGGEWMQQSLPLGNGNLGNLIFGGISKERIHFNEKTLWTGGPSS 106
Query: 61 -------GDYTNPDAPKALSDVRSLVDS------------GQYAEATAASVKLFGHP--- 98
G+ + + R L+D G Y A ++ G
Sbjct: 107 SRPNYQFGNKATAYTATEIENYRKLLDDKSSNVFNDDQSLGGYG--MGAKIRFPGEDNLN 164
Query: 99 ADVYQLLGDIELEFDDSHLKYAE-ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQ 157
YQ GDI L+F + + YRREL+L T A ++S NV + REHF S+PDQ
Sbjct: 165 KGSYQDFGDIWLDFSAMGITDDNVQNYRRELNLQTGIASTEFSYKNVSYKREHFVSSPDQ 224
Query: 158 VIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGI 217
V+VT +S SE G L+F+ ++ L+N N ++ + R I K ND +
Sbjct: 225 VMVTNLSASEKGKLNFSAKME--LNND---NLEGKLTFDVRNQTCTIEGKVKDND----L 275
Query: 218 QFSAILEIKISDDRGTISALEDKKL-KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTS 276
+F +++ ++ G I+A E ++ +++ +D +++ A + + + D +K+ ++
Sbjct: 276 KFRTTMKLLLTG--GEITADEKNQVYRIKNADQVTIIMAAETDYKNDYPTYRDKEKNLSN 333
Query: 277 ESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAE 336
+ + SY +L H++D+Q LF RVS+ L + TD ID +
Sbjct: 334 VIDTRINDSSKKSYDELKQTHIEDHQSLFDRVSLDLGEFQTSVPTDQL----IDEYRNGS 389
Query: 337 RVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPT-WDSAPHVNIN 395
+T L FQ+GRYL I+ SR GT +NL G+W + P+ W H N+N
Sbjct: 390 YSHYLET--------LAFQYGRYLTIAGSR-GTLPSNLVGLWT--VGPSAWTGDYHFNVN 438
Query: 396 LEMNYWQSLPCNLSECQEPLFDF---------LTYLSINGSKTAQVNYLASGWVIHHKTD 446
++MNYW NL+EC D+ LT ++G K A N+ +G+ +H + +
Sbjct: 439 VQMNYWPVYSTNLAECGTTFVDYMDKLREPGRLTAERVHGIKGAVDNH--TGFTVHTENN 496
Query: 447 IWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL 506
+ ++ + + P G AW +LW HY +T D +L+ YP+++ A F +L
Sbjct: 497 PFGMTAPTNAQE-YGWNPTGAAWAVQNLWWHYEFTQDEAYLKNTIYPIMKEAAQFWDSYL 555
Query: 507 IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS---------STMDMAIIREVFSAIISA 557
Y + N TSP H + +A S+S +T D ++I E+++ I A
Sbjct: 556 WTSE--YQKINDETSPYH---GENRLVAAPSFSEEQGPTAIGTTYDQSLIWELYNECIQA 610
Query: 558 AEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW------------------AQDFKD 599
+++ ++E A+++ + + +L P +I I EW A D +
Sbjct: 611 GKIVGEDE-AVLQSWEEKMQKLDPIEINATNGIKEWYEETRVGQETGHNKSYAKAGDLAE 669
Query: 600 PEV-----------HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSIT 648
V RH SHL GLFPG I E NP AA ++L +RGE GWS
Sbjct: 670 IAVPNSGWNIGHNGEQRHASHLVGLFPGTLINKE-NPTYMNAAIQSLTERGECSTGWSKA 728
Query: 649 WKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAH------------PP 696
K LWAR + E AY+++ L GL NLF +H P
Sbjct: 729 NKINLWARAENGEKAYKLLNNLIG--------GNSSGLQHNLFDSHGSGGGDTMMNGTPV 780
Query: 697 FQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGD 756
+QID NFG T+ VAEMLVQS LPA+P D W G V+GLKARG T+ W +G
Sbjct: 781 WQIDGNFGLTSGVAEMLVQSQSGYTQFLPAIP-DAWEKGEVRGLKARGNFTIGEKWANGI 839
Query: 757 LHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLKCT 803
+ Y N + T Y+ N+++ KIY ++++ T
Sbjct: 840 AEAFTV--RYDGNKDSAVFTGSYK------NITSAKIYEDGKEVQVT 878
>gi|317141175|ref|XP_001817567.2| hypothetical protein AOR_1_3054174 [Aspergillus oryzae RIB40]
Length = 770
Score = 326 bits (836), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 238/768 (30%), Positives = 375/768 (48%), Gaps = 94/768 (12%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ P F ++P+GNGRLG ++ +P+E + NED++W+G D N +A VR
Sbjct: 34 YDTPGTRFNASLPVGNGRLGGTLYC-LPTEIVTWNEDSVWSGTFQDRVNSNALDGFPKVR 92
Query: 77 SLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEF----DDSHLKYAEETYRRELD 129
+L+ +G A ++ + G D YQ+L ++ ++ D ++L + Y L+
Sbjct: 93 NLLVNGNITAAGELALSDMTGSSVDQREYQVLSNLYVDLGQRGDATNLVW----YLDTLE 148
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
TA +Y V +TRE +S P V+ +I + S +++ N + NG
Sbjct: 149 GYTA---CEYGFDGVSYTRELIASAPSGVLGFRIQTNTSRAINLN----------AVANG 195
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
I+M+ R + F+A + + + D G ++A DK L V G+
Sbjct: 196 IASIVMKAR------------TGEADYSTFTAGVRVVV--DGGNVTANGDK-LYVTGATT 240
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
V L A SS+ + D +E L + L Y L + D++ L RV+
Sbjct: 241 VVFFLDAESSYR------YATDSDQETELNRKLDAATELGYEALRKEAITDHKDLAGRVT 294
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT--DEDPSLVELLFQFGRYLLISSSRP 367
+ L S D + +P ER+ ++++ D D L+F +GR+LLI+SSR
Sbjct: 295 LDLGSSTDDAAS----------LPPNERMTNYRSSPDHDVQFATLVFNYGRHLLIASSRR 344
Query: 368 GTQVA---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
+ + LQGIWN+D SP+W + VNINLEMNYW + NL+E PL+D L +
Sbjct: 345 TRERSLSPGLQGIWNQDYSPSWGAKYTVNINLEMNYWPAETTNLNELTSPLWDLLALIQE 404
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G A+ + G+V+HH TD+W S +++WPMGGAWL H+ EHY +T D+
Sbjct: 405 RGGDVAEKMHGCPGFVLHHNTDLWGDSVPVHNGTKYSIWPMGGAWLALHMMEHYRFTGDK 464
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYS 539
FL+++A P+ + F +L + DGYL T PS SPE+ F P GK ++ S
Sbjct: 465 TFLKEQACPIFKSAFEFFECYLFD-VDGYLTTGPSCSPENAFQIPSDMTVAGKEEALTMS 523
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKD 599
T+D +++ E+ +A+ ++LE + D L V + + +GS + F +
Sbjct: 524 PTLDNSMLFELLTALNETHQILEIDND-LSGSV----------QTSSNGS-----RSFAE 567
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP---GWSITWKTALWAR 656
+ HR S LFGLFPG +T + L AA L +R G GWS W +L+AR
Sbjct: 568 TDPAHRQFSPLFGLFPGTQLTPLASTKLADAAGVLLDRRMNSGGGSRGWSRAWSISLYAR 627
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
L+ + A+ V+ + L+++ FQID N + AA+ E+L+Q+
Sbjct: 628 LYRGDEAWDNVQAWI-------QTFLLTNLWNSDKGGSTVFQIDGNLDYAAAIPELLLQN 680
Query: 717 TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
++LLPALP +G V GL ARGG V I W+DG L I S
Sbjct: 681 HPGVVHLLPALP-SAVPTGSVSGLVARGGFEVDIAWEDGALTNATITS 727
>gi|359406206|ref|ZP_09198915.1| hypothetical protein HMPREF0673_02149 [Prevotella stercorea DSM
18206]
gi|357556624|gb|EHJ38213.1| hypothetical protein HMPREF0673_02149 [Prevotella stercorea DSM
18206]
Length = 1013
Score = 326 bits (836), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 258/825 (31%), Positives = 390/825 (47%), Gaps = 136/825 (16%)
Query: 8 STTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD 67
+TT L G + A+PIG+G+ GA ++GGV + ++ NE TLW+G P
Sbjct: 216 ATTAKLYSGGQGYSNWMEYALPIGDGQFGACLFGGVYRDEIQFNEKTLWSGTP------- 268
Query: 68 APKALSDVRSLVDSGQYAEATAASVKLFGHPADVY--QLLGDIELEFDDSHLKYAEETYR 125
RS Y + + + +Y L G+ L D A Y
Sbjct: 269 -------ARSSQGGKGYGK--------YENFGSIYAKDLSGEFGLTTDK-----AASNYV 308
Query: 126 RELDLNTATARVKY-SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL--DSLLD 182
R LDL TAT + + S VE+TRE+ +SNP +V+V + S+ G LSF ++ S+
Sbjct: 309 RLLDLTTATGKTMFKSAAGVEYTREYIASNPARVVVAHYTASKGGKLSFRFTMAAGSITA 368
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+ +Y +G EG GK NA +K+ GT++ +D+ +
Sbjct: 369 DPTYADG------EGTFSGKLETISYNA-------------RMKVVPVGGTMTT-DDEGI 408
Query: 243 KVEGSDWAVLLLVASSSFDG---PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
+V G+D +++L + FD + + + S+ ++A + S+ DLY H+
Sbjct: 409 EVIGADEIMVVLGGGTDFDAYESTYTKNTSALAQTISDRVAAAAA---KSWKDLYAEHVA 465
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
DYQ F+R L+ + D+ T+ IDT S + L +L F +GRY
Sbjct: 466 DYQSFFNRCEFDLAGTKNDMTTNRL----IDTYNSGRGADALM------LEQLYFAYGRY 515
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
L ISSSR +NLQGIWN W+S H NIN++MNYW + P NLSE P FL
Sbjct: 516 LEISSSRGVDSPSNLQGIWNNINGVAWNSDIHSNINVQMNYWPAEPTNLSEMHLP---FL 572
Query: 420 TYLSINGSKTAQVNYLAS------GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
Y+ K Q A GW + +I+ SA + V + AW TH
Sbjct: 573 NYIWAMAEKQPQWKQWAKLQGQDRGWTCFTENNIFGGVSAFKNNYV-----IANAWYTTH 627
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
LW+HY YT+DR++L KR +P + + F +D L DG E SPEH + +G
Sbjct: 628 LWQHYRYTLDREYL-KRVFPAMLSASQFWMDRLKLASDGTYECPNEWSPEHGPESENG-- 684
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAED------ 587
V+++ + + ++FS ++A +VL +DA V + + R +K+ +
Sbjct: 685 --VAHAQQL----VYDLFSNTLAAIDVL--GDDAEVSATDLTTLKDRFSKLDKGLATETY 736
Query: 588 ----GS--------IMEWA-QDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKT 634
GS + EW + E HRH+SHL L+P IE +L AA +
Sbjct: 737 TGYFGSAIPTGTKILREWKYSTYTRGENGHRHMSHLMCLYP--FSQIEPGTELFDAAVNS 794
Query: 635 LQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEG--GLYSNLFA 692
++ RG+ GWS+ WK LWAR D +HA ++ H G G++ NLF
Sbjct: 795 MKLRGDGATGWSMGWKMNLWARALDGDHARTILNNAL--------AHSNGGAGVFYNLFD 846
Query: 693 AHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICW 752
+H PFQID NFG A +AEM++QS + +LPALP W+ G + G+KA G TVSI W
Sbjct: 847 SHAPFQIDGNFGACAGIAEMIMQSNSGLIRILPALP-SAWTEGHMHGMKAVGDVTVSIDW 905
Query: 753 KDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFN 797
K+G+ V + +NN + + +HY+ NL+ K+Y N
Sbjct: 906 KNGEATRVTL----TNNQGQTMR-VHYK------NLAKAKVYVDN 939
>gi|330819167|ref|YP_004348029.1| hypothetical protein bgla_2g00360 [Burkholderia gladioli BSR3]
gi|327371162|gb|AEA62517.1| hypothetical protein bgla_2g00360 [Burkholderia gladioli BSR3]
Length = 796
Score = 325 bits (834), Expect = 5e-86, Method: Compositional matrix adjust.
Identities = 245/780 (31%), Positives = 377/780 (48%), Gaps = 110/780 (14%)
Query: 13 LKITFN---GPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
L+++++ G + + +P+GNGRLGA+ G E L LNE TLW+G D +P
Sbjct: 65 LRLSYSQAAGESNILFEGLPLGNGRLGALTGGSPVREALYLNEITLWSGQK-DAVDP--- 120
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
Y A S YQ+LG + +E H + Y R LD
Sbjct: 121 -------------AYTAAGMGS----------YQMLGKLYVELP-GHAQ--ASGYSRSLD 154
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
++ A AR +Y G + RE F S+PD+V+V ++S S+ GS +SL + + V G
Sbjct: 155 ISNAVARTQYVAGGHTYRREVFCSHPDKVLVMRLS-SDGGSHDGTISL--VDGQGASVTG 211
Query: 190 NNQIIM-EGRCPG--KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
+N I++ +G+ G +R A D +++ A +G ++ L
Sbjct: 212 SNGILLAQGKLDGVGERYATHVLAMPDSGTVKYDA--------SKGVLTMSRCPAL---- 259
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
L++ A +++ G DP + + + +L Y +L RHL DY LF
Sbjct: 260 ----TLIIAARTNYSGIEAEGYLGATDPAALARADASGAAHLPYRNLLERHLRDYTALFG 315
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSS 365
R S+ L +S + T+P + ++ D DP L L QFGRYL I+SS
Sbjct: 316 RFSLDLGKS--------SDAQRAMTIPDRLKARTASPDIADPELEALYVQFGRYLTIASS 367
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN 425
R G ANLQG+W+ + +P W + H +IN++MNYW + L ECQ+P D++ +
Sbjct: 368 R-GPLPANLQGLWSVNNTPPWMADYHTDINVQMNYWLADRAGLPECQKPFADYVLSQLPS 426
Query: 426 GSKTAQVNY-------------LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
+++ Q ++ +GW I T I+ G + W P AW C
Sbjct: 427 WARSTQAHFNDAANSNYSNSSGKVAGWTIAISTGIY-------GGIGWDWSPPASAWYCR 479
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDG 531
LW HY YT+DRD+L + YP+L+ F LI + G L + SPEH D
Sbjct: 480 TLWNHYQYTLDRDYL-RAIYPVLKSACEFWQARLIVDPASGLLVDDRDWSPEHG----DH 534
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNED-ALVEKVLKS---LPRLRPTKIAED 587
+ ++Y+ + + ++F+ +A+ L + D A L+S LP++ PT
Sbjct: 535 QELGITYAQEL----VWDLFTNYGTASGTLNLDTDFAATIAGLRSRLYLPKISPTT---- 586
Query: 588 GSIMEWAQDFKDP-EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWS 646
G + EW +D D + HRHLS L G F G I + +P L AA+ L RG + GW
Sbjct: 587 GQLQEWMEDKVDTGDPQHRHLSPLIGWFEGERIAYDSDPALVAAAKALLTARGTDSFGWG 646
Query: 647 ITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP--FQIDANFG 704
+ W+ A WA+ D Y MV++L + G ++N+F A+ FQIDANFG
Sbjct: 647 LAWRIACWAKFRDAATCYSMVQKLLRFASGSDSTN---GTFTNMFDAYGGNIFQIDANFG 703
Query: 705 FTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
AA+ EMLVQS+++ + LLPALP +W++G VKG++ +GG +V + WKDG L I S
Sbjct: 704 GPAAILEMLVQSSMDSIVLLPALP-PQWNTGSVKGVRVKGGFSVDLAWKDGRLTSAAITS 762
>gi|325855022|ref|ZP_08171738.1| hypothetical protein HMPREF9303_0271 [Prevotella denticola CRIS
18C-A]
gi|325484000|gb|EGC86940.1| hypothetical protein HMPREF9303_0271 [Prevotella denticola CRIS
18C-A]
Length = 753
Score = 325 bits (833), Expect = 7e-86, Method: Compositional matrix adjust.
Identities = 240/801 (29%), Positives = 364/801 (45%), Gaps = 111/801 (13%)
Query: 24 FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
T +PIGNG+ GA + G V + ++ N+ TLW+G G T S D G
Sbjct: 1 MTSCLPIGNGQFGATLMGQVAVDDVQFNDKTLWSGKLGGLT------------STADYGS 48
Query: 84 YAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
Y FG+ F SH Y R LD+N A A V++ +
Sbjct: 49 YLN--------FGNL-------------FISSHGMKKVTDYVRYLDINNAVAGVQFCMDG 87
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY----VNGNNQ--IIMEG 197
V + R +F+SNPD IV + + S+ G +S ++L + N Y V+ NQ I +G
Sbjct: 88 VAYRRTYFASNPDSCIVIRYTASQRGKISTTLAL--MDQNGGYVRYVVDKVNQATITFDG 145
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVAS 257
+ A D S ++ + G + ++V +D + L
Sbjct: 146 QI--------ARQKDGGAATPESYCCTARVVTEGGKVRKNAKGLIEVSNADCMTIYLRGL 197
Query: 258 SSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPK 317
+ FD S + + + S + Y+ L H DY+ LF R L S
Sbjct: 198 TDFDPDAPEYVAGSGRLASRAAATVDSAQRKGYAALLAAHKADYRSLFDRCQFTLGDSKA 257
Query: 318 DIVTDTCSEENIDTVPSAERVKSFQTDEDPSLV--ELLFQFGRYLLISSSRPGTQVANLQ 375
DI T + + S++ + +L EL F +GRYLLISSSR + ANLQ
Sbjct: 258 DIST-------------PQLISSYRDNPHDNLFLEELYFSYGRYLLISSSRGISLPANLQ 304
Query: 376 GIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL---TYLSINGSKTAQ- 431
GIWN +P W + H NIN++MNYW + P NLSE P D++ + + + A+
Sbjct: 305 GIWNNSNTPAWHADIHANINVQMNYWPAEPTNLSELHRPFLDYIYREACVRPSWHRFAKD 364
Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
+ ++ +GW + + +I+ G + + AW C HLW+HY YTMDR++L RA
Sbjct: 365 MGHVDAGWTLPTENNIYGS-----GTTFADTYTVANAWYCQHLWQHYMYTMDREYLRTRA 419
Query: 492 YPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
+ +++ + L L++ DG E SPEH P ++ ++ ++F
Sbjct: 420 FSVMKSAVDYWLRKLVKASDGTYECPDEWSPEH---GP------TENATAHSQQLVWDLF 470
Query: 552 SAIISAAEVLEKNEDALVEKVLK-----SLPRLRPTKIAE----DGS--IMEW--AQDFK 598
++ A +VL D +V + + RL E DG + EW F
Sbjct: 471 NSTRKAIKVL---GDDMVSRTFRDSLAGCFARLDDGCHTEVNPADGQTYLREWKYTSQFD 527
Query: 599 DPE-------VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGE-EGPGWSITWK 650
+P+ HRH+SHL GL+P I+ + + + +AA +L RG+ G GWS+ K
Sbjct: 528 NPDRVGVDEYRTHRHISHLMGLYPCSQISEDGDMTVFRAARTSLLARGDGHGTGWSLGHK 587
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
L AR H+ H + +++R GG+Y NL+ AH P+QID NFG+TA +A
Sbjct: 588 INLNARAHEGLHCHNLIRRALQQTWSTDVDERAGGIYENLWDAHAPYQIDGNFGYTAGIA 647
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNND 770
EML+QS L +LPALP D W+ G VKGLKA G TV I W E+ I S+
Sbjct: 648 EMLLQSYNGKLVILPALPTDFWTKGAVKGLKAVGNFTVDITWAKARAEEIRIVSHAG--- 704
Query: 771 HDSFKTLHYRGTSVKVNLSAG 791
+ + Y G + L+AG
Sbjct: 705 --TVCVVKYAGVADDFKLTAG 723
>gi|169604462|ref|XP_001795652.1| hypothetical protein SNOG_05244 [Phaeosphaeria nodorum SN15]
gi|160706577|gb|EAT87635.2| hypothetical protein SNOG_05244 [Phaeosphaeria nodorum SN15]
Length = 771
Score = 323 bits (829), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 231/765 (30%), Positives = 360/765 (47%), Gaps = 79/765 (10%)
Query: 6 STSTTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN 65
S STT I F P +TDA+P+GNGRLGA++ GG E + LNED++W+G N
Sbjct: 21 SASTT----IWFGKPGVIWTDALPVGNGRLGAVIHGGYGMEQVGLNEDSIWSGGLQKRIN 76
Query: 66 PDAPKALSDVRSLVDSGQYAEATAA---SVKLFGHPADVYQLLGDIELEFDDSHLKYAEE 122
+A A + +G ++A ++K G YQ G++ +EF + +
Sbjct: 77 SNALAAFPGIPEAFTNGNISKADEIWHNNLKGTGTQVRQYQPAGNMMIEFGQN--VSSVS 134
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
Y R LDL T V Y+ +V + R+ +S P + + + ++G+L +SL
Sbjct: 135 GYNRSLDLTTGENHVSYTRNDVTYLRQALASYPHDTLGFRYTADKAGALDMKISLT---- 190
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+ V G ++ I +D ++F + I++ D G K++
Sbjct: 191 RNESVTG-----LKVDLEKLSITMYGQGTNDSS-LKF--VHSIRVVADTG------GKEV 236
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
++ A ++F + +++ + + L + + + + ++ ++DY+
Sbjct: 237 RI--------YYGAETTFRHANVEAAEAAMN------AKLDAAVAVPWEEFKSKAIEDYK 282
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD----EDPSLVELLFQFGR 358
L RV + D S I + + +R+K++ T DP L+ L + +GR
Sbjct: 283 NLADRVQL-----------DVGSSGEIGRLDTGQRLKNWNTTGNATSDPELMALTYNYGR 331
Query: 359 YLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
+LLI SSR G+ +NLQG+WN+ P W S +NIN EMNYW + NL+E P+FD
Sbjct: 332 FLLIGSSRIGSLPSNLQGVWNDKFKPPWGSRFTININTEMNYWPAETTNLAETHLPVFDH 391
Query: 419 LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
L + G A+ Y SGWV HH TD+W + WA P+GGAWL HL EH+
Sbjct: 392 LLRMQEQGRYVAKGMYNMSGWVCHHNTDLWGDCVPVDDQTYWAANPVGGAWLALHLIEHF 451
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGK-----L 533
+ + + A P+L +F D+ I+ D Y +SPE+ + P K
Sbjct: 452 RFNGNTTWASSTALPILSDALTFFYDFSIKKGD-YNALIYDSSPENSYHIPSNKQVPNAT 510
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
+ S ++ E+FS I +E + V K L + P +A DG ++EW
Sbjct: 511 TGIDQGSAHPRQVLHELFSGFIEMSEATGSIDG--VAKAKDYLAHIEPPNVATDGHLLEW 568
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWK 650
+ DF++ E HRHLSHL G++PG I+ N AA +L R + GWS W
Sbjct: 569 SGDFRETEPGHRHLSHLLGVYPGGHISPLINKTASDAALVSLDNRIAASTDPIGWSKVWA 628
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAH-PPFQIDANFGFTAAV 709
++ARL D + K F+L D L NLF + FQID N GFT ++
Sbjct: 629 AGIYARLFDGD------KAAFHLCDL-----ISNYLAGNLFDLNIGVFQIDGNLGFTGSM 677
Query: 710 AEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKD 754
E+ +QS ++L PALP + G V GL ARGG VS+ WKD
Sbjct: 678 TELFLQSHAGVVHLAPALPSNLIPEGSVSGLVARGGFVVSVKWKD 722
>gi|327313293|ref|YP_004328730.1| hypothetical protein HMPREF9137_1029 [Prevotella denticola F0289]
gi|326946180|gb|AEA22065.1| conserved hypothetical protein [Prevotella denticola F0289]
Length = 753
Score = 323 bits (827), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 239/801 (29%), Positives = 364/801 (45%), Gaps = 111/801 (13%)
Query: 24 FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
T +PIGNG+ GA + G V + ++ N+ TLW+G G T S D G
Sbjct: 1 MTSCLPIGNGQFGATLMGQVAVDDVQFNDKTLWSGKLGGLT------------STADYGS 48
Query: 84 YAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
Y FG+ F SH Y R LD+N A A V++ +
Sbjct: 49 YLN--------FGNL-------------FISSHGMRKVTDYVRYLDINNAVAGVQFCIDG 87
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY----VNGNNQ--IIMEG 197
V + R +F+S+PD IV + + S+ G +S ++L + N Y V+ NQ I +G
Sbjct: 88 VAYRRTYFASSPDSCIVIRYTASQRGKISTTLAL--MDQNGGYVRYVVDKVNQATITFDG 145
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVAS 257
+ A D S ++ + G + ++V +D + L
Sbjct: 146 QI--------ARQKDGGAATPESYCCTARVVTEGGKVRKNARGLIEVINADCMTVYLRGL 197
Query: 258 SSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPK 317
+ FD + + + S + Y+ L H DY+ LF R + L S
Sbjct: 198 TDFDPDAPEYVAGAGRLAGRAAATVDSAQRRGYAALLAAHKADYRSLFDRCQLTLGDSKA 257
Query: 318 DIVTDTCSEENIDTVPSAERVKSFQTDEDPSLV--ELLFQFGRYLLISSSRPGTQVANLQ 375
DI T + + S++ + +L EL F +GRYLLISSSR + ANLQ
Sbjct: 258 DIST-------------PQLISSYRDNPHDNLFLEELYFSYGRYLLISSSRGVSLPANLQ 304
Query: 376 GIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL---TYLSINGSKTAQ- 431
GIWN +P W + H NIN++MNYW + P NLSE P D++ + + + A+
Sbjct: 305 GIWNNSNTPAWHADIHANINVQMNYWPAEPTNLSELHRPFLDYIYREACVRPSWHRFAKD 364
Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
+ ++ +GW + + +I+ G + + AW C HLW+HY YTMDR++L RA
Sbjct: 365 MGHVDAGWTLPTENNIYGS-----GTTFADTYTVANAWYCQHLWQHYMYTMDREYLRTRA 419
Query: 492 YPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
+P+++ + L L++ DG E SPEH P ++ ++ ++F
Sbjct: 420 FPVMKSAVDYWLRKLVKASDGTYECPDEWSPEH---GP------TENATAHSQQLVWDLF 470
Query: 552 SAIISAAEVLEKNEDALVEKVLK-----SLPRLRPTKIAE----DGS--IMEW--AQDFK 598
++ A +VL D +V + + RL E DG + EW F
Sbjct: 471 NSTRKAIKVL---GDDMVSRTFRDSLAGCFARLDDGCHTEVNPADGQTYLREWKYTSQFD 527
Query: 599 DPE-------VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGE-EGPGWSITWK 650
+P HRH+SHL GL+P I+ + + + +AA +L RG+ G GWS+ K
Sbjct: 528 NPGRVGVDEYRTHRHISHLMGLYPCSQISEDGDKTVFRAARTSLLARGDGHGTGWSLGHK 587
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
L AR H+ H + +++R GG+Y NL+ AH P+QID NFG+TA +A
Sbjct: 588 INLNARAHEGLHCHNLIRRALQQTWSTDVDERAGGIYENLWDAHAPYQIDGNFGYTAGIA 647
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNND 770
EML+QS L +LPALP D W+ G VKGLKA G TV I W E+ I S+
Sbjct: 648 EMLLQSYNGKLVILPALPTDFWTKGAVKGLKAVGNFTVDITWVKARAEEIRIVSHAG--- 704
Query: 771 HDSFKTLHYRGTSVKVNLSAG 791
+ + Y G + L+AG
Sbjct: 705 --TVCVVKYAGVADDFKLTAG 723
>gi|336431570|ref|ZP_08611417.1| hypothetical protein HMPREF0991_00536, partial [Lachnospiraceae
bacterium 2_1_58FAA]
gi|336011929|gb|EGN41858.1| hypothetical protein HMPREF0991_00536 [Lachnospiraceae bacterium
2_1_58FAA]
Length = 1869
Score = 323 bits (827), Expect = 3e-85, Method: Compositional matrix adjust.
Identities = 253/870 (29%), Positives = 406/870 (46%), Gaps = 130/870 (14%)
Query: 6 STSTTNPLKITFNGPAKHFTD----------AIPIGNGRLGAMVWGGVPSETLKLNEDTL 55
+ S + LK+ + PA T ++P+GNG LG +++GG+ E + NE TL
Sbjct: 40 TESISQSLKLWYTSPANINTQETNGGEWMQQSLPLGNGNLGNLIFGGISKERIHFNEKTL 99
Query: 56 WTGVP---------GDYTNPDAPKALSDVRSLVDS------------GQYAEATAASVKL 94
WTG P G+ + + + R L+D G Y A +K
Sbjct: 100 WTGGPSPSRPGYQFGNKATAYTDEEIENYRKLLDDKSTKVFNDDQSLGGYG--MGAQIKF 157
Query: 95 FGHP---ADVYQLLGDIELEFDDSHLKYAE-ETYRRELDLNTATARVKYSVGNVEFTREH 150
G YQ GDI L+F L+ + YRRELDL T A ++S +V + REH
Sbjct: 158 PGENNLNKGSYQDFGDIWLDFSKMGLQDQNVKNYRRELDLQTGVASTEFSYEDVNYKREH 217
Query: 151 FSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVNGNNQIIMEGRCPGKRIPPK 207
F SNPDQ++VTK+S SESG L +V ++ + L+ + + NQ C I K
Sbjct: 218 FVSNPDQIMVTKLSASESGKLDLSVKMELNNNGLEGKTTFDSENQT-----CT---IEGK 269
Query: 208 ANANDDPKGIQFSAILEIKISDDRGTISALEDKKL-KVEGSDWAVLLLVASSSFDGPFIN 266
ND ++F +++ + + G + E ++ ++E ++ ++++ A + + +
Sbjct: 270 VKDND----LKFYTTMKLVL--EGGDLEVDEKNQVYQIEDANQVMIVMAAETDYKNDYPT 323
Query: 267 PSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSE 326
D +K+ + S SY L +H+ D+QKLF RVS+ L +I
Sbjct: 324 YRDKEKNLKKMVDDRVNSNAKKSYQKLKEKHIADHQKLFDRVSLDLGEQRTNI------- 376
Query: 327 ENIDTVPSAERVKSFQTDEDPSLVELL-FQFGRYLLISSSRPGTQVANLQGIWNEDLSPT 385
P+ + V ++ +E+L FQ+GRYL I+ SR GT +NL G+W S
Sbjct: 377 ------PTNQLVDEYRNGTYSHYLEVLAFQYGRYLTIAGSR-GTLPSNLVGLWTVGDSA- 428
Query: 386 WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ-VNYLA------SG 438
W H N+N++MNYW NL+EC D++ L G TA+ V+ + +G
Sbjct: 429 WTGDYHFNVNVQMNYWPVYTTNLAECGVTFVDYMDKLREPGRLTAERVHGIEGAVENHTG 488
Query: 439 WVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGC 498
+ +H + + + ++ + + P G AW +LW HY +T + D+L+ YP+++
Sbjct: 489 FTVHTENNPFGMTAPTNAQE-YGWNPTGAAWAIQNLWWHYEFTQNEDYLKNTIYPIMKEA 547
Query: 499 ASFLLD--WLIEGHDGYLETNPSTSPEHEFIAPD--GKLACVSYSSTMDMAIIREVFSAI 554
A F W E E++P + +AP + + +T D +++ E++
Sbjct: 548 AQFWDSYLWTSEYQKINDESSPYNGQDRLVVAPSFSEEQGPTAIGTTYDQSLVWELYKEC 607
Query: 555 ISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKD--------------- 599
I A +++ ++E AL++ +++ +L P +I E I EW ++ +
Sbjct: 608 IQAGKIVGEDE-ALLKSWEENMQKLDPIEINETNGIKEWYEETRVGQKNGHNRSYAKAGN 666
Query: 600 -PEVH-------------HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGW 645
PE+ RH SHL GLFPG I E N + AA ++L +RGE GW
Sbjct: 667 LPEIEVPNSGWDIGHPGEQRHSSHLVGLFPGTLINKE-NKEYMDAAIQSLTERGEYSTGW 725
Query: 646 SITWKTALWARLHDQEHAYRMVKRL---------FNLVDPEHEKHFEGGLYSNLFAAHPP 696
S K LWAR + E AY+++ L +NL D H GG + +P
Sbjct: 726 SKANKINLWARTENGEKAYKLLNNLIGGNSSGLQYNLFDS----HGSGG-GETMKNGNPV 780
Query: 697 FQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGD 756
+QID NFG T+ VAEMLVQS LPA+P + W G ++GLKARG T+ W +G
Sbjct: 781 WQIDGNFGLTSGVAEMLVQSQSGYTQFLPAIP-NAWEEGNIQGLKARGNFTIGEKWANG- 838
Query: 757 LHEVGIYSNYSNNDHDSFKTLHYRGTSVKV 786
+ E N+ ++F + TS KV
Sbjct: 839 VAETFTVRYDGENESNTFTGSYKNITSAKV 868
>gi|358399331|gb|EHK48674.1| glycoside hydrolase family 95 protein [Trichoderma atroviride IMI
206040]
Length = 797
Score = 322 bits (824), Expect = 7e-85, Method: Compositional matrix adjust.
Identities = 233/805 (28%), Positives = 361/805 (44%), Gaps = 81/805 (10%)
Query: 17 FNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDV 75
+ PA + T +PIGN RLGA ++GG +E + +NEDTLW G + + AL V
Sbjct: 30 YTSPATDWETGVLPIGNSRLGAAIFGGA-NEVVTINEDTLWDGPLQNRIPANGLAALPKV 88
Query: 76 RSLVDSGQYAEATAASVKLFGHPAD---VYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
R ++++ A + P + G++ L F H Y R LD
Sbjct: 89 RQMLEANSLTAAGNLVLSQMTPPISGERQFSYFGNLNLNF--GHSSGGISNYIRSLDTRQ 146
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS---LDSLLDN-HSYVN 188
+ V Y+ V +TRE+ +S P VI + + S++G+LS + + + ++L N S
Sbjct: 147 GNSSVSYTYNGVTYTREYVASTPAGVIAARFTASKAGALSVSATFSRISNILSNVASTSG 206
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
G N + ++G A+D+P I F+ + S G + L + G+
Sbjct: 207 GANTLTLQGSS-------GQAASDNP--ILFTGTAQFVAS---GATFSTSGGTLTISGAT 254
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
+ + +S+ P S D ++ S L + + + ++ + D L R
Sbjct: 255 TIDVFIDVETSYRYP------SASDLAAQVNSKLSAAVSQGFQKIHDGAIADASALLGRA 308
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRP 367
+I L SP + + + + +RVK+ ++ DP L L + +GR+LL++SSR
Sbjct: 309 NINLGTSPNGLAS----------LSTDQRVKNARSSFNDPQLAVLAWNYGRHLLVASSR- 357
Query: 368 GTQVA-----NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
T A NLQG+WN S W +NIN EMN W + NL E Q PLFD +
Sbjct: 358 NTSAAIDMPPNLQGVWNNQTSAPWGGKFTININTEMNLWPAGQTNLIETQLPLFDLMKVA 417
Query: 423 SINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
G + AQ Y +G V HH D+W + +WPMG WL H+ E Y +
Sbjct: 418 QPRGQQMAQDLYGCNGTVFHHNLDVWGDPAPTDNYTSSTMWPMGATWLVQHMIEQYRFGG 477
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAP-----DGKLACVS 537
D + L YP L + FL + G L T PS SPE+ ++ P G+ +
Sbjct: 478 DLNLLRSATYPYLLDISKFLQCYTFS-WQGNLVTGPSLSPENTYVVPSNATVSGQQEPMD 536
Query: 538 YSSTMDMAIIREVFSAIISAAEVLE-KNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
+ MD ++R+V II AA L + D+ V+ +P++R +I G I+EW +
Sbjct: 537 LAPEMDNQLMRDVMKGIIEAAAALGISSSDSNVQAATNFIPQIRTPRIGSYGQILEWRYE 596
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWSITWKTAL 653
+ + + HRHLS ++GL P + + N L AA+ L R G GWS TW
Sbjct: 597 YGETDPGHRHLSPMYGLHPSNQFSPLVNTTLSAAAKALLDHRVASGSGSTGWSRTWLMNQ 656
Query: 654 WARLHDQEHAYR-MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
+ARL ++ +V P +G FQID NFG T+ + EM
Sbjct: 657 YARLFSGADVWKHLVAWFAEYPTPNLWNTNDGST----------FQIDGNFGLTSGLTEM 706
Query: 713 LVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHD 772
L+QS ++LLPALP +G +GL ARGG V I W G L + S
Sbjct: 707 LLQSQTGTVHLLPALPGSNIPTGSAQGLMARGGFEVDINWSGGSLTSATVTST------- 759
Query: 773 SFKTLHYRGTSVKVNLSAGKIYTFN 797
RG S+ + ++ G+ + N
Sbjct: 760 -------RGGSLTLRVAGGQSFKVN 777
>gi|210614863|ref|ZP_03290362.1| hypothetical protein CLONEX_02576 [Clostridium nexile DSM 1787]
gi|210150505|gb|EEA81514.1| hypothetical protein CLONEX_02576 [Clostridium nexile DSM 1787]
Length = 1797
Score = 321 bits (823), Expect = 9e-85, Method: Compositional matrix adjust.
Identities = 261/896 (29%), Positives = 413/896 (46%), Gaps = 149/896 (16%)
Query: 8 STTNPLKITFNGPAKHFT----------DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWT 57
S LK+ + PAK T ++P+GNG LG +++GG+ E + NE TLWT
Sbjct: 43 SINQELKLWYTSPAKIDTAETNGGEWMQQSLPLGNGNLGNLIFGGIAKERIHFNEKTLWT 102
Query: 58 GVPG----DYTNPDAPKALSDV-----RSLVDS------------GQYAEATAASVKLFG 96
G P +Y + A +D R L+D G Y A +K G
Sbjct: 103 GGPSSSRPNYQFGNKATAYTDTEIEEYRKLLDDKSTNVFNDDKSLGGYG--MGAKIKFPG 160
Query: 97 HP---ADVYQLLGDIELEF-----DDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTR 148
YQ GDI L+F +D+++K YRRELD+ T A ++S +V + R
Sbjct: 161 ENNLNKGSYQDFGDIWLDFSKMGINDNNVK----DYRRELDIQTGIAATEFSCKDVTYKR 216
Query: 149 EHFSSNPDQVIVTKISGSESGSLSFNVSLD---SLLDNHSYVNGNNQIIMEGRCPGKRIP 205
EHF SNPDQV+VT++S SE G L NV ++ S L+ + + NQ C I
Sbjct: 217 EHFVSNPDQVMVTELSASEKGKLDLNVKMELNNSGLEGKTTFDEKNQT-----CT---IE 268
Query: 206 PKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL-KVEGSDWAVLLLVASSSFDGPF 264
K ND ++F +++ ++ G +SA E ++ +++ +D ++++ A + + +
Sbjct: 269 GKVKDND----LKFCTTMKLVLTG--GKLSADEKNQVYQIQDADCVMIVMAAETDYKNDY 322
Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
D KD + + SY +L H+ D+Q LF RVS+ L
Sbjct: 323 PTYRDKNKDLKKVVADRVNNGTKKSYDELKETHIADHQGLFDRVSLDLG----------- 371
Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELL-FQFGRYLLISSSRPGTQVANLQGIWNEDLS 383
E +VP+ + V ++ +E+L FQ+GRYL I+ SR GT +NL G+W S
Sbjct: 372 --EQRTSVPTNQLVDEYRNGNYSHYLEVLAFQYGRYLTIAGSR-GTLPSNLVGLWTVGNS 428
Query: 384 PTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ-VNYLA------ 436
W H N+N++MNYW NL+EC D++ L G TA+ V+ +
Sbjct: 429 A-WTGDYHFNVNVQMNYWPVYATNLAECGTTFVDYMDKLREPGRLTAERVHGIEGAVKNH 487
Query: 437 SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLE 496
+G+ +H + + + ++ + + P G AW +LW HY +T D +L+ YP+++
Sbjct: 488 TGFTVHTENNPFGMTAPTNAQ-EYGWNPTGAAWAIQNLWWHYEFTQDEAYLKNTIYPIMK 546
Query: 497 GCA----SFLLDWLIEGHDGYLETNPSTSPEHEFIAP--DGKLACVSYSSTMDMAIIREV 550
A S+L W E E +P +AP + + +T D +++ E+
Sbjct: 547 EAALFWDSYL--WTSEYQKINDENSPYNGQNRLVVAPSFSEEQGPTAVGTTYDQSLVWEL 604
Query: 551 FSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW----------------- 593
++ I A +++ ++E AL++ + + +L P +I + I EW
Sbjct: 605 YNECIKAGKIVGEDE-ALLKSWEEKMQKLDPIEINDTNGIKEWYEETRVGQKNGHNQSYA 663
Query: 594 -AQDFKDPEV-----------HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEE 641
A D + EV RH SHL GLFPG T+ + N + AA ++L +RGE
Sbjct: 664 QAGDLAEIEVPNSGWNIGHLGEQRHASHLVGLFPG-TLINKDNEEYMNAAIQSLTERGEY 722
Query: 642 GPGWSITWKTALWARLHDQEHAYRMVKRL---------FNLVDPEHEKHFEGGLYSNLFA 692
GWS K LWAR + E AY ++ L +NL D H GG +
Sbjct: 723 STGWSKANKINLWARTENGEKAYTLLNHLIGGNSSGLQYNLFDS----HGSGG-GDTMMN 777
Query: 693 AHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICW 752
P +QID NFG T+ VAEMLVQS LPA+P W G V+GLKARG T+ W
Sbjct: 778 GTPVWQIDGNFGLTSGVAEMLVQSQSGYTQFLPAIP-SAWEEGSVQGLKARGNFTIGEKW 836
Query: 753 KDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLKCTNLHQS 808
+G + Y + S T Y ++++ K+Y ++++ T ++
Sbjct: 837 ANGVAETFTVC--YDGDKESSTFTGSYE------DITSAKVYADGKEIEVTKEEET 884
>gi|154505582|ref|ZP_02042320.1| hypothetical protein RUMGNA_03121 [Ruminococcus gnavus ATCC 29149]
gi|153794240|gb|EDN76660.1| hypothetical protein RUMGNA_03121, partial [Ruminococcus gnavus
ATCC 29149]
Length = 1873
Score = 320 bits (821), Expect = 1e-84, Method: Compositional matrix adjust.
Identities = 247/839 (29%), Positives = 396/839 (47%), Gaps = 120/839 (14%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDAPKALSDVRS 77
++P+GNG LG +++GG+ E + NE TLWTG P G+ + + + R
Sbjct: 4 SLPLGNGNLGNLIFGGISKERIHFNEKTLWTGGPSPSRPGYQFGNKATAYTDEEIENYRK 63
Query: 78 LVDS------------GQYAEATAASVKLFGHP---ADVYQLLGDIELEFDDSHLKYAE- 121
L+D G Y A +K G YQ GDI L+F L+
Sbjct: 64 LLDDKSTKVFNDDQSLGGYG--MGAQIKFPGENNLNKGSYQDFGDIWLDFSKMGLQDQNV 121
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD--- 178
+ YRRELDL T A ++S +V + REHF SNPDQ++VTK+S SESG L +V ++
Sbjct: 122 KNYRRELDLQTGVASTEFSYEDVNYKREHFVSNPDQIMVTKLSASESGKLDLSVKMELNN 181
Query: 179 SLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
+ L+ + + NQ C I K ND ++F +++ + + G + E
Sbjct: 182 NGLEGKTTFDPENQT-----CT---IEGKVKDND----LKFYTTMKLVL--EGGDLEVDE 227
Query: 239 DKKL-KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
++ ++E ++ ++++ A + + + D +K+ + S SY L +H
Sbjct: 228 KNQVYQIEDANQVMIVMAAETDYKNDYPTYRDKEKNLKKMVDDRVNSNAKKSYQKLKEKH 287
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL-FQF 356
+ D+QKLF RVS+ L +I P+ + V ++ +E+L FQ+
Sbjct: 288 IADHQKLFDRVSLDLGEQRTNI-------------PTNQLVDEYRNGTYSHYLEVLAFQY 334
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GRYL I+ SR GT +NL G+W S W H N+N++MNYW NL+EC
Sbjct: 335 GRYLTIAGSR-GTLPSNLVGLWTVGDSA-WTGDYHFNVNVQMNYWPVYTTNLAECGVTFV 392
Query: 417 DFLTYLSINGSKTAQ-VNYLA------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAW 469
D++ L G TA+ V+ + +G+ +H + + + ++ + + P G AW
Sbjct: 393 DYMDKLREPGRLTAERVHGIEGAVENHTGFTVHTENNPFGMTAPTNAQE-YGWNPTGAAW 451
Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLD--WLIEGHDGYLETNPSTSPEHEFI 527
+LW HY +T + D+L+ YP+++ A F W E E++P + +
Sbjct: 452 AIQNLWWHYEFTQNEDYLKNTIYPIMKEAAQFWDSYLWTSEYQKINDESSPYNGQDRLVV 511
Query: 528 APD--GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIA 585
AP + + +T D +++ E++ I A +++ ++E AL++ +++ +L P +I
Sbjct: 512 APSFSEEQGPTAIGTTYDQSLVWELYKECIQAGKIVGEDE-ALLKSWEENMQKLDPIEIN 570
Query: 586 EDGSIMEWAQDFKD----------------PEVH-------------HRHLSHLFGLFPG 616
E I EW ++ + PE+ RH SHL GLFPG
Sbjct: 571 ETNGIKEWYEETRVGQKNGHNRSYAKAGNLPEIEVPNSGWDIGHPGEQRHSSHLVGLFPG 630
Query: 617 HTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRL------ 670
I E N + AA ++L +RGE GWS K LWAR + E AY+++ L
Sbjct: 631 TLINKE-NKEYMDAAIQSLTERGEYSTGWSKANKINLWARTENGEKAYKLLNNLIGGNSS 689
Query: 671 ---FNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPAL 727
+NL D H GG + +P +QID NFG T+ VAEMLVQS LPA+
Sbjct: 690 GLQYNLFDS----HGSGG-GETMKNGNPVWQIDGNFGLTSGVAEMLVQSQSGYTQFLPAI 744
Query: 728 PWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKV 786
P + W G ++GLKARG T+ W +G + E N+ ++F + TS KV
Sbjct: 745 P-NAWEEGNIQGLKARGNFTIGEKWANG-VAETFTVRYDGENESNTFTGSYKNITSAKV 801
>gi|282881164|ref|ZP_06289851.1| conserved hypothetical protein [Prevotella timonensis CRIS 5C-B1]
gi|281304968|gb|EFA97041.1| conserved hypothetical protein [Prevotella timonensis CRIS 5C-B1]
Length = 1008
Score = 320 bits (820), Expect = 2e-84, Method: Compositional matrix adjust.
Identities = 232/765 (30%), Positives = 360/765 (47%), Gaps = 101/765 (13%)
Query: 24 FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
T +PIGNG+ G V GGV + ++ N+ TLW G V ++V +
Sbjct: 206 MTSTLPIGNGQFGGCVMGGVKRDEVQFNDKTLWKG---------------HVGAVVGNPN 250
Query: 84 YAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGN 143
Y Y G++ + DS L A YRR LD++ A A V Y+
Sbjct: 251 YGS---------------YLNFGNLYITSTDSRLN-AATNYRRWLDIDQAKAGVAYTANG 294
Query: 144 VEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ---IIMEGRCP 200
V++ RE+ S PD+VI SE G +S N+ L + N N I +G P
Sbjct: 295 VDYQREYICSFPDKVIAIHYKASEKGKISNNIILFNQNGKTPTYNMNGTTGVITFQGEVP 354
Query: 201 GKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSF 260
PKG + + ++ GTI+ +D + V+ +D + L +++F
Sbjct: 355 ---------RTGTPKGESY--YCKAYVTAKGGTIAVGKDGGIDVKNADEMFIYLYGTTNF 403
Query: 261 DGP---FINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPK 317
D +I SD+ P S + + + Y+ + H++DY+ L+ R + ++++
Sbjct: 404 DASNDEYI--SDAALLP-SHVTGVVDAALSKGYAAICDAHVEDYKALYDRCQLNITKA-- 458
Query: 318 DIVTDTCSEENIDTVPSAERVKSFQTDEDPSLV--ELLFQFGRYLLISSSRPGTQVANLQ 375
+ +V + + + F +L+ E+ F +GRYL+ISSSR +NLQ
Sbjct: 459 -----------MPSVTTRKLIADFAISPADNLLLEEIYFCYGRYLMISSSRGVDLPSNLQ 507
Query: 376 GIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING-------SK 428
GIWN +P W+S H NIN++MNYW + NLSE P FL Y+ +
Sbjct: 508 GIWNNVNNPAWNSDIHSNINVQMNYWPAEITNLSELHLP---FLKYIHREACERPQWRAN 564
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWAL-WPMGGAWLCTHLWEHYNYTMDRDFL 487
Q+ GW + + +I+ S W + + AW C HLW+HY +T+D+++L
Sbjct: 565 ARQIAGQTVGWTLTTENNIYGSGSN------WMQNYTIANAWYCMHLWQHYRFTLDKEYL 618
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
+ AYP + CA + L L++ DG E SPEH P + A + ++
Sbjct: 619 KNIAYPAMRSCAEYWLQRLVKAADGTYECPNEFSPEH---GPGSENA-----TAHSQQLV 670
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS-----IMEW---AQDFKD 599
++F+ + A L +EDA+ L + + T +A + + EW +Q
Sbjct: 671 WDLFNNTLQAIAELGISEDAIFLNDLNNKFKKLDTGLAIENVNGQPLLREWKYTSQASVS 730
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHD 659
HRH+SHL GL+PG+ I + + ++ +AA +L+ RG EG GWS+ WK L AR +
Sbjct: 731 SYNSHRHMSHLMGLYPGNQIGRDIDANIYEAALNSLKTRGYEGTGWSMGWKVNLHARARN 790
Query: 660 QEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLN 719
R++K + D GG+Y NL+ AH P+QID NFG A +AEML+QS L
Sbjct: 791 GNVCQRLLKTALHFQDYTGNSE-GGGVYENLWDAHTPYQIDGNFGACAGMAEMLLQSHLG 849
Query: 720 DLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
L +LPALP W +G VKGL A VSI WK+ + I S
Sbjct: 850 KLDILPALP-SMWKNGSVKGLCAVDNFEVSIEWKNNKAVSIEIVS 893
>gi|260588898|ref|ZP_05854811.1| putative fibronectin type III domain protein [Blautia hansenii DSM
20583]
gi|260540677|gb|EEX21246.1| putative fibronectin type III domain protein [Blautia hansenii DSM
20583]
Length = 744
Score = 319 bits (817), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 235/786 (29%), Positives = 369/786 (46%), Gaps = 85/786 (10%)
Query: 21 AKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVD 80
AK + +P+GNG+ GA++ GGV E + LNE++LW G + + L VR L++
Sbjct: 11 AKSWEQGLPVGNGQQGAVLLGGVQQERIVLNEESLWYGGKRERAVEAGKEKLEKVRELLE 70
Query: 81 SGQYAEATAASVKLF-GHP--ADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARV 137
G+ ++A + F G+P + Y + L F+ K E Y R +DL A V
Sbjct: 71 KGEASKAQTLCSRWFVGNPRYTNPYHPAAEAVLNFEPFG-KVKE--YFRGIDLEKGEAGV 127
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
K N + RE FSS QV ++ + +SF++ L+
Sbjct: 128 KICFDNCKTVREIFSSVKYQVTALRMETDKEQGMSFSLGLN------------------- 168
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVAS 257
R P + NA + + I + + D D ++ VEG LLV
Sbjct: 169 -----RRPFEENAEVEDREISLNGHSGDGVCYDVRCRVGKTDGRVCVEGG----YLLVER 219
Query: 258 SSFDGPF--INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRS 315
+S+ F + K+ + L++ + + ++ H+++Y +L++ + +++ +
Sbjct: 220 ASYVEIFFCVRTDYESKECLDKCSRLLKAAAKVGFEEIKKAHIEEYGRLYNNMRLEIEGA 279
Query: 316 PKDIVTDTCSEENIDTVPSAERVKSFQTDEDPS----LVELLFQFGRYLLISSSRPGTQV 371
E + +P+ E +K E+P L+ L+F + RYLLISSS
Sbjct: 280 -----------EELAQIPADELLKRC---EEPKVQGYLIWLMFSYARYLLISSSYGCALP 325
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
ANLQGIWN +P W+S +NINL+MNYW + L C E F+ + + NG KTA+
Sbjct: 326 ANLQGIWNGSFTPPWESGYTININLQMNYWMADRAGLGVCYESFFNLIEKMLPNGRKTAK 385
Query: 432 VNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRA 491
Y G+V HH T++W + + LWPMGGAW+ L+ H + + + +R
Sbjct: 386 KVYACRGFVAHHNTNLWGDTDITGLWLPAFLWPMGGAWMANQLYHHSEFEENPKEIRERV 445
Query: 492 YPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
P+++ C F D+L D + P+ SPE+ + DG+ A V+ MD IIRE+
Sbjct: 446 LPVMKECILFFYDYLYRKSDKMWISGPTVSPENTYRLLDGQEASVAMGVAMDHQIIRELA 505
Query: 552 SAIISAAEVL-----EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRH 606
+ E + + +++L+ LP PTKI + G I+EW +++++ E HRH
Sbjct: 506 ENYLEGCRRYNTGSPEYETEKMAQEILEHLP---PTKIGKSGRILEWQEEYEEVEKGHRH 562
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHA 663
+SHL+GL PG I+ E P L +AA++TL+ R E G GWS W +ARL D++
Sbjct: 563 ISHLYGLHPGREIS-EDTPALFEAAKRTLEYRLEHGGGHTGWSKAWIMCFYARLKDKKKF 621
Query: 664 -YRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLY 722
+M + L N VD NL+ HPPFQID NFG AV E L + +
Sbjct: 622 DEQMRQFLANSVD------------ENLWDIHPPFQIDGNFGMAKAVLEALASRRGDVVE 669
Query: 723 LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGT 782
LL +P + +G V GL G V WK G L ++ + S + L Y G
Sbjct: 670 LLRIIP-EGMETGMVTGLCLEGRLKVDFAWKCGKLTKISLSSGKTQTIE-----LRYCGI 723
Query: 783 SVKVNL 788
V L
Sbjct: 724 RRSVTL 729
>gi|67902324|ref|XP_681418.1| hypothetical protein AN8149.2 [Aspergillus nidulans FGSC A4]
gi|74593077|sp|Q5AU81.1|AFCA_EMENI RecName: Full=Alpha-fucosidase A; AltName: Full=Alpha-L-fucoside
fucohydrolase A; Flags: Precursor
gi|40739981|gb|EAA59171.1| hypothetical protein AN8149.2 [Aspergillus nidulans FGSC A4]
gi|95025957|gb|ABF50892.1| alpha-fucosidase [Emericella nidulans]
gi|259480915|tpe|CBF73981.1| TPA: Alpha-fucosidasePutative uncharacterized protein ;
[Source:UniProtKB/TrEMBL;Acc:Q5AU81] [Aspergillus
nidulans FGSC A4]
Length = 809
Score = 319 bits (817), Expect = 4e-84, Method: Compositional matrix adjust.
Identities = 243/782 (31%), Positives = 380/782 (48%), Gaps = 90/782 (11%)
Query: 30 IGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYT--NPDAP--KALSDVRSLVDSG 82
IGNG+LG + +G +E L LN D+LW+G P +YT NP +P AL +R +
Sbjct: 46 IGNGKLGVIPFGPPDTEKLNLNVDSLWSGGPFEVENYTGGNPSSPIYDALPGIRERI--- 102
Query: 83 QYAEATAASVKLFG--HPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYS 140
+ T +L G + ++LG+I + D A Y+R LDL+ R ++
Sbjct: 103 -FENGTGGMEELLGSGNHYGSSRVLGNITIALDGVE---AYSKYKRTLDLSDGVHRTSFT 158
Query: 141 VGN---VEFTREHFSSNPDQVIVTKISGSESGSL-SFNVSLDSLLDNHSYVNGNNQIIME 196
+ N F S PDQV V + + L +S+++LL N S + + + E
Sbjct: 159 IANRTTAALKSSIFCSYPDQVCVYHLESASDARLPKVTISIENLLVNQSLLQTSCE--SE 216
Query: 197 GRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLV- 255
+ R A P+G++++A+ E+ ++ + L + L++ + +++
Sbjct: 217 AKRAVLRHSGVTQAGP-PEGMKYAAVAEV-VNPRSSVTTCLGEGALQISSRKKQLTIIIG 274
Query: 256 ASSSFDGPFINPSD-----SKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
A++++D N + KDP S + Y L RH+ DY+KL S+
Sbjct: 275 AATNYDQKAGNAKSGWSFKNAKDPASIVDGIASAAGWKGYQRLLDRHVKDYKKLMGDFSL 334
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
+L DT + DT E+ +P L LL + R+LL+SSSRP +
Sbjct: 335 ELP--------DTTDSASKDTSELIEKYSYASATGNPYLENLLLDYARHLLVSSSRPNSL 386
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI-NGSKT 429
ANLQG W E L+P+W + H NINL+MNYW + L E Q L++++ + G++T
Sbjct: 387 PANLQGRWTESLTPSWSADYHANINLQMNYWLADQTGLGETQHALWNYMADTWVPRGTET 446
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A++ Y ASGWV+H++ +I+ +A + WA +P AW+ H+W++++YT D +L
Sbjct: 447 ARLLYNASGWVVHNEINIFG-FTAMKEDAGWANYPAAAAWMMQHVWDNFDYTHDTAWLVS 505
Query: 490 RAYPLLEGCASFLLDWLIEG---HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
+ Y LL+G ASF L L E +DG L NP SPE P C Y +
Sbjct: 506 QGYALLKGIASFWLSSLQEDKFFNDGSLVVNPCNSPE---TGPT-TFGCTHYQQ-----L 556
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIMEWAQDFKDPEVH-- 603
I +VF +++A E + +++ V+ V +L RL ++ G + EW K P+ +
Sbjct: 557 IHQVFETVLAAQEYIHESDTKFVDSVASALERLDTGLHLSSWGGLKEW----KLPDSYGY 612
Query: 604 -----HRHLSHLFGLFPGHTITI----EKNPDLCKAAEKTLQKRG-----EEGPGWSITW 649
HRHLSHL G +PG++I+ +N + A ++TL RG + GW+ W
Sbjct: 613 DNMSTHRHLSHLAGWYPGYSISSFAHGYRNKTIQDAVKETLTARGMGNAADANAGWAKVW 672
Query: 650 KTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAV 709
+ A WARL+D AY ++ +++F G S + A PPFQIDANFGF AV
Sbjct: 673 RAACWARLNDSSMAYDELRYAI-------DENFVGNGLSMYWGASPPFQIDANFGFAGAV 725
Query: 710 AEMLV--------QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICW-KDGDLHEV 760
MLV + L PA+P W G KGL+ RGG V W K G ++ V
Sbjct: 726 LSMLVVDLPTPRSDPGQRTVVLGPAIP-SAWGGGRAKGLRLRGGAKVDFGWDKRGVVNWV 784
Query: 761 GI 762
I
Sbjct: 785 NI 786
>gi|152968134|ref|YP_001363918.1| twin-arginine translocation pathway signal [Kineococcus
radiotolerans SRS30216]
gi|151362651|gb|ABS05654.1| twin-arginine translocation pathway signal [Kineococcus
radiotolerans SRS30216]
Length = 808
Score = 318 bits (815), Expect = 8e-84, Method: Compositional matrix adjust.
Identities = 254/773 (32%), Positives = 357/773 (46%), Gaps = 65/773 (8%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
++ + GPA + +A+P+G+GRLGA+ WG E L LN+D W+G G +P P
Sbjct: 5 RLRYEGPATTWLEALPVGDGRLGAVCWGLADGERLSLNDDRAWSGPVGGPHHPTPPDHPD 64
Query: 74 DV---RSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDL 130
V R+ V +G A + H + +GD+ + A R LDL
Sbjct: 65 RVEAARAAVLAGDPTRAGELLEPVVHH-TQAFLPVGDLLVTT----AAAAAPGVVRGLDL 119
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHS---YV 187
TATA + V T H +S V+V +++ +G+ ++L S L V
Sbjct: 120 GTATAWSQRPVPG--GTVRHETSVGAGVLVHRVTAPGAGT-GLRLALASPLRPAGSTLRV 176
Query: 188 NGNNQIIMEGRC----PGKRIPPKANANDDP-----KGIQFSAILEIKISDDRGTISALE 238
+ +E R P P + ++DP G + + GT A
Sbjct: 177 PDGDPGGLEWRTLLDLPEDVHPWHPDQHEDPVRWAAPGTPSRQVAVVVRVRCDGTPRAAP 236
Query: 239 DKKLKVEGSDWAVL----LLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLY 294
D VEG W + ++VA + D P +P+ P E+ +A + +
Sbjct: 237 DPAGPVEGPAWDGVREAHVVVAVETPD-PATDPTGR---PDVEAAAARAAAAVADPGAVR 292
Query: 295 TRHLDDYQKLFHRVSIQLS-RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
RH ++ +LF R + L R P TD V + DED + V
Sbjct: 293 ERHRREHAELFGRSDLDLGGRVPAGTTTDAL-------------VGLAEHDEDAARVLAA 339
Query: 354 FQFG--RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSEC 411
RYLL++ SRPGT LQGIWNE+L P W S +N+NL M YW P L EC
Sbjct: 340 LAVAHARYLLVTGSRPGTLPLTLQGIWNEELQPPWSSNYTLNVNLPMAYWPVQPWGLPEC 399
Query: 412 QEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGK---VVWALWPMGGA 468
EPL F L+ G+ TA Y A GWV HH +D WA++ + G W+ WP GG
Sbjct: 400 AEPLLAFAERLAAAGTATAAEMYGARGWVAHHNSDGWAQTRSVGGGWNDPAWSAWPYGGV 459
Query: 469 WLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIA 528
WL +L + ++ D L +R P++EG F LD L+ DG L T PSTSPE+ ++
Sbjct: 460 WLSLNLLDALDFAADPGPLARRVLPVVEGAVRFCLDRLVVLPDGTLGTAPSTSPENHWLD 519
Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAA-----EVLEKNEDALVEKVLKSLPRLRPTK 583
G V SST D+ + R + + A + + A VE L LP
Sbjct: 520 AAGNAQAVERSSTCDLELTRGLLTGWSRWAGRQTHAPVPADLRAEVEAALAGLPH---PG 576
Query: 584 IAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP 643
G ++EW + + E HRH SHL GL+P TI + AA ++L RG E
Sbjct: 577 TGARGELLEWHAELAEAEPEHRHTSHLVGLYPLGTIAAGTS--AAAAAARSLDLRGPEST 634
Query: 644 GWSITWKTALWARLHDQEHAYRMVKRLFN----LVDPEHEKHFEGGLYSNLFAAHPPFQI 699
GW++ W+TAL ARL D +V+R GGLY NLF+AHPPFQ+
Sbjct: 635 GWALAWRTALRARLRDGAAVGDLVRRCLRPATDGHGTGGGAAHRGGLYPNLFSAHPPFQV 694
Query: 700 DANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICW 752
D N GF AAVAE+LVQS + + LLPALP +W G V+GL+ R G V + W
Sbjct: 695 DGNLGFAAAVAEVLVQSGADRVDLLPALP-PQWPEGRVRGLRTRAGVEVDLTW 746
>gi|323345397|ref|ZP_08085620.1| fibronectin type III domain protein [Prevotella oralis ATCC 33269]
gi|323093511|gb|EFZ36089.1| fibronectin type III domain protein [Prevotella oralis ATCC 33269]
Length = 801
Score = 316 bits (809), Expect = 4e-83, Method: Compositional matrix adjust.
Identities = 238/783 (30%), Positives = 364/783 (46%), Gaps = 125/783 (15%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
A+PIGNG+LGAM++GG+ + ++ NE TLWTG S + G Y
Sbjct: 49 ALPIGNGQLGAMIYGGIRQDIVQFNEKTLWTG------------------SAEERGSYQN 90
Query: 87 ATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSV--GNV 144
A ++ G D + Y R LDL+ ATA +S G+
Sbjct: 91 FGALVIENIGGSYD-----------------RRGVYNYYRNLDLSNATAVASWSTADGDT 133
Query: 145 EFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
+TRE+ +SNP Q +V + S +++ L+ + +Y G EG GK
Sbjct: 134 VYTREYIASNPAQCVVIHMKASVPRAINNRFYLNDVHGRETYYQGK-----EGMFAGKLT 188
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
S +K++ GT++ D + V+ +D +++L A + ++
Sbjct: 189 -------------TVSYCARMKVAAVGGTVTTTNDG-IVVKHADEVMVILAAGTDYNAVA 234
Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
+ S + + S ++ + LY+RH++DY+ + R +QL I TD
Sbjct: 235 PSYISHTTLLPSRIKNTVDSAVSMGWQALYSRHVEDYKAFYDRTDLQLGGVTNTIPTDKL 294
Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVE-LLFQFGRYLLISSSRPGTQVANLQGIWNEDLS 383
ID ++++ D L+E L FQ+GRYLLISSSR NLQGIWN
Sbjct: 295 ----IDGY-----AENYEHDNRYRLIEQLYFQYGRYLLISSSRGIDLPNNLQGIWNNSNE 345
Query: 384 PTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI---NGSKTAQVNYL-ASGW 439
P W H +IN++MNYW + NLSE E L +++ +++ A+V +GW
Sbjct: 346 PAWQCDMHADINVQMNYWLANSTNLSEMNEKLLNYIYNMALVQPQWKSYARVRLRQQNGW 405
Query: 440 VIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCA 499
+ +I+ +A + A GAWLC HLW+HY YT+DR+FL +A P++
Sbjct: 406 ACFTENNIFGHCTAWQNNYCAA-----GAWLCAHLWQHYRYTLDREFLLHKALPVMVSQC 460
Query: 500 SFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA------IIREVFSA 553
F L+ L++ DG E SPEH P + A Y+ + A +++ +FSA
Sbjct: 461 EFWLERLVKATDGTYECPDEYSPEH---GPGTESAPGVYAIKPENATAHAQQLVKYLFSA 517
Query: 554 IISAAEVLEKNEDALVEKVLKSLPRLRPTKI---------------------AEDGSIME 592
+ A ++ N+ A V+++ + R + A D + E
Sbjct: 518 TLKAISIV-GNKAACVDRMFVKALKERLLGLDTGLHNEVYTGKWGNVYNGVTAGDSILRE 576
Query: 593 WA-QDFKD---PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSIT 648
W D+ + E HRHLSHL L+P I+ K+P A +L+ RG + GWS+
Sbjct: 577 WKYTDYANGNGKERDHRHLSHLMELYPLDGIS-PKSPYFLSAV-NSLRLRGIQSQGWSMG 634
Query: 649 WKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHF-------EGGLYSNLFAAHPPFQIDA 701
WK LWAR D + ++ K F +H K++ GG+Y N+ AH PFQID
Sbjct: 635 WKINLWARAFDGDVCAKIFKMAF-----QHSKYYTLNMSPEAGGIYYNMLDAHSPFQIDG 689
Query: 702 NFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVG 761
NFG A +AEML+QS + ++LLPALP WS G V+GL A +S W D L EV
Sbjct: 690 NFGVAAGMAEMLLQSCTDTIHLLPALP-KIWSEGTVRGLCAVNRFEISETWADMQLTEVT 748
Query: 762 IYS 764
+ S
Sbjct: 749 VKS 751
>gi|421287924|ref|ZP_15738687.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA58771]
gi|395886487|gb|EJG97503.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA58771]
Length = 717
Score = 315 bits (808), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 221/688 (32%), Positives = 338/688 (49%), Gaps = 70/688 (10%)
Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
Y GDI +EF ++ T Y+R+L+++ A Y F RE F+S PD ++
Sbjct: 27 TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLL 86
Query: 160 VTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPK 215
V + + + +L F + L D S + C I K D+
Sbjct: 87 VQRFTKEGAETLDFTIELSLTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND- 145
Query: 216 GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPT 275
++F++ L + G I D+ +++ G+ +A L L A + F + K D
Sbjct: 146 -LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLE 200
Query: 276 SESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSA 335
+ + + + + Y+ L +RH++DYQ LF RV + L E N+D +
Sbjct: 201 QQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL-------------EANVDAFTTD 247
Query: 336 ERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVN 393
+ +K+++ E +L EL FQ+GRYLLISSSR P ANLQG+WN +P W+S H+N
Sbjct: 248 DLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLN 307
Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKT 445
+NL+MNYW + NL E P+ +++ L + G + A V Y +GW++H +
Sbjct: 308 VNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQA 366
Query: 446 DI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
W D W P AW+ ++E Y++ D+D+L ++ YP+L F
Sbjct: 367 TPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWN 423
Query: 504 DWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLE 562
+L + ++PS SPEH +S +T D ++I ++F I AA+ L
Sbjct: 424 AFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELG 474
Query: 563 KNEDALVEKVLKSLPRLRPTKIAEDGSIMEW----AQDFKDPEV--HHRHLSHLFGLFPG 616
+ED L E KS L P +I + G I EW Q F++ +V HRH SHL GL+PG
Sbjct: 475 LDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPG 533
Query: 617 HTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDP 676
+ + K + +AA +L RG+ G GWS K LWARL D A++++
Sbjct: 534 NLFSY-KGQEYIEAARASLNDRGDGGTGWSKANKINLWARLGDGNRAHKLLA-------- 584
Query: 677 EHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGC 736
+ + NL+ +HPPFQID NFG T+ +AEML+QS L L ALP D WS+G
Sbjct: 585 ---EQLKTSTLPNLWCSHPPFQIDGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGS 640
Query: 737 VKGLKARGGETVSICWKDGDLHEVGIYS 764
V GL ARG VS+ W+D L ++ I S
Sbjct: 641 VSGLMARGHFEVSMSWEDKKLLQLTILS 668
>gi|336399821|ref|ZP_08580621.1| Alpha-L-fucosidase [Prevotella multisaccharivorax DSM 17128]
gi|336069557|gb|EGN58191.1| Alpha-L-fucosidase [Prevotella multisaccharivorax DSM 17128]
Length = 1111
Score = 315 bits (808), Expect = 5e-83, Method: Compositional matrix adjust.
Identities = 223/799 (27%), Positives = 369/799 (46%), Gaps = 108/799 (13%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV 59
+++ S + N + + PA+++ T +PIG+G+ GA + G + + ++ N+ TLW+G
Sbjct: 334 VISIASYTPKNKYTLWYTQPAENWMTSCLPIGDGQFGATLMGQIAVDDIQFNDKTLWSGK 393
Query: 60 PGDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKY 119
G T+ D +G Y G++ + H
Sbjct: 394 LGARTSSDN--------------------------YG----FYLNFGNLYIMSKGMH--- 420
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL-- 177
+ Y R LD+N A A V ++ V++ R +F+SNPD IV + S++G ++ + L
Sbjct: 421 SATNYVRYLDINDAIAGVNFTSDGVDYQRSYFASNPDSCIVIRYKASQNGHINAVLRLKN 480
Query: 178 ----DSL--LDN--HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD 229
DS +DN + ++ N I +G G + P+ S + ++
Sbjct: 481 QNGKDSCYNIDNSQQATISFNGTIARQGD-SGVTVEPE------------SYVCSARVVI 527
Query: 230 DRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLS 289
D G++ ++V G++ ++ L + +D + + +Q +
Sbjct: 528 DGGSLKKNSAGLIEVIGANSMIIYLRGLTDYDPDAPQYVSGAALLPTRVAAIVQKAQKKG 587
Query: 290 YSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSL 349
Y L H DY++ F R + LS + +I P+ + +++ D +L
Sbjct: 588 YETLLAAHKADYKQWFDRCQLTLSNAKNNI-------------PTPTLIANYKNDPKANL 634
Query: 350 V--ELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCN 407
EL F +GRYLLISSSR + ANLQGIWN + +P W + H NIN++MNYW + P N
Sbjct: 635 FLEELYFSYGRYLLISSSRGVSLPANLQGIWNNNNTPAWHADIHSNINVQMNYWPAEPTN 694
Query: 408 LSECQEPLFDFLTYLSINGSKTAQ----VNYLASGWVIHHKTDIWAKSSADRGKVVWALW 463
LSE P +++ + Q + + +GW + + +I+ G +
Sbjct: 695 LSELHMPFLNYIYREACVKPTWRQYAKDMGGVNAGWTLPTENNIYGS-----GTTFAPTY 749
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
+ AW C HLW+HY YT+D+D+L ++A+P ++ C + L++ +DG E SPE
Sbjct: 750 TIANAWYCQHLWQHYQYTLDKDYLRRQAFPAMKSCVEYWFQKLVKANDGTYECPDEWSPE 809
Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKN------EDALVEKVLKSLP 577
H ++ ++ +F+ A VL K+ + L ++K
Sbjct: 810 H---------GPTENATAHSQQLVWNLFNNTRKAIAVLGKSVASKEFRNKLNNYLVKVDD 860
Query: 578 RLRPTKIAEDGS--IMEW--AQDFKDPEV-------HHRHLSHLFGLFPGHTITIEKNPD 626
K DG + EW F +P+ +HRH+SHL GL+P I + N
Sbjct: 861 GCHTEKNPLDGKTYLREWKYTSQFNNPQKIGIYEYKNHRHISHLMGLYPCDEIGPDINRA 920
Query: 627 LCKAAEKTLQKRGEE-GPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGG 685
+ AA +L RG++ G GWS+ K L AR + +H + ++KR GG
Sbjct: 921 IFDAARTSLIARGDDHGTGWSLGHKMNLNARAYLGDHCHNLIKRALQQTWTTSVNEAAGG 980
Query: 686 LYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGG 745
+Y NL+ AH P+QID NFGFTA +AEML+QS + L +LPALP + W G V GL+A G
Sbjct: 981 IYENLWDAHAPYQIDGNFGFTAGIAEMLLQSRFDKLEILPALPTEYWLKGSVSGLRAVGN 1040
Query: 746 ETVSICWKDGDLHEVGIYS 764
TV I W + ++ I S
Sbjct: 1041 FTVDITWDNAIAQKITIVS 1059
>gi|336429327|ref|ZP_08609294.1| hypothetical protein HMPREF0994_05300 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336002938|gb|EGN33035.1| hypothetical protein HMPREF0994_05300 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 779
Score = 314 bits (804), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 250/813 (30%), Positives = 374/813 (46%), Gaps = 93/813 (11%)
Query: 21 AKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPD-APKALSDVRSLV 79
A+ + +A +GNGR+GA V+GGV ET+ L+E T ++G N A A ++RSL+
Sbjct: 11 AERWQEAYLLGNGRMGAAVYGGVFEETVDLSEITFFSGSSSSENNQKGAALAFQEMRSLL 70
Query: 80 DSGQYAEATAASVKLFGHPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
G+ A + G + L G +++ ++S K + Y R LDL T +
Sbjct: 71 QEGKEEAAMERASDFIGIRENYGTNLPVGRLKIMLENSGEK--PDGYVRRLDLQTGLFSM 128
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
+Y R F S PDQV +I + SLS + ++ G N
Sbjct: 129 EYRQEGSTVVRNAFVSWPDQVFCYEIKTGKPESLSGRIWVE---------GGENPFSART 179
Query: 198 RCPGKRIPPKANA---NDDPKGIQFSAILEI-----KISDDRGTISALEDKKLKVEGSDW 249
R +A +D G+ S +++ KIS GTI+ +L +
Sbjct: 180 EEEEYRFQVQAREKLHSDGSCGVDLSGMVKAWCEDGKISCSGGTIAFTGCSRLLIG---- 235
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
L + D K +S+ Y + +RH++D + RVS
Sbjct: 236 --LWMETDYEEKAGLTACKDKKAGCAKQSLPK-------EYDRIRSRHMEDVKSRMERVS 286
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERV-KSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
+ L + +E+ VP+ ERV S Q EDP L L FQFGRYLL SSR
Sbjct: 287 LCLGTKEE--------QEDAAAVPTDERVLASRQGKEDPLLFALAFQFGRYLLQCSSRED 338
Query: 369 TQV-ANLQGIWNEDLSPT--WDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI- 424
+ + A+LQG+WN++++ W H++IN +MNYW S P NL EC+ PLF ++ L I
Sbjct: 339 SPLPAHLQGVWNDNVACRIGWTCDMHLDINTQMNYWLSGPGNLPECRRPLFAWMEKLLIP 398
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
+G +A+ +Y GW ++ W S+ + + + P GG W + EHY YT D
Sbjct: 399 SGRISARESYGRKGWSADLVSNAWGFSAPYWSRTI-SPCPTGGIWQASDYMEHYRYTRDE 457
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
F + AYP++ F ++ EG DG + PS SPE+ +I +G+ S T ++
Sbjct: 458 AFAREHAYPVIREAVEFFTGYVFEGEDGCYLSGPSISPENAYIK-EGEKRFFSNGCTYEI 516
Query: 545 AIIREVFSAIISAAEVL---EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPE 601
+IRE+ + A L + + ALV + K LPRL P +I DG++ EWA +
Sbjct: 517 LMIRELLEEFLELASFLPDLAEKDRALVMQAEKILPRLLPYRILPDGTLAEWAHSHPAAD 576
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR-----GEEGPGWSITWKTALWAR 656
HRH SHL G+FP IT E P+L +AA K+++ R E GW+ + AR
Sbjct: 577 SQHRHTSHLLGVFPYAQITPEGTPELAEAAWKSMESRLCPEDNWEDTGWARSLLLLYSAR 636
Query: 657 LHDQE----HAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP----------FQIDAN 702
L +E H M K L + NL HPP +++D N
Sbjct: 637 LRKKEAVSHHLRSMQKEL---------------THPNLLVMHPPTRGAGSFMEVYELDGN 681
Query: 703 FGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
G + +AEML+QS +L LLP LP ++W G V GL ARG V I W++G L E
Sbjct: 682 TGLSMGIAEMLLQSHSGELRLLPCLP-EEWDCGSVDGLLARGNVRVGIRWQEGRLEEARF 740
Query: 763 YSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYT 795
+ + +L YRG ++L AG T
Sbjct: 741 TAA-----REMLISLEYRGIHRPLSLKAGVTET 768
>gi|421230262|ref|ZP_15686926.1| fibronectin type III domain protein [Streptococcus pneumoniae
2061376]
gi|395593788|gb|EJG54030.1| fibronectin type III domain protein [Streptococcus pneumoniae
2061376]
Length = 717
Score = 313 bits (803), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 221/699 (31%), Positives = 344/699 (49%), Gaps = 92/699 (13%)
Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
Y GDI +EF ++ T Y+R+L+++ A Y F RE F+S PD ++
Sbjct: 27 TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALVTTSYVYKGTRFEREAFASFPDDLL 86
Query: 160 VTKISGSESGSLSFNVSLD---SLLDNHSYVN------------GNNQIIMEGRCPGKRI 204
V + + + +L F + L L + Y ++ I+M+GR
Sbjct: 87 VQRFTKEGAETLDFTIELSLTCDLASDEKYEQEKSDYKECKLDITDSHILMKGRVKD--- 143
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
ND ++F++ L + G I D+ +++ G+ +A L L A + F
Sbjct: 144 ------ND----LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNP 189
Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
+ K D + + + + + Y+ L +RH++DYQ LF RV + L
Sbjct: 190 ASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL------------ 237
Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDL 382
E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P ANLQG+WN
Sbjct: 238 -EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVD 296
Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA------ 436
+P W+S H+N+NL+MNYW + NL E P+ +++ L + G + A V Y
Sbjct: 297 NPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG-RLAAVKYAGIVSQKG 355
Query: 437 --SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
+GW++H + W D W P AW+ ++E Y++ D+D+L ++ Y
Sbjct: 356 EENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIY 412
Query: 493 PLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
P+L F +L + ++PS SPEH +S +T D ++I ++F
Sbjct: 413 PMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLF 463
Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD----FKDPEV--HHR 605
I AA+ L +ED L E KS L P +I + G I EW ++ F++ +V HR
Sbjct: 464 HDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEEEEQYFQNEKVEAQHR 522
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H SHL GL+PG+ + K + +AA +L RG+ G GWS K LWARL D A++
Sbjct: 523 HASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSKANKINLWARLGDGNRAHK 581
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
++ + + NL+ +HPPFQID NFG T+ +AEML+QS L L
Sbjct: 582 LLA-----------EQLKTSTLPNLWCSHPPFQIDGNFGATSGMAEMLLQSHAAYLVPLA 630
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 631 ALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 668
>gi|367026916|ref|XP_003662742.1| glycoside hydrolase family 95 protein [Myceliophthora thermophila
ATCC 42464]
gi|347010011|gb|AEO57497.1| glycoside hydrolase family 95 protein [Myceliophthora thermophila
ATCC 42464]
Length = 834
Score = 313 bits (803), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 245/824 (29%), Positives = 374/824 (45%), Gaps = 118/824 (14%)
Query: 24 FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
F +PIGNGRL A V+G +E L LNE+++W+G D NP++ A+ +R ++ SG
Sbjct: 36 FKSTLPIGNGRLAAAVYG-TGTEKLVLNENSVWSGPWLDRANPNSKDAVPKIREMLISGN 94
Query: 84 YAEATAASV-KLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVG 142
A A++ + G+P + L D H + Y R LD TA V Y+
Sbjct: 95 ITGAGQAALDNMAGNPISPRAYHPLVNLGIDFGHGSGISD-YTRWLDTFQGTAAVNYTYH 153
Query: 143 NVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGK 202
++RE+ +S P V+ ++S + G L+ N SL +V + +G G
Sbjct: 154 GTSYSREYVASYPHGVLAFRLSADQPGKLNANFSLS----RSQWVLSRRASVSDGEG-GH 208
Query: 203 RIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDG 262
+ A++ I F + E +I + G ++ + + + G+D + A +S+
Sbjct: 209 TVALSADSGQPSDAITFWS--EARIVNSGGNATS-DGTTVFITGADTVDVFFDAETSYRH 265
Query: 263 PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTD 322
P +D+ + E L + Y + ++D+ L RV + L S
Sbjct: 266 P---DADAAQ---RELKRKLDAAVAAGYPAVRDGAVEDFSSLMGRVRLDLGSS------G 313
Query: 323 TCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLISSSR---PGTQVANLQGI 377
+ E+ + T R+ +F+ D DP L+ L+F FGR+LL +SSR P + ANLQGI
Sbjct: 314 SAGEQPVPT-----RLSNFRQDPDADPELMTLVFNFGRHLLAASSRDTGPRSLPANLQGI 368
Query: 378 WNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY-LA 436
WN+D P W S +NIN+EMNYW +L NL+E +PLFD + G A+ Y
Sbjct: 369 WNDDYDPPWQSKYTININIEMNYWPALVTNLAETHKPLFDLIDMAIPRGRDVARTMYGCE 428
Query: 437 SGWVIHHKTDIWAKSS-ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLL 495
G+V+HH TD+W ++ DRG + +WPMG AWL TH EHY +T +R FL + A+P+L
Sbjct: 429 RGFVLHHNTDLWGDAAPVDRG-TPYTVWPMGAAWLATHAMEHYRFTRNRTFLAEVAWPVL 487
Query: 496 EGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLAC-----VSYSSTMDMAIIREV 550
A F +L E D Y T PS SPEH FI P G + S MD ++ ++
Sbjct: 488 RETARFYHCYLFE-WDSYWTTGPSLSPEHSFIVPPGMTTAGAAEGLDISPEMDNQLLHQL 546
Query: 551 FSAIISAAEVL-----------EKNEDALVEKVLKSLPRLRPTKI-AEDGSIMEW-AQDF 597
F+ + A L + + + LPR+RP + G I EW + ++
Sbjct: 547 FTDVTEACARLGLFSSSSSDDDDDDAETCTTTAETYLPRIRPPAVHPTTGRIQEWRSPEY 606
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTL-------------------QKR 638
D E HRH S L+GL+PG + + + ++ +
Sbjct: 607 ADTEPGHRHFSPLWGLYPGRQLLLTRAGSGSGSSASGSDSASANLTTAAAAALLDHRMES 666
Query: 639 GEEGPGWSITWKTALWARLHDQ-EHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPF 697
G GWS W AL+AR+ + A+R ++L G L+++ F
Sbjct: 667 GSGSTGWSRAWAAALYARVPGRGRDAWRHARQLV-------ATFLLGNLWNSDSGGDSVF 719
Query: 698 QIDANFGFTAAVAEMLVQS-----------------------------------TLNDLY 722
QID NFGF AA+AEML+QS + ++
Sbjct: 720 QIDGNFGFVAALAEMLLQSHETAPASMRGSPGNNNRRTGVRQGEQQQQEEEEEKEVFVVH 779
Query: 723 LLPALPWDKWSSGCVKGLKARGGETV-SICWKDGDLHEVGIYSN 765
LLPALP D+ G V GL ARGG V + W G + +
Sbjct: 780 LLPALPGDEVPDGRVDGLVARGGFVVRELVWAGGKFARASVLAQ 823
>gi|111658272|ref|ZP_01408963.1| hypothetical protein SpneT_02000541 [Streptococcus pneumoniae
TIGR4]
Length = 576
Score = 313 bits (802), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 201/571 (35%), Positives = 285/571 (49%), Gaps = 73/571 (12%)
Query: 215 KGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDP 274
KG+QF + K++D G +S L + + + + L L + + + G
Sbjct: 9 KGVQFKVVCHSKVTD--GEVSVL-GETIVIRNATEVFLYLKSMTDYWGNI---------- 55
Query: 275 TSESMSALQS-IRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVP 333
+S+LQ ++ Y H+ YQ+ F+RV +L S + +I T
Sbjct: 56 ---DISSLQGEFSSIDYFTEKDEHVKKYQEQFNRVDFKLDYSKGCL--------SIPTNL 104
Query: 334 SAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVN 393
E K + L LLF +GRYLLISSS+P ANLQGIW ++L+P W S +N
Sbjct: 105 LLENTKKYSN----YLTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSKYTIN 160
Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSA 453
IN +MNYW PC+L E + PLFD L + G TA+ Y A G+ HH TD + ++
Sbjct: 161 INTQMNYWMVGPCDLPEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTDGFGDTAP 220
Query: 454 DRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGY 513
+ A+W + WLCTH+WEHY Y D L + + +++ F D+L E DGY
Sbjct: 221 QSHAMGAAIWVLTIPWLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYLFE-VDGY 278
Query: 514 LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL--VEK 571
L T PS SPE+++ +G SST+D I+R + I A+ L N D + V++
Sbjct: 279 LMTGPSVSPENKYRLKNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFISRVKE 338
Query: 572 VLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAA 631
+ K LP+ TKI +G I EW +D+++ E HRH+S LFGL+P + I I K P+L +AA
Sbjct: 339 LKKKLPK---TKIGSNGQIQEWLEDYEEVEPGHRHISPLFGLYPYNEIDIHKTPELAEAA 395
Query: 632 EKTLQKR-------------------------GEEGPGWSITWKTALWARLHDQEHAYRM 666
+ T+ +R GWS W +ARL+ E AY
Sbjct: 396 KITINRRLSNANFLSSQEREQAINNWLVSGLHASTQTGWSAAWLIHFFARLYQGEPAYNQ 455
Query: 667 VKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPA 726
+ L N NLF HPPFQID N G + + E+LVQS N L L+PA
Sbjct: 456 INGLLN-----------NATLGNLFLDHPPFQIDGNLGLVSGICELLVQSHHNWLSLIPA 504
Query: 727 LPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
LP WS G VKG + RGG VS WK+GD+
Sbjct: 505 LP-SAWSEGEVKGFRVRGGYKVSFAWKNGDI 534
>gi|418157949|ref|ZP_12794665.1| hypothetical protein SPAR41_1732 [Streptococcus pneumoniae GA16833]
gi|353824397|gb|EHE04571.1| hypothetical protein SPAR41_1732 [Streptococcus pneumoniae GA16833]
Length = 692
Score = 313 bits (802), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 223/699 (31%), Positives = 343/699 (49%), Gaps = 92/699 (13%)
Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
Y GDI +EF ++ T Y+R+L+++ A A Y F RE F+S PD ++
Sbjct: 27 TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86
Query: 160 VTKISGSESGSLSFNVSLD---SLLDNHSYVN------------GNNQIIMEGRCPGKRI 204
V + +L F + L L N Y ++ I+M+GR
Sbjct: 87 VQCFTKEGLETLDFTIELSLTCDLASNGKYEQEKSDYKECKLDITDSHILMKGRVKD--- 143
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
ND ++F++ L + G I D+ +++ G+ +A L L A + F
Sbjct: 144 ------ND----LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNP 189
Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
+ K D + + + + + Y+ L +RH++DYQ LF RV + L
Sbjct: 190 ASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL------------ 237
Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDL 382
E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P ANLQG+WN
Sbjct: 238 -EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVD 296
Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA------ 436
+P W+S H+N+NL+MNYW + NL E P+ +++ L + G + A V Y
Sbjct: 297 NPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG-RLAAVKYAGIVSQKG 355
Query: 437 --SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
+GW++H + W D W P AW+ ++E Y++ D+D+L ++ Y
Sbjct: 356 EENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIY 412
Query: 493 PLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
P+L F +L + ++PS SPEH +S +T D ++I ++F
Sbjct: 413 PMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLF 463
Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD----FKDPEV--HHR 605
I AA+ L +ED L E KS L P +I + G I EW ++ F++ +V HR
Sbjct: 464 HDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEEEEQYFQNEKVEAQHR 522
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H SHL GL+PG+ + K + +AA +L RG+ G GWS K LWARL D A++
Sbjct: 523 HASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSKANKINLWARLGDGNRAHK 581
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
++ + + NL+ +HPPFQID NFG T+ +AEML+QS L L
Sbjct: 582 LLA-----------EQLKTSTLPNLWCSHPPFQIDGNFGATSGMAEMLLQSHAAYLVPLA 630
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 631 ALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 668
>gi|302345048|ref|YP_003813401.1| hypothetical protein HMPREF0659_A5282 [Prevotella melaninogenica
ATCC 25845]
gi|302149037|gb|ADK95299.1| conserved hypothetical protein [Prevotella melaninogenica ATCC
25845]
Length = 775
Score = 313 bits (801), Expect = 3e-82, Method: Compositional matrix adjust.
Identities = 230/780 (29%), Positives = 366/780 (46%), Gaps = 110/780 (14%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQY 84
+ PIGNGRL A V+ G + LNE + W+G + + + D G
Sbjct: 48 AEGYPIGNGRLAASVFHGDERDRYSLNEVSFWSG----------GRNTGTINNKGDKGYD 97
Query: 85 AEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNV 144
+ + K FG YQ +GD+ ++++ + + R++ L+
Sbjct: 98 VSGSDVTDKGFGS----YQPVGDLIVDYN----ALVQSDFVRQITLDKGLVESSALRQGN 149
Query: 145 EFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRI 204
F S +QV+V + + L S GN + + R
Sbjct: 150 MIRSLAFCSYSNQVMVIRYESQKRRKLDLRFSFAIQRKEDVISVGNKGLSLYSRLK---- 205
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
G++ E+K+ + G + A + + L+++ +D LL+ +++++
Sbjct: 206 ----------NGVECQT--EVKVLHEGGELVA-DKEGLQLKNADNCTLLVFIATNYE--- 249
Query: 265 INPSDSKKDPTSESMSALQSIRN--LSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTD 322
+N + + +E Q + L Y+ L HL DYQ L+ R + ++ +
Sbjct: 250 MNAAQKFRGIPAEERLKQQMAKTAALPYAKLLKNHLSDYQSLYQRQELNIAHTA------ 303
Query: 323 TCSEENIDTVPSAERVKSF-QTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNED 381
+++DT+P+A R++++ ++ D L EL+F+FGRYL+I +SRPG+ A LQGIWN
Sbjct: 304 ----DSLDTLPTARRLEAYRKSHTDNGLEELVFRFGRYLMIQTSRPGSLPAGLQGIWNGM 359
Query: 382 LSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT------------YLSINGSKT 429
++ W + H NIN +M YW NLSEC P+ D+L YL G T
Sbjct: 360 VAAPWGNDYHSNINFQMVYWLPEVGNLSECHLPMLDYLKAMRMPFQENTREYLKAIGEST 419
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
++ GW+++ S G W + G AW HLWEHY +T D +L +
Sbjct: 420 DEIEN-NEGWIVY-------TSHNPFGAGGWQVNLPGAAWYGLHLWEHYAFTNDTIYLRQ 471
Query: 490 RAYPLLEGCASFL---LDWLIEGHDG----YLETNPSTSPEHEFIAPDGKLACVSYSS-- 540
AYP+++ + L L E +G YL + S PE + + + +S
Sbjct: 472 HAYPMMKELCHYWQKHLKALGEAGEGFCSNYLPVDISKYPELKRVKAGTLVVPAGWSPEH 531
Query: 541 --------TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
D I+ E+F I AA +L K ++ V+ + + RL +I + G++ME
Sbjct: 532 GPRGEDGVAHDQEIVAELFQNTIKAAHIL-KTDELWVKGLQEMAARLYSPQIGKKGNLME 590
Query: 593 WAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTL---QKRGEEGPGWSITW 649
W D +DPE HRH SHLF +FPG TI+I K P L +AA K+L + G+ W+ TW
Sbjct: 591 WMVD-RDPETDHRHTSHLFAVFPGSTISISKTPALAEAARKSLMYCKTTGDSRRSWAWTW 649
Query: 650 KTALWARLHDQEHAYRMVKRLF--NLVDPEHEKHFEGGLYSNLFAAHP-PFQIDANFGFT 706
++ LWARLHD E A+ M+K L N++D NLF +H P QID N+G
Sbjct: 650 RSLLWARLHDGEQAHNMIKGLISHNMLD-------------NLFTSHKIPLQIDGNYGIA 696
Query: 707 AAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNY 766
AA+ EML+QS + + LLPA P +W G V+GLKARG V W++ + +YS+Y
Sbjct: 697 AAMIEMLIQSHSDVIELLPA-PCQQWKDGNVRGLKARGNIEVDFSWENNRVTSWKLYSSY 755
>gi|421299116|ref|ZP_15749803.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA60080]
gi|395900587|gb|EJH11525.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA60080]
Length = 717
Score = 312 bits (800), Expect = 4e-82, Method: Compositional matrix adjust.
Identities = 225/699 (32%), Positives = 341/699 (48%), Gaps = 92/699 (13%)
Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
Y GDI +EF ++ T Y+R+L+++ A A Y F RE F+S PD ++
Sbjct: 27 TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86
Query: 160 VTKISGSESGSLSFNVSLD---SLLDNHSYVN------------GNNQIIMEGRCPGKRI 204
V + +L F + L L N Y ++ I+M+GR
Sbjct: 87 VQCFTKEGLETLDFTIELSLTCDLASNGKYEQEKSDYKECKLDITDSHILMKGRVKD--- 143
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
ND ++F++ L + G I D+ +++ G+ +A L L A + F
Sbjct: 144 ------ND----LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNP 189
Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
+ K D + + + + + Y+ L +RH++DYQ LF RV + L
Sbjct: 190 ASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL------------ 237
Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDL 382
E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P ANLQG+WN
Sbjct: 238 -EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVD 296
Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA------ 436
+P W+S H+N+NL+MNYW + NL E P+ +++ L + G + A V Y
Sbjct: 297 NPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG-RLAAVKYAGIVSQKG 355
Query: 437 --SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
+GW++H + W D W P AW+ ++E Y++ D+D+L ++ Y
Sbjct: 356 EENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIY 412
Query: 493 PLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
P+L F +L E ++PS SPEH +S +T D ++I ++F
Sbjct: 413 PMLRETVCFWNAFLHKEQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLF 463
Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW----AQDFKDPEV--HHR 605
I AA+ L +ED L E KS L P +I + G I EW Q F++ +V HR
Sbjct: 464 HDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEEEEQYFQNEKVEAQHR 522
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H SHL GL+PG+ + K + +AA +L RG+ G GWS K LWARL D A++
Sbjct: 523 HASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSKANKINLWARLGDGNRAHK 581
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
++ + NL+ +HPPFQID NFG T+ +AEML+QS L L
Sbjct: 582 LLAEQLKI-----------STLPNLWCSHPPFQIDGNFGATSGMAEMLLQSHAAYLVPLA 630
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 631 ALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 668
>gi|417677381|ref|ZP_12326788.1| hypothetical protein SPAR148_1583 [Streptococcus pneumoniae
GA17545]
gi|418226036|ref|ZP_12852664.1| hypothetical protein SPAR141_1569 [Streptococcus pneumoniae NP112]
gi|419467267|ref|ZP_14007148.1| BH0842-like protein [Streptococcus pneumoniae GA05248]
gi|332072822|gb|EGI83303.1| hypothetical protein SPAR148_1583 [Streptococcus pneumoniae
GA17545]
gi|353881233|gb|EHE61047.1| hypothetical protein SPAR141_1569 [Streptococcus pneumoniae NP112]
gi|379543014|gb|EHZ08166.1| BH0842-like protein [Streptococcus pneumoniae GA05248]
Length = 692
Score = 312 bits (800), Expect = 5e-82, Method: Compositional matrix adjust.
Identities = 223/699 (31%), Positives = 342/699 (48%), Gaps = 92/699 (13%)
Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
Y GDI +EF ++ T Y+R+L+++ A A Y F RE F+S PD ++
Sbjct: 27 TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86
Query: 160 VTKISGSESGSLSFNVSLD---SLLDNHSYVN------------GNNQIIMEGRCPGKRI 204
V + +L F + L L N Y ++ I+M+GR
Sbjct: 87 VQCFTKEGLETLDFTIELSLTCDLASNGKYEQEKSDYKECKLDITDSHILMKGRVKD--- 143
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
ND ++F++ L + G I D+ +++ G+ +A L L A + F
Sbjct: 144 ------ND----LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNP 189
Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
+ K D + + + + + Y+ L +RH++DYQ LF RV + L
Sbjct: 190 ASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL------------ 237
Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDL 382
E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P ANLQG+WN
Sbjct: 238 -EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVD 296
Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA------ 436
+P W+S H+N+NL+MNYW + NL E P+ +++ L + G + A V Y
Sbjct: 297 NPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG-RLAAVKYAGIVSQKG 355
Query: 437 --SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
+GW++H + W D W P AW+ ++E Y++ D+D+L ++ Y
Sbjct: 356 EENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIY 412
Query: 493 PLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
P+L F +L + ++PS SPEH +S +T D ++I ++F
Sbjct: 413 PMLRETVCFWNAFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLF 463
Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD----FKDPEV--HHR 605
I AA+ L +ED L E KS L P +I + G I EW ++ F++ +V HR
Sbjct: 464 HDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEEEEQYFQNEKVEAQHR 522
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H SHL GL+PG+ + K + +AA +L RG+ G GWS K LWARL D A++
Sbjct: 523 HASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSKANKINLWARLGDGNRAHK 581
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
++ + NL+ +HPPFQID NFG T+ +AEML+QS L L
Sbjct: 582 LLAEQLKI-----------STLPNLWCSHPPFQIDGNFGATSGMAEMLLQSHAAYLVPLA 630
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 631 ALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 668
>gi|418112994|ref|ZP_12749994.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA41538]
gi|418153393|ref|ZP_12790131.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA16121]
gi|418155639|ref|ZP_12792366.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA16242]
gi|419513045|ref|ZP_14052677.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA05578]
gi|419517252|ref|ZP_14056868.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA02506]
gi|421283791|ref|ZP_15734577.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA04216]
gi|353783356|gb|EHD63785.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA41538]
gi|353816944|gb|EHD97152.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA16121]
gi|353819888|gb|EHE00077.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA16242]
gi|379634210|gb|EHZ98775.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA05578]
gi|379639325|gb|EIA03869.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA02506]
gi|395880477|gb|EJG91529.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA04216]
Length = 717
Score = 312 bits (799), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 224/699 (32%), Positives = 341/699 (48%), Gaps = 92/699 (13%)
Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
Y GDI +EF ++ T Y+R+L+++ A A Y F RE F+S PD ++
Sbjct: 27 TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86
Query: 160 VTKISGSESGSLSFNVSLD---SLLDNHSYVN------------GNNQIIMEGRCPGKRI 204
V + +L F + L L N Y ++ I+M+GR
Sbjct: 87 VQCFTKEGLETLDFTIELSLTCDLASNGKYEQEKSDYKECKLDITDSHILMKGRVKD--- 143
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
ND ++F++ L + G I D+ +++ G+ +A L L A + F
Sbjct: 144 ------ND----LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNP 189
Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
+ K D + + + + + Y+ L +RH++DYQ LF RV + L
Sbjct: 190 ASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL------------ 237
Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDL 382
E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P ANLQG+WN
Sbjct: 238 -EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVD 296
Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA------ 436
+P W+S H+N+NL+MNYW + NL E P+ +++ L + G + A V Y
Sbjct: 297 NPPWNSDYHLNVNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG-RLAAVKYAGIVSQKG 355
Query: 437 --SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
+GW++H + W D W P AW+ ++E Y++ D+D+L ++ Y
Sbjct: 356 EENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIY 412
Query: 493 PLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
P+L F +L + ++PS SPEH +S +T D ++I ++F
Sbjct: 413 PMLRETVCFWNAFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLF 463
Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW----AQDFKDPEV--HHR 605
I AA+ L +ED L E KS L P +I + G I EW Q F++ +V HR
Sbjct: 464 HDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEEEEQYFQNEKVEAQHR 522
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H SHL GL+PG+ + K + +AA +L RG+ G GWS K LWARL D A++
Sbjct: 523 HASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSKANKINLWARLGDGNRAHK 581
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
++ + NL+ +HPPFQID NFG T+ +AEML+QS L L
Sbjct: 582 LLAEQLKI-----------STLPNLWCSHPPFQIDGNFGATSGMAEMLLQSHAAYLVPLA 630
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 631 ALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 668
>gi|418108082|ref|ZP_12745119.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA41410]
gi|419423496|ref|ZP_13963709.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA43264]
gi|353778359|gb|EHD58827.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA41410]
gi|379586068|gb|EHZ50922.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA43264]
Length = 717
Score = 312 bits (799), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 220/688 (31%), Positives = 338/688 (49%), Gaps = 70/688 (10%)
Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
Y GDI +EF ++ T Y+R+L+++ A A Y F RE F+S PD ++
Sbjct: 27 TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86
Query: 160 VTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPK 215
V + +L F + L D S + C I K D+
Sbjct: 87 VQCFTKEGLETLDFTIELSLTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND- 145
Query: 216 GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPT 275
++F++ L + G I D+ +++ G+ +A L L A + F + K D
Sbjct: 146 -LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLE 200
Query: 276 SESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSA 335
+ + + + + Y+ L +RH++DYQ LF RV + L E ++D +
Sbjct: 201 QQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL-------------EADVDASTTD 247
Query: 336 ERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVN 393
+ +K+++ E +L EL FQ+GRYLLISSSR P ANLQG+WN +P W+S H+N
Sbjct: 248 DLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLN 307
Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKT 445
+NL+MNYW + NL E P+ +++ L + G + A V Y +GW++H +
Sbjct: 308 VNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQA 366
Query: 446 DI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
W D W P AW+ ++E Y++ D+D+L ++ YP+L F
Sbjct: 367 TPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWN 423
Query: 504 DWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLE 562
+L + ++PS SPEH +S +T D ++I ++F I AA+ L
Sbjct: 424 AFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELG 474
Query: 563 KNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPG 616
+ED L E KS L P +I + G I EW ++ F++ +V HRH SHL GL+PG
Sbjct: 475 LDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPG 533
Query: 617 HTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDP 676
+ + K + +AA +L RG+ G GWS K LWARL D A++++
Sbjct: 534 NLFSY-KGQEYIEAARASLNDRGDGGTGWSKANKINLWARLGDGNRAHKLLA-------- 584
Query: 677 EHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGC 736
+ + NL+ +HPPFQID NFG T+ +AEML+QS L L ALP D WS+G
Sbjct: 585 ---EQLKTSTLQNLWCSHPPFQIDGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGS 640
Query: 737 VKGLKARGGETVSICWKDGDLHEVGIYS 764
V GL ARG VS+ W+D L ++ I S
Sbjct: 641 VSGLMARGHFEVSMSWEDKKLLQLTILS 668
>gi|417694531|ref|ZP_12343718.1| hypothetical protein SPAR120_1586 [Streptococcus pneumoniae
GA47901]
gi|418110621|ref|ZP_12747640.1| hypothetical protein SPAR113_1694 [Streptococcus pneumoniae
GA49447]
gi|332201080|gb|EGJ15151.1| hypothetical protein SPAR120_1586 [Streptococcus pneumoniae
GA47901]
gi|353781242|gb|EHD61687.1| hypothetical protein SPAR113_1694 [Streptococcus pneumoniae
GA49447]
Length = 692
Score = 311 bits (798), Expect = 6e-82, Method: Compositional matrix adjust.
Identities = 220/688 (31%), Positives = 338/688 (49%), Gaps = 70/688 (10%)
Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
Y GDI +EF ++ T Y+R+L+++ A A Y F RE F+S PD ++
Sbjct: 27 TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86
Query: 160 VTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPK 215
V + +L F + L D S + C I K D+
Sbjct: 87 VQCFTKEGLETLDFTIELSLTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND- 145
Query: 216 GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPT 275
++F++ L + G I D+ +++ G+ +A L L A + F + K D
Sbjct: 146 -LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLE 200
Query: 276 SESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSA 335
+ + + + + Y+ L +RH++DYQ LF RV + L E ++D +
Sbjct: 201 QQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL-------------EADVDASTTD 247
Query: 336 ERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVN 393
+ +K+++ E +L EL FQ+GRYLLISSSR P ANLQG+WN +P W+S H+N
Sbjct: 248 DLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVDNPPWNSDYHLN 307
Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKT 445
+NL+MNYW + NL E P+ +++ L + G + A V Y +GW++H +
Sbjct: 308 VNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQA 366
Query: 446 DI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
W D W P AW+ ++E Y++ D+D+L ++ YP+L F
Sbjct: 367 TPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWN 423
Query: 504 DWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLE 562
+L + ++PS SPEH +S +T D ++I ++F I AA+ L
Sbjct: 424 AFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELG 474
Query: 563 KNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPG 616
+ED L E KS L P +I + G I EW ++ F++ +V HRH SHL GL+PG
Sbjct: 475 LDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPG 533
Query: 617 HTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDP 676
+ + K + +AA +L RG+ G GWS K LWARL D A++++
Sbjct: 534 NLFSY-KGQEYIEAARASLNDRGDGGTGWSKANKINLWARLGDGNRAHKLLA-------- 584
Query: 677 EHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGC 736
+ + NL+ +HPPFQID NFG T+ +AEML+QS L L ALP D WS+G
Sbjct: 585 ---EQLKTSTLQNLWCSHPPFQIDGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGS 640
Query: 737 VKGLKARGGETVSICWKDGDLHEVGIYS 764
V GL ARG VS+ W+D L ++ I S
Sbjct: 641 VSGLMARGHFEVSMSWEDKKLLQLTILS 668
>gi|429725254|ref|ZP_19260100.1| hypothetical protein HMPREF9999_00368 [Prevotella sp. oral taxon
473 str. F0040]
gi|429150389|gb|EKX93300.1| hypothetical protein HMPREF9999_00368 [Prevotella sp. oral taxon
473 str. F0040]
Length = 1038
Score = 311 bits (798), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 258/803 (32%), Positives = 380/803 (47%), Gaps = 121/803 (15%)
Query: 11 NPLKITFNGPAKHFTD-----AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT- 64
NPL + + PA + ++PIGNG+LGA ++GGV ++ ++ NE TLW G P D
Sbjct: 201 NPLTLWYPSPANAGPNPWMEYSLPIGNGQLGACIFGGVKTDEIQFNEKTLWWGTPKDMQR 260
Query: 65 -NPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEET 123
N D P V FG Y G + ++ +++L ++
Sbjct: 261 QNGDGP----------------------VSGFG----CYLNFGGLFVQNLNANLSQVKD- 293
Query: 124 YRRELDLNTATARVKYS-VGNVEFTREHFSSNPDQVIVT--KISGSESGSLSFN-VSLDS 179
Y R LD+ TA A VK++ ++TR + SS PD VI + +G L F +S D+
Sbjct: 294 YVRYLDIQTAVAGVKFTDEAGTQYTRRYLSSQPDGVIAALYEANGKNKLDLQFTLISGDT 353
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
L + + G+ P I A P G GT++A D
Sbjct: 354 LKTKKTEYTADGSGWFAGKLP--TIFHNARFKVVPVG---------------GTLTATAD 396
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSAL-QSIRNLSYSDLYTRHL 298
+ V+G++ +++L +SF + D + ++AL + S+ + ++
Sbjct: 397 G-IVVKGAEKVMVILAGGTSFAPTLPERTKGTADDLNARITALVDNAAKKSFEAIEAANI 455
Query: 299 DDYQKLFHRVSIQL-----SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELL 353
D+Q RV+ L R+ KD+V + N + T + L +L
Sbjct: 456 ADHQSYMSRVAFHLEGAASQRNTKDLVDYYSAAPN-----------NRNTADGLFLEQLY 504
Query: 354 FQFGRYLLISSSRPGTQVAN-LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
F FGRYL ISSSR V N LQGIWN W+S H NIN++MNYW + P NLS+C
Sbjct: 505 FNFGRYLSISSSRGSMPVPNNLQGIWNNRHDAPWNSDVHNNINVQMNYWPAEPTNLSDCH 564
Query: 413 EPLFDFLTYLSINGSKTAQVNYLA-----------SGWVIHHKTDIWAKSSADRGKVVWA 461
P FL Y+ IN S++ A GW + +++I+ G W+
Sbjct: 565 MP---FLNYI-INNSQSEGWQRAAREFNKINGKSNKGWTVFTESNIFG------GMSTWS 614
Query: 462 L-WPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPST 520
+ + AWL HLW+HY YT+D+DFL +RA+P + G A F + L + +DG E
Sbjct: 615 SNYCVANAWLVYHLWQHYRYTLDQDFL-RRAWPAIWGSAEFWIHRLKKANDGTYEAPNEW 673
Query: 521 SPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNED-ALVEKVLKSLPR- 578
SPE+ DG +A T ++ I +V I+ A V +ED L+ L L +
Sbjct: 674 SPEYG-PKQDG-VAHAQQLITENLQIAHDVVE-ILGAKNVGISDEDLKLLNDRLTHLDKG 730
Query: 579 --------------LRPTKIAEDGSIM-EWA-QDFK-DPEVHHRHLSHLFGLFPGHTITI 621
R I++D ++ EW D++ +V+HRHLSHL L+P +
Sbjct: 731 LRIEKYRNDWAQREARERGISKDTPLLKEWKYSDYRAGGDVNHRHLSHLMCLYPFSQVQ- 789
Query: 622 EKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKH 681
E + +AA+ +L RG++ GWS+ WKT LWAR D HA R++ H
Sbjct: 790 EGDQGFYEAAKNSLALRGDDATGWSMGWKTNLWARAKDGNHARRILSNALKHAQATHVVM 849
Query: 682 FEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLK 741
GG+Y NL+ AHP FQID NFG TA VAEML+QS + L +LPALP D W++G + GLK
Sbjct: 850 SGGGVYYNLWDAHPSFQIDGNFGVTAGVAEMLLQSQNDVLEILPALPSD-WTAGSITGLK 908
Query: 742 ARGGETVSICWKDGDLHEVGIYS 764
A G TV + W G V I S
Sbjct: 909 AVGNFTVDMTWNAGKPTMVNITS 931
>gi|294806382|ref|ZP_06765225.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
gi|294446397|gb|EFG15021.1| conserved hypothetical protein [Bacteroides xylanisolvens SD CC 1b]
Length = 562
Score = 311 bits (797), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 202/582 (34%), Positives = 302/582 (51%), Gaps = 57/582 (9%)
Query: 13 LKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
LK+ ++ PAK++++A+PIGN RLGAMV+GG E L+LNE+T W G P + NP+A L
Sbjct: 22 LKLWYSQPAKNWSEALPIGNSRLGAMVYGGTEREELQLNEETFWAGSPYNNNNPNAVHVL 81
Query: 73 SDVRSLVDSGQYAEATA---ASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
VR L+ G+ EA A+ H Y LG++ LEF K A++ YR +L+
Sbjct: 82 PIVRKLIFEGRNKEAQRLIDANFLTRQHGMS-YLTLGNLYLEFPGH--KDADDFYR-DLN 137
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L AT +Y V + +TR F+S D VI+ I S+ +L+FNVS + L N V
Sbjct: 138 LENATTTTRYQVNGINYTRTTFASFTDNVIIMHIKASQPNALNFNVSYNCPLKNEVNVQN 197
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
+ II C GK + +G++ + E ++ I L++ G
Sbjct: 198 DKLIIT---CQGK----------EQEGMKAALRAECQVQVKTDGIIHPAGNILQINGGTE 244
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
A L + A++++ +N + D + + L+ + Y H+ Y+K F RV
Sbjct: 245 ATLYISAATNY----VNYQNVSADESRRTTDYLEEAILIPYEKALKEHIAFYKKQFDRVQ 300
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L S + + R+++F D ++ LLFQ+GRYLLISSS+PG
Sbjct: 301 LHLPSS------------EASQIETPRRIENFGQGNDMAMAALLFQYGRYLLISSSQPGG 348
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
Q ANLQGIWN WDS +NIN EMNYW + NLSE PLF L LS+ G++T
Sbjct: 349 QPANLQGIWNNSTHAPWDSKYTININTEMNYWPAEVTNLSETHSPLFSMLKDLSVTGAET 408
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWA---LWPMGGAWLCTHLWEHYNYTMDRDF 486
A+ Y GWV HH TD+W G V +A +WP GGAWL H+W+HY +T +++F
Sbjct: 409 ARTMYDCWGWVAHHNTDLWRIC----GVVDFAAAGMWPSGGAWLAQHIWQHYLFTGNKEF 464
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGY--LETNPSTSPEHEFIAPDGKLACVSYSSTMDM 544
L K YP+L+G A F +D+L+E H Y L +PS SPEH ++ TMD
Sbjct: 465 L-KEYYPILKGTAQFYMDFLVE-HPTYKWLVVSPSVSPEH---------GPITAGCTMDN 513
Query: 545 AIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE 586
I + + A+ + + + + + ++L +L P +I +
Sbjct: 514 QIAFDALHNTLLASYIAGE-APSFQDSLKQTLEKLPPMQIGK 554
>gi|418189889|ref|ZP_12826401.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47373]
gi|419493782|ref|ZP_14033507.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47210]
gi|353853616|gb|EHE33597.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47373]
gi|379592355|gb|EHZ57171.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA47210]
Length = 717
Score = 311 bits (796), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 222/699 (31%), Positives = 341/699 (48%), Gaps = 92/699 (13%)
Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
Y GDI +EF ++ T Y+R+L+++ A A Y F RE F+S PD ++
Sbjct: 27 TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86
Query: 160 VTKISGSESGSLSFNVSLD---SLLDNHSYVN------------GNNQIIMEGRCPGKRI 204
V + +L F + L L N Y ++ I+M+GR
Sbjct: 87 VQCFTKEGLETLDFTIELSLTCDLASNGKYEQEKSDYKECKLDITDSHILMKGRVKD--- 143
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
ND ++F++ L + D S ++++ G+ +A L L A + F
Sbjct: 144 ------ND----LRFASYLAWETDGDIRVWSY----RVQISGASYANLFLAAKTDFAQNP 189
Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
+ K D + + + + Y+ L +RH++DYQ LF RV + L
Sbjct: 190 ASNYRKKLDLEQQVRDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL------------ 237
Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDL 382
E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P ANLQG+WN
Sbjct: 238 -EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVD 296
Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA------ 436
+P W+S H+N+NL+MNYW + NL E P+ +++ L + G + A V Y
Sbjct: 297 NPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG-RLAAVKYAGIVSQKG 355
Query: 437 --SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
+GW++H + W D W P AW+ ++E Y++ D+D+L ++ Y
Sbjct: 356 EENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIY 412
Query: 493 PLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
P+L F +L + ++PS SPEH +S +T D ++I ++F
Sbjct: 413 PMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLF 463
Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD----FKDPEV--HHR 605
I AA+ L +ED L E KS L P +I + G I EW ++ F++ +V HR
Sbjct: 464 HDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEEEEQYFQNEKVEAQHR 522
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H SHL GL+PG+ + K + +AA +L RG+ G GWS K LWARL D A++
Sbjct: 523 HASHLVGLYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSKANKINLWARLGDGNRAHK 581
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
++ + + NL+ +HPPFQID NFG T+ +AEML+QS L L
Sbjct: 582 LLA-----------EQLKTSTLPNLWCSHPPFQIDGNFGATSGMAEMLLQSHAAYLVPLA 630
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 631 ALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 668
>gi|160884726|ref|ZP_02065729.1| hypothetical protein BACOVA_02715 [Bacteroides ovatus ATCC 8483]
gi|156109761|gb|EDO11506.1| hypothetical protein BACOVA_02715 [Bacteroides ovatus ATCC 8483]
Length = 795
Score = 311 bits (796), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 239/794 (30%), Positives = 371/794 (46%), Gaps = 117/794 (14%)
Query: 13 LKITFNGPAKHFTD-AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
L + ++ P+ ++ D ++PIGNG+LGAM++GG+ + ++ NE T+WTG P
Sbjct: 50 LTLWYDQPSDNWMDLSLPIGNGQLGAMIFGGIGCDEIQFNEKTVWTGRP----------- 98
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLN 131
+ Y E Y+ G++ + YRR LD+
Sbjct: 99 ----NGIEKKANYGE---------------YRNFGNLYISHRGIKTDTKITDYRRWLDIR 139
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNN 191
A A + YS+ V + RE+ +S+PD +I + S G NV L L D ++ NG
Sbjct: 140 NAVAGMTYSIDGVRYDREYIASSPDGMIAVMLRAS--GKEKINVDL-LLKDGNTDYNGT- 195
Query: 192 QIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD-DRGTISALEDKKLKVEGSDWA 250
G +I K N K S + ++ + ++ D L + +D
Sbjct: 196 -------ASGTKID-KGNMTFKGKLTYLSYYCRVAVTPYGKKAKVSINDSALTITKADSL 247
Query: 251 VLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
++LL +++ N ++ + +Y+ L TR ++ LF R
Sbjct: 248 LVLLSGGTNYSTETANYRTNESVLHQRIDDIINKALAKNYTTLKTRQQKSHRMLFDRC-- 305
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSF-QTD----EDPSLVELLFQFGRYLLISSS 365
QLS +P D +T P+ + V + +TD ++ L EL F +GRYLLIS +
Sbjct: 306 QLSITPDDC----------NTKPTPQLVADYNKTDSSYLDNHFLEELYFNYGRYLLISCA 355
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL------ 419
+ +NLQGIWN S W H NIN++MNYW + NLSE L D++
Sbjct: 356 QGIALPSNLQGIWNYSNSAVWHCDIHANINVQMNYWPAEVTNLSELHNNLLDYIYNEALI 415
Query: 420 ------TYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWAL--WPMGGAWLC 471
++ S N G+ +I+ G W L + + AW C
Sbjct: 416 HTQWRDNVNTVLRSANKNENQKPGGFFCSTANNIFG------GGTEWKLQEYAVVNAWYC 469
Query: 472 THLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPD 530
H +EH+ YT D+ FL ++A P++ F + LI + +DG SPE P
Sbjct: 470 LHFYEHWLYTGDKTFLREKALPVMLSAVEFWKNRLIRDENDGKWICPREFSPEQ---GPT 526
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKN------EDALVEKVLKSLPRLRPTKI 584
GK+ + +++ +FS + A + L+K+ E ++ ++ T+I
Sbjct: 527 GKVTAHA------QQLVKSLFSNTLKACKALDKDCPLRAEELEVINDYHNNIDDGLYTEI 580
Query: 585 AE--DGSIM--EWAQDFKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR 638
DG ++ EW +D + HRH+SHLF L+P + I N + +AA ++L+ R
Sbjct: 581 VNKADGELLLKEWKYAGQDSIGSLTHRHVSHLFALYPLNEIDKTSNDSIYQAALRSLKWR 640
Query: 639 GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFE--------GGLYSNL 690
G + GW+I+WK LWAR D +A R++K + H H++ GG+Y+NL
Sbjct: 641 GPQATGWAISWKMNLWARAQDGGYARRLLKSALH-----HSTHYQMKASTSSPGGIYNNL 695
Query: 691 FAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSI 750
F AHPPFQID NFG TA +AEML+QS ++LLPALP D W+ G VKGLKARGG +SI
Sbjct: 696 FDAHPPFQIDGNFGTTAGIAEMLMQSHAGYIHLLPALPPD-WTKGSVKGLKARGGYEISI 754
Query: 751 CWKDGDLHEVGIYS 764
WKDG + I S
Sbjct: 755 DWKDGKVTHTTIKS 768
>gi|187734699|ref|YP_001876811.1| glycoside hydrolase family protein [Akkermansia muciniphila ATCC
BAA-835]
gi|187424751|gb|ACD04030.1| glycoside hydrolase family 95 [Akkermansia muciniphila ATCC
BAA-835]
Length = 788
Score = 311 bits (796), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 230/772 (29%), Positives = 352/772 (45%), Gaps = 75/772 (9%)
Query: 12 PLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKA 71
P+++T + PA+ +T+ GNGRLG + +G P ET+ LNE +++ A +A
Sbjct: 28 PMQVTASTPARVWTEGYGTGNGRLGILSFGVFPKETVVLNEGSIFA-KKNFQMREGAAEA 86
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADV---YQLLGDIELEFDDSHLKYAEETYRREL 128
L R L G+Y A K P ++ YQ G +++EF + +Y+R L
Sbjct: 87 LDKARELCKEGKYRSADQLFRKNILPPGNIAGDYQQGGRLQVEFQGLP---SPSSYQRTL 143
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
D+ A + G E T E ++ I+ + +++L+ + V
Sbjct: 144 DMRRGKATTRAQFGTGELTTEILAAPSSDCAAYHIACTMPSGCRVSLNLEHPDPSARIVA 203
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
N ++EG+ +N + IL S R + + D +V
Sbjct: 204 QPNGWVLEGQ----------GSNGGTRFENTVVILAPGASVTRKGSTIILDSAREV---- 249
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDDYQK 303
++++S S D P + P + S++A L + + L D + +
Sbjct: 250 ----MVLSSISTDYNIRKP----EAPLTHSLAAKNARILAKAQKAGWKKLAAETEDYFSR 301
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
L R + L SP + T ++ ERVK Q +DP L+E LFQFGR+ I+
Sbjct: 302 LMTRCQVDLGDSPAGVSAMTTAQR-------LERVK--QGKKDPDLLEQLFQFGRFCTIA 352
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
+RPG LQG+WN +L W +NIN +MN W S L E Q DF+ L
Sbjct: 353 HTRPGQLPCGLQGLWNPELRAAWMGCYFLNINSQMNQWPSHVTGLGEFQSSYLDFVRSLR 412
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+G + A+ G+ H TD W ++ W M GAW C HL + Y +T D
Sbjct: 413 PHGEEFARF-IKRDGFCFGHYTDCWKRTYFSGNNPEWGASLMNGAWACAHLVDSYRFTGD 471
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGK----LACVSYS 539
R+ L K++ P+LE A F++ W + +G + P SPE F APDG L+ VS
Sbjct: 472 REDL-KKSLPILESNARFIMSWFEDDGEGRYLSGPGVSPETGFYAPDGTGPNVLSYVSNG 530
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKD 599
++ D + RE I A L L+ K ++ L ++ I DG + EW Q F++
Sbjct: 531 TSHDQLLGREALRNYIYACGELGIRTPTLL-KAVQFLRKIPQPAIGPDGRVQEWRQPFEE 589
Query: 600 PEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR------GEEG--PGWSITWKT 651
+ HRH+SHL+GLFPG + P+ +A K+ R G G GWS W
Sbjct: 590 MQKGHRHISHLYGLFPGTEWDVLNTPEYAEAVRKSADFRRKYADMGNNGIRTGWSTAWLI 649
Query: 652 ALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAE 711
L+A L D A R++ ++ +H+ + SNLF HPPFQI+ NFGF++ VAE
Sbjct: 650 NLYAALGDGNAAE---DRMYTML-----RHY---INSNLFDLHPPFQIEGNFGFSSGVAE 698
Query: 712 MLVQSTLND-----LYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLH 758
L+QS + + L PAL D W G GL+ RGG V + W+DG +
Sbjct: 699 CLIQSRIMQDGFQVILLAPALA-DDWKKGSATGLRTRGGLKVDLSWQDGRVQ 749
>gi|325261390|ref|ZP_08128128.1| fibronectin type III domain protein [Clostridium sp. D5]
gi|324032844|gb|EGB94121.1| fibronectin type III domain protein [Clostridium sp. D5]
Length = 1783
Score = 310 bits (794), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 232/786 (29%), Positives = 370/786 (47%), Gaps = 91/786 (11%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD----------YTNPDAPKALSDVR 76
++PIGN +GA V+GGV E ++LNE +LW+G P D N + ++
Sbjct: 73 SLPIGNSAIGASVFGGVDIERIQLNEKSLWSGGPSDSRPDYNGGNIQQNGQDGATMKQIQ 132
Query: 77 SLVDSGQYAEATAASVKLFGHPADV-------YQLLGDIELEFDDSHLKYAEETYRRELD 129
L G + A+A KL G D Y G++ L+F D E Y R+L+
Sbjct: 133 ELFKEGNNSAASALCNKLIGVSDDAGDKGYGYYLSYGNMYLDFQDGASPDNVENYSRDLN 192
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
L A + V Y + RE+F S PD V+VT+++ +E G+L F+V ++ D+
Sbjct: 193 LRNAVSSVDYDYKGTHYHREYFVSYPDNVLVTRLT-AEGGTLDFDVRVEP--DDQKGGGS 249
Query: 190 NNQIIME-GRCPGKRIPPKA---NANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE 245
NN GR + N ++FS+ K+ D G +K+ V
Sbjct: 250 NNPSAESYGRSWDTDVKDGVISINGELTDNQMKFSS--HTKVVADEGGKVKDGTEKVSVS 307
Query: 246 GSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSA-----LQSIRNLSYSDLYTRHLDD 300
G+ + + + + + + T+E +SA + Y + H D
Sbjct: 308 GAKEVTIYTSIGTDYKNEY---PEYRTGQTAEEVSARIKAYVDQAAVKGYEAVKEAHTKD 364
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTC-SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
+ +F RV + L ++ D TD+ + N ER + L +LFQ+GRY
Sbjct: 365 FDSIFGRVDLNLGQTVSDRATDSLLAAYNSGKASEGERRQ---------LEVMLFQYGRY 415
Query: 360 LLISSSR------PGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSEC 411
L I SSR P + +NLQGIW + W + H+N+NL+MNYW + N++EC
Sbjct: 416 LTIESSRETPDDDPSRETLPSNLQGIWVGANNSAWHADYHMNVNLQMNYWPTYSTNMAEC 475
Query: 412 QEPLFDFLTYLSINGSKTAQV------NYLASGWVIHHKTD--IWAKSSADRGKVVWALW 463
+PL ++ L G TA++ +G++ H + + W D W
Sbjct: 476 AQPLISYVDSLREPGRVTAKIYAGIGDGKSETGFMAHTQNNPFGWTCPGWD---FSWGWS 532
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
P W+ + W++Y++T D ++L YP++ A L++ G L ++PS SPE
Sbjct: 533 PAAVPWILQNCWDYYDFTGDTEYLRNVIYPIMREEALLYDQMLVDDGTGKLVSSPSFSPE 592
Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLR-PT 582
H P + A +Y T+ I +++ I AAE+L + + VE RL+ P
Sbjct: 593 H---GP--RTAGNTYEQTL----IWQLYEDTIQAAEILGTDAEQ-VEVWKDKQSRLKGPI 642
Query: 583 KIAEDGSIMEWAQDFK----DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR 638
+I + G I EW ++ +HRHLSH+ G+FPG I+ + P+ +AA+ ++ R
Sbjct: 643 EIGDSGQIKEWYEETTVNSLGEGFNHRHLSHMLGVFPGDLISSD-TPEWYEAAKISMNNR 701
Query: 639 GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQ 698
+E GW + + WARL D AY+++ LF+ G+ +NL+ H P+Q
Sbjct: 702 TDESTGWGMGQRINTWARLGDGNRAYKLITDLFHK-----------GILTNLWDTHAPYQ 750
Query: 699 IDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLH 758
ID NFG T+ VAEML+QS + LLPALP D+W+ G V GL ARG +++ W +G +
Sbjct: 751 IDGNFGMTSGVAEMLLQSNQGYMNLLPALP-DEWADGSVNGLTARGNFVLNMSWGEGVVK 809
Query: 759 EVGIYS 764
I S
Sbjct: 810 TAEILS 815
>gi|421218284|ref|ZP_15675178.1| alpha-fucosidase domain protein [Streptococcus pneumoniae 2070335]
gi|395583053|gb|EJG43502.1| alpha-fucosidase domain protein [Streptococcus pneumoniae 2070335]
Length = 692
Score = 310 bits (794), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 220/688 (31%), Positives = 336/688 (48%), Gaps = 70/688 (10%)
Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
Y GDI +EF ++ T Y+R+L+++ A A Y F RE F+S PD ++
Sbjct: 27 TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86
Query: 160 VTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPK 215
V + +L F + L D S + C I K D+
Sbjct: 87 VQCFTKEGLETLDFTIELSLTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDN-- 144
Query: 216 GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPT 275
++F++ L + G I D+ +++ G+ +A L L A + F + K D
Sbjct: 145 DLRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLE 200
Query: 276 SESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSA 335
+ + + + + Y+ L +RH++DYQ LF RV + L E ++D +
Sbjct: 201 QQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL-------------EADVDASTTD 247
Query: 336 ERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVN 393
+ +K+++ E +L EL FQ+GRYLLISSSR P ANLQG+WN +P W+S H+N
Sbjct: 248 DLLKNYKPQEGQALEELFFQYGRYLLISSSRDYPDALPANLQGVWNAVDNPPWNSDYHLN 307
Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKT 445
+NL+MNYW + NL E P+ +++ L + G + A V Y +GW++H +
Sbjct: 308 VNLQMNYWPAYVTNLLETVFPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQA 366
Query: 446 DI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
W D W P AW+ ++E Y++ D+D+L ++ YP+L F
Sbjct: 367 TPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWN 423
Query: 504 DWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLE 562
+L + ++PS SPEH +S +T D ++I ++F I AA+ L
Sbjct: 424 AFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFYDFIQAAQELG 474
Query: 563 KNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPG 616
+ED L E KS L P +I + G I EW ++ F++ +V HRH SHL GL+PG
Sbjct: 475 LDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPG 533
Query: 617 HTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDP 676
+ + K + +AA L RG+ G GWS K LWARL D A++++ +
Sbjct: 534 NLFSY-KGQEYIEAARAGLNDRGDGGTGWSKANKINLWARLGDGNRAHKLLAEQLKI--- 589
Query: 677 EHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGC 736
NL+ +HPPFQID NFG T+ +AEML+QS L L ALP D WS+G
Sbjct: 590 --------STLPNLWCSHPPFQIDGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGS 640
Query: 737 VKGLKARGGETVSICWKDGDLHEVGIYS 764
V GL ARG VS+ W+D L ++ I S
Sbjct: 641 VSGLMARGHFEVSMSWEDKKLLQLTILS 668
>gi|148984088|ref|ZP_01817383.1| hypothetical protein CGSSp3BS71_02667 [Streptococcus pneumoniae
SP3-BS71]
gi|418232655|ref|ZP_12859241.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA07228]
gi|418237110|ref|ZP_12863676.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA19690]
gi|147923377|gb|EDK74490.1| hypothetical protein CGSSp3BS71_02667 [Streptococcus pneumoniae
SP3-BS71]
gi|353885968|gb|EHE65752.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA07228]
gi|353891548|gb|EHE71302.1| fibronectin type III domain protein [Streptococcus pneumoniae
GA19690]
Length = 717
Score = 309 bits (792), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 223/699 (31%), Positives = 341/699 (48%), Gaps = 92/699 (13%)
Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
Y GDI +EF ++ T Y+R+L+++ A A Y F RE F+S PD ++
Sbjct: 27 TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVCKGTRFEREAFASFPDDLL 86
Query: 160 VTKISGSESGSLSFNVSLD---SLLDNHSYVN------------GNNQIIMEGRCPGKRI 204
V + +L F + L L N Y ++ I+M+GR
Sbjct: 87 VQCFTKEGLETLDFTIELSLTCDLASNGKYEQEKSDYKECKLDITDSHILMKGRVKD--- 143
Query: 205 PPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPF 264
ND ++F++ L + G I D+ +++ G+ +A L L A + F
Sbjct: 144 ------ND----LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNP 189
Query: 265 INPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTC 324
+ K D + + + + + Y+ L +RH++DYQ LF RV + L
Sbjct: 190 ASNYRKKLDLEQQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL------------ 237
Query: 325 SEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDL 382
E ++D + + +K+++ E +L EL FQ+GRYLLISSSR P ANLQG+WN
Sbjct: 238 -EADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNAVD 296
Query: 383 SPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA------ 436
+P W+S H+N+NL+MNYW + NL E P+ +++ L + G + A V Y
Sbjct: 297 NPPWNSDYHLNVNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG-RLAAVKYAGIVSQKG 355
Query: 437 --SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAY 492
+GW++H + W D W P AW+ ++E Y++ D+D+L ++ Y
Sbjct: 356 EENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIY 412
Query: 493 PLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVF 551
P+L F +L + ++PS SPEH +S +T D ++I ++F
Sbjct: 413 PMLRETVRFWNAFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLF 463
Query: 552 SAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW----AQDFKDPEV--HHR 605
I AA+ L +ED L E KS L P +I + G I EW Q F++ +V HR
Sbjct: 464 HDFIQAAQELGLDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEEEEQYFQNEKVEAQHR 522
Query: 606 HLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYR 665
H SHL L+PG+ + K + +AA +L RG+ G GWS K LWARL D A++
Sbjct: 523 HASHLVELYPGNLFSY-KGQEYIEAARASLNDRGDGGTGWSKANKINLWARLGDGNRAHK 581
Query: 666 MVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLP 725
++ + + NL+ +HPPFQID NFG T+ +AEML+QS L L
Sbjct: 582 LLA-----------EQLKTSTLPNLWCSHPPFQIDGNFGATSGMAEMLLQSHAAYLVPLA 630
Query: 726 ALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
ALP D WS+G V GL ARG VS+ W+D L ++ I S
Sbjct: 631 ALP-DAWSTGSVSGLMARGHFEVSMSWEDKKLLQLTILS 668
>gi|400594907|gb|EJP62734.1| alpha-fucosidase [Beauveria bassiana ARSEF 2860]
Length = 798
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 235/799 (29%), Positives = 365/799 (45%), Gaps = 86/799 (10%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQY 84
T + IGNGR+GA ++G +E + LNED++W+G + +AL +R +
Sbjct: 42 TGVLAIGNGRIGAAIFGS-GNEVITLNEDSIWSGPLQNRMPTRGLQALPKIRQQLVEDNI 100
Query: 85 AEATAASVKLFGHPAD---VYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSV 141
EAT++ + VY G++ L+F Y R LD A + Y+
Sbjct: 101 TEATSSIMNDMMPSVSRERVYSYFGNLHLDFGHER---GMTNYVRWLDTRQGNAGISYTY 157
Query: 142 GNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL---DSLLDNHSYVNGNNQIIMEGR 198
+ +TRE+ +S P ++ + + S++G+LSFN + ++L N + N ++
Sbjct: 158 NGINYTREYIASFPAGILAARFTASKAGALSFNTTFTRESNILANSASATTNGGLLTMRG 217
Query: 199 CPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS 258
G+ + +DP I F+ + I+D+ T ++ L + G+ L +
Sbjct: 218 SSGQ------STKNDP--ILFTGKGQF-IADNAHT--SVSGSTLSITGATEVDLFFDIET 266
Query: 259 SFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKD 318
S+ +++ +E L++ Y+D+ + D L R SI +SP
Sbjct: 267 SYR------HQTQQKLEAEVDRKLKASIAKGYTDIRDGAIADATALLGRASINFGKSPNG 320
Query: 319 IVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPG----TQVAN 373
+P+ +R+K + +D L L + +GR+LL++SSR + AN
Sbjct: 321 AAN----------LPTDKRIKMARKGLDDTQLAVLAWNYGRHLLVASSRHNDADVSLPAN 370
Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVN 433
L G+WN + W +N+NLEMNYW + N+ E QE +F L G + AQ
Sbjct: 371 LLGLWNNRTTSAWGGKFTINVNLEMNYWPAGQTNIIETQESMFSLLKIAKPRGEEMAQKL 430
Query: 434 YLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYP 493
Y +G V HH D+W ++ +WPMG AW H+ +HY +T D FL AYP
Sbjct: 431 YGCNGTVFHHNLDLWGDAAPSDNNTSATMWPMGAAWTVQHMMDHYRFTGDAGFLLHTAYP 490
Query: 494 LLEGCASFL----LDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYSSTMDM 544
L ASF DW G T PS SPE+ FI P G + MD
Sbjct: 491 FLTDVASFYRCYAFDW-----QGSKVTGPSVSPENSFIVPKNASVAGSRKAYDIAPEMDN 545
Query: 545 AIIREVFSAIISAAEVL---EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPE 601
++R+V +++ AA+ L + +ED V++ K LP +R I G I+EW ++K+ E
Sbjct: 546 QLMRDVMESLLEAAKALNIPQTDED--VKEATKFLPLIRRPAIGSYGQILEWRSEYKEAE 603
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP---GWSITWKTALWARLH 658
HRHLS L+GL P + N L +AA L R G GWS W +ARL
Sbjct: 604 PGHRHLSPLYGLHPSFQFSPLVNETLSRAANVLLNHRVANGSGHTGWSRAWLINQYARLF 663
Query: 659 DQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTL 718
A++ V+ F + + + G FQID NFG T+ + EM++QS
Sbjct: 664 SGAKAWKHVEAWFAKYPTSNLWNTDSG---------QGFQIDGNFGITSGITEMILQSHA 714
Query: 719 NDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLH 778
+++LPALP +G +GL ARGG V I WK+G + I L
Sbjct: 715 GIVHILPALPAAALPTGNARGLLARGGFEVDIDWKEGTFQKAAIRPQRGGR-------LQ 767
Query: 779 YR---GTSVKVNLSAGKIY 794
R GTS KVN G++Y
Sbjct: 768 LRVSDGTSFKVN---GELY 783
>gi|427386362|ref|ZP_18882559.1| hypothetical protein HMPREF9447_03592 [Bacteroides oleiciplenus YIT
12058]
gi|425726402|gb|EKU89267.1| hypothetical protein HMPREF9447_03592 [Bacteroides oleiciplenus YIT
12058]
Length = 817
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 230/775 (29%), Positives = 363/775 (46%), Gaps = 128/775 (16%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
++PIGNG +GA ++G E ++L E T+ G G Y
Sbjct: 84 SLPIGNGAMGACIFGRTDVERIQLAEKTM--GNKGAY----------------------- 118
Query: 87 ATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
S+ F + A++Y D H YA+ Y+R L LN A + V Y E+
Sbjct: 119 ----SMGGFTNFAEIYL----------DIHHNYAQ-NYKRTLRLNDAISTVSYIHEGTEY 163
Query: 147 TREHFSSNPDQVIVTKISGSESGSLSFNVS-----LDSLLDNHSYVNGNNQ-----IIME 196
RE+F+SNP VI K+ S+ G +SF V L S + + +G+ Q I +E
Sbjct: 164 NREYFASNPANVIAVKLKASQPGMISFTVRPVLPYLHSFNNEQTGRSGHVQAEKDLITLE 223
Query: 197 GRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE----DKKLKVEGSDWAVL 252
G +P + +IKI + GT+S++ + + V +D +L
Sbjct: 224 GEIQYFHLPYEG---------------QIKIINYGGTLSSVNKGDNNSFINVSKADSVIL 268
Query: 253 LLVASSSF---DGPFINPSDSK----KDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
+ ++S+ D F+ P+ K P + ++ Y L ++H+ DYQ F
Sbjct: 269 YITVATSYELKDSVFLLPNAEKFKGNAHPHGQVSKRIREAIEKGYECLRSKHIADYQHFF 328
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISS 364
+RV +QL+ E+ ++P+ + + ++ + D L EL FQ+GRYLLISS
Sbjct: 329 NRVDLQLT-------------EHTPSIPTDKLLNQYRNGKHDTYLEELFFQYGRYLLISS 375
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF------ 418
SR G+ ANLQG+WN+ W N+N++MNYW + NL+E P D+
Sbjct: 376 SRQGSLPANLQGVWNQYEFAPWSGGYWHNVNVQMNYWPAFNTNLAELFIPYMDYNEAFRK 435
Query: 419 ------LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
+ Y++ N + +GW I + S G +
Sbjct: 436 AATGKAVDYITQNNPEALDPTVEENGWTIGTGATAFGISGPGGHSGP-----GTGGFTTK 490
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGK 532
W++Y++T D+ L+ YP L G A FL L DG L +PS SPE I G
Sbjct: 491 LFWDYYDFTRDKQLLKDHVYPALMGMAKFLSKTLKPQPDGTLLVDPSFSPEQ--IHQQGY 548
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIME 592
S D ++I E + ++ AA++L +++ ++ V + + +L +I E G I E
Sbjct: 549 YR--SKGCIFDQSMILETYRDLLIAAKILN-DKNPFLKTVKEQIGKLDAIQIGESGQIKE 605
Query: 593 WAQDFKDPEV---HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITW 649
+ ++ K E+ HRH+S L ++PG TI P+ +AA+ TLQ+RG++ GW++
Sbjct: 606 FREEKKYGEIGQYQHRHISQLCAMYPGTTINAS-TPEWLEAAKVTLQERGDKSTGWAMAH 664
Query: 650 KTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAV 709
+ LWAR + AY++ + + G NL+ +HPPFQIDANFG TA +
Sbjct: 665 RLNLWARAKNGNRAYKLYQDILTY-----------GTLENLWGSHPPFQIDANFGATAGM 713
Query: 710 AEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
AEML+QS + LPA+P D WS G GL ARG VS+ W++G + + I S
Sbjct: 714 AEMLLQSHEGYIEPLPAIP-DNWSKGSFNGLMARGNFKVSVKWENGTIQSIQILS 767
>gi|429860747|gb|ELA35469.1| glycoside hydrolase family 95 protein [Colletotrichum
gloeosporioides Nara gc5]
Length = 797
Score = 306 bits (785), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 237/770 (30%), Positives = 365/770 (47%), Gaps = 93/770 (12%)
Query: 29 PIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYT--NPDAPK--ALSDVRSLVDS 81
P+GNG+LGA+ +G SE + LN D+LW G P +YT NP PK AL ++R+ +
Sbjct: 44 PVGNGKLGAIPFGPPGSEKVNLNIDSLWAGGPFGASNYTGGNPTEPKYEALPEIRATI-- 101
Query: 82 GQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY 139
+ T L G D ++L ++ + Y++ YRR LDL T K+
Sbjct: 102 --FENGTGDVSPLLGVGDDYGSNRVLANLTVNIQGIS-DYSD--YRRTLDLKTGVHTTKF 156
Query: 140 SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL---DNHSYVNGNNQIIME 196
+ F HF S PDQV V I+ SE + V ++ L D + G++ +
Sbjct: 157 TANGAAFEISHFCSYPDQVCVYHIA-SEGALPAVEVGYENQLVEQDTFNVSCGDDHVRFA 215
Query: 197 GRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVA 256
G + PP+ D I A + S + T++ +D+K +++
Sbjct: 216 GLT--QLGPPEGMKFDSIARINKGAAITNCTSANFLTVTPEKDQKA-------LTIIIGG 266
Query: 257 SSSFDGPFINP----SDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQL 312
+++D N S DP + S+ + H+ DYQKL + L
Sbjct: 267 ETNYDQKNGNAESDYSFKGGDPGPIVEKTTSDAASKSFHTILKDHIADYQKLESACELNL 326
Query: 313 SRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE--DPSLVELLFQFGRYLLISSSRPGTQ 370
DT E +T + + + + DP + LLF + RYLLI+SSR +
Sbjct: 327 P--------DTQGSEEKET---GQLISDYVYTDGGDPYVEALLFDYSRYLLITSSRANSL 375
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI-NGSKT 429
ANLQG W E L P W + H NIN++MNYW + L E Q L+D++ + G++T
Sbjct: 376 PANLQGRWTEQLWPAWSADYHANINIQMNYWAADQTGLGETQTALWDYMEDTWVPRGAET 435
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEK 489
A++ Y ASGWV+H++ + + ++ G WA +P AW+ H+W+++ YT D ++ +
Sbjct: 436 AKLLYNASGWVVHNEMNTFGHTAMKEGS-SWANYPAAAAWMMQHVWDNFEYTQDLEWFIR 494
Query: 490 RAYPLLEGCASFLLDWLIEG---HDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
+ YPL++G A F L L E +DG L NP SPEH P C Y +
Sbjct: 495 QGYPLIKGVAEFWLSQLQEDLYFNDGTLVVNPCNSPEH---GPT-TFGCTHYHQ-----M 545
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIMEW--AQDFKDPEVH 603
I +VF A++ A + +E V +L RL + + E G + EW + ++ E+
Sbjct: 546 IHQVFEAVLHGATFVSTK---FIEDVPPNLNRLDKGVHVTEWGGLKEWKLSDNYGYDEMS 602
Query: 604 -HRHLSHLFGLFPGHTITI----EKNPDLCKAAEKTLQKRG-----EEGPGWSITWKTAL 653
HRHLSHL G PG++++ N + A +TL RG + GW+ W+TA
Sbjct: 603 THRHLSHLTGWHPGYSVSSFLGGYTNATIQSAVRETLISRGLGNADDANAGWAKVWRTAC 662
Query: 654 WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEML 713
WARL++ + AY ++ ++ +F +S +A PPFQIDANFG AV ML
Sbjct: 663 WARLNETDRAYEQLRYAIDV-------NFAPNGFSMYWALSPPFQIDANFGLGGAVLSML 715
Query: 714 V---------QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKD 754
V + + + L PA+P KW G VKGL+ RGG V W +
Sbjct: 716 VVDLPLPYASREDVRTVVLGPAIP-KKWGGGSVKGLRVRGGGIVDFSWDE 764
>gi|46140003|ref|XP_391692.1| hypothetical protein FG11516.1 [Gibberella zeae PH-1]
Length = 798
Score = 306 bits (784), Expect = 3e-80, Method: Compositional matrix adjust.
Identities = 240/802 (29%), Positives = 395/802 (49%), Gaps = 92/802 (11%)
Query: 1 MMNAESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV 59
+++A+ ++ P T G A++ P+GNG+LGA+ +G E + LN D+LW+G
Sbjct: 16 LVSAKELWSSKPASYTKQGSAEYLLRTGYPVGNGKLGAIHFGPPGREKINLNVDSLWSGG 75
Query: 60 PGD---YT--NPDAPK--ALSDVRSLVDSGQYAEATAASVKLFGHPADV--YQLLGDIEL 110
P + YT NP +PK L +R + + AT +L G + ++LG++ +
Sbjct: 76 PFEVDGYTGGNPSSPKFQYLPAIRDRI----FTNATGEMEELMGSGSHFGSNRVLGNLTI 131
Query: 111 EFDDSHLKYAEETYRRELDLNTATARVKYSV--GNVEFTREHFSSNPDQVIVTKISGSES 168
+FD +Y++ YRR LD+ T ++ G +F F S DQV V + + +
Sbjct: 132 QFDGLD-EYSD--YRRSLDMKTGIYETSFASKDGGSKFISSVFCSYSDQVCVYFLK-ANT 187
Query: 169 GSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCP-GKRIPPKANANDDPKGIQFSAILEIKI 227
+ + +++ L Q +++ C G + P+G++++A L +
Sbjct: 188 RLPNIKIGIENKL--------VKQDLIKTTCKNGMALHTGMTQTGPPEGMKYAAALSVDR 239
Query: 228 SDDRGTISALEDKKLKVEGSDWAVLLL-VASSSFDGPFINPSDS----KKDPTSESMSAL 282
S GT++ L D ++ V+ + + + A +++D N D DP A
Sbjct: 240 S--LGTVTCLNDGQIIVKPKNKRMAIFWAAETNYDQKAGNTDDGWAFKGPDPVPRVKKAS 297
Query: 283 QSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ 342
++ Y+ L H++D++KL ++ L DT + ++++T A+ +++++
Sbjct: 298 KTAATKGYAKLRKVHVEDFKKLEEAFTLNLP--------DTQNSKDVET---ADLIQAYK 346
Query: 343 TDE--DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNY 400
D DP L +LF RYLLI+SSR + ANLQG W E L W + H NINL+MNY
Sbjct: 347 YDGPGDPFLEGILFDLSRYLLITSSRENSLPANLQGRWTELLQAAWGADYHANINLQMNY 406
Query: 401 WQSLPCNLSECQEPLFDFLTYLSI-NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVV 459
W + L+ Q+ +++++T + G++TA++ Y A+GWV+H++ +I+ +A +
Sbjct: 407 WVADQTGLAATQKSVWNYMTDTWVPRGTETAKLLYNATGWVVHNEMNIFGH-TAMKEVAG 465
Query: 460 WALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG---HDGYLET 516
WA +P+ AW+ H+W+ ++YT D+ +L + YPL++G A F + L E DG L
Sbjct: 466 WANYPVAPAWMMQHVWDAFDYTQDKKWLSSQGYPLIKGVAEFWVSQLQEDAYTEDGSLVA 525
Query: 517 NPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL 576
P S E P CV Y +I +V + + AA+++ + + V+ V +L
Sbjct: 526 IPCNSAE---TGPT-TFGCVHYQQ-----LIHQVLDSTLIAADIVSEPDSDFVDSVSSTL 576
Query: 577 PRL-RPTKIAEDGSIMEWAQDFK---DPEVHHRHLSHLFGLFPGHTITIEK----NPDLC 628
RL + A G + EW K D HRHLSHL G FPG++I+ N +
Sbjct: 577 KRLDKGLHFASWGGLKEWKIPEKLGYDKPSTHRHLSHLNGWFPGYSISSFANGYVNETIQ 636
Query: 629 KAAEKTLQKRG-----EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFE 683
A KTL RG + GW+ W++A WARL+D E AY ++ E++F
Sbjct: 637 DAIRKTLISRGMGNAEDANAGWAKVWRSACWARLNDTEKAYDHLRYAI-------EQNFV 689
Query: 684 GGLYSNLFAAHPPFQIDANFGFTAAVAEMLV--------QSTLNDLYLLPALPWDKWSSG 735
G S A +PPFQIDAN GF AV ML + L PA+P +W G
Sbjct: 690 GNGLSMYSARNPPFQIDANLGFGGAVLSMLAVDIPLPHGSKGKRTVILGPAIP-SQWGPG 748
Query: 736 CVKGLKARGGETVSICWKDGDL 757
VKGL+ RGG V W + L
Sbjct: 749 NVKGLRIRGGGVVDFEWNEKGL 770
>gi|225018139|ref|ZP_03707331.1| hypothetical protein CLOSTMETH_02076 [Clostridium methylpentosum
DSM 5476]
gi|224949136|gb|EEG30345.1| hypothetical protein CLOSTMETH_02076 [Clostridium methylpentosum
DSM 5476]
Length = 1556
Score = 306 bits (783), Expect = 4e-80, Method: Compositional matrix adjust.
Identities = 225/808 (27%), Positives = 375/808 (46%), Gaps = 105/808 (12%)
Query: 11 NPLKITFNGPAKHFT-DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTN---- 65
N L++ + PA ++T D + IGNG G +++ GV + + NE TLW G PG +N
Sbjct: 57 NTLRMWYTKPASNWTNDCLVIGNGSTGGVLFSGVGRDRVHFNEKTLWNGGPGSVSNYNGG 116
Query: 66 ----PDAPKALSDVRSLVD---SGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSH 116
P + L +R D + + T G+ + + YQ GD+ L+F +
Sbjct: 117 NRTIPTTKEQLDAIREQADDHSTSVFPLGTGGVRDFMGNGSGMGQYQDFGDLYLDFSKTG 176
Query: 117 LKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV 175
+ A T Y R+LD+ TA + + Y V + RE+F S+PD+V+ +++ SE+G L+F+
Sbjct: 177 MTDANATNYVRDLDMRTAVSSLNYDYDGVHYEREYFVSHPDKVMAVRLTASEAGKLTFDA 236
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
S V + + RI ++ + A ++ ++ GT++
Sbjct: 237 S----------VAAASGLTTTATAQDGRITLAGTVRNNGMKCEMQA----QVINEGGTLT 282
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
+ +D + VEG+D ++L + + + P+ DP E + + + SY +L
Sbjct: 283 SNDDGTVSVEGADAVTIVLTTGTDYANDW--PTYRTDDPHDELTATVDAAAAKSYQELKD 340
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDP-SLVELLF 354
HL DYQ+LF R+ I L C + VP+ E +K+++ E + E+++
Sbjct: 341 AHLADYQELFSRLEIDLGGE--------CPQ-----VPTDEMMKAYRRGETSHAAEEMVY 387
Query: 355 QFGRYLLISSSRPGTQV-ANLQGIW-NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQ 412
QFGRYL I+ SR G ++ NL G+W W + H N+N++MNYW + NL+EC
Sbjct: 388 QFGRYLTIAGSREGDELPTNLCGLWLIGSAGSYWGADFHFNVNVQMNYWPAYQTNLAECG 447
Query: 413 EPLFDFLTYLSINGSKTAQVNYL-----------ASGWVIHHKTDIWAKSSADRGKVVWA 461
D++ L G TA + +G++++ + + + +A G +
Sbjct: 448 SVFTDYMESLVEPGRVTAGASAALPTEPGTPIGEGNGFLVNTQNNPFG-CTAPFGSQEYG 506
Query: 462 LWPMGG-AWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPS 519
W +GG +W ++++ Y YT D++ L+ + YP+L+ A+F +L + G L PS
Sbjct: 507 -WNIGGSSWALQNVYDQYLYTGDKELLKNKIYPMLKEQANFWNQFLWYSDYQGRLVVGPS 565
Query: 520 TSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL 579
S E +T D +I+ E++ I A+E+L +ED K +L
Sbjct: 566 VSAEQ---------GPTVNGTTYDQSIVWELYKMAIEASEILGVDEDQRAVWEDKQ-SQL 615
Query: 580 RPTKIAEDGSIMEWAQ----------DFKDPEVH-------------HRHLSHLFGLFPG 616
P I G + EW + D + + HRH S L GL+PG
Sbjct: 616 NPIIIGSQGQVKEWYEESTLGKGQVDDLAEVNIPNFGAGGSANAGSVHRHTSQLIGLYPG 675
Query: 617 HTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDP 676
T+ + P+ AA +LQ+R G GWS K ++AR E Y +V +
Sbjct: 676 -TLINQDTPEWMDAAVVSLQQRNMGGTGWSKAHKINMYARTGRAEDTYSLVTGMI----- 729
Query: 677 EHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGC 736
+ G+ NL +HPPFQID N+G TA + EML+QS LP LP W++G
Sbjct: 730 ---AGNQNGILDNLLDSHPPFQIDGNYGLTAGMNEMLIQSQAGYTEFLPTLP-QAWATGS 785
Query: 737 VKGLKARGGETVSICWKDGDLHEVGIYS 764
+ G+ ARG + + W +G+ I S
Sbjct: 786 ISGVMARGNFEIDMDWSNGEADRFVITS 813
>gi|242815430|ref|XP_002486567.1| alpha-L-fucosidase 2 precursor, putative [Talaromyces stipitatus
ATCC 10500]
gi|218714906|gb|EED14329.1| alpha-L-fucosidase 2 precursor, putative [Talaromyces stipitatus
ATCC 10500]
Length = 773
Score = 305 bits (781), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 230/777 (29%), Positives = 377/777 (48%), Gaps = 82/777 (10%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD--YTNPDAPKA 71
K+ ++ PA+ + D +PIGNG +GA++ SE N + W+G +A
Sbjct: 5 KLWYDQPAQKWQDGLPIGNGHMGAVIISQPSSEIWSFNNISFWSGRSESTPVIEYGGREA 64
Query: 72 LSDVRSLVDSGQYAEATAASVKLFGHPADVYQ---LLGDIELEFDDSHLKYAEETYRREL 128
L +R + Y + K Y ++ I L + + + +RREL
Sbjct: 65 LDKIRKEYFADNYEHGKRLTEKYLQPEKGNYGTNLMVARIYLALEHGGEEPSFTDFRREL 124
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
+L+ A R +Y +V F RE F+S P QV++ ++ ++ + + + S +
Sbjct: 125 NLDEAIVRTEYKSKSVLFRREVFASYPHQVLMARLRTECLEGMNLKLGVSGVTKEFSISD 184
Query: 189 GNNQ--IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEG 246
G ++ E + + I +GI ++ G++ + D +L+V+
Sbjct: 185 GETTDCLVFETQAV-EEIHSNGTCGVRGRGI-------VQAHTVGGSVHIV-DGELRVKN 235
Query: 247 SDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ ++ + SF F + +D D + L ++ + SY +L H+ DYQ L+
Sbjct: 236 ASEVIIKV----SFQTDFRSLND---DWKLRVQTLLDNVWDTSYEELRALHVRDYQSLYR 288
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFGRYLLISS 364
RV I L + P +R SFQ DPSL YL IS
Sbjct: 289 RVHIDLGHTEDS------------NFPLNKRKASFQKSGYNDPSL---------YLTISG 327
Query: 365 SRPGTQVA-NLQGIWN--EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTY 421
+R + + +LQGIWN E + W H++IN +MNY+ + NL + Q PL + Y
Sbjct: 328 TRATSPLPLHLQGIWNDGEANAMNWSCDYHLDINTQMNYFPTETTNLGDLQGPLMRYCEY 387
Query: 422 LSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRG-KVVWALWPMGGAWLCTHLWEHYNY 480
L+ +G K+A+ Y A GWV H +++W + D G + W L GG W+ TH+ EHY Y
Sbjct: 388 LASSGKKSARNFYGAGGWVAHVFSNVWGYT--DPGWETSWGLNITGGLWMATHMIEHYEY 445
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWL-IEGHDGYLETNPSTSPEHEFI----APDGKLAC 535
++DR+FL +AYP+L A F LD++ I+ GYL T PS SPE+ F +P K
Sbjct: 446 SLDRNFLTTQAYPVLREAAEFFLDYMTIDPRTGYLVTGPSNSPENSFYPSTQSPREKQE- 504
Query: 536 VSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQ 595
+S T+D+ ++R++F I + + L NE +V ++L +L P +I + G + EW +
Sbjct: 505 LSLGPTIDITLVRDLFKFCIFSVDELGLNESEFAARVHEALAKLPPFRIGKRGQLQEWFE 564
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL-- 653
D+++ + HRHLSH+ GL I+ P+L A + TL R E+ I + AL
Sbjct: 565 DYEEAQPDHRHLSHIIGLCRSDQISRRHTPELADAVQVTLACRQEQADLEDIEFTAALLG 624
Query: 654 --WARLHDQEHAYRMVKRLF------NLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGF 705
+ARL+D +A++ + L NL+ + K G + +F A D N+G
Sbjct: 625 LAYARLNDGGNAFKQIAHLIYDLSFDNLLT--YSKPGIAGAETTIFVA------DGNYGG 676
Query: 706 TAAVAEMLVQS-----TLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
TA +AEML++S +++ LLPALP +W++G VKGL+ARG + I W +G L
Sbjct: 677 TAVIAEMLIRSLSRGKNGSEIELLPALP-TQWATGSVKGLRARGNIEIDIEWAEGTL 732
>gi|225017021|ref|ZP_03706213.1| hypothetical protein CLOSTMETH_00943 [Clostridium methylpentosum
DSM 5476]
gi|224950188|gb|EEG31397.1| hypothetical protein CLOSTMETH_00943 [Clostridium methylpentosum
DSM 5476]
Length = 1158
Score = 305 bits (781), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 240/844 (28%), Positives = 395/844 (46%), Gaps = 143/844 (16%)
Query: 4 AESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD 62
++S++ N L+I ++ PA + T+A+ IGNG +G MV+GGV + + +NE T+W G P +
Sbjct: 35 SQSSANDNLLRIWYDEPATDWQTEALAIGNGYMGGMVFGGVKRDKVHINEKTVWNGGPTE 94
Query: 63 ------YTNPDAPKALSDVRSLVD--SGQYAEATAASVKLFGHPADVYQ----------- 103
Y N + + D++ + D + + S +FG D YQ
Sbjct: 95 NNNRYNYGNTNPTETEEDLQKIKDDLNAIREKLDDKSEFVFGFDEDSYQSSGTSTRGEAM 154
Query: 104 -----LLGDIE-----LEFDDSHL------KYAEETYRRELDLNTATARVKYSVGNVEFT 147
L+GD+ ++ D + + A Y R+LD+ T A V Y V +T
Sbjct: 155 DWLNKLMGDLTGYSAPQDYADLFITNNAIDESAVTNYIRDLDMRTGLATVSYDYDGVHYT 214
Query: 148 REHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPK 207
RE+F+S PD V+V +++ + G ++FN +L GNN + G I K
Sbjct: 215 REYFNSYPDNVLVVRLTADQGGKINFNTNL------TDKTRGNN---LTNTAEGDTITMK 265
Query: 208 ANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP 267
++ + G++ A ++K+ + G IS ++ + V +D A L+L + + P
Sbjct: 266 SSLRSN--GLKVEA--QLKVVPEGGDIS-VDGSSINVANADAATLILACGTDYKMEL--P 318
Query: 268 SDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEE 327
+ +DP + + + Y+DL H+ D+ LF R+ I + E
Sbjct: 319 TFRGEDPHAAVTGRISAAAEKGYADLKEDHVADHSALFSRMEIGFN-------------E 365
Query: 328 NIDTVPSAERVKSFQ-----------TDEDPSLVELL-FQFGRYLLISSSRPGTQVANLQ 375
I +P+ E +K ++ T+ + +E++ +QFGRYL I+ SR G+ NLQ
Sbjct: 366 EIPQIPTDELIKKYRNMVDNNGGEVPTEAEQRALEIICYQFGRYLTIAGSREGSLPTNLQ 425
Query: 376 GIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYL 435
G+W E S W H NIN++MNYW ++ NL+EC P D+L L G A +
Sbjct: 426 GVWGEG-SFAWGGDYHFNINVQMNYWPTMASNLAECHVPYNDYLNVLREAGRGAAAAAFG 484
Query: 436 -------ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
+GW++ + + ++ + P G AW + +E+Y ++ D ++L+
Sbjct: 485 IKSEPGEENGWLVGCFSTPYMFATMGQKNNAAGWNPTGSAWALLNSYEYYLFSGDTEYLK 544
Query: 489 KRAYPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAI 546
YP ++ A+F + L E Y+ + PS SPE+ + ++ D
Sbjct: 545 NELYPSMKEVANFWNEALYWSEYQQRYV-SGPSYSPEN---------GPIVNGASYDQQF 594
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW----------AQD 596
I + F I AAE L +ED LV + +L P + +DG + EW A D
Sbjct: 595 IWQHFENTIQAAETLGVDED-LVATWREKQSKLDPVIVGDDGQVKEWFEETTFGKAQAGD 653
Query: 597 FKDPEVH----------------HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGE 640
++ ++ HRHLSHL L+P + I+ + NP+ AA TL +RG
Sbjct: 654 LEEIDIPQWRQSLGASTSGQEPPHRHLSHLMALYPCNIIS-KDNPEYMDAAMVTLNERGL 712
Query: 641 EGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAH------ 694
+ GWS K LWAR + A+++V+ G +NLF++H
Sbjct: 713 DATGWSKAHKLNLWARTGHSDEAFQIVQSAVG--------GGNSGFLTNLFSSHGGGANY 764
Query: 695 ---PPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSIC 751
P FQID N+G+TA V EML+QS L + LPALP ++W++G VKG+ ARG + +
Sbjct: 765 KAYPIFQIDGNYGYTAGVNEMLLQSQLGYVQFLPALP-EEWNTGFVKGMVARGNFEIDMD 823
Query: 752 WKDG 755
W DG
Sbjct: 824 WADG 827
>gi|225019386|ref|ZP_03708578.1| hypothetical protein CLOSTMETH_03339 [Clostridium methylpentosum
DSM 5476]
gi|224947849|gb|EEG29058.1| hypothetical protein CLOSTMETH_03339 [Clostridium methylpentosum
DSM 5476]
Length = 1796
Score = 305 bits (781), Expect = 6e-80, Method: Compositional matrix adjust.
Identities = 209/679 (30%), Positives = 339/679 (49%), Gaps = 81/679 (11%)
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
Y+R LDLNTA V Y + V +TR+ F++ PD V+V K+ S+ G+L F V + + D
Sbjct: 185 YQRYLDLNTAVTGVSYDIDGVTYTRQMFANFPDNVMVYKMDASKEGALDFTVRPE-IPDM 243
Query: 184 HSYVNGNNQIIMEGRCPGKRIPPKANANDDPKG-IQFSAIL---EIKISDDRGTISALED 239
S +GN G+ + + N +G ++ + +L + K+ D GT++A D
Sbjct: 244 VSKASGNYDKTTMGKE--GTVFAEENGLITLRGTLKHNGMLFEGQYKVIPDGGTMTASND 301
Query: 240 K-----KLKVEGSDWAVLLLVASSSFDGPFINPSDSK---KDPTSESMSALQSIRNLSYS 291
+ ++ V G++ A +++ +++ +N D +DP + + + + L +
Sbjct: 302 ENNDHGQITVSGANSAYIIIALGTNY----VNDYDKDYVGEDPHDDVTARIANAEALGFD 357
Query: 292 DLYTRHLDDYQKLFHRVSIQLSRS--PKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSL 349
+LY+RH DY LF R ++ L+ + P D TD +E + R + +
Sbjct: 358 ELYSRHKADYTALFDRATLSLNGATFPADKTTDQLLKE----YKAGSRSQYLE------- 406
Query: 350 VELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLS 409
+L FQFGRYLLI++SR T NLQG+WN+ +P+W S H NINL+MNYW ++ NLS
Sbjct: 407 -QLYFQFGRYLLIAASRGDTLPTNLQGVWNDSETPSWQSDYHTNINLQMNYWPAMETNLS 465
Query: 410 ECQEPLFDFLTYLSINGSKTAQVNY--------LASGWVIHHKTDIWAKSSADRGKVVWA 461
E PL +++ L G T Q + SGW+++ + +
Sbjct: 466 ETAIPLVEYIDSLRKPGRVTFQKTWGIEPAEGDEESGWIVNCSNGPMGFTGNINSNA--S 523
Query: 462 LWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL----IEGHDGYLETN 517
G A++ +L+++Y +T D+D+L YP+L+ + + L E L
Sbjct: 524 FTATGAAFINQNLFDYYQFTQDKDYLRSTIYPILKESSKTYMQILEPGRTEADKDKLYMV 583
Query: 518 PSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP 577
PS S E G +Y D +I + F+ AA+ L + D E + + +P
Sbjct: 584 PSYSSEQ------GPWTVGAY---FDQQLIYQCFNDTALAADELGIDSDFAAE-LRELMP 633
Query: 578 RLRPTKIAEDGSIMEWAQD-----------FKDPEVHHRHLSHLFGLFPGHTITIEKNPD 626
+L P +I + G I EW Q+ + HRH S L L+PG+ IT ++ P+
Sbjct: 634 KLDPIQIGDSGQIKEWQQETTYNRDQHGNTLGESAGKHRHNSQLIALYPGNFIT-DRTPE 692
Query: 627 LCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGL 686
+AA+ TL RG++ GWS+ K LWAR D HAY+++ L + G
Sbjct: 693 WMEAAKTTLNFRGDDATGWSMGHKLNLWARTGDGNHAYKLLNNLLS-----------NGT 741
Query: 687 YSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGE 746
Y+NLF HPPFQID N+G TA + EML+QS + +LPA+P D W++G GL ARG
Sbjct: 742 YNNLFDYHPPFQIDGNYGGTAGITEMLLQSQGGYIDILPAIP-DAWNAGSYNGLLARGNF 800
Query: 747 TVSICWKDGDLHEVGIYSN 765
+ + W++ +++ + SN
Sbjct: 801 EIGVSWENQVANQITVKSN 819
>gi|393222962|gb|EJD08446.1| glycoside hydrolase family 95 protein [Fomitiporia mediterranea
MF3/22]
Length = 842
Score = 305 bits (781), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 245/788 (31%), Positives = 374/788 (47%), Gaps = 107/788 (13%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP-------------DAPKALSD 74
+P+GNG LGAM+ GG E+ +LN ++LW+G P + +P + +A+
Sbjct: 56 LPVGNGFLGAMISGGTTQESTQLNIESLWSGGP--FADPGYNGGNKQLDEQSEIGQAMRS 113
Query: 75 VRSLVDSGQYA-----EATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
+R + ++ +A A + +G+ + L+ + + A Y R LD
Sbjct: 114 IRQKIFKSKHGTIDNVDALMAPIGAYGNYSSAGFLVSTLT-----NTPSSAISDYARFLD 168
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD---NHSY 186
L T AR ++ GN +FTRE F S P Q S + S +L +++ +
Sbjct: 169 LETGVARTIWTHGNYQFTRETFCSYPAQACAQNTSSTNPSGFSQTYALGAIIGLPPPNVT 228
Query: 187 VNGNNQIIMEGRC--PGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV 244
N+ + G PG A + P GI +E + + L +
Sbjct: 229 CADNSTLRSSGLVSNPGMAYEILATVSVSPGGI-----IECNTVPNVNHTRKASNATLTI 283
Query: 245 EGSDWAVLLLVASSSFDGPFINPSDS----KKDPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ ++ V +++D + + S DP S L S SYS+ H+ D
Sbjct: 284 SNATSMSIMWVGGTNYDAGAGDAAHSFSFRGSDPHEGLSSLLISASEKSYSEFVAEHISD 343
Query: 301 YQKLFH-RVSIQLSRSPKDIVTDTCSEENID-TVPSAERVKSFQTDE-DPSLVELLFQFG 357
++ + S+ L +NI+ VP+ + ++ D+ DP L LLF +G
Sbjct: 344 FKSALNPSFSLNLG-------------QNINLKVPTDKLKDVYRVDKGDPYLEWLLFNYG 390
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLL+SS+R G ANLQG W D W + HVNINL+MNYW + NL + + LFD
Sbjct: 391 RYLLVSSAR-GALPANLQGKWARDAGNPWSADYHVNINLQMNYWFAESTNL-DVTKSLFD 448
Query: 418 FL--TYLSINGSKTAQVNYLAS-GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
F+ T++S G+ TAQV Y ++ GWV+H++ +I+ + +G WA +P AW+ H+
Sbjct: 449 FIEETWVS-RGTYTAQVLYNSTQGWVLHNEINIFGHTGMKQGDAEWADYPESNAWMMIHV 507
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH---DGYLETNPSTSPEHEFIAPDG 531
W+H+++T D + + + YPL++G ASF L+ LI DG L P SPE P
Sbjct: 508 WDHFDFTNDVAWWKAQGYPLVKGAASFHLNKLIPDERFKDGTLVVAPCNSPEQ----PPI 563
Query: 532 KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSI 590
LAC +I ++F+A+ A + ++A + ++ R+ + I G +
Sbjct: 564 TLACAHAQQ-----VIWQLFNAVEKGAAAAGETDEAFLNEIKSKKGRMDKGIHIGSWGQL 618
Query: 591 MEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDL---------CKAAEKT-LQKRGE 640
EW D P HRH+SHL GL+PG+ I+ NPD+ +AA +T L RG
Sbjct: 619 QEWKVDMDSPTDTHRHMSHLVGLYPGYAIS-NYNPDIQGLKYSVADVRAAARTSLIHRGN 677
Query: 641 -EGP----GWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYS--NLFAA 693
GP GW W+ A WA+ D + Y L VD ++F L+S N F
Sbjct: 678 GTGPDADSGWEKVWRAACWAQFADPDKFYH---ELTYAVD----RNFAANLFSIYNPFDP 730
Query: 694 HPPFQIDANFGFTAAVAEMLVQ-----STLNDL--YLLPALPWDKWSSGCVKGLKARGGE 746
P FQIDANFG+TAAV L+Q ST L LLPALP WS+G + G + RGG
Sbjct: 731 DPIFQIDANFGYTAAVMNALIQAPDVASTTIPLTITLLPALP-SAWSTGSISGARVRGGI 789
Query: 747 TVSICWKD 754
TV + W D
Sbjct: 790 TVDMAWVD 797
>gi|210632036|ref|ZP_03297176.1| hypothetical protein COLSTE_01069 [Collinsella stercoris DSM 13279]
gi|210159752|gb|EEA90723.1| F5/8 type C domain protein [Collinsella stercoris DSM 13279]
Length = 1203
Score = 305 bits (780), Expect = 8e-80, Method: Compositional matrix adjust.
Identities = 234/799 (29%), Positives = 367/799 (45%), Gaps = 111/799 (13%)
Query: 26 DAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG------DYTNPD---APKALSDVR 76
DA+ IGNG+ GA+++G V + + NE TLWTG P D N D L +R
Sbjct: 72 DALVIGNGKTGAILFGQVAQDKVHFNEKTLWTGGPSKSRPNYDGGNKDQAVTKHQLDALR 131
Query: 77 SLVDSGQ---YAEATAASVKLFG--HPADVYQLLGDIELEFDDSHLKYAE-ETYRRELDL 130
+ +D + T +++G + YQ GD+E +F + + Y R+LD+
Sbjct: 132 AKMDDHSKDVFPMGTQIPTEVWGDGNGMGAYQDFGDLEFDFSPMGATNSNIQNYERDLDM 191
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
TA + V Y V +TRE+ +S+P V+ ++ S+ G +SF++ + S + + +
Sbjct: 192 RTAVSTVSYDFNGVHYTREYLASHPAGVVAVRLDASKDGEISFDLGVGSAKGLNVRASAD 251
Query: 191 -NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
+++ G + + A P+G G+I A E V +D
Sbjct: 252 AGDLVLAGNVADNGMLCEMRARVLPEG---------------GSIKASESGGFSVRDADA 296
Query: 250 AVLLLVASSSFDGPFINPSDSKKDPTSESMSALQS----IRNLSYSDLYTRHLDDYQKLF 305
+L + ++ + PS + +AL+ +SY +L +H+DD++ LF
Sbjct: 297 VTVLYATETDYENAY--PSYRSGQTLEQVDAALKEKLDVAAGISYDELKKQHIDDHRSLF 354
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLFQFGRYLLISS 364
RV I L P TD + +K ++ + DP + E+LFQFGRYL I+S
Sbjct: 355 ERVEIDLGGVPAQKPTD-------------QMMKDYRAGNNDPFIEEMLFQFGRYLTIAS 401
Query: 365 SRPGTQV-ANLQGIWN-EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SR G ++ +NL GIW D W H N+N++MNYW + NLSEC D++ L
Sbjct: 402 SREGDELPSNLCGIWMMGDAGRFWGGDFHFNVNVQMNYWPAYMTNLSECGSVFTDYMESL 461
Query: 423 SINGSKTAQVNYL-------------ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAW 469
+ G TA+ + G++++ + + + +A G + G +W
Sbjct: 462 VVPGRVTAERSAAMKTENHATTPVGQGKGFLVNTQNNPFG-CTAPFGSQEYGWNVTGSSW 520
Query: 470 LCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIA 528
++++ Y +T D + L R YP+L+ +F +L + L PS S E
Sbjct: 521 ALQNVYDEYLFTRDENLLRTRIYPMLKEMTTFWDGFLWWSDYQKRLVVGPSFSAEQ---- 576
Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDG 588
ST D +++ E+++ I A+E L +ED L + K+ +L P I E+G
Sbjct: 577 -----GPTVNGSTYDQSLVWELYTMAIDASERLGVDED-LRAEWKKTRDKLNPIIIGEEG 630
Query: 589 SIMEW--------AQDFKDPEVH---------------HRHLSHLFGLFPGHTITIEKNP 625
+ EW AQ PEV HRH S L GL+PG T+ + N
Sbjct: 631 QVKEWFEETSTGKAQAGSLPEVAIPNFGAGGGANQGALHRHTSQLIGLYPG-TLVNKDNK 689
Query: 626 DLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGG 685
AA KTL+ RG G GWS K +WAR E Y +++ + + G
Sbjct: 690 AWMDAAIKTLEIRGLGGTGWSKAHKINMWARTGKAETTYELIRAMI--------AGNKNG 741
Query: 686 LYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGG 745
+ NL +HPPFQID NFG TA +AE L+QS L LLPALP + W G V+G+ ARG
Sbjct: 742 ILDNLLDSHPPFQIDGNFGLTAGIAECLLQSQLGYAQLLPALP-EAWGYGSVEGIVARGN 800
Query: 746 ETVSICWKDGDLHEVGIYS 764
+ + W G L V + S
Sbjct: 801 FVIDMDWSAGTLDGVNVES 819
>gi|421310055|ref|ZP_15760680.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA62681]
gi|395909670|gb|EJH20545.1| alpha-L-fucosidase, putative, afc95A [Streptococcus pneumoniae
GA62681]
Length = 709
Score = 303 bits (775), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 218/688 (31%), Positives = 334/688 (48%), Gaps = 78/688 (11%)
Query: 101 VYQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVI 159
Y GDI +EF ++ T Y+R+L+++ A A Y F RE F+S PD ++
Sbjct: 27 TYLSFGDIHIEFSQQGTTLSQVTDYQRQLNISKALATTSYVYKGTRFEREAFASFPDDLL 86
Query: 160 VTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCP----GKRIPPKANANDDPK 215
V + +L F + L D S + C I K D+
Sbjct: 87 VQCFTKEGLETLDFTIELSLTCDLASDGKYEQEKSDYKECKLDITDSHILMKGRVKDND- 145
Query: 216 GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPT 275
++F++ L + G I D+ +++ G+ +A L L A + F + K D
Sbjct: 146 -LRFASYLAWETD---GDIRVWSDR-VQISGASYANLFLAAKTDFAQNPASNYRKKLDLE 200
Query: 276 SESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSA 335
+ + + + + Y+ L +RH++DYQ LF RV + L E ++D +
Sbjct: 201 QQVIDLVDTAKEKGYTQLKSRHIEDYQALFQRVQLDL-------------EADVDASTTD 247
Query: 336 ERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNEDLSPTWDSAPHVN 393
+ +K+++ E +L EL FQ+GRYLLISSSR P ANLQG+WN D H+N
Sbjct: 248 DLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNSDY--------HLN 299
Query: 394 INLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA--------SGWVIHHKT 445
+NL+MNYW + NL E P+ +++ L + G + A V Y +GW++H +
Sbjct: 300 VNLQMNYWPAYVTNLLEAVFPVINYVDDLRVYG-RLAAVKYAGIVSQKGEENGWLVHTQA 358
Query: 446 DI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLL 503
W D W P AW+ ++E Y++ D+D+L ++ YP+L F
Sbjct: 359 TPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREKIYPMLRETVRFWN 415
Query: 504 DWLIEGHDGYL-ETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLE 562
+L + ++PS SPEH +S +T D ++I ++F I AA+ L
Sbjct: 416 AFLHKDQQAQRWVSSPSYSPEH---------GPISIGNTYDQSLIWQLFHDFIQAAQELG 466
Query: 563 KNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD----FKDPEV--HHRHLSHLFGLFPG 616
+ED L E KS L P +I + G I EW ++ F++ +V HRH SHL GL+PG
Sbjct: 467 LDEDLLTEVKEKS-DLLNPLQITQSGRIREWYEEEEQYFQNEKVEAQHRHASHLVGLYPG 525
Query: 617 HTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDP 676
+ + K + +AA +L RG+ G GWS K LWARL D A++++
Sbjct: 526 NLFSY-KGQEYIEAARASLNDRGDGGTGWSEANKINLWARLGDGNRAHKLLA-------- 576
Query: 677 EHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGC 736
+ + NL+ +HPPFQID NFG T+ +AEML+QS L L ALP D WS+G
Sbjct: 577 ---EQLKTSTLQNLWCSHPPFQIDGNFGATSGMAEMLLQSHAAYLVPLAALP-DAWSTGS 632
Query: 737 VKGLKARGGETVSICWKDGDLHEVGIYS 764
V GL ARG VS+ W+D L ++ I S
Sbjct: 633 VSGLMARGHFEVSMSWEDKKLLQLTILS 660
>gi|229829382|ref|ZP_04455451.1| hypothetical protein GCWU000342_01471 [Shuttleworthia satelles DSM
14600]
gi|229792545|gb|EEP28659.1| hypothetical protein GCWU000342_01471 [Shuttleworthia satelles DSM
14600]
Length = 1622
Score = 303 bits (775), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 245/858 (28%), Positives = 388/858 (45%), Gaps = 156/858 (18%)
Query: 6 STSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY- 63
+ + N L++ ++ PA + T ++ IGNG +G +V+GG+ + + +NE T+W G P
Sbjct: 39 NAKSDNLLRLWYDKPASDWQTQSLAIGNGYMGGLVFGGINQDRIHINEKTVWEGGPDGKS 98
Query: 64 ------TNPDAPKA--------LSDVRSLVDSGQYAEATAASVKLFGHPADVYQ------ 103
TNP + + L+++R +D S +FG + YQ
Sbjct: 99 TYSYGTTNPISTEEDLQKIKDNLNEIRQKLDD--------KSEHVFGFDENSYQASGTDT 150
Query: 104 ----------LLGDIELEFDDSHLKYAE------------ETYRRELDLNTATARVKYSV 141
L+GD L+ D+ YA Y R+LD+ TA A V Y
Sbjct: 151 KGEAMDALNKLMGD--LKGYDAPTDYANLYISNDQDPSKVTNYVRDLDMRTALATVSYDY 208
Query: 142 GNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPG 201
V + RE+F+S PD ++ ++S + G +SF +L++L+ +Y N ++ G
Sbjct: 209 EGVHYCREYFNSYPDNIMAVRLSADKDGKISFKTNLENLIGGDAYTN-----VVRGDTIT 263
Query: 202 KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDK---KLKVEGSDWAVLLLVASS 258
R D +G A ++K+ ++ G+IS+ E+ ++V G++ L+ +
Sbjct: 264 MR--------DALRGNGLKAEAQLKVINEGGSISSDENDGKPAIRVSGANAVTLIFACGT 315
Query: 259 SFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKD 318
+ P+ +DP +Q+ Y L H++D+ LF R+ +
Sbjct: 316 DYKMEL--PNFRGEDPHKAVKKRIQAAAKKGYQVLKKDHVEDHSALFSRMELGFDEEIPQ 373
Query: 319 IVTD-------TCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV 371
I TD E N +P + E +L + +QFGRYL I+ SR G+
Sbjct: 374 IPTDELIRRYRNMVENNGGQIP--------MSAEQRALEVMCYQFGRYLTIAGSREGSLP 425
Query: 372 ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQ 431
NLQG+W E TW H NIN++MNYW ++ NL EC +P DFL L G A
Sbjct: 426 TNLQGVWGEGFF-TWYGDYHFNINVQMNYWPTMASNLGECMKPYNDFLNVLKEAGRNAAA 484
Query: 432 VNY-------LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
+Y +GW++ + + S+ + P+G AW + +E+Y YT D
Sbjct: 485 ASYGIKSREGEENGWLVGCFSTPYMFSALGQKNNAAGWNPIGSAWALLNSYEYYLYTGDT 544
Query: 485 DFLEKRAYPLLEGCASF---LLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
+L ++ YP ++ A+F L W E Y+ + PS SPE+ + ++
Sbjct: 545 QYL-RQLYPSMKEVANFWNKALYW-SEYQQRYV-SAPSYSPEN---------GPIVNGAS 592
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW-------- 593
D I + I AAE L + D LV + + +L P + + G + EW
Sbjct: 593 YDQQFIWQHLENTIHAAETLGLDGD-LVAEWKEKQSKLDPVIVGKSGQVKEWFEETSFGK 651
Query: 594 AQDFKDPEVH------------------HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTL 635
AQ PE+ HRHLSHL L+P + I+ +K P+ AA +L
Sbjct: 652 AQAGNLPEIDIPQWRQSLGAQNSGVQPPHRHLSHLMALYPCNLISKDK-PEYMNAAIVSL 710
Query: 636 QKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAH- 694
++RG + GWS K LWAR E A+++V+ + G +NLF +H
Sbjct: 711 KERGLDATGWSKAHKLNLWARTGHAEEAFKLVQSDVGGGNS--------GFLTNLFCSHG 762
Query: 695 --------PPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGE 746
P FQID NFG+TA V EML+QS L + LPALP D+WS+G VKG+ ARG
Sbjct: 763 SGANYKEKPIFQIDGNFGYTAGVNEMLLQSQLGYVQFLPALP-DQWSTGHVKGIVARGNF 821
Query: 747 TVSICWKDGDLHEVGIYS 764
+++ W +G I S
Sbjct: 822 EINMDWSNGKADRFEITS 839
>gi|225019012|ref|ZP_03708204.1| hypothetical protein CLOSTMETH_02963 [Clostridium methylpentosum
DSM 5476]
gi|224948237|gb|EEG29446.1| hypothetical protein CLOSTMETH_02963 [Clostridium methylpentosum
DSM 5476]
Length = 1657
Score = 302 bits (773), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 236/796 (29%), Positives = 362/796 (45%), Gaps = 136/796 (17%)
Query: 13 LKITFNGPAKHFTDA------IPIGNGRLGAMVWGGVPSETLKLNEDTLW--TGVPGDYT 64
LK+ ++ PA + +DA +P+G G +GA V+G +E ++L E++L G G
Sbjct: 53 LKLWYDEPAPN-SDAGWEQWSLPLGCGYMGANVFGITDTERIQLTENSLCGNNGFEGGLN 111
Query: 65 NPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEE-- 122
N F +++L + +
Sbjct: 112 N----------------------------------------------FSETYLDFGHDYS 125
Query: 123 ---TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVS--- 176
Y R+L LN ATA V+Y G V ++RE+F+S PD+V+ K+S SESG LSF +
Sbjct: 126 GVSNYTRDLILNDATAHVRYDYGGVTYSREYFTSYPDKVMAIKLSASESGKLSFTLRPTI 185
Query: 177 --LDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTI 234
L+ G+ I + GR G + + P G S D GTI
Sbjct: 186 PYLNEKKSGTVSAQGDT-ITLSGRMHGYEVDFEGQYKVIPSGGSASMQAANDADGDNGTI 244
Query: 235 SALEDKKLKVEGSDWAVLLLVASSSFD---GPFINPSDSK----KDPTSESMSALQSIRN 287
+V G+D AV+L+ ++++ F+NP +K + P ++ ++
Sbjct: 245 --------QVTGADSAVILIAIGTNYEFDPQVFLNPDATKLEGFEHPHAKVTERIEQASA 296
Query: 288 LSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DED 346
SY L + H DYQ LF R L + + TD E + +++ D
Sbjct: 297 QSYEQLRSNHTADYQNLFDRTRFDLGGAVPQLTTD-------------ELMNAYKAGSND 343
Query: 347 PSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPC 406
L EL FQ+GRYLLISSSR G NLQG+WN W + NIN++MNYW
Sbjct: 344 RYLEELYFQYGRYLLISSSRKGALPPNLQGVWNMYEQAPWTAGYWHNINIQMNYWPVFST 403
Query: 407 NLSECQEPLFDFL-TYLSINGSKTAQV-------NYLASGWVIHHKTDIWAKSSADRGKV 458
NL+E + D+ YL + + Q NY G + W+ +
Sbjct: 404 NLAELFDSYIDYYNAYLPAVRNSSNQFIAQQHPDNYDPGG------DNGWSIGTGAGPYS 457
Query: 459 VWALWPMG------GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDG 512
V+A G GA + WE+Y++T D D LE YP + G A+F + ++E H
Sbjct: 458 VYAPNGQGTDGNGTGALMAQVFWEYYDFTRDPDILENITYPAVSGAANF-MSRVMEPHGD 516
Query: 513 YLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKV 572
YL +PS SPE +G V+ + D + E+ + AAE+L + ++AL +++
Sbjct: 517 YLLADPSASPEQ---MENGNY-VVTVGTAWDQQLAYEMEQNTLEAAELLGRQDEALPQRL 572
Query: 573 LKSLPRLRPTKIAEDGSIMEWAQD---FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCK 629
+ +L P ++ G I E+ ++ + E +HRH+S L GL+PG T+ P
Sbjct: 573 ADQIDKLDPVQVGFSGQIKEFREENFYGEIAEYNHRHISQLVGLYPG-TLINSTTPAWMD 631
Query: 630 AAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSN 689
AA+ +L RG++ GW++ + WAR D Y + + L + G +N
Sbjct: 632 AAKVSLNLRGDKSTGWAMAHRLNAWARTKDGNRTYSIYQTL-----------LKNGTLNN 680
Query: 690 LFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVS 749
L+ HPPFQID NFG TA V+EML+QS + +PA+P D W+ G +GL ARG TV
Sbjct: 681 LWDTHPPFQIDGNFGGTAGVSEMLLQSHEGYIAPMPAIP-DAWAQGSYRGLVARGNFTVG 739
Query: 750 ICWKDGDLHEVGIYSN 765
W +G + I SN
Sbjct: 740 ADWSNGQADQFTITSN 755
>gi|310286736|ref|YP_003937994.1| cell wall protein containing Ig-like domains (group2 and 3)
[Bifidobacterium bifidum S17]
gi|309250672|gb|ADO52420.1| cell wall protein containing Ig-like domains (group2 and 3)
[Bifidobacterium bifidum S17]
Length = 1959
Score = 301 bits (772), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 246/877 (28%), Positives = 397/877 (45%), Gaps = 149/877 (16%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
+P GNG++G VWG V E + NE+TLWTG PG T N + + +R+L
Sbjct: 627 LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 686
Query: 82 GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
T L G + A+ L GDI L++ + E YRR+L+L+ A V
Sbjct: 687 LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 744
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
+ V +TRE+F+SNPD V+V +++ S++G L+FNVS+ + N +Y ++G
Sbjct: 745 TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 801
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
+ K ++ G+ +++ +++ + + GT+S D LKV + L + A
Sbjct: 802 DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGSDGASLKVSDAKAVTLYIAA 855
Query: 257 SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
++ + P ++ + + +Q+ N Y+ + H+DD+ ++ RV I L +
Sbjct: 856 ATDYKQKYPSYRTGETAAEVNTRVAKVVQAAANKGYTAVKKAHIDDHSAIYDRVKINLGQ 915
Query: 315 SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
S + D + A + S T + L L++++GRYL I SSR +Q+ +N
Sbjct: 916 SGH----SSDGAVATDALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 971
Query: 374 LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
LQGIW N + W S H+N+NL+MNYW + N+ E EPL +++ L G
Sbjct: 972 LQGIWSVTAGDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 1031
Query: 428 KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
TA+V A G++ H + + ++ + W P W+ ++
Sbjct: 1032 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 1090
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI----EGHDGYLETNPSTSPEHEFIAPD 530
+E Y Y+ D L R Y LL+ + F +++++ L T + SPE + D
Sbjct: 1091 YEAYEYSGDPALLN-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 1149
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL----------------- 573
G +T + +++ ++ + I AA+ + + D LV
Sbjct: 1150 G--------NTYESSLVWQMLNDAIEAAKA-KGDPDGLVGNTTDCSADNWAKGDNGNFTD 1200
Query: 574 ----------KSLPRLRPTKIAEDGSIMEW-------------AQDFKDPEVHHRHLSHL 610
KSL L+P ++ + G I EW A + HRH+SHL
Sbjct: 1201 ANANRSWSCAKSL--LKPIEVGDSGQIKEWYFEGALGKKKDGSAISGYQADNQHRHMSHL 1258
Query: 611 FGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG------PGWSITWKTALWARLHDQEHAY 664
GLFPG ITI+ N + AA+ +L+ R +G GW+I + WAR D Y
Sbjct: 1259 LGLFPGDLITID-NSEYMDAAKTSLRYRCFKGNVLQSNTGWAIGQRINSWARTGDGNTTY 1317
Query: 665 RMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST------- 717
++V E + +Y+NLF H PFQID NFG T+ V EML+QS
Sbjct: 1318 KLV-----------ELQLKNAMYANLFDYHAPFQIDGNFGNTSGVDEMLLQSNSTFTDTD 1366
Query: 718 ----LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDS 773
+N +LPALP D W+ G V GL ARG TV WK+G EV + SN
Sbjct: 1367 GKKYVNYTNILPALP-DAWADGSVSGLVARGNFTVGTTWKNGKATEVKLTSN-------- 1417
Query: 774 FKTLHYRGTSVKVNLSAGKIYTFNRQLKCTNLHQSIV 810
+G V ++AG + + T ++ +V
Sbjct: 1418 ------KGKQAAVKITAGGAQNYEVKNGDTAVNAKVV 1448
>gi|146386777|pdb|2EAB|A Chain A, Crystal Structure Of 1,2-A-L-Fucosidase From
Bifidobacterium Bifidum (Apo Form)
gi|146386778|pdb|2EAB|B Chain B, Crystal Structure Of 1,2-A-L-Fucosidase From
Bifidobacterium Bifidum (Apo Form)
gi|146386779|pdb|2EAC|A Chain A, Crystal Structure Of 1,2-A-L-Fucosidase From
Bifidobacterium Bifidum In Complex With
Deoxyfuconojirimycin
gi|146386780|pdb|2EAC|B Chain B, Crystal Structure Of 1,2-A-L-Fucosidase From
Bifidobacterium Bifidum In Complex With
Deoxyfuconojirimycin
Length = 899
Score = 301 bits (772), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 248/872 (28%), Positives = 397/872 (45%), Gaps = 139/872 (15%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
+P GNG++G VWG V E + NE+TLWTG PG T N + + +R+L
Sbjct: 52 LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 111
Query: 82 GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
T L G + A+ L GDI L++ + E YRR+L+L+ A V
Sbjct: 112 LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 169
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
+ V +TRE+F+SNPD V+V +++ S++G L+FNVS+ + N +Y ++G
Sbjct: 170 TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 226
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
+ N G+ +++ +++ + + GT+S D LKV + L + A
Sbjct: 227 DTLTVKGALGNN------GLLYNSQIKVVLDNGEGTLSEGSDGASLKVSDAKAVTLYIAA 280
Query: 257 SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
++ + P ++ + + +Q N Y+ + H+DD+ ++ RV I L +
Sbjct: 281 ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIDDHSAIYDRVKIDLGQ 340
Query: 315 SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
S + D + A + S T + L L++++GRYL I SSR +Q+ +N
Sbjct: 341 SGHSSDGAVAT----DALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 396
Query: 374 LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
LQGIW N + W S H+N+NL+MNYW + N+ E EPL +++ L G
Sbjct: 397 LQGIWSVTAGDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 456
Query: 428 KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
TA+V A G++ H + + ++ + W P W+ ++
Sbjct: 457 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 515
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI----EGHDGYLETNPSTSPEHEFIAPD 530
+E Y Y+ D L+ R Y LL+ + F +++++ L T + SPE + D
Sbjct: 516 YEAYEYSGDPALLD-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 574
Query: 531 GKLACVSYSSTMDMAIIREVFSAIIS--------------AAEVLEKNE-----DALVEK 571
G +Y S++ ++ + A + +A+ KN+ DA +
Sbjct: 575 GN----TYESSLVWQMLNDAIEAAKAKGDPDGLVGNTTDCSADNWAKNDSGNFTDANANR 630
Query: 572 ---VLKSLPRLRPTKIAEDGSIMEWAQDF-----KDPEV--------HHRHLSHLFGLFP 615
KSL L+P ++ + G I EW + KD HRH+SHL GLFP
Sbjct: 631 SWSCAKSL--LKPIEVGDSGQIKEWYFEGALGKKKDGSTISGYQADNQHRHMSHLLGLFP 688
Query: 616 GHTITIEKNPDLCKAAEKTLQKRGEEG------PGWSITWKTALWARLHDQEHAYRMVKR 669
G ITI+ N + AA+ +L+ R +G GW+I + WAR D Y++V
Sbjct: 689 GDLITID-NSEYMDAAKTSLRYRCFKGNVLQSNTGWAIGQRINSWARTGDGNTTYQLV-- 745
Query: 670 LFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST-----------L 718
E + +Y+NLF H PFQID NFG T+ V EML+QS +
Sbjct: 746 ---------ELQLKNAMYANLFDYHAPFQIDGNFGNTSGVDEMLLQSNSTFTDTAGKKYV 796
Query: 719 NDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLH 778
N +LPALP D W+ G V GL ARG TV WK+G EV + SN
Sbjct: 797 NYTNILPALP-DAWAGGSVSGLVARGNFTVGTTWKNGKATEVRLTSN------------- 842
Query: 779 YRGTSVKVNLSAGKIYTFNRQLKCTNLHQSIV 810
+G V ++AG + + T ++ +V
Sbjct: 843 -KGKQAAVKITAGGAQNYEVKNGDTAVNAKVV 873
>gi|383115618|ref|ZP_09936374.1| hypothetical protein BSGG_2513 [Bacteroides sp. D2]
gi|313694978|gb|EFS31813.1| hypothetical protein BSGG_2513 [Bacteroides sp. D2]
Length = 793
Score = 301 bits (772), Expect = 7e-79, Method: Compositional matrix adjust.
Identities = 234/774 (30%), Positives = 350/774 (45%), Gaps = 124/774 (16%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
++PIGNG +GA ++G +E ++L E T GV G Y
Sbjct: 58 SLPIGNGYMGACIFGRTDTERIQLTEKTF--GVKGPYKKGG------------------- 96
Query: 87 ATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
G+ A++Y IE D L Y+R L LN A +RV Y V +
Sbjct: 97 --------IGNFAEIY-----IEGIHHDQPL-----NYKRSLRLNDAISRVNYQYEGVNY 138
Query: 147 TREHFSSNPDQVIVTKISGSESGSLSFNVS-----LDSLLDNHSYVNG-----NNQIIME 196
TRE+F++ P VIV K+ + G +SF + L D + G N+ I +
Sbjct: 139 TREYFANYPSNVIVVKLKADQPGKISFTLRPVLPYLHEYNDEGTGRTGKVSAQNDLITLT 198
Query: 197 GRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVA 256
G R+P +A P G Q A+ +D+ G + ++++ +D VLL+ A
Sbjct: 199 GDIQFFRLPYEAQIKVIPSGGQLKAM-----NDELGN-----NGTIRIQQADSVVLLINA 248
Query: 257 -------SSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVS 309
SS F N + P +Q + Y L H+ DYQ LF RV
Sbjct: 249 QTAYQLKSSVFTASPENKFTGNEHPHRAVSQCIQKAADKGYEALCKEHIADYQSLFSRVD 308
Query: 310 IQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGT 369
+ L I TD+ + +R K E + ELLFQ+GRYLLI+SSR G+
Sbjct: 309 LHLCNETPGIPTDSLLHD-------YQRGK-----ESLYMDELLFQYGRYLLIASSRKGS 356
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKT 429
+LQG W++ W NIN++MNYW + NL+E F+ Y+ N +
Sbjct: 357 LPPHLQGAWSQYEYAPWSGGYWHNINIQMNYWAAFNTNLAEV------FIPYVEYNEAFR 410
Query: 430 AQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAW---------------LCTHL 474
N A+G++ + D + + G W + A+ T L
Sbjct: 411 QSANEKATGYIKKNNPDALSAIPEENG---WTIGTGANAFSIDSPGGHSGPGTGGFTTKL 467
Query: 475 -WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKL 533
W++Y++T D D L+K +YP + G A FL L + YL +PS+SPE +
Sbjct: 468 FWDYYDFTRDEDILKKHSYPAMLGMAKFLSKTLKPTEEEYLLADPSSSPEQYHNGTTYQT 527
Query: 534 ACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW 593
++ D +I E F ++ AA++L K E + + + + +L +I E G I E+
Sbjct: 528 KGCAF----DQGMIWESFHDVLKAADIL-KEESPFLRTIKEQIGKLDAIQIGESGQIKEY 582
Query: 594 AQDFKDPEV---HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWK 650
++ K ++ HRH+SHL L+PG I E P+ KAA TL RG++ GW + +
Sbjct: 583 REEKKYSDIGDPRHRHISHLCALYPGTLINAE-TPEWLKAATVTLNNRGDKSTGWGVAHR 641
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
LWAR+ D + AY+ + L + NL+ HPPFQID N G TA VA
Sbjct: 642 LNLWARVKDGDMAYQRYQLLLKKY-----------ILENLWNMHPPFQIDGNLGGTAGVA 690
Query: 711 EMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
EML+QS + LPALP W G +GL ARG VS+ WK G + ++ + S
Sbjct: 691 EMLIQSHEGYIDPLPALP-AAWRDGSYEGLVARGNFVVSVFWKQGLMTQMNVLS 743
>gi|358368279|dbj|GAA84896.1| similar to glycoside hydrolase family 95 protein [Aspergillus
kawachii IFO 4308]
Length = 810
Score = 301 bits (771), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 238/792 (30%), Positives = 359/792 (45%), Gaps = 113/792 (14%)
Query: 27 AIPIGNGRLG--------------------AMVWGGVPSETLKLNEDTLWTGVPGD---Y 63
A P+GNGRLG AM G E + LN D+LW G P + Y
Sbjct: 38 AFPLGNGRLGGSYFDQTSKGYYGRILKCSLAMPVGSYDKEIVNLNVDSLWRGGPFESPTY 97
Query: 64 T--NPDAPKA--LSDVRSLVDSGQYAEATAASVKLFG-HPA-DVYQLLGDIELEFDD-SH 116
+ NP+ KA L +R + + T L G +P YQ+L ++ ++ S
Sbjct: 98 SGGNPNVSKAGALPGIREWI----FQNGTGNVSALLGEYPYYGSYQVLANLTIDMGQLSD 153
Query: 117 LKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSES-GSLSFNV 175
+ + YRR LDL++A +S G RE F S PD V V K+S + S ++F +
Sbjct: 154 I----DGYRRNLDLSSAVYSDHFSTGETYIEREAFCSYPDNVCVYKLSSNSSLPGITFGL 209
Query: 176 --SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGT 233
L S N S +GN+ + G+ P G+ ++A + + +
Sbjct: 210 ENQLTSPAPNVS-CHGNSISLY-----GQTYPVI--------GMIYNARVTVVVPGSSNA 255
Query: 234 ISALEDKKLKV-EGSDWAVLLLVASSSFDGPFINPSDS----KKDPTSESMSALQSIRNL 288
+KV EG L+ A +++D N S ++P ++ + A +
Sbjct: 256 SDLCSSLTIKVPEGEKEVFLVFAADTNYDASNGNSKASFSFKGENPYTKVLQAATNAAKK 315
Query: 289 SYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPS 348
+YS L + H+ DYQ +F+ ++ L P+ E + S+ DP
Sbjct: 316 TYSALKSSHVKDYQGVFNEFTLTLP-----------DPNGSADRPTTELLSSYSQPGDPY 364
Query: 349 LVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL 408
+ LLF +GRYL ISSSRPG+ NLQG+W E SP W H NINL+MN+W L
Sbjct: 365 VENLLFDYGRYLFISSSRPGSLPPNLQGLWTESYSPAWSGDYHANINLQMNHWAVEQTGL 424
Query: 409 SECQEPLFDFLTYLSI-NGSKTAQVNYLAS-GWVIHHKTDIWAKSSADRGKVVWALWPMG 466
E EPL+ ++ + G++TA++ Y S GWV H + + + +A + WA +P
Sbjct: 425 GELTEPLWTYMAETWMPRGAETAELLYGTSEGWVTHDEMNTFGH-TAMKDVAQWADYPAT 483
Query: 467 GAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH---DGYLETNPSTSPE 523
AW+ H+W+H++Y+ D + ++ YP+L+G A F L L++ DG L NP SPE
Sbjct: 484 NAWMSHHVWDHFDYSQDSTWYREKGYPILKGAAQFWLSQLVKDEYFKDGTLVVNPCNSPE 543
Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-T 582
H P C Y +I EVF ++ ++ + + L L P
Sbjct: 544 H---GPT-TFGCTHYQQ-----LIWEVFGHVLQGWTASGDDDTSFKNAITSKLSTLDPGI 594
Query: 583 KIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITI--EKNPDLCKAAEKTLQKRG- 639
I G I EW D HRHLS+L+G +PG+ I+ N + A E TL RG
Sbjct: 595 HIGSWGQIQEWKLDIDVKNDTHRHLSNLYGWYPGYVISSVHGSNKTITDAVETTLYSRGT 654
Query: 640 ---EEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP 696
+ GW+ W++A WA L+ + AY + + D E F+ +++ PP
Sbjct: 655 GVEDSNTGWAKVWRSACWALLNVTDEAYSELS--LAIQDNFAENGFD------MYSGSPP 706
Query: 697 FQIDANFGFTAAVAEMLVQ-----------STLNDLYLLPALPWDKWSSGCVKGLKARGG 745
FQIDANFG A+ +ML++ + L PA+P W G V GL+ RGG
Sbjct: 707 FQIDANFGLVGAMVQMLIRDLDRSNADARAGKTQAVLLGPAIP-AAWGGGSVDGLRLRGG 765
Query: 746 ETVSICWKDGDL 757
VS W D L
Sbjct: 766 GVVSFSWDDNGL 777
>gi|418222212|ref|ZP_12848861.1| hypothetical protein SPAR104_2201 [Streptococcus pneumoniae
GA47751]
gi|353872607|gb|EHE52471.1| hypothetical protein SPAR104_2201 [Streptococcus pneumoniae
GA47751]
Length = 461
Score = 300 bits (768), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 172/436 (39%), Positives = 235/436 (53%), Gaps = 44/436 (10%)
Query: 349 LVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL 408
+ LLF +GRYLLISSS+P ANLQGIW ++L+P W S +NIN +MNYW PC+L
Sbjct: 1 MTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDL 60
Query: 409 SECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGA 468
E + PLFD L + G TA+ Y A G+ HH TD ++ ++ + A+W +
Sbjct: 61 PEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTDGFSDTAPQSHAMGAAIWVLTIP 120
Query: 469 WLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIA 528
WLCTH+WEHY Y D L + + +++ F D+L E DGYL T PS SPE+++
Sbjct: 121 WLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRL 178
Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAE 586
+G SST+D I+R + I A+ L N D + V+++ K LP+ TKI
Sbjct: 179 KNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGS 235
Query: 587 DGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR-------- 638
+G I EW +D+++ E HRH+S LFGL+P + I I K P+L +AA+ T+ +R
Sbjct: 236 NGQIQEWLEDYEEVEPGHRHISPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLS 295
Query: 639 -----------------GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKH 681
GWS W +ARL+ E AY + L N
Sbjct: 296 SQEREQAINNWLVSGLHASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLN--------- 346
Query: 682 FEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLK 741
NLF HPPFQID N G + + E+LVQS N L L+PALP WS G VKG +
Sbjct: 347 --NATLGNLFLDHPPFQIDGNLGLVSGICELLVQSHHNWLSLIPALP-SAWSEGEVKGFR 403
Query: 742 ARGGETVSICWKDGDL 757
RGG VS WK+GD+
Sbjct: 404 VRGGYKVSFAWKNGDI 419
>gi|418165478|ref|ZP_12802140.1| hypothetical protein SPAR45_2154 [Streptococcus pneumoniae GA17371]
gi|353827258|gb|EHE07411.1| hypothetical protein SPAR45_2154 [Streptococcus pneumoniae GA17371]
Length = 461
Score = 300 bits (768), Expect = 2e-78, Method: Compositional matrix adjust.
Identities = 172/436 (39%), Positives = 234/436 (53%), Gaps = 44/436 (10%)
Query: 349 LVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNL 408
+ LLF +GRYLLISSS+P ANLQGIW ++L+P W S +NIN +MNYW PC+L
Sbjct: 1 MTNLLFHYGRYLLISSSQPNGLPANLQGIWCDELNPIWGSKYTININTQMNYWMVGPCDL 60
Query: 409 SECQEPLFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGA 468
E + PLFD L + G TA+ Y A G+ HH TD + ++ + A+W +
Sbjct: 61 PEVEYPLFDMLERMREPGRLTAKKMYGARGFTAHHNTDGFGDTAPQSHAMGAAIWVLTIP 120
Query: 469 WLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIA 528
WLCTH+WEHY Y D L + + +++ F D+L E DGYL T PS SPE+++
Sbjct: 121 WLCTHIWEHYLYFQDERILTEH-FEMIKEAFLFFEDYLFEV-DGYLMTGPSVSPENKYRL 178
Query: 529 PDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDAL--VEKVLKSLPRLRPTKIAE 586
+G SST+D I+R + I A+ L N D + V+++ K LP+ TKI
Sbjct: 179 KNGIEGNACLSSTIDNQILRYFCDSCIGIAKQLGDNSDFISRVKELKKKLPK---TKIGS 235
Query: 587 DGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR-------- 638
+G I EW +D+++ E HRH+S LFGL+P + I I K P+L +AA+ T+ +R
Sbjct: 236 NGQIQEWLEDYEEVEPGHRHISPLFGLYPYNEIDIHKTPELAEAAKITINRRLSNANFLS 295
Query: 639 -----------------GEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKH 681
GWS W +ARL+ E AY + L N
Sbjct: 296 SQEREQAINNWLVSGLHASTQTGWSAAWLIHFFARLYQGEPAYNQINGLLN--------- 346
Query: 682 FEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLK 741
NLF HPPFQID N G + + E+LVQS N L L+PALP WS G VKG +
Sbjct: 347 --NATLGNLFLDHPPFQIDGNLGLVSGICELLVQSHHNWLSLIPALP-SAWSEGEVKGFR 403
Query: 742 ARGGETVSICWKDGDL 757
RGG VS WK+GD+
Sbjct: 404 VRGGYKVSFAWKNGDI 419
>gi|34451973|gb|AAQ72464.1| alpha-fucosidase [Bifidobacterium bifidum JCM 1254]
Length = 1959
Score = 300 bits (767), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 248/872 (28%), Positives = 398/872 (45%), Gaps = 139/872 (15%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
+P GNG++G VWG V E + NE+TLWTG PG T N + + +R+L
Sbjct: 627 LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 686
Query: 82 GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
T L G + A+ L GDI L++ + E YRR+L+L+ A V
Sbjct: 687 LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 744
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
+ V +TRE+F+SNPD V+V +++ S++G L+FNVS+ + N +Y ++G
Sbjct: 745 TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 801
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
+ K ++ G+ +++ +++ + + GT+S D LKV + L + A
Sbjct: 802 DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGSDGASLKVSDAKAVTLYIAA 855
Query: 257 SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
++ + P ++ + + +Q N Y+ + H+DD+ ++ RV I L +
Sbjct: 856 ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIDDHSAIYDRVKIDLGQ 915
Query: 315 SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
S + D + A + S T + L L++++GRYL I SSR +Q+ +N
Sbjct: 916 SGH----SSDGAVATDALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 971
Query: 374 LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
LQGIW N + W S H+N+NL+MNYW + N+ E EPL +++ L G
Sbjct: 972 LQGIWSVTAGDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 1031
Query: 428 KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
TA+V A G++ H + + ++ + W P W+ ++
Sbjct: 1032 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 1090
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI----EGHDGYLETNPSTSPEHEFIAPD 530
+E Y Y+ D L+ R Y LL+ + F +++++ L T + SPE + D
Sbjct: 1091 YEAYEYSGDPALLD-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 1149
Query: 531 GKLACVSYSSTMDMAIIREVFSAIIS--------------AAEVLEKNE-----DALVEK 571
G +Y S++ ++ + A + +A+ KN+ DA +
Sbjct: 1150 GN----TYESSLVWQMLNDAIEAAKAKGDPDGLVGNTTDCSADNWAKNDSGNFTDANANR 1205
Query: 572 ---VLKSLPRLRPTKIAEDGSIMEW-----AQDFKDPEV--------HHRHLSHLFGLFP 615
KSL L+P ++ + G I EW KD HRH+SHL GLFP
Sbjct: 1206 SWSCAKSL--LKPIEVGDSGQIKEWYFEGALGKKKDGSTISGYQADNQHRHMSHLLGLFP 1263
Query: 616 GHTITIEKNPDLCKAAEKTLQKRGEEG------PGWSITWKTALWARLHDQEHAYRMVKR 669
G ITI+ N + AA+ +L+ R +G GW+I + WAR D Y++V
Sbjct: 1264 GDLITID-NSEYMDAAKTSLRYRCFKGNVLQSNTGWAIGQRINSWARTGDGNTTYQLV-- 1320
Query: 670 LFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST-----------L 718
E + +Y+NLF H PFQID NFG T+ V EML+QS +
Sbjct: 1321 ---------ELQLKNAMYANLFDYHAPFQIDGNFGNTSGVDEMLLQSNSTFTDTAGKKYV 1371
Query: 719 NDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLH 778
N +LPALP D W+ G V GL ARG TV WK+G EV + SN
Sbjct: 1372 NYTNILPALP-DAWAGGSVSGLVARGNFTVGTTWKNGKATEVRLTSN------------- 1417
Query: 779 YRGTSVKVNLSAGKIYTFNRQLKCTNLHQSIV 810
+G V ++AG + + T ++ +V
Sbjct: 1418 -KGKQAAVKITAGGAQNYEVKNGDTAVNAKVV 1448
>gi|146386781|pdb|2EAD|A Chain A, Crystal Structure Of 1,2-A-L-Fucosidase From
Bifidobacterium Bifidum In Complex With Substrate
gi|146386782|pdb|2EAD|B Chain B, Crystal Structure Of 1,2-A-L-Fucosidase From
Bifidobacterium Bifidum In Complex With Substrate
Length = 899
Score = 300 bits (767), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 247/872 (28%), Positives = 398/872 (45%), Gaps = 139/872 (15%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
+P GNG++G VWG V E + NE+TLWTG PG T N + + +R+L
Sbjct: 52 LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 111
Query: 82 GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
T L G + A+ L GDI L++ + E YRR+L+L+ A V
Sbjct: 112 LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 169
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
+ V +TRE+F+SNPD V+V +++ S++G L+FNVS+ + N +Y ++G
Sbjct: 170 TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 226
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
+ K ++ G+ +++ +++ + + GT+S D LKV + L + A
Sbjct: 227 DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGSDGASLKVSDAKAVTLYIAA 280
Query: 257 SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
++ + P ++ + + +Q N Y+ + H+DD+ ++ RV I L +
Sbjct: 281 ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIDDHSAIYDRVKIDLGQ 340
Query: 315 SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
S + D + A + S T + L L++++GRYL I SSR +Q+ +N
Sbjct: 341 SGHSSDGAVAT----DALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 396
Query: 374 LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
LQGIW N + W S H+N+NL+MNYW + N+ E EPL +++ L G
Sbjct: 397 LQGIWSVTAGDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 456
Query: 428 KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
TA+V A G++ H + + ++ + W P W+ ++
Sbjct: 457 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 515
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI----EGHDGYLETNPSTSPEHEFIAPD 530
+E Y Y+ D L+ R Y LL+ + F +++++ L T + SP + D
Sbjct: 516 YEAYEYSGDPALLD-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPAQGPLGTD 574
Query: 531 GKLACVSYSSTMDMAIIREVFSAIIS--------------AAEVLEKNE-----DALVEK 571
G +Y S++ ++ + A + +A+ KN+ DA +
Sbjct: 575 GN----TYESSLVWQMLNDAIEAAKAKGDPDGLVGNTTDCSADNWAKNDSGNFTDANANR 630
Query: 572 ---VLKSLPRLRPTKIAEDGSIMEWAQDF-----KDPEV--------HHRHLSHLFGLFP 615
KSL L+P ++ + G I EW + KD HRH+SHL GLFP
Sbjct: 631 SWSCAKSL--LKPIEVGDSGQIKEWYFEGALGKKKDGSTISGYQADNQHRHMSHLLGLFP 688
Query: 616 GHTITIEKNPDLCKAAEKTLQKRGEEG------PGWSITWKTALWARLHDQEHAYRMVKR 669
G ITI+ N + AA+ +L+ R +G GW+I + WAR D Y++V
Sbjct: 689 GDLITID-NSEYMDAAKTSLRYRCFKGNVLQSNTGWAIGQRINSWARTGDGNTTYQLV-- 745
Query: 670 LFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST-----------L 718
E + +Y+NLF H PFQID NFG T+ V EML+QS +
Sbjct: 746 ---------ELQLKNAMYANLFDYHAPFQIDGNFGNTSGVDEMLLQSNSTFTDTAGKKYV 796
Query: 719 NDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLH 778
N +LPALP D W+ G V GL ARG TV WK+G EV + SN
Sbjct: 797 NYTNILPALP-DAWAGGSVSGLVARGNFTVGTTWKNGKATEVRLTSN------------- 842
Query: 779 YRGTSVKVNLSAGKIYTFNRQLKCTNLHQSIV 810
+G V ++AG + + T ++ +V
Sbjct: 843 -KGKQAAVKITAGGAQNYEVKNGDTAVNAKVV 873
>gi|291549437|emb|CBL25699.1| Uncharacterised Sugar-binding Domain [Ruminococcus torques L2-14]
Length = 1637
Score = 299 bits (766), Expect = 4e-78, Method: Compositional matrix adjust.
Identities = 238/862 (27%), Positives = 388/862 (45%), Gaps = 153/862 (17%)
Query: 5 ESTSTTNPLKITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDY 63
E+ N L++ ++ PA + T ++ IGNG +G++V+GG+ + + +NE T+W G P Y
Sbjct: 38 ETAKNDNLLRVWYDEPATDWQTQSLAIGNGYMGSLVFGGINKDKIHINEKTVWEGGPTSY 97
Query: 64 ------------TNPDAPKALSDVRS----LVDSGQYA--------EATAASVKLFGHPA 99
T+ D K D+ + L D +Y EA+ + K G
Sbjct: 98 NGYSYGTTNKTETDADLQKIKDDLNAIREKLDDKSEYVFGFNEDSYEASGTNTK--GEAM 155
Query: 100 D-VYQLLGDI----------ELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTR 148
D + +L+GD+ L ++ Y R+LD+ TA A V Y V +TR
Sbjct: 156 DWLNKLMGDLVGYSAPKDYANLYISNNQDSSKVSNYVRDLDMRTALATVNYDYEGVHYTR 215
Query: 149 EHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSY---VNGNNQIIMEGRCPGKRIP 205
E+F S PD V+ ++S + G ++F+ +L SL+ ++ V+G+ I M G +
Sbjct: 216 EYFDSYPDNVMAVRLSADQKGKINFDTNLQSLIGGRTHKSTVDGDT-ITMRDALGGNGLN 274
Query: 206 PKANANDDPKGIQFSAILEIKISDDRGTISA---LEDKKLKVEGSDWAVLLLVASSSFDG 262
+A ++K+ ++ G++S+ + + V +D L+ + +
Sbjct: 275 IEA---------------QLKVINEGGSLSSNTNGSNPSITVSDADAVTLIFACGTDYKM 319
Query: 263 PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTD 322
PS +DP + + + Y L H+ D+ LF R+ + +
Sbjct: 320 EL--PSFRGEDPHDAVTARINAAAKKGYEALKKDHVADHDALFSRMELGFN--------- 368
Query: 323 TCSEENIDTVPSAERVKSFQT------------DEDPSLVELLFQFGRYLLISSSRPGTQ 370
E + T+P+ E +K ++ E +L + +QFGRYL I+ SR G
Sbjct: 369 ----EEVPTIPTDELIKKYRNMVDNNGGEVPTESEQRALEVICYQFGRYLTIAGSREGAL 424
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
NLQG+W E W H NIN++MNYW +L NL+ECQ D+L L G A
Sbjct: 425 PTNLQGVWGEGYFQ-WGGDYHFNINVQMNYWPTLASNLAECQTAYNDYLNVLKEAGRYAA 483
Query: 431 QVNY-------LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
+ +GW++ + + S+ + P+G AW + +E+Y YT D
Sbjct: 484 AAAFGIKSDEGEENGWLVGCFSTPYMFSALGQKNNAAGWNPIGSAWALLNAYEYYLYTED 543
Query: 484 RDFLEKRAYPLLEGCASFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
D+L+ YP L+ A+F + L E Y+ PS SPE+ + ++
Sbjct: 544 TDYLKNELYPSLKEVANFWNEALYWSEYQQRYVSA-PSYSPEN---------GPIVNGAS 593
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW-------- 593
D I + F I AAE L + D LVE+ + +L P + +DG + EW
Sbjct: 594 YDQQFIWQHFENTIQAAETLGVDAD-LVEQWKEKQSKLDPVLVGDDGQVKEWYEETHFGK 652
Query: 594 --AQDFKDPEVH----------------HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTL 635
A D + ++ HRHLSHL L+P + I+ + NP+ AA +L
Sbjct: 653 AQAGDLGEIDIPQWRQSLGAQSGGVQPPHRHLSHLMALYPCNMIS-KDNPEFMDAAIVSL 711
Query: 636 QKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAH- 694
+RG + GWS K LWAR + A+++V+ G +NL ++H
Sbjct: 712 NERGLDATGWSKAHKLNLWARTGHSDEAFQIVQSAVG--------GGNSGFLTNLLSSHG 763
Query: 695 --------PPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGE 746
P FQID NFG+TA V EML+QS L + LPA+P ++W++G V+G+ ARG
Sbjct: 764 GGANYKGYPIFQIDGNFGYTAGVNEMLLQSQLGYVQFLPAIP-EQWNTGHVEGIVARGNF 822
Query: 747 TVSICWKDGDLHEVGIYSNYSN 768
+++ W +G I S N
Sbjct: 823 EINMNWSEGKADRFEIKSRNGN 844
>gi|311063634|ref|YP_003970359.1| 1,2-A-L-fucosidase [Bifidobacterium bifidum PRL2010]
gi|310865953|gb|ADP35322.1| 1,2-A-L-Fucosidase [Bifidobacterium bifidum PRL2010]
Length = 1959
Score = 299 bits (765), Expect = 5e-78, Method: Compositional matrix adjust.
Identities = 245/877 (27%), Positives = 397/877 (45%), Gaps = 149/877 (16%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
+P GNG++G VWG V E + NE+TLWTG PG T N + + +R+L
Sbjct: 627 LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTRYNGGNNETKGQNGATLRALNKQ 686
Query: 82 GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
T L G + A+ L GDI L++ + E YRR+L+L+ A V
Sbjct: 687 LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 744
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
+ V +TRE+F+SNPD V+V +++ S++G L+FNVS+ + N +Y ++G
Sbjct: 745 TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 801
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
+ K ++ G+ +++ +++ + + GT+S D LKV + L + A
Sbjct: 802 DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGADGASLKVSDAKAVTLYIAA 855
Query: 257 SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
++ + P ++ + + +Q N Y+ + H+ D+ ++ RV I L +
Sbjct: 856 ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIADHSAIYDRVKIDLGQ 915
Query: 315 SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
S + D + A + S T + L L++++GRYL I SSR +Q+ +N
Sbjct: 916 SGH----SSDGAVATDALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 971
Query: 374 LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
LQGIW N + W S H+N+NL+MNYW + N+ E EPL +++ L G
Sbjct: 972 LQGIWSVTADDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 1031
Query: 428 KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
TA+V A G++ H + + ++ + W P W+ ++
Sbjct: 1032 VTAKVYAGAETTNPETTPIGEGKGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 1090
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI----EGHDGYLETNPSTSPEHEFIAPD 530
+E Y Y+ D L+ R Y LL+ + F +++++ L T + SPE + D
Sbjct: 1091 YEAYEYSGDPALLD-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 1149
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL----------------- 573
G +T + +++ ++ + I AA+ + + D LV
Sbjct: 1150 G--------NTYESSLVWQMLNDAIEAAKA-KGDPDGLVGDTTDCSANNWAKGDNGNFTD 1200
Query: 574 ----------KSLPRLRPTKIAEDGSIMEW-------------AQDFKDPEVHHRHLSHL 610
KSL L+P ++ + G I EW A + HRH+SHL
Sbjct: 1201 ANANRSWSCAKSL--LKPIEVGDSGQIKEWYFEGALGKKKDGSAISGYQADNQHRHMSHL 1258
Query: 611 FGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG------PGWSITWKTALWARLHDQEHAY 664
GLFPG ITI+ N + +AA+ +L+ R +G GW+I + WAR D Y
Sbjct: 1259 LGLFPGDLITID-NSEYMEAAKTSLRYRCFKGNVLQSNTGWAIGQRINSWARTGDGNTTY 1317
Query: 665 RMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST------- 717
++V E + +Y+NLF H PFQID NFG T+ V EML+QS
Sbjct: 1318 QLV-----------ELQLKNAMYANLFDYHAPFQIDGNFGNTSGVDEMLLQSNSTFTDTA 1366
Query: 718 ----LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDS 773
+N +LPALP D W+ G V GL ARG TV WK+G EV + SN
Sbjct: 1367 GKKYVNYTNILPALP-DAWAGGSVSGLVARGNFTVGTTWKNGKATEVKLTSN-------- 1417
Query: 774 FKTLHYRGTSVKVNLSAGKIYTFNRQLKCTNLHQSIV 810
+G V ++AG + + T ++ +V
Sbjct: 1418 ------KGKQAAVKITAGGAQNYEVKNGDTAVNAKVV 1448
>gi|340514861|gb|EGR45120.1| glycoside hydrolase family 95 [Trichoderma reesei QM6a]
Length = 795
Score = 298 bits (764), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 230/803 (28%), Positives = 378/803 (47%), Gaps = 90/803 (11%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGV-----PGDYTNPDA 68
++ + P+ F ++P+GNGR A V E L LNE + W+G G P+
Sbjct: 6 RLFYTTPSTAFPTSLPLGNGRFAASVLSSPSKEVLILNEVSFWSGKEQPAGAGLSHKPER 65
Query: 69 PK-ALSDVRSLVDSGQYAEATAASVKL-------FGHPADVYQLLGDIELEFDDSH-LKY 119
K L + + SG YA+ + + FG V G +E+ + +
Sbjct: 66 AKDELRETQRCYLSGDYAQGKKRAERFLESRKTNFGTNLGV----GRLEIAVNGQETIDG 121
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
+ REL L+ A +Y++ +F R F S+P QV+V ++ G + L V +
Sbjct: 122 VVSGFERELRLDEAVTETRYTLSGRQFKRRCFLSHPHQVLVVQLEGDDLQGLEIEVDVQG 181
Query: 180 LLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED 239
+N ++ + N +G+ + +D G++ ++ + D G + +
Sbjct: 182 --ENEAFTSNVN---ADGKLEFNVQALETVHSDGTCGVKGYGLIAATV--DEGKVQR-RN 233
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
KL + +L+ +F+ + P D+ + T M A LS SDL+ HL
Sbjct: 234 GKLVISAKKSITILV----TFNTDYAEPGDAWRRRTVAQMDA---ALELSASDLFQAHLQ 286
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFG 357
D+Q L+ RVSI L +++CS + P+ +R +SF+ D + L F +
Sbjct: 287 DFQPLYRRVSISLG-------SESCS---TASAPTDQRRQSFEASGYADAGMFALYFHYA 336
Query: 358 RYLLISSSRPGTQVA-NLQGIWN--EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
RYL I+ +R + + +LQG+WN E W H++IN +MNY+ + LS+ +P
Sbjct: 337 RYLTIAGTRHDSPLPLHLQGLWNDGEACKMGWSCDYHLDINTQMNYFAIMNSGLSDLMQP 396
Query: 415 LFDFLTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRG-KVVWALWPMGGAWLCTH 473
L ++L L +G TA+V Y GWV H +++W + D G +V + L GG WL +H
Sbjct: 397 LINYLVRLGESGQDTARVCYGCPGWVAHVFSNVWGFT--DPGWEVSYGLNVTGGLWLASH 454
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEF--IAPD 530
L E + Y++D F A+ +L G + F LD++IE G+L T PS SPE+ F + D
Sbjct: 455 LIEMFEYSLDDSFTRNEAWSVLLGASKFFLDYMIEDPKTGWLLTGPSVSPENSFFVVKED 514
Query: 531 GKLA--CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL---KSLPRLRPTKIA 585
G+ + + T+D+ ++R++F+ A L+ E E V ++L +L P +I
Sbjct: 515 GEKEEHYAALAPTLDIVLVRDLFAFCEYALTKLDCQESNYKEDVRMYREALAKLPPFQIG 574
Query: 586 EDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGW 645
++G + EW DF++ + +HRHLSH L I+ PDL +A TL++R
Sbjct: 575 KNGQLQEWLHDFEEAQPYHRHLSHTMALCRSAQISARHQPDLAEAVRVTLERRQGRDDLE 634
Query: 646 SITWKTAL----WARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP----- 696
I + AL +ARL D E A + L + + NL + P
Sbjct: 635 DIEFTAALFAQNYARLGDAEKAVAQIGHLVGELS-----------FDNLLSYSKPGVAGA 683
Query: 697 ----FQIDANFGFTAAVAEMLVQSTLNDLY------LLPALPWDKWSSGCVKGLKARGGE 746
F ID N G AA+AEML++S + L LLPALP W+ G VKG++ RGG
Sbjct: 684 EKDIFVIDGNLGGAAAIAEMLIRSIIPRLGGPVEVDLLPALP-AAWAEGNVKGMRIRGGL 742
Query: 747 TVSICWKDGDLHEVGIYSNYSNN 769
W+ G L V + ++ +++
Sbjct: 743 EADFSWQGGKLDGVTLRASAASS 765
>gi|146386783|pdb|2EAE|A Chain A, Crystal Structure Of 1,2-A-L-Fucosidase From
Bifidobacterium Bifidum In Complexes With Products
Length = 898
Score = 298 bits (764), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 247/872 (28%), Positives = 396/872 (45%), Gaps = 139/872 (15%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
+P GNG++G VWG V E + NE+TLWTG PG T N + + +R+L
Sbjct: 51 LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 110
Query: 82 GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
T L G + A+ L GDI L++ + E YRR+L+L+ A V
Sbjct: 111 LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 168
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
+ V +TRE+F+SNPD V+V +++ S++G L+FNVS+ + N +Y ++G
Sbjct: 169 TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 225
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
+ N G+ +++ +++ + + GT+S D LKV + L + A
Sbjct: 226 DTLTVKGALGNN------GLLYNSQIKVVLDNGEGTLSEGSDGASLKVSDAKAVTLYIAA 279
Query: 257 SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
++ + P ++ + + +Q N Y+ + H+DD+ ++ RV I L +
Sbjct: 280 ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIDDHSAIYDRVKIDLGQ 339
Query: 315 SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
S + D + A + S T + L L++++GRYL I SSR +Q+ +N
Sbjct: 340 SGHSSDGAVAT----DALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 395
Query: 374 LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
LQGIW N + W S H+N+NL+MNYW + N+ E EPL +++ L G
Sbjct: 396 LQGIWSVTAGDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 455
Query: 428 KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
TA+V A G++ H + + ++ + W P W+ ++
Sbjct: 456 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 514
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI----EGHDGYLETNPSTSPEHEFIAPD 530
+E Y Y+ D L+ R Y LL+ + F +++++ L T + SPE + D
Sbjct: 515 YEAYEYSGDPALLD-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 573
Query: 531 GKLACVSYSSTMDMAIIREVFSAIIS--------------AAEVLEKNE-----DALVEK 571
G +Y S++ ++ + A + +A+ KN+ DA +
Sbjct: 574 GN----TYESSLVWQMLNDAIEAAKAKGDPDGLVGNTTDCSADNWAKNDSGNFTDANANR 629
Query: 572 ---VLKSLPRLRPTKIAEDGSIMEWAQDF-----KDPEV--------HHRHLSHLFGLFP 615
KSL L+P ++ + G I EW + KD HRH+SHL GLFP
Sbjct: 630 SWSCAKSL--LKPIEVGDSGQIKEWYFEGALGKKKDGSTISGYQADNQHRHMSHLLGLFP 687
Query: 616 GHTITIEKNPDLCKAAEKTLQKRGEEG------PGWSITWKTALWARLHDQEHAYRMVKR 669
G ITI+ N + AA+ +L+ R +G GW+I + WAR D Y++V
Sbjct: 688 GDLITID-NSEYMDAAKTSLRYRCFKGNVLQSNTGWAIGQRINSWARTGDGNTTYQLV-- 744
Query: 670 LFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST-----------L 718
E + +Y+NLF H PFQI NFG T+ V EML+QS +
Sbjct: 745 ---------ELQLKNAMYANLFDYHAPFQIAGNFGNTSGVDEMLLQSNSTFTDTAGKKYV 795
Query: 719 NDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLH 778
N +LPALP D W+ G V GL ARG TV WK+G EV + SN
Sbjct: 796 NYTNILPALP-DAWAGGSVSGLVARGNFTVGTTWKNGKATEVRLTSN------------- 841
Query: 779 YRGTSVKVNLSAGKIYTFNRQLKCTNLHQSIV 810
+G V ++AG + + T ++ +V
Sbjct: 842 -KGKQAAVKITAGGAQNYEVKNGDTAVNAKVV 872
>gi|358400122|gb|EHK49453.1| glycoside hydrolase family 95 protein [Trichoderma atroviride IMI
206040]
Length = 788
Score = 298 bits (762), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 239/819 (29%), Positives = 371/819 (45%), Gaps = 108/819 (13%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYT--NPDAP--KAL 72
PA A P+GNG+LGAM G V + + LNE +LW+G P DY NP P AL
Sbjct: 29 PANIIMTAYPLGNGKLGAMPLGLVGEDIVVLNEHSLWSGGPFQNPDYIGGNPPGPVYTAL 88
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVY----QLLGDIELEFDDSHLKYAEETYRREL 128
+R + Q + L+G PAD Y + LG++ ++ +Y +Y R L
Sbjct: 89 PGIRDTIWQTQINNDIS---PLYGDPADYYYGNYETLGNLTVKIAGLS-QYT--SYNRAL 142
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSES-------------GSLSFNV 175
DL T + + FT F + PDQV V + +++ S + N+
Sbjct: 143 DLETGIHQTVFRSNGASFTTTTFCTFPDQVCVHNVQSTKALPAITIGLQDNARSSPASNL 202
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS 235
S D+ N ++ G Q + G + PKG +A EI I D T S
Sbjct: 203 SCDA---NGVHLRGQTQQDI-----GMIFDARVQVLSRPKGAACTASHEIVIPADSKTKS 254
Query: 236 ALEDKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYT 295
+ G+D+ +S++ S DP +S +++ SY+ LY
Sbjct: 255 V---TVIYAAGTDYDQKKGTKASNY-------SFKGVDPAPAVLSTIKAAAKESYNSLYN 304
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVE-LLF 354
H+ D+ LF + ++ L S +N ++P+A+ ++ + D + +E LLF
Sbjct: 305 SHVKDHNALFSQFTLNLPDS-----------DNSASIPTAKLMEDYDDDIGNTFIENLLF 353
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
+GRYL I S RPG+ NLQGIW E L+P W + HV++N++MN+W + L + Q P
Sbjct: 354 DYGRYLFIGSCRPGSLPPNLQGIWTESLTPAWSADYHVDVNVQMNHWHTEQTGLGDIQGP 413
Query: 415 LFDFLTYLSI-NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTH 473
L+DF+T + G++TA + Y A G+V + + + VW+ +P AWL +
Sbjct: 414 LWDFITDTWVPRGTETAALLYDAPGFVGFSNLNTFG-FTGQMNAAVWSDYPASAAWLMQN 472
Query: 474 LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE---GHDGYLETNPSTSPEHEFIAPD 530
+W+ Y+Y D + YPL++ A + + ++ +DG L P SPEH +
Sbjct: 473 VWDRYDYGRDTTWYRATGYPLMKAVAEYWIHEMVPDLYSNDGTLVAAPCNSPEHGWTT-- 530
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGS 589
C Y ++ E+F II + + +E V ++ +L P I G
Sbjct: 531 --FGCTHYQQ-----LVWELFDHIIQSWDATGDKNTTFLETVKETQAKLSPGIIIGWFGQ 583
Query: 590 IMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEK-NPDLCKAAEKTLQKRG----EEGPG 644
I EW + P HRHLS L G +PG++I N + A TL RG + G
Sbjct: 584 IQEWKIGWDQPNDEHRHLSQLVGWYPGYSIGANMWNKTVTDAVNITLTARGNGTADSNTG 643
Query: 645 WSITWKTALWARLHDQEHAYRMVKRL--FNLVDPEHEKHFEGGLYSNLFAAHPPFQIDAN 702
W W+ A WA+L++ + AY +K N D + G L A PFQIDAN
Sbjct: 644 WEKVWRVACWAQLNNTDIAYTYLKYAIGMNYADNGFSVYTAGSWPYELAA---PFQIDAN 700
Query: 703 FGFTAAVAEMLV--------QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKD 754
FG+TAAV ML+ ++ + L PA+P +W++G V G++ RGG +V W
Sbjct: 701 FGYTAAVLAMLITDLPVPSASKAVHTVILGPAIP-SEWANGSVTGMRIRGGGSVDFSWDK 759
Query: 755 GDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKI 793
L + TLH S+K+ GK+
Sbjct: 760 NGLA--------------THATLHNHKASIKIVDVNGKV 784
>gi|313139434|ref|ZP_07801627.1| alpha-fucosidase [Bifidobacterium bifidum NCIMB 41171]
gi|313131944|gb|EFR49561.1| alpha-fucosidase [Bifidobacterium bifidum NCIMB 41171]
Length = 1959
Score = 297 bits (760), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 245/877 (27%), Positives = 396/877 (45%), Gaps = 149/877 (16%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
+P GNG++G VWG V E + NE+TLWTG PG T N + + +R+L
Sbjct: 627 LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 686
Query: 82 GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
T L G + A+ L GDI L++ + E YRR+L+L+ A V
Sbjct: 687 LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 744
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
+ V +TRE+F+SNPD V+V +++ S++G L+FNVS+ + N +Y ++G
Sbjct: 745 TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 801
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
+ K ++ G+ +++ +++ + + GT+S D LKV + L + A
Sbjct: 802 DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGSDGASLKVSDAKAVTLYIAA 855
Query: 257 SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
++ + P ++ + + +Q N Y+ + H+ D+ ++ RV I L +
Sbjct: 856 ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIADHSAIYDRVKIDLGQ 915
Query: 315 SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
S + D + A + S T + L L++++GRYL I SSR +Q+ +N
Sbjct: 916 SGH----SSDGAVATDALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 971
Query: 374 LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
LQGIW N + W S H+N+NL+MNYW + N+ E EPL +++ L G
Sbjct: 972 LQGIWSVTAGDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 1031
Query: 428 KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
TA+V A G++ H + + ++ + W P W+ ++
Sbjct: 1032 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 1090
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI----EGHDGYLETNPSTSPEHEFIAPD 530
+E Y Y+ D L+ R Y LL+ + F +++++ L T + SPE + D
Sbjct: 1091 YEAYEYSGDPALLD-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 1149
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL----------------- 573
G +T + +++ ++ + I AA+ + + D LV
Sbjct: 1150 G--------NTYESSLVWQMLNDAIEAAKA-KGDPDGLVGDTTDCSTDNWAKGDNGNFAD 1200
Query: 574 ----------KSLPRLRPTKIAEDGSIMEW-------------AQDFKDPEVHHRHLSHL 610
KSL L+P ++ G I EW A + HRH+SHL
Sbjct: 1201 ANANRSWSCAKSL--LKPIEVGNSGQIKEWYFEGALGKKKDGSAISGYQADNQHRHMSHL 1258
Query: 611 FGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG------PGWSITWKTALWARLHDQEHAY 664
GLFPG ITI+ N + +AA+ +L+ R +G GW+I + WAR D Y
Sbjct: 1259 LGLFPGDLITID-NSEYMEAAKTSLRYRCFKGNVLQSNTGWAIGQRINSWARTGDGNTTY 1317
Query: 665 RMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST------- 717
++V E + +Y+NLF H PFQID NFG T+ V EML+QS
Sbjct: 1318 QLV-----------ELQLKNAMYANLFDYHAPFQIDGNFGNTSGVDEMLLQSNSTFTDTD 1366
Query: 718 ----LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDS 773
+N +LPALP D W+ G V GL ARG TV WK+G EV + SN
Sbjct: 1367 GKKYVNYTNILPALP-DAWADGSVSGLVARGNFTVGTTWKNGKATEVKLTSN-------- 1417
Query: 774 FKTLHYRGTSVKVNLSAGKIYTFNRQLKCTNLHQSIV 810
+G V ++AG + + T ++ +V
Sbjct: 1418 ------KGKQAAVKITAGGAQNYEVKNGDTAVNAKVV 1448
>gi|390936092|ref|YP_006393651.1| alpha-L-fucosidase [Bifidobacterium bifidum BGN4]
gi|389889705|gb|AFL03772.1| alpha-L-fucosidase [Bifidobacterium bifidum BGN4]
Length = 1959
Score = 297 bits (760), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 245/877 (27%), Positives = 395/877 (45%), Gaps = 149/877 (16%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
+P GNG++G VWG V E + NE+TLWTG PG T N + + +R+L
Sbjct: 627 LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 686
Query: 82 GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
T L G + A+ L GDI L++ + E YRR+L+L+ A V
Sbjct: 687 LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 744
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
+ V +TRE+F+SNPD V+V +++ S++G L+FNVS+ + N +Y ++G
Sbjct: 745 TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 801
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
+ K ++ G+ +++ +++ + + GT+S D LKV + L + A
Sbjct: 802 DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGSDGASLKVSDAKAVTLYIAA 855
Query: 257 SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
++ + P ++ + + +Q N Y+ + H+ D+ ++ RV I L +
Sbjct: 856 ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIADHSAIYDRVKIDLGQ 915
Query: 315 SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
S + D + A + S T + L L++++GRYL I SSR +Q+ +N
Sbjct: 916 SGH----SSDGAVATDALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 971
Query: 374 LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
LQGIW N + W S H+N+NL+MNYW + N+ E EPL +++ L G
Sbjct: 972 LQGIWSVTADDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 1031
Query: 428 KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
TA+V A G++ H + + ++ + W P W+ ++
Sbjct: 1032 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 1090
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI----EGHDGYLETNPSTSPEHEFIAPD 530
+E Y Y+ D L R Y LL+ + F +++++ L T + SPE + D
Sbjct: 1091 YEAYEYSGDPALLN-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 1149
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL----------------- 573
G +T + +++ ++ + I AA+ + + D LV
Sbjct: 1150 G--------NTYESSLVWQMLNDAIEAAKA-KGDPDGLVGDTTDCSADNWAKGDNGNFTD 1200
Query: 574 ----------KSLPRLRPTKIAEDGSIMEW-------------AQDFKDPEVHHRHLSHL 610
KSL L+P ++ + G I EW A + HRH+SHL
Sbjct: 1201 ANANRSWSCAKSL--LKPIEVGDSGQIKEWYFEGALGKKKDGSAISGYQADNQHRHMSHL 1258
Query: 611 FGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG------PGWSITWKTALWARLHDQEHAY 664
GLFPG ITI+ N + AA+ +L+ R +G GW+I + WAR D Y
Sbjct: 1259 LGLFPGDLITID-NSEYMDAAKTSLRYRCFKGNVLQSNTGWAIGQRINSWARTGDGNTTY 1317
Query: 665 RMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST------- 717
++V E + +Y+NLF H PFQID NFG T+ V EML+QS
Sbjct: 1318 KLV-----------ELQLKNAMYANLFDYHAPFQIDGNFGNTSGVDEMLLQSNSTFTDTD 1366
Query: 718 ----LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDS 773
+N +LPALP D W+ G V GL ARG TV WK+G EV + SN
Sbjct: 1367 GKKYVNYTNILPALP-DAWADGSVSGLVARGNFTVGTTWKNGKATEVRLTSN-------- 1417
Query: 774 FKTLHYRGTSVKVNLSAGKIYTFNRQLKCTNLHQSIV 810
+G V ++AG + + T ++ +V
Sbjct: 1418 ------KGKQAAVKITAGGAQNYEVKNGDTAVNAKVV 1448
>gi|189466378|ref|ZP_03015163.1| hypothetical protein BACINT_02753 [Bacteroides intestinalis DSM
17393]
gi|189434642|gb|EDV03627.1| hypothetical protein BACINT_02753 [Bacteroides intestinalis DSM
17393]
Length = 792
Score = 296 bits (759), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 232/816 (28%), Positives = 370/816 (45%), Gaps = 138/816 (16%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
++PIGNG +G ++G E ++L E T+ G G Y
Sbjct: 59 SLPIGNGAMGVCIFGRTDVERIQLAEKTM--GNKGAY----------------------- 93
Query: 87 ATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
+ F + A++Y D H YA++ Y+R L LN A + V Y +E+
Sbjct: 94 ----GMGGFTNFAEIYL----------DIHHNYAQD-YKRALRLNDAISTVNYKHEEIEY 138
Query: 147 TREHFSSNPDQVIVTKISGSESGSLSFNVS-----LDSLLDNHSYVNGN-----NQIIME 196
RE+F+S P +I K+ S+ G +SF + L S D + +G + I ++
Sbjct: 139 DREYFASYPANIIAVKLKASQPGKVSFTLRPVLPYLHSFNDEQTGRSGQAHAEKDLITLK 198
Query: 197 GRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTIS----ALEDKKLKVEGSDWAVL 252
G +P + +IK+ + GT+S + + + +D +L
Sbjct: 199 GEIQYFHLPYEG---------------QIKVVNYGGTLSCSNKGENNSTIDISKADSVIL 243
Query: 253 LLVASSSF---DGPFINPSDSK----KDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
+ A++S+ D F+ P+ K P + + Y L H+ DYQ+LF
Sbjct: 244 YISAATSYQLKDSVFLLPNAEKFKGNTHPHKQVSECIGRAVEKGYEVLRKEHIADYQQLF 303
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISS 364
+RV+ QL+ E+I ++P+ + + ++ + D L EL FQ+GRYLLI+S
Sbjct: 304 NRVNFQLT-------------EDIPSIPTDKLLYQYRNGKRDAYLEELFFQYGRYLLIAS 350
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF------ 418
SR G+ NLQG WN+ W N+N++MNYW NL+E P D+
Sbjct: 351 SRQGSLPPNLQGAWNQYEFAPWSGGYWHNVNVQMNYWPVFNTNLTELFIPYADYNEAFRK 410
Query: 419 ------LTYLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
+ Y++ N + +GW I +A G +
Sbjct: 411 AATQKAVDYITQNNPEALNPIAEENGWTIGTGATAFAIEGPGGHSGP-----GTGGFTTK 465
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE--HEFIAPD 530
W++Y++T D+ L+ YP L G A FL L DG L +PS SPE H+ +
Sbjct: 466 LFWDYYDFTRDKQLLKDHVYPALMGMAKFLSKTLKPQPDGTLLVDPSFSPEQVHQQVYYR 525
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSI 590
K C+ D ++I E + ++ AAE+L K++D ++ V + + +L I E G I
Sbjct: 526 SK-GCI-----FDQSMILETYRDLLHAAEIL-KDKDPFLKTVKEQIGKLDAILIGESGQI 578
Query: 591 MEWAQDFKDPEV---HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSI 647
E+ ++ K E+ HRH+S L ++PG TI P+ +AA+ TL++RG++ GW++
Sbjct: 579 KEFREENKYGEIGQYQHRHISQLCAMYPG-TIINADTPEWLEAAKVTLKERGDKSTGWAM 637
Query: 648 TWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTA 707
+ LWAR + AY++ + + G NL+ +HPPFQIDANFG TA
Sbjct: 638 AHRQNLWARAKNGNRAYKLYQDILTY-----------GTLENLWGSHPPFQIDANFGATA 686
Query: 708 AVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYS 767
+AEML+QS + LPA+P D W G GL ARG VS W++G + + I SN
Sbjct: 687 GIAEMLLQSHEGYIEPLPAIP-DNWDKGSFSGLMARGNFQVSATWENGAIQSIRILSNKG 745
Query: 768 N------NDHDSFKTLHYRGTSVKVNLSAGKIYTFN 797
S + +K+ LS I+ FN
Sbjct: 746 ELCRIKYCKAASAQVTDKYNKPIKIKLSGNDIFEFN 781
>gi|421734699|ref|ZP_16173762.1| alpha-L-fucosidase [Bifidobacterium bifidum LMG 13195]
gi|407077388|gb|EKE50231.1| alpha-L-fucosidase [Bifidobacterium bifidum LMG 13195]
Length = 1954
Score = 296 bits (758), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 245/877 (27%), Positives = 394/877 (44%), Gaps = 149/877 (16%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
+P GNG++G VWG V E + NE+TLWTG PG T N + + +R+L
Sbjct: 622 LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 681
Query: 82 GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
T L G + A+ L GDI L++ + E YRR+L+L+ A V
Sbjct: 682 LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 739
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
+ V +TRE+F+SNPD V+V +++ S++G L+FNVS+ + N +Y ++G
Sbjct: 740 TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 796
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
+ K ++ G+ +++ +++ + + GT+S D LKV + L + A
Sbjct: 797 DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGADGASLKVSDAKAVTLYIAA 850
Query: 257 SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
++ + P ++ + + +Q N Y+ + H+ D+ ++ RV I L +
Sbjct: 851 ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIADHSAIYDRVKIDLGQ 910
Query: 315 SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
S + D + A + S T + L L++++GRYL I SSR +Q+ +N
Sbjct: 911 SGH----SSDGAVATDALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 966
Query: 374 LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
LQGIW N + W S H+N+NL+MNYW + N+ E EPL +++ L G
Sbjct: 967 LQGIWSVTADDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 1026
Query: 428 KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
TA+V A G++ H + + ++ + W P W+ ++
Sbjct: 1027 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 1085
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI----EGHDGYLETNPSTSPEHEFIAPD 530
+E Y Y+ D L R Y LL+ + F +++++ L T + SPE + D
Sbjct: 1086 YEAYEYSGDPALLN-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 1144
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL----------------- 573
G +T + +++ ++ + I AA+ + + D LV
Sbjct: 1145 G--------NTYESSLVWQMLNDAIEAAKA-KGDPDGLVGDTTDCSADNWAKGDNGNFTD 1195
Query: 574 ----------KSLPRLRPTKIAEDGSIMEW-------------AQDFKDPEVHHRHLSHL 610
KSL L+P ++ G I EW A + HRH+SHL
Sbjct: 1196 ANANRSWSCAKSL--LKPIEVGNSGQIKEWYFEGALGKKKDGSAISGYQADNQHRHMSHL 1253
Query: 611 FGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG------PGWSITWKTALWARLHDQEHAY 664
GLFPG ITI+ N + AA+ +L+ R +G GW+I + WAR D Y
Sbjct: 1254 LGLFPGDLITID-NSEYMDAAKTSLRYRCFKGNVLQSNTGWAIGQRINSWARTGDGNTTY 1312
Query: 665 RMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST------- 717
++V E + +Y+NLF H PFQID NFG T+ V EML+QS
Sbjct: 1313 KLV-----------ELQLKNAMYANLFDYHAPFQIDGNFGNTSGVDEMLLQSNSTFTDTD 1361
Query: 718 ----LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDS 773
+N +LPALP D W+ G V GL ARG TV WK+G EV + SN
Sbjct: 1362 GKKYVNYTNILPALP-DAWADGSVSGLVARGNFTVGTTWKNGKATEVKLTSN-------- 1412
Query: 774 FKTLHYRGTSVKVNLSAGKIYTFNRQLKCTNLHQSIV 810
+G V ++AG + + T ++ +V
Sbjct: 1413 ------KGKQAAVKITAGGAQNYEVKNGDTAVNAKVV 1443
>gi|298351514|sp|A2R797.1|AFCA_ASPNC RecName: Full=Probable alpha-fucosidase A; AltName:
Full=Alpha-L-fucoside fucohydrolase A; Flags: Precursor
gi|134083134|emb|CAK48586.1| unnamed protein product [Aspergillus niger]
Length = 793
Score = 296 bits (758), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 235/775 (30%), Positives = 360/775 (46%), Gaps = 89/775 (11%)
Query: 21 AKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD---YT--NPDAPKA--LS 73
+ T A P+GNGRLGAM G E + LN D+LW G P + Y+ NP+ KA L
Sbjct: 32 SSFITTAFPLGNGRLGAMPIGSYDKEIVNLNVDSLWRGGPFESPTYSGGNPNVSKAGALP 91
Query: 74 DVRSLVDSGQYAEATAASVKLFG-HPA-DVYQLLGDIELEFDD-SHLKYAEETYRRELDL 130
+R + + T L G +P YQ+L ++ ++ + S + + YRR LDL
Sbjct: 92 GIREWI----FQNGTGNVSALLGEYPYYGSYQVLANLTIDMGELSDI----DGYRRNLDL 143
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSES-GSLSFNV--SLDSLLDNHSYV 187
++A +S G RE F S PD V V ++S + S ++F + L S N S
Sbjct: 144 DSAVYSDHFSTGETYIEREAFCSYPDNVCVYRLSSNSSLPEITFGLENQLTSPAPNVS-C 202
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV-EG 246
+GN+ + G+ P G+ ++A + + + T +KV EG
Sbjct: 203 HGNSISLY-----GQTYPVI--------GMIYNARVTVVVPGSSNTTDLCSSSTVKVPEG 249
Query: 247 SDWAVLLLVASSSFDGPFINPSDS----KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
L+ A ++++ N S ++P + + + SYS L + H+ DYQ
Sbjct: 250 EKEVFLVFAADTNYEASNGNSKASFSFKGENPYMKVLQTATNAAKKSYSALKSSHVKDYQ 309
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
+F++ ++ L P+ E + S+ DP + LLF +GRYL I
Sbjct: 310 GVFNKFTLTLP-----------DPNGSADRPTTELLSSYSQPGDPYVENLLFDYGRYLFI 358
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SSSRPG+ NLQG+W E SP W H NINL+MN+W L E EPL+ ++
Sbjct: 359 SSSRPGSLPPNLQGLWTESYSPAWSGDYHANINLQMNHWAVDQTGLGELTEPLWTYMAET 418
Query: 423 SI-NGSKTAQVNYLAS-GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
+ G++TA++ Y S GWV H + + + +A + WA +P AW+ H+W+H++Y
Sbjct: 419 WMPRGAETAELLYGTSEGWVTHDEMNTFGH-TAMKDVAQWADYPATNAWMSHHVWDHFDY 477
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGH---DGYLETNPSTSPEHEFIAPDGKLACVS 537
+ D + + YP+L+G A F L L++ DG L NP SPEH C
Sbjct: 478 SQDSAWYRETGYPILKGAAQFWLSQLVKDEYFKDGTLVVNPCNSPEHGPTLTPQTFGCTH 537
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEWAQD 596
Y +I E+F ++ ++ + + L P I G I EW D
Sbjct: 538 Y-----QQLIWELFDHVLQGWTASGDDDTSFKNAITSKFSTLDPGIHIGSWGQIQEWKLD 592
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITI--EKNPDLCKAAEKTLQKRG----EEGPGWSITWK 650
HRHLS+L+G +PG+ I+ N + A E TL RG + GW+ W+
Sbjct: 593 IDVKNDTHRHLSNLYGWYPGYIISSVHGSNKTITDAVETTLYSRGTGVEDSNTGWAKVWR 652
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
+A WA L+ + AY + + D E F+ +++ PPFQIDANFG A+
Sbjct: 653 SACWALLNVTDEAYSELS--LAIQDNFAENGFD------MYSGSPPFQIDANFGLVGAMV 704
Query: 711 EMLVQST-----------LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKD 754
+ML++ + D+ L PA+P W G V GL+ RGG VS W D
Sbjct: 705 QMLIRDSDRSSADASAGKTQDVLLGPAIP-AAWGGGSVGGLRLRGGGVVSFSWND 758
>gi|83764453|dbj|BAE54597.1| unnamed protein product [Aspergillus oryzae RIB40]
Length = 513
Score = 296 bits (758), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 181/471 (38%), Positives = 248/471 (52%), Gaps = 32/471 (6%)
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
D++ L RV + L+ S + N+ T ER K+ D DP LV L+FQFGRY
Sbjct: 15 DHEALAGRVHLDLASS--------GAAGNLPTDVRLERYKT-HPDADPELVTLMFQFGRY 65
Query: 360 LLISSSRP-GTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
LI+SSR GT NLQG+WNED P W VNINLEMNYW + NL+E PL
Sbjct: 66 SLIASSRKTGTSPLPPNLQGLWNEDYEPAWGGRYTVNINLEMNYWPAGVTNLAETLGPLI 125
Query: 417 DFLTYLSINGSKTAQVNYLA--SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
L + G A+ Y G+V+HH TDIW + W +WPMGGAWL +L
Sbjct: 126 FLLETVKPRGQDIARRMYNCDNGGYVLHHNTDIWGDAVPVNNGTKWTMWPMGGAWLSANL 185
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD---- 530
E+Y +T D + L++R +PLL A F ++ +GYL T PS+SPE+ F+ P+
Sbjct: 186 MEYYRFTQDTNLLKERIWPLLRSAAQFYHCYVFS-FNGYLSTGPSSSPENAFVVPNDMSE 244
Query: 531 -GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
G + + TMD ++ E+F +II +VL N + K SLP ++ +I G
Sbjct: 245 SGNEEGIDIAPTMDNTLLSELFHSIIETGKVLGIN-NTDTTKAASSLPLIKLPQIGSYGQ 303
Query: 590 IMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWS 646
I+EW ++++ E HRH+S +FGL+PG +T N L AA L R G GWS
Sbjct: 304 ILEWRHEYQETEPGHRHMSPIFGLYPGSQMTPLVNSTLAAAATVLLDHRIAHGSGSTGWS 363
Query: 647 ITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFT 706
W +L++RL D + A+ + + + L++ FQID NFGFT
Sbjct: 364 RAWTISLYSRLFDGDAAWNHTQVFL-------KTYPSANLWNTDSGPGSAFQIDGNFGFT 416
Query: 707 AAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
A +AEML+QS ++LLPALP G V GL ARG V + W DG L
Sbjct: 417 AGIAEMLLQSHAGVVHLLPALP-SAVPHGKVSGLVARGNFVVDMEWSDGKL 466
>gi|357061269|ref|ZP_09122028.1| hypothetical protein HMPREF9332_01585 [Alloprevotella rava F0323]
gi|355374778|gb|EHG22070.1| hypothetical protein HMPREF9332_01585 [Alloprevotella rava F0323]
Length = 1118
Score = 296 bits (757), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 233/773 (30%), Positives = 356/773 (46%), Gaps = 121/773 (15%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
++PIGNG+LGA ++ GV + ++ NE TLWTG D N + A + SL +AE
Sbjct: 303 SLPIGNGQLGASLFNGVYKDEVQFNEKTLWTGSSTD--NGSSYGAYQNFGSL-----FAE 355
Query: 87 ATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSV--GNV 144
L GD + D + Y R LDL++ ++ G+
Sbjct: 356 ----------------DLSGDFDFGSDKK-----VKNYYRALDLSSGLGSTHFTNADGSK 394
Query: 145 EFTREHFSSNPDQVIVTKISGSESGSLSFNVSLD-SLLDNHSYVNGNNQIIMEGRCPGKR 203
+ R + +S PD+VI + + + GS+S +L + SY +G EG GK
Sbjct: 395 TYDRTYLASFPDRVIAVRYACDKPGSISLRFTLKPGVKATPSYADG------EGMFSGKL 448
Query: 204 IPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDG- 262
NA +K+ GT++ + ++V +D + L A + FD
Sbjct: 449 TTVTFNA-------------RMKVVPVGGTMTT-DANGVEVRNADEVCVYLAAGTDFDAY 494
Query: 263 --PFIN-----PSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRS 315
+I+ PS K+ + + + +I T H+ DY+ F RV L
Sbjct: 495 KTTYISNTAALPSTMKERVDAAAQKGMAAI--------LTDHVADYRNYFDRVDFSL--- 543
Query: 316 PKDIVTDTCSEENIDTVPSAERVKSFQTD----EDPSLV--ELLFQFGRYLLISSSRPGT 369
E + + +P+ + + ++ D + SL+ +L F +GRYL I+SSR
Sbjct: 544 ----------EGSENAIPTNKLIDAYSADATGLKGSSLMLEQLYFAYGRYLEIASSRGVD 593
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS-- 427
+NLQGIWN +P W S H NIN++MNYW + P NLSE P +++T +++N S
Sbjct: 594 LPSNLQGIWNNSNTPPWASDIHSNINVQMNYWPAEPTNLSEMHLPFLNYITNMAMNHSQW 653
Query: 428 -KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDF 486
K A+ GW + + +I+ V + AW THLW+HY YT+DRDF
Sbjct: 654 QKYAKDAGQTKGWTCYTENNIFGGVGGFMHNYV-----IANAWYATHLWQHYRYTLDRDF 708
Query: 487 LEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEH----EFIAPDGKLACVSYSSTM 542
L A+P + + F ++ L DG E SPEH +A +L +T
Sbjct: 709 LLS-AFPTMWSASQFWIERLRLAADGTYECPSEYSPEHGPTENAVAHAQQLVVELLQNTK 767
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAED-GS-----------I 590
D A I + + A + + ++ L +++ K+ L K GS +
Sbjct: 768 DAADI------LGNDANISDADKTKLEDRLAKADKGLAIEKYTGKWGSPHHGVRTGQDLL 821
Query: 591 MEWA-QDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITW 649
EW + E HRH SHL L+P + +T KAA +L+ R +E GWS+ W
Sbjct: 822 REWKYSSYTRGEDGHRHQSHLMCLYPFNQVT--PGSPYFKAAVNSLKLRSDESTGWSMGW 879
Query: 650 KTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAV 709
+ LWAR D +HA ++ R + GG+Y NL+ AH PFQID NFG A +
Sbjct: 880 RINLWARAQDGDHARVILHRALRHATSFGTNQYAGGIYYNLYDAHAPFQIDGNFGACAGI 939
Query: 710 AEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGI 762
AEML+QS + + +LPALP W +G +KGLKA G TV I WK G + +
Sbjct: 940 AEMLMQSATDTIVVLPALP-SVWKAGHIKGLKAIGNYTVDIAWKAGKATRITV 991
>gi|340514441|gb|EGR44703.1| glycoside hydrolase family 95 [Trichoderma reesei QM6a]
Length = 755
Score = 296 bits (757), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 225/767 (29%), Positives = 352/767 (45%), Gaps = 76/767 (9%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYT--NPDAP--KALSDVRSLV 79
A P+GNG+LGAM G V + + LNE +LW G P DY NP AP AL +R +
Sbjct: 3 AYPLGNGKLGAMPLGVVGEDIVVLNEHSLWAGGPFQSPDYIGGNPPAPVYTALPGIRETI 62
Query: 80 DSGQYAEATAASVKLFGHPADVY----QLLGDIELEFDDSHLKYAEETYRRELDLNTATA 135
Q +A L+G PA Y + LG++ + KY +Y R LDL T
Sbjct: 63 WKTQINNDISA---LYGDPAYYYYGNYETLGNLTVNIAGVS-KYT--SYNRALDLETGIH 116
Query: 136 RVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIM 195
++ +FT F + PDQV I S+ DSL N +
Sbjct: 117 TTEFKANGAKFTITTFCTFPDQVCAYNIQSSKPLPAVTIGLRDSLRSNPA---------S 167
Query: 196 EGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAV-LLL 254
C + + D G+ F A ++ R T ++ + +G ++ ++
Sbjct: 168 NLTCDANGVHLRGQTQQD-IGMIFDARAQLINRPKRATCTSSHGLSVPSDGRTTSLTVVY 226
Query: 255 VASSSFD----GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
A +++D N S DP +S ++ + S++ +Y H+ D+ LF + S+
Sbjct: 227 AAGTNYDQKKGTKASNYSFKGVDPAPAVLSTIKKVSQKSFNSMYNAHIKDHNGLFSQFSL 286
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISSSRPGT 369
L K +VP+A ++++ D DP + LLF +GRYL I S R G+
Sbjct: 287 DLPDPEKSA-----------SVPTATLMENYDYDLGDPFVENLLFDYGRYLFIGSCRDGS 335
Query: 370 QVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI-NGSK 428
NLQGIW E L+P W + HV++N++MN+W + L E Q PL+DF+ + G++
Sbjct: 336 LPPNLQGIWTESLTPAWSADYHVDVNVQMNHWHTEQTGLGEIQGPLWDFIIDTWVPRGTE 395
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
TA + Y A G+V + + + VW+ +P AWL ++W Y+Y+ D + +
Sbjct: 396 TAALLYDAPGFVGFSNLNTFG-FTGQMNAAVWSNYPASAAWLMQNVWNRYDYSRDTHWWK 454
Query: 489 KRAYPLLEGCASFLLDWLIE---GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMA 545
YPL++ A + + ++ +DG L P SPEH + C Y
Sbjct: 455 TVGYPLMKSIAEYWIHEMVPDLYSNDGTLVAAPCNSPEHGWTT----FGCTHYQQ----- 505
Query: 546 IIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEWAQDFKDPEVHH 604
++ EVF +I E +E V ++ +L P I G I EW + P H
Sbjct: 506 LVWEVFDHVIEGWEASGDKNTTFLETVKETQSKLSPGIIIGWFGQIQEWKIGWDQPNDEH 565
Query: 605 RHLSHLFGLFPGHTITIEK-NPDLCKAAEKTLQKRG----EEGPGWSITWKTALWARLHD 659
RHLSHL G +PG++I N + A +L RG + GW W+ A WA+L++
Sbjct: 566 RHLSHLVGWYPGYSIGTHMWNKTVTDAVNVSLTARGNGTADSNTGWEKVWRVACWAQLNN 625
Query: 660 QEHAYRMVKRLFNLVDPEHE-KHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLV---- 714
+ AY +K ++ + + G + AA PFQIDANFG++AAV ML+
Sbjct: 626 TDIAYTYLKYAIDMNYANNGFSVYTTGSWPYELAA--PFQIDANFGYSAAVLAMLITDLP 683
Query: 715 ----QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
++ + L PA+P +W G V+G++ RGG +V W D L
Sbjct: 684 VPSASKAIHTVILGPAIP-PEWKGGSVRGMRIRGGGSVDFSWDDNGL 729
>gi|350633541|gb|EHA21906.1| hypothetical protein ASPNIDRAFT_184037 [Aspergillus niger ATCC
1015]
Length = 758
Score = 295 bits (756), Expect = 5e-77, Method: Compositional matrix adjust.
Identities = 236/775 (30%), Positives = 362/775 (46%), Gaps = 93/775 (12%)
Query: 21 AKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD---YT--NPDAPKA--LS 73
+ T A P+GNGRLGAM G E + LN D+LW G P + Y+ NP+ KA L
Sbjct: 32 SSFITTAFPLGNGRLGAMPIGSYDKEIVNLNVDSLWRGGPFESPTYSGGNPNVSKAGALP 91
Query: 74 DVRSLVDSGQYAEATAASVKLFG-HPA-DVYQLLGDIELEFDD-SHLKYAEETYRRELDL 130
+R + + T L G +P YQ+L ++ ++ + S + + YRR LDL
Sbjct: 92 GIREWI----FQNGTGNVSALLGEYPYYGSYQVLANLTIDMGELSDI----DGYRRNLDL 143
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSES-GSLSFNV--SLDSLLDNHSYV 187
++A +S G RE F S PD V V ++S + S ++F + L S N S
Sbjct: 144 DSAVYSDHFSTGETYIEREAFCSYPDNVCVYRLSSNSSLPEITFGLENQLTSPAPNVS-C 202
Query: 188 NGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV-EG 246
+GN+ + G+ P G+ ++A + + + T +KV EG
Sbjct: 203 HGNSISLY-----GQTYPVI--------GMIYNARVTVVVPGSSNTTDLCSSSTVKVPEG 249
Query: 247 SDWAVLLLVASSSFDGPFINPSDS----KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
L+ A ++++ N S ++P + + + SYS L + H+ DYQ
Sbjct: 250 EKEVFLVFAADTNYEASNGNSKASFSFKGENPYMKVLQTATNAAKKSYSALKSSHVKDYQ 309
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLI 362
+F++ ++ L P+ E + S+ DP++ LLF +GRYL I
Sbjct: 310 GVFNKFTLTLP-----------DPNGSADRPTTELLSSYSQPGDPNVENLLFDYGRYLFI 358
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
SSSRPG+ NLQG+W E SP W H NINL+MN+W L E EPL+ ++
Sbjct: 359 SSSRPGSLPPNLQGLWTESYSPAWSGDYHANINLQMNHWAVDQTGLGELTEPLWTYMAET 418
Query: 423 SI-NGSKTAQVNYLAS-GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
+ G++TA++ Y S GWV H + + + +A + WA +P AW+ H+W+H++Y
Sbjct: 419 WMPRGAETAELLYGTSKGWVTHDEMNTFGH-TAMKDVAQWADYPATNAWMSHHVWDHFDY 477
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGH---DGYLETNPSTSPEHEFIAPDGKLACVS 537
+ D + + YP+L+G A F L L++ DG L NP SPEH P C
Sbjct: 478 SQDSAWYRETGYPILKGAAQFWLSQLVKDEYFKDGTLVVNPCNSPEH---GPT-TFGCTH 533
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEWAQD 596
Y +I E+F ++ ++ + + L P I G I EW D
Sbjct: 534 YQQ-----LIWELFDHVLQGWTASGDDDTSFKNAITSKFSTLDPGIHIGSWGQIQEWKLD 588
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITI--EKNPDLCKAAEKTLQKRG----EEGPGWSITWK 650
HRHLS+L+G +PG+ I+ N + A E TL RG + GW+ W+
Sbjct: 589 IDVKNDTHRHLSNLYGWYPGYIISSVHGSNKTITDAVETTLYSRGTGVEDSNTGWAKVWR 648
Query: 651 TALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVA 710
+A WA L+ + AY + + D E F+ +++ PPFQIDANFG A+
Sbjct: 649 SACWALLNVTDEAYSELS--LAIQDNFAENGFD------MYSGSPPFQIDANFGLVGAMV 700
Query: 711 EMLVQST-----------LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKD 754
+ML++ + D+ L PA+P W G V GL+ RGG VS W D
Sbjct: 701 QMLIRDSDRSSADASAGKTQDVLLGPAIP-AAWGGGSVGGLRLRGGGVVSFSWND 754
>gi|421735948|ref|ZP_16174814.1| alpha-L-fucosidase, partial [Bifidobacterium bifidum IPLA 20015]
gi|407296769|gb|EKF16285.1| alpha-L-fucosidase, partial [Bifidobacterium bifidum IPLA 20015]
Length = 1935
Score = 294 bits (753), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 244/877 (27%), Positives = 395/877 (45%), Gaps = 149/877 (16%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYT------NPDAPKALSDVRSLVDS 81
+P GNG++G VWG V E + NE+TLWTG PG T N + + +R+L
Sbjct: 622 LPFGNGKIGGTVWGEVSRERVTFNEETLWTGGPGSSTSYNGGNNETKGQNGATLRALNKQ 681
Query: 82 GQYAEATAASVKLFG--HPADVYQLL--GDIELEFDDSHLKYAEETYRRELDLNTATARV 137
T L G + A+ L GDI L++ + E YRR+L+L+ A V
Sbjct: 682 LANGAETVNPGNLTGGENAAEQGNYLNWGDIYLDYGFNDTTVTE--YRRDLNLSKGKADV 739
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEG 197
+ V +TRE+F+SNPD V+V +++ S++G L+FNVS+ + N +Y ++G
Sbjct: 740 TFKHDGVTYTREYFASNPDNVMVARLTASKAGKLNFNVSMPT---NTNYSKTGETTTVKG 796
Query: 198 RCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALED-KKLKVEGSDWAVLLLVA 256
+ K ++ G+ +++ +++ + + GT+S D LKV + L + A
Sbjct: 797 DT----LTVKGALGNN--GLLYNSQIKVVLDNGEGTLSEGADGASLKVSDAKAVTLYIAA 850
Query: 257 SSSFDG--PFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSR 314
++ + P ++ + + +Q N Y+ + H+ D+ ++ RV I L +
Sbjct: 851 ATDYKQKYPSYRTGETAAEVNTRVAKVVQDAANKGYTAVKKAHIADHSAIYDRVKIDLGQ 910
Query: 315 SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-AN 373
S + D + A + S T + L L++++GRYL I SSR +Q+ +N
Sbjct: 911 SGH----SSDGAVATDALLKAYQRGSATTAQKRELETLVYKYGRYLTIGSSRENSQLPSN 966
Query: 374 LQGIW------NEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
LQGIW N + W S H+N+NL+MNYW + N+ E EPL +++ L G
Sbjct: 967 LQGIWSVTADDNAHGNTPWGSDFHMNVNLQMNYWPTYSANMGELAEPLIEYVEGLVKPGR 1026
Query: 428 KTAQVNYLA-------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
TA+V A G++ H + + ++ + W P W+ ++
Sbjct: 1027 VTAKVYAGAETTNPETTPIGEGEGYMAHTENTAYGWTAPGQ-SFSWGWSPAAVPWILQNV 1085
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLI----EGHDGYLETNPSTSPEHEFIAPD 530
+E Y Y+ D L R Y LL+ + F +++++ L T + SPE + D
Sbjct: 1086 YEAYEYSGDPALLN-RVYALLKEESHFYVNYMLHKAGSSSGDRLTTGVAYSPEQGPLGTD 1144
Query: 531 GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVL----------------- 573
G +T + +++ ++ + I AA+ + + D LV
Sbjct: 1145 G--------NTYESSLVWQMLNDAIEAAKA-KGDPDGLVGNTTDCSADNWAKGDNGNFTD 1195
Query: 574 ----------KSLPRLRPTKIAEDGSIMEW-------------AQDFKDPEVHHRHLSHL 610
KSL L+P ++ + G I EW A + HRH+SHL
Sbjct: 1196 ANANRSWSCAKSL--LKPIEVGDSGQIKEWYFEGALGKKKDGSAISGYQADNQHRHMSHL 1253
Query: 611 FGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG------PGWSITWKTALWARLHDQEHAY 664
GLFPG ITI+ N + +AA+ +L+ R +G GW+I + WAR D Y
Sbjct: 1254 LGLFPGDLITID-NSEYMEAAKTSLRYRCFKGNVLQSNTGWAIGQRINSWARTGDGNTTY 1312
Query: 665 RMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQST------- 717
++V E + +Y+NLF H PFQID NFG T+ V EML+QS
Sbjct: 1313 QLV-----------ELQLKNAMYANLFDYHAPFQIDGNFGNTSGVDEMLLQSNSTFTDTD 1361
Query: 718 ----LNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDS 773
+N +LPALP W+ G V GL ARG TV WK+G EV + SN
Sbjct: 1362 GKKYVNYTNILPALP-GAWADGSVSGLVARGNFTVGTTWKNGKATEVKLTSN-------- 1412
Query: 774 FKTLHYRGTSVKVNLSAGKIYTFNRQLKCTNLHQSIV 810
+G V ++AG + + T ++ +V
Sbjct: 1413 ------KGKQAAVKITAGGAQNYEVKNGDTAVNAKVV 1443
>gi|451852884|gb|EMD66178.1| glycoside hydrolase family 95 protein [Cochliobolus sativus ND90Pr]
Length = 805
Score = 294 bits (753), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 233/785 (29%), Positives = 360/785 (45%), Gaps = 97/785 (12%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYTNPDAPKALSDVRSLVDSGQ 83
A+P+GNGRL AM G +ETL LN D+LW+G P +YT + ++ +
Sbjct: 38 ALPVGNGRLAAMPIGSPSAETLTLNLDSLWSGGPFEASNYTGGNPESSIDSTLPGIRDWI 97
Query: 84 YAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSV 141
+ T KL G + Y++L ++ + S + Y R+LDL ++
Sbjct: 98 FTNGTGNVTKLLGTNDNYGSYRVLANLTVTIP-SLVGIQVSNYTRKLDLTNGLHSTSFNT 156
Query: 142 GNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSL-----DSLLDNHSYV-NGNNQIIM 195
+ + F S PDQV V I S S +F + L D+ L+N + V NG
Sbjct: 157 NDTQLESTVFCSYPDQVCVYTIQSSRSLP-AFELKLGNELVDAKLENITCVANGTGADSG 215
Query: 196 EGRCPG-KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV---EGSDWAV 251
R G ++ P P+G+ + I + + D T LKV G+ A
Sbjct: 216 HVRLRGVTQLGP-------PEGMLYDTIARLLPNSDVKTTCDSNTGILKVTPENGAKSAT 268
Query: 252 LLLVASSSFDGPFINP----SDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
+++ A +++D S DP +Q + + +L + HL+D+ L R
Sbjct: 269 VIIGAETNYDMKKGTAEHQYSFRGNDPGPAVEETIQKVSMKTLEELKSSHLEDFTSLTGR 328
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ---TDEDPSLVELLFQFGRYLLISS 364
L P + N VP+ E + S+ T DP + LLF + +YLLISS
Sbjct: 329 FEFHL---PDPL--------NSAQVPTPELIASYDSNVTSGDPFVESLLFDYAQYLLISS 377
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
SRPG+ NLQG W E ++P W + H NINL+MNYW + L+E Q PL+D++ +
Sbjct: 378 SRPGSLPTNLQGRWTEQMAPDWSADYHANINLQMNYWTADQTGLTETQTPLWDYMINTWV 437
Query: 425 -NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
G +TA + Y A GWV+H++ +I+ + G+ WA +P AW+ H++++++YT D
Sbjct: 438 PRGHETAMLLYGAPGWVVHNEMNIFGHTGMKDGE-GWANYPAAPAWMMLHVFDYWDYTRD 496
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEGH------DGYLETNPSTSPEHEFIAPDGKLACVS 537
+L + YPL++ A F WL + H D L NP +SPEH P C
Sbjct: 497 TTWLRTQGYPLIKSVAQF---WLSQLHADSFTNDNTLVVNPCSSPEH---GPT-TFGCAH 549
Query: 538 YSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIMEW--- 593
Y +I +VF A+++ + +++ + + +L RL + + I EW
Sbjct: 550 YQQ-----LIHQVFEAVLTTHSLAGESDTSFTSNISSTLSRLDKGFHVGSWSQIKEWKLP 604
Query: 594 ---AQDFKDPEVHHRHLSHLFGLFPGHTITI----EKNPDLCKAAEKTLQKRG-EEGP-- 643
+F++ HRH+S L G PG++++ N + A L RG GP
Sbjct: 605 DSFGYEFQNDT--HRHISELVGWHPGYSLSSFLGGYSNTTVQSAVRNKLISRGIGNGPDA 662
Query: 644 --GWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDA 701
GW W+ A WARL+D A+ ++ E++F G +S PFQIDA
Sbjct: 663 NSGWEKVWRGACWARLNDTAQAHLELRYAI-------EQNFVGNGFSMYKGERTPFQIDA 715
Query: 702 NFGFTAAVAEMLV---------QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICW 752
N+G+ V MLV Q L PA+P + W G VKGL+ RGG V W
Sbjct: 716 NYGYGGLVLSMLVVDLPAPAEGQEGKRRAVLGPAIP-ESWKGGKVKGLRIRGGGVVDFGW 774
Query: 753 KDGDL 757
DG +
Sbjct: 775 DDGGV 779
>gi|391873884|gb|EIT82888.1| alpha-L-fucosidase 2 precursor, putative [Aspergillus oryzae 3.042]
Length = 513
Score = 294 bits (752), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 180/471 (38%), Positives = 247/471 (52%), Gaps = 32/471 (6%)
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
D++ L RV + L+ S + N+ T ER K+ D DP LV L+FQFGRY
Sbjct: 15 DHEALAGRVHLDLASS--------GAAGNLPTDVRLERYKT-HPDADPELVTLMFQFGRY 65
Query: 360 LLISSSR-PGTQV--ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
LI+SSR GT NLQG+WNED P W VNINLEMNYW + NL+E PL
Sbjct: 66 SLIASSRETGTSPLPPNLQGLWNEDYEPAWGGRYTVNINLEMNYWPAGVTNLAETLGPLI 125
Query: 417 DFLTYLSINGSKTAQVNYLA--SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
L + G A+ Y G+V+HH TDIW + W +WPMGGAWL +L
Sbjct: 126 FLLETVKPRGQDIARRMYNCDNGGYVLHHNTDIWGDAVPVNNGTKWTMWPMGGAWLSANL 185
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD---- 530
E+Y +T D + L++R +PLL A F ++ +GYL T PS+SPE+ F+ P+
Sbjct: 186 MEYYRFTQDTNLLKERIWPLLRSAAQFYHCYVFS-FNGYLSTGPSSSPENAFVVPNDMSK 244
Query: 531 -GKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGS 589
G + + TMD ++ E+F +II +VL N + K SLP ++ +I G
Sbjct: 245 SGNEEGIDIAPTMDNTLLSELFHSIIETGKVLGIN-NTDTTKAASSLPLIKLPQIGSYGQ 303
Query: 590 IMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKR---GEEGPGWS 646
I+EW ++++ E HRH+S +FGL+PG +T N L AA L R G GWS
Sbjct: 304 ILEWRHEYQETEPGHRHMSPIFGLYPGSQMTPLVNSTLAAAARVLLDHRIAHGSGSTGWS 363
Query: 647 ITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFT 706
W +L++RL D + A+ + + + L++ FQID NFGFT
Sbjct: 364 RAWTISLYSRLFDGDAAWNHTQVFL-------KTYPSANLWNTDSGPGSAFQIDGNFGFT 416
Query: 707 AAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
A +AEML+QS ++LLPALP G V GL ARG V + W G L
Sbjct: 417 AGIAEMLLQSHAGVVHLLPALP-SAVPHGKVSGLVARGNFVVDMEWSGGKL 466
>gi|358388157|gb|EHK25751.1| glycoside hydrolase family 95 protein [Trichoderma virens Gv29-8]
Length = 794
Score = 293 bits (751), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 219/770 (28%), Positives = 364/770 (47%), Gaps = 71/770 (9%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDYTNPDAPKA-LSDVRSLVDS 81
+P+GNGR A V ET LNE + W+G G P+ PKA L + + +
Sbjct: 20 LPLGNGRFAASVLSSPAKETFILNEVSFWSGETQKAGGGLAERPEDPKAELRETQKCYLN 79
Query: 82 GQYAEATAASVKLFGHPADVYQL---LGDIELEFDDSHLKYAEETYRRELDLNTATARVK 138
G YA+ + K + +G +++ + + REL L+ A A +
Sbjct: 80 GDYAKGKKRAEKYLESKKRNFGTNLGVGTLDIVVNGHESIGQVNGFERELRLDEAVAETR 139
Query: 139 YSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGR 198
Y++ +F R F S+P+QV+V + G + L V + +N ++ + N +G+
Sbjct: 140 YTIDGRQFKRRSFLSHPNQVLVVQFDGDDLSGLEVVVGVQG--ENEAFTSKIND---DGK 194
Query: 199 CPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASS 258
+ +D G++ I+ + D G + D KL + +L+
Sbjct: 195 LEFNAQALETVHSDGTCGVKGYGIIAATV--DEGKVEH-RDTKLVISAKKNITILV---- 247
Query: 259 SFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKD 318
+F+ + P++ + T+ L+ LS +DL HL+D+Q L+ R+SI L
Sbjct: 248 TFNTDYSEPNEEWRKRTTLQ---LEEALKLSAADLLKAHLEDFQPLYRRMSISLGSKSST 304
Query: 319 IVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVA-NLQGI 377
+ + + PS DPS+ L F + RYL I+ +R + + +LQG+
Sbjct: 305 TASIRTDQRRQNFEPSGY--------ADPSMFALYFHYARYLTIAGTRHDSPLPLHLQGL 356
Query: 378 WN--EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYL 435
WN E W H++IN +MNY+ L S+ +PL ++L L+ +G A+ Y
Sbjct: 357 WNDGEACKMGWSCDYHLDINTQMNYFAILNGGFSDLMQPLINYLIRLAASGQHAARACYG 416
Query: 436 ASGWVIHHKTDIWAKSSADRG-KVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPL 494
+ GWV H +++W AD G +V + L GG W+ HL E + Y++D F+ A+PL
Sbjct: 417 SEGWVAHVFSNVWG--FADPGWEVSYGLNVTGGLWMANHLIEMFEYSLDEGFMANDAWPL 474
Query: 495 LEGCASFLLDWLIEG-HDGYLETNPSTSPEHEFIAPDG----KLACVSYSSTMDMAIIRE 549
L G + F L++++E G+L T PS SPE+ F +G + + + T+D+ ++R+
Sbjct: 475 LAGASKFFLNYMVEDPKTGWLLTGPSVSPENSFFVVNGDGEKEEHYAALAPTLDVVLVRD 534
Query: 550 VFS---AIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRH 606
+ + +++ + N + +++ ++ +L P +I ++G + EW DF++ + +HRH
Sbjct: 535 LLAFCEYVVTKFNAGKSNWEDDIQQYQEAQAKLPPFQIGKNGQLQEWLHDFEEAQPYHRH 594
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTAL----WARLHDQEH 662
LSH L I+ PDL +AA TL++R I + AL +ARL D E
Sbjct: 595 LSHTMALCRSALISARHQPDLAEAARVTLERRQGRDDLEDIEFTAALFALNYARLGDAEK 654
Query: 663 AYRMVKRLF------NLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
A + L NL+ + K G +N+F ID NFG AA+AEML++S
Sbjct: 655 AVAQIGHLVGELSFDNLLS--YSKPGVAGAEANIFV------IDGNFGGAAAIAEMLIRS 706
Query: 717 TLNDLY------LLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEV 760
+ L LLPALP WS G V G++ RGG W DG L V
Sbjct: 707 IIPRLGGPVEVDLLPALP-AAWSEGTVDGMRVRGGLEAHFEWHDGKLDGV 755
>gi|358381765|gb|EHK19439.1| glycoside hydrolase family 95 protein [Trichoderma virens Gv29-8]
Length = 788
Score = 292 bits (748), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 221/774 (28%), Positives = 360/774 (46%), Gaps = 76/774 (9%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYT--NPDAP--KAL 72
PA A P+GNG+LGAM G V + + LNE +LW+G P DY NP AP AL
Sbjct: 29 PANIIMTAYPLGNGKLGAMPLGLVGEDIVVLNEHSLWSGGPFESPDYIGGNPPAPVYTAL 88
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADV----YQLLGDIELEFDDSHLKYAEETYRREL 128
+R + + Q +A L+G P Y+ LG++ ++ +Y+ +Y R L
Sbjct: 89 PGIRETIWNTQINNDISA---LYGDPTYYHYGNYETLGNLTVKIAGVS-RYS--SYNRAL 142
Query: 129 DLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVN 188
DL T + ++ +FT F + PDQV + ++ L DN
Sbjct: 143 DLETGIHQTAFTSNGAKFTITTFCTFPDQVCAYNVQSNKP----LPAVTIGLQDNQ---- 194
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSD 248
+ C + + D G+ F A ++ + T ++ + + +G
Sbjct: 195 -RSSPSSNSSCDANGVRLRGQTQQD-IGMIFDARAQVLNRPRKATCTSSHELLVPSDGKT 252
Query: 249 WAV-LLLVASSSFD----GPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
+V ++ A +++D N S DP +S +Q++ S+S +Y H+ D+
Sbjct: 253 ASVTVVYAAGTNYDQKKGTKASNYSFKGVDPAPAVVSTIQAVEKKSFSSMYNAHVKDHNT 312
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLI 362
LF + ++ L S + +VP+A ++++ + DP + LLF +GRYL I
Sbjct: 313 LFSQFTLNLPDSEHSV-----------SVPTATLMENYDYNVGDPFVENLLFDYGRYLFI 361
Query: 363 SSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYL 422
S R G+ NLQGIW E+ P W S HV++N++MN+W + L + Q PL+DF+
Sbjct: 362 GSCRDGSLPPNLQGIWTENQFPAWSSDYHVDVNVQMNHWHTEQTGLGDIQGPLWDFIIDT 421
Query: 423 SI-NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYT 481
+ G++TA++ Y A G+V + + + VW+ +P AWL ++W Y+Y
Sbjct: 422 WVPRGTETAELLYDAPGFVGFSNLNTFG-FTGQMNSAVWSNYPASAAWLMQNVWNRYDYG 480
Query: 482 MDRDFLEKRAYPLLEGCASFLLDWLIE---GHDGYLETNPSTSPEHEFIAPDGKLACVSY 538
D + + YPL++ A + + ++ +DG L P SPEH + C Y
Sbjct: 481 RDTHWWKTVGYPLMKSVAEYWIHEMVPDLYSNDGTLVAAPCNSPEHGWTT----FGCTHY 536
Query: 539 SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEWAQDF 597
++ EVF II + E +E V ++ +L P I G I EW +
Sbjct: 537 QQ-----LVWEVFDHIIDSWEDSGDTNTTFLETVKETQSKLSPGIIIGWFGQIQEWKIGW 591
Query: 598 KDPEVHHRHLSHLFGLFPGHTITIEK-NPDLCKAAEKTLQKRG----EEGPGWSITWKTA 652
P HRHLSHL G +PG++I N + A +L RG + GW W+ A
Sbjct: 592 DQPNDEHRHLSHLVGWYPGYSIGTHMWNKTVTDAVNVSLTARGNGTADSNTGWEKVWRVA 651
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHE-KHFEGGLYSNLFAAHPPFQIDANFGFTAAVAE 711
WA+L++ + AY +K ++ + + G + AA PFQIDANFG++AAV
Sbjct: 652 CWAQLNNTDIAYTYLKYAIDMNYANNGFSVYTSGSWPYELAA--PFQIDANFGYSAAVLA 709
Query: 712 MLV--------QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
ML+ + ++ + L PA+P W G V+G++ RGG +V W + L
Sbjct: 710 MLITDLPVPSASNAIHTVILGPAIP-SAWKGGSVQGMRIRGGGSVDFSWDNNGL 762
>gi|210613380|ref|ZP_03289700.1| hypothetical protein CLONEX_01907 [Clostridium nexile DSM 1787]
gi|210151222|gb|EEA82230.1| hypothetical protein CLONEX_01907 [Clostridium nexile DSM 1787]
Length = 1389
Score = 292 bits (747), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 218/704 (30%), Positives = 330/704 (46%), Gaps = 119/704 (16%)
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSE-SGS------LSFNVS 176
Y R LD++TA A V Y N + RE+F+S PD VI K++ E GS L F VS
Sbjct: 460 YERALDIDTALATVSYDRDNTHYYREYFASYPDNVIAMKLTAEEIKGSEGEMRPLEFEVS 519
Query: 177 L-------DSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISD 229
SL +Y ++ II+ G K ND ++ + L++ D
Sbjct: 520 FPVDQPGDKSLGKEVTYTTEDDSIIVAG---------KMKDND----LKLNGRLKVVTKD 566
Query: 230 DRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP----SDSKKDPTSESMSALQSI 285
G ++ +E K+ + SD + + ++ D ++P + + E +
Sbjct: 567 --GEVTPVEGKEGTLLVSDATEVYIYVTADTDYEMVHPEYRTGQTDQQLADEVKKVMDDA 624
Query: 286 RNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE 345
Y + DY+ ++ RV I + S++ ID + A + + T+E
Sbjct: 625 TKQGYDQVKENAQADYKNIYDRVKIDFGQE--------ASDKTIDELIKAYKDGNASTEE 676
Query: 346 DPSLVELLFQFGRYLLISSSRPGTQV-ANLQGIW----NEDLSPT-WDSAPHVNINLEMN 399
L ++FQ+GRYL ISSSR G ++ ANLQG+W SP W S H+N+NL+MN
Sbjct: 677 KAYLETMIFQYGRYLQISSSREGDKLPANLQGVWLDCTGAANSPVAWGSDYHMNVNLQMN 736
Query: 400 YWQSLPCNLSECQEPLFDFL------------TYLSINGSKTAQVNYLAS------GWVI 441
YW + N++EC EPL D++ TY I+ S Q ++A+ GW
Sbjct: 737 YWPTYVTNMAECAEPLIDYVEGLREPGRITASTYFGIDNSDGKQNGFMANTQNTPFGWTC 796
Query: 442 HHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASF 501
WA S W P W+ +++E Y Y+ D + LE +P++E A F
Sbjct: 797 PG----WAFS--------WGWSPAAVPWILQNVYEAYEYSGDVEKLESEIFPMMEEEAKF 844
Query: 502 LLDWLIE-----GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIIS 556
+ L E G Y+ T P+ SPEH + + + ++ ++F+ I
Sbjct: 845 YMSILKEVTDADGTKRYV-TVPAYSPEH---------GPYTAGNVYENVLVWQLFNDCIE 894
Query: 557 AAEVLEKNEDALVEKV-----LKSLPRLRPTKIAEDGSIMEWAQDFK----------DPE 601
AAE L NE V K K L+P +I + G I EW + + +
Sbjct: 895 AAEALNANEAGTVSKEQIDEWTKYRDGLKPIEIGDSGQIKEWYDETEFGQTANGAIPSFD 954
Query: 602 VHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQE 661
HRH+SHL G++PG +T++ N AA+ +L RG+ GW I + WAR D
Sbjct: 955 AKHRHMSHLLGVYPGDLVTVD-NKQYMDAAKVSLTARGDNATGWGIAQRLNTWARTGDGN 1013
Query: 662 HAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDL 721
H+Y+++ + G+YSNL+ +H P+QID NFGFT+ VAEML+QS +
Sbjct: 1014 HSYQIINQFIKT-----------GIYSNLWDSHAPYQIDGNFGFTSGVAEMLLQSNAGYI 1062
Query: 722 YLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSN 765
LLPA+P ++W++G V GL ARG VS WKDG L E I SN
Sbjct: 1063 NLLPAMPDEQWTTGSVSGLVARGNFEVSESWKDGALTEAKIVSN 1106
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 52/191 (27%), Positives = 89/191 (46%), Gaps = 46/191 (24%)
Query: 11 NPLKITFN---------GPAKHFTD-----------AIPIGNGRLGAMVWGGVPSETLKL 50
+P+KI F+ G + +FT ++PIGN +GA ++G V E L
Sbjct: 47 DPMKIRFDEPLSKGKLTGSSGNFTKPGSDTDWWQQLSLPIGNSYMGANIYGEVEKEHLTF 106
Query: 51 NEDTLWTGVPGD---YTNPDAP----KALSD-VRSLVDSGQYAEATAASV--KLFGHPA- 99
N+ TLW G P + YT + +++SD V+S+ ++ ++ A+S+ KL G +
Sbjct: 107 NQKTLWNGGPSETQPYTGGNISTVNGQSMSDYVKSVQNAFLTGDSNASSMCEKLVGTSSR 166
Query: 100 --DVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSV-------GNVEFTREH 150
YQ GDI L+FD EE E ++ + +KY + E EH
Sbjct: 167 EYGAYQGWGDIYLDFD------REEPQEEEKIISDTSDEIKYESMWHSYPQPDWEGGSEH 220
Query: 151 FSSNPDQVIVT 161
++++P + V+
Sbjct: 221 YTNDPGKFTVS 231
>gi|309798858|ref|ZP_07693119.1| alpha-fucosidase [Streptococcus infantis SK1302]
gi|308117507|gb|EFO54922.1| alpha-fucosidase [Streptococcus infantis SK1302]
Length = 627
Score = 292 bits (747), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 210/645 (32%), Positives = 323/645 (50%), Gaps = 81/645 (12%)
Query: 102 YQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIV 160
Y GDI + F++ T Y R LD++ A Y+ F RE FSS PD V V
Sbjct: 12 YLAFGDIFMVFNNQKKGLENVTDYHRGLDISEAITTTSYTQDGTSFKRETFSSYPDDVTV 71
Query: 161 TKISGSESGSLSF---NVSLDSLLDNHSYVNGNNQIIMEGRCP--GKRIPPKANANDDPK 215
T ++ +L F N + L+ N Y + N +G I K D+
Sbjct: 72 THLTKKGDKTLDFTLWNSLTEDLIANGDY-SWENSKYKQGTVSVDSNGILLKGTVKDN-- 128
Query: 216 GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDGPFINP-SDSKKDP 274
G+QF++ L IK G ++A +D L V G+ +A LLL A ++F NP ++ +KD
Sbjct: 129 GLQFASYLGIKTD---GQVTA-QDGYLTVTGASYATLLLSAKTNFAQ---NPKTNYRKDI 181
Query: 275 TSESM--SALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTV 332
E S +++ + Y L H+ DYQ LF+RV + L S + T
Sbjct: 182 DVEKTVKSIVEAAKAKDYETLKNDHIKDYQSLFNRVQLNLGGSKSNQTT----------- 230
Query: 333 PSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV--ANLQGIWNEDLSPTWDSAP 390
E ++++ + L EL FQ+GRYLLISSSR T ANLQG+WN +P W+S
Sbjct: 231 --KEALQTYNPTKGQKLEELFFQYGRYLLISSSRNRTDALPANLQGVWNAVDNPPWNSDY 288
Query: 391 HVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING-----------SKTAQVNYLASGW 439
H+N+NL+MNYW + NL+E +P+ +++ + G SK Q N GW
Sbjct: 289 HLNVNLQMNYWPAYMSNLAETAKPMINYIDDMRYYGRIAAKEYAGIESKEGQEN----GW 344
Query: 440 VIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCA 499
++H + + ++ W P AW+ +++++Y +T D +L+++ YP+L+ A
Sbjct: 345 LVHTQATPFGWTTPGW-NYYWGWSPAANAWMMQNVYDYYKFTKDETYLKEKIYPMLKETA 403
Query: 500 SFLLDWLI--EGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISA 557
F +L + D ++ ++PS SPEH ++ +T D +++ ++F + A
Sbjct: 404 KFWNSFLHYDKASDRWV-SSPSYSPEH---------GTITIGNTFDQSLVWQLFHDYMEA 453
Query: 558 AEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD----FKDP--EVHHRHLSHLF 611
A L+ ++D LV +V +L+P I +DG I EW ++ F + E HHRH+SHL
Sbjct: 454 ANHLKVDQD-LVTEVKTKFDKLKPLHINQDGRIKEWYEEDSPQFTNEGIENHHRHVSHLV 512
Query: 612 GLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLF 671
GLFPG T+ + P+ +AA TL RG+ G GWS K LWARL D A+R++
Sbjct: 513 GLFPG-TLFGKDQPEYLEAARATLNHRGDGGTGWSKANKINLWARLLDGNRAHRLLA--- 568
Query: 672 NLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQS 716
+ NL+ H PFQID NFG T+ +AEML+QS
Sbjct: 569 --------EQLRSSTLENLWDTHAPFQIDGNFGATSGMAEMLLQS 605
>gi|329957719|ref|ZP_08298194.1| fibronectin type III domain protein [Bacteroides clarus YIT 12056]
gi|328522596|gb|EGF49705.1| fibronectin type III domain protein [Bacteroides clarus YIT 12056]
Length = 922
Score = 291 bits (746), Expect = 8e-76, Method: Compositional matrix adjust.
Identities = 249/805 (30%), Positives = 364/805 (45%), Gaps = 153/805 (19%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAE 86
++PIGNG +GA ++GG +E L+L + TL+ +R L
Sbjct: 66 SLPIGNGYMGASIFGGTSTERLQLTDKTLY------------------IRGLWG------ 101
Query: 87 ATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEF 146
A+ GD+ L+F YRR L+LN A V Y V++
Sbjct: 102 ------------AETQTSFGDLYLDF----FHDLRSDYRRSLNLNKGIAEVSYQYQGVKY 145
Query: 147 TREHFSSNPDQVIVTKISGSESGSLSFNVS--------------LDSLLDNHSYVNGNNQ 192
RE+F S PD V+V K++ + GSL+F V D++ Y++G Q
Sbjct: 146 HREYFMSYPDNVLVIKLTADKPGSLTFTVRPQIAHLVPFGPLQRTDTM--TIGYLSGPTQ 203
Query: 193 IIM-----EGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK-----L 242
EG+ K + + + A ++K+ G++SA D +
Sbjct: 204 TRFSYNGREGKVFAKDDMITLRGQTEYLKLIYEA--QVKVIPINGSMSAWNDSNADHGTI 261
Query: 243 KVEGSDWAVLLLVASSSFD---GPFIN-PSDSKK---DPTSESMSALQSIRNLSYSDLYT 295
+VE +D AV+LL +++ F N P++ K DP +E L YS L T
Sbjct: 262 RVENADSAVILLALGTNYRLSPQVFANKPAEKLKGYPDPHTEISQRLIKATQKGYSQLRT 321
Query: 296 RHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQT-DEDPSLVELLF 354
H++D+ L RV QL+ PK + P+ + +++ +D L EL F
Sbjct: 322 THINDFSSLTERV--QLNIGPKSYL------------PTDRLLAAYKAGKQDTYLEELFF 367
Query: 355 QFGRYLLISSSRPGTQVANLQGIWNE-DLSPTWDSAPHVNINLEMNYWQSLPCNLSECQE 413
+GRYLLISS+R G LQG+WN+ +L+P W+ NIN++MNYW + NL+E
Sbjct: 368 HYGRYLLISSARKGALPPTLQGVWNQYELAP-WNGNYTHNINIQMNYWPAFNTNLTEL-- 424
Query: 414 PLFDFLTYLSINGSKTAQVNYLASGWV-IHHKTDIWAKSSADRGKVVWALWPMGGAWLCT 472
F +Y + + AS ++ IHH S + G W + GA++
Sbjct: 425 ----FESYSDYHKAYKPMAEQFASKYIKIHHPQHF----SDEPGGNGWTMGTGAGAYMVG 476
Query: 473 H----------------LWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLET 516
W++Y +T D+ L++ +YP + G A FL + G L
Sbjct: 477 MPGGHSGPGMAAFTSKLFWDYYAFTNDKQILKETSYPAILGVADFLSK-VTTDTLGLLLA 535
Query: 517 NPSTSPEHEFIA---PDGKLACVSYSSTMDMAIIREVFSAIISAAEVL-EKNEDALVEKV 572
NPS SPE A P + C D +I E I AA +L E NE+ + K
Sbjct: 536 NPSASPEQYAKATNRPYPTIGCA-----FDQQMIYENHQDAIRAANLLGEHNENIRLFK- 589
Query: 573 LKSLPRLRPTKIAEDGSIMEWAQD--FKDP--EVHHRHLSHLFGLFPGHTITIEKNPDLC 628
+ RL P +I G I E+ ++ + D E HHRHLS L GL+PG T+ E P
Sbjct: 590 -EQSKRLDPVQIGYSGQIKEYREEKYYGDIVLEQHHRHLSQLIGLYPG-TLINENTPAWL 647
Query: 629 KAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYS 688
AA+ TL +RG+ GWS+ K LWAR + A+ +V L G+
Sbjct: 648 DAAKVTLNRRGDVSTGWSMAHKINLWARAKEGNRAHDLVAALLT-----------NGIRE 696
Query: 689 NLFAA-----HPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKAR 743
NL+A PFQIDANFG TA +AEML+QS +++LPALP D W G KGL AR
Sbjct: 697 NLWATCLAVLRSPFQIDANFGGTAGIAEMLLQSHEGYIHILPALP-DAWKDGSYKGLTAR 755
Query: 744 GGETVSICWKDGDLHEVGIYSNYSN 768
G VS WK+G L E + S +N
Sbjct: 756 GNFEVSASWKEGRLTEAKVLSKQNN 780
>gi|225018990|ref|ZP_03708182.1| hypothetical protein CLOSTMETH_02941 [Clostridium methylpentosum
DSM 5476]
gi|224948215|gb|EEG29424.1| hypothetical protein CLOSTMETH_02941 [Clostridium methylpentosum
DSM 5476]
Length = 1743
Score = 291 bits (744), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 242/837 (28%), Positives = 373/837 (44%), Gaps = 139/837 (16%)
Query: 7 TSTTNPLKITFNGPAKHFTD-----AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPG 61
T+ T L++ ++ PA + ++P+G G +GA V+G +E +++ E++L
Sbjct: 44 TTGTKELRLWYDEPAPDSDNGWEQWSLPLGCGYMGANVFGRTDTERIQITENSL------ 97
Query: 62 DYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAE 121
NP P + ++VY ++F+ ++
Sbjct: 98 --ANPYNPG------------------------LNNFSEVY-------IDFNHAN----P 120
Query: 122 ETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL 181
Y R+LD+ A A V Y +TRE+F+S PD+V+ ++S S++G LSF +L
Sbjct: 121 SNYTRDLDIREAVAHVNYDWEGTTYTREYFTSYPDKVMAIRLSASDAGKLSF-----TLR 175
Query: 182 DNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAIL---------EIKISDDRG 232
+V N PG + + + + I S + ++K+ G
Sbjct: 176 PTVPFVKDYN------TTPGDGMGKSGSVSAEGDTITLSGNMHYYDIDFEGQLKVIPTGG 229
Query: 233 TISALEDKK-----LKVEGSDWAVLLLVASSSFDGP---FINPSDSKK-----DPTSESM 279
++ A D + VE +D AV+L+ +++ F P KK P ++
Sbjct: 230 SMRANNDDNGVNGTITVENADSAVILMAVGTNYQMESRVFTEPDAKKKLDGYEHPHAKVT 289
Query: 280 SALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVK 339
+Q S+ +L H DYQ+ F+RV++ L + TD +
Sbjct: 290 QYIQDASQKSFDELLEAHKADYQQYFNRVNLNLGAEVPQVTTDVL-------------LN 336
Query: 340 SFQT-DEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNE-DLSPTWDSAPHVNINLE 397
+++ D L EL FQ+GRYLLI+SSR GT NLQGIWN D SP W + NIN++
Sbjct: 337 NYKKGDTSQYLDELYFQYGRYLLIASSRKGTLPGNLQGIWNRYDQSP-WSAGYWHNINIQ 395
Query: 398 MNYWQSLPCNLSECQEPLFDFL------------TYLSINGSK-TAQVNYLASGWVIHHK 444
MNYW + NL+E E D+ YL GSK A+ +GW I
Sbjct: 396 MNYWPAFSTNLAEMFESYADYNEAFREAAQQNADQYLKQTGSKLMAEAGTGENGWAI--G 453
Query: 445 TDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLD 504
T W A+ P GA+ W++Y++T D D L YP +EG A FL
Sbjct: 454 TGTWPY-RAEAPSATGHSGPGTGAFTTKLFWDYYDFTRDEDVLRDTTYPAIEGMAKFLSK 512
Query: 505 WLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKN 564
LIE DG PS SPE G + D +I E + +I AA++L +
Sbjct: 513 TLIE-EDGKQLAYPSASPEQR----QGSGYYRTTGCAFDQQMIYENHNDLIKAADILGID 567
Query: 565 EDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV---HHRHLSHLFGLFPGHTITI 621
+V+ + + +L P + G + E+ ++ E+ HRH+S L GL PG T+
Sbjct: 568 SQ-IVDTCKEQIDKLDPVNVGYSGQVKEYREENYYGEIGEYQHRHISQLVGLQPG-TLIN 625
Query: 622 EKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKH 681
P AA+ TL KRG++ GW++ + LWAR D +Y + + L
Sbjct: 626 SSTPAWMDAAKVTLNKRGDKSTGWAMAHRLNLWARTGDGNRSYTLFQNL----------- 674
Query: 682 FEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLK 741
+ G +NL+ HPPFQID N+G TA VAEML+QS + L A P D W++G +GL
Sbjct: 675 LKNGTLTNLWDTHPPFQIDGNYGGTAGVAEMLLQSQEGVIMPLAARP-DAWANGSYQGLV 733
Query: 742 ARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNR 798
ARG VS W +G + I SN K +Y V S G++ +F +
Sbjct: 734 ARGNFEVSADWANGQATKFEITSNKGG----ECKLSYYNIADAVVKTSDGQVVSFTK 786
>gi|320537187|ref|ZP_08037155.1| hypothetical protein HMPREF9554_01893 [Treponema phagedenis F0421]
gi|320145965|gb|EFW37613.1| hypothetical protein HMPREF9554_01893 [Treponema phagedenis F0421]
Length = 735
Score = 291 bits (744), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 223/719 (31%), Positives = 340/719 (47%), Gaps = 89/719 (12%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---------GDYTNPDA-----PKAL 72
++PIGNG +GA ++GG+ E L LNE TLWTG P G+ T D
Sbjct: 57 SLPIGNGFIGASIFGGIRREYLHLNEKTLWTGGPCKKRPNYSGGNKTGVDENGYTPADYF 116
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPAD----VYQLLGDIELEFDDS-HLKYAE-----E 122
+ +R+L G+ AEA A KL G A YQ G ++F S H +E +
Sbjct: 117 AKIRTLFSEGKDAEAAALCDKLVGEKASEGYGAYQSFGKFFIDFYYSAHTALSEPPAEIK 176
Query: 123 TYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLD 182
YRRELDLN A V+Y E+ R +F++ P V+ KI+ S L +V +S
Sbjct: 177 AYRRELDLNQALVEVRYQYNTTEYRRMYFANYPSNVLAGKITASNP-VLHCSVHFESD-Q 234
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
S N + G K ND ++F +L +I D I+ DK +
Sbjct: 235 GGSISYTQNGFTLSG---------KVEDND----LEF--LLRCRIRTD--GITTCSDKGI 277
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
+ + + L +++ + + P P + L N S+ L H+ DY
Sbjct: 278 SITQASFLEFFLCSATDYSDSY--PKYRTGFPPHIDEANL----NKSFDALLAEHIKDYC 331
Query: 303 KLFHRVSIQLSR-SPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLL 361
LF R + + + S D+ TD E + S + L +LLFQ+GRYLL
Sbjct: 332 PLFDRCRLNIGQDSEPDMPTDVLLSEYKNGKFSRK------------LEDLLFQYGRYLL 379
Query: 362 ISSSRPGTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLT 420
+SSSR + ANLQG+WN SP W S H+NINL+MNYW + L EC PL ++
Sbjct: 380 LSSSREKNILPANLQGMWNNSNSPPWASDYHLNINLQMNYWLACVTGLPECCIPLVKYVA 439
Query: 421 YLSINGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
L +TA+ G ++ H + + W P W+ +LW++Y
Sbjct: 440 ALEKPAERTAKAYTGLDGGLMIHTQNTPFGWTCPGWSFDWGWSPAAFPWILQNLWQYYCA 499
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLI-EGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
+ D L++ YPL + F L+ + L ++P+ SPEH P +
Sbjct: 500 SGDFTRLKEIIYPLFKKEIQFYTAVLVFDKKQNRLVSSPTYSPEH---GPR------TNG 550
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFK- 598
+T + ++I E+F I AA++ + + AL+ + K L+P I + I+EW + +
Sbjct: 551 NTYEQSLIWELFKQGIEAAKLCGEKK-ALIAQWKKVQENLKPIVIGKSRQILEWYTEEEL 609
Query: 599 --DPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWAR 656
E HHRH+SHL G++PG IT E + DL AA+++L+ RG++ GW++ + WAR
Sbjct: 610 GSIGEKHHRHISHLLGVYPGTLITKE-DTDLAAAAKRSLEARGDKSTGWAMAQRILTWAR 668
Query: 657 LHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQ 715
L + + AY +++ + +Y NL A HPPFQID NFG TAA+AE+ +
Sbjct: 669 LGEGKRAYAILQTMIQTC-----------IYDNLLATHPPFQIDGNFGLTAAIAELFLH 716
>gi|452988935|gb|EME88690.1| glycoside hydrolase family 95 protein [Pseudocercospora fijiensis
CIRAD86]
Length = 646
Score = 290 bits (742), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 170/394 (43%), Positives = 223/394 (56%), Gaps = 32/394 (8%)
Query: 376 GIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNY- 434
G+WN D P W S NIN++MNYW + NLSEC E LF FL L+ G KTA+ Y
Sbjct: 227 GLWNRDEKPVWGSKYTANINVQMNYWPAEITNLSECHEVLFTFLKRLAARGKKTAKEMYG 286
Query: 435 LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCT-HLWEHYNYTMDRDFLEKRAYP 493
+ GWV HH TDIWA + + W + GAWL H+WE Y ++ D FL + +
Sbjct: 287 IDRGWVSHHNTDIWADPTPQDRSICATYWNLSGAWLVVGHIWERYLFSRDEGFL-RENWD 345
Query: 494 LLEGCASFLLDWLIE---GHDGYLETNPSTSPEHEFIAPDG----KLACVSYSSTMDMAI 546
+++G A F +++L+E DG L T+PS S E+ + DG ++ V T D I
Sbjct: 346 IMKGSAEFFVEFLVEDGGKKDGKLVTSPSVSAENSYFYVDGEGKRQVGSVCAGPTWDSQI 405
Query: 547 IREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRH 606
+RE+F A + A +L + E E VL LP+ +I G IMEW +DF++ E HRH
Sbjct: 406 LRELFGACVQAGRILGE-ETGEFEGVLGRLPQ---DEIGMFGQIMEWREDFEEVEPGHRH 461
Query: 607 LSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPG---WSITWKTALWARLHDQEHA 663
+SHL+GLFPG +I ++ D AA TL++R E G G WS+ W L ARL D+E A
Sbjct: 462 VSHLWGLFPGTSIQAKEMKD---AARVTLKRRLEAGGGHTSWSLAWIQCLCARLRDEELA 518
Query: 664 YRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYL 723
MV ++ G + NLFA HPPFQID NFG+TAAVAEML+QS + L
Sbjct: 519 QEMVGKM------------SGAVLENLFANHPPFQIDGNFGYTAAVAEMLLQSHEGPIDL 566
Query: 724 LPALPWDKWSSGCVKGLKARGGETVSICWKDGDL 757
LP L D G VKGL+ARG V I WKDG L
Sbjct: 567 LPCLLADWAEGGSVKGLRARGNVVVDISWKDGKL 600
Score = 74.3 bits (181), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 54/157 (34%), Positives = 73/157 (46%), Gaps = 26/157 (16%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
+ PA + D +PIGNGRLGAMV G E L LNED++W G P + NP A K L VR
Sbjct: 8 YTTPANLWEDGLPIGNGRLGAMVRGTTNVERLWLNEDSVWYGGPQERVNPGALKNLDRVR 67
Query: 77 SLVDSGQYAEATAASVKLFGHPADV---YQLLGDIELEFDD-----SHLKYA-------- 120
L++ + +EA + F + Y+ LGD+ L F H ++
Sbjct: 68 DLINQRRISEAENLMSRTFTAMPECMRHYEPLGDLMLYFGHGVDPPGHHQHVVGIPQFEN 127
Query: 121 ---------EET-YRRELDLNTATARVKYSVGNVEFT 147
E T Y+RELDL T V+Y + T
Sbjct: 128 QKWSGGGGKEVTGYKRELDLRTGVVSVEYECDDQAMT 164
>gi|396466146|ref|XP_003837624.1| similar to glycoside hydrolase family 95 protein [Leptosphaeria
maculans JN3]
gi|312214186|emb|CBX94180.1| similar to glycoside hydrolase family 95 protein [Leptosphaeria
maculans JN3]
Length = 807
Score = 289 bits (739), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 230/775 (29%), Positives = 354/775 (45%), Gaps = 93/775 (12%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD---YTNPDAPKALSDVRSLVDSGQ 83
A P+GNGRLGAM +G ET+ LN D+LW+G P + YT + A++ +
Sbjct: 46 AYPLGNGRLGAMPFGPAGQETVNLNLDSLWSGGPFETVSYTGGNPTSAVAQALPGIRDWI 105
Query: 84 YAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEET-YRRELDLNTATARVKYS 140
+ T +L G + Y++LG++ + + T + R LD+ +Y
Sbjct: 106 FTNGTGNVTELLGEDGNFGSYRVLGNLSVSIPSLQIGNVSITGFTRTLDIVNGIHTTRYK 165
Query: 141 VGNVEFTREHFSSNPDQVIVTKISGSESGSLS-FNVSLDSLLD-----------NHSYVN 188
V E F S PDQV V S SG L +SLD+ L +H +
Sbjct: 166 VDENEINTTVFCSYPDQVCV--YSAQSSGQLPVLQLSLDNELVTSELKTRTCEVDHVRMR 223
Query: 189 GNNQIIMEGRCPGKRIPPKANANDDPKGIQFS-----AILEIKISDDRGTISALEDKKLK 243
G Q+ G G R A P+GI+ S AIL I ++ +++ + +
Sbjct: 224 GVTQV---GPPEGMRYDAIARVAS-PEGIKMSCINGTAILNITPNNGTNSVTVILGAETD 279
Query: 244 VEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQK 303
+ ++ FD F +DP + Q + +L H++D+
Sbjct: 280 YDQKK-------GTAEFDYSF-----RGEDPGPTVEATTQKAAAKTSVELVGAHVEDFTS 327
Query: 304 LFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLIS 363
L R + L TDT + T+ ER S T+ DP L LLF + YL IS
Sbjct: 328 LSERFKLSL--------TDTLNSLQTPTLDLIERYDSEDTNGDPYLESLLFDYSNYLFIS 379
Query: 364 SSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLS 423
SSR G+ NLQG W+E L W H NINL+MN+W + L++ Q PL+D++
Sbjct: 380 SSRAGSLPPNLQGRWSEGLYAAWSGDYHANINLQMNHWTADQTGLTDLQSPLWDYMADTW 439
Query: 424 I-NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
+ G++TA++ Y A GWV+H++ +I+ + G A + AW+ H+++H++Y+
Sbjct: 440 VPRGTETAELLYDAPGWVVHNEMNIFGHTGMKSGASW-ANYAAAAAWMMQHVYDHWDYSR 498
Query: 483 DRDFLEKRAYPLLEGCASFLLDWL---IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYS 539
D +L+ + YPLL+G A F L L + +D L P SPEH P AC +
Sbjct: 499 DTAWLKSQGYPLLKGVAKFWLHQLQLDMFSNDNSLVVIPCNSPEH---GPT-TFACAHFQ 554
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPT-KIAEDGSIMEW----A 594
+I ++F AI++ + ++ +++ A + SL L I G I EW +
Sbjct: 555 Q-----VIHQLFDAILTLSPIVSESDTAFTTNISSSLKFLDTGFHIGSFGQIKEWKLPDS 609
Query: 595 QDFKDPEVHHRHLSHLFGLFPGHTITI----EKNPDLCKAAEKTLQKRGE-EGP----GW 645
+ P HRHLS L G +PG++++ N + A + L RG GP GW
Sbjct: 610 FGYDIPNDTHRHLSELVGWYPGYSLSSFLSGYTNKTIASAIRQKLISRGNGNGPDANAGW 669
Query: 646 SITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGF 705
W+ A WARL+D + A+ ++ +++F G +S PFQIDANFG
Sbjct: 670 GKVWRAACWARLNDTQQAHYHLRYAI-------QENFAGNGFSMYSGTGAPFQIDANFGL 722
Query: 706 TAAVAEMLV--------QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICW 752
AV MLV + + L PA+P W +G V+GL+ RGG V W
Sbjct: 723 GGAVLSMLVVDLPQVVGDERVKSVVLGPAIP-KAWGAGSVEGLRVRGGGVVGFEW 776
>gi|256832984|ref|YP_003161711.1| hypothetical protein Jden_1765 [Jonesia denitrificans DSM 20603]
gi|256686515|gb|ACV09408.1| conserved hypothetical protein [Jonesia denitrificans DSM 20603]
Length = 819
Score = 288 bits (737), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 233/848 (27%), Positives = 356/848 (41%), Gaps = 113/848 (13%)
Query: 9 TTNPLKITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD-YTNP- 66
T +P +++ N P + +A+P+GNG LG M TL +N W+G P Y P
Sbjct: 15 TDSPEQLSLNAPCTTWVEALPLGNGILGVMDGAHAAHTTLWINHHATWSGHPATAYQLPP 74
Query: 67 --DAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETY 124
D P L + R + Y T ++L + + L A T
Sbjct: 75 AADNPTWLIEARLALARQDYPTIT--------------RILKSTQTPHSQAFLPLAHLTL 120
Query: 125 ---------RRELDLNTATARVKYSVGNVEFTRE--------------HFSSNPD----- 156
R LD +TAT+ Y+ + H P
Sbjct: 121 TPTHSVTFISRHLDFSTATSHAIYATADNSTIHHRTWVPRADNYSPPFHLPDTPHAPPGD 180
Query: 157 -QVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPK 215
I+ I+ +L + +S D+LL H+ + ++ + R P P +
Sbjct: 181 GSAIIHTITNHSPHTLHYTISTDTLLRPHTQ-HTTHRPHLTVRLPSDVAPTHETTDHHIT 239
Query: 216 GIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFD-----GPFINPSDS 270
SA + + G +L+L A++ D P I +
Sbjct: 240 YDHTSASQTLTWATTSAATPTTLTIAPHTTG----ILVLTANTPADPTEPTAPVITHLHT 295
Query: 271 KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENID 330
+ ++++ + + Y RH+ +++++ R S+ ++ P
Sbjct: 296 HAERIRDALTNAGTPPTAELAGPYARHVAAHRQMYTRTSLHIAADPH------------- 342
Query: 331 TVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAP 390
A R F GR+LLI++ P LQG+WN +L P W S
Sbjct: 343 ----ATRQ---------------FHMGRHLLITTLHPNALPITLQGLWNAELPPPWSSNY 383
Query: 391 HVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSIN-GSKTAQVNYLASGWVIHHKTDIWA 449
+NIN MNYW + L E L +LT + G A Y A G+V+HH +D W
Sbjct: 384 TLNINTPMNYWAADQVGLGEHHTQLRHWLTRAAAGPGRYIANALYHAPGFVLHHNSDRWG 443
Query: 450 KSS---ADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWL 506
++ A G W+ WPMGG WL W+H YT D +PL+EG A F L WL
Sbjct: 444 YATPAGAGHGDPAWSFWPMGGLWLTLTAWDHITYTDDLTD-AAHLWPLIEGAAHFALHWL 502
Query: 507 IEGHDGYL-ETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNE 565
HDG + PSTSPEH F DG ++ + TMD+A++ E+ AA +L K+
Sbjct: 503 T--HDGTTTHSAPSTSPEHTFTH-DGTTTAITDTPTMDIALLTELHQVATHAAAMLNKDA 559
Query: 566 D--ALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEK 623
A + +++ LP R I G + EW + E +HRHLSHL GL+P +T
Sbjct: 560 PWLAPLGRLIADLPTPR---ITTSGHLAEWTHNHPSAEPNHRHLSHLIGLYPFRHLT--- 613
Query: 624 NPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFE 683
P+L AA +L RG E GW++ W+ AL AR E A + R + +H
Sbjct: 614 TPELRDAAMASLNARGPESTGWALAWRIALSARARRNEDAATWIARSLRPMT-QHTGPHH 672
Query: 684 GGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKAR 743
GGLY +L +AHPPFQID N G+ A V L+ +T + + LLPALP W+ G + GL
Sbjct: 673 GGLYPSLLSAHPPFQIDGNLGYLAGVCACLIDATTDTITLLPALP-PAWTQGHITGLHLP 731
Query: 744 GGETVSICWKDG--DLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVNLSAGKIYTFNRQLK 801
G T I W++ DL V +++ + +T+ + T + ++ G+ F +
Sbjct: 732 GRLTCEITWRNAAPDLVTVTLHAQARQ---PARRTISFGTTQRSITVTPGETLRFTGRHL 788
Query: 802 CTNLHQSI 809
N Q I
Sbjct: 789 QENTTQPI 796
>gi|429725255|ref|ZP_19260101.1| hypothetical protein HMPREF9999_00369 [Prevotella sp. oral taxon
473 str. F0040]
gi|429150390|gb|EKX93301.1| hypothetical protein HMPREF9999_00369 [Prevotella sp. oral taxon
473 str. F0040]
Length = 1045
Score = 288 bits (737), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 232/796 (29%), Positives = 369/796 (46%), Gaps = 114/796 (14%)
Query: 12 PLKITFNGPAKHFTD-----AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNP 66
PL + + PA ++ ++P+GNG LGA ++GG+ + ++LNE T+WTG P D +
Sbjct: 196 PLTLWYTKPAMGVSNPWMEYSLPLGNGHLGASLFGGIQVDQIQLNEKTIWTGTPTDMGHY 255
Query: 67 DAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRR 126
+ L + F H D+ FD + K Y R
Sbjct: 256 GGYRNLGGI-------------------FVH---------DLSGNFDKTTKK--ANGYSR 285
Query: 127 ELDLNTATARVKYS-VGNVEFTREHFSSNPDQVIVT--KISGSESGSLSFN-VSLDSLLD 182
LD+ V +S ++ R +FSS PD V+ K +G L F V+ + +
Sbjct: 286 FLDIERGIGGVDFSDSQGTKYERRYFSSAPDDVVAAHYKATGDNKLHLRFALVAGEEINA 345
Query: 183 NHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+ + N + G+ P + ++A +K+ GT++ ++ +
Sbjct: 346 SDPSYDKNGEAFFAGKLP---------------TVYYNA--RMKVVPTGGTMTVTKE-GI 387
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNL---SYSDLYTRHLD 299
+V+ + ++ A+S+FD PS S D T+ + + S+++L + H+
Sbjct: 388 EVKDATEVKVIFSAASTFDSNV--PSRSSGDATTMATKVQDIVTKAAAKSWAELESAHVA 445
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRY 359
D++ RV + L D V+ +E I + R + + E L +L F +GRY
Sbjct: 446 DFESYMGRVKLNLD----DAVSRKHTESLIGFYNTNTRNRD--SKEGLFLEQLYFNYGRY 499
Query: 360 LLISSSRPGTQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDF 418
L+ISSSR V +NLQGIWN+ + W+S H NIN++MNYW + NLS+C P F
Sbjct: 500 LMISSSRGAINVPSNLQGIWNDKANAPWNSDIHTNINVQMNYWPAETTNLSDCHLP---F 556
Query: 419 LTYLSINGSKTAQVNYL-------ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLC 471
L Y+ N + N GW + +++I+ S R + AW C
Sbjct: 557 LNYILDNYKEKGWQNAARWGQDGQKVGWTVFTESNIFGGMSQFRTN-----YKEVNAWYC 611
Query: 472 THLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE---GHDGYLETNPSTSPEHEFIA 528
THLW+HY +T D FL K A+P + A F ++ +I+ DG SPE +
Sbjct: 612 THLWDHYRFTRDEAFLRK-AFPAIWQSAQFWMERMIQDKVKKDGTFVAPNEYSPEQDNHP 670
Query: 529 PDGKLACVSYSSTMDMAIIREVFSAI------ISAAEV------LEKNEDALVEKVLK-- 574
+ A T ++ I +E + + +SAA+V +EK + L + K
Sbjct: 671 TEDGTAHAQQLITANLQIAQEAINILGAESLGLSAADVAQLKKYVEKTDKGLHIEEYKGD 730
Query: 575 ------SLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLC 628
+L + TK+ ++ ++A + HRH+SHL L+P + +E+ D
Sbjct: 731 WGNWATNLGINKGTKLLKE---WKYASYSVSGDKGHRHMSHLMCLYPLN--QVERGDDYF 785
Query: 629 KAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYS 688
+ A L RG+E GWS+ WK LWAR D +HA R++ + + GG+Y
Sbjct: 786 QPAVNALALRGDEATGWSMGWKVNLWARAKDGDHARRILNNALKHSTAYNTDQYRGGIYY 845
Query: 689 NLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETV 748
NL+ +H PFQID NFG A +AEML+QS + + LLPALP W +G + GLKA G TV
Sbjct: 846 NLYDSHAPFQIDGNFGVCAGIAEMLLQSQNDVIELLPALP-RAWKNGSITGLKAVGNFTV 904
Query: 749 SICWKDGDLHEVGIYS 764
+ WK+ EV I S
Sbjct: 905 DVAWKNLLPSEVKIVS 920
>gi|358396613|gb|EHK45994.1| glycoside hydrolase family 95 protein [Trichoderma atroviride IMI
206040]
Length = 793
Score = 286 bits (731), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 215/775 (27%), Positives = 354/775 (45%), Gaps = 78/775 (10%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYTNPDAPKALSDVR 76
P IGNGR G + G + L LN+D++W G P YT + +L+
Sbjct: 28 PGNVLMTGYTIGNGRQGGLPLGIPGDDLLCLNDDSVWRGGPFSNSSYTGGNPSSSLAHFL 87
Query: 77 SLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTAT 134
+ + T L+G +D Y+ L ++ + KY+ Y+R LDL TA
Sbjct: 88 PGIQEFIFQNGTGDESALYGGSSDYGSYEALANLTVSIAGV-TKYSN--YKRTLDLETAL 144
Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSES-GSLSFNVSLDSLLDNHSYVNGNNQI 193
+++ F F + PDQV V +S ++ ++F L+DN+ N
Sbjct: 145 HSAEFTANGASFQTVQFCTFPDQVCVYHVSSNKPLPDITF-----GLVDNYRT---NPAS 196
Query: 194 IMEGRCPGKRIPPKANANDDPK--GIQFSAILE-IKISDDRGTISALEDKKLKVEGSDWA 250
++ G + + A+D G++ A + S + T ++ L + A
Sbjct: 197 TVQCSSSGIWLSGRTVADDGEGLIGMKIDAQASALSSSGLKATCNSRGQTVLSTKSVKSA 256
Query: 251 VLLLVASSSFDGPFINPSDS----KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+++ + + +D N +++ DP + + ++ SY+ + RH+ D+ + F+
Sbjct: 257 TIVVASGTEYDAEKGNAANNYSFRGADPHPGVVKTINAVSKKSYNAILQRHVADHGEWFN 316
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSS 365
+ ++ L N V S E + ++ TD+ DP + LL +G+Y+ I+SS
Sbjct: 317 KFTLDLP-----------DPNNSAEVDSMELLTNYSTDKGDPFVEGLLIDYGKYMFIASS 365
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI- 424
RPG+ NLQG W D +P W S H+++N++MN+W L +PL+DF+TY +
Sbjct: 366 RPGSLPPNLQGSWAPDGNPAWSSDYHIDVNVQMNHWHVEKMGLGGLTDPLWDFMTYTWVP 425
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G++TA++ Y ASGWV T+I+ +A W+ AW+ H+W+ Y+Y D+
Sbjct: 426 RGTETARLWYNASGWVAFTNTNIFGH-TAQENDATWSDVAHDIAWMMAHVWDRYDYGRDK 484
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEG---HDGYLETNPSTSPEHEFIAPDG--KLACVSYS 539
++ YPL++G ASF +D L++ DG L NP SPEH P G C +
Sbjct: 485 NWYASVGYPLMKGVASFWMDLLVQDDYFKDGTLVANPCNSPEH---GPTGFQTFGCAQFQ 541
Query: 540 STMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEWAQDFK 598
+I E+F II + + ++++ +S +L P + G I EW D
Sbjct: 542 Q-----VIWELFDHIIKDWNASGDRDASFLKRLKESYGKLDPGVHVGSWGQIQEWKLDID 596
Query: 599 DPEVHHRHLSHLFGLFPGHTITI--EKNPDLCKAAEKTLQKRG----EEGPGWSITWKTA 652
HRHLSHL+G +PG+ I+ N + A +L RG + GW W+ A
Sbjct: 597 VKNDTHRHLSHLYGFYPGYVISSVHGDNKTIMDAVATSLYSRGNGTDDSNTGWEKVWRGA 656
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHP-----PFQIDANFGFTA 707
W +L + AY+ +K ++ GL + P PFQIDANFG +A
Sbjct: 657 CWGQLGVTDEAYKELKYTIDM------NFAANGLSVYTAGSWPYELALPFQIDANFGLSA 710
Query: 708 AVAEMLV--------QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKD 754
ML +++ + L PA+P +W+ G VKG RGG TV W D
Sbjct: 711 NALAMLYTDLPKKWGDNSVQKVILGPAIP-AEWAGGSVKGASLRGGGTVDFGWDD 764
>gi|452002453|gb|EMD94911.1| glycoside hydrolase family 95 protein [Cochliobolus heterostrophus
C5]
Length = 805
Score = 285 bits (729), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 235/790 (29%), Positives = 364/790 (46%), Gaps = 107/790 (13%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYT--NPDAP--KALSDVRSLV 79
A+P+GNGRL AM G +ETL LN D+LW+G P +YT NP + AL +R +
Sbjct: 38 ALPVGNGRLAAMPIGPPSAETLTLNLDSLWSGGPFEASNYTGGNPQSSIDSALPGIRDWI 97
Query: 80 DSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARV 137
+ T KL G + Y++L ++ + S + Y R+LDL
Sbjct: 98 ----FTNGTGNVTKLLGTNDNYGSYRVLANLTVAIP-SLVGSQVSNYTRKLDLANGLHST 152
Query: 138 KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSL-SFNVSL-----DSLLDNHSYV-NGN 190
++ + + F S PDQ+ V + SGSL +F + L D+ L+N + V NG
Sbjct: 153 SFNTNDTQLETTVFCSYPDQICVYTVQ--SSGSLPAFELKLGNELVDAKLENKTCVANGT 210
Query: 191 NQIIMEGRCPG-KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKV---EG 246
R G ++ P P+G+ + I + + D L V +G
Sbjct: 211 GADSGHLRLRGVTQLGP-------PEGMLYDTIARLLPNSDVKATCDSNTGILTVTPGDG 263
Query: 247 SDWAVLLLVASSSFDGPFINP----SDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQ 302
+ A +++ A +++D S DP ++ + +L + HL+D+
Sbjct: 264 AKSATVIIGAETNYDMKKGTAEHQYSFRGNDPGPVVEETIRKASTKTLEELKSSHLEDFT 323
Query: 303 KLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQ---TDEDPSLVELLFQFGRY 359
L R L P + N VP+ E + S+ T DP + LLF + +Y
Sbjct: 324 SLTGRFEFLL---PDPL--------NSAQVPTPELMASYDSNVTSGDPFVENLLFDYAQY 372
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LLISSSRPG+ NLQG W E ++P W + H NINL+MNYW + L+E Q PL+D++
Sbjct: 373 LLISSSRPGSLPTNLQGRWTEQMAPDWSADYHANINLQMNYWTADQTGLTETQTPLWDYM 432
Query: 420 TYLSI-NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHY 478
+ G +TA + Y A GWV+H++ +I+ ++ G+ WA +P AW+ H+++++
Sbjct: 433 INTWVPRGHETAMLLYGAPGWVVHNEMNIFGHTAMKDGE-GWANYPAAPAWMMLHVFDYW 491
Query: 479 NYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH------DGYLETNPSTSPEHEFIAPDGK 532
+YT D +L + YPL+ A F WL + H D L NP +SPEH P
Sbjct: 492 DYTRDTTWLRTQGYPLIRSVAQF---WLSQLHADSFTNDNTLVVNPCSSPEH---GPT-T 544
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIM 591
C Y +I +VF A+++ ++ +++ V +L RL + + I
Sbjct: 545 FGCAHYQQ-----LIHQVFEAVLTTHSLVGESDTEFTSNVSSTLSRLDKGFHVGSWSQIK 599
Query: 592 EW------AQDFKDPEVHHRHLSHLFGLFPGHTITI----EKNPDLCKAAEKTLQKRG-E 640
EW +F++ HRH+S L G PG++++ N + A L RG
Sbjct: 600 EWKLPDSFGYEFQNDT--HRHISELVGWHPGYSLSSFLGGYSNTTVQSAVRNKLISRGIG 657
Query: 641 EGP----GWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP 696
GP GW W+ A WARL+D A+ ++ E++F G +S P
Sbjct: 658 NGPDANSGWEKVWRGACWARLNDTAQAHLELRYAI-------EQNFVGNGFSMYKGERTP 710
Query: 697 FQIDANFGFTAAVAEMLVQ---------STLNDLYLLPALPWDKWSSGCVKGLKARGGET 747
FQIDAN+G+ V MLV + L PA+P + W G VKGL+ RGG
Sbjct: 711 FQIDANYGYGGLVLSMLVVDLPAPAEGLEGKRRVVLGPAIP-ESWKGGKVKGLRIRGGGV 769
Query: 748 VSICWKDGDL 757
V W DG +
Sbjct: 770 VDFGWDDGGV 779
>gi|358383160|gb|EHK20828.1| glycoside hydrolase family 95 protein [Trichoderma virens Gv29-8]
Length = 791
Score = 285 bits (729), Expect = 8e-74, Method: Compositional matrix adjust.
Identities = 211/773 (27%), Positives = 353/773 (45%), Gaps = 77/773 (9%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYTNPDAPKALSDVR 76
P IGNGR G + G ++ L LN+D++W G P YT + +L+
Sbjct: 29 PGNVLMTGYTIGNGRQGGLPLGIPGNDLLCLNDDSIWRGGPFANSSYTGGNPSSSLAHFL 88
Query: 77 SLVDSGQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTAT 134
+ + T +L+G AD Y+ L ++ + Y++ Y+R LDL TA
Sbjct: 89 PGIQEAIFQNGTGDESELYGGTADYGSYEALANLTVSIAGV-TNYSK--YKRTLDLETAL 145
Query: 135 ARVKYSVGNVEFTREHFSSNPDQVIVTKISGSES-GSLSFNVSLDSLLDNHSYVNGNNQI 193
+++ F+ F S PDQV V +S ++ ++F L+DN+ N
Sbjct: 146 HSAEFTANGATFSTVQFCSFPDQVCVYHVSSNKPLPQITF-----GLVDNYRT---NPPS 197
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKK---LKVEGSDWA 250
++ G + + AND I + + G + + L + + A
Sbjct: 198 TVKCSSSGIWLSGRTVANDGEGLIGMKIDAQARALPSAGLKAICNSQGQTVLSTKSAKSA 257
Query: 251 VLLLVASSSFDGPFINPSDSKK----DPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+++ + + +D N + + DP + + ++ SY+ + H+ D+ + F+
Sbjct: 258 TIVVASGTEYDATKGNAAHNYSFRGVDPYPGVVKTINAVSKKSYNTILQSHVKDHGEWFN 317
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFGRYLLISSS 365
+ ++ L D + ++DT+ E + ++ T++ DP + LL ++G+Y+ I+SS
Sbjct: 318 KFTLDLP--------DPHNSADVDTM---ELLTNYTTEKGDPFVENLLIEYGQYMFIASS 366
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI- 424
RPG+ NLQG W D +P W S H+++N++MN+W L +PL+DF+TY +
Sbjct: 367 RPGSLPPNLQGSWAPDGNPAWSSDYHIDVNVQMNHWHVEKMGLGGLTDPLWDFMTYTWVP 426
Query: 425 NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDR 484
G++TA + Y SGWV T+I+ +A W+ AW+ H+W+ Y+Y D+
Sbjct: 427 RGTETASLWYNVSGWVAFTNTNIFGH-TAQENDATWSNVAHDIAWMMAHVWDRYDYGRDK 485
Query: 485 DFLEKRAYPLLEGCASFLLDWLIEGH---DGYLETNPSTSPEHEFIAPDGKLACVSYSST 541
+ YPL++G ASF +D ++ DG L NP SPEH P C +
Sbjct: 486 KWYASVGYPLMKGVASFWVDMMVPDEYFKDGTLVANPCNSPEH---GPT-TFGCAQFQQ- 540
Query: 542 MDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRP-TKIAEDGSIMEWAQDFKDP 600
++ E+F II + + A +++V +S +L P + G I EW D
Sbjct: 541 ----VVWELFDHIIKDWDASGDTDTAFLKRVKESYSKLDPGVHVGSWGQIQEWKMDIDVK 596
Query: 601 EVHHRHLSHLFGLFPGHTIT--IEKNPDLCKAAEKTLQKRG----EEGPGWSITWKTALW 654
HRHLSHL+G +PG+ I+ N + A +L RG + GW W+ A W
Sbjct: 597 NDTHRHLSHLYGFYPGYIISSVYADNKTVMDAVATSLYSRGNGTEDSNTGWEKVWRGACW 656
Query: 655 ARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHP-----PFQIDANFGFTAAV 709
+L + AY+ +K ++ GL + P PFQIDANFG +A
Sbjct: 657 GQLGVTDEAYKELKYTIDM------NFAANGLSVYTTGSWPYEVTLPFQIDANFGLSANA 710
Query: 710 AEMLV--------QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKD 754
ML +++ + L PA+P +W+ G VKG RGG TV W D
Sbjct: 711 LAMLYTDLPKKWGDNSIQKVILGPAIP-KEWAGGSVKGGSLRGGGTVDFSWDD 762
>gi|393222468|gb|EJD07952.1| glycoside hydrolase family 95 protein [Fomitiporia mediterranea
MF3/22]
Length = 835
Score = 283 bits (724), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 235/804 (29%), Positives = 372/804 (46%), Gaps = 115/804 (14%)
Query: 17 FNGPAKHFTD-AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-------GDYTNPDA 68
++ P + +T +P+GNG L AM GG E+ +LN ++LW+G P G PD
Sbjct: 36 YDAPGQIWTQHYLPLGNGFLAAMTPGGTLQESTQLNIESLWSGGPFADPAYNGGNKQPDE 95
Query: 69 PKALSDVRSLVDSGQYAEATAAS--VKLFGHPADVY---QLLGDIELEFDDSHLKYAEET 123
A++ + + +T + V + P D Y G + +S L +
Sbjct: 96 QAAMAQAMQSIRQSIFNSSTGITDNVDVLMTPIDAYGSYSGAGFLVSTLQNSSLSNISD- 154
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS---L 180
+ R LDL++ + ++ N +F+RE F S+P Q V S + S + +L + L
Sbjct: 155 FGRFLDLDSGLTKTIWNEDNAQFSRETFCSHPTQACVQNTSTAASSGFTQTYALAAASGL 214
Query: 181 LDNHSYVNGNNQIIMEGRC--PGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
+ N + + G PG A P G L+ + + T +
Sbjct: 215 PAPNVTCTDNATLRLNGLVAEPGMAYELLATVAVPPGGT-----LKCTVVPNMDTTDNVV 269
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDS-------KKDPTSESMSALQSIRNLSYS 291
+ + V A ++ V +++D IN D+ DP + + L S SYS
Sbjct: 270 NATITVSNVTSASVVWVGGTNYD---INAGDAVHNFSFRGPDPHDDLVPLLSSASKKSYS 326
Query: 292 DLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVE 351
+L + H+ DY+ H S+ L + + ++DT + + + ++ D+ VE
Sbjct: 327 ELLSDHVADYEATLHAFSLDLGQ-----------KADLDT-STDKLINAYTVDKGDVYVE 374
Query: 352 -LLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSE 410
LLF +GR+LL SSSR G ANLQG W D P W + H++IN+EMNYW + NL +
Sbjct: 375 WLLFNYGRHLLASSSR-GILPANLQGKWAVDAFPAWGADYHLDINVEMNYWLAEMTNL-D 432
Query: 411 CQEPLFDFL--TYLSINGSKTAQVNY-LASGWVIHHKT--DIWAKSSADRGKVVWALWPM 465
+PLF+++ TY + G+ TAQV Y + GWV+H + I+ + G+ W +P
Sbjct: 433 VSKPLFNYIAKTY-APRGAYTAQVLYNITQGWVVHTEVMFKIFGYTGMKVGEAEWYDYPE 491
Query: 466 GGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGH---DGYLETNPSTSP 522
AWL ++W+H++YT D + + + YPLL+G A F L+ LI DG L P SP
Sbjct: 492 PNAWLMLNVWDHFDYTNDVAWWKAQGYPLLKGVALFHLEKLIPDEHFLDGTLVVAPCNSP 551
Query: 523 EHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RP 581
E I LAC +I ++ +AI A + +++ + V + ++ +
Sbjct: 552 EQAPI----TLACAH-----SQQLIWQLLNAIEKGAAAAGETDESFLNDVRAKIAQMDKG 602
Query: 582 TKIAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCK----------AA 631
I G + EW D P HRHLSHL GL+PG+ ++ NPD+ K AA
Sbjct: 603 IHIGSWGQLQEWKVDMDSPTDTHRHLSHLVGLYPGYAVS-NYNPDVQKLNYSVNDVRDAA 661
Query: 632 EKTLQKRGE-EGP----GWSITWKTALWARLHDQEHAY---------RMVKRLFNLVDPE 677
+L RG GP GW W+ A WA+ D + Y + LF++ DP
Sbjct: 662 RTSLIHRGNGTGPDADAGWEKVWRAACWAQFADSDMFYHELTYAVDRNFAENLFSIYDPA 721
Query: 678 HEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQ----STLN---DLYLLPALPWD 730
+P FQIDANFG+TAA L+Q ++L+ + +LPALP
Sbjct: 722 DP--------------NPVFQIDANFGYTAAAMNALLQAPDVASLDIPLTVTILPALP-S 766
Query: 731 KWSSGCVKGLKARGGETVSICWKD 754
WS+G + G + RGG + + W+D
Sbjct: 767 AWSTGSILGARVRGGIMLDMSWED 790
>gi|302883112|ref|XP_003040458.1| hypothetical protein NECHADRAFT_122680 [Nectria haematococca mpVI
77-13-4]
gi|256721342|gb|EEU34745.1| hypothetical protein NECHADRAFT_122680 [Nectria haematococca mpVI
77-13-4]
Length = 812
Score = 282 bits (722), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 230/787 (29%), Positives = 357/787 (45%), Gaps = 90/787 (11%)
Query: 29 PIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGD---YT--NPDAPK--ALSDVRSLVDS 81
P+GNG L +G E + N D+LW+G P + YT NP K AL +R +
Sbjct: 47 PVGNGILAGTHFGDPGHEKIVFNVDSLWSGGPFENSAYTGGNPTTSKSTALPGIREYI-- 104
Query: 82 GQYAEATAASVKLFG--HPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY 139
+ + T L G + Y++LG++ + + Y Y R LD +T Y
Sbjct: 105 --FDQGTGNVSALLGSGNYYGSYRVLGNLSIIIGHA-TDYTN--YTRSLDPSTGVHTTTY 159
Query: 140 SVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRC 199
+V +T F SNP V +++ E + N+ ++L + S N + C
Sbjct: 160 LADSVNYTTTLFCSNPADACVYRVTSDED-LPNINIQFENLAVSSSLANPS--------C 210
Query: 200 --PGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVE---GSDWAVLLL 254
P R D P+G+++ AI + D +S + L + G +++
Sbjct: 211 NHPYTRFRGVTQLGD-PEGMKYEAIARFVDNRDGDGVSCATNGSLTIARSPGFKTVDVII 269
Query: 255 VASSSFDGPFINPSDS----KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
A +++D N + DP + S Y L H++DYQ LF ++
Sbjct: 270 SAGTNYDATKGNAENDYSFRGDDPAEAVQRSTSSGAQQGYDKLLKAHIEDYQSLFGTFTL 329
Query: 311 QLSRSPKDIVTDTC---SEENIDTVPSAE-RVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
L + K +T S + + + R+ DP L LLF + RYLLI+SSR
Sbjct: 330 TLPDAQKSAGHETAVLISNYSSNGIGDPYIRIYYISKSRDPYLESLLFDYSRYLLIASSR 389
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI-N 425
+ ANLQG W E ++P+W S H NIN++MNYW + L + L++++ +
Sbjct: 390 ENSLPANLQGKWTEQMNPSWSSDYHANINIQMNYWAADQTGLGKTSVALWNYMRNTWVPR 449
Query: 426 GSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRD 485
G++TA++ Y A GWV+H++ +I+ + +G WA +P+ AW+ H+W++Y Y
Sbjct: 450 GTETAKLLYDAPGWVVHNEMNIFGHTGM-KGSATWANYPVAAAWMMQHVWDNYEYGRSLT 508
Query: 486 FLEKRAYPLLEGCASFLLDWLIE---GHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
+L + YPLL+ A F + L E +DG L NP S EH P C Y
Sbjct: 509 WLRQEGYPLLKEVAQFWISQLQEDEFNNDGTLVVNPCNSAEH---GPT-TFGCTHYQQ-- 562
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIMEWA---QDFK 598
+I +V A +++ + +++ ++ L +L + G I EW
Sbjct: 563 ---LIHQVLEATLNSITYIGEDDQDFTSELKTVLKKLDKGLHYTSWGGIKEWKLPDSAGY 619
Query: 599 DPEVHHRHLSHLFGLFPGHTITIEK----NPDLCKAAEKTLQKRG----EEGPGWSITWK 650
D + HRHLSHL G +PG++I+ + N + A E TL RG ++ GW W+
Sbjct: 620 DTKNTHRHLSHLVGWYPGYSISSFQGGYWNSTVQAAVEATLVARGNGVQDQDTGWGKAWR 679
Query: 651 TALWARLHDQEHAYRMVKRLF-NLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAV 709
A WARL++ AY ++ L N P ++G PPFQIDANFG AV
Sbjct: 680 VACWARLNNTSQAYDELRLLIDNNFAPNGFDMYQG--------QKPPFQIDANFGLGGAV 731
Query: 710 AEMLV----QSTLND-----LYLLPALPWDKWSSGCVKGLKARGGETVSICW-KDGD--- 756
MLV S +N+ + L PA+P +W G VK L+ RGG V W DG
Sbjct: 732 LSMLVVDLPNSYVNEDKTRTIVLGPAIP-PRWGGGNVKNLRLRGGSAVDFEWDSDGKVTH 790
Query: 757 --LHEVG 761
LHE G
Sbjct: 791 ATLHETG 797
>gi|149199701|ref|ZP_01876733.1| hypothetical protein LNTAR_25135 [Lentisphaera araneosa HTCC2155]
gi|149137218|gb|EDM25639.1| hypothetical protein LNTAR_25135 [Lentisphaera araneosa HTCC2155]
Length = 1754
Score = 282 bits (721), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 229/777 (29%), Positives = 360/777 (46%), Gaps = 134/777 (17%)
Query: 29 PIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYAEAT 88
PIGNG GA ++G +E +++ + TL + G+Y +
Sbjct: 63 PIGNGYTGANIFGRTDTERIQITDKTLH-----------------------NRGKYNKGG 99
Query: 89 AASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSVGNVEFTR 148
S E++ D H K+++ YRR L+LN A V Y+ V +TR
Sbjct: 100 LTSF---------------AEIKLDFRHHKFSK--YRRSLNLNEGIAHVAYNYRGVNYTR 142
Query: 149 EHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKA 208
E+F+S PD VIV +++ + +LSF + + +G+
Sbjct: 143 EYFASYPDNVIVIRLTADKKAALSFEIRPEIPYLERKERSGS-----------------I 185
Query: 209 NANDDPKGIQFSAIL-------EIKISDDRGTISA-LEDKKLKVEGSDWAVLLLVASSSF 260
+A DD ++ S L +IK+ ++ GT+ A + ++V +D +L+ +++
Sbjct: 186 SAKDDLLTLKGSIALFSCNFDGQIKVLNEGGTLKANAKQGSIEVSKADAVTILIATGTNY 245
Query: 261 ---DGPFINPSDSKKDPT----SESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
+ F N S K +P +E + +Q+ +N Y L RHL DYQ LF RV++ L+
Sbjct: 246 RLHEDTFRNTSAKKLNPKEFPHNEVSARIQAAQNRGYEQLKERHLKDYQNLFGRVAVNLN 305
Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQVAN 373
P + T E+ K+ +T+ L EL+FQ+GRYLLISSSR + AN
Sbjct: 306 SRPSNDPTHIL----------LEKYKAGKTNN--WLEELMFQYGRYLLISSSREKSLPAN 353
Query: 374 LQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL-TYLSINGSKTAQV 432
LQG W++D W NIN++MNYW S+ NL+EC + +F YL I ++
Sbjct: 354 LQGAWSQDYYTPWSGGFWHNINVQMNYWGSMSTNLAECFQSYTNFYKAYLPI--AREHAT 411
Query: 433 NYLA------------SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNY 480
+Y+ +GW+I + + SA G + L ++Y +
Sbjct: 412 DYVQKYNPSQVTKGGDNGWIIGTGANAYYIPSAGGHSGP-----GTGGFTAKLLMDYYLF 466
Query: 481 TMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD------GKLA 534
T D+ +LE+ AYP + + F LI H L PS SPE + P+ GKL
Sbjct: 467 TQDKQYLEEVAYPAMLSLSKFYSKVLIP-HGDKLLVEPSASPE-QLAKPEQVKNMPGKLK 524
Query: 535 CVSY----SSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSI 590
Y T D + E F+ ++ A+ L +ED ++ + + + +L P I DG I
Sbjct: 525 GGKYYVTAGCTFDQGFVWESFADTLTLADAL-GSEDPFLDTIREQITKLDPILIGADGQI 583
Query: 591 MEWAQDFKDPEV---HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSI 647
E+ ++ ++ HRH+SHL LFPG I+ + D +AA KTL RG++ GW++
Sbjct: 584 KEYREENNYSDIGDKKHRHISHLCPLFPGTLIS--QKSDWLQAASKTLDLRGDKTTGWAL 641
Query: 648 TWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTA 707
+ ARL + E A+++ +R E+ + NL+ HPPFQID + G A
Sbjct: 642 AHRMNSRARLGEGEKAHKVYQRFIK------ERTVQ-----NLWTLHPPFQIDGSLGTMA 690
Query: 708 AVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
VAEML+QS + + +LPALP W G GL ARG +S W E I S
Sbjct: 691 GVAEMLLQSHEDTIKILPALP-KAWEDGHFDGLVARGNFAISAKWNKVRASEFSIES 746
>gi|358390062|gb|EHK39468.1| glycoside hydrolase family 95 protein [Trichoderma atroviride IMI
206040]
Length = 797
Score = 281 bits (719), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 225/793 (28%), Positives = 366/793 (46%), Gaps = 87/793 (10%)
Query: 14 KITFNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-----GDYTNPDA 68
++ + P+ F ++ +GNGR A V ET LNE T W+G G P+
Sbjct: 6 RLYYTTPSTSFPTSLALGNGRFAASVLSSPEHETFLLNEVTFWSGEARNAGEGLAERPED 65
Query: 69 PKA-LSDVRSLVDSGQYAEATAASVKLFGHPADVYQL---LGDIELEFDDSHLKYAEETY 124
PKA L ++ +G YA+ + K + + +G +++ + +
Sbjct: 66 PKAELRKTQNCYLNGDYAQGKKRAEKYLESKKNNFGTNLGVGKLDIAVTGHGNPADIQDF 125
Query: 125 RRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNH 184
REL + A +Y V ++ R F S+P QV+V + G + L VS
Sbjct: 126 ERELRFDEAITETRYKVNGHQYKRRAFLSHPHQVLVIQFDGDDLSGLEVAVS-------- 177
Query: 185 SYVNGNNQIIMEGRCPGKRIPPKANA-----NDDPKGIQFSAILEIKISDDRGTISALED 239
V G N+ R+ A A +D G++ I+ K+++ + +D
Sbjct: 178 --VQGENEAFTSKVNSESRLEFDAQALETVHSDGTCGVKGFGIVAAKVNEGK---VEQKD 232
Query: 240 KKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLD 299
KL + + + ++ ++ +S+ + ++ ++ + L DL HL
Sbjct: 233 GKLTISAQKSITIFVAFNTDYN-------ESRNEWRERTLLQIEDVLQLPIDDLLKEHLG 285
Query: 300 DYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD--EDPSLVELLFQFG 357
DYQ L+ R+ I+L PK S N +P+ +R +F++ DP + L F +
Sbjct: 286 DYQPLYRRMDIRLG--PK-------SNPN-SNIPTDQRRGNFESSGYADPGMFALYFHYS 335
Query: 358 RYLLISSSRPGTQVA-NLQGIWN--EDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEP 414
RYL I+ +R + + +LQG+WN E W H++IN +MNY+ L L++ +P
Sbjct: 336 RYLTIAGTREDSPLPLHLQGLWNDGEACKMGWSCDYHLDINTQMNYFAILNSGLADLMKP 395
Query: 415 LFDFLTYLSINGSKTAQVNYLA-SGWVIHHKTDIWAKSSADRG-KVVWALWPMGGAWLCT 472
L+ ++ L++ G +TA+ Y + GWV H ++ W + D G ++ + L GG W+
Sbjct: 396 LYKYIFKLAVKGQQTARTCYGSREGWVAHVFSNAWGFT--DPGWEISYGLNVTGGLWMAA 453
Query: 473 HLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEG-HDGYLETNPSTSPEHEF--IAP 529
L E Y YT+D + +PLL G F LD++IE G+L T PS SPE+ F +
Sbjct: 454 PLIEMYEYTLDDGLMMTNLWPLLFGATKFWLDYMIEDPKTGWLLTGPSVSPENSFFVVNE 513
Query: 530 DG--KLACVSYSSTMDMAIIREVFSAIISAAEVLEKNE----DALVEKVLKSLPRLRPTK 583
DG + S T+D+ ++R++F+ A L+ D +++ K L +L P +
Sbjct: 514 DGTKEEHSADLSPTLDVVLLRDLFAFCEYFAGKLKTMTGFPWDEDIKEYQKVLAKLPPLQ 573
Query: 584 IAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP 643
I ++G + EW D+++ + +HRHLSH L I+ PDL +A +L++R
Sbjct: 574 IGKNGQLQEWLHDYEEAQPYHRHLSHTMALCRSALISARHQPDLAEAVRVSLERRQGRDD 633
Query: 644 GWSITWKTAL----WARLHDQEHAYRMVKRLF------NLVDPEHEKHFEGGLYSNLFAA 693
I + AL +ARL D E A V L NL+ + K G N+F
Sbjct: 634 LEDIEFTAALFALNYARLGDAEKAVAQVGHLVGELSFDNLLS--YSKPGVAGAEKNIFV- 690
Query: 694 HPPFQIDANFGFTAAVAEMLVQSTLNDLY------LLPALPWDKWSSGCVKGLKARGGET 747
ID NFG AA+AEML++S + L LLPALP WS G V G++ RGG
Sbjct: 691 -----IDGNFGGAAAIAEMLIRSIIPRLGRPVEIDLLPALP-AAWSEGSVSGMRIRGGLE 744
Query: 748 VSICWKDGDLHEV 760
S W G L V
Sbjct: 745 ASFAWSKGKLEGV 757
>gi|225019811|ref|ZP_03709003.1| hypothetical protein CLOSTMETH_03764, partial [Clostridium
methylpentosum DSM 5476]
gi|224947447|gb|EEG28656.1| hypothetical protein CLOSTMETH_03764 [Clostridium methylpentosum
DSM 5476]
Length = 1411
Score = 281 bits (718), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 248/853 (29%), Positives = 386/853 (45%), Gaps = 171/853 (20%)
Query: 4 AESTSTTNPLKITFNGPAKHFTD------AIPIGNGRLGAMVWGGVPSETLKLNEDTLWT 57
AE + LK+ ++ PA +D +IP+GNG +G ++GGV +E +++ E++L
Sbjct: 38 AEPLAAAKQLKLWYDEPAPS-SDIGWREWSIPMGNGYMGVNLFGGVQTERIQITENSL-- 94
Query: 58 GVPGDYTNPDAPKALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHL 117
+ + SV + ++ Y I+ E D
Sbjct: 95 ----------------------------QDSNTSVGGLNNFSETY-----IDFEHSDP-- 119
Query: 118 KYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNV-- 175
+ Y+REL+L+ A V Y V + R++F+ PD+V+V ++S SE+G LSF +
Sbjct: 120 ----QNYQRELNLSEGVASVVYDSDGVRYERQYFTDYPDKVMVIRLSASEAGKLSFTLRP 175
Query: 176 SLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANAND-------DPKGIQFSAILEIKIS 228
++ L D H + G GK KA + + ++F + K+
Sbjct: 176 TIPYLCDYH---------VEPGDNRGKHGTVKAEGDTITLAGAMEYYNVEFEG--QYKVL 224
Query: 229 DDRGTISALEDKK-----LKVEGSDWAVLLLVASSSFD-GPFINPSDSKKD-------PT 275
GT++A D+ + V+ +D AV+L+ ++++ + ++++ D P
Sbjct: 225 PTGGTMTAQNDQNGDNGTISVQNADSAVILIGIGTNYELKSSVFTANNRLDKLKGNAHPH 284
Query: 276 SESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSA 335
++ +Q SY +L H +DY+ LF RVS+ + TD
Sbjct: 285 AKVTKIIQDASAKSYDELLASHQEDYKGLFDRVSVDFGGQMPTVTTD------------- 331
Query: 336 ERVKSFQTDE-DPSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNI 394
E +K++Q + DP L EL +QFGRY+LI SSR G NLQG+WN P W S NI
Sbjct: 332 ELLKNYQNGQSDPYLEELFYQFGRYMLICSSRKGALPPNLQGVWNVFNDPPWRSGYWHNI 391
Query: 395 NLEMNYWQSLPCNLSECQEPLFDFL-TYLSI------------NGSKTAQVNYLASGWVI 441
NL+MNYW + NL E E D+ YL N S +VN +GW +
Sbjct: 392 NLQMNYWPAFTGNLPELFEAYADYQKAYLEKAEQYAVSNIQKNNPSALDKVNTKENGWAL 451
Query: 442 HHKTDIW----AKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEG 497
+ T W + S++ G GA+ W++Y+YT D LE AYP + G
Sbjct: 452 GNST--WPYNISGSASHSGFGT-------GAFTSIMFWDYYDYTRDASVLEDTAYPAVSG 502
Query: 498 CASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISA 557
A F L +++ DGYL +PS SPE++ K ++ D +I E + A
Sbjct: 503 MAKF-LSKIVQPIDGYLLASPSYSPENQHNGGSYKTVGCAF----DQQMIYENHLDTLKA 557
Query: 558 AEVL---EKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD--FKD-PEVHHRHLSHLF 611
A+ L ++E AL + + LP L P ++ G I E+ ++ + D E HRH+S L
Sbjct: 558 ADALGLTAEDEPALA-TLEQQLPLLDPVQVGASGQIKEYREEKFYGDIGEYDHRHISQLV 616
Query: 612 GLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAYRMVKRLF 671
G +PG T+ P A + +LQ RG+ GWS +TA+WAR+ + + AYR
Sbjct: 617 GAYPG-TMINSSTPAWQDAVKVSLQSRGDGSKGWSKAHRTAVWARVFEGDEAYRT----- 670
Query: 672 NLVDPEHEKHFEGGLYSNLFAAH--------PPFQIDANFGFTAAVAEMLVQSTLNDLYL 723
++ +NLF H FQ D NFG TA V+EML+QS L
Sbjct: 671 ------YQLQLRTHTMNNLFNDHNGSKNSSSKLFQCDGNFGATAGVSEMLLQSHEGFLAP 724
Query: 724 LPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTS 783
LPA+P W +G +GL ARG VS W +G + F+ L G S
Sbjct: 725 LPAMP-QAWDTGSYRGLLARGNFEVSADWAEGQATK--------------FEILSKSGES 769
Query: 784 VKV---NLSAGKI 793
KV NL++ K+
Sbjct: 770 CKVKYDNLASAKL 782
>gi|307707033|ref|ZP_07643830.1| alpha-fucosidase [Streptococcus mitis SK321]
gi|307617559|gb|EFN96729.1| alpha-fucosidase [Streptococcus mitis SK321]
Length = 539
Score = 279 bits (714), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 183/521 (35%), Positives = 269/521 (51%), Gaps = 62/521 (11%)
Query: 266 NPSDS---KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTD 322
NP+ + K D + L + + Y+ L +RH+ DYQ LF RV + L
Sbjct: 10 NPASNYRKKIDLEQQVKDLLDTAKEKGYAQLKSRHIQDYQALFQRVQLDLG--------- 60
Query: 323 TCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR--PGTQVANLQGIWNE 380
++D + + +K+++ E +L EL FQ+GRYLLISSSR P ANLQG+WN
Sbjct: 61 ----ADVDASTTDDLLKNYKPQEGQALEELFFQYGRYLLISSSRDCPDALPANLQGVWNA 116
Query: 381 DLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQVNYLA---- 436
+P W+S H+NINL+MNYW S NL E P+ +++ L + G + A Y
Sbjct: 117 VDNPPWNSDYHLNINLQMNYWPSYVTNLLETAFPVINYIDDLRVYG-RLAAARYAGIVSQ 175
Query: 437 ----SGWVIHHKTDI--WAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKR 490
+GW++H + W D W P AW+ ++E Y++ D+D+L ++
Sbjct: 176 EGEENGWLVHTQATPFGWTAPGWD---YYWGWSPAANAWMMQTVYEAYSFYRDQDYLREK 232
Query: 491 AYPLLEGCASFLLDWLIEGHDGY-LETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIRE 549
YP+L F D+L E H ++PS SPEH +S +T D +++ +
Sbjct: 233 IYPMLRETVRFWNDFLHEDHQAQRWVSSPSYSPEH---------GPISIGNTYDQSLLWQ 283
Query: 550 VFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEW----AQDFKDPEV--H 603
+F I AA+ L +E AL+ +V + L P +I + G I EW Q F++ +V
Sbjct: 284 LFHDFIQAAQELGLDE-ALLTEVKEKFDLLNPLQITQSGRIREWYEEEEQYFQNEKVEAQ 342
Query: 604 HRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHA 663
HRH SHL GL+PG+ + K + +AA +L RG+ G GWS K LWARL D A
Sbjct: 343 HRHASHLVGLYPGNLFSY-KGQEYLEAARASLNDRGDGGTGWSKANKINLWARLGDGNRA 401
Query: 664 YRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYL 723
++++ + + NL+ +HPPFQID NFG T+ +AEML+QS L
Sbjct: 402 HKLLA-----------EQLKSSTLPNLWCSHPPFQIDGNFGATSGMAEMLLQSHTAYLVP 450
Query: 724 LPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
L ALP D WS+G V GL ARG VS+ W D L ++ I S
Sbjct: 451 LAALP-DAWSTGSVSGLMARGHFEVSMSWADKKLLQLTILS 490
>gi|346725241|ref|YP_004851910.1| hypothetical protein XACM_2350 [Xanthomonas axonopodis pv.
citrumelo F1]
gi|346649988|gb|AEO42612.1| hypothetical protein XACM_2350 [Xanthomonas axonopodis pv.
citrumelo F1]
Length = 803
Score = 278 bits (710), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 234/795 (29%), Positives = 360/795 (45%), Gaps = 119/795 (14%)
Query: 13 LKITFNGPA---KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
L++ + PA + + +PIGNGRLGA+ G ETL ++E +LW+G
Sbjct: 57 LRLPYAAPAADAQLLREGLPIGNGRLGALCGGAPACETLFVSEGSLWSG----------- 105
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
+L D GQ+A + + FG + LL + +E + H + Y+RELD
Sbjct: 106 ---GSNAALQDDGQFAY----TKEDFGS----FMLLAKLFVELE-GHAQAQVSDYQRELD 153
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
++ RV+Y +G +TR F+S+PD IV ++ +GS + L +D H+
Sbjct: 154 MSNGYVRVRYRIGETRYTRTLFASHPDAAIVLRLDCEGAGSHRGRIRL---IDTHAGA-- 208
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
GR G A D+ G++++A L + D R D L+
Sbjct: 209 -------GRADGDAGLRFAGQLDN--GLRYAAALRVHSDDGRLETG---DGLLQFRDCRG 256
Query: 250 AVLLLVASSSF--DGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
++L + + DG D +DP + + Q+ ++ + L H+ D++ LF
Sbjct: 257 LTIVLCGDTDYAADGAR-GWRDPTRDPLARARHRAQAAASVPAALLLDTHVADHRALFDT 315
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
+ ++L +S ++ ++T + + DP L QFGRYL I++SR
Sbjct: 316 LQVELGQSSD-------AQRGLETWQRIQARAAAPALPDPELEVAYLQFGRYLTIAASRD 368
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
G NLQG+W E+ P W S H ++NL+MNYW + P L C + L + + +
Sbjct: 369 GLPT-NLQGLWLENNEPPWMSDYHSDVNLQMNYWLADPSGLGTCVDALTRYCLAQLPSWT 427
Query: 428 KTAQVNY------------LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
+ Q ++ +GW + A S+ G W P G AWLC LW
Sbjct: 428 RITQAHFNDPRNRFRNTSGKIAGWTV-------AISTNPFGGNGWYWHPAGNAWLCDSLW 480
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASF----LLDWLIEGHDGY----LETNPSTSPEHEFI 527
+HY +T +RD L R YPLL+G F L+ + DG L + SPEH
Sbjct: 481 QHYEFTQNRDDL-TRIYPLLKGACQFWQARLIAMEVTDADGRTRQCLVDDHDWSPEH--- 536
Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE- 586
P+ ++Y+ + + +F A+ +L ++ A V RL +I+
Sbjct: 537 GPENARG-IAYAQEL----VWTLFGQYRQASALLGRDA-AYAATVATLQQRLYLPEISPL 590
Query: 587 DGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDL-CKAAEKTLQKRGEEGPGW 645
G + EW E HHRHLS L GLFPGH + + P +AA + L+ RG + GW
Sbjct: 591 SGQLQEWMSPTDLGEAHHRHLSPLMGLFPGHRLHPDLGPPAQVEAARRLLEARGMQSFGW 650
Query: 646 SITWKTALWARLHDQEHAYRMV---------------KRLFNLVD-PEHEKHFEGGLYSN 689
+ W+ WARL D E AY +V LF++ D +H GG+
Sbjct: 651 ACAWRALCWARLGDAERAYALVLTNLKPSIGHSNGTAPNLFDIYDLSQHGDPTLGGV--- 707
Query: 690 LFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVS 749
FQIDANFG AA+ EML+ S + LLPALP + G V GL ARGG TV
Sbjct: 708 -------FQIDANFGTPAAMLEMLLYSRPGQITLLPALPKAWAAQGRVTGLGARGGFTVD 760
Query: 750 ICWKDGDLHEVGIYS 764
+ W++G +V + S
Sbjct: 761 MAWRNGVPTQVSVRS 775
>gi|440695005|ref|ZP_20877568.1| hypothetical protein STRTUCAR8_09907 [Streptomyces turgidiscabies
Car8]
gi|440282898|gb|ELP70288.1| hypothetical protein STRTUCAR8_09907 [Streptomyces turgidiscabies
Car8]
Length = 902
Score = 277 bits (708), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 210/686 (30%), Positives = 312/686 (45%), Gaps = 83/686 (12%)
Query: 124 YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDN 183
Y+R LD ++ RE F+S V+V + + LS +SL S +
Sbjct: 274 YQRALDFVEGVHVTRFGAPRHRVLREAFASRSADVMVFRYTSDSDQGLSGAISLTSGQEG 333
Query: 184 H-SYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL 242
+ V+ + ++I G G++ + + + +D G S + L
Sbjct: 334 APTTVDADARLIAFRGVMGN-------------GLKHACTIRVAHAD--GAFST-DGSVL 377
Query: 243 KVEGSDWAVLLLVASSSFDGPFINPSDSKK--DPTSESMSALQSIRNLSYSDLYTRHLDD 300
+ G LLL A + + ++ + + DP AL SY L H
Sbjct: 378 RFSGCRTLTLLLDARTDYR---LDAAAGWRGADPEPAIGRALAKAAARSYDKLRAEHTAA 434
Query: 301 YQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRY 359
+ L +RVS++ S +V+ +P+ R+ + +DP+L + +F +GRY
Sbjct: 435 TRALMNRVSVRWGTSDTAVVS----------LPTQARLARYAAGGQDPTLEQTMFDYGRY 484
Query: 360 LLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL 419
LLISSSRP ANLQG+WN+ +P W S H NIN++MNYW + NL EC E L +F+
Sbjct: 485 LLISSSRPNGLPANLQGLWNDSNAPAWASDYHTNINIQMNYWGAETTNLPECHEALVEFI 544
Query: 420 TYLSINGSKTAQVNYL---ASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWE 476
+++ S+ A N + GW I+ G W AW HL+E
Sbjct: 545 RQVAVP-SRVATRNAFGEDSRGWTARTSQSIF-------GGNAWEWNTTASAWYAQHLYE 596
Query: 477 HYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACV 536
H+ +T D+ +L A+P+++ F L E DG L SPEH DG +
Sbjct: 597 HWAFTQDKVYLRTVAHPMIKEICEFWEGHLKEREDGLLVAPNGWSPEHG-PREDGVM--- 652
Query: 537 SYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQD 596
D II ++F + VL+ ++ A KV RL P +I + G + EW +D
Sbjct: 653 -----YDQQIIWDLFQNYLDCEAVLD-SDPAYRAKVTDLQSRLAPNRIGKWGQLQEWQED 706
Query: 597 FKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPG------------ 644
P HRH SHLF ++PG IT + PDL AA +L+ R E G
Sbjct: 707 IDSPTDIHRHTSHLFAVYPGRQITPD-TPDLAAAALVSLKARCGEKEGVPFTAATVSGDS 765
Query: 645 ---WSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDA 701
W+ W+ AL+ARL D + A M++ L NLF HPPFQ+D
Sbjct: 766 RRSWTWPWRAALFARLGDGQRAQVMLRGLLTY-----------NTLPNLFCNHPPFQMDG 814
Query: 702 NFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVG 761
NFG T AVAEML+QS L+LLPALP D SG GL+ARGG VS W++G +
Sbjct: 815 NFGITGAVAEMLLQSHNGVLHLLPALPDDWRPSGSFTGLRARGGYEVSCEWRNGKVTSYR 874
Query: 762 IYSNYSNNDHDSFKTLHYRGTSVKVN 787
I ++ +++ + T+ G KV
Sbjct: 875 IVADRASSRREV--TVRVNGVDRKVK 898
>gi|78048096|ref|YP_364271.1| hypothetical protein XCV2540 [Xanthomonas campestris pv.
vesicatoria str. 85-10]
gi|78036526|emb|CAJ24217.1| conserved hypothetical protein [Xanthomonas campestris pv.
vesicatoria str. 85-10]
Length = 803
Score = 276 bits (707), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 234/795 (29%), Positives = 363/795 (45%), Gaps = 119/795 (14%)
Query: 13 LKITFNGPA---KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
L++ + PA + + +PIGNGRLGA+ G ETL ++E +LW+G
Sbjct: 57 LRLPYAAPAADAQLLREGLPIGNGRLGALCGGAPACETLFVSEGSLWSG----------- 105
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
L D GQ+A + + FG + LL + +E + H + Y+RELD
Sbjct: 106 ---GSNAVLQDDGQFAY----TKEEFGS----FMLLAKLFVELE-GHAQAQVFDYQRELD 153
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
++ RV+Y +G+ +TR F+S+PD IV ++ +GS + L +D H+
Sbjct: 154 MSNGCVRVRYRIGDTRYTRTLFASHPDAAIVLRLDCEGAGSHRGRIRL---IDTHAGA-- 208
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
GR G A D+ G++++A L ++ D G++ D L+
Sbjct: 209 -------GRADGDAGLRFAGQLDN--GLRYAAAL--RVHSDDGSLET-GDGLLQFRDCRG 256
Query: 250 AVLLLVASSSF--DGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
++L + + DG D +DP + + Q+ ++ + L H+ D++ LF
Sbjct: 257 LTIVLCGDTDYAADGAR-GWRDPTRDPLARARHRAQAAASVPAALLLDTHVADHRALFDT 315
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
+ ++L +S ++ ++T + + DP L QFGRYL I++SR
Sbjct: 316 LQVELGQSSD-------AQRGLETWQRIQARAAAPALPDPELEVAYLQFGRYLTIAASRD 368
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
G NLQG+W E+ P W S H ++NL+MNYW + P L C + L + + +
Sbjct: 369 GLPT-NLQGLWLENNEPPWMSDYHSDVNLQMNYWLADPSGLGTCVDALTRYCLAQLPSWT 427
Query: 428 KTAQVNY------------LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
+ Q ++ +GW + A S+ G W P G AWLC LW
Sbjct: 428 RITQAHFNDPRNRFRNTSGKIAGWTV-------AISTNPFGGNGWYWHPAGNAWLCDSLW 480
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASF----LLDWLIEGHDGY----LETNPSTSPEHEFI 527
+HY +T +RD L R YPLL+G F L+ + DG L + SPEH
Sbjct: 481 QHYEFTQNRDDL-TRIYPLLKGACQFWQAPLIAMEVTDADGRTRQCLVDDHDWSPEH--- 536
Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE- 586
P+ ++Y+ + + +F A+ +L ++ A V RL +I+
Sbjct: 537 GPENARG-IAYAQEL----VWTLFGQYRQASALLGRDA-AYAATVATLQQRLYLPEISPL 590
Query: 587 DGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDL-CKAAEKTLQKRGEEGPGW 645
G + EW E HHRHLS L GLFPGH + + P +AA + L+ RG + GW
Sbjct: 591 SGQLQEWMSPTDLGEAHHRHLSPLMGLFPGHRLHPDLGPPAQVEAARRLLEARGMQSFGW 650
Query: 646 SITWKTALWARLHDQEHAYRMV---------------KRLFNLVD-PEHEKHFEGGLYSN 689
+ W+ WARL D E AY +V LF++ D +H GG+
Sbjct: 651 ACAWRALCWARLGDAERAYALVLTNLKPSIGHSNGTAPNLFDIYDLSQHGDPTLGGV--- 707
Query: 690 LFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVS 749
FQIDANFG AA+ EML+ S + LLPALP + G V GL ARGG TV
Sbjct: 708 -------FQIDANFGTPAAMLEMLLYSRPGQITLLPALPKAWAAQGRVTGLGARGGFTVD 760
Query: 750 ICWKDGDLHEVGIYS 764
+ W++G +V + S
Sbjct: 761 MAWRNGVPTQVSVRS 775
>gi|325926465|ref|ZP_08187785.1| hypothetical protein XPE_1772 [Xanthomonas perforans 91-118]
gi|325543114|gb|EGD14557.1| hypothetical protein XPE_1772 [Xanthomonas perforans 91-118]
Length = 754
Score = 276 bits (707), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 234/795 (29%), Positives = 362/795 (45%), Gaps = 119/795 (14%)
Query: 13 LKITFNGPA---KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
L++ + PA + + +PIGNGRLGA+ G ETL ++E +LW+G
Sbjct: 8 LRLPYAAPAADAQLLREGLPIGNGRLGALCGGAPACETLFVSEGSLWSG----------- 56
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
+L D GQ+A + + FG + LL + +E + H + Y+RELD
Sbjct: 57 ---GSNAALQDDGQFAY----TKEDFGS----FMLLAKLFVELE-GHAQAQVSDYQRELD 104
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
++ RV+Y +G +TR F+S+PD IV ++ +GS + L +D H+
Sbjct: 105 MSNGCVRVRYRIGETRYTRTLFASHPDAAIVLRLDCEGAGSHRGRIRL---IDTHAGA-- 159
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
GR G A D+ G++++A L ++ D G++ D L+
Sbjct: 160 -------GRADGDAGLRFAGQLDN--GLRYAAAL--RVHSDDGSLET-GDGLLQFRDCRG 207
Query: 250 AVLLLVASSSF--DGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHR 307
++L + + DG D +DP + + Q+ ++ + L H+ D++ LF
Sbjct: 208 LTIVLCGDTDYAADGAR-GWRDPTRDPLARARHRAQAAASVPAALLLDTHVADHRALFDT 266
Query: 308 VSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRP 367
+ ++L +S ++ ++T + + DP L QFGRYL I++SR
Sbjct: 267 LQVELGQSSD-------AQRGLETWQRIQARAAAPALPDPELEVAYLQFGRYLTIAASRD 319
Query: 368 GTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
G NLQG+W E+ P W S H ++NL+MNYW + P L C + L + + +
Sbjct: 320 GLPT-NLQGLWLENNDPPWMSDYHSDVNLQMNYWLADPSGLGNCVDALTRYCLAQLPSWT 378
Query: 428 KTAQVNY------------LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
+ Q ++ +GW + A S+ G W P G AWLC LW
Sbjct: 379 RITQTHFNDPRNRFRNTSGKIAGWTV-------AISTNPFGGNGWYWHPAGNAWLCDSLW 431
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASF----LLDWLIEGHDGY----LETNPSTSPEHEFI 527
+HY +T DR L R YPLL+G F L+ + DG+ L + SPEH
Sbjct: 432 QHYEFTQDRGQL-TRIYPLLKGACEFWQARLIAMEVTDADGHTRQCLVDDHDWSPEH--- 487
Query: 528 APDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAE- 586
P+ ++Y+ + + +F A+ +L ++ A V RL +I+
Sbjct: 488 GPENARG-IAYAQEL----VWTLFGQYRQASALLGRDA-AYAATVATLQQRLYLPEISPL 541
Query: 587 DGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDL-CKAAEKTLQKRGEEGPGW 645
G + EW E HHRHLS L GLFP H + + P +AA K L+ RG + GW
Sbjct: 542 SGQLQEWMSPTDLGEAHHRHLSPLMGLFPCHRLHPDLGPPAQVEAARKLLEARGMQSFGW 601
Query: 646 SITWKTALWARLHDQEHAYRMV---------------KRLFNLVD-PEHEKHFEGGLYSN 689
+ W+ WARL D E AY +V LF++ D +H GG+
Sbjct: 602 ACAWRALCWARLGDAERAYALVLTNLKSSIGHSNGTAPNLFDIYDLSQHGDPTLGGV--- 658
Query: 690 LFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVS 749
FQIDANFG AA+ EML+ S + LLPALP + G V GL ARGG TV
Sbjct: 659 -------FQIDANFGTPAAMLEMLLYSRPGQITLLPALPKAWAAQGRVTGLGARGGFTVD 711
Query: 750 ICWKDGDLHEVGIYS 764
+ W++G +V + S
Sbjct: 712 MAWRNGVPTQVSVRS 726
>gi|115384756|ref|XP_001208925.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
gi|114196617|gb|EAU38317.1| conserved hypothetical protein [Aspergillus terreus NIH2624]
Length = 1276
Score = 276 bits (706), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 222/769 (28%), Positives = 346/769 (44%), Gaps = 120/769 (15%)
Query: 24 FTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQ 83
T A P+GNGRLG + G G+ N A +AL +R +
Sbjct: 556 ITTAFPLGNGRLGEKAYAG------------------GNPNNCRA-EALPGIRDFI---- 592
Query: 84 YAEATAASVKLFGH-PA-DVYQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKYSV 141
+ T L G P+ YQ+LG++ ++ + YRR LD+ + ++V
Sbjct: 593 FQNGTGNVSALLGEFPSYGSYQVLGNLTIDLGELE---NVRGYRRRLDMKSGVYTDGFAV 649
Query: 142 GNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPG 201
GN + R F S PDQV V IS + + S + L+ NQ++ P
Sbjct: 650 GNALYNRTAFCSYPDQVCVYHISSANASLPSVEIGLE------------NQVV----SPA 693
Query: 202 KRIPPKANA-----NDDPK-GIQFSA----ILEIKISDD--RGTISALEDKKLKVEGSDW 249
+ AN+ P G+ ++A ++ K S D GT+ + + +V
Sbjct: 694 PNVTCHANSISLYGQTFPTIGMIYNARATVVVPGKSSGDFCAGTVVRVPSGQKEV----- 748
Query: 250 AVLLLVASSSFDGPFINP----SDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
++L A +++D N S DP + + SY+ L + H+ D++ +
Sbjct: 749 -YIVLAADTNYDASKGNAAAKFSFRGSDPYEKVLQTASKAAKKSYAQLKSSHVKDFRAIS 807
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSS 365
++ L D+ + P+ E + ++ DP + LLF +GRYL +SSS
Sbjct: 808 DGFTLTLPDR-----RDSAGK------PTTELIAAYTQPGDPFIEGLLFDYGRYLFMSSS 856
Query: 366 RPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFL--TYLS 423
R G+ NLQG+W E SP W + H NINL+MN+W L E EPL+ ++ T+L
Sbjct: 857 RAGSLPPNLQGLWTEQASPAWSADYHANINLQMNHWAVEQVGLGELTEPLWKYMADTWLP 916
Query: 424 INGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
G +TA++ Y GWV H + +++ +A + WA +P AW+ H+W+H++YT D
Sbjct: 917 -RGQETARLLYGGEGWVTHDEMNVFGH-TAMKNDAQWANYPAVNAWMSQHVWDHFDYTQD 974
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEG---HDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
+ + YP+L+G A F L L++ +DG NP SPEH P C +Y
Sbjct: 975 AAWYQSMGYPILKGAAQFWLSQLVQDEHFNDGTWVVNPCNSPEH---GPT-TFGCTNYQQ 1030
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKS-LPRL-RPTKIAEDGSIMEWAQDFK 598
+I E+F ++ ++D L + + S L I G I EW D
Sbjct: 1031 -----LIWELFDHVLRGWTA-SGDKDRLFRRAIASKFAALDNGIHIGSWGQIQEWKLDLD 1084
Query: 599 DPEVHHRHLSHLFGLFPGHTITIEKN--PDLCKAAEKTLQKRG----EEGPGWSITWKTA 652
P HRHLS+L +PG+ + N ++ +A TL+ RG ++ GW W++A
Sbjct: 1085 TPNDTHRHLSNLHAWYPGYAMHALNNQYTNVSQAVATTLRSRGDGVADQNTGWGKMWRSA 1144
Query: 653 LWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEM 712
WA L+ E AY M L GL +++ PPFQIDANFG AV +
Sbjct: 1145 CWALLNHTETAYSM------LTLAVQNNFAANGL--SMYTGAPPFQIDANFGIMGAVTSL 1196
Query: 713 LV---------QSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICW 752
LV Q+ + + L PA+P W G V+GL+ RGG +V W
Sbjct: 1197 LVRDLDRPASDQTKVQRVVLGPAIP-SAWGGGSVEGLRLRGGGSVRFGW 1244
>gi|386346135|ref|YP_006044384.1| alpha-L-fucosidase [Spirochaeta thermophila DSM 6578]
gi|339411102|gb|AEJ60667.1| alpha-L-fucosidase 2 precursor [Spirochaeta thermophila DSM 6578]
Length = 784
Score = 275 bits (702), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 225/758 (29%), Positives = 343/758 (45%), Gaps = 93/758 (12%)
Query: 20 PAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLV 79
PA + D P+GNGRL A+V GGV E + LN + LW G D + + VR
Sbjct: 13 PAGVWRDGYPVGNGRLAALVLGGVGEERIHLNHEWLWRGWYRDRVAEERAHLVGWVREAF 72
Query: 80 DSGQYAEATAASVKLFGHPADV---------YQLLGDIELEFDDSHLKYAEETYRRELDL 130
+G + E T + + FG V YQ G + L ++ E YRRELDL
Sbjct: 73 FTGDWEEGTRRANEAFGGGGGVSGRTCRVGAYQPAGTLVLRWEGME----EAEYRRELDL 128
Query: 131 NTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGN 190
RV+ ++E P + ++SG G + + ++ G+
Sbjct: 129 EEGVVRVRRGE-SLEEVMAVLGGGP---VGVRVSGWGKGWVGLGREVQEGVEVRVEC-GD 183
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFS--AILEIKISDDRGTISALEDKKLKVEGSD 248
++ +EGR +GI + A++E + + G +E +++ V
Sbjct: 184 GRVRLEGRFE--------------EGIVWEVLAVVEGGVCREEGKGVWVEGEEVVVWVVV 229
Query: 249 WAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRV 308
+ S PS + E A++ RH++ Y +LF RV
Sbjct: 230 DVWEEVGGSRRR-----LPSYGPPEVPGEGWEAVRR-----------RHVEAYGQLFGRV 273
Query: 309 SIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPG 368
+ + EE + +P+ R + D DP L LLF +GRYLLISSS PG
Sbjct: 274 RLVVE-----------GEEPL--LPTGRR----RGDPDPLLPVLLFDYGRYLLISSSAPG 316
Query: 369 TQV-ANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGS 427
+ ANLQG WN L P WD+ H++INL+MNYW + L EC PL ++ + +
Sbjct: 317 CDLPANLQGKWNPLLEPPWDADYHMDINLQMNYWLAEGAGLGECVTPLVRYVVRMMPSAR 376
Query: 428 KTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFL 487
+ A+ + G +D WA+++ + W +W AW+ HL Y Y+ D FL
Sbjct: 377 EAARRLFGCRGIWFPLTSDAWARATPE--AYGWDVWVGAAAWMAQHLVWRYLYSGDEGFL 434
Query: 488 EKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAII 547
+ YP LE A F D+L+E +G L+ PS SPEH + +G + SS +D+ ++
Sbjct: 435 RETVYPFLEEVALFFEDFLVEDGEGVLQVVPSQSPEHRWEGLEGFPVGLCVSSAVDVQLV 494
Query: 548 REVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEVHHRHL 607
R V + L +E + ++ L RLR + DG ++EW ++ + E HRHL
Sbjct: 495 RWVLRMAVELGGRL-GDEVSRWREMEGRLARLR---VGRDGVLLEWGRELPEAEPGHRHL 550
Query: 608 SHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEG---PGWSITWKTALWARLHDQEHAY 664
S L+G FPG + ++ P++ + A + L++R G GWS L A L E A+
Sbjct: 551 SPLWGFFPGDVLW-DEAPEVREGAVRLLERRVRHGCGRTGWSRAHLACLCAALGRGEDAW 609
Query: 665 RMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPP--FQIDANFGFTAAVAEMLVQSTLND-L 721
V L E +L HP FQ+DA G AAV ML+Q + L
Sbjct: 610 EHVCVLLREFTTE-----------SLLGLHPVDLFQVDAGLGGAAAVLLMLLQVRPDGVL 658
Query: 722 YLLPALPWDKWSSGCVKGLKARGGETVSICWKDGDLHE 759
LLPALP W G V+G++A GG V + W+ G++ E
Sbjct: 659 RLLPALP-RAWGRGRVEGMRAPGGWCVGVWWEGGEVRE 695
>gi|257069951|ref|YP_003156206.1| hypothetical protein Bfae_28510 [Brachybacterium faecium DSM 4810]
gi|256560769|gb|ACU86616.1| hypothetical protein Bfae_28510 [Brachybacterium faecium DSM 4810]
Length = 773
Score = 275 bits (702), Expect = 9e-71, Method: Compositional matrix adjust.
Identities = 235/812 (28%), Positives = 351/812 (43%), Gaps = 113/812 (13%)
Query: 15 ITFNGPAKHFTDA-IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
+ + PA + +PIGNGR+GA WG ++LNE +LW+G DY N A
Sbjct: 31 LALDAPATDWAGGTLPIGNGRVGATFWGDPVHGVIQLNEISLWSGTI-DYDNALHGHAER 89
Query: 74 DVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
D+ + S+ FG +LL D+ D S RRELD++T
Sbjct: 90 DMDT-------------SMTGFGSFLSGGRLLLDVR-GADGSAAPVDGAPLRRELDVSTG 135
Query: 134 TARV-KYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
+ + G++ +E F+S P ++V + L +++L+S + + Q
Sbjct: 136 LHTIHSRAPGDIAVHQEAFASAPADLLVLALEAE--APLRIDLALESDQEGTTLWAEEQQ 193
Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
+ G++ + + + D ++A + ++ + VL
Sbjct: 194 RTLWA------------TGTLGNGLRHATAVHLLEHDGTARVAA-DGSGAQLHDATRLVL 240
Query: 253 LLVASSSFDGPFINPSDS--KKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSI 310
L+ ++ + +P +DP + + L ++ L HL L RVS+
Sbjct: 241 LVDQATDY---LRDPEQGWRGEDPVTAVRTRLADASRTGHAALRRAHLAHLTALTSRVSL 297
Query: 311 QLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQ 370
+ SP +++ + I+ V + ER DPSL LLF +GRYLL+SSSRPG
Sbjct: 298 RGEASPAEVLALPV-DRRIERVAAGER--------DPSLERLLFAYGRYLLLSSSRPGGL 348
Query: 371 VANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTA 430
ANLQG W+ P W S H NIN++M YW + L E E L +L S + + A
Sbjct: 349 PANLQGPWSHSNHPQWSSDYHSNINVQMAYWPAEVTGLPETHEALIGWL-LASRDALRRA 407
Query: 431 QVNYLA--SGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
+ GW W G W + AW H+ EH+++T D +F
Sbjct: 408 TRHTFGPVRGWTARTSQSPW-------GGNAWEWNTVSSAWYAIHVLEHWDFTRDAEFAR 460
Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIR 548
A+P ++ F D LIEG DG L SPEH + D I+R
Sbjct: 461 AIAWPFVDEVCQFWEDRLIEGEDGTLLAPDGWSPEH---------GPREHGVMHDQQIVR 511
Query: 549 EVFSAIISAAEVLEKNEDALVEKVLKSLP-RLRPTKIAEDGSIMEWAQDFKDPEVHHRHL 607
E+F + AE E D L+++ RL KI G + EW +D DP HRH
Sbjct: 512 ELFGRAGALAE--EVGADETRRAALRTIAERLGGEKIGAWGQLQEWQEDRDDPADLHRHT 569
Query: 608 SHLFGLFPGHTITIEKNPDLCKAAEKTLQKR--------GEEGPG--------------- 644
SHLF L+PG I I P L +AA +L R G E P
Sbjct: 570 SHLFSLYPGSHI-IRAAPALQRAARVSLLARCGLPPSEDGSEQPADQPVPEDLETTVSGD 628
Query: 645 ----WSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQID 700
W+ W+ AL+ARL D + A+ M++ L NL+A HPPFQ+D
Sbjct: 629 SRRSWTWPWRAALFARLGDGDGAHAMLRGLLRC-----------STLPNLWATHPPFQLD 677
Query: 701 ANFGFTAAVAEMLVQSTLND------LYLLPALPWDKWSSGCVKGLKARGGETVSICWKD 754
NFG TAA+AEMLVQS + LLPALP SG V+GL+ARGG V + W++
Sbjct: 678 GNFGITAAIAEMLVQSHERTEDGQVLVRLLPALPTAWAGSGAVQGLRARGGLVVDVAWEE 737
Query: 755 GDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKV 786
G + + + + S ++ + T V+V
Sbjct: 738 GAVTDWSLAAVSSGAVREAVVVIGEAETVVEV 769
>gi|290955162|ref|YP_003486344.1| hypothetical protein SCAB_5761 [Streptomyces scabiei 87.22]
gi|260644688|emb|CBG67773.1| conserved hypothetical protein [Streptomyces scabiei 87.22]
Length = 1072
Score = 275 bits (702), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 212/699 (30%), Positives = 303/699 (43%), Gaps = 88/699 (12%)
Query: 114 DSHLKYAEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSF 173
D+ + Y R LD ++ RE F+ V+V + + LS
Sbjct: 433 DTRAQRTVVDYERGLDFVKGLHVTRFGPPGRRVLREAFAVRSADVMVFRYTSDSPRGLSG 492
Query: 174 NVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGT 233
++L S D R P +A D + I F+ ++ +
Sbjct: 493 AIALTSGQD--------------------RAPTSVDA--DARRISFAGVMGNGLKHACTV 530
Query: 234 ISALEDKKLKVEGS-----DWAVLLLVASSSFDGPFINPSDSKK-DPTSESMSALQSIRN 287
D V+GS D L L+ + D + + DP + AL
Sbjct: 531 RVVDTDGDFDVDGSTLRFSDCTTLTLLLDARTDYRLDAAAGWRGGDPRAAVDRALAKAAA 590
Query: 288 LSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-D 346
Y+ L RH+ + L +RVS+ S+ + +P+A R+ + + D
Sbjct: 591 RPYARLRDRHISRTRALMNRVSVDWG----------TSDAGVMALPTAARLARYAAGKAD 640
Query: 347 PSLVELLFQFGRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPC 406
P+L + +F +GRYLLISSSRP ANLQG+WN+ P W S H NIN++MNYW +
Sbjct: 641 PTLEQAMFDYGRYLLISSSRPDGLPANLQGLWNDSNQPAWASDYHTNINIQMNYWGAETT 700
Query: 407 NLSECQEPLFDFLTYLSINGSKTAQVNYLAS---GWVIHHKTDIWAKSSADRGKVVWALW 463
NLSEC + L F+ +++ S+ A N + GW I+ G W
Sbjct: 701 NLSECHKALVAFIEQVAVP-SRVATRNAFGARTRGWTARTSQSIF-------GGNAWEWN 752
Query: 464 PMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPE 523
+ AW HL+EH+ +T D D+L A+P+++ F D L E DG L SPE
Sbjct: 753 TVASAWYAQHLYEHWAFTQDMDYLRTVAHPMIKEICEFWEDHLKERADGLLVAPDGWSPE 812
Query: 524 HEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTK 583
H DG + D II ++F + VL+ + A KV RL P K
Sbjct: 813 HG-PREDGVM--------YDQQIIWDLFQNYLDCEAVLDADP-AYRAKVADMQERLAPNK 862
Query: 584 IAEDGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP 643
I + G + EW +D P HRH SHLF ++PG IT K D AA +L+ R E
Sbjct: 863 IGKWGQLQEWQEDIDSPTDIHRHTSHLFAVYPGRQIT-PKERDFAAAALVSLKARCGEKD 921
Query: 644 G---------------WSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYS 688
G W+ W+ AL+ARL D + A M++ L
Sbjct: 922 GVPFTAATVSGDSRRSWTWPWRAALFARLGDGQRAQVMLRGLLTY-----------NTLP 970
Query: 689 NLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETV 748
NLF HPPFQ+D NFG + AVAEML+QS + LLPALP D + G GL+ARGG V
Sbjct: 971 NLFCNHPPFQMDGNFGISGAVAEMLLQSHDGVIDLLPALPDDWKAKGSFTGLRARGGYEV 1030
Query: 749 SICWKDGDLHEVGIYSNYSNNDHDSFKTLHYRGTSVKVN 787
W+DG + I ++ + D T+ GT KV
Sbjct: 1031 RCEWRDGKVTSYEIVADRA-PDRKKKVTVRVNGTEKKVR 1068
Score = 48.9 bits (115), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 28/61 (45%), Positives = 39/61 (63%), Gaps = 2/61 (3%)
Query: 15 ITFNGPAKHF-TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALS 73
+T+ PA + + A+PIGNGRLGAM++G E ++ NE +LW GV +Y N A K S
Sbjct: 58 LTYRVPATDWQSQALPIGNGRLGAMLFGDPDEERIQFNEQSLWGGV-NNYDNALAGKPDS 116
Query: 74 D 74
D
Sbjct: 117 D 117
>gi|294624936|ref|ZP_06703590.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
gi|294665903|ref|ZP_06731169.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292600773|gb|EFF44856.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
gi|292604307|gb|EFF47692.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 801
Score = 274 bits (700), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 237/801 (29%), Positives = 367/801 (45%), Gaps = 133/801 (16%)
Query: 13 LKITFNGPA---KHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAP 69
L++ + PA + + +PIGNGRLGA+ G ETL ++E +LW+G G P
Sbjct: 57 LRLPYAAPAADAQLLREGLPIGNGRLGALCGGAPACETLFVSEGSLWSG--GSNAVPQ-- 112
Query: 70 KALSDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELD 129
D GQ+A + + FG + LL + +E H + ++ Y+RELD
Sbjct: 113 ----------DDGQFAY----TKEDFGS----FMLLAKLFVELQ-GHAQVSD--YQRELD 151
Query: 130 LNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNG 189
++ RV+Y +G+ +TR F+S+PD IV ++ +GS + L +D H+
Sbjct: 152 MSNGCVRVRYRIGDTRYTRILFASHPDAAIVLRLDCEGAGSHRGRIRL---IDTHAGA-- 206
Query: 190 NNQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDW 249
GR G A D+ G++++A L + D R LE ++ D
Sbjct: 207 -------GRADGHAGLRFAGQLDN--GLRYAAALRVHSDDGR-----LETGDGLLQFHDC 252
Query: 250 AVLLLVASSSFDGPFINPS---DSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFH 306
+ L +V D D+ +DP + + + Q+ ++ + L H+ D++ LF
Sbjct: 253 SGLTIVLCGDTDYAADGARGWRDATRDPLALARTRAQAAASVPAALLLDTHVADHRALFD 312
Query: 307 RVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISSSR 366
+ ++L +S + ++ ++T + + DP L QFGRYL I++SR
Sbjct: 313 TLQVELGQSSE-------AQRGLETWQRIQARAAAPALPDPELEVAYLQFGRYLTIAASR 365
Query: 367 PGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSING 426
G NLQG+W E+ P W S H ++NL+MNYW + P L C + L + +
Sbjct: 366 DGLPT-NLQGLWLENNEPPWMSDYHSDVNLQMNYWLADPSGLGTCVDALTRYCLAQLPSW 424
Query: 427 SKTAQVNY------------LASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
++ Q ++ +GW + A S+ G W P G AWLC L
Sbjct: 425 TRITQAHFNDPRNRFRNTSGKIAGWTV-------AISTNPFGGNGWYWHPAGNAWLCDSL 477
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASF----LLDWLIEGHDGY----LETNPSTSPEHEF 526
W+HY +T +RD L R YPLL+G F L+ + DG L + SPEH
Sbjct: 478 WQHYEFTQNRDDL-TRIYPLLKGACQFWQARLIAMEVTDADGRTRQCLVDDHDWSPEH-- 534
Query: 527 IAPDGKLACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLP-RLRPTKIA 585
P+ ++Y+ + + +F A+ +L + DA + +L RL +I+
Sbjct: 535 -GPENARG-IAYAQEL----VWTLFGQYRQASALLGR--DAAYAATIATLQQRLYLPQIS 586
Query: 586 E-DGSIMEWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLC-----KAAEKTLQKRG 639
G + EW E HHRHLS L G+FPGH + +PDL +AA K L+ RG
Sbjct: 587 PLSGQLQEWMSPTDLGEAHHRHLSPLMGVFPGHRL----HPDLAPPAQVEAARKLLEARG 642
Query: 640 EEGPGWSITWKTALWARLHDQEHAYRMV---------------KRLFNLVD-PEHEKHFE 683
+ GW+ W+ WARL D E AY +V LF++ D +H
Sbjct: 643 MQSFGWACAWRALCWARLGDAERAYALVLTNLKPSIGHSNGSAPNLFDIYDLSQHGDPTL 702
Query: 684 GGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKAR 743
GG+ FQIDANFG AA+ EML+ S + LLPALP G V GL AR
Sbjct: 703 GGV----------FQIDANFGTPAAMLEMLLYSRPGQITLLPALPKAWAEQGRVTGLGAR 752
Query: 744 GGETVSICWKDGDLHEVGIYS 764
GG V + W++G ++ + S
Sbjct: 753 GGFVVDMAWRNGVPTQISVRS 773
>gi|386070626|ref|YP_005985522.1| hypothetical protein TIIST44_05070 [Propionibacterium acnes ATCC
11828]
gi|353454992|gb|AER05511.1| hypothetical protein TIIST44_05070 [Propionibacterium acnes ATCC
11828]
Length = 736
Score = 274 bits (700), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 226/775 (29%), Positives = 339/775 (43%), Gaps = 118/775 (15%)
Query: 14 KITFNGPAKHFTD-AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
++ + PA + +PIGNGRLGA++ G + + ++ NE++LW G +Y N L
Sbjct: 7 RLHYRSPAAEWEAYGLPIGNGRLGAVLRGDIAHDVVQFNENSLWAG-SNNYDN-----GL 60
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
V + S+ FG Y G + + F + Y R LDL
Sbjct: 61 CGVAD--------DVFDTSMHGFGR----YLDFGRVTISFAGLE-ESTVSGYERGLDLRR 107
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A A + G V R F+S VIV + S S V L+S S V G+
Sbjct: 108 AVAHTCFDAGGVRHQRSAFASREADVIVLRYSA--SAPFGCTVRLESAQGVPSRVAGDTS 165
Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
++ +G G+++ A L + D R A D+ + + + A++
Sbjct: 166 VVFDGVL--------------GNGLRYCASLVVLECDGRSI--AHGDRIVVADATTLALV 209
Query: 253 L-------LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
L L A + + G +NP + +M+ L + L+ H+ ++ +
Sbjct: 210 LDAGTDYALSAVAGWRG--VNPRPVVDERICSAMA-------LGWGRLHDAHVTNFSAVM 260
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISS 364
R ++ RS + +D P+ ER++ ++ D L +L GRYLL+SS
Sbjct: 261 DRCRLRWGRSVPE----------LDAQPTDERLRRYRDGAADVGLEQLAVVLGRYLLVSS 310
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
SR ANLQG+WN+ P W S H NIN++MNYW + SE L +F+ +++
Sbjct: 311 SRAEGLPANLQGLWNDSNDPAWGSDYHTNINVQMNYWGAEVTGQSEEHMALLNFVEEVAV 370
Query: 425 --NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
+ A GW S + G W M AW H++EH+ +T
Sbjct: 371 PSRSATRAMCGPDVPGWTAR-------TSQSPLGGNGWKPNTMASAWYAHHVYEHWAFTR 423
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
D ++L R P+L F L+E DG + SPEH DG V+Y
Sbjct: 424 DDEWLRTRGLPMLAEICRFWEHQLVERDDGMIVAPAGWSPEHG-PREDG----VAY---- 474
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
D I+ ++F+ ++ + L ED L +V + RL P ++ G + EW D DP
Sbjct: 475 DQQIVWDLFTNLLECSRAL-GVEDDLYYRVERLRDRLAPNQVGCWGQLQEWQDDRDDPTE 533
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP------------------- 643
HRH SHLF ++PG IT + P+L AA +L+ R E P
Sbjct: 534 LHRHTSHLFAVYPGRQITTD-TPELQAAALVSLKARCGEPPPVVGAPTAAPFRAEMVVGD 592
Query: 644 ---GWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQID 700
W+ W+ AL+ARL D A MV+ L + NL+ HPPFQ+D
Sbjct: 593 SRRSWTWPWRAALFARLGDGYRAGEMVRGLLTY-----------NMLPNLWTTHPPFQVD 641
Query: 701 ANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
N G AVAEML+QS + LLPALP + G V GL+ARGG VS+ W+DG
Sbjct: 642 GNLGLVGAVAEMLLQSHDGRIRLLPALPPAWEAEGEVIGLRARGGYRVSMQWRDG 696
>gi|422389510|ref|ZP_16469607.1| fibronectin type III domain protein [Propionibacterium acnes
HL103PA1]
gi|422463533|ref|ZP_16540146.1| conserved hypothetical protein [Propionibacterium acnes HL060PA1]
gi|422565850|ref|ZP_16641489.1| conserved hypothetical protein [Propionibacterium acnes HL082PA2]
gi|314965492|gb|EFT09591.1| conserved hypothetical protein [Propionibacterium acnes HL082PA2]
gi|315094542|gb|EFT66518.1| conserved hypothetical protein [Propionibacterium acnes HL060PA1]
gi|327329037|gb|EGE70797.1| fibronectin type III domain protein [Propionibacterium acnes
HL103PA1]
Length = 736
Score = 274 bits (700), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 226/775 (29%), Positives = 339/775 (43%), Gaps = 118/775 (15%)
Query: 14 KITFNGPAKHFTD-AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
++ + PA + +PIGNGRLGA++ G + + ++ NE++LW G +Y N L
Sbjct: 7 RLHYRSPAAEWEAYGLPIGNGRLGAVLRGDIAHDVVQFNENSLWAG-SNNYDN-----GL 60
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
V + S+ FG Y G + + F + Y R LDL
Sbjct: 61 CGVAD--------DVFDTSMHGFGR----YLDFGRVTISFAGLE-ESTVSGYERGLDLRR 107
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A A + G V R F+S VIV + S S V L+S S V G+
Sbjct: 108 AVAHTCFDAGGVRHQRSAFASREADVIVLRYSA--SAPFGCTVRLESAQGVPSRVAGDTS 165
Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
++ +G G+++ A L + D R A D+ + + + A++
Sbjct: 166 VVFDGVL--------------GNGLRYCASLVVLECDGRSI--AHGDRIVVADATTLALV 209
Query: 253 L-------LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
L L A + + G +NP + +M+ L + L+ H+ ++ +
Sbjct: 210 LDAGTDYALSAVAGWRG--VNPRPVVDERICSAMA-------LGWGRLHDAHVTNFSAVM 260
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISS 364
R ++ RS + +D P+ ER++ ++ D L +L GRYLL+SS
Sbjct: 261 DRCRLRWGRSVPE----------LDAQPTDERLRRYRDGAADVGLEQLAVVLGRYLLVSS 310
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
SR ANLQG+WN+ P W S H NIN++MNYW + SE L +F+ +++
Sbjct: 311 SRAEGLPANLQGLWNDSNDPAWGSDYHTNINVQMNYWGAEVTGQSEEHMALLNFVEEVAV 370
Query: 425 --NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
+ A GW S + G W M AW H++EH+ +T
Sbjct: 371 PSRSATRAMCGPDVPGWTAR-------TSQSPLGGNGWQPNTMASAWYAHHVYEHWAFTR 423
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
D ++L R P+L F L+E DG + SPEH DG V+Y
Sbjct: 424 DDEWLRTRGLPMLAEICRFWEHQLVERDDGMIVAPAGWSPEHG-PREDG----VAY---- 474
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
D I+ ++F+ ++ + L ED L +V + RL P ++ G + EW D DP
Sbjct: 475 DQQIVWDLFTNLLECSRAL-GVEDDLYYRVERLRDRLAPNQVGCWGQLQEWQDDRDDPTE 533
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP------------------- 643
HRH SHLF ++PG IT + P+L AA +L+ R E P
Sbjct: 534 LHRHTSHLFAVYPGRQITTD-TPELQAAALVSLKARCGEPPPVVGAPTAAPFRAEMVVGD 592
Query: 644 ---GWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQID 700
W+ W+ AL+ARL D A MV+ L + NL+ HPPFQ+D
Sbjct: 593 SRRSWTWPWRAALFARLGDGYRAGEMVRGLLTY-----------NMLPNLWTTHPPFQVD 641
Query: 701 ANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
N G AVAEML+QS + LLPALP + G V GL+ARGG VS+ W+DG
Sbjct: 642 GNLGLVGAVAEMLLQSHDGRIRLLPALPPAWEAEGEVIGLRARGGYRVSMQWRDG 696
>gi|282853132|ref|ZP_06262469.1| conserved hypothetical protein [Propionibacterium acnes J139]
gi|282582585|gb|EFB87965.1| conserved hypothetical protein [Propionibacterium acnes J139]
Length = 736
Score = 274 bits (700), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 226/775 (29%), Positives = 339/775 (43%), Gaps = 118/775 (15%)
Query: 14 KITFNGPAKHFTD-AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
++ + PA + +PIGNGRLGA++ G + + ++ NE++LW G +Y N L
Sbjct: 7 RLHYRSPAAEWEAYGLPIGNGRLGAVLRGDIAHDVVQFNENSLWAG-SNNYDN-----GL 60
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
V + S+ FG Y G + + F + Y R LDL
Sbjct: 61 CGVAD--------DVFDTSMHGFGR----YLDFGRVTISFAGLE-ESTVSGYERGLDLRR 107
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A A + G V R F+S VIV + S S V L+S S V G+
Sbjct: 108 AVAHTCFDAGGVRHQRSAFASREADVIVLRYSA--SAPFGCTVRLESAQGVPSRVAGDTS 165
Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
++ +G G+++ A L + D R A D+ + + + A++
Sbjct: 166 VVFDGVL--------------GNGLRYCASLVVLECDGRSI--AHGDRIVVADATALALV 209
Query: 253 L-------LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
L L A + + G +NP + +M+ L + L+ H+ ++ +
Sbjct: 210 LDAGTDYALSAVAGWRG--VNPRPVVDERICSAMA-------LGWGRLHDAHVTNFSAVM 260
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISS 364
R ++ RS + +D P+ ER++ ++ D L +L GRYLL+SS
Sbjct: 261 DRCRLRWGRSVPE----------LDAQPTDERLRRYRDGAADVGLEQLAVVLGRYLLVSS 310
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
SR ANLQG+WN+ P W S H NIN++MNYW + SE L +F+ +++
Sbjct: 311 SRAEGLPANLQGLWNDSNDPAWGSDYHTNINVQMNYWGAEVTGQSEEHMALLNFVEEVAV 370
Query: 425 --NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
+ A GW S + G W M AW H++EH+ +T
Sbjct: 371 PSRSATRAMCGPDVPGWTAR-------TSQSPLGGNGWKPNTMASAWYAHHVYEHWAFTR 423
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
D ++L R P+L F L+E DG + SPEH DG V+Y
Sbjct: 424 DDEWLRTRGLPMLAEICRFWEHQLVERDDGMIVAPAGWSPEHG-PREDG----VAY---- 474
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
D I+ ++F+ ++ + L ED L +V + RL P ++ G + EW D DP
Sbjct: 475 DQQIVWDLFTNLLECSRAL-GVEDDLYYRVERLRDRLAPNQVGCWGQLQEWQDDRDDPTE 533
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP------------------- 643
HRH SHLF ++PG IT + P+L AA +L+ R E P
Sbjct: 534 LHRHTSHLFAVYPGRQITTD-TPELQAAALVSLKARCGEPPPVVGAPTAAPFRAEMVVGD 592
Query: 644 ---GWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQID 700
W+ W+ AL+ARL D A MV+ L + NL+ HPPFQ+D
Sbjct: 593 SRRSWTWPWRAALFARLGDGYRAGEMVRGLLTY-----------NMLPNLWTTHPPFQVD 641
Query: 701 ANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
N G AVAEML+QS + LLPALP + G V GL+ARGG VS+ W+DG
Sbjct: 642 GNLGLVGAVAEMLLQSHDGRIRLLPALPPAWEAEGEVIGLRARGGYRVSMQWRDG 696
>gi|319792118|ref|YP_004153758.1| alpha-L-fucosidase [Variovorax paradoxus EPS]
gi|315594581|gb|ADU35647.1| Alpha-L-fucosidase [Variovorax paradoxus EPS]
Length = 938
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 204/657 (31%), Positives = 302/657 (45%), Gaps = 79/657 (12%)
Query: 120 AEETYRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDS 179
A YRR LDL T ++S + RE F+S V+V + + S+S + S ++L S
Sbjct: 308 ATTGYRRTLDLGTGVHTTEFSTSGRKIVREAFASKVADVMVFRYTASDSRAFSGTLTLTS 367
Query: 180 LLDNHSYVNGN-NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALE 238
+ + + Q+ G A AN ++++ +++ D + +S
Sbjct: 368 MQGATATADAATGQVSFSG----------AMANS----LKYACAVQVVKEDGQLAVSG-- 411
Query: 239 DKKLKVEGSDWAVLLLVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHL 298
L + LL+ A + + + S DP +AL + + +Y+ L H+
Sbjct: 412 -NALSFDQCTSLTLLVDARTDYKLDYAAGWRST-DPAPRVQAALAAAASKTYAALRQAHV 469
Query: 299 DDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDE-DPSLVELLFQFG 357
D+ + R S+ S +V T + +R++ + DP L + +F +G
Sbjct: 470 ADFGAVMSRASVTWGNSDAAVVGLT----------TRQRLERYAGGAADPGLEQAMFDYG 519
Query: 358 RYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFD 417
RYLL+SSSR G ANLQG+WN SP W S H NIN++MNYW + L +C PL D
Sbjct: 520 RYLLVSSSRQGGLPANLQGLWNNSNSPAWASDYHTNINVQMNYWGAESTGLPDCHTPLVD 579
Query: 418 FLTYLSINGSKTAQVNYLAS---GWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHL 474
F++ ++ S+ A N + GW I+ G W + AW HL
Sbjct: 580 FVSQVA-GPSRIATRNAFGANTRGWTARTSQSIF-------GGNAWNWNNVSSAWYAQHL 631
Query: 475 WEHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLA 534
+EH+ +T D ++L AYP+L+ F D L DG L SPEH DG +
Sbjct: 632 YEHFAFTQDLNYLRNTAYPMLKEICQFWEDRLKLRADGLLVAPNGWSPEHG-PTEDGVM- 689
Query: 535 CVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSL-PRLRPTKIAEDGSIMEW 593
D II ++F + AA L N DA + + + +L P KI + G + EW
Sbjct: 690 -------YDQQIIWDLFQNYLDAARTL--NVDAAYQTTVAGMQAKLAPNKIGKWGQLQEW 740
Query: 594 AQDFKDPEVHHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPG--------- 644
D DP+ HHRH SHLF ++PG +T K P AA +L+ R E G
Sbjct: 741 QGDIDDPKDHHRHTSHLFAVYPGRQVTPAKTPAFAAAALVSLKARCGEVAGQPFTASMVT 800
Query: 645 ------WSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQ 698
W+ W+ AL+ARL D A M++ L NLF HPPFQ
Sbjct: 801 GDSRRSWTWPWRCALFARLGDAGRAQTMLRGLLTY-----------NTLQNLFCNHPPFQ 849
Query: 699 IDANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
+D NFG + A+ EML+QS + LLPA P D ++G GL+ARGG VS WK+G
Sbjct: 850 MDGNFGISGALTEMLLQSHEGVIVLLPACPDDWKAAGAFNGLRARGGYRVSCVWKNG 906
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/88 (38%), Positives = 46/88 (52%), Gaps = 18/88 (20%)
Query: 25 TDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQY 84
+ A+PIGN RLGAM++GG +E ++ NE +LW GV +Y N A G+
Sbjct: 88 SQALPIGNARLGAMLFGGAFNERIQFNEQSLWGGV-NNYDNALA-------------GKN 133
Query: 85 AEATAASVKLFGHPADVYQLLGDIELEF 112
+A SV FG Y+ GDI L F
Sbjct: 134 DDAFDTSVTGFGS----YRAFGDIALAF 157
>gi|422457861|ref|ZP_16534519.1| conserved hypothetical protein [Propionibacterium acnes HL050PA2]
gi|315104961|gb|EFT76937.1| conserved hypothetical protein [Propionibacterium acnes HL050PA2]
Length = 736
Score = 273 bits (699), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 226/775 (29%), Positives = 339/775 (43%), Gaps = 118/775 (15%)
Query: 14 KITFNGPAKHFTD-AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKAL 72
++ + PA + +PIGNGRLGA++ G + + ++ NE++LW G +Y N L
Sbjct: 7 RLHYRSPAAEWEAYGLPIGNGRLGAVLRGDIAHDVVQFNENSLWAG-SNNYDN-----GL 60
Query: 73 SDVRSLVDSGQYAEATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEETYRRELDLNT 132
V + S+ FG Y G + + F + Y R LDL
Sbjct: 61 CGVAD--------DVFDTSMHGFGR----YLDFGRVTISFAGLE-ESTVSGYERGLDLRR 107
Query: 133 ATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQ 192
A A + G V R F+S VIV + S S V L+S S V G+
Sbjct: 108 AVAHTCFDAGGVRHQRSAFASREADVIVLRYSA--SAPFGCTVRLESAQGVPSRVAGDTS 165
Query: 193 IIMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVL 252
++ +G G+++ A L + D R A D+ + + + A++
Sbjct: 166 VVFDGVL--------------GNGLRYCASLVVLECDGRSI--AHGDRIVVADATTLALV 209
Query: 253 L-------LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLF 305
L L A + + G +NP + +M+ L + L+ H+ ++ +
Sbjct: 210 LDAGTDYALSAVAGWRG--VNPRPVVDERICSAMA-------LGWGRLHDAHVTNFSAVM 260
Query: 306 HRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTD-EDPSLVELLFQFGRYLLISS 364
R ++ RS + +D P+ ER++ ++ D L +L GRYLL+SS
Sbjct: 261 DRCRLRWGRSVPE----------LDAQPTDERLRRYRDGAADVGLEQLAVVLGRYLLVSS 310
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
SR ANLQG+WN+ P W S H NIN++MNYW + SE L +F+ +++
Sbjct: 311 SRAEGLPANLQGLWNDSNDPAWGSDYHTNINVQMNYWGAEVTGQSEEHMALLNFVEEVAV 370
Query: 425 --NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTM 482
+ A GW S + G W M AW H++EH+ +T
Sbjct: 371 PSRSATRAMCGPDVPGWTAR-------TSQSPLGGNGWQPNTMASAWYAHHVYEHWAFTR 423
Query: 483 DRDFLEKRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTM 542
D ++L R P+L F L+E DG + SPEH DG V+Y
Sbjct: 424 DDEWLRTRGLPMLAEICRFWEHQLVERDDGMIVAPAGWSPEHG-PREDG----VAY---- 474
Query: 543 DMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDFKDPEV 602
D I+ ++F+ ++ + L ED L +V + RL P ++ G + EW D DP
Sbjct: 475 DQQIVWDLFTNLLECSRAL-GVEDDLYYRVERLRDRLAPNQVGCWGQLQEWQDDRDDPTE 533
Query: 603 HHRHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGP------------------- 643
HRH SHLF ++PG IT + P+L AA +L+ R E P
Sbjct: 534 LHRHTSHLFAVYPGRQITTD-TPELQAAALVSLKVRCGEPPPVVGAPTAAPFRAEMVVGD 592
Query: 644 ---GWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQID 700
W+ W+ AL+ARL D A MV+ L + NL+ HPPFQ+D
Sbjct: 593 SRRSWTWPWRAALFARLGDGYRAGEMVRGLLTY-----------NMLPNLWTTHPPFQVD 641
Query: 701 ANFGFTAAVAEMLVQSTLNDLYLLPALPWDKWSSGCVKGLKARGGETVSICWKDG 755
N G AVAEML+QS + LLPALP + G V GL+ARGG VS+ W+DG
Sbjct: 642 GNLGLVGAVAEMLLQSHDGRIRLLPALPPAWEAEGEVIGLRARGGYRVSMQWRDG 696
>gi|238482887|ref|XP_002372682.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
gi|220700732|gb|EED57070.1| conserved hypothetical protein [Aspergillus flavus NRRL3357]
Length = 608
Score = 273 bits (698), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 182/594 (30%), Positives = 294/594 (49%), Gaps = 60/594 (10%)
Query: 17 FNGPAKHFTDAIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVR 76
++ P F ++P+GNGRLG ++ +P+E + NED++W+G D N +A VR
Sbjct: 34 YDTPGTRFNASLPVGNGRLGGTLYY-LPTEIVTWNEDSVWSGTFQDRVNSNALDGFPKVR 92
Query: 77 SLVDSGQYAEATAASVK-LFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTA 133
+L+ +G A ++ + G D YQ+L ++ ++ + R LD
Sbjct: 93 NLLVNGNITAAGELALSDMTGSSVDQREYQVLSNLYVDLGQ---RGDATNLVRYLDTLEG 149
Query: 134 TARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLLDNHSYVNGNNQI 193
+Y V +TRE +S P V+ +I + S +++ N + NG I
Sbjct: 150 YTACEYGFDGVSYTRELIASAPSGVLGFRIQANTSRAINLN----------AVANGIASI 199
Query: 194 IMEGRCPGKRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKLKVEGSDWAVLL 253
+M+ R + F+A + + + D G ++A DK L V G+ V
Sbjct: 200 VMKAR------------TGEADYSTFTAGVRVVV--DGGNVTANGDK-LYVTGATTVVFF 244
Query: 254 LVASSSFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLS 313
L A SS+ + D +E L + L Y L + D++ L RV++ L
Sbjct: 245 LDAESSYR------YATDSDQETELNRKLDAATELGYEALRKEAITDHKDLAGRVTLDLG 298
Query: 314 RSPKDIVTDTCSEENIDTVPSAERVKSFQT--DEDPSLVELLFQFGRYLLISSSRPGTQV 371
S D + +P ER+ ++++ D D L+F +GR+LLI+SSR +
Sbjct: 299 SSTDDAAS----------LPPNERMTNYRSSPDHDVQFATLVFNYGRHLLIASSRRTRER 348
Query: 372 A---NLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSK 428
+ LQGIWN+D SP+W + VNINLEMNYW + NL+E PL+D L + G
Sbjct: 349 SLSPGLQGIWNQDYSPSWGAKYTVNINLEMNYWPAETTNLNELTSPLWDLLALIQERGGD 408
Query: 429 TAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLE 488
A+ + G+V+HH TD+W S +++WPMGGAWL H+ EHY +T D+ FL+
Sbjct: 409 VAEKMHGCPGFVLHHNTDLWGDSVPVHNGTKYSIWPMGGAWLALHMMEHYRFTGDKTFLK 468
Query: 489 KRAYPLLEGCASFLLDWLIEGHDGYLETNPSTSPEHEFIAPD-----GKLACVSYSSTMD 543
++A P+ + F +L + DGYL T PS SPE+ F P GK ++ S T+D
Sbjct: 469 EQACPIFKSAFEFFECYLFD-VDGYLTTGPSCSPENAFQIPSDMTVAGKEEALTMSPTLD 527
Query: 544 MAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRLRPTKIAEDGSIMEWAQDF 597
+++ E+ +A+ ++LE + D L V L ++RP +I DG I+EW ++F
Sbjct: 528 NSMLFELLTALNETHQILEIDND-LSGSVQTYLGKIRPPRIGSDGQILEWIEEF 580
>gi|189208288|ref|XP_001940477.1| alpha-fucosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
gi|187976570|gb|EDU43196.1| alpha-fucosidase [Pyrenophora tritici-repentis Pt-1C-BFP]
Length = 814
Score = 273 bits (698), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 218/728 (29%), Positives = 336/728 (46%), Gaps = 80/728 (10%)
Query: 29 PIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP---GDYT--NPDA--PKALSDVRSLVDS 81
P+GNGRLGAM G +ETL LN D+LW+G P +YT NP AL +R +
Sbjct: 41 PLGNGRLGAMPVGPPAAETLTLNLDSLWSGGPFNISNYTGGNPHTLIASALPGIRDWI-- 98
Query: 82 GQYAEATAASVKLFGHPADV--YQLLGDIELEFDDSHLKYAEETYRRELDLNTATARVKY 139
+ T L G + YQ+LG++ ++ Y R+LD++T T +
Sbjct: 99 --FTNGTGNVSALLGSNDNYGSYQVLGNLTVKIPSLSSDIVSN-YTRKLDMSTGTHTTTF 155
Query: 140 SVGNVEFTREHFSSNPDQVIVTKISGSESGSLS-FNVSLDSLL-----DNHSYVNGNNQI 193
+ F S PDQV V + + +G + V+LD++L N + V G+
Sbjct: 156 IANGNDLETTGFCSFPDQVCVYTVQSTGAGDVPPLEVTLDNVLVSPQLQNVTCVEGDTTK 215
Query: 194 IMEGRCPG-KRIPPKANANDDPKGIQFSAILEIKISDDRGTISALEDKKL----KVEGSD 248
R G ++ P P+G+++ +I + +S+ +S E+ L G+
Sbjct: 216 PAHLRLRGVTQLGP-------PEGMRYDSIARV-VSNSNTDVSCDENTGLLSIAPRSGTK 267
Query: 249 WAVLLLVASSSFDGPFI----NPSDSKKDPTSESMSALQSIRNLSYSDLYTRHLDDYQKL 304
+++ A +++D N S +DP + + L RH+DD+ L
Sbjct: 268 SVSIVIGAGTNYDAKKGTAEHNYSFRGEDPALIVEATTLKAATKTLDQLRGRHIDDFTAL 327
Query: 305 FHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVELLFQFGRYLLISS 364
+ L D + T R T DP L LL + RYL ISS
Sbjct: 328 TGLFELSLP--------DPLNSSQTQTSELINRYTVNNTSGDPYLESLLMENSRYLFISS 379
Query: 365 SRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLFDFLTYLSI 424
SRPG+ NLQG W+E L W + H NIN +MN+W S L++ Q PL+D++T +
Sbjct: 380 SRPGSLPPNLQGRWSEGLETDWSADYHANINFQMNHWTSDQTGLTDLQSPLWDYMTDTWM 439
Query: 425 -NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMD 483
G++TA + Y A GWV+H++ +I+ +A + WA +P+ AW+ H+++H++Y+ +
Sbjct: 440 PRGAETATLLYNAPGWVVHNEMNIFGH-TAMKSAAEWANYPIAAAWMMQHVFDHWDYSRN 498
Query: 484 RDFLEKRAYPLLEGCASFLLDWLIEG---HDGYLETNPSTSPEHEFIAPDGKLACVSYSS 540
+L K+ YPLL+G A F LD L + DG L NP SPEH C Y
Sbjct: 499 ATWLLKQGYPLLKGVAMFWLDQLQQDGYYKDGSLVVNPCNSPEHGGTT----FGCAHYQQ 554
Query: 541 TMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIMEW----AQ 595
+I +VF +I++ + + + + SL RL + I EW +
Sbjct: 555 -----LIHQVFHSILAVQPTVADPDTVFLTNLTSSLHRLDKGFHTGSFSQIKEWKIPDSY 609
Query: 596 DFKDPEVHHRHLSHLFGLFPGHTITIEK----NPDLCKAAEKTLQKRGE-EGP----GWS 646
+ P HRHLS L G PG +++ + N + A + L RG +GP W+
Sbjct: 610 TYDRPNDTHRHLSELVGWHPGFSLSALQHGYSNATIASAVRQKLISRGPGKGPDGNSAWA 669
Query: 647 ITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFT 706
W++A WARL+D EHA+ ++ E ++ S F PFQID NFGF
Sbjct: 670 KVWRSACWARLNDTEHAHWELRFAI-------ETNWAPNGLSMYFGDKIPFQIDGNFGFG 722
Query: 707 AAVAEMLV 714
AV MLV
Sbjct: 723 GAVLGMLV 730
>gi|393247026|gb|EJD54534.1| glycoside hydrolase family 95 protein [Auricularia delicata
TFB-10046 SS5]
Length = 861
Score = 273 bits (698), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 222/790 (28%), Positives = 353/790 (44%), Gaps = 124/790 (15%)
Query: 28 IPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVP-GDYTNPDAPKALSDVRSLVDSGQYAE 86
+P+GNG +G M + + LN ++LWTG P N + L+ V + V E
Sbjct: 103 LPVGNGYMGMMQSSRPDFDDVVLNLESLWTGGPYNSANNYNGGNPLTAVNASVR-----E 157
Query: 87 ATAASVKLFGHPADVYQLLGDIELEFDDSHLKYAEE---------------TYRRELDLN 131
A++ G P D+ D SH Y R LD N
Sbjct: 158 NIRATIWANGSP--------DLTPLVDGSHYGSLSSPGSLHISRSIGNDVTGYERALDFN 209
Query: 132 TATARVKYSVGNVEFTREHFSSNPDQVIVTKISGSESGSLSFNVSLDSLL-DNHSYVNGN 190
T + G+ + R +F S PDQV V G+ + + + SLD+L +++ V
Sbjct: 210 DGTISATWKEGSNSYLRTYFCSFPDQVCVVNTEGTGNDTAIY--SLDTLRPRDYASVACL 267
Query: 191 NQIIMEGRCPGKRIPPKANANDDPKGIQFSAILE-IKISDDRGTISALEDKKLKVEGSDW 249
++ + R + G+ + ++ I S D T S + L G+
Sbjct: 268 DKSTLAYRGLA-----------ESSGMTYEILVRLISSSPDSVTCSGAGNATLTGSGARQ 316
Query: 250 AVLLLVASS------------SFDGPFINPSDSKKDPTSESMSALQSIRNLSYSDLYTRH 297
VL+ A++ SF GP DP + ++++L SY L +RH
Sbjct: 317 MVLITGATNYNIDAGTRAHNFSFAGP---------DPHASALNSLSKASRSSYEALLSRH 367
Query: 298 LDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAERVKSFQTDEDPSLVE-LLFQF 356
+DDY LFH + L + P D+V P+ + V + T +E LLF
Sbjct: 368 IDDYSALFHGFELDLGQKP-DVVK-----------PTDQLVAEYVTGTGNVYLEWLLFNL 415
Query: 357 GRYLLISSSRPGTQVANLQGIWNEDLSPTWDSAPHVNINLEMNYWQSLPCNLSECQEPLF 416
GR+++I+ +R G + LQ +W L W H NINL+MNYW + NL PL+
Sbjct: 416 GRFMMITGAR-GVLPSGLQSVWTTGLEAPWGGDYHANINLQMNYWGAEETNLGAVTGPLW 474
Query: 417 DFLTYLSI-NGSKTAQVNYLASGWVIHHKTDIWAKSSADRGKVVWALWPMGGAWLCTHLW 475
+++ + GS+TAQ+ Y + G+V+H++ +I+ + G WA +P W+ H+W
Sbjct: 475 NYMRKTWVPRGSETAQLVYGSRGFVVHNEMNIFGHTGMKLGDPQWADYPAAATWMMLHVW 534
Query: 476 EHYNYTMDRDFLEKRAYPLLEGCASFLLDWLIE---GHDGYLETNPSTSPEHEFIAPDGK 532
+H+++T D ++ + + LL+ A F LD L E DG L P SPE+ + P
Sbjct: 535 DHFDFTGDLNWFRSQGWSLLKAQAEFWLDNLFEDSASKDGTLVAVPCNSPENGIVGP--- 591
Query: 533 LACVSYSSTMDMAIIREVFSAIISAAEVLEKNEDALVEKVLKSLPRL-RPTKIAEDGSIM 591
+Y +I E+F I ++ + + ++++ L +L R +I G +
Sbjct: 592 ----TYGCAHFQQLIWELFHNIQKGFKLSGDADQSFLKEIEAKLSKLDRGVRIGSWGQMQ 647
Query: 592 EWAQDFKDPEVHHRHLSHLFGLFPGHTITIEKNP-----DLCKAAEKTLQKRG----EEG 642
EW +D P HRH+SHL GL+PG+ + P ++ KAA T+ RG +
Sbjct: 648 EWKRDLDQPGDLHRHISHLMGLYPGYAVASWNEPSPSRQEVMKAAATTVAHRGPGIADSD 707
Query: 643 PGWSITWKTALWARLHDQEHAYRMVKRLFNLVDPEHEKHFEGGLYSNLF-----AAHPPF 697
GW ++ LW++L + AY ++ E +NLF A+ F
Sbjct: 708 AGWEKMVRSVLWSQLGNASGAYY-----------AYQLSLERDYGANLFDMYSGEANSLF 756
Query: 698 QIDANFGFTAAVAEMLVQST----LND---LYLLPALPWDKWSSGCVKGLKARGGETVSI 750
QIDANFG AV M+VQ+T L+D + LLPALP WS+G VK + R G +S+
Sbjct: 757 QIDANFGAVGAVINMIVQATNTPSLSDPLVINLLPALP-GAWSTGSVKNARVRNGIGLSM 815
Query: 751 CWKDGDLHEV 760
W G + V
Sbjct: 816 SWSAGTVKSV 825
>gi|331092429|ref|ZP_08341254.1| hypothetical protein HMPREF9477_01897 [Lachnospiraceae bacterium
2_1_46FAA]
gi|330401272|gb|EGG80861.1| hypothetical protein HMPREF9477_01897 [Lachnospiraceae bacterium
2_1_46FAA]
Length = 1317
Score = 273 bits (697), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 200/700 (28%), Positives = 329/700 (47%), Gaps = 79/700 (11%)
Query: 108 IELEFDDSHLKYAEET-YRRELDLNTATARVKYSVGNVEFTREHFSSNPDQVIVTKI--- 163
I D S ++ E T Y R LD+++A A V + + RE+F+S PD VI K+
Sbjct: 433 IVTSMDKSKPEHTEVTNYERALDIDSALATVSFDRDYTHYYREYFASYPDNVIAMKLTAE 492
Query: 164 ----SGSESGSLSFNVSLDSLLDNHSYVNGNNQIIMEGRCPGKRIPPKANANDDPKGIQF 219
S E L F VS +D S ++ E G I + D+ G+ F
Sbjct: 493 ALKGSQKEMKPLEFEVSFP--VDQPSEAALGKEVKYETTEDG-TIVVSGHMRDN--GLLF 547
Query: 220 SAILEIKISDDRGTISALEDKKLKVEGSDWAVLLLVASSSFDG--PFINPSDSKKDPTSE 277
+ L++ D + A ++ L V G+ + + A + + P + + +++
Sbjct: 548 NGRLQVVTKDGKVEQIANKEGTLLVSGATEVYIYVTADTDYKMTYPKYRSGITADELSTQ 607
Query: 278 SMSALQSIRNLSYSDLYTRHLDDYQKLFHRVSIQLSRSPKDIVTDTCSEENIDTVPSAER 337
+ L Y + + DY+K++ RV + L + ++ +D + ++ +
Sbjct: 608 VKTVLDKAVKKGYKAVKDDAVADYKKIYDRVKLDLGQG--------AYKKTVDELIASYK 659
Query: 338 VKSFQTDEDPSLVELLFQFGRYLLISSSRPGTQV-ANLQGIW-----NEDLSPTWDSAPH 391
+E L +LFQ+GRYL ISS+R G ++ ANLQG+W + W S H
Sbjct: 660 SNKASAEEKAYLEAILFQYGRYLQISSTREGDKLPANLQGVWLDCTGKANAPIAWGSDYH 719
Query: 392 VNINLEMNYWQSLPCNLSECQEPLFDFLTYLSINGSKTAQV-------NYLASGWVIHHK 444
+N+NL+MNYW + N++EC EP+ ++ L G TA N +G+ H +
Sbjct: 720 MNVNLQMNYWPTYVTNMAECAEPMIKYIEGLREPGRVTASTYFGIDNSNGQKNGFTAHTQ 779
Query: 445 TDIWAKSSADRGKVVWALWPMGGAWLCTHLWEHYNYTMDRDFLEKRAYPLLEGCASFLLD 504
+ + + W P W+ +++E Y Y+ + + LEK +P+++ A F +
Sbjct: 780 NTPFGWTCPGW-EFSWGWSPAAVPWMLQNVYEAYEYSGNIEKLEKDIFPMMQEQAKFYMS 838
Query: 505 WL-----IEGHDGYLETNPSTSPEHEFIAPDGKLACVSYSSTMDMAIIREVFSAIISAAE 559
L +G + Y+ T P+ SPEH + + + ++ ++F+ I AA+
Sbjct: 839 ILKKVTTADGKERYV-TIPAYSPEH---------GPYTAGNVYENVLVWQLFNDCIEAAD 888
Query: 560 VLEKNEDALV--EKVLK---SLPRLRPTKIAEDGSIMEWAQD----------FKDPEVHH 604
L N+ V E++ + L+P +I + G I EW + + H
Sbjct: 889 ALNANKAGTVSEEQITQWKEYRAGLKPIEIGQSGQIKEWYDETTLGHNTKGNIPKYQKGH 948
Query: 605 RHLSHLFGLFPGHTITIEKNPDLCKAAEKTLQKRGEEGPGWSITWKTALWARLHDQEHAY 664
RH+SHL ++PG +T++ + AA+ +L RG+ GW I + WAR D HAY
Sbjct: 949 RHMSHLLAVYPGDLVTVDDEKTM-DAAKVSLNDRGDNATGWGIAQRLNTWARTGDGNHAY 1007
Query: 665 RMVKRLFNLVDPEHEKHFEGGLYSNLFAAHPPFQIDANFGFTAAVAEMLVQSTLNDLYLL 724
+++ + + G+YSNL+ AHPPFQID NFG+T+ VAEML+QS + LL
Sbjct: 1008 KII-----------DSFIKNGIYSNLWDAHPPFQIDGNFGYTSGVAEMLLQSNAGYINLL 1056
Query: 725 PALPWDKWSSGCVKGLKARGGETVSICWKDGDLHEVGIYS 764
PA+P ++W SG V GL ARG VS W G L E I S
Sbjct: 1057 PAMPENQWQSGSVSGLVARGNFVVSENWDKGVLTEATIES 1096
Score = 44.3 bits (103), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 41/158 (25%), Positives = 63/158 (39%), Gaps = 34/158 (21%)
Query: 27 AIPIGNGRLGAMVWGGVPSETLKLNEDTLWTGVPGDYTNPDAPKALSDVRSLVDSGQYA- 85
++PIGN +GA V+G V E L N TLW G P D P ++ + D A
Sbjct: 79 SLPIGNSYMGANVYGEVGKEHLTFNHKTLWNGGP----TADKPHTGGNINKVGDKSMAAY 134
Query: 86 -------------EATAASVKLFGHPA---DVYQLLGDIELEFDDSHLKYAEETYRRELD 129
A+ +L G YQ GDI L+FD K
Sbjct: 135 LESVQQAFLDGKSNASEMCNQLIGQNTREYGAYQGWGDIYLDFDRESAK------EDATI 188
Query: 130 LNTATARVKYSVGNVEFTR-------EHFSSNPDQVIV 160
++ + ++KY G E+ + EH++ NP ++ +
Sbjct: 189 ISDKSDKIKYGQGWGEWPQPTWEAGSEHYAMNPARLEI 226
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.316 0.133 0.407
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 13,688,275,876
Number of Sequences: 23463169
Number of extensions: 602940244
Number of successful extensions: 1377937
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 1339
Number of HSP's successfully gapped in prelim test: 94
Number of HSP's that attempted gapping in prelim test: 1366170
Number of HSP's gapped (non-prelim): 1907
length of query: 810
length of database: 8,064,228,071
effective HSP length: 151
effective length of query: 659
effective length of database: 8,816,256,848
effective search space: 5809913262832
effective search space used: 5809913262832
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.6 bits)
S2: 81 (35.8 bits)