BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 014889
(416 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|359473865|ref|XP_002275105.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Vitis vinifera]
gi|296085565|emb|CBI29297.3| unnamed protein product [Vitis vinifera]
Length = 444
Score = 542 bits (1396), Expect = e-151, Method: Compositional matrix adjust.
Identities = 261/392 (66%), Positives = 310/392 (79%), Gaps = 3/392 (0%)
Query: 1 MAKYNPINNDANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLD 60
M Y + D D LV+ + + A+ +E L P + S + +RRL+SLD
Sbjct: 1 MGMYETVRRD-EDPLVLDANTPENLSAD--VESSLLNSPRSDGSGRGGGNASKRRLVSLD 57
Query: 61 VFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKV 120
VFRGLTVA+MILVDD GGILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL YKN
Sbjct: 58 VFRGLTVAIMILVDDAGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALAYKNLSSGY 117
Query: 121 VATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCE 180
+AT+ A++RAL L + G+FLQGG+FHG+NNL YGVDI QIR G+LQRIA+AY +AA+CE
Sbjct: 118 LATKMAVVRALKLLVFGLFLQGGYFHGLNNLTYGVDIEQIRLAGILQRIAVAYFLAAVCE 177
Query: 181 IWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWI 240
IWLKGD +V S SL +KY+ W V LVLT Y LLYGLYVPDW+Y P ETSSS+ I
Sbjct: 178 IWLKGDSNVKSGSSLLKKYQFQWAVVLVLTVAYCSLLYGLYVPDWEYSIPSETSSSALKI 237
Query: 241 FNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSW 300
F V CGVR TGPACNAVGMIDR +LGIQHLY++PIY+R KQCSINSPDYGP+P +AP+W
Sbjct: 238 FKVKCGVRSDTGPACNAVGMIDRNVLGIQHLYKRPIYARMKQCSINSPDYGPLPPNAPTW 297
Query: 301 CQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFV 360
CQAPFDPEGLLSSVMA VTCL+GLH+GH+IVHFKDH+DR+L+WI+ SSCL+ LG +LDF
Sbjct: 298 CQAPFDPEGLLSSVMAIVTCLVGLHYGHIIVHFKDHKDRILHWIVPSSCLLVLGFALDFF 357
Query: 361 GMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
GMH+NKALY+LSY C+TAGA+G+L AGIY MV
Sbjct: 358 GMHVNKALYTLSYMCVTAGAAGILFAGIYLMV 389
>gi|255543288|ref|XP_002512707.1| conserved hypothetical protein [Ricinus communis]
gi|223548668|gb|EEF50159.1| conserved hypothetical protein [Ricinus communis]
Length = 426
Score = 539 bits (1389), Expect = e-151, Method: Compositional matrix adjust.
Identities = 256/339 (75%), Positives = 287/339 (84%)
Query: 51 HQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLA 110
H RL+SLDVFRGLTVALMILVD GGILPAINHSPWNGLTLAD VMPFFLFIVGVSL
Sbjct: 47 HPHHRLLSLDVFRGLTVALMILVDYAGGILPAINHSPWNGLTLADLVMPFFLFIVGVSLG 106
Query: 111 LTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIA 170
LTYK PCK VATRKAILR L L LG FLQGG+ HG+N+L YGV++ ++R MG+LQRIA
Sbjct: 107 LTYKKLPCKAVATRKAILRTLKLLTLGFFLQGGYLHGLNDLTYGVNVEKLRLMGILQRIA 166
Query: 171 IAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFP 230
IAYLV ALCEIWLKGD HV S SL RKYR W +ALVL + YL L+YGLYVPDW+Y+ P
Sbjct: 167 IAYLVGALCEIWLKGDDHVDSCSSLLRKYRFQWAMALVLISTYLSLIYGLYVPDWEYQIP 226
Query: 231 VETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDY 290
E SSS IF V CGVRG+TGPACNAVG+IDR LGIQHLY KP+Y+RTK CSINSPDY
Sbjct: 227 AEASSSPAKIFLVKCGVRGNTGPACNAVGLIDRTTLGIQHLYGKPVYARTKLCSINSPDY 286
Query: 291 GPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCL 350
GP+P DAPSWCQAPFDPEG+LSSVMA VTCLIGLH+GH+IVHFKDHR+RML+W+I S CL
Sbjct: 287 GPLPADAPSWCQAPFDPEGILSSVMAVVTCLIGLHYGHIIVHFKDHRNRMLHWMIPSICL 346
Query: 351 IGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIY 389
IGLGL+LDF+GMH+NKALYS SY +TAGA+G+L GIY
Sbjct: 347 IGLGLALDFLGMHVNKALYSFSYMSVTAGAAGILFTGIY 385
>gi|224057870|ref|XP_002299365.1| predicted protein [Populus trichocarpa]
gi|222846623|gb|EEE84170.1| predicted protein [Populus trichocarpa]
Length = 386
Score = 519 bits (1336), Expect = e-144, Method: Compositional matrix adjust.
Identities = 247/338 (73%), Positives = 287/338 (84%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYK 114
RL+SLDVFRGLTVALMILVDD GG+LPAINHSPWNGLTLAD VMPFFLFIVGVSL LTYK
Sbjct: 1 RLVSLDVFRGLTVALMILVDDAGGVLPAINHSPWNGLTLADVVMPFFLFIVGVSLGLTYK 60
Query: 115 NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYL 174
CK VATRKAILR L L ++G+FLQGGF HG+N+L YGVD+ QIRWMG+LQRIAI YL
Sbjct: 61 KLSCKAVATRKAILRTLKLLIIGLFLQGGFLHGLNDLTYGVDMTQIRWMGILQRIAIGYL 120
Query: 175 VAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETS 234
V A+CEIWLKG HV+S LS+ RKY+ W L+ T+YL LLYGL+VPDW+Y+ PV S
Sbjct: 121 VGAMCEIWLKGGNHVTSGLSMLRKYQFQWAAVLMFVTIYLSLLYGLHVPDWEYQIPVAAS 180
Query: 235 SSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMP 294
+S+P IF V CGVRG TGPACNA GMIDR ILGIQHLYRKPIY+RTK CSINSP YGP+P
Sbjct: 181 ASTPKIFPVKCGVRGHTGPACNAGGMIDRTILGIQHLYRKPIYARTKPCSINSPGYGPLP 240
Query: 295 LDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLG 354
DAPSWCQAPFDPEGLLSSVMA VTCL+GLH+GH+IVHFK+H+DR L+W++ S+C + LG
Sbjct: 241 PDAPSWCQAPFDPEGLLSSVMAIVTCLVGLHYGHIIVHFKEHKDRTLHWMVPSTCFLVLG 300
Query: 355 LSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
L LD +GMH+NKALY+ SY C+TAGA+G++ GIY +V
Sbjct: 301 LVLDLLGMHVNKALYTFSYMCVTAGAAGIVFTGIYLLV 338
>gi|147844298|emb|CAN82113.1| hypothetical protein VITISV_031338 [Vitis vinifera]
Length = 401
Score = 516 bits (1330), Expect = e-144, Method: Compositional matrix adjust.
Identities = 256/401 (63%), Positives = 304/401 (75%), Gaps = 18/401 (4%)
Query: 1 MAKYNPINNDANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLD 60
M Y + D D LV+ + + A+ +E L P + S + +RRL+SLD
Sbjct: 1 MGMYETVRRD-EDPLVLDANTPENLSAD--VESSLLNSPRSDGSGRGGGNASKRRLVSLD 57
Query: 61 VFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKV 120
VFRGLTVA+MILVDD GGILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL YKN
Sbjct: 58 VFRGLTVAIMILVDDAGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALAYKNLSSGY 117
Query: 121 VATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCE 180
+AT+ A GG+FHG+NNL YGVDI QIR G+LQRIA+AY +AA+CE
Sbjct: 118 LATKMA--------------SGGYFHGLNNLTYGVDIEQIRLAGILQRIAVAYFLAAVCE 163
Query: 181 IWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWI 240
IWLKGD +V S SL +KY+ W V LVLT Y LLYGLYVPDW+Y P ETSSS+ I
Sbjct: 164 IWLKGDXNVKSGSSLLKKYQFQWAVVLVLTVAYCSLLYGLYVPDWEYSIPSETSSSALKI 223
Query: 241 FNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSW 300
F V CGVR TGPACNAVGMIDR +LGIQHLY++PIY+R KQCSINSPDYGP+P +AP+W
Sbjct: 224 FKVKCGVRSDTGPACNAVGMIDRNVLGIQHLYKRPIYARMKQCSINSPDYGPLPPNAPTW 283
Query: 301 CQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFV 360
CQAPFDPEGLLSSVMA VTCL+GLH+GH+IVHFKDH+DR+L+WI+ SSCL+ LG +LDF
Sbjct: 284 CQAPFDPEGLLSSVMAIVTCLVGLHYGHIIVHFKDHKDRILHWIVPSSCLLVLGFALDFF 343
Query: 361 GMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLML 401
GMH+NKALY+LSY C+TAGA+G+L AGIY MV + +HL+L
Sbjct: 344 GMHVNKALYTLSYMCVTAGAAGILFAGIYLMVGPL-THLIL 383
>gi|449446789|ref|XP_004141153.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Cucumis sativus]
Length = 494
Score = 508 bits (1307), Expect = e-141, Method: Compositional matrix adjust.
Identities = 255/380 (67%), Positives = 305/380 (80%), Gaps = 5/380 (1%)
Query: 15 LVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVD 74
L I + + P + K++ S+S R Q RL+SLDVFRG+TVALMI+VD
Sbjct: 63 LPIHNANPLTTPVSSKIDEPQF---SSSVRPILRSSDQCHRLVSLDVFRGITVALMIVVD 119
Query: 75 DVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLF 134
GG++PAINHSPW+GLTLAD VMPFFLFIVGVSLAL YK P + +AT+KA+LR L L
Sbjct: 120 YAGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLL 179
Query: 135 LLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLS 194
LG+FLQGGF HG+NNL YGVDI QIRWMG+LQRIAIAY +AALCEIWLKG +V+S+ +
Sbjct: 180 FLGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYVNSETA 239
Query: 195 LFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSS--SSPWIFNVTCGVRGSTG 252
L RKY+ V A+VLT LYL L YGLYVPDW+Y+ P T+S +SP IF+V CG RG TG
Sbjct: 240 LRRKYQLQLVAAVVLTMLYLALSYGLYVPDWEYQVPSLTTSDVASPKIFSVKCGTRGDTG 299
Query: 253 PACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLS 312
PACNAVGMIDRKI GIQHLY++PIY+RT+QCSIN+PDYGP+P DAPSWCQAPFDPEGLLS
Sbjct: 300 PACNAVGMIDRKIFGIQHLYKRPIYARTEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLS 359
Query: 313 SVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLS 372
+VMA VTCL+GLH+GH+IVHFKDHRDRML+WII SSCLI L + LDF+GMH+NK LY++S
Sbjct: 360 TVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKVLYTVS 419
Query: 373 YTCLTAGASGVLLAGIYFMV 392
Y +TAGA+G+L GIY MV
Sbjct: 420 YMSVTAGAAGLLFTGIYLMV 439
>gi|388508176|gb|AFK42154.1| unknown [Lotus japonicus]
Length = 467
Score = 496 bits (1278), Expect = e-138, Method: Compositional matrix adjust.
Identities = 247/345 (71%), Positives = 287/345 (83%), Gaps = 1/345 (0%)
Query: 48 RPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGV 107
+PQ Q +RL+S+DVFRGLTVALMILVDD GG+LPA+NHSPW+GLT+ADFVMP FLFIVG+
Sbjct: 69 KPQSQSQRLVSIDVFRGLTVALMILVDDAGGLLPALNHSPWDGLTIADFVMPLFLFIVGL 128
Query: 108 SLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
SLALTYK C V+ATRKAILRAL L LG+FLQGG+FH IN+L +GVD+ QIR MG+LQ
Sbjct: 129 SLALTYKKLSCPVIATRKAILRALKLLALGLFLQGGYFHRINDLTFGVDMKQIRLMGILQ 188
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQY 227
RIAIAYL+ ALCEIWLK D V S SL RKYR W VA VL+ YL LLYGLYVPDW+Y
Sbjct: 189 RIAIAYLLTALCEIWLKCDDIVKSGSSLLRKYRYQWAVAFVLSGFYLCLLYGLYVPDWEY 248
Query: 228 EFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINS 287
+ P + SSS P F+V CGV TGPACN VGMIDRKILGIQHLYR+PIY+R +CSINS
Sbjct: 249 QIPTD-SSSVPKTFSVKCGVWADTGPACNVVGMIDRKILGIQHLYRRPIYARMPECSINS 307
Query: 288 PDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILS 347
PDYGP+P DAP+WCQAPFDPEGLLSSVMA VTCLIGLH+GH+IVH+KDHR R+++W+I +
Sbjct: 308 PDYGPLPPDAPAWCQAPFDPEGLLSSVMAIVTCLIGLHYGHIIVHYKDHRVRIIHWMIPT 367
Query: 348 SCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
SCLI G +L GMH+NK LYS SYTC+TAGA+G+LL IY MV
Sbjct: 368 SCLIVFGFALHLFGMHVNKVLYSFSYTCVTAGAAGILLVAIYLMV 412
>gi|356527477|ref|XP_003532336.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Glycine max]
Length = 463
Score = 496 bits (1277), Expect = e-138, Method: Compositional matrix adjust.
Identities = 259/384 (67%), Positives = 305/384 (79%), Gaps = 10/384 (2%)
Query: 18 QISKSTSAPANEKLERDPLLPPSNSN---------SKQTRPQHQQRRLISLDVFRGLTVA 68
QIS+S SA + + + L +N Q +PQ + RL+SLDVFRGLTVA
Sbjct: 26 QISESRSATVSPTIAQTTPLHLHINNIIEEQHIIARHQPQPQPKSPRLVSLDVFRGLTVA 85
Query: 69 LMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAIL 128
LMILVDD GG++PA+NHSPWNGLTLAD+VMPFFLFIVGVSLALTYK C V A+RKA L
Sbjct: 86 LMILVDDAGGLIPALNHSPWNGLTLADYVMPFFLFIVGVSLALTYKKLSCGVDASRKASL 145
Query: 129 RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGH 188
RAL L +LG+FLQGG+FH +N+L YGVD+ QIRWMG+LQRI +AYLVAALCEIWLK D
Sbjct: 146 RALKLLVLGLFLQGGYFHRVNDLTYGVDLKQIRWMGILQRIGVAYLVAALCEIWLKSDDT 205
Query: 189 VSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVR 248
V+S SL RKYR W VAL+L+ LYL LLYGLYVPDW Y+ E SS P F+V CGVR
Sbjct: 206 VNSGPSLLRKYRYQWAVALILSFLYLCLLYGLYVPDWVYQIQTE-PSSEPKTFSVKCGVR 264
Query: 249 GSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPE 308
G+TGPACNAVGMIDR ILGI HLY++PIY+R +CSINSP+YGP+P DAP+WCQAPFDPE
Sbjct: 265 GNTGPACNAVGMIDRTILGIHHLYQRPIYARMPECSINSPNYGPLPPDAPAWCQAPFDPE 324
Query: 309 GLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKAL 368
GLLSSVMA VTCLIGLH+GH+IVHFKDHR R++ W+I +SCL+ GL+LD GMH+NK L
Sbjct: 325 GLLSSVMAIVTCLIGLHYGHIIVHFKDHRVRIIYWMIPTSCLVVFGLALDLFGMHINKVL 384
Query: 369 YSLSYTCLTAGASGVLLAGIYFMV 392
YSLSYTC+TAGA+G+L GIY MV
Sbjct: 385 YSLSYTCVTAGAAGILFVGIYLMV 408
>gi|224072443|ref|XP_002303734.1| predicted protein [Populus trichocarpa]
gi|222841166|gb|EEE78713.1| predicted protein [Populus trichocarpa]
Length = 381
Score = 494 bits (1273), Expect = e-137, Method: Compositional matrix adjust.
Identities = 238/324 (73%), Positives = 278/324 (85%)
Query: 69 LMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAIL 128
LMILVDD GG+LPAINHSPWNGLTLAD VMPFFLF+VGVSL LTYK P K VATRKAIL
Sbjct: 3 LMILVDDAGGVLPAINHSPWNGLTLADVVMPFFLFMVGVSLGLTYKKLPSKAVATRKAIL 62
Query: 129 RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGH 188
RAL L ++G+FLQGGF HG+N+L +GVD+ QIRWMG+LQRIAI YL+ A+CEIWLKGD H
Sbjct: 63 RALKLLVIGLFLQGGFLHGLNDLTFGVDMVQIRWMGILQRIAIGYLIGAMCEIWLKGDNH 122
Query: 189 VSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVR 248
V+S LS+ RKY+ W +VL +LYL LLYGLYVPDW+YE PV SSSSP IF V CGVR
Sbjct: 123 VASGLSMLRKYQLQWGAVVVLVSLYLSLLYGLYVPDWEYEIPVAASSSSPKIFRVKCGVR 182
Query: 249 GSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPE 308
G+TG ACNAVGMIDR +LGIQHLYRKPIY+RTK CSINSPDYGP+P DAPSWCQAPFDPE
Sbjct: 183 GTTGSACNAVGMIDRTVLGIQHLYRKPIYARTKACSINSPDYGPLPPDAPSWCQAPFDPE 242
Query: 309 GLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKAL 368
GLLSSVMA VTCL+GLH+GH+IVHFK+H+DR+L+W++ S+C + LGL LD GMH+NKAL
Sbjct: 243 GLLSSVMAIVTCLVGLHYGHIIVHFKEHKDRILHWMVPSTCFVVLGLVLDLSGMHVNKAL 302
Query: 369 YSLSYTCLTAGASGVLLAGIYFMV 392
Y+ SY C+TAGA+G++ GIY +V
Sbjct: 303 YTFSYMCVTAGAAGIVFTGIYMLV 326
>gi|186530230|ref|NP_199601.2| uncharacterized protein [Arabidopsis thaliana]
gi|332008203|gb|AED95586.1| uncharacterized protein [Arabidopsis thaliana]
Length = 440
Score = 491 bits (1263), Expect = e-136, Method: Compositional matrix adjust.
Identities = 254/388 (65%), Positives = 304/388 (78%), Gaps = 18/388 (4%)
Query: 8 NNDANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTV 67
+ND ND Q + + ++ R LPP + RL+SLDVFRGLTV
Sbjct: 13 DNDDNDH---QWREKKDIESALQISRSSSLPP------------DKERLVSLDVFRGLTV 57
Query: 68 ALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAI 127
A MILVDDVGGILP+INHSPW+G+TLADFVMPFFLFIVGVSLA YKN C+ VATRKA+
Sbjct: 58 AFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAYKNLSCRFVATRKAL 117
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDG 187
+R+L L LLG+FLQGGF HG+NNL YG+D+ +IR MG+LQRIAIAYLV ALCEIWLKG+
Sbjct: 118 IRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAYLVVALCEIWLKGNH 177
Query: 188 HVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIF---NVT 244
+VSS+LS+ +KYR HWVVA V+TT+YL LLYGLYVPDW+Y+ E S+ F V
Sbjct: 178 NVSSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQILKEDQGSTLTTFLNLKVK 237
Query: 245 CGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAP 304
CGVRG TGP CNAVGM+DR LGIQHLYRKP+Y+RTKQCSIN P+ GP+P DAPSWCQAP
Sbjct: 238 CGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINYPNNGPLPPDAPSWCQAP 297
Query: 305 FDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHL 364
FDPEGLLSS+MATVTCL+GLH+GH+I+HFKDH+ R+ WI+ S CL+ LGL+L+ GMHL
Sbjct: 298 FDPEGLLSSLMATVTCLVGLHYGHIIIHFKDHKKRLNQWILRSFCLLMLGLALNLFGMHL 357
Query: 365 NKALYSLSYTCLTAGASGVLLAGIYFMV 392
NK LY+LSY C+T+GASG LL+ IY MV
Sbjct: 358 NKPLYTLSYMCVTSGASGFLLSAIYLMV 385
>gi|356569086|ref|XP_003552737.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Glycine max]
Length = 461
Score = 488 bits (1256), Expect = e-135, Method: Compositional matrix adjust.
Identities = 258/380 (67%), Positives = 304/380 (80%), Gaps = 6/380 (1%)
Query: 18 QISKSTSAPANEKLERDPLLPPSNSNSKQT-----RPQHQQRRLISLDVFRGLTVALMIL 72
QIS+S SA + + + L N +Q +PQ + RL+SLDVFRGLTVALMIL
Sbjct: 28 QISESRSATVSSPIGQTTPLHIHNIIEEQRIISRHQPQPKSPRLVSLDVFRGLTVALMIL 87
Query: 73 VDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALN 132
VDD GG++PA+NHSPWNGLTLAD+VMPFFLFIVGVSLAL+YK C V A+RKA LRAL
Sbjct: 88 VDDAGGLIPALNHSPWNGLTLADYVMPFFLFIVGVSLALSYKKLSCGVDASRKASLRALK 147
Query: 133 LFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSK 192
L LG+FLQGG+FH +N+L +GVDI QIRWMG+LQRIA+AYLV ALCEIWLK D V+S
Sbjct: 148 LLALGLFLQGGYFHRVNDLTFGVDIKQIRWMGILQRIAVAYLVVALCEIWLKSDDTVNSG 207
Query: 193 LSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTG 252
SL RKYR W VAL+L+ LYL LLYGLYVPDW Y+ E S+ P F+V CGVRG+TG
Sbjct: 208 PSLLRKYRYQWAVALILSFLYLCLLYGLYVPDWVYQIQTE-PSAEPKTFSVKCGVRGNTG 266
Query: 253 PACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLS 312
PACN VGMIDR ILGIQHLY++PIY+R +CSINSP+YGP+P DAP+WCQAPFDPEGLLS
Sbjct: 267 PACNVVGMIDRMILGIQHLYKRPIYARMPECSINSPNYGPLPPDAPAWCQAPFDPEGLLS 326
Query: 313 SVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLS 372
SVMA VTCLIGLH+GH+IVHFKDHR R++ W+I +SCL+ GL+LD GMH+NK LYSLS
Sbjct: 327 SVMAIVTCLIGLHYGHIIVHFKDHRVRIIYWMIPTSCLLVFGLALDLFGMHINKVLYSLS 386
Query: 373 YTCLTAGASGVLLAGIYFMV 392
YTC+TAGA+GVL GIY MV
Sbjct: 387 YTCVTAGAAGVLFVGIYLMV 406
>gi|242065256|ref|XP_002453917.1| hypothetical protein SORBIDRAFT_04g021400 [Sorghum bicolor]
gi|241933748|gb|EES06893.1| hypothetical protein SORBIDRAFT_04g021400 [Sorghum bicolor]
Length = 439
Score = 485 bits (1248), Expect = e-134, Method: Compositional matrix adjust.
Identities = 232/392 (59%), Positives = 290/392 (73%), Gaps = 8/392 (2%)
Query: 1 MAKYNPINNDANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLD 60
M +Y + +D A A++ P P + S P +Q RL+SLD
Sbjct: 1 MGRYELVRSDDAAAGATGTDLECGASASKASTTSPAAPSTTS------PAARQPRLVSLD 54
Query: 61 VFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKV 120
VFRG+TV LMI+VDD GG LP++NHSPW+G+T+ADFVMPFFLFIVGVSL L YK P K+
Sbjct: 55 VFRGITVLLMIIVDDAGGFLPSLNHSPWDGVTIADFVMPFFLFIVGVSLTLAYKRVPDKL 114
Query: 121 VATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCE 180
AT+KA+LRAL LF LG+ LQGGFFHG+++L +GVD+ +IR MG+LQRIAIAYL+AA+CE
Sbjct: 115 EATKKAVLRALKLFCLGLVLQGGFFHGVHSLTFGVDLTKIRLMGILQRIAIAYLLAAICE 174
Query: 181 IWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWI 240
IWLKGD V S L R+YR V LVL+ Y +LLYG+YVPDW+Y+ + S+
Sbjct: 175 IWLKGDDDVDSGYGLLRRYRYQLFVGLVLSIAYTILLYGIYVPDWEYK--ISGPGSTEKS 232
Query: 241 FNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSW 300
F+V CGVRG TGPACNAVGM+DR ILGI HLYR+P+Y+RTK+CSIN P+ GP+P DAPSW
Sbjct: 233 FSVKCGVRGDTGPACNAVGMVDRTILGIDHLYRRPVYARTKECSINYPENGPLPPDAPSW 292
Query: 301 CQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFV 360
CQAPFDPEGLLSSVMA VTCLIGL FGH+I+HF+ HR R+ NW+I S ++ L +DF
Sbjct: 293 CQAPFDPEGLLSSVMAIVTCLIGLQFGHIIIHFEKHRGRITNWLIPSFSMLALAFLMDFS 352
Query: 361 GMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
GM +NK LY++SYT TAGA+G+L AGIY +V
Sbjct: 353 GMRMNKPLYTISYTLATAGAAGLLFAGIYALV 384
>gi|297791891|ref|XP_002863830.1| hypothetical protein ARALYDRAFT_494835 [Arabidopsis lyrata subsp.
lyrata]
gi|297309665|gb|EFH40089.1| hypothetical protein ARALYDRAFT_494835 [Arabidopsis lyrata subsp.
lyrata]
Length = 432
Score = 484 bits (1246), Expect = e-134, Method: Compositional matrix adjust.
Identities = 254/392 (64%), Positives = 307/392 (78%), Gaps = 19/392 (4%)
Query: 4 YNPINNDANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFR 63
Y I + +DQ Q + + ++ R P LPP + RL+SLDVFR
Sbjct: 2 YELIKSCDDDQ---QWREKKDIESALQISR-PSLPP------------DKERLVSLDVFR 45
Query: 64 GLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVAT 123
GLTVALMILVDDVG ILP+INHSPW+G+TLADFVMPFFLFIVGVSLA YKN C+ VAT
Sbjct: 46 GLTVALMILVDDVGEILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAYKNLSCRFVAT 105
Query: 124 RKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWL 183
RKA++R+L L LLG+FLQGGF HG+NNL YG+D+ +IR+MG+LQRIAIAYLVAALCEIWL
Sbjct: 106 RKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRFMGILQRIAIAYLVAALCEIWL 165
Query: 184 KGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIF-- 241
KG+ +VSS+LS+ +KYR HWVVA V+TT+YL LLYGLYV DW+Y+ E S+ F
Sbjct: 166 KGNHNVSSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVSDWEYQISTEDQGSTLTTFLN 225
Query: 242 -NVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSW 300
V CGVRG TGP CNAVGM+DR LGIQHLYRKP+Y+RTKQCSI+SP+ GP+P DAPSW
Sbjct: 226 LKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSISSPNNGPLPPDAPSW 285
Query: 301 CQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFV 360
CQAPFDPEGLLSS+MA VTCL+GLH+GH+I+HFKDH+ R+ WI+ S CL+ LGL+L+
Sbjct: 286 CQAPFDPEGLLSSLMAIVTCLVGLHYGHIIIHFKDHKKRLNQWILRSFCLLMLGLALNLF 345
Query: 361 GMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
GMHLNK LY+LSY C+T+GASG LL+ IY MV
Sbjct: 346 GMHLNKPLYTLSYMCVTSGASGFLLSAIYLMV 377
>gi|219885579|gb|ACL53164.1| unknown [Zea mays]
gi|413937084|gb|AFW71635.1| hypothetical protein ZEAMMB73_862609 [Zea mays]
Length = 482
Score = 481 bits (1239), Expect = e-133, Method: Compositional matrix adjust.
Identities = 223/341 (65%), Positives = 277/341 (81%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
+Q+RL SLDVFRG+TV LMI+VDD GG LPA+NHSPW+G+T+ADF+MPFFLFIVGVSL L
Sbjct: 87 RQQRLASLDVFRGITVLLMIIVDDAGGFLPALNHSPWDGVTVADFIMPFFLFIVGVSLTL 146
Query: 112 TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAI 171
YK P +V ATRKA+LRAL LF LG+ LQGGFFHG+++L +GVD+ +IR MG+LQRIAI
Sbjct: 147 AYKRVPDRVEATRKAVLRALKLFCLGLVLQGGFFHGVHSLTFGVDLTKIRLMGILQRIAI 206
Query: 172 AYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPV 231
AYL+AA+CEIWLKGD V S L R+YR V LVL+ Y +LLYG+YVPDW+Y+
Sbjct: 207 AYLLAAVCEIWLKGDDDVDSGYGLLRRYRYQLFVGLVLSIAYSILLYGMYVPDWEYQIAG 266
Query: 232 ETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYG 291
SSS+ F+V CGVRG TGPACNAVGM+DR +LGI HLYR+P+Y+RTK+CSI+ P+ G
Sbjct: 267 PGSSSTEKSFSVKCGVRGDTGPACNAVGMVDRTVLGIDHLYRRPVYARTKECSIDYPENG 326
Query: 292 PMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLI 351
P+P DAPSWCQAPFDPEGLLSSVMA VTCLIGL FGH+I+HF+ HR R+ +W++ S ++
Sbjct: 327 PLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQFGHVIIHFEKHRGRIASWLVPSFSML 386
Query: 352 GLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
L +DFVGM +NK LY++SYT TAGA+G+L AGIY +V
Sbjct: 387 ALAFVMDFVGMRMNKPLYTMSYTLATAGAAGLLFAGIYALV 427
>gi|226509496|ref|NP_001144452.1| uncharacterized protein LOC100277415 [Zea mays]
gi|195642330|gb|ACG40633.1| hypothetical protein [Zea mays]
Length = 441
Score = 481 bits (1239), Expect = e-133, Method: Compositional matrix adjust.
Identities = 223/341 (65%), Positives = 277/341 (81%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
+Q+RL SLDVFRG+TV LMI+VDD GG LPA+NHSPW+G+T+ADF+MPFFLFIVGVSL L
Sbjct: 46 RQQRLASLDVFRGITVLLMIIVDDAGGFLPALNHSPWDGVTVADFIMPFFLFIVGVSLTL 105
Query: 112 TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAI 171
YK P +V ATRKA+LRAL LF LG+ LQGGFFHG+++L +GVD+ +IR MG+LQRIAI
Sbjct: 106 AYKRVPDRVEATRKAVLRALKLFCLGLVLQGGFFHGVHSLTFGVDLTKIRLMGILQRIAI 165
Query: 172 AYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPV 231
AYL+AA+CEIWLKGD V S L R+YR V LVL+ Y +LLYG+YVPDW+Y+
Sbjct: 166 AYLLAAVCEIWLKGDDDVDSGYGLLRRYRYQLFVGLVLSIAYSILLYGMYVPDWEYQIAG 225
Query: 232 ETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYG 291
SSS+ F+V CGVRG TGPACNAVGM+DR +LGI HLYR+P+Y+RTK+CSI+ P+ G
Sbjct: 226 PGSSSTEKSFSVKCGVRGDTGPACNAVGMVDRTVLGIDHLYRRPVYARTKECSIDYPENG 285
Query: 292 PMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLI 351
P+P DAPSWCQAPFDPEGLLSSVMA VTCLIGL FGH+I+HF+ HR R+ +W++ S ++
Sbjct: 286 PLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQFGHVIIHFEKHRGRITSWLVPSFSML 345
Query: 352 GLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
L +DFVGM +NK LY++SYT TAGA+G+L AGIY +V
Sbjct: 346 ALAFVMDFVGMRMNKPLYTMSYTLATAGAAGLLFAGIYALV 386
>gi|413937082|gb|AFW71633.1| hypothetical protein ZEAMMB73_862609 [Zea mays]
Length = 441
Score = 481 bits (1239), Expect = e-133, Method: Compositional matrix adjust.
Identities = 223/341 (65%), Positives = 277/341 (81%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
+Q+RL SLDVFRG+TV LMI+VDD GG LPA+NHSPW+G+T+ADF+MPFFLFIVGVSL L
Sbjct: 46 RQQRLASLDVFRGITVLLMIIVDDAGGFLPALNHSPWDGVTVADFIMPFFLFIVGVSLTL 105
Query: 112 TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAI 171
YK P +V ATRKA+LRAL LF LG+ LQGGFFHG+++L +GVD+ +IR MG+LQRIAI
Sbjct: 106 AYKRVPDRVEATRKAVLRALKLFCLGLVLQGGFFHGVHSLTFGVDLTKIRLMGILQRIAI 165
Query: 172 AYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPV 231
AYL+AA+CEIWLKGD V S L R+YR V LVL+ Y +LLYG+YVPDW+Y+
Sbjct: 166 AYLLAAVCEIWLKGDDDVDSGYGLLRRYRYQLFVGLVLSIAYSILLYGMYVPDWEYQIAG 225
Query: 232 ETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYG 291
SSS+ F+V CGVRG TGPACNAVGM+DR +LGI HLYR+P+Y+RTK+CSI+ P+ G
Sbjct: 226 PGSSSTEKSFSVKCGVRGDTGPACNAVGMVDRTVLGIDHLYRRPVYARTKECSIDYPENG 285
Query: 292 PMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLI 351
P+P DAPSWCQAPFDPEGLLSSVMA VTCLIGL FGH+I+HF+ HR R+ +W++ S ++
Sbjct: 286 PLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQFGHVIIHFEKHRGRIASWLVPSFSML 345
Query: 352 GLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
L +DFVGM +NK LY++SYT TAGA+G+L AGIY +V
Sbjct: 346 ALAFVMDFVGMRMNKPLYTMSYTLATAGAAGLLFAGIYALV 386
>gi|449528551|ref|XP_004171267.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Cucumis sativus]
Length = 380
Score = 475 bits (1222), Expect = e-131, Method: Compositional matrix adjust.
Identities = 234/325 (72%), Positives = 276/325 (84%), Gaps = 2/325 (0%)
Query: 70 MILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILR 129
MI+VD GG++PAINHSPW+GLTLAD VMPFFLFIVGVSLAL YK P + +AT+KA+LR
Sbjct: 1 MIVVDYAGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLR 60
Query: 130 ALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHV 189
L L LG+FLQGGF HG+NNL YGVDI QIRWMG+LQRIAIAY +AALCEIWLKG +V
Sbjct: 61 TLKLLFLGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLKGSDYV 120
Query: 190 SSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSS--SSPWIFNVTCGV 247
+S+ +L RKY+ V A+VLT LYL L YGLYVPDW+Y+ P T+S +SP IF+V CG
Sbjct: 121 NSETALRRKYQLQLVAAVVLTMLYLALSYGLYVPDWEYQVPSLTTSDVASPKIFSVKCGT 180
Query: 248 RGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDP 307
RG TGPACNAVGMIDRKI GIQHLY++PIY+RT+QCSIN+PDYGP+P DAPSWCQAPFDP
Sbjct: 181 RGDTGPACNAVGMIDRKIFGIQHLYKRPIYARTEQCSINAPDYGPLPPDAPSWCQAPFDP 240
Query: 308 EGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKA 367
EGLLS+VMA VTCL+GLH+GH+IVHFKDHRDRML+WII SSCLI L + LDF+GMH+NK
Sbjct: 241 EGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDFLGMHINKV 300
Query: 368 LYSLSYTCLTAGASGVLLAGIYFMV 392
LY++SY +TAGA+G+L GIY MV
Sbjct: 301 LYTVSYMSVTAGAAGLLFTGIYLMV 325
>gi|334188248|ref|NP_001190487.1| uncharacterized protein [Arabidopsis thaliana]
gi|332008209|gb|AED95592.1| uncharacterized protein [Arabidopsis thaliana]
Length = 435
Score = 474 bits (1220), Expect = e-131, Method: Compositional matrix adjust.
Identities = 254/425 (59%), Positives = 304/425 (71%), Gaps = 55/425 (12%)
Query: 8 NNDANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTV 67
+ND ND Q + + ++ R LPP + RL+SLDVFRGLTV
Sbjct: 13 DNDDNDH---QWREKKDIESALQISRSSSLPP------------DKERLVSLDVFRGLTV 57
Query: 68 ALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAI 127
A MILVDDVGGILP+INHSPW+G+TLADFVMPFFLFIVGVSLA YKN C+ VATRKA+
Sbjct: 58 AFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAYKNLSCRFVATRKAL 117
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDG 187
+R+L L LLG+FLQGGF HG+NNL YG+D+ +IR MG+LQRIAIAYLV ALCEIWLKG+
Sbjct: 118 IRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAYLVVALCEIWLKGNH 177
Query: 188 HVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIF---NVT 244
+VSS+LS+ +KYR HWVVA V+TT+YL LLYGLYVPDW+Y+ E S+ F V
Sbjct: 178 NVSSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQILKEDQGSTLTTFLNLKVK 237
Query: 245 CGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAP 304
CGVRG TGP CNAVGM+DR LGIQHLYRKP+Y+RTKQCSIN P+ GP+P DAPSWCQAP
Sbjct: 238 CGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINYPNNGPLPPDAPSWCQAP 297
Query: 305 FDPEGLLSSVMATVTCLIGLHFGHLIVHFK------------------------------ 334
FDPEGLLSS+MATVTCL+GLH+GH+I+HFK
Sbjct: 298 FDPEGLLSSLMATVTCLVGLHYGHIIIHFKRNGSKGQVYNEPSISIRPFFFILSETYLLL 357
Query: 335 -------DHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAG 387
DH+ R+ WI+ S CL+ LGL+L+ GMHLNK LY+LSY C+T+GASG LL+
Sbjct: 358 YVINFLQDHKKRLNQWILRSFCLLMLGLALNLFGMHLNKPLYTLSYMCVTSGASGFLLSA 417
Query: 388 IYFMV 392
IY MV
Sbjct: 418 IYLMV 422
>gi|242075654|ref|XP_002447763.1| hypothetical protein SORBIDRAFT_06g015200 [Sorghum bicolor]
gi|241938946|gb|EES12091.1| hypothetical protein SORBIDRAFT_06g015200 [Sorghum bicolor]
Length = 446
Score = 471 bits (1213), Expect = e-130, Method: Compositional matrix adjust.
Identities = 232/397 (58%), Positives = 288/397 (72%), Gaps = 11/397 (2%)
Query: 1 MAKYNPINNDANDQLVIQI---SKSTSAPANEKL--ERDPLLPPSNSNSKQTRPQHQQRR 55
M Y + +D I + + T+AP + R PS TRPQ R
Sbjct: 1 MGGYELVRSDDAPAAAIAVDLEAGGTTAPCGDDYPKRRRGSSTPSPPAPASTRPQ----R 56
Query: 56 LISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKN 115
L+SLDVFRG+TV LMI+VDD G +PA+NHSPW+G+T+ADFVMPFFLFIVGV+LAL YK
Sbjct: 57 LVSLDVFRGITVLLMIIVDDAGAFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAYKR 116
Query: 116 FPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLV 175
P K+ AT+KA+LRAL LF LG+ LQGGFFHG+ +L +GVD+ +IR MG+LQRIAIAYL+
Sbjct: 117 VPDKLDATKKAVLRALKLFCLGLVLQGGFFHGVRSLTFGVDLQEIRLMGILQRIAIAYLL 176
Query: 176 AALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSS 235
ALCEIWLKGD V L ++YR +V V+ Y+ LLYG YVPDW+Y+
Sbjct: 177 TALCEIWLKGDEDVDYGYDLLKRYRYQLLVGAVVAITYMCLLYGTYVPDWEYQ--TSGPG 234
Query: 236 SSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPL 295
S F V CGVRG T P CNAVGMIDRKILGIQHLY +P+Y+R+KQCSI+SP GP+P
Sbjct: 235 SIEKSFFVKCGVRGDTSPGCNAVGMIDRKILGIQHLYGRPVYARSKQCSIDSPQNGPLPP 294
Query: 296 DAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGL 355
DAPSWCQAPFDPEGLLSSVMA VTCLIGL +GH+IVHF+ HR+R++NW+I S ++ L
Sbjct: 295 DAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHIIVHFQKHRERIMNWLIPSFSMLVLAF 354
Query: 356 SLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
++DF G+H+NK LYSLSYT TAGA+G+L +GIY +V
Sbjct: 355 AMDFFGLHMNKPLYSLSYTLATAGAAGLLFSGIYTLV 391
>gi|224033113|gb|ACN35632.1| unknown [Zea mays]
gi|413918233|gb|AFW58165.1| hypothetical protein ZEAMMB73_985435 [Zea mays]
Length = 444
Score = 468 bits (1204), Expect = e-129, Method: Compositional matrix adjust.
Identities = 225/392 (57%), Positives = 289/392 (73%), Gaps = 3/392 (0%)
Query: 1 MAKYNPINNDANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLD 60
M +Y + +D + + T+AP + +R P + + +++QR L+SLD
Sbjct: 1 MGRYELVRSDDAPAVAADLEAGTAAPCDGYPKRRVFTPSPSPSPAPASTRYRQR-LVSLD 59
Query: 61 VFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKV 120
VFRG+TV LMI+VDD G +PA+NHSPW+G+T+ADFVMPFFLFIVGV+LAL YK P K+
Sbjct: 60 VFRGITVLLMIIVDDAGAFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAYKRVPDKL 119
Query: 121 VATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCE 180
A+RKA+LRAL LF LG+ LQGGFFHG+ +L +GVD+ +IR MGVLQRIAIAYL+ ALCE
Sbjct: 120 DASRKALLRALKLFCLGLVLQGGFFHGVRSLSFGVDLQEIRLMGVLQRIAIAYLLTALCE 179
Query: 181 IWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWI 240
IW++GD V L ++YR V V+ Y+ LLYG YVPDW+Y+ S+
Sbjct: 180 IWIRGDEDVDYGYDLLKRYRYQLFVGAVVAITYMSLLYGTYVPDWEYQTSAPGSTEKHLF 239
Query: 241 FNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSW 300
V CGVRG T P CNAVGMIDRKILGIQHLY +P+Y+R+KQCSI+SP GP+P DAPSW
Sbjct: 240 --VKCGVRGDTSPGCNAVGMIDRKILGIQHLYGRPVYARSKQCSIDSPQNGPLPSDAPSW 297
Query: 301 CQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFV 360
CQAPFDPEGLLSSVMA VTCLIGL +GH+IVHF+ HR+RM+NW+I S ++ L ++DF
Sbjct: 298 CQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHRERMMNWLIPSFSMLVLAFAMDFF 357
Query: 361 GMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
G+H+NK LY+LSYT TAGA+G+L +GIY +V
Sbjct: 358 GLHMNKPLYTLSYTLGTAGAAGLLFSGIYTLV 389
>gi|226494648|ref|NP_001146383.1| uncharacterized protein LOC100279961 [Zea mays]
gi|219886923|gb|ACL53836.1| unknown [Zea mays]
gi|413918231|gb|AFW58163.1| hypothetical protein ZEAMMB73_985435 [Zea mays]
Length = 469
Score = 468 bits (1203), Expect = e-129, Method: Compositional matrix adjust.
Identities = 225/392 (57%), Positives = 289/392 (73%), Gaps = 3/392 (0%)
Query: 1 MAKYNPINNDANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLD 60
M +Y + +D + + T+AP + +R P + + +++QR L+SLD
Sbjct: 26 MGRYELVRSDDAPAVAADLEAGTAAPCDGYPKRRVFTPSPSPSPAPASTRYRQR-LVSLD 84
Query: 61 VFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKV 120
VFRG+TV LMI+VDD G +PA+NHSPW+G+T+ADFVMPFFLFIVGV+LAL YK P K+
Sbjct: 85 VFRGITVLLMIIVDDAGAFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAYKRVPDKL 144
Query: 121 VATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCE 180
A+RKA+LRAL LF LG+ LQGGFFHG+ +L +GVD+ +IR MGVLQRIAIAYL+ ALCE
Sbjct: 145 DASRKALLRALKLFCLGLVLQGGFFHGVRSLSFGVDLQEIRLMGVLQRIAIAYLLTALCE 204
Query: 181 IWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWI 240
IW++GD V L ++YR V V+ Y+ LLYG YVPDW+Y+ S+
Sbjct: 205 IWIRGDEDVDYGYDLLKRYRYQLFVGAVVAITYMSLLYGTYVPDWEYQTSAPGSTEKHLF 264
Query: 241 FNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSW 300
V CGVRG T P CNAVGMIDRKILGIQHLY +P+Y+R+KQCSI+SP GP+P DAPSW
Sbjct: 265 --VKCGVRGDTSPGCNAVGMIDRKILGIQHLYGRPVYARSKQCSIDSPQNGPLPSDAPSW 322
Query: 301 CQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFV 360
CQAPFDPEGLLSSVMA VTCLIGL +GH+IVHF+ HR+RM+NW+I S ++ L ++DF
Sbjct: 323 CQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHRERMMNWLIPSFSMLVLAFAMDFF 382
Query: 361 GMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
G+H+NK LY+LSYT TAGA+G+L +GIY +V
Sbjct: 383 GLHMNKPLYTLSYTLGTAGAAGLLFSGIYTLV 414
>gi|326505544|dbj|BAJ95443.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 429
Score = 467 bits (1201), Expect = e-129, Method: Compositional matrix adjust.
Identities = 219/339 (64%), Positives = 272/339 (80%), Gaps = 2/339 (0%)
Query: 54 RRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTY 113
+RL+SLDVFRG+TV LMI+VDD G LPA+NHSPW G+T+ADFVMPFFLFIVGV+LAL Y
Sbjct: 38 QRLVSLDVFRGITVLLMIIVDDAGSFLPAMNHSPWEGVTIADFVMPFFLFIVGVALALAY 97
Query: 114 KNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAY 173
K P K+ ATRKA LRAL LF +G+ LQGGFFHG+ +L +GVDIAQIR MG+LQRIAIAY
Sbjct: 98 KRVPDKLDATRKATLRALKLFCVGLVLQGGFFHGVRSLTFGVDIAQIRLMGILQRIAIAY 157
Query: 174 LVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVET 233
LV ALC+IWLKGD V S L L ++Y+ + L++T Y+ LLYG YVPDW+Y +
Sbjct: 158 LVTALCQIWLKGDDDVDSGLDLIKRYKYQLLAGLLITITYMALLYGTYVPDWEYR--ISG 215
Query: 234 SSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPM 293
+ F V CGVRG +GP CNAVGMIDRKILGIQHLY +P+Y+R++QCSI+SP GP+
Sbjct: 216 PGFTEKTFTVRCGVRGDSGPGCNAVGMIDRKILGIQHLYGRPVYARSQQCSIDSPQNGPL 275
Query: 294 PLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL 353
P DAPSWCQAPFDPEGLLSSVMA VTCLIGL +GH+IVHF+ H++R+++W++ S ++ L
Sbjct: 276 PPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHIIVHFQKHKERIMHWLVPSFGMLVL 335
Query: 354 GLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
++DF GMH+NK LY+LSYT TAGA+G+L AGIY +V
Sbjct: 336 AFAMDFFGMHMNKPLYTLSYTLCTAGAAGLLFAGIYTLV 374
>gi|219886509|gb|ACL53629.1| unknown [Zea mays]
gi|414587417|tpg|DAA37988.1| TPA: hypothetical protein ZEAMMB73_167983 [Zea mays]
Length = 438
Score = 466 bits (1199), Expect = e-129, Method: Compositional matrix adjust.
Identities = 221/392 (56%), Positives = 286/392 (72%), Gaps = 9/392 (2%)
Query: 1 MAKYNPINNDANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLD 60
M +Y + +D + + + T+AP ++ P ++ P ++RL+SLD
Sbjct: 1 MGRYELVRSDDTPAIAVDLEAGTAAPRDD-------YPKRRGSAPSPAPAPTRQRLVSLD 53
Query: 61 VFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKV 120
VFRG+TV LMI+VDD G +PA+NHSPW+G+T+ADFVMPFFLFIVGV+LAL YK P K+
Sbjct: 54 VFRGITVLLMIIVDDAGSFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAYKRVPDKL 113
Query: 121 VATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCE 180
AT+KA+LRAL LF LG+ LQGGFFHG+ +L +GVD+ +IR MG+LQRIAIAYL+ ALCE
Sbjct: 114 DATKKAVLRALKLFCLGLVLQGGFFHGVRSLTFGVDLQEIRLMGILQRIAIAYLLTALCE 173
Query: 181 IWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWI 240
IWLKGD V L ++YR V ++ Y+ LLYG YV DW+Y+ S
Sbjct: 174 IWLKGDEDVDYGYDLLKRYRYQLFVGAIVGITYMSLLYGTYVRDWEYQ--TSGPGSIEKS 231
Query: 241 FNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSW 300
F V CGVRG T P CNAVGMIDR+ILGIQHLY +P+Y+R+KQCSI+SP GP+P DAPSW
Sbjct: 232 FFVKCGVRGDTSPGCNAVGMIDRRILGIQHLYGRPVYARSKQCSIDSPQNGPLPPDAPSW 291
Query: 301 CQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFV 360
CQAPFDPEGLLSSVMA VTCLIGL +GH+IVHF+ HR+R++NW+I S ++ L ++DF+
Sbjct: 292 CQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHRERIMNWLIPSFSMLVLAFAMDFL 351
Query: 361 GMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
G+ +NK LY+LSYT TAGA+G+L GIY +V
Sbjct: 352 GLRMNKPLYTLSYTLATAGAAGLLFCGIYTLV 383
>gi|125582342|gb|EAZ23273.1| hypothetical protein OsJ_06967 [Oryza sativa Japonica Group]
Length = 423
Score = 453 bits (1165), Expect = e-125, Method: Compositional matrix adjust.
Identities = 215/369 (58%), Positives = 276/369 (74%), Gaps = 18/369 (4%)
Query: 40 SNSNSKQTRPQHQQRRLISLDVFRGLTV----------------ALMILVDDVGGILPAI 83
SNS +K+ R + Q+ + R + V LMI+VDD G LPA+
Sbjct: 2 SNSTNKRKRRKDTQKSRMCAASMRTVLVRSPSSDKILKIFVWLFQLMIIVDDAGAFLPAL 61
Query: 84 NHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGG 143
NHSPW+G+T+ADFVMPFFLF+VG+SL L YK P K+ AT+KA+LRAL LF LG+ LQGG
Sbjct: 62 NHSPWDGVTIADFVMPFFLFMVGISLTLAYKRVPDKLEATKKAVLRALKLFCLGLVLQGG 121
Query: 144 FFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHW 203
FFHG+ +L +GVDI +IR MG+LQRIAIAYL+AA+CEIWLKGD V L + R+YR
Sbjct: 122 FFHGVRSLTFGVDITKIRLMGILQRIAIAYLLAAICEIWLKGDDDVDCGLDVIRRYRYQL 181
Query: 204 VVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDR 263
VVAL+L+T+Y ++L G+YVPDW+Y+ + S+ F+V CGVRG TGPACNAVGM+DR
Sbjct: 182 VVALLLSTMYTVILNGVYVPDWEYQ--ISGPGSTEKSFSVRCGVRGDTGPACNAVGMLDR 239
Query: 264 KILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIG 323
ILGI HLYR+P+Y+RTKQCSIN P GP+P DAPSWCQAPFDPEGLLSSVMA VTCLIG
Sbjct: 240 TILGIDHLYRRPVYARTKQCSINYPQNGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIG 299
Query: 324 LHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGV 383
L FGH+I+HF+ H+ R++NW+I S ++ L S+DF+G+ +NK LY++SY T+GA+G+
Sbjct: 300 LQFGHIIIHFEKHKGRIINWLIPSFSMLALAFSMDFIGIRMNKPLYTISYALATSGAAGL 359
Query: 384 LLAGIYFMV 392
L AGIY +V
Sbjct: 360 LFAGIYTLV 368
>gi|115485801|ref|NP_001068044.1| Os11g0543500 [Oryza sativa Japonica Group]
gi|77551354|gb|ABA94151.1| expressed protein [Oryza sativa Japonica Group]
gi|113645266|dbj|BAF28407.1| Os11g0543500 [Oryza sativa Japonica Group]
gi|125577433|gb|EAZ18655.1| hypothetical protein OsJ_34172 [Oryza sativa Japonica Group]
gi|215701389|dbj|BAG92813.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 448
Score = 452 bits (1162), Expect = e-124, Method: Compositional matrix adjust.
Identities = 225/369 (60%), Positives = 278/369 (75%), Gaps = 10/369 (2%)
Query: 27 ANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHS 86
A++ +E L + + TR +RL+SLDVFRG+TVALMILVDDVGGI+PAI+HS
Sbjct: 32 ADDDVEAGVLGRDGEAAATTTR-----QRLVSLDVFRGITVALMILVDDVGGIVPAISHS 86
Query: 87 PWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFH 146
PW+G+TLADFV PFFLFIVGVSLA YK P K++AT+KA+LRA+ LF++G+ LQGGFFH
Sbjct: 87 PWDGVTLADFVFPFFLFIVGVSLAFAYKKVPDKMLATKKAMLRAVKLFIVGLILQGGFFH 146
Query: 147 GINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLK---GDGHVSSKLSLFRKYRGHW 203
GI+ L YGVDI +IR MGVLQRIAIAYLV ALCEIWL+ G++ S L +Y
Sbjct: 147 GIHELTYGVDIRKIRLMGVLQRIAIAYLVVALCEIWLRRVSSGGNIGSGSMLITRYHHQM 206
Query: 204 VVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDR 263
V LVL YL++LYGL+VPDW+YE V + S+ F V CGV+G TGP CNAVGMIDR
Sbjct: 207 FVGLVLVVTYLVILYGLHVPDWEYE--VTSPDSTVKHFLVKCGVKGDTGPGCNAVGMIDR 264
Query: 264 KILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIG 323
+LGIQHLY P+Y +T+QCS+ SP GP+P +APSWC+APFDPEGLLSS+MA VTCLIG
Sbjct: 265 SVLGIQHLYAHPVYLKTEQCSMASPRNGPLPPNAPSWCEAPFDPEGLLSSLMAIVTCLIG 324
Query: 324 LHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGV 383
L GH+IVHFK H +R+ W ILS CL+ LG SL G+H+NK+LYSLSYTC+T G +G+
Sbjct: 325 LQIGHVIVHFKKHNERIKRWSILSLCLLTLGFSLHLFGLHMNKSLYSLSYTCVTTGTAGL 384
Query: 384 LLAGIYFMV 392
IY +V
Sbjct: 385 FFVAIYLLV 393
>gi|218190872|gb|EEC73299.1| hypothetical protein OsI_07466 [Oryza sativa Indica Group]
Length = 454
Score = 449 bits (1156), Expect = e-123, Method: Compositional matrix adjust.
Identities = 207/324 (63%), Positives = 262/324 (80%), Gaps = 2/324 (0%)
Query: 69 LMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAIL 128
LMI+VDD G LPA+NHSPW+G+T+ADFVMPFFLF+VG+SL L YK P K+ AT+KA+L
Sbjct: 78 LMIIVDDAGAFLPALNHSPWDGVTIADFVMPFFLFMVGISLTLAYKRVPDKLEATKKAVL 137
Query: 129 RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGH 188
RAL LF LG+ LQGGFFHG+ +L +GVDI +IR MG+LQRIAIAYL+AA+CEIWLKGD
Sbjct: 138 RALKLFCLGLVLQGGFFHGVRSLTFGVDITKIRLMGILQRIAIAYLLAAICEIWLKGDDD 197
Query: 189 VSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVR 248
V L + R+YR VVAL+L+T+Y ++L G+YVPDW+Y+ + S+ F+V CGVR
Sbjct: 198 VDCGLDVIRRYRYQLVVALLLSTMYTVILNGVYVPDWEYQ--ISGPGSTEKSFSVRCGVR 255
Query: 249 GSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPE 308
G TGPACNAVGM+DR ILGI HLYR+P+Y+RTKQCSIN P GP+P DAPSWCQAPFDPE
Sbjct: 256 GDTGPACNAVGMLDRTILGIDHLYRRPVYARTKQCSINYPQNGPLPPDAPSWCQAPFDPE 315
Query: 309 GLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKAL 368
GLLSSVMA VTCLIGL FGH+I+HF+ H+ R++NW+I S ++ L S+DF+G+ +NK L
Sbjct: 316 GLLSSVMAIVTCLIGLQFGHIIIHFEKHKGRIINWLIPSFSMLALAFSMDFIGIRMNKPL 375
Query: 369 YSLSYTCLTAGASGVLLAGIYFMV 392
Y++SY T+GA+G+L AGIY +V
Sbjct: 376 YTISYALATSGAAGLLFAGIYTLV 399
>gi|116309454|emb|CAH66526.1| H0502B11.6 [Oryza sativa Indica Group]
gi|218194797|gb|EEC77224.1| hypothetical protein OsI_15768 [Oryza sativa Indica Group]
Length = 448
Score = 447 bits (1151), Expect = e-123, Method: Compositional matrix adjust.
Identities = 220/340 (64%), Positives = 267/340 (78%), Gaps = 2/340 (0%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALT 112
++RL+SLDVFRG+TV LMILVDD G LPAINHSPW+G+TLADFVMPFFLFIVGV+LAL
Sbjct: 56 RQRLVSLDVFRGITVLLMILVDDAGAFLPAINHSPWDGVTLADFVMPFFLFIVGVALALA 115
Query: 113 YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIA 172
YK P K+ ATRKAILRAL LF +G+ LQGGFFHG+ +L +G+D+ +IR MG+LQRIAIA
Sbjct: 116 YKRVPNKLEATRKAILRALKLFCVGLVLQGGFFHGVRSLTFGIDMEKIRLMGILQRIAIA 175
Query: 173 YLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVE 232
Y+V ALCEIWLKGD V S L ++ R + L++ Y+ LYG YVPDW+Y V
Sbjct: 176 YIVTALCEIWLKGDDDVDSGFDLLKRNRYQLFIGLIVMITYMGFLYGTYVPDWEYRISVP 235
Query: 233 TSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGP 292
S+ F V C VRG TGP CNAVGMIDRKILGIQHLY +P+Y+R+KQCSINSP GP
Sbjct: 236 GSTEKS--FFVKCSVRGDTGPGCNAVGMIDRKILGIQHLYCRPVYARSKQCSINSPQNGP 293
Query: 293 MPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIG 352
+ DAPSWCQAPFDPEGLLSSVMA VTCLIGL +GH+IVHF+ H++R++ W+I S ++
Sbjct: 294 LRPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHKERIMKWLIPSFSMLI 353
Query: 353 LGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
L SLDF GMH+NK LY++SY TAGA+G+L AGIY +V
Sbjct: 354 LAFSLDFFGMHMNKPLYTVSYALATAGAAGLLFAGIYALV 393
>gi|115458212|ref|NP_001052706.1| Os04g0404900 [Oryza sativa Japonica Group]
gi|113564277|dbj|BAF14620.1| Os04g0404900 [Oryza sativa Japonica Group]
gi|222628804|gb|EEE60936.1| hypothetical protein OsJ_14685 [Oryza sativa Japonica Group]
Length = 447
Score = 447 bits (1151), Expect = e-123, Method: Compositional matrix adjust.
Identities = 220/340 (64%), Positives = 267/340 (78%), Gaps = 2/340 (0%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALT 112
++RL+SLDVFRG+TV LMILVDD G LPAINHSPW+G+TLADFVMPFFLFIVGV+LAL
Sbjct: 55 RQRLVSLDVFRGITVLLMILVDDAGAFLPAINHSPWDGVTLADFVMPFFLFIVGVALALA 114
Query: 113 YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIA 172
YK P K+ ATRKAILRAL LF +G+ LQGGFFHG+ +L +G+D+ +IR MG+LQRIAIA
Sbjct: 115 YKRVPNKLEATRKAILRALKLFCVGLVLQGGFFHGVRSLTFGIDMEKIRLMGILQRIAIA 174
Query: 173 YLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVE 232
Y+V ALCEIWLKGD V S L ++ R + L++ Y+ LYG YVPDW+Y V
Sbjct: 175 YIVTALCEIWLKGDDDVDSGFDLLKRNRYQLFIGLIVMITYMGFLYGTYVPDWEYRISVP 234
Query: 233 TSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGP 292
S+ F V C VRG TGP CNAVGMIDRKILGIQHLY +P+Y+R+KQCSINSP GP
Sbjct: 235 GSTEKS--FFVKCSVRGDTGPGCNAVGMIDRKILGIQHLYCRPVYARSKQCSINSPQNGP 292
Query: 293 MPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIG 352
+ DAPSWCQAPFDPEGLLSSVMA VTCLIGL +GH+IVHF+ H++R++ W+I S ++
Sbjct: 293 LRPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHKERIMKWLIPSFSMLI 352
Query: 353 LGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
L SLDF GMH+NK LY++SY TAGA+G+L AGIY +V
Sbjct: 353 LAFSLDFFGMHMNKPLYTVSYALATAGAAGLLFAGIYALV 392
>gi|115446433|ref|NP_001046996.1| Os02g0526000 [Oryza sativa Japonica Group]
gi|49388281|dbj|BAD25399.1| unknown protein [Oryza sativa Japonica Group]
gi|49388287|dbj|BAD25402.1| unknown protein [Oryza sativa Japonica Group]
gi|113536527|dbj|BAF08910.1| Os02g0526000 [Oryza sativa Japonica Group]
Length = 376
Score = 446 bits (1148), Expect = e-123, Method: Compositional matrix adjust.
Identities = 206/323 (63%), Positives = 261/323 (80%), Gaps = 2/323 (0%)
Query: 70 MILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILR 129
MI+VDD G LPA+NHSPW+G+T+ADFVMPFFLF+VG+SL L YK P K+ AT+KA+LR
Sbjct: 1 MIIVDDAGAFLPALNHSPWDGVTIADFVMPFFLFMVGISLTLAYKRVPDKLEATKKAVLR 60
Query: 130 ALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHV 189
AL LF LG+ LQGGFFHG+ +L +GVDI +IR MG+LQRIAIAYL+AA+CEIWLKGD V
Sbjct: 61 ALKLFCLGLVLQGGFFHGVRSLTFGVDITKIRLMGILQRIAIAYLLAAICEIWLKGDDDV 120
Query: 190 SSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRG 249
L + R+YR VVAL+L+T+Y ++L G+YVPDW+Y+ + S+ F+V CGVRG
Sbjct: 121 DCGLDVIRRYRYQLVVALLLSTMYTVILNGVYVPDWEYQ--ISGPGSTEKSFSVRCGVRG 178
Query: 250 STGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEG 309
TGPACNAVGM+DR ILGI HLYR+P+Y+RTKQCSIN P GP+P DAPSWCQAPFDPEG
Sbjct: 179 DTGPACNAVGMLDRTILGIDHLYRRPVYARTKQCSINYPQNGPLPPDAPSWCQAPFDPEG 238
Query: 310 LLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALY 369
LLSSVMA VTCLIGL FGH+I+HF+ H+ R++NW+I S ++ L S+DF+G+ +NK LY
Sbjct: 239 LLSSVMAIVTCLIGLQFGHIIIHFEKHKGRIINWLIPSFSMLALAFSMDFIGIRMNKPLY 298
Query: 370 SLSYTCLTAGASGVLLAGIYFMV 392
++SY T+GA+G+L AGIY +V
Sbjct: 299 TISYALATSGAAGLLFAGIYTLV 321
>gi|218185886|gb|EEC68313.1| hypothetical protein OsI_36402 [Oryza sativa Indica Group]
Length = 450
Score = 440 bits (1132), Expect = e-121, Method: Compositional matrix adjust.
Identities = 225/369 (60%), Positives = 278/369 (75%), Gaps = 10/369 (2%)
Query: 27 ANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHS 86
A++ +E L + + TR +RL+SLDVFRG+TVALMILVDDVGGI+PAI+HS
Sbjct: 34 ADDDVEAGVLGRDGEAAATTTR-----QRLVSLDVFRGITVALMILVDDVGGIVPAISHS 88
Query: 87 PWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFH 146
PW+G+TLADFV PFFLFIVGVSLA YK P K++AT+KA+LRA+ LF++G+ LQGGFFH
Sbjct: 89 PWDGVTLADFVFPFFLFIVGVSLAFAYKKVPDKMLATKKAMLRAVKLFIVGLILQGGFFH 148
Query: 147 GINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLK---GDGHVSSKLSLFRKYRGHW 203
GI+ L YGVDI +IR MGVLQRIAIAYLV ALCEIWL+ G + S L +Y
Sbjct: 149 GIHELTYGVDIRKIRLMGVLQRIAIAYLVVALCEIWLRRVSSGGDIGSGSMLITRYHHQM 208
Query: 204 VVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDR 263
V LVL YL++LYGL+VPDW+YE V + S+ F V CGV+G TGP CNAVGMIDR
Sbjct: 209 FVGLVLVVTYLVILYGLHVPDWEYE--VTSLDSTVKHFLVKCGVKGDTGPGCNAVGMIDR 266
Query: 264 KILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIG 323
+LGIQHLY P+Y +T+QCS++SP GP+P +APSWC+APFDPEGLLSS+MA VTCLIG
Sbjct: 267 SVLGIQHLYAHPVYLKTEQCSMDSPRNGPLPPNAPSWCEAPFDPEGLLSSLMAIVTCLIG 326
Query: 324 LHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGV 383
L GH+IVHFK H +R+ W LS CL+ LG SL G+H+NK+LYSLSYTC+TAG +G+
Sbjct: 327 LQIGHVIVHFKKHNERIKRWSTLSLCLLTLGFSLHLFGLHMNKSLYSLSYTCVTAGTAGL 386
Query: 384 LLAGIYFMV 392
IY +V
Sbjct: 387 FFVAIYLLV 395
>gi|186530235|ref|NP_001119392.1| uncharacterized protein [Arabidopsis thaliana]
gi|332008204|gb|AED95587.1| uncharacterized protein [Arabidopsis thaliana]
Length = 359
Score = 432 bits (1112), Expect = e-118, Method: Compositional matrix adjust.
Identities = 219/330 (66%), Positives = 259/330 (78%), Gaps = 18/330 (5%)
Query: 8 NNDANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTV 67
+ND ND Q + + ++ R LPP + RL+SLDVFRGLTV
Sbjct: 13 DNDDNDH---QWREKKDIESALQISRSSSLPP------------DKERLVSLDVFRGLTV 57
Query: 68 ALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAI 127
A MILVDDVGGILP+INHSPW+G+TLADFVMPFFLFIVGVSLA YKN C+ VATRKA+
Sbjct: 58 AFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAYKNLSCRFVATRKAL 117
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDG 187
+R+L L LLG+FLQGGF HG+NNL YG+D+ +IR MG+LQRIAIAYLV ALCEIWLKG+
Sbjct: 118 IRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAYLVVALCEIWLKGNH 177
Query: 188 HVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIF---NVT 244
+VSS+LS+ +KYR HWVVA V+TT+YL LLYGLYVPDW+Y+ E S+ F V
Sbjct: 178 NVSSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQILKEDQGSTLTTFLNLKVK 237
Query: 245 CGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAP 304
CGVRG TGP CNAVGM+DR LGIQHLYRKP+Y+RTKQCSIN P+ GP+P DAPSWCQAP
Sbjct: 238 CGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINYPNNGPLPPDAPSWCQAP 297
Query: 305 FDPEGLLSSVMATVTCLIGLHFGHLIVHFK 334
FDPEGLLSS+MATVTCL+GLH+GH+I+HFK
Sbjct: 298 FDPEGLLSSLMATVTCLVGLHYGHIIIHFK 327
>gi|255548527|ref|XP_002515320.1| conserved hypothetical protein [Ricinus communis]
gi|223545800|gb|EEF47304.1| conserved hypothetical protein [Ricinus communis]
Length = 460
Score = 432 bits (1110), Expect = e-118, Method: Compositional matrix adjust.
Identities = 209/375 (55%), Positives = 275/375 (73%), Gaps = 16/375 (4%)
Query: 28 NEKLERDPLLPPSNSNSKQTR---PQHQQRRLISLDVFRGLTVALMILVDDVGGILPAIN 84
N++ E++ + PS+S+S + P +RL+SLDVFRGLT+ALMILVDD GG P+IN
Sbjct: 21 NDEDEKEEEIAPSSSSSDEREALPPPTPNQRLMSLDVFRGLTIALMILVDDAGGAFPSIN 80
Query: 85 HSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGF 144
HSPW G+TLADFVMPFFLF VGVS++L +K K VAT+K +LR + LFLLG+ LQGG+
Sbjct: 81 HSPWFGVTLADFVMPFFLFGVGVSISLVFKKISSKSVATKKVMLRTIKLFLLGVLLQGGY 140
Query: 145 FHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWV 204
FHG N+L YG+D+ +IRW+GVLQRI+I YL A++ EIWL V S L+ +KY W+
Sbjct: 141 FHGRNHLTYGIDVLKIRWLGVLQRISIGYLFASISEIWLVNHCIVDSPLAFMKKYYAQWM 200
Query: 205 VALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIF-------NVTCGVRGSTGPACNA 257
V+L+L +LY LLY L+VP+W++E +SS +F V CGVRGS P CNA
Sbjct: 201 VSLILCSLYTCLLYFLFVPNWEFE------ASSINLFGYGSGTQTVICGVRGSLEPPCNA 254
Query: 258 VGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMAT 317
VG+IDR +LG HLY++P+Y RTKQCS+NSPDYGP+P ++P WC APFDPEG+LSS+MA
Sbjct: 255 VGLIDRFLLGEHHLYQRPVYRRTKQCSVNSPDYGPLPPNSPPWCLAPFDPEGILSSLMAA 314
Query: 318 VTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLT 377
VTCL+GL FGH++VH KDH R+L W+I S L+ G L +G+ +K LY+LSYTC+T
Sbjct: 315 VTCLLGLQFGHVLVHLKDHMQRILVWLISSFSLLVTGFVLKLIGIPFSKPLYTLSYTCIT 374
Query: 378 AGASGVLLAGIYFMV 392
GASG+LL I++ V
Sbjct: 375 TGASGLLLTIIFYAV 389
>gi|238481501|ref|NP_001154765.1| uncharacterized protein [Arabidopsis thaliana]
gi|332008206|gb|AED95589.1| uncharacterized protein [Arabidopsis thaliana]
Length = 435
Score = 431 bits (1108), Expect = e-118, Method: Compositional matrix adjust.
Identities = 219/330 (66%), Positives = 259/330 (78%), Gaps = 18/330 (5%)
Query: 8 NNDANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTV 67
+ND ND Q + + ++ R LPP + RL+SLDVFRGLTV
Sbjct: 7 DNDDNDH---QWREKKDIESALQISRSSSLPP------------DKERLVSLDVFRGLTV 51
Query: 68 ALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAI 127
A MILVDDVGGILP+INHSPW+G+TLADFVMPFFLFIVGVSLA YKN C+ VATRKA+
Sbjct: 52 AFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAYKNLSCRFVATRKAL 111
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDG 187
+R+L L LLG+FLQGGF HG+NNL YG+D+ +IR MG+LQRIAIAYLV ALCEIWLKG+
Sbjct: 112 IRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAYLVVALCEIWLKGNH 171
Query: 188 HVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIF---NVT 244
+VSS+LS+ +KYR HWVVA V+TT+YL LLYGLYVPDW+Y+ E S+ F V
Sbjct: 172 NVSSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQILKEDQGSTLTTFLNLKVK 231
Query: 245 CGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAP 304
CGVRG TGP CNAVGM+DR LGIQHLYRKP+Y+RTKQCSIN P+ GP+P DAPSWCQAP
Sbjct: 232 CGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINYPNNGPLPPDAPSWCQAP 291
Query: 305 FDPEGLLSSVMATVTCLIGLHFGHLIVHFK 334
FDPEGLLSS+MATVTCL+GLH+GH+I+HFK
Sbjct: 292 FDPEGLLSSLMATVTCLVGLHYGHIIIHFK 321
>gi|357149263|ref|XP_003575052.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Brachypodium distachyon]
Length = 432
Score = 429 bits (1104), Expect = e-117, Method: Compositional matrix adjust.
Identities = 223/392 (56%), Positives = 289/392 (73%), Gaps = 15/392 (3%)
Query: 1 MAKYNPINNDANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLD 60
MA Y + +D D + + A++ PL P + ++Q RL+SLD
Sbjct: 1 MAGYELVRSD--DGAAAATTPDLESGASKA---SPLPTPVSPAARQ--------RLVSLD 47
Query: 61 VFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKV 120
VFRG+TV LMI+VDD GG LPA+NHSPW+G+T+ DFVMPFFLFIVGVSL L YK P ++
Sbjct: 48 VFRGITVLLMIIVDDAGGFLPALNHSPWDGVTIGDFVMPFFLFIVGVSLTLAYKRVPERL 107
Query: 121 VATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCE 180
AT+KA+LRAL LF LG+ LQGGFFHG+ +L +GVDI +IR MG+LQRIAIAYL+AA+CE
Sbjct: 108 EATKKAVLRALKLFCLGLVLQGGFFHGVRSLTFGVDITEIRLMGILQRIAIAYLIAAICE 167
Query: 181 IWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWI 240
IWLKG+ V L L R+YR V L+L+ +Y +LLYG+YVPDW+Y+ S+ +
Sbjct: 168 IWLKGNDEVDRGLDLLRRYRYQLFVGLLLSVMYTVLLYGIYVPDWEYQITGPGSTEKSLL 227
Query: 241 FNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSW 300
V CGVRG TGP CNAVGM+DR +LGI HLYR+P+Y+RTK+CSI+ P+ GP+P DAPSW
Sbjct: 228 --VKCGVRGDTGPGCNAVGMVDRTMLGIDHLYRRPVYARTKECSIDYPENGPLPPDAPSW 285
Query: 301 CQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFV 360
CQAPFDPEGLLSSVMA VTCL+GL FGH+I+HF+ H++R++NW+I S ++ L +DF+
Sbjct: 286 CQAPFDPEGLLSSVMAIVTCLMGLQFGHVIIHFEKHKERIINWLIPSFSMLALAFLMDFI 345
Query: 361 GMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
GM +NK LY++SYT TAGA+G AGIY +V
Sbjct: 346 GMRMNKPLYTISYTFATAGAAGFFFAGIYTLV 377
>gi|212723180|ref|NP_001132467.1| uncharacterized protein LOC100193923 [Zea mays]
gi|194694464|gb|ACF81316.1| unknown [Zea mays]
gi|414587418|tpg|DAA37989.1| TPA: hypothetical protein ZEAMMB73_167983 [Zea mays]
Length = 391
Score = 429 bits (1103), Expect = e-117, Method: Compositional matrix adjust.
Identities = 203/325 (62%), Positives = 253/325 (77%), Gaps = 2/325 (0%)
Query: 68 ALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAI 127
ALMI+VDD G +PA+NHSPW+G+T+ADFVMPFFLFIVGV+LAL YK P K+ AT+KA+
Sbjct: 14 ALMIIVDDAGSFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAYKRVPDKLDATKKAV 73
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDG 187
LRAL LF LG+ LQGGFFHG+ +L +GVD+ +IR MG+LQRIAIAYL+ ALCEIWLKGD
Sbjct: 74 LRALKLFCLGLVLQGGFFHGVRSLTFGVDLQEIRLMGILQRIAIAYLLTALCEIWLKGDE 133
Query: 188 HVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGV 247
V L ++YR V ++ Y+ LLYG YV DW+Y+ S F V CGV
Sbjct: 134 DVDYGYDLLKRYRYQLFVGAIVGITYMSLLYGTYVRDWEYQ--TSGPGSIEKSFFVKCGV 191
Query: 248 RGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDP 307
RG T P CNAVGMIDR+ILGIQHLY +P+Y+R+KQCSI+SP GP+P DAPSWCQAPFDP
Sbjct: 192 RGDTSPGCNAVGMIDRRILGIQHLYGRPVYARSKQCSIDSPQNGPLPPDAPSWCQAPFDP 251
Query: 308 EGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKA 367
EGLLSSVMA VTCLIGL +GH+IVHF+ HR+R++NW+I S ++ L ++DF+G+ +NK
Sbjct: 252 EGLLSSVMAIVTCLIGLQYGHVIVHFQKHRERIMNWLIPSFSMLVLAFAMDFLGLRMNKP 311
Query: 368 LYSLSYTCLTAGASGVLLAGIYFMV 392
LY+LSYT TAGA+G+L GIY +V
Sbjct: 312 LYTLSYTLATAGAAGLLFCGIYTLV 336
>gi|195642128|gb|ACG40532.1| hypothetical protein [Zea mays]
Length = 379
Score = 428 bits (1101), Expect = e-117, Method: Compositional matrix adjust.
Identities = 203/324 (62%), Positives = 253/324 (78%), Gaps = 2/324 (0%)
Query: 69 LMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAIL 128
LMI+VDD G +PA+NHSPW+G+T+ADFVMPFFLFIVGV+LAL YK P K+ AT+KA+L
Sbjct: 3 LMIIVDDAGSFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAYKRVPDKLDATKKAVL 62
Query: 129 RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGH 188
RAL LF LG+ LQGGFFHG+ +L +GVD+ +IR MG+LQRIAIAYL+ ALCEIWLKGD
Sbjct: 63 RALKLFCLGLVLQGGFFHGVRSLTFGVDLQEIRLMGILQRIAIAYLLTALCEIWLKGDED 122
Query: 189 VSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVR 248
V L ++YR +V V+ Y+ LLYG YVPD +Y+ S F V CGVR
Sbjct: 123 VDYGYDLLKRYRYQLLVGAVVAITYMSLLYGTYVPDCEYQ--TSGPGSIEKSFFVKCGVR 180
Query: 249 GSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPE 308
G T P CNAVGMIDR+ILGIQHLY +P+Y+R+KQCSI+SP GP+P DAPSWCQAPFDPE
Sbjct: 181 GDTSPGCNAVGMIDRRILGIQHLYGRPVYARSKQCSIDSPQNGPLPPDAPSWCQAPFDPE 240
Query: 309 GLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKAL 368
GLLSSVMA VTCLIGL +GH+IVHF+ HR+R++NW+I S ++ L ++DF+G+ +NK L
Sbjct: 241 GLLSSVMAIVTCLIGLQYGHIIVHFQKHRERIMNWLIPSFSMLVLAFAMDFLGLRMNKPL 300
Query: 369 YSLSYTCLTAGASGVLLAGIYFMV 392
Y+LSYT TAGA+G+L GIY +V
Sbjct: 301 YTLSYTLATAGAAGLLFCGIYTLV 324
>gi|242071239|ref|XP_002450896.1| hypothetical protein SORBIDRAFT_05g020800 [Sorghum bicolor]
gi|241936739|gb|EES09884.1| hypothetical protein SORBIDRAFT_05g020800 [Sorghum bicolor]
Length = 455
Score = 427 bits (1097), Expect = e-117, Method: Compositional matrix adjust.
Identities = 222/356 (62%), Positives = 270/356 (75%), Gaps = 8/356 (2%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
+ +RL SLDVFRG+TV LMILVDDVGG++PAI+HSPW+G+TLADFV PFFLFIVGVSLA
Sbjct: 60 RGQRLASLDVFRGITVVLMILVDDVGGLVPAISHSPWDGVTLADFVFPFFLFIVGVSLAF 119
Query: 112 TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAI 171
YK P K +AT+KA++RA LFLLG+ LQGG+FH I++L YGVD+ +IR MG+LQRIAI
Sbjct: 120 AYKRVPNKTLATKKALIRASKLFLLGLLLQGGYFHTIHDLSYGVDLHKIRLMGILQRIAI 179
Query: 172 AYLVAALCEIWLKGDG--HVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEF 229
AY ALCEIWL+G + + L R+YR V LVLT Y +LLYG+YVPDW+Y
Sbjct: 180 AYFAVALCEIWLRGGASDNGAGGYVLIRRYRHQLFVGLVLTVTYTVLLYGMYVPDWEYV- 238
Query: 230 PVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPD 289
V + ++ F V CGVRG TGP CNAVGMIDR +LGIQHLY P+Y +T QCSINSP
Sbjct: 239 -VTSPDTTLKNFMVKCGVRGDTGPGCNAVGMIDRCVLGIQHLYAHPVYLKTAQCSINSPR 297
Query: 290 YGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSC 349
GP+P DAP+WC+APFDPEGLLSS+MA VTCLIGL GH+IVHFK H R++ W I S
Sbjct: 298 NGPLPSDAPTWCEAPFDPEGLLSSLMAIVTCLIGLQIGHVIVHFKQHSKRIVRWSIPSLI 357
Query: 350 LIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKKPF 405
L+ LG+SLD GMH+NK+LYSLSYTC+T G++G+ AGIY +V KKPF
Sbjct: 358 LLILGVSLDLFGMHMNKSLYSLSYTCVTTGSAGLFFAGIYLLVDV----YFYKKPF 409
>gi|38346153|emb|CAE02025.2| OSJNBb0118P14.13 [Oryza sativa Japonica Group]
Length = 415
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 215/357 (60%), Positives = 262/357 (73%), Gaps = 12/357 (3%)
Query: 46 QTRPQHQQRRLISLDVFRGL----------TVALMILVDDVGGILPAINHSPWNGLTLAD 95
R Q Q ++ +DV G + LMILVDD G LPAINHSPW+G+TLAD
Sbjct: 6 DNRNQRQGIQVSVVDVAEGQWLTCVHLFMPEMPLMILVDDAGAFLPAINHSPWDGVTLAD 65
Query: 96 FVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGV 155
FVMPFFLFIVGV+LAL YK P K+ ATRKAILRAL LF +G+ LQGGFFHG+ +L +G+
Sbjct: 66 FVMPFFLFIVGVALALAYKRVPNKLEATRKAILRALKLFCVGLVLQGGFFHGVRSLTFGI 125
Query: 156 DIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLL 215
D+ +IR MG+LQRIAIAY+V ALCEIWLKGD V S L ++ R + L++ Y+
Sbjct: 126 DMEKIRLMGILQRIAIAYIVTALCEIWLKGDDDVDSGFDLLKRNRYQLFIGLIVMITYMG 185
Query: 216 LLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKP 275
LYG YVPDW+Y V S+ F V C VRG TGP CNAVGMIDRKILGIQHLY +P
Sbjct: 186 FLYGTYVPDWEYRISVPGSTEKS--FFVKCSVRGDTGPGCNAVGMIDRKILGIQHLYCRP 243
Query: 276 IYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKD 335
+Y+R+KQCSINSP GP+ DAPSWCQAPFDPEGLLSSVMA VTCLIGL +GH+IVHF+
Sbjct: 244 VYARSKQCSINSPQNGPLRPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQK 303
Query: 336 HRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
H++R++ W+I S ++ L SLDF GMH+NK LY++SY TAGA+G+L AGIY +V
Sbjct: 304 HKERIMKWLIPSFSMLILAFSLDFFGMHMNKPLYTVSYALATAGAAGLLFAGIYALV 360
>gi|32487909|emb|CAE05368.1| OJ000315_02.13 [Oryza sativa Japonica Group]
Length = 452
Score = 412 bits (1059), Expect = e-112, Method: Compositional matrix adjust.
Identities = 215/381 (56%), Positives = 262/381 (68%), Gaps = 36/381 (9%)
Query: 46 QTRPQHQQRRLISLDVFRGL----------TVALMILVDDVGGILPAINHSPWNGLTLAD 95
R Q Q ++ +DV G + LMILVDD G LPAINHSPW+G+TLAD
Sbjct: 6 DNRNQRQGIQVSVVDVAEGQWLTCVHLFMPEMPLMILVDDAGAFLPAINHSPWDGVTLAD 65
Query: 96 FVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGV 155
FVMPFFLFIVGV+LAL YK P K+ ATRKAILRAL LF +G+ LQGGFFHG+ +L +G+
Sbjct: 66 FVMPFFLFIVGVALALAYKRVPNKLEATRKAILRALKLFCVGLVLQGGFFHGVRSLTFGI 125
Query: 156 DIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLL 215
D+ +IR MG+LQRIAIAY+V ALCEIWLKGD V S L ++ R + L++ Y+
Sbjct: 126 DMEKIRLMGILQRIAIAYIVTALCEIWLKGDDDVDSGFDLLKRNRYQLFIGLIVMITYMG 185
Query: 216 LLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKP 275
LYG YVPDW+Y V S+ F V C VRG TGP CNAVGMIDRKILGIQHLY +P
Sbjct: 186 FLYGTYVPDWEYRISVPGSTEKS--FFVKCSVRGDTGPGCNAVGMIDRKILGIQHLYCRP 243
Query: 276 IYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLL------------------------ 311
+Y+R+KQCSINSP GP+ DAPSWCQAPFDPEGLL
Sbjct: 244 VYARSKQCSINSPQNGPLRPDAPSWCQAPFDPEGLLRLQQYNISFANFAKFSLFFLDSRI 303
Query: 312 SSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSL 371
SSVMA VTCLIGL +GH+IVHF+ H++R++ W+I S ++ L SLDF GMH+NK LY++
Sbjct: 304 SSVMAIVTCLIGLQYGHVIVHFQKHKERIMKWLIPSFSMLILAFSLDFFGMHMNKPLYTV 363
Query: 372 SYTCLTAGASGVLLAGIYFMV 392
SY TAGA+G+L AGIY +V
Sbjct: 364 SYALATAGAAGLLFAGIYALV 384
>gi|326512130|dbj|BAJ96046.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 488
Score = 409 bits (1052), Expect = e-111, Method: Compositional matrix adjust.
Identities = 199/379 (52%), Positives = 267/379 (70%), Gaps = 15/379 (3%)
Query: 34 DPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTL 93
PL P +++K Q + +R+ SLDVFRGLTVA+MILVDD GG P INH+PW G+T+
Sbjct: 34 SPLPQPPGADAKPG--QQKPQRVASLDVFRGLTVAMMILVDDAGGAWPGINHAPWLGVTV 91
Query: 94 ADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKY 153
ADFVMP FLFI+GVS AL +K P K+ ++KA RA+ LF+LG+ LQGG+ HG + L Y
Sbjct: 92 ADFVMPAFLFIIGVSAALVFKKTPNKIATSKKAACRAIKLFILGVILQGGYIHGRHKLTY 151
Query: 154 GVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLY 213
GVD+ QIRW+GVLQRIAI Y +AA+ EIWL + V S +S +KY W++A++++ LY
Sbjct: 152 GVDLDQIRWLGVLQRIAIGYFLAAISEIWLVNNTSVDSPVSFVKKYFMEWIMAIIISALY 211
Query: 214 LLLLYGLYVPDWQYEFPVETSSSS-------PWIFNVTCGVRGSTGPACNAVGMIDRKIL 266
+ L++GLYVP+W EF V+TSSS+ + CG+ GS GP CNAVG +DR +L
Sbjct: 212 IGLVFGLYVPNW--EFKVQTSSSTFSNPSNDVGFKTIQCGLTGSLGPPCNAVGFVDRVLL 269
Query: 267 GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHF 326
G HLY+ P+Y RTK+CSINSPDYGP+P +AP WC APFDPEGLLS++MA V+C +GLHF
Sbjct: 270 GESHLYKNPVYKRTKECSINSPDYGPLPPNAPDWCLAPFDPEGLLSTLMAAVSCFVGLHF 329
Query: 327 GHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
GH+++H K H RM++W++ S+ L G L +GM +K LY++SY LT G SG +L
Sbjct: 330 GHVLIHCKTHSQRMMSWLLASTVLTVSGFLLQLLGMPFSKPLYTVSYMLLTGGVSGFVLL 389
Query: 387 GIYFMVRYISSHLMLKKPF 405
+Y +V I +KKP
Sbjct: 390 LLYCIVDVIH----IKKPL 404
>gi|357134575|ref|XP_003568892.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Brachypodium distachyon]
Length = 495
Score = 409 bits (1051), Expect = e-111, Method: Compositional matrix adjust.
Identities = 196/373 (52%), Positives = 260/373 (69%), Gaps = 13/373 (3%)
Query: 39 PSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVM 98
P +P+ + R+ SLDVFRGLTVA+MILVDD GG P INH+PW G+T+ADFVM
Sbjct: 34 PRQPPGTDAKPERKPHRVASLDVFRGLTVAMMILVDDAGGAWPGINHAPWLGVTVADFVM 93
Query: 99 PFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIA 158
P FLFI+GVS AL +K K+ ++KA RA LF+LG+ LQGG+ HG +NL YGVD+
Sbjct: 94 PAFLFIIGVSAALVFKRTQNKIATSKKAAYRAFKLFILGVILQGGYIHGRHNLTYGVDLD 153
Query: 159 QIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLY 218
IRW+GVLQRIAI Y +AA+ EIWL + V S +S +KY WV+A++++ LY+ L++
Sbjct: 154 HIRWLGVLQRIAIGYFLAAMSEIWLVNNISVDSPVSFVKKYFMEWVMAIMISALYISLIF 213
Query: 219 GLYVPDWQYEFPVETS-------SSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHL 271
GLYVP+W EF V+TS S+ V CG+RGS GP CNAVG +DR +LG HL
Sbjct: 214 GLYVPNW--EFKVQTSNLTFSNGSNEIGFKTVQCGLRGSLGPPCNAVGFVDRVLLGENHL 271
Query: 272 YRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV 331
Y+ P+Y RTK+CS+NSPDYG +P +AP WC APFDPEGLLS++MA V+C +GLHFGH+++
Sbjct: 272 YKNPVYKRTKECSVNSPDYGALPPNAPDWCLAPFDPEGLLSTLMAAVSCFVGLHFGHVLI 331
Query: 332 HFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM 391
H ++H RML+W++ S+ L G L GM +K LY++SY LT G SG LL +Y++
Sbjct: 332 HCQNHSQRMLSWLLASTVLTASGFLLQLSGMPFSKPLYTVSYMLLTGGVSGFLLLLLYYI 391
Query: 392 VRYISSHLMLKKP 404
V I +KKP
Sbjct: 392 VDVIQ----IKKP 400
>gi|212724122|ref|NP_001131867.1| uncharacterized protein LOC100193245 [Zea mays]
gi|194692766|gb|ACF80467.1| unknown [Zea mays]
gi|413948803|gb|AFW81452.1| hypothetical protein ZEAMMB73_255914 [Zea mays]
gi|413948804|gb|AFW81453.1| hypothetical protein ZEAMMB73_255914 [Zea mays]
Length = 492
Score = 406 bits (1043), Expect = e-110, Method: Compositional matrix adjust.
Identities = 197/375 (52%), Positives = 262/375 (69%), Gaps = 13/375 (3%)
Query: 38 PPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFV 97
PP ++ + Q Q + R+ SLDVFRG TVA+MILVDD GG P INH+PW G+T+ADFV
Sbjct: 40 PPLDAAATQLEEQRKPERVASLDVFRGFTVAMMILVDDAGGAWPGINHAPWFGVTVADFV 99
Query: 98 MPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDI 157
MP FLFI+GVS AL +K K AT+KA +RA LF+LG+ LQGG+ HG + L YGVD+
Sbjct: 100 MPAFLFIIGVSAALVFKKMANKTAATKKAAIRASKLFILGVILQGGYIHGRHKLTYGVDL 159
Query: 158 AQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLL 217
IRW+GVLQRIAI Y VAA+ EIWL + V S + +KY W +A+ +T LY+ L+
Sbjct: 160 DHIRWLGVLQRIAIGYFVAAMSEIWLVNNNLVDSPVPFVKKYFIEWFMAIAITVLYVALV 219
Query: 218 YGLYVPDWQYEFPVETSSSSPWIFN-------VTCGVRGSTGPACNAVGMIDRKILGIQH 270
+GLYV +W++E ++TS+S+ I + + CGVRGS GP CNAVG++DR +LG H
Sbjct: 220 FGLYVANWEFE--IQTSNSTLSIPSNSIETKMIQCGVRGSLGPPCNAVGLVDRVLLGENH 277
Query: 271 LYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI 330
LY+ P+Y RTK+CSINSPDYGP+P +AP WC APFDPEGLLS++MA VTC +GL FGH++
Sbjct: 278 LYKNPVYKRTKECSINSPDYGPLPPNAPDWCLAPFDPEGLLSTLMAVVTCFVGLFFGHVL 337
Query: 331 VHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYF 390
+H K+H RML W++ S L + +GM +K LY+++Y LT G SG LL +Y+
Sbjct: 338 IHCKNHSQRMLIWLLASVVLTISAYLVLLLGMPFSKPLYTVNYMLLTGGVSGFLLLLLYY 397
Query: 391 MVRYISSHLMLKKPF 405
+V I +KKPF
Sbjct: 398 IVDVIH----IKKPF 408
>gi|115462187|ref|NP_001054693.1| Os05g0155700 [Oryza sativa Japonica Group]
gi|54291854|gb|AAV32222.1| unknown protein [Oryza sativa Japonica Group]
gi|113578244|dbj|BAF16607.1| Os05g0155700 [Oryza sativa Japonica Group]
gi|215694847|dbj|BAG90038.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218196128|gb|EEC78555.1| hypothetical protein OsI_18526 [Oryza sativa Indica Group]
gi|222630256|gb|EEE62388.1| hypothetical protein OsJ_17178 [Oryza sativa Japonica Group]
Length = 491
Score = 405 bits (1041), Expect = e-110, Method: Compositional matrix adjust.
Identities = 217/415 (52%), Positives = 281/415 (67%), Gaps = 31/415 (7%)
Query: 13 DQLVIQISKSTSAPANEKLERDPLLPPSNSNSK-------QTRPQH------QQRRLISL 59
D V+ I+ A A R PLL ++ + + PQH + RR+ SL
Sbjct: 2 DAAVVAIAGDGDADAGH---RRPLLASADDDDEIRPYPASSPSPQHPAGAERKPRRVASL 58
Query: 60 DVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCK 119
DVFRGLTVA+MILVDD GG P +NHSPW G+T+ADFVMP FLFI+GVS AL +K P K
Sbjct: 59 DVFRGLTVAMMILVDDAGGAWPGMNHSPWLGVTVADFVMPAFLFIIGVSAALVFKKTPNK 118
Query: 120 VVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALC 179
VAT+KA +RA+ LF+LG+ LQGG+ HG +NL YG+D+ IRW+GVLQRIAI Y +AA+
Sbjct: 119 TVATKKAAIRAIKLFILGVILQGGYIHGRHNLTYGIDLDHIRWLGVLQRIAIGYFLAAIS 178
Query: 180 EIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPW 239
EIWL + V S +S +KY W+VA++++ LY+ LL GLYV +W EF V+TS+S
Sbjct: 179 EIWLVNNISVDSAISFVKKYFMEWIVAVMISALYVGLLLGLYVSNW--EFKVQTSNSILT 236
Query: 240 IFN---------VTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDY 290
I + CGVRGS GP CNAVG +DR +LG HLY+ P+Y RTK+CS+NSPDY
Sbjct: 237 IPTPGNEIGMKMIQCGVRGSLGPPCNAVGFVDRVLLGENHLYKNPVYKRTKECSVNSPDY 296
Query: 291 GPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCL 350
GP+P +AP WC APFDPEGLLS++MA VTC +GLHFGH++VH KDH RML W++ S+ L
Sbjct: 297 GPLPPNAPDWCLAPFDPEGLLSTLMAAVTCFVGLHFGHVLVHCKDHSPRMLLWLLASTVL 356
Query: 351 IGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKKPF 405
G L +GM +K LY++SY LT G SG LL +Y++V I+ +KKPF
Sbjct: 357 TVSGFLLQLLGMPFSKPLYTVSYMLLTGGVSGFLLLLLYYIVDVIN----IKKPF 407
>gi|195652797|gb|ACG45866.1| hypothetical protein [Zea mays]
Length = 492
Score = 404 bits (1037), Expect = e-110, Method: Compositional matrix adjust.
Identities = 196/375 (52%), Positives = 261/375 (69%), Gaps = 13/375 (3%)
Query: 38 PPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFV 97
PP ++ + Q Q + R+ SLDVFRG TVA+ ILVDD GG P INH+PW G+T+ADFV
Sbjct: 40 PPLDAAATQLEEQRKPERVASLDVFRGFTVAMXILVDDAGGAWPGINHAPWFGVTVADFV 99
Query: 98 MPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDI 157
MP FLFI+GVS AL +K K AT+KA +RA LF+LG+ LQGG+ HG + L YGVD+
Sbjct: 100 MPAFLFIIGVSAALVFKKMANKTAATKKAAIRASKLFILGVILQGGYIHGRHKLTYGVDL 159
Query: 158 AQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLL 217
IRW+GVLQRIAI Y VAA+ EIWL + V S + +KY W +A+ +T LY+ L+
Sbjct: 160 DHIRWLGVLQRIAIGYFVAAMSEIWLVNNNLVDSPVPFVKKYFIEWFMAIAITVLYVALV 219
Query: 218 YGLYVPDWQYEFPVETSSSSPWIFN-------VTCGVRGSTGPACNAVGMIDRKILGIQH 270
+GLYV +W++E ++TS+S+ I + + CGVRGS GP CNAVG++DR +LG H
Sbjct: 220 FGLYVANWEFE--IQTSNSTLSIPSNSIETKMIQCGVRGSLGPPCNAVGLVDRVLLGENH 277
Query: 271 LYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI 330
LY+ P+Y RTK+CSINSPDYGP+P +AP WC APFDPEGLLS++MA VTC +GL FGH++
Sbjct: 278 LYKNPVYKRTKECSINSPDYGPLPPNAPDWCLAPFDPEGLLSTLMAVVTCFVGLFFGHVL 337
Query: 331 VHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYF 390
+H K+H RML W++ S L + +GM +K LY+++Y LT G SG LL +Y+
Sbjct: 338 IHCKNHSQRMLIWLLASVVLTISAYLVLLLGMPFSKPLYTVNYMLLTGGVSGFLLLLLYY 397
Query: 391 MVRYISSHLMLKKPF 405
+V I +KKPF
Sbjct: 398 IVDVIH----IKKPF 408
>gi|326510109|dbj|BAJ87271.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 304
Score = 403 bits (1036), Expect = e-110, Method: Compositional matrix adjust.
Identities = 187/293 (63%), Positives = 232/293 (79%), Gaps = 2/293 (0%)
Query: 69 LMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAIL 128
LMI+VDD G LPA+NHSPW G+T+ADFVMPFFLFIVGV+LAL YK P K+ ATRKA L
Sbjct: 8 LMIIVDDAGSFLPAMNHSPWEGVTIADFVMPFFLFIVGVALALAYKRVPDKLDATRKATL 67
Query: 129 RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGH 188
RAL LF +G+ LQGGFFHG+ +L +GVDIAQIR MG+LQRIAIAYLV ALC+IWLKGD
Sbjct: 68 RALKLFCVGLVLQGGFFHGVRSLTFGVDIAQIRLMGILQRIAIAYLVTALCQIWLKGDDD 127
Query: 189 VSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVR 248
V S L L ++Y+ + L++T Y+ LLYG YVPDW+Y + + F V CGVR
Sbjct: 128 VDSGLDLIKRYKYQLLAGLLITITYMALLYGTYVPDWEYR--ISGPGFTEKTFTVRCGVR 185
Query: 249 GSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPE 308
G +GP CNAVGMIDRKILGIQHLY +P+Y+R++QCSI+SP GP+P DAPSWCQAPFDPE
Sbjct: 186 GDSGPGCNAVGMIDRKILGIQHLYGRPVYARSQQCSIDSPQNGPLPPDAPSWCQAPFDPE 245
Query: 309 GLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVG 361
GLLSSVMA VTCLIGL +GH+IVHF+ H++R+++W++ S ++ L ++DF G
Sbjct: 246 GLLSSVMAIVTCLIGLQYGHIIVHFQKHKERIMHWLVPSFGMLVLAFAMDFFG 298
>gi|224131042|ref|XP_002320987.1| predicted protein [Populus trichocarpa]
gi|222861760|gb|EEE99302.1| predicted protein [Populus trichocarpa]
Length = 481
Score = 399 bits (1025), Expect = e-108, Method: Compositional matrix adjust.
Identities = 209/387 (54%), Positives = 267/387 (68%), Gaps = 20/387 (5%)
Query: 23 TSAPANEKLERDPLL----------------PPSNSNSKQTRPQHQQRRLISLDVFRGLT 66
T+ +E+ +R+PLL PS S+S + P Q RL+SLDVFRGLT
Sbjct: 5 TTTELDER-QREPLLHNPRSLSNEEEEEITNTPSTSSSNASPPPTQ--RLLSLDVFRGLT 61
Query: 67 VALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKA 126
VALMILVDD GG P INHSPW G+TLADFVMPFFLF+VGVS++L +K K +AT+K
Sbjct: 62 VALMILVDDAGGAFPCINHSPWFGVTLADFVMPFFLFVVGVSISLVFKKVSSKPMATKKV 121
Query: 127 ILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGD 186
I R + LFLLG+ LQGG+FHG +NL YGVD+ +IRWMGVLQRI+I YL AA+ EIWL
Sbjct: 122 IQRTIKLFLLGLLLQGGYFHGRHNLTYGVDVGKIRWMGVLQRISIGYLFAAMSEIWLVDS 181
Query: 187 GHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFN-VTC 245
V S ++ +KY W+VA + T Y+ LLYGLYVPDW++E P + V C
Sbjct: 182 ITVDSPMAFVKKYYIQWMVAFLFCTFYMCLLYGLYVPDWEFEVPSTNLFEHEFGTKIVNC 241
Query: 246 GVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPF 305
GVRGS P CNAVG+IDR LG HLY+ P+Y RTK CS+NSPDYGP+P ++P WC APF
Sbjct: 242 GVRGSLEPPCNAVGLIDRFFLGEHHLYQHPVYRRTKHCSVNSPDYGPLPPNSPGWCLAPF 301
Query: 306 DPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLN 365
DPEG+LSS+MA +TC +GL FGH++VHFK H R+ W + S ++ G + +G+ L
Sbjct: 302 DPEGILSSLMAAITCFLGLQFGHILVHFKGHMQRLCLWSVCSFIILITGYVFELLGVPLC 361
Query: 366 KALYSLSYTCLTAGASGVLLAGIYFMV 392
K LY+LSY C+TAGASG+ L I+++V
Sbjct: 362 KPLYTLSYMCITAGASGLALTIIFYIV 388
>gi|302796996|ref|XP_002980259.1| hypothetical protein SELMODRAFT_112263 [Selaginella moellendorffii]
gi|300151875|gb|EFJ18519.1| hypothetical protein SELMODRAFT_112263 [Selaginella moellendorffii]
Length = 401
Score = 395 bits (1016), Expect = e-107, Method: Compositional matrix adjust.
Identities = 189/325 (58%), Positives = 237/325 (72%), Gaps = 8/325 (2%)
Query: 69 LMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAIL 128
+MILVD+ GG PAINHSPWNG+TLAD VMPFFLFIVGV+LAL YK P K+ +TRKAIL
Sbjct: 1 MMILVDNAGGEWPAINHSPWNGVTLADLVMPFFLFIVGVALALVYKKIPSKLDSTRKAIL 60
Query: 129 RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWL-KGDG 187
R+L LF LG+FLQGG+FHG N+L YGVD+ IRW G+LQRIA YLV ALCE+WL + G
Sbjct: 61 RSLKLFFLGVFLQGGYFHGENDLSYGVDLTLIRWCGILQRIAFVYLVVALCEVWLPRVQG 120
Query: 188 HVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGV 247
S + Y HW+ +V T+YL LLYGL VPDWQ+E P + + VTCG
Sbjct: 121 ---SYFGFMQNYLFHWIFVVVTLTVYLSLLYGLKVPDWQFELPNNRNIT----MTVTCGT 173
Query: 248 RGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDP 307
R + P CNAVG +DR+ILG+ HL ++P++ RT+ CSINSPDYGP+P DAP WC APFDP
Sbjct: 174 RSNLDPPCNAVGYVDRQILGVNHLDQRPVFIRTESCSINSPDYGPLPADAPVWCHAPFDP 233
Query: 308 EGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKA 367
EG+LSSV A VTC IGLH+GH IV K+H+ R++N+I+ + L+ LG L +G+ +NK
Sbjct: 234 EGILSSVSAIVTCFIGLHYGHFIVQCKEHKQRIINFIVPAFILLALGYVLHLLGIKMNKP 293
Query: 368 LYSLSYTCLTAGASGVLLAGIYFMV 392
LYS SY C TAGA+G + +Y +V
Sbjct: 294 LYSFSYMCFTAGAAGAVFCLLYILV 318
>gi|302759310|ref|XP_002963078.1| hypothetical protein SELMODRAFT_78688 [Selaginella moellendorffii]
gi|300169939|gb|EFJ36541.1| hypothetical protein SELMODRAFT_78688 [Selaginella moellendorffii]
Length = 401
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 187/325 (57%), Positives = 238/325 (73%), Gaps = 8/325 (2%)
Query: 69 LMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAIL 128
+MILVD+ GG PAINHSPWNG+TLAD VMPFFLFIVGV+LAL YK P K+ +TRKAIL
Sbjct: 1 MMILVDNAGGEWPAINHSPWNGVTLADLVMPFFLFIVGVALALVYKKIPSKLDSTRKAIL 60
Query: 129 RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWL-KGDG 187
R+L LF LG+FLQGG+FHG N+L YGVD+ IRW G+LQRIA Y++ ALCE+WL + G
Sbjct: 61 RSLKLFFLGVFLQGGYFHGENDLSYGVDLTLIRWCGILQRIAFVYVIVALCEVWLPRVQG 120
Query: 188 HVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGV 247
S + + Y HW+ +V T+YL LLYGL VP WQ+E P + + VTCG
Sbjct: 121 ---SYFGIMQNYLFHWIFVVVTLTVYLSLLYGLKVPHWQFELPNNRNIT----MTVTCGT 173
Query: 248 RGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDP 307
R + PACNAVG +DR+ILG+ HL ++P++ RT+ CSINSPDYGP+P DAP WC APFDP
Sbjct: 174 RSNLDPACNAVGYVDRQILGVNHLDQQPVFIRTESCSINSPDYGPLPADAPVWCHAPFDP 233
Query: 308 EGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKA 367
EG+LSSV A VTC IGLH+GH IV K+H+ R++N+I+ + L+ LG L +G+ +NK
Sbjct: 234 EGILSSVSAIVTCFIGLHYGHFIVQCKEHKQRIINFIVPAVILLALGYVLHLLGIKMNKP 293
Query: 368 LYSLSYTCLTAGASGVLLAGIYFMV 392
LYS SY C TAGA+G + +Y +V
Sbjct: 294 LYSFSYMCFTAGAAGAVFCLLYILV 318
>gi|168035930|ref|XP_001770461.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678169|gb|EDQ64630.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 487
Score = 389 bits (999), Expect = e-105, Method: Compositional matrix adjust.
Identities = 194/376 (51%), Positives = 261/376 (69%), Gaps = 12/376 (3%)
Query: 29 EKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPW 88
E + PL+ + + RL SLDVFRGL++A+MILVD+ GG+ P+INHSPW
Sbjct: 19 EPVLLQPLVEGGGGSGYAATAAEKSPRLASLDVFRGLSIAVMILVDNAGGVWPSINHSPW 78
Query: 89 NGLTLADFVMPFFLFIVGVSLALTYKNFP-CKVVATRKAILRALNLFLLGIFLQGGFFHG 147
G+TLADFVMPFFLFIVGV+LALTYK K VA++KA+ R L ++G+ +QGG+FHG
Sbjct: 79 TGITLADFVMPFFLFIVGVALALTYKRITRDKKVASQKALGRTAKLLIVGLVIQGGYFHG 138
Query: 148 INNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWL--KGDGHVSSKLSLFRKYRGHWVV 205
+++ YGVD+ +IRW GVLQRIA+AY+V ALCEIW + + ++F+ Y HW V
Sbjct: 139 LHDTSYGVDLERIRWCGVLQRIALAYMVVALCEIWAPRRRQDVSNDNFAIFKTYHFHWAV 198
Query: 206 ALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIF---------NVTCGVRGSTGPACN 256
A + YL LLYG+YVPDW + P +S++ + V CGVRG+ GPACN
Sbjct: 199 AAAIVATYLALLYGVYVPDWDFIPPTVLNSTALHVSVVRVNGSMSEVHCGVRGNIGPACN 258
Query: 257 AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMA 316
AVG +DR ILG+ HLY++P++ RT CS+NSPDYGP+P AP WC+APFDPEGLLSS+ A
Sbjct: 259 AVGYLDRTILGVSHLYQRPVFRRTPACSVNSPDYGPLPSGAPDWCKAPFDPEGLLSSLSA 318
Query: 317 TVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCL 376
+C +GLHFGH++VH K+H R+ +W+I+S L+ +GL L +G+ NK LYS+SY
Sbjct: 319 VGSCFLGLHFGHVLVHRKEHIARLWDWMIMSLVLLIVGLLLHLLGVPFNKPLYSVSYMLF 378
Query: 377 TAGASGVLLAGIYFMV 392
T GA+GV+ AG Y +V
Sbjct: 379 TGGAAGVVFAGFYLLV 394
>gi|357442361|ref|XP_003591458.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
truncatula]
gi|355480506|gb|AES61709.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
truncatula]
Length = 476
Score = 383 bits (984), Expect = e-104, Method: Compositional matrix adjust.
Identities = 195/346 (56%), Positives = 259/346 (74%), Gaps = 10/346 (2%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALT 112
+RL+SLDVFRGLTVALMILVDDVG P++NHSPW G+TLADFVMPFFLF VGVS+AL
Sbjct: 42 NQRLVSLDVFRGLTVALMILVDDVGRAFPSLNHSPWFGVTLADFVMPFFLFGVGVSIALV 101
Query: 113 YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIA 172
+K K AT+K I R + LFLLG+ LQGG+FHG NL YG+D+ ++RW GVLQRI+I
Sbjct: 102 FKKVSSKQNATKKIISRTIKLFLLGLLLQGGYFHGRGNLTYGLDLTKLRWFGVLQRISIG 161
Query: 173 YLVAALCEIWL-KGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYE--- 228
Y +A++ EIWL G+ V S + RKY W+ +++L ++YL LLYGLYVP+W++E
Sbjct: 162 YFLASMSEIWLVNGNILVDSPAAFVRKYSIQWIFSILLCSVYLCLLYGLYVPNWEFEHSN 221
Query: 229 --FPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
+P S+ I NV C +RGS P CNAVG IDR ILG H+Y++P+Y RTK+CS+N
Sbjct: 222 LLWPGRVST----IQNVHCDMRGSLDPPCNAVGFIDRLILGEDHMYQRPVYRRTKECSVN 277
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
SPDYGP+P D+P WC APFDPEG+LSS+MA +TC +GL FGH++V F+ H+ R+L W +
Sbjct: 278 SPDYGPLPPDSPGWCLAPFDPEGILSSLMAAITCFVGLQFGHILVIFQAHKQRVLLWSVF 337
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
S L+ +G L+ +G+ L+KALY+LS+ +TAGASG++L IY++V
Sbjct: 338 SFSLLVVGYVLEILGIPLSKALYTLSFMFITAGASGLVLTAIYYIV 383
>gi|224080634|ref|XP_002306188.1| predicted protein [Populus trichocarpa]
gi|222849152|gb|EEE86699.1| predicted protein [Populus trichocarpa]
Length = 422
Score = 381 bits (979), Expect = e-103, Method: Compositional matrix adjust.
Identities = 197/366 (53%), Positives = 249/366 (68%), Gaps = 11/366 (3%)
Query: 34 DPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTL 93
D L P +Q R + R+ SLDVFRGL V LM+LVD G I+P I HSPWNGL L
Sbjct: 3 DLLHKPLLDIEEQPRTSKKTPRVASLDVFRGLCVFLMMLVDYGGAIVPIIAHSPWNGLHL 62
Query: 94 ADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKY 153
ADFVMPFFLFI GVSLAL YK P ++ ATRKA+LRA+ LFLLG+ LQGG+FHGIN L Y
Sbjct: 63 ADFVMPFFLFIAGVSLALVYKRVPNRIEATRKAVLRAVELFLLGVILQGGYFHGINFLTY 122
Query: 154 GVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLY 213
GVD+ +IRW+G+LQRI+I Y+ AALCEIWL +S + Y HW A L+ +Y
Sbjct: 123 GVDMKRIRWLGILQRISIGYIFAALCEIWLSCRSR--RDVSFLKSYYWHWGAAFSLSAIY 180
Query: 214 LLLLYGLYVPDWQYEFPVETSSSSP-------WIFNVTCGVRGSTGPACNAVGMIDRKIL 266
L LLYGLYVPDWQ+E TSS P + V C VRG GPACN+ GMIDR +L
Sbjct: 181 LGLLYGLYVPDWQFEMSNATSSVFPTNHSYVYMLTQVKCSVRGDLGPACNSAGMIDRYVL 240
Query: 267 GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHF 326
GI HLY+KP+Y K+C++++ G +P APSWC APFDPEG+LSS+ A V C+IGL +
Sbjct: 241 GIDHLYKKPVYRNLKECNMSTN--GQVPESAPSWCHAPFDPEGVLSSITAAVACIIGLQY 298
Query: 327 GHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
GH + H +DH+ RM NWI+ S L+ +GL L VG +NK+LY+ Y +T ++G+ +
Sbjct: 299 GHSLAHLQDHKQRMQNWILFSLSLLLVGLLLAVVGDPVNKSLYTFGYMLITCASAGITYS 358
Query: 387 GIYFMV 392
IY +V
Sbjct: 359 AIYLLV 364
>gi|356534906|ref|XP_003535992.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Glycine max]
Length = 489
Score = 380 bits (976), Expect = e-103, Method: Compositional matrix adjust.
Identities = 203/361 (56%), Positives = 260/361 (72%), Gaps = 6/361 (1%)
Query: 37 LPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADF 96
LP SN + QR L SLDVFRGLTVALMILVD+VG P++NHSPW G+TLADF
Sbjct: 37 LPQSNPTDTSSLSLPNQR-LSSLDVFRGLTVALMILVDNVGRAFPSLNHSPWFGVTLADF 95
Query: 97 VMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVD 156
VMPFFLF+VGVS+ L +K K AT+K I R L LFLLG+ LQGG+FHG L YGVD
Sbjct: 96 VMPFFLFVVGVSIGLVFKKVSSKPNATKKVISRTLKLFLLGLLLQGGYFHGHGKLTYGVD 155
Query: 157 IAQIRWMGVLQRIAIAYLVAALCEIWLKGDG-HVSSKLSLFRKYRGHWVVALVLTTLYLL 215
+++IRW+GVLQRI+I Y A++ EIWL V S RKY W+ +++L ++YL
Sbjct: 156 LSKIRWLGVLQRISIGYFFASISEIWLVNHNILVDSPAGFVRKYSIQWMFSILLCSVYLC 215
Query: 216 LLYGLYVPDWQYEFPVETSSSSPW----IFNVTCGVRGSTGPACNAVGMIDRKILGIQHL 271
LLYGLYVP+W+++ SSS I NV C VRGS P CN VG IDR ILG H+
Sbjct: 216 LLYGLYVPNWKFKHSNLLSSSDSSHLSIIQNVHCEVRGSLEPPCNVVGFIDRLILGEDHM 275
Query: 272 YRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV 331
Y++P+Y RTK+CS+NSPDYGP+P D+P WC APFDPEG+LSS+MA +TC +GL +GH+IV
Sbjct: 276 YQRPVYIRTKECSVNSPDYGPLPPDSPGWCLAPFDPEGILSSLMAAITCFMGLQYGHIIV 335
Query: 332 HFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM 391
H + H+ R+L W + S L+ +G L+ +GM L+KALY+LSYTC+TAGASG++L IY++
Sbjct: 336 HLQGHKQRVLLWSVFSFSLLLIGYILEILGMPLSKALYTLSYTCITAGASGLVLTAIYYI 395
Query: 392 V 392
V
Sbjct: 396 V 396
>gi|224103167|ref|XP_002312951.1| predicted protein [Populus trichocarpa]
gi|222849359|gb|EEE86906.1| predicted protein [Populus trichocarpa]
Length = 419
Score = 380 bits (975), Expect = e-103, Method: Compositional matrix adjust.
Identities = 190/363 (52%), Positives = 252/363 (69%), Gaps = 8/363 (2%)
Query: 34 DPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTL 93
D L P +Q + R SLDVFRGL V LM+LVD G I+P I HSPWNGL L
Sbjct: 3 DLLHKPLLDIEEQLHTSKKPPRAASLDVFRGLCVFLMMLVDYGGAIIPIIAHSPWNGLHL 62
Query: 94 ADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKY 153
AD VMPFFLFI GVSLAL YK P ++ AT KA+L+A+ LFLLG+ +QGG+FHGIN+L Y
Sbjct: 63 ADSVMPFFLFIAGVSLALVYKKVPNRIEATWKAVLKAIKLFLLGVVIQGGYFHGINSLTY 122
Query: 154 GVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLY 213
GVD+ +IRW+G+LQ+I++ Y+VAALCEIWL +S + Y HW VA L+ +Y
Sbjct: 123 GVDMKRIRWLGILQKISVGYIVAALCEIWL--SCRTRRGVSFLKSYYWHWCVAFSLSAIY 180
Query: 214 LLLLYGLYVPDWQYEFPVETSSSSPW----IFNVTCGVRGSTGPACNAVGMIDRKILGIQ 269
L LLYGLYVPDWQ+E TSS P ++ V C +RG GPACN+ GMIDR ILGI
Sbjct: 181 LGLLYGLYVPDWQFEMSNATSSVFPTNHSNVYMVKCSLRGDLGPACNSAGMIDRYILGID 240
Query: 270 HLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHL 329
HLY+KP+Y K+C++++ G +P ++ SWC APFDPEG+LSS+ A VTC+IGL +GHL
Sbjct: 241 HLYKKPVYRNLKECNMSTD--GQVPDNSASWCHAPFDPEGVLSSLTAAVTCIIGLQYGHL 298
Query: 330 IVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIY 389
+ H +DH+ RM NW + S L+ +GL L +G +NK+LY+ SY +T+ ++G+ + +Y
Sbjct: 299 LAHLQDHKGRMENWTLFSFSLLVVGLLLVVIGDPVNKSLYTFSYMLITSASAGITYSALY 358
Query: 390 FMV 392
+V
Sbjct: 359 LLV 361
>gi|356516509|ref|XP_003526936.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Glycine max]
Length = 416
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 190/347 (54%), Positives = 242/347 (69%), Gaps = 8/347 (2%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
Q Q R+ SLDVFRGL+V LMI VD I P I H+PWNG+ LADFVMPFFLFI G+SL
Sbjct: 16 QFQNTRIASLDVFRGLSVFLMIFVDYAASIFPIIAHAPWNGIHLADFVMPFFLFIAGISL 75
Query: 110 ALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRI 169
AL YK P + AT KA RALNLF LGI LQGG+FHG+ +L +GVDI +IRW+G+LQRI
Sbjct: 76 ALVYKRRPHRTQATWKAFARALNLFALGILLQGGYFHGVTSLTFGVDIQRIRWLGILQRI 135
Query: 170 AIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEF 229
+I Y+VAALCEIWL +L + Y W VA++L LY LLYGLYVPDWQ++
Sbjct: 136 SIGYIVAALCEIWLPAPRW--KELGFVKSYYWQWFVAVILLALYSGLLYGLYVPDWQFDV 193
Query: 230 PVETSSSSPW----IFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSI 285
TSS P I+ V C VRG GPACN+ GMIDR ILG+ HLYRKP+Y K C++
Sbjct: 194 SASTSSLPPIGGGDIYMVNCSVRGDLGPACNSAGMIDRYILGLDHLYRKPVYRNLKGCNM 253
Query: 286 NSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWII 345
++ G + +PSWC APFDPEG+LSS+ A V+C+IGL +GH++ H +DH+ R+ NW+
Sbjct: 254 SAK--GQVSDSSPSWCHAPFDPEGILSSITAAVSCIIGLQYGHVLAHLQDHKGRLYNWMC 311
Query: 346 LSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
S + LGL L +G+ LNK+LY++SY LT+ ASG+ +YF+V
Sbjct: 312 FSLSFLALGLFLALIGIPLNKSLYTVSYMLLTSAASGLTFIALYFLV 358
>gi|238481505|ref|NP_001154767.1| uncharacterized protein [Arabidopsis thaliana]
gi|332008208|gb|AED95591.1| uncharacterized protein [Arabidopsis thaliana]
Length = 340
Score = 374 bits (959), Expect = e-101, Method: Compositional matrix adjust.
Identities = 191/285 (67%), Positives = 234/285 (82%), Gaps = 3/285 (1%)
Query: 111 LTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIA 170
+++ P + VATRKA++R+L L LLG+FLQGGF HG+NNL YG+D+ +IR MG+LQRIA
Sbjct: 1 MSFAVLPSQFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIA 60
Query: 171 IAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFP 230
IAYLV ALCEIWLKG+ +VSS+LS+ +KYR HWVVA V+TT+YL LLYGLYVPDW+Y+
Sbjct: 61 IAYLVVALCEIWLKGNHNVSSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQIL 120
Query: 231 VETSSSSPWIF---NVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINS 287
E S+ F V CGVRG TGP CNAVGM+DR LGIQHLYRKP+Y+RTKQCSIN
Sbjct: 121 KEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINY 180
Query: 288 PDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILS 347
P+ GP+P DAPSWCQAPFDPEGLLSS+MATVTCL+GLH+GH+I+HFKDH+ R+ WI+ S
Sbjct: 181 PNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKDHKKRLNQWILRS 240
Query: 348 SCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
CL+ LGL+L+ GMHLNK LY+LSY C+T+GASG LL+ IY MV
Sbjct: 241 FCLLMLGLALNLFGMHLNKPLYTLSYMCVTSGASGFLLSAIYLMV 285
>gi|242059773|ref|XP_002459032.1| hypothetical protein SORBIDRAFT_03g044830 [Sorghum bicolor]
gi|241931007|gb|EES04152.1| hypothetical protein SORBIDRAFT_03g044830 [Sorghum bicolor]
Length = 481
Score = 369 bits (948), Expect = 1e-99, Method: Compositional matrix adjust.
Identities = 181/378 (47%), Positives = 245/378 (64%), Gaps = 7/378 (1%)
Query: 26 PANEKLERDPLL---PPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPA 82
P + E DP P ++ + + RR+ SLDVFRGLTVALMILVD GG P
Sbjct: 13 PHGDVAEEDPARRGGPGTDEADDNEKAPRRSRRVASLDVFRGLTVALMILVDGAGGEWPV 72
Query: 83 INHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQG 142
I H+PWNG LADFVMPFFLFIVG+++ L+ K P + A R+ ++R L L GI LQG
Sbjct: 73 IGHAPWNGCNLADFVMPFFLFIVGMAIPLSLKRIPDRGRAVRRVVIRTLKLLFWGILLQG 132
Query: 143 GFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKG---DGHVSSKLSLFRKY 199
+ H + L YGVD+ +RW G+LQRIA+AYLV A+ EI K SS S+FR Y
Sbjct: 133 RYSHAPDELTYGVDMKHVRWGGILQRIALAYLVVAVLEIVTKDAKIQDQSSSGFSIFRMY 192
Query: 200 RGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVG 259
W+VA + +YL L+YG+YVPDW++ S + + VTCG RG P CNAVG
Sbjct: 193 LSQWIVACCILVIYLALVYGIYVPDWEFRVRNVDSPNYGKVLTVTCGTRGILDPPCNAVG 252
Query: 260 MIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVT 319
IDRK+LGI H+Y+KP + R + C+ +SP G DAP+WC APF+PEG+LSS+ A ++
Sbjct: 253 YIDRKVLGINHMYQKPAWRRHRACTDDSPHEGHFRNDAPAWCVAPFEPEGILSSLSAVLS 312
Query: 320 CLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFV-GMHLNKALYSLSYTCLTA 378
+IG+H+GH++VH K H DR+ W+ + CL+ LG+ L F + LNK LY+ SY C+TA
Sbjct: 313 TIIGVHYGHVLVHMKSHTDRLRQWVTMGICLLVLGIILHFSHAIPLNKQLYTFSYICVTA 372
Query: 379 GASGVLLAGIYFMVRYIS 396
GA+GV+ + +YF+V +S
Sbjct: 373 GAAGVVFSVLYFLVDIVS 390
>gi|413918232|gb|AFW58164.1| hypothetical protein ZEAMMB73_985435 [Zea mays]
Length = 423
Score = 369 bits (947), Expect = 2e-99, Method: Compositional matrix adjust.
Identities = 190/392 (48%), Positives = 247/392 (63%), Gaps = 49/392 (12%)
Query: 1 MAKYNPINNDANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLD 60
M +Y + +D + + T+AP + +R P + + +++QR L+SLD
Sbjct: 26 MGRYELVRSDDAPAVAADLEAGTAAPCDGYPKRRVFTPSPSPSPAPASTRYRQR-LVSLD 84
Query: 61 VFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKV 120
VFRG+TV P K+
Sbjct: 85 VFRGITVL----------------------------------------------RVPDKL 98
Query: 121 VATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCE 180
A+RKA+LRAL LF LG+ LQGGFFHG+ +L +GVD+ +IR MGVLQRIAIAYL+ ALCE
Sbjct: 99 DASRKALLRALKLFCLGLVLQGGFFHGVRSLSFGVDLQEIRLMGVLQRIAIAYLLTALCE 158
Query: 181 IWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWI 240
IW++GD V L ++YR V V+ Y+ LLYG YVPDW+Y+ S+
Sbjct: 159 IWIRGDEDVDYGYDLLKRYRYQLFVGAVVAITYMSLLYGTYVPDWEYQTSAPGSTEKHLF 218
Query: 241 FNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSW 300
V CGVRG T P CNAVGMIDRKILGIQHLY +P+Y+R+KQCSI+SP GP+P DAPSW
Sbjct: 219 --VKCGVRGDTSPGCNAVGMIDRKILGIQHLYGRPVYARSKQCSIDSPQNGPLPSDAPSW 276
Query: 301 CQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFV 360
CQAPFDPEGLLSSVMA VTCLIGL +GH+IVHF+ HR+RM+NW+I S ++ L ++DF
Sbjct: 277 CQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHRERMMNWLIPSFSMLVLAFAMDFF 336
Query: 361 GMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
G+H+NK LY+LSYT TAGA+G+L +GIY +V
Sbjct: 337 GLHMNKPLYTLSYTLGTAGAAGLLFSGIYTLV 368
>gi|449458622|ref|XP_004147046.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Cucumis sativus]
gi|449489633|ref|XP_004158370.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Cucumis sativus]
Length = 418
Score = 365 bits (936), Expect = 3e-98, Method: Compositional matrix adjust.
Identities = 176/355 (49%), Positives = 243/355 (68%), Gaps = 9/355 (2%)
Query: 43 NSKQTRPQHQQR--RLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPF 100
++Q P + R++SLDVFRGL+V +M+LVD G LP I+HSPW GL LADFVMP+
Sbjct: 9 KNQQELPASSGKAPRVVSLDVFRGLSVFMMMLVDYGGSFLPIISHSPWIGLHLADFVMPW 68
Query: 101 FLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
FLFI GVS+AL YK KV A R A R L LFLLG+FLQGG+FHGI +L YGVD+ I
Sbjct: 69 FLFIAGVSVALVYKEVESKVAAARNAACRGLYLFLLGVFLQGGYFHGITSLTYGVDLESI 128
Query: 161 RWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGL 220
RW+G+LQRI+I YL+AALCEIWL ++ + + + HW + L +LY+ L YGL
Sbjct: 129 RWLGILQRISIGYLIAALCEIWLTRCTREEAQHT--KSFSWHWCIIFFLLSLYMGLSYGL 186
Query: 221 YVPDWQYEF---PVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIY 277
YVPDW ++ S +++ V C +RG GPACN+ GMIDR +LGI HLY KP+Y
Sbjct: 187 YVPDWDFKISAPSSSLPLSGSYVYKVNCSLRGDLGPACNSAGMIDRYVLGIHHLYTKPVY 246
Query: 278 SRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHR 337
K+C+I+S G P +PSWC+APF+PEGLLSS+ ATV C+IGL +GH++ +DH+
Sbjct: 247 RNLKECNISSS--GQFPETSPSWCRAPFEPEGLLSSLTATVACIIGLQYGHILARAQDHK 304
Query: 338 DRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
R W +LS ++ G+ L F+G+ +NK+LY++SY +T+ ++G++ +Y +V
Sbjct: 305 TRTNGWFLLSFKILAFGIFLVFIGIPVNKSLYTVSYMLITSASAGIIFCALYILV 359
>gi|168007055|ref|XP_001756224.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162692734|gb|EDQ79090.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 411
Score = 364 bits (934), Expect = 5e-98, Method: Compositional matrix adjust.
Identities = 180/327 (55%), Positives = 237/327 (72%), Gaps = 10/327 (3%)
Query: 70 MILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF-PCKVVATRKAIL 128
MILVD GGI PAINHSPW+G+TLADFV+PFFLFIVGV+LALTYK K +A++KAI
Sbjct: 1 MILVDYAGGIWPAINHSPWDGVTLADFVLPFFLFIVGVALALTYKKIINEKQLASQKAIG 60
Query: 129 RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGH 188
R+L L ++G+F+QGG+FHG++N YGVD+ IRW GVLQRIA+AY+V ALCEIW GH
Sbjct: 61 RSLKLVIVGLFIQGGYFHGVHNTSYGVDLESIRWCGVLQRIALAYMVVALCEIWAP-RGH 119
Query: 189 VSSK---LSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTC 245
S + R++ VA + +YL+LLYG+YVPDW++ S++ +F V C
Sbjct: 120 YDSMNVYIKSTRRFGTFRAVAAAIVAIYLVLLYGVYVPDWEF-----VSAADSTVFQVKC 174
Query: 246 GVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPF 305
GVRG GP+CN VG +DR +LG+ HLY+K +Y R CS+ SPDYGP+P AP WC+APF
Sbjct: 175 GVRGDVGPSCNVVGYLDRTLLGLSHLYQKAVYRRAPACSVLSPDYGPLPAGAPVWCKAPF 234
Query: 306 DPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLN 365
DPEGLLSS+ A V+C +GLHFGH++VH K+H R+ +W+++S L+ G L +GM N
Sbjct: 235 DPEGLLSSMSAIVSCFLGLHFGHVLVHHKEHNARLKDWVLMSLTLLVTGALLHVLGMPWN 294
Query: 366 KALYSLSYTCLTAGASGVLLAGIYFMV 392
K LYS+SY T GA+G++ AG YF+V
Sbjct: 295 KPLYSVSYMLFTGGAAGLVFAGYYFLV 321
>gi|224069583|ref|XP_002326379.1| predicted protein [Populus trichocarpa]
gi|222833572|gb|EEE72049.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 363 bits (933), Expect = 6e-98, Method: Compositional matrix adjust.
Identities = 173/390 (44%), Positives = 247/390 (63%), Gaps = 9/390 (2%)
Query: 9 NDANDQLVIQISKSTSAPANEKLERDPLLP--PSNSNSKQTRPQHQQRRLISLDVFRGLT 66
+D N L + K+ NEK ER + +Q + + +R+ +LD FRGLT
Sbjct: 15 DDENIHLSEKEGKTDGGDDNEKEERRAVHDHLAEREGDRQPVVKQKSKRVATLDAFRGLT 74
Query: 67 VALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKA 126
+ LMILVDD GG+ P I+HSPWNG TLADFVMPFFLFIVGV++AL +K P + A +K
Sbjct: 75 IVLMILVDDAGGVYPRIDHSPWNGCTLADFVMPFFLFIVGVAIALAFKRIPKRRDAVKKI 134
Query: 127 ILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL-QRIAIAYLVAALCEIWLKG 185
ILR L L G+ LQGG+ H ++L YGVD+ IRW G+L QRIA+ Y+V AL E +
Sbjct: 135 ILRTLKLLFWGVLLQGGYSHAPSDLAYGVDMKLIRWFGILQQRIALVYMVVALIEALIPK 194
Query: 186 DGHV--SSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNV 243
+ ++F YR W+ + +Y++ + LYVPDW + + + V
Sbjct: 195 NRQTIEPDHFTIFTAYRWQWIAGFISFVIYMVTTFALYVPDWSFTVDEDHERRR---YTV 251
Query: 244 TCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQA 303
CG+RG GPACNAVG +DR++ GI HLY+ P++SR K C+++SP GP DAPSWC+A
Sbjct: 252 ECGMRGHLGPACNAVGYVDREVWGINHLYQYPVWSRLKACTLSSPGSGPFRKDAPSWCRA 311
Query: 304 PFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFV-GM 362
PF+PEGLLSS+ A ++ IG+H+GH+++HFK H +R+ W+ + L+ + + L F +
Sbjct: 312 PFEPEGLLSSISAILSGTIGIHYGHVLIHFKGHAERLRQWVSMGVILLIVAIILHFTDAI 371
Query: 363 HLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+NK LYS SY C TAGA+G++ +G Y ++
Sbjct: 372 PINKQLYSFSYVCFTAGAAGIVFSGFYVLI 401
>gi|302754694|ref|XP_002960771.1| hypothetical protein SELMODRAFT_75452 [Selaginella moellendorffii]
gi|300171710|gb|EFJ38310.1| hypothetical protein SELMODRAFT_75452 [Selaginella moellendorffii]
Length = 493
Score = 359 bits (921), Expect = 2e-96, Method: Compositional matrix adjust.
Identities = 184/378 (48%), Positives = 252/378 (66%), Gaps = 19/378 (5%)
Query: 28 NEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSP 87
+ ++++ LL N + P + R+ +LDVFRGLTVALM+LVDD GG P INHSP
Sbjct: 27 DRHVKKESLL---NEDQAVAVPLKKPVRIATLDVFRGLTVALMVLVDDAGGEWPRINHSP 83
Query: 88 WNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHG 147
WNG TLAD VMPFFLFIVGV++AL K P +V AT+K ++R L L G+ LQGGF H
Sbjct: 84 WNGCTLADLVMPFFLFIVGVAIALALKRIPDQVAATQKVVIRTLKLLFWGLLLQGGFSHA 143
Query: 148 INNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVS---SKLSLFRKYRGHWV 204
++L YGVD+ +IRW G+LQRIA YL+ AL EI + + +F+ Y+ HW
Sbjct: 144 PDDLSYGVDMRKIRWCGILQRIAFGYLIVALVEIATTKSRSLELPKGQFGIFKLYKWHWA 203
Query: 205 VALVLTTLYLLLLYGLYVPDWQY-----EFPVETSSSSPWIF----NVTCGVRGSTGPAC 255
AL + +Y + YGLYVPDW + F V S + ++F NV CGVRG GPAC
Sbjct: 204 CALAVVIIYHSVAYGLYVPDWHFIDSGHRFVV---SLAKFVFSSQINVQCGVRGDIGPAC 260
Query: 256 NAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVM 315
NAVG IDR ILGI HLY+ P ++RT+ C ++SP G P +AP+WC+APF+PEG+LSS+
Sbjct: 261 NAVGHIDRTILGINHLYQSPEWTRTQSCDLDSPAEGDPPANAPAWCKAPFEPEGILSSIS 320
Query: 316 ATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFV-GMHLNKALYSLSYT 374
A ++C+IG+H+GH+++HFK H R+L+W I ++ L+ L L F + LNK LYS SY
Sbjct: 321 AILSCIIGIHYGHVLIHFKGHMKRVLHWTIPAAALLVLATILHFTHAIPLNKQLYSFSYV 380
Query: 375 CLTAGASGVLLAGIYFMV 392
C TAGA+G++ + +Y ++
Sbjct: 381 CFTAGAAGMIFSLLYVVI 398
>gi|326493552|dbj|BAJ85237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326511587|dbj|BAJ91938.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 486
Score = 358 bits (919), Expect = 3e-96, Method: Compositional matrix adjust.
Identities = 175/340 (51%), Positives = 232/340 (68%), Gaps = 5/340 (1%)
Query: 58 SLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFP 117
SLDVFRGLTVALMILVD GG P I H+PW+G LADFVMPFFLFIVG+++ L+ K P
Sbjct: 52 SLDVFRGLTVALMILVDGAGGEWPVIGHAPWHGCNLADFVMPFFLFIVGMAIPLSLKRIP 111
Query: 118 CKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAA 177
+ A R+ ++R L L GI LQGG+ H + L YGVD+ IRW G+LQRIA+AYLV A
Sbjct: 112 DRGWAVRRVVIRTLKLLFWGILLQGGYSHAPDELTYGVDMKHIRWCGILQRIALAYLVVA 171
Query: 178 LCEIWLKG----DGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVET 233
+ EI K D S S+FR Y W+VA + +YL L+YG+YVPDW++
Sbjct: 172 VIEIATKDARVQDQSSSGFFSVFRLYLSQWIVACCILLIYLSLVYGVYVPDWEFTVRNVD 231
Query: 234 SSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPM 293
S + + VTCG RG+ P CNAVG IDRK+LGI HLY+KP + R + C+ +SP GP
Sbjct: 232 SPNYGKVLTVTCGTRGNLSPPCNAVGYIDRKVLGINHLYQKPAWRRHRDCTDDSPHEGPF 291
Query: 294 PLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL 353
DAP+WC +PF+PEGLLSS A ++ +IG+H+GH++VH K H DR+ W+ + L+ L
Sbjct: 292 KRDAPAWCASPFEPEGLLSSFSAVLSTIIGVHYGHVLVHMKSHMDRLKQWVTMGVALLLL 351
Query: 354 GLSLDFV-GMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
G+ L F + LNK LY+LSY C+TAGA+G++ + +YF+V
Sbjct: 352 GIILHFSHAIPLNKQLYTLSYICVTAGAAGIIFSMLYFLV 391
>gi|302804288|ref|XP_002983896.1| hypothetical protein SELMODRAFT_119497 [Selaginella moellendorffii]
gi|300148248|gb|EFJ14908.1| hypothetical protein SELMODRAFT_119497 [Selaginella moellendorffii]
Length = 493
Score = 357 bits (917), Expect = 4e-96, Method: Compositional matrix adjust.
Identities = 184/378 (48%), Positives = 251/378 (66%), Gaps = 19/378 (5%)
Query: 28 NEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSP 87
+ ++++ LL N + P + R+ +LDVFRGLTVALM+LVDD GG P INHSP
Sbjct: 27 DHHVKKESLL---NGDQAVAVPLKKPVRIATLDVFRGLTVALMVLVDDAGGEWPRINHSP 83
Query: 88 WNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHG 147
WNG TLAD VMPFFLFIVGV++AL K P +V AT+K ++R L L G+ LQGGF H
Sbjct: 84 WNGCTLADLVMPFFLFIVGVAIALALKRIPDQVAATQKVVIRTLKLLFWGLLLQGGFSHA 143
Query: 148 INNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVS---SKLSLFRKYRGHWV 204
++L YGVD+ +IRW G+LQRIA YL+ AL EI + +F+ Y+ HW
Sbjct: 144 PDDLSYGVDMRKIRWCGILQRIAFGYLIVALVEIATTKSRSLELPKGHFGIFKLYKWHWA 203
Query: 205 VALVLTTLYLLLLYGLYVPDWQY-----EFPVETSSSSPWIF----NVTCGVRGSTGPAC 255
AL + +Y + YGLYVPDW + F V S + ++F NV CGVRG GPAC
Sbjct: 204 CALAVVIIYHSVAYGLYVPDWHFIDSGHRFVV---SLAKFVFSSQINVQCGVRGDIGPAC 260
Query: 256 NAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVM 315
NAVG IDR ILGI HLY+ P ++RT+ C ++SP G P +AP+WC+APF+PEG+LSS+
Sbjct: 261 NAVGHIDRTILGINHLYQSPEWTRTQSCDLDSPAEGDPPANAPAWCKAPFEPEGILSSIS 320
Query: 316 ATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFV-GMHLNKALYSLSYT 374
A ++C+IG+H+GH+++HFK H R+L+W I ++ L+ L L F + LNK LYS SY
Sbjct: 321 AILSCIIGIHYGHVLIHFKGHMKRVLHWTIPAAALLVLATILHFTHAIPLNKQLYSFSYV 380
Query: 375 CLTAGASGVLLAGIYFMV 392
C TAGA+G++ + +Y ++
Sbjct: 381 CFTAGAAGMIFSLLYVVI 398
>gi|357467537|ref|XP_003604053.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
truncatula]
gi|355493101|gb|AES74304.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
truncatula]
Length = 421
Score = 357 bits (917), Expect = 5e-96, Method: Compositional matrix adjust.
Identities = 177/349 (50%), Positives = 242/349 (69%), Gaps = 10/349 (2%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
+ Q R+ S+DVFRGL+V LMI VD G I P I+H+PWNGL LADFVMPFFLF+VG+SL
Sbjct: 14 ETQFPRVASVDVFRGLSVFLMIFVDYGGSIFPIISHAPWNGLHLADFVMPFFLFLVGISL 73
Query: 110 ALTYKNFPCKVV--ATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
AL YKN + +T K +LR+ LF+LGI LQGG+FHGI++ YGVD+ IR+ GVLQ
Sbjct: 74 ALVYKNKRSRPTQSSTWKPLLRSFQLFILGILLQGGYFHGIHSFTYGVDVQTIRFFGVLQ 133
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQY 227
RI+I Y+VAALC+I L S S F+ Y HW VA +L ++ LLYGL+VPDWQ+
Sbjct: 134 RISIGYIVAALCQICLPTLP--SKHTSFFKTYYSHWFVAAILLAIHSGLLYGLHVPDWQF 191
Query: 228 EFPVETSSSSPW----IFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQC 283
+ + TSS P ++ V C VRG GPACN+ GMIDR ILG+ HLY+KP++ K+C
Sbjct: 192 DASLSTSSLPPIQAGNVYTVNCSVRGDLGPACNSAGMIDRYILGLDHLYKKPVFRNLKEC 251
Query: 284 SINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNW 343
+++S G + +PSWC APFDPEG+LSS+ A V+C+IGL +GH++ + +DH+ R+ W
Sbjct: 252 NMSST--GQVSDSSPSWCHAPFDPEGILSSITAAVSCIIGLQYGHILANLEDHKGRLNQW 309
Query: 344 IILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+ S + LG L +G+ LNK+LY++SY L++ ASG+ +Y +V
Sbjct: 310 LGFSVSFLALGWFLALIGIPLNKSLYTVSYMLLSSAASGLTFMALYILV 358
>gi|357126662|ref|XP_003565006.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Brachypodium distachyon]
Length = 485
Score = 357 bits (915), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 185/390 (47%), Positives = 251/390 (64%), Gaps = 16/390 (4%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRP-------QHQQRRLISLDVFRGLTVALMI 71
++K A A+E+ DP P + + + RR+ SLDVFRGLTVALMI
Sbjct: 9 VAKRRGAAADEE---DPDRRPGAEAAAADKDGDDDEKAARRSRRVASLDVFRGLTVALMI 65
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRAL 131
LVD GG P I H+PW+G LADFVMPFFLFIVG+++ L+ K P + A R+ ++R L
Sbjct: 66 LVDGAGGEWPVIGHAPWDGCNLADFVMPFFLFIVGMAIPLSLKRIPDRGRAVRRVVIRTL 125
Query: 132 NLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHV-- 189
L GI LQGG+ H + L YGVD+ IRW G+LQRIA AYLV A+ EI K D ++
Sbjct: 126 KLLFWGILLQGGYSHAPDELAYGVDMKHIRWCGILQRIAFAYLVVAVIEIATK-DANIQD 184
Query: 190 --SSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGV 247
SS S+FR Y W+VA + +YL L+YG+YVPDW++ S + + VTCG
Sbjct: 185 QSSSGFSIFRMYFSQWIVACCILLIYLSLVYGIYVPDWEFRVRNVDSPNYGKVLTVTCGT 244
Query: 248 RGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDP 307
RG P CNAVG IDRK+LGI HLY+KP + R + C+ +SP GP DAP+WC +PF+P
Sbjct: 245 RGKLSPPCNAVGYIDRKVLGINHLYQKPAWRRHRACTDDSPHEGPFKSDAPAWCASPFEP 304
Query: 308 EGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFV-GMHLNK 366
EGLLSS A ++ +IG+H+GH++VH K H DR+ W+ + L+ LG+ L F + LNK
Sbjct: 305 EGLLSSFSAVLSTIIGVHYGHVLVHMKSHMDRLKQWVTMGIALLLLGIILHFSHAIPLNK 364
Query: 367 ALYSLSYTCLTAGASGVLLAGIYFMVRYIS 396
LY+ SY C+TAGA+G++ + +YF+V +S
Sbjct: 365 QLYTFSYICVTAGAAGIVFSMLYFLVDMVS 394
>gi|255581844|ref|XP_002531722.1| conserved hypothetical protein [Ricinus communis]
gi|223528625|gb|EEF30642.1| conserved hypothetical protein [Ricinus communis]
Length = 397
Score = 354 bits (909), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 176/327 (53%), Positives = 231/327 (70%), Gaps = 7/327 (2%)
Query: 70 MILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILR 129
M+LVD G I P I HSPWNGL LADFVMPFFLFI GVSLAL YK ++ AT KA+LR
Sbjct: 1 MMLVDYGGSIFPIIAHSPWNGLHLADFVMPFFLFIAGVSLALVYKKVTKRIDATWKAMLR 60
Query: 130 ALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHV 189
A+ LF LG+FLQGG+FHGIN+L YGVDI +IRW G+LQRI+I Y+VAALCEIWL
Sbjct: 61 AVKLFFLGVFLQGGYFHGINSLTYGVDIERIRWFGILQRISIGYIVAALCEIWLSRRTQS 120
Query: 190 SSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPW----IFNVTC 245
++ F+ Y HWVVA L+ +YL LLYGLYVPDWQ+E SS+ P ++ V C
Sbjct: 121 QREIGFFKNYYWHWVVAFSLSAVYLGLLYGLYVPDWQFEMSNAASSALPINGSNVYMVKC 180
Query: 246 GVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPF 305
VRG GPACN+ GMIDR +LG HLY KP++ K+C++ + G + +PSWC APF
Sbjct: 181 SVRGDLGPACNSAGMIDRYVLGFDHLYTKPVHRNLKECNMTN---GQVSESSPSWCHAPF 237
Query: 306 DPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLN 365
DPEGLLSS+ A +TC+IGL GH++ H ++H+ R+ +W + S+ L+ LG L F+G+ +N
Sbjct: 238 DPEGLLSSLTAAITCIIGLQCGHVLAHIQEHKGRIESWSLFSASLLLLGSVLAFIGIPVN 297
Query: 366 KALYSLSYTCLTAGASGVLLAGIYFMV 392
K+LY++SY +T+ SG+ +Y +V
Sbjct: 298 KSLYTISYMLITSALSGITFCALYLLV 324
>gi|359481929|ref|XP_002268831.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Vitis vinifera]
gi|297739972|emb|CBI30154.3| unnamed protein product [Vitis vinifera]
Length = 489
Score = 352 bits (904), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 168/384 (43%), Positives = 238/384 (61%), Gaps = 12/384 (3%)
Query: 18 QISKSTSAPANEKLERD-----PLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMIL 72
+ K + +K+E+D P +Q + + +R+ +LD FRGLT+ LMIL
Sbjct: 14 HVHKEDISEKADKIEKDESSATPAQSVEQKGEEQPLIKQKSKRVATLDAFRGLTIVLMIL 73
Query: 73 VDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALN 132
VDD GG I+HSPWNG TLADFVMPFFLFIVGV++AL K P +A +K LR L
Sbjct: 74 VDDAGGSYARIDHSPWNGCTLADFVMPFFLFIVGVAVALALKKIPRISLAVKKISLRTLK 133
Query: 133 LFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHV--- 189
L GI LQGG+ H ++L YGVD+ IRW G+LQRIA+ Y V AL E
Sbjct: 134 LLFWGILLQGGYSHAPDDLSYGVDMKHIRWFGILQRIAVVYFVVALIETLTTKRRPTVID 193
Query: 190 SSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRG 249
S S+ Y+ W+ V +Y++ Y LYVPDW + + + + V CG+RG
Sbjct: 194 SGHFSILSAYKWQWIGGFVAFLIYMITTYALYVPDWSFVIDQDHEAKR---YTVKCGMRG 250
Query: 250 STGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEG 309
GPACNAVG +DR++ GI HLY +P+++R K C+++SP+ GP DAPSWC APF+PEG
Sbjct: 251 HLGPACNAVGYVDRQVWGINHLYSQPVWTRLKACTLSSPNSGPFREDAPSWCYAPFEPEG 310
Query: 310 LLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFV-GMHLNKAL 368
LLS++ A ++ IG+H+GH+++HFK H +R+ W+ + L+ + + L F + +NK L
Sbjct: 311 LLSTISAILSGTIGIHYGHVLIHFKGHAERLKQWVSMGIVLLIVAIILHFTDAIPINKQL 370
Query: 369 YSLSYTCLTAGASGVLLAGIYFMV 392
YS SY C TAGA+G++L+ Y ++
Sbjct: 371 YSFSYVCFTAGAAGIVLSAFYLVI 394
>gi|222619812|gb|EEE55944.1| hypothetical protein OsJ_04649 [Oryza sativa Japonica Group]
Length = 846
Score = 352 bits (903), Expect = 2e-94, Method: Compositional matrix adjust.
Identities = 179/350 (51%), Positives = 235/350 (67%), Gaps = 4/350 (1%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALT 112
RR+ SLDVFRGLTVALMILVD GG P I H+PWNG LADFVMPFFLFIVG+++ L+
Sbjct: 408 SRRVASLDVFRGLTVALMILVDGAGGEWPVIGHAPWNGCNLADFVMPFFLFIVGMAIPLS 467
Query: 113 YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIA 172
K P + A R+ +LR L L GI LQGG+ H ++L YGVD+ +RW G+LQRIA+A
Sbjct: 468 LKRIPDRGRAVRRVVLRTLKLLFWGILLQGGYSHAPDDLSYGVDMKHVRWCGILQRIALA 527
Query: 173 YLVAALCEIWLKG---DGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEF 229
YLV A+ EI K SS S+FR Y W+VA + +YL L+YG+YVPDW +
Sbjct: 528 YLVVAVLEIVTKNAKVQDQSSSGFSIFRMYFSQWIVACCILVIYLSLVYGIYVPDWDFRV 587
Query: 230 PVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPD 289
+ + I VTCG RG P CNAVG IDRK+LGI H+Y +P + R K C+ +SP
Sbjct: 588 SDVKNPNFGKILTVTCGTRGKLSPPCNAVGYIDRKVLGINHMYHRPAWRRHKDCTDDSPH 647
Query: 290 YGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSC 349
GP D+P+WC APF+PEGLLSS+ A ++ +IG+H+GH++VH K H DR+ W I+
Sbjct: 648 EGPFKTDSPAWCYAPFEPEGLLSSLSAVLSTIIGVHYGHVLVHMKSHTDRLKQWSIMGIT 707
Query: 350 LIGLGLSLDFV-GMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSH 398
L+ LGL+L F + LNK LY+ SY C+TAGA+G++ YF+V ++ H
Sbjct: 708 LLILGLTLHFSHAIPLNKQLYTFSYICVTAGAAGIVFCMFYFLVDILNLH 757
>gi|212723192|ref|NP_001131974.1| uncharacterized protein LOC100193372 [Zea mays]
gi|194693076|gb|ACF80622.1| unknown [Zea mays]
gi|413951397|gb|AFW84046.1| hypothetical protein ZEAMMB73_047978 [Zea mays]
Length = 484
Score = 352 bits (902), Expect = 3e-94, Method: Compositional matrix adjust.
Identities = 179/400 (44%), Positives = 254/400 (63%), Gaps = 17/400 (4%)
Query: 7 INNDANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLT 66
+++ +++++V +++ D +++N K RP RR+ SLDVFRGLT
Sbjct: 1 MDDGSSNKVVEDVAEEDPGRRGGPWRTDE----ADANEKAPRPS---RRVASLDVFRGLT 53
Query: 67 VALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKA 126
VALMILVD GG P I H+PWNG LADFVMPFFLFIVG+++ L K P + A R+
Sbjct: 54 VALMILVDGAGGEWPVIGHAPWNGCNLADFVMPFFLFIVGMAVPLALKRIPDRGRAVRRV 113
Query: 127 ILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGD 186
++R L L GI LQGG+ H + L YGVD+ +RW G+LQRIA+AYLV A+ E+ K
Sbjct: 114 VVRTLKLLFWGILLQGGYSHAPDELAYGVDMRHVRWGGILQRIALAYLVVAVLEMVTKDG 173
Query: 187 GHV--------SSKLS-LFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSS 237
V S + S +FR Y W+VA + +YL L YG+YVPDW++ S
Sbjct: 174 AKVHQDQPPGSSGRFSRVFRMYLSQWIVACCILVVYLSLAYGVYVPDWEFRVRNADSPDY 233
Query: 238 PWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDA 297
+ V CG RG+ P CNAVG IDR++LGI H+Y+KP + R + C+ +SP GP DA
Sbjct: 234 GKVLTVRCGTRGALDPPCNAVGYIDRRVLGINHMYQKPAWRRHRACTDDSPHEGPFREDA 293
Query: 298 PSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSL 357
P+WC APF+PEG+LSS+ A ++ ++G+H+GH++VH K H DR+ W+ + L+ LG+ L
Sbjct: 294 PAWCVAPFEPEGILSSLSAVLSTVVGVHYGHVLVHMKSHTDRLRQWVTMGVALLVLGIIL 353
Query: 358 DFV-GMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYIS 396
F + LNK LY+ SY C+TAGA+GV+ + +YF+V +S
Sbjct: 354 HFSHAIPLNKQLYTFSYICVTAGAAGVVFSALYFLVDVVS 393
>gi|115442029|ref|NP_001045294.1| Os01g0931100 [Oryza sativa Japonica Group]
gi|57899654|dbj|BAD87323.1| unknown protein [Oryza sativa Japonica Group]
gi|57900117|dbj|BAD88179.1| unknown protein [Oryza sativa Japonica Group]
gi|113534825|dbj|BAF07208.1| Os01g0931100 [Oryza sativa Japonica Group]
gi|215697092|dbj|BAG91086.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 488
Score = 349 bits (896), Expect = 1e-93, Method: Compositional matrix adjust.
Identities = 181/358 (50%), Positives = 238/358 (66%), Gaps = 4/358 (1%)
Query: 45 KQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFI 104
K P+ RR+ SLDVFRGLTVALMILVD GG P I H+PWNG LADFVMPFFLFI
Sbjct: 42 KAPPPRRTSRRVASLDVFRGLTVALMILVDGAGGEWPVIGHAPWNGCNLADFVMPFFLFI 101
Query: 105 VGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMG 164
VG+++ L+ K P + A R+ +LR L L GI LQGG+ H ++L YGVD+ +RW G
Sbjct: 102 VGMAIPLSLKRIPDRGRAVRRVVLRTLKLLFWGILLQGGYSHAPDDLSYGVDMKHVRWCG 161
Query: 165 VLQRIAIAYLVAALCEIWLKG---DGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLY 221
+LQRIA+AYLV A+ EI K SS S+FR Y W+VA + +YL L+YG+Y
Sbjct: 162 ILQRIALAYLVVAVLEIVTKNAKVQDQSSSGFSIFRMYFSQWIVACCILVIYLSLVYGIY 221
Query: 222 VPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTK 281
VPDW + + + I VTCG RG P CNAVG IDRK+LGI H+Y +P + R K
Sbjct: 222 VPDWDFRVSDVKNPNFGKILTVTCGTRGKLSPPCNAVGYIDRKVLGINHMYHRPAWRRHK 281
Query: 282 QCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRML 341
C+ +SP GP D+P+WC APF+PEGLLSS+ A ++ +IG+H+GH++VH K H DR+
Sbjct: 282 DCTDDSPHEGPFKTDSPAWCYAPFEPEGLLSSLSAVLSTIIGVHYGHVLVHMKSHTDRLK 341
Query: 342 NWIILSSCLIGLGLSLDFV-GMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSH 398
W I+ L+ LGL+L F + LNK LY+ SY C+TAGA+G++ YF+V ++ H
Sbjct: 342 QWSIMGITLLILGLTLHFSHAIPLNKQLYTFSYICVTAGAAGIVFCMFYFLVDILNLH 399
>gi|356572978|ref|XP_003554642.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Glycine max]
Length = 464
Score = 348 bits (892), Expect = 3e-93, Method: Compositional matrix adjust.
Identities = 182/370 (49%), Positives = 246/370 (66%), Gaps = 10/370 (2%)
Query: 27 ANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHS 86
L LP S+ N +T+ R+ SLD+FRGLTVALMILVDD GG P I H+
Sbjct: 6 GEHSLNVSEELPLSDKNLPKTK------RVASLDIFRGLTVALMILVDDAGGQWPMIGHA 59
Query: 87 PWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFH 146
PWNG LADFVMPFFLFIVG+++ L K P +++A +K I+R L L G+ LQGGF H
Sbjct: 60 PWNGCNLADFVMPFFLFIVGMAIPLALKRIPNRLLAVKKVIVRTLKLLFWGLLLQGGFSH 119
Query: 147 GINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIW---LKGDGHVSSKLSLFRKYRGHW 203
+NL YGVD+ IRW G+LQRIA+AYLV AL EI+ + + LS+F+ Y HW
Sbjct: 120 APDNLTYGVDMKHIRWCGILQRIALAYLVVALVEIFSRSAQARDPEPTHLSIFKLYYWHW 179
Query: 204 VVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDR 263
+V + +YL LLYG++VPDWQ+ S + VTCGVRG P CNAVG IDR
Sbjct: 180 LVGACILAVYLALLYGIHVPDWQFTVHNPDSIYNGTTLTVTCGVRGKLDPPCNAVGYIDR 239
Query: 264 KILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIG 323
+++GI H+Y++P + R++ C+ NSP GP +APSWC APF+PEG+LSS+ A ++ +IG
Sbjct: 240 EVIGINHMYKRPAWRRSEACTENSPYEGPFKKNAPSWCYAPFEPEGILSSISAILSTIIG 299
Query: 324 LHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFV-GMHLNKALYSLSYTCLTAGASG 382
LHFGH+++H +DH R+ +W++L L+ GL L F + LNK LY+LSY C+T+GA+
Sbjct: 300 LHFGHVLIHLQDHPSRLKHWLLLGLALLTSGLILHFTHAIPLNKQLYTLSYVCVTSGAAA 359
Query: 383 VLLAGIYFMV 392
+L + Y MV
Sbjct: 360 LLFSAFYIMV 369
>gi|224125166|ref|XP_002319516.1| predicted protein [Populus trichocarpa]
gi|222857892|gb|EEE95439.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 348 bits (892), Expect = 4e-93, Method: Compositional matrix adjust.
Identities = 168/342 (49%), Positives = 229/342 (66%), Gaps = 4/342 (1%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYK 114
R+ SLD++RGLTVALMILVDD GG P I H+PWNG LADFVMPFFLFIVG+++ L +K
Sbjct: 32 RVASLDIYRGLTVALMILVDDAGGEWPKIGHAPWNGCNLADFVMPFFLFIVGMAIPLAFK 91
Query: 115 NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYL 174
+ A R+ I+R L L GI LQGGF H + L YGVD+ +IRW G+LQRIA AYL
Sbjct: 92 RITSRHHAVRRVIVRTLKLLFWGIMLQGGFSHAPDKLTYGVDMKKIRWCGILQRIAFAYL 151
Query: 175 VAALCEIWLKGDGHVS---SKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPV 231
V AL EI+ K LS+++ Y W++ + +YL ++YG YVP WQ+
Sbjct: 152 VVALMEIFTKKKQTRELPPGWLSIYKLYSSQWLMGACILVIYLAVIYGTYVPHWQFTVND 211
Query: 232 ETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYG 291
S+ +F V C VRG P CNAVG IDR+ILGI H+Y+ P + R++ C+ NSP G
Sbjct: 212 RDSADYGKVFTVECAVRGKLDPPCNAVGFIDREILGINHMYQHPAWKRSEACTENSPYEG 271
Query: 292 PMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLI 351
P APSWC+APF+PEG+LSS+ A ++ +IG+HFGH++V+ + H R+ +WI++ L+
Sbjct: 272 PFRTSAPSWCKAPFEPEGILSSISAVLSTIIGVHFGHVLVYMRGHAARLKHWIVMGFALL 331
Query: 352 GLGLSLDFV-GMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
LGL L F + LNK LY+ SY C+T+GA+ ++ + IY +V
Sbjct: 332 ILGLVLHFTHAIPLNKQLYTFSYVCVTSGAAALVFSSIYALV 373
>gi|18421151|ref|NP_568500.1| heparan-alpha-glucosaminide N-acetyltransferase [Arabidopsis
thaliana]
gi|14334592|gb|AAK59474.1| unknown protein [Arabidopsis thaliana]
gi|26983902|gb|AAN86203.1| unknown protein [Arabidopsis thaliana]
gi|332006336|gb|AED93719.1| heparan-alpha-glucosaminide N-acetyltransferase [Arabidopsis
thaliana]
Length = 472
Score = 347 bits (889), Expect = 7e-93, Method: Compositional matrix adjust.
Identities = 185/377 (49%), Positives = 247/377 (65%), Gaps = 12/377 (3%)
Query: 27 ANEKLERDP----LLPPSNSNSKQTRPQ--HQQRRLISLDVFRGLTVALMILVDDVGGIL 80
A K+ER L P +++S TR + RL SLD+FRGLTVALMILVDD GG
Sbjct: 2 AEIKVERSHDQHLLEPKEDTSSSYTRRSLAGNRPRLASLDIFRGLTVALMILVDDAGGDW 61
Query: 81 PAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFL 140
P I H+PWNG LADFVMPFFLFIVGVS+AL+ K K A +K R L G+ L
Sbjct: 62 PMIAHAPWNGCNLADFVMPFFLFIVGVSIALSLKRISNKFEACKKVGFRTCKLLFWGLLL 121
Query: 141 QGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGH----VSSKLSLF 196
QGGF H + L YGVD+ +R+ G+LQRIA++YLV AL EI+ K D H + + S+F
Sbjct: 122 QGGFSHAPDELTYGVDVTMMRFCGILQRIALSYLVVALVEIFTK-DSHEENLSTGRFSIF 180
Query: 197 RKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACN 256
+ Y HW+VA + +YL LYG YVPDW++ + S I +V+CGVRG P CN
Sbjct: 181 KSYYWHWIVAASVLVIYLATLYGTYVPDWEFVVYDKDSVLYGKILSVSCGVRGKLNPPCN 240
Query: 257 AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMA 316
AVG +DR++LGI H+Y P + R+K C+ +SP G + DAPSWC+APF+PEG+LSS+ A
Sbjct: 241 AVGYVDRQVLGINHMYHHPAWRRSKACTDDSPYEGAIRQDAPSWCRAPFEPEGILSSISA 300
Query: 317 TVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVG-MHLNKALYSLSYTC 375
++ +IG+HFGH+I+H K H R+ +WI L+ LGL+L F M LNK LYS SY C
Sbjct: 301 ILSTIIGVHFGHIILHLKGHSARLKHWISTGLVLLALGLTLHFTHLMPLNKQLYSFSYIC 360
Query: 376 LTAGASGVLLAGIYFMV 392
+T+GA+ ++ + +Y +V
Sbjct: 361 VTSGAAALVFSSLYSLV 377
>gi|359487632|ref|XP_003633626.1| PREDICTED: LOW QUALITY PROTEIN: heparan-alpha-glucosaminide
N-acetyltransferase-like [Vitis vinifera]
Length = 499
Score = 345 bits (886), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 178/379 (46%), Positives = 244/379 (64%), Gaps = 13/379 (3%)
Query: 20 SKSTSAPANEK--LERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVG 77
S AP N+ + D PP + RPQ + +RL SLD+FRGLTVALMILVDD G
Sbjct: 33 SIKDDAPDNQHRLIISDSGFPP------EERPQ-KTKRLASLDIFRGLTVALMILVDDAG 85
Query: 78 GILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLG 137
G P I H+PWNG LADFVMPFFLFIVGV++AL K P +++A +K LR L L G
Sbjct: 86 GEWPMIGHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRLMAIKKVTLRTLKLLFWG 145
Query: 138 IFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLK---GDGHVSSKLS 194
+ LQG F + L YGVD+ +IRW G+LQ IA+AYLV AL EI K + S
Sbjct: 146 LLLQGSFTQDPDKLTYGVDMKKIRWCGILQXIALAYLVVALLEITTKKAQAKDLSPGQFS 205
Query: 195 LFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPA 254
+F+ Y HW++ + +Y+ + YG YVPDW + S+ + V CG RG P
Sbjct: 206 IFKLYCWHWLMGACVLIVYMAVSYGTYVPDWHFTVHDRDSADYGKVLTVACGARGKLDPP 265
Query: 255 CNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSV 314
CN VG IDR+ILG+ H+Y+ P ++R+K C+ SPD GP DAPSWC APF+PEG+LSS+
Sbjct: 266 CNVVGYIDREILGMNHMYQHPAWTRSKACNEYSPDKGPFRKDAPSWCYAPFEPEGILSSI 325
Query: 315 MATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVG-MHLNKALYSLSY 373
A ++ +IG+HFGH+++H K H DR+ +W+++ L+ LG++L F G + LNK LY+ SY
Sbjct: 326 SAILSTIIGVHFGHVLMHLKGHSDRLKHWVVMGFALLVLGITLHFTGAIPLNKQLYTFSY 385
Query: 374 TCLTAGASGVLLAGIYFMV 392
C+T+GA+ ++ + Y +V
Sbjct: 386 VCVTSGAAALVFSFFYILV 404
>gi|356503734|ref|XP_003520659.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Glycine max]
Length = 508
Score = 345 bits (886), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 174/402 (43%), Positives = 252/402 (62%), Gaps = 16/402 (3%)
Query: 5 NPINNDAN-DQLVIQIS-KSTSAPANEKLERDPLLPPSNSN------SKQTRP--QHQQR 54
+ +N D N D L + + K+++ + + ++D + P ++Q +P + + +
Sbjct: 14 SALNGDGNKDDLKKRATIKTSNGGSIFEHDKDTMAKPVAEGESVQQIAEQEQPPVKQKTK 73
Query: 55 RLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYK 114
R+ +LD FRGLT+ LMILVDD G P I+HSPWNG TLADFVMPFFLFIVGV++AL K
Sbjct: 74 RVATLDAFRGLTIVLMILVDDAGEAYPRIDHSPWNGCTLADFVMPFFLFIVGVAIALALK 133
Query: 115 NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYL 174
+ +K ILR L L GI LQGG+ H ++L+YGV++ IRW G+LQRIA+ Y
Sbjct: 134 RISKIKHSVKKIILRTLKLLFWGIILQGGYSHAPDDLEYGVNMKFIRWCGILQRIALVYC 193
Query: 175 VAALCEIW---LKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPV 231
V AL E + L+ S LS+F Y+ W V +Y++ + LYVPDW F
Sbjct: 194 VVALIETFTTKLRPTTLASGHLSIFAAYKWQWFGGFVAFLIYMITTFSLYVPDWS--FVD 251
Query: 232 ETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYG 291
+ P + V CG+RG GPACNAVG +DR++ G+ HLY +P++ R K C+ +SP G
Sbjct: 252 HFNGDEPKRYTVICGMRGHLGPACNAVGHVDRQVWGVNHLYSQPVWRRLKACTFSSPGSG 311
Query: 292 PMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLI 351
P DAPSWC APF+PEGLLSS+ A ++ IG+H+GH+++HFK H +R+ W+ + L+
Sbjct: 312 PFRDDAPSWCLAPFEPEGLLSSISAILSGTIGIHYGHVLIHFKGHSERLKQWVSMGFVLL 371
Query: 352 GLGLSLDFV-GMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+ + L F + +NK LYS SY C TAGA+G++ +G Y ++
Sbjct: 372 IIAIILHFTDALPINKQLYSFSYVCFTAGAAGIVFSGFYILI 413
>gi|255556868|ref|XP_002519467.1| conserved hypothetical protein [Ricinus communis]
gi|223541330|gb|EEF42881.1| conserved hypothetical protein [Ricinus communis]
Length = 519
Score = 344 bits (882), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 175/394 (44%), Positives = 241/394 (61%), Gaps = 16/394 (4%)
Query: 16 VIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDD 75
VI + TS+ + E + L P KQ + +R+ +LD FRGLTV LMILVD+
Sbjct: 52 VIPEKELTSSTVLVEQEGEQLQQPEQLPVKQ-----KTKRVATLDAFRGLTVVLMILVDN 106
Query: 76 VGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFL 135
G I+HSPWNG TLADFVMPFFLFIVGV++AL K P K A +K LR L L
Sbjct: 107 AGESYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKRIPRKRDAVKKISLRTLKLLF 166
Query: 136 LGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHV---SSK 192
GI LQGG+ H +L YGVD+ IRW G+LQRIA+ Y+ AL E + +
Sbjct: 167 WGILLQGGYSHAPVDLSYGVDMKLIRWCGILQRIALVYMFVALIETLTIKERQTVLQPNH 226
Query: 193 LSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTG 252
S+F YR W+ + +Y++ Y LYVPDW + + P + V CG+RG G
Sbjct: 227 FSIFTAYRWQWIGGFIAFLIYMITTYALYVPDWSF---TAYDDNRPTRYTVKCGMRGHLG 283
Query: 253 PACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLS 312
PACNAVG +DR++ GI HLY+ P++SR K C+ +SP GP+ DAPSWC APF+PEGLLS
Sbjct: 284 PACNAVGYVDREVWGINHLYQYPVWSRLKACTFSSPATGPLRADAPSWCLAPFEPEGLLS 343
Query: 313 SVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFV-GMHLNKALYSL 371
++ A ++ IG+H+GH+++HFK H +R+ W+ + L + + L F + +NK LYS
Sbjct: 344 TISAILSGTIGIHYGHVLIHFKGHSERLKQWVSMGLGLFLIAIILHFTDAIPINKQLYSF 403
Query: 372 SYTCLTAGASGVLLAGIYFMVRYISSHLMLKKPF 405
SY C TAGA+G++ +G Y ++ + L+ PF
Sbjct: 404 SYVCFTAGAAGIVFSGFYILIDVLG----LRIPF 433
>gi|297812935|ref|XP_002874351.1| hypothetical protein ARALYDRAFT_489556 [Arabidopsis lyrata subsp.
lyrata]
gi|297320188|gb|EFH50610.1| hypothetical protein ARALYDRAFT_489556 [Arabidopsis lyrata subsp.
lyrata]
Length = 453
Score = 342 bits (878), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 170/352 (48%), Positives = 235/352 (66%), Gaps = 4/352 (1%)
Query: 45 KQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFI 104
+++ + ++RL SLD+FRGLTVALMILVDD GG P I H+PWNG LADFVMPFFLFI
Sbjct: 7 ERSLAGNNRQRLASLDIFRGLTVALMILVDDAGGDWPMIAHAPWNGCNLADFVMPFFLFI 66
Query: 105 VGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMG 164
VGVS+AL+ K K A +K R L G+ LQGGF H + L YGVD+ +R+ G
Sbjct: 67 VGVSIALSLKRISNKFEACKKVCFRTCKLLFWGLLLQGGFSHAPDELSYGVDVTMMRFCG 126
Query: 165 VLQRIAIAYLVAALCEIWLKG---DGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLY 221
+LQRIA++YLV AL EI+ K + + +LS+F+ Y HW+V + + +YL LYG Y
Sbjct: 127 ILQRIALSYLVVALIEIFTKDLHEENLSTGRLSIFKSYYCHWIVGVSVLVIYLATLYGTY 186
Query: 222 VPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTK 281
VPDW++ + S I +V+CGVRG P CNAVG +DR++L I H+Y P + R+K
Sbjct: 187 VPDWEFVVNDKDSILYGKIQSVSCGVRGKLNPPCNAVGYVDRQVLVINHMYHHPAWRRSK 246
Query: 282 QCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRML 341
+ +SP G + DAPSWC APF+PEG+LSS+ A ++ +IG+HFGH+I+H + H R+
Sbjct: 247 AFTDDSPYEGALRQDAPSWCHAPFEPEGILSSISAILSTIIGVHFGHIIIHLQGHLARLK 306
Query: 342 NWIILSSCLIGLGLSLDFVG-MHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+WI + LGL+L F M LNK LYS SY C+T+GA+ ++ + +Y +V
Sbjct: 307 HWISTGLVFLTLGLTLHFTHLMPLNKQLYSFSYICVTSGAAALVFSSLYSLV 358
>gi|413951398|gb|AFW84047.1| hypothetical protein ZEAMMB73_047978 [Zea mays]
Length = 503
Score = 340 bits (873), Expect = 5e-91, Method: Compositional matrix adjust.
Identities = 179/419 (42%), Positives = 254/419 (60%), Gaps = 36/419 (8%)
Query: 7 INNDANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLT 66
+++ +++++V +++ D +++N K RP RR+ SLDVFRGLT
Sbjct: 1 MDDGSSNKVVEDVAEEDPGRRGGPWRTDE----ADANEKAPRPS---RRVASLDVFRGLT 53
Query: 67 VALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYK------------ 114
VALMILVD GG P I H+PWNG LADFVMPFFLFIVG+++ L K
Sbjct: 54 VALMILVDGAGGEWPVIGHAPWNGCNLADFVMPFFLFIVGMAVPLALKVRRRRRSSRPSV 113
Query: 115 -------NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
P + A R+ ++R L L GI LQGG+ H + L YGVD+ +RW G+LQ
Sbjct: 114 VHAMHAHRIPDRGRAVRRVVVRTLKLLFWGILLQGGYSHAPDELAYGVDMRHVRWGGILQ 173
Query: 168 RIAIAYLVAALCEIWLKGDGHV--------SSKLS-LFRKYRGHWVVALVLTTLYLLLLY 218
RIA+AYLV A+ E+ K V S + S +FR Y W+VA + +YL L Y
Sbjct: 174 RIALAYLVVAVLEMVTKDGAKVHQDQPPGSSGRFSRVFRMYLSQWIVACCILVVYLSLAY 233
Query: 219 GLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYS 278
G+YVPDW++ S + V CG RG+ P CNAVG IDR++LGI H+Y+KP +
Sbjct: 234 GVYVPDWEFRVRNADSPDYGKVLTVRCGTRGALDPPCNAVGYIDRRVLGINHMYQKPAWR 293
Query: 279 RTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRD 338
R + C+ +SP GP DAP+WC APF+PEG+LSS+ A ++ ++G+H+GH++VH K H D
Sbjct: 294 RHRACTDDSPHEGPFREDAPAWCVAPFEPEGILSSLSAVLSTVVGVHYGHVLVHMKSHTD 353
Query: 339 RMLNWIILSSCLIGLGLSLDFV-GMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYIS 396
R+ W+ + L+ LG+ L F + LNK LY+ SY C+TAGA+GV+ + +YF+V +S
Sbjct: 354 RLRQWVTMGVALLVLGIILHFSHAIPLNKQLYTFSYICVTAGAAGVVFSALYFLVDVVS 412
>gi|186530239|ref|NP_001119393.1| uncharacterized protein [Arabidopsis thaliana]
gi|332008205|gb|AED95588.1| uncharacterized protein [Arabidopsis thaliana]
Length = 292
Score = 340 bits (871), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 176/274 (64%), Positives = 209/274 (76%), Gaps = 15/274 (5%)
Query: 8 NNDANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTV 67
+ND ND Q + + ++ R LPP + RL+SLDVFRGLTV
Sbjct: 13 DNDDNDH---QWREKKDIESALQISRSSSLPP------------DKERLVSLDVFRGLTV 57
Query: 68 ALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAI 127
A MILVDDVGGILP+INHSPW+G+TLADFVMPFFLFIVGVSLA YKN C+ VATRKA+
Sbjct: 58 AFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAYKNLSCRFVATRKAL 117
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDG 187
+R+L L LLG+FLQGGF HG+NNL YG+D+ +IR MG+LQRIAIAYLV ALCEIWLKG+
Sbjct: 118 IRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAYLVVALCEIWLKGNH 177
Query: 188 HVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGV 247
+VSS+LS+ +KYR HWVVA V+TT+YL LLYGLYVPDW+Y+ E S+ F V CGV
Sbjct: 178 NVSSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQILKEDQGSTLTTFLVKCGV 237
Query: 248 RGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTK 281
RG TGP CNAVGM+DR LGIQHLYRKP+Y+RTK
Sbjct: 238 RGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTK 271
>gi|356570776|ref|XP_003553560.1| PREDICTED: LOW QUALITY PROTEIN: heparan-alpha-glucosaminide
N-acetyltransferase-like [Glycine max]
Length = 509
Score = 339 bits (869), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 173/395 (43%), Positives = 244/395 (61%), Gaps = 12/395 (3%)
Query: 8 NNDANDQLVIQISKSTSAPANEK-LERDPLLPPSNSNSK---QTRP--QHQQRRLISLDV 61
+D ++ I+ S S ++K P S S K Q +P + + +R+ +LD
Sbjct: 22 KDDLKRRVTIKTSNGGSVIEHDKGTMAKPYGAESESVQKIAEQEQPVVKQKTKRIATLDA 81
Query: 62 FRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVV 121
FRGLT+ LMILVDD G P I+HSPWNG TLADFVMPFFLFIVG+++AL K
Sbjct: 82 FRGLTIVLMILVDDAGEAYPRIDHSPWNGCTLADFVMPFFLFIVGIAIALALKRIAKIKH 141
Query: 122 ATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEI 181
A +K ILR L L GI LQGG+ H ++L+YGV++ IRW G+LQRIA+ Y V AL E
Sbjct: 142 AVKKIILRTLKLLFWGIILQGGYSHAPDDLEYGVNMKFIRWCGILQRIALVYCVVALIET 201
Query: 182 W---LKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSP 238
+ L+ S LS+F Y+ W V +Y++ + LYVP W F + P
Sbjct: 202 FTTKLRPTTLASGHLSIFTAYKWQWFGGFVAFIIYMITTFTLYVPHWS--FLDHFNGDEP 259
Query: 239 WIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAP 298
+ V CG+RG GPACNAVG +DR++ G+ HLY +P++ R K SP GP DAP
Sbjct: 260 KRYTVICGMRGHLGPACNAVGHVDRQVWGVNHLYSQPVWRRLKMTIDYSPASGPFRDDAP 319
Query: 299 SWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLD 358
SWC++PF+PEGLLSS+ A ++ IG+H+GH+++HFK H +R+ W+++ L+ + + L
Sbjct: 320 SWCRSPFEPEGLLSSISAILSGTIGIHYGHILIHFKGHSERLKQWVLMGFVLLIIAIILH 379
Query: 359 FV-GMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
F + +NK LYS SY C TAGA+G++ +G+Y +V
Sbjct: 380 FTDALPINKQLYSFSYVCFTAGAAGIVFSGLYILV 414
>gi|356504028|ref|XP_003520801.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Glycine max]
Length = 465
Score = 338 bits (868), Expect = 2e-90, Method: Compositional matrix adjust.
Identities = 174/345 (50%), Positives = 235/345 (68%), Gaps = 4/345 (1%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
+ +R+ SLD+FRGLTVALMILVDD G P I H+PWNG LADFVMPFFLFIVG+++ L
Sbjct: 26 KTKRVASLDIFRGLTVALMILVDDAGEQWPMIGHAPWNGCNLADFVMPFFLFIVGMAIPL 85
Query: 112 TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAI 171
K P +++A +K I+R L L G+ LQGGF H +NL YGVD+ IRW G+LQRIA+
Sbjct: 86 ALKRIPNRLLAVKKVIVRTLKLLFWGLLLQGGFSHAPDNLTYGVDMKHIRWCGILQRIAL 145
Query: 172 AYLVAALCEIW---LKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYE 228
AYLV AL EI+ + + LS+F Y HW+V + +YL LLYG++VPDW +
Sbjct: 146 AYLVVALVEIFSRSTQARDPEPTHLSIFNLYYWHWLVGACILVVYLALLYGIHVPDWGFT 205
Query: 229 FPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSP 288
S + VTCGVRG P CNAVG IDR++LGI H+Y++P + R++ C+ NSP
Sbjct: 206 VHNPDSIYNGTTLTVTCGVRGKLDPPCNAVGYIDREVLGINHMYKRPAWRRSEACTENSP 265
Query: 289 DYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSS 348
GP +APSWC APF+PEG+LSS+ A ++ +IGLHFGH+++H +DH R+ +W++L
Sbjct: 266 YEGPFKKNAPSWCYAPFEPEGILSSISAILSTIIGLHFGHVLIHLQDHPSRLKHWLLLGL 325
Query: 349 CLIGLGLSLDFV-GMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
L+ GL L F + LNK LY+LSY C+T+GA+ +L + Y V
Sbjct: 326 ALLTSGLILHFTHAIPLNKQLYTLSYVCVTSGAAALLFSAFYITV 370
>gi|147817637|emb|CAN64496.1| hypothetical protein VITISV_004036 [Vitis vinifera]
Length = 511
Score = 338 bits (866), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 167/406 (41%), Positives = 237/406 (58%), Gaps = 34/406 (8%)
Query: 18 QISKSTSAPANEKLERD-----PLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMIL 72
+ K + +K+E+D P +Q + + +R+ +LD FRGLT+ LMIL
Sbjct: 14 HVHKEDISEKADKIEKDESSATPAQSVEQKGEEQPLIKQKSKRVATLDAFRGLTIVLMIL 73
Query: 73 VDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALN 132
VDD GG I+HSPWNG TLADFVMPFFLFIVGV++AL K P +A +K LR L
Sbjct: 74 VDDAGGSYARIDHSPWNGCTLADFVMPFFLFIVGVAVALALKKIPRISLAVKKISLRTLK 133
Query: 133 LFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ----------------------RIA 170
L GI LQGG+ H ++L YGVD+ IRW G+LQ RIA
Sbjct: 134 LLFWGILLQGGYSHAPDDLSYGVDMKHIRWFGILQVFPLPLFTGKSIPSSSLSGFLQRIA 193
Query: 171 IAYLVAALCEIWLKGDGHV---SSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQY 227
+ Y V AL E S S+ Y+ W+ V +Y++ Y LYVPDW +
Sbjct: 194 VVYFVVALIETLTTKRRPTVIDSGHFSILSAYKWQWIGGFVAFLIYMITTYALYVPDWSF 253
Query: 228 EFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINS 287
+ + + V CG+RG GPACNAVG +DR++ GI HLY +P+++R K C+++S
Sbjct: 254 VIDQDHEAKR---YTVKCGMRGHLGPACNAVGYVDRQVWGINHLYSQPVWTRLKACTLSS 310
Query: 288 PDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILS 347
P+ GP DAPSWC APF+PEGLLS++ A ++ IG+H+GH+++HFK H +R+ W+ +
Sbjct: 311 PNSGPFREDAPSWCYAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHAERLKQWVSMG 370
Query: 348 SCLIGLGLSLDFV-GMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
L+ + + L F + +NK LYS SY C TAGA+G++ + Y ++
Sbjct: 371 IVLLIVAIILHFTDAIPINKQLYSFSYVCFTAGAAGIVXSAFYLVI 416
>gi|238481503|ref|NP_001154766.1| uncharacterized protein [Arabidopsis thaliana]
gi|332008207|gb|AED95590.1| uncharacterized protein [Arabidopsis thaliana]
Length = 295
Score = 336 bits (862), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 176/277 (63%), Positives = 209/277 (75%), Gaps = 18/277 (6%)
Query: 8 NNDANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTV 67
+ND ND Q + + ++ R LPP + RL+SLDVFRGLTV
Sbjct: 13 DNDDNDH---QWREKKDIESALQISRSSSLPP------------DKERLVSLDVFRGLTV 57
Query: 68 ALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAI 127
A MILVDDVGGILP+INHSPW+G+TLADFVMPFFLFIVGVSLA YKN C+ VATRKA+
Sbjct: 58 AFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAYKNLSCRFVATRKAL 117
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDG 187
+R+L L LLG+FLQGGF HG+NNL YG+D+ +IR MG+LQRIAIAYLV ALCEIWLKG+
Sbjct: 118 IRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAYLVVALCEIWLKGNH 177
Query: 188 HVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIF---NVT 244
+VSS+LS+ +KYR HWVVA V+TT+YL LLYGLYVPDW+Y+ E S+ F V
Sbjct: 178 NVSSELSMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQILKEDQGSTLTTFLNLKVK 237
Query: 245 CGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTK 281
CGVRG TGP CNAVGM+DR LGIQHLYRKP+Y+RTK
Sbjct: 238 CGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTK 274
>gi|449454063|ref|XP_004144775.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Cucumis sativus]
gi|449490878|ref|XP_004158735.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Cucumis sativus]
Length = 490
Score = 334 bits (857), Expect = 4e-89, Method: Compositional matrix adjust.
Identities = 180/389 (46%), Positives = 246/389 (63%), Gaps = 6/389 (1%)
Query: 8 NNDANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTV 67
NN N+ IS P + L+ +S++ +P + +RL SLD+FRGLTV
Sbjct: 9 NNSPNEISQPLISMEEIKPDSTSHHPHRLIS-VDSDALLPKPV-KSKRLASLDIFRGLTV 66
Query: 68 ALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAI 127
ALMILVDD GG P I H+PW G LADFVMPFFLFIVG+++AL K P +++A K
Sbjct: 67 ALMILVDDAGGEWPMIGHAPWYGCNLADFVMPFFLFIVGMAIALALKRIPNQLMAIEKVT 126
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIW-LKGD 186
LR L L G+ LQGG+ H + L YGVD+ +IR G+LQRIA+AYLV A E+ K
Sbjct: 127 LRTLKLLFWGLLLQGGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRKTQ 186
Query: 187 GHVS--SKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVT 244
+V + S+F+ Y +W+V + +Y LLYG+YVPDWQ+ S F V
Sbjct: 187 SNVQPFNHFSIFKSYFWNWLVGACILVVYFALLYGIYVPDWQFTVTDSESVYYGRNFTVA 246
Query: 245 CGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAP 304
CGVRG+ P CNAVG IDRK+LGI HLY P + R++ C+ NSP G +APSWC AP
Sbjct: 247 CGVRGNLDPPCNAVGYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAP 306
Query: 305 FDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFV-GMH 363
F+PEG+LSS+ A ++ +IG+HFGH+++HF+DH R+ W+ + L+ LGL L F +
Sbjct: 307 FEPEGILSSISAILSTIIGVHFGHVLIHFQDHSARLKQWVTMGFTLLILGLVLHFTHAIP 366
Query: 364 LNKALYSLSYTCLTAGASGVLLAGIYFMV 392
LNK LY+ SY C+T+GA+ ++ + Y +V
Sbjct: 367 LNKQLYTFSYVCVTSGAAALVFSVFYTLV 395
>gi|357511851|ref|XP_003626214.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
truncatula]
gi|355501229|gb|AES82432.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
truncatula]
Length = 483
Score = 333 bits (854), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 178/370 (48%), Positives = 242/370 (65%), Gaps = 22/370 (5%)
Query: 44 SKQTRPQHQQRRLISLDVFRGLTVA----------------LMILVDDVGGILPAINHSP 87
S + P+ + +R+ SLD+FRGLTVA LMILVDD GG PAI H+P
Sbjct: 20 SAKELPK-KVKRVASLDIFRGLTVADGDLTVFVAVKYRAKQLMILVDDAGGEWPAIGHAP 78
Query: 88 WNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHG 147
WNG LADFVMPFFLFIVG+++ L+ K P K++A +K I+R L L G+ LQGG+ H
Sbjct: 79 WNGCNLADFVMPFFLFIVGMAIPLSLKKIPNKLLAVKKVIVRTLKLLFWGLLLQGGYSHA 138
Query: 148 INNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKG----DGHVSSKLSLFRKYRGHW 203
++L YGVD+ IRW G+LQRIA+AYLV AL EI + D + LS+F Y HW
Sbjct: 139 PDHLSYGVDMKHIRWCGILQRIALAYLVVALVEIISRSRQDRDDPEPTNLSIFTLYYWHW 198
Query: 204 VVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDR 263
+VA + +Y+ LLYG++VPDWQ+ S + F VTCGVRG P CNAVG IDR
Sbjct: 199 LVAACILVVYMPLLYGIHVPDWQFTVHNPDSIYNGTTFTVTCGVRGKLDPPCNAVGYIDR 258
Query: 264 KILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIG 323
++LGI H+Y+KP R++ C++ P GP AP+WC APF+PEG+LSS+ A ++ +IG
Sbjct: 259 EVLGINHVYKKPASRRSEACTVKPPYEGPFKKTAPAWCYAPFEPEGILSSISAILSTIIG 318
Query: 324 LHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVG-MHLNKALYSLSYTCLTAGASG 382
LH+GH+++H +DH R+ WI+L L+ LG L F + LNK LY+LSY C+T+GA+
Sbjct: 319 LHYGHVLIHLQDHLSRLKQWILLGLALLTLGFILHFSHVIPLNKQLYTLSYVCVTSGAAA 378
Query: 383 VLLAGIYFMV 392
++ + Y MV
Sbjct: 379 LVFSAFYVMV 388
>gi|395146531|gb|AFN53685.1| putative aquaporin PIP2-8 [Linum usitatissimum]
Length = 694
Score = 333 bits (853), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 174/372 (46%), Positives = 230/372 (61%), Gaps = 44/372 (11%)
Query: 29 EKLERDPLLPPSNSNSKQTR----PQHQQRRLISLDVFRGLTVALMILVDDVGGILPAIN 84
+K R PL PP S + + + + +RL SLD FRGL + LM+LVD G + P I
Sbjct: 17 QKSFRPPLPPPDFSGREDGQLLMLYRKKNKRLASLDAFRGLCIFLMMLVDYGGHVFPTIA 76
Query: 85 HSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGF 144
HS WNG+ LADFVMPFFLFIVGVS+AL YK P +V ATRKA+L+++ LFL+GI LQ
Sbjct: 77 HSAWNGIHLADFVMPFFLFIVGVSIALVYKKAPNRVEATRKALLKSVKLFLVGILLQE-- 134
Query: 145 FHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWL----KGDGHVSSKLSLFRKYR 200
QRI+I Y+V A+CEIWL KGD + + + Y
Sbjct: 135 ----------------------QRISIGYIVGAICEIWLSIRRKGD------VGIIKSYY 166
Query: 201 GHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGM 260
HW+ AL + +Y L YGLYVPDWQ+ P + +F V C V+G GPACN+ GM
Sbjct: 167 WHWIAALAIVAVYARLSYGLYVPDWQFSLPGDQHH----VFTVKCSVKGDVGPACNSAGM 222
Query: 261 IDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTC 320
IDR +LG+ HLY KP+Y K C+++S +P DAPSWC APFDPEGLLSS+ A VTC
Sbjct: 223 IDRYVLGLSHLYAKPVYKNLKVCNMSSNK--QVPEDAPSWCHAPFDPEGLLSSLTAAVTC 280
Query: 321 LIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGA 380
+IGL FGH++ H +DH+ R+ NW S + LGL L +G +NK LYS+SY +T+ +
Sbjct: 281 IIGLQFGHVLAHIQDHKGRLENWSGFSVFFLVLGLFLVRLGFPINKPLYSISYMLITSAS 340
Query: 381 SGVLLAGIYFMV 392
+G+ A +Y +V
Sbjct: 341 AGITFAALYLLV 352
>gi|356548323|ref|XP_003542552.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Glycine max]
Length = 419
Score = 332 bits (852), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 159/327 (48%), Positives = 221/327 (67%), Gaps = 7/327 (2%)
Query: 70 MILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILR 129
M+LVDD GG P I+HSPWNG TLADFVMPFFLFIVGV++AL K P A +K ILR
Sbjct: 1 MVLVDDAGGAYPRIDHSPWNGCTLADFVMPFFLFIVGVAIALALKRIPKVKYAVKKIILR 60
Query: 130 ALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIW---LKGD 186
L L GI LQGG+ H ++L YGVD+ IRW G+LQRIA+ Y V AL E + L+
Sbjct: 61 TLKLLFWGILLQGGYSHAPDDLSYGVDMRFIRWCGILQRIALVYCVVALIETYTTKLRPS 120
Query: 187 GHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCG 246
LS+F YR W+ V +Y++ ++ LYVPDW + V+ +S P + V CG
Sbjct: 121 TLKPGHLSIFTAYRWQWLGGFVAFVIYMVTIFSLYVPDWSF---VDYNSDKPKRYTVECG 177
Query: 247 VRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFD 306
+RG GPACNAVG +DR++ G+ HLY +P+++R K C+++SP GP+ +AP+WC+APF+
Sbjct: 178 MRGHLGPACNAVGYVDRQVWGVNHLYSQPVWTRLKACTLSSPAEGPLRKNAPAWCRAPFE 237
Query: 307 PEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFV-GMHLN 365
PEG LSSV+A ++ IG+H+GH+++HFK H +R+ W+ + L+ LGL L F + +N
Sbjct: 238 PEGFLSSVLAILSGTIGIHYGHVLIHFKGHFERLKQWLSMGFVLLTLGLILHFTDAIPIN 297
Query: 366 KALYSLSYTCLTAGASGVLLAGIYFMV 392
K LYS SY C TAGA+G++ + Y ++
Sbjct: 298 KQLYSFSYVCFTAGAAGIVFSVFYLLI 324
>gi|449440411|ref|XP_004137978.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Cucumis sativus]
gi|449517341|ref|XP_004165704.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Cucumis sativus]
Length = 488
Score = 330 bits (847), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 170/379 (44%), Positives = 231/379 (60%), Gaps = 15/379 (3%)
Query: 32 ERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGL 91
E + + P+ Q R + +R+ +LD FRGLT+ LMILVDD GG I+HSPWNG
Sbjct: 34 EEEKEVAPTIVEEAQLR--QKTKRVATLDAFRGLTIVLMILVDDAGGAYSRIDHSPWNGC 91
Query: 92 TLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNL 151
TLADFVMPFFLFIVGV++AL +K K LR + L G+ LQGG+ H ++L
Sbjct: 92 TLADFVMPFFLFIVGVAIALAFKRIGSIKQGVMKISLRTIKLVFWGLILQGGYSHAPDDL 151
Query: 152 KYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLK-GDGHV---SSKLSLFRKYRGHWVVAL 207
+YGVD+ IRW G+LQRIA+ Y V A+ E + G V S+F YR W+
Sbjct: 152 EYGVDMKHIRWCGILQRIALVYFVVAMIEAFTTIGKPRVVLDHGHFSIFTAYR--WIGGF 209
Query: 208 VLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILG 267
+Y++ Y LYVP+W F V + V CGVRG GPACNAVG +DR++ G
Sbjct: 210 AAFIIYIITTYALYVPNWS--FSVLEDDQLLHHYTVVCGVRGHLGPACNAVGHVDRQVWG 267
Query: 268 IQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFG 327
I HLY P++ R K C+ ++PD GP+ DA SWC APF+PEGLLSSV A ++ IG+H+G
Sbjct: 268 INHLYSYPVWIRHKDCTFSAPDEGPLRDDAASWCLAPFEPEGLLSSVSAILSGTIGIHYG 327
Query: 328 HLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFV-GMHLNKALYSLSYTCLTAGASGVLLA 386
H+++HFK H R+ W+ + +G+ L F + +NK LYSLSY C TAGA+G++ +
Sbjct: 328 HVLLHFKTHSQRLKQWVSMGFGFFIIGIILHFTNAIPINKQLYSLSYVCFTAGAAGIVFS 387
Query: 387 GIYFMVRYISSHLMLKKPF 405
G Y ++ +KPF
Sbjct: 388 GFYILIDVWG----FRKPF 402
>gi|224123608|ref|XP_002330163.1| predicted protein [Populus trichocarpa]
gi|222871619|gb|EEF08750.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 330 bits (845), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 173/361 (47%), Positives = 234/361 (64%), Gaps = 5/361 (1%)
Query: 36 LLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLAD 95
LL ++ +P +RR+ SLD+FRGLTVALMILVDD GG P + H+PW+G LAD
Sbjct: 14 LLIADGTHFSAPKPD-PERRVASLDIFRGLTVALMILVDDAGGEWPKMGHAPWHGSNLAD 72
Query: 96 FVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGV 155
FVMPFFLFIVG+++ LT+K + A +K I+R L L GI LQGGF H + L YGV
Sbjct: 73 FVMPFFLFIVGMAIPLTFKGITSRDHAVKKMIVRTLKLLFWGIMLQGGFSHAPDKLSYGV 132
Query: 156 DIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVS---SKLSLFRKYRGHWVVALVLTTL 212
D+ +IRW G+LQRIA AYLV AL EI+ K D +LS+FR Y W+V + +
Sbjct: 133 DMKKIRWCGILQRIAFAYLVMALMEIFTKKDQTKDLPPGRLSIFRLYGSQWLVGACILVV 192
Query: 213 YLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLY 272
YL ++YG+YVP WQ+ E SS +F V C VRG PACNA+ IDRKILGI HLY
Sbjct: 193 YLAVIYGMYVPHWQFTVNDEESSDYGKVFTVECAVRGKLDPACNAIAYIDRKILGINHLY 252
Query: 273 RKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVH 332
+ P + R++ C+ S P AP+WC+APF+P+G+LSS+ + ++ + G HFGH+ VH
Sbjct: 253 QHPAWKRSEACTEASLYEAPFQTSAPTWCKAPFEPDGILSSISSVLSTITGAHFGHVHVH 312
Query: 333 FKDHRDRMLNWIILSSCLIGLGLSLDFV-GMHLNKALYSLSYTCLTAGASGVLLAGIYFM 391
K R+ +W ++ L+ LGL L F M LNK LY+ SY C+T+GA+ ++ + IY +
Sbjct: 313 LKGDTARLKHWTVMGLALLILGLVLHFTHAMPLNKQLYTFSYVCVTSGAAALVFSAIYIL 372
Query: 392 V 392
V
Sbjct: 373 V 373
>gi|413937083|gb|AFW71634.1| hypothetical protein ZEAMMB73_862609 [Zea mays]
Length = 317
Score = 329 bits (843), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 153/230 (66%), Positives = 187/230 (81%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
+Q+RL SLDVFRG+TV LMI+VDD GG LPA+NHSPW+G+T+ADF+MPFFLFIVGVSL L
Sbjct: 87 RQQRLASLDVFRGITVLLMIIVDDAGGFLPALNHSPWDGVTVADFIMPFFLFIVGVSLTL 146
Query: 112 TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAI 171
YK P +V ATRKA+LRAL LF LG+ LQGGFFHG+++L +GVD+ +IR MG+LQRIAI
Sbjct: 147 AYKRVPDRVEATRKAVLRALKLFCLGLVLQGGFFHGVHSLTFGVDLTKIRLMGILQRIAI 206
Query: 172 AYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPV 231
AYL+AA+CEIWLKGD V S L R+YR V LVL+ Y +LLYG+YVPDW+Y+
Sbjct: 207 AYLLAAVCEIWLKGDDDVDSGYGLLRRYRYQLFVGLVLSIAYSILLYGMYVPDWEYQIAG 266
Query: 232 ETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTK 281
SSS+ F+V CGVRG TGPACNAVGM+DR +LGI HLYR+P+Y+RTK
Sbjct: 267 PGSSSTEKSFSVKCGVRGDTGPACNAVGMVDRTVLGIDHLYRRPVYARTK 316
>gi|296089693|emb|CBI39512.3| unnamed protein product [Vitis vinifera]
Length = 481
Score = 324 bits (831), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 167/376 (44%), Positives = 233/376 (61%), Gaps = 25/376 (6%)
Query: 20 SKSTSAPANEK--LERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVG 77
S AP N+ + D PP + RPQ + +RL SLD+FRGLTVALMILVDD G
Sbjct: 33 SIKDDAPDNQHRLIISDSGFPP------EERPQ-KTKRLASLDIFRGLTVALMILVDDAG 85
Query: 78 GILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLG 137
G P I H+PWNG LADFVMPFFLFIVGV++AL K P +++A +K LR L L G
Sbjct: 86 GEWPMIGHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRLMAIKKVTLRTLKLLFWG 145
Query: 138 IFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFR 197
+ LQG F + L YGVD+ +IRW G+LQ + + S+F+
Sbjct: 146 LLLQGSFTQDPDKLTYGVDMKKIRWCGILQA---------------QAKDLSPGQFSIFK 190
Query: 198 KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNA 257
Y HW++ + +Y+ + YG YVPDW + S+ + V CG RG P CN
Sbjct: 191 LYCWHWLMGACVLIVYMAVSYGTYVPDWHFTVHDRDSADYGKVLTVACGARGKLDPPCNV 250
Query: 258 VGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMAT 317
VG IDR+ILG+ H+Y+ P ++R+K C+ SPD GP DAPSWC APF+PEG+LSS+ A
Sbjct: 251 VGYIDREILGMNHMYQHPAWTRSKACNEYSPDKGPFRKDAPSWCYAPFEPEGILSSISAI 310
Query: 318 VTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVG-MHLNKALYSLSYTCL 376
++ +IG+HFGH+++H K H DR+ +W+++ L+ LG++L F G + LNK LY+ SY C+
Sbjct: 311 LSTIIGVHFGHVLMHLKGHSDRLKHWVVMGFALLVLGITLHFTGAIPLNKQLYTFSYVCV 370
Query: 377 TAGASGVLLAGIYFMV 392
T+GA+ ++ + Y +V
Sbjct: 371 TSGAAALVFSFFYILV 386
>gi|242067981|ref|XP_002449267.1| hypothetical protein SORBIDRAFT_05g006970 [Sorghum bicolor]
gi|241935110|gb|EES08255.1| hypothetical protein SORBIDRAFT_05g006970 [Sorghum bicolor]
Length = 512
Score = 322 bits (826), Expect = 2e-85, Method: Compositional matrix adjust.
Identities = 161/384 (41%), Positives = 236/384 (61%), Gaps = 28/384 (7%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIV 105
+ P+ + RR+ +LD FRGLT+ LMILVDD GG I+HSPWNG TLADFVMPFFLFIV
Sbjct: 47 EEVPKKKSRRVAALDAFRGLTIVLMILVDDAGGAYERIDHSPWNGCTLADFVMPFFLFIV 106
Query: 106 GVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGV 165
GV++A K P A ++ +R L + G+ LQGG+ H ++L YGVD+ +IRWMG+
Sbjct: 107 GVAIAFALKRVPNMGNAVKRITIRTLKMLFWGVLLQGGYSHAPDDLSYGVDMKKIRWMGI 166
Query: 166 LQ--------------------RIAIAYLVAALCE---IWLKGDGHVSSKLSLFRKYRGH 202
LQ RIA+ Y + AL E + ++ S ++F +R
Sbjct: 167 LQLYIYHGNNLDSFLFFTLGHQRIALVYFIVALIEAFTVKVRPTTVRSGPYAIFNAHRWQ 226
Query: 203 WVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMID 262
W+ + +Y++ + LYVPDW Y + + + F V CGVR S ACNAVG +D
Sbjct: 227 WLGGFIAFVIYMVTTFSLYVPDWSYVYHNDGDVNDGKQFTVKCGVRASLEQACNAVGYVD 286
Query: 263 RKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLI 322
R++ GI HLY +P++ R+K C+ +SP+ GP+ DAP WC APF+PEGLLSS+ + ++ I
Sbjct: 287 RQVWGINHLYTQPVWIRSKDCTSSSPNMGPLRADAPEWCLAPFEPEGLLSSISSVLSGTI 346
Query: 323 GLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFV-GMHLNKALYSLSYTCLTAGAS 381
G+H+GH+++HFK H++R+ +W++ L+ L + L F + +NK LYS SY C T GA+
Sbjct: 347 GIHYGHVLIHFKTHKERLKHWLVTGFSLLVLAIILHFTNAIPINKQLYSFSYVCFTGGAA 406
Query: 382 GVLLAGIYFMVRYISSHLMLKKPF 405
G++L+ Y ++ L+KPF
Sbjct: 407 GIVLSAFYILIDVWG----LRKPF 426
>gi|357152403|ref|XP_003576108.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Brachypodium distachyon]
Length = 498
Score = 318 bits (815), Expect = 3e-84, Method: Compositional matrix adjust.
Identities = 159/367 (43%), Positives = 238/367 (64%), Gaps = 11/367 (2%)
Query: 37 LPPSNSNSKQTR------PQHQQR-RLISLDVFRGLTVALMILVDDVGGILPAINHSPWN 89
+ + +N+++ R PQ ++ R+ +LD FRGLT+ +MILVDD G ++HSPWN
Sbjct: 37 IDAAGANAEKERLAVVEEPQKKKSTRVAALDAFRGLTIVVMILVDDAGSSYERMDHSPWN 96
Query: 90 GLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGIN 149
G TLADFVMPFFLFIVGV++A K P A +K +R L + G+ LQGG+ H +
Sbjct: 97 GCTLADFVMPFFLFIVGVAIAFAMKRVPNMGAAVKKVSVRTLKMIFWGLLLQGGYSHAPD 156
Query: 150 NLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIW---LKGDGHVSSKLSLFRKYRGHWVVA 206
+L YGVD+ IRW G+LQRIA+ Y AL E++ ++ S ++F YR W+ A
Sbjct: 157 DLAYGVDMKMIRWCGILQRIALVYFAVALIEVFTTKVRPTTVRSGPYAIFDAYRWQWLGA 216
Query: 207 LVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKIL 266
++ +Y++ + LYVPDW + + + + F V CGVRG PACNAVG IDR++
Sbjct: 217 FIVLVIYMITTFSLYVPDWSFVYHNDGDINDGKRFTVQCGVRGHLDPACNAVGFIDRQVW 276
Query: 267 GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHF 326
GI HLY +P++ RTK C+ +SP+ G + DAP+WC PF+PEGLLSS+ + ++ IG+H+
Sbjct: 277 GINHLYSQPVWIRTKDCTFSSPETGKLRDDAPAWCLGPFEPEGLLSSISSIISGTIGIHY 336
Query: 327 GHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFV-GMHLNKALYSLSYTCLTAGASGVLL 385
GH+++HFK H++R+ +W+ + L+ LG+ L F + +NK LYS SY C T GA+G++L
Sbjct: 337 GHVLIHFKTHKERLTHWLSMGFALLLLGILLHFTNAIPINKQLYSFSYICFTGGAAGIVL 396
Query: 386 AGIYFMV 392
+ Y ++
Sbjct: 397 SAFYALI 403
>gi|413947252|gb|AFW79901.1| hypothetical protein ZEAMMB73_198786 [Zea mays]
Length = 505
Score = 292 bits (747), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 141/238 (59%), Positives = 174/238 (73%), Gaps = 3/238 (1%)
Query: 155 VDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYL 214
+D W V RIAIAYL+AA+CEIWLKGD V S L R+YR V LVL+ Y
Sbjct: 268 IDTFSQCWKKV--RIAIAYLLAAVCEIWLKGDDDVDSGYGLLRRYRYQLFVGLVLSIAYS 325
Query: 215 LLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRK 274
+LLYG+YVPDW+Y+ SSS+ F V CGVRG T PACNAVGM+DR ILGI HLYR+
Sbjct: 326 ILLYGIYVPDWEYQIAGPGSSSTKKSFFVKCGVRGDTRPACNAVGMVDRTILGIDHLYRR 385
Query: 275 PIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFK 334
P+Y RTK+CSI+ + GP+P DAPSWCQAPFDPEGLLS VMA VTCLIGL F H+I+HF+
Sbjct: 386 PVYVRTKECSIDYLENGPLPPDAPSWCQAPFDPEGLLSFVMAIVTCLIGLQFRHVIIHFE 445
Query: 335 DHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
HR R+ +W++ S ++ L +DFVGM +NK LY++SYT L AGA+G+L GIY +V
Sbjct: 446 KHRGRIASWLVPSFSMLALAFVMDFVGMRMNKPLYTMSYT-LAAGAAGLLFPGIYVLV 502
>gi|255642425|gb|ACU21476.1| unknown [Glycine max]
Length = 326
Score = 290 bits (743), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 146/280 (52%), Positives = 193/280 (68%), Gaps = 8/280 (2%)
Query: 117 PCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVA 176
P + AT KA RALNLF LGI LQGG+FHG+ +L +GVDI +IRW+G+LQRI+I Y+VA
Sbjct: 44 PHRTQATWKAFARALNLFALGILLQGGYFHGVTSLTFGVDIQRIRWLGILQRISIGYIVA 103
Query: 177 ALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSS 236
ALCEIWL +L + Y W VA++L LY LLYGLYVPDWQ++ TSS
Sbjct: 104 ALCEIWLPAPRW--KELGFVKSYYWQWFVAVILLALYSGLLYGLYVPDWQFDVSASTSSL 161
Query: 237 SPW----IFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGP 292
P I+ V C VRG GPACN+ GMIDR ILG+ HLYRKP+Y K C++++ G
Sbjct: 162 PPIGGGDIYMVNCSVRGDLGPACNSAGMIDRYILGLDHLYRKPVYRNLKGCNMSAK--GQ 219
Query: 293 MPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIG 352
+ +PSWC APFDPEG+LSS+ A V+C+IGL +GH++ H +DH+ R+ NW+ S +
Sbjct: 220 VSDSSPSWCHAPFDPEGILSSITAAVSCIIGLQYGHVLAHLQDHKGRLYNWMCFSLSFLA 279
Query: 353 LGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
LGL L +G+ LNK+LY++SY LT+ ASG+ +YF+V
Sbjct: 280 LGLFLALIGIPLNKSLYTVSYMLLTSAASGLTFIALYFLV 319
>gi|395146473|gb|AFN53630.1| putative aquaporin PIP2-8 [Linum usitatissimum]
Length = 692
Score = 287 bits (734), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 144/298 (48%), Positives = 198/298 (66%), Gaps = 17/298 (5%)
Query: 114 KNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAY 173
K P +V ATRKA L+++ LFL+GI LQGGFFHG+++L YGVDI +IR +G+LQRI+I Y
Sbjct: 96 KKVPNRVEATRKAFLKSVKLFLVGILLQGGFFHGLHSLTYGVDIERIRLLGILQRISIGY 155
Query: 174 LVAALCEIWL----KGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEF 229
+V A+CEIWL KGD + + + Y HWV AL + +Y L YGLYVPDWQ+
Sbjct: 156 IVGAICEIWLSVRRKGD------VGIIKSYYSHWVAALAIVVVYARLSYGLYVPDWQFAL 209
Query: 230 PVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPD 289
P + ++ V C V+G GPACN+ GM+DR +LG+ HLY KP+Y K C+++S
Sbjct: 210 PQDQHH----VYTVKCSVKGDVGPACNSAGMMDRYVLGLSHLYAKPVYKNLKICNMSSNK 265
Query: 290 YGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSC 349
+P DAPSWC APFDPEGLLSS+ A VTC+IGL FGH++ H +DH+ R+ NW S
Sbjct: 266 --QVPEDAPSWCHAPFDPEGLLSSLTAAVTCIIGLQFGHVLAHVQDHKGRLENWSGFSVF 323
Query: 350 LIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKKPFDY 407
+ LGL L +G +NK LYS+SY +T+ ++G+ A +Y +V + L PF++
Sbjct: 324 FLLLGLFLVLLGFPINKPLYSISYMLITSASAGITFAALYLLVD-VYGQRWLSLPFEW 380
>gi|326432441|gb|EGD78011.1| hypothetical protein PTSG_09649 [Salpingoeca sp. ATCC 50818]
Length = 1087
Score = 261 bits (667), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 153/373 (41%), Positives = 206/373 (55%), Gaps = 55/373 (14%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALT 112
+ RL SLDVFRG TVALM+ VD+ G P I+HSPWNG+ LADFVMPFF FIVGVSLAL+
Sbjct: 635 KERLSSLDVFRGFTVALMVFVDETGAAFPPIDHSPWNGVRLADFVMPFFDFIVGVSLALS 694
Query: 113 YKNFPCKVVAT--------RKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMG 164
+K F + T RKA +R L LF+LG+ QG GI+ + Y D+A IR MG
Sbjct: 695 FKKFDLEDATTTPRVWPALRKATIRFLKLFILGMLTQG----GIDIMNY--DLAHIRIMG 748
Query: 165 VLQRIAIAYLVAALCEIWLKGD-------------GHVSSKLSLFRKYRGHWVVALVLTT 211
+LQR+A+ Y AL EI+L + G L + +Y+ HW A L
Sbjct: 749 ILQRVAVCYYAVALMEIFLPRNKKYRNYNETDTVTGWAVDVLHMLWRYKWHWFTAACLFA 808
Query: 212 LYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHL 271
+ ++YG+ VPD F CG RG PACNA IDR +L ++H+
Sbjct: 809 THTGIMYGVNVPD---------------AFGEECG-RGVLTPACNAATYIDRNVLTVEHM 852
Query: 272 Y----------RKPIYSRTKQCSINSPDYGPMPLDAPSWC-QAPFDPEGLLSSVMATVTC 320
Y + R +CS SP P DAP+WC PFDPEGL+SS+ A +
Sbjct: 853 YFPANGGDKSGNDVTFQRLPECSTCSPGKCVPPEDAPAWCLHGPFDPEGLVSSLNAIIAT 912
Query: 321 LIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMH-LNKALYSLSYTCLTAG 379
+IG+H+GH++ + + R+++W + +G +L F G +N LYS+SYT +TAG
Sbjct: 913 VIGIHYGHVLRRVQSPKARIVHWTAFGVVQLVIGFALHFSGAFVMNTDLYSISYTLVTAG 972
Query: 380 ASGVLLAGIYFMV 392
GVLLA Y +V
Sbjct: 973 TGGVLLALFYVIV 985
>gi|62701854|gb|AAX92927.1| hypothetical protein LOC_Os11g14080 [Oryza sativa Japonica Group]
gi|77549602|gb|ABA92399.1| D8Ertd354e protein, putative [Oryza sativa Japonica Group]
gi|125576749|gb|EAZ17971.1| hypothetical protein OsJ_33516 [Oryza sativa Japonica Group]
Length = 447
Score = 239 bits (610), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 137/359 (38%), Positives = 188/359 (52%), Gaps = 69/359 (19%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIV 105
+ P+ + +R+ +LD FRGLT+ LMILVDD GG ++HSPWNG TLADFVMPFFLFIV
Sbjct: 51 EEEPRKKSKRVAALDAFRGLTIVLMILVDDAGGAYERMDHSPWNGCTLADFVMPFFLFIV 110
Query: 106 GVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGV 165
GV++A K P A +K +R L + G+ LQGG+ H ++L YGVD+ +IRW G+
Sbjct: 111 GVAIAFALKRVPKLGAAVKKITIRTLKMLFWGLLLQGGYSHAPDDLSYGVDMKKIRWCGI 170
Query: 166 LQRIAIAYLVAALCEIW---LKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYV 222
LQRIA+ Y V AL E + ++ S ++F YR W+ V +Y++ + LYV
Sbjct: 171 LQRIALVYFVVALIEAFTTKVRPTTVRSGPYAIFHAYRWQWLGGFVALFIYMVTTFSLYV 230
Query: 223 PDWQYEFPVETSSSSPWIF---------NVTCGVRGSTGPACNAVGMIDRKILGIQHLYR 273
PDW Y + + + F +V CGVRG PACNAVG +DR + GI HLY
Sbjct: 231 PDWSYVYHNDGDVNDGKQFTVLLAVFPDHVQCGVRGHLDPACNAVGYVDRVVWGINHLYT 290
Query: 274 KPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHF 333
+P++ R+K I+S
Sbjct: 291 QPVWIRSKFNIIDSV--------------------------------------------- 305
Query: 334 KDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
RD NW L +C S F + +NK LYSLSY C TAGA+GV+L+ Y ++
Sbjct: 306 ---RD---NWDPLWTC------SRSFQAIPINKQLYSLSYVCFTAGAAGVVLSAFYILI 352
>gi|167522597|ref|XP_001745636.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775985|gb|EDQ89607.1| predicted protein [Monosiga brevicollis MX1]
Length = 1047
Score = 238 bits (607), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 157/411 (38%), Positives = 216/411 (52%), Gaps = 62/411 (15%)
Query: 32 ERDPLLPPSNSNSKQTR-----------PQHQQRRLISLDVFRGLTVALMILVDDVGGIL 80
R PLLP S ++ Q++ P+ + RL +LDV+RGLT+A+MILVD+ G
Sbjct: 572 NRTPLLPASTDSNIQSKSNIDLATDPVAPKPPRERLSALDVYRGLTIAVMILVDETGAAF 631
Query: 81 PAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRK---AILRALNLFLLG 137
P I+H+PWNGL LAD V+P F FIVGVS+AL +K F + A + A +A + FL
Sbjct: 632 PPIDHAPWNGLHLADTVVPSFDFIVGVSIALAFKRFDLEAGAQGQRWTAFKKATDRFL-- 689
Query: 138 IFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWL--------KGDGHV 189
F GI + Y D+ IR G+LQR+A+ Y AL EI+L +G
Sbjct: 690 -----KLFGGITFMNY--DLTNIRIFGILQRVAVCYFAVALMEIFLPRLTGALPADNGTW 742
Query: 190 SSKLS----LFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTC 245
+ + LF +YR HW A +L ++ +LYG+ VPD F C
Sbjct: 743 ADWMRRTQHLFWRYRWHWFSAALLLAVHTSILYGVDVPD---------------AFGERC 787
Query: 246 GVRGSTGPACNAVGMIDRKILGIQHLY----------RKPIYSRTKQCSINSPDYGPMPL 295
G RG PACNA IDR IL + H+Y + R +CS SP P
Sbjct: 788 G-RGQLTPACNAATYIDRLILTVPHMYFPENGGDPAHADVTFKRLPECSSCSPGLCVAPA 846
Query: 296 DAPSWC-QAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLG 354
DAP+WC PFDPEGL+SS+ A VT +IG+H+GH++ K +R+ W + + LG
Sbjct: 847 DAPAWCLHGPFDPEGLVSSLTAIVTTIIGVHYGHVLRQIKSPMERIFQWSSFALLQLLLG 906
Query: 355 LSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKKPF 405
L L F G+ LN LYS+S+ +T G +G+LL Y +V Y + L PF
Sbjct: 907 LILHFSGIPLNINLYSVSFVLVTGGMTGLLLVLCYLIVDYRPTARWLWLPF 957
>gi|357510831|ref|XP_003625704.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
truncatula]
gi|355500719|gb|AES81922.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
truncatula]
Length = 444
Score = 237 bits (605), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 114/275 (41%), Positives = 171/275 (62%), Gaps = 8/275 (2%)
Query: 122 ATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEI 181
+K ILR L L GI LQGG+ H + L YGV++ IRW G+LQRIA+ Y + AL E
Sbjct: 79 TVKKIILRTLKLLFWGILLQGGYSHAPDELVYGVNMKFIRWCGILQRIALVYCIVALIET 138
Query: 182 W---LKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSP 238
+ L+ ++++F Y+ W + +Y++ + LYVP+W F ++ P
Sbjct: 139 FTTKLRPTTLSPGRIAIFTAYK--WFGGFMAFLIYMITTFALYVPNWS--FVDHVNNDEP 194
Query: 239 WIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAP 298
+ V CG+RG GPACNAVG +DR+ G+ HLY +P++ R K C+ +SP GP DAP
Sbjct: 195 KRYTVICGMRGHLGPACNAVGYVDRQTWGVNHLYSQPVWRRLKACTFSSPSEGPFRDDAP 254
Query: 299 SWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLD 358
SWC APF+PEGLLSS+ A ++ IG+H+GH+++HFK H +R+ W + L+ + + L
Sbjct: 255 SWCLAPFEPEGLLSSISAILSGTIGIHYGHVLIHFKSHSERLKQWFSMGFVLLVVAIILH 314
Query: 359 FV-GMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
F + +NK LYS SY C TAGA+G++ + +Y ++
Sbjct: 315 FTDAIPINKQLYSFSYVCFTAGAAGIIFSILYILI 349
>gi|255635187|gb|ACU17949.1| unknown [Glycine max]
Length = 217
Score = 223 bits (569), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 112/189 (59%), Positives = 132/189 (69%), Gaps = 2/189 (1%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
Q Q R+ SLDVFRGL+V LMI VD I P I H+PWNG LADFVMPFFLFI G+SL
Sbjct: 16 QFQNTRIASLDVFRGLSVFLMIFVDYAASIFPIIAHAPWNGTHLADFVMPFFLFIAGISL 75
Query: 110 ALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRI 169
AL YK P + AT KA RALNLF LGI LQGG+FHG+ +L +GVDI +IRW+G+LQRI
Sbjct: 76 ALVYKRRPHRTQATWKAFARALNLFALGILLQGGYFHGVTSLTFGVDIQRIRWLGILQRI 135
Query: 170 AIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEF 229
+I Y+VAALCEIWL +L + Y W VA++L LY LLYGLYVPDWQ++
Sbjct: 136 SIGYIVAALCEIWLPAPRW--KELGFVKSYYWQWFVAVILLALYSGLLYGLYVPDWQFDV 193
Query: 230 PVETSSSSP 238
TSS P
Sbjct: 194 SASTSSLPP 202
>gi|413953638|gb|AFW86287.1| hypothetical protein ZEAMMB73_717084 [Zea mays]
Length = 357
Score = 223 bits (568), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 112/244 (45%), Positives = 147/244 (60%), Gaps = 40/244 (16%)
Query: 119 KVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAAL 178
K AT+KA +RA LF+LG+ LQGG+ HG + L YGVD+ IRW+GVLQRIAI Y VAA+
Sbjct: 4 KTAATKKAAIRASKLFILGVILQGGYIHGRHKLTYGVDLDHIRWLGVLQRIAIGYFVAAM 63
Query: 179 CEIWLKGDGHVSSKLSLFRKYRGHWV---------------------------------- 204
EIWL + V S + +KY W+
Sbjct: 64 SEIWLVNNNLVDSPVPFVKKYFIEWIAIGYFVAAMSEIWLVNNNLVDSPVPFVKKYFIEW 123
Query: 205 -VALVLTTLYLLLLYGLYVPDWQYEFPVETS-----SSSPWIFNVTCGVRGSTGPACNAV 258
+A+ +T LY+ L++GLYV +W++E S S+S + CGVRGS GP CNAV
Sbjct: 124 FMAIAITVLYVALVFGLYVANWEFEIQTSNSTLSIPSNSIETKMIQCGVRGSLGPPCNAV 183
Query: 259 GMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATV 318
G++DR +LG HLY+ P+Y RTK+CSINSPDYGP+P +AP WC APFDPEGLLS + TV
Sbjct: 184 GLVDRVLLGENHLYKNPVYKRTKECSINSPDYGPLPPNAPDWCLAPFDPEGLLSKPLYTV 243
Query: 319 TCLI 322
++
Sbjct: 244 NYML 247
>gi|413918234|gb|AFW58166.1| hypothetical protein ZEAMMB73_985435 [Zea mays]
Length = 202
Score = 220 bits (560), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 107/200 (53%), Positives = 143/200 (71%), Gaps = 1/200 (0%)
Query: 1 MAKYNPINNDANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLD 60
M +Y + +D + + T+AP + +R P + + +++QR L+SLD
Sbjct: 1 MGRYELVRSDDAPAVAADLEAGTAAPCDGYPKRRVFTPSPSPSPAPASTRYRQR-LVSLD 59
Query: 61 VFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKV 120
VFRG+TV LMI+VDD G +PA+NHSPW+G+T+ADFVMPFFLFIVGV+LAL YK P K+
Sbjct: 60 VFRGITVLLMIIVDDAGAFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAYKRVPDKL 119
Query: 121 VATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCE 180
A+RKA+LRAL LF LG+ LQGGFFHG+ +L +GVD+ +IR MGVLQRIAIAYL+ ALCE
Sbjct: 120 DASRKALLRALKLFCLGLVLQGGFFHGVRSLSFGVDLQEIRLMGVLQRIAIAYLLTALCE 179
Query: 181 IWLKGDGHVSSKLSLFRKYR 200
IW++GD V L ++YR
Sbjct: 180 IWIRGDEDVDYGYDLLKRYR 199
>gi|125533951|gb|EAY80499.1| hypothetical protein OsI_35679 [Oryza sativa Indica Group]
Length = 444
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 132/359 (36%), Positives = 184/359 (51%), Gaps = 72/359 (20%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIV 105
+ P+ + +R+ +LD FRGLT+ LMILVDD GG ++HSPWNG TLADFVMPFFLFIV
Sbjct: 51 EEEPRKKSKRVAALDAFRGLTIVLMILVDDAGGAYERMDHSPWNGCTLADFVMPFFLFIV 110
Query: 106 GVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGV 165
GV++A K P A +K +R L + G+ LQGG+ H ++L YGVD+ +IRW G+
Sbjct: 111 GVAIAFALKRVPKLGAAVKKITIRTLKMLFWGLLLQGGYSHAPDDLSYGVDMKKIRWCGI 170
Query: 166 LQRIAIAYLVAALCEIWLKG--DGHVSSKLSLFRK-YRGHWVVALVLTTLYLLLLYGLYV 222
LQ + + + A L+G D + K S + R VAL + Y++ + LYV
Sbjct: 171 LQNLLVLFDNAEDSFGVLRGCSDRGIHHKSSAYDSAVRLGGFVALFI---YMVTTFSLYV 227
Query: 223 PDWQYEFPVETSSSSPWIF---------NVTCGVRGSTGPACNAVGMIDRKILGIQHLYR 273
PDW Y + + + F +V CGVRG PACNAVG +DR + GI HLY
Sbjct: 228 PDWSYIYHNDGDVNDGKQFTVLLAVFPDHVQCGVRGHLDPACNAVGYVDRVVWGINHLYT 287
Query: 274 KPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHF 333
+P+ W ++ F+ ++ SV
Sbjct: 288 QPV-----------------------WIRSKFN---IVDSV------------------- 302
Query: 334 KDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
RD NW L + S F + +NK LYSLSY C TAGA+GV+L+ Y ++
Sbjct: 303 ---RD---NWDPLWT------RSRSFQAIPINKQLYSLSYVCFTAGAAGVVLSAFYILI 349
>gi|413920627|gb|AFW60559.1| hypothetical protein ZEAMMB73_831897 [Zea mays]
Length = 343
Score = 196 bits (497), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 96/230 (41%), Positives = 148/230 (64%), Gaps = 4/230 (1%)
Query: 167 QRIAIAYLVAALCE---IWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVP 223
QRIA+ Y AL E + ++ S ++F YR W+ LV +Y++ + LYVP
Sbjct: 19 QRIALVYFFVALIEALTVKVRPTTVRSGPYAIFDAYRWQWLGGLVAFVVYMVTTFSLYVP 78
Query: 224 DWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQC 283
DW + + E + F V CGVR S ACNAVG +DR++ GI HLY +P++ R+K C
Sbjct: 79 DWSFVYHNEGDVNDGKQFTVKCGVRASLEQACNAVGYVDRQVWGINHLYTQPVWIRSKDC 138
Query: 284 SINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNW 343
+ +SP+ GP+ DAP+WC APF+PEGLLSS+ + ++ IG+H+GH+++HFK H++R+ +W
Sbjct: 139 TSSSPNMGPLRSDAPAWCLAPFEPEGLLSSISSVLSGTIGIHYGHVLIHFKTHKERLKHW 198
Query: 344 IILSSCLIGLGLSLDFV-GMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
++ L+ LG+ L F + +NK LYS SY C T GA+G++L+ Y ++
Sbjct: 199 LLTGFSLLVLGIILHFTNAIPINKQLYSFSYVCFTGGAAGIVLSAFYILI 248
>gi|10177926|dbj|BAB11337.1| unnamed protein product [Arabidopsis thaliana]
Length = 384
Score = 174 bits (442), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 74/94 (78%), Positives = 84/94 (89%)
Query: 241 FNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSW 300
V CGVRG TGP CNAVGM+DR LGIQHLYRKP+Y+RTKQCSIN P+ GP+P DAPSW
Sbjct: 259 LKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINYPNNGPLPPDAPSW 318
Query: 301 CQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFK 334
CQAPFDPEGLLSS+MATVTCL+GLH+GH+I+HFK
Sbjct: 319 CQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFK 352
Score = 171 bits (433), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 96/164 (58%), Positives = 118/164 (71%), Gaps = 15/164 (9%)
Query: 8 NNDANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTV 67
+ND ND Q + + ++ R LPP + RL+SLDVFRGLTV
Sbjct: 113 DNDDNDH---QWREKKDIESALQISRSSSLPP------------DKERLVSLDVFRGLTV 157
Query: 68 ALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAI 127
A MILVDDVGGILP+INHSPW+G+TLADFVMPFFLFIVGVSLA YKN C+ VATRKA+
Sbjct: 158 AFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAYKNLSCRFVATRKAL 217
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAI 171
+R+L L LLG+FLQGGF HG+NNL YG+D+ +IR MG+LQ + +
Sbjct: 218 IRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQNLKV 261
>gi|121489785|emb|CAK18864.1| hypothetical protein [Phillyrea latifolia]
Length = 129
Score = 163 bits (413), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 73/130 (56%), Positives = 98/130 (75%), Gaps = 2/130 (1%)
Query: 248 RGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDP 307
G GPACN+ GMIDR +LGI HLY KP+Y K+C+I+S +G +P APSWC APFDP
Sbjct: 1 NGDLGPACNSAGMIDRNVLGIDHLYAKPVYRNLKECNISS--HGQVPETAPSWCHAPFDP 58
Query: 308 EGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKA 367
EG+LSS+ A V+C+IGL +GH++V +DH++R+ NW I S +GLGL L FVG+ LNK+
Sbjct: 59 EGILSSLTAAVSCIIGLQYGHILVRLQDHKERLCNWSIFSFAFLGLGLFLAFVGIPLNKS 118
Query: 368 LYSLSYTCLT 377
LY++SY +T
Sbjct: 119 LYTISYLLVT 128
>gi|391346547|ref|XP_003747534.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Metaseiulus occidentalis]
Length = 564
Score = 149 bits (375), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 124/386 (32%), Positives = 187/386 (48%), Gaps = 60/386 (15%)
Query: 38 PPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFV 97
PP + K +P R+ SLD FRG + LMI V+ GG L H PW+GLT AD +
Sbjct: 164 PPEDGIGKAGKP-----RIKSLDAFRGFCLFLMIFVNYGGGGLWLFEHIPWDGLTFADLL 218
Query: 98 MPFFLFIVGVSLALTYKNFPCKVVATR----KAILRALNLFLLGIFLQGGFFHGINNLKY 153
P+F++I+GVS+A++ ++ K V K + R++ LFLLG+ L G NN
Sbjct: 219 FPWFVWIMGVSMAISLRSMRRKCVPLSEIFFKILSRSVKLFLLGLILNS---MGKNN--- 272
Query: 154 GVDIAQIRWMGVLQRIAIAYLVAALCEIWLK--GDGHVSSKLSLFRK---YRGHWVVALV 208
DI+++R GVLQR A++Y V A ++ D ++K + R Y WV+ +
Sbjct: 273 --DISKLRIPGVLQRFAVSYFVVASMHMFFSRATDAAETAKWAKIRDVALYWQEWVMMIS 330
Query: 209 LTTLYLLLLYGLYVPDWQ--YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKIL 266
L +++LL + L VPD Y P + FN T G A G IDR +L
Sbjct: 331 LVAIHVLLTFLLDVPDCPKGYLGPGGLHENGT-HFNCTGG----------AAGYIDRVVL 379
Query: 267 GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHF 326
G H+Y P + + S PFDPEG+L + + +GL
Sbjct: 380 GPNHMYGHPTTEKIYETS------------------QPFDPEGVLGCLTSIFLTFLGLQA 421
Query: 327 GHLIVHFKDHRDRMLNWI---ILSSCLIGL--GLSLDFVGMHLNKALYSLSYTCLTAGAS 381
G +++ F + R+ WI +L L G+ G S + + +NK L+SLSY TAG +
Sbjct: 422 GKILLTFNNPGRRLSRWICWGVLLGLLAGILCGFSKEDGWIPINKNLWSLSYVLCTAGLA 481
Query: 382 GVLLAGIYFMVRYISSHLMLKKPFDY 407
+LL+ Y ++ ++ L PF Y
Sbjct: 482 FLLLSVFYLIIDVLA--LWSAVPFIY 505
>gi|443694948|gb|ELT95966.1| hypothetical protein CAPTEDRAFT_92095 [Capitella teleta]
Length = 431
Score = 144 bits (364), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 121/432 (28%), Positives = 192/432 (44%), Gaps = 74/432 (17%)
Query: 10 DANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVAL 69
++N + + ST + EK E + P + ++ RL SLD FRGL + L
Sbjct: 2 ESNVSISSAKTDSTRRNSEEKDEGKLITPK----------EVKKERLRSLDAFRGLNILL 51
Query: 70 MILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKA--- 126
MI V+ GG +H+ WNGL + D + P+F+FI+G SL L + K V +
Sbjct: 52 MIFVNYGGGGYWYFSHAVWNGLYITDLIFPWFIFIMGTSLGLGISSLVKKEVDPVEGLWG 111
Query: 127 -ILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALC------ 179
I R++ LF +GI + N K D+ IR GVLQR+A+ Y + A+
Sbjct: 112 IIWRSVKLFAVGI---------MYNTKSSNDLENIRMTGVLQRLAMVYFITAIVHYAGES 162
Query: 180 -EIWLKGDGHVSSK---LSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSS 235
+ ++ G VS LS Y G W+ LV+ +Y Y VP E
Sbjct: 163 LQCCMRSRGTVSRWRHILSDLAPYFGEWITMLVIIGIYCYFTYWFAVPG------CEAGY 216
Query: 236 SSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPL 295
P + G TG A ID K+ ++H+Y+ P Q
Sbjct: 217 VGPGGLHRDGAHAGCTG---GAALYIDLKVYTMRHIYQWPDIRTIYQ------------- 260
Query: 296 DAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGL 355
+ FDPEGLL ++ + C +GL G ++V K HR+R++ W+I + G+G
Sbjct: 261 -----TDSAFDPEGLLGTLTSIFLCFLGLQAGKILVCHKGHRERLVRWLIWAIITGGIGT 315
Query: 356 -----SLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKKPFDYS-- 408
L+ + +NK + S+S+ ++AG ++L+ +Y ++ S L +PF Y+
Sbjct: 316 LLCKAQLEDGWVPINKNIMSISFVLVSAGTGFIMLSVMYILID--SWKLWNGQPFTYAGM 373
Query: 409 -----YACKSML 415
Y C S+
Sbjct: 374 NSIVLYMCHSIF 385
>gi|443685781|gb|ELT89271.1| hypothetical protein CAPTEDRAFT_227545 [Capitella teleta]
Length = 605
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 121/389 (31%), Positives = 175/389 (44%), Gaps = 44/389 (11%)
Query: 18 QISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVG 77
Q+ + P E ++ + H++R L SLD FRG+++ +MI V+ G
Sbjct: 175 QLLEDLGDPEESDPEMQTESATDDAETTAVNKTHKER-LRSLDAFRGMSLTIMIFVNYGG 233
Query: 78 GILPAINHSPWNGLTLADFVMPFFLFIVGVSLALT----YKNFPCKVVATRKAILRALNL 133
G +HS WNGLTLAD V P+F +I+G +LAL+ + K K I R L
Sbjct: 234 GGYWFFDHSYWNGLTLADLVFPWFTWIIGTALALSIQGQMRRGKTKFSIAAKIIRRTCVL 293
Query: 134 FLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCE-IWLKGD-GHVSS 191
F LGI L G VD+ +R GVLQR+AI+YLV AL I+ K + H S
Sbjct: 294 FALGIVLGSG------GGSEPVDVQTLRIPGVLQRLAISYLVVALLHLIFAKANKDHQPS 347
Query: 192 KLSLFRKYRGHWV---VALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVR 248
+L + R HW + LV+ +L L + L + D + P T P +
Sbjct: 348 RLDMVRDITDHWPQWGIVLVMVACHLGLTFLLPISDVEGTCP--TGYLGPGGLHEGGKYE 405
Query: 249 GSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPE 308
TG A +IDR QH+Y+ P + P DPE
Sbjct: 406 NCTG---GAAAVIDRWFFSRQHVYQTPTCKEVYKTV------------------EPHDPE 444
Query: 309 GLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI---ILSSCLIGL--GLSLDFVGMH 363
G+L ++ + C +GL G ++ FK RM WI I+ + GL G D +
Sbjct: 445 GILGTLTSIFLCFLGLQAGVILTTFKQKSPRMRRWIVWGIILGLIAGLLCGFKQDGGWIP 504
Query: 364 LNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+NK L+SLS+ A + VLLA Y ++
Sbjct: 505 VNKNLWSLSFVLGLASMAFVLLAVFYLLI 533
>gi|449500329|ref|XP_004174928.1| PREDICTED: LOW QUALITY PROTEIN: heparan-alpha-glucosaminide
N-acetyltransferase-like [Taeniopygia guttata]
Length = 789
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 116/384 (30%), Positives = 176/384 (45%), Gaps = 61/384 (15%)
Query: 4 YNPINNDANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFR 63
Y +N A D++ I+ +P + + DP P ++ S+Q RL SLD FR
Sbjct: 360 YKKLNPRATDRM---INSELGSPNTDSINCDPTPSPWSTGSRQ--------RLRSLDTFR 408
Query: 64 GLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCK 119
GL++ +M+ V+ GG H WNGLT+AD V P+F+FI+G S+AL + K
Sbjct: 409 GLSLVIMVFVNYGGGKYWFFKHVSWNGLTVADLVFPWFVFIMGTSIALALGSMLRWGSSK 468
Query: 120 VVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALC 179
RK I R+ L LLGI + N + +R GVLQR+ YLV A
Sbjct: 469 WKVLRKIIWRSFVLILLGIIVVNP-----NYCLGPLSWDNLRIPGVLQRLGFTYLVVAAL 523
Query: 180 EIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPW 239
E+ L Y W+ LVL T++L L + L VPD +
Sbjct: 524 ELLFTRADRRFPALQDILPYWPQWIFILVLETIWLCLTFLLPVPDCPRGY---------- 573
Query: 240 IFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPL 295
+ G G G P C A G IDR ILG +H+Y+ P T Q ++
Sbjct: 574 ---LGPGGIGDFGKYPNCTGGAAGYIDRLILGEKHMYQHPSSGVTYQSTM---------- 620
Query: 296 DAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL-- 353
P+DPEG+L ++ + V +GL G + + +KDH ++++ I+ ++G+
Sbjct: 621 --------PYDPEGILGTINSIVMAFLGLQAGKITLFYKDHPKQIMSRFIIWGIVMGVIS 672
Query: 354 ----GLSLDFVGMHLNKALYSLSY 373
S + + +NK L+S+SY
Sbjct: 673 AILTKCSKEEGFIPVNKNLWSISY 696
>gi|326427923|gb|EGD73493.1| heparan-alpha-glucosaminide N-acetyltransferase [Salpingoeca sp.
ATCC 50818]
Length = 788
Score = 140 bits (353), Expect = 1e-30, Method: Compositional matrix adjust.
Identities = 122/412 (29%), Positives = 187/412 (45%), Gaps = 82/412 (19%)
Query: 28 NEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSP 87
+K RDPLL +S P+ + RL SLD FRG+ + +MI V+ GG +HS
Sbjct: 338 TQKYTRDPLL---SSTHAIGNPKRSKTRLQSLDSFRGMALTIMIFVNYGGGDYNFFDHSV 394
Query: 88 WNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATR----KAILRALNLFLLGIFLQGG 143
WNGLT+AD V P+F++I+G S+A+T+ + + R K R L LF +G+
Sbjct: 395 WNGLTVADLVFPWFIWIMGTSMAITFNSLFKRHTPLRTILYKVARRTLLLFGIGVI---- 450
Query: 144 FFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGD----GHVSSKLSLFRK- 198
F + +++L++ R GVLQR AIAYLV AL I++ +V L R+
Sbjct: 451 FINVVHDLRFA------RVPGVLQRFAIAYLVVALVIIFVPKAVSLLRNVDEVTPLIRRL 504
Query: 199 ---------------------------YRGHWVVALVLTTLYLLLLYGLYVPDWQ--YEF 229
Y G W+ +VL ++ + + L VP Y
Sbjct: 505 TPTVRNPASDLDPGGCGMLRHLPDVAPYVGEWIAIIVLVVIHTCITFLLPVPGCPTGYIG 564
Query: 230 P----VETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSI 285
P E +P N +C V G+ A G +DR +L +H+Y P T Q
Sbjct: 565 PGGALAEFGQFAP--ANGSC-VNGTFCCEGGAAGHVDRWLLSWKHIYGSPTSQETYQ--- 618
Query: 286 NSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWII 345
+DPEG+L S+ + + C +GL G +IVH+K R R + W+
Sbjct: 619 ----------------TGAYDPEGILGSLTSILICYLGLQSGKIIVHYKAARARSVRWLA 662
Query: 346 LS--SCLIGLGL---SLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
C I GL S + + ++K L+SLS+ L +G + L Y+++
Sbjct: 663 WGVLCCAIATGLCGGSKNDGVIPVSKNLWSLSFVLLMSGFGFISLTAFYWLI 714
>gi|242062186|ref|XP_002452382.1| hypothetical protein SORBIDRAFT_04g024716 [Sorghum bicolor]
gi|241932213|gb|EES05358.1| hypothetical protein SORBIDRAFT_04g024716 [Sorghum bicolor]
Length = 108
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 62/96 (64%), Positives = 77/96 (80%)
Query: 297 APSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLS 356
APSWCQAPFDPEGLLSSVMA VTCLIGL FGH+I+HF+ HR R+ NW+ILS ++ L
Sbjct: 13 APSWCQAPFDPEGLLSSVMAIVTCLIGLQFGHIIIHFEKHRGRITNWLILSFSMLALAFL 72
Query: 357 LDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+DF GM ++K LY++SYT TAG +G+L AGIY +V
Sbjct: 73 MDFSGMRMSKPLYTMSYTLATAGTAGLLFAGIYALV 108
>gi|348529394|ref|XP_003452198.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Oreochromis niloticus]
Length = 600
Score = 139 bits (349), Expect = 3e-30, Method: Compositional matrix adjust.
Identities = 121/407 (29%), Positives = 191/407 (46%), Gaps = 62/407 (15%)
Query: 10 DANDQLVIQISKS--TSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTV 67
DA LV+++ S T N +L ++PP N P +RL SLD FRG+++
Sbjct: 158 DAVRGLVLRLGSSMETERLINSELGPTRVVPPVTDNILPP-PLTSSKRLRSLDTFRGISL 216
Query: 68 ALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVAT 123
+M+ V+ GG H WNGLT+AD V P+F+FI+G S+AL+ +
Sbjct: 217 VIMVFVNYGGGRYWFFRHESWNGLTVADLVFPWFVFIMGTSIALSINALLRAGATRCSLL 276
Query: 124 RKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWL 183
RKA+ R+L LF++G+ + N + + +R GVLQR+A +YLV A ++ L
Sbjct: 277 RKAVWRSLQLFIIGVLVINP-----NYCQGALAWENLRIPGVLQRLAWSYLVVACLDL-L 330
Query: 184 KGDGHVS--------SKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSS 235
G + S F Y W+ ++L L+L L + L VPD T
Sbjct: 331 VARGQLDVITVDAWWSPAIDFLLYWPAWLCVILLEVLWLFLTFLLPVPD------CPTGY 384
Query: 236 SSPWIFNVTCGVRGSTGPACN----AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYG 291
P G G G N A G IDR +LG +H+Y+ P SR +
Sbjct: 385 LGP-------GGIGDMGLYVNCTGGAAGFIDRLLLGEKHMYQNP-SSRVIYAT------- 429
Query: 292 PMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLI 351
+ P+DPEG+L S+ + + +GL G +I+H+++ +++ ++ +
Sbjct: 430 ----------RIPYDPEGVLGSINSILMAFLGLQAGKIILHYRERPKSIMSRFLIWGLSL 479
Query: 352 GL------GLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
G+ S D + +NK L+SLSY A + VLL +Y++V
Sbjct: 480 GVISAFLTKCSTDRGFIPVNKNLWSLSYVTTLACFAFVLLMLVYYIV 526
>gi|390367684|ref|XP_789038.3| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Strongylocentrotus purpuratus]
Length = 624
Score = 138 bits (348), Expect = 4e-30, Method: Compositional matrix adjust.
Identities = 118/396 (29%), Positives = 183/396 (46%), Gaps = 70/396 (17%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGG 78
+S P+N E D SNS Q + + +RL SLD FRG+++ +MI V+ GG
Sbjct: 204 VSSDLGTPSNPVAEAD-------SNSIQRPSRDKPKRLKSLDAFRGMSLVIMIFVNYGGG 256
Query: 79 ILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATR----KAILRALNLF 134
NHS WNGLT+AD V P+F++I+GVS+ +++ V+ R K I R + LF
Sbjct: 257 QYSFFNHSIWNGLTVADLVFPWFIWIMGVSITMSFYALVRHGVSRRVIFTKIIRRFVILF 316
Query: 135 LLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWL--KGDGHVSSK 192
LGI L G G+D + R GVLQRIA +YLV A ++ D +
Sbjct: 317 GLGIILDG-----------GIDFSTFRVPGVLQRIAFSYLVVATVHLFAVKHKDEEYRIR 365
Query: 193 LSLFRKYRG------HWVVALVLTTLYLLLLYGLYVPDWQYEF-----PVETSSSSPWIF 241
++R+ R W++ + L++ L + L VP + P+ + S
Sbjct: 366 HVVYRELRDLLDYWYEWIIMISFLALHICLTFFLPVPGCPTGYLGPGGPLVGENESL--- 422
Query: 242 NVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWC 301
V C TG A N ID+ IL H Y + + Q ++
Sbjct: 423 -VNC-----TGGAAN---YIDKVILTYNHTYPRGTPRKIYQTTV---------------- 457
Query: 302 QAPFDPEGLLSSVMATVTCLIGLHFG---HLIVHFKDHRDRMLNWIILSSCLIG--LGLS 356
P DPEG+L ++ + +GL G HL + +D R L W +++ + G G S
Sbjct: 458 --PHDPEGILGTLTSIFMTFLGLQAGKIFHLFSYPRDRILRFLGWCVVTGVIAGALCGFS 515
Query: 357 LDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+ + +NK L+S+S+ TA + LLA Y+++
Sbjct: 516 KEDGIIPVNKNLWSVSFILATASMAFFLLAIFYYLI 551
>gi|260816362|ref|XP_002602940.1| hypothetical protein BRAFLDRAFT_251788 [Branchiostoma floridae]
gi|229288254|gb|EEN58952.1| hypothetical protein BRAFLDRAFT_251788 [Branchiostoma floridae]
Length = 512
Score = 138 bits (347), Expect = 5e-30, Method: Compositional matrix adjust.
Identities = 117/387 (30%), Positives = 188/387 (48%), Gaps = 74/387 (19%)
Query: 38 PPSNSNSKQT------RPQHQQ-RRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNG 90
P S + QT +P Q RRL SLD FRGL++A+M+ V+ GG H+ WNG
Sbjct: 96 PSSTATEDQTVANSSTQPASQGIRRLRSLDTFRGLSLAVMVFVNYGGGGYWFFKHARWNG 155
Query: 91 LTLADFVMPFFLFIVGVSLALTYKNFPCKVVAT----RKAILRALNLFLLGIFLQGGFFH 146
LT+AD V P+F+FI+G S+AL+++ K V+ K I R + LFLLG+F+
Sbjct: 156 LTVADLVFPWFVFIMGTSIALSFRRLLKKGVSRLSLLWKVIQRTVILFLLGLFI------ 209
Query: 147 GINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVS------SKLSLFRKY- 199
IN K + +R GVLQR+A+ Y + AL E W K G++S S+++ R
Sbjct: 210 -INTKKGHNSWSTLRIPGVLQRLALTYFIVALMESW-KPRGYLSLYLLQTSRIAPIRDIV 267
Query: 200 --RGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNA 257
G W+ +V+ TL+L+L++ L VP+ + S +N T G A
Sbjct: 268 NSWGQWLFMIVVVTLHLVLMFWLQVPNCPIGYLGPGGLSDIAHYNCTGG----------A 317
Query: 258 VGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMAT 317
G IDR + H+Y+ P P+ + PF+PEGLL ++ +
Sbjct: 318 AGYIDRAVFTDDHIYQH-----------------PTPITVYE-TEVPFEPEGLLGTLTSA 359
Query: 318 VTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVG------------MHLN 365
+ C +GL ++ ++ ++N+ + ++ + L +G + LN
Sbjct: 360 LLCFLGLQVKNMYMYI------IMNYCVFTNRSPAICFCLGAIGGLLCNGRQNEGWIPLN 413
Query: 366 KALYSLSYTCLTAGASGVLLAGIYFMV 392
K L+SLS+ + + + VLL+ Y +V
Sbjct: 414 KNLWSLSFVLVLSCFAFVLLSVCYIIV 440
>gi|281209034|gb|EFA83209.1| hypothetical protein PPL_03999 [Polysphondylium pallidum PN500]
Length = 1154
Score = 137 bits (346), Expect = 9e-30, Method: Compositional matrix adjust.
Identities = 115/380 (30%), Positives = 175/380 (46%), Gaps = 82/380 (21%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
++ R+ SLDVFRGL++ +MI V+ GG NHS WNGLT+AD V P+F+FI+G ++ +
Sbjct: 222 KKDRMKSLDVFRGLSITIMIFVNYGGGGYWFFNHSYWNGLTVADLVFPWFVFIMGCAMPM 281
Query: 112 TYKNFPCKVVATR----KAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
++ + V + K + R++ LF LG+FL NN G D+ R +GVLQ
Sbjct: 282 SFNALESRGVPKKTIVIKLVRRSITLFALGMFL--------NN---GNDLQHWRILGVLQ 330
Query: 168 RIAIAYLVAAL----CEIW--------------LKGDGHVSSKL-----SLFRK---YRG 201
R I+YLV L +W L G G + ++ +F Y
Sbjct: 331 RFGISYLVTGLIMMFVPVWRYRQLDDLSEEQQPLYGGGSIQDRIRSRYPRMFADILPYWI 390
Query: 202 HWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVG-- 259
WVVAL+L +++ L+ + L VP T P G GS G N G
Sbjct: 391 QWVVALMLLSVWFLVTFLLPVPG------CPTGYIGP-------GGIGSQGQYANCTGGA 437
Query: 260 --MIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMAT 317
+D KI G H+Y+ P T Q N+ Y DPEG L + +
Sbjct: 438 ARYVDLKIFGENHIYQTP----TCQTIYNTGSY---------------DPEGTLGYITSI 478
Query: 318 VTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL-----GLSLDFVGMHLNKALYSLS 372
C +G+ G I+ FK R++ W I L G+ G+S + + +NK L++ S
Sbjct: 479 FMCFLGVQCGRTILAFKKASCRLIRWSIWGVVLCGIAAGLCGMSQNNGWLPINKNLWTPS 538
Query: 373 YTCLTAGASGVLLAGIYFMV 392
+ L +G +L+ +Y +
Sbjct: 539 FVLLLSGFGFFVLSFMYIFI 558
>gi|432845830|ref|XP_004065874.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Oryzias latipes]
Length = 622
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 127/411 (30%), Positives = 194/411 (47%), Gaps = 70/411 (17%)
Query: 10 DANDQLVIQISKSTS----------APANEKLERDPLLPPSNSNSKQTRPQHQQRRLISL 59
DA +LV++I S +P E D +LPP PQ ++RL SL
Sbjct: 180 DAVKRLVVRIGGSMETERLINSELGSPQTEAPLTDNILPP---------PQSPRKRLRSL 230
Query: 60 DVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCK 119
D FRG+ + +M+ V+ GG H WNGLT+AD V P+F+F++G S+AL+ + +
Sbjct: 231 DTFRGIALVIMVFVNYGGGRYWFFRHESWNGLTVADLVFPWFVFVMGTSIALSINSL-LR 289
Query: 120 VVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALC 179
TR ++LR + + +FL G F N + G+ +R GVLQR+A +YLV A
Sbjct: 290 AGLTRGSLLRKIVWRSIQLFLIGVFIINPNYCQGGLSWENLRIPGVLQRLAFSYLVVASL 349
Query: 180 EIW--------LKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPV 231
++ L+ D S L + Y WVV L+L L+L L + L VPD
Sbjct: 350 DLMVARGHLDVLQTDAWWSPFLDVLL-YWPAWVVVLLLEVLWLSLTFLLPVPD------C 402
Query: 232 ETSSSSPWIFNVTCGVRGSTGPACN----AVGMIDRKILGIQHLYRKPIYSRTKQCSINS 287
T P G G G N A G +DR +LG +H+Y+ P SR +
Sbjct: 403 PTGYLGP-------GGIGDMGLYANCTGGAAGFLDRWLLGEKHIYQTP-SSRVLYLT--- 451
Query: 288 PDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILS 347
Q P+DPEG+L S+ + + +GL G +I+H++D +++ ++
Sbjct: 452 --------------QIPYDPEGVLGSINSVLMAFLGLQAGKIILHYRDLPTSIMSRFLIW 497
Query: 348 SCLIGL------GLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+G+ S D + +NK L+SLSY A + VLL +Y+ V
Sbjct: 498 GLFMGVISAVLTRCSTDQGFIPVNKNLWSLSYVTTLACFAFVLLVLVYYTV 548
>gi|167538367|ref|XP_001750848.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163770669|gb|EDQ84352.1| predicted protein [Monosiga brevicollis MX1]
Length = 779
Score = 134 bits (338), Expect = 7e-29, Method: Compositional matrix adjust.
Identities = 120/399 (30%), Positives = 178/399 (44%), Gaps = 85/399 (21%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
+ +RL SLD FRG + +MI V+ GG NHS WNGLT+AD V P+F++I+G S+
Sbjct: 335 KRSSQRLRSLDSFRGFALTIMIFVNFNGGFYWFFNHSAWNGLTVADLVFPWFIWIMGTSM 394
Query: 110 ALTYKNFPCKVVATR----KAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGV 165
A+ + + + T K R L LF GIF+ G F ++L+ G R GV
Sbjct: 395 AIAFNSLLKRQTPTTTILYKIFRRMLILFAFGIFIIGNF----HDLRNG------RIPGV 444
Query: 166 LQRIAIAYLVAALC-------EIWLK-------------------GDGH---VSSKLSLF 196
LQR A++YLV AL E W G GH V++ ++
Sbjct: 445 LQRFAVSYLVVALVMLYAPKMESWCASVSTSDSPTPALVRGIAKPGSGHQLDVAADIAEM 504
Query: 197 R--------------KYRGHWVVALVLTTLYLLLLYGLYVPDWQ--YEFPVETSSSSPWI 240
+ Y WV V+ ++ + + L VP Y P +
Sbjct: 505 KPWVRTFLLHTRDLTPYIWEWVAMFVIIIIHTCITFLLPVPGCPTGYIGPGGALAEYGQF 564
Query: 241 FNVTCGVRGSTGPACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAP 298
V G + C A G IDR++ G +H+Y +P T Q P
Sbjct: 565 APPEGEVCGESTFCCEGGASGYIDRQVFGWRHIYDQP----TSQ---------------P 605
Query: 299 SWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNW----IILSSCLIGL- 353
+ P+DPEGLL S+ + V C +GL G +IVH+K H R +W ++L GL
Sbjct: 606 IYETGPYDPEGLLGSLTSIVMCFLGLQSGKIIVHYKSHAQRSRHWLMWALVLGVIATGLC 665
Query: 354 GLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
G S + + ++K L+SLS+ L A + LL Y+++
Sbjct: 666 GASQNNGVIPVSKNLWSLSFIILLASFAFFLLTVFYWVI 704
>gi|312381520|gb|EFR27253.1| hypothetical protein AND_06166 [Anopheles darlingi]
Length = 782
Score = 133 bits (335), Expect = 2e-28, Method: Compositional matrix adjust.
Identities = 113/401 (28%), Positives = 182/401 (45%), Gaps = 71/401 (17%)
Query: 18 QISKSTSAPANEKLERD---------PLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVA 68
Q + ++SA A E+ + D PPS + + R +RL SLD RG+ +
Sbjct: 353 QPTANSSAVAEERTDGDDDKQEGFTASAAPPSVAPANIAR-----KRLQSLDTLRGIAIM 407
Query: 69 LMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAI- 127
LMI V+ GG I H+ WNGL +AD V P+FLFI+GV + ++ + + V+ R+ +
Sbjct: 408 LMIFVNSGGGHYWWIEHATWNGLHVADLVFPWFLFIMGVCIPISLRGQLARNVSKRQIVS 467
Query: 128 ---LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLK 184
R+++LFL+G+ L N G ++A +R GVLQR +AY V +L ++
Sbjct: 468 SITTRSISLFLIGLCL---------NSMNGPNMANLRIFGVLQRFGVAYFVVSLVHLFCH 518
Query: 185 GDG------HVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSP 238
+ V + + + R R W++ +L +YL ++ + P +
Sbjct: 519 REQIASQHRFVRANVDIIRLVR-QWIIVGLLVVIYLAVILLIPAPGCPRGY---FGPGGK 574
Query: 239 WIFNVTCGVRGSTGPACNA--VGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLD 296
+FNV P C G IDR +LG+ HLY+ P
Sbjct: 575 HLFNVY--------PNCTGGITGYIDRVLLGMSHLYQHPTARYV---------------- 610
Query: 297 APSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCL-----I 351
+ PFDPEG + + + +GL G I+ F HR R+ + + S L +
Sbjct: 611 ---YDGQPFDPEGPFACLPTILQVFLGLQCGSTILSFTGHRQRLQRFAVWSVALGLVAGV 667
Query: 352 GLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
G S + + +NK L+SLSY TA + +LL Y+ +
Sbjct: 668 LCGFSKNDGWLPVNKNLWSLSYVLATASLAYLLLLICYYTI 708
>gi|428306334|ref|YP_007143159.1| heparan-alpha-glucosaminide N-acetyltransferase [Crinalium
epipsammum PCC 9333]
gi|428247869|gb|AFZ13649.1| Heparan-alpha-glucosaminide N-acetyltransferase [Crinalium
epipsammum PCC 9333]
Length = 371
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 118/362 (32%), Positives = 171/362 (47%), Gaps = 82/362 (22%)
Query: 55 RLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL SLDVFRG+ +A MILV+ G + PA+ H+ WNG T AD V PFFLFI+GV++A
Sbjct: 2 RLTSLDVFRGMAIAGMILVNKAGVADQVYPALAHADWNGWTFADLVFPFFLFIIGVAMAF 61
Query: 112 TYKNF------PCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGV 165
++ + P K + R + R+ LF+LG+ L G + D + IR MGV
Sbjct: 62 SFAKYTEGDNKPTKQLYLR-ILRRSAILFILGLLLNG---------FWNYDFSTIRVMGV 111
Query: 166 LQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDW 225
LQRI++AYL+A+L + L+L +K G W +A VL Y L++ + VP +
Sbjct: 112 LQRISVAYLLASL------------AVLTLPKK--GQWALAAVLLIGYWLIMSFVPVPGY 157
Query: 226 QYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSI 285
GV G N IDR I+G HLY+ Y+
Sbjct: 158 G------------------AGVLTREG---NFGAYIDRLIIGAAHLYKGDNYNSLG---- 192
Query: 286 NSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWII 345
DPEGL SS+ A V+ LIG G + + +N +I
Sbjct: 193 --------------------DPEGLFSSLPAVVSVLIGYFTGEWLRKQPERSRTSINMLI 232
Query: 346 LS-SCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKKP 404
SCL+ +G +F +NK L++ SY TAG + +LLA Y ++ + +P
Sbjct: 233 AGLSCLV-VGEVWNF-WFPINKKLWTSSYVLFTAGIALILLAACYELID-VRKRREWGRP 289
Query: 405 FD 406
F+
Sbjct: 290 FE 291
>gi|443685179|gb|ELT88879.1| hypothetical protein CAPTEDRAFT_26311, partial [Capitella teleta]
Length = 361
Score = 132 bits (332), Expect = 3e-28, Method: Compositional matrix adjust.
Identities = 110/352 (31%), Positives = 159/352 (45%), Gaps = 51/352 (14%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYK 114
RL SLD FRG+++ +MI V+ GG HS WNGLTLAD V P+F+FI+G S+AL+++
Sbjct: 1 RLKSLDTFRGISLVIMIFVNYRGGGYWFFRHSAWNGLTLADLVFPWFVFIMGTSMALSFR 60
Query: 115 NFPCKVVATRKAIL----RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIA 170
+ + K IL RA+ LF LG+ I+N K D+ +R GVLQR+A
Sbjct: 61 GALRRGIPRFKLILKVLKRAMILFALGVM--------ISNSKGAFDLRTLRVPGVLQRLA 112
Query: 171 IAYLVAALCEIWLKGDGHVSSKLSLFRKY---RGHWVVALVLTTLYLLLLYGLYVPDWQY 227
+ YLV + E L SL R G W L+ ++ L + L VP
Sbjct: 113 LTYLVLGIMEAALAKSHDPHQWWSLVRDVVGNLGQWAAVLMFVAVHCCLTFLLPVPG--- 169
Query: 228 EFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINS 287
P TG A IDR I G +H+Y P C I
Sbjct: 170 ---CPKGYLGPGGLQHGGAYENCTG---GATAYIDRMIFGTEHMYGHP------TCMI-- 215
Query: 288 PDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI--- 344
P P DPEG+L ++ + C +GL G +I+ F+ + R+ W+
Sbjct: 216 ----------PYQTTVPLDPEGVLGTLTSIFLCFLGLQAGKVILIFQGWKSRVSRWMCWS 265
Query: 345 ----ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+++ CL F+ +NK L+SLSY A + +LL+ + V
Sbjct: 266 LVTGLVAGCLCKFSAEDGFI--PINKNLWSLSYVMALASMAFLLLSVCFLAV 315
>gi|195447210|ref|XP_002071113.1| GK25317 [Drosophila willistoni]
gi|194167198|gb|EDW82099.1| GK25317 [Drosophila willistoni]
Length = 537
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 113/376 (30%), Positives = 170/376 (45%), Gaps = 56/376 (14%)
Query: 34 DPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTL 93
D ++S Q++RL SLD FRGL + LMI V+ GG +I+H WNGL L
Sbjct: 168 DEAAAAADSIGDAAAKATQRKRLRSLDTFRGLAIVLMIFVNSGGGGYDSIDHVAWNGLHL 227
Query: 94 ADFVMPFFLFIVGVSLALTYKN----FPCKVVATRKAILRALNLFLLGIFLQGGFFHGIN 149
AD V P FL+I+GV + L+ K+ K+ + I R+ LF +G+ L + IN
Sbjct: 228 ADLVFPCFLWIMGVCIPLSIKSQLGRGTSKIQICGRIIWRSFKLFAIGVCL-----NSIN 282
Query: 150 NLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRK------YRGHW 203
G + Q+R MGVLQR +A+LV L H+S + + R + G +
Sbjct: 283 ----GPKLEQLRVMGVLQRFGVAFLVVGLLHTVCSRRDHISPQQAWQRSIYDICIFSGEF 338
Query: 204 VVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPAC--NAVGMI 261
V L L YL L YGL VP + S+ + P C A G I
Sbjct: 339 AVLLALIATYLGLTYGLKVPGCPRGYLGPGGKSN-----------NAANPHCIGGAAGYI 387
Query: 262 DRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCL 321
D+++LG H+Y+ P + FDPEGL +++ V L
Sbjct: 388 DQQVLGNAHIYQYPTAKYVYDAT-------------------AFDPEGLFGCLLSVVHVL 428
Query: 322 IGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMH-----LNKALYSLSYTCL 376
+G G ++ + RM W IL+ L +G +L +NK L+SLS+ +
Sbjct: 429 LGAFAGVTLLVHPTWQSRMKRWTILALLLGLMGGALCGFSKENGLIPVNKNLWSLSFVFV 488
Query: 377 TAGASGVLLAGIYFMV 392
T + +LL +Y+++
Sbjct: 489 TVTVALLLLDFLYYII 504
>gi|332028000|gb|EGI68051.1| Heparan-alpha-glucosaminide N-acetyltransferase [Acromyrmex
echinatior]
Length = 557
Score = 132 bits (331), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 101/367 (27%), Positives = 164/367 (44%), Gaps = 55/367 (14%)
Query: 43 NSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFL 102
K + +RR+ ++D FRG + MI V+D G + H+ W+G+ L D V P F+
Sbjct: 155 KDKSNNKKLAKRRVKAIDTFRGASTLFMIFVNDGSGSYTVLEHTIWDGMLLGDIVFPCFM 214
Query: 103 FIVGV----SLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIA 158
+I+GV +L+ K K+ + + R+ LFL+G+ +N L +
Sbjct: 215 WIMGVCIPIALSAQLKRGVSKLQISYSILKRSFLLFLIGV--------SLNTLGTDSQVE 266
Query: 159 QIRWMGVLQRIAIAYLVAALCEIWLKGDG----HVSSKLSLFRKYRG------HWVVALV 208
IR GVLQR + YLV +L + +S + RK + HW V L+
Sbjct: 267 NIRIFGVLQRFGVTYLVVSLVYLCFPSQQSKILRNTSPTWIMRKMQDILSLLPHWFVMLI 326
Query: 209 LTTLYLLLLYGLYVPDW--QYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKIL 266
++ L + L VP Y P + FN T G A G ID+ +L
Sbjct: 327 FVIVHCALTFCLPVPGCPTGYLGPGGMHEDGKY-FNCTGG----------ATGYIDKTVL 375
Query: 267 GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHF 326
+ H+Y+ P YG PFDPEG+L + A +G+H
Sbjct: 376 TLNHIYQYPTIKSV---------YG----------SGPFDPEGILGCLTAIFQVFLGVHA 416
Query: 327 GHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVG-MHLNKALYSLSYTCLTAGASGVLL 385
G +++ +KD +DR++ W++ + LG + F + +NK L+SLS+ +T S L
Sbjct: 417 GTILMLYKDWKDRVMRWLLWAVFYACLGCAFHFTNTIPVNKNLWSLSFVFVTTSFSLAFL 476
Query: 386 AGIYFMV 392
+G Y ++
Sbjct: 477 SGCYLLI 483
>gi|356536971|ref|XP_003537005.1| PREDICTED: uncharacterized protein LOC100781855 [Glycine max]
Length = 357
Score = 131 bits (330), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 72/175 (41%), Positives = 96/175 (54%), Gaps = 16/175 (9%)
Query: 8 NNDANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTV 67
N+ +L +I ++ + + ++D + + +Q Q Q
Sbjct: 16 NDYHKGELKHEIERTNGNGDSIEHDKDARITQEGESVQQIVEQEQP-------------- 61
Query: 68 ALMILVDDV--GGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRK 125
LM+L DD GG P I+HSPWNG TLADFVMPFFLF+VGV++AL K P A +
Sbjct: 62 LLMVLEDDADAGGAYPRIDHSPWNGCTLADFVMPFFLFVVGVAIALALKRIPKVKYAVKN 121
Query: 126 AILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCE 180
ILR L L GI LQGG+ H ++L YGVD+ IRW G+LQRIA+ Y AL E
Sbjct: 122 IILRTLKLLFWGILLQGGYSHAPDDLSYGVDMRFIRWCGILQRIALVYCAVALIE 176
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/93 (38%), Positives = 61/93 (65%), Gaps = 5/93 (5%)
Query: 314 VMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFV-GMHLNKALYSLS 372
+ A+++ IG+H+GH+++HFK H +R+ W+++ L+ LGL L F + +NK LYS S
Sbjct: 183 ISASLSGTIGIHYGHVLIHFKGHSERLKQWLLMGFLLLTLGLMLHFTEAIPINKQLYSFS 242
Query: 373 YTCLTAGASGVLLAGIYFMVRYISSHLMLKKPF 405
Y C TAGA+G++ + Y ++ I ++K PF
Sbjct: 243 YVCFTAGAAGIVFSVFYLLLNGI----LIKYPF 271
>gi|66826507|ref|XP_646608.1| hypothetical protein DDB_G0270192 [Dictyostelium discoideum AX4]
gi|60474509|gb|EAL72446.1| hypothetical protein DDB_G0270192 [Dictyostelium discoideum AX4]
Length = 426
Score = 131 bits (329), Expect = 6e-28, Method: Compositional matrix adjust.
Identities = 115/381 (30%), Positives = 168/381 (44%), Gaps = 83/381 (21%)
Query: 32 ERDPLLPPSNS----NSKQTRPQ------HQQRRLISLDVFRGLTVALMILVDDVGG--I 79
E PL+ S+S + K T+ + QRR+ SLD RGLT+ MILVD+ G +
Sbjct: 12 ESKPLIEHSSSTHINHGKDTKTKIIEYLKPVQRRMGSLDAVRGLTIFGMILVDNQAGNDV 71
Query: 80 LPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIF 139
+ +N + WNGL+ AD + P F+FI G S+AL KN I R L LF + F
Sbjct: 72 IWPLNETEWNGLSTADLIFPSFIFISGFSIALALKNSKNTTSTWYGIIRRTLLLFFIQCF 131
Query: 140 LQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKY 199
L N + + R MGVLQRIAI Y + L + + L R +
Sbjct: 132 L--------NLMGDHFNFTTFRIMGVLQRIAICYFFSCL--------SFLCFPIFLQRLF 175
Query: 200 RGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVG 259
L +T Y+ ++Y L VP CG R + CNA
Sbjct: 176 ------LLSVTVTYISIMYALNVPK--------------------CG-RANLTQNCNAGA 208
Query: 260 MIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVT 319
ID K+ G+ + K+ ++N P Y DPEGL+S++ + +T
Sbjct: 209 YIDSKVFGLNIM---------KESNLNGPYYN--------------DPEGLISTMSSFIT 245
Query: 320 CLIGLHFGHLIVHF-KDH----RDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYT 374
+GL FG + F K H D ++ WI+L + +SL M NK ++S S+
Sbjct: 246 AWMGLEFGRIFTRFYKKHDFGNTDIIVRWILLVILFMVPAISLGATVMPFNKKIWSFSFA 305
Query: 375 CLTAGASGVLLAGIYFMVRYI 395
T GASG L+ + ++ I
Sbjct: 306 LFTVGASGSLILIAFILIDVI 326
>gi|194888520|ref|XP_001976930.1| GG18736 [Drosophila erecta]
gi|190648579|gb|EDV45857.1| GG18736 [Drosophila erecta]
Length = 576
Score = 130 bits (328), Expect = 8e-28, Method: Compositional matrix adjust.
Identities = 113/386 (29%), Positives = 176/386 (45%), Gaps = 76/386 (19%)
Query: 34 DPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTL 93
D ++S + Q++RL SLD FRGL++ LMI V+ GG I H+ WNGL L
Sbjct: 164 DEATAAADSIGEAATKATQRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHL 223
Query: 94 ADFVMPFFLFIVGVSLALTYKN----FPCKVVATRKAILRALNLFLLGIFLQGGFFHGIN 149
AD V P FL+I+GV + L+ K+ K + + R++ LF++G+ L
Sbjct: 224 ADVVFPSFLWIMGVCIPLSVKSQLSRGSSKARICLRILWRSIKLFVIGLCL--------- 274
Query: 150 NLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRK------YRGHW 203
N G ++ Q+R+MGVLQR +AYLV + +S + S R + G
Sbjct: 275 NSMSGPNLEQLRFMGVLQRFGVAYLVVGVLHTLCCRREPISPQRSWQRAVHDVCLFSGEL 334
Query: 204 VVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPA--------- 254
V L L YL L +GL VP P RG GP
Sbjct: 335 AVLLALVATYLGLTFGLRVP------------GCP---------RGYLGPGGKHDYNAHP 373
Query: 255 -C--NAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLL 311
C A G D ++LG H+Y+ P T + +S FDPEG+
Sbjct: 374 HCIGGAAGYADLQVLGNAHIYQHP----TAKYVYDS---------------TAFDPEGVF 414
Query: 312 SSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL-----GLSLDFVGMHLNK 366
+++ V L+G G ++ + + RM W++L+ L + G S + + +NK
Sbjct: 415 GCILSVVQALLGAFAGVTLLVHPNWQSRMRRWMLLAILLGLIGGALCGFSREGGAIPVNK 474
Query: 367 ALYSLSYTCLTAGASGVLLAGIYFMV 392
L+SLS+ C+T + V+L+ +Y+++
Sbjct: 475 NLWSLSFVCVTVSLALVILSLLYYII 500
>gi|195476975|ref|XP_002100049.1| GE16376 [Drosophila yakuba]
gi|194187573|gb|EDX01157.1| GE16376 [Drosophila yakuba]
Length = 576
Score = 130 bits (327), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 114/387 (29%), Positives = 175/387 (45%), Gaps = 78/387 (20%)
Query: 34 DPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTL 93
D +NS + Q++RL SLD FRGL++ LMI V+ GG I H+ WNGL L
Sbjct: 164 DEATAAANSIGEAATKATQRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHL 223
Query: 94 ADFVMPFFLFIVGVSLALTYKN----FPCKVVATRKAILRALNLFLLGIFLQGGFFHGIN 149
AD V P FL+I+GV + L+ K+ K + + R++ LF++G+ L
Sbjct: 224 ADVVFPSFLWIMGVCIPLSVKSQLSRGSSKARICLRILWRSIKLFVIGLCL--------- 274
Query: 150 NLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRK------YRGHW 203
N G ++ Q+R+MGVLQR +AYLV + +S + R + G
Sbjct: 275 NSMSGPNLEQLRFMGVLQRFGVAYLVVGVLHTLCCRREPISPQRLWQRAVHDVCLFSGEL 334
Query: 204 VVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPA--------- 254
V L L YL L YGL VP P RG GP
Sbjct: 335 AVLLALVATYLGLTYGLRVP------------GCP---------RGYLGPGGKHDYNAHP 373
Query: 255 -C--NAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLL 311
C A G +D ++LG H+Y+ P T + +S FDPEG+
Sbjct: 374 NCIGGAAGYVDLQVLGNAHIYQHP----TAKYVYDS---------------TAFDPEGIF 414
Query: 312 SSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL------GLSLDFVGMHLN 365
+++ V L+G G ++ + + R+ W L + L+GL G S + + +N
Sbjct: 415 GCILSVVQVLLGAFAGVTLLVHPNWQSRIRRWTFL-AILLGLIGGALCGFSREGGAIPVN 473
Query: 366 KALYSLSYTCLTAGASGVLLAGIYFMV 392
K L+SLS+ C+T + ++L+ +Y+ +
Sbjct: 474 KNLWSLSFVCVTVSLALLILSLLYYFI 500
>gi|321474731|gb|EFX85695.1| hypothetical protein DAPPUDRAFT_309035 [Daphnia pulex]
Length = 588
Score = 130 bits (326), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 116/394 (29%), Positives = 178/394 (45%), Gaps = 60/394 (15%)
Query: 20 SKSTSAPANEKLE--RDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVG 77
K TS NE P++P + + + P+ + RL SLD FRG+T+ LMI V+D
Sbjct: 160 QKETSVEGNENPGGLAAPIVP---AVADEITPKKKSSRLKSLDTFRGITIVLMIFVNDGA 216
Query: 78 GILPAINHSPWNGLTLADFVMPFFLFIVGV----SLALTYKNFPCKVVATRKAILRALNL 133
G H+ WNGL LAD + P+F++I+GV SL + + K+ + R+ L
Sbjct: 217 GQYFIFQHATWNGLQLADVIFPWFMWIMGVCMPISLRSSLRRKESKLTIFAGILRRSCLL 276
Query: 134 FLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLK----GDGHV 189
F LGI N+L VD+ ++R GVLQR AI YL + L H
Sbjct: 277 FFLGIM--------NNSLGGPVDLGRLRVPGVLQRFAITYLAVGTAGLLLTPADLSAPHP 328
Query: 190 SSKL-SLFRK---YRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTC 245
SSK LF+ W++ L+L + + + L PVE ++
Sbjct: 329 SSKARKLFQDIVVLWPQWILFLLLVAAHCFITFFL---------PVEEGCPVGYLGPAGL 379
Query: 246 GVRGSTGPAC--NAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQA 303
+ + C A G IDR +L +QH++ KP YG
Sbjct: 380 HLDNAYPGHCIGGAAGYIDRLMLSVQHIFNKPTTIGV---------YG----------SG 420
Query: 304 PFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILS--SCLIGLGLSLDFVG 361
P+DPEG+L S++ T +G G ++ F + R++ W+ S + LIG L L
Sbjct: 421 PYDPEGILGSMLCTFQVFLGAQAGMTLLIFSGWKSRLIRWLAWSVLTGLIGALLCLASQN 480
Query: 362 ---MHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+ +NK L+SLS+ +T G + LL Y+++
Sbjct: 481 DGWIPVNKNLWSLSFVLVTTGLAFFLLGACYWLI 514
>gi|328869407|gb|EGG17785.1| transmembrane protein [Dictyostelium fasciculatum]
Length = 651
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 110/401 (27%), Positives = 174/401 (43%), Gaps = 101/401 (25%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
Q ++ RL SLDVFRGL++ +MI V+ GG NHS WNGLT+AD V P+F+FI+G+++
Sbjct: 221 QPKKDRLKSLDVFRGLSITIMIFVNYGGGGYWFFNHSYWNGLTVADLVFPWFIFIMGIAM 280
Query: 110 ALTYKNFPCKVVATR----KAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGV 165
L++ + V R K + R++ LF LG+F+ G NNL R +GV
Sbjct: 281 PLSFNAMEIRGVPKRTIFIKLVRRSVILFSLGLFINNG-----NNL------GHWRILGV 329
Query: 166 LQRIAIAYLVAALCEIWL------------------------------------------ 183
LQR ++Y V +++
Sbjct: 330 LQRFGVSYFVTGCIMMFVPLYRPNGGGGGNSHHQYNRFDGTGNDREREPSESDPLFQSSS 389
Query: 184 ---KGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWI 240
K H +S L+ F + W+ AL++ ++ L+ + L VP T P
Sbjct: 390 IQEKFKAHSASMLADFIPFWLQWLFALLILAVWFLVTFLLPVPG------CPTGYLGP-- 441
Query: 241 FNVTCGVRGSTGPACNAVG----MIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLD 296
G G G N G ++D I H+++ P C
Sbjct: 442 -----GGLGDQGQHVNCTGGAAKIVDLHIFSNNHIFQTP------TCQ------------ 478
Query: 297 APSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL--- 353
P + +DPEG L + + C +G+H G I+ +K +R R++ W ILS L G+
Sbjct: 479 -PIYNTGAYDPEGTLGYLTSVFMCFLGVHAGRTIMTYKSNRSRLIRWTILSILLCGIAAG 537
Query: 354 --GLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
G+S + + +NK L++ S+ L +G +LA Y +V
Sbjct: 538 LCGVSQNGGWIPINKNLWTPSFIFLLSGFGFFVLAIFYVVV 578
>gi|195340719|ref|XP_002036960.1| GM12376 [Drosophila sechellia]
gi|194131076|gb|EDW53119.1| GM12376 [Drosophila sechellia]
Length = 576
Score = 129 bits (324), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 111/383 (28%), Positives = 174/383 (45%), Gaps = 70/383 (18%)
Query: 34 DPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTL 93
D ++S + Q++RL SLD FRGL++ LMI V+ GG I H+ WNGL L
Sbjct: 164 DEATAAADSIGEAATKATQRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHL 223
Query: 94 ADFVMPFFLFIVGVSLALTYKN----FPCKVVATRKAILRALNLFLLGIFLQGGFFHGIN 149
AD V P FL+I+GV + L+ K+ K + ++R++ LF++G+ L
Sbjct: 224 ADIVFPSFLWIMGVCIPLSVKSQLSRGSSKARICLRILVRSIKLFVIGLCL--------- 274
Query: 150 NLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRK------YRGHW 203
N G ++ Q+R MGVLQR +AYLV + +S + S R + G
Sbjct: 275 NSMSGPNLEQLRVMGVLQRFGVAYLVVGVLHTLCCRREPISPQRSWQRAVHDVCLFSGEL 334
Query: 204 VVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTG-------PAC- 255
V L L YL L +GL VP G G G P C
Sbjct: 335 AVLLALVATYLGLTFGLRVP------------------GCPRGYLGPGGKHDYNAHPKCI 376
Query: 256 -NAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSV 314
A G D ++LG H+Y+ P T + +S FDPEG+ +
Sbjct: 377 GGAAGYADLQVLGNAHIYQHP----TAKYVYDS---------------TAFDPEGIFGCI 417
Query: 315 MATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL-----GLSLDFVGMHLNKALY 369
++ V L+G G ++ +++ R+ W +L+ L + G S + + +NK L+
Sbjct: 418 LSVVQVLLGAFAGVTLLVHPNYQSRIRRWTLLAILLGLIGGALCGFSSEGGAIPVNKNLW 477
Query: 370 SLSYTCLTAGASGVLLAGIYFMV 392
SLS+ C+T + V+L+ +Y+ +
Sbjct: 478 SLSFVCVTVSLALVILSLMYYFI 500
>gi|443731781|gb|ELU16770.1| hypothetical protein CAPTEDRAFT_135912, partial [Capitella teleta]
Length = 388
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 108/352 (30%), Positives = 158/352 (44%), Gaps = 50/352 (14%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYK 114
RL SLD FRG+++ +MI V+ GG HS WNGLTLAD V P+F+FI+G S+AL+++
Sbjct: 1 RLKSLDTFRGISLVIMIFVNYRGGGYWFFRHSAWNGLTLADLVFPWFVFIMGTSMALSFR 60
Query: 115 NFPCKVVATRKAIL----RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIA 170
+ + K IL RA+ LF LG+ + N+ VD+ +R GVLQR+A
Sbjct: 61 GALRRGIPRFKLILKVLKRAMILFALGVMIS-------NSKGKAVDLRTLRVPGVLQRLA 113
Query: 171 IAYLVAALCEIWLKGDGHVSSKLSLFRKY---RGHWVVALVLTTLYLLLLYGLYVPDWQY 227
+ YLV + E L S R G W L+ ++ L + L VP
Sbjct: 114 LTYLVLGIMEAALAKSHDPHQWWSSVRDVVGNLGQWAAVLMFVAVHCCLTFLLPVPG--- 170
Query: 228 EFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINS 287
P TG A IDR I G +H+Y P C I
Sbjct: 171 ---CPKGYLGPGGLQHGGAYENCTG---GATAYIDRMIFGTEHMYGHP------TCMI-- 216
Query: 288 PDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI--- 344
P P DPEG+L ++ + C +GL G +I+ F+ + R+ W+
Sbjct: 217 ----------PYQTTVPLDPEGVLGTLTSIFLCFLGLQAGKVILIFQGWKSRVSRWMCWS 266
Query: 345 ----ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+++ CL F+ +NK L+SLSY A + +LL+ + V
Sbjct: 267 LVTGLVAGCLCKFSAEDGFI--PINKNLWSLSYVMALASMAFLLLSVCFLAV 316
>gi|328870644|gb|EGG19017.1| hypothetical protein DFA_02260 [Dictyostelium fasciculatum]
Length = 759
Score = 129 bits (323), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 110/405 (27%), Positives = 171/405 (42%), Gaps = 97/405 (23%)
Query: 27 ANEKLERDPLL--------PPSNSNSKQTRPQH--------QQRRLISLDVFRGLTVALM 70
++ E+ PLL P +N +S T + Q++R++SLD RGLT+ M
Sbjct: 7 SHNHTEKSPLLNEQQHVSLPINNDDSTATITKTPSATPTTTQRKRVLSLDTVRGLTIFGM 66
Query: 71 ILVDDVGG--ILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAIL 128
ILVD+ GG ++ + + WNGL+ AD + P FLFI G S+AL K+ + I
Sbjct: 67 ILVDNQGGPQVIWPLLETEWNGLSTADLIFPSFLFICGFSVALALKSAKNDIKTWYNIIR 126
Query: 129 RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGH 188
R L LF + FL N + + R MGVLQRI+I Y A C
Sbjct: 127 RTLLLFFIQAFL--------NLMAHKFVFDSFRVMGVLQRISICYF-ACCCSF------- 170
Query: 189 VSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVR 248
L G + + +YL ++YGL VP CG R
Sbjct: 171 ------LLLPLVGQRIFLVACAAIYLSVMYGLDVPG--------------------CG-R 203
Query: 249 GSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPE 308
G P+CNA ID +LG ++ P DPE
Sbjct: 204 GVLTPSCNAGSYIDNSVLGANMIH-------------------------------PNDPE 232
Query: 309 GLLSSVMATVTCLIGLHFGHLIVHFKD-----HRDRMLNWIILSSCLIGLGLSLDFVGMH 363
GLLS+ A +T +GL G + F H + ++ WI ++ G++L M
Sbjct: 233 GLLSTFSAFITTWMGLELGRIFTRFYRKHDYAHLNILIRWIGIAVVFGVTGIALGVTKMP 292
Query: 364 LNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKKPFDYS 408
+NK ++S S+ +T +L++ Y+++ + +K+ ++S
Sbjct: 293 VNKLIWSFSFALITVACGSLLISVAYYLLDVVEWSPTVKRHIEFS 337
>gi|432907420|ref|XP_004077635.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Oryzias latipes]
Length = 482
Score = 128 bits (322), Expect = 4e-27, Method: Compositional matrix adjust.
Identities = 109/378 (28%), Positives = 175/378 (46%), Gaps = 75/378 (19%)
Query: 42 SNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFF 101
++S+ TRP RL+SLD FRG + +M+ V+ GG H+PWNGLT+AD VMP+F
Sbjct: 79 ADSRGTRPA----RLLSLDTFRGFALTVMVFVNYGGGGYWFFEHAPWNGLTVADLVMPWF 134
Query: 102 LFIVGVSLALTYKNFPCKVVATR----KAILRALNLFLLGIFLQGGFFHGINNLKYGVDI 157
+FI+G S+ L + + + V R K R + L LLG F + +
Sbjct: 135 VFIMGTSVVLAFSSMQRRGVGRRQLLGKITWRTVVLMLLGF-----CFLNYSPRDGPLSW 189
Query: 158 AQIRWMGVLQRIAIAYLVAALCEIWL------KGDGHVSSKLSLFRKYRGHWVVALVLTT 211
+ +R GVLQR+A Y V +L + + + + H + + + W++ +L T
Sbjct: 190 SWLRIPGVLQRLAFTYFVLSLLQTFWGRKAIPESENHWWNPVQDVVLFWPQWLLIFLLET 249
Query: 212 LYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTG-------PACN--AVGMID 262
L+L + + + VP N G G+ G P C A G ID
Sbjct: 250 LWLCITFLMPVP------------------NCPTGYLGAGGIGDHGLYPNCTGGAAGSID 291
Query: 263 RKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLI 322
R + G ++YR P + + + P+DPEG+L ++ + V +
Sbjct: 292 RWMFG-DNMYRYPTCKKLYR------------------TEQPYDPEGVLGTINSIVMGFL 332
Query: 323 GLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVG--------MHLNKALYSLSYT 374
G+ G +IV +K +L W L+ +I LG+S + + +NK L+SLSY
Sbjct: 333 GMQAGKIIVFYKRKSGHIL-WRYLTWAVI-LGISAAILSKCTRDGGFIPVNKNLWSLSYV 390
Query: 375 CLTAGASGVLLAGIYFMV 392
T S +LL G+YF++
Sbjct: 391 TCTGALSFLLLGGMYFLI 408
>gi|194762450|ref|XP_001963347.1| GF20351 [Drosophila ananassae]
gi|190629006|gb|EDV44423.1| GF20351 [Drosophila ananassae]
Length = 576
Score = 128 bits (322), Expect = 5e-27, Method: Compositional matrix adjust.
Identities = 109/358 (30%), Positives = 171/358 (47%), Gaps = 56/358 (15%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
Q++RL SLD FRGL++ LMI V+ GG I+H+ WNGL LAD V P FL+I+GV + L
Sbjct: 182 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYTWIDHAAWNGLHLADLVFPSFLWIMGVCIPL 241
Query: 112 TYKNFPCKVVATRKAIL----RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
+ K + + + L R++ LF +G+ L N G + ++R MGVLQ
Sbjct: 242 SVKAQLSRGASKGRICLRILWRSIKLFAIGLCL---------NSMSGPGLEELRIMGVLQ 292
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRK------YRGHWVVALVLTTLYLLLLYGLY 221
R +A+LV + +S + S R + G V L L YL L +GL
Sbjct: 293 RFGVAFLVVGVLHTLCSRRDPISPQRSWQRAVHDICLFSGELAVLLALVATYLGLTFGLR 352
Query: 222 VPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPAC--NAVGMIDRKILGIQHLYRKPIYSR 279
VP + F+ ++ P C A G +D K+LG H+Y+ P
Sbjct: 353 VPGCPKGYLGPGGK-----FDY------ASNPNCIGGAAGYVDLKVLGNAHIYQHP---- 397
Query: 280 TKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDR 339
T + +S A FDPEG+ +++ V L+G G ++ + R
Sbjct: 398 TAKYVYDS---------------AAFDPEGIFGCILSVVQVLLGAFAGVTLLVHPTWQSR 442
Query: 340 MLNWIILSSC--LIG---LGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+ W+IL+ LIG G S + + +NK L+SLS+ C+T + ++L+ +Y+ +
Sbjct: 443 IRRWLILAVVLGLIGGALCGFSREGGAIPVNKNLWSLSFVCVTVSLALIILSLLYYFI 500
>gi|301608954|ref|XP_002934053.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Xenopus (Silurana) tropicalis]
Length = 633
Score = 128 bits (321), Expect = 6e-27, Method: Compositional matrix adjust.
Identities = 118/411 (28%), Positives = 183/411 (44%), Gaps = 71/411 (17%)
Query: 3 KYNPINNDANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVF 62
K NPI D ++ +P + + S+ P Q RL SLD F
Sbjct: 199 KMNPIETDR------LVNSELGSPNRADISSQ------ETYSRAWNPSVQ--RLRSLDTF 244
Query: 63 RGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVA 122
RGL + +M+ V+ GG H WNGLT+AD V P+F+FI+G S+ L+ + K +
Sbjct: 245 RGLALTIMVFVNYGGGGYWFFKHQSWNGLTVADLVFPWFVFIMGTSIYLSLNSMLSKGSS 304
Query: 123 TR----KAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAAL 178
K + R++ LFL+G+F+ +N + + ++IR MGVLQR+++ YL +
Sbjct: 305 RWNLLGKVLWRSVQLFLIGLFVI-----NVNYCRGPLSFSEIRIMGVLQRLSLTYLAVSA 359
Query: 179 CEIWLKG---DGHVSSKLS-LFRKYRGH---WVVALVLTTLYLLLLYGLYVPDWQYEFPV 231
E+ D S+ L + H W+V L L ++L L L VPD +
Sbjct: 360 LELIFSKPTPDALTQSRTCFLLQDVLSHWPKWIVILALEAVWLCLTLLLQVPDCPLGY-- 417
Query: 232 ETSSSSPWIFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINS 287
+ G G G P C A G IDR ILG H+Y+ P + + ++
Sbjct: 418 -----------LGPGGIGDFGKFPNCTGGAAGYIDRMILGQGHIYQHPTSNVIYKSTM-- 464
Query: 288 PDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRD----RMLNW 343
P+DPEGLL ++ V GL G ++V +K+ R +W
Sbjct: 465 ----------------PYDPEGLLGTINCVVMAFFGLQAGIILVLYKNQHKYVLVRFFSW 508
Query: 344 IILSSCLIGL--GLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
I+ L + S + + +NK L+S+SY + + LL IYF+V
Sbjct: 509 AIIMGVLSAVLTKCSTNEGIIPVNKNLWSISYITTLSCFAYFLLMLIYFLV 559
>gi|158294726|ref|XP_315774.3| AGAP005761-PA [Anopheles gambiae str. PEST]
gi|157015699|gb|EAA10745.3| AGAP005761-PA [Anopheles gambiae str. PEST]
Length = 581
Score = 127 bits (320), Expect = 8e-27, Method: Compositional matrix adjust.
Identities = 112/369 (30%), Positives = 169/369 (45%), Gaps = 72/369 (19%)
Query: 22 STSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILP 81
T+ +NE+ +R P + S Q P+ +RL SLD FRG+ + LMI V+ GG
Sbjct: 175 ETAHSSNEETDRKP--HQTGPGSGQVTPK---KRLQSLDTFRGIAIMLMIFVNSGGGHYW 229
Query: 82 AINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQ 141
I H+ WNGL +AD V P+FLFI+GV + ++ + R R++ LF++G+ L
Sbjct: 230 WIEHATWNGLHVADLVFPWFLFIMGVCVPISLRG-----QLNRN---RSVKLFIIGLCL- 280
Query: 142 GGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAA----LCEIWLKGDGHVSSKLSLFR 197
N G +A +R GVLQR IAYLV + LC + V S+ L R
Sbjct: 281 --------NSMNGPSMANLRIFGVLQRFGIAYLVVSTVHLLCH---EQQVQVQSQNRLLR 329
Query: 198 ------KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGST 251
+ + W+V +LT LYL++++ + P + ++N
Sbjct: 330 ASEDIVRLKKQWLVIGLLTVLYLVVMFFVPAPGCPSAY---FGPGGKHLYNAF------- 379
Query: 252 GPACNA--VGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEG 309
P C G IDR +LGI HLY+ P Y M PFDPEG
Sbjct: 380 -PNCTGGITGYIDRALLGIAHLYQHPTARYV---------YDGM----------PFDPEG 419
Query: 310 LLSSVMATVTCLIGLHFGHLIVHFKDHRDRML---NWIILSSCLIG--LGLSLDFVGMHL 364
+ + +GL G I+ + +HR RM+ +W ++ G G + + + +
Sbjct: 420 PFGCLPTILQVFLGLQCGCTILAYTEHRQRMVRFASWSLVLGLAAGALCGFTKNDGWIPI 479
Query: 365 NKALYSLSY 373
NK L+SLSY
Sbjct: 480 NKNLWSLSY 488
>gi|434403337|ref|YP_007146222.1| hypothetical protein Cylst_1247 [Cylindrospermum stagnale PCC 7417]
gi|428257592|gb|AFZ23542.1| hypothetical protein Cylst_1247 [Cylindrospermum stagnale PCC 7417]
Length = 375
Score = 127 bits (320), Expect = 9e-27, Method: Compositional matrix adjust.
Identities = 122/364 (33%), Positives = 168/364 (46%), Gaps = 82/364 (22%)
Query: 55 RLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL SLDVFRG+T+A MILV+ G I P ++H+ WNG T D V PFFLFIVGV++
Sbjct: 2 RLTSLDVFRGMTIAAMILVNMAGVADEIYPLLDHAKWNGCTPTDLVFPFFLFIVGVAMTF 61
Query: 112 -----TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
T N P K V R + RA LF LG+ L G + G+ + D++ IR+MGVL
Sbjct: 62 SLSKYTAANKPTKAVYLR-ILRRAAILFALGLLLNGFWNKGV----WTFDLSNIRFMGVL 116
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRI++ YL+A+L + L L RK G W++A+VL Y L + + VPD+
Sbjct: 117 QRISLTYLLASL------------AVLQLPRK--GQWILAVVLLVGYWLTMMYVPVPDYG 162
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
GV G N IDR I+ HLY+ ++
Sbjct: 163 ------------------AGVLTREG---NFGAFIDRLIIPKAHLYKGDGFNLLG----- 196
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVH--FKDHRDRMLNWI 344
DPEGL S++ A V L G G I K L +
Sbjct: 197 -------------------DPEGLFSTIPAVVNVLAGYFAGEWIRSQPVKSRTSLGLVLV 237
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM--VRYISSHLMLK 402
++ +IG G F +NK +++ SY T+G + +LLA Y + VR I
Sbjct: 238 GVACLVIGWGWGWIF---PINKKIWTSSYVVFTSGWALLLLAACYELIEVRLIKRW---S 291
Query: 403 KPFD 406
KPF+
Sbjct: 292 KPFE 295
>gi|24639786|ref|NP_572198.1| CG6903 [Drosophila melanogaster]
gi|7290544|gb|AAF45996.1| CG6903 [Drosophila melanogaster]
gi|21483396|gb|AAM52673.1| LD22376p [Drosophila melanogaster]
Length = 576
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 110/383 (28%), Positives = 173/383 (45%), Gaps = 70/383 (18%)
Query: 34 DPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTL 93
D ++S + Q++RL SLD FRG+++ LMI V+ GG I H+ WNGL L
Sbjct: 164 DEATAAADSIGEAATKATQRKRLRSLDTFRGISIVLMIFVNSGGGGYAWIEHAAWNGLHL 223
Query: 94 ADFVMPFFLFIVGVSLALTYKN----FPCKVVATRKAILRALNLFLLGIFLQGGFFHGIN 149
AD V P FL+I+GV + L+ K+ K + + R++ LF++G+ L
Sbjct: 224 ADVVFPSFLWIMGVCIPLSVKSQLSRGSSKARICLRILWRSIKLFVIGLCL--------- 274
Query: 150 NLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRK------YRGHW 203
N G ++ Q+R MGVLQR +AYLV A+ +S + S R + G
Sbjct: 275 NSMSGPNLEQLRIMGVLQRFGVAYLVVAILHTLCCRREPISPQRSWQRAVHDVCLFSGEL 334
Query: 204 VVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTG-------PAC- 255
V L L YL L +GL VP G G G P C
Sbjct: 335 AVLLALVATYLGLTFGLRVP------------------GCPRGYLGPGGKHDYNAHPKCI 376
Query: 256 -NAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSV 314
A G D ++LG H+Y+ P T + +S FDPEG+ +
Sbjct: 377 GGAAGYADLQVLGNAHIYQHP----TAKYVYDS---------------TAFDPEGIFGCI 417
Query: 315 MATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL-----GLSLDFVGMHLNKALY 369
++ V L+G G ++ + + R+ W +L+ L + G S + + +NK L+
Sbjct: 418 LSVVQVLLGAFAGVTLLVHPNFQSRIRRWTLLAILLGLIGGALCGFSREGGAIPMNKNLW 477
Query: 370 SLSYTCLTAGASGVLLAGIYFMV 392
SLS+ C+T + ++L+ +Y+ +
Sbjct: 478 SLSFVCVTVSLALLILSLMYYFI 500
>gi|345481194|ref|XP_001603332.2| PREDICTED: LOW QUALITY PROTEIN: heparan-alpha-glucosaminide
N-acetyltransferase-like [Nasonia vitripennis]
Length = 570
Score = 125 bits (315), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 105/398 (26%), Positives = 188/398 (47%), Gaps = 54/398 (13%)
Query: 12 NDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMI 71
+L+I + ++ A E L S S + RL +LD FRG+ V LMI
Sbjct: 140 TSKLIINVVRTYLAKHAESHNDIDRLQESESTPEMVAVSKTAMRLQALDAFRGIAVLLMI 199
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYK-NFPCKVVATRKAIL-- 128
V++ GG +NH+ WNGLT+AD V+P+F + +G ++ + + + V TR I+
Sbjct: 200 FVNNGGGEYVFLNHAAWNGLTVADLVLPWFAWAMGFTIVNSVRVHLRVSVSRTRLIIMQL 259
Query: 129 -RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGD- 186
R + L L G+F+ N ++ ++++R+ GVLQ +A+AY + ++ E L
Sbjct: 260 RRTVLLILFGLFI---------NSQHNSTLSELRFPGVLQLLAVAYFICSVIETCLASPQ 310
Query: 187 -----GHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIF 241
G + ++ W+V LV+ ++ + + L+VP + + +
Sbjct: 311 RTFQFGRFVFLQDILERWT-QWMVVLVIILVHTCITFFLHVPGCPRGY-LGPGGYHHYGL 368
Query: 242 NVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWC 301
NV C TG A G IDR I G QH+Y+K + +P YGP
Sbjct: 369 NVNC-----TG---GAAGYIDRLIFG-QHMYQKTM----------NPVYGP--------- 400
Query: 302 QAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLN---WIILSSCLIGL--GLS 356
P DPEGL++++ A + +G+ G + V + R++ W + + L G+ S
Sbjct: 401 TLPHDPEGLMNTISAVLIVFMGVQAGRIFVTYYQANSRIIRWFAWFVATGLLAGILCNFS 460
Query: 357 LDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRY 394
+ + +NK + SLS+ T+ + +L + +Y+++ +
Sbjct: 461 QEKGWVPVNKNMMSLSFVLCTSSFAFLLFSILYYLIDH 498
>gi|260061394|ref|YP_003194474.1| hypothetical protein RB2501_07335 [Robiginitalea biformata
HTCC2501]
gi|88785526|gb|EAR16695.1| hypothetical protein RB2501_07335 [Robiginitalea biformata
HTCC2501]
Length = 382
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 109/364 (29%), Positives = 165/364 (45%), Gaps = 89/364 (24%)
Query: 40 SNSNSKQTRPQHQ----QRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLT 92
+N ++T P+ Q + R++S+D+FRGLT+ALMILV+ G + H+ W+G T
Sbjct: 5 NNDQRERTNPEKQTKAMKERIVSVDIFRGLTIALMILVNTPGTWEAVYAPFRHAEWHGYT 64
Query: 93 LADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLK 152
D V PFFLFIVG S+ Y+N RK I+R L L LLGIFL G F
Sbjct: 65 PTDLVFPFFLFIVGTSIVFAYRNKQPDAATHRKIIVRTLKLILLGIFL-GAF---TVEPP 120
Query: 153 YGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTL 212
+ ++IR+ GVLQRI + + AAL + H + K L +
Sbjct: 121 FFEPFSEIRFPGVLQRIGVVFFAAALLFL------HTNYK---------------TLLAI 159
Query: 213 YLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPAC-----NAVGMIDRKILG 267
+++L G +V W P+ G P+ N ID + G
Sbjct: 160 TVVILLGYWV--WMAFIPL-----------------GGEPPSLERVPNNWANYIDVHVFG 200
Query: 268 IQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFG 327
H Y+ PDY DPEGLLS++ A + L+G+ G
Sbjct: 201 -SHTYK--------------PDY---------------DPEGLLSTLPAIASALLGIFTG 230
Query: 328 HLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAG 387
++V R W++L+ + S+ + +NKAL++ S+ +TAG + +LLA
Sbjct: 231 RVLV---SDRANKTQWMLLAGAALLAAGSIWGLVFPVNKALWTSSFVLVTAGWANLLLAL 287
Query: 388 IYFM 391
IY++
Sbjct: 288 IYYL 291
>gi|307201549|gb|EFN81312.1| Heparan-alpha-glucosaminide N-acetyltransferase [Harpegnathos
saltator]
Length = 564
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 105/368 (28%), Positives = 170/368 (46%), Gaps = 57/368 (15%)
Query: 43 NSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFL 102
+ T + +RR+ ++D FRG + MI V+D G + H+ WNG+ D V P F+
Sbjct: 162 KEEATNKEPTKRRVKAIDAFRGASTLFMIFVNDGSGSYSVLGHTTWNGMLPGDLVFPCFM 221
Query: 103 FIVGVSL-----ALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDI 157
+I+GV + A + P K+ + R+ LFL+G+ L N L +
Sbjct: 222 WIMGVCVPIALSAQLRRGIP-KLEIAFTVLKRSFLLFLIGVSL--------NTLGTNAQL 272
Query: 158 AQIRWMGVLQRIAIAYLVAA---LC-EIWLKGDGHVSSKLSLFRKYRG------HWVVAL 207
+IR GVLQR + YLV + LC E L+ SS+ + R R +W L
Sbjct: 273 EKIRVFGVLQRFGVTYLVVSVMYLCLEPSLQLQDQDSSRNRVTRVLRDMQVLLPYWSFML 332
Query: 208 VLTTLYLLLLYGLYVPDW--QYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKI 265
+L ++ L +GL VP+ Y P T ++ N T G A G IDR +
Sbjct: 333 ILVMVHCGLTFGLAVPNCPTGYLGPGGTHEDGYYM-NCTGG----------AAGYIDRVV 381
Query: 266 LGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLH 325
L I H++ P + YG PFDPEG+L + AT +G+H
Sbjct: 382 LTINHIFAGPTIASV---------YG----------SGPFDPEGILGCLTATFQVYLGVH 422
Query: 326 FGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVG-MHLNKALYSLSYTCLTAGASGVL 384
G +++ +K+ ++R++ W+ + LG L F + +NK L+SLS+ ++ +
Sbjct: 423 AGVILMMYKNWKERVVRWLSWAVLYGVLGCILHFCNVIPVNKNLWSLSFVFVSTSFALAF 482
Query: 385 LAGIYFMV 392
L+G Y ++
Sbjct: 483 LSGCYLLI 490
>gi|384249073|gb|EIE22555.1| hypothetical protein COCSUDRAFT_42235 [Coccomyxa subellipsoidea
C-169]
Length = 395
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 107/361 (29%), Positives = 168/361 (46%), Gaps = 63/361 (17%)
Query: 67 VALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTY---KNFPCKVVAT 123
+ALM+ V+ G +P + H+ W+G+ LAD VMP FL +VGVS+AL+ + P + +
Sbjct: 1 MALMLFVNHAGHEVPWVAHAAWDGVHLADLVMPCFLLLVGVSVALSLGPRASGPRRPL-L 59
Query: 124 RKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWL 183
RK + R L LG+ +QGG G D++++R+ GVLQRIA+ + + +L ++L
Sbjct: 60 RKVLARTGKLAGLGLLIQGGVGAGAFP---AWDLSRLRYCGVLQRIALCFALVSLVVLYL 116
Query: 184 -------------KGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFP 230
+GD +S ++ FR Y W++ L + +W F
Sbjct: 117 PQTPSPRLQSLLDRGD-ESASLMAPFRFYALWWILGTALFVAF----------NWMALF- 164
Query: 231 VETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDY 290
+ C R + CN +D ++LG HLY P R N P
Sbjct: 165 ---------LRPPGCLARPALTADCNVAAYVDARLLGRSHLYPWPSCRRA-----NPP-- 208
Query: 291 GPMPLDAPSWCQAPFDPEGLLSSVM-ATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSC 349
C+ DPEGL +++ A + +GL FG +++ + HR R+ +W S
Sbjct: 209 ----------CEY-LDPEGLFATLSGALASTFLGLWFGAVLLTLRGHRARLRSWAYASVL 257
Query: 350 LIGLGLSLDFVG-MHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLM--LKKPFD 406
L LGL+L G + NK LYS S LTAG+SG L +Y + + + PF
Sbjct: 258 LTELGLALHVTGAVPFNKNLYSASSVLLTAGSSGAFLGLVYLFTEVAPTKIFERVAAPFM 317
Query: 407 Y 407
+
Sbjct: 318 W 318
>gi|307178470|gb|EFN67159.1| Heparan-alpha-glucosaminide N-acetyltransferase [Camponotus
floridanus]
Length = 512
Score = 124 bits (312), Expect = 6e-26, Method: Compositional matrix adjust.
Identities = 103/378 (27%), Positives = 169/378 (44%), Gaps = 64/378 (16%)
Query: 32 ERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGL 91
ER P N+N K T+ RR+ ++D FRG+ MI V+D G + H+ WNG+
Sbjct: 108 ERTP-----NNNEKATK----HRRVKAIDTFRGVCTLFMIFVNDGSGSYTTLGHATWNGM 158
Query: 92 TLADFVMPFFLFIVGV----SLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHG 147
L D V P F++I+GV +L+ K K+ + R+ LFL+GI
Sbjct: 159 LLGDLVFPCFMWIMGVCVPIALSAQLKRGLSKLEISFSIFKRSFLLFLIGI--------A 210
Query: 148 INNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLK----------GDGHVSSKLSLFR 197
+N L + IR GVLQR I YL+ +L + ++ K+
Sbjct: 211 LNTLGTNAQLENIRIFGVLQRFGITYLIVSLLYLCFTPQQPKVAQNLSQTWMTHKMQDIL 270
Query: 198 KYRGHWVVALVLTTLYLLLLYGLYVPDW--QYEFPVETSSSSPWIFNVTCGVRGSTGPAC 255
HW + L L ++ + + L +P Y P + FN T G
Sbjct: 271 SLLPHWCIMLTLVMVHCAVTFCLPIPGCPTGYLGPGGRHEDGKY-FNCTGG--------- 320
Query: 256 NAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVM 315
A G IDR +L + H+Y+ P +I+S YG PFDPEG+L +
Sbjct: 321 -ATGYIDRILLTLSHIYQWP--------TIDSI-YG----------SGPFDPEGILGCLT 360
Query: 316 ATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVG-MHLNKALYSLSYT 374
+ +G+H G +++ +K ++R++ W++ + LG F + +NK L+SLS+
Sbjct: 361 SIFQVFLGVHTGVILMMYKGWKERIIRWLVWAVFYGCLGCIFHFTNIIPINKNLWSLSFV 420
Query: 375 CLTAGASGVLLAGIYFMV 392
++ + L+G Y ++
Sbjct: 421 LVSTCFALAFLSGCYLLI 438
>gi|193664422|ref|XP_001945789.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
isoform 1 [Acyrthosiphon pisum]
Length = 568
Score = 123 bits (309), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 106/387 (27%), Positives = 183/387 (47%), Gaps = 76/387 (19%)
Query: 17 IQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDV 76
+ I +NE + P +P T ++ R+ SLD FRG+++ LM+ V+
Sbjct: 151 LDIENDFGTGSNEPVIIHPQIP--------TPVKNNSYRITSLDTFRGISIILMVFVNLG 202
Query: 77 GGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRK-----AILRAL 131
GG H+PWNG+TLADF++P+F +++GVS+A++ ++ + RK I R++
Sbjct: 203 GGHYWFFEHAPWNGITLADFILPWFCWVMGVSIAISLRS-QLRSSTKRKYVFGRVIRRSI 261
Query: 132 NLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCE-IWLKGDGH-V 189
L ++G+ L ++ R +GVLQR+A+ Y +AA E I++K +
Sbjct: 262 ALLIMGLVLN---------SVNNNNLRTFRPLGVLQRLALIYFIAATLETIFMKPQPYFT 312
Query: 190 SSKLSLFR---KYRGHWVVALVLTTLYLLLLYGLYVPDWQ--YEFPVETSSSSPWIFNVT 244
+++L + R + W + ++L ++ ++ + L VP Y P +SS N
Sbjct: 313 NTRLDVIRDIIESARQWFIVIILVAIHTVITFFLPVPGCPKGYLGPGGLYNSSS---NTN 369
Query: 245 CGVRGSTGPACNAVGMIDRKILGIQHLYR---KPIYSRTKQCSINSPDYGPMPLDAPSWC 301
C TG A G IDR + G H+Y KP+Y
Sbjct: 370 C-----TG---GAAGYIDRLVFGENHMYPGSPKPVYQ----------------------- 398
Query: 302 QAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI-------ILSSCLIGLG 354
PFDPEG+LS++ T+ +G+H G +I+ ++ +R+ WI ++ CL
Sbjct: 399 SIPFDPEGILSTLTNTLLVYMGVHAGRIILCYQYTNERIKRWIAWTIVLGLIGGCLCNFS 458
Query: 355 LSLDFVGMHLNKALYSLSYTCLTAGAS 381
+ +NK L+SLSY +T ++
Sbjct: 459 KEDGLIP--INKNLFSLSYAFITGSSA 483
>gi|328696746|ref|XP_003240114.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
isoform 2 [Acyrthosiphon pisum]
Length = 591
Score = 123 bits (308), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 105/377 (27%), Positives = 181/377 (48%), Gaps = 76/377 (20%)
Query: 27 ANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHS 86
+NE + P +P T ++ R+ SLD FRG+++ LM+ V+ GG H+
Sbjct: 184 SNEPVIIHPQIP--------TPVKNNSYRITSLDTFRGISIILMVFVNLGGGHYWFFEHA 235
Query: 87 PWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRK-----AILRALNLFLLGIFLQ 141
PWNG+TLADF++P+F +++GVS+A++ ++ + RK I R++ L ++G+ L
Sbjct: 236 PWNGITLADFILPWFCWVMGVSIAISLRS-QLRSSTKRKYVFGRVIRRSIALLIMGLVLN 294
Query: 142 GGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCE-IWLKGDGH-VSSKLSLFR-- 197
++ R +GVLQR+A+ Y +AA E I++K + +++L + R
Sbjct: 295 ---------SVNNNNLRTFRPLGVLQRLALIYFIAATLETIFMKPQPYFTNTRLDVIRDI 345
Query: 198 -KYRGHWVVALVLTTLYLLLLYGLYVPDWQ--YEFPVETSSSSPWIFNVTCGVRGSTGPA 254
+ W + ++L ++ ++ + L VP Y P +SS N C TG
Sbjct: 346 IESARQWFIVIILVAIHTVITFFLPVPGCPKGYLGPGGLYNSSS---NTNC-----TG-- 395
Query: 255 CNAVGMIDRKILGIQHLYR---KPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLL 311
A G IDR + G H+Y KP+Y PFDPEG+L
Sbjct: 396 -GAAGYIDRLVFGENHMYPGSPKPVYQ-----------------------SIPFDPEGIL 431
Query: 312 SSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI-------ILSSCLIGLGLSLDFVGMHL 364
S++ T+ +G+H G +I+ ++ +R+ WI ++ CL + +
Sbjct: 432 STLTNTLLVYMGVHAGRIILCYQYTNERIKRWIAWTIVLGLIGGCLCNFSKEDGLIP--I 489
Query: 365 NKALYSLSYTCLTAGAS 381
NK L+SLSY +T ++
Sbjct: 490 NKNLFSLSYAFITGSSA 506
>gi|198434539|ref|XP_002120178.1| PREDICTED: similar to heparan-alpha-glucosaminide
N-acetyltransferase [Ciona intestinalis]
Length = 624
Score = 122 bits (307), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 114/418 (27%), Positives = 187/418 (44%), Gaps = 69/418 (16%)
Query: 22 STSAPANEKLERDPLLPPSNSNS-KQTRP----QHQQRRLISLDVFRGLTVALMILVDDV 76
ST+ P ++ D L + NS ++ P + + R+ S+D FRGL + +M+ V+
Sbjct: 183 STALPHETQIHED--LGNTEPNSVQEANPTPLVREKSERIKSIDTFRGLCLVVMVFVNFR 240
Query: 77 GGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTY-----KNFPCKVVATRKAILRAL 131
GG +HSPW+GLT+AD V P+F+FI+GV++ L+ KN P +A K I R +
Sbjct: 241 GGDYWFFHHSPWHGLTVADLVFPWFMFIMGVNITLSINSLITKNVPNSKIA-YKLIRRTV 299
Query: 132 NLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAAL---------CEIW 182
LF LG+F+ + + A R GVLQR AIAY + + EI
Sbjct: 300 LLFGLGMFV----------VNHSTSWAAFRVPGVLQRFAIAYFLPFVLQWAFHLTPIEIE 349
Query: 183 LKGDGHVSSKLSLFR------KYRGHWVVALVLTTLYLLLLYGLYVPDWQ--YEFPVETS 234
+ + +L + Y W++ L + L+L L + L +P Y P
Sbjct: 350 TRAKTN-EGELKWWHWCKDVVPYWLQWLIVLAMEALWLFLTFLLPIPGCPTGYLGPGGLD 408
Query: 235 SSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMP 294
+ +I N TC A G IDR I G H+Y P + S
Sbjct: 409 NDGKYI-NETC--------VGGAAGYIDRVIFGEAHIYGHPTCKNVYYPTYTSDQ----- 454
Query: 295 LDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCL---- 350
+ P+DPEGLL S+ + + ++G G + +++K DR + WI+ L
Sbjct: 455 -------RVPYDPEGLLGSINSCIIVILGCQAGKIFLYYKHPLDRAMRWILWCFFLGVIS 507
Query: 351 -IGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKKPFDY 407
I S + + +NK L++ ++ A + ++ IY++V + +P D+
Sbjct: 508 IILCKASANGGWIPVNKNLWTTTFVTTLACMAFFIIPVIYYLVD--VKKVWTGRPLDF 563
>gi|427385206|ref|ZP_18881711.1| hypothetical protein HMPREF9447_02744 [Bacteroides oleiciplenus YIT
12058]
gi|425727374|gb|EKU90234.1| hypothetical protein HMPREF9447_02744 [Bacteroides oleiciplenus YIT
12058]
Length = 368
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 111/359 (30%), Positives = 163/359 (45%), Gaps = 86/359 (23%)
Query: 39 PSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGIL--PAINHSPWNGLTLADF 96
PS NS+ ++RL+SLDV RG+TV MILV++ GG L ++ HS WNGLTL D
Sbjct: 6 PSTFNSQP------KKRLLSLDVLRGITVVGMILVNNSGGKLSYESLQHSAWNGLTLCDL 59
Query: 97 VMPFFLFIVGVS--LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYG 154
V PFFLFI+G+S +AL+ +F RK + R L + +G + +F I + +
Sbjct: 60 VFPFFLFIMGISTSIALSKFHFQASGSVVRKILKRTLIILCIGWVIH--WFDFICDGDFS 117
Query: 155 VDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYL 214
A +R GVL RIA+ Y VA+ +++ KY G W++ ++L
Sbjct: 118 -PFAHLRLTGVLPRIALCYCVASFVALYVN------------HKYIG-WLIGILLAGYTF 163
Query: 215 LLLYGL-YVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYR 273
LL G Y PD + N + +IDR +LG HLY
Sbjct: 164 LLCIGNGYAPD-----------------------------STNLLAIIDRNVLGADHLYH 194
Query: 274 KPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHF 333
K +P DPEGL S+ A LIG G LI+
Sbjct: 195 K----------------------------SPIDPEGLTSTFSAIAHTLIGFCCGKLILAK 226
Query: 334 KDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA-GIYFM 391
K+ + L ++ L+ G L + LNK ++S ++ +T G + +L A IYF+
Sbjct: 227 KNLEQKTLKLFVVGFILMACGFCL-MEALPLNKRIWSPTFVLVTCGLAAMLQALLIYFI 284
>gi|242022263|ref|XP_002431560.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212516863|gb|EEB18822.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 607
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 97/367 (26%), Positives = 166/367 (45%), Gaps = 53/367 (14%)
Query: 41 NSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPF 100
+ + Q + + R+ S+D FRGL V LMI V+D G + H+ WNG+ +ADFV P+
Sbjct: 185 DDRTTQASSKPARHRIKSIDTFRGLAVVLMIFVNDGAGHYWFLEHATWNGILVADFVFPW 244
Query: 101 FLFIVGVSLALTYKNFPCKVVATRK----AILRALNLFLLGIFLQGGFFHGINNLKYGVD 156
FL+++G+ + ++ + + V+ K I R + LF LG+ L N + G D
Sbjct: 245 FLWVMGLCIPISIRTQLKRNVSRWKILGHVIKRGILLFGLGVLL--------NTVGIGSD 296
Query: 157 IAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKL---SLFRKYRG-------HWVVA 206
+ IR GVLQR +I YL+ A+ + + S FR+ W+V
Sbjct: 297 LETIRIPGVLQRFSIVYLIIAILGVCFTPRSISNENRFPGSSFRETFQDIIIIFPQWIVI 356
Query: 207 LVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKIL 266
L + Y ++ VP + FN G G +D+ +L
Sbjct: 357 LSIVAAYCYFVFFSPVPGCPSGYLGPGGIQDGGRFNECTG---------GMTGYVDKVLL 407
Query: 267 GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHF 326
G++H+Y+ P S+ + PFDPEGLL + + G+
Sbjct: 408 GVEHIYKNPTSSKVYK-------------------SGPFDPEGLLGVMPSIFQAFFGVQA 448
Query: 327 GHLIVHFKDHRDRMLNWII--LSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVL 384
G +++ + + +++ W + + ++ L LSL + + +NK L+SLSY T ++ ++
Sbjct: 449 GATLLYHPEWKAKLIRWFTWGILNGILALLLSLPGI-VPINKNLWSLSYVFTTTSSAFLI 507
Query: 385 LAGIYFM 391
L IYF
Sbjct: 508 LCVIYFF 514
>gi|383115204|ref|ZP_09935962.1| hypothetical protein BSGG_2914 [Bacteroides sp. D2]
gi|313695379|gb|EFS32214.1| hypothetical protein BSGG_2914 [Bacteroides sp. D2]
Length = 361
Score = 121 bits (304), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 101/333 (30%), Positives = 157/333 (47%), Gaps = 78/333 (23%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGI---LPAINHSPWNGLTLADFVMPFFLFIVGVS- 108
+RL+SLDV RG+TVA MILV++ G A H+ W+G + AD V P F+F++G+S
Sbjct: 4 NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63
Query: 109 -LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
++L NF C+ A K I R+L L +G+ ++ F I++ Y D++Q+R MGV+Q
Sbjct: 64 YISLCKYNFQCR-PAIAKIIKRSLLLIFIGLVME-WFITAIDSGNY-FDLSQLRLMGVMQ 120
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLL-LYGLYVPDWQ 226
R+ I Y + AL + + ++ +A++L +Y + L+G
Sbjct: 121 RLGICYGITALLAVTI--------------PHKRFMPLAIILLVVYFIFQLFG------- 159
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
G A N VGMID ILG H+Y +
Sbjct: 160 ---------------------NGFEKSADNIVGMIDSAILGSNHMYLQG----------- 187
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
+ DPEG+LS++ A +IG G +I+ KD+ RMLN ++
Sbjct: 188 ---------------RQFVDPEGILSTIPAVSQVMIGFVCGKIIIDIKDNERRMLNLFLI 232
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
+ L+ +G L + LNK L+S S+ LT G
Sbjct: 233 GTTLLFVGYLLSY-ACPLNKRLWSPSFVLLTCG 264
>gi|242007028|ref|XP_002424344.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212507744|gb|EEB11606.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 497
Score = 121 bits (303), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 104/375 (27%), Positives = 176/375 (46%), Gaps = 68/375 (18%)
Query: 43 NSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFL 102
N + +R ++ R+ SLD FRGL + +MI V+ GG HSPWNG+T+ADFV P+F+
Sbjct: 99 NDRYSRIKNS--RIKSLDAFRGLAILIMIFVNYNGGDYSVFKHSPWNGITIADFVFPWFI 156
Query: 103 FIVGVSLALTYKNFPCKVVATRKAILRALN----LFLLGIFLQGGFFHGINNLKYGVDIA 158
+I+G S L+ N + + ++ R L L LGI L G +
Sbjct: 157 WIMGASTVLSIDNNFRRAQSKKEIFFRILKRSFYLIALGIVLNSG---------HRDSKG 207
Query: 159 QIRWMGVLQRIAIAYLVAALCEIW-LK-------GDGHVSSKLSLFRKYRGHWVVALVLT 210
+R GVLQRI + Y + A EI+ LK G + S + K W+V ++L
Sbjct: 208 FLRVCGVLQRIGLTYFIIASLEIFALKSLLNEHFGPWNFSRNII---KIWIQWLVPILLV 264
Query: 211 TLYLLLLYGLYVPD--WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGI 268
+++++ + L+VP Y P S+ S R TG A G IDR I+
Sbjct: 265 AIHVIITFTLHVPGCPLGYTGPGGLSNHS--------AFRNCTG---GAAGYIDRLIITD 313
Query: 269 QHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGH 328
H+Y + + + + S+ PFDPEGLL ++ + +G+
Sbjct: 314 NHMYHRGSFLKIFKPSV------------------PFDPEGLLGTLTSVFCAFLGVQSAR 355
Query: 329 LIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVG--------MHLNKALYSLSYTCLTAGA 380
++++ ++ ++ +WI + I +GL F+ + +NK L+SLSY T+
Sbjct: 356 ILINHENSFSKIKSWIFWA---IVMGLISGFLCNWSQNSGIIPINKNLWSLSYVLATSSI 412
Query: 381 SGVLLAGIYFMVRYI 395
+ ++L Y ++ ++
Sbjct: 413 AFLILTTFYTLIDFL 427
>gi|160884063|ref|ZP_02065066.1| hypothetical protein BACOVA_02039 [Bacteroides ovatus ATCC 8483]
gi|423291476|ref|ZP_17270324.1| hypothetical protein HMPREF1069_05367 [Bacteroides ovatus
CL02T12C04]
gi|156110405|gb|EDO12150.1| hypothetical protein BACOVA_02039 [Bacteroides ovatus ATCC 8483]
gi|392663476|gb|EIY57026.1| hypothetical protein HMPREF1069_05367 [Bacteroides ovatus
CL02T12C04]
Length = 361
Score = 120 bits (302), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 101/333 (30%), Positives = 157/333 (47%), Gaps = 78/333 (23%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGI---LPAINHSPWNGLTLADFVMPFFLFIVGVS- 108
+RL+SLDV RG+TVA MILV++ G A H+ W+G + AD V P F+F++G+S
Sbjct: 4 NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63
Query: 109 -LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
++L NF C+ A K I R+L L +G+ ++ F I++ Y D++Q+R MGV+Q
Sbjct: 64 YISLCKYNFQCR-PAIAKIIKRSLLLIFIGLVME-WFITAIDSGNY-FDLSQLRLMGVMQ 120
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLL-LYGLYVPDWQ 226
R+ I Y + AL + + ++ +A++L +Y + L+G
Sbjct: 121 RLGICYGITALLAVTI--------------PHKRFMPLAIILLAVYFIFQLFG------- 159
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
G A N VGMID ILG H+Y +
Sbjct: 160 ---------------------NGFEKSADNIVGMIDSAILGSNHMYLQG----------- 187
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
+ DPEG+LS++ A +IG G +I+ KD+ RMLN ++
Sbjct: 188 ---------------RQFVDPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLFLI 232
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
+ L+ +G L + LNK L+S S+ LT G
Sbjct: 233 GTTLLFVGYLLSY-ACPLNKRLWSPSFVLLTCG 264
>gi|186683151|ref|YP_001866347.1| hypothetical protein Npun_R2871 [Nostoc punctiforme PCC 73102]
gi|186465603|gb|ACC81404.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
Length = 375
Score = 120 bits (302), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 115/362 (31%), Positives = 162/362 (44%), Gaps = 78/362 (21%)
Query: 55 RLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL SLDVFRG+T+A MILV+ G I P + H+ W+G T D V PFFLFIVGV++
Sbjct: 2 RLTSLDVFRGITIAAMILVNMAGVADNIYPPLAHADWHGCTPTDLVFPFFLFIVGVAMTF 61
Query: 112 -----TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
T N P + R + RA LF LG+ L G + G+ + D++ IR MGVL
Sbjct: 62 SLSKYTEDNKPTSAIYWR-ILRRAAILFALGLLLNGFWNQGV----WTFDLSSIRIMGVL 116
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRI+I YL+A+L L+L RK G W++A V+ Y L++ L VPD+
Sbjct: 117 QRISITYLLASLIV------------LNLPRK--GQWILAAVILIGYWLMMMYLPVPDYG 162
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
GV G N IDR I+ HLY+ +
Sbjct: 163 ------------------AGVLTREG---NLGAYIDRMIIPKAHLYKGDGFKFMG----- 196
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVH--FKDHRDRMLNWI 344
DPEGL S++ A V+ L G G I + L
Sbjct: 197 -------------------DPEGLFSTIPAIVSVLAGYFTGQWIRSQPVQSRTSIGLGLF 237
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKKP 404
+ +IG F +NK L++ SY ++G + +LLA Y ++ + + KP
Sbjct: 238 GVGCLIIGWAWGWTF---PINKKLWTSSYVVFSSGWALILLAACYELIE-VRLNRRWSKP 293
Query: 405 FD 406
F+
Sbjct: 294 FE 295
>gi|326918494|ref|XP_003205523.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Meleagris gallopavo]
Length = 532
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 108/405 (26%), Positives = 184/405 (45%), Gaps = 67/405 (16%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQ------QRRLISLDVFRGLTVALMIL 72
+ K + ++L L PS ++S T P + Q+RL SLD FRGL++ +M+
Sbjct: 90 VYKKLNPRETDRLINSELGSPSTTDSISTDPSPRLWRATSQQRLRSLDTFRGLSLIIMVF 149
Query: 73 VDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAIL 128
V+ GG H WNGLT+AD V P+F+FI+G S++L+ + K K +
Sbjct: 150 VNYGGGKYWFFKHESWNGLTVADLVFPWFVFIMGTSISLSLSSMLRWGSSKQKVLGKILW 209
Query: 129 RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGH 188
R+ L LLG+ + N + +R GVLQR+ + YLV A E+ +
Sbjct: 210 RSFLLTLLGVIVVNP-----NYCLGALSWENLRIPGVLQRLGLTYLVVAALELLFTRAVN 264
Query: 189 VSSKLSLFRK-----------YRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSS 237
+S L L ++ + W+ L L ++L L + L VP +
Sbjct: 265 ISPSLHLMQEMSYPALQDVLPFWPQWIFILTLEVIWLCLTFLLPVPGCPRGY-------- 316
Query: 238 PWIFNVTCGVRGSTGPACN----AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPM 293
+ G G G N A G IDR +LG +H+Y+ P + Q ++
Sbjct: 317 -----LGPGGIGDFGKYANCTGGAAGYIDRLVLGEKHIYQHPSCNVLYQTTV-------- 363
Query: 294 PLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL 353
P+DPEG+L ++ + +GL G +I+ +KD ++++ ++ S ++G+
Sbjct: 364 ----------PYDPEGILGTINTILMAFLGLQAGKIILSYKDQHKQIMSRFLIWSLVMGI 413
Query: 354 ------GLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
S + + +NK L+S SY + + +LL +Y++V
Sbjct: 414 ISAILTKCSKEEGFIPINKNLWSTSYVTTMSCFAFILLLLMYYLV 458
>gi|270004236|gb|EFA00684.1| hypothetical protein TcasGA2_TC003561 [Tribolium castaneum]
Length = 569
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 178/380 (46%), Gaps = 79/380 (20%)
Query: 40 SNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMP 99
+ S +K+ +P+ ++RL SLD FRG+++ +MI V+ G P ++H+ WNGL LAD V P
Sbjct: 165 AKSETKEKKPE-GKKRLKSLDTFRGISIVIMIFVNYGSGGYPVLDHATWNGLHLADLVFP 223
Query: 100 FFLFIVGVSLALTYKNFPCKVVATRKAIL----RALNLFLLGIFLQGGFFHGINNLKYGV 155
+F++I+G + ++ + K ++ + L R++ LF LG+FL G +
Sbjct: 224 WFMWIMGACMPISLTSSFKKQISNKDIFLNVLKRSIKLFCLGVFLNAGPY---------- 273
Query: 156 DIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRG------HWVVALVL 209
+ +R GVLQR I YLV ++L SK + + + W+V L++
Sbjct: 274 -LECMRIFGVLQRFGICYLVVTTICLFLMKREFSESKHKIGKFFTDILVLWKGWIVVLII 332
Query: 210 TTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPA----------CN--A 257
++ + L+ L P RG GP C A
Sbjct: 333 FFVHCMFLFLL------------ADEGCP---------RGYLGPGGLHENGKHFNCTGGA 371
Query: 258 VGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMAT 317
G ID ILG H Y+KP +K+ + + FDPEG+L + +
Sbjct: 372 TGYIDAVILG-NHRYQKP---TSKEIYLGT---------------QAFDPEGILGCLTSI 412
Query: 318 VTCLIGLHFGHLIVHFKDHRDRMLNWI---ILSSCLIG--LGLSLDFVGMHLNKALYSLS 372
V IG+ G ++ +K+H R++ W+ +L+ + G G S + + +NK L+S+S
Sbjct: 413 VHVFIGVQAGITLLVYKEHSARLIRWLSWSVLAGIVGGALCGFSKEDGLIPVNKNLWSIS 472
Query: 373 YTCLTAGASGVLLAGIYFMV 392
+ +T+ + +LL+ Y ++
Sbjct: 473 FVLVTSCFAFLLLSICYVLI 492
>gi|91079154|ref|XP_966977.1| PREDICTED: similar to CG6903 CG6903-PA [Tribolium castaneum]
Length = 533
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 102/380 (26%), Positives = 178/380 (46%), Gaps = 79/380 (20%)
Query: 40 SNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMP 99
+ S +K+ +P+ ++RL SLD FRG+++ +MI V+ G P ++H+ WNGL LAD V P
Sbjct: 165 AKSETKEKKPE-GKKRLKSLDTFRGISIVIMIFVNYGSGGYPVLDHATWNGLHLADLVFP 223
Query: 100 FFLFIVGVSLALTYKNFPCKVVATRKAIL----RALNLFLLGIFLQGGFFHGINNLKYGV 155
+F++I+G + ++ + K ++ + L R++ LF LG+FL G +
Sbjct: 224 WFMWIMGACMPISLTSSFKKQISNKDIFLNVLKRSIKLFCLGVFLNAGPY---------- 273
Query: 156 DIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRG------HWVVALVL 209
+ +R GVLQR I YLV ++L SK + + + W+V L++
Sbjct: 274 -LECMRIFGVLQRFGICYLVVTTICLFLMKREFSESKHKIGKFFTDILVLWKGWIVVLII 332
Query: 210 TTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPA----------CN--A 257
++ + L+ L P RG GP C A
Sbjct: 333 FFVHCMFLFLL------------ADEGCP---------RGYLGPGGLHENGKHFNCTGGA 371
Query: 258 VGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMAT 317
G ID ILG H Y+KP +K+ + + FDPEG+L + +
Sbjct: 372 TGYIDAVILG-NHRYQKP---TSKEIYLGT---------------QAFDPEGILGCLTSI 412
Query: 318 VTCLIGLHFGHLIVHFKDHRDRMLNWI---ILSSCLIG--LGLSLDFVGMHLNKALYSLS 372
V IG+ G ++ +K+H R++ W+ +L+ + G G S + + +NK L+S+S
Sbjct: 413 VHVFIGVQAGITLLVYKEHSARLIRWLSWSVLAGIVGGALCGFSKEDGLIPVNKNLWSIS 472
Query: 373 YTCLTAGASGVLLAGIYFMV 392
+ +T+ + +LL+ Y ++
Sbjct: 473 FVLVTSCFAFLLLSICYVLI 492
>gi|423214205|ref|ZP_17200733.1| hypothetical protein HMPREF1074_02265 [Bacteroides xylanisolvens
CL03T12C04]
gi|392693150|gb|EIY86385.1| hypothetical protein HMPREF1074_02265 [Bacteroides xylanisolvens
CL03T12C04]
Length = 361
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 101/333 (30%), Positives = 157/333 (47%), Gaps = 78/333 (23%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGI---LPAINHSPWNGLTLADFVMPFFLFIVGVS- 108
+RL+SLDV RG+TVA MILV++ G A H+ W+G + AD V P F+F++G+S
Sbjct: 4 NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63
Query: 109 -LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
++L NF C+ A K I R+L L +G+ ++ F I++ Y D++Q+R MGV+Q
Sbjct: 64 YISLCKYNFQCR-PAIAKIIKRSLLLIFIGLVME-WFITSIDSGNY-FDLSQLRLMGVMQ 120
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLL-LYGLYVPDWQ 226
R+ I Y + AL + + ++ +A++L +Y + L+G
Sbjct: 121 RLGICYGITALLAVTI--------------PHKRFMPLAIILLAVYFIFQLFG------- 159
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
G A N VGMID ILG H+Y +
Sbjct: 160 ---------------------NGFEKSADNIVGMIDSAILGSNHMYLQG----------- 187
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
+ DPEG+LS++ A +IG G +I++ KD+ RMLN ++
Sbjct: 188 ---------------RQFVDPEGILSTIPAVSQVMIGFVCGKIIINIKDNDRRMLNLFLI 232
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
+ L+ G L + LNK L+S S+ LT G
Sbjct: 233 GTTLLFAGYLLSY-ACPLNKRLWSPSFVLLTCG 264
>gi|449283383|gb|EMC90042.1| Heparan-alpha-glucosaminide N-acetyltransferase [Columba livia]
Length = 560
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 110/396 (27%), Positives = 173/396 (43%), Gaps = 67/396 (16%)
Query: 4 YNPINNDANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFR 63
Y +N D+L+ S+ S A + + DP + Q ++RL SLD FR
Sbjct: 123 YKKLNPRETDRLIN--SEPGSPNAADPISSDP--------APQLWSSAPRQRLRSLDTFR 172
Query: 64 GLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGV----SLALTYKNFPCK 119
GL++ +M+ V+ GG H WNGLT+AD V P+F+FI+G SL+ + K
Sbjct: 173 GLSLIIMVFVNYGGGKYWFFKHESWNGLTVADLVFPWFVFIMGTSISLSLSSMLRQGSSK 232
Query: 120 VVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALC 179
K + R+ L LLG+ + N + +R GVLQR+ +AYLV A
Sbjct: 233 WKVLGKILWRSFLLILLGVIVVNP-----NYCLGPLSWENLRIPGVLQRLGLAYLVVAAL 287
Query: 180 EIWLKGDGHVSSKLSL-------FRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVE 232
E+ G S L Y WV L+L L+L L + L VP +
Sbjct: 288 ELLFTRAGAESGTLETPCPALRDILPYWPQWVFVLMLEVLWLCLTFLLPVPGCPRGY--- 344
Query: 233 TSSSSPWIFNVTCGVRGSTGPACN----AVGMIDRKILGIQHLYRKPIYSRTKQCSINSP 288
+ G G G N A G IDR +LG +H+Y+ P + Q ++
Sbjct: 345 ----------LGPGGIGDFGNYANCTGGAAGYIDRLLLGDKHIYQHPSSNVIYQTTM--- 391
Query: 289 DYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSS 348
P+DPEG+L ++ +GL G +I+ +KD R+++ + +
Sbjct: 392 ---------------PYDPEGILGTINTIFMAFLGLQAGKIILFYKDQHKRIMSRFFIWT 436
Query: 349 CLIGL------GLSLDFVGMHLNKALYSLSYTCLTA 378
++G+ S + + +NK L+S+SY T+
Sbjct: 437 IVMGIISAILTKCSKEEGFIPINKNLWSVSYVTTTS 472
>gi|299144716|ref|ZP_07037784.1| putative membrane protein [Bacteroides sp. 3_1_23]
gi|298515207|gb|EFI39088.1| putative membrane protein [Bacteroides sp. 3_1_23]
Length = 361
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 102/333 (30%), Positives = 157/333 (47%), Gaps = 78/333 (23%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGI---LPAINHSPWNGLTLADFVMPFFLFIVGVS- 108
+RL+SLDV RG+TVA MILV++ G A H+ W+G + AD V P F+F++G+S
Sbjct: 4 NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63
Query: 109 -LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
++L NF C+ A K I R+L L +G+ ++ F I++ Y D++Q+R MGV+Q
Sbjct: 64 YISLCKYNFQCR-PAIAKIIKRSLLLIFIGLVME-WFITAIDSGNY-FDLSQLRLMGVMQ 120
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLL-LYGLYVPDWQ 226
R+ I Y + AL L++ ++ +A++L +Y + L+G
Sbjct: 121 RLGICYGITAL--------------LAVTIPHKRFMPLAIILLIVYFIFQLFG------- 159
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
G A N VGMID ILG H+Y +
Sbjct: 160 ---------------------NGFEKSADNIVGMIDSAILGSNHMYLQG----------- 187
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
+ DPEG+LS++ A +IG G +I+ KD+ RMLN ++
Sbjct: 188 ---------------RQFVDPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLFLI 232
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
+ L+ G L + LNK L+S S+ LT G
Sbjct: 233 GTTLLFAGYLLSY-ACPLNKRLWSPSFVLLTCG 264
>gi|195041852|ref|XP_001991329.1| GH12115 [Drosophila grimshawi]
gi|193901087|gb|EDV99953.1| GH12115 [Drosophila grimshawi]
Length = 573
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 114/383 (29%), Positives = 171/383 (44%), Gaps = 56/383 (14%)
Query: 27 ANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHS 86
A ++ D ++S Q++RL SLD FRGL + LMI V+ GG I H+
Sbjct: 156 AWKRYRYDEAAAVADSIGDAAAKATQRKRLRSLDTFRGLCIVLMIFVNSGGGGYSWIEHA 215
Query: 87 PWNGLTLADFVMPFFLFIVGVSLALTYK----NFPCKVVATRKAILRALNLFLLGIFLQG 142
WNGL LAD V P FL+I+GV + L+ K K + + RA LF +G+ L
Sbjct: 216 AWNGLHLADIVFPSFLWIMGVCIPLSIKAQLARGTSKTRICLRIVWRACKLFAIGLCL-- 273
Query: 143 GFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRK---- 198
N G + Q+R MGVLQR IAYL+ A+ +S + + R
Sbjct: 274 -------NSVNGPQLEQLRLMGVLQRFGIAYLLVAILHTVCSRRDQLSPQRAWQRAIYDI 326
Query: 199 --YRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPAC- 255
+ G + V L L YL L +GL VP + + + P C
Sbjct: 327 CLFSGEFAVLLALIATYLGLTFGLRVPGCPVGYLGPGGKHN-----------NAAHPNCI 375
Query: 256 -NAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSV 314
A G ID +LG H+Y+ P DA + FDPEG+ +
Sbjct: 376 GGAAGYIDLLVLGNAHIYQHPT--------------AKYVYDATA-----FDPEGIFGCL 416
Query: 315 MATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMH-----LNKALY 369
++ V L+G G ++ + R+ W+++++ L LG +L +NK L+
Sbjct: 417 LSVVQTLLGAFAGVTLLVHSTWQGRLKRWLLVATVLGLLGGALCGFSKENGIIPINKNLW 476
Query: 370 SLSYTCLTAGASGVLLAGIYFMV 392
SLS+ +TA + VLL+ IY++V
Sbjct: 477 SLSFVFVTAALALVLLSLIYYVV 499
>gi|237719042|ref|ZP_04549523.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
gi|229451820|gb|EEO57611.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
Length = 361
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 100/333 (30%), Positives = 156/333 (46%), Gaps = 78/333 (23%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGI---LPAINHSPWNGLTLADFVMPFFLFIVGVS- 108
+RL+SLDV RG+TVA MILV++ G A H+ W+G + AD V P F+F++G+S
Sbjct: 4 NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63
Query: 109 -LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
++L NF C+ A K I R+L L +G+ ++ F I++ Y D++Q+R MGV+Q
Sbjct: 64 YISLCKYNFQCR-PAIAKIIKRSLLLIFIGLVME-WFITSIDSGNY-FDLSQLRLMGVMQ 120
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLL-LYGLYVPDWQ 226
R+ I Y + AL + + ++ +A++L +Y + L+G
Sbjct: 121 RLGICYGITALLAVTI--------------PHKRFMPLAIILLVVYFIFQLFG------- 159
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
G N VGMID ILG H+Y +
Sbjct: 160 ---------------------NGFEKSVDNIVGMIDSAILGANHMYLQG----------- 187
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
+ DPEG+LS++ A +IG G +I+ KD+ RMLN ++
Sbjct: 188 ---------------RQFVDPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLFLI 232
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
+ L+ +G L + LNK L+S S+ LT G
Sbjct: 233 GTTLLFVGYLLSY-ACPLNKRLWSPSFVLLTCG 264
>gi|423293378|ref|ZP_17271505.1| hypothetical protein HMPREF1070_00170 [Bacteroides ovatus
CL03T12C18]
gi|392678321|gb|EIY71729.1| hypothetical protein HMPREF1070_00170 [Bacteroides ovatus
CL03T12C18]
Length = 361
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 100/333 (30%), Positives = 156/333 (46%), Gaps = 78/333 (23%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGI---LPAINHSPWNGLTLADFVMPFFLFIVGVS- 108
+RL+SLDV RG+TVA MILV++ G A H+ W+G + AD V P F+F++G+S
Sbjct: 4 NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63
Query: 109 -LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
++L NF C+ A K I R+L L +G+ ++ F I++ Y D++Q+R MGV+Q
Sbjct: 64 YISLCKYNFQCR-PAIAKIIKRSLLLIFIGLVME-WFITSIDSGNY-FDLSQLRLMGVMQ 120
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLL-LYGLYVPDWQ 226
R+ I Y + AL + + ++ +A++L +Y + L+G
Sbjct: 121 RLGICYGITALLAVTI--------------PHKRFMPLAIILLVVYFIFQLFG------- 159
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
G N VGMID ILG H+Y +
Sbjct: 160 ---------------------NGFEKSVDNIVGMIDSAILGANHMYLQG----------- 187
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
+ DPEG+LS++ A +IG G +I+ KD+ RMLN ++
Sbjct: 188 ---------------RQFVDPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLFLI 232
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
+ L+ +G L + LNK L+S S+ LT G
Sbjct: 233 GTTLLFVGYLLSY-ACPLNKRLWSPSFVLLTCG 264
>gi|340371415|ref|XP_003384241.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Amphimedon queenslandica]
Length = 743
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 123/415 (29%), Positives = 188/415 (45%), Gaps = 83/415 (20%)
Query: 2 AKYNPINNDANDQLVI-QISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLD 60
Y +NN L+ Q K+T+ NE DPL TR ++ RL SLD
Sbjct: 321 VNYGALNNGETANLLDDQKEKATTDLLNE----DPL---------STR---KKERLRSLD 364
Query: 61 VFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKV 120
FRG+++ +MI V+ GG NHS WNG+T+AD V P+F++I+GVS+ ++K
Sbjct: 365 TFRGMSLIIMIFVNYGGGGYWFFNHSIWNGITVADLVFPWFVWIMGVSIVYSFKGRKKDS 424
Query: 121 VATR--KAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAAL 178
R + + R++ L LG+FL G+ ++ R GVLQR AIAY V A+
Sbjct: 425 FKLRLYQVVRRSVILLGLGLFLNNGY-----------RLSHWRIPGVLQRFAIAYFVVAM 473
Query: 179 CEIW---------LKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEF 229
E+ LK D V S L + W+V + L +L+L++ + L P +
Sbjct: 474 TELLAPMVYNKYKLKWD--VISVRDLTHNWV-QWLVIVFLESLWLIITFSLKAPGCPRGY 530
Query: 230 PVETSSSSPWIFNVTCGVRGSTGPACNAV----GMIDRKILGIQHLYRKPIYSRTKQCSI 285
+ G R G N G ID IL H+Y P C
Sbjct: 531 -------------LGPGGRADGGKYSNCTGGIAGYIDSWILTDNHIYGHP------TCK- 570
Query: 286 NSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWII 345
+ +DPEG+L S+ + V C G+ G +++H K R++ +++
Sbjct: 571 ------------AIYHTGSYDPEGILGSINSIVMCFFGVQAGRILIHHKQFGSRIVRFVV 618
Query: 346 LSSCLIGLGL-----SLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYI 395
+ GLG +L+ + LNK L+SLS+ + AG +LLA YF++ I
Sbjct: 619 WGLLMGGLGTILCEATLNKGVIPLNKNLWSLSFILVIAGLGYILLALFYFIIDVI 673
>gi|242062184|ref|XP_002452381.1| hypothetical protein SORBIDRAFT_04g024713 [Sorghum bicolor]
gi|241932212|gb|EES05357.1| hypothetical protein SORBIDRAFT_04g024713 [Sorghum bicolor]
Length = 96
Score = 119 bits (298), Expect = 3e-24, Method: Composition-based stats.
Identities = 58/97 (59%), Positives = 70/97 (72%), Gaps = 2/97 (2%)
Query: 163 MGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYV 222
MG+LQRIAIAYL+AA+CEIWLKGD V S L R+YR V LVL+ Y +LLYG+YV
Sbjct: 2 MGILQRIAIAYLLAAICEIWLKGDDDVDSGYGLLRRYRYQLFVGLVLSIAYTILLYGIYV 61
Query: 223 PDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVG 259
PDW+Y+ + S+ F+V CGVRG TG ACNAVG
Sbjct: 62 PDWEYK--ISGPGSTEKSFSVKCGVRGDTGLACNAVG 96
>gi|17229379|ref|NP_485927.1| hypothetical protein all1887 [Nostoc sp. PCC 7120]
gi|17130977|dbj|BAB73586.1| all1887 [Nostoc sp. PCC 7120]
Length = 375
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 123/364 (33%), Positives = 165/364 (45%), Gaps = 82/364 (22%)
Query: 55 RLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL SLDVFRG+T+A MILV+ G + P + H+ W+G T D V PFFLFIVGV+++
Sbjct: 2 RLTSLDVFRGITIAGMILVNMAGVADDVYPPLAHAEWHGCTPTDLVFPFFLFIVGVAMSF 61
Query: 112 -----TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
T +N P VV R RA LF+LG+ L G + GI + D++ IR MGVL
Sbjct: 62 SLSKYTQENKPTSVVYWR-IFRRAAILFVLGLLLNGFWNKGI----WTFDLSNIRIMGVL 116
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRI+++YL A+L + L+L RK G W++A VL Y L + + VPD+
Sbjct: 117 QRISLSYLFASL------------TVLNLPRK--GQWILAGVLLVGYWLTMMYVPVPDYG 162
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
GV G N IDR I+ HLY +
Sbjct: 163 ------------------AGVLTREG---NFGAYIDRLIIPKSHLYAGDGFKNLG----- 196
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
DPEGL S++ A V+ L G G I L +
Sbjct: 197 -------------------DPEGLFSTIPAIVSVLAGYFTGEWIRKQPVQTRTSLGLALF 237
Query: 347 S-SCLI-GLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM--VRYISSHLMLK 402
CLI G G F +NK L++ SY T+G + +LLA Y + VR I
Sbjct: 238 GIGCLIVGWGWGWVF---PINKKLWTSSYVVFTSGWALLLLAACYELIEVRLIKRW---G 291
Query: 403 KPFD 406
KPF+
Sbjct: 292 KPFE 295
>gi|383849627|ref|XP_003700446.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Megachile rotundata]
Length = 552
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 94/362 (25%), Positives = 159/362 (43%), Gaps = 50/362 (13%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIV 105
T Q +RR+ ++D RG + LMI V+D G + H+ WNGL D + P F++I+
Sbjct: 160 DTARQPVKRRVKAIDTVRGASTLLMIFVNDGSGGYKTLGHATWNGLLPGDLLFPCFIWIM 219
Query: 106 GV----SLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIR 161
GV +L K K V + R++ LFL+G+ +N + G + IR
Sbjct: 220 GVCIPIALGSQLKRMVPKHVILYGILKRSVLLFLIGV--------SLNTVGTGPQLESIR 271
Query: 162 WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRG------HWVVALVLTTLYLL 215
GVLQR + Y + A+ ++L + + R + W V L + + +
Sbjct: 272 IFGVLQRFGVTYFIVAVIYLFLISKRPTKVQSPMLRDVQDFLLLLPQWTVMLAIVAAHCI 331
Query: 216 LLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKP 275
+ + L VP T P + G A G ID+ +L QHL+
Sbjct: 332 ITFCLPVPG------CPTGYLGPGGLHDDAKYFDCVG---GAAGYIDKVVLKEQHLHHSM 382
Query: 276 IYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKD 335
++ APFDPEG+L + +T +GLH G +++ +KD
Sbjct: 383 TVYKS----------------------APFDPEGILGCLTSTFHVFLGLHAGIIMMTYKD 420
Query: 336 HRDRMLNWIILSSCLIGLGLSLDFVG-MHLNKALYSLSYTCLTAGASGVLLAGIYFMVRY 394
++R++ W+ ++ +G +L F + +NK L+SLS+ +T S L+ Y +V
Sbjct: 421 WKERVIRWLAWAAFFSCIGCALHFTNVIPVNKKLWSLSFVFVTTSFSLAFLSACYLLVDV 480
Query: 395 IS 396
+
Sbjct: 481 VK 482
>gi|118378164|ref|XP_001022258.1| hypothetical protein TTHERM_00500990 [Tetrahymena thermophila]
gi|89304025|gb|EAS02013.1| hypothetical protein TTHERM_00500990 [Tetrahymena thermophila
SB210]
Length = 827
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 100/358 (27%), Positives = 162/358 (45%), Gaps = 77/358 (21%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGG--ILPAINHSPWNGLTLADFVMPFFLFIVGVSLA 110
++RL LD++RGLT+ MILVD++G ++ ++ + WNGL+ AD V P FLFI G+++
Sbjct: 460 KQRLECLDIYRGLTMVGMILVDNMGNSSVIWPLDETEWNGLSTADCVFPSFLFISGMAIT 519
Query: 111 LTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIA 170
L K+ K + + R + LF++G+ L N K Q R MGVLQRIA
Sbjct: 520 LAIKHNGNKKQQFFRILERFVKLFVIGVALNA----ACANYK-----QQFRIMGVLQRIA 570
Query: 171 IAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFP 230
I Y V + ++L+ + +V+ V +Y+ +Y VPD
Sbjct: 571 ICYFVTSTSYLFLQN-------------FAVQFVLNGVFLLIYIYFMYFFDVPD------ 611
Query: 231 VETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDY 290
CG T P CN +D +I + ++ +
Sbjct: 612 -------------GCGANNVT-PTCNFGRYLDMQIFTLNYMMK----------------- 640
Query: 291 GPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRM-LNWIILSSC 349
P DPEGL +++ A VT IGL +G + FK + R+ W ++S
Sbjct: 641 -------------PSDPEGLFTTLGALVTTFIGLCYGLALQEFKSQKKRLSCIWFVMSLV 687
Query: 350 LIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKKPFDY 407
L+ +G F+ +NK ++S S+ + SG L + F+V I ++L L K ++
Sbjct: 688 LVFIGGICCFL-TPINKKVWSPSFVFIVGSMSGAFL-NLCFIVVDIYNNLKLNKALEF 743
>gi|270339962|ref|ZP_06203500.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
gi|270333113|gb|EFA43899.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
Length = 389
Score = 119 bits (297), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 111/336 (33%), Positives = 155/336 (46%), Gaps = 78/336 (23%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGG--ILPAINHSPWNGLTLADFVMPFFLFIVGVS-- 108
Q+RLISLDV RGLTV LMI V++ G I + HS WNG+TL D V PFFLF+VGVS
Sbjct: 17 QQRLISLDVLRGLTVMLMIFVNNGAGTQIFSPLRHSRWNGMTLCDLVFPFFLFMVGVSTY 76
Query: 109 LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQR 168
L+L NF RK R LFL+G+ + +F N +D+A +R MGV+QR
Sbjct: 77 LSLRKSNFAWSAKTLRKIARRTALLFLIGLTIN--WFDMACN-GSPLDLAHLRIMGVMQR 133
Query: 169 IAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYE 228
IA+ Y A I +SSK+ R + W++A++L LLL+ G + Y
Sbjct: 134 IALCYGATAFVAI-------LSSKVPQ-RLHLIPWLIAVLLIAYSLLLIIG---GGYDYS 182
Query: 229 FPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSP 288
A N + ++D ILG HLY +
Sbjct: 183 ------------------------SATNLLAIVDTHILGYDHLYHR-------------- 204
Query: 289 DYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV-----HFKDHRDRMLNW 343
+P DPEGLLS++ A LIG L + H + RM +
Sbjct: 205 --------------SPVDPEGLLSTLPAIAHTLIGFWVARLTIGKQGSHSTQNTVRM--F 248
Query: 344 IILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
++ + L+ G+ L + + LNK ++S SY T G
Sbjct: 249 LLAGAMLVVSGIILTLL-LPLNKRIWSPSYVLTTCG 283
>gi|392968994|ref|ZP_10334410.1| Protein of unknown function DUF2261, transmembrane [Fibrisoma limi
BUZ 3]
gi|387843356|emb|CCH56464.1| Protein of unknown function DUF2261, transmembrane [Fibrisoma limi
BUZ 3]
Length = 390
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 114/375 (30%), Positives = 161/375 (42%), Gaps = 89/375 (23%)
Query: 21 KSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGG-- 78
++T++PA+ + S+ T PQ RL+SLD FRGLTVA MILV++ G
Sbjct: 2 QTTTSPASVR-----------SSESLTNPQ----RLLSLDAFRGLTVAGMILVNNAGDWQ 46
Query: 79 -ILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLG 137
+ + H+ WNG T D + PFFLFIVGVS+ K + R+ LF+LG
Sbjct: 47 YVYAPLKHAAWNGWTPTDLIFPFFLFIVGVSITFALAGGQEHTNVIGKILKRSFTLFMLG 106
Query: 138 IFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFR 197
+FL FF DI +R GVLQRIA+ YL +L I+L+ + L L
Sbjct: 107 LFL--AFFPKF-------DITTVRIPGVLQRIALVYLACSL--IYLRTTTRQQTWL-LAA 154
Query: 198 KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNA 257
G+W+V V+ VP Y + P N
Sbjct: 155 LLVGYWLVMTVVP-----------VPGVGY---------------------ANLEPTTNL 182
Query: 258 VGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMAT 317
+DR +L HLYR TK +DPEGLLS++ A
Sbjct: 183 AAWLDRTVLTTDHLYRS-----TKV----------------------WDPEGLLSTIPAI 215
Query: 318 VTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLT 377
T L G+ G + + W+ CL LG + +NKAL++ SY L
Sbjct: 216 GTGLAGVLVGTWLRRRDVADADKIAWLFTVGCLTTLGGLIWDAWFPINKALWTSSYVLLA 275
Query: 378 AGASGVLLAGIYFMV 392
AG + + LA Y+++
Sbjct: 276 AGLAMLGLALCYWLI 290
>gi|196002389|ref|XP_002111062.1| hypothetical protein TRIADDRAFT_54611 [Trichoplax adhaerens]
gi|190587013|gb|EDV27066.1| hypothetical protein TRIADDRAFT_54611 [Trichoplax adhaerens]
Length = 431
Score = 119 bits (297), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 92/316 (29%), Positives = 139/316 (43%), Gaps = 49/316 (15%)
Query: 45 KQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFI 104
+Q + RR+ SLD++RGL +M D GG HS WNGLT+ D V P F+FI
Sbjct: 152 QQPHIDNLDRRIRSLDLYRGLCAIVMAFGDSGGGQYRFFKHSIWNGLTIVDVVFPGFIFI 211
Query: 105 VGVSLALT-----YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQ 159
G SL+++ YK K++ I R+ LF LG+ + G I+
Sbjct: 212 SGFSLSISLVKRLYKMQTPKLILIVTTIRRSFYLFFLGLLING-----------PCQISN 260
Query: 160 IRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRK--YRGHWVVALVLTTLYLLLL 217
R +GVLQRI++ +LV + +WL +K + ++ R W + +++ L+ +
Sbjct: 261 WRLLGVLQRISVTFLVVSCLAVWLYPTIKSFTKDQVLQEKVLRKMWPIMVLIVGLHTYVT 320
Query: 218 YGLYVPDW--QYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKP 275
VPD Y P S + +N T G+ G IDR + G HLY +P
Sbjct: 321 LTAAVPDCPVGYSGPGGKSDDGKY-YNCTGGI----------AGFIDRFVFGSNHLYSRP 369
Query: 276 IYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKD 335
Q S PFDPEG+L ++ + C +GL G L F +
Sbjct: 370 TCKLLYQSS------------------QPFDPEGVLGTLTSIFLCFLGLQMGILHNIFSN 411
Query: 336 HRDRMLNWIILSSCLI 351
+ M WI+ L+
Sbjct: 412 NLRIMRTWILFGLLLV 427
>gi|428308802|ref|YP_007119779.1| hypothetical protein Mic7113_0454 [Microcoleus sp. PCC 7113]
gi|428250414|gb|AFZ16373.1| hypothetical protein Mic7113_0454 [Microcoleus sp. PCC 7113]
Length = 381
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 114/373 (30%), Positives = 166/373 (44%), Gaps = 86/373 (23%)
Query: 45 KQTRPQHQQRRLISLDVFRGLTVALMILVDDVG-----GILPAINHSPWNGLTLADFVMP 99
+Q +P +RL SLDVFRG+T+A MILV+ +G + P + H+ WNG T D V P
Sbjct: 4 EQLKPS---KRLTSLDVFRGITIAGMILVNMIGVAGDKNVYPPLLHADWNGFTPTDLVFP 60
Query: 100 FFLFIVGVSLALTYKNF-----PCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYG 154
FFLFIVG ++A ++ + P V R I R+L LF LGI L G + +
Sbjct: 61 FFLFIVGAAMAFSFSKYKHGNKPTPTVYWR-IIRRSLILFALGILLNGFWEY-------- 111
Query: 155 VDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYL 214
+ + IR MGVLQRI++ YL+A+L L++ RK G W +A L Y
Sbjct: 112 -NWSSIRIMGVLQRISLTYLIASLIV------------LNVPRK--GQWAIAAFLLIGYW 156
Query: 215 LLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRK 274
+ + VPD+ G+ N DR I+ HLY+
Sbjct: 157 FAMSLIPVPDYG---------------------MGNLTREGNFGAYFDRLIIPTAHLYK- 194
Query: 275 PIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFK 334
D+ M DPEGL S++ A V+ L G G +
Sbjct: 195 ------------GDDFNGMG-----------DPEGLFSTLPAVVSVLFGYLTGDWLRQQP 231
Query: 335 DHRDRMLNWIILS-SCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVR 393
++ ++L SCL+ +G DF +NK L++ SY T G + +LLA Y +
Sbjct: 232 IKSTTSMDLVLLGLSCLV-IGQVWDF-WFPINKKLWTSSYVLFTTGWALLLLAACYESIE 289
Query: 394 YISSHLMLKKPFD 406
+ KPF+
Sbjct: 290 -VRQRQRWAKPFE 301
>gi|281423205|ref|ZP_06254118.1| putative membrane protein [Prevotella oris F0302]
gi|281402541|gb|EFB33372.1| putative membrane protein [Prevotella oris F0302]
Length = 359
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 101/337 (29%), Positives = 151/337 (44%), Gaps = 78/337 (23%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG--ILPAINHSPWNGLTLADFVMPFFLFIVGVS- 108
+++RL+SLDV RG TV LMILV++ G I + HS WNG+T D V PFFLFI+G+S
Sbjct: 2 EKKRLLSLDVLRGATVCLMILVNNGAGKHIYATLQHSKWNGMTPCDLVFPFFLFIMGIST 61
Query: 109 -LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
L+L NF K + R + LFL+G+F+ F I+ +D + +R V+Q
Sbjct: 62 YLSLKKTNFTWSRQVAFKIVKRTVLLFLIGLFIN-WFDMAISG--NALDFSHLRIWAVMQ 118
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGL-YVPDWQ 226
RIAI Y ++ + + + + L Y LLL++G Y D Q
Sbjct: 119 RIAICYFAVSIFALCCNHRHTIPAIVILLAAYN-------------LLLIWGNGYAYDSQ 165
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
N + ID ++ GI+HLY
Sbjct: 166 Q----------------------------NILAQIDIRLFGIEHLYH------------- 184
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
+P DPEG SS+ A LIG + G + K +++L ++I
Sbjct: 185 ---------------NSPVDPEGTGSSLSAIAHTLIGFYCGKRMSDAKSTEEKVLRFLIT 229
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGV 383
L+ +G + F G+ LNK ++S SY C+T G + V
Sbjct: 230 GGFLVIIGYIVSF-GLPLNKRIWSPSYVCMTCGLAAV 265
>gi|373460170|ref|ZP_09551926.1| hypothetical protein HMPREF9944_00190 [Prevotella maculosa OT 289]
gi|371956555|gb|EHO74341.1| hypothetical protein HMPREF9944_00190 [Prevotella maculosa OT 289]
Length = 359
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 107/354 (30%), Positives = 157/354 (44%), Gaps = 94/354 (26%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG--ILPAINHSPWNGLTLADFVMPFFLFIVGVS- 108
+++RL+SLDV RG+TV LMILV++ G I + HS WNG+T D V PFFLFI+G+S
Sbjct: 2 EKKRLLSLDVLRGMTVCLMILVNNGAGEHIYSTLQHSKWNGMTPCDLVFPFFLFIMGIST 61
Query: 109 -LALTYKNF------PCKVVATRKAILRALNLFL--LGIFLQGGFFHGINNLKYGVDIAQ 159
L+L NF CK +A R +L A+ LF+ + LQG +D
Sbjct: 62 FLSLKQTNFAWNRQTACK-IAKRTVLLFAIGLFINWFDLLLQG----------RALDFEH 110
Query: 160 IRWMGVLQRIAIAYLVAALCEIWLKGDGHVSS-KLSLFRKYRGHWVVALVLTTLYLLLLY 218
+R GV+QRIAI Y G VS LS+ K + L++ L+L
Sbjct: 111 LRIWGVMQRIAICY-------------GAVSVFALSINHKRTLPLIATLLIAYAMFLMLG 157
Query: 219 GLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYS 278
Y D Q N + ID + G HLY K
Sbjct: 158 NGYAYDSQQ----------------------------NLIAQIDIHLFGQAHLYHK---- 185
Query: 279 RTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRD 338
+P DPEGL SS+ A LIG + G L+ + +
Sbjct: 186 ------------------------SPVDPEGLASSLPAIAHTLIGFYCGRLMAMARTTEE 221
Query: 339 RMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
++L ++++ L+ +G F G+ LNK ++S SY C+T G + L + +++
Sbjct: 222 KVLKFMLVGGVLVLIGYLASF-GLPLNKRIWSPSYVCITCGLAATCLGLLMYVI 274
>gi|423226736|ref|ZP_17213201.1| hypothetical protein HMPREF1062_05387 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392627009|gb|EIY21050.1| hypothetical protein HMPREF1062_05387 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 368
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 105/346 (30%), Positives = 157/346 (45%), Gaps = 80/346 (23%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILP--AINHSPWNGLTLADFVMPFFLFIVGVS- 108
Q++RL+SLDV RG+TV MILV++ GG L ++ HS WNGLTL D V PFFLFI+GVS
Sbjct: 13 QKKRLLSLDVLRGITVVGMILVNNSGGKLSYDSLQHSAWNGLTLCDLVFPFFLFIMGVST 72
Query: 109 -LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
+AL+ +F RK + R L + +G + +FH I + + A +R GVL
Sbjct: 73 YIALSKFHFQASGSVVRKVLKRTLVILCIGWAIH--WFHFICDGDF-FPFAHLRLTGVLP 129
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGL-YVPDWQ 226
RIA+ Y V + +++ KY G W++ ++ +LL G Y PD
Sbjct: 130 RIALCYCVVSFVALYVN------------HKYIG-WIIGCLIAGYAVLLCIGNGYAPD-- 174
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
N + +IDR +LG HLY K
Sbjct: 175 ---------------------------DTNLLAIIDRNVLGADHLYHK------------ 195
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
+P DPEGL S++ A LIG G +I+ + + L +
Sbjct: 196 ----------------SPIDPEGLTSTLSAIAHTLIGFCCGKIILAKEALEQKTLKLFVA 239
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAG-IYFM 391
L+ G L + LNK ++S ++ +T G + +L + IYF+
Sbjct: 240 GFILMACGFVLT-EALPLNKRIWSPTFVLVTCGLAAMLQSVLIYFI 284
>gi|298480127|ref|ZP_06998326.1| membrane protein [Bacteroides sp. D22]
gi|336404355|ref|ZP_08585053.1| hypothetical protein HMPREF0127_02366 [Bacteroides sp. 1_1_30]
gi|295085510|emb|CBK67033.1| Uncharacterized conserved protein [Bacteroides xylanisolvens XB1A]
gi|298273936|gb|EFI15498.1| membrane protein [Bacteroides sp. D22]
gi|335943683|gb|EGN05522.1| hypothetical protein HMPREF0127_02366 [Bacteroides sp. 1_1_30]
Length = 361
Score = 118 bits (296), Expect = 5e-24, Method: Compositional matrix adjust.
Identities = 100/333 (30%), Positives = 156/333 (46%), Gaps = 78/333 (23%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGI---LPAINHSPWNGLTLADFVMPFFLFIVGVS- 108
+RL+SLDV RG+TVA MILV++ G A H+ W+G + AD V P F+F++G+S
Sbjct: 4 NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63
Query: 109 -LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
++L +F C+ T K I R+L L +G+ ++ F I++ Y D++Q+R MGV+Q
Sbjct: 64 YISLCKYDFQCRPAIT-KIIKRSLLLIFIGLVME-WFITSIDSGNY-FDLSQLRLMGVMQ 120
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLL-LYGLYVPDWQ 226
R+ I Y + AL + + ++ +A++L +Y + L+G
Sbjct: 121 RLGICYGITALLAVAI--------------PHKRFMPLAIILLIVYFIFQLFG------- 159
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
G A N VGMID ILG H+Y +
Sbjct: 160 ---------------------NGFEKSADNIVGMIDSAILGANHMYLQG----------- 187
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
+ DPEG+LS++ A +IG G +I+ KD+ RMLN ++
Sbjct: 188 ---------------RQFVDPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLFLI 232
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
+ L+ G L + LNK L+S S+ LT G
Sbjct: 233 GTTLLFAGYLLSY-ACPLNKRLWSPSFVLLTCG 264
>gi|299140549|ref|ZP_07033687.1| membrane protein [Prevotella oris C735]
gi|298577515|gb|EFI49383.1| membrane protein [Prevotella oris C735]
Length = 359
Score = 118 bits (295), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 99/336 (29%), Positives = 151/336 (44%), Gaps = 76/336 (22%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG--ILPAINHSPWNGLTLADFVMPFFLFIVGVS- 108
+++RL+SLDV RG TV LMILV++ G I + HS WNG+T D V PFFLFI+G+S
Sbjct: 2 EKKRLLSLDVLRGATVCLMILVNNGAGKHIYATLQHSKWNGMTPCDLVFPFFLFIMGIST 61
Query: 109 -LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
L+L NF K + R + LF +G+F+ F I+ +D++ +R V+Q
Sbjct: 62 YLSLEKTNFTWSRQVAFKIVKRTVLLFFIGLFIN-WFDMAISG--NALDLSHLRIWAVMQ 118
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQY 227
RIAI Y ++ +L +R ++L Y LLL W
Sbjct: 119 RIAICYFAVSI--------------FALCCNHRHTIPAIVILLAAYSLLLI------WGN 158
Query: 228 EFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINS 287
+ ++ N + ID ++ GI+HLY
Sbjct: 159 GYAYDSQQ--------------------NILAQIDIRLFGIEHLYH-------------- 184
Query: 288 PDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILS 347
+P DPEG SS+ A LIG + G + K +++L ++I
Sbjct: 185 --------------NSPVDPEGTGSSLSAIAHTLIGFYCGKRMSDAKSTEEKVLRFLITG 230
Query: 348 SCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGV 383
L+ +G + F G+ LNK ++S SY C+T G + V
Sbjct: 231 GFLVIIGYIVSF-GLPLNKRIWSPSYVCMTCGLAAV 265
>gi|350423601|ref|XP_003493532.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Bombus impatiens]
Length = 565
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 96/360 (26%), Positives = 161/360 (44%), Gaps = 54/360 (15%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
Q +RR+ ++D RG + LMI V+D G + H+ WNGL D + P F++I+GV +
Sbjct: 177 QPAKRRVKAIDTVRGASTLLMIFVNDGSGGYRTLGHATWNGLLPGDLLFPCFIWIMGVCI 236
Query: 110 ALTYKNFPCKVVATRK----AILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGV 165
+ + ++ R+ + R++ LFL+G+ L N + G + IR GV
Sbjct: 237 PIAMSSQMKRMTPKRQILYGIVKRSILLFLIGLSL--------NTVSTGGQLETIRIFGV 288
Query: 166 LQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRG------HWVVALVLTTLYLLLLYG 219
LQR I Y V AL L + + R+ + W V LV+ ++ ++ +
Sbjct: 289 LQRFGITYFVVALLYFLLMSRRPRKIQSPMLREVQDFLLLLPQWCVMLVIVVVHCVITFC 348
Query: 220 LYVPDW--QYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIY 277
L VP Y P + + F+ G A G IDR IL HL+
Sbjct: 349 LNVPGCPTGYLGPGGLHDDAKY-FDCVGG----------AAGYIDRVILKEAHLHHS--- 394
Query: 278 SRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHR 337
A + P+DPEG+L ++ A +GLH G +++ +KD +
Sbjct: 395 -------------------ATVYKSGPYDPEGILGTLTAAFQVFLGLHAGIIMMTYKDWK 435
Query: 338 DRMLNWIILSSCLIGLGLSLDFVG-MHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYIS 396
+R++ W+ ++ +G L F + +NK L+SLS+ +T S L+ Y +V +
Sbjct: 436 ERVIRWLAWAAFFGCVGCVLHFTNVIPVNKKLWSLSFVFVTTSFSLAFLSACYLLVDVVK 495
>gi|328780782|ref|XP_396570.4| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Apis mellifera]
Length = 569
Score = 118 bits (295), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 95/360 (26%), Positives = 161/360 (44%), Gaps = 54/360 (15%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGV-- 107
Q +RR+ ++D RG + LMI V+D G + H+ WNGL D + P F++I+GV
Sbjct: 181 QPSKRRVKAIDTVRGASTLLMIFVNDGSGGYRILGHATWNGLLPGDLLFPCFIWIMGVCI 240
Query: 108 --SLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGV 165
++A K K + + R++ +FL+G+ +N + G + IR GV
Sbjct: 241 PIAMAGQMKRMLPKHMIFYGIVKRSILMFLIGL--------SLNTVSTGPQLETIRIFGV 292
Query: 166 LQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRG------HWVVALVLTTLYLLLLYG 219
LQR I Y + AL + L ++ + ++ + W V LV+ ++ + +
Sbjct: 293 LQRFGITYFIVALIYLCLMTRKPKKTQSPMLKEVQDFLLLLPQWCVMLVIVAVHCFITFC 352
Query: 220 LYVPDWQ--YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIY 277
L VP Y P + + F+ G A G IDR IL HL+
Sbjct: 353 LKVPGCPTGYLGPGGLHDDAKY-FDCVGG----------AAGYIDRMILKESHLHH---- 397
Query: 278 SRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHR 337
A + P+DPEG+L ++ T +GLH G +++ +KD +
Sbjct: 398 ------------------SATVYKSGPYDPEGILGTLTTTFQVFLGLHAGIIMMTYKDWK 439
Query: 338 DRMLNWIILSSCLIGLGLSLDFVG-MHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYIS 396
+R++ W+ ++ +G L F + +NK L+SLS+ +T S L+ Y +V I
Sbjct: 440 ERVIRWLTWAAFFSCIGCILHFTNIIPVNKKLWSLSFVFVTTSFSLAFLSACYLLVDVIK 499
>gi|262406057|ref|ZP_06082607.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|294648122|ref|ZP_06725665.1| conserved domain protein [Bacteroides ovatus SD CC 2a]
gi|294806856|ref|ZP_06765681.1| conserved domain protein [Bacteroides xylanisolvens SD CC 1b]
gi|345510562|ref|ZP_08790129.1| hypothetical protein BSAG_00775 [Bacteroides sp. D1]
gi|229443274|gb|EEO49065.1| hypothetical protein BSAG_00775 [Bacteroides sp. D1]
gi|262356932|gb|EEZ06022.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|292636506|gb|EFF54981.1| conserved domain protein [Bacteroides ovatus SD CC 2a]
gi|294445885|gb|EFG14527.1| conserved domain protein [Bacteroides xylanisolvens SD CC 1b]
Length = 361
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 100/346 (28%), Positives = 163/346 (47%), Gaps = 78/346 (22%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGI---LPAINHSPWNGLTLADFVMPFFLFIVGVS- 108
+RL+SLDV RG+TVA MILV++ G A H+ W+G + AD V P F+F++G+S
Sbjct: 4 NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63
Query: 109 -LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
++L NF C+ A K I R+L L +G+ ++ F I++ Y D++Q+R MGV+Q
Sbjct: 64 YISLCKYNFQCR-PAIAKIIKRSLLLIFIGLVME-WFITAIDSGNY-FDLSQLRLMGVMQ 120
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLL-LYGLYVPDWQ 226
R+ I Y + AL + + ++ +A++L +Y + L+G
Sbjct: 121 RLGICYGITALLAVTI--------------PHKKFMPLAIILLVVYFIFQLFG------- 159
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
G N VG++D ILG H+Y +
Sbjct: 160 ---------------------NGFEKSVDNIVGIVDSAILGSNHMYLQG----------- 187
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
+ DPEG+LS++ A +IG G +I+ KD+ RMLN ++
Sbjct: 188 ---------------RQFVDPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLFLI 232
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+ L+ G L + LNK L+S S+ LT G + + LA + +++
Sbjct: 233 GTTLLFAGYLLSY-ACPLNKRLWSPSFVLLTCGIATLSLALLLYII 277
>gi|307178500|gb|EFN67189.1| Heparan-alpha-glucosaminide N-acetyltransferase [Camponotus
floridanus]
Length = 466
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 111/410 (27%), Positives = 184/410 (44%), Gaps = 80/410 (19%)
Query: 14 QLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILV 73
+ VI++ K +P N + D L SN R+ S+D FRG+ + LMI V
Sbjct: 37 KFVIRVIKGKLSPNNVHDDLDRLQEAETSN-PIIGTNRSSTRIRSVDTFRGIAILLMIFV 95
Query: 74 DDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAIL----- 128
++ GG NHS WNGLT+AD V+P+F +I+G+S+ ++ K +V +R I+
Sbjct: 96 NNRGGEYVFFNHSAWNGLTVADLVLPWFAWIMGLSITIS-KRSELRVSNSRTKIILRCLQ 154
Query: 129 RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCE-IWLKGDG 187
RA L LLG+ L + + NL++ GVLQ +A++Y V A E I+++
Sbjct: 155 RAFILILLGLMLNSIRSNSLQNLRFP---------GVLQLLAVSYFVCATIETIFMR--M 203
Query: 188 HVSSKLSLFRKYR---------GHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSP 238
H L F ++ W++ L + + L+ + L VP
Sbjct: 204 HSQDDLLQFGRFTFLRDILNNWAQWLIILAIVVTHTLITFLLPVP--------------- 248
Query: 239 WIFNVTCGVRGSTG-------PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPD 289
N G G G P C A G IDR + G H+Y K +P
Sbjct: 249 ---NCPTGYLGPGGYSHFGNFPNCTGGAAGYIDRLVFG-SHMYNKT----------KNPV 294
Query: 290 YGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSC 349
YG + P DPEG++++V + +G+H G +++ + R++ W++ S
Sbjct: 295 YGTI---------LPHDPEGIMNTVSIILVVYLGVHAGKILLLYYQCNARVVRWLLWSGV 345
Query: 350 ---LIGLGLSLDFVG--MHLNKALYSLSYTCLTAGASGVLLAGIYFMVRY 394
+ GL + D G + ++K + SLS+ + + +L ++F V Y
Sbjct: 346 TGIIAGLLCNFDKEGGVIPVSKKMMSLSFVLTVSCFAFLLFTILHFFVDY 395
>gi|380028317|ref|XP_003697852.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Apis florea]
Length = 555
Score = 117 bits (294), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 98/356 (27%), Positives = 161/356 (45%), Gaps = 51/356 (14%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
Q +RR+ ++D RG + LMI V+D G + H+ WNGL D + P F++I+GV +
Sbjct: 172 QPSKRRVKAIDTVRGASTLLMIFVNDGSGGYRILGHATWNGLLPGDLLFPCFIWIMGVCI 231
Query: 110 ALTYKNFPCKVVATRKAIL-----RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMG 164
+ + K + R IL R++ +FL+G+ +N + G + IR G
Sbjct: 232 PIAMAS-QMKRMLPRHVILYGIVKRSILMFLIGL--------SLNTVSTGPQLETIRVFG 282
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSK-LSLFRKYRGHWVVALVLTTLYLLLLYGLYVP 223
VLQR I YL+ AL L ++ + F W V LV+ ++ ++ + L VP
Sbjct: 283 VLQRFGITYLIVALIYFCLMARKPKKTQVMQDFLLLLPQWCVMLVIVAVHCVITFCLKVP 342
Query: 224 DWQ--YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTK 281
Y P + + F+ G A G IDR IL HL+
Sbjct: 343 GCPTGYLGPGGLHDDAKY-FDCVGG----------AAGYIDRMILKEPHLHH-------- 383
Query: 282 QCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRML 341
A + P+DPEG+L ++ T +GLH G +++ +KD ++R++
Sbjct: 384 --------------SATVYKSGPYDPEGILGTLTTTFQVFLGLHAGIIMMTYKDWKERVI 429
Query: 342 NWIILSSCLIGLGLSLDFVG-MHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYIS 396
W+ ++ +G L F + +NK L+SLS+ +T S L+ Y +V I
Sbjct: 430 RWLAWAAFFSCIGCILHFSNIIPVNKKLWSLSFVFVTTSFSLAFLSACYLLVDVIK 485
>gi|196012186|ref|XP_002115956.1| hypothetical protein TRIADDRAFT_59909 [Trichoplax adhaerens]
gi|190581732|gb|EDV21808.1| hypothetical protein TRIADDRAFT_59909 [Trichoplax adhaerens]
Length = 580
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 112/407 (27%), Positives = 183/407 (44%), Gaps = 83/407 (20%)
Query: 10 DANDQLVIQISKSTSA-PANEKLERDPLLPPSNSNSKQTRPQHQQ-----RRLISLDVFR 63
+ D+ VI ++S SA + E L + L ++ + + R + +R+ ++D FR
Sbjct: 197 NTTDKSVICRNRSCSAYHSTESLITEDLGNINDIDRRLARVSEESIHPLAQRIYAVDAFR 256
Query: 64 GLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVAT 123
GL + +MI V+ GG +PWNGLT AD ++P+F+FIVG+ +AL++ N + A+
Sbjct: 257 GLCITIMIFVNSGGGGYWYFRSTPWNGLTFADLILPWFIFIVGICIALSFYNHR-YITAS 315
Query: 124 R--------KAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLV 175
R K + R++ LFL+G+FL GV+++ R G LQ++AI+Y+V
Sbjct: 316 RLPPSSAVLKVLSRSVILFLIGLFLND-----------GVNLSTWRIPGNLQKVAISYIV 364
Query: 176 AALCEIWLKGDGHVSSKLSLFRKYR---GHWVVALVLTTLYLLLLYGLYVPDWQYEFPVE 232
+L ++L + L R+ W+ + L ++YL L++ L VP
Sbjct: 365 VSLSVLYLAKPPDTITNLRAIREIVCIWKIWIGMIGLLSIYLSLIFALPVPG------CP 418
Query: 233 TSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKP---IYSRTKQCSINSPD 289
T P + TG A G IDR I G HL P + RT
Sbjct: 419 TGYFGPGGLSDDANHYNCTG---GATGYIDRFIFG-NHLDANPSCKVLYRT--------- 465
Query: 290 YGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSC 349
PFD EG LS++ + +TC +GL +
Sbjct: 466 ------------HMPFDSEGCLSTLTSILTCFMGLQ------------------VATGVA 495
Query: 350 LIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYIS 396
L G ++ F+ LN+ L+SLSY L G + +L +Y ++ +S
Sbjct: 496 LCGGKQNMGFI--PLNRNLWSLSYITLLGGLAYFVLMMLYLLIDVMS 540
>gi|293371912|ref|ZP_06618316.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
gi|292633158|gb|EFF51735.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
Length = 361
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 99/333 (29%), Positives = 156/333 (46%), Gaps = 78/333 (23%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGI---LPAINHSPWNGLTLADFVMPFFLFIVGVS- 108
+RL+SLDV RG+TVA MILV++ G A H+ W+G + AD V P F+F++G+S
Sbjct: 4 NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63
Query: 109 -LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
++L NF C+ A K I R+L L +G+ ++ F I++ Y D++Q+R MGV+Q
Sbjct: 64 YISLCKYNFQCR-PAIAKIIKRSLLLIFIGLVME-WFITAIDSGNY-FDLSQLRLMGVMQ 120
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLL-LYGLYVPDWQ 226
R+ I Y + AL + + ++ +A++L +Y + L+G
Sbjct: 121 RLGICYGITALLAVTI--------------PHKRFMPLAIILLIVYFIFQLFG------- 159
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
G A N VG++D ILG H+Y +
Sbjct: 160 ---------------------NGFEKSADNIVGIVDSAILGSNHMYLQG----------- 187
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
+ DPEG+LS++ A +IG G +I+ KD+ RMLN ++
Sbjct: 188 ---------------RQFVDPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLFLI 232
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
+ L+ G L + LNK L+S S+ LT G
Sbjct: 233 GTTLLFAGYLLSY-ACPLNKRLWSPSFVLLTCG 264
>gi|189465173|ref|ZP_03013958.1| hypothetical protein BACINT_01518 [Bacteroides intestinalis DSM
17393]
gi|189437447|gb|EDV06432.1| hypothetical protein BACINT_01518 [Bacteroides intestinalis DSM
17393]
Length = 369
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 104/345 (30%), Positives = 156/345 (45%), Gaps = 78/345 (22%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILP--AINHSPWNGLTLADFVMPFFLFIVGVS- 108
Q++RL+SLDV RG+TV MILV++ GG L ++ HS WNGLTL D V PFFLFI+G+S
Sbjct: 14 QKKRLLSLDVLRGITVVGMILVNNSGGKLSYDSLQHSAWNGLTLCDLVFPFFLFIMGIST 73
Query: 109 -LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
+AL +F RK + R L + +G + +FH I + +A +R GVL
Sbjct: 74 YIALNKFHFQASGPVIRKILKRTLVILCIGWAIH--WFHFICEGDF-FPLAHLRLTGVLP 130
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQY 227
RIA+ Y + +++K KY G W++ ++ +LL G
Sbjct: 131 RIALCYCAVSFVALYVK------------PKYIG-WMIGFLIIGYAVLLGIG-------- 169
Query: 228 EFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINS 287
G T + N + +IDR +LG HLY K
Sbjct: 170 --------------------NGYTLDSTNILAIIDRNVLGADHLYHK------------- 196
Query: 288 PDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILS 347
+P DPEGL S++ A LIG G +I+ + + L +
Sbjct: 197 ---------------SPIDPEGLTSTLAAIAHTLIGFCCGRIILAKEALEQKTLKLFVAG 241
Query: 348 SCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAG-IYFM 391
L+ G L + LNK ++S ++ +T G + +L A IYF+
Sbjct: 242 FILMACGFVLT-EALPLNKRIWSPTFVLVTCGLAAMLQAVLIYFI 285
>gi|224537467|ref|ZP_03678006.1| hypothetical protein BACCELL_02346 [Bacteroides cellulosilyticus
DSM 14838]
gi|224520905|gb|EEF90010.1| hypothetical protein BACCELL_02346 [Bacteroides cellulosilyticus
DSM 14838]
Length = 368
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 105/346 (30%), Positives = 156/346 (45%), Gaps = 80/346 (23%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILP--AINHSPWNGLTLADFVMPFFLFIVGVS- 108
Q++RL+SLDV RG+TV MILV++ GG L ++ HS WNGLTL D V PFFLFI+G+S
Sbjct: 13 QKKRLLSLDVLRGITVVGMILVNNSGGKLSYDSLQHSAWNGLTLCDLVFPFFLFIMGIST 72
Query: 109 -LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
+AL +F RK + R L + +G + +FH I + + A +R GVL
Sbjct: 73 YIALGKFHFQASGSVVRKILKRTLVILCIGWAIH--WFHFICDGDF-FPFAHLRLTGVLP 129
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGL-YVPDWQ 226
RIA+ Y V + +++ KY G W++ ++ +LL G Y PD
Sbjct: 130 RIALCYCVVSFVALYVN------------HKYIG-WIIGCLIAGYAVLLCIGNGYAPD-- 174
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
N + +IDR ILG HLY K
Sbjct: 175 ---------------------------DTNLLAIIDRNILGADHLYHK------------ 195
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
+P DPEGL S++ A LIG G +I+ + + L +
Sbjct: 196 ----------------SPIDPEGLTSTLSAIAHTLIGFCCGKIILAKEALEQKTLKLFVA 239
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAG-IYFM 391
L+ G L + LNK ++S ++ +T G + +L + IYF+
Sbjct: 240 GFILMACGFVLT-EALPLNKRIWSPTFVLVTCGLAAMLQSVLIYFI 284
>gi|428300562|ref|YP_007138868.1| hypothetical protein Cal6303_3980 [Calothrix sp. PCC 6303]
gi|428237106|gb|AFZ02896.1| hypothetical protein Cal6303_3980 [Calothrix sp. PCC 6303]
Length = 402
Score = 116 bits (291), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 123/392 (31%), Positives = 168/392 (42%), Gaps = 111/392 (28%)
Query: 55 RLISLDVFRGLTVALMILVDDVG----GILPAINHSPWNGLTLADFVMPFFLFIVGVSLA 110
RL SLDVFRG T+A MILV+ V + PA+ H+ W+G TLAD V PFFLFIVGV+++
Sbjct: 2 RLTSLDVFRGATIAGMILVNMVSLAEPNVYPALLHADWHGCTLADLVFPFFLFIVGVAMS 61
Query: 111 LTYKNF---------------------------------PCKVVATRKAILRALNLFLLG 137
++ + P K + R AIL AL LFL
Sbjct: 62 FSFAKYTDVIPKVEKEKDAIGALQQFLAKESSAAGGAKPPYKKIFRRGAILFALGLFL-- 119
Query: 138 IFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFR 197
F NL Y D + +R MGVLQRIA+ YL A+L + L
Sbjct: 120 -----NLFWNSKNLPY-FDFSTLRIMGVLQRIALTYLFASLIVLKLPK------------ 161
Query: 198 KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGS-TGPACN 256
+ W+VA VL Y LL+ +YVP +Y G G+ TG N
Sbjct: 162 --KAQWIVAGVLLVGYWLLM--MYVPIPEYG----------------AGEIGTRTG---N 198
Query: 257 AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMA 316
IDR I+ HLY+ ++ ++G DPEGL S++ A
Sbjct: 199 FAAYIDRFIIPKAHLYKGDGFN----------NFG--------------DPEGLFSTIPA 234
Query: 317 TVTCLIGLHFGHLIVH--FKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYT 374
V+ L G G I K L + +I +LDF +NK L++ SY
Sbjct: 235 IVSVLGGYFSGQWIRSQPVKTRTSLGLALAGIGCLIIAWLFALDF---PINKKLWTSSYV 291
Query: 375 CLTAGASGVLLAGIYFMVRYISSHLMLKKPFD 406
T G + +LLAG Y ++ + KPF+
Sbjct: 292 IFTTGWALLLLAGCYELIE-VQKIRSWAKPFE 322
>gi|320168011|gb|EFW44910.1| hgsnat protein [Capsaspora owczarzaki ATCC 30864]
Length = 800
Score = 116 bits (290), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 113/393 (28%), Positives = 176/393 (44%), Gaps = 61/393 (15%)
Query: 16 VIQISKSTSAPANEKLERDPL--LPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILV 73
VIQ S ++ E LE D L + + P ++ R+ SLD FRG+ +++M+ V
Sbjct: 379 VIQ-STICASCGRELLEADRFCGLCGTRRDKASAPPAEERVRVRSLDTFRGIALSIMLFV 437
Query: 74 DDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATR-----KAIL 128
+ GG +HS WNGLT+AD V P+F+F++GVS++L+++ + A R K I
Sbjct: 438 NYGGGGYWFFDHSTWNGLTVADLVFPWFIFMMGVSMSLSFEKLR-RRGAPRGALFLKVIR 496
Query: 129 RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGH 188
R++ LF LG+FL + + A R GVLQR A++YL A +++
Sbjct: 497 RSMTLFALGLFL----------VCRQIIFATWRMPGVLQRFAVSYLFVAAIVMFVPIFAT 546
Query: 189 VSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVR 248
+ + WVV + T++ + + VP T P G
Sbjct: 547 LPGPFRDLTSHWLQWVVIGIFITIHTCITFLYDVPG------CGTGYIGP-------GGI 593
Query: 249 GSTGPACN----AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAP 304
G G N A G ID ++ G +H+Y+ P T Q +
Sbjct: 594 GDFGQYMNCTGGAAGYIDSQVFG-RHIYQAP----TAQA---------------YYLTGA 633
Query: 305 FDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMH- 363
+DPEGLL + + V +G G ++V F H R+ W L LGL L +
Sbjct: 634 YDPEGLLGCLTSVVITFLGYQAGRILVTFSTHSARLRRWAAWGVGLGLLGLILCKGTQND 693
Query: 364 ----LNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+NK L+SLS+ + AG ++LA Y +V
Sbjct: 694 GWIPINKNLWSLSFVLIMAGLGYLMLASCYIIV 726
>gi|75909960|ref|YP_324256.1| hypothetical protein Ava_3756 [Anabaena variabilis ATCC 29413]
gi|75703685|gb|ABA23361.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
Length = 375
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 114/349 (32%), Positives = 158/349 (45%), Gaps = 77/349 (22%)
Query: 55 RLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL SLDVFRG+T+A MILV+ G + P + H+ W+G T D V PFFLFIVGV+++
Sbjct: 2 RLTSLDVFRGITIAGMILVNMAGVADDVYPPLAHAEWHGCTPTDLVFPFFLFIVGVAMSF 61
Query: 112 -----TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
T +N P V R RA LF+LG+ L G + GI + D++ IR MGVL
Sbjct: 62 SLSKYTQENKPTSAVYWR-IFRRAAILFVLGLLLNGFWNKGI----WTFDLSNIRIMGVL 116
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRI+++YL A+L + L+L RK G W++A VL Y L + + VPD+
Sbjct: 117 QRISLSYLFASL------------AVLNLPRK--GQWILAGVLLVGYWLTMMYVPVPDYG 162
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
GV G N +DR I+ HLY +
Sbjct: 163 ------------------AGVLTREG---NFGAYVDRLIIPQAHLYAGDGFKNLG----- 196
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
DPEGL S++ A V+ L G G I + +
Sbjct: 197 -------------------DPEGLFSTIPAIVSVLAGYFTGEWIRKQSVQTRTSVGLALF 237
Query: 347 S-SCLI-GLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVR 393
CLI G G F +NK L++ SY T+G + +LLA Y ++
Sbjct: 238 GIGCLIVGWGWGWVF---PINKKLWTSSYVVFTSGWALLLLAACYELIE 283
>gi|307209305|gb|EFN86390.1| Heparan-alpha-glucosaminide N-acetyltransferase [Harpegnathos
saltator]
Length = 552
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 105/404 (25%), Positives = 190/404 (47%), Gaps = 62/404 (15%)
Query: 11 ANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALM 70
A ++++++ K +P + + D L +SN R R+ S+D FRG+++ LM
Sbjct: 120 ATCKVIVRVVKGKLSPDSVHDDLDRLQEAESSN-PVIRTSRVNTRIRSVDTFRGISILLM 178
Query: 71 ILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAIL-- 128
I V++ GG NHS WNGLT+AD V+P+F +I+G+S+ ++ K +V +R I+
Sbjct: 179 IFVNNGGGKYVFFNHSVWNGLTVADLVLPWFAWIMGLSITIS-KRSELRVSNSRGKIILR 237
Query: 129 ---RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCE-IWLK 184
RA L LLG+ L N + + +R+ G+LQ +A++Y V A E I+++
Sbjct: 238 CLQRAFILILLGLML---------NSIHTKSLKDLRFPGILQLLAVSYFVCATIETIFMR 288
Query: 185 GDGHVSSKLSLFRKYR---------GHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSS 235
H L F ++ W + + + T + L+ + L V D
Sbjct: 289 --AHSQDDLLQFGRFTVLRDILDSWAQWSIIVAIATTHTLITFLLPVLD------CPKGY 340
Query: 236 SSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPL 295
P +++ TG A G IDR + G H+Y K ++P YG +
Sbjct: 341 LGPGGYHLFGKNANCTG---GAAGYIDRLVFG-SHMYNKT----------HNPVYGTI-- 384
Query: 296 DAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSC---LIG 352
P+DPEG+++++ + +G+H G +++ + R++ W++ S + G
Sbjct: 385 -------LPYDPEGIMNTISVILVVYMGVHAGKILLLYYQCNARIVRWLLWSGVTGLIAG 437
Query: 353 LGLSLDFVG--MHLNKALYSLSYTCLTAGASGVLLAGIYFMVRY 394
+ D + +NK + SLS+ + + +L A ++F V Y
Sbjct: 438 ILCHFDKESGVIPVNKRMMSLSFVLTVSCFAFLLYAILHFFVDY 481
>gi|333378336|ref|ZP_08470067.1| hypothetical protein HMPREF9456_01662 [Dysgonomonas mossii DSM
22836]
gi|332883312|gb|EGK03595.1| hypothetical protein HMPREF9456_01662 [Dysgonomonas mossii DSM
22836]
Length = 387
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 98/353 (27%), Positives = 162/353 (45%), Gaps = 89/353 (25%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
RL+SLDV RG+T+A MILV++ G I + H+ WNGLT D + PFF+FI+G+S
Sbjct: 6 SSRLLSLDVLRGITIAGMILVNNPGSWGHIYAPLRHAEWNGLTPTDLIFPFFMFIMGIST 65
Query: 110 ALTYK--NFPCKVVATRKAILRALNLFLLGIFLQG-----GFFHGI--NNLKY------- 153
++ + NF V RK + R +FL+G+ L G +H + +NL +
Sbjct: 66 FISLRKFNFEFSVPTLRKILKRTFVIFLIGLGLSWLGVSFGTYHALAADNLGFLERLGRS 125
Query: 154 GVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLY 213
+ +R +GV+QR+A+ Y + +L I++K KY +++ + L +
Sbjct: 126 VTNFEHLRILGVMQRLALTYGITSLIAIFIK------------HKYI-PYIIVVGLVGYF 172
Query: 214 LLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYR 273
LLLL+G G N + + D+ ILG+ H+Y
Sbjct: 173 LLLLFG----------------------------NGFATEGYNILAVTDQSILGLNHMY- 203
Query: 274 KPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHF 333
++G DPEG+LS++ A LIG + G +++
Sbjct: 204 --------------TEFG-------------LDPEGILSTIPAVCHVLIGFYCGKILMET 236
Query: 334 KDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
KD++ RML+ I+ + L G L + G +NK ++S ++ T G L+
Sbjct: 237 KDNQQRMLHLFIIGAILTFSGFLLSY-GCPINKKIWSPTFELTTCGLGATFLS 288
>gi|298491757|ref|YP_003721934.1| hypothetical protein Aazo_3034 ['Nostoc azollae' 0708]
gi|298233675|gb|ADI64811.1| Protein of unknown function DUF2261, transmembrane ['Nostoc
azollae' 0708]
Length = 376
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 108/359 (30%), Positives = 163/359 (45%), Gaps = 71/359 (19%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGI---LPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL SLDVFRG+T+A MILV+ VG ++H+ WNG T D V PFFLFIVGV++
Sbjct: 2 RLTSLDVFRGITIAGMILVNMVGVADHKYSLLDHAEWNGCTPTDLVFPFFLFIVGVAMTF 61
Query: 112 TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAI 171
+ + T+ LR L + L N + D++ IR+MGVLQRI++
Sbjct: 62 SLSKYTADNKPTKAVYLRILRRAAILFLLGLLLNGFWNKGVWTFDLSSIRFMGVLQRISL 121
Query: 172 AYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPV 231
+YL A+L + + G + WV+A VL Y L + + VPD+
Sbjct: 122 SYLFASLIVLKVPG--------------KNQWVLAGVLLIGYWLTMMYVPVPDYG----- 162
Query: 232 ETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYG 291
GV G N G IDR I+ HLY+ ++
Sbjct: 163 -------------AGVLTREG---NFGGFIDRLIIPKAHLYKGDGFNYLG---------- 196
Query: 292 PMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRM--LNWIILSSC 349
DPEGL S++ A V+ L+G +F + + + H + +++++ C
Sbjct: 197 --------------DPEGLYSTIPAIVSVLVG-YFAGIRIKERKHLNSQTSMDFVLFGLC 241
Query: 350 LIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM--VRYISSHLMLKKPFD 406
+ + + D V +NK L++ SY T G + +LLA Y + VR I KPF+
Sbjct: 242 CLVVSIIWD-VAFPINKKLWTSSYVVFTTGWALMLLAACYELIEVRLIKRW---SKPFE 296
>gi|440684188|ref|YP_007158983.1| hypothetical protein Anacy_4727 [Anabaena cylindrica PCC 7122]
gi|428681307|gb|AFZ60073.1| hypothetical protein Anacy_4727 [Anabaena cylindrica PCC 7122]
Length = 376
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 117/363 (32%), Positives = 165/363 (45%), Gaps = 79/363 (21%)
Query: 55 RLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL SLDVFRG+T+A MILV+ VG ++H+ WNG T D V PFFLFIVGV++
Sbjct: 2 RLTSLDVFRGITIAGMILVNMVGVADNKYSLLDHAEWNGCTPTDLVFPFFLFIVGVAMTF 61
Query: 112 -----TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
T N P K V R + RA LF+LG+ L G + G+ + D++ IR MGVL
Sbjct: 62 SLSKYTADNKPTKAVYLR-ILRRAAILFILGLLLNGFWNKGV----WTFDLSSIRLMGVL 116
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRI+++YL A+L L L RK ++A VL Y L + +Y+P +
Sbjct: 117 QRISLSYLFASLIV------------LKLPRK--SQLILAGVLLIGYWLTM--MYIPVPE 160
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
Y V T N IDR I+ LY+ ++
Sbjct: 161 YGAGVLTREG-------------------NFGAFIDRLIIPKAQLYKGDGFNFMG----- 196
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRM-LNWII 345
DPEGL S++ A V+ L G G I K + ++ ++
Sbjct: 197 -------------------DPEGLYSTIPAIVSVLAGYFTGEWIKDKKQANSQTSMDLVL 237
Query: 346 LSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM--VRYISSHLMLKK 403
C + +G+ D V +NK L++ SY T G + +LLA Y + VR I K
Sbjct: 238 FGLCCLVIGIIWD-VAFPINKKLWTSSYVVFTTGWALMLLAACYELIEVRVIKRW---SK 293
Query: 404 PFD 406
PF+
Sbjct: 294 PFE 296
>gi|332027964|gb|EGI68015.1| Heparan-alpha-glucosaminide N-acetyltransferase [Acromyrmex
echinatior]
Length = 569
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 112/415 (26%), Positives = 197/415 (47%), Gaps = 58/415 (13%)
Query: 11 ANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALM 70
A +++I++ K +P N + D L +N R R+ S+D FRG+++ LM
Sbjct: 137 ATVKVIIRVVKGKLSPDNVHDDLDELQEAETAN-IMIRTNRSSIRIRSVDTFRGISILLM 195
Query: 71 ILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATR-KAILR 129
I V++ GG NHS WNGLT+AD V+P+F +I+G+S+ ++ K +V +R K I+R
Sbjct: 196 IFVNNGGGQYMFFNHSAWNGLTVADLVLPWFAWIMGLSITIS-KRSELRVSNSRGKIIVR 254
Query: 130 ALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCE-IWLKGDGH 188
L ++ I L + N Y + +R+ GVLQ +A++Y + A E I++K H
Sbjct: 255 CLQRTIILILLG-----LMLNSIYAKSLDDLRFPGVLQLLAVSYFICATIETIFMK--TH 307
Query: 189 VSSKLSLFRKYR---------GHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPW 239
+ F ++ W++ L + T ++L+ + L VP+ T P
Sbjct: 308 PQDDVLQFGRFTVLRDILNNWAQWLIILAIMTTHILITFLLPVPN------CPTGYLGPG 361
Query: 240 IFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPS 299
++ TG A G IDR + G H+Y K +P YG +
Sbjct: 362 GYHHFGEFANCTG---GAAGYIDRLVFG-SHMYSKT----------QNPVYGTI------ 401
Query: 300 WCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL--GLSL 357
P DPEG+++++ + +G+H G +++ + +++ W +L S + GL G+
Sbjct: 402 ---LPHDPEGIMNTISIILVVYLGVHAGKILLLYYQCNAKVIRW-LLWSFVTGLIAGILC 457
Query: 358 DFVG----MHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKKPFDYS 408
DF + ++K + SLS+ + + +L A +Y V Y PF+Y+
Sbjct: 458 DFDKEFGVIPVSKKMMSLSFVLTVSCFAFLLYAILYVFVDY--KQYWSGAPFNYA 510
>gi|340727662|ref|XP_003402158.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Bombus terrestris]
Length = 554
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 101/361 (27%), Positives = 162/361 (44%), Gaps = 56/361 (15%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
Q +RR+ ++D RG + LMI V+D G + H+ WNGL D + P F++I+GV +
Sbjct: 166 QPAKRRVKAIDTVRGASTLLMIFVNDGSGGYRTLGHATWNGLLPGDLLFPCFIWIMGVCI 225
Query: 110 ALTYKNFPCKVVATRKAIL-----RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMG 164
+ + K + + IL R++ LFL+G+ L N + G + IR G
Sbjct: 226 PIAMSS-QMKRMTLKHQILYGIVKRSILLFLIGLSL--------NTVSTGGQLETIRIFG 276
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRG------HWVVALVLTTLYLLLLY 218
VLQR I YLV AL L + + R+ + W V LV+ ++ + +
Sbjct: 277 VLQRFGITYLVVALLYFLLMSRRPSKIQSPMLREVQDFLLLLPQWCVMLVIVVVHCAITF 336
Query: 219 GLYVPDW--QYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPI 276
L VP Y P + + F+ G A G IDR IL HL+
Sbjct: 337 CLNVPGCPTGYLGPGGLHDDAKY-FDCVGG----------AAGYIDRMILKEAHLH---- 381
Query: 277 YSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDH 336
YS T S P+DPEG+L ++ +GLH G +++ +KD
Sbjct: 382 YSATVYKS------------------GPYDPEGILGTLTTAFQVFLGLHAGIIMMTYKDW 423
Query: 337 RDRMLNWIILSSCLIGLGLSLDFVG-MHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYI 395
++R++ W+ ++ +G L F + +NK L+SLS+ +T S L+ Y +V +
Sbjct: 424 KERVIRWLAWAAFFGCVGCVLHFTNVIPVNKKLWSLSFVFVTTSFSLAFLSACYLLVDVV 483
Query: 396 S 396
Sbjct: 484 K 484
>gi|336412607|ref|ZP_08592960.1| hypothetical protein HMPREF1017_00068 [Bacteroides ovatus
3_8_47FAA]
gi|335942653|gb|EGN04495.1| hypothetical protein HMPREF1017_00068 [Bacteroides ovatus
3_8_47FAA]
Length = 361
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 98/333 (29%), Positives = 155/333 (46%), Gaps = 78/333 (23%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGI---LPAINHSPWNGLTLADFVMPFFLFIVGVS- 108
+RL+SLDV RG+TVA MILV++ G A H+ W+G + AD V P F+F++G+S
Sbjct: 4 NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63
Query: 109 -LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
++L NF C+ A K I R+L L +G+ ++ F I++ Y D++Q+R MGV+Q
Sbjct: 64 YISLCKYNFQCR-PAIAKIIKRSLLLIFIGLVME-WFITSIDSGNY-FDLSQLRLMGVMQ 120
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLL-LYGLYVPDWQ 226
R+ I Y + AL + + ++ +A++L +Y + L+G
Sbjct: 121 RLGICYGITALLAVTI--------------PHKRFMPLAIILLIVYFIFQLFG------- 159
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
G N VG++D ILG H+Y +
Sbjct: 160 ---------------------NGFEKSVDNIVGIVDSAILGSNHMYLQG----------- 187
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
+ DPEG+LS++ A +IG G +I+ KD+ RMLN ++
Sbjct: 188 ---------------RQFVDPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLFLI 232
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
+ L+ G L + LNK L+S S+ LT G
Sbjct: 233 GTTLLFAGYLLSY-ACPLNKRLWSPSFVLLTCG 264
>gi|297172331|gb|ADI23307.1| uncharacterized conserved protein [uncultured nuHF2 cluster
bacterium HF0770_19K18]
Length = 373
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 104/363 (28%), Positives = 159/363 (43%), Gaps = 81/363 (22%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG 106
+++ RL+SLD FRGLT+A MI+V+ G + + H+ W+G T D V PFFLFIVG
Sbjct: 4 KNKSDRLLSLDAFRGLTIAFMIIVNTPGNWSYVYGPLRHAEWHGCTPTDLVFPFFLFIVG 63
Query: 107 VSLALTYK--NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMG 164
V++ ++ N+ +K R + +F G+ L F ++ D + +R MG
Sbjct: 64 VAMRFSFAQHNYQPSSDLLKKIFWRTVTIFSFGLLLNAYPF-----IRQNWDWSSLRIMG 118
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLY--LLLLYGLYV 222
VLQRI +AY +AA+ LSL+ + W+ ++ Y +LLL+G
Sbjct: 119 VLQRIGLAYGLAAI--------------LSLYLSEKKLWISCGIILIGYWLILLLFG--- 161
Query: 223 PDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQ 282
S P+ G + N ID ILG HL+R
Sbjct: 162 ------------GSDPF------------GLSSNIARTIDIAILGENHLWRG-------- 189
Query: 283 CSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLN 342
PFDPEGLLS++ A VT LIG G LI + +
Sbjct: 190 ------------------TGIPFDPEGLLSTIPAIVTVLIGFSIGQLIQENSNRISLVQT 231
Query: 343 WIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLK 402
+I + + +G F+ +NK L++ +Y T G + LA +++ I + L
Sbjct: 232 ILIRGAGIAAVGWLWGFI-FPINKQLWTSTYVLYTGGLASFFLAAFIWLID-IRGYKKLS 289
Query: 403 KPF 405
PF
Sbjct: 290 WPF 292
>gi|282898832|ref|ZP_06306819.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
gi|281196359|gb|EFA71269.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
Length = 375
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 101/347 (29%), Positives = 151/347 (43%), Gaps = 73/347 (21%)
Query: 55 RLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RLISLDVFRGLT+A+MI+ + G + P ++H+PWNG T D V PFFLFIVGV+++
Sbjct: 2 RLISLDVFRGLTIAMMIIANMAGVVPDVYPFLSHAPWNGCTPTDLVFPFFLFIVGVAMSF 61
Query: 112 TYKNFPCKVVATRKAIL----RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
+ + + RA+ LF LG+ L G + G+ + D+ +R MGVLQ
Sbjct: 62 SLSKYSLESKLDNLVYFNLCRRAVILFTLGLLLNGFWNQGVGSF----DLQSLRVMGVLQ 117
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQY 227
RI +AYL A+L + L + W +A +L Y L + + VPD+
Sbjct: 118 RIGLAYLFASLIVLKLPE--------------KTQWALAGILLIFYWLTMMYIPVPDYG- 162
Query: 228 EFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINS 287
G N IDR I+ HLY ++
Sbjct: 163 --------------------AGMLTREGNFGAFIDRLIIAKPHLYAGDGFNFRG------ 196
Query: 288 PDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILS 347
DPEGL S++ A V L G G I + ++ ++
Sbjct: 197 ------------------DPEGLFSTIPAIVNVLFGYFAGQWIRKSTINSHTSMDLVLWG 238
Query: 348 SCLIGLGLSLDFVGMH-LNKALYSLSYTCLTAGASGVLLAGIYFMVR 393
C + +G+ D G+ +NK L++ SY + G V LA Y ++
Sbjct: 239 LCSLVVGMIWD--GLFPINKKLWTSSYVLFSTGWGLVFLAACYDLIE 283
>gi|427728534|ref|YP_007074771.1| hypothetical protein Nos7524_1293 [Nostoc sp. PCC 7524]
gi|427364453|gb|AFY47174.1| hypothetical protein Nos7524_1293 [Nostoc sp. PCC 7524]
Length = 380
Score = 115 bits (288), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 115/371 (30%), Positives = 165/371 (44%), Gaps = 78/371 (21%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFL 102
Q+ + RL SLDVFRG+T+A MILV+ G + + H+ W+G T D V PFFL
Sbjct: 7 QSLDKTLPMRLTSLDVFRGITIAAMILVNMAGVADDVYLPLTHADWHGCTPTDLVFPFFL 66
Query: 103 FIVGVSLAL-----TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDI 157
FIVGV++ T N P + R + RA LF+LG+FL G + G+ + D
Sbjct: 67 FIVGVAMTFSLSKYTQDNKPTSAIYWR-ILRRAAILFILGLFLNGFWNQGV----WTFDF 121
Query: 158 AQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLL 217
IR MGVLQRI+++YL+A+L L L RK G W++A VL Y L +
Sbjct: 122 TSIRMMGVLQRISLSYLLASLIV------------LKLPRK--GQWLLAGVLLIGYWLAM 167
Query: 218 YGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIY 277
+ VPD+ GV G N +DR I+ HLY+ +
Sbjct: 168 MYIPVPDYG------------------AGVLTREG---NFGAYVDRLIIPKAHLYKGDGF 206
Query: 278 SRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHR 337
+ DPEGL S++ A V+ L G G I
Sbjct: 207 NFMG------------------------DPEGLFSTIPAIVSVLAGYFTGEWIRKQPVQT 242
Query: 338 DRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISS 397
+ + + +G + +V +NK L++ SY T+G + +LLA Y I
Sbjct: 243 RTSVGLALFGIGCLMIGWAWGWV-FPINKKLWTSSYVVFTSGWALLLLAACY---ELIEV 298
Query: 398 HLMLK--KPFD 406
LM + KPF+
Sbjct: 299 RLMRRWSKPFE 309
>gi|119512372|ref|ZP_01631456.1| hypothetical protein N9414_19342 [Nodularia spumigena CCY9414]
gi|119462961|gb|EAW43914.1| hypothetical protein N9414_19342 [Nodularia spumigena CCY9414]
Length = 369
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 112/344 (32%), Positives = 156/344 (45%), Gaps = 73/344 (21%)
Query: 55 RLISLDVFRGLTVALMILVD---DVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL SLDVFRG+T+A MILV+ G + P + H+ W+G T D V PFFLFIVGV+++
Sbjct: 2 RLSSLDVFRGITIAAMILVNMAGVAGEVYPPLAHADWHGCTPTDLVFPFFLFIVGVAMSF 61
Query: 112 TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAI 171
+ + K + RA LF LG+ L G + GI + D+++IR MGVLQRI++
Sbjct: 62 SLSKYTEK--GYSRIFRRAAILFALGLLLNGFWNQGI----WTFDLSKIRIMGVLQRISL 115
Query: 172 AYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPV 231
AYL+A+L + L+L RK G W++A VL Y L + +YVP +Y V
Sbjct: 116 AYLLASL------------AVLNLPRK--GQWILAGVLLIGYWLTM--MYVPVPEYGAGV 159
Query: 232 ETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYG 291
T N IDR I+ HLY Y
Sbjct: 160 LTREG-------------------NFGAYIDRLIIPQVHLYAGDGY-------------- 186
Query: 292 PMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVH--FKDHRDRMLNWIILSSC 349
Q DPEGL S++ A V L G G I + K L +
Sbjct: 187 ----------QNLGDPEGLFSTIPAVVNVLAGYFTGQWIRNQPVKTRTSIGLGLFGIGCL 236
Query: 350 LIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVR 393
+IG F +NK L++ SY + G + +LLA Y ++
Sbjct: 237 IIGWAWGWIF---PINKKLWTSSYVVFSNGWALLLLAACYELIE 277
>gi|26330552|dbj|BAC29006.1| unnamed protein product [Mus musculus]
gi|74213594|dbj|BAE35603.1| unnamed protein product [Mus musculus]
gi|74225342|dbj|BAE31601.1| unnamed protein product [Mus musculus]
Length = 624
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 110/394 (27%), Positives = 174/394 (44%), Gaps = 63/394 (15%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRR-----LISLDVFRGLTVALMILV 73
ISK+ ++ ++L L PS ++ Q + RR L +D FRGL + LM+ V
Sbjct: 187 ISKTIASRETDRLINSELGSPSRADPLSADYQPETRRSSANRLRCVDTFRGLALVLMVFV 246
Query: 74 DDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAILR 129
+ GG HS WNGLT+AD V P+F+FI+G S+ L+ + K+ K + R
Sbjct: 247 NYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGTSIFLSMTSILQRGCSKLKLLGKIVWR 306
Query: 130 ALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCE--IWLKGDG 187
+ L +G+ + N + ++R GVLQR+ + Y V A+ E W
Sbjct: 307 SFLLICIGVIIVNP-----NYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLEFFFWKPVPD 361
Query: 188 HVSSKLSLFR-----KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFN 242
+ + S F W+ L L +++L L + L VP T P
Sbjct: 362 SCTLESSCFSLRDITSSWPQWLTILTLESIWLALTFFLPVPG------CPTGYLGP---- 411
Query: 243 VTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAP 298
G G G P C A G IDR +LG HLY+ P + +
Sbjct: 412 ---GGIGDLGKYPHCTGGAAGYIDRLLLGDNHLYQHPSSTVLYHTEV------------- 455
Query: 299 SWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL----- 353
+DPEG+L ++ + V +G+ G ++V++KD +L C++GL
Sbjct: 456 -----AYDPEGVLGTINSIVMAFLGVQAGKILVYYKDQTKAILTRFAAWCCILGLISIVL 510
Query: 354 -GLSLDFVGMHLNKALYSLSY-TCLTAGASGVLL 385
+S + + +NK L+S+SY T L+ A +LL
Sbjct: 511 TKVSANEGFIPINKNLWSISYVTTLSCFAFFILL 544
>gi|332709783|ref|ZP_08429740.1| hypothetical protein LYNGBM3L_44860 [Moorea producens 3L]
gi|332351381|gb|EGJ30964.1| hypothetical protein LYNGBM3L_44860 [Moorea producens 3L]
Length = 366
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 112/351 (31%), Positives = 152/351 (43%), Gaps = 90/351 (25%)
Query: 55 RLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL SLDVFRG+ +A MILV++ G + P + H+ W+G T D V P FLFI GV++A
Sbjct: 2 RLTSLDVFRGIAMASMILVNNPGSWSYVYPPLLHAKWHGFTPTDLVFPAFLFIAGVAMAF 61
Query: 112 T---YKNFPCKV------VATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRW 162
+ Y N V + R AIL AL L L G Y D IR
Sbjct: 62 SLVKYTNNNQSVSQGYWRIGRRCAILFALGLLLNGF------------PTYNWD--TIRI 107
Query: 163 MGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYV 222
MGVLQRI++AY ++A+ + L+L R RG WV+ ++ Y + + V
Sbjct: 108 MGVLQRISLAYFLSAV------------AVLNLRR--RGLWVLTGIVLLGYWAAMSLVPV 153
Query: 223 PDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQ 282
PD+ G+ P N IDR +LG HLY+
Sbjct: 154 PDYG---------------------AGNLTPEGNFAAYIDRMVLGTNHLYK--------- 183
Query: 283 CSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLN 342
QA FDPEGL S+ A VT L G G + H L
Sbjct: 184 -------------------QAQFDPEGLFSTFPAVVTVLAGYFVGDWLRHQPIPSRTSLG 224
Query: 343 WIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVR 393
++ +GLG DF +NK L++ SY +AG S +LLA Y ++
Sbjct: 225 LVLFGVGCLGLGWVWDF-WFPINKQLWTSSYVVFSAGWSMLLLAACYELIE 274
>gi|170027692|ref|XP_001841731.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167862301|gb|EDS25684.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 558
Score = 115 bits (287), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 97/314 (30%), Positives = 142/314 (45%), Gaps = 76/314 (24%)
Query: 16 VIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDD 75
++Q S++ S P L P S + S Q Q+ RL SLD FRG+ + LMI V+
Sbjct: 260 LVQRSRTPSEPQ--------LSPNSPTISVQATGVPQKTRLRSLDTFRGIAIMLMIFVNS 311
Query: 76 VGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTY-----KNFPCKVVATRKAILRA 130
GG I H+ WNGL +AD V P+FLFI+GV + ++ +N P + + +R+
Sbjct: 312 GGGDYWWIEHATWNGLHVADLVFPWFLFIMGVCIPISLRSQLGRNVP-RYEILKNVAVRS 370
Query: 131 LNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGH-- 188
L LFL+G+ L + IN G +A +R GVLQR +AY V + ++ +
Sbjct: 371 LKLFLIGLCL-----NSIN----GPTVADLRLFGVLQRFGVAYFVVSAIHLYCYQENDQL 421
Query: 189 ----VSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVT 244
S + R ++ HWV+ + +YLL+++ + VP N
Sbjct: 422 QHPLARSHADILRLWK-HWVIVGTIVFVYLLVIFFVPVP------------------NCP 462
Query: 245 CGVRGSTG-------PACNA--VGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPL 295
G G G P C G IDR++LGI+HLY+ P Y M
Sbjct: 463 SGYFGPGGKHLMLLYPNCTGGITGYIDRQVLGIRHLYQHPTARYM---------YDAM-- 511
Query: 296 DAPSWCQAPFDPEG 309
PFDPEG
Sbjct: 512 --------PFDPEG 517
>gi|195133238|ref|XP_002011046.1| GI16326 [Drosophila mojavensis]
gi|193907021|gb|EDW05888.1| GI16326 [Drosophila mojavensis]
Length = 570
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 113/383 (29%), Positives = 173/383 (45%), Gaps = 56/383 (14%)
Query: 27 ANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHS 86
A ++ D ++S Q++R+ SLD FRGL++ LMI V+ GG I H+
Sbjct: 153 AWKRYRYDEAAAVADSIGDAAAKATQRQRMRSLDTFRGLSIVLMIFVNSGGGGYSWIEHA 212
Query: 87 PWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAIL----RALNLFLLGIFLQG 142
WNGL LAD V P FL+I+GV + L+ K + ++ + L RA LF +G+ L
Sbjct: 213 AWNGLHLADIVFPTFLWIMGVCIPLSIKAQLGRGISKPRICLRIVWRACKLFAIGLCL-- 270
Query: 143 GFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRK---- 198
N G + Q+R MGVLQR IAYLVA + ++S + + R
Sbjct: 271 -------NSTNGPQLEQLRLMGVLQRFGIAYLVAGVLHTICSRRDYLSPQRAWQRAIYDI 323
Query: 199 --YRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPAC- 255
+ G V L L YL L +GL VP + + + P C
Sbjct: 324 CLFSGELAVLLALIAAYLGLTFGLRVPGCPRGYLGPGGKHN-----------NAADPNCI 372
Query: 256 -NAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSV 314
A G +DR ILG H+Y+ P S FDPEG+ +
Sbjct: 373 GGAAGYVDRLILGNAHIYQHPTAKFVYDAS-------------------AFDPEGVFGCL 413
Query: 315 MATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSL-DFVGMH----LNKALY 369
++ V ++G G ++ + R+ W++ ++ L LG +L F H +NK L+
Sbjct: 414 LSIVQAMLGCFAGVTLLVHVTWQARLRRWLLGATLLGVLGGALCGFSKEHGIIPVNKNLW 473
Query: 370 SLSYTCLTAGASGVLLAGIYFMV 392
SLS+ +T + VLL+ +Y++V
Sbjct: 474 SLSFVFVTVALALVLLSVLYYVV 496
>gi|23272280|gb|AAH24084.1| Hgsnat protein [Mus musculus]
gi|148700869|gb|EDL32816.1| DNA segment, Chr 8, ERATO Doi 354, expressed [Mus musculus]
Length = 624
Score = 114 bits (286), Expect = 6e-23, Method: Compositional matrix adjust.
Identities = 110/394 (27%), Positives = 174/394 (44%), Gaps = 63/394 (15%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRR-----LISLDVFRGLTVALMILV 73
ISK+ ++ ++L L PS ++ Q + RR L +D FRGL + LM+ V
Sbjct: 187 ISKAIASRETDRLINSELGSPSRADPLSADYQPETRRSSANRLRCVDTFRGLALVLMVFV 246
Query: 74 DDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAILR 129
+ GG HS WNGLT+AD V P+F+FI+G S+ L+ + K+ K + R
Sbjct: 247 NYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGTSIFLSMTSILQRGCSKLKLLGKIVWR 306
Query: 130 ALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCE--IWLKGDG 187
+ L +G+ + N + ++R GVLQR+ + Y V A+ E W
Sbjct: 307 SFLLICIGVIIVNP-----NYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLEFFFWKPVPD 361
Query: 188 HVSSKLSLFR-----KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFN 242
+ + S F W+ L L +++L L + L VP T P
Sbjct: 362 SCTLESSCFSLRDITSSWPQWLTILTLESIWLALTFFLPVPG------CPTGYLGP---- 411
Query: 243 VTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAP 298
G G G P C A G IDR +LG HLY+ P + +
Sbjct: 412 ---GGIGDLGKYPHCTGGAAGYIDRLLLGDNHLYQHPSSTVLYHTEV------------- 455
Query: 299 SWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL----- 353
+DPEG+L ++ + V +G+ G ++V++KD +L C++GL
Sbjct: 456 -----AYDPEGVLGTINSIVMAFLGVQAGKILVYYKDQTKAILTRFAAWCCILGLISIVL 510
Query: 354 -GLSLDFVGMHLNKALYSLSY-TCLTAGASGVLL 385
+S + + +NK L+S+SY T L+ A +LL
Sbjct: 511 TKVSANEGFIPINKNLWSISYVTTLSCFAFFILL 544
>gi|115292433|ref|NP_084160.1| heparan-alpha-glucosaminide N-acetyltransferase [Mus musculus]
gi|341940800|sp|Q3UDW8.2|HGNAT_MOUSE RecName: Full=Heparan-alpha-glucosaminide N-acetyltransferase;
AltName: Full=Transmembrane protein 76
Length = 656
Score = 114 bits (286), Expect = 7e-23, Method: Compositional matrix adjust.
Identities = 112/397 (28%), Positives = 175/397 (44%), Gaps = 69/397 (17%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRR-----LISLDVFRGLTVALMILV 73
ISK+ ++ ++L L PS ++ Q + RR L +D FRGL + LM+ V
Sbjct: 219 ISKTIASRETDRLINSELGSPSRADPLSADYQPETRRSSANRLRCVDTFRGLALVLMVFV 278
Query: 74 DDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAILR 129
+ GG HS WNGLT+AD V P+F+FI+G S+ L+ + K+ K + R
Sbjct: 279 NYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGTSIFLSMTSILQRGCSKLKLLGKIVWR 338
Query: 130 ALNLFLLGIFLQGGFFHGINNLKY---GVDIAQIRWMGVLQRIAIAYLVAALCE--IWLK 184
+ L +G+ I N Y + ++R GVLQR+ + Y V A+ E W
Sbjct: 339 SFLLICIGVI--------IVNPNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLEFFFWKP 390
Query: 185 GDGHVSSKLSLFR-----KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPW 239
+ + S F W+ L L +++L L + L VP T P
Sbjct: 391 VPDSCTLESSCFSLRDITSSWPQWLTILTLESIWLALTFFLPVPG------CPTGYLGP- 443
Query: 240 IFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPL 295
G G G P C A G IDR +LG HLY+ P + +
Sbjct: 444 ------GGIGDLGKYPHCTGGAAGYIDRLLLGDNHLYQHPSSTVLYHTEV---------- 487
Query: 296 DAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL-- 353
+DPEG+L ++ + V +G+ G ++V++KD +L C++GL
Sbjct: 488 --------AYDPEGVLGTINSIVMAFLGVQAGKILVYYKDQTKAILTRFAAWCCILGLIS 539
Query: 354 ----GLSLDFVGMHLNKALYSLSY-TCLTAGASGVLL 385
+S + + +NK L+S+SY T L+ A +LL
Sbjct: 540 IVLTKVSANEGFIPINKNLWSISYVTTLSCFAFFILL 576
>gi|74198170|dbj|BAE35261.1| unnamed protein product [Mus musculus]
Length = 624
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 110/394 (27%), Positives = 173/394 (43%), Gaps = 63/394 (15%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRR-----LISLDVFRGLTVALMILV 73
ISK+ ++ ++L L PS + Q + RR L +D FRGL + LM+ V
Sbjct: 187 ISKTIASRETDRLINSELGSPSRGDPLSADYQPETRRSSANRLRCVDTFRGLALVLMVFV 246
Query: 74 DDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAILR 129
+ GG HS WNGLT+AD V P+F+FI+G S+ L+ + K+ K + R
Sbjct: 247 NYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGTSIFLSMTSILQRGCSKLKLLGKIVWR 306
Query: 130 ALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCE--IWLKGDG 187
+ L +G+ + N + ++R GVLQR+ + Y V A+ E W
Sbjct: 307 SFLLICIGVIIVNP-----NYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLEFFFWKPVPD 361
Query: 188 HVSSKLSLFR-----KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFN 242
+ + S F W+ L L +++L L + L VP T P
Sbjct: 362 SCTLESSCFSLRDITSSWPQWLTILTLESIWLALTFFLPVPG------CPTGYLGP---- 411
Query: 243 VTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAP 298
G G G P C A G IDR +LG HLY+ P + +
Sbjct: 412 ---GGIGDLGKYPHCTGGAAGYIDRLLLGDNHLYQHPSSTVLYHTEV------------- 455
Query: 299 SWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL----- 353
+DPEG+L ++ + V +G+ G ++V++KD +L C++GL
Sbjct: 456 -----AYDPEGVLGTINSIVMAFLGVQAGKILVYYKDQTKAILTRFAAWCCILGLISIVL 510
Query: 354 -GLSLDFVGMHLNKALYSLSY-TCLTAGASGVLL 385
+S + + +NK L+S+SY T L+ A +LL
Sbjct: 511 TKVSANEGFIPINKNLWSISYVTTLSCFAFFILL 544
>gi|74208071|dbj|BAE29143.1| unnamed protein product [Mus musculus]
Length = 656
Score = 114 bits (285), Expect = 9e-23, Method: Compositional matrix adjust.
Identities = 112/397 (28%), Positives = 174/397 (43%), Gaps = 69/397 (17%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRR-----LISLDVFRGLTVALMILV 73
ISK+ ++ ++L L PS ++ Q + RR L +D FRGL + LM+ V
Sbjct: 219 ISKTIASRETDRLINSELGSPSRADPLSADYQPETRRSSANRLRCVDTFRGLALVLMVFV 278
Query: 74 DDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAILR 129
+ GG HS WNGLT+AD V P+F+FI+G S+ L+ + K K + R
Sbjct: 279 NYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGTSIFLSMTSILQRGCSKFKLLGKIVWR 338
Query: 130 ALNLFLLGIFLQGGFFHGINNLKY---GVDIAQIRWMGVLQRIAIAYLVAALCE--IWLK 184
+ L +G+ I N Y + ++R GVLQR+ + Y V A+ E W
Sbjct: 339 SFLLICIGVI--------IVNPNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLEFFFWKP 390
Query: 185 GDGHVSSKLSLFR-----KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPW 239
+ + S F W+ L L +++L L + L VP T P
Sbjct: 391 VPDSCTLESSCFSLRDITSSWPQWLTILTLESIWLALTFFLPVPG------CPTGYLGP- 443
Query: 240 IFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPL 295
G G G P C A G IDR +LG HLY+ P + +
Sbjct: 444 ------GGIGDLGKYPHCTGGAAGYIDRLLLGDNHLYQHPSSTVLYHTEV---------- 487
Query: 296 DAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL-- 353
+DPEG+L ++ + V +G+ G ++V++KD +L C++GL
Sbjct: 488 --------AYDPEGVLGTINSIVMAFLGVQAGKILVYYKDQTKAILTRFAAWCCILGLIS 539
Query: 354 ----GLSLDFVGMHLNKALYSLSY-TCLTAGASGVLL 385
+S + + +NK L+S+SY T L+ A +LL
Sbjct: 540 IVLTKVSANEGFIPINKNLWSISYVTTLSCFAFFILL 576
>gi|395501613|ref|XP_003755186.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Sarcophilus harrisii]
Length = 425
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 109/385 (28%), Positives = 168/385 (43%), Gaps = 64/385 (16%)
Query: 32 ERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGL 91
E P + S N+K + L SL GL++ LM+ V+ GG H+PWNGL
Sbjct: 7 ESRPRIDASVFNNKGKIINFRPWEL-SLVSKHGLSLTLMVFVNYGGGGYWFFEHAPWNGL 65
Query: 92 TLADFVMPFFLFIVGVSLALTYKNFPCKVVAT----RKAILRALNLFLLGIFLQGGFFHG 147
T+AD VMP+F+FI+G S+ LT+ N + V RK R L ++G+
Sbjct: 66 TVADLVMPWFVFILGTSVGLTFHNMQKRGVKNIQLLRKVAWRTGVLIIIGVLF------- 118
Query: 148 INNLKYGVDIAQIRWM-----GVLQRIAIAYLVAALCEIWLKGDGHVSSKLSL----FRK 198
L YG + W GVLQR+ Y AL +I +++L FR
Sbjct: 119 ---LNYGPADGPLSWSWARLPGVLQRLGFTYFAVALMQIAFGVTDTQIYQVNLWWAPFRD 175
Query: 199 ---YRGHWVVALVLTTLYLLLLYGLYVPDW--QYEFPVETSSSSPWIFNVTCGVRGSTGP 253
Y W + + L L+L L + L VP Y P + FN T G
Sbjct: 176 VILYWKEWFIIISLEILWLCLTFLLPVPGCPRGYLGPGGIGDEGKY-FNCTGG------- 227
Query: 254 ACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSS 313
A ID+ ILG HLY+ P + + PFDPEG+L +
Sbjct: 228 ---AAAYIDKWILGENHLYQFPSCKELYKTT------------------QPFDPEGILGT 266
Query: 314 VMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMH------LNKA 367
+ + + GL G +I+ +++ +L ++ S L+G+ ++ +G +NK
Sbjct: 267 INSILMAFFGLQAGKIILMYRNKPRSILRRFLIWSILLGIISAILTMGTQNEGFIPVNKN 326
Query: 368 LYSLSYTCLTAGASGVLLAGIYFMV 392
L+SLS+ T+ S +LL +Y++
Sbjct: 327 LWSLSFVTTTSCFSFILLGLLYYIT 351
>gi|354472121|ref|XP_003498289.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Cricetulus griseus]
Length = 782
Score = 114 bits (285), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 108/391 (27%), Positives = 169/391 (43%), Gaps = 68/391 (17%)
Query: 13 DQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQR-------RLISLDVFRGL 65
D ISK+ ++ ++L L PS ++ Q + R RL +D FRG+
Sbjct: 337 DDFSNWISKTITSRETDRLINSELGSPSRADPLSGDCQPETRHTSALPYRLRCVDTFRGI 396
Query: 66 TVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVV 121
+ LM+ V+ GG HS WNGLT+AD V P+F+FI+G S+ L+ + K
Sbjct: 397 ALILMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGSSVFLSMTSALHRGCSKFR 456
Query: 122 ATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEI 181
K R+ L +GI + N + ++R GVLQR+ + Y V A+ E+
Sbjct: 457 LLGKITWRSFLLICIGIIVVNP-----NYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLEL 511
Query: 182 WLKGDGHVSSKLSLFRKYRG---------HWVVALVLTTLYLLLLYGLYVPDWQYEFPVE 232
V + +L R Y W+V L+L +++L L + L VP
Sbjct: 512 IFSKP--VPDRCALERSYLSLRDITCSWPQWLVVLILESIWLALTFFLPVPG------CP 563
Query: 233 TSSSSPWIFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSP 288
T P G G G P C A G ID +LG HLY+ P +
Sbjct: 564 TGYLGP-------GGIGDMGKYPHCTGGASGYIDHLLLGDNHLYQHPSSTVLYH------ 610
Query: 289 DYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSS 348
Q +DPEG+L ++ + V +G+ G +++++KD +L
Sbjct: 611 ------------TQVAYDPEGILGTINSIVMAFLGVQAGKILLYYKDQTKAILMRFTAWC 658
Query: 349 CLIGL------GLSLDFVGMHLNKALYSLSY 373
C++GL +S D + +NK L+S+SY
Sbjct: 659 CILGLISIALTKMSADEGFIPINKNLWSISY 689
>gi|372268395|ref|ZP_09504443.1| hypothetical protein AlS89_10850 [Alteromonas sp. S89]
Length = 395
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 104/356 (29%), Positives = 166/356 (46%), Gaps = 73/356 (20%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG 106
Q RL+S+D+ RG+ +A M+LV++ G + + H+ W+G T D + P FLF+VG
Sbjct: 11 QVPHGRLMSVDLLRGIAIAAMVLVNNPGSWSFVYAPMAHAQWHGWTPTDVIFPLFLFVVG 70
Query: 107 VSLALTY---KNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGV---DIAQI 160
VS+ L+ +FP A RAL LF LG+FL FF+ + Y + I
Sbjct: 71 VSMVLSTGKRGDFPPVGWAQWS---RALKLFALGLFL-AIFFYNFRDASYNWIEDRLEGI 126
Query: 161 RWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGL 220
RWMGVLQRIA+ Y++ WL G +VA +L ++
Sbjct: 127 RWMGVLQRIALVYILCCYLVRWLPAKGL---------------LVAAILCSV-------- 163
Query: 221 YVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRT 280
VP W V S+S +F +G + +D+ +LG H+Y +
Sbjct: 164 -VP-WTLMLVVPYQSASGEVF------QGQLAFGNHFAAWLDQWLLGSAHVYYR------ 209
Query: 281 KQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRM 340
D P FDPEG+L++ A TCL+G+ + +
Sbjct: 210 --------DAQPF----------AFDPEGVLTTFSAASTCLLGVLAALAWKSADSNGEAQ 251
Query: 341 L----NWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
L NW++ + ++ +G + + + +NKAL+S S+ +TAG S +L+AG+Y++V
Sbjct: 252 LRLCRNWLVAGTLMVLVGQLMHSI-VPINKALWSPSFVLVTAGVSLLLMAGLYYLV 306
>gi|322790964|gb|EFZ15612.1| hypothetical protein SINV_04659 [Solenopsis invicta]
Length = 581
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 106/400 (26%), Positives = 191/400 (47%), Gaps = 53/400 (13%)
Query: 11 ANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALM 70
A ++++++ K +P N + D L + +++ R R+ S+D FRG+ + LM
Sbjct: 129 ATAKVILRVVKGKLSPDNIHDDLDEL-QEAETSTPIVRTSRSSTRIRSIDTFRGIALLLM 187
Query: 71 ILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATR-KAILR 129
I VD+ GG NHS WNGLT+AD V+P+F +I+G+S+ ++ K +V +R K I R
Sbjct: 188 IFVDNGGGKYVFFNHSAWNGLTVADLVLPWFAWIMGLSITIS-KRSELRVSNSRMKIIFR 246
Query: 130 ALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCE-IWLKGDGH 188
L L+ + L + N + +R+ GVLQ +A++Y V A E I++K
Sbjct: 247 CLQRALVLVLLG-----LMLNSMSMESLKHLRFPGVLQLLAVSYFVCATIETIFMKAHSQ 301
Query: 189 VS----SKLSLFRKYR---GHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIF 241
+ S+ R W++ L + ++L+ + L VP+ + + S
Sbjct: 302 DDVLQFGRFSILRDILNNWAQWLIILAIMVTHILITFLLPVPNCPTGYLGPGGNYS---- 357
Query: 242 NVTCGVRGSTGPACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPS 299
R P C A G IDR + G +YS+T+ +P YG +
Sbjct: 358 ------RYGKFPNCTGGAAGYIDRLVFG------SHVYSKTQ-----NPVYGTI------ 394
Query: 300 WCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSC---LIGLGLS 356
P DPEG+++++ + +G+H G +++ + R++ W++ SS + GL
Sbjct: 395 ---LPHDPEGIMNTMSIILVVYMGVHAGKILLLYYQCNGRVIRWLLWSSVTGLIAGLLCH 451
Query: 357 LDFVG--MHLNKALYSLSYTCLTAGASGVLLAGIYFMVRY 394
D + ++K + SLS+ + + +L A ++F+V Y
Sbjct: 452 FDKESGVIPVSKKMMSLSFVLTVSCFAFLLYAILHFLVDY 491
>gi|333030942|ref|ZP_08459003.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
gi|332741539|gb|EGJ72021.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
Length = 385
Score = 113 bits (283), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 98/359 (27%), Positives = 169/359 (47%), Gaps = 78/359 (21%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+RL+SLD+ RG T+ MILV++ G I + H+ W+GLT D + PFF+F++G+S
Sbjct: 2 SSKRLLSLDILRGGTIIGMILVNNPGSWEYIYSPLRHAEWHGLTPTDLIFPFFIFVMGIS 61
Query: 109 LALTYKNFP----CKVVATRKAILRALNLFLLG--IFLQGGFFHGINN-LKYG------V 155
++L++ F K + K I R+ LFLLG + GINN L+Y
Sbjct: 62 MSLSFSKFKNEEYNKTLFWEKVIKRSAKLFLLGLFLSWFSLLLEGINNRLEYESISEILF 121
Query: 156 DIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLL 215
QIR +GV+QR+A++YLV ++ + + H LV+T++ LL
Sbjct: 122 PFGQIRILGVMQRLALSYLVGSVFVMLIPKAKH------------------LVITSVILL 163
Query: 216 LLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKP 275
+ Y F + G + + N + ++D + G H+Y +
Sbjct: 164 IAY----------------------FILLSLGNGFSFSSDNIIAIVDNSLFGENHVYLEW 201
Query: 276 IYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKD 335
+ P + FDPEGLLS++ V ++G G +I KD
Sbjct: 202 L---------------------PDGERLRFDPEGLLSTIPCIVQVIMGYLCGEVIRKKKD 240
Query: 336 HRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRY 394
++M++ I+ L+ +GL L + G LNK ++S ++ +T+G + + L + +++ Y
Sbjct: 241 LLNKMMDLAIIGIVLLFIGLLLSY-GCPLNKKIWSPTFELVTSGFAVLALTLLIWIIDY 298
>gi|417411833|gb|JAA52338.1| Putative heparan-alpha-glucosaminide n-acetyltransferase, partial
[Desmodus rotundus]
Length = 595
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 109/397 (27%), Positives = 177/397 (44%), Gaps = 67/397 (16%)
Query: 19 ISKSTSAPANEKLERDPLLPPS-------NSNSKQTRPQHQQRRLISLDVFRGLTVALMI 71
ISK+ S+ ++L L PS ++ S+ RP RL +D FRGL + LM+
Sbjct: 156 ISKAMSSRETDRLINSELGSPSRAGQLGDDAQSEVWRPAAAPLRLRCVDTFRGLALILMV 215
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAI 127
V+ GG H+ WNGLT+ADFV P+F+FI+G S+ L+ + K K
Sbjct: 216 FVNYGGGQYWYFKHASWNGLTVADFVFPWFVFIMGSSIFLSMSSVLQRGCSKFRLLGKVA 275
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKG-- 185
R+ L +G+ + N + ++R GVLQR+ + Y V A+ E+
Sbjct: 276 WRSFLLICIGVIVVNP-----NYCLGPLSWDKVRLPGVLQRLGVTYFVVAVLELLFAKPV 330
Query: 186 ------DGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPW 239
+G SS + + W+ L+L +++L L + L VP T P
Sbjct: 331 PERGAWEGRCSSLQDIMSSWP-QWLFILMLESIWLALTFFLPVPG------CPTGYLGP- 382
Query: 240 IFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPL 295
G G G P C A G IDR +LG H+Y+ P + +
Sbjct: 383 ------GGIGDLGKYPNCTGGAAGYIDRLLLGEDHIYQHPSSTVLYHTRV---------- 426
Query: 296 DAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL-- 353
+DPEG+L ++ + V +G+ G +++++K+ +L C +GL
Sbjct: 427 --------AYDPEGILGTINSIVMAFLGVQAGKILLYYKEQTKDILIRFTAWCCFLGLIS 478
Query: 354 ----GLSLDFVGMHLNKALYSLSY-TCLTAGASGVLL 385
+S + + +NK L+S+SY T L++ A +LL
Sbjct: 479 VGLTKISENEGFIPVNKNLWSISYVTTLSSFAFFILL 515
>gi|282896863|ref|ZP_06304869.1| conserved hypothetical protein [Raphidiopsis brookii D9]
gi|281198272|gb|EFA73162.1| conserved hypothetical protein [Raphidiopsis brookii D9]
Length = 375
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 100/347 (28%), Positives = 153/347 (44%), Gaps = 73/347 (21%)
Query: 55 RLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RLISLDVFRGLT+A+MI+ + G + P ++H+ WNG T D V PFFLFIVGV+++
Sbjct: 2 RLISLDVFRGLTIAMMIIANMAGVAPDVYPFLSHALWNGCTPTDLVYPFFLFIVGVAMSF 61
Query: 112 TYKNFPCKVVATRKAIL----RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
+ + + + RA+ LF LG+ L G + G+ + D+ +R MGVLQ
Sbjct: 62 SLSKYSLESKLDKFVYFNLCRRAVILFTLGLLLNGFWNQGVGSF----DLQSLRVMGVLQ 117
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQY 227
RI++AYLVA+L + L + W +A +L Y L + + VPD+
Sbjct: 118 RISLAYLVASL--------------IVLKFPEKTQWALAGILLIFYWLTMMYIPVPDYG- 162
Query: 228 EFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINS 287
G N IDR I+ HLY ++
Sbjct: 163 --------------------AGMLTREGNFGAFIDRLIIAKPHLYAGDGFNFRG------ 196
Query: 288 PDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILS 347
DPEGL S++ A V L G G + + ++ ++
Sbjct: 197 ------------------DPEGLFSTIPAIVNVLFGYFAGQWMRKSTINSHTSMDLVLWG 238
Query: 348 SCLIGLGLSLDFVGMH-LNKALYSLSYTCLTAGASGVLLAGIYFMVR 393
C + +G+ D G+ +NK L++ SY + G V LA Y ++
Sbjct: 239 LCSLVVGMIWD--GLFPINKKLWTSSYVLFSTGWGLVFLAACYDLIE 283
>gi|153808903|ref|ZP_01961571.1| hypothetical protein BACCAC_03204 [Bacteroides caccae ATCC 43185]
gi|423220258|ref|ZP_17206753.1| hypothetical protein HMPREF1061_03526 [Bacteroides caccae
CL03T12C61]
gi|149128236|gb|EDM19455.1| hypothetical protein BACCAC_03204 [Bacteroides caccae ATCC 43185]
gi|392623335|gb|EIY17438.1| hypothetical protein HMPREF1061_03526 [Bacteroides caccae
CL03T12C61]
Length = 371
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 97/340 (28%), Positives = 154/340 (45%), Gaps = 77/340 (22%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+RL++LDV RG+T+A MILV++ G + + H+ WNGLT D + PFF+FI+G+S
Sbjct: 5 SNKRLLALDVMRGITIAGMILVNNPGSWGYVYFPLKHAQWNGLTPTDLIFPFFMFIMGIS 64
Query: 109 --LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
++L NF A K I R + +FL+GI + +F + + AQIR +GV+
Sbjct: 65 TYISLRKYNFTFSTPAALKIIKRTIVIFLIGIAIN--WFALLCYYHDPLPFAQIRVLGVM 122
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QR+A+ Y +AL + +K KY + +VAL L ++LL+ G
Sbjct: 123 QRLALCYGASALIALLIK------------HKYIPYLIVAL-LVGYFILLITG------- 162
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
G N + ++DR ILG H+Y+
Sbjct: 163 ---------------------NGFAYNETNILSIVDRSILGDAHMYQ------------- 188
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
DPEGLLS++ + LIG G L++ KD R+++ ++
Sbjct: 189 ---------------DNHIDPEGLLSTIPSIAHVLIGFCVGKLLMEVKDIREKLERLFLI 233
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
+ L G L + G NK ++S ++ +T G LA
Sbjct: 234 GTILTFAGFLLSY-GCPFNKKIWSPTFVLVTCGLGSSFLA 272
>gi|157112232|ref|XP_001657450.1| hypothetical protein AaeL_AAEL000933 [Aedes aegypti]
gi|108883723|gb|EAT47948.1| AAEL000933-PA [Aedes aegypti]
Length = 569
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 112/401 (27%), Positives = 178/401 (44%), Gaps = 75/401 (18%)
Query: 33 RDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLT 92
R+ L ++ +S + ++ RL SLD FRG+ + LMI V+ GG I H+ WNGL
Sbjct: 159 REAPLAAASPSSSGHPVEPRKTRLQSLDTFRGIAIMLMIFVNSGGGHYWWIEHATWNGLH 218
Query: 93 LADFVMPFFLFIVGVSLALTY-----KNFPCKVVATRKAILRALNLFLLGIFLQGGFFHG 147
+AD V P+FLFI+GV + ++ +N P K + A+ R+ LF +G+ L +
Sbjct: 219 VADLVFPWFLFIMGVCIPISLRSQVSRNIPRKTILANVAV-RSFKLFCIGLCL-----NS 272
Query: 148 INNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIW-----LKGDGHVSS-KLSLFRKYRG 201
IN G +A +R GVLQR +AY V + ++ ++ G ++ + + R ++
Sbjct: 273 IN----GPQVANLRLFGVLQRFGVAYFVVSAIHLYCYSESIEFQGRLARLNVDILRLWK- 327
Query: 202 HWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTG-------PA 254
HW++ + +YLL+++ + P G G G P
Sbjct: 328 HWIIMGAIVFIYLLIMFLVAAP------------------GCPSGYFGPGGKHLMAMYPN 369
Query: 255 CNA--VGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLS 312
C G +DR ILG HLY+ P DA + FDPEG
Sbjct: 370 CTGGITGYLDRIILGNNHLYQHPTARYV--------------YDAQA-----FDPEGPFG 410
Query: 313 SVMATVTCLIGLHFGHLIVHFKD---HRDRMLNW--IILSSCLIGLGLSLDFVGMHLNKA 367
+ + +GL G LI+ + RM W ++ I G S + + +NK
Sbjct: 411 CLPTILQVFLGLQCGVLILTHTEVMARIRRMAAWGTVLGLLGGILCGFSKNDGWIPVNKN 470
Query: 368 LYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKKPFDYS 408
L+SLSY +TA + VLL + ++ L PF Y+
Sbjct: 471 LWSLSYVLVTASLAFVLLLICFVLID--VKRLWTGNPFLYA 509
>gi|297491309|ref|XP_002698775.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Bos
taurus]
gi|296472360|tpg|DAA14475.1| TPA: Heparan-alpha-glucosaminide N-acetyltransferase-like [Bos
taurus]
Length = 723
Score = 112 bits (281), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 110/399 (27%), Positives = 176/399 (44%), Gaps = 62/399 (15%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQR-----RLISLDVFRGLTVALMILV 73
ISK+ ++ ++L L PS ++ Q P+ +R RL +D FRG+ + LM+ V
Sbjct: 288 ISKAINSRETDRLINSELGSPSRASDPQ--PEAWRRSAAPLRLRCVDTFRGMALILMVFV 345
Query: 74 DDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAILR 129
+ GG HS WNGLT+AD V P+F+FI+G S+ L+ + K K R
Sbjct: 346 NYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGTSIFLSMTSILQRGCSKFRLLGKIAWR 405
Query: 130 ALNLFLLGIFLQGGFFHGINNLKY---GVDIAQIRWMGVLQRIAIAYLVAALCEIWLKG- 185
+ L +GIF + N KY + + R GVLQR+ Y V A+ E+
Sbjct: 406 SFLLICIGIF--------VVNPKYCLGPLSWEKARIPGVLQRLGATYFVVAVLELLFAKP 457
Query: 186 -DGHVSSKLSLFR-----KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPW 239
+S+ S F W+ L+L ++L L + L VP T P
Sbjct: 458 VPETCASERSCFSLLDITASWPQWLFVLILEGVWLALTFFLPVPG------CPTGYLGPG 511
Query: 240 IFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPS 299
R TG A G +DR +LG QHLY+ P + +
Sbjct: 512 GIGDGGRYRNCTG---GAAGYVDRLLLGDQHLYQHPSSAVLYHTEV-------------- 554
Query: 300 WCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL------ 353
+DPEG+L ++ + V +G+ G +++++KD +L CL+GL
Sbjct: 555 ----AYDPEGILGTINSIVMAFLGVQAGKILLYYKDQTRGILIRFAAWGCLLGLVSVALT 610
Query: 354 GLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
S + + +NK L+S+SY + + ++L +Y +V
Sbjct: 611 KASENEGFIPVNKNLWSISYVTTLSSLAFLILLALYPVV 649
>gi|427709244|ref|YP_007051621.1| hypothetical protein Nos7107_3914 [Nostoc sp. PCC 7107]
gi|427361749|gb|AFY44471.1| hypothetical protein Nos7107_3914 [Nostoc sp. PCC 7107]
Length = 375
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 118/363 (32%), Positives = 161/363 (44%), Gaps = 80/363 (22%)
Query: 55 RLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL SLDVFRG+T+A MILV+ G + P + H+ W+G T D V PFFLFIVGV++
Sbjct: 2 RLTSLDVFRGITIAAMILVNMAGVADDVYPLLAHADWHGCTPTDLVFPFFLFIVGVAMTF 61
Query: 112 -----TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
T N P V R RA LF LG+ L + G+ D + IR MGVL
Sbjct: 62 SLSKYTADNKPTSTVYLR-IFRRAAILFALGLLLNVFWNKGVGTF----DFSSIRIMGVL 116
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRI+++YL+A+L + L+L RK G W++A VL Y L + +YVP +
Sbjct: 117 QRISLSYLLASL------------AVLNLPRK--GQWILAAVLLIGYWLTM--MYVPVPE 160
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
Y V T N DR I+ HLY +
Sbjct: 161 YGAGVLTREG-------------------NFGAYFDRLIIPQTHLYAGDGFKSMG----- 196
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
DPEGL S++ A V+ L G G I + I+
Sbjct: 197 -------------------DPEGLFSTIPAVVSVLAGYFTGQWIRKQPVQTRTSVGLILF 237
Query: 347 S-SCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM--VRYISSHLMLKK 403
CLI +G + +V +NK L++ SY T+G + +LLA Y + VR I K
Sbjct: 238 GMGCLI-IGWAWGWV-FPINKKLWTSSYVIFTSGWALLLLAACYELIEVRLIKRW---SK 292
Query: 404 PFD 406
PF+
Sbjct: 293 PFE 295
>gi|149057830|gb|EDM09073.1| rCG43316 [Rattus norvegicus]
Length = 626
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 110/399 (27%), Positives = 176/399 (44%), Gaps = 71/399 (17%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISL-------DVFRGLTVALMI 71
ISK+ ++ ++L L PS ++ Q + RR +L D FRG+ + LM+
Sbjct: 187 ISKAIASRETDRLINSELGSPSRADPLGADCQPETRRASALPHRLRCVDTFRGVALILMV 246
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAI 127
V+ GG HS WNGLT+AD V P+F+FI+G S+ L+ + K+ K
Sbjct: 247 FVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGSSIFLSMTSLLQRGCSKIKLLGKIA 306
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDG 187
R+ L +GI + N + ++R GVLQR+ + Y V A+ E++
Sbjct: 307 WRSFLLICIGIIIVNP-----NYCLGPLSWDKVRIPGVLQRLGVTYFVVAMLELFFWKP- 360
Query: 188 HVSSKLSLFRKYRG---------HWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSP 238
V +L R W++ L+L +++L L + L VP T P
Sbjct: 361 -VPDSCTLERSCLSLRDITSSWPQWLIILILESIWLALTFFLPVPG------CPTGYLGP 413
Query: 239 WIFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMP 294
G G G P C A G IDR +LG HLY+ P + +
Sbjct: 414 -------GGIGDLGKYPHCTGGAAGYIDRLLLGDSHLYQHPSSAVLYHTEV--------- 457
Query: 295 LDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLG 354
+DPEG+L ++ + V +G+ G +++++KD +L C++GL
Sbjct: 458 ---------AYDPEGVLGTINSIVMAFLGVQAGKILLYYKDQTKAILIRFAAWCCILGL- 507
Query: 355 LSLDFVGMH-------LNKALYSLSY-TCLTAGASGVLL 385
+S+ M +NK L+S+SY T L+ A +LL
Sbjct: 508 ISIALTKMSANEGFIPINKNLWSISYVTTLSCFAFSILL 546
>gi|414077874|ref|YP_006997192.1| hypothetical protein ANA_C12665 [Anabaena sp. 90]
gi|413971290|gb|AFW95379.1| hypothetical protein ANA_C12665 [Anabaena sp. 90]
Length = 376
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 110/362 (30%), Positives = 161/362 (44%), Gaps = 77/362 (21%)
Query: 55 RLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL SLDVFRG+T+A MIL + G + ++H+ W+G T D + P FLFIVGV++
Sbjct: 2 RLSSLDVFRGITIAAMILANMAGVADDVYRPLSHAQWHGCTPTDLIFPCFLFIVGVAMTF 61
Query: 112 TYKNFPCKVVATRKAILRALN----LFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
+ + + T+ LR L LF+LG+ L G + G+ + D++ IR MGVLQ
Sbjct: 62 SLAKYTAQNKPTKAVYLRILRRTAILFILGLVLNGFWNQGV----WTFDLSSIRLMGVLQ 117
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQY 227
RIA+ YL A+L L L RK W+VA L Y L + + VPD+
Sbjct: 118 RIALTYLFASLIV------------LKLPRK--SQWLVAGGLLIAYWLTMMYIPVPDYG- 162
Query: 228 EFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINS 287
GV G N IDR I+ HLY+ ++
Sbjct: 163 -----------------AGVLTREG---NFGAFIDRLIIPKAHLYKGDGFNFLG------ 196
Query: 288 PDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRM-LNWIIL 346
DPEGL S++ A V+ L G G I K + ++ ++
Sbjct: 197 ------------------DPEGLFSTIPAIVSVLAGYFTGQWIKDKKQATSQTSMDLVLF 238
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM--VRYISSHLMLKKP 404
C + + + D V +NK +++ SY T G + +LLA Y + VR I KP
Sbjct: 239 GLCCLVIAIIWD-VSFPINKKIWTSSYVLFTTGWALMLLAACYELIEVRLIKRW---SKP 294
Query: 405 FD 406
F+
Sbjct: 295 FE 296
>gi|194679266|ref|XP_588978.4| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Bos
taurus]
Length = 734
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 110/399 (27%), Positives = 176/399 (44%), Gaps = 62/399 (15%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQR-----RLISLDVFRGLTVALMILV 73
ISK+ ++ ++L L PS ++ Q P+ +R RL +D FRG+ + LM+ V
Sbjct: 299 ISKAINSRETDRLINSELGSPSRASDPQ--PEAWRRSAAPLRLRCVDTFRGMALILMVFV 356
Query: 74 DDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAILR 129
+ GG HS WNGLT+AD V P+F+FI+G S+ L+ + K K R
Sbjct: 357 NYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGTSIFLSMTSILQRGCSKFRLLGKIAWR 416
Query: 130 ALNLFLLGIFLQGGFFHGINNLKY---GVDIAQIRWMGVLQRIAIAYLVAALCEIWLKG- 185
+ L +GIF + N KY + + R GVLQR+ Y V A+ E+
Sbjct: 417 SFLLICIGIF--------VVNPKYCLGPLSWEKARIPGVLQRLGATYFVVAVLELLFAKP 468
Query: 186 -DGHVSSKLSLFR-----KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPW 239
+S+ S F W+ L+L ++L L + L VP T P
Sbjct: 469 VPETCASERSCFSLLDITASWPQWLFVLILEGVWLALTFFLPVPG------CPTGYLGPG 522
Query: 240 IFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPS 299
R TG A G +DR +LG QHLY+ P + +
Sbjct: 523 GIGDGGRYRNCTG---GAAGYVDRLLLGDQHLYQHPSSAVLYHTEV-------------- 565
Query: 300 WCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL------ 353
+DPEG+L ++ + V +G+ G +++++KD +L CL+GL
Sbjct: 566 ----AYDPEGILGTINSIVMAFLGVQAGKILLYYKDQTRGILIRFAAWGCLLGLVSVALT 621
Query: 354 GLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
S + + +NK L+S+SY + + ++L +Y +V
Sbjct: 622 KASENEGFIPVNKNLWSISYVTTLSSLAFLILLALYPVV 660
>gi|345482764|ref|XP_001600799.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Nasonia vitripennis]
Length = 569
Score = 112 bits (279), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 89/349 (25%), Positives = 163/349 (46%), Gaps = 51/349 (14%)
Query: 40 SNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMP 99
++ + + + +RR+ SLD RG+++ LMI V++ + H+ WNGL + D V P
Sbjct: 172 ADEDELEVGKKTAKRRVRSLDTVRGMSILLMIFVNNGAAGYALLEHATWNGLLVGDLVFP 231
Query: 100 FFLFIVGV----SLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGV 155
F++I+GV S++ ++ R + R++ LF +G+ L N L
Sbjct: 232 CFMWIMGVCIPLSISAQLSRGSSRLRLCRAIVKRSVYLFAIGLAL--------NTLGGRN 283
Query: 156 DIAQIRWMGVLQRIAIAYLVAALC-EIWLKGDGHVSSK-LSLFRKYRGHWVVALVLTTLY 213
+ +IR GVLQR +AYLVA + + + D S + L W+VAL++ +
Sbjct: 284 QLERIRIFGVLQRFGLAYLVAGIVYALAARPDDKQSKRMLGDVVALIPQWIVALLILAAH 343
Query: 214 LLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACN----AVGMIDRKILGIQ 269
+++ L VP + + G R + G N A G +D+ +LG+
Sbjct: 344 CAVVFLLPVPGCPRGY-------------LGPGGRHADGKYWNCSGGATGYVDKVLLGVD 390
Query: 270 HLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHL 329
H+Y+ P + YG PFDPEG+L S+ + +G+ G +
Sbjct: 391 HIYQLPTANSV---------YG----------SGPFDPEGVLGSLTSIFQVFLGIQAGQI 431
Query: 330 IVHFKDHRDRMLNWIILSSCLIGLGLSLDFVG-MHLNKALYSLSYTCLT 377
+ + + R++ W++ + L +G +L + + +NK L+S+S+ +T
Sbjct: 432 LRTYGSWKARLVRWLLWAVLLGAVGAALHYTNVVPVNKNLWSVSFVLVT 480
>gi|363733262|ref|XP_420455.3| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Gallus
gallus]
Length = 581
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 101/369 (27%), Positives = 166/369 (44%), Gaps = 73/369 (19%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALT 112
++RL SLD FRGL++ +M+ V+ GG H WNGLT+AD V P+F+FI+G S++L+
Sbjct: 183 RQRLRSLDTFRGLSLIIMVFVNYGGGKYWFFKHESWNGLTVADLVFPWFVFIMGTSISLS 242
Query: 113 YKNF----PCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQR 168
+ K K + R+ L LLG+ + N + +R GVLQR
Sbjct: 243 LSSTLRWGSSKQKVLWKILWRSFLLILLGVIVVNP-----NYCLGALSWENLRIPGVLQR 297
Query: 169 IAIAYLVAALCEIWLKGDGHVSSKLSL-------FRKYRGHWVVALVLTTLYLLLLYGLY 221
+ + YLV A E+ G S L + + W+ L+L ++L L + L
Sbjct: 298 LGLTYLVVAALELLFTRTGADSGTLEMSCPALQDILPFWPQWIFILMLEVIWLCLTFLLP 357
Query: 222 VPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPA----------CN--AVGMIDRKILGIQ 269
VP P RG GP C A G IDR +LG +
Sbjct: 358 VP------------GCP---------RGYLGPGGIGDFGNYLNCTGGAAGYIDRLVLGEK 396
Query: 270 HLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHL 329
H+Y+ P + Q ++ P+DPEG+L ++ + +GL G +
Sbjct: 397 HIYQHPSCNVLYQTTV------------------PYDPEGILGTINTILMAFLGLQAGKI 438
Query: 330 IVHFKDHRDRMLNWIILSSCLIGL------GLSLDFVGMHLNKALYSLSYTCLTAGASGV 383
I+ +KD ++++ + S ++G+ S + + +NK L+S SY + + +
Sbjct: 439 ILSYKDQHKQIMSRFFIWSVVMGIISAILTKCSKEEGFIPINKNLWSTSYVTTMSCFAFI 498
Query: 384 LLAGIYFMV 392
LL +Y++V
Sbjct: 499 LLLLMYYLV 507
>gi|395213375|ref|ZP_10400182.1| hypothetical protein O71_05742 [Pontibacter sp. BAB1700]
gi|394456744|gb|EJF11001.1| hypothetical protein O71_05742 [Pontibacter sp. BAB1700]
Length = 391
Score = 111 bits (278), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 100/363 (27%), Positives = 167/363 (46%), Gaps = 75/363 (20%)
Query: 39 PSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDD---VGGILPAINHSPWNGLTLAD 95
P +++ RPQ +R L SLDV RGLT+ALM++V++ G I H+ W+G T+ D
Sbjct: 8 PPLTDAGLLRPQTYERYL-SLDVLRGLTIALMVVVNNPGSWGSIYAPFKHAAWHGFTVTD 66
Query: 96 FVMPFFLFIVGVSLALTYKNFPCK--VVATRKAILRALNLFLLGIFLQGGFFHGINNLKY 153
V P FLF+VG +++ + + F + V RK + R +FL+G+FL F + N +
Sbjct: 67 LVFPSFLFVVGNAMSFSMRKFETQPDSVFLRKVLKRTALIFLIGLFLNL-FPFVMRNPEG 125
Query: 154 GV---DIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLT 210
+ D +R MGVLQRIA+ Y +A+L +LK ++G + +LV+
Sbjct: 126 AIVMKDFTAVRIMGVLQRIALCYFIASLAVHYLK--------------FKGAAIFSLVVL 171
Query: 211 TLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQH 270
Y ++Y ++ P+ A NA D I+ +
Sbjct: 172 LGYWAVMYFF------------GDAADPY------------SLAGNAARKFDDLIIPETN 207
Query: 271 LYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI 330
LY+ +G PFDPEGLLS++ ATV + G G I
Sbjct: 208 LYK---------------GFG-----------IPFDPEGLLSTLPATVNVIAGYFAGLFI 241
Query: 331 VHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYF 390
++ + ++ + L+ + L D + +NK +++ SY + G S +L+AG+
Sbjct: 242 QKNGNNLSTVFKLMVAGAILVAMALVWD-IYFPINKPIWTSSYVLHSVGLSVMLIAGLML 300
Query: 391 MVR 393
++
Sbjct: 301 VIE 303
>gi|291409013|ref|XP_002720836.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase
[Oryctolagus cuniculus]
Length = 613
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 113/406 (27%), Positives = 182/406 (44%), Gaps = 71/406 (17%)
Query: 13 DQLVIQISKSTSAPANEKLERDPLLPPSNSN--SKQTRPQH-----QQRRLISLDVFRGL 65
D ISK+ + ++L L PS ++ S +P+ + RL +D FRG+
Sbjct: 168 DDFNSWISKAIHSRETDRLINSELGSPSRADPLSGDLQPETWHLSAAKHRLRCVDTFRGI 227
Query: 66 TVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTY----KNFPCKVV 121
+ LM+ V+ GG HS WNGLT+AD V P+F+FI+G S+ L+ + K+
Sbjct: 228 ALVLMVFVNYGGGRYWYFRHSSWNGLTVADLVFPWFVFIMGSSIFLSMMSALQRGCSKLR 287
Query: 122 ATRKAILRALNLFLLGIFLQGGFFHGINNLKY---GVDIAQIRWMGVLQRIAIAYLVAAL 178
K R+ L ++GI I N Y + ++R GVLQR+ + Y V A+
Sbjct: 288 LLGKIAWRSFLLIMIGIV--------IVNPNYCLGPLSWDKVRIPGVLQRLGVTYFVVAV 339
Query: 179 CEIWLKG----DGHVSSKLSLFRKYRG---HWVVALVLTTLYLLLLYGLYVPDWQYEFPV 231
E+ + + S + R W++ L+L +++L L + L VP
Sbjct: 340 LELLFAKPVPENWVLESSCTCLRDVTSSWPQWLLILLLESIWLGLSFFLPVPG------C 393
Query: 232 ETSSSSPWIFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINS 287
T P G G G P C A G IDR +LG HLY+ P + +
Sbjct: 394 PTGYLGP-------GGIGDWGKYPNCTGGAAGYIDRVLLGDDHLYKHPSSTVLYHTEV-- 444
Query: 288 PDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILS 347
+DPEG+L ++ + VT +G+ G +++ +KD +L
Sbjct: 445 ----------------AYDPEGILGTINSIVTAFLGVQAGKILLFYKDQTKSILIRFTAW 488
Query: 348 SCLIGL------GLSLDFVGMHLNKALYSLSY-TCLTAGASGVLLA 386
SC++GL +S + + +NK L+S+SY T L++ A +LL
Sbjct: 489 SCILGLISVALTKISENEGFIPINKNLWSISYVTTLSSFAFFILLV 534
>gi|194226375|ref|XP_001488696.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Equus caballus]
Length = 663
Score = 111 bits (277), Expect = 7e-22, Method: Compositional matrix adjust.
Identities = 111/396 (28%), Positives = 175/396 (44%), Gaps = 65/396 (16%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNS-------KQTRPQHQQRRLISLDVFRGLTVALMI 71
ISK+ ++ + L L PS +++ + P +RL +D FRG+ + +M+
Sbjct: 224 ISKAINSRETDHLINSELGSPSRADALGGDSQLEAWGPAAAPQRLRCVDTFRGIALIIMV 283
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL----TYKNFPCKVVATRKAI 127
V+ GG HS WNGLT+AD V P+F+FI+G S+ L T + K K
Sbjct: 284 FVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGSSIFLSMTSTLQRGCSKFRLLGKIA 343
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLK--- 184
R+ L LGI + N + ++R GVLQR+ + Y V A+ E+
Sbjct: 344 WRSFLLISLGIVVVNP-----NYCLGPLSWDKLRIPGVLQRLGVTYFVVAVLELLFAKPV 398
Query: 185 -GDGHVSSKLSLFRKYRG---HWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWI 240
G G + S R W+ L+L +++L L + L VP T P
Sbjct: 399 PGSGASERRCSSLRDILSSWPQWLFILLLESIWLGLTFFLPVPG------CPTGYLGP-- 450
Query: 241 FNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLD 296
G G G P C A G IDR +LG HLY+ P + +
Sbjct: 451 -----GGIGDLGRYPNCTGGAAGYIDRLLLGEDHLYQHPSSAVLYHTEV----------- 494
Query: 297 APSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL--- 353
+DPEG+L ++ + V +G+ G +++++KD +L SC +GL
Sbjct: 495 -------AYDPEGILGTINSIVMAFLGVQAGRILLYYKDQTKAILLRFTAWSCFLGLISV 547
Query: 354 ---GLSLDFVGMHLNKALYSLSY-TCLTAGASGVLL 385
+S + + +NK L+S+SY T L++ A +LL
Sbjct: 548 ALTKVSENEGSIPINKNLWSISYVTTLSSFAFFILL 583
>gi|330792857|ref|XP_003284503.1| hypothetical protein DICPUDRAFT_18260 [Dictyostelium purpureum]
gi|325085533|gb|EGC38938.1| hypothetical protein DICPUDRAFT_18260 [Dictyostelium purpureum]
Length = 373
Score = 111 bits (277), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 103/372 (27%), Positives = 156/372 (41%), Gaps = 94/372 (25%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGG--ILPAINHSPWNGLTLADFVMPFFLFIVGVSLA 110
++R+ SLDV RG+T+ MILVD+ GG ++ + + WNGL+ AD + P FLFI G S+A
Sbjct: 1 KKRMSSLDVARGITIFGMILVDNQGGPDVIWPLKETEWNGLSTADLIFPSFLFICGFSVA 60
Query: 111 LTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIA 170
L K I R LF + FL N + + + R MGVLQRIA
Sbjct: 61 LALKTAKNTRSTWYNIIRRTFLLFFIQCFL--------NLMAHHFVFSSFRVMGVLQRIA 112
Query: 171 IAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFP 230
+ Y ++ + + + L R + L T Y+ ++Y L VP
Sbjct: 113 LCYFLSCV--------SFLCFPVFLQRLF------LLGTTVTYISVMYALPVPG------ 152
Query: 231 VETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDY 290
CG +G P CNA +D K+ G ++
Sbjct: 153 --------------CG-KGVLTPTCNAGAYLDFKVFGPNMIH------------------ 179
Query: 291 GPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHF-----KDHRDRMLNWII 345
P DPEGLLS++ A +T +GL FG + + + D ++ WI+
Sbjct: 180 -------------PNDPEGLLSTLSAFITTWMGLEFGRVFTTYYRKYDYSNVDLIVRWIV 226
Query: 346 LSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV----------RYI 395
+ + I + L M NK ++S S+ T G G L+ Y ++ R+I
Sbjct: 227 MIALFIVPAIGLGATVMPFNKLIWSFSFALFTVGTGGCLITVAYILIDVIEWGNKARRFI 286
Query: 396 SSHLMLKKPFDY 407
+L KPF Y
Sbjct: 287 D---LLSKPFIY 295
>gi|156401294|ref|XP_001639226.1| predicted protein [Nematostella vectensis]
gi|156226353|gb|EDO47163.1| predicted protein [Nematostella vectensis]
Length = 387
Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 102/365 (27%), Positives = 169/365 (46%), Gaps = 77/365 (21%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYK 114
RL SLD FRG+++ +MI V+ GG HS WNGLT+AD V P+F++I+GVS+ L+++
Sbjct: 1 RLKSLDTFRGISLTVMIFVNFGGGGYYFFAHSIWNGLTVADLVFPWFMWIMGVSMVLSFR 60
Query: 115 NFPCKVVATRKAIL----RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIA 170
K ++T + I+ R L LF LG+F NNL R GVLQR A
Sbjct: 61 VLRRKQISTYRIIIKITKRTLLLFALGLFTS-------NNL------TNYRIPGVLQRFA 107
Query: 171 IAYLVAALCEI--------------WLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLL 216
Y V A+ ++ W G + +SL+ + W++ +Y+++
Sbjct: 108 ACYFVVAVIQVLAGPSVEDSQPRGSWWDG---IRDVVSLWAQ----WLLMFAFLIIYVVV 160
Query: 217 LYG--LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRK 274
Y L+ Y P S +S FN T G+ +D +LG +H+Y++
Sbjct: 161 TYATELHGCPRGYTGPGGISDNSS-AFNCTGGMASH----------VDSWLLG-KHVYQR 208
Query: 275 PIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFK 334
+ + ++ DPEG++ ++ + +G+ GH + F
Sbjct: 209 GTFKDMYRTTVAH------------------DPEGVMGTLTSIFIVFLGVQAGHTLFTFS 250
Query: 335 DHRDRMLNWIILSSCL--IGLGLS----LDFVGMHLNKALYSLSYTCLTAGASGVLLAGI 388
HR R++ W + + L I +GLS D V + +NK L+S+S+ T + +LL+
Sbjct: 251 HHRQRLVRWFVWAVLLGVIAIGLSGGTQNDGV-IPINKNLWSISFVLATGSMAFLLLSFC 309
Query: 389 YFMVR 393
Y +
Sbjct: 310 YVTIE 314
>gi|395507548|ref|XP_003758085.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase
[Sarcophilus harrisii]
Length = 634
Score = 110 bits (276), Expect = 9e-22, Method: Compositional matrix adjust.
Identities = 103/343 (30%), Positives = 161/343 (46%), Gaps = 63/343 (18%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYK 114
RL SLD FRG+ + +M+ V+ GG H WNGLTLAD V P+F+FI+G S+AL+
Sbjct: 238 RLRSLDTFRGIALIIMVFVNYGGGKYWFFKHESWNGLTLADLVFPWFVFIMGSSIALSLS 297
Query: 115 NF----PCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKY---GVDIAQIRWMGVLQ 167
+ K K + R+L L ++GIF I N Y + ++R GVLQ
Sbjct: 298 SMLRRGCSKWKLLGKILWRSLLLCVIGIF--------IVNPNYCLGPLSWDKLRIPGVLQ 349
Query: 168 RIAIAYLVAALCEIWLKG----DGHVSSKLSLFR---KYRGHWVVALVLTTLYLLLLYGL 220
R+ + YLV A+ E+ + + S F+ Y W+ L+L ++ + + L
Sbjct: 350 RLGLTYLVVAVLELLFAKAVPENSAMERSCSSFQDIISYWPQWIFILMLEAAWVCVTFLL 409
Query: 221 YVPDWQYEFPVETSSSSPWIFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPI 276
VP T P G G G P C A G IDR +LG H+Y+ P
Sbjct: 410 PVPG------CPTGYLGP-------GGIGDFGKYPNCTGGAAGYIDRLLLGEDHIYQHP- 455
Query: 277 YSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDH 336
SP + + P+DPEGLL ++ + V +G+ G +++ +KD
Sbjct: 456 ----------SP-------NVLYHTKVPYDPEGLLGTINSIVMAFLGVQAGKILLFYKDQ 498
Query: 337 RDRMLNWIILSSCLIGL--GLSLDFVG----MHLNKALYSLSY 373
+++ +L S ++G+ G+ F + +NK L+S+SY
Sbjct: 499 PKQIMLRFLLWSAMLGIISGVLTKFSQNEGFIPVNKNLWSISY 541
>gi|223935576|ref|ZP_03627492.1| conserved hypothetical protein [bacterium Ellin514]
gi|223895584|gb|EEF62029.1| conserved hypothetical protein [bacterium Ellin514]
Length = 410
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 111/379 (29%), Positives = 165/379 (43%), Gaps = 87/379 (22%)
Query: 44 SKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPF 100
+K T P RL+SLDVFRG T+A M+LV++ G I ++H+ WNG T D + PF
Sbjct: 15 AKVTAPGSA--RLMSLDVFRGATIASMMLVNNPGSWDSIYRQLDHAEWNGWTFTDLIFPF 72
Query: 101 FLFIVGVSLAL-TYKNFPCKVVATR---KAILRALNLFLLGIFLQGGFFHGINNLKYG-- 154
FL+IVGV++ L T K T + RA +F LG+FL FF + N YG
Sbjct: 73 FLWIVGVAIPLSTQKRLDGGASRTNLWLHVVRRAAIIFGLGLFLA--FFSFLINGSYGRL 130
Query: 155 --------VDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVA 206
IR GVLQRIA+ YL+A S + L K RG
Sbjct: 131 GGFGPWFNEICGTIRIPGVLQRIAVCYLIA--------------STIYLTTKLRGQIAWL 176
Query: 207 LVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKIL 266
+ L Y +L+ + VP G P N +D +L
Sbjct: 177 IGLLAAYWVLMKCVPVPGHG---------------------AGVLTPEGNFSAYVDGNVL 215
Query: 267 GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHF 326
G +H +W AP+DPEG++S++ A TCL G+
Sbjct: 216 G-RH----------------------------TWHGAPWDPEGVISTIPAIATCLFGILT 246
Query: 327 GHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
G L++ K ++ W+ +S L+ L ++ + + +NK L++ SY+ AG + + A
Sbjct: 247 GQLLL-IKRSVEQKTTWVFVSGILLILAGAVMNIWLPINKNLWTSSYSVFMAGMAMNVFA 305
Query: 387 GIYFMVRYISSHLMLKKPF 405
Y++V + KPF
Sbjct: 306 VFYWLVD-VKGCQKWAKPF 323
>gi|417411831|gb|JAA52337.1| Putative heparan-alpha-glucosaminide n-acetyltransferase, partial
[Desmodus rotundus]
Length = 595
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 108/397 (27%), Positives = 176/397 (44%), Gaps = 67/397 (16%)
Query: 19 ISKSTSAPANEKLERDPLLPPS-------NSNSKQTRPQHQQRRLISLDVFRGLTVALMI 71
ISK+ S+ ++L L PS ++ S+ RP RL +D FRGL + LM+
Sbjct: 156 ISKAMSSRETDRLINSELGSPSRAGQLGDDAQSEVWRPAAAPLRLRCVDTFRGLALILMV 215
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAI 127
V+ GG H+ WNGLT+AD V P+F+FI+G S+ L+ + K K
Sbjct: 216 FVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMSSVLQRGCSKFRLLGKVA 275
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKG-- 185
R+ L +G+ + N + ++R GVLQR+ + Y V A+ E+
Sbjct: 276 WRSFLLICIGVIVVNP-----NYCLGPLSWDKVRLPGVLQRLGVTYFVVAVLELLFAKPV 330
Query: 186 ------DGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPW 239
+G SS + + W+ L+L +++L L + L VP T P
Sbjct: 331 PERGAWEGRCSSLQDIMSSWP-QWLFILMLESIWLALTFFLPVPG------CPTGYLGP- 382
Query: 240 IFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPL 295
G G G P C A G IDR +LG H+Y+ P + +
Sbjct: 383 ------GGIGDLGKYPNCTGGAAGYIDRLLLGEDHIYQHPSSTVLYHTRV---------- 426
Query: 296 DAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL-- 353
+DPEG+L ++ + V +G+ G +++++K+ +L C +GL
Sbjct: 427 --------AYDPEGILGTINSIVMAFLGVQAGKILLYYKEQTKDILIRFTAWCCFLGLIS 478
Query: 354 ----GLSLDFVGMHLNKALYSLSY-TCLTAGASGVLL 385
+S + + +NK L+S+SY T L++ A +LL
Sbjct: 479 VGLTKISENEGFIPVNKNLWSISYVTTLSSFAFFILL 515
>gi|294463099|gb|ADE77087.1| unknown [Picea sitchensis]
Length = 218
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 53/88 (60%), Positives = 67/88 (76%)
Query: 305 FDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHL 364
FDPEGLLSS+MA VTC IGLHFGH++VHFK H +R+L II S LI G++L GM L
Sbjct: 44 FDPEGLLSSIMAVVTCFIGLHFGHILVHFKGHSERVLQCIIPSLGLIFFGIALGIFGMPL 103
Query: 365 NKALYSLSYTCLTAGASGVLLAGIYFMV 392
NK LY+ +Y C+T+GA+G+L GIY +V
Sbjct: 104 NKPLYTFNYMCVTSGAAGLLFVGIYLLV 131
>gi|427710153|ref|YP_007052530.1| heparan-alpha-glucosaminide N-acetyltransferase [Nostoc sp. PCC
7107]
gi|427362658|gb|AFY45380.1| Heparan-alpha-glucosaminide N-acetyltransferase [Nostoc sp. PCC
7107]
Length = 387
Score = 110 bits (276), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 107/349 (30%), Positives = 159/349 (45%), Gaps = 77/349 (22%)
Query: 55 RLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL+SLDVFRG+ +A MILV++ G I P ++H+ W+G T D V PFFLFIVGV++
Sbjct: 18 RLVSLDVFRGIAIASMILVNNPGSWDYIYPPLDHAEWHGCTPTDLVFPFFLFIVGVAMPF 77
Query: 112 TY-----KNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGV--DIAQIRWMG 164
++ +N P V R ILR + F ++ L G+ + + +R MG
Sbjct: 78 SFAKYTPENRPTATVYQR--ILRRGLILFALGLFLALFTLTLDWLIKGITPNFSTLRIMG 135
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQRI++AY++AAL + L+L R RG W++A V+ Y L + + VP
Sbjct: 136 VLQRISLAYVIAAL------------AVLNLSR--RGLWILAAVILIGYWLAMQFIPVPG 181
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
+ G+ P N G IDR ILG +H+YR
Sbjct: 182 FG---------------------AGNLTPEGNLGGYIDRIILG-KHIYR----------- 208
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
FDPEGL S++ A VT +G G + +N +
Sbjct: 209 -----------------SGSFDPEGLFSTLPAVVTVFLGYFTGDWLRTQPIKSRTSINLV 251
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVR 393
I + +G F+ +NK L++ SY TAG + + LA Y ++
Sbjct: 252 IFGLITLIIGQLWGFI-FPINKQLWTSSYVLFTAGWALLTLALCYELIE 299
>gi|426256612|ref|XP_004021932.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Ovis
aries]
Length = 641
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 108/398 (27%), Positives = 175/398 (43%), Gaps = 60/398 (15%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQR-----RLISLDVFRGLTVALMILV 73
ISK+ ++ ++L L PS ++ Q P+ +R RL +D FRG+ + LM+ V
Sbjct: 206 ISKAINSRETDRLINSELGSPSRASDPQ--PEAWRRSAAPLRLRCVDTFRGMALILMVFV 263
Query: 74 DDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAILR 129
+ GG HS WNGLT+AD V P+F+FI+G S+ L+ + K+ K + R
Sbjct: 264 NYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGASIFLSMASILQRGCSKLRLLGKIVWR 323
Query: 130 ALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHV 189
+ L +GIF+ N + + R GVLQR+ Y V A+ E+ V
Sbjct: 324 SFLLICIGIFVVNP-----NYCLGPLSWEKARIPGVLQRLGATYFVVAVLELLFAKP--V 376
Query: 190 SSKLSLFRKYRG---------HWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWI 240
+L R W+ L+L ++L L + L VP T P
Sbjct: 377 PETCALERSCFSLLDITASWPQWLFVLILEGVWLALTFFLPVPG------CPTGYLGPGG 430
Query: 241 FNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSW 300
R TG A G +DR +LG QHLY+ P + +
Sbjct: 431 IGDGGRYRNCTG---GAAGYVDRLLLGDQHLYQHPSSAVLYHTEV--------------- 472
Query: 301 CQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL------G 354
+DPEG+L ++ + V +G+ G +++++KD +L CL+GL
Sbjct: 473 ---AYDPEGILGTINSIVMAFLGVQAGKILLYYKDQTRGILIRFAAWGCLLGLVSVALTK 529
Query: 355 LSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
S + + +NK L+S+SY + + ++L +Y +V
Sbjct: 530 ASENEGFIPVNKNLWSVSYVTTLSSLAFLILLALYPVV 567
>gi|423215264|ref|ZP_17201791.1| hypothetical protein HMPREF1074_03323 [Bacteroides xylanisolvens
CL03T12C04]
gi|392691832|gb|EIY85072.1| hypothetical protein HMPREF1074_03323 [Bacteroides xylanisolvens
CL03T12C04]
Length = 371
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 100/348 (28%), Positives = 158/348 (45%), Gaps = 74/348 (21%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+ RL+SLDV RG+T+ MILV++ G + + H+ WNGLT D V PFF+FI+GVS
Sbjct: 2 KSERLLSLDVLRGITIVGMILVNNPGTWESVYAPLRHAEWNGLTPTDLVFPFFMFIMGVS 61
Query: 109 LALTYKNFPCKVVAT--RKAILRALNLFLLGIFLQGGFFHGINNLKYGVD--IAQIRWMG 164
++ F + K + R + LFLLG+FL + + GV+ +QIR +G
Sbjct: 62 MSFALSRFDHHFSRSFITKLVRRTVILFLLGLFLS-----WFSLVCAGVEQPFSQIRILG 116
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQR+A+AY +L +S+ R W+ A++L +LL G
Sbjct: 117 VLQRLALAYFFGSLL------------IMSVRRPANLAWISAIILIGYIVLLALG----- 159
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
G N + + DR + G HLYR+ +
Sbjct: 160 -----------------------NGFELSEQNIIAVTDRTLFGETHLYREWL-------- 188
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
PD G + FDPEGLLS++ +IG G+++ + R+L
Sbjct: 189 ---PDGG----------RIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHRLLQIS 235
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
IL L+ G L + G LNK ++S ++ +T G + + L + +++
Sbjct: 236 ILGIALLFAGWLLSY-GCPLNKKVWSPTFVLVTCGFASLFLVFLTWLI 282
>gi|345781561|ref|XP_539948.3| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Canis
lupus familiaris]
Length = 638
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 112/399 (28%), Positives = 175/399 (43%), Gaps = 71/399 (17%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQ-------RRLISLDVFRGLTVALMI 71
ISK+ ++ ++L L P + S Q + RL S+D FRGL + LM+
Sbjct: 199 ISKAINSRETDRLINSELGSPGRAGSIGGEAQQEAWHPPSALPRLRSIDTFRGLALILMV 258
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAI 127
V+ GG HS WNGLT+AD V P+F+FI+G S+ L+ + K K
Sbjct: 259 FVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGSSIFLSMTSMLQRGCSKFRLLGKIA 318
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKY---GVDIAQIRWMGVLQRIAIAYLVAALCE-IWL 183
R+ LF +G+ I N Y + ++R GVLQR+ + Y V A+ E I+
Sbjct: 319 WRSFLLFCIGVV--------IVNPNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELIFA 370
Query: 184 KGDGHVSSKLSLFRKYRG------HWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSS 237
K + R W+ L+L +++L L + L VP T
Sbjct: 371 KPVPESCASERRCFSLRDIILSWPQWLFILLLESIWLGLTFFLPVPG------CPTGYLG 424
Query: 238 PWIFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPM 293
P G G G P C A G IDR +LG H+Y+ P + +
Sbjct: 425 P-------GGIGDLGKYPNCTGGAAGYIDRLLLGDDHIYQHPSSAVLYHTKV-------- 469
Query: 294 PLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL 353
P+DPEG+L ++ + V +G+ G +++++KD +L C +GL
Sbjct: 470 ----------PYDPEGILGTISSIVMAFLGIQAGKILLYYKDQTKDILIRFTAWCCFLGL 519
Query: 354 ------GLSLDFVGMHLNKALYSLSY-TCLTAGASGVLL 385
+S + + +NK L+S+SY T L++ A +LL
Sbjct: 520 ISVALTKISENEGFIPINKNLWSISYVTTLSSFAFFILL 558
>gi|281209662|gb|EFA83830.1| hypothetical protein PPL_02898 [Polysphondylium pallidum PN500]
Length = 409
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 100/382 (26%), Positives = 161/382 (42%), Gaps = 84/382 (21%)
Query: 17 IQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDV 76
+++ + T + D + ++P ++R++SLD RGLT+ MILVD+
Sbjct: 1 MKVDEETPLVKGSTITSDTSINVDVDKDTTSKPP-PKKRMLSLDTARGLTIFGMILVDNQ 59
Query: 77 GG--ILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLF 134
GG ++ + + WNG++ AD + P FLFI G S++L KN I R + LF
Sbjct: 60 GGPEVIWPLKETDWNGISTADLIFPSFLFICGFSISLALKNAKNDRPTWINIIRRTILLF 119
Query: 135 LLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLS 194
+ +FL N + + + R MGVLQRI++ Y +
Sbjct: 120 GIQLFL--------NLMAHKFVFSTFRVMGVLQRISLCYCFSC-------------CSFM 158
Query: 195 LFRKYRGHWVVALVLT-TLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGP 253
L K+ VALV++ T+YL L+Y VP CG RG+
Sbjct: 159 LLPKWAQR--VALVISATIYLCLMYAYPVPG--------------------CG-RGNITR 195
Query: 254 ACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSS 313
+CNA G ID IL ++ P DPEG +S+
Sbjct: 196 SCNAAGYIDNLILRKNMIH-------------------------------PTDPEGFIST 224
Query: 314 VMATVTCLIGLHFGH-LIVHFKDH---RDRMLNWIILSSCLIGLGLSLDFVG-MHLNKAL 368
A +T +G+ G L H + +D ++ W+ + +GL LD + NK +
Sbjct: 225 FSAFITTWMGVELGRILTTHARSADGWKDILIRWLSIGMVCAMIGLFLDATNVIQFNKII 284
Query: 369 YSLSYTCLTAGASGVLLAGIYF 390
+S S+ LT + L+ +Y+
Sbjct: 285 WSFSFAMLTVACGALFLSALYY 306
>gi|262407085|ref|ZP_06083634.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|294648023|ref|ZP_06725570.1| putative membrane protein [Bacteroides ovatus SD CC 2a]
gi|294809833|ref|ZP_06768513.1| putative membrane protein [Bacteroides xylanisolvens SD CC 1b]
gi|345512215|ref|ZP_08791750.1| hypothetical protein BSAG_03359 [Bacteroides sp. D1]
gi|229445856|gb|EEO51647.1| hypothetical protein BSAG_03359 [Bacteroides sp. D1]
gi|262355788|gb|EEZ04879.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|292636642|gb|EFF55113.1| putative membrane protein [Bacteroides ovatus SD CC 2a]
gi|294442971|gb|EFG11758.1| putative membrane protein [Bacteroides xylanisolvens SD CC 1b]
Length = 371
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 100/348 (28%), Positives = 158/348 (45%), Gaps = 74/348 (21%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+ RL+SLDV RG+T+ MILV++ G + + H+ WNGLT D V PFF+FI+GVS
Sbjct: 2 KSERLLSLDVLRGITIVGMILVNNPGTWESVYAPLRHAEWNGLTPTDLVFPFFMFIMGVS 61
Query: 109 LALTYKNFPCKVVAT--RKAILRALNLFLLGIFLQGGFFHGINNLKYGVD--IAQIRWMG 164
++ F + K + R + LFLLG+FL + + GV+ +QIR +G
Sbjct: 62 MSFALSRFDHHFSRSFITKLVRRTVILFLLGLFLS-----WFSLVCAGVEQPFSQIRILG 116
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQR+A+AY +L +S+ R W+ A++L +LL G
Sbjct: 117 VLQRLALAYFFGSLL------------IMSVRRPANLAWISAIILIGYIVLLALG----- 159
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
G N + + DR + G HLYR+ +
Sbjct: 160 -----------------------NGFELSEQNIIAVTDRTLFGETHLYREWL-------- 188
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
PD G + FDPEGLLS++ +IG G+++ + R+L
Sbjct: 189 ---PDGG----------RIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHRLLQIS 235
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
IL L+ G L + G LNK ++S ++ +T G + + L + +++
Sbjct: 236 ILGIVLLFAGWLLSY-GCPLNKKVWSPTFVLVTCGFASLFLVFLTWLI 282
>gi|427716050|ref|YP_007064044.1| hypothetical protein Cal7507_0722 [Calothrix sp. PCC 7507]
gi|427348486|gb|AFY31210.1| hypothetical protein Cal7507_0722 [Calothrix sp. PCC 7507]
Length = 375
Score = 110 bits (274), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 120/365 (32%), Positives = 164/365 (44%), Gaps = 84/365 (23%)
Query: 55 RLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL SLDVFRG+T+A MILV+ VG P ++H+ WNG T D V PFFLFIVGV++
Sbjct: 2 RLTSLDVFRGITIAAMILVNMVGVADDKYPLLDHAEWNGCTPTDLVFPFFLFIVGVAMTF 61
Query: 112 -----TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
T N P V R ILR + L L GF+ N + D++ IR MGVL
Sbjct: 62 SLSKYTEGNKPNSSVYWR--ILRRAAILLALGLLLNGFW---NKGVWTFDLSSIRLMGVL 116
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRI+++YLVA++ + L+L RK G W++A VL Y L + L VP
Sbjct: 117 QRISLSYLVASV------------TVLNLPRK--GQWILAAVLLIGYWLTMMYLPVP--- 159
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
GV G N IDR I+ HLY+ ++
Sbjct: 160 ---------------GHGAGVLTREG---NLGAYIDRLIIPKAHLYKGDKFNFMG----- 196
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRM---LNW 343
DPEGL S++ A V+ L G +F L + + R R L
Sbjct: 197 -------------------DPEGLFSTIPAIVSVLAG-YFAGLWIRSQPVRSRTSIGLAL 236
Query: 344 IILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM--VRYISSHLML 401
+ +IG F +NK L++ SY ++G + +LLA Y + VR I
Sbjct: 237 FGIGCLIIGWAWGWTF---PINKKLWTSSYVVFSSGWALLLLAACYELIEVRLIRRW--- 290
Query: 402 KKPFD 406
KPF+
Sbjct: 291 SKPFE 295
>gi|387793162|ref|YP_006258227.1| hypothetical protein Solca_4061 [Solitalea canadensis DSM 3403]
gi|379655995|gb|AFD09051.1| hypothetical protein Solca_4061 [Solitalea canadensis DSM 3403]
Length = 393
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 108/363 (29%), Positives = 163/363 (44%), Gaps = 81/363 (22%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFL 102
Q + RL+SLDVFRG TVA MILV++ G I + H+ WNG T D + PFFL
Sbjct: 2 QASASEPKPRLLSLDVFRGATVAAMILVNNPGSWSNIYAPLEHAKWNGCTPTDLIFPFFL 61
Query: 103 FIVGVSLALTY---KNFPCKVVATRKAI-LRALNLFLLGIFLQGGFFHGINNLKYG-VDI 157
FIVG+S+A K+ P + A K+I +R+L LF LG+ L F + K+G VD+
Sbjct: 62 FIVGISIAYALSGKKSRPEEHSAAIKSITIRSLKLFGLGLIL--ALFPIVYFDKFGEVDV 119
Query: 158 --------AQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVL 209
+ +R MGVLQRI I + +A + I++K + W +L
Sbjct: 120 WDQIVMRFSGVRIMGVLQRIGIVFFIAGI--IFIKAKPKTIA-----------WTAGSLL 166
Query: 210 TTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQ 269
YLL+ + + VP Y + P N IDR IL
Sbjct: 167 VIYYLLMTF-VPVPGVGY---------------------ANLEPETNLGAWIDRLILTTD 204
Query: 270 HLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHL 329
HL+++ S+T +DPEGLL ++ A T L+G G
Sbjct: 205 HLWKQ---SKT------------------------WDPEGLLGTIPAVATGLLGTLCGTW 237
Query: 330 IVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIY 389
+ D+ ++ + ++ I GL D +NK+L++ S+ TAG A +Y
Sbjct: 238 MKKPMDNALKITWLFVFAAFGIAAGLVWDLF-FPINKSLWTSSFVLYTAGLGSAFFAVLY 296
Query: 390 FMV 392
+++
Sbjct: 297 WLI 299
>gi|333031144|ref|ZP_08459205.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
gi|332741741|gb|EGJ72223.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
Length = 363
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 89/335 (26%), Positives = 153/335 (45%), Gaps = 76/335 (22%)
Query: 51 HQQRRLISLDVFRGLTVALMILVDDVGGI---LPAINHSPWNGLTLADFVMPFFLFIVGV 107
+RL+SLDV RG+TVA MILV++ G A+ H+ W+GL AD V P F+F++G+
Sbjct: 6 EDSKRLLSLDVLRGITVAGMILVNNTGSCGYNYTALRHASWDGLNFADLVFPMFMFMMGI 65
Query: 108 SLALTYKNFP-CKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
S ++ + + K A K R L ++G+F++ I ++ G++++ +R MGV+
Sbjct: 66 STYISLRKYENNKKTAFYKIFKRTSLLIIIGLFMEC----IITWIEVGLNLSTLRLMGVM 121
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QR+ + Y + AL +++ + LS+ Y
Sbjct: 122 QRLGLCYGITALLSLYVPHKYLLKIALSVLLGY--------------------------- 154
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
F ++ S G A N +G++DR +LG+ H IY + KQ
Sbjct: 155 --FIIQIVGS------------GFDKSAENVIGVVDRSVLGVNH-----IYLQGKQFV-- 193
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
DPEG+LS++ A +IG G I+ ++H+ +ML L
Sbjct: 194 -------------------DPEGVLSTLPAIAQVMIGFFCGRKILEKREHKQQMLILYRL 234
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGAS 381
S + +G +V +NK L+S ++ +T+G +
Sbjct: 235 GSLFLFVGFVFSYV-CPINKRLWSPTFVLVTSGVA 268
>gi|350412149|ref|XP_003489557.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Bombus impatiens]
Length = 571
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 107/427 (25%), Positives = 192/427 (44%), Gaps = 81/427 (18%)
Query: 11 ANDQLVIQISKSTSAPAN--EKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVA 68
+++I++ + +P N + L R L N+ R R+ S+D FRG+ +
Sbjct: 138 TTSKVIIRVIRGKLSPNNVYDDLNR---LQEENTTHPVIRVTKFSSRIQSVDAFRGIAIL 194
Query: 69 LMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAIL 128
LMI V++ GG NHS W GL++AD ++P+F +I+G+S+ ++ K ++ +R I
Sbjct: 195 LMIFVNNGGGKYVFFNHSAWFGLSVADLILPWFAWIMGMSITIS-KRAELRLTTSRVKIT 253
Query: 129 -----RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCE-IW 182
R+ L LLG+ L +N+L R+ G+LQ +A++Y V A+ E I+
Sbjct: 254 LCCLRRSAILILLGLMLNSIDSKSLNDL---------RFPGILQLLAVSYFVCAILETIF 304
Query: 183 LKGDGH----VSSKLSLFRKYRGHW----VVALVLTTLYLLLLY--------GLYVPDWQ 226
+K + ++FR W ++A ++TT L+ + G + P +
Sbjct: 305 MKPHSQDILLQFGRFAIFRDILDSWPQWLIMAGIMTTHTLITFFLHMPNCPTGYFGPGGK 364
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
Y + + N T G A G IDR I G H Y K I
Sbjct: 365 YHYRGK-------YMNCTAG----------AAGYIDRLIFG-NHTYSK----------IK 396
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI-- 344
YG + +DPEGL++++ A +G+H G +++ + R++ W
Sbjct: 397 DSIYGQI---------LRYDPEGLMNTISAIFIVYLGVHAGKILLLYYQGNARLIRWFLW 447
Query: 345 -ILSSCLIGLGLSLDFVG--MHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLML 401
I + + G+ + + G + ++K + +LSY + + +L A +YF++ Y
Sbjct: 448 AIFTGIIAGILCNFENEGGVIPVSKRMMTLSYVLTCSSFAFLLYAILYFLIDY--KQFWS 505
Query: 402 KKPFDYS 408
PF Y+
Sbjct: 506 GAPFIYA 512
>gi|298479647|ref|ZP_06997847.1| membrane protein [Bacteroides sp. D22]
gi|336403243|ref|ZP_08583960.1| hypothetical protein HMPREF0127_01273 [Bacteroides sp. 1_1_30]
gi|295084924|emb|CBK66447.1| Uncharacterized conserved protein [Bacteroides xylanisolvens XB1A]
gi|298274037|gb|EFI15598.1| membrane protein [Bacteroides sp. D22]
gi|335946636|gb|EGN08437.1| hypothetical protein HMPREF0127_01273 [Bacteroides sp. 1_1_30]
Length = 371
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 100/342 (29%), Positives = 154/342 (45%), Gaps = 74/342 (21%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+ RL+SLDV RG+T+ MILV++ G + + H+ WNGLT D V PFF+FI+GVS
Sbjct: 2 KSERLLSLDVLRGITIVGMILVNNPGTWESVYAPLRHAEWNGLTPTDLVFPFFMFIMGVS 61
Query: 109 LALTYKNFPCKVVAT--RKAILRALNLFLLGIFLQGGFFHGINNLKYGVD--IAQIRWMG 164
++ F + K + R + LFLLG+FL + + GV+ +QIR +G
Sbjct: 62 MSFALSRFDHHFSRSFITKLVRRTVILFLLGLFLS-----WFSLVCAGVEQPFSQIRILG 116
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQR+A+AY +L +S+ R W+ A++L +LL G
Sbjct: 117 VLQRLALAYFFGSLL------------IMSVRRPANLAWISAIILIGYIVLLALG----- 159
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
G N + + DR + G HLYR+ +
Sbjct: 160 -----------------------NGFELSEQNIIAVTDRTLFGETHLYREWL-------- 188
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
PD G + FDPEGLLS++ +IG G+++ + R+L
Sbjct: 189 ---PDGG----------RIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHRLLQIS 235
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
IL L+ G L + G LNK ++S ++ +T G + + L
Sbjct: 236 ILGIVLLFAGWLLSY-GCPLNKKVWSPTFVLVTCGFASLFLV 276
>gi|354568330|ref|ZP_08987495.1| hypothetical protein FJSC11DRAFT_3703 [Fischerella sp. JSC-11]
gi|353540693|gb|EHC10166.1| hypothetical protein FJSC11DRAFT_3703 [Fischerella sp. JSC-11]
Length = 384
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 110/359 (30%), Positives = 155/359 (43%), Gaps = 90/359 (25%)
Query: 55 RLISLDVFRGLTVALMILVDDVG----GILPAINHSPWNGLTLADFVMPFFLFIVGVSLA 110
RL SLDVFRG+T+A MILV+ + P + H+ W+G T D V PFFLFIVGV+++
Sbjct: 2 RLTSLDVFRGITIAGMILVNTASIAEPNVYPPLLHAEWHGCTPTDLVFPFFLFIVGVAMS 61
Query: 111 LTYKNFPCKVVATRKA-----------ILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQ 159
++ + + K I RA LF+LG+ L G + G+ + D
Sbjct: 62 FSFSKYTDSKLHGEKEKVFVSLPYWRIIRRAAILFVLGLLLNGFWNQGV----WTFDFNS 117
Query: 160 IRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYG 219
IR MGVLQRI++ YL+A+L ++ RK G W++A VL Y L +
Sbjct: 118 IRVMGVLQRISLTYLLASLVV------------FNIPRK--GQWILAGVLLIGYWLAM-- 161
Query: 220 LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSR 279
+YVP Y V T N IDR I+ HLY+ Y+
Sbjct: 162 MYVPVSGYGAGVLTRDG-------------------NLGAYIDRLIIPKAHLYKGDNYNF 202
Query: 280 TKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDR 339
DPEGL S++ A V+ L G G I K +
Sbjct: 203 MG------------------------DPEGLFSTIPAIVSVLAGYFAGQWIRSQKQINSK 238
Query: 340 MLNWIILSSCLIGLGLSLDFVG------MHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
S L+ GLS +G +NK L++ SY T G + +LLA Y ++
Sbjct: 239 T------SMDLVLFGLSCLVIGGIWDLAFPINKKLWTSSYVVFTTGWALLLLAFCYELI 291
>gi|313242995|emb|CBY39713.1| unnamed protein product [Oikopleura dioica]
Length = 597
Score = 109 bits (273), Expect = 2e-21, Method: Compositional matrix adjust.
Identities = 107/405 (26%), Positives = 173/405 (42%), Gaps = 86/405 (21%)
Query: 27 ANEKLERDPLLPPSNSNSKQTRPQHQQR------------RLISLDVFRGLTVALMILVD 74
A E E P P ++ + + Q Q R SLD RGL++ +MI V+
Sbjct: 167 ALEPEEAKPENPFADDQTDKQEDQEVQEDEPAPPAPAKKKRYKSLDTLRGLSLIIMIFVN 226
Query: 75 DVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTY----KNFPCKVVATRKAILRA 130
GG + H WNGLT+AD VMP+FLF+ GVS+ + K K + + ++R+
Sbjct: 227 YGGGEYWFMEHVAWNGLTVADLVMPWFLFMSGVSIRIALQSRIKRGISKTEISYEILVRS 286
Query: 131 LNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWL--KGDGH 188
+ L LG+ GG R+ GVLQRI +Y V A+ + + D
Sbjct: 287 VKLIGLGMITIGG----------NESWEYFRFPGVLQRIGFSYFVVAIIHLLVIEHPDKE 336
Query: 189 VSSKLSLFRK----YRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVT 244
+ LF++ ++ H +++ + ++ L Y L +P
Sbjct: 337 PETNWGLFKEMSFNFKEH-LISWSILGAFICLTYLLPIP------------------GCP 377
Query: 245 CGVRGSTGPACN---------AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPL 295
G G G + N A G IDRK+LG +H+Y P
Sbjct: 378 TGYTGPGGLSENGEHYHCIGGAAGYIDRKLLGEKHIYNWPTAYH---------------- 421
Query: 296 DAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGL 355
D P+ PFDPEGLL ++ + +GL G FK + +L+ + L++ G+
Sbjct: 422 DEPN--GVPFDPEGLLGTLTSIFMVYLGLQAGKCFDIFKTPKSIILHLLGLAAVYGISGM 479
Query: 356 SLDFVG--------MHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
L +G + +NK L+S+S+ + + + LLA +YF++
Sbjct: 480 LLATIGFGKIAEAIIPINKNLWSVSFVFVLSSMAFTLLAFLYFII 524
>gi|433652541|ref|YP_007296395.1| hypothetical protein Prede_1583 [Prevotella dentalis DSM 3688]
gi|433303074|gb|AGB28889.1| hypothetical protein Prede_1583 [Prevotella dentalis DSM 3688]
Length = 394
Score = 109 bits (272), Expect = 3e-21, Method: Compositional matrix adjust.
Identities = 109/352 (30%), Positives = 154/352 (43%), Gaps = 87/352 (24%)
Query: 42 SNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGG--ILPAINHSPWNGLTLADFVMP 99
SN TR Q RLISLDV RGLTV LMI V++ G I + HS WNG+TL D V P
Sbjct: 2 SNPTITRMSSQ--RLISLDVLRGLTVMLMIFVNNGAGEQIFAQLQHSRWNGMTLCDLVFP 59
Query: 100 FFLFIVGVS--LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDI 157
FFLFI+GVS L+L F RK R L LF++G+ + F + + D+
Sbjct: 60 FFLFIMGVSTYLSLRKTQFVWSARLGRKIARRTLLLFVIGLAIN-WFDMACSGRPF--DL 116
Query: 158 AQIRWMGVLQRIAIAY----LVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLY 213
+ +R MGV+QRIA+ Y L+A C+ WL FR + A++ L
Sbjct: 117 SHLRIMGVMQRIALCYGATALIAVGCQRWLHD----------FRA-----MPAIIAALLG 161
Query: 214 LLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYR 273
L + Y+ A N + +D+ +LG HLY
Sbjct: 162 AYGALLLMGQGYAYD------------------------AAINLLSRVDQAVLGHAHLYH 197
Query: 274 KPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVH- 332
K +P DPEGL+S++ A L G + H +
Sbjct: 198 K----------------------------SPVDPEGLVSTLAAVAHTLAGFYVAHWALGP 229
Query: 333 FKD-----HRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
+D R M+ + + L +GL+L + + LNK ++S SY CL+ G
Sbjct: 230 GRDGAGPAARRSMMRFAAAGTVLAVVGLALSPL-LPLNKRVWSPSYVCLSCG 280
>gi|409990365|ref|ZP_11273749.1| hypothetical protein APPUASWS_05524 [Arthrospira platensis str.
Paraca]
gi|291567406|dbj|BAI89678.1| hypothetical protein [Arthrospira platensis NIES-39]
gi|409938771|gb|EKN80051.1| hypothetical protein APPUASWS_05524 [Arthrospira platensis str.
Paraca]
Length = 378
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 112/369 (30%), Positives = 167/369 (45%), Gaps = 88/369 (23%)
Query: 51 HQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGV 107
+ + RLISLDVFRG+ +A MILV++ G + P + H+ W+G T D V P FL IVGV
Sbjct: 5 NTKIRLISLDVFRGIAIAAMILVNNPGSWGYMYPVLQHAQWHGCTPTDVVFPSFLLIVGV 64
Query: 108 SLALTYKNFPCK--------VVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQ 159
++A + F + + I R L L + GF + D+A
Sbjct: 65 AIAFSLSKFSPEHRLGGDGVPPSVYSRIGRRCLLLFLLGLILNGFPN--------YDLAN 116
Query: 160 IRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYG 219
IR MGVLQRIAIAY ++A+ + L+L R R W++++ Y L +
Sbjct: 117 IRIMGVLQRIAIAYGLSAI------------AILNLSR--RQLWLISIFTLIGYWLAMTM 162
Query: 220 LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSR 279
+ VP + SP G+ P N ID+ ILG HL+R
Sbjct: 163 IPVPGY-----------SP----------GNLSPEGNLGAFIDQTILGSHHLWRG----- 196
Query: 280 TKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDR 339
P+DPEGL S+ ATVT +IG G + +
Sbjct: 197 -----------------------GPYDPEGLFSTAPATVTVIIGYLTGEWLKSQPRNSLT 233
Query: 340 MLNWII--LSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISS 397
++N ++ LSS ++G + F +NKAL++ SY +TAG +LLA Y ++ + +
Sbjct: 234 VINLVMFALSSLVVGYLWGIWF---PINKALWTSSYVLVTAGWGLLLLAFCYGVIE-VKN 289
Query: 398 HLMLKKPFD 406
KPF+
Sbjct: 290 WRRWGKPFE 298
>gi|91078976|ref|XP_974454.1| PREDICTED: similar to heparan-alpha-glucosaminide
N-acetyltransferase [Tribolium castaneum]
gi|270003686|gb|EFA00134.1| hypothetical protein TcasGA2_TC002950 [Tribolium castaneum]
Length = 566
Score = 108 bits (271), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 90/362 (24%), Positives = 163/362 (45%), Gaps = 63/362 (17%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
+ R+ S+DVFRG + +MI V+ GG +HS WNGLT+AD V P+FL+++GVS
Sbjct: 176 RKHPHRIKSIDVFRGFCIMIMIFVNYGGGKYWFFSHSVWNGLTVADLVFPWFLWLMGVSF 235
Query: 110 ALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRI 169
A++ + + V R+ ++ + F+ IN+ + I +R+ GVLQRI
Sbjct: 236 AVSLQAKLRRAVPRRQLVIGVMRR----SFILILLGIIINSNQNLQTIGSLRFPGVLQRI 291
Query: 170 AIAYLVAALCEIWLKGDGHVSSKLSLFRKYRG--HWVVALVLTTLYLLLLYGLYVPDWQY 227
+ Y + + EI V S ++ W+ VL ++ + + VP
Sbjct: 292 GVCYFIVGMLEIIFTKRSEVESVSCIYDVAVAWPQWLCVTVLVVIHTCVTFLGDVPG--- 348
Query: 228 EFPVETSSSSPWIFNVTCGVRGSTGPA----------CNA--VGMIDRKILGIQHLYRKP 275
CG RG GP C G IDR++ G +H+++ P
Sbjct: 349 -----------------CG-RGYLGPGGLDDNGRFYNCTGGVAGYIDRQVFG-EHMHKNP 389
Query: 276 IYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKD 335
+ + + + FDPEG+L ++ + +T G+ G + +++
Sbjct: 390 VCKKLYEIDVY------------------FDPEGILGTLTSVLTVYFGVQAGRTLNTYQN 431
Query: 336 HRDRMLNWIILSSCLIGLGLSLDFVGMH-----LNKALYSLSYTCLTAGASGVLLAGIYF 390
+ +++ W++ S LG +L + LNK L+SLS+ + +G + ++ A ++
Sbjct: 432 VKAKVIRWVVWGSLAGLLGGALCEFKQNDGLIPLNKQLWSLSFALVLSGMAFIIQAFLFV 491
Query: 391 MV 392
+V
Sbjct: 492 LV 493
>gi|410926267|ref|XP_003976600.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like,
partial [Takifugu rubripes]
Length = 497
Score = 108 bits (270), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 93/267 (34%), Positives = 133/267 (49%), Gaps = 51/267 (19%)
Query: 24 SAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAI 83
+ P E + LLPP S+ +RL SLD FRG+++ +M+ V+ GG
Sbjct: 264 TVPVTENI----LLPPPTSS----------KRLQSLDTFRGISLVIMVFVNYGGGRYWFF 309
Query: 84 NHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAIL-----RALNLFLLGI 138
H WNGLT+AD V P+F+FI+G S+AL+ + +TR ++L R+L LFL+G+
Sbjct: 310 RHESWNGLTVADLVFPWFMFIMGTSIALSVHAL-LRTGSTRLSLLGKVAWRSLQLFLIGL 368
Query: 139 FLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGH------VSSK 192
F+ N + + +R GVLQR+A+AYLV A ++ L H VSS
Sbjct: 369 FIINP-----NYCQGPLSWGTLRIPGVLQRLALAYLVVACLDL-LVARAHLEIYTTVSST 422
Query: 193 LSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTG 252
L Y WV L+L +++L L + L VPD T P G G G
Sbjct: 423 DVLL--YWPAWVCVLLLESVWLFLTFLLPVPD------CPTGYLGP-------GGIGDMG 467
Query: 253 --PACN--AVGMIDRKILGIQHLYRKP 275
P C A G IDR +LG +H+Y+ P
Sbjct: 468 LFPNCTGGAAGFIDRWLLGEKHIYQNP 494
>gi|431902215|gb|ELK08716.1| Heparan-alpha-glucosaminide N-acetyltransferase [Pteropus alecto]
Length = 585
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 110/401 (27%), Positives = 176/401 (43%), Gaps = 75/401 (18%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISL-------DVFRGLTVALMI 71
ISK+ ++ ++L L PS + Q + RL ++ D FRG+ + LM+
Sbjct: 146 ISKAINSRETDRLINSELGSPSRAGQLGDDTQLEAWRLSAVPLRLRCVDTFRGIALILMV 205
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAI 127
V+ GG H+ WNGLT+AD V P+F+FI+G S+ L+ + K K
Sbjct: 206 FVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKFRLLGKVT 265
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYG---VDIAQIRWMGVLQRIAIAYLVAALCEIWLK 184
R+ L +GIF I N Y + ++R GVLQR+ + Y V A+ E+
Sbjct: 266 WRSFLLICIGIF--------IVNPNYCLGPLSWDKLRIPGVLQRLGVTYFVVAVLELLFA 317
Query: 185 G--------DGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSS 236
+ SS + + W+ L+L +++L L + L VP T
Sbjct: 318 KPVPESCTVERRCSSLQDIISSWP-QWLFILMLESIWLALTFFLPVPG------CPTGYL 370
Query: 237 SPWIFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGP 292
P G G G P C A G IDR +LG HLY+ P + +
Sbjct: 371 GP-------GGIGDLGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSTVLYHTKV------- 416
Query: 293 MPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIG 352
+DPEG+L ++ + V +G+ G +++++KD +L C +G
Sbjct: 417 -----------AYDPEGILGTINSIVMAFLGVQAGKILLYYKDQTKDILIRFTAWCCFLG 465
Query: 353 LGLSLDFVGMH-------LNKALYSLSY-TCLTAGASGVLL 385
L +S+ F + +NK L+S+SY T L++ A +LL
Sbjct: 466 L-MSVAFTKISENEGFIPVNKNLWSISYVTTLSSFAFFILL 505
>gi|333378010|ref|ZP_08469743.1| hypothetical protein HMPREF9456_01338 [Dysgonomonas mossii DSM
22836]
gi|332884030|gb|EGK04310.1| hypothetical protein HMPREF9456_01338 [Dysgonomonas mossii DSM
22836]
Length = 389
Score = 108 bits (269), Expect = 6e-21, Method: Compositional matrix adjust.
Identities = 101/359 (28%), Positives = 161/359 (44%), Gaps = 93/359 (25%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG 106
Q RL+SLD+ RG+T+A MI+V++ G + + H+ W+GLT D V PFF+FI+G
Sbjct: 3 QKPSSRLLSLDILRGITIAGMIMVNNPGSWSYVYAPLGHAAWHGLTPTDLVFPFFMFIMG 62
Query: 107 VSLALTYK--NFPCKVVATRKAILRALNLFLLGIFLQGGF----FHGINNLKYGVDIA-- 158
+S ++ + NF K + R + +FL+G+ L G+ F N+L G DI
Sbjct: 63 ISTYISLRKFNFEFNKPTLFKILKRTVVIFLIGLGL--GWLSLSFRTFNSLS-GEDIGFF 119
Query: 159 -----------QIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVAL 207
IR +GV+QR+A+ Y AL I++K KY +++ +
Sbjct: 120 ERFITAITNFEHIRILGVMQRLALTYGATALIAIFVK------------HKYI-PYIIVV 166
Query: 208 VLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILG 267
L +LLLL+G G N + ++DR ILG
Sbjct: 167 TLIGYFLLLLFG----------------------------NGFDFSEDNIISVLDRAILG 198
Query: 268 IQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFG 327
H+Y+ D G DPEGLLS++ A LIG G
Sbjct: 199 ADHMYK---------------DSG-----------LAIDPEGLLSTIPAICHVLIGFCCG 232
Query: 328 HLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
+++ KD+ +R+ I+ + + LG L + G +NK ++S ++ T G + +LA
Sbjct: 233 EILLTTKDNNERIQRLFIIGAIMTFLGFLLSY-GCPINKKIWSPTFVLATCGLASTMLA 290
>gi|403303686|ref|XP_003942455.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Saimiri boliviensis boliviensis]
Length = 631
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 112/397 (28%), Positives = 175/397 (44%), Gaps = 65/397 (16%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSN-------SKQTRPQHQQRRLISLDVFRGLTVALMI 71
ISK+ S+ ++L L PS ++ S RP RL S+D FRG+ + LM+
Sbjct: 192 ISKAISSRETDRLINSELGSPSRTDPLDGDVQSAVWRPSSPPLRLRSVDTFRGIALILMV 251
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFP----CKVVATRKAI 127
V+ GG H+ WNGLT+AD V P+F+FI+G S+ L+ + K K
Sbjct: 252 FVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGTSIFLSMTSIMQRGCSKFRLLGKIA 311
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKG-- 185
R+ L +GI + N + ++R GVLQR+ + Y V A+ E+
Sbjct: 312 WRSFLLICIGIIIVNP-----NYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPV 366
Query: 186 DGHVSSKLSLFR-----KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWI 240
H +S+ S W++ L L L+L L + L VP T P
Sbjct: 367 PEHCASERSCLSLQDITSSWPQWLLILALEGLWLGLTFLLPVPG------CPTGYLGP-- 418
Query: 241 FNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLD 296
G G G P C A G IDR +LG HLY+ P + +
Sbjct: 419 -----GGIGDFGKYPNCTGGAAGYIDRLLLGDNHLYQHPSSAVLYHTEV----------- 462
Query: 297 APSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL-GL 355
+DPEG+L ++ + + +G+ G +++++K +L C++GL +
Sbjct: 463 -------AYDPEGILGTINSILMAFLGVQAGKILLYYKARTKDILIRFTAWCCILGLISV 515
Query: 356 SLDFVGMH-----LNKALYSLSY-TCLTAGASGVLLA 386
+L V + +NK L+SLSY T L++ A +LL
Sbjct: 516 ALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLV 552
>gi|305665862|ref|YP_003862149.1| hypothetical protein FB2170_06240 [Maribacter sp. HTCC2170]
gi|88710633|gb|EAR02865.1| hypothetical protein FB2170_06240 [Maribacter sp. HTCC2170]
Length = 362
Score = 107 bits (268), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 104/350 (29%), Positives = 158/350 (45%), Gaps = 89/350 (25%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+ R+IS+D+FRGLT+ LMILV+ G + H+ W+G T D V PFFLFIVG S
Sbjct: 1 MKNRVISVDIFRGLTIVLMILVNTPGTWSSVYTPFLHAEWHGYTPTDLVFPFFLFIVGTS 60
Query: 109 LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQR 168
++ Y+ +K +R+L L LG+FL G F +K D A IR+ GVLQR
Sbjct: 61 ISFAYQKKKASTQTYKKIAVRSLKLIGLGLFL-GAFTLSFPFIK---DFADIRFPGVLQR 116
Query: 169 IAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYE 228
I + +L A+ + + + L G+W LL G YV
Sbjct: 117 IGVVFLFTAVLFVNFNWKTLLGICIVLL---VGYW------------LLMG-YV------ 154
Query: 229 FPVETSSSSPWIFNVTCGVRGSTGPACNAVG-MIDRKILGIQHLYRKPIYSRTKQCSINS 287
PVE G+ + A N + +D KI G H Y+
Sbjct: 155 -PVE-------------GIESTFDRAPNNLANYLDVKIFG-THNYK-------------- 185
Query: 288 PDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILS 347
PDY DPEG LS++ + + L G+ G ++ KD++ +
Sbjct: 186 PDY---------------DPEGFLSTLPSIASALTGVFTGLILTSKKDNK---------T 221
Query: 348 SCLIGLGLSLDFVG------MHLNKALYSLSYTCLTAGASGVLLAGIYFM 391
L+GLG+ + +G +NKAL+S S+ +T+G + + LA +Y++
Sbjct: 222 MVLVGLGVVMLALGYLWHTVFPINKALWSSSFVLVTSGWANIFLALVYYI 271
>gi|418746616|ref|ZP_13302939.1| PF07786 family protein [Leptospira santarosai str. CBC379]
gi|410792596|gb|EKR90528.1| PF07786 family protein [Leptospira santarosai str. CBC379]
Length = 375
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 114/369 (30%), Positives = 167/369 (45%), Gaps = 99/369 (26%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFL 102
+ + + R++SLD+FRG+TV MILV++ G I + H+ WNG T D V PFFL
Sbjct: 2 EKQSTQNKDRILSLDLFRGMTVIGMILVNNPGSWSYIYSPLKHAEWNGCTPTDLVFPFFL 61
Query: 103 FIVGVS--LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
F VG S ++L KN + +R+ NL LLG+FL F G A++
Sbjct: 62 FAVGTSIPISLYSKNGINRSDIWIGICIRSANLILLGLFLN---FFG------EWSFAEL 112
Query: 161 RWMGVLQRIAIAY-LVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYG 219
R GVLQRI Y +VA+LC + K+ +F VA++L ++L
Sbjct: 113 RIPGVLQRIGFVYWVVASLCLV------FPGKKILVFS-------VAILLIHTWILT--- 156
Query: 220 LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSR 279
Q P E+ S G A IDR I G +HL+R +S+
Sbjct: 157 ------QIALPGESVVSL------------EQGKDIGA--WIDRTIFGEKHLWR---FSK 193
Query: 280 TKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDR 339
T +DPEG LS V + VT L G+ G I+ ++ R++
Sbjct: 194 T------------------------WDPEGFLSGVASVVTTLFGVLCG-FILFLRERRNK 228
Query: 340 MLNWIILSSCLIGLGLSLDFVGMH------LNKALYSLSYTCLTAGASGVLLAGIYFMVR 393
+L GLG+ FVG+ +NK+L++ SY+ TAG S I+F
Sbjct: 229 IL----------GLGILFSFVGLLWDRSLPMNKSLWTGSYSVYTAGLS---FLSIWFF-E 274
Query: 394 YISSHLMLK 402
Y+SS ++ K
Sbjct: 275 YLSSLIISK 283
>gi|160883406|ref|ZP_02064409.1| hypothetical protein BACOVA_01375 [Bacteroides ovatus ATCC 8483]
gi|156111126|gb|EDO12871.1| hypothetical protein BACOVA_01375 [Bacteroides ovatus ATCC 8483]
Length = 371
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 97/348 (27%), Positives = 158/348 (45%), Gaps = 74/348 (21%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+ RL+SLD+ RG+T+ MILV++ G I + H+ WNGLT D V PFF+FI+GVS
Sbjct: 2 KSERLLSLDILRGITIVGMILVNNPGTWESIYAPLRHAEWNGLTPTDLVFPFFMFIMGVS 61
Query: 109 LALTYKNFPCKVVA--TRKAILRALNLFLLGIFLQGGFFHGINNLKYGVD--IAQIRWMG 164
++ F K + R + LFLLG+FL + + GV+ + IR +G
Sbjct: 62 MSFALSRFDHHFSRGFIIKLVRRTVILFLLGLFLS-----WFSLVCTGVEQPFSHIRILG 116
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQR+A+AY +L + ++ +++ W+ ++L +LL G
Sbjct: 117 VLQRLALAYFFGSLLIVGVRRPANLA------------WISGIILAGYSILLALG----- 159
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
G N + + DR + G HLYR+ +
Sbjct: 160 -----------------------HGFELSEQNIIAVTDRTLFGEAHLYREWL-------- 188
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
PD G + FDPEGLLS++ +IG G+++ + R+L
Sbjct: 189 ---PDGG----------RIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHRLLQIS 235
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
IL L+ G L + G LNK ++S ++ +T G + +LL + +++
Sbjct: 236 ILGIALLFAGWLLSY-GCPLNKKVWSPTFVLVTCGFASLLLVFLTWLI 282
>gi|423289836|ref|ZP_17268686.1| hypothetical protein HMPREF1069_03729 [Bacteroides ovatus
CL02T12C04]
gi|392666578|gb|EIY60091.1| hypothetical protein HMPREF1069_03729 [Bacteroides ovatus
CL02T12C04]
Length = 371
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 97/348 (27%), Positives = 158/348 (45%), Gaps = 74/348 (21%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+ RL+SLD+ RG+T+ MILV++ G I + H+ WNGLT D V PFF+FI+GVS
Sbjct: 2 KSERLLSLDILRGITIVGMILVNNPGTWESIYAPLRHAEWNGLTPTDLVFPFFMFIMGVS 61
Query: 109 LALTYKNFPCKVVA--TRKAILRALNLFLLGIFLQGGFFHGINNLKYGVD--IAQIRWMG 164
++ F K + R + LFLLG+FL + + GV+ + IR +G
Sbjct: 62 MSFALSRFDHHFSRGFIIKLVRRTVILFLLGLFLS-----WFSLVCTGVEQPFSHIRILG 116
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQR+A+AY +L + ++ +++ W+ ++L +LL G
Sbjct: 117 VLQRLALAYFFGSLLIVGVRRPANLA------------WISGIILAGYSILLALG----- 159
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
G N + + DR + G HLYR+ +
Sbjct: 160 -----------------------HGFELSEQNIIAVTDRTLFGEAHLYREWL-------- 188
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
PD G + FDPEGLLS++ +IG G+++ + R+L
Sbjct: 189 ---PDGG----------RIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHRLLQIS 235
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
IL L+ G L + G LNK ++S ++ +T G + +LL + +++
Sbjct: 236 ILGIALLFAGWLLSY-GCPLNKKVWSPTFVLVTCGFASLLLVFLTWLI 282
>gi|290999917|ref|XP_002682526.1| hypothetical protein NAEGRDRAFT_78070 [Naegleria gruberi]
gi|284096153|gb|EFC49782.1| hypothetical protein NAEGRDRAFT_78070 [Naegleria gruberi]
Length = 425
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 95/382 (24%), Positives = 158/382 (41%), Gaps = 113/382 (29%)
Query: 52 QQRRLISLDVFRGLTVALMILVDD--VGGILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
++ R+++LD+ RG+T+ LMI+V++ +P ++H+ W G T D V PFFLF++G +
Sbjct: 62 RKERMVALDIMRGMTIMLMIIVNNQPARAFIP-LDHAEWFGFTPTDCVFPFFLFVMGYAA 120
Query: 110 ALTY-KNFPCKV-------------------------------------------VATRK 125
A+ Y + +P V + RK
Sbjct: 121 AIVYSREWPSDVYLYPPSHVKLSIQSYFRELCGKKQDLMDENEKKEEESIKFMYLIPMRK 180
Query: 126 AILRALNLFL------LGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALC 179
++ ++ ++ + +FL G F + +L + +R MGV QRIAI Y + +L
Sbjct: 181 SLYEFVSKWVKLFRRPILMFLIGFSFSVLAHL---FNFTHVRVMGVFQRIAICYFIVSLI 237
Query: 180 EIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPW 239
L + ++ ++ +Y+ + +GLYVP +E
Sbjct: 238 -------------LVMVPWTFVQILIVVLFQAIYITVTFGLYVP-------MEGEGDG-- 275
Query: 240 IFNVTCGVRGST-GPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAP 298
CG RG P C A G IDR IL H+Y
Sbjct: 276 -----CGTRGELYEPRCTAEGYIDRLILSRDHIY-------------------------- 304
Query: 299 SWCQAPFDPEGLLSSVMATVTCLIG-LHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSL 357
Q +DPEG LSS+ A +G L F KD R+ W I+ S +I L++
Sbjct: 305 --LQDSYDPEGFLSSLSAVTNAFVGILAFKVARAAGKDAHKRLNYWFIMGSLMILAALAI 362
Query: 358 DFVGMHLNKALYSLSYTCLTAG 379
D+ G+ + K L++ S+ +T+G
Sbjct: 363 DYAGLPIGKKLWTTSFALITSG 384
>gi|383111974|ref|ZP_09932776.1| hypothetical protein BSGG_3641 [Bacteroides sp. D2]
gi|423296747|ref|ZP_17274817.1| hypothetical protein HMPREF1070_03482 [Bacteroides ovatus
CL03T12C18]
gi|313696106|gb|EFS32941.1| hypothetical protein BSGG_3641 [Bacteroides sp. D2]
gi|392669124|gb|EIY62615.1| hypothetical protein HMPREF1070_03482 [Bacteroides ovatus
CL03T12C18]
Length = 371
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 97/348 (27%), Positives = 158/348 (45%), Gaps = 74/348 (21%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+ RL+SLD+ RG+T+ MILV++ G I + H+ WNGLT D V PFF+FI+GVS
Sbjct: 2 KSERLLSLDILRGITIVGMILVNNPGTWESIYAPLRHAEWNGLTPTDLVFPFFMFIMGVS 61
Query: 109 LALTYKNFPCKVVA--TRKAILRALNLFLLGIFLQGGFFHGINNLKYGVD--IAQIRWMG 164
++ F K + R + LFLLG+FL + + GV+ + IR +G
Sbjct: 62 MSFALSRFDHHFSRGFIIKLVRRTVILFLLGLFLS-----WFSLVCTGVEQPFSHIRILG 116
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQR+A+AY +L + ++ +++ W+ ++L +LL G
Sbjct: 117 VLQRLALAYFFGSLLIVGVRRPANLA------------WISGIILAGYSILLALG----- 159
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
G N + + DR + G HLYR+ +
Sbjct: 160 -----------------------HGFELSEQNIIAVTDRTLFGEAHLYREWL-------- 188
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
PD G + FDPEGLLS++ +IG G+++ + R+L
Sbjct: 189 ---PDGG----------RIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHRLLQIS 235
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
IL L+ G L + G LNK ++S ++ +T G + +LL + +++
Sbjct: 236 ILGIALLFAGWLLSY-GCPLNKKVWSPTFVLVTCGFASLLLVFLTWLI 282
>gi|345513910|ref|ZP_08793425.1| hypothetical protein BSEG_01940 [Bacteroides dorei 5_1_36/D4]
gi|229435722|gb|EEO45799.1| hypothetical protein BSEG_01940 [Bacteroides dorei 5_1_36/D4]
Length = 372
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 95/356 (26%), Positives = 155/356 (43%), Gaps = 78/356 (21%)
Query: 41 NSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGIL--PAINHSPWNGLTLADFVM 98
N+ + Q P ++RL+SLD RG+TVA MILV++ GG + + HS WNGLT D V
Sbjct: 7 NTETAQQAPP-IKKRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSVWNGLTPCDLVF 65
Query: 99 PFFLFIVGVS--LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVD 156
PFFLFI+G+S ++L NF + K + R + +G + G+F + + +
Sbjct: 66 PFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI--GWFDHVCEGDF-LP 122
Query: 157 IAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLL 216
+R GVLQRIA+ Y V + +LF ++ + +L Y ++
Sbjct: 123 FVHLRIPGVLQRIALCYCVISFT--------------ALFMNHKFIPALTFILLVSYTVI 168
Query: 217 LYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPI 276
L C G N + +IDR++ G HLY+K
Sbjct: 169 L---------------------------CMGNGYACDESNILSIIDRQLFGEAHLYQK-- 199
Query: 277 YSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDH 336
+P DPEG +S++ A IG +G I+
Sbjct: 200 --------------------------SPIDPEGFVSTLSAIAHTCIGFSYGKWIIQSHQT 233
Query: 337 RDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+++L + LI +G L + LNK ++S ++ +T GA+ + LA + + +
Sbjct: 234 ENKVLRLFLTGFILISIGYLLA-DALPLNKRIWSPTFVLVTCGAASMSLATLMYYI 288
>gi|336416828|ref|ZP_08597160.1| hypothetical protein HMPREF1017_04268 [Bacteroides ovatus
3_8_47FAA]
gi|335937266|gb|EGM99170.1| hypothetical protein HMPREF1017_04268 [Bacteroides ovatus
3_8_47FAA]
Length = 371
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 98/348 (28%), Positives = 159/348 (45%), Gaps = 74/348 (21%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+ RL+SLD+ RG+T+ MILV++ G I + H+ WNGLT D V PFF+FI+GVS
Sbjct: 2 KSERLLSLDILRGITIVGMILVNNPGTWESIYAPLRHAEWNGLTPTDLVFPFFMFIMGVS 61
Query: 109 LALTYKNFPCKVVA--TRKAILRALNLFLLGIFLQGGFFHGINNLKYGVD--IAQIRWMG 164
++ F K + R + LFLLG+FL +F + GV+ + IR +G
Sbjct: 62 MSFALSRFDHHFSRGFIIKLVRRTVILFLLGLFLS--WFSLVCT---GVEQPFSHIRILG 116
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQR+A+AY +L + ++ +++ W+ ++L +LL G
Sbjct: 117 VLQRLALAYFFGSLLIVGVRRPANLA------------WISGIILAGYSILLALG----- 159
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
G N + + DR + G HLYR+ +
Sbjct: 160 -----------------------HGFELSEQNIIAVTDRTLFGEAHLYREWL-------- 188
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
PD G + FDPEGLLS++ +IG G+++ + R+L
Sbjct: 189 ---PDGG----------RIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHRLLQIS 235
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
IL L+ G L + G LNK ++S ++ +T G + +LL + +++
Sbjct: 236 ILGIALLFAGWLLSY-GCPLNKKVWSPTFVLVTCGFASLLLVFLTWLI 282
>gi|237711644|ref|ZP_04542125.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
gi|423230937|ref|ZP_17217341.1| hypothetical protein HMPREF1063_03161 [Bacteroides dorei
CL02T00C15]
gi|423244648|ref|ZP_17225723.1| hypothetical protein HMPREF1064_01929 [Bacteroides dorei
CL02T12C06]
gi|229454339|gb|EEO60060.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
gi|392630057|gb|EIY24059.1| hypothetical protein HMPREF1063_03161 [Bacteroides dorei
CL02T00C15]
gi|392641497|gb|EIY35273.1| hypothetical protein HMPREF1064_01929 [Bacteroides dorei
CL02T12C06]
Length = 372
Score = 107 bits (266), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 95/356 (26%), Positives = 155/356 (43%), Gaps = 78/356 (21%)
Query: 41 NSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGIL--PAINHSPWNGLTLADFVM 98
N+ + Q P ++RL+SLD RG+TVA MILV++ GG + + HS WNGLT D V
Sbjct: 7 NTETAQQAPP-IKKRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSVWNGLTPCDLVF 65
Query: 99 PFFLFIVGVS--LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVD 156
PFFLFI+G+S ++L NF + K + R + +G + G+F + + +
Sbjct: 66 PFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI--GWFDHVCEGDF-LP 122
Query: 157 IAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLL 216
+R GVLQRIA+ Y V + +LF ++ + +L Y ++
Sbjct: 123 FVHLRIPGVLQRIALCYCVISFT--------------ALFMNHKFIPALTFILLVSYTVI 168
Query: 217 LYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPI 276
L C G N + +IDR++ G HLY+K
Sbjct: 169 L---------------------------CMGNGYACDESNILSIIDRQLFGEAHLYQK-- 199
Query: 277 YSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDH 336
+P DPEG +S++ A IG +G I+
Sbjct: 200 --------------------------SPIDPEGFVSTLSAIAHTCIGFSYGKWIIQSHQT 233
Query: 337 RDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+++L + LI +G L + LNK ++S ++ +T GA+ + LA + + +
Sbjct: 234 ENKVLRLFLTGFILISIGYLLA-DALPLNKRIWSPTFVLVTCGAASMSLATLMYYI 288
>gi|116789271|gb|ABK25182.1| unknown [Picea sitchensis]
Length = 124
Score = 107 bits (266), Expect = 1e-20, Method: Composition-based stats.
Identities = 55/98 (56%), Positives = 69/98 (70%), Gaps = 8/98 (8%)
Query: 18 QISKSTSAPANEKLERD-PLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDV 76
Q ST+ +N + +D +P +N +TR + +LDVFRGLT+A+MILVDD
Sbjct: 9 QKQPSTTESSNVIVIQDGQTIPAKPTNETKTR-------VATLDVFRGLTIAVMILVDDA 61
Query: 77 GGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYK 114
GG P INHSPWNG TLADFVMPFFLFIVGV++ALT+K
Sbjct: 62 GGKWPQINHSPWNGCTLADFVMPFFLFIVGVAVALTFK 99
>gi|340347656|ref|ZP_08670761.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Prevotella
dentalis DSM 3688]
gi|339608850|gb|EGQ13733.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Prevotella
dentalis DSM 3688]
Length = 386
Score = 107 bits (266), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 104/342 (30%), Positives = 150/342 (43%), Gaps = 85/342 (24%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG--ILPAINHSPWNGLTLADFVMPFFLFIVGVS- 108
+RLISLDV RGLTV LMI V++ G I + HS WNG+TL D V PFFLFI+GVS
Sbjct: 2 SSQRLISLDVLRGLTVMLMIFVNNGAGEQIFAQLQHSRWNGMTLCDLVFPFFLFIMGVST 61
Query: 109 -LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
L+L F RK R L LF++G+ + F + + D++ +R MGV+Q
Sbjct: 62 YLSLRKTQFVWSARLGRKIARRTLLLFVIGLAIN-WFDMACSGRPF--DLSHLRIMGVMQ 118
Query: 168 RIAIAY----LVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVP 223
RIA+ Y L+A C+ WL FR + A++ L L
Sbjct: 119 RIALCYGATALIAVGCQRWLHD----------FRA-----MPAIIAALLGAYGALLLMGQ 163
Query: 224 DWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQC 283
+ Y+ A N + +D+ +LG HLY K
Sbjct: 164 GYAYD------------------------AAINLLSRVDQAVLGHAHLYHK--------- 190
Query: 284 SINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVH-FKD-----HR 337
+P DPEGL+S++ A L G + H + +D R
Sbjct: 191 -------------------SPVDPEGLVSTLAAVAHTLAGFYVAHWALGPGRDGAGPAAR 231
Query: 338 DRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
M+ + + L +GL+L + + LNK ++S SY CL+ G
Sbjct: 232 RSMMRFAAAGTVLAVVGLALSPL-LPLNKRVWSPSYVCLSCG 272
>gi|383859754|ref|XP_003705357.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Megachile rotundata]
Length = 572
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 92/352 (26%), Positives = 162/352 (46%), Gaps = 59/352 (16%)
Query: 11 ANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALM 70
+++I++ + +P N + D L ++ R R+ S+D FRG+ + LM
Sbjct: 138 TTSKVMIRVIRGRLSPDNAPDDLDRLQETESTTHPVIRTTRASTRIRSVDTFRGIAILLM 197
Query: 71 ILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRA 130
I V++ GG NHS W GLT+AD V+P+F +I+G+++ ++ K +V +R I+
Sbjct: 198 IFVNNGGGKYVFFNHSAWYGLTVADLVLPWFAWIMGLTITIS-KRAELRVTVSRVKIMLH 256
Query: 131 LNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCE-IWLKGDGHV 189
L + L G + I N + + +R+ GVLQ + ++Y V ++ E I++K
Sbjct: 257 CIRRSLVLILLGLMLNSIKNNSF----SDLRFPGVLQLLGVSYFVCSMLETIFMKPHSQD 312
Query: 190 S----SKLSLFRKYRGHW----VVALVLTTLYLLLLY--------GLYVPDWQYEFPVET 233
+ + + FR W V+A+++TT L+ G + P QYE+ +
Sbjct: 313 TLLQFGRFASFRDILDSWPQWLVMAVIMTTHTLITFLLPVPNCPKGYFGPGGQYEYRGK- 371
Query: 234 SSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPM 293
N T G A G IDR I G H+Y KP YG +
Sbjct: 372 ------YMNCTAG----------AAGYIDRLIFG-NHMYPKP----------KESIYGDI 404
Query: 294 PLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWII 345
+DPEGL++++ A +G+H G +++ + + R++ WI+
Sbjct: 405 ---------LRYDPEGLMNTISAIFIVYLGVHAGKILLLYYQYNSRVIRWIL 447
>gi|328790778|ref|XP_623715.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Apis mellifera]
Length = 572
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 101/408 (24%), Positives = 185/408 (45%), Gaps = 74/408 (18%)
Query: 14 QLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILV 73
+++I++ K +P N + D L N+ + R R+ S+D FRG+ + LMI V
Sbjct: 141 KVIIRVIKGKLSPNNIYDDLDRLQEAENATNLVVRTTKFSTRIHSVDTFRGIAILLMIFV 200
Query: 74 DDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAIL----- 128
++ GG NHS W GL++AD V+P+F +I+G+ + ++ K ++ +R I
Sbjct: 201 NNGGGKYIFFNHSAWFGLSIADLVLPWFAWIMGLMITVS-KRTELRLTTSRIKITLYCLR 259
Query: 129 RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCE-IWLKGDG 187
R+ L LG+ L N K + +R+ GVLQ + ++Y V A+ E I++K
Sbjct: 260 RSAILIFLGLML---------NSKDSESLHDLRFPGVLQLLGVSYFVCAILETIFMKPHS 310
Query: 188 ----HVSSKLSLFRKYRGHW----VVALVLTTLYLLLLY--------GLYVPDWQYEFPV 231
H + ++FR W ++A ++TT L+ G + P +Y F
Sbjct: 311 QDILHQFGRFAMFRDILESWPQWLIMAGIVTTHTLITFLLPISNCPKGYFGPGGEYHFRG 370
Query: 232 ETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYG 291
+ N T G A G IDR I G Y+ T+ YG
Sbjct: 371 K-------YINCTAG----------AAGYIDRLIFG------NHTYNHTENFL-----YG 402
Query: 292 PMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI---ILSS 348
+ +DPEGL++++ A +G+H G +++ + R++ W + +
Sbjct: 403 QI---------LRYDPEGLMNTISAIFIVYLGVHAGKILLLYYQCNSRVIRWFLWTVFTG 453
Query: 349 CLIGLGLSLDFVG--MHLNKALYSLSYTCLTAGASGVLLAGIYFMVRY 394
+ G+ + + G + ++K + +LSY + + + +L A +Y ++ Y
Sbjct: 454 IIAGILCNFETQGGIIPISKRMMTLSYVLICSSFAFLLYALLYVLIDY 501
>gi|374263976|ref|ZP_09622521.1| hypothetical protein LDG_8987 [Legionella drancourtii LLAP12]
gi|363535543|gb|EHL28992.1| hypothetical protein LDG_8987 [Legionella drancourtii LLAP12]
Length = 372
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 99/351 (28%), Positives = 152/351 (43%), Gaps = 81/351 (23%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG--ILPAINHSPWNGLTLADFVMPFFLFIVGV 107
+ +R++SLDVFRGLT+ALM+LV+ G I P ++H+ WNG TLAD V P FLFIVGV
Sbjct: 2 KEDSKRILSLDVFRGLTMALMVLVNSQGSRSIYPILDHAAWNGCTLADLVFPAFLFIVGV 61
Query: 108 SLALTYKNFPCKVVATR----KAILRALNLFLLGIFLQGGF-FHGINNLKYGVDIAQIRW 162
+ ++ A R K+IL+ L L F FH + + A +R
Sbjct: 62 TTVVSLNRQVTTNEAARLDIYKSILKRSILLFLFGLFLNAFPFH------FDLSFANLRI 115
Query: 163 MGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRK-YRGHWVVALVLTTLYLLLLYGLY 221
G+LQRIAI Y + AL + + V +++ LF G+W
Sbjct: 116 YGILQRIAICYFICALIYL----NTTVKTQIILFWGILLGYWY----------------- 154
Query: 222 VPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTK 281
W + PV S G A N V +D+ I HL++
Sbjct: 155 ---WITQIPVPGFSG------------GQLSLANNWVAYVDKMIFSPVHLHKN------- 192
Query: 282 QCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRML 341
FDPEGL+S++ A T L GL GH ++ + + L
Sbjct: 193 -----------------------FDPEGLISTISAVATTLAGLITGHFLLMQLSKKKKCL 229
Query: 342 NWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
++ + LG + + +NK L++ S+ T G S ++ A ++++
Sbjct: 230 LMFLVGMAFLVLGWAWGYY-FPINKNLWTSSFVLWTGGVSLIVFALCFYII 279
>gi|410096828|ref|ZP_11291813.1| hypothetical protein HMPREF1076_00991 [Parabacteroides goldsteinii
CL02T12C30]
gi|409225445|gb|EKN18364.1| hypothetical protein HMPREF1076_00991 [Parabacteroides goldsteinii
CL02T12C30]
Length = 367
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 96/341 (28%), Positives = 159/341 (46%), Gaps = 72/341 (21%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
Q RL+SLDV RG+T+A MI+V++ G + + H+ WNGLT D V PFF+FI+GVS
Sbjct: 3 QSGRLLSLDVMRGITIAGMIMVNNPGSWGYVYAPLRHASWNGLTPTDLVFPFFMFIMGVS 62
Query: 109 LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYG--VDIAQIRWMGVL 166
+ + + + K+ +R+++ + L +L IFL G + +L Y + +R +GV+
Sbjct: 63 MFFSLRKYDFKL--SRESVTKVLKRTVL-IFLVGFALNLFGHLCYNGFSNFENLRILGVM 119
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QR+A+AY + +L LS+ KY ++ T +LL Y
Sbjct: 120 QRLALAYGIGSLI------------GLSVKHKY-------ILQTAAGILLFY-------- 152
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
WI G + T N + ++DR + G H+Y + T+
Sbjct: 153 ------------WILLAATGSQ--TLSENNIIAIVDRALFGNTHMYHDYLADGTR----- 193
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRML-NWII 345
FDPEGLLS + + L+G + G +I+ K + + ++ N I
Sbjct: 194 ----------------IAFDPEGLLSCLGSIGHVLLGFYVGKMILDCKKNNELIIRNLFI 237
Query: 346 LSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
+ ++ LG L + G +NK L+S ++ T G + LA
Sbjct: 238 FGTIILFLGFLLSY-GCPINKKLWSSTFVLTTCGFGSLFLA 277
>gi|270160204|ref|ZP_06188860.1| membrane protein, putative [Legionella longbeachae D-4968]
gi|289165026|ref|YP_003455164.1| hypothetical protein LLO_1691 [Legionella longbeachae NSW150]
gi|269988543|gb|EEZ94798.1| membrane protein, putative [Legionella longbeachae D-4968]
gi|288858199|emb|CBJ12067.1| putative membrane protein [Legionella longbeachae NSW150]
Length = 372
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 104/347 (29%), Positives = 160/347 (46%), Gaps = 80/347 (23%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGIL--PAINHSPWNGLTLADFVMPFFLFIVGVSL 109
Q R++SLDVFRGLT+ALM+LV+ +G + P + HS WNG TLAD V P FLFIVG++
Sbjct: 7 QNERILSLDVFRGLTMALMVLVNSLGTRISYPILLHSEWNGCTLADLVFPSFLFIVGMTT 66
Query: 110 ALTYKNF---PCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
++ K K R + LFLLGIFL N VDI+ IR G+L
Sbjct: 67 VISLKRHIKEESKTEIYYSIFKRTIILFLLGIFL--------NVFPKNVDISSIRIYGIL 118
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRIA+ YL+ A + H + + +F ++L +L G +
Sbjct: 119 QRIALCYLICAFIYL------HTTIRAQIF---------------IFLGILLGYWYFLAC 157
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
+ PV G+ T N VG ID+ + +HL +
Sbjct: 158 FHLPVS-------------GMNQLTITR-NWVGYIDQLLFSPKHLLFR------------ 191
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
FDPEGLLS++ + T L GL G+L++ + + + +++
Sbjct: 192 -----------------NFDPEGLLSTIPSIATTLSGLIAGNLLLAQIQKQKKCI--LMV 232
Query: 347 SSCLIGLGLS-LDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+S L+ L L+ L +NK L++ S+ +G S ++ + YF++
Sbjct: 233 ASGLVFLLLAWLWNYSFPINKNLWTSSFVLWCSGFSLIIFSFCYFII 279
>gi|226225918|ref|YP_002760024.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27]
gi|226089109|dbj|BAH37554.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27]
Length = 401
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 103/360 (28%), Positives = 155/360 (43%), Gaps = 81/360 (22%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+ RL+SLDVFRG+TVA M+LV++ G I P + H+PW+G T D + PFFLFIVG++
Sbjct: 9 KAERLLSLDVFRGMTVAGMLLVNNPGTWSAIYPPLQHAPWHGWTPTDLIFPFFLFIVGIT 68
Query: 109 LALTYKNFPCK----VVATRKAILRALNLFLLGIFLQGGFFH----GINNLKYGVDI--- 157
L+ + + R+ + R +FL G+ L G F G+ +G +
Sbjct: 69 TELSLRARRARGDDEQAILRQILKRGALIFLFGLLLAGFPFFTWPPGLPGASFGERVIDR 128
Query: 158 -AQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLL 216
R MGVLQRI +AYL AL L+ R V+ L Y L
Sbjct: 129 FEHWRIMGVLQRIGVAYLCGAL--------------LTWRTTVRQQGVILAALLFGYWAL 174
Query: 217 LYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGS---TGPACNAVGMIDRKILGIQHLYR 273
+ + VPD GV G P +DR +LG+ HL+
Sbjct: 175 MTLVPVPD--------------------TGVAGRFVLDKPDQLLSAWLDRTVLGVNHLWS 214
Query: 274 KPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHF 333
A +W DPEGLLS++ A T + G G I
Sbjct: 215 ----------------------GAKTW-----DPEGLLSTIPAIGTMICGTFAGRWIARQ 247
Query: 334 K-DHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+ +R++ + + + +GL +V +NK++++ SY TAG V LA +++
Sbjct: 248 ELTLHERLVALFAVGALAMMVGLMWHWV-FPINKSIWTSSYVVFTAGTGAVTLATCMWLI 306
>gi|375149929|ref|YP_005012370.1| hypothetical protein [Niastella koreensis GR20-10]
gi|361063975|gb|AEW02967.1| Protein of unknown function DUF2261, transmembrane [Niastella
koreensis GR20-10]
Length = 392
Score = 106 bits (264), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 105/373 (28%), Positives = 159/373 (42%), Gaps = 86/373 (23%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
Q+R SLDVFRG TVA MILV++ G + + H+PW+G T D V PFFLF VG +
Sbjct: 1 MQKRFYSLDVFRGATVAFMILVNNPGSWSNLYAPLEHAPWHGCTPTDLVFPFFLFAVGNA 60
Query: 109 LALTYKNF--PCKVVATRKAILRALNLFLLGIFLQGGFFHGINN--LKYGV--------D 156
LA +K I R+ +FL+G FL F +N L + +
Sbjct: 61 LAFVMPRLQEAGTTAFLKKVITRSFLIFLIGFFLNWSPFIRWDNDHLTFKAWEYAGANGN 120
Query: 157 IAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLY--L 214
+ IR +GVLQRIA+ Y A+L + F K RG +V A VL Y L
Sbjct: 121 LIGIRILGVLQRIALCYFFASL--------------IIYFFKIRGAFVSAFVLLLGYWVL 166
Query: 215 LLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRK 274
+ +G +++ P+ N G+ +D+ ILG H+Y
Sbjct: 167 CMFFG--------------NAADPYSLNGYFGLG------------VDKAILGESHMYHG 200
Query: 275 PIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFK 334
FDPEG+ S++ A V + G G I K
Sbjct: 201 E--------------------------GVAFDPEGITSTLTAIVQVIFGYFVGFYIQQ-K 233
Query: 335 DHRDRMLNWIILSSC-LIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVR 393
ML+ + ++ C LI G + D + +NK +++ SY T G + ++L+ F++
Sbjct: 234 GKNFEMLSHLFVAGCILIFTGYAWDMM-FPINKKIWTSSYVLYTTGLAILVLSLCIFLIE 292
Query: 394 YISSHLMLKKPFD 406
+ + + FD
Sbjct: 293 FKEAKGAWSRFFD 305
>gi|348507459|ref|XP_003441273.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Oreochromis niloticus]
Length = 460
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 102/401 (25%), Positives = 174/401 (43%), Gaps = 75/401 (18%)
Query: 12 NDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMI 71
++QL ++ S ++ K E S + + + RL+SLD FRG + +M+
Sbjct: 41 HNQLTEEVHASYTSDYCYKEESHA------SETAHGTVKAKPTRLLSLDTFRGFALTVMV 94
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVAT----RKAI 127
V+ GG H+PWNGLT+AD VMP+F+F++G S+ L + + + V+ RK
Sbjct: 95 FVNYGGGGYWFFQHAPWNGLTVADLVMPWFVFVIGTSVVLAFSSMQRRGVSRLQLLRKIT 154
Query: 128 LRALNLFLLGI-FLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGD 186
R + L LLG FL N + + R A A + LC +
Sbjct: 155 WRTVVLLLLGFCFL---------NYSPRDGPCSVLVLAEDPRSAAASGLHLLCSV---SP 202
Query: 187 GHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCG 246
+ + + Y W++ ++L TL+L L + + VP N G
Sbjct: 203 YNWWNPVQDILLYWPQWLIIILLETLWLCLTFLMPVP------------------NCPTG 244
Query: 247 VRGSTG-------PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDA 297
G+ G P C A G IDR + G ++YR P + +
Sbjct: 245 YLGAGGIGDNGLYPNCTGGAAGYIDRWMFG-DNMYRYPTCKEMYRTT------------- 290
Query: 298 PSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSL 357
PFDPEG+L ++ + V +G+ G +++ +K +L ++ + ++G+ ++
Sbjct: 291 -----QPFDPEGVLGTINSIVIGFLGMQAGKILIFYKKMNIHILCRFLVWTVILGISAAI 345
Query: 358 ------DFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
D + +NK L+SLSY S +LL G++F+V
Sbjct: 346 LSKCTRDGGFIPVNKNLWSLSYVMCMGCLSFLLLGGMFFVV 386
>gi|212695334|ref|ZP_03303462.1| hypothetical protein BACDOR_04881, partial [Bacteroides dorei DSM
17855]
gi|212662113|gb|EEB22687.1| hypothetical protein BACDOR_04881, partial [Bacteroides dorei DSM
17855]
Length = 284
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 96/350 (27%), Positives = 152/350 (43%), Gaps = 78/350 (22%)
Query: 41 NSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGIL--PAINHSPWNGLTLADFVM 98
N+ + Q P ++RL+SLD RG+TVA MILV++ GG + + HS WNGLT D V
Sbjct: 7 NTETAQQAPP-IKKRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSVWNGLTPCDLVF 65
Query: 99 PFFLFIVGVS--LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVD 156
PFFLFI+G+S ++L NF + K + R + +G + G+F + + +
Sbjct: 66 PFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI--GWFDHVCEGDF-LP 122
Query: 157 IAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLL 216
+R GVLQRIA+ Y V + +LF ++ + +L Y ++
Sbjct: 123 FVHLRIPGVLQRIALCYCVISFT--------------ALFMNHKFIPALTFILLVSYTVI 168
Query: 217 LYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPI 276
L C G N + +IDR++ G HLY+K
Sbjct: 169 L---------------------------CMGNGYACDESNILSIIDRQLFGEAHLYQK-- 199
Query: 277 YSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDH 336
+P DPEG +S++ A IG G I+
Sbjct: 200 --------------------------SPIDPEGFVSTLSAIAHTCIGFSCGKWIIQSHQT 233
Query: 337 RDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
+++L + LI +G L V + LNK ++S ++ +T GA+ + LA
Sbjct: 234 ENKVLRLFLTGFILISIGYLLADV-LPLNKRIWSPTFVLVTCGAASMSLA 282
>gi|410956346|ref|XP_003984803.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Felis
catus]
Length = 629
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 110/399 (27%), Positives = 179/399 (44%), Gaps = 71/399 (17%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNS--KQTRP-----QHQQRRLISLDVFRGLTVALMI 71
ISK+ ++ ++L L PS ++S T+P RL +D FRG+ + LM+
Sbjct: 190 ISKAINSRETDRLINSELGSPSRADSLGGDTQPGGWCPPASPPRLRCVDTFRGIALILMV 249
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAI---- 127
V+ GG HS WNGLT+AD V P+F+FI+G S+ L+ + + + K +
Sbjct: 250 FVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKLKLMGKIG 309
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYG---VDIAQIRWMGVLQRIAIAYLVAALCEIWLK 184
R+ L +G+F I N Y + ++R GVLQR+ + Y V A+ E+
Sbjct: 310 WRSFLLICIGMF--------IVNPNYCLGPLSWDKVRIPGVLQRLGVTYFVVAMLELIFA 361
Query: 185 G--DGHVSSKLSLFRKYR-----GHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSS 237
+S+ S F W+ L+L +++L L + L VP T
Sbjct: 362 KPVPESCASERSCFSLRDIIFSWPQWLFILMLESIWLGLTFFLPVPG------CPTGYLG 415
Query: 238 PWIFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPM 293
P G G G P C A G IDR +LG H+Y+ P + +
Sbjct: 416 P-------GGIGDLGKYPNCTGGAAGYIDRLLLGDDHIYQHPSSAVLYHTEV-------- 460
Query: 294 PLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL 353
+DPEG+L ++ + V +G+ G +++++KD +L C +GL
Sbjct: 461 ----------AYDPEGILGTINSIVMAFLGVQAGKILLYYKDQTKDILIRFTAWCCFLGL 510
Query: 354 ------GLSLDFVGMHLNKALYSLSY-TCLTAGASGVLL 385
+S + + +NK L+S+SY T L++ A +LL
Sbjct: 511 ISIALTKISENEGFIPINKNLWSISYVTTLSSFAFFILL 549
>gi|265753064|ref|ZP_06088633.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|263236250|gb|EEZ21745.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
Length = 372
Score = 105 bits (263), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 96/356 (26%), Positives = 155/356 (43%), Gaps = 78/356 (21%)
Query: 41 NSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGIL--PAINHSPWNGLTLADFVM 98
N+ + Q P ++RL+SLD RG+TVA MILV++ GG + + HS WNGLT D V
Sbjct: 7 NTETAQQAPP-IKKRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSVWNGLTPCDLVF 65
Query: 99 PFFLFIVGVS--LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVD 156
PFFLFI+G+S ++L NF + K + R + +G + G+F + + +
Sbjct: 66 PFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI--GWFDHVCEGDF-LP 122
Query: 157 IAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLL 216
+R GVLQRIA+ Y V + +LF ++ + +L Y ++
Sbjct: 123 FVHLRIPGVLQRIALCYCVISFT--------------ALFMNHKFIPALTFILLVSYTVI 168
Query: 217 LYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPI 276
L C G N + +IDR++ G HLY+K
Sbjct: 169 L---------------------------CMGNGYACDESNILSIIDRQLFGEAHLYQK-- 199
Query: 277 YSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDH 336
+P DPEG +S++ A IG G I+
Sbjct: 200 --------------------------SPIDPEGFVSTLSAIAHTCIGFSCGKWIIQSHQT 233
Query: 337 RDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+++L + LI +G L V + LNK ++S ++ +T GA+ + LA + + +
Sbjct: 234 ENKVLRLFLTGFILISIGYLLADV-LPLNKRIWSPTFVLVTCGAASMSLATLMYYI 288
>gi|71278983|ref|YP_267171.1| hypothetical protein CPS_0413 [Colwellia psychrerythraea 34H]
gi|71144723|gb|AAZ25196.1| putative membrane protein [Colwellia psychrerythraea 34H]
Length = 358
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 93/355 (26%), Positives = 149/355 (41%), Gaps = 81/355 (22%)
Query: 55 RLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
R ++LD FRG+T+ALMILV+ G + + H+ W+G T D V PFFLFI+G ++
Sbjct: 3 RYLALDAFRGITIALMILVNTPGTWSHVYAPLLHAEWDGATPTDLVFPFFLFIIGSAMFF 62
Query: 112 TYK--NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRI 169
++K NF RK I R +F +G L N + + V+ R MG+LQRI
Sbjct: 63 SFKKSNFSASPEQFRKIIKRGFIMFFIGFML--------NVIPFTVNAEDWRIMGILQRI 114
Query: 170 AIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEF 229
IAY VAA + L G + + + +LL Y
Sbjct: 115 GIAYTVAACLVLTLNRTG-------------------VFIASAVILLAY----------- 144
Query: 230 PVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPD 289
W ++ G G+ N + +D + G H+Y
Sbjct: 145 ---------WALLLSMG-EGALTIEGNIIRQLDLAVFGANHMYTMR-------------- 180
Query: 290 YGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSC 349
F+PEGLLS++ A V L+G + +D R ++ ++
Sbjct: 181 ------------GVAFEPEGLLSTIPAIVNMLLGFELTRYLTSIEDKRSSVIKLTLIGGL 228
Query: 350 LIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKKP 404
+G G V + +NK+L++ SY + G + +LLA +++ I + L +P
Sbjct: 229 AVGFGALWGLV-LPINKSLWTPSYVIYSTGFACLLLAAFIWLID-IMKQVKLAEP 281
>gi|126304129|ref|XP_001381943.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase
[Monodelphis domestica]
Length = 638
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 105/348 (30%), Positives = 166/348 (47%), Gaps = 54/348 (15%)
Query: 45 KQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFI 104
+ T P H RL SLD FRG+++ +MI V+ GG H WNGLT+AD V P+F+FI
Sbjct: 235 RLTPPVH---RLRSLDTFRGISLIIMIFVNYGGGKYWFFKHESWNGLTVADLVFPWFVFI 291
Query: 105 VGVSLALTYKNFPCKVVATRKAILRAL-NLFLLGIFLQGGFFHGINNLKYGVDIAQIRWM 163
+G S+AL+ + + + K + + L FLL + G N + ++R
Sbjct: 292 MGSSIALSLSSMLRRGCSKWKLLGKILWRSFLLCVI--GVLIMNPNYCLGPLSWDKLRIP 349
Query: 164 GVLQRIAIAYLVAALCE-IWLKGDGHVSSKLSLFRKYRG------HWVVALVLTTLYLLL 216
GVLQR+ + YLV A+ E ++ K S+ SL ++ W+ L+L +++ +
Sbjct: 350 GVLQRLGLTYLVVAVLELLFAKAVPENSTMESLCASFQDIISYWPQWIFILMLEAVWVCV 409
Query: 217 LYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLY 272
+ L VP T P G G G P C A G IDR +LG H+Y
Sbjct: 410 TFLLPVPG------CPTGYLGP-------GGIGDFGKYPNCTGGAAGYIDRLLLGEDHIY 456
Query: 273 RKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVH 332
+ P SP + + +DPEGLL ++ + V +G+ G +++
Sbjct: 457 QHP-----------SP-------NVLYHTKVAYDPEGLLGTINSIVMAFLGVQAGKILLF 498
Query: 333 FKD-HRDRMLNWIILSSCL-IGLGLSLDFVG----MHLNKALYSLSYT 374
+KD H+ ML +++ S+ L I G+ F + +NK L+S+SY
Sbjct: 499 YKDQHKQIMLRFLLWSAMLAIISGVLTKFSQNEGFIPVNKNLWSISYV 546
>gi|422005552|ref|ZP_16352731.1| hypothetical protein LSS_18678 [Leptospira santarosai serovar
Shermani str. LT 821]
gi|417255773|gb|EKT85231.1| hypothetical protein LSS_18678 [Leptospira santarosai serovar
Shermani str. LT 821]
Length = 375
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 113/369 (30%), Positives = 167/369 (45%), Gaps = 99/369 (26%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFL 102
+ + + R++SLD+FRG+TV MILV++ G I + H+ WNG T D V PFFL
Sbjct: 2 EKQSTQNKDRILSLDLFRGMTVIGMILVNNPGSWSYIYSPLKHAEWNGCTPTDLVFPFFL 61
Query: 103 FIVGVS--LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
F VG S ++L KN + +R+ NL LLG+FL F G A++
Sbjct: 62 FAVGTSIPISLYSKNGINRSDIWIGICIRSANLILLGLFLN---FFG------EWSFAEL 112
Query: 161 RWMGVLQRIAIAY-LVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYG 219
R GVLQRI Y +VA+LC + K+ +F +V ++L ++L
Sbjct: 113 RIPGVLQRIGFVYWVVASLCLV------FPGKKILVF-------LVPILLIHTWILT--- 156
Query: 220 LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSR 279
Q P E+ S G A IDR I G +HL+R +S+
Sbjct: 157 ------QIALPGESVVSL------------EQGKDIGA--WIDRTIFGEKHLWR---FSK 193
Query: 280 TKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDR 339
T +DPEG LS V + VT L G+ G I+ ++ R++
Sbjct: 194 T------------------------WDPEGFLSGVASVVTTLFGVLCG-FILFLRERRNK 228
Query: 340 MLNWIILSSCLIGLGLSLDFVG------MHLNKALYSLSYTCLTAGASGVLLAGIYFMVR 393
+L GLG+ FVG + +NK+L++ SY+ TAG S I+F
Sbjct: 229 IL----------GLGILFSFVGLLWDRSLPMNKSLWTGSYSVYTAGLS---FLSIWFF-E 274
Query: 394 YISSHLMLK 402
Y+SS ++ K
Sbjct: 275 YLSSLIISK 283
>gi|421110364|ref|ZP_15570862.1| PF07786 family protein [Leptospira santarosai str. JET]
gi|410804289|gb|EKS10409.1| PF07786 family protein [Leptospira santarosai str. JET]
Length = 375
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 113/369 (30%), Positives = 166/369 (44%), Gaps = 99/369 (26%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFL 102
+ + + R++SLD+FRG+TV MILV++ G I + H+ WNG T D V PFFL
Sbjct: 2 EKQSTQNKDRILSLDLFRGMTVIGMILVNNPGSWSYIYSPLKHAEWNGCTPTDLVFPFFL 61
Query: 103 FIVGVS--LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
F VG S ++L KN + +R+ NL LLG+FL F G A++
Sbjct: 62 FAVGTSIPISLYSKNGINRSDIWIGICIRSANLILLGLFLN---FFG------EWSFAEL 112
Query: 161 RWMGVLQRIAIAY-LVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYG 219
R GVLQRI Y +VA+LC + K+ +F V ++L ++L
Sbjct: 113 RIPGVLQRIGFVYWVVASLCLV------FPGKKILVFS-------VPILLIHTWILT--- 156
Query: 220 LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSR 279
Q P E+ S G A IDR I G +HL+R +S+
Sbjct: 157 ------QIALPGESVVSL------------EQGKDIGA--WIDRTIFGEKHLWR---FSK 193
Query: 280 TKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDR 339
T +DPEG LS V + VT L G+ G I+ ++ R++
Sbjct: 194 T------------------------WDPEGFLSGVASVVTTLFGVLCG-FILFLRERRNK 228
Query: 340 MLNWIILSSCLIGLGLSLDFVG------MHLNKALYSLSYTCLTAGASGVLLAGIYFMVR 393
+L GLG+ FVG + +NK+L++ SY+ TAG S I+F
Sbjct: 229 IL----------GLGILFSFVGLLWDRSLPMNKSLWTGSYSVYTAGLS---FLSIWFF-E 274
Query: 394 YISSHLMLK 402
Y+SS ++ K
Sbjct: 275 YLSSLIISK 283
>gi|372223192|ref|ZP_09501613.1| hypothetical protein MzeaS_12804 [Mesoflavibacter
zeaxanthinifaciens S86]
Length = 364
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 98/342 (28%), Positives = 157/342 (45%), Gaps = 75/342 (21%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
+R++++D+FRG+T++LM+LV+ G + H+ W+G T D V PFFLFIVG S+
Sbjct: 4 NKRIVAVDIFRGMTISLMVLVNTPGTWSSVYSPFLHAQWHGYTPTDLVFPFFLFIVGTSI 63
Query: 110 ALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRI 169
YKN + +K +RAL L +LG+FL G F K D IR+ GVLQRI
Sbjct: 64 VFAYKNKKPSLKTYKKIGVRALKLIILGLFL-GAFTLSFPFFK---DFENIRFPGVLQRI 119
Query: 170 AIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEF 229
+ + ++S L L ++ V LVL Y L+ +
Sbjct: 120 GVVFF--------------ITSALFLNLNWKKLIFVVLVLLFGYWALMTLI--------- 156
Query: 230 PVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPD 289
PV+ S+ T N +D ++ G H Y+ PD
Sbjct: 157 PVDGQPST------------LTRAPNNLANWLDVQLFG-SHNYK--------------PD 189
Query: 290 YGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSC 349
Y DPEG LS++ A + L+G+ G ++V ++ + +L +
Sbjct: 190 Y---------------DPEGFLSTLPAIASALLGIFTGEILVGKQNKKTPLLFGLGGCLL 234
Query: 350 LIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM 391
+IG + F +NKAL++ S+ +TAG + V+LA I+++
Sbjct: 235 IIGHLWDIVF---PINKALWTSSFVLVTAGWANVVLAIIHYL 273
>gi|334121382|ref|ZP_08495452.1| hypothetical protein MicvaDRAFT_2176 [Microcoleus vaginatus FGP-2]
gi|333455096|gb|EGK83757.1| hypothetical protein MicvaDRAFT_2176 [Microcoleus vaginatus FGP-2]
Length = 406
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 107/382 (28%), Positives = 148/382 (38%), Gaps = 119/382 (31%)
Query: 55 RLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
R SLDVFRG+ +A MILV++ G + P ++H+ W+G T D + PFFLFIVG +++
Sbjct: 2 RFKSLDVFRGIAIASMILVNNPGSWEQVYPPLDHAEWHGCTPTDLIFPFFLFIVGCAMSF 61
Query: 112 TY----KNFPCKVVATRKAIL------------------------------RALNLFLLG 137
+ +N+P + T K RA LF+LG
Sbjct: 62 SLSKYTQNYPQTGIETSKITQTKEKSESAKNPLPSSLFLLPYSNIYWRIARRAAILFILG 121
Query: 138 IFLQGGFFHGINNLKYGVDI----------AQIRWMGVLQRIAIAYLVAALCEIWLKGDG 187
+ L N +D+ +IR MGVLQRI +AY ++A+ I L
Sbjct: 122 LLL--------NTSSIALDVLLNSAPVENFGKIRIMGVLQRIGLAYFISAIAIINLSPRN 173
Query: 188 HVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGV 247
KL G+W V F V
Sbjct: 174 Q---KLLAVAVLLGYWAALTV--------------------------------FAVGGYT 198
Query: 248 RGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDP 307
G P N G +DR ILG QHLY+ PFDP
Sbjct: 199 AGELTPEGNLGGYVDRLILGSQHLYKG----------------------------GPFDP 230
Query: 308 EGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKA 367
EGLLS++ A VT LIG G + +N I + +G F+ +NK
Sbjct: 231 EGLLSTLPAVVTVLIGYFTGEWLRVQPIKTRTSINLAICGLSCVVIGRLWGFL-FPINKQ 289
Query: 368 LYSLSYTCLTAGASGVLLAGIY 389
L++ SY TAG + +LLA Y
Sbjct: 290 LWTSSYVVFTAGWALLLLATCY 311
>gi|284041413|ref|YP_003391343.1| heparan-alpha-glucosaminide N-acetyltransferase [Spirosoma linguale
DSM 74]
gi|283820706|gb|ADB42544.1| Heparan-alpha-glucosaminide N-acetyltransferase [Spirosoma linguale
DSM 74]
Length = 364
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 104/352 (29%), Positives = 149/352 (42%), Gaps = 88/352 (25%)
Query: 51 HQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGV 107
Q RLISLD RG T+A MI+ + G + + HS WNGL+ D + PFFLFIVGV
Sbjct: 2 RQPHRLISLDAMRGFTIAAMIVANFPGSEEFVYFTLRHSRWNGLSFTDLIAPFFLFIVGV 61
Query: 108 SLALTYKNFPC----KVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWM 163
S+ L Y K +K +LR+L +F +G+FL NL D A +RW
Sbjct: 62 SIVLAYARKRADGSPKGPLIQKIVLRSLKIFAVGMFL---------NLLPDFDFATLRWT 112
Query: 164 GVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGH-WVVALVLTTLYLLLLYGLYV 222
G L RIAI +LV AL L L +R W+ L L Y L + +
Sbjct: 113 GTLHRIAIVFLVCAL--------------LFLTTSWRQQAWIATLTLLA-YWLAMTQIPT 157
Query: 223 PDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQ 282
PD G R P N +DR+ L
Sbjct: 158 PD--------------------VG-RVVLEPGQNLAAWLDRRYLP--------------- 181
Query: 283 CSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLN 342
G M Q +DPEG+LS+ + VT ++G+ G L+V ++
Sbjct: 182 --------GRM-------WQGTWDPEGILSTFPSIVTGILGMLAGRLMVSPASQTAKVSY 226
Query: 343 WIILSSCLIGLGLSLDFVGMH--LNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+ LG F G+ +N+ L++ S+ +TAG + ++L +YF+V
Sbjct: 227 LMTAGVFTAALGY---FWGLTFPVNENLWTSSFVLVTAGFAALVLGAVYFLV 275
>gi|423241434|ref|ZP_17222547.1| hypothetical protein HMPREF1065_03170 [Bacteroides dorei
CL03T12C01]
gi|392641327|gb|EIY35104.1| hypothetical protein HMPREF1065_03170 [Bacteroides dorei
CL03T12C01]
Length = 372
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 95/356 (26%), Positives = 154/356 (43%), Gaps = 78/356 (21%)
Query: 41 NSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGIL--PAINHSPWNGLTLADFVM 98
N+ + Q P ++RL+SLD RG+TVA MILV++ GG + + HS WNGLT D V
Sbjct: 7 NTETAQQAPP-IKKRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSVWNGLTPCDLVF 65
Query: 99 PFFLFIVGVS--LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVD 156
PFFLFI+G+S ++L NF + K + R + +G + G+F + + +
Sbjct: 66 PFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI--GWFDHVCEGDF-LP 122
Query: 157 IAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLL 216
+R GVLQRIA+ Y V + +LF ++ + +L Y ++
Sbjct: 123 FVHLRIPGVLQRIALCYCVISFT--------------ALFMNHKFIPALTFILLVSYTVI 168
Query: 217 LYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPI 276
L C G N + +IDR++ G HLY+K
Sbjct: 169 L---------------------------CMGNGYACDESNILSIIDRQLFGEAHLYQK-- 199
Query: 277 YSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDH 336
+P DPEG +S++ A IG G I+
Sbjct: 200 --------------------------SPIDPEGFVSTLSAIAHTCIGFSCGKWIIQSHQT 233
Query: 337 RDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+++L + LI +G L + LNK ++S ++ +T GA+ + LA + + +
Sbjct: 234 ENKVLRLFLTGFILISIGYLLA-DALPLNKRIWSPTFVLVTCGAASMSLATLMYYI 288
>gi|297682811|ref|XP_002819101.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Pongo
abelii]
Length = 645
Score = 105 bits (261), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 115/399 (28%), Positives = 175/399 (43%), Gaps = 71/399 (17%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISL-------DVFRGLTVALMI 71
ISK+ S+ ++L L PS ++ Q RL +L D FRG+ + LM+
Sbjct: 206 ISKAVSSRETDRLINSELGSPSRTDPLDGDVQPAMWRLSALPPRLRSVDTFRGIALILMV 265
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAI 127
V+ GG H+ WNGLT+AD V P+F+FI+G S+ L+ + K K
Sbjct: 266 FVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKFRLLGKIA 325
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKY---GVDIAQIRWMGVLQRIAIAYLVAALCEIWLK 184
R+ L +GI I N Y + ++R GVLQR+ + Y V A+ E+
Sbjct: 326 WRSFLLICIGII--------IVNPNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFA 377
Query: 185 G--DGHVSSKLSLFR-----KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSS 237
H +S+ S W++ LVL L+L L + L VP T
Sbjct: 378 KPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVPG------CPTGYLG 431
Query: 238 PWIFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPM 293
P G G G P C A G IDR +LG HLY+ P + +
Sbjct: 432 P-------GGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSAVLYHTEV-------- 476
Query: 294 PLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL 353
+DPEG+L +V + V +G+ G +++++K +L C++GL
Sbjct: 477 ----------AYDPEGILGTVNSIVMAFLGVQAGKILLYYKARTKDILIRFTAWCCILGL 526
Query: 354 -GLSLDFVGMH-----LNKALYSLSY-TCLTAGASGVLL 385
++L V + +NK L+SLSY T L++ A +LL
Sbjct: 527 ISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILL 565
>gi|410900570|ref|XP_003963769.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Takifugu rubripes]
Length = 581
Score = 105 bits (261), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 93/373 (24%), Positives = 156/373 (41%), Gaps = 89/373 (23%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
+ + +RL+SLD FRG + +M+ V+ GG H+PWNGLT+AD VMP+F+FI+G S+
Sbjct: 194 KQRPKRLLSLDTFRGFALTVMVFVNYGGGGYWFFQHAPWNGLTVADLVMPWFVFIIGTSV 253
Query: 110 ALTYKNFPCKV---------VATRKAILRALNLFLLG-----------IFLQGGFFHGIN 149
L +++ + + R +L L L + Q + +
Sbjct: 254 VLAFRSMQRRRVRRLQLLRKITWRTGVLLMLGFCFLNYSPRDGPCSVLVLAQDSWSPAAS 313
Query: 150 NLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVL 209
L I RW +Q + + Y W++ ++L
Sbjct: 314 GLHLLHSITPHRWWSSVQDVVV---------------------------YWPQWLIIILL 346
Query: 210 TTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTG--PACN--AVGMIDRKI 265
TL+L + + + VPD + + G G G P C A G IDR +
Sbjct: 347 ETLWLCVTFLMPVPDCPTGY-------------LGAGGIGDHGLYPNCTGGAAGYIDRWM 393
Query: 266 LGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLH 325
G ++YR P Q + PFDPEG+L +V + V +G+
Sbjct: 394 FG-DNMYRYPTCKEMYQTT------------------QPFDPEGVLGTVNSIVMGFLGMQ 434
Query: 326 FGHLIVHFKDHRDRMLNWIILSSCLIGLGLSL------DFVGMHLNKALYSLSYTCLTAG 379
G +++ ++ +L +L + ++G+ ++ D + +NK L+SLSY
Sbjct: 435 AGKILLFYRGADVHILCRFLLWAVILGISAAILTKCTRDEGFIPVNKNLWSLSYVTCMGC 494
Query: 380 ASGVLLAGIYFMV 392
S +LLAGIY +
Sbjct: 495 FSFLLLAGIYLVT 507
>gi|195399031|ref|XP_002058124.1| GJ15666 [Drosophila virilis]
gi|194150548|gb|EDW66232.1| GJ15666 [Drosophila virilis]
Length = 572
Score = 104 bits (260), Expect = 6e-20, Method: Compositional matrix adjust.
Identities = 92/304 (30%), Positives = 136/304 (44%), Gaps = 51/304 (16%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
Q++RL SLD FRGL++ LMI V+ GG I H+ WNGL LAD V P FL+I+GV + L
Sbjct: 180 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYSWIEHAAWNGLHLADLVFPSFLWIMGVCIPL 239
Query: 112 TYKNFPCKVVATR----KAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
+ K+ + ++ + + R+ LF +G+ L N G + Q+R MGVLQ
Sbjct: 240 SIKSQLGRGISKSRICGRIVWRSCKLFAIGLCL---------NSTNGPQLEQLRLMGVLQ 290
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRK------YRGHWVVALVLTTLYLLLLYGLY 221
R IA+LV L +S + + R + G V L L YL L +GL
Sbjct: 291 RFGIAFLVVGLLHTVCSRRDQLSPQRAWQRAIYDICLFSGELAVLLALIAAYLGLTFGLP 350
Query: 222 VPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPAC--NAVGMIDRKILGIQHLYRKPIYSR 279
VP + + + P C A G IDR++LG H+Y+ P
Sbjct: 351 VPGCPRGYLGPGGKHN-----------NAANPNCIGGAAGYIDRQVLGNAHIYQHPT--- 396
Query: 280 TKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDR 339
DA + FDPEG+ +++ V L+G G ++ + R
Sbjct: 397 -----------AKYVYDATA-----FDPEGIFGCLLSVVQTLLGAFAGVTLLVHATWQAR 440
Query: 340 MLNW 343
+ W
Sbjct: 441 LKRW 444
>gi|113475212|ref|YP_721273.1| hypothetical protein Tery_1515 [Trichodesmium erythraeum IMS101]
gi|110166260|gb|ABG50800.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
Length = 366
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 111/345 (32%), Positives = 151/345 (43%), Gaps = 86/345 (24%)
Query: 55 RLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL SLDVFRG+T+A MI+V++ G + P + H+ W+G T D + PFFLFI+GV++
Sbjct: 2 RLKSLDVFRGITIASMIIVNNPGSWNHVYPPLLHAKWHGCTPTDLIFPFFLFIMGVAMTF 61
Query: 112 -----TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
T KN P + R I R + L L GF + ++A IR MGVL
Sbjct: 62 SLSKYTDKNQPIPHIYQR--IFRRCLIIFLFGLLLNGFPN--------YNLATIRVMGVL 111
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRI++ YL+AA+ + L+L RK L L LL G ++
Sbjct: 112 QRISLVYLLAAI------------AILNLSRKQ---------LYGLATTLLIGYWIA--M 148
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
PV G+ P N IDR IL QHL
Sbjct: 149 QLIPVPGYG------------LGNLSPEGNFAAYIDRLILTQQHL--------------- 181
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWII- 345
W +DPEGL S++ A VT LIG G + H + LN +I
Sbjct: 182 -------------WAGKQYDPEGLFSTLPAIVTVLIGYLTGEWLKHQSTNSRTTLNMVIS 228
Query: 346 -LSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIY 389
LS ++G L F +NKAL++ SY TAG + + LA Y
Sbjct: 229 GLSCLVVGYLWGLFF---PINKALWTSSYVIFTAGWALLFLAICY 270
>gi|336415339|ref|ZP_08595679.1| hypothetical protein HMPREF1017_02787 [Bacteroides ovatus
3_8_47FAA]
gi|335940935|gb|EGN02797.1| hypothetical protein HMPREF1017_02787 [Bacteroides ovatus
3_8_47FAA]
Length = 371
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 92/331 (27%), Positives = 146/331 (44%), Gaps = 77/331 (23%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+RL++LDV RG+T+A MILV++ G + H+ WNGLT D V PFF+FI+G+S
Sbjct: 5 SNKRLLALDVIRGITIAGMILVNNPGSWGHAYAPLKHAQWNGLTPTDLVFPFFMFIMGIS 64
Query: 109 LALTYK--NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
++ K NF A K I R + +FL+GI L +F + + QIR +GV+
Sbjct: 65 TYISLKKYNFTFSTPAALKIIKRTIVIFLIGIALN--WFALLCYTHNPLPFEQIRILGVM 122
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QR+A+ Y +AL + LK KY + +V L L +++L+ G
Sbjct: 123 QRLALCYGASALIALLLK------------HKYIPYLIVVL-LVGYFIILITG------- 162
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
G N + ++DR ILG H+Y+
Sbjct: 163 ---------------------NGFAYNETNILSIVDRSILGDAHMYQ------------- 188
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
DPEGLLS++ + LIG G L++ KD +++ ++
Sbjct: 189 ---------------DNHIDPEGLLSTIPSIAHVLIGFCVGKLLMEVKDIHEKLERLFLI 233
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLT 377
+ L G + G NK ++S ++ +T
Sbjct: 234 GTILTFAGFLFSY-GCPFNKKIWSPTFVLVT 263
>gi|51491261|emb|CAH18694.1| hypothetical protein [Homo sapiens]
Length = 459
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 112/397 (28%), Positives = 174/397 (43%), Gaps = 65/397 (16%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISL-------DVFRGLTVALMI 71
ISK+ S+ ++L L PS ++ Q RL +L D FRG+ + LM+
Sbjct: 20 ISKAISSRETDRLINSELGSPSRTDPLDGDVQPATWRLSALPPRLRSVDTFRGIALILMV 79
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAI 127
V+ GG H+ WNGLT+AD V P+F+FI+G S+ L+ + K K
Sbjct: 80 FVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKFRLLGKIA 139
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKG-- 185
R+ L +GI + N + ++R GVLQR+ + Y V A+ E+
Sbjct: 140 WRSFLLICIGIIIVNP-----NYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPV 194
Query: 186 DGHVSSKLSLFR-----KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWI 240
H +S+ S W++ LVL L+L L + L VP T P
Sbjct: 195 PEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVPG------CPTGYLGP-- 246
Query: 241 FNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLD 296
G G G P C A G IDR +LG HLY+ P +
Sbjct: 247 -----GGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSAVLYH-------------- 287
Query: 297 APSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL-GL 355
+ +DPEG+L ++ + V +G+ G +++++K +L C++GL +
Sbjct: 288 ----TEVAYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILIRFTAWCCILGLISV 343
Query: 356 SLDFVGMH-----LNKALYSLSY-TCLTAGASGVLLA 386
+L V + +NK L+SLSY T L++ A +LL
Sbjct: 344 ALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLV 380
>gi|441501363|ref|ZP_20983482.1| N-acetylglucosamine related transporter, NagX [Fulvivirga
imtechensis AK7]
gi|441434899|gb|ELR68324.1| N-acetylglucosamine related transporter, NagX [Fulvivirga
imtechensis AK7]
Length = 368
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 96/351 (27%), Positives = 162/351 (46%), Gaps = 82/351 (23%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVG 106
++ +RL+SLDVFRG+T+A MI+V++ G + P + H+ W+G TL D V PFFLFIVG
Sbjct: 1 MNKNKRLLSLDVFRGITIAAMIVVNNPGSWAAVYPPLLHAGWHGCTLTDLVFPFFLFIVG 60
Query: 107 VSLALTYKNFPCKVVATRKAIL-----RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIR 161
V++ L+ + K I+ R++ LFL+G+FL N Y D+ +R
Sbjct: 61 VAVCLSLSR-AVEDKGRHKQIIFTVLKRSVILFLIGLFL--------NAFPY-FDLYHLR 110
Query: 162 WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLY 221
GVLQRIA+ + + A F + W V + + + L++ + L+
Sbjct: 111 IPGVLQRIAVVFFICA------------------FLYLKTGWKVQVYIGSAILMVYWLLF 152
Query: 222 VPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTK 281
+ I + GS N +D ++L
Sbjct: 153 L-----------------IIPIPGAATGSLESGANLAAWVDSQLLT-------------- 181
Query: 282 QCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRML 341
G M +W DPEG+LS++ A VT +IG+ G ++ + +++++
Sbjct: 182 ---------GHMWEVTKTW-----DPEGVLSTLPAIVTGIIGVLVGQWLMADRTEKEKVI 227
Query: 342 NWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+ ++ LI GL D +NK+L++ SY TAG + LA +Y+++
Sbjct: 228 YLFVAANLLIVAGLFWDLF-FPINKSLWTSSYVLYTAGIAIHFLAFLYWLL 277
>gi|160885454|ref|ZP_02066457.1| hypothetical protein BACOVA_03454 [Bacteroides ovatus ATCC 8483]
gi|423290374|ref|ZP_17269223.1| hypothetical protein HMPREF1069_04266 [Bacteroides ovatus
CL02T12C04]
gi|156109076|gb|EDO10821.1| hypothetical protein BACOVA_03454 [Bacteroides ovatus ATCC 8483]
gi|392665761|gb|EIY59284.1| hypothetical protein HMPREF1069_04266 [Bacteroides ovatus
CL02T12C04]
Length = 371
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 92/331 (27%), Positives = 146/331 (44%), Gaps = 77/331 (23%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+RL++LDV RG+T+A MILV++ G + H+ WNGLT D V PFF+FI+G+S
Sbjct: 5 SNKRLLALDVMRGITIAGMILVNNPGSWGHAYAPLKHAQWNGLTPTDLVFPFFMFIMGIS 64
Query: 109 LALTYK--NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
++ K NF A K I R + +FL+GI L +F + + QIR +GV+
Sbjct: 65 TYISLKKYNFTFSTPAALKIIKRTIVIFLIGIALN--WFALLCYTHNPLPFEQIRILGVM 122
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QR+A+ Y +AL + LK KY + +V L L +++L+ G
Sbjct: 123 QRLALCYGASALIALLLK------------HKYIPYLIVVL-LVGYFIILITG------- 162
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
G N + ++DR ILG H+Y+
Sbjct: 163 ---------------------NGFAYNETNILSIVDRSILGDAHMYQ------------- 188
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
DPEGLLS++ + LIG G L++ KD +++ ++
Sbjct: 189 ---------------DNHIDPEGLLSTIPSIAHVLIGFCVGKLLMEVKDIHEKLERLFLI 233
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLT 377
+ L G + G NK ++S ++ +T
Sbjct: 234 GTILTFAGFLFSY-GCPFNKKIWSPTFVLVT 263
>gi|397505549|ref|XP_003823319.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase isoform
1 [Pan paniscus]
Length = 585
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 112/396 (28%), Positives = 174/396 (43%), Gaps = 65/396 (16%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISL-------DVFRGLTVALMI 71
ISK+ S+ ++L L PS ++ Q RL +L D FRG+ + LM+
Sbjct: 146 ISKAISSRETDRLINSELGSPSRTDPLDGDVQPATWRLSALPPRLRSVDTFRGIALILMV 205
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAI 127
V+ GG H+ WNGLT+AD V P+F+FI+G S+ L+ + K K
Sbjct: 206 FVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKFRLLGKIA 265
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKG-- 185
R+ L +GI + N + ++R GVLQR+ + Y V A+ E+
Sbjct: 266 WRSFLLICIGIIIVNP-----NYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPV 320
Query: 186 DGHVSSKLSLFR-----KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWI 240
H +S+ S W++ LVL L+L L + L VP T P
Sbjct: 321 PEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVPG------CPTGYLGP-- 372
Query: 241 FNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLD 296
G G G P C A G IDR +LG HLY+ P + +
Sbjct: 373 -----GGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSAVLYHTEV----------- 416
Query: 297 APSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL-GL 355
+DPEG+L ++ + V +G+ G +++++K +L C++GL +
Sbjct: 417 -------AYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILIRFTAWCCILGLISV 469
Query: 356 SLDFVGMH-----LNKALYSLSY-TCLTAGASGVLL 385
+L V + +NK L+SLSY T L++ A +LL
Sbjct: 470 ALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILL 505
>gi|126272886|ref|XP_001369919.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Monodelphis domestica]
Length = 389
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 101/348 (29%), Positives = 154/348 (44%), Gaps = 55/348 (15%)
Query: 65 LTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVAT- 123
L++ LMI V+ GG H+PWNGLT+AD VMP+F+FI+G S+ L + K V
Sbjct: 3 LSLTLMIFVNYGGGGYWFFEHAPWNGLTIADLVMPWFVFILGTSVGLAFHVMQRKGVKKF 62
Query: 124 ---RKAILRALNLFLLG-IFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALC 179
RK R L +G +FL G G + + R GVLQR+ Y + AL
Sbjct: 63 KLFRKVAWRTGVLIAIGALFLNYGPVDGPLSWSWA------RLPGVLQRLGFTYFIVALM 116
Query: 180 EIWLK----GDGHVSSKLSLFRK---YRGHWVVALVLTTLYLLLLYGLYVPDW--QYEFP 230
+I V + FR Y W++ + L ++L L + L VP Y P
Sbjct: 117 QIAFGVADMQKYQVGVWWAPFRDIVLYWQEWIIIIGLECIWLCLTFLLPVPGCPRGYLGP 176
Query: 231 VETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDY 290
+ FN T G A ID+ ILG HLYR P + +
Sbjct: 177 GGIGDEGKY-FNCTGG----------AAAYIDKWILGENHLYRFPSCKELYKTT------ 219
Query: 291 GPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCL 350
PFDPEG+L ++ + + GL G +I+ ++ +L ++ S L
Sbjct: 220 ------------QPFDPEGILGTINSIIMAFFGLQAGKIILMYRSKPRSILKRFLIWSVL 267
Query: 351 IGLGLSLDFVGMH------LNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+G+ ++ +G +NK L+SLS+ T+ S LL +Y+++
Sbjct: 268 LGIISAILTLGTQNEGFIPVNKNLWSLSFVTTTSCFSFFLLGLLYYII 315
>gi|325299496|ref|YP_004259413.1| hypothetical protein [Bacteroides salanitronis DSM 18170]
gi|324319049|gb|ADY36940.1| hypothetical protein Bacsa_2393 [Bacteroides salanitronis DSM
18170]
Length = 356
Score = 104 bits (260), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 99/349 (28%), Positives = 152/349 (43%), Gaps = 87/349 (24%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGILP--AINHSPWNGLTLADFVMPFFLFIVGVS-- 108
++RL+SLDV RG+TV MI+V++ GG ++ HS WNGLT D V PFFLFI+G+S
Sbjct: 2 KQRLLSLDVLRGITVFGMIVVNNAGGEYSYDSLRHSVWNGLTPCDLVFPFFLFIMGISTY 61
Query: 109 LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVD-----IAQIRWM 163
+AL F RK + R L +FL+G GI ++ + A IR +
Sbjct: 62 IALRKFQFQPSPAVLRKIVRRTLLIFLIG--------WGIYWFEFACEGDFLPFAHIRIL 113
Query: 164 GVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVP 223
GVL RIA+ Y + +L ++++ G W+ ++L LLL +G
Sbjct: 114 GVLPRIALCYGIVSLLALYVRPKGLA-------------WIAGILLLGYALLLQWG---- 156
Query: 224 DWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQC 283
G + N + + D K+LG +HLY K
Sbjct: 157 ------------------------NGYAMDSTNILAIWDTKVLGYEHLYHK--------- 183
Query: 284 SINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNW 343
+P DPEGL S++ A +IG G LI+ +D +++
Sbjct: 184 -------------------SPVDPEGLTSTLAAIAHTIIGFLCGRLIMETQDLGQKIVKL 224
Query: 344 IILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+ L G L + LNK ++S SY T G + +L A + + +
Sbjct: 225 FVAGFLLAAAGFLL-VEWLPLNKRIWSPSYALATCGMAAMLQATLLYFI 272
>gi|393783262|ref|ZP_10371437.1| hypothetical protein HMPREF1071_02305 [Bacteroides salyersiae
CL02T12C01]
gi|392669541|gb|EIY63029.1| hypothetical protein HMPREF1071_02305 [Bacteroides salyersiae
CL02T12C01]
Length = 365
Score = 104 bits (259), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 98/347 (28%), Positives = 158/347 (45%), Gaps = 84/347 (24%)
Query: 49 PQHQQRRLISLDVFRGLTVALMILVDDVG----GILPAINHSPWNGLTLADFVMPFFLFI 104
P +RL+SLDV RG+TVA MILV++ G G P + H+ W+G T AD V P F+F+
Sbjct: 4 PISTHQRLLSLDVLRGITVAGMILVNNAGACGYGYAP-LRHAKWDGFTPADLVFPMFMFL 62
Query: 105 VGVS--LALTYKNFPCKVVATRKAILRALNLFLLGI---FLQGGFFHGINNLKYGVDIAQ 159
+G+S ++L NF + + K I RA L L+GI +L F GI N D
Sbjct: 63 MGISTYISLRKYNFQWQ-LTIGKIIKRAFLLILIGIAMKWLIHSFETGIWN-----DWEH 116
Query: 160 IRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYG 219
+R +GV+QR+ I Y + A+ ++LF ++ +AL+L Y +L
Sbjct: 117 MRILGVMQRLGICYGITAV--------------MALFIPHKRFLPIALLLLIGYFIL--- 159
Query: 220 LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSR 279
+ SP N + ++D +LG H+Y +
Sbjct: 160 --------QLAGNGFEKSP----------------DNIMAIVDSTVLGTSHMYLQG---- 191
Query: 280 TKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDR 339
+ +PEG+LS++ A +IG GH++++ KD+++R
Sbjct: 192 ----------------------RQFVEPEGILSTIPAVAQVMIGFVCGHMLINRKDNQER 229
Query: 340 MLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
M + + L+ G L + LNK L+S S+ +T G + + LA
Sbjct: 230 MQQLFFMGTLLLFAGFLLSY-ACPLNKRLWSPSFVLVTCGIAALALA 275
>gi|212693969|ref|ZP_03302097.1| hypothetical protein BACDOR_03493 [Bacteroides dorei DSM 17855]
gi|265751179|ref|ZP_06087242.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|423228550|ref|ZP_17214956.1| hypothetical protein HMPREF1063_00776 [Bacteroides dorei
CL02T00C15]
gi|423243815|ref|ZP_17224891.1| hypothetical protein HMPREF1064_01097 [Bacteroides dorei
CL02T12C06]
gi|212663501|gb|EEB24075.1| hypothetical protein BACDOR_03493 [Bacteroides dorei DSM 17855]
gi|263238075|gb|EEZ23525.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|392635957|gb|EIY29845.1| hypothetical protein HMPREF1063_00776 [Bacteroides dorei
CL02T00C15]
gi|392644181|gb|EIY37924.1| hypothetical protein HMPREF1064_01097 [Bacteroides dorei
CL02T12C06]
Length = 363
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 94/342 (27%), Positives = 152/342 (44%), Gaps = 84/342 (24%)
Query: 54 RRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS-- 108
+RL++LD+ RG+T+A MILV++ G + + H+ +NGLT D V PFF+FI+G+S
Sbjct: 7 KRLLALDILRGITIAGMILVNNPGSWGYVYAPLEHAAFNGLTPTDLVFPFFMFIMGISTY 66
Query: 109 LALTYKNFPCKVVATRKAILRALNLFLLGIFL----QGGFFHGINNLKYGVDIAQIRWMG 164
++L NF RK + R + +F +G+ L + F H +N + R++G
Sbjct: 67 ISLRKYNFTYSHATLRKIMKRTVIIFCIGLLLNLLAKSVFTHHLN-------FEEWRYLG 119
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
V+QR+AI Y V +L I +K KY ++ + L +LLL G
Sbjct: 120 VMQRLAIGYGVTSLVAITVK------------HKYFPA-IILVTLAAYFLLLATG----- 161
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
G N V D LG H+Y +
Sbjct: 162 -----------------------DGFNQSETNVVARFDAWALGTSHMYHE---------- 188
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
G M FDPEGLLS+V A ++G + G L++ KD+ +++
Sbjct: 189 ------GGM----------AFDPEGLLSTVPAVCHVMVGFYCGKLLLSAKDNAEKIQRLF 232
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
++ + L G L + G +NK ++S ++ +T G + LA
Sbjct: 233 LIGTILTFAGFLLSY-GCPINKKVWSPTFVIITCGLASSFLA 273
>gi|150378452|ref|NP_689632.2| heparan-alpha-glucosaminide N-acetyltransferase precursor [Homo
sapiens]
gi|332826066|ref|XP_519741.3| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Pan
troglodytes]
gi|194385774|dbj|BAG65262.1| unnamed protein product [Homo sapiens]
gi|410222096|gb|JAA08267.1| heparan-alpha-glucosaminide N-acetyltransferase [Pan troglodytes]
gi|410256018|gb|JAA15976.1| heparan-alpha-glucosaminide N-acetyltransferase [Pan troglodytes]
gi|410299048|gb|JAA28124.1| heparan-alpha-glucosaminide N-acetyltransferase [Pan troglodytes]
Length = 635
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 114/399 (28%), Positives = 177/399 (44%), Gaps = 71/399 (17%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSN--SKQTRPQHQQ-----RRLISLDVFRGLTVALMI 71
ISK+ S+ ++L L PS ++ +P + RL S+D FRG+ + LM+
Sbjct: 196 ISKAISSRETDRLINSELGSPSRTDPLDGDVQPATWRLSALPPRLRSVDTFRGIALILMV 255
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAI 127
V+ GG H+ WNGLT+AD V P+F+FI+G S+ L+ + K K
Sbjct: 256 FVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKFRLLGKIA 315
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKY---GVDIAQIRWMGVLQRIAIAYLVAALCEIWLK 184
R+ L +GI I N Y + ++R GVLQR+ + Y V A+ E+
Sbjct: 316 WRSFLLICIGII--------IVNPNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFA 367
Query: 185 G--DGHVSSKLSLFR-----KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSS 237
H +S+ S W++ LVL L+L L + L VP T
Sbjct: 368 KPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVPG------CPTGYLG 421
Query: 238 PWIFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPM 293
P G G G P C A G IDR +LG HLY+ P + +
Sbjct: 422 P-------GGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSAVLYHTEV-------- 466
Query: 294 PLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL 353
+DPEG+L ++ + V +G+ G +++++K +L C++GL
Sbjct: 467 ----------AYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILIRFTAWCCILGL 516
Query: 354 -GLSLDFVGMH-----LNKALYSLSY-TCLTAGASGVLL 385
++L V + +NK L+SLSY T L++ A +LL
Sbjct: 517 ISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILL 555
>gi|345517324|ref|ZP_08796801.1| hypothetical protein BSFG_00542 [Bacteroides sp. 4_3_47FAA]
gi|345457717|gb|EET14395.2| hypothetical protein BSFG_00542 [Bacteroides sp. 4_3_47FAA]
Length = 365
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 96/356 (26%), Positives = 155/356 (43%), Gaps = 79/356 (22%)
Query: 41 NSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGIL--PAINHSPWNGLTLADFVM 98
++Q P ++RL+SLD RG+TVA MILV++ GG + + HS WNGLT D V
Sbjct: 8 TETAQQALPI--KKRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSAWNGLTPCDLVF 65
Query: 99 PFFLFIVGVS--LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVD 156
PFFLFI+G+S ++L NF + K + R + +G + G+F + + +
Sbjct: 66 PFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI--GWFDHVCEGDF-LP 122
Query: 157 IAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLL 216
+R GVLQRIA+ Y V + +LF ++ + TL +L
Sbjct: 123 FVHLRIPGVLQRIALCYCVISFT--------------ALFMNHK-------FIPTLTFIL 161
Query: 217 LYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPI 276
L V + C G T N + +IDR++ G HLY+K
Sbjct: 162 LVSYTV--------------------ILCMGNGYTCDESNILSIIDRQLFGEAHLYQK-- 199
Query: 277 YSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDH 336
+P DPEG +S++ A IG G I+
Sbjct: 200 --------------------------SPIDPEGFVSTLSAIAHTCIGFSCGKWIIQSHQT 233
Query: 337 RDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+++L + L+ +G L + LNK ++S ++ +T GA+ + LA + + +
Sbjct: 234 ENKVLRLFLTGFILMSIGYLLA-DALPLNKRIWSPTFVLVTCGAASMSLATLMYYI 288
>gi|294777712|ref|ZP_06743163.1| putative membrane protein [Bacteroides vulgatus PC510]
gi|319640295|ref|ZP_07995020.1| hypothetical protein HMPREF9011_00617 [Bacteroides sp. 3_1_40A]
gi|294448780|gb|EFG17329.1| putative membrane protein [Bacteroides vulgatus PC510]
gi|317388070|gb|EFV68924.1| hypothetical protein HMPREF9011_00617 [Bacteroides sp. 3_1_40A]
Length = 372
Score = 104 bits (259), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 96/356 (26%), Positives = 155/356 (43%), Gaps = 79/356 (22%)
Query: 41 NSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGIL--PAINHSPWNGLTLADFVM 98
++Q P ++RL+SLD RG+TVA MILV++ GG + + HS WNGLT D V
Sbjct: 8 TETAQQALPI--KKRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSAWNGLTPCDLVF 65
Query: 99 PFFLFIVGVS--LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVD 156
PFFLFI+G+S ++L NF + K + R + +G + G+F + + +
Sbjct: 66 PFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI--GWFDHVCEGDF-LP 122
Query: 157 IAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLL 216
+R GVLQRIA+ Y V + +LF ++ + TL +L
Sbjct: 123 FVHLRIPGVLQRIALCYCVISFT--------------ALFMNHK-------FIPTLTFIL 161
Query: 217 LYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPI 276
L V + C G T N + +IDR++ G HLY+K
Sbjct: 162 LVSYTV--------------------ILCMGNGYTCDESNILSIIDRQLFGEAHLYQK-- 199
Query: 277 YSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDH 336
+P DPEG +S++ A IG G I+
Sbjct: 200 --------------------------SPIDPEGFVSTLSAIAHTCIGFSCGKWIIQSHQT 233
Query: 337 RDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+++L + L+ +G L + LNK ++S ++ +T GA+ + LA + + +
Sbjct: 234 ENKVLRLFLTGFILMSIGYLLA-DALPLNKRIWSPTFVLVTCGAASMSLATLMYYI 288
>gi|410332579|gb|JAA35236.1| heparan-alpha-glucosaminide N-acetyltransferase [Pan troglodytes]
Length = 635
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 114/399 (28%), Positives = 177/399 (44%), Gaps = 71/399 (17%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSN--SKQTRPQHQQ-----RRLISLDVFRGLTVALMI 71
ISK+ S+ ++L L PS ++ +P + RL S+D FRG+ + LM+
Sbjct: 196 ISKAISSRETDRLINSELGSPSRTDPLDGDVQPATWRLSALPPRLRSVDTFRGIALILMV 255
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAI 127
V+ GG H+ WNGLT+AD V P+F+FI+G S+ L+ + K K
Sbjct: 256 FVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKFRLLGKIA 315
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKY---GVDIAQIRWMGVLQRIAIAYLVAALCEIWLK 184
R+ L +GI I N Y + ++R GVLQR+ + Y V A+ E+
Sbjct: 316 WRSFLLICIGII--------IVNPNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFA 367
Query: 185 G--DGHVSSKLSLFR-----KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSS 237
H +S+ S W++ LVL L+L L + L VP T
Sbjct: 368 KPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVPG------CPTGYLG 421
Query: 238 PWIFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPM 293
P G G G P C A G IDR +LG HLY+ P + +
Sbjct: 422 P-------GGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSAVLYHTEV-------- 466
Query: 294 PLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL 353
+DPEG+L ++ + V +G+ G +++++K +L C++GL
Sbjct: 467 ----------AYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILIRFTAWCCILGL 516
Query: 354 -GLSLDFVGMH-----LNKALYSLSY-TCLTAGASGVLL 385
++L V + +NK L+SLSY T L++ A +LL
Sbjct: 517 ISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILL 555
>gi|150004749|ref|YP_001299493.1| transmembrane protein [Bacteroides vulgatus ATCC 8482]
gi|294775179|ref|ZP_06740705.1| putative membrane protein [Bacteroides vulgatus PC510]
gi|149933173|gb|ABR39871.1| putative transmembrane protein [Bacteroides vulgatus ATCC 8482]
gi|294450991|gb|EFG19465.1| putative membrane protein [Bacteroides vulgatus PC510]
Length = 363
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 95/342 (27%), Positives = 152/342 (44%), Gaps = 84/342 (24%)
Query: 54 RRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS-- 108
+RL++LD+ RG+T+A MILV++ G + + H+ +NGLT D V PFF+FI+G+S
Sbjct: 7 KRLLALDILRGITIAGMILVNNPGSWGYVYTPLEHAAFNGLTPTDLVFPFFMFIMGISTY 66
Query: 109 LALTYKNFPCKVVATRKAILRALNLFLLGIFL----QGGFFHGINNLKYGVDIAQIRWMG 164
++L NF RK + R + +F +G+FL + F H +N ++R++G
Sbjct: 67 ISLRKYNFTYSHATLRKIVKRTVVIFCIGLFLNLLAKSVFTHHLN-------FEELRYLG 119
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
V+QR+AI Y V +L I +K KY ++ + L +LLL G
Sbjct: 120 VMQRLAIGYGVTSLVAITVK------------HKYFPA-IILVTLAVYFLLLAMG----- 161
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
G A N V D LG H+Y
Sbjct: 162 -----------------------DGFNLSATNIVARFDVWALGTSHMYHD---------- 188
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
G M FDPEGLLS++ A ++G + G L+ KD+ +++
Sbjct: 189 ------GGM----------AFDPEGLLSTLPAVCHVMVGFYCGKLLFSAKDNDEKIQRLF 232
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
++ + L G L + G +NK ++S ++ T G + LA
Sbjct: 233 LVGTILTFAGFLLSY-GCPINKKVWSPTFVITTCGLASSFLA 273
>gi|319641808|ref|ZP_07996487.1| transmembrane protein [Bacteroides sp. 3_1_40A]
gi|345518546|ref|ZP_08797993.1| hypothetical protein BSFG_02387 [Bacteroides sp. 4_3_47FAA]
gi|254835931|gb|EET16240.1| hypothetical protein BSFG_02387 [Bacteroides sp. 4_3_47FAA]
gi|317386564|gb|EFV67464.1| transmembrane protein [Bacteroides sp. 3_1_40A]
Length = 363
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 95/342 (27%), Positives = 152/342 (44%), Gaps = 84/342 (24%)
Query: 54 RRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS-- 108
+RL++LD+ RG+T+A MILV++ G + + H+ +NGLT D V PFF+FI+G+S
Sbjct: 7 KRLLALDILRGITIAGMILVNNPGSWGHVYTPLEHAAFNGLTPTDLVFPFFMFIMGISTY 66
Query: 109 LALTYKNFPCKVVATRKAILRALNLFLLGIFL----QGGFFHGINNLKYGVDIAQIRWMG 164
++L NF RK + R + +F +G+FL + F H +N ++R++G
Sbjct: 67 ISLRKYNFTYSHATLRKIVKRTVVIFCIGLFLNLLAKSVFTHHLN-------FEELRYLG 119
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
V+QR+AI Y V +L I +K KY ++ + L +LLL G
Sbjct: 120 VMQRLAIGYGVTSLVAITVK------------HKYFPA-IILVTLAVYFLLLAMG----- 161
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
G A N V D LG H+Y
Sbjct: 162 -----------------------DGFNLSATNIVARFDVWALGTSHMYHD---------- 188
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
G M FDPEGLLS++ A ++G + G L+ KD+ +++
Sbjct: 189 ------GGM----------AFDPEGLLSTLPAVCHVMVGFYCGKLLFSAKDNDEKIQRLF 232
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
++ + L G L + G +NK ++S ++ T G + LA
Sbjct: 233 LVGTILTFAGFLLSY-GCPINKKVWSPTFVITTCGLASSFLA 273
>gi|29349027|ref|NP_812530.1| hypothetical protein BT_3619 [Bacteroides thetaiotaomicron
VPI-5482]
gi|298386734|ref|ZP_06996289.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
gi|383124379|ref|ZP_09945043.1| hypothetical protein BSIG_3594 [Bacteroides sp. 1_1_6]
gi|29340934|gb|AAO78724.1| putative transmembrane protein [Bacteroides thetaiotaomicron
VPI-5482]
gi|251839125|gb|EES67209.1| hypothetical protein BSIG_3594 [Bacteroides sp. 1_1_6]
gi|298260408|gb|EFI03277.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
Length = 372
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 97/340 (28%), Positives = 152/340 (44%), Gaps = 77/340 (22%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILPA---INHSPWNGLTLADFVMPFFLFIVGVS 108
+RL++LDV RG+T+A MILV+ G A + H+ W GLT D V PFF+FI+G+S
Sbjct: 6 SNKRLLALDVMRGITIAGMILVNTPGSWQHAYAPLKHAEWIGLTPTDLVFPFFMFIMGIS 65
Query: 109 --LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
++L NF V A K + R + +FL+GI + + + +D QIR +GV+
Sbjct: 66 TYISLRKYNFTFSVPAGLKILKRTVIIFLIGIGISWLSILCFQHDPFPID--QIRILGVM 123
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QR+A+ Y V A+ + +K KY + + L+++ +L L YV D
Sbjct: 124 QRLALGYGVTAIVALLMK------------HKYIPYLIAVLLISYFAILALGNGYVYD-- 169
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
ET N + ++DR +LG H+Y I
Sbjct: 170 -----ET----------------------NILSIVDRAVLGQAHIYGGQI---------- 192
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
DPEGLLS++ A LIG G L++ KD +++ ++
Sbjct: 193 ------------------LDPEGLLSTISAIAHVLIGFCAGKLLMEVKDIHEKLERLFLI 234
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
+ L G L + G + K ++S S+ +T G LA
Sbjct: 235 GTILTFAGFLLSY-GSPICKKVWSPSFVLVTCGLGSSFLA 273
>gi|124007195|sp|Q68CP4.2|HGNAT_HUMAN RecName: Full=Heparan-alpha-glucosaminide N-acetyltransferase;
AltName: Full=Transmembrane protein 76
Length = 663
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 114/399 (28%), Positives = 177/399 (44%), Gaps = 71/399 (17%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSN--SKQTRPQHQQ-----RRLISLDVFRGLTVALMI 71
ISK+ S+ ++L L PS ++ +P + RL S+D FRG+ + LM+
Sbjct: 224 ISKAISSRETDRLINSELGSPSRTDPLDGDVQPATWRLSALPPRLRSVDTFRGIALILMV 283
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAI 127
V+ GG H+ WNGLT+AD V P+F+FI+G S+ L+ + K K
Sbjct: 284 FVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKFRLLGKIA 343
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKY---GVDIAQIRWMGVLQRIAIAYLVAALCEIWLK 184
R+ L +GI I N Y + ++R GVLQR+ + Y V A+ E+
Sbjct: 344 WRSFLLICIGII--------IVNPNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFA 395
Query: 185 G--DGHVSSKLSLFR-----KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSS 237
H +S+ S W++ LVL L+L L + L VP T
Sbjct: 396 KPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVPG------CPTGYLG 449
Query: 238 PWIFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPM 293
P G G G P C A G IDR +LG HLY+ P + +
Sbjct: 450 P-------GGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSAVLYHTEV-------- 494
Query: 294 PLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL 353
+DPEG+L ++ + V +G+ G +++++K +L C++GL
Sbjct: 495 ----------AYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILIRFTAWCCILGL 544
Query: 354 -GLSLDFVGMH-----LNKALYSLSY-TCLTAGASGVLL 385
++L V + +NK L+SLSY T L++ A +LL
Sbjct: 545 ISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILL 583
>gi|237710367|ref|ZP_04540848.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
gi|229455829|gb|EEO61550.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
Length = 366
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 92/342 (26%), Positives = 150/342 (43%), Gaps = 84/342 (24%)
Query: 54 RRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS-- 108
+RL++LD+ RG+T+A MILV++ G + + H+ +NGLT D V PFF+FI+G+S
Sbjct: 10 KRLLALDILRGITIAGMILVNNPGSWGYVYAPLEHASFNGLTPTDLVFPFFMFIMGISTY 69
Query: 109 LALTYKNFPCKVVATRKAILRALNLFLLGIFL----QGGFFHGINNLKYGVDIAQIRWMG 164
++L NF RK + R + +F +G+ L + F H +N + R++G
Sbjct: 70 ISLRKYNFTYSHATLRKIMKRTVIIFCIGLLLNLLAKSVFTHHLN-------FEEWRYLG 122
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
V+QR+AI Y V +L I +K KY ++ + L +LLL G
Sbjct: 123 VMQRLAIGYGVTSLVAITVK------------HKYFPA-IILVTLAAYFLLLATG----- 164
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
G N V D LG H+Y +
Sbjct: 165 -----------------------DGFNQSETNVVARFDAWALGTSHMYHES--------- 192
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
FDPEGLLS+V A ++G + G L++ KD+ +++
Sbjct: 193 -----------------GMAFDPEGLLSTVPAVCHVMVGFYCGKLLLSAKDNAEKIQRLF 235
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
++ + L G L + G +NK ++S ++ +T G + LA
Sbjct: 236 LIGTILTFAGFLLSY-GCPINKKVWSPTFVIITCGLASSFLA 276
>gi|345516841|ref|ZP_08796327.1| hypothetical protein BSEG_03945 [Bacteroides dorei 5_1_36/D4]
gi|229437727|gb|EEO47804.1| hypothetical protein BSEG_03945 [Bacteroides dorei 5_1_36/D4]
Length = 363
Score = 104 bits (259), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 94/342 (27%), Positives = 152/342 (44%), Gaps = 84/342 (24%)
Query: 54 RRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS-- 108
+RL++LD+ RG+T+A MILV++ G + + H+ +NGLT D V PFF+FI+G+S
Sbjct: 7 KRLLALDILRGITIAGMILVNNPGSWGYVYAPLEHAAFNGLTPTDLVFPFFMFIMGISTY 66
Query: 109 LALTYKNFPCKVVATRKAILRALNLFLLGIFL----QGGFFHGINNLKYGVDIAQIRWMG 164
++L NF RK + R + +F +G+ L + F H +N + R++G
Sbjct: 67 ISLRKYNFTYSHATLRKIMKRTVIIFCIGLLLNLLAKSVFTHHLN-------FEEWRYLG 119
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
V+QR+AI Y V +L I +K KY ++ + L +LLL G
Sbjct: 120 VMQRLAIGYGVTSLVAITVK------------HKYFPA-IILVTLAAYFLLLATG----- 161
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
G N V D LG H+Y +
Sbjct: 162 -----------------------DGFNQSETNVVARFDAWALGTSHMYHE---------- 188
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
G M FDPEGLLS+V A ++G + G L++ KD+ +++
Sbjct: 189 ------GGM----------AFDPEGLLSTVPAVCHVMVGFYCGKLLLSAKDNAEKIQRLF 232
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
++ + L G L + G +NK ++S ++ +T G + LA
Sbjct: 233 LIGTILTFAGFLLSY-GCPINKKVWSPTFVIITCGLASSFLA 273
>gi|432099917|gb|ELK28811.1| Heparan-alpha-glucosaminide N-acetyltransferase [Myotis davidii]
Length = 586
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 107/402 (26%), Positives = 176/402 (43%), Gaps = 64/402 (15%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSN--SKQTRPQHQQR-----RLISLDVFRGLTVALMI 71
ISK+ ++ ++L L PS + +P+ + RL +D FRGL + LM+
Sbjct: 147 ISKAINSRETDRLINSELGSPSRAGLFGDDAQPEVWRLSAVPPRLRCVDTFRGLALILMV 206
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAI 127
V+ GG H+ WNGLT+AD V P+F+FI+G S+ L+ + K K
Sbjct: 207 FVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKFRLLGKVA 266
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKG-- 185
R+ L +GI + N + ++R GVLQR+ + Y V A+ E+
Sbjct: 267 WRSFLLICIGIVIVNP-----NYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPV 321
Query: 186 -DGHVSSK----LSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWI 240
+ VS + L W+V L+L +++L L + L VP T P
Sbjct: 322 PESCVSERRCSCLQDITSSWPQWLVILMLESVWLALTFFLPVPG------CPTGYLGP-- 373
Query: 241 FNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLD 296
G G G P C A G IDR +LG H+Y+ P + ++
Sbjct: 374 -----GGIGDLGKYPNCTGGAAGYIDRLLLGDDHIYQHPSSNVLYHTTV----------- 417
Query: 297 APSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL--- 353
+DPEG+L ++ + V +G+ G +++++KD +L C +GL
Sbjct: 418 -------AYDPEGILGTINSIVMAFLGVQAGKILLYYKDQTKDILIRFTAWWCFLGLISV 470
Query: 354 ---GLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+S + + +NK L+S+SY + + LL +Y +V
Sbjct: 471 VLTKVSENEGFIPVNKNLWSISYVTTLSSFAFFLLLILYPIV 512
>gi|284040246|ref|YP_003390176.1| hypothetical protein Slin_5410 [Spirosoma linguale DSM 74]
gi|283819539|gb|ADB41377.1| Protein of unknown function DUF2261, transmembrane [Spirosoma
linguale DSM 74]
Length = 385
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 107/341 (31%), Positives = 154/341 (45%), Gaps = 75/341 (21%)
Query: 55 RLISLDVFRGLTVALMILVD---DVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL+SLD FRGLTVA MILV+ D G I + H+PW+G T D + PFFLFIVGVS+
Sbjct: 22 RLLSLDFFRGLTVAAMILVNNPGDWGHIYAPLEHAPWHGWTPTDLIFPFFLFIVGVSITF 81
Query: 112 TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAI 171
+ K K + R++ LFLL N DI +R GVLQRIA+
Sbjct: 82 ALEGGKSKKGVVGKIVKRSVTLFLL---------GLFLNFFPKFDITLVRIPGVLQRIAV 132
Query: 172 AYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPV 231
YLV +L I+LK +S+ L+ + +++ Y LL+ + VP Y
Sbjct: 133 VYLVCSL--IFLK----TNSRQQLY--------ILVIVLIGYWLLMTVVPVPGVGY---- 174
Query: 232 ETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYG 291
+ PA N D IL H+Y+
Sbjct: 175 -----------------ANLEPATNLAAWFDYTILTPAHVYKP----------------- 200
Query: 292 PMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLI 351
A +W DPEG+LS++ A T LIG+ G + + D++ W+ + CL
Sbjct: 201 -----AKTW-----DPEGVLSTLPAVGTGLIGMLVGTWLRSSRPVADKV-AWLFATGCLA 249
Query: 352 GLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
LG + +NKAL++ SY L G + + LA Y+++
Sbjct: 250 TLGGVMWDGFFPINKALWTSSYVLLAGGLAMLGLALCYWLI 290
>gi|395842491|ref|XP_003794051.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase
[Otolemur garnettii]
Length = 677
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 110/399 (27%), Positives = 177/399 (44%), Gaps = 71/399 (17%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRPQ-------HQQRRLISLDVFRGLTVALMI 71
ISK+ ++ ++L L PS ++S + Q RL +D FRG+++ LM+
Sbjct: 238 ISKTINSREMDRLINSELGSPSRADSLGSDAQPATWHLSTHPPRLRCVDTFRGISLTLMV 297
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVAT----RKAI 127
V+ GG HS WNGLT+AD V P+F+FI+G S+ L+ + + + RK
Sbjct: 298 FVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGSSVFLSMTSVLQRGCSKGRLLRKIA 357
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKY---GVDIAQIRWMGVLQRIAIAYLVAALCEIWLK 184
R+ L +GI I N Y + ++R GVLQR+ + Y V A+ E+
Sbjct: 358 WRSFLLICIGII--------IVNPNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFA 409
Query: 185 G--DGHVSSKLSLFR-----KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSS 237
+ +S+ F W++ L L +++L L + L VP T
Sbjct: 410 KPVPENCASQRGCFSLGDVTSSWPQWLLILTLESVWLCLTFFLPVPG------CPTGYLG 463
Query: 238 PWIFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPM 293
P G G G P C A G ID +LG HLY P + +
Sbjct: 464 P-------GGIGDLGKYPNCTGGAAGYIDHLLLGENHLYHHPSSAVLYHTEV-------- 508
Query: 294 PLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL 353
+DPEG+L ++ + V +G+ G +++++KD +L C++GL
Sbjct: 509 ----------AYDPEGILGTINSIVMAFLGVQAGKILLYYKDQTKDILMRFAGWCCILGL 558
Query: 354 -GLSLDFVGMH-----LNKALYSLSY-TCLTAGASGVLL 385
++L V + +NK L+S+SY T L+ A +LL
Sbjct: 559 ISVALTKVSENEGFIPVNKNLWSISYVTTLSCFAFFILL 597
>gi|428319268|ref|YP_007117150.1| Heparan-alpha-glucosaminide N-acetyltransferase [Oscillatoria
nigro-viridis PCC 7112]
gi|428242948|gb|AFZ08734.1| Heparan-alpha-glucosaminide N-acetyltransferase [Oscillatoria
nigro-viridis PCC 7112]
Length = 406
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 106/372 (28%), Positives = 144/372 (38%), Gaps = 119/372 (31%)
Query: 55 RLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL SLDVFRG+ +A MILV++ G + P ++H+ WNG T D V PFFLFIVG +++
Sbjct: 2 RLKSLDVFRGIAIASMILVNNPGSWEQVYPPLDHAEWNGCTPTDLVFPFFLFIVGCAMSF 61
Query: 112 TY----KNFPCKVVATRKAI------------------------------LRALNLFLLG 137
+ +N+P + T K I RA LF+LG
Sbjct: 62 SLSKYIQNYPKTGIETSKIIQKNEKLESDKNPFPSSFFLLPASNIYWRIARRAAILFILG 121
Query: 138 IFLQGGFFHGINNLKYGVDI----------AQIRWMGVLQRIAIAYLVAALCEIWLKGDG 187
+ L N +D+ +IR MGVLQRI +AY + A+ + L
Sbjct: 122 LLL--------NTSSIALDVLLNSAPVENFGKIRIMGVLQRIGLAYFIGAIAILNLSPR- 172
Query: 188 HVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGV 247
+ KL G+W V F V
Sbjct: 173 --NQKLLAAAVLLGYWGALTV--------------------------------FAVGGYT 198
Query: 248 RGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDP 307
G P N G +DR ILG QHLY+ PFDP
Sbjct: 199 AGELTPEGNLGGYVDRLILGSQHLYKG----------------------------GPFDP 230
Query: 308 EGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKA 367
EGLLS++ A VT LIG G + +N I + +G F+ +NK
Sbjct: 231 EGLLSTLPAVVTVLIGYFTGEWLRVQPIKTRTSINLAICGLSCVVIGHLWGFL-FPINKQ 289
Query: 368 LYSLSYTCLTAG 379
L++ SY TAG
Sbjct: 290 LWTSSYVVFTAG 301
>gi|281351504|gb|EFB27088.1| hypothetical protein PANDA_006846 [Ailuropoda melanoleuca]
Length = 557
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 110/401 (27%), Positives = 175/401 (43%), Gaps = 75/401 (18%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNS-------KQTRPQHQQRRLISLDVFRGLTVALMI 71
ISK+ ++ ++L L PS + S + P RL +D FRG+ + LM+
Sbjct: 118 ISKAINSRETDRLINSELGSPSRAGSIGGDAQLEAWHPPAASPRLRCVDTFRGIALILMV 177
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAI 127
V+ GG HS WNGLT+AD V P+F+FI+G S+ L+ + K K
Sbjct: 178 FVNYGGGRYWYFKHSSWNGLTVADLVFPWFVFIMGSSVFLSMTSVLQRGCSKFKLLGKIA 237
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYG---VDIAQIRWMGVLQRIAIAYLVAALCEIWL- 183
R+ L +G+ I N Y + ++R GVLQR+ + Y V A+ E+
Sbjct: 238 WRSFLLICIGVV--------IVNPNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELIFA 289
Query: 184 --------KGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSS 235
G S + +F W+ L+L +++L L + L VP T
Sbjct: 290 KPVPESCASERGCFSLRDIIFS--WPQWLFILMLESIWLGLTFFLPVPG------CPTGY 341
Query: 236 SSPWIFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYG 291
P G G G P C A G IDR +LG H+Y+ P + +
Sbjct: 342 LGP-------GGIGDWGKYPNCTGGAAGYIDRLLLGDDHIYQHPSSAVLYHTEV------ 388
Query: 292 PMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLI 351
+DPEG+L ++ + V +G+ G +++++KD +L C +
Sbjct: 389 ------------AYDPEGILGTINSIVMAFLGIQAGKILLYYKDQTKDILIRFTAWCCFL 436
Query: 352 GL-GLSLDFVGMH-----LNKALYSLSY-TCLTAGASGVLL 385
GL ++L V + +NK L+S+SY T L++ A +LL
Sbjct: 437 GLISVALTKVSENEGFIPINKNLWSISYVTTLSSFAFFILL 477
>gi|383110831|ref|ZP_09931649.1| hypothetical protein BSGG_1941 [Bacteroides sp. D2]
gi|313694406|gb|EFS31241.1| hypothetical protein BSGG_1941 [Bacteroides sp. D2]
Length = 371
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 92/331 (27%), Positives = 145/331 (43%), Gaps = 77/331 (23%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+RL++LDV RG+T+A MILV++ G + H+ WNGLT D V PFF+FI+G+S
Sbjct: 5 SNKRLLALDVMRGITIAGMILVNNPGSWGHAYAPLKHAQWNGLTPTDLVFPFFMFIMGIS 64
Query: 109 LALTYK--NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
++ K NF A K I R + +FL+GI L +F + + QIR +GV+
Sbjct: 65 TYISLKKYNFTFSTPAALKIIKRTIVIFLIGIALN--WFALLCYTHNPLPFEQIRILGVM 122
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QR+A+ Y +AL + LK KY + +V L L +++L+ G
Sbjct: 123 QRLALCYGASALIALLLK------------HKYIPYLIVVL-LVGYFIILITG------- 162
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
G N + ++DR ILG H+Y+
Sbjct: 163 ---------------------NGFAYNETNILSIVDRSILGDAHMYQ------------- 188
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
DPEGLLS++ + LIG G L++ KD +++ ++
Sbjct: 189 ---------------DNHIDPEGLLSTIPSIAHVLIGFCVGKLLMEVKDIHEKLERLFLI 233
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLT 377
+ L G + G NK ++S ++ T
Sbjct: 234 GTILTFAGFLFSY-GCPFNKKIWSPTFVLAT 263
>gi|348577435|ref|XP_003474490.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Cavia porcellus]
Length = 638
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 106/407 (26%), Positives = 176/407 (43%), Gaps = 70/407 (17%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQR-------RLISLDVFRGLTVALMI 71
+SK+ ++ ++L L PS ++ PQ + R RL LD FRG+ + LM+
Sbjct: 195 VSKAINSQETDRLINSDLGSPSRADLLTGEPQPETRCLPAPGYRLRCLDTFRGIALILMV 254
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAI 127
V+ GG HS WNGLT+AD V P+F+FI+G S+ L+ + K+ K
Sbjct: 255 FVNYGGGRYWYFRHSSWNGLTVADLVFPWFVFIMGSSVFLSVTSVLQRGCSKLKLLGKIA 314
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYG---VDIAQIRWMGVLQRIAIAYLVAALCEIWLK 184
R+ L +GI I N Y + ++R GVLQR+ + Y V A+ E+
Sbjct: 315 WRSFLLICIGIV--------IVNPNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFT 366
Query: 185 GDGHVS----SKLSLFRKYRGHW--------VVALVLTTLYLLLLYGL-YVPDWQYEFPV 231
H + + R W + +L L +LL + G YV + Y P
Sbjct: 367 KPVHENCVSDRRFPFLRDITCSWPQWLLILLLESLWLGLTFLLPVPGCPYVSEPGYLGPG 426
Query: 232 ETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYG 291
++ N T G A G ID +LG HLY+ P + +
Sbjct: 427 GIGDLGKYV-NCTGG----------AAGYIDHLLLGSDHLYQHPSSAVLYHTKV------ 469
Query: 292 PMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLI 351
+DPEG+L ++ + V +G+ G +++++KD +L C++
Sbjct: 470 ------------AYDPEGILGTINSIVMAFLGVQAGKILLYYKDQTKDILIRFTAWCCVL 517
Query: 352 GL------GLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
G+ +S + + +NK L+S+SY + S +L +YF++
Sbjct: 518 GVISAALTKMSENEGFIPVNKNLWSISYVTTLSTFSFCILLVLYFII 564
>gi|351712254|gb|EHB15173.1| Heparan-alpha-glucosaminide N-acetyltransferase, partial
[Heterocephalus glaber]
Length = 537
Score = 103 bits (257), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 111/407 (27%), Positives = 173/407 (42%), Gaps = 74/407 (18%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQR-------RLISLDVFRGLTVALMI 71
+SK+ ++ + L L PS ++ PQ + + RL LD FRG+ + LM+
Sbjct: 157 VSKAINSQETDHLINSDLGSPSGADPLTGDPQPEAQCASASGHRLRCLDTFRGIALVLMV 216
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAI 127
V+ GG HS WNGLT+AD V P+F+FI+G S+ L+ + K K
Sbjct: 217 FVNYGGGRYWYFKHSSWNGLTVADLVFPWFVFIMGSSVFLSMTSVLQRGCSKFKLLGKIA 276
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYG---VDIAQIRWMGVLQRIAIAYLVAALCEIWLK 184
R+ L +GI I N Y + ++R GVLQR+ + Y V A+ E+
Sbjct: 277 WRSFLLICIGIV--------IVNPNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFA 328
Query: 185 GD-------GHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSS 237
L W++ L+L ++L L + L VP T
Sbjct: 329 KPIPENCVLERSCPSLRDITSSWSQWLLILLLEGIWLGLTFLLPVPG------CPTGYLG 382
Query: 238 PWIFNVTCGVRGSTGPACNAVG----MIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPM 293
P G G G N G ID +LG HLY+ P + +
Sbjct: 383 P-------GGIGDLGKYANCTGGAARYIDHLLLGSDHLYQHPSSTVLYHTEV-------- 427
Query: 294 PLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL 353
P+DPEG+L ++ + V +G+ G +++ +K +L C++GL
Sbjct: 428 ----------PYDPEGILGTINSIVMAFLGVQAGKILLCYKGQTKDILIRFTAWCCVLGL 477
Query: 354 GLSLDFVGMH-------LNKALYSLSY-TCLTAGASGVLLAGIYFMV 392
+S+ M +NK L+S+SY T L+A A +LLA +Y +V
Sbjct: 478 -ISVALTKMSENKGFIPINKNLWSISYVTTLSAFAFFILLA-LYLVV 522
>gi|374385780|ref|ZP_09643283.1| hypothetical protein HMPREF9449_01669 [Odoribacter laneus YIT
12061]
gi|373225482|gb|EHP47816.1| hypothetical protein HMPREF9449_01669 [Odoribacter laneus YIT
12061]
Length = 382
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 94/356 (26%), Positives = 156/356 (43%), Gaps = 82/356 (23%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVG 106
+ RL++LDVFRG+T+A MILV+D G + + H+ WNGLT D V PFF+FI+G
Sbjct: 1 MKTENRLLALDVFRGITIAGMILVNDPGSWSAVYAPLCHASWNGLTPTDLVFPFFMFIMG 60
Query: 107 VSLALTYKNFPCKVV--ATRKAILRALNLFLLGI-----FLQGGFFHGINNLKYG----- 154
+S+ + + + A K RA+ +FL+G+ L G F + N ++
Sbjct: 61 ISMYFSLRRYNSLFSRGAVAKIFRRAVLIFLIGLGINWFALWFGTFMSMGNGEFTFWERF 120
Query: 155 ----VDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLT 210
+A IR +GVLQR+A+AYL A+ + + + LF ++L
Sbjct: 121 TQNIFPVADIRILGVLQRLALAYLGGAILCL------GIRPRYQLF-------TAVMILV 167
Query: 211 TLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQH 270
+++L+ G G N + ++DR +LG++H
Sbjct: 168 GYFVILVVG----------------------------EGFIRSEHNILSVVDRAVLGVRH 199
Query: 271 LYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI 330
LY S FDPEGLLS++ L G+ G ++
Sbjct: 200 LYGGGASSGAGM---------------------AFDPEGLLSTLPCFAHVLFGVCMGRML 238
Query: 331 VHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
K++ R+ I + L+ G + +NK ++S +Y ++ GA+ +L A
Sbjct: 239 GEVKENEIRIRQLFIFGTILLFAGYLWSY-ACPVNKRIWSPTYVLISCGAASLLFA 293
>gi|423239671|ref|ZP_17220787.1| hypothetical protein HMPREF1065_01410 [Bacteroides dorei
CL03T12C01]
gi|392645711|gb|EIY39434.1| hypothetical protein HMPREF1065_01410 [Bacteroides dorei
CL03T12C01]
Length = 363
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 94/342 (27%), Positives = 151/342 (44%), Gaps = 84/342 (24%)
Query: 54 RRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS-- 108
+RL++LD+ RG+T+A MILV++ G + + H +NGLT D V PFF+FI+G+S
Sbjct: 7 KRLLALDILRGITIAGMILVNNPGSWGYVYAPLEHVAFNGLTPTDLVFPFFMFIMGISTY 66
Query: 109 LALTYKNFPCKVVATRKAILRALNLFLLGIFL----QGGFFHGINNLKYGVDIAQIRWMG 164
++L NF RK + R + +F +G+ L + F H +N + R++G
Sbjct: 67 ISLRKYNFTYSHATLRKIMKRTVIIFCIGLLLNLLAKSVFTHHLN-------FEEWRYLG 119
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
V+QR+AI Y V +L I +K KY ++ + L +LLL G
Sbjct: 120 VMQRLAIGYGVTSLVAITVK------------HKYFPA-IILVTLAAYFLLLATG----- 161
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
G N V D LG H+Y +
Sbjct: 162 -----------------------DGFNQSETNVVARFDAWALGTSHMYHE---------- 188
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
G M FDPEGLLS+V A ++G + G L++ KD+ +++
Sbjct: 189 ------GGM----------AFDPEGLLSTVPAVCHVMVGFYCGKLLLSAKDNAEKIQRLF 232
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
++ + L G L + G +NK ++S ++ +T G + LA
Sbjct: 233 LIGTILTFAGFLLSY-GCPINKKVWSPTFVIITCGLASSFLA 273
>gi|332241088|ref|XP_003269721.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase
[Nomascus leucogenys]
Length = 654
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 111/396 (28%), Positives = 174/396 (43%), Gaps = 65/396 (16%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQ-------QRRLISLDVFRGLTVALMI 71
ISK+ S+ ++L L PS +S Q RL S+D FRG+ + LM+
Sbjct: 215 ISKAISSRETDRLINSELGSPSRIDSLDGDVQPALWRLSALPPRLRSVDTFRGIALILMV 274
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAI 127
V+ GG H+ WNGLT+AD V P+F+FI+G S+ L+ + K K
Sbjct: 275 FVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMASILQRGCSKFRLLGKIA 334
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKG-- 185
R+ L +GI + N + ++R GVLQR+ + Y V A+ E+
Sbjct: 335 WRSFLLICIGIIIVNP-----NYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPV 389
Query: 186 DGHVSSKLSLFR-----KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWI 240
H +S+ S W++ LVL L+L L + L VP +
Sbjct: 390 PEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVPGCPIGY----------- 438
Query: 241 FNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLD 296
+ G G G P C A G IDR +LG HLY+ P + +
Sbjct: 439 --LGPGGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSAVLYHTEV----------- 485
Query: 297 APSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL-GL 355
+DPEG+L ++ + V +G+ G +++++K +L C++GL +
Sbjct: 486 -------AYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILIRFTAWCCILGLISV 538
Query: 356 SLDFVGMH-----LNKALYSLSY-TCLTAGASGVLL 385
+L V + +NK L+SLSY T L++ A +LL
Sbjct: 539 ALTKVSENERFIPVNKNLWSLSYVTTLSSFAFFILL 574
>gi|375253854|ref|YP_005013021.1| hypothetical protein BFO_0041 [Tannerella forsythia ATCC 43037]
gi|363406758|gb|AEW20444.1| putative membrane protein [Tannerella forsythia ATCC 43037]
Length = 390
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 93/357 (26%), Positives = 154/357 (43%), Gaps = 89/357 (24%)
Query: 49 PQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIV 105
P RRL++LD+ RG+T+A MI+V++ G + + H+ W+GLT D V PFF+FI+
Sbjct: 5 PDTSSRRLLALDILRGITIAGMIMVNNPGSWSFVYAPLGHAAWHGLTPTDLVFPFFMFIM 64
Query: 106 GVSLALTYK--NFPCKVVATRKAILRALNLFLLGIFL--------------QGGFFHGIN 149
G+S ++ K +F A RK I R +F +G+ L GG G
Sbjct: 65 GISTYISLKKYDFTFSYSAMRKIIRRTAVIFAIGLGLAWLGLTCRTWHGLADGGLSFGAR 124
Query: 150 NLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVL 209
+ + +R +GV+QR+A++Y AL + R +R +++ +L
Sbjct: 125 LWQSVSNFGHLRILGVMQRLALSYGATALIAL-------------AIRHHRIPYLIVALL 171
Query: 210 TTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQ 269
+LLL G G N + ++DR +LG+
Sbjct: 172 GGYTVLLLAG----------------------------NGLAYNETNILSIVDRAVLGVN 203
Query: 270 HLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHL 329
H Y+ D G +PEGLLS++ A LIG G
Sbjct: 204 HTYK---------------DMG-------------IEPEGLLSTLPAIAHVLIGFCCGRA 235
Query: 330 IVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
++ + RD+ML ++ + + G L + G +NK ++S ++ +T G + LLA
Sbjct: 236 MLGVTEVRDKMLRLFLIGTAMAFAGWLLSY-GCPINKKIWSPTFVLITCGMASGLLA 291
>gi|340727561|ref|XP_003402110.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Bombus terrestris]
Length = 571
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 101/390 (25%), Positives = 182/390 (46%), Gaps = 69/390 (17%)
Query: 11 ANDQLVIQISKSTSAPAN--EKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVA 68
+++I++ + +P N + L R L N+ R R+ S+D FRG+T+
Sbjct: 138 TTSKVIIRVIRGKLSPNNVYDDLNR---LQEENTTHPVIRVTKFSSRIQSVDAFRGITIL 194
Query: 69 LMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAIL 128
LMI V++ GG NHS W GL++AD ++P+F +I+G+S+ ++ K ++ +R I
Sbjct: 195 LMIFVNNGGGKYVFFNHSAWFGLSVADLILPWFAWIMGMSITIS-KRAELRLTTSRVKIT 253
Query: 129 -----RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCE-IW 182
R+ L LLG+ L +N+L R+ GVLQ ++++Y V A+ E I+
Sbjct: 254 LCCLRRSAILILLGLMLNSIDSKSLNDL---------RFPGVLQLLSVSYFVCAILETIF 304
Query: 183 LKGDGH----VSSKLSLFRKYRGHW----VVALVLTTLYLLLLYGLYVPD--WQYEFPVE 232
+K + ++FR W ++A ++TT + L+ + L++P+ Y P
Sbjct: 305 MKPHSQDILLQFGRFAIFRDILDSWPQWLIMAGIMTT-HTLITFFLHIPNCPTGYFGPGG 363
Query: 233 TSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGP 292
N T G A G IDR I G H Y + I + YG
Sbjct: 364 KYHYRGKYMNCTAG----------AAGYIDRLIFG-NHTYSR----------ITNSIYGQ 402
Query: 293 MPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI---ILSSC 349
+ +DPEGL++++ A +G+H G +++ + R++ W + +
Sbjct: 403 I---------LRYDPEGLMNTISAIFIVYLGVHAGKILLLYYQGNARLIRWFLWAVFTGI 453
Query: 350 LIGLGLSLDFVG--MHLNKALYSLSY--TC 375
+ G+ + + G + ++K + +LSY TC
Sbjct: 454 IAGILCNFESEGGVIPVSKRMMTLSYVLTC 483
>gi|299140911|ref|ZP_07034049.1| membrane protein [Prevotella oris C735]
gi|298577877|gb|EFI49745.1| membrane protein [Prevotella oris C735]
Length = 370
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 100/332 (30%), Positives = 160/332 (48%), Gaps = 71/332 (21%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
Q+RLISLD+ RGL +A MILV++ G I + HS WNGLT D V PFF+F +G+++
Sbjct: 3 QQRLISLDMLRGLAMAGMILVNNPGSWSHIYVPLEHSVWNGLTPTDLVFPFFVFAMGMAM 62
Query: 110 ALTYKNFPC-KVVATRKAILRALNLFLLG-IFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
+ KN + RK + R++ LF++G + G + L + +Q+R MGVLQ
Sbjct: 63 GFSTKNLTALRASYLRKVMKRSVLLFVIGLLLTLLGRWLNTGELCF----SQLRVMGVLQ 118
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQY 227
R++++YLV AL +KG ++ +VV +L+ ++LLL G
Sbjct: 119 RLSLSYLVVALIVRRVKGVPTMT------------FVVVALLSGYWVLLLLG-------- 158
Query: 228 EFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINS 287
G A N V ++DR +LG HLY + +
Sbjct: 159 --------------------HGFDFSANNIVAVVDRWLLGESHLYIERL----------- 187
Query: 288 PDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILS 347
PD P+ FDPEGLLS++ L+G L+ ++ R+L ++
Sbjct: 188 PDGTPI----------AFDPEGLLSTIPCVAQVLLGYICSRLLCTSQELPQRILRLAVIG 237
Query: 348 SCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
+ L+ GL L ++ LNK ++S ++ +T+G
Sbjct: 238 ALLLLSGLLLSYM-CPLNKKIWSSTFVMVTSG 268
>gi|119491291|ref|ZP_01623345.1| hypothetical protein L8106_21879 [Lyngbya sp. PCC 8106]
gi|119453455|gb|EAW34617.1| hypothetical protein L8106_21879 [Lyngbya sp. PCC 8106]
Length = 371
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 102/342 (29%), Positives = 154/342 (45%), Gaps = 67/342 (19%)
Query: 55 RLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL SLDVFRG+ +A MILV++ G + P + H+ W+G T D V P FLFIVGV++
Sbjct: 2 RLTSLDVFRGIAIASMILVNNPGSWNHVYPLLKHAEWHGYTPTDLVFPSFLFIVGVAMTF 61
Query: 112 TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAI 171
+ + + + I + L +L L + N D+A IR MGVLQRI++
Sbjct: 62 SMSKYLPENRNLEENISPKIYLRILRRCLILFLLGLLLNGYPNYDLANIRIMGVLQRISL 121
Query: 172 AYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPV 231
AY ++A+ + H+S K W +++ L Y +++ + VP+
Sbjct: 122 AYGLSAITIL------HLSRKQI--------WGLSIGLLIGYAVVMQLIPVPN------- 160
Query: 232 ETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYG 291
GV T P N +DR ILG HL
Sbjct: 161 -------------SGVVNLT-PEGNFAAYLDRLILGEHHLL------------------- 187
Query: 292 PMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLI 351
+DPEGLLS++ A VT LIG G+ + + LN +I+ C I
Sbjct: 188 ---------GGGKYDPEGLLSTLPAVVTVLIGYLTGNWLKKQPINSQTSLNLVIIGLCNI 238
Query: 352 GLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVR 393
+G + +NK+L++ SY +TAG + VLLA Y ++
Sbjct: 239 IVGHLWGLI-FPINKSLWTSSYVLVTAGWALVLLAACYELIE 279
>gi|410638830|ref|ZP_11349383.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola
lipolytica E3]
gi|410141358|dbj|GAC16588.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola
lipolytica E3]
Length = 365
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 102/369 (27%), Positives = 160/369 (43%), Gaps = 100/369 (27%)
Query: 55 RLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
R ISLD+FRG+T+A M+LV++ G + P + H+ W+GLT D + PFFLFIVG ++
Sbjct: 3 RQISLDIFRGITLAAMLLVNNPGSWSFVYPPLLHAKWHGLTPTDLIFPFFLFIVGAAMFH 62
Query: 112 TYKNF----------PCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIR 161
+ + P + +A R +L A+ FLL IF G D R
Sbjct: 63 SMGRYLPKANQALQVPWQKIAKRTIVLFAIG-FLLNIFPFTG------------DPQNWR 109
Query: 162 WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLY 221
MGVLQRIAI Y +AA+ L +++ ++L G+W+
Sbjct: 110 IMGVLQRIAICYGIAAILICVLHQKQLIAACITLL---IGYWL----------------- 149
Query: 222 VPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTK 281
+ N+ V G N V +ID ++LG HLY+
Sbjct: 150 ------------------MLNL---VENPYGLETNLVRLIDIEVLGSAHLYQ-------- 180
Query: 282 QCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRML 341
+G FDPEGLLS + A VT L G ++ + K + RM
Sbjct: 181 -------GFG-----------VAFDPEGLLSCIPAVVTVLAGFFTSKMLANAKTEQQRMK 222
Query: 342 NWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLML 401
++ S + ++ F +NK+L++ +Y T G + + LA I ++ H +
Sbjct: 223 TLLLWSLVTFVVCIAWQF-SFPVNKSLWTSTYVLATNGFAWLALAVIIYL------HDVK 275
Query: 402 KKPFDYSYA 410
K+ F + +A
Sbjct: 276 KQTFGFEWA 284
>gi|327275365|ref|XP_003222444.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Anolis carolinensis]
Length = 632
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 99/342 (28%), Positives = 151/342 (44%), Gaps = 57/342 (16%)
Query: 54 RRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGV----SL 109
RL SLD FRGL + +M+ V+ GG H WNGLT+AD V P+F+FI+G SL
Sbjct: 235 HRLRSLDTFRGLALIIMVFVNYGGGKYWFFKHQSWNGLTVADLVFPWFVFIMGTSISLSL 294
Query: 110 ALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRI 169
+ + K K + R+L LFL+G+ + N + +R GVLQR+
Sbjct: 295 SSMLRRGCSKWKLLGKILWRSLLLFLIGVIIVNP-----NYCLGPLSWENLRIPGVLQRL 349
Query: 170 AIAYLVAALCEIWLKGDGHVSSKLSL-------FRKYRGHWVVALVLTTLYLLLLYGLYV 222
+ Y V A+ E+ +S L + Y W+ + L T++L L + L V
Sbjct: 350 SCTYFVVAVLELLFAKPVPDNSTLEIPCPALRDILPYWPQWLFMMALETVWLCLTFLLNV 409
Query: 223 PDWQYEFPVETSSSSPWIFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYS 278
P P G G G P C A ID +LG +H+Y+ P +
Sbjct: 410 PG------CPNGYLGP-------GGIGDFGNYPNCTGGAAAYIDHVLLGEKHIYQHPSSN 456
Query: 279 RTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKD-HR 337
Q ++ FDPEG+L ++ + + +GL G +++ +KD HR
Sbjct: 457 VLYQTTV------------------AFDPEGILGTINSVIMAFLGLQAGKILLFYKDQHR 498
Query: 338 DRMLNWIILSSCL-----IGLGLSLDFVGMHLNKALYSLSYT 374
ML + I S + + S D + +NK L+S+SY
Sbjct: 499 QIMLRFCIWSVVMGVISAVLTECSKDEGFIPVNKNLWSISYV 540
>gi|404485250|ref|ZP_11020448.1| hypothetical protein HMPREF9448_00860 [Barnesiella intestinihominis
YIT 11860]
gi|404338685|gb|EJZ65130.1| hypothetical protein HMPREF9448_00860 [Barnesiella intestinihominis
YIT 11860]
Length = 390
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 99/355 (27%), Positives = 159/355 (44%), Gaps = 91/355 (25%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
+RL+SLD+ RG+T+A MI+V++ G I + H+ W GLT D V PFF+FI+G+S
Sbjct: 6 NKRLLSLDILRGITIAGMIMVNNPGSWGYIYAPLGHAEWIGLTPTDLVFPFFMFIMGIST 65
Query: 110 ALTYKNFPCKV--VATRKAILRALNLFLLGI-----------FLQGG-----FFHGINNL 151
++ + F ++ K I R + +F +G+ + Q G FF +
Sbjct: 66 YMSLRKFDFRLSGAVAWKIIRRTIVIFAIGLAIAWFGLTMRTYHQLGEESLSFFERLGRS 125
Query: 152 KYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTT 211
+ D IR +GV+ R+AI Y VAA + +K KY H +V++ L
Sbjct: 126 MWNFD--HIRILGVMPRLAICYGVAAFIALIVK------------HKYIPH-IVSVTLIA 170
Query: 212 LYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHL 271
+++L+ G +EF + N + ++DR ILG H+
Sbjct: 171 YFVILITGK-----GFEFSED-----------------------NIISVVDRAILGSNHM 202
Query: 272 YRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV 331
Y D G DPEGLLS++ + L+G+ G LI+
Sbjct: 203 YH---------------DNG-----------LALDPEGLLSTIPSICHVLVGIFCGGLIM 236
Query: 332 HFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
KD+ RM N I + L GL L++ G ++K ++S ++ T G + LA
Sbjct: 237 RTKDNAVRMQNLFIAGTILTFAGLLLEY-GCPISKKIWSPTFVLTTCGLAASSLA 290
>gi|393785792|ref|ZP_10373938.1| hypothetical protein HMPREF1068_00218 [Bacteroides nordii
CL02T12C05]
gi|392661411|gb|EIY54997.1| hypothetical protein HMPREF1068_00218 [Bacteroides nordii
CL02T12C05]
Length = 361
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 93/334 (27%), Positives = 152/334 (45%), Gaps = 80/334 (23%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVG----GILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
RL+SLDV RG TVA MILV++ G G P + H+ W+G T AD V P F+F++G+S
Sbjct: 4 NNRLLSLDVLRGFTVAGMILVNNAGACGYGYAP-LRHAKWDGFTPADLVFPMFMFLMGIS 62
Query: 109 LALTYKNFPCK-VVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGV--DIAQIRWMGV 165
++ + + + +A K I RAL L L+GI ++ IN+ + G+ D +R +GV
Sbjct: 63 TYISLRKYDFQWRLAIGKIIKRALLLILIGIAMK----WIINSSETGIWTDWEHMRLLGV 118
Query: 166 LQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDW 225
+QR+ I Y A+ ++LF ++ + VAL+L Y +L
Sbjct: 119 MQRLGICYGATAI--------------MALFIPHKRFFPVALLLLAGYFIL--------- 155
Query: 226 QYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSI 285
+ SP N + ++D +LG H+Y +
Sbjct: 156 --QLIGNGFEKSP----------------DNIIAIVDSTVLGTNHMYLQG---------- 187
Query: 286 NSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWII 345
+ +PEG+LS++ A +IG G +I++ KD+++RM
Sbjct: 188 ----------------RQFVEPEGILSTIPAIAQVMIGFVCGRMIINQKDNKERMQKLFF 231
Query: 346 LSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
L + ++ G + LNK L+S S+ LT G
Sbjct: 232 LGTLMLFAGFLFSY-ACPLNKRLWSPSFVLLTCG 264
>gi|355779672|gb|EHH64148.1| Heparan-alpha-glucosaminide N-acetyltransferase, partial [Macaca
fascicularis]
Length = 596
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 110/397 (27%), Positives = 172/397 (43%), Gaps = 65/397 (16%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQ-------QRRLISLDVFRGLTVALMI 71
ISK+ S+ ++L L PS ++ Q RL S+D FRG+ + LM+
Sbjct: 157 ISKAISSRETDRLINSELGSPSRTDPLDGDVQPAVWHLSVPPPRLRSVDTFRGIALILMV 216
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAI 127
V+ GG H+ WNGLT+AD V P+F+FI+G S+ L+ + K K
Sbjct: 217 FVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKFRLLGKIA 276
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKG-- 185
R+ L +GI + N + ++R GVLQR+ + Y V A+ E+
Sbjct: 277 WRSFLLICIGIIIVNP-----NYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPV 331
Query: 186 DGHVSSKLSLFR-----KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWI 240
H +S+ S W++ L L L+L L + L VP T P
Sbjct: 332 PEHCASERSCLSLRDITSSWPQWLLILALEGLWLGLTFLLPVPG------CPTGYLGP-- 383
Query: 241 FNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLD 296
G G G P C A G IDR +LG HLY+ P + +
Sbjct: 384 -----GGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSTVLYHTEV----------- 427
Query: 297 APSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL--- 353
+DPEG+L ++ + V +G+ G +++++K +L C++GL
Sbjct: 428 -------AYDPEGILGTINSIVMAFLGVQAGKILLYYKAQTKDILIRFTAWCCILGLISV 480
Query: 354 ---GLSLDFVGMHLNKALYSLSY-TCLTAGASGVLLA 386
+S + + +NK L+SLSY T L++ A +LL
Sbjct: 481 VLTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLV 517
>gi|385809567|ref|YP_005845963.1| heparan-alpha-glucosaminide N-acetyltransferase [Ignavibacterium
album JCM 16511]
gi|383801615|gb|AFH48695.1| Heparan-alpha-glucosaminide N-acetyltransferase [Ignavibacterium
album JCM 16511]
Length = 378
Score = 102 bits (254), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 103/359 (28%), Positives = 161/359 (44%), Gaps = 74/359 (20%)
Query: 54 RRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSLA 110
RL+SLDVFRG+T+ MILV++ G + P + H+ W+G T D + PFFLFIVGV+++
Sbjct: 5 ERLVSLDVFRGITIMGMILVNNPGTWSAVYPQLLHAEWHGCTFTDLIFPFFLFIVGVAVS 64
Query: 111 LTYKNFPCKVVATRKAIL----RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
+ + + + L R + LFLLGI L G F + ++ + +I GVL
Sbjct: 65 YSLTKRKAQGGSMKSLYLNIIRRTVILFLLGIILNGFPFGLLFGHQFSWETLRIP--GVL 122
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRIAI Y VAA + S+K +W A +L LY ++ + VP
Sbjct: 123 QRIAIVYFVAAFLFL------TTSTKFQ-------YWFTAAIL-ILYAAVMSFIPVPGIG 168
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
Y + P N ID+ ILG H+ +S TK
Sbjct: 169 Y---------------------ANFEPGKNLSAWIDQMILG-SHM-----WSGTKL---- 197
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
+DPEG+LS++ A + ++G+ G+ + D +++++ I
Sbjct: 198 ------------------WDPEGILSTIPAIGSAMLGIFTGNWLRSENDQKEKVVYLFIW 239
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKKPF 405
++ L+ G F LNK L++ SY TAG + L Y+ + + KPF
Sbjct: 240 ANVLMLAGWIWSF-WFPLNKNLWTSSYVLWTAGLALHFLGFCYWFID-VKKITWWTKPF 296
>gi|209523049|ref|ZP_03271606.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|376001698|ref|ZP_09779557.1| conserved hypothetical protein (membrane) [Arthrospira sp. PCC
8005]
gi|423062475|ref|ZP_17051265.1| hypothetical protein SPLC1_S033650 [Arthrospira platensis C1]
gi|209496636|gb|EDZ96934.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|375329927|emb|CCE15310.1| conserved hypothetical protein (membrane) [Arthrospira sp. PCC
8005]
gi|406716383|gb|EKD11534.1| hypothetical protein SPLC1_S033650 [Arthrospira platensis C1]
Length = 378
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 109/357 (30%), Positives = 162/357 (45%), Gaps = 87/357 (24%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG 106
++ + RLISLDVFRG+ +A MILV++ G + P + H+ W+G T D V P FL I+G
Sbjct: 4 KNTKMRLISLDVFRGIAIAAMILVNNPGSWGYMYPVLQHAEWDGCTPTDVVFPSFLLIMG 63
Query: 107 VSLALTYKNF------PCKVV--ATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIA 158
V++A + F P + V + I R L L GF H D+A
Sbjct: 64 VAIAFSLSKFAREHRLPGEKVPPSVYSRIGRRCLLLFLLGLFLNGFPH--------YDLA 115
Query: 159 QIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLY 218
IR MGVLQRIAIAY + A+ + L+L R R W+++++ Y + +
Sbjct: 116 NIRIMGVLQRIAIAYGLTAI------------AILNLSR--RQLWLISILTLIGYWVAMT 161
Query: 219 GLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYS 278
+ VP S P G+ P N ID+ ILG HL+R
Sbjct: 162 IIPVP-----------SYGP----------GNLSPEGNLGAFIDQTILGSHHLWRG---- 196
Query: 279 RTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRD 338
P+DPEGL S+ ATVT ++G G + +
Sbjct: 197 ------------------------GPYDPEGLFSTAPATVTVILGYLTGEWLKSQPRNSF 232
Query: 339 RMLNWII--LSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVR 393
++N ++ LSS ++G L V +NKAL++ S+ +TAG +LLA Y ++
Sbjct: 233 TVINLVMFALSSLVVGY---LWGVWFPINKALWTSSFVLVTAGWGLLLLAFCYGVIE 286
>gi|355697915|gb|EHH28463.1| Heparan-alpha-glucosaminide N-acetyltransferase, partial [Macaca
mulatta]
Length = 596
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 110/397 (27%), Positives = 172/397 (43%), Gaps = 65/397 (16%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQ-------QRRLISLDVFRGLTVALMI 71
ISK+ S+ ++L L PS ++ Q RL S+D FRG+ + LM+
Sbjct: 157 ISKAISSRETDRLINSELGSPSRTDPLDGDVQPAVWHLSVPPPRLRSVDTFRGIALILMV 216
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAI 127
V+ GG H+ WNGLT+AD V P+F+FI+G S+ L+ + K K
Sbjct: 217 FVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSVLQRGCSKFRLLGKIA 276
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKG-- 185
R+ L +GI + N + ++R GVLQR+ + Y V A+ E+
Sbjct: 277 WRSFLLICIGIIIVNP-----NYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPV 331
Query: 186 DGHVSSKLSLFR-----KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWI 240
H +S+ S W++ L L L+L L + L VP T P
Sbjct: 332 PEHCASERSCLSLRDITSSWPQWLLILALEGLWLGLTFLLPVPG------CPTGYLGP-- 383
Query: 241 FNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLD 296
G G G P C A G IDR +LG HLY+ P + +
Sbjct: 384 -----GGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSTVLYHTEV----------- 427
Query: 297 APSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL--- 353
+DPEG+L ++ + V +G+ G +++++K +L C++GL
Sbjct: 428 -------AYDPEGILGTINSIVMAFLGVQAGKILLYYKAQTKDILIRFTAWCCILGLISV 480
Query: 354 ---GLSLDFVGMHLNKALYSLSY-TCLTAGASGVLLA 386
+S + + +NK L+SLSY T L++ A +LL
Sbjct: 481 VLTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLV 517
>gi|404487027|ref|ZP_11022214.1| hypothetical protein HMPREF9448_02670 [Barnesiella intestinihominis
YIT 11860]
gi|404335523|gb|EJZ61992.1| hypothetical protein HMPREF9448_02670 [Barnesiella intestinihominis
YIT 11860]
Length = 364
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 97/354 (27%), Positives = 157/354 (44%), Gaps = 90/354 (25%)
Query: 51 HQQRRLISLDVFRGLTVALMILVDDVGGILPA---INHSPWNGLTLADFVMPFFLFIVGV 107
+RL+SLDV RG+TV MILV++ G A + H+ W+G T AD V P F+FI+GV
Sbjct: 5 RTSKRLVSLDVLRGITVCGMILVNNAGACGYAYAPLKHAKWDGFTPADLVFPAFMFIMGV 64
Query: 108 S--LALTYKNFPCKVVATRKAILRALNLFLLGI-------FLQGGFFHGINNLKYGVDIA 158
S L+L NF ++ R + R + +F+ G+ F+ G ++ + NL
Sbjct: 65 SIYLSLNKSNFDWRISIAR-ILRRTVLIFMSGVALKWILAFIATGEYNTLENL------- 116
Query: 159 QIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLY 218
R MGVLQR+ I Y + AL V+ + LF ++A++L YLL L+
Sbjct: 117 --RIMGVLQRLGICYGIVALLA--------VTVRHRLFPT-----IIAVLLVGYYLLQLF 161
Query: 219 GLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYS 278
G G A N V M+D +LG H+Y
Sbjct: 162 G----------------------------NGFEKCAGNIVSMVDYAVLGKSHMY------ 187
Query: 279 RTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRD 338
+ + DPEG+LS++ A +IG G +IV K+ R
Sbjct: 188 ------LGGAQF--------------VDPEGVLSTIPAIAQVMIGFLCGKVIVGEKEIRS 227
Query: 339 RMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+++ + + + +G + LNK L+S S+ +T G + ++ A + +++
Sbjct: 228 QIVKLAVWGTSMFVIGYLWSYAA-PLNKRLWSPSFVLVTCGITSLIFATLIYII 280
>gi|223940501|ref|ZP_03632350.1| conserved hypothetical protein [bacterium Ellin514]
gi|223890825|gb|EEF57337.1| conserved hypothetical protein [bacterium Ellin514]
Length = 410
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 111/391 (28%), Positives = 162/391 (41%), Gaps = 100/391 (25%)
Query: 34 DPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDV-------------GGIL 80
D L PP S + T P +RL+SLD RG + ++ +++ G I
Sbjct: 3 DILSPPLQSKPQVTSPS-TTKRLLSLDALRGFDMFWIVGGEELVHALYNAWPNGPLGIIN 61
Query: 81 PAINHSPWNGLTLADFVMPFFLFIVGVSLALTYK---NFPCKVVATRKAILRALNLFLLG 137
++H W G+ D + P F+FIVGVSL + K A ++ R+L L++ G
Sbjct: 62 SQMDHKVWQGVAFYDLIFPLFVFIVGVSLVFSLTKAIEVNGKAAALKRVFFRSLLLYVFG 121
Query: 138 IFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFR 197
+ + GG GI+ IRWMGVLQRIAI Y +L + F
Sbjct: 122 LLIYGGISKGIDG---------IRWMGVLQRIAICYFSTSL--------------VFCFF 158
Query: 198 KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPW-------IFNV------- 243
K RG V A L Y L+ + PD V +S+SP NV
Sbjct: 159 KLRGMIVAAAALLLTYWALMTFVPFPD------VRPASASPQEITKHNGFTNVAQLNLSS 212
Query: 244 TCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQA 303
T + G P N +D+K L P Y W
Sbjct: 213 TTMLHGQFIPGVNLANYVDQKYL---------------------PGY--------KW-DG 242
Query: 304 PFDPEGLLSSVMATVTCLIGLHFGHLIV--HFKDHRDRMLNWIILSSCLIGLGLSLDFV- 360
+DPEGLLS++ A VTCL+G+ G L+ + D + +L L+ I G++L F+
Sbjct: 243 TYDPEGLLSTLPAIVTCLLGVFAGLLLRNPNVPDQKKVLL----LAGAGIA-GVALGFLW 297
Query: 361 GMHLN--KALYSLSYTCLTAGASGVLLAGIY 389
G+ K L++ SY + G + + LA Y
Sbjct: 298 GLEFPVIKKLWTSSYVLVAGGYACIFLAAFY 328
>gi|380789677|gb|AFE66714.1| heparan-alpha-glucosaminide N-acetyltransferase precursor [Macaca
mulatta]
gi|383410547|gb|AFH28487.1| heparan-alpha-glucosaminide N-acetyltransferase [Macaca mulatta]
gi|384945386|gb|AFI36298.1| heparan-alpha-glucosaminide N-acetyltransferase [Macaca mulatta]
Length = 635
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 112/399 (28%), Positives = 173/399 (43%), Gaps = 71/399 (17%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQ-------QRRLISLDVFRGLTVALMI 71
ISK+ S+ ++L L PS ++ Q RL S+D FRG+ + LM+
Sbjct: 196 ISKAISSRETDRLINSELGSPSRTDPLDGDVQPAVWHLSVPPPRLRSVDTFRGIALILMV 255
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAI 127
V+ GG H+ WNGLT+AD V P+F+FI+G S+ L+ + K K
Sbjct: 256 FVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKFRLLGKIA 315
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKY---GVDIAQIRWMGVLQRIAIAYLVAALCEIWLK 184
R+ L +GI I N Y + ++R GVLQR+ + Y V A+ E+
Sbjct: 316 WRSFLLICIGII--------IVNPNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFA 367
Query: 185 G--DGHVSSKLSLFR-----KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSS 237
H +S+ S W++ L L L+L L + L VP T
Sbjct: 368 KPVPEHCASERSCLSLRDITSSWPQWLLILALEGLWLGLTFLLPVPG------CPTGYLG 421
Query: 238 PWIFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPM 293
P G G G P C A G IDR +LG HLY+ P + +
Sbjct: 422 P-------GGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSTVLYHTEV-------- 466
Query: 294 PLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL 353
+DPEG+L ++ + V +G+ G +++++K +L C++GL
Sbjct: 467 ----------AYDPEGILGTINSIVMAFLGVQAGKILLYYKAQTKDILIRFTAWCCILGL 516
Query: 354 ------GLSLDFVGMHLNKALYSLSY-TCLTAGASGVLL 385
+S + + +NK L+SLSY T L++ A +LL
Sbjct: 517 ISVVLTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILL 555
>gi|347738959|ref|ZP_08870332.1| hypothetical protein AZA_89781 [Azospirillum amazonense Y2]
gi|346917867|gb|EGY00078.1| hypothetical protein AZA_89781 [Azospirillum amazonense Y2]
Length = 400
Score = 101 bits (252), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 106/329 (32%), Positives = 151/329 (45%), Gaps = 63/329 (19%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGILPA---INHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL SLDV RGL VA MILV G A + H+PW+G TLAD V P FLF VG+++AL
Sbjct: 22 RLPSLDVLRGLAVAGMILVVSPGDWSKAYTPLKHAPWDGWTLADMVFPTFLFSVGLAIAL 81
Query: 112 TYKNFP-CKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIA 170
++ + A K RAL L +LG+ L N L Y D+A +R G+LQRIA
Sbjct: 82 SFTKIAQNRRAAGVKIARRALALIVLGLVL--------NALPY-FDLAHLRLPGILQRIA 132
Query: 171 IAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFP 230
+ Y++A L + G S R +L + ++LL V W P
Sbjct: 133 LCYVLATLLCLVTARTGADGSPTVNQRA---------LLIAMAVVLLGYCAVLAW---VP 180
Query: 231 VETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDY 290
V + G P + IDR + + HL+
Sbjct: 181 VPGIGA------------GHLDPGGSLAAWIDRGVFTVPHLW------------------ 210
Query: 291 GPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCL 350
P DA +DPEGLLS++ ATV L+G+ G K R R LN ++ + L
Sbjct: 211 -PYGTDAAG--AVVYDPEGLLSTLPATVNVLVGVLAG---TALKASRSR-LNLLVAAVML 263
Query: 351 IGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
+ GL+LD V + +NK +++ S+ ++G
Sbjct: 264 MMAGLALDPV-LVINKRIWTSSFALFSSG 291
>gi|386819709|ref|ZP_10106925.1| hypothetical protein JoomaDRAFT_1633 [Joostella marina DSM 19592]
gi|386424815|gb|EIJ38645.1| hypothetical protein JoomaDRAFT_1633 [Joostella marina DSM 19592]
Length = 366
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 98/349 (28%), Positives = 147/349 (42%), Gaps = 77/349 (22%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG 106
+ + R +SLDVFRGLT+ LMI+V+ G I ++H+PWNG TL D V P FLF+VG
Sbjct: 2 KQLKERYLSLDVFRGLTLFLMIIVNTPGSWSFIYKPLHHAPWNGFTLTDLVFPTFLFVVG 61
Query: 107 VSLALTYKNFP--CKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMG 164
+++ + K F +K R +FL+G L F LK I++ R G
Sbjct: 62 NAMSFSLKKFEEIGNTAFLKKVFKRTFLIFLIGFLLYWFPFFKDGALK---PISETRIFG 118
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQRIA+ Y AAL + K G + V+ALV + LLL
Sbjct: 119 VLQRIALCYCFAALILHYWKPKGALIFS-----------VIALVGYHIILLLF------- 160
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
G NA D ++G H+Y+ +
Sbjct: 161 ------------------------GDLTMQGNAAIKADLWLIGSSHMYKGEGF------- 189
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
PFDPEG+LS++ A V + G G + + +
Sbjct: 190 -------------------PFDPEGVLSTLPAIVNVIAGYFAGVFLQQKGKTYEAIAKLT 230
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVR 393
++ LI L L D + + +NK L++ SY LT G ++L+ + F++
Sbjct: 231 MVGGVLIFLALGWDLL-LPINKKLWTSSYVLLTVGIDVMVLSILVFVIE 278
>gi|423312333|ref|ZP_17290270.1| hypothetical protein HMPREF1058_00882 [Bacteroides vulgatus
CL09T03C04]
gi|392688817|gb|EIY82101.1| hypothetical protein HMPREF1058_00882 [Bacteroides vulgatus
CL09T03C04]
Length = 363
Score = 101 bits (252), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 94/342 (27%), Positives = 151/342 (44%), Gaps = 84/342 (24%)
Query: 54 RRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS-- 108
+RL++LD+ RG+T+A MILV++ G + + H+ +NGLT D V PFF+FI+G+S
Sbjct: 7 KRLLALDILRGITIAGMILVNNPGSWGYVYTPLEHAAFNGLTPTDLVFPFFMFIMGISTY 66
Query: 109 LALTYKNFPCKVVATRKAILRALNLFLLGIFL----QGGFFHGINNLKYGVDIAQIRWMG 164
++L NF RK + R + +F +G+FL + F H +N ++R++G
Sbjct: 67 ISLRKYNFTYSHAILRKIVKRTVVIFCIGLFLNLLAKSVFTHHLN-------FEELRYLG 119
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
V+QR+AI Y V +L I +K KY ++ + L +LLL G
Sbjct: 120 VMQRLAIGYGVTSLVAITVK------------HKYFPA-IILVTLAVYFLLLAMG----- 161
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
G N V D LG H+Y
Sbjct: 162 -----------------------DGFNLSVTNIVARFDVWALGTSHMYHD---------- 188
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
G M FDPEGLLS++ A ++G + G L+ KD+ +++
Sbjct: 189 ------GGM----------AFDPEGLLSTLPAVCHVMVGFYCGKLLFSAKDNDEKIQRLF 232
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
++ + L G L + G +NK ++S ++ T G + LA
Sbjct: 233 LVGTILTFAGFLLSY-GCPINKKVWSPTFVITTCGLASSFLA 273
>gi|150009610|ref|YP_001304353.1| transmembrane protein [Parabacteroides distasonis ATCC 8503]
gi|262383102|ref|ZP_06076239.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
gi|298374002|ref|ZP_06983960.1| membrane protein [Bacteroides sp. 3_1_19]
gi|149938034|gb|ABR44731.1| putative transmembrane protein [Parabacteroides distasonis ATCC
8503]
gi|262295980|gb|EEY83911.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
gi|298268370|gb|EFI10025.1| membrane protein [Bacteroides sp. 3_1_19]
Length = 368
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 93/342 (27%), Positives = 151/342 (44%), Gaps = 72/342 (21%)
Query: 51 HQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGV 107
Q RL+SLDV RG+T+A MILV++ G + + H+ WNGLT D V PFF+FI+GV
Sbjct: 3 QQSGRLLSLDVMRGITIAGMILVNNPGSWKYVYTPLEHARWNGLTPTDLVFPFFMFIMGV 62
Query: 108 SLALTYK--NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGV 165
S+ + + NF + K + R + +FL+G+ L G D +R +GV
Sbjct: 63 SMFFSLRKYNFKLSKESVTKVLRRTVLIFLVGLGLN---LFGHVCYNGFTDFQNLRILGV 119
Query: 166 LQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDW 225
+QR+A+AY +L L++ KY V A +L + LL
Sbjct: 120 MQRLALAYGFGSLI------------GLAINHKYILQ-VAAGILIFYWALL--------- 157
Query: 226 QYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSI 285
+ +E S S + ++DR + G H+Y
Sbjct: 158 GFTHSMEMSEDS-------------------IIAIVDRTLFGTSHMYHD----------- 187
Query: 286 NSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRML-NWI 344
D + FDPEGLLS + + L+G + G +I K + + ++ N
Sbjct: 188 ----------DMADGTRIAFDPEGLLSCIGSIAHVLLGFYVGKVIQDCKKNNELIIRNIF 237
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
I + ++ G L + G +NK ++S ++ +T G + + LA
Sbjct: 238 IFGTIILFAGFLLSY-GCPINKKIWSSTFVLVTCGFASLFLA 278
>gi|255013110|ref|ZP_05285236.1| putative transmembrane protein [Bacteroides sp. 2_1_7]
gi|256838332|ref|ZP_05543842.1| conserved hypothetical protein [Parabacteroides sp. D13]
gi|410102572|ref|ZP_11297498.1| hypothetical protein HMPREF0999_01270 [Parabacteroides sp. D25]
gi|423333958|ref|ZP_17311739.1| hypothetical protein HMPREF1075_03390 [Parabacteroides distasonis
CL03T12C09]
gi|256739251|gb|EEU52575.1| conserved hypothetical protein [Parabacteroides sp. D13]
gi|409226793|gb|EKN19699.1| hypothetical protein HMPREF1075_03390 [Parabacteroides distasonis
CL03T12C09]
gi|409238644|gb|EKN31435.1| hypothetical protein HMPREF0999_01270 [Parabacteroides sp. D25]
Length = 368
Score = 101 bits (251), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 93/342 (27%), Positives = 151/342 (44%), Gaps = 72/342 (21%)
Query: 51 HQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGV 107
Q RL+SLDV RG+T+A MILV++ G + + H+ WNGLT D V PFF+FI+GV
Sbjct: 3 QQSGRLLSLDVMRGITIAGMILVNNPGSWKYVYTPLEHARWNGLTPTDLVFPFFMFIMGV 62
Query: 108 SLALTYK--NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGV 165
S+ + + NF + K + R + +FL+G+ L G D +R +GV
Sbjct: 63 SMFFSLRKYNFKLSKESVTKVLRRTVLIFLVGLGLN---LFGHVCYNGFTDFQNLRILGV 119
Query: 166 LQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDW 225
+QR+A+AY +L L++ KY V A +L + LL
Sbjct: 120 MQRLALAYGFGSLI------------GLAINHKYILQ-VAAGILIFYWALL--------- 157
Query: 226 QYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSI 285
+ +E S S + ++DR + G H+Y
Sbjct: 158 GFTHSMEMSEDS-------------------IIAIVDRTLFGTSHMYHD----------- 187
Query: 286 NSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRML-NWI 344
D + FDPEGLLS + + L+G + G +I K + + ++ N
Sbjct: 188 ----------DMADGTRIAFDPEGLLSCIGSIAHVLLGFYVGKVIQDCKKNNELIIRNIF 237
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
I + ++ G L + G +NK ++S ++ +T G + + LA
Sbjct: 238 IFGTIILFAGFLLSY-GCPINKKIWSSTFVLVTCGFASLFLA 278
>gi|365877201|ref|ZP_09416706.1| hypothetical protein EAAG1_13068 [Elizabethkingia anophelis Ag1]
gi|442587874|ref|ZP_21006688.1| hypothetical protein D505_08600 [Elizabethkingia anophelis R26]
gi|365755061|gb|EHM96995.1| hypothetical protein EAAG1_13068 [Elizabethkingia anophelis Ag1]
gi|442562373|gb|ELR79594.1| hypothetical protein D505_08600 [Elizabethkingia anophelis R26]
Length = 400
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 113/382 (29%), Positives = 158/382 (41%), Gaps = 97/382 (25%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+ R SLDVFRG TVALMILV++ G I P + H+ W+G T D V PFFLF VG +
Sbjct: 2 KSARYYSLDVFRGATVALMILVNNPGTWSAIYPPLEHAKWHGCTPTDLVFPFFLFAVGNA 61
Query: 109 LALTYKNFP--CKVVATRKAILRALNLFLLGIFLQG-GFFH-GINNLKY----GVDIAQI 160
+ F V +K I R L +F +G+FL FF ++L + D +
Sbjct: 62 MTFVIPKFQQHNSSVFWKKVIKRTLLIFGIGLFLNWCPFFQWDHDSLSFISWESSDENGV 121
Query: 161 RWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGL 220
R MGVLQRIAIAY A++ + K K+ L W+ +L +LL L+
Sbjct: 122 RIMGVLQRIAIAYFFASVIAYYFK------EKMVL-------WISGALLVIYWLLTLF-- 166
Query: 221 YVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRT 280
+ P+ GV ID ILGI H Y+
Sbjct: 167 ------------LGGTDPYSLEGFIGVP------------IDHSILGIAHEYKGE----- 197
Query: 281 KQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRM 340
PFDPEGL S++ A L FG+LI ++ +
Sbjct: 198 ---------------------GVPFDPEGLFSTIPAISQVL----FGYLIGNYIQKKGN- 231
Query: 341 LNWIILS----------SCLIGLGLSLDFVG------MHLNKALYSLSYTCLTAGASGVL 384
+ W S S L LG+ F+ NK ++S SYT LT G + ++
Sbjct: 232 IQWFGKSLKENSIYSMLSGLFILGIIALFISYVWQLDFPYNKKIWSSSYTLLTTGLAIMV 291
Query: 385 LAGIYFMVRYISSHLMLKKPFD 406
L + + + + L K FD
Sbjct: 292 LGVLIWFIEILEIRNGLMKFFD 313
>gi|158337501|ref|YP_001518676.1| hypothetical protein AM1_4380 [Acaryochloris marina MBIC11017]
gi|158307742|gb|ABW29359.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 383
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 105/363 (28%), Positives = 158/363 (43%), Gaps = 87/363 (23%)
Query: 59 LDVFRGLTVALMILVDDVGGI---LPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKN 115
LDVFRG+ +A M+LV+ G + P + H+ W+G TLAD V PFFLF++G S+A +
Sbjct: 13 LDVFRGIAIAGMLLVNKSGLVKDAYPQLQHADWHGWTLADLVFPFFLFVLGASMAFSMAR 72
Query: 116 FPCKVVATR-----KAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIA 170
+ + K + R++ LF LG+FL G + +++ +R MG+LQRI+
Sbjct: 73 HTASLTQPKRRVYLKILRRSVVLFGLGLFLNG---------FWSYNLSTLRVMGILQRIS 123
Query: 171 IAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFP 230
+ YLV+AL L L RK W + +L Y L L + VP EF
Sbjct: 124 LTYLVSALV------------ILKLPRK--SQWGMTGLLLVGYWLALSFIPVP----EFG 165
Query: 231 VETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDY 290
G+ N +DR I+G HLY ++
Sbjct: 166 A-----------------GNLTRTGNFGAYVDRLIIGSSHLYVGDQFNSMG--------- 199
Query: 291 GPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI------VHFKDHRDRMLNWI 344
DPEGL S++ A T L+G G I + K R +
Sbjct: 200 ---------------DPEGLFSTLPAIATVLLGYFAGDWIRKRGSGLKIKTSRQSL---A 241
Query: 345 ILSSCLIGLGLSLDF-VGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKK 403
+ S LI GL L + + +NK L++ SY T G + +LLA Y ++ + + K
Sbjct: 242 LASYGLISTGLGLLWSIWFPINKKLWTSSYVLFTVGIALILLAVCYELIE-VRRIRLWSK 300
Query: 404 PFD 406
PF+
Sbjct: 301 PFE 303
>gi|393782159|ref|ZP_10370348.1| hypothetical protein HMPREF1071_01216 [Bacteroides salyersiae
CL02T12C01]
gi|392674193|gb|EIY67642.1| hypothetical protein HMPREF1071_01216 [Bacteroides salyersiae
CL02T12C01]
Length = 387
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 92/358 (25%), Positives = 157/358 (43%), Gaps = 93/358 (25%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG 106
+ +RL++LD+ RG+T+A MI+V++ G + + H+ WNGLT D V PFF+FI+G
Sbjct: 3 KTTSKRLLALDILRGITIAGMIMVNNPGSWSYVYAPLGHAQWNGLTPTDLVFPFFMFIMG 62
Query: 107 VS--LALTYKNFPCKVVATRKAILRALNLFLLGIFL----------------QGGFFHGI 148
+S ++L NF A K + R + +F +G+ L + FF +
Sbjct: 63 ISTYISLRKYNFEFSHSAALKILKRTIVIFAIGLGLAWFSMFCRTWNSLSAEEISFFSRL 122
Query: 149 NNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALV 208
+ D IR +GV+QR+A+ Y A+ + +K H+ +++A +
Sbjct: 123 GQSIWTFD--HIRILGVMQRLALCYGATAIVALTMK-HKHIP------------YLIATL 167
Query: 209 LTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGI 268
L ++LL+ G G + N + ++DR +LG
Sbjct: 168 LIGYFILLVTG----------------------------NGFEYNSTNILSVVDRAVLGE 199
Query: 269 QHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGH 328
H+Y+ D G DPEGLLS++ A LIG G
Sbjct: 200 AHMYK---------------DNG-------------IDPEGLLSTIPAIAHVLIGFCVGK 231
Query: 329 LIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
L++ KD +++ ++ + L LG L + G +NK ++S ++ +T G LA
Sbjct: 232 LLMEVKDINEKLGRLFLIGTILTFLGFLLSY-GCPINKKIWSPTFAIVTCGLGSSFLA 288
>gi|344203119|ref|YP_004788262.1| hypothetical protein [Muricauda ruestringensis DSM 13258]
gi|343955041|gb|AEM70840.1| hypothetical protein Murru_1800 [Muricauda ruestringensis DSM
13258]
Length = 375
Score = 100 bits (250), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 96/340 (28%), Positives = 142/340 (41%), Gaps = 77/340 (22%)
Query: 45 KQTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFF 101
Q + R +SLDVFRGL VALMI+V+ G + H+ WNG TL D V P F
Sbjct: 6 NQPNMSKLKNRYLSLDVFRGLDVALMIIVNSPGNGSTTFSPLLHADWNGFTLTDLVFPTF 65
Query: 102 LFIVGVSLALTYKNFPC--KVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQ 159
LF+VG S++ + K + K +K + R +FLLG + F LK ++
Sbjct: 66 LFVVGNSMSFSMKKYESMGKPAFFKKVLKRTAIIFLLGFLMYWYPFFDDGQLK---PFSE 122
Query: 160 IRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYG 219
R GVLQRIA+ Y+ A++ L + W+ AL L +L+L
Sbjct: 123 TRVFGVLQRIALCYMFASII-------------LHFVKTKTAIWLSALFLVGYHLIL--- 166
Query: 220 LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSR 279
+ G TG NAV +D ++G H+Y
Sbjct: 167 -----------------------IGFGDLTLTG---NAVLKLDEWLIGANHMYHGE---- 196
Query: 280 TKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDR 339
FDPEGLLS++ A V +IG G I + + +
Sbjct: 197 ----------------------GIAFDPEGLLSTLPAIVNVIIGYLAGRFIQNNGQNFET 234
Query: 340 MLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
+ ++ L+ GL+ D V + +NK L++ S+ LT G
Sbjct: 235 VAKLMMFGFALVFAGLAWDLV-LPINKKLWTSSFVLLTCG 273
>gi|346224087|ref|ZP_08845229.1| hypothetical protein AtheD1_02868 [Anaerophaga thermohalophila DSM
12881]
Length = 369
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 100/349 (28%), Positives = 157/349 (44%), Gaps = 81/349 (23%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG 106
+ R ++LDV RG+T+ALMI V+ G + + HS W+G T D V PFFLF+VG
Sbjct: 1 MKKSERYLALDVLRGMTIALMITVNTPGSWQYVYAPLRHSSWHGCTPTDLVFPFFLFVVG 60
Query: 107 VSLALTYKNFPCKV--VATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMG 164
VS+ ++ + + + + R L +F +G+FL + ++ D + +R MG
Sbjct: 61 VSMFFSFAKYGNTLNKASFNRLGRRTLLIFAIGLFLN-------SFPQWMTDYSSLRIMG 113
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQRIA+AY A+L LS+ RKY + +A++L +L +G P
Sbjct: 114 VLQRIALAYGFASLIV------------LSMKRKYIPYLGLAILLVYWGILAWFGGDDP- 160
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
+ +E + + P D ILG QHLY
Sbjct: 161 ----YSLEGNVTIP----------------------FDAAILGEQHLYTG---------- 184
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
PFDPEGLLS++ A VT L+G G I + + ++ +
Sbjct: 185 ----------------FGIPFDPEGLLSTIPAVVTVLLGYLTGVFISN--TEKAKLPARL 226
Query: 345 ILSSCLIG-LGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
L L+ +G FV +NK L++ SY TAG + ++ A + F++
Sbjct: 227 ALYGVLVTIIGRLWGFV-FPINKPLWTSSYVLYTAGLAALVFALLIFII 274
>gi|301765942|ref|XP_002918389.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Ailuropoda melanoleuca]
Length = 851
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 109/399 (27%), Positives = 174/399 (43%), Gaps = 71/399 (17%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNS-------KQTRPQHQQRRLISLDVFRGLTVALMI 71
ISK+ ++ ++L L PS + S + P RL +D FRG+ + LM+
Sbjct: 412 ISKAINSRETDRLINSELGSPSRAGSIGGDAQLEAWHPPAASPRLRCVDTFRGIALILMV 471
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAI 127
V+ GG HS WNGLT+AD V P+F+FI+G S+ L+ + K K
Sbjct: 472 FVNYGGGRYWYFKHSSWNGLTVADLVFPWFVFIMGSSVFLSMTSVLQRGCSKFKLLGKIA 531
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKY---GVDIAQIRWMGVLQRIAIAYLVAALCEIWLK 184
R+ L +G+ I N Y + ++R GVLQR+ + Y V A+ E+
Sbjct: 532 WRSFLLICIGVV--------IVNPNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELIFA 583
Query: 185 G--DGHVSSKLSLFRKYR-----GHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSS 237
+S+ F W+ L+L +++L L + L VP T
Sbjct: 584 KPVPESCASERGCFSLRDIIFSWPQWLFILMLESIWLGLTFFLPVPG------CPTGYLG 637
Query: 238 PWIFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPM 293
P G G G P C A G IDR +LG H+Y+ P + +
Sbjct: 638 P-------GGIGDWGKYPNCTGGAAGYIDRLLLGDDHIYQHPSSAVLYHTEV-------- 682
Query: 294 PLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL 353
+DPEG+L ++ + V +G+ G +++++KD +L C +GL
Sbjct: 683 ----------AYDPEGILGTINSIVMAFLGIQAGKILLYYKDQTKDILIRFTAWCCFLGL 732
Query: 354 -GLSLDFVGMH-----LNKALYSLSY-TCLTAGASGVLL 385
++L V + +NK L+S+SY T L++ A +LL
Sbjct: 733 ISVALTKVSENEGFIPINKNLWSISYVTTLSSFAFFILL 771
>gi|405978397|gb|EKC42791.1| Heparan-alpha-glucosaminide N-acetyltransferase, partial
[Crassostrea gigas]
Length = 549
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 90/300 (30%), Positives = 136/300 (45%), Gaps = 54/300 (18%)
Query: 7 INNDANDQLVIQISKSTSAPANEKLERDPLLPPSN---------------SNSKQTRPQH 51
++ N + I+ +AP N +++ L P N +NS Q H
Sbjct: 150 LDRKYNKSQSVNITDPENAPRNLRMKS--LRPTQNGFSDISEDSGTAHDRNNSPQQYSTH 207
Query: 52 QQR-RLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLA 110
+R RL SLD FRGL++ +M+ V+ GG +H PWNG+T+AD V P+F+FI+G ++
Sbjct: 208 NKRERLKSLDTFRGLSLMIMVFVNYGGGGYWFFDHPPWNGITVADLVFPWFIFIMGTAMN 267
Query: 111 LTYKNF----PCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYG-VDIAQIRWMGV 165
+++ + K + RA+ LF +GI L N +G V++ IR GV
Sbjct: 268 YSFRGMMKRGTPRYRMLYKVLRRAILLFFIGIVL---------NTNWGPVNLKTIRIPGV 318
Query: 166 LQRIAIAYLVAALCEIWL-------KGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLY 218
LQR ++ YLV L E+ K G S L + W +AL + Y+ L +
Sbjct: 319 LQRFSLTYLVLGLFEVCFSRYDTPEKYQGRCWSSLRDILLFLPQWFLALGILAAYVCLTF 378
Query: 219 GLYV---PDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKP 275
L + P Y P SS + +N T G A ID +LG H+Y KP
Sbjct: 379 LLPIGPCPT-GYIGPGGLHDSSKY-YNCTAG----------AAAYIDIMVLGKNHIYGKP 426
>gi|431796483|ref|YP_007223387.1| hypothetical protein Echvi_1106 [Echinicola vietnamensis DSM 17526]
gi|430787248|gb|AGA77377.1| hypothetical protein Echvi_1106 [Echinicola vietnamensis DSM 17526]
Length = 369
Score = 100 bits (249), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 102/349 (29%), Positives = 162/349 (46%), Gaps = 84/349 (24%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
++R+++LDVFRG+T+ MILV++ G + + H+ W+G T D + PFFLFIVGV++
Sbjct: 3 KQRILALDVFRGITIFAMILVNNPGSWSHVYAPLLHAKWHGCTPTDLIFPFFLFIVGVAI 62
Query: 110 ALT----YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGV 165
L+ K K RK+++RAL L L + + Y D+A +R+ GV
Sbjct: 63 ELSLGGQLKKGTPKGFLLRKSLIRALKLIGL--------GLLLTAIPY-FDLAHLRFPGV 113
Query: 166 LQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDW 225
LQRI + Y ++ + + + S K ALV ++ LL+ Y W
Sbjct: 114 LQRIGLVYFISTVMYL------YWSPK-------------ALVFSSGILLIGY------W 148
Query: 226 QYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSI 285
+ P + P N ID+++L H+ +S+TK
Sbjct: 149 LCMTFIPVPGIGP----------ANLEPGTNLAAWIDQQVL-TGHM-----WSQTKT--- 189
Query: 286 NSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWII 345
+DPEGL S++ A VTCL+G+ G ++ H+ R+ W I
Sbjct: 190 -------------------WDPEGLFSTLPAIVTCLLGVACGKILTGNSSHKARLTKWGI 230
Query: 346 LSSCLI--GLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
L+ GL SL F LNKAL++ S+ TAG + + LA Y+++
Sbjct: 231 AGVTLVFGGLAWSLFF---PLNKALWTSSFVLYTAGWAFLGLAACYWIL 276
>gi|380693406|ref|ZP_09858265.1| hypothetical protein BfaeM_05407 [Bacteroides faecis MAJ27]
Length = 371
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/340 (27%), Positives = 150/340 (44%), Gaps = 77/340 (22%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+RL++LDV RG+T+A MILV+ G + H+ W GLT D V PFF+FI+G+S
Sbjct: 5 SNKRLLALDVMRGITIAGMILVNTPGSWQHTYAPLKHAEWIGLTPTDLVFPFFMFIMGIS 64
Query: 109 --LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
++L +F + A K + R + +FL+GI + + + +D QIR +GV+
Sbjct: 65 TYISLRKYDFTFSIPAGLKILKRTVIIFLIGIGISWLSILCFQHDPFPID--QIRILGVM 122
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QR+A+ Y + AL + +K KY + + L++ +L + YV D
Sbjct: 123 QRLALGYGITALAALLIK------------HKYIPYLITVLLIGYFMILAVGNGYVYD-- 168
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
ET N + ++DR +LG H+Y I
Sbjct: 169 -----ET----------------------NVLSIVDRAVLGQAHIYGGAI---------- 191
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
DPEGLLS++ A +IG G L++ KD +++ ++
Sbjct: 192 ------------------LDPEGLLSTISAVAHVMIGFCAGKLLMEVKDIHEKLERLFLI 233
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
+ L G L + G + K ++S S+ +T G LA
Sbjct: 234 GTILTFAGFLLSY-GSPICKKIWSPSFVLITCGMGSSFLA 272
>gi|374312698|ref|YP_005059128.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
gi|358754708|gb|AEU38098.1| protein of unknown function DUF1624 [Granulicella mallensis
MP5ACTX8]
Length = 383
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 103/357 (28%), Positives = 154/357 (43%), Gaps = 83/357 (23%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFL 102
TRP RL+S+DV RGLTVA MILV++ G A+NHS WNG T D V P FL
Sbjct: 11 NTRPSS---RLLSIDVLRGLTVAFMILVNNNGNNDLAYRALNHSLWNGFTPTDLVFPTFL 67
Query: 103 FIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRW 162
FI+G+S+ L++ K + + F L IF F +N Y + +R
Sbjct: 68 FIMGISMVLSFSAHRAKATSNTVMLTSIGRRFALLIF----FGLVVNGFPY-FHLDTLRI 122
Query: 163 MGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYV 222
GVLQRIA+ YL+AAL + L D + F G+WV+ L + + G +
Sbjct: 123 YGVLQRIAVCYLLAALLQ--LVTDRIAPRVVLFFAAVIGYWVL------LRFVPVPGHGI 174
Query: 223 PDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQ 282
P +FP+ N V +DR H++ ++ +T+
Sbjct: 175 PG--RDFPLLDHD-------------------INLVAWLDR------HIFPHRLFEKTR- 206
Query: 283 CSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLN 342
DPEGLLS + A + ++GL G I + +++
Sbjct: 207 -----------------------DPEGLLSDIPAFASTILGLLAGAWIKQARPAGQKLMG 243
Query: 343 WIILSSCLIGLGLSLDFVGM------HLNKALYSLSYTCLTAGASGVLLAGIYFMVR 393
L G G++L G+ +NK L++ SY G S +LLA Y+++
Sbjct: 244 -------LFGAGIALAVAGLLWSQSFPINKKLWTSSYVLYAGGLSILLLAVAYYVIE 293
>gi|189502592|ref|YP_001958309.1| hypothetical protein Aasi_1258 [Candidatus Amoebophilus asiaticus
5a2]
gi|189498033|gb|ACE06580.1| hypothetical protein Aasi_1258 [Candidatus Amoebophilus asiaticus
5a2]
Length = 380
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 104/357 (29%), Positives = 162/357 (45%), Gaps = 86/357 (24%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG 106
QH ++RL+SLD FRGLTVA MIL ++ G I + H+ W+G T D + PFFLFIVG
Sbjct: 2 QHLKQRLVSLDFFRGLTVAGMILANNPGSWGHIYAPLKHAEWHGCTPTDLIFPFFLFIVG 61
Query: 107 VSLAL---TYKNFP-CKVVATRKAILRALNLFLLGIFLQ---GGFFHGINNLKYGVDIAQ 159
VS+A + K P K++ R L LF LGIFL F I K
Sbjct: 62 VSIAFAIGSKKELPETHSQLILKSVRRMLTLFCLGIFLALYPKIFTSPIEAFK------T 115
Query: 160 IRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTL--YLLLL 217
+R GVLQR AI Y ++ + +F K+ ++ ++L L Y +L+
Sbjct: 116 VRIPGVLQRTAIVYFISTI----------------IFLKFTPRTILKIMLGLLVSYWILM 159
Query: 218 YGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIY 277
+ VP Y + ++ + W IDR +L HL++
Sbjct: 160 TFVPVPGIGYANLEQETNLAAW---------------------IDRNLLTEPHLWKA--- 195
Query: 278 SRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHR 337
RT +DPEG+L ++ A T L G+ G +++ KD
Sbjct: 196 VRT------------------------WDPEGILGTISAIATGLSGILAG-ILLQRKDQS 230
Query: 338 DRMLNWIILSSCLIGL--GLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
D + SS + + GL +F+ +NK+L++ S+ T+G + ++LA Y+++
Sbjct: 231 DTEKIARLFSSGTLAVITGLIWNFI-FPINKSLWTSSFVLYTSGLAYIILALCYWII 286
>gi|301307595|ref|ZP_07213552.1| conserved hypothetical protein [Bacteroides sp. 20_3]
gi|423337400|ref|ZP_17315144.1| hypothetical protein HMPREF1059_01069 [Parabacteroides distasonis
CL09T03C24]
gi|300834269|gb|EFK64882.1| conserved hypothetical protein [Bacteroides sp. 20_3]
gi|409237229|gb|EKN30029.1| hypothetical protein HMPREF1059_01069 [Parabacteroides distasonis
CL09T03C24]
Length = 368
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 92/342 (26%), Positives = 151/342 (44%), Gaps = 72/342 (21%)
Query: 51 HQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGV 107
Q RL+SLDV RG+T+A MILV++ G + + H+ WNGLT D V PFF+FI+GV
Sbjct: 3 QQSGRLLSLDVMRGITIAGMILVNNPGSWKYVYTPLEHARWNGLTPTDLVFPFFMFIMGV 62
Query: 108 SLALTYK--NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGV 165
S+ + + NF + K + R + +FL+G+ L G D +R +GV
Sbjct: 63 SMFFSLRKYNFKLSKESVTKVLRRTVLIFLVGLGLN---LFGHVCYNGFTDFQNLRILGV 119
Query: 166 LQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDW 225
+QR+A+AY +L L++ KY V A +L + LL
Sbjct: 120 MQRLALAYGFGSLI------------GLAINHKYILQ-VAAGILIFYWALL--------- 157
Query: 226 QYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSI 285
+ +E S S + ++D+ + G H+Y
Sbjct: 158 GFTHSMEMSEDS-------------------IIAIVDKALFGTSHMYHD----------- 187
Query: 286 NSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRML-NWI 344
D + FDPEGLLS + + L+G + G +I K + + ++ N
Sbjct: 188 ----------DMADGTRIAFDPEGLLSCIGSIAHVLLGFYVGKVIQDCKKNNELIIRNIF 237
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
I + ++ G L + G +NK ++S ++ +T G + + LA
Sbjct: 238 IFGTIILFAGFLLSY-GCPINKKIWSSTFVLVTCGFASLFLA 278
>gi|371776142|ref|ZP_09482464.1| hypothetical protein AnHS1_01923 [Anaerophaga sp. HS1]
Length = 369
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 101/348 (29%), Positives = 154/348 (44%), Gaps = 79/348 (22%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG 106
+ R ++LDV RG+T+ALMI V++ G I + HS W+G T D V PFFLF+VG
Sbjct: 1 MKKTERYLALDVLRGMTIALMITVNNPGSWKYIYAPLRHSSWHGCTPTDLVFPFFLFVVG 60
Query: 107 VSLALTYKNFPCKV--VATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMG 164
VS+ ++ + + + ++ R L +F +G+FL + ++ D + +R MG
Sbjct: 61 VSMFFSFSKYGNTLNKESFKRLGRRTLLIFAIGLFLN-------SFPQWDRDYSTLRIMG 113
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQRIA+AY +L LS+ RKY + L +LLLY +
Sbjct: 114 VLQRIALAYGFGSLIV------------LSVPRKY-------IPLLGFSILLLYWGILGW 154
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
+ E P + NVT D ILG QHLYR
Sbjct: 155 FGGEDPYSLNG------NVTIP--------------FDAAILGEQHLYRG---------- 184
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
PFDPEGLLS++ A VT L+G G I + ++ +
Sbjct: 185 ----------------FGIPFDPEGLLSTIPAVVTVLLGYLTGVFIK--TTEKVKIPGQL 226
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
L ++ + L + +NK L++ SY TAG + + A + F++
Sbjct: 227 ALYGLIVAIAGRLWGLVFPINKPLWTGSYVLYTAGLAAMAFALLIFII 274
>gi|323343595|ref|ZP_08083822.1| transmembrane protein [Prevotella oralis ATCC 33269]
gi|323095414|gb|EFZ37988.1| transmembrane protein [Prevotella oralis ATCC 33269]
Length = 384
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 93/348 (26%), Positives = 158/348 (45%), Gaps = 88/348 (25%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG--ILPAINHSPWNGLTLADFVMPFFLFIVGV 107
Q++R++++D+ RG+T+A MILV++ G + + H+ W GLT D V PFF+FI+G+
Sbjct: 5 TQQKKRILAVDILRGMTIAGMILVNNPGTDTVYAPLEHAEWIGLTPTDLVFPFFMFIMGI 64
Query: 108 SLALTYKNFPCK--VVATRKAILRALNLFLLGI-------FLQGGFFHGINNLKYGVDI- 157
+ L+ K F K V RK RAL L+L+G+ F +G ++++ +G +
Sbjct: 65 TTYLSLKKFEFKWSVECGRKIAKRALLLWLIGLAISWLFMFCRGLLDPDMSSMPFGSRLW 124
Query: 158 ------AQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTT 211
Q+R +GVL R+ I Y +AA+ LS+ KY W++A++
Sbjct: 125 ASVNTFDQLRLLGVLPRLGICYGLAAVVA------------LSVKHKYI-PWLIAIIFIG 171
Query: 212 LYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHL 271
Y+LL TC G A N + ++D +LG H+
Sbjct: 172 YYILL--------------------------ETC--NGYAHDASNILAIVDDAVLGHGHV 203
Query: 272 YRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV 331
YR W ++P DPEGLLS+ A LIG G ++
Sbjct: 204 YR--------------------------W-ESP-DPEGLLSTFPALAHVLIGFCVGRTVM 235
Query: 332 HFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
++ D++ ++ + L G L + ++K L++ ++ +T G
Sbjct: 236 EMQNLNDKIERLFLIGALLTFAGFLLSY-ACPISKKLWTPTFAMVTCG 282
>gi|410448043|ref|ZP_11302130.1| putative membrane protein [Leptospira sp. Fiocruz LV3954]
gi|410018124|gb|EKO80169.1| putative membrane protein [Leptospira sp. Fiocruz LV3954]
gi|456875246|gb|EMF90470.1| putative membrane protein [Leptospira santarosai str. ST188]
Length = 363
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 112/358 (31%), Positives = 162/358 (45%), Gaps = 99/358 (27%)
Query: 57 ISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS--LAL 111
+SLD+FRG+TV MILV++ G I + H+ WNG T D V PFFLF VG S ++L
Sbjct: 1 MSLDLFRGMTVIGMILVNNPGSWSYIYSPLKHAEWNGCTPTDLVFPFFLFAVGTSIPISL 60
Query: 112 TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAI 171
KN + +R+ NL LLG+FL F G A++R GVLQRI
Sbjct: 61 YSKNGINRSDIWIGICIRSANLILLGLFLN---FFG------EWSFAELRIPGVLQRIGF 111
Query: 172 AY-LVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFP 230
Y +VA+LC + K+ +F +V ++L ++L Q P
Sbjct: 112 VYWVVASLCLV------FPGKKILVF-------LVPILLIHTWILT---------QIALP 149
Query: 231 VETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDY 290
E+ S G A IDR I G +HL+R +S+T
Sbjct: 150 GESVVSL------------EQGKDIGA--WIDRTIFGEKHLWR---FSKT---------- 182
Query: 291 GPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCL 350
+DPEG LS V + VT L G+ G I+ ++ R+++L
Sbjct: 183 --------------WDPEGFLSGVASVVTTLFGVLCG-FILFLRERRNKIL--------- 218
Query: 351 IGLGLSLDFVG------MHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLK 402
GLG+ FVG + +NK+L++ SY+ TAG S I+F Y+SS ++ K
Sbjct: 219 -GLGILFSFVGLLWDRSLPMNKSLWTGSYSVYTAGLS---FLSIWFF-EYLSSLIISK 271
>gi|418752318|ref|ZP_13308585.1| putative membrane protein [Leptospira santarosai str. MOR084]
gi|409967313|gb|EKO35143.1| putative membrane protein [Leptospira santarosai str. MOR084]
Length = 363
Score = 99.8 bits (247), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 112/358 (31%), Positives = 161/358 (44%), Gaps = 99/358 (27%)
Query: 57 ISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS--LAL 111
+SLD+FRG+TV MILV++ G I + H+ WNG T D V PFFLF VG S ++L
Sbjct: 1 MSLDLFRGMTVIGMILVNNPGSWSYIYSPLKHAEWNGCTPTDLVFPFFLFAVGTSIPISL 60
Query: 112 TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAI 171
KN + +R+ NL LLG+FL F G A++R GVLQRI
Sbjct: 61 YSKNGINRSDIWIGICIRSANLILLGLFLN---FFG------EWSFAELRIPGVLQRIGF 111
Query: 172 AY-LVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFP 230
Y +VA+LC + K+ +F V ++L ++L Q P
Sbjct: 112 VYWVVASLCLV------FSGKKILVFS-------VPILLIHTWILT---------QIALP 149
Query: 231 VETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDY 290
E+ S G A IDR I G +HL+R +S+T
Sbjct: 150 GESVVSL------------EQGKDIGA--WIDRTIFGEKHLWR---FSKT---------- 182
Query: 291 GPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCL 350
+DPEG LS V + VT L G+ G I+ ++ R+++L
Sbjct: 183 --------------WDPEGFLSGVASVVTTLFGVLCG-FILFLRERRNKIL--------- 218
Query: 351 IGLGLSLDFVGMH------LNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLK 402
GLG+ FVG+ +NK+L++ SY+ TAG S I+F Y+SS ++ K
Sbjct: 219 -GLGILFSFVGLLWDLSLPMNKSLWTGSYSVYTAGLS---FLSIWFF-EYLSSLIISK 271
>gi|255038072|ref|YP_003088693.1| hypothetical protein Dfer_4326 [Dyadobacter fermentans DSM 18053]
gi|254950828|gb|ACT95528.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
Length = 368
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 98/355 (27%), Positives = 154/355 (43%), Gaps = 88/355 (24%)
Query: 48 RPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFI 104
P RL+SLD RG T+A MI+V+ G + P + HS WNGLT D + P FLFI
Sbjct: 3 NPSVPSSRLLSLDAMRGFTIAAMIMVNFPGHEDYVFPTLRHSKWNGLTFTDLIAPTFLFI 62
Query: 105 VGVSLALTY-----KNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQ 159
VGVS+ L Y N P K RK ++R+L +F +G+FL N+ + +
Sbjct: 63 VGVSITLAYSKKRLSNAP-KSGLYRKIVIRSLKIFAVGMFL---------NMLPDFNFSD 112
Query: 160 IRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYG 219
+R+ G L RIAI +LV A+ L L ++ +A+ + LY L L G
Sbjct: 113 LRYTGTLHRIAIVFLVCAI--------------LFLNTSWKQQLGIAVGILVLYWLALTG 158
Query: 220 LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSR 279
+ +P I V P N +D++ L
Sbjct: 159 I---------------PTPGIGKVML------EPGVNLAAWVDQQYLP------------ 185
Query: 280 TKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV--HFKDHR 337
G M W Q +DPEG+LS+ A T + G+ G L++ + +
Sbjct: 186 -----------GKM------W-QGNWDPEGILSTFPAIATTITGILAGRLMLLPFSPNEK 227
Query: 338 DRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
L ++ +G +L F +N+ L++ S+ +T+G + +L +YF++
Sbjct: 228 SNFLLTAGFATAALGYFWNLIF---PVNENLWTSSFVLVTSGFASMLFGALYFLI 279
>gi|373953861|ref|ZP_09613821.1| Protein of unknown function DUF2261, transmembrane
[Mucilaginibacter paludis DSM 18603]
gi|373890461|gb|EHQ26358.1| Protein of unknown function DUF2261, transmembrane
[Mucilaginibacter paludis DSM 18603]
Length = 379
Score = 99.4 bits (246), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 110/374 (29%), Positives = 168/374 (44%), Gaps = 86/374 (22%)
Query: 43 NSKQTRPQHQ-QRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVM 98
+++ PQ + +RL+SLDVFRGLTVA MILV++ G I + HS WNG T D +
Sbjct: 2 STETLNPQVKLNQRLLSLDVFRGLTVACMILVNNPGDWAHIYSPLEHSAWNGCTPTDLIF 61
Query: 99 PFFLFIVGVSLAL---TYKNFPCKVVATRKAIL-RALNLFLLGIFLQGGFFHGINNLKYG 154
PFFLFIVGVS+ T K P + IL R+L LF L +FL +L
Sbjct: 62 PFFLFIVGVSIVYSMGTKKTDPAQHGKLVLTILKRSLILFCLALFL---------SLYPK 112
Query: 155 VDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYL 214
+ +R GVLQRIA+ + + + I+LK + ++L LF W+ +V Y
Sbjct: 113 FNFHTLRIPGVLQRIAVVFGICGI--IFLKTER--KTQLILF------WLFLIV----YY 158
Query: 215 LLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRK 274
LL+ + VP Y + P N IDR ++G HL+++
Sbjct: 159 LLMTLVPVPGVGY---------------------ANLQPETNLGAWIDRTVIGNVHLWKE 197
Query: 275 PIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFK 334
+ +W DPEG+L ++ AT T L G+ G +
Sbjct: 198 SV----------------------TW-----DPEGILGTMPATSTGLFGILVGTWLKRKD 230
Query: 335 DHRDRMLNWII---LSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM 391
+ W+ + + ++GL L F +NKAL++ S+ L AG L +++
Sbjct: 231 VDESTKVAWLFCTGIGAVILGLLWDLFF---PINKALWTSSF-VLYAGGLATLGLTLFYW 286
Query: 392 VRYISSHLMLKKPF 405
+ + + KPF
Sbjct: 287 IIDVQGYKKFTKPF 300
>gi|393786264|ref|ZP_10374400.1| hypothetical protein HMPREF1068_00680 [Bacteroides nordii
CL02T12C05]
gi|392659893|gb|EIY53510.1| hypothetical protein HMPREF1068_00680 [Bacteroides nordii
CL02T12C05]
Length = 387
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 97/357 (27%), Positives = 156/357 (43%), Gaps = 91/357 (25%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG 106
+ +RL++LD+ RG+T+A MI+V++ G + + H+ WNGLT D V PFF+FI+G
Sbjct: 3 KTTSKRLLALDILRGITIAGMIMVNNPGSWSYVYAPLGHAKWNGLTPTDLVFPFFMFIMG 62
Query: 107 VS--LALTYKNFPCKVVATRKAILRALNLFL--LGIFLQGGFFHGINNLKYGVDIA---- 158
+S ++L NF A K + R + +F LGI F N+L G +I+
Sbjct: 63 ISTYISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-GEEISFLSR 121
Query: 159 ---------QIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVL 209
IR +GV+QR+A+ Y A+ + +K KY + +V L L
Sbjct: 122 LGQSVWTFDHIRILGVMQRLALCYGATAIIALTMK------------HKYIPYLIVTL-L 168
Query: 210 TTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQ 269
++LL+ G G N + ++DR +LG
Sbjct: 169 AGYFILLITG----------------------------NGFEYNDTNILSVVDRAVLGEA 200
Query: 270 HLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHL 329
H+Y+ D G DPEGLLS++ A LIG G L
Sbjct: 201 HMYK---------------DNG-------------IDPEGLLSTIPAIAHVLIGFCVGKL 232
Query: 330 IVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
++ KD +++ ++ + L LG L + G +NK ++S ++ +T G LA
Sbjct: 233 LMEVKDINEKLERLFLIGTILTFLGFLLSY-GCPINKKIWSPTFAIVTCGLGSSFLA 288
>gi|66808259|ref|XP_637852.1| transmembrane protein [Dictyostelium discoideum AX4]
gi|60466271|gb|EAL64333.1| transmembrane protein [Dictyostelium discoideum AX4]
Length = 675
Score = 99.0 bits (245), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 58/134 (43%), Positives = 83/134 (61%), Gaps = 15/134 (11%)
Query: 48 RPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGV 107
R ++ RL SLDVFRG ++ +MI V+ GG NHS WNGLT+AD V P+F+FI+G+
Sbjct: 199 RENRKKDRLRSLDVFRGFSITIMIFVNYGGGGYWFFNHSLWNGLTVADLVFPWFVFIMGI 258
Query: 108 SLALTY----KNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWM 163
++ L++ K K + +K + R++ LF LG+F INN GVD+ Q R +
Sbjct: 259 AMPLSFHAMEKRGTPKRIIFQKLLRRSIILFALGLF--------INN---GVDLQQWRIL 307
Query: 164 GVLQRIAIAYLVAA 177
GVLQR +I+YLV
Sbjct: 308 GVLQRFSISYLVVG 321
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 52/224 (23%), Positives = 94/224 (41%), Gaps = 51/224 (22%)
Query: 183 LKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFN 242
++ + +S S Y WV AL++ + + LL++ + VP
Sbjct: 416 IQHETVISKYFSDIAPYWIQWVFALIIFSGWFLLMFLVPVP------------------G 457
Query: 243 VTCGVRGSTGPA-------CN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPM 293
G G+ G A C A +ID KI H+++ P + +P Y
Sbjct: 458 CPTGYLGAGGLADQGRYQHCTGGAARLIDLKIFTEAHIFQNPTCLEVYK----TPSY--- 510
Query: 294 PLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL 353
DPEG + + + C IG+ G +I+ +K +R R++ W++ S L G+
Sbjct: 511 ------------DPEGTVGYLTSIFLCFIGVQAGRIILTYKSNRSRLIRWMVWSVVLCGI 558
Query: 354 -----GLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
GL+ + + +NK L+S S+ L AG +L ++ ++
Sbjct: 559 AAGLCGLTQNQGWLPVNKNLWSPSFILLMAGFGFFVLTVMFILI 602
>gi|321463338|gb|EFX74354.1| hypothetical protein DAPPUDRAFT_129175 [Daphnia pulex]
Length = 409
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 93/363 (25%), Positives = 165/363 (45%), Gaps = 58/363 (15%)
Query: 63 RGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVA 122
R LT+ MI+V+ GG HSPWNG+T+AD + P F++I+G S L+ + + ++
Sbjct: 29 RRLTIVFMIIVNYGGGGYWFFEHSPWNGITIADVIFPCFVWILGASCVLSLNSQLRRALS 88
Query: 123 TRK----AILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAAL 178
++ + R++ + ++G+ L N+L +I R GVLQR++ YL+ AL
Sbjct: 89 KQRILYSTVRRSVAMLVIGLVL--------NSLSNN-NIKTFRIPGVLQRMSFVYLIVAL 139
Query: 179 CEIWLKG-DGHVSSKLSLFRKYRG------HWVVALVLTTLYLLLLYGLYVPDWQYEFPV 231
E L G D + + + F R W++ V + LL+ + L VPD +
Sbjct: 140 IE--LTGFDPEDNQRYAWFAPIRDIVCSWRQWIIVTVFVSTQLLITFLLPVPDCPLGYTG 197
Query: 232 ETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYG 291
++ R TG A ++D + G H+Y++P ++
Sbjct: 198 AGGLEKNGLY------RNCTGGAAR---LVDVSLFGNDHIYQRPTPRAIYDATL------ 242
Query: 292 PMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLI 351
FDPEG L + + +G +++ F ++ R++ W +L + +
Sbjct: 243 ------------AFDPEGALGGLTCVLCAYLGAEAAKVLLVFPANKQRIVRW-MLWALVT 289
Query: 352 GL--GLSLDFV----GMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKKPF 405
GL G+ DF + +NK L+SLSY +T+ + +LL +Y + +S PF
Sbjct: 290 GLSGGILCDFKIDDGPIPINKNLWSLSYVLVTSSIAFILLTILYVFIDVLS--WWSGAPF 347
Query: 406 DYS 408
Y+
Sbjct: 348 RYA 350
>gi|338210835|ref|YP_004654884.1| hypothetical protein [Runella slithyformis DSM 19594]
gi|336304650|gb|AEI47752.1| Protein of unknown function DUF2261, transmembrane [Runella
slithyformis DSM 19594]
Length = 363
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 100/349 (28%), Positives = 150/349 (42%), Gaps = 88/349 (25%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
Q RL+SLDVFRG+TVA MILV++ G + + H+ WNG T D + PFFLFIVGVS
Sbjct: 1 MQNRLLSLDVFRGMTVAAMILVNNPGDWDHVYAPLLHAHWNGCTPTDLIFPFFLFIVGVS 60
Query: 109 LALTY-KNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
+A KN P + +++ + L ++ + D A +R GVLQ
Sbjct: 61 VAFAMGKNPPSLLKIIKRSAILFGLGLFLNLYPK-------------FDFANVRIPGVLQ 107
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQY 227
RIA+ YLV +L I RK + V+TT+ LL+ Y
Sbjct: 108 RIALVYLVCSLIFI------------KTTRKTQ-------VITTVLLLIAY--------- 139
Query: 228 EFPVETSSSSPWIFNVTCGVRG----STGPACNAVGMIDRKILGIQHLYRKPIYSRTKQC 283
W+ V G + P N +DR +L HL++
Sbjct: 140 -----------WLLMTLVPVPGVGYANLEPTTNLGAWVDRGLLTTAHLWK---------- 178
Query: 284 SINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNW 343
A W DPEG+ S++ A T L+G+ G + K ++M W
Sbjct: 179 ------------SAKVW-----DPEGMFSTIPAIGTGLLGVLTGQWLRSEKPVAEKM-AW 220
Query: 344 IILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+ LS + LG + +NK+L++ S+ G + V LA Y+++
Sbjct: 221 LFLSGNALILGGLIWNEFFPINKSLWTSSFVLYAGGWAMVGLAACYWLI 269
>gi|125981811|ref|XP_001354909.1| GA19944 [Drosophila pseudoobscura pseudoobscura]
gi|54643221|gb|EAL31965.1| GA19944 [Drosophila pseudoobscura pseudoobscura]
Length = 574
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 93/301 (30%), Positives = 128/301 (42%), Gaps = 66/301 (21%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
Q++RL SLD FRGL++ LMI V+ GG I H+ WNGL LAD V P FL+I+GV + L
Sbjct: 181 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHLADLVFPSFLWIMGVCIPL 240
Query: 112 TYK----NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
+ K K + + R++ LF +G+ L N G + Q+R MGVLQ
Sbjct: 241 SVKAQLSRGNSKARICLRILWRSIKLFAIGLCL---------NSMSGPSLEQLRLMGVLQ 291
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRK------YRGHWVVALVLTTLYLLLLYGLY 221
R IA+LV + V + + R + G V L L YL L +GL
Sbjct: 292 RFGIAFLVVGILHTLCSRREQVQPQRAWHRAIYDVCLFSGELAVLLALIAAYLGLTFGLP 351
Query: 222 VPDWQYEFPVETSSSSPWIFNVTCGVRGSTG-------PAC--NAVGMIDRKILGIQHLY 272
VP G G G P C A G +D ++LG H+Y
Sbjct: 352 VP------------------GCPRGYLGPGGKHDLAAHPNCIGGAAGYVDLQVLGNAHIY 393
Query: 273 RKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFG-HLIV 331
+ P S FDPEG+ +++ V L+G G L+V
Sbjct: 394 QHPTAKYVYDSS-------------------AFDPEGVFGCLLSVVQVLLGAFAGLTLLV 434
Query: 332 H 332
H
Sbjct: 435 H 435
>gi|224496100|ref|NP_001139059.1| uncharacterized protein LOC565246 precursor [Danio rerio]
Length = 582
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 100/392 (25%), Positives = 159/392 (40%), Gaps = 108/392 (27%)
Query: 38 PPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFV 97
P + +++P+ RL SLD FRG ++ +M+ V+ GG H+PWNGLT+AD V
Sbjct: 188 PNTEQQIDESKPK--SSRLKSLDTFRGFSLTVMVFVNYGGGGYWFFQHAPWNGLTVADLV 245
Query: 98 MPFFLFIVGVSLALTYKNFPCKVVA----TRKAILRALNLFLLGIFLQGGFFHGINNLKY 153
MP+F+FI+G S+ L + + K V+ RK R + L L+G
Sbjct: 246 MPWFVFIIGTSVMLAFTSMHRKGVSLLQLLRKVTWRTVVLMLIGFCF------------- 292
Query: 154 GVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRK--------------- 198
M R I L A +S L LFR
Sbjct: 293 ---------MNYSPRDGILVLAA-------DTRSSPASGLHLFRSGTDHNWWNPIQDVIL 336
Query: 199 YRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTG------ 252
Y W+ ++L TL+L L + L VP N G G+ G
Sbjct: 337 YWPEWLFIVLLETLWLCLTFLLPVP------------------NCPTGYLGAGGVGDAGL 378
Query: 253 -PACN--AVGMIDRKILGIQHLYR---KPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFD 306
P C A ID+ G + K +Y T+ PFD
Sbjct: 379 YPNCTGGAAAHIDKWFFGDNMFWYPTCKVLYRTTE----------------------PFD 416
Query: 307 PEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSL------DFV 360
PEG+L ++ + V +G+ G +++ F+ +L ++ + ++G+ ++ D
Sbjct: 417 PEGVLGTINSIVMGFLGMQAGKILLFFRQMNKGILARFLVWALILGISAAILSKCTRDEG 476
Query: 361 GMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+ +NK L+SLS+ A S +LLA +YF++
Sbjct: 477 FIPVNKNLWSLSFVTCMACMSFLLLAVMYFII 508
>gi|255534024|ref|YP_003094396.1| hypothetical protein Phep_4143 [Pedobacter heparinus DSM 2366]
gi|255347008|gb|ACU06334.1| conserved hypothetical protein [Pedobacter heparinus DSM 2366]
Length = 384
Score = 97.8 bits (242), Expect = 8e-18, Method: Compositional matrix adjust.
Identities = 98/356 (27%), Positives = 160/356 (44%), Gaps = 69/356 (19%)
Query: 55 RLISLDVFRGLTVALMILVD---DVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL+SLD FRG TVA MILV+ D G I + H+ WNG T D + PFFLFIVGVS+A
Sbjct: 12 RLLSLDFFRGATVAAMILVNNPGDWGHIYAPLEHAEWNGCTPTDLIFPFFLFIVGVSIAY 71
Query: 112 TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVD-IAQIRWMGVLQRIA 170
+ K I++AL + L ++ Q+R GVLQRIA
Sbjct: 72 AMGGKKADPSSHGKTIVKALKRASILFGLGLFLSLFPKVFTAPLEAFQQVRIPGVLQRIA 131
Query: 171 IAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFP 230
+ +L++A+ I+LK +++ ++F+ + L + +Y L+ + VP Y
Sbjct: 132 VVFLISAI--IFLK-----NTEKNIFK-------ILLAILAVYWALMTFIPVPGVGYANL 177
Query: 231 VETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDY 290
+ ++ W +DR IL HL++
Sbjct: 178 EKETNLGAW---------------------LDRSILTEAHLWK----------------- 199
Query: 291 GPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSC- 349
A +W DPEG+LS++ A T L G+ G + ++W+ + C
Sbjct: 200 -----SAKTW-----DPEGILSTLPAIATGLFGILVGVYLKRKDVDAATKISWLFCTGCA 249
Query: 350 LIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKKPF 405
+ LGL D + +NK+L++ S+ T G + ++L+ Y+++ + + KPF
Sbjct: 250 AVALGLLWD-LQFPINKSLWTSSFVLYTGGLATMILSLCYWIID-VQQYNRFTKPF 303
>gi|330805524|ref|XP_003290731.1| hypothetical protein DICPUDRAFT_49381 [Dictyostelium purpureum]
gi|325079117|gb|EGC32733.1| hypothetical protein DICPUDRAFT_49381 [Dictyostelium purpureum]
Length = 644
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 57/140 (40%), Positives = 85/140 (60%), Gaps = 15/140 (10%)
Query: 43 NSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFL 102
+S + ++ RL SLDVFRG ++ +MI V+ GG NHS WNGLT+AD V P+F+
Sbjct: 195 SSVERENNKKKDRLKSLDVFRGFSITIMIFVNYGGGGYWFFNHSYWNGLTVADLVFPWFV 254
Query: 103 FIVGVSLALTY----KNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIA 158
FI+G+++ L++ + K+V +K + R+ LF LG+F INN GV++
Sbjct: 255 FIMGIAMPLSFNAMERRGTTKLVIVQKLVRRSAILFALGLF--------INN---GVNLQ 303
Query: 159 QIRWMGVLQRIAIAYLVAAL 178
R +GVLQR AI+YL+ L
Sbjct: 304 HWRILGVLQRFAISYLIVGL 323
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 40/147 (27%), Positives = 67/147 (45%), Gaps = 27/147 (18%)
Query: 253 PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGL 310
P C A +ID KI H+++ P + S +DPEG
Sbjct: 444 PNCTGGAARLIDMKIFTNNHIFQSPTCQDIYKTS-------------------SYDPEGT 484
Query: 311 LSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL-----GLSLDFVGMHLN 365
+ + + C IG+ G +I+ +K +R R++ W++ S+ G+ GLS + + +N
Sbjct: 485 VGYLTSIFICFIGVQAGRIILIYKSNRSRLIRWMVWSAVCCGIAAGLCGLSQNDGVIPIN 544
Query: 366 KALYSLSYTCLTAGASGVLLAGIYFMV 392
K L+S S+ L AG G + I F+V
Sbjct: 545 KNLWSPSFVFLMAGF-GFFVLTIMFIV 570
>gi|260788586|ref|XP_002589330.1| hypothetical protein BRAFLDRAFT_217958 [Branchiostoma floridae]
gi|229274507|gb|EEN45341.1| hypothetical protein BRAFLDRAFT_217958 [Branchiostoma floridae]
Length = 382
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 98/353 (27%), Positives = 160/353 (45%), Gaps = 59/353 (16%)
Query: 54 RRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTY 113
RRL SLD FRG+ + +M V+ GG ++HS WNG+T+AD V P+F++I+G S AL++
Sbjct: 1 RRLKSLDTFRGMCLCIMAFVNYGGGGYWFLDHSVWNGITVADLVFPWFMWIMGTSTALSF 60
Query: 114 KNFPCKVVAT----RKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRI 169
+ K K + R + LFLLG+F+ +N D A IR GVLQR
Sbjct: 61 RGLQRKATPKLTIFGKIVRRTITLFLLGLFI-------VNAPD---DWATIRIPGVLQRF 110
Query: 170 AIAYLVAALCEIWLKGDGHVSSKLSLFRK---YRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
A++Y + + + + +R Y W+ L L ++ L + + VP
Sbjct: 111 AVSYFAVSTMML-------LHMETEWYRDLVPYWKQWLFVLCLLAVHTCLTFLMPVPGCP 163
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLY--RKPIYSRTKQCS 284
+ S N T G A G ID +L H+Y P Q
Sbjct: 164 TGYLGAGGLSDLDHTNCTGG----------AAGYIDNWLLTQDHIYGDETPKVRILYQIL 213
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
+N +DPEG+L S+ + +GL G +++ ++DH R++ W+
Sbjct: 214 VN------------------YDPEGVLGSLTSIFMTFLGLQAGKILLSYEDHGSRLVRWL 255
Query: 345 ILSSCLIGLGLSL-----DFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+ L L + L + + +NK L+S+++ A + +LL+ YF+V
Sbjct: 256 LWGIGLGLLAILLCEGRQNGGWVPINKNLWSVTFVLSLASMAFILLSVYYFLV 308
>gi|294674520|ref|YP_003575136.1| hypothetical protein PRU_1851 [Prevotella ruminicola 23]
gi|294472648|gb|ADE82037.1| putative membrane protein [Prevotella ruminicola 23]
Length = 357
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 93/347 (26%), Positives = 153/347 (44%), Gaps = 81/347 (23%)
Query: 52 QQRRLISLDVFRGLTVALMILVDD-VGGILPAINHSPWNGLTLADFVMPFFLFIVGVS-- 108
+ +RL+SLD+ RG+TVA MILV++ G + HS WNG+T D V PFFLFI+G+S
Sbjct: 2 ESKRLLSLDILRGITVAGMILVNNGWGESFEMLRHSKWNGMTPCDLVFPFFLFIMGISCY 61
Query: 109 LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINN--LKYGVDIAQIRWMGVL 166
L+L F R+ + R + LF +G+F+ F H I L +G +R V+
Sbjct: 62 LSLVKSEFKPTPQVIRRIVKRTVLLFAIGLFIN-WFDHAIEGDLLCFG----HLRIWAVM 116
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRIA+ Y + +L ++ KY + L+ +L+L Y D
Sbjct: 117 QRIALCYGIVSLFALFCN------------HKYTLSVIGGLLAIYTAILILGNGYAED-- 162
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
N + D K+ G H+Y K
Sbjct: 163 --------------------------ANVNVLAQADLKLFGYDHIYHK------------ 184
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
+P DPEGL+ ++ + L+G + G LI + +++ ++
Sbjct: 185 ----------------SPVDPEGLMGTISSVAHVLLGFYCGMLIRKRETVEQKVIALFVV 228
Query: 347 SS-CLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+ C+IG G L + G+ LNK ++S SY +T G + ++ A + +++
Sbjct: 229 GAVCVIG-GYLLSY-GLPLNKRIWSPSYVLMTCGLASLMQALLMYVI 273
>gi|421093382|ref|ZP_15554106.1| putative membrane protein [Leptospira borgpetersenii str.
200801926]
gi|410363365|gb|EKP14394.1| putative membrane protein [Leptospira borgpetersenii str.
200801926]
Length = 383
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 105/370 (28%), Positives = 162/370 (43%), Gaps = 94/370 (25%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFL 102
+ + + R++SLD+FRG+TV MILV++ G + + H+ WNG T D V PFFL
Sbjct: 2 EKKSTQNKDRILSLDLFRGMTVVGMILVNNPGSWSYVYSPLKHAEWNGCTPTDLVFPFFL 61
Query: 103 FIVGVS--LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
F VG S ++L KN ++ +R ++L LLG+FL F G +++
Sbjct: 62 FAVGASIPISLYSKNGINRIRVWIGICIRGISLILLGLFLN---FFG------EWTFSEL 112
Query: 161 RWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGL 220
R GVLQRI Y V A +LF + G V+ ++ L
Sbjct: 113 RIPGVLQRIGFVYWVVA----------------TLFLIFPGKKVLVFLIPI--------L 148
Query: 221 YVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRT 280
V W + T + P V G A IDR+I G +HL++ +S+T
Sbjct: 149 LVHTW-----ILTHIAPP----GESMVSLEQGKDIGA--WIDRRIFGEKHLWK---FSKT 194
Query: 281 KQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDH-RDR 339
+DPEG LS + + T L G+ G ++ + ++R
Sbjct: 195 ------------------------WDPEGFLSGIASIATSLFGVICGFILFRREGRGKNR 230
Query: 340 MLNWIILSSCLIGLGLSLDFVG------MHLNKALYSLSYTCLTAGASGVLLAGIYFMVR 393
+L+ + GLG FVG + +NK+L++ SY T G S L G +
Sbjct: 231 VLS-------IFGLGFLFTFVGLLWDRSLPMNKSLWTGSYAVYTTGLS-FLCIGFF---E 279
Query: 394 YISSHLMLKK 403
Y+ S ++LKK
Sbjct: 280 YLDSLILLKK 289
>gi|344281343|ref|XP_003412439.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Loxodonta africana]
Length = 782
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 112/403 (27%), Positives = 179/403 (44%), Gaps = 69/403 (17%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNS-------KQTRPQHQQRRLISLDVFRGLTVALMI 71
ISK+ ++ ++L L PS ++ + RP RL +D FRGL + +M+
Sbjct: 333 ISKAINSRETDRLINSELGSPSRADPLGGDIQLEAWRPSAPPSRLRCVDTFRGLALIIMV 392
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIV-GVSLALTYKNFPCKVVATRKAIL-R 129
V+ GG H+ WNGLT+AD V P FL I+ G L T P ++ + +IL R
Sbjct: 393 FVNYGGGKYWYFKHASWNGLTVADLVFPCFLEILFGEDLLCTRD--PLEIFLSMTSILQR 450
Query: 130 ALNLF-LLGIFLQGGFFH-----GINNLKY---GVDIAQIRWMGVLQRIAIAYLVAALCE 180
+ F LLG F I N Y + ++R GVLQR+ + Y V A+ E
Sbjct: 451 GCSKFKLLGKIAWRSFLLICIGVVIVNPNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLE 510
Query: 181 IWLKG--DGHVSSKLSLF--RKYRG---HWVVALVLTTLYLLLLYGLYVPDWQYEFPVET 233
+ + +S+ S F R W+ L L +++L L + L VP T
Sbjct: 511 LLFAKPVPENCASERSCFSLRDLTASWPQWLFILTLESIWLTLTFFLPVPG------CPT 564
Query: 234 SSSSPWIFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPD 289
P G G G P C A G +DR +LG +HLY+ P + +
Sbjct: 565 GYLGP-------GGIGDWGKYPNCTGGAAGYMDRVLLGDEHLYQHPSSAVLYHTEM---- 613
Query: 290 YGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSC 349
+DPEG+L ++ + V +G+ G +++++KD ++ C
Sbjct: 614 --------------AYDPEGILGTINSIVMAFLGVQAGKILLYYKDQTKDIVIRFTAWCC 659
Query: 350 LIGL------GLSLDFVGMHLNKALYSLSY-TCLTAGASGVLL 385
++GL +S + + +NK L+S+SY T L++ A +LL
Sbjct: 660 ILGLTSAALTKVSENEGFIPVNKNLWSISYVTTLSSFAFFILL 702
>gi|90022681|ref|YP_528508.1| hypothetical protein Sde_3039 [Saccharophagus degradans 2-40]
gi|89952281|gb|ABD82296.1| conserved hypothetical protein [Saccharophagus degradans 2-40]
Length = 363
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 92/335 (27%), Positives = 145/335 (43%), Gaps = 87/335 (25%)
Query: 54 RRLISLDVFRGLTVALMILVD---DVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLA 110
+R ++LDV RG T+A+MILV+ D G + + H+ W+G+T+ DFV PFFLFI+G +L
Sbjct: 4 QRYLALDVMRGATLAMMILVNTPGDWGFVYAPLLHADWHGVTITDFVFPFFLFIIGSALF 63
Query: 111 LTYKNFP--CKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQR 168
T ++ + +K I R LF +G+ L + ++++R +GVLQR
Sbjct: 64 FTSRSSGQLAPAIKAKKIIKRTALLFTIGLLLHA--------FPFTTALSELRILGVLQR 115
Query: 169 IAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYE 228
IA+AY +AA +WL + + L + Y W+V ++ + Y L
Sbjct: 116 IALAYGIAAFI-VWLPTTQRLMAALGILVAY---WLVFILTDSSYHL------------- 158
Query: 229 FPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSP 288
A N V ID ILG +HL++
Sbjct: 159 -------------------------ADNIVRHIDITILGAEHLWQGK------------- 180
Query: 289 DYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV-----HFKDHRDRMLNW 343
FDPEGLLS++ A V L G L+V + R
Sbjct: 181 -------------GLAFDPEGLLSTLPAAVNILAGFEATRLLVSQPAGEPNNATSRQFKL 227
Query: 344 IILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTA 378
+ + C I + L + M +NK+L++ S+ LT+
Sbjct: 228 ALYAMCSITIAL-IWHRWMPINKSLWTSSFVLLTS 261
>gi|256425421|ref|YP_003126074.1| hypothetical protein Cpin_6469 [Chitinophaga pinensis DSM 2588]
gi|256040329|gb|ACU63873.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588]
Length = 358
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 101/359 (28%), Positives = 164/359 (45%), Gaps = 75/359 (20%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
++RL+SLD FRGLTVA MILV++ G + P + HS WNG T D V PFFLF+VGVS
Sbjct: 1 MKQRLLSLDFFRGLTVAAMILVNNPGSWSYVYPPLEHSKWNGCTPTDLVFPFFLFMVGVS 60
Query: 109 LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQR 168
+ + V+ +++ + +F G F I + D +R +GVLQR
Sbjct: 61 VTFALSSRKAD-VSGHTSLIIHIIRRAAILFAIGLAFRLIPSF----DFHNLRILGVLQR 115
Query: 169 IAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYE 228
I+I +LV +L + +K ++ + + +Y LL+ + VP +
Sbjct: 116 ISIVFLVISLLYL------KTGTKPRIW--------LCISFLVIYWLLMTVVPVPGYG-- 159
Query: 229 FPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSP 288
P + + N IDR +LG QHL+++ +RT
Sbjct: 160 -PANLEAET------------------NLAAWIDRTVLGEQHLWKQ---ART-------- 189
Query: 289 DYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDR-MLNWIILS 347
+DPEGLLS++ A T L+G+ G + KD D ++W+ +
Sbjct: 190 ----------------WDPEGLLSTLPAISTGLLGIMTGDWLRR-KDVADADKVSWLFAA 232
Query: 348 SCL-IGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKKPF 405
L + GL D +NK+L++ S+ T G + + LA Y+++ + + + PF
Sbjct: 233 GFLSVIAGLIWDGF-FPINKSLWTSSFVLYTGGLAAMGLALSYWLID-VQQYKSITPPF 289
>gi|311747386|ref|ZP_07721171.1| membrane protein [Algoriphagus sp. PR1]
gi|126579104|gb|EAZ83268.1| membrane protein [Algoriphagus sp. PR1]
Length = 381
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 94/348 (27%), Positives = 153/348 (43%), Gaps = 72/348 (20%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
+ R ++LDV RGLT+A MI+V+ G + + H+ W+G T D V P FLF+VG ++
Sbjct: 11 KNRYLALDVLRGLTIAFMIVVNSAGDWSNLYAPLAHAKWHGFTPTDLVFPTFLFVVGNAM 70
Query: 110 ALTYKNFPCKVVAT--RKAILRALNLFLLGIFLQGGFFHGINNLKYG--VDIAQIRWMGV 165
+ + K + +K R L +FL+G L F+ I+ ++I ++R GV
Sbjct: 71 SFSMKKLQEMPTSAFFKKVGKRTLLIFLIGWLLNAFPFYDISETGNFSLINITEVRLFGV 130
Query: 166 LQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDW 225
LQRIA+ Y AA+ Y G + + + + LL +G+
Sbjct: 131 LQRIALCYFFAAII------------------LYIGGVRLGWIFSGIALLTYWGI----- 167
Query: 226 QYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSI 285
Y F SS P+ G TG NA +D ++G+ +Y
Sbjct: 168 MYVF---GDSSDPY---------GLTG---NAAIKLDLSLIGVDRMYGGE---------- 202
Query: 286 NSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWII 345
PFDPEGLLS++ + V + G G ++ + + + + +I
Sbjct: 203 ----------------GIPFDPEGLLSTLPSIVNVIAGYIIGKMVQKYGNTLESIKKLLI 246
Query: 346 LSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVR 393
+ LI L D V +NK +++ SY LT G VLLA + +++
Sbjct: 247 GAVVLIVLAYIWDIV-FPINKKIWTSSYVLLTVGIDMVLLALLVYIIE 293
>gi|395803976|ref|ZP_10483217.1| hypothetical protein FF52_18915 [Flavobacterium sp. F52]
gi|395433620|gb|EJF99572.1| hypothetical protein FF52_18915 [Flavobacterium sp. F52]
Length = 423
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 107/388 (27%), Positives = 162/388 (41%), Gaps = 112/388 (28%)
Query: 53 QRRLISLDVFRGLTVALMILVD---DVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
+ RLISLDVFRGLT+ LM +V+ D G + P + H+ WNG T D V PFF+FI+GV++
Sbjct: 3 RERLISLDVFRGLTILLMTIVNNPGDWGNVYPPLLHAHWNGCTPTDLVFPFFIFIMGVAV 62
Query: 110 ALTYKNFPCKVVATRKAILRALNLFLLGIF------------------------------ 139
L K ++R+L +F LGIF
Sbjct: 63 PLAMPEKKYDETTFNKILIRSLRMFCLGIFFNFFGKIQLFGLDGIPLLLVRLIITFAVGY 122
Query: 140 -LQGGFFHGINN------------LKYG--VDIAQIRWMGVLQRIAIAYLVAALCEIWLK 184
L G F + + N L YG + A +R GVLQRIAI Y V +L ++LK
Sbjct: 123 ALMGNFSNKLKNIFAFSILAIYIILAYGGFENYADVRLPGVLQRIAIVYFVVSL--LYLK 180
Query: 185 GDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVT 244
S K LF ++LL+G + + T P I
Sbjct: 181 ----TSRKTQLFTG---------------IVLLFGYW--------AIMTLVPVPGIGEAN 213
Query: 245 CGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAP 304
RG+ N +D +L H+Y + +W
Sbjct: 214 LE-RGT-----NLAAWVDSVLLK-GHMYH----------------------ETNTW---- 240
Query: 305 FDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHL 364
DPEG+LS++ + V +IGL G +++ + ++ + LI GL D V +
Sbjct: 241 -DPEGILSTIPSIVNGIIGLFIGQILLLNITKIQKAQRMGMIGTSLIFFGLIWDLV-FPI 298
Query: 365 NKALYSLSYTCLTAGASGVLLAGIYFMV 392
NK++++ SY T G + V L +Y+++
Sbjct: 299 NKSIWTSSYVLYTTGLATVCLTVLYYII 326
>gi|323447301|gb|EGB03229.1| hypothetical protein AURANDRAFT_68196 [Aureococcus anophagefferens]
Length = 399
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 62/153 (40%), Positives = 86/153 (56%), Gaps = 9/153 (5%)
Query: 49 PQHQQR--RLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVG 106
P+ +R R+ SLDV RG V LMI VDD G ++HSPW+GLT+AD VMPFF+F+VG
Sbjct: 6 PESARRPPRVRSLDVVRGFAVLLMIFVDDAGSAYAVLDHSPWDGLTIADVVMPFFIFMVG 65
Query: 107 VSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
VS AL R R L+++G+ +QGG +G D+ +RW G+L
Sbjct: 66 VSAALALGGKRTLAPVLR----RGATLWVVGVAVQGGGLPDPTTYAWGYDLGTVRWCGIL 121
Query: 167 QRIAIAYLVAA---LCEIWLKGDGHVSSKLSLF 196
QRIA Y+VA+ LC + G ++ +F
Sbjct: 122 QRIAACYVVASALVLCSPRARRGGLATADDGVF 154
Score = 37.7 bits (86), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 44/112 (39%), Gaps = 26/112 (23%)
Query: 300 WCQAP--FDPEGLLSSVMATVTCLIGLHFGHLIVHFK------DHRDR------------ 339
WC AP DPEG LS V+A T +IG F + + D DR
Sbjct: 167 WCAAPGVADPEGFLSGVLAVATAVIGALFARCLERCRAPRPGRDSDDRGGGYARLVDGDG 226
Query: 340 ------MLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLL 385
ML+W + S L L G +NK L++ SY TA G L
Sbjct: 227 DRRGALMLHWALASLALAAAALVAVAAGSPVNKQLWTPSYCLATAALCGFAL 278
>gi|295132874|ref|YP_003583550.1| hypothetical protein ZPR_1009 [Zunongwangia profunda SM-A87]
gi|294980889|gb|ADF51354.1| membrane protein [Zunongwangia profunda SM-A87]
Length = 376
Score = 96.3 bits (238), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 98/351 (27%), Positives = 149/351 (42%), Gaps = 75/351 (21%)
Query: 51 HQQRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGV 107
R +SLDVFRGLT+ALMILV+ G + P + H+ W G TLAD V P FLF VG
Sbjct: 4 KTSERFLSLDVFRGLTIALMILVNTPGTGADLYPYLVHAQWFGFTLADLVFPSFLFAVGN 63
Query: 108 SLALTYKNFPCKVVAT--RKAILRALNLFLLGIFLQGGFFHGINNLKYGVD-IAQIRWMG 164
+++ + + F A +K + R +FLLG + F +N + ++ R MG
Sbjct: 64 AMSFSMRKFQEAAPADFWKKVLKRTAIIFLLGFLMYWFPFFRMNEGHLELSPFSETRIMG 123
Query: 165 VLQRIAIAYLVAALCEIW--LKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYV 222
VLQRIA+ Y A+ + +K G + + + L +W + LYG
Sbjct: 124 VLQRIALCYFFGAVLVRYFSVKTIGFICAAILL-----AYWGI-----------LYGFGE 167
Query: 223 PDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQ 282
P + E A NA D ILG H+Y+K
Sbjct: 168 PGHELEM------------------------ATNAAAKFDYAILGEGHIYKK-------- 195
Query: 283 CSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLN 342
DA PFDPEG+LS++ + V L G + + + K +
Sbjct: 196 -------------DA-----IPFDPEGILSTLPSIVNVLAG-YLAGVFIRRKGKSYETIA 236
Query: 343 WIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVR 393
++L+ L+ + L+K L++ + LT G + +LA + F V
Sbjct: 237 KLMLAGFLVFALAEWWALIFPLSKKLWTSPFAMLTIGLNLSMLAALIFAVE 287
>gi|224537871|ref|ZP_03678410.1| hypothetical protein BACCELL_02758 [Bacteroides cellulosilyticus
DSM 14838]
gi|423227284|ref|ZP_17213748.1| hypothetical protein HMPREF1062_05934 [Bacteroides cellulosilyticus
CL02T12C19]
gi|224520557|gb|EEF89662.1| hypothetical protein BACCELL_02758 [Bacteroides cellulosilyticus
DSM 14838]
gi|392624424|gb|EIY18516.1| hypothetical protein HMPREF1062_05934 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 373
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 92/334 (27%), Positives = 147/334 (44%), Gaps = 74/334 (22%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
RL++LDV RG+T+A MILV++ G + + H+ W+GLT D V PFF+FI+GV++
Sbjct: 5 NNRLLALDVIRGITIAGMILVNNPGSWQSVYAPLQHARWHGLTPTDLVYPFFMFIMGVAI 64
Query: 110 ALTYKNF-PCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGV-DIAQIRWMGVLQ 167
+ + F + K I R + LF +GI L + L YG +R +GV+Q
Sbjct: 65 HFSLRKFDKLNTTVSLKIIRRTVALFAVGIALD-----CFSKLCYGTFSWEHLRILGVMQ 119
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQY 227
R+A+AY A I + +KY YL G+
Sbjct: 120 RLALAYAGGAFLAILIP------------KKY-------------YLATAGGI------- 147
Query: 228 EFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINS 287
+ FN G A N + +ID K+LG HL ++ +
Sbjct: 148 ---LLLYLVLLQCFN------GYVHSADNLIAVIDVKLLGAGHLIKE------------T 186
Query: 288 PDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILS 347
+ G F+PEGLLS++ L G G LI ++++R+ +
Sbjct: 187 AEGGSF----------AFEPEGLLSTIPCWAHVLFGTFVGSLITGIAENKERIRQIALFG 236
Query: 348 SCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGAS 381
+ L+ G L ++ +NK L++ SYT +T+G +
Sbjct: 237 TVLLFAGFLLQYLD-PINKKLWTASYTLITSGTA 269
>gi|317478517|ref|ZP_07937676.1| hypothetical protein HMPREF1007_00792 [Bacteroides sp. 4_1_36]
gi|316905331|gb|EFV27126.1| hypothetical protein HMPREF1007_00792 [Bacteroides sp. 4_1_36]
Length = 394
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 96/360 (26%), Positives = 163/360 (45%), Gaps = 97/360 (26%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
+R+++LD+ RG+T+A MI+V++ G I + H+ WNGLT D V PFF+FI+G+S
Sbjct: 7 SKRILALDILRGVTIAGMIMVNNPGSWGHIYAPLRHAEWNGLTPTDLVFPFFMFIMGIST 66
Query: 110 ALTYK--NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGI--------NNLKYGVDI-- 157
++ K NF A K + R + +FL+G+ + G+F ++L +G +
Sbjct: 67 YISLKKYNFEFSHAAGMKILKRTIVIFLIGMAI--GWFSRFCYYWASAPDDLSFGEKLWA 124
Query: 158 -----AQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTL 212
+IR +GV+QR+A+ Y A++ + +K H+ +++A +LT
Sbjct: 125 SVWTFDRIRILGVMQRLALCYGAASIIALTMK-HKHIP------------YLIAGLLTGY 171
Query: 213 YLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLY 272
++LL+ CG G N + ++DR IL H+Y
Sbjct: 172 FILLM---------------------------CG-NGFAYNETNILSVVDRAILTPAHMY 203
Query: 273 RKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVH 332
+ D G DPEGLLS++ A L+G G L++
Sbjct: 204 K---------------DNG-------------IDPEGLLSTIPAIAHVLLGFCVGRLMLD 235
Query: 333 FKDHRDR--MLNWIILSSCLIGLGLSLDFV----GMHLNKALYSLSYTCLTAGASGVLLA 386
DR LN +++ L+G+ L+ G +NK ++S +Y +T G + LA
Sbjct: 236 GNKSEDRASFLNSQLITLLLVGVILTFSGFLLSYGCPINKKIWSPTYVLVTCGLASSFLA 295
>gi|418737426|ref|ZP_13293823.1| putative membrane protein [Leptospira borgpetersenii serovar
Castellonis str. 200801910]
gi|410746620|gb|EKQ99526.1| putative membrane protein [Leptospira borgpetersenii serovar
Castellonis str. 200801910]
Length = 383
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 105/370 (28%), Positives = 161/370 (43%), Gaps = 94/370 (25%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFL 102
+ + + R++SLD+FRG+TV MILV++ G + + H+ WNG T D V PFFL
Sbjct: 2 EKKSTQNKDRILSLDLFRGMTVVGMILVNNPGSWSYVYSPLKHAEWNGCTPTDLVFPFFL 61
Query: 103 FIVGVS--LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
F VG S ++L KN ++ +R ++L LLG+FL F G +++
Sbjct: 62 FAVGTSIPISLYSKNGINRIRVWIGICIRGISLILLGLFLN---FFG------EWTFSEL 112
Query: 161 RWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGL 220
R GVLQRI Y V A +LF + G V+ ++ L
Sbjct: 113 RIPGVLQRIGFVYWVVA----------------TLFLIFPGKKVLVFLIPI--------L 148
Query: 221 YVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRT 280
V W + T + P V G A IDR I G +HL++ +S+T
Sbjct: 149 LVHTW-----ILTHIAPP----GESMVSLEQGKDIGA--WIDRTIFGEKHLWK---FSKT 194
Query: 281 KQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDH-RDR 339
+DPEG LS + + T L G+ G ++ + ++R
Sbjct: 195 ------------------------WDPEGFLSGIASIATSLFGVICGFILFRREGRGKNR 230
Query: 340 MLNWIILSSCLIGLGLSLDFVG------MHLNKALYSLSYTCLTAGASGVLLAGIYFMVR 393
+L+ + GLG FVG + +NK+L++ SY T G S L G +
Sbjct: 231 VLS-------IFGLGFLFTFVGLLWDRSLPMNKSLWTGSYAVYTTGLS-FLCIGFF---E 279
Query: 394 YISSHLMLKK 403
Y+ S ++LKK
Sbjct: 280 YLDSLILLKK 289
>gi|225010297|ref|ZP_03700769.1| conserved hypothetical protein [Flavobacteria bacterium MS024-3C]
gi|225005776|gb|EEG43726.1| conserved hypothetical protein [Flavobacteria bacterium MS024-3C]
Length = 363
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 102/346 (29%), Positives = 163/346 (47%), Gaps = 76/346 (21%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
+R+ S+D+FRG T+ LMILV+ G + H+ W+G TL D V PFF+FIVGVS+
Sbjct: 4 NKRVPSVDIFRGATLLLMILVNTPGTWSAVYAPFLHASWHGYTLTDLVFPFFIFIVGVSI 63
Query: 110 ALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRI 169
+L+YK+ K R+L L LG+FL G F +K + IR+ GVLQRI
Sbjct: 64 SLSYKDKKLNGPVFFKLTKRSLKLIGLGLFL-GAFTISFPFIK---AVENIRFPGVLQRI 119
Query: 170 AIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEF 229
+ + A++ +W SK S AL++ +LL Y L W
Sbjct: 120 GLVFFFASIIYLW-------GSKRS----------TALIIGI--ILLAYWL----WMGFL 156
Query: 230 PVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPD 289
P++ + + A N ID K+LG H+++ PD
Sbjct: 157 PLDGVAPT------------YERAANNWANFIDFKVLG-SHMWK--------------PD 189
Query: 290 YGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSC 349
Y DPEG+LS++ A T L+G G ++ + + + ++ +++S
Sbjct: 190 Y---------------DPEGILSTLPAIATALLGTLAGDVL---RSNTYQKVSLLLISGL 231
Query: 350 LIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYI 395
++ L + +NKAL+S S+ +TAG + ++L GI F +R +
Sbjct: 232 VLLGLGHLLDLSFPINKALWSSSFVMVTAGWANLVL-GILFYLREV 276
>gi|418719584|ref|ZP_13278783.1| putative membrane protein [Leptospira borgpetersenii str. UI 09149]
gi|410743627|gb|EKQ92369.1| putative membrane protein [Leptospira borgpetersenii str. UI 09149]
Length = 383
Score = 95.9 bits (237), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 105/370 (28%), Positives = 161/370 (43%), Gaps = 94/370 (25%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFL 102
+ + + R++SLD+FRG+TV MILV++ G + + H+ WNG T D V PFFL
Sbjct: 2 EKKSTQNKDRILSLDLFRGMTVVGMILVNNPGSWSYVYSPLKHAEWNGCTPTDLVFPFFL 61
Query: 103 FIVGVS--LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
F VG S ++L KN ++ +R ++L LLG+FL F G +++
Sbjct: 62 FAVGASIPISLYSKNGINRIRVWIGICIRGISLILLGLFLN---FFG------EWTFSEL 112
Query: 161 RWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGL 220
R GVLQRI Y V A +LF + G V+ ++ L
Sbjct: 113 RIPGVLQRIGFVYWVVA----------------TLFLIFPGKKVLVFLIPI--------L 148
Query: 221 YVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRT 280
V W + T + P V G A IDR I G +HL++ +S+T
Sbjct: 149 LVHTW-----ILTHIAPP----GESMVSLEQGKDIGA--WIDRTIFGEKHLWK---FSKT 194
Query: 281 KQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDH-RDR 339
+DPEG LS + + T L G+ G ++ + ++R
Sbjct: 195 ------------------------WDPEGFLSGIASIATSLFGVICGFILFRREGRGKNR 230
Query: 340 MLNWIILSSCLIGLGLSLDFVG------MHLNKALYSLSYTCLTAGASGVLLAGIYFMVR 393
+L+ + GLG FVG + +NK+L++ SY T G S L G +
Sbjct: 231 VLS-------IFGLGFLFTFVGLLWDRSLPMNKSLWTGSYAVYTTGLS-FLCIGFF---E 279
Query: 394 YISSHLMLKK 403
Y+ S ++LKK
Sbjct: 280 YLDSLILLKK 289
>gi|313147781|ref|ZP_07809974.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
gi|423280992|ref|ZP_17259903.1| hypothetical protein HMPREF1203_04120 [Bacteroides fragilis HMW
610]
gi|313136548|gb|EFR53908.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
gi|404583442|gb|EKA88121.1| hypothetical protein HMPREF1203_04120 [Bacteroides fragilis HMW
610]
Length = 387
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 95/357 (26%), Positives = 156/357 (43%), Gaps = 91/357 (25%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG 106
Q +RL++LDV RG+T+A MI+V++ G + + H+ W GLT D V PFF+FI+G
Sbjct: 3 QTANKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMG 62
Query: 107 VS--LALTYKNFPCKVVATRKAILRALNLFL--LGIFLQGGFFHGINNLKYGVDI----- 157
+S ++L NF A K + R + +F LGI F H N+L G DI
Sbjct: 63 ISTYISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCHTWNSLS-GEDIPFFSR 121
Query: 158 --------AQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVL 209
IR +GV+QR+A+ Y A+ + +K KY + + AL L
Sbjct: 122 LGESVWTFGHIRILGVMQRLALCYGATAIIALIMK------------HKYIPYLIAAL-L 168
Query: 210 TTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQ 269
+++L+ G G + N + ++DR +LG
Sbjct: 169 IGYFIILITG----------------------------NGFEYNSTNILAVVDRAVLGEA 200
Query: 270 HLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHL 329
H+Y+ D G DPEG+LS++ + LIG G L
Sbjct: 201 HMYK---------------DNG-------------IDPEGVLSTIPSIAHVLIGFCVGKL 232
Query: 330 IVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
++ KD +++ ++ + L G L + G ++K ++S ++ +T G + LA
Sbjct: 233 LMEVKDINEKIERLFLVGTILTFAGFLLSY-GCPISKKIWSPTFAIVTCGLASSFLA 288
>gi|424666001|ref|ZP_18103037.1| hypothetical protein HMPREF1205_01876 [Bacteroides fragilis HMW
616]
gi|404574254|gb|EKA79005.1| hypothetical protein HMPREF1205_01876 [Bacteroides fragilis HMW
616]
Length = 387
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 95/357 (26%), Positives = 156/357 (43%), Gaps = 91/357 (25%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG 106
Q +RL++LDV RG+T+A MI+V++ G + + H+ W GLT D V PFF+FI+G
Sbjct: 3 QTANKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMG 62
Query: 107 VS--LALTYKNFPCKVVATRKAILRALNLFL--LGIFLQGGFFHGINNLKYGVDI----- 157
+S ++L NF A K + R + +F LGI F H N+L G DI
Sbjct: 63 ISTYISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCHTWNSLS-GEDIPFFSR 121
Query: 158 --------AQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVL 209
IR +GV+QR+A+ Y A+ + +K KY + + AL L
Sbjct: 122 LGESVWTFGHIRILGVMQRLALCYGATAIIALIMK------------HKYIPYLIAAL-L 168
Query: 210 TTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQ 269
+++L+ G G + N + ++DR +LG
Sbjct: 169 IGYFIILITG----------------------------NGFEYNSTNILAVVDRAVLGEA 200
Query: 270 HLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHL 329
H+Y+ D G DPEG+LS++ + LIG G L
Sbjct: 201 HMYK---------------DNG-------------IDPEGVLSTIPSIAHVLIGFCVGKL 232
Query: 330 IVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
++ KD +++ ++ + L G L + G ++K ++S ++ +T G + LA
Sbjct: 233 LMEVKDINEKIERLFLVGTILTFAGFLLSY-GCPISKKIWSPTFAIVTCGLASSFLA 288
>gi|160887858|ref|ZP_02068861.1| hypothetical protein BACUNI_00261 [Bacteroides uniformis ATCC 8492]
gi|156862688|gb|EDO56119.1| hypothetical protein BACUNI_00261 [Bacteroides uniformis ATCC 8492]
Length = 394
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 96/360 (26%), Positives = 163/360 (45%), Gaps = 97/360 (26%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
+R+++LD+ RG+T+A MI+V++ G I + H+ WNGLT D V PFF+FI+G+S
Sbjct: 7 SKRILALDILRGVTIAGMIMVNNPGSWGHIYAPLRHAEWNGLTPTDLVFPFFMFIMGIST 66
Query: 110 ALTYK--NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGI--------NNLKYGVDI-- 157
++ K NF A K + R + +FL+G+ + G+F ++L +G +
Sbjct: 67 YISLKKYNFEFSHAAGMKILKRTIVIFLIGMAI--GWFSRFCYYWASAPDDLSFGEKLWA 124
Query: 158 -----AQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTL 212
+IR +GV+QR+A+ Y A++ + +K H+ +++A +LT
Sbjct: 125 SVWTFDRIRILGVMQRLALCYGAASIIALTMK-HKHIP------------YLIAGLLTGY 171
Query: 213 YLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLY 272
++LL+ CG G N + ++DR IL H+Y
Sbjct: 172 FILLM---------------------------CG-NGFAYNETNILSVVDRAILTPAHMY 203
Query: 273 RKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVH 332
+ D G DPEGLLS++ A L+G G L++
Sbjct: 204 K---------------DNG-------------IDPEGLLSTIPAIAHVLLGFCVGRLMLD 235
Query: 333 FKDHRDR--MLNWIILSSCLIGLGLSLDFV----GMHLNKALYSLSYTCLTAGASGVLLA 386
DR LN +++ L+G+ L+ G +NK ++S +Y +T G + LA
Sbjct: 236 GNKSEDRASFLNSQLITLFLVGVILTFSGFLLSYGCPINKKIWSPTYVLVTCGLASSFLA 295
>gi|270295536|ref|ZP_06201737.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|270274783|gb|EFA20644.1| conserved hypothetical protein [Bacteroides sp. D20]
Length = 394
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 96/360 (26%), Positives = 163/360 (45%), Gaps = 97/360 (26%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
+R+++LD+ RG+T+A MI+V++ G I + H+ WNGLT D V PFF+FI+G+S
Sbjct: 7 SKRILALDILRGVTIAGMIMVNNPGSWGHIYAPLRHAEWNGLTPTDLVFPFFMFIMGIST 66
Query: 110 ALTYK--NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGI--------NNLKYGVDI-- 157
++ K NF A K + R + +FL+G+ + G+F ++L +G +
Sbjct: 67 YISLKKYNFEFSHAAGMKILKRTIVIFLIGMAI--GWFSRFCYYWASAPDDLSFGEKLWA 124
Query: 158 -----AQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTL 212
+IR +GV+QR+A+ Y A++ + +K H+ +++A +LT
Sbjct: 125 SVWTFDRIRILGVMQRLALCYGAASIIALTMK-HKHIP------------YLIAGLLTGY 171
Query: 213 YLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLY 272
++LL+ CG G N + ++DR IL H+Y
Sbjct: 172 FILLM---------------------------CG-NGFAYNETNILSVVDRAILTPAHMY 203
Query: 273 RKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVH 332
+ D G DPEGLLS++ A L+G G L++
Sbjct: 204 K---------------DNG-------------IDPEGLLSTIPAIAHVLLGFCVGRLMLD 235
Query: 333 FKDHRDR--MLNWIILSSCLIGLGLSLDFV----GMHLNKALYSLSYTCLTAGASGVLLA 386
DR LN +++ L+G+ L+ G +NK ++S +Y +T G + LA
Sbjct: 236 GNKSEDRASFLNSQLITLFLVGVILTFSGFLLSYGCPINKKIWSPTYVLVTCGLASSFLA 295
>gi|390956359|ref|YP_006420116.1| hypothetical protein Terro_0436 [Terriglobus roseus DSM 18391]
gi|390411277|gb|AFL86781.1| hypothetical protein Terro_0436 [Terriglobus roseus DSM 18391]
Length = 404
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 101/358 (28%), Positives = 151/358 (42%), Gaps = 85/358 (23%)
Query: 28 NEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGG--ILPAINH 85
++E DP + S +S R + RL+S+DV RGLT+A MILV++ G + H
Sbjct: 4 GRQVELDPEIA-STVDSDVVRTAPKTERLLSVDVLRGLTIAFMILVNNQPGPGAFFELQH 62
Query: 86 SPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAIL-----RALNLFLLGIFL 140
+ WNG TL D V P FLF+VG+SL L+ K A+RK + R+ L L GI +
Sbjct: 63 AQWNGFTLTDLVFPTFLFLVGLSLVLSTAARLAK-GASRKTLFLHTLRRSAVLALFGIVV 121
Query: 141 QGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRK-Y 199
F ++ +IR+ GVLQR A+ YLV S L L RK +
Sbjct: 122 NTFPFQHLD---------RIRFYGVLQRTALCYLVV--------------SGLCLLRKGW 158
Query: 200 RGHWVVALVLTTLYLLLLYGLYVPDW---QYEFPVETSSSSPWIFNVTCGVRGSTGPACN 256
+ +A+ +Y +L+ + VP + +E P+ P N
Sbjct: 159 KDKAAIAVACLVVYWVLMRFVPVPGFGTPTHEIPIND-------------------PNGN 199
Query: 257 AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMA 316
+DR I QHLY Q DPEGLLS++ A
Sbjct: 200 LTAWLDRLIFAPQHLY-----------------------------QQVRDPEGLLSTLPA 230
Query: 317 TVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYT 374
T L G+ G + + + + + L + + L G LNK L++ S++
Sbjct: 231 ISTALYGVLAGTWLRTTRSTTAKAVG-LALGGVAMTVAGWLWSYGFPLNKKLWTSSFS 287
>gi|338212268|ref|YP_004656323.1| hypothetical protein [Runella slithyformis DSM 19594]
gi|336306089|gb|AEI49191.1| Protein of unknown function DUF2261, transmembrane [Runella
slithyformis DSM 19594]
Length = 365
Score = 95.5 bits (236), Expect = 5e-17, Method: Compositional matrix adjust.
Identities = 93/349 (26%), Positives = 145/349 (41%), Gaps = 84/349 (24%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
Q RL+SLD RG T+A M++V+ G + + H+ WNGL+ D V P FLF+VGVS
Sbjct: 4 QTNRLVSLDALRGFTIAAMLMVNFPGSEEYVFFTLRHTKWNGLSFTDLVAPIFLFVVGVS 63
Query: 109 LALTYKNFPCKVVAT----RKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMG 164
+ Y T RK I+R+L +F +G+FL NL D + IRW G
Sbjct: 64 IVFAYSKRKWDGRPTGELYRKIIIRSLKIFAVGMFL---------NLMPTFDFSDIRWTG 114
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGH-WVVALVLTTLYLLLLYGLYVP 223
L RIA +L A+ L L ++ WV A++L +L L
Sbjct: 115 TLHRIAFVFLGCAV--------------LYLNTNWKQQAWVGAVILVAYWLAL------- 153
Query: 224 DWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQC 283
T +P I V P N V D +
Sbjct: 154 ---------TLIPTPGIGKVML------EPGVNLVAWFDTQF------------------ 180
Query: 284 SINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNW 343
L W Q +DPE +LS+ + V+ + G+ G L+ +++ N+
Sbjct: 181 -----------LPGKMW-QGTWDPESILSTFPSIVSGITGMLAGQLLQSTFTPNEKV-NY 227
Query: 344 IILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
++ + +G +N+ L++ S+ +T+G + +LL +YFMV
Sbjct: 228 LMTAGVFSAALGYFWGLGFPVNENLWTSSFVLVTSGFACLLLGALYFMV 276
>gi|300865789|ref|ZP_07110543.1| conserved membrane hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300336202|emb|CBN55698.1| conserved membrane hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 376
Score = 95.1 bits (235), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 109/364 (29%), Positives = 152/364 (41%), Gaps = 81/364 (22%)
Query: 55 RLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL SLDVFRG+ +A MILV++ G + P ++H+ WNG T D + PFFLF VG +++
Sbjct: 2 RLTSLDVFRGIAIASMILVNNPGSWDYVYPPLDHAEWNGCTPTDLIFPFFLFAVGAAMSF 61
Query: 112 -----TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDI---AQIRWM 163
T +N P V R ILR L L L F ++ L G I +IR +
Sbjct: 62 SLSKYTEENPPISTVYWR--ILRRATLLFLLGLLLNSFSIFLDVLLNGSPIENFGKIRIL 119
Query: 164 GVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVP 223
GVLQRI++AY +AA+ + L +L Y G AL L
Sbjct: 120 GVLQRISLAYFLAAIAILNLSSRNLRILAATLLLGYWG----ALTL-------------- 161
Query: 224 DWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQC 283
PV ++ P N IDR ILG QHLYR
Sbjct: 162 -----IPVPGYGAN------------LLTPEGNLGAYIDRLILGTQHLYR---------- 194
Query: 284 SINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDR-MLN 342
Q +DPE LL ++ A T L G +F + + + R N
Sbjct: 195 ------------------QGQYDPESLLGTLPAIATVLAG-YFTTQWLRVQPIKTRTTWN 235
Query: 343 WIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLK 402
+I + +G F +NK L++ SY LTAG + +LLA Y +
Sbjct: 236 LVIFGLASLTIGQLWGF-WFPINKQLWTSSYVLLTAGWAILLLAICYETIE--VRRWQWG 292
Query: 403 KPFD 406
+PF+
Sbjct: 293 RPFE 296
>gi|149280688|ref|ZP_01886799.1| hypothetical protein PBAL39_24475 [Pedobacter sp. BAL39]
gi|149228553|gb|EDM33961.1| hypothetical protein PBAL39_24475 [Pedobacter sp. BAL39]
Length = 385
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 101/364 (27%), Positives = 157/364 (43%), Gaps = 77/364 (21%)
Query: 51 HQQRRLISLDVFRGLTVALMILVD---DVGGILPAINHSPWNGLTLADFVMPFFLFIVGV 107
Q RL+SLD FRG TVA MILV+ D G I + H+ W+G T D V PFFLFIVGV
Sbjct: 9 EQPVRLLSLDFFRGATVAAMILVNNPGDWGHIYAPLEHADWHGCTPTDLVFPFFLFIVGV 68
Query: 108 SLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVD-IAQIRWMGVL 166
S+A + + K IL+AL L+ L N V+ Q+R GVL
Sbjct: 69 SIAYAMGSKKTDPSSHGKTILKALKRTLILFGLGLFLSLFPNVFSNPVEAFQQVRIPGVL 128
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLY----GLYV 222
QRIA+ + + ++ I+LK SS+ ++FR + ++L + ++ + G
Sbjct: 129 QRIAVVFFICSI--IFLK-----SSERTIFR------TMVIILAAYWAIMTFIPVPGTGF 175
Query: 223 PDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQ 282
P+ + E N IDR + HL++ S+T
Sbjct: 176 PNLEKE--------------------------TNLGAWIDRGVFTEAHLWKS---SKT-- 204
Query: 283 CSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLN 342
+DPEGLLS++ A T L G+ G + +
Sbjct: 205 ----------------------WDPEGLLSTLPAIATGLFGILVGSYLKRKDIEPATKIA 242
Query: 343 WIILS-SCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLML 401
W+ + + LGL D + +NK L++ S+ T G + +L+ Y+++ + +
Sbjct: 243 WLFSTGAAATALGLLWD-LQFPINKQLWTSSFVLYTGGLATTILSLSYWIID-VQQYNRF 300
Query: 402 KKPF 405
KPF
Sbjct: 301 TKPF 304
>gi|265765098|ref|ZP_06093373.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
gi|263254482|gb|EEZ25916.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
Length = 387
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 95/357 (26%), Positives = 158/357 (44%), Gaps = 91/357 (25%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG 106
Q +RL++LDV RG+T+A MI+V++ G I + H+ W GLT D V PFF+FI+G
Sbjct: 3 QTVNKRLLALDVLRGITIAGMIMVNNPGSWSYIYAPLGHAAWIGLTPTDLVFPFFMFIMG 62
Query: 107 VS--LALTYKNFPCKVVATRKAILRALNLFL--LGIFLQGGFFHGINNLKYGVDIA---- 158
+S ++L NF A K + R + +F LGI F N+L G DI+
Sbjct: 63 ISTYISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-GEDISFFSR 121
Query: 159 ---------QIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVL 209
IR +GV+QR+A+ Y A+ + +K KY + ++A++L
Sbjct: 122 LYESVWTFGHIRILGVMQRLALCYGATAIIALIMK------------HKYIPY-LIAILL 168
Query: 210 TTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQ 269
+++L+ G G + N + ++DR +LG
Sbjct: 169 IGYFIILING----------------------------NGFEYNSSNILSIVDRTVLGEA 200
Query: 270 HLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHL 329
H+Y+ D G DPEGLLS++ + LIG G L
Sbjct: 201 HMYK---------------DNG-------------IDPEGLLSTIPSIAHVLIGFCVGKL 232
Query: 330 IVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
++ KD +++ ++ + L G L + G ++K ++S ++ +T G + LA
Sbjct: 233 LMEVKDIHEKIERLFLIGTILTFAGFLLSY-GCPISKKIWSPTFAIVTCGLASSFLA 288
>gi|53711719|ref|YP_097711.1| hypothetical protein BF0428 [Bacteroides fragilis YCH46]
gi|52214584|dbj|BAD47177.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
Length = 387
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 158/357 (44%), Gaps = 91/357 (25%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG 106
Q +RL++LDV RG+T+A MI+V++ G + + H+ W GLT D V PFF+FI+G
Sbjct: 3 QTVNKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMG 62
Query: 107 VS--LALTYKNFPCKVVATRKAILRALNLFL--LGIFLQGGFFHGINNLKYGVDIA---- 158
+S ++L NF A K + R + +F LGI F N+L G DI+
Sbjct: 63 ISTYISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-GEDISFFSR 121
Query: 159 ---------QIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVL 209
IR +GV+QR+A+ Y A+ + +K KY + ++A++L
Sbjct: 122 LYESVWTFGHIRILGVMQRLALCYGATAIIALIMK------------HKYIPY-LIAILL 168
Query: 210 TTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQ 269
+++L+ G G + N + ++DR +LG
Sbjct: 169 IGYFIILING----------------------------NGFEYNSSNILSIVDRTVLGEA 200
Query: 270 HLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHL 329
H+Y+ D G DPEGLLS++ + LIG G L
Sbjct: 201 HMYK---------------DNG-------------IDPEGLLSTIPSIAHVLIGFCVGKL 232
Query: 330 IVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
++ KD +++ ++ + L G L + G ++K ++S ++ +T G + LA
Sbjct: 233 LMEVKDIHEKIERLFLIGTILTFAGFLLSY-GCPISKKIWSPTFAIITCGLASSFLA 288
>gi|427400072|ref|ZP_18891310.1| hypothetical protein HMPREF9710_00906 [Massilia timonae CCUG 45783]
gi|425720812|gb|EKU83727.1| hypothetical protein HMPREF9710_00906 [Massilia timonae CCUG 45783]
Length = 380
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 100/349 (28%), Positives = 158/349 (45%), Gaps = 87/349 (24%)
Query: 55 RLISLDVFRGLTVALMILVD---DVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL SLD FRG T+A M+LV+ D G + + H+ W+G T D + PFFLFI GV++AL
Sbjct: 7 RLTSLDAFRGFTIAAMVLVNNPGDWGHLHAQLAHAAWHGWTFTDTIFPFFLFIGGVAMAL 66
Query: 112 TYKNFPCKVVATRKAIL----RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
+ + +L RA +FL+G L NL D +R GVLQ
Sbjct: 67 SLGRLAAAGAHKPQLLLKLAKRAALIFLIGFLL---------NLIPRFDFDSVRIPGVLQ 117
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQY 227
RIA+ L+AA L ++ +RG + +L LY +L+ + VP
Sbjct: 118 RIALCTLLAA--------------PLVVYLTWRGQALAVFLLLALYSVLMLLVPVP---- 159
Query: 228 EFPVETSSSSPWIFNVTCGV-RGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
G+ G+ P +A IDR +
Sbjct: 160 ------------------GIGAGNLEPGRDAGAWIDRAL--------------------- 180
Query: 287 SPDYGPMPLDAPSWCQAP-FDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWII 345
+D W QA +DPEGL+S++ A + L G+ G L++ ++++ W++
Sbjct: 181 --------MDGHLWAQAKTWDPEGLVSTLPAVCSLLFGVLAGRLLLSALPRVEQVV-WLM 231
Query: 346 LS--SCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
LS +CL LG +LD V M +NK+L++ S+ L +G + + Y+++
Sbjct: 232 LSGLACL-ALGSTLDAVLMPINKSLWTPSFCLLMSGWALLAFGASYWLL 279
>gi|343082900|ref|YP_004772195.1| hypothetical protein [Cyclobacterium marinum DSM 745]
gi|342351434|gb|AEL23964.1| Protein of unknown function DUF2261, transmembrane [Cyclobacterium
marinum DSM 745]
Length = 365
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 57/133 (42%), Positives = 80/133 (60%), Gaps = 18/133 (13%)
Query: 55 RLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RLISLD RG T+A MILV+ G + P + H+ W+G+T+ DF+ PFF+F+VGVS+A
Sbjct: 7 RLISLDALRGFTIAAMILVNYPGSWSHVYPPLLHAEWHGMTMTDFIFPFFIFMVGVSVAF 66
Query: 112 TY-----KNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
Y + P K +K I+RA+ LF+LGIFL NL D + +R GVL
Sbjct: 67 AYSKRLDEGVP-KAGMYKKIIIRAIKLFVLGIFL---------NLIPDFDFSHMRIAGVL 116
Query: 167 QRIAIAYLVAALC 179
QRI+I +L ++
Sbjct: 117 QRISIVFLASSFL 129
>gi|237720190|ref|ZP_04550671.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
gi|293371122|ref|ZP_06617659.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
gi|229450742|gb|EEO56533.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
gi|292633780|gb|EFF52332.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
Length = 371
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 94/348 (27%), Positives = 156/348 (44%), Gaps = 74/348 (21%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+ RL+SLD+ RG+T+ MILV++ G I + H+ WNGLT D V PFF+FI+GVS
Sbjct: 2 KSERLLSLDILRGITIVGMILVNNPGTWESIYAPLRHAEWNGLTPTDLVFPFFMFIMGVS 61
Query: 109 LALTYKNFPCKVVATRKAILRAL--NLFLLGIFLQGGFFHGINNLKYGVD--IAQIRWMG 164
++ F +R I++ + L L + L +F + GV+ + IR +G
Sbjct: 62 MSFALSRFDHHF--SRGFIIKLVRRTLILFLLGLFLSWFSLVCT---GVEQPFSHIRILG 116
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQR+A+AY +L + ++ +++ W+ ++L LL G
Sbjct: 117 VLQRLALAYFFGSLLIVGVRRPANLA------------WISGIILAGYSTLLALG----- 159
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
G N + + DR + G HLYR+ +
Sbjct: 160 -----------------------HGFELSEQNIIAVTDRTLFGEAHLYREWL-------- 188
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
PD G + FDPEGLLS++ +IG G+++ + R+L
Sbjct: 189 ---PDGG----------RIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHRLLQIS 235
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
IL L+ G L + G LNK ++S ++ +T G + +LL + +++
Sbjct: 236 ILGIALLFAGWLLSY-GCPLNKKVWSPTFVLVTCGFASLLLVFLTWLI 282
>gi|374309722|ref|YP_005056152.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
gi|358751732|gb|AEU35122.1| hypothetical protein AciX8_0773 [Granulicella mallensis MP5ACTX8]
Length = 377
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 92/336 (27%), Positives = 147/336 (43%), Gaps = 61/336 (18%)
Query: 65 LTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVV 121
+ +A MILV D G + P + H+ WNG T D + P FL I+GV++ ++ + +
Sbjct: 1 MIIAGMILVTDPGTYSAVYPQLMHAQWNGATATDMIFPSFLVIIGVAMTFSFAS-RIERG 59
Query: 122 ATRKAIL-----RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVA 176
A R+ IL R++ L LG+ + G F N + IR G+LQRIA+ Y
Sbjct: 60 ADRRQILWHVLTRSVLLIFLGLLVNG--FPEYN-------LHTIRIPGILQRIALCYFAG 110
Query: 177 ALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSS 236
+L + + G +++ R+ V L LY +LL G VP +
Sbjct: 111 SLLYLAVSGKKDANTESQRLRRGTVIGAVLAGLLVLYWVLLKGYPVPGFG---------- 160
Query: 237 SPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLD 296
G N DRKI G+QHL+ + +P YG
Sbjct: 161 -----------SGRLDSLGNVAAYFDRKIFGVQHLWAYGL----------TPGYG----- 194
Query: 297 APSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLS 356
FDPEGLLS++ A T L G+ G + + + L + L+ +GL+
Sbjct: 195 ------VTFDPEGLLSTLPALATLLFGVLAGEWLRTNQARGRKALVLAVAGVALVLVGLA 248
Query: 357 LDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
L + + LNK + + ++ + G + +L AG YF++
Sbjct: 249 LSPL-LPLNKKILTSTFAIFSGGVALLLFAGFYFVL 283
>gi|60679957|ref|YP_210101.1| hypothetical protein BF0369 [Bacteroides fragilis NCTC 9343]
gi|336407897|ref|ZP_08588393.1| hypothetical protein HMPREF1018_00408 [Bacteroides sp. 2_1_56FAA]
gi|423248371|ref|ZP_17229387.1| hypothetical protein HMPREF1066_00397 [Bacteroides fragilis
CL03T00C08]
gi|423253319|ref|ZP_17234250.1| hypothetical protein HMPREF1067_00894 [Bacteroides fragilis
CL03T12C07]
gi|423269643|ref|ZP_17248615.1| hypothetical protein HMPREF1079_01697 [Bacteroides fragilis
CL05T00C42]
gi|423272798|ref|ZP_17251745.1| hypothetical protein HMPREF1080_00398 [Bacteroides fragilis
CL05T12C13]
gi|60491391|emb|CAH06139.1| putative transmembrane protein [Bacteroides fragilis NCTC 9343]
gi|335944976|gb|EGN06793.1| hypothetical protein HMPREF1018_00408 [Bacteroides sp. 2_1_56FAA]
gi|392657219|gb|EIY50856.1| hypothetical protein HMPREF1067_00894 [Bacteroides fragilis
CL03T12C07]
gi|392659584|gb|EIY53202.1| hypothetical protein HMPREF1066_00397 [Bacteroides fragilis
CL03T00C08]
gi|392700489|gb|EIY93651.1| hypothetical protein HMPREF1079_01697 [Bacteroides fragilis
CL05T00C42]
gi|392708362|gb|EIZ01469.1| hypothetical protein HMPREF1080_00398 [Bacteroides fragilis
CL05T12C13]
Length = 387
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 158/357 (44%), Gaps = 91/357 (25%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG 106
Q +RL++LDV RG+T+A MI+V++ G + + H+ W GLT D V PFF+FI+G
Sbjct: 3 QTVNKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMG 62
Query: 107 VS--LALTYKNFPCKVVATRKAILRALNLFL--LGIFLQGGFFHGINNLKYGVDIA---- 158
+S ++L NF A K + R + +F LGI F N+L G DI+
Sbjct: 63 ISTYISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-GEDISFFSR 121
Query: 159 ---------QIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVL 209
IR +GV+QR+A+ Y A+ + +K KY + ++A++L
Sbjct: 122 LYESVWTFGHIRILGVMQRLALCYGATAIIALIMK------------HKYIPY-LIAILL 168
Query: 210 TTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQ 269
+++L+ G G + N + ++DR +LG
Sbjct: 169 IGYFIILING----------------------------NGFEYNSSNILSIVDRTVLGEA 200
Query: 270 HLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHL 329
H+Y+ D G DPEGLLS++ + LIG G L
Sbjct: 201 HMYK---------------DNG-------------IDPEGLLSTIPSIAHVLIGFCVGKL 232
Query: 330 IVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
++ KD +++ ++ + L G L + G ++K ++S ++ +T G + LA
Sbjct: 233 LMEVKDIHEKIERLFLIGTILTFAGFLLSY-GCPISKKIWSPTFAIVTCGLASSFLA 288
>gi|333029673|ref|ZP_08457734.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
gi|332740270|gb|EGJ70752.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
Length = 387
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 154/357 (43%), Gaps = 91/357 (25%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDD---VGGILPAINHSPWNGLTLADFVMPFFLFIVG 106
+ + +RL++LD+ RG+T+A MILV++ G I + H+ WNGLT D V PFF+FI+G
Sbjct: 3 KKENQRLLALDILRGITIAGMILVNNPGSWGSIYAPLGHAEWNGLTPTDLVFPFFMFIMG 62
Query: 107 VS--LALTYKNFPCKVVATRKAILRALNLFL--LGIFLQGGFFHGINNLKYGVDIA---- 158
+S +L F A K + R + +F LGI F N+L DI+
Sbjct: 63 ISTYFSLRKYKFEFSKEAALKILKRTIIIFAIGLGIAWFSLFLRTWNSLA-SADISIFER 121
Query: 159 ---------QIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVL 209
+R +GV+ R+A+ Y A+ + +K KY +V +++
Sbjct: 122 LSQSIFVFENLRILGVMPRLALTYCATAIIALTIK------------HKYIPTLIVGILI 169
Query: 210 TTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQ 269
++L L + + +N T N + ++D+ ILG
Sbjct: 170 VYTFILFL------------------GNGFEYNET-----------NILSIVDKAILGEN 200
Query: 270 HLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHL 329
H+Y+ D G DPEGL+S++ A L+G G +
Sbjct: 201 HMYK---------------DNG-------------IDPEGLVSTIPAIAHVLLGFFVGKI 232
Query: 330 IVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
KD ++ I+ S L +GL L + G +NK ++S ++ T G + LLA
Sbjct: 233 FTEKKDIHSKVEFLFIMGSILTFVGLLLSY-GCPINKKIWSPTFVLTTCGLASTLLA 288
>gi|156401292|ref|XP_001639225.1| predicted protein [Nematostella vectensis]
gi|156226352|gb|EDO47162.1| predicted protein [Nematostella vectensis]
Length = 376
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 95/342 (27%), Positives = 155/342 (45%), Gaps = 65/342 (19%)
Query: 70 MILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATR----K 125
MI V+ GG H+ WNGL +AD V P+F++I+GVS+ L++K+ + V K
Sbjct: 1 MIFVNFGGGGYYFFGHAAWNGLLVADLVFPWFIWIMGVSITLSFKSLKRRKVKKWKICLK 60
Query: 126 AILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKG 185
I R+L LF LG+F +N D+ R GVLQR A Y+V AL +++L G
Sbjct: 61 VIRRSLILFGLGLF--------TSNFN---DLETYRIPGVLQRFAACYIVIALMQLFL-G 108
Query: 186 DGHVSSKLSLFRKYRG----------HWVVALVLTTLYLLLLYGLYV---PDWQYEFPVE 232
+++ L+ K+ W+ L+L +Y+ + Y + + P Y P
Sbjct: 109 PSEEQTQV-LYPKWWDPIRDVVSIWKQWLAMLLLLAIYVTVTYAVKLDGCPR-GYTGPGG 166
Query: 233 TSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGP 292
P FN T GV IDRK G +H+Y+ P + + +
Sbjct: 167 IGRGYPEAFNCTGGV----------ANYIDRKFFG-KHIYQWPTVKQLYKTKL------- 208
Query: 293 MPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSC--L 350
P +PEG L ++ + +G+ G ++ ++ +R+ W+ L
Sbjct: 209 -----------PHEPEGFLGTLTSIFLVFLGVQAGRILHTYRKSTERITRWLAWGVFLGL 257
Query: 351 IGLGL---SLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIY 389
IG+GL S + + +NK L+S+S+ +T +S LL Y
Sbjct: 258 IGVGLCKASENEGVVPINKNLWSVSFILVTGSSSFFLLTFCY 299
>gi|374580713|ref|ZP_09653807.1| hypothetical protein DesyoDRAFT_2145 [Desulfosporosinus youngiae
DSM 17734]
gi|374416795|gb|EHQ89230.1| hypothetical protein DesyoDRAFT_2145 [Desulfosporosinus youngiae
DSM 17734]
Length = 375
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 102/350 (29%), Positives = 158/350 (45%), Gaps = 78/350 (22%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGGI--LPA-INHSPWNGLTLADFVMPFFLFIVG 106
+ + R +D+FRGLT++LM++ + G I +PA + H+ WNG T+ DFV PFF+F +G
Sbjct: 2 EKGKLRFDCIDIFRGLTISLMLICSNPGNITNIPAQLRHADWNGATIGDFVFPFFIFSMG 61
Query: 107 VSLALTYKNFPCKVVATRKAILRALN----LFLLGIFLQGGFFHGINNLKYGVDIAQIRW 162
+ + + K ++ + I+ LN +FLLG+ L G D+A IR
Sbjct: 62 IVVPIAINRRLEKGISQMRIIINVLNRSIVMFLLGLILNG---------FPTFDLAIIRV 112
Query: 163 MGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYV 222
GVLQRIAI Y +AL I+L V L ++A++L +Y LL GL V
Sbjct: 113 PGVLQRIAIVYFCSAL--IYLLFKSIVKKDLVQIGILT---LIAVLLLAIYYWLLKGLQV 167
Query: 223 PDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQ 282
P G+ G G V ID K L HLY
Sbjct: 168 P----------------------GIEGLKG---GLVSYIDLKYLK-GHLY---------- 191
Query: 283 CSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLN 342
P+ FDPEG+LS++ A + +IG+ G + + +D R +
Sbjct: 192 --------------TPT-----FDPEGILSTIPALSSGIIGVVVGMIFLR-RDSRFVKMT 231
Query: 343 WIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+ S L+ + NK L+S S+ LT+G G+L+ I++++
Sbjct: 232 IFVCSGILLIIFAEWFNAYFPYNKQLWSSSFVLLTSGF-GILVLTIFYLL 280
>gi|195565141|ref|XP_002106164.1| GD16714 [Drosophila simulans]
gi|194203536|gb|EDX17112.1| GD16714 [Drosophila simulans]
Length = 318
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 56/147 (38%), Positives = 82/147 (55%), Gaps = 13/147 (8%)
Query: 34 DPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTL 93
D ++S + Q++RL SLD FRGL++ LMI V+ GG I H+ WNGL L
Sbjct: 164 DEATAAADSIGEAATKATQRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHL 223
Query: 94 ADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAILRALNLFLLGIFLQGGFFHGIN 149
AD V P FL+I+GV + L+ K+ K + ++R++ LF++G+ L
Sbjct: 224 ADIVFPSFLWIMGVCIPLSVKSQLSRGSSKARICLRILVRSIKLFVIGLCL--------- 274
Query: 150 NLKYGVDIAQIRWMGVLQRIAIAYLVA 176
N G ++ Q+R MGVLQR +AYL A
Sbjct: 275 NSMSGPNLEQLRVMGVLQRFGVAYLGA 301
>gi|440747989|ref|ZP_20927244.1| N-acetylglucosamine related transporter, NagX [Mariniradius
saccharolyticus AK6]
gi|436483731|gb|ELP39771.1| N-acetylglucosamine related transporter, NagX [Mariniradius
saccharolyticus AK6]
Length = 382
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 94/351 (26%), Positives = 155/351 (44%), Gaps = 76/351 (21%)
Query: 52 QQRRLISLDVFRGLTVALMILVD---DVGGILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
Q R ++LDV RGLT+ALMI+V+ D + + H+ W+G T+ D V P FLF+VG +
Sbjct: 11 MQERYLALDVLRGLTIALMIVVNTPGDWSNVFSPLLHADWHGFTITDLVFPTFLFVVGNA 70
Query: 109 LALTYKNFP--CKVVATRKAILRALNLFLLGIFLQGGFFHGINN--LKYGVDIAQIRWMG 164
++ + K + +K RA +FL+G L F N + +D + +R +G
Sbjct: 71 MSFSMKKMEKMSQGAFLKKVFKRAALIFLIGWGLNAFPFFETNETGVVSMIDWSAVRLLG 130
Query: 165 VLQRIAIAYLVAALCEIWLKGDGH-VSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVP 223
VLQRIA+ YL+A+L ++ G + S LSL G+W V LY P
Sbjct: 131 VLQRIALCYLIASLVLYYIGKRGAIIFSILSLL----GYWGV-----------LYFFGNP 175
Query: 224 DWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQC 283
+ Y NA +D ++G ++LY
Sbjct: 176 EDPYSL------------------------EGNAALKLDLWLIGAKNLYTGE-------- 203
Query: 284 SINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNW 343
PFDPEG+LS+ + V + G G I +++ R +
Sbjct: 204 ------------------GIPFDPEGVLSTWPSVVNVIAGFLAGKFIQQIGNNK-RTVKA 244
Query: 344 IILSSCLIGLGLSLDF-VGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVR 393
++L+ LI +G S+ + + +NK +++ SY LT G ++L + ++
Sbjct: 245 LLLAG-LIAVGFSVIWELAFPINKKIWTSSYVLLTVGLDLIVLGFLVLIIE 294
>gi|423259248|ref|ZP_17240171.1| hypothetical protein HMPREF1055_02448 [Bacteroides fragilis
CL07T00C01]
gi|423263781|ref|ZP_17242784.1| hypothetical protein HMPREF1056_00471 [Bacteroides fragilis
CL07T12C05]
gi|387776828|gb|EIK38928.1| hypothetical protein HMPREF1055_02448 [Bacteroides fragilis
CL07T00C01]
gi|392706047|gb|EIY99170.1| hypothetical protein HMPREF1056_00471 [Bacteroides fragilis
CL07T12C05]
Length = 387
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 94/357 (26%), Positives = 158/357 (44%), Gaps = 91/357 (25%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG 106
Q +RL++LDV RG+T+A MI+V++ G + + H+ W GLT D V PFF+FI+G
Sbjct: 3 QTVNKRLLALDVLRGITIAGMIMVNNPGSWSYVYVPLGHAAWIGLTPTDLVFPFFMFIMG 62
Query: 107 VS--LALTYKNFPCKVVATRKAILRALNLFL--LGIFLQGGFFHGINNLKYGVDIA---- 158
+S ++L NF A K + R + +F LGI F N+L G DI+
Sbjct: 63 ISTYISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-GEDISFFSR 121
Query: 159 ---------QIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVL 209
IR +GV+QR+A+ Y A+ + +K KY + ++A++L
Sbjct: 122 LYESVWTFGHIRILGVMQRLALCYGATAIIALIMK------------HKYIPY-LIAILL 168
Query: 210 TTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQ 269
+++L+ G G + N + ++DR +LG
Sbjct: 169 IGYFIILING----------------------------NGFEYNSSNILSIVDRTVLGEA 200
Query: 270 HLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHL 329
H+Y+ D G DPEGLLS++ + LIG G L
Sbjct: 201 HMYK---------------DNG-------------IDPEGLLSTIPSIAHVLIGFCVGKL 232
Query: 330 IVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
++ KD +++ ++ + L G L + G ++K ++S ++ +T G + LA
Sbjct: 233 LMEVKDIHEKIERLFLIGTILTFAGFLLSY-GCPISKKIWSPTFAIVTCGLASSFLA 288
>gi|359460787|ref|ZP_09249350.1| hypothetical protein ACCM5_18818 [Acaryochloris sp. CCMEE 5410]
Length = 383
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 101/363 (27%), Positives = 154/363 (42%), Gaps = 87/363 (23%)
Query: 59 LDVFRGLTVALMILVDDVGGI---LPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKN 115
LDVFRG+ +A M+LV+ G + P + H+ W+G TLAD V PFFL ++G S+A +
Sbjct: 13 LDVFRGIAIAGMLLVNKSGLVKEAYPQLLHADWHGWTLADLVFPFFLVVLGASMAFSMAR 72
Query: 116 FPCKVVATRKAIL-----RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIA 170
+ ++A+ R+ LF LG+FL G + + + +R MG+LQRI+
Sbjct: 73 HTASLTQPKRAVYLKILRRSAVLFGLGLFLNG---------FWSFNFSTLRVMGILQRIS 123
Query: 171 IAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFP 230
+ YL +A L L RK W + +L Y L L + VP++
Sbjct: 124 LTYLASAFV------------ILKLPRK--SQWGLTGLLLVGYWLALSFIPVPEF----- 164
Query: 231 VETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDY 290
P G+ N IDR I+G HLY ++
Sbjct: 165 ------GP----------GNLTRTGNFGAYIDRLIIGSSHLYVGDQFNSMG--------- 199
Query: 291 GPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI------VHFKDHRDRMLNWI 344
DPEGL S++ A T L+G G I + K R +
Sbjct: 200 ---------------DPEGLFSTLPAIATVLLGYFAGDWIRKRGSGLKIKTSRQSL---A 241
Query: 345 ILSSCLIGLGLSLDF-VGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKK 403
+ S L GL L + + +NK L++ SY T G + +LLA Y ++ + + K
Sbjct: 242 LASYGLFSTGLGLLWSIWFPINKKLWTSSYVLFTVGIALILLAVCYELIE-VRRIRLWSK 300
Query: 404 PFD 406
PF+
Sbjct: 301 PFE 303
>gi|423304873|ref|ZP_17282872.1| hypothetical protein HMPREF1072_01812 [Bacteroides uniformis
CL03T00C23]
gi|423310012|ref|ZP_17287996.1| hypothetical protein HMPREF1073_02746 [Bacteroides uniformis
CL03T12C37]
gi|392682836|gb|EIY76175.1| hypothetical protein HMPREF1072_01812 [Bacteroides uniformis
CL03T00C23]
gi|392683302|gb|EIY76639.1| hypothetical protein HMPREF1073_02746 [Bacteroides uniformis
CL03T12C37]
Length = 394
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 95/360 (26%), Positives = 162/360 (45%), Gaps = 97/360 (26%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
+R+++LD+ RG+T+A MI+V++ G I + H+ WNGLT D V PFF+FI+G+S
Sbjct: 7 SKRILALDILRGVTIAGMIMVNNPGSWGHIYAPLRHAEWNGLTPTDLVFPFFMFIMGIST 66
Query: 110 ALTYK--NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGI--------NNLKYGVDI-- 157
++ K NF A K + R + +FL+G+ + G+F ++L +G +
Sbjct: 67 YISLKKYNFEFSHAAGMKILKRTIVIFLIGMAI--GWFSRFCYYWASAPDDLSFGEKLWA 124
Query: 158 -----AQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTL 212
+IR +GV+QR+A+ Y A++ + +K H+ +++A +LT
Sbjct: 125 SVWTFDRIRILGVMQRLALCYGAASIIALTMK-HKHIP------------YLIAGLLTGY 171
Query: 213 YLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLY 272
++LL+ CG G N + ++D IL H+Y
Sbjct: 172 FILLM---------------------------CG-NGFAYNETNILSVVDHAILTPAHMY 203
Query: 273 RKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVH 332
+ D G DPEGLLS++ A L+G G L++
Sbjct: 204 K---------------DNG-------------IDPEGLLSTIPAIAHVLLGFCVGRLMLD 235
Query: 333 FKDHRDR--MLNWIILSSCLIGLGLSLDFV----GMHLNKALYSLSYTCLTAGASGVLLA 386
DR LN +++ L+G+ L+ G +NK ++S +Y +T G + LA
Sbjct: 236 GNKSEDRASFLNSQLITLFLVGVILTFSGFLLSYGCPINKKIWSPTYVLVTCGLASSFLA 295
>gi|431798742|ref|YP_007225646.1| hypothetical protein Echvi_3416 [Echinicola vietnamensis DSM 17526]
gi|430789507|gb|AGA79636.1| hypothetical protein Echvi_3416 [Echinicola vietnamensis DSM 17526]
Length = 363
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 54/133 (40%), Positives = 77/133 (57%), Gaps = 14/133 (10%)
Query: 51 HQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGV 107
++ +RLISLDV RG+T+A MILV+ G + P ++H+ WNG+T DF+ PFFLFIVGV
Sbjct: 3 NKNKRLISLDVLRGMTIAAMILVNFPGSWEHVFPPLHHAQWNGITPTDFIFPFFLFIVGV 62
Query: 108 SLALTY--KNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGV 165
S+ + Y K K + +K R +F LG+ L + D + IR GV
Sbjct: 63 SIVMAYAGKMEMDKTIVYKKLFFRGAKIFALGVLL---------GMIPEFDFSAIRVAGV 113
Query: 166 LQRIAIAYLVAAL 178
LQRIA+ ++ L
Sbjct: 114 LQRIALVFVACTL 126
>gi|325286182|ref|YP_004261972.1| heparan-alpha-glucosaminide N-acetyltransferase [Cellulophaga
lytica DSM 7489]
gi|324321636|gb|ADY29101.1| Heparan-alpha-glucosaminide N-acetyltransferase [Cellulophaga
lytica DSM 7489]
Length = 361
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 101/343 (29%), Positives = 153/343 (44%), Gaps = 75/343 (21%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+ R++S+D+FRG+T+ LMILV++ G + H+ W+G T D V PFFLFIVG+S
Sbjct: 1 MKNRVVSVDIFRGITIVLMILVNNPGTWSSVYAPFLHADWHGYTPTDLVFPFFLFIVGIS 60
Query: 109 LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQR 168
+ Y RK ++R+L L LG+FL G F K D IR+ GVLQR
Sbjct: 61 IVYAYHTKEVTGKTYRKIVIRSLKLIGLGLFL-GAFTLSFPFFK---DFNDIRFPGVLQR 116
Query: 169 IAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYE 228
I + + A+ I L ALV +L++Y L W
Sbjct: 117 IGLVFFFTAILFIKLNWK-------------------ALVAVCAAILIMYWL----WMGF 153
Query: 229 FPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSP 288
P+ ++ + P N ID K+LG H+++
Sbjct: 154 VPINGTAPT-----------FDRAP-NNWANFIDLKVLG-SHMWKT-------------- 186
Query: 289 DYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSS 348
DY DPEG+LS++ A T L+G+ G L+ + ++L + +S
Sbjct: 187 DY---------------DPEGVLSTLPAIATSLLGVFVGLLLKSAYKKKTQILLLLGVSL 231
Query: 349 CLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM 391
G L F +NKAL+S S+ +TAG + ++LA IY+
Sbjct: 232 LTAGHIWDLFF---PINKALWSSSFVLVTAGWATIILAVIYYF 271
>gi|345880604|ref|ZP_08832150.1| hypothetical protein HMPREF9431_00814 [Prevotella oulorum F0390]
gi|343922516|gb|EGV33216.1| hypothetical protein HMPREF9431_00814 [Prevotella oulorum F0390]
Length = 383
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 96/360 (26%), Positives = 148/360 (41%), Gaps = 98/360 (27%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG--ILPAINHSPWNGLTLADFVMPFFLFIVGV 107
+ R+ ++D+ RG+T+A MILV++ GG + + H+ W GLT D V PFF+FI+G+
Sbjct: 3 TQKTSRIEAVDILRGITIAGMILVNNPGGQPVYTPLEHAEWFGLTPTDLVFPFFMFIMGI 62
Query: 108 SLALTYKNF------PCKVVATRKAILRALNLFLLGIFLQG--GFFHGINNLKYGV---- 155
+ L+ + + PC +K I R + L+++GI + F G+ N Y
Sbjct: 63 TTYLSLRKYDFEWSWPC----AKKIIKRGMLLYVIGIAISWLMMFCRGLFNEDYAALPFF 118
Query: 156 --------DIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRG-HWVVA 206
IR +GV R+A Y+ A S ++L K+R W++A
Sbjct: 119 SHVFAAANVFDHIRLVGVFPRLAFCYVFA--------------SVVALSVKHRFIPWLIA 164
Query: 207 LVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKIL 266
V F V C G A N +ID IL
Sbjct: 165 AVFIGY----------------------------FAVLCLGNGFAHDASNICNVIDEAIL 196
Query: 267 GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHF 326
G QHLY+ D P DPEGLLSS+ A LIG
Sbjct: 197 GRQHLYK---------------------WDIP-------DPEGLLSSLPALGHVLIGFCV 228
Query: 327 GHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
G +++ D++ I + L LG L + G ++K L++ ++ +T G + LA
Sbjct: 229 GRVVMSATSLNDKIEKLFIYGAVLTILGFLLSY-GCPISKKLWTPTFALVTCGLASTTLA 287
>gi|410631381|ref|ZP_11342056.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola arctica
BSs20135]
gi|410148827|dbj|GAC18923.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola arctica
BSs20135]
Length = 330
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 59/160 (36%), Positives = 88/160 (55%), Gaps = 11/160 (6%)
Query: 42 SNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVM 98
S+ + Q RL++LDVFRG+T+ MILV++ G I + H+ W+G TL D +
Sbjct: 9 SHCQAILGQQPGNRLLALDVFRGITITAMILVNNPGSWQHIYGPMRHAQWHGWTLTDLIF 68
Query: 99 PFFLFIVGVSLALTYKNF----PCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYG 154
PFF+FIVGVS+ L+ + + ++A+LR L LLG FL F++ Y
Sbjct: 69 PFFIFIVGVSIQLSGQRMLASGTSRSSIIKQALLRTFKLVLLGWFL-ALFYYDFGAEHYN 127
Query: 155 ---VDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSS 191
+ IR+MGVLQRIA+ Y + L ++L G + S
Sbjct: 128 WVEQRLLNIRFMGVLQRIAVVYFICVLMWLFLSKRGLLVS 167
Score = 42.7 bits (99), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 30/97 (30%), Positives = 57/97 (58%), Gaps = 2/97 (2%)
Query: 305 FDPEGLLSSVMATVTCLIGLHFGHLIVHFK-DHRDRMLNWIILSSCLIGLGLSLDFVGMH 363
FDPEG+LS++ A + + G+ G+L+ + R++ + ++L S + LG L
Sbjct: 226 FDPEGILSTLPAVASGISGMLVGYLLTQSSLNTRNKTIVLLVLGSIGVLLG-ELWHGYFP 284
Query: 364 LNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLM 400
+NKAL++ SY LT+ + ++LA + F++ +HL+
Sbjct: 285 INKALWTSSYVLLTSAYACLVLASLIFILDSKKNHLL 321
>gi|325103749|ref|YP_004273403.1| hypothetical protein [Pedobacter saltans DSM 12145]
gi|324972597|gb|ADY51581.1| hypothetical protein Pedsa_1010 [Pedobacter saltans DSM 12145]
Length = 374
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 86/333 (25%), Positives = 145/333 (43%), Gaps = 75/333 (22%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
+ R +SLDV RG TVA MI+V+ G + + H+PW+G T+ D V P FLF+VG ++
Sbjct: 9 KPRYLSLDVLRGATVAFMIIVNTPGSWSYVYAPLKHAPWHGFTVTDLVFPTFLFVVGNAM 68
Query: 110 ALTYKNFPCKVVAT--RKAILRALNLFLLGIFLQ-GGFFHGINNLKYGVDIAQIRWMGVL 166
+ + + +K R L +FL+G+FL F ++++ D +IR GVL
Sbjct: 69 SFGMGKLKEQGNSAFLKKVFSRTLKIFLIGLFLNMFPFVKWVDDVLVMKDFTEIRIWGVL 128
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRIA+ Y +A+L + + L+++ LL +G V W
Sbjct: 129 QRIAVCYCIASLLVYYFRSK------------------TILLISAAMLLGYWG--VMSWF 168
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
++ +E NA +D ++ ++LY+
Sbjct: 169 GDYTLE----------------------GNAATKLDNFLMNAKNLYK------------- 193
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
YG PFDPEG+LS++ A V ++G G I +++ N +I
Sbjct: 194 --GYG-----------IPFDPEGVLSTIPAVVNVILGYFAGLFIQKKGNNKSTAFN-LIG 239
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
+ ++ L S + +NK +++ SY T G
Sbjct: 240 TGVILLLAASAWNLVFPINKPIWTSSYVLYTVG 272
>gi|383643230|ref|ZP_09955636.1| hypothetical protein SeloA3_10334 [Sphingomonas elodea ATCC 31461]
Length = 382
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 101/365 (27%), Positives = 152/365 (41%), Gaps = 84/365 (23%)
Query: 40 SNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADF 96
+ + + + R ++LDVFRG T+ LMILV+ G P + H+ W G TLAD
Sbjct: 2 TETQANTVAALAGKPRFLALDVFRGATIFLMILVNTSGPGAEPYPQLVHAKWIGFTLADL 61
Query: 97 VMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLG--IFLQGGFFHGINNLKYG 154
V P FLF +G +++ ++ K VAT + R LF G IF+ G + ++ G
Sbjct: 62 VFPTFLFAMGNAMSFAFR----KPVATGPFLAR---LFRRGAIIFVLGYLMYWFPFVEQG 114
Query: 155 VD------IAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALV 208
D A R GVLQR+A+ Y++A L WLK + + +++ G+W + LV
Sbjct: 115 PDGWALKPFALTRVPGVLQRLALCYVLAGLMIRWLKPRQLLLAGIAML---LGYWTILLV 171
Query: 209 LTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGI 268
+ P + N ID +LG
Sbjct: 172 FS-------------------PAGMAFDK----------------YANIGTQIDLWLLGP 196
Query: 269 QHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGH 328
HLY+K A FDPEGLL ++ ATV + G G
Sbjct: 197 GHLYKK---------------------------DAGFDPEGLLGTLPATVNVIAGYLAGL 229
Query: 329 LIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGI 388
IV D R + ++ + L+ GL+ + K L++ SY LT G VLLAG+
Sbjct: 230 AIVQGGDLRRTVGRMALVGAALVLAGLAWS-PWFPIAKKLWTGSYVLLTVGIDLVLLAGV 288
Query: 389 YFMVR 393
++
Sbjct: 289 IGLIE 293
>gi|410721825|ref|ZP_11361152.1| hypothetical protein B655_1618 [Methanobacterium sp. Maddingley
MBC34]
gi|410598366|gb|EKQ52947.1| hypothetical protein B655_1618 [Methanobacterium sp. Maddingley
MBC34]
Length = 372
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 103/351 (29%), Positives = 155/351 (44%), Gaps = 93/351 (26%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGILP----AINHSPWNGLTLADFVMPFFLFIVGVS 108
+ RL+SLDVFRG T+A MI+V+ + G+ P + H+ W GL LAD V PFFLFIVGVS
Sbjct: 5 KDRLVSLDVFRGFTIAGMIMVN-ILGLYPDTPSLLQHASWIGLNLADLVFPFFLFIVGVS 63
Query: 109 LALTYKNFPCKVVATR--KAILRALNLFLLGIFLQGGFFHGINNLKYGV-DIAQIRWMGV 165
+ ++ + + + K + R L+L+G+ L G F YGV D + IR G+
Sbjct: 64 MNFSFASRSKQPSWKKWGKFLFRVAALYLIGVALVFGLFF------YGVPDFSTIRIPGI 117
Query: 166 LQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDW 225
LQ IA++ L AA + R W++ L++ L+V
Sbjct: 118 LQLIALSSLFAAPLA-----------------RLRTRWIILAASVILFVQAAILLWV--- 157
Query: 226 QYEFPVETSSSSPWIFNVTCGV-RGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
S+P GV GS + N G ID ++ HL K
Sbjct: 158 ----------SAP-------GVPAGSLEMSNNIAGWIDSQVFTPAHLLDK---------- 190
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
+ FDPEG+++ + T LIGL G + HR NW
Sbjct: 191 -----------------EHVFDPEGMMAVINGTAMVLIGLACGR---TLRLHR----NWK 226
Query: 345 ILSSCLIG------LGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIY 389
+ +IG +GL + V M + K L++ S+ + AG + ++LA +Y
Sbjct: 227 GVQYLIIGGLIALTIGLIISPV-MPIIKQLWTSSFILVNAGLAAIILALLY 276
>gi|374374997|ref|ZP_09632655.1| hypothetical protein NiasoDRAFT_0408 [Niabella soli DSM 19437]
gi|373231837|gb|EHP51632.1| hypothetical protein NiasoDRAFT_0408 [Niabella soli DSM 19437]
Length = 395
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 108/380 (28%), Positives = 153/380 (40%), Gaps = 94/380 (24%)
Query: 55 RLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
R SLDVFRG TV LMILV++ G I ++H+PW+GLT D V PFFLF VG +++
Sbjct: 4 RYRSLDVFRGATVCLMILVNNPGSWAHIYAPLDHAPWHGLTPTDLVFPFFLFAVGNAMSF 63
Query: 112 TYKNFPCKVVAT--RKAILRALNLFLLGIFLQGGFFHGINN-----LKYGVDIAQ---IR 161
A +K R L +F +GIFL F N + + D A+ IR
Sbjct: 64 VIPRLQEAGPAEFWKKITKRTLIIFGIGIFLNWSPFVRWNGDTLQAVTWVTDPAKNIGIR 123
Query: 162 WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLY 221
GVLQRIA Y A++ +LK A L+ + LL +GL
Sbjct: 124 IFGVLQRIAFCYFFASIIVYYLKPK------------------TAYFLSLVLLLAYWGLC 165
Query: 222 VPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTK 281
+ P + S W G+ ID+ IL I H+Y+
Sbjct: 166 I----LGNPADPYSLKGWF--------GTN---------IDKAILHIPHMYKGE------ 198
Query: 282 QCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVH--------F 333
PFDPEG SS+ A V + G G I +
Sbjct: 199 --------------------GVPFDPEGFASSLGAIVQIVFGYFVGMYIKNSSAQIPKDL 238
Query: 334 KDHRD------RMLNWIILSSC-LIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
D +D +ML + ++ L+ G D V +NK +++ SYT T G + + L
Sbjct: 239 TDKQDPRNPMFKMLTVLFVAGVGLLVTGFCWDMV-FPINKKIWTSSYTVYTTGLAIITLC 297
Query: 387 GIYFMVRYISSHLMLKKPFD 406
+ F + S L F+
Sbjct: 298 VMIFFIEIKGSKSFLATFFE 317
>gi|409199197|ref|ZP_11227860.1| hypothetical protein MsalJ2_19286 [Marinilabilia salmonicolor JCM
21150]
Length = 369
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 85/286 (29%), Positives = 125/286 (43%), Gaps = 77/286 (26%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG 106
Q +R ++LDV RG+T+ALMI V+ G I + H+ W+G T D V PFFLF+ G
Sbjct: 1 MKQSQRYLALDVLRGMTIALMITVNTPGSWQYIYAPLRHASWHGCTPTDLVFPFFLFVAG 60
Query: 107 VSLALTYKNFPCKV--VATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMG 164
VS+ ++ + + + ++ R L +F++G+FL + ++ D + +R MG
Sbjct: 61 VSMFFSFGKYGGALNSESLKRLGRRTLLIFVIGLFLN-------SFPQWSHDFSTLRIMG 113
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQRIA+AY + +L LS RKY ++L +L +G
Sbjct: 114 VLQRIALAYGIGSLIV------------LSAPRKYIPFIGGGILLIYWGILAWFG----- 156
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
+ P+ NAV D+ ILG QHLY
Sbjct: 157 ----------GAEPYSLE------------GNAVIPFDKAILGEQHLYTG---------- 184
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI 330
PFDPEGLLS+V A VT L+G G +I
Sbjct: 185 ----------------FGIPFDPEGLLSTVPAIVTVLLGYLTGVII 214
>gi|295690502|ref|YP_003594195.1| hypothetical protein Cseg_3137 [Caulobacter segnis ATCC 21756]
gi|295432405|gb|ADG11577.1| Protein of unknown function DUF2261, transmembrane [Caulobacter
segnis ATCC 21756]
Length = 372
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 102/338 (30%), Positives = 136/338 (40%), Gaps = 70/338 (20%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGILPA---INHSPWNGLTLADFVMPFFLFIVGVSLAL 111
R +SLDVFRGLTV LMI+V+ G A + H+PW G T AD V P FLF VG S+A
Sbjct: 7 RFLSLDVFRGLTVCLMIVVNTAGPGAKAYTQLVHAPWFGFTAADAVFPSFLFAVGCSMAF 66
Query: 112 TYKNFPCKVVATRKAILRALNLFLLGIFLQG-GFFHGINNLKYGVDIAQIRWMGVLQRIA 170
+ K + RA +FLLG + F I+ + A R MGVLQRIA
Sbjct: 67 AFSRPIPTNEFLAKVLRRAALIFLLGFLMYWFPFVKKIDGHWALIPFADTRVMGVLQRIA 126
Query: 171 IAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFP 230
+ Y++AA WL V+ L G+W + + L P
Sbjct: 127 LCYMLAAFAVRWLSPRLIVALSAVLL---LGYWAILMTLGD------------------P 165
Query: 231 VETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDY 290
S NA +D ++G HLYRK D
Sbjct: 166 AAPLSK-----------------LGNAGTHLDLFLIGQNHLYRK--------------DG 194
Query: 291 GPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCL 350
G FDPEGLL ++ +TV L G + + M I L
Sbjct: 195 G-------------FDPEGLLGTLPSTVNVLAGYLAARFLKENPGSQSAMARMAIAGVVL 241
Query: 351 IGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGI 388
I GL+ + + K L++ S+ LT G VLLA +
Sbjct: 242 ILAGLAWSPL-FPIAKKLWTGSFVLLTVGIDLVLLAAL 278
>gi|423282312|ref|ZP_17261197.1| hypothetical protein HMPREF1204_00735 [Bacteroides fragilis HMW
615]
gi|404581880|gb|EKA86575.1| hypothetical protein HMPREF1204_00735 [Bacteroides fragilis HMW
615]
Length = 387
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 93/357 (26%), Positives = 157/357 (43%), Gaps = 91/357 (25%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG 106
Q +RL++LDV RG+T+A MI+V++ G + + H+ W GLT D V PFF+FI+G
Sbjct: 3 QTVNKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMG 62
Query: 107 VS--LALTYKNFPCKVVATRKAILRALNLFL--LGIFLQGGFFHGINNLKYGVDIA---- 158
+S ++L NF A K + R + +F LGI F N+L DI+
Sbjct: 63 ISTYISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-SEDISFFSR 121
Query: 159 ---------QIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVL 209
IR +GV+QR+A+ Y A+ + +K KY + ++A++L
Sbjct: 122 LYESIWTFGHIRILGVMQRLALCYGATAIIALIMK------------HKYIPY-LIAILL 168
Query: 210 TTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQ 269
+++L+ G G + N + ++DR +LG
Sbjct: 169 IGYFIILING----------------------------NGFEYNSSNILSIVDRTVLGEA 200
Query: 270 HLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHL 329
H+Y+ D G DPEGLLS++ + LIG G L
Sbjct: 201 HMYK---------------DNG-------------IDPEGLLSTIPSIAHVLIGFCVGKL 232
Query: 330 IVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
++ KD +++ ++ + L G L + G ++K ++S ++ +T G + LA
Sbjct: 233 LMEVKDIHEKIERLFLIGTILTFAGFLLSY-GCPISKKIWSPTFAIVTCGLASSFLA 288
>gi|218131911|ref|ZP_03460715.1| hypothetical protein BACEGG_03534 [Bacteroides eggerthii DSM 20697]
gi|217986214|gb|EEC52553.1| hypothetical protein BACEGG_03534 [Bacteroides eggerthii DSM 20697]
Length = 396
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 165/373 (44%), Gaps = 105/373 (28%)
Query: 42 SNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVM 98
SN+ +T +R+++LD+ RG+T+A MI+V++ G I + H+ WNGLT D V
Sbjct: 2 SNAVKT-----NKRILALDILRGVTIAGMIMVNNPGTWAHIYAPLRHAEWNGLTPTDLVF 56
Query: 99 PFFLFIVGVSLALTYK--NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGI-------- 148
PFF+FI+G+S ++ K NF A K + R + +FL+G+ + G+F
Sbjct: 57 PFFMFIMGISTYISLKKYNFEFSHAAGIKILKRTILIFLIGMAI--GWFSKFCYYWTSPT 114
Query: 149 NNLKYGVDIAQ-------IRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRG 201
+ +G + + IR +GV+QR+A+ Y A+ ++L K++
Sbjct: 115 EGISFGTQLWESVWTFDRIRILGVMQRLALCYGATAI--------------IALTMKHKN 160
Query: 202 -HWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGM 260
+++A +LT ++LLL CG G N + +
Sbjct: 161 IPYLIATLLTGYFILLL---------------------------CG-NGFAYNDTNILSI 192
Query: 261 IDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTC 320
+DR IL H+Y+ D G DPEGLLS++ A
Sbjct: 193 VDRTILTPAHMYK---------------DNG-------------IDPEGLLSTIPAIAHV 224
Query: 321 LIGLHFGHLIVH---FKDHRDRMLNWIILSSCLIGLGLSLDFV----GMHLNKALYSLSY 373
L+G G +++ + R+ MLN ++ L+G L+ G +NK ++S ++
Sbjct: 225 LLGFCVGRMMLEGGKANEDRESMLNSHLIKLLLVGTILTFSGFLLSYGCPINKKIWSPTF 284
Query: 374 TCLTAGASGVLLA 386
T G + LA
Sbjct: 285 VLTTCGLASSFLA 297
>gi|428210738|ref|YP_007083882.1| hypothetical protein Oscil6304_0209 [Oscillatoria acuminata PCC
6304]
gi|427999119|gb|AFY79962.1| hypothetical protein Oscil6304_0209 [Oscillatoria acuminata PCC
6304]
Length = 398
Score = 91.3 bits (225), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 108/350 (30%), Positives = 156/350 (44%), Gaps = 74/350 (21%)
Query: 55 RLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL SLDVFRG+ +A MILV++ G + P + H+PW+G T D + P FLFI GV++A
Sbjct: 20 RLTSLDVFRGMAIASMILVNNPGSWQQVYPPLLHAPWHGFTPTDLIFPAFLFISGVAMAF 79
Query: 112 TYK------NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYG--VDIAQIRWM 163
++ N P K + RAL LF LG+FL G + L G +D +R M
Sbjct: 80 SFAKYTNSPNSPPAASVYFKILRRALILFGLGLFLNGSTLV-LKTLLQGQPLDFGTLRIM 138
Query: 164 GVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVP 223
GVLQRI++AYL A ++ L+L R+ G +A++L Y L + VP
Sbjct: 139 GVLQRISLAYLFGA------------TAILNLSRRRLGFLCLAILLG--YWFALTQIPVP 184
Query: 224 DWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQC 283
+ P + S+ A V +DR IL H+
Sbjct: 185 GYG---PGDLSAKG----------------AGTLVAYLDRLILTPPHIL----------- 214
Query: 284 SINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNW 343
F+PEGLLS++ + VT L+G G + L
Sbjct: 215 -----------------GDGSFEPEGLLSTLPSVVTLLLGFFIGDWLQKQPVTSRTSLQM 257
Query: 344 IILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVR 393
++ I G SL + +NK L++ SY L+AG S +LLA Y +V
Sbjct: 258 AGVAVVTIVTG-SLWGLVFPINKQLWTSSYVVLSAGWSLLLLAACYELVE 306
>gi|375356811|ref|YP_005109583.1| putative transmembrane protein [Bacteroides fragilis 638R]
gi|383116724|ref|ZP_09937472.1| hypothetical protein BSHG_1191 [Bacteroides sp. 3_2_5]
gi|251947990|gb|EES88272.1| hypothetical protein BSHG_1191 [Bacteroides sp. 3_2_5]
gi|301161492|emb|CBW21032.1| putative transmembrane protein [Bacteroides fragilis 638R]
Length = 387
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 93/357 (26%), Positives = 157/357 (43%), Gaps = 91/357 (25%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG 106
Q +RL++LDV RG+T+A MI+V++ G + + H+ W GLT D V PFF+FI+G
Sbjct: 3 QTVNKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMG 62
Query: 107 VS--LALTYKNFPCKVVATRKAILRALNLFL--LGIFLQGGFFHGINNLKYGVDIA---- 158
+S ++L NF A K + R + +F LGI F N+L G DI+
Sbjct: 63 ISTYISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-GEDISFFSR 121
Query: 159 ---------QIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVL 209
IR +GV+QR+A+ Y A+ + +K KY + ++A++L
Sbjct: 122 LYESVWTFGHIRILGVMQRLALCYGATAIIALIMK------------HKYIPY-LIAILL 168
Query: 210 TTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQ 269
+++L+ G G + N + ++D +LG
Sbjct: 169 IGYFIILING----------------------------NGFEYNSSNILSIVDHTVLGEA 200
Query: 270 HLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHL 329
H+Y+ D G DPEGLLS++ + LIG G L
Sbjct: 201 HMYK---------------DNG-------------IDPEGLLSTIPSIAHVLIGFCVGKL 232
Query: 330 IVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
++ KD +++ ++ + L G L + G ++K ++S ++ +T G + LA
Sbjct: 233 LMEVKDIHEKIERLFLIGTILTFAGFLLSY-GCPISKKIWSPTFAIVTCGLASSFLA 288
>gi|317474486|ref|ZP_07933760.1| hypothetical protein HMPREF1016_00739 [Bacteroides eggerthii
1_2_48FAA]
gi|316909167|gb|EFV30847.1| hypothetical protein HMPREF1016_00739 [Bacteroides eggerthii
1_2_48FAA]
Length = 396
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 165/373 (44%), Gaps = 105/373 (28%)
Query: 42 SNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVM 98
SN+ +T +R+++LD+ RG+T+A MI+V++ G I + H+ WNGLT D V
Sbjct: 2 SNAVKT-----NKRILALDILRGVTIAGMIMVNNPGTWAHIYAPLRHAEWNGLTPTDLVF 56
Query: 99 PFFLFIVGVSLALTYK--NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGI-------- 148
PFF+FI+G+S ++ K NF A K + R + +FL+G+ + G+F
Sbjct: 57 PFFMFIMGISTYISLKKYNFEFSHAAGIKILKRTILIFLIGMAI--GWFSKFCYYWTSPT 114
Query: 149 NNLKYGVDIAQ-------IRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRG 201
+ +G + + IR +GV+QR+A+ Y A+ ++L K++
Sbjct: 115 EGISFGTQLWESVWTFDRIRILGVMQRLALCYGATAI--------------IALTMKHKN 160
Query: 202 -HWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGM 260
+++A +LT ++LLL CG G N + +
Sbjct: 161 IPYLIATLLTGYFILLL---------------------------CG-NGFAYNDTNILSI 192
Query: 261 IDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTC 320
+DR IL H+Y+ D G DPEGLLS++ A
Sbjct: 193 VDRTILTPAHMYK---------------DNG-------------IDPEGLLSTIPAIAHV 224
Query: 321 LIGLHFGHLIVH---FKDHRDRMLNWIILSSCLIGLGLSLDFV----GMHLNKALYSLSY 373
L+G G +++ + R+ MLN ++ L+G L+ G +NK ++S ++
Sbjct: 225 LLGFCVGRMMLEGGKANEDRESMLNSHLIKLFLVGTILTFSGFLLSYGCPINKKIWSPTF 284
Query: 374 TCLTAGASGVLLA 386
T G + LA
Sbjct: 285 VLTTCGLASSFLA 297
>gi|332983392|ref|YP_004464833.1| heparan-alpha-glucosaminide N-acetyltransferase [Mahella
australiensis 50-1 BON]
gi|332701070|gb|AEE98011.1| Heparan-alpha-glucosaminide N-acetyltransferase [Mahella
australiensis 50-1 BON]
Length = 368
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 103/360 (28%), Positives = 166/360 (46%), Gaps = 83/360 (23%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
++R+ S+D RG+ + MI +++ G P + H+PWNG+TLAD P F+F++G+ +
Sbjct: 3 KKRIQSIDALRGICITAMIFMNNPGNSKYTSPLLLHAPWNGITLADLFFPCFIFVMGMVI 62
Query: 110 ALTYKNFPCKVVATRKAIL----RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGV 165
+++ K + I R+ LFL+G+FL N D+ +R +GV
Sbjct: 63 PVSFGKRMAKGQTKGQLIAHLLKRSAMLFLIGLFL---------NAFPCFDMQHVRILGV 113
Query: 166 LQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDW 225
LQRIA+ Y + L ++ SS +S+F + A +L YLLL +VP
Sbjct: 114 LQRIALVYFFSGLIFLF-------SSTMSMFI------ISAAILIGYYLLL---RFVPVP 157
Query: 226 QYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSI 285
Y GV TG N + ID K+L HLY
Sbjct: 158 GYG----------------AGVFERTG---NLIQYIDLKLLK-GHLY------------- 184
Query: 286 NSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWII 345
+PD+ DPEGLLS++ A + L+G+ G L+V K + ++ L ++
Sbjct: 185 -TPDW---------------DPEGLLSTLPAIASSLLGILTGCLLVSDKKNTNK-LYIML 227
Query: 346 LSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKKPF 405
+ S L + + LNK L+S S+ T G + +LL+ Y++ I++ L KPF
Sbjct: 228 VCSALAFISSIITQKWFPLNKNLWSSSFVLFTTGFALLLLSVCYWLAD-INNLATLIKPF 286
>gi|452822119|gb|EME29141.1| heparan-alpha-glucosaminide N-acetyltransferase isoform 1
[Galdieria sulphuraria]
Length = 356
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 91/328 (27%), Positives = 130/328 (39%), Gaps = 88/328 (26%)
Query: 70 MILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVV----- 121
MIL D G + H W +AD + PFFLF+VG S+ ++ P +V
Sbjct: 1 MILGKDQGNFDKMYSVFKHESWFSWHMADLIFPFFLFMVGSSIYFAFRKVPREVENSEEK 60
Query: 122 --ATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALC 179
A R R + LFL+G+ L N G +RWMG+LQRIAI Y A
Sbjct: 61 DKALRSVTSRTIKLFLVGVLL--------NVPLSGFRWETLRWMGILQRIAICYGCVAF- 111
Query: 180 EIWLKGDGHVSSKLSLFRKYRG-HWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSP 238
L LF R + + VL L+ LLYGL VP+
Sbjct: 112 -------------LFLFVNSRVIQYALVSVLFLLHTSLLYGLIVPN-------------- 144
Query: 239 WIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAP 298
C + AC+A +D ILG +HLY
Sbjct: 145 ------CLISERLTRACSAQSYLDTMILGGKHLYF------------------------- 173
Query: 299 SWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSL- 357
+DPEG+LS++MAT+ GL L + R+L ++ S +G+ + L
Sbjct: 174 ---HLEYDPEGILSTLMATINTFAGLEAARLTSSLRYVNQRILWCFLIGSSFVGIEILLV 230
Query: 358 ----DFVGMHLNKALYSLSYTCLTAGAS 381
D + ++K L++ S+ LT G S
Sbjct: 231 DCFPD--SVPISKPLWTASFLFLTVGCS 256
>gi|329956032|ref|ZP_08296803.1| hypothetical protein HMPREF9445_01662 [Bacteroides clarus YIT
12056]
gi|328524791|gb|EGF51845.1| hypothetical protein HMPREF9445_01662 [Bacteroides clarus YIT
12056]
Length = 396
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 94/369 (25%), Positives = 160/369 (43%), Gaps = 100/369 (27%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFL 102
+ P +R+++LD+ RG+T+A MI+V++ G I + H+ WNGLT D V PFF+
Sbjct: 1 MSNPVKTNKRILALDILRGVTIAGMIMVNNPGTWAHIYAPLRHAEWNGLTPTDLVFPFFM 60
Query: 103 FIVGVSLALTYK--NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGI--------NNLK 152
FI+G+S ++ K NF K + R + +FL+G+ + G+F +
Sbjct: 61 FIMGISTYISLKKYNFEFSRAVGMKILKRTILIFLIGMAI--GWFSKFCYYWTSPTEGIS 118
Query: 153 YGVDIAQ-------IRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRG-HWV 204
+G + + IR +GV+QR+A+ Y A+ ++L K+R ++
Sbjct: 119 FGAQLWESVWTFDRIRILGVMQRLALCYGATAI--------------IALTVKHRNIPYL 164
Query: 205 VALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRK 264
+A +LT ++LL+ CG G N + +IDR
Sbjct: 165 IATLLTGYFILLV---------------------------CG-NGFAYNDTNILSVIDRT 196
Query: 265 ILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGL 324
IL H+Y+ D G DPEGLLS++ + L+G
Sbjct: 197 ILTPAHMYK---------------DNG-------------IDPEGLLSTIPSIAHVLLGF 228
Query: 325 HFGHLIVH---FKDHRDRMLNWIILSSCLIGLGLSLDFV----GMHLNKALYSLSYTCLT 377
G +++ + R+ MLN ++ L G L+ G +NK ++S ++ T
Sbjct: 229 CVGRMMLENGKANEDRESMLNSHLIKLFLAGTILTFSGFLLSYGCPINKKIWSPTFVLTT 288
Query: 378 AGASGVLLA 386
G + LA
Sbjct: 289 CGLASSFLA 297
>gi|390344818|ref|XP_795043.3| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Strongylocentrotus purpuratus]
Length = 680
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 90/377 (23%), Positives = 157/377 (41%), Gaps = 58/377 (15%)
Query: 32 ERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGL 91
++ PL P++ K + RL S+D FRGL + ++L G NH+ W G+
Sbjct: 273 DQTPLTFPASDKPKSSL------RLRSVDTFRGLAITHLVLGASGDGHFWYSNHARWYGI 326
Query: 92 TLADFVMPFFLFIVGVSLALTYKNFPCKVVAT----RKAILRALNLFLLGIFLQGGFFHG 147
T+ADF+ P+F+FI+G S+ L++ K ++ +K + R+++LF++G+ +Q H
Sbjct: 327 TVADFMFPWFVFIMGTSIHLSFNILLSKGLSYCAIFKKIVFRSISLFIMGVCIQS---HN 383
Query: 148 INNLKYGVDIAQIRWMGVLQRIAIAYLVAALC-----EIWLKGDGHVSSKLSLFRKYRGH 202
D+ +R GVLQR I Y + A + + +FR +
Sbjct: 384 --------DLRNLRIPGVLQRFGITYFIVASSYLLSRRLQARRAEKTGKCYMMFRDITDY 435
Query: 203 WVVALVLTTL--YLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGM 260
+ L L +L L + L VP G +G GP G
Sbjct: 436 LELPLAACCLVVHLCLTFLLPVP------------------GCPLGYQGPGGPLVGENGE 477
Query: 261 IDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTC 320
+ G + ++ +N+ D + DPEG+L + + C
Sbjct: 478 LTNCTGGASGYIDRTFFTEAHLILVNTCDDVYRTI-------VRSDPEGILGTFTSIALC 530
Query: 321 LIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMH-----LNKALYSLSYTC 375
+ GL G ++ F R R++ ++ LI L M LNK L+S+S+
Sbjct: 531 VFGLQSGKILHLFTTVRGRLVRLLLWGLALISCSAVLCKCSMADGWIPLNKNLWSVSFIA 590
Query: 376 LTAGASGVLLAGIYFMV 392
LT G + ++ A + ++
Sbjct: 591 LTGGTAFIVQALFHVLI 607
>gi|329851960|ref|ZP_08266641.1| hypothetical protein ABI_47300 [Asticcacaulis biprosthecum C19]
gi|328839809|gb|EGF89382.1| hypothetical protein ABI_47300 [Asticcacaulis biprosthecum C19]
Length = 369
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 97/339 (28%), Positives = 136/339 (40%), Gaps = 70/339 (20%)
Query: 54 RRLISLDVFRGLTVALMILVDDVG-GILP--AINHSPWNGLTLADFVMPFFLFIVGVSLA 110
+R SLDVFRGLTVA MI+V+ G G P ++H+ W GLTLAD V P FLF VG +++
Sbjct: 4 QRFTSLDVFRGLTVAFMIVVNTSGPGAAPFAQLSHATWFGLTLADLVFPAFLFAVGNAMS 63
Query: 111 LTYKNFPCKVVATRKAILRALNLFLLGIFLQG-GFFHGINNLKYGVDIAQIRWMGVLQRI 169
K + RA LFLLG + F H + + R GVLQRI
Sbjct: 64 FGDPKSGPTGRYLGKVVKRAAILFLLGYLMYWFPFVHATADGWALNPVEHTRIPGVLQRI 123
Query: 170 AIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEF 229
A+ +L AA+ WL + L Y G L+++G P Q
Sbjct: 124 ALCFLAAAIAVRWLDVPKLIGLSAVLLLGYWGA------------LMVFG--PPGEQLT- 168
Query: 230 PVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPD 289
P N +IDR + GI H+Y K
Sbjct: 169 -----------------------PLGNIGALIDRAVFGINHMYAK--------------- 190
Query: 290 YGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSC 349
+DPEGL S++ A V L G G I D R ++ + +
Sbjct: 191 ------------GKGYDPEGLFSTLPAIVNVLAGYLAGRYIRSQPDLRTVVIRLAVAAGL 238
Query: 350 LIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGI 388
++ L+ L+K L++ S+ + G LLAG+
Sbjct: 239 IVAAALAWSLT-FPLSKRLWTSSFALINIGIDLGLLAGL 276
>gi|16124796|ref|NP_419360.1| hypothetical protein CC_0541 [Caulobacter crescentus CB15]
gi|221233512|ref|YP_002515948.1| hypothetical protein CCNA_00575 [Caulobacter crescentus NA1000]
gi|13421730|gb|AAK22528.1| hypothetical protein CC_0541 [Caulobacter crescentus CB15]
gi|220962684|gb|ACL94040.1| hypothetical protein CCNA_00575 [Caulobacter crescentus NA1000]
Length = 372
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 105/344 (30%), Positives = 140/344 (40%), Gaps = 72/344 (20%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGGILPAIN---HSPWNGLTLADFVMPFFLFIVG 106
R +SLDVFRGLTV LMI+V+ G A + H+PW G T AD V P FLF VG
Sbjct: 2 SQPAARFLSLDVFRGLTVFLMIVVNTAGPGAKAYSQLVHAPWFGFTAADAVFPSFLFAVG 61
Query: 107 VSLALTY-KNFPCKVVATRKAILRALNLFLLGIFLQG-GFFHGINNLKYGVDIAQIRWMG 164
S+A + K P T K + RA +FLLG + F ++ + + R MG
Sbjct: 62 CSMAFAFSKPIPLNDF-TVKVLRRAALIFLLGFLMYWFPFVRKVDGDWALIPFSDTRVMG 120
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQRIA+ YL+AA WL S +L + L +LL G +
Sbjct: 121 VLQRIALCYLLAAFAVRWL------SPRL---------------IVALCAVLLLGYWAIL 159
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
+ P S NA +D ++G HLYRK
Sbjct: 160 MAFGDPAAPLSK-----------------LGNAGTRLDLLLIGQNHLYRK---------- 192
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
D G FDPEGLL ++ +TV L G + M
Sbjct: 193 ----DGG-------------FDPEGLLGTLPSTVNVLAGYLAARFLKENPGSSQAMGRMA 235
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGI 388
I LI GL + + K L++ S+ LT G +LLAG+
Sbjct: 236 IAGLVLILAGLVWSPL-FPIAKKLWTSSFVLLTVGIDLILLAGL 278
>gi|315500593|ref|YP_004089395.1| hypothetical protein Astex_3616 [Asticcacaulis excentricus CB 48]
gi|315418605|gb|ADU15244.1| protein of unknown function DUF1624 [Asticcacaulis excentricus CB
48]
Length = 372
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 103/349 (29%), Positives = 147/349 (42%), Gaps = 90/349 (25%)
Query: 55 RLISLDVFRGLTVALMILVDDV-GGILP--AINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
R +LDVFRGLTV +MI+V+ G P + H+ W G TL D V P FLF +G S+
Sbjct: 5 RYTALDVFRGLTVCVMIVVNTSPAGAEPFAQLQHAQWFGFTLTDLVFPSFLFAIGNSMVF 64
Query: 112 TYKN-FPCKVVATRKAILRALNLFLLGIFLQG-GFFHGINNLKYGV-DIAQIRWMGVLQR 168
++ P K K + R+ +FLLG + F H + + DI R MGVLQR
Sbjct: 65 AFRKPLPHKEFLL-KVLRRSALIFLLGYLMYWFPFVHQTTDGAWAFNDIGHTRIMGVLQR 123
Query: 169 IAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYE 228
IA+ YL A+L +L G L L LLL+G +
Sbjct: 124 IALCYLFASLAARYLSVRG---------------------LVILSALLLFGYW------- 155
Query: 229 FPVETSSSSPWIFNVTCGVRGSTGPACNAVGM-------IDRKILGIQHLYRKPIYSRTK 281
G+ + PA +A+ M ID+ +LG+ H+YR
Sbjct: 156 -----------------GLLYAFTPAADALTMTGNLGAKIDQFVLGLDHMYRGG------ 192
Query: 282 QCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRML 341
++PEGLLS++ A V L G G LI+ D DR
Sbjct: 193 --------------------AKGYEPEGLLSTLPAIVNVLAGYLCGRLIL---DSEDRRR 229
Query: 342 NWIILSSCLIGLGLSLDF--VGMHLNKALYSLSYTCLTAGASGVLLAGI 388
+ILS + L + G +K L++ S+ LT G ++L+GI
Sbjct: 230 TVMILSGAGLALVAAALVWSFGFPFSKRLWTSSFAVLTIGLDCLILSGI 278
>gi|189464405|ref|ZP_03013190.1| hypothetical protein BACINT_00746 [Bacteroides intestinalis DSM
17393]
gi|189438195|gb|EDV07180.1| hypothetical protein BACINT_00746 [Bacteroides intestinalis DSM
17393]
Length = 395
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 96/369 (26%), Positives = 168/369 (45%), Gaps = 100/369 (27%)
Query: 44 SKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPF 100
+ QT+ +R+++LD+ RG+T+A MI+V++ G I + H+ WNGLT D V PF
Sbjct: 2 NSQTK---TNKRILALDILRGVTIAGMIMVNNPGTWGHIYAPLRHAEWNGLTPTDLVFPF 58
Query: 101 FLFIVGVSLALTYK--NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGI--------NN 150
F+FI+G+S ++ K NF A K + R + +FL+G L G+F
Sbjct: 59 FMFIMGISTYISLKKYNFEFSHAAAMKILKRTIIIFLIG--LAIGWFSRFCYYWAGSHEG 116
Query: 151 LKYGVDI-------AQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHW 203
L +G + +IR +GV+QR+A+ Y A+ + +K H+ +
Sbjct: 117 LSFGEQLWASVWTFDRIRILGVMQRLALCYGATAIIALTMK-HRHIP------------Y 163
Query: 204 VVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDR 263
++A +L ++LL+ G + +++N T N + ++DR
Sbjct: 164 LIATLLVGYFILLMCG-----------------NGFVYNET-----------NILSIVDR 195
Query: 264 KILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIG 323
IL H+Y+ D G DPEGLLS++ + L+G
Sbjct: 196 AILTPAHMYK---------------DNG-------------IDPEGLLSTIPSIAHVLLG 227
Query: 324 LHFGHLIV--HFKDHRDRMLNWIILSSCLIGLGLS----LDFVGMHLNKALYSLSYTCLT 377
G +++ + ++R+ +LN ++ LIG L+ L G +NK ++S ++ T
Sbjct: 228 FCVGRMMLDSNKTENRETLLNSHLIKLFLIGTILTFAGFLLSYGCPINKKIWSPTFVLTT 287
Query: 378 AGASGVLLA 386
G + LA
Sbjct: 288 CGLASSFLA 296
>gi|296222155|ref|XP_002757067.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase, partial
[Callithrix jacchus]
Length = 516
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 93/332 (28%), Positives = 139/332 (41%), Gaps = 64/332 (19%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQT-------RPQHQQRRLISLDVFRGLTVALMI 71
ISK+ S+ ++L L PS ++ RP RL S+D FRG+ + LM+
Sbjct: 224 ISKAISSRETDRLINSELGSPSRTDPLDGDVQPAVWRPSALPLRLRSVDTFRGIALILMV 283
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAI 127
V+ GG H+ WNGLT+AD V P+F+FI+G S+ L+ + K K
Sbjct: 284 FVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGTSVFLSMTSILQRGCSKFRLLGKIA 343
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKY---GVDIAQIRWMGVLQRIAIAYLVAALCEIWLK 184
R+ L +GI I N Y + ++R GVLQR+ + Y V A+ E+
Sbjct: 344 WRSFLLICIGII--------IVNPNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFA 395
Query: 185 G--DGHVSSKLSLFR-----KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSS 237
H +S+ S W++ L L L+L L + L VP T
Sbjct: 396 KPVPEHCTSERSCLSLRDITSSWPQWLLILALEGLWLGLTFLLPVPG------CPTGYLG 449
Query: 238 PWIFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPM 293
P G G G P C A G IDR +LG HLY+ P + +
Sbjct: 450 P-------GGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSAVLYHTEV-------- 494
Query: 294 PLDAPSWCQAPFDPEGLLSSVMATVTCLIGLH 325
+DPEG+L ++ + + +G+
Sbjct: 495 ----------AYDPEGILGTINSILMAFLGVQ 516
>gi|397505551|ref|XP_003823320.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase isoform
2 [Pan paniscus]
Length = 622
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 92/329 (27%), Positives = 138/329 (41%), Gaps = 58/329 (17%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISL-------DVFRGLTVALMI 71
ISK+ S+ ++L L PS ++ Q RL +L D FRG+ + LM+
Sbjct: 146 ISKAISSRETDRLINSELGSPSRTDPLDGDVQPATWRLSALPPRLRSVDTFRGIALILMV 205
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAI 127
V+ GG H+ WNGLT+AD V P+F+FI+G S+ L+ + K K
Sbjct: 206 FVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKFRLLGKIA 265
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKG-- 185
R+ L +GI + N + ++R GVLQR+ + Y V A+ E+
Sbjct: 266 WRSFLLICIGIIIVNP-----NYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPV 320
Query: 186 DGHVSSKLSLFR-----KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWI 240
H +S+ S W++ LVL L+L L + L VP T P
Sbjct: 321 PEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVPG------CPTGYLGP-- 372
Query: 241 FNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLD 296
G G G P C A G IDR +LG HLY+ P + +
Sbjct: 373 -----GGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSAVLYHTEV----------- 416
Query: 297 APSWCQAPFDPEGLLSSVMATVTCLIGLH 325
+DPEG+L ++ + V +G+
Sbjct: 417 -------AYDPEGILGTINSIVMAFLGVQ 438
>gi|452822118|gb|EME29140.1| heparan-alpha-glucosaminide N-acetyltransferase isoform 2
[Galdieria sulphuraria]
Length = 351
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 86/311 (27%), Positives = 124/311 (39%), Gaps = 85/311 (27%)
Query: 84 NHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVV-------ATRKAILRALNLFLL 136
H W +AD + PFFLF+VG S+ ++ P +V A R R + LFL+
Sbjct: 13 KHESWFSWHMADLIFPFFLFMVGSSIYFAFRKVPREVENSEEKDKALRSVTSRTIKLFLV 72
Query: 137 GIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLF 196
G+ L N G +RWMG+LQRIAI Y A L LF
Sbjct: 73 GVLL--------NVPLSGFRWETLRWMGILQRIAICYGCVAF--------------LFLF 110
Query: 197 RKYRG-HWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPAC 255
R + + VL L+ LLYGL VP+ C + AC
Sbjct: 111 VNSRVIQYALVSVLFLLHTSLLYGLIVPN--------------------CLISERLTRAC 150
Query: 256 NAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVM 315
+A +D ILG +HLY +DPEG+LS++M
Sbjct: 151 SAQSYLDTMILGGKHLYF----------------------------HLEYDPEGILSTLM 182
Query: 316 ATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSL-----DFVGMHLNKALYS 370
AT+ GL L + R+L ++ S +G+ + L D + ++K L++
Sbjct: 183 ATINTFAGLEAARLTSSLRYVNQRILWCFLIGSSFVGIEILLVDCFPD--SVPISKPLWT 240
Query: 371 LSYTCLTAGAS 381
S+ LT G S
Sbjct: 241 ASFLFLTVGCS 251
>gi|410631394|ref|ZP_11342069.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola arctica
BSs20135]
gi|410148840|dbj|GAC18936.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola arctica
BSs20135]
Length = 366
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 92/347 (26%), Positives = 157/347 (45%), Gaps = 77/347 (22%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVG 106
Q Q R+ ++DV RGL +ALM+LV++ G + H+ W+GLT D V PFFLF++G
Sbjct: 3 QSIQTRIEAIDVLRGLALALMLLVNNPGSWSAVYAPFLHADWHGLTPTDLVFPFFLFVMG 62
Query: 107 VSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
S+A + + ++ A+ L L +F+ GF I D +I MGVL
Sbjct: 63 ASMACSLRG---QIQASGLPWLSIFKRSFLLVFI--GFLLQIIPFDQAPDTWRI--MGVL 115
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRI + +L+ V+S L++ ++ W L+L+ + L++Y
Sbjct: 116 QRIGLCFLL-------------VASMLAIIKE---RW---LLLSAVVTLIVY-------- 148
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
W+ ++ G + N+V D ILG H+++
Sbjct: 149 ------------WLLLLSAG-QAPYSLENNSVRHFDMAILGSAHMWQGK----------- 184
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
PFDPEGLLS++ A +T L G ++ K+ + ++L +I+
Sbjct: 185 ---------------GLPFDPEGLLSTIGAAMTVLSGYLICVNVLQQKNQKQQILQLMIV 229
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVR 393
+ L+ LG V +NK+L++ SY +++ + + LA I + R
Sbjct: 230 GAILLALGFVWS-VWHPINKSLWTSSYVLVSSAFACLSLAVIILLWR 275
>gi|418709516|ref|ZP_13270303.1| putative membrane protein [Leptospira interrogans serovar
Grippotyphosa str. UI 08368]
gi|421125723|ref|ZP_15585968.1| putative membrane protein [Leptospira interrogans serovar
Grippotyphosa str. 2006006986]
gi|421135286|ref|ZP_15595410.1| putative membrane protein [Leptospira interrogans serovar
Grippotyphosa str. Andaman]
gi|410020544|gb|EKO87345.1| putative membrane protein [Leptospira interrogans serovar
Grippotyphosa str. Andaman]
gi|410436829|gb|EKP85940.1| putative membrane protein [Leptospira interrogans serovar
Grippotyphosa str. 2006006986]
gi|410770179|gb|EKR45405.1| putative membrane protein [Leptospira interrogans serovar
Grippotyphosa str. UI 08368]
gi|456966468|gb|EMG08068.1| putative membrane protein [Leptospira interrogans serovar
Grippotyphosa str. LT2186]
Length = 381
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 95/339 (28%), Positives = 144/339 (42%), Gaps = 88/339 (25%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
Q R++SLD+FRG+TVA MILV++ G I + H+ WNG T D V PFFLF VG+S
Sbjct: 6 NQNRILSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHARWNGCTPTDLVFPFFLFAVGIS 65
Query: 109 LALT-YKNFPCKVVATRKAI-LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
+ + Y + T I +R++ L L+G+FL F G +++R GVL
Sbjct: 66 IHFSVYSKNKIYLSKTWLGICIRSITLILIGLFLN---FFG------EWSFSELRIPGVL 116
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRI Y V A + L + S W+ L++ T L+ Q
Sbjct: 117 QRIGFVYWVVASLYLILPKRAILIS-----------WIPILIVHTWILI----------Q 155
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
P E+ P + IDR + G HL++ +S+T
Sbjct: 156 LPPPGESIVYLE--------------PGKDIGAWIDRNVFGENHLWK---FSKT------ 192
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
+DPEG S + + T L+G+ G ++ + + IL
Sbjct: 193 ------------------WDPEGFFSGISSITTSLLGVFCGSILSSKTNETKKQ----IL 230
Query: 347 SSCLIGLGLSLDFVG------MHLNKALYSLSYTCLTAG 379
S + G G+ VG + +NK+L++ SY TAG
Sbjct: 231 S--IFGFGILFVLVGLLWNQNLPMNKSLWTGSYVIYTAG 267
>gi|449664780|ref|XP_002169793.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Hydra magnipapillata]
Length = 369
Score = 88.6 bits (218), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 94/333 (28%), Positives = 146/333 (43%), Gaps = 46/333 (13%)
Query: 70 MILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILR 129
MI V+ GG +HS WNGLT+AD + P+F+FI+G S+ ++ + K ++ RK ++
Sbjct: 1 MIFVNYGGGGYYFFSHSSWNGLTVADLLFPWFIFIMGSSIYISMHSLR-KKLSKRKMTVK 59
Query: 130 ALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHV 189
I+ L G D+A R GVLQR AI+Y V AL +W
Sbjct: 60 I-------IYRSFKLLLLGLFLNNGFDLANWRLPGVLQRFAISYFVVALVFLWFDSPNEE 112
Query: 190 SSKLSLFRKYRGHW-----VVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVT 244
S S +R W +V L+L T+YLL++Y L VP + +
Sbjct: 113 SETNSWKNMFRDVWFPFQHIVMLLLLTIYLLIIYLLNVPGCPKGYFGPGGDGDHGAYEKC 172
Query: 245 CGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAP 304
G A G +DR + G+ H+Y+ P C
Sbjct: 173 TG---------GASGYVDRTVFGLNHIYKNPTCKSLYNC-------------------FT 204
Query: 305 FDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCL----IGL-GLSLDF 359
+DPEGLL ++ + + +GL ++ +K ++ W I S L +GL G +L+
Sbjct: 205 YDPEGLLGTIPSILLTYLGLQAARTLLFYKSKNGHIIRWFIWSVLLGALAVGLCGGTLND 264
Query: 360 VGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+ +NK L+SLSY + +LL Y +V
Sbjct: 265 GAIPINKNLWSLSYIFAMGSTAYLLLLICYVLV 297
>gi|301625227|ref|XP_002941812.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like,
partial [Xenopus (Silurana) tropicalis]
Length = 370
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 56/144 (38%), Positives = 75/144 (52%), Gaps = 11/144 (7%)
Query: 40 SNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMP 99
S N + + RRL SLD FRG ++ +M+ V+ GG H+PWNGLT+AD VMP
Sbjct: 190 SEDNCGEQSKVPESRRLYSLDTFRGFSLTIMVFVNYGGGGYWFFEHAPWNGLTVADLVMP 249
Query: 100 FFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQ 159
+F+FI+G S+AL + N K +R +L L +F G FF L YG
Sbjct: 250 WFVFIIGTSVALAF-NAMLKRGLSRCQLLYKLTWRTCILFAIGVFF-----LNYGPADGP 303
Query: 160 IRWM-----GVLQRIAIAYLVAAL 178
+ W GVLQR+ Y V AL
Sbjct: 304 LSWRWARIPGVLQRLGFTYFVIAL 327
>gi|167764222|ref|ZP_02436349.1| hypothetical protein BACSTE_02607 [Bacteroides stercoris ATCC
43183]
gi|167698338|gb|EDS14917.1| hypothetical protein BACSTE_02607 [Bacteroides stercoris ATCC
43183]
Length = 396
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 98/368 (26%), Positives = 159/368 (43%), Gaps = 110/368 (29%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+R+++LD+ RG+T+A MI+V++ G I + H+ WNGLT D V PFF+FI+G+S
Sbjct: 7 TNKRILALDILRGVTIAGMIMVNNPGTWAHIYAPLRHAEWNGLTPTDLVFPFFMFIMGIS 66
Query: 109 LALTYK--NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGI--------NNLKYGVDIA 158
++ K NF A K + R + +FL+G+ + G+F + +G +
Sbjct: 67 TYISLKKYNFEFSRAAGMKILKRTILIFLIGMGI--GWFSRFCYYWTSPTEGIGFGAQLW 124
Query: 159 QIRW-------MGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRG-HWVVALVLT 210
+ W +GV+QR+A+ Y A+ ++L K+R +++A +LT
Sbjct: 125 EAAWTFDRIRILGVMQRLALCYGATAI--------------IALTMKHRNIPYLIATLLT 170
Query: 211 TLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQH 270
++LL+ CG G N + ++DR IL H
Sbjct: 171 GYFILLV---------------------------CG-NGFAYNDTNILSVVDRAILTPAH 202
Query: 271 LYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI 330
+Y+ D G DPEGLLS++ A L+G G ++
Sbjct: 203 MYK---------------DNG-------------IDPEGLLSTIPAIAHVLLGFCVGRMM 234
Query: 331 VH---FKDHRDRMLNWIILSSCLIGL---GLSLDFVGMHL------NKALYSLSYTCLTA 378
+ + R+ MLN S LI L G L F G L NK ++S ++ T
Sbjct: 235 LEGGKADESRESMLN-----SHLIKLFLAGTILTFAGFLLSYGCPINKKIWSPTFVLATC 289
Query: 379 GASGVLLA 386
G + LA
Sbjct: 290 GLASSFLA 297
>gi|426359530|ref|XP_004047024.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Gorilla
gorilla gorilla]
Length = 635
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 108/399 (27%), Positives = 171/399 (42%), Gaps = 71/399 (17%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSN--SKQTRPQHQQ-----RRLISLDVFRGLTVALMI 71
ISK+ S+ ++L L PS ++ +P + RL S+D FRG+ + LM+
Sbjct: 196 ISKAISSRETDRLINSELGSPSRTDPLDGDVQPATWRLSALPPRLRSVDTFRGIALILMV 255
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAI 127
V+ GG H+ WNG ++ F+FI+G S+ L+ + K K
Sbjct: 256 FVNYGGGKYWYFKHASWNGAEGCIEMIEMFVFIMGSSIFLSMTSILQRGCSKFRLLGKIA 315
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKY---GVDIAQIRWMGVLQRIAIAYLVAALCEIWLK 184
R+ L +GI I N Y + ++R GVLQR+ + Y V A+ E+
Sbjct: 316 WRSFLLICIGII--------IVNPNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFA 367
Query: 185 G--DGHVSSKLSLFR-----KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSS 237
H +S+ S W++ LVL L+L L + L VP T
Sbjct: 368 KPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVPG------CPTGYLG 421
Query: 238 PWIFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPM 293
P G G G P C A G IDR +LG HLY+ P + +
Sbjct: 422 P-------GGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSAVLYHTEV-------- 466
Query: 294 PLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL 353
+DPEG+L ++ + V +G+ G +++++K +L C++GL
Sbjct: 467 ----------AYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILIRFTAWCCILGL 516
Query: 354 -GLSLDFVGMH-----LNKALYSLSY-TCLTAGASGVLL 385
++L V + +NK L+SLSY T L++ A +LL
Sbjct: 517 ISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILL 555
>gi|146302702|ref|YP_001197293.1| hypothetical protein Fjoh_4975 [Flavobacterium johnsoniae UW101]
gi|146157120|gb|ABQ07974.1| Uncharacterized protein [Flavobacterium johnsoniae UW101]
Length = 423
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 105/401 (26%), Positives = 160/401 (39%), Gaps = 113/401 (28%)
Query: 53 QRRLISLDVFRGLTVALMILVD---DVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
+ RLISLDVFRGLT+ LM +V+ D G + P + H+ W+G T D V PFF+FI+GV++
Sbjct: 3 KERLISLDVFRGLTILLMTIVNNPGDWGNVYPPLLHAEWHGCTPTDLVFPFFIFIMGVAV 62
Query: 110 ALTYKNFPCKVVATRKAILRALNLFLLGIF------------------------------ 139
L + K ++R+L + LGIF
Sbjct: 63 PLAMPDKFYDSTTFNKILVRSLRMLCLGIFFNFFGKIQLFGLEGIPLLIGRLAITIAVGY 122
Query: 140 -LQGGFFHGINN------------LKY-GVDIAQ-IRWMGVLQRIAIAYLVAALCEIWLK 184
L G F + N L Y G++ +R GVLQRIAI Y V +L +
Sbjct: 123 ALMGSFSSKVKNILAFSILFIYLFLAYSGIEAYHDVRLPGVLQRIAIVYFVVSLLYLKTS 182
Query: 185 GDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVT 244
+ + + L G+W + ++ PV + N+
Sbjct: 183 QRTQIITGIVLL---LGYWAIMTLI--------------------PVPGIGEA----NLE 215
Query: 245 CGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAP 304
G N +D +L H+YR I +W
Sbjct: 216 KG--------TNLASWLDSVLLK-GHMYRGTI----------------------TW---- 240
Query: 305 FDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHL 364
DPEG+LS++ + V +IGL G ++ + I + LI GL D V +
Sbjct: 241 -DPEGILSTLPSIVNGIIGLLIGQVLQRDTTKILKAQKMGIAGTILIFFGLMWDLV-FPI 298
Query: 365 NKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKKPF 405
NK+L++ SY T G + V L +Y+ + I+ + KPF
Sbjct: 299 NKSLWTSSYVLYTTGLATVFLTILYYTID-IADYKKGFKPF 338
>gi|427384458|ref|ZP_18880963.1| hypothetical protein HMPREF9447_01996 [Bacteroides oleiciplenus YIT
12058]
gi|425727719|gb|EKU90578.1| hypothetical protein HMPREF9447_01996 [Bacteroides oleiciplenus YIT
12058]
Length = 395
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 93/368 (25%), Positives = 159/368 (43%), Gaps = 99/368 (26%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFL 102
+ +R+++LD+ RG+T+A MI+V++ G I + H+ WNGLT D V PFF+
Sbjct: 1 MSSQMKTNKRILALDILRGVTIAGMIMVNNPGTWGHIYAPLRHAEWNGLTPTDLVFPFFM 60
Query: 103 FIVGVSLALTYK--NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGI--------NNLK 152
FI+G+S ++ K NF A K + R + +FL+G L G+F +
Sbjct: 61 FIMGISTYISLKKYNFEFSHAAAMKILKRTIIIFLIG--LAIGWFSKFCYYWTNPSEGIG 118
Query: 153 YGVDIAQ-------IRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVV 205
+G + + IR +GV+QR+A+ Y A+ + +K H+ +++
Sbjct: 119 FGAQLWESVWTFDRIRILGVMQRLALCYGATAIIALTMK-HRHIP------------YLI 165
Query: 206 ALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKI 265
A +L ++LL+ CG G N + ++DR I
Sbjct: 166 ATLLIGYFILLI---------------------------CG-NGFAYNETNVLSIVDRAI 197
Query: 266 LGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLH 325
L H+Y+ D G DPEGLLS++ + L+G
Sbjct: 198 LTPAHMYK---------------DNG-------------IDPEGLLSTIPSIAHVLLGFC 229
Query: 326 FGHLIV--HFKDHRDRMLNWIILSSCLIGL-----GLSLDFVGMHLNKALYSLSYTCLTA 378
G +++ + R+ +LN ++ LIG G L + G +NK ++S ++ T
Sbjct: 230 VGRMMLDGNKAQSREELLNSHLIKLFLIGTILTFAGFLLSY-GCPINKKIWSPTFVLTTC 288
Query: 379 GASGVLLA 386
G + LA
Sbjct: 289 GLASSFLA 296
>gi|294949094|ref|XP_002786049.1| hypothetical protein Pmar_PMAR023775 [Perkinsus marinus ATCC 50983]
gi|239900157|gb|EER17845.1| hypothetical protein Pmar_PMAR023775 [Perkinsus marinus ATCC 50983]
Length = 277
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 92/343 (26%), Positives = 128/343 (37%), Gaps = 112/343 (32%)
Query: 25 APANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAIN 84
AP++ LLPP+ + RP R++++DV RG + + +VD G +P+I
Sbjct: 6 APSSSVDSSGLLLPPTRRTTTPDRP-----RIVAVDVMRGRSS--VQIVDVCGKTVPSIG 58
Query: 85 HSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGF 144
H+PWNGL LAD VMP F+FI
Sbjct: 59 HAPWNGLHLADIVMPGFIFI---------------------------------------- 78
Query: 145 FHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWV 204
+ L G+D+ R G+LQRIA+ Y A L + S LS +G
Sbjct: 79 ----DTLTLGLDLYTFRAPGILQRIAVCYAAAVLLR-------KLVSDLSPNDTVKG--- 124
Query: 205 VALVLTTLYLLLLYGLYVPDWQYEF--PVETSSSSPWIFNVTCGVRGSTGPACNAVGMID 262
AL + LL+ + +W P S RGS P CN ID
Sbjct: 125 -ALKNNSRVLLMGLLCIIINWAIMLLGPQPEGCS-----------RGSLTPQCNVASNID 172
Query: 263 RKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLI 322
R + G +H+Y +P W DPEGLLS++ T +
Sbjct: 173 RMVFGPEHMY------------------------SPLW-----DPEGLLSTLPTLATVAL 203
Query: 323 GLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLN 365
GL G I H + + L+G GL L GM L
Sbjct: 204 GLACGKFIQSRPSHTELLR--------LVGCGLLLALSGMALG 238
>gi|410613391|ref|ZP_11324450.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola
psychrophila 170]
gi|410167053|dbj|GAC38339.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola
psychrophila 170]
Length = 400
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 99/358 (27%), Positives = 162/358 (45%), Gaps = 67/358 (18%)
Query: 46 QTRPQHQ-QRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFF 101
Q+ QH RL++LDVFRG+T+ MILV++ G I + H+ W+G TL D + PFF
Sbjct: 10 QSIYQHVPANRLLALDVFRGMTITAMILVNNPGSWQYIYSPLAHAKWHGWTLTDLIFPFF 69
Query: 102 LFIVGVSLALTYKNFPCKVVATRKAILRA---LNLFLLGIFLQGGFFHGINNLKYGV--- 155
+FIVGVS++L+ + + + I A + LL F++ + Y
Sbjct: 70 IFIVGVSISLSGQRQKEQGLGHGHIIHHALLRMFKLLLLGCFLALFYYNFSAADYDWFTQ 129
Query: 156 DIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLL 215
+ Q+R+MGVLQRIA+ Y+ L ++L S+L LV+ L +L
Sbjct: 130 RLMQMRFMGVLQRIALVYMACVLLWLFL-------SRLQ------------LVICMLAIL 170
Query: 216 LLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKP 275
+ Y L + P + ++ G A N +D + HLY
Sbjct: 171 VAYWLAMA----FIPYHDDLGNQYV--------GLLEYANNLSAWLDNYLFAKTHLY--- 215
Query: 276 IYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKD 335
YS + + FDPEG+LS++ A + L G+ G + F
Sbjct: 216 -YSSAQPFA--------------------FDPEGVLSTLPAIASGLSGVLAGQWL-SFSH 253
Query: 336 HRDR-MLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
H R W+ + + L + + +NKAL++ SY LT+G + ++LAG+ +++
Sbjct: 254 HSMRHKAKWLAICGVVALLLGQMWAHWLPINKALWTSSYVLLTSGYAALILAGLMYVL 311
>gi|225012704|ref|ZP_03703139.1| conserved hypothetical protein [Flavobacteria bacterium MS024-2A]
gi|225003237|gb|EEG41212.1| conserved hypothetical protein [Flavobacteria bacterium MS024-2A]
Length = 366
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 53/137 (38%), Positives = 78/137 (56%), Gaps = 16/137 (11%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG 106
+ + +RL++LD+ RGLT+ LMI+V+D G + H+ WNGLT D+V P FLFIVG
Sbjct: 2 ETKNQRLLALDILRGLTIILMIVVNDPGSWSEVYAPFLHAEWNGLTPTDYVFPTFLFIVG 61
Query: 107 VSLALT----YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRW 162
VS+ L+ + + +K + RAL ++L+GIFL L + IRW
Sbjct: 62 VSIVLSLSKQLEAGKTRSQIAKKVLWRALKIYLVGIFLW---------LWPSFNFEGIRW 112
Query: 163 MGVLQRIAIAYLVAALC 179
+GVL RIA+ +L L
Sbjct: 113 VGVLPRIALVFLACGLI 129
Score = 45.4 bits (106), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 30/91 (32%), Positives = 51/91 (56%), Gaps = 3/91 (3%)
Query: 305 FDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHL 364
+DPEG LS++ A V+ LIG+ G++++ ++ + R+ + L+ LG ++ +V
Sbjct: 191 WDPEGFLSTLPAIVSGLIGMWAGYVLMKKEELKTRLNQLFFIGFILLFLGDAMQWV-FPF 249
Query: 365 NKALYSLSYTCLTAGASGVLLAGI--YFMVR 393
NK L+S S+T G + LA YF VR
Sbjct: 250 NKNLWSSSFTLFMGGIGMLSLAAFSYYFDVR 280
>gi|345856403|ref|ZP_08808889.1| putative membrane protein [Desulfosporosinus sp. OT]
gi|344330527|gb|EGW41819.1| putative membrane protein [Desulfosporosinus sp. OT]
Length = 381
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 96/370 (25%), Positives = 154/370 (41%), Gaps = 81/370 (21%)
Query: 42 SNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGIL---PAINHSPWNGLTLADFVM 98
NS + + RL +DVFRG+ VA+M++V + G L P + H+ WNG T+AD
Sbjct: 2 ENSIAPEEKGKFGRLNCIDVFRGIAVAIMLIVTNPGNPLRNYPQLRHAAWNGYTVADLAF 61
Query: 99 PFFLFIVGVSLALTY----KNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYG 154
PFF+ I+G+ + K + ++R++ LF +GI L G +
Sbjct: 62 PFFMLIMGMVIPYAVDKRIKEGKSNLSIFNHILIRSIGLFCIGILLNGFPVY-------- 113
Query: 155 VDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYL 214
D++ IR GVLQRIAIAYL + E+ +K S L + + +AL + ++Y
Sbjct: 114 -DLSIIRIPGVLQRIAIAYLCTGIIELIVKATVK-KSYLQIIVESS----LALSIISVYS 167
Query: 215 LLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRK 274
+LL PD++ N V ID L HLY
Sbjct: 168 VLLIKYSFPDYK-----------------------------NLVQTIDLYFLK-GHLY-- 195
Query: 275 PIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFK 334
P W DPEG+L++ + T + G G+++ +
Sbjct: 196 ----------------------TPDW-----DPEGILTTFSSIATAIFGSIAGNILFNRD 228
Query: 335 DHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRY 394
+ + I + + + S+ NK L+S SY +TAG + L + F+V
Sbjct: 229 NKARKKFITIFIYGVVTIIVASIIQRWFPYNKNLWSSSYVLITAGIA-YLTISMLFLVID 287
Query: 395 ISSHLMLKKP 404
++ L KP
Sbjct: 288 VAGFKALFKP 297
>gi|116621919|ref|YP_824075.1| hypothetical protein Acid_2804 [Candidatus Solibacter usitatus
Ellin6076]
gi|116225081|gb|ABJ83790.1| conserved hypothetical protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 367
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 96/332 (28%), Positives = 140/332 (42%), Gaps = 80/332 (24%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGILPA---INHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL+SLD FRG T+ALM+LV++ G L + + HSPW+G T+ D V P FL+IVGV++ L
Sbjct: 12 RLVSLDAFRGATIALMVLVNNAGSGLDSYRQLEHSPWHGWTITDTVFPSFLWIVGVAITL 71
Query: 112 TYKNFPCKVVATR----KAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
+ + V + + RA LF+ G+F+ F H D+ R +GVLQ
Sbjct: 72 SLGKRVAEGVPRSHLLPQILRRAAILFVFGLFVY-AFPH--------FDLGTQRILGVLQ 122
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQY 227
RIAI YL A S + L+ RG +L + GL W
Sbjct: 123 RIAICYLAA--------------SVIFLYSGVRGQ-----------ILWILGLLAAYWMM 157
Query: 228 EFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINS 287
+ P G N ID LG +H Y +RT
Sbjct: 158 MTLIPVPGYGP----------GRLDVEGNFAHYIDHLALG-RHNYHS---TRT------- 196
Query: 288 PDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILS 347
+DPEGL+S++ A T L G+ GH++ + +R
Sbjct: 197 -----------------WDPEGLVSTLPAIATALFGVLAGHILRCRRTLAERTSWMFTAG 239
Query: 348 SCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
S L+ GL + + +NK L++ S+ AG
Sbjct: 240 SLLLAAGL-ICTAWLPINKKLWTDSFCLFMAG 270
>gi|312131163|ref|YP_003998503.1| hypothetical protein Lbys_2486 [Leadbetterella byssophila DSM
17132]
gi|311907709|gb|ADQ18150.1| hypothetical protein Lbys_2486 [Leadbetterella byssophila DSM
17132]
Length = 413
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 110/391 (28%), Positives = 164/391 (41%), Gaps = 123/391 (31%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
++R++SLDVFRGLT+ LMI V++ G + + H+ W+G T D V PFF+F +G++L
Sbjct: 5 KQRIVSLDVFRGLTMILMITVNNPGDWSNVYAPLLHAEWHGWTPTDLVFPFFVFAMGMAL 64
Query: 110 ALTYK--------NFPCKVVATRKAILRALNLFL-------------------------- 135
+ K +F K++A R A L AL LFL
Sbjct: 65 PFSMKPGSGLSKDDF-LKILA-RSARLIALGLFLNFFSKIEFGNAQGITLLLFRLMITGF 122
Query: 136 LGIFLQGGFFHGINNLKYGV--------------DIAQIRWMGVLQRIAIAYLVAALCEI 181
+G L G F I AQ+R GVLQR+ Y AA+
Sbjct: 123 VGFLLMGNFPTKIKLYTALALLGLMLALAYSGLPHFAQVRIPGVLQRLGTVYFFAAI--- 179
Query: 182 WLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIF 241
L L R W + L + +Y LLL + VP
Sbjct: 180 -----------LYLAFSLRVQWGIGLSVLVIYWLLLAYIPVPG----------------S 212
Query: 242 NVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWC 301
VT +G PA ID +LG H ++S +K
Sbjct: 213 GVTGFEKGENLPAW-----IDSIVLG-DH-----VWSSSK-------------------- 241
Query: 302 QAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVG 361
P+DPEG+LS++ A ++CL+G G + +D + +L +IL C GL S F
Sbjct: 242 --PWDPEGVLSTLPAIISCLLGAWAGVFLR--EDKKKLLLTGVILLIC--GLAWSTFF-- 293
Query: 362 MHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+NKAL++ S+ LTAG ++++ + F+V
Sbjct: 294 -PINKALWTSSFVLLTAGLGSIIVSLLGFVV 323
>gi|404404862|ref|ZP_10996446.1| transmembrane protein [Alistipes sp. JC136]
Length = 382
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 93/354 (26%), Positives = 147/354 (41%), Gaps = 82/354 (23%)
Query: 52 QQRRLISLDVFRGLTVALMILVDD---VGGILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+ RL+SLDV RG+T+A MILV++ G + H+ W+G+T D + PFF+FI+GVS
Sbjct: 2 KSERLLSLDVMRGMTIAAMILVNNPAVWGKAYAPLQHAFWHGMTPTDLIYPFFVFIMGVS 61
Query: 109 --LALTYKNFPCKVVATRKAILRALNLFLLGIFLQ--GGFFHGINNLKYGVDIAQIRW-- 162
+L+ + A + + R+ +F +G+ LQ F +G N G A W
Sbjct: 62 AFFSLSKRYEGAGREAFSRILRRSAVIFGVGLLLQEISYFGYGTANFLSGQTSADATWFE 121
Query: 163 ----------MGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTL 212
MGVLQ +A+ YL + + L+ FR H +VA
Sbjct: 122 TVFPFRTFRIMGVLQGLALVYLFGSAALLCLR-----------FR----HLIVA------ 160
Query: 213 YLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLY 272
+ G + A N + ++DR +LG HLY
Sbjct: 161 --------------------AGGLLILYLVLLQTGNGYSLSADNIIAVVDRAVLGESHLY 200
Query: 273 RKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVH 332
R+ + P + F+PEGLLS++ L+G G +++
Sbjct: 201 REWL---------------------PDGSRLAFEPEGLLSTLPRIAQFLLGCAAGRILLA 239
Query: 333 FKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
+D R + L GL L + G LNK ++S S+ T+G + +LL
Sbjct: 240 NEDAPMRFGRLFAFGTALFFTGLLLQY-GDPLNKKIWSSSFALATSGFASLLLG 292
>gi|417770421|ref|ZP_12418329.1| putative membrane protein [Leptospira interrogans serovar Pomona
str. Pomona]
gi|418680131|ref|ZP_13241383.1| putative membrane protein [Leptospira interrogans serovar Pomona
str. Kennewicki LC82-25]
gi|421117858|ref|ZP_15578212.1| putative membrane protein [Leptospira interrogans serovar Canicola
str. Fiocruz LV133]
gi|400328139|gb|EJO80376.1| putative membrane protein [Leptospira interrogans serovar Pomona
str. Kennewicki LC82-25]
gi|409947562|gb|EKN97558.1| putative membrane protein [Leptospira interrogans serovar Pomona
str. Pomona]
gi|410010535|gb|EKO68672.1| putative membrane protein [Leptospira interrogans serovar Canicola
str. Fiocruz LV133]
gi|455668600|gb|EMF33807.1| putative membrane protein [Leptospira interrogans serovar Pomona
str. Fox 32256]
Length = 381
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 95/339 (28%), Positives = 143/339 (42%), Gaps = 88/339 (25%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
Q R++SLD+FRG+TVA MILV++ G I + H+ WNG T D V PFFLF VG+S
Sbjct: 6 NQNRILSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHARWNGCTPTDLVFPFFLFAVGIS 65
Query: 109 LALT-YKNFPCKVVATRKAI-LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
+ + Y + T I +R++ L L+G+FL F G +++R GVL
Sbjct: 66 IHFSVYSKNKIYLSKTWLGICIRSITLILIGLFLN---FFG------EWSFSELRIPGVL 116
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRI Y V A + L + S W+ L++ T L+ Q
Sbjct: 117 QRIGFVYWVVASLYLILPKRAILIS-----------WIPILIVHTWILI----------Q 155
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
P E+ P + IDR + G HL++ +S+T
Sbjct: 156 LPPPGESIVYLE--------------PGKDIGAWIDRNVFGENHLWK---FSKT------ 192
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
+DPEG S + + T L+G+ G ++ + + IL
Sbjct: 193 ------------------WDPEGFFSGISSITTSLLGVFCGSILSSKTNETKKQ----IL 230
Query: 347 SSCLIGLGLSLDFVG------MHLNKALYSLSYTCLTAG 379
S + G G VG + +NK+L++ SY TAG
Sbjct: 231 S--IFGFGTLFVLVGLLWNQNLPMNKSLWTGSYVIYTAG 267
>gi|24213473|ref|NP_710954.1| hypothetical protein LA_0773 [Leptospira interrogans serovar Lai
str. 56601]
gi|45658672|ref|YP_002758.1| hypothetical protein LIC12842 [Leptospira interrogans serovar
Copenhageni str. Fiocruz L1-130]
gi|386073105|ref|YP_005987422.1| hypothetical protein LIF_A0631 [Leptospira interrogans serovar Lai
str. IPAV]
gi|417764272|ref|ZP_12412242.1| putative membrane protein [Leptospira interrogans serovar Bulgarica
str. Mallika]
gi|417786789|ref|ZP_12434477.1| putative membrane protein [Leptospira interrogans str. C10069]
gi|418669621|ref|ZP_13231000.1| putative membrane protein [Leptospira interrogans serovar Pyrogenes
str. 2006006960]
gi|418701883|ref|ZP_13262801.1| putative membrane protein [Leptospira interrogans serovar Bataviae
str. L1111]
gi|418702896|ref|ZP_13263788.1| putative membrane protein [Leptospira interrogans serovar
Hebdomadis str. R499]
gi|418717763|ref|ZP_13277304.1| putative membrane protein [Leptospira interrogans str. UI 08452]
gi|418729566|ref|ZP_13288113.1| putative membrane protein [Leptospira interrogans str. UI 12758]
gi|421083731|ref|ZP_15544602.1| putative membrane protein [Leptospira santarosai str. HAI1594]
gi|421102101|ref|ZP_15562711.1| putative membrane protein [Leptospira interrogans serovar
Icterohaemorrhagiae str. Verdun LP]
gi|421121310|ref|ZP_15581607.1| putative membrane protein [Leptospira interrogans str. Brem 329]
gi|24194245|gb|AAN47972.1|AE011263_12 conserved hypothetical protein [Leptospira interrogans serovar Lai
str. 56601]
gi|45601916|gb|AAS71395.1| conserved hypothetical protein [Leptospira interrogans serovar
Copenhageni str. Fiocruz L1-130]
gi|353456894|gb|AER01439.1| conserved hypothetical protein [Leptospira interrogans serovar Lai
str. IPAV]
gi|400353508|gb|EJP05677.1| putative membrane protein [Leptospira interrogans serovar Bulgarica
str. Mallika]
gi|409950064|gb|EKO04595.1| putative membrane protein [Leptospira interrogans str. C10069]
gi|410345744|gb|EKO96814.1| putative membrane protein [Leptospira interrogans str. Brem 329]
gi|410368246|gb|EKP23624.1| putative membrane protein [Leptospira interrogans serovar
Icterohaemorrhagiae str. Verdun LP]
gi|410433648|gb|EKP77988.1| putative membrane protein [Leptospira santarosai str. HAI1594]
gi|410754552|gb|EKR16202.1| putative membrane protein [Leptospira interrogans serovar Pyrogenes
str. 2006006960]
gi|410759015|gb|EKR25234.1| putative membrane protein [Leptospira interrogans serovar Bataviae
str. L1111]
gi|410767440|gb|EKR38115.1| putative membrane protein [Leptospira interrogans serovar
Hebdomadis str. R499]
gi|410775744|gb|EKR55735.1| putative membrane protein [Leptospira interrogans str. UI 12758]
gi|410786933|gb|EKR80669.1| putative membrane protein [Leptospira interrogans str. UI 08452]
gi|456824782|gb|EMF73208.1| putative membrane protein [Leptospira interrogans serovar Canicola
str. LT1962]
Length = 381
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 95/339 (28%), Positives = 143/339 (42%), Gaps = 88/339 (25%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
Q R++SLD+FRG+TVA MILV++ G I + H+ WNG T D V PFFLF VG+S
Sbjct: 6 NQNRILSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHARWNGCTPTDLVFPFFLFAVGIS 65
Query: 109 LALT-YKNFPCKVVATRKAI-LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
+ + Y + T I +R++ L L+G+FL F G +++R GVL
Sbjct: 66 IHFSVYSKNKIYLSKTWLGICIRSITLILIGLFLN---FFG------EWSFSELRIPGVL 116
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRI Y V A + L + S W+ L++ T L+ Q
Sbjct: 117 QRIGFVYWVVASLYLILPKRAILIS-----------WIPILIVHTWILI----------Q 155
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
P E+ P + IDR + G HL++ +S+T
Sbjct: 156 LPPPGESIVYLE--------------PGKDIGAWIDRNVFGENHLWK---FSKT------ 192
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
+DPEG S + + T L+G+ G ++ + + IL
Sbjct: 193 ------------------WDPEGFFSGISSITTSLLGVFCGSILSSKTNETKKQ----IL 230
Query: 347 SSCLIGLGLSLDFVG------MHLNKALYSLSYTCLTAG 379
S + G G VG + +NK+L++ SY TAG
Sbjct: 231 S--IFGFGTLFVLVGLLWNQNLPMNKSLWTGSYVIYTAG 267
>gi|417760159|ref|ZP_12408187.1| putative membrane protein [Leptospira interrogans str. 2002000624]
gi|417775681|ref|ZP_12423532.1| putative membrane protein [Leptospira interrogans str. 2002000621]
gi|418673844|ref|ZP_13235155.1| putative membrane protein [Leptospira interrogans str. 2002000623]
gi|409944118|gb|EKN89707.1| putative membrane protein [Leptospira interrogans str. 2002000624]
gi|410574555|gb|EKQ37586.1| putative membrane protein [Leptospira interrogans str. 2002000621]
gi|410579122|gb|EKQ46972.1| putative membrane protein [Leptospira interrogans str. 2002000623]
Length = 381
Score = 87.4 bits (215), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 95/339 (28%), Positives = 143/339 (42%), Gaps = 88/339 (25%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
Q R++SLD+FRG+TVA MILV++ G I + H+ WNG T D V PFFLF VG+S
Sbjct: 6 NQNRILSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHARWNGCTPTDLVFPFFLFAVGIS 65
Query: 109 LALT-YKNFPCKVVATRKAI-LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
+ + Y + T I +R++ L L+G+FL F G +++R GVL
Sbjct: 66 IHFSVYSKNKIYLSKTWLGICIRSITLILIGLFLN---FFG------EWSFSELRIPGVL 116
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRI Y V A + L + S W+ L++ T L+ Q
Sbjct: 117 QRIGFVYWVVASLYLILPKRAILIS-----------WIPILIVHTWILI----------Q 155
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
P E+ P + IDR + G HL++ +S+T
Sbjct: 156 LPPPGESIVYLE--------------PGKDIGAWIDRNVFGENHLWK---FSKT------ 192
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
+DPEG S + + T L+G+ G ++ + + IL
Sbjct: 193 ------------------WDPEGFFSGISSITTSLLGVFCGSILSSKTNETKKQ----IL 230
Query: 347 SSCLIGLGLSLDFVG------MHLNKALYSLSYTCLTAG 379
S + G G VG + +NK+L++ SY TAG
Sbjct: 231 S--IFGFGTLFVLVGLLWNQNLPMNKSLWTGSYVIYTAG 267
>gi|398348299|ref|ZP_10533002.1| hypothetical protein Lbro5_13954 [Leptospira broomii str. 5399]
Length = 399
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 94/334 (28%), Positives = 143/334 (42%), Gaps = 82/334 (24%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
++RL+S+D RG TVA MILV++ G I + H+ W G T D V PFFLF VGVS+
Sbjct: 32 RKRLLSIDALRGFTVAGMILVNNPGSWSAIYWPLKHAKWFGCTPTDLVFPFFLFSVGVSI 91
Query: 110 ALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRI 169
+ K + RA L L+G+FL + G ++ +D Q+R GVLQRI
Sbjct: 92 PFSSIGNGGTFF---KILKRASILILIGLFLH---WFG----EWSID--QLRIPGVLQRI 139
Query: 170 AIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEF 229
+ Y ++A+ YR A +L L +L Y + + EF
Sbjct: 140 GLVYFISAIA-------------------YRSSNFHARILICLSILFGYWILL-----EF 175
Query: 230 PVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPD 289
S S S P + +DR + G HL++ S+T
Sbjct: 176 VPPPGSDSV-----------SLSPGKDWGAWLDRIVFGENHLWKS---SKT--------- 212
Query: 290 YGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRM----LNWII 345
+DPEGLLSS+ A T +G FG ++ D + + N I
Sbjct: 213 ---------------WDPEGLLSSLSAVATTFLGFFFGEVLKKDSDTKKNIQKTAFN-FI 256
Query: 346 LSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
L++ +I + + +NK+L++ S+ T G
Sbjct: 257 LAAIVIMVAGWIWHQFFPMNKSLWTSSFVLWTGG 290
>gi|418690664|ref|ZP_13251772.1| putative membrane protein [Leptospira interrogans str. FPW2026]
gi|418722429|ref|ZP_13281595.1| putative membrane protein [Leptospira interrogans str. UI 12621]
gi|400360164|gb|EJP16144.1| putative membrane protein [Leptospira interrogans str. FPW2026]
gi|409963797|gb|EKO27519.1| putative membrane protein [Leptospira interrogans str. UI 12621]
gi|455790461|gb|EMF42326.1| putative membrane protein [Leptospira interrogans serovar Lora str.
TE 1992]
Length = 381
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 95/339 (28%), Positives = 143/339 (42%), Gaps = 88/339 (25%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
Q R++SLD+FRG+TVA MILV++ G I + H+ WNG T D V PFFLF VG+S
Sbjct: 6 NQNRILSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHARWNGCTPTDLVFPFFLFAVGIS 65
Query: 109 LALT-YKNFPCKVVATRKAI-LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
+ + Y + T I +R++ L L+G+FL F G +++R GVL
Sbjct: 66 IHFSVYSKNKIYLSKTWLGICIRSITLILIGLFLN---FFG------EWSFSELRIPGVL 116
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRI Y V A + L + S W+ L++ T L+ Q
Sbjct: 117 QRIGFVYWVVASLYLILPKRAILIS-----------WIPILIVHTWILI----------Q 155
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
P E+ P + IDR + G HL++ +S+T
Sbjct: 156 LPPPGESIVYLE--------------PGKDIGAWIDRNVFGENHLWK---FSKT------ 192
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
+DPEG S + + T L+G+ G ++ + + IL
Sbjct: 193 ------------------WDPEGFFSGISSITTSLLGVFCGSILSSKTNETKKQ----IL 230
Query: 347 SSCLIGLGLSLDFVG------MHLNKALYSLSYTCLTAG 379
S + G G VG + +NK+L++ SY TAG
Sbjct: 231 S--IFGFGTLFVLVGLLWNQNLPMNKSLWTGSYVIYTAG 267
>gi|336316712|ref|ZP_08571601.1| hypothetical protein Rhein_3024 [Rheinheimera sp. A13L]
gi|335878877|gb|EGM76787.1| hypothetical protein Rhein_3024 [Rheinheimera sp. A13L]
Length = 363
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 96/328 (29%), Positives = 140/328 (42%), Gaps = 78/328 (23%)
Query: 55 RLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
R +LD RGL +ALMILV+ G + + H+PW+G T AD V P FLF+VG ++
Sbjct: 5 RFYALDALRGLAIALMILVNTPGSWQHVYTPLLHAPWDGFTFADIVFPTFLFVVGAAMFY 64
Query: 112 TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAI 171
+ K + + RAL L +G+ L N + + VD+A++R GVLQRI +
Sbjct: 65 SLKTAVLSRQSLWRVSSRALKLIGIGVLL--------NYVPFTVDLAELRLPGVLQRIGL 116
Query: 172 AYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPV 231
AY +AAL L++ R Y VALVL L+L G +P + +
Sbjct: 117 AYWLAALL------------ILTVKRSYLPFVAVALVLLYWLALVLGGGELP-----YSL 159
Query: 232 ETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYG 291
E + W D I G HLY+ +G
Sbjct: 160 EHNLVRQW----------------------DLAIFGAAHLYQ---------------GFG 182
Query: 292 PMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLI 351
FDPEG LS++ V LIG + + H+ L +LS ++
Sbjct: 183 -----------VAFDPEGFLSTLPCVVAVLIGFGSASFLQGKQQHQALRLL--LLSGLVL 229
Query: 352 GLGLSLDFVGMHLNKALYSLSYTCLTAG 379
+ L + +NKAL+S SY L+ G
Sbjct: 230 VVLALLWHLLWPINKALWSGSYLALSNG 257
>gi|359438686|ref|ZP_09228688.1| hypothetical protein P20311_2740 [Pseudoalteromonas sp. BSi20311]
gi|359445329|ref|ZP_09235071.1| hypothetical protein P20439_1393 [Pseudoalteromonas sp. BSi20439]
gi|358026628|dbj|GAA64937.1| hypothetical protein P20311_2740 [Pseudoalteromonas sp. BSi20311]
gi|358040838|dbj|GAA71320.1| hypothetical protein P20439_1393 [Pseudoalteromonas sp. BSi20439]
Length = 359
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 82/337 (24%), Positives = 146/337 (43%), Gaps = 79/337 (23%)
Query: 55 RLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
R +LD RGLT+ALMILV+ G + + H+ W+G T D + PFF+FI+G ++
Sbjct: 3 RYKALDAMRGLTIALMILVNTPGSWSHVYAPLLHADWHGCTPTDVIFPFFMFIIGSAMFF 62
Query: 112 TYKNFPCKVVATR--KAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRI 169
++K A++ + + R +F +G+ +N + +I +R +GVLQRI
Sbjct: 63 SFKKTNSAASASQVLRLVKRGAIIFAIGL--------ALNIYPFTTNIENLRILGVLQRI 114
Query: 170 AIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEF 229
IAY++A++C LF RG +++++ Y LLL
Sbjct: 115 GIAYILASIC--------------VLFLNRRGVLTLSVIILVAYWLLL------------ 148
Query: 230 PVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPD 289
++ G + N V +D +LG HL++
Sbjct: 149 -------------LSVGAENAYTLEHNLVRAVDIAVLGESHLWQGK-------------- 181
Query: 290 YGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSC 349
FDPEGL+S++ A V+ L G L+ + + +++
Sbjct: 182 ------------GLAFDPEGLISTLPAVVSVLFGFEVTRLLTS-TSCQWASIKRLLVIGV 228
Query: 350 LIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
+ + L + M +NK+L++ S+ T+G + ++LA
Sbjct: 229 VAVVIGQLTSMVMPINKSLWTSSFVIYTSGIACIVLA 265
>gi|163786877|ref|ZP_02181325.1| hypothetical protein FBALC1_16867 [Flavobacteriales bacterium
ALC-1]
gi|159878737|gb|EDP72793.1| hypothetical protein FBALC1_16867 [Flavobacteriales bacterium
ALC-1]
Length = 361
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 95/340 (27%), Positives = 149/340 (43%), Gaps = 76/340 (22%)
Query: 55 RLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
R+ S+D+ RGLT+ MILV+ G + + H+ W+GLT D + PFFLFIVG+S+
Sbjct: 4 RIESVDILRGLTILAMILVNTPGTWGHVYTPLRHAEWHGLTPTDLIFPFFLFIVGISIYF 63
Query: 112 TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAI 171
YKN P + +K I+R+L L LG+FL H + D R GVLQRI +
Sbjct: 64 AYKNKPNTKLTYKKIIIRSLKLIGLGLFLNLFLPH----FPFFNDFETHRIPGVLQRIGL 119
Query: 172 AYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPV 231
+L SS L L ++ + + + Y L L ++P P
Sbjct: 120 VFL--------------FSSILYLNCSWKSLTAIGITIILGYWLCLG--FIPFQDGSLPT 163
Query: 232 ETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYG 291
+ + W ID ILG +H+++ DY
Sbjct: 164 FDRAPNNW------------------ANYIDLNILG-EHMWKT--------------DY- 189
Query: 292 PMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLI 351
DPEGL+S++ A TC+ G+ G L+ D + I + +
Sbjct: 190 --------------DPEGLISTIPAIATCISGILIGKLL----DGLTHIKYLFITALGFL 231
Query: 352 GLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM 391
G L+ + NKA++S S+ +T+G + ++LA IY++
Sbjct: 232 ISGYILN-IWFPTNKAIWSSSFVLVTSGWATLILAVIYYL 270
>gi|146302547|ref|YP_001197138.1| hypothetical protein Fjoh_4820 [Flavobacterium johnsoniae UW101]
gi|146156965|gb|ABQ07819.1| Uncharacterized protein [Flavobacterium johnsoniae UW101]
Length = 423
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 103/391 (26%), Positives = 172/391 (43%), Gaps = 116/391 (29%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+ RL SLDVFRG T+ LM +V++ G I P + H+ W+G T D V PFF+FI+G +
Sbjct: 2 TKERLTSLDVFRGFTILLMTIVNNPGSWSSIYPPLEHAEWHGCTPTDLVFPFFVFIMGTA 61
Query: 109 L--ALTYKNFPCKVVATRKAILRALNLFLLGIFLQ------------------------- 141
+ A+ K+F V K ++R+L +F LG+FL
Sbjct: 62 IPFAMPVKHFDGAVF--NKILVRSLRIFCLGLFLSVFSRIHLFGLEGIPLLGVRLVIAFA 119
Query: 142 ------GGFFHGINN-LKYGVDI-------------AQIRWMGVLQRIAIAYLVAALCEI 181
G F + L G+ I R GVLQRIAI Y A++ +
Sbjct: 120 VAYALLGNFSMKVKTILAVGIFIILLSLAFSGLEHFEDTRIPGVLQRIAIVYFFASI--L 177
Query: 182 WLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIF 241
+LK + + ++L WVVA +L YLL+ + V P F
Sbjct: 178 YLKTN--LKTQL---------WVVASILVIYYLLMAF------------VPVPGFGPANF 214
Query: 242 NVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWC 301
+ +G+ N +D +L HL+ S+T
Sbjct: 215 D-----KGT-----NLAAWLDNLVLN-GHLWS---VSKT--------------------- 239
Query: 302 QAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVG 361
+DPEG+LS++ A T ++G++ G L+ + + +R +L ++ ++ +G L +
Sbjct: 240 ---WDPEGILSTLPAIGTGILGMYIGQLL-NLQTNRTEILKKTAVTGVILLIGGLLWNII 295
Query: 362 MHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+NK+L++ SY TAG + + L+ +Y+++
Sbjct: 296 FPINKSLWTSSYVLYTAGIATLCLSLLYYII 326
>gi|423223641|ref|ZP_17210110.1| hypothetical protein HMPREF1062_02296 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638266|gb|EIY32113.1| hypothetical protein HMPREF1062_02296 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 395
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 95/370 (25%), Positives = 163/370 (44%), Gaps = 102/370 (27%)
Query: 44 SKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPF 100
+ QT+ +R+++LD+ RG+T+A MI+V++ G I + H+ WNGLT D V PF
Sbjct: 2 NSQTK---TNKRILALDILRGVTIAGMIMVNNPGTWGHIYAPLRHAEWNGLTPTDLVFPF 58
Query: 101 FLFIVGVSLALTYK--NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGI--------NN 150
F+FI+G+S ++ K NF A K + R + +FL+G L G+F
Sbjct: 59 FMFIMGISTYISLKKYNFKFSHAAALKILKRTIIIFLIG--LAIGWFSRFCYYWAGSHEG 116
Query: 151 LKYGVDI-------AQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHW 203
L +G + +IR +GV+QR+A+ Y A+ + +K H+ +
Sbjct: 117 LSFGEQLWASVWTFDRIRILGVMQRLALCYGATAIIALTMK-HRHIP------------Y 163
Query: 204 VVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDR 263
++A +L ++LL+ CG G N + ++DR
Sbjct: 164 LIATLLVGYFILLM---------------------------CG-NGFAYNETNILSIVDR 195
Query: 264 KILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIG 323
IL H+Y+ D G DPEGLLS++ + L+G
Sbjct: 196 AILTPAHMYK---------------DNG-------------IDPEGLLSTIPSIAHVLLG 227
Query: 324 LHFGHLIV--HFKDHRDRMLNWIILSSCLIG-----LGLSLDFVGMHLNKALYSLSYTCL 376
G +++ + + R+ +LN ++ L+G G L + G +NK ++S ++
Sbjct: 228 FCVGRMMLDSNKAESREALLNSHLIKLFLVGAILTFTGFLLSY-GCPINKKIWSPTFVLT 286
Query: 377 TAGASGVLLA 386
T G + LA
Sbjct: 287 TCGLASSFLA 296
>gi|392555555|ref|ZP_10302692.1| hypothetical protein PundN2_08983 [Pseudoalteromonas undina NCIMB
2128]
Length = 359
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 82/337 (24%), Positives = 146/337 (43%), Gaps = 79/337 (23%)
Query: 55 RLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
R +LD RGLT+ALMILV+ G + + H+ W+G T D + PFF+FI+G ++
Sbjct: 3 RYKALDAMRGLTIALMILVNTPGSWSHVYAPLLHADWHGCTPTDVIFPFFMFIIGSAMFF 62
Query: 112 TYKNFPCKVVATR--KAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRI 169
++K A++ + + R +F +G+ +N + +I +R +GVLQRI
Sbjct: 63 SFKKTNSAASASQVLRLVKRGAIIFAIGL--------ALNIYPFTTNIENLRILGVLQRI 114
Query: 170 AIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEF 229
IAY++A++C LF RG +++++ Y LLL
Sbjct: 115 GIAYILASIC--------------VLFLNRRGVLTLSVIILVAYWLLL------------ 148
Query: 230 PVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPD 289
++ G + N V +D +LG HL++
Sbjct: 149 -------------LSVGAENAYTLEHNLVRAVDIAVLGESHLWQGK-------------- 181
Query: 290 YGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSC 349
FDPEGL+S++ A V+ L G L+ + + +++
Sbjct: 182 ------------GLAFDPEGLISTLPAVVSVLFGFEVTRLLTS-TSCQWASIKRLLVIGV 228
Query: 350 LIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
+ + L + M +NK+L++ S+ T+G + ++LA
Sbjct: 229 VAVVIGQLTSMVMPINKSLWTSSFVIYTSGIACIVLA 265
>gi|393725858|ref|ZP_10345785.1| hypothetical protein SPAM2_19574 [Sphingomonas sp. PAMC 26605]
Length = 400
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 105/346 (30%), Positives = 156/346 (45%), Gaps = 79/346 (22%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGILPA---INHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL +LDV RGL VA MILV G A + H+ W+G TLAD V P FLF VG++L L
Sbjct: 7 RLEALDVLRGLAVAGMILVVSPGDWSMAYAQLQHAAWHGATLADMVFPTFLFSVGMALGL 66
Query: 112 TYKNFPCKVVATR----KAILRALNLFLLGIFLQ---------GGFFHGINNLKYGVDIA 158
++ R + I R++ L +LG+ ++ G + G L Y
Sbjct: 67 SFPRLMADTAQRRLFWMRLIRRSITLVVLGLVVEATYVWTISAGAPYPGHGGLSY----- 121
Query: 159 QIRWMGVLQRIAIAYLV-AALCEIWLK--GDGHVSSKLSLFRKYRGHWVVALVLTTLYLL 215
+R G+LQRI + YL+ AL + + DG ++ R + +A +L + L
Sbjct: 122 -VRIPGILQRIGLCYLLGGALIVVTSRTIADGRIA-----IAPQRVLFCIAAILIGYWAL 175
Query: 216 LLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKP 275
L + + VP F V G P + +DR + + HL+ P
Sbjct: 176 LRF-VPVPG----------------FGV-----GLLTPDGSLPAFVDRTLFTVPHLW--P 211
Query: 276 IYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV-HFK 334
+ S T Q GP A +DPEGLLS++ AT L FG L ++
Sbjct: 212 LGSATGQ--------GP----------ATYDPEGLLSTLPATANLL----FGALAAWAWR 249
Query: 335 DHRDR-MLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
+ DR L+ I + LI GL+LD V +NK L++ S+ ++G
Sbjct: 250 QNSDRATLHVAIAGTMLIIAGLALDPV-FEINKRLWTSSFALFSSG 294
>gi|436833933|ref|YP_007319149.1| Protein of unknown function DUF2261,transmembrane [Fibrella
aestuarina BUZ 2]
gi|384065346|emb|CCG98556.1| Protein of unknown function DUF2261,transmembrane [Fibrella
aestuarina BUZ 2]
Length = 361
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 99/338 (29%), Positives = 132/338 (39%), Gaps = 89/338 (26%)
Query: 70 MILVDDVGGILPA---INHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKA 126
MILV++ G + + H+PWNG T D + PFFLFIVGVS+ ++ K
Sbjct: 1 MILVNNAGDWAHSYAPLKHAPWNGWTPTDLIFPFFLFIVGVSITFALSKRQTSLLEDEKT 60
Query: 127 -----ILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEI 181
I R + LF LG FL NL D A +R MGVLQRI I Y V AL
Sbjct: 61 QRLKIIRRGVTLFALGFFL---------NLFPRFDFANVRIMGVLQRIGIVYTVCAL--- 108
Query: 182 WLKGDGHVSSKLSLFRKYRGHWVVALVLTTL--YLLLLYGLYVPDWQYEFPVETSSSSPW 239
+F + V L+L L Y LL+ + VP Y
Sbjct: 109 -------------VFLRTSPRQQVNLILLILIGYFLLMTMVPVPGIGY------------ 143
Query: 240 IFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPS 299
+ P N IDR IL H YR +
Sbjct: 144 ---------ANLEPETNLAAWIDRTILTPAHCYR----------------------SSKV 172
Query: 300 WCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFK-----DHRDRMLNWIILSSCLIGLG 354
W DPEGLLS+V A T L+GL G + + + L + + +G
Sbjct: 173 W-----DPEGLLSTVPAIATGLLGLLAGRWLRSTRYGTTVRESQKALFLFLAGLLMAFVG 227
Query: 355 LSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
D V +NKAL++ SY L G + LA Y+++
Sbjct: 228 TLFDTV-FPINKALWTSSYVLLAGGLAMCGLAIFYYLI 264
>gi|255656770|ref|ZP_05402179.1| hypothetical protein CdifQCD-2_14006 [Clostridium difficile
QCD-23m63]
Length = 481
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 84/347 (24%), Positives = 151/347 (43%), Gaps = 80/347 (23%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
R+ S+D+ RGL++ALMI+ ++ G + P + H+ W+G+TLADF PFF+ +GV+
Sbjct: 112 MNSRVKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVT 171
Query: 109 LALT----YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMG 164
+ ++ KN + R++ L L G FL N D+ +R +G
Sbjct: 172 IPISINSKLKNNKSTLSIILSIFKRSILLILFGFFL---------NYLGNPDLNTVRILG 222
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQR+ + Y V +L + LK KL++ G L + + ++ G Y+
Sbjct: 223 VLQRMGLVYFVTSLVYLLLK-------KLNV-----GSTATILTFLCISVFIIVGYYILA 270
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
Y F +E S + ++D H ++ +Y
Sbjct: 271 KPYGFELEGSLAQ----------------------LVD------LHFFKGHLY------- 295
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
+ F+P+G L+S++A + ++G G +++ + I
Sbjct: 296 -----------------KPEFEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKI 338
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM 391
++ S ++ +G + NK L+S S+ L AG+ GVLL+ YF+
Sbjct: 339 LVMSIILLIGAFIFNQYFPFNKRLWSSSFVLLMAGSYGVLLSIFYFI 385
>gi|224025514|ref|ZP_03643880.1| hypothetical protein BACCOPRO_02254, partial [Bacteroides
coprophilus DSM 18228]
gi|224018750|gb|EEF76748.1| hypothetical protein BACCOPRO_02254 [Bacteroides coprophilus DSM
18228]
Length = 298
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 92/338 (27%), Positives = 147/338 (43%), Gaps = 86/338 (25%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVD-DVGGILPAINHSPWNGLTLADFVMPFFLFI 104
T Q +RL+S+D+FRG+T+ MILV+ GG + H P G +AD V P F+FI
Sbjct: 9 NTLNQKNMKRLLSIDIFRGITIFFMILVNTQAGGSFDFLIHIPGYGWRIADLVYPSFIFI 68
Query: 105 VGVSLALTYKNF-PCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWM 163
+G S+ L+ + + K I R L IFL G F N + + ++ +R +
Sbjct: 69 MGASMYLSMRKYVEAPPTDLYKHIFRRTVL----IFLMGIIF---NWIPFDQNLLDVRIL 121
Query: 164 GVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVP 223
GVLQRIAI YL+ +L I ++ + + LS + L+L YLL + G
Sbjct: 122 GVLQRIAIVYLICSLLVIKVRS---IPTLLS---------ISGLILGFYYLLTMKG---- 165
Query: 224 DWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQC 283
YE V +D I+G +H+Y
Sbjct: 166 ---YEI----------------------------VDAVDLAIIGTKHMY----------- 183
Query: 284 SINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNW 343
+P + DPEGLLSS+ + V +IG + I+ + ++R++
Sbjct: 184 ---TPTH---------------DPEGLLSSIPSVVNAIIG-YVSARILTENELKERLIKM 224
Query: 344 IILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGAS 381
++ +I L L + + + K +S S+ LT+G S
Sbjct: 225 STIAISMIALAYILHWTILPIYKTYWSSSFGLLTSGIS 262
>gi|294895713|ref|XP_002775269.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239881343|gb|EER07085.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 323
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 88/324 (27%), Positives = 123/324 (37%), Gaps = 107/324 (33%)
Query: 62 FRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVV 121
RG+ +++M++VD G +P+I H+PWNGL LAD VMP F+FI +++ L
Sbjct: 1 MRGVVMSIMLIVDVCGKAVPSIGHAPWNGLHLADIVMPGFIFIDTLTVGL---------- 50
Query: 122 ATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEI 181
D+ R G+LQRIA+ Y A L
Sbjct: 51 ----------------------------------DLYTFRAPGILQRIAVCYAAAVLLA- 75
Query: 182 WLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIF 241
+ S LS +G AL + L++ V +W P
Sbjct: 76 ------KLVSDLSPNDTVKG----ALKNNSRVLVVGLLCIVINWAIMLLGPQPKGCP--- 122
Query: 242 NVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWC 301
RGS P CN IDR + G +H+Y P W
Sbjct: 123 ------RGSLTPQCNVASNIDRMVFGPEHMYN------------------------PLW- 151
Query: 302 QAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVG 361
DPEGLLS++ + T +GL G I H + + L+G GL LD G
Sbjct: 152 ----DPEGLLSTLPSLATVALGLACGKFIQSRPSHTELLR--------LVGCGLLLDLCG 199
Query: 362 MHL------NKALYSLSYTCLTAG 379
M L +K L++ SY LT G
Sbjct: 200 MGLGIVIPVSKVLWTPSYCLLTGG 223
>gi|408382946|ref|ZP_11180487.1| heparan-alpha-glucosaminide N-acetyltransferase [Methanobacterium
formicicum DSM 3637]
gi|407814484|gb|EKF85111.1| heparan-alpha-glucosaminide N-acetyltransferase [Methanobacterium
formicicum DSM 3637]
Length = 382
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 101/360 (28%), Positives = 156/360 (43%), Gaps = 89/360 (24%)
Query: 40 SNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVG--GILPAI-NHSPWNGLTLADF 96
+++NS ++RR+ISLDVFRGL VA MI V+ + P I H+ WNGLT AD
Sbjct: 8 NSTNSTVKTNAVKKRRVISLDVFRGLAVAAMIFVNAMAFSEFTPGIFEHATWNGLTFADL 67
Query: 97 VMPFFLFIVGVSLALTY--KNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYG 154
V P FLFIVGVS+A ++ ++ K + R LF +G+ L +
Sbjct: 68 VFPSFLFIVGVSMAYSFAARSKNSKRDLWGHFLFRVGALFTIGVALN----------WFT 117
Query: 155 VDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWV--VALVLTTL 212
D + +R GVLQ IA+A L A S +++ W+ VA VL +
Sbjct: 118 SDFSMVRIPGVLQLIALASLFA-----------------SPMARFKPRWILLVAGVLLLI 160
Query: 213 YLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLY 272
+ +L G+ P P G+ N ID ++L + H
Sbjct: 161 HGFILLGVGAPG----IPA-----------------GTLEKGNNIDDWIDIQVLTVNH-- 197
Query: 273 RKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVH 332
+DA DPEG+LS + AT L+GL G +
Sbjct: 198 ---------------------TIDANG------DPEGILSIITATALVLLGLCVGRTL-Q 229
Query: 333 FKDHRDRMLNWIILSSCLIG--LGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYF 390
+ H + + I+L+ I LGL+L + + +NK L++ S+ + AG + L +++
Sbjct: 230 LRKHNLKTIG-ILLAGGAISLLLGLALSQI-LPINKQLWTSSFILVCAGIGTLFLTILFW 287
>gi|54297581|ref|YP_123950.1| hypothetical protein lpp1632 [Legionella pneumophila str. Paris]
gi|53751366|emb|CAH12784.1| hypothetical protein lpp1632 [Legionella pneumophila str. Paris]
Length = 372
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 92/358 (25%), Positives = 147/358 (41%), Gaps = 88/358 (24%)
Query: 43 NSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGI--LPAINHSPWNGLTLADFVMPF 100
N+ +PQ RL+SLDVFRG+T+ LMI+V+ I P H WNG TLAD V PF
Sbjct: 2 NTVPLKPQ----RLLSLDVFRGMTIVLMIIVNGQAAIDPYPIFEHVDWNGCTLADLVFPF 57
Query: 101 FLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
FLFIVG++ ++ KN RKA + + + +N + ++ I
Sbjct: 58 FLFIVGLTSVISLKN-----QMERKAKTSLYSAIIERSVVLFLLGLFLNVFPHPIEFDSI 112
Query: 161 RWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGL 220
R G+LQRIA+ YL++A + + S K F + LVL Y +++ +
Sbjct: 113 RIYGILQRIAVCYLISAFIYL------NTSIKTQFF--------IFLVLLLGYWIIMTQV 158
Query: 221 YVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRT 280
VP + + + W V D+ HLY K
Sbjct: 159 PVPGYGAN---QLTKDGSW------------------VSYFDQLFFSASHLYEK------ 191
Query: 281 KQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRM 340
+DPEG +S+ + T L G+ G L+++ + +
Sbjct: 192 -----------------------TYDPEGFVSTFTSIATTLSGVLAGSLLINPCNQFKKF 228
Query: 341 LNWIILSSCLIGLGLSLDFVG------MHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
L G+G+ +G +NK L++ SY T+G + ++ A Y ++
Sbjct: 229 -------YLLAGVGMLFLLLGWLWNMSFPINKNLWTSSYVLWTSGLALLVFAFCYLLI 279
>gi|345320430|ref|XP_001516736.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase, partial
[Ornithorhynchus anatinus]
Length = 448
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 91/326 (27%), Positives = 141/326 (43%), Gaps = 57/326 (17%)
Query: 21 KSTSAPANEKLERDPLLPPSNSNSKQTRPQHQ------QRRLISLDVFRGLTVALMILVD 74
K + ++L L P+ ++S + Q + +RL SLD FRG ++ +M+ V+
Sbjct: 159 KKMNPRETDRLINSELGSPTRADSYNSDLQAEVWRSSSPQRLRSLDTFRGFSLIIMVFVN 218
Query: 75 DVGGILPAINHSPWNGLTLADFVMPFFLFIVG----VSLALTYKNFPCKVVATRKAILRA 130
GG H WNGLT+AD V P+F+FI+G +SL+ + K K + R+
Sbjct: 219 YGGGKYWFFKHEGWNGLTVADLVFPWFVFIMGSSISLSLSSMLRRGYSKWRLLWKILWRS 278
Query: 131 LNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVS 190
LFL+G+ + N + ++R GVLQR+ YLV A E+ S
Sbjct: 279 FLLFLIGVLIVNP-----NYCLGPLSWDKLRIPGVLQRLGFTYLVVATLELLFAKAVPES 333
Query: 191 SKL----SLFRK---YRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNV 243
+ L S ++ Y W+ L+L T +L L + L VP T P
Sbjct: 334 NSLERTCSFLQEIISYWPQWIFILMLETAWLCLTFLLPVPG------CPTGYLGP----- 382
Query: 244 TCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPS 299
G G G P C A G ID +LG H+Y+ P SP +
Sbjct: 383 --GGIGDFGKYPNCTGGAAGYIDHLLLGENHIYQHP-----------SP-------NVLY 422
Query: 300 WCQAPFDPEGLLSSVMATVTCLIGLH 325
+ +DPEG+L ++ + V +G+
Sbjct: 423 HTKVAYDPEGILGTINSIVMAFLGVQ 448
>gi|434387287|ref|YP_007097898.1| hypothetical protein Cha6605_3369 [Chamaesiphon minutus PCC 6605]
gi|428018277|gb|AFY94371.1| hypothetical protein Cha6605_3369 [Chamaesiphon minutus PCC 6605]
Length = 377
Score = 86.3 bits (212), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 104/371 (28%), Positives = 161/371 (43%), Gaps = 90/371 (24%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGILPA-------INHSPWNGLTLADFVMPFFLFIVGV 107
RL SLDVFRGLT+A MILV+ LP ++H+PW+G T+AD V PFFL+I+GV
Sbjct: 2 RLTSLDVFRGLTMATMILVNMAS--LPNDDRKYAWLDHAPWHGYTIADLVFPFFLYIIGV 59
Query: 108 SLALTYKNFPCKVVATRKAIL-----RALNLFLLGIFLQGGFF-HGINNLKYGVDIAQIR 161
S+A + + V K + R+ LF LG+ L + + + K+ ++ ++R
Sbjct: 60 SMAFSLAKYTSGDVPLSKQVYWQILRRSAILFGLGLILNNLVWNYNLTEPKFFANLDKLR 119
Query: 162 WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLY 221
MGVLQRI IA+ A++ V+AL L+ +L G+
Sbjct: 120 IMGVLQRIGIAFFFASIA------------------------VLALAQRCLW-ILTGGIL 154
Query: 222 VPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTK 281
V W + +S GV G N +DR I+ HL++ +K
Sbjct: 155 VGYWLILAFIPALDNSD-------GVFSQFG---NFGAYVDRLIITPAHLHKG-----SK 199
Query: 282 QCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRML 341
S DPEGL S++ A + L FG+L + +
Sbjct: 200 NLS---------------------DPEGLFSTLPAISSIL----FGYLTGTWLKRQPVAT 234
Query: 342 NWIILSSCLIGLGLSLDFVGM------HLNKALYSLSYTCLTAGASGVLLAGIYFMVRYI 395
S+ L+ GL+ +G+ +NK L++ S+ T G + LA Y +V +
Sbjct: 235 R---TSASLLMYGLAAVVIGLVWNSFFPINKKLWTSSFVLFTTGWGLISLAACYELVD-V 290
Query: 396 SSHLMLKKPFD 406
+ KPF+
Sbjct: 291 RKYRQWFKPFE 301
>gi|326801867|ref|YP_004319686.1| hypothetical protein [Sphingobacterium sp. 21]
gi|326552631|gb|ADZ81016.1| hypothetical protein Sph21_4499 [Sphingobacterium sp. 21]
Length = 376
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 91/337 (27%), Positives = 145/337 (43%), Gaps = 77/337 (22%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVG-GILP--AINHSPWNGLTLADFVMPFFLFIVG 106
+ Q+ R +LDVFRG+T+ MI+V+ G G P +NH+ W+G T D V P FLF VG
Sbjct: 2 KEQKLRFTALDVFRGMTICFMIIVNSPGSGATPYWPLNHATWHGFTPTDLVFPSFLFAVG 61
Query: 107 VSLALTYKNFPCKVVATRKAIL----RALNLFLLGIFLQGGFFHGINNLKYGVD--IAQI 160
+L+ + + F + +++++ +L RA +FLLG + F I + + +
Sbjct: 62 NALSFSERKF--QYLSSKQVLLTIFKRAALIFLLGFLMYWFPFFKITEQHEIISFPLHET 119
Query: 161 RWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGL 220
R GVLQRIA+ YL AL +++ RKY LV + LLL+Y
Sbjct: 120 RVFGVLQRIALCYLFTALAVYYVR------------RKY-------LVWLAIALLLIY-- 158
Query: 221 YVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRT 280
W+ + G NA+ +D +LG HLY
Sbjct: 159 ------------------WVILLIFGTDAPYSLEGNAIFKLDLWLLGESHLYH------- 193
Query: 281 KQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRM 340
+G + FDPEGLLS++ A + G G + +
Sbjct: 194 --------SHGII-----------FDPEGLLSTIPAITNAIAGYLVGKYLQEKGGTVQSL 234
Query: 341 LNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLT 377
+I+ + + + L + V +NK ++S S+ LT
Sbjct: 235 GKLLIIGAIGVLIALVWNQV-FPINKKIWSSSFALLT 270
>gi|405957484|gb|EKC23691.1| Heparan-alpha-glucosaminide N-acetyltransferase [Crassostrea gigas]
Length = 1901
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 95/376 (25%), Positives = 154/376 (40%), Gaps = 102/376 (27%)
Query: 42 SNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFF 101
++ + +RP+ + RL SLD FRG F
Sbjct: 1531 ADEQDSRPKKE--RLKSLDTFRG------------------------------------F 1552
Query: 102 LFIVGVSLALTYKNFPCKVVATRKAI----LRALNLFLLGIFLQGGFFHGINNLKYGVDI 157
++I+G ++A ++ +V K R+ LF+LG+ + G G + + +
Sbjct: 1553 IWIMGTAMAYSFTGMLRRVTPKHKIFWKIFKRSCILFVLGLLVNTG---GCDPTR----L 1605
Query: 158 AQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVS--SKLSLFRK---YRGHWVVALVLTTL 212
+R GVLQR A YLV A ++ VS + R + W++ +V T+
Sbjct: 1606 THLRIPGVLQRFAGTYLVVASIHMFFAKTVDVSMYTYWGFIRDIVDFWLEWILHIVFVTV 1665
Query: 213 YLLLLYGLYVPDWQYEF-------PVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKI 265
++++ + L VP + S+ + N T G A G IDR++
Sbjct: 1666 HIIITFTLDVPGCGKGYIGPGGLHEAVNSTEASVYQNCTGG----------AAGYIDRQV 1715
Query: 266 LGIQHLYR----KPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCL 321
G H+Y+ KPIY T P+DPEGLL ++ + C
Sbjct: 1716 FGDDHIYQSPTCKPIYKTT----------------------VPYDPEGLLGTLNSVFMCY 1753
Query: 322 IGLHFGHLIVHFKDHRDRMLNWII--LSSCLIG---LGLSLDFVGMHLNKALYSLSYTCL 376
+GL G +++ FK+ R+ ++I L CLI G D + LNK L+SLS+
Sbjct: 1754 LGLQAGKILMTFKEPSARVKRFLIWGLFLCLIAGALCGFKKDGGTIPLNKNLWSLSFVLC 1813
Query: 377 TAGASGVLLAGIYFMV 392
AG + VLLA Y +
Sbjct: 1814 MAGFAFVLLAFCYVTI 1829
>gi|296452402|ref|ZP_06894103.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Clostridium
difficile NAP08]
gi|296258732|gb|EFH05626.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Clostridium
difficile NAP08]
Length = 481
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 84/349 (24%), Positives = 150/349 (42%), Gaps = 80/349 (22%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG 106
+ R+ S+D+ RGL++ALMI+ ++ G + P + H+ W+G+TLADF PFF+ +G
Sbjct: 110 KFVNSRVKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLG 169
Query: 107 VSLALT----YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRW 162
V++ ++ KN + R++ L L G FL N D+ +R
Sbjct: 170 VTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFL---------NYLGNPDLNTVRI 220
Query: 163 MGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYV 222
+GVLQR+ + Y V +L + LK SS V + + ++ G Y+
Sbjct: 221 LGVLQRMGLVYFVTSLVYLLLKKLNVRSS------------VTIITFLCISAFIIVGYYI 268
Query: 223 PDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQ 282
Y F +E S + ++D H ++ +Y
Sbjct: 269 VAKPYGFELEGSLAQ----------------------LVD------LHFFKGHLY----- 295
Query: 283 CSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLN 342
PD F+P+G L+S++A + ++G G +++ +
Sbjct: 296 ----KPD---------------FEPDGFLTSIVAISSGMLGCMMGCVLLKENIGEYKKFF 336
Query: 343 WIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM 391
I++ S ++ +G + NK L+S S+ L AG+ G+LL+ YF+
Sbjct: 337 KILVMSIILLIGAFIFNQYFPFNKRLWSSSFVLLMAGSYGILLSIFYFI 385
>gi|374311063|ref|YP_005057493.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
gi|358753073|gb|AEU36463.1| protein of unknown function DUF1624 [Granulicella mallensis
MP5ACTX8]
Length = 435
Score = 85.9 bits (211), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 96/369 (26%), Positives = 157/369 (42%), Gaps = 83/369 (22%)
Query: 40 SNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADF 96
S + S+ R + R++S+DV RG+T+ALMILV+D G I ++H+ WNG TL D
Sbjct: 34 SQTVSQTERTVSKPGRVLSVDVLRGITIALMILVNDPGDWDHIFGQLDHAAWNGWTLTDM 93
Query: 97 VMPFFLFIVGVSLALTYK----NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLK 152
V P FLF++G S+ + + CK RA +L ++ FF ++
Sbjct: 94 VFPAFLFLMGASIIFSLQARIARGNCKGTLAGHIFARAGK--ILALYWVLAFFPRMH--- 148
Query: 153 YGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTL 212
IRW GVL RIA+ YL+A+L L V ++ +VA +L
Sbjct: 149 -----WTIRWFGVLPRIALCYLLASLV---LLATRRVRVLIA---------IVAFLLVGY 191
Query: 213 YLLLLY----GLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKI--L 266
++LL + GL P F + ++ + W IDR +
Sbjct: 192 WVLLRWVPVPGLGTPMRDIPFMDQNANLASW---------------------IDRGVSSW 230
Query: 267 GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHF 326
++ L+ +Y +T+ DPEGLLS++ A T L+G
Sbjct: 231 SLRWLHTGTLYRKTR------------------------DPEGLLSTLPAVATTLLGALA 266
Query: 327 GHLIVHFK---DHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGV 383
G +++ + + RM + L +NK L++ S+ L AG + +
Sbjct: 267 GMWMINGQRVVNGMRRMRIGLAAMGVAGVAAGVLWSRWFPINKNLWTSSFVLLMAGWTAL 326
Query: 384 LLAGIYFMV 392
LAG +++
Sbjct: 327 ALAGCSWLI 335
>gi|116327439|ref|YP_797159.1| hypothetical protein LBL_0655 [Leptospira borgpetersenii serovar
Hardjo-bovis str. L550]
gi|116120183|gb|ABJ78226.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar
Hardjo-bovis str. L550]
Length = 369
Score = 85.5 bits (210), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 100/357 (28%), Positives = 153/357 (42%), Gaps = 94/357 (26%)
Query: 59 LDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS--LALTY 113
+D+FRG+TV MILV++ G + + H+ WNG T D V PFFLF VG S ++L
Sbjct: 1 MDLFRGMTVVGMILVNNPGSWSYVYSPLKHAEWNGCTPTDLVFPFFLFAVGASIPISLYS 60
Query: 114 KNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAY 173
KN ++ +R ++L LLG+FL N +++R GVLQRI Y
Sbjct: 61 KNGINRIRIWIGICIRGISLILLGLFL---------NFFGEWTFSELRIPGVLQRIGFVY 111
Query: 174 LVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVET 233
V A +LF + G V+ ++ L V W + T
Sbjct: 112 WVVA----------------TLFLVFPGKKVLVFLIPI--------LLVHTW-----ILT 142
Query: 234 SSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPM 293
+ P V G A IDR I G +HL++ +S+T
Sbjct: 143 HIAPP----GESMVSLEQGKDIGA--WIDRTIFGEKHLWK---FSKT------------- 180
Query: 294 PLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDH-RDRMLNWIILSSCLIG 352
+DPEG LS + + T L G+ G ++ + ++R+L+ + G
Sbjct: 181 -----------WDPEGFLSGIASIATSLFGVICGFILFRREGRGKNRVLS-------IFG 222
Query: 353 LGLSLDFVG------MHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKK 403
LG FVG + +NK+L++ SY T G + L G + Y+ S ++LKK
Sbjct: 223 LGFLFTFVGLLWDRSLPMNKSLWTGSYAVYTTGFA-FLCIGFF---EYLDSLILLKK 275
>gi|410664067|ref|YP_006916438.1| hypothetical protein M5M_07585 [Simiduia agarivorans SA1 = DSM
21679]
gi|409026424|gb|AFU98708.1| hypothetical protein M5M_07585 [Simiduia agarivorans SA1 = DSM
21679]
Length = 390
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 101/346 (29%), Positives = 160/346 (46%), Gaps = 69/346 (19%)
Query: 55 RLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL+S+DV RGL +A M+LV++ G + + H+ W+G T D + P FL++VG+S+ L
Sbjct: 15 RLMSVDVLRGLAIAAMVLVNNPGSWSHVYAPLAHAEWHGWTPTDVIFPLFLYVVGLSIVL 74
Query: 112 TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFF-HGINNLKYGVD-IAQIRWMGVLQRI 169
K + R LRA LF LG+FL +F + D + +R +GVLQRI
Sbjct: 75 AQKG-ETFALPGRSTWLRAAKLFGLGLFLALFYFPFAKPEFSWWRDQLLDVRILGVLQRI 133
Query: 170 AIAYL----VAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDW 225
A+ YL +A LC+ + W++ L T +++ L Y L +
Sbjct: 134 ALVYLACCYLAWLCQ-------------------KRQWLLWLA-TLVFMWLAYALML--- 170
Query: 226 QYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSI 285
S P+ + RG + +D+ +LG +HLY + T Q
Sbjct: 171 ----------SIPYADDTGEIYRGQLVFGNHFSAWLDQLLLGREHLYYQ-----TAQ--- 212
Query: 286 NSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWII 345
P FDPEGLL+++ A + L+G+ G L + H R+ W
Sbjct: 213 --------PFA--------FDPEGLLTTLPAISSGLLGVLAG-LQLKAAGHSSRLEIWFA 255
Query: 346 LSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM 391
++ G L V +NKAL++ S+ +TAG S +LLA +Y++
Sbjct: 256 GGVLMLVAGQLLHPV-CPINKALWTPSFVLVTAGVSQLLLASLYWL 300
>gi|325922207|ref|ZP_08183994.1| hypothetical protein XGA_3017 [Xanthomonas gardneri ATCC 19865]
gi|325547326|gb|EGD18393.1| hypothetical protein XGA_3017 [Xanthomonas gardneri ATCC 19865]
Length = 390
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 99/324 (30%), Positives = 130/324 (40%), Gaps = 88/324 (27%)
Query: 22 STSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILP 81
S PA + P L P ++ R +SLDVFRGLT+ LMILV+ G
Sbjct: 2 SEQTPAAAAITASPTLTP------------KRERFLSLDVFRGLTIFLMILVNTAGPGAQ 49
Query: 82 A---INHSPWNGLTLADFVMPFFLFIVGVSLALTY-KNFPCKVVATRKAILRALNLFLLG 137
A + H+ W G TLAD V P FLF VG +++ + P + R RA +FL G
Sbjct: 50 AYAQLTHAAWFGFTLADLVFPSFLFAVGSAMSFALAADTPHRPFLGRVGK-RAALIFLCG 108
Query: 138 IFLQG-GFFHGINNLKYG-VDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSL 195
+ + FFH + I Q+R GVLQRI + YL+AAL +L G + ++L
Sbjct: 109 VLMYWFPFFHLQPGGGWAFTAIDQLRLTGVLQRIGLCYLLAALLVRYLPPRGIAPACVAL 168
Query: 196 FRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPAC 255
G+W V LYV F TG
Sbjct: 169 L---LGYWAV--------------LYV------------------FGQPGAELSKTG--- 190
Query: 256 NAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVM 315
NA +D + G HLYRK D G FDPEGLL ++
Sbjct: 191 NAGTRLDLWLYGRAHLYRK--------------DNG-------------FDPEGLLGTLP 223
Query: 316 ATVTCLIGLHFGHLIVHFKDHRDR 339
ATV L G+L F R +
Sbjct: 224 ATVNVLA----GYLTGRFLQRRGK 243
>gi|398341237|ref|ZP_10525940.1| hypothetical protein LkirsB1_18883 [Leptospira kirschneri serovar
Bim str. 1051]
Length = 383
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 93/340 (27%), Positives = 147/340 (43%), Gaps = 78/340 (22%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFL 102
+ + + R++SLD+FRG+TVA MILV++ G I + H+ WNG T D V PFFL
Sbjct: 2 ENKSTQNKNRVLSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHAKWNGCTPTDLVFPFFL 61
Query: 103 FIVGVSLALTY--KNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
F VG+S+ L+ KN K +R++ L L+G+FL F G +++
Sbjct: 62 FAVGISIQLSVYSKNKIHKSKIWFGICIRSITLILIGLFLN---FFG------EWSFSEL 112
Query: 161 RWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGL 220
R GVLQRI Y + A + L R W+ L++ T L+
Sbjct: 113 RIPGVLQRIGFVYWIVASLHLILPK-----------RMILISWIPILLVHTWVLI----- 156
Query: 221 YVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRT 280
Q P E+ P + IDR + G HL++ +S+T
Sbjct: 157 -----QIPAPGESIV--------------YLEPGKDIGAWIDRNVFGENHLWK---FSKT 194
Query: 281 KQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDH-RDR 339
+DPEGL S + + T L+G+ G ++ + + +
Sbjct: 195 ------------------------WDPEGLFSGISSIATSLLGVFCGSILSSKTNEIKKQ 230
Query: 340 MLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
+L+ L+ +GL D + +NK+L++ SY TAG
Sbjct: 231 ILSIFGFGILLVLVGLLWD-QNLPMNKSLWTGSYVIYTAG 269
>gi|256420508|ref|YP_003121161.1| hypothetical protein Cpin_1463 [Chitinophaga pinensis DSM 2588]
gi|256035416|gb|ACU58960.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588]
Length = 374
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 97/355 (27%), Positives = 146/355 (41%), Gaps = 76/355 (21%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILP--AINHSPWNGLTLADFVMPFFLF 103
T PQ R + LDVFRGLTV MI+V+ G +NH+ WNG T D V P FLF
Sbjct: 3 NTTPQ----RFLPLDVFRGLTVCFMIIVNTPGWDTSYYILNHAQWNGCTPTDMVFPSFLF 58
Query: 104 IVGVSLALTYKNFPC--KVVATRKAILRALNLFLLGIFLQGGFF--HGINNLKYGVDIAQ 159
VG +++ + + F K R L +FLLG + F H + L++ + ++
Sbjct: 59 AVGNAMSFSMRKFQQLENTAVLSKIFRRTLLIFLLGFLMYWLPFVRHTESGLEF-IPLSD 117
Query: 160 IRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYG 219
R +GVLQRIA+ Y A+L +L + W V+ VL Y ++Y
Sbjct: 118 TRILGVLQRIALCYCFASLLIHYLPK--------------KAVWAVSAVLLLGYWAVMYA 163
Query: 220 LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSR 279
P +Y TG NA D+ I+G HLY
Sbjct: 164 FGDPADRYSL---------------------TG---NAALHFDKLIMGDSHLYHGE---- 195
Query: 280 TKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDR 339
FDPEGLLS++ A V + G + G I
Sbjct: 196 ----------------------GIAFDPEGLLSTLPAIVNVIAGYYTGLFIQQEGKTGKG 233
Query: 340 MLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRY 394
+ + + + LI + L + +NK L++ SY T G ++L+ + F++ +
Sbjct: 234 LRKLLQMGALLILVALVWS-MAFPINKKLWTSSYVLYTVGIDLLILSLLIFVIDF 287
>gi|423089801|ref|ZP_17078150.1| hypothetical protein HMPREF9945_01335, partial [Clostridium
difficile 70-100-2010]
gi|357557565|gb|EHJ39099.1| hypothetical protein HMPREF9945_01335, partial [Clostridium
difficile 70-100-2010]
Length = 391
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 84/349 (24%), Positives = 152/349 (43%), Gaps = 84/349 (24%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
R+ S+D+ RGL++ALMI+ ++ G + P + H+ W+G+TLADF PFF+ +GV+
Sbjct: 22 MNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVT 81
Query: 109 LALT----YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMG 164
+ ++ KN + R++ L L G FL N D+ +R +G
Sbjct: 82 IPISINSKLKNNKSTLSIILSIFKRSILLILFGFFL---------NYLGNPDLDTVRILG 132
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYL--LLLYGLYV 222
VLQR+ + Y V +L + LK S+ A ++T LY+ ++ G Y+
Sbjct: 133 VLQRMGLVYFVTSLAYLLLKKLNVGST--------------ATIITFLYIATFIIVGYYI 178
Query: 223 PDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQ 282
Y F +E S + ++D H ++ +Y
Sbjct: 179 LAKPYGFELEGSLAQ----------------------LVD------LHFFKGHLY----- 205
Query: 283 CSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLN 342
+ F+P+G L+S++A + ++G G +++ +
Sbjct: 206 -------------------KPEFEPDGFLTSIVAISSGMLGCTMGCVLLKENLGEYKKFF 246
Query: 343 WIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM 391
I++ S ++ +G + NK L+S S+ L AG+ G+LL+ YF+
Sbjct: 247 KILVMSIILLIGAFIFNQYFPFNKRLWSSSFVLLMAGSYGILLSIFYFI 295
>gi|404450663|ref|ZP_11015643.1| hypothetical protein A33Q_15100 [Indibacter alkaliphilus LW1]
gi|403763718|gb|EJZ24662.1| hypothetical protein A33Q_15100 [Indibacter alkaliphilus LW1]
Length = 381
Score = 85.1 bits (209), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 51/141 (36%), Positives = 82/141 (58%), Gaps = 6/141 (4%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
+ R ++LDV RGLT+ALMI+V+ G + H+PW+G T+ D V P FLF+VG ++
Sbjct: 12 KERYLALDVLRGLTIALMIVVNTPGSWSHMYGPFMHAPWHGFTITDLVFPTFLFVVGNAM 71
Query: 110 ALTYKNFP--CKVVATRKAILRALNLFLLGIFLQG-GFFHGINNLKYGVDIAQIRWMGVL 166
+ + K + + RK + R+ +F++G L FF N ++ ++R +GVL
Sbjct: 72 SFSMKKLEKMGQGLFLRKVLKRSFLIFIIGWGLNAFPFFDQTENGLAMINWGEVRLLGVL 131
Query: 167 QRIAIAYLVAALCEIWLKGDG 187
QRIA+ YL+A+L +L G
Sbjct: 132 QRIALCYLIASLVLYYLSRSG 152
>gi|192359631|ref|YP_001981658.1| hypothetical protein CJA_1162 [Cellvibrio japonicus Ueda107]
gi|190685796|gb|ACE83474.1| putative membrane protein [Cellvibrio japonicus Ueda107]
Length = 399
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 55/131 (41%), Positives = 77/131 (58%), Gaps = 13/131 (9%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
++R ++LDV RGLT+ALMILV+ G + + H+ W+G+T DFV PFFLFIVG ++
Sbjct: 36 KQRFLALDVMRGLTLALMILVNTPGSWSHVYGPLLHADWHGVTPTDFVFPFFLFIVGSAM 95
Query: 110 ALTYKNFPCKVVAT--RKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
+ + ++ RK R L LF++GI L + D+ R MGVLQ
Sbjct: 96 YFSVRGLAQLSLSQQLRKVGRRVLLLFVMGILLAA--------YPFTADVHDWRIMGVLQ 147
Query: 168 RIAIAYLVAAL 178
RIA+AY VAAL
Sbjct: 148 RIALAYGVAAL 158
>gi|421109691|ref|ZP_15570204.1| putative membrane protein [Leptospira kirschneri str. H2]
gi|410005185|gb|EKO58983.1| putative membrane protein [Leptospira kirschneri str. H2]
Length = 383
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 93/340 (27%), Positives = 147/340 (43%), Gaps = 78/340 (22%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFL 102
+ + + R++SLD+FRG+TVA MILV++ G I + H+ WNG T D V PFFL
Sbjct: 2 ENKSTQNKNRVLSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHAKWNGCTPTDLVFPFFL 61
Query: 103 FIVGVSLALTY--KNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
F VG+S+ L+ KN K +R++ L L+G+FL F G +++
Sbjct: 62 FAVGISIQLSVYSKNKIYKSKIWFGICIRSITLILIGLFLN---FFG------EWSFSEL 112
Query: 161 RWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGL 220
R GVLQRI Y + A + L R W+ L++ T L+
Sbjct: 113 RIPGVLQRIGFVYWIVASLHLILPK-----------RMILISWIPILLVHTWVLI----- 156
Query: 221 YVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRT 280
Q P E+ P + IDR + G HL++ +S+T
Sbjct: 157 -----QIPPPGESIV--------------YLEPGKDIGAWIDRNVFGENHLWK---FSKT 194
Query: 281 KQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDH-RDR 339
+DPEGL S + + T L+G+ G ++ + + +
Sbjct: 195 ------------------------WDPEGLFSGISSIATSLLGVFCGSILSSKTNEIKKQ 230
Query: 340 MLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
+L+ L+ +GL D + +NK+L++ SY TAG
Sbjct: 231 ILSIFGFGILLVLVGLLWD-QNLPMNKSLWTGSYVIYTAG 269
>gi|418694540|ref|ZP_13255577.1| putative membrane protein [Leptospira kirschneri str. H1]
gi|409957715|gb|EKO16619.1| putative membrane protein [Leptospira kirschneri str. H1]
Length = 383
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 93/340 (27%), Positives = 147/340 (43%), Gaps = 78/340 (22%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFL 102
+ + + R++SLD+FRG+TVA MILV++ G I + H+ WNG T D V PFFL
Sbjct: 2 ENKSTQNKNRVLSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHAKWNGCTPTDLVFPFFL 61
Query: 103 FIVGVSLALTY--KNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
F VG+S+ L+ KN K +R++ L L+G+FL F G +++
Sbjct: 62 FAVGISIQLSVYSKNKIYKSKIWFGICIRSITLILIGLFLN---FFG------EWSFSEL 112
Query: 161 RWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGL 220
R GVLQRI Y + A + L R W+ L++ T L+
Sbjct: 113 RIPGVLQRIGFVYWIVASLHLILPK-----------RMILISWIPILLVHTWVLI----- 156
Query: 221 YVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRT 280
Q P E+ P + IDR + G HL++ +S+T
Sbjct: 157 -----QIPPPGESIV--------------YLEPGKDIGAWIDRNVFGENHLWK---FSKT 194
Query: 281 KQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDH-RDR 339
+DPEGL S + + T L+G+ G ++ + + +
Sbjct: 195 ------------------------WDPEGLFSGISSIATSLLGVFCGSILSSKTNEIKKQ 230
Query: 340 MLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
+L+ L+ +GL D + +NK+L++ SY TAG
Sbjct: 231 ILSIFGFGILLVLVGLLWD-QNLPMNKSLWTGSYVIYTAG 269
>gi|404256028|ref|ZP_10959996.1| hypothetical protein SPAM266_22806 [Sphingomonas sp. PAMC 26621]
Length = 392
Score = 84.7 bits (208), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 103/364 (28%), Positives = 162/364 (44%), Gaps = 66/364 (18%)
Query: 57 ISLDVFRGLTVALMILV---DDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTY 113
++LDV RGL VA MILV D G + H+ WNG TLAD V P FLF VG++L L+
Sbjct: 1 MALDVLRGLAVAGMILVVSPGDWGQAYVQLQHANWNGATLADMVFPTFLFSVGIALGLS- 59
Query: 114 KNFPCKVVAT-------RKAILRALNLFLLGIFLQGGFFHGINN---LKYGVDIAQIRWM 163
FP ++ + + R L LLG+ ++ + I G +A IR
Sbjct: 60 --FPRRLETAGDRRLFWTRLLRRTALLILLGLLVEATYVWTIAAGAPYPGGPGLAHIRIP 117
Query: 164 GVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVP 223
G+LQRI + Y +A + + L + + + ++L +LLL++
Sbjct: 118 GILQRIGLCYGLAGI--LLLATNRRDPDGMIRINPLAIVGCIVVILIGYWLLLIF----- 170
Query: 224 DWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQC 283
PV + G PA N G +DR + HL+ P+ S T
Sbjct: 171 -----VPVPGFGA------------GVLTPAGNLPGFVDRTLFTEPHLW--PLGSATAAR 211
Query: 284 SINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNW 343
A +DPEGLLS++ AT L G+ + + DR L +
Sbjct: 212 P------------------ATYDPEGLLSTLPATANVLFGILSAW---ALRRYPDRALGY 250
Query: 344 I-ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLK 402
I ++++ L GL+LD + + +NK +++ S+ L+ G S + L + ++R + LML
Sbjct: 251 IAVVAALLFSAGLALDPL-LVINKRIWTSSFAVLSGGVSALALTALMVVLRSRGAALML- 308
Query: 403 KPFD 406
PF
Sbjct: 309 TPFQ 312
>gi|320105553|ref|YP_004181143.1| hypothetical protein AciPR4_0312 [Terriglobus saanensis SP1PR4]
gi|319924074|gb|ADV81149.1| hypothetical protein AciPR4_0312 [Terriglobus saanensis SP1PR4]
Length = 412
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 93/349 (26%), Positives = 140/349 (40%), Gaps = 88/349 (25%)
Query: 48 RPQHQQRRLISLDVFRGLTVALMILVDD---VGGILPAINHSPWNGLTLADFVMPFFLFI 104
R H+ RL+SLDV RG+T+ MILV++ G P + H+ WNG T D V P FL +
Sbjct: 23 RTTHKPARLLSLDVLRGVTIGFMILVNNQTGEGAFFP-LQHAKWNGFTPTDLVFPTFLLL 81
Query: 105 VGVSLALTYKNFPCKVVATRKAILRALN----LFLLGIFLQGG-FFHGINNLKYGVDIAQ 159
VG+S L+ + + VA L L LFL G+ + FFH +
Sbjct: 82 VGLSTVLSTEARLARGVAKSTIFLHTLQRSAVLFLFGLIVNNAPFFH----------LQT 131
Query: 160 IRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKY--RGHWVVALVLTTLYLLLL 217
+R GVL RIA+ Y + + ++ + L+ G+W + + + +
Sbjct: 132 LRVYGVLPRIAVCYFIVGSLYLLVRDLKQRAFILAAAAAACLVGYWAL------MRFIPI 185
Query: 218 YGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIY 277
G P +E P+ P N V IDR I HLY K
Sbjct: 186 PGFGTP--THEIPIN-------------------DPDGNLVAYIDRHIFSASHLYEK--- 221
Query: 278 SRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHR 337
+R DPEGLLS++ A T L G+ G + + R
Sbjct: 222 TR--------------------------DPEGLLSTIPAVATALFGILAG---IWLRTSR 252
Query: 338 DRM-----LNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGAS 381
M + + +S ++G L F +NK L++ S++ G S
Sbjct: 253 STMQKAKGIEYAGISFLILGGAWHLAF---PINKKLWTSSFSLWAGGWS 298
>gi|388258355|ref|ZP_10135531.1| hypothetical protein O59_002752 [Cellvibrio sp. BR]
gi|387937867|gb|EIK44422.1| hypothetical protein O59_002752 [Cellvibrio sp. BR]
Length = 362
Score = 84.7 bits (208), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 90/334 (26%), Positives = 144/334 (43%), Gaps = 81/334 (24%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
++R +LDV RGLT+ALMILV+ G + + H+ W+G T DFV PFF+FIVG S+
Sbjct: 3 KQRFQALDVMRGLTLALMILVNTPGSWSFVYGPLLHADWHGATATDFVFPFFMFIVGSSM 62
Query: 110 ALTYKNFP--CKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
+ +K + R + LF++G+ L F NN I R MGVLQ
Sbjct: 63 YFAMRGLRQLAPAAQAQKILRRVVLLFVIGVLLSAYPFT--NN------IENWRVMGVLQ 114
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQY 227
RIAIAY AA ++ G V V++ + LL +GL
Sbjct: 115 RIAIAYGFAAFIILYFGFTGRV------------------VMSAILLLGYWGLL------ 150
Query: 228 EFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINS 287
++ P+ N V D +LG HL++
Sbjct: 151 -----NIAADPYSLE------------HNLVRQFDLAVLGANHLWQGK------------ 181
Query: 288 PDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILS 347
FDPEG+LS+V + V +IG +++ +D + + L+ + ++
Sbjct: 182 --------------GLAFDPEGILSTVPSIVNVIIGFEATRVLLASED-KAKALSQLFVA 226
Query: 348 SCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGAS 381
+ L+ + + +NK+L++ S+ LT G +
Sbjct: 227 ALLLIGLALVWNLLFPINKSLWTSSFVVLTCGVA 260
>gi|296877751|ref|ZP_06901777.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Clostridium
difficile NAP07]
gi|296431202|gb|EFH17023.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Clostridium
difficile NAP07]
Length = 370
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 84/347 (24%), Positives = 149/347 (42%), Gaps = 80/347 (23%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
R+ S+D+ RGL++ALMI+ ++ G + P + H+ W+G+TLADF PFF+ +GV+
Sbjct: 1 MNSRVKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVT 60
Query: 109 LALT----YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMG 164
+ ++ KN + R++ L L G FL N D+ +R +G
Sbjct: 61 IPISINSKLKNNKSTLSIILSIFKRSILLILFGFFL---------NYLGNPDLNTVRILG 111
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQR+ + Y V +L + LK SS V + + ++ G Y+
Sbjct: 112 VLQRMGLVYFVTSLVYLLLKKLNVRSS------------VTIITFLCISAFIIVGYYIVA 159
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
Y F +E S + ++D H ++ +Y
Sbjct: 160 KPYGFELEGSLAQ----------------------LVD------LHFFKGHLY------- 184
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
PD F+P+G L+S++A + ++G G +++ + I
Sbjct: 185 --KPD---------------FEPDGFLTSIVAISSGMLGCMMGCVLLKENIGEYKKFFKI 227
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM 391
++ S ++ +G + NK L+S S+ L AG+ G+LL+ YF+
Sbjct: 228 LVMSIILLIGAFIFNQYFPFNKRLWSSSFVLLMAGSYGILLSIFYFI 274
>gi|300770061|ref|ZP_07079940.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33861]
gi|300762537|gb|EFK59354.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33861]
Length = 404
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 89/384 (23%), Positives = 154/384 (40%), Gaps = 98/384 (25%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
++R SLDVFRG TVALMI+V++ G + + H+ W+G T D V PFFLF VG ++
Sbjct: 2 KQRYYSLDVFRGATVALMIMVNNPGSWGHMFAPLKHAEWHGCTPTDLVFPFFLFAVGNAM 61
Query: 110 ALTYKNF--PCKVVATRKAILRALNLFLLGIFLQGGFFHG-------INNLKYGVD-IAQ 159
+ V +K + R + +FL+G+F+ F Y D +
Sbjct: 62 SFVIPRLQEAGPAVFWQKVLKRTVLIFLIGLFINWWPFVQWAQDTLVFKQWSYADDSMRG 121
Query: 160 IRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYG 219
+R +GVLQRIA+AY A++ + FR+ W+ +L
Sbjct: 122 VRILGVLQRIALAYCFASIIAYY-------------FREKAIIWISTFILV--------- 159
Query: 220 LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMI----DRKILGIQHLYRKP 275
++ C G+ G + G D +ILG+ H+Y+
Sbjct: 160 --------------------VYWAVCAFLGTPGDPYSLQGWFGTAYDIQILGVAHVYKGE 199
Query: 276 IYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVH--- 332
PFDPEGL+S++ A V ++G G I
Sbjct: 200 --------------------------GVPFDPEGLMSTLPAIVQVVLGYLAGTYIKKQGQ 233
Query: 333 ----------FKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASG 382
++ ++L+ + ++ ++ + + +G +NK +++ SY T G
Sbjct: 234 VDWLWKKVPASQEPHFKLLSGLFVTGFILVVLAWVWSLGFPINKKIWTSSYVLYTTGLGV 293
Query: 383 VLLAGIYFMVRYISSHLMLKKPFD 406
+ + G+ + + L + FD
Sbjct: 294 MTIGGMIWFIEVQGVKNSLTRFFD 317
>gi|406662851|ref|ZP_11070935.1| hypothetical protein B879_02963 [Cecembia lonarensis LW9]
gi|405553158|gb|EKB48438.1| hypothetical protein B879_02963 [Cecembia lonarensis LW9]
Length = 382
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 92/340 (27%), Positives = 154/340 (45%), Gaps = 88/340 (25%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
+ R ++LDV RGLT+ALM++V+ G + H+ W+G T+ D + P FLF+VG ++
Sbjct: 12 KERYLALDVLRGLTIALMVVVNTPGSWSHMYAPFMHADWHGFTITDLIFPTFLFVVGNAM 71
Query: 110 ALTYKNFPC--KVVATRKAILRALNLFLLGIFLQGGFFHGIN-NLKYG---VDIAQIRWM 163
+ + K + V +K R L +FL+G L F +N N + G ++ +++R +
Sbjct: 72 SFSMKKLESMGQQVFLKKVFKRTLLIFLIGWLLNA--FPFVNYNPESGYSMINWSEVRLL 129
Query: 164 GVLQRIAIAYLVAALCEIWLKGDGH-VSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYV 222
GVLQRIA+ Y++AAL +L G + S +L G+W+V + L+G
Sbjct: 130 GVLQRIALCYMLAALILYYLGKRGALIYSAATLL----GYWLV---------MYLFG--- 173
Query: 223 PDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQ 282
++ P+ TG NA +D ++G ++LY
Sbjct: 174 -----------NADDPY---------SLTG---NAALKLDLWLIGAKNLYMGE------- 203
Query: 283 CSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHR---DR 339
PFDPEG+LS+ + V + G G I ++ +
Sbjct: 204 -------------------GIPFDPEGILSTFPSVVNVIAGFLVGKFIQDSGNNTGTVKK 244
Query: 340 MLNW--IILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLT 377
M+ W I+L +CLI D + +NK +++ Y LT
Sbjct: 245 MVIWGIILLVACLI-----WD-MAFPINKKIWTSPYVLLT 278
>gi|148359197|ref|YP_001250404.1| hypothetical protein LPC_1091 [Legionella pneumophila str. Corby]
gi|296107241|ref|YP_003618941.1| hypothetical protein lpa_02399 [Legionella pneumophila 2300/99
Alcoy]
gi|148280970|gb|ABQ55058.1| conserved hypothetical protein [Legionella pneumophila str. Corby]
gi|295649142|gb|ADG24989.1| hypothetical protein lpa_02399 [Legionella pneumophila 2300/99
Alcoy]
Length = 372
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 92/356 (25%), Positives = 142/356 (39%), Gaps = 84/356 (23%)
Query: 43 NSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGI--LPAINHSPWNGLTLADFVMPF 100
N+ +PQ RL+SLDVFRG+T+ LMI+V+ I P H WNG TLAD V PF
Sbjct: 2 NTVPLKPQ----RLLSLDVFRGMTIVLMIIVNGQAAIDPYPIFEHVDWNGCTLADLVFPF 57
Query: 101 FLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
FLFIVG++ ++ KN RKA + + + +N + ++ I
Sbjct: 58 FLFIVGLTSVISLKN-----QMERKAKTSLYSAIIERSVVLFLLGLFLNVFPHPIEFDSI 112
Query: 161 RWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGL 220
R G+LQRIA+ YL++A + + S K F + LVL Y +++ +
Sbjct: 113 RIYGILQRIAVCYLISAFIYL------NTSIKTQFF--------IFLVLLLGYWIIMTQV 158
Query: 221 YVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRT 280
VP + + + W V D+ HLY K
Sbjct: 159 PVPGYGAN---QLTKDGSW------------------VSYFDQLFFSASHLYEK------ 191
Query: 281 KQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV----HFKDH 336
+DPEG LS+ + T L G+ G L++ FK
Sbjct: 192 -----------------------TYDPEGFLSTFTSIATTLSGVLAGSLLINPCNQFKKF 228
Query: 337 RDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
++ +S +NK L++ SY T+G + + A Y ++
Sbjct: 229 YLLAGVGLLFLLLGWLWNMSFP-----INKNLWTSSYVLWTSGLALLAFAFCYLLI 279
>gi|282877735|ref|ZP_06286550.1| putative membrane protein [Prevotella buccalis ATCC 35310]
gi|281300307|gb|EFA92661.1| putative membrane protein [Prevotella buccalis ATCC 35310]
Length = 403
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 99/377 (26%), Positives = 156/377 (41%), Gaps = 110/377 (29%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
Q +R++++D+ RG+T+A MILV++ G I + H+ WNG+T D V PFF+F++G+
Sbjct: 8 QSKRILAIDILRGITIAGMILVNNPGSWAHIFAPLEHAEWNGMTPTDLVFPFFMFVMGMC 67
Query: 109 LALTYK--NFPCKVVATRKAILRALNLFLLGIFLQGGFFHG--------INNLKYGVDIA 158
+ ++ + F C K I R L L+L+GIF+ G+F + + G I
Sbjct: 68 IFISMQKYQFACNRQTVYKIIRRTLLLYLVGIFV--GWFSRFCYRWAFPLEDATLGQQIW 125
Query: 159 Q-------IRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTT 211
IR GVL R+AI Y + AL I R +++++V+T
Sbjct: 126 HTVWSFDTIRLSGVLARLAICYGITALLAI----------------TVRHRYLLSIVITL 169
Query: 212 L--YLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQ 269
L Y +LL+ CG G N + ++DR +L
Sbjct: 170 LIGYTILLF--------------------------CG-NGFAYDETNILSIVDRAVLTDA 202
Query: 270 HLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHL 329
H+Y D G DPEGLLS+ A LIG G L
Sbjct: 203 HMYH---------------DNG-------------IDPEGLLSTFPAIAHTLIGFLIGKL 234
Query: 330 ------IVHFKDHRD--------RMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTC 375
V D ++ I + L GL L + G LNK ++S ++
Sbjct: 235 AFSKQKTVTATTQNDPKTGLILHNIVPLFIAGTILTFSGLLLAY-GCPLNKKIWSPTFVL 293
Query: 376 LTAGASGVLLAGIYFMV 392
+ G + LLA + +++
Sbjct: 294 TSCGLASTLLALLIWLI 310
>gi|227538516|ref|ZP_03968565.1| transmembrane protein [Sphingobacterium spiritivorum ATCC 33300]
gi|227241435|gb|EEI91450.1| transmembrane protein [Sphingobacterium spiritivorum ATCC 33300]
Length = 404
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 89/384 (23%), Positives = 154/384 (40%), Gaps = 98/384 (25%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
++R SLDVFRG TVALMI+V++ G + + H+ W+G T D V PFFLF VG ++
Sbjct: 2 KQRYYSLDVFRGATVALMIMVNNPGSWGHMFAPLKHAEWHGCTPTDLVFPFFLFAVGNAM 61
Query: 110 ALTYKNF--PCKVVATRKAILRALNLFLLGIFLQGGFFHG-------INNLKYGVD-IAQ 159
+ V +K + R + +FL+G+F+ F Y D +
Sbjct: 62 SFVIPRLQEAGPAVFWQKVLKRTVLIFLIGLFINWWPFVQWAQDTLVFKQWSYADDPMRG 121
Query: 160 IRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYG 219
+R +GVLQRIA+AY A++ + FR+ W+ +L
Sbjct: 122 VRILGVLQRIALAYCFASIIAYY-------------FREKAIIWISTFILV--------- 159
Query: 220 LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMI----DRKILGIQHLYRKP 275
++ C G+ G + G D +ILG+ H+Y+
Sbjct: 160 --------------------VYWAVCAFLGTPGDPYSLQGWFGTAYDIQILGVAHVYKGE 199
Query: 276 IYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVH--- 332
PFDPEGL+S++ A V ++G G I
Sbjct: 200 --------------------------GVPFDPEGLMSTLPAIVQVVLGYLAGTYIKKQGE 233
Query: 333 ----------FKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASG 382
++ ++L+ + ++ ++ + + +G +NK +++ SY T G
Sbjct: 234 VDWLWKKVPASQEPHFKLLSGLFVTGFILVVLAWVWSLGFPINKKIWTSSYVLYTTGLGI 293
Query: 383 VLLAGIYFMVRYISSHLMLKKPFD 406
+ + G+ + + L + FD
Sbjct: 294 MTIGGMIWFIEVQGVKNSLTRFFD 317
>gi|119583586|gb|EAW63182.1| hCG1993224, isoform CRA_a [Homo sapiens]
Length = 382
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 97/342 (28%), Positives = 149/342 (43%), Gaps = 64/342 (18%)
Query: 70 MILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRK 125
M+ V+ GG H+ WNGLT+AD V P+F+FI+G S+ L+ + K K
Sbjct: 1 MVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKFRLLGK 60
Query: 126 AILRALNLFLLGIFLQGGFFHGINNLKYG---VDIAQIRWMGVLQRIAIAYLVAALCEIW 182
R+ L +GI I N Y + ++R GVLQR+ + Y V A+ E+
Sbjct: 61 IAWRSFLLICIGII--------IVNPNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELL 112
Query: 183 LKG--DGHVSSKLSLFR-----KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSS 235
H +S+ S W++ LVL L+L L + L VP T
Sbjct: 113 FAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVPG------CPTGY 166
Query: 236 SSPWIFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYG 291
P G G G P C A G IDR +LG HLY+ P +
Sbjct: 167 LGP-------GGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSAVLYH--------- 210
Query: 292 PMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLI 351
+ +DPEG+L ++ + V +G+ G +++++K +L C++
Sbjct: 211 ---------TEVAYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILIRFTAWCCIL 261
Query: 352 GL-GLSLDFVGMH-----LNKALYSLSY-TCLTAGASGVLLA 386
GL ++L V + +NK L+SLSY T L++ A +LL
Sbjct: 262 GLISVALTKVSENEGFIPVNKNLWSLSYVTTLSSFAFFILLV 303
>gi|445498183|ref|ZP_21465038.1| putative membrane protein DUF1624 [Janthinobacterium sp. HH01]
gi|444788178|gb|ELX09726.1| putative membrane protein DUF1624 [Janthinobacterium sp. HH01]
Length = 370
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 92/339 (27%), Positives = 137/339 (40%), Gaps = 83/339 (24%)
Query: 48 RPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSP-----WNGLTLADFVMPFFL 102
+ QHQ+ + ++DV RGLTVALMI+V+ G PA ++P W+GLTL D V P F+
Sbjct: 2 KTQHQRSQ--AIDVLRGLTVALMIMVNMPG--TPATTYAPFLHAEWHGLTLTDLVFPTFM 57
Query: 103 FIVGVSLALTYKNFP--CKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGV-DIAQ 159
F+VG +L+ T + + + +K R +FL G + F + V ++
Sbjct: 58 FVVGTALSFTLEKYEGMGEAAVLKKIFTRTALIFLCGFLMYWYPFFSTDGGSLTVLPLSG 117
Query: 160 IRWMGVLQRIAIAYLVAALC-EIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLY 218
R GVLQRIA+ Y +L W + V + L+L G+W V +Y
Sbjct: 118 TRIFGVLQRIALGYCAGSLILHYWREKGALVFAVLALL----GYWTV-----------MY 162
Query: 219 GLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYS 278
G G A NA +D +LG H+Y
Sbjct: 163 GF----------------------------GDYTLAGNAQRKLDLLVLGEAHMYHGE--- 191
Query: 279 RTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRD 338
FDPEG+LS++ + V L G G L+ +
Sbjct: 192 -----------------------GIAFDPEGILSTLPSIVNVLAGYFAGRLVRRLGASYE 228
Query: 339 RMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLT 377
+ + + L L L V LNK L++ SYT +T
Sbjct: 229 TVAKLAMSGAVLTVLALCWSSV-FPLNKKLWTSSYTLIT 266
>gi|374373358|ref|ZP_09631018.1| membrane protein [Niabella soli DSM 19437]
gi|373234331|gb|EHP54124.1| membrane protein [Niabella soli DSM 19437]
Length = 375
Score = 84.3 bits (207), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 97/347 (27%), Positives = 144/347 (41%), Gaps = 77/347 (22%)
Query: 55 RLISLDVFRGLTVALMILVDDVG-GILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTY 113
R ++LD+FRG+T+ MI+V+ G P + H+ WNG TL D V P FLF VG ++A +
Sbjct: 11 RFLALDIFRGMTICFMIIVNTGGPNPFPELRHAQWNGFTLTDLVFPSFLFAVGNAIAFSK 70
Query: 114 KNF---PCKVVATRKAILRALNLFLLGIFLQGGFFHGI---NNLKYGVDIAQIRWMGVLQ 167
+ K V T K I R LFL+G + F I NN++ I + R GVLQ
Sbjct: 71 SKWDQQSNKEVLT-KIIKRTCLLFLIGYLMYWLPFVKIDAQNNIR-PFPIGETRIFGVLQ 128
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQY 227
RIA+ Y G G + + + R +++T++ LLL Y
Sbjct: 129 RIALCY-----------GIGALIIRFASAR--------TIIITSICLLLGY--------- 160
Query: 228 EFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINS 287
W + G G A +D + HLY K
Sbjct: 161 -----------WFIMMAFGDYTVNGAAETK---LDILLFTRDHLYIKD------------ 194
Query: 288 PDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILS 347
P A FDPEG LS+ A V LIG G I + + +
Sbjct: 195 ------PARA-------FDPEGFLSTFPAIVNVLIGYLAGIHIRKNPKSYEMLARLAVAG 241
Query: 348 SCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRY 394
L+ LG + + +NK L++ S+ CLT G +++A I + + +
Sbjct: 242 FLLVALGYLWN-LAFPVNKKLWTSSFVCLTTGLDCLIIATILYFIDF 287
>gi|397667386|ref|YP_006508923.1| putative Heparan-alpha-glucosaminide N-acetyltransferase
[Legionella pneumophila subsp. pneumophila]
gi|395130797|emb|CCD09044.1| putative Heparan-alpha-glucosaminide N-acetyltransferase
[Legionella pneumophila subsp. pneumophila]
Length = 372
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 92/356 (25%), Positives = 141/356 (39%), Gaps = 84/356 (23%)
Query: 43 NSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGI--LPAINHSPWNGLTLADFVMPF 100
N+ +PQ RL+SLDVFRG+T+ LMI V+ I P H WNG TLAD V PF
Sbjct: 2 NTVPLKPQ----RLLSLDVFRGMTIVLMIFVNGQAAIDPYPIFEHVDWNGCTLADLVFPF 57
Query: 101 FLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
FLFIVG++ ++ KN RKA + + + +N + ++ I
Sbjct: 58 FLFIVGLTSVISLKN-----QMERKAKTSLYSAIIERSVVLFLLGLFLNVFPHPIEFDSI 112
Query: 161 RWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGL 220
R G+LQRIA+ YL++A + + S K F + LVL Y +++ +
Sbjct: 113 RIYGILQRIAVCYLISAFIYL------NTSIKTQFF--------IFLVLLLGYWIIMTQV 158
Query: 221 YVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRT 280
VP + + + W V D+ HLY K
Sbjct: 159 PVPGYGAN---QLTKDGSW------------------VSYFDQLFFSASHLYEK------ 191
Query: 281 KQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV----HFKDH 336
+DPEG LS+ + T L G+ G L++ FK
Sbjct: 192 -----------------------TYDPEGFLSTFTSIATTLSGVLAGSLLINPCNQFKKF 228
Query: 337 RDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
++ +S +NK L++ SY T+G + + A Y ++
Sbjct: 229 YLLAGVGLLFLLLGWLWNMSFP-----INKNLWTSSYVLWTSGLALLAFAFCYLLI 279
>gi|397664114|ref|YP_006505652.1| Putative heparan-alpha-glucosaminide N-acetyltransferase
[Legionella pneumophila subsp. pneumophila]
gi|395127525|emb|CCD05722.1| Putative heparan-alpha-glucosaminide N-acetyltransferase
[Legionella pneumophila subsp. pneumophila]
Length = 372
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 92/356 (25%), Positives = 141/356 (39%), Gaps = 84/356 (23%)
Query: 43 NSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGI--LPAINHSPWNGLTLADFVMPF 100
N+ +PQ RL+SLDVFRG+T+ LMI V+ I P H WNG TLAD V PF
Sbjct: 2 NTVPLKPQ----RLLSLDVFRGMTIVLMIFVNGQAAIDPYPIFEHVDWNGCTLADLVFPF 57
Query: 101 FLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
FLFIVG++ ++ KN RKA + + + +N + ++ I
Sbjct: 58 FLFIVGLTSVISLKN-----QMERKAKTSLYSAIIERSVVLFLLGLFLNVFPHPIEFDSI 112
Query: 161 RWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGL 220
R G+LQRIA+ YL++A + + S K F + LVL Y +++ +
Sbjct: 113 RIYGILQRIAVCYLISAFIYL------NTSIKTQFF--------IFLVLLLGYWIIMTQV 158
Query: 221 YVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRT 280
VP + + + W V D+ HLY K
Sbjct: 159 PVPGYGAN---QLTKDGSW------------------VSYFDQLFFSASHLYEK------ 191
Query: 281 KQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV----HFKDH 336
+DPEG LS+ + T L G+ G L++ FK
Sbjct: 192 -----------------------TYDPEGFLSTFTSIATTLSGVLAGSLLINPCNQFKKF 228
Query: 337 RDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
++ +S +NK L++ SY T+G + + A Y ++
Sbjct: 229 YLLAGGGLLFLLLGWLWNMSFP-----INKNLWTSSYVLWTSGLALLAFAFCYLLI 279
>gi|255307823|ref|ZP_05351994.1| hypothetical protein CdifA_14636 [Clostridium difficile ATCC 43255]
Length = 483
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/347 (23%), Positives = 152/347 (43%), Gaps = 80/347 (23%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
R+ S+D+ RGL++ALMI+ ++ G + P + H+ W+G+TLADF PFF+ +GV+
Sbjct: 114 MNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVT 173
Query: 109 LALTYKNFPCKVVATRKAIL----RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMG 164
+ ++ + +T IL R++ L L G FL N D+ +R +G
Sbjct: 174 IPISINSKIKNNKSTLSIILSIFKRSILLILFGFFL---------NYLGNPDLDTVRILG 224
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQR+ + Y V +L + LK KL++ G + + ++ G Y+
Sbjct: 225 VLQRMGLVYFVTSLVYLLLK-------KLNV-----GSTATIITFLCISTFIIVGYYILA 272
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
Y F +E S + ++D H ++ +Y
Sbjct: 273 KPYGFELEGSLAQ----------------------LVD------LHFFKGHLY------- 297
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
+ F+P+G L+S++A + ++G G +++ + I
Sbjct: 298 -----------------KPEFEPDGFLTSIVAISSGMLGCTMGCVLLKENLGEYKKFFKI 340
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM 391
++ S ++ +G + NK L+S S+ L AG+ G+LL+ YF+
Sbjct: 341 LVMSIILLIGAFIFNQYFPFNKRLWSSSFVLLMAGSYGILLSIFYFI 387
>gi|255101955|ref|ZP_05330932.1| hypothetical protein CdifQCD-6_14161 [Clostridium difficile
QCD-63q42]
Length = 469
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/347 (23%), Positives = 152/347 (43%), Gaps = 80/347 (23%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
R+ S+D+ RGL++ALMI+ ++ G + P + H+ W+G+TLADF PFF+ +GV+
Sbjct: 100 MNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVT 159
Query: 109 LALTYKNFPCKVVATRKAIL----RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMG 164
+ ++ + +T IL R++ L L G FL N D+ +R +G
Sbjct: 160 IPISINSKIKNNKSTLSIILSIFKRSILLILFGFFL---------NYLGNPDLDTVRILG 210
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQR+ + Y V +L + LK KL++ G + + ++ G Y+
Sbjct: 211 VLQRMGLVYFVTSLVYLLLK-------KLNV-----GSTATIITFLCISTFIIVGYYILA 258
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
Y F +E S + ++D H ++ +Y
Sbjct: 259 KPYGFELEGSLAQ----------------------LVD------LHFFKGHLY------- 283
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
+ F+P+G L+S++A + ++G G +++ + I
Sbjct: 284 -----------------KPEFEPDGFLTSIVAISSGMLGCTMGCVLLKENLGEYKKFFKI 326
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM 391
++ S ++ +G + NK L+S S+ L AG+ G+LL+ YF+
Sbjct: 327 LVMSIILLIGAFIFNQYFPFNKRLWSSSFVLLMAGSYGILLSIFYFI 373
>gi|340618131|ref|YP_004736584.1| hypothetical protein zobellia_2146 [Zobellia galactanivorans]
gi|339732928|emb|CAZ96303.1| Conserved hypothetical membrane protein [Zobellia galactanivorans]
Length = 367
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 92/364 (25%), Positives = 148/364 (40%), Gaps = 82/364 (22%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
+R +LDVFRGLT+ LMI+V+ G + H+ W+G T D V P FLF VG +
Sbjct: 1 MKRFKALDVFRGLTICLMIIVNTPGDWDMTFSPLLHAKWHGFTPTDLVFPSFLFAVGNAF 60
Query: 110 ALTYKNFPCKVVAT--RKAILRALNLFLLGIFLQGGFFHGINNLK--YGVDIAQIRWMGV 165
A + K ++ +K R L +FLLG + F V ++ R +GV
Sbjct: 61 AFVKTKWADKPLSDIFKKIAKRTLIIFLLGYTMYWIPFFSWTETGDLAAVPFSETRILGV 120
Query: 166 LQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDW 225
LQRIA+ Y + A+ +L + + + Y G LL +G Y +
Sbjct: 121 LQRIALCYFIGAIMIYFLTNRQLIIASAVILLGYWG------------LLSAFGDYTLE- 167
Query: 226 QYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSI 285
N V IDR +LG HLY
Sbjct: 168 -----------------------------GNFVRTIDRMLLGDSHLYMGN---------- 188
Query: 286 NSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWII 345
PFDPEGLLS++ + L G G I+ +++ ++
Sbjct: 189 ----------------GIPFDPEGLLSTLPSICNVLGGYLVGKYIIDKGIDYEKLAKMLL 232
Query: 346 LSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKKPF 405
+ + L+ + D + +NK L++ S+ LT G V+L+ + + + + LK+P
Sbjct: 233 VGAGLLVVAYLWD-LTFPVNKKLWTSSFVVLTVGLDIVVLSVLIYTIEF------LKRPI 285
Query: 406 DYSY 409
+Y++
Sbjct: 286 NYNF 289
>gi|423081105|ref|ZP_17069717.1| hypothetical protein HMPREF1122_00699 [Clostridium difficile
002-P50-2011]
gi|423085023|ref|ZP_17073481.1| hypothetical protein HMPREF1123_00624 [Clostridium difficile
050-P50-2011]
gi|357550878|gb|EHJ32683.1| hypothetical protein HMPREF1123_00624 [Clostridium difficile
050-P50-2011]
gi|357551414|gb|EHJ33204.1| hypothetical protein HMPREF1122_00699 [Clostridium difficile
002-P50-2011]
Length = 427
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/347 (23%), Positives = 152/347 (43%), Gaps = 80/347 (23%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
R+ S+D+ RGL++ALMI+ ++ G + P + H+ W+G+TLADF PFF+ +GV+
Sbjct: 58 MNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVT 117
Query: 109 LALTYKNFPCKVVATRKAIL----RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMG 164
+ ++ + +T IL R++ L L G FL N D+ +R +G
Sbjct: 118 IPISINSKIKNNKSTLSIILSIFKRSILLILFGFFL---------NYLGNPDLDTVRILG 168
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQR+ + Y V +L + LK KL++ G + + ++ G Y+
Sbjct: 169 VLQRMGLVYFVTSLVYLLLK-------KLNV-----GSTATIITFLCISTFIIVGYYILA 216
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
Y F +E S + ++D H ++ +Y
Sbjct: 217 KPYGFELEGSLAQ----------------------LVD------LHFFKGHLY------- 241
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
+ F+P+G L+S++A + ++G G +++ + I
Sbjct: 242 -----------------KPEFEPDGFLTSIVAISSGMLGCTMGCVLLKEDIGEYKKFFKI 284
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM 391
++ S ++ +G + NK L+S S+ L AG+ G+LL+ YF+
Sbjct: 285 LVMSIILLIGAFIFNQYFPFNKRLWSSSFVLLMAGSYGILLSIFYFI 331
>gi|289667572|ref|ZP_06488647.1| hypothetical protein XcampmN_03447, partial [Xanthomonas campestris
pv. musacearum NCPPB 4381]
Length = 298
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 90/287 (31%), Positives = 121/287 (42%), Gaps = 76/287 (26%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILPA---INHSPWNGLTLADFVMPFFLFIVG-- 106
++ R +SLDVFRGLT+ LMILV+ G A + H+ W G TLAD V P FLF VG
Sbjct: 22 KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 81
Query: 107 VSLALTYKNFPCKVVA--TRKAILRALNLFLLGIFLQGGFFHGINNLKYG-VDIAQIRWM 163
+S AL P + + +++A+L L L+ F FFH + + + Q+R
Sbjct: 82 MSFALATNTPPLQFLGRVSKRAVLIVLCGVLMYWF---PFFHLQPDGGWAFTTVDQLRLT 138
Query: 164 GVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVP 223
GVLQRI + YL AAL +L G + ++L G+W LLY P
Sbjct: 139 GVLQRIGLCYLAAALLVRYLPPRGIAPACVALL---LGYWA-----------LLYVFGQP 184
Query: 224 DWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQC 283
VE S + NA +D + G HLYRK
Sbjct: 185 G------VELSKTG------------------NAGTRLDLWLYGRAHLYRK--------- 211
Query: 284 SINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI 330
FDPEGLL ++ ATV L G G +
Sbjct: 212 ------------------DGGFDPEGLLGTLSATVNVLAGYLCGRFL 240
>gi|52841889|ref|YP_095688.1| hypothetical protein lpg1661 [Legionella pneumophila subsp.
pneumophila str. Philadelphia 1]
gi|378777523|ref|YP_005185961.1| hypothetical protein lp12_1599 [Legionella pneumophila subsp.
pneumophila ATCC 43290]
gi|52629000|gb|AAU27741.1| hypothetical protein lpg1661 [Legionella pneumophila subsp.
pneumophila str. Philadelphia 1]
gi|364508338|gb|AEW51862.1| hypothetical protein lp12_1599 [Legionella pneumophila subsp.
pneumophila ATCC 43290]
Length = 372
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 92/356 (25%), Positives = 141/356 (39%), Gaps = 84/356 (23%)
Query: 43 NSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGI--LPAINHSPWNGLTLADFVMPF 100
N+ +PQ RL+SLDVFRG+T+ LMI V+ I P H WNG TLAD V PF
Sbjct: 2 NTVPLKPQ----RLLSLDVFRGMTIVLMIFVNGQAAIDPYPIFEHVDWNGCTLADLVFPF 57
Query: 101 FLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
FLFIVG++ ++ KN RKA + + + +N + ++ I
Sbjct: 58 FLFIVGLTSVISLKN-----QMERKAKTSLYSAIIERSVVLFLLGLFLNVFPHPIEFDSI 112
Query: 161 RWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGL 220
R G+LQRIA+ YL++A + + S K F + LVL Y +++ +
Sbjct: 113 RIYGILQRIAVCYLISAFIYL------NTSIKTQFF--------IFLVLLLGYWIIMTQV 158
Query: 221 YVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRT 280
VP + + + W V D+ HLY K
Sbjct: 159 PVPGYGAN---QLTKDGSW------------------VSYFDQLFFSASHLYEK------ 191
Query: 281 KQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV----HFKDH 336
+DPEG LS+ + T L G+ G L++ FK
Sbjct: 192 -----------------------TYDPEGFLSTFTSIATTLSGVLAGSLLISPCNQFKKF 228
Query: 337 RDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
++ +S +NK L++ SY T+G + + A Y ++
Sbjct: 229 YLLAGIGLLFLLLGWLWNMSFP-----INKNLWTSSYVLWTSGLALLAFAFCYLLI 279
>gi|427781073|gb|JAA55988.1| Putative heparan-alpha-glucosaminide n-acetyltransferase
[Rhipicephalus pulchellus]
Length = 337
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 84/301 (27%), Positives = 132/301 (43%), Gaps = 49/301 (16%)
Query: 105 VGVSLALTYKNFPCKVVATRKAIL----RALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
+GVSLA+T ++ K V + L R+L LF LGI N L VD+ +
Sbjct: 1 MGVSLAMTIRSLLRKSVTRGRIFLQIVKRSLILFGLGIM--------TNTLSGDVDLNTL 52
Query: 161 RWMGVLQRIAIAYLVAALCEIWLKGDGH----VSSKLSLFRKYRGHWVVALVLTTLYLLL 216
R GVLQR+A +YLVAA + L H V + + Y W++A+ + L+L L
Sbjct: 53 RIPGVLQRLAFSYLVAATVHL-LFAKPHEGQLVWAPVRDVLAYWPEWLLAIPMLALHLAL 111
Query: 217 LYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPI 276
+ L VP+ P ++ TG A G IDR+I G H+Y+ P
Sbjct: 112 TFFLPVPN------CPQGYLGPGGLHLNSSFENCTG---GAAGFIDRRIFGNSHIYQTPD 162
Query: 277 YSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDH 336
+ P+DPEG L + + +GL G +++ F +
Sbjct: 163 MRHVYDTHL------------------PYDPEGTLGCLTSIFLVFLGLQAGKILLTFPEW 204
Query: 337 RDRMLNWIILS-SCLIGLGLSLDFVGMH----LNKALYSLSYTCLTAGASGVLLAGIYFM 391
+ R++ W I C I G+ +F +NK L+S+S+ TA + LL +Y++
Sbjct: 205 KARVIRWCIWGLLCGIIAGVLCNFSKEEGWIPINKNLWSVSFILSTASTAFFLLVVLYYL 264
Query: 392 V 392
+
Sbjct: 265 I 265
>gi|392390355|ref|YP_006426958.1| hypothetical protein Ornrh_0972 [Ornithobacterium rhinotracheale
DSM 15997]
gi|390521433|gb|AFL97164.1| hypothetical protein Ornrh_0972 [Ornithobacterium rhinotracheale
DSM 15997]
Length = 390
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 93/377 (24%), Positives = 150/377 (39%), Gaps = 96/377 (25%)
Query: 55 RLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
R SLDVFRG TVALMILV++ G + + H+ W G T D V PFFLF VG ++A
Sbjct: 4 RYYSLDVFRGATVALMILVNNPGSWSAMFKPLTHAEWAGCTPTDLVFPFFLFAVGNAMAF 63
Query: 112 TYKNF--PCKVVATRKAILRALNLFLLGIFLQGGFF----HGINNLKYGVDIAQIRWMGV 165
V RK + R +F++G+ L F GI K+ +R +GV
Sbjct: 64 VIPRMQKAGSQVFWRKVLKRTFLIFIIGLLLNWFPFVQWKDGILTFKHW---ENVRILGV 120
Query: 166 LQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDW 225
LQRIA AY AA+ + K +++ + LL++Y
Sbjct: 121 LQRIAFAYFFAAIIAYYFKEK-------------------KVLIISFLLLIVY------- 154
Query: 226 QYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSI 285
+ + + P+ G R +D ILG H+Y
Sbjct: 155 -WLLALLLGGADPYSMQGFWGTR------------VDLAILGESHMYHGE---------- 191
Query: 286 NSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI--------VHFKDHR 337
PFDPEG + ++ +T L+G G +I + + +
Sbjct: 192 ----------------GVPFDPEGFVGAISSTAQVLLGYLAGKIIMAQGEVNWLFVRAPK 235
Query: 338 DRMLNWIILSSCLIGLGL--------SLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIY 389
L++ +LS + G+ LDF + K ++S +Y T G + + ++ +
Sbjct: 236 TSELHYKVLSMLFVSAGILLVVAYVWQLDF---PIIKKIWSSTYVLYTTGLAIITISIMI 292
Query: 390 FMVRYISSHLMLKKPFD 406
+ + + + L + FD
Sbjct: 293 WFIEVLKAKNFLTQFFD 309
>gi|255093765|ref|ZP_05323243.1| hypothetical protein CdifC_14056 [Clostridium difficile CIP 107932]
Length = 505
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/347 (23%), Positives = 150/347 (43%), Gaps = 80/347 (23%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
R+ S+D+ RGL++ALMI+ ++ G + P + H+ W+G+TLADF PFF+ +GV+
Sbjct: 136 TNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVT 195
Query: 109 LALT----YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMG 164
+ ++ KN + R++ L L G FL N D+ +R +G
Sbjct: 196 IPISINSKLKNNKSTLSIILSIFKRSILLILFGFFL---------NYLGNPDLNSVRILG 246
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQR+ + Y V +L + LK KL++ G + + ++ G Y+
Sbjct: 247 VLQRMGLVYFVTSLVYLLLK-------KLNV-----GSTATIITFLCIATFIIVGYYILA 294
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
Y F +E S + ++D H ++ +Y
Sbjct: 295 KPYGFELEGSLAQ----------------------LVD------LHFFKGHLY------- 319
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
+ F+P+G L+S++A + ++G G +++ + I
Sbjct: 320 -----------------KPEFEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKI 362
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM 391
++ S ++ +G + NK L+S S+ L AG+ G+LL+ YF+
Sbjct: 363 LVMSIILLIGAFIFNQYFPFNKRLWSSSFVLLMAGSYGMLLSIFYFI 409
>gi|384362003|ref|YP_006199855.1| hypothetical protein CDBI1_13575 [Clostridium difficile BI1]
Length = 485
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/347 (23%), Positives = 150/347 (43%), Gaps = 80/347 (23%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
R+ S+D+ RGL++ALMI+ ++ G + P + H+ W+G+TLADF PFF+ +GV+
Sbjct: 116 TNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVT 175
Query: 109 LALT----YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMG 164
+ ++ KN + R++ L L G FL N D+ +R +G
Sbjct: 176 IPISINSKLKNNKSTLSIILSIFKRSILLILFGFFL---------NYLGNPDLNSVRILG 226
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQR+ + Y V +L + LK KL++ G + + ++ G Y+
Sbjct: 227 VLQRMGLVYFVTSLVYLLLK-------KLNV-----GSTATIITFLCIATFIIVGYYILA 274
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
Y F +E S + ++D H ++ +Y
Sbjct: 275 KPYGFELEGSLAQ----------------------LVD------LHFFKGHLY------- 299
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
+ F+P+G L+S++A + ++G G +++ + I
Sbjct: 300 -----------------KPEFEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKI 342
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM 391
++ S ++ +G + NK L+S S+ L AG+ G+LL+ YF+
Sbjct: 343 LVMSIILLIGAFIFNQYFPFNKRLWSSSFVLLMAGSYGMLLSIFYFI 389
>gi|255651295|ref|ZP_05398197.1| hypothetical protein CdifQCD_14003 [Clostridium difficile
QCD-37x79]
Length = 461
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/347 (23%), Positives = 150/347 (43%), Gaps = 80/347 (23%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
R+ S+D+ RGL++ALMI+ ++ G + P + H+ W+G+TLADF PFF+ +GV+
Sbjct: 92 TNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVT 151
Query: 109 LALT----YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMG 164
+ ++ KN + R++ L L G FL N D+ +R +G
Sbjct: 152 IPISINSKLKNNKSTLSIILSIFKRSILLILFGFFL---------NYLGNPDLNSVRILG 202
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQR+ + Y V +L + LK KL++ G + + ++ G Y+
Sbjct: 203 VLQRMGLVYFVTSLVYLLLK-------KLNV-----GSTATIITFLCIATFIIVGYYILA 250
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
Y F +E S + ++D H ++ +Y
Sbjct: 251 KPYGFELEGSLAQ----------------------LVD------LHFFKGHLY------- 275
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
+ F+P+G L+S++A + ++G G +++ + I
Sbjct: 276 -----------------KPEFEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKI 318
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM 391
++ S ++ +G + NK L+S S+ L AG+ G+LL+ YF+
Sbjct: 319 LVMSIILLIGAFIFNQYFPFNKRLWSSSFVLLMAGSYGMLLSIFYFI 365
>gi|255518179|ref|ZP_05385855.1| hypothetical protein CdifQCD-_13768 [Clostridium difficile
QCD-97b34]
Length = 469
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/347 (23%), Positives = 150/347 (43%), Gaps = 80/347 (23%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
R+ S+D+ RGL++ALMI+ ++ G + P + H+ W+G+TLADF PFF+ +GV+
Sbjct: 100 TNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVT 159
Query: 109 LALT----YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMG 164
+ ++ KN + R++ L L G FL N D+ +R +G
Sbjct: 160 IPISINSKLKNNKSTLSIILSIFKRSILLILFGFFL---------NYLGNPDLNSVRILG 210
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQR+ + Y V +L + LK KL++ G + + ++ G Y+
Sbjct: 211 VLQRMGLVYFVTSLVYLLLK-------KLNV-----GSTATIITFLCIATFIIVGYYILA 258
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
Y F +E S + ++D H ++ +Y
Sbjct: 259 KPYGFELEGSLAQ----------------------LVD------LHFFKGHLY------- 283
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
+ F+P+G L+S++A + ++G G +++ + I
Sbjct: 284 -----------------KPEFEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKI 326
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM 391
++ S ++ +G + NK L+S S+ L AG+ G+LL+ YF+
Sbjct: 327 LVMSIILLIGAFIFNQYFPFNKRLWSSSFVLLMAGSYGMLLSIFYFI 373
>gi|254976379|ref|ZP_05272851.1| hypothetical protein CdifQC_13741 [Clostridium difficile QCD-66c26]
Length = 459
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/347 (23%), Positives = 150/347 (43%), Gaps = 80/347 (23%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
R+ S+D+ RGL++ALMI+ ++ G + P + H+ W+G+TLADF PFF+ +GV+
Sbjct: 90 TNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVT 149
Query: 109 LALT----YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMG 164
+ ++ KN + R++ L L G FL N D+ +R +G
Sbjct: 150 IPISINSKLKNNKSTLSIILSIFKRSILLILFGFFL---------NYLGNPDLNSVRILG 200
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQR+ + Y V +L + LK KL++ G + + ++ G Y+
Sbjct: 201 VLQRMGLVYFVTSLVYLLLK-------KLNV-----GSTATIITFLCIATFIIVGYYILA 248
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
Y F +E S + ++D H ++ +Y
Sbjct: 249 KPYGFELEGSLAQ----------------------LVD------LHFFKGHLY------- 273
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
+ F+P+G L+S++A + ++G G +++ + I
Sbjct: 274 -----------------KPEFEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKI 316
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM 391
++ S ++ +G + NK L+S S+ L AG+ G+LL+ YF+
Sbjct: 317 LVMSIILLIGAFIFNQYFPFNKRLWSSSFVLLMAGSYGMLLSIFYFI 363
>gi|289663929|ref|ZP_06485510.1| hypothetical protein XcampvN_12875 [Xanthomonas campestris pv.
vasculorum NCPPB 702]
Length = 392
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 90/287 (31%), Positives = 121/287 (42%), Gaps = 76/287 (26%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILPA---INHSPWNGLTLADFVMPFFLFIVG-- 106
++ R +SLDVFRGLT+ LMILV+ G A + H+ W G TLAD V P FLF VG
Sbjct: 22 KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 81
Query: 107 VSLALTYKNFPCKVVA--TRKAILRALNLFLLGIFLQGGFFHGINNLKYG-VDIAQIRWM 163
+S AL P + + +++A+L L L+ F FFH + + + Q+R
Sbjct: 82 MSFALATNTPPLQFLGRVSKRAVLIVLCGVLMYWF---PFFHLQPDGGWAFTTVDQLRLT 138
Query: 164 GVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVP 223
GVLQRI + YL AAL +L G + ++L G+W LLY P
Sbjct: 139 GVLQRIGLCYLAAALLVRYLPPRGIAPACVALL---LGYWA-----------LLYVFGQP 184
Query: 224 DWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQC 283
VE S + NA +D + G HLYRK
Sbjct: 185 G------VELSKTG------------------NAGTRLDLWLYGRAHLYRK--------- 211
Query: 284 SINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI 330
FDPEGLL ++ ATV L G G +
Sbjct: 212 ------------------DGGFDPEGLLGTLSATVNVLAGYLCGRFL 240
>gi|319952891|ref|YP_004164158.1| heparan-alpha-glucosaminide n-acetyltransferase [Cellulophaga
algicola DSM 14237]
gi|319421551|gb|ADV48660.1| Heparan-alpha-glucosaminide N-acetyltransferase [Cellulophaga
algicola DSM 14237]
Length = 363
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 91/346 (26%), Positives = 158/346 (45%), Gaps = 83/346 (23%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+ R++++D+FRG+T+ LMILV++ G + H+ W+G T D V PFFLFIVG S
Sbjct: 1 MKERIVAVDIFRGMTIVLMILVNNPGTWAAVYAPFLHADWHGYTPTDLVFPFFLFIVGTS 60
Query: 109 LALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQR 168
+ Y +K + R+L L LG+ L F + + D ++IR+ GVLQR
Sbjct: 61 IVFAYSTKKPTADTYKKIVSRSLKLIGLGLLLGA--FTLV--FPFVKDFSEIRFPGVLQR 116
Query: 169 IAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGL---YVPDW 225
I + + + ++ LF + L+ T+++L+ Y L ++P
Sbjct: 117 IGVVFFITSI----------------LFLNFNWK---QLIGVTVFILIGYWLAMGFIP-- 155
Query: 226 QYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVG-MIDRKILGIQHLYRKPIYSRTKQCS 284
G+ + A N + +D ILG H+++
Sbjct: 156 ------------------VNGIASTFDRAPNNLANYVDLNILG-SHMWKD---------- 186
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
DY DPEGLLS++ A +CL+G+ G +++ + ++ ++ +
Sbjct: 187 ----DY---------------DPEGLLSTIPAIASCLLGVFTGKILLSKQQNKTTIVLGL 227
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYF 390
L ++G F +NKAL+S S+ +TAG + ++LA IY+
Sbjct: 228 GLILLIVGHAWHFIF---PINKALWSSSFVLVTAGWANLILALIYY 270
>gi|410028220|ref|ZP_11278056.1| hypothetical protein MaAK2_03415 [Marinilabilia sp. AK2]
Length = 382
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 91/371 (24%), Positives = 150/371 (40%), Gaps = 73/371 (19%)
Query: 43 NSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMP 99
N Q + R ++LDV RGLT+ALM++V+ G + H+ W+G T+ D + P
Sbjct: 2 NPAQNLGVPLKERYLALDVLRGLTIALMVVVNTPGSWSHMYAPFMHADWHGFTITDLIFP 61
Query: 100 FFLFIVGVSLALTYKNFPC--KVVATRKAILRALNLFLLGIFLQG-GFFHGINNLKYG-V 155
FLF+VG +++ + K + + +K R L +FL+G L FF+ Y +
Sbjct: 62 TFLFVVGNAMSFSMKRMESMGQSLFLKKVFKRTLLIFLIGWLLNAFPFFNYNAETGYSMI 121
Query: 156 DIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLL 215
+ +++R +GVLQRIA+ Y++AAL L G + + F G+W +
Sbjct: 122 NWSEVRLLGVLQRIALCYMLAALI---LYYFGKKGAFIYSFIALLGYWAI---------- 168
Query: 216 LLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKP 275
Y F P+ NA +D ++G +HLY
Sbjct: 169 ----------MYFF---GDGEDPY------------SLIGNAALKLDLWLIGAKHLYMGE 203
Query: 276 IYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKD 335
PFDPEG+LS+ + V + G G I +
Sbjct: 204 --------------------------GIPFDPEGVLSTFPSVVNVIAGFLVGKFIQESGN 237
Query: 336 HRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYI 395
+ + +I L+ L D V +NK +++ Y LT + L G+ +V +
Sbjct: 238 NTGTVKKLVIWGIILLVACLVWDMV-FPINKKIWTSPYVLLTISLD-LFLIGLLMLVIEV 295
Query: 396 SSHLMLKKPFD 406
PF+
Sbjct: 296 WQKRNWTYPFE 306
>gi|390989491|ref|ZP_10259788.1| conserved hypothetical protein [Xanthomonas axonopodis pv. punicae
str. LMG 859]
gi|372555760|emb|CCF66763.1| conserved hypothetical protein [Xanthomonas axonopodis pv. punicae
str. LMG 859]
Length = 388
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 86/284 (30%), Positives = 113/284 (39%), Gaps = 70/284 (24%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILPA---INHSPWNGLTLADFVMPFFLFIVGVS 108
++ R +SLDVFRGLT+ LMILV+ G A + H+ W G TLAD V P FLF VG +
Sbjct: 18 KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77
Query: 109 LALTYKNFPCKVVATRKAILRALNLFLLGIFLQG-GFFHGINNLKYG-VDIAQIRWMGVL 166
++ + + RA + L G+ + FFH + + + Q+R GVL
Sbjct: 78 MSFALATNTPHLQFLGRVSRRAALILLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGVL 137
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRI + YL AAL L + RG V L L Y LLY P
Sbjct: 138 QRIGLCYLAAAL--------------LVRYLPQRGIAPVCLALLLGYWALLYAFGQPG-- 181
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
E S + NA +D + G HLYRK
Sbjct: 182 ----AELSKTG------------------NAGTRLDLWLYGRDHLYRK------------ 207
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI 330
FDPEGLL ++ ATV L G G +
Sbjct: 208 ---------------DGGFDPEGLLGTLSATVNVLAGYLCGRFL 236
>gi|255315516|ref|ZP_05357099.1| hypothetical protein CdifQCD-7_14229, partial [Clostridium
difficile QCD-76w55]
Length = 381
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 82/347 (23%), Positives = 150/347 (43%), Gaps = 80/347 (23%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
R+ S+D+ RGL++ALMI+ ++ G + P + H+ W+G+TLADF PFF+ +GV+
Sbjct: 12 TNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVT 71
Query: 109 LALT----YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMG 164
+ ++ KN + R++ L L G FL N D+ +R +G
Sbjct: 72 IPISINSKLKNNKSTLSIILSIFKRSILLILFGFFL---------NYLGNPDLNSVRILG 122
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQR+ + Y V +L + LK KL++ G + + ++ G Y+
Sbjct: 123 VLQRMGLVYFVTSLVYLLLK-------KLNV-----GSTATIITFLCIATFIIVGYYILA 170
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
Y F +E S + ++D H ++ +Y
Sbjct: 171 KPYGFELEGSLAQ----------------------LVD------LHFFKGHLY------- 195
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
+ F+P+G L+S++A + ++G G +++ + I
Sbjct: 196 -----------------KPEFEPDGFLTSIVAISSGMLGCTMGCVLLKENIGEYKKFFKI 238
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM 391
++ S ++ +G + NK L+S S+ L AG+ G+LL+ YF+
Sbjct: 239 LVMSIILLIGAFIFNQYFPFNKRLWSSSFVLLMAGSYGMLLSIFYFI 285
>gi|410941669|ref|ZP_11373463.1| putative membrane protein [Leptospira noguchii str. 2006001870]
gi|410783218|gb|EKR72215.1| putative membrane protein [Leptospira noguchii str. 2006001870]
Length = 381
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 94/344 (27%), Positives = 147/344 (42%), Gaps = 90/344 (26%)
Query: 48 RPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFI 104
+ Q R++SLD+FRG+TVA MILV++ G I + H+ WNG T D V PFFLF+
Sbjct: 2 EKKSNQNRILSLDLFRGMTVAGMILVNNPGSWSFIYTPLKHAKWNGCTPTDLVFPFFLFV 61
Query: 105 VGVSL--ALTYKNFPCKVVATRKAI---LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQ 159
VG S+ +L KN K+ ++ +R++ L L+G+FL F G ++
Sbjct: 62 VGTSIPFSLYSKN---KIYISKIWFGICIRSITLILIGLFLN---FFG------EWSFSK 109
Query: 160 IRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYG 219
+R G+LQRI Y WVVA +LYL+L
Sbjct: 110 LRIPGILQRIGFVY-----------------------------WVVA----SLYLMLPKR 136
Query: 220 LYVPDWQYEFPVETSSSSPWIFNVTCGVRGST----GPACNAVGMIDRKILGIQHLYRKP 275
+ + W V T W+ + G + P + IDR + G HL++
Sbjct: 137 IILISWIPILIVHT-----WVL-IQIPPPGESIVYLEPGKDIGAWIDRNVFGENHLWK-- 188
Query: 276 IYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKD 335
+S+T +DPEG S + + T L+G+ G ++ +
Sbjct: 189 -FSKT------------------------WDPEGFFSGISSIATTLLGVFCGSILSSKTN 223
Query: 336 HRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
+ + I L+ L L + +NK+L++ SY TAG
Sbjct: 224 ETKKQILSIFGFGILLVLVGLLWDQNLPMNKSLWTGSYVIYTAG 267
>gi|126700401|ref|YP_001089298.1| membrane protein [Clostridium difficile 630]
gi|115251838|emb|CAJ69673.1| putative membrane protein [Clostridium difficile 630]
Length = 370
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 82/347 (23%), Positives = 150/347 (43%), Gaps = 80/347 (23%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
R+ S+D+ RGL++ALMI+ ++ G + P + H+ W+G+TLADF PFF+ +GV+
Sbjct: 1 MNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVT 60
Query: 109 LALT----YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMG 164
+ ++ KN + R++ L L G FL N D+ +R +G
Sbjct: 61 IPISINSKLKNNKSTLSIILSIFKRSILLILFGFFL---------NYLGNPDLDTVRILG 111
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQR+ + Y V +L + LK KL++ G + + ++ G Y+
Sbjct: 112 VLQRMGLVYFVTSLVYLLLK-------KLNV-----GSTATIITFLCISTFIIVGYYILA 159
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
Y F +E S + ++D H ++ +Y
Sbjct: 160 KPYGFELEGSLAQ----------------------LVD------LHFFKGHLY------- 184
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWI 344
+ F+P+G L+S++A + ++G G +++ + I
Sbjct: 185 -----------------KPEFEPDGFLTSIVAISSGMLGCTMGCVLLKEDIGEYKKFFKI 227
Query: 345 ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM 391
++ S ++ +G + NK L+S S+ L AG+ G+LL+ YF+
Sbjct: 228 LVMSIILLIGAFIFNQYFPFNKRLWSSSFVLLMAGSYGILLSIFYFI 274
>gi|418515336|ref|ZP_13081517.1| hypothetical protein MOU_00795 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
gi|418520970|ref|ZP_13087016.1| hypothetical protein WS7_08123 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410702946|gb|EKQ61443.1| hypothetical protein WS7_08123 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410708055|gb|EKQ66504.1| hypothetical protein MOU_00795 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
Length = 388
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 85/284 (29%), Positives = 114/284 (40%), Gaps = 70/284 (24%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILPA---INHSPWNGLTLADFVMPFFLFIVGVS 108
++ R +SLDVFRGLT+ LMILV+ G A + H+ W G TLAD V P FLF VG +
Sbjct: 18 KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77
Query: 109 LALTYKNFPCKVVATRKAILRALNLFLLGIFLQG-GFFHGINNLKYG-VDIAQIRWMGVL 166
++ + + RA + L G+ + FFH + + + Q+R GVL
Sbjct: 78 MSFALATNTPHLQFLGRVSRRAALILLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGVL 137
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRI + YL AAL +L G L+L G+W LLY P
Sbjct: 138 QRIGLCYLAAALLVRYLPPRGIAPVCLALL---LGYWA-----------LLYAFGQPG-- 181
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
E S + NA +D + G HLYRK
Sbjct: 182 ----AELSKTG------------------NAGTRLDLWLYGRDHLYRK------------ 207
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI 330
FDPEGLL ++ ATV L G G +
Sbjct: 208 ---------------DGGFDPEGLLGTLSATVNVLAGYLCGRFL 236
>gi|381188372|ref|ZP_09895934.1| N-acetylglucosamine related transporter, NagX [Flavobacterium
frigoris PS1]
gi|379650160|gb|EIA08733.1| N-acetylglucosamine related transporter, NagX [Flavobacterium
frigoris PS1]
Length = 430
Score = 82.4 bits (202), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 97/401 (24%), Positives = 160/401 (39%), Gaps = 138/401 (34%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
++R+ISLDV RG+T+ +M+LV++ G + + H+ WNG T D V PFF+F++G ++
Sbjct: 3 KQRIISLDVLRGITIMMMVLVNNPGSWDNVFAPLEHANWNGCTPTDLVFPFFIFVLGAAI 62
Query: 110 ALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGIN-------------------- 149
L + K + R+L + LG+FL GF+ I
Sbjct: 63 PLAILTKELNQQSFLKILTRSLRIISLGLFL--GFYGKIEIFNLVGYPLLISKLIITGIV 120
Query: 150 ------NLKYGVDIA---------------------QIRWMGVLQRIAIAYLVAALCEIW 182
N K + + ++R GVLQRI I Y +L ++
Sbjct: 121 AYILLGNFKQKIKFSLVLTLFFVFVFLAFSGITAYNEVRLPGVLQRIGIVYFFTSL--VY 178
Query: 183 LKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFN 242
LK + +G ++ +L Y + + VPD+
Sbjct: 179 LKTE------------IKGQIIIIGLLLVGYWATMTLIPVPDF----------------- 209
Query: 243 VTCGVRGSTGPA-----CNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDA 297
GPA N G ID +L HL+ +S+T
Sbjct: 210 ---------GPANLNKGTNLAGWIDNLLLK-NHLWS---FSKT----------------- 239
Query: 298 PSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSL 357
+DPEG+LS++ A + +IGL G L+ +++ L + G GL+L
Sbjct: 240 -------WDPEGILSTIPAIASGIIGLLVGQLLNSSLAKKEKGLK-------MFGAGLAL 285
Query: 358 DFVGM------HLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
G+ LNK+L++ S+ TAG + + LA Y+ +
Sbjct: 286 VISGLIWNEFFPLNKSLWTSSFVLYTAGFATLFLAAFYYAI 326
>gi|307610361|emb|CBW99930.1| hypothetical protein LPW_16871 [Legionella pneumophila 130b]
Length = 372
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 91/356 (25%), Positives = 140/356 (39%), Gaps = 84/356 (23%)
Query: 43 NSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGI--LPAINHSPWNGLTLADFVMPF 100
N+ +PQ RL+SLDVFRG+T+ LMI V+ I P H WNG TLAD V PF
Sbjct: 2 NTVPLKPQ----RLLSLDVFRGMTIVLMIFVNGQAAIDPYPIFEHVDWNGCTLADLVFPF 57
Query: 101 FLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
FLFIVG++ ++ KN RK + + + +N + ++ I
Sbjct: 58 FLFIVGLTSVISLKN-----QMERKEKTSLYSAIIERSVVLFLLGLFLNVFPHPIEFDSI 112
Query: 161 RWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGL 220
R G+LQRIA+ YL++A + + S K F + LVL Y +++ +
Sbjct: 113 RIYGILQRIAVCYLISAFIYL------NTSIKTQFF--------IFLVLLLGYWIIMTQV 158
Query: 221 YVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRT 280
VP + + + W V D+ HLY K
Sbjct: 159 PVPGYGAN---QLTKDGSW------------------VSYFDQLFFSASHLYEK------ 191
Query: 281 KQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV----HFKDH 336
+DPEG LS+ + T L G+ G L++ FK
Sbjct: 192 -----------------------TYDPEGFLSTFTSIATTLSGVLAGSLLINPCNQFKKF 228
Query: 337 RDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
++ +S +NK L++ SY T+G + + A Y ++
Sbjct: 229 YLLAGVGLLFLLLGWLWNMSFP-----INKNLWTSSYVLWTSGLALLAFAFCYLLI 279
>gi|399028182|ref|ZP_10729485.1| hypothetical protein PMI10_01306 [Flavobacterium sp. CF136]
gi|398074259|gb|EJL65410.1| hypothetical protein PMI10_01306 [Flavobacterium sp. CF136]
Length = 423
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 96/354 (27%), Positives = 157/354 (44%), Gaps = 44/354 (12%)
Query: 53 QRRLISLDVFRGLTVALMILVD---DVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
+ RLISLDVFRGLT+ LM +V+ D G + P + H+ WNG T D V PFF+FI+GV++
Sbjct: 3 KERLISLDVFRGLTILLMTIVNNPGDWGHVFPPLLHAKWNGCTPTDLVFPFFIFIMGVAV 62
Query: 110 ALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRI 169
L + K ++R+L + LGIF FF I +G++ + ++ I
Sbjct: 63 PLAMPDKIYDDTTFNKILVRSLRMLCLGIFFN--FFEKIQ--LFGLEGIPLLIGRLIITI 118
Query: 170 AIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEF 229
A+ Y++ G+ SSKL + A + +YL L Y
Sbjct: 119 AVGYVLM----------GNFSSKLKN--------IFAFSILIIYLFLAYSEIEAYQDVRL 160
Query: 230 PVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILG---IQHLYRKP-IYSRTKQCSI 285
P + F V+ ++ G+ +LG + +L P I +
Sbjct: 161 PGVLQRIAVVYFVVSLLYLKTSQKTQIITGVF--LLLGYWAVMNLIPVPGIGEANLEKGT 218
Query: 286 NSPDY-------GPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRD 338
N + G M + +W DPEG+LS++ + V +IGL G L+ +
Sbjct: 219 NLASWLDSILLKGHMYHETKTW-----DPEGILSTIPSIVNGIIGLLIGQLLFLKIAKIE 273
Query: 339 RMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
++ LI GL + V +NK++++ SY T G + L+ +YF++
Sbjct: 274 IAKKIALIGIALIITGLLWNIV-FPINKSIWTSSYVLYTTGLAATTLSVLYFII 326
>gi|398343267|ref|ZP_10527970.1| hypothetical protein LinasL1_09415 [Leptospira inadai serovar Lyme
str. 10]
Length = 399
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 89/336 (26%), Positives = 136/336 (40%), Gaps = 86/336 (25%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
+RL+S+D RG TVA MILV++ G I + H+ W G T D V PFFLF VGVS+
Sbjct: 32 SKRLLSIDALRGFTVAGMILVNNPGSWSAIYSPLRHAKWFGCTPTDLVFPFFLFSVGVSI 91
Query: 110 ALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRI 169
P + + L + IF+ G F H ++ +D ++R GVLQRI
Sbjct: 92 -------PFSTIGNGGTFFKILKRASILIFI-GLFLHWFG--EWSMD--RLRIPGVLQRI 139
Query: 170 AIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEF 229
+ Y ++A+ YR ++ + +L Y + + EF
Sbjct: 140 GLVYFISAIA-------------------YRSASFRTRIMICISILFGYWILL-----EF 175
Query: 230 PVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPD 289
+ SP S P + +DR + G HL++ S+T
Sbjct: 176 APPPGAGSP-----------SLSPGKDWGAWLDRIVFGENHLWKS---SKT--------- 212
Query: 290 YGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSC 349
+DPEGLL S+ A T +G+ FG ++ K D N +
Sbjct: 213 ---------------WDPEGLLGSLSAVATTFLGIFFGEVL---KKDSDTKGNIQKTAFT 254
Query: 350 LIGLGLSLDFVG------MHLNKALYSLSYTCLTAG 379
I + L VG +NK+L++ S+ T G
Sbjct: 255 FILAAIVLMLVGWIWHQFFPMNKSLWTSSFVLWTGG 290
>gi|295136516|ref|YP_003587192.1| hypothetical protein ZPR_4697 [Zunongwangia profunda SM-A87]
gi|294984531|gb|ADF54996.1| conserved hypothetical protein [Zunongwangia profunda SM-A87]
Length = 371
Score = 82.0 bits (201), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 57/141 (40%), Positives = 84/141 (59%), Gaps = 12/141 (8%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
+ R +SLD+ RG+TVALMILV++ G I H+ W+G TL D V P FLF+VG ++
Sbjct: 3 RSRYLSLDILRGMTVALMILVNNPGSWATIYAPFKHAAWHGFTLTDLVFPTFLFVVGNAM 62
Query: 110 ALTYKNF-----PCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGV-DIAQIRWM 163
+ ++K P + T K RA +FL+G+ L F + ++ + +I IR M
Sbjct: 63 SFSFKKMNSWSTPEFLTKTFK---RAAIIFLIGLGLSYYPFVRRTDGEFILKNILDIRIM 119
Query: 164 GVLQRIAIAYLVAALCEIWLK 184
GVLQRIA+ YL+AA+ +LK
Sbjct: 120 GVLQRIAVCYLLAAIAIRFLK 140
Score = 45.8 bits (107), Expect = 0.042, Method: Compositional matrix adjust.
Identities = 26/85 (30%), Positives = 44/85 (51%), Gaps = 1/85 (1%)
Query: 304 PFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMH 363
PFDPEGLLS + A V + G G I + + ++ +I +I L L D + +
Sbjct: 195 PFDPEGLLSCLPAVVNVIFGYLAGRFIQQSVNKKKLVIQLVIAGLAMISLALVWDVI-LP 253
Query: 364 LNKALYSLSYTCLTAGASGVLLAGI 388
+NK +++ SY L+ G ++L +
Sbjct: 254 VNKPIWTSSYVILSTGWDFIILGAL 278
>gi|225875032|ref|YP_002756491.1| hypothetical protein ACP_3497 [Acidobacterium capsulatum ATCC
51196]
gi|225792728|gb|ACO32818.1| putative membrane protein [Acidobacterium capsulatum ATCC 51196]
Length = 378
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 85/291 (29%), Positives = 129/291 (44%), Gaps = 88/291 (30%)
Query: 50 QHQQ-----RRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFF 101
QHQ +R++S+D+ RG+T+A MILV++ G A+ H+ WNG T D V P F
Sbjct: 3 QHQDTVVNAKRMVSIDLLRGITIAFMILVNNNGDEAHAFWALKHAQWNGFTPTDLVFPTF 62
Query: 102 LFIVGVSLALT----YKNFPCKVVATRKAILRALNLFLLGIFLQG-GFFHGINNLKYGVD 156
+F+VG+SL + + +++ A+ R++ LFLLG+ + G +FH
Sbjct: 63 IFVVGISLVFSTEARLRRGQSRLLIAAHALRRSVILFLLGLVVNGFPYFH---------- 112
Query: 157 IAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTL---Y 213
+R GVLQRIAI YL +L L+ R W+ AL+ TT Y
Sbjct: 113 FGTLRIYGVLQRIAICYLFGSL----------------LYLLSRRVWLQALLFTTALVGY 156
Query: 214 LLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYR 273
L+ + VP Y P P++ P N V +DR +L +
Sbjct: 157 WALMRWVPVPG--YGLP---GRDIPFL-----------DPNANLVAWLDRLLLPGR---- 196
Query: 274 KPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGL 324
+Y+ T+ DPEGLLS++ A T L+G+
Sbjct: 197 --LYAGTR------------------------DPEGLLSTIPAMGTLLLGM 221
>gi|54294550|ref|YP_126965.1| hypothetical protein lpl1626 [Legionella pneumophila str. Lens]
gi|53754382|emb|CAH15866.1| hypothetical protein lpl1626 [Legionella pneumophila str. Lens]
Length = 372
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 91/356 (25%), Positives = 140/356 (39%), Gaps = 84/356 (23%)
Query: 43 NSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGI--LPAINHSPWNGLTLADFVMPF 100
N+ +PQ RL+SLDVFRG+T+ LMI V+ I P H WNG TLAD V PF
Sbjct: 2 NTVPLKPQ----RLLSLDVFRGMTIVLMIFVNGQAAIDPYPIFEHVDWNGCTLADLVFPF 57
Query: 101 FLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
FLFIVG++ ++ KN RK + + + +N + ++ I
Sbjct: 58 FLFIVGLTSVISLKN-----QMERKEKTSLYSAIIERSVVLFLLGLFLNVFPHPIEFDSI 112
Query: 161 RWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGL 220
R G+LQRIA+ YL++A + + S K F + LVL Y +++ +
Sbjct: 113 RIYGILQRIAVCYLISAFIYL------NTSIKTQFF--------IFLVLLLGYWIIMTQV 158
Query: 221 YVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRT 280
VP + + + W V D+ HLY K
Sbjct: 159 PVPGYGAN---QLTKDGSW------------------VSYFDQLFFSAPHLYEK------ 191
Query: 281 KQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV----HFKDH 336
+DPEG LS+ + T L G+ G L++ FK
Sbjct: 192 -----------------------TYDPEGFLSTFTSIATTLSGVLAGSLLINPCNQFKKF 228
Query: 337 RDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
++ +S +NK L++ SY T+G + + A Y ++
Sbjct: 229 YLLAGVGLLFLLLGWLWNMSFP-----INKNLWTSSYVLWTSGLALLAFAFCYLLI 279
>gi|424795356|ref|ZP_18221218.1| N-acetylglucosaminidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
gi|422795515|gb|EKU24196.1| N-acetylglucosaminidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
Length = 1105
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 56/140 (40%), Positives = 73/140 (52%), Gaps = 5/140 (3%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGILPA---INHSPWNGLTLADFVMPFFLFIVGVSL 109
+ R +SLDVFRGLT+ LMILV+ G A + H+PW G T AD V P FLF VG ++
Sbjct: 736 RERFLSLDVFRGLTIFLMILVNTPGAGADAFVQLRHTPWFGFTAADLVFPSFLFAVGNAM 795
Query: 110 ALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFF--HGINNLKYGVDIAQIRWMGVLQ 167
+ R+ R+ +FLLG + F HG + + I Q R GVLQ
Sbjct: 796 SFALDRGQPLGAFLRRIGKRSALIFLLGFLMYWFPFVHHGADGSWSFIAIDQTRVPGVLQ 855
Query: 168 RIAIAYLVAALCEIWLKGDG 187
RIA+ Y + AL WL G
Sbjct: 856 RIALCYALGALLCRWLPPRG 875
>gi|294627662|ref|ZP_06706244.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
gi|292598014|gb|EFF42169.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
Length = 388
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 85/284 (29%), Positives = 114/284 (40%), Gaps = 70/284 (24%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILPA---INHSPWNGLTLADFVMPFFLFIVGVS 108
++ R +SLDVFRGLT+ LMILV+ G A + H+ W G TLAD V P FLF VG +
Sbjct: 18 KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77
Query: 109 LALTYKNFPCKVVATRKAILRALNLFLLGIFLQG-GFFHGINNLKYG-VDIAQIRWMGVL 166
++ + + RA + L G+ + FFH + + + Q+R GVL
Sbjct: 78 MSFALATNTPHLQFLGRVSKRAALILLCGVLMYWFPFFHLQPDGGWSFTTVDQLRLTGVL 137
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRI + YL AAL +L G L+L G+W LLY P
Sbjct: 138 QRIGLCYLAAALLVRYLPPRGIAPVCLALL---LGYWA-----------LLYAFGQPG-- 181
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
E S + NA +D + G HLYRK
Sbjct: 182 ----AELSKTG------------------NAGTRLDLWLYGRDHLYRK------------ 207
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI 330
FDPEGLL ++ ATV L G G +
Sbjct: 208 ---------------DGGFDPEGLLGTLSATVNVLAGYLCGRFL 236
>gi|372221469|ref|ZP_09499890.1| hypothetical protein MzeaS_04069 [Mesoflavibacter
zeaxanthinifaciens S86]
Length = 378
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 85/283 (30%), Positives = 124/283 (43%), Gaps = 70/283 (24%)
Query: 51 HQQRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGV 107
++ R ++LDVFRGLT+ LMILV+ G I P ++H+ W G TLAD V P FLF +G
Sbjct: 6 NKTNRFLALDVFRGLTICLMILVNSPGTGADIYPYLSHANWFGFTLADLVFPSFLFAMGN 65
Query: 108 SLALTYKNFPCKVVAT--RKAILRALNLFLLGIFLQGGFFHGINNLKYGVD-IAQIRWMG 164
+++ + + +K + R L +FLLG + F + + + +A+ R MG
Sbjct: 66 AMSFSMAKLKTTPASKFYKKVVRRTLIIFLLGYLMYWFPFFEWSAQGFSLKPVAETRIMG 125
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPD 224
VLQRIA+ Y A++ H++SKL VAL+ + LL LY D
Sbjct: 126 VLQRIALCYFFASVLIY------HLNSKL-----------VALISALILLLYWCLLYWFD 168
Query: 225 WQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCS 284
G G NA+ +D +LG H+Y+K
Sbjct: 169 ---------------------GNGGVLEMGSNAITRLDLFLLGEGHVYKKD--------- 198
Query: 285 INSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFG 327
PFDPEGLLS++ + V L G G
Sbjct: 199 -----------------SVPFDPEGLLSTLPSIVNVLGGYFAG 224
>gi|260790699|ref|XP_002590379.1| hypothetical protein BRAFLDRAFT_76652 [Branchiostoma floridae]
gi|229275571|gb|EEN46390.1| hypothetical protein BRAFLDRAFT_76652 [Branchiostoma floridae]
Length = 347
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 74/264 (28%), Positives = 119/264 (45%), Gaps = 58/264 (21%)
Query: 101 FLFIVGVSLALTYKNFPCKVVATR---KAILRALNLFLLGIFLQGGFFHGINNLKYGVDI 157
F+FI+G S+AL+++ + R + I R+ LFL+G FL G HG N D+
Sbjct: 66 FVFIMGTSMALSFRGMRKRTSTRRVVFRVITRSAKLFLVGFFLNAG--HGRN------DL 117
Query: 158 AQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLS---------LFRKYR------GH 202
+R GVLQR++IAYLV+ E ++ + S + S + R
Sbjct: 118 GTVRVPGVLQRLSIAYLVSGFIECFVGKERKSSDERSRLTNPTLQKIHNALRDIVDNWAA 177
Query: 203 WVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMID 262
W++ L++ ++L++ + L VP P + TG A G ID
Sbjct: 178 WLLHLLILVIHLIITFLLPVPGC----PTGYLGPGGPLLGDGVEYLNCTG---GAAGYID 230
Query: 263 RKILGIQHLYRKP---IYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVT 319
R ILG H+Y+ P ++ +TK FDPEG+L S+
Sbjct: 231 RLILG-SHMYQTPTVRVFYKTK---------------------VAFDPEGILGSLTTIFN 268
Query: 320 CLIGLHFGHLIVHFKDHRDRMLNW 343
C +GL G ++V++K+H R++ W
Sbjct: 269 CFLGLQAGKILVYYKEHSSRIIRW 292
>gi|294667090|ref|ZP_06732315.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292603100|gb|EFF46526.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 388
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 85/284 (29%), Positives = 114/284 (40%), Gaps = 70/284 (24%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILPA---INHSPWNGLTLADFVMPFFLFIVGVS 108
++ R +SLDVFRGLT+ LMILV+ G A + H+ W G TLAD V P FLF VG +
Sbjct: 18 KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77
Query: 109 LALTYKNFPCKVVATRKAILRALNLFLLGIFLQG-GFFHGINNLKYG-VDIAQIRWMGVL 166
++ + + RA + L G+ + FFH + + + Q+R GVL
Sbjct: 78 MSFALATNTPHLQFLGRVSKRAALILLCGVLMYWFPFFHLQPDGGWSFTTVDQLRLTGVL 137
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRI + YL AAL +L G L+L G+W LLY P
Sbjct: 138 QRIGLCYLAAALLVRYLPPRGIAPVCLALL---LGYWA-----------LLYAFGQPG-- 181
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
E S + NA +D + G HLYRK
Sbjct: 182 ----AELSKTG------------------NAGTRLDLWLYGRDHLYRK------------ 207
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI 330
FDPEGLL ++ ATV L G G +
Sbjct: 208 ---------------DGGFDPEGLLGTLSATVNVLAGYLCGRFL 236
>gi|239907232|ref|YP_002953973.1| hypothetical protein DMR_25960 [Desulfovibrio magneticus RS-1]
gi|239797098|dbj|BAH76087.1| hypothetical membrane protein [Desulfovibrio magneticus RS-1]
Length = 371
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 105/346 (30%), Positives = 148/346 (42%), Gaps = 84/346 (24%)
Query: 49 PQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIV 105
P RL S+D RGL +A MI+V++ G + P + H+ W+GLTLAD V P FLF+V
Sbjct: 2 PTPAIPRLSSVDTLRGLAIAAMIVVNNPGDRRFVYPQLLHAQWHGLTLADVVFPLFLFLV 61
Query: 106 GVSLALTY-----KNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
GV +AL ++ + RK + RA LF LG+ N + ++
Sbjct: 62 GVCVALAIDPDKPRDAEARARLWRKILPRAAVLFALGLG---------ENAYLRLSFDEL 112
Query: 161 RWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGL 220
R GVLQRIA+ YL AA WL+ S+LS R VV V Y LLL +
Sbjct: 113 RLPGVLQRIAVVYLAAA----WLQ------SRLS----SRALAVVGAVTLLGYWLLLAAV 158
Query: 221 YVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRT 280
VP + S N G +D+ +LG +H+++
Sbjct: 159 PVPGHGHP---------------------SLAMEPNLQGWLDQLVLG-RHIWK------- 189
Query: 281 KQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRM 340
+DPEG+LS+ A LIG+ G + D R
Sbjct: 190 --------------------FHTSWDPEGILSTFPAIALGLIGVLAGRWLRRGGDRPGRA 229
Query: 341 LNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
+L LI +GL+ D V LNK+L + S+ LT G +LA
Sbjct: 230 ---GLLGLLLIAIGLAWDAV-FPLNKSLCTSSFVLLTGGLGLAMLA 271
>gi|84625357|ref|YP_452729.1| hypothetical protein XOO_3700 [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|188575197|ref|YP_001912126.1| hypothetical protein PXO_04319 [Xanthomonas oryzae pv. oryzae
PXO99A]
gi|84369297|dbj|BAE70455.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|188519649|gb|ACD57594.1| membrane protein, putative [Xanthomonas oryzae pv. oryzae PXO99A]
Length = 388
Score = 81.3 bits (199), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 89/289 (30%), Positives = 122/289 (42%), Gaps = 80/289 (27%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILPA---INHSPWNGLTLADFVMPFFLFIVGVS 108
++ R +SLDVFRGLT+ LMILV+ G A + H+ W G TLAD V P FLF VG +
Sbjct: 18 KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77
Query: 109 LALT------YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYG-VDIAQIR 161
++ + F +V +++A L AL L+ F FFH + + + Q+R
Sbjct: 78 MSFALATNMPHLQFLGRV--SKRAALIALCGVLMYWF---PFFHLQPDGGWAFTTVDQVR 132
Query: 162 WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLY 221
GVLQRI + YL AAL +L G + VAL+L LL ++G
Sbjct: 133 LTGVLQRIGLCYLAAALLVRYLPPRGIAPA------------CVALLLGYWALLHVFG-- 178
Query: 222 VPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTK 281
+ E S + NA +D + G +HLYRK
Sbjct: 179 ------QPGAELSKTG------------------NAGTCLDLWLYGREHLYRK------- 207
Query: 282 QCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI 330
FDPEGLL S+ ATV L G G +
Sbjct: 208 --------------------DGGFDPEGLLGSLSATVNVLAGYLCGRFL 236
>gi|170725675|ref|YP_001759701.1| hypothetical protein Swoo_1314 [Shewanella woodyi ATCC 51908]
gi|169811022|gb|ACA85606.1| conserved hypothetical protein [Shewanella woodyi ATCC 51908]
Length = 378
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 99/366 (27%), Positives = 156/366 (42%), Gaps = 92/366 (25%)
Query: 45 KQTRPQHQQRRLISLDVFRGLTVALMILVDDV-GGIL------------PAINHSPWNGL 91
+ T+ + +RRL+SLD RG + ++ + + G+L ++HS W+G
Sbjct: 2 EATQAKTPKRRLMSLDALRGFDMFWILGGEALFAGLLLWTGWHGWQWADAQMHHSQWHGF 61
Query: 92 TLADFVMPFFLFIVGVSLALTYKNFPCKVVATR-----KAILRALNLFLLGIFLQGGFFH 146
T D + P F+F+ GV+L L+ K +A R ++ R L L G+ G+
Sbjct: 62 TFYDLIFPLFIFLSGVALGLSPKRLDKLPMAQRMPLYKHSVKRLLLLLFFGVLYNHGWGT 121
Query: 147 GINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVA 206
G V I ++R+ VL RIA A+ AA+ +W H S FR
Sbjct: 122 GAP-----VAIDEVRYASVLGRIAFAWFFAAML-VW-----HTS-----FRTQ------- 158
Query: 207 LVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKIL 266
VL TL +L+LYGL Q PV + G P + +D +L
Sbjct: 159 -VLVTLGILVLYGLA----QLLIPVPGYGA------------GIFSPEGSINAYVDTHLL 201
Query: 267 -GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLH 325
GI + R DPEG+LS++ A V + G+
Sbjct: 202 PGIAYQNRA------------------------------LDPEGILSTLPAIVNAMAGVF 231
Query: 326 FGHLIV--HFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGV 383
GH IV H K +++ +I + ++G G L+ V + +NK L++ S+ +T G S +
Sbjct: 232 VGHFIVKEHAKGEWYKVVTMLIAGALVLGCGWLLNLV-IPVNKDLWTSSFVLVTTGWSMI 290
Query: 384 LLAGIY 389
LLA Y
Sbjct: 291 LLAVFY 296
>gi|320106288|ref|YP_004181878.1| hypothetical protein AciPR4_1053 [Terriglobus saanensis SP1PR4]
gi|319924809|gb|ADV81884.1| hypothetical protein AciPR4_1053 [Terriglobus saanensis SP1PR4]
Length = 394
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 84/289 (29%), Positives = 127/289 (43%), Gaps = 77/289 (26%)
Query: 54 RRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLA 110
+R++S+DV RGLTVA MILV+D G ++H+PWNG T D V P FLF+VG S+
Sbjct: 10 QRILSVDVLRGLTVAFMILVNDPGDGHVAYAPLDHAPWNGWTPTDMVFPTFLFLVGCSIV 69
Query: 111 LTYKNFPCKVVATRKAILRAL--NLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQR 168
+ + + + + L+ + ++LL I I YG ++R GVL R
Sbjct: 70 FSITSRLKRGDSKSRIALQVIRRTIYLLAINYA---IRLIPQFHYG----RMRLFGVLPR 122
Query: 169 IAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTL--YLLLLYGLYVPDWQ 226
IAI YL+AAL +WL+ R W+ V+T L Y L+ + +P
Sbjct: 123 IAICYLIAALLFLWLQ---------------RARWIAVAVVTLLVGYWALMRFVPIPGAG 167
Query: 227 ---YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKI--LGIQHLYRKPIYSRTK 281
++FP+ FN N IDR + L+ +Y +T+
Sbjct: 168 MPVHDFPLMDQ------FN-------------NLPSYIDRGFNDFTQRFLHTGSLYEKTR 208
Query: 282 QCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI 330
DPEG+LS++ A T L+G+ G L+
Sbjct: 209 ------------------------DPEGILSTLPAVGTALLGVLNGKLL 233
>gi|340786861|ref|YP_004752326.1| protein involved in N-acetyl-D-glucosamine utilization [Collimonas
fungivorans Ter331]
gi|340552128|gb|AEK61503.1| protein involved in N-Acetyl-D-glucosamine utilization [Collimonas
fungivorans Ter331]
Length = 373
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 97/353 (27%), Positives = 153/353 (43%), Gaps = 92/353 (26%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG 106
+ RRL S+D RG TVA M+LV+D G + + HS W+G T D V PFFLF+VG
Sbjct: 19 RSMTRRLASVDALRGCTVAAMLLVNDPGDWSHVYAPLEHSAWHGCTPTDLVFPFFLFVVG 78
Query: 107 VSLALTYKNFPCK----VVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGV-DIAQIR 161
VS AL + + R A++RAL + LG+ IN L + + +R
Sbjct: 79 VSTALGIEPRLAQGANPSTLARAALIRALRIVALGLL--------INLLAWFIMPGVHLR 130
Query: 162 WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHW--VVALVLTTLYLLLLYG 219
GVLQRI + + ALC S++ + R W +VA++L LL L G
Sbjct: 131 LPGVLQRIGLCFAATALC--------------SIYTRPRTQWGLIVAILLGYWGLLTLGG 176
Query: 220 LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSR 279
S PW+ N D + G +Y+
Sbjct: 177 ---------------SLEPWL---------------NLASRSDSALFG-HFVYQ------ 199
Query: 280 TKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDR 339
+DA S DPEGLL ++ + T L+GL G + +++R +
Sbjct: 200 ---------------IDAVS--GRGHDPEGLLGTLPSLATSLLGLCAGRWL---RENRLK 239
Query: 340 MLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
L + +++ ++G +L + + LNK L++ S+ AG + + L +++V
Sbjct: 240 PLLFAAVAALVLG---TLWSLALPLNKNLWTSSFVLWCAGWATLALLLFHWLV 289
>gi|389809458|ref|ZP_10205319.1| protein involved in N-acetyl-D-glucosamine utilization
[Rhodanobacter thiooxydans LCS2]
gi|388441722|gb|EIL97972.1| protein involved in N-acetyl-D-glucosamine utilization
[Rhodanobacter thiooxydans LCS2]
Length = 353
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/139 (40%), Positives = 76/139 (54%), Gaps = 16/139 (11%)
Query: 54 RRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLA 110
+RL SLD RG TVA M+LV+D G + + H+ WNG T D V PFFLF+VGVS+A
Sbjct: 2 KRLASLDALRGCTVAAMLLVNDPGDWSHVYWPLAHAAWNGCTPTDLVFPFFLFVVGVSVA 61
Query: 111 LT----YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNL-KYGVDIAQIRWMGV 165
L + TR A+ RAL + LG+ IN L + + A +R+ GV
Sbjct: 62 LAILPRLEQGASPSALTRAAMWRALRILALGV--------AINLLAAWWLPQAHLRFPGV 113
Query: 166 LQRIAIAYLVAALCEIWLK 184
LQRIA+ + AL ++ K
Sbjct: 114 LQRIALCFAGVALFAVYTK 132
>gi|326800650|ref|YP_004318469.1| hypothetical protein [Sphingobacterium sp. 21]
gi|326551414|gb|ADZ79799.1| hypothetical protein Sph21_3257 [Sphingobacterium sp. 21]
Length = 396
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 98/376 (26%), Positives = 158/376 (42%), Gaps = 90/376 (23%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
+R SLDVFRG TVALMILV++ G + H+PW+G T D V PFFLF VG ++
Sbjct: 2 NQRYYSLDVFRGATVALMILVNNPGSWSYAFSPLKHAPWHGCTPTDLVFPFFLFAVGNAM 61
Query: 110 ALTYKNFPCKV--VATRKAILRALNLFLLGI------FLQGG-----FFHGINNLKYGVD 156
+ + V +K + R + +FL+G+ F+Q F + IN +K
Sbjct: 62 SFVIPRLRTQAGKVFWKKVLKRTILIFLIGLLLNWYPFVQWSNDTLLFKYWINPIK---S 118
Query: 157 IAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLL 216
+ IR +GVLQRIA+ Y A++ L F K + +VL ++++LL
Sbjct: 119 DSGIRILGVLQRIALCYCFASI--------------LVYFFKTK-----TVVLISIFILL 159
Query: 217 LYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPI 276
Y L + S P+ G + ID IL I H+Y+
Sbjct: 160 SYWLIC--------ILLGDSDPYSLQGWFGTK------------IDVSILQIAHMYKGE- 198
Query: 277 YSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDH 336
PF+PEG+ S+ A + +IG G I
Sbjct: 199 -------------------------GVPFEPEGIASTFTAVIQVVIGFLVGQYIQTGTKA 233
Query: 337 RD------RMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYF 390
+ R ++ +++++ L+ LG + +NK +++ SY ++G + L + +
Sbjct: 234 IEGPMLIYRTVSTLMVTAALLTLGGMTWGLAFPINKKIWTSSYVLYSSGLAITALGVLIW 293
Query: 391 MVRYISSHLMLKKPFD 406
V + K FD
Sbjct: 294 FVEIKGHKNFVTKFFD 309
>gi|329964617|ref|ZP_08301671.1| hypothetical protein HMPREF9446_03278 [Bacteroides fluxus YIT
12057]
gi|328525017|gb|EGF52069.1| hypothetical protein HMPREF9446_03278 [Bacteroides fluxus YIT
12057]
Length = 396
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 93/368 (25%), Positives = 161/368 (43%), Gaps = 105/368 (28%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG 106
Q +R+++LD+ RG+T+A MI+V++ G I + H+ W GLT D V PFF+FI+G
Sbjct: 4 QKTSKRILALDILRGITIAGMIMVNNPGSWAHIYAPLAHAQWIGLTPTDLVFPFFMFIMG 63
Query: 107 VSLALTYK--NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGI--------NNLKYGVD 156
+S ++ K NF A K + R + +FL+G+ + G+F +NL +G +
Sbjct: 64 ISTYISLKKYNFEFSHAAALKILKRTVIIFLIGMAI--GWFSRFCYYWSSAPDNLGFGEN 121
Query: 157 I-------AQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRG-HWVVALV 208
+ ++R +GV+QR+A+ Y +S ++L K++ +++A +
Sbjct: 122 LWASVWTFDRMRILGVMQRLALCY--------------GATSIIALTMKHKNIPYLIATL 167
Query: 209 LTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGI 268
L ++LLL G G N + ++DR +L
Sbjct: 168 LIGYFILLL----------------------------GGNGFAYNETNILSIVDRAVLTP 199
Query: 269 QHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGH 328
H+Y+ D G DPEG+LS++ A L+G G
Sbjct: 200 AHMYK---------------DNG-------------IDPEGILSTIPAIAHVLLGFCVGR 231
Query: 329 LIV-HFKDHRDRMLNWIILSSCLIGL---GLSLDFVGMHL------NKALYSLSYTCLTA 378
+++ + + N +L S LI L G L F G L +K ++S ++ +T
Sbjct: 232 MMLGDSQSAKGDRTN--VLDSHLIKLFLAGTILTFAGFLLSYGCPISKKIWSPTFVLVTC 289
Query: 379 GASGVLLA 386
G + LA
Sbjct: 290 GLASSFLA 297
>gi|198277541|ref|ZP_03210072.1| hypothetical protein BACPLE_03763 [Bacteroides plebeius DSM 17135]
gi|198270039|gb|EDY94309.1| hypothetical protein BACPLE_03763 [Bacteroides plebeius DSM 17135]
Length = 338
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 88/339 (25%), Positives = 138/339 (40%), Gaps = 101/339 (29%)
Query: 70 MILVDDVGGIL--PAINHSPWNGLTLADFVMPFFLFIVGVSLALTYK--NFPCKVVATRK 125
MILV++ GG + + HS WNGLT D V PFFLF+VG+S ++ + NF RK
Sbjct: 1 MILVNNAGGPVSYAPLRHSVWNGLTPCDLVFPFFLFMVGISTYISLRKFNFGPTSEVIRK 60
Query: 126 AILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKG 185
+ R + L+G+ + F + N + +D +I GVLQRI + Y + +L I++
Sbjct: 61 IVRRTFLIILIGLAID-WFGYACNGNFFPIDTLRIP--GVLQRIGLCYGIVSLMVIYIN- 116
Query: 186 DGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTC 245
KY PWI C
Sbjct: 117 -----------HKYL-------------------------------------PWI----C 124
Query: 246 G---------VRGSTGPAC---NAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPM 293
G + G AC N + +IDR I G HLY+K
Sbjct: 125 GGLLLAYSILLLTGNGYACDDTNLLAVIDRGIFGEAHLYKK------------------- 165
Query: 294 PLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL 353
+P DPEGL ++ A LIG G L++ R++ I + ++ +
Sbjct: 166 ---------SPIDPEGLAGTLSAVAHTLIGFMCGRLLLEKISVNKRIVKLITAAVMMLII 216
Query: 354 GLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
G L + M +NK ++S ++ +T G +LLA + +++
Sbjct: 217 GYVLS-IWMPINKRVWSTTFVLVTCGWGSLLLALLMYVI 254
>gi|395761203|ref|ZP_10441872.1| hypothetical protein JPAM2_05565 [Janthinobacterium lividum PAMC
25724]
Length = 373
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 59/143 (41%), Positives = 78/143 (54%), Gaps = 9/143 (6%)
Query: 53 QRRLISLDVFRGLTVALMILVD---DVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
+R ++LDV RGLTVALMI+V+ D G + H+ W+G TL D V P FLF+VG +L
Sbjct: 5 SQRYLALDVLRGLTVALMIVVNTPGDWGSVYAPFLHAEWHGFTLTDLVFPSFLFVVGNAL 64
Query: 110 ALT---YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGV--DIAQIRWMG 164
A Y+N V + AL +FLLG L F I++ ++Q R G
Sbjct: 65 AFVLGKYENLAHGAVLAKLCKRSAL-IFLLGFLLYWFPFFKIDDAGQFAWSSLSQTRIPG 123
Query: 165 VLQRIAIAYLVAALCEIWLKGDG 187
VLQRIA+ YL AAL + K G
Sbjct: 124 VLQRIAVCYLAAALILHYWKTRG 146
>gi|375149723|ref|YP_005012164.1| hypothetical protein [Niastella koreensis GR20-10]
gi|361063769|gb|AEW02761.1| Protein of unknown function DUF2261, transmembrane [Niastella
koreensis GR20-10]
Length = 368
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 92/351 (26%), Positives = 140/351 (39%), Gaps = 72/351 (20%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
+R ISLDVFRGL + LMI+V+ G + H+ WNG TLAD V P FLF VG +L
Sbjct: 4 NKRFISLDVFRGLIICLMIIVNTPGSHDTSFALLQHANWNGFTLADLVFPSFLFAVGNAL 63
Query: 110 ALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYG----VDIAQIRWMGV 165
+ + + A + L L + +F G A R MGV
Sbjct: 64 FFSMQKWKTMTQGQVLAKIGKRTLLLFLLGYLLYWFPFFETNTQGHIVFKSFAGTRIMGV 123
Query: 166 LQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDW 225
LQRIA+ Y +A+L +LK G + +V+ ++ Y LL+ L P
Sbjct: 124 LQRIALCYGIASLLIYYLKPKGAL--------------IVSAIILVAYPGLLFWLGDPG- 168
Query: 226 QYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSI 285
NAV D +LG H+ +
Sbjct: 169 -----------------------NKLNMVGNAVTKFDLWLLGPDHMNHGEV--------- 196
Query: 286 NSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWII 345
PF+PEG+LS++ A + G G I + +L I
Sbjct: 197 -----------------VPFEPEGILSTLPAITNVVAGYLVGWYIQTAGKTKRMLLRLIA 239
Query: 346 LSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYIS 396
+ L LGL ++V + +NK L++ S+ + G +LLA I ++ +++
Sbjct: 240 TGAGLTFLGLCWNYV-IPINKNLWTSSFVVHSTGLDCLLLAAIIYIADFLN 289
>gi|288929890|ref|ZP_06423732.1| conserved hypothetical protein [Prevotella sp. oral taxon 317 str.
F0108]
gi|288328709|gb|EFC67298.1| conserved hypothetical protein [Prevotella sp. oral taxon 317 str.
F0108]
Length = 399
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 93/368 (25%), Positives = 154/368 (41%), Gaps = 98/368 (26%)
Query: 48 RPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFI 104
R++S+D+ RGLT+A MI V++ G + + H+ WNGLT D V PFF+ +
Sbjct: 2 EKNKTTSRILSIDILRGLTIAGMITVNNPGSWSYMYAPLEHAEWNGLTPTDLVFPFFMCV 61
Query: 105 VG--VSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQ--GGFFHGINNLKYGVD---- 156
+G + +A+ +F C K + R + ++L+G+ + F + N+ + G D
Sbjct: 62 MGMCIYIAMRKFDFACNRATVYKIVKRMVLIYLVGLAIGWFAKFCYRWNSPQEGADFFSQ 121
Query: 157 -------IAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVL 209
+IR GVL R+AI Y + AL I ++ K+ + +V L+L
Sbjct: 122 LWYMVWSFDKIRLTGVLARLAICYGITALLAITVR------------HKHLPYIIVGLLL 169
Query: 210 TTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQ 269
T +++L+ G G N + + DR +L
Sbjct: 170 -TYFVILMAG----------------------------NGFAYDETNILSIADRAVLTDA 200
Query: 270 HLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHL 329
H+Y D G DPEGLLS++ + L+G G L
Sbjct: 201 HMYH---------------DNG-------------IDPEGLLSTLPSIAHTLLGFIIGSL 232
Query: 330 IVHFKDHRDRMLNW---IILSSC--LIGLGLSLDFVGMHL------NKALYSLSYTCLTA 378
+ D ++ L+ I L+ L +G SL F G L NK ++S ++ +T
Sbjct: 233 LFRKADVGEQQLDARTNITLTKVVPLFVVGTSLLFAGYLLSYGCPINKKVWSPTFVLVTC 292
Query: 379 GASGVLLA 386
G + +LLA
Sbjct: 293 GLASMLLA 300
>gi|381169858|ref|ZP_09879020.1| conserved hypothetical protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
gi|380689628|emb|CCG35507.1| conserved hypothetical protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
Length = 388
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 82/284 (28%), Positives = 112/284 (39%), Gaps = 70/284 (24%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILPA---INHSPWNGLTLADFVMPFFLFIVGVS 108
++ R +SLDVFRGLT+ LMILV+ G A + H+ W G TLAD V P FLF VG +
Sbjct: 18 KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77
Query: 109 LALTYKNFPCKVVATRKAILRALNLFLLGIFLQG-GFFHGINNLKYG-VDIAQIRWMGVL 166
++ + + RA + L G+ + FFH + + + Q+R GVL
Sbjct: 78 MSFALATNAPHLQFLGRVSRRAALILLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGVL 137
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRI + YL AAL +L G L+L Y W
Sbjct: 138 QRIGLCYLAAALLVRYLPPRGIAPVCLALLLGY-------------------------WA 172
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
+ + +F TG NA +D + G HLYRK
Sbjct: 173 FLY----------VFGQPGAELSKTG---NAGTRLDLWLYGRDHLYRK------------ 207
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI 330
FDPEGLL ++ ATV L G G +
Sbjct: 208 ---------------DGGFDPEGLLGTLSATVNVLAGYLCGRFL 236
>gi|313203961|ref|YP_004042618.1| hypothetical protein Palpr_1487 [Paludibacter propionicigenes WB4]
gi|312443277|gb|ADQ79633.1| hypothetical protein Palpr_1487 [Paludibacter propionicigenes WB4]
Length = 382
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 95/349 (27%), Positives = 140/349 (40%), Gaps = 73/349 (20%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGILPA---INHSPWNGLTLADFVMPFFLFIVGVSLAL 111
R +LD+FRG+TV MI+V+ G + H+ WNG T D V P FLF VG +L
Sbjct: 14 RFTALDIFRGMTVCFMIIVNTSGNGATTYWPLMHADWNGFTPTDLVFPSFLFAVGNALGF 73
Query: 112 TYKNFPC--KVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGV--DIAQIRWMGVLQ 167
K + + K R +FL+G + F +N + + I+Q R MGVLQ
Sbjct: 74 AMKRWDTMKQSDVLLKIFKRTALIFLIGYLMYWFPFFRLNAESHLILSPISQTRIMGVLQ 133
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLL-YGLYVPDWQ 226
RIA+ Y + AL +L R WV + L ++LLL +G
Sbjct: 134 RIALCYGITALLVYYLGTK-------------RTIWVGVVSLLAYWVLLLAFG------- 173
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
E E S + NAV +D +LG HLY +
Sbjct: 174 -EAGAEFSKTG------------------NAVLRLDIWLLGTHHLYGGEGF--------- 205
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
PFDPEG+LS++ A + G G + K L ++L
Sbjct: 206 -----------------PFDPEGVLSTLPALFNVIAGFAVGRYLQQQKGKSYESLAKLLL 248
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYI 395
+ + M +NK L++ SY LT G +LL+ I + ++
Sbjct: 249 VGIGLLVLAYCWNSWMPINKKLWTSSYAVLTVGLDCLLLSVIIYFTDFL 297
>gi|421097001|ref|ZP_15557700.1| putative membrane protein [Leptospira borgpetersenii str.
200901122]
gi|410800246|gb|EKS02307.1| putative membrane protein [Leptospira borgpetersenii str.
200901122]
Length = 383
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 92/349 (26%), Positives = 145/349 (41%), Gaps = 92/349 (26%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFL 102
+ + + R++SLD+FRG+TV MILV++ G I + H+ WNG T D V PFFL
Sbjct: 2 EKKSTQNKERILSLDLFRGMTVIGMILVNNPGSWSYIYSPLKHAEWNGCTPTDLVFPFFL 61
Query: 103 FIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQ---GGFFHGINNLKYGVDIAQ 159
F VGVS+ ++ + ++ I R+ +GI ++ + N A+
Sbjct: 62 FAVGVSIPIS--------LYSKNGINRS--KVWIGICIRSISLILLGLLLNFFGEWSFAE 111
Query: 160 IRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYG 219
+R GVLQRI Y A ++L G + W++ ++
Sbjct: 112 LRVPGVLQRIGFVYWTVA--SLYLIFPGKKVLIFLILVLLVHTWILTHIIP--------- 160
Query: 220 LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSR 279
P E++ S G A IDR I+G +HL++ +S+
Sbjct: 161 ----------PGESTVSL------------EQGKDIGA--WIDRMIIGEKHLWK---FSK 193
Query: 280 TKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGH-LIVHFKDHRD 338
T +DPEGLLS + + T L G+ G L + ++
Sbjct: 194 T------------------------WDPEGLLSGIASIATSLFGVLCGFILFLREGVGKN 229
Query: 339 RMLNWIILSSCLIGLGLSLDFVG------MHLNKALYSLSYTCLTAGAS 381
R+L + GLG FVG + +NK+L++ SY TAG S
Sbjct: 230 RVLG-------IFGLGFLFTFVGLLWDQSLPMNKSLWTGSYAAYTAGLS 271
>gi|444731031|gb|ELW71398.1| Heparan-alpha-glucosaminide N-acetyltransferase [Tupaia chinensis]
Length = 732
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 100/402 (24%), Positives = 170/402 (42%), Gaps = 67/402 (16%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSN--SKQTRPQ-----HQQRRLISLDVFRGLTVALMI 71
ISK+ ++ ++L L PS ++ +TRP+ RL +D FRG+ + LM+
Sbjct: 223 ISKAMNSRETDRLINSELGSPSRADPLGGETRPEPWRPAASPHRLRCVDTFRGIALILMV 282
Query: 72 LVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVS------LALTYKNFPCKVVA--- 122
V+ GG H+ WN ++ +P L +GV+ L L + C
Sbjct: 283 FVNYGGGKYWYFKHASWN-VSWDKVRIPGVLQRLGVTFFVVAVLELLFAKPVCIYYGVFN 341
Query: 123 -TRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEI 181
+ I A +F + I + NL + ++R GVLQR+ + + V A+ E+
Sbjct: 342 FSVNDIYAAAGMFKIQIARENCVEEFPVNLYRDLSWDKVRIPGVLQRLGVTFFVVAVLEL 401
Query: 182 WL-----KGDGHVSSKLSLFRKYRG--HWVVALVLTTLYLLLLYGLYVPDWQYEFPVETS 234
+ S LSL W+V L+L +++L L + L VP +
Sbjct: 402 LFAKPVPENCASERSCLSLRDVTSSWPQWLVILMLESIWLGLTFFLPVPGCPKGY----- 456
Query: 235 SSSPWIFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDY 290
+ G G G P C A G ID +LG HLY+ P + +
Sbjct: 457 --------LGPGGIGDFGKYPNCTGGAAGYIDHLLLGADHLYKHPSSTVLYHTEV----- 503
Query: 291 GPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCL 350
+DPEG+L ++ + V +G+ G +++++KD +L C+
Sbjct: 504 -------------AYDPEGILGTINSIVMAFLGVQAGKILLYYKDRTKDILIRFTAWCCI 550
Query: 351 IGL------GLSLDFVGMHLNKALYSLSY-TCLTAGASGVLL 385
+GL +S + + +NK L+S+SY T L++ A +LL
Sbjct: 551 LGLISIGLTKISENEGFIPVNKNLWSISYVTTLSSFAFFILL 592
>gi|424670170|ref|ZP_18107195.1| hypothetical protein A1OC_03788 [Stenotrophomonas maltophilia
Ab55555]
gi|401070628|gb|EJP79142.1| hypothetical protein A1OC_03788 [Stenotrophomonas maltophilia
Ab55555]
Length = 355
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 60/192 (31%), Positives = 93/192 (48%), Gaps = 18/192 (9%)
Query: 54 RRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSLA 110
RRL S+D RG+TVA M+LV++ G + + HS W+G T D V PFFLF+VGVS+A
Sbjct: 7 RRLGSIDALRGITVAAMLLVNNPGDWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSMA 66
Query: 111 LTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIA 170
+ + A R A+ R + L I + G H + + + +D R GVLQRIA
Sbjct: 67 FSVAP-RAQDAAARPALARGVLERALRILMAGALLHLL--IWWALDTHHFRIWGVLQRIA 123
Query: 171 IAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFP 230
+ + + ++ + V + ++L Y LLL G P
Sbjct: 124 VCAALVGVLAVYARPRVQVGALIALLVGYT------------VLLLGIGDLAPWTNPASR 171
Query: 231 VETSSSSPWIFN 242
++T+ +PWI+
Sbjct: 172 LDTALFAPWIYQ 183
>gi|260910302|ref|ZP_05916976.1| conserved hypothetical protein [Prevotella sp. oral taxon 472 str.
F0295]
gi|260635554|gb|EEX53570.1| conserved hypothetical protein [Prevotella sp. oral taxon 472 str.
F0295]
Length = 399
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 91/368 (24%), Positives = 151/368 (41%), Gaps = 98/368 (26%)
Query: 48 RPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFI 104
R++S+D+ RGLT+A MI V++ G + + H+ WNGLT D V PFF+ +
Sbjct: 2 EKNKTTSRILSIDILRGLTIAGMITVNNPGSWSYMYAPLEHAEWNGLTPTDLVFPFFMCV 61
Query: 105 VG--VSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQ--GGFFHGINNLKYGVD---- 156
+G + +A++ NF C K + R + ++L+G+ + F + NN + G D
Sbjct: 62 MGMCIYIAMSKFNFACNRATVYKILKRMVLIYLVGLAIGWFAKFCYRWNNPQEGADFFSQ 121
Query: 157 -------IAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVL 209
+IR GVL R+A+ Y + AL I + R ++V +L
Sbjct: 122 LWYMVWSFDKIRLTGVLARLAVCYGITALLAITV-------------RHKHLPYIVGGLL 168
Query: 210 TTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQ 269
+++L+ G G N + ++DR +L
Sbjct: 169 LAYFVILMAG----------------------------NGFAYDETNILSIVDRAVLTDA 200
Query: 270 HLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHL 329
H+Y D G DPEGLLS++ + L+G G +
Sbjct: 201 HMYH---------------DNG-------------IDPEGLLSTLPSIAHTLLGFIIGGM 232
Query: 330 IVHFKDHRDRMLNW---IILSSC--LIGLGLSLDFVGMHL------NKALYSLSYTCLTA 378
+ D + L+ I L+ L +G SL F G L NK ++S ++ +T
Sbjct: 233 LFRKADAGVQQLDARANITLTKVVPLFVVGTSLLFAGYLLSYGCPINKKVWSPTFVLVTC 292
Query: 379 GASGVLLA 386
G + +LLA
Sbjct: 293 GLASMLLA 300
>gi|21241481|ref|NP_641063.1| hypothetical protein XAC0710 [Xanthomonas axonopodis pv. citri str.
306]
gi|21106824|gb|AAM35599.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 388
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 85/284 (29%), Positives = 112/284 (39%), Gaps = 70/284 (24%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILPA---INHSPWNGLTLADFVMPFFLFIVGVS 108
++ R +SLDVFRGLT+ LMILV+ G A + H+ W G TLAD V P FLF VG +
Sbjct: 18 KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77
Query: 109 LALTYKNFPCKVVATRKAILRALNLFLLGIFLQG-GFFHGINNLKYG-VDIAQIRWMGVL 166
++ + + RA + L G+ + FFH + + + Q+R VL
Sbjct: 78 MSFALATNTPHLQFLGRVSRRAALILLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTCVL 137
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRI + YL AAL L + RG V L L Y LLY P
Sbjct: 138 QRIGLCYLAAAL--------------LVRYLPPRGIAPVCLALLLGYWALLYAFGQPG-- 181
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
E S + NA +D + G HLYRK
Sbjct: 182 ----AELSKTG------------------NAGTRLDLWLYGRDHLYRK------------ 207
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI 330
FDPEGLL ++ ATV L G G +
Sbjct: 208 ---------------DGGFDPEGLLGTLSATVNVLAGYLCGRFL 236
>gi|383934719|ref|ZP_09988159.1| heparan-alpha-glucosaminide N-acetyltransferase [Rheinheimera
nanhaiensis E407-8]
gi|383704254|dbj|GAB58250.1| heparan-alpha-glucosaminide N-acetyltransferase [Rheinheimera
nanhaiensis E407-8]
Length = 397
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 89/340 (26%), Positives = 148/340 (43%), Gaps = 63/340 (18%)
Query: 55 RLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
R+++LDV RGLT+ MILV++ G + + H+ W+G T+ D + PFF+ IVG+SL L
Sbjct: 20 RMLALDVLRGLTITAMILVNNPGSWNYVYSPLLHAQWHGWTITDLIFPFFIVIVGMSLQL 79
Query: 112 TYKNFPC--KVVATRKAILRALNLFLLGIFLQGGFFHGIN-NLKYGVD-IAQIRWMGVLQ 167
+ + K R+A+LR+ LF LG+ L +++ + Y D + +RW+GVLQ
Sbjct: 80 SLRQHSLNNKGPLIRQALLRSGKLFGLGLLLALFYYNFRDPEFNYVEDRLLTVRWLGVLQ 139
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQY 227
RI + YL L + Y G L L L + L GL +
Sbjct: 140 RIGLVYLATVLIVL-----------------YFGQRGRLLWLLGLMAVYLAGL------W 176
Query: 228 EFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINS 287
P + + RG + V +D+ +LG H+Y +
Sbjct: 177 WLPYQDAQGHE--------FRGLLLFGNSFVAWLDQLVLGANHVYYR------------- 215
Query: 288 PDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILS 347
P FDPEGL S++ A +CL G+ + + ++ ++
Sbjct: 216 ---SATPF--------AFDPEGLWSTLPAIASCLTGVLMAQWLQSERSLAQKIRGLLLCG 264
Query: 348 SCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAG 387
+ L F + +NK+L++ S+ L++G + LA
Sbjct: 265 VVAVWLAELWHF-SLPVNKSLWTPSFVLLSSGYCAIALAA 303
>gi|456734835|gb|EMF59605.1| N-acetylglucosamine transporter NagX [Stenotrophomonas maltophilia
EPM1]
Length = 355
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 60/192 (31%), Positives = 93/192 (48%), Gaps = 18/192 (9%)
Query: 54 RRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSLA 110
RRL S+D RG+TVA M+LV++ G + + HS W+G T D V PFFLF+VGVS+A
Sbjct: 7 RRLGSIDALRGITVAAMLLVNNPGDWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSMA 66
Query: 111 LTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIA 170
+ + A R A+ R + L I + G H + + + +D R GVLQRIA
Sbjct: 67 FSVAP-RAQDAAARPALARGVLERALRILVAGALLHLL--IWWALDTHHFRIWGVLQRIA 123
Query: 171 IAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFP 230
+ + + ++ + V + ++L Y LLL G P
Sbjct: 124 VCAALVGVLAVYARPRVQVGALIALLVGYT------------VLLLGIGDLAPWTNPASR 171
Query: 231 VETSSSSPWIFN 242
++T+ +PWI+
Sbjct: 172 LDTALFAPWIYQ 183
>gi|343083133|ref|YP_004772428.1| hypothetical protein [Cyclobacterium marinum DSM 745]
gi|342351667|gb|AEL24197.1| Protein of unknown function DUF2261, transmembrane [Cyclobacterium
marinum DSM 745]
Length = 381
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 97/361 (26%), Positives = 157/361 (43%), Gaps = 78/361 (21%)
Query: 42 SNSKQTRPQHQQRRLISLDVFRGLTVALMILVD---DVGGILPAINHSPWNGLTLADFVM 98
+N+ + H + + SLDV RGLT+ALM++V+ D + H+ W+GLTL D V
Sbjct: 2 TNTLDSITPHTRYQ--SLDVLRGLTLALMVIVNTPGDGSTSFGPLTHADWHGLTLTDLVF 59
Query: 99 PFFLFIVGVSLALTYKNFPCK--VVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGV- 155
P FLF+VG +++ + F K K R +F++G+ L F +N+ GV
Sbjct: 60 PSFLFVVGNAMSFSLGKFKLKGGKAYFSKVFKRTALIFIIGLLLTAFPFFRVND--SGVV 117
Query: 156 --DIAQIRWMGVLQRIAIAYLVAALCEIWLKGDG-HVSSKLSLFRKYRGHWVVALVLTTL 212
D IR +GVLQRIA+ Y + A +L ++S L L +W+V
Sbjct: 118 PYDFTSIRILGVLQRIALCYGLGATLIYFLSPKKILITSGLILLS----YWLV------- 166
Query: 213 YLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLY 272
+ L+G+ D F +E NA G +D + +LY
Sbjct: 167 --MFLFGVPGMD---PFSLE----------------------GNASGRLDLWLFSPDNLY 199
Query: 273 RKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVH 332
+ +N + P EGLLS+ A V L+G G I
Sbjct: 200 K-----------MNGFAFDP---------------EGLLSTFPAMVNVLLGYWVGLQIQK 233
Query: 333 FKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+ +L + + L+ +G D+ G +NK +++ S+T +T G S + LA + F++
Sbjct: 234 RGGDIETVLWLAMFAVILLVVGYLWDY-GFPINKKIWTSSFTLVTVGYSTLTLALLMFIL 292
Query: 393 R 393
Sbjct: 293 E 293
>gi|386392672|ref|ZP_10077453.1| hypothetical protein DesU5LDRAFT_2079 [Desulfovibrio sp. U5L]
gi|385733550|gb|EIG53748.1| hypothetical protein DesU5LDRAFT_2079 [Desulfovibrio sp. U5L]
Length = 370
Score = 78.6 bits (192), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 67/181 (37%), Positives = 92/181 (50%), Gaps = 34/181 (18%)
Query: 52 QQRRLISLDVFRGLTVALMILVD---DVGGILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
++ RL S+D RGL VA MIL + + G + + H+ W+G T ADF+ P FLF+VGV
Sbjct: 5 RKTRLASVDGLRGLAVAGMILANNPGERGHVYRELQHAVWDGWTAADFIFPLFLFLVGVC 64
Query: 109 LALTYKNFPCKVVAT----RKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMG 164
+AL + R+ + RA+ LFLLG+ + N V +R G
Sbjct: 65 VALAVDRDTVRTGEAHRFWRRVLTRAIILFLLGL---------LENAYLRVSFENLRIPG 115
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTL--YLLLLYGLYV 222
VLQRIA+ YL A WL HV RG +V+++L TL Y LLL G+ V
Sbjct: 116 VLQRIAVVYLATA----WL----HVRC------GNRG--IVSVILVTLLGYWLLLAGVPV 159
Query: 223 P 223
P
Sbjct: 160 P 160
>gi|359686994|ref|ZP_09256995.1| hypothetical protein LlicsVM_01380 [Leptospira licerasiae serovar
Varillal str. MMD0835]
gi|418756670|ref|ZP_13312858.1| PF07786 family protein [Leptospira licerasiae serovar Varillal str.
VAR 010]
gi|384116341|gb|EIE02598.1| PF07786 family protein [Leptospira licerasiae serovar Varillal str.
VAR 010]
Length = 391
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 84/339 (24%), Positives = 147/339 (43%), Gaps = 86/339 (25%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVG 106
+ R++S+D+ RGLTVA MILV++ G + + H+ W+G T D V PFFLF VG
Sbjct: 20 SENRIRILSIDLLRGLTVAGMILVNNPGTWSNMYWPLKHAKWDGCTPTDLVFPFFLFAVG 79
Query: 107 VSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
S+ + N + K + R++ L LG+FL F G + +R+ GVL
Sbjct: 80 ASIPFSVSN---GIQEFPKILKRSVILIFLGLFLN---FFG------EWSFSNLRFPGVL 127
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRI AY +A+ + ++ ++ LF + L+++ YL
Sbjct: 128 QRIGFAYFFSAIAY----REKNLKFRIILF--------LTLLISYWYL------------ 163
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
EF ++ P S + +DR++ G HL++
Sbjct: 164 QEFIPPPGAAEP-----------SMKEGKDWGAWLDREVFGQAHLWK------------- 199
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
+G + +DPEGLL+S + + G+ G + K H ++ + + +
Sbjct: 200 ---FGKV-----------WDPEGLLTSFTSIASVFCGIFAGEFL---KVHLEKKESPLSI 242
Query: 347 SSCLIGLGLSLDFVG------MHLNKALYSLSYTCLTAG 379
S ++ ++ VG +NK+L++ +Y+ TAG
Sbjct: 243 SGKIVLGAFAVLVVGGVWGIYYPINKSLWTGTYSLWTAG 281
>gi|319900285|ref|YP_004160013.1| hypothetical protein Bache_0400 [Bacteroides helcogenes P 36-108]
gi|319415316|gb|ADV42427.1| putative transmembrane protein [Bacteroides helcogenes P 36-108]
Length = 396
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 97/375 (25%), Positives = 165/375 (44%), Gaps = 111/375 (29%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+R+++LD+ RG+T+A MI+V++ G I + H+ W GLT D V PFF+FI+G+S
Sbjct: 6 TSKRILALDILRGITIAGMIMVNNPGNWGHIYAPLEHAEWIGLTPTDLVFPFFMFIMGIS 65
Query: 109 LALTYK--NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGI--------NNLKYGVDI- 157
++ K +F A K + R +FL+G L G+F + L +G ++
Sbjct: 66 TYISLKKYDFEFSRSAALKILKRTAIIFLIG--LAIGWFARLCYYWAAAPGELSFGENLW 123
Query: 158 ------AQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRG-HWVVALVLT 210
++R +GV+QR+A+ Y +S ++L K+R +++A +L
Sbjct: 124 ASVWTFDRMRILGVMQRLALCY--------------GATSIIALTMKHRHIPYLIAGLLI 169
Query: 211 TLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQH 270
+ ++LL+ CG G N + ++DR +L H
Sbjct: 170 SYFILLM---------------------------CG-NGFAYNETNILSVVDRAVLTPAH 201
Query: 271 LYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI 330
+Y+ D G DPEGLLS++ + L+G G ++
Sbjct: 202 MYK---------------DNG-------------IDPEGLLSTIPSIAHVLLGFCVGRMM 233
Query: 331 VHFKDHR----DRMLNWIILSSCLIGL---GLSLDFVGMHL------NKALYSLSYTCLT 377
+ D R DRM +L S LI L G L F G L +K ++S ++ +T
Sbjct: 234 L--GDDRNAKADRM---AVLDSHLIKLLLTGAILTFAGFLLSYGCPISKKIWSPTFVLVT 288
Query: 378 AGASGVLLAGIYFMV 392
G + LA + ++V
Sbjct: 289 CGMASSFLALLIWIV 303
>gi|384417772|ref|YP_005627132.1| membrane protein [Xanthomonas oryzae pv. oryzicola BLS256]
gi|353460685|gb|AEQ94964.1| membrane protein, putative [Xanthomonas oryzae pv. oryzicola
BLS256]
Length = 388
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 83/284 (29%), Positives = 115/284 (40%), Gaps = 70/284 (24%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILPA---INHSPWNGLTLADFVMPFFLFIVGVS 108
++ R +SLDVFRGLT+ LMILV+ G A + H+ W G TLAD V P FLF VG +
Sbjct: 18 KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77
Query: 109 LALTYKNFPCKVVATRKAILRALNLFLLGIFLQG-GFFHGINNLKYG-VDIAQIRWMGVL 166
++ + + RA + L G+ + FFH + + + Q+R GVL
Sbjct: 78 MSFALATNTPHLQFLGRVSKRAALIVLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGVL 137
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRI + YL AAL +L + ++L G+W LLY P
Sbjct: 138 QRIGLCYLAAALLVRYLPPRSIAPACVALL---LGYWA-----------LLYVFGQPG-- 181
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
E S + NA +D + G +HLYRK
Sbjct: 182 ----AELSKTG------------------NAGTCLDLWLYGREHLYRK------------ 207
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI 330
FDPEGLL ++ ATV L G G +
Sbjct: 208 ---------------DGGFDPEGLLGTLSATVNVLAGYLCGRFL 236
>gi|352080530|ref|ZP_08951469.1| putative transmembrane protein [Rhodanobacter sp. 2APBS1]
gi|351683811|gb|EHA66887.1| putative transmembrane protein [Rhodanobacter sp. 2APBS1]
Length = 353
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 93/347 (26%), Positives = 136/347 (39%), Gaps = 88/347 (25%)
Query: 54 RRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLA 110
+RL SLD RG TVA M+LV+D G + + H+ W+G T D V PFFLF+VGVS+A
Sbjct: 2 KRLASLDALRGCTVAAMLLVNDPGDWGHVYWPLEHAAWHGCTPTDLVFPFFLFVVGVSVA 61
Query: 111 LT----YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNL-KYGVDIAQIRWMGV 165
L + TR A RAL + LG+ IN L + + A +R+ GV
Sbjct: 62 LAILPRLEQGAAPSALTRAATWRALRILALGV--------AINLLAAWLLPQAHLRFPGV 113
Query: 166 LQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDW 225
LQRIA+ + AL I H + W +A +L + LL G
Sbjct: 114 LQRIALCFAGVALFAI------HTKPRTQ-------WWAIAALLIGYWGLLRLG------ 154
Query: 226 QYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSI 285
GS P N +D + G P R
Sbjct: 155 -----------------------GSLEPWTNLASRVDSAVFGRFVYLIDPASGRGH---- 187
Query: 286 NSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWII 345
DPEGLLS++ + L+GL G + + R L
Sbjct: 188 --------------------DPEGLLSTLPSLAGTLLGLRMG---CWLRREQFRTLLLAG 224
Query: 346 LSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
++ L+G +L + NK L++ S+ T G + + L + ++
Sbjct: 225 IACLLLG---ALWSPWLPFNKNLWTPSFVLWTTGWATLALLAFHVLI 268
>gi|410463501|ref|ZP_11317013.1| hypothetical protein B193_1525 [Desulfovibrio magneticus str.
Maddingley MBC34]
gi|409983383|gb|EKO39760.1| hypothetical protein B193_1525 [Desulfovibrio magneticus str.
Maddingley MBC34]
Length = 371
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 55/140 (39%), Positives = 76/140 (54%), Gaps = 17/140 (12%)
Query: 49 PQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIV 105
P RL+S+D RGL +A MI+V++ G I P + H+ W+GLTLAD V P FLF+V
Sbjct: 2 PTPATSRLLSVDALRGLAIAAMIVVNNPGDRRFIYPQLLHAHWHGLTLADVVFPLFLFLV 61
Query: 106 GVSLALTY-----KNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
GV +AL ++ + RK + RA LF LG+ L+ D ++
Sbjct: 62 GVCVALAIDLDKARDAKGRARLWRKILPRAAVLFALGL-------GETAYLRLSFD--EL 112
Query: 161 RWMGVLQRIAIAYLVAALCE 180
R GVLQRIA+ YL AA +
Sbjct: 113 RIPGVLQRIAVVYLAAAWLQ 132
>gi|325954677|ref|YP_004238337.1| hypothetical protein [Weeksella virosa DSM 16922]
gi|323437295|gb|ADX67759.1| hypothetical protein Weevi_1050 [Weeksella virosa DSM 16922]
Length = 402
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 98/382 (25%), Positives = 149/382 (39%), Gaps = 95/382 (24%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG-- 106
+ R SLDVFRG T+ALMILV++ G + + H+ W+G T D V PFFLF VG
Sbjct: 2 KTTRYYSLDVFRGATIALMILVNNPGSWSYMFSPLQHASWHGCTPTDLVFPFFLFAVGNA 61
Query: 107 VSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQ------I 160
+S +++ V +K I R + +F +G+F+ F N + + +
Sbjct: 62 MSFGMSHLKLQASNVFWKKIIKRTILIFAIGLFINWWPFLKWENNELVFRAWRESEENGV 121
Query: 161 RWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGL 220
R MGVLQRIAIA A S L+ + + R V L ++ L LL + L
Sbjct: 122 RIMGVLQRIAIANFFA--------------STLAYYYRDR----VVLKISILILLFYWAL 163
Query: 221 YVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRT 280
P+ G + ID ++G+ H+Y+
Sbjct: 164 ---------TFFLGGVDPYSLEGFIGTK------------IDVHLIGLAHMYKGE----- 197
Query: 281 KQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHR--- 337
PFDPEGL S++ A L+G G I D R
Sbjct: 198 ---------------------GVPFDPEGLYSTIPAISQILLGYLVGVYIQKQGDIRWFS 236
Query: 338 ----------DRMLNWIIL---SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVL 384
RML+ + + + ++G LDF NK ++S SY T +
Sbjct: 237 RSLPASNLPIYRMLSGLFVLGAFALIMGYIWQLDF---PYNKKIWSSSYVIHTTALAIFT 293
Query: 385 LAGIYFMVRYISSHLMLKKPFD 406
+ + + + H + FD
Sbjct: 294 IGCMVWFAEVLQMHPAWMRFFD 315
>gi|417780880|ref|ZP_12428636.1| PF07786 family protein [Leptospira weilii str. 2006001853]
gi|410778851|gb|EKR63473.1| PF07786 family protein [Leptospira weilii str. 2006001853]
Length = 383
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 101/352 (28%), Positives = 147/352 (41%), Gaps = 98/352 (27%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFL 102
+ + + R++SLD+FRG+TV MILV++ G I + H+ WNG T D V PFFL
Sbjct: 2 EKKSTQNKDRILSLDLFRGMTVIGMILVNNPGSWSYIYSPLKHAKWNGCTPTDLVFPFFL 61
Query: 103 FIVG--VSLALTYKNFPCK----VVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVD 156
F VG + ++L KN + V ++++ L LL F + F
Sbjct: 62 FAVGGSIPISLYSKNGIDRSRIWVGICKRSVNLILLGLLLNFFGEWSF------------ 109
Query: 157 IAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLL 216
A++R GVLQRI Y V A SL+ + G V+ ++
Sbjct: 110 -AELRIPGVLQRIGFVYWVVA----------------SLYLIFPGKKVLIFLIPV----- 147
Query: 217 LYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPI 276
L V W + T + P V G A IDR I+G +HL++
Sbjct: 148 ---LLVHTW-----ILTHIAPP----GEAMVSLEQGKDIGA--WIDRVIIGEKHLWK--- 190
Query: 277 YSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGH-LIVHFKD 335
+S+T +DPEGLLS V + T L G+ G L +
Sbjct: 191 FSKT------------------------WDPEGLLSGVASIATSLFGVLCGFILFLREGG 226
Query: 336 HRDRMLNWIILSSCLIGLGLSLDFVG------MHLNKALYSLSYTCLTAGAS 381
R R+L+ GLG FVG + +NK+L++ SY TAG S
Sbjct: 227 GRSRVLS-------TFGLGFLFTFVGLLWDQSLPMNKSLWTGSYAAYTAGLS 271
>gi|194367192|ref|YP_002029802.1| hypothetical protein Smal_3420 [Stenotrophomonas maltophilia
R551-3]
gi|194349996|gb|ACF53119.1| conserved hypothetical protein [Stenotrophomonas maltophilia
R551-3]
Length = 355
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 61/192 (31%), Positives = 92/192 (47%), Gaps = 18/192 (9%)
Query: 54 RRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSLA 110
RRL S+D RG+TVA M+LV++ G + + HS W+G T D V PFFLF+VGVS+A
Sbjct: 7 RRLGSIDALRGITVAAMLLVNNPGDWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSMA 66
Query: 111 LTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIA 170
+ VA R A+ R + L I + G H + + + +D R GVLQRIA
Sbjct: 67 FSVAPRALD-VALRPALARGVLERALRILVAGALLHLL--IWWALDTHHFRIWGVLQRIA 123
Query: 171 IAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFP 230
+ + + ++ + V + + L Y LLL G P
Sbjct: 124 VCAALVGVLAVYARPRVQVGALIVLLVGYT------------VLLLGIGDLAPWTNPASR 171
Query: 231 VETSSSSPWIFN 242
++T+ +PWI+
Sbjct: 172 LDTALFAPWIYQ 183
>gi|196233857|ref|ZP_03132695.1| conserved hypothetical protein [Chthoniobacter flavus Ellin428]
gi|196222051|gb|EDY16583.1| conserved hypothetical protein [Chthoniobacter flavus Ellin428]
Length = 437
Score = 77.8 bits (190), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 105/405 (25%), Positives = 155/405 (38%), Gaps = 77/405 (19%)
Query: 12 NDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQ--RRLISLDVFRGLTVAL 69
+D I + A + LE +P N P+ Q RL+SLD RG +
Sbjct: 2 SDHARIFAKANLDAASGASLEEVAQIPSPNV---PVSPETGQPAGRLVSLDALRGFDMFW 58
Query: 70 MILVDDV-------------GGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF 116
++ V G+ H W GL D + P FLFI+G+S+ +
Sbjct: 59 IVGAGAVIQSLDKMCRTPFTAGLAWQFKHVHWKGLHCYDVIFPLFLFIIGISIVFSLDKA 118
Query: 117 PC---KVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAY 173
K + R++ LF LG+ GGF N++ G GVL RIA+ Y
Sbjct: 119 LATGGKAQVLTRVARRSVLLFALGVLYYGGFMKPWPNVQLG---------GVLPRIALCY 169
Query: 174 LVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVET 233
L AAL + R RG A L Y LL + PD Q PV
Sbjct: 170 LAAALI-------------YTFIRSTRGLLAAAAALLIGYWALLAFVPFPDLQLRKPVVE 216
Query: 234 SSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPM 293
+ GS PA A + +R + G+ YR N D+ M
Sbjct: 217 EIAER---------IGSDSPAAIAAAVPER-VHGLYEEYRN---------LANYVDFLYM 257
Query: 294 PLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL 353
P Q F EGLLS++ + L G G L+ + K R + W L+G
Sbjct: 258 PGRK---AQFYFINEGLLSTIPSIALSLFGAVAGLLLKNQKVLPRRKIAW------LVGA 308
Query: 354 GLSLDFVG------MHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
G++ +G + L K +++ S+ + G S ++LA Y++V
Sbjct: 309 GVAFIVLGRVWAIDLPLIKRIWTSSFILVATGWSALMLALFYYIV 353
>gi|375146803|ref|YP_005009244.1| hypothetical protein [Niastella koreensis GR20-10]
gi|361060849|gb|AEV99840.1| Protein of unknown function DUF2261, transmembrane [Niastella
koreensis GR20-10]
Length = 376
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 89/351 (25%), Positives = 142/351 (40%), Gaps = 77/351 (21%)
Query: 51 HQQRRLISLDVFRGLTVALMILVD---DVGGILPAINHSPWNGLTLADFVMPFFLFIVGV 107
QQ+R ++LDVFRG+T+ MI+V+ D + H+ W+G T D V P FLF VG
Sbjct: 2 QQQKRFLALDVFRGMTICFMIIVNTSPDGSHTYAPLLHAQWHGFTPTDLVFPSFLFAVGN 61
Query: 108 SLALT---YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGV----DIAQI 160
+++ ++N V K + R L +F+LG + F ++ + GV +
Sbjct: 62 AMSFVMPRWENASTGFV-LGKILKRTLLIFILGYLMYWFPFVRMDKVT-GVYAFYPFEKT 119
Query: 161 RWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGL 220
R GVLQRIA+AY A+L + F K+R ++ + LY +LY
Sbjct: 120 RVFGVLQRIALAYCFASL--------------MLYFLKFRATIIITAAILLLYWPVLYFF 165
Query: 221 YVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRT 280
S+ P A NAV +D +LG HLY
Sbjct: 166 ------------GDSADP------------LSLAGNAVLKLDLWLLGPDHLYHGE----- 196
Query: 281 KQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRM 340
PFDPEG LS+ A + G G + + +
Sbjct: 197 ---------------------GVPFDPEGFLSTFPAIANVVGGYWVGRFLQQKGGTYEAL 235
Query: 341 LNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFM 391
++ L+ L ++ +NK L++ S+ T G +++A I ++
Sbjct: 236 TKLMLAGFALLVLAHFWNY-SFPINKKLWTSSFVLHTVGIDCLIIAAIVYI 285
>gi|190575857|ref|YP_001973702.1| transmembrane protein [Stenotrophomonas maltophilia K279a]
gi|190013779|emb|CAQ47415.1| putative transmembrane protein [Stenotrophomonas maltophilia K279a]
Length = 355
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/130 (40%), Positives = 72/130 (55%), Gaps = 7/130 (5%)
Query: 54 RRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSLA 110
RRL S+D RG+TVA M+LV++ G + + HS W+G T D V PFFLF+VGVS+A
Sbjct: 7 RRLGSIDALRGITVAAMLLVNNPGDWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSMA 66
Query: 111 LTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIA 170
+ + A R A+ R + L I + G H + + + +D R GVLQRIA
Sbjct: 67 FSVAP-RAQDAAARPALARGVLERALRILVAGALLHLL--IWWALDTHHFRIWGVLQRIA 123
Query: 171 I-AYLVAALC 179
+ A LV L
Sbjct: 124 VCAALVGVLA 133
>gi|374310943|ref|YP_005057373.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
gi|358752953|gb|AEU36343.1| protein of unknown function DUF1624 [Granulicella mallensis
MP5ACTX8]
Length = 385
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 51/140 (36%), Positives = 75/140 (53%), Gaps = 16/140 (11%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGGILPA---INHSPWNGLTLADFVMPFFLFIVG 106
+ +R+ S+DV RG T+A MILV+ G A + H+ WNG T D V P FLF+ G
Sbjct: 12 ELTSKRIPSVDVLRGFTLAAMILVNAAGEWPHAYWPLKHAQWNGCTPTDLVFPTFLFLTG 71
Query: 107 VSLALTYKNFPCKVVATRKAIL----RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRW 162
SL ++++ + V R+ L R++ LF +G+ L N L Y I +R
Sbjct: 72 TSLVFSFRSRLARGVGKRELFLHTLKRSVILFFIGVLL--------NALPY-FHIGTLRI 122
Query: 163 MGVLQRIAIAYLVAALCEIW 182
GVLQRIA+ YL ++ +W
Sbjct: 123 YGVLQRIALCYLCVSVLYLW 142
>gi|389793498|ref|ZP_10196662.1| protein involved in N-acetyl-D-glucosamine utilization
[Rhodanobacter fulvus Jip2]
gi|388434056|gb|EIL91012.1| protein involved in N-acetyl-D-glucosamine utilization
[Rhodanobacter fulvus Jip2]
Length = 354
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 83/285 (29%), Positives = 113/285 (39%), Gaps = 84/285 (29%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+RL SLD RG TVA M+LV+D G I + H+PW+G T D V PFFLF+VGVS
Sbjct: 1 MSKRLPSLDALRGCTVAAMLLVNDPGDWGHIYAPLEHAPWHGCTPTDLVFPFFLFVVGVS 60
Query: 109 LALTYKNFPCKVVA----TRKAILRALNLFLLGIFLQGGFFHGINNL-KYGVDIAQIRWM 163
AL + VA + A+ RAL + LG+ IN L + + A +R+
Sbjct: 61 SALALLPRLEQGVAPGALVKAALWRALRILALGV--------AINLLAAWLLPHAHLRFP 112
Query: 164 GVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWV-VALVLTTLYLLLLYGLYV 222
GVLQRI I + AL ++ + R W+ + +L + LLL G
Sbjct: 113 GVLQRIGICFAAVAL--------------FAVHTRPRTQWIAIGGILLGYWGLLLAG--- 155
Query: 223 PDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQ 282
GS P N V D + G P R
Sbjct: 156 --------------------------GSVAPWVNIVSRTDSVVFGHFVYLIDPATGRGH- 188
Query: 283 CSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFG 327
DPEGLL+++ + T L+GL G
Sbjct: 189 -----------------------DPEGLLATLPSLATTLLGLRVG 210
>gi|388456506|ref|ZP_10138801.1| hypothetical protein FdumT_08017 [Fluoribacter dumoffii Tex-KL]
Length = 372
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 95/362 (26%), Positives = 149/362 (41%), Gaps = 96/362 (26%)
Query: 42 SNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGI--LPAINHSPWNGLTLADFVMP 99
SNS +T+ R++SLDVFRGLT+ALM+LV+ +G + H WNG +LAD V P
Sbjct: 3 SNSTETK------RILSLDVFRGLTMALMVLVNSLGSRENYKILMHVEWNGCSLADLVFP 56
Query: 100 FFLFIVGVSLALTYK---NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVD 156
FLFIVG++ ++ + N K R + R L L G +N +D
Sbjct: 57 AFLFIVGITTVISLQRHLNDESKAQLYRSILTRTLLLMFFG--------LFLNIFPKQID 108
Query: 157 IAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLL 216
++ IR G+LQRIA YL+ C I L L +R ++ + + Y
Sbjct: 109 LSTIRIYGILQRIAWCYLI---CSI-----------LYLHTSFRTQIIILVGILIGYWFF 154
Query: 217 LYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPI 276
L + VP ++ + S + W V I+ + +HL K
Sbjct: 155 LTQIPVPGPGFD---QLSMAKNW------------------VSYIETHLFSPRHLLFK-- 191
Query: 277 YSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDH 336
FDPEG LS++ A T L GL G ++
Sbjct: 192 ---------------------------NFDPEGFLSTIPAIATTLSGLLVGQYLLTPGSK 224
Query: 337 RDRMLNWIILSSCLIGLGLSLDFVG------MHLNKALYSLSYTCLTAGASGVLLAGIYF 390
+ S LI +G+ FV +NK L++ ++ ++G S ++ +F
Sbjct: 225 LKK-------SFTLITIGILCLFVAGFWNHYFPINKNLWTSTFVLWSSGFSLIVFGLCFF 277
Query: 391 MV 392
++
Sbjct: 278 VI 279
>gi|359728547|ref|ZP_09267243.1| hypothetical protein Lwei2_17159 [Leptospira weilii str.
2006001855]
Length = 383
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 100/350 (28%), Positives = 146/350 (41%), Gaps = 98/350 (28%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFL 102
+ + + R++SLD+FRG+TV MILV++ G I + H+ WNG T D V PFFL
Sbjct: 2 EKKSTQNKDRILSLDLFRGMTVIGMILVNNPGSWSYIYSPLKHAKWNGCTPTDLVFPFFL 61
Query: 103 FIVG--VSLALTYKNFPCK----VVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVD 156
F VG + ++L KN + V ++++ L LL F + F
Sbjct: 62 FAVGGSIPISLYSKNGIDRSRIWVGICKRSVNLILLGLLLNFFGEWSF------------ 109
Query: 157 IAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLL 216
A++R GVLQRI Y V A SL+ + G V+ ++
Sbjct: 110 -AELRIPGVLQRIGFVYWVVA----------------SLYLIFPGKKVLIFLIPV----- 147
Query: 217 LYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPI 276
L V W + T + P V G A IDR I+G +HL++
Sbjct: 148 ---LLVHTW-----ILTHIAPP----GEAMVSLEQGKDIGA--WIDRVIIGEKHLWK--- 190
Query: 277 YSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGH-LIVHFKD 335
+S+T +DPEGLLS V + T L G+ G L +
Sbjct: 191 FSKT------------------------WDPEGLLSGVASIATSLFGVLCGFILFLREGG 226
Query: 336 HRDRMLNWIILSSCLIGLGLSLDFVG------MHLNKALYSLSYTCLTAG 379
R R+L+ GLG FVG + +NK+L++ SY TAG
Sbjct: 227 GRSRVLS-------TFGLGFLFTFVGLLWDQSLPMNKSLWTGSYAAYTAG 269
>gi|114048505|ref|YP_739055.1| hypothetical protein Shewmr7_3014 [Shewanella sp. MR-7]
gi|113889947|gb|ABI43998.1| conserved hypothetical protein [Shewanella sp. MR-7]
Length = 395
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 99/372 (26%), Positives = 151/372 (40%), Gaps = 90/372 (24%)
Query: 41 NSNSKQTRPQHQQRRLISLDVFRGLTV-----------ALMILVDDVGGIL--PAINHSP 87
N+ + + + + RL+SLD RG + AL+IL G ++HS
Sbjct: 15 NAQNAAAKKRQSKPRLMSLDALRGFDMFWILGGEALFGALLILTGWAGWQWGDTQMHHSE 74
Query: 88 WNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILR-ALNLFLLGIFLQGGFFH 146
WNG D + P F+F+ GV+L L+ K + R + R + L + L + H
Sbjct: 75 WNGFRFYDLIFPLFIFLSGVALGLSPKRLDKLPMHERMPVYRHGIKRLFLLLLLGILYNH 134
Query: 147 GINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVA 206
G VD ++R+ VL RIA A+ AAL +W H S +
Sbjct: 135 GWGT-GAPVDPEKVRYASVLGRIAFAWFFAALL-VW-----HTSLRTQ------------ 175
Query: 207 LVLTTLYLLLLYG---LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDR 263
VL L +L+ YG L++P FP G G P + +D
Sbjct: 176 -VLVALGILVAYGAVQLWLP-----FP--------------GGQAGELSPTESINAYVDS 215
Query: 264 KIL-GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLI 322
+L G+ + R P DPEG+LS++ A V L
Sbjct: 216 LLLPGVSYQGRTP------------------------------DPEGVLSTLPAVVNALA 245
Query: 323 GLHFGHLIV--HFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGA 380
G+ GH IV H K ++ + + LG LD V + +NK L++ S+ +T+G
Sbjct: 246 GVFVGHFIVKSHPKGEWAKVGLLSVAGGVCLALGWLLDGV-IPVNKELWTSSFVLVTSGW 304
Query: 381 SGVLLAGIYFMV 392
S +LLA Y +V
Sbjct: 305 SMLLLALFYAIV 316
>gi|429738942|ref|ZP_19272716.1| hypothetical protein HMPREF9151_01157 [Prevotella saccharolytica
F0055]
gi|429158431|gb|EKY00988.1| hypothetical protein HMPREF9151_01157 [Prevotella saccharolytica
F0055]
Length = 400
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 88/367 (23%), Positives = 148/367 (40%), Gaps = 104/367 (28%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG-- 106
+ R++++D+ RG+T+A MILV++ G I H+ WNG+T D V PFF+F++G
Sbjct: 7 KSSRILAIDILRGITIAGMILVNNPGNWGRIFAPFEHAEWNGMTPTDLVFPFFMFVMGMC 66
Query: 107 VSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHG--------INNLKYGVDI- 157
+ +A+ +F C K R + ++L+G L G+F + +G +
Sbjct: 67 IYIAMRKFDFTCNKSTVYKITKRMVLIYLVG--LGIGWFAKFCFRWASPLEEASFGEQLW 124
Query: 158 ------AQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTT 211
IR GVL R+AI Y + AL + +K +++ +L
Sbjct: 125 YMVWPFDSIRLTGVLARLAICYGITALLAVTVKHKNL-------------PYIIVTLLVG 171
Query: 212 LYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHL 271
+++L+ G G N + + DR +L H+
Sbjct: 172 YFIILMAG----------------------------NGFAYDETNILSIADRAVLTDVHM 203
Query: 272 YRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV 331
Y D G DPEGLLS++ + L+G G L+
Sbjct: 204 YH---------------DNG-------------IDPEGLLSTLPSIAHTLLGFMVGSLLF 235
Query: 332 HFKD----HRD--------RMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
D H D +++ ++ + LI G L + G LNK ++S +Y +T G
Sbjct: 236 KTTDEHSEHTDVRTGIILSKVVPLFVVGTILIFSGFLLSY-GCPLNKKVWSPTYVLVTCG 294
Query: 380 ASGVLLA 386
+ LLA
Sbjct: 295 LASTLLA 301
>gi|344208862|ref|YP_004794003.1| hypothetical protein [Stenotrophomonas maltophilia JV3]
gi|343780224|gb|AEM52777.1| Protein of unknown function DUF2261, transmembrane
[Stenotrophomonas maltophilia JV3]
Length = 360
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 93/345 (26%), Positives = 139/345 (40%), Gaps = 98/345 (28%)
Query: 54 RRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSLA 110
RRL S+D RG+TVA M+LV++ G + + HS W+G T D V PFFLF+VGVS+A
Sbjct: 12 RRLASIDALRGITVAAMLLVNNPGDWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSMA 71
Query: 111 LTYKNFPCKVVA----TRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
+ A R + RAL + L G L + ++ + R GVL
Sbjct: 72 FSVAPRALDAAARPALARGVLERALRILLAGALLHLLIWWALHTHHF-------RIWGVL 124
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRIA+ AAL + L+++ + R V + L Y +LL G+
Sbjct: 125 QRIAV---CAALVGV-----------LAVYARPRAQAAVLVTLLVGYTVLLLGI------ 164
Query: 227 YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
G P N +D L+ IY +
Sbjct: 165 ----------------------GDLAPWTNPASRLDTT------LFAPWIYQWHADTGLG 196
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL 346
DPEGLLS++ A + ++GL G L+ ++ R
Sbjct: 197 H------------------DPEGLLSTLGALASTVLGLIAGGLL---RNGR--------- 226
Query: 347 SSCLIGLGLSLDFVG------MHLNKALYSLSYTCLTAGASGVLL 385
S+ L GLG++ +G + LNK L++ SY T G + + L
Sbjct: 227 SAALAGLGVATAVLGLLLATVLPLNKQLWTPSYVLWTGGLAALAL 271
>gi|404485011|ref|ZP_11020215.1| hypothetical protein HMPREF9448_00625 [Barnesiella intestinihominis
YIT 11860]
gi|404340016|gb|EJZ66447.1| hypothetical protein HMPREF9448_00625 [Barnesiella intestinihominis
YIT 11860]
Length = 440
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 98/383 (25%), Positives = 151/383 (39%), Gaps = 93/383 (24%)
Query: 32 ERDPLLPPSNSNS----KQTRPQHQQRRLISLDVFRGLTVALMILVDDV----------- 76
E LL +N NS T+P Q RL SLD+ RG + L++ + V
Sbjct: 51 EISVLLFYNNLNSCMLGNATQPVGQ--RLASLDILRGFDLFLLVFLQPVLVSLGACVDSS 108
Query: 77 --GGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALT---YKNFPCKVVATRKAILRAL 131
+L +H W G D VMP FLF+ GVS+ + Y+ + +K + R +
Sbjct: 109 VMNAVLYQFDHEVWEGFRFWDLVMPLFLFMTGVSMPFSFSKYERVESRRFIYKKVLRRFV 168
Query: 132 NLFLLGIFLQGGFFHGINNLKYGVDIAQIR-WMGVLQRIAIAYLVAALCEIWLKGDGHVS 190
LFLLG+ +QG G+D+ IR + LQ IA YL+AAL ++ G +
Sbjct: 169 ILFLLGMVVQGNLL--------GLDLKYIRLYSNTLQAIAAGYLIAALIQLNFSLKGQMG 220
Query: 191 SKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGS 250
+LLLL Y FP+ G
Sbjct: 221 MT--------------------FLLLL--------AYWFPMT--------------FFGD 238
Query: 251 TGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGL 310
P N +DR +LG +R +Y S DY +W +
Sbjct: 239 FTPEGNFAERVDRFVLG---RFRDGVYWNADGTWSFSSDY------HYTW---------I 280
Query: 311 LSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIIL-SSCLIGLGLSLDFVGMHLNKALY 369
LSS+ VT ++G+ G ++ HR ++ + L LI + F M + K ++
Sbjct: 281 LSSLTFGVTVMLGVFAGRMMKEGTFHRQKVARRLFLVGVALIVAAWAWSF-QMPVIKRIW 339
Query: 370 SLSYTCLTAGASGVLLAGIYFMV 392
+ S T L+ G +L+A Y+ +
Sbjct: 340 TCSMTLLSGGYCFLLMALFYYWI 362
>gi|329963071|ref|ZP_08300851.1| hypothetical protein HMPREF9446_02444 [Bacteroides fluxus YIT
12057]
gi|328529112|gb|EGF56042.1| hypothetical protein HMPREF9446_02444 [Bacteroides fluxus YIT
12057]
Length = 381
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 90/376 (23%), Positives = 153/376 (40%), Gaps = 93/376 (24%)
Query: 52 QQRRLISLDVFRGLTVALMILV---DDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+RL++LDV RGLT+A MILV + + + H+ WNGLT D + PFFLF++GVS
Sbjct: 3 NNQRLVALDVMRGLTIAGMILVNTPETWSYVYAPLQHARWNGLTPTDVIFPFFLFMMGVS 62
Query: 109 LALTYK--NFPCKVVATRKAILRALNLFLLG--IFLQGGF------------FHGINNLK 152
+ ++ K +F K I R+L LFL+G I+ F F G +
Sbjct: 63 MYISLKKCSFHLSSHLLMKIIRRSLILFLIGTAIYALATFLGTLRDACRQPGFEGNPWKE 122
Query: 153 YGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTL 212
+ R GVLQR+ + Y + ++ L+ +Y H + +L
Sbjct: 123 AFASLPGTRIPGVLQRLGVCYGIGSII------------VLTCRHRYIPH-LAGGILAGY 169
Query: 213 YLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLY 272
+L+LL+G G N + ++DR + G +
Sbjct: 170 FLILLFG----------------------------NGFVHSPENILSVVDRTLFGDNMIN 201
Query: 273 RKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVH 332
I DPEG LS++ + LIG G + +
Sbjct: 202 DGGI-----------------------------DPEGALSTLPSIAQVLIGFCIGKICIE 232
Query: 333 FKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
D R+++ + S ++ +G + G LNK +++ S+ +T G + +LL ++
Sbjct: 233 TPDMREKLNKIFLYGSLMLIVGWLFSY-GCPLNKRVWTPSFVLVTCGFACLLLG---ILI 288
Query: 393 RYISSHLMLKKPFDYS 408
YI + K+ + +
Sbjct: 289 WYIDLRKVYKQTWTFE 304
>gi|404486905|ref|ZP_11022093.1| hypothetical protein HMPREF9448_02547 [Barnesiella intestinihominis
YIT 11860]
gi|404335959|gb|EJZ62425.1| hypothetical protein HMPREF9448_02547 [Barnesiella intestinihominis
YIT 11860]
Length = 373
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 95/369 (25%), Positives = 149/369 (40%), Gaps = 111/369 (30%)
Query: 52 QQRRLISLDVFRGLTVALMI----LVDDVGGILPA---------INHSPWNGLTLADFVM 98
+ RL+SLD RG + ++ LV + + P + H PW+G T D +
Sbjct: 5 KNTRLLSLDTLRGFDMLFIMGFAPLVVTLNALHPTAVGDVIAGHMRHVPWDGFTQHDMIF 64
Query: 99 PFFLFIVGVSLALTYKNFPCKVVATR-----------KAILRALNLFLLGIFLQGGFFHG 147
P FLFI G+S FP + R + R + L LLG FL GF
Sbjct: 65 PLFLFIAGIS-------FPFSLAKQRGSGSSDKHIYLRVFRRGVTLVLLG-FLYNGFLQ- 115
Query: 148 INNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVAL 207
++ +R VL RI +A++ A + LK K + ++
Sbjct: 116 -------LNFPDVRLASVLGRIGLAWMFGAFIYMSLK-------------KSVQYGLIVF 155
Query: 208 VLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILG 267
+L +LLL + + PD + +SP + G N VG IDR L
Sbjct: 156 ILVGYWLLLAF-VPAPD--------AAGASP------LSIEG------NLVGYIDRHCLP 194
Query: 268 IQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFG 327
+ +Y FDPEGLLS++ A VT L+G++ G
Sbjct: 195 GKLIY------------------------------GNFDPEGLLSTLPAIVTALLGIYAG 224
Query: 328 HLIVHFK----DHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGV 383
++ + + + +L+ I L+ +GL + V +NK L+S S+TC G S +
Sbjct: 225 EIVRSTRLGSGERKSLLLSGI--GVVLVAIGLVWNTV-FPINKMLWSSSFTCFVGGLSFL 281
Query: 384 LLAGIYFMV 392
L A Y++V
Sbjct: 282 LFALFYYIV 290
>gi|294139796|ref|YP_003555774.1| hypothetical protein SVI_1025 [Shewanella violacea DSS12]
gi|293326265|dbj|BAJ00996.1| conserved hypothetical protein [Shewanella violacea DSS12]
Length = 378
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 93/366 (25%), Positives = 154/366 (42%), Gaps = 102/366 (27%)
Query: 53 QRRLISLDVFRGLTVALMILVDDV-GGILP------------AINHSPWNGLTLADFVMP 99
+RRL+SLD RG + ++ + + G+L ++HS W+G T D + P
Sbjct: 10 KRRLMSLDALRGFDMFWILGGEALFAGLLAWSSWQGWQWADAQMHHSQWHGFTFYDLIFP 69
Query: 100 FFLFIVGVSLALTYKNFPCKVVATR-----KAILRALNLFLLGIFLQGGFFHGINNLKYG 154
F+F+ GV+L L+ K +A R ++ R L G+ G+ G
Sbjct: 70 LFIFLSGVALGLSPKRLDKLPIAQRMPLYKHSVKRLFLLLFFGVLYNHGWGTGAP----- 124
Query: 155 VDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYL 214
V I ++R+ VL RIA A+ AA+ +W H S + +F TL +
Sbjct: 125 VAIDEVRYASVLGRIAFAWFFAAML-VW-----HTSFRTQVF-------------VTLGI 165
Query: 215 LLLYG---LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKIL-GIQH 270
L+ YG L++P FP GV G P +D L GI +
Sbjct: 166 LIAYGLLQLFMP-----FP--------------GGVGGVFTPQGTINAYVDTHFLPGITY 206
Query: 271 LYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI 330
R P DPEG+LS++ A + G+ GH I
Sbjct: 207 QNR------------------------------PLDPEGILSTIPAVANAMAGVFVGHFI 236
Query: 331 VHFKDHRD----RMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
V K+H+ +++ ++ + ++ LG ++ + + +NK L++ S+ +T G S +LLA
Sbjct: 237 V--KEHKQGEWAKVVCLLLSGAFILALGWWVNLI-IPVNKDLWTSSFVLVTTGWSIILLA 293
Query: 387 GIYFMV 392
Y +V
Sbjct: 294 IFYALV 299
>gi|421090259|ref|ZP_15551055.1| putative membrane protein [Leptospira kirschneri str. 200802841]
gi|421129053|ref|ZP_15589263.1| putative membrane protein [Leptospira kirschneri str. 2008720114]
gi|410000994|gb|EKO51618.1| putative membrane protein [Leptospira kirschneri str. 200802841]
gi|410359757|gb|EKP06816.1| putative membrane protein [Leptospira kirschneri str. 2008720114]
Length = 369
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 90/327 (27%), Positives = 140/327 (42%), Gaps = 78/327 (23%)
Query: 59 LDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTY-- 113
+D+FRG+TVA MILV++ G I + H+ WNG T D V PFFLF VG+S+ L+
Sbjct: 1 MDLFRGMTVAGMILVNNPGSWSFIYSPLKHAKWNGCTPTDLVFPFFLFAVGISIQLSVYS 60
Query: 114 KNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAY 173
KN K +R++ L L+G+FL F G +++R GVLQRI Y
Sbjct: 61 KNKIHKSKIWFGICIRSITLILIGLFLN---FFG------EWSFSELRIPGVLQRIGFVY 111
Query: 174 LVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVET 233
+ A + L R W+ L++ T L+ Q P E+
Sbjct: 112 WIVASLHLILPK-----------RMILISWIPILLVHTWVLI----------QIPAPGES 150
Query: 234 SSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPM 293
P + IDR + G HL++ +S+T
Sbjct: 151 IV--------------YLEPGKDIGAWIDRNVFGENHLWK---FSKT------------- 180
Query: 294 PLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDH-RDRMLNWIILSSCLIG 352
+DPEGL S + + T L+G+ G ++ + + ++L+ L+
Sbjct: 181 -----------WDPEGLFSGISSIATSLLGVFCGSILSSKTNEIKKQILSIFGFGILLVL 229
Query: 353 LGLSLDFVGMHLNKALYSLSYTCLTAG 379
+GL D + +NK+L++ SY TAG
Sbjct: 230 VGLLWD-QNLPMNKSLWTGSYVIYTAG 255
>gi|116331948|ref|YP_801666.1| hypothetical protein LBJ_2457 [Leptospira borgpetersenii serovar
Hardjo-bovis str. JB197]
gi|116125637|gb|ABJ76908.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar
Hardjo-bovis str. JB197]
Length = 363
Score = 75.5 bits (184), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 96/351 (27%), Positives = 147/351 (41%), Gaps = 94/351 (26%)
Query: 65 LTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS--LALTYKNFPCK 119
+TV MILV++ G + + H+ WNG T D V PFFLF VG S ++L KN +
Sbjct: 1 MTVVGMILVNNPGSWSYVYSPLKHAEWNGCTPTDLVFPFFLFAVGASIPISLYSKNGINR 60
Query: 120 VVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALC 179
+ +R ++L LLG+FL N +++R GVLQRI Y V A
Sbjct: 61 IRIWIGICIRGISLILLGLFL---------NFFGEWTFSELRIPGVLQRIGFVYWVVA-- 109
Query: 180 EIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPW 239
+LF + G V+ ++ L V W + T + P
Sbjct: 110 --------------TLFLVFPGKKVLVFLIPI--------LLVHTW-----ILTHIAPP- 141
Query: 240 IFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPS 299
V G A IDR I G +HL++ +S+T
Sbjct: 142 ---GESMVSLEQGKDIGA--WIDRTIFGEKHLWK---FSKT------------------- 174
Query: 300 WCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDH-RDRMLNWIILSSCLIGLGLSLD 358
+DPEG LS + + T L G+ G ++ + ++R+L+ + GLG
Sbjct: 175 -----WDPEGFLSGIASIATSLFGVICGFILFRREGRGKNRVLS-------IFGLGFLFT 222
Query: 359 FVG------MHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKK 403
FVG + +NK+L++ SY T G + L G + Y+ S ++LKK
Sbjct: 223 FVGLLWDRSLPMNKSLWTGSYAVYTTGFA-FLCIGFF---EYLDSLILLKK 269
>gi|389799428|ref|ZP_10202419.1| protein involved in N-acetyl-D-glucosamine utilization
[Rhodanobacter sp. 116-2]
gi|388442725|gb|EIL98904.1| protein involved in N-acetyl-D-glucosamine utilization
[Rhodanobacter sp. 116-2]
Length = 353
Score = 75.1 bits (183), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 92/347 (26%), Positives = 136/347 (39%), Gaps = 88/347 (25%)
Query: 54 RRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLA 110
+RL SLD RG TVA M+LV+D G + + H+ W+G T D V FFLF+VGVS+A
Sbjct: 2 KRLASLDALRGCTVAAMLLVNDPGDWGHVYWPLEHAAWHGCTPTDLVFSFFLFVVGVSVA 61
Query: 111 LT----YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNL-KYGVDIAQIRWMGV 165
L + TR A RAL + LG+ IN L + + A +R+ GV
Sbjct: 62 LAILPRLEQGAAPSALTRAATWRALRILALGV--------AINLLAAWLLPQAHLRFPGV 113
Query: 166 LQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDW 225
LQRIA+ + AL I H + W +A +L + LL G
Sbjct: 114 LQRIALCFAGVALFAI------HTKPRTQ-------WWAIAALLIGYWGLLRLG------ 154
Query: 226 QYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSI 285
GS P N +D + G P R
Sbjct: 155 -----------------------GSLEPWTNLASRVDSAVFGRFVYLIDPASGRGH---- 187
Query: 286 NSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWII 345
DPEGLLS++ + L+GL G + + R L
Sbjct: 188 --------------------DPEGLLSTLPSLAGTLLGLRMG---CWLRREQFRTLLLAG 224
Query: 346 LSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
++ L+G +L + NK L++ S+ T G + + L ++ ++
Sbjct: 225 IACLLLG---ALWSPWLPFNKNLWTPSFVLWTTGWATLALLALHVLI 268
>gi|88859970|ref|ZP_01134609.1| hypothetical protein PTD2_18200 [Pseudoalteromonas tunicata D2]
gi|88817964|gb|EAR27780.1| hypothetical protein PTD2_18200 [Pseudoalteromonas tunicata D2]
Length = 378
Score = 74.7 bits (182), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 102/381 (26%), Positives = 154/381 (40%), Gaps = 102/381 (26%)
Query: 43 NSKQTRPQHQQRRLISLDVFRGLTV-----------ALMILVDDVGGILP--AINHSPWN 89
+ +T+PQ +RRL SLD RG+ + AL IL G + HS W+
Sbjct: 2 TNTETKPQ--KRRLASLDALRGMDMFWILGGEKIFAALFILTGWTGWQVAHGQTLHSNWH 59
Query: 90 GLTLADFVMPFFLFIVGVSLALTYKN-----FPCKVVATRKAILRALNLFLLGIFLQGGF 144
G T D + P F+F+ GV++ L+ K F + V KA+ R L G+ G+
Sbjct: 60 GFTFYDLIFPLFIFLAGVAMGLSPKRIDHLPFQERRVYYAKALKRLFLLAGFGVLYNHGW 119
Query: 145 FHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWV 204
GI ++ +IR+ VL RIAIA+ V + +W H S + + +
Sbjct: 120 GTGIP-----FNLEEIRYASVLGRIAIAWFVCVML-VW-----HTSLRTQI--------I 160
Query: 205 VALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRK 264
A+ L Y LLL +VP V G G A +D+
Sbjct: 161 TAVSLLIGYWLLL--CFVP-------------------VPGGQAGDLTMAGTWNAWVDQH 199
Query: 265 IL-GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIG 323
+L GI + R DPEGLLS+V A V L+G
Sbjct: 200 LLPGITYQNRAT------------------------------DPEGLLSNVPAVVNALMG 229
Query: 324 LHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMH------LNKALYSLSYTCLT 377
+ G LI ++ W + S L G GL+ +G +NK L++ S+ +T
Sbjct: 230 VFAGRLIA----KANQFGEWKTV-SYLFGAGLTSLALGWAWNMVFPVNKDLWTSSFVLVT 284
Query: 378 AGASGVLLAGIYFMVRYISSH 398
G S + LA Y +V +++
Sbjct: 285 VGWSLIFLASFYAVVDLLNTQ 305
>gi|408369302|ref|ZP_11167083.1| hypothetical protein I215_00330 [Galbibacter sp. ck-I2-15]
gi|407745048|gb|EKF56614.1| hypothetical protein I215_00330 [Galbibacter sp. ck-I2-15]
Length = 345
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 86/337 (25%), Positives = 135/337 (40%), Gaps = 79/337 (23%)
Query: 77 GGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFP--CKVVATRKAILRALNLF 134
G + ++H+ W+G TL D V P FLF+VG +++ + + F + +K I R +F
Sbjct: 11 GSVYRPLSHASWHGFTLTDLVFPTFLFVVGNAMSFSMRKFEQTSQAAFLKKVIKRTFVIF 70
Query: 135 LLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLS 194
+G L F LK + R GVLQRIA+ Y A+L H
Sbjct: 71 AIGFLLSWFPFFRDGQLK---PLEDARIFGVLQRIALCYFFASLVI-------H------ 114
Query: 195 LFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPA 254
+ K +G V ++V Y L++Y + G
Sbjct: 115 -YFKIKGALVFSMVALLGYHLIMYTM----------------------------GDYTLE 145
Query: 255 CNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSV 314
NA +D +LG HLY+ PFDPEGLLS++
Sbjct: 146 GNAALKLDLWLLGPNHLYQGE--------------------------GIPFDPEGLLSTL 179
Query: 315 MATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYT 374
ATV + G G + + + +I + L+ L L D + +NK +++ SY
Sbjct: 180 PATVNVIFGYFAGLFLQQSGKNFKTIALLMIGGATLVFLALGWDLL-FPINKKIWTSSYV 238
Query: 375 CLTAGASGVLLAGIYFMVRYISSHLMLKKPFDYSYAC 411
LT G ++LA F+V I ++ KK + Y +
Sbjct: 239 LLTVGIDVMVLA---FLVYGI--EILQKKKWTYFFEV 270
>gi|47213040|emb|CAF93449.1| unnamed protein product [Tetraodon nigroviridis]
Length = 297
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 81/268 (30%), Positives = 116/268 (43%), Gaps = 72/268 (26%)
Query: 155 VDIAQIRWMGVLQRIAIAYLVAAL----------CEIWLKGDGHVSSKLSLFRKYRGHWV 204
V +R GVLQR+A+AYLV A C + D S + + Y WV
Sbjct: 1 VSWDNLRIPGVLQRLALAYLVVACLDLLVARRFSCVFCVLQDAWWSQGIDILL-YWPAWV 59
Query: 205 VALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTG--PACN--AVGM 260
L+L +++L + + L VPD T P G G G P C A G
Sbjct: 60 CVLLLESVWLFITFLLPVPD------CPTGYLGP-------GGIGDMGLYPNCTGGAAGF 106
Query: 261 IDRKILGIQHLYRKP----IYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMA 316
IDR +LG +H+Y+ P IY+ P+DPEG+L S+ +
Sbjct: 107 IDRWLLGEKHIYQNPSSQGIYAT----------------------HLPYDPEGILGSINS 144
Query: 317 TVTCLIGLHFGHLIVHFKD-HR---DRMLNW--------IILSSCLIGLGLSLDFVGMHL 364
+ +GL G +I+H +D H+ R L W +L++C GL + +
Sbjct: 145 ILIAFLGLQAGKIILHHRDLHQGVISRFLIWGFLLGIISAVLTNCSTNQGL------IPI 198
Query: 365 NKALYSLSYTCLTAGASGVLLAGIYFMV 392
NK L+SLSY A + VLLA IY+ V
Sbjct: 199 NKNLWSLSYVTTLACFAYVLLALIYYTV 226
>gi|392548092|ref|ZP_10295229.1| hypothetical protein PrubA2_17028 [Pseudoalteromonas rubra ATCC
29570]
Length = 373
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 102/383 (26%), Positives = 154/383 (40%), Gaps = 105/383 (27%)
Query: 50 QHQQRRLISLDVFRGLTV-----------ALMILVDDVG-GILPAIN-HSPWNGLTLADF 96
+ ++RL SLD RG+ + AL +L G I A HS W+G T D
Sbjct: 2 SNNKKRLASLDALRGMDMFWILGGQSIFAALFVLTGWQGWKIFEAQTLHSAWHGFTFYDL 61
Query: 97 VMPFFLFIVGVSLALTYKNFPCKVVATR-----KAILRALNLFLLGIFLQGGFFHGINNL 151
+ P F+F+ GV++ L K +A R KAI R L L G+ G+ GI
Sbjct: 62 IFPLFIFLSGVAMGLRPKRIDHLPMAERKPIYIKAIKRLGLLCLFGVLYNHGWGTGIP-- 119
Query: 152 KYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTT 211
D +IR+ VL RIAIA+ A+ +W H S K + LT
Sbjct: 120 ---ADFGEIRYASVLGRIAIAWFFCAML-VW-----HCSLKTT-------------ALTG 157
Query: 212 LYLLLLYGL---YVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKIL-G 267
+ +LL Y L ++P V G G PA + +D+ +L G
Sbjct: 158 VGILLAYWLLLCFIP-------------------VPGGSAGELTPAGSWNAWVDQALLPG 198
Query: 268 IQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFG 327
I + R P DPEG+LSS A V + G+ G
Sbjct: 199 ITYQNR------------------------------PVDPEGILSSFPAIVNAIAGVFAG 228
Query: 328 HLIVHFKDHRDRMLNWII----LSSCLIGLGLSLDF-VGMHLNKALYSLSYTCLTAGASG 382
LI D++ W + ++ ++ L L + + +NK L++ S+ +T G S
Sbjct: 229 QLIA----QSDKLGQWQVAGRLFAAGIVSLALGWLWDLQFPVNKELWTSSFVLVTVGWSA 284
Query: 383 VLLAGIYFMVRYISSHLMLKKPF 405
+ LA ++F + I + L PF
Sbjct: 285 IFLA-VFFTLVDILNGQKLAYPF 306
>gi|333382729|ref|ZP_08474395.1| hypothetical protein HMPREF9455_02561 [Dysgonomonas gadei ATCC
BAA-286]
gi|332828330|gb|EGK01039.1| hypothetical protein HMPREF9455_02561 [Dysgonomonas gadei ATCC
BAA-286]
Length = 389
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 52/154 (33%), Positives = 81/154 (52%), Gaps = 19/154 (12%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG 106
Q RL+SLDV RG+T+A MI+V++ G + H W+GLT D V PFF+FI+G
Sbjct: 3 QKPSGRLLSLDVLRGITIAGMIMVNNSGSGEYTYAPLKHVAWDGLTPTDLVFPFFMFIMG 62
Query: 107 VSLALTYK--NFPCKVVATRKAILRALNLFLLGIFLQ--GGFFHGI-----NNLKYG--- 154
+S ++ + NF K + R + +FL+G+ L G F +NL +
Sbjct: 63 ISTYISLRKFNFEFNTPTLLKILKRTIVIFLIGLGLSWLGLSFRTYHMLEPDNLGFWERF 122
Query: 155 ----VDIAQIRWMGVLQRIAIAYLVAALCEIWLK 184
D +R +GV+QR+A+ Y A++ I +K
Sbjct: 123 FRAITDFGHLRTLGVMQRLALTYGAASIIAITVK 156
Score = 46.6 bits (109), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 36/122 (29%), Positives = 56/122 (45%), Gaps = 27/122 (22%)
Query: 258 VGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMAT 317
V ++D+ ILGI H+Y+ D G DPEGLLS++ A
Sbjct: 189 VSIVDQAILGINHMYK---------------DSG-----------LAIDPEGLLSTIPAI 222
Query: 318 VTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLT 377
LIG G LI++ KD+ R+ I+ + L G L + G +NK +++ ++ T
Sbjct: 223 AHVLIGFCCGALIMNTKDNDKRISQLFIVGTILTFAGFLLSY-GCPINKKIWTPTFVLAT 281
Query: 378 AG 379
G
Sbjct: 282 CG 283
>gi|375110537|ref|ZP_09756759.1| hypothetical protein AJE_11264 [Alishewanella jeotgali KCTC 22429]
gi|374569481|gb|EHR40642.1| hypothetical protein AJE_11264 [Alishewanella jeotgali KCTC 22429]
Length = 394
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 91/352 (25%), Positives = 147/352 (41%), Gaps = 63/352 (17%)
Query: 42 SNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVM 98
+ +Q + R+++LD RGL + MILV++ G + P + H+ W+G T D +
Sbjct: 4 QHCQQILAKQPANRMLALDALRGLAILAMILVNNPGSWQYVYPPLLHAEWHGWTPTDLIF 63
Query: 99 PFFLFIVGVSL--ALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGF--FHGINNLKYG 154
P FL +VG+++ +L + K R+ +RAL L+LLG+FL + F N
Sbjct: 64 PAFLVMVGMAIPYSLAGRQLLPKAEQIRQGAIRALKLYLLGLFLVLFYYNFRDPNYSYLQ 123
Query: 155 VDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYL 214
+ +RW GVLQRI I Y L ++ G + W+ L L LY
Sbjct: 124 QKLLTVRWSGVLQRIGIVYFCTLLIVLYSGTRGRIL------------WLSGLCL--LYF 169
Query: 215 LLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRK 274
LL+ P+ N G N +D ++LG H+Y +
Sbjct: 170 LLM-----------------QFVPYRDNYGHTFVGLWEHGNNLAAWLDHQLLGPNHVYFR 212
Query: 275 PIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFK 334
S FDPEG+LS++ A +CL G+ L+ K
Sbjct: 213 ------------------------SATPFAFDPEGILSTLPAIASCLSGVLMAQLL-QSK 247
Query: 335 DHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
L ++LS L + +NK L++ ++ L++G + V+LA
Sbjct: 248 AELAFKLRVLLLSGLAAVWIAELAHPALPINKMLWTPTFVLLSSGFTAVILA 299
>gi|113971267|ref|YP_735060.1| hypothetical protein Shewmr4_2932 [Shewanella sp. MR-4]
gi|113885951|gb|ABI40003.1| conserved hypothetical protein [Shewanella sp. MR-4]
Length = 395
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 102/376 (27%), Positives = 151/376 (40%), Gaps = 98/376 (26%)
Query: 41 NSNSKQTRPQHQQRRLISLDVFRGLTV-----------ALMILVDDVGGIL--PAINHSP 87
N+ + + RL+SLD RG + AL+IL G ++HS
Sbjct: 15 NAQDAAAKKSQSKPRLMSLDALRGFDMFWILGGEALFGALLILTGWAGWQWGDTQMHHSE 74
Query: 88 WNGLTLADFVMPFFLFIVGVSLALTYKNFPC-----KVVATRKAILRALNLFLLGIFLQG 142
WNG D + P F+F+ GV+L L+ K ++ R I R L LLGI
Sbjct: 75 WNGFRFYDLIFPLFIFLSGVALGLSPKRLDKLPMHERMPVYRHGIKRLFLLLLLGILYNH 134
Query: 143 GFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGH 202
G+ G+ D ++R+ VL RIA A+ AAL +W H S +
Sbjct: 135 GWGTGVP-----ADPEKVRYASVLGRIAFAWFFAALL-VW-----HTSLRTQ-------- 175
Query: 203 WVVALVLTTLYLLLLYG---LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVG 259
VL L +L+ YG L++P FP G G P +
Sbjct: 176 -----VLVALGILVAYGAMQLWLP-----FP--------------GGQAGVLSPTESINA 211
Query: 260 MIDRKIL-GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATV 318
+D +L G+ + R P DPEG+LS++ A V
Sbjct: 212 YVDSLLLPGVSYQGRTP------------------------------DPEGVLSTLPAVV 241
Query: 319 TCLIGLHFGHLIV--HFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCL 376
L G+ GH IV H K ++ + + LG LD V + +NK L++ S+ +
Sbjct: 242 NALAGVFVGHFIVKSHPKGEWAKVGLLSVAGGVCLALGWLLDGV-IPVNKELWTSSFVLV 300
Query: 377 TAGASGVLLAGIYFMV 392
T+G S +LLA Y +V
Sbjct: 301 TSGWSMLLLALFYALV 316
>gi|224064476|ref|XP_002301495.1| predicted protein [Populus trichocarpa]
gi|222843221|gb|EEE80768.1| predicted protein [Populus trichocarpa]
Length = 136
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 32/45 (71%), Positives = 39/45 (86%)
Query: 70 MILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYK 114
MILVDD GG P INHSPW G+TL+DFVMPFFLF+VG+S++L +K
Sbjct: 1 MILVDDAGGAFPCINHSPWFGVTLSDFVMPFFLFVVGLSISLVFK 45
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/65 (49%), Positives = 38/65 (58%), Gaps = 1/65 (1%)
Query: 217 LYGLYVPDWQYEFP-VETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKP 275
LYGLY PDW++E P V CGV+GS P CNA G+IDR LG LY+ P
Sbjct: 68 LYGLYDPDWEFEVPSTHLFGYKSGTKTVNCGVKGSLEPPCNAAGLIDRFFLGEHPLYQHP 127
Query: 276 IYSRT 280
+Y RT
Sbjct: 128 VYRRT 132
>gi|322436289|ref|YP_004218501.1| hypothetical protein AciX9_2697 [Granulicella tundricola MP5ACTX9]
gi|321164016|gb|ADW69721.1| hypothetical protein AciX9_2697 [Granulicella tundricola MP5ACTX9]
Length = 389
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 95/345 (27%), Positives = 149/345 (43%), Gaps = 77/345 (22%)
Query: 51 HQQRRLISLDVFRGLTVALMILVDDVGGILPA---INHSPWNGLTLADFVMPFFLFIVGV 107
+ RL+S+D+ RGLT+ MILV+D G A + H+ WNG T D V P FLF+VG+
Sbjct: 10 QRPSRLLSIDLLRGLTIGFMILVNDAGSERDAYAPLQHAWWNGFTPTDLVFPTFLFLVGI 69
Query: 108 SLALTY-----KNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRW 162
+ L+ +N P ++ + RA+ ++++GI F +A +R+
Sbjct: 70 TTVLSLGSRMDRNVP-RMTLFWSVLRRAVLIYVVGILASTFPF---------THLAGMRF 119
Query: 163 MGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYV 222
+GVL RIA+ YL+ V S L + + ++ V+ Y LL +V
Sbjct: 120 VGVLPRIALCYLI-------------VGSLLLISKSWKDKVVILAACLIGYWALLR--FV 164
Query: 223 PDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQ 282
P Y P +V R N +DR + QHL Y RT+
Sbjct: 165 PVPGYGVPTH---------DVPLLDRDG-----NLAAWLDRWMFAPQHL-----YERTR- 204
Query: 283 CSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLN 342
DPEGLLS++ A T L+GL G + R +++
Sbjct: 205 -----------------------DPEGLLSTIPAVGTALLGLLTGLFLRSQHALRTKIMG 241
Query: 343 WIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAG 387
++ I LGL + + + +NK +++ SY G S +LLA
Sbjct: 242 IAGAATVSILLGLLWN-ITLPINKKMWTSSYVLFAGGLSMLLLAA 285
>gi|442611023|ref|ZP_21025729.1| N-acetylglucosamine related transporter, NagX [Pseudoalteromonas
luteoviolacea B = ATCC 29581]
gi|441746951|emb|CCQ11791.1| N-acetylglucosamine related transporter, NagX [Pseudoalteromonas
luteoviolacea B = ATCC 29581]
Length = 373
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 104/363 (28%), Positives = 151/363 (41%), Gaps = 96/363 (26%)
Query: 53 QRRLISLDVFRGLTV-----------ALMILVDDVG--GILPAINHSPWNGLTLADFVMP 99
++RL SLD RG + AL +L G HS W+G T D + P
Sbjct: 5 KKRLASLDALRGFDMMWILGGQGIFAALFVLTGWTGWRTFEAHTVHSDWHGFTFYDLIFP 64
Query: 100 FFLFIVGVSLALTYK---NFPC--KVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYG 154
F+F+ GV++ L+ K + P + RKA+ R + L LLGI G+ GI
Sbjct: 65 LFIFLSGVAMGLSPKRIDHLPMSERTPIYRKALKRFVLLCLLGILYNHGWGTGIP----- 119
Query: 155 VDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYL 214
D ++IR+ VL RIA A+L+ AL +W H S K V + L+ L
Sbjct: 120 ADFSEIRYSSVLGRIAFAWLICALL-VW-----HFSLKQ----------VAYIGLSILIT 163
Query: 215 LLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRG--STGPACNAVGMIDRKIL-GIQHL 271
++ Y+P V G G S G + NA ID+ L G+++
Sbjct: 164 YWIWLCYIP-------------------VPGGDAGDLSIGGSWNA--WIDQNFLPGVRYQ 202
Query: 272 YRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV 331
R P+D PEG+LSS+ A V + GL G LI
Sbjct: 203 DR--------------------PVD----------PEGILSSLPAVVNAIAGLFAGQLIK 232
Query: 332 HFKDHRDRMLNWIILSS--CLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIY 389
+ + ++ S I LG D V +NK L++ S+T +T G S +LLA Y
Sbjct: 233 RAPEKGEWKCVALLFSGGVLFIALGWLWDLV-FPVNKELWTSSFTLVTIGWSAILLAVFY 291
Query: 390 FMV 392
+V
Sbjct: 292 VLV 294
>gi|291516094|emb|CBK65304.1| Uncharacterized conserved protein [Alistipes shahii WAL 8301]
Length = 331
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 83/320 (25%), Positives = 128/320 (40%), Gaps = 86/320 (26%)
Query: 83 INHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATR----KAILRALNLFLLGI 138
+ H WNGLT D + P FLFI GVS + ++ R K I R + L +LG+
Sbjct: 9 MGHVSWNGLTQHDTIFPLFLFIAGVSFPFSLSKQRASGISERRILFKVIRRGMTLIVLGM 68
Query: 139 FLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRK 198
I N + D A +R VL RI +A++ A+L L ++ K
Sbjct: 69 ---------IYNGLFRFDFASLRVASVLGRIGLAWMFASL--------------LYMYCK 105
Query: 199 YRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAV 258
R V A V+ Y LL+Y + PD + P A N
Sbjct: 106 VRTRAVFAAVVLIGYSLLMYLVVAPD-------APDGTDP------------LSVAGNIA 146
Query: 259 GMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATV 318
G +DR+ L T C FDPEGLLS++ A V
Sbjct: 147 GWVDRQWL-----------PGTFAC-------------------GSFDPEGLLSTLPAIV 176
Query: 319 TCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVG------MHLNKALYSLS 372
+ L G+ G ++ + L+ ++ C+ +++ +G + +NK L+S S
Sbjct: 177 SALFGMFTGEFLLRKRSS----LSGEQIALCMALAAVAITVIGIIWNCWIPINKKLWSSS 232
Query: 373 YTCLTAGASGVLLAGIYFMV 392
+TC+ G S L A Y+++
Sbjct: 233 FTCVVTGYSLGLFALFYYLI 252
>gi|315498708|ref|YP_004087512.1| hypothetical protein Astex_1695 [Asticcacaulis excentricus CB 48]
gi|315416720|gb|ADU13361.1| hypothetical protein Astex_1695 [Asticcacaulis excentricus CB 48]
Length = 378
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 55/151 (36%), Positives = 74/151 (49%), Gaps = 22/151 (14%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVGG--------ILPAINHSPWNGLTLADFV 97
+ P+ +RL SLDV RGLTV MILV+ G + P + H+ W GL +AD V
Sbjct: 1 MSEPKTATQRLPSLDVLRGLTVIGMILVNATAGMYYGLQAKVFPLLLHAHWEGLKIADVV 60
Query: 98 MPFFLFIVGVSLALTYKNFP----CKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKY 153
P FL +VG+S+ + V RK R L LFL+G L +NL +
Sbjct: 61 FPAFLTMVGLSIPMALNRAKMTTGLDVAQARKIGGRVLRLFLIGWLL--------SNLGW 112
Query: 154 --GVDIAQIRWMGVLQRIAIAYLVAALCEIW 182
D R+ GVLQRI + Y AA+ +W
Sbjct: 113 LAHFDGEPWRFWGVLQRIGLVYGAAAVLFLW 143
>gi|418676277|ref|ZP_13237561.1| putative membrane protein [Leptospira kirschneri serovar
Grippotyphosa str. RM52]
gi|418684894|ref|ZP_13246077.1| putative membrane protein [Leptospira kirschneri serovar
Grippotyphosa str. Moskva]
gi|418743212|ref|ZP_13299580.1| putative membrane protein [Leptospira kirschneri serovar Valbuzzi
str. 200702274]
gi|400323423|gb|EJO71273.1| putative membrane protein [Leptospira kirschneri serovar
Grippotyphosa str. RM52]
gi|410740642|gb|EKQ85357.1| putative membrane protein [Leptospira kirschneri serovar
Grippotyphosa str. Moskva]
gi|410749503|gb|EKR06488.1| putative membrane protein [Leptospira kirschneri serovar Valbuzzi
str. 200702274]
Length = 369
Score = 72.8 bits (177), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 89/327 (27%), Positives = 139/327 (42%), Gaps = 78/327 (23%)
Query: 59 LDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTY-- 113
+D+FRG+TVA MILV++ G I + H+ WNG T D V PFFLF VG+S+ L+
Sbjct: 1 MDLFRGMTVAGMILVNNPGSWSFIYSPLKHAKWNGCTPTDLVFPFFLFAVGISIQLSVYS 60
Query: 114 KNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAY 173
KN K +R++ L L+G+FL F G +++R GVLQRI Y
Sbjct: 61 KNKIHKSKIWFGICIRSITLILIGLFLN---FFG------EWSFSELRIPGVLQRIGFVY 111
Query: 174 LVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVET 233
+ A + L R W+ L++ T L+ Q P E+
Sbjct: 112 WIVASLHLILPK-----------RMILISWIPILLVHTWVLI----------QIPAPGES 150
Query: 234 SSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPM 293
P + IDR + G L++ +S+T
Sbjct: 151 IV--------------YLEPGKDIGAWIDRNVFGENRLWK---FSKT------------- 180
Query: 294 PLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDH-RDRMLNWIILSSCLIG 352
+DPEGL S + + T L+G+ G ++ + + ++L+ L+
Sbjct: 181 -----------WDPEGLFSGISSIATSLLGVFCGSILSSKTNEIKKQILSIFGFGILLVL 229
Query: 353 LGLSLDFVGMHLNKALYSLSYTCLTAG 379
+GL D + +NK+L++ SY TAG
Sbjct: 230 VGLLWD-QNLPMNKSLWTGSYVIYTAG 255
>gi|428299602|ref|YP_007137908.1| hypothetical protein Cal6303_2987 [Calothrix sp. PCC 6303]
gi|428236146|gb|AFZ01936.1| hypothetical protein Cal6303_2987 [Calothrix sp. PCC 6303]
Length = 104
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 33/73 (45%), Positives = 47/73 (64%), Gaps = 3/73 (4%)
Query: 47 TRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLF 103
T P RL+SLDVFRG+ +A MILV++ G I P + H+ W+G T D V PFFLF
Sbjct: 5 TLPTQNSNRLVSLDVFRGIAIASMILVNNPGSWDSIYPPLEHAEWHGCTPTDLVFPFFLF 64
Query: 104 IVGVSLALTYKNF 116
IVG+++ ++ +
Sbjct: 65 IVGMAMPFSFAKY 77
>gi|167623085|ref|YP_001673379.1| hypothetical protein Shal_1151 [Shewanella halifaxensis HAW-EB4]
gi|167353107|gb|ABZ75720.1| conserved hypothetical protein [Shewanella halifaxensis HAW-EB4]
Length = 398
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 103/393 (26%), Positives = 155/393 (39%), Gaps = 97/393 (24%)
Query: 24 SAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTV-----------ALMIL 72
+ P + +E L + + +P RL SLD RG + AL++L
Sbjct: 6 TKPQSSLIESHLKLQTQSIAKSEAKP-----RLKSLDALRGFDMFWILGGEAIFAALLVL 60
Query: 73 VDDVG--GILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFP-----CKVVATRK 125
G ++HS WNG T D + P F+F+ GV+L L+ K ++ +
Sbjct: 61 TGWAGFKWFDGQMHHSVWNGFTFYDLIFPLFIFLSGVALGLSPKRLDKLPLPPRLPLYKH 120
Query: 126 AILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKG 185
AI R L L G+ G+ G + + IR+ VL RIA A+ AL +W
Sbjct: 121 AIKRLFLLLLFGVIYNHGWGTGAS-----FAVGDIRYASVLGRIAFAWFFCALL-VW--- 171
Query: 186 DGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTC 245
H S K + V A+V+ Y LL FP+ T SS
Sbjct: 172 --HTSLKTQI--------VTAIVVLVGYALL----------QAFPMMTLDSS-------- 203
Query: 246 GVRGSTGPACNAVGMIDRKIL-GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAP 304
G TG AV D IL GI + A
Sbjct: 204 GAFSQTGSINAAV---DSLILPGITY------------------------------QNAA 230
Query: 305 FDPEGLLSSVMATVTCLIGLHFGHLIV--HFKDHRDRMLNWIILSSCLIGLGLSLDFVGM 362
DPEG+LS++ A V L G+ GH IV H K ++ L+ LG + + +
Sbjct: 231 VDPEGILSTIPAVVNGLFGVFVGHFIVKPHVKGEWFKLAVLAASGVALVALGWLISPL-I 289
Query: 363 HLNKALYSLSYTCLTAGASGVLLAGIYFMVRYI 395
+NK L++ S+T +++G S + LA Y ++ +
Sbjct: 290 PVNKTLWTSSFTLVSSGWSILFLALFYAVIDVV 322
>gi|413922900|gb|AFW62832.1| hypothetical protein ZEAMMB73_935848 [Zea mays]
Length = 1241
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 30/48 (62%), Positives = 36/48 (75%)
Query: 240 IFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINS 287
+ V CGVRG T CNAVGMIDRKILGIQHLY +P+Y+R+K N+
Sbjct: 966 VLQVKCGVRGDTSSGCNAVGMIDRKILGIQHLYGRPVYARSKNYRKNT 1013
Score = 45.1 bits (105), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 17/27 (62%), Positives = 23/27 (85%)
Query: 141 QGGFFHGINNLKYGVDIAQIRWMGVLQ 167
+GGFFHG+ +L +GVD+ +IR MGVLQ
Sbjct: 942 KGGFFHGVRSLSFGVDLQEIRLMGVLQ 968
>gi|380512476|ref|ZP_09855883.1| hypothetical protein XsacN4_14717 [Xanthomonas sacchari NCPPB 4393]
Length = 384
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 99/331 (29%), Positives = 135/331 (40%), Gaps = 73/331 (22%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGILPA---INHSPWNGLTLADFVMPFFLFIVGVSL 109
+ R +SLDVFRGLT+ LMILV+ G A + H+PW G T AD V P FLF VG ++
Sbjct: 15 RERFLSLDVFRGLTIFLMILVNTPGAGADAFVQLRHTPWFGFTAADLVFPSFLFAVGNAM 74
Query: 110 ALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFF--HGINNLKYGVDIAQIRWMGVLQ 167
+ R+ R+ +FLLG + F G + I Q R GVLQ
Sbjct: 75 SFALDRGQPLGAFLRRVGKRSALIFLLGFLMYWFPFVHQGADGHWSLTAIDQTRVPGVLQ 134
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQY 227
RIA+ Y +AAL WL G + ++L Y G L L+G
Sbjct: 135 RIALCYALAALLCRWLSPRGLLGMCVALLLGYWGA------------LYLWG-------- 174
Query: 228 EFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINS 287
+ E S NA +D +L LYRK
Sbjct: 175 QPGAELSKLG------------------NAGTRLDLWLLDPAQLYRK------------- 203
Query: 288 PDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILS 347
D G FDPEGLL ++ ATV + G + L V + R + W+ L+
Sbjct: 204 -DGG-------------FDPEGLLGTLPATVNVIAG-YLTGLYVRRVGKQARTVRWLWLA 248
Query: 348 -SCLIGLGLSLDFVGMHLNKALYSLSYTCLT 377
L+ L L+ L K L++ S+ LT
Sbjct: 249 GGALVLLALAWQ-PWFPLAKKLWTGSFVLLT 278
>gi|397171248|ref|ZP_10494657.1| hypothetical protein AEST_24230 [Alishewanella aestuarii B11]
gi|396087147|gb|EJI84748.1| hypothetical protein AEST_24230 [Alishewanella aestuarii B11]
Length = 394
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 89/352 (25%), Positives = 152/352 (43%), Gaps = 65/352 (18%)
Query: 43 NSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMP 99
+ +Q Q R+++LD RGL + MILV++ G + P + H+ W+G T D + P
Sbjct: 5 HCQQILTQQPANRMLALDALRGLAILAMILVNNPGSWQYVYPPLLHAEWHGWTPTDLIFP 64
Query: 100 FFLFIVGVSL--ALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGF--FHGINNLKYGV 155
FL +VG+++ +L + K R+ +RAL L+LLG+FL + F N
Sbjct: 65 AFLVMVGMAIPYSLAGRQLLPKAELIRQGAIRALKLYLLGLFLVLFYYNFRDPNYSYLQQ 124
Query: 156 DIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLL 215
+ +RW GVLQRI I Y + ++ G + W+ L L LY L
Sbjct: 125 KLLTVRWSGVLQRIGIVYFCTLVIVLYSGTRGRIL------------WLSGLCL--LYFL 170
Query: 216 LLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKP 275
L+ P+ N G N +D ++LG H+Y +
Sbjct: 171 LM-----------------QFVPYRDNYGHTFVGLWEHGNNLAAWLDHQLLGPNHVYFR- 212
Query: 276 IYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKD 335
S FDPEG+LS++ A +CL G+ L+ +
Sbjct: 213 -----------------------SATPFAFDPEGILSTLPAIASCLSGVLMAQLLQSQAE 249
Query: 336 HRDRMLNWIILSSCLIGLGLS-LDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
++ ++L S L + ++ L + +NK L++ ++ L++G + ++LA
Sbjct: 250 LAFKLR--VLLLSGLAAVWIAELAHPALPINKMLWTPTFVLLSSGFTALMLA 299
>gi|440804580|gb|ELR25457.1| Heparan-alpha-glucosaminide N-acetyltransferase [Acanthamoeba
castellanii str. Neff]
Length = 446
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 89/363 (24%), Positives = 151/363 (41%), Gaps = 91/363 (25%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGI-------------LPAINHSPWNGLTL---AD 95
++ RL SLDVFRG+T+ MILVD+ G PA P N + AD
Sbjct: 56 KKPRLQSLDVFRGVTMLGMILVDNQGNFDHVVRPLDESIVRHPAPPRPPTNARSWVDPAD 115
Query: 96 FVMPFFLFIVGVSLALTYKNFP---CKVVATRKAILRALNLFLLGIFLQGGFFHGINNLK 152
+ F V +++ + P K+ A + + R LF++G+ L + +
Sbjct: 116 HCAQWDGFAVALAMNGFWDKIPDRRGKIKAWARVLQRIGTLFVVGLLLNAFGSNPWDKWP 175
Query: 153 YGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTL 212
+ R MG RIA+ Y + + + ++ ++ VV L T +
Sbjct: 176 HW----HFRIMGC--RIALCYGTVTVLFL---------ATSTIVQR-----VVMLCFTAI 215
Query: 213 YLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLY 272
Y+ L+YGL VP CG RG+ P CNA G IDR I G
Sbjct: 216 YVGLMYGLDVP--------------------KCG-RGNLTPGCNAGGFIDRSIFG----- 249
Query: 273 RKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVH 332
W P DPEGLLS++ AT+TC +GL FG ++
Sbjct: 250 --------------------------DWMIRPNDPEGLLSTLTATLTCYLGLEFGRILHK 283
Query: 333 FKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
++ ++ ++ ++ + + ++ M +NK ++S+ + + G G+++ Y++V
Sbjct: 284 YRANQLELVCRWVMLALGLIGLALFLWLWMPINKKMWSVPFALMMGGIGGLVIFICYYLV 343
Query: 393 RYI 395
+
Sbjct: 344 DMV 346
>gi|399088486|ref|ZP_10753550.1| hypothetical protein PMI01_04686 [Caulobacter sp. AP07]
gi|398030770|gb|EJL24173.1| hypothetical protein PMI01_04686 [Caulobacter sp. AP07]
Length = 398
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 96/335 (28%), Positives = 145/335 (43%), Gaps = 60/335 (17%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGILPA---INHSPWNGLTLADFVMPFFLFIVGVSLAL 111
R+++LDV RGL VA MILV G A + H+ W G TLAD V P FLF VGV++ L
Sbjct: 10 RIVALDVLRGLAVAGMILVTSPGAWAHAYAPLKHAAWQGWTLADLVFPTFLFCVGVAIGL 69
Query: 112 TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAI 171
+ A+ AL + + + N D+A +R GVLQRI +
Sbjct: 70 SVPRLRIGEGAS-----AALWIKVARRTALLILLGLVLNALPRFDLAHLRIPGVLQRIGL 124
Query: 172 AY-LVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFP 230
Y L +A+C + + + +L++ + + L Y LL VP F
Sbjct: 125 CYALASAICILPARAEADGQLRLNVGGVV----LAVVGLLVGYWALLTFTPVPG----FG 176
Query: 231 VETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDY 290
V+ S +G+ PA IDR + I HL+ + T+ +
Sbjct: 177 VDRWDS-----------QGAL-PA-----FIDRAVFTIPHLWP---WGTTEGVGVT---- 212
Query: 291 GPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDR----MLNWIIL 346
+DPEGLLS+ ATV L+G + D R + + L
Sbjct: 213 --------------YDPEGLLSTFPATVNVLLGAVAAAFLARTGDGRQGRGRVLAALLAL 258
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGAS 381
+ LI GL+LD + + +NK L++ S+ ++GAS
Sbjct: 259 GAALIVAGLALDPI-VPVNKRLWTPSFALFSSGAS 292
>gi|445497063|ref|ZP_21463918.1| putative transmembrane protein [Janthinobacterium sp. HH01]
gi|444787058|gb|ELX08606.1| putative transmembrane protein [Janthinobacterium sp. HH01]
Length = 353
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 50/138 (36%), Positives = 71/138 (51%), Gaps = 14/138 (10%)
Query: 55 RLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
R+ S+D RGLTVA M+LV+D G + P + H+ W+G T DF+ P F+ IVG+S+ L
Sbjct: 2 RINSIDAVRGLTVAAMLLVNDAGDWSHVYPWLEHAEWHGCTPPDFIFPIFMLIVGISINL 61
Query: 112 TYKNFPCKVVAT----RKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
AT R +LRA+ + LLG+ L +N + R GVLQ
Sbjct: 62 ALSPRLDAGAATAPLARSVLLRAVRIVLLGLALHVVAMLLLNGRGF-------RLFGVLQ 114
Query: 168 RIAIAYLVAALCEIWLKG 185
R I + A L I ++G
Sbjct: 115 RTGICFAAAGLLAIHVRG 132
>gi|117921549|ref|YP_870741.1| hypothetical protein Shewana3_3111 [Shewanella sp. ANA-3]
gi|117613881|gb|ABK49335.1| conserved hypothetical protein [Shewanella sp. ANA-3]
Length = 395
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 97/373 (26%), Positives = 150/373 (40%), Gaps = 92/373 (24%)
Query: 41 NSNSKQTRPQHQQRRLISLDVFRGLTV-----------ALMILVDDVGGIL--PAINHSP 87
N+ + + + RL+SLD RG + AL++L G ++HS
Sbjct: 15 NAQEAAAKKRQSKPRLMSLDALRGFDMFWILGGEALFGALLMLTGWAGWQWGDTQMHHSE 74
Query: 88 WNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILR-ALNLFLLGIFLQGGFFH 146
WNG D + P F+F+ GV+L L+ K + R + R + L + L + H
Sbjct: 75 WNGFRFYDLIFPLFIFLSGVALGLSPKRLDKLPMQERMPVYRHGIKRLFLLLLLGILYNH 134
Query: 147 GINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVA 206
G D ++R+ VL RIA A+ AAL +W H S +
Sbjct: 135 GWGT-GAPADPEKVRYASVLGRIAFAWFFAALL-VW-----HTSLRTQ------------ 175
Query: 207 LVLTTLYLLLLYG---LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDR 263
VL L +L+ YG L++P FP G G P + +D
Sbjct: 176 -VLVALGILVAYGAVQLWLP-----FP--------------GGQAGVLSPTESINAYVDS 215
Query: 264 KIL-GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLI 322
+L G+ + R P DPEG+LS++ A V L
Sbjct: 216 LLLPGVSYQGRTP------------------------------DPEGVLSTLPAVVNALA 245
Query: 323 GLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFV---GMHLNKALYSLSYTCLTAG 379
G+ GH IV K H + L S G+ L+L ++ + +NK L++ S+ +T+G
Sbjct: 246 GVFVGHFIV--KSHPKGEWAKVGLLSVAGGVCLALGWLLGGVIPVNKELWTSSFVLVTSG 303
Query: 380 ASGVLLAGIYFMV 392
S +LLA Y +V
Sbjct: 304 WSMLLLALFYALV 316
>gi|390946357|ref|YP_006410117.1| hypothetical protein Alfi_1078 [Alistipes finegoldii DSM 17242]
gi|390422926|gb|AFL77432.1| hypothetical protein Alfi_1078 [Alistipes finegoldii DSM 17242]
Length = 369
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 99/366 (27%), Positives = 148/366 (40%), Gaps = 99/366 (27%)
Query: 50 QHQQRRLISLDVFRGLTVALMI----LVDDVGGILPA---------INHSPWNGLTLADF 96
+ RRL+SLD RG+ + ++ LV + + P + H+ WNGLT+ D
Sbjct: 1 MNPNRRLLSLDTLRGVDMFFIMGFSGLVTSLCALWPGSFTDMLASQMQHAAWNGLTIQDT 60
Query: 97 VMPFFLFIVGVSLALTYKNFPCKVVATRKAIL-----RALNLFLLGIFLQGGFFHGINNL 151
+ P FLFI GV+ + + RK IL R L L LLG+ G F
Sbjct: 61 IFPLFLFIAGVAFPFSLAKQRARGFG-RKRILDRIFRRGLILALLGMVYNGLF------- 112
Query: 152 KYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTT 211
++ + +R VL RI +A++ AAL L ++ R VA ++
Sbjct: 113 --ELNFSSLRIASVLGRIGLAWMFAAL--------------LCVYCSVRTRIAVAGIILI 156
Query: 212 LYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHL 271
Y LLL + PD PV S V G C A G IDR+ L L
Sbjct: 157 GYSLLLGLVVAPD----APVGADPLS---------VEG-----CLA-GWIDRQYLPGHIL 197
Query: 272 YRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV 331
Y FDPEG+LS++ A V+ L G+ G ++
Sbjct: 198 Y------------------------------GAFDPEGILSTLPAVVSALFGMFTGEFLL 227
Query: 332 HFKDHRDRMLNWIILSSCLIGL-----GLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
R +W + L GL + + M +NK L+S S+TC+ +G S + A
Sbjct: 228 --DGRRGLSGSWKAFYMAVAALAITTAGLCWNLI-MPVNKNLWSSSFTCVVSGYSLGMTA 284
Query: 387 GIYFMV 392
Y+++
Sbjct: 285 LFYYLI 290
>gi|395804141|ref|ZP_10483382.1| hypothetical protein FF52_19760 [Flavobacterium sp. F52]
gi|395433785|gb|EJF99737.1| hypothetical protein FF52_19760 [Flavobacterium sp. F52]
Length = 423
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 40/95 (42%), Positives = 59/95 (62%), Gaps = 7/95 (7%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+ RL SLDVFRG T+ LM +V++ G I P + H+ W+G T D V PFF+FI+G +
Sbjct: 2 TKERLTSLDVFRGFTIFLMTIVNNPGSWSSIYPPLEHAEWHGCTPTDLVFPFFVFIMGTA 61
Query: 109 L--ALTYKNFPCKVVATRKAILRALNLFLLGIFLQ 141
+ A+ K+F V K ++R+L +F LG+FL
Sbjct: 62 IPFAMPVKHFDGSVF--NKILVRSLRIFCLGLFLS 94
Score = 41.2 bits (95), Expect = 0.99, Method: Compositional matrix adjust.
Identities = 30/101 (29%), Positives = 55/101 (54%), Gaps = 2/101 (1%)
Query: 305 FDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHL 364
+DPEG+LS++ A T ++G++ G L+ D + + I + L+ GL + + +
Sbjct: 240 WDPEGILSTLPAIGTGILGMYIGQLLNLSVDKMEIVKKTAIAGTALVIGGLIWN-IFFPI 298
Query: 365 NKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKKPF 405
NK+L++ SY TAG + + L +Y+++ I H K F
Sbjct: 299 NKSLWTSSYVLYTAGIATLCLTLLYYIID-IKGHKKWTKLF 338
>gi|217974365|ref|YP_002359116.1| hypothetical protein Sbal223_3208 [Shewanella baltica OS223]
gi|217499500|gb|ACK47693.1| conserved hypothetical protein [Shewanella baltica OS223]
Length = 384
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 95/358 (26%), Positives = 146/358 (40%), Gaps = 90/358 (25%)
Query: 55 RLISLDVFRGLTVALMILVDDV-GGIL------------PAINHSPWNGLTLADFVMPFF 101
RL+SLD RG + ++ + + GG+L ++HS W+G D + P F
Sbjct: 18 RLVSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFRFYDLIFPLF 77
Query: 102 LFIVGVSLALTYKNFPCKVVATRKAILR-ALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
+F+ GV+L L+ K ++ R + R + L + L + HG D +I
Sbjct: 78 IFLSGVALGLSPKRLDKLPISERLPVYRHGIKRLFLLLLLGILYNHGWGT-GAPADPEKI 136
Query: 161 RWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYG- 219
R+ VL RIA A+ AAL +W H S + ++ L +LL YG
Sbjct: 137 RYASVLGRIAFAWFFAALL-VW-----HTSLRTQ-------------IIVALGILLGYGA 177
Query: 220 --LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKIL-GIQHLYRKPI 276
L++P FP G G P + +D +L G+ + R P
Sbjct: 178 IQLWLP-----FP--------------GGQAGVLSPTESINAYVDSILLPGVSYQGRTP- 217
Query: 277 YSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV--HFK 334
DPEGLLS++ A V L G+ GH IV H K
Sbjct: 218 -----------------------------DPEGLLSTIPAIVNALTGVFVGHFIVKSHPK 248
Query: 335 DHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
++ L+ G LD V + +NK L++ S+ +T+G S +LLA Y +V
Sbjct: 249 GEWAKVGLLAAAGGILLAFGWLLDLV-IPVNKELWTSSFVLVTSGWSMILLAVFYALV 305
>gi|423280893|ref|ZP_17259805.1| hypothetical protein HMPREF1203_04022 [Bacteroides fragilis HMW
610]
gi|404583534|gb|EKA88212.1| hypothetical protein HMPREF1203_04022 [Bacteroides fragilis HMW
610]
Length = 375
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 89/357 (24%), Positives = 141/357 (39%), Gaps = 85/357 (23%)
Query: 55 RLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLADFVMPFF 101
RL SLD+ RG + L++ V IL +H W G D VMP F
Sbjct: 11 RLASLDILRGFDLFLLVFFQPVLWALAHQLNAPWLNSILSQFDHEVWEGFRFWDLVMPLF 70
Query: 102 LFIVGVSLALTYKNF---PCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIA 158
LF+ G S+ ++ F P K RK I R + LF+ G+ +QG G+N KY
Sbjct: 71 LFMTGASMPFSFSKFKDDPDKGTIYRKIIRRFILLFIFGMIVQGNLL-GLNP-KY----- 123
Query: 159 QIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLY 218
+ LQ IA YL+AA+ + H S FR+ L++T L LL+ +
Sbjct: 124 LYLYSNTLQAIATGYLIAAIILL------HCS-----FRR-------QLIVTALLLLIYW 165
Query: 219 GLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYS 278
+P F G P N ID+ +LG +R +Y
Sbjct: 166 ------------------APMTF------LGDFTPEGNFAEKIDKLVLG---RFRDGVYW 198
Query: 279 RTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRD 338
SP Y + + SS+ T ++G G ++ KD+R
Sbjct: 199 NEDGTWNFSPHY---------------NYTWIWSSLTFGATVMLGTFAGKIMKAGKDNRR 243
Query: 339 RML-NWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRY 394
+++ +++ L+ L + M + K L++ S T + G +L+ Y+ + Y
Sbjct: 244 KVVRTLLVIGIALVAFSLVWS-LQMPIIKRLWTSSMTLFSGGLCFLLMGAFYYWIDY 299
>gi|334366956|ref|ZP_08515871.1| putative membrane protein [Alistipes sp. HGB5]
gi|313156833|gb|EFR56273.1| putative membrane protein [Alistipes sp. HGB5]
Length = 370
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 99/366 (27%), Positives = 148/366 (40%), Gaps = 99/366 (27%)
Query: 50 QHQQRRLISLDVFRGLTVALMI----LVDDVGGILPA---------INHSPWNGLTLADF 96
+ RRL+SLD RG+ + ++ LV + + P + H+ WNGLT+ D
Sbjct: 2 MNPNRRLLSLDTLRGVDMFFIMGFSGLVTSLCALWPGSFTDMLASQMQHAAWNGLTIQDT 61
Query: 97 VMPFFLFIVGVSLALTYKNFPCKVVATRKAIL-----RALNLFLLGIFLQGGFFHGINNL 151
+ P FLFI GV+ + + RK IL R L L LLG+ G F
Sbjct: 62 IFPLFLFIAGVAFPFSLAKQRARGFG-RKRILDRIFRRGLILALLGMVYNGLF------- 113
Query: 152 KYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTT 211
++ + +R VL RI +A++ AAL L ++ R VA ++
Sbjct: 114 --ELNFSSLRIASVLGRIGLAWMFAAL--------------LCVYCSVRTRIAVAGIILI 157
Query: 212 LYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHL 271
Y LLL + PD PV S V G C A G IDR+ L L
Sbjct: 158 GYSLLLGLVVAPD----APVGADPLS---------VEG-----CLA-GWIDRQYLPGHIL 198
Query: 272 YRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV 331
Y FDPEG+LS++ A V+ L G+ G ++
Sbjct: 199 Y------------------------------GAFDPEGILSTLPAVVSALFGMFTGEFLL 228
Query: 332 HFKDHRDRMLNWIILSSCLIGL-----GLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
R +W + L GL + + M +NK L+S S+TC+ +G S + A
Sbjct: 229 --DGRRGLSGSWKAFYMAVAALAITTAGLCWNLI-MPVNKNLWSSSFTCVVSGYSLGMTA 285
Query: 387 GIYFMV 392
Y+++
Sbjct: 286 LFYYLI 291
>gi|227875179|ref|ZP_03993321.1| possible heparan-alpha-glucosaminide N-acetyltransferase
[Mobiluncus mulieris ATCC 35243]
gi|227844084|gb|EEJ54251.1| possible heparan-alpha-glucosaminide N-acetyltransferase
[Mobiluncus mulieris ATCC 35243]
Length = 399
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 93/381 (24%), Positives = 152/381 (39%), Gaps = 93/381 (24%)
Query: 20 SKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGI 79
S +T + A + +P +N +T+P + R+ SLDV RG L++ V +
Sbjct: 10 SAATQSEAATTRQTEP--NTGETNQAETKPA-KPGRITSLDVGRGWF--LIMSVTSAAWL 64
Query: 80 LPAIN---HSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLL 136
LP + H+PW G+ D + P F+ + G+ LA Y N V R+ ++
Sbjct: 65 LPRPDWLIHAPWIGIRYYDMIFPLFVTLSGIGLAFAYHNRVSFKVTLRR---------IV 115
Query: 137 GIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLF 196
+ + G ++G+++ ++ D A R+ G LQ A+ + A C ++ +
Sbjct: 116 VLVVVGLLYNGVSSGQW--DPATFRFTGPLQVYAVIVTIIATCHLFAR------------ 161
Query: 197 RKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACN 256
+W+ +T +L GL W W GV P+CN
Sbjct: 162 -----NWMAWAGITAGVAVLQTGLLT--W-------------WAGTCPSGV---LSPSCN 198
Query: 257 AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMA 316
GM DR +LG H+Y DPEGL++ A
Sbjct: 199 PSGMWDRALLG-AHMYYGGFLGH--------------------------DPEGLVAITGA 231
Query: 317 TVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLN------KALYS 370
+T G GHL + R L W L+ L ++ G+ LN K L++
Sbjct: 232 LLTAAAGTTAGHLALS-----SRRLGWKTGPVKLLALAAAMSVFGLILNIWVPAFKRLWT 286
Query: 371 LSYTCLTAGASGVLLAGIYFM 391
S++ L AGA GVL+ + F+
Sbjct: 287 PSFS-LIAGAVGVLIFAVAFL 306
>gi|408821750|ref|ZP_11206640.1| hypothetical protein PgenN_01470 [Pseudomonas geniculata N1]
Length = 355
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 61/196 (31%), Positives = 91/196 (46%), Gaps = 26/196 (13%)
Query: 54 RRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSLA 110
RRL S+D RG+TVA M+LV++ G + + HS W+G T D V PFFLF+VGVS+A
Sbjct: 7 RRLGSIDALRGITVAAMLLVNNPGDWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSMA 66
Query: 111 LTYKNFPCKVVA----TRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
+ V A R + RAL + L G L + + +D R GVL
Sbjct: 67 FSVAPRALDVSARPALARGVLERALRILLAGALLHLLIW-------WALDTHHFRIWGVL 119
Query: 167 QRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQ 226
QRIA+ + + ++ + V + ++L Y LLL G P
Sbjct: 120 QRIAVCAALVGVLAVYARPRMQVGALIALLVGYT------------VLLLGIGDLAPWTN 167
Query: 227 YEFPVETSSSSPWIFN 242
++T+ +PWI+
Sbjct: 168 PASRLDTALFAPWIYQ 183
>gi|306818439|ref|ZP_07452162.1| conserved hypothetical protein [Mobiluncus mulieris ATCC 35239]
gi|304648612|gb|EFM45914.1| conserved hypothetical protein [Mobiluncus mulieris ATCC 35239]
Length = 399
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 92/381 (24%), Positives = 152/381 (39%), Gaps = 93/381 (24%)
Query: 20 SKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGI 79
S +T + A + +P +N +T+P + R+ SLDV RG L++ V +
Sbjct: 10 SAATQSEAATTRQTEP--NTGETNQTETKPA-KPGRITSLDVGRGWF--LIMSVTSAAWL 64
Query: 80 LPAIN---HSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLL 136
LP + H+PW G+ D + P F+ + G+ LA Y N V R+ ++
Sbjct: 65 LPRPDWLIHAPWIGIRYYDMIFPLFVTLSGIGLAFAYHNRVSFKVTLRR---------IV 115
Query: 137 GIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLF 196
+ + G ++G+++ ++ D A R+ G LQ A+ + A C ++ +
Sbjct: 116 VLVVVGLLYNGVSSGQW--DPATFRFTGPLQVYAVIVAIIATCHLFAR------------ 161
Query: 197 RKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACN 256
+W+V +T +L GL + W P G P+CN
Sbjct: 162 -----NWMVWAGITAGVAVLQTGL-LTWWAGTCP-----------------SGVLSPSCN 198
Query: 257 AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMA 316
GM DR +LG H+Y DPEGL++ A
Sbjct: 199 PSGMWDRALLG-AHMYYGGFLGH--------------------------DPEGLVAITGA 231
Query: 317 TVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLN------KALYS 370
+T G GHL + R L W L+ L ++ G+ L K L++
Sbjct: 232 LLTAAAGTTAGHLALS-----SRRLGWKTGPVKLLALAAAMSVFGLILTIWVPAFKRLWT 286
Query: 371 LSYTCLTAGASGVLLAGIYFM 391
S++ L AGA GVL+ + F+
Sbjct: 287 PSFS-LIAGAVGVLIFAVAFL 306
>gi|313149262|ref|ZP_07811455.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
gi|313138029|gb|EFR55389.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
Length = 375
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 90/357 (25%), Positives = 142/357 (39%), Gaps = 85/357 (23%)
Query: 55 RLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLADFVMPFF 101
RL SLD+ RG + L++ V IL +H W G D VMP F
Sbjct: 11 RLASLDILRGFDLFLLVFFQPVLWALAHQLNAPWLNSILSQFDHEVWEGFRFWDLVMPLF 70
Query: 102 LFIVGVSLALTYKNF---PCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIA 158
LF+ G S+ ++ F P K RK I R + LF+ G+ +QG G+N KY
Sbjct: 71 LFMTGASMPFSFSKFKDDPDKGPIYRKIIRRFILLFIFGMIVQGNLL-GLNP-KY----- 123
Query: 159 QIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLY 218
+ LQ IA YL+AA+ + H S FR+ L++T L LL+ +
Sbjct: 124 LYLYSNTLQAIATGYLIAAIILL------HCS-----FRR-------QLIVTALLLLIYW 165
Query: 219 GLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYS 278
+P F G P N ID+ +LG H +R +Y
Sbjct: 166 ------------------APMTF------LGDFTPEGNFAEKIDKLVLG--H-FRDGVYW 198
Query: 279 RTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRD 338
SP Y + + SS+ T ++G G ++ KD+R
Sbjct: 199 NEDGTWNFSPHY---------------NYTWIWSSLTFGATVMLGTFAGKIMKAGKDNRR 243
Query: 339 RML-NWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRY 394
+++ +++ L+ L + M + K L++ S T + G +L+ Y+ + Y
Sbjct: 244 KVVRTLLVIGIALVAFSLVWS-LQMPIIKRLWTSSMTLFSGGLCFLLMGAFYYWIDY 299
>gi|152999681|ref|YP_001365362.1| hypothetical protein Shew185_1147 [Shewanella baltica OS185]
gi|151364299|gb|ABS07299.1| conserved hypothetical protein [Shewanella baltica OS185]
Length = 384
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 95/358 (26%), Positives = 146/358 (40%), Gaps = 90/358 (25%)
Query: 55 RLISLDVFRGLTVALMILVDDV-GGIL------------PAINHSPWNGLTLADFVMPFF 101
RL+SLD RG + ++ + + GG+L ++HS W+G D + P F
Sbjct: 18 RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFRFYDLIFPLF 77
Query: 102 LFIVGVSLALTYKNFPCKVVATRKAILR-ALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
+F+ GV+L L+ K ++ R + R + L + L + HG D +I
Sbjct: 78 IFLSGVALGLSPKRLDKLPMSERLPVYRHGIKRLFLLLLLGILYNHGWGT-GAPADPEKI 136
Query: 161 RWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYG- 219
R+ VL RIA A+ AAL +W H S + ++ L +LL YG
Sbjct: 137 RYASVLGRIAFAWFFAALL-VW-----HTSLRTQ-------------IIVALGILLGYGA 177
Query: 220 --LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKIL-GIQHLYRKPI 276
L++P FP G G P + +D +L G+ + R P
Sbjct: 178 MQLWLP-----FP--------------GGQAGVLSPTESINAYVDSILLPGVSYQGRTP- 217
Query: 277 YSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV--HFK 334
DPEGLLS++ A V L G+ GH IV H K
Sbjct: 218 -----------------------------DPEGLLSTIPAIVNALTGVFVGHFIVKSHPK 248
Query: 335 DHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
++ L+ G LD V + +NK L++ S+ +T+G S +LLA Y +V
Sbjct: 249 GEWAKVGLLAAAGGILLAFGWLLDLV-IPVNKELWTSSFVLVTSGWSMILLAVFYALV 305
>gi|402878146|ref|XP_003902762.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Papio
anubis]
Length = 708
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 88/321 (27%), Positives = 137/321 (42%), Gaps = 64/321 (19%)
Query: 90 GLTLADFVMPFFLFIVGVSLALTYKNF----PCKVVATRKAILRALNLFLLGIFLQGGFF 145
GLT+AD V P+F+FI+G S+ L+ + K K R+ L +GI
Sbjct: 347 GLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKFRLLGKIAWRSFLLICIGII------ 400
Query: 146 HGINNLKY---GVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGD--GHVSSKLSLFR--- 197
I N Y + ++R GVLQR+ + Y V A+ E+ H +S+ S
Sbjct: 401 --IVNPNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVPEHCASERSCLSLRD 458
Query: 198 --KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTG--P 253
W++ L L L+L L + L VP T P G G G P
Sbjct: 459 ITSSWPQWLLILALEGLWLGLTFLLPVPG------CPTGYLGP-------GGIGDFGKYP 505
Query: 254 ACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLL 311
C A G IDR +LG HLY+ P + + +DPEG+L
Sbjct: 506 NCTGGAAGYIDRLLLGDDHLYQHPSSTVLYHTEV------------------AYDPEGIL 547
Query: 312 SSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL------GLSLDFVGMHLN 365
++ + V +G+ G +++++K +L C++GL +S + + +N
Sbjct: 548 GTINSIVMAFLGVQAGKILLYYKAQTKDILIRFTAWCCILGLISVVLTKVSENEGFIPVN 607
Query: 366 KALYSLSY-TCLTAGASGVLL 385
K L+SLSY T L++ A +LL
Sbjct: 608 KNLWSLSYVTTLSSFAFFILL 628
>gi|254446502|ref|ZP_05059978.1| hypothetical protein VDG1235_4753 [Verrucomicrobiae bacterium
DG1235]
gi|198260810|gb|EDY85118.1| hypothetical protein VDG1235_4753 [Verrucomicrobiae bacterium
DG1235]
Length = 394
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 99/354 (27%), Positives = 146/354 (41%), Gaps = 89/354 (25%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
++ RL++LD RG T+ MI+V+ G + + H+ W+G+T D V PFFLF VGVS
Sbjct: 2 KRERLLALDALRGFTIIGMIIVNSPGSWSHVYSPLLHASWHGVTPTDLVFPFFLFFVGVS 61
Query: 109 LALTYKNFPC----KVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMG 164
+AL Y +V RK R +F LG+FL NL ++R G
Sbjct: 62 IALAYSGKRGTKRERVGKYRKIFWRVAKIFALGLFL---------NLWPYFYFEEMRVAG 112
Query: 165 VLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGH-WVVALVLTTLYLLLLY----- 218
VLQRIA LV +C I L L +++ WV A +L + LL++
Sbjct: 113 VLQRIA---LVFGVCAI-----------LFLNTRWKQQLWVGASILLGYWALLVWVPVPL 158
Query: 219 ----------GLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGI 268
G+ + E V + + G+ P N +DR +L
Sbjct: 159 DEVNAGALETGIVERSYGTEVAVSVEARG------ETSIAGNFEPGVNIAAWVDRVLL-- 210
Query: 269 QHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGH 328
++ RT +DPEGLLS+V A T + G+ G
Sbjct: 211 ----PGGMWERT------------------------WDPEGLLSTVPAVATGIFGMLVGA 242
Query: 329 LIVHFKDHRDRMLNWII---LSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
LI+ D R+ +W+ + + LIG S F NK L+S S+ G
Sbjct: 243 LILGVGDPYRRV-SWVFFVGVVALLIGSAWSWVF---PYNKNLWSSSFVLYAGG 292
>gi|29348589|ref|NP_812092.1| hypothetical protein BT_3180 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29340494|gb|AAO78286.1| putative transmembrane protein [Bacteroides thetaiotaomicron
VPI-5482]
Length = 376
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 84/366 (22%), Positives = 146/366 (39%), Gaps = 90/366 (24%)
Query: 49 PQHQQRRLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLAD 95
+ RL SLD+ RG + L++ V IL +H W G D
Sbjct: 5 SEKNTTRLASLDILRGFDLFLLVFFQPVFAALVRQLNLPFLNDILYQFDHEVWEGFRFWD 64
Query: 96 FVMPFFLFIVGVSLALTYKNF----PCKVVATRKAILRALNLFLLGIFLQGGFFHGINNL 151
VMP FLF+ G S+ + + + R+ + R LF+ G+ +QG
Sbjct: 65 LVMPLFLFMTGASMPFSLSKYVGMSGSYWLVYRRILRRVFLLFIFGMIVQGNLL------ 118
Query: 152 KYGVDIAQIR-WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLT 210
G+D + I + LQ IA+ YL+AA+ ++ H S + W + +
Sbjct: 119 --GLDSSHIYLYSNTLQSIAVGYLIAAVIQL------HFSFR----------WQIGIT-- 158
Query: 211 TLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQH 270
LLLL+ ++P G PA N +DR +LG
Sbjct: 159 ---LLLLFIYWIP---------------------MTFLGDFTPAGNFAEQVDRCVLG--- 191
Query: 271 LYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAP-FDPEGLLSSVMATVTCLIGLHFGHL 329
R + + D +W +P ++ + SS+ VT ++G G +
Sbjct: 192 --------RFRDGVFWNED--------GTWSFSPYYNYTWIWSSLTFGVTVMLGAFAGKI 235
Query: 330 IVHFKDHRDRMLNWI-ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGI 388
+ K +R +++ + ++ L+GL + L + M + K L++ S T L+ G +L+A
Sbjct: 236 MKEGKANRKKVVQTLSVIGVLLVGLAM-LWSLQMPIIKRLWTGSMTLLSGGYCFLLMALF 294
Query: 389 YFMVRY 394
Y+ + Y
Sbjct: 295 YYWIDY 300
>gi|336312505|ref|ZP_08567454.1| N-acetylglucosamine transporter, NagX [Shewanella sp. HN-41]
gi|335864011|gb|EGM69129.1| N-acetylglucosamine transporter, NagX [Shewanella sp. HN-41]
Length = 384
Score = 69.7 bits (169), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 101/379 (26%), Positives = 152/379 (40%), Gaps = 103/379 (27%)
Query: 38 PPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDV-GGIL------------PAIN 84
P + K T+P RL+SLD RG + ++ + + GG+L ++
Sbjct: 6 PNETATIKVTKP-----RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMH 60
Query: 85 HSPWNGLTLADFVMPFFLFIVGVSLALTYKN-----FPCKVVATRKAILRALNLFLLGIF 139
HS W+G D + P F+F+ GV+L L+ K ++ R I R L L LLGI
Sbjct: 61 HSQWHGFRFYDLIFPLFIFLSGVALGLSPKRLDKLPLSERLPVYRHGIKRLLLLLLLGIL 120
Query: 140 LQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKY 199
G+ G D ++R+ VL RIA A+ AAL +W H S +
Sbjct: 121 YNHGWGTGAP-----ADPEKVRYASVLGRIAFAWFFAALL-VW-----HTSLRTQ----- 164
Query: 200 RGHWVVALVLTTLYLLLLYG---LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACN 256
++ L +LL YG L++P + G G P +
Sbjct: 165 --------IIVALGILLGYGAIQLWLP-------------------FSGGQAGVLSPTES 197
Query: 257 AVGMIDRKIL-GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVM 315
ID +L G+ + R DPEGLLS++
Sbjct: 198 INAYIDSILLPGVSYQGRT------------------------------LDPEGLLSTIP 227
Query: 316 ATVTCLIGLHFGHLIV--HFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSY 373
A V L G+ GH IV H + ++ + LG LD V + +NK L++ S+
Sbjct: 228 AVVNALAGVFVGHFIVKSHPQGEWAKVGLLAAAGGVCLALGWLLDGV-IPVNKELWTSSF 286
Query: 374 TCLTAGASGVLLAGIYFMV 392
+T+G S VLLA Y +V
Sbjct: 287 VLVTSGWSMVLLAVFYALV 305
>gi|58583544|ref|YP_202560.1| hypothetical protein XOO3921 [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|58428138|gb|AAW77175.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC
10331]
Length = 362
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 84/280 (30%), Positives = 115/280 (41%), Gaps = 80/280 (28%)
Query: 61 VFRGLTVALMILVDDVGGILPA---INHSPWNGLTLADFVMPFFLFIVGVSLALT----- 112
+FRGLT+ LMILV+ G A + H+ W G TLAD V P FLF VG +++
Sbjct: 1 MFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSAMSFALATNM 60
Query: 113 -YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYG-VDIAQIRWMGVLQRIA 170
+ F +V +++A L AL L+ F FFH + + + Q+R GVLQRI
Sbjct: 61 PHLQFLGRV--SKRAALIALCGVLMYWF---PFFHLQPDGGWAFTTVDQVRLTGVLQRIG 115
Query: 171 IAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFP 230
+ YL AAL +L G + VAL+L LL ++G +
Sbjct: 116 LCYLAAALLVRYLPPRGIAPA------------CVALLLGYWALLHVFG--------QPG 155
Query: 231 VETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDY 290
E S + NA +D + G +HLYRK
Sbjct: 156 AELSKTG------------------NAGTCLDLWLYGREHLYRK---------------- 181
Query: 291 GPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI 330
FDPEGLL S+ ATV L G G +
Sbjct: 182 -----------DGGFDPEGLLGSLSATVNVLAGYLCGRFL 210
>gi|307700906|ref|ZP_07637931.1| putative membrane protein [Mobiluncus mulieris FB024-16]
gi|307613901|gb|EFN93145.1| putative membrane protein [Mobiluncus mulieris FB024-16]
Length = 442
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 92/381 (24%), Positives = 151/381 (39%), Gaps = 93/381 (24%)
Query: 20 SKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGI 79
S +T + A + +P +N +T+P + R+ SLDV RG L++ V +
Sbjct: 53 SAATQSEAATTRQTEP--NTGETNQAETKPA-KPGRITSLDVGRGWF--LIMSVTSAAWL 107
Query: 80 LPAIN---HSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLL 136
LP + H+PW G+ D + P F+ + G+ LA Y N V R+ ++
Sbjct: 108 LPRPDWLIHAPWIGIRYYDMIFPLFVTLSGIGLAFAYHNRVSFKVTLRR---------IV 158
Query: 137 GIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLF 196
+ + G ++G+++ ++ D A R+ G LQ A+ + A C ++ +
Sbjct: 159 VLVVVGLLYNGVSSGQW--DPATFRFTGPLQVYAVIVAIIATCHLFAR------------ 204
Query: 197 RKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACN 256
+W+ +T +L GL W W GV P+CN
Sbjct: 205 -----NWMAWAGITAGVAVLQTGLLT--W-------------WAGTCPSGV---LSPSCN 241
Query: 257 AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMA 316
GM DR +LG H+Y DPEGL++ A
Sbjct: 242 PSGMWDRALLG-AHMYYGGFLGH--------------------------DPEGLVAITGA 274
Query: 317 TVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLN------KALYS 370
+T G GHL + R L W L+ L ++ G+ L K L++
Sbjct: 275 LLTAAAGTTAGHLAL-----SSRRLGWKTGPVKLLALAAAMSVFGLILTIWVPAFKRLWT 329
Query: 371 LSYTCLTAGASGVLLAGIYFM 391
S++ L AGA GVL+ + F+
Sbjct: 330 PSFS-LIAGAVGVLIFAVAFL 349
>gi|262381451|ref|ZP_06074589.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
gi|262296628|gb|EEY84558.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
Length = 372
Score = 69.3 bits (168), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 92/375 (24%), Positives = 138/375 (36%), Gaps = 114/375 (30%)
Query: 48 RPQHQQRRLISLDVFRG-----------LTVALMILVDD--VGGILPAINHSPWNGLTLA 94
R + Q +RL SLD RG L VAL L + I + H WNGL
Sbjct: 3 RQKQQPQRLQSLDALRGFDMLFIMGGASLFVALATLFPNPFFQAIAGQMEHVEWNGLAHH 62
Query: 95 DFVMPFFLFIVGVSLALTYKNFPCKVVAT----RKAILRALNLFLLGIFLQGGFFHGINN 150
D + P FLFI G+S + + K + +K + R + L LG+ + N
Sbjct: 63 DTIFPLFLFIAGISFPFSLEKQRGKGMTEGAIYKKIVRRGITLVFLGL---------VYN 113
Query: 151 LKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLT 210
+ +R VL RI + ++ AAL + + AL+L
Sbjct: 114 GLLSFEFDHLRCASVLARIGLGWMFAALLFVRFGWKARAG-------------ITALILV 160
Query: 211 TLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGP---ACNAVGMIDRKIL- 266
+L + + + VPD G GP N VG IDR L
Sbjct: 161 GYWLAMAF-VPVPD-----------------------AGGAGPFTLEGNLVGYIDRLFLP 196
Query: 267 GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHF 326
G H + FDPEGL S+V A T ++G+
Sbjct: 197 GRLH-------------------------------ETVFDPEGLFSTVPAIATAMLGMFT 225
Query: 327 GHLIVHFKD---HRDRMLNWIILSSCLIGLGLSLDFVGM------HLNKALYSLSYTCLT 377
G I K+ R ++L CL+G G L VG+ +NK L++ S+ C+
Sbjct: 226 GEWIKLRKEGLTDRKKVL-------CLVGAGAVLLIVGLLWSLVFPINKKLWTSSFVCVV 278
Query: 378 AGASGVLLAGIYFMV 392
S + A ++++
Sbjct: 279 GAYSAWMFALFFYII 293
>gi|269978070|ref|ZP_06185020.1| putative membrane protein [Mobiluncus mulieris 28-1]
gi|269933579|gb|EEZ90163.1| putative membrane protein [Mobiluncus mulieris 28-1]
Length = 442
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 94/381 (24%), Positives = 153/381 (40%), Gaps = 93/381 (24%)
Query: 20 SKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGI 79
S +T + A + +P +N +T+P + R+ SLDV RG L++ V +
Sbjct: 53 SAATQSEAATTRQTEP--NTGETNQTETKPA-KPGRITSLDVGRGWF--LIMSVTSAAWL 107
Query: 80 LPAIN---HSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLL 136
LP + H+PW G+ D + P F+ + G+ LA Y N V R+ ++
Sbjct: 108 LPRPDWLIHAPWIGIRYYDMIFPLFVTLSGIGLAFAYHNRVSFKVTLRR---------IV 158
Query: 137 GIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLF 196
+ + G ++G+++ ++ D A R+ G LQ A+ + A C ++ +
Sbjct: 159 VLVVVGLLYNGVSSGQW--DPATFRFTGPLQVYAVIVAIIATCHLFAR------------ 204
Query: 197 RKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACN 256
+W+ +T +L GL W W GV P+CN
Sbjct: 205 -----NWMAWAGITAGVAVLQTGLLT--W-------------WAGTCPSGV---LSPSCN 241
Query: 257 AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMA 316
GM DR +LG H+Y YG DPEGL++ A
Sbjct: 242 PSGMWDRALLG-AHMY-----------------YGGF---------LGHDPEGLVAITGA 274
Query: 317 TVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLN------KALYS 370
+T G GHL + R L W L+ L ++ G+ L K L++
Sbjct: 275 LLTAAAGTTAGHLALS-----SRRLGWKTGPVKLLALAAAMSVFGLILTIWVPAFKRLWT 329
Query: 371 LSYTCLTAGASGVLLAGIYFM 391
S++ L AGA GVL+ + F+
Sbjct: 330 PSFS-LIAGAVGVLIFAVAFL 349
>gi|254784997|ref|YP_003072425.1| hypothetical protein TERTU_0813 [Teredinibacter turnerae T7901]
gi|237684955|gb|ACR12219.1| putative membrane protein [Teredinibacter turnerae T7901]
Length = 354
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 52/136 (38%), Positives = 70/136 (51%), Gaps = 12/136 (8%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
R ++LD RG+T+A+MILV+ G + P + H+ W+G+T DFV PFFLFIVG +
Sbjct: 1 MNERSLALDALRGITLAMMILVNTPGSWSHVYPPLLHANWHGVTPTDFVFPFFLFIVGCA 60
Query: 109 LALT-YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
L + KN + I R L LL + Y A+ R GVLQ
Sbjct: 61 LFFSNRKNHQLDIYTHALKIFRRTVLLLLAGLGLHAYL-------YSGTFAEFRLPGVLQ 113
Query: 168 RIAIAYLVAALCEIWL 183
RIA+AY AA +WL
Sbjct: 114 RIALAYGAAAFI-VWL 128
Score = 43.1 bits (100), Expect = 0.24, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 51/84 (60%), Gaps = 4/84 (4%)
Query: 305 FDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHL 364
FDPEGLLS++ A VT L G +IV + + ++L +I+++ IG+ L L + +
Sbjct: 183 FDPEGLLSTLPAIVTVLSGYEATRIIVE-RTTQQKVL--VIIAALAIGMALLLH-PWVPI 238
Query: 365 NKALYSLSYTCLTAGASGVLLAGI 388
NK L++ SY LT+G + ++L +
Sbjct: 239 NKYLWTSSYVLLTSGVAVLVLVAL 262
>gi|336411649|ref|ZP_08592112.1| hypothetical protein HMPREF1018_04130 [Bacteroides sp. 2_1_56FAA]
gi|335941083|gb|EGN02943.1| hypothetical protein HMPREF1018_04130 [Bacteroides sp. 2_1_56FAA]
Length = 375
Score = 68.9 bits (167), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 84/358 (23%), Positives = 138/358 (38%), Gaps = 87/358 (24%)
Query: 55 RLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLADFVMPFF 101
RL SLD+ RG + L++ V IL +H W G D VMP F
Sbjct: 11 RLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFRFWDLVMPLF 70
Query: 102 LFIVGVSLALTYKNF---PCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIA 158
LF+ G S+ ++ F P K RK I R + LF+ G+ +QG G+D
Sbjct: 71 LFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNLL--------GLDPK 122
Query: 159 QIR-WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLL 217
+ + LQ IA YL+AA+ ++ H + + L + T LLL+
Sbjct: 123 HLYLYSNTLQAIATGYLIAAIIQL------HCNFRWQL-------------IVTALLLLI 163
Query: 218 YGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIY 277
Y ++P G P N +DR +LG H +
Sbjct: 164 Y--WIP---------------------MTFLGDFTPEGNFAEKVDRLVLG--HFRDGVFW 198
Query: 278 SRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHR 337
+ S + A ++ + SS+ T ++G G ++ KD+R
Sbjct: 199 NEDGSWSFS----------------AHYNYTWIWSSLTFGATVMLGAFAGKIMKAGKDNR 242
Query: 338 DRMLN-WIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRY 394
+++ +I+ LI L + M + K L++ S T + G +L+ Y+ + Y
Sbjct: 243 RKVVQTLLIIGLSLIAFSLIWS-LQMPIIKRLWTSSMTLFSGGLCFLLMGAFYYRIDY 299
>gi|383119755|ref|ZP_09940493.1| hypothetical protein BSHG_3425 [Bacteroides sp. 3_2_5]
gi|423252290|ref|ZP_17233284.1| hypothetical protein HMPREF1066_04294 [Bacteroides fragilis
CL03T00C08]
gi|423252861|ref|ZP_17233792.1| hypothetical protein HMPREF1067_00436 [Bacteroides fragilis
CL03T12C07]
gi|251944624|gb|EES85099.1| hypothetical protein BSHG_3425 [Bacteroides sp. 3_2_5]
gi|392647563|gb|EIY41262.1| hypothetical protein HMPREF1066_04294 [Bacteroides fragilis
CL03T00C08]
gi|392659230|gb|EIY52856.1| hypothetical protein HMPREF1067_00436 [Bacteroides fragilis
CL03T12C07]
Length = 375
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 84/358 (23%), Positives = 138/358 (38%), Gaps = 87/358 (24%)
Query: 55 RLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLADFVMPFF 101
RL SLD+ RG + L++ V IL +H W G D VMP F
Sbjct: 11 RLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFRFWDLVMPLF 70
Query: 102 LFIVGVSLALTYKNF---PCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIA 158
LF+ G S+ ++ F P K RK I R + LF+ G+ +QG G+D
Sbjct: 71 LFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNLL--------GLDPK 122
Query: 159 QIR-WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLL 217
+ + LQ IA YL+AA+ ++ H + + L + T LLL+
Sbjct: 123 HLYLYSNTLQAIATGYLIAAIIQL------HCNFRWQL-------------IVTALLLLI 163
Query: 218 YGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIY 277
Y ++P G P N +DR +LG H +
Sbjct: 164 Y--WIP---------------------MTFLGDFTPEGNFAEKVDRLVLG--HFRDGVFW 198
Query: 278 SRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHR 337
+ S + A ++ + SS+ T ++G G ++ KD+R
Sbjct: 199 NEDGSWSFS----------------AHYNYTWIWSSLTFGATVMLGAFAGKIMKAGKDNR 242
Query: 338 DRMLN-WIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRY 394
+++ +I+ LI L + M + K L++ S T + G +L+ Y+ + Y
Sbjct: 243 RKVVQTLLIIGISLIAFSLIWS-LQMPIIKRLWTSSMTLFSGGLCFLLMGAFYYRIDY 299
>gi|418023168|ref|ZP_12662153.1| hypothetical protein Sbal625DRAFT_1278 [Shewanella baltica OS625]
gi|353537051|gb|EHC06608.1| hypothetical protein Sbal625DRAFT_1278 [Shewanella baltica OS625]
Length = 384
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 94/358 (26%), Positives = 146/358 (40%), Gaps = 90/358 (25%)
Query: 55 RLISLDVFRGLTVALMILVDDV-GGIL------------PAINHSPWNGLTLADFVMPFF 101
RL+SLD RG + ++ + + GG+L ++HS W+G D + P F
Sbjct: 18 RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFNFYDLIFPLF 77
Query: 102 LFIVGVSLALTYKNFPCKVVATRKAILR-ALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
+F+ GV+L L+ K ++ R + R + L + L + HG D +I
Sbjct: 78 IFLSGVALGLSPKRLDKLPMSERLPVYRHGIKRLFLLLLLGILYNHGWGT-GAPADPEKI 136
Query: 161 RWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYG- 219
R+ VL RIA A+ AAL +W H S + ++ L +LL YG
Sbjct: 137 RYASVLGRIAFAWFFAALL-VW-----HTSLRTQ-------------IIVALGILLGYGA 177
Query: 220 --LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKIL-GIQHLYRKPI 276
L++P FP G G P + +D +L G+ + R P
Sbjct: 178 MQLWLP-----FP--------------GGQAGVLSPTESINAYVDSILLPGVSYQGRTP- 217
Query: 277 YSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV--HFK 334
DPEGLLS++ A V L G+ GH +V H K
Sbjct: 218 -----------------------------DPEGLLSTIPAIVNALTGVFVGHFLVKSHPK 248
Query: 335 DHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
++ L+ G LD V + +NK L++ S+ +T+G S +LLA Y +V
Sbjct: 249 GEWAKVGLLAAAGGILLAFGWLLDLV-IPVNKELWTSSFVLVTSGWSMILLAVFYALV 305
>gi|60683670|ref|YP_213814.1| hypothetical protein BF4252 [Bacteroides fragilis NCTC 9343]
gi|423259842|ref|ZP_17240765.1| hypothetical protein HMPREF1055_03042 [Bacteroides fragilis
CL07T00C01]
gi|423267497|ref|ZP_17246478.1| hypothetical protein HMPREF1056_04165 [Bacteroides fragilis
CL07T12C05]
gi|60495104|emb|CAH09923.1| putative transmembrane protein [Bacteroides fragilis NCTC 9343]
gi|387775880|gb|EIK37984.1| hypothetical protein HMPREF1055_03042 [Bacteroides fragilis
CL07T00C01]
gi|392696971|gb|EIY90158.1| hypothetical protein HMPREF1056_04165 [Bacteroides fragilis
CL07T12C05]
Length = 375
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 84/358 (23%), Positives = 138/358 (38%), Gaps = 87/358 (24%)
Query: 55 RLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLADFVMPFF 101
RL SLD+ RG + L++ V IL +H W G D VMP F
Sbjct: 11 RLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFRFWDLVMPLF 70
Query: 102 LFIVGVSLALTYKNF---PCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIA 158
LF+ G S+ ++ F P K RK I R + LF+ G+ +QG G+D
Sbjct: 71 LFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNLL--------GLDPK 122
Query: 159 QIR-WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLL 217
+ + LQ IA YL+AA+ ++ H + + L + T LLL+
Sbjct: 123 HLYLYSNTLQAIATGYLIAAIIQL------HCNFRWQL-------------IVTALLLLI 163
Query: 218 YGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIY 277
Y ++P G P N +DR +LG H +
Sbjct: 164 Y--WIP---------------------MTFLGDFTPEGNFAEKVDRLVLG--HFRDGVFW 198
Query: 278 SRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHR 337
+ S + A ++ + SS+ T ++G G ++ KD+R
Sbjct: 199 NEDGSWSFS----------------AHYNYTWIWSSLTFGATVMLGAFAGKIMKAGKDNR 242
Query: 338 DRMLN-WIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRY 394
+++ +I+ LI L + M + K L++ S T + G +L+ Y+ + Y
Sbjct: 243 RKVVQTLLIIGISLIAFSLIWS-LQMPIIKRLWTSSMTLFSGGLCFLLMGAFYYRIDY 299
>gi|119774084|ref|YP_926824.1| hypothetical protein Sama_0947 [Shewanella amazonensis SB2B]
gi|119766584|gb|ABL99154.1| conserved hypothetical protein [Shewanella amazonensis SB2B]
Length = 378
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 102/383 (26%), Positives = 154/383 (40%), Gaps = 95/383 (24%)
Query: 45 KQTRPQHQQRRLISLDVFRG-----------LTVALMILVDDVGGILP--AINHSPWNGL 91
+ T+ + + RL+SLD RG L +AL L L ++HS W+G
Sbjct: 2 QATQTKAAKPRLMSLDALRGFDMFWILGGEKLFIALFALTGWSFWQLADAEMHHSEWHGF 61
Query: 92 TLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILR-ALNLFLLGIFLQGGFFHGINN 150
T D + P F+F+ GV+L L+ K A R I R A+ L + L + HG
Sbjct: 62 TFYDLIFPLFIFLSGVALGLSPKRLDKLAPAERNPIYRHAVKRLFLLLALGVLYNHG--- 118
Query: 151 LKYGVDIA----QIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVA 206
+G I ++R+ VL RIA A+ AAL +W H S + +A
Sbjct: 119 --WGTGIPAHSDEVRYASVLGRIAFAWFFAALL-VW-----HTSLRTQ----------IA 160
Query: 207 LVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKIL 266
L L+ L++P V G G P+ + +D L
Sbjct: 161 TALAILFGYAAIQLWLP-------------------VPGGQAGVLTPSGSINAWVDTHFL 201
Query: 267 -GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLH 325
GI + +R P+DPEG+LS++ A V L+G+
Sbjct: 202 PGITYQHR------------------------------PYDPEGILSTLPAIVNALMGVF 231
Query: 326 FGHLIVHFKDHRDRMLNWIILSSC---LIGLGLSLDFVGMHLNKALYSLSYTCLTAGASG 382
G IV D R IL+ + LG SLD V + +NK L++ S+ +T G +
Sbjct: 232 VGRFIVK-PDARGDWAKAGILTGAGGLSLVLGWSLDSV-LPVNKDLWTSSFVLVTTGWNL 289
Query: 383 VLLAGIYFMVRYISSHLMLKKPF 405
+ LA Y +V + + L PF
Sbjct: 290 LFLALFYVLVDVLGAK-RLAFPF 311
>gi|94985055|ref|YP_604419.1| hypothetical protein Dgeo_0949 [Deinococcus geothermalis DSM 11300]
gi|94555336|gb|ABF45250.1| hypothetical protein Dgeo_0949 [Deinococcus geothermalis DSM 11300]
Length = 573
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 42/110 (38%), Positives = 61/110 (55%), Gaps = 7/110 (6%)
Query: 38 PPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVG-GIL--PAINHSPWNGLTLA 94
P +++ P + RL +LD +RGLTV LM+LV++V G L P + H+P+ GLTL
Sbjct: 201 PAPTTSAGGAGPVQTRVRLTALDAWRGLTVLLMLLVNNVALGDLTPPQLQHAPFGGLTLT 260
Query: 95 DFVMPFFLFIVGVSLALTYKNFPCKVVA----TRKAILRALNLFLLGIFL 140
D V P+FLF G +L + V R+ + RA L+L+G FL
Sbjct: 261 DLVFPWFLFCAGAALPFSQAAMRRAGVTGWARVRRLVTRAALLYLVGAFL 310
>gi|383124758|ref|ZP_09945419.1| hypothetical protein BSIG_1496 [Bacteroides sp. 1_1_6]
gi|251841090|gb|EES69171.1| hypothetical protein BSIG_1496 [Bacteroides sp. 1_1_6]
Length = 376
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 83/365 (22%), Positives = 146/365 (40%), Gaps = 90/365 (24%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLADF 96
++ RL SLD+ RG + L++ V IL +H W G D
Sbjct: 6 ENNTSRLASLDILRGFDLFLLVFFQPVFAALVRQLNLPFLNDILYQFDHEVWEGFRFWDL 65
Query: 97 VMPFFLFIVGVSLALTYKNFPCKVVA----TRKAILRALNLFLLGIFLQGGFFHGINNLK 152
VMP FLF+ G S+ + + + R+ + R LF+ G+ +QG
Sbjct: 66 VMPLFLFMTGASMPFSLSKYVGMSGSYWPVYRRILRRVFLLFIFGMIVQGNLL------- 118
Query: 153 YGVDIAQIR-WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTT 211
G+D + I + LQ IA+ Y +AA+ ++ H S + W + +
Sbjct: 119 -GLDSSHIYLYSNTLQSIAVGYFIAAVIQL------HFSFR----------WQIGIT--- 158
Query: 212 LYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHL 271
LLLL+ ++P G PA N +DR +LG
Sbjct: 159 --LLLLFIYWIP---------------------MTFLGDFTPAGNFAEQVDRWVLG---- 191
Query: 272 YRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAP-FDPEGLLSSVMATVTCLIGLHFGHLI 330
R + + D +W +P ++ + SS+ VT ++G G ++
Sbjct: 192 -------RFRDGVFWNED--------GTWSFSPYYNYTWIWSSLTFGVTVMLGAFAGKIM 236
Query: 331 VHFKDHRDRMLNWI-ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIY 389
K +R +++ + ++ L+GL + L + M + K L++ S T L+ G +L+A Y
Sbjct: 237 KEGKANRKKVVQTLSVIGVLLVGLAM-LWSLQMPIIKRLWTGSMTLLSGGYCFLLMALFY 295
Query: 390 FMVRY 394
+ + Y
Sbjct: 296 YWIDY 300
>gi|24375008|ref|NP_719051.1| N-acetylglucosamine locus membrane protein of unknown function
DUF1624 NagX [Shewanella oneidensis MR-1]
gi|24349746|gb|AAN56495.1| N-acetylglucosamine locus membrane protein of unknown function
DUF1624 NagX [Shewanella oneidensis MR-1]
Length = 395
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 104/394 (26%), Positives = 160/394 (40%), Gaps = 102/394 (25%)
Query: 22 STSAP---ANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGG 78
ST+AP AN + + +N+ Q +P RL+SLD RG + ++ + + G
Sbjct: 2 STTAPELAANVSINAQV----ATANNSQPKP-----RLMSLDALRGFDMFWILGGEALFG 52
Query: 79 IL-------------PAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRK 125
L ++HS W+G L D + P F+F+ GV+L L+ K + R
Sbjct: 53 ALLIFTGWAGWQWGDTQMHHSEWHGFRLYDLIFPLFIFLSGVALGLSPKRLDKLPLHERL 112
Query: 126 AILR-ALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLK 184
+ R + L + L + HG VD +IR+ VL RIA A+ AAL +W
Sbjct: 113 PVYRHGVKRLFLLLLLGILYNHGWGT-GAPVDPDKIRYASVLGRIAFAWFFAALL-VW-- 168
Query: 185 GDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYG---LYVPDWQYEFPVETSSSSPWIF 241
H S + VL + +L+ YG L++P FP
Sbjct: 169 ---HTSLRTQ-------------VLVAVGILVGYGAMQLWLP-----FP----------- 196
Query: 242 NVTCGVRGSTGPACNAVGMIDRKIL-GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSW 300
G G P + +D +L G+ + R P
Sbjct: 197 ---GGQAGVLSPTVSINAYVDSLLLPGVSYQGRMP------------------------- 228
Query: 301 CQAPFDPEGLLSSVMATVTCLIGLHFGHLIV--HFKDHRDRMLNWIILSSCLIGLGLSLD 358
DPEG+LS++ A V L G+ GH IV H K ++ + LG LD
Sbjct: 229 -----DPEGVLSTLPAVVNALAGVFVGHFIVKSHPKGEWAKVGLLGAAGGVCLALGWLLD 283
Query: 359 FVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
V + +NK L++ S+ +T+G S +LLA Y +V
Sbjct: 284 AV-IPVNKELWTSSFVLVTSGWSMLLLALFYALV 316
>gi|392551353|ref|ZP_10298490.1| hypothetical protein PspoU_08780 [Pseudoalteromonas spongiae
UST010723-006]
Length = 379
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 97/379 (25%), Positives = 155/379 (40%), Gaps = 94/379 (24%)
Query: 40 SNSNSKQTRPQHQQRRLISLDVFRGLTV-----------ALMILVDDVGGILPAIN--HS 86
S++N K ++RL SLD RG + AL +L G + HS
Sbjct: 2 SDTNKKT------KKRLASLDALRGFDMFWILGGEKIFAALFVLTGWAGWKVAEAQTLHS 55
Query: 87 PWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAI-LRALNLFLLGIFLQGGFF 145
W+G T D + P F+F+ GV++ L+ K RK I ++A LL F +
Sbjct: 56 QWHGFTFYDLIFPLFIFLSGVAMGLSPKRIDHLPFVDRKPIYIKAFKRLLLLCFFGVLYN 115
Query: 146 HGINNLKYGVDI--AQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHW 203
HG GV + ++R+ VL RIA+A+ VAA+ +W H S + +F
Sbjct: 116 HGWGT---GVPLNPEEVRYASVLGRIAVAWFVAAML-VW-----HTSFRTQMF------- 159
Query: 204 VVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRG--STGPACNAVGMI 261
+A +L ++ L ++P V GV G S G A +
Sbjct: 160 TLAGILIGYWVWL---CFIP-------------------VPGGVAGDLSAGGAGSWNAWF 197
Query: 262 DRKIL-GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTC 320
D +L GI + KP+ DPEG+LSS+ A
Sbjct: 198 DTNLLPGITY-QNKPM-----------------------------DPEGILSSIPAIANA 227
Query: 321 LIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLS-LDFVGMHLNKALYSLSYTCLTAG 379
L G+ G I ++ + I+ ++ L+ L + L + +NK L++ S+ +T G
Sbjct: 228 LFGVIAGRYIKQAQERGEWKTAGILFAAGLVALAVGWLWNMVFPVNKDLWTSSFVMVTVG 287
Query: 380 ASGVLLAGIYFMVRYISSH 398
S +LLA Y +V ++
Sbjct: 288 WSFILLAVFYAVVDLLNQQ 306
>gi|375360501|ref|YP_005113273.1| putative transmembrane protein [Bacteroides fragilis 638R]
gi|301165182|emb|CBW24752.1| putative transmembrane protein [Bacteroides fragilis 638R]
Length = 373
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 84/356 (23%), Positives = 139/356 (39%), Gaps = 85/356 (23%)
Query: 55 RLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLADFVMPFF 101
RL SLD+ RG + L++ V IL +H W G D VMP F
Sbjct: 11 RLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFRFWDLVMPLF 70
Query: 102 LFIVGVSLALT-YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
LF+ G S+ + +K+ P K RK I R + LF+ G+ +QG G+D +
Sbjct: 71 LFMTGASMPFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNLL--------GLDPKHL 122
Query: 161 R-WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYG 219
+ LQ IA YL+AA+ ++ H + + L + T LLL+Y
Sbjct: 123 YLYSNTLQAIATGYLIAAIIQL------HCNFRWQL-------------IVTALLLLIY- 162
Query: 220 LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSR 279
++P G P N +DR +LG H ++
Sbjct: 163 -WIP---------------------MTFLGDFTPEGNFAEKVDRLVLG--HFRDGVFWNE 198
Query: 280 TKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDR 339
S + A ++ + SS+ T ++G G ++ KD+R +
Sbjct: 199 DGSWSFS----------------AHYNYTWIWSSLTFGATVMLGAFAGKIMKAGKDNRRK 242
Query: 340 MLN-WIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRY 394
++ +I+ LI L + M + K L++ S T + G +L+ Y+ + Y
Sbjct: 243 VVQTLLIIGISLIAFSLIWS-LQMPIIKRLWTSSMTLFSGGLCFLLMGAFYYRIDY 297
>gi|423282787|ref|ZP_17261672.1| hypothetical protein HMPREF1204_01210 [Bacteroides fragilis HMW
615]
gi|404581658|gb|EKA86354.1| hypothetical protein HMPREF1204_01210 [Bacteroides fragilis HMW
615]
Length = 375
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 83/358 (23%), Positives = 136/358 (37%), Gaps = 87/358 (24%)
Query: 55 RLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLADFVMPFF 101
RL SLD+ RG + L++ V IL +H W G D VMP F
Sbjct: 11 RLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFRFWDLVMPLF 70
Query: 102 LFIVGVSLALTYKNF---PCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIA 158
LF+ G S+ ++ F P K RK I R + LF+ G+ +QG G+D
Sbjct: 71 LFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNLL--------GLDPK 122
Query: 159 QIR-WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLL 217
+ + LQ IA YL+AA+ ++ H + + L V AL+L
Sbjct: 123 HLYLYSNTLQAIATGYLIAAIIQL------HCNFRWQLI-------VTALLLV------- 162
Query: 218 YGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIY 277
I+ + G P N +DR +LG H +
Sbjct: 163 ----------------------IYWIPMTFLGDFTPEGNFAEKVDRLVLG--HFRDGVFW 198
Query: 278 SRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHR 337
+ S + A ++ + SS+ T ++G G ++ KD+R
Sbjct: 199 NEDGSWSFS----------------AHYNYTWIWSSLTFGATVMLGAFAGKIMKAGKDNR 242
Query: 338 DRMLN-WIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRY 394
+++ +I+ LI L + M + K L++ S T + G +L+ Y+ + Y
Sbjct: 243 RKVVQTLLIIGISLIAFSLIWS-LQMPIIKRLWTSSMTLFSGGLCFLLMGAFYYRIDY 299
>gi|319785830|ref|YP_004145305.1| transmembrane protein [Pseudoxanthomonas suwonensis 11-1]
gi|317464342|gb|ADV26074.1| putative transmembrane protein [Pseudoxanthomonas suwonensis 11-1]
Length = 357
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 51/134 (38%), Positives = 71/134 (52%), Gaps = 6/134 (4%)
Query: 54 RRLISLDVFRGLTVALMILVD---DVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLA 110
RRL S+D RGLTVA M+LV+ D G + + H+ W+G T AD V PFFL IVGVS+A
Sbjct: 7 RRLASVDALRGLTVAAMLLVNNPGDWGHVYAPLLHADWHGCTPADLVFPFFLAIVGVSIA 66
Query: 111 LTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIA 170
L + A R ++R + + L I G + +D R GVLQRI
Sbjct: 67 LGVVPR-IEAGADRAGLMRTVAVRPLRILAVGLLL--HLLAWWWLDQPHYRPWGVLQRIG 123
Query: 171 IAYLVAALCEIWLK 184
+ +L A ++L+
Sbjct: 124 LCFLGAGAAALYLR 137
>gi|373948546|ref|ZP_09608507.1| Protein of unknown function DUF2261, transmembrane [Shewanella
baltica OS183]
gi|386325609|ref|YP_006021726.1| hypothetical protein [Shewanella baltica BA175]
gi|333819754|gb|AEG12420.1| Protein of unknown function DUF2261, transmembrane [Shewanella
baltica BA175]
gi|373885146|gb|EHQ14038.1| Protein of unknown function DUF2261, transmembrane [Shewanella
baltica OS183]
Length = 384
Score = 68.2 bits (165), Expect = 8e-09, Method: Compositional matrix adjust.
Identities = 95/358 (26%), Positives = 146/358 (40%), Gaps = 90/358 (25%)
Query: 55 RLISLDVFRGLTVALMILVDDV-GGIL------------PAINHSPWNGLTLADFVMPFF 101
RL+SLD RG + ++ + + GG+L ++HS W+G D + P F
Sbjct: 18 RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFRFYDLIFPLF 77
Query: 102 LFIVGVSLALTYKNFPCKVVATRKAILR-ALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
+F+ GV+L L+ K ++ R + R + L + L + HG D +I
Sbjct: 78 IFLSGVALGLSPKRLDKLPMSERLPVYRHGIKRLFLLLLLGILYNHGWGT-GAPADPEKI 136
Query: 161 RWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYG- 219
R+ VL RIA A+ AAL +W H S + ++ L +LL YG
Sbjct: 137 RYASVLGRIAFAWFFAALL-VW-----HTSLRTQ-------------IIVALGILLGYGA 177
Query: 220 --LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKIL-GIQHLYRKPI 276
L++P FP G G P + +D +L G+ + R P
Sbjct: 178 IQLWLP-----FP--------------GGQAGVLSPTESINAYVDSILLPGVSYQGRTP- 217
Query: 277 YSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV--HFK 334
DPEGLLS++ A V L G+ GH IV H K
Sbjct: 218 -----------------------------DPEGLLSTIPAIVNALAGVFVGHFIVKSHPK 248
Query: 335 DHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
++ + LG LD V + +NK L++ S+ +T+G S +LLA Y +V
Sbjct: 249 GEWAKVGVLAAAGGIFLVLGWLLDLV-IPVNKELWTSSFVLVTSGWSMILLAVFYALV 305
>gi|440791267|gb|ELR12512.1| transmembrane protein [Acanthamoeba castellanii str. Neff]
Length = 825
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 56/201 (27%), Positives = 84/201 (41%), Gaps = 50/201 (24%)
Query: 202 HWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACN----- 256
HW+VA L +Y ++ + L VP CG RG GP +
Sbjct: 591 HWLVAFSLLFVYFMITFFLDVPG--------------------CG-RGYLGPDISTATGG 629
Query: 257 AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMA 316
A G ID+KI H+Y +P C P + +DPEG L ++ +
Sbjct: 630 AAGYIDKKIFTEDHIYNQP------TCQ-------------PLYLTGSYDPEGTLGNLTS 670
Query: 317 TVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMH-----LNKALYSL 371
+GL G ++ +KDH+ R++ W I S L + L L + LNK L+S
Sbjct: 671 IFMVFLGLQSGRTLMAWKDHKHRVVRWYIWSIVLGFIALGLCEAKQNGGFFPLNKNLWSP 730
Query: 372 SYTCLTAGASGVLLAGIYFMV 392
S+ TA + LLA Y ++
Sbjct: 731 SFILATAAMAFFLLATFYLLI 751
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/82 (40%), Positives = 52/82 (63%), Gaps = 3/82 (3%)
Query: 44 SKQTRPQHQQR---RLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPF 100
+Q P+ R R+ SLD RGL +A+MI V+ GG NHS WNG+T+AD V P+
Sbjct: 384 DRQNGPKPAPRVSSRVNSLDAVRGLAIAIMIFVNYGGGGYWFFNHSAWNGITVADLVFPW 443
Query: 101 FLFIVGVSLALTYKNFPCKVVA 122
F++I+G S+A+++ + K++
Sbjct: 444 FIWIMGTSMAISFTSLEKKLLG 465
>gi|237722081|ref|ZP_04552562.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
gi|293373568|ref|ZP_06619919.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
gi|299145142|ref|ZP_07038210.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
gi|229448950|gb|EEO54741.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
gi|292631466|gb|EFF50093.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
gi|298515633|gb|EFI39514.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
Length = 377
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 86/368 (23%), Positives = 143/368 (38%), Gaps = 86/368 (23%)
Query: 45 KQTRPQHQQRRLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGL 91
K P++ RL SLD+ RG + L++ V IL +H W G
Sbjct: 2 KNKLPENNSIRLASLDILRGFDLFLLVFFQPVFVALARQMNMSFLDSILYQFDHEVWEGF 61
Query: 92 TLADFVMPFFLFIVGVSLALTYKNFPCKVVA----TRKAILRALNLFLLGIFLQGGFFHG 147
D VMP FLF+ G S+ + + + R+ + R LF+ G+ +QG
Sbjct: 62 RFWDLVMPLFLFMTGASMPFSLSKYIGTTGSYWPVYRRILKRVFLLFIFGMIVQGNLL-- 119
Query: 148 INNLKYGVDIAQIR-WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVA 206
G+D + + LQ IA+ YL+AA+ ++ H K W V
Sbjct: 120 ------GLDATHLYLYSNTLQSIAVGYLIAAVIQL------HFPFK----------WQVG 157
Query: 207 LVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKIL 266
+ TL LLL+Y +VP G PA N +DR +L
Sbjct: 158 I---TLLLLLVY--WVP---------------------MTFLGDFTPAGNFAEQVDRCVL 191
Query: 267 GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHF 326
G +R ++ SP Y + + SS+ VT ++G
Sbjct: 192 G---RFRDGVFWNEDGTWSFSPSY---------------NYTWMWSSLTFGVTVMLGAFA 233
Query: 327 GHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
G ++ K +R R++ +++ + L + M + K +++ S T L+ G +L+A
Sbjct: 234 GKIMKEGKANRKRVVQTLLVVGLALVGLALLWSLQMPVIKRIWTGSMTLLSGGYCFLLMA 293
Query: 387 GIYFMVRY 394
Y+ + Y
Sbjct: 294 LFYYWIDY 301
>gi|53715734|ref|YP_101726.1| hypothetical protein BF4455 [Bacteroides fragilis YCH46]
gi|423271955|ref|ZP_17250924.1| hypothetical protein HMPREF1079_04006 [Bacteroides fragilis
CL05T00C42]
gi|423276040|ref|ZP_17254983.1| hypothetical protein HMPREF1080_03636 [Bacteroides fragilis
CL05T12C13]
gi|52218599|dbj|BAD51192.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
gi|392696310|gb|EIY89506.1| hypothetical protein HMPREF1079_04006 [Bacteroides fragilis
CL05T00C42]
gi|392699545|gb|EIY92721.1| hypothetical protein HMPREF1080_03636 [Bacteroides fragilis
CL05T12C13]
Length = 375
Score = 67.8 bits (164), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 83/358 (23%), Positives = 136/358 (37%), Gaps = 87/358 (24%)
Query: 55 RLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLADFVMPFF 101
RL SLD+ RG + L++ V IL +H W G D VMP F
Sbjct: 11 RLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFRFWDLVMPLF 70
Query: 102 LFIVGVSLALTYKNF---PCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIA 158
LF+ G S+ ++ F P K RK I R + LF+ G+ +QG G+D
Sbjct: 71 LFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNLL--------GLDPK 122
Query: 159 QIR-WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLL 217
+ + LQ IA YL+AA+ ++ H + + L V AL+L
Sbjct: 123 HLYLYSNTLQAIATGYLIAAIIQL------HCNFRWQLI-------VTALLLV------- 162
Query: 218 YGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIY 277
I+ + G P N +DR +LG H +
Sbjct: 163 ----------------------IYWIPMTFLGDFTPEGNFAEKVDRLVLG--HFRDGVFW 198
Query: 278 SRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHR 337
+ S + A ++ + SS+ T ++G G ++ KD+R
Sbjct: 199 NEDGSWSFS----------------AHYNYTWIWSSLTFGATVMLGAFAGKIMKAGKDNR 242
Query: 338 DRMLN-WIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRY 394
+++ +I+ LI L + M + K L++ S T + G +L+ Y+ + Y
Sbjct: 243 RKVVQTLLIIGISLIAFSLIWS-LQMPIIKRLWTSSMTLFSGGLCFLLMGAFYYRIDY 299
>gi|410663435|ref|YP_006915806.1| hypothetical protein M5M_04345 [Simiduia agarivorans SA1 = DSM
21679]
gi|409025792|gb|AFU98076.1| hypothetical protein M5M_04345 [Simiduia agarivorans SA1 = DSM
21679]
Length = 356
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 54/134 (40%), Positives = 75/134 (55%), Gaps = 13/134 (9%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSL 109
++R I+LD RGLT+ALMI+V+ G + + H+ W G T D V PFFLFIVG SL
Sbjct: 2 KQRYIALDALRGLTLALMIVVNTPGSWAHVYGPLLHADWMGWTFTDLVFPFFLFIVGASL 61
Query: 110 ALTYKNFPCKVVAT--RKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
+ K A RK I R+L L +LG+ L+ + V + ++R GVLQ
Sbjct: 62 YFSQKGMASLTRADQLRKIIRRSLLLIVLGVLLE--------YYPFIVSLHELRLPGVLQ 113
Query: 168 RIAIAYLVAALCEI 181
RI +A+ VAAL +
Sbjct: 114 RIGLAFGVAALLVV 127
>gi|265767324|ref|ZP_06094990.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
gi|263252629|gb|EEZ24141.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
Length = 375
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 83/358 (23%), Positives = 136/358 (37%), Gaps = 87/358 (24%)
Query: 55 RLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLADFVMPFF 101
RL SLD+ RG + L++ V IL +H W G D VMP F
Sbjct: 11 RLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGFRFWDLVMPLF 70
Query: 102 LFIVGVSLALTYKNF---PCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIA 158
LF+ G S+ ++ F P K RK I R + LF+ G+ +QG G+D
Sbjct: 71 LFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNLL--------GLDPK 122
Query: 159 QIR-WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLL 217
+ + LQ IA YL+AA+ ++ H + + L V AL+L
Sbjct: 123 HLYLYSNTLQAIATGYLIAAIIQL------HCNFRWQLI-------VTALLLV------- 162
Query: 218 YGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIY 277
I+ + G P N +DR +LG H +
Sbjct: 163 ----------------------IYWIPMTFLGDFTPEGNFAEKVDRLVLG--HFRDGVFW 198
Query: 278 SRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHR 337
+ S + A ++ + SS+ T ++G G ++ KD+R
Sbjct: 199 NEDGSWSFS----------------AHYNYTWIWSSLTFGATVMLGAFAGKIMKAGKDNR 242
Query: 338 DRMLN-WIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRY 394
+++ +I+ LI L + M + K L++ S T + G +L+ Y+ + Y
Sbjct: 243 RKVVQTLLIIGISLIAFSLIWS-LQMPIIKRLWTSSMTLFSGGLCFLLMGAFYYRIDY 299
>gi|393763917|ref|ZP_10352530.1| hypothetical protein AGRI_13026 [Alishewanella agri BL06]
gi|392605231|gb|EIW88129.1| hypothetical protein AGRI_13026 [Alishewanella agri BL06]
Length = 394
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 90/369 (24%), Positives = 152/369 (41%), Gaps = 64/369 (17%)
Query: 43 NSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMP 99
+ +Q + R+++LD RGL + MILV++ G + P + H+ W+G T D + P
Sbjct: 5 HCQQILAKQPANRMLALDALRGLAILAMILVNNPGSWQYVYPPLLHAEWHGWTPTDLIFP 64
Query: 100 FFLFIVGVSL--ALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGF--FHGINNLKYGV 155
FL +VG+++ +L + K R+ +RAL L+LLG+FL + F N
Sbjct: 65 AFLVMVGMAIPYSLAGRQMLPKAELLRQGAIRALKLYLLGLFLVLFYYNFRDPNYSYLQQ 124
Query: 156 DIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLL 215
+ +RW GVLQRI I Y L ++ G V W+ L L LY L
Sbjct: 125 KLLTVRWSGVLQRIGIVYFCTLLIVLYSGTRGRVL------------WLSGLCL--LYFL 170
Query: 216 LLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKP 275
L+ P+ N G N +D +LG H++ +
Sbjct: 171 LM-----------------QFVPYRDNYGHTFVGLWEHGNNLAAWLDHHVLGPNHVFFR- 212
Query: 276 IYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKD 335
S FDPEG+LS++ A +CL G+ L+ K
Sbjct: 213 -----------------------SATPFAFDPEGILSTLPAIASCLSGVLMAQLL-QSKA 248
Query: 336 HRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYI 395
L + L+ L + +NK L++ ++ L++G + ++L G++ + +
Sbjct: 249 ELAFKLRVLFLAGLAAIWVAELAHPALPINKMLWTPTFVLLSSGFTALIL-GLFLWLTEM 307
Query: 396 SSHLMLKKP 404
+ + P
Sbjct: 308 KRYRLWTAP 316
>gi|433678126|ref|ZP_20510025.1| Heparan-alpha-glucosaminide N-acetyltransferase [Xanthomonas
translucens pv. translucens DSM 18974]
gi|430816762|emb|CCP40477.1| Heparan-alpha-glucosaminide N-acetyltransferase [Xanthomonas
translucens pv. translucens DSM 18974]
Length = 384
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 87/296 (29%), Positives = 119/296 (40%), Gaps = 71/296 (23%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGILPA---INHSPWNGLTLADFVMPFFLFIVGVSL 109
+ R +SLDVFRGL + LMIL + G A + H+PW G T AD P FLF+VG ++
Sbjct: 15 RERFLSLDVFRGLMIFLMILGNTPGAGADAFVQLRHAPWLGFTAADVGFPSFLFVVGNAM 74
Query: 110 ALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFF--HGINNLKYGVDIAQIRWMGVLQ 167
+ R+ R+ +FLLG + F G + + I Q R GVLQ
Sbjct: 75 SFALDRSQPLGAFLRRVGKRSALIFLLGFLMYWFPFVHQGADGSWSFIAIDQTRVPGVLQ 134
Query: 168 RIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQY 227
RIA+ Y +AAL WL G + + ++L GHW L L+G
Sbjct: 135 RIALCYALAALLCRWLPPRGLLGACVALL---LGHWAA---------LYLWG-------- 174
Query: 228 EFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINS 287
+ E S NA +D +L LYRK
Sbjct: 175 QPGAELSKLG------------------NAATRLDLWLLNPAQLYRK------------- 203
Query: 288 PDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNW 343
D G FDPEGLL ++ ATV + G + L V + R + W
Sbjct: 204 -DGG-------------FDPEGLLGTLPATVNVIAG-YLTGLYVRRAGKQARTVRW 244
>gi|329851798|ref|ZP_08266479.1| hypothetical protein ABI_45670 [Asticcacaulis biprosthecum C19]
gi|328839647|gb|EGF89220.1| hypothetical protein ABI_45670 [Asticcacaulis biprosthecum C19]
Length = 398
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 79/284 (27%), Positives = 114/284 (40%), Gaps = 58/284 (20%)
Query: 49 PQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIV 105
P+ R +LD+ RGL + M+L ++ G I ++H+ W+G TL D V P F+ V
Sbjct: 21 PKPASVRFEALDILRGLFIIGMLLANNAGDWSHIYTPLDHAEWHGFTLTDMVFPGFMTCV 80
Query: 106 GVSLALTY--KNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWM 163
G+S+ L+ + A KA L +L I + G F NL D R
Sbjct: 81 GLSMTLSLGRRQKTLNSQAGGKAALLVHSLRRAAILVGIGLFL---NLLPQFDFEHWRLP 137
Query: 164 GVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVP 223
GVLQRI I Y +A+ + L + L L + W V ++ Y LLL + VP
Sbjct: 138 GVLQRIGICYAIASGLVV-LHSHQNQQGGLILHSRALALWGVGFLVA--YTLLLKYVPVP 194
Query: 224 DWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQC 283
D + S P +D ++LG+ H ++S K
Sbjct: 195 DGAGANQWDAIHSWP--------------------AWVDMQVLGVNH-----VWSGAKT- 228
Query: 284 SINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFG 327
+DPEGLLSSV AT L G+ G
Sbjct: 229 ---------------------YDPEGLLSSVPATSNILFGILMG 251
>gi|298376668|ref|ZP_06986623.1| conserved hypothetical protein [Bacteroides sp. 3_1_19]
gi|298266546|gb|EFI08204.1| conserved hypothetical protein [Bacteroides sp. 3_1_19]
Length = 372
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 93/375 (24%), Positives = 140/375 (37%), Gaps = 115/375 (30%)
Query: 48 RPQHQQRRLISLDVFRG-----------LTVALMILVDD--VGGILPAINHSPWNGLTLA 94
+ + Q +RL SLD RG L VAL L + I + H WNGL
Sbjct: 3 KQKQQPQRLQSLDALRGFDMLFIMGGASLFVALATLFPNPFFQAIAGQMEHVEWNGLAHH 62
Query: 95 DFVMPFFLFIVGVSLALTYKNFPCKVVAT----RKAILRALNLFLLGIFLQGGFFHGINN 150
D + P FLFI G+S + + K + +K + R + L LG+ + N
Sbjct: 63 DTIFPLFLFIAGISFPFSLEKQRGKGMTEGAIYKKIVRRGITLVFLGL---------VYN 113
Query: 151 LKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLT 210
+ +R VL RI + ++ AAL + + AL+L
Sbjct: 114 GLLSFEFDHLRCASVLARIGLGWMFAALLFVRFGWKARAG-------------ITALILV 160
Query: 211 TLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGP---ACNAVGMIDRKIL- 266
+L++ + + VPD G GP N VG IDR L
Sbjct: 161 GYWLVMAF-VPVPD-----------------------AGGAGPFTLEGNLVGYIDRLFLP 196
Query: 267 GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHF 326
G H + FDPEGL S+V A T ++G+
Sbjct: 197 GRLH-------------------------------ETVFDPEGLFSTVPAIATAMLGMFT 225
Query: 327 GHLIVHFKD---HRDRMLNWIILSSCLIGLGLSLDFVGM------HLNKALYSLSYTCLT 377
G I K+ R ++L CL+G G L VG+ +NK L++ S+ C+
Sbjct: 226 GEWIKLGKEGLTDRKKVL-------CLVGAGAVLLIVGLLWSLVFPVNKKLWTSSFVCVV 278
Query: 378 AGASGVLLAGIYFMV 392
GA V + ++F +
Sbjct: 279 -GAYSVWMFALFFYI 292
>gi|298386962|ref|ZP_06996516.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
gi|298260112|gb|EFI02982.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
Length = 376
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 83/365 (22%), Positives = 146/365 (40%), Gaps = 90/365 (24%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLADF 96
++ RL SLD+ RG + L++ V IL +H W G D
Sbjct: 6 ENNTSRLASLDILRGFDLFLLVFFQPVFAALARQLNLPFLNDILYQFDHEVWEGFRFWDL 65
Query: 97 VMPFFLFIVGVSLALTYKNFPCKVVA----TRKAILRALNLFLLGIFLQGGFFHGINNLK 152
VMP FLF+ G S+ + + + R+ + R LF+ G+ +QG
Sbjct: 66 VMPLFLFMTGASMPFSLSKYVGMSGSYWPVYRRILRRVFLLFIFGMIVQGNLL------- 118
Query: 153 YGVDIAQIR-WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTT 211
G+D + I + LQ IA+ Y +AA+ ++ H S + W + +
Sbjct: 119 -GLDSSHIYLYSNTLQSIAVGYFIAAVIQL------HFSFR----------WQIGIT--- 158
Query: 212 LYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHL 271
LLLL+ ++P G PA N +DR +LG
Sbjct: 159 --LLLLFIYWIP---------------------MTFLGDFTPAGNFAEQVDRCVLG---- 191
Query: 272 YRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAP-FDPEGLLSSVMATVTCLIGLHFGHLI 330
R + + D +W +P ++ + SS+ VT ++G G ++
Sbjct: 192 -------RFRDGVFWNED--------GTWSFSPYYNYTWIWSSLTFGVTVMLGAFAGKIM 236
Query: 331 VHFKDHRDRMLNWI-ILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIY 389
K +R +++ + ++ L+GL + L + M + K L++ S T L+ G +L+A Y
Sbjct: 237 KEGKANRKKVVQTLSVIGVLLVGLAM-LWSLQMPIIKRLWTGSMTLLSGGYCFLLMALFY 295
Query: 390 FMVRY 394
+ + Y
Sbjct: 296 YWIDY 300
>gi|218260820|ref|ZP_03475939.1| hypothetical protein PRABACTJOHN_01603 [Parabacteroides johnsonii
DSM 18315]
gi|423344001|ref|ZP_17321714.1| hypothetical protein HMPREF1077_03144 [Parabacteroides johnsonii
CL02T12C29]
gi|218224343|gb|EEC96993.1| hypothetical protein PRABACTJOHN_01603 [Parabacteroides johnsonii
DSM 18315]
gi|409213863|gb|EKN06876.1| hypothetical protein HMPREF1077_03144 [Parabacteroides johnsonii
CL02T12C29]
Length = 371
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 96/368 (26%), Positives = 148/368 (40%), Gaps = 101/368 (27%)
Query: 48 RPQHQQRRLISLDVFRGLTVALMI----LVDDVGGILP---------AINHSPWNGLTLA 94
Q Q RRL+SLD RG + ++ L + + P ++H W+GLT
Sbjct: 2 EKQKQSRRLLSLDALRGFDMFFIMGGASLFVALATLFPNPFFQVIGDQMHHVKWDGLTHH 61
Query: 95 DFVMPFFLFIVGVS--LALTYKNFPCKVVAT--RKAILRALNLFLLGIFLQGGFFHGINN 150
D + P FLFI G+S +L + K A RK I R L L +LG ++G+ N
Sbjct: 62 DTIFPLFLFIAGISFPFSLEKQREQGKTDADIYRKIIRRGLTLVVLGF-----VYNGLLN 116
Query: 151 LKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLT 210
D R+ VL RI +A++ AL + + V W+ +L
Sbjct: 117 F----DFEHQRYASVLGRIGLAWMFGALIFVNTRTITRV-------------WITVAILV 159
Query: 211 TLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKIL-GIQ 269
+LLL + + PD GV G + VG +DR +L G
Sbjct: 160 GYWLLLAF-VPAPD-----------------GNGAGVFTMEG---SLVGYVDRLLLPGRL 198
Query: 270 HLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHL 329
HL DPEG+LS+V A T L+G+ G
Sbjct: 199 HL-------------------------------TVHDPEGILSTVPAVATALLGMFTGEF 227
Query: 330 IVHFKD---HRDRMLNWIILSSCLIGLGL--SLDFVGMHLNKALYSLSYTCLTAGASGVL 384
I ++ + ++ +I + L+ +GL SL F +NK L++ S+ C+ GA V
Sbjct: 228 IKMQREGLTDKKKVGGLVIAGAVLLAVGLLWSLFF---PINKNLWTSSFVCVV-GAYSVW 283
Query: 385 LAGIYFMV 392
+ ++F V
Sbjct: 284 MFALFFYV 291
>gi|380693009|ref|ZP_09857868.1| hypothetical protein BfaeM_03398 [Bacteroides faecis MAJ27]
Length = 376
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 87/370 (23%), Positives = 146/370 (39%), Gaps = 96/370 (25%)
Query: 48 RPQHQQRRLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLA 94
+ ++ RL SLD+ RG + L++ V IL +H W G
Sbjct: 4 KSENNTSRLASLDILRGFDLFLLVFFQPVFAALVRQLNLPFLNDILYQFDHEVWEGFRFW 63
Query: 95 DFVMPFFLFIVGVSLALTYKNFPCKVVAT--------RKAILRALNLFLLGIFLQGGFFH 146
D VMP FLF+ G S+ + K + T R+ + R LF+ G+ +QG
Sbjct: 64 DLVMPLFLFMTGASMPFS----LSKYIGTSGSYRPVYRRILKRVFLLFVFGMIVQGNLL- 118
Query: 147 GINNLKYGVDIAQIR-WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVV 205
G+D I + LQ IA+ YL+AA+ ++ H S K W +
Sbjct: 119 -------GLDGKHIYLYSNTLQSIAVGYLIAAVIQL------HFSFK----------WQI 155
Query: 206 ALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKI 265
+ LLLL+ ++P G PA N +DR +
Sbjct: 156 GIT-----LLLLFVYWIP---------------------MTFLGDFTPAGNFAEQVDRWV 189
Query: 266 LGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLH 325
LG +R +Y SP Y + + SS+ VT ++G
Sbjct: 190 LG---RFRDGVYWNEDGTWNFSPYY---------------NYTWIWSSLTFGVTVMLGAF 231
Query: 326 FGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDF-VGMHLNKALYSLSYTCLTAGASGVL 384
G ++ K R R++ ++ ++ +GL++ + + M + K L++ S T L+ G +L
Sbjct: 232 AGKIMKEGKADRKRVVQ-VLSVVGVLLIGLAMLWSLQMPVIKRLWTGSMTLLSGGYCFLL 290
Query: 385 LAGIYFMVRY 394
+A Y+ + Y
Sbjct: 291 MALFYYWIDY 300
>gi|423345097|ref|ZP_17322786.1| hypothetical protein HMPREF1060_00458 [Parabacteroides merdae
CL03T12C32]
gi|409222883|gb|EKN15820.1| hypothetical protein HMPREF1060_00458 [Parabacteroides merdae
CL03T12C32]
Length = 370
Score = 67.0 bits (162), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 94/371 (25%), Positives = 143/371 (38%), Gaps = 107/371 (28%)
Query: 48 RPQHQQRRLISLDVFRG-----------LTVALMILVDD-----VGGILPAINHSPWNGL 91
Q Q RRL+SLD RG L VAL L + +GG ++H W+GL
Sbjct: 2 EKQKQSRRLLSLDALRGVDMFFIMGGASLFVALATLFPNSFFQAIGG---QMDHVEWDGL 58
Query: 92 TLADFVMPFFLFIVGVSLALTYKNFPCKVVAT----RKAILRALNLFLLGIFLQGGFFHG 147
T D + P FLFI G+S + + + + ++ + R + L LLG
Sbjct: 59 THHDTIFPLFLFIAGISFPFSLEKQREQGKSESDIYKRIVRRGITLVLLGC--------- 109
Query: 148 INNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVAL 207
+ N D A +R VL RI + ++ AAL + H FR W+ +
Sbjct: 110 VYNGLLQFDFANLRCASVLARIGLGWMFAALLFV------H-------FRTSVRAWIAGV 156
Query: 208 VLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKIL- 266
+L ++ W PV + + P+ N VG +DR +L
Sbjct: 157 ILVGYWV----------WIAFIPVPGAEAGPFTLE------------GNWVGYVDRLLLP 194
Query: 267 GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHF 326
G H Q FDPEGLLS++ VT ++G+
Sbjct: 195 GRLH-------------------------------QGFFDPEGLLSTLPGIVTAMLGMFT 223
Query: 327 GHLIVHFKD-----HRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGAS 381
G I K+ + L + L+G+ SL F +NK L+S S+ C+ S
Sbjct: 224 GEFIKLRKEGLTDKRKVAGLVLAGVLLLLVGVLWSLVF---PINKKLWSSSFVCVVGAYS 280
Query: 382 GVLLAGIYFMV 392
+ A Y++V
Sbjct: 281 VWMFALFYYIV 291
>gi|150007980|ref|YP_001302723.1| hypothetical protein BDI_1342 [Parabacteroides distasonis ATCC
8503]
gi|256840846|ref|ZP_05546354.1| conserved hypothetical protein [Parabacteroides sp. D13]
gi|423331513|ref|ZP_17309297.1| hypothetical protein HMPREF1075_01310 [Parabacteroides distasonis
CL03T12C09]
gi|149936404|gb|ABR43101.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
gi|256738118|gb|EEU51444.1| conserved hypothetical protein [Parabacteroides sp. D13]
gi|409230083|gb|EKN22951.1| hypothetical protein HMPREF1075_01310 [Parabacteroides distasonis
CL03T12C09]
Length = 372
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 98/379 (25%), Positives = 141/379 (37%), Gaps = 123/379 (32%)
Query: 48 RPQHQQRRLISLDVFRG-----------LTVALMILVDD--VGGILPAINHSPWNGLTLA 94
+ + Q +RL SLD RG L VAL L + I + H WNGL
Sbjct: 3 KQKQQPQRLQSLDALRGFDMLFIMGGASLFVALATLFPNPFFQAIAGQMEHVEWNGLAHH 62
Query: 95 DFVMPFFLFIVGVSLALTYKNFPCKVVAT----RKAILRALNLFLLGIFLQGGFFHGINN 150
D + P FLFI G+S + + K + +K + R + L LG+ G
Sbjct: 63 DTIFPLFLFIAGISFPFSLEKQRGKGMTEGAIYKKIVRRGITLVFLGLVYNG-------L 115
Query: 151 LKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLT 210
L + D +R VL RI + ++ AAL + R W V +T
Sbjct: 116 LSFEFD--HLRCASVLARIGLGWMFAALLFV------------------RFGWKVRAGIT 155
Query: 211 TL----YLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGP---ACNAVGMIDR 263
L Y L + + VPD G GP N VG IDR
Sbjct: 156 VLILVGYWLAMAFVPVPD-----------------------AGGAGPFTLEGNLVGYIDR 192
Query: 264 KIL-GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLI 322
L G H + FDPEGL S+V A T ++
Sbjct: 193 LFLPGRLH-------------------------------ETVFDPEGLFSTVPAIATAML 221
Query: 323 GLHFGHLIVHFKD---HRDRMLNWIILSSCLIGLGLSLDFVGM------HLNKALYSLSY 373
G+ G I K+ R+++L CL+G G L VG+ +NK L++ S+
Sbjct: 222 GMFTGEWIKLRKEGLTDRNKVL-------CLVGAGAVLLIVGLLWSLVFPINKKLWTSSF 274
Query: 374 TCLTAGASGVLLAGIYFMV 392
C+ GA V + ++F +
Sbjct: 275 VCVV-GAYSVWMFALFFYI 292
>gi|440731410|ref|ZP_20911431.1| membrane protein [Xanthomonas translucens DAR61454]
gi|440373102|gb|ELQ09871.1| membrane protein [Xanthomonas translucens DAR61454]
Length = 384
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 90/302 (29%), Positives = 122/302 (40%), Gaps = 83/302 (27%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGILPA---INHSPWNGLTLADFVMPFFLFIVGVSL 109
+ R +SLDVFRGL + LMIL + G A + H+PW G T AD P FLF+VG ++
Sbjct: 15 RERFLSLDVFRGLMIFLMILGNTPGAGADAFVQLRHAPWLGFTAADVGFPSFLFVVGNAM 74
Query: 110 ALTYKN------FPCKVVATRKAILRALNLFLLGIFLQGGFF--HGINNLKYGVDIAQIR 161
+ F C+ V R A+ +FLLG + F G + + I Q R
Sbjct: 75 SFALDRSQPLGAFLCR-VGKRSAL-----IFLLGFLMYWFPFVHQGADGSWSFIAIDQTR 128
Query: 162 WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLY 221
GVLQRIA+ Y +AAL WL G + + ++L GHW L L+G
Sbjct: 129 VPGVLQRIALCYALAALLCRWLPPRGLLGACVALL---LGHWAA---------LYLWG-- 174
Query: 222 VPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTK 281
+ E S NA +D +L LYRK
Sbjct: 175 ------QPGAELSKLG------------------NAATRLDLWLLNPAQLYRK------- 203
Query: 282 QCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRML 341
D G FDPEGLL ++ ATV + G + L V + R +
Sbjct: 204 -------DGG-------------FDPEGLLGTLPATVNVIAG-YLTGLYVRRAGKQARTV 242
Query: 342 NW 343
W
Sbjct: 243 RW 244
>gi|189463416|ref|ZP_03012201.1| hypothetical protein BACCOP_04135 [Bacteroides coprocola DSM 17136]
gi|189429845|gb|EDU98829.1| hypothetical protein BACCOP_04135 [Bacteroides coprocola DSM 17136]
Length = 82
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 32/70 (45%), Positives = 47/70 (67%), Gaps = 3/70 (4%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVG 106
+ +RLISLDV RG+T+A MI+V++ G + + H+ WNGLT D V PFF+FI+G
Sbjct: 1 MEKAQRLISLDVLRGITIAGMIIVNNPGSWKHVYTPLTHAVWNGLTPTDLVFPFFMFIMG 60
Query: 107 VSLALTYKNF 116
+S ++ K F
Sbjct: 61 ISTYISLKKF 70
>gi|154492358|ref|ZP_02031984.1| hypothetical protein PARMER_01992 [Parabacteroides merdae ATCC
43184]
gi|423722056|ref|ZP_17696232.1| hypothetical protein HMPREF1078_00295 [Parabacteroides merdae
CL09T00C40]
gi|154087583|gb|EDN86628.1| hypothetical protein PARMER_01992 [Parabacteroides merdae ATCC
43184]
gi|409242758|gb|EKN35518.1| hypothetical protein HMPREF1078_00295 [Parabacteroides merdae
CL09T00C40]
Length = 370
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 94/371 (25%), Positives = 142/371 (38%), Gaps = 107/371 (28%)
Query: 48 RPQHQQRRLISLDVFRG-----------LTVALMILVDD-----VGGILPAINHSPWNGL 91
Q Q RRL+SLD RG L VAL L + +GG ++H W+GL
Sbjct: 2 EKQKQSRRLLSLDALRGFDMFFIMGGASLFVALATLFPNSFFQAIGG---QMDHVEWDGL 58
Query: 92 TLADFVMPFFLFIVGVSLALTYKNFPCKVVAT----RKAILRALNLFLLGIFLQGGFFHG 147
T D + P FLFI G+S + + + + ++ + R + L LLG
Sbjct: 59 THHDTIFPLFLFIAGISFPFSLEKQREQGKSESDIYKRIVRRGITLVLLGC--------- 109
Query: 148 INNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVAL 207
+ N D A +R VL RI + ++ AAL + H FR W+
Sbjct: 110 VYNGLLQFDFANLRCASVLARIGLGWMFAALLFV------H-------FRTSVRAWIAGT 156
Query: 208 VLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKIL- 266
+L ++ W PV + + P+ N VG +DR +L
Sbjct: 157 ILVGYWV----------WIAFIPVPGAEAGPFTLE------------GNWVGYVDRLLLP 194
Query: 267 GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHF 326
G H Q FDPEGLLS++ VT ++G+
Sbjct: 195 GRLH-------------------------------QGFFDPEGLLSTLPGIVTAMLGMFT 223
Query: 327 GHLIVHFKD-----HRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGAS 381
G I K+ + L + L+G+ SL F +NK L+S S+ C+ S
Sbjct: 224 GEFIKLRKEGLTDKRKVAGLVLAGVLLLLVGVLWSLVF---PINKKLWSSSFVCVVGAYS 280
Query: 382 GVLLAGIYFMV 392
+ A Y++V
Sbjct: 281 VWMFALFYYIV 291
>gi|301309930|ref|ZP_07215869.1| conserved hypothetical protein [Bacteroides sp. 20_3]
gi|423340409|ref|ZP_17318148.1| hypothetical protein HMPREF1059_04073 [Parabacteroides distasonis
CL09T03C24]
gi|300831504|gb|EFK62135.1| conserved hypothetical protein [Bacteroides sp. 20_3]
gi|409227844|gb|EKN20740.1| hypothetical protein HMPREF1059_04073 [Parabacteroides distasonis
CL09T03C24]
Length = 372
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 98/379 (25%), Positives = 140/379 (36%), Gaps = 123/379 (32%)
Query: 48 RPQHQQRRLISLDVFRG-----------LTVALMILVDD--VGGILPAINHSPWNGLTLA 94
+ + Q +RL SLD RG L VAL L + I + H WNGL
Sbjct: 3 KQKQQPQRLQSLDALRGFDMLFIMGGASLFVALATLFPNPFFQAIAGQMEHVEWNGLAHH 62
Query: 95 DFVMPFFLFIVGVSLALTYKNFPCKVVAT----RKAILRALNLFLLGIFLQGGFFHGINN 150
D + P FLFI G+S + + K + +K + R + L LG+ G
Sbjct: 63 DTIFPLFLFIAGISFPFSLEKQRGKGMTEGAIYKKIVRRGITLVFLGLVYNG-------L 115
Query: 151 LKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLT 210
L + D +R VL RI + ++ AAL + R W V +T
Sbjct: 116 LSFEFD--HLRCASVLARIGLGWMFAALLFV------------------RFGWKVRAGIT 155
Query: 211 TL----YLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGP---ACNAVGMIDR 263
L Y L + + VPD G GP N VG IDR
Sbjct: 156 VLILVGYWLAMAFVPVPD-----------------------AGGAGPFTLEGNLVGYIDR 192
Query: 264 KIL-GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLI 322
L G H + FDPEGL S+V A T ++
Sbjct: 193 LFLPGRLH-------------------------------ETVFDPEGLFSTVPAIATAML 221
Query: 323 GLHFGHLIVHFKD---HRDRMLNWIILSSCLIGLGLSLDFVGM------HLNKALYSLSY 373
G+ G I K+ R ++L CL+G G L VG+ +NK L++ S+
Sbjct: 222 GMFTGEWIKLRKEGLTDRKKVL-------CLVGAGAVLLIVGLLWSLVFPINKKLWTSSF 274
Query: 374 TCLTAGASGVLLAGIYFMV 392
C+ GA V + ++F +
Sbjct: 275 VCVV-GAYSVWMFALFFYI 292
>gi|357628855|gb|EHJ78009.1| putative heparan-alpha-glucosaminide N-acetyltransferase [Danaus
plexippus]
Length = 275
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 54/168 (32%), Positives = 85/168 (50%), Gaps = 26/168 (15%)
Query: 43 NSKQTRPQHQQR-RLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFF 101
++ + P+ R RL SLD+FRG+ +ALM +H+ WNGLT+AD V P+F
Sbjct: 69 TTEASVPRSPTRSRLRSLDIFRGIAIALMQANK--------FSHAVWNGLTVADLVFPWF 120
Query: 102 LFIVGVSLALTYK-----NFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVD 156
F +G ++ L+ + P +V A + R+L L L+GI L G N +
Sbjct: 121 AFTMGEAMVLSLNARLRTSLP-RVNALGQVARRSLLLSLIGICL------GSVNTNW--- 170
Query: 157 IAQIRWMGVLQRIAIAYLVAALCE-IWLKGDGHVSSKLSLFRKYRGHW 203
+ +R+ GVLQR+A YL+ E +++ ++ SLFR W
Sbjct: 171 -SYVRFPGVLQRLAAMYLIVGSLECAFMRTSQNIIPGRSLFRDIAAGW 217
>gi|297567057|ref|YP_003686029.1| hypothetical protein [Meiothermus silvanus DSM 9946]
gi|296851506|gb|ADH64521.1| Protein of unknown function DUF2261, transmembrane [Meiothermus
silvanus DSM 9946]
Length = 377
Score = 66.2 bits (160), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 56/152 (36%), Positives = 79/152 (51%), Gaps = 25/152 (16%)
Query: 38 PPSNSNSKQTR--PQHQQRRLISLDVFRGLTVALMILVDDVGGILPA-----INHSPWN- 89
PP+ +T + RL SLDVFRGLT+ LM+LV++V L A + H+PW
Sbjct: 8 PPTQDQQTETPFPSRKTAMRLGSLDVFRGLTILLMLLVNNVA--LDANTPYLLTHAPWKG 65
Query: 90 GLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATR----KAILRALNLFLLGIFLQGGFF 145
G+ LAD V P+FL VGV++ +F K + + K I R++ LF LG+ +
Sbjct: 66 GVYLADLVFPWFLLAVGVAIPFAAASFRKKNLPSWRYDLKIIQRSIVLFGLGLLIVSSI- 124
Query: 146 HGINNLKYGVDIAQIRWMGVLQRIAIAYLVAA 177
+ +D VLQ IA+AYLVAA
Sbjct: 125 --ARRPVFALD--------VLQLIAMAYLVAA 146
>gi|254524630|ref|ZP_05136685.1| putative heparan-alpha-glucosaminide N-acetyltransferase
(transmembrane protein 76) [Stenotrophomonas sp. SKA14]
gi|219722221|gb|EED40746.1| putative heparan-alpha-glucosaminide N-acetyltransferase
(transmembrane protein 76) [Stenotrophomonas sp. SKA14]
Length = 355
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 50/134 (37%), Positives = 69/134 (51%), Gaps = 15/134 (11%)
Query: 54 RRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSLA 110
RRL S+D RG+TVA M+LV++ G + + HS W+G T D V PFFLF+VGVS+A
Sbjct: 7 RRLGSIDALRGITVAAMLLVNNPGDWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSMA 66
Query: 111 LTYKNFPCKVVA----TRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
+ A R + RAL + L G L + ++ + + GVL
Sbjct: 67 FSVAPRALDAAARPALARGVLERALRILLAGALLHLLIWWALHTHHFRI-------WGVL 119
Query: 167 QRIAI-AYLVAALC 179
QRIA+ A LV L
Sbjct: 120 QRIAVCAALVGVLA 133
>gi|322436067|ref|YP_004218279.1| hypothetical protein AciX9_2466 [Granulicella tundricola MP5ACTX9]
gi|321163794|gb|ADW69499.1| hypothetical protein AciX9_2466 [Granulicella tundricola MP5ACTX9]
Length = 391
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 32/64 (50%), Positives = 43/64 (67%), Gaps = 3/64 (4%)
Query: 55 RLISLDVFRGLTVALMILVDD---VGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
R++S+DV RGLT+ALMILV+D G + P + H+ WNG T AD V P FLF+ G SL
Sbjct: 16 RVLSIDVLRGLTIALMILVNDPGDAGCVYPQLQHAEWNGYTAADLVFPNFLFLGGASLVF 75
Query: 112 TYKN 115
+ +
Sbjct: 76 SLQG 79
>gi|255037019|ref|YP_003087640.1| hypothetical protein Dfer_3263 [Dyadobacter fermentans DSM 18053]
gi|254949775|gb|ACT94475.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
Length = 380
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 92/373 (24%), Positives = 138/373 (36%), Gaps = 94/373 (25%)
Query: 39 PSNSNSKQTRPQHQQRRLISLDVFRGLTVALMI-----LVDDVGG---------ILPAIN 84
PS K+ RP RL S+D RG + LMI + +GG +
Sbjct: 4 PSVVVKKEVRPSSSPGRLASIDALRGFDM-LMIAGGGQFIATLGGKTGISFIDAVAAQFE 62
Query: 85 HSPWNGLTLADFVMPFFLFIVGVSLALTY-----KNFPCKVVATRKAILRALNLFLLGIF 139
H WNG T DF+ P FLF+ G SLA + K P V+ K R L L LGI
Sbjct: 63 HPAWNGFTFYDFIFPLFLFLAGTSLAFSVTGGLAKGIPPSVI-RNKVFKRMLILIALGIL 121
Query: 140 LQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKY 199
+ N D A IR+ VL RI +A ++A+ + + R Y
Sbjct: 122 DK-------NAPMDIFDPAHIRYGSVLGRIGLATFISAILYMKFGTNQ---------RLY 165
Query: 200 RGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVG 259
G ++ TL L+ + G D +E N VG
Sbjct: 166 IGVGILVAYYLTLILVPVPGFGSGDLTFE--------------------------GNLVG 199
Query: 260 MIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVT 319
IDR + P L Q +D LL+ A
Sbjct: 200 WIDRNFM-------------------------PGILK-----QGTYDELALLTQFPALCL 229
Query: 320 CLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAG 379
L G G +++ ++ ++ +S I +G+ +F +NK L+S S+ LT+G
Sbjct: 230 TLFGTVAGDVLLRENRGNSKIGKLLLFASTGISIGILWNF-AFPINKHLWSSSFIMLTSG 288
Query: 380 ASGVLLAGIYFMV 392
+ +LA Y+++
Sbjct: 289 MAFAMLALFYWII 301
>gi|311031971|ref|ZP_07710061.1| hypothetical protein Bm3-1_15792 [Bacillus sp. m3-13]
Length = 370
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 85/339 (25%), Positives = 139/339 (41%), Gaps = 84/339 (24%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDV--GGILPAINHSPWNGLTLADFVMPFFLF 103
+T ++R S+DV RG+ V + + V + G + H+ W GLT+ D V P FL
Sbjct: 2 ETNNHITKKRYRSIDVTRGIVVLVSVFVSALPGGAEYDFLRHAYWYGLTITDLVFPAFLT 61
Query: 104 IVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWM 163
+ G+ LA+ Y+ K V + + R L L G+ F+ I + + D++ +R+
Sbjct: 62 VYGIGLAIVYR----KGVRWKDLLRRTFLLVLYGL-----LFNLIASWSF--DLSTLRFT 110
Query: 164 GVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVP 223
GVLQ AI L + KG ++ + +V+ T YL +L
Sbjct: 111 GVLQLFAITGLGVVVLSYLAKG-------------WKSMLALGMVIATAYLSILV----- 152
Query: 224 DWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQC 283
I +V C G CN G++D + G +H+Y +
Sbjct: 153 ----------------ISSVGCE-GGVPQRDCNPSGVVDVLVFGEKHMYAQG-------- 187
Query: 284 SINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNW 343
+ FDPEG+LS A G G ++ + R ++L
Sbjct: 188 ------------------EKGFDPEGILSIFSALSNVAFGFAVGLVL----NGRKQILQR 225
Query: 344 IILSSCLIGLGLSLDFV---GMHLNKALYSLSYTCLTAG 379
+ S IGL +SL F+ + LNK L++ S+ L +G
Sbjct: 226 VFGIS--IGL-ISLAFIFNNFIELNKRLWTPSFAILASG 261
>gi|116748970|ref|YP_845657.1| hypothetical protein Sfum_1534 [Syntrophobacter fumaroxidans MPOB]
gi|116698034|gb|ABK17222.1| conserved hypothetical protein [Syntrophobacter fumaroxidans MPOB]
Length = 374
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 53/135 (39%), Positives = 69/135 (51%), Gaps = 24/135 (17%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGIL---PAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL SLD FRG +A MILV+ G + + H+ WNG T AD + P FLF+VGVS+
Sbjct: 9 RLASLDAFRGAVIAGMILVNSPGRWVYTYSQLKHAQWNGWTFADTIFPAFLFVVGVSMVF 68
Query: 112 TY-KNFPCKVVAT-------RKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWM 163
++ + C+ A R+ L L LL + L FHG N +R
Sbjct: 69 SFSRRRECEEPAWRLVLQVFRRTSLIFLLGLLLNVMLD---FHGSN----------LRIP 115
Query: 164 GVLQRIAIAYLVAAL 178
GVLQRIA Y VA+L
Sbjct: 116 GVLQRIAACYFVASL 130
>gi|297300348|ref|XP_001115683.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Macaca mulatta]
Length = 547
Score = 65.1 bits (157), Expect = 6e-08, Method: Compositional matrix adjust.
Identities = 92/347 (26%), Positives = 136/347 (39%), Gaps = 77/347 (22%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQ-------QRRLISLDVFRGLTVALMI 71
ISK+ S+ ++L L PS ++ Q RL S+D FRG+ + LM+
Sbjct: 224 ISKAISSRETDRLINSELGSPSRTDPLDGDVQPAVWHLSVPPPRLRSVDTFRGIALILMV 283
Query: 72 LVDDVGGILPAINHSPWNGL-----TLADFVMPFFLFIVGVSLALTYKNF----PCKVVA 122
V+ GG H+ WNG + F M F+FI+G S+ L+ + K
Sbjct: 284 FVNYGGGKYWYFKHASWNGTLPMQCGICIFAM-MFVFIMGSSIFLSMTSILQRGCSKFRL 342
Query: 123 TRKAILRALNLFLLGIFLQGGFFHGINNLKY---GVDIAQIRWMGVLQRIAIAYLVAALC 179
K R+ L +GI I N Y + ++R GVLQR+ + Y V A+
Sbjct: 343 LGKIAWRSFLLICIGII--------IVNPNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVL 394
Query: 180 EIWLKGDGHVSSKLSLFRKYRG---------HWVVALVLTTLYLLLLYGLYVPDWQYEFP 230
E+ V +L R W++ L L L+L L + L VP
Sbjct: 395 ELLFAKP--VPEHCALERSCLSLRDITSSWPQWLLILALEGLWLGLTFLLPVPG------ 446
Query: 231 VETSSSSPWIFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLYRKPIYSRTKQCSIN 286
T P G G G P C A G IDR +LG HLY+ P + +
Sbjct: 447 CPTGYLGP-------GGIGDFGKYPNCTGGAAGYIDRLLLGDDHLYQHPSSTVLYHTEV- 498
Query: 287 SPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHF 333
+DPEG+L ++ + V +G+ + VHF
Sbjct: 499 -----------------AYDPEGILGTINSIVMAFLGV---QVFVHF 525
>gi|386719962|ref|YP_006186288.1| N-acetylglucosamine related transporter, NagX [Stenotrophomonas
maltophilia D457]
gi|384079524|emb|CCH14124.1| N-acetylglucosamine related transporter, NagX [Stenotrophomonas
maltophilia D457]
Length = 352
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 47/125 (37%), Positives = 65/125 (52%), Gaps = 14/125 (11%)
Query: 54 RRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSLA 110
RRL S+D RG+TVA M+LV++ G + + HS W+G T D V PFFLF+VGVS+A
Sbjct: 4 RRLGSIDALRGITVAAMLLVNNPGDWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSMA 63
Query: 111 LTYKNFPCKVVA----TRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVL 166
+ A R + RAL + L G L + ++ + R GVL
Sbjct: 64 FSVAPRALDAAARPALARGVLERALRILLAGALLHLLIWWALHTHHF-------RIWGVL 116
Query: 167 QRIAI 171
QRIA+
Sbjct: 117 QRIAV 121
>gi|403236334|ref|ZP_10914920.1| hypothetical protein B1040_11244 [Bacillus sp. 10403023]
Length = 373
Score = 65.1 bits (157), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 84/336 (25%), Positives = 141/336 (41%), Gaps = 77/336 (22%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVG-GILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
+ R+ SLD+ RG+ V + + + G H+ W GLTL DF++P F+ + GV +A+
Sbjct: 9 RSRIHSLDMARGIIVVFSVFLSSLPYGSYDFATHASWYGLTLVDFILPCFITVFGVGMAI 68
Query: 112 TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAI 171
Y+ K V ++ I R + L L G+ F+ I + + D++ +R+ GVLQ A+
Sbjct: 69 AYQ----KGVKWKRFISRTIKLILFGL-----LFNII--VAWSFDLSTLRFTGVLQMYAL 117
Query: 172 AYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPV 231
+ L ++K VS +V L++ ++Y +L LY+
Sbjct: 118 LGIGTVLITRFIKRPITVS-------------LVGLLIFSIYGAIL--LYMGQ------- 155
Query: 232 ETSSSSPWIFNVTCGVRGST-GPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDY 290
TC GS P CN ++D + G H+Y
Sbjct: 156 ------------TC--EGSLPQPGCNPSWLVDPVVFGESHIY------------------ 183
Query: 291 GPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCL 350
S + FDPEG+ + A L+G G +I+ +K+ + L
Sbjct: 184 --------SLGERGFDPEGIPAIFSALGNVLLGFAAGRIIILYKEKGAGWRLLLHGVLLL 235
Query: 351 IGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
L F + K L++ S+ +TAGA+ +LLA
Sbjct: 236 ALAFLVEAF--LPFGKRLWTPSFGLVTAGATSLLLA 269
>gi|255013328|ref|ZP_05285454.1| hypothetical protein B2_05430 [Bacteroides sp. 2_1_7]
gi|410103821|ref|ZP_11298742.1| hypothetical protein HMPREF0999_02514 [Parabacteroides sp. D25]
gi|409236550|gb|EKN29357.1| hypothetical protein HMPREF0999_02514 [Parabacteroides sp. D25]
Length = 372
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 96/379 (25%), Positives = 138/379 (36%), Gaps = 123/379 (32%)
Query: 48 RPQHQQRRLISLDVFRG-----------LTVALMILVDD--VGGILPAINHSPWNGLTLA 94
+ + Q +RL SLD RG L VAL L + I + H WNGL
Sbjct: 3 KQKQQPQRLQSLDALRGFDMLFIMGGASLFVALATLFPNPFFQAIAGQMEHVEWNGLAHH 62
Query: 95 DFVMPFFLFIVGVSLALTYKNFPCKVVAT----RKAILRALNLFLLGIFLQGGFFHGINN 150
D + P FLFI G+S + + K + +K + R + L LG+ + N
Sbjct: 63 DTIFPLFLFIAGISFPFSLEKQRGKGMTEGAIYKKIVRRGITLVFLGL---------VYN 113
Query: 151 LKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLT 210
+ +R VL RI + ++ AAL + R W V +T
Sbjct: 114 GLLSFEFDHLRCASVLARIGLGWMFAALLFV------------------RFGWKVRAGIT 155
Query: 211 TL----YLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGP---ACNAVGMIDR 263
L Y L + + VPD G GP N VG IDR
Sbjct: 156 VLILVGYWLAMAFVPVPD-----------------------VGGAGPFTLEGNLVGYIDR 192
Query: 264 KIL-GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLI 322
L G H + FDPEGL S+V A T ++
Sbjct: 193 LFLPGRLH-------------------------------ETVFDPEGLFSTVPAIATAML 221
Query: 323 GLHFGHLIVHFKD---HRDRMLNWIILSSCLIGLGLSLDFVGM------HLNKALYSLSY 373
G+ G I K+ R + L CL+G G L VG+ +NK L++ S+
Sbjct: 222 GMFTGEWIKLRKEGLTDRKKEL-------CLVGAGAVLLIVGLLWSLVFPINKKLWTSSF 274
Query: 374 TCLTAGASGVLLAGIYFMV 392
C+ GA V + ++F +
Sbjct: 275 VCVV-GAYSVWMFALFFYI 292
>gi|403717790|ref|ZP_10942873.1| hypothetical protein KILIM_074_00050 [Kineosphaera limosa NBRC
100340]
gi|403208927|dbj|GAB97556.1| hypothetical protein KILIM_074_00050 [Kineosphaera limosa NBRC
100340]
Length = 461
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 75/313 (23%), Positives = 115/313 (36%), Gaps = 78/313 (24%)
Query: 21 KSTSAPANEKLERDPLLPPSNSNSKQTRPQHQ--QRRLISLDVFRGLTVALMILVDDVGG 78
++ S PA + L P + S++K P RR ISLDV RGL + + + V+
Sbjct: 51 ETQSLPATQSL------PATQSSTKSPPPAQSFPSRRFISLDVARGLMLVVSVAVNAWIT 104
Query: 79 ILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGI 138
H+ W G+ D V P F+ + G LA Y + + R + R + L L G+
Sbjct: 105 APAWFEHAAWAGVHPVDLVFPTFVALSGAGLAFAY----ARRIPLRPLLSRVIVLALAGL 160
Query: 139 FLQGGFFHGINNLKYG-VDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFR 197
+ L G +D A R GVLQ A LV AL + R
Sbjct: 161 ----AYNAHAQYLSTGQLDWATFRIPGVLQLYAAIVLVIALLHF-------------VLR 203
Query: 198 KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNA 257
++ + +V T + L L F C G+ P CN
Sbjct: 204 RWWAWPLFTIVAATCFALALNR---------------------FAAGC-PGGALTPECNP 241
Query: 258 VGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMAT 317
G+ D + G++H+Y + + DPEG+++ AT
Sbjct: 242 SGLFDPALFGVEHIYHQGRFGH--------------------------DPEGIVALAGAT 275
Query: 318 VTCLIGLHFGHLI 330
+ G GHL+
Sbjct: 276 IVACAGASLGHLL 288
>gi|224027055|ref|ZP_03645421.1| hypothetical protein BACCOPRO_03816, partial [Bacteroides
coprophilus DSM 18228]
gi|224020291|gb|EEF78289.1| hypothetical protein BACCOPRO_03816 [Bacteroides coprophilus DSM
18228]
Length = 373
Score = 64.7 bits (156), Expect = 8e-08, Method: Compositional matrix adjust.
Identities = 96/374 (25%), Positives = 147/374 (39%), Gaps = 99/374 (26%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMI----LVDDVGGILP---------AINHSPWNGLT 92
+ + + + +RL SLDV RG + ++ L + P + H PW+G+
Sbjct: 5 RMKKEVKSQRLQSLDVLRGFDMFFIMGGGALFAGLATCCPIPFFQAIARQMEHVPWHGVA 64
Query: 93 LADFVMPFFLFIVGVSLALTY-KNFPCKVVAT---RKAILRALNLFLLGIFLQGGFFHGI 148
D + P FLFI G+S + K C + + RK I R L L LLG I
Sbjct: 65 FEDMIFPLFLFIAGISFPYSLEKQKACGMSSAAIYRKVIRRGLVLVLLGC---------I 115
Query: 149 NNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALV 208
N D A +R+ VL RI ++++ AAL + +V +++ + VVAL+
Sbjct: 116 YNGLLDFDFAHLRYASVLGRIGLSWMFAALLFL------NVRTRVRM-------GVVALL 162
Query: 209 LTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGI 268
+ LL FP + P+ + GS VG +DR L
Sbjct: 163 FIGYWALLAC----------FPAPDGNGDPF------SMEGSL------VGYVDRMFLPG 200
Query: 269 QHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGH 328
Q LY DPEGLL + A T +G+ G
Sbjct: 201 Q-LY-----------------------------LGIHDPEGLLGIIPAVGTASLGMLTGE 230
Query: 329 LIVHFKDHRDRMLNWIILSSC----LIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVL 384
I K L ++L +C L+ +G D V +NK L++ S+ CL G S +L
Sbjct: 231 WI---KREGLPELRKVVLLACAGVMLVVVGWIWDLV-FPINKNLWTSSFACLVGGISMLL 286
Query: 385 LAGIYFMVRYISSH 398
A Y+++ H
Sbjct: 287 FALFYYLIDVRHCH 300
>gi|383753678|ref|YP_005432581.1| hypothetical protein SELR_08500 [Selenomonas ruminantium subsp.
lactilytica TAM6421]
gi|381365730|dbj|BAL82558.1| hypothetical protein SELR_08500 [Selenomonas ruminantium subsp.
lactilytica TAM6421]
Length = 384
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 49/153 (32%), Positives = 80/153 (52%), Gaps = 16/153 (10%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGI---LPAINHSPWNGLTLADFVMPFFLFIVGVS 108
+RRL ++D+FRGL +A+M+LV+ + P + H+PW GLT+AD P F+FI+GVS
Sbjct: 7 NKRRLAAIDIFRGLAIAIMLLVNALPNFEQAWPLLVHAPWAGLTIADLAFPGFVFIMGVS 66
Query: 109 LALTY---------KNFPCKVVATRKAILRALNLFLLGIFLQGGF--FHGINNLKYGVDI 157
+L + + F + + IL L + LQ F G + +K V+
Sbjct: 67 ASLWFPKHEQDGSGEKFCIILKRSLLLILLGFFLCQFPLVLQHVFQPEPGGSLIKDIVEH 126
Query: 158 AQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVS 190
+I GVLQR+ + Y + WL+ + H++
Sbjct: 127 GRIP--GVLQRLGLVYFFGMIITWWLRSEKHIA 157
>gi|146292182|ref|YP_001182606.1| hypothetical protein Sputcn32_1079 [Shewanella putrefaciens CN-32]
gi|145563872|gb|ABP74807.1| conserved hypothetical protein [Shewanella putrefaciens CN-32]
Length = 384
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 96/377 (25%), Positives = 151/377 (40%), Gaps = 98/377 (25%)
Query: 40 SNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDV-GGIL------------PAINHS 86
+ S + + RL+SLD RG + ++ + + GG+L ++HS
Sbjct: 3 TTQTSAIVPGKANKPRLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHS 62
Query: 87 PWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILR-ALNLFLLGIFLQGGFF 145
W+G D + P F+F+ GV+L L+ K ++ R + R + L + L +
Sbjct: 63 QWHGFHFYDLIFPLFIFLSGVALGLSPKRLDKLPMSERLPVYRHGIKRLFLLLLLGILYN 122
Query: 146 HGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVV 205
HG D +IR+ VL RIA A+ AAL +W H S +
Sbjct: 123 HGWGT-GAPADPEKIRYASVLGRIAFAWFFAALL-VW-----HTSLRTQ----------- 164
Query: 206 ALVLTTLYLLLLYG---LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMID 262
V+ L +LL YG L++P FP G G P + +D
Sbjct: 165 --VIVALGILLGYGAMQLWLP-----FP--------------SGQAGVFSPTQSINAYVD 203
Query: 263 RKIL-GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCL 321
+L G+ + R P DP+GLLS++ A V L
Sbjct: 204 SILLPGVSYQGRTP------------------------------DPQGLLSTIPAVVNAL 233
Query: 322 IGLHFGHLIVHFKDHRDRMLNWIIL------SSCLIGLGLSLDFVGMHLNKALYSLSYTC 375
G+ G+ IV K H W+ + + LG LD V + +NK L++ S+
Sbjct: 234 AGVFVGYFIV--KSHPQG--EWVKVGLLATAGGAWLALGWLLDGV-IPVNKELWTSSFVL 288
Query: 376 LTAGASGVLLAGIYFMV 392
+T+G S +LLA Y +V
Sbjct: 289 VTSGWSMILLALFYALV 305
>gi|260684359|ref|YP_003215644.1| hypothetical protein CD196_2626 [Clostridium difficile CD196]
gi|260688018|ref|YP_003219152.1| hypothetical protein CDR20291_2673 [Clostridium difficile R20291]
gi|260210522|emb|CBA65033.1| putative membrane protein [Clostridium difficile CD196]
gi|260214035|emb|CBE06182.1| putative membrane protein [Clostridium difficile R20291]
Length = 352
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 71/315 (22%), Positives = 130/315 (41%), Gaps = 77/315 (24%)
Query: 81 PAINHSPWNGLTLADFVMPFFLFIVGVSLALT----YKNFPCKVVATRKAILRALNLFLL 136
P + H+ W+G+TLADF PFF+ +GV++ ++ KN + R++ L L
Sbjct: 15 PQLRHAVWHGVTLADFAFPFFVISLGVTIPISINSKLKNNKSTLSIILSIFKRSILLILF 74
Query: 137 GIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLF 196
G FL N D+ +R +GVLQR+ + Y V +L + LK KL++
Sbjct: 75 GFFL---------NYLGNPDLNSVRILGVLQRMGLVYFVTSLVYLLLK-------KLNV- 117
Query: 197 RKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACN 256
G + + ++ G Y+ Y F +E S +
Sbjct: 118 ----GSTATIITFLCIATFIIVGYYILAKPYGFELEGSLAQ------------------- 154
Query: 257 AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMA 316
++D H ++ +Y + F+P+G L+S++A
Sbjct: 155 ---LVDL------HFFKGHLY------------------------KPEFEPDGFLTSIVA 181
Query: 317 TVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCL 376
+ ++G G +++ + I++ S ++ +G + NK L+S S+ L
Sbjct: 182 ISSGMLGCTMGCVLLKENIGEYKKFFKILVMSIILLIGAFIFNQYFPFNKRLWSSSFVLL 241
Query: 377 TAGASGVLLAGIYFM 391
AG+ G+LL+ YF+
Sbjct: 242 MAGSYGMLLSIFYFI 256
>gi|359686399|ref|ZP_09256400.1| hypothetical protein Lsan2_17893 [Leptospira santarosai str.
2000030832]
Length = 329
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 91/316 (28%), Positives = 135/316 (42%), Gaps = 96/316 (30%)
Query: 96 FVMPFFLFIVGVSL--ALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKY 153
F +PFFLF VG S+ +L KN + +R+ NL LLG+FL F G
Sbjct: 9 FGVPFFLFAVGTSIPISLYSKNGINRSDIWIGICIRSANLILLGLFLN---FFG------ 59
Query: 154 GVDIAQIRWMGVLQRIAIAY-LVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTL 212
A++R GVLQRI Y +VA+LC + K+ +F V ++L
Sbjct: 60 EWSFAELRIPGVLQRIGFVYWVVASLCLV------FPGKKILVFS-------VPILLIHT 106
Query: 213 YLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLY 272
++L Q P E+ S G A IDR I G +HL+
Sbjct: 107 WILT---------QIALPGESVVSL------------EQGKDIGA--WIDRTIFGEKHLW 143
Query: 273 RKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVH 332
R +S+T +DPEG LS V + VT L G+ G I+
Sbjct: 144 R---FSKT------------------------WDPEGFLSGVASVVTTLFGVLCG-FILF 175
Query: 333 FKDHRDRMLNWIILSSCLIGLGLSLDFVGMH------LNKALYSLSYTCLTAGASGVLLA 386
++ ++++L GLG+ FVG+ +NK+L++ SY+ TAG S
Sbjct: 176 LRERKNKIL----------GLGILFSFVGLLWDLSLPMNKSLWTGSYSVYTAGLS---FL 222
Query: 387 GIYFMVRYISSHLMLK 402
I+F Y+SS ++ K
Sbjct: 223 SIWFF-EYLSSLIISK 237
>gi|386312853|ref|YP_006009018.1| N-acetylglucosamine related transporter, NagX [Shewanella
putrefaciens 200]
gi|319425478|gb|ADV53552.1| N-acetylglucosamine related transporter, NagX [Shewanella
putrefaciens 200]
Length = 384
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 101/366 (27%), Positives = 149/366 (40%), Gaps = 106/366 (28%)
Query: 55 RLISLDVFRGLTVALMILVDDV-GGIL------------PAINHSPWNGLTLADFVMPFF 101
RL+SLD RG + ++ + + GG+L ++HS W+G D + P F
Sbjct: 18 RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFHFYDLIFPLF 77
Query: 102 LFIVGVSLALTYK---NFPCK--VVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVD 156
+F+ GV+L L+ K P K + R I R L LLGI G+ G D
Sbjct: 78 IFLSGVALGLSPKRLDKLPMKDRLPVYRHGIKRLFLLLLLGILYNHGWGTGAP-----AD 132
Query: 157 IAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLL 216
++R+ VL RIA A+ AAL +W H S + V+ L +LL
Sbjct: 133 PEKVRYASVLGRIAFAWFFAALL-VW-----HTSLRTQ-------------VIVALGILL 173
Query: 217 LYG---LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKIL-GIQHLY 272
YG L++P FP G G P + +D +L G+ +
Sbjct: 174 GYGAMQLWLP-----FP--------------SGQAGVLSPTQSINAYVDSILLPGVSYQG 214
Query: 273 RKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVH 332
R P DPEGLLS++ A V L G+ G+ IV
Sbjct: 215 RTP------------------------------DPEGLLSTIPAVVNALAGVFVGYFIV- 243
Query: 333 FKDHRDRMLNWIIL------SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
K H W+ + + LG LD V + +NK L++ S+ +T+G S +LLA
Sbjct: 244 -KSHPQG--EWVKVGLLATAGGAWLALGWLLDGV-IPVNKELWTSSFVLVTSGWSMILLA 299
Query: 387 GIYFMV 392
Y +V
Sbjct: 300 LFYALV 305
>gi|409203840|ref|ZP_11232043.1| hypothetical protein PflaJ_21058 [Pseudoalteromonas flavipulchra
JG1]
Length = 377
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 92/367 (25%), Positives = 142/367 (38%), Gaps = 88/367 (23%)
Query: 50 QHQQRRLISLDVFRGLTV-----------ALMILVDDVG--GILPAINHSPWNGLTLADF 96
++ +RL SLD RG+ + AL +L G HSPW+G T D
Sbjct: 4 NNKPKRLASLDALRGMDMFWILGGQSIFAALFVLTGWQGWKAFEAHTLHSPWHGFTFYDL 63
Query: 97 VMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFL-----LGIFLQGGFFHGINNL 151
+ P F+F+ GV++ L+ K RK+ L G+ G+ GI
Sbjct: 64 IFPLFIFLSGVAMGLSPKRIDHLPFNERKSFYLKALKRLLLLCAFGVLYNHGWGTGIP-- 121
Query: 152 KYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTT 211
+D +R+ VL RIA A+ AL +W H S + + V + + T
Sbjct: 122 ---MDPDGVRYASVLGRIAFAWFFCALL-VW-----HTSLRTLAY--------VGVGILT 164
Query: 212 LYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGP-ACNAVGMIDRKILGIQH 270
Y LLL + VP G G + N VG + I
Sbjct: 165 FYWLLLCFIPVPG------------------------GQAGDLSANGVGSWNAFI----D 196
Query: 271 LYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI 330
Y P S+ P DPEG+LSS+ A V + G+ G I
Sbjct: 197 TYLLP---------------------GTSYQNRPVDPEGVLSSLPAIVNAIAGVFAGRAI 235
Query: 331 VHFKDHRDRMLNWIILSSCLIGLGLSLDF-VGMHLNKALYSLSYTCLTAGASGVLLAGIY 389
+ + + + I+ S ++ L L + + +NK L++ S+ +T G S +LLA Y
Sbjct: 236 ANAQTQGEWKIVGILAGSGVLALALGWLWDMQFPVNKELWTSSFVLVTVGWSAILLAVFY 295
Query: 390 FMVRYIS 396
+V +S
Sbjct: 296 AIVDVLS 302
>gi|126173329|ref|YP_001049478.1| hypothetical protein Sbal_1087 [Shewanella baltica OS155]
gi|386340088|ref|YP_006036454.1| hypothetical protein [Shewanella baltica OS117]
gi|125996534|gb|ABN60609.1| conserved hypothetical protein [Shewanella baltica OS155]
gi|334862489|gb|AEH12960.1| Protein of unknown function DUF2261, transmembrane [Shewanella
baltica OS117]
Length = 387
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 95/362 (26%), Positives = 148/362 (40%), Gaps = 95/362 (26%)
Query: 55 RLISLDVFRGLTVALMILVDDV-GGIL---------------PAINHSPWNGLTLADFVM 98
RL+SLD RG + ++ + + GG+L ++HS W+G D +
Sbjct: 18 RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWAGWQWGDEQMHHSQWHGFHFYDLIF 77
Query: 99 PFFLFIVGVSLALTYKNFPCKVVATRKAILR-ALNLFLLGIFLQGGFFHGINNLKYGVDI 157
P F+F+ GV+L L+ K + R + R + L + L + HG D
Sbjct: 78 PLFIFLSGVALGLSPKRLDKLPMKERLPVYRHGIKRLFLLLLLGILYNHGWGT-GAPADP 136
Query: 158 AQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLL 217
+IR+ VL RIA A+ AAL +W H S + ++ L +LL
Sbjct: 137 EKIRYASVLGRIAFAWFFAALL-VW-----HTSLRTQ-------------IIVALGILLG 177
Query: 218 YG---LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKIL-GIQHLYR 273
YG L++P FP G G P + +D +L G+ + R
Sbjct: 178 YGAMQLWLP-----FP--------------GGQAGVLSPTESINAYVDSILLPGVSYQGR 218
Query: 274 KPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV-- 331
P DPEGLLS++ A V L G+ GH IV
Sbjct: 219 TP------------------------------DPEGLLSTIPAIVNALAGVFVGHFIVKS 248
Query: 332 HFKDHRDRMLNWIILSSCL-IGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYF 390
H K ++ + + C+ + G LD V + +NK L++ S+ +T+G S +LLA Y
Sbjct: 249 HPKGEWAKV-GLLAAAGCVCLTFGWLLDLV-IPVNKELWTSSFVLVTSGWSMILLALFYA 306
Query: 391 MV 392
+V
Sbjct: 307 LV 308
>gi|332664355|ref|YP_004447143.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
gi|332333169|gb|AEE50270.1| Protein of unknown function DUF2261, transmembrane
[Haliscomenobacter hydrossis DSM 1100]
Length = 438
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 43/95 (45%), Positives = 59/95 (62%), Gaps = 7/95 (7%)
Query: 53 QRRLISLDVFRGLTVALMILVD---DVGGILPAINHSPWNGLTLADFVMPFFLFIVGVS- 108
RL+SLDV RGLT+A MILV+ D G + + H+ W+G T D+V PFFLF+VGV+
Sbjct: 4 SNRLLSLDVMRGLTIAGMILVNNPGDWGNVYGPLLHADWHGCTPTDWVFPFFLFMVGVAI 63
Query: 109 -LALTYKNFPCKVVAT--RKAILRALNLFLLGIFL 140
LAL + + + RK I R+L + LG+FL
Sbjct: 64 PLALGKRKDEGEDLRKIYRKIISRSLIIIGLGLFL 98
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 47/88 (53%), Gaps = 1/88 (1%)
Query: 305 FDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHL 364
+DPEGLLS++ A T + G+ G + K +++ + + + L LGL + L
Sbjct: 258 WDPEGLLSTIPAIGTGIAGMLAGEWVRSEKSDYEKVSGLLAVGALLFALGLIWNEF-FPL 316
Query: 365 NKALYSLSYTCLTAGASGVLLAGIYFMV 392
NK +++ SY AG S + L IY++V
Sbjct: 317 NKKIWTSSYVVYMAGISLLFLGTIYWLV 344
>gi|284041428|ref|YP_003391358.1| heparan-alpha-glucosaminide N-acetyltransferase [Spirosoma linguale
DSM 74]
gi|283820721|gb|ADB42559.1| Heparan-alpha-glucosaminide N-acetyltransferase [Spirosoma linguale
DSM 74]
Length = 381
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 90/357 (25%), Positives = 141/357 (39%), Gaps = 92/357 (25%)
Query: 54 RRLISLDVFRGLTVALMILVDDV-----------GGILPA--INHSPWNGLTLADFVMPF 100
+RL+SLD RG + ++ +++ G IL A +H WNG D + P
Sbjct: 19 KRLLSLDTLRGFDMFWIMGGEEIFHVLAKTTGWAGAILLADQFSHPAWNGFRAYDLIFPL 78
Query: 101 FLFIVGVSLALTY----KNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVD 156
F+F+ GVS + K RK I R L L +LGI G F+ +
Sbjct: 79 FMFMAGVSTPFSVGSRLDQGTDKAKIARKIISRGLILVVLGIIYNNGLFNRV-------- 130
Query: 157 IAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLL 216
+R+ VL RI +A + A L ++ + + + W V L+L L++
Sbjct: 131 FEDMRFPSVLGRIGLAGMFAQLIYLYFR------------PRAQYIWFVGLLLGYWALMM 178
Query: 217 LYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPI 276
L + VP CG G CN IDR ++ HLY K I
Sbjct: 179 L--VPVPG--------------------CGA-GVLTMECNLASFIDRMLVP-GHLY-KTI 213
Query: 277 YSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI-VHFKD 335
+ DPEGL S++ A L+G+ G + H +
Sbjct: 214 H----------------------------DPEGLFSTLPAIDNTLLGIFAGTFLRTHGRT 245
Query: 336 HRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+ + + + LG DFV +NK L++ S+ +T G S +LLA Y+++
Sbjct: 246 GNQKTALLLGAGAAFVLLGWLWDFV-FPINKNLWTSSFVLVTGGLSLLLLAVFYWVI 301
>gi|160874301|ref|YP_001553617.1| hypothetical protein Sbal195_1181 [Shewanella baltica OS195]
gi|378707545|ref|YP_005272439.1| hypothetical protein [Shewanella baltica OS678]
gi|160859823|gb|ABX48357.1| conserved hypothetical protein [Shewanella baltica OS195]
gi|315266534|gb|ADT93387.1| hypothetical protein Sbal678_1209 [Shewanella baltica OS678]
Length = 384
Score = 63.5 bits (153), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 94/359 (26%), Positives = 147/359 (40%), Gaps = 92/359 (25%)
Query: 55 RLISLDVFRGLTVALMILVDDV-GGIL------------PAINHSPWNGLTLADFVMPFF 101
RL+SLD RG + ++ + + GG+L ++HS W+G D + P F
Sbjct: 18 RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFHFYDLIFPLF 77
Query: 102 LFIVGVSLALTYKNFPCKVVATRKAILR-ALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
+F+ GV+L L+ K + R + R + L + L + HG D +I
Sbjct: 78 IFLSGVALGLSPKRLDKLPMKERLPVYRHGIKRLFLLLLLGILYNHGWGT-GAPADPEKI 136
Query: 161 RWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYG- 219
R+ VL RIA A+ AAL +W H S + ++ L +LL YG
Sbjct: 137 RYASVLGRIAFAWFFAALL-VW-----HTSLRTQ-------------IIVALGILLGYGA 177
Query: 220 --LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKIL-GIQHLYRKPI 276
L++P FP G P + +D +L G+ + R P
Sbjct: 178 MQLWLP-----FP--------------GGQASVLSPTESINAYVDSILLPGVSYQGRTP- 217
Query: 277 YSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV--HFK 334
DPEGLLS++ A V L G+ GH IV H K
Sbjct: 218 -----------------------------DPEGLLSTIPAIVNALAGVFVGHFIVKSHPK 248
Query: 335 DHRDRMLNWIILSSCL-IGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
++ + + C+ + G LD V + +NK L++ S+ +T+G S +LLA Y +V
Sbjct: 249 GEWAKV-GLLAAAGCVCLAFGWLLDLV-IPVNKELWTSSFVLVTSGWSMILLAVFYALV 305
>gi|291514403|emb|CBK63613.1| Uncharacterized conserved protein [Alistipes shahii WAL 8301]
Length = 376
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 89/375 (23%), Positives = 137/375 (36%), Gaps = 108/375 (28%)
Query: 45 KQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAI----------------NHSPW 88
+ T+P QR L+SLD RG M+ + G++ A+ H+ W
Sbjct: 4 QPTQPAASQR-LLSLDALRGFD---MLFIMGFAGLVTALCKLCPGEFSDWMTAQMGHADW 59
Query: 89 NGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATR----KAILRALNLFLLGIFLQGGF 144
NG D + P FLFI G+S + K ++ R K I R L L LG F+ G
Sbjct: 60 NGFFHHDTIFPLFLFIAGISFPFSLAKQREKGMSERSIYLKVIRRGLTLVALG-FVYSGL 118
Query: 145 FHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKY--RGH 202
F +D A +R VL RI +A++ AAL LF + R
Sbjct: 119 FK--------LDFATLRLPSVLGRIGLAWMFAAL----------------LFVNFNVRTR 154
Query: 203 WVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPAC---NAVG 259
V+A + Y LLL + PD G GP N VG
Sbjct: 155 AVIAAAILLGYGLLLQFVAAPD-----------------------AGGAGPLTLEGNIVG 191
Query: 260 MIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVT 319
+DR ++ L + FDPEGLLS++ A VT
Sbjct: 192 YVDRIVMPSHLLGGR-----------------------------GFDPEGLLSTLPAIVT 222
Query: 320 CLIGLHFGHLIVHFKDHR--DRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLT 377
++G+ G + ++ R W+ + + +NK L++ ++
Sbjct: 223 AMLGMFTGEFVRRSEERTSGSRKTLWMAAGAVALLALALCLDPLQPINKKLWTPAFVFAA 282
Query: 378 AGASGVLLAGIYFMV 392
S + A Y+++
Sbjct: 283 GAYSLGMFALFYYII 297
>gi|171914858|ref|ZP_02930328.1| hypothetical protein VspiD_26815 [Verrucomicrobium spinosum DSM
4136]
Length = 379
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 78/323 (24%), Positives = 129/323 (39%), Gaps = 70/323 (21%)
Query: 83 INHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATR---KAILRALNLFLLGIF 139
+ H W G D + P FLF+VGVS+ L+ +V +R + + R+ LF +G+F
Sbjct: 30 LQHVEWEGFRFYDAIFPLFLFLVGVSIVLSVDRMVARVGRSRALARIVRRSALLFAVGVF 89
Query: 140 LQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKY 199
GG +++ GVL RIA+ YLVAA + L G V + +
Sbjct: 90 YYGGIARPWPDVQLS---------GVLPRIALCYLVAATLYVLLPRKGIVIATAACL--- 137
Query: 200 RGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVG 259
G+W + L+VP FP +P G R
Sbjct: 138 LGYWALM-------------LFVP-----FP-NVDIKTP------AGARKQ--------- 163
Query: 260 MIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAP-------FDPEGLLS 312
++ R+ +++ + + G C P + EGLLS
Sbjct: 164 --------VEAKTREELFAGAASTTKGTFKEGLNLAHYVDACWLPGRKRNLYYSNEGLLS 215
Query: 313 SVMATVTCLIGLHFGHLIVHFKDHRDRMLNWII---LSSCLIGLGLSLDFVGMHLNKALY 369
++ A + L G+ G ++ H + + W++ L+ +IGL L+F + K L+
Sbjct: 216 TIPAVASTLFGVLAGWVLTHGRRSGRWKVGWLVGSGLAGVVIGLLWGLEF---PVIKRLW 272
Query: 370 SLSYTCLTAGASGVLLAGIYFMV 392
+ S+ + AG S VLL Y +V
Sbjct: 273 TSSFCMVAAGFSAVLLGLFYLVV 295
>gi|424665544|ref|ZP_18102580.1| hypothetical protein HMPREF1205_01419 [Bacteroides fragilis HMW
616]
gi|404574617|gb|EKA79366.1| hypothetical protein HMPREF1205_01419 [Bacteroides fragilis HMW
616]
Length = 375
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 83/357 (23%), Positives = 133/357 (37%), Gaps = 85/357 (23%)
Query: 55 RLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLADFVMPFF 101
RL SLD+ RG + L++ V IL +H W G D VMP F
Sbjct: 11 RLASLDILRGFDLFLLVFFQPVLWALAHQLNLPWLNSILFQFDHEVWEGFRFWDLVMPLF 70
Query: 102 LFIVGVSLALTYKNF---PCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIA 158
LF+ G S+ ++ F P K RK + R + LF+ G+ +QG G+N KY +
Sbjct: 71 LFMTGASMPFSFSKFKDDPDKGPIYRKILKRFILLFIFGMIVQGNLL-GLNP-KYLYLYS 128
Query: 159 QIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLY 218
LQ IA YL+AA+ ++ H S +
Sbjct: 129 N-----TLQAIATGYLIAAIIQL------HCSFR-------------------------- 151
Query: 219 GLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYS 278
WQ V + G P N +DR ILG +R +Y
Sbjct: 152 ------WQLIVTVLLLLI----YWAPMTFLGDFTPERNFAEKVDRLILG---RFRDGVYW 198
Query: 279 RTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRD 338
SP Y + + SS+ T ++G G ++ KD+R
Sbjct: 199 NEDGSWSFSPHY---------------NYTWIWSSLTFGATVMLGAFAGKIMKAGKDNRR 243
Query: 339 RMLNWIILSS-CLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRY 394
+++ ++++ L+ L + M + K L++ S T + G +L+ Y+ + Y
Sbjct: 244 QVVRTLLITGISLVAFALIWS-LQMPIIKRLWTSSMTLFSGGLCFLLMGTFYYWIDY 299
>gi|223936396|ref|ZP_03628308.1| conserved hypothetical protein [bacterium Ellin514]
gi|223894914|gb|EEF61363.1| conserved hypothetical protein [bacterium Ellin514]
Length = 383
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 48/166 (28%), Positives = 71/166 (42%), Gaps = 28/166 (16%)
Query: 24 SAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMI----------LV 73
SAP + P L + + S T Q R+IS+D RG + ++ +
Sbjct: 2 SAPTST--SEAPALS-NQAGSTATLNQKANTRIISIDALRGFDMFWIMGGDQLVRSFQKI 58
Query: 74 DDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF---PCKVVATRKAI 127
DD + + H W G D + P F+F+ GVS+ + +V A ++
Sbjct: 59 DDSAPTHALANQMEHCEWAGFHFYDLIFPLFVFLAGVSIVFSITRLIEHSGRVAAVKRIA 118
Query: 128 LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAY 173
R++ LFL GIF GG +G N I GVL RIA+AY
Sbjct: 119 FRSVILFLFGIFYMGGVSNGFKN---------IYLAGVLHRIAVAY 155
>gi|410099160|ref|ZP_11294132.1| hypothetical protein HMPREF1076_03310 [Parabacteroides goldsteinii
CL02T12C30]
gi|409219182|gb|EKN12145.1| hypothetical protein HMPREF1076_03310 [Parabacteroides goldsteinii
CL02T12C30]
Length = 371
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 87/366 (23%), Positives = 144/366 (39%), Gaps = 96/366 (26%)
Query: 48 RPQHQQRRLISLDVFRG-----------LTVALMILVDD--VGGILPAINHSPWN-GLTL 93
Q Q +RL+SLD RG L VAL LV I ++H+ W G T
Sbjct: 2 EKQKQSQRLLSLDALRGFDMFFIMGGGSLFVALATLVPTPFFESIAAQMSHAKWGAGFTF 61
Query: 94 ADFVMPFFLFIVGVSLALTYKNFPCKVVAT----RKAILRALNLFLLGIFLQGGFFHGIN 149
D + P FLFI G+S + + + ++ +K I R + L +LG ++G+
Sbjct: 62 EDIIFPLFLFIAGISFPFSLEKQRERGMSEAAIYKKIIRRGITLVVLGF-----VYNGLL 116
Query: 150 NLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVL 209
NL + R+ VL RI + ++ AL + + V W+VA +L
Sbjct: 117 NLNFETQ----RYASVLARIGLGWMFGALIFVNTRTITRV-------------WIVAAIL 159
Query: 210 TTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQ 269
+LLL + PD E + N+ C IDR +L
Sbjct: 160 IGYWLLLF--IPAPDGN---GAELFTREG---NLAC--------------YIDRLLL--- 194
Query: 270 HLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHL 329
++ +Y P EG+LS++ A T L+G+ G
Sbjct: 195 ------------PGRLHGGNYDP---------------EGILSTLPAIGTALLGMFTGEF 227
Query: 330 IVHFKD---HRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
+ ++ +++ + + CL+ +GL +NK L++ S+ C G S +L A
Sbjct: 228 VKLRREGLTETKKVVYMLAVGGCLLVIGLLWGLF-FPINKYLWTSSFVCTVGGISAILFA 286
Query: 387 GIYFMV 392
Y++V
Sbjct: 287 VFYYIV 292
>gi|408673387|ref|YP_006873135.1| Protein of unknown function DUF2261, transmembrane [Emticicia
oligotrophica DSM 17448]
gi|387855011|gb|AFK03108.1| Protein of unknown function DUF2261, transmembrane [Emticicia
oligotrophica DSM 17448]
Length = 423
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 52/175 (29%), Positives = 78/175 (44%), Gaps = 48/175 (27%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVS 108
++RL S+DVFRG+T+ LM +V++ G I + H+ W+G T D V PFFLFIVG+S
Sbjct: 1 MKQRLTSIDVFRGMTIMLMTIVNNPGDWSHIYAPLEHAEWHGCTPTDLVFPFFLFIVGIS 60
Query: 109 LALT--YKNFPCKV---VATRK------------------AILRALNLFLLGIFLQG--- 142
L+ K F + TR L + L L+ + L G
Sbjct: 61 TVLSSPVKRFDSNTFERIITRALRIFLLGLFLNFFSKIHLGTLEGVPLMLIRLVLTGIAT 120
Query: 143 -----------GFFHGINNLKYGV--------DIAQIRWMGVLQRIAIAYLVAAL 178
F+ + + + D A +R GVLQRIA+ YL+ ++
Sbjct: 121 VLLLGDFDKKKQFYAAVGLFVFMISLCFSGIEDFASVRIPGVLQRIAMVYLIVSV 175
Score = 40.8 bits (94), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 58/105 (55%), Gaps = 10/105 (9%)
Query: 305 FDPEGLLSSVMATVTCLIGLHFGHLIVH-FKDHRDRMLNWIILSSCLIGLGLSL---DFV 360
+DPEG+LS++ A T L G+ G L+ + F ++ + +LS+ +IG+ + D+
Sbjct: 239 WDPEGILSTIPAIGTALAGVFTGKLLTNDFPKNKKAIY---LLSAGVIGVMIGFLWNDY- 294
Query: 361 GMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKKPF 405
+NKAL++ SY AG + ++L +YF++ + KPF
Sbjct: 295 -FPINKALWTSSYVLYVAGWALLVLGVLYFIIDVLGFE-KWTKPF 337
>gi|291514624|emb|CBK63834.1| Uncharacterized conserved protein [Alistipes shahii WAL 8301]
Length = 352
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 87/320 (27%), Positives = 130/320 (40%), Gaps = 86/320 (26%)
Query: 83 INHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAIL-----RALNLFLLG 137
+ H+ WNGLT+ D + P FLFI GV+ + + RK IL R L L LLG
Sbjct: 30 MQHAAWNGLTIQDTIFPLFLFIAGVAFPFSLAKQRARGFG-RKRILDRIFRRGLILALLG 88
Query: 138 IFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFR 197
+ + N + ++ + +R VL RI +A++ AAL L ++
Sbjct: 89 M---------VYNGLFELNFSSLRIASVLGRIGLAWMFAAL--------------LCVYC 125
Query: 198 KYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNA 257
R VA ++ Y LLL + PD PV S V G C A
Sbjct: 126 SVRTRIAVAGIILIGYSLLLGLVVAPD----APVGADPLS---------VEG-----CLA 167
Query: 258 VGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMAT 317
G IDR+ L LY FDPEG+LS++ A
Sbjct: 168 -GWIDRQYLPGHILY------------------------------GAFDPEGILSTLPAV 196
Query: 318 VTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGL-----GLSLDFVGMHLNKALYSLS 372
V+ L G+ G ++ R +W + L GL + + M +NK L+S S
Sbjct: 197 VSALFGMFTGEFLL--DGRRGLSGSWKAFYMAVAALAITTAGLCWNLI-MPVNKNLWSSS 253
Query: 373 YTCLTAGASGVLLAGIYFMV 392
+TC+ +G S + A Y+++
Sbjct: 254 FTCVVSGYSLGMTALFYYLI 273
>gi|392308231|ref|ZP_10270765.1| hypothetical protein PcitN1_06167 [Pseudoalteromonas citrea NCIMB
1889]
Length = 375
Score = 62.8 bits (151), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 94/364 (25%), Positives = 142/364 (39%), Gaps = 92/364 (25%)
Query: 50 QHQQRRLISLDVFRGLTV-----------ALMILVDDVG--GILPAINHSPWNGLTLADF 96
+ ++RL SLD RG+ + AL +L G HS W+G T D
Sbjct: 4 KTDKKRLASLDALRGMDMFWILGGQSIFAALFVLTGWQGWKAFEAHTVHSAWHGFTFYDL 63
Query: 97 VMPFFLFIVGVSLALTYKNFPCKVVATR-----KAILRALNLFLLGIFLQGGFFHGINNL 151
+ P F+F+ GV++ L+ K + R KA+ R L LG+ G+ GI
Sbjct: 64 IFPLFIFLSGVAMGLSPKRIDHLPFSERRGYYNKALKRLFLLSALGVLYNHGWGTGIP-- 121
Query: 152 KYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTT 211
V + +IR+ VL RIAIA+ L +W H S + F + VL
Sbjct: 122 ---VALGEIRYASVLGRIAIAWFFCMLL-VW-----HTSLRTLGF-------IGGGVLIG 165
Query: 212 LYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKIL-GIQH 270
+LLL ++P V G G A + +D +L GI +
Sbjct: 166 YWLLL---CFIP-------------------VPGGTAGDLSAAGSWNAWVDAMLLPGISY 203
Query: 271 LYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI 330
R P+D PEG+LSS+ A V + G+ G LI
Sbjct: 204 QNR--------------------PVD----------PEGVLSSLPAIVNAIAGVFAGRLI 233
Query: 331 VHFK--DHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGI 388
+ + + +GLG D + +NK L++ S+ +T G S +LLA
Sbjct: 234 AQAQHVGQWHTVARLFVFGFVALGLGWLWD-IHFPVNKELWTSSFVLVTVGWSAILLAAF 292
Query: 389 YFMV 392
Y +V
Sbjct: 293 YALV 296
>gi|392402534|ref|YP_006439146.1| Protein of unknown function DUF2261, transmembrane [Turneriella
parva DSM 21527]
gi|390610488|gb|AFM11640.1| Protein of unknown function DUF2261, transmembrane [Turneriella
parva DSM 21527]
Length = 396
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 70/146 (47%), Gaps = 24/146 (16%)
Query: 55 RLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
R SLD+ RGLT+ALMI+V++ G + + H+ W+G AD V P FLF+ G + AL
Sbjct: 2 RNFSLDLLRGLTIALMIIVNNPGDWKAMFAVLRHAEWHGFLGADIVFPLFLFVAGYAAAL 61
Query: 112 TYKNF---------PCKVVATRKA----------ILRALNLFLLGIFLQGGFFHGINNLK 152
C T + + RA LFL+G+FL + + +
Sbjct: 62 KIDRLYGPTTAGGPHCASALTLEERELPAYYLPLMRRAAILFLIGLFLNAWPLGLLPDTE 121
Query: 153 YGVDIAQIRWMGVLQRIAIAYLVAAL 178
+ +R +GVLQRIAI LV +
Sbjct: 122 F--SFGHLRVLGVLQRIAICVLVGGV 145
>gi|355694569|gb|AER99714.1| heparan-alpha-glucosaminide N-acetyltransferase [Mustela putorius
furo]
Length = 296
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 67/245 (27%), Positives = 106/245 (43%), Gaps = 49/245 (20%)
Query: 159 QIRWMGVLQRIAIAYLVAALCEIWLKGD--GHVSSKLSLFRKYR-----GHWVVALVLTT 211
++R GVLQR+ + Y V A+ E+ +S+ S F W+ L+L +
Sbjct: 4 KVRIPGVLQRLGVTYFVVAVLELIFAKPVPESCASERSCFSLRDIIFSWPQWLFILMLES 63
Query: 212 LYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTG--PACN--AVGMIDRKILG 267
++L L + L VP T P G G G P C A G IDR +LG
Sbjct: 64 IWLALTFFLPVPG------CPTGYLGP-------GGIGDLGKYPNCTGGAAGYIDRLLLG 110
Query: 268 IQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFG 327
H+Y+ P + Q +DPEG+L S+ + V +G+ G
Sbjct: 111 DDHIYQHPSSAVLYHT------------------QVAYDPEGILGSINSIVMAFLGVQAG 152
Query: 328 HLIVHFKDHRDRMLNWIILSSCLIGL------GLSLDFVGMHLNKALYSLSY-TCLTAGA 380
+++++KD +L SC +GL S + + +NK L+S+SY T L++ A
Sbjct: 153 KILLYYKDQTKDILIRFTAWSCFLGLISVALTKFSENEGFIPINKNLWSVSYVTTLSSFA 212
Query: 381 SGVLL 385
+LL
Sbjct: 213 FFILL 217
>gi|393788826|ref|ZP_10376952.1| hypothetical protein HMPREF1068_03232 [Bacteroides nordii
CL02T12C05]
gi|392653932|gb|EIY47582.1| hypothetical protein HMPREF1068_03232 [Bacteroides nordii
CL02T12C05]
Length = 376
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 79/365 (21%), Positives = 138/365 (37%), Gaps = 88/365 (24%)
Query: 49 PQHQQRRLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLAD 95
P++ RL SLD+ RG + L++ V +L +H W G D
Sbjct: 5 PENDFTRLASLDILRGFDLFLLVFFQPVFVALARQLNLPFLDEVLYQFDHEVWEGFRFWD 64
Query: 96 FVMPFFLFIVGVSLALTYKNFPCKVV----ATRKAILRALNLFLLGIFLQGGFFHGINNL 151
VMP FLF+ G S+ + + V R+ + R + LF+ G+ +QG
Sbjct: 65 LVMPLFLFMTGASMPFSLSKYKTASVDYWPVYRRILKRVILLFIFGMIVQGNLL------ 118
Query: 152 KYGVDIAQIR-WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLT 210
G D I + LQ IA+ Y +AA+ ++ H S
Sbjct: 119 --GFDSKHIYFYSNTLQSIAVGYFIAAVIQL------HFS-------------------- 150
Query: 211 TLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQH 270
+G WQ + + + + G PA N +DR +LG
Sbjct: 151 -------FG-----WQ----IIITLLLLLAYWIPMTFLGDFTPAGNFAEQVDRWVLG--- 191
Query: 271 LYRKPIYSRTKQCSINSPDYGPMPLDAPSWC-QAPFDPEGLLSSVMATVTCLIGLHFGHL 329
R + + D +W + ++ + SS+ VT ++G G +
Sbjct: 192 --------RFRDGVFWNED--------GTWSFSSQYNYTWIWSSLTFGVTVMLGAFAGKI 235
Query: 330 IVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIY 389
+ K +R +++ ++L L+ L + M + K L++ S T L+ G +L+A Y
Sbjct: 236 MKEGKANRKKVVQILLLIGLLLIGLAELWSLQMPVIKRLWTASMTLLSGGYCFLLMALFY 295
Query: 390 FMVRY 394
+ + Y
Sbjct: 296 YWIDY 300
>gi|15806610|ref|NP_295325.1| hypothetical protein DR_1602 [Deinococcus radiodurans R1]
gi|6459373|gb|AAF11168.1|AE002004_7 hypothetical protein DR_1602 [Deinococcus radiodurans R1]
Length = 388
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 50/131 (38%), Positives = 69/131 (52%), Gaps = 18/131 (13%)
Query: 55 RLISLDVFRGLTVALMILVDDV--GGILP-AINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL +LD +RGLTV LM+LV++V G P ++H+ + GLTL D V P+FLF G +L
Sbjct: 33 RLTALDAWRGLTVLLMLLVNNVALGDSTPRQLSHAHFGGLTLTDLVFPWFLFCAGAALPF 92
Query: 112 TYKNFPCKVVAT----RKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
+ V R+ + RA L+L+G F+ H L G +GVLQ
Sbjct: 93 SAAAMNKAGVTGWPLYRRLLERAALLYLMGAFVTSVTSH---RLTLG--------LGVLQ 141
Query: 168 RIAIAYLVAAL 178
IA+A AAL
Sbjct: 142 LIALASFFAAL 152
>gi|195167204|ref|XP_002024424.1| GL15027 [Drosophila persimilis]
gi|194107797|gb|EDW29840.1| GL15027 [Drosophila persimilis]
Length = 493
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 28/50 (56%), Positives = 34/50 (68%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFF 101
Q++RL SLD FRGL++ LMI V+ GG I H+ WNGL LAD V PF
Sbjct: 181 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHLADLVFPFL 230
>gi|404404699|ref|ZP_10996283.1| hypothetical protein AJC13_04673 [Alistipes sp. JC136]
Length = 376
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 80/313 (25%), Positives = 118/313 (37%), Gaps = 99/313 (31%)
Query: 44 SKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAI----------------NHSP 87
+K T PQ RL+SLD RG M + G++ A+ H+
Sbjct: 6 TKPTAPQ----RLLSLDALRGFD---MFFIMGFAGLVVALCKLRPGSFADWMSAQMGHAA 58
Query: 88 WNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATR----KAILRALNLFLLGIFLQGG 143
W+G D + P FLFI G+S + K V R K I R L L LG+
Sbjct: 59 WDGFFHHDTIFPLFLFIAGISFPFSLAKQREKGVRERSIYTKVIRRGLTLVALGL----- 113
Query: 144 FFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHW 203
++G+ NL D A +R VL RI +A++ AA+ I + ++++L
Sbjct: 114 VYNGLFNL----DFATLRLPSVLGRIGLAWMFAAMLFIRFG----IRTRIAL-------- 157
Query: 204 VVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDR 263
A++L LLL + + G A N VG IDR
Sbjct: 158 -AAVILVGYGLLLQF---------------------VAAPDAAGAGPLTEAGNIVGYIDR 195
Query: 264 KILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIG 323
I+ HLY FDPEGLLS++ A VT ++G
Sbjct: 196 TIMP-AHLY----------------------------GNRGFDPEGLLSTLPAIVTAMLG 226
Query: 324 LHFGHLIVHFKDH 336
+ G + ++
Sbjct: 227 MFTGEFVRRSEEQ 239
>gi|373953356|ref|ZP_09613316.1| Protein of unknown function DUF2261, transmembrane
[Mucilaginibacter paludis DSM 18603]
gi|373889956|gb|EHQ25853.1| Protein of unknown function DUF2261, transmembrane
[Mucilaginibacter paludis DSM 18603]
Length = 404
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 78/319 (24%), Positives = 122/319 (38%), Gaps = 89/319 (27%)
Query: 83 INHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAIL-----RALNLFLLG 137
++HSPWNG T D + P F+FI G+S+ +Y + ++ K I R + L LLG
Sbjct: 80 LHHSPWNGFTFYDLIFPLFIFIAGISMPFSYNRQVAQSPSSNKQIYVRLIKRTVLLILLG 139
Query: 138 IFLQGGF-FHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLF 196
+ G F G Q R+ VL RIA+A AA+ +++S L
Sbjct: 140 TVVNGALHFAGYQ---------QTRFASVLGRIALACFFAAVI--------YLNSSL--- 179
Query: 197 RKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACN 256
+++ W ++L L+ L PV + G P N
Sbjct: 180 -RWQIIWFAVILLGYWLLMALV-----------PVPGHGA------------GVLTPGAN 215
Query: 257 AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMA 316
ID+ L L+RK +DPEGLLS++ A
Sbjct: 216 LSAWIDQHFLP-GKLHRK-----------------------------VYDPEGLLSTIPA 245
Query: 317 TVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGM------HLNKALYS 370
T ++G+ GH + R L I + G+SL + + +NK +++
Sbjct: 246 IATAMMGIFTGHFLQWEPGERLSPLKKI---GIMAAAGISLILIALIWNMAFPINKNMWT 302
Query: 371 LSYTCLTAGASGVLLAGIY 389
S+T G S +L Y
Sbjct: 303 SSFTLYAGGWSLLLFTLFY 321
>gi|329851309|ref|ZP_08266066.1| hypothetical protein ABI_41500 [Asticcacaulis biprosthecum C19]
gi|328840155|gb|EGF89727.1| hypothetical protein ABI_41500 [Asticcacaulis biprosthecum C19]
Length = 384
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 95/369 (25%), Positives = 144/369 (39%), Gaps = 99/369 (26%)
Query: 52 QQRRLISLDVFRGLTVALMIL-------VDDVGGILPAINHSPWNGLTLADFVMPFFLFI 104
Q R ++LD+ RGL++ M+L + G +L H+ W G T D V P FLF
Sbjct: 7 QGNRWLALDILRGLSIIFMLLNLNPGSWSEQYGWVL----HAKWEGATFIDMVAPVFLFC 62
Query: 105 VGVSLALTYKNFPCKVVATR------KAIL-RALNLFLLGIFLQGGFFHGINNLKYGVDI 157
+GV++ L+ + ++ A K IL RA L LLG+FL N D
Sbjct: 63 IGVAIPLSLRR---RIEAGESNGQLAKHILNRAGILVLLGLFL---------NAYPAFDW 110
Query: 158 AQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFR-KYRGHWVVALVLTTLYLLL 216
A +R GVLQRI + Y AL + + + FR + W+ + + L
Sbjct: 111 AHMRIPGVLQRIGVCYGAVALFVL------FTARREGGFRLNAKAGWIAWTFVLLSWTAL 164
Query: 217 LYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVG----MIDRKILGIQHLY 272
L ++VP V G P + VG +DR +L H++
Sbjct: 165 L--MFVP-----------------------VPGFGAPRFDPVGSWPAYVDRLVLTTDHMF 199
Query: 273 RKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVH 332
P + P+D + FDP+GLLS+ L G GH
Sbjct: 200 --PWW----------------PVDG----KVVFDPDGLLSTWPVCANVLFGALVGH---- 233
Query: 333 FKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLN----KALYSLSYTCLTAGASGVLLAGI 388
R + IL + G L VG+H K +++ ++ T G S V L +
Sbjct: 234 ---ARLTGITAPILKMLVAGGLLMAAAVGLHTTIPIIKHIWTATFALFTIGFSLVSLGAL 290
Query: 389 YFMVRYISS 397
+V +S
Sbjct: 291 TLLVERWNS 299
>gi|153805867|ref|ZP_01958535.1| hypothetical protein BACCAC_00106 [Bacteroides caccae ATCC 43185]
gi|149130544|gb|EDM21750.1| hypothetical protein BACCAC_00106 [Bacteroides caccae ATCC 43185]
Length = 377
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 47/140 (33%), Positives = 64/140 (45%), Gaps = 23/140 (16%)
Query: 55 RLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLADFVMPFF 101
RL SLD+ RG + L++ V IL +H W G D VMP F
Sbjct: 13 RLASLDILRGFDLFLLVFFQPVFVALGQRLNFPWLNDILYQFDHESWIGFRFWDLVMPLF 72
Query: 102 LFIVGVSLALT---YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIA 158
LF+ G S+ + +KN P K RK I R + LF+ G+ +QG N L D
Sbjct: 73 LFMTGASMPFSFSKFKNAPNKWHIYRKIIKRFVLLFIFGMIVQG------NLLGLNPDSL 126
Query: 159 QIRWMGVLQRIAIAYLVAAL 178
+ + LQ IA YL+AA+
Sbjct: 127 YL-YSNTLQAIATGYLIAAI 145
>gi|423219681|ref|ZP_17206177.1| hypothetical protein HMPREF1061_02950 [Bacteroides caccae
CL03T12C61]
gi|392624886|gb|EIY18964.1| hypothetical protein HMPREF1061_02950 [Bacteroides caccae
CL03T12C61]
Length = 375
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/140 (33%), Positives = 64/140 (45%), Gaps = 23/140 (16%)
Query: 55 RLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLADFVMPFF 101
RL SLD+ RG + L++ V IL +H W G D VMP F
Sbjct: 11 RLASLDILRGFDLFLLVFFQPVFVALGQRLNFPWLNDILYQFDHESWIGFRFWDLVMPLF 70
Query: 102 LFIVGVSLALT---YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIA 158
LF+ G S+ + +KN P K RK I R + LF+ G+ +QG N L D
Sbjct: 71 LFMTGASMPFSFSKFKNAPNKWHIYRKIIKRFVLLFIFGMIVQG------NLLGLNPDSL 124
Query: 159 QIRWMGVLQRIAIAYLVAAL 178
+ + LQ IA YL+AA+
Sbjct: 125 YL-YSNTLQAIATGYLIAAI 143
>gi|218260819|ref|ZP_03475938.1| hypothetical protein PRABACTJOHN_01602 [Parabacteroides johnsonii
DSM 18315]
gi|218224342|gb|EEC96992.1| hypothetical protein PRABACTJOHN_01602 [Parabacteroides johnsonii
DSM 18315]
Length = 376
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 46/165 (27%), Positives = 75/165 (45%), Gaps = 25/165 (15%)
Query: 54 RRLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLADFVMPF 100
+RL SLDV RG + ++ ++ + L +H W G + D VMP
Sbjct: 6 KRLESLDVLRGFDLFCLVALEGILHPLGRAIDASWYNDFLWGFSHVQWEGFSSWDLVMPL 65
Query: 101 FLFIVGVSLALT---YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDI 157
F+F+ GVS+ YK P K R+ I R L++ G+ QG G+D
Sbjct: 66 FMFMAGVSMPFALSRYKAMPDKWAVYRRIIKRVALLWIFGMMCQGNLL--------GLDP 117
Query: 158 AQIR-WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRG 201
+I + LQ IA+ YL+AA+ + ++ + + ++L Y G
Sbjct: 118 GRIYLYSNTLQAIAMGYLIAAMLFLHVRLSVQIGTAVALLLVYWG 162
>gi|313225183|emb|CBY20977.1| unnamed protein product [Oikopleura dioica]
Length = 335
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 78/324 (24%), Positives = 133/324 (41%), Gaps = 64/324 (19%)
Query: 63 RGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVA 122
RG+ + +MI V+ GG +H+ W GLT+AD MP+F+F++GVSL ++ +
Sbjct: 2 RGIAIGIMIFVNYGGGGYWFFDHAVWFGLTVADLAMPWFMFMMGVSLTFSFNSM------ 55
Query: 123 TRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIW 182
+K + + NL N +G + GVLQR AI Y V + ++
Sbjct: 56 VKKVLRLSYNLV---------------NPTFGT------FPGVLQRFAICYAVVSPLQL- 93
Query: 183 LKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFN 242
S+ L W+V L L L+L L +G+ E P +
Sbjct: 94 ----AQQSTSL--------RWLVILGLEGLWLGLTFGM------AEAGCPAGYLGPGGLH 135
Query: 243 VTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQ 302
R +G A + R+I H+Y T +Y L A
Sbjct: 136 RDGQFRNCSGGADKGCLIYCREINQKAHIYG----DSTSHPVFWHDEYWFGDLQA----- 186
Query: 303 APFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSL----- 357
+DPEG+L S+ + + +GL G + ++ + R + + + L +G +L
Sbjct: 187 --YDPEGILGSINSILIVFLGLQAGRIFNFYETFQQRAIRLSVWGTVLTAVGGALTGLNQ 244
Query: 358 --DFVGMHLNKALYSLSYTCLTAG 379
+ + + K L++LS+ + AG
Sbjct: 245 FQEGSNIPIAKNLWTLSFVLVMAG 268
>gi|423345098|ref|ZP_17322787.1| hypothetical protein HMPREF1060_00459 [Parabacteroides merdae
CL03T12C32]
gi|409222884|gb|EKN15821.1| hypothetical protein HMPREF1060_00459 [Parabacteroides merdae
CL03T12C32]
Length = 376
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/171 (27%), Positives = 78/171 (45%), Gaps = 27/171 (15%)
Query: 48 RPQHQQRRLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLA 94
+P +Q RL SLDV RG + ++ ++ V L +H W G +
Sbjct: 2 KPAYQ--RLESLDVLRGFDLFCLVALEGVLHSLGRAIDAPWYNDFLWGFSHVQWEGFSSW 59
Query: 95 DFVMPFFLFIVGVSLALT---YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNL 151
D VMP F+F+ GVS+ YK P K R+ + R L++ G+ QG
Sbjct: 60 DLVMPLFMFMAGVSMPFALSRYKAMPDKWAVYRRIVKRVALLWIFGMMCQGNLL------ 113
Query: 152 KYGVDIAQIR-WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRG 201
G+D +I + LQ IA+ YL++A+ + ++ + + ++L Y G
Sbjct: 114 --GLDPDRIYLYSNTLQAIAMGYLISAMLFLHVRLSVQIGTAVALLLAYWG 162
>gi|393784535|ref|ZP_10372698.1| hypothetical protein HMPREF1071_03566 [Bacteroides salyersiae
CL02T12C01]
gi|392665516|gb|EIY59040.1| hypothetical protein HMPREF1071_03566 [Bacteroides salyersiae
CL02T12C01]
Length = 378
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 84/364 (23%), Positives = 142/364 (39%), Gaps = 88/364 (24%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLADF 96
++ RL SLD+ RG + L++ V ++ +H W G L D
Sbjct: 8 ENNSSRLASLDILRGFDLFLLVFFQPVFVALGQQLDLPFLNRLVYQFDHEAWVGFHLWDL 67
Query: 97 VMPFFLFIVGVSLALTYKNFP-----CKVVATRKAILRALNLFLLGIFLQGGFFHGINNL 151
VMP FLF+ G S+ + + C+ V R+ R + LFL G+ +QG
Sbjct: 68 VMPLFLFMTGASMPFSLSKYKISSAGCQFV-YRRIFRRVVLLFLFGMIVQGNLL------ 120
Query: 152 KYGVDIAQIR-WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLT 210
G D I + LQ IA+ YL+AA+ ++ H S K + ++ L+L
Sbjct: 121 --GFDSQHIYLYSNTLQAIAVGYLIAAIIQL------HFSFKWQI--------IITLLLL 164
Query: 211 TLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQH 270
+Y WI C G P N +DR +LG
Sbjct: 165 LVY-------------------------WIPMTFC---GDFTPQGNFAEQVDRWVLG--- 193
Query: 271 LYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI 330
+R +Y S Y + + SS+ VT ++G G L+
Sbjct: 194 RFRDGVYWNGDGTWSFSLQY---------------NYTWIWSSLTFGVTVMLGTFAGKLM 238
Query: 331 VHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYF 390
+ +R R++ ++++ ++ L + M + K L++ S T L+ G +L+A Y+
Sbjct: 239 KEGEANRKRVVRILLITGLILIGVAVLWSLQMPVIKRLWTGSMTLLSGGYCFLLMAFFYY 298
Query: 391 MVRY 394
+ Y
Sbjct: 299 WIDY 302
>gi|423722057|ref|ZP_17696233.1| hypothetical protein HMPREF1078_00296 [Parabacteroides merdae
CL09T00C40]
gi|409242759|gb|EKN35519.1| hypothetical protein HMPREF1078_00296 [Parabacteroides merdae
CL09T00C40]
Length = 376
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 47/171 (27%), Positives = 78/171 (45%), Gaps = 27/171 (15%)
Query: 48 RPQHQQRRLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLA 94
+P +Q RL SLDV RG + ++ ++ V L +H W G +
Sbjct: 2 KPSYQ--RLESLDVLRGFDLFCLVALEGVLHPLGRAIDAPWYNDFLWCFSHVQWEGFSSW 59
Query: 95 DFVMPFFLFIVGVSLALT---YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNL 151
D VMP F+F+ GVS+ YK P K R+ + R L++ G+ QG
Sbjct: 60 DLVMPLFMFMAGVSMPFALSRYKVMPDKWAVYRRIVKRVALLWIFGMMCQGNLL------ 113
Query: 152 KYGVDIAQIR-WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRG 201
G+D +I + LQ IA+ YL++A+ + ++ + + ++L Y G
Sbjct: 114 --GLDPDRIYLYSNTLQAIAMGYLISAMLFLHVRLSVQIGTAVALLLAYWG 162
>gi|338211253|ref|YP_004655306.1| hypothetical protein [Runella slithyformis DSM 19594]
gi|336305072|gb|AEI48174.1| Protein of unknown function DUF2261, transmembrane [Runella
slithyformis DSM 19594]
Length = 374
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 94/371 (25%), Positives = 150/371 (40%), Gaps = 102/371 (27%)
Query: 44 SKQTRPQHQQRRLISLDVFRGLTVALMILVDDV-----------GGILPA--INHSPWNG 90
S TRP RL+SLD RG + + +++ G I+ A ++H WNG
Sbjct: 5 STDTRPH----RLLSLDALRGFDMFWITGGEEIFHLLAKATGWTGAIIMAEQLSHPDWNG 60
Query: 91 LTLADFVMPFFLFIVGV----SLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFH 146
D + P FLF+ GV SL + + + RK I R L L LLGI G
Sbjct: 61 FRAYDLIFPLFLFLSGVSAPYSLGVRLERGDDRGKMLRKVIQRGLTLVLLGIIYNNGL-- 118
Query: 147 GINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVA 206
+K + +R+ VL RI +A + A + + + S+++ W V+
Sbjct: 119 ---QIK---PLEDMRFPSVLGRIGLAGMFAQIIYL------YTSTRVQYI------WFVS 160
Query: 207 LVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKIL 266
L+L ++L PV CG G CN V +DR I+
Sbjct: 161 LLLGYWAFVML-----------VPVP-----------GCGA-GLMTMECNPVSYLDRLII 197
Query: 267 GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHF 326
HL+ K I+ DPEGL+S++ A T L+G+
Sbjct: 198 P-GHLH-KDIH----------------------------DPEGLVSTIPAIATGLLGIFA 227
Query: 327 GHLIVHFKDHR-----DRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGAS 381
G+L+ D R ++L + + +G D+V +NK L++ S+ G S
Sbjct: 228 GNLLR--ADERSTSRTQKVLVLFVAGILFLIIGKLWDYV-FPINKNLWTSSFVMTVGGWS 284
Query: 382 GVLLAGIYFMV 392
+LL+ Y+++
Sbjct: 285 LILLSLFYWII 295
>gi|423344000|ref|ZP_17321713.1| hypothetical protein HMPREF1077_03143 [Parabacteroides johnsonii
CL02T12C29]
gi|409213862|gb|EKN06875.1| hypothetical protein HMPREF1077_03143 [Parabacteroides johnsonii
CL02T12C29]
Length = 376
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 46/165 (27%), Positives = 75/165 (45%), Gaps = 25/165 (15%)
Query: 54 RRLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLADFVMPF 100
+RL SLDV RG + ++ ++ + L +H W G + D VMP
Sbjct: 6 KRLESLDVLRGFDLFCLVALEGILHPLGRAIDASWYNDFLWGFSHVQWEGFSSWDLVMPL 65
Query: 101 FLFIVGVSLALT---YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDI 157
F+F+ GVS+ YK P K R+ I R L++ G+ QG G+D
Sbjct: 66 FMFMAGVSMPFALSRYKAMPDKWAVYRRIIKRVALLWIFGMMCQGNLL--------GLDP 117
Query: 158 AQIR-WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRG 201
+I + LQ IA+ YL+AA+ + ++ + + ++L Y G
Sbjct: 118 DRIYLYSNTLQAIAMGYLIAAMLFLHVRLSVQIGTAVALLLVYWG 162
>gi|345322030|ref|XP_003430524.1| PREDICTED: hypothetical protein LOC100681967 [Ornithorhynchus
anatinus]
Length = 530
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 32/80 (40%), Positives = 48/80 (60%), Gaps = 4/80 (5%)
Query: 65 LTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTY----KNFPCKV 120
L++ LM+ V+ GG H+PWNGLT+AD VMP+F+FI+G S+AL + + +V
Sbjct: 130 LSLTLMVFVNYGGGGYWFFEHAPWNGLTVADLVMPWFVFILGTSVALAFYAMRRRGVNRV 189
Query: 121 VATRKAILRALNLFLLGIFL 140
RK R L ++G+F
Sbjct: 190 QLLRKLTWRTAVLMIIGLFF 209
>gi|154492357|ref|ZP_02031983.1| hypothetical protein PARMER_01991 [Parabacteroides merdae ATCC
43184]
gi|154087582|gb|EDN86627.1| hypothetical protein PARMER_01991 [Parabacteroides merdae ATCC
43184]
Length = 376
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 47/171 (27%), Positives = 79/171 (46%), Gaps = 27/171 (15%)
Query: 48 RPQHQQRRLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLA 94
+P +Q RL SLDV RG + ++ ++ V L +H W+G +
Sbjct: 2 KPSYQ--RLESLDVLRGFDLFCLVALEGVLHPLGRAIDAPWYNDFLWCFSHVQWDGFSSW 59
Query: 95 DFVMPFFLFIVGVSLALT---YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNL 151
D VMP F+F+ GVS+ YK P K R+ + R L++ G+ QG
Sbjct: 60 DLVMPLFMFMAGVSMPFALSRYKVMPDKWAVYRRIVKRVALLWIFGMMCQGNLL------ 113
Query: 152 KYGVDIAQIR-WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRG 201
G+D +I + LQ IA+ YL++A+ + ++ + + ++L Y G
Sbjct: 114 --GLDPDRIYLYSNTLQAIAMGYLISAMLFLHVRLSVQIGTVVALLLAYWG 162
>gi|212557932|gb|ACJ30386.1| Conserved hypothetical protein [Shewanella piezotolerans WP3]
Length = 387
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 101/374 (27%), Positives = 153/374 (40%), Gaps = 97/374 (25%)
Query: 40 SNSNSKQTR-PQHQQRRLISLDVFRGLTV----------ALMILVDDVGGIL---PAINH 85
N+ S+ P+ + RL SLD RG + A +I++ GG+ ++H
Sbjct: 5 KNTQSQTEHGPKKNKVRLKSLDALRGFDMFWILGGEAIFAALIVLTGWGGLHWLDKQMHH 64
Query: 86 SPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATR-----KAILRALNLFLLGIFL 140
S W+G T D + P F+F+ GV+L L+ K + R A+ R L L LLG+
Sbjct: 65 SAWHGFTFYDLIFPLFIFLSGVALGLSPKRLDKLPMVQRMPLYQHAVKRLLLLLLLGVIY 124
Query: 141 QGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYR 200
G+ G + + IR+ VL RIA A+ AL +W H S +
Sbjct: 125 NHGWGTGAP-----MALGDIRYASVLGRIAFAWFFCALL-VW-----HTSLRTQ------ 167
Query: 201 GHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGM 260
++ ++ +L+ Y L FP+ T S+ A A G
Sbjct: 168 -------IIASVSILVGYALL-----QMFPLITLDST---------------GAFTATGS 200
Query: 261 IDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTC 320
I+ I G M L + A DPEG+LS++ A V
Sbjct: 201 INAAIDG-------------------------MLLPGVIFQNAAVDPEGILSTIPAVVNG 235
Query: 321 LIGLHFGHLIVHFKDHRDRMLNWIILSS-CLIGLGLSLDFVGMH----LNKALYSLSYTC 375
L+G+ GH IV K H W L+ CL+GLGL + +NK L++ S+
Sbjct: 236 LLGVFVGHFIV--KTHAKG--EWYKLAVMCLLGLGLLALGWLISPIIPVNKTLWTSSFVL 291
Query: 376 LTAGASGVLLAGIY 389
+T+G S +LL Y
Sbjct: 292 VTSGWSVLLLTLFY 305
>gi|149176468|ref|ZP_01855081.1| hypothetical protein PM8797T_29827 [Planctomyces maris DSM 8797]
gi|148844581|gb|EDL58931.1| hypothetical protein PM8797T_29827 [Planctomyces maris DSM 8797]
Length = 518
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 88/440 (20%), Positives = 165/440 (37%), Gaps = 87/440 (19%)
Query: 5 NPINNDANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRG 64
N +N+D+ + I + ++ + ++E + + + K+ + +RL+SLD +RG
Sbjct: 22 NNLNSDSQNSETIPLQQTPGVEVSAEVEPAKAVTEKDVSLKEKKKPETNQRLVSLDAYRG 81
Query: 65 LTV--------ALMILVDDVGGILPAIN--------------------HSPWNGLTLADF 96
+ A+ +V + +L N H W G D
Sbjct: 82 FVMLAMASGGLAIASVVRNSPEVLDQYNGTQWESSWKTLWQTLSYQLSHVEWTGAGFWDL 141
Query: 97 VMPFFLFIVGVSLALTYKNFPCKVVATRK----AILRALNLFLLGIFLQGGFFHGINNLK 152
+ P F+F+VGVS+ + + K +T K AI RA+ L LG+FL F
Sbjct: 142 IQPSFMFMVGVSMPFSVRKRRQKGDSTFKIWMHAIFRAILLVALGVFLSSQFSPERGFTY 201
Query: 153 YGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTL 212
V + VL +I + YLV +V+ R + + + +
Sbjct: 202 EDVPQTNFTFANVLCQIGLGYLVVFF---------YVN------RSFATQMIGVVTILGG 246
Query: 213 YLLLLYGLYVPDWQ------YEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKIL 266
Y Y P+ + Y V+ + W + G+ + NA +DR++L
Sbjct: 247 YWFFFYQYMPPEDELAAVKTYLKEVQHKDEAEW--SQFSGIGSAWNKHTNAAAAVDRQLL 304
Query: 267 GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHF 326
+ Y P + PD G Q L+ + + T L GL
Sbjct: 305 NMFPRYDNP---------KDDPDQGDTFWVNKGGYQT-------LNFIPSIATMLFGLMA 348
Query: 327 GHLIVHFKDHRDRMLNWIILSSCLIGLGLSL--------------DFVGMHLNKALYSLS 372
G L++ + ++M +L + LI G+S+ ++ + + K ++S
Sbjct: 349 GQLLI--SNRLEKMKVKWLLQAGLICFGVSMLLDTSIWPVNINNWEWHLVPIVKRIWSPG 406
Query: 373 YTCLTAGASGVLLAGIYFMV 392
+ +AG + LA Y+++
Sbjct: 407 WAIFSAGWAFWFLAVFYWII 426
>gi|392965639|ref|ZP_10331058.1| hypothetical protein BN8_02168 [Fibrisoma limi BUZ 3]
gi|387844703|emb|CCH53104.1| hypothetical protein BN8_02168 [Fibrisoma limi BUZ 3]
Length = 411
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 39/105 (37%), Positives = 61/105 (58%), Gaps = 9/105 (8%)
Query: 44 SKQTRPQHQQRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPW--NGLTLADFVM 98
S +TR Q ++R+ S+D+FR LT+ MI V+D+ GI + HSP + L LAD V
Sbjct: 9 SPETRRQLPEKRVHSIDIFRALTMLFMIFVNDLWTLIGIPDWLEHSPADVDFLGLADVVF 68
Query: 99 PFFLFIVGVSLALTYKNFPCK----VVATRKAILRALNLFLLGIF 139
P FLFIVG+S+ + K + R ++R++ L ++G+F
Sbjct: 69 PCFLFIVGMSIPFAIQGRLAKGDSYGLIIRHIVVRSVALLIMGVF 113
>gi|150007979|ref|YP_001302722.1| transmembrane protein [Parabacteroides distasonis ATCC 8503]
gi|423331514|ref|ZP_17309298.1| hypothetical protein HMPREF1075_01311 [Parabacteroides distasonis
CL03T12C09]
gi|149936403|gb|ABR43100.1| putative transmembrane protein [Parabacteroides distasonis ATCC
8503]
gi|409230084|gb|EKN22952.1| hypothetical protein HMPREF1075_01311 [Parabacteroides distasonis
CL03T12C09]
Length = 378
Score = 59.3 bits (142), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 44/152 (28%), Positives = 73/152 (48%), Gaps = 23/152 (15%)
Query: 49 PQHQQRRLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLAD 95
+ + RL SLDV RG + +++++ V + +H W G + D
Sbjct: 2 EKKKLNRLESLDVLRGFDLFCLVVLEMVLHPLAHAIDMPWFNSFMWGFSHVEWEGFSTWD 61
Query: 96 FVMPFFLFIVGVSLALT---YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLK 152
VMP FLF+ GVS+ + YK+ P K+ R+ R L L++ G+ QG N L
Sbjct: 62 LVMPLFLFMAGVSMPFSLSRYKDMPDKMAVYRRIGKRVLLLWVFGMMCQG------NLLA 115
Query: 153 YGVDIAQIRWMGVLQRIAIAYLVAALCEIWLK 184
D + + LQ IA+ YL+A+L ++++
Sbjct: 116 LDPDRVYL-YSNTLQSIAMGYLIASLLFLYVR 146
>gi|256840847|ref|ZP_05546355.1| conserved hypothetical protein [Parabacteroides sp. D13]
gi|298376669|ref|ZP_06986624.1| conserved hypothetical protein [Bacteroides sp. 3_1_19]
gi|256738119|gb|EEU51445.1| conserved hypothetical protein [Parabacteroides sp. D13]
gi|298266547|gb|EFI08205.1| conserved hypothetical protein [Bacteroides sp. 3_1_19]
Length = 378
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 69/146 (47%), Gaps = 23/146 (15%)
Query: 49 PQHQQRRLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLAD 95
+ + RL SLDV RG + +++++ V + +H W G + D
Sbjct: 2 EKKKLNRLESLDVLRGFDLFCLVVLEMVLHPLAHAIDMPWFNSFMWGFSHVEWEGFSTWD 61
Query: 96 FVMPFFLFIVGVSLALT---YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLK 152
VMP FLF+ GVS+ + YK+ P K+ R+ R L L++ G+ QG N L
Sbjct: 62 LVMPLFLFMAGVSMPFSLSRYKDMPDKMAVYRRIGKRVLLLWVFGMMCQG------NLLA 115
Query: 153 YGVDIAQIRWMGVLQRIAIAYLVAAL 178
D + + LQ IA+ YL+A+L
Sbjct: 116 LDPDRVYL-YSNTLQSIAMGYLIASL 140
>gi|301309931|ref|ZP_07215870.1| conserved hypothetical protein [Bacteroides sp. 20_3]
gi|423340410|ref|ZP_17318149.1| hypothetical protein HMPREF1059_04074 [Parabacteroides distasonis
CL09T03C24]
gi|300831505|gb|EFK62136.1| conserved hypothetical protein [Bacteroides sp. 20_3]
gi|409227845|gb|EKN20741.1| hypothetical protein HMPREF1059_04074 [Parabacteroides distasonis
CL09T03C24]
Length = 378
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 69/146 (47%), Gaps = 23/146 (15%)
Query: 49 PQHQQRRLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLAD 95
+ + RL SLDV RG + +++++ V + +H W G + D
Sbjct: 2 EKKKLNRLESLDVLRGFDLFCLVVLEMVLHPLAHAIDMPWFNSFMWGFSHVEWEGFSTWD 61
Query: 96 FVMPFFLFIVGVSLALT---YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLK 152
VMP FLF+ GVS+ + YK+ P K+ R+ R L L++ G+ QG N L
Sbjct: 62 LVMPLFLFMAGVSMPFSLSRYKDMPDKMAVYRRIGKRVLLLWVFGMMCQG------NLLA 115
Query: 153 YGVDIAQIRWMGVLQRIAIAYLVAAL 178
D + + LQ IA+ YL+A+L
Sbjct: 116 LDPDRVYL-YSNTLQSIAMGYLIASL 140
>gi|322785719|gb|EFZ12357.1| hypothetical protein SINV_16151 [Solenopsis invicta]
Length = 111
Score = 58.9 bits (141), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 43/72 (59%)
Query: 40 SNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMP 99
+N + + + ++ R+ ++D FRG++ MI V+D G + H+ WNGL L D V P
Sbjct: 32 NNVKDESSNKEPKKNRVKAIDTFRGISTLFMIFVNDGSGSYTVLEHATWNGLLLGDLVFP 91
Query: 100 FFLFIVGVSLAL 111
F++I+GV + +
Sbjct: 92 CFIWIMGVCVPI 103
>gi|16552925|dbj|BAB71412.1| unnamed protein product [Homo sapiens]
Length = 367
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 69/249 (27%), Positives = 108/249 (43%), Gaps = 49/249 (19%)
Query: 155 VDIAQIRWMGVLQRIAIAYLVAALCEIWLKGD--GHVSSKLSLFR-----KYRGHWVVAL 207
V ++R GVLQR+ + Y V A+ E+ H +S+ S W++ L
Sbjct: 70 VSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLIL 129
Query: 208 VLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTG--PACN--AVGMIDR 263
VL L+L L + L VP T P G G G P C A G IDR
Sbjct: 130 VLEGLWLGLTFLLPVPG------CPTGYLGP-------GGIGDFGKYPNCTGGAAGYIDR 176
Query: 264 KILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIG 323
+LG HLY+ P + + +DPEG+L ++ + V +G
Sbjct: 177 LLLGDDHLYQHPSSAVLYHTEV------------------AYDPEGILGTINSIVMAFLG 218
Query: 324 LHFGHLIVHFKDHRDRMLNWIILSSCLIGL-GLSLDFVGMH-----LNKALYSLSY-TCL 376
+ G +++++K +L C++GL ++L V + +NK L+SLSY T L
Sbjct: 219 VQAGKILLYYKARTKDILIRFTAWCCILGLISVALTKVSENEGFIPVNKNLWSLSYVTTL 278
Query: 377 TAGASGVLL 385
++ A +LL
Sbjct: 279 SSFAFFILL 287
>gi|255013329|ref|ZP_05285455.1| putative transmembrane protein [Bacteroides sp. 2_1_7]
gi|410103820|ref|ZP_11298741.1| hypothetical protein HMPREF0999_02513 [Parabacteroides sp. D25]
gi|409236549|gb|EKN29356.1| hypothetical protein HMPREF0999_02513 [Parabacteroides sp. D25]
Length = 378
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 69/146 (47%), Gaps = 23/146 (15%)
Query: 49 PQHQQRRLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLAD 95
+ + RL SLDV RG + +++++ V + +H W G + D
Sbjct: 2 EKKKLNRLESLDVLRGFDLFCLVVLEMVLHPLAHAIDMPWFNSFMWGFSHVEWEGFSTWD 61
Query: 96 FVMPFFLFIVGVSLALT---YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLK 152
VMP FLF+ GVS+ + YK+ P K+ R+ R + L++ G+ QG N L
Sbjct: 62 LVMPLFLFMAGVSMPFSLSRYKDMPDKMAVYRRIGKRVVLLWVFGMMCQG------NLLA 115
Query: 153 YGVDIAQIRWMGVLQRIAIAYLVAAL 178
D + + LQ IA+ YL+A+L
Sbjct: 116 LDPDRVYL-YSNTLQSIAMGYLIASL 140
>gi|343087500|ref|YP_004776795.1| hypothetical protein [Cyclobacterium marinum DSM 745]
gi|342356034|gb|AEL28564.1| Protein of unknown function DUF2261, transmembrane [Cyclobacterium
marinum DSM 745]
Length = 386
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 50/144 (34%), Positives = 68/144 (47%), Gaps = 26/144 (18%)
Query: 54 RRLISLDVFRGLTV-------ALMILVDDVGGILPAIN-------HSPWNGLTLADFVMP 99
+RL+S+D RG + A ++L+ GI PAI+ H WNG + DF+ P
Sbjct: 25 KRLLSIDALRGFDMLLIAGAGAFLVLLKGKTGI-PAIDWIAGQFYHPAWNGFSFYDFIFP 83
Query: 100 FFLFIVGVSLALTY---KNFP-CKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGV 155
FLFI GVSL + +N K +K R L L LLGI + N+
Sbjct: 84 LFLFIAGVSLTFSLNKGRNLGMSKPTLYKKTFSRMLVLILLGILYK-------NSPVPIF 136
Query: 156 DIAQIRWMGVLQRIAIAYLVAALC 179
+ +QIR+ VL RI IA V L
Sbjct: 137 EPSQIRYGSVLGRIGIATFVTTLV 160
>gi|357624248|gb|EHJ75102.1| putative heparan-alpha-glucosaminide N-acetyltransferase [Danaus
plexippus]
Length = 340
Score = 57.8 bits (138), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 72/288 (25%), Positives = 123/288 (42%), Gaps = 53/288 (18%)
Query: 119 KVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAAL 178
+V A + R+L L L+GI L G N + + +R+ GVLQR+A YL+
Sbjct: 19 RVNALGQVARRSLLLSLIGICL------GSVNTNW----SYVRFPGVLQRLAAMYLIVGS 68
Query: 179 CEI-WLKGDGHVSSKLSLFRKYRG---HWVVALVLTTLYLLLLYGLYVPDW--QYEFPV- 231
E +++ ++ SLFR W+ +++ + L + + P Y P
Sbjct: 69 LECAFMRTSQNIIPGRSLFRDIAAGWQQWLATVLMVAIQLCITLTVAAPGCPVGYSGPGG 128
Query: 232 --ETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPD 289
T++ + N T G+ G IDR ILG HLY+ + +
Sbjct: 129 LHRTATGDFSLQNCTGGI----------AGYIDRLILGPNHLYQHGTFKSIYRT------ 172
Query: 290 YGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSC 349
Q P DPEG+L + + G H +++ + R R++ W+ S
Sbjct: 173 ------------QLPHDPEGILGILSGVLVVQAGAHAARIMLVYNHARARIMRWVFW-SV 219
Query: 350 LIGL--GLSLDFVG---MHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+ G+ GL F + +NK L+S+SY +T+ + + A +YF+V
Sbjct: 220 MFGVVGGLLCKFSDGGYIPVNKNLWSVSYCLVTSSMAFFIQAILYFVV 267
>gi|262381452|ref|ZP_06074590.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
gi|262296629|gb|EEY84559.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
Length = 378
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 69/146 (47%), Gaps = 23/146 (15%)
Query: 49 PQHQQRRLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLAD 95
+ + RL SLDV RG + +++++ V + +H W G + D
Sbjct: 2 EKKKLNRLESLDVLRGFDLFCLVVLEMVLHPLAHAIDMPWFNSFMWGFSHVEWEGFSTWD 61
Query: 96 FVMPFFLFIVGVSLALT---YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLK 152
VMP FLF+ GVS+ + YK+ P K+ R+ R + L++ G+ QG N L
Sbjct: 62 LVMPLFLFMAGVSMPFSLSRYKDMPDKMAVYRRIGKRVVLLWVFGMMCQG------NLLA 115
Query: 153 YGVDIAQIRWMGVLQRIAIAYLVAAL 178
D + + LQ IA+ YL+A+L
Sbjct: 116 LDPDRVYL-YSNTLQSIAMGYLIASL 140
>gi|291295418|ref|YP_003506816.1| hypothetical protein [Meiothermus ruber DSM 1279]
gi|290470377|gb|ADD27796.1| conserved hypothetical protein [Meiothermus ruber DSM 1279]
Length = 399
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 49/149 (32%), Positives = 80/149 (53%), Gaps = 20/149 (13%)
Query: 39 PSNSNSKQTRPQHQQR--RLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTL 93
PS + + +++ + RL++LD RGLTV LM+LV+++ + H+P+ G+TL
Sbjct: 6 PSTAIAVESKNKATPAGARLLALDGLRGLTVFLMLLVNNLALQEATPDQLVHAPFGGVTL 65
Query: 94 ADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALN----LFLLGIFLQGGFFHGIN 149
AD V P+FLF +G ++ +F + + + +LR L +FLLG+FL
Sbjct: 66 ADLVFPWFLFCMGAAIPYAASSFDKQKLPLWRRLLRILRRTSLIFLLGLFLTSALAR--- 122
Query: 150 NLKYGVDIAQIRWMGVLQRIAIAYLVAAL 178
+ +GVLQ IA+AY +AAL
Sbjct: 123 --------TPVFALGVLQLIALAYCLAAL 143
>gi|311746093|ref|ZP_07719878.1| hypothetical protein ALPR1_06685 [Algoriphagus sp. PR1]
gi|126576311|gb|EAZ80589.1| hypothetical protein ALPR1_06685 [Algoriphagus sp. PR1]
Length = 367
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 52/108 (48%), Gaps = 19/108 (17%)
Query: 49 PQHQQRRLISLDVFRGLTVALMI---------LVDD------VGGILPAINHSPWNGLTL 93
P + RL+SLD FRGLT+ L+I L++ + H PWNGL
Sbjct: 4 PLSKSGRLVSLDTFRGLTMFLLIAEAAFVYESLLEAFPDPGILNSFFTQFTHHPWNGLRF 63
Query: 94 ADFVMPFFLFIVGVSLALT----YKNFPCKVVATRKAILRALNLFLLG 137
D + PFF+FIVGV++ + +N + T+ + R LFL G
Sbjct: 64 WDLIQPFFMFIVGVAMPFSLNKRLENQENRSEVTKHILKRCFYLFLFG 111
>gi|157374353|ref|YP_001472953.1| hypothetical protein Ssed_1214 [Shewanella sediminis HAW-EB3]
gi|157316727|gb|ABV35825.1| conserved hypothetical protein [Shewanella sediminis HAW-EB3]
Length = 378
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/118 (33%), Positives = 66/118 (55%), Gaps = 6/118 (5%)
Query: 278 SRTKQCSINS-PDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV--HFK 334
S T + SIN+ D +P ++ P DPEG+LS++ A L G+ GH I+ H K
Sbjct: 185 SFTPEGSINAFVDTHFLP--GITYQNRPLDPEGILSTIPAVANALAGVFVGHFIIKPHPK 242
Query: 335 DHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+++ ++ + +GLG LDF+ + +NK L++ S+T +T G S +LL Y +V
Sbjct: 243 GEWFKVVYMLVAGAAFLGLGWLLDFI-VPVNKELWTSSFTLVTIGWSLILLTVFYAIV 299
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 47/153 (30%), Positives = 73/153 (47%), Gaps = 25/153 (16%)
Query: 45 KQTRPQHQQRRLISLDVFRGLTVALMILVDDV--GGILP------------AINHSPWNG 90
+ + + +RRL+SLD RG + IL +V G+L ++HS W+G
Sbjct: 2 EVAQAKVSKRRLMSLDALRGFDM-FWILGGEVLFAGLLAWTGWQGWQWFDTQMHHSEWHG 60
Query: 91 LTLADFVMPFFLFIVGVSLALTYKNFPCKVVATR-----KAILRALNLFLLGIFLQGGFF 145
T D + P F+F+ GV+L L+ K +A R A+ R L L GI G+
Sbjct: 61 FTFYDLIFPLFIFLSGVALGLSPKRLDKLPIAKRMPLYIHAVKRLLLLLFFGILYNHGWG 120
Query: 146 HGINNLKYGVDIAQIRWMGVLQRIAIAYLVAAL 178
G+ V + ++R+ VL RIA A+ AA+
Sbjct: 121 TGVP-----VVLDEVRYASVLGRIAFAWFFAAI 148
>gi|392544017|ref|ZP_10291154.1| hypothetical protein PpisJ2_19642 [Pseudoalteromonas piscicida JCM
20779]
Length = 377
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 96/375 (25%), Positives = 142/375 (37%), Gaps = 104/375 (27%)
Query: 50 QHQQRRLISLDVFRGLTV-----------ALMILVDDVG--GILPAINHSPWNGLTLADF 96
++ +RL SLD RG+ + AL +L G HSPW+G T D
Sbjct: 4 NNKPKRLASLDALRGMDMFWILGGQSIFAALFVLTGWQGWKAFEAHTLHSPWHGFTFYDL 63
Query: 97 VMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFL-----LGIFLQGGFFHGINNL 151
+ P F+F+ GV++ L+ K RK L G+ G+ GI
Sbjct: 64 IFPLFIFLSGVAMGLSPKRIDHLPFNERKPFYLKALKRLLLLCAFGVLYNHGWGTGIP-- 121
Query: 152 KYGVDIAQIRWMGVLQRIAIAYLVAALCEIW---LKGDGHVSSKLSLFRKYRGHWVVALV 208
+D IR+ VL RIA A+ AL +W L+ HV + LF
Sbjct: 122 ---MDPDGIRYASVLGRIAFAWFFCALL-VWHTSLRTLAHVGVGILLF------------ 165
Query: 209 LTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGP-ACNAVG----MIDR 263
Y LLL + VP G G + N VG ID
Sbjct: 166 ----YWLLLCFIPVPG------------------------GQAGDLSANGVGSWNAFIDT 197
Query: 264 KIL-GIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLI 322
+L GI + R P+D PEG+LSS+ A V +
Sbjct: 198 YLLPGISYQNR--------------------PVD----------PEGVLSSLPAIVNAIA 227
Query: 323 GLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDF-VGMHLNKALYSLSYTCLTAGAS 381
G+ G I + + + I+ S ++ L L + + +NK L++ S+ +T G S
Sbjct: 228 GVFAGRAIANAQTQGEWKTVGILAGSGVLVLALGWLWDMQFPVNKELWTSSFVLVTVGWS 287
Query: 382 GVLLAGIYFMVRYIS 396
+LLA Y +V +S
Sbjct: 288 AILLAVFYAIVDVLS 302
>gi|332666399|ref|YP_004449187.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
gi|332335213|gb|AEE52314.1| Protein of unknown function DUF2261, transmembrane
[Haliscomenobacter hydrossis DSM 1100]
Length = 369
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 94/363 (25%), Positives = 139/363 (38%), Gaps = 94/363 (25%)
Query: 49 PQHQQRRLISLDVFRGLTVALMILVDDVGGILPA-------------INHSPWNGLTLAD 95
P +RL SLD RG + ++ + V L ++H W+G L D
Sbjct: 3 PPSTGQRLYSLDALRGFDMFWIMGAEAVVHSLATATGSSVWEAAAHQLSHPDWHGFRLYD 62
Query: 96 FVMPFFLFIVGV----SLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNL 151
+ P FLF+ GV S+ +N K + I R L L LLGI G L
Sbjct: 63 LIFPLFLFLAGVATPYSVGRDLENGKPKQQLLLRVIRRGLVLVLLGIIYNNGLV-----L 117
Query: 152 KYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTT 211
K +A+IR+ VL RI +AY+ A + ++ K G + W AL++
Sbjct: 118 K---PLAEIRFPSVLGRIGLAYMFANIIYLYTKQLGQII------------WFCALLIG- 161
Query: 212 LYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGV-RGSTGPACNAVGMIDRKILGIQH 270
Y LLL FN G G N +DR I+ H
Sbjct: 162 -YWLLLR----------------------FNAAPGFPMGDLSMEGNFASYLDRLIIP-GH 197
Query: 271 LYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLI 330
LY LD DPEGL S + A T L+G++ G+L+
Sbjct: 198 LY----------------------LDI-------HDPEGLTSCIPAIGTALLGIYAGNLL 228
Query: 331 VH-FKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIY 389
H + L I+ + L + V +NK L++ S+ T G S +LL+ Y
Sbjct: 229 KHGTMTPPKKALILAIMGVVALVLAQLWNLV-FPINKNLWTSSFVLQTGGCSLLLLSAFY 287
Query: 390 FMV 392
+++
Sbjct: 288 YVI 290
>gi|255037955|ref|YP_003088576.1| hypothetical protein Dfer_4208 [Dyadobacter fermentans DSM 18053]
gi|254950711|gb|ACT95411.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
Length = 371
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 85/362 (23%), Positives = 138/362 (38%), Gaps = 98/362 (27%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGILPAIN-------------HSPWNGLTLADFVMP 99
+RL+SLD RG + + +++ +L + H WNG D + P
Sbjct: 7 SQRLLSLDTLRGFDMFWISGGEEIFHVLAKVTGWSWAIVLAHQFTHPDWNGFRAYDLIFP 66
Query: 100 FFLFIVGVSLALTYKNFPCKVVA----TRKAILRALNLFLLGIFLQGGFFHGINNLKYGV 155
FLF+ GVS + + K V RK I R + L LGI G F
Sbjct: 67 TFLFMAGVSTPFSLGSRLEKGVPPSQLVRKVIQRGIILVFLGIIYNNGIFE--------T 118
Query: 156 DIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWV-VALVLTTLYL 214
+ +Q+R+ VL RI +A G + + L+ YR W+ +L YL
Sbjct: 119 EWSQMRYPSVLARIGLA--------------GMFAQIIYLYFGYRARWIWFGGLLIGYYL 164
Query: 215 LLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKIL-GIQHLYR 273
+++ +PV CG G CN D I+ G HL
Sbjct: 165 FMMF----------YPVP-----------GCGA-GLLTIDCNPASYFDSLIIPGRLHL-- 200
Query: 274 KPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHF 333
+I+ DPEGL+S++ A T L+G+ G L+
Sbjct: 201 ----------TIH-------------------DPEGLVSTIPAIATGLMGIFAGELLRTS 231
Query: 334 KD---HRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYF 390
+ + +++ + + + L D+ +NK L++ S+ G S +LLA Y+
Sbjct: 232 HEVLSQKSKVIYLVFAGVVSLLVCLVWDYF-FPINKNLWTSSFVLCAGGFSTLLLALFYW 290
Query: 391 MV 392
+V
Sbjct: 291 IV 292
>gi|339021122|ref|ZP_08645235.1| hypothetical protein ATPR_1543 [Acetobacter tropicalis NBRC 101654]
gi|338751776|dbj|GAA08539.1| hypothetical protein ATPR_1543 [Acetobacter tropicalis NBRC 101654]
Length = 377
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 88/359 (24%), Positives = 148/359 (41%), Gaps = 88/359 (24%)
Query: 63 RGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCK 119
RG T+ M++V++ G + ++H+ WNG T AD V PFFLF++G + + +
Sbjct: 2 RGATIVFMVIVNNPGDWNRVWSPLDHAAWNGCTPADLVFPFFLFLMGCVIPFAFDR-RLR 60
Query: 120 VVATRKAILRAL------NLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAY 173
A R ++ + + L + FFH + +R+ GVL RIA+ Y
Sbjct: 61 EGAQRSQLVSHIAWRGLALVGLKLLLSLYPFFH----------VTHLRFFGVLTRIALCY 110
Query: 174 LVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVET 233
+ A + + G + S + L +W + LY L + GL P + F
Sbjct: 111 VAAVSLYLCSRKTGFLVSVIGLI--LLAYWAI------LYALPVPGLGWPGKDFAFLDLN 162
Query: 234 SSSSPWI---FNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDY 290
+ + W+ F+ C TG LY K
Sbjct: 163 RNMAAWLDRQFSAWCQTWLHTG-----------------ILYEK---------------- 189
Query: 291 GPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKD----HRDRMLNWIIL 346
+W DPEGLLS++ A T L G+ G + KD R +
Sbjct: 190 --------TW-----DPEGLLSTLPAIATTLSGVLAGQ-VFRRKDISPARRPMLFMAAGG 235
Query: 347 SSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA---GIYFMVRYISSHLMLK 402
+S L+GL F LNK+L++ S+T +++GA+ LA GI+ ++R + + +++
Sbjct: 236 ASILVGLLWGQVF---PLNKSLWTSSFTLVSSGAALCSLAVCDGIFDVLRLQNKNTIIQ 291
>gi|399069322|ref|ZP_10749357.1| Protein of unknown function (DUF1624), partial [Caulobacter sp.
AP07]
gi|398045229|gb|EJL37978.1| Protein of unknown function (DUF1624), partial [Caulobacter sp.
AP07]
Length = 233
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 39/116 (33%), Positives = 57/116 (49%), Gaps = 22/116 (18%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVG--------GILPAINHSPWNGLTLADFV 97
+RP+ RL SLDV RGLT+ MI+V+ + + H+ W G T AD V
Sbjct: 2 DSRPKAA--RLASLDVLRGLTIVGMIVVNTASYLHYVSGYAVFAGLEHAEWRGFTAADAV 59
Query: 98 MPFFLFIVGVSL-----ALTYKNFPCKV-------VATRKAILRALNLFLLGIFLQ 141
P F+F+ GVS+ L + P + A R+ ++R+ LFLLG+ L
Sbjct: 60 FPAFVFMTGVSIPLALGPLALGDGPIERGMAGLDGAALRRLLVRSGRLFLLGLILS 115
>gi|399028715|ref|ZP_10729871.1| hypothetical protein PMI10_01698 [Flavobacterium sp. CF136]
gi|398073551|gb|EJL64721.1| hypothetical protein PMI10_01698 [Flavobacterium sp. CF136]
Length = 382
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 87/371 (23%), Positives = 146/371 (39%), Gaps = 98/371 (26%)
Query: 55 RLISLDVFRGLTVALMI----LVDDVGGILP---------AINHSPWNGLTLADFVMPFF 101
RL+SLD RG + ++ ++ + P ++H+ W G+T D + P F
Sbjct: 9 RLVSLDALRGFVMFWIMSGEHIIHALAKAAPIPVFVWMSSQLHHTEWEGITFYDMIFPIF 68
Query: 102 LFIVGVSLALTYK--------NFPCKVVA--TRKAILRALNLFLLGIFLQGGFFHGINNL 151
LF+ GVS+ +++ N P ++ A +K L L + IFL GF +N L
Sbjct: 69 LFVAGVSMPYSFEKKMSIAGVNTPMELPAKEKKKIYLSMLKRTCILIFL--GFI--VNGL 124
Query: 152 KYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTT 211
Q R+ VL RI +A+ A + L+ K + W + +++
Sbjct: 125 LRFDGYDQTRFASVLGRIGLAWFFAGIIY------------LNFNLKKQIIWFIGILVG- 171
Query: 212 LYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHL 271
Y L + + VPD+ GV G + G IDR L
Sbjct: 172 -YYLAMKLIPVPDFG------------------AGVLTKEG---SLEGYIDRMFL----- 204
Query: 272 YRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV 331
++S Y P EGL S++ A T L+G+ G +
Sbjct: 205 ----------PGRLHSKVYDP---------------EGLFSTIPAVATALLGMFLGTFLK 239
Query: 332 ----HFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAG 387
HF ++ ++L + + LIG+GL ++ +NK L+S S+ C G S +
Sbjct: 240 IKANHFSTNK-KILIMVASAIVLIGIGLIWNY-DFPINKHLWSSSFVCFVGGCSILFFTF 297
Query: 388 IYFMVRYISSH 398
Y ++ + H
Sbjct: 298 FYLIIDVLGFH 308
>gi|319902718|ref|YP_004162446.1| hypothetical protein Bache_2925 [Bacteroides helcogenes P 36-108]
gi|319417749|gb|ADV44860.1| hypothetical protein Bache_2925 [Bacteroides helcogenes P 36-108]
Length = 380
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 47/151 (31%), Positives = 69/151 (45%), Gaps = 25/151 (16%)
Query: 45 KQTRPQHQQRRLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGL 91
+Q + RL SLDV RG + L++ V +L +H W G
Sbjct: 6 QQDSLKISSSRLASLDVLRGFDLFLLVFFQPVLMSLGQQLNLPFMDVVLYQFDHEVWEGF 65
Query: 92 TLADFVMPFFLFIVGVSLALT---YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGI 148
D +MP FLF+ GVS+ + Y++ P K + RK R L LFLLG+ +QG
Sbjct: 66 RFWDLIMPLFLFMTGVSMPFSFAKYQSSPDKCIIYRKIFRRVLLLFLLGMVVQGNLL--- 122
Query: 149 NNLKYGVDIAQIR-WMGVLQRIAIAYLVAAL 178
G++ I + LQ IA+ YL+A +
Sbjct: 123 -----GLNPKHIYFYTNTLQAIAVGYLIAGM 148
>gi|375255119|ref|YP_005014286.1| hypothetical protein BFO_1396 [Tannerella forsythia ATCC 43037]
gi|363406141|gb|AEW19827.1| putative membrane protein [Tannerella forsythia ATCC 43037]
Length = 375
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 83/369 (22%), Positives = 128/369 (34%), Gaps = 105/369 (28%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLADFVM 98
++RL+SLD+ RG + ++++ + + H W G+ D +M
Sbjct: 6 SEKRLVSLDLLRGFDLFCLLMLQPILMTWLEIADNPAWAPLARQFTHVEWRGVAFWDLIM 65
Query: 99 PFFLFIVGVSLALTYKNFPCKVVATRKAILRALN----LFLLGIFLQGGF-------FHG 147
P F+F+ G+++ + L+ L LF LG +QG FH
Sbjct: 66 PLFMFMSGITVPFALSKYKRGAKPGHSFYLKLLKRFVILFFLGWIVQGNLLALDPNRFHI 125
Query: 148 INNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVAL 207
N LQ IA+ Y+V A C + R + V L
Sbjct: 126 FAN--------------TLQAIAVGYVVTAFCYV------------------RFSFRVQL 153
Query: 208 VLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILG 267
T L+ + Y T W P N IDR +LG
Sbjct: 154 GATVLFFI----------AYLLVFATVGGMNW------------EPGTNIAEEIDRCVLG 191
Query: 268 IQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDP----EGLLSSVMATVTCLIG 323
R + I D SW FDP +LSS+ VT + G
Sbjct: 192 -----------RFRDGIITEAD--------GSW---KFDPAYHYTWILSSLNFVVTVMTG 229
Query: 324 LHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGV 383
GH++ K R R++ +I L+ L +D V L K ++S S T G +
Sbjct: 230 SFAGHILRLRKTARQRLMRLLITGVSLVVAALLMDPV-FPLIKRIWSSSMTLFYGGVCFL 288
Query: 384 LLAGIYFMV 392
L+ Y++V
Sbjct: 289 LMGLFYYVV 297
>gi|445497064|ref|ZP_21463919.1| hypothetical protein Jab_2c06620 [Janthinobacterium sp. HH01]
gi|444787059|gb|ELX08607.1| hypothetical protein Jab_2c06620 [Janthinobacterium sp. HH01]
Length = 381
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 39/115 (33%), Positives = 65/115 (56%), Gaps = 16/115 (13%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGI--LPA-INHSP--WNGLTLADFVMPFFLFIVGVSL 109
R++++D FRG+T+ +MI V+ + G+ +PA + H+P + +T D V P FLFIVG+S+
Sbjct: 8 RVLAIDAFRGITILVMIFVNTLAGVRGMPAWMEHAPADADAMTFPDVVFPAFLFIVGMSI 67
Query: 110 --ALTYKNFPCKVVAT--RKAILRALNLFLLGIFL---QGGFFHGINNLKYGVDI 157
A+ + A R + RA L +LG+F+ +GG+ N G+ I
Sbjct: 68 PFAMAQRQAAGDTPAARWRHVLARAAGLLVLGVFMVNAEGGY----NEAAMGMSI 118
>gi|338212226|ref|YP_004656281.1| hypothetical protein [Runella slithyformis DSM 19594]
gi|336306047|gb|AEI49149.1| Protein of unknown function DUF2261, transmembrane [Runella
slithyformis DSM 19594]
Length = 369
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 87/352 (24%), Positives = 136/352 (38%), Gaps = 90/352 (25%)
Query: 55 RLISLDVFRGLTVALMI-----LVDDVGG---------ILPAINHSPWNGLTLADFVMPF 100
RL SLD RG + LMI + +GG + +H W+G T DF+ P
Sbjct: 9 RLASLDALRGFDM-LMISGGGAFLSLMGGKTDSALLNAVAAQFHHPDWDGFTFYDFIFPL 67
Query: 101 FLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDI--- 157
FLF+ GVSL+++ KN K + K + + L+ FL G+ + +DI
Sbjct: 68 FLFMAGVSLSISLKNGIAKGIPQYKLMEKVFKRMLILFFL------GLLDKNAPIDILDP 121
Query: 158 AQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLL 217
A IR+ VL RI IA + A+ L L ++ +VA + LY L
Sbjct: 122 AHIRYGTVLGRIGIATFLVAI--------------LYLNTGWKTQLIVAFTILGLYYAAL 167
Query: 218 YGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIY 277
+ V D+ G+ N VG IDR + P
Sbjct: 168 MLISVGDYG---------------------GGNLSFEGNLVGWIDRAFM--------P-- 196
Query: 278 SRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHR 337
R KQ + +D + + + AT + G G +++
Sbjct: 197 GRLKQTT--------------------YDELAMTTQLSATCLTIFGSLAGKILLDKTTAN 236
Query: 338 DRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIY 389
+++ + I GL+ V +NK L+S S+ LTAG + +L+A Y
Sbjct: 237 IKLIRLAGMGVIGIAAGLAWATV-FPINKHLWSSSFILLTAGMASLLVALFY 287
>gi|374312990|ref|YP_005059420.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
gi|358755000|gb|AEU38390.1| protein of unknown function DUF1624 [Granulicella mallensis
MP5ACTX8]
Length = 408
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 36/98 (36%), Positives = 57/98 (58%), Gaps = 9/98 (9%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGI--LPAINH---SPWNGLTLADFVMPFFLFIVG 106
+ R+ S+D+FRGLT+A+MI V+D+ G+ LP H + + +T D V PFFLFI+G
Sbjct: 14 RTTRVASIDIFRGLTMAIMIFVNDLDGVQGLPWWTHHAKANIDVMTYVDMVFPFFLFIIG 73
Query: 107 VSLALT----YKNFPCKVVATRKAILRALNLFLLGIFL 140
+S+ L K P ++R+++L LG+ L
Sbjct: 74 LSMPLAIRQRLKKNPSIPQLWLHVLIRSVSLVALGVIL 111
>gi|404403948|ref|ZP_10995532.1| hypothetical protein AJC13_00860 [Alistipes sp. JC136]
Length = 369
Score = 55.5 bits (132), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 47/160 (29%), Positives = 75/160 (46%), Gaps = 28/160 (17%)
Query: 53 QRRLISLDVFRGLTVALMI----LVDDVGGILP---------AINHSPWNGLTLADFVMP 99
+RL+SLD RG + ++ LV + + P ++ H+ W+GLT D + P
Sbjct: 4 NQRLLSLDALRGFDMLFIMGFSGLVASLCALWPNPFTDAVAGSMGHAAWDGLTHHDTIFP 63
Query: 100 FFLFIVGVSLALTYKNFPCKVVATR----KAILRALNLFLLGIFLQGGFFHGINNLKYGV 155
FLFI GVS + + R K I R + L +LG+ G F +
Sbjct: 64 LFLFIAGVSFPFSLAKQRANGLGERAILGKVIRRGVTLVVLGLVYNGLF---------KL 114
Query: 156 DIAQIRWMGVLQRIAIAYLVAALCEIW--LKGDGHVSSKL 193
D A +R VL RI +A++ AA+ I+ +K G V++ +
Sbjct: 115 DFASLRVASVLGRIGLAWMFAAILYIYCGVKTRGIVAAAV 154
Score = 42.7 bits (99), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 48/93 (51%), Gaps = 8/93 (8%)
Query: 305 FDPEGLLSSVMATVTCLIGLHFGHLIVHFK-----DHRDRMLNWIILSSCLIGLGLSLDF 359
FDPEGLLS+V A V+ + G+ G + + D + + + L+G+ S
Sbjct: 201 FDPEGLLSTVPAIVSAMFGMFTGEFLRRERPGLTGDRKALYMALAAAAITLVGIAWSGV- 259
Query: 360 VGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
M +NK L+S S+TC+ G S + A Y+++
Sbjct: 260 --MPINKKLWSSSFTCVVTGYSLGMFALFYYLI 290
>gi|127512051|ref|YP_001093248.1| hypothetical protein Shew_1118 [Shewanella loihica PV-4]
gi|126637346|gb|ABO22989.1| conserved hypothetical protein [Shewanella loihica PV-4]
Length = 387
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 47/152 (30%), Positives = 73/152 (48%), Gaps = 24/152 (15%)
Query: 46 QTRPQHQQR-RLISLDVFRG-----------LTVALMILVDDVGGILP--AINHSPWNGL 91
T P+ + RL+SLD RG L AL++ G + ++HS W+G
Sbjct: 6 DTHPKPAAKPRLMSLDALRGFDMFWILGGEALFAALLVWTGWQGWRIADAQMHHSQWHGF 65
Query: 92 TLADFVMPFFLFIVGVSLALTYKN-----FPCKVVATRKAILRALNLFLLGIFLQGGFFH 146
T D + P F+F+ GV+L L+ K +P ++ R AI R + L L G+ G+
Sbjct: 66 TFYDLIFPLFIFLSGVALGLSPKRLDSLPWPERLPLYRHAIKRLMLLLLFGVLYNHGWGT 125
Query: 147 GINNLKYGVDIAQIRWMGVLQRIAIAYLVAAL 178
G+ + ++R+ VL RIA A+ AAL
Sbjct: 126 GMP-----MAADEVRYASVLGRIAFAWFFAAL 152
>gi|390958852|ref|YP_006422609.1| hypothetical protein Terro_3042 [Terriglobus roseus DSM 18391]
gi|390413770|gb|AFL89274.1| Protein of unknown function (DUF1624) [Terriglobus roseus DSM
18391]
Length = 406
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 58/112 (51%), Gaps = 17/112 (15%)
Query: 45 KQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPW---------NGLTLAD 95
+ TR + +R+ SLD+FRGL +ALMI V++ L I PW N +T D
Sbjct: 3 ETTRATVKPQRIQSLDIFRGLNIALMIFVNE----LHEIKGLPWWTYHAPGAANVMTYVD 58
Query: 96 FVMPFFLFIVGVSLALTYKNFPCKVVATRK----AILRALNLFLLGIFLQGG 143
V P FL IVG+SL L + + T + +LR++ L +LG+ LQ
Sbjct: 59 MVFPAFLVIVGMSLPLALQARIRRGDETPQLIWYVVLRSVALIVLGLILQNA 110
>gi|343086706|ref|YP_004776001.1| hypothetical protein [Cyclobacterium marinum DSM 745]
gi|342355240|gb|AEL27770.1| Protein of unknown function DUF2261, transmembrane [Cyclobacterium
marinum DSM 745]
Length = 368
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 47/96 (48%), Gaps = 15/96 (15%)
Query: 52 QQRRLISLDVFRGLTVALMI-----LVDDVGGILPAIN----------HSPWNGLTLADF 96
+RL+SLD +RG+T+ L++ L G+ P ++ H PWNGL D
Sbjct: 8 SNKRLVSLDAYRGITMFLLVAESARLYGAFEGLFPEVSGWQMFFTQFTHHPWNGLRFWDL 67
Query: 97 VMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALN 132
+ PFF+FIVGV++ + K RK L L
Sbjct: 68 IQPFFMFIVGVAMPFSLNKRLEKQGDRRKVTLHILK 103
>gi|315126665|ref|YP_004068668.1| hypothetical protein PSM_A1588 [Pseudoalteromonas sp. SM9913]
gi|315015179|gb|ADT68517.1| conserved hypothetical protein [Pseudoalteromonas sp. SM9913]
Length = 307
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 60/287 (20%), Positives = 112/287 (39%), Gaps = 76/287 (26%)
Query: 102 LFIVGVSLALTYKNFPCKVVATR--KAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQ 159
+FI+G ++ ++K A++ + I R +F +G+ +N + I
Sbjct: 1 MFIIGSAMFFSFKKTNSTANASQVLRLIKRGAIIFAIGL--------ALNMYPFTTSIEN 52
Query: 160 IRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYG 219
+R +GVLQRI IAY++A++C + L G +S + + Y
Sbjct: 53 LRILGVLQRIGIAYILASICVLLLNRRGVISLSVVILIAY-------------------- 92
Query: 220 LYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSR 279
W+ ++ G + N V +D +LG HL++
Sbjct: 93 -------------------WLLLLSVGPNNAYTLENNLVRTVDIAVLGESHLWQGK---- 129
Query: 280 TKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDR 339
FDPEGLLS++ + V+ L G L+
Sbjct: 130 ----------------------GLAFDPEGLLSTLPSVVSVLFGFEVTRLLTSSSSQWTS 167
Query: 340 MLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
+ +++ I LG L + M +NK+L++ ++ T+G + ++LA
Sbjct: 168 IKRLLVIGVVGIALG-QLGSLIMPINKSLWTSTFVIYTSGIACIVLA 213
>gi|334364999|ref|ZP_08513969.1| putative membrane protein [Alistipes sp. HGB5]
gi|313158791|gb|EFR58176.1| putative membrane protein [Alistipes sp. HGB5]
Length = 383
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 47/161 (29%), Positives = 69/161 (42%), Gaps = 40/161 (24%)
Query: 42 SNSKQTRPQHQQRRLISLDVFRGLTVALMI----LVDDVGGILPA---------INHSPW 88
S + R Q RL+SLD RG + ++ LV V G+ P+ ++H W
Sbjct: 10 SGFRAGRIMKQSERLMSLDALRGFDMLFIMGFASLVVAVCGLWPSAVTDAAAASMSHVAW 69
Query: 89 NGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATR-----------KAILRALNLFLLG 137
+G D + P FLFI GVS FP V R K + R L L +LG
Sbjct: 70 DGFAHHDTIFPLFLFIAGVS-------FPYSVAKQRAGGMSEGRIYAKIVRRGLTLVVLG 122
Query: 138 IFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAAL 178
+ + N + +D +R VL RI +A+ +AA+
Sbjct: 123 M---------VYNGLFKLDFENLRIASVLGRIGLAWSIAAV 154
Score = 38.9 bits (89), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 24/90 (26%), Positives = 43/90 (47%)
Query: 303 APFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGM 362
FDPEGLLS+V A VT ++G+ G + R W+ ++ + +
Sbjct: 215 GSFDPEGLLSTVPAVVTAMLGMFTGEFVRRSDIRGGRKTLWMAAAAAALLAAGLAFSGVL 274
Query: 363 HLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+NK L+S ++ C+ S + A Y+++
Sbjct: 275 PVNKKLWSSTFVCVVGAYSLGMFALFYYLI 304
>gi|311274235|ref|XP_003134250.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like,
partial [Sus scrofa]
Length = 297
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 64/240 (26%), Positives = 101/240 (42%), Gaps = 49/240 (20%)
Query: 164 GVLQRIAIAYLVAALCEIWLKGD--GHVSSKLSLFR-----KYRGHWVVALVLTTLYLLL 216
GVLQR+ + Y V A+ E+ +S+ S F W+ LVL ++L L
Sbjct: 9 GVLQRLGVTYFVVAVLELLFAKPVPESCASERSCFSLLDVTSSWPQWLFVLVLEGVWLAL 68
Query: 217 LYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTG--PACN--AVGMIDRKILGIQHLY 272
+ L VP T P G G G P C A G IDR +LG HLY
Sbjct: 69 TFFLPVPG------CPTGYLGP-------GGIGDLGKYPNCTGGAAGYIDRLLLGDDHLY 115
Query: 273 RKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVH 332
+ P + + +DPEG+L ++ + + +G+ G ++++
Sbjct: 116 QHPSPAVLYHTKVA------------------YDPEGILGTINSILMAYLGVQAGKILLY 157
Query: 333 FKDHRDRMLNWIILSSCLIGL------GLSLDFVGMHLNKALYSLSY-TCLTAGASGVLL 385
+KD +L + C +GL S + + +NK L+S SY T L++ A +LL
Sbjct: 158 YKDRTKGILIRFAVWGCFLGLISVALTKASENEGFIPVNKNLWSTSYVTTLSSSAFLILL 217
>gi|91794054|ref|YP_563705.1| hypothetical protein Sden_2703 [Shewanella denitrificans OS217]
gi|91716056|gb|ABE55982.1| conserved hypothetical protein [Shewanella denitrificans OS217]
Length = 400
Score = 54.7 bits (130), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 93/373 (24%), Positives = 149/373 (39%), Gaps = 90/373 (24%)
Query: 40 SNSNSKQTRPQHQQRRLISLDVFRG-----------LTVALMILVDDVGGILPA--INHS 86
S + QT + RL SLD RG L AL L G + + + HS
Sbjct: 19 STEHKGQTSTSLNKPRLKSLDALRGFDMFWIIGGEGLFAALFTLTGWAGWNIASRQMQHS 78
Query: 87 PWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVAT-----RKAILRALNLFLLGIFLQ 141
W+G TL D + P F+F+ GV+L L+ K + A + A R + L LGI
Sbjct: 79 QWHGFTLYDLIFPLFIFLSGVALGLSPKRLDQQAFAVALPLYQHACKRLILLIALGILYN 138
Query: 142 GGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRG 201
G+ GI D+ +IR+ VL RI A+ AA+ +W ++LS+
Sbjct: 139 HGWGTGIP-----ADLDKIRYSSVLARIGFAWFFAAML-VW-------HTRLSI------ 179
Query: 202 HWVVALVLTTLYLLLLYGLYVPDWQY-EFPVETSSSSPWIFNVTCGVRGSTGPACNAVGM 260
+V++ + LY L L VP Q +F ++ S + V G P
Sbjct: 180 QVIVSVSIIGLYTLAQLYLPVPGGQAGQFTLDAS--------INTYVDGLLRPGIA---- 227
Query: 261 IDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTC 320
Y PLD PEG+LS+V A +
Sbjct: 228 -----------------------------YQDRPLD----------PEGILSTVPAVINA 248
Query: 321 LIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGM-HLNKALYSLSYTCLTAG 379
++G+ G I+ D ++++ ++ L L+ M +NK L++ S+ +T+G
Sbjct: 249 MVGVFAGQFIIRAHSRGDWAKVGVLIACGVLLLVLAWLLEPMIPVNKDLWTTSFVLVTSG 308
Query: 380 ASGVLLAGIYFMV 392
S + L+ Y ++
Sbjct: 309 WSLLFLSLFYVII 321
>gi|292609605|ref|XP_002660455.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Danio
rerio]
Length = 292
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 51/94 (54%), Gaps = 6/94 (6%)
Query: 10 DANDQLVIQISKSTSAPANEKLERDPLLPPS---NSNSKQTRPQHQQRRLISLDVFRGLT 66
DA L+I++ + E+L L P+ +++++ P RRL SLD FRGL+
Sbjct: 202 DAVTNLLIRLGGTVET---ERLINSELGTPNRLVDASTESIIPATTGRRLRSLDTFRGLS 258
Query: 67 VALMILVDDVGGILPAINHSPWNGLTLADFVMPF 100
+ +M+ V+ GG H WNGLT+AD V P+
Sbjct: 259 LVIMVFVNYGGGRYWFFRHESWNGLTVADLVFPW 292
>gi|134025078|gb|AAI35092.1| Unknown (protein for IMAGE:7224994) [Danio rerio]
Length = 291
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/94 (35%), Positives = 51/94 (54%), Gaps = 6/94 (6%)
Query: 10 DANDQLVIQISKSTSAPANEKLERDPLLPPS---NSNSKQTRPQHQQRRLISLDVFRGLT 66
DA L+I++ + E+L L P+ +++++ P RRL SLD FRGL+
Sbjct: 201 DAVTNLLIRLGGTVE---TERLINSELGTPNRLVDASTESIIPATTGRRLRSLDTFRGLS 257
Query: 67 VALMILVDDVGGILPAINHSPWNGLTLADFVMPF 100
+ +M+ V+ GG H WNGLT+AD V P+
Sbjct: 258 LVIMVFVNYGGGRYWFFRHESWNGLTVADLVFPW 291
>gi|456985620|gb|EMG21387.1| hypothetical protein LEP1GSC150_0590 [Leptospira interrogans
serovar Copenhageni str. LT2050]
Length = 77
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/53 (50%), Positives = 35/53 (66%), Gaps = 3/53 (5%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFF 101
Q R++SLD+FRG+TVA MILV++ G I + H+ WNG T D V PFF
Sbjct: 6 NQNRILSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHARWNGCTPTDLVFPFF 58
>gi|320333679|ref|YP_004170390.1| hypothetical protein [Deinococcus maricopensis DSM 21211]
gi|319754968|gb|ADV66725.1| hypothetical protein Deima_1072 [Deinococcus maricopensis DSM
21211]
Length = 376
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/83 (40%), Positives = 47/83 (56%), Gaps = 11/83 (13%)
Query: 55 RLISLDVFRGLTVALMILVDDVG---GILPAINHSPW-NGLTLADFVMPFFLFIVGVSLA 110
RL +LD +RGLTV LM+LV++V + H+PW G TLAD V P+FLF G +L
Sbjct: 27 RLAALDAWRGLTVLLMLLVNNVALDWRTPKELMHAPWGGGATLADLVFPWFLFCAGTAL- 85
Query: 111 LTYKNFPCKVVATRKAILRALNL 133
P + + R+A +R L
Sbjct: 86 ------PFSLASARRAGVRGWAL 102
>gi|393787642|ref|ZP_10375774.1| hypothetical protein HMPREF1068_02054 [Bacteroides nordii
CL02T12C05]
gi|392658877|gb|EIY52507.1| hypothetical protein HMPREF1068_02054 [Bacteroides nordii
CL02T12C05]
Length = 373
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 78/358 (21%), Positives = 131/358 (36%), Gaps = 86/358 (24%)
Query: 53 QRRLISLDVFRGLTVALMIL----------VDDVGGILPAIN---HSPWNGLTLADFVMP 99
+RL SLD+ RG + +++ +++ + P N H W G+ D +MP
Sbjct: 5 NKRLASLDLLRGFDLFCLLMLQPILMTWLEIENNPSLDPITNQFTHVEWQGVAFWDLIMP 64
Query: 100 FFLFIVGVSLALTYKNFPCKVVATRKAILRALN----LFLLGIFLQGGFFHGINNLKYGV 155
F+F+ G+++ + R R LF LG +QG +
Sbjct: 65 LFMFMSGITIPFAMSKYKQGEKIDRHFYFRLFKRFFVLFFLGWVVQGNLL--------AL 116
Query: 156 DIAQIR-WMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYL 214
DI Q + LQ IA+ Y+VAAL +W +S + F Y LV T
Sbjct: 117 DIRQFHIFANTLQAIAVGYVVAALLYVWCSFRTQISFTVLCFITY------LLVFAT--- 167
Query: 215 LLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRK 274
+ D YE P N IDR +LG R
Sbjct: 168 -------IGDMNYE------------------------PGTNIAEEIDRCVLGSL---RD 193
Query: 275 PIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFK 334
+ S +S + +LSS+ VT ++G GH++ K
Sbjct: 194 GVTWTNGTWSFDS----------------SYHYTWILSSLNFIVTVMLGSFAGHILRLRK 237
Query: 335 DHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+ R+ ++ L+ + L +D + + K ++S S T G +L+ Y+++
Sbjct: 238 EPVQRLKVLLLTGGFLVAVALLMDPL-FPIIKRIWSSSMTLFYGGVCFLLMGIFYYLI 294
>gi|375148919|ref|YP_005011360.1| hypothetical protein [Niastella koreensis GR20-10]
gi|361062965|gb|AEW01957.1| Protein of unknown function DUF2261, transmembrane [Niastella
koreensis GR20-10]
Length = 397
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 40/101 (39%), Positives = 57/101 (56%), Gaps = 11/101 (10%)
Query: 48 RPQHQQRRLISLDVFRGLTVALMILVDDVGGI--LPA-INHSPWN--GLTLADFVMPFFL 102
RP Q RL S+DVFR LT+ LMI V+D+G + +P + H+ N G+ LAD V P FL
Sbjct: 2 RPFTQ--RLASIDVFRALTMLLMIFVNDLGTLKNIPLWLEHTKANEDGMGLADTVFPAFL 59
Query: 103 FIVGVSLALTYKNFPCKVVATRK----AILRALNLFLLGIF 139
FIVG+S+ N K + ++R+ L ++G F
Sbjct: 60 FIVGLSIPFAIGNRWAKGASQSNILGHILIRSFALLVMGFF 100
>gi|345517559|ref|ZP_08797028.1| hypothetical protein BSFG_03809 [Bacteroides sp. 4_3_47FAA]
gi|254837353|gb|EET17662.1| hypothetical protein BSFG_03809 [Bacteroides sp. 4_3_47FAA]
Length = 359
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 47/138 (34%), Positives = 65/138 (47%), Gaps = 19/138 (13%)
Query: 55 RLISLDVFRGLTVALMILVDDV-------------GGILPAINHSPWNGLTLADFVMPFF 101
RL SLD+ RGL + L++ V IL H W G D VMP F
Sbjct: 9 RLDSLDMLRGLDLFLLVFFQPVLMSFGQQTDFPWMTSILYQFEHEVWVGFRFWDLVMPLF 68
Query: 102 LFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIR 161
LF+ GVS+ ++ + + ++ R A+ R + L +FL G G NL G+D I
Sbjct: 69 LFMTGVSMPFSFAKY--RDISDRNAVYRKITRRFLLLFLLGMVVQG--NL-LGLDWEHIY 123
Query: 162 -WMGVLQRIAIAYLVAAL 178
+ LQ IA YL+AAL
Sbjct: 124 LYNNTLQAIAAGYLIAAL 141
>gi|343082821|ref|YP_004772116.1| hypothetical protein [Cyclobacterium marinum DSM 745]
gi|342351355|gb|AEL23885.1| Protein of unknown function DUF2261, transmembrane [Cyclobacterium
marinum DSM 745]
Length = 367
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 46/154 (29%), Positives = 67/154 (43%), Gaps = 24/154 (15%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDV------GGILP-------AINHSPWNGLTLADFVM 98
+ R++S+D RG + +I D GG P +H W G T D +M
Sbjct: 7 KPNRILSIDALRGFDMLFIIFADRFFALLHKGGQTPFTGFLANQFSHPDWFGSTFYDIIM 66
Query: 99 PFFLFIVGV----SLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYG 154
P FLF+VG SL+ + K +K R L LF LG +QG N L
Sbjct: 67 PLFLFMVGAVIPFSLSKRMQENTGKAQIYKKLFKRVLILFFLGWIVQG------NLLALD 120
Query: 155 VDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGH 188
++ +I + LQ IA+ Y + L I+L +G
Sbjct: 121 INTFKI-FSNTLQAIAVGYFFSCLAFIYLSRNGR 153
>gi|408673239|ref|YP_006872987.1| Protein of unknown function DUF2261, transmembrane [Emticicia
oligotrophica DSM 17448]
gi|387854863|gb|AFK02960.1| Protein of unknown function DUF2261, transmembrane [Emticicia
oligotrophica DSM 17448]
Length = 370
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 44/146 (30%), Positives = 64/146 (43%), Gaps = 25/146 (17%)
Query: 54 RRLISLDVFRGLTVALMILVDDVGGILPAINHSP-------------WNGLTLADFVMPF 100
+RL SLD RG + ++ +D L H P WNG D + P
Sbjct: 8 QRLYSLDALRGFDMFWIMGGEDFFHALSEATHHPAAIWIATQLSHVAWNGFRFYDLIFPL 67
Query: 101 FLFIVGV----SLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVD 156
FLFI GV S+ + K R+ I R L L LLG+ G +K +
Sbjct: 68 FLFISGVSTPYSVGREIEKGIDKQAILRRIIKRGLILVLLGVIYNNGL-----QIK---E 119
Query: 157 IAQIRWMGVLQRIAIAYLVAALCEIW 182
++QIR+ VL RI +AY+ A + ++
Sbjct: 120 LSQIRFPSVLGRIGLAYMFACIIYVY 145
>gi|374384982|ref|ZP_09642493.1| hypothetical protein HMPREF9449_00879 [Odoribacter laneus YIT
12061]
gi|373227040|gb|EHP49361.1| hypothetical protein HMPREF9449_00879 [Odoribacter laneus YIT
12061]
Length = 382
Score = 52.8 bits (125), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 84/357 (23%), Positives = 141/357 (39%), Gaps = 78/357 (21%)
Query: 54 RRLISLDVFRGLT----VALMILVDDVGGILPA---------INHSPWNGLTLADFVMPF 100
+RL SLD RG VAL L++ + A NH W G + D +MP
Sbjct: 12 KRLESLDALRGFDLFFLVALGPLMNSLARAADAEWFNNWMGIFNHVSWEGFSPWDLIMPL 71
Query: 101 FLFIVGVSLALT---YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDI 157
FLF+ G+S+ YK+ P K R+ R L L++ G+ QG G+N K +
Sbjct: 72 FLFMSGISMPFALARYKSMPDKRPLLRRLGKRILLLWIFGMICQGNLL-GLNPDKIYL-- 128
Query: 158 AQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLL 217
+ LQ IA YL+ AL L LF + R ++T + LLL+
Sbjct: 129 ----YSNTLQAIAAGYLITAL--------------LFLFTRRRTQ-----LITAVLLLLI 165
Query: 218 YGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIY 277
Y W ++ G+ P N ID+ +LG +
Sbjct: 166 Y------WAAMQFIQVDGYGG----------GNYTPQGNLAEWIDKVVLG--------RF 201
Query: 278 SRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHR 337
T Q +D + +LSS+ VT L GL G++ + +
Sbjct: 202 RDTAQL-----------VDGKVVVAEWYHYTWILSSLNFGVTVLTGLFAGYIAKDKIEEK 250
Query: 338 DRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRY 394
++ + + + ++ +G +F M + K +++ S +++G +L+ Y+ + Y
Sbjct: 251 RKLKLYFGIGATMVTIGWLWNF-QMPVIKTIWTSSMVLVSSGYCFLLMGLFYYWIDY 306
>gi|317503636|ref|ZP_07961655.1| conserved hypothetical protein, partial [Prevotella salivae DSM
15606]
gi|315665261|gb|EFV04909.1| conserved hypothetical protein [Prevotella salivae DSM 15606]
Length = 59
Score = 52.8 bits (125), Expect = 3e-04, Method: Composition-based stats.
Identities = 24/52 (46%), Positives = 36/52 (69%), Gaps = 2/52 (3%)
Query: 55 RLISLDVFRGLTVALMILVDDVGG--ILPAINHSPWNGLTLADFVMPFFLFI 104
R+ ++D+ RG+T+A MILV++ GG + + H+ W GLT D V PFF+FI
Sbjct: 8 RIEAVDILRGITIAGMILVNNPGGQPVYTPLEHAEWLGLTPTDLVFPFFMFI 59
>gi|389866878|ref|YP_006369119.1| hypothetical protein MODMU_5285 [Modestobacter marinus]
gi|388489082|emb|CCH90660.1| conserved transmembrane protein of unknown function [Modestobacter
marinus]
Length = 327
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 27/63 (42%), Positives = 39/63 (61%), Gaps = 2/63 (3%)
Query: 54 RRLISLDVFRGLTVALMILVDDVG--GILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RRL +DV RGL V M++VD+ G I +H+ W+GL +AD V P FL + GVS+
Sbjct: 2 RRLHGVDVLRGLAVVGMLVVDNRGNASIATQWHHAAWDGLHVADVVFPAFLLVAGVSMPF 61
Query: 112 TYK 114
+ +
Sbjct: 62 SRR 64
>gi|456861512|gb|EMF80162.1| hypothetical protein LEP1GSC188_2620 [Leptospira weilii serovar
Topaz str. LT2116]
Length = 88
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 25/59 (42%), Positives = 36/59 (61%), Gaps = 3/59 (5%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFF 101
+ + + R++SLD+FRG+TV MILV++ G I + H+ WNG T D V PFF
Sbjct: 2 EKKSTQNKDRILSLDLFRGMTVIGMILVNNPGSWSYIYSPLKHAKWNGCTPTDLVFPFF 60
>gi|210622217|ref|ZP_03293007.1| hypothetical protein CLOHIR_00953 [Clostridium hiranonis DSM 13275]
gi|210154351|gb|EEA85357.1| hypothetical protein CLOHIR_00953 [Clostridium hiranonis DSM 13275]
Length = 483
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 56/243 (23%), Positives = 104/243 (42%), Gaps = 49/243 (20%)
Query: 12 NDQLVIQISKSTSAPANEK---LERDPLLPPSNSNSKQT--------RPQHQQ------- 53
D + + S+ ++P +K E+ L S+++QT +PQ ++
Sbjct: 53 EDAYINEESRKDTSPKQKKESKREKKVELSREESSTEQTVINRGEKEQPQAREVVTGDPL 112
Query: 54 -RRLISLDVFRGLTVALMILVDDVG--GILPA-INHSPWNGLTLADFVMPFFL----FIV 105
RR ++++ G+ V +I +G G +PA + S WNG++ D +P L F++
Sbjct: 113 KRRYTTVELIMGVAVIAIICSSGIGVLGEMPAFLAFSKWNGISFGDLGLPLLLASVCFMI 172
Query: 106 GVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGV 165
+ L K K + + LF++GI + NL + R MG+
Sbjct: 173 PTEVELDVKRKKSFKEICIKKVKVGIILFVIGILI---------NLIGAWNFNSFRIMGI 223
Query: 166 LQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYR-GHWVVALVLTTLYLLLLYGLYVPD 224
LQ IA+ Y++ +L + LFR++ V+A+ LT + ++ L G Y
Sbjct: 224 LQMIAVVYMLGSLLYV-------------LFRRFNFKSSVIAVFLTAIGVVGLAGYYFAS 270
Query: 225 WQY 227
+
Sbjct: 271 ANF 273
>gi|431799248|ref|YP_007226152.1| hypothetical protein Echvi_3932 [Echinicola vietnamensis DSM 17526]
gi|430790013|gb|AGA80142.1| Protein of unknown function (DUF1624) [Echinicola vietnamensis DSM
17526]
Length = 412
Score = 52.4 bits (124), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 37/112 (33%), Positives = 59/112 (52%), Gaps = 15/112 (13%)
Query: 40 SNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPWNG--------L 91
+ SK + P + +R ++DVFR +T+ LMI V+D+ + + W G L
Sbjct: 9 TQRPSKVSEPIIEAKRSYAIDVFRAVTMLLMIFVNDLWTLE---GYPDWLGHAAVGEDRL 65
Query: 92 TLADFVMPFFLFIVGVSLALTYKN-FPCKVVATRKA---ILRALNLFLLGIF 139
+D + P FLFIVG+S+ +N F ++ + A ILR L L ++GIF
Sbjct: 66 GFSDVIFPAFLFIVGLSIPFALQNRFRKRIPKIKLAEHIILRGLALLVMGIF 117
>gi|372268269|ref|ZP_09504317.1| hypothetical protein AlS89_10220 [Alteromonas sp. S89]
Length = 365
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 47/147 (31%), Positives = 71/147 (48%), Gaps = 27/147 (18%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILP---------------AINHSPWNGLTLADF 96
+++RL S+D RG + +I + + LP + H+PW+G T D
Sbjct: 4 KKQRLASVDALRGFDMFWIIGGEAL--FLPLFALTGWSIFQFGHAQMQHTPWHGFTFYDL 61
Query: 97 VMPFFLFIVGVSLALTYKNFPCKVVAT-----RKAILRALNLFLLGIFLQGGFFHGINNL 151
+ P F+F+ GV+L L K+ V+ RKA R L L LLGI G+ GI
Sbjct: 62 IFPLFIFLSGVTLGLANKSLRGLPVSQRAPVYRKATKRLLLLVLLGILYNHGWGTGIP-- 119
Query: 152 KYGVDIAQIRWMGVLQRIAIAYLVAAL 178
D+++IR+ VL RI A+ AA+
Sbjct: 120 ---ADLSEIRYASVLARIGFAWFFAAM 143
>gi|288800484|ref|ZP_06405942.1| conserved hypothetical protein [Prevotella sp. oral taxon 299 str.
F0039]
gi|288332697|gb|EFC71177.1| conserved hypothetical protein [Prevotella sp. oral taxon 299 str.
F0039]
Length = 409
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 42/156 (26%), Positives = 69/156 (44%), Gaps = 30/156 (19%)
Query: 44 SKQTRPQHQQRRLISLDVFRGLTVALMILVDDV----------------GGILPAINHSP 87
+K T P RL+SLD+ RG +AL++L + + I H P
Sbjct: 31 NKATAPH--SGRLLSLDLLRGADLALLVLFQPIIYQWVEASEPTPGSFGEMVFGQITHVP 88
Query: 88 WNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAIL-----RALNLFLLGIFLQG 142
W G D +MP F+F+ G+++ + + V K L R + L++LG+ QG
Sbjct: 89 WEGFCFWDIIMPLFMFMSGITIPFSMGKYQQGKVKADKGFLWRLLKRFVVLWVLGMIAQG 148
Query: 143 GFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAAL 178
N L + + + + LQ IA+ Y++ AL
Sbjct: 149 ------NLLLFDPRLIHL-YSNTLQSIAVGYVMVAL 177
>gi|223936398|ref|ZP_03628310.1| conserved hypothetical protein [bacterium Ellin514]
gi|223894916|gb|EEF61365.1| conserved hypothetical protein [bacterium Ellin514]
Length = 427
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 89/393 (22%), Positives = 154/393 (39%), Gaps = 77/393 (19%)
Query: 39 PSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILP-------------AINH 85
P+ + S + PQ RL+S+D RG + +I D + L ++H
Sbjct: 13 PAKAASTASIPQ----RLMSVDALRGFDMFWIIGADSLVYALHRLSQNRVTDFLGLQLDH 68
Query: 86 SPWNGLTLADFVMPFFLFIVGVSLALTYKNFP---CKVVATRKAILRALNLFLLGIFLQG 142
W G D + P F+FI+GVS+ + + A ++ R+ LF++ + G
Sbjct: 69 CDWAGFHFYDLIFPLFVFIMGVSVVFSLTKAIQQLGRAEAVKRVFRRSALLFVVALIYSG 128
Query: 143 GFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGH 202
G + IR +GVL RIA+ Y V L + K V+ +L G+
Sbjct: 129 GVRSAWPD---------IRLLGVLNRIALCYFVGGLIFCFFKPRAMVAIAAALL---IGY 176
Query: 203 WVVALVLTTLYLLLLY----GLYVPDWQYEFPVETSSSSPW--IFNVTCGVRGSTGPACN 256
W + + + + + V + + +T S P +N T V N
Sbjct: 177 WSIMTFVPIRDIRMAHYKEKHELVDNDVDKIMQDTGVSDPAKIFYNTTNWVTAKYDMGYN 236
Query: 257 AVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMA 316
+D K LG + +DPEGLLS++ A
Sbjct: 237 VANHLDFKYLGGRKY------------------------------DTYWDPEGLLSTIPA 266
Query: 317 TVTCLIGLHFGHLIVHFKDHRDRMLNWIILS----SCLIGLGLSLDFVGMHLNKALYSLS 372
TCL+G+ L++ ++ DR +LS ++G S+ F + K +++ S
Sbjct: 267 VATCLLGIL-AGLLLRSTNYCDRWKVIYLLSLGAAGVILGFLWSIQF---PVVKKIWTSS 322
Query: 373 YTCLTAGASGVLLAGIYFMVRYISSHLMLKKPF 405
+ + G S +LL GI++ V + + +PF
Sbjct: 323 FVLVAGGFSAILL-GIFYQVVDVWKYQKWCQPF 354
>gi|430745463|ref|YP_007204592.1| hypothetical protein Sinac_4725 [Singulisphaera acidiphila DSM
18658]
gi|430017183|gb|AGA28897.1| hypothetical protein Sinac_4725 [Singulisphaera acidiphila DSM
18658]
Length = 391
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 49/158 (31%), Positives = 69/158 (43%), Gaps = 20/158 (12%)
Query: 43 NSKQTRPQHQQRRLISLDVFRG-----------LTVALMILVDDVGG--ILPAINHSPWN 89
+S RL+S+D RG L AL D G + + H+ W+
Sbjct: 16 DSDSIAAPKPSERLLSIDALRGFDMLWIIGGERLAKALARWSDSSAGKVVQEQLEHAEWH 75
Query: 90 GLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGIN 149
G L D + P FLF+VG L + + R A R + L +F G +G+
Sbjct: 76 GFRLNDLIFPLFLFLVGTVLPFSLGKLQGQ---GRGAEYRRIARRTLLLFALGLLCNGV- 131
Query: 150 NLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDG 187
LK+ D A +R GVLQRIA+ Y +AAL +W G
Sbjct: 132 -LKF--DWANLRVAGVLQRIALCYGIAALISLWFSRRG 166
>gi|390946391|ref|YP_006410151.1| hypothetical protein Alfi_1113 [Alistipes finegoldii DSM 17242]
gi|390422960|gb|AFL77466.1| hypothetical protein Alfi_1113 [Alistipes finegoldii DSM 17242]
Length = 366
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 45/153 (29%), Positives = 66/153 (43%), Gaps = 40/153 (26%)
Query: 50 QHQQRRLISLDVFRGLTVALMI----LVDDVGGILPA---------INHSPWNGLTLADF 96
Q RL+SLD RG + ++ LV V G+ P+ ++H W+G D
Sbjct: 1 MKQSERLMSLDALRGFDMLFIMGFASLVVAVCGLWPSAVTDAAAASMSHVAWDGFAHHDT 60
Query: 97 VMPFFLFIVGVSLALTYKNFPCKVVATR-----------KAILRALNLFLLGIFLQGGFF 145
+ P FLFI GVS FP V R K + R L L +LG+
Sbjct: 61 IFPLFLFIAGVS-------FPYSVAKQRAGGMSEGRIYAKIVRRGLTLVVLGM------- 106
Query: 146 HGINNLKYGVDIAQIRWMGVLQRIAIAYLVAAL 178
+ N + +D +R VL RI +A+ +AA+
Sbjct: 107 --VYNGLFKLDFENLRIASVLGRIGLAWSIAAV 137
Score = 38.9 bits (89), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 24/90 (26%), Positives = 43/90 (47%)
Query: 303 APFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGM 362
FDPEGLLS+V A VT ++G+ G + R W+ ++ + +
Sbjct: 198 GSFDPEGLLSTVPAVVTAMLGMFTGEFVRRGDIRGGRKTLWMAAAAAALLAAGLAFSGVL 257
Query: 363 HLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+NK L+S ++ C+ S + A Y+++
Sbjct: 258 PVNKKLWSSTFVCVVGAYSLGMFALFYYLI 287
>gi|410660784|ref|YP_006913155.1| N-acetylglucosamine related transporter, NagX [Dehalobacter sp. CF]
gi|409023140|gb|AFV05170.1| N-acetylglucosamine related transporter, NagX [Dehalobacter sp. CF]
Length = 368
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 84/336 (25%), Positives = 135/336 (40%), Gaps = 80/336 (23%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGIL--PAINHSPWNGLTLADFVMPFFLFIVGVSLALT 112
R+ +LD R L+V L+ L G L I H+PW G T DF P F+ + G S+A+
Sbjct: 7 RIKALDFARALSVLLLFLTFVPEGPLYGAYITHAPWFGYTAIDFAFPAFVTLSGTSMAIV 66
Query: 113 YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIA 172
Y+ K V + I R L ++G+ + + + ++++R+ GVLQ +A
Sbjct: 67 YR----KHVPWVRLIRRFFVLLIIGLIFN-------SLVSWEFQLSELRFTGVLQVLAFT 115
Query: 173 YLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYL-LLLYGLYVPDWQYEFPV 231
++ AL VS K+ + L++ YL +LLY
Sbjct: 116 GIMTALIT-------RVSG------KWFWPFTAGLLILAAYLGILLY------------- 149
Query: 232 ETSSSSPWIFNVTCGVRGST-GPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDY 290
TS S P GS P N GM D I HLY
Sbjct: 150 -TSQSFP----------GSLPSPDHNLSGMTDPFIFTKSHLYVH---------------- 182
Query: 291 GPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCL 350
A +DPEG+ + A + L G G + + RD + + + L
Sbjct: 183 ----------GDAGYDPEGICTLFSAIASTLFGYTAGLFLNNKNIGRDFLKILALAAVLL 232
Query: 351 IGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
+ L +F+ + K L++ S+ L++GA+ ++LA
Sbjct: 233 LLTPLLSNFI--PIGKRLWTPSFVTLSSGATILILA 266
>gi|325106033|ref|YP_004275687.1| hypothetical protein [Pedobacter saltans DSM 12145]
gi|324974881|gb|ADY53865.1| hypothetical protein Pedsa_3330 [Pedobacter saltans DSM 12145]
Length = 397
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 36/99 (36%), Positives = 58/99 (58%), Gaps = 8/99 (8%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDV--GGILPAINHSPWN--GLTLADFVMPFFLFIV 105
+ R++S+D+ RGLT+ LM+ V+D+ G+ + H+ N G+ LAD+V P FLF+V
Sbjct: 3 EKLSVRILSIDIMRGLTLFLMLFVNDLYEPGVPKWLVHTKANVDGMGLADWVFPGFLFMV 62
Query: 106 GVSLALTYKNFPCKVVATRK----AILRALNLFLLGIFL 140
G+S+ K K + K +LRAL+L +GI +
Sbjct: 63 GLSIPYAVKARKAKGESGFKIFVHILLRALSLLFIGILM 101
>gi|406834557|ref|ZP_11094151.1| hypothetical protein SpalD1_23036 [Schlesneria paludicola DSM
18645]
Length = 534
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 41/119 (34%), Positives = 58/119 (48%), Gaps = 12/119 (10%)
Query: 5 NPINNDANDQLVIQISKSTSAPAN------EKLERDPLLPPSNSNSKQTRPQHQQRRLIS 58
N N ++D+ ++S S + EK + L S ++ R RL+S
Sbjct: 21 NDFENPSDDRRARVDTRSDSGVTHSWTSRGEKSKGQTLTMNVTSTAEPPRTIAGTERLVS 80
Query: 59 LDVFRGLTVALMILVDDVGGILPAINHSPW----NGLTLADFVMPFFLFIVGVSLALTY 113
LD FRG TVA MILV+ +G A+ HS + N + AD +MP F F VG S LT+
Sbjct: 81 LDQFRGYTVAGMILVNFIGSF--AVVHSIFKHNNNYFSYADSIMPGFHFAVGYSYRLTF 137
>gi|345881756|ref|ZP_08833266.1| hypothetical protein HMPREF9431_01930 [Prevotella oulorum F0390]
gi|343918415|gb|EGV29178.1| hypothetical protein HMPREF9431_01930 [Prevotella oulorum F0390]
Length = 380
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 39/155 (25%), Positives = 73/155 (47%), Gaps = 29/155 (18%)
Query: 45 KQTRPQHQQRRLISLDVFRGLTVALMILVDDV----------------GGILPAINHSPW 88
+ T Q +RL+SLD+ RG +A+++L+ + I+ ++H PW
Sbjct: 2 ETTTQSSQPQRLLSLDILRGADLAMLVLIQPILFRALKTAHPAEGTIGHFIMGQLSHLPW 61
Query: 89 NGLTLADFVMPFFLFIVGVSLALTYKNFP--CKVVAT--RKAILRALNLFLLGIFLQGGF 144
G D +MP F+F+ G+++ + ++ + I R + L++LG+ +QG
Sbjct: 62 EGFCFWDIIMPLFMFMSGITIPFAMSRYKRGARIDGQFYWRIIKRFVVLWVLGMVVQG-- 119
Query: 145 FHGINNLKYGVDIAQIR-WMGVLQRIAIAYLVAAL 178
N L + D+ Q+ + LQ IA+ Y+ A
Sbjct: 120 ----NLLAF--DLRQLHLFSNTLQSIAVGYVAVAF 148
>gi|403174292|ref|XP_003333277.2| hypothetical protein PGTG_14197 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375170913|gb|EFP88858.2| hypothetical protein PGTG_14197 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 386
Score = 51.2 bits (121), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 46/141 (32%), Positives = 71/141 (50%), Gaps = 17/141 (12%)
Query: 54 RRLISLDVFRGLTVALMILVDDVGGILPA-INH--SPWNGLTLADFVMPFFLFIVGVSLA 110
+R S+DV RGLT M+LV+ G + P+ ++H S +T AD + P F+F G++ A
Sbjct: 25 KRDRSIDVLRGLTCLAMVLVNTAGPVRPSWLSHPTSIHQSITFADTLFPCFVFTSGLASA 84
Query: 111 LTYKNFP-CKVVATRKAILRALNLFLLGIF-------LQGGFFHGINNLKYGVDIAQIRW 162
+ KN + + ++ ++RA+ L L+GI L G G+ NLK R
Sbjct: 85 QSKKNEQNGRNPSLKRTLIRAIKLNLIGIAYNNLIPRLAGLHGDGLLNLK------TYRI 138
Query: 163 MGVLQRIAIAYLVAALCEIWL 183
VL I I+ LV L W+
Sbjct: 139 PSVLGTIGISSLVCTLESYWI 159
>gi|187735009|ref|YP_001877121.1| transmembrane protein [Akkermansia muciniphila ATCC BAA-835]
gi|187425061|gb|ACD04340.1| putative transmembrane protein [Akkermansia muciniphila ATCC
BAA-835]
Length = 373
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 50/180 (27%), Positives = 75/180 (41%), Gaps = 39/180 (21%)
Query: 42 SNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAIN---------------HS 86
S+ TRPQ R+ ++D RG + L + ++ IN H
Sbjct: 2 SSLSDTRPQ----RIAAIDALRGFD--MFFLTGGLALVVAGINLFYDRSPEWLVKHSTHV 55
Query: 87 PWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATR-------KAILRALNLFLLGIF 139
W G D VMP FLFIVG ++ ++ K + + K R + LFLLG+
Sbjct: 56 AWEGFAAWDLVMPLFLFIVGTAMPFSFS----KRIGSEPLWKIYLKVARRVVVLFLLGMV 111
Query: 140 LQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKY 199
+QG N L + + + LQ IA YL+AA+C + L V++ L Y
Sbjct: 112 VQG------NLLSFEPSRMSL-YCNTLQAIASGYLIAAICLLHLSIRWQVAATGGLLAVY 164
>gi|326799399|ref|YP_004317218.1| hypothetical protein [Sphingobacterium sp. 21]
gi|326550163|gb|ADZ78548.1| hypothetical protein Sph21_1988 [Sphingobacterium sp. 21]
Length = 398
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 60/96 (62%), Gaps = 8/96 (8%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVG--GILPAINHSPWN--GLTLADFVMPFFLFIVGVS 108
R++S+D+ RGLT+ LM+ V+D+ G+ + H+ + G+ LAD+V P FLFIVGVS
Sbjct: 8 SERILSVDIMRGLTLLLMLFVNDLFEPGVPAWLLHTKVDVDGMGLADWVFPGFLFIVGVS 67
Query: 109 LALTYKNFPCKVVATRKAI----LRALNLFLLGIFL 140
+ ++ K + R+ I +R L+L ++G+++
Sbjct: 68 VPYAIRSRLNKGESKRQIIGHIAVRTLSLLIIGVYI 103
>gi|146302719|ref|YP_001197310.1| hypothetical protein Fjoh_4992 [Flavobacterium johnsoniae UW101]
gi|146157137|gb|ABQ07991.1| Uncharacterized protein [Flavobacterium johnsoniae UW101]
Length = 395
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 59/105 (56%), Gaps = 11/105 (10%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVGGI--LPA-INHSP--WNGLTLADFVMPF 100
+ + +R+IS+D RG+T+ +MI V+++ I +P + H P + +T D V P
Sbjct: 2 KIKENLYNQRIISIDALRGITIFVMIFVNELASIQNVPQWMKHMPADADAMTFVDLVFPA 61
Query: 101 FLFIVGVSLALTY-----KNFPCKVVATRKAILRALNLFLLGIFL 140
FLFIVG+S+ + K KV+ T + RAL L ++G+++
Sbjct: 62 FLFIVGMSVPFAFNARLIKGDSPKVIWTH-TLKRALALIIIGVYM 105
>gi|317505448|ref|ZP_07963366.1| ABC superfamily ATP binding cassette transporter permease subunit
[Prevotella salivae DSM 15606]
gi|315663361|gb|EFV03110.1| ABC superfamily ATP binding cassette transporter permease subunit
[Prevotella salivae DSM 15606]
Length = 380
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/162 (26%), Positives = 73/162 (45%), Gaps = 33/162 (20%)
Query: 41 NSNSKQTRPQHQQRRLISLDVFRGLTVALMILVD---------------DVGG-ILPAIN 84
SN+ +P RL+SLD+ RG +A+++LV VG I+ +
Sbjct: 2 ESNTTLQKPN----RLLSLDILRGADLAMLVLVQPILLKALETMQPAEGTVGHFIMGQLL 57
Query: 85 HSPWNGLTLADFVMPFFLFIVGVSLALTYKNFP----CKVVATRKAILRALNLFLLGIFL 140
H PW G D +MP F+F+ G+++ + R+ + R + L++LG+
Sbjct: 58 HLPWEGFCFWDIIMPLFMFMSGITIPFAMARYKRGERIDGSFYRRILKRFVVLWILGMVC 117
Query: 141 QGGFFHGINNLKYGVDIAQIR-WMGVLQRIAIAYLVAALCEI 181
QG N L + D+ Q+ + LQ IA+ Y+ A +
Sbjct: 118 QG------NLLAF--DLQQLHLYSNTLQSIAVGYVAVAFLYV 151
>gi|196234160|ref|ZP_03132993.1| conserved hypothetical protein [Chthoniobacter flavus Ellin428]
gi|196221811|gb|EDY16348.1| conserved hypothetical protein [Chthoniobacter flavus Ellin428]
Length = 417
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 44/135 (32%), Positives = 57/135 (42%), Gaps = 19/135 (14%)
Query: 44 SKQTRPQHQQRRLISLDVFRGLTVALMIL----VDDVGGILP----------AINHSPWN 89
S T P RL S+D +RGL + L++ V LP H W
Sbjct: 21 SAPTTPAPASVRLGSIDAYRGLVMFLLLAEQFRTASVAKALPDSSFWRFLATQQEHVTWT 80
Query: 90 GLTLADFVMPFFLFIVGVSLALTYKNFPCK----VVATRKAILRALNLFLLGIFLQGGFF 145
G L D + P F F+VGV+L + N + T A LRAL L LLGIFL+
Sbjct: 81 GAVLHDMIQPSFSFLVGVALPFSIGNRRARGQSPEATTGHAFLRALILVLLGIFLRST-G 139
Query: 146 HGINNLKYGVDIAQI 160
H N + + QI
Sbjct: 140 HSQTNFTFEDTLTQI 154
>gi|408370371|ref|ZP_11168148.1| hypothetical protein I215_05677 [Galbibacter sp. ck-I2-15]
gi|407744129|gb|EKF55699.1| hypothetical protein I215_05677 [Galbibacter sp. ck-I2-15]
Length = 394
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/97 (35%), Positives = 58/97 (59%), Gaps = 8/97 (8%)
Query: 52 QQRRLISLDVFRGLTVALMILVDD--VGGILPAINHS-PW-NGLTLADFVMPFFLFIVGV 107
R++S+D+ RG+T+ LM+ V+D + G+ + H+ W +G+ LAD+V P FLF+VG+
Sbjct: 2 NSNRIMSIDIMRGITLFLMLFVNDLFIPGVPKWLVHTQEWEDGMGLADWVFPGFLFMVGL 61
Query: 108 SLALTYKNFPCKVVATRK----AILRALNLFLLGIFL 140
S+ K K + + I+R L+L L+GI +
Sbjct: 62 SIPYAMKARKNKGQSNLRLWSHVIMRTLSLLLIGILM 98
>gi|320105641|ref|YP_004181231.1| hypothetical protein AciPR4_0402 [Terriglobus saanensis SP1PR4]
gi|319924162|gb|ADV81237.1| hypothetical protein AciPR4_0402 [Terriglobus saanensis SP1PR4]
Length = 406
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/135 (31%), Positives = 64/135 (47%), Gaps = 28/135 (20%)
Query: 42 SNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPW---------NGLT 92
+ S P QR ++SLD+FRGL +ALMI V++ L I PW + +T
Sbjct: 2 ATSAGPHPLAPQR-ILSLDIFRGLNIALMIFVNE----LAEIKGLPWWTYHAPGKVDVMT 56
Query: 93 LADFVMPFFLFIVGVSLALTYK------NFPCKVVATRKAILRALNLFLLGIFLQGG--- 143
D V P FLFI+G+++ L + P ++ LR+ L +LGI L+ G
Sbjct: 57 YVDMVFPGFLFILGMAIPLALNARIRKGDSPATLLG--YIALRSAALLVLGIILENGGRG 114
Query: 144 ---FFHGINNLKYGV 155
HG+ +G+
Sbjct: 115 DPALMHGLGRYIWGL 129
>gi|284036950|ref|YP_003386880.1| hypothetical protein Slin_2036 [Spirosoma linguale DSM 74]
gi|283816243|gb|ADB38081.1| hypothetical protein Slin_2036 [Spirosoma linguale DSM 74]
Length = 404
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 41/137 (29%), Positives = 66/137 (48%), Gaps = 28/137 (20%)
Query: 40 SNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPA---INHSPW--NGLTLA 94
++ + R R+ ++D+ R +T+ LMI V+D+ + + H P +G+ LA
Sbjct: 5 ETADRQAYRSSSVSMRVDAIDILRAMTMILMIFVNDLWSLTAIPDWLEHVPHGVDGIGLA 64
Query: 95 DFVMPFFLFIVGVSLALTYKNFPCKVVATRKA-----------ILRALNLFLLGIFLQGG 143
D V P FLFIVG+SL P + A R+ I+R++ L ++G+FL G
Sbjct: 65 DVVFPGFLFIVGMSL-------PFAMNARRQKGDTNSALVSHIIMRSIALLVMGVFLVNG 117
Query: 144 F-----FHGINNLKYGV 155
GIN L + V
Sbjct: 118 ESIDQKATGINRLVWNV 134
>gi|410099161|ref|ZP_11294133.1| hypothetical protein HMPREF1076_03311 [Parabacteroides goldsteinii
CL02T12C30]
gi|409219183|gb|EKN12146.1| hypothetical protein HMPREF1076_03311 [Parabacteroides goldsteinii
CL02T12C30]
Length = 377
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/149 (28%), Positives = 65/149 (43%), Gaps = 29/149 (19%)
Query: 49 PQHQQRRLISLDVFRGLTVALMILVD----------DVGGI---LPAINHSPWNGLTLAD 95
+ +RL SLD RG + ++ + DVG + + A NH W G + D
Sbjct: 2 EKTTYKRLESLDALRGFDLFFLVALGPLAHSLARAADVGWLNDCMWAFNHVQWEGFSPWD 61
Query: 96 FVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALN-----LFLLGIFLQGGFFHGINN 150
+MP FLF+ G S+ F K V+ +K + R L L++ G+ QG
Sbjct: 62 LIMPLFLFMSGASMPFALSRF--KGVSDKKTLFRRLGKRILLLWIFGMMCQGNLL----- 114
Query: 151 LKYGVDIAQIR-WMGVLQRIAIAYLVAAL 178
G D +I + LQ IA YL+ A+
Sbjct: 115 ---GFDPDRIYLYSNTLQSIAAGYLITAV 140
>gi|302759308|ref|XP_002963077.1| hypothetical protein SELMODRAFT_404643 [Selaginella moellendorffii]
gi|300169938|gb|EFJ36540.1| hypothetical protein SELMODRAFT_404643 [Selaginella moellendorffii]
Length = 293
Score = 50.4 bits (119), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 23/54 (42%), Positives = 36/54 (66%)
Query: 339 RMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
R++N+I+ + L+ LG L +G+ +NK LYS SY C TAGA+G + +Y +V
Sbjct: 9 RIINFIVPAVILLALGYVLHLLGIKMNKPLYSFSYMCFTAGAAGAVFCLLYILV 62
>gi|436836802|ref|YP_007322018.1| hypothetical protein FAES_3417 [Fibrella aestuarina BUZ 2]
gi|384068215|emb|CCH01425.1| hypothetical protein FAES_3417 [Fibrella aestuarina BUZ 2]
Length = 401
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 43/131 (32%), Positives = 71/131 (54%), Gaps = 13/131 (9%)
Query: 41 NSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGI--LPA-INHSP--WNGLTLAD 95
++ S + P R+ S+DV R LT+ LMI V+D+ + +P + H P +G+ LAD
Sbjct: 3 STASATSSPFRVLTRVDSIDVLRALTMVLMIFVNDLWSLTAIPGWLEHVPEGADGIGLAD 62
Query: 96 FVMPFFLFIVGVSL--ALTYKNFPCKVVA--TRKAILRALNLFLLGIFLQGGFFHGINNL 151
V P FLFIVG+S+ A+ ++ + A + RA L ++G++L G + +N
Sbjct: 63 VVFPAFLFIVGLSIPFAIQHRRTRHETDAQIAGHVLTRAAALLVMGLWLVNGEY--LNEA 120
Query: 152 KYGVDIAQIRW 162
G IA++ W
Sbjct: 121 ATG--IARLGW 129
>gi|430747657|ref|YP_007206786.1| hypothetical protein Sinac_7036 [Singulisphaera acidiphila DSM
18658]
gi|430019377|gb|AGA31091.1| hypothetical protein Sinac_7036 [Singulisphaera acidiphila DSM
18658]
Length = 418
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 42/141 (29%), Positives = 63/141 (44%), Gaps = 22/141 (15%)
Query: 19 ISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMIL----VD 74
+S ++ +PA + PP +S S RRL S+D FRG + L++ +
Sbjct: 1 MSPNSPSPAAGDRPVNAPKPPESSGSGSA----PSRRLASIDAFRGFVMFLLLAEWLKLP 56
Query: 75 DVGGILPAIN----------HSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATR 124
V P H W G +L D + P F F+VGV+L + + + +T
Sbjct: 57 QVAKSFPKSELWALLSRHQQHVEWVGCSLHDLIQPSFSFLVGVALPFSIASRLARGQSTT 116
Query: 125 K----AILRALNLFLLGIFLQ 141
+ A RAL L LLGIFL+
Sbjct: 117 RMAGHAFWRALVLVLLGIFLR 137
>gi|410657728|ref|YP_006910099.1| N-acetylglucosamine related transporter, NagX [Dehalobacter sp.
DCA]
gi|409020083|gb|AFV02114.1| N-acetylglucosamine related transporter, NagX [Dehalobacter sp.
DCA]
Length = 370
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 84/338 (24%), Positives = 135/338 (39%), Gaps = 80/338 (23%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGIL--PAINHSPWNGLTLADFVMPFFLFIVGVSLA 110
R+ +LD R L+V L+ L G L I H+PW G T DF P F+ + G S+A
Sbjct: 5 TNRIKALDFARALSVLLLFLTFVPEGPLYGAYITHAPWFGYTAIDFAFPAFVTLSGTSMA 64
Query: 111 LTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIA 170
+ Y+ K V + I R L ++G+ + + + ++Q+R+ GVLQ +A
Sbjct: 65 IAYR----KHVPWVRLIRRFFVLIIIGLIFN-------SLVSWEFHLSQLRFTGVLQVLA 113
Query: 171 IAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYL-LLLYGLYVPDWQYEF 229
++ L VS K+ + L++ YL +LLY
Sbjct: 114 FTGIMTTLIT-------RVSG------KWFWPFTAGLLILAAYLGILLY----------- 149
Query: 230 PVETSSSSPWIFNVTCGVRGST-GPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSP 288
TS S P GS P N GMID I HLY
Sbjct: 150 ---TSQSFP----------GSLPSPDHNLSGMIDPFIFTKSHLYVH-------------- 182
Query: 289 DYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSS 348
A +DPEG+ + A + L G G + + R+ + + +
Sbjct: 183 ------------GDAGYDPEGICTLFSAIASTLFGYTAGLFLNNKNIGRNFLKILALAAV 230
Query: 349 CLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLA 386
L+ L +F+ + K L++ S+ L++GA+ ++LA
Sbjct: 231 LLLLTPLLSNFIP--IGKRLWTPSFVTLSSGATILVLA 266
>gi|300123408|emb|CBK24681.2| unnamed protein product [Blastocystis hominis]
Length = 213
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/89 (31%), Positives = 47/89 (52%), Gaps = 3/89 (3%)
Query: 15 LVIQISKSTSAPANEKLERDPLLPPSNSNSKQTR-PQHQQRRLISLDVFRGLTVALMILV 73
++ I K + A+ K E + P KQ+ Q + R+ S+DVFRG+T+ +MI
Sbjct: 124 MIENIRKGRNEEAS-KTEASSVNEPLIQKEKQSVVSQPMKSRVQSIDVFRGITICIMIFA 182
Query: 74 D-DVGGILPAINHSPWNGLTLADFVMPFF 101
+ G ++ H+ W+G+T ADF P +
Sbjct: 183 NYGAGQYSHSLMHAAWDGITFADFAFPLY 211
>gi|386821099|ref|ZP_10108315.1| hypothetical protein JoomaDRAFT_3082 [Joostella marina DSM 19592]
gi|386426205|gb|EIJ40035.1| hypothetical protein JoomaDRAFT_3082 [Joostella marina DSM 19592]
Length = 395
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/110 (32%), Positives = 63/110 (57%), Gaps = 10/110 (9%)
Query: 55 RLISLDVFRGLTVALMILVDDVG--GILPAINHSP--WNGLTLADFVMPFFLFIVGVSLA 110
R++S+D+ RGLT+ LM+ V+D+ G+ + HS + + LAD+V P FLF+VG+S+
Sbjct: 6 RILSIDIMRGLTLFLMLFVNDLFEPGVPKWLVHSKATEDAMGLADWVFPGFLFMVGLSIP 65
Query: 111 LTY----KNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVD 156
+ K + + ++R L+L L+G+F+ H IN G++
Sbjct: 66 FAFLSRRKKGEGDLEILKHILVRTLSLLLIGVFMIN--IHEINAELSGIN 113
>gi|409198223|ref|ZP_11226886.1| hypothetical protein MsalJ2_14356 [Marinilabilia salmonicolor JCM
21150]
Length = 394
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 37/119 (31%), Positives = 57/119 (47%), Gaps = 24/119 (20%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGILPAINHSP-W--------NGLTLADFVMPFFLFIV 105
R+ S+D+ R +TVALM+ V+D LP I P W +G+ LAD V P FLF V
Sbjct: 9 RIKSIDILRAITVALMVFVND----LPGIRDIPQWLGHASAGHDGMFLADIVFPLFLFWV 64
Query: 106 GVSLALTY----KNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQI 160
G+S+ L K + R + R +L +G+ + +N + V++ I
Sbjct: 65 GMSIPLAVDGRQKKGDSDLTIVRHILKRTFSLVFIGVLM-------VNTSRMSVEVTGI 116
>gi|149178821|ref|ZP_01857402.1| phosphoribosylaminoimidazole synthetase [Planctomyces maris DSM
8797]
gi|148842362|gb|EDL56744.1| phosphoribosylaminoimidazole synthetase [Planctomyces maris DSM
8797]
Length = 405
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 52/105 (49%), Gaps = 5/105 (4%)
Query: 41 NSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGG--ILPAINHSPWNGLTLADFVM 98
N+ T + +R++SLD FRG TVA M LV+ +G + P + + AD +M
Sbjct: 2 TDNTSSTTAEKFNKRIVSLDQFRGYTVAGMFLVNYMGFFVVCPVVLKHHNTYCSYADTIM 61
Query: 99 PFFLFIVGVSLALTYK---NFPCKVVATRKAILRALNLFLLGIFL 140
P FLF VG + LT+ V A + + R L L L+ + +
Sbjct: 62 PHFLFAVGFAFRLTFGRRVQTAGAVSAYARVVRRLLGLVLVSLII 106
>gi|329849634|ref|ZP_08264480.1| hypothetical protein ABI_25290 [Asticcacaulis biprosthecum C19]
gi|328841545|gb|EGF91115.1| hypothetical protein ABI_25290 [Asticcacaulis biprosthecum C19]
Length = 410
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 35/113 (30%), Positives = 60/113 (53%), Gaps = 9/113 (7%)
Query: 40 SNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDV---GGILPAINH--SPWNGLTLA 94
+K ++Q R+ ++D+ R LT+ LMI V+D+ G+ + H S +G+ L+
Sbjct: 6 EKGRAKTMPNKNQFSRVGAIDLVRALTMVLMIFVNDLWSLKGVPVWLEHVASGVDGMGLS 65
Query: 95 DFVMPFFLFIVGVSLALTYKNFPCKVVATRKAIL----RALNLFLLGIFLQGG 143
D V P FLFIVG+SL + + + +L R++ L ++G+FL G
Sbjct: 66 DVVFPAFLFIVGLSLPFAVSSRQARGDSLGSTVLHILGRSVALLVMGVFLVNG 118
>gi|256424049|ref|YP_003124702.1| hypothetical protein Cpin_5069 [Chitinophaga pinensis DSM 2588]
gi|256038957|gb|ACU62501.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588]
Length = 390
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 35/96 (36%), Positives = 53/96 (55%), Gaps = 9/96 (9%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGI--LPA-INH--SPWNGLTLADFVMPFFLFIVGV 107
+RL+S+D FR LT+ MI V+DV G+ +P I+H + +G+ AD V P FLFIVG+
Sbjct: 5 SQRLLSIDAFRALTMLTMIFVNDVSGVKNIPEWIDHVKAQDDGMGFADTVFPAFLFIVGL 64
Query: 108 SLALTYKNFPCK----VVATRKAILRALNLFLLGIF 139
S+ K +LR+L + ++G F
Sbjct: 65 SIPFAIGKRISKQDSFFSIESHILLRSLAMIVMGFF 100
>gi|440749360|ref|ZP_20928608.1| N-acetylglucosamine related transporter, NagX [Mariniradius
saccharolyticus AK6]
gi|436482365|gb|ELP38488.1| N-acetylglucosamine related transporter, NagX [Mariniradius
saccharolyticus AK6]
Length = 401
Score = 49.3 bits (116), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 56/110 (50%), Gaps = 18/110 (16%)
Query: 39 PSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDD---VGGILPAINH--SPWNGLTL 93
P N+N RR+ S+DVFR +T+ LMI V+D + GI + H + +G+ L
Sbjct: 5 PENANP---------RRVYSIDVFRAITMMLMIFVNDLWTLEGIPAWLGHVDAKEDGMGL 55
Query: 94 ADFVMPFFLFIVGVSLALTYKNFPCK---VVATRKAI-LRALNLFLLGIF 139
AD V P FL IVG+S+ K + T K I R L L +G+F
Sbjct: 56 ADVVFPAFLVIVGLSIPFALSKRIEKGERLAGTLKHIFFRTLALLTMGVF 105
>gi|430744438|ref|YP_007203567.1| hypothetical protein Sinac_3623 [Singulisphaera acidiphila DSM
18658]
gi|430016158|gb|AGA27872.1| hypothetical protein Sinac_3623 [Singulisphaera acidiphila DSM
18658]
Length = 454
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/110 (35%), Positives = 56/110 (50%), Gaps = 13/110 (11%)
Query: 40 SNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADF 96
SN + T+ R++S+D FRG TVA M +V+ +GG I P + H+ N + AD
Sbjct: 47 SNGEAAGTKTG----RIVSMDQFRGYTVAGMCVVNFLGGLQAIHPVLKHNN-NYFSYADT 101
Query: 97 VMPFFLFIVGVSLALT----YKNFPCKVVATRKAILRALNLFLLGIFLQG 142
+MP FLF G S LT F R+ + R+L L LL + + G
Sbjct: 102 IMPSFLFACGFSYRLTALKRLDQF-GPAAMYRRFVWRSLGLVLLSLMMYG 150
>gi|320104555|ref|YP_004180146.1| hypothetical protein Isop_3032 [Isosphaera pallida ATCC 43644]
gi|319751837|gb|ADV63597.1| hypothetical protein Isop_3032 [Isosphaera pallida ATCC 43644]
Length = 399
Score = 48.9 bits (115), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 37/99 (37%), Positives = 50/99 (50%), Gaps = 5/99 (5%)
Query: 49 PQHQQRRLISLDVFRGLTVALMILVDDVGGI--LPAINHSPWNGLTLADFVMPFFLFIVG 106
P R +LD FRG TVA MI+V+ VGG+ +PAI + AD +MP F VG
Sbjct: 10 PLSSPSRWDALDQFRGYTVAGMIVVNFVGGLAAVPAILKHHNTYCSYADTIMPQFFLAVG 69
Query: 107 VSLALTYKNFPCK---VVATRKAILRALNLFLLGIFLQG 142
+ T+ N + A R A+ R L L L+G + G
Sbjct: 70 FAYRWTFLNRLERGGWQAAVRHALGRNLGLLLVGFLMYG 108
>gi|350546587|ref|ZP_08915965.1| hypothetical protein GUU_02077 [Mycoplasma iowae 695]
gi|349503825|gb|EGZ31390.1| hypothetical protein GUU_02077 [Mycoplasma iowae 695]
Length = 427
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 43/117 (36%), Positives = 60/117 (51%), Gaps = 20/117 (17%)
Query: 55 RLISLDVFRGLTVALMILVDDVG-----GILPAI-NHSPWNG------LTLADFVMPFFL 102
RL S+D FRG+ V M++ + GIL I NHS G +TLADF+ P F+
Sbjct: 30 RLKSIDAFRGMCVFCMLIFQFLKNFPSLGILSRIANHSLEKGIVILPGMTLADFIAPAFI 89
Query: 103 FIVGVSLALTY---KNFPCKVVATRKAILRALNLFLLGIFLQ-----GGFFHGINNL 151
F +G++ +L++ KN + A AI RAL + +G FL FF G N L
Sbjct: 90 FAIGLTFSLSFINRKNRQGTLKAFIHAIERALTILGIGTFLDLCNKYLDFFGGKNQL 146
>gi|317503638|ref|ZP_07961656.1| transmembrane protein, partial [Prevotella salivae DSM 15606]
gi|315665239|gb|EFV04888.1| transmembrane protein [Prevotella salivae DSM 15606]
Length = 322
Score = 48.5 bits (114), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 69/298 (23%), Positives = 112/298 (37%), Gaps = 84/298 (28%)
Query: 109 LALTYKNFPCKVVATRKAILRALNLFLLGIFLQG--GFFHGINNLKYGV----------- 155
L+L NF TRK I R + L+++GI + F HG+ N +
Sbjct: 5 LSLRKYNFEWSWTCTRKIIKRGILLYVIGIGISWIMMFSHGLFNAENATLPLLSRIVVSA 64
Query: 156 -DIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYL 214
IR +GV R+A Y++A+ I +K KY +++ AL YL
Sbjct: 65 NVFDHIRLVGVFPRLAFCYMLASFVAISMK------------HKYIPYFIGAL-FVGYYL 111
Query: 215 LLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRK 274
+L YG G A N +ID + G QHLY+
Sbjct: 112 ILCYG----------------------------NGFAHDASNICNVIDEAVFGRQHLYQ- 142
Query: 275 PIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFK 334
D P DPEGLLS++ A LIG G ++
Sbjct: 143 --------------------WDIP-------DPEGLLSNLPALGHVLIGFCVGRSLMSAT 175
Query: 335 DHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+++ + + L G L + G ++K L++ ++ +T G + LA + +++
Sbjct: 176 SINEKIEKLFLFGALLTFAGFLLSY-GCPISKKLWTPTFALVTCGLASTTLAMLSWII 232
>gi|373954275|ref|ZP_09614235.1| Protein of unknown function DUF2261, transmembrane
[Mucilaginibacter paludis DSM 18603]
gi|373890875|gb|EHQ26772.1| Protein of unknown function DUF2261, transmembrane
[Mucilaginibacter paludis DSM 18603]
Length = 397
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 36/95 (37%), Positives = 56/95 (58%), Gaps = 13/95 (13%)
Query: 54 RRLISLDVFRGLTVALMILVDDVGGI--LPA-INHSP--WNGLTLADFVMPFFLFIVGVS 108
R+ S+D+FR +T+ LMI V+D+ G+ +P I H+ +GL LAD V P FLFIVG+S
Sbjct: 7 NRVHSIDIFRAVTMFLMIFVNDIDGVPGVPEWIKHAGERTDGLGLADIVFPAFLFIVGLS 66
Query: 109 L------ALTYKNFPCKVVATRKAILRALNLFLLG 137
+ ++ + K+ A ++RAL L +G
Sbjct: 67 IPHAIQSRISRGDSKTKIAA--YIVMRALALIFIG 99
>gi|380025576|ref|XP_003696546.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Apis florea]
Length = 298
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 61/276 (22%), Positives = 113/276 (40%), Gaps = 56/276 (20%)
Query: 150 NLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHW----VV 205
N K + +R+ GVLQ + ++Y V A+ E + ++FR W ++
Sbjct: 3 NSKDSESLHDLRFPGVLQLLGVSYFVCAILETIFMKPHSQFGRFAMFRDILESWPQWLIM 62
Query: 206 ALVLTTLYLLLLY--------GLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNA 257
A ++TT L+ G + P +Y F + N T G A
Sbjct: 63 AGIVTTHTLITFLLPISNCPKGYFGPGGEYHFRGKY-------MNCTAG----------A 105
Query: 258 VGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMAT 317
G IDR I G Y+ T+ YG + +DPEGL++++ A
Sbjct: 106 AGYIDRLIFG------NHTYNHTENFL-----YGQI---------LRYDPEGLMNTISAI 145
Query: 318 VTCLIGLHFGHLIVHFKDHRDRMLNWI---ILSSCLIGLGLSLDFVG--MHLNKALYSLS 372
+G+H G +++ + R++ W I + + G+ + + G + ++K + +LS
Sbjct: 146 FIVYLGVHAGKILLLYYQCNSRVIRWFLWTIFTGIVAGILCNFETQGGIIPISKRMMTLS 205
Query: 373 YTCLTAGASGVLLAGIYFMVRYISSHLMLKKPFDYS 408
Y + + + +L A +Y ++ Y PF Y+
Sbjct: 206 YVLICSSFAFLLYALLYVLIDY--KQFWNGAPFVYA 239
>gi|157960931|ref|YP_001500965.1| hypothetical protein Spea_1103 [Shewanella pealeana ATCC 700345]
gi|157845931|gb|ABV86430.1| conserved hypothetical protein [Shewanella pealeana ATCC 700345]
Length = 394
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 46/163 (28%), Positives = 71/163 (43%), Gaps = 23/163 (14%)
Query: 34 DPLLPPSNSNSKQTRPQHQQRRLISLDVFRGLTV-----------ALMILVDDVG--GIL 80
P + ++ ++ + RL SLD RG + AL++L G
Sbjct: 7 KPRVSIASVAAESVSKPAAKPRLKSLDALRGFDMFWILGGEAIFAALLLLTGWAGFNWFD 66
Query: 81 PAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKN-----FPCKVVATRKAILRALNLFL 135
++HS W+G T D + P F+F+ GV+L L+ K P ++ + AI R L L L
Sbjct: 67 SQMHHSTWHGFTFYDLIFPLFIFLSGVALGLSPKRLDKLPLPQRMPLYQHAIKRLLLLLL 126
Query: 136 LGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAAL 178
G+ G+ G + IR+ VL RIA A+ AL
Sbjct: 127 FGVIYNHGWGTGAP-----FALGDIRYASVLGRIAFAWFFCAL 164
>gi|406980095|gb|EKE01754.1| hypothetical protein ACD_21C00059G0003 [uncultured bacterium]
Length = 295
Score = 48.1 bits (113), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 37/116 (31%), Positives = 57/116 (49%), Gaps = 9/116 (7%)
Query: 295 LDAPSWCQAP---FDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNW-IILSSCL 350
L +W AP FDPEG+LS++ A T L G+ G L+ K + W +I L
Sbjct: 99 LHGHTWVNAPAPGFDPEGILSTLPAIATMLFGVLTGQLL---KSSFTQKTVWMLIFGGAL 155
Query: 351 IGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLKKPFD 406
I LGL + + +NK L++ SY T+G + V+ Y+++ + H KP
Sbjct: 156 IFLGLVMSH-WLPINKNLWTSSYAVFTSGMASVVFGCCYWLID-VKKHQKWFKPLQ 209
>gi|305666718|ref|YP_003863005.1| hypothetical protein FB2170_10666 [Maribacter sp. HTCC2170]
gi|88708942|gb|EAR01176.1| hypothetical protein FB2170_10666 [Maribacter sp. HTCC2170]
Length = 395
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 54/100 (54%), Gaps = 9/100 (9%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGGILPA---INHSPWNG--LTLADFVMPFFLFI 104
+ + R+ S+DV R LT+ LMI V+D + + H+ N L +D + P FLFI
Sbjct: 3 KSKTLRIASIDVLRALTMLLMIWVNDFWTLTQVPKWLTHAKPNEDYLGFSDIIFPLFLFI 62
Query: 105 VGVSLALTYKNFPCK----VVATRKAILRALNLFLLGIFL 140
VG+S+ N K + + ++R+++L ++G+F+
Sbjct: 63 VGLSIPFAINNRMAKGEPRSIMFKHIVIRSISLLIIGVFM 102
>gi|395803959|ref|ZP_10483200.1| hypothetical protein FF52_18830 [Flavobacterium sp. F52]
gi|395433603|gb|EJF99555.1| hypothetical protein FF52_18830 [Flavobacterium sp. F52]
Length = 396
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 32/98 (32%), Positives = 56/98 (57%), Gaps = 11/98 (11%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGI--LPA-INHSP--WNGLTLADFVMPFFLFIVGV 107
+R++S+D RG+T+ +MI V+++ I +P + H P + +T D V P FLFIVG+
Sbjct: 9 NQRIVSIDSLRGITIFVMIFVNELASIQNVPQWMKHMPADADAMTFVDLVFPAFLFIVGM 68
Query: 108 SLALTY-----KNFPCKVVATRKAILRALNLFLLGIFL 140
S+ + K K + T + RA+ L ++G+F+
Sbjct: 69 SIPFAFNARLIKGDSPKTIWTH-TLKRAIALIIIGVFM 105
>gi|407789242|ref|ZP_11136344.1| hypothetical protein B3C1_03120 [Gallaecimonas xiamenensis 3-C-1]
gi|407207220|gb|EKE77163.1| hypothetical protein B3C1_03120 [Gallaecimonas xiamenensis 3-C-1]
Length = 364
Score = 47.8 bits (112), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 46/146 (31%), Positives = 67/146 (45%), Gaps = 17/146 (11%)
Query: 47 TRPQHQQRRLISLDVFRG----------LTVALMILVDDVG---GILPAINHSPWNGLTL 93
T PQ Q RL+ +D RG L A + G + + HS W+GLT
Sbjct: 2 TTPQPQ--RLLCVDALRGFDMFWILGGELLFAALFTWTGAGIWHSLAGQMAHSDWHGLTA 59
Query: 94 ADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFF-HGINNLK 152
D + P F+F+ GV+L L K R+A+ R+ LL + L G + HG
Sbjct: 60 YDGIFPLFIFLSGVTLGLADKRASALGGGARRALYRSALRRLLLLLLLGVLYNHGWGTGL 119
Query: 153 YGVDIAQIRWMGVLQRIAIAYLVAAL 178
G ++R+ VL RI +A+ VAA+
Sbjct: 120 PG-HWDEVRYASVLGRIGLAWFVAAM 144
Score = 38.1 bits (87), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 49/91 (53%), Gaps = 1/91 (1%)
Query: 302 QAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVG 361
Q P DPEGLLS++ A V L+G+ G L+ + + + L L+ LG V
Sbjct: 196 QMPADPEGLLSTLPAVVNALLGVVAGGLLRSPRQPWSKAVLLAALGLGLLALGYLWSLV- 254
Query: 362 MHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
LNK L++ S+ +T+G S +LLA Y ++
Sbjct: 255 FPLNKTLWTSSFVLVTSGWSALLLALFYVLI 285
>gi|149277363|ref|ZP_01883505.1| hypothetical protein PBAL39_10746 [Pedobacter sp. BAL39]
gi|149232240|gb|EDM37617.1| hypothetical protein PBAL39_10746 [Pedobacter sp. BAL39]
Length = 396
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 39/118 (33%), Positives = 59/118 (50%), Gaps = 16/118 (13%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPA---INHSPW--NGLTLADFVMPF 100
T ++ Q+RL+S+D R L + LMI V+D+ ++ + H+P N + LAD V P
Sbjct: 1 MTANKNVQQRLVSIDALRALVMLLMIFVNDLWSLIDIPGWLEHAPGDANYMGLADVVFPA 60
Query: 101 FLFIVGVSLALTYKNFPCKVVATRKAIL----RALNLFLLGIFLQGGFFHGINNLKYG 154
FL IVG+S+ + K R L R + L ++ GFFH +N YG
Sbjct: 61 FLVIVGLSVPYAIDSRRRKGDGNRAIFLHIVYRTIALLVM------GFFH-VNMETYG 111
>gi|326663866|ref|XP_696425.5| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Danio rerio]
Length = 170
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 52/97 (53%), Gaps = 10/97 (10%)
Query: 304 PFDPEGLLSSVMATVTCLIGLHFGHLIVHFKD-HRD---RMLNW----IILSSCLIGLGL 355
PFDPEG+L S+ + + +GL G +++H++D HR R L W I+S+ L
Sbjct: 2 PFDPEGVLGSINSILMAFLGLQAGKILLHYRDQHRQIITRFLMWGLILGIISAVLTKCSR 61
Query: 356 SLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+ F+ +NK L+SLSY + + V L Y+ V
Sbjct: 62 NDGFIP--VNKNLWSLSYVTTLSCFAFVALVFFYYTV 96
>gi|255035026|ref|YP_003085647.1| hypothetical protein Dfer_1233 [Dyadobacter fermentans DSM 18053]
gi|254947782|gb|ACT92482.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
Length = 401
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 51/104 (49%), Gaps = 9/104 (8%)
Query: 46 QTRPQHQQRRLISLDVFRGLTVALMILVDD---VGGILPAINHS--PWNGLTLADFVMPF 100
+ RL S+DVFR +T+ LMI V+D + + + HS + + L+D V P
Sbjct: 1 MNKVASSSLRLDSIDVFRAVTMLLMIFVNDFWTLEAVPKWLEHSKAEEDAMGLSDVVFPA 60
Query: 101 FLFIVGVSLALTYKNFPCK----VVATRKAILRALNLFLLGIFL 140
FLFIVG+S+ N K + R R L L+GIF+
Sbjct: 61 FLFIVGLSIPFAISNRRKKGDGNALIIRHIAERTFALLLMGIFI 104
>gi|406834451|ref|ZP_11094045.1| hypothetical protein SpalD1_22506 [Schlesneria paludicola DSM
18645]
Length = 358
Score = 47.4 bits (111), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 35/103 (33%), Positives = 48/103 (46%), Gaps = 6/103 (5%)
Query: 79 ILPAINHSPWNGLTLADFVMPFFLFIVGVSL--ALTYKNFPCKVVATRKAILRALNLFLL 136
++ + H W+G D + P FLF+VGV L +LT ++V R + + L
Sbjct: 23 VVTQLEHVKWDGFHFYDLIFPLFLFLVGVVLPFSLTKYQTAGELVPNRSGVYARIIRRTL 82
Query: 137 GIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALC 179
+ G +GI L D RW GVLQRI I Y AAL
Sbjct: 83 LLIALGLIGNGILQL----DFTNFRWPGVLQRIGICYFFAALA 121
>gi|387789753|ref|YP_006254818.1| hypothetical protein Solca_0510 [Solitalea canadensis DSM 3403]
gi|379652586|gb|AFD05642.1| hypothetical protein Solca_0510 [Solitalea canadensis DSM 3403]
Length = 389
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 33/97 (34%), Positives = 53/97 (54%), Gaps = 15/97 (15%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGILPAINHSPW--------NGLTLADFVMPFFLFIVG 106
RL S+DV R LT+ LMI V+D+ + +N W +G+ L+D + P FLFIVG
Sbjct: 7 RLGSIDVIRALTMFLMIFVNDLWSL---VNVPKWLEHVDVQTDGMGLSDVIFPAFLFIVG 63
Query: 107 VSLALTYKNFPCKVVAT----RKAILRALNLFLLGIF 139
+S+ + +N K +T + +R+ L ++G F
Sbjct: 64 LSIPFSVENRIKKGDSTIQLLKHIFIRSFALLVIGFF 100
>gi|146300862|ref|YP_001195453.1| hypothetical protein Fjoh_3117 [Flavobacterium johnsoniae UW101]
gi|146155280|gb|ABQ06134.1| Uncharacterized protein [Flavobacterium johnsoniae UW101]
Length = 380
Score = 47.0 bits (110), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 68/146 (46%), Gaps = 25/146 (17%)
Query: 55 RLISLDVFRGLTVALMI----LVDDVGGILP---------AINHSPWNGLTLADFVMPFF 101
RLISLDV RG + ++ ++ + P ++H+ WNG+T D + P F
Sbjct: 8 RLISLDVLRGFVMFWIMSGEHIIHALAKAAPIPIFIWMSSQLHHAEWNGITFYDMIFPVF 67
Query: 102 LFIVGVSLALTYKNFPCKVVATR------KAILRALNLFLLG---IFLQGGFFHGINNLK 152
LF+ GVS+ +++ K+ + KA R + L +L I L GF +N L
Sbjct: 68 LFVAGVSMPFSFEK-KMKLAGVKEPKDLPKAEKRKIYLSMLRRTCILLVLGFV--VNGLL 124
Query: 153 YGVDIAQIRWMGVLQRIAIAYLVAAL 178
Q R+ VL RI +A+ A +
Sbjct: 125 RFDGFDQTRFASVLGRIGLAWFFAGI 150
>gi|338209612|ref|YP_004653659.1| hypothetical protein [Runella slithyformis DSM 19594]
gi|336303425|gb|AEI46527.1| Protein of unknown function DUF2261, transmembrane [Runella
slithyformis DSM 19594]
Length = 398
Score = 47.0 bits (110), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 34/99 (34%), Positives = 54/99 (54%), Gaps = 9/99 (9%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDD---VGGILPAINH--SPWNGLTLADFVMPFFLFI 104
Q +R+ S+D FR LT+ LMI V+D + GI + H + + L +D + P FLFI
Sbjct: 2 QITLKRVPSIDAFRALTMLLMIFVNDFWSLSGIPYWLEHAKAEEDFLGFSDIIFPCFLFI 61
Query: 105 VGVSLALTYKNFPCK----VVATRKAILRALNLFLLGIF 139
+G+++ +N K R ILR++ L ++GIF
Sbjct: 62 LGMAIPFAVQNRIAKGDTRWQIVRHIILRSVALIVMGIF 100
>gi|254445881|ref|ZP_05059357.1| hypothetical protein VDG1235_4128 [Verrucomicrobiae bacterium
DG1235]
gi|198260189|gb|EDY84497.1| hypothetical protein VDG1235_4128 [Verrucomicrobiae bacterium
DG1235]
Length = 394
Score = 46.6 bits (109), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 43/69 (62%), Gaps = 5/69 (7%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGI--LPA-INHSP--WNGLTLADFVMPFFLFIVG 106
Q R+ S+D+FRGLT+ LMI V+D + +P + H+P + + +D + P FLFIVG
Sbjct: 1 MQSRIHSIDIFRGLTMLLMIWVNDFWSLTNVPTWLEHAPGDADAMGFSDIIFPAFLFIVG 60
Query: 107 VSLALTYKN 115
+S+ ++
Sbjct: 61 LSIPFALRS 69
>gi|305665830|ref|YP_003862117.1| hypothetical protein FB2170_06080 [Maribacter sp. HTCC2170]
gi|88710601|gb|EAR02833.1| hypothetical protein FB2170_06080 [Maribacter sp. HTCC2170]
Length = 346
Score = 46.2 bits (108), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 38/74 (51%), Gaps = 3/74 (4%)
Query: 78 GILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKN---FPCKVVATRKAILRALNLF 134
G+ ++H PWNGL D + PFF+FIVGV++ + + + ATR + R LF
Sbjct: 26 GLAHQLHHHPWNGLRFWDLIQPFFMFIVGVAMPFSLRKRLASGSRKSATRHILKRCFLLF 85
Query: 135 LLGIFLQGGFFHGI 148
G L + H +
Sbjct: 86 AFGALLHCVYSHAL 99
>gi|115770385|ref|XP_001180412.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like,
partial [Strongylocentrotus purpuratus]
Length = 78
Score = 46.2 bits (108), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 21/63 (33%), Positives = 42/63 (66%), Gaps = 4/63 (6%)
Query: 83 INHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVA----TRKAILRALNLFLLGI 138
++H+ W+G+T+ADF+ P+F+FI+G S+ L+ K + +K + R++ LF++G+
Sbjct: 16 VSHAIWSGITVADFMFPWFVFIMGTSIHLSINILLSKGQSYPSIYKKLVSRSITLFIMGV 75
Query: 139 FLQ 141
+Q
Sbjct: 76 CIQ 78
>gi|456890764|gb|EMG01555.1| putative membrane protein [Leptospira borgpetersenii str.
200701203]
Length = 270
Score = 46.2 bits (108), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 42/150 (28%), Positives = 69/150 (46%), Gaps = 45/150 (30%)
Query: 261 IDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTC 320
IDR+I G +HL++ +S+T +DPEG LS + + T
Sbjct: 78 IDRRIFGEKHLWK---FSKT------------------------WDPEGFLSGIASIATS 110
Query: 321 LIGLHFGHLIVHFKDH-RDRMLNWIILSSCLIGLGLSLDFVGMH------LNKALYSLSY 373
L G+ G ++ + ++R+L+ + GLG FVG+ +NK+L++ SY
Sbjct: 111 LFGVICGFILFRREGRGKNRVLS-------IFGLGFLFTFVGLLWDRSLPMNKSLWTGSY 163
Query: 374 TCLTAGASGVLLAGIYFMVRYISSHLMLKK 403
T G S L G + Y+ S ++LKK
Sbjct: 164 AVYTTGLS-FLCIGFF---EYLDSLILLKK 189
>gi|440747820|ref|ZP_20927075.1| N-acetylglucosamine related transporter, NagX [Mariniradius
saccharolyticus AK6]
gi|436483562|gb|ELP39602.1| N-acetylglucosamine related transporter, NagX [Mariniradius
saccharolyticus AK6]
Length = 372
Score = 46.2 bits (108), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 84/362 (23%), Positives = 133/362 (36%), Gaps = 93/362 (25%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDD-------------VGGILPAINHSPWNGLTLADF 96
+ +RL+S+D RG + ++ D V + H W G T DF
Sbjct: 6 KQPSKRLVSIDALRGFDMLMICGADAFFRSLEGKTSFAWVDVLARQFEHPEWIGFTFYDF 65
Query: 97 VMPFFLFIVGVSLALTYKNFPCKVVATR----KAILRALNLFLLGIFLQGGFFHGINNLK 152
+ P FLF+ GVS+ + + V+ R KA+ R L L LG+ + F
Sbjct: 66 IFPLFLFVAGVSIPFSLGKSLAENVSKREIYKKALSRTLLLIGLGMLDKNAPFPFF---- 121
Query: 153 YGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSL-FRKYRGHWVVALVLTT 211
D QIR VL RI IA G V+ L L F + +V LVL
Sbjct: 122 ---DWEQIRLGSVLGRIGIA--------------GFVTVFLFLNFPARQRLGIVGLVLIA 164
Query: 212 LYLLLLYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHL 271
Y + + PV + G+ N G IDR L
Sbjct: 165 YYAAV----------FLIPVPGFGA------------GNLSFEGNLAGWIDRTFLP---- 198
Query: 272 YRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIV 331
R Q S FD G+L++ A ++G G ++
Sbjct: 199 ------GRLLQGS--------------------FDELGILTTFPAICLTILGAQAGEILR 232
Query: 332 HFK-DHRDRMLNWIILSSCLIGLGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYF 390
+ + + +++ ++ IGL L + + K +++ S+ L AG + V LAG Y+
Sbjct: 233 NAQLSEQQKVVRTLLFGVVCIGLALIWH-LHFPIFKRMWTSSFILLNAGMAFVALAGFYW 291
Query: 391 MV 392
++
Sbjct: 292 LI 293
>gi|406835226|ref|ZP_11094820.1| hypothetical protein SpalD1_26403 [Schlesneria paludicola DSM
18645]
Length = 508
Score = 46.2 bits (108), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 29/62 (46%), Positives = 38/62 (61%), Gaps = 6/62 (9%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGILPAINHSPW----NGLTLADFVMPFFLFIVGVSLA 110
R+IS+D FRG VA MI V+ VGG + HS + N L+ AD +M F+F+VG S
Sbjct: 14 RVISMDQFRGYAVAAMIFVNFVGGF--GVVHSVFKHNDNYLSYADTIMANFMFMVGFSFR 71
Query: 111 LT 112
LT
Sbjct: 72 LT 73
>gi|312131791|ref|YP_003999131.1| hypothetical protein Lbys_3117 [Leadbetterella byssophila DSM
17132]
gi|311908337|gb|ADQ18778.1| hypothetical protein Lbys_3117 [Leadbetterella byssophila DSM
17132]
Length = 361
Score = 46.2 bits (108), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 50/100 (50%), Gaps = 20/100 (20%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSPW--------NGLTLADFVMPFF 101
++ RL+S+D+FR LT+ MI V+D + N W +G+ +D + P F
Sbjct: 5 DQKKNRLLSIDIFRALTMFFMIFVND---LFTVKNVPKWMLHTEMHEDGMGFSDVIFPIF 61
Query: 102 LFIVGVSLALTYKNFPCKVVATRKAI-LRALNLFLLGIFL 140
L IVG+S+ ++ K I +R L ++G+FL
Sbjct: 62 LLIVGMSIPFAKADW--------KGIGMRTFALLVMGVFL 93
>gi|320450186|ref|YP_004202282.1| hypothetical protein TSC_c11130 [Thermus scotoductus SA-01]
gi|320150355|gb|ADW21733.1| putative membrane protein [Thermus scotoductus SA-01]
Length = 334
Score = 46.2 bits (108), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 25/60 (41%), Positives = 40/60 (66%), Gaps = 2/60 (3%)
Query: 55 RLISLDVFRGLTVALMILVDDV-GGILPAINHSPWNG-LTLADFVMPFFLFIVGVSLALT 112
R ++LD FRGLTVALM+ V+++ G P + H P+ G + LAD V P++L +G ++ +
Sbjct: 6 RSLALDAFRGLTVALMLFVNNLPPGAPPYLEHGPFGGSVYLADLVFPWYLLAMGAAIPFS 65
>gi|333379187|ref|ZP_08470911.1| hypothetical protein HMPREF9456_02506 [Dysgonomonas mossii DSM
22836]
gi|332885455|gb|EGK05704.1| hypothetical protein HMPREF9456_02506 [Dysgonomonas mossii DSM
22836]
Length = 395
Score = 45.8 bits (107), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 31/100 (31%), Positives = 53/100 (53%), Gaps = 9/100 (9%)
Query: 49 PQHQQRRLISLDVFRGLTVALMILVDD---VGGILPAINHSPWNG--LTLADFVMPFFLF 103
P + +R+ S+D++R LT+ MI V+D V + + H+ N L +D V P FLF
Sbjct: 5 PTEKPKRIASIDIYRALTMFFMIFVNDLWSVSNVPHWLEHAAANEDMLGFSDIVFPSFLF 64
Query: 104 IVGVSLALTY----KNFPCKVVATRKAILRALNLFLLGIF 139
I+G+S+ L K + I+R++ L ++G+F
Sbjct: 65 ILGMSIPLAIEIRKKKGDSNSGILKHIIIRSIALLVMGLF 104
>gi|456890770|gb|EMG01561.1| hypothetical protein LEP1GSC123_2562 [Leptospira borgpetersenii
str. 200701203]
Length = 74
Score = 45.8 bits (107), Expect = 0.040, Method: Composition-based stats.
Identities = 21/46 (45%), Positives = 29/46 (63%), Gaps = 3/46 (6%)
Query: 59 LDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFF 101
+D+FRG+TV MILV++ G + + H+ WNG T D V PFF
Sbjct: 1 MDLFRGMTVVGMILVNNPGSWSYVYSPLKHAEWNGCTPTDLVFPFF 46
>gi|406832166|ref|ZP_11091760.1| hypothetical protein SpalD1_11017 [Schlesneria paludicola DSM
18645]
Length = 413
Score = 45.8 bits (107), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 50/105 (47%), Gaps = 18/105 (17%)
Query: 55 RLISLDVFRGLTVALMIL----VDDVGGILPAI----------NHSPWNGLTLADFVMPF 100
RL+S+D +RG + LM+ + DV LP +H W G L D + P
Sbjct: 28 RLVSVDAYRGWVMLLMMAEVLRLRDVAKALPESRLWAFLAQQQSHVTWVGCVLHDMIQPS 87
Query: 101 FLFIVGVSLALTYKNFPCK----VVATRKAILRALNLFLLGIFLQ 141
F F+VGV+L L+ + T A R+L L LLG+FL+
Sbjct: 88 FSFLVGVALPLSLRRRSLSGQPLWQRTAHAAWRSLVLILLGVFLR 132
>gi|296121958|ref|YP_003629736.1| hypothetical protein Plim_1707 [Planctomyces limnophilus DSM 3776]
gi|296014298|gb|ADG67537.1| conserved hypothetical protein [Planctomyces limnophilus DSM 3776]
Length = 378
Score = 45.4 bits (106), Expect = 0.048, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 50/96 (52%), Gaps = 14/96 (14%)
Query: 84 NHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATR----KAILRALNLFLLGIF 139
H W G +L D V P F+F+VGVS+ + K V+T +A+ R++ L +LGIF
Sbjct: 32 EHVAWRGCSLWDMVQPSFMFLVGVSIPWSLAAQKSKNVSTGQGWVRAVWRSVLLVVLGIF 91
Query: 140 LQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLV 175
L I+N K D + + VL +I + YLV
Sbjct: 92 L-------ISNNKPSTDFSFVN---VLTQIGLGYLV 117
>gi|429727718|ref|ZP_19262477.1| hypothetical protein HMPREF9998_00424 [Peptostreptococcus
anaerobius VPI 4330]
gi|429151771|gb|EKX94625.1| hypothetical protein HMPREF9998_00424 [Peptostreptococcus
anaerobius VPI 4330]
Length = 463
Score = 45.4 bits (106), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 34/140 (24%), Positives = 68/140 (48%), Gaps = 16/140 (11%)
Query: 49 PQHQQRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIV 105
+++ R+ S+D RGL V L + + + G I A +S WNG+TL D ++P FL ++
Sbjct: 42 AKYEDMRVQSIDYMRGLLVILSMFMINQGLENQISYAFQNSKWNGMTLLDILVPMFLLVI 101
Query: 106 GVSLALTYKNFPCKVVATRKAI----LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIR 161
G S+ K + R + ++++ +F++G+ ++ + +R
Sbjct: 102 GSSIPFYVKKHYEENEDLRHIVKMSFIKSIIVFVIGLIFSCIYYPAND---------YVR 152
Query: 162 WMGVLQRIAIAYLVAALCEI 181
G +Q +A Y+++ L I
Sbjct: 153 LTGPIQMMAFVYIMSLLLYI 172
>gi|182412825|ref|YP_001817891.1| hypothetical protein Oter_1003 [Opitutus terrae PB90-1]
gi|177840039|gb|ACB74291.1| conserved hypothetical protein [Opitutus terrae PB90-1]
Length = 411
Score = 45.4 bits (106), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 38/131 (29%), Positives = 54/131 (41%), Gaps = 25/131 (19%)
Query: 55 RLISLDVFRG-------------LTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFF 101
RL+S+D RG L + M L + + + H W G D + P F
Sbjct: 11 RLVSVDALRGFDMFWILGADALVLALGAMSLSPTLRALAGQLEHKDWAGFAFYDLIFPLF 70
Query: 102 LFIVGVSLALTYKNFPC---KVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIA 158
+FIVGVS + + + A ++ + R L L GIF GG H
Sbjct: 71 VFIVGVSTVFSLTSLVAREGRAAAVKRILRRTLLLLAFGIFYNGGLAH---------QWP 121
Query: 159 QIRWMGVLQRI 169
+R +GVLQRI
Sbjct: 122 DVRLVGVLQRI 132
>gi|255532593|ref|YP_003092965.1| hypothetical protein Phep_2702 [Pedobacter heparinus DSM 2366]
gi|255345577|gb|ACU04903.1| hypothetical protein Phep_2702 [Pedobacter heparinus DSM 2366]
Length = 390
Score = 45.1 bits (105), Expect = 0.060, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 54/94 (57%), Gaps = 9/94 (9%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGI--LP-AINH--SPWNGLTLADFVMPFFLFIVGVSL 109
R ++DV R LT+ LMI V+DVG + LP ++H + +G+ AD + P FLFIVG+SL
Sbjct: 8 RFQAVDVLRALTMFLMIFVNDVGSVKYLPHWVDHVEADVDGMGFADTIFPAFLFIVGLSL 67
Query: 110 ALTYKNFPCK----VVATRKAILRALNLFLLGIF 139
++ K + + +LR+ L ++G F
Sbjct: 68 PFALQSRMNKGKSFLSISLYIVLRSAALIVMGFF 101
>gi|326798253|ref|YP_004316072.1| hypothetical protein [Sphingobacterium sp. 21]
gi|326549017|gb|ADZ77402.1| hypothetical protein Sph21_0826 [Sphingobacterium sp. 21]
Length = 368
Score = 45.1 bits (105), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 50/115 (43%), Gaps = 24/115 (20%)
Query: 48 RPQHQQRRLISLDVFRGLTVALMILVDDVG-----------------GILPAINHSPWNG 90
+ Q R++SLDV RGL +MIL+ GI+ H W+G
Sbjct: 2 EEKKVQTRILSLDVMRGL---IMILLAAESCELYTALSSTYSSGWPQGIIHHFFHHEWHG 58
Query: 91 LTLADFVMPFFLFIVGVSLALTYKNFPCKVVAT----RKAILRALNLFLLGIFLQ 141
L D V P F+FI G SL L+++ V+ + R+ LFL G+ L
Sbjct: 59 LYFWDLVQPAFMFIAGTSLYLSFQRKQAAGVSWSSHFKSVAWRSAKLFLCGVALH 113
>gi|313145390|ref|ZP_07807583.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
gi|423279948|ref|ZP_17258861.1| hypothetical protein HMPREF1203_03078 [Bacteroides fragilis HMW
610]
gi|424661980|ref|ZP_18099017.1| hypothetical protein HMPREF1205_02366 [Bacteroides fragilis HMW
616]
gi|313134157|gb|EFR51517.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
gi|404578291|gb|EKA83026.1| hypothetical protein HMPREF1205_02366 [Bacteroides fragilis HMW
616]
gi|404584284|gb|EKA88949.1| hypothetical protein HMPREF1203_03078 [Bacteroides fragilis HMW
610]
Length = 377
Score = 45.1 bits (105), Expect = 0.063, Method: Compositional matrix adjust.
Identities = 39/145 (26%), Positives = 64/145 (44%), Gaps = 23/145 (15%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVD-------------DVGGILPAINHSPWNGLTLADF 96
Q ++RL SLD RGL + ++ + + G+ + H W G + D
Sbjct: 3 QPIRQRLESLDALRGLDLFFLVALGPLLRTLVRAIDSPHLDGVNWCLRHVDWIGFSPWDL 62
Query: 97 VMPFFLFIVGVSLALTYKNFPCKVVATR---KAILRALNLFLLGIFLQGGFFHGINNLKY 153
+MP FLF+ G+S+ F + ++ + R L L++ G+ QG N L +
Sbjct: 63 IMPLFLFMSGISIPFALSRFKGEADKSKLIYRLCKRVLLLWIFGMMCQG------NLLSF 116
Query: 154 GVDIAQIRWMGVLQRIAIAYLVAAL 178
D + + LQ IA Y+ AAL
Sbjct: 117 DPDHLYL-YTNTLQSIATGYIAAAL 140
>gi|377572860|ref|ZP_09801940.1| hypothetical protein MOPEL_003_01300 [Mobilicoccus pelagius NBRC
104925]
gi|377538518|dbj|GAB47105.1| hypothetical protein MOPEL_003_01300 [Mobilicoccus pelagius NBRC
104925]
Length = 439
Score = 45.1 bits (105), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 69/287 (24%), Positives = 109/287 (37%), Gaps = 74/287 (25%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYK 114
RL SLDV RG+ + + ++V+ H+ W G+ D V P F+ + G +A+ +
Sbjct: 11 RLESLDVCRGVMLVVSVVVNAWFTAPEWFEHAAWTGVHPVDLVFPAFVTLSGAGMAIAF- 69
Query: 115 NFPCKVVATRKAILRALNLFLLGI-FLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAY 173
+ V + + R L L G+ F G G VD+A +R+ GVLQ A
Sbjct: 70 ---ARRVPVARQVRRVLVLTAAGLAFAVAGQVLGTG----AVDVATLRFTGVLQLYAFLV 122
Query: 174 LVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVET 233
L L + + R++ HW+ A + L +
Sbjct: 123 LALGLVAV-------------VVRRW-WHWLAAAAVVAGAQAWLLASW------------ 156
Query: 234 SSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPM 293
+SS P G+ ACN G++D + G H+Y + R
Sbjct: 157 ASSCP---------GGALTKACNPSGVVDAAVFG-PHMY---VMGRLGH----------- 192
Query: 294 PLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRM 340
DPEG +++ A VT L+G G L+ +HR M
Sbjct: 193 ------------DPEGFVAAAGALVTALVGAAAGRLMW---EHRRGM 224
>gi|225013130|ref|ZP_03703543.1| conserved hypothetical protein [Flavobacteria bacterium MS024-2A]
gi|225002750|gb|EEG40733.1| conserved hypothetical protein [Flavobacteria bacterium MS024-2A]
Length = 365
Score = 45.1 bits (105), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 39/85 (45%), Gaps = 20/85 (23%)
Query: 47 TRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAI----------------NHSPWNG 90
T P Q RL SLD FRG V + +LV + + +H W G
Sbjct: 4 TNPLSQ--RLRSLDFFRG--VVMFLLVAEFSHLFGVFMKTENETITAAADFLFHHVQWEG 59
Query: 91 LTLADFVMPFFLFIVGVSLALTYKN 115
L D + PFF+FIVGVS+ +Y N
Sbjct: 60 LHFWDLIQPFFMFIVGVSIPYSYAN 84
>gi|333382416|ref|ZP_08474086.1| hypothetical protein HMPREF9455_02252 [Dysgonomonas gadei ATCC
BAA-286]
gi|332828727|gb|EGK01419.1| hypothetical protein HMPREF9455_02252 [Dysgonomonas gadei ATCC
BAA-286]
Length = 394
Score = 45.1 bits (105), Expect = 0.066, Method: Compositional matrix adjust.
Identities = 34/95 (35%), Positives = 55/95 (57%), Gaps = 11/95 (11%)
Query: 55 RLISLDVFRGLTVALMILVDD---VGGILPAINHSP--WNGLTLADFVMPFFLFIVGVSL 109
R+ S+D+FR LT+ MI V+D V G+ + H+ + L +D V P FLFI+G+S+
Sbjct: 11 RVASIDIFRALTMFFMIFVNDFWSVSGVPHWLEHAAASEDMLGFSDVVFPSFLFILGMSI 70
Query: 110 ALTYKNFPCKVVATRKAIL-----RALNLFLLGIF 139
L ++ K T+K IL R++ L ++G+F
Sbjct: 71 PLAMES-RMKKGETKKQILWHIVVRSVALLVMGLF 104
>gi|406831132|ref|ZP_11090726.1| hypothetical protein SpalD1_05831 [Schlesneria paludicola DSM
18645]
Length = 508
Score = 44.7 bits (104), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 41/120 (34%), Positives = 58/120 (48%), Gaps = 11/120 (9%)
Query: 42 SNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVM 98
S+S T P+ RL SLD FRG T+ M+LV+ +GG + P I + + AD +M
Sbjct: 2 SSSIPTAPK-PSLRLTSLDQFRGYTMLGMLLVNFIGGYKAVSPRILLHTHDYCSYADTIM 60
Query: 99 PFFLFIVGVSLALTY--KNFPCKVVATRKAILRALNLFLLGIFL-----QGGFFHGINNL 151
P FLF G +L L+ + + +AI R L L L+ I GG H N +
Sbjct: 61 PHFLFAAGFALRLSLGRRMEAGGKMPWGRAIRRILGLALVAIIWYGYCDWGGVVHKFNTM 120
>gi|256423178|ref|YP_003123831.1| hypothetical protein Cpin_4173 [Chitinophaga pinensis DSM 2588]
gi|256038086|gb|ACU61630.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588]
Length = 349
Score = 44.7 bits (104), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 53/99 (53%), Gaps = 18/99 (18%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGILPAIN------------HSPWNGLTLADFVMPFFL 102
RL+SLDV RGL +++L + + +++ H PW+GL D V P F+
Sbjct: 6 RLLSLDVMRGLI--MILLAGESCRVYESLHEWHDNAFIRQFFHHPWHGLRFWDLVQPAFM 63
Query: 103 FIVGVSLALTYKNFPCKVVAT----RKAILRALNLFLLG 137
+ G ++ ++Y++ K V+ + ++R+L LFLLG
Sbjct: 64 LMAGTAMYISYQSKLRKGVSWSQNFKHILIRSLKLFLLG 102
>gi|358344082|ref|XP_003636122.1| Magnesium transporter NIPA2 [Medicago truncatula]
gi|355502057|gb|AES83260.1| Magnesium transporter NIPA2 [Medicago truncatula]
Length = 328
Score = 44.7 bits (104), Expect = 0.088, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 35/63 (55%), Gaps = 6/63 (9%)
Query: 116 FPCKVVATRKAIL------RALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRI 169
F C + R A+ A +++LG GG+ H +++L +GVD+ QIR MG+LQR
Sbjct: 78 FGCCAASARHAVFCLCWAGVACVVWVLGGVFTGGYVHRVSDLTFGVDLKQIRLMGILQRE 137
Query: 170 AIA 172
I
Sbjct: 138 NIV 140
>gi|289422375|ref|ZP_06424221.1| hypothetical protein HMPREF0631_1471 [Peptostreptococcus anaerobius
653-L]
gi|289157210|gb|EFD05829.1| hypothetical protein HMPREF0631_1471 [Peptostreptococcus anaerobius
653-L]
Length = 463
Score = 44.7 bits (104), Expect = 0.094, Method: Compositional matrix adjust.
Identities = 34/139 (24%), Positives = 67/139 (48%), Gaps = 16/139 (11%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVG---GILPAINHSPWNGLTLADFVMPFFLFIVG 106
+++ R+ S+D RGL V L + + + G I A +S WNG+TL D ++P FL ++G
Sbjct: 43 KYEYMRVQSIDYMRGLLVILSMFMINQGLENQISYAFQNSKWNGMTLNDILVPMFLLVIG 102
Query: 107 VSLALTYKNFPCKVVATRKAI----LRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRW 162
S+ K + R + ++++ +FL+G+ ++ + +R
Sbjct: 103 SSIPFYVKKHYEENEDIRHIVKMSFIKSIIVFLIGLIFSCIYYPAND---------YVRL 153
Query: 163 MGVLQRIAIAYLVAALCEI 181
G +Q + Y+++ L I
Sbjct: 154 TGPIQMMVFVYIMSLLLYI 172
>gi|414864592|tpg|DAA43149.1| TPA: hypothetical protein ZEAMMB73_072128 [Zea mays]
Length = 762
Score = 44.7 bits (104), Expect = 0.099, Method: Compositional matrix adjust.
Identities = 18/35 (51%), Positives = 23/35 (65%)
Query: 133 LFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQ 167
F GGFFHG+ +L +GVD+ +IR MGVLQ
Sbjct: 258 FFCSATRCAGGFFHGVRSLSFGVDLQEIRLMGVLQ 292
>gi|332668157|ref|YP_004450945.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
gi|332336971|gb|AEE54072.1| Protein of unknown function DUF2261, transmembrane
[Haliscomenobacter hydrossis DSM 1100]
Length = 387
Score = 44.3 bits (103), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 34/102 (33%), Positives = 54/102 (52%), Gaps = 17/102 (16%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILPAINHSP-W--------NGLTLADFVMPFFL 102
+ +RL S+D+ R +T+ LMI V+D L ++ H P W +G+ L+D V P FL
Sbjct: 2 KNQRLPSIDILRAVTMLLMIFVND----LWSLTHVPHWLLHTAAEEDGMGLSDVVFPAFL 57
Query: 103 FIVGVSLALTYKNFPCKVVATRKAIL----RALNLFLLGIFL 140
FIVG+S+ K K + +L R L ++G+F+
Sbjct: 58 FIVGLSIPHALKARLEKGASKGSVMLHILSRTFALLVMGLFM 99
>gi|418750786|ref|ZP_13307072.1| putative membrane protein [Leptospira licerasiae str. MMD4847]
gi|404273389|gb|EJZ40709.1| putative membrane protein [Leptospira licerasiae str. MMD4847]
Length = 322
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 64/289 (22%), Positives = 117/289 (40%), Gaps = 83/289 (28%)
Query: 97 VMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINNLKYGVD 156
+ PFFLF VG S+ + N + K + R++ L LG+FL F G
Sbjct: 1 MFPFFLFAVGASIPFSVSN---GIQEFPKILKRSVILIFLGLFLN---FFG------EWS 48
Query: 157 IAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLL 216
+ +R+ GVLQRI AY +A+ + ++ ++ LF + L+++ YL
Sbjct: 49 FSNLRFPGVLQRIGFAYFFSAIA----YREKNLKFRIILF--------LTLLISYWYL-- 94
Query: 217 LYGLYVPDWQYEFPVETSSSSPWIFNVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPI 276
EF ++ P S + +DR++ G HL++
Sbjct: 95 ----------QEFIPPPGAAEP-----------SMKEGKDWGAWLDREVFGQAHLWK--- 130
Query: 277 YSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDH 336
+G + +DPEGLL+S + + G+ G + K H
Sbjct: 131 -------------FGKV-----------WDPEGLLTSFTSIASVFCGIFAGEFL---KVH 163
Query: 337 RDRMLNWIILSSCLIGLGLSLDFVG------MHLNKALYSLSYTCLTAG 379
++ + + +S ++ ++ VG +NK+L++ +Y+ TAG
Sbjct: 164 LEKKESPLSISGKIVLGAFAVLVVGGVWGIYYPINKSLWTGTYSLWTAG 212
>gi|340617673|ref|YP_004736126.1| hypothetical protein zobellia_1684 [Zobellia galactanivorans]
gi|339732470|emb|CAZ95738.1| Conserved hypothetical membrane protein [Zobellia galactanivorans]
Length = 346
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 24/74 (32%), Positives = 37/74 (50%), Gaps = 3/74 (4%)
Query: 78 GILPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPC---KVVATRKAILRALNLF 134
G+ ++H PWNGL D + PFF+FIVGV++ + + K T+ + R LF
Sbjct: 26 GLADQLHHHPWNGLRFWDLIQPFFMFIVGVAMPFSLRKRLASGDKKGVTKHILRRCFLLF 85
Query: 135 LLGIFLQGGFFHGI 148
G L + H +
Sbjct: 86 AFGALLHCVYSHAL 99
>gi|395804714|ref|ZP_10483949.1| hypothetical protein FF52_22629 [Flavobacterium sp. F52]
gi|395433102|gb|EJF99060.1| hypothetical protein FF52_22629 [Flavobacterium sp. F52]
Length = 380
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 41/155 (26%), Positives = 66/155 (42%), Gaps = 41/155 (26%)
Query: 55 RLISLDVFRGLTVALMI----LVDDVGGILP---------AINHSPWNGLTLADFVMPFF 101
RLISLD RG + ++ ++ + P ++H+ WNG+T D + P F
Sbjct: 8 RLISLDALRGFVMFWIMSGEHIIHALAKAAPIPIFLWMSSQLHHAEWNGITFYDMIFPVF 67
Query: 102 LFIVGVSLALTY------------KNFPCKVVATRKAIL----RALNLFLLGIFLQGGF- 144
LF+ GVS+ ++ + P K RK L R + L +LG + G
Sbjct: 68 LFVAGVSMPYSFEKKMNLAGVSTPQELPSK--EKRKIYLSMLRRTIILVVLGFVVNGLLR 125
Query: 145 FHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALC 179
F G ++ ++ VL RI IA+ A +
Sbjct: 126 FDGFDHTRFA---------SVLGRIGIAWFFAGMI 151
>gi|456861595|gb|EMF80245.1| hypothetical protein LEP1GSC188_2621 [Leptospira weilii serovar
Topaz str. LT2116]
Length = 179
Score = 43.9 bits (102), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 38/126 (30%), Positives = 56/126 (44%), Gaps = 41/126 (32%)
Query: 261 IDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDPEGLLSSVMATVTC 320
IDR I+G +HL++ +S+T +DPEGLLS V + T
Sbjct: 83 IDRVIIGEKHLWK---FSKT------------------------WDPEGLLSGVASIATS 115
Query: 321 LIGLHFGH-LIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVG------MHLNKALYSLSY 373
L G+ G L + R R+ + + GLG FVG + +NK+L++ SY
Sbjct: 116 LFGVLCGFILFLRESGGRSRVFS-------IFGLGFLFTFVGLLWDQSLPMNKSLWTGSY 168
Query: 374 TCLTAG 379
TAG
Sbjct: 169 AAYTAG 174
>gi|168705120|ref|ZP_02737397.1| hypothetical protein GobsU_36644 [Gemmata obscuriglobus UQM 2246]
Length = 387
Score = 43.9 bits (102), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 36/93 (38%), Positives = 46/93 (49%), Gaps = 10/93 (10%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGI------LPAINHSPWNGLTLADFVMPFFLFIVGVS 108
RL SLD FRG TV M+LV+ VG +P + H + AD +MP FLF VG +
Sbjct: 20 RLASLDQFRGYTVLGMLLVNFVGSFAVIKADVPVLAHHH-TYCSYADTIMPQFLFAVGFA 78
Query: 109 LALTYKNFPCKV---VATRKAILRALNLFLLGI 138
LT+ V A A+ R L LFL+
Sbjct: 79 FRLTFARRRDAVGPRAAYGHAVRRNLALFLVAF 111
>gi|386857066|ref|YP_006261243.1| hypothetical protein DGo_CA1858 [Deinococcus gobiensis I-0]
gi|380000595|gb|AFD25785.1| hypothetical protein DGo_CA1858 [Deinococcus gobiensis I-0]
Length = 341
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 41/118 (34%), Positives = 58/118 (49%), Gaps = 18/118 (15%)
Query: 70 MILVDDV--GGILPA-INHSPWNGLTLADFVMPFFLFIVGVSL----ALTYKNFPCKVVA 122
M+LV++V G PA + H+ + GLTL D V P+FLF G +L A +
Sbjct: 1 MLLVNNVSLGNSTPAQLVHADFGGLTLTDLVFPWFLFCAGAALPFSNAAMTRAGVTGAAR 60
Query: 123 TRKAILRALNLFLLGIFLQGGFFHGINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCE 180
R+ + RA L+L+G FL H + G +GVLQ IA+A L A+L
Sbjct: 61 VRRLLTRAALLYLVGAFLTSVTLHAFS---LG--------LGVLQLIALATLGASLLA 107
>gi|189463412|ref|ZP_03012197.1| hypothetical protein BACCOP_04131 [Bacteroides coprocola DSM 17136]
gi|189429841|gb|EDU98825.1| hypothetical protein BACCOP_04131 [Bacteroides coprocola DSM 17136]
Length = 219
Score = 43.9 bits (102), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 60/260 (23%), Positives = 104/260 (40%), Gaps = 81/260 (31%)
Query: 124 RKAILRALNLFLLGIFLQGGFFHGINNLKYGV-DIAQIRWMGVLQRIAIAYLVAAL-CEI 181
+K I R + LFL+G+F IN G+ ++R GVLQR+A+ Y + C
Sbjct: 5 KKVIKRTILLFLVGLF--------INWCAKGMCSFQELRIPGVLQRLALCYFFTVIICT- 55
Query: 182 WLKGDGHVSSKLSLFRKYRGHWVVALVLTTLYLLLLYGLYVPDWQYEFPVETSSSSPWIF 241
++ KY + ++T L L+ Y+ + T + +
Sbjct: 56 ------------NIHEKY-----IPALITVLLLI-----------YQIILVTGNGFVY-- 85
Query: 242 NVTCGVRGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWC 301
GP N + +ID+ ILG HLY D+G
Sbjct: 86 ----------GPQ-NIIAVIDQYILGASHLYN---------------DHG---------- 109
Query: 302 QAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVG 361
DPEG+LS++ + LIG G + + ++ ++ ++ + L+ G + G
Sbjct: 110 ---IDPEGILSTIPSISHTLIGYCIGKICIEKENIHSKLEKLFLIGTVLLFAGYLFSY-G 165
Query: 362 MHLNKALYSLSYTCLTAGAS 381
+NK ++S +Y +T GA
Sbjct: 166 CPINKKIWSPTYVFMTCGAG 185
>gi|224536805|ref|ZP_03677344.1| hypothetical protein BACCELL_01681 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521571|gb|EEF90676.1| hypothetical protein BACCELL_01681 [Bacteroides cellulosilyticus
DSM 14838]
Length = 394
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 61/117 (52%), Gaps = 19/117 (16%)
Query: 45 KQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGI--LPA-INHSPWNG--LTLADFVMP 99
K PQ R+ ++DVFR LT+ LM+ V+D+ G+ +P + H+ N L +D + P
Sbjct: 2 KNLTPQ----RVAAVDVFRALTMFLMLFVNDIPGLKNIPHWLKHAEMNEDMLGFSDTIFP 57
Query: 100 FFLFIVGVSLALTYKNFPCKVVATRKAI----LRALNLFLLGIF------LQGGFFH 146
FLF +G+S++ +N K T + I R + L +G+F ++GG H
Sbjct: 58 AFLFCMGMSVSFAIQNRYRKGDTTLQVIAHIFWRTVALIAMGLFSLNSGGIEGGISH 114
>gi|430744193|ref|YP_007203322.1| hypothetical protein Sinac_3363 [Singulisphaera acidiphila DSM
18658]
gi|430015913|gb|AGA27627.1| hypothetical protein Sinac_3363 [Singulisphaera acidiphila DSM
18658]
Length = 368
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 32/88 (36%), Positives = 47/88 (53%), Gaps = 5/88 (5%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGI--LPAINHSPWNGLTLADFVMPFFLFIVGVSLALT 112
R++SLD FRG TV M+ V+ +G LPA+ + + AD +MP F F VG + LT
Sbjct: 22 RIVSLDQFRGYTVVGMLFVNFLGNFDALPAVFKHHNSYCSYADTIMPQFFFAVGFAYRLT 81
Query: 113 Y-KNFPCKVV--ATRKAILRALNLFLLG 137
+ + + A + R+L L LLG
Sbjct: 82 FLRRLETSGIGGAVAAVLRRSLGLILLG 109
>gi|423223322|ref|ZP_17209791.1| hypothetical protein HMPREF1062_01977 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638858|gb|EIY32689.1| hypothetical protein HMPREF1062_01977 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 394
Score = 43.5 bits (101), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 61/117 (52%), Gaps = 19/117 (16%)
Query: 45 KQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGI--LPA-INHSPWNG--LTLADFVMP 99
K PQ R+ ++DVFR LT+ LM+ V+D+ G+ +P + H+ N L +D + P
Sbjct: 2 KNLTPQ----RVAAVDVFRALTMFLMLFVNDIPGLKNIPHWLKHAEMNEDMLGFSDTIFP 57
Query: 100 FFLFIVGVSLALTYKNFPCKVVATRKAI----LRALNLFLLGIF------LQGGFFH 146
FLF +G+S++ +N K T + I R + L +G+F ++GG H
Sbjct: 58 AFLFCMGMSVSFAIQNRYRKGDTTLQVIAHIFWRTVALIAMGLFSLNSGGIEGGISH 114
>gi|329960675|ref|ZP_08299018.1| conserved domain protein [Bacteroides fluxus YIT 12057]
gi|328532548|gb|EGF59342.1| conserved domain protein [Bacteroides fluxus YIT 12057]
Length = 394
Score = 43.5 bits (101), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 37/121 (30%), Positives = 60/121 (49%), Gaps = 27/121 (22%)
Query: 45 KQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSP-W--------NGLTLAD 95
K PQ R+ ++DVFR LT+ LM+ V+D+ P + + P W + L +D
Sbjct: 2 KNLTPQ----RVAAVDVFRALTMFLMLFVNDI----PGLKNVPHWLMHAKIDEDMLGFSD 53
Query: 96 FVMPFFLFIVGVSLALTYKNFPCKVVATRKAI----LRALNLFLLGIF------LQGGFF 145
+ P FLF +G+S++L +N K T + I R + L +G+F ++GG
Sbjct: 54 TIFPAFLFCMGMSVSLAIQNRYKKGNTTLQVISHIFWRTIALLAMGLFSLNSGGIEGGLS 113
Query: 146 H 146
H
Sbjct: 114 H 114
>gi|373850799|ref|ZP_09593600.1| Protein of unknown function DUF2261, transmembrane [Opitutaceae
bacterium TAV5]
gi|372476964|gb|EHP36973.1| Protein of unknown function DUF2261, transmembrane [Opitutaceae
bacterium TAV5]
Length = 401
Score = 43.1 bits (100), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 32/101 (31%), Positives = 52/101 (51%), Gaps = 15/101 (14%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGILPAINHSPW--------NGLTLADFVMPFFLFIVG 106
R+ S+D+ R LT+ LMI+V+D + N W +G+ +AD V P FLF+VG
Sbjct: 12 RVASIDILRALTMVLMIIVND---LFTLKNTPAWLGHSASGVDGIGVADVVFPAFLFLVG 68
Query: 107 VSLALTYKNFPCKVVATRKAI----LRALNLFLLGIFLQGG 143
+SL + K + + +R+ L ++G+FL G
Sbjct: 69 LSLPHALEARRNKGDTGLRLVWHVAVRSFALIVMGVFLVNG 109
>gi|262381364|ref|ZP_06074502.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
gi|262296541|gb|EEY84471.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
Length = 410
Score = 43.1 bits (100), Expect = 0.28, Method: Compositional matrix adjust.
Identities = 31/102 (30%), Positives = 54/102 (52%), Gaps = 9/102 (8%)
Query: 48 RPQHQQRRLISLDVFRGLTVALMILVDD---VGGILPAINHSPW--NGLTLADFVMPFFL 102
+P +R I++D+ R +T+ +MI V+D V + + H+ + + + LAD V P FL
Sbjct: 4 QPIDFTQRNIAIDILRAVTMCVMIFVNDFWTVHDVPHYLEHAAYGEDFMGLADVVFPAFL 63
Query: 103 FIVGVSLALTYKNFPCKVVATRKAIL----RALNLFLLGIFL 140
F VG+S+ + K ++ IL R L L ++G F+
Sbjct: 64 FAVGMSIPFAIERRYAKGMSGESTILHILSRTLALLIMGAFI 105
>gi|427384705|ref|ZP_18881210.1| hypothetical protein HMPREF9447_02243 [Bacteroides oleiciplenus YIT
12058]
gi|425727966|gb|EKU90825.1| hypothetical protein HMPREF9447_02243 [Bacteroides oleiciplenus YIT
12058]
Length = 398
Score = 42.7 bits (99), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 36/117 (30%), Positives = 61/117 (52%), Gaps = 19/117 (16%)
Query: 45 KQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGI--LPA-INHSPWNG--LTLADFVMP 99
K PQ R+ ++DVFR LT+ LM+ V+D+ G+ +P + H+ N + +D + P
Sbjct: 6 KNLAPQ----RVAAVDVFRALTMFLMLFVNDIPGLKNIPHWLEHAEMNEDMMGFSDTIFP 61
Query: 100 FFLFIVGVSLALTYKNFPCKVVATRKAI----LRALNLFLLGIF------LQGGFFH 146
FLF +G+S++ +N K T + I R + L +G+F ++GG H
Sbjct: 62 AFLFCMGMSVSFAIQNRYRKGDTTLQVIAHVFWRTVALIAMGLFSLNSGGIEGGLSH 118
>gi|319900329|ref|YP_004160057.1| hypothetical protein Bache_0445 [Bacteroides helcogenes P 36-108]
gi|319415360|gb|ADV42471.1| hypothetical protein Bache_0445 [Bacteroides helcogenes P 36-108]
Length = 414
Score = 42.7 bits (99), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 34/112 (30%), Positives = 55/112 (49%), Gaps = 13/112 (11%)
Query: 39 PSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDD---VGGILPAINHSPW--NGLTL 93
P+N N+ T RR +++D+ R LT+ MI V+D V I + H+ + + + L
Sbjct: 4 PTNVNTSAT----YSRRNLAIDMLRALTMFTMIFVNDFWKVHDIPRWLEHAGYGEDFMGL 59
Query: 94 ADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAI----LRALNLFLLGIFLQ 141
AD V P FLF VG+S+ + K + + LR L ++G F+
Sbjct: 60 ADVVFPCFLFAVGMSIPYAIERRYAKGFSAESTLGHIFLRTFALLVMGAFIT 111
>gi|308050627|ref|YP_003914193.1| hypothetical protein Fbal_2917 [Ferrimonas balearica DSM 9799]
gi|307632817|gb|ADN77119.1| conserved hypothetical protein [Ferrimonas balearica DSM 9799]
Length = 366
Score = 42.7 bits (99), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 40/143 (27%), Positives = 68/143 (47%), Gaps = 19/143 (13%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVGGILPAIN-------------HSPWNGLTLADFVMP 99
++RL +LD RG + +I + + L + HSPW+G T D + P
Sbjct: 5 KQRLQALDALRGFDMFWIIGGEKLFAALLLLTGWPLWQVAADQMLHSPWHGFTFYDLIFP 64
Query: 100 FFLFIVGVSLALTYKNFPCKVVATRKAILRALNLFLLGIFLQGGFF-HGINNLKYGVDIA 158
F+F+ GV++ L ++ + R+ R LL + L G + HG G+ +A
Sbjct: 65 LFIFLSGVTIGLQRQSLIGIAWSDRQPHYRKALKRLLLLALLGVLYNHGWGT---GMPMA 121
Query: 159 --QIRWMGVLQRIAIAYLVAALC 179
+IR+ VL RI +A+ +AA+
Sbjct: 122 LDEIRYASVLGRIGMAWFLAAMI 144
Score = 41.6 bits (96), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 35/128 (27%), Positives = 58/128 (45%), Gaps = 17/128 (13%)
Query: 303 APFDPEGLLSSVMATVTCLIGLHFGHLIVHFKDHRDRMLNWIILSSCLIGLGLSLDFVGM 362
AP DPEGL+S + A + L+G+ G+ + + R + L + L L +
Sbjct: 199 APLDPEGLMSHLSAALNALVGVWAGYWLRQPRSVWQRAGV-LALVALASLLLGWLLHPLV 257
Query: 363 HLNKALYSLSYTCLTAGASGVLLAGIYFMVRYISSHLMLK----------------KPFD 406
+NK L++LS+ +T G SG+ LAG +V + H + K FD
Sbjct: 258 PVNKTLWTLSFVLVTVGWSGLFLAGFLAVVDGLGWHRLGKLFALIGVNSIFIYLASAWFD 317
Query: 407 YSYACKSM 414
+ Y +S+
Sbjct: 318 WGYTVESL 325
>gi|423281270|ref|ZP_17260181.1| hypothetical protein HMPREF1203_04398 [Bacteroides fragilis HMW
610]
gi|404583178|gb|EKA87860.1| hypothetical protein HMPREF1203_04398 [Bacteroides fragilis HMW
610]
Length = 385
Score = 42.7 bits (99), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 57/108 (52%), Gaps = 21/108 (19%)
Query: 45 KQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSP-WNG--------LTLAD 95
K + PQ R+ ++DVFR LT+ LM+ V+D+ P + + P W G L +D
Sbjct: 2 KTSSPQ----RVAAVDVFRALTMFLMLFVNDI----PGLRNIPHWLGHAAMTEDMLGFSD 53
Query: 96 FVMPFFLFIVGVSLALTYKNFPCKVVATRKAIL----RALNLFLLGIF 139
+ P FLF +G+S++ +N K + + I+ R + L ++G+F
Sbjct: 54 TIFPAFLFCMGMSISFAVQNRYQKGDSPLQVIMHIFWRTVALIVMGLF 101
>gi|149437198|ref|XP_001516670.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like,
partial [Ornithorhynchus anatinus]
Length = 176
Score = 42.4 bits (98), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 51/100 (51%), Gaps = 10/100 (10%)
Query: 301 CQAPFDPEGLLSSVMATVTCLIGLHFGHLIVHFKD-HRD---RMLNWII----LSSCLIG 352
+ +DPEG+L ++ + V +G+ G +++ +K+ HR R L W + +S L
Sbjct: 5 TKVAYDPEGILGTINSIVMAFLGVQAGKILLFYKEQHRQIMLRFLTWSVVMGLISGVLTK 64
Query: 353 LGLSLDFVGMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
+ FV +NK L+S+SY + + V L IY+ V
Sbjct: 65 FSQNEGFVP--INKNLWSISYVTTLSCFAFVALLLIYYFV 102
>gi|260911058|ref|ZP_05917694.1| conserved hypothetical protein [Prevotella sp. oral taxon 472 str.
F0295]
gi|260634862|gb|EEX52916.1| conserved hypothetical protein [Prevotella sp. oral taxon 472 str.
F0295]
Length = 409
Score = 42.4 bits (98), Expect = 0.42, Method: Compositional matrix adjust.
Identities = 28/102 (27%), Positives = 48/102 (47%), Gaps = 14/102 (13%)
Query: 83 INHSPWNGLTLADFVMPFFLFIVGVSLALTYKNF-----PCKVVATRKAILRALNLFLLG 137
I H PW G D +MP F+F+ G+++ + + V + + R + L++LG
Sbjct: 84 ITHVPWQGFCFWDIIMPLFMFMSGITIPFSMAKYQRGESKAGVGFLLRLLKRFVVLWVLG 143
Query: 138 IFLQGGFFHGINNLKYGVDIAQIR-WMGVLQRIAIAYLVAAL 178
+ +QG +D Q+ + LQ IA+ Y+V AL
Sbjct: 144 MVVQGNLL--------ALDARQLHLYSNTLQSIAVGYVVVAL 177
>gi|424665794|ref|ZP_18102830.1| hypothetical protein HMPREF1205_01669 [Bacteroides fragilis HMW
616]
gi|404574047|gb|EKA78798.1| hypothetical protein HMPREF1205_01669 [Bacteroides fragilis HMW
616]
Length = 385
Score = 42.4 bits (98), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 57/108 (52%), Gaps = 21/108 (19%)
Query: 45 KQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSP-WNG--------LTLAD 95
K + PQ R+ ++DVFR LT+ LM+ V+D+ P + + P W G L +D
Sbjct: 2 KTSSPQ----RVAAVDVFRALTMFLMLFVNDI----PGLRNIPHWLGHAAMTEDMLGFSD 53
Query: 96 FVMPFFLFIVGVSLALTYKNFPCKVVATRKAIL----RALNLFLLGIF 139
+ P FLF +G+S++ +N K + + I+ R + L ++G+F
Sbjct: 54 TIFPAFLFCMGMSISFAVQNRYQKGDSLLQVIMHIFWRTVALVVMGLF 101
>gi|406831131|ref|ZP_11090725.1| hypothetical protein SpalD1_05826 [Schlesneria paludicola DSM
18645]
Length = 520
Score = 42.4 bits (98), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 40/122 (32%), Positives = 54/122 (44%), Gaps = 9/122 (7%)
Query: 42 SNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGG--ILPAINHSPWNGLTLADFVMP 99
S S + RL SLD FRG T+ M+LV+ +G + P I + + AD +MP
Sbjct: 2 STSPASSVPAPSARLTSLDQFRGYTMLGMLLVNYLGSYHVCPQILKHSHDYCSYADTIMP 61
Query: 100 FFLFIVGVSLALTY-KNFPCKVVAT-RKAILRALNLFLLGIFLQG-----GFFHGINNLK 152
FLF G ++ L+ K A +AI R L L L+ I G G N L
Sbjct: 62 QFLFAAGFAMRLSLGKRLAVGGFAPWGRAIRRILGLALVAILWYGYADYRGVVEKFNTLP 121
Query: 153 YG 154
G
Sbjct: 122 VG 123
>gi|167764058|ref|ZP_02436185.1| hypothetical protein BACSTE_02441 [Bacteroides stercoris ATCC
43183]
gi|167698174|gb|EDS14753.1| hypothetical protein BACSTE_02441 [Bacteroides stercoris ATCC
43183]
Length = 394
Score = 42.4 bits (98), Expect = 0.47, Method: Compositional matrix adjust.
Identities = 33/112 (29%), Positives = 57/112 (50%), Gaps = 23/112 (20%)
Query: 54 RRLISLDVFRGLTVALMILVDDVGGILPAINHSP-W--------NGLTLADFVMPFFLFI 104
+R+ ++DVFR LT+ LM+ V+D+ P + + P W + L +D + P FLF
Sbjct: 7 QRVAAVDVFRALTMFLMLFVNDI----PGLKNVPHWLMHAAADEDMLGFSDTIFPAFLFC 62
Query: 105 VGVSLALTYKNFPCKVVATRKAI----LRALNLFLLGIF------LQGGFFH 146
+G+S++ +N K T + I R + L +G+F ++GG H
Sbjct: 63 MGMSVSFAIQNRYKKGDTTTQVIAHIFWRTVALIAMGLFSLNSGGIEGGLSH 114
>gi|313148038|ref|ZP_07810231.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
gi|313136805|gb|EFR54165.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
Length = 385
Score = 42.4 bits (98), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 57/108 (52%), Gaps = 21/108 (19%)
Query: 45 KQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINHSP-WNG--------LTLAD 95
K + PQ R+ ++DVFR LT+ LM+ V+D+ P + + P W G L +D
Sbjct: 2 KTSSPQ----RVAAVDVFRALTMFLMLFVNDI----PGLRNIPHWLGHAAMTEDMLGFSD 53
Query: 96 FVMPFFLFIVGVSLALTYKNFPCKVVATRKAIL----RALNLFLLGIF 139
+ P FLF +G+S++ +N K + + I+ R + L ++G+F
Sbjct: 54 TIFPAFLFCMGMSISFAVQNRYQKGDSLLQVIMHIFWRTVALVVMGLF 101
>gi|189464971|ref|ZP_03013756.1| hypothetical protein BACINT_01315 [Bacteroides intestinalis DSM
17393]
gi|189437245|gb|EDV06230.1| hypothetical protein BACINT_01315 [Bacteroides intestinalis DSM
17393]
Length = 394
Score = 42.0 bits (97), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 62/117 (52%), Gaps = 15/117 (12%)
Query: 45 KQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGI--LPA-INHSPWNG--LTLADFVMP 99
K PQ R+ ++DVFR LT+ LM+ V+D+ G+ +P + H+ N + +D + P
Sbjct: 2 KNLAPQ----RVAAVDVFRALTMFLMLFVNDIPGLKNIPHWLEHADINEDMMGFSDTIFP 57
Query: 100 FFLFIVGVSLALTYKNFPCKVVATRKAI----LRALNLFLLGIFL--QGGFFHGINN 150
FLF +G+S++ +N K T + I R + L +G+F GG GI++
Sbjct: 58 AFLFCMGMSVSFAIQNRYRKGDTTLQVIAHIFWRTVALIAMGLFSLNSGGIAGGISH 114
>gi|223937685|ref|ZP_03629587.1| conserved hypothetical protein [bacterium Ellin514]
gi|223893657|gb|EEF60116.1| conserved hypothetical protein [bacterium Ellin514]
Length = 413
Score = 41.6 bits (96), Expect = 0.67, Method: Compositional matrix adjust.
Identities = 33/122 (27%), Positives = 52/122 (42%), Gaps = 19/122 (15%)
Query: 38 PPSNSNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAIN------------- 84
P S +T P+ + RLISLD +RG + LM + N
Sbjct: 13 PTQESRPARTVPE-KATRLISLDAYRGFVMLLMASEGFNMWRMAEQNPNSSFWQFLKYQT 71
Query: 85 -HSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRK----AILRALNLFLLGIF 139
H W G L D + P F+F+VGV++ + + K + + R++ L +GIF
Sbjct: 72 EHVDWRGCALWDLIQPSFMFMVGVAMPFSLASRRAKGQSFNTMLGHTLWRSIALVFIGIF 131
Query: 140 LQ 141
L+
Sbjct: 132 LR 133
>gi|374373619|ref|ZP_09631279.1| hypothetical protein NiasoDRAFT_2435 [Niabella soli DSM 19437]
gi|373234592|gb|EHP54385.1| hypothetical protein NiasoDRAFT_2435 [Niabella soli DSM 19437]
Length = 397
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 42/68 (61%), Gaps = 9/68 (13%)
Query: 48 RPQHQQRRLISLDVFRGLTVALMILVDDVGGILPAINH------SPWNGLTLADFVMPFF 101
+PQ R + S+D+ RG+T+ LM+ V+D+ P + H + + + LAD+V P F
Sbjct: 2 KPQFAGR-IRSIDIMRGITLCLMLFVNDL--YEPGVPHWLVHTKAETDSMGLADWVFPGF 58
Query: 102 LFIVGVSL 109
LF+VG+S+
Sbjct: 59 LFMVGLSI 66
>gi|325110973|ref|YP_004272041.1| hypothetical protein Plabr_4447 [Planctomyces brasiliensis DSM
5305]
gi|324971241|gb|ADY62019.1| hypothetical protein Plabr_4447 [Planctomyces brasiliensis DSM
5305]
Length = 459
Score = 41.2 bits (95), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 56/260 (21%), Positives = 97/260 (37%), Gaps = 59/260 (22%)
Query: 1 MAKYNPINNDANDQLVIQISKSTSAPANEKLERDPLLPPSNSNSKQTRPQHQQRRLISLD 60
+++ NP+ + + + + Q + S+ A+ + + P S++ R + R+ +D
Sbjct: 7 LSEENPMTSTCDVKPIGQAATGQSSAADVTIHAGSEISPVEKRSRRLR---KYERIAGVD 63
Query: 61 VFR---------------------GLTVALMILVDDVGGILPAINHSPWN-----GLTLA 94
+FR G + + + G+ + S W GL A
Sbjct: 64 LFRGILLLILGLGIGVLSVLELPSGDPIRQVFRYETARGLTNQVTGSAWRNAAGPGLVFA 123
Query: 95 -DFVMPFFLFIVGVSLAL----TYKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGIN 149
DF++ FLF GVSL L +N R I R L LLG+FLQ
Sbjct: 124 RDFLLSGFLFAAGVSLTLWRWKRRQNKSESASHARAIINRVGILILLGVFLQ-------- 175
Query: 150 NLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVALVL 209
G + ++ L +I +A L+A+ L YR WV +L
Sbjct: 176 --SVGSEQTRLVLTSPLAQIGLATLIASCF---------------LTISYRWGWVTVCIL 218
Query: 210 TTLYLLLLYGLYVPDWQYEF 229
+ + Y + VP++ E
Sbjct: 219 LIGHTVWFYTVPVPEYGSEL 238
>gi|187735023|ref|YP_001877135.1| hypothetical protein Amuc_0516 [Akkermansia muciniphila ATCC
BAA-835]
gi|187425075|gb|ACD04354.1| conserved hypothetical protein [Akkermansia muciniphila ATCC
BAA-835]
Length = 370
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 52/198 (26%), Positives = 80/198 (40%), Gaps = 38/198 (19%)
Query: 43 NSKQTRPQHQQRRLISLDVFRGLTVALMILV----------DDVGGILPA----INHSPW 88
N+ + P + RL SLD RGL + +++ + + G L + H+ W
Sbjct: 2 NAASSVPT-RSPRLHSLDALRGLDMLIILGLDALILLLASRNPENGFLQEAARQMTHARW 60
Query: 89 NGLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAIL--RALNLFLLGIFLQGGFFH 146
GL L D V P F+FI G S++ + + +A L RA L LG+ + G
Sbjct: 61 EGLHLYDLVFPVFVFISGASMSFSLAKYTGTSIAPGLFHLWKRAFGLAFLGLLVNG---- 116
Query: 147 GINNLKYGVDIAQIRWMGVLQRIAIAYLVAALCEIWLKGDGHVSSKLSLFRKYRGHWVVA 206
L + +R+ VL I + +A C + + G V+S
Sbjct: 117 ---TLTW---TESMRYASVLGLIGFSCAIAGTCILLCRRTGAVASAAGFI---------- 160
Query: 207 LVLTTLYLLLLYGLYVPD 224
LVL TL L G + PD
Sbjct: 161 LVLITL-LQFTCGNFTPD 177
>gi|149276664|ref|ZP_01882807.1| hypothetical protein PBAL39_14829 [Pedobacter sp. BAL39]
gi|149232333|gb|EDM37709.1| hypothetical protein PBAL39_14829 [Pedobacter sp. BAL39]
Length = 359
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 37/148 (25%), Positives = 67/148 (45%), Gaps = 32/148 (21%)
Query: 57 ISLDVFRGLTVALMILVD--------------DVGGILPAINHSPWNGLTLADFVMPFFL 102
+SLDV RGL + L+ G++ H PW+GL D V P F+
Sbjct: 1 MSLDVMRGLIMILLCAESCLLYVSLQHLNPAWPASGLVEQFFHHPWHGLRFWDLVQPAFM 60
Query: 103 FIVGVSLALTYKNFPCKVVATRK----AILRALNLFLLGIFLQGGFFHGINNLKYGVDIA 158
F+ G ++ ++Y K + + ++R+L LFL G+ G++ + G +
Sbjct: 61 FMAGAAMYISYSRKLEKGSSWSQNWNHILIRSLKLFLCGV--------GLHCVYAGKPVW 112
Query: 159 QIRWMGVLQRIA----IAYLVAALCEIW 182
++ VL ++A +AYL+ ++W
Sbjct: 113 ELW--NVLTQLAFTSILAYLIIQKSQVW 138
>gi|374372786|ref|ZP_09630447.1| hypothetical protein NiasoDRAFT_3432 [Niabella soli DSM 19437]
gi|373234862|gb|EHP54654.1| hypothetical protein NiasoDRAFT_3432 [Niabella soli DSM 19437]
Length = 357
Score = 40.4 bits (93), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 28/102 (27%), Positives = 48/102 (47%), Gaps = 20/102 (19%)
Query: 56 LISLDVFRGLTVALMILVDDVG---------------GILPAINHSPWNGLTLADFVMPF 100
++SLD RGL + L+ L + G GI+ H PW+GL D + P
Sbjct: 1 MLSLDFMRGLIMVLLAL-ESTGLYEHLSHASAGTWFEGIMQQFFHHPWHGLHFWDLIQPG 59
Query: 101 FLFIVGVSLALTYKNFPCKVV----ATRKAILRALNLFLLGI 138
F+F+ GV++A + + + + +K + R+ LF G+
Sbjct: 60 FMFMAGVAMAYSLQKQKQRDYTWNRSLKKTLRRSGWLFFWGV 101
>gi|300867270|ref|ZP_07111930.1| conserved membrane hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300334747|emb|CBN57096.1| conserved membrane hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 486
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 38/116 (32%), Positives = 50/116 (43%), Gaps = 17/116 (14%)
Query: 43 NSKQTRPQHQQRRLISLDVFRGLTVALMILVDDVG-GILPA------------INHSPWN 89
N + +R +LD RG+ V M+L + LPA + +
Sbjct: 8 NPQDMSTPAVNKRADALDALRGIAVLAMVLSGTIARKTLPAWMYHAQEPPPSHLFNPKLA 67
Query: 90 GLTLADFVMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALNL-FLLG---IFLQ 141
GLT D V PFFLF +G ++ L K T+K IL L FLLG IFLQ
Sbjct: 68 GLTWVDLVFPFFLFAMGAAIPLALSRRIAKGWDTKKVILSILQRGFLLGSFAIFLQ 123
>gi|373954371|ref|ZP_09614331.1| Protein of unknown function DUF2261, transmembrane
[Mucilaginibacter paludis DSM 18603]
gi|373890971|gb|EHQ26868.1| Protein of unknown function DUF2261, transmembrane
[Mucilaginibacter paludis DSM 18603]
Length = 370
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/104 (28%), Positives = 51/104 (49%), Gaps = 18/104 (17%)
Query: 52 QQRRLISLDVFRGLTVALM-----ILVDDV---------GGILPAINHSPWNGLTLADFV 97
RLISLDV RGL + L+ +L +++ G + H+ W GL D V
Sbjct: 8 SSNRLISLDVMRGLIMILLAGESCLLYENLHALHLGGVADGFIEQFFHAQWRGLHFWDCV 67
Query: 98 MPFFLFIVGVSLALTYKNFPCKVVATRKAI----LRALNLFLLG 137
P F+ + G ++ ++Y + K V ++ +R++ LF+LG
Sbjct: 68 QPGFMLMAGAAMYISYYSKLQKGVTWKQNFKHIAIRSVKLFVLG 111
>gi|224025513|ref|ZP_03643879.1| hypothetical protein BACCOPRO_02253, partial [Bacteroides
coprophilus DSM 18228]
gi|224018749|gb|EEF76747.1| hypothetical protein BACCOPRO_02253 [Bacteroides coprophilus DSM
18228]
Length = 377
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/111 (27%), Positives = 45/111 (40%), Gaps = 20/111 (18%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILPA---------------INHSPWNGLTLADF 96
Q+ RL SLD+ RGL + +++ V L H W G L D
Sbjct: 7 QKERLESLDILRGLDLFILVGFQSVFMYLAQATGENNWIKTIFDVLFTHVEWEGFHLWDQ 66
Query: 97 VMPFFLFIVGVSLALTYKNFPCKVVATR-----KAILRALNLFLLGIFLQG 142
VMP FLF+ G S+ + K + + R + L++ G +QG
Sbjct: 67 VMPLFLFMAGTSIPYAMARYKRKEEEISGKLFYRVLKRVVLLWIFGAIVQG 117
>gi|430744194|ref|YP_007203323.1| hypothetical protein Sinac_3364 [Singulisphaera acidiphila DSM
18658]
gi|430015914|gb|AGA27628.1| hypothetical protein Sinac_3364 [Singulisphaera acidiphila DSM
18658]
Length = 408
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 33/89 (37%), Positives = 49/89 (55%), Gaps = 5/89 (5%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGILPAINHSPWNG--LTLADFVMPFFLFIVGVSLALT 112
R+ S+D FRG TVA M +V+ +GG++ + N + AD +MP F+F G S L+
Sbjct: 9 RINSMDQFRGYTVAGMFVVNFLGGMVAIHSVFKHNNSYFSYADSIMPSFMFACGFSYRLS 68
Query: 113 YKNFPCKV--VAT-RKAILRALNLFLLGI 138
+V AT R AI+R L L L+ +
Sbjct: 69 MLRRLSRVGLAATWRHAIIRGLALVLVSL 97
>gi|383112620|ref|ZP_09933412.1| hypothetical protein BSGG_0509 [Bacteroides sp. D2]
gi|313692974|gb|EFS29809.1| hypothetical protein BSGG_0509 [Bacteroides sp. D2]
Length = 472
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 32/110 (29%), Positives = 44/110 (40%), Gaps = 18/110 (16%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVG-GILPA-------------INHSPWNGLTLAD 95
+ R SLD RG + M+L + G+LP + G+T D
Sbjct: 2 NNNNTRASSLDALRGYAILTMVLSGSIAWGVLPGWMYHAQVGPRSNFVFDGSIYGITWVD 61
Query: 96 FVMPFFLFIVGVSLALTYKNFPCKVVATRKAI----LRALNLFLLGIFLQ 141
V PFFLF +G + + N K + RK I LR L IF+Q
Sbjct: 62 LVFPFFLFAMGAAFPFSIGNKYRKGSSRRKIIYDSFLRGFRLTFFAIFIQ 111
>gi|160883836|ref|ZP_02064839.1| hypothetical protein BACOVA_01809 [Bacteroides ovatus ATCC 8483]
gi|237717686|ref|ZP_04548167.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
gi|156110921|gb|EDO12666.1| hypothetical protein BACOVA_01809 [Bacteroides ovatus ATCC 8483]
gi|229453005|gb|EEO58796.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
Length = 467
Score = 40.0 bits (92), Expect = 2.3, Method: Compositional matrix adjust.
Identities = 31/108 (28%), Positives = 45/108 (41%), Gaps = 18/108 (16%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVG-GILPA-INHS------------PWNGLTLADFV 97
+R SLD RG + M+L + G+LP + H+ + G+T D V
Sbjct: 1 MNKRAFSLDALRGYAIITMVLSGTIASGVLPGWMYHAQVGPRSNFAFDPSFYGITWVDLV 60
Query: 98 MPFFLFIVGVSLALTYKNF----PCKVVATRKAILRALNLFLLGIFLQ 141
PFFLF +G + + N K+ ILR L IF+Q
Sbjct: 61 FPFFLFAMGAAFPFSIGNKLEKGESKLKIAWDCILRGFRLTFFAIFIQ 108
>gi|423299515|ref|ZP_17277540.1| hypothetical protein HMPREF1057_00681 [Bacteroides finegoldii
CL09T03C10]
gi|408473324|gb|EKJ91846.1| hypothetical protein HMPREF1057_00681 [Bacteroides finegoldii
CL09T03C10]
Length = 467
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 32/110 (29%), Positives = 49/110 (44%), Gaps = 24/110 (21%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVG-GILPA-------------INHSPWNGLTLADFVM 98
+R ISLD FRG + M+L + G+LP I G+T D V
Sbjct: 1 MKRAISLDAFRGYAIVTMVLSGTIASGVLPGWMYHAQMGPRSNYIFDPQLYGITWVDLVF 60
Query: 99 PFFLFIVGVSLALTY-------KNFPCKVVATRKAILRALNLFLLGIFLQ 141
PFFLF +G ++ + +N K++ + +LR + L IF+Q
Sbjct: 61 PFFLFAMGAAIPFSVGGKIEKGENL-WKIIG--ECVLRGIRLAFFAIFIQ 107
>gi|261880015|ref|ZP_06006442.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
gi|270333306|gb|EFA44092.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
Length = 477
Score = 40.0 bits (92), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 33/107 (30%), Positives = 46/107 (42%), Gaps = 18/107 (16%)
Query: 53 QRRLISLDVFRGLTVALMILVDD-VGGILPAINHSPWNG-------------LTLADFVM 98
++R +SLD FRG + MIL V G+LPA + G +T D V
Sbjct: 9 KQRALSLDAFRGYAILTMILSGSIVWGVLPAWMYHAQEGPRSGFHFDPTIYGITWVDLVF 68
Query: 99 PFFLFIVGVSLALTYKNFPCKVVATRK----AILRALNLFLLGIFLQ 141
PFFLF +G + + N R+ ILR L + I +Q
Sbjct: 69 PFFLFAMGAAFPFSIGNKITHGKTKRQIVGDIILRYFRLVVFAIAVQ 115
>gi|406831133|ref|ZP_11090727.1| hypothetical protein SpalD1_05836 [Schlesneria paludicola DSM
18645]
Length = 415
Score = 39.7 bits (91), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 33/92 (35%), Positives = 47/92 (51%), Gaps = 11/92 (11%)
Query: 55 RLISLDVFRGLTVALMILVDDVGG---ILPAINHSPWNGLTLADFVMPFFLFIVGVSLAL 111
RL SLD FRG T+ MILV+ +G + P + + + AD +MP F F VG ++ L
Sbjct: 14 RLTSLDQFRGYTMVGMILVNYLGAYKEVTPRLFRHTNDYCSYADTIMPHFFFAVGFAMRL 73
Query: 112 TYKNFPCKVVATRK-----AILRALNLFLLGI 138
+ ++ A K AI R L L L+ I
Sbjct: 74 SLGK---RIEAGGKMPWGRAIRRILGLALVAI 102
>gi|323343607|ref|ZP_08083834.1| hypothetical protein HMPREF0663_10369 [Prevotella oralis ATCC
33269]
gi|323095426|gb|EFZ38000.1| hypothetical protein HMPREF0663_10369 [Prevotella oralis ATCC
33269]
Length = 468
Score = 39.7 bits (91), Expect = 2.8, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 58/136 (42%), Gaps = 21/136 (15%)
Query: 52 QQRRLISLDVFRGLTVALMIL-VDDVGGILPA------------INHSPWNGLTLADFVM 98
+Q R +LD RG + MIL + +LPA + + G+T D +
Sbjct: 2 KQERAHALDALRGYAIMTMILSATEAFRVLPAWMYHAQVPPPDHVFNPSIYGITWVDLIF 61
Query: 99 PFFLFIVGVSLALT----YKNFPCKVVATRKAILRALNLFLLGIFLQGGFFHGINN---- 150
PFFLF +G ++ L+ YK RK+ +R L L IF+ F +
Sbjct: 62 PFFLFSMGAAIPLSLGRQYKAGASLRKLCRKSAIRWLKLAFFAIFIYHTFPFMLGYRQEW 121
Query: 151 LKYGVDIAQIRWMGVL 166
L+Y V +A M VL
Sbjct: 122 LRYAVPLAGFALMFVL 137
>gi|298384751|ref|ZP_06994311.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
gi|383122973|ref|ZP_09943661.1| hypothetical protein BSIG_0281 [Bacteroides sp. 1_1_6]
gi|251841928|gb|EES70008.1| hypothetical protein BSIG_0281 [Bacteroides sp. 1_1_6]
gi|298263030|gb|EFI05894.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
Length = 472
Score = 39.7 bits (91), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 32/110 (29%), Positives = 44/110 (40%), Gaps = 18/110 (16%)
Query: 50 QHQQRRLISLDVFRGLTVALMILVDDVG-GILPA-------------INHSPWNGLTLAD 95
+ R SLD RG + M+L V G+LP + G+T D
Sbjct: 2 NNNSTRASSLDALRGYAILTMVLSGSVAWGVLPGWMYHAQVGPRSNFVFDGSIYGITWVD 61
Query: 96 FVMPFFLFIVGVSLALTYKNFPCKVVATRKAI----LRALNLFLLGIFLQ 141
V PFFLF +G + + N K + R+ I LR L IF+Q
Sbjct: 62 LVFPFFLFAMGAAFPFSIGNKYRKGSSRRRIIYDSLLRGFRLTFFAIFIQ 111
>gi|332662942|ref|YP_004445730.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
gi|332331756|gb|AEE48857.1| Protein of unknown function DUF2261, transmembrane
[Haliscomenobacter hydrossis DSM 1100]
Length = 394
Score = 39.7 bits (91), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 37/69 (53%), Gaps = 14/69 (20%)
Query: 55 RLISLDVFRGLTVALMIL-VDDVGGILPAI-------------NHSPWNGLTLADFVMPF 100
RL S+DV+RGL + LM+ V + G + A +H PW G +L D + P
Sbjct: 9 RLGSVDVYRGLVMFLMMAEVLEFGHVAKAFPDSGFWAFLHFHQSHVPWVGCSLHDLIQPS 68
Query: 101 FLFIVGVSL 109
F F+VGV+L
Sbjct: 69 FSFLVGVAL 77
>gi|408674314|ref|YP_006874062.1| Protein of unknown function DUF2261, transmembrane [Emticicia
oligotrophica DSM 17448]
gi|387855938|gb|AFK04035.1| Protein of unknown function DUF2261, transmembrane [Emticicia
oligotrophica DSM 17448]
Length = 391
Score = 39.7 bits (91), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 42/96 (43%), Gaps = 14/96 (14%)
Query: 51 HQQRRLISLDVFRGLTVALMIL-VDDVGGILPAI-------------NHSPWNGLTLADF 96
+ Q RL S D++RG + LM+ V G + A+ +H W G +L D
Sbjct: 2 NSQNRLTSADIYRGFVMLLMMAEVLHFGKVSEALPESSFWAFLAFHQDHVEWVGCSLHDL 61
Query: 97 VMPFFLFIVGVSLALTYKNFPCKVVATRKAILRALN 132
+ P F F+VGV L + + T A L AL
Sbjct: 62 IQPSFSFLVGVVLPYSIARRLTQREGTNAAFLHALK 97
>gi|373954327|ref|ZP_09614287.1| hypothetical protein Mucpa_2712 [Mucilaginibacter paludis DSM
18603]
gi|373890927|gb|EHQ26824.1| hypothetical protein Mucpa_2712 [Mucilaginibacter paludis DSM
18603]
Length = 473
Score = 39.3 bits (90), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 40/84 (47%), Gaps = 18/84 (21%)
Query: 42 SNSKQTRPQHQQRRLISLDVFRGLTVALMILVDDV--GGILP-----------AINHSP- 87
+ + QT PQ R SLD RG + LM+L + GGILP A P
Sbjct: 3 NTTIQTPPQ----RANSLDALRGTAILLMVLSGSIAFGGILPGWMYHAQVPPPAHQFKPD 58
Query: 88 WNGLTLADFVMPFFLFIVGVSLAL 111
G+T D V PFFLF +G ++ L
Sbjct: 59 LPGITWVDLVFPFFLFAMGAAIPL 82
>gi|317477968|ref|ZP_07937151.1| hypothetical protein HMPREF1007_00267 [Bacteroides sp. 4_1_36]
gi|316905882|gb|EFV27653.1| hypothetical protein HMPREF1007_00267 [Bacteroides sp. 4_1_36]
Length = 394
Score = 39.3 bits (90), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 31/112 (27%), Positives = 57/112 (50%), Gaps = 23/112 (20%)
Query: 54 RRLISLDVFRGLTVALMILVDDVGGILPAINHSP-W--------NGLTLADFVMPFFLFI 104
+R+ ++DVFR LT+ LM+ V+D+ P + + P W + + +D + P FLF
Sbjct: 7 QRIAAVDVFRALTMFLMLFVNDI----PGLKNVPHWLMHARMDEDMMGFSDTIFPAFLFC 62
Query: 105 VGVSLALTYKNFPCKVVATRKAI----LRALNLFLLGIF------LQGGFFH 146
+G+S++ +N K T + + R + L +G+F ++GG H
Sbjct: 63 MGMSVSFAIQNRYKKGDNTLQVVAHIFWRTVALIAMGLFSLNSGGIEGGLSH 114
>gi|270294981|ref|ZP_06201182.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|270274228|gb|EFA20089.1| conserved hypothetical protein [Bacteroides sp. D20]
Length = 394
Score = 39.3 bits (90), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 31/112 (27%), Positives = 57/112 (50%), Gaps = 23/112 (20%)
Query: 54 RRLISLDVFRGLTVALMILVDDVGGILPAINHSP-W--------NGLTLADFVMPFFLFI 104
+R+ ++DVFR LT+ LM+ V+D+ P + + P W + + +D + P FLF
Sbjct: 7 QRIAAVDVFRALTMFLMLFVNDI----PGLKNVPHWLMHARMDEDMMGFSDTIFPAFLFC 62
Query: 105 VGVSLALTYKNFPCKVVATRKAI----LRALNLFLLGIF------LQGGFFH 146
+G+S++ +N K T + + R + L +G+F ++GG H
Sbjct: 63 MGMSVSFAIQNRYKKGDNTLQVVAHIFWRTVALIAMGLFSLNSGGIEGGLSH 114
>gi|262407719|ref|ZP_06084267.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|294647258|ref|ZP_06724855.1| putative membrane protein [Bacteroides ovatus SD CC 2a]
gi|294807839|ref|ZP_06766624.1| putative membrane protein [Bacteroides xylanisolvens SD CC 1b]
gi|336405627|ref|ZP_08586303.1| hypothetical protein HMPREF0127_03616 [Bacteroides sp. 1_1_30]
gi|345511807|ref|ZP_08791346.1| hypothetical protein BSAG_01256 [Bacteroides sp. D1]
gi|229443754|gb|EEO49545.1| hypothetical protein BSAG_01256 [Bacteroides sp. D1]
gi|262354527|gb|EEZ03619.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|292637395|gb|EFF55816.1| putative membrane protein [Bacteroides ovatus SD CC 2a]
gi|294444958|gb|EFG13640.1| putative membrane protein [Bacteroides xylanisolvens SD CC 1b]
gi|335937110|gb|EGM99016.1| hypothetical protein HMPREF0127_03616 [Bacteroides sp. 1_1_30]
Length = 467
Score = 39.3 bits (90), Expect = 4.0, Method: Compositional matrix adjust.
Identities = 30/107 (28%), Positives = 45/107 (42%), Gaps = 18/107 (16%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVG-GILPA-INHS------------PWNGLTLADFVM 98
+R SLD RG + M+L + G+LP + H+ + G+T D V
Sbjct: 2 NKRAFSLDALRGYAIITMVLSGTIASGVLPGWMYHAQVGPRSNFAFDPSFYGITWVDLVF 61
Query: 99 PFFLFIVGVSLALTYKNF----PCKVVATRKAILRALNLFLLGIFLQ 141
PFFLF +G + + N K+ +LR L IF+Q
Sbjct: 62 PFFLFAMGAAFPFSIGNKLEKGESKLRIAWDCLLRGFRLTFFAIFIQ 108
>gi|374309893|ref|YP_005056323.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
gi|358751903|gb|AEU35293.1| hypothetical protein AciX8_0944 [Granulicella mallensis MP5ACTX8]
Length = 399
Score = 38.9 bits (89), Expect = 4.6, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 52/107 (48%), Gaps = 18/107 (16%)
Query: 53 QRRLISLDVFRGLTVALMI------------LVDDVGGILPAIN--HSPWNGLTLADFVM 98
Q+R ++D +RG +ALM+ D+ + A N H W G++L D +
Sbjct: 12 QQRNSAVDAYRGFVMALMLAEVFRFAFVAKSFPDNFLLHILAYNQSHVEWTGMSLHDMIQ 71
Query: 99 PFFLFIVGVSLALTYKNFPCKVVATR----KAILRALNLFLLGIFLQ 141
P F F+VGV+L + ++ K + + I R+ L LGIFL+
Sbjct: 72 PSFTFLVGVALPYSLRSRRRKGESFKYMLGHTIWRSFLLVALGIFLR 118
>gi|302796998|ref|XP_002980260.1| hypothetical protein SELMODRAFT_419933 [Selaginella moellendorffii]
gi|300151876|gb|EFJ18520.1| hypothetical protein SELMODRAFT_419933 [Selaginella moellendorffii]
Length = 312
Score = 38.9 bits (89), Expect = 4.7, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 22/32 (68%)
Query: 361 GMHLNKALYSLSYTCLTAGASGVLLAGIYFMV 392
G+ +NK LYS SY C TAGA+G + +Y +V
Sbjct: 39 GIKMNKPLYSFSYMCFTAGAAGAVFCLLYILV 70
>gi|428319838|ref|YP_007117720.1| hypothetical protein Osc7112_5038 [Oscillatoria nigro-viridis PCC
7112]
gi|428243518|gb|AFZ09304.1| hypothetical protein Osc7112_5038 [Oscillatoria nigro-viridis PCC
7112]
Length = 482
Score = 38.9 bits (89), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 37/106 (34%), Positives = 48/106 (45%), Gaps = 17/106 (16%)
Query: 53 QRRLISLDVFRGLTVALMILVDDVG-GILPA------------INHSPWNGLTLADFVMP 99
+R +LD RG+ V M+L + LPA I ++ GLT D V P
Sbjct: 14 SQRADALDALRGIAVLAMVLSGTIARKTLPAWMYHAQLPPPDHIFNNKLPGLTWVDLVFP 73
Query: 100 FFLFIVGVSLALTYKNFPCKVVATRKAILRALNL-FLL---GIFLQ 141
FFLF +G ++ L K T+K IL L FLL IFLQ
Sbjct: 74 FFLFAMGAAIPLALSRRIAKGWDTKKVILSILQRGFLLASFAIFLQ 119
>gi|423304305|ref|ZP_17282304.1| hypothetical protein HMPREF1072_01244 [Bacteroides uniformis
CL03T00C23]
gi|423310581|ref|ZP_17288565.1| hypothetical protein HMPREF1073_03315 [Bacteroides uniformis
CL03T12C37]
gi|392681752|gb|EIY75109.1| hypothetical protein HMPREF1073_03315 [Bacteroides uniformis
CL03T12C37]
gi|392684891|gb|EIY78211.1| hypothetical protein HMPREF1072_01244 [Bacteroides uniformis
CL03T00C23]
Length = 394
Score = 38.5 bits (88), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 31/112 (27%), Positives = 57/112 (50%), Gaps = 23/112 (20%)
Query: 54 RRLISLDVFRGLTVALMILVDDVGGILPAINHSP-W--------NGLTLADFVMPFFLFI 104
+R+ ++DVFR LT+ LM+ V+D+ P + + P W + + +D + P FLF
Sbjct: 7 QRIAAVDVFRALTMFLMLFVNDI----PRLKNVPHWLMHARMDEDMMGFSDTIFPAFLFC 62
Query: 105 VGVSLALTYKNFPCKVVATRKAI----LRALNLFLLGIF------LQGGFFH 146
+G+S++ +N K T + + R + L +G+F ++GG H
Sbjct: 63 MGMSVSFAIQNRYKKGDNTLQVVAHIFWRTVALIAMGLFSLNSGGIEGGLSH 114
>gi|283781521|ref|YP_003372276.1| hypothetical protein Psta_3761 [Pirellula staleyi DSM 6068]
gi|283439974|gb|ADB18416.1| conserved hypothetical protein [Pirellula staleyi DSM 6068]
Length = 417
Score = 38.5 bits (88), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 37/148 (25%), Positives = 59/148 (39%), Gaps = 46/148 (31%)
Query: 55 RLISLDVFRGLTVALMILVDDVGGILPAI-------------------NHSPWNGLTLAD 95
RL+SLD +RG +M+ + G +P + +H W G D
Sbjct: 20 RLLSLDAYRGF---VMLAMASRGFGIPKVAALPQFASHPTWQFLAGQLDHVAWVGSCFWD 76
Query: 96 FVMPFFLFIVGVSLALTYKNFPCKVVATR---------KAILRALNLFLLGIFLQGGFFH 146
+ P F+F+VGV++A + C ++ AI RA+ L LG+FL+
Sbjct: 77 LIQPSFMFMVGVAMA-----YSCAARVSKGDPYWKMLLHAIFRAMVLIALGVFLRSN--- 128
Query: 147 GINNLKYGVDIAQIRWMGVLQRIAIAYL 174
D +M V +I + YL
Sbjct: 129 -------SSDQTNFTFMDVTSQIGLGYL 149
>gi|189468533|ref|ZP_03017318.1| hypothetical protein BACINT_04936 [Bacteroides intestinalis DSM
17393]
gi|189436797|gb|EDV05782.1| hypothetical protein BACINT_04936 [Bacteroides intestinalis DSM
17393]
Length = 393
Score = 38.5 bits (88), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 29/97 (29%), Positives = 52/97 (53%), Gaps = 14/97 (14%)
Query: 54 RRLISLDVFRGLTVALMILVDDVGGI--LPA-INHSPWNG--LTLADFVMPFFLFIVGVS 108
+R+ ++DVFR LT+ M+ V+D+ G+ +P + H+ N + +D + P FLF +G+S
Sbjct: 7 QRVAAVDVFRALTMFFMLFVNDIPGLKNVPHWLMHAEMNEDMMGFSDTIFPAFLFCMGMS 66
Query: 109 LALTYKNFPCK---------VVATRKAILRALNLFLL 136
+ +N K ++ R L A+ LF+L
Sbjct: 67 IPFAIQNRVKKGDTALQIISHISERTVALIAMGLFML 103
>gi|227537867|ref|ZP_03967916.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33300]
gi|227242252|gb|EEI92267.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33300]
Length = 461
Score = 38.5 bits (88), Expect = 6.6, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 42/77 (54%), Gaps = 13/77 (16%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVG-GILPA-INHSP-------WN----GLTLADFVM 98
Q +R SLD RG+ + LM+L + G++P + H+ +N G+T D V
Sbjct: 2 QVKRNNSLDTLRGIAILLMVLSASIAFGVMPGWMYHAQVPPPDHQFNPAIAGITWVDLVF 61
Query: 99 PFFLFIVGVSLALTYKN 115
PFFLF +G ++ L+ +N
Sbjct: 62 PFFLFSMGAAIPLSMQN 78
>gi|148378385|ref|YP_001252926.1| acyltransferase [Clostridium botulinum A str. ATCC 3502]
gi|421838144|ref|ZP_16272110.1| acyltransferase [Clostridium botulinum CFSAN001627]
gi|148287869|emb|CAL81935.1| putative membrane-associated acyltransferase [Clostridium botulinum
A str. ATCC 3502]
gi|409739522|gb|EKN40197.1| acyltransferase [Clostridium botulinum CFSAN001627]
Length = 343
Score = 38.5 bits (88), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 39/69 (56%), Gaps = 6/69 (8%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILPAINHS--PWNGL----TLADFVMPFFLFIV 105
+ +R++S+D+ R L+ +IL+ G IL N+ +N LA F +P F+F+
Sbjct: 8 KNKRIVSMDLLRALSTIAVILIHVTGTILYNSNNKSLTYNSSLVLNQLARFSVPAFIFLS 67
Query: 106 GVSLALTYK 114
G LAL+YK
Sbjct: 68 GFGLALSYK 76
>gi|72161944|ref|YP_289601.1| hypothetical protein Tfu_1542 [Thermobifida fusca YX]
gi|71915676|gb|AAZ55578.1| putative membrane protein [Thermobifida fusca YX]
Length = 445
Score = 38.5 bits (88), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 37/127 (29%), Positives = 54/127 (42%), Gaps = 21/127 (16%)
Query: 35 PLLPPSNSNSKQTRPQHQQ---------------RRLISLDVFRGLTVALMILVDDVGGI 79
P+ PP +S +TR QH+ RL+ +DV RG+ + M +V G
Sbjct: 14 PMSPPETDSSVETRTQHRAPTPQEGSSPPVKPPVPRLLGVDVARGVAILGMFVVHVGVGW 73
Query: 80 LPAINHSPWNGLTLADFVMPFFLFIVGVSLALTYKNFPCKV-----VATRKAILRALNLF 134
A +P L A F + GVS+AL KV VA + ++RA+ L
Sbjct: 74 TLADGTNPLQPLA-AGRSAALFALLAGVSIALLSGGHDRKVNKDLGVALWRVVVRAVILL 132
Query: 135 LLGIFLQ 141
+LG L
Sbjct: 133 ILGTALT 139
>gi|226947615|ref|YP_002802706.1| acyltransferase family protein [Clostridium botulinum A2 str.
Kyoto]
gi|226843694|gb|ACO86360.1| acyltransferase family protein [Clostridium botulinum A2 str.
Kyoto]
Length = 343
Score = 38.5 bits (88), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 39/69 (56%), Gaps = 6/69 (8%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVGGILPAINHS--PWNGL----TLADFVMPFFLFIV 105
+ +R++S+D+ R L+ +IL+ G IL N+ +N LA F +P F+F+
Sbjct: 8 KDKRIVSMDLLRALSTIAVILIHVTGTILYNSNNKSLTYNSSLVLNQLARFSVPAFIFLS 67
Query: 106 GVSLALTYK 114
G LAL+YK
Sbjct: 68 GFGLALSYK 76
>gi|300772530|ref|ZP_07082400.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33861]
gi|300760833|gb|EFK57659.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33861]
Length = 461
Score = 38.5 bits (88), Expect = 7.0, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 42/77 (54%), Gaps = 13/77 (16%)
Query: 52 QQRRLISLDVFRGLTVALMILVDDVG-GILPA-INHSP-------WN----GLTLADFVM 98
Q +R SLD RG+ + LM+L + G++P + H+ +N G+T D V
Sbjct: 2 QVKRNNSLDTLRGIAILLMVLSASIAFGVMPGWMYHAQVPPPDHQFNPAIAGITWVDLVF 61
Query: 99 PFFLFIVGVSLALTYKN 115
PFFLF +G ++ L+ +N
Sbjct: 62 PFFLFSMGAAIPLSMQN 78
>gi|294949096|ref|XP_002786050.1| hypothetical protein Pmar_PMAR023776 [Perkinsus marinus ATCC 50983]
gi|239900158|gb|EER17846.1| hypothetical protein Pmar_PMAR023776 [Perkinsus marinus ATCC 50983]
Length = 124
Score = 38.1 bits (87), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 26/91 (28%), Positives = 33/91 (36%), Gaps = 29/91 (31%)
Query: 248 RGSTGPACNAVGMIDRKILGIQHLYRKPIYSRTKQCSINSPDYGPMPLDAPSWCQAPFDP 307
RGS P CN IDR + G +H+Y P W D
Sbjct: 24 RGSLTPQCNVASNIDRMVFGAEHMYN------------------------PLW-----DS 54
Query: 308 EGLLSSVMATVTCLIGLHFGHLIVHFKDHRD 338
EGLLS++ T +GL G I H +
Sbjct: 55 EGLLSTLPTLATVALGLACGKFIQSRPSHTE 85
>gi|338211620|ref|YP_004655673.1| hypothetical protein [Runella slithyformis DSM 19594]
gi|336305439|gb|AEI48541.1| Protein of unknown function DUF2261, transmembrane [Runella
slithyformis DSM 19594]
Length = 393
Score = 38.1 bits (87), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 51/108 (47%), Gaps = 24/108 (22%)
Query: 55 RLISLDVFRGLTVALMIL-VDDVGGILPAI-------------NHSPWNGLTLADFVMPF 100
R+ S+D +RG + LM+ V + G I A+ +H W G +L D + P
Sbjct: 8 RISSVDAYRGFVMFLMMAEVLEFGHISKALPDSSFWAFLAYNQDHVEWVGCSLHDLIQPS 67
Query: 101 FLFIVGVSLALTY-------KNFPCKVVATRKAILRALNLFLLGIFLQ 141
F F+VGV+L + +NF T + R+L L LGIFL+
Sbjct: 68 FSFLVGVALPYSIASRMAKGQNFGSMFGHT---VQRSLILIFLGIFLR 112
>gi|163849279|ref|YP_001637323.1| hypothetical protein Caur_3756 [Chloroflexus aurantiacus J-10-fl]
gi|222527272|ref|YP_002571743.1| hypothetical protein Chy400_4057 [Chloroflexus sp. Y-400-fl]
gi|163670568|gb|ABY36934.1| membrane protein-like protein [Chloroflexus aurantiacus J-10-fl]
gi|222451151|gb|ACM55417.1| membrane protein-like protein [Chloroflexus sp. Y-400-fl]
Length = 387
Score = 37.7 bits (86), Expect = 9.7, Method: Compositional matrix adjust.
Identities = 35/127 (27%), Positives = 56/127 (44%), Gaps = 25/127 (19%)
Query: 44 SKQTRP----QHQQRRLISLDVFRGLTVALMIL------------VDDVGGILPAINHSP 87
+ +TRP Q RR++++D RG+ + LM L + GG P + S
Sbjct: 7 AAETRPIVVEQQASRRIVAIDALRGIALILMALDHSAFFVGAGLQAESYGG-QPVVLQSA 65
Query: 88 --WNGLTLADFVMPFFLFIVGVSLAL-----TYKNFPCKVVATRKAILRALNLFLLGIFL 140
W L + P F F+ G SLAL + + P TR ++RAL + +L + +
Sbjct: 66 AYWLSGLLTNLASPIFFFLGGYSLALYAAGQSRRGRPASAT-TRFILIRALIILVLDLTI 124
Query: 141 QGGFFHG 147
F+ G
Sbjct: 125 CAWFWRG 131
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.326 0.141 0.445
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 6,654,288,289
Number of Sequences: 23463169
Number of extensions: 282144351
Number of successful extensions: 888740
Number of sequences better than 100.0: 917
Number of HSP's better than 100.0 without gapping: 591
Number of HSP's successfully gapped in prelim test: 326
Number of HSP's that attempted gapping in prelim test: 885226
Number of HSP's gapped (non-prelim): 1692
length of query: 416
length of database: 8,064,228,071
effective HSP length: 145
effective length of query: 271
effective length of database: 8,957,035,862
effective search space: 2427356718602
effective search space used: 2427356718602
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.6 bits)
S2: 78 (34.7 bits)