BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 017365
(373 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|18421151|ref|NP_568500.1| heparan-alpha-glucosaminide N-acetyltransferase [Arabidopsis
thaliana]
gi|14334592|gb|AAK59474.1| unknown protein [Arabidopsis thaliana]
gi|26983902|gb|AAN86203.1| unknown protein [Arabidopsis thaliana]
gi|332006336|gb|AED93719.1| heparan-alpha-glucosaminide N-acetyltransferase [Arabidopsis
thaliana]
Length = 472
Score = 513 bits (1322), Expect = e-143, Method: Compositional matrix adjust.
Identities = 248/345 (71%), Positives = 290/345 (84%), Gaps = 2/345 (0%)
Query: 1 MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQR--LASLDIFRGLAVALMILVDHAGGD 58
M+EIK E +H L+ + D S + L R LASLDIFRGL VALMILVD AGGD
Sbjct: 1 MAEIKVERSHDQHLLEPKEDTSSSYTRRSLAGNRPRLASLDIFRGLTVALMILVDDAGGD 60
Query: 59 WPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGIL 118
WP I+HAPWNGCNLADFVMPFFLFIVGV+IAL+LKRI ++ +A KKV FRT KLLFWG+L
Sbjct: 61 WPMIAHAPWNGCNLADFVMPFFLFIVGVSIALSLKRISNKFEACKKVGFRTCKLLFWGLL 120
Query: 119 LQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIF 178
LQGGFSHAPDELTYGVDV M+R CG+LQRIALSYL+V+LVEIFTKD +++ S GRFSIF
Sbjct: 121 LQGGFSHAPDELTYGVDVTMMRFCGILQRIALSYLVVALVEIFTKDSHEENLSTGRFSIF 180
Query: 179 RLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCN 238
+ Y WHW++AA VLV+YLA LYGTYVPDW+F + +KDS YGK+ +V+CGVR KLNPPCN
Sbjct: 181 KSYYWHWIVAASVLVIYLATLYGTYVPDWEFVVYDKDSVLYGKILSVSCGVRGKLNPPCN 240
Query: 239 AVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSS 298
AVGY+DR+VLGINHMYHHPAWRRSKACT DSP+EG +R+DAPSWC APFEPEG+LSS+S+
Sbjct: 241 AVGYVDRQVLGINHMYHHPAWRRSKACTDDSPYEGAIRQDAPSWCRAPFEPEGILSSISA 300
Query: 299 ILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTN 343
ILSTIIGVHFGH+I+H KGH ARLK W++ G LL GLTLHFT+
Sbjct: 301 ILSTIIGVHFGHIILHLKGHSARLKHWISTGLVLLALGLTLHFTH 345
>gi|297812935|ref|XP_002874351.1| hypothetical protein ARALYDRAFT_489556 [Arabidopsis lyrata subsp.
lyrata]
gi|297320188|gb|EFH50610.1| hypothetical protein ARALYDRAFT_489556 [Arabidopsis lyrata subsp.
lyrata]
Length = 453
Score = 504 bits (1297), Expect = e-140, Method: Compositional matrix adjust.
Identities = 233/324 (71%), Positives = 277/324 (85%)
Query: 20 DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
++ ++ + QRLASLDIFRGL VALMILVD AGGDWP I+HAPWNGCNLADFVMPF
Sbjct: 3 EIKVERSLAGNNRQRLASLDIFRGLTVALMILVDDAGGDWPMIAHAPWNGCNLADFVMPF 62
Query: 80 FLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
FLFIVGV+IAL+LKRI ++ +A KKV FRT KLLFWG+LLQGGFSHAPDEL+YGVDV M+
Sbjct: 63 FLFIVGVSIALSLKRISNKFEACKKVCFRTCKLLFWGLLLQGGFSHAPDELSYGVDVTMM 122
Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
R CG+LQRIALSYL+V+L+EIFTKD+ +++ S GR SIF+ Y HW++ VLV+YLA L
Sbjct: 123 RFCGILQRIALSYLVVALIEIFTKDLHEENLSTGRLSIFKSYYCHWIVGVSVLVIYLATL 182
Query: 200 YGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAW 259
YGTYVPDW+F + +KDS YGK+ +V+CGVR KLNPPCNAVGY+DR+VL INHMYHHPAW
Sbjct: 183 YGTYVPDWEFVVNDKDSILYGKIQSVSCGVRGKLNPPCNAVGYVDRQVLVINHMYHHPAW 242
Query: 260 RRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHL 319
RRSKA T DSP+EG LR+DAPSWCHAPFEPEG+LSS+S+ILSTIIGVHFGH+IIH +GHL
Sbjct: 243 RRSKAFTDDSPYEGALRQDAPSWCHAPFEPEGILSSISAILSTIIGVHFGHIIIHLQGHL 302
Query: 320 ARLKQWVTMGFALLIFGLTLHFTN 343
ARLK W++ G L GLTLHFT+
Sbjct: 303 ARLKHWISTGLVFLTLGLTLHFTH 326
>gi|224125166|ref|XP_002319516.1| predicted protein [Populus trichocarpa]
gi|222857892|gb|EEE95439.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 503 bits (1296), Expect = e-140, Method: Compositional matrix adjust.
Identities = 238/360 (66%), Positives = 292/360 (81%), Gaps = 7/360 (1%)
Query: 1 MSEIKAETTHHHPLIISE-PDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDW 59
M+EIKA+ H L I+E D+S Q+ + R+ASLDI+RGL VALMILVD AGG+W
Sbjct: 1 MAEIKADIALDHRLTIAEVTDISAQKPDPKI---RVASLDIYRGLTVALMILVDDAGGEW 57
Query: 60 PEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILL 119
P+I HAPWNGCNLADFVMPFFLFIVG+AI LA KRI R AV++VI RTLKLLFWGI+L
Sbjct: 58 PKIGHAPWNGCNLADFVMPFFLFIVGMAIPLAFKRITSRHHAVRRVIVRTLKLLFWGIML 117
Query: 120 QGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFR 179
QGGFSHAPD+LTYGVD++ IR CG+LQRIA +YL+V+L+EIFTK Q ++ G SI++
Sbjct: 118 QGGFSHAPDKLTYGVDMKKIRWCGILQRIAFAYLVVALMEIFTKKKQTRELPPGWLSIYK 177
Query: 180 LYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNA 239
LY WLM AC+LV+YLA++YGTYVP WQFT+ ++DSADYGKVF V C VR KL+PPCNA
Sbjct: 178 LYSSQWLMGACILVIYLAVIYGTYVPHWQFTVNDRDSADYGKVFTVECAVRGKLDPPCNA 237
Query: 240 VGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSI 299
VG+IDR++LGINHMY HPAW+RS+ACT++SP+EGP R APSWC APFEPEG+LSS+S++
Sbjct: 238 VGFIDREILGINHMYQHPAWKRSEACTENSPYEGPFRTSAPSWCKAPFEPEGILSSISAV 297
Query: 300 LSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNGEHGSGK---FSTTCV 356
LSTIIGVHFGHV+++ +GH ARLK W+ MGFALLI GL LHFT+ + + FS CV
Sbjct: 298 LSTIIGVHFGHVLVYMRGHAARLKHWIVMGFALLILGLVLHFTHAIPLNKQLYTFSYVCV 357
>gi|359487632|ref|XP_003633626.1| PREDICTED: LOW QUALITY PROTEIN: heparan-alpha-glucosaminide
N-acetyltransferase-like [Vitis vinifera]
Length = 499
Score = 489 bits (1259), Expect = e-136, Method: Compositional matrix adjust.
Identities = 244/350 (69%), Positives = 285/350 (81%), Gaps = 5/350 (1%)
Query: 10 HHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNG 69
+ H LIIS+ ++ KT+RLASLDIFRGL VALMILVD AGG+WP I HAPWNG
Sbjct: 41 NQHRLIISDSGFPPEERPQ--KTKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWNG 98
Query: 70 CNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDE 129
CNLADFVMPFFLFIVGVAIALALKRIPDR A+KKV RTLKLLFWG+LLQG F+ PD+
Sbjct: 99 CNLADFVMPFFLFIVGVAIALALKRIPDRLMAIKKVTLRTLKLLFWGLLLQGSFTQDPDK 158
Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
LTYGVD++ IR CG+LQ IAL+YL+V+L+EI TK Q KD S G+FSIF+LYCWHWLM A
Sbjct: 159 LTYGVDMKKIRWCGILQXIALAYLVVALLEITTKKAQAKDLSPGQFSIFKLYCWHWLMGA 218
Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
CVL+VY+A+ YGTYVPDW FT+ ++DSADYGKV V CG R KL+PPCN VGYIDR++LG
Sbjct: 219 CVLIVYMAVSYGTYVPDWHFTVHDRDSADYGKVLTVACGARGKLDPPCNVVGYIDREILG 278
Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
+NHMY HPAW RSKAC + SP +GP RKDAPSWC+APFEPEG+LSS+S+ILSTIIGVHFG
Sbjct: 279 MNHMYQHPAWTRSKACNEYSPDKGPFRKDAPSWCYAPFEPEGILSSISAILSTIIGVHFG 338
Query: 310 HVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNGEHGSGK---FSTTCV 356
HV++H KGH RLK WV MGFALL+ G+TLHFT + + FS CV
Sbjct: 339 HVLMHLKGHSDRLKHWVVMGFALLVLGITLHFTGAIPLNKQLYTFSYVCV 388
>gi|242059773|ref|XP_002459032.1| hypothetical protein SORBIDRAFT_03g044830 [Sorghum bicolor]
gi|241931007|gb|EES04152.1| hypothetical protein SORBIDRAFT_03g044830 [Sorghum bicolor]
Length = 481
Score = 483 bits (1243), Expect = e-134, Method: Compositional matrix adjust.
Identities = 221/325 (68%), Positives = 270/325 (83%)
Query: 20 DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
D +D EK+ +++R+ASLD+FRGL VALMILVD AGG+WP I HAPWNGCNLADFVMPF
Sbjct: 31 DEADDNEKAPRRSRRVASLDVFRGLTVALMILVDGAGGEWPVIGHAPWNGCNLADFVMPF 90
Query: 80 FLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
FLFIVG+AI L+LKRIPDR AV++V+ RTLKLLFWGILLQG +SHAPDELTYGVD++ +
Sbjct: 91 FLFIVGMAIPLSLKRIPDRGRAVRRVVIRTLKLLFWGILLQGRYSHAPDELTYGVDMKHV 150
Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
R G+LQRIAL+YL+V+++EI TKD + +DQS FSIFR+Y W++A C+LV+YLAL+
Sbjct: 151 RWGGILQRIALAYLVVAVLEIVTKDAKIQDQSSSGFSIFRMYLSQWIVACCILVIYLALV 210
Query: 200 YGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAW 259
YG YVPDW+F + N DS +YGKV VTCG R L+PPCNAVGYIDRKVLGINHMY PAW
Sbjct: 211 YGIYVPDWEFRVRNVDSPNYGKVLTVTCGTRGILDPPCNAVGYIDRKVLGINHMYQKPAW 270
Query: 260 RRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHL 319
RR +ACT DSP EG R DAP+WC APFEPEG+LSS+S++LSTIIGVH+GHV++H K H
Sbjct: 271 RRHRACTDDSPHEGHFRNDAPAWCVAPFEPEGILSSLSAVLSTIIGVHYGHVLVHMKSHT 330
Query: 320 ARLKQWVTMGFALLIFGLTLHFTNG 344
RL+QWVTMG LL+ G+ LHF++
Sbjct: 331 DRLRQWVTMGICLLVLGIILHFSHA 355
>gi|449454063|ref|XP_004144775.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Cucumis sativus]
gi|449490878|ref|XP_004158735.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Cucumis sativus]
Length = 490
Score = 479 bits (1234), Expect = e-133, Method: Compositional matrix adjust.
Identities = 241/359 (67%), Positives = 286/359 (79%), Gaps = 4/359 (1%)
Query: 1 MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP 60
M EIK ++T HHP + D SD +K++RLASLDIFRGL VALMILVD AGG+WP
Sbjct: 22 MEEIKPDSTSHHPHRLISVD-SDALLPKPVKSKRLASLDIFRGLTVALMILVDDAGGEWP 80
Query: 61 EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ 120
I HAPW GCNLADFVMPFFLFIVG+AIALALKRIP++ A++KV RTLKLLFWG+LLQ
Sbjct: 81 MIGHAPWYGCNLADFVMPFFLFIVGMAIALALKRIPNQLMAIEKVTLRTLKLLFWGLLLQ 140
Query: 121 GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL 180
GG+SHAPD+LTYGVDVR IRL G+LQRIAL+YL+V+ VE+ ++ Q Q FSIF+
Sbjct: 141 GGYSHAPDKLTYGVDVRKIRLFGILQRIALAYLVVAFVEVLSRKTQSNVQPFNHFSIFKS 200
Query: 181 YCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAV 240
Y W+WL+ AC+LVVY ALLYG YVPDWQFT+ + +S YG+ F V CGVR L+PPCNAV
Sbjct: 201 YFWNWLVGACILVVYFALLYGIYVPDWQFTVTDSESVYYGRNFTVACGVRGNLDPPCNAV 260
Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300
GYIDRKVLGINH+Y HPAWRRS+ACT++SP+ G R +APSWC APFEPEG+LSS+S+IL
Sbjct: 261 GYIDRKVLGINHLYAHPAWRRSEACTENSPYAGSFRDNAPSWCFAPFEPEGILSSISAIL 320
Query: 301 STIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNGEHGSGK---FSTTCV 356
STIIGVHFGHV+IH + H ARLKQWVTMGF LLI GL LHFT+ + + FS CV
Sbjct: 321 STIIGVHFGHVLIHFQDHSARLKQWVTMGFTLLILGLVLHFTHAIPLNKQLYTFSYVCV 379
>gi|326493552|dbj|BAJ85237.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326511587|dbj|BAJ91938.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 486
Score = 477 bits (1228), Expect = e-132, Method: Compositional matrix adjust.
Identities = 231/374 (61%), Positives = 290/374 (77%), Gaps = 7/374 (1%)
Query: 7 ETTHHHPLIISEPDVSDQQEKS-----HLKTQRLASLDIFRGLAVALMILVDHAGGDWPE 61
E H + D+ D EK ++R+ASLD+FRGL VALMILVD AGG+WP
Sbjct: 17 EDPDRHRTHEAADDLDDDGEKKASRPSSSSSRRVASLDVFRGLTVALMILVDGAGGEWPV 76
Query: 62 ISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQG 121
I HAPW+GCNLADFVMPFFLFIVG+AI L+LKRIPDR AV++V+ RTLKLLFWGILLQG
Sbjct: 77 IGHAPWHGCNLADFVMPFFLFIVGMAIPLSLKRIPDRGWAVRRVVIRTLKLLFWGILLQG 136
Query: 122 GFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSV-GRFSIFRL 180
G+SHAPDELTYGVD++ IR CG+LQRIAL+YL+V+++EI TKD + +DQS G FS+FRL
Sbjct: 137 GYSHAPDELTYGVDMKHIRWCGILQRIALAYLVVAVIEIATKDARVQDQSSSGFFSVFRL 196
Query: 181 YCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAV 240
Y W++A C+L++YL+L+YG YVPDW+FT+ N DS +YGKV VTCG R L+PPCNAV
Sbjct: 197 YLSQWIVACCILLIYLSLVYGVYVPDWEFTVRNVDSPNYGKVLTVTCGTRGNLSPPCNAV 256
Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300
GYIDRKVLGINH+Y PAWRR + CT DSP EGP ++DAP+WC +PFEPEGLLSS S++L
Sbjct: 257 GYIDRKVLGINHLYQKPAWRRHRDCTDDSPHEGPFKRDAPAWCASPFEPEGLLSSFSAVL 316
Query: 301 STIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNGEHGSGKFST-TCVCLF 359
STIIGVH+GHV++H K H+ RLKQWVTMG ALL+ G+ LHF++ + + T + +C+
Sbjct: 317 STIIGVHYGHVLVHMKSHMDRLKQWVTMGVALLLLGIILHFSHAIPLNKQLYTLSYICVT 376
Query: 360 IYSKVILFQWQPFL 373
+ I+F FL
Sbjct: 377 AGAAGIIFSMLYFL 390
>gi|357126662|ref|XP_003565006.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Brachypodium distachyon]
Length = 485
Score = 466 bits (1199), Expect = e-129, Method: Compositional matrix adjust.
Identities = 223/344 (64%), Positives = 279/344 (81%), Gaps = 1/344 (0%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
+++R+ASLD+FRGL VALMILVD AGG+WP I HAPW+GCNLADFVMPFFLFIVG+AI L
Sbjct: 46 RSRRVASLDVFRGLTVALMILVDGAGGEWPVIGHAPWDGCNLADFVMPFFLFIVGMAIPL 105
Query: 91 ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
+LKRIPDR AV++V+ RTLKLLFWGILLQGG+SHAPDEL YGVD++ IR CG+LQRIA
Sbjct: 106 SLKRIPDRGRAVRRVVIRTLKLLFWGILLQGGYSHAPDELAYGVDMKHIRWCGILQRIAF 165
Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
+YL+V+++EI TKD +DQS FSIFR+Y W++A C+L++YL+L+YG YVPDW+F
Sbjct: 166 AYLVVAVIEIATKDANIQDQSSSGFSIFRMYFSQWIVACCILLIYLSLVYGIYVPDWEFR 225
Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSP 270
+ N DS +YGKV VTCG R KL+PPCNAVGYIDRKVLGINH+Y PAWRR +ACT DSP
Sbjct: 226 VRNVDSPNYGKVLTVTCGTRGKLSPPCNAVGYIDRKVLGINHLYQKPAWRRHRACTDDSP 285
Query: 271 FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF 330
EGP + DAP+WC +PFEPEGLLSS S++LSTIIGVH+GHV++H K H+ RLKQWVTMG
Sbjct: 286 HEGPFKSDAPAWCASPFEPEGLLSSFSAVLSTIIGVHYGHVLVHMKSHMDRLKQWVTMGI 345
Query: 331 ALLIFGLTLHFTNGEHGSGKFST-TCVCLFIYSKVILFQWQPFL 373
ALL+ G+ LHF++ + + T + +C+ + I+F FL
Sbjct: 346 ALLLLGIILHFSHAIPLNKQLYTFSYICVTAGAAGIVFSMLYFL 389
>gi|212723192|ref|NP_001131974.1| uncharacterized protein LOC100193372 [Zea mays]
gi|194693076|gb|ACF80622.1| unknown [Zea mays]
gi|413951397|gb|AFW84046.1| hypothetical protein ZEAMMB73_047978 [Zea mays]
Length = 484
Score = 460 bits (1184), Expect = e-127, Method: Compositional matrix adjust.
Identities = 221/331 (66%), Positives = 271/331 (81%), Gaps = 6/331 (1%)
Query: 20 DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
D +D EK+ ++R+ASLD+FRGL VALMILVD AGG+WP I HAPWNGCNLADFVMPF
Sbjct: 28 DEADANEKAPRPSRRVASLDVFRGLTVALMILVDGAGGEWPVIGHAPWNGCNLADFVMPF 87
Query: 80 FLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
FLFIVG+A+ LALKRIPDR AV++V+ RTLKLLFWGILLQGG+SHAPDEL YGVD+R +
Sbjct: 88 FLFIVGMAVPLALKRIPDRGRAVRRVVVRTLKLLFWGILLQGGYSHAPDELAYGVDMRHV 147
Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQD--KDQ---SVGRFS-IFRLYCWHWLMAACVLV 193
R G+LQRIAL+YL+V+++E+ TKD +DQ S GRFS +FR+Y W++A C+LV
Sbjct: 148 RWGGILQRIALAYLVVAVLEMVTKDGAKVHQDQPPGSSGRFSRVFRMYLSQWIVACCILV 207
Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
VYL+L YG YVPDW+F + N DS DYGKV V CG R L+PPCNAVGYIDR+VLGINHM
Sbjct: 208 VYLSLAYGVYVPDWEFRVRNADSPDYGKVLTVRCGTRGALDPPCNAVGYIDRRVLGINHM 267
Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
Y PAWRR +ACT DSP EGP R+DAP+WC APFEPEG+LSS+S++LST++GVH+GHV++
Sbjct: 268 YQKPAWRRHRACTDDSPHEGPFREDAPAWCVAPFEPEGILSSLSAVLSTVVGVHYGHVLV 327
Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNG 344
H K H RL+QWVTMG ALL+ G+ LHF++
Sbjct: 328 HMKSHTDRLRQWVTMGVALLVLGIILHFSHA 358
>gi|296089693|emb|CBI39512.3| unnamed protein product [Vitis vinifera]
Length = 481
Score = 457 bits (1176), Expect = e-126, Method: Compositional matrix adjust.
Identities = 233/350 (66%), Positives = 270/350 (77%), Gaps = 23/350 (6%)
Query: 10 HHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNG 69
+ H LIIS+ ++ KT+RLASLDIFRGL VALMILVD AGG+WP I HAPWNG
Sbjct: 41 NQHRLIISDSGFPPEERPQ--KTKRLASLDIFRGLTVALMILVDDAGGEWPMIGHAPWNG 98
Query: 70 CNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDE 129
CNLADFVMPFFLFIVGVAIALALKRIPDR A+KKV RTLKLLFWG+LLQG F+ PD+
Sbjct: 99 CNLADFVMPFFLFIVGVAIALALKRIPDRLMAIKKVTLRTLKLLFWGLLLQGSFTQDPDK 158
Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
LTYGVD++ IR CG+LQ Q KD S G+FSIF+LYCWHWLM A
Sbjct: 159 LTYGVDMKKIRWCGILQ------------------AQAKDLSPGQFSIFKLYCWHWLMGA 200
Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
CVL+VY+A+ YGTYVPDW FT+ ++DSADYGKV V CG R KL+PPCN VGYIDR++LG
Sbjct: 201 CVLIVYMAVSYGTYVPDWHFTVHDRDSADYGKVLTVACGARGKLDPPCNVVGYIDREILG 260
Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
+NHMY HPAW RSKAC + SP +GP RKDAPSWC+APFEPEG+LSS+S+ILSTIIGVHFG
Sbjct: 261 MNHMYQHPAWTRSKACNEYSPDKGPFRKDAPSWCYAPFEPEGILSSISAILSTIIGVHFG 320
Query: 310 HVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNGEHGSGK---FSTTCV 356
HV++H KGH RLK WV MGFALL+ G+TLHFT + + FS CV
Sbjct: 321 HVLMHLKGHSDRLKHWVVMGFALLVLGITLHFTGAIPLNKQLYTFSYVCV 370
>gi|356572978|ref|XP_003554642.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Glycine max]
Length = 464
Score = 457 bits (1175), Expect = e-126, Method: Compositional matrix adjust.
Identities = 242/369 (65%), Positives = 291/369 (78%), Gaps = 7/369 (1%)
Query: 1 MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP 60
M+EIK E H L +SE +K+ KT+R+ASLDIFRGL VALMILVD AGG WP
Sbjct: 1 MAEIKGE----HSLNVSEE--LPLSDKNLPKTKRVASLDIFRGLTVALMILVDDAGGQWP 54
Query: 61 EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ 120
I HAPWNGCNLADFVMPFFLFIVG+AI LALKRIP+R AVKKVI RTLKLLFWG+LLQ
Sbjct: 55 MIGHAPWNGCNLADFVMPFFLFIVGMAIPLALKRIPNRLLAVKKVIVRTLKLLFWGLLLQ 114
Query: 121 GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL 180
GGFSHAPD LTYGVD++ IR CG+LQRIAL+YL+V+LVEIF++ Q +D SIF+L
Sbjct: 115 GGFSHAPDNLTYGVDMKHIRWCGILQRIALAYLVVALVEIFSRSAQARDPEPTHLSIFKL 174
Query: 181 YCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAV 240
Y WHWL+ AC+L VYLALLYG +VPDWQFT+ N DS G VTCGVR KL+PPCNAV
Sbjct: 175 YYWHWLVGACILAVYLALLYGIHVPDWQFTVHNPDSIYNGTTLTVTCGVRGKLDPPCNAV 234
Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300
GYIDR+V+GINHMY PAWRRS+ACT++SP+EGP +K+APSWC+APFEPEG+LSS+S+IL
Sbjct: 235 GYIDREVIGINHMYKRPAWRRSEACTENSPYEGPFKKNAPSWCYAPFEPEGILSSISAIL 294
Query: 301 STIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNGEHGSGKFST-TCVCLF 359
STIIG+HFGHV+IH + H +RLK W+ +G ALL GL LHFT+ + + T + VC+
Sbjct: 295 STIIGLHFGHVLIHLQDHPSRLKHWLLLGLALLTSGLILHFTHAIPLNKQLYTLSYVCVT 354
Query: 360 IYSKVILFQ 368
+ +LF
Sbjct: 355 SGAAALLFS 363
>gi|356504028|ref|XP_003520801.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Glycine max]
Length = 465
Score = 456 bits (1174), Expect = e-126, Method: Compositional matrix adjust.
Identities = 245/371 (66%), Positives = 293/371 (78%), Gaps = 10/371 (2%)
Query: 1 MSEIKAETTHHHPLIISE--PDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD 58
M+EIK E H L +S+ P+VSD K+ KT+R+ASLDIFRGL VALMILVD AG
Sbjct: 1 MAEIKGE----HSLNVSQELPEVSD---KNLPKTKRVASLDIFRGLTVALMILVDDAGEQ 53
Query: 59 WPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGIL 118
WP I HAPWNGCNLADFVMPFFLFIVG+AI LALKRIP+R AVKKVI RTLKLLFWG+L
Sbjct: 54 WPMIGHAPWNGCNLADFVMPFFLFIVGMAIPLALKRIPNRLLAVKKVIVRTLKLLFWGLL 113
Query: 119 LQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIF 178
LQGGFSHAPD LTYGVD++ IR CG+LQRIAL+YL+V+LVEIF++ Q +D SIF
Sbjct: 114 LQGGFSHAPDNLTYGVDMKHIRWCGILQRIALAYLVVALVEIFSRSTQARDPEPTHLSIF 173
Query: 179 RLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCN 238
LY WHWL+ AC+LVVYLALLYG +VPDW FT+ N DS G VTCGVR KL+PPCN
Sbjct: 174 NLYYWHWLVGACILVVYLALLYGIHVPDWGFTVHNPDSIYNGTTLTVTCGVRGKLDPPCN 233
Query: 239 AVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSS 298
AVGYIDR+VLGINHMY PAWRRS+ACT++SP+EGP +K+APSWC+APFEPEG+LSS+S+
Sbjct: 234 AVGYIDREVLGINHMYKRPAWRRSEACTENSPYEGPFKKNAPSWCYAPFEPEGILSSISA 293
Query: 299 ILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNGEHGSGKFST-TCVC 357
ILSTIIG+HFGHV+IH + H +RLK W+ +G ALL GL LHFT+ + + T + VC
Sbjct: 294 ILSTIIGLHFGHVLIHLQDHPSRLKHWLLLGLALLTSGLILHFTHAIPLNKQLYTLSYVC 353
Query: 358 LFIYSKVILFQ 368
+ + +LF
Sbjct: 354 VTSGAAALLFS 364
>gi|413951398|gb|AFW84047.1| hypothetical protein ZEAMMB73_047978 [Zea mays]
Length = 503
Score = 449 bits (1156), Expect = e-124, Method: Compositional matrix adjust.
Identities = 221/350 (63%), Positives = 271/350 (77%), Gaps = 25/350 (7%)
Query: 20 DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
D +D EK+ ++R+ASLD+FRGL VALMILVD AGG+WP I HAPWNGCNLADFVMPF
Sbjct: 28 DEADANEKAPRPSRRVASLDVFRGLTVALMILVDGAGGEWPVIGHAPWNGCNLADFVMPF 87
Query: 80 FLFIVGVAIALALK-------------------RIPDRADAVKKVIFRTLKLLFWGILLQ 120
FLFIVG+A+ LALK RIPDR AV++V+ RTLKLLFWGILLQ
Sbjct: 88 FLFIVGMAVPLALKVRRRRRSSRPSVVHAMHAHRIPDRGRAVRRVVVRTLKLLFWGILLQ 147
Query: 121 GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD--KDQ---SVGRF 175
GG+SHAPDEL YGVD+R +R G+LQRIAL+YL+V+++E+ TKD +DQ S GRF
Sbjct: 148 GGYSHAPDELAYGVDMRHVRWGGILQRIALAYLVVAVLEMVTKDGAKVHQDQPPGSSGRF 207
Query: 176 S-IFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLN 234
S +FR+Y W++A C+LVVYL+L YG YVPDW+F + N DS DYGKV V CG R L+
Sbjct: 208 SRVFRMYLSQWIVACCILVVYLSLAYGVYVPDWEFRVRNADSPDYGKVLTVRCGTRGALD 267
Query: 235 PPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLS 294
PPCNAVGYIDR+VLGINHMY PAWRR +ACT DSP EGP R+DAP+WC APFEPEG+LS
Sbjct: 268 PPCNAVGYIDRRVLGINHMYQKPAWRRHRACTDDSPHEGPFREDAPAWCVAPFEPEGILS 327
Query: 295 SVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNG 344
S+S++LST++GVH+GHV++H K H RL+QWVTMG ALL+ G+ LHF++
Sbjct: 328 SLSAVLSTVVGVHYGHVLVHMKSHTDRLRQWVTMGVALLVLGIILHFSHA 377
>gi|222619812|gb|EEE55944.1| hypothetical protein OsJ_04649 [Oryza sativa Japonica Group]
Length = 846
Score = 448 bits (1152), Expect = e-123, Method: Compositional matrix adjust.
Identities = 224/353 (63%), Positives = 279/353 (79%), Gaps = 12/353 (3%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA 91
++R+ASLD+FRGL VALMILVD AGG+WP I HAPWNGCNLADFVMPFFLFIVG+AI L+
Sbjct: 408 SRRVASLDVFRGLTVALMILVDGAGGEWPVIGHAPWNGCNLADFVMPFFLFIVGMAIPLS 467
Query: 92 LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALS 151
LKRIPDR AV++V+ RTLKLLFWGILLQGG+SHAPD+L+YGVD++ +R CG+LQRIAL+
Sbjct: 468 LKRIPDRGRAVRRVVLRTLKLLFWGILLQGGYSHAPDDLSYGVDMKHVRWCGILQRIALA 527
Query: 152 YLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTI 211
YL+V+++EI TK+ + +DQS FSIFR+Y W++A C+LV+YL+L+YG YVPDW F +
Sbjct: 528 YLVVAVLEIVTKNAKVQDQSSSGFSIFRMYFSQWIVACCILVIYLSLVYGIYVPDWDFRV 587
Query: 212 INKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPF 271
+ + ++GK+ VTCG R KL+PPCNAVGYIDRKVLGINHMYH PAWRR K CT DSP
Sbjct: 588 SDVKNPNFGKILTVTCGTRGKLSPPCNAVGYIDRKVLGINHMYHRPAWRRHKDCTDDSPH 647
Query: 272 EGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFA 331
EGP + D+P+WC+APFEPEGLLSS+S++LSTIIGVH+GHV++H K H RLKQW MG
Sbjct: 648 EGPFKTDSPAWCYAPFEPEGLLSSLSAVLSTIIGVHYGHVLVHMKSHTDRLKQWSIMGIT 707
Query: 332 LLIFGLTLHFTNGEHGSGK---FSTTCV---------CLFIYSKVILFQWQPF 372
LLI GLTLHF++ + + FS CV C+F + IL PF
Sbjct: 708 LLILGLTLHFSHAIPLNKQLYTFSYICVTAGAAGIVFCMFYFLVDILNLHYPF 760
>gi|115442029|ref|NP_001045294.1| Os01g0931100 [Oryza sativa Japonica Group]
gi|57899654|dbj|BAD87323.1| unknown protein [Oryza sativa Japonica Group]
gi|57900117|dbj|BAD88179.1| unknown protein [Oryza sativa Japonica Group]
gi|113534825|dbj|BAF07208.1| Os01g0931100 [Oryza sativa Japonica Group]
gi|215697092|dbj|BAG91086.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 488
Score = 441 bits (1134), Expect = e-121, Method: Compositional matrix adjust.
Identities = 224/353 (63%), Positives = 279/353 (79%), Gaps = 12/353 (3%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA 91
++R+ASLD+FRGL VALMILVD AGG+WP I HAPWNGCNLADFVMPFFLFIVG+AI L+
Sbjct: 50 SRRVASLDVFRGLTVALMILVDGAGGEWPVIGHAPWNGCNLADFVMPFFLFIVGMAIPLS 109
Query: 92 LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALS 151
LKRIPDR AV++V+ RTLKLLFWGILLQGG+SHAPD+L+YGVD++ +R CG+LQRIAL+
Sbjct: 110 LKRIPDRGRAVRRVVLRTLKLLFWGILLQGGYSHAPDDLSYGVDMKHVRWCGILQRIALA 169
Query: 152 YLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTI 211
YL+V+++EI TK+ + +DQS FSIFR+Y W++A C+LV+YL+L+YG YVPDW F +
Sbjct: 170 YLVVAVLEIVTKNAKVQDQSSSGFSIFRMYFSQWIVACCILVIYLSLVYGIYVPDWDFRV 229
Query: 212 INKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPF 271
+ + ++GK+ VTCG R KL+PPCNAVGYIDRKVLGINHMYH PAWRR K CT DSP
Sbjct: 230 SDVKNPNFGKILTVTCGTRGKLSPPCNAVGYIDRKVLGINHMYHRPAWRRHKDCTDDSPH 289
Query: 272 EGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFA 331
EGP + D+P+WC+APFEPEGLLSS+S++LSTIIGVH+GHV++H K H RLKQW MG
Sbjct: 290 EGPFKTDSPAWCYAPFEPEGLLSSLSAVLSTIIGVHYGHVLVHMKSHTDRLKQWSIMGIT 349
Query: 332 LLIFGLTLHFTNGEHGSGK---FSTTCV---------CLFIYSKVILFQWQPF 372
LLI GLTLHF++ + + FS CV C+F + IL PF
Sbjct: 350 LLILGLTLHFSHAIPLNKQLYTFSYICVTAGAAGIVFCMFYFLVDILNLHYPF 402
>gi|224123608|ref|XP_002330163.1| predicted protein [Populus trichocarpa]
gi|222871619|gb|EEF08750.1| predicted protein [Populus trichocarpa]
Length = 468
Score = 428 bits (1101), Expect = e-117, Method: Compositional matrix adjust.
Identities = 222/359 (61%), Positives = 275/359 (76%), Gaps = 5/359 (1%)
Query: 1 MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP 60
M++IKA ++ L+I+ D + +R+ASLDIFRGL VALMILVD AGG+WP
Sbjct: 1 MADIKAYISYAKRLLIA--DGTHFSAPKPDPERRVASLDIFRGLTVALMILVDDAGGEWP 58
Query: 61 EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ 120
++ HAPW+G NLADFVMPFFLFIVG+AI L K I R AVKK+I RTLKLLFWGI+LQ
Sbjct: 59 KMGHAPWHGSNLADFVMPFFLFIVGMAIPLTFKGITSRDHAVKKMIVRTLKLLFWGIMLQ 118
Query: 121 GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL 180
GGFSHAPD+L+YGVD++ IR CG+LQRIA +YL+++L+EIFTK Q KD GR SIFRL
Sbjct: 119 GGFSHAPDKLSYGVDMKKIRWCGILQRIAFAYLVMALMEIFTKKDQTKDLPPGRLSIFRL 178
Query: 181 YCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAV 240
Y WL+ AC+LVVYLA++YG YVP WQFT+ +++S+DYGKVF V C VR KL+P CNA+
Sbjct: 179 YGSQWLVGACILVVYLAVIYGMYVPHWQFTVNDEESSDYGKVFTVECAVRGKLDPACNAI 238
Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300
YIDRK+LGINH+Y HPAW+RS+ACT+ S +E P + AP+WC APFEP+G+LSS+SS+L
Sbjct: 239 AYIDRKILGINHLYQHPAWKRSEACTEASLYEAPFQTSAPTWCKAPFEPDGILSSISSVL 298
Query: 301 STIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNGEHGSGK---FSTTCV 356
STI G HFGHV +H KG ARLK W MG ALLI GL LHFT+ + + FS CV
Sbjct: 299 STITGAHFGHVHVHLKGDTARLKHWTVMGLALLILGLVLHFTHAMPLNKQLYTFSYVCV 357
>gi|357511851|ref|XP_003626214.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
truncatula]
gi|355501229|gb|AES82432.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
truncatula]
Length = 483
Score = 426 bits (1095), Expect = e-116, Method: Compositional matrix adjust.
Identities = 226/360 (62%), Positives = 276/360 (76%), Gaps = 21/360 (5%)
Query: 1 MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVA------------- 47
M EI E + H ++SE + +E K +R+ASLDIFRGL VA
Sbjct: 1 MEEIIGEHSVH---VVSEVEPVSAKELPK-KVKRVASLDIFRGLTVADGDLTVFVAVKYR 56
Query: 48 ---LMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKK 104
LMILVD AGG+WP I HAPWNGCNLADFVMPFFLFIVG+AI L+LK+IP++ AVKK
Sbjct: 57 AKQLMILVDDAGGEWPAIGHAPWNGCNLADFVMPFFLFIVGMAIPLSLKKIPNKLLAVKK 116
Query: 105 VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKD 164
VI RTLKLLFWG+LLQGG+SHAPD L+YGVD++ IR CG+LQRIAL+YL+V+LVEI ++
Sbjct: 117 VIVRTLKLLFWGLLLQGGYSHAPDHLSYGVDMKHIRWCGILQRIALAYLVVALVEIISRS 176
Query: 165 VQDKDQ-SVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVF 223
QD+D SIF LY WHWL+AAC+LVVY+ LLYG +VPDWQFT+ N DS G F
Sbjct: 177 RQDRDDPEPTNLSIFTLYYWHWLVAACILVVYMPLLYGIHVPDWQFTVHNPDSIYNGTTF 236
Query: 224 NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWC 283
VTCGVR KL+PPCNAVGYIDR+VLGINH+Y PA RRS+ACT P+EGP +K AP+WC
Sbjct: 237 TVTCGVRGKLDPPCNAVGYIDREVLGINHVYKKPASRRSEACTVKPPYEGPFKKTAPAWC 296
Query: 284 HAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTN 343
+APFEPEG+LSS+S+ILSTIIG+H+GHV+IH + HL+RLKQW+ +G ALL G LHF++
Sbjct: 297 YAPFEPEGILSSISAILSTIIGLHYGHVLIHLQDHLSRLKQWILLGLALLTLGFILHFSH 356
>gi|255556868|ref|XP_002519467.1| conserved hypothetical protein [Ricinus communis]
gi|223541330|gb|EEF42881.1| conserved hypothetical protein [Ricinus communis]
Length = 519
Score = 404 bits (1037), Expect = e-110, Method: Compositional matrix adjust.
Identities = 196/355 (55%), Positives = 243/355 (68%), Gaps = 18/355 (5%)
Query: 5 KAETTHHHPLIISEPDVSD-----QQEKSHL----------KTQRLASLDIFRGLAVALM 49
K + TH +I E +++ +QE L KT+R+A+LD FRGL V LM
Sbjct: 42 KLDKTHDGGGVIPEKELTSSTVLVEQEGEQLQQPEQLPVKQKTKRVATLDAFRGLTVVLM 101
Query: 50 ILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRT 109
ILVD+AG + I H+PWNGC LADFVMPFFLFIVGVAIALALKRIP + DAVKK+ RT
Sbjct: 102 ILVDNAGESYARIDHSPWNGCTLADFVMPFFLFIVGVAIALALKRIPRKRDAVKKISLRT 161
Query: 110 LKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKD 169
LKLLFWGILLQGG+SHAP +L+YGVD+++IR CG+LQRIAL Y+ V+L+E T +
Sbjct: 162 LKLLFWGILLQGGYSHAPVDLSYGVDMKLIRWCGILQRIALVYMFVALIETLTIKERQTV 221
Query: 170 QSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGV 229
FSIF Y W W+ ++Y+ Y YVPDW FT + + + V CG+
Sbjct: 222 LQPNHFSIFTAYRWQWIGGFIAFLIYMITTYALYVPDWSFTAYDDNRPTR---YTVKCGM 278
Query: 230 RAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEP 289
R L P CNAVGY+DR+V GINH+Y +P W R KACT SP GPLR DAPSWC APFEP
Sbjct: 279 RGHLGPACNAVGYVDREVWGINHLYQYPVWSRLKACTFSSPATGPLRADAPSWCLAPFEP 338
Query: 290 EGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNG 344
EGLLS++S+ILS IG+H+GHV+IH KGH RLKQWV+MG L + + LHFT+
Sbjct: 339 EGLLSTISAILSGTIGIHYGHVLIHFKGHSERLKQWVSMGLGLFLIAIILHFTDA 393
>gi|356503734|ref|XP_003520659.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Glycine max]
Length = 508
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 189/323 (58%), Positives = 233/323 (72%), Gaps = 2/323 (0%)
Query: 22 SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
+Q KT+R+A+LD FRGL + LMILVD AG +P I H+PWNGC LADFVMPFFL
Sbjct: 62 EQEQPPVKQKTKRVATLDAFRGLTIVLMILVDDAGEAYPRIDHSPWNGCTLADFVMPFFL 121
Query: 82 FIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
FIVGVAIALALKRI +VKK+I RTLKLLFWGI+LQGG+SHAPD+L YGV+++ IR
Sbjct: 122 FIVGVAIALALKRISKIKHSVKKIILRTLKLLFWGIILQGGYSHAPDDLEYGVNMKFIRW 181
Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
CG+LQRIAL Y +V+L+E FT ++ + G SIF Y W W ++Y+ +
Sbjct: 182 CGILQRIALVYCVVALIETFTTKLRPTTLASGHLSIFAAYKWQWFGGFVAFLIYMITTFS 241
Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
YVPDW F ++ + D K + V CG+R L P CNAVG++DR+V G+NH+Y P WRR
Sbjct: 242 LYVPDWSF--VDHFNGDEPKRYTVICGMRGHLGPACNAVGHVDRQVWGVNHLYSQPVWRR 299
Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
KACT SP GP R DAPSWC APFEPEGLLSS+S+ILS IG+H+GHV+IH KGH R
Sbjct: 300 LKACTFSSPGSGPFRDDAPSWCLAPFEPEGLLSSISAILSGTIGIHYGHVLIHFKGHSER 359
Query: 322 LKQWVTMGFALLIFGLTLHFTNG 344
LKQWV+MGF LLI + LHFT+
Sbjct: 360 LKQWVSMGFVLLIIAIILHFTDA 382
>gi|224069583|ref|XP_002326379.1| predicted protein [Populus trichocarpa]
gi|222833572|gb|EEE72049.1| predicted protein [Populus trichocarpa]
Length = 496
Score = 400 bits (1027), Expect = e-109, Method: Compositional matrix adjust.
Identities = 197/350 (56%), Positives = 248/350 (70%), Gaps = 8/350 (2%)
Query: 22 SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
D+Q K++R+A+LD FRGL + LMILVD AGG +P I H+PWNGC LADFVMPFFL
Sbjct: 51 GDRQPVVKQKSKRVATLDAFRGLTIVLMILVDDAGGVYPRIDHSPWNGCTLADFVMPFFL 110
Query: 82 FIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
FIVGVAIALA KRIP R DAVKK+I RTLKLLFWG+LLQGG+SHAP +L YGVD+++IR
Sbjct: 111 FIVGVAIALAFKRIPKRRDAVKKIILRTLKLLFWGVLLQGGYSHAPSDLAYGVDMKLIRW 170
Query: 142 CGVL-QRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
G+L QRIAL Y++V+L+E + K+ Q + F+IF Y W W+ V+Y+
Sbjct: 171 FGILQQRIALVYMVVALIEALIPKNRQTIEPD--HFTIFTAYRWQWIAGFISFVIYMVTT 228
Query: 200 YGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAW 259
+ YVPDW FT+ D + + V CG+R L P CNAVGY+DR+V GINH+Y +P W
Sbjct: 229 FALYVPDWSFTV---DEDHERRRYTVECGMRGHLGPACNAVGYVDREVWGINHLYQYPVW 285
Query: 260 RRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHL 319
R KACT SP GP RKDAPSWC APFEPEGLLSS+S+ILS IG+H+GHV+IH KGH
Sbjct: 286 SRLKACTLSSPGSGPFRKDAPSWCRAPFEPEGLLSSISAILSGTIGIHYGHVLIHFKGHA 345
Query: 320 ARLKQWVTMGFALLIFGLTLHFTNGEHGSGK-FSTTCVCLFIYSKVILFQ 368
RL+QWV+MG LLI + LHFT+ + + +S + VC + I+F
Sbjct: 346 ERLRQWVSMGVILLIVAIILHFTDAIPINKQLYSFSYVCFTAGAAGIVFS 395
>gi|359481929|ref|XP_002268831.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Vitis vinifera]
gi|297739972|emb|CBI30154.3| unnamed protein product [Vitis vinifera]
Length = 489
Score = 397 bits (1021), Expect = e-108, Method: Compositional matrix adjust.
Identities = 187/322 (58%), Positives = 231/322 (71%), Gaps = 3/322 (0%)
Query: 23 DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLF 82
++Q K++R+A+LD FRGL + LMILVD AGG + I H+PWNGC LADFVMPFFLF
Sbjct: 45 EEQPLIKQKSKRVATLDAFRGLTIVLMILVDDAGGSYARIDHSPWNGCTLADFVMPFFLF 104
Query: 83 IVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
IVGVA+ALALK+IP + AVKK+ RTLKLLFWGILLQGG+SHAPD+L+YGVD++ IR
Sbjct: 105 IVGVAVALALKKIPRISLAVKKISLRTLKLLFWGILLQGGYSHAPDDLSYGVDMKHIRWF 164
Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
G+LQRIA+ Y +V+L+E T + G FSI Y W W+ ++Y+ Y
Sbjct: 165 GILQRIAVVYFVVALIETLTTKRRPTVIDSGHFSILSAYKWQWIGGFVAFLIYMITTYAL 224
Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
YVPDW F I D K + V CG+R L P CNAVGY+DR+V GINH+Y P W R
Sbjct: 225 YVPDWSFVI---DQDHEAKRYTVKCGMRGHLGPACNAVGYVDRQVWGINHLYSQPVWTRL 281
Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
KACT SP GP R+DAPSWC+APFEPEGLLS++S+ILS IG+H+GHV+IH KGH RL
Sbjct: 282 KACTLSSPNSGPFREDAPSWCYAPFEPEGLLSTISAILSGTIGIHYGHVLIHFKGHAERL 341
Query: 323 KQWVTMGFALLIFGLTLHFTNG 344
KQWV+MG LLI + LHFT+
Sbjct: 342 KQWVSMGIVLLIVAIILHFTDA 363
>gi|356548323|ref|XP_003542552.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Glycine max]
Length = 419
Score = 393 bits (1009), Expect = e-107, Method: Compositional matrix adjust.
Identities = 184/296 (62%), Positives = 218/296 (73%), Gaps = 3/296 (1%)
Query: 49 MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR 108
M+LVD AGG +P I H+PWNGC LADFVMPFFLFIVGVAIALALKRIP AVKK+I R
Sbjct: 1 MVLVDDAGGAYPRIDHSPWNGCTLADFVMPFFLFIVGVAIALALKRIPKVKYAVKKIILR 60
Query: 109 TLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDK 168
TLKLLFWGILLQGG+SHAPD+L+YGVD+R IR CG+LQRIAL Y +V+L+E +T ++
Sbjct: 61 TLKLLFWGILLQGGYSHAPDDLSYGVDMRFIRWCGILQRIALVYCVVALIETYTTKLRPS 120
Query: 169 DQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCG 228
G SIF Y W WL V+Y+ ++ YVPDW F N D K + V CG
Sbjct: 121 TLKPGHLSIFTAYRWQWLGGFVAFVIYMVTIFSLYVPDWSFVDYNSDKP---KRYTVECG 177
Query: 229 VRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFE 288
+R L P CNAVGY+DR+V G+NH+Y P W R KACT SP EGPLRK+AP+WC APFE
Sbjct: 178 MRGHLGPACNAVGYVDRQVWGVNHLYSQPVWTRLKACTLSSPAEGPLRKNAPAWCRAPFE 237
Query: 289 PEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNG 344
PEG LSSV +ILS IG+H+GHV+IH KGH RLKQW++MGF LL GL LHFT+
Sbjct: 238 PEGFLSSVLAILSGTIGIHYGHVLIHFKGHFERLKQWLSMGFVLLTLGLILHFTDA 293
>gi|356570776|ref|XP_003553560.1| PREDICTED: LOW QUALITY PROTEIN: heparan-alpha-glucosaminide
N-acetyltransferase-like [Glycine max]
Length = 509
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 183/323 (56%), Positives = 228/323 (70%), Gaps = 2/323 (0%)
Query: 22 SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
+Q KT+R+A+LD FRGL + LMILVD AG +P I H+PWNGC LADFVMPFFL
Sbjct: 63 EQEQPVVKQKTKRIATLDAFRGLTIVLMILVDDAGEAYPRIDHSPWNGCTLADFVMPFFL 122
Query: 82 FIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
FIVG+AIALALKRI AVKK+I RTLKLLFWGI+LQGG+SHAPD+L YGV+++ IR
Sbjct: 123 FIVGIAIALALKRIAKIKHAVKKIILRTLKLLFWGIILQGGYSHAPDDLEYGVNMKFIRW 182
Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
CG+LQRIAL Y +V+L+E FT ++ + G SIF Y W W ++Y+ +
Sbjct: 183 CGILQRIALVYCVVALIETFTTKLRPTTLASGHLSIFTAYKWQWFGGFVAFIIYMITTFT 242
Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
YVP W F ++ + D K + V CG+R L P CNAVG++DR+V G+NH+Y P WRR
Sbjct: 243 LYVPHWSF--LDHFNGDEPKRYTVICGMRGHLGPACNAVGHVDRQVWGVNHLYSQPVWRR 300
Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
K SP GP R DAPSWC +PFEPEGLLSS+S+ILS IG+H+GH++IH KGH R
Sbjct: 301 LKMTIDYSPASGPFRDDAPSWCRSPFEPEGLLSSISAILSGTIGIHYGHILIHFKGHSER 360
Query: 322 LKQWVTMGFALLIFGLTLHFTNG 344
LKQWV MGF LLI + LHFT+
Sbjct: 361 LKQWVLMGFVLLIIAIILHFTDA 383
>gi|147817637|emb|CAN64496.1| hypothetical protein VITISV_004036 [Vitis vinifera]
Length = 511
Score = 387 bits (993), Expect = e-105, Method: Compositional matrix adjust.
Identities = 188/344 (54%), Positives = 231/344 (67%), Gaps = 25/344 (7%)
Query: 23 DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLF 82
++Q K++R+A+LD FRGL + LMILVD AGG + I H+PWNGC LADFVMPFFLF
Sbjct: 45 EEQPLIKQKSKRVATLDAFRGLTIVLMILVDDAGGSYARIDHSPWNGCTLADFVMPFFLF 104
Query: 83 IVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR-- 140
IVGVA+ALALK+IP + AVKK+ RTLKLLFWGILLQGG+SHAPD+L+YGVD++ IR
Sbjct: 105 IVGVAVALALKKIPRISLAVKKISLRTLKLLFWGILLQGGYSHAPDDLSYGVDMKHIRWF 164
Query: 141 --------------------LCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL 180
L G LQRIA+ Y +V+L+E T + G FSI
Sbjct: 165 GILQVFPLPLFTGKSIPSSSLSGFLQRIAVVYFVVALIETLTTKRRPTVIDSGHFSILSA 224
Query: 181 YCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAV 240
Y W W+ ++Y+ Y YVPDW F I D K + V CG+R L P CNAV
Sbjct: 225 YKWQWIGGFVAFLIYMITTYALYVPDWSFVI---DQDHEAKRYTVKCGMRGHLGPACNAV 281
Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300
GY+DR+V GINH+Y P W R KACT SP GP R+DAPSWC+APFEPEGLLS++S+IL
Sbjct: 282 GYVDRQVWGINHLYSQPVWTRLKACTLSSPNSGPFREDAPSWCYAPFEPEGLLSTISAIL 341
Query: 301 STIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNG 344
S IG+H+GHV+IH KGH RLKQWV+MG LLI + LHFT+
Sbjct: 342 SGTIGIHYGHVLIHFKGHAERLKQWVSMGIVLLIVAIILHFTDA 385
>gi|302754694|ref|XP_002960771.1| hypothetical protein SELMODRAFT_75452 [Selaginella moellendorffii]
gi|300171710|gb|EFJ38310.1| hypothetical protein SELMODRAFT_75452 [Selaginella moellendorffii]
Length = 493
Score = 382 bits (981), Expect = e-103, Method: Compositional matrix adjust.
Identities = 188/320 (58%), Positives = 237/320 (74%), Gaps = 6/320 (1%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
K R+A+LD+FRGL VALM+LVD AGG+WP I+H+PWNGC LAD VMPFFLFIVGVAIAL
Sbjct: 48 KPVRIATLDVFRGLTVALMVLVDDAGGEWPRINHSPWNGCTLADLVMPFFLFIVGVAIAL 107
Query: 91 ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
ALKRIPD+ A +KV+ RTLKLLFWG+LLQGGFSHAPD+L+YGVD+R IR CG+LQRIA
Sbjct: 108 ALKRIPDQVAATQKVVIRTLKLLFWGLLLQGGFSHAPDDLSYGVDMRKIRWCGILQRIAF 167
Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
YL+V+LVEI T + + G+F IF+LY WHW A V+++Y ++ YG YVPDW F
Sbjct: 168 GYLIVALVEIATTKSRSLELPKGQFGIFKLYKWHWACALAVVIIYHSVAYGLYVPDWHFI 227
Query: 211 ------IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+++ + NV CGVR + P CNAVG+IDR +LGINH+Y P W R+++
Sbjct: 228 DSGHRFVVSLAKFVFSSQINVQCGVRGDIGPACNAVGHIDRTILGINHLYQSPEWTRTQS 287
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
C DSP EG +AP+WC APFEPEG+LSS+S+ILS IIG+H+GHV+IH KGH+ R+
Sbjct: 288 CDLDSPAEGDPPANAPAWCKAPFEPEGILSSISAILSCIIGIHYGHVLIHFKGHMKRVLH 347
Query: 325 WVTMGFALLIFGLTLHFTNG 344
W ALL+ LHFT+
Sbjct: 348 WTIPAAALLVLATILHFTHA 367
>gi|302804288|ref|XP_002983896.1| hypothetical protein SELMODRAFT_119497 [Selaginella moellendorffii]
gi|300148248|gb|EFJ14908.1| hypothetical protein SELMODRAFT_119497 [Selaginella moellendorffii]
Length = 493
Score = 381 bits (978), Expect = e-103, Method: Compositional matrix adjust.
Identities = 188/320 (58%), Positives = 236/320 (73%), Gaps = 6/320 (1%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
K R+A+LD+FRGL VALM+LVD AGG+WP I+H+PWNGC LAD VMPFFLFIVGVAIAL
Sbjct: 48 KPVRIATLDVFRGLTVALMVLVDDAGGEWPRINHSPWNGCTLADLVMPFFLFIVGVAIAL 107
Query: 91 ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
ALKRIPD+ A +KV+ RTLKLLFWG+LLQGGFSHAPD+L+YGVD+R IR CG+LQRIA
Sbjct: 108 ALKRIPDQVAATQKVVIRTLKLLFWGLLLQGGFSHAPDDLSYGVDMRKIRWCGILQRIAF 167
Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
YL+V+LVEI T + + G F IF+LY WHW A V+++Y ++ YG YVPDW F
Sbjct: 168 GYLIVALVEIATTKSRSLELPKGHFGIFKLYKWHWACALAVVIIYHSVAYGLYVPDWHFI 227
Query: 211 ------IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+++ + NV CGVR + P CNAVG+IDR +LGINH+Y P W R+++
Sbjct: 228 DSGHRFVVSLAKFVFSSQINVQCGVRGDIGPACNAVGHIDRTILGINHLYQSPEWTRTQS 287
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
C DSP EG +AP+WC APFEPEG+LSS+S+ILS IIG+H+GHV+IH KGH+ R+
Sbjct: 288 CDLDSPAEGDPPANAPAWCKAPFEPEGILSSISAILSCIIGIHYGHVLIHFKGHMKRVLH 347
Query: 325 WVTMGFALLIFGLTLHFTNG 344
W ALL+ LHFT+
Sbjct: 348 WTIPAAALLVLATILHFTHA 367
>gi|357152403|ref|XP_003576108.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Brachypodium distachyon]
Length = 498
Score = 376 bits (965), Expect = e-101, Method: Compositional matrix adjust.
Identities = 188/319 (58%), Positives = 233/319 (73%)
Query: 25 QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIV 84
+E K+ R+A+LD FRGL + +MILVD AG + + H+PWNGC LADFVMPFFLFIV
Sbjct: 53 EEPQKKKSTRVAALDAFRGLTIVVMILVDDAGSSYERMDHSPWNGCTLADFVMPFFLFIV 112
Query: 85 GVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
GVAIA A+KR+P+ AVKKV RTLK++FWG+LLQGG+SHAPD+L YGVD++MIR CG+
Sbjct: 113 GVAIAFAMKRVPNMGAAVKKVSVRTLKMIFWGLLLQGGYSHAPDDLAYGVDMKMIRWCGI 172
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
LQRIAL Y V+L+E+FT V+ G ++IF Y W WL A VLV+Y+ + YV
Sbjct: 173 LQRIALVYFAVALIEVFTTKVRPTTVRSGPYAIFDAYRWQWLGAFIVLVIYMITTFSLYV 232
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
PDW F N + GK F V CGVR L+P CNAVG+IDR+V GINH+Y P W R+K
Sbjct: 233 PDWSFVYHNDGDINDGKRFTVQCGVRGHLDPACNAVGFIDRQVWGINHLYSQPVWIRTKD 292
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
CT SP G LR DAP+WC PFEPEGLLSS+SSI+S IG+H+GHV+IH K H RL
Sbjct: 293 CTFSSPETGKLRDDAPAWCLGPFEPEGLLSSISSIISGTIGIHYGHVLIHFKTHKERLTH 352
Query: 325 WVTMGFALLIFGLTLHFTN 343
W++MGFALL+ G+ LHFTN
Sbjct: 353 WLSMGFALLLLGILLHFTN 371
>gi|242067981|ref|XP_002449267.1| hypothetical protein SORBIDRAFT_05g006970 [Sorghum bicolor]
gi|241935110|gb|EES08255.1| hypothetical protein SORBIDRAFT_05g006970 [Sorghum bicolor]
Length = 512
Score = 373 bits (957), Expect = e-100, Method: Compositional matrix adjust.
Identities = 186/348 (53%), Positives = 234/348 (67%), Gaps = 20/348 (5%)
Query: 16 ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
+ + V+ +E K++R+A+LD FRGL + LMILVD AGG + I H+PWNGC LADF
Sbjct: 38 VEKERVAVAEEVPKKKSRRVAALDAFRGLTIVLMILVDDAGGAYERIDHSPWNGCTLADF 97
Query: 76 VMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
VMPFFLFIVGVAIA ALKR+P+ +AVK++ RTLK+LFWG+LLQGG+SHAPD+L+YGVD
Sbjct: 98 VMPFFLFIVGVAIAFALKRVPNMGNAVKRITIRTLKMLFWGVLLQGGYSHAPDDLSYGVD 157
Query: 136 VRMIRLCGVLQ--------------------RIALSYLLVSLVEIFTKDVQDKDQSVGRF 175
++ IR G+LQ RIAL Y +V+L+E FT V+ G +
Sbjct: 158 MKKIRWMGILQLYIYHGNNLDSFLFFTLGHQRIALVYFIVALIEAFTVKVRPTTVRSGPY 217
Query: 176 SIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNP 235
+IF + W WL V+Y+ + YVPDW + N + GK F V CGVRA L
Sbjct: 218 AIFNAHRWQWLGGFIAFVIYMVTTFSLYVPDWSYVYHNDGDVNDGKQFTVKCGVRASLEQ 277
Query: 236 PCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSS 295
CNAVGY+DR+V GINH+Y P W RSK CT SP GPLR DAP WC APFEPEGLLSS
Sbjct: 278 ACNAVGYVDRQVWGINHLYTQPVWIRSKDCTSSSPNMGPLRADAPEWCLAPFEPEGLLSS 337
Query: 296 VSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTN 343
+SS+LS IG+H+GHV+IH K H RLK W+ GF+LL+ + LHFTN
Sbjct: 338 ISSVLSGTIGIHYGHVLIHFKTHKERLKHWLVTGFSLLVLAIILHFTN 385
>gi|449440411|ref|XP_004137978.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Cucumis sativus]
gi|449517341|ref|XP_004165704.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Cucumis sativus]
Length = 488
Score = 364 bits (935), Expect = 4e-98, Method: Compositional matrix adjust.
Identities = 187/343 (54%), Positives = 233/343 (67%), Gaps = 16/343 (4%)
Query: 5 KAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISH 64
K E P I+ E + KT+R+A+LD FRGL + LMILVD AGG + I H
Sbjct: 33 KEEEKEVAPTIVEEAQLRQ-------KTKRVATLDAFRGLTIVLMILVDDAGGAYSRIDH 85
Query: 65 APWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFS 124
+PWNGC LADFVMPFFLFIVGVAIALA KRI V K+ RT+KL+FWG++LQGG+S
Sbjct: 86 SPWNGCTLADFVMPFFLFIVGVAIALAFKRIGSIKQGVMKISLRTIKLVFWGLILQGGYS 145
Query: 125 HAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSV---GRFSIFRLY 181
HAPD+L YGVD++ IR CG+LQRIAL Y +V+++E FT K + V G FSIF Y
Sbjct: 146 HAPDDLEYGVDMKHIRWCGILQRIALVYFVVAMIEAFT--TIGKPRVVLDHGHFSIFTAY 203
Query: 182 CWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVG 241
W+ ++Y+ Y YVP+W F+++ D + + V CGVR L P CNAVG
Sbjct: 204 --RWIGGFAAFIIYIITTYALYVPNWSFSVLEDDQLLHH--YTVVCGVRGHLGPACNAVG 259
Query: 242 YIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILS 301
++DR+V GINH+Y +P W R K CT +P EGPLR DA SWC APFEPEGLLSSVS+ILS
Sbjct: 260 HVDRQVWGINHLYSYPVWIRHKDCTFSAPDEGPLRDDAASWCLAPFEPEGLLSSVSAILS 319
Query: 302 TIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNG 344
IG+H+GHV++H K H RLKQWV+MGF I G+ LHFTN
Sbjct: 320 GTIGIHYGHVLLHFKTHSQRLKQWVSMGFGFFIIGIILHFTNA 362
>gi|242075654|ref|XP_002447763.1| hypothetical protein SORBIDRAFT_06g015200 [Sorghum bicolor]
gi|241938946|gb|EES12091.1| hypothetical protein SORBIDRAFT_06g015200 [Sorghum bicolor]
Length = 446
Score = 339 bits (870), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 161/311 (51%), Positives = 216/311 (69%), Gaps = 5/311 (1%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
+ QRL SLD+FRG+ V LMI+VD AG P ++H+PW+G +ADFVMPFFLFIVGVA+AL
Sbjct: 53 RPQRLVSLDVFRGITVLLMIIVDDAGAFIPAMNHSPWDGVTVADFVMPFFLFIVGVALAL 112
Query: 91 ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
A KR+PD+ DA KK + R LKL G++LQGGF H LT+GVD++ IRL G+LQRIA+
Sbjct: 113 AYKRVPDKLDATKKAVLRALKLFCLGLVLQGGFFHGVRSLTFGVDLQEIRLMGILQRIAI 172
Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
+YLL +L EI+ K +D D + + + Y + L+ A V + Y+ LLYGTYVPDW++
Sbjct: 173 AYLLTALCEIWLKGDEDVDYG---YDLLKRYRYQLLVGAVVAITYMCLLYGTYVPDWEYQ 229
Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSP 270
S + K F V CGVR +P CNAVG IDRK+LGI H+Y P + RSK C+ DSP
Sbjct: 230 TSGPGSIE--KSFFVKCGVRGDTSPGCNAVGMIDRKILGIQHLYGRPVYARSKQCSIDSP 287
Query: 271 FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF 330
GPL DAPSWC APF+PEGLLSSV +I++ +IG+ +GH+I+H + H R+ W+ F
Sbjct: 288 QNGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHIIVHFQKHRERIMNWLIPSF 347
Query: 331 ALLIFGLTLHF 341
++L+ + F
Sbjct: 348 SMLVLAFAMDF 358
>gi|168035930|ref|XP_001770461.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162678169|gb|EDQ64630.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 487
Score = 337 bits (865), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 165/308 (53%), Positives = 221/308 (71%), Gaps = 11/308 (3%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
K+ RLASLD+FRGL++A+MILVD+AGG WP I+H+PW G LADFVMPFFLFIVGVA+AL
Sbjct: 42 KSPRLASLDVFRGLSIAVMILVDNAGGVWPSINHSPWTGITLADFVMPFFLFIVGVALAL 101
Query: 91 ALKRIP-DRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
KRI D+ A +K + RT KLL G+++QGG+ H + +YGVD+ IR CGVLQRIA
Sbjct: 102 TYKRITRDKKVASQKALGRTAKLLIVGLVIQGGYFHGLHDTSYGVDLERIRWCGVLQRIA 161
Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
L+Y++V+L EI+ + +D S F+IF+ Y +HW +AA ++ YLALLYG YVPDW F
Sbjct: 162 LAYMVVALCEIWAPR-RRQDVSNDNFAIFKTYHFHWAVAAAIVATYLALLYGVYVPDWDF 220
Query: 210 ---TIINKDSADY------GKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWR 260
T++N + G + V CGVR + P CNAVGY+DR +LG++H+Y P +R
Sbjct: 221 IPPTVLNSTALHVSVVRVNGSMSEVHCGVRGNIGPACNAVGYLDRTILGVSHLYQRPVFR 280
Query: 261 RSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLA 320
R+ AC+ +SP GPL AP WC APF+PEGLLSS+S++ S +G+HFGHV++H K H+A
Sbjct: 281 RTPACSVNSPDYGPLPSGAPDWCKAPFDPEGLLSSLSAVGSCFLGLHFGHVLVHRKEHIA 340
Query: 321 RLKQWVTM 328
RL W+ M
Sbjct: 341 RLWDWMIM 348
>gi|219885579|gb|ACL53164.1| unknown [Zea mays]
gi|413937084|gb|AFW71635.1| hypothetical protein ZEAMMB73_862609 [Zea mays]
Length = 482
Score = 336 bits (861), Expect = 1e-89, Method: Compositional matrix adjust.
Identities = 157/313 (50%), Positives = 210/313 (67%), Gaps = 3/313 (0%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
+ QRLASLD+FRG+ V LMI+VD AGG P ++H+PW+G +ADF+MPFFLFIVGV++ L
Sbjct: 87 RQQRLASLDVFRGITVLLMIIVDDAGGFLPALNHSPWDGVTVADFIMPFFLFIVGVSLTL 146
Query: 91 ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
A KR+PDR +A +K + R LKL G++LQGGF H LT+GVD+ IRL G+LQRIA+
Sbjct: 147 AYKRVPDRVEATRKAVLRALKLFCLGLVLQGGFFHGVHSLTFGVDLTKIRLMGILQRIAI 206
Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
+YLL ++ EI+ K D D G + R Y + + + + Y LLYG YVPDW++
Sbjct: 207 AYLLAAVCEIWLKGDDDVDSGYG---LLRRYRYQLFVGLVLSIAYSILLYGMYVPDWEYQ 263
Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSP 270
I S+ K F+V CGVR P CNAVG +DR VLGI+H+Y P + R+K C+ D P
Sbjct: 264 IAGPGSSSTEKSFSVKCGVRGDTGPACNAVGMVDRTVLGIDHLYRRPVYARTKECSIDYP 323
Query: 271 FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF 330
GPL DAPSWC APF+PEGLLSSV +I++ +IG+ FGHVIIH + H R+ W+ F
Sbjct: 324 ENGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQFGHVIIHFEKHRGRIASWLVPSF 383
Query: 331 ALLIFGLTLHFTN 343
++L + F
Sbjct: 384 SMLALAFVMDFVG 396
>gi|413937082|gb|AFW71633.1| hypothetical protein ZEAMMB73_862609 [Zea mays]
Length = 441
Score = 335 bits (860), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 157/313 (50%), Positives = 210/313 (67%), Gaps = 3/313 (0%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
+ QRLASLD+FRG+ V LMI+VD AGG P ++H+PW+G +ADF+MPFFLFIVGV++ L
Sbjct: 46 RQQRLASLDVFRGITVLLMIIVDDAGGFLPALNHSPWDGVTVADFIMPFFLFIVGVSLTL 105
Query: 91 ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
A KR+PDR +A +K + R LKL G++LQGGF H LT+GVD+ IRL G+LQRIA+
Sbjct: 106 AYKRVPDRVEATRKAVLRALKLFCLGLVLQGGFFHGVHSLTFGVDLTKIRLMGILQRIAI 165
Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
+YLL ++ EI+ K D D G + R Y + + + + Y LLYG YVPDW++
Sbjct: 166 AYLLAAVCEIWLKGDDDVDSGYG---LLRRYRYQLFVGLVLSIAYSILLYGMYVPDWEYQ 222
Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSP 270
I S+ K F+V CGVR P CNAVG +DR VLGI+H+Y P + R+K C+ D P
Sbjct: 223 IAGPGSSSTEKSFSVKCGVRGDTGPACNAVGMVDRTVLGIDHLYRRPVYARTKECSIDYP 282
Query: 271 FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF 330
GPL DAPSWC APF+PEGLLSSV +I++ +IG+ FGHVIIH + H R+ W+ F
Sbjct: 283 ENGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQFGHVIIHFEKHRGRIASWLVPSF 342
Query: 331 ALLIFGLTLHFTN 343
++L + F
Sbjct: 343 SMLALAFVMDFVG 355
>gi|226509496|ref|NP_001144452.1| uncharacterized protein LOC100277415 [Zea mays]
gi|195642330|gb|ACG40633.1| hypothetical protein [Zea mays]
Length = 441
Score = 335 bits (860), Expect = 2e-89, Method: Compositional matrix adjust.
Identities = 157/313 (50%), Positives = 210/313 (67%), Gaps = 3/313 (0%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
+ QRLASLD+FRG+ V LMI+VD AGG P ++H+PW+G +ADF+MPFFLFIVGV++ L
Sbjct: 46 RQQRLASLDVFRGITVLLMIIVDDAGGFLPALNHSPWDGVTVADFIMPFFLFIVGVSLTL 105
Query: 91 ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
A KR+PDR +A +K + R LKL G++LQGGF H LT+GVD+ IRL G+LQRIA+
Sbjct: 106 AYKRVPDRVEATRKAVLRALKLFCLGLVLQGGFFHGVHSLTFGVDLTKIRLMGILQRIAI 165
Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
+YLL ++ EI+ K D D G + R Y + + + + Y LLYG YVPDW++
Sbjct: 166 AYLLAAVCEIWLKGDDDVDSGYG---LLRRYRYQLFVGLVLSIAYSILLYGMYVPDWEYQ 222
Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSP 270
I S+ K F+V CGVR P CNAVG +DR VLGI+H+Y P + R+K C+ D P
Sbjct: 223 IAGPGSSSTEKSFSVKCGVRGDTGPACNAVGMVDRTVLGIDHLYRRPVYARTKECSIDYP 282
Query: 271 FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF 330
GPL DAPSWC APF+PEGLLSSV +I++ +IG+ FGHVIIH + H R+ W+ F
Sbjct: 283 ENGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQFGHVIIHFEKHRGRITSWLVPSF 342
Query: 331 ALLIFGLTLHFTN 343
++L + F
Sbjct: 343 SMLALAFVMDFVG 355
>gi|224033113|gb|ACN35632.1| unknown [Zea mays]
gi|413918233|gb|AFW58165.1| hypothetical protein ZEAMMB73_985435 [Zea mays]
Length = 444
Score = 332 bits (852), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 158/309 (51%), Positives = 214/309 (69%), Gaps = 5/309 (1%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
QRL SLD+FRG+ V LMI+VD AG P ++H+PW+G +ADFVMPFFLFIVGVA+ALA
Sbjct: 53 QRLVSLDVFRGITVLLMIIVDDAGAFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAY 112
Query: 93 KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
KR+PD+ DA +K + R LKL G++LQGGF H L++GVD++ IRL GVLQRIA++Y
Sbjct: 113 KRVPDKLDASRKALLRALKLFCLGLVLQGGFFHGVRSLSFGVDLQEIRLMGVLQRIAIAY 172
Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
LL +L EI+ + +D D + + + Y + + A V + Y++LLYGTYVPDW++
Sbjct: 173 LLTALCEIWIRGDEDVDYG---YDLLKRYRYQLFVGAVVAITYMSLLYGTYVPDWEYQTS 229
Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
S + K V CGVR +P CNAVG IDRK+LGI H+Y P + RSK C+ DSP
Sbjct: 230 APGSTE--KHLFVKCGVRGDTSPGCNAVGMIDRKILGIQHLYGRPVYARSKQCSIDSPQN 287
Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
GPL DAPSWC APF+PEGLLSSV +I++ +IG+ +GHVI+H + H R+ W+ F++
Sbjct: 288 GPLPSDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHRERMMNWLIPSFSM 347
Query: 333 LIFGLTLHF 341
L+ + F
Sbjct: 348 LVLAFAMDF 356
>gi|219886509|gb|ACL53629.1| unknown [Zea mays]
gi|414587417|tpg|DAA37988.1| TPA: hypothetical protein ZEAMMB73_167983 [Zea mays]
Length = 438
Score = 332 bits (852), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 159/309 (51%), Positives = 214/309 (69%), Gaps = 5/309 (1%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
QRL SLD+FRG+ V LMI+VD AG P ++H+PW+G +ADFVMPFFLFIVGVA+ALA
Sbjct: 47 QRLVSLDVFRGITVLLMIIVDDAGSFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAY 106
Query: 93 KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
KR+PD+ DA KK + R LKL G++LQGGF H LT+GVD++ IRL G+LQRIA++Y
Sbjct: 107 KRVPDKLDATKKAVLRALKLFCLGLVLQGGFFHGVRSLTFGVDLQEIRLMGILQRIAIAY 166
Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
LL +L EI+ K +D D + + + Y + + A V + Y++LLYGTYV DW++
Sbjct: 167 LLTALCEIWLKGDEDVDYG---YDLLKRYRYQLFVGAIVGITYMSLLYGTYVRDWEYQTS 223
Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
S + K F V CGVR +P CNAVG IDR++LGI H+Y P + RSK C+ DSP
Sbjct: 224 GPGSIE--KSFFVKCGVRGDTSPGCNAVGMIDRRILGIQHLYGRPVYARSKQCSIDSPQN 281
Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
GPL DAPSWC APF+PEGLLSSV +I++ +IG+ +GHVI+H + H R+ W+ F++
Sbjct: 282 GPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHRERIMNWLIPSFSM 341
Query: 333 LIFGLTLHF 341
L+ + F
Sbjct: 342 LVLAFAMDF 350
>gi|255543288|ref|XP_002512707.1| conserved hypothetical protein [Ricinus communis]
gi|223548668|gb|EEF50159.1| conserved hypothetical protein [Ricinus communis]
Length = 426
Score = 332 bits (851), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 161/309 (52%), Positives = 212/309 (68%), Gaps = 3/309 (0%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
RL SLD+FRGL VALMILVD+AGG P I+H+PWNG LAD VMPFFLFIVGV++ L
Sbjct: 50 HRLLSLDVFRGLTVALMILVDYAGGILPAINHSPWNGLTLADLVMPFFLFIVGVSLGLTY 109
Query: 93 KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
K++P +A A +K I RTLKLL G LQGG+ H ++LTYGV+V +RL G+LQRIA++Y
Sbjct: 110 KKLPCKAVATRKAILRTLKLLTLGFFLQGGYLHGLNDLTYGVNVEKLRLMGILQRIAIAY 169
Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
L+ +L EI+ K D S+ R Y + W MA ++ YL+L+YG YVPDW++ I
Sbjct: 170 LVGALCEIWLKGDDHVDSCS---SLLRKYRFQWAMALVLISTYLSLIYGLYVPDWEYQIP 226
Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
+ S+ K+F V CGVR P CNAVG IDR LGI H+Y P + R+K C+ +SP
Sbjct: 227 AEASSSPAKIFLVKCGVRGNTGPACNAVGLIDRTTLGIQHLYGKPVYARTKLCSINSPDY 286
Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
GPL DAPSWC APF+PEG+LSSV ++++ +IG+H+GH+I+H K H R+ W+ L
Sbjct: 287 GPLPADAPSWCQAPFDPEGILSSVMAVVTCLIGLHYGHIIVHFKDHRNRMLHWMIPSICL 346
Query: 333 LIFGLTLHF 341
+ GL L F
Sbjct: 347 IGLGLALDF 355
>gi|226494648|ref|NP_001146383.1| uncharacterized protein LOC100279961 [Zea mays]
gi|219886923|gb|ACL53836.1| unknown [Zea mays]
gi|413918231|gb|AFW58163.1| hypothetical protein ZEAMMB73_985435 [Zea mays]
Length = 469
Score = 332 bits (851), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 158/309 (51%), Positives = 214/309 (69%), Gaps = 5/309 (1%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
QRL SLD+FRG+ V LMI+VD AG P ++H+PW+G +ADFVMPFFLFIVGVA+ALA
Sbjct: 78 QRLVSLDVFRGITVLLMIIVDDAGAFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAY 137
Query: 93 KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
KR+PD+ DA +K + R LKL G++LQGGF H L++GVD++ IRL GVLQRIA++Y
Sbjct: 138 KRVPDKLDASRKALLRALKLFCLGLVLQGGFFHGVRSLSFGVDLQEIRLMGVLQRIAIAY 197
Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
LL +L EI+ + +D D + + + Y + + A V + Y++LLYGTYVPDW++
Sbjct: 198 LLTALCEIWIRGDEDVDYG---YDLLKRYRYQLFVGAVVAITYMSLLYGTYVPDWEYQTS 254
Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
S + K V CGVR +P CNAVG IDRK+LGI H+Y P + RSK C+ DSP
Sbjct: 255 APGSTE--KHLFVKCGVRGDTSPGCNAVGMIDRKILGIQHLYGRPVYARSKQCSIDSPQN 312
Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
GPL DAPSWC APF+PEGLLSSV +I++ +IG+ +GHVI+H + H R+ W+ F++
Sbjct: 313 GPLPSDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHRERMMNWLIPSFSM 372
Query: 333 LIFGLTLHF 341
L+ + F
Sbjct: 373 LVLAFAMDF 381
>gi|359473865|ref|XP_002275105.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Vitis vinifera]
gi|296085565|emb|CBI29297.3| unnamed protein product [Vitis vinifera]
Length = 444
Score = 331 bits (849), Expect = 3e-88, Method: Compositional matrix adjust.
Identities = 162/335 (48%), Positives = 222/335 (66%), Gaps = 3/335 (0%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
+RL SLD+FRGL VA+MILVD AGG P I+H+PWNG LADFVMPFFLFIVGV++ALA
Sbjct: 51 RRLVSLDVFRGLTVAIMILVDDAGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALAY 110
Query: 93 KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
K + A K + R LKLL +G+ LQGG+ H + LTYGVD+ IRL G+LQRIA++Y
Sbjct: 111 KNLSSGYLATKMAVVRALKLLVFGLFLQGGYFHGLNNLTYGVDIEQIRLAGILQRIAVAY 170
Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
L ++ EI+ K D + G S+ + Y + W + + V Y +LLYG YVPDW+++I
Sbjct: 171 FLAAVCEIWLKG--DSNVKSGS-SLLKKYQFQWAVVLVLTVAYCSLLYGLYVPDWEYSIP 227
Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
++ S+ K+F V CGVR+ P CNAVG IDR VLGI H+Y P + R K C+ +SP
Sbjct: 228 SETSSSALKIFKVKCGVRSDTGPACNAVGMIDRNVLGIQHLYKRPIYARMKQCSINSPDY 287
Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
GPL +AP+WC APF+PEGLLSSV +I++ ++G+H+GH+I+H K H R+ W+ L
Sbjct: 288 GPLPPNAPTWCQAPFDPEGLLSSVMAIVTCLVGLHYGHIIVHFKDHKDRILHWIVPSSCL 347
Query: 333 LIFGLTLHFTNGEHGSGKFSTTCVCLFIYSKVILF 367
L+ G L F ++ + +C+ + ILF
Sbjct: 348 LVLGFALDFFGMHVNKALYTLSYMCVTAGAAGILF 382
>gi|116309454|emb|CAH66526.1| H0502B11.6 [Oryza sativa Indica Group]
gi|218194797|gb|EEC77224.1| hypothetical protein OsI_15768 [Oryza sativa Indica Group]
Length = 448
Score = 330 bits (846), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 159/309 (51%), Positives = 212/309 (68%), Gaps = 5/309 (1%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
QRL SLD+FRG+ V LMILVD AG P I+H+PW+G LADFVMPFFLFIVGVA+ALA
Sbjct: 57 QRLVSLDVFRGITVLLMILVDDAGAFLPAINHSPWDGVTLADFVMPFFLFIVGVALALAY 116
Query: 93 KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
KR+P++ +A +K I R LKL G++LQGGF H LT+G+D+ IRL G+LQRIA++Y
Sbjct: 117 KRVPNKLEATRKAILRALKLFCVGLVLQGGFFHGVRSLTFGIDMEKIRLMGILQRIAIAY 176
Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
++ +L EI+ K D D F + + + + V++ Y+ LYGTYVPDW++ I
Sbjct: 177 IVTALCEIWLKGDDDVDSG---FDLLKRNRYQLFIGLIVMITYMGFLYGTYVPDWEYRIS 233
Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
S + K F V C VR P CNAVG IDRK+LGI H+Y P + RSK C+ +SP
Sbjct: 234 VPGSTE--KSFFVKCSVRGDTGPGCNAVGMIDRKILGIQHLYCRPVYARSKQCSINSPQN 291
Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
GPLR DAPSWC APF+PEGLLSSV +I++ +IG+ +GHVI+H + H R+ +W+ F++
Sbjct: 292 GPLRPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHKERIMKWLIPSFSM 351
Query: 333 LIFGLTLHF 341
LI +L F
Sbjct: 352 LILAFSLDF 360
>gi|115458212|ref|NP_001052706.1| Os04g0404900 [Oryza sativa Japonica Group]
gi|113564277|dbj|BAF14620.1| Os04g0404900 [Oryza sativa Japonica Group]
gi|222628804|gb|EEE60936.1| hypothetical protein OsJ_14685 [Oryza sativa Japonica Group]
Length = 447
Score = 330 bits (846), Expect = 6e-88, Method: Compositional matrix adjust.
Identities = 159/309 (51%), Positives = 212/309 (68%), Gaps = 5/309 (1%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
QRL SLD+FRG+ V LMILVD AG P I+H+PW+G LADFVMPFFLFIVGVA+ALA
Sbjct: 56 QRLVSLDVFRGITVLLMILVDDAGAFLPAINHSPWDGVTLADFVMPFFLFIVGVALALAY 115
Query: 93 KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
KR+P++ +A +K I R LKL G++LQGGF H LT+G+D+ IRL G+LQRIA++Y
Sbjct: 116 KRVPNKLEATRKAILRALKLFCVGLVLQGGFFHGVRSLTFGIDMEKIRLMGILQRIAIAY 175
Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
++ +L EI+ K D D F + + + + V++ Y+ LYGTYVPDW++ I
Sbjct: 176 IVTALCEIWLKGDDDVDSG---FDLLKRNRYQLFIGLIVMITYMGFLYGTYVPDWEYRIS 232
Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
S + K F V C VR P CNAVG IDRK+LGI H+Y P + RSK C+ +SP
Sbjct: 233 VPGSTE--KSFFVKCSVRGDTGPGCNAVGMIDRKILGIQHLYCRPVYARSKQCSINSPQN 290
Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
GPLR DAPSWC APF+PEGLLSSV +I++ +IG+ +GHVI+H + H R+ +W+ F++
Sbjct: 291 GPLRPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHKERIMKWLIPSFSM 350
Query: 333 LIFGLTLHF 341
LI +L F
Sbjct: 351 LILAFSLDF 359
>gi|115485801|ref|NP_001068044.1| Os11g0543500 [Oryza sativa Japonica Group]
gi|77551354|gb|ABA94151.1| expressed protein [Oryza sativa Japonica Group]
gi|113645266|dbj|BAF28407.1| Os11g0543500 [Oryza sativa Japonica Group]
gi|125577433|gb|EAZ18655.1| hypothetical protein OsJ_34172 [Oryza sativa Japonica Group]
gi|215701389|dbj|BAG92813.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 448
Score = 329 bits (844), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 161/336 (47%), Positives = 216/336 (64%), Gaps = 2/336 (0%)
Query: 23 DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLF 82
D + + QRL SLD+FRG+ VALMILVD GG P ISH+PW+G LADFV PFFLF
Sbjct: 44 DGEAAATTTRQRLVSLDVFRGITVALMILVDDVGGIVPAISHSPWDGVTLADFVFPFFLF 103
Query: 83 IVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
IVGV++A A K++PD+ A KK + R +KL G++LQGGF H ELTYGVD+R IRL
Sbjct: 104 IVGVSLAFAYKKVPDKMLATKKAMLRAVKLFIVGLILQGGFFHGIHELTYGVDIRKIRLM 163
Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
GVLQRIA++YL+V+L EI+ + V + Y + ++V YL +LYG
Sbjct: 164 GVLQRIAIAYLVVALCEIWLRRVSSGGNIGSGSMLITRYHHQMFVGLVLVVTYLVILYGL 223
Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
+VPDW++ + + DS K F V CGV+ P CNAVG IDR VLGI H+Y HP + ++
Sbjct: 224 HVPDWEYEVTSPDSTV--KHFLVKCGVKGDTGPGCNAVGMIDRSVLGIQHLYAHPVYLKT 281
Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
+ C+ SP GPL +APSWC APF+PEGLLSS+ +I++ +IG+ GHVI+H K H R+
Sbjct: 282 EQCSMASPRNGPLPPNAPSWCEAPFDPEGLLSSLMAIVTCLIGLQIGHVIVHFKKHNERI 341
Query: 323 KQWVTMGFALLIFGLTLHFTNGEHGSGKFSTTCVCL 358
K+W + LL G +LH +S + C+
Sbjct: 342 KRWSILSLCLLTLGFSLHLFGLHMNKSLYSLSYTCV 377
>gi|326505544|dbj|BAJ95443.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 429
Score = 329 bits (844), Expect = 1e-87, Method: Compositional matrix adjust.
Identities = 156/309 (50%), Positives = 208/309 (67%), Gaps = 5/309 (1%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
QRL SLD+FRG+ V LMI+VD AG P ++H+PW G +ADFVMPFFLFIVGVA+ALA
Sbjct: 38 QRLVSLDVFRGITVLLMIIVDDAGSFLPAMNHSPWEGVTIADFVMPFFLFIVGVALALAY 97
Query: 93 KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
KR+PD+ DA +K R LKL G++LQGGF H LT+GVD+ IRL G+LQRIA++Y
Sbjct: 98 KRVPDKLDATRKATLRALKLFCVGLVLQGGFFHGVRSLTFGVDIAQIRLMGILQRIAIAY 157
Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
L+ +L +I+ K D D + + + Y + L + + Y+ALLYGTYVPDW++ I
Sbjct: 158 LVTALCQIWLKGDDDVDSGL---DLIKRYKYQLLAGLLITITYMALLYGTYVPDWEYRIS 214
Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
+ K F V CGVR P CNAVG IDRK+LGI H+Y P + RS+ C+ DSP
Sbjct: 215 GPGFTE--KTFTVRCGVRGDSGPGCNAVGMIDRKILGIQHLYGRPVYARSQQCSIDSPQN 272
Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
GPL DAPSWC APF+PEGLLSSV +I++ +IG+ +GH+I+H + H R+ W+ F +
Sbjct: 273 GPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHIIVHFQKHKERIMHWLVPSFGM 332
Query: 333 LIFGLTLHF 341
L+ + F
Sbjct: 333 LVLAFAMDF 341
>gi|302796996|ref|XP_002980259.1| hypothetical protein SELMODRAFT_112263 [Selaginella moellendorffii]
gi|300151875|gb|EFJ18519.1| hypothetical protein SELMODRAFT_112263 [Selaginella moellendorffii]
Length = 401
Score = 329 bits (843), Expect = 2e-87, Method: Compositional matrix adjust.
Identities = 153/311 (49%), Positives = 210/311 (67%), Gaps = 9/311 (2%)
Query: 48 LMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF 107
+MILVD+AGG+WP I+H+PWNG LAD VMPFFLFIVGVA+AL K+IP + D+ +K I
Sbjct: 1 MMILVDNAGGEWPAINHSPWNGVTLADLVMPFFLFIVGVALALVYKKIPSKLDSTRKAIL 60
Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
R+LKL F G+ LQGG+ H ++L+YGVD+ +IR CG+LQRIA YL+V+L E++ VQ
Sbjct: 61 RSLKLFFLGVFLQGGYFHGENDLSYGVDLTLIRWCGILQRIAFVYLVVALCEVWLPRVQG 120
Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTC 227
F + Y +HW+ L VYL+LLYG VPDWQF + N + VTC
Sbjct: 121 S-----YFGFMQNYLFHWIFVVVTLTVYLSLLYGLKVPDWQFELPNNRNI----TMTVTC 171
Query: 228 GVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF 287
G R+ L+PPCNAVGY+DR++LG+NH+ P + R+++C+ +SP GPL DAP WCHAPF
Sbjct: 172 GTRSNLDPPCNAVGYVDRQILGVNHLDQRPVFIRTESCSINSPDYGPLPADAPVWCHAPF 231
Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNGEHG 347
+PEG+LSSVS+I++ IG+H+GH I+ K H R+ ++ F LL G LH +
Sbjct: 232 DPEGILSSVSAIVTCFIGLHYGHFIVQCKEHKQRIINFIVPAFILLALGYVLHLLGIKMN 291
Query: 348 SGKFSTTCVCL 358
+S + +C
Sbjct: 292 KPLYSFSYMCF 302
>gi|242065256|ref|XP_002453917.1| hypothetical protein SORBIDRAFT_04g021400 [Sorghum bicolor]
gi|241933748|gb|EES06893.1| hypothetical protein SORBIDRAFT_04g021400 [Sorghum bicolor]
Length = 439
Score = 326 bits (836), Expect = 9e-87, Method: Compositional matrix adjust.
Identities = 153/313 (48%), Positives = 209/313 (66%), Gaps = 5/313 (1%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
+ RL SLD+FRG+ V LMI+VD AGG P ++H+PW+G +ADFVMPFFLFIVGV++ L
Sbjct: 46 RQPRLVSLDVFRGITVLLMIIVDDAGGFLPSLNHSPWDGVTIADFVMPFFLFIVGVSLTL 105
Query: 91 ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
A KR+PD+ +A KK + R LKL G++LQGGF H LT+GVD+ IRL G+LQRIA+
Sbjct: 106 AYKRVPDKLEATKKAVLRALKLFCLGLVLQGGFFHGVHSLTFGVDLTKIRLMGILQRIAI 165
Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
+YLL ++ EI+ K D D G + R Y + + + + Y LLYG YVPDW++
Sbjct: 166 AYLLAAICEIWLKGDDDVDSGYG---LLRRYRYQLFVGLVLSIAYTILLYGIYVPDWEYK 222
Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSP 270
I S + K F+V CGVR P CNAVG +DR +LGI+H+Y P + R+K C+ + P
Sbjct: 223 ISGPGSTE--KSFSVKCGVRGDTGPACNAVGMVDRTILGIDHLYRRPVYARTKECSINYP 280
Query: 271 FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF 330
GPL DAPSWC APF+PEGLLSSV +I++ +IG+ FGH+IIH + H R+ W+ F
Sbjct: 281 ENGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQFGHIIIHFEKHRGRITNWLIPSF 340
Query: 331 ALLIFGLTLHFTN 343
++L + F+
Sbjct: 341 SMLALAFLMDFSG 353
>gi|218185886|gb|EEC68313.1| hypothetical protein OsI_36402 [Oryza sativa Indica Group]
Length = 450
Score = 325 bits (834), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 163/336 (48%), Positives = 218/336 (64%), Gaps = 2/336 (0%)
Query: 23 DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLF 82
D + + QRL SLD+FRG+ VALMILVD GG P ISH+PW+G LADFV PFFLF
Sbjct: 46 DGEAAATTTRQRLVSLDVFRGITVALMILVDDVGGIVPAISHSPWDGVTLADFVFPFFLF 105
Query: 83 IVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
IVGV++A A K++PD+ A KK + R +KL G++LQGGF H ELTYGVD+R IRL
Sbjct: 106 IVGVSLAFAYKKVPDKMLATKKAMLRAVKLFIVGLILQGGFFHGIHELTYGVDIRKIRLM 165
Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
GVLQRIA++YL+V+L EI+ + V + Y + ++V YL +LYG
Sbjct: 166 GVLQRIAIAYLVVALCEIWLRRVSSGGDIGSGSMLITRYHHQMFVGLVLVVTYLVILYGL 225
Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
+VPDW++ + + DS K F V CGV+ P CNAVG IDR VLGI H+Y HP + ++
Sbjct: 226 HVPDWEYEVTSLDSTV--KHFLVKCGVKGDTGPGCNAVGMIDRSVLGIQHLYAHPVYLKT 283
Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
+ C+ DSP GPL +APSWC APF+PEGLLSS+ +I++ +IG+ GHVI+H K H R+
Sbjct: 284 EQCSMDSPRNGPLPPNAPSWCEAPFDPEGLLSSLMAIVTCLIGLQIGHVIVHFKKHNERI 343
Query: 323 KQWVTMGFALLIFGLTLHFTNGEHGSGKFSTTCVCL 358
K+W T+ LL G +LH +S + C+
Sbjct: 344 KRWSTLSLCLLTLGFSLHLFGLHMNKSLYSLSYTCV 379
>gi|302759310|ref|XP_002963078.1| hypothetical protein SELMODRAFT_78688 [Selaginella moellendorffii]
gi|300169939|gb|EFJ36541.1| hypothetical protein SELMODRAFT_78688 [Selaginella moellendorffii]
Length = 401
Score = 321 bits (822), Expect = 4e-85, Method: Compositional matrix adjust.
Identities = 150/311 (48%), Positives = 208/311 (66%), Gaps = 9/311 (2%)
Query: 48 LMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF 107
+MILVD+AGG+WP I+H+PWNG LAD VMPFFLFIVGVA+AL K+IP + D+ +K I
Sbjct: 1 MMILVDNAGGEWPAINHSPWNGVTLADLVMPFFLFIVGVALALVYKKIPSKLDSTRKAIL 60
Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
R+LKL F G+ LQGG+ H ++L+YGVD+ +IR CG+LQRIA Y++V+L E++ VQ
Sbjct: 61 RSLKLFFLGVFLQGGYFHGENDLSYGVDLTLIRWCGILQRIAFVYVIVALCEVWLPRVQG 120
Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTC 227
F I + Y +HW+ L VYL+LLYG VP WQF + N + VTC
Sbjct: 121 S-----YFGIMQNYLFHWIFVVVTLTVYLSLLYGLKVPHWQFELPNNRNIT----MTVTC 171
Query: 228 GVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF 287
G R+ L+P CNAVGY+DR++LG+NH+ P + R+++C+ +SP GPL DAP WCHAPF
Sbjct: 172 GTRSNLDPACNAVGYVDRQILGVNHLDQQPVFIRTESCSINSPDYGPLPADAPVWCHAPF 231
Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNGEHG 347
+PEG+LSSVS+I++ IG+H+GH I+ K H R+ ++ LL G LH +
Sbjct: 232 DPEGILSSVSAIVTCFIGLHYGHFIVQCKEHKQRIINFIVPAVILLALGYVLHLLGIKMN 291
Query: 348 SGKFSTTCVCL 358
+S + +C
Sbjct: 292 KPLYSFSYMCF 302
>gi|224057870|ref|XP_002299365.1| predicted protein [Populus trichocarpa]
gi|222846623|gb|EEE84170.1| predicted protein [Populus trichocarpa]
Length = 386
Score = 313 bits (803), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 153/293 (52%), Positives = 199/293 (67%), Gaps = 3/293 (1%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
RL SLD+FRGL VALMILVD AGG P I+H+PWNG LAD VMPFFLFIVGV++ L K
Sbjct: 1 RLVSLDVFRGLTVALMILVDDAGGVLPAINHSPWNGLTLADVVMPFFLFIVGVSLGLTYK 60
Query: 94 RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYL 153
++ +A A +K I RTLKLL G+ LQGGF H ++LTYGVD+ IR G+LQRIA+ YL
Sbjct: 61 KLSCKAVATRKAILRTLKLLIIGLFLQGGFLHGLNDLTYGVDMTQIRWMGILQRIAIGYL 120
Query: 154 LVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIIN 213
+ ++ EI+ K + S+ R Y + W + +YL+LLYG +VPDW++ I
Sbjct: 121 VGAMCEIWLKG---GNHVTSGLSMLRKYQFQWAAVLMFVTIYLSLLYGLHVPDWEYQIPV 177
Query: 214 KDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEG 273
SA K+F V CGVR P CNA G IDR +LGI H+Y P + R+K C+ +SP G
Sbjct: 178 AASASTPKIFPVKCGVRGHTGPACNAGGMIDRTILGIQHLYRKPIYARTKPCSINSPGYG 237
Query: 274 PLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
PL DAPSWC APF+PEGLLSSV +I++ ++G+H+GH+I+H K H R W+
Sbjct: 238 PLPPDAPSWCQAPFDPEGLLSSVMAIVTCLVGLHYGHIIVHFKEHKDRTLHWM 290
>gi|212723180|ref|NP_001132467.1| uncharacterized protein LOC100193923 [Zea mays]
gi|194694464|gb|ACF81316.1| unknown [Zea mays]
gi|414587418|tpg|DAA37989.1| TPA: hypothetical protein ZEAMMB73_167983 [Zea mays]
Length = 391
Score = 313 bits (801), Expect = 1e-82, Method: Compositional matrix adjust.
Identities = 150/295 (50%), Positives = 203/295 (68%), Gaps = 5/295 (1%)
Query: 47 ALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVI 106
ALMI+VD AG P ++H+PW+G +ADFVMPFFLFIVGVA+ALA KR+PD+ DA KK +
Sbjct: 14 ALMIIVDDAGSFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAYKRVPDKLDATKKAV 73
Query: 107 FRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQ 166
R LKL G++LQGGF H LT+GVD++ IRL G+LQRIA++YLL +L EI+ K +
Sbjct: 74 LRALKLFCLGLVLQGGFFHGVRSLTFGVDLQEIRLMGILQRIAIAYLLTALCEIWLKGDE 133
Query: 167 DKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVT 226
D D + + + Y + + A V + Y++LLYGTYV DW++ S + K F V
Sbjct: 134 DVDYG---YDLLKRYRYQLFVGAIVGITYMSLLYGTYVRDWEYQTSGPGSIE--KSFFVK 188
Query: 227 CGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP 286
CGVR +P CNAVG IDR++LGI H+Y P + RSK C+ DSP GPL DAPSWC AP
Sbjct: 189 CGVRGDTSPGCNAVGMIDRRILGIQHLYGRPVYARSKQCSIDSPQNGPLPPDAPSWCQAP 248
Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
F+PEGLLSSV +I++ +IG+ +GHVI+H + H R+ W+ F++L+ + F
Sbjct: 249 FDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHRERIMNWLIPSFSMLVLAFAMDF 303
>gi|195642128|gb|ACG40532.1| hypothetical protein [Zea mays]
Length = 379
Score = 312 bits (799), Expect = 2e-82, Method: Compositional matrix adjust.
Identities = 149/294 (50%), Positives = 203/294 (69%), Gaps = 5/294 (1%)
Query: 48 LMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF 107
LMI+VD AG P ++H+PW+G +ADFVMPFFLFIVGVA+ALA KR+PD+ DA KK +
Sbjct: 3 LMIIVDDAGSFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAYKRVPDKLDATKKAVL 62
Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
R LKL G++LQGGF H LT+GVD++ IRL G+LQRIA++YLL +L EI+ K +D
Sbjct: 63 RALKLFCLGLVLQGGFFHGVRSLTFGVDLQEIRLMGILQRIAIAYLLTALCEIWLKGDED 122
Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTC 227
D + + + Y + L+ A V + Y++LLYGTYVPD ++ S + K F V C
Sbjct: 123 VDYG---YDLLKRYRYQLLVGAVVAITYMSLLYGTYVPDCEYQTSGPGSIE--KSFFVKC 177
Query: 228 GVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF 287
GVR +P CNAVG IDR++LGI H+Y P + RSK C+ DSP GPL DAPSWC APF
Sbjct: 178 GVRGDTSPGCNAVGMIDRRILGIQHLYGRPVYARSKQCSIDSPQNGPLPPDAPSWCQAPF 237
Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
+PEGLLSSV +I++ +IG+ +GH+I+H + H R+ W+ F++L+ + F
Sbjct: 238 DPEGLLSSVMAIVTCLIGLQYGHIIVHFQKHRERIMNWLIPSFSMLVLAFAMDF 291
>gi|38346153|emb|CAE02025.2| OSJNBb0118P14.13 [Oryza sativa Japonica Group]
Length = 415
Score = 310 bits (793), Expect = 9e-82, Method: Compositional matrix adjust.
Identities = 155/333 (46%), Positives = 216/333 (64%), Gaps = 12/333 (3%)
Query: 15 IISEPDVSDQQEKSHLKT------QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWN 68
+++ PD +Q++ + Q L + +F + LMILVD AG P I+H+PW+
Sbjct: 1 MVAWPDNRNQRQGIQVSVVDVAEGQWLTCVHLFMP-EMPLMILVDDAGAFLPAINHSPWD 59
Query: 69 GCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPD 128
G LADFVMPFFLFIVGVA+ALA KR+P++ +A +K I R LKL G++LQGGF H
Sbjct: 60 GVTLADFVMPFFLFIVGVALALAYKRVPNKLEATRKAILRALKLFCVGLVLQGGFFHGVR 119
Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
LT+G+D+ IRL G+LQRIA++Y++ +L EI+ K D D F + + + +
Sbjct: 120 SLTFGIDMEKIRLMGILQRIAIAYIVTALCEIWLKGDDDVDSG---FDLLKRNRYQLFIG 176
Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
V++ Y+ LYGTYVPDW++ I S + K F V C VR P CNAVG IDRK+L
Sbjct: 177 LIVMITYMGFLYGTYVPDWEYRISVPGSTE--KSFFVKCSVRGDTGPGCNAVGMIDRKIL 234
Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
GI H+Y P + RSK C+ +SP GPLR DAPSWC APF+PEGLLSSV +I++ +IG+ +
Sbjct: 235 GIQHLYCRPVYARSKQCSINSPQNGPLRPDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQY 294
Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
GHVI+H + H R+ +W+ F++LI +L F
Sbjct: 295 GHVIVHFQKHKERIMKWLIPSFSMLILAFSLDF 327
>gi|326510109|dbj|BAJ87271.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 304
Score = 309 bits (791), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 146/294 (49%), Positives = 196/294 (66%), Gaps = 5/294 (1%)
Query: 48 LMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF 107
LMI+VD AG P ++H+PW G +ADFVMPFFLFIVGVA+ALA KR+PD+ DA +K
Sbjct: 8 LMIIVDDAGSFLPAMNHSPWEGVTIADFVMPFFLFIVGVALALAYKRVPDKLDATRKATL 67
Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
R LKL G++LQGGF H LT+GVD+ IRL G+LQRIA++YL+ +L +I+ K D
Sbjct: 68 RALKLFCVGLVLQGGFFHGVRSLTFGVDIAQIRLMGILQRIAIAYLVTALCQIWLKGDDD 127
Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTC 227
D + + + Y + L + + Y+ALLYGTYVPDW++ I + K F V C
Sbjct: 128 VDSGL---DLIKRYKYQLLAGLLITITYMALLYGTYVPDWEYRISGPGFTE--KTFTVRC 182
Query: 228 GVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF 287
GVR P CNAVG IDRK+LGI H+Y P + RS+ C+ DSP GPL DAPSWC APF
Sbjct: 183 GVRGDSGPGCNAVGMIDRKILGIQHLYGRPVYARSQQCSIDSPQNGPLPPDAPSWCQAPF 242
Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
+PEGLLSSV +I++ +IG+ +GH+I+H + H R+ W+ F +L+ + F
Sbjct: 243 DPEGLLSSVMAIVTCLIGLQYGHIIVHFQKHKERIMHWLVPSFGMLVLAFAMDF 296
>gi|147844298|emb|CAN82113.1| hypothetical protein VITISV_031338 [Vitis vinifera]
Length = 401
Score = 306 bits (785), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 155/336 (46%), Positives = 212/336 (63%), Gaps = 19/336 (5%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
+RL SLD+FRGL VA+MILVD AGG P I+H+PWNG LADFVMPFFLFIVGV++ALA
Sbjct: 51 RRLVSLDVFRGLTVAIMILVDDAGGILPAINHSPWNGLTLADFVMPFFLFIVGVSLALAY 110
Query: 93 KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
K + A K + GG+ H + LTYGVD+ IRL G+LQRIA++Y
Sbjct: 111 KNLSSGYLATK--------------MASGGYFHGLNNLTYGVDIEQIRLAGILQRIAVAY 156
Query: 153 LLVSLVEIFTK-DVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTI 211
L ++ EI+ K D K S S+ + Y + W + + V Y +LLYG YVPDW+++I
Sbjct: 157 FLAAVCEIWLKGDXNVKSGS----SLLKKYQFQWAVVLVLTVAYCSLLYGLYVPDWEYSI 212
Query: 212 INKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPF 271
++ S+ K+F V CGVR+ P CNAVG IDR VLGI H+Y P + R K C+ +SP
Sbjct: 213 PSETSSSALKIFKVKCGVRSDTGPACNAVGMIDRNVLGIQHLYKRPIYARMKQCSINSPD 272
Query: 272 EGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFA 331
GPL +AP+WC APF+PEGLLSSV +I++ ++G+H+GH+I+H K H R+ W+
Sbjct: 273 YGPLPPNAPTWCQAPFDPEGLLSSVMAIVTCLVGLHYGHIIVHFKDHKDRILHWIVPSSC 332
Query: 332 LLIFGLTLHFTNGEHGSGKFSTTCVCLFIYSKVILF 367
LL+ G L F ++ + +C+ + ILF
Sbjct: 333 LLVLGFALDFFGMHVNKALYTLSYMCVTAGAAGILF 368
>gi|255548527|ref|XP_002515320.1| conserved hypothetical protein [Ricinus communis]
gi|223545800|gb|EEF47304.1| conserved hypothetical protein [Ricinus communis]
Length = 460
Score = 306 bits (785), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 153/345 (44%), Positives = 217/345 (62%), Gaps = 8/345 (2%)
Query: 19 PDVSDQQEKSHLK----TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLAD 74
P S E+ L QRL SLD+FRGL +ALMILVD AGG +P I+H+PW G LAD
Sbjct: 32 PSSSSSDEREALPPPTPNQRLMSLDVFRGLTIALMILVDDAGGAFPSINHSPWFGVTLAD 91
Query: 75 FVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGV 134
FVMPFFLF VGV+I+L K+I ++ A KKV+ RT+KL G+LLQGG+ H + LTYG+
Sbjct: 92 FVMPFFLFGVGVSISLVFKKISSKSVATKKVMLRTIKLFLLGVLLQGGYFHGRNHLTYGI 151
Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
DV IR GVLQRI++ YL S+ EI+ + D + + + Y W+++ + +
Sbjct: 152 DVLKIRWLGVLQRISIGYLFASISEIWLVNHCIVDSPL---AFMKKYYAQWMVSLILCSL 208
Query: 195 YLALLYGTYVPDWQFTIINKDSADYGK-VFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
Y LLY +VP+W+F + + YG V CGVR L PPCNAVG IDR +LG +H+
Sbjct: 209 YTCLLYFLFVPNWEFEASSINLFGYGSGTQTVICGVRGSLEPPCNAVGLIDRFLLGEHHL 268
Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
Y P +RR+K C+ +SP GPL ++P WC APF+PEG+LSS+ + ++ ++G+ FGHV++
Sbjct: 269 YQRPVYRRTKQCSVNSPDYGPLPPNSPPWCLAPFDPEGILSSLMAAVTCLLGLQFGHVLV 328
Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNGEHGSGKFSTTCVCL 358
H K H+ R+ W+ F+LL+ G L ++ + C+
Sbjct: 329 HLKDHMQRILVWLISSFSLLVTGFVLKLIGIPFSKPLYTLSYTCI 373
>gi|357149263|ref|XP_003575052.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Brachypodium distachyon]
Length = 432
Score = 306 bits (783), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 157/333 (47%), Positives = 216/333 (64%), Gaps = 13/333 (3%)
Query: 17 SEPDV-SDQQEKSHLKT-------QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWN 68
+ PD+ S + S L T QRL SLD+FRG+ V LMI+VD AGG P ++H+PW+
Sbjct: 17 TTPDLESGASKASPLPTPVSPAARQRLVSLDVFRGITVLLMIIVDDAGGFLPALNHSPWD 76
Query: 69 GCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPD 128
G + DFVMPFFLFIVGV++ LA KR+P+R +A KK + R LKL G++LQGGF H
Sbjct: 77 GVTIGDFVMPFFLFIVGVSLTLAYKRVPERLEATKKAVLRALKLFCLGLVLQGGFFHGVR 136
Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
LT+GVD+ IRL G+LQRIA++YL+ ++ EI+ K + D+ + + R Y + +
Sbjct: 137 SLTFGVDITEIRLMGILQRIAIAYLIAAICEIWLKGNDEVDRGL---DLLRRYRYQLFVG 193
Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
+ V+Y LLYG YVPDW++ I S + K V CGVR P CNAVG +DR +L
Sbjct: 194 LLLSVMYTVLLYGIYVPDWEYQITGPGSTE--KSLLVKCGVRGDTGPGCNAVGMVDRTML 251
Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
GI+H+Y P + R+K C+ D P GPL DAPSWC APF+PEGLLSSV +I++ ++G+ F
Sbjct: 252 GIDHLYRRPVYARTKECSIDYPENGPLPPDAPSWCQAPFDPEGLLSSVMAIVTCLMGLQF 311
Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
GHVIIH + H R+ W+ F++L + F
Sbjct: 312 GHVIIHFEKHKERIINWLIPSFSMLALAFLMDF 344
>gi|326512130|dbj|BAJ96046.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 488
Score = 306 bits (783), Expect = 2e-80, Method: Compositional matrix adjust.
Identities = 161/379 (42%), Positives = 226/379 (59%), Gaps = 30/379 (7%)
Query: 16 ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
+ +P +D + K QR+ASLD+FRGL VA+MILVD AGG WP I+HAPW G +ADF
Sbjct: 36 LPQPPGADAKPGQQ-KPQRVASLDVFRGLTVAMMILVDDAGGAWPGINHAPWLGVTVADF 94
Query: 76 VMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
VMP FLFI+GV+ AL K+ P++ KK R +KL G++LQGG+ H +LTYGVD
Sbjct: 95 VMPAFLFIIGVSAALVFKKTPNKIATSKKAACRAIKLFILGVILQGGYIHGRHKLTYGVD 154
Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY 195
+ IR GVLQRIA+ Y L ++ EI+ + D V S + Y W+MA + +Y
Sbjct: 155 LDQIRWLGVLQRIAIGYFLAAISEIWLVNNTSVDSPV---SFVKKYFMEWIMAIIISALY 211
Query: 196 LALLYGTYVPDWQFTI------INKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
+ L++G YVP+W+F + + S D G + CG+ L PPCNAVG++DR +LG
Sbjct: 212 IGLVFGLYVPNWEFKVQTSSSTFSNPSNDVG-FKTIQCGLTGSLGPPCNAVGFVDRVLLG 270
Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
+H+Y +P ++R+K C+ +SP GPL +AP WC APF+PEGLLS++ + +S +G+HFG
Sbjct: 271 ESHLYKNPVYKRTKECSINSPDYGPLPPNAPDWCLAPFDPEGLLSTLMAAVSCFVGLHFG 330
Query: 310 HVIIHTKGHLARLKQWV-------TMGFALLIFGLTLH----------FTNGEHGSGKFS 352
HV+IH K H R+ W+ GF L + G+ T G G
Sbjct: 331 HVLIHCKTHSQRMMSWLLASTVLTVSGFLLQLLGMPFSKPLYTVSYMLLTGGVSGFVLLL 390
Query: 353 TTCVCLFIYSK--VILFQW 369
C+ I+ K +ILFQW
Sbjct: 391 LYCIVDVIHIKKPLILFQW 409
>gi|388508176|gb|AFK42154.1| unknown [Lotus japonicus]
Length = 467
Score = 302 bits (774), Expect = 1e-79, Method: Compositional matrix adjust.
Identities = 167/336 (49%), Positives = 222/336 (66%), Gaps = 10/336 (2%)
Query: 25 QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIV 84
K ++QRL S+D+FRGL VALMILVD AGG P ++H+PW+G +ADFVMP FLFIV
Sbjct: 67 NHKPQSQSQRLVSIDVFRGLTVALMILVDDAGGLLPALNHSPWDGLTIADFVMPLFLFIV 126
Query: 85 GVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
G+++AL K++ A +K I R LKLL G+ LQGG+ H ++LT+GVD++ IRL G+
Sbjct: 127 GLSLALTYKKLSCPVIATRKAILRALKLLALGLFLQGGYFHRINDLTFGVDMKQIRLMGI 186
Query: 145 LQRIALSYLLVSLVEIFTK-DVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY 203
LQRIA++YLL +L EI+ K D K S S+ R Y + W +A + YL LLYG Y
Sbjct: 187 LQRIAIAYLLTALCEIWLKCDDIVKSGS----SLLRKYRYQWAVAFVLSGFYLCLLYGLY 242
Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
VPDW++ I DS+ K F+V CGV A P CN VG IDRK+LGI H+Y P + R
Sbjct: 243 VPDWEYQ-IPTDSSSVPKTFSVKCGVWADTGPACNVVGMIDRKILGIQHLYRRPIYARMP 301
Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
C+ +SP GPL DAP+WC APF+PEGLLSSV +I++ +IG+H+GH+I+H K H R+
Sbjct: 302 ECSINSPDYGPLPPDAPAWCQAPFDPEGLLSSVMAIVTCLIGLHYGHIIVHYKDHRVRII 361
Query: 324 QWVTMGFALLIFGLTLHFTNGEHGSG---KFSTTCV 356
W+ L++FG LH G H + FS TCV
Sbjct: 362 HWMIPTSCLIVFGFALHLF-GMHVNKVLYSFSYTCV 396
>gi|357134575|ref|XP_003568892.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Brachypodium distachyon]
Length = 495
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 151/343 (44%), Positives = 208/343 (60%), Gaps = 10/343 (2%)
Query: 6 AETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHA 65
A H +P S K K R+ASLD+FRGL VA+MILVD AGG WP I+HA
Sbjct: 22 ASEIHPYPESPSPRQPPGTDAKPERKPHRVASLDVFRGLTVAMMILVDDAGGAWPGINHA 81
Query: 66 PWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSH 125
PW G +ADFVMP FLFI+GV+ AL KR ++ KK +R KL G++LQGG+ H
Sbjct: 82 PWLGVTVADFVMPAFLFIIGVSAALVFKRTQNKIATSKKAAYRAFKLFILGVILQGGYIH 141
Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
LTYGVD+ IR GVLQRIA+ Y L ++ EI+ + D V S + Y W
Sbjct: 142 GRHNLTYGVDLDHIRWLGVLQRIAIGYFLAAMSEIWLVNNISVDSPV---SFVKKYFMEW 198
Query: 186 LMAACVLVVYLALLYGTYVPDWQFTI------INKDSADYGKVFNVTCGVRAKLNPPCNA 239
+MA + +Y++L++G YVP+W+F + + S + G V CG+R L PPCNA
Sbjct: 199 VMAIMISALYISLIFGLYVPNWEFKVQTSNLTFSNGSNEIG-FKTVQCGLRGSLGPPCNA 257
Query: 240 VGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSI 299
VG++DR +LG NH+Y +P ++R+K C+ +SP G L +AP WC APF+PEGLLS++ +
Sbjct: 258 VGFVDRVLLGENHLYKNPVYKRTKECSVNSPDYGALPPNAPDWCLAPFDPEGLLSTLMAA 317
Query: 300 LSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFT 342
+S +G+HFGHV+IH + H R+ W+ L G L +
Sbjct: 318 VSCFVGLHFGHVLIHCQNHSQRMLSWLLASTVLTASGFLLQLS 360
>gi|224080634|ref|XP_002306188.1| predicted protein [Populus trichocarpa]
gi|222849152|gb|EEE86699.1| predicted protein [Populus trichocarpa]
Length = 422
Score = 301 bits (772), Expect = 2e-79, Method: Compositional matrix adjust.
Identities = 148/323 (45%), Positives = 209/323 (64%), Gaps = 19/323 (5%)
Query: 11 HHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGC 70
H PL+ D+ +Q S KT R+ASLD+FRGL V LM+LVD+ G P I+H+PWNG
Sbjct: 6 HKPLL----DIEEQPRTSK-KTPRVASLDVFRGLCVFLMMLVDYGGAIVPIIAHSPWNGL 60
Query: 71 NLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
+LADFVMPFFLFI GV++AL KR+P+R +A +K + R ++L G++LQGG+ H + L
Sbjct: 61 HLADFVMPFFLFIAGVSLALVYKRVPNRIEATRKAVLRAVELFLLGVILQGGYFHGINFL 120
Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
TYGVD++ IR G+LQRI++ Y+ +L EI+ +D S + Y WHW A
Sbjct: 121 TYGVDMKRIRWLGILQRISIGYIFAALCEIWLSCRSRRD-----VSFLKSYYWHWGAAFS 175
Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSA----DYGKVF---NVTCGVRAKLNPPCNAVGYI 243
+ +YL LLYG YVPDWQF + N S+ ++ V+ V C VR L P CN+ G I
Sbjct: 176 LSAIYLGLLYGLYVPDWQFEMSNATSSVFPTNHSYVYMLTQVKCSVRGDLGPACNSAGMI 235
Query: 244 DRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTI 303
DR VLGI+H+Y P +R K C + G + + APSWCHAPF+PEG+LSS+++ ++ I
Sbjct: 236 DRYVLGIDHLYKKPVYRNLKECNMST--NGQVPESAPSWCHAPFDPEGVLSSITAAVACI 293
Query: 304 IGVHFGHVIIHTKGHLARLKQWV 326
IG+ +GH + H + H R++ W+
Sbjct: 294 IGLQYGHSLAHLQDHKQRMQNWI 316
>gi|449446789|ref|XP_004141153.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Cucumis sativus]
Length = 494
Score = 301 bits (772), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 158/331 (47%), Positives = 220/331 (66%), Gaps = 8/331 (2%)
Query: 16 ISEPDVSDQQE---KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNL 72
I EP S +S + RL SLD+FRG+ VALMI+VD+AGG P I+H+PW+G L
Sbjct: 79 IDEPQFSSSVRPILRSSDQCHRLVSLDVFRGITVALMIVVDYAGGVMPAINHSPWDGLTL 138
Query: 73 ADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
AD VMPFFLFIVGV++ALA K+IP R A +K + RTLKLLF G+ LQGGF H + LTY
Sbjct: 139 ADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLRTLKLLFLGLFLQGGFLHGVNNLTY 198
Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
GVD++ IR G+LQRIA++Y L +L EI+ K D ++ R Y + A +
Sbjct: 199 GVDIQQIRWMGILQRIAIAYFLAALCEIWLK---GSDYVNSETALRRKYQLQLVAAVVLT 255
Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYG--KVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
++YLAL YG YVPDW++ + + ++D K+F+V CG R P CNAVG IDRK+ GI
Sbjct: 256 MLYLALSYGLYVPDWEYQVPSLTTSDVASPKIFSVKCGTRGDTGPACNAVGMIDRKIFGI 315
Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
H+Y P + R++ C+ ++P GPL DAPSWC APF+PEGLLS+V ++++ ++G+H+GH
Sbjct: 316 QHLYKRPIYARTEQCSINAPDYGPLPPDAPSWCQAPFDPEGLLSTVMAVVTCLVGLHYGH 375
Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
+I+H K H R+ W+ L++ + L F
Sbjct: 376 IIVHFKDHRDRMLHWIIPSSCLIVLAIGLDF 406
>gi|186530230|ref|NP_199601.2| uncharacterized protein [Arabidopsis thaliana]
gi|332008203|gb|AED95586.1| uncharacterized protein [Arabidopsis thaliana]
Length = 440
Score = 301 bits (771), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 154/301 (51%), Positives = 205/301 (68%), Gaps = 6/301 (1%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
+RL SLD+FRGL VA MILVD GG P I+H+PW+G LADFVMPFFLFIVGV++A A
Sbjct: 44 ERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAY 103
Query: 93 KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
K + R A +K + R+LKLL G+ LQGGF H + LTYG+DV IRL G+LQRIA++Y
Sbjct: 104 KNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAY 163
Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
L+V+L EI+ K + + S+ + Y +HW++A + +YL+LLYG YVPDW++ I+
Sbjct: 164 LVVALCEIWLKGNHNVSSEL---SMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQIL 220
Query: 213 NKDSADYGKVF---NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
+D F V CGVR P CNAVG +DR LGI H+Y P + R+K C+ +
Sbjct: 221 KEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINY 280
Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMG 329
P GPL DAPSWC APF+PEGLLSS+ + ++ ++G+H+GH+IIH K H RL QW+
Sbjct: 281 PNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKDHKKRLNQWILRS 340
Query: 330 F 330
F
Sbjct: 341 F 341
>gi|242071239|ref|XP_002450896.1| hypothetical protein SORBIDRAFT_05g020800 [Sorghum bicolor]
gi|241936739|gb|EES09884.1| hypothetical protein SORBIDRAFT_05g020800 [Sorghum bicolor]
Length = 455
Score = 300 bits (767), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 159/337 (47%), Positives = 213/337 (63%), Gaps = 3/337 (0%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
+ QRLASLD+FRG+ V LMILVD GG P ISH+PW+G LADFV PFFLFIVGV++A
Sbjct: 60 RGQRLASLDVFRGITVVLMILVDDVGGLVPAISHSPWDGVTLADFVFPFFLFIVGVSLAF 119
Query: 91 ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
A KR+P++ A KK + R KL G+LLQGG+ H +L+YGVD+ IRL G+LQRIA+
Sbjct: 120 AYKRVPNKTLATKKALIRASKLFLLGLLLQGGYFHTIHDLSYGVDLHKIRLMGILQRIAI 179
Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
+Y V+L EI+ + D G + + R Y + + V Y LLYG YVPDW++
Sbjct: 180 AYFAVALCEIWLRG-GASDNGAGGYVLIRRYRHQLFVGLVLTVTYTVLLYGMYVPDWEYV 238
Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSP 270
+ + D+ K F V CGVR P CNAVG IDR VLGI H+Y HP + ++ C+ +SP
Sbjct: 239 VTSPDTTL--KNFMVKCGVRGDTGPGCNAVGMIDRCVLGIQHLYAHPVYLKTAQCSINSP 296
Query: 271 FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF 330
GPL DAP+WC APF+PEGLLSS+ +I++ +IG+ GHVI+H K H R+ +W
Sbjct: 297 RNGPLPSDAPTWCEAPFDPEGLLSSLMAIVTCLIGLQIGHVIVHFKQHSKRIVRWSIPSL 356
Query: 331 ALLIFGLTLHFTNGEHGSGKFSTTCVCLFIYSKVILF 367
LLI G++L +S + C+ S + F
Sbjct: 357 ILLILGVSLDLFGMHMNKSLYSLSYTCVTTGSAGLFF 393
>gi|212724122|ref|NP_001131867.1| uncharacterized protein LOC100193245 [Zea mays]
gi|194692766|gb|ACF80467.1| unknown [Zea mays]
gi|413948803|gb|AFW81452.1| hypothetical protein ZEAMMB73_255914 [Zea mays]
gi|413948804|gb|AFW81453.1| hypothetical protein ZEAMMB73_255914 [Zea mays]
Length = 492
Score = 299 bits (766), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 159/384 (41%), Positives = 223/384 (58%), Gaps = 32/384 (8%)
Query: 10 HHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNG 69
H PL D + Q + K +R+ASLD+FRG VA+MILVD AGG WP I+HAPW G
Sbjct: 38 QHPPL-----DAAATQLEEQRKPERVASLDVFRGFTVAMMILVDDAGGAWPGINHAPWFG 92
Query: 70 CNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDE 129
+ADFVMP FLFI+GV+ AL K++ ++ A KK R KL G++LQGG+ H +
Sbjct: 93 VTVADFVMPAFLFIIGVSAALVFKKMANKTAATKKAAIRASKLFILGVILQGGYIHGRHK 152
Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
LTYGVD+ IR GVLQRIA+ Y + ++ EI+ + D V + Y W MA
Sbjct: 153 LTYGVDLDHIRWLGVLQRIAIGYFVAAMSEIWLVNNNLVDSPV---PFVKKYFIEWFMAI 209
Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDS-----ADYGKVFNVTCGVRAKLNPPCNAVGYID 244
+ V+Y+AL++G YV +W+F I +S ++ + + CGVR L PPCNAVG +D
Sbjct: 210 AITVLYVALVFGLYVANWEFEIQTSNSTLSIPSNSIETKMIQCGVRGSLGPPCNAVGLVD 269
Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
R +LG NH+Y +P ++R+K C+ +SP GPL +AP WC APF+PEGLLS++ ++++ +
Sbjct: 270 RVLLGENHLYKNPVYKRTKECSINSPDYGPLPPNAPDWCLAPFDPEGLLSTLMAVVTCFV 329
Query: 305 GVHFGHVIIHTKGHLARLKQWVTMGFALLI-----------FGLTLHFTNGEHGSGKFST 353
G+ FGHV+IH K H R+ W+ L I F L+ N +G S
Sbjct: 330 GLFFGHVLIHCKNHSQRMLIWLLASVVLTISAYLVLLLGMPFSKPLYTVNYMLLTGGVSG 389
Query: 354 TCVCLFIY--------SKVILFQW 369
+ L Y +LFQW
Sbjct: 390 FLLLLLYYIVDVIHIKKPFVLFQW 413
>gi|195652797|gb|ACG45866.1| hypothetical protein [Zea mays]
Length = 492
Score = 297 bits (760), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 158/384 (41%), Positives = 222/384 (57%), Gaps = 32/384 (8%)
Query: 10 HHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNG 69
H PL D + Q + K +R+ASLD+FRG VA+ ILVD AGG WP I+HAPW G
Sbjct: 38 QHPPL-----DAAATQLEEQRKPERVASLDVFRGFTVAMXILVDDAGGAWPGINHAPWFG 92
Query: 70 CNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDE 129
+ADFVMP FLFI+GV+ AL K++ ++ A KK R KL G++LQGG+ H +
Sbjct: 93 VTVADFVMPAFLFIIGVSAALVFKKMANKTAATKKAAIRASKLFILGVILQGGYIHGRHK 152
Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
LTYGVD+ IR GVLQRIA+ Y + ++ EI+ + D V + Y W MA
Sbjct: 153 LTYGVDLDHIRWLGVLQRIAIGYFVAAMSEIWLVNNNLVDSPV---PFVKKYFIEWFMAI 209
Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDS-----ADYGKVFNVTCGVRAKLNPPCNAVGYID 244
+ V+Y+AL++G YV +W+F I +S ++ + + CGVR L PPCNAVG +D
Sbjct: 210 AITVLYVALVFGLYVANWEFEIQTSNSTLSIPSNSIETKMIQCGVRGSLGPPCNAVGLVD 269
Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
R +LG NH+Y +P ++R+K C+ +SP GPL +AP WC APF+PEGLLS++ ++++ +
Sbjct: 270 RVLLGENHLYKNPVYKRTKECSINSPDYGPLPPNAPDWCLAPFDPEGLLSTLMAVVTCFV 329
Query: 305 GVHFGHVIIHTKGHLARLKQWVTMGFALLI-----------FGLTLHFTNGEHGSGKFST 353
G+ FGHV+IH K H R+ W+ L I F L+ N +G S
Sbjct: 330 GLFFGHVLIHCKNHSQRMLIWLLASVVLTISAYLVLLLGMPFSKPLYTVNYMLLTGGVSG 389
Query: 354 TCVCLFIY--------SKVILFQW 369
+ L Y +LFQW
Sbjct: 390 FLLLLLYYIVDVIHIKKPFVLFQW 413
>gi|125582342|gb|EAZ23273.1| hypothetical protein OsJ_06967 [Oryza sativa Japonica Group]
Length = 423
Score = 297 bits (760), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 139/294 (47%), Positives = 197/294 (67%), Gaps = 5/294 (1%)
Query: 48 LMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF 107
LMI+VD AG P ++H+PW+G +ADFVMPFFLF+VG+++ LA KR+PD+ +A KK +
Sbjct: 47 LMIIVDDAGAFLPALNHSPWDGVTIADFVMPFFLFMVGISLTLAYKRVPDKLEATKKAVL 106
Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
R LKL G++LQGGF H LT+GVD+ IRL G+LQRIA++YLL ++ EI+ K D
Sbjct: 107 RALKLFCLGLVLQGGFFHGVRSLTFGVDITKIRLMGILQRIAIAYLLAAICEIWLKGDDD 166
Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTC 227
D + + R Y + ++A + +Y +L G YVPDW++ I S + K F+V C
Sbjct: 167 VDCGL---DVIRRYRYQLVVALLLSTMYTVILNGVYVPDWEYQISGPGSTE--KSFSVRC 221
Query: 228 GVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF 287
GVR P CNAVG +DR +LGI+H+Y P + R+K C+ + P GPL DAPSWC APF
Sbjct: 222 GVRGDTGPACNAVGMLDRTILGIDHLYRRPVYARTKQCSINYPQNGPLPPDAPSWCQAPF 281
Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
+PEGLLSSV +I++ +IG+ FGH+IIH + H R+ W+ F++L ++ F
Sbjct: 282 DPEGLLSSVMAIVTCLIGLQFGHIIIHFEKHKGRIINWLIPSFSMLALAFSMDF 335
>gi|218190872|gb|EEC73299.1| hypothetical protein OsI_07466 [Oryza sativa Indica Group]
Length = 454
Score = 297 bits (760), Expect = 7e-78, Method: Compositional matrix adjust.
Identities = 139/294 (47%), Positives = 197/294 (67%), Gaps = 5/294 (1%)
Query: 48 LMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF 107
LMI+VD AG P ++H+PW+G +ADFVMPFFLF+VG+++ LA KR+PD+ +A KK +
Sbjct: 78 LMIIVDDAGAFLPALNHSPWDGVTIADFVMPFFLFMVGISLTLAYKRVPDKLEATKKAVL 137
Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
R LKL G++LQGGF H LT+GVD+ IRL G+LQRIA++YLL ++ EI+ K D
Sbjct: 138 RALKLFCLGLVLQGGFFHGVRSLTFGVDITKIRLMGILQRIAIAYLLAAICEIWLKGDDD 197
Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTC 227
D + + R Y + ++A + +Y +L G YVPDW++ I S + K F+V C
Sbjct: 198 VDCGL---DVIRRYRYQLVVALLLSTMYTVILNGVYVPDWEYQISGPGSTE--KSFSVRC 252
Query: 228 GVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF 287
GVR P CNAVG +DR +LGI+H+Y P + R+K C+ + P GPL DAPSWC APF
Sbjct: 253 GVRGDTGPACNAVGMLDRTILGIDHLYRRPVYARTKQCSINYPQNGPLPPDAPSWCQAPF 312
Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
+PEGLLSSV +I++ +IG+ FGH+IIH + H R+ W+ F++L ++ F
Sbjct: 313 DPEGLLSSVMAIVTCLIGLQFGHIIIHFEKHKGRIINWLIPSFSMLALAFSMDF 366
>gi|32487909|emb|CAE05368.1| OJ000315_02.13 [Oryza sativa Japonica Group]
Length = 452
Score = 296 bits (759), Expect = 9e-78, Method: Compositional matrix adjust.
Identities = 155/357 (43%), Positives = 216/357 (60%), Gaps = 36/357 (10%)
Query: 15 IISEPDVSDQQEKSHLKT------QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWN 68
+++ PD +Q++ + Q L + +F + LMILVD AG P I+H+PW+
Sbjct: 1 MVAWPDNRNQRQGIQVSVVDVAEGQWLTCVHLFMP-EMPLMILVDDAGAFLPAINHSPWD 59
Query: 69 GCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPD 128
G LADFVMPFFLFIVGVA+ALA KR+P++ +A +K I R LKL G++LQGGF H
Sbjct: 60 GVTLADFVMPFFLFIVGVALALAYKRVPNKLEATRKAILRALKLFCVGLVLQGGFFHGVR 119
Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
LT+G+D+ IRL G+LQRIA++Y++ +L EI+ K D D F + + + +
Sbjct: 120 SLTFGIDMEKIRLMGILQRIAIAYIVTALCEIWLKGDDDVDSG---FDLLKRNRYQLFIG 176
Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
V++ Y+ LYGTYVPDW++ I S + K F V C VR P CNAVG IDRK+L
Sbjct: 177 LIVMITYMGFLYGTYVPDWEYRISVPGSTE--KSFFVKCSVRGDTGPGCNAVGMIDRKIL 234
Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLL--------------- 293
GI H+Y P + RSK C+ +SP GPLR DAPSWC APF+PEGLL
Sbjct: 235 GIQHLYCRPVYARSKQCSINSPQNGPLRPDAPSWCQAPFDPEGLLRLQQYNISFANFAKF 294
Query: 294 ---------SSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
SSV +I++ +IG+ +GHVI+H + H R+ +W+ F++LI +L F
Sbjct: 295 SLFFLDSRISSVMAIVTCLIGLQYGHVIVHFQKHKERIMKWLIPSFSMLILAFSLDF 351
>gi|297791891|ref|XP_002863830.1| hypothetical protein ARALYDRAFT_494835 [Arabidopsis lyrata subsp.
lyrata]
gi|297309665|gb|EFH40089.1| hypothetical protein ARALYDRAFT_494835 [Arabidopsis lyrata subsp.
lyrata]
Length = 432
Score = 296 bits (758), Expect = 1e-77, Method: Compositional matrix adjust.
Identities = 158/320 (49%), Positives = 208/320 (65%), Gaps = 13/320 (4%)
Query: 14 LIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLA 73
L IS P + +E RL SLD+FRGL VALMILVD G P I+H+PW+G LA
Sbjct: 24 LQISRPSLPPDKE-------RLVSLDVFRGLTVALMILVDDVGEILPSINHSPWDGVTLA 76
Query: 74 DFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYG 133
DFVMPFFLFIVGV++A A K + R A +K + R+LKLL G+ LQGGF H + LTYG
Sbjct: 77 DFVMPFFLFIVGVSLAFAYKNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYG 136
Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
+DV IR G+LQRIA++YL+ +L EI+ K + + S+ + Y +HW++A +
Sbjct: 137 IDVEKIRFMGILQRIAIAYLVAALCEIWLKGNHNVSSEL---SMIKKYRFHWVVAFVITT 193
Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVF---NVTCGVRAKLNPPCNAVGYIDRKVLGI 250
+YL+LLYG YV DW++ I +D F V CGVR P CNAVG +DR LGI
Sbjct: 194 IYLSLLYGLYVSDWEYQISTEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGI 253
Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
H+Y P + R+K C+ SP GPL DAPSWC APF+PEGLLSS+ +I++ ++G+H+GH
Sbjct: 254 QHLYRKPVYARTKQCSISSPNNGPLPPDAPSWCQAPFDPEGLLSSLMAIVTCLVGLHYGH 313
Query: 311 VIIHTKGHLARLKQWVTMGF 330
+IIH K H RL QW+ F
Sbjct: 314 IIIHFKDHKKRLNQWILRSF 333
>gi|224131042|ref|XP_002320987.1| predicted protein [Populus trichocarpa]
gi|222861760|gb|EEE99302.1| predicted protein [Populus trichocarpa]
Length = 481
Score = 296 bits (757), Expect = 2e-77, Method: Compositional matrix adjust.
Identities = 162/346 (46%), Positives = 216/346 (62%), Gaps = 14/346 (4%)
Query: 1 MSEIKAETTHHHP--------LIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILV 52
+ E + E H+P I+ + S TQRL SLD+FRGL VALMILV
Sbjct: 9 LDERQREPLLHNPRSLSNEEEEEITNTPSTSSSNASPPPTQRLLSLDVFRGLTVALMILV 68
Query: 53 DHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKL 112
D AGG +P I+H+PW G LADFVMPFFLF+VGV+I+L K++ + A KKVI RT+KL
Sbjct: 69 DDAGGAFPCINHSPWFGVTLADFVMPFFLFVVGVSISLVFKKVSSKPMATKKVIQRTIKL 128
Query: 113 LFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSV 172
G+LLQGG+ H LTYGVDV IR GVLQRI++ YL ++ EI+ D D +
Sbjct: 129 FLLGLLLQGGYFHGRHNLTYGVDVGKIRWMGVLQRISIGYLFAAMSEIWLVDSITVDSPM 188
Query: 173 GRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTI--INKDSADYGKVFNVTCGVR 230
+ + Y W++A Y+ LLYG YVPDW+F + N ++G V CGVR
Sbjct: 189 ---AFVKKYYIQWMVAFLFCTFYMCLLYGLYVPDWEFEVPSTNLFEHEFGTKI-VNCGVR 244
Query: 231 AKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPE 290
L PPCNAVG IDR LG +H+Y HP +RR+K C+ +SP GPL ++P WC APF+PE
Sbjct: 245 GSLEPPCNAVGLIDRFFLGEHHLYQHPVYRRTKHCSVNSPDYGPLPPNSPGWCLAPFDPE 304
Query: 291 GLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFG 336
G+LSS+ + ++ +G+ FGH+++H KGH+ RL W F +LI G
Sbjct: 305 GILSSLMAAITCFLGLQFGHILVHFKGHMQRLCLWSVCSFIILITG 350
>gi|356527477|ref|XP_003532336.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Glycine max]
Length = 463
Score = 295 bits (755), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 171/371 (46%), Positives = 237/371 (63%), Gaps = 8/371 (2%)
Query: 1 MSEIKAETT----HHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAG 56
+S A+TT H + +I + ++ Q + K+ RL SLD+FRGL VALMILVD AG
Sbjct: 35 VSPTIAQTTPLHLHINNIIEEQHIIARHQPQPQPKSPRLVSLDVFRGLTVALMILVDDAG 94
Query: 57 GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWG 116
G P ++H+PWNG LAD+VMPFFLFIVGV++AL K++ DA +K R LKLL G
Sbjct: 95 GLIPALNHSPWNGLTLADYVMPFFLFIVGVSLALTYKKLSCGVDASRKASLRALKLLVLG 154
Query: 117 ILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFS 176
+ LQGG+ H ++LTYGVD++ IR G+LQRI ++YL+ +L EI+ K D + G S
Sbjct: 155 LFLQGGYFHRVNDLTYGVDLKQIRWMGILQRIGVAYLVAALCEIWLKS--DDTVNSGP-S 211
Query: 177 IFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPP 236
+ R Y + W +A + +YL LLYG YVPDW + I + S++ K F+V CGVR P
Sbjct: 212 LLRKYRYQWAVALILSFLYLCLLYGLYVPDWVYQIQTEPSSE-PKTFSVKCGVRGNTGPA 270
Query: 237 CNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSV 296
CNAVG IDR +LGI+H+Y P + R C+ +SP GPL DAP+WC APF+PEGLLSSV
Sbjct: 271 CNAVGMIDRTILGIHHLYQRPIYARMPECSINSPNYGPLPPDAPAWCQAPFDPEGLLSSV 330
Query: 297 SSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNGEHGSGKFSTTCV 356
+I++ +IG+H+GH+I+H K H R+ W+ L++FGL L +S +
Sbjct: 331 MAIVTCLIGLHYGHIIVHFKDHRVRIIYWMIPTSCLVVFGLALDLFGMHINKVLYSLSYT 390
Query: 357 CLFIYSKVILF 367
C+ + ILF
Sbjct: 391 CVTAGAAGILF 401
>gi|115446433|ref|NP_001046996.1| Os02g0526000 [Oryza sativa Japonica Group]
gi|49388281|dbj|BAD25399.1| unknown protein [Oryza sativa Japonica Group]
gi|49388287|dbj|BAD25402.1| unknown protein [Oryza sativa Japonica Group]
gi|113536527|dbj|BAF08910.1| Os02g0526000 [Oryza sativa Japonica Group]
Length = 376
Score = 295 bits (754), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 138/293 (47%), Positives = 196/293 (66%), Gaps = 5/293 (1%)
Query: 49 MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR 108
MI+VD AG P ++H+PW+G +ADFVMPFFLF+VG+++ LA KR+PD+ +A KK + R
Sbjct: 1 MIIVDDAGAFLPALNHSPWDGVTIADFVMPFFLFMVGISLTLAYKRVPDKLEATKKAVLR 60
Query: 109 TLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDK 168
LKL G++LQGGF H LT+GVD+ IRL G+LQRIA++YLL ++ EI+ K D
Sbjct: 61 ALKLFCLGLVLQGGFFHGVRSLTFGVDITKIRLMGILQRIAIAYLLAAICEIWLKGDDDV 120
Query: 169 DQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCG 228
D + + R Y + ++A + +Y +L G YVPDW++ I S + K F+V CG
Sbjct: 121 DCGL---DVIRRYRYQLVVALLLSTMYTVILNGVYVPDWEYQISGPGSTE--KSFSVRCG 175
Query: 229 VRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFE 288
VR P CNAVG +DR +LGI+H+Y P + R+K C+ + P GPL DAPSWC APF+
Sbjct: 176 VRGDTGPACNAVGMLDRTILGIDHLYRRPVYARTKQCSINYPQNGPLPPDAPSWCQAPFD 235
Query: 289 PEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
PEGLLSSV +I++ +IG+ FGH+IIH + H R+ W+ F++L ++ F
Sbjct: 236 PEGLLSSVMAIVTCLIGLQFGHIIIHFEKHKGRIINWLIPSFSMLALAFSMDF 288
>gi|356516509|ref|XP_003526936.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Glycine max]
Length = 416
Score = 293 bits (751), Expect = 7e-77, Method: Compositional matrix adjust.
Identities = 141/313 (45%), Positives = 198/313 (63%), Gaps = 11/313 (3%)
Query: 18 EPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVM 77
+P + + E + + R+ASLD+FRGL+V LMI VD+A +P I+HAPWNG +LADFVM
Sbjct: 5 QPLLLNDSEPTQFQNTRIASLDVFRGLSVFLMIFVDYAASIFPIIAHAPWNGIHLADFVM 64
Query: 78 PFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
PFFLFI G+++AL KR P R A K R L L GILLQGG+ H LT+GVD++
Sbjct: 65 PFFLFIAGISLALVYKRRPHRTQATWKAFARALNLFALGILLQGGYFHGVTSLTFGVDIQ 124
Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
IR G+LQRI++ Y++ +L EI+ + K+ + Y W W +A +L +Y
Sbjct: 125 RIRWLGILQRISIGYIVAALCEIWLPAPRWKE-----LGFVKSYYWQWFVAVILLALYSG 179
Query: 198 LLYGTYVPDWQFTIINKDSA----DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
LLYG YVPDWQF + S+ G ++ V C VR L P CN+ G IDR +LG++H+
Sbjct: 180 LLYGLYVPDWQFDVSASTSSLPPIGGGDIYMVNCSVRGDLGPACNSAGMIDRYILGLDHL 239
Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
Y P +R K C + +G + +PSWCHAPF+PEG+LSS+++ +S IIG+ +GHV+
Sbjct: 240 YRKPVYRNLKGCNMSA--KGQVSDSSPSWCHAPFDPEGILSSITAAVSCIIGLQYGHVLA 297
Query: 314 HTKGHLARLKQWV 326
H + H RL W+
Sbjct: 298 HLQDHKGRLYNWM 310
>gi|357510831|ref|XP_003625704.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
truncatula]
gi|355500719|gb|AES81922.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
truncatula]
Length = 444
Score = 293 bits (751), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 137/244 (56%), Positives = 172/244 (70%), Gaps = 4/244 (1%)
Query: 101 AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEI 160
VKK+I RTLKLLFWGILLQGG+SHAPDEL YGV+++ IR CG+LQRIAL Y +V+L+E
Sbjct: 79 TVKKIILRTLKLLFWGILLQGGYSHAPDELVYGVNMKFIRWCGILQRIALVYCIVALIET 138
Query: 161 FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYG 220
FT ++ S GR +IF Y W ++Y+ + YVP+W F ++ + D
Sbjct: 139 FTTKLRPTTLSPGRIAIFTAY--KWFGGFMAFLIYMITTFALYVPNWSF--VDHVNNDEP 194
Query: 221 KVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAP 280
K + V CG+R L P CNAVGY+DR+ G+NH+Y P WRR KACT SP EGP R DAP
Sbjct: 195 KRYTVICGMRGHLGPACNAVGYVDRQTWGVNHLYSQPVWRRLKACTFSSPSEGPFRDDAP 254
Query: 281 SWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLH 340
SWC APFEPEGLLSS+S+ILS IG+H+GHV+IH K H RLKQW +MGF LL+ + LH
Sbjct: 255 SWCLAPFEPEGLLSSISAILSGTIGIHYGHVLIHFKSHSERLKQWFSMGFVLLVVAIILH 314
Query: 341 FTNG 344
FT+
Sbjct: 315 FTDA 318
>gi|168007055|ref|XP_001756224.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162692734|gb|EDQ79090.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 411
Score = 293 bits (749), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 153/298 (51%), Positives = 199/298 (66%), Gaps = 17/298 (5%)
Query: 49 MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRAD-AVKKVIF 107
MILVD+AGG WP I+H+PW+G LADFV+PFFLFIVGVA+AL K+I + A +K I
Sbjct: 1 MILVDYAGGIWPAINHSPWDGVTLADFVLPFFLFIVGVALALTYKKIINEKQLASQKAIG 60
Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFT----- 162
R+LKL+ G+ +QGG+ H +YGVD+ IR CGVLQRIAL+Y++V+L EI+
Sbjct: 61 RSLKLVIVGLFIQGGYFHGVHNTSYGVDLESIRWCGVLQRIALAYMVVALCEIWAPRGHY 120
Query: 163 KDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKV 222
+ +S RF FR +AA ++ +YL LLYG YVPDW+F SA V
Sbjct: 121 DSMNVYIKSTRRFGTFRA------VAAAIVAIYLVLLYGVYVPDWEFV-----SAADSTV 169
Query: 223 FNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSW 282
F V CGVR + P CN VGY+DR +LG++H+Y +RR+ AC+ SP GPL AP W
Sbjct: 170 FQVKCGVRGDVGPSCNVVGYLDRTLLGLSHLYQKAVYRRAPACSVLSPDYGPLPAGAPVW 229
Query: 283 CHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLH 340
C APF+PEGLLSS+S+I+S +G+HFGHV++H K H ARLK WV M LL+ G LH
Sbjct: 230 CKAPFDPEGLLSSMSAIVSCFLGLHFGHVLVHHKEHNARLKDWVLMSLTLLVTGALLH 287
>gi|356569086|ref|XP_003552737.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Glycine max]
Length = 461
Score = 292 bits (748), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 170/358 (47%), Positives = 232/358 (64%), Gaps = 6/358 (1%)
Query: 10 HHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNG 69
H H +I + +S Q + K+ RL SLD+FRGL VALMILVD AGG P ++H+PWNG
Sbjct: 48 HIHNIIEEQRIISRHQPQP--KSPRLVSLDVFRGLTVALMILVDDAGGLIPALNHSPWNG 105
Query: 70 CNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDE 129
LAD+VMPFFLFIVGV++AL+ K++ DA +K R LKLL G+ LQGG+ H ++
Sbjct: 106 LTLADYVMPFFLFIVGVSLALSYKKLSCGVDASRKASLRALKLLALGLFLQGGYFHRVND 165
Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
LT+GVD++ IR G+LQRIA++YL+V+L EI+ K D + G S+ R Y + W +A
Sbjct: 166 LTFGVDIKQIRWMGILQRIAVAYLVVALCEIWLKS--DDTVNSGP-SLLRKYRYQWAVAL 222
Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
+ +YL LLYG YVPDW + I + SA+ K F+V CGVR P CN VG IDR +LG
Sbjct: 223 ILSFLYLCLLYGLYVPDWVYQIQTEPSAE-PKTFSVKCGVRGNTGPACNVVGMIDRMILG 281
Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
I H+Y P + R C+ +SP GPL DAP+WC APF+PEGLLSSV +I++ +IG+H+G
Sbjct: 282 IQHLYKRPIYARMPECSINSPNYGPLPPDAPAWCQAPFDPEGLLSSVMAIVTCLIGLHYG 341
Query: 310 HVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNGEHGSGKFSTTCVCLFIYSKVILF 367
H+I+H K H R+ W+ LL+FGL L +S + C+ + +LF
Sbjct: 342 HIIVHFKDHRVRIIYWMIPTSCLLVFGLALDLFGMHINKVLYSLSYTCVTAGAAGVLF 399
>gi|224103167|ref|XP_002312951.1| predicted protein [Populus trichocarpa]
gi|222849359|gb|EEE86906.1| predicted protein [Populus trichocarpa]
Length = 419
Score = 290 bits (743), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 139/325 (42%), Positives = 208/325 (64%), Gaps = 16/325 (4%)
Query: 11 HHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGC 70
H PL+ D+ +Q S K R ASLD+FRGL V LM+LVD+ G P I+H+PWNG
Sbjct: 6 HKPLL----DIEEQLHTSK-KPPRAASLDVFRGLCVFLMMLVDYGGAIIPIIAHSPWNGL 60
Query: 71 NLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
+LAD VMPFFLFI GV++AL K++P+R +A K + + +KL G+++QGG+ H + L
Sbjct: 61 HLADSVMPFFLFIAGVSLALVYKKVPNRIEATWKAVLKAIKLFLLGVVIQGGYFHGINSL 120
Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
TYGVD++ IR G+LQ+I++ Y++ +L EI+ + S + Y WHW +A
Sbjct: 121 TYGVDMKRIRWLGILQKISVGYIVAALCEIWLSCRTRRG-----VSFLKSYYWHWCVAFS 175
Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSA----DYGKVFNVTCGVRAKLNPPCNAVGYIDRK 246
+ +YL LLYG YVPDWQF + N S+ ++ V+ V C +R L P CN+ G IDR
Sbjct: 176 LSAIYLGLLYGLYVPDWQFEMSNATSSVFPTNHSNVYMVKCSLRGDLGPACNSAGMIDRY 235
Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
+LGI+H+Y P +R K C + +G + ++ SWCHAPF+PEG+LSS+++ ++ IIG+
Sbjct: 236 ILGIDHLYKKPVYRNLKECNMST--DGQVPDNSASWCHAPFDPEGVLSSLTAAVTCIIGL 293
Query: 307 HFGHVIIHTKGHLARLKQWVTMGFA 331
+GH++ H + H R++ W F+
Sbjct: 294 QYGHLLAHLQDHKGRMENWTLFSFS 318
>gi|334188248|ref|NP_001190487.1| uncharacterized protein [Arabidopsis thaliana]
gi|332008209|gb|AED95592.1| uncharacterized protein [Arabidopsis thaliana]
Length = 435
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 148/287 (51%), Positives = 198/287 (68%), Gaps = 6/287 (2%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
+RL SLD+FRGL VA MILVD GG P I+H+PW+G LADFVMPFFLFIVGV++A A
Sbjct: 44 ERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAY 103
Query: 93 KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
K + R A +K + R+LKLL G+ LQGGF H + LTYG+DV IRL G+LQRIA++Y
Sbjct: 104 KNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAY 163
Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
L+V+L EI+ K + + S+ + Y +HW++A + +YL+LLYG YVPDW++ I+
Sbjct: 164 LVVALCEIWLKGNHNVSSEL---SMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQIL 220
Query: 213 NKDSADYGKVF---NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
+D F V CGVR P CNAVG +DR LGI H+Y P + R+K C+ +
Sbjct: 221 KEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINY 280
Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
P GPL DAPSWC APF+PEGLLSS+ + ++ ++G+H+GH+IIH K
Sbjct: 281 PNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFK 327
>gi|186530235|ref|NP_001119392.1| uncharacterized protein [Arabidopsis thaliana]
gi|332008204|gb|AED95587.1| uncharacterized protein [Arabidopsis thaliana]
Length = 359
Score = 289 bits (739), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 148/291 (50%), Positives = 201/291 (69%), Gaps = 6/291 (2%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
+RL SLD+FRGL VA MILVD GG P I+H+PW+G LADFVMPFFLFIVGV++A A
Sbjct: 44 ERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAY 103
Query: 93 KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
K + R A +K + R+LKLL G+ LQGGF H + LTYG+DV IRL G+LQRIA++Y
Sbjct: 104 KNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAY 163
Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
L+V+L EI+ K + + S+ + Y +HW++A + +YL+LLYG YVPDW++ I+
Sbjct: 164 LVVALCEIWLKGNHNVSSEL---SMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQIL 220
Query: 213 NKDSADYGKVF---NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
+D F V CGVR P CNAVG +DR LGI H+Y P + R+K C+ +
Sbjct: 221 KEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINY 280
Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLA 320
P GPL DAPSWC APF+PEGLLSS+ + ++ ++G+H+GH+IIH K +++
Sbjct: 281 PNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKVNIS 331
>gi|238481501|ref|NP_001154765.1| uncharacterized protein [Arabidopsis thaliana]
gi|332008206|gb|AED95589.1| uncharacterized protein [Arabidopsis thaliana]
Length = 435
Score = 288 bits (736), Expect = 4e-75, Method: Compositional matrix adjust.
Identities = 148/287 (51%), Positives = 198/287 (68%), Gaps = 6/287 (2%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
+RL SLD+FRGL VA MILVD GG P I+H+PW+G LADFVMPFFLFIVGV++A A
Sbjct: 38 ERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAY 97
Query: 93 KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
K + R A +K + R+LKLL G+ LQGGF H + LTYG+DV IRL G+LQRIA++Y
Sbjct: 98 KNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAY 157
Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
L+V+L EI+ K + + S+ + Y +HW++A + +YL+LLYG YVPDW++ I+
Sbjct: 158 LVVALCEIWLKGNHNVSSEL---SMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQIL 214
Query: 213 NKDSADYGKVF---NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
+D F V CGVR P CNAVG +DR LGI H+Y P + R+K C+ +
Sbjct: 215 KEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINY 274
Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
P GPL DAPSWC APF+PEGLLSS+ + ++ ++G+H+GH+IIH K
Sbjct: 275 PNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFK 321
>gi|62701854|gb|AAX92927.1| hypothetical protein LOC_Os11g14080 [Oryza sativa Japonica Group]
gi|77549602|gb|ABA92399.1| D8Ertd354e protein, putative [Oryza sativa Japonica Group]
gi|125576749|gb|EAZ17971.1| hypothetical protein OsJ_33516 [Oryza sativa Japonica Group]
Length = 447
Score = 287 bits (735), Expect = 5e-75, Method: Compositional matrix adjust.
Identities = 138/254 (54%), Positives = 173/254 (68%), Gaps = 9/254 (3%)
Query: 25 QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIV 84
+E+ K++R+A+LD FRGL + LMILVD AGG + + H+PWNGC LADFVMPFFLFIV
Sbjct: 51 EEEPRKKSKRVAALDAFRGLTIVLMILVDDAGGAYERMDHSPWNGCTLADFVMPFFLFIV 110
Query: 85 GVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
GVAIA ALKR+P AVKK+ RTLK+LFWG+LLQGG+SHAPD+L+YGVD++ IR CG+
Sbjct: 111 GVAIAFALKRVPKLGAAVKKITIRTLKMLFWGLLLQGGYSHAPDDLSYGVDMKKIRWCGI 170
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
LQRIAL Y +V+L+E FT V+ G ++IF Y W WL L +Y+ + YV
Sbjct: 171 LQRIALVYFVVALIEAFTTKVRPTTVRSGPYAIFHAYRWQWLGGFVALFIYMVTTFSLYV 230
Query: 205 PDWQFTIINKDSADYGKVF---------NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYH 255
PDW + N + GK F +V CGVR L+P CNAVGY+DR V GINH+Y
Sbjct: 231 PDWSYVYHNDGDVNDGKQFTVLLAVFPDHVQCGVRGHLDPACNAVGYVDRVVWGINHLYT 290
Query: 256 HPAWRRSKACTQDS 269
P W RSK DS
Sbjct: 291 QPVWIRSKFNIIDS 304
>gi|357467537|ref|XP_003604053.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
truncatula]
gi|355493101|gb|AES74304.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
truncatula]
Length = 421
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 141/316 (44%), Positives = 196/316 (62%), Gaps = 13/316 (4%)
Query: 27 KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGV 86
S + R+AS+D+FRGL+V LMI VD+ G +P ISHAPWNG +LADFVMPFFLF+VG+
Sbjct: 12 NSETQFPRVASVDVFRGLSVFLMIFVDYGGSIFPIISHAPWNGLHLADFVMPFFLFLVGI 71
Query: 87 AIALALKRIPDR--ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
++AL K R + K + R+ +L GILLQGG+ H TYGVDV+ IR GV
Sbjct: 72 SLALVYKNKRSRPTQSSTWKPLLRSFQLFILGILLQGGYFHGIHSFTYGVDVQTIRFFGV 131
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
LQRI++ Y++ +L +I + K S F+ Y HW +AA +L ++ LLYG +V
Sbjct: 132 LQRISIGYIVAALCQICLPTLPSKHT-----SFFKTYYSHWFVAAILLAIHSGLLYGLHV 186
Query: 205 PDWQFTIINKDSA----DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWR 260
PDWQF S+ G V+ V C VR L P CN+ G IDR +LG++H+Y P +R
Sbjct: 187 PDWQFDASLSTSSLPPIQAGNVYTVNCSVRGDLGPACNSAGMIDRYILGLDHLYKKPVFR 246
Query: 261 RSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLA 320
K C S G + +PSWCHAPF+PEG+LSS+++ +S IIG+ +GH++ + + H
Sbjct: 247 NLKECNMSS--TGQVSDSSPSWCHAPFDPEGILSSITAAVSCIIGLQYGHILANLEDHKG 304
Query: 321 RLKQWVTMGFALLIFG 336
RL QW+ + L G
Sbjct: 305 RLNQWLGFSVSFLALG 320
>gi|115462187|ref|NP_001054693.1| Os05g0155700 [Oryza sativa Japonica Group]
gi|54291854|gb|AAV32222.1| unknown protein [Oryza sativa Japonica Group]
gi|113578244|dbj|BAF16607.1| Os05g0155700 [Oryza sativa Japonica Group]
gi|215694847|dbj|BAG90038.1| unnamed protein product [Oryza sativa Japonica Group]
gi|218196128|gb|EEC78555.1| hypothetical protein OsI_18526 [Oryza sativa Indica Group]
gi|222630256|gb|EEE62388.1| hypothetical protein OsJ_17178 [Oryza sativa Japonica Group]
Length = 491
Score = 283 bits (723), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 147/355 (41%), Positives = 214/355 (60%), Gaps = 22/355 (6%)
Query: 6 AETTHHHPLIISEPDVSD------------QQEKSHLKTQRLASLDIFRGLAVALMILVD 53
A+ H PL+ S D + + K +R+ASLD+FRGL VA+MILVD
Sbjct: 14 ADAGHRRPLLASADDDDEIRPYPASSPSPQHPAGAERKPRRVASLDVFRGLTVAMMILVD 73
Query: 54 HAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLL 113
AGG WP ++H+PW G +ADFVMP FLFI+GV+ AL K+ P++ A KK R +KL
Sbjct: 74 DAGGAWPGMNHSPWLGVTVADFVMPAFLFIIGVSAALVFKKTPNKTVATKKAAIRAIKLF 133
Query: 114 FWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVG 173
G++LQGG+ H LTYG+D+ IR GVLQRIA+ Y L ++ EI+ + D ++
Sbjct: 134 ILGVILQGGYIHGRHNLTYGIDLDHIRWLGVLQRIAIGYFLAAISEIWLVNNISVDSAI- 192
Query: 174 RFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-------ADYGKVFNVT 226
S + Y W++A + +Y+ LL G YV +W+F + +S + + +
Sbjct: 193 --SFVKKYFMEWIVAVMISALYVGLLLGLYVSNWEFKVQTSNSILTIPTPGNEIGMKMIQ 250
Query: 227 CGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP 286
CGVR L PPCNAVG++DR +LG NH+Y +P ++R+K C+ +SP GPL +AP WC AP
Sbjct: 251 CGVRGSLGPPCNAVGFVDRVLLGENHLYKNPVYKRTKECSVNSPDYGPLPPNAPDWCLAP 310
Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
F+PEGLLS++ + ++ +G+HFGHV++H K H R+ W+ L + G L
Sbjct: 311 FDPEGLLSTLMAAVTCFVGLHFGHVLVHCKDHSPRMLLWLLASTVLTVSGFLLQL 365
>gi|224072443|ref|XP_002303734.1| predicted protein [Populus trichocarpa]
gi|222841166|gb|EEE78713.1| predicted protein [Populus trichocarpa]
Length = 381
Score = 282 bits (722), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 150/320 (46%), Positives = 207/320 (64%), Gaps = 3/320 (0%)
Query: 48 LMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF 107
LMILVD AGG P I+H+PWNG LAD VMPFFLF+VGV++ L K++P +A A +K I
Sbjct: 3 LMILVDDAGGVLPAINHSPWNGLTLADVVMPFFLFMVGVSLGLTYKKLPSKAVATRKAIL 62
Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
R LKLL G+ LQGGF H ++LT+GVD+ IR G+LQRIA+ YL+ ++ EI+ K D
Sbjct: 63 RALKLLVIGLFLQGGFLHGLNDLTFGVDMVQIRWMGILQRIAIGYLIGAMCEIWLKG--D 120
Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTC 227
+ G S+ R Y W ++ +YL+LLYG YVPDW++ I S+ K+F V C
Sbjct: 121 NHVASG-LSMLRKYQLQWGAVVVLVSLYLSLLYGLYVPDWEYEIPVAASSSSPKIFRVKC 179
Query: 228 GVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPF 287
GVR CNAVG IDR VLGI H+Y P + R+KAC+ +SP GPL DAPSWC APF
Sbjct: 180 GVRGTTGSACNAVGMIDRTVLGIQHLYRKPIYARTKACSINSPDYGPLPPDAPSWCQAPF 239
Query: 288 EPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNGEHG 347
+PEGLLSSV +I++ ++G+H+GH+I+H K H R+ W+ ++ GL L +
Sbjct: 240 DPEGLLSSVMAIVTCLVGLHYGHIIVHFKEHKDRILHWMVPSTCFVVLGLVLDLSGMHVN 299
Query: 348 SGKFSTTCVCLFIYSKVILF 367
++ + +C+ + I+F
Sbjct: 300 KALYTFSYMCVTAGAAGIVF 319
>gi|356534906|ref|XP_003535992.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Glycine max]
Length = 489
Score = 281 bits (720), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 159/350 (45%), Positives = 218/350 (62%), Gaps = 9/350 (2%)
Query: 13 PLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNL 72
PL S P +D S L QRL+SLD+FRGL VALMILVD+ G +P ++H+PW G L
Sbjct: 36 PLPQSNP--TDTSSLS-LPNQRLSSLDVFRGLTVALMILVDNVGRAFPSLNHSPWFGVTL 92
Query: 73 ADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
ADFVMPFFLF+VGV+I L K++ + +A KKVI RTLKL G+LLQGG+ H +LTY
Sbjct: 93 ADFVMPFFLFVVGVSIGLVFKKVSSKPNATKKVISRTLKLFLLGLLLQGGYFHGHGKLTY 152
Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
GVD+ IR GVLQRI++ Y S+ EI+ + S F R Y W+ + +
Sbjct: 153 GVDLSKIRWLGVLQRISIGYFFASISEIWLVNHNILVDSPAGF--VRKYSIQWMFSILLC 210
Query: 193 VVYLALLYGTYVPDWQF----TIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
VYL LLYG YVP+W+F + + DS+ + NV C VR L PPCN VG+IDR +L
Sbjct: 211 SVYLCLLYGLYVPNWKFKHSNLLSSSDSSHLSIIQNVHCEVRGSLEPPCNVVGFIDRLIL 270
Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
G +HMY P + R+K C+ +SP GPL D+P WC APF+PEG+LSS+ + ++ +G+ +
Sbjct: 271 GEDHMYQRPVYIRTKECSVNSPDYGPLPPDSPGWCLAPFDPEGILSSLMAAITCFMGLQY 330
Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNGEHGSGKFSTTCVCL 358
GH+I+H +GH R+ W F+LL+ G L ++ + C+
Sbjct: 331 GHIIVHLQGHKQRVLLWSVFSFSLLLIGYILEILGMPLSKALYTLSYTCI 380
>gi|357442361|ref|XP_003591458.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
truncatula]
gi|355480506|gb|AES61709.1| Heparan-alpha-glucosaminide N-acetyltransferase [Medicago
truncatula]
Length = 476
Score = 280 bits (717), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 155/340 (45%), Positives = 210/340 (61%), Gaps = 14/340 (4%)
Query: 2 SEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPE 61
S I T H + L P VS + QRL SLD+FRGL VALMILVD G +P
Sbjct: 23 SSILTLTVHENEL----PPVS-------VPNQRLVSLDVFRGLTVALMILVDDVGRAFPS 71
Query: 62 ISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQG 121
++H+PW G LADFVMPFFLF VGV+IAL K++ + +A KK+I RT+KL G+LLQG
Sbjct: 72 LNHSPWFGVTLADFVMPFFLFGVGVSIALVFKKVSSKQNATKKIISRTIKLFLLGLLLQG 131
Query: 122 GFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLY 181
G+ H LTYG+D+ +R GVLQRI++ Y L S+ EI+ + S F R Y
Sbjct: 132 GYFHGRGNLTYGLDLTKLRWFGVLQRISIGYFLASMSEIWLVNGNILVDSPAAF--VRKY 189
Query: 182 CWHWLMAACVLVVYLALLYGTYVPDWQFTIINKD-SADYGKVFNVTCGVRAKLNPPCNAV 240
W+ + + VYL LLYG YVP+W+F N + NV C +R L+PPCNAV
Sbjct: 190 SIQWIFSILLCSVYLCLLYGLYVPNWEFEHSNLLWPGRVSTIQNVHCDMRGSLDPPCNAV 249
Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300
G+IDR +LG +HMY P +RR+K C+ +SP GPL D+P WC APF+PEG+LSS+ + +
Sbjct: 250 GFIDRLILGEDHMYQRPVYRRTKECSVNSPDYGPLPPDSPGWCLAPFDPEGILSSLMAAI 309
Query: 301 STIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLH 340
+ +G+ FGH+++ + H R+ W F+LL+ G L
Sbjct: 310 TCFVGLQFGHILVIFQAHKQRVLLWSVFSFSLLVVGYVLE 349
>gi|449458622|ref|XP_004147046.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Cucumis sativus]
gi|449489633|ref|XP_004158370.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Cucumis sativus]
Length = 418
Score = 278 bits (711), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 145/331 (43%), Positives = 200/331 (60%), Gaps = 14/331 (4%)
Query: 17 SEPDVSDQQE--KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLAD 74
S P + +QQE S K R+ SLD+FRGL+V +M+LVD+ G P ISH+PW G +LAD
Sbjct: 4 SRPLLKNQQELPASSGKAPRVVSLDVFRGLSVFMMMLVDYGGSFLPIISHSPWIGLHLAD 63
Query: 75 FVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGV 134
FVMP+FLFI GV++AL K + + A + R L L G+ LQGG+ H LTYGV
Sbjct: 64 FVMPWFLFIAGVSVALVYKEVESKVAAARNAACRGLYLFLLGVFLQGGYFHGITSLTYGV 123
Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIF-TKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
D+ IR G+LQRI++ YL+ +L EI+ T+ +++ Q FS WHW + +L
Sbjct: 124 DLESIRWLGILQRISIGYLIAALCEIWLTRCTREEAQHTKSFS------WHWCIIFFLLS 177
Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGK---VFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
+Y+ L YG YVPDW F I S+ V+ V C +R L P CN+ G IDR VLGI
Sbjct: 178 LYMGLSYGLYVPDWDFKISAPSSSLPLSGSYVYKVNCSLRGDLGPACNSAGMIDRYVLGI 237
Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
+H+Y P +R K C S G + +PSWC APFEPEGLLSS+++ ++ IIG+ +GH
Sbjct: 238 HHLYTKPVYRNLKECNISS--SGQFPETSPSWCRAPFEPEGLLSSLTATVACIIGLQYGH 295
Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
++ + H R W + F +L FG+ L F
Sbjct: 296 ILARAQDHKTRTNGWFLLSFKILAFGIFLVF 326
>gi|449528551|ref|XP_004171267.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Cucumis sativus]
Length = 380
Score = 276 bits (705), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 142/295 (48%), Positives = 200/295 (67%), Gaps = 5/295 (1%)
Query: 49 MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR 108
MI+VD+AGG P I+H+PW+G LAD VMPFFLFIVGV++ALA K+IP R A +K + R
Sbjct: 1 MIVVDYAGGVMPAINHSPWDGLTLADLVMPFFLFIVGVSLALAYKKIPSRGIATQKAVLR 60
Query: 109 TLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDK 168
TLKLLF G+ LQGGF H + LTYGVD++ IR G+LQRIA++Y L +L EI+ K
Sbjct: 61 TLKLLFLGLFLQGGFLHGVNNLTYGVDIQQIRWMGILQRIAIAYFLAALCEIWLK---GS 117
Query: 169 DQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYG--KVFNVT 226
D ++ R Y + A + ++YLAL YG YVPDW++ + + ++D K+F+V
Sbjct: 118 DYVNSETALRRKYQLQLVAAVVLTMLYLALSYGLYVPDWEYQVPSLTTSDVASPKIFSVK 177
Query: 227 CGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP 286
CG R P CNAVG IDRK+ GI H+Y P + R++ C+ ++P GPL DAPSWC AP
Sbjct: 178 CGTRGDTGPACNAVGMIDRKIFGIQHLYKRPIYARTEQCSINAPDYGPLPPDAPSWCQAP 237
Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
F+PEGLLS+V ++++ ++G+H+GH+I+H K H R+ W+ L++ + L F
Sbjct: 238 FDPEGLLSTVMAVVTCLVGLHYGHIIVHFKDHRDRMLHWIIPSSCLIVLAIGLDF 292
>gi|255581844|ref|XP_002531722.1| conserved hypothetical protein [Ricinus communis]
gi|223528625|gb|EEF30642.1| conserved hypothetical protein [Ricinus communis]
Length = 397
Score = 272 bits (696), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 131/279 (46%), Positives = 182/279 (65%), Gaps = 10/279 (3%)
Query: 49 MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR 108
M+LVD+ G +P I+H+PWNG +LADFVMPFFLFI GV++AL K++ R DA K + R
Sbjct: 1 MMLVDYGGSIFPIIAHSPWNGLHLADFVMPFFLFIAGVSLALVYKKVTKRIDATWKAMLR 60
Query: 109 TLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDK 168
+KL F G+ LQGG+ H + LTYGVD+ IR G+LQRI++ Y++ +L EI+ + +
Sbjct: 61 AVKLFFLGVFLQGGYFHGINSLTYGVDIERIRWFGILQRISIGYIVAALCEIW---LSRR 117
Query: 169 DQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSA----DYGKVFN 224
QS F+ Y WHW++A + VYL LLYG YVPDWQF + N S+ + V+
Sbjct: 118 TQSQREIGFFKNYYWHWVVAFSLSAVYLGLLYGLYVPDWQFEMSNAASSALPINGSNVYM 177
Query: 225 VTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCH 284
V C VR L P CN+ G IDR VLG +H+Y P R K C + G + + +PSWCH
Sbjct: 178 VKCSVRGDLGPACNSAGMIDRYVLGFDHLYTKPVHRNLKECNMTN---GQVSESSPSWCH 234
Query: 285 APFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
APF+PEGLLSS+++ ++ IIG+ GHV+ H + H R++
Sbjct: 235 APFDPEGLLSSLTAAITCIIGLQCGHVLAHIQEHKGRIE 273
>gi|395146531|gb|AFN53685.1| putative aquaporin PIP2-8 [Linum usitatissimum]
Length = 694
Score = 264 bits (674), Expect = 7e-68, Method: Compositional matrix adjust.
Identities = 142/342 (41%), Positives = 201/342 (58%), Gaps = 42/342 (12%)
Query: 3 EIKAETTHHHPLIISEPDVSDQQEKSHL-----KTQRLASLDIFRGLAVALMILVDHAGG 57
E + + + PL PD S +++ L K +RLASLD FRGL + LM+LVD+ G
Sbjct: 13 EDEMQKSFRPPL--PPPDFSGREDGQLLMLYRKKNKRLASLDAFRGLCIFLMMLVDYGGH 70
Query: 58 DWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGI 117
+P I+H+ WNG +LADFVMPFFLFIVGV+IAL K+ P+R +A +K + +++KL GI
Sbjct: 71 VFPTIAHSAWNGIHLADFVMPFFLFIVGVSIALVYKKAPNRVEATRKALLKSVKLFLVGI 130
Query: 118 LLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSI 177
LLQ QRI++ Y++ ++ EI+ ++ K G I
Sbjct: 131 LLQE------------------------QRISIGYIVGAICEIWL-SIRRK----GDVGI 161
Query: 178 FRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPC 237
+ Y WHW+ A ++ VY L YG YVPDWQF++ D VF V C V+ + P C
Sbjct: 162 IKSYYWHWIAALAIVAVYARLSYGLYVPDWQFSL----PGDQHHVFTVKCSVKGDVGPAC 217
Query: 238 NAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVS 297
N+ G IDR VLG++H+Y P ++ K C S + P +DAPSWCHAPF+PEGLLSS++
Sbjct: 218 NSAGMIDRYVLGLSHLYAKPVYKNLKVCNMSSNKQVP--EDAPSWCHAPFDPEGLLSSLT 275
Query: 298 SILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTL 339
+ ++ IIG+ FGHV+ H + H RL+ W L+ GL L
Sbjct: 276 AAVTCIIGLQFGHVLAHIQDHKGRLENWSGFSVFFLVLGLFL 317
>gi|413937083|gb|AFW71634.1| hypothetical protein ZEAMMB73_862609 [Zea mays]
Length = 317
Score = 250 bits (638), Expect = 9e-64, Method: Compositional matrix adjust.
Identities = 118/233 (50%), Positives = 158/233 (67%), Gaps = 3/233 (1%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
+ QRLASLD+FRG+ V LMI+VD AGG P ++H+PW+G +ADF+MPFFLFIVGV++ L
Sbjct: 87 RQQRLASLDVFRGITVLLMIIVDDAGGFLPALNHSPWDGVTVADFIMPFFLFIVGVSLTL 146
Query: 91 ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
A KR+PDR +A +K + R LKL G++LQGGF H LT+GVD+ IRL G+LQRIA+
Sbjct: 147 AYKRVPDRVEATRKAVLRALKLFCLGLVLQGGFFHGVHSLTFGVDLTKIRLMGILQRIAI 206
Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
+YLL ++ EI+ K D D G + R Y + + + + Y LLYG YVPDW++
Sbjct: 207 AYLLAAVCEIWLKGDDDVDSGYG---LLRRYRYQLFVGLVLSIAYSILLYGMYVPDWEYQ 263
Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
I S+ K F+V CGVR P CNAVG +DR VLGI+H+Y P + R+K
Sbjct: 264 IAGPGSSSTEKSFSVKCGVRGDTGPACNAVGMVDRTVLGIDHLYRRPVYARTK 316
>gi|413918232|gb|AFW58164.1| hypothetical protein ZEAMMB73_985435 [Zea mays]
Length = 423
Score = 248 bits (634), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 117/248 (47%), Positives = 164/248 (66%), Gaps = 5/248 (2%)
Query: 94 RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYL 153
R+PD+ DA +K + R LKL G++LQGGF H L++GVD++ IRL GVLQRIA++YL
Sbjct: 93 RVPDKLDASRKALLRALKLFCLGLVLQGGFFHGVRSLSFGVDLQEIRLMGVLQRIAIAYL 152
Query: 154 LVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIIN 213
L +L EI+ + +D D + + + Y + + A V + Y++LLYGTYVPDW++
Sbjct: 153 LTALCEIWIRGDEDVDYG---YDLLKRYRYQLFVGAVVAITYMSLLYGTYVPDWEYQTSA 209
Query: 214 KDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEG 273
S + K V CGVR +P CNAVG IDRK+LGI H+Y P + RSK C+ DSP G
Sbjct: 210 PGSTE--KHLFVKCGVRGDTSPGCNAVGMIDRKILGIQHLYGRPVYARSKQCSIDSPQNG 267
Query: 274 PLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALL 333
PL DAPSWC APF+PEGLLSSV +I++ +IG+ +GHVI+H + H R+ W+ F++L
Sbjct: 268 PLPSDAPSWCQAPFDPEGLLSSVMAIVTCLIGLQYGHVIVHFQKHRERMMNWLIPSFSML 327
Query: 334 IFGLTLHF 341
+ + F
Sbjct: 328 VLAFAMDF 335
>gi|125533951|gb|EAY80499.1| hypothetical protein OsI_35679 [Oryza sativa Indica Group]
Length = 444
Score = 241 bits (614), Expect = 6e-61, Method: Compositional matrix adjust.
Identities = 125/257 (48%), Positives = 160/257 (62%), Gaps = 18/257 (7%)
Query: 25 QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIV 84
+E+ K++R+A+LD FRGL + LMILVD AGG + + H+PWNGC LADFVMPFFLFIV
Sbjct: 51 EEEPRKKSKRVAALDAFRGLTIVLMILVDDAGGAYERMDHSPWNGCTLADFVMPFFLFIV 110
Query: 85 GVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
GVAIA ALKR+P AVKK+ RTLK+LFWG+LLQGG+SHAPD+L+YGVD++ IR CG+
Sbjct: 111 GVAIAFALKRVPKLGAAVKKITIRTLKMLFWGLLLQGGYSHAPDDLSYGVDMKKIRWCGI 170
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKD---QSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
LQ + + + + D+ +S S R L L +Y+ +
Sbjct: 171 LQNLLVLFDNAEDSFGVLRGCSDRGIHHKSSAYDSAVR------LGGFVALFIYMVTTFS 224
Query: 202 TYVPDWQFTIINKDSADYGKVF---------NVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
YVPDW + N + GK F +V CGVR L+P CNAVGY+DR V GINH
Sbjct: 225 LYVPDWSYIYHNDGDVNDGKQFTVLLAVFPDHVQCGVRGHLDPACNAVGYVDRVVWGINH 284
Query: 253 MYHHPAWRRSKACTQDS 269
+Y P W RSK DS
Sbjct: 285 LYTQPVWIRSKFNIVDS 301
>gi|326432441|gb|EGD78011.1| hypothetical protein PTSG_09649 [Salpingoeca sp. ATCC 50818]
Length = 1087
Score = 240 bits (612), Expect = 8e-61, Method: Compositional matrix adjust.
Identities = 136/350 (38%), Positives = 189/350 (54%), Gaps = 51/350 (14%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFI 83
Q + L +RL+SLD+FRG VALM+ VD G +P I H+PWNG LADFVMPFF FI
Sbjct: 627 QPAQRSLPKERLSSLDVFRGFTVALMVFVDETGAAFPPIDHSPWNGVRLADFVMPFFDFI 686
Query: 84 VGVAIALALKRI--------PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
VGV++AL+ K+ P A++K R LKL G+L QGG D + Y D
Sbjct: 687 VGVSLALSFKKFDLEDATTTPRVWPALRKATIRFLKLFILGMLTQGGI----DIMNY--D 740
Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFT------KDVQDKDQSVG----RFSIFRLYCWHW 185
+ IR+ G+LQR+A+ Y V+L+EIF ++ + D G + Y WHW
Sbjct: 741 LAHIRIMGILQRVAVCYYAVALMEIFLPRNKKYRNYNETDTVTGWAVDVLHMLWRYKWHW 800
Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
AAC+ + ++YG VPD F CG R L P CNA YIDR
Sbjct: 801 FTAACLFATHTGIMYGVNVPD---------------AFGEECG-RGVLTPACNAATYIDR 844
Query: 246 KVLGINHMY----------HHPAWRRSKACTQDSPFEGPLRKDAPSWC-HAPFEPEGLLS 294
VL + HMY + ++R C+ SP + +DAP+WC H PF+PEGL+S
Sbjct: 845 NVLTVEHMYFPANGGDKSGNDVTFQRLPECSTCSPGKCVPPEDAPAWCLHGPFDPEGLVS 904
Query: 295 SVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNG 344
S+++I++T+IG+H+GHV+ + AR+ W G L+ G LHF+
Sbjct: 905 SLNAIIATVIGIHYGHVLRRVQSPKARIVHWTAFGVVQLVIGFALHFSGA 954
>gi|186530239|ref|NP_001119393.1| uncharacterized protein [Arabidopsis thaliana]
gi|332008205|gb|AED95588.1| uncharacterized protein [Arabidopsis thaliana]
Length = 292
Score = 224 bits (571), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 119/231 (51%), Positives = 157/231 (67%), Gaps = 3/231 (1%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
+RL SLD+FRGL VA MILVD GG P I+H+PW+G LADFVMPFFLFIVGV++A A
Sbjct: 44 ERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAY 103
Query: 93 KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
K + R A +K + R+LKLL G+ LQGGF H + LTYG+DV IRL G+LQRIA++Y
Sbjct: 104 KNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAY 163
Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
L+V+L EI+ K + + S+ + Y +HW++A + +YL+LLYG YVPDW++ I+
Sbjct: 164 LVVALCEIWLKGNHNVSSEL---SMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQIL 220
Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
+D F V CGVR P CNAVG +DR LGI H+Y P + R+K
Sbjct: 221 KEDQGSTLTTFLVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTK 271
>gi|238481503|ref|NP_001154766.1| uncharacterized protein [Arabidopsis thaliana]
gi|332008207|gb|AED95590.1| uncharacterized protein [Arabidopsis thaliana]
Length = 295
Score = 220 bits (561), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 119/234 (50%), Positives = 157/234 (67%), Gaps = 6/234 (2%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
+RL SLD+FRGL VA MILVD GG P I+H+PW+G LADFVMPFFLFIVGV++A A
Sbjct: 44 ERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAY 103
Query: 93 KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
K + R A +K + R+LKLL G+ LQGGF H + LTYG+DV IRL G+LQRIA++Y
Sbjct: 104 KNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIAIAY 163
Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
L+V+L EI+ K + + S+ + Y +HW++A + +YL+LLYG YVPDW++ I+
Sbjct: 164 LVVALCEIWLKGNHNVSSEL---SMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEYQIL 220
Query: 213 NKDSADYGKVF---NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
+D F V CGVR P CNAVG +DR LGI H+Y P + R+K
Sbjct: 221 KEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTK 274
>gi|238481505|ref|NP_001154767.1| uncharacterized protein [Arabidopsis thaliana]
gi|332008208|gb|AED95591.1| uncharacterized protein [Arabidopsis thaliana]
Length = 340
Score = 217 bits (553), Expect = 6e-54, Method: Compositional matrix adjust.
Identities = 112/244 (45%), Positives = 159/244 (65%), Gaps = 6/244 (2%)
Query: 90 LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
++ +P + A +K + R+LKLL G+ LQGGF H + LTYG+DV IRL G+LQRIA
Sbjct: 1 MSFAVLPSQFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQRIA 60
Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
++YL+V+L EI+ K + + S+ + Y +HW++A + +YL+LLYG YVPDW++
Sbjct: 61 IAYLVVALCEIWLKGNHNVSSEL---SMIKKYRFHWVVAFVITTIYLSLLYGLYVPDWEY 117
Query: 210 TIINKDSADYGKVF---NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
I+ +D F V CGVR P CNAVG +DR LGI H+Y P + R+K C+
Sbjct: 118 QILKEDQGSTLTTFLNLKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCS 177
Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
+ P GPL DAPSWC APF+PEGLLSS+ + ++ ++G+H+GH+IIH K H RL QW+
Sbjct: 178 INYPNNGPLPPDAPSWCQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKDHKKRLNQWI 237
Query: 327 TMGF 330
F
Sbjct: 238 LRSF 241
>gi|413920627|gb|AFW60559.1| hypothetical protein ZEAMMB73_831897 [Zea mays]
Length = 343
Score = 215 bits (548), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 109/198 (55%), Positives = 131/198 (66%)
Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
QRIAL Y V+L+E T V+ G ++IF Y W WL VVY+ + YVP
Sbjct: 19 QRIALVYFFVALIEALTVKVRPTTVRSGPYAIFDAYRWQWLGGLVAFVVYMVTTFSLYVP 78
Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
DW F N+ + GK F V CGVRA L CNAVGY+DR+V GINH+Y P W RSK C
Sbjct: 79 DWSFVYHNEGDVNDGKQFTVKCGVRASLEQACNAVGYVDRQVWGINHLYTQPVWIRSKDC 138
Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
T SP GPLR DAP+WC APFEPEGLLSS+SS+LS IG+H+GHV+IH K H RLK W
Sbjct: 139 TSSSPNMGPLRSDAPAWCLAPFEPEGLLSSISSVLSGTIGIHYGHVLIHFKTHKERLKHW 198
Query: 326 VTMGFALLIFGLTLHFTN 343
+ GF+LL+ G+ LHFTN
Sbjct: 199 LLTGFSLLVLGIILHFTN 216
>gi|395146473|gb|AFN53630.1| putative aquaporin PIP2-8 [Linum usitatissimum]
Length = 692
Score = 215 bits (548), Expect = 3e-53, Method: Compositional matrix adjust.
Identities = 104/233 (44%), Positives = 149/233 (63%), Gaps = 11/233 (4%)
Query: 93 KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
K++P+R +A +K +++KL GILLQGGF H LTYGVD+ IRL G+LQRI++ Y
Sbjct: 96 KKVPNRVEATRKAFLKSVKLFLVGILLQGGFFHGLHSLTYGVDIERIRLLGILQRISIGY 155
Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
++ ++ EI+ V+ K G I + Y HW+ A ++VVY L YG YVPDWQF +
Sbjct: 156 IVGAICEIWL-SVRRK----GDVGIIKSYYSHWVAALAIVVVYARLSYGLYVPDWQFAL- 209
Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
D V+ V C V+ + P CN+ G +DR VLG++H+Y P ++ K C S +
Sbjct: 210 ---PQDQHHVYTVKCSVKGDVGPACNSAGMMDRYVLGLSHLYAKPVYKNLKICNMSSNKQ 266
Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
P +DAPSWCHAPF+PEGLLSS+++ ++ IIG+ FGHV+ H + H RL+ W
Sbjct: 267 VP--EDAPSWCHAPFDPEGLLSSLTAAVTCIIGLQFGHVLAHVQDHKGRLENW 317
>gi|167522597|ref|XP_001745636.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775985|gb|EDQ89607.1| predicted protein [Monosiga brevicollis MX1]
Length = 1047
Score = 210 bits (535), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 144/377 (38%), Positives = 198/377 (52%), Gaps = 68/377 (18%)
Query: 4 IKAETTHHHPLIISEPDVSDQQEKSHLK-----------TQRLASLDIFRGLAVALMILV 52
++ ++ PL+ + D S+ Q KS++ +RL++LD++RGL +A+MILV
Sbjct: 566 VRPRDSNRTPLLPASTD-SNIQSKSNIDLATDPVAPKPPRERLSALDVYRGLTIAVMILV 624
Query: 53 DHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIP-------DRADAVKKV 105
D G +P I HAPWNG +LAD V+P F FIVGV+IALA KR R A KK
Sbjct: 625 DETGAAFPPIDHAPWNGLHLADTVVPSFDFIVGVSIALAFKRFDLEAGAQGQRWTAFKKA 684
Query: 106 IFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDV 165
R LK LF GI D+ IR+ G+LQR+A+ Y V+L+EIF +
Sbjct: 685 TDRFLK-LFGGITFM------------NYDLTNIRIFGILQRVAVCYFAVALMEIFLPRL 731
Query: 166 QD---------KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS 216
D +F Y WHW AA +L V+ ++LYG VPD
Sbjct: 732 TGALPADNGTWADWMRRTQHLFWRYRWHWFSAALLLAVHTSILYGVDVPD---------- 781
Query: 217 ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYH-----HPA-----WRRSKACT 266
F CG R +L P CNA YIDR +L + HMY PA ++R C+
Sbjct: 782 -----AFGERCG-RGQLTPACNAATYIDRLILTVPHMYFPENGGDPAHADVTFKRLPECS 835
Query: 267 QDSPFEGPLRKDAPSWC-HAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
SP DAP+WC H PF+PEGL+SS+++I++TIIGVH+GHV+ K + R+ QW
Sbjct: 836 SCSPGLCVAPADAPAWCLHGPFDPEGLVSSLTAIVTTIIGVHYGHVLRQIKSPMERIFQW 895
Query: 326 VTMGFALLIFGLTLHFT 342
+ L+ GL LHF+
Sbjct: 896 SSFALLQLLLGLILHFS 912
>gi|255642425|gb|ACU21476.1| unknown [Glycine max]
Length = 326
Score = 206 bits (523), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 99/243 (40%), Positives = 143/243 (58%), Gaps = 11/243 (4%)
Query: 93 KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
+R P R A K R L L GILLQGG+ H LT+GVD++ IR G+LQRI++ Y
Sbjct: 41 QRRPHRTQATWKAFARALNLFALGILLQGGYFHGVTSLTFGVDIQRIRWLGILQRISIGY 100
Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
++ +L EI+ + K+ + Y W W +A +L +Y LLYG YVPDWQF +
Sbjct: 101 IVAALCEIWLPAPRWKE-----LGFVKSYYWQWFVAVILLALYSGLLYGLYVPDWQFDVS 155
Query: 213 NKDSA----DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
S+ G ++ V C VR L P CN+ G IDR +LG++H+Y P +R K C
Sbjct: 156 ASTSSLPPIGGGDIYMVNCSVRGDLGPACNSAGMIDRYILGLDHLYRKPVYRNLKGCNMS 215
Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM 328
+ +G + +PSWCHAPF+PEG+LSS+++ +S IIG+ +GHV+ H + H RL W+
Sbjct: 216 A--KGQVSDSSPSWCHAPFDPEGILSSITAAVSCIIGLQYGHVLAHLQDHKGRLYNWMCF 273
Query: 329 GFA 331
+
Sbjct: 274 SLS 276
>gi|255635187|gb|ACU17949.1| unknown [Glycine max]
Length = 217
Score = 187 bits (474), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 93/200 (46%), Positives = 127/200 (63%), Gaps = 5/200 (2%)
Query: 18 EPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVM 77
+P + + E + + R+ASLD+FRGL+V LMI VD+A +P I+HAPWNG +LADFVM
Sbjct: 5 QPLLLNDSEPTQFQNTRIASLDVFRGLSVFLMIFVDYAASIFPIIAHAPWNGTHLADFVM 64
Query: 78 PFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
PFFLFI G+++AL KR P R A K R L L GILLQGG+ H LT+GVD++
Sbjct: 65 PFFLFIAGISLALVYKRRPHRTQATWKAFARALNLFALGILLQGGYFHGVTSLTFGVDIQ 124
Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
IR G+LQRI++ Y++ +L EI+ + K+ + Y W W +A +L +Y
Sbjct: 125 RIRWLGILQRISIGYIVAALCEIWLPAPRWKE-----LGFVKSYYWQWFVAVILLALYSG 179
Query: 198 LLYGTYVPDWQFTIINKDSA 217
LLYG YVPDWQF + S+
Sbjct: 180 LLYGLYVPDWQFDVSASTSS 199
>gi|413947252|gb|AFW79901.1| hypothetical protein ZEAMMB73_198786 [Zea mays]
Length = 505
Score = 179 bits (454), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 85/195 (43%), Positives = 117/195 (60%), Gaps = 3/195 (1%)
Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
RIA++YLL ++ EI+ K D D G + R Y + + + + Y LLYG YVPD
Sbjct: 279 RIAIAYLLAAVCEIWLKGDDDVDSGYG---LLRRYRYQLFVGLVLSIAYSILLYGIYVPD 335
Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
W++ I S+ K F V CGVR P CNAVG +DR +LGI+H+Y P + R+K C+
Sbjct: 336 WEYQIAGPGSSSTKKSFFVKCGVRGDTRPACNAVGMVDRTILGIDHLYRRPVYVRTKECS 395
Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
D GPL DAPSWC APF+PEGLLS V +I++ +IG+ F HVIIH + H R+ W+
Sbjct: 396 IDYLENGPLPPDAPSWCQAPFDPEGLLSFVMAIVTCLIGLQFRHVIIHFEKHRGRIASWL 455
Query: 327 TMGFALLIFGLTLHF 341
F++L + F
Sbjct: 456 VPSFSMLALAFVMDF 470
>gi|356536971|ref|XP_003537005.1| PREDICTED: uncharacterized protein LOC100781855 [Glycine max]
Length = 357
Score = 176 bits (447), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 89/162 (54%), Positives = 109/162 (67%), Gaps = 2/162 (1%)
Query: 5 KAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHA--GGDWPEI 62
K E H D + + + + + + I LM+L D A GG +P I
Sbjct: 20 KGELKHEIERTNGNGDSIEHDKDARITQEGESVQQIVEQEQPLLMVLEDDADAGGAYPRI 79
Query: 63 SHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGG 122
H+PWNGC LADFVMPFFLF+VGVAIALALKRIP AVK +I RTLKLLFWGILLQGG
Sbjct: 80 DHSPWNGCTLADFVMPFFLFVVGVAIALALKRIPKVKYAVKNIILRTLKLLFWGILLQGG 139
Query: 123 FSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKD 164
+SHAPD+L+YGVD+R IR CG+LQRIAL Y V+L+E +T +
Sbjct: 140 YSHAPDDLSYGVDMRFIRWCGILQRIALVYCAVALIETYTTN 181
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 34/72 (47%), Positives = 44/72 (61%)
Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
G L++ A +C + +S+ LS IG+H+GHV+IH KGH RLKQW+ MGF L
Sbjct: 160 GILQRIALVYCAVALIETYTTNCISASLSGTIGIHYGHVLIHFKGHSERLKQWLLMGFLL 219
Query: 333 LIFGLTLHFTNG 344
L GL LHFT
Sbjct: 220 LTLGLMLHFTEA 231
>gi|413953638|gb|AFW86287.1| hypothetical protein ZEAMMB73_717084 [Zea mays]
Length = 357
Score = 168 bits (425), Expect = 4e-39, Method: Compositional matrix adjust.
Identities = 102/290 (35%), Positives = 154/290 (53%), Gaps = 50/290 (17%)
Query: 97 DRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVS 156
++ A KK R KL G++LQGG+ H +LTYGVD+ IR GVLQRIA+ Y + +
Sbjct: 3 NKTAATKKAAIRASKLFILGVILQGGYIHGRHKLTYGVDLDHIRWLGVLQRIAIGYFVAA 62
Query: 157 LVEIFTKDVQDKDQ-------------SVGRF-------------------SIFRLYCWH 184
+ EI+ + D ++G F + Y
Sbjct: 63 MSEIWLVNNNLVDSPVPFVKKYFIEWIAIGYFVAAMSEIWLVNNNLVDSPVPFVKKYFIE 122
Query: 185 WLMAACVLVVYLALLYGTYVPDWQFTIINKDS-----ADYGKVFNVTCGVRAKLNPPCNA 239
W MA + V+Y+AL++G YV +W+F I +S ++ + + CGVR L PPCNA
Sbjct: 123 WFMAIAITVLYVALVFGLYVANWEFEIQTSNSTLSIPSNSIETKMIQCGVRGSLGPPCNA 182
Query: 240 VGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLS----S 295
VG +DR +LG NH+Y +P ++R+K C+ +SP GPL +AP WC APF+PEGLLS +
Sbjct: 183 VGLVDRVLLGENHLYKNPVYKRTKECSINSPDYGPLPPNAPDWCLAPFDPEGLLSKPLYT 242
Query: 296 VSSILST-------IIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLT 338
V+ +L T ++ +++ +IH K L QW+ M AL+++ L
Sbjct: 243 VNYMLLTGGVSGFLLLLLYYIVDVIHIKKPFV-LFQWMGMN-ALIVYVLA 290
>gi|413918234|gb|AFW58166.1| hypothetical protein ZEAMMB73_985435 [Zea mays]
Length = 202
Score = 166 bits (419), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 80/137 (58%), Positives = 105/137 (76%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
QRL SLD+FRG+ V LMI+VD AG P ++H+PW+G +ADFVMPFFLFIVGVA+ALA
Sbjct: 53 QRLVSLDVFRGITVLLMIIVDDAGAFIPAMNHSPWDGVTVADFVMPFFLFIVGVALALAY 112
Query: 93 KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
KR+PD+ DA +K + R LKL G++LQGGF H L++GVD++ IRL GVLQRIA++Y
Sbjct: 113 KRVPDKLDASRKALLRALKLFCLGLVLQGGFFHGVRSLSFGVDLQEIRLMGVLQRIAIAY 172
Query: 153 LLVSLVEIFTKDVQDKD 169
LL +L EI+ + +D D
Sbjct: 173 LLTALCEIWIRGDEDVD 189
>gi|242007028|ref|XP_002424344.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212507744|gb|EEB11606.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 497
Score = 153 bits (387), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 102/334 (30%), Positives = 165/334 (49%), Gaps = 48/334 (14%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFI 83
S +K R+ SLD FRGLA+ +MI V++ GGD+ H+PWNG +ADFV P+F++I
Sbjct: 99 NDRYSRIKNSRIKSLDAFRGLAILIMIFVNYNGGDYSVFKHSPWNGITIADFVFPWFIWI 158
Query: 84 VGVAIALALKRIPDRADAVKKVIFRTLKLLFW----GILLQGGFSHAPDELTYGVDVRMI 139
+G + L++ RA + K++ FR LK F+ GI+L G + +
Sbjct: 159 MGASTVLSIDNNFRRAQSKKEIFFRILKRSFYLIALGIVLNSGHRDSKG---------FL 209
Query: 140 RLCGVLQRIALSYLLVSLVEIFT-KDVQDKDQSVGRFSIFRLYCW-HWLMAACVLVVYLA 197
R+CGVLQRI L+Y +++ +EIF K + ++ FS + W WL+ ++ +++
Sbjct: 210 RVCGVLQRIGLTYFIIASLEIFALKSLLNEHFGPWNFSRNIIKIWIQWLVPILLVAIHVI 269
Query: 198 LLYGTYVPDWQFTIINKDSADYGKVF-NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
+ + +VP F N T G A GYIDR ++ NHMYH
Sbjct: 270 ITFTLHVPGCPLGYTGPGGLSNHSAFRNCTGG----------AAGYIDRLIITDNHMYHR 319
Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
++ L+ PS PF+PEGLL +++S+ +GV ++I+ +
Sbjct: 320 GSF---------------LKIFKPS---VPFDPEGLLGTLTSVFCAFLGVQSARILINHE 361
Query: 317 GHLARLKQWVTMGFALLIFGLTLHF-TNGEHGSG 349
+++K W+ F ++ GL F N SG
Sbjct: 362 NSFSKIKSWI---FWAIVMGLISGFLCNWSQNSG 392
>gi|384249073|gb|EIE22555.1| hypothetical protein COCSUDRAFT_42235 [Coccomyxa subellipsoidea
C-169]
Length = 395
Score = 150 bits (379), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 103/310 (33%), Positives = 155/310 (50%), Gaps = 53/310 (17%)
Query: 46 VALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL--KRIPDRADAVK 103
+ALM+ V+HAG + P ++HA W+G +LAD VMP FL +VGV++AL+L + R ++
Sbjct: 1 MALMLFVNHAGHEVPWVAHAAWDGVHLADLVMPCFLLLVGVSVALSLGPRASGPRRPLLR 60
Query: 104 KVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIF-- 161
KV+ RT KL G+L+QGG D+ +R CGVLQRIAL + LVSLV ++
Sbjct: 61 KVLARTGKLAGLGLLIQGGVGAGAFP---AWDLSRLRYCGVLQRIALCFALVSLVVLYLP 117
Query: 162 ------TKDVQDK-DQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINK 214
+ + D+ D+S + FR Y W++ + V + +W +
Sbjct: 118 QTPSPRLQSLLDRGDESASLMAPFRFYALWWILGTALFVAF----------NWMALFLRP 167
Query: 215 DSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGP 274
C R L CN Y+D ++LG +H+Y P+ RR+ + P E
Sbjct: 168 PG----------CLARPALTADCNVAAYVDARLLGRSHLYPWPSCRRA-----NPPCE-- 210
Query: 275 LRKDAPSWCHAPFEPEGLLSSVSSIL-STIIGVHFGHVIIHTKGHLARLKQWVTMGFALL 333
+PEGL +++S L ST +G+ FG V++ +GH ARL+ W L
Sbjct: 211 -----------YLDPEGLFATLSGALASTFLGLWFGAVLLTLRGHRARLRSWAYASVLLT 259
Query: 334 IFGLTLHFTN 343
GL LH T
Sbjct: 260 ELGLALHVTG 269
>gi|391346547|ref|XP_003747534.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Metaseiulus occidentalis]
Length = 564
Score = 142 bits (357), Expect = 4e-31, Method: Compositional matrix adjust.
Identities = 99/351 (28%), Positives = 166/351 (47%), Gaps = 47/351 (13%)
Query: 6 AETTHHHPLIISEPDVSDQQEKSHLKTQ---RLASLDIFRGLAVALMILVDHAGGDWPEI 62
AET+ PL S + + + R+ SLD FRG + LMI V++ GG
Sbjct: 145 AETSSLGPLEASSSTAGSRPPEDGIGKAGKPRIKSLDAFRGFCLFLMIFVNYGGGGLWLF 204
Query: 63 SHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGIL 118
H PW+G AD + P+F++I+GV++A++L+ + + + ++ F R++KL G++
Sbjct: 205 EHIPWDGLTFADLLFPWFVWIMGVSMAISLRSMRRKCVPLSEIFFKILSRSVKLFLLGLI 264
Query: 119 LQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSV--GRFS 176
L + + D+ +R+ GVLQR A+SY +V+ + +F D ++ +
Sbjct: 265 L--------NSMGKNNDISKLRIPGVLQRFAVSYFVVASMHMFFSRATDAAETAKWAKIR 316
Query: 177 IFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNP 235
LY W+M ++ +++ L + VPD + + G FN T G
Sbjct: 317 DVALYWQEWVMMISLVAIHVLLTFLLDVPDCPKGYLGPGGLHENGTHFNCTGG------- 369
Query: 236 PCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSS 295
A GYIDR VLG NHMY HP + +Q PF+PEG+L
Sbjct: 370 ---AAGYIDRVVLGPNHMYGHPTTEKIYETSQ------------------PFDPEGVLGC 408
Query: 296 VSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL-LIFGLTLHFTNGE 345
++SI T +G+ G +++ RL +W+ G L L+ G+ F+ +
Sbjct: 409 LTSIFLTFLGLQAGKILLTFNNPGRRLSRWICWGVLLGLLAGILCGFSKED 459
>gi|312381520|gb|EFR27253.1| hypothetical protein AND_06166 [Anopheles darlingi]
Length = 782
Score = 135 bits (340), Expect = 4e-29, Method: Compositional matrix adjust.
Identities = 98/327 (29%), Positives = 155/327 (47%), Gaps = 48/327 (14%)
Query: 28 SHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
+++ +RL SLD RG+A+ LMI V+ GG + I HA WNG ++AD V P+FLFI+GV
Sbjct: 388 ANIARKRLQSLDTLRGIAIMLMIFVNSGGGHYWWIEHATWNGLHVADLVFPWFLFIMGVC 447
Query: 88 IALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
I ++L+ R + V + R++ L G+ L G ++ +R+ G
Sbjct: 448 IPISLRGQLARNVSKRQIVSSITTRSISLFLIGLCLNS---------MNGPNMANLRIFG 498
Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQS---VGRFSIFRLYCWHWLMAACVLVVYLALLY 200
VLQR ++Y +VSLV +F Q Q I RL W++ ++V+YLA++
Sbjct: 499 VLQRFGVAYFVVSLVHLFCHREQIASQHRFVRANVDIIRL-VRQWIIVGLLVVIYLAVIL 557
Query: 201 GTYVPDWQFTIINKDSADYGKVF-NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAW 259
P V+ N T G+ GYIDR +LG++H+Y HP
Sbjct: 558 LIPAPGCPRGYFGPGGKHLFNVYPNCTGGI----------TGYIDRVLLGMSHLYQHPTA 607
Query: 260 RRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHL 319
R ++G PF+PEG + + +IL +G+ G I+ GH
Sbjct: 608 RYV--------YDG-----------QPFDPEGPFACLPTILQVFLGLQCGSTILSFTGHR 648
Query: 320 ARLKQWVTMGFAL-LIFGLTLHFTNGE 345
RL+++ AL L+ G+ F+ +
Sbjct: 649 QRLQRFAVWSVALGLVAGVLCGFSKND 675
>gi|449500329|ref|XP_004174928.1| PREDICTED: LOW QUALITY PROTEIN: heparan-alpha-glucosaminide
N-acetyltransferase-like [Taeniopygia guttata]
Length = 789
Score = 134 bits (338), Expect = 6e-29, Method: Compositional matrix adjust.
Identities = 95/322 (29%), Positives = 156/322 (48%), Gaps = 44/322 (13%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
QRL SLD FRGL++ +M+ V++ GG + H WNG +AD V P+F+FI+G +IALAL
Sbjct: 399 QRLRSLDTFRGLSLVIMVFVNYGGGKYWFFKHVSWNGLTVADLVFPWFVFIMGTSIALAL 458
Query: 93 KRI----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
+ + ++K+I+R+ L+ GI++ P+ + +R+ GVLQR+
Sbjct: 459 GSMLRWGSSKWKVLRKIIWRSFVLILLGIIVVN-----PNYCLGPLSWDNLRIPGVLQRL 513
Query: 149 ALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
+YL+V+ +E +FT+ D+ Y W+ + ++L L + VPD
Sbjct: 514 GFTYLVVAALELLFTR----ADRRFPALQDILPYWPQWIFILVLETIWLCLTFLLPVPDC 569
Query: 208 QFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
+ D+GK N T G A GYIDR +LG HMY HP S T
Sbjct: 570 PRGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLILGEKHMYQHP----SSGVT 615
Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR-LKQW 325
S P++PEG+L +++SI+ +G+ G + + K H + + ++
Sbjct: 616 YQSTM--------------PYDPEGILGTINSIVMAFLGLQAGKITLFYKDHPKQIMSRF 661
Query: 326 VTMGFALLIFGLTLHFTNGEHG 347
+ G + + L + E G
Sbjct: 662 IIWGIVMGVISAILTKCSKEEG 683
>gi|170027692|ref|XP_001841731.1| conserved hypothetical protein [Culex quinquefasciatus]
gi|167862301|gb|EDS25684.1| conserved hypothetical protein [Culex quinquefasciatus]
Length = 558
Score = 134 bits (337), Expect = 8e-29, Method: Compositional matrix adjust.
Identities = 96/279 (34%), Positives = 138/279 (49%), Gaps = 30/279 (10%)
Query: 5 KAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISH 64
++ T L + P +S Q KT RL SLD FRG+A+ LMI V+ GGD+ I H
Sbjct: 263 RSRTPSEPQLSPNSPTISVQATGVPQKT-RLRSLDTFRGIAIMLMIFVNSGGGDYWWIEH 321
Query: 65 APWNGCNLADFVMPFFLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQ 120
A WNG ++AD V P+FLFI+GV I ++L+ R R + +K V R+LKL G+ L
Sbjct: 322 ATWNGLHVADLVFPWFLFIMGVCIPISLRSQLGRNVPRYEILKNVAVRSLKLFLIGLCLN 381
Query: 121 GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQ---SVGRFSI 177
G V +RL GVLQR ++Y +VS + ++ D+ Q + I
Sbjct: 382 S---------INGPTVADLRLFGVLQRFGVAYFVVSAIHLYCYQENDQLQHPLARSHADI 432
Query: 178 FRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVF-NVTCGVRAKLNPP 236
RL+ HW++ ++ VYL +++ VP+ ++ N T G+
Sbjct: 433 LRLW-KHWVIVGTIVFVYLLVIFFVPVPNCPSGYFGPGGKHLMLLYPNCTGGI------- 484
Query: 237 CNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPL 275
GYIDR+VLGI H+Y HP R P EGP
Sbjct: 485 ---TGYIDRQVLGIRHLYQHPTARYMYDAMPFDP-EGPF 519
>gi|301608954|ref|XP_002934053.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Xenopus (Silurana) tropicalis]
Length = 633
Score = 134 bits (336), Expect = 9e-29, Method: Compositional matrix adjust.
Identities = 102/331 (30%), Positives = 162/331 (48%), Gaps = 52/331 (15%)
Query: 17 SEPDVSDQQEKSHL---KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLA 73
+ D+S Q+ S QRL SLD FRGLA+ +M+ V++ GG + H WNG +A
Sbjct: 217 NRADISSQETYSRAWNPSVQRLRSLDTFRGLALTIMVFVNYGGGGYWFFKHQSWNGLTVA 276
Query: 74 DFVMPFFLFIVGVAIALALKRI----PDRADAVKKVIFRTLKLLFWGI-LLQGGFSHAPD 128
D V P+F+FI+G +I L+L + R + + KV++R+++L G+ ++ + P
Sbjct: 277 DLVFPWFVFIMGTSIYLSLNSMLSKGSSRWNLLGKVLWRSVQLFLIGLFVINVNYCRGP- 335
Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDK-DQSVGRFSIFRLYC-W-H 184
+ IR+ GVLQR++L+YL VS +E IF+K D QS F + + W
Sbjct: 336 -----LSFSEIRIMGVLQRLSLTYLAVSALELIFSKPTPDALTQSRTCFLLQDVLSHWPK 390
Query: 185 WLMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYI 243
W++ + V+L L VPD + D+GK N T G A GYI
Sbjct: 391 WIVILALEAVWLCLTLLLQVPDCPLGYLGPGGIGDFGKFPNCTGG----------AAGYI 440
Query: 244 DRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTI 303
DR +LG H+Y HP T + ++ + P++PEGLL +++ ++
Sbjct: 441 DRMILGQGHIYQHP--------TSNVIYKSTM----------PYDPEGLLGTINCVVMAF 482
Query: 304 IGVHFGHVIIHTKGH----LARLKQW-VTMG 329
G+ G +++ K L R W + MG
Sbjct: 483 FGLQAGIILVLYKNQHKYVLVRFFSWAIIMG 513
>gi|348529394|ref|XP_003452198.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Oreochromis niloticus]
Length = 600
Score = 131 bits (330), Expect = 4e-28, Method: Compositional matrix adjust.
Identities = 95/308 (30%), Positives = 151/308 (49%), Gaps = 43/308 (13%)
Query: 19 PDVSDQQEKSHL-KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVM 77
P V+D L ++RL SLD FRG+++ +M+ V++ GG + H WNG +AD V
Sbjct: 188 PPVTDNILPPPLTSSKRLRSLDTFRGISLVIMVFVNYGGGRYWFFRHESWNGLTVADLVF 247
Query: 78 PFFLFIVGVAIALALKRI----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYG 133
P+F+FI+G +IAL++ + R ++K ++R+L+L G+L+ P+
Sbjct: 248 PWFVFIMGTSIALSINALLRAGATRCSLLRKAVWRSLQLFIIGVLVIN-----PNYCQGA 302
Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTK----DVQDKDQSVGRFSIFRLYCWHWLMAA 189
+ +R+ GVLQR+A SYL+V+ +++ DV D F LY WL
Sbjct: 303 LAWENLRIPGVLQRLAWSYLVVACLDLLVARGQLDVITVDAWWSPAIDFLLYWPAWLCVI 362
Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
+ V++L L + VPD + D G N T G A G+IDR +L
Sbjct: 363 LLEVLWLFLTFLLPVPDCPTGYLGPGGIGDMGLYVNCTGG----------AAGFIDRLLL 412
Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
G HMY +P+ R A P++PEG+L S++SIL +G+
Sbjct: 413 GEKHMYQNPSSRVIYA------------------TRIPYDPEGVLGSINSILMAFLGLQA 454
Query: 309 GHVIIHTK 316
G +I+H +
Sbjct: 455 GKIILHYR 462
>gi|443685781|gb|ELT89271.1| hypothetical protein CAPTEDRAFT_227545 [Capitella teleta]
Length = 605
Score = 130 bits (326), Expect = 1e-27, Method: Compositional matrix adjust.
Identities = 106/345 (30%), Positives = 161/345 (46%), Gaps = 52/345 (15%)
Query: 13 PLIISEPDVSDQQEKSHLKT--QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGC 70
P + +E D + + KT +RL SLD FRG+++ +MI V++ GG + H+ WNG
Sbjct: 188 PEMQTESATDDAETTAVNKTHKERLRSLDAFRGMSLTIMIFVNYGGGGYWFFDHSYWNGL 247
Query: 71 NLADFVMPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHA 126
LAD V P+F +I+G A+AL+ ++R + K+I RT L GI+L G
Sbjct: 248 TLADLVFPWFTWIIGTALALSIQGQMRRGKTKFSIAAKIIRRTCVLFALGIVLGSGGGSE 307
Query: 127 PDELTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFRLYCWHW 185
P VDV+ +R+ GVLQR+A+SYL+V+L+ IF K +KD R + R HW
Sbjct: 308 P------VDVQTLRIPGVLQRLAISYLVVALLHLIFAK--ANKDHQPSRLDMVRDITDHW 359
Query: 186 LMAACVLVV---YLALLYGTYVPDWQFTIINKDSA-----DYGKVFNVTCGVRAKLNPPC 237
VLV+ +L L + + D + T + GK N T G
Sbjct: 360 PQWGIVLVMVACHLGLTFLLPISDVEGTCPTGYLGPGGLHEGGKYENCTGG--------- 410
Query: 238 NAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVS 297
A IDR H+Y P + + P +PEG+L +++
Sbjct: 411 -AAAVIDRWFFSRQHVYQTPTCKEVYKTVE------------------PHDPEGILGTLT 451
Query: 298 SILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL-LIFGLTLHF 341
SI +G+ G ++ K R+++W+ G L LI GL F
Sbjct: 452 SIFLCFLGLQAGVILTTFKQKSPRMRRWIVWGIILGLIAGLLCGF 496
>gi|167538367|ref|XP_001750848.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163770669|gb|EDQ84352.1| predicted protein [Monosiga brevicollis MX1]
Length = 779
Score = 129 bits (325), Expect = 2e-27, Method: Compositional matrix adjust.
Identities = 106/394 (26%), Positives = 169/394 (42%), Gaps = 82/394 (20%)
Query: 6 AETT----HHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPE 61
ETT H ++ S+ + Q+ +QRL SLD FRG A+ +MI V+ GG +
Sbjct: 309 GETTGLLLQHATVMPSDAGMHAIQDMKR-SSQRLRSLDSFRGFALTIMIFVNFNGGFYWF 367
Query: 62 ISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKV---IFRTLKLLF-WGI 117
+H+ WNG +AD V P+F++I+G ++A+A + R + IFR + +LF +GI
Sbjct: 368 FNHSAWNGLTVADLVFPWFIWIMGTSMAIAFNSLLKRQTPTTTILYKIFRRMLILFAFGI 427
Query: 118 LLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR--- 174
+ G F D+R R+ GVLQR A+SYL+V+LV ++ ++ SV
Sbjct: 428 FIIGNFH----------DLRNGRIPGVLQRFAVSYLVVALVMLYAPKMESWCASVSTSDS 477
Query: 175 ------------------------------FSIFRL-------YCWHWLMAACVLVVYLA 197
F L Y W W+ +++++
Sbjct: 478 PTPALVRGIAKPGSGHQLDVAADIAEMKPWVRTFLLHTRDLTPYIWEWVAMFVIIIIHTC 537
Query: 198 LLYGTYVPDWQFTIINKDS--ADYGKVFNVTCGVRAKLNPPCN--AVGYIDRKVLGINHM 253
+ + VP I A+YG+ V + C A GYIDR+V G H+
Sbjct: 538 ITFLLPVPGCPTGYIGPGGALAEYGQFAPPEGEVCGESTFCCEGGASGYIDRQVFGWRHI 597
Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
Y P P + P++PEGLL S++SI+ +G+ G +I+
Sbjct: 598 YDQP-------------------TSQPIYETGPYDPEGLLGSLTSIVMCFLGLQSGKIIV 638
Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTNGEHG 347
H K H R + W+ L + L + +G
Sbjct: 639 HYKSHAQRSRHWLMWALVLGVIATGLCGASQNNG 672
>gi|270004236|gb|EFA00684.1| hypothetical protein TcasGA2_TC003561 [Tribolium castaneum]
Length = 569
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 95/316 (30%), Positives = 151/316 (47%), Gaps = 52/316 (16%)
Query: 22 SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
S+ +EK +RL SLD FRG+++ +MI V++ G +P + HA WNG +LAD V P+F+
Sbjct: 167 SETKEKKPEGKKRLKSLDTFRGISIVIMIFVNYGSGGYPVLDHATWNGLHLADLVFPWFM 226
Query: 82 FIVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
+I+G + ++L K+ D V+ R++KL G+ L G +
Sbjct: 227 WIMGACMPISLTSSFKKQISNKDIFLNVLKRSIKLFCLGVFLNA-----------GPYLE 275
Query: 138 MIRLCGVLQRIALSYLLVSLVEIF--TKDVQDKDQSVGRFSIFRLYCWH-WLMAACVLVV 194
+R+ GVLQR + YL+V+ + +F ++ + +G+F L W W++ + V
Sbjct: 276 CMRIFGVLQRFGICYLVVTTICLFLMKREFSESKHKIGKFFTDILVLWKGWIVVLIIFFV 335
Query: 195 ---YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
+L LL P + + + GK FN T G A GYID +LG N
Sbjct: 336 HCMFLFLLADEGCP--RGYLGPGGLHENGKHFNCTGG----------ATGYIDAVILG-N 382
Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
H Y P + TQ F+PEG+L ++SI+ IGV G
Sbjct: 383 HRYQKPTSKEIYLGTQ------------------AFDPEGILGCLTSIVHVFIGVQAGIT 424
Query: 312 IIHTKGHLARLKQWVT 327
++ K H ARL +W++
Sbjct: 425 LLVYKEHSARLIRWLS 440
>gi|91079154|ref|XP_966977.1| PREDICTED: similar to CG6903 CG6903-PA [Tribolium castaneum]
Length = 533
Score = 129 bits (323), Expect = 3e-27, Method: Compositional matrix adjust.
Identities = 95/316 (30%), Positives = 151/316 (47%), Gaps = 52/316 (16%)
Query: 22 SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
S+ +EK +RL SLD FRG+++ +MI V++ G +P + HA WNG +LAD V P+F+
Sbjct: 167 SETKEKKPEGKKRLKSLDTFRGISIVIMIFVNYGSGGYPVLDHATWNGLHLADLVFPWFM 226
Query: 82 FIVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
+I+G + ++L K+ D V+ R++KL G+ L G +
Sbjct: 227 WIMGACMPISLTSSFKKQISNKDIFLNVLKRSIKLFCLGVFLNA-----------GPYLE 275
Query: 138 MIRLCGVLQRIALSYLLVSLVEIF--TKDVQDKDQSVGRFSIFRLYCWH-WLMAACVLVV 194
+R+ GVLQR + YL+V+ + +F ++ + +G+F L W W++ + V
Sbjct: 276 CMRIFGVLQRFGICYLVVTTICLFLMKREFSESKHKIGKFFTDILVLWKGWIVVLIIFFV 335
Query: 195 ---YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
+L LL P + + + GK FN T G A GYID +LG N
Sbjct: 336 HCMFLFLLADEGCP--RGYLGPGGLHENGKHFNCTGG----------ATGYIDAVILG-N 382
Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
H Y P + TQ F+PEG+L ++SI+ IGV G
Sbjct: 383 HRYQKPTSKEIYLGTQ------------------AFDPEGILGCLTSIVHVFIGVQAGIT 424
Query: 312 IIHTKGHLARLKQWVT 327
++ K H ARL +W++
Sbjct: 425 LLVYKEHSARLIRWLS 440
>gi|395507548|ref|XP_003758085.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase
[Sarcophilus harrisii]
Length = 634
Score = 127 bits (320), Expect = 7e-27, Method: Compositional matrix adjust.
Identities = 96/337 (28%), Positives = 161/337 (47%), Gaps = 44/337 (13%)
Query: 20 DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
D+ + + L RL SLD FRG+A+ +M+ V++ GG + H WNG LAD V P+
Sbjct: 224 DLQVEAWRLTLPVYRLRSLDTFRGIALIIMVFVNYGGGKYWFFKHESWNGLTLADLVFPW 283
Query: 80 FLFIVGVAIALALKRIPDRA----DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
F+FI+G +IAL+L + R + K+++R+L L GI + P+ +
Sbjct: 284 FVFIMGSSIALSLSSMLRRGCSKWKLLGKILWRSLLLCVIGIFIVN-----PNYCLGPLS 338
Query: 136 VRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDK---DQSVGRFSIFRLYCWHWLMAACV 191
+R+ GVLQR+ L+YL+V+++E +F K V + ++S F Y W+ +
Sbjct: 339 WDKLRIPGVLQRLGLTYLVVAVLELLFAKAVPENSAMERSCSSFQDIISYWPQWIFILML 398
Query: 192 LVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
++ + + VP + D+GK N T G A GYIDR +LG
Sbjct: 399 EAAWVCVTFLLPVPGCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGE 448
Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
+H+Y HP+ + P++PEGLL +++SI+ +GV G
Sbjct: 449 DHIYQHPS------------------PNVLYHTKVPYDPEGLLGTINSIVMAFLGVQAGK 490
Query: 311 VIIHTKGHLARLKQWVTMGFALL--IFGLTLHFTNGE 345
+++ K ++ + A+L I G+ F+ E
Sbjct: 491 ILLFYKDQPKQIMLRFLLWSAMLGIISGVLTKFSQNE 527
>gi|157112232|ref|XP_001657450.1| hypothetical protein AaeL_AAEL000933 [Aedes aegypti]
gi|108883723|gb|EAT47948.1| AAEL000933-PA [Aedes aegypti]
Length = 569
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 95/329 (28%), Positives = 150/329 (45%), Gaps = 47/329 (14%)
Query: 8 TTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPW 67
+ PL + P S + + RL SLD FRG+A+ LMI V+ GG + I HA W
Sbjct: 157 SAREAPLAAASPSSSGHPVEP--RKTRLQSLDTFRGIAIMLMIFVNSGGGHYWWIEHATW 214
Query: 68 NGCNLADFVMPFFLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGF 123
NG ++AD V P+FLFI+GV I ++L+ R R + V R+ KL G+ L
Sbjct: 215 NGLHVADLVFPWFLFIMGVCIPISLRSQVSRNIPRKTILANVAVRSFKLFCIGLCLNS-- 272
Query: 124 SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQS-VGRFSIFRLYC 182
G V +RL GVLQR ++Y +VS + ++ + Q + R ++ L
Sbjct: 273 -------INGPQVANLRLFGVLQRFGVAYFVVSAIHLYCYSESIEFQGRLARLNVDILRL 325
Query: 183 W-HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVF-NVTCGVRAKLNPPCNAV 240
W HW++ ++ +YL +++ P ++ N T G+
Sbjct: 326 WKHWIIMGAIVFIYLLIMFLVAAPGCPSGYFGPGGKHLMAMYPNCTGGI----------T 375
Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300
GY+DR +LG NH+Y HP R DA + F+PEG + +IL
Sbjct: 376 GYLDRIILGNNHLYQHPTARYV--------------YDAQA-----FDPEGPFGCLPTIL 416
Query: 301 STIIGVHFGHVIIHTKGHLARLKQWVTMG 329
+G+ G +I+ +AR+++ G
Sbjct: 417 QVFLGLQCGVLILTHTEVMARIRRMAAWG 445
>gi|390367684|ref|XP_789038.3| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Strongylocentrotus purpuratus]
Length = 624
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 90/323 (27%), Positives = 151/323 (46%), Gaps = 50/323 (15%)
Query: 16 ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
++E D + Q S K +RL SLD FRG+++ +MI V++ GG + +H+ WNG +AD
Sbjct: 215 VAEADSNSIQRPSRDKPKRLKSLDAFRGMSLVIMIFVNYGGGQYSFFNHSIWNGLTVADL 274
Query: 76 VMPFFLFIVGVAIALALKRI----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELT 131
V P+F++I+GV+I ++ + R K+I R + L GI+L G
Sbjct: 275 VFPWFIWIMGVSITMSFYALVRHGVSRRVIFTKIIRRFVILFGLGIILDG---------- 324
Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL------YCWHW 185
G+D R+ GVLQRIA SYL+V+ V +F +D++ + R ++R Y + W
Sbjct: 325 -GIDFSTFRVPGVLQRIAFSYLVVATVHLFAVKHKDEEYRI-RHVVYRELRDLLDYWYEW 382
Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
++ L +++ L + VP + G+ + +N A YID+
Sbjct: 383 IIMISFLALHICLTFFLPVPGCPTGYLGPGGPLVGE-------NESLVNCTGGAANYIDK 435
Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
+L NH Y R+ T P +PEG+L +++SI T +G
Sbjct: 436 VILTYNHTYPRGTPRKIYQTT------------------VPHDPEGILGTLTSIFMTFLG 477
Query: 306 VHFG---HVIIHTKGHLARLKQW 325
+ G H+ + + + R W
Sbjct: 478 LQAGKIFHLFSYPRDRILRFLGW 500
>gi|198434539|ref|XP_002120178.1| PREDICTED: similar to heparan-alpha-glucosaminide
N-acetyltransferase [Ciona intestinalis]
Length = 624
Score = 127 bits (318), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 103/381 (27%), Positives = 169/381 (44%), Gaps = 60/381 (15%)
Query: 7 ETTHHHPLIISEPDVSDQQEKSHL---KTQRLASLDIFRGLAVALMILVDHAGGDWPEIS 63
ET H L +EP+ + + L K++R+ S+D FRGL + +M+ V+ GGD+
Sbjct: 189 ETQIHEDLGNTEPNSVQEANPTPLVREKSERIKSIDTFRGLCLVVMVFVNFRGGDYWFFH 248
Query: 64 HAPWNGCNLADFVMPFFLFIVGVAIALAL-----KRIPDRADAVKKVIFRTLKLLFWGIL 118
H+PW+G +AD V P+F+FI+GV I L++ K +P+ A K+I RT+ L G+
Sbjct: 249 HSPWHGLTVADLVFPWFMFIMGVNITLSINSLITKNVPNSKIAY-KLIRRTVLLFGLGMF 307
Query: 119 LQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIF 178
+ + + R+ GVLQR A++Y L +++ + ++ + +
Sbjct: 308 V----------VNHSTSWAAFRVPGVLQRFAIAYFLPFVLQWAFHLTPIEIETRAKTNEG 357
Query: 179 RLYCWH-----------WLMAACVLVVYLALLYGTYVPDWQFTIINKDSADY-GKVFNVT 226
L WH WL+ + ++L L + +P + D GK N T
Sbjct: 358 ELKWWHWCKDVVPYWLQWLIVLAMEALWLFLTFLLPIPGCPTGYLGPGGLDNDGKYINET 417
Query: 227 CGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP 286
C A GYIDR + G H+Y HP + T S P
Sbjct: 418 C--------VGGAAGYIDRVIFGEAHIYGHPTCKNVYYPTYTSD------------QRVP 457
Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNGEH 346
++PEGLL S++S + I+G G + ++ K L R +W+ F L + + L +
Sbjct: 458 YDPEGLLGSINSCIIVILGCQAGKIFLYYKHPLDRAMRWILWCFFLGVISIILCKASANG 517
Query: 347 G---------SGKFSTTCVCL 358
G + F TT C+
Sbjct: 518 GWIPVNKNLWTTTFVTTLACM 538
>gi|307178470|gb|EFN67159.1| Heparan-alpha-glucosaminide N-acetyltransferase [Camponotus
floridanus]
Length = 512
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 95/336 (28%), Positives = 153/336 (45%), Gaps = 43/336 (12%)
Query: 17 SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFV 76
+E + + + K +R+ ++D FRG+ MI V+ G + + HA WNG L D V
Sbjct: 105 TEEERTPNNNEKATKHRRVKAIDTFRGVCTLFMIFVNDGSGSYTTLGHATWNGMLLGDLV 164
Query: 77 MPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
P F++I+GV + +AL R + ++ F K F L+ A + L +
Sbjct: 165 FPCFMWIMGVCVPIALSAQLKRGLSKLEISFSIFKRSFLLFLI----GIALNTLGTNAQL 220
Query: 137 RMIRLCGVLQRIALSYLLVSLVEI-FTKDVQDKDQSVGRF-------SIFRLYCWHWLMA 188
IR+ GVLQR ++YL+VSL+ + FT Q++ + I L HW +
Sbjct: 221 ENIRIFGVLQRFGITYLIVSLLYLCFTPQQPKVAQNLSQTWMTHKMQDILSLLP-HWCIM 279
Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
+++V+ A+ + +P + + GK FN T G A GYIDR +
Sbjct: 280 LTLVMVHCAVTFCLPIPGCPTGYLGPGGRHEDGKYFNCTGG----------ATGYIDRIL 329
Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
L ++H+Y P T DS + PF+PEG+L ++SI +GVH
Sbjct: 330 LTLSHIYQWP--------TIDSIYGS-----------GPFDPEGILGCLTSIFQVFLGVH 370
Query: 308 FGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTN 343
G +++ KG R+ +W+ G HFTN
Sbjct: 371 TGVILMMYKGWKERIIRWLVWAVFYGCLGCIFHFTN 406
>gi|158294726|ref|XP_315774.3| AGAP005761-PA [Anopheles gambiae str. PEST]
gi|157015699|gb|EAA10745.3| AGAP005761-PA [Anopheles gambiae str. PEST]
Length = 581
Score = 126 bits (317), Expect = 1e-26, Method: Compositional matrix adjust.
Identities = 98/320 (30%), Positives = 146/320 (45%), Gaps = 54/320 (16%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
+RL SLD FRG+A+ LMI V+ GG + I HA WNG ++AD V P+FLFI+GV + ++L
Sbjct: 202 KRLQSLDTFRGIAIMLMIFVNSGGGHYWWIEHATWNGLHVADLVFPWFLFIMGVCVPISL 261
Query: 93 KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
+ +R R++KL G+ L G + +R+ GVLQR ++Y
Sbjct: 262 RGQLNRN--------RSVKLFIIGLCLNS---------MNGPSMANLRIFGVLQRFGIAY 304
Query: 153 LLVSLVEIFTKDVQDKDQSVGRF-----SIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
L+VS V + + Q + QS R I RL WL+ + V+YL +++ P
Sbjct: 305 LVVSTVHLLCHEQQVQVQSQNRLLRASEDIVRLKK-QWLVIGLLTVLYLVVMFFVPAPGC 363
Query: 208 QFTIINKDSADYGKVF-NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
F N T G+ GYIDR +LGI H+Y HP R
Sbjct: 364 PSAYFGPGGKHLYNAFPNCTGGI----------TGYIDRALLGIAHLYQHPTARYV---- 409
Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
++G PF+PEG + +IL +G+ G I+ H R+ ++
Sbjct: 410 ----YDG-----------MPFDPEGPFGCLPTILQVFLGLQCGCTILAYTEHRQRMVRFA 454
Query: 327 TMGFAL-LIFGLTLHFTNGE 345
+ L L G FT +
Sbjct: 455 SWSLVLGLAAGALCGFTKND 474
>gi|443731781|gb|ELU16770.1| hypothetical protein CAPTEDRAFT_135912, partial [Capitella teleta]
Length = 388
Score = 126 bits (317), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 88/297 (29%), Positives = 140/297 (47%), Gaps = 38/297 (12%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL--- 90
RL SLD FRG+++ +MI V++ GG + H+ WNG LAD V P+F+FI+G ++AL
Sbjct: 1 RLKSLDTFRGISLVIMIFVNYRGGGYWFFRHSAWNGLTLADLVFPWFVFIMGTSMALSFR 60
Query: 91 -ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
AL+R R + KV+ R + L G+++ A VD+R +R+ GVLQR+A
Sbjct: 61 GALRRGIPRFKLILKVLKRAMILFALGVMISNSKGKA-------VDLRTLRVPGVLQRLA 113
Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
L+YL++ ++E D Q W + V+ L + VP
Sbjct: 114 LTYLVLGIMEAALAKSHDPHQWWSSVRDVVGNLGQWAAVLMFVAVHCCLTFLLPVPGCPK 173
Query: 210 TIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
+ +G + G A YIDR + G HMY HP T
Sbjct: 174 GYLGPGGLQHGGAYENCTG---------GATAYIDRMIFGTEHMYGHP--------TCMI 216
Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
P++ + P +PEG+L +++SI +G+ G VI+ +G +R+ +W+
Sbjct: 217 PYQTTV----------PLDPEGVLGTLTSIFLCFLGLQAGKVILIFQGWKSRVSRWM 263
>gi|417411833|gb|JAA52338.1| Putative heparan-alpha-glucosaminide n-acetyltransferase, partial
[Desmodus rotundus]
Length = 595
Score = 126 bits (316), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 96/318 (30%), Positives = 155/318 (48%), Gaps = 47/318 (14%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
RL +D FRGLA+ LM+ V++ GG + HA WNG +ADFV P+F+FI+G +I L++
Sbjct: 199 RLRCVDTFRGLALILMVFVNYGGGQYWYFKHASWNGLTVADFVFPWFVFIMGSSIFLSMS 258
Query: 94 RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
+ R + + KV +R+ L+ G+++ P+ + +RL GVLQR+
Sbjct: 259 SVLQRGCSKFRLLGKVAWRSFLLICIGVIVVN-----PNYCLGPLSWDKVRLPGVLQRLG 313
Query: 150 LSYLLVSLVEI-FTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
++Y +V+++E+ F K V ++ GR S + + W WL + ++LAL + VP
Sbjct: 314 VTYFVVAVLELLFAKPVPERGAWEGRCSSLQDIMSSWPQWLFILMLESIWLALTFFLPVP 373
Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+ D GK N T G A GYIDR +LG +H+Y HP+
Sbjct: 374 GCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDRLLLGEDHIYQHPS------ 417
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
++PEG+L +++SI+ +GV G ++++ TK L
Sbjct: 418 ------------STVLYHTRVAYDPEGILGTINSIVMAFLGVQAGKILLYYKEQTKDILI 465
Query: 321 RLKQWVTMGFALLIFGLT 338
R W L+ GLT
Sbjct: 466 RFTAWCCF-LGLISVGLT 482
>gi|443694948|gb|ELT95966.1| hypothetical protein CAPTEDRAFT_92095 [Capitella teleta]
Length = 431
Score = 125 bits (315), Expect = 2e-26, Method: Compositional matrix adjust.
Identities = 98/342 (28%), Positives = 157/342 (45%), Gaps = 60/342 (17%)
Query: 1 MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP 60
+S K ++T + E D +K +RL SLD FRGL + LMI V++ GG +
Sbjct: 7 ISSAKTDSTRRNS---EEKDEGKLITPKEVKKERLRSLDAFRGLNILLMIFVNYGGGGYW 63
Query: 61 EISHAPWNGCNLADFVMPFFLFIVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWG 116
SHA WNG + D + P+F+FI+G ++ L + K+ D + + +I+R++KL G
Sbjct: 64 YFSHAVWNGLYITDLIFPWFIFIMGTSLGLGISSLVKKEVDPVEGLWGIIWRSVKLFAVG 123
Query: 117 ILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFS 176
I+ S+ D+ IR+ GVLQR+A+ Y + ++V + +Q +S G S
Sbjct: 124 IMYNTKSSN---------DLENIRMTGVLQRLAMVYFITAIVHYAGESLQCCMRSRGTVS 174
Query: 177 IFR-------LYCWHWLMAACVLVVYLALLYGTYVPDWQFTII-----NKDSADYGKVFN 224
+R Y W+ ++ +Y Y VP + + ++D A G
Sbjct: 175 RWRHILSDLAPYFGEWITMLVIIGIYCYFTYWFAVPGCEAGYVGPGGLHRDGAHAG---- 230
Query: 225 VTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCH 284
C A L YID KV + H+Y P R DS
Sbjct: 231 --CTGGAAL--------YIDLKVYTMRHIYQWPDIR--TIYQTDS--------------- 263
Query: 285 APFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
F+PEGLL +++SI +G+ G +++ KGH RL +W+
Sbjct: 264 -AFDPEGLLGTLTSIFLCFLGLQAGKILVCHKGHRERLVRWL 304
>gi|345781561|ref|XP_539948.3| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Canis
lupus familiaris]
Length = 638
Score = 125 bits (314), Expect = 3e-26, Method: Compositional matrix adjust.
Identities = 96/317 (30%), Positives = 154/317 (48%), Gaps = 48/317 (15%)
Query: 24 QQEKSHLKTQ--RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
QQE H + RL S+D FRGLA+ LM+ V++ GG + H+ WNG +AD V P+F+
Sbjct: 230 QQEAWHPPSALPRLRSIDTFRGLALILMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFV 289
Query: 82 FIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
FI+G +I L++ + R + + K+ +R+ L G+++ P+ +
Sbjct: 290 FIMGSSIFLSMTSMLQRGCSKFRLLGKIAWRSFLLFCIGVVIVN-----PNYCLGPLSWD 344
Query: 138 MIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLV 193
+R+ GVLQR+ ++Y +V+++E IF K V + S R R + W WL +
Sbjct: 345 KVRIPGVLQRLGVTYFVVAVLELIFAKPVPESCASERRCFSLRDIILSWPQWLFILLLES 404
Query: 194 VYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
++L L + VP + D GK N T G A GYIDR +LG +H
Sbjct: 405 IWLGLTFFLPVPGCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDRLLLGDDH 454
Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
+Y HP S A + P++PEG+L ++SSI+ +G+ G ++
Sbjct: 455 IYQHP----SSAVLYHT--------------KVPYDPEGILGTISSIVMAFLGIQAGKIL 496
Query: 313 IH----TKGHLARLKQW 325
++ TK L R W
Sbjct: 497 LYYKDQTKDILIRFTAW 513
>gi|242022263|ref|XP_002431560.1| conserved hypothetical protein [Pediculus humanus corporis]
gi|212516863|gb|EEB18822.1| conserved hypothetical protein [Pediculus humanus corporis]
Length = 607
Score = 125 bits (314), Expect = 4e-26, Method: Compositional matrix adjust.
Identities = 101/367 (27%), Positives = 161/367 (43%), Gaps = 54/367 (14%)
Query: 20 DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
D Q S R+ S+D FRGLAV LMI V+ G + + HA WNG +ADFV P+
Sbjct: 185 DDRTTQASSKPARHRIKSIDTFRGLAVVLMIFVNDGAGHYWFLEHATWNGILVADFVFPW 244
Query: 80 FLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
FL+++G+ I ++ LKR R + VI R + L G+LL + + G D
Sbjct: 245 FLWVMGLCIPISIRTQLKRNVSRWKILGHVIKRGILLFGLGVLL--------NTVGIGSD 296
Query: 136 VRMIRLCGVLQRIALSYLLVSLVEI-FT-KDVQDKDQSVGR-----FSIFRLYCWHWLMA 188
+ IR+ GVLQR ++ YL+++++ + FT + + ++++ G F + W++
Sbjct: 297 LETIRIPGVLQRFSIVYLIIAILGVCFTPRSISNENRFPGSSFRETFQDIIIIFPQWIVI 356
Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
++ Y ++ + VP + G FN G GY+D+ +L
Sbjct: 357 LSIVAAYCYFVFFSPVPGCPSGYLGPGGIQDGGRFNECTG---------GMTGYVDKVLL 407
Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
G+ H+Y +P + + PF+PEGLL + SI GV
Sbjct: 408 GVEHIYKNPT-------------------SSKVYKSGPFDPEGLLGVMPSIFQAFFGVQA 448
Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHF-------TNGEHGSGKFSTTCVCLFIY 361
G +++ A+L +W T G I L L N S F+TT I
Sbjct: 449 GATLLYHPEWKAKLIRWFTWGILNGILALLLSLPGIVPINKNLWSLSYVFTTTSSAFLIL 508
Query: 362 SKVILFQ 368
+ FQ
Sbjct: 509 CVIYFFQ 515
>gi|332028000|gb|EGI68051.1| Heparan-alpha-glucosaminide N-acetyltransferase [Acromyrmex
echinatior]
Length = 557
Score = 125 bits (313), Expect = 5e-26, Method: Compositional matrix adjust.
Identities = 98/350 (28%), Positives = 151/350 (43%), Gaps = 45/350 (12%)
Query: 4 IKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS 63
I A + H + V D+ L +R+ ++D FRG + MI V+ G + +
Sbjct: 137 ISAGRSLWHMITKCVTGVKDKSNNKKLAKRRVKAIDTFRGASTLFMIFVNDGSGSYTVLE 196
Query: 64 HAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGF 123
H W+G L D V P F++I+GV I +AL R + ++ + LK F L+
Sbjct: 197 HTIWDGMLLGDIVFPCFMWIMGVCIPIALSAQLKRGVSKLQISYSILKRSFLLFLIGVSL 256
Query: 124 SHAPDELTYGVD--VRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKD-QSVGRFSIFRL 180
+ T G D V IR+ GVLQR ++YL+VSLV + Q K ++ I R
Sbjct: 257 N------TLGTDSQVENIRIFGVLQRFGVTYLVVSLVYLCFPSQQSKILRNTSPTWIMRK 310
Query: 181 Y------CWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKL 233
HW + ++V+ AL + VP + + GK FN T G
Sbjct: 311 MQDILSLLPHWFVMLIFVIVHCALTFCLPVPGCPTGYLGPGGMHEDGKYFNCTGG----- 365
Query: 234 NPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLL 293
A GYID+ VL +NH+Y +P + PF+PEG+L
Sbjct: 366 -----ATGYIDKTVLTLNHIYQYPTIKSVYG-------------------SGPFDPEGIL 401
Query: 294 SSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTN 343
+++I +GVH G +++ K R+ +W+ G HFTN
Sbjct: 402 GCLTAIFQVFLGVHAGTILMLYKDWKDRVMRWLLWAVFYACLGCAFHFTN 451
>gi|443685179|gb|ELT88879.1| hypothetical protein CAPTEDRAFT_26311, partial [Capitella teleta]
Length = 361
Score = 124 bits (311), Expect = 7e-26, Method: Compositional matrix adjust.
Identities = 91/302 (30%), Positives = 145/302 (48%), Gaps = 49/302 (16%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL--- 90
RL SLD FRG+++ +MI V++ GG + H+ WNG LAD V P+F+FI+G ++AL
Sbjct: 1 RLKSLDTFRGISLVIMIFVNYRGGGYWFFRHSAWNGLTLADLVFPWFVFIMGTSMALSFR 60
Query: 91 -ALKRIPDRADAVKKVIFRTLKLLFWGILL---QGGFSHAPDELTYGVDVRMIRLCGVLQ 146
AL+R R + KV+ R + L G+++ +G F D+R +R+ GVLQ
Sbjct: 61 GALRRGIPRFKLILKVLKRAMILFALGVMISNSKGAF-----------DLRTLRVPGVLQ 109
Query: 147 RIALSYLLVSLVEIFTKDVQDKDQ--SVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
R+AL+YL++ ++E D Q S+ R + L W + V+ L + V
Sbjct: 110 RLALTYLVLGIMEAALAKSHDPHQWWSLVRDVVGNLG--QWAAVLMFVAVHCCLTFLLPV 167
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
P + +G + G A YIDR + G HMY HP
Sbjct: 168 PGCPKGYLGPGGLQHGGAYENCTG---------GATAYIDRMIFGTEHMYGHP------- 211
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
T P++ P +PEG+L +++SI +G+ G VI+ +G +R+ +
Sbjct: 212 -TCMIPYQ----------TTVPLDPEGVLGTLTSIFLCFLGLQAGKVILIFQGWKSRVSR 260
Query: 325 WV 326
W+
Sbjct: 261 WM 262
>gi|322790964|gb|EFZ15612.1| hypothetical protein SINV_04659 [Solenopsis invicta]
Length = 581
Score = 124 bits (311), Expect = 8e-26, Method: Compositional matrix adjust.
Identities = 103/336 (30%), Positives = 169/336 (50%), Gaps = 47/336 (13%)
Query: 16 ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
+ E + S ++ + R+ S+D FRG+A+ LMI VD+ GG + +H+ WNG +AD
Sbjct: 154 LQEAETSTPIVRTSRSSTRIRSIDTFRGIALLLMIFVDNGGGKYVFFNHSAWNGLTVADL 213
Query: 76 VMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
V+P+F +I+G++I ++ + +++ K+IFR L+ +LL + E
Sbjct: 214 VLPWFAWIMGLSITISKRSELRVSNSRMKIIFRCLQRALVLVLLGLMLNSMSME-----S 268
Query: 136 VRMIRLCGVLQRIALSYLLVSLVE-IFTK-DVQDKDQSVGRFSIFR--LYCW-HWLMAAC 190
++ +R GVLQ +A+SY + + +E IF K QD GRFSI R L W WL+
Sbjct: 269 LKHLRFPGVLQLLAVSYFVCATIETIFMKAHSQDDVLQFGRFSILRDILNNWAQWLIILA 328
Query: 191 VLVVYLALLYGTYVPDWQFTIINK--DSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
++V ++ + + VP+ + + + YGK N T G A GYIDR V
Sbjct: 329 IMVTHILITFLLPVPNCPTGYLGPGGNYSRYGKFPNCTGG----------AAGYIDRLVF 378
Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
G +H+Y TQ+ P G + P +PEG+++++S IL +GVH
Sbjct: 379 G-SHVYSK---------TQN-PVYGTI---------LPHDPEGIMNTMSIILVVYMGVHA 418
Query: 309 GHVII---HTKGHLARLKQWVTMGFALLIFGLTLHF 341
G +++ G + R W ++ LI GL HF
Sbjct: 419 GKILLLYYQCNGRVIRWLLWSSV--TGLIAGLLCHF 452
>gi|417411831|gb|JAA52337.1| Putative heparan-alpha-glucosaminide n-acetyltransferase, partial
[Desmodus rotundus]
Length = 595
Score = 124 bits (310), Expect = 9e-26, Method: Compositional matrix adjust.
Identities = 95/318 (29%), Positives = 154/318 (48%), Gaps = 47/318 (14%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
RL +D FRGLA+ LM+ V++ GG + HA WNG +AD V P+F+FI+G +I L++
Sbjct: 199 RLRCVDTFRGLALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMS 258
Query: 94 RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
+ R + + KV +R+ L+ G+++ P+ + +RL GVLQR+
Sbjct: 259 SVLQRGCSKFRLLGKVAWRSFLLICIGVIVVN-----PNYCLGPLSWDKVRLPGVLQRLG 313
Query: 150 LSYLLVSLVEI-FTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
++Y +V+++E+ F K V ++ GR S + + W WL + ++LAL + VP
Sbjct: 314 VTYFVVAVLELLFAKPVPERGAWEGRCSSLQDIMSSWPQWLFILMLESIWLALTFFLPVP 373
Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+ D GK N T G A GYIDR +LG +H+Y HP+
Sbjct: 374 GCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDRLLLGEDHIYQHPS------ 417
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
++PEG+L +++SI+ +GV G ++++ TK L
Sbjct: 418 ------------STVLYHTRVAYDPEGILGTINSIVMAFLGVQAGKILLYYKEQTKDILI 465
Query: 321 RLKQWVTMGFALLIFGLT 338
R W L+ GLT
Sbjct: 466 RFTAWCCF-LGLISVGLT 482
>gi|193664422|ref|XP_001945789.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
isoform 1 [Acyrthosiphon pisum]
Length = 568
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 98/340 (28%), Positives = 159/340 (46%), Gaps = 48/340 (14%)
Query: 17 SEPDVSDQQEKSHLKTQ--RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLAD 74
+EP + Q + +K R+ SLD FRG+++ LM+ V+ GG + HAPWNG LAD
Sbjct: 162 NEPVIIHPQIPTPVKNNSYRITSLDTFRGISIILMVFVNLGGGHYWFFEHAPWNGITLAD 221
Query: 75 FVMPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
F++P+F +++GV+IA++ L+ R +VI R++ LL G++L
Sbjct: 222 FILPWFCWVMGVSIAISLRSQLRSSTKRKYVFGRVIRRSIALLIMGLVLN---------S 272
Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFR---LYCWHWLM 187
++R R GVLQR+AL Y + + +E Q + R + R W +
Sbjct: 273 VNNNNLRTFRPLGVLQRLALIYFIAATLETIFMKPQPYFTNT-RLDVIRDIIESARQWFI 331
Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
++ ++ + + VP K G ++N + + N A GYIDR V
Sbjct: 332 VIILVAIHTVITFFLPVPG-----CPKGYLGPGGLYNSS----SNTNCTGGAAGYIDRLV 382
Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
G NHMY SP P + PF+PEG+LS++++ L +GVH
Sbjct: 383 FGENHMY------------PGSP--------KPVYQSIPFDPEGILSTLTNTLLVYMGVH 422
Query: 308 FGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNGEHG 347
G +I+ + R+K+W+ L + G L + E G
Sbjct: 423 AGRIILCYQYTNERIKRWIAWTIVLGLIGGCLCNFSKEDG 462
>gi|328696746|ref|XP_003240114.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
isoform 2 [Acyrthosiphon pisum]
Length = 591
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 98/340 (28%), Positives = 159/340 (46%), Gaps = 48/340 (14%)
Query: 17 SEPDVSDQQEKSHLKTQ--RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLAD 74
+EP + Q + +K R+ SLD FRG+++ LM+ V+ GG + HAPWNG LAD
Sbjct: 185 NEPVIIHPQIPTPVKNNSYRITSLDTFRGISIILMVFVNLGGGHYWFFEHAPWNGITLAD 244
Query: 75 FVMPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
F++P+F +++GV+IA++ L+ R +VI R++ LL G++L
Sbjct: 245 FILPWFCWVMGVSIAISLRSQLRSSTKRKYVFGRVIRRSIALLIMGLVLN---------S 295
Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFR---LYCWHWLM 187
++R R GVLQR+AL Y + + +E Q + R + R W +
Sbjct: 296 VNNNNLRTFRPLGVLQRLALIYFIAATLETIFMKPQPYFTNT-RLDVIRDIIESARQWFI 354
Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
++ ++ + + VP K G ++N + + N A GYIDR V
Sbjct: 355 VIILVAIHTVITFFLPVPG-----CPKGYLGPGGLYNSS----SNTNCTGGAAGYIDRLV 405
Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
G NHMY SP P + PF+PEG+LS++++ L +GVH
Sbjct: 406 FGENHMY------------PGSP--------KPVYQSIPFDPEGILSTLTNTLLVYMGVH 445
Query: 308 FGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNGEHG 347
G +I+ + R+K+W+ L + G L + E G
Sbjct: 446 AGRIILCYQYTNERIKRWIAWTIVLGLIGGCLCNFSKEDG 485
>gi|405978397|gb|EKC42791.1| Heparan-alpha-glucosaminide N-acetyltransferase, partial
[Crassostrea gigas]
Length = 549
Score = 122 bits (307), Expect = 2e-25, Method: Compositional matrix adjust.
Identities = 86/266 (32%), Positives = 132/266 (49%), Gaps = 32/266 (12%)
Query: 2 SEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPE 61
S+I ++ H + + S QQ +H K +RL SLD FRGL++ +M+ V++ GG +
Sbjct: 185 SDISEDSGTAH-----DRNNSPQQYSTHNKRERLKSLDTFRGLSLMIMVFVNYGGGGYWF 239
Query: 62 ISHAPWNGCNLADFVMPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGI 117
H PWNG +AD V P+F+FI+G A+ + +KR R + KV+ R + L F GI
Sbjct: 240 FDHPPWNGITVADLVFPWFIFIMGTAMNYSFRGMMKRGTPRYRMLYKVLRRAILLFFIGI 299
Query: 118 LLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR-FS 176
+L + V+++ IR+ GVLQR +L+YL++ L E+ ++ GR +S
Sbjct: 300 VLNTNWGP--------VNLKTIRIPGVLQRFSLTYLVLGLFEVCFSRYDTPEKYQGRCWS 351
Query: 177 IFR---LYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAK 232
R L+ W +A +L Y+ L + + I D K +N T G
Sbjct: 352 SLRDILLFLPQWFLALGILAAYVCLTFLLPIGPCPTGYIGPGGLHDSSKYYNCTAG---- 407
Query: 233 LNPPCNAVGYIDRKVLGINHMYHHPA 258
A YID VLG NH+Y P
Sbjct: 408 ------AAAYIDIMVLGKNHIYGKPT 427
>gi|195399031|ref|XP_002058124.1| GJ15666 [Drosophila virilis]
gi|194150548|gb|EDW66232.1| GJ15666 [Drosophila virilis]
Length = 572
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 98/314 (31%), Positives = 151/314 (48%), Gaps = 49/314 (15%)
Query: 21 VSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFF 80
+ D EK+ + +RL SLD FRGL++ LMI V+ GG + I HA WNG +LAD V P F
Sbjct: 171 IGDAAEKA-TQRKRLRSLDTFRGLSIVLMIFVNSGGGGYSWIEHAAWNGLHLADLVFPSF 229
Query: 81 LFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
L+I+GV I L++K R ++ ++++R+ KL G+ L T G +
Sbjct: 230 LWIMGVCIPLSIKSQLGRGISKSRICGRIVWRSCKLFAIGLCLNS---------TNGPQL 280
Query: 137 RMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCWHWLMAACVLV 193
+RL GVLQR +++L+V L+ + ++ Q Q + +I+ L+ + ++
Sbjct: 281 EQLRLMGVLQRFGIAFLVVGLLHTVCSRRDQLSPQRAWQRAIYDICLFSGELAVLLALIA 340
Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPC--NAVGYIDRKVLGIN 251
YL L +G VP + GK N NP C A GYIDR+VLG
Sbjct: 341 AYLGLTFGLPVPGCPRGYLGPG----GKHNNAA-------NPNCIGGAAGYIDRQVLGNA 389
Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
H+Y HP + T F+PEG+ + S++ T++G G
Sbjct: 390 HIYQHPTAKYVYDATA-------------------FDPEGIFGCLLSVVQTLLGAFAGVT 430
Query: 312 IIHTKGHLARLKQW 325
++ ARLK+W
Sbjct: 431 LLVHATWQARLKRW 444
>gi|10177926|dbj|BAB11337.1| unnamed protein product [Arabidopsis thaliana]
Length = 384
Score = 122 bits (306), Expect = 3e-25, Method: Compositional matrix adjust.
Identities = 69/118 (58%), Positives = 86/118 (72%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
+RL SLD+FRGL VA MILVD GG P I+H+PW+G LADFVMPFFLFIVGV++A A
Sbjct: 144 ERLVSLDVFRGLTVAFMILVDDVGGILPSINHSPWDGVTLADFVMPFFLFIVGVSLAFAY 203
Query: 93 KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
K + R A +K + R+LKLL G+ LQGGF H + LTYG+DV IRL G+LQ + +
Sbjct: 204 KNLSCRFVATRKALIRSLKLLLLGLFLQGGFIHGLNNLTYGIDVEKIRLMGILQNLKV 261
Score = 120 bits (302), Expect = 8e-25, Method: Compositional matrix adjust.
Identities = 50/98 (51%), Positives = 69/98 (70%)
Query: 223 FNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSW 282
V CGVR P CNAVG +DR LGI H+Y P + R+K C+ + P GPL DAPSW
Sbjct: 259 LKVKCGVRGHTGPGCNAVGMLDRMFLGIQHLYRKPVYARTKQCSINYPNNGPLPPDAPSW 318
Query: 283 CHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLA 320
C APF+PEGLLSS+ + ++ ++G+H+GH+IIH K +++
Sbjct: 319 CQAPFDPEGLLSSLMATVTCLVGLHYGHIIIHFKVNIS 356
>gi|403303686|ref|XP_003942455.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Saimiri boliviensis boliviensis]
Length = 631
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 91/305 (29%), Positives = 149/305 (48%), Gaps = 46/305 (15%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
RL S+D FRG+A+ LM+ V++ GG + HA WNG +AD V P+F+FI+G +I L++
Sbjct: 235 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGTSIFLSMT 294
Query: 94 RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
I R + + K+ +R+ L+ GI++ P+ + +R+ GVLQR+
Sbjct: 295 SIMQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 349
Query: 150 LSYLLVSLVE-IFTKDVQD---KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
++Y +V+++E +F K V + ++S WL+ + ++L L + VP
Sbjct: 350 VTYFVVAVLELLFAKPVPEHCASERSCLSLQDITSSWPQWLLILALEGLWLGLTFLLPVP 409
Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+ D+GK N T G A GYIDR +LG NH+Y HP S A
Sbjct: 410 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDNHLYQHP----SSA 455
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
+ ++PEG+L +++SIL +GV G ++++ TK L
Sbjct: 456 VLYHT--------------EVAYDPEGILGTINSILMAFLGVQAGKILLYYKARTKDILI 501
Query: 321 RLKQW 325
R W
Sbjct: 502 RFTAW 506
>gi|395842491|ref|XP_003794051.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase
[Otolemur garnettii]
Length = 677
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 97/320 (30%), Positives = 159/320 (49%), Gaps = 49/320 (15%)
Query: 22 SDQQEKS-HLKTQ--RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMP 78
SD Q + HL T RL +D FRG+++ LM+ V++ GG + H+ WNG +AD V P
Sbjct: 266 SDAQPATWHLSTHPPRLRCVDTFRGISLTLMVFVNYGGGKYWYFKHSSWNGLTVADLVFP 325
Query: 79 FFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGV 134
+F+FI+G ++ L+ L+R + ++K+ +R+ L+ GI++ P+ +
Sbjct: 326 WFVFIMGSSVFLSMTSVLQRGCSKGRLLRKIAWRSFLLICIGIIIVN-----PNYCLGPL 380
Query: 135 DVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSV-GRFSIFRLY-CW-HWLMAAC 190
+R+ GVLQR+ ++Y +V+++E +F K V + S G FS+ + W WL+
Sbjct: 381 SWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVPENCASQRGCFSLGDVTSSWPQWLLILT 440
Query: 191 VLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
+ V+L L + VP + D GK N T G A GYID +LG
Sbjct: 441 LESVWLCLTFFLPVPGCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDHLLLG 490
Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
NH+YHHP S A + ++PEG+L +++SI+ +GV G
Sbjct: 491 ENHLYHHP----SSAVLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAG 532
Query: 310 HVIIH----TKGHLARLKQW 325
++++ TK L R W
Sbjct: 533 KILLYYKDQTKDILMRFAGW 552
>gi|195447210|ref|XP_002071113.1| GK25317 [Drosophila willistoni]
gi|194167198|gb|EDW82099.1| GK25317 [Drosophila willistoni]
Length = 537
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 92/316 (29%), Positives = 146/316 (46%), Gaps = 49/316 (15%)
Query: 20 DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
+ D K+ + +RL SLD FRGLA+ LMI V+ GG + I H WNG +LAD V P
Sbjct: 176 SIGDAAAKA-TQRKRLRSLDTFRGLAIVLMIFVNSGGGGYDSIDHVAWNGLHLADLVFPC 234
Query: 80 FLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
FL+I+GV I L++K R + ++I+R+ KL G+ L G
Sbjct: 235 FLWIMGVCIPLSIKSQLGRGTSKIQICGRIIWRSFKLFAIGVCLNS---------INGPK 285
Query: 136 VRMIRLCGVLQRIALSYLLVSLV-EIFTKDVQDKDQSVGRFSIFR--LYCWHWLMAACVL 192
+ +R+ GVLQR +++L+V L+ + ++ Q + SI+ ++ + + ++
Sbjct: 286 LEQLRVMGVLQRFGVAFLVVGLLHTVCSRRDHISPQQAWQRSIYDICIFSGEFAVLLALI 345
Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPC--NAVGYIDRKVLGI 250
YL L YG VP + GK N NP C A GYID++VLG
Sbjct: 346 ATYLGLTYGLKVPGCPRGYLGPG----GKSNNAA-------NPHCIGGAAGYIDQQVLGN 394
Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
H+Y +P + T F+PEGL + S++ ++G G
Sbjct: 395 AHIYQYPTAKYVYDAT-------------------AFDPEGLFGCLLSVVHVLLGAFAGV 435
Query: 311 VIIHTKGHLARLKQWV 326
++ +R+K+W
Sbjct: 436 TLLVHPTWQSRMKRWT 451
>gi|432845830|ref|XP_004065874.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Oryzias latipes]
Length = 622
Score = 122 bits (305), Expect = 4e-25, Method: Compositional matrix adjust.
Identities = 88/293 (30%), Positives = 148/293 (50%), Gaps = 42/293 (14%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
+RL SLD FRG+A+ +M+ V++ GG + H WNG +AD V P+F+F++G +IAL++
Sbjct: 225 KRLRSLDTFRGIALVIMVFVNYGGGRYWFFRHESWNGLTVADLVFPWFVFVMGTSIALSI 284
Query: 93 KRI----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
+ R ++K+++R+++L G+ F P+ G+ +R+ GVLQR+
Sbjct: 285 NSLLRAGLTRGSLLRKIVWRSIQLFLIGV-----FIINPNYCQGGLSWENLRIPGVLQRL 339
Query: 149 ALSYLLVSLVEIFTK----DVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
A SYL+V+ +++ DV D F LY W++ + V++L+L + V
Sbjct: 340 AFSYLVVASLDLMVARGHLDVLQTDAWWSPFLDVLLYWPAWVVVLLLEVLWLSLTFLLPV 399
Query: 205 PDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
PD + D G N T G A G++DR +LG H+Y P+ R
Sbjct: 400 PDCPTGYLGPGGIGDMGLYANCTGG----------AAGFLDRWLLGEKHIYQTPS-SRVL 448
Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
TQ P++PEG+L S++S+L +G+ G +I+H +
Sbjct: 449 YLTQ-----------------IPYDPEGVLGSINSVLMAFLGLQAGKIILHYR 484
>gi|74208071|dbj|BAE29143.1| unnamed protein product [Mus musculus]
Length = 656
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 94/325 (28%), Positives = 152/325 (46%), Gaps = 51/325 (15%)
Query: 17 SEPDVSD-QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
++P +D Q E RL +D FRGLA+ LM+ V++ GG + H+ WNG +AD
Sbjct: 242 ADPLSADYQPETRRSSANRLRCVDTFRGLALVLMVFVNYGGGKYWYFKHSSWNGLTVADL 301
Query: 76 VMPFFLFIVGVAIALALKRIPDRA----DAVKKVIFRTLKLLFWGILLQGGFSHAPDELT 131
V P+F+FI+G +I L++ I R + K+++R+ L+ G+++ P+
Sbjct: 302 VFPWFVFIMGTSIFLSMTSILQRGCSKFKLLGKIVWRSFLLICIGVIIVN-----PNYCL 356
Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFT-KDVQDKDQSVGRFSIFRLY----CW-HW 185
+ +R+ GVLQR+ ++Y +V+++E F K V D S F L W W
Sbjct: 357 GPLSWDKVRIPGVLQRLGVTYFVVAVLEFFFWKPV--PDSCTLESSCFSLRDITSSWPQW 414
Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYID 244
L + ++LAL + VP + D GK + T G A GYID
Sbjct: 415 LTILTLESIWLALTFFLPVPGCPTGYLGPGGIGDLGKYPHCTGG----------AAGYID 464
Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
R +LG NH+Y HP+ ++PEG+L +++SI+ +
Sbjct: 465 RLLLGDNHLYQHPS------------------STVLYHTEVAYDPEGVLGTINSIVMAFL 506
Query: 305 GVHFGHVIIH----TKGHLARLKQW 325
GV G ++++ TK L R W
Sbjct: 507 GVQAGKILVYYKDQTKAILTRFAAW 531
>gi|23272280|gb|AAH24084.1| Hgsnat protein [Mus musculus]
gi|148700869|gb|EDL32816.1| DNA segment, Chr 8, ERATO Doi 354, expressed [Mus musculus]
Length = 624
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 94/325 (28%), Positives = 152/325 (46%), Gaps = 51/325 (15%)
Query: 17 SEPDVSD-QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
++P +D Q E RL +D FRGLA+ LM+ V++ GG + H+ WNG +AD
Sbjct: 210 ADPLSADYQPETRRSSANRLRCVDTFRGLALVLMVFVNYGGGKYWYFKHSSWNGLTVADL 269
Query: 76 VMPFFLFIVGVAIALALKRIPDRA----DAVKKVIFRTLKLLFWGILLQGGFSHAPDELT 131
V P+F+FI+G +I L++ I R + K+++R+ L+ G+++ P+
Sbjct: 270 VFPWFVFIMGTSIFLSMTSILQRGCSKLKLLGKIVWRSFLLICIGVIIVN-----PNYCL 324
Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFT-KDVQDKDQSVGRFSIFRLY----CW-HW 185
+ +R+ GVLQR+ ++Y +V+++E F K V D S F L W W
Sbjct: 325 GPLSWDKVRIPGVLQRLGVTYFVVAVLEFFFWKPV--PDSCTLESSCFSLRDITSSWPQW 382
Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYID 244
L + ++LAL + VP + D GK + T G A GYID
Sbjct: 383 LTILTLESIWLALTFFLPVPGCPTGYLGPGGIGDLGKYPHCTGG----------AAGYID 432
Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
R +LG NH+Y HP+ ++PEG+L +++SI+ +
Sbjct: 433 RLLLGDNHLYQHPS------------------STVLYHTEVAYDPEGVLGTINSIVMAFL 474
Query: 305 GVHFGHVIIH----TKGHLARLKQW 325
GV G ++++ TK L R W
Sbjct: 475 GVQAGKILVYYKDQTKAILTRFAAW 499
>gi|26330552|dbj|BAC29006.1| unnamed protein product [Mus musculus]
gi|74213594|dbj|BAE35603.1| unnamed protein product [Mus musculus]
gi|74225342|dbj|BAE31601.1| unnamed protein product [Mus musculus]
Length = 624
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 94/325 (28%), Positives = 152/325 (46%), Gaps = 51/325 (15%)
Query: 17 SEPDVSD-QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
++P +D Q E RL +D FRGLA+ LM+ V++ GG + H+ WNG +AD
Sbjct: 210 ADPLSADYQPETRRSSANRLRCVDTFRGLALVLMVFVNYGGGKYWYFKHSSWNGLTVADL 269
Query: 76 VMPFFLFIVGVAIALALKRIPDRA----DAVKKVIFRTLKLLFWGILLQGGFSHAPDELT 131
V P+F+FI+G +I L++ I R + K+++R+ L+ G+++ P+
Sbjct: 270 VFPWFVFIMGTSIFLSMTSILQRGCSKLKLLGKIVWRSFLLICIGVIIVN-----PNYCL 324
Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFT-KDVQDKDQSVGRFSIFRLY----CW-HW 185
+ +R+ GVLQR+ ++Y +V+++E F K V D S F L W W
Sbjct: 325 GPLSWDKVRIPGVLQRLGVTYFVVAVLEFFFWKPV--PDSCTLESSCFSLRDITSSWPQW 382
Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYID 244
L + ++LAL + VP + D GK + T G A GYID
Sbjct: 383 LTILTLESIWLALTFFLPVPGCPTGYLGPGGIGDLGKYPHCTGG----------AAGYID 432
Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
R +LG NH+Y HP+ ++PEG+L +++SI+ +
Sbjct: 433 RLLLGDNHLYQHPS------------------STVLYHTEVAYDPEGVLGTINSIVMAFL 474
Query: 305 GVHFGHVIIH----TKGHLARLKQW 325
GV G ++++ TK L R W
Sbjct: 475 GVQAGKILVYYKDQTKAILTRFAAW 499
>gi|115292433|ref|NP_084160.1| heparan-alpha-glucosaminide N-acetyltransferase [Mus musculus]
gi|341940800|sp|Q3UDW8.2|HGNAT_MOUSE RecName: Full=Heparan-alpha-glucosaminide N-acetyltransferase;
AltName: Full=Transmembrane protein 76
Length = 656
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 94/325 (28%), Positives = 152/325 (46%), Gaps = 51/325 (15%)
Query: 17 SEPDVSD-QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
++P +D Q E RL +D FRGLA+ LM+ V++ GG + H+ WNG +AD
Sbjct: 242 ADPLSADYQPETRRSSANRLRCVDTFRGLALVLMVFVNYGGGKYWYFKHSSWNGLTVADL 301
Query: 76 VMPFFLFIVGVAIALALKRIPDRA----DAVKKVIFRTLKLLFWGILLQGGFSHAPDELT 131
V P+F+FI+G +I L++ I R + K+++R+ L+ G+++ P+
Sbjct: 302 VFPWFVFIMGTSIFLSMTSILQRGCSKLKLLGKIVWRSFLLICIGVIIVN-----PNYCL 356
Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFT-KDVQDKDQSVGRFSIFRLY----CW-HW 185
+ +R+ GVLQR+ ++Y +V+++E F K V D S F L W W
Sbjct: 357 GPLSWDKVRIPGVLQRLGVTYFVVAVLEFFFWKPV--PDSCTLESSCFSLRDITSSWPQW 414
Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYID 244
L + ++LAL + VP + D GK + T G A GYID
Sbjct: 415 LTILTLESIWLALTFFLPVPGCPTGYLGPGGIGDLGKYPHCTGG----------AAGYID 464
Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
R +LG NH+Y HP+ ++PEG+L +++SI+ +
Sbjct: 465 RLLLGDNHLYQHPS------------------STVLYHTEVAYDPEGVLGTINSIVMAFL 506
Query: 305 GVHFGHVIIH----TKGHLARLKQW 325
GV G ++++ TK L R W
Sbjct: 507 GVQAGKILVYYKDQTKAILTRFAAW 531
>gi|195041852|ref|XP_001991329.1| GH12115 [Drosophila grimshawi]
gi|193901087|gb|EDV99953.1| GH12115 [Drosophila grimshawi]
Length = 573
Score = 121 bits (303), Expect = 6e-25, Method: Compositional matrix adjust.
Identities = 94/316 (29%), Positives = 146/316 (46%), Gaps = 49/316 (15%)
Query: 20 DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
+ D K+ + +RL SLD FRGL + LMI V+ GG + I HA WNG +LAD V P
Sbjct: 171 SIGDAAAKAT-QRKRLRSLDTFRGLCIVLMIFVNSGGGGYSWIEHAAWNGLHLADIVFPS 229
Query: 80 FLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
FL+I+GV I L++K R + ++++R KL G+ L G
Sbjct: 230 FLWIMGVCIPLSIKAQLARGTSKTRICLRIVWRACKLFAIGLCLNS---------VNGPQ 280
Query: 136 VRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCWHWLMAACVL 192
+ +RL GVLQR ++YLLV+++ + ++ Q Q + +I+ L+ + + ++
Sbjct: 281 LEQLRLMGVLQRFGIAYLLVAILHTVCSRRDQLSPQRAWQRAIYDICLFSGEFAVLLALI 340
Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPC--NAVGYIDRKVLGI 250
YL L +G VP + GK N +P C A GYID VLG
Sbjct: 341 ATYLGLTFGLRVPGCPVGYLGPG----GKHNNAA-------HPNCIGGAAGYIDLLVLGN 389
Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
H+Y HP + T F+PEG+ + S++ T++G G
Sbjct: 390 AHIYQHPTAKYVYDAT-------------------AFDPEGIFGCLLSVVQTLLGAFAGV 430
Query: 311 VIIHTKGHLARLKQWV 326
++ RLK+W+
Sbjct: 431 TLLVHSTWQGRLKRWL 446
>gi|74198170|dbj|BAE35261.1| unnamed protein product [Mus musculus]
Length = 624
Score = 120 bits (302), Expect = 7e-25, Method: Compositional matrix adjust.
Identities = 92/317 (29%), Positives = 147/317 (46%), Gaps = 50/317 (15%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFI 83
Q E RL +D FRGLA+ LM+ V++ GG + H+ WNG +AD V P+F+FI
Sbjct: 218 QPETRRSSANRLRCVDTFRGLALVLMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFI 277
Query: 84 VGVAIALALKRIPDRA----DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
+G +I L++ I R + K+++R+ L+ G+++ P+ + +
Sbjct: 278 MGTSIFLSMTSILQRGCSKLKLLGKIVWRSFLLICIGVIIVN-----PNYCLGPLSWDKV 332
Query: 140 RLCGVLQRIALSYLLVSLVEIFT-KDVQDKDQSVGRFSIFRLY----CW-HWLMAACVLV 193
R+ GVLQR+ ++Y +V+++E F K V D S F L W WL +
Sbjct: 333 RIPGVLQRLGVTYFVVAVLEFFFWKPV--PDSCTLESSCFSLRDITSSWPQWLTILTLES 390
Query: 194 VYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
++LAL + VP + D GK + T G A GYIDR +LG NH
Sbjct: 391 IWLALTFFLPVPGCPTGYLGPGGIGDLGKYPHCTGG----------AAGYIDRLLLGDNH 440
Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
+Y HP+ ++PEG+L +++SI+ +GV G ++
Sbjct: 441 LYQHPS------------------STVLYHTEVAYDPEGVLGTINSIVMAFLGVQAGKIL 482
Query: 313 IH----TKGHLARLKQW 325
++ TK L R W
Sbjct: 483 VYYKDQTKAILTRFAAW 499
>gi|432099917|gb|ELK28811.1| Heparan-alpha-glucosaminide N-acetyltransferase [Myotis davidii]
Length = 586
Score = 120 bits (301), Expect = 9e-25, Method: Compositional matrix adjust.
Identities = 94/305 (30%), Positives = 149/305 (48%), Gaps = 46/305 (15%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
RL +D FRGLA+ LM+ V++ GG + HA WNG +AD V P+F+FI+G +I L++
Sbjct: 190 RLRCVDTFRGLALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 249
Query: 94 RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
I R + + KV +R+ L+ GI++ P+ + +R+ GVLQR+
Sbjct: 250 SILQRGCSKFRLLGKVAWRSFLLICIGIVIVN-----PNYCLGPLSWDKVRIPGVLQRLG 304
Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
++Y +V+++E +F K V + S R S + W WL+ + V+LAL + VP
Sbjct: 305 VTYFVVAVLELLFAKPVPESCVSERRCSCLQDITSSWPQWLVILMLESVWLALTFFLPVP 364
Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+ D GK N T G A GYIDR +LG +H+Y HP+
Sbjct: 365 GCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDRLLLGDDHIYQHPSSNVLYH 414
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
T ++PEG+L +++SI+ +GV G ++++ TK L
Sbjct: 415 TT------------------VAYDPEGILGTINSIVMAFLGVQAGKILLYYKDQTKDILI 456
Query: 321 RLKQW 325
R W
Sbjct: 457 RFTAW 461
>gi|194226375|ref|XP_001488696.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Equus caballus]
Length = 663
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 91/306 (29%), Positives = 149/306 (48%), Gaps = 46/306 (15%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA- 91
QRL +D FRG+A+ +M+ V++ GG + H+ WNG +AD V P+F+FI+G +I L+
Sbjct: 266 QRLRCVDTFRGIALIIMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGSSIFLSM 325
Query: 92 ---LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
L+R + + K+ +R+ L+ GI++ P+ + +R+ GVLQR+
Sbjct: 326 TSTLQRGCSKFRLLGKIAWRSFLLISLGIVVVN-----PNYCLGPLSWDKLRIPGVLQRL 380
Query: 149 ALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYV 204
++Y +V+++E +F K V S R S R L W WL + ++L L + V
Sbjct: 381 GVTYFVVAVLELLFAKPVPGSGASERRCSSLRDILSSWPQWLFILLLESIWLGLTFFLPV 440
Query: 205 PDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
P + D G+ N T G A GYIDR +LG +H+Y HP S
Sbjct: 441 PGCPTGYLGPGGIGDLGRYPNCTGG----------AAGYIDRLLLGEDHLYQHP----SS 486
Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHL 319
A + ++PEG+L +++SI+ +GV G ++++ TK L
Sbjct: 487 AVLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGRILLYYKDQTKAIL 532
Query: 320 ARLKQW 325
R W
Sbjct: 533 LRFTAW 538
>gi|345482764|ref|XP_001600799.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Nasonia vitripennis]
Length = 569
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 92/335 (27%), Positives = 151/335 (45%), Gaps = 49/335 (14%)
Query: 17 SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFV 76
+ D + + +R+ SLD RG+++ LMI V++ + + HA WNG + D V
Sbjct: 170 AAADEDELEVGKKTAKRRVRSLDTVRGMSILLMIFVNNGAAGYALLEHATWNGLLVGDLV 229
Query: 77 MPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDEL 130
P F++I+GV I L+ L R R + ++ R++ L G+ L GG +
Sbjct: 230 FPCFMWIMGVCIPLSISAQLSRGSSRLRLCRAIVKRSVYLFAIGLALNTLGGRNQ----- 284
Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
+ IR+ GVLQR L+YL+ +V DK + L W++A
Sbjct: 285 -----LERIRIFGVLQRFGLAYLVAGIVYALAARPDDKQSKRMLGDVVALIP-QWIVALL 338
Query: 191 VLVVYLALLYGTYVPDWQFTIINKDS--ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
+L + A+++ VP + AD GK +N + G A GY+D+ +L
Sbjct: 339 ILAAHCAVVFLLPVPGCPRGYLGPGGRHAD-GKYWNCSGG----------ATGYVDKVLL 387
Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
G++H+Y P T +S + PF+PEG+L S++SI +G+
Sbjct: 388 GVDHIYQLP--------TANSVYG-----------SGPFDPEGVLGSLTSIFQVFLGIQA 428
Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTN 343
G ++ ARL +W+ L G LH+TN
Sbjct: 429 GQILRTYGSWKARLVRWLLWAVLLGAVGAALHYTN 463
>gi|126304129|ref|XP_001381943.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase
[Monodelphis domestica]
Length = 638
Score = 120 bits (301), Expect = 1e-24, Method: Compositional matrix adjust.
Identities = 92/327 (28%), Positives = 153/327 (46%), Gaps = 48/327 (14%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA 91
RL SLD FRG+++ +MI V++ GG + H WNG +AD V P+F+FI+G +IAL+
Sbjct: 240 VHRLRSLDTFRGISLIIMIFVNYGGGKYWFFKHESWNGLTVADLVFPWFVFIMGSSIALS 299
Query: 92 LKRIPDRA----DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
L + R + K+++R+ L G+L+ P+ + +R+ GVLQR
Sbjct: 300 LSSMLRRGCSKWKLLGKILWRSFLLCVIGVLIMN-----PNYCLGPLSWDKLRIPGVLQR 354
Query: 148 IALSYLLVSLVEI-FTKDVQDK---DQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY 203
+ L+YL+V+++E+ F K V + + F Y W+ + V++ + +
Sbjct: 355 LGLTYLVVAVLELLFAKAVPENSTMESLCASFQDIISYWPQWIFILMLEAVWVCVTFLLP 414
Query: 204 VPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
VP + D+GK N T G A GYIDR +LG +H+Y HP+
Sbjct: 415 VPGCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGEDHIYQHPS---- 460
Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH---- 318
+ ++PEGLL +++SI+ +GV G +++ K
Sbjct: 461 --------------PNVLYHTKVAYDPEGLLGTINSIVMAFLGVQAGKILLFYKDQHKQI 506
Query: 319 LARLKQWVTMGFALLIFGLTLHFTNGE 345
+ R W M +I G+ F+ E
Sbjct: 507 MLRFLLWSAM--LAIISGVLTKFSQNE 531
>gi|307178500|gb|EFN67189.1| Heparan-alpha-glucosaminide N-acetyltransferase [Camponotus
floridanus]
Length = 466
Score = 119 bits (299), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 97/337 (28%), Positives = 168/337 (49%), Gaps = 50/337 (14%)
Query: 16 ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
+ E + S+ ++ + R+ S+D FRG+A+ LMI V++ GG++ +H+ WNG +AD
Sbjct: 59 LQEAETSNPIIGTNRSSTRIRSVDTFRGIAILLMIFVNNRGGEYVFFNHSAWNGLTVADL 118
Query: 76 VMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELT 131
V+P+F +I+G++I ++ + +++ K+I R L+ L+ G++L S++ L
Sbjct: 119 VLPWFAWIMGLSITISKRSELRVSNSRTKIILRCLQRAFILILLGLMLNSIRSNSLQNL- 177
Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTK-DVQDKDQSVGRFSIFR--LYCW-HWL 186
R GVLQ +A+SY + + +E IF + QD GRF+ R L W WL
Sbjct: 178 --------RFPGVLQLLAVSYFVCATIETIFMRMHSQDDLLQFGRFTFLRDILNNWAQWL 229
Query: 187 MAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
+ ++V + + + VP+ + + +G N T G A GYIDR
Sbjct: 230 IILAIVVTHTLITFLLPVPNCPTGYLGPGGYSHFGNFPNCTGG----------AAGYIDR 279
Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
V G +HMY+ +P G + P +PEG++++VS IL +G
Sbjct: 280 LVFG-SHMYNK----------TKNPVYGTI---------LPHDPEGIMNTVSIILVVYLG 319
Query: 306 VHFGHVIIHTKGHLARLKQWVT-MGFALLIFGLTLHF 341
VH G +++ AR+ +W+ G +I GL +F
Sbjct: 320 VHAGKILLLYYQCNARVVRWLLWSGVTGIIAGLLCNF 356
>gi|363733262|ref|XP_420455.3| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Gallus
gallus]
Length = 581
Score = 119 bits (298), Expect = 2e-24, Method: Compositional matrix adjust.
Identities = 94/307 (30%), Positives = 148/307 (48%), Gaps = 39/307 (12%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
QRL SLD FRGL++ +M+ V++ GG + H WNG +AD V P+F+FI+G +I+L+L
Sbjct: 184 QRLRSLDTFRGLSLIIMVFVNYGGGKYWFFKHESWNGLTVADLVFPWFVFIMGTSISLSL 243
Query: 93 KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
+ +KV+++ L F ILL G P+ + +R+ GVLQR+ L+Y
Sbjct: 244 SSTLRWGSSKQKVLWKILWRSFLLILL-GVIVVNPNYCLGALSWENLRIPGVLQRLGLTY 302
Query: 153 LLVSLVE-IFTKDVQDK---DQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
L+V+ +E +FT+ D + S + W+ + V++L L + VP
Sbjct: 303 LVVAALELLFTRTGADSGTLEMSCPALQDILPFWPQWIFILMLEVIWLCLTFLLPVPGCP 362
Query: 209 FTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
+ D+G N T G A GYIDR VLG H+Y HP+ T
Sbjct: 363 RGYLGPGGIGDFGNYLNCTGG----------AAGYIDRLVLGEKHIYQHPSCNVLYQTT- 411
Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH----LARLK 323
P++PEG+L ++++IL +G+ G +I+ K ++R
Sbjct: 412 -----------------VPYDPEGILGTINTILMAFLGLQAGKIILSYKDQHKQIMSRFF 454
Query: 324 QW-VTMG 329
W V MG
Sbjct: 455 IWSVVMG 461
>gi|395501613|ref|XP_003755186.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Sarcophilus harrisii]
Length = 425
Score = 119 bits (298), Expect = 3e-24, Method: Compositional matrix adjust.
Identities = 91/309 (29%), Positives = 143/309 (46%), Gaps = 46/309 (14%)
Query: 20 DVSDQQEKSHLKTQRLASLDIFR--GLAVALMILVDHAGGDWPEISHAPWNGCNLADFVM 77
D S K + R L + GL++ LM+ V++ GG + HAPWNG +AD VM
Sbjct: 13 DASVFNNKGKIINFRPWELSLVSKHGLSLTLMVFVNYGGGGYWFFEHAPWNGLTVADLVM 72
Query: 78 PFFLFIVGVAIALALKRIPDRA----DAVKKVIFRTLKLLFWGIL-LQGGFSHAPDELTY 132
P+F+FI+G ++ L + R ++KV +RT L+ G+L L G + P ++
Sbjct: 73 PWFVFILGTSVGLTFHNMQKRGVKNIQLLRKVAWRTGVLIIIGVLFLNYGPADGPLSWSW 132
Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR-FSIFR---LYCWHWLMA 188
RL GVLQR+ +Y V+L++I + V ++ FR LY W +
Sbjct: 133 A------RLPGVLQRLGFTYFAVALMQIAFGVTDTQIYQVNLWWAPFRDVILYWKEWFII 186
Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
+ +++L L + VP + D GK FN T G A YID+ +
Sbjct: 187 ISLEILWLCLTFLLPVPGCPRGYLGPGGIGDEGKYFNCTGG----------AAAYIDKWI 236
Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
LG NH+Y P+ + TQ PF+PEG+L +++SIL G+
Sbjct: 237 LGENHLYQFPSCKELYKTTQ------------------PFDPEGILGTINSILMAFFGLQ 278
Query: 308 FGHVIIHTK 316
G +I+ +
Sbjct: 279 AGKIILMYR 287
>gi|332241088|ref|XP_003269721.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase
[Nomascus leucogenys]
Length = 654
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 91/305 (29%), Positives = 149/305 (48%), Gaps = 46/305 (15%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
RL S+D FRG+A+ LM+ V++ GG + HA WNG +AD V P+F+FI+G +I L++
Sbjct: 258 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMA 317
Query: 94 RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
I R + + K+ +R+ L+ GI++ P+ + +R+ GVLQR+
Sbjct: 318 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 372
Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
++Y +V+++E +F K V + S R W WL+ + ++L L + VP
Sbjct: 373 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP 432
Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+ D+GK N T G A GYIDR +LG +H+Y HP S A
Sbjct: 433 GCPIGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 478
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
+ ++PEG+L +++SI+ +GV G ++++ TK L
Sbjct: 479 VLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILI 524
Query: 321 RLKQW 325
R W
Sbjct: 525 RFTAW 529
>gi|297682811|ref|XP_002819101.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Pongo
abelii]
Length = 645
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 92/305 (30%), Positives = 149/305 (48%), Gaps = 46/305 (15%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
RL S+D FRG+A+ LM+ V++ GG + HA WNG +AD V P+F+FI+G +I L++
Sbjct: 249 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 308
Query: 94 RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
I R + + K+ +R+ L+ GI++ P+ + +R+ GVLQR+
Sbjct: 309 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 363
Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
++Y +V+++E +F K V + S R W WL+ + ++L L + VP
Sbjct: 364 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP 423
Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+ D+GK N T G A GYIDR +LG +H+Y HP S A
Sbjct: 424 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 469
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
+ ++PEG+L +V+SI+ +GV G ++++ TK L
Sbjct: 470 VLYHT--------------EVAYDPEGILGTVNSIVMAFLGVQAGKILLYYKARTKDILI 515
Query: 321 RLKQW 325
R W
Sbjct: 516 RFTAW 520
>gi|194762450|ref|XP_001963347.1| GF20351 [Drosophila ananassae]
gi|190629006|gb|EDV44423.1| GF20351 [Drosophila ananassae]
Length = 576
Score = 118 bits (296), Expect = 4e-24, Method: Compositional matrix adjust.
Identities = 95/327 (29%), Positives = 151/327 (46%), Gaps = 48/327 (14%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
+ +RL SLD FRGL++ LMI V+ GG + I HA WNG +LAD V P FL+I+GV I L
Sbjct: 182 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYTWIDHAAWNGLHLADLVFPSFLWIMGVCIPL 241
Query: 91 ALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
++K R + ++++R++KL G+ L +EL R+ GVLQ
Sbjct: 242 SVKAQLSRGASKGRICLRILWRSIKLFAIGLCLNSMSGPGLEEL---------RIMGVLQ 292
Query: 147 RIALSYLLVSLVEIFT--KDVQDKDQSVGR-FSIFRLYCWHWLMAACVLVVYLALLYGTY 203
R +++L+V ++ +D +S R L+ + ++ YL L +G
Sbjct: 293 RFGVAFLVVGVLHTLCSRRDPISPQRSWQRAVHDICLFSGELAVLLALVATYLGLTFGLR 352
Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPC--NAVGYIDRKVLGINHMYHHPAWRR 261
VP K G F+ NP C A GY+D KVLG H+Y HP
Sbjct: 353 VPG-----CPKGYLGPGGKFDYAS------NPNCIGGAAGYVDLKVLGNAHIYQHP---- 397
Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
+ DS A F+PEG+ + S++ ++G G ++ +R
Sbjct: 398 TAKYVYDS---------------AAFDPEGIFGCILSVVQVLLGAFAGVTLLVHPTWQSR 442
Query: 322 LKQWVTMGFALLIFGLTLHFTNGEHGS 348
+++W+ + L + G L + E G+
Sbjct: 443 IRRWLILAVVLGLIGGALCGFSREGGA 469
>gi|124007195|sp|Q68CP4.2|HGNAT_HUMAN RecName: Full=Heparan-alpha-glucosaminide N-acetyltransferase;
AltName: Full=Transmembrane protein 76
Length = 663
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 91/305 (29%), Positives = 149/305 (48%), Gaps = 46/305 (15%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
RL S+D FRG+A+ LM+ V++ GG + HA WNG +AD V P+F+FI+G +I L++
Sbjct: 267 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 326
Query: 94 RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
I R + + K+ +R+ L+ GI++ P+ + +R+ GVLQR+
Sbjct: 327 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 381
Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
++Y +V+++E +F K V + S R W WL+ + ++L L + VP
Sbjct: 382 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP 441
Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+ D+GK N T G A GYIDR +LG +H+Y HP S A
Sbjct: 442 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 487
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
+ ++PEG+L +++SI+ +GV G ++++ TK L
Sbjct: 488 VLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILI 533
Query: 321 RLKQW 325
R W
Sbjct: 534 RFTAW 538
>gi|397505549|ref|XP_003823319.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase isoform
1 [Pan paniscus]
Length = 585
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 91/305 (29%), Positives = 149/305 (48%), Gaps = 46/305 (15%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
RL S+D FRG+A+ LM+ V++ GG + HA WNG +AD V P+F+FI+G +I L++
Sbjct: 189 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 248
Query: 94 RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
I R + + K+ +R+ L+ GI++ P+ + +R+ GVLQR+
Sbjct: 249 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 303
Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
++Y +V+++E +F K V + S R W WL+ + ++L L + VP
Sbjct: 304 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP 363
Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+ D+GK N T G A GYIDR +LG +H+Y HP S A
Sbjct: 364 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 409
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
+ ++PEG+L +++SI+ +GV G ++++ TK L
Sbjct: 410 VLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILI 455
Query: 321 RLKQW 325
R W
Sbjct: 456 RFTAW 460
>gi|410332579|gb|JAA35236.1| heparan-alpha-glucosaminide N-acetyltransferase [Pan troglodytes]
Length = 635
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 91/305 (29%), Positives = 149/305 (48%), Gaps = 46/305 (15%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
RL S+D FRG+A+ LM+ V++ GG + HA WNG +AD V P+F+FI+G +I L++
Sbjct: 239 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 298
Query: 94 RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
I R + + K+ +R+ L+ GI++ P+ + +R+ GVLQR+
Sbjct: 299 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 353
Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
++Y +V+++E +F K V + S R W WL+ + ++L L + VP
Sbjct: 354 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP 413
Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+ D+GK N T G A GYIDR +LG +H+Y HP S A
Sbjct: 414 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 459
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
+ ++PEG+L +++SI+ +GV G ++++ TK L
Sbjct: 460 VLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILI 505
Query: 321 RLKQW 325
R W
Sbjct: 506 RFTAW 510
>gi|281351504|gb|EFB27088.1| hypothetical protein PANDA_006846 [Ailuropoda melanoleuca]
Length = 557
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 88/307 (28%), Positives = 152/307 (49%), Gaps = 46/307 (14%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA 91
+ RL +D FRG+A+ LM+ V++ GG + H+ WNG +AD V P+F+FI+G ++ L+
Sbjct: 159 SPRLRCVDTFRGIALILMVFVNYGGGRYWYFKHSSWNGLTVADLVFPWFVFIMGSSVFLS 218
Query: 92 LKRIPDRA----DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
+ + R + K+ +R+ L+ G+++ P+ + +R+ GVLQR
Sbjct: 219 MTSVLQRGCSKFKLLGKIAWRSFLLICIGVVIVN-----PNYCLGPLSWDKVRIPGVLQR 273
Query: 148 IALSYLLVSLVE-IFTKDVQDKDQSV-GRFSIFR-LYCW-HWLMAACVLVVYLALLYGTY 203
+ ++Y +V+++E IF K V + S G FS+ ++ W WL + ++L L +
Sbjct: 274 LGVTYFVVAVLELIFAKPVPESCASERGCFSLRDIIFSWPQWLFILMLESIWLGLTFFLP 333
Query: 204 VPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
VP + D+GK N T G A GYIDR +LG +H+Y HP S
Sbjct: 334 VPGCPTGYLGPGGIGDWGKYPNCTGG----------AAGYIDRLLLGDDHIYQHP----S 379
Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGH 318
A + ++PEG+L +++SI+ +G+ G ++++ TK
Sbjct: 380 SAVLYHT--------------EVAYDPEGILGTINSIVMAFLGIQAGKILLYYKDQTKDI 425
Query: 319 LARLKQW 325
L R W
Sbjct: 426 LIRFTAW 432
>gi|150378452|ref|NP_689632.2| heparan-alpha-glucosaminide N-acetyltransferase precursor [Homo
sapiens]
gi|332826066|ref|XP_519741.3| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Pan
troglodytes]
gi|194385774|dbj|BAG65262.1| unnamed protein product [Homo sapiens]
gi|410222096|gb|JAA08267.1| heparan-alpha-glucosaminide N-acetyltransferase [Pan troglodytes]
gi|410256018|gb|JAA15976.1| heparan-alpha-glucosaminide N-acetyltransferase [Pan troglodytes]
gi|410299048|gb|JAA28124.1| heparan-alpha-glucosaminide N-acetyltransferase [Pan troglodytes]
Length = 635
Score = 117 bits (294), Expect = 6e-24, Method: Compositional matrix adjust.
Identities = 91/305 (29%), Positives = 149/305 (48%), Gaps = 46/305 (15%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
RL S+D FRG+A+ LM+ V++ GG + HA WNG +AD V P+F+FI+G +I L++
Sbjct: 239 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 298
Query: 94 RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
I R + + K+ +R+ L+ GI++ P+ + +R+ GVLQR+
Sbjct: 299 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 353
Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
++Y +V+++E +F K V + S R W WL+ + ++L L + VP
Sbjct: 354 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP 413
Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+ D+GK N T G A GYIDR +LG +H+Y HP S A
Sbjct: 414 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 459
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
+ ++PEG+L +++SI+ +GV G ++++ TK L
Sbjct: 460 VLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILI 505
Query: 321 RLKQW 325
R W
Sbjct: 506 RFTAW 510
>gi|51491261|emb|CAH18694.1| hypothetical protein [Homo sapiens]
Length = 459
Score = 117 bits (294), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 93/321 (28%), Positives = 153/321 (47%), Gaps = 46/321 (14%)
Query: 18 EPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVM 77
+ DV + RL S+D FRG+A+ LM+ V++ GG + HA WNG +AD V
Sbjct: 47 DGDVQPATWRLSALPPRLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVF 106
Query: 78 PFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYG 133
P+F+FI+G +I L++ I R + + K+ +R+ L+ GI++ P+
Sbjct: 107 PWFVFIMGSSIFLSMTSILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGP 161
Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAA 189
+ +R+ GVLQR+ ++Y +V+++E +F K V + S R W WL+
Sbjct: 162 LSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLIL 221
Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
+ ++L L + VP + D+GK N T G A GYIDR +L
Sbjct: 222 VLEGLWLGLTFLLPVPGCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLL 271
Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
G +H+Y HP S A + ++PEG+L +++SI+ +GV
Sbjct: 272 GDDHLYQHP----SSAVLYHT--------------EVAYDPEGILGTINSIVMAFLGVQA 313
Query: 309 GHVIIH----TKGHLARLKQW 325
G ++++ TK L R W
Sbjct: 314 GKILLYYKARTKDILIRFTAW 334
>gi|307201549|gb|EFN81312.1| Heparan-alpha-glucosaminide N-acetyltransferase [Harpegnathos
saltator]
Length = 564
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 90/329 (27%), Positives = 148/329 (44%), Gaps = 41/329 (12%)
Query: 23 DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLF 82
++ +R+ ++D FRG + MI V+ G + + H WNG D V P F++
Sbjct: 163 EEATNKEPTKRRVKAIDAFRGASTLFMIFVNDGSGSYSVLGHTTWNGMLPGDLVFPCFMW 222
Query: 83 IVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
I+GV + +AL R ++ F LK F +L G S + L + IR+
Sbjct: 223 IMGVCVPIALSAQLRRGIPKLEIAFTVLKRSF--LLFLIGVSL--NTLGTNAQLEKIRVF 278
Query: 143 GVLQRIALSYLLVSLVEIFTK---DVQDKDQSVGRFSI----FRLYCWHWLMAACVLVVY 195
GVLQR ++YL+VS++ + + +QD+D S R + ++ +W +++V+
Sbjct: 279 GVLQRFGVTYLVVSVMYLCLEPSLQLQDQDSSRNRVTRVLRDMQVLLPYWSFMLILVMVH 338
Query: 196 LALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
L +G VP+ + + G N T G A GYIDR VL INH+
Sbjct: 339 CGLTFGLAVPNCPTGYLGPGGTHEDGYYMNCTGG----------AAGYIDRVVLTINHI- 387
Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
F GP A + PF+PEG+L +++ +GVH G +++
Sbjct: 388 ----------------FAGP--TIASVYGSGPFDPEGILGCLTATFQVYLGVHAGVILMM 429
Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFTN 343
K R+ +W++ + G LHF N
Sbjct: 430 YKNWKERVVRWLSWAVLYGVLGCILHFCN 458
>gi|260816362|ref|XP_002602940.1| hypothetical protein BRAFLDRAFT_251788 [Branchiostoma floridae]
gi|229288254|gb|EEN58952.1| hypothetical protein BRAFLDRAFT_251788 [Branchiostoma floridae]
Length = 512
Score = 117 bits (294), Expect = 8e-24, Method: Compositional matrix adjust.
Identities = 92/302 (30%), Positives = 144/302 (47%), Gaps = 45/302 (14%)
Query: 22 SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
S Q S +RL SLD FRGL++A+M+ V++ GG + HA WNG +AD V P+F+
Sbjct: 109 SSTQPASQ-GIRRLRSLDTFRGLSLAVMVFVNYGGGGYWFFKHARWNGLTVADLVFPWFV 167
Query: 82 FIVGVAIALALKRI----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
FI+G +IAL+ +R+ R + KVI RT+ L G+ + T
Sbjct: 168 FIMGTSIALSFRRLLKKGVSRLSLLWKVIQRTVILFLLGLFIINTKKGHNSWST------ 221
Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKD--VQDKDQSVGRFSIFR--LYCW-HWLMAACVL 192
+R+ GVLQR+AL+Y +V+L+E + + R + R + W WL V+
Sbjct: 222 -LRIPGVLQRLALTYFIVALMESWKPRGYLSLYLLQTSRIAPIRDIVNSWGQWLFMIVVV 280
Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
++L L++ VP+ + +N T G A GYIDR V +H
Sbjct: 281 TLHLVLMFWLQVPNCPIGYLGPGGLSDIAHYNCTGG----------AAGYIDRAVFTDDH 330
Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
+Y HP T + +E PFEPEGLL +++S L +G+ ++
Sbjct: 331 IYQHP--------TPITVYE----------TEVPFEPEGLLGTLTSALLCFLGLQVKNMY 372
Query: 313 IH 314
++
Sbjct: 373 MY 374
>gi|350412149|ref|XP_003489557.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Bombus impatiens]
Length = 571
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 95/324 (29%), Positives = 159/324 (49%), Gaps = 51/324 (15%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA 91
+ R+ S+D FRG+A+ LMI V++ GG + +H+ W G ++AD ++P+F +I+G++I ++
Sbjct: 179 SSRIQSVDAFRGIAILLMIFVNNGGGKYVFFNHSAWFGLSVADLILPWFAWIMGMSITIS 238
Query: 92 ----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
L+ R + R+ L+ G++L S + ++L R G+LQ
Sbjct: 239 KRAELRLTTSRVKITLCCLRRSAILILLGLMLNSIDSKSLNDL---------RFPGILQL 289
Query: 148 IALSYLLVSLVE-IFTK-DVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGT 202
+A+SY + +++E IF K QD GRF+IFR L W WL+ A ++ + + +
Sbjct: 290 LAVSYFVCAILETIFMKPHSQDILLQFGRFAIFRDILDSWPQWLIMAGIMTTHTLITFFL 349
Query: 203 YVPDWQFTIINKDSADY--GKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWR 260
++P+ + GK N T G A GYIDR + G NH Y
Sbjct: 350 HMPNCPTGYFGPGGKYHYRGKYMNCTAG----------AAGYIDRLIFG-NHTYSK---- 394
Query: 261 RSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLA 320
+DS + LR D PEGL++++S+I +GVH G +++ A
Sbjct: 395 -----IKDSIYGQILRYD----------PEGLMNTISAIFIVYLGVHAGKILLLYYQGNA 439
Query: 321 RLKQWVTMG-FALLIFGLTLHFTN 343
RL +W F +I G+ +F N
Sbjct: 440 RLIRWFLWAIFTGIIAGILCNFEN 463
>gi|301765942|ref|XP_002918389.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Ailuropoda melanoleuca]
Length = 851
Score = 117 bits (293), Expect = 9e-24, Method: Compositional matrix adjust.
Identities = 88/307 (28%), Positives = 152/307 (49%), Gaps = 46/307 (14%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA 91
+ RL +D FRG+A+ LM+ V++ GG + H+ WNG +AD V P+F+FI+G ++ L+
Sbjct: 453 SPRLRCVDTFRGIALILMVFVNYGGGRYWYFKHSSWNGLTVADLVFPWFVFIMGSSVFLS 512
Query: 92 LKRIPDRA----DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
+ + R + K+ +R+ L+ G+++ P+ + +R+ GVLQR
Sbjct: 513 MTSVLQRGCSKFKLLGKIAWRSFLLICIGVVIVN-----PNYCLGPLSWDKVRIPGVLQR 567
Query: 148 IALSYLLVSLVE-IFTKDVQDKDQSV-GRFSIFR-LYCW-HWLMAACVLVVYLALLYGTY 203
+ ++Y +V+++E IF K V + S G FS+ ++ W WL + ++L L +
Sbjct: 568 LGVTYFVVAVLELIFAKPVPESCASERGCFSLRDIIFSWPQWLFILMLESIWLGLTFFLP 627
Query: 204 VPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
VP + D+GK N T G A GYIDR +LG +H+Y HP S
Sbjct: 628 VPGCPTGYLGPGGIGDWGKYPNCTGG----------AAGYIDRLLLGDDHIYQHP----S 673
Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGH 318
A + ++PEG+L +++SI+ +G+ G ++++ TK
Sbjct: 674 SAVLYHT--------------EVAYDPEGILGTINSIVMAFLGIQAGKILLYYKDQTKDI 719
Query: 319 LARLKQW 325
L R W
Sbjct: 720 LIRFTAW 726
>gi|291409013|ref|XP_002720836.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase
[Oryctolagus cuniculus]
Length = 613
Score = 117 bits (293), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 91/317 (28%), Positives = 150/317 (47%), Gaps = 48/317 (15%)
Query: 24 QQEKSHLKT--QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
Q E HL RL +D FRG+A+ LM+ V++ GG + H+ WNG +AD V P+F+
Sbjct: 205 QPETWHLSAAKHRLRCVDTFRGIALVLMVFVNYGGGRYWYFRHSSWNGLTVADLVFPWFV 264
Query: 82 FIVGVAIAL----ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
FI+G +I L AL+R + + K+ +R+ L+ GI++ P+ +
Sbjct: 265 FIMGSSIFLSMMSALQRGCSKLRLLGKIAWRSFLLIMIGIVIVN-----PNYCLGPLSWD 319
Query: 138 MIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLV 193
+R+ GVLQR+ ++Y +V+++E +F K V + + R W WL+ +
Sbjct: 320 KVRIPGVLQRLGVTYFVVAVLELLFAKPVPENWVLESSCTCLRDVTSSWPQWLLILLLES 379
Query: 194 VYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
++L L + VP + D+GK N T G A GYIDR +LG +H
Sbjct: 380 IWLGLSFFLPVPGCPTGYLGPGGIGDWGKYPNCTGG----------AAGYIDRVLLGDDH 429
Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
+Y HP+ ++PEG+L +++SI++ +GV G ++
Sbjct: 430 LYKHPS------------------STVLYHTEVAYDPEGILGTINSIVTAFLGVQAGKIL 471
Query: 313 I----HTKGHLARLKQW 325
+ TK L R W
Sbjct: 472 LFYKDQTKSILIRFTAW 488
>gi|431902215|gb|ELK08716.1| Heparan-alpha-glucosaminide N-acetyltransferase [Pteropus alecto]
Length = 585
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 91/305 (29%), Positives = 146/305 (47%), Gaps = 46/305 (15%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
RL +D FRG+A+ LM+ V++ GG + HA WNG +AD V P+F+FI+G +I L++
Sbjct: 189 RLRCVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 248
Query: 94 RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
I R + + KV +R+ L+ GI F P+ + +R+ GVLQR+
Sbjct: 249 SILQRGCSKFRLLGKVTWRSFLLICIGI-----FIVNPNYCLGPLSWDKLRIPGVLQRLG 303
Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
++Y +V+++E +F K V + R S + + W WL + ++LAL + VP
Sbjct: 304 VTYFVVAVLELLFAKPVPESCTVERRCSSLQDIISSWPQWLFILMLESIWLALTFFLPVP 363
Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+ D GK N T G A GYIDR +LG +H+Y HP+
Sbjct: 364 GCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDRLLLGDDHLYQHPS------ 407
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
++PEG+L +++SI+ +GV G ++++ TK L
Sbjct: 408 ------------STVLYHTKVAYDPEGILGTINSIVMAFLGVQAGKILLYYKDQTKDILI 455
Query: 321 RLKQW 325
R W
Sbjct: 456 RFTAW 460
>gi|380789677|gb|AFE66714.1| heparan-alpha-glucosaminide N-acetyltransferase precursor [Macaca
mulatta]
gi|383410547|gb|AFH28487.1| heparan-alpha-glucosaminide N-acetyltransferase [Macaca mulatta]
gi|384945386|gb|AFI36298.1| heparan-alpha-glucosaminide N-acetyltransferase [Macaca mulatta]
Length = 635
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 89/305 (29%), Positives = 147/305 (48%), Gaps = 46/305 (15%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
RL S+D FRG+A+ LM+ V++ GG + HA WNG +AD V P+F+FI+G +I L++
Sbjct: 239 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 298
Query: 94 RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
I R + + K+ +R+ L+ GI++ P+ + +R+ GVLQR+
Sbjct: 299 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 353
Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
++Y +V+++E +F K V + S R W WL+ + ++L L + VP
Sbjct: 354 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILALEGLWLGLTFLLPVP 413
Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+ D+GK N T G A GYIDR +LG +H+Y HP+
Sbjct: 414 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHPS------ 457
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
++PEG+L +++SI+ +GV G ++++ TK L
Sbjct: 458 ------------STVLYHTEVAYDPEGILGTINSIVMAFLGVQAGKILLYYKAQTKDILI 505
Query: 321 RLKQW 325
R W
Sbjct: 506 RFTAW 510
>gi|195476975|ref|XP_002100049.1| GE16376 [Drosophila yakuba]
gi|194187573|gb|EDX01157.1| GE16376 [Drosophila yakuba]
Length = 576
Score = 117 bits (292), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 94/336 (27%), Positives = 151/336 (44%), Gaps = 66/336 (19%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
+ +RL SLD FRGL++ LMI V+ GG + I HA WNG +LAD V P FL+I+GV I L
Sbjct: 182 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHLADVVFPSFLWIMGVCIPL 241
Query: 91 ALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
++K R +A ++++R++KL G+ L G ++ +R GVLQ
Sbjct: 242 SVKSQLSRGSSKARICLRILWRSIKLFVIGLCLNS---------MSGPNLEQLRFMGVLQ 292
Query: 147 RIALSYLLVSLV-------------EIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
R ++YL+V ++ ++ + V D G ++ L+A ++
Sbjct: 293 RFGVAYLVVGVLHTLCCRREPISPQRLWQRAVHDVCLFSGELAV--------LLA--LVA 342
Query: 194 VYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
YL L YG VP + DY N G A GY+D +VLG H
Sbjct: 343 TYLGLTYGLRVPGCPRGYLGPGGKHDYNAHPNCIGG----------AAGYVDLQVLGNAH 392
Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
+Y HP + DS F+PEG+ + S++ ++G G +
Sbjct: 393 IYQHP----TAKYVYDS---------------TAFDPEGIFGCILSVVQVLLGAFAGVTL 433
Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHFTNGEHGS 348
+ +R+++W + L + G L + E G+
Sbjct: 434 LVHPNWQSRIRRWTFLAILLGLIGGALCGFSREGGA 469
>gi|355779672|gb|EHH64148.1| Heparan-alpha-glucosaminide N-acetyltransferase, partial [Macaca
fascicularis]
Length = 596
Score = 116 bits (291), Expect = 1e-23, Method: Compositional matrix adjust.
Identities = 89/305 (29%), Positives = 147/305 (48%), Gaps = 46/305 (15%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
RL S+D FRG+A+ LM+ V++ GG + HA WNG +AD V P+F+FI+G +I L++
Sbjct: 200 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 259
Query: 94 RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
I R + + K+ +R+ L+ GI++ P+ + +R+ GVLQR+
Sbjct: 260 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 314
Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
++Y +V+++E +F K V + S R W WL+ + ++L L + VP
Sbjct: 315 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILALEGLWLGLTFLLPVP 374
Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+ D+GK N T G A GYIDR +LG +H+Y HP+
Sbjct: 375 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHPS------ 418
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
++PEG+L +++SI+ +GV G ++++ TK L
Sbjct: 419 ------------STVLYHTEVAYDPEGILGTINSIVMAFLGVQAGKILLYYKAQTKDILI 466
Query: 321 RLKQW 325
R W
Sbjct: 467 RFTAW 471
>gi|313242995|emb|CBY39713.1| unnamed protein product [Oikopleura dioica]
Length = 597
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 87/308 (28%), Positives = 141/308 (45%), Gaps = 56/308 (18%)
Query: 20 DVSDQQEKSHLKTQ-----------RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWN 68
D +D+QE ++ R SLD RGL++ +MI V++ GG++ + H WN
Sbjct: 182 DQTDKQEDQEVQEDEPAPPAPAKKKRYKSLDTLRGLSLIIMIFVNYGGGEYWFMEHVAWN 241
Query: 69 GCNLADFVMPFFLFIVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFS 124
G +AD VMP+FLF+ GV+I +AL KR + + +++ R++KL+ G++ GG
Sbjct: 242 GLTVADLVMPWFLFMSGVSIRIALQSRIKRGISKTEISYEILVRSVKLIGLGMITIGG-- 299
Query: 125 HAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSV--GRFSIFRLYC 182
R GVLQRI SY +V+++ + + DK+ G F
Sbjct: 300 --------NESWEYFRFPGVLQRIGFSYFVVAIIHLLVIEHPDKEPETNWGLFKEMSFNF 351
Query: 183 WHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVG 241
L++ +L ++ L Y +P ++ G+ ++ G A G
Sbjct: 352 KEHLISWSILGAFICLTYLLPIPGCPTGYTGPGGLSENGEHYHCIGG----------AAG 401
Query: 242 YIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILS 301
YIDRK+LG H+Y+ P D P PF+PEGLL +++SI
Sbjct: 402 YIDRKLLGEKHIYNWP------TAYHDEP------------NGVPFDPEGLLGTLTSIFM 443
Query: 302 TIIGVHFG 309
+G+ G
Sbjct: 444 VYLGLQAG 451
>gi|355697915|gb|EHH28463.1| Heparan-alpha-glucosaminide N-acetyltransferase, partial [Macaca
mulatta]
Length = 596
Score = 116 bits (290), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 88/305 (28%), Positives = 147/305 (48%), Gaps = 46/305 (15%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
RL S+D FRG+A+ LM+ V++ GG + HA WNG +AD V P+F+FI+G +I L++
Sbjct: 200 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 259
Query: 94 RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
+ R + + K+ +R+ L+ GI++ P+ + +R+ GVLQR+
Sbjct: 260 SVLQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 314
Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
++Y +V+++E +F K V + S R W WL+ + ++L L + VP
Sbjct: 315 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILALEGLWLGLTFLLPVP 374
Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+ D+GK N T G A GYIDR +LG +H+Y HP+
Sbjct: 375 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHPS------ 418
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
++PEG+L +++SI+ +GV G ++++ TK L
Sbjct: 419 ------------STVLYHTEVAYDPEGILGTINSIVMAFLGVQAGKILLYYKAQTKDILI 466
Query: 321 RLKQW 325
R W
Sbjct: 467 RFTAW 471
>gi|354472121|ref|XP_003498289.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Cricetulus griseus]
Length = 782
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 95/330 (28%), Positives = 156/330 (47%), Gaps = 49/330 (14%)
Query: 24 QQEKSHLKT--QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
Q E H RL +D FRG+A+ LM+ V++ GG + H+ WNG +AD V P+F+
Sbjct: 374 QPETRHTSALPYRLRCVDTFRGIALILMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFV 433
Query: 82 FIVGVAIAL----ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
FI+G ++ L AL R + + K+ +R+ L+ GI++ P+ +
Sbjct: 434 FIMGSSVFLSMTSALHRGCSKFRLLGKITWRSFLLICIGIIVVN-----PNYCLGPLSWD 488
Query: 138 MIRLCGVLQRIALSYLLVSLVE-IFTKDVQDK-DQSVGRFSIFRLYC-W-HWLMAACVLV 193
+R+ GVLQR+ ++Y +V+++E IF+K V D+ S+ + C W WL+ +
Sbjct: 489 KVRIPGVLQRLGVTYFVVAVLELIFSKPVPDRCALERSYLSLRDITCSWPQWLVVLILES 548
Query: 194 VYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
++LAL + VP + D GK + T G A GYID +LG NH
Sbjct: 549 IWLALTFFLPVPGCPTGYLGPGGIGDMGKYPHCTGG----------ASGYIDHLLLGDNH 598
Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
+Y HP+ ++PEG+L +++SI+ +GV G ++
Sbjct: 599 LYQHPS------------------STVLYHTQVAYDPEGILGTINSIVMAFLGVQAGKIL 640
Query: 313 IH----TKGHLARLKQWVTMGFALLIFGLT 338
++ TK L R W + L+ LT
Sbjct: 641 LYYKDQTKAILMRFTAWCCI-LGLISIALT 669
>gi|410956346|ref|XP_003984803.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Felis
catus]
Length = 629
Score = 115 bits (289), Expect = 2e-23, Method: Compositional matrix adjust.
Identities = 90/305 (29%), Positives = 146/305 (47%), Gaps = 46/305 (15%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
RL +D FRG+A+ LM+ V++ GG + H+ WNG +AD V P+F+FI+G +I L++
Sbjct: 233 RLRCVDTFRGIALILMVFVNYGGGKYWYFKHSSWNGLTVADLVFPWFVFIMGSSIFLSMT 292
Query: 94 RIPDRA----DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
I R + K+ +R+ L+ G+ F P+ + +R+ GVLQR+
Sbjct: 293 SILQRGCSKLKLMGKIGWRSFLLICIGM-----FIVNPNYCLGPLSWDKVRIPGVLQRLG 347
Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
++Y +V+++E IF K V + S R ++ W WL + ++L L + VP
Sbjct: 348 VTYFVVAMLELIFAKPVPESCASERSCFSLRDIIFSWPQWLFILMLESIWLGLTFFLPVP 407
Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+ D GK N T G A GYIDR +LG +H+Y HP S A
Sbjct: 408 GCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDRLLLGDDHIYQHP----SSA 453
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
+ ++PEG+L +++SI+ +GV G ++++ TK L
Sbjct: 454 VLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGKILLYYKDQTKDILI 499
Query: 321 RLKQW 325
R W
Sbjct: 500 RFTAW 504
>gi|326918494|ref|XP_003205523.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Meleagris gallopavo]
Length = 532
Score = 115 bits (289), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 86/297 (28%), Positives = 144/297 (48%), Gaps = 46/297 (15%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
QRL SLD FRGL++ +M+ V++ GG + H WNG +AD V P+F+FI+G +I+L+L
Sbjct: 131 QRLRSLDTFRGLSLIIMVFVNYGGGKYWFFKHESWNGLTVADLVFPWFVFIMGTSISLSL 190
Query: 93 KRI----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
+ + + K+++R+ L G+++ P+ + +R+ GVLQR+
Sbjct: 191 SSMLRWGSSKQKVLGKILWRSFLLTLLGVIVVN-----PNYCLGALSWENLRIPGVLQRL 245
Query: 149 ALSYLLVSLVE-IFTKDVQ-------DKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLY 200
L+YL+V+ +E +FT+ V ++ S + W+ + V++L L +
Sbjct: 246 GLTYLVVAALELLFTRAVNISPSLHLMQEMSYPALQDVLPFWPQWIFILTLEVIWLCLTF 305
Query: 201 GTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAW 259
VP + D+GK N T G A GYIDR VLG H+Y HP+
Sbjct: 306 LLPVPGCPRGYLGPGGIGDFGKYANCTGG----------AAGYIDRLVLGEKHIYQHPSC 355
Query: 260 RRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
T P++PEG+L ++++IL +G+ G +I+ K
Sbjct: 356 NVLYQTT------------------VPYDPEGILGTINTILMAFLGLQAGKIILSYK 394
>gi|449283383|gb|EMC90042.1| Heparan-alpha-glucosaminide N-acetyltransferase [Columba livia]
Length = 560
Score = 115 bits (288), Expect = 3e-23, Method: Compositional matrix adjust.
Identities = 88/317 (27%), Positives = 152/317 (47%), Gaps = 44/317 (13%)
Query: 17 SEPDVSDQQEK--SHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLAD 74
++P SD + S QRL SLD FRGL++ +M+ V++ GG + H WNG +AD
Sbjct: 145 ADPISSDPAPQLWSSAPRQRLRSLDTFRGLSLIIMVFVNYGGGKYWFFKHESWNGLTVAD 204
Query: 75 FVMPFFLFIVGV----AIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
V P+F+FI+G +++ L++ + + K+++R+ L+ G+++ P+
Sbjct: 205 LVFPWFVFIMGTSISLSLSSMLRQGSSKWKVLGKILWRSFLLILLGVIVVN-----PNYC 259
Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWL 186
+ +R+ GVLQR+ L+YL+V+ +E +FT+ + R L W W+
Sbjct: 260 LGPLSWENLRIPGVLQRLGLAYLVVAALELLFTRAGAESGTLETPCPALRDILPYWPQWV 319
Query: 187 MAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
+ V++L L + VP + D+G N T G A GYIDR
Sbjct: 320 FVLMLEVLWLCLTFLLPVPGCPRGYLGPGGIGDFGNYANCTGG----------AAGYIDR 369
Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
+LG H+Y HP+ S Q + P++PEG+L ++++I +G
Sbjct: 370 LLLGDKHIYQHPS---SNVIYQTT---------------MPYDPEGILGTINTIFMAFLG 411
Query: 306 VHFGHVIIHTKGHLARL 322
+ G +I+ K R+
Sbjct: 412 LQAGKIILFYKDQHKRI 428
>gi|307209305|gb|EFN86390.1| Heparan-alpha-glucosaminide N-acetyltransferase [Harpegnathos
saltator]
Length = 552
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 94/346 (27%), Positives = 168/346 (48%), Gaps = 42/346 (12%)
Query: 3 EIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI 62
++ ++ H + E + S+ ++ R+ S+D FRG+++ LMI V++ GG +
Sbjct: 132 KLSPDSVHDDLDRLQEAESSNPVIRTSRVNTRIRSVDTFRGISILLMIFVNNGGGKYVFF 191
Query: 63 SHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGG 122
+H+ WNG +AD V+P+F +I+G++I ++ + +++ K+I R L+ F ILL
Sbjct: 192 NHSVWNGLTVADLVLPWFAWIMGLSITISKRSELRVSNSRGKIILRCLQRAFILILLGLM 251
Query: 123 FSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTK-DVQDKDQSVGRFSIFR- 179
+ + ++ +R G+LQ +A+SY + + +E IF + QD GRF++ R
Sbjct: 252 LNSIHTK-----SLKDLRFPGILQLLAVSYFVCATIETIFMRAHSQDDLLQFGRFTVLRD 306
Query: 180 -LYCW-HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSAD-YGKVFNVTCGVRAKLNPP 236
L W W + + + + + V D + +GK N T G
Sbjct: 307 ILDSWAQWSIIVAIATTHTLITFLLPVLDCPKGYLGPGGYHLFGKNANCTGG-------- 358
Query: 237 CNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSV 296
A GYIDR V G +HMY+ +P G + P++PEG+++++
Sbjct: 359 --AAGYIDRLVFG-SHMYNK----------THNPVYGTI---------LPYDPEGIMNTI 396
Query: 297 SSILSTIIGVHFGHVIIHTKGHLARLKQWVT-MGFALLIFGLTLHF 341
S IL +GVH G +++ AR+ +W+ G LI G+ HF
Sbjct: 397 SVILVVYMGVHAGKILLLYYQCNARIVRWLLWSGVTGLIAGILCHF 442
>gi|281209034|gb|EFA83209.1| hypothetical protein PPL_03999 [Polysphondylium pallidum PN500]
Length = 1154
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 89/335 (26%), Positives = 143/335 (42%), Gaps = 68/335 (20%)
Query: 26 EKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVG 85
E + K R+ SLD+FRGL++ +MI V++ GG + +H+ WNG +AD V P+F+FI+G
Sbjct: 217 ESNDPKKDRMKSLDVFRGLSITIMIFVNYGGGGYWFFNHSYWNGLTVADLVFPWFVFIMG 276
Query: 86 VAIALALKRIPDRADAVK----KVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
A+ ++ + R K K++ R++ L G+ L G D++ R+
Sbjct: 277 CAMPMSFNALESRGVPKKTIVIKLVRRSITLFALGMFLNN-----------GNDLQHWRI 325
Query: 142 CGVLQRIALSYLLVSLVEIFTK---------------------DVQDKDQS--VGRFSIF 178
GVLQR +SYL+ L+ +F +QD+ +S F+
Sbjct: 326 LGVLQRFGISYLVTGLIMMFVPVWRYRQLDDLSEEQQPLYGGGSIQDRIRSRYPRMFADI 385
Query: 179 RLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPC 237
Y W++A +L V+ + + VP I G+ N T G
Sbjct: 386 LPYWIQWVVALMLLSVWFLVTFLLPVPGCPTGYIGPGGIGSQGQYANCTGG--------- 436
Query: 238 NAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVS 297
A Y+D K+ G NH+Y P + + ++PEG L ++
Sbjct: 437 -AARYVDLKIFGENHIYQTPTCQT-------------------IYNTGSYDPEGTLGYIT 476
Query: 298 SILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
SI +GV G I+ K RL +W G L
Sbjct: 477 SIFMCFLGVQCGRTILAFKKASCRLIRWSIWGVVL 511
>gi|149057830|gb|EDM09073.1| rCG43316 [Rattus norvegicus]
Length = 626
Score = 115 bits (287), Expect = 4e-23, Method: Compositional matrix adjust.
Identities = 89/319 (27%), Positives = 152/319 (47%), Gaps = 46/319 (14%)
Query: 20 DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
D + ++ RL +D FRG+A+ LM+ V++ GG + H+ WNG +AD V P+
Sbjct: 216 DCQPETRRASALPHRLRCVDTFRGVALILMVFVNYGGGKYWYFKHSSWNGLTVADLVFPW 275
Query: 80 FLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
F+FI+G +I L+ L+R + + K+ +R+ L+ GI++ P+ +
Sbjct: 276 FVFIMGSSIFLSMTSLLQRGCSKIKLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLS 330
Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFT-KDVQDK---DQSVGRFSIFRLYCWHWLMAACV 191
+R+ GVLQR+ ++Y +V+++E+F K V D ++S WL+ +
Sbjct: 331 WDKVRIPGVLQRLGVTYFVVAMLELFFWKPVPDSCTLERSCLSLRDITSSWPQWLIILIL 390
Query: 192 LVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
++LAL + VP + D GK + T G A GYIDR +LG
Sbjct: 391 ESIWLALTFFLPVPGCPTGYLGPGGIGDLGKYPHCTGG----------AAGYIDRLLLGD 440
Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
+H+Y HP S A + ++PEG+L +++SI+ +GV G
Sbjct: 441 SHLYQHP----SSAVLYHT--------------EVAYDPEGVLGTINSIVMAFLGVQAGK 482
Query: 311 VIIH----TKGHLARLKQW 325
++++ TK L R W
Sbjct: 483 ILLYYKDQTKAILIRFAAW 501
>gi|196012186|ref|XP_002115956.1| hypothetical protein TRIADDRAFT_59909 [Trichoplax adhaerens]
gi|190581732|gb|EDV21808.1| hypothetical protein TRIADDRAFT_59909 [Trichoplax adhaerens]
Length = 580
Score = 115 bits (287), Expect = 5e-23, Method: Compositional matrix adjust.
Identities = 87/295 (29%), Positives = 140/295 (47%), Gaps = 54/295 (18%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFI 83
+E H QR+ ++D FRGL + +MI V+ GG + PWNG AD ++P+F+FI
Sbjct: 238 SEESIHPLAQRIYAVDAFRGLCITIMIFVNSGGGGYWYFRSTPWNGLTFADLILPWFIFI 297
Query: 84 VGVAIALAL--------KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
VG+ IAL+ R+P + AV KV+ R++ L G+ L GV+
Sbjct: 298 VGICIALSFYNHRYITASRLPP-SSAVLKVLSRSVILFLIGLFLND-----------GVN 345
Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYC-WH-WLMAACVLV 193
+ R+ G LQ++A+SY++VSL ++ + D +I + C W W+ +L
Sbjct: 346 LSTWRIPGNLQKVAISYIVVSLSVLYL--AKPPDTITNLRAIREIVCIWKIWIGMIGLLS 403
Query: 194 VYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
+YL+L++ VP +D +N T G A GYIDR + G NH
Sbjct: 404 IYLSLIFALPVPGCPTGYFGPGGLSDDANHYNCTGG----------ATGYIDRFIFG-NH 452
Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
+ +P+ + H PF+ EG LS+++SIL+ +G+
Sbjct: 453 LDANPSCKVLYR------------------THMPFDSEGCLSTLTSILTCFMGLQ 489
>gi|126272886|ref|XP_001369919.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Monodelphis domestica]
Length = 389
Score = 114 bits (285), Expect = 8e-23, Method: Compositional matrix adjust.
Identities = 86/283 (30%), Positives = 136/283 (48%), Gaps = 44/283 (15%)
Query: 44 LAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL----KRIPDRA 99
L++ LMI V++ GG + HAPWNG +AD VMP+F+FI+G ++ LA ++ +
Sbjct: 3 LSLTLMIFVNYGGGGYWFFEHAPWNGLTIADLVMPWFVFILGTSVGLAFHVMQRKGVKKF 62
Query: 100 DAVKKVIFRTLKLLFWGIL-LQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
+KV +RT L+ G L L G P ++ RL GVLQR+ +Y +V+L+
Sbjct: 63 KLFRKVAWRTGVLIAIGALFLNYGPVDGPLSWSWA------RLPGVLQRLGFTYFIVALM 116
Query: 159 EIFTKDVQDKDQSVG-RFSIFR---LYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINK 214
+I + VG ++ FR LY W++ + ++L L + VP +
Sbjct: 117 QIAFGVADMQKYQVGVWWAPFRDIVLYWQEWIIIIGLECIWLCLTFLLPVPGCPRGYLGP 176
Query: 215 DS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEG 273
D GK FN T G A YID+ +LG NH+Y P+ + TQ
Sbjct: 177 GGIGDEGKYFNCTGG----------AAAYIDKWILGENHLYRFPSCKELYKTTQ------ 220
Query: 274 PLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
PF+PEG+L +++SI+ G+ G +I+ +
Sbjct: 221 ------------PFDPEGILGTINSIIMAFFGLQAGKIILMYR 251
>gi|351712254|gb|EHB15173.1| Heparan-alpha-glucosaminide N-acetyltransferase, partial
[Heterocephalus glaber]
Length = 537
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 85/325 (26%), Positives = 150/325 (46%), Gaps = 49/325 (15%)
Query: 17 SEPDVSDQQEKSHLKT---QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLA 73
++P D Q ++ + RL LD FRG+A+ LM+ V++ GG + H+ WNG +A
Sbjct: 180 ADPLTGDPQPEAQCASASGHRLRCLDTFRGIALVLMVFVNYGGGRYWYFKHSSWNGLTVA 239
Query: 74 DFVMPFFLFIVGVAIALALKRIPDRA----DAVKKVIFRTLKLLFWGILLQGGFSHAPDE 129
D V P+F+FI+G ++ L++ + R + K+ +R+ L+ GI++ P+
Sbjct: 240 DLVFPWFVFIMGSSVFLSMTSVLQRGCSKFKLLGKIAWRSFLLICIGIVIVN-----PNY 294
Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDK---DQSVGRFSIFRLYCWHW 185
+ +R+ GVLQR+ ++Y +V+++E +F K + + ++S W
Sbjct: 295 CLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPIPENCVLERSCPSLRDITSSWSQW 354
Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYID 244
L+ + ++L L + VP + D GK N T G A YID
Sbjct: 355 LLILLLEGIWLGLTFLLPVPGCPTGYLGPGGIGDLGKYANCTGG----------AARYID 404
Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
+LG +H+Y HP+ P++PEG+L +++SI+ +
Sbjct: 405 HLLLGSDHLYQHPS------------------STVLYHTEVPYDPEGILGTINSIVMAFL 446
Query: 305 GVHFGHVIIHTKGH----LARLKQW 325
GV G +++ KG L R W
Sbjct: 447 GVQAGKILLCYKGQTKDILIRFTAW 471
>gi|432907420|ref|XP_004077635.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Oryzias latipes]
Length = 482
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 88/310 (28%), Positives = 150/310 (48%), Gaps = 46/310 (14%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
+ RL SLD FRG A+ +M+ V++ GG + HAPWNG +AD VMP+F+FI+G ++ L
Sbjct: 85 RPARLLSLDTFRGFALTVMVFVNYGGGGYWFFEHAPWNGLTVADLVMPWFVFIMGTSVVL 144
Query: 91 AL----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
A +R R + K+ +RT+ L+ G +++P + + +R+ GVLQ
Sbjct: 145 AFSSMQRRGVGRRQLLGKITWRTVVLMLLGFCF---LNYSPRDGP--LSWSWLRIPGVLQ 199
Query: 147 RIALSYLLVSLVEIF--TKDV-QDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY 203
R+A +Y ++SL++ F K + + ++ L+ WL+ + ++L + +
Sbjct: 200 RLAFTYFVLSLLQTFWGRKAIPESENHWWNPVQDVVLFWPQWLLIFLLETLWLCITFLMP 259
Query: 204 VPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
VP+ + D+G N T G A G IDR + G N MY +P ++
Sbjct: 260 VPNCPTGYLGAGGIGDHGLYPNCTGG----------AAGSIDRWMFGDN-MYRYPTCKKL 308
Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK---GH- 318
Q P++PEG+L +++SI+ +G+ G +I+ K GH
Sbjct: 309 YRTEQ------------------PYDPEGVLGTINSIVMGFLGMQAGKIIVFYKRKSGHI 350
Query: 319 LARLKQWVTM 328
L R W +
Sbjct: 351 LWRYLTWAVI 360
>gi|326427923|gb|EGD73493.1| heparan-alpha-glucosaminide N-acetyltransferase [Salpingoeca sp.
ATCC 50818]
Length = 788
Score = 114 bits (284), Expect = 1e-22, Method: Compositional matrix adjust.
Identities = 94/366 (25%), Positives = 162/366 (44%), Gaps = 71/366 (19%)
Query: 1 MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP 60
M + + PL+ S + + + RL SLD FRG+A+ +MI V++ GGD+
Sbjct: 333 MLLLNTQKYTRDPLLSSTHAIGNPKRSK----TRLQSLDSFRGMALTIMIFVNYGGGDYN 388
Query: 61 EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWG 116
H+ WNG +AD V P+F++I+G ++A+ + R ++ +++ RTL L G
Sbjct: 389 FFDHSVWNGLTVADLVFPWFIWIMGTSMAITFNSLFKRHTPLRTILYKVARRTLLLFGIG 448
Query: 117 ILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIF--------------- 161
++ + D+R R+ GVLQR A++YL+V+LV IF
Sbjct: 449 VIF----------INVVHDLRFARVPGVLQRFAIAYLVVALVIIFVPKAVSLLRNVDEVT 498
Query: 162 ------TKDVQD--KDQSVGRFSIFR------LYCWHWLMAACVLVVYLALLYGTYVPDW 207
T V++ D G + R Y W+ ++V++ + + VP
Sbjct: 499 PLIRRLTPTVRNPASDLDPGGCGMLRHLPDVAPYVGEWIAIIVLVVIHTCITFLLPVPGC 558
Query: 208 QFTIINKDSA--DYGKVF--NVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
I A ++G+ N +C V A G++DR +L H+Y P + +
Sbjct: 559 PTGYIGPGGALAEFGQFAPANGSC-VNGTFCCEGGAAGHVDRWLLSWKHIYGSPTSQET- 616
Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
+ ++PEG+L S++SIL +G+ G +I+H K AR
Sbjct: 617 ------------------YQTGAYDPEGILGSLTSILICYLGLQSGKIIVHYKAARARSV 658
Query: 324 QWVTMG 329
+W+ G
Sbjct: 659 RWLAWG 664
>gi|320168011|gb|EFW44910.1| hgsnat protein [Capsaspora owczarzaki ATCC 30864]
Length = 800
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 80/297 (26%), Positives = 138/297 (46%), Gaps = 48/297 (16%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
R+ SLD FRG+A+++M+ V++ GG + H+ WNG +AD V P+F+F++GV+++L+ +
Sbjct: 419 RVRSLDTFRGIALSIMLFVNYGGGGYWFFDHSTWNGLTVADLVFPWFIFMMGVSMSLSFE 478
Query: 94 RI----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
++ R KVI R++ L G+ L + + R+ GVLQR A
Sbjct: 479 KLRRRGAPRGALFLKVIRRSMTLFALGLFL----------VCRQIIFATWRMPGVLQRFA 528
Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
+SYL V+ + +F G F + W++ + ++ + + VP
Sbjct: 529 VSYLFVAAIVMFVPIFATLP---GPFRDLTSHWLQWVVIGIFITIHTCITFLYDVPGCGT 585
Query: 210 TIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
I D+G+ N T G A GYID +V G H+Y P +
Sbjct: 586 GYIGPGGIGDFGQYMNCTGG----------AAGYIDSQVFG-RHIYQAPTAQ-------- 626
Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
+ ++PEGLL ++S++ T +G G +++ H ARL++W
Sbjct: 627 -----------AYYLTGAYDPEGLLGCLTSVVITFLGYQAGRILVTFSTHSARLRRW 672
>gi|121489785|emb|CAK18864.1| hypothetical protein [Phillyrea latifolia]
Length = 129
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 52/112 (46%), Positives = 70/112 (62%), Gaps = 2/112 (1%)
Query: 230 RAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEP 289
L P CN+ G IDR VLGI+H+Y P +R K C S G + + APSWCHAPF+P
Sbjct: 1 NGDLGPACNSAGMIDRNVLGIDHLYAKPVYRNLKECNISS--HGQVPETAPSWCHAPFDP 58
Query: 290 EGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
EG+LSS+++ +S IIG+ +GH+++ + H RL W FA L GL L F
Sbjct: 59 EGILSSLTAAVSCIIGLQYGHILVRLQDHKERLCNWSIFSFAFLGLGLFLAF 110
>gi|195133238|ref|XP_002011046.1| GI16326 [Drosophila mojavensis]
gi|193907021|gb|EDW05888.1| GI16326 [Drosophila mojavensis]
Length = 570
Score = 113 bits (282), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 99/337 (29%), Positives = 154/337 (45%), Gaps = 49/337 (14%)
Query: 20 DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
+ D K+ + QR+ SLD FRGL++ LMI V+ GG + I HA WNG +LAD V P
Sbjct: 168 SIGDAAAKA-TQRQRMRSLDTFRGLSIVLMIFVNSGGGGYSWIEHAAWNGLHLADIVFPT 226
Query: 80 FLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
FL+I+GV I L++K R + ++++R KL G+ L T G
Sbjct: 227 FLWIMGVCIPLSIKAQLGRGISKPRICLRIVWRACKLFAIGLCLNS---------TNGPQ 277
Query: 136 VRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCWHWLMAACVL 192
+ +RL GVLQR ++YL+ ++ I ++ Q + +I+ L+ + ++
Sbjct: 278 LEQLRLMGVLQRFGIAYLVAGVLHTICSRRDYLSPQRAWQRAIYDICLFSGELAVLLALI 337
Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPC--NAVGYIDRKVLGI 250
YL L +G VP + GK N +P C A GY+DR +LG
Sbjct: 338 AAYLGLTFGLRVPGCPRGYLGPG----GKHNNAA-------DPNCIGGAAGYVDRLILGN 386
Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
H+Y HP DA + F+PEG+ + SI+ ++G G
Sbjct: 387 AHIYQHPT--------------AKFVYDASA-----FDPEGVFGCLLSIVQAMLGCFAGV 427
Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNGEHG 347
++ ARL++W+ L + G L + EHG
Sbjct: 428 TLLVHVTWQARLRRWLLGATLLGVLGGALCGFSKEHG 464
>gi|332027964|gb|EGI68015.1| Heparan-alpha-glucosaminide N-acetyltransferase [Acromyrmex
echinatior]
Length = 569
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 96/350 (27%), Positives = 174/350 (49%), Gaps = 50/350 (14%)
Query: 3 EIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI 62
++ + H + E + ++ +++ + R+ S+D FRG+++ LMI V++ GG +
Sbjct: 149 KLSPDNVHDDLDELQEAETANIMIRTNRSSIRIRSVDTFRGISILLMIFVNNGGGQYMFF 208
Query: 63 SHAPWNGCNLADFVMPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGIL 118
+H+ WNG +AD V+P+F +I+G++I ++ L+ R + + + RT+ L+ G++
Sbjct: 209 NHSAWNGLTVADLVLPWFAWIMGLSITISKRSELRVSNSRGKIIVRCLQRTIILILLGLM 268
Query: 119 LQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKD-VQDKDQSVGRFS 176
L ++ + D+L R GVLQ +A+SY + + +E IF K QD GRF+
Sbjct: 269 LNSIYAKSLDDL---------RFPGVLQLLAVSYFICATIETIFMKTHPQDDVLQFGRFT 319
Query: 177 IFR--LYCW-HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAK 232
+ R L W WL+ ++ ++ + + VP+ + +G+ N T G
Sbjct: 320 VLRDILNNWAQWLIILAIMTTHILITFLLPVPNCPTGYLGPGGYHHFGEFANCTGG---- 375
Query: 233 LNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGL 292
A GYIDR V G +HMY TQ+ P G + P +PEG+
Sbjct: 376 ------AAGYIDRLVFG-SHMYSK---------TQN-PVYGTI---------LPHDPEGI 409
Query: 293 LSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL-LIFGLTLHF 341
++++S IL +GVH G +++ A++ +W+ F LI G+ F
Sbjct: 410 MNTISIILVVYLGVHAGKILLLYYQCNAKVIRWLLWSFVTGLIAGILCDF 459
>gi|392968994|ref|ZP_10334410.1| Protein of unknown function DUF2261, transmembrane [Fibrisoma limi
BUZ 3]
gi|387843356|emb|CCH56464.1| Protein of unknown function DUF2261, transmembrane [Fibrisoma limi
BUZ 3]
Length = 390
Score = 112 bits (281), Expect = 2e-22, Method: Compositional matrix adjust.
Identities = 91/292 (31%), Positives = 130/292 (44%), Gaps = 79/292 (27%)
Query: 22 SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVM 77
S + +S QRL SLD FRGL VA MILV++AG DW + HA WNG D +
Sbjct: 9 SVRSSESLTNPQRLLSLDAFRGLTVAGMILVNNAG-DWQYVYAPLKHAAWNGWTPTDLIF 67
Query: 78 PFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
PFFLFIVGV+I AL + + + K++ R+ L G+ L + P D+
Sbjct: 68 PFFLFIVGVSITFALAGGQEHTNVIGKILKRSFTLFMLGLFL----AFFPK-----FDIT 118
Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
+R+ GVLQRIAL YL SL+ + T Q WL+AA ++ +L
Sbjct: 119 TVRIPGVLQRIALVYLACSLIYLRTTTRQQT----------------WLLAALLVGYWLV 162
Query: 198 LLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
+ T++ Y A L P N ++DR VL +H+Y
Sbjct: 163 M-----------TVVPVPGVGY-----------ANLEPTTNLAAWLDRTVLTTDHLY--- 197
Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
R +K ++PEGLLS++ +I + + GV G
Sbjct: 198 --RSTKV----------------------WDPEGLLSTIPAIGTGLAGVLVG 225
>gi|186683151|ref|YP_001866347.1| hypothetical protein Npun_R2871 [Nostoc punctiforme PCC 73102]
gi|186465603|gb|ACC81404.1| conserved hypothetical protein [Nostoc punctiforme PCC 73102]
Length = 375
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 84/287 (29%), Positives = 134/287 (46%), Gaps = 73/287 (25%)
Query: 33 QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
RL SLD+FRG+ +A MILV+ AG +P ++HA W+GC D V PFFLFIVGVA+
Sbjct: 1 MRLTSLDVFRGITIAAMILVNMAGVADNIYPPLAHADWHGCTPTDLVFPFFLFIVGVAMT 60
Query: 90 LALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
+L + + + +R L+ L G+LL G ++ + D+ IR+ GVL
Sbjct: 61 FSLSKYTEDNKPTSAIYWRILRRAAILFALGLLLNGFWNQG----VWTFDLSSIRIMGVL 116
Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
QRI+++YLL SL+ + ++ K Q W++AA +L+ Y ++ VP
Sbjct: 117 QRISITYLLASLIVL---NLPRKGQ--------------WILAAVILIGYWLMMMYLPVP 159
Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
D+ ++ ++ N YIDR ++ H+Y ++
Sbjct: 160 DYGAGVLTREG---------------------NLGAYIDRMIIPKAHLYKGDGFK----- 193
Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
F G +PEGL S++ +I+S + G G I
Sbjct: 194 -----FMG--------------DPEGLFSTIPAIVSVLAGYFTGQWI 221
>gi|195340719|ref|XP_002036960.1| GM12376 [Drosophila sechellia]
gi|194131076|gb|EDW53119.1| GM12376 [Drosophila sechellia]
Length = 576
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 86/319 (26%), Positives = 142/319 (44%), Gaps = 50/319 (15%)
Query: 17 SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFV 76
+ D + + +RL SLD FRGL++ LMI V+ GG + I HA WNG +LAD V
Sbjct: 168 AAADSIGEAATKATQRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHLADIV 227
Query: 77 MPFFLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
P FL+I+GV I L++K R +A +++ R++KL G+ L
Sbjct: 228 FPSFLWIMGVCIPLSVKSQLSRGSSKARICLRILVRSIKLFVIGLCLNS---------MS 278
Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFT--KDVQDKDQSVGR-FSIFRLYCWHWLMAA 189
G ++ +R+ GVLQR ++YL+V ++ ++ +S R L+ +
Sbjct: 279 GPNLEQLRVMGVLQRFGVAYLVVGVLHTLCCRREPISPQRSWQRAVHDVCLFSGELAVLL 338
Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPC--NAVGYIDRK 246
++ YL L +G VP + DY +P C A GY D +
Sbjct: 339 ALVATYLGLTFGLRVPGCPRGYLGPGGKHDYNA------------HPKCIGGAAGYADLQ 386
Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
VLG H+Y HP + T F+PEG+ + S++ ++G
Sbjct: 387 VLGNAHIYQHPTAKYVYDSTA-------------------FDPEGIFGCILSVVQVLLGA 427
Query: 307 HFGHVIIHTKGHLARLKQW 325
G ++ + +R+++W
Sbjct: 428 FAGVTLLVHPNYQSRIRRW 446
>gi|91078976|ref|XP_974454.1| PREDICTED: similar to heparan-alpha-glucosaminide
N-acetyltransferase [Tribolium castaneum]
gi|270003686|gb|EFA00134.1| hypothetical protein TcasGA2_TC002950 [Tribolium castaneum]
Length = 566
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 87/313 (27%), Positives = 150/313 (47%), Gaps = 37/313 (11%)
Query: 19 PDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMP 78
P V ++ R+ S+D+FRG + +MI V++ GG + SH+ WNG +AD V P
Sbjct: 166 PLVIERTPSIRKHPHRIKSIDVFRGFCIMIMIFVNYGGGKYWFFSHSVWNGLTVADLVFP 225
Query: 79 FFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
+FL+++GV+ A++L+ RA ++++ ++ F ILL + + T G
Sbjct: 226 WFLWLMGVSFAVSLQAKLRRAVPRRQLVIGVMRRSFILILLGIIINSNQNLQTIG----S 281
Query: 139 IRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
+R GVLQRI + Y +V ++E IFTK + + +SV + WL ++V++
Sbjct: 282 LRFPGVLQRIGVCYFIVGMLEIIFTK--RSEVESVSCIYDVAVAWPQWLCVTVLVVIHTC 339
Query: 198 LLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
+ + VP + D G+ +N T GV GYIDR+V G HM+ +
Sbjct: 340 VTFLGDVPGCGRGYLGPGGLDDNGRFYNCTGGV----------AGYIDRQVFG-EHMHKN 388
Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
P ++ F+PEG+L +++S+L+ GV G + +
Sbjct: 389 PVCKKLYE------------------IDVYFDPEGILGTLTSVLTVYFGVQAGRTLNTYQ 430
Query: 317 GHLARLKQWVTMG 329
A++ +WV G
Sbjct: 431 NVKAKVIRWVVWG 443
>gi|66826507|ref|XP_646608.1| hypothetical protein DDB_G0270192 [Dictyostelium discoideum AX4]
gi|60474509|gb|EAL72446.1| hypothetical protein DDB_G0270192 [Dictyostelium discoideum AX4]
Length = 426
Score = 112 bits (280), Expect = 3e-22, Method: Compositional matrix adjust.
Identities = 95/318 (29%), Positives = 137/318 (43%), Gaps = 78/318 (24%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+R+ SLD RGL + MILVD+ G+ WP ++ WNG + AD + P F+FI G +IA
Sbjct: 44 RRMGSLDAVRGLTIFGMILVDNQAGNDVIWP-LNETEWNGLSTADLIFPSFIFISGFSIA 102
Query: 90 LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
LALK + +I RTL L F +Q + D + R+ GVLQRIA
Sbjct: 103 LALKNSKNTTSTWYGIIRRTLLLFF----IQCFLNLMGDHFNFTT----FRIMGVLQRIA 154
Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
+ Y L S F IF L V V Y++++Y VP
Sbjct: 155 ICYFFSCL-------------SFLCFPIFL----QRLFLLSVTVTYISIMYALNVPK--- 194
Query: 210 TIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
CG RA L CNA YID KV G+N M ++S
Sbjct: 195 -----------------CG-RANLTQNCNAGAYIDSKVFGLNIM-------------KES 223
Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII-----HTKGHLARLKQ 324
GP D PEGL+S++SS ++ +G+ FG + H G+ + +
Sbjct: 224 NLNGPYYND----------PEGLISTMSSFITAWMGLEFGRIFTRFYKKHDFGNTDIIVR 273
Query: 325 WVTMGFALLIFGLTLHFT 342
W+ + ++ ++L T
Sbjct: 274 WILLVILFMVPAISLGAT 291
>gi|296222155|ref|XP_002757067.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase, partial
[Callithrix jacchus]
Length = 516
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 85/283 (30%), Positives = 139/283 (49%), Gaps = 42/283 (14%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
RL S+D FRG+A+ LM+ V++ GG + HA WNG +AD V P+F+FI+G ++ L++
Sbjct: 267 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGTSVFLSMT 326
Query: 94 RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
I R + + K+ +R+ L+ GI++ P+ + +R+ GVLQR+
Sbjct: 327 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 381
Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
++Y +V+++E +F K V + S R W WL+ + ++L L + VP
Sbjct: 382 VTYFVVAVLELLFAKPVPEHCTSERSCLSLRDITSSWPQWLLILALEGLWLGLTFLLPVP 441
Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+ D+GK N T G A GYIDR +LG +H+Y HP S A
Sbjct: 442 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 487
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
+ ++PEG+L +++SIL +GV
Sbjct: 488 VLYHT--------------EVAYDPEGILGTINSILMAFLGVQ 516
>gi|340727561|ref|XP_003402110.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Bombus terrestris]
Length = 571
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 88/305 (28%), Positives = 149/305 (48%), Gaps = 50/305 (16%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA 91
+ R+ S+D FRG+ + LMI V++ GG + +H+ W G ++AD ++P+F +I+G++I ++
Sbjct: 179 SSRIQSVDAFRGITILLMIFVNNGGGKYVFFNHSAWFGLSVADLILPWFAWIMGMSITIS 238
Query: 92 ----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
L+ R + R+ L+ G++L S + ++L R GVLQ
Sbjct: 239 KRAELRLTTSRVKITLCCLRRSAILILLGLMLNSIDSKSLNDL---------RFPGVLQL 289
Query: 148 IALSYLLVSLVE-IFTK-DVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGT 202
+++SY + +++E IF K QD GRF+IFR L W WL+ A ++ + + +
Sbjct: 290 LSVSYFVCAILETIFMKPHSQDILLQFGRFAIFRDILDSWPQWLIMAGIMTTHTLITFFL 349
Query: 203 YVPDWQFTIINKDSADY--GKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWR 260
++P+ + GK N T G A GYIDR + G NH Y
Sbjct: 350 HIPNCPTGYFGPGGKYHYRGKYMNCTAG----------AAGYIDRLIFG-NHTYSR---- 394
Query: 261 RSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLA 320
+S + LR D PEGL++++S+I +GVH G +++ A
Sbjct: 395 -----ITNSIYGQILRYD----------PEGLMNTISAIFIVYLGVHAGKILLLYYQGNA 439
Query: 321 RLKQW 325
RL +W
Sbjct: 440 RLIRW 444
>gi|348577435|ref|XP_003474490.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Cavia porcellus]
Length = 638
Score = 112 bits (279), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 92/309 (29%), Positives = 152/309 (49%), Gaps = 50/309 (16%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA-- 91
RL LD FRG+A+ LM+ V++ GG + H+ WNG +AD V P+F+FI+G ++ L+
Sbjct: 238 RLRCLDTFRGIALILMVFVNYGGGRYWYFRHSSWNGLTVADLVFPWFVFIMGSSVFLSVT 297
Query: 92 --LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
L+R + + K+ +R+ L+ GI++ P+ + +R+ GVLQR+
Sbjct: 298 SVLQRGCSKLKLLGKIAWRSFLLICIGIVIVN-----PNYCLGPLSWDKVRIPGVLQRLG 352
Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR-LYC-W-HWLMAACVLVVYLALLYGTYVP 205
++Y +V+++E +FTK V + S RF R + C W WL+ + ++L L + VP
Sbjct: 353 VTYFVVAVLELLFTKPVHENCVSDRRFPFLRDITCSWPQWLLILLLESLWLGLTFLLPVP 412
Query: 206 DWQFT-----IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWR 260
+ + D GK N T G A GYID +LG +H+Y HP
Sbjct: 413 GCPYVSEPGYLGPGGIGDLGKYVNCTGG----------AAGYIDHLLLGSDHLYQHP--- 459
Query: 261 RSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TK 316
S A + ++PEG+L +++SI+ +GV G ++++ TK
Sbjct: 460 -SSAVLYHT--------------KVAYDPEGILGTINSIVMAFLGVQAGKILLYYKDQTK 504
Query: 317 GHLARLKQW 325
L R W
Sbjct: 505 DILIRFTAW 513
>gi|426256612|ref|XP_004021932.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Ovis
aries]
Length = 641
Score = 111 bits (278), Expect = 4e-22, Method: Compositional matrix adjust.
Identities = 89/324 (27%), Positives = 152/324 (46%), Gaps = 47/324 (14%)
Query: 17 SEPDVSDQQEKSHLKTQ---RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLA 73
S SD Q ++ ++ RL +D FRG+A+ LM+ V++ GG + H+ WNG +A
Sbjct: 225 SPSRASDPQPEAWRRSAAPLRLRCVDTFRGMALILMVFVNYGGGKYWYFKHSSWNGLTVA 284
Query: 74 DFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDE 129
D V P+F+FI+G +I L++ I R + + K+++R+ L+ GI F P+
Sbjct: 285 DLVFPWFVFIMGASIFLSMASILQRGCSKLRLLGKIVWRSFLLICIGI-----FVVNPNY 339
Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR--FSIFRLYC-W-HW 185
+ R+ GVLQR+ +Y +V+++E+ + ++ R FS+ + W W
Sbjct: 340 CLGPLSWEKARIPGVLQRLGATYFVVAVLELLFAKPVPETCALERSCFSLLDITASWPQW 399
Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
L + V+LAL + VP + G + G A GY+DR
Sbjct: 400 LFVLILEGVWLALTFFLPVPGCPTGYLGPGGIGDGGRYRNCTG---------GAAGYVDR 450
Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
+LG H+Y HP S A + ++PEG+L +++SI+ +G
Sbjct: 451 LLLGDQHLYQHP----SSAVLYHT--------------EVAYDPEGILGTINSIVMAFLG 492
Query: 306 VHFGHVIIH----TKGHLARLKQW 325
V G ++++ T+G L R W
Sbjct: 493 VQAGKILLYYKDQTRGILIRFAAW 516
>gi|118378164|ref|XP_001022258.1| hypothetical protein TTHERM_00500990 [Tetrahymena thermophila]
gi|89304025|gb|EAS02013.1| hypothetical protein TTHERM_00500990 [Tetrahymena thermophila
SB210]
Length = 827
Score = 111 bits (278), Expect = 5e-22, Method: Compositional matrix adjust.
Identities = 83/319 (26%), Positives = 134/319 (42%), Gaps = 80/319 (25%)
Query: 22 SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMP 78
D Q+ + QRL LDI+RGL + MILVD+ G WP + WNG + AD V P
Sbjct: 450 KDIQQPAAAPKQRLECLDIYRGLTMVGMILVDNMGNSSVIWP-LDETEWNGLSTADCVFP 508
Query: 79 FFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
FLFI G+AI LA+K ++ +++ R +KL G+ L ++ +
Sbjct: 509 SFLFISGMAITLAIKHNGNKKQQFFRILERFVKLFVIGVALNAACANYKQQF-------- 560
Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
R+ GVLQRIA+ Y + S +F ++ + +++ L++Y+
Sbjct: 561 -RIMGVLQRIAICYFVTSTSYLFLQN----------------FAVQFVLNGVFLLIYIYF 603
Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
+Y VPD CG + P CN Y+D ++ +N+M
Sbjct: 604 MYFFDVPD-------------------GCGAN-NVTPTCNFGRYLDMQIFTLNYM----- 638
Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH 318
P +PEGL +++ ++++T IG+ +G + K
Sbjct: 639 -------------------------MKPSDPEGLFTTLGALVTTFIGLCYGLALQEFKSQ 673
Query: 319 LARLK-QWVTMGFALLIFG 336
RL W M L+ G
Sbjct: 674 KKRLSCIWFVMSLVLVFIG 692
>gi|24639786|ref|NP_572198.1| CG6903 [Drosophila melanogaster]
gi|7290544|gb|AAF45996.1| CG6903 [Drosophila melanogaster]
gi|21483396|gb|AAM52673.1| LD22376p [Drosophila melanogaster]
Length = 576
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 86/320 (26%), Positives = 144/320 (45%), Gaps = 50/320 (15%)
Query: 17 SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFV 76
+ D + + +RL SLD FRG+++ LMI V+ GG + I HA WNG +LAD V
Sbjct: 168 AAADSIGEAATKATQRKRLRSLDTFRGISIVLMIFVNSGGGGYAWIEHAAWNGLHLADVV 227
Query: 77 MPFFLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
P FL+I+GV I L++K R +A ++++R++KL G+ L
Sbjct: 228 FPSFLWIMGVCIPLSVKSQLSRGSSKARICLRILWRSIKLFVIGLCLNS---------MS 278
Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFT--KDVQDKDQSVGR-FSIFRLYCWHWLMAA 189
G ++ +R+ GVLQR ++YL+V+++ ++ +S R L+ +
Sbjct: 279 GPNLEQLRIMGVLQRFGVAYLVVAILHTLCCRREPISPQRSWQRAVHDVCLFSGELAVLL 338
Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPC--NAVGYIDRK 246
++ YL L +G VP + DY +P C A GY D +
Sbjct: 339 ALVATYLGLTFGLRVPGCPRGYLGPGGKHDYNA------------HPKCIGGAAGYADLQ 386
Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
VLG H+Y HP + DS F+PEG+ + S++ ++G
Sbjct: 387 VLGNAHIYQHP----TAKYVYDS---------------TAFDPEGIFGCILSVVQVLLGA 427
Query: 307 HFGHVIIHTKGHLARLKQWV 326
G ++ +R+++W
Sbjct: 428 FAGVTLLVHPNFQSRIRRWT 447
>gi|297491309|ref|XP_002698775.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Bos
taurus]
gi|296472360|tpg|DAA14475.1| TPA: Heparan-alpha-glucosaminide N-acetyltransferase-like [Bos
taurus]
Length = 723
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 92/324 (28%), Positives = 151/324 (46%), Gaps = 47/324 (14%)
Query: 17 SEPDVSDQQEKSHLKTQ---RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLA 73
S SD Q ++ ++ RL +D FRG+A+ LM+ V++ GG + H+ WNG +A
Sbjct: 307 SPSRASDPQPEAWRRSAAPLRLRCVDTFRGMALILMVFVNYGGGKYWYFKHSSWNGLTVA 366
Query: 74 DFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDE 129
D V P+F+FI+G +I L++ I R + + K+ +R+ L+ GI F P
Sbjct: 367 DLVFPWFVFIMGTSIFLSMTSILQRGCSKFRLLGKIAWRSFLLICIGI-----FVVNPKY 421
Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGR-FSIFRLYC-W-HW 185
+ R+ GVLQR+ +Y +V+++E +F K V + S FS+ + W W
Sbjct: 422 CLGPLSWEKARIPGVLQRLGATYFVVAVLELLFAKPVPETCASERSCFSLLDITASWPQW 481
Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
L + V+LAL + VP + G + G A GY+DR
Sbjct: 482 LFVLILEGVWLALTFFLPVPGCPTGYLGPGGIGDGGRYRNCTG---------GAAGYVDR 532
Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
+LG H+Y HP S A + ++PEG+L +++SI+ +G
Sbjct: 533 LLLGDQHLYQHP----SSAVLYHT--------------EVAYDPEGILGTINSIVMAFLG 574
Query: 306 VHFGHVIIH----TKGHLARLKQW 325
V G ++++ T+G L R W
Sbjct: 575 VQAGKILLYYKDQTRGILIRFAAW 598
>gi|328790778|ref|XP_623715.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Apis mellifera]
Length = 572
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 90/312 (28%), Positives = 151/312 (48%), Gaps = 64/312 (20%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA 91
+ R+ S+D FRG+A+ LMI V++ GG + +H+ W G ++AD V+P+F +I+G+ I ++
Sbjct: 180 STRIHSVDTFRGIAILLMIFVNNGGGKYIFFNHSAWFGLSIADLVLPWFAWIMGLMITVS 239
Query: 92 ----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
L+ R + R+ L+F G++L S + +L R GVLQ
Sbjct: 240 KRTELRLTTSRIKITLYCLRRSAILIFLGLMLNSKDSESLHDL---------RFPGVLQL 290
Query: 148 IALSYLLVSLVE-IFTK-DVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLY-- 200
+ +SY + +++E IF K QD GRF++FR L W WL+ A ++ + + +
Sbjct: 291 LGVSYFVCAILETIFMKPHSQDILHQFGRFAMFRDILESWPQWLIMAGIVTTHTLITFLL 350
Query: 201 -------GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
G + P ++ GK N T G A GYIDR + G NH
Sbjct: 351 PISNCPKGYFGPGGEYHF-------RGKYINCTAG----------AAGYIDRLIFG-NHT 392
Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
Y+H T++ + LR D PEGL++++S+I +GVH G +++
Sbjct: 393 YNH---------TENFLYGQILRYD----------PEGLMNTISAIFIVYLGVHAGKILL 433
Query: 314 HTKGHLARLKQW 325
+R+ +W
Sbjct: 434 LYYQCNSRVIRW 445
>gi|194679266|ref|XP_588978.4| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Bos
taurus]
Length = 734
Score = 111 bits (277), Expect = 6e-22, Method: Compositional matrix adjust.
Identities = 92/324 (28%), Positives = 151/324 (46%), Gaps = 47/324 (14%)
Query: 17 SEPDVSDQQEKSHLKTQ---RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLA 73
S SD Q ++ ++ RL +D FRG+A+ LM+ V++ GG + H+ WNG +A
Sbjct: 318 SPSRASDPQPEAWRRSAAPLRLRCVDTFRGMALILMVFVNYGGGKYWYFKHSSWNGLTVA 377
Query: 74 DFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDE 129
D V P+F+FI+G +I L++ I R + + K+ +R+ L+ GI F P
Sbjct: 378 DLVFPWFVFIMGTSIFLSMTSILQRGCSKFRLLGKIAWRSFLLICIGI-----FVVNPKY 432
Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGR-FSIFRLYC-W-HW 185
+ R+ GVLQR+ +Y +V+++E +F K V + S FS+ + W W
Sbjct: 433 CLGPLSWEKARIPGVLQRLGATYFVVAVLELLFAKPVPETCASERSCFSLLDITASWPQW 492
Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
L + V+LAL + VP + G + G A GY+DR
Sbjct: 493 LFVLILEGVWLALTFFLPVPGCPTGYLGPGGIGDGGRYRNCTG---------GAAGYVDR 543
Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
+LG H+Y HP S A + ++PEG+L +++SI+ +G
Sbjct: 544 LLLGDQHLYQHP----SSAVLYHT--------------EVAYDPEGILGTINSIVMAFLG 585
Query: 306 VHFGHVIIH----TKGHLARLKQW 325
V G ++++ T+G L R W
Sbjct: 586 VQAGKILLYYKDQTRGILIRFAAW 609
>gi|397505551|ref|XP_003823320.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase isoform
2 [Pan paniscus]
Length = 622
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 85/283 (30%), Positives = 139/283 (49%), Gaps = 42/283 (14%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
RL S+D FRG+A+ LM+ V++ GG + HA WNG +AD V P+F+FI+G +I L++
Sbjct: 189 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMT 248
Query: 94 RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
I R + + K+ +R+ L+ GI++ P+ + +R+ GVLQR+
Sbjct: 249 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 303
Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
++Y +V+++E +F K V + S R W WL+ + ++L L + VP
Sbjct: 304 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP 363
Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+ D+GK N T G A GYIDR +LG +H+Y HP S A
Sbjct: 364 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 409
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
+ ++PEG+L +++SI+ +GV
Sbjct: 410 VLYHT--------------EVAYDPEGILGTINSIVMAFLGVQ 438
>gi|194888520|ref|XP_001976930.1| GG18736 [Drosophila erecta]
gi|190648579|gb|EDV45857.1| GG18736 [Drosophila erecta]
Length = 576
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 87/320 (27%), Positives = 143/320 (44%), Gaps = 50/320 (15%)
Query: 17 SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFV 76
+ D + + +RL SLD FRGL++ LMI V+ GG + I HA WNG +LAD V
Sbjct: 168 AAADSIGEAATKATQRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHLADVV 227
Query: 77 MPFFLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
P FL+I+GV I L++K R +A ++++R++KL G+ L
Sbjct: 228 FPSFLWIMGVCIPLSVKSQLSRGSSKARICLRILWRSIKLFVIGLCLNS---------MS 278
Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFT--KDVQDKDQSVGR-FSIFRLYCWHWLMAA 189
G ++ +R GVLQR ++YL+V ++ ++ +S R L+ +
Sbjct: 279 GPNLEQLRFMGVLQRFGVAYLVVGVLHTLCCRREPISPQRSWQRAVHDVCLFSGELAVLL 338
Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPC--NAVGYIDRK 246
++ YL L +G VP + DY +P C A GY D +
Sbjct: 339 ALVATYLGLTFGLRVPGCPRGYLGPGGKHDYNA------------HPHCIGGAAGYADLQ 386
Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
VLG H+Y HP + DS F+PEG+ + S++ ++G
Sbjct: 387 VLGNAHIYQHP----TAKYVYDS---------------TAFDPEGVFGCILSVVQALLGA 427
Query: 307 HFGHVIIHTKGHLARLKQWV 326
G ++ +R+++W+
Sbjct: 428 FAGVTLLVHPNWQSRMRRWM 447
>gi|345481194|ref|XP_001603332.2| PREDICTED: LOW QUALITY PROTEIN: heparan-alpha-glucosaminide
N-acetyltransferase-like [Nasonia vitripennis]
Length = 570
Score = 110 bits (274), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 88/329 (26%), Positives = 151/329 (45%), Gaps = 48/329 (14%)
Query: 5 KAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISH 64
K +H+ + E + + + RL +LD FRG+AV LMI V++ GG++ ++H
Sbjct: 154 KHAESHNDIDRLQESESTPEMVAVSKTAMRLQALDAFRGIAVLLMIFVNNGGGEYVFLNH 213
Query: 65 APWNGCNLADFVMPFFLFIVGVAIALALK---RIP-DRADAVKKVIFRTLKLLFWGILLQ 120
A WNG +AD V+P+F + +G I +++ R+ R + + RT+ L+ +G+ +
Sbjct: 214 AAWNGLTVADLVLPWFAWAMGFTIVNSVRVHLRVSVSRTRLIIMQLRRTVLLILFGLFIN 273
Query: 121 GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFR- 179
+ EL R GVLQ +A++Y + S++E Q Q GRF +
Sbjct: 274 SQHNSTLSEL---------RFPGVLQLLAVAYFICSVIETCLASPQRTFQ-FGRFVFLQD 323
Query: 180 -LYCW-HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPP 236
L W W++ +++V+ + + +VP + YG N T G
Sbjct: 324 ILERWTQWMVVLVIILVHTCITFFLHVPGCPRGYLGPGGYHHYGLNVNCTGG-------- 375
Query: 237 CNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSV 296
A GYIDR + G HMY +P GP P +PEGL++++
Sbjct: 376 --AAGYIDRLIFG-QHMYQKTM----------NPVYGPT---------LPHDPEGLMNTI 413
Query: 297 SSILSTIIGVHFGHVIIHTKGHLARLKQW 325
S++L +GV G + + +R+ +W
Sbjct: 414 SAVLIVFMGVQAGRIFVTYYQANSRIIRW 442
>gi|410926267|ref|XP_003976600.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like,
partial [Takifugu rubripes]
Length = 497
Score = 108 bits (270), Expect = 4e-21, Method: Compositional matrix adjust.
Identities = 74/236 (31%), Positives = 123/236 (52%), Gaps = 22/236 (9%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
++RL SLD FRG+++ +M+ V++ GG + H WNG +AD V P+F+FI+G +IAL
Sbjct: 278 SSKRLQSLDTFRGISLVIMVFVNYGGGRYWFFRHESWNGLTVADLVFPWFMFIMGTSIAL 337
Query: 91 A----LKRIPDRADAVKKVIFRTLKLLFWGI-LLQGGFSHAPDELTYGVDVRMIRLCGVL 145
+ L+ R + KV +R+L+L G+ ++ + P L++G +R+ GVL
Sbjct: 338 SVHALLRTGSTRLSLLGKVAWRSLQLFLIGLFIINPNYCQGP--LSWGT----LRIPGVL 391
Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
QR+AL+YL+V+ +++ + + + LY W+ + V+L L + VP
Sbjct: 392 QRLALAYLVVACLDLLVARAHLEIYTTVSSTDVLLYWPAWVCVLLLESVWLFLTFLLPVP 451
Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWR 260
D + D G N T G A G+IDR +LG H+Y +P+ R
Sbjct: 452 DCPTGYLGPGGIGDMGLFPNCTGG----------AAGFIDRWLLGEKHIYQNPSSR 497
>gi|383859754|ref|XP_003705357.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Megachile rotundata]
Length = 572
Score = 108 bits (269), Expect = 5e-21, Method: Compositional matrix adjust.
Identities = 96/331 (29%), Positives = 162/331 (48%), Gaps = 60/331 (18%)
Query: 17 SEPDVSDQQEKSHLKTQ----------RLASLDIFRGLAVALMILVDHAGGDWPEISHAP 66
+ PD D+ +++ T R+ S+D FRG+A+ LMI V++ GG + +H+
Sbjct: 155 NAPDDLDRLQETESTTHPVIRTTRASTRIRSVDTFRGIAILLMIFVNNGGGKYVFFNHSA 214
Query: 67 WNGCNLADFVMPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGG 122
W G +AD V+P+F +I+G+ I ++ L+ R + I R+L L+ G++L
Sbjct: 215 WYGLTVADLVLPWFAWIMGLTITISKRAELRVTVSRVKIMLHCIRRSLVLILLGLMLNSI 274
Query: 123 FSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKD-VQDKDQSVGRFSIFR- 179
+++ +L R GVLQ + +SY + S++E IF K QD GRF+ FR
Sbjct: 275 KNNSFSDL---------RFPGVLQLLGVSYFVCSMLETIFMKPHSQDTLLQFGRFASFRD 325
Query: 180 -LYCW-HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSA-DY-GKVFNVTCGVRAKLNP 235
L W WL+ A ++ + + + VP+ +Y GK N T G
Sbjct: 326 ILDSWPQWLVMAVIMTTHTLITFLLPVPNCPKGYFGPGGQYEYRGKYMNCTAG------- 378
Query: 236 PCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSS 295
A GYIDR + G NHMY P ++S + LR ++PEGL+++
Sbjct: 379 ---AAGYIDRLIFG-NHMYPKP---------KESIYGDILR----------YDPEGLMNT 415
Query: 296 VSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
+S+I +GVH G +++ + +R+ +W+
Sbjct: 416 ISAIFIVYLGVHAGKILLLYYQYNSRVIRWI 446
>gi|116789271|gb|ABK25182.1| unknown [Picea sitchensis]
Length = 124
Score = 107 bits (268), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 48/91 (52%), Positives = 66/91 (72%)
Query: 5 KAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISH 64
+ TT +I+ + + + ++ R+A+LD+FRGL +A+MILVD AGG WP+I+H
Sbjct: 11 QPSTTESSNVIVIQDGQTIPAKPTNETKTRVATLDVFRGLTIAVMILVDDAGGKWPQINH 70
Query: 65 APWNGCNLADFVMPFFLFIVGVAIALALKRI 95
+PWNGC LADFVMPFFLFIVGVA+AL K +
Sbjct: 71 SPWNGCTLADFVMPFFLFIVGVAVALTFKVV 101
>gi|327275365|ref|XP_003222444.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Anolis carolinensis]
Length = 632
Score = 107 bits (268), Expect = 8e-21, Method: Compositional matrix adjust.
Identities = 83/293 (28%), Positives = 139/293 (47%), Gaps = 42/293 (14%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGV----AI 88
RL SLD FRGLA+ +M+ V++ GG + H WNG +AD V P+F+FI+G ++
Sbjct: 235 HRLRSLDTFRGLALIIMVFVNYGGGKYWFFKHQSWNGLTVADLVFPWFVFIMGTSISLSL 294
Query: 89 ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
+ L+R + + K+++R+L L G+++ P+ + +R+ GVLQR+
Sbjct: 295 SSMLRRGCSKWKLLGKILWRSLLLFLIGVIIVN-----PNYCLGPLSWENLRIPGVLQRL 349
Query: 149 ALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYV 204
+ +Y +V+++E +F K V D R L W WL + V+L L + V
Sbjct: 350 SCTYFVVAVLELLFAKPVPDNSTLEIPCPALRDILPYWPQWLFMMALETVWLCLTFLLNV 409
Query: 205 PDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
P + D+G N T G A YID +LG H+Y HP+ S
Sbjct: 410 PGCPNGYLGPGGIGDFGNYPNCTGG----------AAAYIDHVLLGEKHIYQHPS---SN 456
Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
Q + F+PEG+L +++S++ +G+ G +++ K
Sbjct: 457 VLYQTT---------------VAFDPEGILGTINSVIMAFLGLQAGKILLFYK 494
>gi|282898832|ref|ZP_06306819.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
gi|281196359|gb|EFA71269.1| conserved hypothetical protein [Cylindrospermopsis raciborskii
CS-505]
Length = 375
Score = 107 bits (267), Expect = 9e-21, Method: Compositional matrix adjust.
Identities = 83/287 (28%), Positives = 129/287 (44%), Gaps = 73/287 (25%)
Query: 33 QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
RL SLD+FRGL +A+MI+ + AG +P +SHAPWNGC D V PFFLFIVGVA++
Sbjct: 1 MRLISLDVFRGLTIAMMIIANMAGVVPDVYPFLSHAPWNGCTPTDLVFPFFLFIVGVAMS 60
Query: 90 LALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
+L + + V F R + L G+LL G ++ D++ +R+ GVL
Sbjct: 61 FSLSKYSLESKLDNLVYFNLCRRAVILFTLGLLLNGFWNQGVGSF----DLQSLRVMGVL 116
Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
QRI L+YL SL+ + + +K Q W +A +L+ Y + VP
Sbjct: 117 QRIGLAYLFASLIVL---KLPEKTQ--------------WALAGILLIFYWLTMMYIPVP 159
Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
D+ ++ ++ N +IDR ++ H+Y +
Sbjct: 160 DYGAGMLTREG---------------------NFGAFIDRLIIAKPHLYAGDGFN----- 193
Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
F G +PEGL S++ +I++ + G G I
Sbjct: 194 -----FRG--------------DPEGLFSTIPAIVNVLFGYFAGQWI 221
>gi|428306334|ref|YP_007143159.1| heparan-alpha-glucosaminide N-acetyltransferase [Crinalium
epipsammum PCC 9333]
gi|428247869|gb|AFZ13649.1| Heparan-alpha-glucosaminide N-acetyltransferase [Crinalium
epipsammum PCC 9333]
Length = 371
Score = 107 bits (267), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 84/286 (29%), Positives = 132/286 (46%), Gaps = 79/286 (27%)
Query: 33 QRLASLDIFRGLAVALMILVDHAG-GD--WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
RL SLD+FRG+A+A MILV+ AG D +P ++HA WNG AD V PFFLFI+GVA+A
Sbjct: 1 MRLTSLDVFRGMAIAGMILVNKAGVADQVYPALAHADWNGWTFADLVFPFFLFIIGVAMA 60
Query: 90 LALKRIPDRADAVKKVIF-----RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
+ + + + K ++ R+ L G+LL G +++ D IR+ GV
Sbjct: 61 FSFAKYTEGDNKPTKQLYLRILRRSAILFILGLLLNGFWNY---------DFSTIRVMGV 111
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
LQRI+++YLL SL + + K Q W +AA +L+ Y ++ V
Sbjct: 112 LQRISVAYLLASLAVL---TLPKKGQ--------------WALAAVLLIGYWLIMSFVPV 154
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
P + ++ ++ N YIDR ++G H+Y +
Sbjct: 155 PGYGAGVLTREG---------------------NFGAYIDRLIIGAAHLYKGDNY----- 188
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
++ +PEGL SS+ +++S +IG G
Sbjct: 189 -------------------NSLGDPEGLFSSLPAVVSVLIGYFTGE 215
>gi|17229379|ref|NP_485927.1| hypothetical protein all1887 [Nostoc sp. PCC 7120]
gi|17130977|dbj|BAB73586.1| all1887 [Nostoc sp. PCC 7120]
Length = 375
Score = 106 bits (265), Expect = 1e-20, Method: Compositional matrix adjust.
Identities = 86/287 (29%), Positives = 130/287 (45%), Gaps = 73/287 (25%)
Query: 33 QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
RL SLD+FRG+ +A MILV+ AG +P ++HA W+GC D V PFFLFIVGVA++
Sbjct: 1 MRLTSLDVFRGITIAGMILVNMAGVADDVYPPLAHAEWHGCTPTDLVFPFFLFIVGVAMS 60
Query: 90 LALKRIPDRADAVKKV---IFRTLKLLF-WGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
+L + V IFR +LF G+LL G ++ + D+ IR+ GVL
Sbjct: 61 FSLSKYTQENKPTSVVYWRIFRRAAILFVLGLLLNGFWNKG----IWTFDLSNIRIMGVL 116
Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
QRI+LSYL SL + ++ K Q W++A +LV Y + VP
Sbjct: 117 QRISLSYLFASLTVL---NLPRKGQ--------------WILAGVLLVGYWLTMMYVPVP 159
Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
D+ ++ ++ N YIDR ++ +H+Y ++
Sbjct: 160 DYGAGVLTREG---------------------NFGAYIDRLIIPKSHLYAGDGFKNLG-- 196
Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
+PEGL S++ +I+S + G G I
Sbjct: 197 ----------------------DPEGLFSTIPAIVSVLAGYFTGEWI 221
>gi|328870644|gb|EGG19017.1| hypothetical protein DFA_02260 [Dictyostelium fasciculatum]
Length = 759
Score = 106 bits (265), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 100/365 (27%), Positives = 143/365 (39%), Gaps = 101/365 (27%)
Query: 1 MSEIKAETTHHHPLI-----ISEPDVSDQQEKSHLKT----------QRLASLDIFRGLA 45
+ I T PL+ +S P +D + KT +R+ SLD RGL
Sbjct: 3 IENISHNHTEKSPLLNEQQHVSLPINNDDSTATITKTPSATPTTTQRKRVLSLDTVRGLT 62
Query: 46 VALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAV 102
+ MILVD+ GG WP + WNG + AD + P FLFI G ++ALALK +
Sbjct: 63 IFGMILVDNQGGPQVIWPLL-ETEWNGLSTADLIFPSFLFICGFSVALALKSAKNDIKTW 121
Query: 103 KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFT 162
+I RTL L F L + + + R+ GVLQRI++ Y
Sbjct: 122 YNIIRRTLLLFFIQAFL--------NLMAHKFVFDSFRVMGVLQRISICYF--------- 164
Query: 163 KDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKV 222
F + L + AC +YL+++YG VP
Sbjct: 165 -------ACCCSFLLLPLVGQRIFLVACA-AIYLSVMYGLDVPG---------------- 200
Query: 223 FNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSW 282
CG R L P CNA YID VLG N ++
Sbjct: 201 ----CG-RGVLTPSCNAGSYIDNSVLGANMIH---------------------------- 227
Query: 283 CHAPFEPEGLLSSVSSILSTIIGVHFGHVII-----HTKGHLARLKQWVTMGFALLIFGL 337
P +PEGLLS+ S+ ++T +G+ G + H HL L +W+ + + G+
Sbjct: 228 ---PNDPEGLLSTFSAFITTWMGLELGRIFTRFYRKHDYAHLNILIRWIGIAVVFGVTGI 284
Query: 338 TLHFT 342
L T
Sbjct: 285 ALGVT 289
>gi|340371415|ref|XP_003384241.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Amphimedon queenslandica]
Length = 743
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 91/327 (27%), Positives = 153/327 (46%), Gaps = 48/327 (14%)
Query: 17 SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFV 76
+ D+ ++ S K +RL SLD FRG+++ +MI V++ GG + +H+ WNG +AD V
Sbjct: 342 ATTDLLNEDPLSTRKKERLRSLDTFRGMSLIIMIFVNYGGGGYWFFNHSIWNGITVADLV 401
Query: 77 MPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
P+F++I+GV+I + K + D+ K +++ ++ L
Sbjct: 402 FPWFVWIMGVSIVYSFK--GRKKDSFKLRLYQVVRR---------SVILLGLGLFLNNGY 450
Query: 137 RMI--RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACV 191
R+ R+ GVLQR A++Y +V++ E+ V +K + R + W WL+ +
Sbjct: 451 RLSHWRIPGVLQRFAIAYFVVAMTELLAPMVYNKYKLKWDVISVRDLTHNWVQWLVIVFL 510
Query: 192 LVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
++L + + P + AD GK N T G+ GYID +L
Sbjct: 511 ESLWLIITFSLKAPGCPRGYLGPGGRADGGKYSNCTGGI----------AGYIDSWILTD 560
Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
NH+Y HP KA ++PEG+L S++SI+ GV G
Sbjct: 561 NHIYGHPT---CKAIYHT----------------GSYDPEGILGSINSIVMCFFGVQAGR 601
Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGL 337
++IH K +R+ ++V G LL+ GL
Sbjct: 602 ILIHHKQFGSRIVRFVVWG--LLMGGL 626
>gi|426359530|ref|XP_004047024.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Gorilla
gorilla gorilla]
Length = 635
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 87/305 (28%), Positives = 145/305 (47%), Gaps = 46/305 (15%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
RL S+D FRG+A+ LM+ V++ GG + HA WNG ++ F+FI+G +I L++
Sbjct: 239 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGAEGCIEMIEMFVFIMGSSIFLSMT 298
Query: 94 RIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
I R + + K+ +R+ L+ GI++ P+ + +R+ GVLQR+
Sbjct: 299 SILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLG 353
Query: 150 LSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVP 205
++Y +V+++E +F K V + S R W WL+ + ++L L + VP
Sbjct: 354 VTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVP 413
Query: 206 DWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+ D+GK N T G A GYIDR +LG +H+Y HP S A
Sbjct: 414 GCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSA 459
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLA 320
+ ++PEG+L +++SI+ +GV G ++++ TK L
Sbjct: 460 VLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILI 505
Query: 321 RLKQW 325
R W
Sbjct: 506 RFTAW 510
>gi|75909960|ref|YP_324256.1| hypothetical protein Ava_3756 [Anabaena variabilis ATCC 29413]
gi|75703685|gb|ABA23361.1| conserved hypothetical protein [Anabaena variabilis ATCC 29413]
Length = 375
Score = 106 bits (264), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 86/287 (29%), Positives = 130/287 (45%), Gaps = 73/287 (25%)
Query: 33 QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
RL SLD+FRG+ +A MILV+ AG +P ++HA W+GC D V PFFLFIVGVA++
Sbjct: 1 MRLTSLDVFRGITIAGMILVNMAGVADDVYPPLAHAEWHGCTPTDLVFPFFLFIVGVAMS 60
Query: 90 LALKRIPDR---ADAVKKVIFRTLKLLF-WGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
+L + AV IFR +LF G+LL G ++ + D+ IR+ GVL
Sbjct: 61 FSLSKYTQENKPTSAVYWRIFRRAAILFVLGLLLNGFWNKG----IWTFDLSNIRIMGVL 116
Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
QRI+LSYL SL + ++ K Q W++A +LV Y + VP
Sbjct: 117 QRISLSYLFASLAVL---NLPRKGQ--------------WILAGVLLVGYWLTMMYVPVP 159
Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
D+ ++ ++ N Y+DR ++ H+Y ++
Sbjct: 160 DYGAGVLTREG---------------------NFGAYVDRLIIPQAHLYAGDGFKNLG-- 196
Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
+PEGL S++ +I+S + G G I
Sbjct: 197 ----------------------DPEGLFSTIPAIVSVLAGYFTGEWI 221
>gi|196002389|ref|XP_002111062.1| hypothetical protein TRIADDRAFT_54611 [Trichoplax adhaerens]
gi|190587013|gb|EDV27066.1| hypothetical protein TRIADDRAFT_54611 [Trichoplax adhaerens]
Length = 431
Score = 105 bits (262), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 86/317 (27%), Positives = 142/317 (44%), Gaps = 66/317 (20%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
+R+ SLD++RGL +M D GG + H+ WNG + D V P F+FI G +++++L
Sbjct: 161 RRIRSLDLYRGLCAIVMAFGDSGGGQYRFFKHSIWNGLTIVDVVFPGFIFISGFSLSISL 220
Query: 93 KRIPDRADAVKKV-----IFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
+ + K + I R+ L F G+L+ G + RL GVLQR
Sbjct: 221 VKRLYKMQTPKLILIVTTIRRSFYLFFLGLLING-----------PCQISNWRLLGVLQR 269
Query: 148 IALSYLLVSLVEI--------FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
I++++L+VS + + FTKD +++ + + W ++ L Y+ L
Sbjct: 270 ISVTFLVVSCLAVWLYPTIKSFTKDQVLQEKVLRKM-------WPIMVLIVGLHTYVTLT 322
Query: 200 YGTYVPDWQFTIINK-DSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
VPD +D GK +N T G+ G+IDR V G NH+Y P
Sbjct: 323 AA--VPDCPVGYSGPGGKSDDGKYYNCTGGI----------AGFIDRFVFGSNHLYSRPT 370
Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHT--K 316
+ +Q PF+PEG+L +++SI +G+ G I+H
Sbjct: 371 CKLLYQSSQ------------------PFDPEGVLGTLTSIFLCFLGLQMG--ILHNIFS 410
Query: 317 GHLARLKQWVTMGFALL 333
+L ++ W+ G L+
Sbjct: 411 NNLRIMRTWILFGLLLV 427
>gi|260788586|ref|XP_002589330.1| hypothetical protein BRAFLDRAFT_217958 [Branchiostoma floridae]
gi|229274507|gb|EEN45341.1| hypothetical protein BRAFLDRAFT_217958 [Branchiostoma floridae]
Length = 382
Score = 105 bits (262), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 83/294 (28%), Positives = 137/294 (46%), Gaps = 47/294 (15%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
+RL SLD FRG+ + +M V++ GG + + H+ WNG +AD V P+F++I+G + AL+
Sbjct: 1 RRLKSLDTFRGMCLCIMAFVNYGGGGYWFLDHSVWNGITVADLVFPWFMWIMGTSTALSF 60
Query: 93 KRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
+ + +A K++ RT+ L G+ + +APD D IR+ GVLQR
Sbjct: 61 RGLQRKATPKLTIFGKIVRRTITLFLLGLFI----VNAPD------DWATIRIPGVLQRF 110
Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
A+SY VS + + + + V Y WL C+L V+ L + VP
Sbjct: 111 AVSYFAVSTMMLLHMETEWYRDLVP-------YWKQWLFVLCLLAVHTCLTFLMPVPGCP 163
Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
+ N T G A GYID +L +H+Y +
Sbjct: 164 TGYLGAGGLSDLDHTNCTGG----------AAGYIDNWLLTQDHIY-----------GDE 202
Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
+P L + + ++PEG+L S++SI T +G+ G +++ + H +RL
Sbjct: 203 TPKVRILYQILVN-----YDPEGVLGSLTSIFMTFLGLQAGKILLSYEDHGSRL 251
>gi|423226736|ref|ZP_17213201.1| hypothetical protein HMPREF1062_05387 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392627009|gb|EIY21050.1| hypothetical protein HMPREF1062_05387 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 368
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 98/336 (29%), Positives = 144/336 (42%), Gaps = 102/336 (30%)
Query: 17 SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
S P +S +K +RL SLD+ RG+ V MILV+++GG + + H+ WNG L D
Sbjct: 4 SHPPISTSPQK-----KRLLSLDVLRGITVVGMILVNNSGGKLSYDSLQHSAWNGLTLCD 58
Query: 75 FVMPFFLFIVGVAIALALKRIPDRADA--VKKVIFRTLKLLF--WGI-----LLQGGFSH 125
V PFFLFI+GV+ +AL + +A V+KV+ RTL +L W I + G F
Sbjct: 59 LVFPFFLFIMGVSTYIALSKFHFQASGSVVRKVLKRTLVILCIGWAIHWFHFICDGDF-- 116
Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
+RL GVL RIAL Y +VS V ++ + +G W
Sbjct: 117 --------FPFAHLRLTGVLPRIALCYCVVSFVALYV-----NHKYIG-----------W 152
Query: 186 LMAACVLVVYLALLY--GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYI 243
++ C++ Y LL Y PD D+ N + I
Sbjct: 153 II-GCLIAGYAVLLCIGNGYAPD--------DT---------------------NLLAII 182
Query: 244 DRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTI 303
DR VLG +H+YH +P +PEGL S++S+I T+
Sbjct: 183 DRNVLGADHLYH----------------------------KSPIDPEGLTSTLSAIAHTL 214
Query: 304 IGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTL 339
IG G +I+ + + + GF L+ G L
Sbjct: 215 IGFCCGKIILAKEALEQKTLKLFVAGFILMACGFVL 250
>gi|427385206|ref|ZP_18881711.1| hypothetical protein HMPREF9447_02744 [Bacteroides oleiciplenus YIT
12058]
gi|425727374|gb|EKU90234.1| hypothetical protein HMPREF9447_02744 [Bacteroides oleiciplenus YIT
12058]
Length = 368
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 94/335 (28%), Positives = 139/335 (41%), Gaps = 100/335 (29%)
Query: 17 SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
S+P + Q K +RL SLD+ RG+ V MILV+++GG + + H+ WNG L D
Sbjct: 4 SQPSTFNSQPK-----KRLLSLDVLRGITVVGMILVNNSGGKLSYESLQHSAWNGLTLCD 58
Query: 75 FVMPFFLFIVGVAIALALKRIPDRADA--VKKVIFRTLKLLFWG-------ILLQGGFSH 125
V PFFLFI+G++ ++AL + +A V+K++ RTL +L G + G FS
Sbjct: 59 LVFPFFLFIMGISTSIALSKFHFQASGSVVRKILKRTLIILCIGWVIHWFDFICDGDFS- 117
Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
+RL GVL RIAL Y + S V ++ + +G W
Sbjct: 118 ---------PFAHLRLTGVLPRIALCYCVASFVALYV-----NHKYIG-----------W 152
Query: 186 LMAACVLVVYLALLYGT-YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYID 244
L+ + L G Y PD N + ID
Sbjct: 153 LIGILLAGYTFLLCIGNGYAPD-----------------------------STNLLAIID 183
Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
R VLG +H+YH +P +PEGL S+ S+I T+I
Sbjct: 184 RNVLGADHLYH----------------------------KSPIDPEGLTSTFSAIAHTLI 215
Query: 305 GVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTL 339
G G +I+ K + + +GF L+ G L
Sbjct: 216 GFCCGKLILAKKNLEQKTLKLFVVGFILMACGFCL 250
>gi|348507459|ref|XP_003441273.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Oreochromis niloticus]
Length = 460
Score = 104 bits (260), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 90/312 (28%), Positives = 139/312 (44%), Gaps = 64/312 (20%)
Query: 20 DVSDQQEKSH----LKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADF 75
+ S E +H K RL SLD FRG A+ +M+ V++ GG + HAPWNG +AD
Sbjct: 60 EESHASETAHGTVKAKPTRLLSLDTFRGFALTVMVFVNYGGGGYWFFQHAPWNGLTVADL 119
Query: 76 VMPFFLFIVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDE-- 129
VMP+F+F++G ++ LA +R R ++K+ +RT+ LL G +++P +
Sbjct: 120 VMPWFVFVIGTSVVLAFSSMQRRGVSRLQLLRKITWRTVVLLLLGFCF---LNYSPRDGP 176
Query: 130 ---LTYGVDVRMIRLCGVLQRIALSYLLVSLVEI-FTKDVQDKDQSVGRFSIFRLYCWHW 185
L D R G+ +LL S+ + VQD LY W
Sbjct: 177 CSVLVLAEDPRSAAASGL-------HLLCSVSPYNWWNPVQD----------ILLYWPQW 219
Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYID 244
L+ + ++L L + VP+ + D G N T G A GYID
Sbjct: 220 LIIILLETLWLCLTFLMPVPNCPTGYLGAGGIGDNGLYPNCTGG----------AAGYID 269
Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
R + G N MY +P + TQ PF+PEG+L +++SI+ +
Sbjct: 270 RWMFGDN-MYRYPTCKEMYRTTQ------------------PFDPEGVLGTINSIVIGFL 310
Query: 305 GVHFGHVIIHTK 316
G+ G ++I K
Sbjct: 311 GMQAGKILIFYK 322
>gi|119512372|ref|ZP_01631456.1| hypothetical protein N9414_19342 [Nodularia spumigena CCY9414]
gi|119462961|gb|EAW43914.1| hypothetical protein N9414_19342 [Nodularia spumigena CCY9414]
Length = 369
Score = 104 bits (259), Expect = 7e-20, Method: Compositional matrix adjust.
Identities = 82/284 (28%), Positives = 134/284 (47%), Gaps = 73/284 (25%)
Query: 33 QRLASLDIFRGLAVALMILVDHAG--GD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
RL+SLD+FRG+ +A MILV+ AG G+ +P ++HA W+GC D V PFFLFIVGVA++
Sbjct: 1 MRLSSLDVFRGITIAAMILVNMAGVAGEVYPPLAHADWHGCTPTDLVFPFFLFIVGVAMS 60
Query: 90 LALKRIPDRADAVKKVIFRTLKLLF-WGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
+L + ++ IFR +LF G+LL G ++ + D+ IR+ GVLQRI
Sbjct: 61 FSLSKYTEKG---YSRIFRRAAILFALGLLLNGFWNQG----IWTFDLSKIRIMGVLQRI 113
Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
+L+YLL SL + ++ K Q W++A +L+ Y + VP++
Sbjct: 114 SLAYLLASLAVL---NLPRKGQ--------------WILAGVLLIGYWLTMMYVPVPEYG 156
Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
++ ++ N YIDR ++ H+Y ++
Sbjct: 157 AGVLTREG---------------------NFGAYIDRLIIPQVHLYAGDGYQNLG----- 190
Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
+PEGL S++ ++++ + G G I
Sbjct: 191 -------------------DPEGLFSTIPAVVNVLAGYFTGQWI 215
>gi|282896863|ref|ZP_06304869.1| conserved hypothetical protein [Raphidiopsis brookii D9]
gi|281198272|gb|EFA73162.1| conserved hypothetical protein [Raphidiopsis brookii D9]
Length = 375
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 82/285 (28%), Positives = 129/285 (45%), Gaps = 73/285 (25%)
Query: 33 QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
RL SLD+FRGL +A+MI+ + AG +P +SHA WNGC D V PFFLFIVGVA++
Sbjct: 1 MRLISLDVFRGLTIAMMIIANMAGVAPDVYPFLSHALWNGCTPTDLVYPFFLFIVGVAMS 60
Query: 90 LALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
+L + + K V F R + L G+LL G ++ D++ +R+ GVL
Sbjct: 61 FSLSKYSLESKLDKFVYFNLCRRAVILFTLGLLLNGFWNQGVGSF----DLQSLRVMGVL 116
Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
QRI+L+YL+ SL+ + +K Q W +A +L+ Y + VP
Sbjct: 117 QRISLAYLVASLIVL---KFPEKTQ--------------WALAGILLIFYWLTMMYIPVP 159
Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
D+ ++ ++ N +IDR ++ H+Y +
Sbjct: 160 DYGAGMLTREG---------------------NFGAFIDRLIIAKPHLYAGDGFN----- 193
Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
F G +PEGL S++ +I++ + G G
Sbjct: 194 -----FRG--------------DPEGLFSTIPAIVNVLFGYFAGQ 219
>gi|380028317|ref|XP_003697852.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Apis florea]
Length = 555
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 79/326 (24%), Positives = 141/326 (43%), Gaps = 47/326 (14%)
Query: 23 DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLF 82
D+ +R+ ++D RG + LMI V+ G + + HA WNG D + P F++
Sbjct: 166 DETAMKQPSKRRVKAIDTVRGASTLLMIFVNDGSGGYRILGHATWNGLLPGDLLFPCFIW 225
Query: 83 IVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
I+GV I +A +KR+ R + ++ R++ + G+ L + ++ G +
Sbjct: 226 IMGVCIPIAMASQMKRMLPRHVILYGIVKRSILMFLIGLSL--------NTVSTGPQLET 277
Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
IR+ GVLQR ++YL+V+L+ + K V F L W + ++ V+ +
Sbjct: 278 IRVFGVLQRFGITYLIVALIYFCLMARKPKKTQV--MQDFLLLLPQWCVMLVIVAVHCVI 335
Query: 199 LYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
+ VP + D K F+ G A GYIDR +L H++H
Sbjct: 336 TFCLKVPGCPTGYLGPGGLHDDAKYFDCVGG----------AAGYIDRMILKEPHLHHSA 385
Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
+S P++PEG+L ++++ +G+H G +++ K
Sbjct: 386 TVYKS----------------------GPYDPEGILGTLTTTFQVFLGLHAGIIMMTYKD 423
Query: 318 HLARLKQWVTMGFALLIFGLTLHFTN 343
R+ +W+ G LHF+N
Sbjct: 424 WKERVIRWLAWAAFFSCIGCILHFSN 449
>gi|347738959|ref|ZP_08870332.1| hypothetical protein AZA_89781 [Azospirillum amazonense Y2]
gi|346917867|gb|EGY00078.1| hypothetical protein AZA_89781 [Azospirillum amazonense Y2]
Length = 400
Score = 103 bits (258), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 97/308 (31%), Positives = 146/308 (47%), Gaps = 67/308 (21%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPE----ISHAPWNGCNLADFVMPFFLFIVGV 86
+ RL SLD+ RGLAVA MILV G DW + + HAPW+G LAD V P FLF VG+
Sbjct: 19 TSPRLPSLDVLRGLAVAGMILVVSPG-DWSKAYTPLKHAPWDGWTLADMVFPTFLFSVGL 77
Query: 87 AIALALKRIP-DRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
AIAL+ +I +R A K+ R L L+ G++L + L Y D+ +RL G+L
Sbjct: 78 AIALSFTKIAQNRRAAGVKIARRALALIVLGLVL--------NALPY-FDLAHLRLPGIL 128
Query: 146 QRIALSYLLVSLVEIFT-KDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
QRIAL Y+L +L+ + T + D +V + ++ + A VL+ Y A+L V
Sbjct: 129 QRIALCYVLATLLCLVTARTGADGSPTVNQRALL-------IAMAVVLLGYCAVLAWVPV 181
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAK-LNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
P G+ A L+P + +IDR V + H++
Sbjct: 182 P----------------------GIGAGHLDPGGSLAAWIDRGVFTVPHLW--------- 210
Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
P DA ++PEGLLS++ + ++ ++GV G + ++ L L
Sbjct: 211 ----------PYGTDAAGAVV--YDPEGLLSTLPATVNVLVGVLAGTALKASRSRLNLLV 258
Query: 324 QWVTMGFA 331
V + A
Sbjct: 259 AAVMLMMA 266
>gi|298491757|ref|YP_003721934.1| hypothetical protein Aazo_3034 ['Nostoc azollae' 0708]
gi|298233675|gb|ADI64811.1| Protein of unknown function DUF2261, transmembrane ['Nostoc
azollae' 0708]
Length = 376
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 90/308 (29%), Positives = 137/308 (44%), Gaps = 71/308 (23%)
Query: 33 QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
RL SLD+FRG+ +A MILV+ G + + HA WNGC D V PFFLFIVGVA+
Sbjct: 1 MRLTSLDVFRGITIAGMILVNMVGVADHKYSLLDHAEWNGCTPTDLVFPFFLFIVGVAMT 60
Query: 90 LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
+L + K V R L+ LL + ++ + D+ IR GVLQRI+
Sbjct: 61 FSLSKYTADNKPTKAVYLRILRRAAILFLLGLLLNGFWNKGVWTFDLSSIRFMGVLQRIS 120
Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
LSYL SL+ + V K+Q W++A +L+ Y + VPD+
Sbjct: 121 LSYLFASLIVL---KVPGKNQ--------------WVLAGVLLIGYWLTMMYVPVPDYGA 163
Query: 210 TIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
++ ++ N G+IDR ++ H+Y
Sbjct: 164 GVLTREG---------------------NFGGFIDRLIIPKAHLY--------------- 187
Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMG 329
+ D ++ +PEGL S++ +I+S ++G +F + I + H L +M
Sbjct: 188 ------KGDGFNYLG---DPEGLYSTIPAIVSVLVG-YFAGIRIKERKH---LNSQTSMD 234
Query: 330 FALLIFGL 337
F L FGL
Sbjct: 235 FVL--FGL 240
>gi|434403337|ref|YP_007146222.1| hypothetical protein Cylst_1247 [Cylindrospermum stagnale PCC 7417]
gi|428257592|gb|AFZ23542.1| hypothetical protein Cylst_1247 [Cylindrospermum stagnale PCC 7417]
Length = 375
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 84/287 (29%), Positives = 125/287 (43%), Gaps = 73/287 (25%)
Query: 33 QRLASLDIFRGLAVALMILVDHAG-GD--WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
RL SLD+FRG+ +A MILV+ AG D +P + HA WNGC D V PFFLFIVGVA+
Sbjct: 1 MRLTSLDVFRGMTIAAMILVNMAGVADEIYPLLDHAKWNGCTPTDLVFPFFLFIVGVAMT 60
Query: 90 LALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
+L + K V R L+ L G+LL G ++ + D+ IR GVL
Sbjct: 61 FSLSKYTAANKPTKAVYLRILRRAAILFALGLLLNGFWNKG----VWTFDLSNIRFMGVL 116
Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
QRI+L+YLL SL + + K Q W++A +LV Y + VP
Sbjct: 117 QRISLTYLLASLAVL---QLPRKGQ--------------WILAVVLLVGYWLTMMYVPVP 159
Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
D+ ++ ++ N +IDR ++ H+Y +
Sbjct: 160 DYGAGVLTREG---------------------NFGAFIDRLIIPKAHLYKGDGFNLLG-- 196
Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
+PEGL S++ ++++ + G G I
Sbjct: 197 ----------------------DPEGLFSTIPAVVNVLAGYFAGEWI 221
>gi|345320430|ref|XP_001516736.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase, partial
[Ornithorhynchus anatinus]
Length = 448
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 80/284 (28%), Positives = 133/284 (46%), Gaps = 42/284 (14%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
QRL SLD FRG ++ +M+ V++ GG + H WNG +AD V P+F+FI+G +I+L+L
Sbjct: 198 QRLRSLDTFRGFSLIIMVFVNYGGGKYWFFKHEGWNGLTVADLVFPWFVFIMGSSISLSL 257
Query: 93 KRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
+ R + + K+++R+ L G+L+ P+ + +R+ GVLQR+
Sbjct: 258 SSMLRRGYSKWRLLWKILWRSFLLFLIGVLIVN-----PNYCLGPLSWDKLRIPGVLQRL 312
Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL----YCWHWLMAACVLVVYLALLYGTYV 204
+YL+V+ +E+ + S+ R F Y W+ + +L L + V
Sbjct: 313 GFTYLVVATLELLFAKAVPESNSLERTCSFLQEIISYWPQWIFILMLETAWLCLTFLLPV 372
Query: 205 PDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
P + D+GK N T G A GYID +LG NH+Y HP+
Sbjct: 373 PGCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDHLLLGENHIYQHPS----- 417
Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
+ ++PEG+L +++SI+ +GV
Sbjct: 418 -------------PNVLYHTKVAYDPEGILGTINSIVMAFLGVQ 448
>gi|390344818|ref|XP_795043.3| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Strongylocentrotus purpuratus]
Length = 680
Score = 103 bits (257), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 99/352 (28%), Positives = 159/352 (45%), Gaps = 68/352 (19%)
Query: 5 KAETTHHHPLIISEPDVSDQ---------QEKSHLKTQRLASLDIFRGLAVALMILVDHA 55
K+ +H+ I+S DQ + KS L RL S+D FRGLA+ ++L
Sbjct: 255 KSLPINHNGSILSNGSQDDQTPLTFPASDKPKSSL---RLRSVDTFRGLAITHLVLGASG 311
Query: 56 GGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLK 111
G + +HA W G +ADF+ P+F+FI+G +I L+ + + + KK++FR++
Sbjct: 312 DGHFWYSNHARWYGITVADFMFPWFVFIMGTSIHLSFNILLSKGLSYCAIFKKIVFRSIS 371
Query: 112 LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQ-DKDQ 170
L G+ +Q SH D+R +R+ GVLQR ++Y +V+ + ++ +Q + +
Sbjct: 372 LFIMGVCIQ---SHN--------DLRNLRIPGVLQRFGITYFIVASSYLLSRRLQARRAE 420
Query: 171 SVGR-FSIFRLYC--WHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS---ADYGKVFN 224
G+ + +FR +AAC LVV+L L + VP + G++ N
Sbjct: 421 KTGKCYMMFRDITDYLELPLAACCLVVHLCLTFLLPVPGCPLGYQGPGGPLVGENGELTN 480
Query: 225 VTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCH 284
T G A GYIDR H+ T D + +R D
Sbjct: 481 CTGG----------ASGYIDRTFFTEAHLI--------LVNTCDDVYRTIVRSD------ 516
Query: 285 APFEPEGLLSSVSSILSTIIGVHFG---HVIIHTKGHLARLKQWVTMGFALL 333
PEG+L + +SI + G+ G H+ +G L RL W G AL+
Sbjct: 517 ----PEGILGTFTSIALCVFGLQSGKILHLFTTVRGRLVRLLLW---GLALI 561
>gi|427728534|ref|YP_007074771.1| hypothetical protein Nos7524_1293 [Nostoc sp. PCC 7524]
gi|427364453|gb|AFY47174.1| hypothetical protein Nos7524_1293 [Nostoc sp. PCC 7524]
Length = 380
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 86/288 (29%), Positives = 130/288 (45%), Gaps = 75/288 (26%)
Query: 33 QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
RL SLD+FRG+ +A MILV+ AG + ++HA W+GC D V PFFLFIVGVA+
Sbjct: 15 MRLTSLDVFRGITIAAMILVNMAGVADDVYLPLTHADWHGCTPTDLVFPFFLFIVGVAMT 74
Query: 90 LALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
+L + + +R L+ L G+ L G ++ + D IR+ GVL
Sbjct: 75 FSLSKYTQDNKPTSAIYWRILRRAAILFILGLFLNGFWNQG----VWTFDFTSIRMMGVL 130
Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY-LALLYGTYV 204
QRI+LSYLL SL+ + + K Q WL+A +L+ Y LA++Y V
Sbjct: 131 QRISLSYLLASLIVL---KLPRKGQ--------------WLLAGVLLIGYWLAMMY-IPV 172
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
PD+ ++ ++ N Y+DR ++ H+Y +
Sbjct: 173 PDYGAGVLTREG---------------------NFGAYVDRLIIPKAHLYKGDGFN---- 207
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
F G +PEGL S++ +I+S + G G I
Sbjct: 208 ------FMG--------------DPEGLFSTIPAIVSVLAGYFTGEWI 235
>gi|224496100|ref|NP_001139059.1| uncharacterized protein LOC565246 precursor [Danio rerio]
Length = 582
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 89/336 (26%), Positives = 146/336 (43%), Gaps = 60/336 (17%)
Query: 3 EIKAETTHH---HPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDW 59
+K HH + + + EP+ Q ++S K+ RL SLD FRG ++ +M+ V++ GG +
Sbjct: 169 RLKNRMCHHGSQNSMEMEEPNTEQQIDESKPKSSRLKSLDTFRGFSLTVMVFVNYGGGGY 228
Query: 60 PEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRA----DAVKKVIFRTLKLLFW 115
HAPWNG +AD VMP+F+FI+G ++ LA + + ++KV +RT+ L+
Sbjct: 229 WFFQHAPWNGLTVADLVMPWFVFIIGTSVMLAFTSMHRKGVSLLQLLRKVTWRTVVLMLI 288
Query: 116 GILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGR 174
G +S L D R G+ +L S + + +QD
Sbjct: 289 GFCFM-NYSPRDGILVLAADTRSSPASGL-------HLFRSGTDHNWWNPIQD------- 333
Query: 175 FSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKL 233
LY WL + ++L L + VP+ + D G N T G
Sbjct: 334 ---VILYWPEWLFIVLLETLWLCLTFLLPVPNCPTGYLGAGGVGDAGLYPNCTGG----- 385
Query: 234 NPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLL 293
A +ID+ G N M+ +P + T+ PF+PEG+L
Sbjct: 386 -----AAAHIDKWFFGDN-MFWYPTCKVLYRTTE------------------PFDPEGVL 421
Query: 294 SSVSSILSTIIGVHFGHVIIH----TKGHLARLKQW 325
+++SI+ +G+ G +++ KG LAR W
Sbjct: 422 GTINSIVMGFLGMQAGKILLFFRQMNKGILARFLVW 457
>gi|340727662|ref|XP_003402158.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Bombus terrestris]
Length = 554
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 82/330 (24%), Positives = 145/330 (43%), Gaps = 48/330 (14%)
Query: 22 SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
+D +R+ ++D RG + LMI V+ G + + HA WNG D + P F+
Sbjct: 159 ADDGAMKQPAKRRVKAIDTVRGASTLLMIFVNDGSGGYRTLGHATWNGLLPGDLLFPCFI 218
Query: 82 FIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
+I+GV I +A +KR+ + + ++ R++ L G+ L + ++ G +
Sbjct: 219 WIMGVCIPIAMSSQMKRMTLKHQILYGIVKRSILLFLIGLSL--------NTVSTGGQLE 270
Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQ-DKDQS--VGRFSIFRLYCWHWLMAACVLVV 194
IR+ GVLQR ++YL+V+L+ + K QS + F L W + ++VV
Sbjct: 271 TIRIFGVLQRFGITYLVVALLYFLLMSRRPSKIQSPMLREVQDFLLLLPQWCVMLVIVVV 330
Query: 195 YLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
+ A+ + VP + D K F+ G A GYIDR +L H+
Sbjct: 331 HCAITFCLNVPGCPTGYLGPGGLHDDAKYFDCVGG----------AAGYIDRMILKEAHL 380
Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
++ +S P++PEG+L ++++ +G+H G +++
Sbjct: 381 HYSATVYKS----------------------GPYDPEGILGTLTTAFQVFLGLHAGIIMM 418
Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHFTN 343
K R+ +W+ G LHFTN
Sbjct: 419 TYKDWKERVIRWLAWAAFFGCVGCVLHFTN 448
>gi|328780782|ref|XP_396570.4| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Apis mellifera]
Length = 569
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 78/332 (23%), Positives = 143/332 (43%), Gaps = 48/332 (14%)
Query: 20 DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
+ D +R+ ++D RG + LMI V+ G + + HA WNG D + P
Sbjct: 172 QLDDTTAMKQPSKRRVKAIDTVRGASTLLMIFVNDGSGGYRILGHATWNGLLPGDLLFPC 231
Query: 80 FLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
F++I+GV I +A +KR+ + ++ R++ + G+ L + ++ G
Sbjct: 232 FIWIMGVCIPIAMAGQMKRMLPKHMIFYGIVKRSILMFLIGLSL--------NTVSTGPQ 283
Query: 136 VRMIRLCGVLQRIALSYLLVSLVEI---FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
+ IR+ GVLQR ++Y +V+L+ + K + + + F L W + ++
Sbjct: 284 LETIRIFGVLQRFGITYFIVALIYLCLMTRKPKKTQSPMLKEVQDFLLLLPQWCVMLVIV 343
Query: 193 VVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
V+ + + VP + D K F+ G A GYIDR +L +
Sbjct: 344 AVHCFITFCLKVPGCPTGYLGPGGLHDDAKYFDCVGG----------AAGYIDRMILKES 393
Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
H++H +S P++PEG+L ++++ +G+H G +
Sbjct: 394 HLHHSATVYKS----------------------GPYDPEGILGTLTTTFQVFLGLHAGII 431
Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHFTN 343
++ K R+ +W+T G LHFTN
Sbjct: 432 MMTYKDWKERVIRWLTWAAFFSCIGCILHFTN 463
>gi|428319268|ref|YP_007117150.1| Heparan-alpha-glucosaminide N-acetyltransferase [Oscillatoria
nigro-viridis PCC 7112]
gi|428242948|gb|AFZ08734.1| Heparan-alpha-glucosaminide N-acetyltransferase [Oscillatoria
nigro-viridis PCC 7112]
Length = 406
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 91/317 (28%), Positives = 130/317 (41%), Gaps = 105/317 (33%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
RL SLD+FRG+A+A MILV++ G +P + HA WNGC D V PFFLFIVG A++
Sbjct: 1 MRLKSLDVFRGIAIASMILVNNPGSWEQVYPPLDHAEWNGCTPTDLVFPFFLFIVGCAMS 60
Query: 90 LALKR----IPDRADAVKKVIFRTLKL------------------LFWGI---------- 117
+L + P K+I + KL ++W I
Sbjct: 61 FSLSKYIQNYPKTGIETSKIIQKNEKLESDKNPFPSSFFLLPASNIYWRIARRAAILFIL 120
Query: 118 -LLQGGFSHAPDELTYGVDVR---MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVG 173
LL S A D L V IR+ GVLQRI L+Y + ++ + ++ ++Q
Sbjct: 121 GLLLNTSSIALDVLLNSAPVENFGKIRIMGVLQRIGLAYFIGAIAIL---NLSPRNQK-- 175
Query: 174 RFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKL 233
L+AA VL+ Y L VF V +L
Sbjct: 176 ------------LLAAAVLLGYWGAL---------------------TVFAVGGYTAGEL 202
Query: 234 NPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLL 293
P N GY+DR +LG H+Y PF+PEGLL
Sbjct: 203 TPEGNLGGYVDRLILGSQHLYK----------------------------GGPFDPEGLL 234
Query: 294 SSVSSILSTIIGVHFGH 310
S++ ++++ +IG G
Sbjct: 235 STLPAVVTVLIGYFTGE 251
>gi|125981811|ref|XP_001354909.1| GA19944 [Drosophila pseudoobscura pseudoobscura]
gi|54643221|gb|EAL31965.1| GA19944 [Drosophila pseudoobscura pseudoobscura]
Length = 574
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 85/293 (29%), Positives = 139/293 (47%), Gaps = 45/293 (15%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
+ +RL SLD FRGL++ LMI V+ GG + I HA WNG +LAD V P FL+I+GV I L
Sbjct: 181 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHLADLVFPSFLWIMGVCIPL 240
Query: 91 ALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
++K R +A ++++R++KL G+ L G + +RL GVLQ
Sbjct: 241 SVKAQLSRGNSKARICLRILWRSIKLFAIGLCLNS---------MSGPSLEQLRLMGVLQ 291
Query: 147 RIALSYLLVSLVEIF-TKDVQDKDQSVGRFSIFR--LYCWHWLMAACVLVVYLALLYGTY 203
R +++L+V ++ ++ Q + Q +I+ L+ + ++ YL L +G
Sbjct: 292 RFGIAFLVVGILHTLCSRREQVQPQRAWHRAIYDVCLFSGELAVLLALIAAYLGLTFGLP 351
Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
VP + + A N A GY+D +VLG H+Y HP +
Sbjct: 352 VPGCPRGYLGPGGKH---------DLAAHPNCIGGAAGYVDLQVLGNAHIYQHP----TA 398
Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG-HVIIHT 315
DS + F+PEG+ + S++ ++G G +++HT
Sbjct: 399 KYVYDS---------------SAFDPEGVFGCLLSVVQVLLGAFAGLTLLVHT 436
>gi|350423601|ref|XP_003493532.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Bombus impatiens]
Length = 565
Score = 102 bits (255), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 83/334 (24%), Positives = 146/334 (43%), Gaps = 49/334 (14%)
Query: 18 EPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVM 77
+ V D K K +R+ ++D RG + LMI V+ G + + HA WNG D +
Sbjct: 167 KSQVDDGAMKQPAK-RRVKAIDTVRGASTLLMIFVNDGSGGYRTLGHATWNGLLPGDLLF 225
Query: 78 PFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYG 133
P F++I+GV I +A +KR+ + + ++ R++ L G+ L + ++ G
Sbjct: 226 PCFIWIMGVCIPIAMSSQMKRMTPKRQILYGIVKRSILLFLIGLSL--------NTVSTG 277
Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKD-QS--VGRFSIFRLYCWHWLMAAC 190
+ IR+ GVLQR ++Y +V+L+ + + QS + F L W +
Sbjct: 278 GQLETIRIFGVLQRFGITYFVVALLYFLLMSRRPRKIQSPMLREVQDFLLLLPQWCVMLV 337
Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
++VV+ + + VP + D K F+ G A GYIDR +L
Sbjct: 338 IVVVHCVITFCLNVPGCPTGYLGPGGLHDDAKYFDCVGG----------AAGYIDRVILK 387
Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
H++H +S P++PEG+L ++++ +G+H G
Sbjct: 388 EAHLHHSATVYKS----------------------GPYDPEGILGTLTAAFQVFLGLHAG 425
Query: 310 HVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTN 343
+++ K R+ +W+ G LHFTN
Sbjct: 426 IIMMTYKDWKERVIRWLAWAAFFGCVGCVLHFTN 459
>gi|224537467|ref|ZP_03678006.1| hypothetical protein BACCELL_02346 [Bacteroides cellulosilyticus
DSM 14838]
gi|224520905|gb|EEF90010.1| hypothetical protein BACCELL_02346 [Bacteroides cellulosilyticus
DSM 14838]
Length = 368
Score = 102 bits (254), Expect = 3e-19, Method: Compositional matrix adjust.
Identities = 95/336 (28%), Positives = 144/336 (42%), Gaps = 102/336 (30%)
Query: 17 SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
S P +S +K +RL SLD+ RG+ V MILV+++GG + + H+ WNG L D
Sbjct: 4 SHPPISTSPQK-----KRLLSLDVLRGITVVGMILVNNSGGKLSYDSLQHSAWNGLTLCD 58
Query: 75 FVMPFFLFIVGVAIALALKRIPDRADA--VKKVIFRTLKLLF--WGI-----LLQGGFSH 125
V PFFLFI+G++ +AL + +A V+K++ RTL +L W I + G F
Sbjct: 59 LVFPFFLFIMGISTYIALGKFHFQASGSVVRKILKRTLVILCIGWAIHWFHFICDGDF-- 116
Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
+RL GVL RIAL Y +VS V ++ + +G W
Sbjct: 117 --------FPFAHLRLTGVLPRIALCYCVVSFVALYV-----NHKYIG-----------W 152
Query: 186 LMAACVLVVYLALLY--GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYI 243
++ C++ Y LL Y PD D+ N + I
Sbjct: 153 II-GCLIAGYAVLLCIGNGYAPD--------DT---------------------NLLAII 182
Query: 244 DRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTI 303
DR +LG +H+YH +P +PEGL S++S+I T+
Sbjct: 183 DRNILGADHLYH----------------------------KSPIDPEGLTSTLSAIAHTL 214
Query: 304 IGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTL 339
IG G +I+ + + + GF L+ G L
Sbjct: 215 IGFCCGKIILAKEALEQKTLKLFVAGFILMACGFVL 250
>gi|414077874|ref|YP_006997192.1| hypothetical protein ANA_C12665 [Anabaena sp. 90]
gi|413971290|gb|AFW95379.1| hypothetical protein ANA_C12665 [Anabaena sp. 90]
Length = 376
Score = 102 bits (253), Expect = 4e-19, Method: Compositional matrix adjust.
Identities = 84/291 (28%), Positives = 128/291 (43%), Gaps = 73/291 (25%)
Query: 33 QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
RL+SLD+FRG+ +A MIL + AG + +SHA W+GC D + P FLFIVGVA+
Sbjct: 1 MRLSSLDVFRGITIAAMILANMAGVADDVYRPLSHAQWHGCTPTDLIFPCFLFIVGVAMT 60
Query: 90 LALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
+L + + K V R L+ L G++L G ++ + D+ IRL GVL
Sbjct: 61 FSLAKYTAQNKPTKAVYLRILRRTAILFILGLVLNGFWNQG----VWTFDLSSIRLMGVL 116
Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
QRIAL+YL SL+ + + K Q WL+A +L+ Y + VP
Sbjct: 117 QRIALTYLFASLIVL---KLPRKSQ--------------WLVAGGLLIAYWLTMMYIPVP 159
Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
D+ ++ ++ N +IDR ++ H+Y +
Sbjct: 160 DYGAGVLTREG---------------------NFGAFIDRLIIPKAHLYKGDGFN----- 193
Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
F G +PEGL S++ +I+S + G G I K
Sbjct: 194 -----FLG--------------DPEGLFSTIPAIVSVLAGYFTGQWIKDKK 225
>gi|344281343|ref|XP_003412439.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Loxodonta africana]
Length = 782
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 91/342 (26%), Positives = 153/342 (44%), Gaps = 62/342 (18%)
Query: 20 DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
D+ + + RL +D FRGLA+ +M+ V++ GG + HA WNG +AD V P
Sbjct: 362 DIQLEAWRPSAPPSRLRCVDTFRGLALIIMVFVNYGGGKYWYFKHASWNGLTVADLVFPC 421
Query: 80 FLFI--------------VGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSH 125
FL I + +++ L+R + + K+ +R+ L+ G+++
Sbjct: 422 FLEILFGEDLLCTRDPLEIFLSMTSILQRGCSKFKLLGKIAWRSFLLICIGVVIVN---- 477
Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGR-FSIFRLYC- 182
P+ + +R+ GVLQR+ ++Y +V+++E +F K V + S FS+ L
Sbjct: 478 -PNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVPENCASERSCFSLRDLTAS 536
Query: 183 W-HWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAV 240
W WL + ++L L + VP + D+GK N T G A
Sbjct: 537 WPQWLFILTLESIWLTLTFFLPVPGCPTGYLGPGGIGDWGKYPNCTGG----------AA 586
Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSIL 300
GY+DR +LG H+Y HP S A + ++PEG+L +++SI+
Sbjct: 587 GYMDRVLLGDEHLYQHP----SSAVLYHT--------------EMAYDPEGILGTINSIV 628
Query: 301 STIIGVHFGHVIIH----TKGHLARLKQWVTMGFALLIFGLT 338
+GV G ++++ TK + R W I GLT
Sbjct: 629 MAFLGVQAGKILLYYKDQTKDIVIRFTAWCC------ILGLT 664
>gi|321474731|gb|EFX85695.1| hypothetical protein DAPPUDRAFT_309035 [Daphnia pulex]
Length = 588
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 87/325 (26%), Positives = 142/325 (43%), Gaps = 58/325 (17%)
Query: 19 PDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMP 78
P V+D+ K+ RL SLD FRG+ + LMI V+ G + HA WNG LAD + P
Sbjct: 180 PAVADEITPKK-KSSRLKSLDTFRGITIVLMIFVNDGAGQYFIFQHATWNGLQLADVIFP 238
Query: 79 FFLFIVGVAIALALKRIPDRADAVKKVIF-----RTLKLLFWGILLQGGFSHAPDELTYG 133
+F++I+GV + ++L+ R ++ K IF R+ L F GI+ + L
Sbjct: 239 WFMWIMGVCMPISLRSSLRRKES-KLTIFAGILRRSCLLFFLGIM--------NNSLGGP 289
Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIF--TKDVQDKDQSVGRFSIFR--LYCW-HWLM- 187
VD+ +R+ GVLQR A++YL V + D+ S +F+ + W W++
Sbjct: 290 VDLGRLRVPGVLQRFAITYLAVGTAGLLLTPADLSAPHPSSKARKLFQDIVVLWPQWILF 349
Query: 188 -----AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY 242
A C + +L + G V ++ D+A G A GY
Sbjct: 350 LLLVAAHCFITFFLPVEEGCPVGYLGPAGLHLDNAYPGHCIG-------------GAAGY 396
Query: 243 IDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILST 302
IDR +L + H+++ P + P++PEG+L S+
Sbjct: 397 IDRLMLSVQHIFNKPT-------------------TIGVYGSGPYDPEGILGSMLCTFQV 437
Query: 303 IIGVHFGHVIIHTKGHLARLKQWVT 327
+G G ++ G +RL +W+
Sbjct: 438 FLGAQAGMTLLIFSGWKSRLIRWLA 462
>gi|328869407|gb|EGG17785.1| transmembrane protein [Dictyostelium fasciculatum]
Length = 651
Score = 101 bits (252), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 83/355 (23%), Positives = 145/355 (40%), Gaps = 89/355 (25%)
Query: 26 EKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVG 85
E + K RL SLD+FRGL++ +MI V++ GG + +H+ WNG +AD V P+F+FI+G
Sbjct: 218 ESNQPKKDRLKSLDVFRGLSITIMIFVNYGGGGYWFFNHSYWNGLTVADLVFPWFIFIMG 277
Query: 86 VAI-----ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
+A+ A+ ++ +P R + K++ R++ L G+ + G ++ R
Sbjct: 278 IAMPLSFNAMEIRGVPKRTIFI-KLVRRSVILFSLGLFINN-----------GNNLGHWR 325
Query: 141 LCGVLQRIALSYLLVSLVEIF--------------------------------------- 161
+ GVLQR +SY + + +F
Sbjct: 326 ILGVLQRFGVSYFVTGCIMMFVPLYRPNGGGGGNSHHQYNRFDGTGNDREREPSESDPLF 385
Query: 162 -TKDVQD--KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-A 217
+ +Q+ K S + F + WL A +L V+ + + VP +
Sbjct: 386 QSSSIQEKFKAHSASMLADFIPFWLQWLFALLILAVWFLVTFLLPVPGCPTGYLGPGGLG 445
Query: 218 DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRK 277
D G+ N T G A +D + NH++ P +
Sbjct: 446 DQGQHVNCTGG----------AAKIVDLHIFSNNHIFQTPTCQ----------------- 478
Query: 278 DAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
P + ++PEG L ++S+ +GVH G I+ K + +RL +W + L
Sbjct: 479 --PIYNTGAYDPEGTLGYLTSVFMCFLGVHAGRTIMTYKSNRSRLIRWTILSILL 531
>gi|189465173|ref|ZP_03013958.1| hypothetical protein BACINT_01518 [Bacteroides intestinalis DSM
17393]
gi|189437447|gb|EDV06432.1| hypothetical protein BACINT_01518 [Bacteroides intestinalis DSM
17393]
Length = 369
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 93/333 (27%), Positives = 146/333 (43%), Gaps = 98/333 (29%)
Query: 18 EPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLADF 75
+P +S +K +RL SLD+ RG+ V MILV+++GG + + H+ WNG L D
Sbjct: 6 QPPISGYPQK-----KRLLSLDVLRGITVVGMILVNNSGGKLSYDSLQHSAWNGLTLCDL 60
Query: 76 VMPFFLFIVGVAIALALKRIPDRADA--VKKVIFRTLKLLF--WGI-----LLQGGFSHA 126
V PFFLFI+G++ +AL + +A ++K++ RTL +L W I + +G F
Sbjct: 61 VFPFFLFIMGISTYIALNKFHFQASGPVIRKILKRTLVILCIGWAIHWFHFICEGDF--- 117
Query: 127 PDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWL 186
+ +RL GVL RIAL Y VS V ++ K + +G W+
Sbjct: 118 -------FPLAHLRLTGVLPRIALCYCAVSFVALYV-----KPKYIG-----------WM 154
Query: 187 MAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRK 246
+ L++ A+L G I N + D N + IDR
Sbjct: 155 IG--FLIIGYAVLLG---------IGNGYTLD-----------------STNILAIIDRN 186
Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
VLG +H+YH +P +PEGL S++++I T+IG
Sbjct: 187 VLGADHLYH----------------------------KSPIDPEGLTSTLAAIAHTLIGF 218
Query: 307 HFGHVIIHTKGHLARLKQWVTMGFALLIFGLTL 339
G +I+ + + + GF L+ G L
Sbjct: 219 CCGRIILAKEALEQKTLKLFVAGFILMACGFVL 251
>gi|156401294|ref|XP_001639226.1| predicted protein [Nematostella vectensis]
gi|156226353|gb|EDO47163.1| predicted protein [Nematostella vectensis]
Length = 387
Score = 101 bits (251), Expect = 7e-19, Method: Compositional matrix adjust.
Identities = 77/322 (23%), Positives = 140/322 (43%), Gaps = 47/322 (14%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
RL SLD FRG+++ +MI V+ GG + +H+ WNG +AD V P+F++I+GV++ L+ +
Sbjct: 1 RLKSLDTFRGISLTVMIFVNFGGGGYYFFAHSIWNGLTVADLVFPWFMWIMGVSMVLSFR 60
Query: 94 RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYL 153
+ + + ++I + K +L G + + Y R+ GVLQR A Y
Sbjct: 61 VLRRKQISTYRIIIKITKRTL--LLFALGLFTSNNLTNY-------RIPGVLQRFAACYF 111
Query: 154 LVSLVEIFTKDVQDKDQSVGRF-----SIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
+V+++++ + Q G + + L+ WL+ L++Y+ + Y T +
Sbjct: 112 VVAVIQVLAGPSVEDSQPRGSWWDGIRDVVSLWA-QWLLMFAFLIIYVVVTYATELHGCP 170
Query: 209 FTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
+D FN T G+ + H +W K Q
Sbjct: 171 RGYTGPGGISDNSSAFNCTGGMAS-----------------------HVDSWLLGKHVYQ 207
Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
F+ R +PEG++ +++SI +GV GH + H RL +W
Sbjct: 208 RGTFKDMYRTTVAH------DPEGVMGTLTSIFIVFLGVQAGHTLFTFSHHRQRLVRWFV 261
Query: 328 MGFALLIFGLTLHFTNGEHGSG 349
+A+L+ + + + G G
Sbjct: 262 --WAVLLGVIAIGLSGGTQNDG 281
>gi|270160204|ref|ZP_06188860.1| membrane protein, putative [Legionella longbeachae D-4968]
gi|289165026|ref|YP_003455164.1| hypothetical protein LLO_1691 [Legionella longbeachae NSW150]
gi|269988543|gb|EEZ94798.1| membrane protein, putative [Legionella longbeachae D-4968]
gi|288858199|emb|CBJ12067.1| putative membrane protein [Legionella longbeachae NSW150]
Length = 372
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 89/313 (28%), Positives = 139/313 (44%), Gaps = 83/313 (26%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
L+ +R+ SLD+FRGL +ALM+LV+ G +P + H+ WNGC LAD V P FLFIVG+
Sbjct: 6 LQNERILSLDVFRGLTMALMVLVNSLGTRISYPILLHSEWNGCTLADLVFPSFLFIVGMT 65
Query: 88 IALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
++LKR + + + RT+ L GI L + VD+ IR+ G+
Sbjct: 66 TVISLKRHIKEESKTEIYYSIFKRTIILFLLGIFL--------NVFPKNVDISSIRIYGI 117
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
LQRIAL YL+ + + + T R IF ++L +L G
Sbjct: 118 LQRIALCYLICAFIYLHTTI---------RAQIF---------------IFLGILLGY-- 151
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
W F F++ +L N VGYID+ + H+
Sbjct: 152 --WYFL----------ACFHLPVSGMNQLTITRNWVGYIDQLLFSPKHLLFR-------- 191
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
F+PEGLLS++ SI +T+ G+ G++++ + + K+
Sbjct: 192 ---------------------NFDPEGLLSTIPSIATTLSGLIAGNLLL---AQIQKQKK 227
Query: 325 WVTMGFALLIFGL 337
+ M + L+F L
Sbjct: 228 CILMVASGLVFLL 240
>gi|383849627|ref|XP_003700446.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Megachile rotundata]
Length = 552
Score = 100 bits (250), Expect = 8e-19, Method: Compositional matrix adjust.
Identities = 85/339 (25%), Positives = 144/339 (42%), Gaps = 69/339 (20%)
Query: 23 DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLF 82
D + +K +R+ ++D RG + LMI V+ G + + HA WNG D + P F++
Sbjct: 159 DDTARQPVK-RRVKAIDTVRGASTLLMIFVNDGSGGYKTLGHATWNGLLPGDLLFPCFIW 217
Query: 83 IVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
I+GV I +A LKR+ + + ++ R++ L G+ L + + G +
Sbjct: 218 IMGVCIPIALGSQLKRMVPKHVILYGILKRSVLLFLIGVSL--------NTVGTGPQLES 269
Query: 139 IRLCGVLQRIALSYLLVSLVEIF-------------TKDVQDKDQSVGRFSIFRLYCWHW 185
IR+ GVLQR ++Y +V+++ +F +DVQD F L W
Sbjct: 270 IRIFGVLQRFGVTYFIVAVIYLFLISKRPTKVQSPMLRDVQD----------FLLLLPQW 319
Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSA-DYGKVFNVTCGVRAKLNPPCNAVGYID 244
+ ++ + + + VP + D K F+ G A GYID
Sbjct: 320 TVMLAIVAAHCIITFCLPVPGCPTGYLGPGGLHDDAKYFDCVGG----------AAGYID 369
Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
+ VL H++H +S APF+PEG+L ++S +
Sbjct: 370 KVVLKEQHLHHSMTVYKS----------------------APFDPEGILGCLTSTFHVFL 407
Query: 305 GVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTN 343
G+H G +++ K R+ +W+ G LHFTN
Sbjct: 408 GLHAGIIMMTYKDWKERVIRWLAWAAFFSCIGCALHFTN 446
>gi|297172331|gb|ADI23307.1| uncharacterized conserved protein [uncultured nuHF2 cluster
bacterium HF0770_19K18]
Length = 373
Score = 100 bits (250), Expect = 9e-19, Method: Compositional matrix adjust.
Identities = 82/299 (27%), Positives = 132/299 (44%), Gaps = 80/299 (26%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGV 86
K+ RL SLD FRGL +A MI+V+ G +W + HA W+GC D V PFFLFIVGV
Sbjct: 6 KSDRLLSLDAFRGLTIAFMIIVNTPG-NWSYVYGPLRHAEWHGCTPTDLVFPFFLFIVGV 64
Query: 87 AI--ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
A+ + A +D +KK+ +RT+ + +G+LL +A + D +R+ GV
Sbjct: 65 AMRFSFAQHNYQPSSDLLKKIFWRTVTIFSFGLLL-----NAYPFIRQNWDWSSLRIMGV 119
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
LQRI L+Y L +++ ++ + + W+ +L+ Y
Sbjct: 120 LQRIGLAYGLAAILSLYLSEKK-----------------LWISCGIILIGY--------- 153
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
W ++ S +G N ID +LG NH+ WR +
Sbjct: 154 --WLILLLFGGSDPFGL--------------SSNIARTIDIAILGENHL-----WRGTG- 191
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
PF+PEGLLS++ +I++ +IG G +I ++ ++
Sbjct: 192 --------------------IPFDPEGLLSTIPAIVTVLIGFSIGQLIQENSNRISLVQ 230
>gi|119583586|gb|EAW63182.1| hCG1993224, isoform CRA_a [Homo sapiens]
Length = 382
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 82/290 (28%), Positives = 137/290 (47%), Gaps = 46/290 (15%)
Query: 49 MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKK 104
M+ V++ GG + HA WNG +AD V P+F+FI+G +I L++ I R + + K
Sbjct: 1 MVFVNYGGGKYWYFKHASWNGLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKFRLLGK 60
Query: 105 VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTK 163
+ +R+ L+ GI++ P+ + +R+ GVLQR+ ++Y +V+++E +F K
Sbjct: 61 IAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFAK 115
Query: 164 DVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADY 219
V + S R W WL+ + ++L L + VP + D+
Sbjct: 116 PVPEHCASERSCLSLRDITSSWPQWLLILVLEGLWLGLTFLLPVPGCPTGYLGPGGIGDF 175
Query: 220 GKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDA 279
GK N T G A GYIDR +LG +H+Y HP S A +
Sbjct: 176 GKYPNCTGG----------AAGYIDRLLLGDDHLYQHP----SSAVLYHT---------- 211
Query: 280 PSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH----TKGHLARLKQW 325
++PEG+L +++SI+ +GV G ++++ TK L R W
Sbjct: 212 ----EVAYDPEGILGTINSIVMAFLGVQAGKILLYYKARTKDILIRFTAW 257
>gi|385809567|ref|YP_005845963.1| heparan-alpha-glucosaminide N-acetyltransferase [Ignavibacterium
album JCM 16511]
gi|383801615|gb|AFH48695.1| Heparan-alpha-glucosaminide N-acetyltransferase [Ignavibacterium
album JCM 16511]
Length = 378
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 84/288 (29%), Positives = 137/288 (47%), Gaps = 79/288 (27%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
T+RL SLD+FRG+ + MILV++ G +P++ HA W+GC D + PFFLFIVGVA+
Sbjct: 4 TERLVSLDVFRGITIMGMILVNNPGTWSAVYPQLLHAEWHGCTFTDLIFPFFLFIVGVAV 63
Query: 89 ALALKRIPDRADAVK----KVIFRTLKLLFWGILLQGGFSHAPDELTYG--VDVRMIRLC 142
+ +L + + ++K +I RT+ L GI+L G P L +G +R+
Sbjct: 64 SYSLTKRKAQGGSMKSLYLNIIRRTVILFLLGIILNG----FPFGLLFGHQFSWETLRIP 119
Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
GVLQRIA+ Y + + + + T K Q +W AA +L++Y A++ +
Sbjct: 120 GVLQRIAIVYFVAAFLFLTT---STKFQ-------------YWFTAA-ILILYAAVM--S 160
Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
++P V A P N +ID+ +LG +HM W +
Sbjct: 161 FIP-------------------VPGIGYANFEPGKNLSAWIDQMILG-SHM-----WSGT 195
Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
K ++PEG+LS++ +I S ++G+ G+
Sbjct: 196 KL----------------------WDPEGILSTIPAIGSAMLGIFTGN 221
>gi|332709783|ref|ZP_08429740.1| hypothetical protein LYNGBM3L_44860 [Moorea producens 3L]
gi|332351381|gb|EGJ30964.1| hypothetical protein LYNGBM3L_44860 [Moorea producens 3L]
Length = 366
Score = 100 bits (249), Expect = 1e-18, Method: Compositional matrix adjust.
Identities = 87/288 (30%), Positives = 127/288 (44%), Gaps = 80/288 (27%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
RL SLD+FRG+A+A MILV++ G +P + HA W+G D V P FLFI GVA+A
Sbjct: 1 MRLTSLDVFRGIAMASMILVNNPGSWSYVYPPLLHAKWHGFTPTDLVFPAFLFIAGVAMA 60
Query: 90 LALKRIPDRADAVKKVIFRTLK---LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
+L + + +V + +R + +LF LL GF TY D IR+ GVLQ
Sbjct: 61 FSLVKYTNNNQSVSQGYWRIGRRCAILFALGLLLNGFP------TYNWDT--IRIMGVLQ 112
Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
RI+L+Y L ++ + +++ + W++ VL+ Y A + VP
Sbjct: 113 RISLAYFLSAVAVL---NLRRRGL--------------WVLTGIVLLGYWAAMSLVPVP- 154
Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
DYG L P N YIDR VLG NH+Y
Sbjct: 155 -----------DYGA---------GNLTPEGNFAAYIDRMVLGTNHLYK----------- 183
Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
A F+PEGL S+ ++++ + G G + H
Sbjct: 184 -----------------QAQFDPEGLFSTFPAVVTVLAGYFVGDWLRH 214
>gi|427709244|ref|YP_007051621.1| hypothetical protein Nos7107_3914 [Nostoc sp. PCC 7107]
gi|427361749|gb|AFY44471.1| hypothetical protein Nos7107_3914 [Nostoc sp. PCC 7107]
Length = 375
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 83/287 (28%), Positives = 126/287 (43%), Gaps = 73/287 (25%)
Query: 33 QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
RL SLD+FRG+ +A MILV+ AG +P ++HA W+GC D V PFFLFIVGVA+
Sbjct: 1 MRLTSLDVFRGITIAAMILVNMAGVADDVYPLLAHADWHGCTPTDLVFPFFLFIVGVAMT 60
Query: 90 LALKRIPDRADAVKKV---IFRTLKLLF-WGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
+L + V IFR +LF G+LL ++ D IR+ GVL
Sbjct: 61 FSLSKYTADNKPTSTVYLRIFRRAAILFALGLLLNVFWNKGVGTF----DFSSIRIMGVL 116
Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
QRI+LSYLL SL + ++ K Q W++AA +L+ Y + VP
Sbjct: 117 QRISLSYLLASLAVL---NLPRKGQ--------------WILAAVLLIGYWLTMMYVPVP 159
Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
++ ++ ++ N Y DR ++ H+Y ++
Sbjct: 160 EYGAGVLTREG---------------------NFGAYFDRLIIPQTHLYAGDGFKSMG-- 196
Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
+PEGL S++ +++S + G G I
Sbjct: 197 ----------------------DPEGLFSTIPAVVSVLAGYFTGQWI 221
>gi|270339962|ref|ZP_06203500.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
gi|270333113|gb|EFA43899.1| conserved hypothetical protein [Prevotella bergensis DSM 17361]
Length = 389
Score = 99.8 bits (247), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 83/289 (28%), Positives = 126/289 (43%), Gaps = 73/289 (25%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
QRL SLD+ RGL V LMI V++ G + + H+ WNG L D V PFFLF+VGV+ L
Sbjct: 18 QRLISLDVLRGLTVMLMIFVNNGAGTQIFSPLRHSRWNGMTLCDLVFPFFLFMVGVSTYL 77
Query: 91 ALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
+L++ A ++K+ RT L G+ + F A + +D+ +R+ GV+QRI
Sbjct: 78 SLRKSNFAWSAKTLRKIARRTALLFLIGLTIN-WFDMACNG--SPLDLAHLRIMGVMQRI 134
Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
AL Y + V I + V RL+ WL+A ++ L L+ G
Sbjct: 135 ALCYGATAFVAILSSKVPQ-----------RLHLIPWLIAVLLIAYSLLLIIG------- 176
Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
DY + N + +D +LG +H+YH
Sbjct: 177 ------GGYDY--------------SSATNLLAIVDTHILGYDHLYH------------- 203
Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
+P +PEGLLS++ +I T+IG + I +G
Sbjct: 204 ---------------RSPVDPEGLLSTLPAIAHTLIGFWVARLTIGKQG 237
>gi|226225918|ref|YP_002760024.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27]
gi|226089109|dbj|BAH37554.1| hypothetical membrane protein [Gemmatimonas aurantiaca T-27]
Length = 401
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 97/331 (29%), Positives = 144/331 (43%), Gaps = 79/331 (23%)
Query: 28 SHLKTQRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIV 84
K +RL SLD+FRG+ VA M+LV++ G +P + HAPW+G D + PFFLFIV
Sbjct: 6 GSFKAERLLSLDVFRGMTVAGMLLVNNPGTWSAIYPPLQHAPWHGWTPTDLIFPFFLFIV 65
Query: 85 GVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQG--GFSHAPDELTYGVDVRM 138
G+ L+L+ R D + ++ + LK + +G+LL G F+ P R+
Sbjct: 66 GITTELSLRARRARGDDEQAILRQILKRGALIFLFGLLLAGFPFFTWPPGLPGASFGERV 125
Query: 139 I------RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
I R+ GVLQRI ++YL +L+ T + Q V + A +L
Sbjct: 126 IDRFEHWRIMGVLQRIGVAYLCGALL---TWRTTVRQQGV--------------ILAALL 168
Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAV-GYIDRKVLGIN 251
Y AL+ VPD R L+ P + ++DR VLG+N
Sbjct: 169 FGYWALMTLVPVPD------------------TGVAGRFVLDKPDQLLSAWLDRTVLGVN 210
Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
H++ S A T D PEGLLS++ +I + I G G
Sbjct: 211 HLW-------SGAKTWD--------------------PEGLLSTIPAIGTMICGTFAGRW 243
Query: 312 IIHTKGHL-ARLKQWVTMGFALLIFGLTLHF 341
I + L RL +G ++ GL H+
Sbjct: 244 IARQELTLHERLVALFAVGALAMMVGLMWHW 274
>gi|374580713|ref|ZP_09653807.1| hypothetical protein DesyoDRAFT_2145 [Desulfosporosinus youngiae
DSM 17734]
gi|374416795|gb|EHQ89230.1| hypothetical protein DesyoDRAFT_2145 [Desulfosporosinus youngiae
DSM 17734]
Length = 375
Score = 99.0 bits (245), Expect = 3e-18, Method: Compositional matrix adjust.
Identities = 87/320 (27%), Positives = 144/320 (45%), Gaps = 84/320 (26%)
Query: 25 QEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWP-EISHAPWNGCNLADFVMPFFL 81
EK L R +DIFRGL ++LM++ + G + P ++ HA WNG + DFV PFF+
Sbjct: 1 MEKGKL---RFDCIDIFRGLTISLMLICSNPGNITNIPAQLRHADWNGATIGDFVFPFFI 57
Query: 82 FIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
F +G+ + +A+ R ++ + ++I R++ + G++L G + D+
Sbjct: 58 FSMGIVVPIAINRRLEKGISQMRIIINVLNRSIVMFLLGLILNGFPTF---------DLA 108
Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQ-SVGRFSIFRLYCWHWLMAACVLVVYL 196
+IR+ GVLQRIA+ Y +L+ + K + KD +G + L+A +L +Y
Sbjct: 109 IIRVPGVLQRIAIVYFCSALIYLLFKSIVKKDLVQIGILT---------LIAVLLLAIYY 159
Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
LL G VP + G++ L V YID K L H+Y
Sbjct: 160 WLLKGLQVPGIE-------------------GLKGGL------VSYIDLKYLK-GHLY-- 191
Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
P+ F+PEG+LS++ ++ S IIGV G + +
Sbjct: 192 ----------------------TPT-----FDPEGILSTIPALSSGIIGVVVGMIFLRRD 224
Query: 317 GHLARLKQWVTMGFALLIFG 336
++ +V G L+IF
Sbjct: 225 SRFVKMTIFVCSGILLIIFA 244
>gi|334121382|ref|ZP_08495452.1| hypothetical protein MicvaDRAFT_2176 [Microcoleus vaginatus FGP-2]
gi|333455096|gb|EGK83757.1| hypothetical protein MicvaDRAFT_2176 [Microcoleus vaginatus FGP-2]
Length = 406
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 88/317 (27%), Positives = 132/317 (41%), Gaps = 105/317 (33%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
R SLD+FRG+A+A MILV++ G +P + HA W+GC D + PFFLFIVG A++
Sbjct: 1 MRFKSLDVFRGIAIASMILVNNPGSWEQVYPPLDHAEWHGCTPTDLIFPFFLFIVGCAMS 60
Query: 90 LALKR-----------------IPDRADAVKKVIFRTLKLL-----FWGI---------- 117
+L + +++++ K + +L LL +W I
Sbjct: 61 FSLSKYTQNYPQTGIETSKITQTKEKSESAKNPLPSSLFLLPYSNIYWRIARRAAILFIL 120
Query: 118 -LLQGGFSHAPDELTYGVDVR---MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVG 173
LL S A D L V IR+ GVLQRI L+Y + ++ I ++ ++Q
Sbjct: 121 GLLLNTSSIALDVLLNSAPVENFGKIRIMGVLQRIGLAYFISAIAII---NLSPRNQK-- 175
Query: 174 RFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKL 233
L+A VL+ Y A L VF V +L
Sbjct: 176 ------------LLAVAVLLGYWAAL---------------------TVFAVGGYTAGEL 202
Query: 234 NPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLL 293
P N GY+DR +LG H+Y PF+PEGLL
Sbjct: 203 TPEGNLGGYVDRLILGSQHLYK----------------------------GGPFDPEGLL 234
Query: 294 SSVSSILSTIIGVHFGH 310
S++ ++++ +IG G
Sbjct: 235 STLPAVVTVLIGYFTGE 251
>gi|354568330|ref|ZP_08987495.1| hypothetical protein FJSC11DRAFT_3703 [Fischerella sp. JSC-11]
gi|353540693|gb|EHC10166.1| hypothetical protein FJSC11DRAFT_3703 [Fischerella sp. JSC-11]
Length = 384
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 93/300 (31%), Positives = 132/300 (44%), Gaps = 83/300 (27%)
Query: 33 QRLASLDIFRGLAVALMILVDHAG----GDWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
RL SLD+FRG+ +A MILV+ A +P + HA W+GC D V PFFLFIVGVA+
Sbjct: 1 MRLTSLDVFRGITIAGMILVNTASIAEPNVYPPLLHAEWHGCTPTDLVFPFFLFIVGVAM 60
Query: 89 ALALKRIPDRADAVKK-----------VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
+ + + D +K +I R L G+LL G ++ + D
Sbjct: 61 SFSFSKYTDSKLHGEKEKVFVSLPYWRIIRRAAILFVLGLLLNGFWNQG----VWTFDFN 116
Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY-L 196
IR+ GVLQRI+L+YLL SLV ++ K Q W++A +L+ Y L
Sbjct: 117 SIRVMGVLQRISLTYLLASLVVF---NIPRKGQ--------------WILAGVLLIGYWL 159
Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
A++ YVP + YG GV L N YIDR ++ H+Y
Sbjct: 160 AMM---YVP----------VSGYG------AGV---LTRDGNLGAYIDRLIIPKAHLYKG 197
Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
+ F G +PEGL S++ +I+S + G G I K
Sbjct: 198 ----------DNYNFMG--------------DPEGLFSTIPAIVSVLAGYFAGQWIRSQK 233
>gi|225012704|ref|ZP_03703139.1| conserved hypothetical protein [Flavobacteria bacterium MS024-2A]
gi|225003237|gb|EEG41212.1| conserved hypothetical protein [Flavobacteria bacterium MS024-2A]
Length = 366
Score = 98.6 bits (244), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 94/314 (29%), Positives = 134/314 (42%), Gaps = 85/314 (27%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGV 86
K QRL +LDI RGL + LMI+V+ G W E+ HA WNG D+V P FLFIVGV
Sbjct: 4 KNQRLLALDILRGLTIILMIVVNDPG-SWSEVYAPFLHAEWNGLTPTDYVFPTFLFIVGV 62
Query: 87 AIALALKRIPD----RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
+I L+L + + R+ KKV++R LK+ GI L L + IR
Sbjct: 63 SIVLSLSKQLEAGKTRSQIAKKVLWRALKIYLVGIFLW---------LWPSFNFEGIRWV 113
Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
GVL RIAL +L L+ ++T +G + +W+M A V
Sbjct: 114 GVLPRIALVFLACGLIFLYTTKKTQWYLGIG------ILLGYWIMMAYV----------- 156
Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
P +G+ D V P ++
Sbjct: 157 ---------------------------------PVPGIGFPDLSV---------P--EKN 172
Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
A DS F P R +W +PEG LS++ +I+S +IG+ G+V++ + RL
Sbjct: 173 WAHYLDS-FLIPGRLWKYTW-----DPEGFLSTLPAIVSGLIGMWAGYVLMKKEELKTRL 226
Query: 323 KQWVTMGFALLIFG 336
Q +GF LL G
Sbjct: 227 NQLFFIGFILLFLG 240
>gi|365877201|ref|ZP_09416706.1| hypothetical protein EAAG1_13068 [Elizabethkingia anophelis Ag1]
gi|442587874|ref|ZP_21006688.1| hypothetical protein D505_08600 [Elizabethkingia anophelis R26]
gi|365755061|gb|EHM96995.1| hypothetical protein EAAG1_13068 [Elizabethkingia anophelis Ag1]
gi|442562373|gb|ELR79594.1| hypothetical protein D505_08600 [Elizabethkingia anophelis R26]
Length = 400
Score = 98.2 bits (243), Expect = 5e-18, Method: Compositional matrix adjust.
Identities = 89/307 (28%), Positives = 132/307 (42%), Gaps = 84/307 (27%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGV 86
+K+ R SLD+FRG VALMILV++ G +P + HA W+GC D V PFFLF VG
Sbjct: 1 MKSARYYSLDVFRGATVALMILVNNPGTWSAIYPPLEHAKWHGCTPTDLVFPFFLFAVGN 60
Query: 87 AIALALKRIPDRADAV--KKVIFRTLKLLFWGILLQ--GGFSHAPDELTY----GVDVRM 138
A+ + + +V KKVI RTL + G+ L F D L++ D
Sbjct: 61 AMTFVIPKFQQHNSSVFWKKVIKRTLLIFGIGLFLNWCPFFQWDHDSLSFISWESSDENG 120
Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
+R+ GVLQRIA++Y S++ + K+ ++ W ++ +LV+Y
Sbjct: 121 VRIMGVLQRIAIAYFFASVIAYYFKE--------------KMVLW---ISGALLVIY--- 160
Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
W T+ + Y + + P ID +LGI H Y
Sbjct: 161 --------WLLTLFLGGTDPY--------SLEGFIGVP------IDHSILGIAHEYKG-- 196
Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH 318
EG PF+PEGL S++ +I + G G+ I KG+
Sbjct: 197 -------------EG-----------VPFDPEGLFSTIPAISQVLFGYLIGNY-IQKKGN 231
Query: 319 LARLKQW 325
+ QW
Sbjct: 232 I----QW 234
>gi|260061394|ref|YP_003194474.1| hypothetical protein RB2501_07335 [Robiginitalea biformata
HTCC2501]
gi|88785526|gb|EAR16695.1| hypothetical protein RB2501_07335 [Robiginitalea biformata
HTCC2501]
Length = 382
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 89/315 (28%), Positives = 139/315 (44%), Gaps = 86/315 (27%)
Query: 22 SDQQEKSHLKTQ------RLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCN 71
+DQ+E+++ + Q R+ S+DIFRGL +ALMILV+ G W + HA W+G
Sbjct: 6 NDQRERTNPEKQTKAMKERIVSVDIFRGLTIALMILVN-TPGTWEAVYAPFRHAEWHGYT 64
Query: 72 LADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELT 131
D V PFFLFIVG +I A + A +K+I RTLKL+ GI L G F+ P
Sbjct: 65 PTDLVFPFFLFIVGTSIVFAYRNKQPDAATHRKIIVRTLKLILLGIFL-GAFTVEPP--- 120
Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
+ IR GVLQRI + + +L+ + T + L+A V
Sbjct: 121 FFEPFSEIRFPGVLQRIGVVFFAAALLFLHTN-------------------YKTLLAITV 161
Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
+++ ++ ++P + + +V P N YID V G +
Sbjct: 162 VILLGYWVWMAFIP------LGGEPPSLERV-------------PNNWANYIDVHVFG-S 201
Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
H Y + D ++PEGLLS++ +I S ++G+ G V
Sbjct: 202 HTY---------------------KPD--------YDPEGLLSTLPAIASALLGIFTGRV 232
Query: 312 IIHTKGHLARLKQWV 326
++ + A QW+
Sbjct: 233 LVSDR---ANKTQWM 244
>gi|440684188|ref|YP_007158983.1| hypothetical protein Anacy_4727 [Anabaena cylindrica PCC 7122]
gi|428681307|gb|AFZ60073.1| hypothetical protein Anacy_4727 [Anabaena cylindrica PCC 7122]
Length = 376
Score = 98.2 bits (243), Expect = 6e-18, Method: Compositional matrix adjust.
Identities = 58/133 (43%), Positives = 75/133 (56%), Gaps = 11/133 (8%)
Query: 33 QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
RL SLD+FRG+ +A MILV+ G + + HA WNGC D V PFFLFIVGVA+
Sbjct: 1 MRLTSLDVFRGITIAGMILVNMVGVADNKYSLLDHAEWNGCTPTDLVFPFFLFIVGVAMT 60
Query: 90 LALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
+L + K V R L+ L G+LL G ++ + D+ IRL GVL
Sbjct: 61 FSLSKYTADNKPTKAVYLRILRRAAILFILGLLLNGFWNKG----VWTFDLSSIRLMGVL 116
Query: 146 QRIALSYLLVSLV 158
QRI+LSYL SL+
Sbjct: 117 QRISLSYLFASLI 129
>gi|427716050|ref|YP_007064044.1| hypothetical protein Cal7507_0722 [Calothrix sp. PCC 7507]
gi|427348486|gb|AFY31210.1| hypothetical protein Cal7507_0722 [Calothrix sp. PCC 7507]
Length = 375
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 86/307 (28%), Positives = 136/307 (44%), Gaps = 70/307 (22%)
Query: 33 QRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
RL SLD+FRG+ +A MILV+ G +P + HA WNGC D V PFFLFIVGVA+
Sbjct: 1 MRLTSLDVFRGITIAAMILVNMVGVADDKYPLLDHAEWNGCTPTDLVFPFFLFIVGVAMT 60
Query: 90 LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
+L + + V +R L+ + L + ++ + D+ IRL GVLQRI+
Sbjct: 61 FSLSKYTEGNKPNSSVYWRILRRAAILLALGLLLNGFWNKGVWTFDLSSIRLMGVLQRIS 120
Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
LSYL+ S+ + ++ K Q W++AA +L+ Y + VP
Sbjct: 121 LSYLVASVTVL---NLPRKGQ--------------WILAAVLLIGYWLTMMYLPVPGHGA 163
Query: 210 TIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
++ ++ N YIDR ++ H+Y +
Sbjct: 164 GVLTREG---------------------NLGAYIDRLIIPKAHLYKGDKFN--------- 193
Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMG 329
F G +PEGL S++ +I+S + G +F + I ++ +R ++G
Sbjct: 194 -FMG--------------DPEGLFSTIPAIVSVLAG-YFAGLWIRSQPVRSR----TSIG 233
Query: 330 FALLIFG 336
AL G
Sbjct: 234 LALFGIG 240
>gi|330805524|ref|XP_003290731.1| hypothetical protein DICPUDRAFT_49381 [Dictyostelium purpureum]
gi|325079117|gb|EGC32733.1| hypothetical protein DICPUDRAFT_49381 [Dictyostelium purpureum]
Length = 644
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 50/145 (34%), Positives = 86/145 (59%), Gaps = 15/145 (10%)
Query: 21 VSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFF 80
+S + +++ K RL SLD+FRG ++ +MI V++ GG + +H+ WNG +AD V P+F
Sbjct: 194 ISSVERENNKKKDRLKSLDVFRGFSITIMIFVNYGGGGYWFFNHSYWNGLTVADLVFPWF 253
Query: 81 LFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
+FI+G+A+ L+ + R V+K++ R+ L G+ + GV++
Sbjct: 254 VFIMGIAMPLSFNAMERRGTTKLVIVQKLVRRSAILFALGLFINN-----------GVNL 302
Query: 137 RMIRLCGVLQRIALSYLLVSLVEIF 161
+ R+ GVLQR A+SYL+V L+ +F
Sbjct: 303 QHWRILGVLQRFAISYLIVGLIMLF 327
>gi|387793162|ref|YP_006258227.1| hypothetical protein Solca_4061 [Solitalea canadensis DSM 3403]
gi|379655995|gb|AFD09051.1| hypothetical protein Solca_4061 [Solitalea canadensis DSM 3403]
Length = 393
Score = 97.8 bits (242), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 88/304 (28%), Positives = 136/304 (44%), Gaps = 87/304 (28%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
Q + RL SLD+FRG VA MILV++ G W I HA WNGC D + PF
Sbjct: 1 MQASASEPKPRLLSLDVFRGATVAAMILVNNPG-SWSNIYAPLEHAKWNGCTPTDLIFPF 59
Query: 80 FLFIVGVAIALAL----KRIPDRADAVKKVIFRTLKLLFWGILL---------QGGFSHA 126
FLFIVG++IA AL R + + A+K + R+LKL G++L + G
Sbjct: 60 FLFIVGISIAYALSGKKSRPEEHSAAIKSITIRSLKLFGLGLILALFPIVYFDKFGEVDV 119
Query: 127 PDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWL 186
D++ + +R+ GVLQRI + + + ++ I + K +++ W
Sbjct: 120 WDQIV--MRFSGVRIMGVLQRIGIVFFIAGIIFI-----KAKPKTI---------AWT-- 161
Query: 187 MAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGV-RAKLNPPCNAVGYIDR 245
A +LV+Y L+ T+VP GV A L P N +IDR
Sbjct: 162 -AGSLLVIYYLLM--TFVP--------------------VPGVGYANLEPETNLGAWIDR 198
Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
+L +H+ W++SK +W +PEGLL ++ ++ + ++G
Sbjct: 199 LILTTDHL-----WKQSK-----------------TW-----DPEGLLGTIPAVATGLLG 231
Query: 306 VHFG 309
G
Sbjct: 232 TLCG 235
>gi|344203119|ref|YP_004788262.1| hypothetical protein [Muricauda ruestringensis DSM 13258]
gi|343955041|gb|AEM70840.1| hypothetical protein Murru_1800 [Muricauda ruestringensis DSM
13258]
Length = 375
Score = 97.1 bits (240), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 95/320 (29%), Positives = 134/320 (41%), Gaps = 80/320 (25%)
Query: 23 DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS---HAPWNGCNLADFVMPF 79
+Q S LK + L SLD+FRGL VALMI+V+ G S HA WNG L D V P
Sbjct: 6 NQPNMSKLKNRYL-SLDVFRGLDVALMIIVNSPGNGSTTFSPLLHADWNGFTLTDLVFPT 64
Query: 80 FLFIVGVAIALALKRIPDRADAV--KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
FLF+VG +++ ++K+ KKV+ RT + G L+ +L
Sbjct: 65 FLFVVGNSMSFSMKKYESMGKPAFFKKVLKRTAIIFLLGFLMYWYPFFDDGQLK---PFS 121
Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
R+ GVLQRIAL Y+ S++ F K + W ++A LV Y
Sbjct: 122 ETRVFGVLQRIALCYMFASIILHFVKT--------------KTAIW---LSALFLVGYHL 164
Query: 198 LLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
+L G L NAV +D ++G NHMYH
Sbjct: 165 ILIG----------------------------FGDLTLTGNAVLKLDEWLIGANHMYHG- 195
Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
EG F+PEGLLS++ +I++ IIG G I +
Sbjct: 196 --------------EG-----------IAFDPEGLLSTLPAIVNVIIGYLAGRFIQNNGQ 230
Query: 318 HLARLKQWVTMGFALLIFGL 337
+ + + + GFAL+ GL
Sbjct: 231 NFETVAKLMMFGFALVFAGL 250
>gi|410900570|ref|XP_003963769.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Takifugu rubripes]
Length = 581
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 86/301 (28%), Positives = 138/301 (45%), Gaps = 66/301 (21%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
+ +RL SLD FRG A+ +M+ V++ GG + HAPWNG +AD VMP+F+FI+G ++ L
Sbjct: 196 RPKRLLSLDTFRGFALTVMVFVNYGGGGYWFFQHAPWNGLTVADLVMPWFVFIIGTSVVL 255
Query: 91 ALKRIP----DRADAVKKVIFRTLKLLFWGILLQGGF---SHAPDE-----LTYGVDVRM 138
A + + R ++K+ +RT G+LL GF +++P + L D
Sbjct: 256 AFRSMQRRRVRRLQLLRKITWRT------GVLLMLGFCFLNYSPRDGPCSVLVLAQDSWS 309
Query: 139 IRLCGVLQRIALSYLLVSLV-EIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
G+ +LL S+ + VQD +Y WL+ + ++L
Sbjct: 310 PAASGL-------HLLHSITPHRWWSSVQD----------VVVYWPQWLIIILLETLWLC 352
Query: 198 LLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
+ + VPD + D+G N T G A GYIDR + G N MY +
Sbjct: 353 VTFLMPVPDCPTGYLGAGGIGDHGLYPNCTGG----------AAGYIDRWMFGDN-MYRY 401
Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
P + TQ PF+PEG+L +V+SI+ +G+ G +++ +
Sbjct: 402 PTCKEMYQTTQ------------------PFDPEGVLGTVNSIVMGFLGMQAGKILLFYR 443
Query: 317 G 317
G
Sbjct: 444 G 444
>gi|297300348|ref|XP_001115683.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Macaca mulatta]
Length = 547
Score = 96.7 bits (239), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 83/294 (28%), Positives = 138/294 (46%), Gaps = 49/294 (16%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNG-----CNLADFVMPFFLFIVGVAI 88
RL S+D FRG+A+ LM+ V++ GG + HA WNG C + F M F+FI+G +I
Sbjct: 267 RLRSVDTFRGIALILMVFVNYGGGKYWYFKHASWNGTLPMQCGICIFAM-MFVFIMGSSI 325
Query: 89 ALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
L++ I R + + K+ +R+ L+ GI++ P+ + +R+ GV
Sbjct: 326 FLSMTSILQRGCSKFRLLGKIAWRSFLLICIGIIIVN-----PNYCLGPLSWDKVRIPGV 380
Query: 145 LQRIALSYLLVSLVE-IFTKDVQDK---DQSVGRFSIFRLYCWHWLMAACVLVVYLALLY 200
LQR+ ++Y +V+++E +F K V + ++S WL+ + ++L L +
Sbjct: 381 LQRLGVTYFVVAVLELLFAKPVPEHCALERSCLSLRDITSSWPQWLLILALEGLWLGLTF 440
Query: 201 GTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAW 259
VP + D+GK N T G A GYIDR +LG +H+Y HP+
Sbjct: 441 LLPVPGCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHLYQHPS- 489
Query: 260 RRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH-FGHVI 312
++PEG+L +++SI+ +GV F H I
Sbjct: 490 -----------------STVLYHTEVAYDPEGILGTINSIVMAFLGVQVFVHFI 526
>gi|374263976|ref|ZP_09622521.1| hypothetical protein LDG_8987 [Legionella drancourtii LLAP12]
gi|363535543|gb|EHL28992.1| hypothetical protein LDG_8987 [Legionella drancourtii LLAP12]
Length = 372
Score = 95.9 bits (237), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 81/308 (26%), Positives = 135/308 (43%), Gaps = 73/308 (23%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
++R+ SLD+FRGL +ALM+LV+ G +P + HA WNGC LAD V P FLFIVGV
Sbjct: 5 SKRILSLDVFRGLTMALMVLVNSQGSRSIYPILDHAAWNGCTLADLVFPAFLFIVGVTTV 64
Query: 90 LALKRIPDRADAVKKVIFRT-LKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
++L R +A + I+++ LK L + P + + +R+ G+LQRI
Sbjct: 65 VSLNRQVTTNEAARLDIYKSILKRSILLFLFGLFLNAFPFH--FDLSFANLRIYGILQRI 122
Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
A+ Y + +L+ + + K Q + + I Y W+W+ T +P
Sbjct: 123 AICYFICALIYL---NTTVKTQIILFWGILLGY-WYWI---------------TQIPVPG 163
Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
F+ +L+ N V Y+D+ +
Sbjct: 164 FS-------------------GGQLSLANNWVAYVDKMIF-------------------- 184
Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM 328
+P H F+PEGL+S++S++ +T+ G+ GH ++ + +
Sbjct: 185 ----------SPVHLHKNFDPEGLISTISAVATTLAGLITGHFLLMQLSKKKKCLLMFLV 234
Query: 329 GFALLIFG 336
G A L+ G
Sbjct: 235 GMAFLVLG 242
>gi|330792857|ref|XP_003284503.1| hypothetical protein DICPUDRAFT_18260 [Dictyostelium purpureum]
gi|325085533|gb|EGC38938.1| hypothetical protein DICPUDRAFT_18260 [Dictyostelium purpureum]
Length = 373
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 84/304 (27%), Positives = 127/304 (41%), Gaps = 86/304 (28%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+R++SLD+ RG+ + MILVD+ GG WP + WNG + AD + P FLFI G ++A
Sbjct: 2 KRMSSLDVARGITIFGMILVDNQGGPDVIWP-LKETEWNGLSTADLIFPSFLFICGFSVA 60
Query: 90 LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
LALK + +I RT L F L + + + R+ GVLQRIA
Sbjct: 61 LALKTAKNTRSTWYNIIRRTFLLFFIQCFL--------NLMAHHFVFSSFRVMGVLQRIA 112
Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
L Y L S F F ++ + V Y++++Y VP
Sbjct: 113 LCYFL----------------SCVSFLCFPVFLQRLFLLGTT-VTYISVMYALPVPG--- 152
Query: 210 TIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
CG + L P CNA Y+D KV G N ++
Sbjct: 153 -----------------CG-KGVLTPTCNAGAYLDFKVFGPNMIH--------------- 179
Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII-----HTKGHLARLKQ 324
P +PEGLLS++S+ ++T +G+ FG V + ++ + +
Sbjct: 180 ----------------PNDPEGLLSTLSAFITTWMGLEFGRVFTTYYRKYDYSNVDLIVR 223
Query: 325 WVTM 328
W+ M
Sbjct: 224 WIVM 227
>gi|295132874|ref|YP_003583550.1| hypothetical protein ZPR_1009 [Zunongwangia profunda SM-A87]
gi|294980889|gb|ADF51354.1| membrane protein [Zunongwangia profunda SM-A87]
Length = 376
Score = 95.5 bits (236), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 89/302 (29%), Positives = 129/302 (42%), Gaps = 92/302 (30%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
++R SLD+FRGL +ALMILV+ G +P + HA W G LAD V P FLF VG A+
Sbjct: 6 SERFLSLDVFRGLTIALMILVNTPGTGADLYPYLVHAQWFGFTLADLVFPSFLFAVGNAM 65
Query: 89 ALALKRIPDR--ADAVKKVIFRTLKLLFWGILL---------QGGFSHAPDELTYGVDVR 137
+ ++++ + AD KKV+ RT + G L+ +G +P T
Sbjct: 66 SFSMRKFQEAAPADFWKKVLKRTAIIFLLGFLMYWFPFFRMNEGHLELSPFSET------ 119
Query: 138 MIRLCGVLQRIALSYLL-VSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
R+ GVLQRIAL Y LV F+ +++G + A +L+ Y
Sbjct: 120 --RIMGVLQRIALCYFFGAVLVRYFSV------KTIG------------FICAAILLAYW 159
Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
+LYG P +L NA D +LG H+Y
Sbjct: 160 GILYGFGEPG------------------------HELEMATNAAAKFDYAILGEGHIY-- 193
Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
+KDA PF+PEG+LS++ SI++ + G + V I K
Sbjct: 194 -------------------KKDA-----IPFDPEGILSTLPSIVNVLAG-YLAGVFIRRK 228
Query: 317 GH 318
G
Sbjct: 229 GK 230
>gi|66808259|ref|XP_637852.1| transmembrane protein [Dictyostelium discoideum AX4]
gi|60466271|gb|EAL64333.1| transmembrane protein [Dictyostelium discoideum AX4]
Length = 675
Score = 95.5 bits (236), Expect = 4e-17, Method: Compositional matrix adjust.
Identities = 54/138 (39%), Positives = 86/138 (62%), Gaps = 11/138 (7%)
Query: 26 EKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVG 85
E+ + K RL SLD+FRG ++ +MI V++ GG + +H+ WNG +AD V P+F+FI+G
Sbjct: 198 ERENRKKDRLRSLDVFRGFSITIMIFVNYGGGGYWFFNHSLWNGLTVADLVFPWFVFIMG 257
Query: 86 VAIALALKRIPDRADAVKKVIFRTLKLLFWGILL--QGGFSHAPDELTYGVDVRMIRLCG 143
+A+ L+ + R K++IF+ KLL I+L G F + GVD++ R+ G
Sbjct: 258 IAMPLSFHAMEKRGTP-KRIIFQ--KLLRRSIILFALGLF------INNGVDLQQWRILG 308
Query: 144 VLQRIALSYLLVSLVEIF 161
VLQR ++SYL+V + +F
Sbjct: 309 VLQRFSISYLVVGSIMLF 326
>gi|284040246|ref|YP_003390176.1| hypothetical protein Slin_5410 [Spirosoma linguale DSM 74]
gi|283819539|gb|ADB41377.1| Protein of unknown function DUF2261, transmembrane [Spirosoma
linguale DSM 74]
Length = 385
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 92/300 (30%), Positives = 134/300 (44%), Gaps = 82/300 (27%)
Query: 16 ISEP--DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNG 69
++EP SD K + T RL SLD FRGL VA MILV++ G DW I HAPW+G
Sbjct: 3 VNEPIQKASDYGLKP-VGTSRLLSLDFFRGLTVAAMILVNNPG-DWGHIYAPLEHAPWHG 60
Query: 70 CNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDE 129
D + PFFLFIVGV+I AL+ + V K++ R++ L LL + P
Sbjct: 61 WTPTDLIFPFFLFIVGVSITFALEGGKSKKGVVGKIVKRSVTL----FLLGLFLNFFPK- 115
Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
D+ ++R+ GVLQRIA+ YL+ SL+ + T Q +LY +
Sbjct: 116 ----FDITLVRIPGVLQRIAVVYLVCSLIFLKTNSRQ------------QLY-----ILV 154
Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
VL+ Y L+ T++ Y A L P N + D +L
Sbjct: 155 IVLIGYWLLM----------TVVPVPGVGY-----------ANLEPATNLAAWFDYTILT 193
Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
H+Y K A +W +PEG+LS++ ++ + +IG+ G
Sbjct: 194 PAHVY----------------------KPAKTW-----DPEGVLSTLPAVGTGLIGMLVG 226
>gi|301625227|ref|XP_002941812.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like,
partial [Xenopus (Silurana) tropicalis]
Length = 370
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 56/155 (36%), Positives = 88/155 (56%), Gaps = 13/155 (8%)
Query: 17 SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFV 76
SE + +Q + +++RL SLD FRG ++ +M+ V++ GG + HAPWNG +AD V
Sbjct: 190 SEDNCGEQSKVP--ESRRLYSLDTFRGFSLTIMVFVNYGGGGYWFFEHAPWNGLTVADLV 247
Query: 77 MPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGI-LLQGGFSHAPDELT 131
MP+F+FI+G ++ALA LKR R + K+ +RT L G+ L G + P
Sbjct: 248 MPWFVFIIGTSVALAFNAMLKRGLSRCQLLYKLTWRTCILFAIGVFFLNYGPADGP---- 303
Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQ 166
+ R R+ GVLQR+ +Y +++L+ VQ
Sbjct: 304 --LSWRWARIPGVLQRLGFTYFVIALLHTCFHKVQ 336
>gi|427710153|ref|YP_007052530.1| heparan-alpha-glucosaminide N-acetyltransferase [Nostoc sp. PCC
7107]
gi|427362658|gb|AFY45380.1| Heparan-alpha-glucosaminide N-acetyltransferase [Nostoc sp. PCC
7107]
Length = 387
Score = 94.7 bits (234), Expect = 6e-17, Method: Compositional matrix adjust.
Identities = 84/295 (28%), Positives = 129/295 (43%), Gaps = 75/295 (25%)
Query: 23 DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPF 79
+ E S + RL SLD+FRG+A+A MILV++ G +P + HA W+GC D V PF
Sbjct: 7 NVMENSSTPSTRLVSLDVFRGIAIASMILVNNPGSWDYIYPPLDHAEWHGCTPTDLVFPF 66
Query: 80 FLFIVGVAIALALKRIPDRADAVKKVIFRTLK---LLFWGILLQGGFSHAPDELTYGV-- 134
FLFIVGVA+ + + V R L+ +LF L F+ D L G+
Sbjct: 67 FLFIVGVAMPFSFAKYTPENRPTATVYQRILRRGLILFALGLFLALFTLTLDWLIKGITP 126
Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
+ +R+ GVLQRI+L+Y++ +L + ++ R + W++AA +L+
Sbjct: 127 NFSTLRIMGVLQRISLAYVIAALAVL----------NLSRRGL-------WILAAVILIG 169
Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
Y + VP +G L P N GYIDR +LG H+Y
Sbjct: 170 YWLAMQFIPVP------------GFGA---------GNLTPEGNLGGYIDRIILG-KHIY 207
Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
F+PEGL S++ ++++ +G G
Sbjct: 208 R----------------------------SGSFDPEGLFSTLPAVVTVFLGYFTG 234
>gi|375149929|ref|YP_005012370.1| hypothetical protein [Niastella koreensis GR20-10]
gi|361063975|gb|AEW02967.1| Protein of unknown function DUF2261, transmembrane [Niastella
koreensis GR20-10]
Length = 392
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 93/324 (28%), Positives = 129/324 (39%), Gaps = 92/324 (28%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
+R SLD+FRG VA MILV++ G W + HAPW+GC D V PFFLF VG A+
Sbjct: 3 KRFYSLDVFRGATVAFMILVNNPG-SWSNLYAPLEHAPWHGCTPTDLVFPFFLFAVGNAL 61
Query: 89 ALALKRIPDRADA--VKKVIFRTLKLLFWGILLQGG--FSHAPDELTY------GVDVRM 138
A + R+ + +KKVI R+ + G L D LT+ G + +
Sbjct: 62 AFVMPRLQEAGTTAFLKKVITRSFLIFLIGFFLNWSPFIRWDNDHLTFKAWEYAGANGNL 121
Query: 139 --IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
IR+ GVLQRIAL Y SL+ F K + A V L
Sbjct: 122 IGIRILGVLQRIALCYFFASLIIYFFK----------------------IRGAFVSAFVL 159
Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----IDRKVLGINH 252
L Y W + ++AD P + GY +D+ +LG +H
Sbjct: 160 LLGY------WVLCMFFGNAAD-----------------PYSLNGYFGLGVDKAILGESH 196
Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
MYH EG F+PEG+ S++++I+ I G G I
Sbjct: 197 MYHG---------------EG-----------VAFDPEGITSTLTAIVQVIFGYFVGFYI 230
Query: 313 IHTKGHLARLKQWVTMGFALLIFG 336
+ L G L+ G
Sbjct: 231 QQKGKNFEMLSHLFVAGCILIFTG 254
>gi|383115204|ref|ZP_09935962.1| hypothetical protein BSGG_2914 [Bacteroides sp. D2]
gi|313695379|gb|EFS32214.1| hypothetical protein BSGG_2914 [Bacteroides sp. D2]
Length = 361
Score = 94.7 bits (234), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 90/316 (28%), Positives = 144/316 (45%), Gaps = 78/316 (24%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL SLD+ RG+ VA MILV++ G ++ +HA W+G + AD V P F+F++G++
Sbjct: 4 NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63
Query: 89 ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
++L + + A+ K+I R+L L+F G++++ F A D Y D+ +RL GV+QR
Sbjct: 64 YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITAIDSGNY-FDLSQLRLMGVMQR 121
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
+ + Y + +L+ + + K RF +A +LVVY L+G
Sbjct: 122 LGICYGITALLAV---TIPHK-----RFMP---------LAIILLVVYFIFQLFGNGF-- 162
Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
+ SAD N VG ID +LG NHMY
Sbjct: 163 -------EKSAD-------------------NIVGMIDSAILGSNHMY-----------L 185
Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
Q F +PEG+LS++ ++ +IG G +II K + R+
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNERRMLNLF 230
Query: 327 TMGFALLIFGLTLHFT 342
+G LL G L +
Sbjct: 231 LIGTTLLFVGYLLSYA 246
>gi|323447301|gb|EGB03229.1| hypothetical protein AURANDRAFT_68196 [Aureococcus anophagefferens]
Length = 399
Score = 94.4 bits (233), Expect = 9e-17, Method: Compositional matrix adjust.
Identities = 57/137 (41%), Positives = 79/137 (57%), Gaps = 12/137 (8%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFI 83
+ +S + R+ SLD+ RG AV LMI VD AG + + H+PW+G +AD VMPFF+F+
Sbjct: 4 NEPESARRPPRVRSLDVVRGFAVLLMIFVDDAGSAYAVLDHSPWDGLTIADVVMPFFIFM 63
Query: 84 VGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY--GVDVRMI 139
VGV+ ALAL KR + V+ R L W + + PD TY G D+ +
Sbjct: 64 VGVSAALALGGKRT------LAPVLRRGATL--WVVGVAVQGGGLPDPTTYAWGYDLGTV 115
Query: 140 RLCGVLQRIALSYLLVS 156
R CG+LQRIA Y++ S
Sbjct: 116 RWCGILQRIAACYVVAS 132
>gi|300865789|ref|ZP_07110543.1| conserved membrane hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300336202|emb|CBN55698.1| conserved membrane hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 376
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 76/235 (32%), Positives = 112/235 (47%), Gaps = 51/235 (21%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
RL SLD+FRG+A+A MILV++ G +P + HA WNGC D + PFFLF VG A++
Sbjct: 1 MRLTSLDVFRGIAIASMILVNNPGSWDYVYPPLDHAEWNGCTPTDLIFPFFLFAVGAAMS 60
Query: 90 LALKRIPDRADAVKKVIFRTLK---LLFWGILLQGGFSHAPDELTYGVDVR---MIRLCG 143
+L + + + V +R L+ LLF LL FS D L G + IR+ G
Sbjct: 61 FSLSKYTEENPPISTVYWRILRRATLLFLLGLLLNSFSIFLDVLLNGSPIENFGKIRILG 120
Query: 144 VLQRIALSYLL--VSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
VLQRI+L+Y L ++++ + +++++ ++AA +L+ Y L
Sbjct: 121 VLQRISLAYFLAAIAILNLSSRNLR-------------------ILAATLLLGYWGALTL 161
Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
VP YG L P N YIDR +LG H+Y
Sbjct: 162 IPVP------------GYGANL---------LTPEGNLGAYIDRLILGTQHLYRQ 195
>gi|395803976|ref|ZP_10483217.1| hypothetical protein FF52_18915 [Flavobacterium sp. F52]
gi|395433620|gb|EJF99572.1| hypothetical protein FF52_18915 [Flavobacterium sp. F52]
Length = 423
Score = 93.6 bits (231), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 96/356 (26%), Positives = 145/356 (40%), Gaps = 120/356 (33%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL SLD+FRGL + LM +V++ G DW P + HA WNGC D V PFF+FI+GVA+
Sbjct: 4 ERLISLDVFRGLTILLMTIVNNPG-DWGNVYPPLLHAHWNGCTPTDLVFPFFIFIMGVAV 62
Query: 89 ALALKRIPDRADAVKKVIFRTLKLLFWGIL------------------------------ 118
LA+ K++ R+L++ GI
Sbjct: 63 PLAMPEKKYDETTFNKILIRSLRMFCLGIFFNFFGKIQLFGLDGIPLLLVRLIITFAVGY 122
Query: 119 -LQGGFSHAPDE------------LTYG--VDVRMIRLCGVLQRIALSYLLVSLVEIFTK 163
L G FS+ L YG + +RL GVLQRIA+ Y +VSL+ + T
Sbjct: 123 ALMGNFSNKLKNIFAFSILAIYIILAYGGFENYADVRLPGVLQRIAIVYFVVSLLYLKT- 181
Query: 164 DVQDKDQSVGRFSIFRLYCWHWLMAACVLVV-YLALLYGTYVPDWQFTIINKDSADYGKV 222
K Q L VL+ Y A++ VP
Sbjct: 182 --SRKTQ---------------LFTGIVLLFGYWAIMTLVPVP----------------- 207
Query: 223 FNVTCGV-RAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPS 281
G+ A L N ++D +L HMYH + +
Sbjct: 208 -----GIGEANLERGTNLAAWVDSVLLK-GHMYH----------------------ETNT 239
Query: 282 WCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGL 337
W +PEG+LS++ SI++ IIG+ G +++ + + ++ +G +L+ FGL
Sbjct: 240 W-----DPEGILSTIPSIVNGIIGLFIGQILLLNITKIQKAQRMGMIGTSLIFFGL 290
>gi|428308802|ref|YP_007119779.1| hypothetical protein Mic7113_0454 [Microcoleus sp. PCC 7113]
gi|428250414|gb|AFZ16373.1| hypothetical protein Mic7113_0454 [Microcoleus sp. PCC 7113]
Length = 381
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 85/283 (30%), Positives = 124/283 (43%), Gaps = 80/283 (28%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAG--GD---WPEISHAPWNGCNLADFVMPFFLFIVGV 86
++RL SLD+FRG+ +A MILV+ G GD +P + HA WNG D V PFFLFIVG
Sbjct: 9 SKRLTSLDVFRGITIAGMILVNMIGVAGDKNVYPPLLHADWNGFTPTDLVFPFFLFIVGA 68
Query: 87 AIALALKRIPDRADAVK----KVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
A+A + + ++I R+L L GILL G + + + IR+
Sbjct: 69 AMAFSFSKYKHGNKPTPTVYWRIIRRSLILFALGILLNGFWEY---------NWSSIRIM 119
Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
GVLQRI+L+YL+ SL+ + +V K Q W +AA +L+ Y +
Sbjct: 120 GVLQRISLTYLIASLIVL---NVPRKGQ--------------WAIAAFLLIGYWFAMSLI 162
Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
VP DYG L N Y DR ++ H+Y
Sbjct: 163 PVP------------DYG---------MGNLTREGNFGAYFDRLIIPTAHLY-------- 193
Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
+ F G +PEGL S++ +++S + G
Sbjct: 194 ----KGDDFNG------------MGDPEGLFSTLPAVVSVLFG 220
>gi|299144716|ref|ZP_07037784.1| putative membrane protein [Bacteroides sp. 3_1_23]
gi|298515207|gb|EFI39088.1| putative membrane protein [Bacteroides sp. 3_1_23]
Length = 361
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 89/316 (28%), Positives = 144/316 (45%), Gaps = 78/316 (24%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL SLD+ RG+ VA MILV++ G ++ +HA W+G + AD V P F+F++G++
Sbjct: 4 NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63
Query: 89 ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
++L + + A+ K+I R+L L+F G++++ F A D Y D+ +RL GV+QR
Sbjct: 64 YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITAIDSGNY-FDLSQLRLMGVMQR 121
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
+ + Y + +L+ + + K RF +A +L+VY L+G
Sbjct: 122 LGICYGITALLAV---TIPHK-----RFMP---------LAIILLIVYFIFQLFGNGF-- 162
Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
+ SAD N VG ID +LG NHMY
Sbjct: 163 -------EKSAD-------------------NIVGMIDSAILGSNHMY-----------L 185
Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
Q F +PEG+LS++ ++ +IG G +II K + R+
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLF 230
Query: 327 TMGFALLIFGLTLHFT 342
+G LL G L +
Sbjct: 231 LIGTTLLFAGYLLSYA 246
>gi|281209662|gb|EFA83830.1| hypothetical protein PPL_02898 [Polysphondylium pallidum PN500]
Length = 409
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 91/332 (27%), Positives = 130/332 (39%), Gaps = 87/332 (26%)
Query: 20 DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFV 76
DV +R+ SLD RGL + MILVD+ GG WP + WNG + AD +
Sbjct: 24 DVDKDTTSKPPPKKRMLSLDTARGLTIFGMILVDNQGGPEVIWP-LKETDWNGISTADLI 82
Query: 77 MPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
P FLFI G +I+LALK + +I RT+ LLF G + + +
Sbjct: 83 FPSFLFICGFSISLALKNAKNDRPTWINIIRRTI-LLF-------GIQLFLNLMAHKFVF 134
Query: 137 RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV-LVVY 195
R+ GVLQRI+L Y S L W +A + +Y
Sbjct: 135 STFRVMGVLQRISLCYCFSCC------------------SFMLLPKWAQRVALVISATIY 176
Query: 196 LALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYH 255
L L+Y VP CG R + CNA GYID +L N ++
Sbjct: 177 LCLMYAYPVPG--------------------CG-RGNITRSCNAAGYIDNLILRKNMIH- 214
Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII-- 313
P +PEG +S+ S+ ++T +GV G ++
Sbjct: 215 ------------------------------PTDPEGFISTFSAFITTWMGVELGRILTTH 244
Query: 314 --HTKGHLARLKQWVTMGFALLIFGLTLHFTN 343
G L +W+++G + GL L TN
Sbjct: 245 ARSADGWKDILIRWLSIGMVCAMIGLFLDATN 276
>gi|113475212|ref|YP_721273.1| hypothetical protein Tery_1515 [Trichodesmium erythraeum IMS101]
gi|110166260|gb|ABG50800.1| conserved hypothetical protein [Trichodesmium erythraeum IMS101]
Length = 366
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 85/310 (27%), Positives = 133/310 (42%), Gaps = 80/310 (25%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
RL SLD+FRG+ +A MI+V++ G +P + HA W+GC D + PFFLFI+GVA+
Sbjct: 1 MRLKSLDVFRGITIASMIIVNNPGSWNHVYPPLLHAKWHGCTPTDLIFPFFLFIMGVAMT 60
Query: 90 LALKRIPDRADAVKKV---IFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
+L + D+ + + IFR ++F LL GF + ++ IR+ GVLQ
Sbjct: 61 FSLSKYTDKNQPIPHIYQRIFRRCLIIFLFGLLLNGFPN--------YNLATIRVMGVLQ 112
Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
RI+L YLL ++ ++ S +LY +A +L+ Y + VP
Sbjct: 113 RISLVYLLAAI-------------AILNLSRKQLYG----LATTLLIGYWIAMQLIPVP- 154
Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
YG L+P N YIDR +L H+
Sbjct: 155 -----------GYG---------LGNLSPEGNFAAYIDRLILTQQHL------------- 181
Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
W ++PEGL S++ +I++ +IG G + H + V
Sbjct: 182 ---------------WAGKQYDPEGLFSTLPAIVTVLIGYLTGEWLKHQSTNSRTTLNMV 226
Query: 327 TMGFALLIFG 336
G + L+ G
Sbjct: 227 ISGLSCLVVG 236
>gi|256425421|ref|YP_003126074.1| hypothetical protein Cpin_6469 [Chitinophaga pinensis DSM 2588]
gi|256040329|gb|ACU63873.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588]
Length = 358
Score = 93.2 bits (230), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 87/282 (30%), Positives = 123/282 (43%), Gaps = 77/282 (27%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
QRL SLD FRGL VA MILV++ G +P + H+ WNGC D V PFFLF+VGV++
Sbjct: 3 QRLLSLDFFRGLTVAAMILVNNPGSWSYVYPPLEHSKWNGCTPTDLVFPFFLFMVGVSVT 62
Query: 90 LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
AL +AD + IL G + L D +R+ GVLQRI+
Sbjct: 63 FALSS--RKADVSGHTSLIIHIIRRAAILFAIGLAF---RLIPSFDFHNLRILGVLQRIS 117
Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV--LVVYLALLYGTYVPDW 207
+ +L++SL+ + K + R WL C+ LV+Y L+ VP
Sbjct: 118 IVFLVISLLYL-------KTGTKPRI---------WL---CISFLVIYWLLMTVVPVP-- 156
Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
YG A L N +IDR VLG H++
Sbjct: 157 ----------GYGP---------ANLEAETNLAAWIDRTVLGEQHLW------------- 184
Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
K A +W +PEGLLS++ +I + ++G+ G
Sbjct: 185 ---------KQARTW-----DPEGLLSTLPAISTGLLGIMTG 212
>gi|333031144|ref|ZP_08459205.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
gi|332741741|gb|EGJ72223.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
Length = 363
Score = 92.8 bits (229), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 77/315 (24%), Positives = 138/315 (43%), Gaps = 78/315 (24%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
++RL SLD+ RG+ VA MILV++ G ++ + HA W+G N AD V P F+F++G++
Sbjct: 8 SKRLLSLDVLRGITVAGMILVNNTGSCGYNYTALRHASWDGLNFADLVFPMFMFMMGIST 67
Query: 89 ALALKRIP-DRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
++L++ ++ A K+ RT L+ G+ ++ + + G+++ +RL GV+QR
Sbjct: 68 YISLRKYENNKKTAFYKIFKRTSLLIIIGLFMECIITW----IEVGLNLSTLRLMGVMQR 123
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
+ L Y G ++ LY H +L + L++L G ++
Sbjct: 124 LGLCY--------------------GITALLSLYVPH----KYLLKIALSVLLGYFIIQI 159
Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
+ +K + N +G +DR VLG+NH+Y Q
Sbjct: 160 VGSGFDKSAE--------------------NVIGVVDRSVLGVNHIY-----------LQ 188
Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
F +PEG+LS++ +I +IG G I+ + H ++
Sbjct: 189 GKQF---------------VDPEGVLSTLPAIAQVMIGFFCGRKILEKREHKQQMLILYR 233
Query: 328 MGFALLIFGLTLHFT 342
+G L G +
Sbjct: 234 LGSLFLFVGFVFSYV 248
>gi|345856403|ref|ZP_08808889.1| putative membrane protein [Desulfosporosinus sp. OT]
gi|344330527|gb|EGW41819.1| putative membrane protein [Desulfosporosinus sp. OT]
Length = 381
Score = 92.8 bits (229), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 88/313 (28%), Positives = 140/313 (44%), Gaps = 91/313 (29%)
Query: 25 QEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFL 81
+EK K RL +D+FRG+AVA+M++V + G ++P++ HA WNG +AD PFF+
Sbjct: 8 EEKG--KFGRLNCIDVFRGIAVAIMLIVTNPGNPLRNYPQLRHAAWNGYTVADLAFPFFM 65
Query: 82 FIVGVAIALAL-KRIPDRADAV---KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
I+G+ I A+ KRI + + ++ R++ L GILL G + D+
Sbjct: 66 LIMGMVIPYAVDKRIKEGKSNLSIFNHILIRSIGLFCIGILLNGFPVY---------DLS 116
Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKD--QSVGRFSIFRLYCWHWLMAACVLVVY 195
+IR+ GVLQRIA++YL ++E+ K K Q + S +A ++ VY
Sbjct: 117 IIRIPGVLQRIAIAYLCTGIIELIVKATVKKSYLQIIVESS----------LALSIISVY 166
Query: 196 LALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYH 255
LL PD++ N V ID L H+Y
Sbjct: 167 SVLLIKYSFPDYK-----------------------------NLVQTIDLYFLK-GHLY- 195
Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHT 315
P W +PEG+L++ SSI + I G G+ I+
Sbjct: 196 -----------------------TPDW-----DPEGILTTFSSIATAIFGSIAGN-ILFN 226
Query: 316 KGHLARLKQWVTM 328
+ + AR K+++T+
Sbjct: 227 RDNKAR-KKFITI 238
>gi|160884063|ref|ZP_02065066.1| hypothetical protein BACOVA_02039 [Bacteroides ovatus ATCC 8483]
gi|423291476|ref|ZP_17270324.1| hypothetical protein HMPREF1069_05367 [Bacteroides ovatus
CL02T12C04]
gi|156110405|gb|EDO12150.1| hypothetical protein BACOVA_02039 [Bacteroides ovatus ATCC 8483]
gi|392663476|gb|EIY57026.1| hypothetical protein HMPREF1069_05367 [Bacteroides ovatus
CL02T12C04]
Length = 361
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 89/316 (28%), Positives = 143/316 (45%), Gaps = 78/316 (24%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL SLD+ RG+ VA MILV++ G ++ +HA W+G + AD V P F+F++G++
Sbjct: 4 NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63
Query: 89 ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
++L + + A+ K+I R+L L+F G++++ F A D Y D+ +RL GV+QR
Sbjct: 64 YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITAIDSGNY-FDLSQLRLMGVMQR 121
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
+ + Y + +L+ + + K RF +A +L VY L+G
Sbjct: 122 LGICYGITALLAV---TIPHK-----RFMP---------LAIILLAVYFIFQLFGNGF-- 162
Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
+ SAD N VG ID +LG NHMY
Sbjct: 163 -------EKSAD-------------------NIVGMIDSAILGSNHMY-----------L 185
Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
Q F +PEG+LS++ ++ +IG G +II K + R+
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLF 230
Query: 327 TMGFALLIFGLTLHFT 342
+G LL G L +
Sbjct: 231 LIGTTLLFVGYLLSYA 246
>gi|293371912|ref|ZP_06618316.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
gi|292633158|gb|EFF51735.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
Length = 361
Score = 92.4 bits (228), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 88/316 (27%), Positives = 144/316 (45%), Gaps = 78/316 (24%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL SLD+ RG+ VA MILV++ G ++ +HA W+G + AD V P F+F++G++
Sbjct: 4 NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63
Query: 89 ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
++L + + A+ K+I R+L L+F G++++ F A D Y D+ +RL GV+QR
Sbjct: 64 YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITAIDSGNY-FDLSQLRLMGVMQR 121
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
+ + Y + +L+ + + K RF +A +L+VY L+G
Sbjct: 122 LGICYGITALLAV---TIPHK-----RFMP---------LAIILLIVYFIFQLFGNGF-- 162
Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
+ SAD N VG +D +LG NHMY
Sbjct: 163 -------EKSAD-------------------NIVGIVDSAILGSNHMY-----------L 185
Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
Q F +PEG+LS++ ++ +IG G +II K + R+
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLF 230
Query: 327 TMGFALLIFGLTLHFT 342
+G LL G L +
Sbjct: 231 LIGTTLLFAGYLLSYA 246
>gi|305665862|ref|YP_003862149.1| hypothetical protein FB2170_06240 [Maribacter sp. HTCC2170]
gi|88710633|gb|EAR02865.1| hypothetical protein FB2170_06240 [Maribacter sp. HTCC2170]
Length = 362
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 91/314 (28%), Positives = 133/314 (42%), Gaps = 83/314 (26%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAI 88
R+ S+DIFRGL + LMILV+ G W + HA W+G D V PFFLFIVG +I
Sbjct: 3 NRVISVDIFRGLTIVLMILVNTPG-TWSSVYTPFLHAEWHGYTPTDLVFPFFLFIVGTSI 61
Query: 89 ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
+ A ++ KK+ R+LKL+ G+ L G F+ + + D IR GVLQRI
Sbjct: 62 SFAYQKKKASTQTYKKIAVRSLKLIGLGLFL-GAFTLS---FPFIKDFADIRFPGVLQRI 117
Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC-VLVVYLALLYGTYVPDW 207
+ +L ++ +F + W L+ C VL+V LL G YVP
Sbjct: 118 GVVFLFTAV-------------------LFVNFNWKTLLGICIVLLVGYWLLMG-YVP-- 155
Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLN-PPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
G+ + + P N Y+D K+ G H Y
Sbjct: 156 ------------------VEGIESTFDRAPNNLANYLDVKIFG-THNY------------ 184
Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
+ D ++PEG LS++ SI S + GV G ++ K + + V
Sbjct: 185 ---------KPD--------YDPEGFLSTLPSIASALTGVFTGLILTSKKDN--KTMVLV 225
Query: 327 TMGFALLIFGLTLH 340
+G +L G H
Sbjct: 226 GLGVVMLALGYLWH 239
>gi|441501363|ref|ZP_20983482.1| N-acetylglucosamine related transporter, NagX [Fulvivirga
imtechensis AK7]
gi|441434899|gb|ELR68324.1| N-acetylglucosamine related transporter, NagX [Fulvivirga
imtechensis AK7]
Length = 368
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 86/286 (30%), Positives = 129/286 (45%), Gaps = 80/286 (27%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
K +RL SLD+FRG+ +A MI+V++ G +P + HA W+GC L D V PFFLFIVGVA
Sbjct: 3 KNKRLLSLDVFRGITIAAMIVVNNPGSWAAVYPPLLHAGWHGCTLTDLVFPFFLFIVGVA 62
Query: 88 IALALKRIPDRADAVKKVIFRTLK---LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
+ L+L R + K++IF LK +LF L G F +A D+ +R+ GV
Sbjct: 63 VCLSLSRAVEDKGRHKQIIFTVLKRSVILF----LIGLFLNAFPYF----DLYHLRIPGV 114
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
LQRIA+ + + + ++ K +G + +L+VY L
Sbjct: 115 LQRIAVVFFICAF--LYLKTGWKVQVYIG---------------SAILMVYWLL------ 151
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
F II A G L N ++D ++L HM W +K
Sbjct: 152 ----FLIIPIPGAATG-----------SLESGANLAAWVDSQLLT-GHM-----WEVTKT 190
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
++PEG+LS++ +I++ IIGV G
Sbjct: 191 ----------------------WDPEGVLSTLPAIVTGIIGVLVGQ 214
>gi|298480127|ref|ZP_06998326.1| membrane protein [Bacteroides sp. D22]
gi|336404355|ref|ZP_08585053.1| hypothetical protein HMPREF0127_02366 [Bacteroides sp. 1_1_30]
gi|295085510|emb|CBK67033.1| Uncharacterized conserved protein [Bacteroides xylanisolvens XB1A]
gi|298273936|gb|EFI15498.1| membrane protein [Bacteroides sp. D22]
gi|335943683|gb|EGN05522.1| hypothetical protein HMPREF0127_02366 [Bacteroides sp. 1_1_30]
Length = 361
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 88/316 (27%), Positives = 144/316 (45%), Gaps = 78/316 (24%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL SLD+ RG+ VA MILV++ G ++ +HA W+G + AD V P F+F++G++
Sbjct: 4 NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63
Query: 89 ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
++L + + A+ K+I R+L L+F G++++ F + D Y D+ +RL GV+QR
Sbjct: 64 YISLCKYDFQCRPAITKIIKRSLLLIFIGLVME-WFITSIDSGNY-FDLSQLRLMGVMQR 121
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
+ + Y + +L+ + + K RF +A +L+VY L+G
Sbjct: 122 LGICYGITALLAV---AIPHK-----RFMP---------LAIILLIVYFIFQLFGNGF-- 162
Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
+ SAD N VG ID +LG NHMY
Sbjct: 163 -------EKSAD-------------------NIVGMIDSAILGANHMY-----------L 185
Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
Q F +PEG+LS++ ++ +IG G +II K + R+
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLF 230
Query: 327 TMGFALLIFGLTLHFT 342
+G LL G L +
Sbjct: 231 LIGTTLLFAGYLLSYA 246
>gi|321463338|gb|EFX74354.1| hypothetical protein DAPPUDRAFT_129175 [Daphnia pulex]
Length = 409
Score = 92.0 bits (227), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 67/291 (23%), Positives = 133/291 (45%), Gaps = 42/291 (14%)
Query: 42 RGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADA 101
R L + MI+V++ GG + H+PWNG +AD + P F++I+G + L+L RA +
Sbjct: 29 RRLTIVFMIIVNYGGGGYWFFEHSPWNGITIADVIFPCFVWILGASCVLSLNSQLRRALS 88
Query: 102 VKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSL 157
+++++ R++ +L G++L ++ +++ R+ GVLQR++ YL+V+L
Sbjct: 89 KQRILYSTVRRSVAMLVIGLVLNSLSNN---------NIKTFRIPGVLQRMSFVYLIVAL 139
Query: 158 VEIFTKDVQDKDQSVGRFSIFRLYC-W-HWLMAACVLVVYLALLYGTYVPDWQFTIINKD 215
+E+ D +D + I + C W W++ + L + + VPD
Sbjct: 140 IELTGFDPEDNQRYAWFAPIRDIVCSWRQWIIVTVFVSTQLLITFLLPVPDCPLGYTGAG 199
Query: 216 SADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPL 275
+ ++ G A+L +D + G +H+Y P T + ++ L
Sbjct: 200 GLEKNGLYRNCTGGAARL---------VDVSLFGNDHIYQRP--------TPRAIYDATL 242
Query: 276 RKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
F+PEG L ++ +L +G V++ + R+ +W+
Sbjct: 243 ----------AFDPEGALGGLTCVLCAYLGAEAAKVLLVFPANKQRIVRWM 283
>gi|423214205|ref|ZP_17200733.1| hypothetical protein HMPREF1074_02265 [Bacteroides xylanisolvens
CL03T12C04]
gi|392693150|gb|EIY86385.1| hypothetical protein HMPREF1074_02265 [Bacteroides xylanisolvens
CL03T12C04]
Length = 361
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 88/316 (27%), Positives = 144/316 (45%), Gaps = 78/316 (24%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL SLD+ RG+ VA MILV++ G ++ +HA W+G + AD V P F+F++G++
Sbjct: 4 NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63
Query: 89 ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
++L + + A+ K+I R+L L+F G++++ F + D Y D+ +RL GV+QR
Sbjct: 64 YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITSIDSGNY-FDLSQLRLMGVMQR 121
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
+ + Y + +L+ + + K RF +A +L VY L+G
Sbjct: 122 LGICYGITALLAV---TIPHK-----RFMP---------LAIILLAVYFIFQLFGNGF-- 162
Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
+ SAD N VG ID +LG NHMY
Sbjct: 163 -------EKSAD-------------------NIVGMIDSAILGSNHMY-----------L 185
Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
Q F +PEG+LS++ ++ +IG G +II+ K + R+
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIINIKDNDRRMLNLF 230
Query: 327 TMGFALLIFGLTLHFT 342
+G LL G L +
Sbjct: 231 LIGTTLLFAGYLLSYA 246
>gi|255534024|ref|YP_003094396.1| hypothetical protein Phep_4143 [Pedobacter heparinus DSM 2366]
gi|255347008|gb|ACU06334.1| conserved hypothetical protein [Pedobacter heparinus DSM 2366]
Length = 384
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 87/285 (30%), Positives = 127/285 (44%), Gaps = 73/285 (25%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGV 86
K RL SLD FRG VA MILV++ G DW I HA WNGC D + PFFLFIVGV
Sbjct: 9 KPVRLLSLDFFRGATVAAMILVNNPG-DWGHIYAPLEHAEWNGCTPTDLIFPFFLFIVGV 67
Query: 87 AIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV-RMIRLCGVL 145
+IA A+ + K I + LK L S P T ++ + +R+ GVL
Sbjct: 68 SIAYAMGGKKADPSSHGKTIVKALKRASILFGLGLFLSLFPKVFTAPLEAFQQVRIPGVL 127
Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
QRIA+ +L+ ++ IF K+ + +IF++ +L VY AL+ T++P
Sbjct: 128 QRIAVVFLISAI--IFLKNTEK--------NIFKILL-------AILAVYWALM--TFIP 168
Query: 206 DWQFTIINKDSADYGKVFNVTCGV-RAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
GV A L N ++DR +L H++
Sbjct: 169 --------------------VPGVGYANLEKETNLGAWLDRSILTEAHLW---------- 198
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
K A +W +PEG+LS++ +I + + G+ G
Sbjct: 199 ------------KSAKTW-----DPEGILSTLPAIATGLFGILVG 226
>gi|237719042|ref|ZP_04549523.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
gi|229451820|gb|EEO57611.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
Length = 361
Score = 91.7 bits (226), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 88/316 (27%), Positives = 143/316 (45%), Gaps = 78/316 (24%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL SLD+ RG+ VA MILV++ G ++ +HA W+G + AD V P F+F++G++
Sbjct: 4 NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63
Query: 89 ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
++L + + A+ K+I R+L L+F G++++ F + D Y D+ +RL GV+QR
Sbjct: 64 YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITSIDSGNY-FDLSQLRLMGVMQR 121
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
+ + Y + +L+ + + K RF +A +LVVY L+G
Sbjct: 122 LGICYGITALLAV---TIPHK-----RFMP---------LAIILLVVYFIFQLFGNGF-- 162
Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
+ S D N VG ID +LG NHMY
Sbjct: 163 -------EKSVD-------------------NIVGMIDSAILGANHMY-----------L 185
Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
Q F +PEG+LS++ ++ +IG G +II K + R+
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLF 230
Query: 327 TMGFALLIFGLTLHFT 342
+G LL G L +
Sbjct: 231 LIGTTLLFVGYLLSYA 246
>gi|311747386|ref|ZP_07721171.1| membrane protein [Algoriphagus sp. PR1]
gi|126579104|gb|EAZ83268.1| membrane protein [Algoriphagus sp. PR1]
Length = 381
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 86/307 (28%), Positives = 137/307 (44%), Gaps = 78/307 (25%)
Query: 27 KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLF 82
K+ L R +LD+ RGL +A MI+V+ AG DW + +HA W+G D V P FLF
Sbjct: 6 KTDLLKNRYLALDVLRGLTIAFMIVVNSAG-DWSNLYAPLAHAKWHGFTPTDLVFPTFLF 64
Query: 83 IVGVAIALALKRIPDRADAV--KKVIFRTLKLLFWGILLQG-GFSHAPDELTYG-VDVRM 138
+VG A++ ++K++ + + KKV RTL + G LL F + + +++
Sbjct: 65 VVGNAMSFSMKKLQEMPTSAFFKKVGKRTLLIFLIGWLLNAFPFYDISETGNFSLINITE 124
Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
+RL GVLQRIAL Y +++ ++ V RL W+ + L+ Y +
Sbjct: 125 VRLFGVLQRIALCYFFAAII-LYIGGV-------------RL---GWIFSGIALLTYWGI 167
Query: 199 LYGTYVPDWQFTIINKDSAD-YGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
+Y + DS+D YG NA +D ++G++ MY
Sbjct: 168 MY-----------VFGDSSDPYGLT--------------GNAAIKLDLSLIGVDRMYGG- 201
Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
EG PF+PEGLLS++ SI++ I G G ++
Sbjct: 202 --------------EG-----------IPFDPEGLLSTLPSIVNVIAGYIIGKMVQKYGN 236
Query: 318 HLARLKQ 324
L +K+
Sbjct: 237 TLESIKK 243
>gi|423293378|ref|ZP_17271505.1| hypothetical protein HMPREF1070_00170 [Bacteroides ovatus
CL03T12C18]
gi|392678321|gb|EIY71729.1| hypothetical protein HMPREF1070_00170 [Bacteroides ovatus
CL03T12C18]
Length = 361
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 88/316 (27%), Positives = 143/316 (45%), Gaps = 78/316 (24%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL SLD+ RG+ VA MILV++ G ++ +HA W+G + AD V P F+F++G++
Sbjct: 4 NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63
Query: 89 ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
++L + + A+ K+I R+L L+F G++++ F + D Y D+ +RL GV+QR
Sbjct: 64 YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITSIDSGNY-FDLSQLRLMGVMQR 121
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
+ + Y + +L+ + + K RF +A +LVVY L+G
Sbjct: 122 LGICYGITALLAV---TIPHK-----RFMP---------LAIILLVVYFIFQLFGNGF-- 162
Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
+ S D N VG ID +LG NHMY
Sbjct: 163 -------EKSVD-------------------NIVGMIDSAILGANHMY-----------L 185
Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
Q F +PEG+LS++ ++ +IG G +II K + R+
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLF 230
Query: 327 TMGFALLIFGLTLHFT 342
+G LL G L +
Sbjct: 231 LIGTTLLFVGYLLSYA 246
>gi|255013110|ref|ZP_05285236.1| putative transmembrane protein [Bacteroides sp. 2_1_7]
gi|256838332|ref|ZP_05543842.1| conserved hypothetical protein [Parabacteroides sp. D13]
gi|410102572|ref|ZP_11297498.1| hypothetical protein HMPREF0999_01270 [Parabacteroides sp. D25]
gi|423333958|ref|ZP_17311739.1| hypothetical protein HMPREF1075_03390 [Parabacteroides distasonis
CL03T12C09]
gi|256739251|gb|EEU52575.1| conserved hypothetical protein [Parabacteroides sp. D13]
gi|409226793|gb|EKN19699.1| hypothetical protein HMPREF1075_03390 [Parabacteroides distasonis
CL03T12C09]
gi|409238644|gb|EKN31435.1| hypothetical protein HMPREF0999_01270 [Parabacteroides sp. D25]
Length = 368
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 82/288 (28%), Positives = 132/288 (45%), Gaps = 75/288 (26%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
++ RL SLD+ RG+ +A MILV++ G + + HA WNG D V PFF+FI+GV+
Sbjct: 4 QSGRLLSLDVMRGITIAGMILVNNPGSWKYVYTPLEHARWNGLTPTDLVFPFFMFIMGVS 63
Query: 88 IALALKRIPDR--ADAVKKVIFRTLKLLFWGILLQGGFSHA-PDELTYGVDVRMIRLCGV 144
+ +L++ + ++V KV+ RT+ + G+ L F H + T D + +R+ GV
Sbjct: 64 MFFSLRKYNFKLSKESVTKVLRRTVLIFLVGLGLN-LFGHVCYNGFT---DFQNLRILGV 119
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
+QR+AL+Y SL+ + +I Y +AA +L+ Y ALL T+
Sbjct: 120 MQRLALAYGFGSLIGL---------------AINHKYILQ--VAAGILIFYWALLGFTHS 162
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+ +++DS + +DR + G +HMYH
Sbjct: 163 ME-----MSEDS----------------------IIAIVDRTLFGTSHMYH--------- 186
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
D F+PEGLLS + SI ++G + G VI
Sbjct: 187 ------------DDMADGTRIAFDPEGLLSCIGSIAHVLLGFYVGKVI 222
>gi|150009610|ref|YP_001304353.1| transmembrane protein [Parabacteroides distasonis ATCC 8503]
gi|262383102|ref|ZP_06076239.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
gi|298374002|ref|ZP_06983960.1| membrane protein [Bacteroides sp. 3_1_19]
gi|149938034|gb|ABR44731.1| putative transmembrane protein [Parabacteroides distasonis ATCC
8503]
gi|262295980|gb|EEY83911.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
gi|298268370|gb|EFI10025.1| membrane protein [Bacteroides sp. 3_1_19]
Length = 368
Score = 91.3 bits (225), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 82/288 (28%), Positives = 132/288 (45%), Gaps = 75/288 (26%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
++ RL SLD+ RG+ +A MILV++ G + + HA WNG D V PFF+FI+GV+
Sbjct: 4 QSGRLLSLDVMRGITIAGMILVNNPGSWKYVYTPLEHARWNGLTPTDLVFPFFMFIMGVS 63
Query: 88 IALALKRIPDR--ADAVKKVIFRTLKLLFWGILLQGGFSHA-PDELTYGVDVRMIRLCGV 144
+ +L++ + ++V KV+ RT+ + G+ L F H + T D + +R+ GV
Sbjct: 64 MFFSLRKYNFKLSKESVTKVLRRTVLIFLVGLGLN-LFGHVCYNGFT---DFQNLRILGV 119
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
+QR+AL+Y SL+ + +I Y +AA +L+ Y ALL T+
Sbjct: 120 MQRLALAYGFGSLIGL---------------AINHKYILQ--VAAGILIFYWALLGFTHS 162
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+ +++DS + +DR + G +HMYH
Sbjct: 163 ME-----MSEDS----------------------IIAIVDRTLFGTSHMYH--------- 186
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
D F+PEGLLS + SI ++G + G VI
Sbjct: 187 ------------DDMADGTRIAFDPEGLLSCIGSIAHVLLGFYVGKVI 222
>gi|149280688|ref|ZP_01886799.1| hypothetical protein PBAL39_24475 [Pedobacter sp. BAL39]
gi|149228553|gb|EDM33961.1| hypothetical protein PBAL39_24475 [Pedobacter sp. BAL39]
Length = 385
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 85/281 (30%), Positives = 121/281 (43%), Gaps = 71/281 (25%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAIA 89
RL SLD FRG VA MILV++ G DW I HA W+GC D V PFFLFIVGV+IA
Sbjct: 13 RLLSLDFFRGATVAAMILVNNPG-DWGHIYAPLEHADWHGCTPTDLVFPFFLFIVGVSIA 71
Query: 90 LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV-RMIRLCGVLQRI 148
A+ + K I + LK L S P+ + V+ + +R+ GVLQRI
Sbjct: 72 YAMGSKKTDPSSHGKTILKALKRTLILFGLGLFLSLFPNVFSNPVEAFQQVRIPGVLQRI 131
Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
A+ + + S+ IF K + +IFR +L Y A++ VP
Sbjct: 132 AVVFFICSI--IFLKSSER--------TIFR-------TMVIILAAYWAIMTFIPVPGTG 174
Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
F + K++ N +IDR V H+ W+ SK
Sbjct: 175 FPNLEKET---------------------NLGAWIDRGVFTEAHL-----WKSSKT---- 204
Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
++PEGLLS++ +I + + G+ G
Sbjct: 205 ------------------WDPEGLLSTLPAIATGLFGILVG 227
>gi|256420508|ref|YP_003121161.1| hypothetical protein Cpin_1463 [Chitinophaga pinensis DSM 2588]
gi|256035416|gb|ACU58960.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588]
Length = 374
Score = 90.9 bits (224), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 87/317 (27%), Positives = 137/317 (43%), Gaps = 76/317 (23%)
Query: 28 SHLKTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVG 85
++ QR LD+FRGL V MI+V+ G D + ++HA WNGC D V P FLF VG
Sbjct: 2 TNTTPQRFLPLDVFRGLTVCFMIIVNTPGWDTSYYILNHAQWNGCTPTDMVFPSFLFAVG 61
Query: 86 VAIALALKRIP--DRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRL 141
A++ ++++ + + K+ RTL + G L+ H L + + + R+
Sbjct: 62 NAMSFSMRKFQQLENTAVLSKIFRRTLLIFLLGFLMYWLPFVRHTESGLEF-IPLSDTRI 120
Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
GVLQRIAL Y SL+ H+L V V LL G
Sbjct: 121 LGVLQRIALCYCFASLLI------------------------HYLPKKAVWAVSAVLLLG 156
Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
+ + F D AD + + NA + D+ ++G +H+YH
Sbjct: 157 YWAVMYAF----GDPAD-------------RYSLTGNAALHFDKLIMGDSHLYHG----- 194
Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
EG F+PEGLLS++ +I++ I G + G + I +G +
Sbjct: 195 ----------EG-----------IAFDPEGLLSTLPAIVNVIAGYYTG-LFIQQEGKTGK 232
Query: 322 -LKQWVTMGFALLIFGL 337
L++ + MG L++ L
Sbjct: 233 GLRKLLQMGALLILVAL 249
>gi|262406057|ref|ZP_06082607.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|294648122|ref|ZP_06725665.1| conserved domain protein [Bacteroides ovatus SD CC 2a]
gi|294806856|ref|ZP_06765681.1| conserved domain protein [Bacteroides xylanisolvens SD CC 1b]
gi|345510562|ref|ZP_08790129.1| hypothetical protein BSAG_00775 [Bacteroides sp. D1]
gi|229443274|gb|EEO49065.1| hypothetical protein BSAG_00775 [Bacteroides sp. D1]
gi|262356932|gb|EEZ06022.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|292636506|gb|EFF54981.1| conserved domain protein [Bacteroides ovatus SD CC 2a]
gi|294445885|gb|EFG14527.1| conserved domain protein [Bacteroides xylanisolvens SD CC 1b]
Length = 361
Score = 90.5 bits (223), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 85/316 (26%), Positives = 140/316 (44%), Gaps = 78/316 (24%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL SLD+ RG+ VA MILV++ G ++ +HA W+G + AD V P F+F++G++
Sbjct: 4 NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63
Query: 89 ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
++L + + A+ K+I R+L L+F G++++ F A D Y D+ +RL GV+QR
Sbjct: 64 YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITAIDSGNY-FDLSQLRLMGVMQR 121
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
+ + Y + +L+ + + +A +LVVY L+G
Sbjct: 122 LGICYGITALLAVTIPHKKFMP-----------------LAIILLVVYFIFQLFGNGF-- 162
Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
+ S D N VG +D +LG NHMY
Sbjct: 163 -------EKSVD-------------------NIVGIVDSAILGSNHMY-----------L 185
Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
Q F +PEG+LS++ ++ +IG G +II K + R+
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLF 230
Query: 327 TMGFALLIFGLTLHFT 342
+G LL G L +
Sbjct: 231 LIGTTLLFAGYLLSYA 246
>gi|300770061|ref|ZP_07079940.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33861]
gi|300762537|gb|EFK59354.1| conserved hypothetical protein [Sphingobacterium spiritivorum ATCC
33861]
Length = 404
Score = 90.1 bits (222), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 83/298 (27%), Positives = 129/298 (43%), Gaps = 90/298 (30%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGVAI 88
QR SLD+FRG VALMI+V++ G W + HA W+GC D V PFFLF VG A+
Sbjct: 3 QRYYSLDVFRGATVALMIMVNNPG-SWGHMFAPLKHAEWHGCTPTDLVFPFFLFAVGNAM 61
Query: 89 ALALKRIPDRADAV--KKVIFRTLKLLFWGILLQGG--FSHAPDEL-----TYGVD-VRM 138
+ + R+ + AV +KV+ RT+ + G+ + A D L +Y D +R
Sbjct: 62 SFVIPRLQEAGPAVFWQKVLKRTVLIFLIGLFINWWPFVQWAQDTLVFKQWSYADDSMRG 121
Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
+R+ GVLQRIAL+Y S++ + ++ + W ++ +LVVY A+
Sbjct: 122 VRILGVLQRIALAYCFASIIAYYFRE--------------KAIIW---ISTFILVVYWAV 164
Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYI----DRKVLGINHMY 254
C P + G+ D ++LG+ H+Y
Sbjct: 165 ----------------------------CAFLGTPGDPYSLQGWFGTAYDIQILGVAHVY 196
Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
EG PF+PEGL+S++ +I+ ++G G I
Sbjct: 197 KG---------------EG-----------VPFDPEGLMSTLPAIVQVVLGYLAGTYI 228
>gi|423215264|ref|ZP_17201791.1| hypothetical protein HMPREF1074_03323 [Bacteroides xylanisolvens
CL03T12C04]
gi|392691832|gb|EIY85072.1| hypothetical protein HMPREF1074_03323 [Bacteroides xylanisolvens
CL03T12C04]
Length = 371
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 89/319 (27%), Positives = 136/319 (42%), Gaps = 76/319 (23%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGV 86
+K++RL SLD+ RG+ + MILV++ G + + HA WNG D V PFF+FI+GV
Sbjct: 1 MKSERLLSLDVLRGITIVGMILVNNPGTWESVYAPLRHAEWNGLTPTDLVFPFFMFIMGV 60
Query: 87 AIALALKRIPD--RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD--VRMIRLC 142
+++ AL R + K++ RT+ L G+ L FS GV+ IR+
Sbjct: 61 SMSFALSRFDHHFSRSFITKLVRRTVILFLLGLFLS-WFSLV----CAGVEQPFSQIRIL 115
Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
GVLQR+AL+Y SL+ + + + W ++A +L+ Y+ LL
Sbjct: 116 GVLQRLALAYFFGSLLIMSVRRPAN-------------LAW---ISAIILIGYIVLL--- 156
Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
G F ++ N + DR + G H+Y
Sbjct: 157 ---------------ALGNGFELS---------EQNIIAVTDRTLFGETHLY-------- 184
Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
R+ P F+PEGLLS++ I IIG G+++ RL
Sbjct: 185 -------------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHRL 231
Query: 323 KQWVTMGFALLIFGLTLHF 341
Q +G ALL G L +
Sbjct: 232 LQISILGIALLFAGWLLSY 250
>gi|227538516|ref|ZP_03968565.1| transmembrane protein [Sphingobacterium spiritivorum ATCC 33300]
gi|227241435|gb|EEI91450.1| transmembrane protein [Sphingobacterium spiritivorum ATCC 33300]
Length = 404
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 83/298 (27%), Positives = 129/298 (43%), Gaps = 90/298 (30%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGVAI 88
QR SLD+FRG VALMI+V++ G W + HA W+GC D V PFFLF VG A+
Sbjct: 3 QRYYSLDVFRGATVALMIMVNNPG-SWGHMFAPLKHAEWHGCTPTDLVFPFFLFAVGNAM 61
Query: 89 ALALKRIPDRADAV--KKVIFRTLKLLFWGILLQGG--FSHAPDEL-----TYGVD-VRM 138
+ + R+ + AV +KV+ RT+ + G+ + A D L +Y D +R
Sbjct: 62 SFVIPRLQEAGPAVFWQKVLKRTVLIFLIGLFINWWPFVQWAQDTLVFKQWSYADDPMRG 121
Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
+R+ GVLQRIAL+Y S++ + ++ + W ++ +LVVY A+
Sbjct: 122 VRILGVLQRIALAYCFASIIAYYFRE--------------KAIIW---ISTFILVVYWAV 164
Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYI----DRKVLGINHMY 254
C P + G+ D ++LG+ H+Y
Sbjct: 165 ----------------------------CAFLGTPGDPYSLQGWFGTAYDIQILGVAHVY 196
Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
EG PF+PEGL+S++ +I+ ++G G I
Sbjct: 197 KG---------------EG-----------VPFDPEGLMSTLPAIVQVVLGYLAGTYI 228
>gi|223940501|ref|ZP_03632350.1| conserved hypothetical protein [bacterium Ellin514]
gi|223890825|gb|EEF57337.1| conserved hypothetical protein [bacterium Ellin514]
Length = 410
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 91/324 (28%), Positives = 138/324 (42%), Gaps = 90/324 (27%)
Query: 15 IISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMI----LVDHAGGDWP---------E 61
I+S P S Q S T+RL SLD RG + ++ LV WP +
Sbjct: 4 ILSPPLQSKPQVTSPSTTKRLLSLDALRGFDMFWIVGGEELVHALYNAWPNGPLGIINSQ 63
Query: 62 ISHAPWNGCNLADFVMPFFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGIL 118
+ H W G D + P F+FIVGV++ +L + + +A A+K+V FR+L L +G+L
Sbjct: 64 MDHKVWQGVAFYDLIFPLFVFIVGVSLVFSLTKAIEVNGKAAALKRVFFRSLLLYVFGLL 123
Query: 119 LQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIF 178
+ GG S D IR GVLQRIA+ Y SLV F F
Sbjct: 124 IYGGISKGIDG---------IRWMGVLQRIAICYFSTSLV----------------FCFF 158
Query: 179 RLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKV-------------FNV 225
+L + AA +L+ Y AL+ T+VP F + SA ++ +
Sbjct: 159 KLRG-MIVAAAALLLTYWALM--TFVP---FPDVRPASASPQEITKHNGFTNVAQLNLSS 212
Query: 226 TCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHA 285
T + + P N Y+D+K L P ++ W
Sbjct: 213 TTMLHGQFIPGVNLANYVDQKYL--------PGYK---------------------W-DG 242
Query: 286 PFEPEGLLSSVSSILSTIIGVHFG 309
++PEGLLS++ +I++ ++GV G
Sbjct: 243 TYDPEGLLSTLPAIVTCLLGVFAG 266
>gi|431798742|ref|YP_007225646.1| hypothetical protein Echvi_3416 [Echinicola vietnamensis DSM 17526]
gi|430789507|gb|AGA79636.1| hypothetical protein Echvi_3416 [Echinicola vietnamensis DSM 17526]
Length = 363
Score = 90.1 bits (222), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 57/134 (42%), Positives = 76/134 (56%), Gaps = 16/134 (11%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGV 86
K +RL SLD+ RG+ +A MILV+ G W P + HA WNG DF+ PFFLFIVGV
Sbjct: 4 KNKRLISLDVLRGMTIAAMILVNFPG-SWEHVFPPLHHAQWNGITPTDFIFPFFLFIVGV 62
Query: 87 AIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
+I +A K D+ KK+ FR K+ G+LL P+ D IR+ GV
Sbjct: 63 SIVMAYAGKMEMDKTIVYKKLFFRGAKIFALGVLL----GMIPE-----FDFSAIRVAGV 113
Query: 145 LQRIALSYLLVSLV 158
LQRIAL ++ +L+
Sbjct: 114 LQRIALVFVACTLM 127
>gi|301307595|ref|ZP_07213552.1| conserved hypothetical protein [Bacteroides sp. 20_3]
gi|423337400|ref|ZP_17315144.1| hypothetical protein HMPREF1059_01069 [Parabacteroides distasonis
CL09T03C24]
gi|300834269|gb|EFK64882.1| conserved hypothetical protein [Bacteroides sp. 20_3]
gi|409237229|gb|EKN30029.1| hypothetical protein HMPREF1059_01069 [Parabacteroides distasonis
CL09T03C24]
Length = 368
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 81/288 (28%), Positives = 132/288 (45%), Gaps = 75/288 (26%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
++ RL SLD+ RG+ +A MILV++ G + + HA WNG D V PFF+FI+GV+
Sbjct: 4 QSGRLLSLDVMRGITIAGMILVNNPGSWKYVYTPLEHARWNGLTPTDLVFPFFMFIMGVS 63
Query: 88 IALALKRIPDR--ADAVKKVIFRTLKLLFWGILLQGGFSHA-PDELTYGVDVRMIRLCGV 144
+ +L++ + ++V KV+ RT+ + G+ L F H + T D + +R+ GV
Sbjct: 64 MFFSLRKYNFKLSKESVTKVLRRTVLIFLVGLGLN-LFGHVCYNGFT---DFQNLRILGV 119
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
+QR+AL+Y SL+ + +I Y +AA +L+ Y ALL T+
Sbjct: 120 MQRLALAYGFGSLIGL---------------AINHKYILQ--VAAGILIFYWALLGFTHS 162
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+ +++DS + +D+ + G +HMYH
Sbjct: 163 ME-----MSEDS----------------------IIAIVDKALFGTSHMYH--------- 186
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
D F+PEGLLS + SI ++G + G VI
Sbjct: 187 ------------DDMADGTRIAFDPEGLLSCIGSIAHVLLGFYVGKVI 222
>gi|338210835|ref|YP_004654884.1| hypothetical protein [Runella slithyformis DSM 19594]
gi|336304650|gb|AEI47752.1| Protein of unknown function DUF2261, transmembrane [Runella
slithyformis DSM 19594]
Length = 363
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 90/309 (29%), Positives = 135/309 (43%), Gaps = 82/309 (26%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAI 88
RL SLD+FRG+ VA MILV++ G DW + HA WNGC D + PFFLFIVGV++
Sbjct: 3 NRLLSLDVFRGMTVAAMILVNNPG-DWDHVYAPLLHAHWNGCTPTDLIFPFFLFIVGVSV 61
Query: 89 ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
A A+ + P ++ K+I R+ L L + P D +R+ GVLQRI
Sbjct: 62 AFAMGKNP---PSLLKIIKRSAILF----GLGLFLNLYPK-----FDFANVRIPGVLQRI 109
Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
AL YL+ SL IF K + K Q + +L+ Y L+
Sbjct: 110 ALVYLVCSL--IFIKTTR-KTQVI--------------TTVLLLIAYWLLM--------- 143
Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
T++ Y A L P N ++DR +L H+ W+ +K
Sbjct: 144 -TLVPVPGVGY-----------ANLEPTTNLGAWVDRGLLTTAHL-----WKSAKV---- 182
Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM 328
++PEG+ S++ +I + ++GV G + K ++
Sbjct: 183 ------------------WDPEGMFSTIPAIGTGLLGVLTGQWLRSEKPVAEKMAWLFLS 224
Query: 329 GFALLIFGL 337
G AL++ GL
Sbjct: 225 GNALILGGL 233
>gi|326800650|ref|YP_004318469.1| hypothetical protein [Sphingobacterium sp. 21]
gi|326551414|gb|ADZ79799.1| hypothetical protein Sph21_3257 [Sphingobacterium sp. 21]
Length = 396
Score = 89.7 bits (221), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 91/296 (30%), Positives = 128/296 (43%), Gaps = 85/296 (28%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
QR SLD+FRG VALMILV++ G + + HAPW+GC D V PFFLF VG A+
Sbjct: 2 NQRYYSLDVFRGATVALMILVNNPGSWSYAFSPLKHAPWHGCTPTDLVFPFFLFAVGNAM 61
Query: 89 ALALKRIPDRADAV--KKVIFRTLKLLF------WGILLQGGFSHAPDELTYGVDV---- 136
+ + R+ +A V KKV+ RT+ + W +Q +S+ Y ++
Sbjct: 62 SFVIPRLRTQAGKVFWKKVLKRTILIFLIGLLLNWYPFVQ--WSNDTLLFKYWINPIKSD 119
Query: 137 RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
IR+ GVLQRIAL Y S++ F K ++V SIF L +WL+ C+L+
Sbjct: 120 SGIRILGVLQRIALCYCFASILVYFF-----KTKTVVLISIFILLS-YWLI--CILL--- 168
Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
DS Y G + ID +L I HMY
Sbjct: 169 -----------------GDSDPYS--LQGWFGTK------------IDVSILQIAHMYKG 197
Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
EG PFEPEG+ S+ ++++ +IG G I
Sbjct: 198 ---------------EG-----------VPFEPEGIASTFTAVIQVVIGFLVGQYI 227
>gi|428300562|ref|YP_007138868.1| hypothetical protein Cal6303_3980 [Calothrix sp. PCC 6303]
gi|428237106|gb|AFZ02896.1| hypothetical protein Cal6303_3980 [Calothrix sp. PCC 6303]
Length = 402
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 86/311 (27%), Positives = 127/311 (40%), Gaps = 94/311 (30%)
Query: 33 QRLASLDIFRGLAVALMILVDHAG----GDWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
RL SLD+FRG +A MILV+ +P + HA W+GC LAD V PFFLFIVGVA+
Sbjct: 1 MRLTSLDVFRGATIAGMILVNMVSLAEPNVYPALLHADWHGCTLADLVFPFFLFIVGVAM 60
Query: 89 ALALKRIPD-------RADAVK--------------------KVIFRTLKLLFWGILLQG 121
+ + + D DA+ K IFR +LF L
Sbjct: 61 SFSFAKYTDVIPKVEKEKDAIGALQQFLAKESSAAGGAKPPYKKIFRRGAILFALGLFLN 120
Query: 122 GFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLY 181
F ++ + L Y D +R+ GVLQRIAL+YL SL+ + + K Q
Sbjct: 121 LFWNSKN-LPY-FDFSTLRIMGVLQRIALTYLFASLIVL---KLPKKAQ----------- 164
Query: 182 CWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVG 241
W++A +LV Y L+ +P++ I + ++
Sbjct: 165 ---WIVAGVLLVGYWLLMMYVPIPEYGAGEIGTRTGNFA--------------------A 201
Query: 242 YIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILS 301
YIDR ++ H+Y + +PEGL S++ +I+S
Sbjct: 202 YIDRFIIPKAHLYKGDGFNNFG------------------------DPEGLFSTIPAIVS 237
Query: 302 TIIGVHFGHVI 312
+ G G I
Sbjct: 238 VLGGYFSGQWI 248
>gi|336412607|ref|ZP_08592960.1| hypothetical protein HMPREF1017_00068 [Bacteroides ovatus
3_8_47FAA]
gi|335942653|gb|EGN04495.1| hypothetical protein HMPREF1017_00068 [Bacteroides ovatus
3_8_47FAA]
Length = 361
Score = 89.4 bits (220), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 86/316 (27%), Positives = 143/316 (45%), Gaps = 78/316 (24%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL SLD+ RG+ VA MILV++ G ++ +HA W+G + AD V P F+F++G++
Sbjct: 4 NKRLLSLDVLRGITVAGMILVNNTGKCGYNFAAFAHAKWDGFSPADLVFPMFMFLMGIST 63
Query: 89 ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
++L + + A+ K+I R+L L+F G++++ F + D Y D+ +RL GV+QR
Sbjct: 64 YISLCKYNFQCRPAIAKIIKRSLLLIFIGLVME-WFITSIDSGNY-FDLSQLRLMGVMQR 121
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL-LYGTYVPD 206
+ + Y + +L+ + + K RF +A +L+VY L+G
Sbjct: 122 LGICYGITALLAV---TIPHK-----RFMP---------LAIILLIVYFIFQLFGNGF-- 162
Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
+ S D N VG +D +LG NHMY
Sbjct: 163 -------EKSVD-------------------NIVGIVDSAILGSNHMY-----------L 185
Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
Q F +PEG+LS++ ++ +IG G +II K + R+
Sbjct: 186 QGRQFV---------------DPEGILSTIPAVSQVMIGFVCGKIIIDIKDNDRRMLNLF 230
Query: 327 TMGFALLIFGLTLHFT 342
+G LL G L +
Sbjct: 231 LIGTTLLFAGYLLSYA 246
>gi|294777712|ref|ZP_06743163.1| putative membrane protein [Bacteroides vulgatus PC510]
gi|319640295|ref|ZP_07995020.1| hypothetical protein HMPREF9011_00617 [Bacteroides sp. 3_1_40A]
gi|294448780|gb|EFG17329.1| putative membrane protein [Bacteroides vulgatus PC510]
gi|317388070|gb|EFV68924.1| hypothetical protein HMPREF9011_00617 [Bacteroides sp. 3_1_40A]
Length = 372
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 86/330 (26%), Positives = 131/330 (39%), Gaps = 88/330 (26%)
Query: 18 EPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLADF 75
E D Q+ +K +RL SLD RG+ VA MILV++AGG + + H+ WNG D
Sbjct: 5 ELDTETAQQALPIK-KRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSAWNGLTPCDL 63
Query: 76 VMPFFLFIVGVAIALALKRIPDRAD--AVKKVIFRTLKLLF--WGILLQGGFSHA--PDE 129
V PFFLFI+G++ ++L + V K++ RT +L W I G F H D
Sbjct: 64 VFPFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI---GWFDHVCEGDF 120
Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
L + +R+ GVLQRIAL Y ++S +F
Sbjct: 121 LPF----VHLRIPGVLQRIALCYCVISFTALFMNH------------------------- 151
Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
++P F ++ + + C N + IDR++ G
Sbjct: 152 ------------KFIPTLTFILLVSYTV-------ILCMGNGYTCDESNILSIIDRQLFG 192
Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
H+Y +P +PEG +S++S+I T IG G
Sbjct: 193 EAHLYQ----------------------------KSPIDPEGFVSTLSAIAHTCIGFSCG 224
Query: 310 HVIIHTKGHLARLKQWVTMGFALLIFGLTL 339
II + ++ + GF L+ G L
Sbjct: 225 KWIIQSHQTENKVLRLFLTGFILMSIGYLL 254
>gi|195565141|ref|XP_002106164.1| GD16714 [Drosophila simulans]
gi|194203536|gb|EDX17112.1| GD16714 [Drosophila simulans]
Length = 318
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 50/127 (39%), Positives = 74/127 (58%), Gaps = 13/127 (10%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
+ +RL SLD FRGL++ LMI V+ GG + I HA WNG +LAD V P FL+I+GV I L
Sbjct: 182 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHLADIVFPSFLWIMGVCIPL 241
Query: 91 ALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
++K R +A +++ R++KL G+ L G ++ +R+ GVLQ
Sbjct: 242 SVKSQLSRGSSKARICLRILVRSIKLFVIGLCLNS---------MSGPNLEQLRVMGVLQ 292
Query: 147 RIALSYL 153
R ++YL
Sbjct: 293 RFGVAYL 299
>gi|393725858|ref|ZP_10345785.1| hypothetical protein SPAM2_19574 [Sphingomonas sp. PAMC 26605]
Length = 400
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 82/291 (28%), Positives = 136/291 (46%), Gaps = 69/291 (23%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGV 86
+ RL +LD+ RGLAVA MILV G DW ++ HA W+G LAD V P FLF VG+
Sbjct: 4 RLPRLEALDVLRGLAVAGMILVVSPG-DWSMAYAQLQHAAWHGATLADMVFPTFLFSVGM 62
Query: 87 AIALALKRIPDRADAVKKVIF------RTLKLLFWGILLQGGF-----SHAPDELTYGVD 135
A+ L+ R+ AD ++ +F R++ L+ G++++ + + AP G+
Sbjct: 63 ALGLSFPRL--MADTAQRRLFWMRLIRRSITLVVLGLVVEATYVWTISAGAPYPGHGGLS 120
Query: 136 VRMIRLCGVLQRIALSYLL-VSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
+R+ G+LQRI L YLL +L+ + ++ + D ++ + L+C A +L+
Sbjct: 121 --YVRIPGILQRIGLCYLLGGALIVVTSRTIADGRIAIAPQRV--LFC-----IAAILIG 171
Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
Y ALL VP + ++ D + ++DR + + H+
Sbjct: 172 YWALLRFVPVPGFGVGLLTPDG---------------------SLPAFVDRTLFTVPHL- 209
Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
W A Q GP A ++PEGLLS++ + + + G
Sbjct: 210 ----WPLGSATGQ-----GP----------ATYDPEGLLSTLPATANLLFG 241
>gi|156401292|ref|XP_001639225.1| predicted protein [Nematostella vectensis]
gi|156226352|gb|EDO47162.1| predicted protein [Nematostella vectensis]
Length = 376
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 82/306 (26%), Positives = 137/306 (44%), Gaps = 59/306 (19%)
Query: 49 MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR 108
MI V+ GG + HA WNG +AD V P+F++I+GV+I L+ K + R K+ +
Sbjct: 1 MIFVNFGGGGYYFFGHAAWNGLLVADLVFPWFIWIMGVSITLSFKSLKRRKVKKWKICLK 60
Query: 109 TLK--LLFWGI-LLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDV 165
++ L+ +G+ L F+ D+ R+ GVLQR A Y++++L+++F
Sbjct: 61 VIRRSLILFGLGLFTSNFN----------DLETYRIPGVLQRFAACYIVIALMQLFLGPS 110
Query: 166 QDKDQSV---------GRFSIFRLYCWHWLMAACVLVVYLALLYGTYV---PDWQFTIIN 213
+++ Q + SI++ WL +L +Y+ + Y + P +T
Sbjct: 111 EEQTQVLYPKWWDPIRDVVSIWK----QWLAMLLLLAIYVTVTYAVKLDGCPR-GYTGPG 165
Query: 214 KDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEG 273
Y + FN T GV YIDRK G H+Y P T ++
Sbjct: 166 GIGRGYPEAFNCTGGV----------ANYIDRKFFG-KHIYQWP--------TVKQLYKT 206
Query: 274 PLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALL 333
L P EPEG L +++SI +GV G ++ + R+ +W+ G L
Sbjct: 207 KL----------PHEPEGFLGTLTSIFLVFLGVQAGRILHTYRKSTERITRWLAWGVFLG 256
Query: 334 IFGLTL 339
+ G+ L
Sbjct: 257 LIGVGL 262
>gi|345517324|ref|ZP_08796801.1| hypothetical protein BSFG_00542 [Bacteroides sp. 4_3_47FAA]
gi|345457717|gb|EET14395.2| hypothetical protein BSFG_00542 [Bacteroides sp. 4_3_47FAA]
Length = 365
Score = 89.0 bits (219), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 86/330 (26%), Positives = 131/330 (39%), Gaps = 88/330 (26%)
Query: 18 EPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLADF 75
E D Q+ +K +RL SLD RG+ VA MILV++AGG + + H+ WNG D
Sbjct: 5 ELDTETAQQALPIK-KRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSAWNGLTPCDL 63
Query: 76 VMPFFLFIVGVAIALALKRIPDRAD--AVKKVIFRTLKLLF--WGILLQGGFSHA--PDE 129
V PFFLFI+G++ ++L + V K++ RT +L W I G F H D
Sbjct: 64 VFPFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI---GWFDHVCEGDF 120
Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAA 189
L + +R+ GVLQRIAL Y ++S +F
Sbjct: 121 LPF----VHLRIPGVLQRIALCYCVISFTALFMNH------------------------- 151
Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLG 249
++P F ++ + + C N + IDR++ G
Sbjct: 152 ------------KFIPTLTFILLVSYTV-------ILCMGNGYTCDESNILSIIDRQLFG 192
Query: 250 INHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
H+Y +P +PEG +S++S+I T IG G
Sbjct: 193 EAHLYQ----------------------------KSPIDPEGFVSTLSAIAHTCIGFSCG 224
Query: 310 HVIIHTKGHLARLKQWVTMGFALLIFGLTL 339
II + ++ + GF L+ G L
Sbjct: 225 KWIIQSHQTENKVLRLFLTGFILMSIGYLL 254
>gi|449664780|ref|XP_002169793.2| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Hydra magnipapillata]
Length = 369
Score = 89.0 bits (219), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 83/296 (28%), Positives = 138/296 (46%), Gaps = 51/296 (17%)
Query: 49 MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL----KRIPDRADAVKK 104
MI V++ GG + SH+ WNG +AD + P+F+FI+G +I +++ K++ R V K
Sbjct: 1 MIFVNYGGGGYYFFSHSSWNGLTVADLLFPWFIFIMGSSIYISMHSLRKKLSKRKMTV-K 59
Query: 105 VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKD 164
+I+R+ KL L G D+ RL GVLQR A+SY +V+LV ++
Sbjct: 60 IIYRSFKL-----------LLLGLFLNNGFDLANWRLPGVLQRFAISYFVVALVFLWFDS 108
Query: 165 VQDKDQSVGRFSIFR--LYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGK 221
++ ++ ++FR + + ++ +L +YL ++Y VP D+G
Sbjct: 109 PNEESETNSWKNMFRDVWFPFQHIVMLLLLTIYLLIIYLLNVPGCPKGYFGPGGDGDHGA 168
Query: 222 VFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPS 281
T G A GY+DR V G+NH+Y +P + C
Sbjct: 169 YEKCTGG----------ASGYVDRTVFGLNHIYKNPTCKSLYNCFT-------------- 204
Query: 282 WCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK---GHLARLKQWVTMGFALLI 334
++PEGLL ++ SIL T +G+ ++ K GH+ R W + AL +
Sbjct: 205 -----YDPEGLLGTIPSILLTYLGLQAARTLLFYKSKNGHIIRWFIWSVLLGALAV 255
>gi|345513910|ref|ZP_08793425.1| hypothetical protein BSEG_01940 [Bacteroides dorei 5_1_36/D4]
gi|229435722|gb|EEO45799.1| hypothetical protein BSEG_01940 [Bacteroides dorei 5_1_36/D4]
Length = 372
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 85/331 (25%), Positives = 132/331 (39%), Gaps = 88/331 (26%)
Query: 17 SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
E + Q+ +K +RL SLD RG+ VA MILV++AGG + + H+ WNG D
Sbjct: 4 EELNTETAQQAPPIK-KRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSVWNGLTPCD 62
Query: 75 FVMPFFLFIVGVAIALALKRIPDRAD--AVKKVIFRTLKLLF--WGILLQGGFSHA--PD 128
V PFFLFI+G++ ++L + V K++ RT +L W I G F H D
Sbjct: 63 LVFPFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI---GWFDHVCEGD 119
Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
L + +R+ GVLQRIAL Y ++S +F
Sbjct: 120 FLPF----VHLRIPGVLQRIALCYCVISFTALFMNH------------------------ 151
Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
++P F ++ + + C N + IDR++
Sbjct: 152 -------------KFIPALTFILLVSYTV-------ILCMGNGYACDESNILSIIDRQLF 191
Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
G H+Y +P +PEG +S++S+I T IG +
Sbjct: 192 GEAHLYQ----------------------------KSPIDPEGFVSTLSAIAHTCIGFSY 223
Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTL 339
G II + ++ + GF L+ G L
Sbjct: 224 GKWIIQSHQTENKVLRLFLTGFILISIGYLL 254
>gi|374309722|ref|YP_005056152.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
gi|358751732|gb|AEU35122.1| hypothetical protein AciX8_0773 [Granulicella mallensis MP5ACTX8]
Length = 377
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 82/304 (26%), Positives = 134/304 (44%), Gaps = 67/304 (22%)
Query: 46 VALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA----LKRIPDR 98
+A MILV G +P++ HA WNG D + P FL I+GVA+ + ++R DR
Sbjct: 3 IAGMILVTDPGTYSAVYPQLMHAQWNGATATDMIFPSFLVIIGVAMTFSFASRIERGADR 62
Query: 99 ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
+ V+ R++ L+F G+L+ G P+ ++ IR+ G+LQRIAL Y SL+
Sbjct: 63 RQILWHVLTRSVLLIFLGLLVNG----FPEY-----NLHTIRIPGILQRIALCYFAGSLL 113
Query: 159 EIFT---KDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKD 215
+ KD + Q + R ++ + A +LV+Y LL G VP +
Sbjct: 114 YLAVSGKKDANTESQRLRRGTVIG------AVLAGLLVLYWVLLKGYPVPGFG------- 160
Query: 216 SADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPL 275
+L+ N Y DRK+ G+ H++ + +P G
Sbjct: 161 --------------SGRLDSLGNVAAYFDRKIFGVQHLWAYGL----------TPGYG-- 194
Query: 276 RKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIF 335
F+PEGLLS++ ++ + + GV G + + + G AL++
Sbjct: 195 ---------VTFDPEGLLSTLPALATLLFGVLAGEWLRTNQARGRKALVLAVAGVALVLV 245
Query: 336 GLTL 339
GL L
Sbjct: 246 GLAL 249
>gi|298479647|ref|ZP_06997847.1| membrane protein [Bacteroides sp. D22]
gi|336403243|ref|ZP_08583960.1| hypothetical protein HMPREF0127_01273 [Bacteroides sp. 1_1_30]
gi|295084924|emb|CBK66447.1| Uncharacterized conserved protein [Bacteroides xylanisolvens XB1A]
gi|298274037|gb|EFI15598.1| membrane protein [Bacteroides sp. D22]
gi|335946636|gb|EGN08437.1| hypothetical protein HMPREF0127_01273 [Bacteroides sp. 1_1_30]
Length = 371
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 88/319 (27%), Positives = 135/319 (42%), Gaps = 76/319 (23%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGV 86
+K++RL SLD+ RG+ + MILV++ G + + HA WNG D V PFF+FI+GV
Sbjct: 1 MKSERLLSLDVLRGITIVGMILVNNPGTWESVYAPLRHAEWNGLTPTDLVFPFFMFIMGV 60
Query: 87 AIALALKRIPD--RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD--VRMIRLC 142
+++ AL R + K++ RT+ L G+ L FS GV+ IR+
Sbjct: 61 SMSFALSRFDHHFSRSFITKLVRRTVILFLLGLFLS-WFSLV----CAGVEQPFSQIRIL 115
Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
GVLQR+AL+Y SL+ + + + W ++A +L+ Y+ LL
Sbjct: 116 GVLQRLALAYFFGSLLIMSVRRPAN-------------LAW---ISAIILIGYIVLL--- 156
Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
G F ++ N + DR + G H+Y
Sbjct: 157 ---------------ALGNGFELS---------EQNIIAVTDRTLFGETHLY-------- 184
Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
R+ P F+PEGLLS++ I IIG G+++ RL
Sbjct: 185 -------------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHRL 231
Query: 323 KQWVTMGFALLIFGLTLHF 341
Q +G LL G L +
Sbjct: 232 LQISILGIVLLFAGWLLSY 250
>gi|393783262|ref|ZP_10371437.1| hypothetical protein HMPREF1071_02305 [Bacteroides salyersiae
CL02T12C01]
gi|392669541|gb|EIY63029.1| hypothetical protein HMPREF1071_02305 [Bacteroides salyersiae
CL02T12C01]
Length = 365
Score = 88.6 bits (218), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 83/314 (26%), Positives = 138/314 (43%), Gaps = 76/314 (24%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
QRL SLD+ RG+ VA MILV++AG + + HA W+G AD V P F+F++G++
Sbjct: 9 QRLLSLDVLRGITVAGMILVNNAGACGYGYAPLRHAKWDGFTPADLVFPMFMFLMGISTY 68
Query: 90 LALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
++L++ + + K+I R L+ GI ++ H+ + + D +R+ GV+QR+
Sbjct: 69 ISLRKYNFQWQLTIGKIIKRAFLLILIGIAMK-WLIHSFETGIWN-DWEHMRILGVMQRL 126
Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
+ Y + +++ +F RF L + L LL G +
Sbjct: 127 GICYGITAVMALFIPH--------KRF----------------LPIALLLLIGYF----- 157
Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
I+ + K P N + +D VLG +HMY Q
Sbjct: 158 --ILQLAGNGFEK-------------SPDNIMAIVDSTVLGTSHMY-----------LQG 191
Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM 328
F EPEG+LS++ ++ +IG GH++I+ K + R++Q M
Sbjct: 192 RQF---------------VEPEGILSTIPAVAQVMIGFVCGHMLINRKDNQERMQQLFFM 236
Query: 329 GFALLIFGLTLHFT 342
G LL G L +
Sbjct: 237 GTLLLFAGFLLSYA 250
>gi|237711644|ref|ZP_04542125.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
gi|423230937|ref|ZP_17217341.1| hypothetical protein HMPREF1063_03161 [Bacteroides dorei
CL02T00C15]
gi|423244648|ref|ZP_17225723.1| hypothetical protein HMPREF1064_01929 [Bacteroides dorei
CL02T12C06]
gi|229454339|gb|EEO60060.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
gi|392630057|gb|EIY24059.1| hypothetical protein HMPREF1063_03161 [Bacteroides dorei
CL02T00C15]
gi|392641497|gb|EIY35273.1| hypothetical protein HMPREF1064_01929 [Bacteroides dorei
CL02T12C06]
Length = 372
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 85/331 (25%), Positives = 132/331 (39%), Gaps = 88/331 (26%)
Query: 17 SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
E + Q+ +K +RL SLD RG+ VA MILV++AGG + + H+ WNG D
Sbjct: 4 EELNTETAQQAPPIK-KRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSVWNGLTPCD 62
Query: 75 FVMPFFLFIVGVAIALALKRIPDRAD--AVKKVIFRTLKLLF--WGILLQGGFSHA--PD 128
V PFFLFI+G++ ++L + V K++ RT +L W I G F H D
Sbjct: 63 LVFPFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI---GWFDHVCEGD 119
Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
L + +R+ GVLQRIAL Y ++S +F
Sbjct: 120 FLPF----VHLRIPGVLQRIALCYCVISFTALFMNH------------------------ 151
Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
++P F ++ + + C N + IDR++
Sbjct: 152 -------------KFIPALTFILLVSYTV-------ILCMGNGYACDESNILSIIDRQLF 191
Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
G H+Y +P +PEG +S++S+I T IG +
Sbjct: 192 GEAHLYQ----------------------------KSPIDPEGFVSTLSAIAHTCIGFSY 223
Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTL 339
G II + ++ + GF L+ G L
Sbjct: 224 GKWIIQSHQTENKVLRLFLTGFILISIGYLL 254
>gi|262407085|ref|ZP_06083634.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|294648023|ref|ZP_06725570.1| putative membrane protein [Bacteroides ovatus SD CC 2a]
gi|294809833|ref|ZP_06768513.1| putative membrane protein [Bacteroides xylanisolvens SD CC 1b]
gi|345512215|ref|ZP_08791750.1| hypothetical protein BSAG_03359 [Bacteroides sp. D1]
gi|229445856|gb|EEO51647.1| hypothetical protein BSAG_03359 [Bacteroides sp. D1]
gi|262355788|gb|EEZ04879.1| conserved hypothetical protein [Bacteroides sp. 2_1_22]
gi|292636642|gb|EFF55113.1| putative membrane protein [Bacteroides ovatus SD CC 2a]
gi|294442971|gb|EFG11758.1| putative membrane protein [Bacteroides xylanisolvens SD CC 1b]
Length = 371
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 88/319 (27%), Positives = 135/319 (42%), Gaps = 76/319 (23%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGV 86
+K++RL SLD+ RG+ + MILV++ G + + HA WNG D V PFF+FI+GV
Sbjct: 1 MKSERLLSLDVLRGITIVGMILVNNPGTWESVYAPLRHAEWNGLTPTDLVFPFFMFIMGV 60
Query: 87 AIALALKRIPD--RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD--VRMIRLC 142
+++ AL R + K++ RT+ L G+ L FS GV+ IR+
Sbjct: 61 SMSFALSRFDHHFSRSFITKLVRRTVILFLLGLFLS-WFSLV----CAGVEQPFSQIRIL 115
Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
GVLQR+AL+Y SL+ + + + W ++A +L+ Y+ LL
Sbjct: 116 GVLQRLALAYFFGSLLIMSVRRPAN-------------LAW---ISAIILIGYIVLL--- 156
Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
G F ++ N + DR + G H+Y
Sbjct: 157 ---------------ALGNGFELS---------EQNIIAVTDRTLFGETHLY-------- 184
Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
R+ P F+PEGLLS++ I IIG G+++ RL
Sbjct: 185 -------------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHRL 231
Query: 323 KQWVTMGFALLIFGLTLHF 341
Q +G LL G L +
Sbjct: 232 LQISILGIVLLFAGWLLSY 250
>gi|116621919|ref|YP_824075.1| hypothetical protein Acid_2804 [Candidatus Solibacter usitatus
Ellin6076]
gi|116225081|gb|ABJ83790.1| conserved hypothetical protein [Candidatus Solibacter usitatus
Ellin6076]
Length = 367
Score = 88.6 bits (218), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 99/348 (28%), Positives = 151/348 (43%), Gaps = 90/348 (25%)
Query: 34 RLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
RL SLD FRG +ALM+LV++AG + ++ H+PW+G + D V P FL+IVGVAI L
Sbjct: 12 RLVSLDAFRGATIALMVLVNNAGSGLDSYRQLEHSPWHGWTITDTVFPSFLWIVGVAITL 71
Query: 91 AL-KRIPD---RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
+L KR+ + R+ + +++ R L +G+ + F H D+ R+ GVLQ
Sbjct: 72 SLGKRVAEGVPRSHLLPQILRRAAILFVFGLFVY-AFPH--------FDLGTQRILGVLQ 122
Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
RIA+ YL S++ +++ V+ + I L +W+M + V
Sbjct: 123 RIAICYLAASVIFLYS-GVRGQI-----LWILGLLAAYWMMMTLIPV------------- 163
Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
YG +L+ N YID LG H YH
Sbjct: 164 ----------PGYGP---------GRLDVEGNFAHYIDHLALG-RHNYH----------- 192
Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
+W +PEGL+S++ +I + + GV GH I+ + LA W+
Sbjct: 193 -----------STRTW-----DPEGLVSTLPAIATALFGVLAGH-ILRCRRTLAERTSWM 235
Query: 327 -TMGFALLIFGLTLHFTNGEHGSGKFSTTCVCLFI----YSKVILFQW 369
T G LL GL T + K T CLF+ ++ F W
Sbjct: 236 FTAGSLLLAAGLIC--TAWLPINKKLWTDSFCLFMAGLDFTVFAFFAW 281
>gi|284041413|ref|YP_003391343.1| heparan-alpha-glucosaminide N-acetyltransferase [Spirosoma linguale
DSM 74]
gi|283820706|gb|ADB42544.1| Heparan-alpha-glucosaminide N-acetyltransferase [Spirosoma linguale
DSM 74]
Length = 364
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 82/308 (26%), Positives = 132/308 (42%), Gaps = 88/308 (28%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
+ RL SLD RG +A MI+ + G + + + H+ WNG + D + PFFLFIVGV+
Sbjct: 3 QPHRLISLDAMRGFTIAAMIVANFPGSEEFVYFTLRHSRWNGLSFTDLIAPFFLFIVGVS 62
Query: 88 IALALKRIPDRADA------VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
I LA R RAD ++K++ R+LK+ G+ L + PD D +R
Sbjct: 63 IVLAYAR--KRADGSPKGPLIQKIVLRSLKIFAVGMFL----NLLPD-----FDFATLRW 111
Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
G L RIA+ +L+ +L+ + T Q W+ +L +LA+
Sbjct: 112 TGTLHRIAIVFLVCALLFLTTSWRQQA----------------WIATLTLLAYWLAM--- 152
Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
T +P + D G+V L P N ++DR+ L
Sbjct: 153 TQIP----------TPDVGRVV---------LEPGQNLAAWLDRRYL------------- 180
Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
P R +W +PEG+LS+ SI++ I+G+ G +++ A+
Sbjct: 181 ------------PGRMWQGTW-----DPEGILSTFPSIVTGILGMLAGRLMVSPASQTAK 223
Query: 322 LKQWVTMG 329
+ +T G
Sbjct: 224 VSYLMTAG 231
>gi|294674520|ref|YP_003575136.1| hypothetical protein PRU_1851 [Prevotella ruminicola 23]
gi|294472648|gb|ADE82037.1| putative membrane protein [Prevotella ruminicola 23]
Length = 357
Score = 88.2 bits (217), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 78/288 (27%), Positives = 126/288 (43%), Gaps = 81/288 (28%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGGDWPE-ISHAPWNGCNLADFVMPFFLFIVGVAI 88
++++RL SLDI RG+ VA MILV++ G+ E + H+ WNG D V PFFLFI+G++
Sbjct: 1 MESKRLLSLDILRGITVAGMILVNNGWGESFEMLRHSKWNGMTPCDLVFPFFLFIMGISC 60
Query: 89 ALALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHA--PDELTYGVDVRMIRLCGV 144
L+L + +++++ RT+ L G+ + F HA D L +G +R+ V
Sbjct: 61 YLSLVKSEFKPTPQVIRRIVKRTVLLFAIGLFIN-WFDHAIEGDLLCFG----HLRIWAV 115
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
+QRIAL Y +VSL F L+C H
Sbjct: 116 MQRIALCYGIVSL--------------------FALFCNH-------------------- 135
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+ ++I A Y + + G N N + D K+ G +H+YH
Sbjct: 136 -KYTLSVIGGLLAIYTAILILGNGYAEDAN--VNVLAQADLKLFGYDHIYH--------- 183
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
+P +PEGL+ ++SS+ ++G + G +I
Sbjct: 184 -------------------KSPVDPEGLMGTISSVAHVLLGFYCGMLI 212
>gi|390956359|ref|YP_006420116.1| hypothetical protein Terro_0436 [Terriglobus roseus DSM 18391]
gi|390411277|gb|AFL86781.1| hypothetical protein Terro_0436 [Terriglobus roseus DSM 18391]
Length = 404
Score = 88.2 bits (217), Expect = 6e-15, Method: Compositional matrix adjust.
Identities = 86/330 (26%), Positives = 136/330 (41%), Gaps = 82/330 (24%)
Query: 18 EPDVSDQQE----KSHLKTQRLASLDIFRGLAVALMILVDH--AGGDWPEISHAPWNGCN 71
+P+++ + ++ KT+RL S+D+ RGL +A MILV++ G + E+ HA WNG
Sbjct: 10 DPEIASTVDSDVVRTAPKTERLLSVDVLRGLTIAFMILVNNQPGPGAFFELQHAQWNGFT 69
Query: 72 LADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQG-GFSHA 126
L D V P FLF+VG+++ L+ + + K + TL+ L +GI++ F H
Sbjct: 70 LTDLVFPTFLFLVGLSLVLSTAARLAKGASRKTLFLHTLRRSAVLALFGIVVNTFPFQH- 128
Query: 127 PDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWL 186
+ IR GVLQR AL YL+VS + + K +DK
Sbjct: 129 ---------LDRIRFYGVLQRTALCYLVVSGLCLLRKGWKDKAA---------------- 163
Query: 187 MAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRK 246
+A LVVY L+ VP F +P N ++DR
Sbjct: 164 IAVACLVVYWVLMRFVPVPG----------------FGTPTHEIPINDPNGNLTAWLDRL 207
Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
+ H+Y +PEGLLS++ +I + + GV
Sbjct: 208 IFAPQHLYQQVR-----------------------------DPEGLLSTLPAISTALYGV 238
Query: 307 HFGHVIIHTKGHLARLKQWVTMGFALLIFG 336
G + T+ A+ G A+ + G
Sbjct: 239 LAGTWLRTTRSTTAKAVGLALGGVAMTVAG 268
>gi|299140549|ref|ZP_07033687.1| membrane protein [Prevotella oris C735]
gi|298577515|gb|EFI49383.1| membrane protein [Prevotella oris C735]
Length = 359
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 83/317 (26%), Positives = 136/317 (42%), Gaps = 80/317 (25%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
++ +RL SLD+ RG V LMILV++ G + + H+ WNG D V PFFLFI+G++
Sbjct: 1 MEKKRLLSLDVLRGATVCLMILVNNGAGKHIYATLQHSKWNGMTPCDLVFPFFLFIMGIS 60
Query: 88 IALALKRIP---DRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
L+L++ R A K++ RT+ L F G+ + F A +D+ +R+ V
Sbjct: 61 TYLSLEKTNFTWSRQVAF-KIVKRTVLLFFIGLFIN-WFDMAISG--NALDLSHLRIWAV 116
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
+QRIA+ Y V SIF L C H ++++ A
Sbjct: 117 MQRIAICYFAV--------------------SIFALCCNHRHTIPAIVILLAA------- 149
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+ +I + Y + N + ID ++ GI H+YH
Sbjct: 150 --YSLLLIWGNGYAY--------------DSQQNILAQIDIRLFGIEHLYH--------- 184
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
++P +PEG SS+S+I T+IG + G + K ++ +
Sbjct: 185 -------------------NSPVDPEGTGSSLSAIAHTLIGFYCGKRMSDAKSTEEKVLR 225
Query: 325 WVTMGFALLIFGLTLHF 341
++ G L+I G + F
Sbjct: 226 FLITGGFLVIIGYIVSF 242
>gi|325286182|ref|YP_004261972.1| heparan-alpha-glucosaminide N-acetyltransferase [Cellulophaga
lytica DSM 7489]
gi|324321636|gb|ADY29101.1| Heparan-alpha-glucosaminide N-acetyltransferase [Cellulophaga
lytica DSM 7489]
Length = 361
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 76/274 (27%), Positives = 119/274 (43%), Gaps = 77/274 (28%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAI 88
R+ S+DIFRG+ + LMILV++ G W + HA W+G D V PFFLFIVG++I
Sbjct: 3 NRVVSVDIFRGITIVLMILVNNPG-TWSSVYAPFLHADWHGYTPTDLVFPFFLFIVGISI 61
Query: 89 ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
A +K++ R+LKL+ G+ L G F+ + + D IR GVLQRI
Sbjct: 62 VYAYHTKEVTGKTYRKIVIRSLKLIGLGLFL-GAFTLS---FPFFKDFNDIRFPGVLQRI 117
Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
L + ++ +F W L+A C ++ + L+ +VP
Sbjct: 118 GLVFFFTAI-------------------LFIKLNWKALVAVCAAILIMYWLWMGFVP--- 155
Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
IN + + + P N +ID KVLG +HM W+
Sbjct: 156 ---INGTAPTFDR-------------APNNWANFIDLKVLG-SHM-----WKTD------ 187
Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILST 302
++PEG+LS++ +I ++
Sbjct: 188 ------------------YDPEGVLSTLPAIATS 203
>gi|410664067|ref|YP_006916438.1| hypothetical protein M5M_07585 [Simiduia agarivorans SA1 = DSM
21679]
gi|409026424|gb|AFU98708.1| hypothetical protein M5M_07585 [Simiduia agarivorans SA1 = DSM
21679]
Length = 390
Score = 87.8 bits (216), Expect = 8e-15, Method: Compositional matrix adjust.
Identities = 93/318 (29%), Positives = 145/318 (45%), Gaps = 75/318 (23%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAIA 89
RL S+D+ RGLA+A M+LV++ G W + +HA W+G D + P FL++VG++I
Sbjct: 15 RLMSVDVLRGLAIAAMVLVNNPG-SWSHVYAPLAHAEWHGWTPTDVIFPLFLYVVGLSIV 73
Query: 90 LALK----RIPDRAD---AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
LA K +P R+ A K LF+ + FS D+L +DVR++
Sbjct: 74 LAQKGETFALPGRSTWLRAAKLFGLGLFLALFYFPFAKPEFSWWRDQL---LDVRIL--- 127
Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
GVLQRIAL YL + + Q WL+ L V++ L Y
Sbjct: 128 GVLQRIALVYLACCYLAWLCQKRQ------------------WLLWLATL-VFMWLAYAL 168
Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
+I D D G+++ R +L + ++D+ +LG H+Y+ A
Sbjct: 169 -----MLSIPYAD--DTGEIY------RGQLVFGNHFSAWLDQLLLGREHLYYQTA---- 211
Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
PF F+PEGLL+++ +I S ++GV G + + GH +RL
Sbjct: 212 ------QPFA--------------FDPEGLLTTLPAISSGLLGVLAG-LQLKAAGHSSRL 250
Query: 323 KQWVTMGFALLIFGLTLH 340
+ W G +L+ G LH
Sbjct: 251 EIWFAGGVLMLVAGQLLH 268
>gi|399028182|ref|ZP_10729485.1| hypothetical protein PMI10_01306 [Flavobacterium sp. CF136]
gi|398074259|gb|EJL65410.1| hypothetical protein PMI10_01306 [Flavobacterium sp. CF136]
Length = 423
Score = 87.8 bits (216), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 84/305 (27%), Positives = 137/305 (44%), Gaps = 68/305 (22%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL SLD+FRGL + LM +V++ G DW P + HA WNGC D V PFF+FI+GVA+
Sbjct: 4 ERLISLDVFRGLTILLMTIVNNPG-DWGHVFPPLLHAKWNGCTPTDLVFPFFIFIMGVAV 62
Query: 89 ALALKRIPDRA---DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
LA+ PD+ K++ R+L++L GI F+ +G++ + + ++
Sbjct: 63 PLAM---PDKIYDDTTFNKILVRSLRMLCLGIF----FNFFEKIQLFGLEGIPLLIGRLI 115
Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCW---------------HWLMAAC 190
IA+ Y+L+ + K +++ FSI +Y + L
Sbjct: 116 ITIAVGYVLMG-------NFSSKLKNIFAFSILIIYLFLAYSEIEAYQDVRLPGVLQRIA 168
Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNV--TCGV-RAKLNPPCNAVGYIDRKV 247
V+ ++LLY Q Y V N+ G+ A L N ++D +
Sbjct: 169 VVYFVVSLLYLKTSQKTQIITGVFLLLGYWAVMNLIPVPGIGEANLEKGTNLASWLDSIL 228
Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
L HMYH + +W +PEG+LS++ SI++ IIG+
Sbjct: 229 LK-GHMYH----------------------ETKTW-----DPEGILSTIPSIVNGIIGLL 260
Query: 308 FGHVI 312
G ++
Sbjct: 261 IGQLL 265
>gi|431796483|ref|YP_007223387.1| hypothetical protein Echvi_1106 [Echinicola vietnamensis DSM 17526]
gi|430787248|gb|AGA77377.1| hypothetical protein Echvi_1106 [Echinicola vietnamensis DSM 17526]
Length = 369
Score = 87.0 bits (214), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 88/316 (27%), Positives = 137/316 (43%), Gaps = 84/316 (26%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVG 85
+ QR+ +LD+FRG+ + MILV++ G W + HA W+GC D + PFFLFIVG
Sbjct: 1 MPKQRILALDVFRGITIFAMILVNNPG-SWSHVYAPLLHAKWHGCTPTDLIFPFFLFIVG 59
Query: 86 VAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
VAI L+ LK+ + ++K + R LKL+ + Y D+ +R
Sbjct: 60 VAIELSLGGQLKKGTPKGFLLRKSLIRALKLIG--------LGLLLTAIPY-FDLAHLRF 110
Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
GVLQRI L Y + +++ ++ FS L +WL C+
Sbjct: 111 PGVLQRIGLVYFISTVMYLYW------SPKALVFSSGILLIGYWL---CM---------- 151
Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
T++P V A L P N +ID++VL HM W +
Sbjct: 152 TFIP-------------------VPGIGPANLEPGTNLAAWIDQQVL-TGHM-----WSQ 186
Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
+K ++PEGL S++ +I++ ++GV G ++ H AR
Sbjct: 187 TKT----------------------WDPEGLFSTLPAIVTCLLGVACGKILTGNSSHKAR 224
Query: 322 LKQWVTMGFALLIFGL 337
L +W G L+ GL
Sbjct: 225 LTKWGIAGVTLVFGGL 240
>gi|375146803|ref|YP_005009244.1| hypothetical protein [Niastella koreensis GR20-10]
gi|361060849|gb|AEV99840.1| Protein of unknown function DUF2261, transmembrane [Niastella
koreensis GR20-10]
Length = 376
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 90/329 (27%), Positives = 137/329 (41%), Gaps = 94/329 (28%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHA---GGDWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
+ +R +LD+FRG+ + MI+V+ + + + HA W+G D V P FLF VG A
Sbjct: 3 QQKRFLALDVFRGMTICFMIIVNTSPDGSHTYAPLLHAQWHGFTPTDLVFPSFLFAVGNA 62
Query: 88 IALALKRIPDRADA--VKKVIFRTLKLLFWGILL-----------QGGFSHAPDELTYGV 134
++ + R + + + K++ RTL + G L+ G ++ P E T
Sbjct: 63 MSFVMPRWENASTGFVLGKILKRTLLIFILGYLMYWFPFVRMDKVTGVYAFYPFEKT--- 119
Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
R+ GVLQRIAL+Y SL+ F K FR ++ A +L++
Sbjct: 120 -----RVFGVLQRIALAYCFASLMLYFLK--------------FRATI---IITAAILLL 157
Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
Y +LY DSAD L+ NAV +D +LG +H+Y
Sbjct: 158 YWPVLY-----------FFGDSAD-------------PLSLAGNAVLKLDLWLLGPDHLY 193
Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
H EG PF+PEG LS+ +I + + G G +
Sbjct: 194 HG---------------EG-----------VPFDPEGFLSTFPAIANVVGGYWVGRFLQQ 227
Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFTN 343
G L + + GFALL+ HF N
Sbjct: 228 KGGTYEALTKLMLAGFALLVLA---HFWN 253
>gi|423241434|ref|ZP_17222547.1| hypothetical protein HMPREF1065_03170 [Bacteroides dorei
CL03T12C01]
gi|392641327|gb|EIY35104.1| hypothetical protein HMPREF1065_03170 [Bacteroides dorei
CL03T12C01]
Length = 372
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 85/331 (25%), Positives = 131/331 (39%), Gaps = 88/331 (26%)
Query: 17 SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
E + Q+ +K +RL SLD RG+ VA MILV++AGG + + H+ WNG D
Sbjct: 4 EELNTETAQQAPPIK-KRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSVWNGLTPCD 62
Query: 75 FVMPFFLFIVGVAIALALKRIPDRAD--AVKKVIFRTLKLLF--WGILLQGGFSHA--PD 128
V PFFLFI+G++ ++L + V K++ RT +L W I G F H D
Sbjct: 63 LVFPFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI---GWFDHVCEGD 119
Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
L + +R+ GVLQRIAL Y ++S +F
Sbjct: 120 FLPF----VHLRIPGVLQRIALCYCVISFTALFMNH------------------------ 151
Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
++P F ++ + + C N + IDR++
Sbjct: 152 -------------KFIPALTFILLVSYTV-------ILCMGNGYACDESNILSIIDRQLF 191
Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
G H+Y +P +PEG +S++S+I T IG
Sbjct: 192 GEAHLYQ----------------------------KSPIDPEGFVSTLSAIAHTCIGFSC 223
Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTL 339
G II + ++ + GF L+ G L
Sbjct: 224 GKWIIQSHQTENKVLRLFLTGFILISIGYLL 254
>gi|281423205|ref|ZP_06254118.1| putative membrane protein [Prevotella oris F0302]
gi|281402541|gb|EFB33372.1| putative membrane protein [Prevotella oris F0302]
Length = 359
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 83/317 (26%), Positives = 134/317 (42%), Gaps = 80/317 (25%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
++ +RL SLD+ RG V LMILV++ G + + H+ WNG D V PFFLFI+G++
Sbjct: 1 MEKKRLLSLDVLRGATVCLMILVNNGAGKHIYATLQHSKWNGMTPCDLVFPFFLFIMGIS 60
Query: 88 IALALKRIP---DRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
L+LK+ R A K++ RT+ L G+ + F A +D +R+ V
Sbjct: 61 TYLSLKKTNFTWSRQVAF-KIVKRTVLLFLIGLFIN-WFDMAISG--NALDFSHLRIWAV 116
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
+QRIA+ Y V SIF L C H ++++ A
Sbjct: 117 MQRIAICYFAV--------------------SIFALCCNHRHTIPAIVILLAA------- 149
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+ +I + Y + N + ID ++ GI H+YH
Sbjct: 150 --YNLLLIWGNGYAY--------------DSQQNILAQIDIRLFGIEHLYH--------- 184
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
++P +PEG SS+S+I T+IG + G + K ++ +
Sbjct: 185 -------------------NSPVDPEGTGSSLSAIAHTLIGFYCGKRMSDAKSTEEKVLR 225
Query: 325 WVTMGFALLIFGLTLHF 341
++ G L+I G + F
Sbjct: 226 FLITGGFLVIIGYIVSF 242
>gi|158337501|ref|YP_001518676.1| hypothetical protein AM1_4380 [Acaryochloris marina MBIC11017]
gi|158307742|gb|ABW29359.1| conserved hypothetical protein [Acaryochloris marina MBIC11017]
Length = 383
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 78/295 (26%), Positives = 137/295 (46%), Gaps = 82/295 (27%)
Query: 38 LDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKR 94
LD+FRG+A+A M+LV+ +G +P++ HA W+G LAD V PFFLF++G ++A ++ R
Sbjct: 13 LDVFRGIAIAGMLLVNKSGLVKDAYPQLQHADWHGWTLADLVFPFFLFVLGASMAFSMAR 72
Query: 95 -----IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
+ K++ R++ L G+ L G +S+ ++ +R+ G+LQRI+
Sbjct: 73 HTASLTQPKRRVYLKILRRSVVLFGLGLFLNGFWSY---------NLSTLRVMGILQRIS 123
Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY-LALLYGTYVPDWQ 208
L+YL+ +LV + + K Q W M +LV Y LAL +++P +
Sbjct: 124 LTYLVSALVIL---KLPRKSQ--------------WGMTGLLLVGYWLAL---SFIPVPE 163
Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
F N L N Y+DR ++G +H+Y +
Sbjct: 164 FGAGN-------------------LTRTGNFGAYVDRLIIGSSHLYVGDQF--------- 195
Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
++ +PEGL S++ +I + ++G +F I +G ++K
Sbjct: 196 ---------------NSMGDPEGLFSTLPAIATVLLG-YFAGDWIRKRGSGLKIK 234
>gi|265753064|ref|ZP_06088633.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|263236250|gb|EEZ21745.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
Length = 372
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 85/331 (25%), Positives = 131/331 (39%), Gaps = 88/331 (26%)
Query: 17 SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
E + Q+ +K +RL SLD RG+ VA MILV++AGG + + H+ WNG D
Sbjct: 4 EELNTETAQQAPPIK-KRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSVWNGLTPCD 62
Query: 75 FVMPFFLFIVGVAIALALKRIPDRAD--AVKKVIFRTLKLLF--WGILLQGGFSHA--PD 128
V PFFLFI+G++ ++L + V K++ RT +L W I G F H D
Sbjct: 63 LVFPFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI---GWFDHVCEGD 119
Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
L + +R+ GVLQRIAL Y ++S +F
Sbjct: 120 FLPF----VHLRIPGVLQRIALCYCVISFTALFMNH------------------------ 151
Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
++P F ++ + + C N + IDR++
Sbjct: 152 -------------KFIPALTFILLVSYTV-------ILCMGNGYACDESNILSIIDRQLF 191
Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
G H+Y +P +PEG +S++S+I T IG
Sbjct: 192 GEAHLYQ----------------------------KSPIDPEGFVSTLSAIAHTCIGFSC 223
Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTL 339
G II + ++ + GF L+ G L
Sbjct: 224 GKWIIQSHQTENKVLRLFLTGFILISIGYLL 254
>gi|212695334|ref|ZP_03303462.1| hypothetical protein BACDOR_04881, partial [Bacteroides dorei DSM
17855]
gi|212662113|gb|EEB22687.1| hypothetical protein BACDOR_04881, partial [Bacteroides dorei DSM
17855]
Length = 284
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 85/331 (25%), Positives = 131/331 (39%), Gaps = 88/331 (26%)
Query: 17 SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
E + Q+ +K +RL SLD RG+ VA MILV++AGG + + H+ WNG D
Sbjct: 4 EELNTETAQQAPPIK-KRLLSLDALRGITVAGMILVNNAGGKVSYAPLQHSVWNGLTPCD 62
Query: 75 FVMPFFLFIVGVAIALALKRIPDRAD--AVKKVIFRTLKLLF--WGILLQGGFSHA--PD 128
V PFFLFI+G++ ++L + V K++ RT +L W I G F H D
Sbjct: 63 LVFPFFLFIMGISTYISLNKFNFNVSLQVVTKILKRTFLILCIGWAI---GWFDHVCEGD 119
Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
L + +R+ GVLQRIAL Y ++S +F
Sbjct: 120 FLPF----VHLRIPGVLQRIALCYCVISFTALFMNH------------------------ 151
Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
++P F ++ + + C N + IDR++
Sbjct: 152 -------------KFIPALTFILLVSYTV-------ILCMGNGYACDESNILSIIDRQLF 191
Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
G H+Y +P +PEG +S++S+I T IG
Sbjct: 192 GEAHLYQ----------------------------KSPIDPEGFVSTLSAIAHTCIGFSC 223
Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTL 339
G II + ++ + GF L+ G L
Sbjct: 224 GKWIIQSHQTENKVLRLFLTGFILISIGYLL 254
>gi|410631381|ref|ZP_11342056.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola arctica
BSs20135]
gi|410148827|dbj|GAC18923.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola arctica
BSs20135]
Length = 330
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 54/139 (38%), Positives = 79/139 (56%), Gaps = 11/139 (7%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
RL +LD+FRG+ + MILV++ G W I HA W+G L D + PFF+FIVGV+I
Sbjct: 21 NRLLALDVFRGITITAMILVNNPG-SWQHIYGPMRHAQWHGWTLTDLIFPFFIFIVGVSI 79
Query: 89 ALALKRI----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM--IRLC 142
L+ +R+ R+ +K+ + RT KL+ G L + E V+ R+ IR
Sbjct: 80 QLSGQRMLASGTSRSSIIKQALLRTFKLVLLGWFLALFYYDFGAEHYNWVEQRLLNIRFM 139
Query: 143 GVLQRIALSYLLVSLVEIF 161
GVLQRIA+ Y + L+ +F
Sbjct: 140 GVLQRIAVVYFICVLMWLF 158
>gi|343082900|ref|YP_004772195.1| hypothetical protein [Cyclobacterium marinum DSM 745]
gi|342351434|gb|AEL23964.1| Protein of unknown function DUF2261, transmembrane [Cyclobacterium
marinum DSM 745]
Length = 365
Score = 86.3 bits (212), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 52/133 (39%), Positives = 75/133 (56%), Gaps = 16/133 (12%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
RL SLD RG +A MILV++ G +P + HA W+G + DF+ PFF+F+VGV++A
Sbjct: 6 TRLISLDALRGFTIAAMILVNYPGSWSHVYPPLLHAEWHGMTMTDFIFPFFIFMVGVSVA 65
Query: 90 LALKRIPD----RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
A + D +A KK+I R +KL GI L + PD D +R+ GVL
Sbjct: 66 FAYSKRLDEGVPKAGMYKKIIIRAIKLFVLGIFL----NLIPD-----FDFSHMRIAGVL 116
Query: 146 QRIALSYLLVSLV 158
QRI++ +L S +
Sbjct: 117 QRISIVFLASSFL 129
>gi|189502592|ref|YP_001958309.1| hypothetical protein Aasi_1258 [Candidatus Amoebophilus asiaticus
5a2]
gi|189498033|gb|ACE06580.1| hypothetical protein Aasi_1258 [Candidatus Amoebophilus asiaticus
5a2]
Length = 380
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 56/138 (40%), Positives = 80/138 (57%), Gaps = 13/138 (9%)
Query: 29 HLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVG 85
HLK QRL SLD FRGL VA MIL ++ G + + HA W+GC D + PFFLFIVG
Sbjct: 3 HLK-QRLVSLDFFRGLTVAGMILANNPGSWGHIYAPLKHAEWHGCTPTDLIFPFFLFIVG 61
Query: 86 VAIALAL---KRIPDR-ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV-RMIR 140
V+IA A+ K +P+ + + K + R L L GI L + P T ++ + +R
Sbjct: 62 VSIAFAIGSKKELPETHSQLILKSVRRMLTLFCLGIFL----ALYPKIFTSPIEAFKTVR 117
Query: 141 LCGVLQRIALSYLLVSLV 158
+ GVLQR A+ Y + +++
Sbjct: 118 IPGVLQRTAIVYFISTII 135
>gi|54297581|ref|YP_123950.1| hypothetical protein lpp1632 [Legionella pneumophila str. Paris]
gi|53751366|emb|CAH12784.1| hypothetical protein lpp1632 [Legionella pneumophila str. Paris]
Length = 372
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 79/312 (25%), Positives = 127/312 (40%), Gaps = 80/312 (25%)
Query: 30 LKTQRLASLDIFRGLAVALMILVD-HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
LK QRL SLD+FRG+ + LMI+V+ A D +P H WNGC LAD V PFFLFIVG+
Sbjct: 6 LKPQRLLSLDVFRGMTIVLMIIVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLT 65
Query: 88 IALALKRIPDR---ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
++LK +R +I R++ + + ++ IR+ G+
Sbjct: 66 SVISLKNQMERKAKTSLYSAIIERSVV--------LFLLGLFLNVFPHPIEFDSIRIYGI 117
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
LQRIA+ YL+ + + + + K Q ++L LL G ++
Sbjct: 118 LQRIAVCYLISAFIYL---NTSIKTQ---------------------FFIFLVLLLGYWI 153
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
Q + YG +L + V Y D+ +H+Y
Sbjct: 154 IMTQVPV-----PGYGA---------NQLTKDGSWVSYFDQLFFSASHLYEK-------- 191
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
++PEG +S+ +SI +T+ GV G ++I+ +
Sbjct: 192 ---------------------TYDPEGFVSTFTSIATTLSGVLAGSLLINPCNQFKKFYL 230
Query: 325 WVTMGFALLIFG 336
+G L+ G
Sbjct: 231 LAGVGMLFLLLG 242
>gi|393785792|ref|ZP_10373938.1| hypothetical protein HMPREF1068_00218 [Bacteroides nordii
CL02T12C05]
gi|392661411|gb|EIY54997.1| hypothetical protein HMPREF1068_00218 [Bacteroides nordii
CL02T12C05]
Length = 361
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 83/315 (26%), Positives = 133/315 (42%), Gaps = 76/315 (24%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
RL SLD+ RG VA MILV++AG + + HA W+G AD V P F+F++G++
Sbjct: 4 NNRLLSLDVLRGFTVAGMILVNNAGACGYGYAPLRHAKWDGFTPADLVFPMFMFLMGIST 63
Query: 89 ALALKRIPDRAD-AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
++L++ + A+ K+I R L L+ GI ++ + + E D +RL GV+QR
Sbjct: 64 YISLRKYDFQWRLAIGKIIKRALLLILIGIAMKWIINSS--ETGIWTDWEHMRLLGVMQR 121
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
+ + Y +++ +F RF V L LL G +
Sbjct: 122 LGICYGATAIMALFIPH--------KRF----------------FPVALLLLAGYF---- 153
Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
I+ + K P N + +D VLG NHMY Q
Sbjct: 154 ---ILQLIGNGFEK-------------SPDNIIAIVDSTVLGTNHMY-----------LQ 186
Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
F EPEG+LS++ +I +IG G +II+ K + R+++
Sbjct: 187 GRQFV---------------EPEGILSTIPAIAQVMIGFVCGRMIINQKDNKERMQKLFF 231
Query: 328 MGFALLIFGLTLHFT 342
+G +L G +
Sbjct: 232 LGTLMLFAGFLFSYA 246
>gi|307610361|emb|CBW99930.1| hypothetical protein LPW_16871 [Legionella pneumophila 130b]
Length = 372
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 77/290 (26%), Positives = 121/290 (41%), Gaps = 80/290 (27%)
Query: 30 LKTQRLASLDIFRGLAVALMILVD-HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
LK QRL SLD+FRG+ + LMI V+ A D +P H WNGC LAD V PFFLFIVG+
Sbjct: 6 LKPQRLLSLDVFRGMTIVLMIFVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLT 65
Query: 88 IALALKRIPDRADAV---KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
++LK +R + +I R++ + + ++ IR+ G+
Sbjct: 66 SVISLKNQMERKEKTSLYSAIIERSVV--------LFLLGLFLNVFPHPIEFDSIRIYGI 117
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
LQRIA+ YL+ + + + + K Q ++L LL G ++
Sbjct: 118 LQRIAVCYLISAFIYL---NTSIKTQ---------------------FFIFLVLLLGYWI 153
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
Q + YG +L + V Y D+ +H+Y
Sbjct: 154 IMTQVPV-----PGYGA---------NQLTKDGSWVSYFDQLFFSASHLYEK-------- 191
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
++PEG LS+ +SI +T+ GV G ++I+
Sbjct: 192 ---------------------TYDPEGFLSTFTSIATTLSGVLAGSLLIN 220
>gi|392390355|ref|YP_006426958.1| hypothetical protein Ornrh_0972 [Ornithobacterium rhinotracheale
DSM 15997]
gi|390521433|gb|AFL97164.1| hypothetical protein Ornrh_0972 [Ornithobacterium rhinotracheale
DSM 15997]
Length = 390
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 78/290 (26%), Positives = 119/290 (41%), Gaps = 82/290 (28%)
Query: 34 RLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
R SLD+FRG VALMILV++ G + ++HA W GC D V PFFLF VG A+A
Sbjct: 4 RYYSLDVFRGATVALMILVNNPGSWSAMFKPLTHAEWAGCTPTDLVFPFFLFAVGNAMAF 63
Query: 91 ALKRIPDRADAV--KKVIFRTLKLLFWGILLQG-GFSHAPDELTYGVDVRMIRLCGVLQR 147
+ R+ V +KV+ RT + G+LL F D + +R+ GVLQR
Sbjct: 64 VIPRMQKAGSQVFWRKVLKRTFLIFIIGLLLNWFPFVQWKDGILTFKHWENVRILGVLQR 123
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
IA +Y +++ + K+ + +++ +L+VY L D
Sbjct: 124 IAFAYFFAAIIAYYFKEKKVL-----------------IISFLLLIVYWLLALLLGGAD- 165
Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----IDRKVLGINHMYHHPAWRRSK 263
P + G+ +D +LG +HMYH
Sbjct: 166 ----------------------------PYSMQGFWGTRVDLAILGESHMYHG------- 190
Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
EG PF+PEG + ++SS ++G G +I+
Sbjct: 191 --------EG-----------VPFDPEGFVGAISSTAQVLLGYLAGKIIM 221
>gi|333378336|ref|ZP_08470067.1| hypothetical protein HMPREF9456_01662 [Dysgonomonas mossii DSM
22836]
gi|332883312|gb|EGK03595.1| hypothetical protein HMPREF9456_01662 [Dysgonomonas mossii DSM
22836]
Length = 387
Score = 85.9 bits (211), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 80/330 (24%), Positives = 137/330 (41%), Gaps = 93/330 (28%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
+ RL SLD+ RG+ +A MILV++ G W I HA WNG D + PFF+FI+G++
Sbjct: 6 SSRLLSLDVLRGITIAGMILVNNPG-SWGHIYAPLRHAEWNGLTPTDLIFPFFMFIMGIS 64
Query: 88 IALALKRIPDR--ADAVKKVIFRTLKLLFWGILL-----QGGFSHAPDELTYGVDVRM-- 138
++L++ ++K++ RT + G+ L G HA G R+
Sbjct: 65 TFISLRKFNFEFSVPTLRKILKRTFVIFLIGLGLSWLGVSFGTYHALAADNLGFLERLGR 124
Query: 139 -------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
+R+ GV+QR+AL+Y + SL+ IF K +++ +
Sbjct: 125 SVTNFEHLRILGVMQRLALTYGITSLIAIFIKHKYIP----------------YIIVVGL 168
Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
+ +L LL+G N + + + VT D+ +LG+N
Sbjct: 169 VGYFLLLLFG-----------NGFATEGYNILAVT-----------------DQSILGLN 200
Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
HMY +PEG+LS++ ++ +IG + G +
Sbjct: 201 HMY----------------------------TEFGLDPEGILSTIPAVCHVLIGFYCGKI 232
Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
++ TK + R+ +G L G L +
Sbjct: 233 LMETKDNQQRMLHLFIIGAILTFSGFLLSY 262
>gi|223935576|ref|ZP_03627492.1| conserved hypothetical protein [bacterium Ellin514]
gi|223895584|gb|EEF62029.1| conserved hypothetical protein [bacterium Ellin514]
Length = 410
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 88/320 (27%), Positives = 136/320 (42%), Gaps = 85/320 (26%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
+ RL SLD+FRG +A M+LV++ G W I HA WNG D + PFFL+IVGVA
Sbjct: 22 SARLMSLDVFRGATIASMMLVNNPG-SWDSIYRQLDHAEWNGWTFTDLIFPFFLWIVGVA 80
Query: 88 IALALKRIPD----RADAVKKVIFRT-------LKLLFWGILLQGGFSHAPDELTYGVDV 136
I L+ ++ D R + V+ R L L F+ L+ G + + ++
Sbjct: 81 IPLSTQKRLDGGASRTNLWLHVVRRAAIIFGLGLFLAFFSFLINGSYGRLGGFGPWFNEI 140
Query: 137 -RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY 195
IR+ GVLQRIA+ YL+ S + + TK +G + + W++ CV V
Sbjct: 141 CGTIRIPGVLQRIAVCYLIASTIYLTTKLRGQIAWLIGLLAAY------WVLMKCVPV-- 192
Query: 196 LALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYH 255
+G GV L P N Y+D VLG H +H
Sbjct: 193 ---------------------PGHG------AGV---LTPEGNFSAYVDGNVLG-RHTWH 221
Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHT 315
AP++PEG++S++ +I + + G+ G +++
Sbjct: 222 ----------------------------GAPWDPEGVISTIPAIATCLFGILTGQLLL-I 252
Query: 316 KGHLARLKQWVTMGFALLIF 335
K + + WV + LLI
Sbjct: 253 KRSVEQKTTWVFVSGILLIL 272
>gi|388456506|ref|ZP_10138801.1| hypothetical protein FdumT_08017 [Fluoribacter dumoffii Tex-KL]
Length = 372
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 52/138 (37%), Positives = 82/138 (59%), Gaps = 13/138 (9%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+T+R+ SLD+FRGL +ALM+LV+ G ++ + H WNGC+LAD V P FLFIVG+
Sbjct: 7 ETKRILSLDVFRGLTMALMVLVNSLGSRENYKILMHVEWNGCSLADLVFPAFLFIVGITT 66
Query: 89 ALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
++L+R +A + ++ RTL L + + P + +D+ IR+ G+L
Sbjct: 67 VISLQRHLNDESKAQLYRSILTRTLLL----MFFGLFLNIFPKQ----IDLSTIRIYGIL 118
Query: 146 QRIALSYLLVSLVEIFTK 163
QRIA YL+ S++ + T
Sbjct: 119 QRIAWCYLICSILYLHTS 136
>gi|433652541|ref|YP_007296395.1| hypothetical protein Prede_1583 [Prevotella dentalis DSM 3688]
gi|433303074|gb|AGB28889.1| hypothetical protein Prede_1583 [Prevotella dentalis DSM 3688]
Length = 394
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 74/287 (25%), Positives = 121/287 (42%), Gaps = 73/287 (25%)
Query: 28 SHLKTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVG 85
+ + +QRL SLD+ RGL V LMI V++ G+ + ++ H+ WNG L D V PFFLFI+G
Sbjct: 7 TRMSSQRLISLDVLRGLTVMLMIFVNNGAGEQIFAQLQHSRWNGMTLCDLVFPFFLFIMG 66
Query: 86 VAIALALKRIPD--RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
V+ L+L++ A +K+ RTL L G+ + F A + D+ +R+ G
Sbjct: 67 VSTYLSLRKTQFVWSARLGRKIARRTLLLFVIGLAIN-WFDMACSGRPF--DLSHLRIMG 123
Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY 203
V+QRIAL Y +L+ + C WL
Sbjct: 124 VMQRIALCYGATALIAVG--------------------CQRWLH---------------- 147
Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
D++ + + G + N + +D+ VLG H+YH
Sbjct: 148 --DFRAMPAIIAALLGAYGALLLMGQGYAYDAAINLLSRVDQAVLGHAHLYH-------- 197
Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
+P +PEGL+S+++++ T+ G + H
Sbjct: 198 --------------------KSPVDPEGLVSTLAAVAHTLAGFYVAH 224
>gi|212693969|ref|ZP_03302097.1| hypothetical protein BACDOR_03493 [Bacteroides dorei DSM 17855]
gi|265751179|ref|ZP_06087242.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|423228550|ref|ZP_17214956.1| hypothetical protein HMPREF1063_00776 [Bacteroides dorei
CL02T00C15]
gi|423243815|ref|ZP_17224891.1| hypothetical protein HMPREF1064_01097 [Bacteroides dorei
CL02T12C06]
gi|212663501|gb|EEB24075.1| hypothetical protein BACDOR_03493 [Bacteroides dorei DSM 17855]
gi|263238075|gb|EEZ23525.1| conserved hypothetical protein [Bacteroides sp. 3_1_33FAA]
gi|392635957|gb|EIY29845.1| hypothetical protein HMPREF1063_00776 [Bacteroides dorei
CL02T00C15]
gi|392644181|gb|EIY37924.1| hypothetical protein HMPREF1064_01097 [Bacteroides dorei
CL02T12C06]
Length = 363
Score = 85.5 bits (210), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 83/316 (26%), Positives = 134/316 (42%), Gaps = 82/316 (25%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+RL +LDI RG+ +A MILV++ G + + HA +NG D V PFF+FI+G++
Sbjct: 7 KRLLALDILRGITIAGMILVNNPGSWGYVYAPLEHAAFNGLTPTDLVFPFFMFIMGISTY 66
Query: 90 LALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
++L++ ++K++ RT+ + G+LL A T+ ++ R GV+QR
Sbjct: 67 ISLRKYNFTYSHATLRKIMKRTVIIFCIGLLLN---LLAKSVFTHHLNFEEWRYLGVMQR 123
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA--LLYGTYVP 205
+A+ Y + SLV I K H A +LV A LL T
Sbjct: 124 LAIGYGVTSLVAITVK--------------------HKYFPAIILVTLAAYFLLLAT--- 160
Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
G FN N V D LG +HMYH
Sbjct: 161 --------------GDGFN---------QSETNVVARFDAWALGTSHMYH---------- 187
Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
EG + F+PEGLLS+V ++ ++G + G +++ K + ++++
Sbjct: 188 ------EGGM----------AFDPEGLLSTVPAVCHVMVGFYCGKLLLSAKDNAEKIQRL 231
Query: 326 VTMGFALLIFGLTLHF 341
+G L G L +
Sbjct: 232 FLIGTILTFAGFLLSY 247
>gi|336316712|ref|ZP_08571601.1| hypothetical protein Rhein_3024 [Rheinheimera sp. A13L]
gi|335878877|gb|EGM76787.1| hypothetical protein Rhein_3024 [Rheinheimera sp. A13L]
Length = 363
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 55/138 (39%), Positives = 79/138 (57%), Gaps = 13/138 (9%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVG 85
+ + R +LD RGLA+ALMILV+ G W + HAPW+G AD V P FLF+VG
Sbjct: 1 MSSPRFYALDALRGLAIALMILVNTPG-SWQHVYTPLLHAPWDGFTFADIVFPTFLFVVG 59
Query: 86 VAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
A+ +LK ++ +V R LKL+ G+LL + + + VD+ +RL GVL
Sbjct: 60 AAMFYSLKTAVLSRQSLWRVSSRALKLIGIGVLL--------NYVPFTVDLAELRLPGVL 111
Query: 146 QRIALSYLLVSLVEIFTK 163
QRI L+Y L +L+ + K
Sbjct: 112 QRIGLAYWLAALLILTVK 129
>gi|345516841|ref|ZP_08796327.1| hypothetical protein BSEG_03945 [Bacteroides dorei 5_1_36/D4]
gi|229437727|gb|EEO47804.1| hypothetical protein BSEG_03945 [Bacteroides dorei 5_1_36/D4]
Length = 363
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 83/316 (26%), Positives = 134/316 (42%), Gaps = 82/316 (25%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+RL +LDI RG+ +A MILV++ G + + HA +NG D V PFF+FI+G++
Sbjct: 7 KRLLALDILRGITIAGMILVNNPGSWGYVYAPLEHAAFNGLTPTDLVFPFFMFIMGISTY 66
Query: 90 LALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
++L++ ++K++ RT+ + G+LL A T+ ++ R GV+QR
Sbjct: 67 ISLRKYNFTYSHATLRKIMKRTVIIFCIGLLLN---LLAKSVFTHHLNFEEWRYLGVMQR 123
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA--LLYGTYVP 205
+A+ Y + SLV I K H A +LV A LL T
Sbjct: 124 LAIGYGVTSLVAITVK--------------------HKYFPAIILVTLAAYFLLLAT--- 160
Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
G FN N V D LG +HMYH
Sbjct: 161 --------------GDGFN---------QSETNVVARFDAWALGTSHMYH---------- 187
Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
EG + F+PEGLLS+V ++ ++G + G +++ K + ++++
Sbjct: 188 ------EGGM----------AFDPEGLLSTVPAVCHVMVGFYCGKLLLSAKDNAEKIQRL 231
Query: 326 VTMGFALLIFGLTLHF 341
+G L G L +
Sbjct: 232 FLIGTILTFAGFLLSY 247
>gi|290999917|ref|XP_002682526.1| hypothetical protein NAEGRDRAFT_78070 [Naegleria gruberi]
gi|284096153|gb|EFC49782.1| hypothetical protein NAEGRDRAFT_78070 [Naegleria gruberi]
Length = 425
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 90/379 (23%), Positives = 148/379 (39%), Gaps = 118/379 (31%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+ +R+ +LDI RG+ + LMI+V++ + + HA W G D V PFFLF++G A A
Sbjct: 62 RKERMVALDIMRGMTIMLMIIVNNQPARAFIPLDHAEWFGFTPTDCVFPFFLFVMGYAAA 121
Query: 90 LALKR----------------------------IPDRADAVKKVIFRTLKLLF------- 114
+ R D D +K ++K ++
Sbjct: 122 IVYSREWPSDVYLYPPSHVKLSIQSYFRELCGKKQDLMDENEKKEEESIKFMYLIPMRKS 181
Query: 115 -------WGILLQG-------GFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEI 160
W L + GFS + L + + +R+ GV QRIA+ Y +VSL+ +
Sbjct: 182 LYEFVSKWVKLFRRPILMFLIGFSFSV--LAHLFNFTHVRVMGVFQRIAICYFIVSLILV 239
Query: 161 FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV---VYLALLYGTYVPDWQFTIINKDSA 217
W ++ V++ +Y+ + +G YVP +
Sbjct: 240 MVP-------------------WTFVQILIVVLFQAIYITVTFGLYVP-------MEGEG 273
Query: 218 DYGKVFNVTCGVRAKL-NPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLR 276
D CG R +L P C A GYIDR +L +H+Y QDS
Sbjct: 274 D-------GCGTRGELYEPRCTAEGYIDRLILSRDHIY-----------LQDS------- 308
Query: 277 KDAPSWCHAPFEPEGLLSSVSSILSTIIGV-HFGHVIIHTKGHLARLKQWVTMGFALLIF 335
++PEG LSS+S++ + +G+ F K RL W MG +++
Sbjct: 309 ----------YDPEGFLSSLSAVTNAFVGILAFKVARAAGKDAHKRLNYWFIMGSLMILA 358
Query: 336 GLTLHFTNGEHGSGKFSTT 354
L + + G ++T+
Sbjct: 359 ALAIDYAGLPIGKKLWTTS 377
>gi|148359197|ref|YP_001250404.1| hypothetical protein LPC_1091 [Legionella pneumophila str. Corby]
gi|296107241|ref|YP_003618941.1| hypothetical protein lpa_02399 [Legionella pneumophila 2300/99
Alcoy]
gi|148280970|gb|ABQ55058.1| conserved hypothetical protein [Legionella pneumophila str. Corby]
gi|295649142|gb|ADG24989.1| hypothetical protein lpa_02399 [Legionella pneumophila 2300/99
Alcoy]
Length = 372
Score = 85.1 bits (209), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 77/290 (26%), Positives = 121/290 (41%), Gaps = 80/290 (27%)
Query: 30 LKTQRLASLDIFRGLAVALMILVD-HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
LK QRL SLD+FRG+ + LMI+V+ A D +P H WNGC LAD V PFFLFIVG+
Sbjct: 6 LKPQRLLSLDVFRGMTIVLMIIVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLT 65
Query: 88 IALALKRIPDR---ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
++LK +R +I R++ + + ++ IR+ G+
Sbjct: 66 SVISLKNQMERKAKTSLYSAIIERSVV--------LFLLGLFLNVFPHPIEFDSIRIYGI 117
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
LQRIA+ YL+ + + + + K Q ++L LL G ++
Sbjct: 118 LQRIAVCYLISAFIYL---NTSIKTQ---------------------FFIFLVLLLGYWI 153
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
Q + YG +L + V Y D+ +H+Y
Sbjct: 154 IMTQVPV-----PGYGA---------NQLTKDGSWVSYFDQLFFSASHLYEK-------- 191
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
++PEG LS+ +SI +T+ GV G ++I+
Sbjct: 192 ---------------------TYDPEGFLSTFTSIATTLSGVLAGSLLIN 220
>gi|340347656|ref|ZP_08670761.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Prevotella
dentalis DSM 3688]
gi|339608850|gb|EGQ13733.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Prevotella
dentalis DSM 3688]
Length = 386
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 74/285 (25%), Positives = 120/285 (42%), Gaps = 73/285 (25%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
+ +QRL SLD+ RGL V LMI V++ G+ + ++ H+ WNG L D V PFFLFI+GV+
Sbjct: 1 MSSQRLISLDVLRGLTVMLMIFVNNGAGEQIFAQLQHSRWNGMTLCDLVFPFFLFIMGVS 60
Query: 88 IALALKRIPD--RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
L+L++ A +K+ RTL L G+ + F A + D+ +R+ GV+
Sbjct: 61 TYLSLRKTQFVWSARLGRKIARRTLLLFVIGLAIN-WFDMACSGRPF--DLSHLRIMGVM 117
Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
QRIAL Y +L+ + C WL
Sbjct: 118 QRIALCYGATALIAVG--------------------CQRWLH------------------ 139
Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
D++ + + G + N + +D+ VLG H+YH
Sbjct: 140 DFRAMPAIIAALLGAYGALLLMGQGYAYDAAINLLSRVDQAVLGHAHLYH---------- 189
Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
+P +PEGL+S+++++ T+ G + H
Sbjct: 190 ------------------KSPVDPEGLVSTLAAVAHTLAGFYVAH 216
>gi|146302702|ref|YP_001197293.1| hypothetical protein Fjoh_4975 [Flavobacterium johnsoniae UW101]
gi|146157120|gb|ABQ07974.1| Uncharacterized protein [Flavobacterium johnsoniae UW101]
Length = 423
Score = 84.7 bits (208), Expect = 6e-14, Method: Compositional matrix adjust.
Identities = 96/358 (26%), Positives = 145/358 (40%), Gaps = 124/358 (34%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL SLD+FRGL + LM +V++ G DW P + HA W+GC D V PFF+FI+GVA+
Sbjct: 4 ERLISLDVFRGLTILLMTIVNNPG-DWGNVYPPLLHAEWHGCTPTDLVFPFFIFIMGVAV 62
Query: 89 ALALKRIPDR---ADAVKKVIFRTLKLLFWGIL--------------------------- 118
LA+ PD+ + K++ R+L++L GI
Sbjct: 63 PLAM---PDKFYDSTTFNKILVRSLRMLCLGIFFNFFGKIQLFGLEGIPLLIGRLAITIA 119
Query: 119 ----LQGGFSHAPDE------------LTY-GVDVRM-IRLCGVLQRIALSYLLVSLVEI 160
L G FS L Y G++ +RL GVLQRIA+ Y +VSL +
Sbjct: 120 VGYALMGSFSSKVKNILAFSILFIYLFLAYSGIEAYHDVRLPGVLQRIAIVYFVVSL--L 177
Query: 161 FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYG 220
+ K Q G +L+ Y A++ VP
Sbjct: 178 YLKTSQRTQIITG---------------IVLLLGYWAIMTLIPVP--------------- 207
Query: 221 KVFNVTCGV-RAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDA 279
G+ A L N ++D +L HMY +
Sbjct: 208 -------GIGEANLEKGTNLASWLDSVLLK-GHMY----------------------RGT 237
Query: 280 PSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGL 337
+W +PEG+LS++ SI++ IIG+ G V+ + + ++ G L+ FGL
Sbjct: 238 ITW-----DPEGILSTLPSIVNGIIGLLIGQVLQRDTTKILKAQKMGIAGTILIFFGL 290
>gi|336416828|ref|ZP_08597160.1| hypothetical protein HMPREF1017_04268 [Bacteroides ovatus
3_8_47FAA]
gi|335937266|gb|EGM99170.1| hypothetical protein HMPREF1017_04268 [Bacteroides ovatus
3_8_47FAA]
Length = 371
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 91/320 (28%), Positives = 133/320 (41%), Gaps = 78/320 (24%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
+K++RL SLDI RG+ + MILV++ G W I HA WNG D V PFF+FI+G
Sbjct: 1 MKSERLLSLDILRGITIVGMILVNNPG-TWESIYAPLRHAEWNGLTPTDLVFPFFMFIMG 59
Query: 86 VAIALALKRIPDRADA--VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD--VRMIRL 141
V+++ AL R + K++ RT+ L G+ L FS GV+ IR+
Sbjct: 60 VSMSFALSRFDHHFSRGFIIKLVRRTVILFLLGLFLS-WFSLVCT----GVEQPFSHIRI 114
Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
GVLQR+AL+Y SL+ + + + W ++ +L Y LL
Sbjct: 115 LGVLQRLALAYFFGSLLIVGVRRPAN-------------LAW---ISGIILAGYSILL-- 156
Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
G F ++ N + DR + G H+Y
Sbjct: 157 ----------------ALGHGFELS---------EQNIIAVTDRTLFGEAHLY------- 184
Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
R+ P F+PEGLLS++ I IIG G+++ R
Sbjct: 185 --------------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHR 230
Query: 322 LKQWVTMGFALLIFGLTLHF 341
L Q +G ALL G L +
Sbjct: 231 LLQISILGIALLFAGWLLSY 250
>gi|397667386|ref|YP_006508923.1| putative Heparan-alpha-glucosaminide N-acetyltransferase
[Legionella pneumophila subsp. pneumophila]
gi|395130797|emb|CCD09044.1| putative Heparan-alpha-glucosaminide N-acetyltransferase
[Legionella pneumophila subsp. pneumophila]
Length = 372
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 77/290 (26%), Positives = 120/290 (41%), Gaps = 80/290 (27%)
Query: 30 LKTQRLASLDIFRGLAVALMILVD-HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
LK QRL SLD+FRG+ + LMI V+ A D +P H WNGC LAD V PFFLFIVG+
Sbjct: 6 LKPQRLLSLDVFRGMTIVLMIFVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLT 65
Query: 88 IALALKRIPDR---ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
++LK +R +I R++ + + ++ IR+ G+
Sbjct: 66 SVISLKNQMERKAKTSLYSAIIERSVV--------LFLLGLFLNVFPHPIEFDSIRIYGI 117
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
LQRIA+ YL+ + + + + K Q ++L LL G ++
Sbjct: 118 LQRIAVCYLISAFIYL---NTSIKTQ---------------------FFIFLVLLLGYWI 153
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
Q + YG +L + V Y D+ +H+Y
Sbjct: 154 IMTQVPV-----PGYGA---------NQLTKDGSWVSYFDQLFFSASHLYEK-------- 191
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
++PEG LS+ +SI +T+ GV G ++I+
Sbjct: 192 ---------------------TYDPEGFLSTFTSIATTLSGVLAGSLLIN 220
>gi|398341237|ref|ZP_10525940.1| hypothetical protein LkirsB1_18883 [Leptospira kirschneri serovar
Bim str. 1051]
Length = 383
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 80/303 (26%), Positives = 127/303 (41%), Gaps = 79/303 (26%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
+ KS R+ SLD+FRG+ VA MILV++ G + + HA WNGC D V PFF
Sbjct: 1 MENKSTQNKNRVLSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHAKWNGCTPTDLVFPFF 60
Query: 81 LFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
LF VG++I L++ K ++ + R++ L+ G+ L + EL
Sbjct: 61 LFAVGISIQLSVYSKNKIHKSKIWFGICIRSITLILIGLFLNFFGEWSFSEL-------- 112
Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
R+ GVLQRI Y +V+ + + R+ W+ +L+V+ +
Sbjct: 113 -RIPGVLQRIGFVYWIVASLHLILPK--------------RMILISWI---PILLVHTWV 154
Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
L P +S Y L P + +IDR V G NH+
Sbjct: 155 LIQIPAPG--------ESIVY-------------LEPGKDIGAWIDRNVFGENHL----- 188
Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH 318
W+ SK ++PEGL S +SSI ++++GV G ++
Sbjct: 189 WKFSKT----------------------WDPEGLFSGISSIATSLLGVFCGSILSSKTNE 226
Query: 319 LAR 321
+ +
Sbjct: 227 IKK 229
>gi|423289836|ref|ZP_17268686.1| hypothetical protein HMPREF1069_03729 [Bacteroides ovatus
CL02T12C04]
gi|392666578|gb|EIY60091.1| hypothetical protein HMPREF1069_03729 [Bacteroides ovatus
CL02T12C04]
Length = 371
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 91/320 (28%), Positives = 133/320 (41%), Gaps = 78/320 (24%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
+K++RL SLDI RG+ + MILV++ G W I HA WNG D V PFF+FI+G
Sbjct: 1 MKSERLLSLDILRGITIVGMILVNNPG-TWESIYAPLRHAEWNGLTPTDLVFPFFMFIMG 59
Query: 86 VAIALALKRIPDRADA--VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD--VRMIRL 141
V+++ AL R + K++ RT+ L G+ L FS GV+ IR+
Sbjct: 60 VSMSFALSRFDHHFSRGFIIKLVRRTVILFLLGLFLS-WFSLVCT----GVEQPFSHIRI 114
Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
GVLQR+AL+Y SL+ + + + W ++ +L Y LL
Sbjct: 115 LGVLQRLALAYFFGSLLIVGVRRPAN-------------LAW---ISGIILAGYSILL-- 156
Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
G F ++ N + DR + G H+Y
Sbjct: 157 ----------------ALGHGFELS---------EQNIIAVTDRTLFGEAHLY------- 184
Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
R+ P F+PEGLLS++ I IIG G+++ R
Sbjct: 185 --------------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHR 230
Query: 322 LKQWVTMGFALLIFGLTLHF 341
L Q +G ALL G L +
Sbjct: 231 LLQISILGIALLFAGWLLSY 250
>gi|397664114|ref|YP_006505652.1| Putative heparan-alpha-glucosaminide N-acetyltransferase
[Legionella pneumophila subsp. pneumophila]
gi|395127525|emb|CCD05722.1| Putative heparan-alpha-glucosaminide N-acetyltransferase
[Legionella pneumophila subsp. pneumophila]
Length = 372
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 77/290 (26%), Positives = 120/290 (41%), Gaps = 80/290 (27%)
Query: 30 LKTQRLASLDIFRGLAVALMILVD-HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
LK QRL SLD+FRG+ + LMI V+ A D +P H WNGC LAD V PFFLFIVG+
Sbjct: 6 LKPQRLLSLDVFRGMTIVLMIFVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLT 65
Query: 88 IALALKRIPDR---ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
++LK +R +I R++ + + ++ IR+ G+
Sbjct: 66 SVISLKNQMERKAKTSLYSAIIERSVV--------LFLLGLFLNVFPHPIEFDSIRIYGI 117
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
LQRIA+ YL+ + + + + K Q ++L LL G ++
Sbjct: 118 LQRIAVCYLISAFIYL---NTSIKTQ---------------------FFIFLVLLLGYWI 153
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
Q + YG +L + V Y D+ +H+Y
Sbjct: 154 IMTQVPV-----PGYGA---------NQLTKDGSWVSYFDQLFFSASHLYEK-------- 191
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
++PEG LS+ +SI +T+ GV G ++I+
Sbjct: 192 ---------------------TYDPEGFLSTFTSIATTLSGVLAGSLLIN 220
>gi|160883406|ref|ZP_02064409.1| hypothetical protein BACOVA_01375 [Bacteroides ovatus ATCC 8483]
gi|156111126|gb|EDO12871.1| hypothetical protein BACOVA_01375 [Bacteroides ovatus ATCC 8483]
Length = 371
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 91/320 (28%), Positives = 133/320 (41%), Gaps = 78/320 (24%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
+K++RL SLDI RG+ + MILV++ G W I HA WNG D V PFF+FI+G
Sbjct: 1 MKSERLLSLDILRGITIVGMILVNNPG-TWESIYAPLRHAEWNGLTPTDLVFPFFMFIMG 59
Query: 86 VAIALALKRIPDRADA--VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD--VRMIRL 141
V+++ AL R + K++ RT+ L G+ L FS GV+ IR+
Sbjct: 60 VSMSFALSRFDHHFSRGFIIKLVRRTVILFLLGLFLS-WFSLVCT----GVEQPFSHIRI 114
Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
GVLQR+AL+Y SL+ + + + W ++ +L Y LL
Sbjct: 115 LGVLQRLALAYFFGSLLIVGVRRPAN-------------LAW---ISGIILAGYSILL-- 156
Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
G F ++ N + DR + G H+Y
Sbjct: 157 ----------------ALGHGFELS---------EQNIIAVTDRTLFGEAHLY------- 184
Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
R+ P F+PEGLLS++ I IIG G+++ R
Sbjct: 185 --------------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHR 230
Query: 322 LKQWVTMGFALLIFGLTLHF 341
L Q +G ALL G L +
Sbjct: 231 LLQISILGIALLFAGWLLSY 250
>gi|54294550|ref|YP_126965.1| hypothetical protein lpl1626 [Legionella pneumophila str. Lens]
gi|53754382|emb|CAH15866.1| hypothetical protein lpl1626 [Legionella pneumophila str. Lens]
Length = 372
Score = 84.7 bits (208), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 77/290 (26%), Positives = 120/290 (41%), Gaps = 80/290 (27%)
Query: 30 LKTQRLASLDIFRGLAVALMILVD-HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
LK QRL SLD+FRG+ + LMI V+ A D +P H WNGC LAD V PFFLFIVG+
Sbjct: 6 LKPQRLLSLDVFRGMTIVLMIFVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLT 65
Query: 88 IALALKRIPDRADAV---KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
++LK +R + +I R++ + + ++ IR+ G+
Sbjct: 66 SVISLKNQMERKEKTSLYSAIIERSVV--------LFLLGLFLNVFPHPIEFDSIRIYGI 117
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
LQRIA+ YL+ + + + + K Q ++L LL G ++
Sbjct: 118 LQRIAVCYLISAFIYL---NTSIKTQ---------------------FFIFLVLLLGYWI 153
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
Q + YG +L + V Y D+ H+Y
Sbjct: 154 IMTQVPV-----PGYGA---------NQLTKDGSWVSYFDQLFFSAPHLYEK-------- 191
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
++PEG LS+ +SI +T+ GV G ++I+
Sbjct: 192 ---------------------TYDPEGFLSTFTSIATTLSGVLAGSLLIN 220
>gi|237710367|ref|ZP_04540848.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
gi|229455829|gb|EEO61550.1| conserved hypothetical protein [Bacteroides sp. 9_1_42FAA]
Length = 366
Score = 84.3 bits (207), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 81/316 (25%), Positives = 131/316 (41%), Gaps = 82/316 (25%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+RL +LDI RG+ +A MILV++ G + + HA +NG D V PFF+FI+G++
Sbjct: 10 KRLLALDILRGITIAGMILVNNPGSWGYVYAPLEHASFNGLTPTDLVFPFFMFIMGISTY 69
Query: 90 LALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
++L++ ++K++ RT+ + G+LL A T+ ++ R GV+QR
Sbjct: 70 ISLRKYNFTYSHATLRKIMKRTVIIFCIGLLLN---LLAKSVFTHHLNFEEWRYLGVMQR 126
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA--LLYGTYVP 205
+A+ Y + SLV I K H A +LV A LL T
Sbjct: 127 LAIGYGVTSLVAITVK--------------------HKYFPAIILVTLAAYFLLLAT--- 163
Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
G FN N V D LG +HMYH
Sbjct: 164 --------------GDGFN---------QSETNVVARFDAWALGTSHMYHESG------- 193
Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
F+PEGLLS+V ++ ++G + G +++ K + ++++
Sbjct: 194 -------------------MAFDPEGLLSTVPAVCHVMVGFYCGKLLLSAKDNAEKIQRL 234
Query: 326 VTMGFALLIFGLTLHF 341
+G L G L +
Sbjct: 235 FLIGTILTFAGFLLSY 250
>gi|410096828|ref|ZP_11291813.1| hypothetical protein HMPREF1076_00991 [Parabacteroides goldsteinii
CL02T12C30]
gi|409225445|gb|EKN18364.1| hypothetical protein HMPREF1076_00991 [Parabacteroides goldsteinii
CL02T12C30]
Length = 367
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 78/298 (26%), Positives = 126/298 (42%), Gaps = 87/298 (29%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
++ RL SLD+ RG+ +A MI+V++ G + + HA WNG D V PFF+FI+GV+
Sbjct: 3 QSGRLLSLDVMRGITIAGMIMVNNPGSWGYVYAPLRHASWNGLTPTDLVFPFFMFIMGVS 62
Query: 88 IALALKRIPDRA--DAVKKVIFRTLKLLFWGILLQ-------GGFSHAPDELTYGVDVRM 138
+ +L++ + ++V KV+ RT+ + G L GFS+ +
Sbjct: 63 MFFSLRKYDFKLSRESVTKVLKRTVLIFLVGFALNLFGHLCYNGFSNFEN---------- 112
Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
+R+ GV+QR+AL+Y + SL+ + K AA +L+ Y L
Sbjct: 113 LRILGVMQRLALAYGIGSLIGLSVKHKYILQT-----------------AAGILLFYWIL 155
Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
L T ++ N + +DR + G HMYH
Sbjct: 156 LAAT---------------------------GSQTLSENNIIAIVDRALFGNTHMYH--- 185
Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
D +G F+PEGLLS + SI ++G + G +I+ K
Sbjct: 186 ---------DYLADGT---------RIAFDPEGLLSCLGSIGHVLLGFYVGKMILDCK 225
>gi|386819709|ref|ZP_10106925.1| hypothetical protein JoomaDRAFT_1633 [Joostella marina DSM 19592]
gi|386424815|gb|EIJ38645.1| hypothetical protein JoomaDRAFT_1633 [Joostella marina DSM 19592]
Length = 366
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 89/309 (28%), Positives = 129/309 (41%), Gaps = 82/309 (26%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAI 88
+R SLD+FRGL + LMI+V+ G W I HAPWNG L D V P FLF+VG A+
Sbjct: 6 ERYLSLDVFRGLTLFLMIIVNTPG-SWSFIYKPLHHAPWNGFTLTDLVFPTFLFVVGNAM 64
Query: 89 ALALKRIPDRADA--VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
+ +LK+ + + +KKV RT + G LL F D + R+ GVLQ
Sbjct: 65 SFSLKKFEEIGNTAFLKKVFKRTFLIFLIGFLLY-WFPFFKDGALK--PISETRIFGVLQ 121
Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
RIAL Y +L+ + K + FS+ L +H ++ LL+G
Sbjct: 122 RIALCYCFAALILHYW-----KPKGALIFSVIALVGYHIIL----------LLFG----- 161
Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
L NA D ++G +HMY +
Sbjct: 162 -------------------------DLTMQGNAAIKADLWLIGSSHMYKGEGF------- 189
Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWV 326
PF+PEG+LS++ +I++ I G +F V + KG +
Sbjct: 190 -------------------PFDPEGVLSTLPAIVNVIAG-YFAGVFLQQKGKTYEAIAKL 229
Query: 327 TMGFALLIF 335
TM +LIF
Sbjct: 230 TMVGGVLIF 238
>gi|383111974|ref|ZP_09932776.1| hypothetical protein BSGG_3641 [Bacteroides sp. D2]
gi|423296747|ref|ZP_17274817.1| hypothetical protein HMPREF1070_03482 [Bacteroides ovatus
CL03T12C18]
gi|313696106|gb|EFS32941.1| hypothetical protein BSGG_3641 [Bacteroides sp. D2]
gi|392669124|gb|EIY62615.1| hypothetical protein HMPREF1070_03482 [Bacteroides ovatus
CL03T12C18]
Length = 371
Score = 84.3 bits (207), Expect = 8e-14, Method: Compositional matrix adjust.
Identities = 91/320 (28%), Positives = 133/320 (41%), Gaps = 78/320 (24%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
+K++RL SLDI RG+ + MILV++ G W I HA WNG D V PFF+FI+G
Sbjct: 1 MKSERLLSLDILRGITIVGMILVNNPG-TWESIYAPLRHAEWNGLTPTDLVFPFFMFIMG 59
Query: 86 VAIALALKRIPDRADA--VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD--VRMIRL 141
V+++ AL R + K++ RT+ L G+ L FS GV+ IR+
Sbjct: 60 VSMSFALSRFDHHFSRGFIIKLVRRTVILFLLGLFLS-WFSLVCT----GVEQPFSHIRI 114
Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
GVLQR+AL+Y SL+ + + + W ++ +L Y LL
Sbjct: 115 LGVLQRLALAYFFGSLLIVGVRRPAN-------------LAW---ISGIILAGYSILL-- 156
Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
G F ++ N + DR + G H+Y
Sbjct: 157 ----------------ALGHGFELS---------EQNIIAVTDRTLFGEAHLY------- 184
Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
R+ P F+PEGLLS++ I IIG G+++ R
Sbjct: 185 --------------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILREKTEIHHR 230
Query: 322 LKQWVTMGFALLIFGLTLHF 341
L Q +G ALL G L +
Sbjct: 231 LLQISILGIALLFAGWLLSY 250
>gi|332664355|ref|YP_004447143.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
gi|332333169|gb|AEE50270.1| Protein of unknown function DUF2261, transmembrane
[Haliscomenobacter hydrossis DSM 1100]
Length = 438
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 90/351 (25%), Positives = 141/351 (40%), Gaps = 91/351 (25%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVA 87
+ RL SLD+ RGL +A MILV++ G DW + HA W+GC D+V PFFLF+VGVA
Sbjct: 4 SNRLLSLDVMRGLTIAGMILVNNPG-DWGNVYGPLLHADWHGCTPTDWVFPFFLFMVGVA 62
Query: 88 IALALKRIPDRADAV----KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
I LAL + D + + +K+I R+L ++ G+ L +H T +
Sbjct: 63 IPLALGKRKDEGEDLRKIYRKIISRSLIIIGLGLFLT---AHPTFYFTDKTSPWYV---- 115
Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWH----------WLMAACVLV 193
+ L + ++V +FT++V ++ Q F+ W W AC++V
Sbjct: 116 ----VHLIIMATAMVAVFTREVLNQKQ-------FQTETWQQRRKWVSYLAWSAIACMVV 164
Query: 194 ----------------------VYLA--LLYGTYVPDWQFTIINKDSADYGKVFN---VT 226
VYLA L+ P Q Y + V
Sbjct: 165 LGIFYYDFSHMRFPGVLQRIGLVYLACGFLFLKASPRMQLLTGVGLLLLYWGLMTLVPVP 224
Query: 227 CGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP 286
G+ L N ++DR + NH+ W K
Sbjct: 225 GGIAPNLEAETNLGAWLDRAIFSTNHL-----WAAVKT---------------------- 257
Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGL 337
++PEGLLS++ +I + I G+ G + K ++ + +G L GL
Sbjct: 258 WDPEGLLSTIPAIGTGIAGMLAGEWVRSEKSDYEKVSGLLAVGALLFALGL 308
>gi|440747989|ref|ZP_20927244.1| N-acetylglucosamine related transporter, NagX [Mariniradius
saccharolyticus AK6]
gi|436483731|gb|ELP39771.1| N-acetylglucosamine related transporter, NagX [Mariniradius
saccharolyticus AK6]
Length = 382
Score = 84.3 bits (207), Expect = 9e-14, Method: Compositional matrix adjust.
Identities = 52/134 (38%), Positives = 77/134 (57%), Gaps = 9/134 (6%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAI 88
+R +LD+ RGL +ALMI+V+ G DW + HA W+G + D V P FLF+VG A+
Sbjct: 13 ERYLALDVLRGLTIALMIVVNTPG-DWSNVFSPLLHADWHGFTITDLVFPTFLFVVGNAM 71
Query: 89 ALALKRIPDRAD-AVKKVIFRTLKLLF---WGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
+ ++K++ + A K +F+ L+F WG+ F + +D +RL GV
Sbjct: 72 SFSMKKMEKMSQGAFLKKVFKRAALIFLIGWGLNAFPFFETNETGVVSMIDWSAVRLLGV 131
Query: 145 LQRIALSYLLVSLV 158
LQRIAL YL+ SLV
Sbjct: 132 LQRIALCYLIASLV 145
>gi|374312698|ref|YP_005059128.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
gi|358754708|gb|AEU38098.1| protein of unknown function DUF1624 [Granulicella mallensis
MP5ACTX8]
Length = 383
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 84/315 (26%), Positives = 136/315 (43%), Gaps = 82/315 (26%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+ RL S+D+ RGL VA MILV++ G + + ++H+ WNG D V P FLFI+G+++
Sbjct: 15 SSRLLSIDVLRGLTVAFMILVNNNGNNDLAYRALNHSLWNGFTPTDLVFPTFLFIMGISM 74
Query: 89 ALALKRIPDRADAVKKVIF------RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
L+ RA A + R L+F+G+++ G D L R+
Sbjct: 75 VLSFS--AHRAKATSNTVMLTSIGRRFALLIFFGLVVNGFPYFHLDTL---------RIY 123
Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
GVLQRIA+ YLL +L+++ T + + V F F +W++ V V G
Sbjct: 124 GVLQRIAVCYLLAALLQLVTDRIAPR---VVLF--FAAVIGYWVLLRFVPVP------GH 172
Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
+P F +++ D N V ++DR H++ H + ++
Sbjct: 173 GIPGRDFPLLDHD---------------------INLVAWLDR------HIFPHRLFEKT 205
Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
+ +PEGLLS + + STI+G+ G I + +L
Sbjct: 206 R------------------------DPEGLLSDIPAFASTILGLLAGAWIKQARPAGQKL 241
Query: 323 KQWVTMGFALLIFGL 337
G AL + GL
Sbjct: 242 MGLFGAGIALAVAGL 256
>gi|52841889|ref|YP_095688.1| hypothetical protein lpg1661 [Legionella pneumophila subsp.
pneumophila str. Philadelphia 1]
gi|378777523|ref|YP_005185961.1| hypothetical protein lp12_1599 [Legionella pneumophila subsp.
pneumophila ATCC 43290]
gi|52629000|gb|AAU27741.1| hypothetical protein lpg1661 [Legionella pneumophila subsp.
pneumophila str. Philadelphia 1]
gi|364508338|gb|AEW51862.1| hypothetical protein lp12_1599 [Legionella pneumophila subsp.
pneumophila ATCC 43290]
Length = 372
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 77/289 (26%), Positives = 119/289 (41%), Gaps = 80/289 (27%)
Query: 30 LKTQRLASLDIFRGLAVALMILVD-HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
LK QRL SLD+FRG+ + LMI V+ A D +P H WNGC LAD V PFFLFIVG+
Sbjct: 6 LKPQRLLSLDVFRGMTIVLMIFVNGQAAIDPYPIFEHVDWNGCTLADLVFPFFLFIVGLT 65
Query: 88 IALALKRIPDR---ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
++LK +R +I R++ + + ++ IR+ G+
Sbjct: 66 SVISLKNQMERKAKTSLYSAIIERSVV--------LFLLGLFLNVFPHPIEFDSIRIYGI 117
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
LQRIA+ YL+ + + + + K Q ++L LL G ++
Sbjct: 118 LQRIAVCYLISAFIYL---NTSIKTQ---------------------FFIFLVLLLGYWI 153
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
Q + YG +L + V Y D+ +H+Y
Sbjct: 154 IMTQVPV-----PGYGA---------NQLTKDGSWVSYFDQLFFSASHLYEK-------- 191
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
++PEG LS+ +SI +T+ GV G ++I
Sbjct: 192 ---------------------TYDPEGFLSTFTSIATTLSGVLAGSLLI 219
>gi|410631394|ref|ZP_11342069.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola arctica
BSs20135]
gi|410148840|dbj|GAC18936.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola arctica
BSs20135]
Length = 366
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/309 (26%), Positives = 129/309 (41%), Gaps = 85/309 (27%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAIA 89
R+ ++D+ RGLA+ALM+LV++ G W + HA W+G D V PFFLF++G ++A
Sbjct: 8 RIEAIDVLRGLALALMLLVNNPG-SWSAVYAPFLHADWHGLTPTDLVFPFFLFVMGASMA 66
Query: 90 LALK-RIPDRADAVKKVIFRTLKLLFWGILLQ-GGFSHAPDELTYGVDVRMIRLCGVLQR 147
+L+ +I + R+ L+F G LLQ F APD R+ GVLQR
Sbjct: 67 CSLRGQIQASGLPWLSIFKRSFLLVFIGFLLQIIPFDQAPDTW---------RIMGVLQR 117
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
I L +LLV+ + K+ W L A L+VY LL
Sbjct: 118 IGLCFLLVASMLAIIKE-----------------RWLLLSAVVTLIVYWLLL-------- 152
Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
++ A Y + N+V + D +LG HM+
Sbjct: 153 ----LSAGQAPY--------------SLENNSVRHFDMAILGSAHMWQGKG--------- 185
Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
PF+PEGLLS++ + ++ + G ++ K ++ Q +
Sbjct: 186 -----------------LPFDPEGLLSTIGAAMTVLSGYLICVNVLQQKNQKQQILQLMI 228
Query: 328 MGFALLIFG 336
+G LL G
Sbjct: 229 VGAILLALG 237
>gi|255093765|ref|ZP_05323243.1| hypothetical protein CdifC_14056 [Clostridium difficile CIP 107932]
Length = 505
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/154 (32%), Positives = 81/154 (52%), Gaps = 16/154 (10%)
Query: 20 DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFV 76
+S+ S L R+ S+DI RGL++ALMI+ ++ G +P++ HA W+G LADF
Sbjct: 125 KISNNVVDSKLTNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFA 184
Query: 77 MPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
PFF+ +GV I ++ LK + + R++ L+ +G L + P
Sbjct: 185 FPFFVISLGVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP----- 237
Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQ 166
D+ +R+ GVLQR+ L Y + SLV + K +
Sbjct: 238 --DLNSVRILGVLQRMGLVYFVTSLVYLLLKKLN 269
>gi|418694540|ref|ZP_13255577.1| putative membrane protein [Leptospira kirschneri str. H1]
gi|409957715|gb|EKO16619.1| putative membrane protein [Leptospira kirschneri str. H1]
Length = 383
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 127/313 (40%), Gaps = 99/313 (31%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
+ KS R+ SLD+FRG+ VA MILV++ G + + HA WNGC D V PFF
Sbjct: 1 MENKSTQNKNRVLSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHAKWNGCTPTDLVFPFF 60
Query: 81 LFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
LF VG++I L++ K ++ + R++ L+ G+ L + EL
Sbjct: 61 LFAVGISIQLSVYSKNKIYKSKIWFGICIRSITLILIGLFLNFFGEWSFSEL-------- 112
Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
R+ GVLQRI Y W++A+ L++ +
Sbjct: 113 -RIPGVLQRIGFVY--------------------------------WIVASLHLILPKRM 139
Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----------IDRKVL 248
+ +++P + V V ++ PP ++ Y IDR V
Sbjct: 140 ILISWIP----------------ILLVHTWVLIQIPPPGESIVYLEPGKDIGAWIDRNVF 183
Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
G NH+ W+ SK ++PEGL S +SSI ++++GV
Sbjct: 184 GENHL-----WKFSKT----------------------WDPEGLFSGISSIATSLLGVFC 216
Query: 309 GHVIIHTKGHLAR 321
G ++ + +
Sbjct: 217 GSILSSKTNEIKK 229
>gi|373460170|ref|ZP_09551926.1| hypothetical protein HMPREF9944_00190 [Prevotella maculosa OT 289]
gi|371956555|gb|EHO74341.1| hypothetical protein HMPREF9944_00190 [Prevotella maculosa OT 289]
Length = 359
Score = 84.0 bits (206), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 83/320 (25%), Positives = 136/320 (42%), Gaps = 86/320 (26%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
++ +RL SLD+ RG+ V LMILV++ G+ + + H+ WNG D V PFFLFI+G++
Sbjct: 1 MEKKRLLSLDVLRGMTVCLMILVNNGAGEHIYSTLQHSKWNGMTPCDLVFPFFLFIMGIS 60
Query: 88 IALALKRIP---DRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYG--VDVRMIRLC 142
L+LK+ +R A K+ RT+ L G+ F + D L G +D +R+
Sbjct: 61 TFLSLKQTNFAWNRQTAC-KIAKRTVLLFAIGL-----FINWFDLLLQGRALDFEHLRIW 114
Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
GV+QRIA+ Y V +F + K L+A ++ + L+ G
Sbjct: 115 GVMQRIAICY---GAVSVFALSINHKRTLP-------------LIATLLIAYAMFLMLGN 158
Query: 203 -YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
Y D Q N + ID + G H+YH
Sbjct: 159 GYAYDSQ----------------------------QNLIAQIDIHLFGQAHLYH------ 184
Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
+P +PEGL SS+ +I T+IG + G ++ + +
Sbjct: 185 ----------------------KSPVDPEGLASSLPAIAHTLIGFYCGRLMAMARTTEEK 222
Query: 322 LKQWVTMGFALLIFGLTLHF 341
+ +++ +G L++ G F
Sbjct: 223 VLKFMLVGGVLVLIGYLASF 242
>gi|255315516|ref|ZP_05357099.1| hypothetical protein CdifQCD-7_14229, partial [Clostridium
difficile QCD-76w55]
Length = 381
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/154 (32%), Positives = 81/154 (52%), Gaps = 16/154 (10%)
Query: 20 DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFV 76
+S+ S L R+ S+DI RGL++ALMI+ ++ G +P++ HA W+G LADF
Sbjct: 1 KISNNVVDSKLTNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFA 60
Query: 77 MPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
PFF+ +GV I ++ LK + + R++ L+ +G L + P
Sbjct: 61 FPFFVISLGVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP----- 113
Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQ 166
D+ +R+ GVLQR+ L Y + SLV + K +
Sbjct: 114 --DLNSVRILGVLQRMGLVYFVTSLVYLLLKKLN 145
>gi|373953861|ref|ZP_09613821.1| Protein of unknown function DUF2261, transmembrane
[Mucilaginibacter paludis DSM 18603]
gi|373890461|gb|EHQ26358.1| Protein of unknown function DUF2261, transmembrane
[Mucilaginibacter paludis DSM 18603]
Length = 379
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 81/281 (28%), Positives = 121/281 (43%), Gaps = 75/281 (26%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
QRL SLD+FRGL VA MILV++ G DW I H+ WNGC D + PFFLFIVGV+I
Sbjct: 14 QRLLSLDVFRGLTVACMILVNNPG-DWAHIYSPLEHSAWNGCTPTDLIFPFFLFIVGVSI 72
Query: 89 ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
++ K++ LK L S P + +R+ GVLQRI
Sbjct: 73 VYSMGTKKTDPAQHGKLVLTILKRSLILFCLALFLSLYPK-----FNFHTLRIPGVLQRI 127
Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
A+ + + + IF K + K Q + +F L+ L+VY L+
Sbjct: 128 AVVFGICGI--IFLKT-ERKTQLI----LFWLF----------LIVYYLLM--------- 161
Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
T++ Y A L P N +IDR V+G H++
Sbjct: 162 -TLVPVPGVGY-----------ANLQPETNLGAWIDRTVIGNVHLW-------------- 195
Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFG 309
K++ +W +PEG+L ++ + + + G+ G
Sbjct: 196 --------KESVTW-----DPEGILGTMPATSTGLFGILVG 223
>gi|325103749|ref|YP_004273403.1| hypothetical protein [Pedobacter saltans DSM 12145]
gi|324972597|gb|ADY51581.1| hypothetical protein Pedsa_1010 [Pedobacter saltans DSM 12145]
Length = 374
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 50/141 (35%), Positives = 78/141 (55%), Gaps = 6/141 (4%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
+ + R SLD+ RG VA MI+V+ G + + HAPW+G + D V P F
Sbjct: 1 MANANSIPKPRYLSLDVLRGATVAFMIIVNTPGSWSYVYAPLKHAPWHGFTVTDLVFPTF 60
Query: 81 LFIVGVAIALALKRIPDRADA--VKKVIFRTLKLLFWGILLQG-GFSHAPDELTYGVDVR 137
LF+VG A++ + ++ ++ ++ +KKV RTLK+ G+ L F D++ D
Sbjct: 61 LFVVGNAMSFGMGKLKEQGNSAFLKKVFSRTLKIFLIGLFLNMFPFVKWVDDVLVMKDFT 120
Query: 138 MIRLCGVLQRIALSYLLVSLV 158
IR+ GVLQRIA+ Y + SL+
Sbjct: 121 EIRIWGVLQRIAVCYCIASLL 141
>gi|421109691|ref|ZP_15570204.1| putative membrane protein [Leptospira kirschneri str. H2]
gi|410005185|gb|EKO58983.1| putative membrane protein [Leptospira kirschneri str. H2]
Length = 383
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 78/313 (24%), Positives = 127/313 (40%), Gaps = 99/313 (31%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
+ KS R+ SLD+FRG+ VA MILV++ G + + HA WNGC D V PFF
Sbjct: 1 MENKSTQNKNRVLSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHAKWNGCTPTDLVFPFF 60
Query: 81 LFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
LF VG++I L++ K ++ + R++ L+ G+ L + EL
Sbjct: 61 LFAVGISIQLSVYSKNKIYKSKIWFGICIRSITLILIGLFLNFFGEWSFSEL-------- 112
Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
R+ GVLQRI Y W++A+ L++ +
Sbjct: 113 -RIPGVLQRIGFVY--------------------------------WIVASLHLILPKRM 139
Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----------IDRKVL 248
+ +++P + V V ++ PP ++ Y IDR V
Sbjct: 140 ILISWIP----------------ILLVHTWVLIQIPPPGESIVYLEPGKDIGAWIDRNVF 183
Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
G NH+ W+ SK ++PEGL S +SSI ++++GV
Sbjct: 184 GENHL-----WKFSKT----------------------WDPEGLFSGISSIATSLLGVFC 216
Query: 309 GHVIIHTKGHLAR 321
G ++ + +
Sbjct: 217 GSILSSKTNEIKK 229
>gi|333378010|ref|ZP_08469743.1| hypothetical protein HMPREF9456_01338 [Dysgonomonas mossii DSM
22836]
gi|332884030|gb|EGK04310.1| hypothetical protein HMPREF9456_01338 [Dysgonomonas mossii DSM
22836]
Length = 389
Score = 83.6 bits (205), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 78/329 (23%), Positives = 141/329 (42%), Gaps = 89/329 (27%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+ RL SLDI RG+ +A MI+V++ G + + HA W+G D V PFF+FI+G++
Sbjct: 6 SSRLLSLDILRGITIAGMIMVNNPGSWSYVYAPLGHAAWHGLTPTDLVFPFFMFIMGIST 65
Query: 89 ALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQ------GGFSHAPDE--------LTY 132
++L++ + + K++ RT+ + G+ L F+ E +T
Sbjct: 66 YISLRKFNFEFNKPTLFKILKRTVVIFLIGLGLGWLSLSFRTFNSLSGEDIGFFERFITA 125
Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
+ IR+ GV+QR+AL+Y +L+ IF K +++ ++
Sbjct: 126 ITNFEHIRILGVMQRLALTYGATALIAIFVKHKYIP----------------YIIVVTLI 169
Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
+L LL+G + D+ + N + +DR +LG +H
Sbjct: 170 GYFLLLLFG-------------NGFDFSE---------------DNIISVLDRAILGADH 201
Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
MY +DS +PEGLLS++ +I +IG G ++
Sbjct: 202 MY------------KDSGLA--------------IDPEGLLSTIPAICHVLIGFCCGEIL 235
Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHF 341
+ TK + R+++ +G + G L +
Sbjct: 236 LTTKDNNERIQRLFIIGAIMTFLGFLLSY 264
>gi|333030942|ref|ZP_08459003.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
gi|332741539|gb|EGJ72021.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
Length = 385
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 85/333 (25%), Positives = 135/333 (40%), Gaps = 90/333 (27%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGV 86
+ ++RL SLDI RG + MILV++ G + + HA W+G D + PFF+F++G+
Sbjct: 1 MSSKRLLSLDILRGGTIIGMILVNNPGSWEYIYSPLRHAEWHGLTPTDLIFPFFIFVMGI 60
Query: 87 AIALALKRIPDRADAV----KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM---- 138
+++L+ + + +KVI R+ KL L G F L G++ R+
Sbjct: 61 SMSLSFSKFKNEEYNKTLFWEKVIKRSAKL-----FLLGLFLSWFSLLLEGINNRLEYES 115
Query: 139 ----------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
IR+ GV+QR+ALSYL+ S +F + V + +
Sbjct: 116 ISEILFPFGQIRILGVMQRLALSYLVGS---VFVMLIPKAKHLV-------------ITS 159
Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
+L+ Y LL G F+ + N + +D +
Sbjct: 160 VILLIAYFILL------------------SLGNGFSFSSD---------NIIAIVDNSLF 192
Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
G NH+Y W P F+PEGLLS++ I+ I+G
Sbjct: 193 GENHVYLE--W-------------------LPDGERLRFDPEGLLSTIPCIVQVIMGYLC 231
Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
G VI K L ++ +G LL GL L +
Sbjct: 232 GEVIRKKKDLLNKMMDLAIIGIVLLFIGLLLSY 264
>gi|389809458|ref|ZP_10205319.1| protein involved in N-acetyl-D-glucosamine utilization
[Rhodanobacter thiooxydans LCS2]
gi|388441722|gb|EIL97972.1| protein involved in N-acetyl-D-glucosamine utilization
[Rhodanobacter thiooxydans LCS2]
Length = 353
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 55/139 (39%), Positives = 78/139 (56%), Gaps = 16/139 (11%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
+RLASLD RG VA M+LV+ G DW + +HA WNGC D V PFFLF+VGV++
Sbjct: 2 KRLASLDALRGCTVAAMLLVNDPG-DWSHVYWPLAHAAWNGCTPTDLVFPFFLFVVGVSV 60
Query: 89 ALA-LKRIPDRADA---VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
ALA L R+ A + ++R L++L G+ + + + +R GV
Sbjct: 61 ALAILPRLEQGASPSALTRAAMWRALRILALGVAIN-------LLAAWWLPQAHLRFPGV 113
Query: 145 LQRIALSYLLVSLVEIFTK 163
LQRIAL + V+L ++TK
Sbjct: 114 LQRIALCFAGVALFAVYTK 132
>gi|320105553|ref|YP_004181143.1| hypothetical protein AciPR4_0312 [Terriglobus saanensis SP1PR4]
gi|319924074|gb|ADV81149.1| hypothetical protein AciPR4_0312 [Terriglobus saanensis SP1PR4]
Length = 412
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 81/327 (24%), Positives = 135/327 (41%), Gaps = 74/327 (22%)
Query: 22 SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPF 79
+D ++ K RL SLD+ RG+ + MILV++ G+ + + HA WNG D V P
Sbjct: 18 TDSAARTTHKPARLLSLDVLRGVTIGFMILVNNQTGEGAFFPLQHAKWNGFTPTDLVFPT 77
Query: 80 FLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGG-FSHAPDELTYGV 134
FL +VG++ L+ + R A + TL+ L +G+++ F H
Sbjct: 78 FLLLVGLSTVLSTEARLARGVAKSTIFLHTLQRSAVLFLFGLIVNNAPFFH--------- 128
Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
++ +R+ GVL RIA+ Y +V + + +D++ R + AAC LV
Sbjct: 129 -LQTLRVYGVLPRIAVCYFIVGSLYLLVRDLKQ-----------RAFILAAAAAAC-LVG 175
Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
Y AL+ +P F +P N V YIDR + +H+Y
Sbjct: 176 YWALMRFIPIPG----------------FGTPTHEIPINDPDGNLVAYIDRHIFSASHLY 219
Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
T+D PEGLLS++ ++ + + G+ G +
Sbjct: 220 EK---------TRD--------------------PEGLLSTIPAVATALFGILAGIWLRT 250
Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHF 341
++ + + K G + LI G H
Sbjct: 251 SRSTMQKAKGIEYAGISFLILGGAWHL 277
>gi|423239671|ref|ZP_17220787.1| hypothetical protein HMPREF1065_01410 [Bacteroides dorei
CL03T12C01]
gi|392645711|gb|EIY39434.1| hypothetical protein HMPREF1065_01410 [Bacteroides dorei
CL03T12C01]
Length = 363
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 82/316 (25%), Positives = 133/316 (42%), Gaps = 82/316 (25%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+RL +LDI RG+ +A MILV++ G + + H +NG D V PFF+FI+G++
Sbjct: 7 KRLLALDILRGITIAGMILVNNPGSWGYVYAPLEHVAFNGLTPTDLVFPFFMFIMGISTY 66
Query: 90 LALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
++L++ ++K++ RT+ + G+LL A T+ ++ R GV+QR
Sbjct: 67 ISLRKYNFTYSHATLRKIMKRTVIIFCIGLLLN---LLAKSVFTHHLNFEEWRYLGVMQR 123
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA--LLYGTYVP 205
+A+ Y + SLV I K H A +LV A LL T
Sbjct: 124 LAIGYGVTSLVAITVK--------------------HKYFPAIILVTLAAYFLLLAT--- 160
Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
G FN N V D LG +HMYH
Sbjct: 161 --------------GDGFN---------QSETNVVARFDAWALGTSHMYH---------- 187
Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
EG + F+PEGLLS+V ++ ++G + G +++ K + ++++
Sbjct: 188 ------EGGM----------AFDPEGLLSTVPAVCHVMVGFYCGKLLLSAKDNAEKIQRL 231
Query: 326 VTMGFALLIFGLTLHF 341
+G L G L +
Sbjct: 232 FLIGTILTFAGFLLSY 247
>gi|329851960|ref|ZP_08266641.1| hypothetical protein ABI_47300 [Asticcacaulis biprosthecum C19]
gi|328839809|gb|EGF89382.1| hypothetical protein ABI_47300 [Asticcacaulis biprosthecum C19]
Length = 369
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 81/287 (28%), Positives = 117/287 (40%), Gaps = 72/287 (25%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGV 86
+ QR SLD+FRGL VA MI+V+ +G + ++SHA W G LAD V P FLF VG
Sbjct: 1 MAGQRFTSLDVFRGLTVAFMIVVNTSGPGAAPFAQLSHATWFGLTLADLVFPAFLFAVGN 60
Query: 87 AIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ-GGFSHAPDELTYGVDVRMIRLCGVL 145
A++ + + KV+ R L G L+ F HA + V R+ GVL
Sbjct: 61 AMSFGDPKSGPTGRYLGKVVKRAAILFLLGYLMYWFPFVHATADGWALNPVEHTRIPGVL 120
Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
QRIAL +L ++ + D + +G ++ L W LM + P
Sbjct: 121 QRIALCFLAAAIAVRWL----DVPKLIGLSAVLLLGYWGALM--------------VFGP 162
Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
+ +L P N IDR V GINHMY + K
Sbjct: 163 PGE-----------------------QLTPLGNIGALIDRAVFGINHMYA-----KGKG- 193
Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
++PEGL S++ +I++ + G G I
Sbjct: 194 ---------------------YDPEGLFSTLPAIVNVLAGYLAGRYI 219
>gi|440791267|gb|ELR12512.1| transmembrane protein [Acanthamoeba castellanii str. Neff]
Length = 825
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 83/368 (22%), Positives = 141/368 (38%), Gaps = 92/368 (25%)
Query: 18 EPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVM 77
+ D + + + + R+ SLD RGLA+A+MI V++ GG + +H+ WNG +AD V
Sbjct: 382 QADRQNGPKPAPRVSSRVNSLDAVRGLAIAIMIFVNYGGGGYWFFNHSAWNGITVADLVF 441
Query: 78 PFFLFIVGVAIALALKRIPDRADA-------------VKKVIFRTLKLLFWGILLQGGFS 124
P+F++I+G ++A++ + + + V+ R ++
Sbjct: 442 PWFIWIMGTSMAISFTSLEKKLLGLFQNNGYEWETWRIPGVLMRFAVAYLVVGVVVLFVP 501
Query: 125 HAPDELTYGVDVRMIR-------LCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR--- 174
P LTY + R G I S F +D D R
Sbjct: 502 RWPWRLTYRIYRRFTSGGHHRHAADGAASPILERKRFASEAINFDEDTDKSDDGFSRNVF 561
Query: 175 -----------------FSIFR--LYCW-HWLMAACVLVVYLALLYGTYVPDWQFTIINK 214
+++F L W HWL+A +L VY + + VP
Sbjct: 562 AAEDDESCENMMKDKWVYTLFGDILPFWPHWLVAFSLLFVYFMITFFLDVPG-------- 613
Query: 215 DSADYGKVFNVTCGVRAKLNPPCN-----AVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
CG R L P + A GYID+K+ +H+Y+ P +
Sbjct: 614 ------------CG-RGYLGPDISTATGGAAGYIDKKIFTEDHIYNQPTCQ--------- 651
Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW---- 325
P + ++PEG L +++SI +G+ G ++ K H R+ +W
Sbjct: 652 ----------PLYLTGSYDPEGTLGNLTSIFMVFLGLQSGRTLMAWKDHKHRVVRWYIWS 701
Query: 326 VTMGFALL 333
+ +GF L
Sbjct: 702 IVLGFIAL 709
>gi|409990365|ref|ZP_11273749.1| hypothetical protein APPUASWS_05524 [Arthrospira platensis str.
Paraca]
gi|291567406|dbj|BAI89678.1| hypothetical protein [Arthrospira platensis NIES-39]
gi|409938771|gb|EKN80051.1| hypothetical protein APPUASWS_05524 [Arthrospira platensis str.
Paraca]
Length = 378
Score = 83.2 bits (204), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 81/288 (28%), Positives = 119/288 (41%), Gaps = 85/288 (29%)
Query: 34 RLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
RL SLD+FRG+A+A MILV++ G +P + HA W+GC D V P FL IVGVAIA
Sbjct: 9 RLISLDVFRGIAIAAMILVNNPGSWGYMYPVLQHAQWHGCTPTDVVFPSFLLIVGVAIAF 68
Query: 91 ALKRIPDR----ADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
+L + D V ++ R LLF L+ GF + D+ IR+
Sbjct: 69 SLSKFSPEHRLGGDGVPPSVYSRIGRRCLLLFLLGLILNGFPN--------YDLANIRIM 120
Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
GVLQRIA++Y L ++ + Q WL++ L+ Y +
Sbjct: 121 GVLQRIAIAYGLSAIAILNLSRRQ-----------------LWLISIFTLIGYWLAMTMI 163
Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
VP + L+P N +ID+ +LG +H+
Sbjct: 164 PVPGYS---------------------PGNLSPEGNLGAFIDQTILGSHHL--------- 193
Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
W P++PEGL S+ + ++ IIG G
Sbjct: 194 -------------------WRGGPYDPEGLFSTAPATVTVIIGYLTGE 222
>gi|163786877|ref|ZP_02181325.1| hypothetical protein FBALC1_16867 [Flavobacteriales bacterium
ALC-1]
gi|159878737|gb|EDP72793.1| hypothetical protein FBALC1_16867 [Flavobacteriales bacterium
ALC-1]
Length = 361
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 80/285 (28%), Positives = 118/285 (41%), Gaps = 76/285 (26%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+ R+ S+DI RGL + MILV+ G + + HA W+G D + PFFLFIVG++I
Sbjct: 2 SARIESVDILRGLTILAMILVNTPGTWGHVYTPLRHAEWHGLTPTDLIFPFFLFIVGISI 61
Query: 89 ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
A K P+ KK+I R+LKL+ G+ L H P + D R+ GVLQRI
Sbjct: 62 YFAYKNKPNTKLTYKKIIIRSLKLIGLGLFLNLFLPHFP----FFNDFETHRIPGVLQRI 117
Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYC-WHWLMAACVLVVYLALLYGTYVPDW 207
L +L SI L C W L A + ++ L ++P
Sbjct: 118 GLVFLFS--------------------SILYLNCSWKSLTAIGITIILGYWLCLGFIPFQ 157
Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
++ D A P N YID +LG HM W+
Sbjct: 158 DGSLPTFDRA------------------PNNWANYIDLNILG-EHM-----WKTD----- 188
Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
++PEGL+S++ +I + I G+ G ++
Sbjct: 189 -------------------YDPEGLISTIPAIATCISGILIGKLL 214
>gi|325299496|ref|YP_004259413.1| hypothetical protein [Bacteroides salanitronis DSM 18170]
gi|324319049|gb|ADY36940.1| hypothetical protein Bacsa_2393 [Bacteroides salanitronis DSM
18170]
Length = 356
Score = 82.8 bits (203), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 56/138 (40%), Positives = 84/138 (60%), Gaps = 13/138 (9%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWP--EISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
QRL SLD+ RG+ V MI+V++AGG++ + H+ WNG D V PFFLFI+G++ +
Sbjct: 3 QRLLSLDVLRGITVFGMIVVNNAGGEYSYDSLRHSVWNGLTPCDLVFPFFLFIMGISTYI 62
Query: 91 ALKRI---PDRADAVKKVIFRTLK--LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
AL++ P A ++K++ RTL L+ WGI F+ D L + IR+ GVL
Sbjct: 63 ALRKFQFQPSPA-VLRKIVRRTLLIFLIGWGI-YWFEFACEGDFLPFA----HIRILGVL 116
Query: 146 QRIALSYLLVSLVEIFTK 163
RIAL Y +VSL+ ++ +
Sbjct: 117 PRIALCYGIVSLLALYVR 134
>gi|160885454|ref|ZP_02066457.1| hypothetical protein BACOVA_03454 [Bacteroides ovatus ATCC 8483]
gi|423290374|ref|ZP_17269223.1| hypothetical protein HMPREF1069_04266 [Bacteroides ovatus
CL02T12C04]
gi|156109076|gb|EDO10821.1| hypothetical protein BACOVA_03454 [Bacteroides ovatus ATCC 8483]
gi|392665761|gb|EIY59284.1| hypothetical protein HMPREF1069_04266 [Bacteroides ovatus
CL02T12C04]
Length = 371
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 81/320 (25%), Positives = 133/320 (41%), Gaps = 87/320 (27%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
+RL +LD+ RG+ +A MILV++ G + + HA WNG D V PFF+FI+G++
Sbjct: 5 SNKRLLALDVMRGITIAGMILVNNPGSWGHAYAPLKHAQWNGLTPTDLVFPFFMFIMGIS 64
Query: 88 IALALKR--IPDRADAVKKVIFRTLKLLFWGILLQG----GFSHAPDELTYGVDVRMIRL 141
++LK+ A K+I RT+ + GI L ++H P + IR+
Sbjct: 65 TYISLKKYNFTFSTPAALKIIKRTIVIFLIGIALNWFALLCYTHNP------LPFEQIRI 118
Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
GV+QR+AL Y +L+ + K H + ++V LL G
Sbjct: 119 LGVMQRLALCYGASALIALLLK--------------------HKYIPYLIVV----LLVG 154
Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
++ +I + Y + N + +DR +LG HMY
Sbjct: 155 YFI-----ILITGNGFAYNET---------------NILSIVDRSILGDAHMY------- 187
Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
QD+ +PEGLLS++ SI +IG G +++ K +
Sbjct: 188 -----QDN----------------HIDPEGLLSTIPSIAHVLIGFCVGKLLMEVKDIHEK 226
Query: 322 LKQWVTMGFALLIFGLTLHF 341
L++ +G L G +
Sbjct: 227 LERLFLIGTILTFAGFLFSY 246
>gi|395213375|ref|ZP_10400182.1| hypothetical protein O71_05742 [Pontibacter sp. BAB1700]
gi|394456744|gb|EJF11001.1| hypothetical protein O71_05742 [Pontibacter sp. BAB1700]
Length = 391
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 51/149 (34%), Positives = 80/149 (53%), Gaps = 9/149 (6%)
Query: 17 SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNL 72
+ P ++D +R SLD+ RGL +ALM++V++ G W I HA W+G +
Sbjct: 6 TAPPLTDAGLLRPQTYERYLSLDVLRGLTIALMVVVNNPG-SWGSIYAPFKHAAWHGFTV 64
Query: 73 ADFVMPFFLFIVGVAIALALKRIPDRADAV--KKVIFRTLKLLFWGILLQ--GGFSHAPD 128
D V P FLF+VG A++ ++++ + D+V +KV+ RT + G+ L P+
Sbjct: 65 TDLVFPSFLFVVGNAMSFSMRKFETQPDSVFLRKVLKRTALIFLIGLFLNLFPFVMRNPE 124
Query: 129 ELTYGVDVRMIRLCGVLQRIALSYLLVSL 157
D +R+ GVLQRIAL Y + SL
Sbjct: 125 GAIVMKDFTAVRIMGVLQRIALCYFIASL 153
>gi|374310943|ref|YP_005057373.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
gi|358752953|gb|AEU36343.1| protein of unknown function DUF1624 [Granulicella mallensis
MP5ACTX8]
Length = 385
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 52/143 (36%), Positives = 79/143 (55%), Gaps = 18/143 (12%)
Query: 29 HLKTQRLASLDIFRGLAVALMILVDHAGGDWPE----ISHAPWNGCNLADFVMPFFLFIV 84
L ++R+ S+D+ RG +A MILV+ A G+WP + HA WNGC D V P FLF+
Sbjct: 12 ELTSKRIPSVDVLRGFTLAAMILVN-AAGEWPHAYWPLKHAQWNGCTPTDLVFPTFLFLT 70
Query: 85 GVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGVDVRMIR 140
G ++ + + R +++ TLK L F G+LL + L Y + +R
Sbjct: 71 GTSLVFSFRSRLARGVGKRELFLHTLKRSVILFFIGVLL--------NALPY-FHIGTLR 121
Query: 141 LCGVLQRIALSYLLVSLVEIFTK 163
+ GVLQRIAL YL VS++ ++ +
Sbjct: 122 IYGVLQRIALCYLCVSVLYLWNR 144
>gi|383110831|ref|ZP_09931649.1| hypothetical protein BSGG_1941 [Bacteroides sp. D2]
gi|313694406|gb|EFS31241.1| hypothetical protein BSGG_1941 [Bacteroides sp. D2]
Length = 371
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 81/320 (25%), Positives = 133/320 (41%), Gaps = 87/320 (27%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
+RL +LD+ RG+ +A MILV++ G + + HA WNG D V PFF+FI+G++
Sbjct: 5 SNKRLLALDVMRGITIAGMILVNNPGSWGHAYAPLKHAQWNGLTPTDLVFPFFMFIMGIS 64
Query: 88 IALALKR--IPDRADAVKKVIFRTLKLLFWGILLQG----GFSHAPDELTYGVDVRMIRL 141
++LK+ A K+I RT+ + GI L ++H P + IR+
Sbjct: 65 TYISLKKYNFTFSTPAALKIIKRTIVIFLIGIALNWFALLCYTHNP------LPFEQIRI 118
Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
GV+QR+AL Y +L+ + K H + ++V LL G
Sbjct: 119 LGVMQRLALCYGASALIALLLK--------------------HKYIPYLIVV----LLVG 154
Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
++ +I + Y + N + +DR +LG HMY
Sbjct: 155 YFI-----ILITGNGFAYNET---------------NILSIVDRSILGDAHMY------- 187
Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
QD+ +PEGLLS++ SI +IG G +++ K +
Sbjct: 188 -----QDN----------------HIDPEGLLSTIPSIAHVLIGFCVGKLLMEVKDIHEK 226
Query: 322 LKQWVTMGFALLIFGLTLHF 341
L++ +G L G +
Sbjct: 227 LERLFLIGTILTFAGFLFSY 246
>gi|336415339|ref|ZP_08595679.1| hypothetical protein HMPREF1017_02787 [Bacteroides ovatus
3_8_47FAA]
gi|335940935|gb|EGN02797.1| hypothetical protein HMPREF1017_02787 [Bacteroides ovatus
3_8_47FAA]
Length = 371
Score = 82.4 bits (202), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 81/320 (25%), Positives = 133/320 (41%), Gaps = 87/320 (27%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
+RL +LD+ RG+ +A MILV++ G + + HA WNG D V PFF+FI+G++
Sbjct: 5 SNKRLLALDVIRGITIAGMILVNNPGSWGHAYAPLKHAQWNGLTPTDLVFPFFMFIMGIS 64
Query: 88 IALALKR--IPDRADAVKKVIFRTLKLLFWGILLQG----GFSHAPDELTYGVDVRMIRL 141
++LK+ A K+I RT+ + GI L ++H P + IR+
Sbjct: 65 TYISLKKYNFTFSTPAALKIIKRTIVIFLIGIALNWFALLCYTHNP------LPFEQIRI 118
Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
GV+QR+AL Y +L+ + K H + ++V LL G
Sbjct: 119 LGVMQRLALCYGASALIALLLK--------------------HKYIPYLIVV----LLVG 154
Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
++ +I + Y + N + +DR +LG HMY
Sbjct: 155 YFI-----ILITGNGFAYNET---------------NILSIVDRSILGDAHMY------- 187
Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
QD+ +PEGLLS++ SI +IG G +++ K +
Sbjct: 188 -----QDN----------------HIDPEGLLSTIPSIAHVLIGFCVGKLLMEVKDIHEK 226
Query: 322 LKQWVTMGFALLIFGLTLHF 341
L++ +G L G +
Sbjct: 227 LERLFLIGTILTFAGFLFSY 246
>gi|294627662|ref|ZP_06706244.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
gi|292598014|gb|EFF42169.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 11122]
Length = 388
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 85/312 (27%), Positives = 126/312 (40%), Gaps = 73/312 (23%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
K +R SLD+FRGL + LMILV+ AG + +++HA W G LAD V P FLF VG A
Sbjct: 18 KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77
Query: 88 IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
++ AL + +V R +L G+L+ F PD V +RL GVL
Sbjct: 78 MSFALATNTPHLQFLGRVSKRAALILLCGVLMYWFPFFHLQPDGGWSFTTVDQLRLTGVL 137
Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
QRI L YL +L+ + + +L+ Y ALLY P
Sbjct: 138 QRIGLCYLAAALLVRYLPPRGIAPVCL-----------------ALLLGYWALLYAFGQP 180
Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
A+L+ NA +D + G +H+Y
Sbjct: 181 G------------------------AELSKTGNAGTRLDLWLYGRDHLY----------- 205
Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
RKD F+PEGLL ++S+ ++ + G G + +A +
Sbjct: 206 ----------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFLQRQGKTVASTRSL 249
Query: 326 VTMGFALLIFGL 337
+ G L++ L
Sbjct: 250 LLAGAGLVVLAL 261
>gi|404485250|ref|ZP_11020448.1| hypothetical protein HMPREF9448_00860 [Barnesiella intestinihominis
YIT 11860]
gi|404338685|gb|EJZ65130.1| hypothetical protein HMPREF9448_00860 [Barnesiella intestinihominis
YIT 11860]
Length = 390
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 81/329 (24%), Positives = 137/329 (41%), Gaps = 89/329 (27%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL SLDI RG+ +A MI+V++ G + + HA W G D V PFF+FI+G++
Sbjct: 6 NKRLLSLDILRGITIAGMIMVNNPGSWGYIYAPLGHAEWIGLTPTDLVFPFFMFIMGIST 65
Query: 89 ALALKRIPDR-ADAVK-KVIFRT-------LKLLFWGILLQGGFSHAPDELTYGV----- 134
++L++ R + AV K+I RT L + ++G+ ++ + L++
Sbjct: 66 YMSLRKFDFRLSGAVAWKIIRRTIVIFAIGLAIAWFGLTMRTYHQLGEESLSFFERLGRS 125
Query: 135 --DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
+ IR+ GV+ R+A+ Y + + + + K Y H + + L
Sbjct: 126 MWNFDHIRILGVMPRLAICYGVAAFIALIVK---------------HKYIPH--IVSVTL 168
Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
+ Y +L GK F + N + +DR +LG NH
Sbjct: 169 IAYFVILIT------------------GKGFEFS---------EDNIISVVDRAILGSNH 201
Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
MYH +PEGLLS++ SI ++G+ G +I
Sbjct: 202 MYHDNG--------------------------LALDPEGLLSTIPSICHVLVGIFCGGLI 235
Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHF 341
+ TK + R++ G L GL L +
Sbjct: 236 MRTKDNAVRMQNLFIAGTILTFAGLLLEY 264
>gi|294895713|ref|XP_002775269.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
gi|239881343|gb|EER07085.1| conserved hypothetical protein [Perkinsus marinus ATCC 50983]
Length = 323
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 79/299 (26%), Positives = 109/299 (36%), Gaps = 97/299 (32%)
Query: 41 FRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRAD 100
RG+ +++M++VD G P I HAPWNG +LAD VMP F+FI
Sbjct: 1 MRGVVMSIMLIVDVCGKAVPSIGHAPWNGLHLADIVMPGFIFI----------------- 43
Query: 101 AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEI 160
D LT G+D+ R G+LQRIA+ Y L+
Sbjct: 44 ---------------------------DTLTVGLDLYTFRAPGILQRIAVCYAAAVLLAK 76
Query: 161 FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYG 220
D+ D G + VLVV L + V +W ++
Sbjct: 77 LVSDLSPNDTVKGALK----------NNSRVLVVGLLCI----VINWAIMLLGPQPKGCP 122
Query: 221 KVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAP 280
R L P CN IDR V G HMY +P W
Sbjct: 123 ---------RGSLTPQCNVASNIDRMVFGPEHMY-NPLW--------------------- 151
Query: 281 SWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTL 339
+PEGLLS++ S+ + +G+ G I H L + V G L + G+ L
Sbjct: 152 -------DPEGLLSTLPSLATVALGLACGKFIQSRPSH-TELLRLVGCGLLLDLCGMGL 202
>gi|319952891|ref|YP_004164158.1| heparan-alpha-glucosaminide n-acetyltransferase [Cellulophaga
algicola DSM 14237]
gi|319421551|gb|ADV48660.1| Heparan-alpha-glucosaminide N-acetyltransferase [Cellulophaga
algicola DSM 14237]
Length = 363
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 79/288 (27%), Positives = 127/288 (44%), Gaps = 77/288 (26%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAI 88
+R+ ++DIFRG+ + LMILV++ G W + HA W+G D V PFFLFIVG +I
Sbjct: 3 ERIVAVDIFRGMTIVLMILVNNPG-TWAAVYAPFLHADWHGYTPTDLVFPFFLFIVGTSI 61
Query: 89 ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
A AD KK++ R+LKL+ G+LL P + D IR GVLQRI
Sbjct: 62 VFAYSTKKPTADTYKKIVSRSLKLIGLGLLLGAFTLVFP----FVKDFSEIRFPGVLQRI 117
Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
+ + + S+ +F + W L+ V ++ L ++P
Sbjct: 118 GVVFFITSI-------------------LFLNFNWKQLIGVTVFILIGYWLAMGFIP--- 155
Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
+N ++ + + P N Y+D +LG +HM+
Sbjct: 156 ---VNGIASTFDR-------------APNNLANYVDLNILG-SHMW-------------- 184
Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
KD ++PEGLLS++ +I S ++GV G +++ +
Sbjct: 185 --------KD-------DYDPEGLLSTIPAIASCLLGVFTGKILLSKQ 217
>gi|418746616|ref|ZP_13302939.1| PF07786 family protein [Leptospira santarosai str. CBC379]
gi|410792596|gb|EKR90528.1| PF07786 family protein [Leptospira santarosai str. CBC379]
Length = 375
Score = 82.0 bits (201), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 82/298 (27%), Positives = 128/298 (42%), Gaps = 87/298 (29%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
+++S R+ SLD+FRG+ V MILV++ G W I HA WNGC D V PF
Sbjct: 1 MEKQSTQNKDRILSLDLFRGMTVIGMILVNNPG-SWSYIYSPLKHAEWNGCTPTDLVFPF 59
Query: 80 FLFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVD 135
FLF VG +I ++L K +R+D + R+ L+ G+ L G +S A
Sbjct: 60 FLFAVGTSIPISLYSKNGINRSDIWIGICIRSANLILLGLFLNFFGEWSFAE-------- 111
Query: 136 VRMIRLCGVLQRIALSYLLV-SLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
+R+ GVLQRI Y +V SL +F + + FS+ +L++
Sbjct: 112 ---LRIPGVLQRIGFVYWVVASLCLVF------PGKKILVFSV------------AILLI 150
Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
+ +L +P + + + D G +IDR + G H+
Sbjct: 151 HTWILTQIALPG-ESVVSLEQGKDIG--------------------AWIDRTIFGEKHL- 188
Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
WR SK ++PEG LS V+S+++T+ GV G ++
Sbjct: 189 ----WRFSKT----------------------WDPEGFLSGVASVVTTLFGVLCGFIL 220
>gi|90022681|ref|YP_528508.1| hypothetical protein Sde_3039 [Saccharophagus degradans 2-40]
gi|89952281|gb|ABD82296.1| conserved hypothetical protein [Saccharophagus degradans 2-40]
Length = 363
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 55/136 (40%), Positives = 76/136 (55%), Gaps = 17/136 (12%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVG 85
+ TQR +LD+ RG +A+MILV+ GDW + HA W+G + DFV PFFLFI+G
Sbjct: 1 MATQRYLALDVMRGATLAMMILVN-TPGDWGFVYAPLLHADWHGVTITDFVFPFFLFIIG 59
Query: 86 VAIALALKRIPDRADAV--KKVIFRTLKLLFWGILLQG-GFSHAPDELTYGVDVRMIRLC 142
A+ + A A+ KK+I RT L G+LL F+ A EL R+
Sbjct: 60 SALFFTSRSSGQLAPAIKAKKIIKRTALLFTIGLLLHAFPFTTALSEL---------RIL 110
Query: 143 GVLQRIALSYLLVSLV 158
GVLQRIAL+Y + + +
Sbjct: 111 GVLQRIALAYGIAAFI 126
>gi|372223192|ref|ZP_09501613.1| hypothetical protein MzeaS_12804 [Mesoflavibacter
zeaxanthinifaciens S86]
Length = 364
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 49/129 (37%), Positives = 70/129 (54%), Gaps = 9/129 (6%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVA 87
+R+ ++DIFRG+ ++LM+LV+ G W + HA W+G D V PFFLFIVG +
Sbjct: 4 NKRIVAVDIFRGMTISLMVLVNTPG-TWSSVYSPFLHAQWHGYTPTDLVFPFFLFIVGTS 62
Query: 88 IALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
I A K KK+ R LKL+ G+ L G F+ + + D IR GVLQR
Sbjct: 63 IVFAYKNKKPSLKTYKKIGVRALKLIILGLFL-GAFTLS---FPFFKDFENIRFPGVLQR 118
Query: 148 IALSYLLVS 156
I + + + S
Sbjct: 119 IGVVFFITS 127
>gi|255038072|ref|YP_003088693.1| hypothetical protein Dfer_4326 [Dyadobacter fermentans DSM 18053]
gi|254950828|gb|ACT95528.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
Length = 368
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 82/319 (25%), Positives = 133/319 (41%), Gaps = 84/319 (26%)
Query: 25 QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFL 81
E + + RL SLD RG +A MI+V+ G + +P + H+ WNG D + P FL
Sbjct: 1 MENPSVPSSRLLSLDAMRGFTIAAMIMVNFPGHEDYVFPTLRHSKWNGLTFTDLIAPTFL 60
Query: 82 FIVGVAIALAL--KRIPD--RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
FIVGV+I LA KR+ + ++ +K++ R+LK+ G+ L + PD +
Sbjct: 61 FIVGVSITLAYSKKRLSNAPKSGLYRKIVIRSLKIFAVGMFL----NMLPD-----FNFS 111
Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
+R G L RIA+ +L+ +++ + T Q +VG +LV+Y
Sbjct: 112 DLRYTGTLHRIAIVFLVCAILFLNTSWKQQLGIAVG-----------------ILVLYWL 154
Query: 198 LLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
L G P GKV L P N ++D++ L
Sbjct: 155 ALTGIPTP------------GIGKVM---------LEPGVNLAAWVDQQYL--------- 184
Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
P + +W +PEG+LS+ +I +TI G+ G +++
Sbjct: 185 ----------------PGKMWQGNW-----DPEGILSTFPAIATTITGILAGRLMLLPFS 223
Query: 318 HLARLKQWVTMGFALLIFG 336
+ +T GFA G
Sbjct: 224 PNEKSNFLLTAGFATAALG 242
>gi|319641808|ref|ZP_07996487.1| transmembrane protein [Bacteroides sp. 3_1_40A]
gi|345518546|ref|ZP_08797993.1| hypothetical protein BSFG_02387 [Bacteroides sp. 4_3_47FAA]
gi|254835931|gb|EET16240.1| hypothetical protein BSFG_02387 [Bacteroides sp. 4_3_47FAA]
gi|317386564|gb|EFV67464.1| transmembrane protein [Bacteroides sp. 3_1_40A]
Length = 363
Score = 81.6 bits (200), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 80/317 (25%), Positives = 135/317 (42%), Gaps = 84/317 (26%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+RL +LDI RG+ +A MILV++ G + + HA +NG D V PFF+FI+G++
Sbjct: 7 KRLLALDILRGITIAGMILVNNPGSWGHVYTPLEHAAFNGLTPTDLVFPFFMFIMGISTY 66
Query: 90 LALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
++L++ ++K++ RT+ + G+ L A T+ ++ +R GV+QR
Sbjct: 67 ISLRKYNFTYSHATLRKIVKRTVVIFCIGLFLN---LLAKSVFTHHLNFEELRYLGVMQR 123
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV---VYLALLYGTYV 204
+A+ Y + SLV I K H A +LV VY LL
Sbjct: 124 LAIGYGVTSLVAITVK--------------------HKYFPAIILVTLAVYFLLL----- 158
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
G FN++ N V D LG +HMYH
Sbjct: 159 -------------AMGDGFNLSA---------TNIVARFDVWALGTSHMYH--------- 187
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
+G + F+PEGLLS++ ++ ++G + G ++ K + ++++
Sbjct: 188 -------DGGM----------AFDPEGLLSTLPAVCHVMVGFYCGKLLFSAKDNDEKIQR 230
Query: 325 WVTMGFALLIFGLTLHF 341
+G L G L +
Sbjct: 231 LFLVGTILTFAGFLLSY 247
>gi|150004749|ref|YP_001299493.1| transmembrane protein [Bacteroides vulgatus ATCC 8482]
gi|294775179|ref|ZP_06740705.1| putative membrane protein [Bacteroides vulgatus PC510]
gi|149933173|gb|ABR39871.1| putative transmembrane protein [Bacteroides vulgatus ATCC 8482]
gi|294450991|gb|EFG19465.1| putative membrane protein [Bacteroides vulgatus PC510]
Length = 363
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 80/317 (25%), Positives = 135/317 (42%), Gaps = 84/317 (26%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+RL +LDI RG+ +A MILV++ G + + HA +NG D V PFF+FI+G++
Sbjct: 7 KRLLALDILRGITIAGMILVNNPGSWGYVYTPLEHAAFNGLTPTDLVFPFFMFIMGISTY 66
Query: 90 LALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
++L++ ++K++ RT+ + G+ L A T+ ++ +R GV+QR
Sbjct: 67 ISLRKYNFTYSHATLRKIVKRTVVIFCIGLFLN---LLAKSVFTHHLNFEELRYLGVMQR 123
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV---VYLALLYGTYV 204
+A+ Y + SLV I K H A +LV VY LL
Sbjct: 124 LAIGYGVTSLVAITVK--------------------HKYFPAIILVTLAVYFLLL----- 158
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
G FN++ N V D LG +HMYH
Sbjct: 159 -------------AMGDGFNLSA---------TNIVARFDVWALGTSHMYH--------- 187
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
+G + F+PEGLLS++ ++ ++G + G ++ K + ++++
Sbjct: 188 -------DGGM----------AFDPEGLLSTLPAVCHVMVGFYCGKLLFSAKDNDEKIQR 230
Query: 325 WVTMGFALLIFGLTLHF 341
+G L G L +
Sbjct: 231 LFLVGTILTFAGFLLSY 247
>gi|294667090|ref|ZP_06732315.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
gi|292603100|gb|EFF46526.1| conserved hypothetical protein [Xanthomonas fuscans subsp.
aurantifolii str. ICPB 10535]
Length = 388
Score = 81.6 bits (200), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 85/312 (27%), Positives = 126/312 (40%), Gaps = 73/312 (23%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
K +R SLD+FRGL + LMILV+ AG + +++HA W G LAD V P FLF VG A
Sbjct: 18 KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77
Query: 88 IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
++ AL + +V R +L G+L+ F PD V +RL GVL
Sbjct: 78 MSFALATNTPHLQFLGRVSKRAALILLCGVLMYWFPFFHLQPDGGWSFTTVDQLRLTGVL 137
Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
QRI L YL +L+ + + +L+ Y ALLY P
Sbjct: 138 QRIGLCYLAAALLVRYLPPRGIAPVCL-----------------ALLLGYWALLYAFGQP 180
Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
A+L+ NA +D + G +H+Y
Sbjct: 181 G------------------------AELSKTGNAGTRLDLWLYGRDHLY----------- 205
Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
RKD F+PEGLL ++S+ ++ + G G + +A +
Sbjct: 206 ----------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFLQRHGKTVASTRSL 249
Query: 326 VTMGFALLIFGL 337
+ G L++ L
Sbjct: 250 LLAGAGLVVLAL 261
>gi|119491291|ref|ZP_01623345.1| hypothetical protein L8106_21879 [Lyngbya sp. PCC 8106]
gi|119453455|gb|EAW34617.1| hypothetical protein L8106_21879 [Lyngbya sp. PCC 8106]
Length = 371
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 78/285 (27%), Positives = 124/285 (43%), Gaps = 79/285 (27%)
Query: 34 RLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
RL SLD+FRG+A+A MILV++ G +P + HA W+G D V P FLFIVGVA+
Sbjct: 2 RLTSLDVFRGIAIASMILVNNPGSWNHVYPLLKHAEWHGYTPTDLVFPSFLFIVGVAMTF 61
Query: 91 AL-KRIPDRADAVK----KVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
++ K +P+ + + K+ R L+ LL + P+ D+ IR+ GVL
Sbjct: 62 SMSKYLPENRNLEENISPKIYLRILRRCLILFLLGLLLNGYPNY-----DLANIRIMGVL 116
Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
QRI+L+Y L ++ + Q S+G +L+ Y ++ VP
Sbjct: 117 QRISLAYGLSAITILHLSRKQIWGLSIG-----------------LLIGYAVVMQLIPVP 159
Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
+ GV L P N Y+DR +LG +H+
Sbjct: 160 N--------------------SGV-VNLTPEGNFAAYLDRLILGEHHLL----------- 187
Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
++PEGLLS++ ++++ +IG G+
Sbjct: 188 -----------------GGGKYDPEGLLSTLPAVVTVLIGYLTGN 215
>gi|381169858|ref|ZP_09879020.1| conserved hypothetical protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
gi|380689628|emb|CCG35507.1| conserved hypothetical protein [Xanthomonas citri pv.
mangiferaeindicae LMG 941]
Length = 388
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 83/287 (28%), Positives = 118/287 (41%), Gaps = 73/287 (25%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
K +R SLD+FRGL + LMILV+ AG + +++HA W G LAD V P FLF VG A
Sbjct: 18 KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77
Query: 88 IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
++ AL + +V R +L G+L+ F PD V +RL GVL
Sbjct: 78 MSFALATNAPHLQFLGRVSRRAALILLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGVL 137
Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
QRI L YL +L+ +L + V LALL G
Sbjct: 138 QRIGLCYLAAALLV------------------------RYLPPRGIAPVCLALLLGY--- 170
Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
W F + A+L+ NA +D + G +H+Y
Sbjct: 171 -WAFLYVFGQPG-------------AELSKTGNAGTRLDLWLYGRDHLY----------- 205
Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
RKD F+PEGLL ++S+ ++ + G G +
Sbjct: 206 ----------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFL 236
>gi|384362003|ref|YP_006199855.1| hypothetical protein CDBI1_13575 [Clostridium difficile BI1]
Length = 485
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 49/146 (33%), Positives = 78/146 (53%), Gaps = 16/146 (10%)
Query: 28 SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
S L R+ S+DI RGL++ALMI+ ++ G +P++ HA W+G LADF PFF+ +
Sbjct: 113 SKLTNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 172
Query: 85 GVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
GV I ++ LK + + R++ L+ +G L + P D+ +R
Sbjct: 173 GVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNSVR 223
Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
+ GVLQR+ L Y + SLV + K +
Sbjct: 224 ILGVLQRMGLVYFVTSLVYLLLKKLN 249
>gi|255651295|ref|ZP_05398197.1| hypothetical protein CdifQCD_14003 [Clostridium difficile
QCD-37x79]
Length = 461
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 49/146 (33%), Positives = 78/146 (53%), Gaps = 16/146 (10%)
Query: 28 SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
S L R+ S+DI RGL++ALMI+ ++ G +P++ HA W+G LADF PFF+ +
Sbjct: 89 SKLTNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 148
Query: 85 GVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
GV I ++ LK + + R++ L+ +G L + P D+ +R
Sbjct: 149 GVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNSVR 199
Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
+ GVLQR+ L Y + SLV + K +
Sbjct: 200 ILGVLQRMGLVYFVTSLVYLLLKKLN 225
>gi|255518179|ref|ZP_05385855.1| hypothetical protein CdifQCD-_13768 [Clostridium difficile
QCD-97b34]
Length = 469
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 49/146 (33%), Positives = 78/146 (53%), Gaps = 16/146 (10%)
Query: 28 SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
S L R+ S+DI RGL++ALMI+ ++ G +P++ HA W+G LADF PFF+ +
Sbjct: 97 SKLTNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 156
Query: 85 GVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
GV I ++ LK + + R++ L+ +G L + P D+ +R
Sbjct: 157 GVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNSVR 207
Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
+ GVLQR+ L Y + SLV + K +
Sbjct: 208 ILGVLQRMGLVYFVTSLVYLLLKKLN 233
>gi|254976379|ref|ZP_05272851.1| hypothetical protein CdifQC_13741 [Clostridium difficile QCD-66c26]
Length = 459
Score = 81.3 bits (199), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 49/146 (33%), Positives = 78/146 (53%), Gaps = 16/146 (10%)
Query: 28 SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
S L R+ S+DI RGL++ALMI+ ++ G +P++ HA W+G LADF PFF+ +
Sbjct: 87 SKLTNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 146
Query: 85 GVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
GV I ++ LK + + R++ L+ +G L + P D+ +R
Sbjct: 147 GVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNSVR 197
Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
+ GVLQR+ L Y + SLV + K +
Sbjct: 198 ILGVLQRMGLVYFVTSLVYLLLKKLN 223
>gi|255656770|ref|ZP_05402179.1| hypothetical protein CdifQCD-2_14006 [Clostridium difficile
QCD-23m63]
Length = 481
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 49/146 (33%), Positives = 78/146 (53%), Gaps = 16/146 (10%)
Query: 28 SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
S L R+ S+DI RGL++ALMI+ ++ G +P++ HA W+G LADF PFF+ +
Sbjct: 109 SKLMNSRVKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 168
Query: 85 GVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
GV I ++ LK + + R++ L+ +G L + P D+ +R
Sbjct: 169 GVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNTVR 219
Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
+ GVLQR+ L Y + SLV + K +
Sbjct: 220 ILGVLQRMGLVYFVTSLVYLLLKKLN 245
>gi|374374997|ref|ZP_09632655.1| hypothetical protein NiasoDRAFT_0408 [Niabella soli DSM 19437]
gi|373231837|gb|EHP51632.1| hypothetical protein NiasoDRAFT_0408 [Niabella soli DSM 19437]
Length = 395
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 87/306 (28%), Positives = 120/306 (39%), Gaps = 91/306 (29%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAIA 89
R SLD+FRG V LMILV++ G W I HAPW+G D V PFFLF VG A++
Sbjct: 4 RYRSLDVFRGATVCLMILVNNPG-SWAHIYAPLDHAPWHGLTPTDLVFPFFLFAVGNAMS 62
Query: 90 LALKRIPDR--ADAVKKVIFRTLKLLFWGILL--------QGGFSHAPDELTYGVDVRMI 139
+ R+ + A+ KK+ RTL + GI L G A +T I
Sbjct: 63 FVIPRLQEAGPAEFWKKITKRTLIIFGIGIFLNWSPFVRWNGDTLQAVTWVTDPAKNIGI 122
Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
R+ GVLQRIA Y S++ + K A L + L L
Sbjct: 123 RIFGVLQRIAFCYFFASIIVYYLKP----------------------KTAYFLSLVLLLA 160
Query: 200 YGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----IDRKVLGINHMYH 255
Y W I+ + AD P + G+ ID+ +L I HMY
Sbjct: 161 Y------WGLCILG-NPAD-----------------PYSLKGWFGTNIDKAILHIPHMYK 196
Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHT 315
EG PF+PEG SS+ +I+ + G G I ++
Sbjct: 197 G---------------EG-----------VPFDPEGFASSLGAIVQIVFGYFVGMYIKNS 230
Query: 316 KGHLAR 321
+ +
Sbjct: 231 SAQIPK 236
>gi|374385780|ref|ZP_09643283.1| hypothetical protein HMPREF9449_01669 [Odoribacter laneus YIT
12061]
gi|373225482|gb|EHP47816.1| hypothetical protein HMPREF9449_01669 [Odoribacter laneus YIT
12061]
Length = 382
Score = 81.3 bits (199), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 95/335 (28%), Positives = 140/335 (41%), Gaps = 94/335 (28%)
Query: 30 LKTQ-RLASLDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVG 85
+KT+ RL +LD+FRG+ +A MILV+ G + + HA WNG D V PFF+FI+G
Sbjct: 1 MKTENRLLALDVFRGITIAGMILVNDPGSWSAVYAPLCHASWNGLTPTDLVFPFFMFIMG 60
Query: 86 VAIALALKRIPD--RADAVKKVIFRTLKLLF-------WGILLQGGF-SHAPDELTYG-- 133
+++ +L+R AV K IFR L+F W L G F S E T+
Sbjct: 61 ISMYFSLRRYNSLFSRGAVAK-IFRRAVLIFLIGLGINWFALWFGTFMSMGNGEFTFWER 119
Query: 134 -----VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
V IR+ GVLQR+AL+YL +++ + + R+ +F A
Sbjct: 120 FTQNIFPVADIRILGVLQRLALAYLGGAILCLGIRP---------RYQLFT--------A 162
Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
+LV Y +L G+ F + N + +DR VL
Sbjct: 163 VMILVGYFVIL------------------VVGEGF---------IRSEHNILSVVDRAVL 195
Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
G+ H+Y A S F+PEGLLS++ + GV
Sbjct: 196 GVRHLYG---------------------GGASSGAGMAFDPEGLLSTLPCFAHVLFGVCM 234
Query: 309 GHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTN 343
G ++ K + R++Q L IFG L F
Sbjct: 235 GRMLGEVKENEIRIRQ-------LFIFGTILLFAG 262
>gi|332983392|ref|YP_004464833.1| heparan-alpha-glucosaminide N-acetyltransferase [Mahella
australiensis 50-1 BON]
gi|332701070|gb|AEE98011.1| Heparan-alpha-glucosaminide N-acetyltransferase [Mahella
australiensis 50-1 BON]
Length = 368
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 74/296 (25%), Positives = 128/296 (43%), Gaps = 82/296 (27%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+R+ S+D RG+ + MI +++ G P + HAPWNG LAD P F+F++G+ I
Sbjct: 4 KRIQSIDALRGICITAMIFMNNPGNSKYTSPLLLHAPWNGITLADLFFPCFIFVMGMVIP 63
Query: 90 LALKRIPDRADAVKKVIFRTLK---LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
++ + + ++I LK +LF L G F +A D++ +R+ GVLQ
Sbjct: 64 VSFGKRMAKGQTKGQLIAHLLKRSAMLF----LIGLFLNAFPCF----DMQHVRILGVLQ 115
Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
RIAL Y L+ +F+ + ++++A +L+ Y LL VP
Sbjct: 116 RIALVYFFSGLIFLFSSTMS-----------------MFIISAAILIGYYLLLRFVPVPG 158
Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
+ + + N + YID K+L H+Y
Sbjct: 159 YGAGVFERTG---------------------NLIQYIDLKLLK-GHLY------------ 184
Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
P W +PEGLLS++ +I S+++G+ G +++ K + +L
Sbjct: 185 ------------TPDW-----DPEGLLSTLPAIASSLLGILTGCLLVSDKKNTNKL 223
>gi|390989491|ref|ZP_10259788.1| conserved hypothetical protein [Xanthomonas axonopodis pv. punicae
str. LMG 859]
gi|372555760|emb|CCF66763.1| conserved hypothetical protein [Xanthomonas axonopodis pv. punicae
str. LMG 859]
Length = 388
Score = 80.9 bits (198), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 81/287 (28%), Positives = 117/287 (40%), Gaps = 73/287 (25%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
K +R SLD+FRGL + LMILV+ AG + +++HA W G LAD V P FLF VG A
Sbjct: 18 KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77
Query: 88 IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
++ AL + +V R +L G+L+ F PD V +RL GVL
Sbjct: 78 MSFALATNTPHLQFLGRVSRRAALILLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGVL 137
Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
QRI L YL +L+ + + +L+ Y ALLY P
Sbjct: 138 QRIGLCYLAAALLVRYLPQRGIAPVCL-----------------ALLLGYWALLYAFGQP 180
Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
A+L+ NA +D + G +H+Y
Sbjct: 181 G------------------------AELSKTGNAGTRLDLWLYGRDHLY----------- 205
Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
RKD F+PEGLL ++S+ ++ + G G +
Sbjct: 206 ----------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFL 236
>gi|299140911|ref|ZP_07034049.1| membrane protein [Prevotella oris C735]
gi|298577877|gb|EFI49745.1| membrane protein [Prevotella oris C735]
Length = 370
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 82/282 (29%), Positives = 126/282 (44%), Gaps = 75/282 (26%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
+ QRL SLD+ RGLA+A MILV++ G W I H+ WNG D V PFF+F +G
Sbjct: 1 MTQQRLISLDMLRGLAMAGMILVNNPG-SWSHIYVPLEHSVWNGLTPTDLVFPFFVFAMG 59
Query: 86 VAIALALKRIPD-RADAVKKVIFRTLKLLFWG-ILLQGGFSHAPDELTYGVDVRMIRLCG 143
+A+ + K + RA ++KV+ R++ L G +L G EL + +R+ G
Sbjct: 60 MAMGFSTKNLTALRASYLRKVMKRSVLLFVIGLLLTLLGRWLNTGELCFS----QLRVMG 115
Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY 203
VLQR++LSYL+V+L+ K V F + L +W++ LL G
Sbjct: 116 VLQRLSLSYLVVALIVRRVKGVPTMT-----FVVVALLSGYWVL----------LLLG-- 158
Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
+G F+ N V +DR +LG +H+Y
Sbjct: 159 ---------------HGFDFSAN-----------NIVAVVDRWLLGESHLY--------- 183
Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
+ P P+ F+PEGLLS++ + ++G
Sbjct: 184 --IERLPDGTPI----------AFDPEGLLSTIPCVAQVLLG 213
>gi|421110364|ref|ZP_15570862.1| PF07786 family protein [Leptospira santarosai str. JET]
gi|410804289|gb|EKS10409.1| PF07786 family protein [Leptospira santarosai str. JET]
Length = 375
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 83/298 (27%), Positives = 125/298 (41%), Gaps = 87/298 (29%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
+++S R+ SLD+FRG+ V MILV++ G W I HA WNGC D V PF
Sbjct: 1 MEKQSTQNKDRILSLDLFRGMTVIGMILVNNPG-SWSYIYSPLKHAEWNGCTPTDLVFPF 59
Query: 80 FLFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVD 135
FLF VG +I ++L K +R+D + R+ L+ G+ L G +S A
Sbjct: 60 FLFAVGTSIPISLYSKNGINRSDIWIGICIRSANLILLGLFLNFFGEWSFAE-------- 111
Query: 136 VRMIRLCGVLQRIALSYLLV-SLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
+R+ GVLQRI Y +V SL +F + + FS+ L W++ L
Sbjct: 112 ---LRIPGVLQRIGFVYWVVASLCLVF------PGKKILVFSVPILLIHTWILTQIAL-- 160
Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
P + + + D G +IDR + G H+
Sbjct: 161 ----------PG-ESVVSLEQGKDIG--------------------AWIDRTIFGEKHL- 188
Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
WR SK ++PEG LS V+S+++T+ GV G ++
Sbjct: 189 ----WRFSKT----------------------WDPEGFLSGVASVVTTLFGVLCGFIL 220
>gi|404450663|ref|ZP_11015643.1| hypothetical protein A33Q_15100 [Indibacter alkaliphilus LW1]
gi|403763718|gb|EJZ24662.1| hypothetical protein A33Q_15100 [Indibacter alkaliphilus LW1]
Length = 381
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 50/134 (37%), Positives = 77/134 (57%), Gaps = 10/134 (7%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
+R +LD+ RGL +ALMI+V+ G W + HAPW+G + D V P FLF+VG A+
Sbjct: 13 ERYLALDVLRGLTIALMIVVNTPG-SWSHMYGPFMHAPWHGFTITDLVFPTFLFVVGNAM 71
Query: 89 ALALKRIPDRADA--VKKVIFRTLKLLF--WGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
+ ++K++ ++KV+ R+ + WG+ F + L ++ +RL GV
Sbjct: 72 SFSMKKLEKMGQGLFLRKVLKRSFLIFIIGWGLNAFPFFDQTENGLAM-INWGEVRLLGV 130
Query: 145 LQRIALSYLLVSLV 158
LQRIAL YL+ SLV
Sbjct: 131 LQRIALCYLIASLV 144
>gi|410028220|ref|ZP_11278056.1| hypothetical protein MaAK2_03415 [Marinilabilia sp. AK2]
Length = 382
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 51/134 (38%), Positives = 78/134 (58%), Gaps = 9/134 (6%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
+R +LD+ RGL +ALM++V+ G W + HA W+G + D + P FLF+VG A+
Sbjct: 13 ERYLALDVLRGLTIALMVVVNTPG-SWSHMYAPFMHADWHGFTITDLIFPTFLFVVGNAM 71
Query: 89 ALALKRIPDRADA--VKKVIFRTLKLLFWGILLQG-GFSHAPDELTYG-VDVRMIRLCGV 144
+ ++KR+ + +KKV RTL + G LL F + E Y ++ +RL GV
Sbjct: 72 SFSMKRMESMGQSLFLKKVFKRTLLIFLIGWLLNAFPFFNYNAETGYSMINWSEVRLLGV 131
Query: 145 LQRIALSYLLVSLV 158
LQRIAL Y+L +L+
Sbjct: 132 LQRIALCYMLAALI 145
>gi|325954677|ref|YP_004238337.1| hypothetical protein [Weeksella virosa DSM 16922]
gi|323437295|gb|ADX67759.1| hypothetical protein Weevi_1050 [Weeksella virosa DSM 16922]
Length = 402
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 51/146 (34%), Positives = 79/146 (54%), Gaps = 11/146 (7%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGV 86
+KT R SLD+FRG +ALMILV++ G + + HA W+GC D V PFFLF VG
Sbjct: 1 MKTTRYYSLDVFRGATIALMILVNNPGSWSYMFSPLQHASWHGCTPTDLVFPFFLFAVGN 60
Query: 87 AIALALKRIPDRADAV--KKVIFRTLKLLFWGILLQGG--FSHAPDELTYGV----DVRM 138
A++ + + +A V KK+I RT+ + G+ + +EL + +
Sbjct: 61 AMSFGMSHLKLQASNVFWKKIIKRTILIFAIGLFINWWPFLKWENNELVFRAWRESEENG 120
Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKD 164
+R+ GVLQRIA++ S + + +D
Sbjct: 121 VRIMGVLQRIAIANFFASTLAYYYRD 146
>gi|410941669|ref|ZP_11373463.1| putative membrane protein [Leptospira noguchii str. 2006001870]
gi|410783218|gb|EKR72215.1| putative membrane protein [Leptospira noguchii str. 2006001870]
Length = 381
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 80/309 (25%), Positives = 127/309 (41%), Gaps = 111/309 (35%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
++KS+ R+ SLD+FRG+ VA MILV++ G W I HA WNGC D V PF
Sbjct: 1 MEKKSN--QNRILSLDLFRGMTVAGMILVNNPG-SWSFIYTPLKHAKWNGCTPTDLVFPF 57
Query: 80 FLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQ--GGFSHAPDELTYG 133
FLF+VG +I +L + K+ F R++ L+ G+ L G +S +
Sbjct: 58 FLFVVGTSIPFSLYS--KNKIYISKIWFGICIRSITLILIGLFLNFFGEWSFSK------ 109
Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
+R+ G+LQRI Y W++A+ L+
Sbjct: 110 -----LRIPGILQRIGFVY--------------------------------WVVASLYLM 132
Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----------I 243
+ ++ +++P + V V ++ PP ++ Y I
Sbjct: 133 LPKRIILISWIP----------------ILIVHTWVLIQIPPPGESIVYLEPGKDIGAWI 176
Query: 244 DRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTI 303
DR V G NH+ W+ SK ++PEG S +SSI +T+
Sbjct: 177 DRNVFGENHL-----WKFSKT----------------------WDPEGFFSGISSIATTL 209
Query: 304 IGVHFGHVI 312
+GV G ++
Sbjct: 210 LGVFCGSIL 218
>gi|315500593|ref|YP_004089395.1| hypothetical protein Astex_3616 [Asticcacaulis excentricus CB 48]
gi|315418605|gb|ADU15244.1| protein of unknown function DUF1624 [Asticcacaulis excentricus CB
48]
Length = 372
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 82/292 (28%), Positives = 129/292 (44%), Gaps = 72/292 (24%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDH--AGGD-WPEISHAPWNGCNLADFVMPFFLFIVGV 86
+ R +LD+FRGL V +MI+V+ AG + + ++ HA W G L D V P FLF +G
Sbjct: 1 MSAARYTALDVFRGLTVCVMIVVNTSPAGAEPFAQLQHAQWFGFTLTDLVFPSFLFAIGN 60
Query: 87 AIALALKRIPDRADAVKKVIFRTLKLLFWGILLQG-GFSHAPDELTYGV-DVRMIRLCGV 144
++ A ++ + + KV+ R+ + G L+ F H + + D+ R+ GV
Sbjct: 61 SMVFAFRKPLPHKEFLLKVLRRSALIFLLGYLMYWFPFVHQTTDGAWAFNDIGHTRIMGV 120
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
LQRIAL YL SL + SV I ++A +L Y LLY +
Sbjct: 121 LQRIALCYLFASLAA--------RYLSVRGLVI---------LSALLLFGYWGLLY-AFT 162
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
P +AD +T + AK ID+ VLG++HMY A
Sbjct: 163 P----------AAD---ALTMTGNLGAK----------IDQFVLGLDHMYRGGA------ 193
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
+EPEGLLS++ +I++ + G G +I+ ++
Sbjct: 194 --------------------KGYEPEGLLSTLPAIVNVLAGYLCGRLILDSE 225
>gi|209523049|ref|ZP_03271606.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|376001698|ref|ZP_09779557.1| conserved hypothetical protein (membrane) [Arthrospira sp. PCC
8005]
gi|423062475|ref|ZP_17051265.1| hypothetical protein SPLC1_S033650 [Arthrospira platensis C1]
gi|209496636|gb|EDZ96934.1| conserved hypothetical protein [Arthrospira maxima CS-328]
gi|375329927|emb|CCE15310.1| conserved hypothetical protein (membrane) [Arthrospira sp. PCC
8005]
gi|406716383|gb|EKD11534.1| hypothetical protein SPLC1_S033650 [Arthrospira platensis C1]
Length = 378
Score = 80.9 bits (198), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 84/289 (29%), Positives = 120/289 (41%), Gaps = 85/289 (29%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
RL SLD+FRG+A+A MILV++ G +P + HA W+GC D V P FL I+GVAIA
Sbjct: 8 MRLISLDVFRGIAIAAMILVNNPGSWGYMYPVLQHAEWDGCTPTDVVFPSFLLIMGVAIA 67
Query: 90 LALK------RIPDR--ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
+L R+P +V I R LLF L GF H D+ IR+
Sbjct: 68 FSLSKFAREHRLPGEKVPPSVYSRIGRRCLLLFLLGLFLNGFPH--------YDLANIRI 119
Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
GVLQRIA++Y L ++ + Q WL++ L+ Y +
Sbjct: 120 MGVLQRIAIAYGLTAIAILNLSRRQ-----------------LWLISILTLIGYWVAMTI 162
Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
VP YG L+P N +ID+ +LG +H+
Sbjct: 163 IPVPS------------YGP---------GNLSPEGNLGAFIDQTILGSHHL-------- 193
Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
W P++PEGL S+ + ++ I+G G
Sbjct: 194 --------------------WRGGPYDPEGLFSTAPATVTVILGYLTGE 222
>gi|423312333|ref|ZP_17290270.1| hypothetical protein HMPREF1058_00882 [Bacteroides vulgatus
CL09T03C04]
gi|392688817|gb|EIY82101.1| hypothetical protein HMPREF1058_00882 [Bacteroides vulgatus
CL09T03C04]
Length = 363
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 80/317 (25%), Positives = 135/317 (42%), Gaps = 84/317 (26%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+RL +LDI RG+ +A MILV++ G + + HA +NG D V PFF+FI+G++
Sbjct: 7 KRLLALDILRGITIAGMILVNNPGSWGYVYTPLEHAAFNGLTPTDLVFPFFMFIMGISTY 66
Query: 90 LALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
++L++ ++K++ RT+ + G+ L A T+ ++ +R GV+QR
Sbjct: 67 ISLRKYNFTYSHAILRKIVKRTVVIFCIGLFLN---LLAKSVFTHHLNFEELRYLGVMQR 123
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV---VYLALLYGTYV 204
+A+ Y + SLV I K H A +LV VY LL
Sbjct: 124 LAIGYGVTSLVAITVK--------------------HKYFPAIILVTLAVYFLLL----- 158
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
G FN++ N V D LG +HMYH
Sbjct: 159 -------------AMGDGFNLSV---------TNIVARFDVWALGTSHMYH--------- 187
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
+G + F+PEGLLS++ ++ ++G + G ++ K + ++++
Sbjct: 188 -------DGGM----------AFDPEGLLSTLPAVCHVMVGFYCGKLLFSAKDNDEKIQR 230
Query: 325 WVTMGFALLIFGLTLHF 341
+G L G L +
Sbjct: 231 LFLVGTILTFAGFLLSY 247
>gi|406662851|ref|ZP_11070935.1| hypothetical protein B879_02963 [Cecembia lonarensis LW9]
gi|405553158|gb|EKB48438.1| hypothetical protein B879_02963 [Cecembia lonarensis LW9]
Length = 382
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 49/134 (36%), Positives = 77/134 (57%), Gaps = 9/134 (6%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
+R +LD+ RGL +ALM++V+ G W + HA W+G + D + P FLF+VG A+
Sbjct: 13 ERYLALDVLRGLTIALMVVVNTPG-SWSHMYAPFMHADWHGFTITDLIFPTFLFVVGNAM 71
Query: 89 ALALKRIPDRADAV--KKVIFRTLKLLFWGILLQG--GFSHAPDELTYGVDVRMIRLCGV 144
+ ++K++ V KKV RTL + G LL ++ P+ ++ +RL GV
Sbjct: 72 SFSMKKLESMGQQVFLKKVFKRTLLIFLIGWLLNAFPFVNYNPESGYSMINWSEVRLLGV 131
Query: 145 LQRIALSYLLVSLV 158
LQRIAL Y+L +L+
Sbjct: 132 LQRIALCYMLAALI 145
>gi|409199197|ref|ZP_11227860.1| hypothetical protein MsalJ2_19286 [Marinilabilia salmonicolor JCM
21150]
Length = 369
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 79/294 (26%), Positives = 127/294 (43%), Gaps = 86/294 (29%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGV 86
++QR +LD+ RG+ +ALMI V+ G W I HA W+GC D V PFFLF+ GV
Sbjct: 3 QSQRYLALDVLRGMTIALMITVNTPG-SWQYIYAPLRHASWHGCTPTDLVFPFFLFVAGV 61
Query: 87 AIALALKRIPD--RADAVKKVIFRTLKLLFWGILLQG--GFSHAPDELTYGVDVRMIRLC 142
++ + + ++++K++ RTL + G+ L +SH D +R+
Sbjct: 62 SMFFSFGKYGGALNSESLKRLGRRTLLIFVIGLFLNSFPQWSH---------DFSTLRIM 112
Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
GVLQRIAL+Y + SL+ + S R Y + +L++Y +L
Sbjct: 113 GVLQRIALAYGIGSLIVL---------------SAPRKYI--PFIGGGILLIYWGIL--- 152
Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
W G + NAV D+ +LG H+Y
Sbjct: 153 ---AW-------------------FGGAEPYSLEGNAVIPFDKAILGEQHLY-------- 182
Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
+ PF+PEGLLS+V +I++ ++G G +I +T+
Sbjct: 183 ------------------TGFGIPFDPEGLLSTVPAIVTVLLGYLTGVIIKNTE 218
>gi|418515336|ref|ZP_13081517.1| hypothetical protein MOU_00795 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
gi|418520970|ref|ZP_13087016.1| hypothetical protein WS7_08123 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410702946|gb|EKQ61443.1| hypothetical protein WS7_08123 [Xanthomonas axonopodis pv.
malvacearum str. GSPB2388]
gi|410708055|gb|EKQ66504.1| hypothetical protein MOU_00795 [Xanthomonas axonopodis pv.
malvacearum str. GSPB1386]
Length = 388
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 81/287 (28%), Positives = 117/287 (40%), Gaps = 73/287 (25%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
K +R SLD+FRGL + LMILV+ AG + +++HA W G LAD V P FLF VG A
Sbjct: 18 KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77
Query: 88 IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
++ AL + +V R +L G+L+ F PD V +RL GVL
Sbjct: 78 MSFALATNTPHLQFLGRVSRRAALILLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGVL 137
Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
QRI L YL +L+ + + +L+ Y ALLY P
Sbjct: 138 QRIGLCYLAAALLVRYLPPRGIAPVCL-----------------ALLLGYWALLYAFGQP 180
Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
A+L+ NA +D + G +H+Y
Sbjct: 181 G------------------------AELSKTGNAGTRLDLWLYGRDHLY----------- 205
Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
RKD F+PEGLL ++S+ ++ + G G +
Sbjct: 206 ----------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFL 236
>gi|402878146|ref|XP_003902762.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Papio
anubis]
Length = 708
Score = 80.5 bits (197), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 72/270 (26%), Positives = 123/270 (45%), Gaps = 46/270 (17%)
Query: 69 GCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFS 124
G +AD V P+F+FI+G +I L++ I R + + K+ +R+ L+ GI++
Sbjct: 347 GLTVADLVFPWFVFIMGSSIFLSMTSILQRGCSKFRLLGKIAWRSFLLICIGIIIVN--- 403
Query: 125 HAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEI-FTKDVQDKDQSVGRFSIFR--LY 181
P+ + +R+ GVLQR+ ++Y +V+++E+ F K V + S R
Sbjct: 404 --PNYCLGPLSWDKVRIPGVLQRLGVTYFVVAVLELLFAKPVPEHCASERSCLSLRDITS 461
Query: 182 CW-HWLMAACVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNA 239
W WL+ + ++L L + VP + D+GK N T G A
Sbjct: 462 SWPQWLLILALEGLWLGLTFLLPVPGCPTGYLGPGGIGDFGKYPNCTGG----------A 511
Query: 240 VGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSI 299
GYIDR +LG +H+Y HP+ ++PEG+L +++SI
Sbjct: 512 AGYIDRLLLGDDHLYQHPS------------------STVLYHTEVAYDPEGILGTINSI 553
Query: 300 LSTIIGVHFGHVIIH----TKGHLARLKQW 325
+ +GV G ++++ TK L R W
Sbjct: 554 VMAFLGVQAGKILLYYKAQTKDILIRFTAW 583
>gi|255307823|ref|ZP_05351994.1| hypothetical protein CdifA_14636 [Clostridium difficile ATCC 43255]
Length = 483
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 78/146 (53%), Gaps = 16/146 (10%)
Query: 28 SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
S L R+ S+DI RGL++ALMI+ ++ G +P++ HA W+G LADF PFF+ +
Sbjct: 111 SKLMNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 170
Query: 85 GVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
GV I +++ + +I R++ L+ +G L + P D+ +R
Sbjct: 171 GVTIPISINSKIKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLDTVR 221
Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
+ GVLQR+ L Y + SLV + K +
Sbjct: 222 ILGVLQRMGLVYFVTSLVYLLLKKLN 247
>gi|255101955|ref|ZP_05330932.1| hypothetical protein CdifQCD-6_14161 [Clostridium difficile
QCD-63q42]
Length = 469
Score = 80.1 bits (196), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 78/146 (53%), Gaps = 16/146 (10%)
Query: 28 SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
S L R+ S+DI RGL++ALMI+ ++ G +P++ HA W+G LADF PFF+ +
Sbjct: 97 SKLMNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 156
Query: 85 GVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
GV I +++ + +I R++ L+ +G L + P D+ +R
Sbjct: 157 GVTIPISINSKIKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLDTVR 207
Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
+ GVLQR+ L Y + SLV + K +
Sbjct: 208 ILGVLQRMGLVYFVTSLVYLLLKKLN 233
>gi|423081105|ref|ZP_17069717.1| hypothetical protein HMPREF1122_00699 [Clostridium difficile
002-P50-2011]
gi|423085023|ref|ZP_17073481.1| hypothetical protein HMPREF1123_00624 [Clostridium difficile
050-P50-2011]
gi|357550878|gb|EHJ32683.1| hypothetical protein HMPREF1123_00624 [Clostridium difficile
050-P50-2011]
gi|357551414|gb|EHJ33204.1| hypothetical protein HMPREF1122_00699 [Clostridium difficile
002-P50-2011]
Length = 427
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 78/146 (53%), Gaps = 16/146 (10%)
Query: 28 SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
S L R+ S+DI RGL++ALMI+ ++ G +P++ HA W+G LADF PFF+ +
Sbjct: 55 SKLMNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 114
Query: 85 GVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
GV I +++ + +I R++ L+ +G L + P D+ +R
Sbjct: 115 GVTIPISINSKIKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLDTVR 165
Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
+ GVLQR+ L Y + SLV + K +
Sbjct: 166 ILGVLQRMGLVYFVTSLVYLLLKKLN 191
>gi|398343267|ref|ZP_10527970.1| hypothetical protein LinasL1_09415 [Leptospira inadai serovar Lyme
str. 10]
Length = 399
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 82/308 (26%), Positives = 129/308 (41%), Gaps = 85/308 (27%)
Query: 21 VSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFV 76
V + ++RL S+D RG VA MILV++ G W I HA W GC D V
Sbjct: 21 VKQELLNDSFASKRLLSIDALRGFTVAGMILVNNPG-SWSAIYSPLRHAKWFGCTPTDLV 79
Query: 77 MPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
PFFLF VGV+I + K++ R L+F G+ L + D L
Sbjct: 80 FPFFLFSVGVSIPFS---TIGNGGTFFKILKRASILIFIGLFLHWFGEWSMDRL------ 130
Query: 137 RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
R+ GVLQRI L Y + ++ R + FR +++ +
Sbjct: 131 ---RIPGVLQRIGLVYFISAIAY--------------RSASFR----------TRIMICI 163
Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
++L+G + I+ + + G L+P + ++DR V G NH+
Sbjct: 164 SILFGYW-------ILLEFAPPPG-------AGSPSLSPGKDWGAWLDRIVFGENHL--- 206
Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH-- 314
W+ SK ++PEGLL S+S++ +T +G+ FG V+
Sbjct: 207 --WKSSKT----------------------WDPEGLLGSLSAVATTFLGIFFGEVLKKDS 242
Query: 315 -TKGHLAR 321
TKG++ +
Sbjct: 243 DTKGNIQK 250
>gi|398348299|ref|ZP_10533002.1| hypothetical protein Lbro5_13954 [Leptospira broomii str. 5399]
Length = 399
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 80/284 (28%), Positives = 120/284 (42%), Gaps = 82/284 (28%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG----DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL S+D RG VA MILV++ G WP + HA W GC D V PFFLF VGV+
Sbjct: 33 KRLLSIDALRGFTVAGMILVNNPGSWSAIYWP-LKHAKWFGCTPTDLVFPFFLFSVGVS- 90
Query: 89 ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
IP + F+ LK ++L G F H E + + +R+ GVLQRI
Sbjct: 91 ------IPFSSIGNGGTFFKILKRAS-ILILIGLFLHWFGEWS----IDQLRIPGVLQRI 139
Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
L Y + ++ +R +H + C+ +++ + +VP
Sbjct: 140 GLVYFISAIA-------------------YRSSNFHARILICLSILFGYWILLEFVPP-- 178
Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
DS L+P + ++DR V G NH+ W+ SK
Sbjct: 179 ---PGSDSVS--------------LSPGKDWGAWLDRIVFGENHL-----WKSSKT---- 212
Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
++PEGLLSS+S++ +T +G FG V+
Sbjct: 213 ------------------WDPEGLLSSLSAVATTFLGFFFGEVL 238
>gi|372268395|ref|ZP_09504443.1| hypothetical protein AlS89_10850 [Alteromonas sp. S89]
Length = 395
Score = 80.1 bits (196), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 82/316 (25%), Positives = 138/316 (43%), Gaps = 67/316 (21%)
Query: 34 RLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
RL S+D+ RG+A+A M+LV++ G + ++HA W+G D + P FLF+VGV++ L
Sbjct: 16 RLMSVDLLRGIAIAAMVLVNNPGSWSFVYAPMAHAQWHGWTPTDVIFPLFLFVVGVSMVL 75
Query: 91 ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM--IRLCGVLQRI 148
+ + D R LKL G+ L F + D ++ R+ IR GVLQRI
Sbjct: 76 STGKRGDFPPVGWAQWSRALKLFALGLFLAIFFYNFRDASYNWIEDRLEGIRWMGVLQRI 135
Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
AL Y+L Y WL A +LV A + + VP W
Sbjct: 136 ALVYILCC------------------------YLVRWLPAKGLLV---AAILCSVVP-WT 167
Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
++ + G+VF + +L + ++D+ +LG H+Y+ A
Sbjct: 168 LMLVVPYQSASGEVF------QGQLAFGNHFAAWLDQWLLGSAHVYYRDA---------- 211
Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV----HFGHVIIHTKGHLARLKQ 324
PF F+PEG+L++ S+ + ++GV + + + L +
Sbjct: 212 QPFA--------------FDPEGVLTTFSAASTCLLGVLAALAWKSADSNGEAQLRLCRN 257
Query: 325 WVTMGFALLIFGLTLH 340
W+ G +++ G +H
Sbjct: 258 WLVAGTLMVLVGQLMH 273
>gi|192359631|ref|YP_001981658.1| hypothetical protein CJA_1162 [Cellvibrio japonicus Ueda107]
gi|190685796|gb|ACE83474.1| putative membrane protein [Cellvibrio japonicus Ueda107]
Length = 399
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 53/142 (37%), Positives = 78/142 (54%), Gaps = 15/142 (10%)
Query: 27 KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLF 82
+ ++ QR +LD+ RGL +ALMILV+ G W + HA W+G DFV PFFLF
Sbjct: 31 EVYMVKQRFLALDVMRGLTLALMILVN-TPGSWSHVYGPLLHADWHGVTPTDFVFPFFLF 89
Query: 83 IVGVAIALALKRIP--DRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
IVG A+ +++ + + ++KV R L L GILL + DV R
Sbjct: 90 IVGSAMYFSVRGLAQLSLSQQLRKVGRRVLLLFVMGILLAA--------YPFTADVHDWR 141
Query: 141 LCGVLQRIALSYLLVSLVEIFT 162
+ GVLQRIAL+Y + +L+ ++
Sbjct: 142 IMGVLQRIALAYGVAALIVLYA 163
>gi|295690502|ref|YP_003594195.1| hypothetical protein Cseg_3137 [Caulobacter segnis ATCC 21756]
gi|295432405|gb|ADG11577.1| Protein of unknown function DUF2261, transmembrane [Caulobacter
segnis ATCC 21756]
Length = 372
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 83/311 (26%), Positives = 123/311 (39%), Gaps = 72/311 (23%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
R SLD+FRGL V LMI+V+ AG + ++ HAPW G AD V P FLF VG ++
Sbjct: 5 AARFLSLDVFRGLTVCLMIVVNTAGPGAKAYTQLVHAPWFGFTAADAVFPSFLFAVGCSM 64
Query: 89 ALALKRIPDRADAVKKVIFRTLKLLFWGILLQ-GGFSHAPDELTYGVDVRMIRLCGVLQR 147
A A R + + KV+ R + G L+ F D + R+ GVLQR
Sbjct: 65 AFAFSRPIPTNEFLAKVLRRAALIFLLGFLMYWFPFVKKIDGHWALIPFADTRVMGVLQR 124
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
IAL Y+L + + WL ++ + LL G W
Sbjct: 125 IALCYMLAA------------------------FAVRWLSPRLIVALSAVLLLGY----W 156
Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
+ D A A L+ NA ++D ++G NH+Y
Sbjct: 157 AILMTLGDPA-------------APLSKLGNAGTHLDLFLIGQNHLY------------- 190
Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
RKD F+PEGLL ++ S ++ + G + G + + +
Sbjct: 191 --------RKD------GGFDPEGLLGTLPSTVNVLAGYLAARFLKENPGSQSAMARMAI 236
Query: 328 MGFALLIFGLT 338
G L++ GL
Sbjct: 237 AGVVLILAGLA 247
>gi|237720190|ref|ZP_04550671.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
gi|293371122|ref|ZP_06617659.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
gi|229450742|gb|EEO56533.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
gi|292633780|gb|EFF52332.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
Length = 371
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 92/328 (28%), Positives = 132/328 (40%), Gaps = 94/328 (28%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
+K++RL SLDI RG+ + MILV++ G W I HA WNG D V PFF+FI+G
Sbjct: 1 MKSERLLSLDILRGITIVGMILVNNPG-TWESIYAPLRHAEWNGLTPTDLVFPFFMFIMG 59
Query: 86 VAIALALKRIPDRADA--VKKVIFRTLKLLF------WGILLQGG----FSHAPDELTYG 133
V+++ AL R + K++ RTL L W L+ G FSH
Sbjct: 60 VSMSFALSRFDHHFSRGFIIKLVRRTLILFLLGLFLSWFSLVCTGVEQPFSH-------- 111
Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
IR+ GVLQR+AL+Y SL+ + + + W ++ +L
Sbjct: 112 -----IRILGVLQRLALAYFFGSLLIVGVRRPAN-------------LAW---ISGIILA 150
Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
Y LL G F ++ N + DR + G H+
Sbjct: 151 GYSTLL------------------ALGHGFELS---------EQNIIAVTDRTLFGEAHL 183
Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
Y R+ P F+PEGLLS++ I IIG G+++
Sbjct: 184 Y---------------------REWLPDGGRIFFDPEGLLSTLPCIAQVIIGYFCGNILR 222
Query: 314 HTKGHLARLKQWVTMGFALLIFGLTLHF 341
RL Q +G ALL G L +
Sbjct: 223 EKTEIHHRLLQISILGIALLFAGWLLSY 250
>gi|225010297|ref|ZP_03700769.1| conserved hypothetical protein [Flavobacteria bacterium MS024-3C]
gi|225005776|gb|EEG43726.1| conserved hypothetical protein [Flavobacteria bacterium MS024-3C]
Length = 363
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 57/159 (35%), Positives = 82/159 (51%), Gaps = 13/159 (8%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVA 87
+R+ S+DIFRG + LMILV+ G W + HA W+G L D V PFF+FIVGV+
Sbjct: 4 NKRVPSVDIFRGATLLLMILVNTPG-TWSAVYAPFLHASWHGYTLTDLVFPFFIFIVGVS 62
Query: 88 IALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
I+L+ K K+ R+LKL+ G+ L G F+ + + V IR GVLQR
Sbjct: 63 ISLSYKDKKLNGPVFFKLTKRSLKLIGLGLFL-GAFTIS---FPFIKAVENIRFPGVLQR 118
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWL 186
I L + S++ ++ K + I L W W+
Sbjct: 119 IGLVFFFASIIYLW----GSKRSTALIIGIILLAYWLWM 153
>gi|388258355|ref|ZP_10135531.1| hypothetical protein O59_002752 [Cellvibrio sp. BR]
gi|387937867|gb|EIK44422.1| hypothetical protein O59_002752 [Cellvibrio sp. BR]
Length = 362
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 81/297 (27%), Positives = 126/297 (42%), Gaps = 83/297 (27%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
QR +LD+ RGL +ALMILV+ G + + HA W+G DFV PFF+FIVG ++
Sbjct: 4 QRFQALDVMRGLTLALMILVNTPGSWSFVYGPLLHADWHGATATDFVFPFFMFIVGSSMY 63
Query: 90 LALKRIPDRADA--VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
A++ + A A +K++ R + L G+LL S P + ++ R+ GVLQR
Sbjct: 64 FAMRGLRQLAPAAQAQKILRRVVLLFVIGVLL----SAYP----FTNNIENWRVMGVLQR 115
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
IA++Y + + ++ GR +M+A +L+ Y LL
Sbjct: 116 IAIAYGFAAFIILYFG-------FTGRV----------VMSAILLLGYWGLL-------- 150
Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
N + Y + N V D VLG NH++
Sbjct: 151 -----NIAADPY--------------SLEHNLVRQFDLAVLGANHLWQGKG--------- 182
Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
F+PEG+LS+V SI++ IIG V++ ++ L Q
Sbjct: 183 -----------------LAFDPEGILSTVPSIVNVIIGFEATRVLLASEDKAKALSQ 222
>gi|384417772|ref|YP_005627132.1| membrane protein [Xanthomonas oryzae pv. oryzicola BLS256]
gi|353460685|gb|AEQ94964.1| membrane protein, putative [Xanthomonas oryzae pv. oryzicola
BLS256]
Length = 388
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 84/313 (26%), Positives = 125/313 (39%), Gaps = 75/313 (23%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
K +R SLD+FRGL + LMILV+ AG + +++HA W G LAD V P FLF VG A
Sbjct: 18 KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77
Query: 88 IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
++ AL + +V R ++ G+L+ F PD V +RL GVL
Sbjct: 78 MSFALATNTPHLQFLGRVSKRAALIVLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGVL 137
Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
QRI L YL +L+ + V +L+ Y ALLY
Sbjct: 138 QRIGLCYLAAALLVRYLPPRSIAPACV-----------------ALLLGYWALLY----- 175
Query: 206 DWQFTIINKDSADYGKVFNV-TCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+ + A+ K N TC +D + G H+Y
Sbjct: 176 -----VFGQPGAELSKTGNAGTC---------------LDLWLYGREHLY---------- 205
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
RKD F+PEGLL ++S+ ++ + G G + A +
Sbjct: 206 -----------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFLQRHGKTTASTRS 248
Query: 325 WVTMGFALLIFGL 337
+ G +++ L
Sbjct: 249 LLLAGVGMVLLAL 261
>gi|294949094|ref|XP_002786049.1| hypothetical protein Pmar_PMAR023775 [Perkinsus marinus ATCC 50983]
gi|239900157|gb|EER17845.1| hypothetical protein Pmar_PMAR023775 [Perkinsus marinus ATCC 50983]
Length = 277
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 75/306 (24%), Positives = 108/306 (35%), Gaps = 99/306 (32%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
R+ ++D+ RG + + +VD G P I HAPWNG +LAD VMP F+FI
Sbjct: 31 RIVAVDVMRGRSS--VQIVDVCGKTVPSIGHAPWNGLHLADIVMPGFIFI---------- 78
Query: 94 RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYL 153
D LT G+D+ R G+LQRIA+ Y
Sbjct: 79 ----------------------------------DTLTLGLDLYTFRAPGILQRIAVCYA 104
Query: 154 LVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIIN 213
L+ D+ D G L+ C+++ +W ++
Sbjct: 105 AAVLLRKLVSDLSPNDTVKGALKNNSRVLLMGLL--CIII------------NWAIMLLG 150
Query: 214 KDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEG 273
R L P CN IDR V G HMY P W
Sbjct: 151 PQPEGCS---------RGSLTPQCNVASNIDRMVFGPEHMY-SPLW-------------- 186
Query: 274 PLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALL 333
+PEGLLS++ ++ + +G+ G I H L + V G L
Sbjct: 187 --------------DPEGLLSTLPTLATVALGLACGKFIQSRPSH-TELLRLVGCGLLLA 231
Query: 334 IFGLTL 339
+ G+ L
Sbjct: 232 LSGMAL 237
>gi|371776142|ref|ZP_09482464.1| hypothetical protein AnHS1_01923 [Anaerophaga sp. HS1]
Length = 369
Score = 79.7 bits (195), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 56/157 (35%), Positives = 85/157 (54%), Gaps = 17/157 (10%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGV 86
KT+R +LD+ RG+ +ALMI V++ G W I H+ W+GC D V PFFLF+VGV
Sbjct: 3 KTERYLALDVLRGMTIALMITVNNPG-SWKYIYAPLRHSSWHGCTPTDLVFPFFLFVVGV 61
Query: 87 AIALALKRIPD--RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
++ + + + ++ K++ RTL + G+ L + P + D +R+ GV
Sbjct: 62 SMFFSFSKYGNTLNKESFKRLGRRTLLIFAIGLFL----NSFPQ---WDRDYSTLRIMGV 114
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLY 181
LQRIAL+Y SL+ + V K + FSI LY
Sbjct: 115 LQRIALAYGFGSLIVL---SVPRKYIPLLGFSILLLY 148
>gi|320106288|ref|YP_004181878.1| hypothetical protein AciPR4_1053 [Terriglobus saanensis SP1PR4]
gi|319924809|gb|ADV81884.1| hypothetical protein AciPR4_1053 [Terriglobus saanensis SP1PR4]
Length = 394
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 49/144 (34%), Positives = 77/144 (53%), Gaps = 8/144 (5%)
Query: 26 EKSHLKTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLF 82
E TQR+ S+D+ RGL VA MILV+ G + + HAPWNG D V P FLF
Sbjct: 3 ETKAAPTQRILSVDVLRGLTVAFMILVNDPGDGHVAYAPLDHAPWNGWTPTDMVFPTFLF 62
Query: 83 IVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
+VG +I ++ R D+ ++ + ++ + + + P + YG RM RL
Sbjct: 63 LVGCSIVFSITSRLKRGDSKSRIALQVIRRTIYLLAINYAIRLIP-QFHYG---RM-RLF 117
Query: 143 GVLQRIALSYLLVSLVEIFTKDVQ 166
GVL RIA+ YL+ +L+ ++ + +
Sbjct: 118 GVLPRIAICYLIAALLFLWLQRAR 141
>gi|340618131|ref|YP_004736584.1| hypothetical protein zobellia_2146 [Zobellia galactanivorans]
gi|339732928|emb|CAZ96303.1| Conserved hypothetical membrane protein [Zobellia galactanivorans]
Length = 367
Score = 79.3 bits (194), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 86/312 (27%), Positives = 123/312 (39%), Gaps = 84/312 (26%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+R +LD+FRGL + LMI+V+ G DW + HA W+G D V P FLF VG A
Sbjct: 2 KRFKALDVFRGLTICLMIIVNTPG-DWDMTFSPLLHAKWHGFTPTDLVFPSFLFAVGNAF 60
Query: 89 ALALKRIPDR--ADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGV 144
A + D+ +D KK+ RTL + G + FS V R+ GV
Sbjct: 61 AFVKTKWADKPLSDIFKKIAKRTLIIFLLGYTMYWIPFFSWTETGDLAAVPFSETRILGV 120
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL--YGT 202
LQRIAL Y + +++ F + Q S A +L+ Y LL +G
Sbjct: 121 LQRIALCYFIGAIMIYFLTNRQLIIAS-----------------AVILLGYWGLLSAFGD 163
Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
Y + N V IDR +LG +H+Y
Sbjct: 164 YTLE------------------------------GNFVRTIDRMLLGDSHLYMGNG---- 189
Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
PF+PEGLLS++ SI + + G G II +L
Sbjct: 190 ----------------------IPFDPEGLLSTLPSICNVLGGYLVGKYIIDKGIDYEKL 227
Query: 323 KQWVTMGFALLI 334
+ + +G LL+
Sbjct: 228 AKMLLVGAGLLV 239
>gi|296452402|ref|ZP_06894103.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Clostridium
difficile NAP08]
gi|296258732|gb|EFH05626.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Clostridium
difficile NAP08]
Length = 481
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 48/150 (32%), Positives = 78/150 (52%), Gaps = 16/150 (10%)
Query: 28 SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
S R+ S+DI RGL++ALMI+ ++ G +P++ HA W+G LADF PFF+ +
Sbjct: 109 SKFVNSRVKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 168
Query: 85 GVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
GV I ++ LK + + R++ L+ +G L + P D+ +R
Sbjct: 169 GVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNTVR 219
Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQDKDQ 170
+ GVLQR+ L Y + SLV + K + +
Sbjct: 220 ILGVLQRMGLVYFVTSLVYLLLKKLNVRSS 249
>gi|153808903|ref|ZP_01961571.1| hypothetical protein BACCAC_03204 [Bacteroides caccae ATCC 43185]
gi|423220258|ref|ZP_17206753.1| hypothetical protein HMPREF1061_03526 [Bacteroides caccae
CL03T12C61]
gi|149128236|gb|EDM19455.1| hypothetical protein BACCAC_03204 [Bacteroides caccae ATCC 43185]
gi|392623335|gb|EIY17438.1| hypothetical protein HMPREF1061_03526 [Bacteroides caccae
CL03T12C61]
Length = 371
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 80/319 (25%), Positives = 133/319 (41%), Gaps = 85/319 (26%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGV 86
+RL +LD+ RG+ +A MILV++ G W + HA WNG D + PFF+FI+G+
Sbjct: 5 SNKRLLALDVMRGITIAGMILVNNPG-SWGYVYFPLKHAQWNGLTPTDLIFPFFMFIMGI 63
Query: 87 AIALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLC 142
+ ++L++ A K+I RT+ + GI + + D L + IR+
Sbjct: 64 STYISLRKYNFTFSTPAALKIIKRTIVIFLIGIAINWFALLCYYHDPLPFA----QIRVL 119
Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
GV+QR+AL Y +L+ + K +L+ A ++ ++ L+ G
Sbjct: 120 GVMQRLALCYGASALIALLIKHKYIP----------------YLIVALLVGYFILLITGN 163
Query: 203 YVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRS 262
G +N T N + +DR +LG HMY
Sbjct: 164 -----------------GFAYNET-----------NILSIVDRSILGDAHMY-------- 187
Query: 263 KACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARL 322
QD+ +PEGLLS++ SI +IG G +++ K +L
Sbjct: 188 ----QDN----------------HIDPEGLLSTIPSIAHVLIGFCVGKLLMEVKDIREKL 227
Query: 323 KQWVTMGFALLIFGLTLHF 341
++ +G L G L +
Sbjct: 228 ERLFLIGTILTFAGFLLSY 246
>gi|289667572|ref|ZP_06488647.1| hypothetical protein XcampmN_03447, partial [Xanthomonas campestris
pv. musacearum NCPPB 4381]
Length = 298
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 53/133 (39%), Positives = 73/133 (54%), Gaps = 5/133 (3%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
K +R SLD+FRGL + LMILV+ AG + +++HA W G LAD V P FLF VG A
Sbjct: 22 KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 81
Query: 88 IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
++ AL + +V R + ++ G+L+ F PD V +RL GVL
Sbjct: 82 MSFALATNTPPLQFLGRVSKRAVLIVLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGVL 141
Query: 146 QRIALSYLLVSLV 158
QRI L YL +L+
Sbjct: 142 QRIGLCYLAAALL 154
>gi|242062184|ref|XP_002452381.1| hypothetical protein SORBIDRAFT_04g024713 [Sorghum bicolor]
gi|241932212|gb|EES05357.1| hypothetical protein SORBIDRAFT_04g024713 [Sorghum bicolor]
Length = 96
Score = 79.3 bits (194), Expect = 3e-12, Method: Composition-based stats.
Identities = 42/101 (41%), Positives = 57/101 (56%), Gaps = 5/101 (4%)
Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLY 200
L G+LQRIA++YLL ++ EI+ K D D G + R Y + + + + Y LLY
Sbjct: 1 LMGILQRIAIAYLLAAICEIWLKGDDDVDSGYG---LLRRYRYQLFVGLVLSIAYTILLY 57
Query: 201 GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVG 241
G YVPDW++ I S + K F+V CGVR CNAVG
Sbjct: 58 GIYVPDWEYKISGPGSTE--KSFSVKCGVRGDTGLACNAVG 96
>gi|444731031|gb|ELW71398.1| Heparan-alpha-glucosaminide N-acetyltransferase [Tupaia chinensis]
Length = 732
Score = 79.3 bits (194), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 81/334 (24%), Positives = 138/334 (41%), Gaps = 67/334 (20%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
RL +D FRG+A+ LM+ V++ GG + HA WN + +P L +GV +
Sbjct: 265 HRLRCVDTFRGIALILMVFVNYGGGKYWYFKHASWN-VSWDKVRIPGVLQRLGVTFFVV- 322
Query: 93 KRIPDRADAVKKVIFRTLKLLFWGIL-------------------LQGGFSHAPDELTYG 133
AV +++F +++G+ + P L
Sbjct: 323 --------AVLELLFAKPVCIYYGVFNFSVNDIYAAAGMFKIQIARENCVEEFPVNLYRD 374
Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEI-FTKDVQDKDQSVGRFSIFR--LYCW-HWLMAA 189
+ +R+ GVLQR+ +++ +V+++E+ F K V + S R W WL+
Sbjct: 375 LSWDKVRIPGVLQRLGVTFFVVAVLELLFAKPVPENCASERSCLSLRDVTSSWPQWLVIL 434
Query: 190 CVLVVYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL 248
+ ++L L + VP + D+GK N T G A GYID +L
Sbjct: 435 MLESIWLGLTFFLPVPGCPKGYLGPGGIGDFGKYPNCTGG----------AAGYIDHLLL 484
Query: 249 GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHF 308
G +H+Y HP+ ++PEG+L +++SI+ +GV
Sbjct: 485 GADHLYKHPS------------------STVLYHTEVAYDPEGILGTINSIVMAFLGVQA 526
Query: 309 GHVIIH----TKGHLARLKQWVTMGFALLIFGLT 338
G ++++ TK L R W + L+ GLT
Sbjct: 527 GKILLYYKDRTKDILIRFTAWCCI-LGLISIGLT 559
>gi|71278983|ref|YP_267171.1| hypothetical protein CPS_0413 [Colwellia psychrerythraea 34H]
gi|71144723|gb|AAZ25196.1| putative membrane protein [Colwellia psychrerythraea 34H]
Length = 358
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 74/278 (26%), Positives = 120/278 (43%), Gaps = 84/278 (30%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAIA 89
R +LD FRG+ +ALMILV+ G W + HA W+G D V PFFLFI+G A+
Sbjct: 3 RYLALDAFRGITIALMILVNTPG-TWSHVYAPLLHAEWDGATPTDLVFPFFLFIIGSAMF 61
Query: 90 LALKRIPDRA--DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
+ K+ A + +K+I R + F G +L + + + V+ R+ G+LQR
Sbjct: 62 FSFKKSNFSASPEQFRKIIKRGFIMFFIGFML--------NVIPFTVNAEDWRIMGILQR 113
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
I ++Y + + + + ++ R +F + +A +L+ Y ALL
Sbjct: 114 IGIAYTVAACLVL----------TLNRTGVF-------IASAVILLAYWALL-------- 148
Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
++ G L N + +D V G NHMY
Sbjct: 149 -----------------LSMG-EGALTIEGNIIRQLDLAVFGANHMY------------- 177
Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
+R A FEPEGLLS++ +I++ ++G
Sbjct: 178 ------TMRGVA-------FEPEGLLSTIPAIVNMLLG 202
>gi|345880604|ref|ZP_08832150.1| hypothetical protein HMPREF9431_00814 [Prevotella oulorum F0390]
gi|343922516|gb|EGV33216.1| hypothetical protein HMPREF9431_00814 [Prevotella oulorum F0390]
Length = 383
Score = 79.0 bits (193), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 88/329 (26%), Positives = 134/329 (40%), Gaps = 90/329 (27%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
KT R+ ++DI RG+ +A MILV++ GG + + HA W G D V PFF+FI+G+
Sbjct: 5 KTSRIEAVDILRGITIAGMILVNNPGGQPVYTPLEHAEWFGLTPTDLVFPFFMFIMGITT 64
Query: 89 ALALKRIPDRAD--AVKKVIFRTLKLLFWGILL-------QGGFSHAPDELTYGVDVRM- 138
L+L++ KK+I R + L GI + +G F+ L + V
Sbjct: 65 YLSLRKYDFEWSWPCAKKIIKRGMLLYVIGIAISWLMMFCRGLFNEDYAALPFFSHVFAA 124
Query: 139 ------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
IRL GV R+A Y+ S+V + K RF WL+AA V
Sbjct: 125 ANVFDHIRLVGVFPRLAFCYVFASVVALSVKH---------RFI-------PWLIAA-VF 167
Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
+ Y A+L + N + D + NV ID +LG H
Sbjct: 168 IGYFAVL----------CLGNGFAHDASNICNV-----------------IDEAILGRQH 200
Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
+Y + D P +PEGLLSS+ ++ +IG G V+
Sbjct: 201 LY---------------------KWDIP-------DPEGLLSSLPALGHVLIGFCVGRVV 232
Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHF 341
+ ++++ G L I G L +
Sbjct: 233 MSATSLNDKIEKLFIYGAVLTILGFLLSY 261
>gi|352080530|ref|ZP_08951469.1| putative transmembrane protein [Rhodanobacter sp. 2APBS1]
gi|351683811|gb|EHA66887.1| putative transmembrane protein [Rhodanobacter sp. 2APBS1]
Length = 353
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 67/198 (33%), Positives = 92/198 (46%), Gaps = 34/198 (17%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
+RLASLD RG VA M+LV+ G DW + HA W+GC D V PFFLF+VGV++
Sbjct: 2 KRLASLDALRGCTVAAMLLVNDPG-DWGHVYWPLEHAAWHGCTPTDLVFPFFLFVVGVSV 60
Query: 89 ALA-LKRIPDRADA---VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
ALA L R+ A + +R L++L G+ + + + +R GV
Sbjct: 61 ALAILPRLEQGAAPSALTRAATWRALRILALGVAIN-------LLAAWLLPQAHLRFPGV 113
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL-YGTY 203
LQRIAL + V+L I TK W A +L+ Y LL G
Sbjct: 114 LQRIALCFAGVALFAIHTKPRT-----------------QWWAIAALLIGYWGLLRLGGS 156
Query: 204 VPDWQFTIINKDSADYGK 221
+ W DSA +G+
Sbjct: 157 LEPWTNLASRVDSAVFGR 174
>gi|434387287|ref|YP_007097898.1| hypothetical protein Cha6605_3369 [Chamaesiphon minutus PCC 6605]
gi|428018277|gb|AFY94371.1| hypothetical protein Cha6605_3369 [Chamaesiphon minutus PCC 6605]
Length = 377
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 46/136 (33%), Positives = 78/136 (57%), Gaps = 11/136 (8%)
Query: 33 QRLASLDIFRGLAVALMILVDHAG-----GDWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
RL SLD+FRGL +A MILV+ A + + HAPW+G +AD V PFFL+I+GV+
Sbjct: 1 MRLTSLDVFRGLTMATMILVNMASLPNDDRKYAWLDHAPWHGYTIADLVFPFFLYIIGVS 60
Query: 88 IALALKR-----IPDRADAVKKVIFRTLKLLFWGILLQG-GFSHAPDELTYGVDVRMIRL 141
+A +L + +P +++ R+ L G++L +++ E + ++ +R+
Sbjct: 61 MAFSLAKYTSGDVPLSKQVYWQILRRSAILFGLGLILNNLVWNYNLTEPKFFANLDKLRI 120
Query: 142 CGVLQRIALSYLLVSL 157
GVLQRI +++ S+
Sbjct: 121 MGVLQRIGIAFFFASI 136
>gi|149176468|ref|ZP_01855081.1| hypothetical protein PM8797T_29827 [Planctomyces maris DSM 8797]
gi|148844581|gb|EDL58931.1| hypothetical protein PM8797T_29827 [Planctomyces maris DSM 8797]
Length = 518
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 86/369 (23%), Positives = 152/369 (41%), Gaps = 66/369 (17%)
Query: 3 EIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALM------------- 49
E+ AE ++E DVS +++K QRL SLD +RG + M
Sbjct: 43 EVSAEVEPAKA--VTEKDVSLKEKKKPETNQRLVSLDAYRGFVMLAMASGGLAIASVVRN 100
Query: 50 ---ILVDHAGGDWP------------EISHAPWNGCNLADFVMPFFLFIVGVAIALALKR 94
+L + G W ++SH W G D + P F+F+VGV++ ++++
Sbjct: 101 SPEVLDQYNGTQWESSWKTLWQTLSYQLSHVEWTGAGFWDLIQPSFMFMVGVSMPFSVRK 160
Query: 95 IPDRADAVKKV----IFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIAL 150
+ D+ K+ IFR + L+ G+ L FS V VL +I L
Sbjct: 161 RRQKGDSTFKIWMHAIFRAILLVALGVFLSSQFSPERGFTYEDVPQTNFTFANVLCQIGL 220
Query: 151 SYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFT 210
YL+V F + Q +G +I Y W + LA + TY+ +
Sbjct: 221 GYLVV----FFYVNRSFATQMIGVVTILGGY-WFFFYQYMPPEDELAAV-KTYLKE---- 270
Query: 211 IINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSP 270
+ +KD A++ + G+ + N NA +DR++L + Y +P +D P
Sbjct: 271 VQHKDEAEWSQF----SGIGSAWNKHTNAAAAVDRQLLNMFPRYDNP---------KDDP 317
Query: 271 FEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGF 330
+G W + L+ + SI + + G+ G ++I + ++K + G
Sbjct: 318 DQGD-----TFWVNK--GGYQTLNFIPSIATMLFGLMAGQLLISNRLEKMKVKWLLQAG- 369
Query: 331 ALLIFGLTL 339
L+ FG+++
Sbjct: 370 -LICFGVSM 377
>gi|422005552|ref|ZP_16352731.1| hypothetical protein LSS_18678 [Leptospira santarosai serovar
Shermani str. LT 821]
gi|417255773|gb|EKT85231.1| hypothetical protein LSS_18678 [Leptospira santarosai serovar
Shermani str. LT 821]
Length = 375
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 85/298 (28%), Positives = 127/298 (42%), Gaps = 87/298 (29%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
+++S R+ SLD+FRG+ V MILV++ G W I HA WNGC D V PF
Sbjct: 1 MEKQSTQNKDRILSLDLFRGMTVIGMILVNNPG-SWSYIYSPLKHAEWNGCTPTDLVFPF 59
Query: 80 FLFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVD 135
FLF VG +I ++L K +R+D + R+ L+ G+ L G +S A
Sbjct: 60 FLFAVGTSIPISLYSKNGINRSDIWIGICIRSANLILLGLFLNFFGEWSFAE-------- 111
Query: 136 VRMIRLCGVLQRIALSYLLV-SLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
+R+ GVLQRI Y +V SL +F K V I ++ W ++
Sbjct: 112 ---LRIPGVLQRIGFVYWVVASLCLVF----PGKKILVFLVPILLIHTW--------ILT 156
Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
+AL + V S + GK + +IDR + G H+
Sbjct: 157 QIALPGESVV-----------SLEQGK----------------DIGAWIDRTIFGEKHL- 188
Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
WR SK ++PEG LS V+S+++T+ GV G ++
Sbjct: 189 ----WRFSKT----------------------WDPEGFLSGVASVVTTLFGVLCGFIL 220
>gi|404487027|ref|ZP_11022214.1| hypothetical protein HMPREF9448_02670 [Barnesiella intestinihominis
YIT 11860]
gi|404335523|gb|EJZ61992.1| hypothetical protein HMPREF9448_02670 [Barnesiella intestinihominis
YIT 11860]
Length = 364
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 82/309 (26%), Positives = 131/309 (42%), Gaps = 81/309 (26%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
++RL SLD+ RG+ V MILV++AG + + HA W+G AD V P F+FI+GV+I
Sbjct: 7 SKRLVSLDVLRGITVCGMILVNNAGACGYAYAPLKHAKWDGFTPADLVFPAFMFIMGVSI 66
Query: 89 ALALKRIP-DRADAVKKVIFRTLKLLFWGILLQGGFSH-APDELTYGVDVRMIRLCGVLQ 146
L+L + D ++ +++ RT+ + G+ L+ + A E + +R+ GVLQ
Sbjct: 67 YLSLNKSNFDWRISIARILRRTVLIFMSGVALKWILAFIATGEYN---TLENLRIMGVLQ 123
Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
R+ + Y +V+L+ + + H L + V LL G Y+
Sbjct: 124 RLGICYGIVALLAVTVR--------------------HRLFPTIIAV----LLVGYYLLQ 159
Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
+G F G N V +D VLG +HMY A
Sbjct: 160 L-----------FGNGFEKCAG---------NIVSMVDYAVLGKSHMYLGGA-------- 191
Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK---GHLARLK 323
+PEG+LS++ +I +IG G VI+ K + +L
Sbjct: 192 ------------------QFVDPEGVLSTIPAIAQVMIGFLCGKVIVGEKEIRSQIVKLA 233
Query: 324 QWVTMGFAL 332
W T F +
Sbjct: 234 VWGTSMFVI 242
>gi|383934719|ref|ZP_09988159.1| heparan-alpha-glucosaminide N-acetyltransferase [Rheinheimera
nanhaiensis E407-8]
gi|383704254|dbj|GAB58250.1| heparan-alpha-glucosaminide N-acetyltransferase [Rheinheimera
nanhaiensis E407-8]
Length = 397
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 79/286 (27%), Positives = 129/286 (45%), Gaps = 65/286 (22%)
Query: 28 SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
+ L T R+ +LD+ RGL + MILV++ G + + HA W+G + D + PFF+ IV
Sbjct: 14 AKLNTNRMLALDVLRGLTITAMILVNNPGSWNYVYSPLLHAQWHGWTITDLIFPFFIVIV 73
Query: 85 GVAIALALKR--IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPD-ELTYGVD-VRMIR 140
G+++ L+L++ + ++ +++ + R+ KL G+LL + + D E Y D + +R
Sbjct: 74 GMSLQLSLRQHSLNNKGPLIRQALLRSGKLFGLGLLLALFYYNFRDPEFNYVEDRLLTVR 133
Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLY 200
GVLQRI L YL L+ ++ ++ VYLA L+
Sbjct: 134 WLGVLQRIGLVYLATVLIVLYFGQRGRLL-----------------WLLGLMAVYLAGLW 176
Query: 201 GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWR 260
D Q G F R L + V ++D+ VLG NH+Y+ A
Sbjct: 177 WLPYQDAQ-----------GHEF------RGLLLFGNSFVAWLDQLVLGANHVYYRSA-- 217
Query: 261 RSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
+PF F+PEGL S++ +I S + GV
Sbjct: 218 --------TPFA--------------FDPEGLWSTLPAIASCLTGV 241
>gi|296877751|ref|ZP_06901777.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Clostridium
difficile NAP07]
gi|296431202|gb|EFH17023.1| brp/Blh family beta-carotene 15,15'-monooxygenase [Clostridium
difficile NAP07]
Length = 370
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 47/146 (32%), Positives = 77/146 (52%), Gaps = 16/146 (10%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
R+ S+DI RGL++ALMI+ ++ G +P++ HA W+G LADF PFF+ +GV I
Sbjct: 2 NSRVKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVTI 61
Query: 89 ALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
++ LK + + R++ L+ +G L + P D+ +R+ GV
Sbjct: 62 PISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLNTVRILGV 112
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQ 170
LQR+ L Y + SLV + K + +
Sbjct: 113 LQRMGLVYFVTSLVYLLLKKLNVRSS 138
>gi|423089801|ref|ZP_17078150.1| hypothetical protein HMPREF9945_01335, partial [Clostridium
difficile 70-100-2010]
gi|357557565|gb|EHJ39099.1| hypothetical protein HMPREF9945_01335, partial [Clostridium
difficile 70-100-2010]
Length = 391
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 48/146 (32%), Positives = 77/146 (52%), Gaps = 16/146 (10%)
Query: 28 SHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIV 84
S L R+ S+DI RGL++ALMI+ ++ G +P++ HA W+G LADF PFF+ +
Sbjct: 19 SKLMNSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISL 78
Query: 85 GVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIR 140
GV I ++ LK + + R++ L+ +G L + P D+ +R
Sbjct: 79 GVTIPISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLDTVR 129
Query: 141 LCGVLQRIALSYLLVSLVEIFTKDVQ 166
+ GVLQR+ L Y + SL + K +
Sbjct: 130 ILGVLQRMGLVYFVTSLAYLLLKKLN 155
>gi|410613391|ref|ZP_11324450.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola
psychrophila 170]
gi|410167053|dbj|GAC38339.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola
psychrophila 170]
Length = 400
Score = 78.6 bits (192), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 87/325 (26%), Positives = 140/325 (43%), Gaps = 72/325 (22%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
Q H+ RL +LD+FRG+ + MILV++ G W I +HA W+G L D + PF
Sbjct: 10 QSIYQHVPANRLLALDVFRGMTITAMILVNNPG-SWQYIYSPLAHAKWHGWTLTDLIFPF 68
Query: 80 FLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQG-------GFSHAPDELTY 132
F+FIVGV+I+L+ +R ++ +I L +F +LL FS A D +
Sbjct: 69 FIFIVGVSISLSGQRQKEQGLGHGHIIHHALLRMFKLLLLGCFLALFYYNFS-AADYDWF 127
Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
+ +R GVLQRIAL Y+ L+ +F +Q + C M A ++
Sbjct: 128 TQRLMQMRFMGVLQRIALVYMACVLLWLFLSRLQ------------LVIC----MLAILV 171
Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
+LA+ + Y D + N G+ N N ++D + H
Sbjct: 172 AYWLAMAFIPYHDD---------------LGNQYVGLLEYAN---NLSAWLDNYLFAKTH 213
Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
+Y+ A PF F+PEG+LS++ +I S + GV G +
Sbjct: 214 LYYSSA----------QPFA--------------FDPEGVLSTLPAIASGLSGVLAGQWL 249
Query: 313 IHTKGHLARLKQWVTM-GFALLIFG 336
+ + +W+ + G L+ G
Sbjct: 250 SFSHHSMRHKAKWLAICGVVALLLG 274
>gi|289663929|ref|ZP_06485510.1| hypothetical protein XcampvN_12875 [Xanthomonas campestris pv.
vasculorum NCPPB 702]
Length = 392
Score = 78.6 bits (192), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 53/133 (39%), Positives = 73/133 (54%), Gaps = 5/133 (3%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
K +R SLD+FRGL + LMILV+ AG + +++HA W G LAD V P FLF VG A
Sbjct: 22 KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 81
Query: 88 IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
++ AL + +V R + ++ G+L+ F PD V +RL GVL
Sbjct: 82 MSFALATNTPPLQFLGRVSKRAVLIVLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTGVL 141
Query: 146 QRIALSYLLVSLV 158
QRI L YL +L+
Sbjct: 142 QRIGLCYLAAALL 154
>gi|359460787|ref|ZP_09249350.1| hypothetical protein ACCM5_18818 [Acaryochloris sp. CCMEE 5410]
Length = 383
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 77/295 (26%), Positives = 128/295 (43%), Gaps = 82/295 (27%)
Query: 38 LDIFRGLAVALMILVDHAG---GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKR 94
LD+FRG+A+A M+LV+ +G +P++ HA W+G LAD V PFFL ++G ++A ++ R
Sbjct: 13 LDVFRGIAIAGMLLVNKSGLVKEAYPQLLHADWHGWTLADLVFPFFLVVLGASMAFSMAR 72
Query: 95 ------IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
P RA + K++ R+ L G+ L G +S + +R+ G+LQRI
Sbjct: 73 HTASLTQPKRAVYL-KILRRSAVLFGLGLFLNGFWSF---------NFSTLRVMGILQRI 122
Query: 149 ALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQ 208
+L+YL + V + + K Q W + +LV Y L VP
Sbjct: 123 SLTYLASAFVIL---KLPRKSQ--------------WGLTGLLLVGYWLALSFIPVP--- 162
Query: 209 FTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQD 268
++G L N YIDR ++G +H+Y +
Sbjct: 163 ---------EFGP---------GNLTRTGNFGAYIDRLIIGSSHLYVGDQFNSMG----- 199
Query: 269 SPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
+PEGL S++ +I + ++G +F I +G ++K
Sbjct: 200 -------------------DPEGLFSTLPAIATVLLG-YFAGDWIRKRGSGLKIK 234
>gi|421093382|ref|ZP_15554106.1| putative membrane protein [Leptospira borgpetersenii str.
200801926]
gi|410363365|gb|EKP14394.1| putative membrane protein [Leptospira borgpetersenii str.
200801926]
Length = 383
Score = 78.2 bits (191), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 91/324 (28%), Positives = 132/324 (40%), Gaps = 87/324 (26%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
++KS R+ SLD+FRG+ V MILV++ G + + HA WNGC D V PFF
Sbjct: 1 MEKKSTQNKDRILSLDLFRGMTVVGMILVNNPGSWSYVYSPLKHAEWNGCTPTDLVFPFF 60
Query: 81 LFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
LF VG +I ++L K +R + R + L +L G F + E T+
Sbjct: 61 LFAVGASIPISLYSKNGINRIRVWIGICIRGISL-----ILLGLFLNFFGEWTF----SE 111
Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
+R+ GVLQRI Y +V+ + F IF VLV + +
Sbjct: 112 LRIPGVLQRIGFVYWVVATL----------------FLIFP--------GKKVLVFLIPI 147
Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
L V W T I L + +IDR++ G H+
Sbjct: 148 L---LVHTWILTHIAPPGES-----------MVSLEQGKDIGAWIDRRIFGEKHL----- 188
Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH 318
W+ SK ++PEG LS ++SI +++ GV G ++ +G
Sbjct: 189 WKFSKT----------------------WDPEGFLSGIASIATSLFGVICGFILFRREG- 225
Query: 319 LARLKQWVTMGFALLIFGLTLHFT 342
R K V L IFGL FT
Sbjct: 226 --RGKNRV-----LSIFGLGFLFT 242
>gi|418709516|ref|ZP_13270303.1| putative membrane protein [Leptospira interrogans serovar
Grippotyphosa str. UI 08368]
gi|421125723|ref|ZP_15585968.1| putative membrane protein [Leptospira interrogans serovar
Grippotyphosa str. 2006006986]
gi|421135286|ref|ZP_15595410.1| putative membrane protein [Leptospira interrogans serovar
Grippotyphosa str. Andaman]
gi|410020544|gb|EKO87345.1| putative membrane protein [Leptospira interrogans serovar
Grippotyphosa str. Andaman]
gi|410436829|gb|EKP85940.1| putative membrane protein [Leptospira interrogans serovar
Grippotyphosa str. 2006006986]
gi|410770179|gb|EKR45405.1| putative membrane protein [Leptospira interrogans serovar
Grippotyphosa str. UI 08368]
gi|456966468|gb|EMG08068.1| putative membrane protein [Leptospira interrogans serovar
Grippotyphosa str. LT2186]
Length = 381
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 80/322 (24%), Positives = 130/322 (40%), Gaps = 100/322 (31%)
Query: 27 KSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFI 83
++ L R+ SLD+FRG+ VA MILV++ G + + HA WNGC D V PFFLF
Sbjct: 2 ENKLNQNRILSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHARWNGCTPTDLVFPFFLFA 61
Query: 84 VGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
VG++I ++ K + + R++ L+ G+ L + EL R+
Sbjct: 62 VGISIHFSVYSKNKIYLSKTWLGICIRSITLILIGLFLNFFGEWSFSEL---------RI 112
Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
GVLQRI Y W++A+ L++ +
Sbjct: 113 PGVLQRIGFVY--------------------------------WVVASLYLILPKRAILI 140
Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----------IDRKVLGIN 251
+++P + V + +L PP ++ Y IDR V G N
Sbjct: 141 SWIP----------------ILIVHTWILIQLPPPGESIVYLEPGKDIGAWIDRNVFGEN 184
Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
H+ W+ SK ++PEG S +SSI ++++GV G
Sbjct: 185 HL-----WKFSKT----------------------WDPEGFFSGISSITTSLLGVFCGS- 216
Query: 312 IIHTKGHLARLKQWVTMGFALL 333
I+ +K + + + GF +L
Sbjct: 217 ILSSKTNETKKQILSIFGFGIL 238
>gi|389793498|ref|ZP_10196662.1| protein involved in N-acetyl-D-glucosamine utilization
[Rhodanobacter fulvus Jip2]
gi|388434056|gb|EIL91012.1| protein involved in N-acetyl-D-glucosamine utilization
[Rhodanobacter fulvus Jip2]
Length = 354
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 65/198 (32%), Positives = 94/198 (47%), Gaps = 34/198 (17%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
++RL SLD RG VA M+LV+ G DW I HAPW+GC D V PFFLF+VGV+
Sbjct: 2 SKRLPSLDALRGCTVAAMLLVNDPG-DWGHIYAPLEHAPWHGCTPTDLVFPFFLFVVGVS 60
Query: 88 IALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
ALAL ++ A VK ++R L++L G+ + + + +R G
Sbjct: 61 SALALLPRLEQGVAPGALVKAALWRALRILALGVAIN-------LLAAWLLPHAHLRFPG 113
Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLY-GT 202
VLQRI + + V+L + T+ + Q W+ +L+ Y LL G
Sbjct: 114 VLQRIGICFAAVALFAVHTR---PRTQ--------------WIAIGGILLGYWGLLLAGG 156
Query: 203 YVPDWQFTIINKDSADYG 220
V W + DS +G
Sbjct: 157 SVAPWVNIVSRTDSVVFG 174
>gi|381188372|ref|ZP_09895934.1| N-acetylglucosamine related transporter, NagX [Flavobacterium
frigoris PS1]
gi|379650160|gb|EIA08733.1| N-acetylglucosamine related transporter, NagX [Flavobacterium
frigoris PS1]
Length = 430
Score = 78.2 bits (191), Expect = 6e-12, Method: Compositional matrix adjust.
Identities = 95/363 (26%), Positives = 145/363 (39%), Gaps = 128/363 (35%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
+ QR+ SLD+ RG+ + +M+LV++ G W + HA WNGC D V PFF+F++G
Sbjct: 1 MTKQRIISLDVLRGITIMMMVLVNNPG-SWDNVFAPLEHANWNGCTPTDLVFPFFIFVLG 59
Query: 86 VAIALALKRIPDRADAVKKVIFRTLKLLFWGI---------------------------- 117
AI LA+ + K++ R+L+++ G+
Sbjct: 60 AAIPLAILTKELNQQSFLKILTRSLRIISLGLFLGFYGKIEIFNLVGYPLLISKLIITGI 119
Query: 118 ---LLQGGFSH-----------------APDELTYGVDVRMIRLCGVLQRIALSYLLVSL 157
+L G F A +T +VR L GVLQRI + Y SL
Sbjct: 120 VAYILLGNFKQKIKFSLVLTLFFVFVFLAFSGITAYNEVR---LPGVLQRIGIVYFFTSL 176
Query: 158 VEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSA 217
V + T + K Q + + +LV Y A + VP
Sbjct: 177 VYLKT---EIKGQII--------------IIGLLLVGYWATMTLIPVP------------ 207
Query: 218 DYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRK 277
D+G A LN N G+ID +L NH+ W SK
Sbjct: 208 DFGP---------ANLNKGTNLAGWIDNLLLK-NHL-----WSFSKT------------- 239
Query: 278 DAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTM---GFALLI 334
++PEG+LS++ +I S IIG+ G ++ LA+ ++ + M G AL+I
Sbjct: 240 ---------WDPEGILSTIPAIASGIIGLLVGQLL---NSSLAKKEKGLKMFGAGLALVI 287
Query: 335 FGL 337
GL
Sbjct: 288 SGL 290
>gi|325922207|ref|ZP_08183994.1| hypothetical protein XGA_3017 [Xanthomonas gardneri ATCC 19865]
gi|325547326|gb|EGD18393.1| hypothetical protein XGA_3017 [Xanthomonas gardneri ATCC 19865]
Length = 390
Score = 78.2 bits (191), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 57/162 (35%), Positives = 80/162 (49%), Gaps = 20/162 (12%)
Query: 6 AETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD---WPEI 62
+E T I + P ++ ++E R SLD+FRGL + LMILV+ AG + ++
Sbjct: 2 SEQTPAAAAITASPTLTPKRE-------RFLSLDVFRGLTIFLMILVNTAGPGAQAYAQL 54
Query: 63 SHAPWNGCNLADFVMPFFLFIVGVAIALAL------KRIPDRADAVKKVIFRTLKLLFWG 116
+HA W G LAD V P FLF VG A++ AL + R +IF L++W
Sbjct: 55 THAAWFGFTLADLVFPSFLFAVGSAMSFALAADTPHRPFLGRVGKRAALIFLCGVLMYWF 114
Query: 117 ILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
F P + +RL GVLQRI L YLL +L+
Sbjct: 115 PF----FHLQPGGGWAFTAIDQLRLTGVLQRIGLCYLLAALL 152
>gi|146302547|ref|YP_001197138.1| hypothetical protein Fjoh_4820 [Flavobacterium johnsoniae UW101]
gi|146156965|gb|ABQ07819.1| Uncharacterized protein [Flavobacterium johnsoniae UW101]
Length = 423
Score = 77.8 bits (190), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 87/356 (24%), Positives = 139/356 (39%), Gaps = 114/356 (32%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGV 86
+ +RL SLD+FRG + LM +V++ G +P + HA W+GC D V PFF+FI+G
Sbjct: 1 MTKERLTSLDVFRGFTILLMTIVNNPGSWSSIYPPLEHAEWHGCTPTDLVFPFFVFIMGT 60
Query: 87 AIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ-------------------------- 120
AI A+ K++ R+L++ G+ L
Sbjct: 61 AIPFAMPVKHFDGAVFNKILVRSLRIFCLGLFLSVFSRIHLFGLEGIPLLGVRLVIAFAV 120
Query: 121 -----GGFS-HAPDELTYGVDVRMI-------------RLCGVLQRIALSYLLVSLVEIF 161
G FS L G+ + ++ R+ GVLQRIA+ Y S++ +
Sbjct: 121 AYALLGNFSMKVKTILAVGIFIILLSLAFSGLEHFEDTRIPGVLQRIAIVYFFASILYLK 180
Query: 162 TKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGK 221
T K Q W++A+ +++ YL + +VP F N D
Sbjct: 181 T---NLKTQ-------------LWVVASILVIYYLLM---AFVPVPGFGPANFDKG---- 217
Query: 222 VFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPS 281
N ++D VL H+ W SK +
Sbjct: 218 ---------------TNLAAWLDNLVLN-GHL-----WSVSK-----------------T 239
Query: 282 WCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGL 337
W +PEG+LS++ +I + I+G++ G ++ LK+ G LLI GL
Sbjct: 240 W-----DPEGILSTLPAIGTGILGMYIGQLLNLQTNRTEILKKTAVTGVILLIGGL 290
>gi|410638830|ref|ZP_11349383.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola
lipolytica E3]
gi|410141358|dbj|GAC16588.1| heparan-alpha-glucosaminide N-acetyltransferase [Glaciecola
lipolytica E3]
Length = 365
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 78/299 (26%), Positives = 123/299 (41%), Gaps = 86/299 (28%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
R SLDIFRG+ +A M+LV++ G +P + HA W+G D + PFFLFIVG A+
Sbjct: 2 NRQISLDIFRGITLAAMLLVNNPGSWSFVYPPLLHAKWHGLTPTDLIFPFFLFIVGAAMF 61
Query: 90 LALKRIPDRADAV-----KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
++ R +A+ +K+ RT+ L G LL + + D + R+ GV
Sbjct: 62 HSMGRYLPKANQALQVPWQKIAKRTIVLFAIGFLL--------NIFPFTGDPQNWRIMGV 113
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
LQRIA+ Y + +++ Q L+AAC+ LL G ++
Sbjct: 114 LQRIAICYGIAAILICVLHQKQ-------------------LIAACI-----TLLIGYWL 149
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
++N YG N V ID +VLG H+Y
Sbjct: 150 ------MLNLVENPYGL--------------ETNLVRLIDIEVLGSAHLYQG-------- 181
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
F+PEGLLS + ++++ + G ++ + K R+K
Sbjct: 182 ------------------FGVAFDPEGLLSCIPAVVTVLAGFFTSKMLANAKTEQQRMK 222
>gi|424670170|ref|ZP_18107195.1| hypothetical protein A1OC_03788 [Stenotrophomonas maltophilia
Ab55555]
gi|401070628|gb|EJP79142.1| hypothetical protein A1OC_03788 [Stenotrophomonas maltophilia
Ab55555]
Length = 355
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 50/139 (35%), Positives = 78/139 (56%), Gaps = 16/139 (11%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL S+D RG+ VA M+LV++ G DW + H+ W+GC D V PFFLF+VGV++
Sbjct: 7 RRLGSIDALRGITVAAMLLVNNPG-DWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSM 65
Query: 89 ALALK-RIPD---RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
A ++ R D R + V+ R L++L G LL + + +D R+ GV
Sbjct: 66 AFSVAPRAQDAAARPALARGVLERALRILMAGALLH-------LLIWWALDTHHFRIWGV 118
Query: 145 LQRIALSYLLVSLVEIFTK 163
LQRIA+ LV ++ ++ +
Sbjct: 119 LQRIAVCAALVGVLAVYAR 137
>gi|319785830|ref|YP_004145305.1| transmembrane protein [Pseudoxanthomonas suwonensis 11-1]
gi|317464342|gb|ADV26074.1| putative transmembrane protein [Pseudoxanthomonas suwonensis 11-1]
Length = 357
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 58/141 (41%), Positives = 78/141 (55%), Gaps = 16/141 (11%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGV 86
+ +RLAS+D RGL VA M+LV++ G DW + HA W+GC AD V PFFL IVGV
Sbjct: 5 RFRRLASVDALRGLTVAAMLLVNNPG-DWGHVYAPLLHADWHGCTPADLVFPFFLAIVGV 63
Query: 87 AIALA-LKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
+IAL + RI DRA ++ V R L++L G+LL D+ Y R
Sbjct: 64 SIALGVVPRIEAGADRAGLMRTVAVRPLRILAVGLLLHLLAWWWLDQPHY-------RPW 116
Query: 143 GVLQRIALSYLLVSLVEIFTK 163
GVLQRI L +L ++ +
Sbjct: 117 GVLQRIGLCFLGAGAAALYLR 137
>gi|418690664|ref|ZP_13251772.1| putative membrane protein [Leptospira interrogans str. FPW2026]
gi|418722429|ref|ZP_13281595.1| putative membrane protein [Leptospira interrogans str. UI 12621]
gi|400360164|gb|EJP16144.1| putative membrane protein [Leptospira interrogans str. FPW2026]
gi|409963797|gb|EKO27519.1| putative membrane protein [Leptospira interrogans str. UI 12621]
gi|455790461|gb|EMF42326.1| putative membrane protein [Leptospira interrogans serovar Lora str.
TE 1992]
Length = 381
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 75/301 (24%), Positives = 121/301 (40%), Gaps = 99/301 (32%)
Query: 27 KSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFI 83
++ L R+ SLD+FRG+ VA MILV++ G + + HA WNGC D V PFFLF
Sbjct: 2 ENKLNQNRILSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHARWNGCTPTDLVFPFFLFA 61
Query: 84 VGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
VG++I ++ K + + R++ L+ G+ L + EL R+
Sbjct: 62 VGISIHFSVYSKNKIYLSKTWLGICIRSITLILIGLFLNFFGEWSFSEL---------RI 112
Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
GVLQRI Y W++A+ L++ +
Sbjct: 113 PGVLQRIGFVY--------------------------------WVVASLYLILPKRAILI 140
Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----------IDRKVLGIN 251
+++P + V + +L PP ++ Y IDR V G N
Sbjct: 141 SWIP----------------ILIVHTWILIQLPPPGESIVYLEPGKDIGAWIDRNVFGEN 184
Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
H+ W+ SK ++PEG S +SSI ++++GV G +
Sbjct: 185 HL-----WKFSKT----------------------WDPEGFFSGISSITTSLLGVFCGSI 217
Query: 312 I 312
+
Sbjct: 218 L 218
>gi|417760159|ref|ZP_12408187.1| putative membrane protein [Leptospira interrogans str. 2002000624]
gi|417775681|ref|ZP_12423532.1| putative membrane protein [Leptospira interrogans str. 2002000621]
gi|418673844|ref|ZP_13235155.1| putative membrane protein [Leptospira interrogans str. 2002000623]
gi|409944118|gb|EKN89707.1| putative membrane protein [Leptospira interrogans str. 2002000624]
gi|410574555|gb|EKQ37586.1| putative membrane protein [Leptospira interrogans str. 2002000621]
gi|410579122|gb|EKQ46972.1| putative membrane protein [Leptospira interrogans str. 2002000623]
Length = 381
Score = 77.8 bits (190), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 75/301 (24%), Positives = 121/301 (40%), Gaps = 99/301 (32%)
Query: 27 KSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFI 83
++ L R+ SLD+FRG+ VA MILV++ G + + HA WNGC D V PFFLF
Sbjct: 2 ENKLNQNRILSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHARWNGCTPTDLVFPFFLFA 61
Query: 84 VGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
VG++I ++ K + + R++ L+ G+ L + EL R+
Sbjct: 62 VGISIHFSVYSKNKIYLSKTWLGICIRSITLILIGLFLNFFGEWSFSEL---------RI 112
Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
GVLQRI Y W++A+ L++ +
Sbjct: 113 PGVLQRIGFVY--------------------------------WVVASLYLILPKRAILI 140
Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----------IDRKVLGIN 251
+++P + V + +L PP ++ Y IDR V G N
Sbjct: 141 SWIP----------------ILIVHTWILIQLPPPGESIVYLEPGKDIGAWIDRNVFGEN 184
Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
H+ W+ SK ++PEG S +SSI ++++GV G +
Sbjct: 185 HL-----WKFSKT----------------------WDPEGFFSGISSITTSLLGVFCGSI 217
Query: 312 I 312
+
Sbjct: 218 L 218
>gi|24213473|ref|NP_710954.1| hypothetical protein LA_0773 [Leptospira interrogans serovar Lai
str. 56601]
gi|45658672|ref|YP_002758.1| hypothetical protein LIC12842 [Leptospira interrogans serovar
Copenhageni str. Fiocruz L1-130]
gi|386073105|ref|YP_005987422.1| hypothetical protein LIF_A0631 [Leptospira interrogans serovar Lai
str. IPAV]
gi|417764272|ref|ZP_12412242.1| putative membrane protein [Leptospira interrogans serovar Bulgarica
str. Mallika]
gi|417786789|ref|ZP_12434477.1| putative membrane protein [Leptospira interrogans str. C10069]
gi|418669621|ref|ZP_13231000.1| putative membrane protein [Leptospira interrogans serovar Pyrogenes
str. 2006006960]
gi|418701883|ref|ZP_13262801.1| putative membrane protein [Leptospira interrogans serovar Bataviae
str. L1111]
gi|418702896|ref|ZP_13263788.1| putative membrane protein [Leptospira interrogans serovar
Hebdomadis str. R499]
gi|418717763|ref|ZP_13277304.1| putative membrane protein [Leptospira interrogans str. UI 08452]
gi|418729566|ref|ZP_13288113.1| putative membrane protein [Leptospira interrogans str. UI 12758]
gi|421083731|ref|ZP_15544602.1| putative membrane protein [Leptospira santarosai str. HAI1594]
gi|421102101|ref|ZP_15562711.1| putative membrane protein [Leptospira interrogans serovar
Icterohaemorrhagiae str. Verdun LP]
gi|421121310|ref|ZP_15581607.1| putative membrane protein [Leptospira interrogans str. Brem 329]
gi|24194245|gb|AAN47972.1|AE011263_12 conserved hypothetical protein [Leptospira interrogans serovar Lai
str. 56601]
gi|45601916|gb|AAS71395.1| conserved hypothetical protein [Leptospira interrogans serovar
Copenhageni str. Fiocruz L1-130]
gi|353456894|gb|AER01439.1| conserved hypothetical protein [Leptospira interrogans serovar Lai
str. IPAV]
gi|400353508|gb|EJP05677.1| putative membrane protein [Leptospira interrogans serovar Bulgarica
str. Mallika]
gi|409950064|gb|EKO04595.1| putative membrane protein [Leptospira interrogans str. C10069]
gi|410345744|gb|EKO96814.1| putative membrane protein [Leptospira interrogans str. Brem 329]
gi|410368246|gb|EKP23624.1| putative membrane protein [Leptospira interrogans serovar
Icterohaemorrhagiae str. Verdun LP]
gi|410433648|gb|EKP77988.1| putative membrane protein [Leptospira santarosai str. HAI1594]
gi|410754552|gb|EKR16202.1| putative membrane protein [Leptospira interrogans serovar Pyrogenes
str. 2006006960]
gi|410759015|gb|EKR25234.1| putative membrane protein [Leptospira interrogans serovar Bataviae
str. L1111]
gi|410767440|gb|EKR38115.1| putative membrane protein [Leptospira interrogans serovar
Hebdomadis str. R499]
gi|410775744|gb|EKR55735.1| putative membrane protein [Leptospira interrogans str. UI 12758]
gi|410786933|gb|EKR80669.1| putative membrane protein [Leptospira interrogans str. UI 08452]
gi|456824782|gb|EMF73208.1| putative membrane protein [Leptospira interrogans serovar Canicola
str. LT1962]
Length = 381
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 75/301 (24%), Positives = 121/301 (40%), Gaps = 99/301 (32%)
Query: 27 KSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFI 83
++ L R+ SLD+FRG+ VA MILV++ G + + HA WNGC D V PFFLF
Sbjct: 2 ENKLNQNRILSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHARWNGCTPTDLVFPFFLFA 61
Query: 84 VGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
VG++I ++ K + + R++ L+ G+ L + EL R+
Sbjct: 62 VGISIHFSVYSKNKIYLSKTWLGICIRSITLILIGLFLNFFGEWSFSEL---------RI 112
Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
GVLQRI Y W++A+ L++ +
Sbjct: 113 PGVLQRIGFVY--------------------------------WVVASLYLILPKRAILI 140
Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----------IDRKVLGIN 251
+++P + V + +L PP ++ Y IDR V G N
Sbjct: 141 SWIP----------------ILIVHTWILIQLPPPGESIVYLEPGKDIGAWIDRNVFGEN 184
Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
H+ W+ SK ++PEG S +SSI ++++GV G +
Sbjct: 185 HL-----WKFSKT----------------------WDPEGFFSGISSITTSLLGVFCGSI 217
Query: 312 I 312
+
Sbjct: 218 L 218
>gi|126700401|ref|YP_001089298.1| membrane protein [Clostridium difficile 630]
gi|115251838|emb|CAJ69673.1| putative membrane protein [Clostridium difficile 630]
Length = 370
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 47/142 (33%), Positives = 76/142 (53%), Gaps = 16/142 (11%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
R+ S+DI RGL++ALMI+ ++ G +P++ HA W+G LADF PFF+ +GV I
Sbjct: 2 NSRIKSIDIIRGLSIALMIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVTI 61
Query: 89 ALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
++ LK + + R++ L+ +G L + P D+ +R+ GV
Sbjct: 62 PISINSKLKNNKSTLSIILSIFKRSILLILFGFFLN--YLGNP-------DLDTVRILGV 112
Query: 145 LQRIALSYLLVSLVEIFTKDVQ 166
LQR+ L Y + SLV + K +
Sbjct: 113 LQRMGLVYFVTSLVYLLLKKLN 134
>gi|417770421|ref|ZP_12418329.1| putative membrane protein [Leptospira interrogans serovar Pomona
str. Pomona]
gi|418680131|ref|ZP_13241383.1| putative membrane protein [Leptospira interrogans serovar Pomona
str. Kennewicki LC82-25]
gi|421117858|ref|ZP_15578212.1| putative membrane protein [Leptospira interrogans serovar Canicola
str. Fiocruz LV133]
gi|400328139|gb|EJO80376.1| putative membrane protein [Leptospira interrogans serovar Pomona
str. Kennewicki LC82-25]
gi|409947562|gb|EKN97558.1| putative membrane protein [Leptospira interrogans serovar Pomona
str. Pomona]
gi|410010535|gb|EKO68672.1| putative membrane protein [Leptospira interrogans serovar Canicola
str. Fiocruz LV133]
gi|455668600|gb|EMF33807.1| putative membrane protein [Leptospira interrogans serovar Pomona
str. Fox 32256]
Length = 381
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 75/301 (24%), Positives = 121/301 (40%), Gaps = 99/301 (32%)
Query: 27 KSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFI 83
++ L R+ SLD+FRG+ VA MILV++ G + + HA WNGC D V PFFLF
Sbjct: 2 ENKLNQNRILSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHARWNGCTPTDLVFPFFLFA 61
Query: 84 VGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
VG++I ++ K + + R++ L+ G+ L + EL R+
Sbjct: 62 VGISIHFSVYSKNKIYLSKTWLGICIRSITLILIGLFLNFFGEWSFSEL---------RI 112
Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
GVLQRI Y W++A+ L++ +
Sbjct: 113 PGVLQRIGFVY--------------------------------WVVASLYLILPKRAILI 140
Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY----------IDRKVLGIN 251
+++P + V + +L PP ++ Y IDR V G N
Sbjct: 141 SWIP----------------ILIVHTWILIQLPPPGESIVYLEPGKDIGAWIDRNVFGEN 184
Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
H+ W+ SK ++PEG S +SSI ++++GV G +
Sbjct: 185 HL-----WKFSKT----------------------WDPEGFFSGISSITTSLLGVFCGSI 217
Query: 312 I 312
+
Sbjct: 218 L 218
>gi|428210738|ref|YP_007083882.1| hypothetical protein Oscil6304_0209 [Oscillatoria acuminata PCC
6304]
gi|427999119|gb|AFY79962.1| hypothetical protein Oscil6304_0209 [Oscillatoria acuminata PCC
6304]
Length = 398
Score = 77.8 bits (190), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 52/141 (36%), Positives = 80/141 (56%), Gaps = 10/141 (7%)
Query: 23 DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPF 79
+ ++ L + RL SLD+FRG+A+A MILV++ G +P + HAPW+G D + P
Sbjct: 9 NPSVQNLLNSMRLTSLDVFRGMAIASMILVNNPGSWQQVYPPLLHAPWHGFTPTDLIFPA 68
Query: 80 FLFIVGVAIALALKRIPDRADA--VKKVIFRTLK---LLFWGILLQGGFSHAPDELTYG- 133
FLFI GVA+A + + + ++ V F+ L+ +LF L G + L G
Sbjct: 69 FLFISGVAMAFSFAKYTNSPNSPPAASVYFKILRRALILFGLGLFLNGSTLVLKTLLQGQ 128
Query: 134 -VDVRMIRLCGVLQRIALSYL 153
+D +R+ GVLQRI+L+YL
Sbjct: 129 PLDFGTLRIMGVLQRISLAYL 149
>gi|452822118|gb|EME29140.1| heparan-alpha-glucosaminide N-acetyltransferase isoform 2
[Galdieria sulphuraria]
Length = 351
Score = 77.4 bits (189), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 60/251 (23%), Positives = 107/251 (42%), Gaps = 79/251 (31%)
Query: 64 HAPWNGCNLADFVMPFFLFIVGVAIALALKRIP-------DRADAVKKVIFRTLKLLFWG 116
H W ++AD + PFFLF+VG +I A +++P ++ A++ V RT+KL G
Sbjct: 14 HESWFSWHMADLIFPFFLFMVGSSIYFAFRKVPREVENSEEKDKALRSVTSRTIKLFLVG 73
Query: 117 ILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFS 176
+LL S G +R G+LQRIA+ Y V+ + +F + V +++
Sbjct: 74 VLLNVPLS--------GFRWETLRWMGILQRIAICYGCVAFLFLFV------NSRVIQYA 119
Query: 177 IFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPP 236
+ + + +++ +LLYG VP+ C + +L
Sbjct: 120 ----------LVSVLFLLHTSLLYGLIVPN--------------------CLISERLTRA 149
Query: 237 CNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSV 296
C+A Y+D +LG H+Y H ++PEG+LS++
Sbjct: 150 CSAQSYLDTMILGGKHLYF----------------------------HLEYDPEGILSTL 181
Query: 297 SSILSTIIGVH 307
+ ++T G+
Sbjct: 182 MATINTFAGLE 192
>gi|456734835|gb|EMF59605.1| N-acetylglucosamine transporter NagX [Stenotrophomonas maltophilia
EPM1]
Length = 355
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 50/139 (35%), Positives = 78/139 (56%), Gaps = 16/139 (11%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL S+D RG+ VA M+LV++ G DW + H+ W+GC D V PFFLF+VGV++
Sbjct: 7 RRLGSIDALRGITVAAMLLVNNPG-DWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSM 65
Query: 89 ALALK-RIPD---RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
A ++ R D R + V+ R L++L G LL + + +D R+ GV
Sbjct: 66 AFSVAPRAQDAAARPALARGVLERALRILVAGALLH-------LLIWWALDTHHFRIWGV 118
Query: 145 LQRIALSYLLVSLVEIFTK 163
LQRIA+ LV ++ ++ +
Sbjct: 119 LQRIAVCAALVGVLAVYAR 137
>gi|190575857|ref|YP_001973702.1| transmembrane protein [Stenotrophomonas maltophilia K279a]
gi|190013779|emb|CAQ47415.1| putative transmembrane protein [Stenotrophomonas maltophilia K279a]
Length = 355
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 50/139 (35%), Positives = 78/139 (56%), Gaps = 16/139 (11%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL S+D RG+ VA M+LV++ G DW + H+ W+GC D V PFFLF+VGV++
Sbjct: 7 RRLGSIDALRGITVAAMLLVNNPG-DWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSM 65
Query: 89 ALALK-RIPD---RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
A ++ R D R + V+ R L++L G LL + + +D R+ GV
Sbjct: 66 AFSVAPRAQDAAARPALARGVLERALRILVAGALLH-------LLIWWALDTHHFRIWGV 118
Query: 145 LQRIALSYLLVSLVEIFTK 163
LQRIA+ LV ++ ++ +
Sbjct: 119 LQRIAVCAALVGVLAVYAR 137
>gi|418737426|ref|ZP_13293823.1| putative membrane protein [Leptospira borgpetersenii serovar
Castellonis str. 200801910]
gi|410746620|gb|EKQ99526.1| putative membrane protein [Leptospira borgpetersenii serovar
Castellonis str. 200801910]
Length = 383
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 91/324 (28%), Positives = 131/324 (40%), Gaps = 87/324 (26%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
++KS R+ SLD+FRG+ V MILV++ G + + HA WNGC D V PFF
Sbjct: 1 MEKKSTQNKDRILSLDLFRGMTVVGMILVNNPGSWSYVYSPLKHAEWNGCTPTDLVFPFF 60
Query: 81 LFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
LF VG +I ++L K +R + R + L +L G F + E T+
Sbjct: 61 LFAVGTSIPISLYSKNGINRIRVWIGICIRGISL-----ILLGLFLNFFGEWTF----SE 111
Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
+R+ GVLQRI Y +V+ + F IF VLV + +
Sbjct: 112 LRIPGVLQRIGFVYWVVATL----------------FLIFP--------GKKVLVFLIPI 147
Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
L V W T I L + +IDR + G H+
Sbjct: 148 L---LVHTWILTHIAPPGES-----------MVSLEQGKDIGAWIDRTIFGEKHL----- 188
Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH 318
W+ SK ++PEG LS ++SI +++ GV G ++ +G
Sbjct: 189 WKFSKT----------------------WDPEGFLSGIASIATSLFGVICGFILFRREG- 225
Query: 319 LARLKQWVTMGFALLIFGLTLHFT 342
R K V L IFGL FT
Sbjct: 226 --RGKNRV-----LSIFGLGFLFT 242
>gi|84625357|ref|YP_452729.1| hypothetical protein XOO_3700 [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|188575197|ref|YP_001912126.1| hypothetical protein PXO_04319 [Xanthomonas oryzae pv. oryzae
PXO99A]
gi|84369297|dbj|BAE70455.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae MAFF
311018]
gi|188519649|gb|ACD57594.1| membrane protein, putative [Xanthomonas oryzae pv. oryzae PXO99A]
Length = 388
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 53/133 (39%), Positives = 71/133 (53%), Gaps = 5/133 (3%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
K +R SLD+FRGL + LMILV+ AG + +++HA W G LAD V P FLF VG A
Sbjct: 18 KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77
Query: 88 IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
++ AL + +V R + G+L+ F PD V +RL GVL
Sbjct: 78 MSFALATNMPHLQFLGRVSKRAALIALCGVLMYWFPFFHLQPDGGWAFTTVDQVRLTGVL 137
Query: 146 QRIALSYLLVSLV 158
QRI L YL +L+
Sbjct: 138 QRIGLCYLAAALL 150
>gi|452822119|gb|EME29141.1| heparan-alpha-glucosaminide N-acetyltransferase isoform 1
[Galdieria sulphuraria]
Length = 356
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 60/251 (23%), Positives = 107/251 (42%), Gaps = 79/251 (31%)
Query: 64 HAPWNGCNLADFVMPFFLFIVGVAIALALKRIP-------DRADAVKKVIFRTLKLLFWG 116
H W ++AD + PFFLF+VG +I A +++P ++ A++ V RT+KL G
Sbjct: 19 HESWFSWHMADLIFPFFLFMVGSSIYFAFRKVPREVENSEEKDKALRSVTSRTIKLFLVG 78
Query: 117 ILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFS 176
+LL S G +R G+LQRIA+ Y V+ + +F + V +++
Sbjct: 79 VLLNVPLS--------GFRWETLRWMGILQRIAICYGCVAFLFLFV------NSRVIQYA 124
Query: 177 IFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPP 236
+ + + +++ +LLYG VP+ C + +L
Sbjct: 125 ----------LVSVLFLLHTSLLYGLIVPN--------------------CLISERLTRA 154
Query: 237 CNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSV 296
C+A Y+D +LG H+Y H ++PEG+LS++
Sbjct: 155 CSAQSYLDTMILGGKHLYF----------------------------HLEYDPEGILSTL 186
Query: 297 SSILSTIIGVH 307
+ ++T G+
Sbjct: 187 MATINTFAGLE 197
>gi|418719584|ref|ZP_13278783.1| putative membrane protein [Leptospira borgpetersenii str. UI 09149]
gi|410743627|gb|EKQ92369.1| putative membrane protein [Leptospira borgpetersenii str. UI 09149]
Length = 383
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 91/324 (28%), Positives = 131/324 (40%), Gaps = 87/324 (26%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
++KS R+ SLD+FRG+ V MILV++ G + + HA WNGC D V PFF
Sbjct: 1 MEKKSTQNKDRILSLDLFRGMTVVGMILVNNPGSWSYVYSPLKHAEWNGCTPTDLVFPFF 60
Query: 81 LFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
LF VG +I ++L K +R + R + L +L G F + E T+
Sbjct: 61 LFAVGASIPISLYSKNGINRIRVWIGICIRGISL-----ILLGLFLNFFGEWTF----SE 111
Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
+R+ GVLQRI Y +V+ + F IF VLV + +
Sbjct: 112 LRIPGVLQRIGFVYWVVATL----------------FLIFP--------GKKVLVFLIPI 147
Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
L V W T I L + +IDR + G H+
Sbjct: 148 L---LVHTWILTHIAPPGES-----------MVSLEQGKDIGAWIDRTIFGEKHL----- 188
Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH 318
W+ SK ++PEG LS ++SI +++ GV G ++ +G
Sbjct: 189 WKFSKT----------------------WDPEGFLSGIASIATSLFGVICGFILFRREG- 225
Query: 319 LARLKQWVTMGFALLIFGLTLHFT 342
R K V L IFGL FT
Sbjct: 226 --RGKNRV-----LSIFGLGFLFT 242
>gi|16124796|ref|NP_419360.1| hypothetical protein CC_0541 [Caulobacter crescentus CB15]
gi|221233512|ref|YP_002515948.1| hypothetical protein CCNA_00575 [Caulobacter crescentus NA1000]
gi|13421730|gb|AAK22528.1| hypothetical protein CC_0541 [Caulobacter crescentus CB15]
gi|220962684|gb|ACL94040.1| hypothetical protein CCNA_00575 [Caulobacter crescentus NA1000]
Length = 372
Score = 77.4 bits (189), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 84/311 (27%), Positives = 120/311 (38%), Gaps = 72/311 (23%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
R SLD+FRGL V LMI+V+ AG + ++ HAPW G AD V P FLF VG ++
Sbjct: 5 AARFLSLDVFRGLTVFLMIVVNTAGPGAKAYSQLVHAPWFGFTAADAVFPSFLFAVGCSM 64
Query: 89 ALALKRIPDRADAVKKVIFRTLKLLFWGILLQ-GGFSHAPDELTYGVDVRMIRLCGVLQR 147
A A + D KV+ R + G L+ F D + R+ GVLQR
Sbjct: 65 AFAFSKPIPLNDFTVKVLRRAALIFLLGFLMYWFPFVRKVDGDWALIPFSDTRVMGVLQR 124
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
IAL YLL + + WL ++ + LL G W
Sbjct: 125 IALCYLLAA------------------------FAVRWLSPRLIVALCAVLLLGY----W 156
Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
+ D A A L+ NA +D ++G NH+Y
Sbjct: 157 AILMAFGDPA-------------APLSKLGNAGTRLDLLLIGQNHLY------------- 190
Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
RKD F+PEGLL ++ S ++ + G + G + +
Sbjct: 191 --------RKD------GGFDPEGLLGTLPSTVNVLAGYLAARFLKENPGSSQAMGRMAI 236
Query: 328 MGFALLIFGLT 338
G L++ GL
Sbjct: 237 AGLVLILAGLV 247
>gi|436833933|ref|YP_007319149.1| Protein of unknown function DUF2261,transmembrane [Fibrella
aestuarina BUZ 2]
gi|384065346|emb|CCG98556.1| Protein of unknown function DUF2261,transmembrane [Fibrella
aestuarina BUZ 2]
Length = 361
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 80/286 (27%), Positives = 113/286 (39%), Gaps = 85/286 (29%)
Query: 49 MILVDHAGGDWP----EISHAPWNGCNLADFVMPFFLFIVGVAIALALKR-----IPDRA 99
MILV++AG DW + HAPWNG D + PFFLFIVGV+I AL + + D
Sbjct: 1 MILVNNAG-DWAHSYAPLKHAPWNGWTPTDLIFPFFLFIVGVSITFALSKRQTSLLEDEK 59
Query: 100 DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVE 159
K+I R + L G L L D +R+ GVLQRI + Y + +LV
Sbjct: 60 TQRLKIIRRGVTLFALGFFLN---------LFPRFDFANVRIMGVLQRIGIVYTVCALVF 110
Query: 160 IFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADY 219
+ T Q + + +L+ Y L+ VP +
Sbjct: 111 LRTSPRQQVN-----------------LILLILIGYFLLMTMVPVPGIGY---------- 143
Query: 220 GKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDA 279
A L P N +IDR +L H Y R SK
Sbjct: 144 -----------ANLEPETNLAAWIDRTILTPAHCY-----RSSKV--------------- 172
Query: 280 PSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK-GHLARLKQ 324
++PEGLLS+V +I + ++G+ G + T+ G R Q
Sbjct: 173 -------WDPEGLLSTVPAIATGLLGLLAGRWLRSTRYGTTVRESQ 211
>gi|21241481|ref|NP_641063.1| hypothetical protein XAC0710 [Xanthomonas axonopodis pv. citri str.
306]
gi|21106824|gb|AAM35599.1| conserved hypothetical protein [Xanthomonas axonopodis pv. citri
str. 306]
Length = 388
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 80/287 (27%), Positives = 116/287 (40%), Gaps = 73/287 (25%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
K +R SLD+FRGL + LMILV+ AG + +++HA W G LAD V P FLF VG A
Sbjct: 18 KRERFLSLDVFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSA 77
Query: 88 IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
++ AL + +V R +L G+L+ F PD V +RL VL
Sbjct: 78 MSFALATNTPHLQFLGRVSRRAALILLCGVLMYWFPFFHLQPDGGWAFTTVDQLRLTCVL 137
Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
QRI L YL +L+ + + +L+ Y ALLY P
Sbjct: 138 QRIGLCYLAAALLVRYLPPRGIAPVCL-----------------ALLLGYWALLYAFGQP 180
Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
A+L+ NA +D + G +H+Y
Sbjct: 181 G------------------------AELSKTGNAGTRLDLWLYGRDHLY----------- 205
Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
RKD F+PEGLL ++S+ ++ + G G +
Sbjct: 206 ----------RKD------GGFDPEGLLGTLSATVNVLAGYLCGRFL 236
>gi|242062186|ref|XP_002452382.1| hypothetical protein SORBIDRAFT_04g024716 [Sorghum bicolor]
gi|241932213|gb|EES05358.1| hypothetical protein SORBIDRAFT_04g024716 [Sorghum bicolor]
Length = 108
Score = 77.0 bits (188), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 31/65 (47%), Positives = 46/65 (70%)
Query: 279 APSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLT 338
APSWC APF+PEGLLSSV +I++ +IG+ FGH+IIH + H R+ W+ + F++L
Sbjct: 13 APSWCQAPFDPEGLLSSVMAIVTCLIGLQFGHIIIHFEKHRGRITNWLILSFSMLALAFL 72
Query: 339 LHFTN 343
+ F+
Sbjct: 73 MDFSG 77
>gi|386392672|ref|ZP_10077453.1| hypothetical protein DesU5LDRAFT_2079 [Desulfovibrio sp. U5L]
gi|385733550|gb|EIG53748.1| hypothetical protein DesU5LDRAFT_2079 [Desulfovibrio sp. U5L]
Length = 370
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 78/287 (27%), Positives = 122/287 (42%), Gaps = 82/287 (28%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
+ RLAS+D RGLAVA MIL ++ G + E+ HA W+G ADF+ P FLF+VGV
Sbjct: 5 RKTRLASVDGLRGLAVAGMILANNPGERGHVYRELQHAVWDGWTAADFIFPLFLFLVGVC 64
Query: 88 IALALKRIPDRADAV----KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
+ALA+ R R ++V+ R + L G+L + V +R+ G
Sbjct: 65 VALAVDRDTVRTGEAHRFWRRVLTRAIILFLLGLL---------ENAYLRVSFENLRIPG 115
Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY 203
VLQRIA+ YL + + + + + G S+ + L+ Y LL G
Sbjct: 116 VLQRIAVVYLATAWLHV-------RCGNRGIVSVILV----------TLLGYWLLLAGVP 158
Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
VP +++D N G+ID+ +LG NH++ +
Sbjct: 159 VPGLGHPSLSRD---------------------VNWEGWIDQLLLG-NHIWKY------- 189
Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
++PEG+LS+ +I ++GV G
Sbjct: 190 --------------------ETTWDPEGVLSTFPAIALGLVGVLCGR 216
>gi|445498183|ref|ZP_21465038.1| putative membrane protein DUF1624 [Janthinobacterium sp. HH01]
gi|444788178|gb|ELX09726.1| putative membrane protein DUF1624 [Janthinobacterium sp. HH01]
Length = 370
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 81/287 (28%), Positives = 119/287 (41%), Gaps = 79/287 (27%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
QR ++D+ RGL VALMI+V+ G + HA W+G L D V P F+F+VG A++
Sbjct: 6 QRSQAIDVLRGLTVALMIMVNMPGTPATTYAPFLHAEWHGLTLTDLVFPTFMFVVGTALS 65
Query: 90 LALKRIPDRADA--VKKVIFRTLKLLFWGILLQGG--FSHAPDELTYGVDVRMIRLCGVL 145
L++ +A +KK+ RT + G L+ FS LT + + R+ GVL
Sbjct: 66 FTLEKYEGMGEAAVLKKIFTRTALIFLCGFLMYWYPFFSTDGGSLTV-LPLSGTRIFGVL 124
Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
QRIAL Y SL+ + + +K V A L+ Y ++YG
Sbjct: 125 QRIALGYCAGSLILHYWR---EKGALV--------------FAVLALLGYWTVMYG---- 163
Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
DY NA +D VLG HMYH
Sbjct: 164 ----------FGDY--------------TLAGNAQRKLDLLVLGEAHMYHG--------- 190
Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
EG F+PEG+LS++ SI++ + G G ++
Sbjct: 191 ------EG-----------IAFDPEGILSTLPSIVNVLAGYFAGRLV 220
>gi|313203961|ref|YP_004042618.1| hypothetical protein Palpr_1487 [Paludibacter propionicigenes WB4]
gi|312443277|gb|ADQ79633.1| hypothetical protein Palpr_1487 [Paludibacter propionicigenes WB4]
Length = 382
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 75/293 (25%), Positives = 118/293 (40%), Gaps = 76/293 (25%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGD----WPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
R +LDIFRG+ V MI+V+ +G WP + HA WNG D V P FLF VG A+
Sbjct: 13 SRFTALDIFRGMTVCFMIIVNTSGNGATTYWP-LMHADWNGFTPTDLVFPSFLFAVGNAL 71
Query: 89 ALALKR--IPDRADAVKKVIFRTLKLLFWGILLQG--GFSHAPDELTYGVDVRMIRLCGV 144
A+KR ++D + K+ RT + G L+ F + + R+ GV
Sbjct: 72 GFAMKRWDTMKQSDVLLKIFKRTALIFLIGYLMYWFPFFRLNAESHLILSPISQTRIMGV 131
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
LQRIAL Y + +L+ + + W+ +L ++ LL
Sbjct: 132 LQRIALCYGITALLVYYLGTKRTI----------------WVGVVSLLAYWVLLL----- 170
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
+G+ A+ + NAV +D +LG +H+Y +
Sbjct: 171 -------------AFGE-------AGAEFSKTGNAVLRLDIWLLGTHHLYGGEGF----- 205
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
PF+PEG+LS++ ++ + I G G + KG
Sbjct: 206 ---------------------PFDPEGVLSTLPALFNVIAGFAVGRYLQQQKG 237
>gi|418752318|ref|ZP_13308585.1| putative membrane protein [Leptospira santarosai str. MOR084]
gi|409967313|gb|EKO35143.1| putative membrane protein [Leptospira santarosai str. MOR084]
Length = 363
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 89/326 (27%), Positives = 135/326 (41%), Gaps = 88/326 (26%)
Query: 37 SLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
SLD+FRG+ V MILV++ G W I HA WNGC D V PFFLF VG +I ++L
Sbjct: 2 SLDLFRGMTVIGMILVNNPG-SWSYIYSPLKHAEWNGCTPTDLVFPFFLFAVGTSIPISL 60
Query: 93 --KRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVLQRI 148
K +R+D + R+ L+ G+ L G +S A +R+ GVLQRI
Sbjct: 61 YSKNGINRSDIWIGICIRSANLILLGLFLNFFGEWSFAE-----------LRIPGVLQRI 109
Query: 149 ALSYLLV-SLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
Y +V SL +F+ + + FS+ L W++ L P
Sbjct: 110 GFVYWVVASLCLVFS------GKKILVFSVPILLIHTWILTQIAL------------PG- 150
Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
+ + + D G +IDR + G H+ WR SK
Sbjct: 151 ESVVSLEQGKDIG--------------------AWIDRTIFGEKHL-----WRFSKT--- 182
Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVT 327
++PEG LS V+S+++T+ GV G ++ + L +
Sbjct: 183 -------------------WDPEGFLSGVASVVTTLFGVLCGFILFLRERRNKILGLGIL 223
Query: 328 MGFALLIFGLTLHFTNGEHGSGKFST 353
F L++ L+L N +G +S
Sbjct: 224 FSFVGLLWDLSLP-MNKSLWTGSYSV 248
>gi|317478517|ref|ZP_07937676.1| hypothetical protein HMPREF1007_00792 [Bacteroides sp. 4_1_36]
gi|316905331|gb|EFV27126.1| hypothetical protein HMPREF1007_00792 [Bacteroides sp. 4_1_36]
Length = 394
Score = 76.6 bits (187), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/152 (34%), Positives = 84/152 (55%), Gaps = 22/152 (14%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
++R+ +LDI RG+ +A MI+V++ G W I HA WNG D V PFF+FI+G++
Sbjct: 7 SKRILALDILRGVTIAGMIMVNNPG-SWGHIYAPLRHAEWNGLTPTDLVFPFFMFIMGIS 65
Query: 88 IALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH-------APDELTYGVDV-- 136
++LK+ A K++ RT+ + G+ + G FS APD+L++G +
Sbjct: 66 TYISLKKYNFEFSHAAGMKILKRTIVIFLIGMAI-GWFSRFCYYWASAPDDLSFGEKLWA 124
Query: 137 -----RMIRLCGVLQRIALSYLLVSLVEIFTK 163
IR+ GV+QR+AL Y S++ + K
Sbjct: 125 SVWTFDRIRILGVMQRLALCYGAASIIALTMK 156
>gi|160887858|ref|ZP_02068861.1| hypothetical protein BACUNI_00261 [Bacteroides uniformis ATCC 8492]
gi|156862688|gb|EDO56119.1| hypothetical protein BACUNI_00261 [Bacteroides uniformis ATCC 8492]
Length = 394
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/152 (34%), Positives = 84/152 (55%), Gaps = 22/152 (14%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
++R+ +LDI RG+ +A MI+V++ G W I HA WNG D V PFF+FI+G++
Sbjct: 7 SKRILALDILRGVTIAGMIMVNNPG-SWGHIYAPLRHAEWNGLTPTDLVFPFFMFIMGIS 65
Query: 88 IALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH-------APDELTYGVDV-- 136
++LK+ A K++ RT+ + G+ + G FS APD+L++G +
Sbjct: 66 TYISLKKYNFEFSHAAGMKILKRTIVIFLIGMAI-GWFSRFCYYWASAPDDLSFGEKLWA 124
Query: 137 -----RMIRLCGVLQRIALSYLLVSLVEIFTK 163
IR+ GV+QR+AL Y S++ + K
Sbjct: 125 SVWTFDRIRILGVMQRLALCYGAASIIALTMK 156
>gi|421090259|ref|ZP_15551055.1| putative membrane protein [Leptospira kirschneri str. 200802841]
gi|421129053|ref|ZP_15589263.1| putative membrane protein [Leptospira kirschneri str. 2008720114]
gi|410000994|gb|EKO51618.1| putative membrane protein [Leptospira kirschneri str. 200802841]
gi|410359757|gb|EKP06816.1| putative membrane protein [Leptospira kirschneri str. 2008720114]
Length = 369
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 75/289 (25%), Positives = 121/289 (41%), Gaps = 79/289 (27%)
Query: 38 LDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL-- 92
+D+FRG+ VA MILV++ G + + HA WNGC D V PFFLF VG++I L++
Sbjct: 1 MDLFRGMTVAGMILVNNPGSWSFIYSPLKHAKWNGCTPTDLVFPFFLFAVGISIQLSVYS 60
Query: 93 KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
K ++ + R++ L+ G+ L + EL R+ GVLQRI Y
Sbjct: 61 KNKIHKSKIWFGICIRSITLILIGLFLNFFGEWSFSEL---------RIPGVLQRIGFVY 111
Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
+V+ + + R+ W+ +L+V+ +L P
Sbjct: 112 WIVASLHLILPK--------------RMILISWI---PILLVHTWVLIQIPAPG------ 148
Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
+S Y L P + +IDR V G NH+ W+ SK
Sbjct: 149 --ESIVY-------------LEPGKDIGAWIDRNVFGENHL-----WKFSKT-------- 180
Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
++PEGL S +SSI ++++GV G ++ + +
Sbjct: 181 --------------WDPEGLFSGISSIATSLLGVFCGSILSSKTNEIKK 215
>gi|423304873|ref|ZP_17282872.1| hypothetical protein HMPREF1072_01812 [Bacteroides uniformis
CL03T00C23]
gi|423310012|ref|ZP_17287996.1| hypothetical protein HMPREF1073_02746 [Bacteroides uniformis
CL03T12C37]
gi|392682836|gb|EIY76175.1| hypothetical protein HMPREF1072_01812 [Bacteroides uniformis
CL03T00C23]
gi|392683302|gb|EIY76639.1| hypothetical protein HMPREF1073_02746 [Bacteroides uniformis
CL03T12C37]
Length = 394
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/152 (34%), Positives = 84/152 (55%), Gaps = 22/152 (14%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
++R+ +LDI RG+ +A MI+V++ G W I HA WNG D V PFF+FI+G++
Sbjct: 7 SKRILALDILRGVTIAGMIMVNNPG-SWGHIYAPLRHAEWNGLTPTDLVFPFFMFIMGIS 65
Query: 88 IALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH-------APDELTYGVDV-- 136
++LK+ A K++ RT+ + G+ + G FS APD+L++G +
Sbjct: 66 TYISLKKYNFEFSHAAGMKILKRTIVIFLIGMAI-GWFSRFCYYWASAPDDLSFGEKLWA 124
Query: 137 -----RMIRLCGVLQRIALSYLLVSLVEIFTK 163
IR+ GV+QR+AL Y S++ + K
Sbjct: 125 SVWTFDRIRILGVMQRLALCYGAASIIALTMK 156
>gi|194367192|ref|YP_002029802.1| hypothetical protein Smal_3420 [Stenotrophomonas maltophilia
R551-3]
gi|194349996|gb|ACF53119.1| conserved hypothetical protein [Stenotrophomonas maltophilia
R551-3]
Length = 355
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 50/139 (35%), Positives = 78/139 (56%), Gaps = 16/139 (11%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL S+D RG+ VA M+LV++ G DW + H+ W+GC D V PFFLF+VGV++
Sbjct: 7 RRLGSIDALRGITVAAMLLVNNPG-DWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSM 65
Query: 89 ALALK-RIPD---RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
A ++ R D R + V+ R L++L G LL + + +D R+ GV
Sbjct: 66 AFSVAPRALDVALRPALARGVLERALRILVAGALLH-------LLIWWALDTHHFRIWGV 118
Query: 145 LQRIALSYLLVSLVEIFTK 163
LQRIA+ LV ++ ++ +
Sbjct: 119 LQRIAVCAALVGVLAVYAR 137
>gi|346224087|ref|ZP_08845229.1| hypothetical protein AtheD1_02868 [Anaerophaga thermohalophila DSM
12881]
Length = 369
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 47/138 (34%), Positives = 77/138 (55%), Gaps = 12/138 (8%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
K++R +LD+ RG+ +ALMI V+ G + + H+ W+GC D V PFFLF+VGV+
Sbjct: 3 KSERYLALDVLRGMTIALMITVNTPGSWQYVYAPLRHSSWHGCTPTDLVFPFFLFVVGVS 62
Query: 88 IALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
+ + + + + + ++ RTL + G+ L + P +T D +R+ GVL
Sbjct: 63 MFFSFAKYGNTLNKASFNRLGRRTLLIFAIGLFL----NSFPQWMT---DYSSLRIMGVL 115
Query: 146 QRIALSYLLVSLVEIFTK 163
QRIAL+Y SL+ + K
Sbjct: 116 QRIALAYGFASLIVLSMK 133
>gi|270295536|ref|ZP_06201737.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|270274783|gb|EFA20644.1| conserved hypothetical protein [Bacteroides sp. D20]
Length = 394
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 52/152 (34%), Positives = 84/152 (55%), Gaps = 22/152 (14%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
++R+ +LDI RG+ +A MI+V++ G W I HA WNG D V PFF+FI+G++
Sbjct: 7 SKRILALDILRGVTIAGMIMVNNPG-SWGHIYAPLRHAEWNGLTPTDLVFPFFMFIMGIS 65
Query: 88 IALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH-------APDELTYGVDV-- 136
++LK+ A K++ RT+ + G+ + G FS APD+L++G +
Sbjct: 66 TYISLKKYNFEFSHAAGMKILKRTIVIFLIGMAI-GWFSRFCYYWASAPDDLSFGEKLWA 124
Query: 137 -----RMIRLCGVLQRIALSYLLVSLVEIFTK 163
IR+ GV+QR+AL Y S++ + K
Sbjct: 125 SVWTFDRIRILGVMQRLALCYGAASIIALTMK 156
>gi|224025514|ref|ZP_03643880.1| hypothetical protein BACCOPRO_02254, partial [Bacteroides
coprophilus DSM 18228]
gi|224018750|gb|EEF76748.1| hypothetical protein BACCOPRO_02254 [Bacteroides coprophilus DSM
18228]
Length = 298
Score = 76.3 bits (186), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 48/138 (34%), Positives = 80/138 (57%), Gaps = 11/138 (7%)
Query: 31 KTQRLASLDIFRGLAVALMILVD-HAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+RL S+DIFRG+ + MILV+ AGG + + H P G +AD V P F+FI+G ++
Sbjct: 15 NMKRLLSIDIFRGITIFFMILVNTQAGGSFDFLIHIPGYGWRIADLVYPSFIFIMGASMY 74
Query: 90 LALKRIPDR--ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
L++++ + D K + RT+ + GI+ F+ P + ++ +R+ GVLQR
Sbjct: 75 LSMRKYVEAPPTDLYKHIFRRTVLIFLMGII----FNWIP----FDQNLLDVRILGVLQR 126
Query: 148 IALSYLLVSLVEIFTKDV 165
IA+ YL+ SL+ I + +
Sbjct: 127 IAIVYLICSLLVIKVRSI 144
>gi|340786861|ref|YP_004752326.1| protein involved in N-acetyl-D-glucosamine utilization [Collimonas
fungivorans Ter331]
gi|340552128|gb|AEK61503.1| protein involved in N-Acetyl-D-glucosamine utilization [Collimonas
fungivorans Ter331]
Length = 373
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 62/199 (31%), Positives = 96/199 (48%), Gaps = 36/199 (18%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
T+RLAS+D RG VA M+LV+ G DW + H+ W+GC D V PFFLF+VGV+
Sbjct: 22 TRRLASVDALRGCTVAAMLLVNDPG-DWSHVYAPLEHSAWHGCTPTDLVFPFFLFVVGVS 80
Query: 88 IALALK-RIPDRADA---VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM-IRLC 142
AL ++ R+ A+ + + R L+++ G+L+ + L + + + +RL
Sbjct: 81 TALGIEPRLAQGANPSTLARAALIRALRIVALGLLI--------NLLAWFIMPGVHLRLP 132
Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL-YG 201
GVLQRI L + +L I+T+ W + +L+ Y LL G
Sbjct: 133 GVLQRIGLCFAATALCSIYTRPRT-----------------QWGLIVAILLGYWGLLTLG 175
Query: 202 TYVPDWQFTIINKDSADYG 220
+ W DSA +G
Sbjct: 176 GSLEPWLNLASRSDSALFG 194
>gi|239907232|ref|YP_002953973.1| hypothetical protein DMR_25960 [Desulfovibrio magneticus RS-1]
gi|239797098|dbj|BAH76087.1| hypothetical membrane protein [Desulfovibrio magneticus RS-1]
Length = 371
Score = 75.9 bits (185), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 54/130 (41%), Positives = 73/130 (56%), Gaps = 21/130 (16%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
RL+S+D RGLA+A MI+V++ G +P++ HA W+G LAD V P FLF+VGV +AL
Sbjct: 8 RLSSVDTLRGLAIAAMIVVNNPGDRRFVYPQLLHAQWHGLTLADVVFPLFLFLVGVCVAL 67
Query: 91 ALKRIPD-------RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
A+ PD RA +K++ R L G+ + DEL RL G
Sbjct: 68 AID--PDKPRDAEARARLWRKILPRAAVLFALGLGENAYLRLSFDEL---------RLPG 116
Query: 144 VLQRIALSYL 153
VLQRIA+ YL
Sbjct: 117 VLQRIAVVYL 126
>gi|374311063|ref|YP_005057493.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
gi|358753073|gb|AEU36463.1| protein of unknown function DUF1624 [Granulicella mallensis
MP5ACTX8]
Length = 435
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 87/305 (28%), Positives = 126/305 (41%), Gaps = 79/305 (25%)
Query: 23 DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP----EISHAPWNGCNLADFVMP 78
Q E++ K R+ S+D+ RG+ +ALMILV+ G DW ++ HA WNG L D V P
Sbjct: 38 SQTERTVSKPGRVLSVDVLRGITIALMILVNDPG-DWDHIFGQLDHAAWNGWTLTDMVFP 96
Query: 79 FFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELT-YGVDV- 136
FLF++G +I +L+ R + G L F+ A L Y V
Sbjct: 97 AFLFLMGASIIFSLQARIARGNCK-------------GTLAGHIFARAGKILALYWVLAF 143
Query: 137 --RM---IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
RM IR GVL RIAL YLL SLV + T+ V+ V A +
Sbjct: 144 FPRMHWTIRWFGVLPRIALCYLLASLVLLATRRVRVLIAIV----------------AFL 187
Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
LV Y LL VP + D ++ N +IDR V +
Sbjct: 188 LVGYWVLLRWVPVPG-----LGTPMRDI-----------PFMDQNANLASWIDRGVSSWS 231
Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
+ H G L + +PEGLLS++ ++ +T++G G
Sbjct: 232 LRWLH---------------TGTLYRKTR-------DPEGLLSTLPAVATTLLGALAGMW 269
Query: 312 IIHTK 316
+I+ +
Sbjct: 270 MINGQ 274
>gi|389799428|ref|ZP_10202419.1| protein involved in N-acetyl-D-glucosamine utilization
[Rhodanobacter sp. 116-2]
gi|388442725|gb|EIL98904.1| protein involved in N-acetyl-D-glucosamine utilization
[Rhodanobacter sp. 116-2]
Length = 353
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 66/198 (33%), Positives = 91/198 (45%), Gaps = 34/198 (17%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
+RLASLD RG VA M+LV+ G DW + HA W+GC D V FFLF+VGV++
Sbjct: 2 KRLASLDALRGCTVAAMLLVNDPG-DWGHVYWPLEHAAWHGCTPTDLVFSFFLFVVGVSV 60
Query: 89 ALA-LKRIPDRADA---VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
ALA L R+ A + +R L++L G+ + + + +R GV
Sbjct: 61 ALAILPRLEQGAAPSALTRAATWRALRILALGVAIN-------LLAAWLLPQAHLRFPGV 113
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL-YGTY 203
LQRIAL + V+L I TK W A +L+ Y LL G
Sbjct: 114 LQRIALCFAGVALFAIHTKPRT-----------------QWWAIAALLIGYWGLLRLGGS 156
Query: 204 VPDWQFTIINKDSADYGK 221
+ W DSA +G+
Sbjct: 157 LEPWTNLASRVDSAVFGR 174
>gi|29349027|ref|NP_812530.1| hypothetical protein BT_3619 [Bacteroides thetaiotaomicron
VPI-5482]
gi|298386734|ref|ZP_06996289.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
gi|383124379|ref|ZP_09945043.1| hypothetical protein BSIG_3594 [Bacteroides sp. 1_1_6]
gi|29340934|gb|AAO78724.1| putative transmembrane protein [Bacteroides thetaiotaomicron
VPI-5482]
gi|251839125|gb|EES67209.1| hypothetical protein BSIG_3594 [Bacteroides sp. 1_1_6]
gi|298260408|gb|EFI03277.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
Length = 372
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 79/320 (24%), Positives = 132/320 (41%), Gaps = 87/320 (27%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
+RL +LD+ RG+ +A MILV+ G + + HA W G D V PFF+FI+G++
Sbjct: 6 SNKRLLALDVMRGITIAGMILVNTPGSWQHAYAPLKHAEWIGLTPTDLVFPFFMFIMGIS 65
Query: 88 IALALKR--IPDRADAVKKVIFRTLKLLFWGI----LLQGGFSHAPDELTYGVDVRMIRL 141
++L++ A K++ RT+ + GI L F H P + IR+
Sbjct: 66 TYISLRKYNFTFSVPAGLKILKRTVIIFLIGIGISWLSILCFQHDP------FPIDQIRI 119
Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
GV+QR+AL Y + ++V + K + + + A +L+ Y A+L
Sbjct: 120 LGVMQRLALGYGVTAIVALLMK-----------------HKYIPYLIAVLLISYFAIL-- 160
Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
+ G V++ T N + +DR VLG H+Y
Sbjct: 161 --------------ALGNGYVYDET-----------NILSIVDRAVLGQAHIYGGQI--- 192
Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
+PEGLLS++S+I +IG G +++ K +
Sbjct: 193 -------------------------LDPEGLLSTISAIAHVLIGFCAGKLLMEVKDIHEK 227
Query: 322 LKQWVTMGFALLIFGLTLHF 341
L++ +G L G L +
Sbjct: 228 LERLFLIGTILTFAGFLLSY 247
>gi|359438686|ref|ZP_09228688.1| hypothetical protein P20311_2740 [Pseudoalteromonas sp. BSi20311]
gi|359445329|ref|ZP_09235071.1| hypothetical protein P20439_1393 [Pseudoalteromonas sp. BSi20439]
gi|358026628|dbj|GAA64937.1| hypothetical protein P20311_2740 [Pseudoalteromonas sp. BSi20311]
gi|358040838|dbj|GAA71320.1| hypothetical protein P20439_1393 [Pseudoalteromonas sp. BSi20439]
Length = 359
Score = 75.5 bits (184), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 71/295 (24%), Positives = 120/295 (40%), Gaps = 79/295 (26%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAIA 89
R +LD RGL +ALMILV+ G W + HA W+GC D + PFF+FI+G A+
Sbjct: 3 RYKALDAMRGLTIALMILVNTPG-SWSHVYAPLLHADWHGCTPTDVIFPFFMFIIGSAMF 61
Query: 90 LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
+ K+ A A + L+L+ G ++ A + + ++ +R+ GVLQRI
Sbjct: 62 FSFKKTNSAASASQ-----VLRLVKRGAIIF-AIGLALNIYPFTTNIENLRILGVLQRIG 115
Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
++Y+L S+ +F + R + L + +LV Y LL
Sbjct: 116 IAYILASICVLF----------LNRRGVLTL-------SVIILVAYWLLL---------- 148
Query: 210 TIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
++ G N V +D VLG +H++
Sbjct: 149 ---------------LSVGAENAYTLEHNLVRAVDIAVLGESHLWQGKG----------- 182
Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
F+PEGL+S++ +++S + G ++ T A +K+
Sbjct: 183 ---------------LAFDPEGLISTLPAVVSVLFGFEVTRLLTSTSCQWASIKR 222
>gi|392555555|ref|ZP_10302692.1| hypothetical protein PundN2_08983 [Pseudoalteromonas undina NCIMB
2128]
Length = 359
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 71/295 (24%), Positives = 120/295 (40%), Gaps = 79/295 (26%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAIA 89
R +LD RGL +ALMILV+ G W + HA W+GC D + PFF+FI+G A+
Sbjct: 3 RYKALDAMRGLTIALMILVNTPG-SWSHVYAPLLHADWHGCTPTDVIFPFFMFIIGSAMF 61
Query: 90 LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIA 149
+ K+ A A + L+L+ G ++ A + + ++ +R+ GVLQRI
Sbjct: 62 FSFKKTNSAASASQ-----VLRLVKRGAIIF-AIGLALNIYPFTTNIENLRILGVLQRIG 115
Query: 150 LSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQF 209
++Y+L S+ +F + R + L + +LV Y LL
Sbjct: 116 IAYILASICVLF----------LNRRGVLTL-------SVIILVAYWLLL---------- 148
Query: 210 TIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDS 269
++ G N V +D VLG +H++
Sbjct: 149 ---------------LSVGAENAYTLEHNLVRAVDIAVLGESHLWQGKG----------- 182
Query: 270 PFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
F+PEGL+S++ +++S + G ++ T A +K+
Sbjct: 183 ---------------LAFDPEGLISTLPAVVSVLFGFEVTRLLTSTSCQWASIKR 222
>gi|395804141|ref|ZP_10483382.1| hypothetical protein FF52_19760 [Flavobacterium sp. F52]
gi|395433785|gb|EJF99737.1| hypothetical protein FF52_19760 [Flavobacterium sp. F52]
Length = 423
Score = 75.1 bits (183), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 86/356 (24%), Positives = 136/356 (38%), Gaps = 114/356 (32%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGV 86
+ +RL SLD+FRG + LM +V++ G +P + HA W+GC D V PFF+FI+G
Sbjct: 1 MTKERLTSLDVFRGFTIFLMTIVNNPGSWSSIYPPLEHAEWHGCTPTDLVFPFFVFIMGT 60
Query: 87 AIALALKRIPDRADAVKKVIFRTLKLLFWGILLQ-------------------------- 120
AI A+ K++ R+L++ G+ L
Sbjct: 61 AIPFAMPVKHFDGSVFNKILVRSLRIFCLGLFLSVFSRIHLFGLEGIPLLGLRLIVAFAV 120
Query: 121 -----GGFS-HAPDELTYGVDVRMI-------------RLCGVLQRIALSYLLVSLVEIF 161
G FS L GV + +I R+ GVLQRIA+ Y S++ +
Sbjct: 121 AYALLGNFSMKVKTILAVGVFLILISLSFSGLEHFEDTRIPGVLQRIAIVYFFTSILYLK 180
Query: 162 TKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGK 221
T K Q + + Y WL+ A +VP F N D
Sbjct: 181 T---NLKTQLLVLAGLLVGY---WLLMA-------------FVPVPGFGPANFDKG---- 217
Query: 222 VFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPS 281
N +ID D+ G L + +
Sbjct: 218 ---------------TNLAAWID-----------------------DTLLNGHLWASSKT 239
Query: 282 WCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGL 337
W +PEG+LS++ +I + I+G++ G ++ + + +K+ G AL+I GL
Sbjct: 240 W-----DPEGILSTLPAIGTGILGMYIGQLLNLSVDKMEIVKKTAIAGTALVIGGL 290
>gi|225875032|ref|YP_002756491.1| hypothetical protein ACP_3497 [Acidobacterium capsulatum ATCC
51196]
gi|225792728|gb|ACO32818.1| putative membrane protein [Acidobacterium capsulatum ATCC 51196]
Length = 378
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 75/292 (25%), Positives = 129/292 (44%), Gaps = 80/292 (27%)
Query: 23 DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD----WPEISHAPWNGCNLADFVMP 78
Q + + + +R+ S+D+ RG+ +A MILV++ G + W + HA WNG D V P
Sbjct: 2 KQHQDTVVNAKRMVSIDLLRGITIAFMILVNNNGDEAHAFW-ALKHAQWNGFTPTDLVFP 60
Query: 79 FFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGV 134
F+F+VG+++ + L+R R + R++ L G+++ GF + +G
Sbjct: 61 TFIFVVGISLVFSTEARLRRGQSRLLIAAHALRRSVILFLLGLVVN-GFPY----FHFGT 115
Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
+R+ GVLQRIA+ YL SL+ + ++ V + L+ LV
Sbjct: 116 ----LRIYGVLQRIAICYLFGSLLYLLSRRVWLQA----------------LLFTTALVG 155
Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
Y AL+ VP + + +D L+P N V ++DR +L
Sbjct: 156 YWALMRWVPVPG--YGLPGRDIPF--------------LDPNANLVAWLDRLLL------ 193
Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
P R A T+D PEGLLS++ ++ + ++G+
Sbjct: 194 --PG--RLYAGTRD--------------------PEGLLSTIPAMGTLLLGM 221
>gi|329851309|ref|ZP_08266066.1| hypothetical protein ABI_41500 [Asticcacaulis biprosthecum C19]
gi|328840155|gb|EGF89727.1| hypothetical protein ABI_41500 [Asticcacaulis biprosthecum C19]
Length = 384
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 87/321 (27%), Positives = 131/321 (40%), Gaps = 73/321 (22%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPE----ISHAPWNGCNLADFVMPFFLFIVGV 86
+ R +LDI RGL++ M+L + G W E + HA W G D V P FLF +GV
Sbjct: 7 QGNRWLALDILRGLSIIFMLL-NLNPGSWSEQYGWVLHAKWEGATFIDMVAPVFLFCIGV 65
Query: 87 AIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
AI L+L+R + ++ K ++ R L+ G+ L D +R+
Sbjct: 66 AIPLSLRRRIEAGESNGQLAKHILNRAGILVLLGLFLNA---------YPAFDWAHMRIP 116
Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYC-WHWLMAACVLVVYLALLYG 201
GVLQRI + Y V+L +FT + FRL W+ VL+ + ALL
Sbjct: 117 GVLQRIGVCYGAVALFVLFTARREGG---------FRLNAKAGWIAWTFVLLSWTALLMF 167
Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
VP +G + +P + Y+DR VL +HM+ P W
Sbjct: 168 VPVP------------GFGA---------PRFDPVGSWPAYVDRLVLTTDHMF--PWW-- 202
Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
P +G F+P+GLLS+ + + G GH G A
Sbjct: 203 --------PVDG----------KVVFDPDGLLSTWPVCANVLFGALVGHA--RLTGITAP 242
Query: 322 LKQWVTMGFALLIFGLTLHFT 342
+ + + G L+ + LH T
Sbjct: 243 ILKMLVAGGLLMAAAVGLHTT 263
>gi|445497063|ref|ZP_21463918.1| putative transmembrane protein [Janthinobacterium sp. HH01]
gi|444787058|gb|ELX08606.1| putative transmembrane protein [Janthinobacterium sp. HH01]
Length = 353
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 50/144 (34%), Positives = 75/144 (52%), Gaps = 16/144 (11%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGVAI 88
R+ S+D RGL VA M+LV+ AG DW P + HA W+GC DF+ P F+ IVG++I
Sbjct: 1 MRINSIDAVRGLTVAAMLLVNDAG-DWSHVYPWLEHAEWHGCTPPDFIFPIFMLIVGISI 59
Query: 89 ALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
LAL D A + V+ R ++++ G+ L H L ++ R RL GV
Sbjct: 60 NLALSPRLDAGAATAPLARSVLLRAVRIVLLGLAL-----HVVAMLL--LNGRGFRLFGV 112
Query: 145 LQRIALSYLLVSLVEIFTKDVQDK 168
LQR + + L+ I + + +
Sbjct: 113 LQRTGICFAAAGLLAIHVRGARAQ 136
>gi|410463501|ref|ZP_11317013.1| hypothetical protein B193_1525 [Desulfovibrio magneticus str.
Maddingley MBC34]
gi|409983383|gb|EKO39760.1| hypothetical protein B193_1525 [Desulfovibrio magneticus str.
Maddingley MBC34]
Length = 371
Score = 74.7 bits (182), Expect = 6e-11, Method: Compositional matrix adjust.
Identities = 52/136 (38%), Positives = 75/136 (55%), Gaps = 17/136 (12%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
T RL S+D RGLA+A MI+V++ G +P++ HA W+G LAD V P FLF+VGV +
Sbjct: 6 TSRLLSVDALRGLAIAAMIVVNNPGDRRFIYPQLLHAHWHGLTLADVVFPLFLFLVGVCV 65
Query: 89 ALAL-----KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
ALA+ + RA +K++ R L G+ + DEL R+ G
Sbjct: 66 ALAIDLDKARDAKGRARLWRKILPRAAVLFALGLGETAYLRLSFDEL---------RIPG 116
Query: 144 VLQRIALSYLLVSLVE 159
VLQRIA+ YL + ++
Sbjct: 117 VLQRIAVVYLAAAWLQ 132
>gi|333029673|ref|ZP_08457734.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
gi|332740270|gb|EGJ70752.1| putative transmembrane protein [Bacteroides coprosuis DSM 18011]
Length = 387
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 78/331 (23%), Positives = 134/331 (40%), Gaps = 93/331 (28%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGV 86
+ QRL +LDI RG+ +A MILV++ G W I HA WNG D V PFF+FI+G+
Sbjct: 5 ENQRLLALDILRGITIAGMILVNNPG-SWGSIYAPLGHAEWNGLTPTDLVFPFFMFIMGI 63
Query: 87 AIALALK--RIPDRADAVKKVIFRT-------LKLLFWGILLQGGFSHAP------DELT 131
+ +L+ + +A K++ RT L + ++ + L+ S A + L+
Sbjct: 64 STYFSLRKYKFEFSKEAALKILKRTIIIFAIGLGIAWFSLFLRTWNSLASADISIFERLS 123
Query: 132 YGVDV-RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
+ V +R+ GV+ R+AL+Y +++ + K VG
Sbjct: 124 QSIFVFENLRILGVMPRLALTYCATAIIALTIKHKYIPTLIVG----------------- 166
Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
+L+VY F + + +Y + N + +D+ +LG
Sbjct: 167 ILIVY------------TFILFLGNGFEYNE---------------TNILSIVDKAILGE 199
Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
NHMY +PEGL+S++ +I ++G G
Sbjct: 200 NHMYKDNG----------------------------IDPEGLVSTIPAIAHVLLGFFVGK 231
Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
+ K ++++ MG L GL L +
Sbjct: 232 IFTEKKDIHSKVEFLFIMGSILTFVGLLLSY 262
>gi|408382946|ref|ZP_11180487.1| heparan-alpha-glucosaminide N-acetyltransferase [Methanobacterium
formicicum DSM 3637]
gi|407814484|gb|EKF85111.1| heparan-alpha-glucosaminide N-acetyltransferase [Methanobacterium
formicicum DSM 3637]
Length = 382
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 88/329 (26%), Positives = 132/329 (40%), Gaps = 83/329 (25%)
Query: 16 ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD--WPEI-SHAPWNGCNL 72
++ + + + +K +R+ SLD+FRGLAVA MI V+ P I HA WNG
Sbjct: 5 VNTNSTNSTVKTNAVKKRRVISLDVFRGLAVAAMIFVNAMAFSEFTPGIFEHATWNGLTF 64
Query: 73 ADFVMPFFLFIVGVAIA--LALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
AD V P FLFIVGV++A A + + D +FR L G+ L
Sbjct: 65 ADLVFPSFLFIVGVSMAYSFAARSKNSKRDLWGHFLFRVGALFTIGVALN---------- 114
Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
+ D M+R+ GVLQ IAL+ L S + F W L+A
Sbjct: 115 WFTSDFSMVRIPGVLQLIALASLFASPMARFKPR------------------WILLVAGV 156
Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
+L+++ +L G P + G K N N +ID +VL +
Sbjct: 157 LLLIHGFILLGVGAP------------------GIPAGTLEKGN---NIDDWIDIQVLTV 195
Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
NH T D A +PEG+LS +++ ++G+ G
Sbjct: 196 NH-------------TID----------------ANGDPEGILSIITATALVLLGLCVGR 226
Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTL 339
+ K +L + + G L+ GL L
Sbjct: 227 TLQLRKHNLKTIGILLAGGAISLLLGLAL 255
>gi|405957484|gb|EKC23691.1| Heparan-alpha-glucosaminide N-acetyltransferase [Crassostrea gigas]
Length = 1901
Score = 74.7 bits (182), Expect = 7e-11, Method: Compositional matrix adjust.
Identities = 81/329 (24%), Positives = 135/329 (41%), Gaps = 76/329 (23%)
Query: 22 SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
+D+Q+ S K +RL SLD FRG F+
Sbjct: 1531 ADEQD-SRPKKERLKSLDTFRG------------------------------------FI 1553
Query: 82 FIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
+I+G A+A + L+R+ + K+ R+ L G+L+ G P LT+
Sbjct: 1554 WIMGTAMAYSFTGMLRRVTPKHKIFWKIFKRSCILFVLGLLVNTGGCD-PTRLTH----- 1607
Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSV--GRFSIFRLYCWHWLMAACVLVVY 195
+R+ GVLQR A +YL+V+ + +F D G + W++ + V+
Sbjct: 1608 -LRIPGVLQRFAGTYLVVASIHMFFAKTVDVSMYTYWGFIRDIVDFWLEWILHIVFVTVH 1666
Query: 196 LALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCN--AVGYIDRKVLGINHM 253
+ + + VP I G + A + C A GYIDR+V G +H+
Sbjct: 1667 IIITFTLDVPGCGKGYIGP-----GGLHEAVNSTEASVYQNCTGGAAGYIDRQVFGDDHI 1721
Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
Y P + P+ K P++PEGLL +++S+ +G+ G +++
Sbjct: 1722 YQSPTCK-------------PIYKTT-----VPYDPEGLLGTLNSVFMCYLGLQAGKILM 1763
Query: 314 HTKGHLARLKQWVTMG-FALLIFGLTLHF 341
K AR+K+++ G F LI G F
Sbjct: 1764 TFKEPSARVKRFLIWGLFLCLIAGALCGF 1792
>gi|375149723|ref|YP_005012164.1| hypothetical protein [Niastella koreensis GR20-10]
gi|361063769|gb|AEW02761.1| Protein of unknown function DUF2261, transmembrane [Niastella
koreensis GR20-10]
Length = 368
Score = 74.3 bits (181), Expect = 9e-11, Method: Compositional matrix adjust.
Identities = 89/324 (27%), Positives = 132/324 (40%), Gaps = 84/324 (25%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGV 86
+ +R SLD+FRGL + LMI+V+ G + + HA WNG LAD V P FLF VG
Sbjct: 2 IPNKRFISLDVFRGLIICLMIIVNTPGSHDTSFALLQHANWNGFTLADLVFPSFLFAVGN 61
Query: 87 AIALALK--RIPDRADAVKKVIFRT------LKLLFWGILLQGGFSHAPDELTYGVDVRM 138
A+ +++ + + + K+ RT LL+W + ++ + +
Sbjct: 62 ALFFSMQKWKTMTQGQVLAKIGKRTLLLFLLGYLLYWFPFFE---TNTQGHIVFK-SFAG 117
Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLAL 198
R+ GVLQRIAL Y + SL+ + K +++A +LV Y L
Sbjct: 118 TRIMGVLQRIALCYGIASLLIYYLKPKGAL-----------------IVSAIILVAYPGL 160
Query: 199 LYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPA 258
L+ W D G KLN NAV D +LG +HM H
Sbjct: 161 LF------WL--------GDPGN----------KLNMVGNAVTKFDLWLLGPDHMNHGEV 196
Query: 259 WRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGH 318
PFEPEG+LS++ +I + + G G I T G
Sbjct: 197 --------------------------VPFEPEGILSTLPAITNVVAGYLVGWY-IQTAGK 229
Query: 319 LAR-LKQWVTMGFALLIFGLTLHF 341
R L + + G L GL ++
Sbjct: 230 TKRMLLRLIATGAGLTFLGLCWNY 253
>gi|408673387|ref|YP_006873135.1| Protein of unknown function DUF2261, transmembrane [Emticicia
oligotrophica DSM 17448]
gi|387855011|gb|AFK03108.1| Protein of unknown function DUF2261, transmembrane [Emticicia
oligotrophica DSM 17448]
Length = 423
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 60/175 (34%), Positives = 82/175 (46%), Gaps = 50/175 (28%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
QRL S+D+FRG+ + LM +V++ G DW I HA W+GC D V PFFLFIVG++
Sbjct: 3 QRLTSIDVFRGMTIMLMTIVNNPG-DWSHIYAPLEHAEWHGCTPTDLVFPFFLFIVGIST 61
Query: 89 ALA----------LKRIPDRA------------------DAVKKVIFRTLKLLFWGI--- 117
L+ +RI RA ++ V ++L+ GI
Sbjct: 62 VLSSPVKRFDSNTFERIITRALRIFLLGLFLNFFSKIHLGTLEGVPLMLIRLVLTGIATV 121
Query: 118 LLQGGFSHAPD-ELTYGVDVRMIRLC-------------GVLQRIALSYLLVSLV 158
LL G F G+ V MI LC GVLQRIA+ YL+VS++
Sbjct: 122 LLLGDFDKKKQFYAAVGLFVFMISLCFSGIEDFASVRIPGVLQRIAMVYLIVSVL 176
>gi|329964617|ref|ZP_08301671.1| hypothetical protein HMPREF9446_03278 [Bacteroides fluxus YIT
12057]
gi|328525017|gb|EGF52069.1| hypothetical protein HMPREF9446_03278 [Bacteroides fluxus YIT
12057]
Length = 396
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 53/158 (33%), Positives = 85/158 (53%), Gaps = 24/158 (15%)
Query: 27 KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLF 82
S ++R+ +LDI RG+ +A MI+V++ G W I +HA W G D V PFF+F
Sbjct: 2 NSQKTSKRILALDILRGITIAGMIMVNNPG-SWAHIYAPLAHAQWIGLTPTDLVFPFFMF 60
Query: 83 IVGVAIALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH-------APDELTYG 133
I+G++ ++LK+ A K++ RT+ + G+ + G FS APD L +G
Sbjct: 61 IMGISTYISLKKYNFEFSHAAALKILKRTVIIFLIGMAI-GWFSRFCYYWSSAPDNLGFG 119
Query: 134 VDV--------RMIRLCGVLQRIALSYLLVSLVEIFTK 163
++ RM R+ GV+QR+AL Y S++ + K
Sbjct: 120 ENLWASVWTFDRM-RILGVMQRLALCYGATSIIALTMK 156
>gi|410721825|ref|ZP_11361152.1| hypothetical protein B655_1618 [Methanobacterium sp. Maddingley
MBC34]
gi|410598366|gb|EKQ52947.1| hypothetical protein B655_1618 [Methanobacterium sp. Maddingley
MBC34]
Length = 372
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 55/130 (42%), Positives = 69/130 (53%), Gaps = 12/130 (9%)
Query: 33 QRLASLDIFRGLAVALMILVDHAG--GDWPEI-SHAPWNGCNLADFVMPFFLFIVGVAIA 89
RL SLD+FRG +A MI+V+ G D P + HA W G NLAD V PFFLFIVGV++
Sbjct: 6 DRLVSLDVFRGFTIAGMIMVNILGLYPDTPSLLQHASWIGLNLADLVFPFFLFIVGVSMN 65
Query: 90 LALKRIPDRADAVK--KVIFRTLKLLFWGILLQGGFSHAPDELTYGV-DVRMIRLCGVLQ 146
+ + K K +FR L G+ L G YGV D IR+ G+LQ
Sbjct: 66 FSFASRSKQPSWKKWGKFLFRVAALYLIGVALVFGL------FFYGVPDFSTIRIPGILQ 119
Query: 147 RIALSYLLVS 156
IALS L +
Sbjct: 120 LIALSSLFAA 129
>gi|224537871|ref|ZP_03678410.1| hypothetical protein BACCELL_02758 [Bacteroides cellulosilyticus
DSM 14838]
gi|423227284|ref|ZP_17213748.1| hypothetical protein HMPREF1062_05934 [Bacteroides cellulosilyticus
CL02T12C19]
gi|224520557|gb|EEF89662.1| hypothetical protein BACCELL_02758 [Bacteroides cellulosilyticus
DSM 14838]
gi|392624424|gb|EIY18516.1| hypothetical protein HMPREF1062_05934 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 373
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 49/127 (38%), Positives = 70/127 (55%), Gaps = 12/127 (9%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
RL +LD+ RG+ +A MILV++ G W + HA W+G D V PFF+FI+GVA
Sbjct: 5 NNRLLALDVIRGITIAGMILVNNPG-SWQSVYAPLQHARWHGLTPTDLVYPFFMFIMGVA 63
Query: 88 IALALKRIPDRADAVK-KVIFRTLKLLFWGILLQGGFSHAPDELTYGV-DVRMIRLCGVL 145
I +L++ V K+I RT+ L GI L +L YG +R+ GV+
Sbjct: 64 IHFSLRKFDKLNTTVSLKIIRRTVALFAVGIALD-----CFSKLCYGTFSWEHLRILGVM 118
Query: 146 QRIALSY 152
QR+AL+Y
Sbjct: 119 QRLALAY 125
>gi|295136516|ref|YP_003587192.1| hypothetical protein ZPR_4697 [Zunongwangia profunda SM-A87]
gi|294984531|gb|ADF54996.1| conserved hypothetical protein [Zunongwangia profunda SM-A87]
Length = 371
Score = 73.9 bits (180), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 53/144 (36%), Positives = 77/144 (53%), Gaps = 12/144 (8%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
+ R SLDI RG+ VALMILV++ G W I HA W+G L D V P FLF+VG
Sbjct: 1 MARSRYLSLDILRGMTVALMILVNNPG-SWATIYAPFKHAAWHGFTLTDLVFPTFLFVVG 59
Query: 86 VAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAP-----DELTYGVDVRMIR 140
A++ + K++ + + + + +T K L+ G S+ P D ++ IR
Sbjct: 60 NAMSFSFKKM--NSWSTPEFLTKTFKRAAIIFLIGLGLSYYPFVRRTDGEFILKNILDIR 117
Query: 141 LCGVLQRIALSYLLVSLVEIFTKD 164
+ GVLQRIA+ YLL ++ F K
Sbjct: 118 IMGVLQRIAVCYLLAAIAIRFLKK 141
>gi|374373358|ref|ZP_09631018.1| membrane protein [Niabella soli DSM 19437]
gi|373234331|gb|EHP54124.1| membrane protein [Niabella soli DSM 19437]
Length = 375
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 52/141 (36%), Positives = 72/141 (51%), Gaps = 21/141 (14%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
K R +LDIFRG+ + MI+V+ G + +PE+ HA WNG L D V P FLF VG AIA
Sbjct: 8 KPGRFLALDIFRGMTICFMIIVNTGGPNPFPELRHAQWNGFTLTDLVFPSFLFAVGNAIA 67
Query: 90 LALKRIPDRA--DAVKKVIFRTLKLLFWGILL----------QGGFSHAPDELTYGVDVR 137
+ + ++ + + K+I RT L G L+ Q P T
Sbjct: 68 FSKSKWDQQSNKEVLTKIIKRTCLLFLIGYLMYWLPFVKIDAQNNIRPFPIGET------ 121
Query: 138 MIRLCGVLQRIALSYLLVSLV 158
R+ GVLQRIAL Y + +L+
Sbjct: 122 --RIFGVLQRIALCYGIGALI 140
>gi|410448043|ref|ZP_11302130.1| putative membrane protein [Leptospira sp. Fiocruz LV3954]
gi|410018124|gb|EKO80169.1| putative membrane protein [Leptospira sp. Fiocruz LV3954]
gi|456875246|gb|EMF90470.1| putative membrane protein [Leptospira santarosai str. ST188]
Length = 363
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 83/285 (29%), Positives = 121/285 (42%), Gaps = 87/285 (30%)
Query: 37 SLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
SLD+FRG+ V MILV++ G W I HA WNGC D V PFFLF VG +I ++L
Sbjct: 2 SLDLFRGMTVIGMILVNNPG-SWSYIYSPLKHAEWNGCTPTDLVFPFFLFAVGTSIPISL 60
Query: 93 --KRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVLQRI 148
K +R+D + R+ L+ G+ L G +S A +R+ GVLQRI
Sbjct: 61 YSKNGINRSDIWIGICIRSANLILLGLFLNFFGEWSFAE-----------LRIPGVLQRI 109
Query: 149 ALSYLLV-SLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
Y +V SL +F K V I ++ W ++ +AL + V
Sbjct: 110 GFVYWVVASLCLVF----PGKKILVFLVPILLIHTW--------ILTQIALPGESVV--- 154
Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
S + GK + +IDR + G H+ WR SK
Sbjct: 155 --------SLEQGK----------------DIGAWIDRTIFGEKHL-----WRFSKT--- 182
Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
++PEG LS V+S+++T+ GV G ++
Sbjct: 183 -------------------WDPEGFLSGVASVVTTLFGVLCGFIL 208
>gi|375110537|ref|ZP_09756759.1| hypothetical protein AJE_11264 [Alishewanella jeotgali KCTC 22429]
gi|374569481|gb|EHR40642.1| hypothetical protein AJE_11264 [Alishewanella jeotgali KCTC 22429]
Length = 394
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 84/304 (27%), Positives = 137/304 (45%), Gaps = 66/304 (21%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
QQ + R+ +LD RGLA+ MILV++ G +P + HA W+G D + P F
Sbjct: 7 QQILAKQPANRMLALDALRGLAILAMILVNNPGSWQYVYPPLLHAEWHGWTPTDLIFPAF 66
Query: 81 LFIVGVAI--ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSH--APDELTYGVDV 136
L +VG+AI +LA +++ +A+ +++ R LKL G+ L + + P+ +
Sbjct: 67 LVMVGMAIPYSLAGRQLLPKAEQIRQGAIRALKLYLLGLFLVLFYYNFRDPNYSYLQQKL 126
Query: 137 RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
+R GVLQRI + Y L+ +++ + GR WL C+L L
Sbjct: 127 LTVRWSGVLQRIGIVYFCTLLIVLYSG-------TRGRIL--------WLSGLCLLYFLL 171
Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
+VP +D +YG F G+ N N ++D ++LG NH+Y
Sbjct: 172 M----QFVP-------YRD--NYGHTF---VGLWEHGN---NLAAWLDHQLLGPNHVYFR 212
Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
A +PF F+PEG+LS++ +I S + GV ++ +K
Sbjct: 213 SA----------TPFA--------------FDPEGILSTLPAIASCLSGVLMAQ-LLQSK 247
Query: 317 GHLA 320
LA
Sbjct: 248 AELA 251
>gi|418676277|ref|ZP_13237561.1| putative membrane protein [Leptospira kirschneri serovar
Grippotyphosa str. RM52]
gi|418684894|ref|ZP_13246077.1| putative membrane protein [Leptospira kirschneri serovar
Grippotyphosa str. Moskva]
gi|418743212|ref|ZP_13299580.1| putative membrane protein [Leptospira kirschneri serovar Valbuzzi
str. 200702274]
gi|400323423|gb|EJO71273.1| putative membrane protein [Leptospira kirschneri serovar
Grippotyphosa str. RM52]
gi|410740642|gb|EKQ85357.1| putative membrane protein [Leptospira kirschneri serovar
Grippotyphosa str. Moskva]
gi|410749503|gb|EKR06488.1| putative membrane protein [Leptospira kirschneri serovar Valbuzzi
str. 200702274]
Length = 369
Score = 73.6 bits (179), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 74/289 (25%), Positives = 120/289 (41%), Gaps = 79/289 (27%)
Query: 38 LDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL-- 92
+D+FRG+ VA MILV++ G + + HA WNGC D V PFFLF VG++I L++
Sbjct: 1 MDLFRGMTVAGMILVNNPGSWSFIYSPLKHAKWNGCTPTDLVFPFFLFAVGISIQLSVYS 60
Query: 93 KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
K ++ + R++ L+ G+ L + EL R+ GVLQRI Y
Sbjct: 61 KNKIHKSKIWFGICIRSITLILIGLFLNFFGEWSFSEL---------RIPGVLQRIGFVY 111
Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
+V+ + + R+ W+ +L+V+ +L P
Sbjct: 112 WIVASLHLILPK--------------RMILISWI---PILLVHTWVLIQIPAPG------ 148
Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
+S Y L P + +IDR V G N + W+ SK
Sbjct: 149 --ESIVY-------------LEPGKDIGAWIDRNVFGENRL-----WKFSKT-------- 180
Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
++PEGL S +SSI ++++GV G ++ + +
Sbjct: 181 --------------WDPEGLFSGISSIATSLLGVFCGSILSSKTNEIKK 215
>gi|322436289|ref|YP_004218501.1| hypothetical protein AciX9_2697 [Granulicella tundricola MP5ACTX9]
gi|321164016|gb|ADW69721.1| hypothetical protein AciX9_2697 [Granulicella tundricola MP5ACTX9]
Length = 389
Score = 73.6 bits (179), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 54/159 (33%), Positives = 79/159 (49%), Gaps = 19/159 (11%)
Query: 16 ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNL 72
++E + D Q S RL S+D+ RGL + MILV+ AG + + + HA WNG
Sbjct: 1 MTEQALGDIQRPS-----RLLSIDLLRGLTIGFMILVNDAGSERDAYAPLQHAWWNGFTP 55
Query: 73 ADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
D V P FLF+VG+ L+L DR V ++ LFW +L + + L
Sbjct: 56 TDLVFPTFLFLVGITTVLSLGSRMDR--NVPRMT------LFWSVLRRAVLIYVVGILAS 107
Query: 133 G---VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDK 168
+ +R GVL RIAL YL+V + + +K +DK
Sbjct: 108 TFPFTHLAGMRFVGVLPRIALCYLIVGSLLLISKSWKDK 146
>gi|427781073|gb|JAA55988.1| Putative heparan-alpha-glucosaminide n-acetyltransferase
[Rhipicephalus pulchellus]
Length = 337
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 61/263 (23%), Positives = 113/263 (42%), Gaps = 32/263 (12%)
Query: 84 VGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
+GV++A+ ++ + ++ ++ + +K L+ G + L+ VD+ +R+ G
Sbjct: 1 MGVSLAMTIRSLLRKSVTRGRIFLQIVK----RSLILFGLGIMTNTLSGDVDLNTLRIPG 56
Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY 203
VLQR+A SYL+ + V + + Y WL+A +L ++LAL +
Sbjct: 57 VLQRLAFSYLVAATVHLLFAKPHEGQLVWAPVRDVLAYWPEWLLAIPMLALHLALTFFLP 116
Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
VP+ + F G A G+IDR++ G +H+Y P R
Sbjct: 117 VPNCPQGYLGPGGLHLNSSFENCTG---------GAAGFIDRRIFGNSHIYQTPDMRH-- 165
Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLK 323
D+ H P++PEG L ++SI +G+ G +++ AR+
Sbjct: 166 --VYDT--------------HLPYDPEGTLGCLTSIFLVFLGLQAGKILLTFPEWKARVI 209
Query: 324 QWVTMGFAL-LIFGLTLHFTNGE 345
+W G +I G+ +F+ E
Sbjct: 210 RWCIWGLLCGIIAGVLCNFSKEE 232
>gi|404404862|ref|ZP_10996446.1| transmembrane protein [Alistipes sp. JC136]
Length = 382
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 77/319 (24%), Positives = 132/319 (41%), Gaps = 56/319 (17%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHA---GGDWPEISHAPWNGCNLADFVMPFFLFIVGV 86
+K++RL SLD+ RG+ +A MILV++ G + + HA W+G D + PFF+FI+GV
Sbjct: 1 MKSERLLSLDVMRGMTIAAMILVNNPAVWGKAYAPLQHAFWHGMTPTDLIYPFFVFIMGV 60
Query: 87 AIALALKRIPDRA--DAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
+ +L + + A +A +++ R+ + G+LL E++Y
Sbjct: 61 SAFFSLSKRYEGAGREAFSRILRRSAVIFGVGLLL--------QEISY------------ 100
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYV 204
Y + + T ++V F FR+ +A L ALL
Sbjct: 101 -----FGYGTANFLSGQTSADATWFETVFPFRTFRIMGVLQGLALVYLFGSAALL----C 151
Query: 205 PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKA 264
++ I+ + + G L+ N + +DR VLG +H+Y
Sbjct: 152 LRFRHLIVAAGGLLILYLVLLQTGNGYSLSAD-NIIAVVDRAVLGESHLY---------- 200
Query: 265 CTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQ 324
R+ P FEPEGLLS++ I ++G G +++ + R +
Sbjct: 201 -----------REWLPDGSRLAFEPEGLLSTLPRIAQFLLGCAAGRILLANEDAPMRFGR 249
Query: 325 WVTMGFALLIFGLTLHFTN 343
G AL GL L + +
Sbjct: 250 LFAFGTALFFTGLLLQYGD 268
>gi|343083133|ref|YP_004772428.1| hypothetical protein [Cyclobacterium marinum DSM 745]
gi|342351667|gb|AEL24197.1| Protein of unknown function DUF2261, transmembrane [Cyclobacterium
marinum DSM 745]
Length = 381
Score = 73.2 bits (178), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 48/126 (38%), Positives = 66/126 (52%), Gaps = 6/126 (4%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
R SLD+ RGL +ALM++V+ G + ++HA W+G L D V P FLF+VG A++
Sbjct: 12 TRYQSLDVLRGLTLALMVIVNTPGDGSTSFGPLTHADWHGLTLTDLVFPSFLFVVGNAMS 71
Query: 90 LALKRIPDRADAV--KKVIFRTLKLLFWGILLQG-GFSHAPDELTYGVDVRMIRLCGVLQ 146
+L + + KV RT + G+LL F D D IR+ GVLQ
Sbjct: 72 FSLGKFKLKGGKAYFSKVFKRTALIFIIGLLLTAFPFFRVNDSGVVPYDFTSIRILGVLQ 131
Query: 147 RIALSY 152
RIAL Y
Sbjct: 132 RIALCY 137
>gi|427400072|ref|ZP_18891310.1| hypothetical protein HMPREF9710_00906 [Massilia timonae CCUG 45783]
gi|425720812|gb|EKU83727.1| hypothetical protein HMPREF9710_00906 [Massilia timonae CCUG 45783]
Length = 380
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 51/136 (37%), Positives = 73/136 (53%), Gaps = 18/136 (13%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDW----PEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
RL SLD FRG +A M+LV++ G DW +++HA W+G D + PFFLFI GVA+A
Sbjct: 7 RLTSLDAFRGFTIAAMVLVNNPG-DWGHLHAQLAHAAWHGWTFTDTIFPFFLFIGGVAMA 65
Query: 90 LALKRIPD----RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVL 145
L+L R+ + + K+ R + G LL L D +R+ GVL
Sbjct: 66 LSLGRLAAAGAHKPQLLLKLAKRAALIFLIGFLL---------NLIPRFDFDSVRIPGVL 116
Query: 146 QRIALSYLLVSLVEIF 161
QRIAL LL + + ++
Sbjct: 117 QRIALCTLLAAPLVVY 132
>gi|428299602|ref|YP_007137908.1| hypothetical protein Cal6303_2987 [Calothrix sp. PCC 6303]
gi|428236146|gb|AFZ01936.1| hypothetical protein Cal6303_2987 [Calothrix sp. PCC 6303]
Length = 104
Score = 72.8 bits (177), Expect = 2e-10, Method: Compositional matrix adjust.
Identities = 34/81 (41%), Positives = 49/81 (60%), Gaps = 3/81 (3%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
+ RL SLD+FRG+A+A MILV++ G +P + HA W+GC D V PFFLFIVG+A
Sbjct: 10 NSNRLVSLDVFRGIAIASMILVNNPGSWDSIYPPLEHAEWHGCTPTDLVFPFFLFIVGMA 69
Query: 88 IALALKRIPDRADAVKKVIFR 108
+ + + +V +R
Sbjct: 70 MPFSFAKYTKENRPTARVYWR 90
>gi|393782159|ref|ZP_10370348.1| hypothetical protein HMPREF1071_01216 [Bacteroides salyersiae
CL02T12C01]
gi|392674193|gb|EIY67642.1| hypothetical protein HMPREF1071_01216 [Bacteroides salyersiae
CL02T12C01]
Length = 387
Score = 72.4 bits (176), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 78/329 (23%), Positives = 134/329 (40%), Gaps = 91/329 (27%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
++RL +LDI RG+ +A MI+V++ G + + HA WNG D V PFF+FI+G++
Sbjct: 6 SKRLLALDILRGITIAGMIMVNNPGSWSYVYAPLGHAQWNGLTPTDLVFPFFMFIMGIST 65
Query: 89 ALALKR--IPDRADAVKKVIFRT-------LKLLFWGILLQGGFSHAPDELTYGVDVRM- 138
++L++ A K++ RT L L ++ + + S + +E+++ +
Sbjct: 66 YISLRKYNFEFSHSAALKILKRTIVIFAIGLGLAWFSMFCRTWNSLSAEEISFFSRLGQS 125
Query: 139 ------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
IR+ GV+QR+AL Y ++V + K + A +L
Sbjct: 126 IWTFDHIRILGVMQRLALCYGATAIVALTMKHKHIP-----------------YLIATLL 168
Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
+ Y LL + + +Y N + +DR VLG H
Sbjct: 169 IGYFILL------------VTGNGFEYNS---------------TNILSVVDRAVLGEAH 201
Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
MY KD +PEGLLS++ +I +IG G ++
Sbjct: 202 MY----------------------KD------NGIDPEGLLSTIPAIAHVLIGFCVGKLL 233
Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHF 341
+ K +L + +G L G L +
Sbjct: 234 MEVKDINEKLGRLFLIGTILTFLGFLLSY 262
>gi|375253854|ref|YP_005013021.1| hypothetical protein BFO_0041 [Tannerella forsythia ATCC 43037]
gi|363406758|gb|AEW20444.1| putative membrane protein [Tannerella forsythia ATCC 43037]
Length = 390
Score = 72.4 bits (176), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 78/302 (25%), Positives = 125/302 (41%), Gaps = 91/302 (30%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
++RL +LDI RG+ +A MI+V++ G + + HA W+G D V PFF+FI+G++
Sbjct: 8 SSRRLLALDILRGITIAGMIMVNNPGSWSFVYAPLGHAAWHGLTPTDLVFPFFMFIMGIS 67
Query: 88 IALALKR--IPDRADAVKKVIFRT--------------LKLLFWGILLQGGFSHAPDELT 131
++LK+ A++K+I RT L W L GG S
Sbjct: 68 TYISLKKYDFTFSYSAMRKIIRRTAVIFAIGLGLAWLGLTCRTWHGLADGGLSFGARLWQ 127
Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
+ +R+ GV+QR+ALSY +L+ + + + +
Sbjct: 128 SVSNFGHLRILGVMQRLALSYGATALIALAIRHHR------------------------I 163
Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
+ +ALL G +T++ A G +N T N + +DR VLG+N
Sbjct: 164 PYLIVALLGG-------YTVLLL--AGNGLAYNET-----------NILSIVDRAVLGVN 203
Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
H Y KD EPEGLLS++ +I +IG G
Sbjct: 204 HTY----------------------KD------MGIEPEGLLSTLPAIAHVLIGFCCGRA 235
Query: 312 II 313
++
Sbjct: 236 ML 237
>gi|421097001|ref|ZP_15557700.1| putative membrane protein [Leptospira borgpetersenii str.
200901122]
gi|410800246|gb|EKS02307.1| putative membrane protein [Leptospira borgpetersenii str.
200901122]
Length = 383
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 92/328 (28%), Positives = 138/328 (42%), Gaps = 95/328 (28%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
++KS +R+ SLD+FRG+ V MILV++ G W I HA WNGC D V PF
Sbjct: 1 MEKKSTQNKERILSLDLFRGMTVIGMILVNNPG-SWSYIYSPLKHAEWNGCTPTDLVFPF 59
Query: 80 FLFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVD 135
FLF VGV+I ++L K +R+ + R++ L+ G+LL G +S A
Sbjct: 60 FLFAVGVSIPISLYSKNGINRSKVWIGICIRSISLILLGLLLNFFGEWSFAE-------- 111
Query: 136 VRMIRLCGVLQRIALSYLLV-SLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
+R+ GVLQRI Y V SL IF K + + ++ W
Sbjct: 112 ---LRVPGVLQRIGFVYWTVASLYLIF----PGKKVLIFLILVLLVHTW----------- 153
Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
+ +P + T+ + D G +IDR ++G H+
Sbjct: 154 ----ILTHIIPPGESTVSLEQGKDIG--------------------AWIDRMIIGEKHL- 188
Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIH 314
W+ SK ++PEGLLS ++SI +++ GV G ++
Sbjct: 189 ----WKFSKT----------------------WDPEGLLSGIASIATSLFGVLCGFILF- 221
Query: 315 TKGHLARLKQWVTMGFALLIFGLTLHFT 342
L++ V L IFGL FT
Sbjct: 222 -------LREGVGKNRVLGIFGLGFLFT 242
>gi|404256028|ref|ZP_10959996.1| hypothetical protein SPAM266_22806 [Sphingomonas sp. PAMC 26621]
Length = 392
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 83/290 (28%), Positives = 120/290 (41%), Gaps = 77/290 (26%)
Query: 37 SLDIFRGLAVALMILVDHAGGDWPE----ISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
+LD+ RGLAVA MILV G DW + + HA WNG LAD V P FLF VG+A+ L+
Sbjct: 2 ALDVLRGLAVAGMILVVSPG-DWGQAYVQLQHANWNGATLADMVFPTFLFSVGIALGLSF 60
Query: 93 KRIPDRADAVKKVIFRTLKLLFWGILLQ----GGFSHAPDELTY------------GVDV 136
R + A + LFW LL+ E TY G +
Sbjct: 61 PRRLETAGD---------RRLFWTRLLRRTALLILLGLLVEATYVWTIAAGAPYPGGPGL 111
Query: 137 RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
IR+ G+LQRI L Y L ++ + T +D D G I L + C++V+ +
Sbjct: 112 AHIRIPGILQRIGLCYGLAGILLLATNR-RDPD---GMIRINPLA-----IVGCIVVILI 162
Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
W I + V GV L P N G++DR + H++
Sbjct: 163 GY--------WLLLI-------FVPVPGFGAGV---LTPAGNLPGFVDRTLFTEPHLWPL 204
Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
+ T P A ++PEGLLS++ + + + G+
Sbjct: 205 ------GSATAARP--------------ATYDPEGLLSTLPATANVLFGI 234
>gi|338212268|ref|YP_004656323.1| hypothetical protein [Runella slithyformis DSM 19594]
gi|336306089|gb|AEI49191.1| Protein of unknown function DUF2261, transmembrane [Runella
slithyformis DSM 19594]
Length = 365
Score = 72.0 bits (175), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 78/292 (26%), Positives = 119/292 (40%), Gaps = 84/292 (28%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
+T RL SLD RG +A M++V+ G + + + H WNG + D V P FLF+VGV+
Sbjct: 4 QTNRLVSLDALRGFTIAAMLMVNFPGSEEYVFFTLRHTKWNGLSFTDLVAPIFLFVVGVS 63
Query: 88 IALAL-KRIPD---RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
I A KR D + +K+I R+LK+ G+ L L D IR G
Sbjct: 64 IVFAYSKRKWDGRPTGELYRKIIIRSLKIFAVGMFLN---------LMPTFDFSDIRWTG 114
Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY 203
L RIA +L +++ + + K Q+ W+ A ++ +LAL T
Sbjct: 115 TLHRIAFVFLGCAVLYL---NTNWKQQA-------------WVGAVILVAYWLAL---TL 155
Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
+P + GKV L P N V + D + L
Sbjct: 156 IP----------TPGIGKV---------MLEPGVNLVAWFDTQFL--------------- 181
Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHT 315
P + +W +PE +LS+ SI+S I G+ G ++ T
Sbjct: 182 ----------PGKMWQGTW-----DPESILSTFPSIVSGITGMLAGQLLQST 218
>gi|380693406|ref|ZP_09858265.1| hypothetical protein BfaeM_05407 [Bacteroides faecis MAJ27]
Length = 371
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 77/320 (24%), Positives = 130/320 (40%), Gaps = 87/320 (27%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
+RL +LD+ RG+ +A MILV+ G + + HA W G D V PFF+FI+G++
Sbjct: 5 SNKRLLALDVMRGITIAGMILVNTPGSWQHTYAPLKHAEWIGLTPTDLVFPFFMFIMGIS 64
Query: 88 IALALKR--IPDRADAVKKVIFRTLKLLFWGI----LLQGGFSHAPDELTYGVDVRMIRL 141
++L++ A K++ RT+ + GI L F H P + IR+
Sbjct: 65 TYISLRKYDFTFSIPAGLKILKRTVIIFLIGIGISWLSILCFQHDP------FPIDQIRI 118
Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
GV+QR+AL Y + +L + K + + + +L+ Y +L
Sbjct: 119 LGVMQRLALGYGITALAALLIK-----------------HKYIPYLITVLLIGYFMIL-- 159
Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
+ G V++ T N + +DR VLG H+Y
Sbjct: 160 --------------AVGNGYVYDET-----------NVLSIVDRAVLGQAHIYG------ 188
Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
A +PEGLLS++S++ +IG G +++ K +
Sbjct: 189 ----------------------GAILDPEGLLSTISAVAHVMIGFCAGKLLMEVKDIHEK 226
Query: 322 LKQWVTMGFALLIFGLTLHF 341
L++ +G L G L +
Sbjct: 227 LERLFLIGTILTFAGFLLSY 246
>gi|408821750|ref|ZP_11206640.1| hypothetical protein PgenN_01470 [Pseudomonas geniculata N1]
Length = 355
Score = 72.0 bits (175), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 45/139 (32%), Positives = 75/139 (53%), Gaps = 16/139 (11%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL S+D RG+ VA M+LV++ G DW + H+ W+GC D V PFFLF+VGV++
Sbjct: 7 RRLGSIDALRGITVAAMLLVNNPG-DWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSM 65
Query: 89 ALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
A ++ + R + V+ R L++L + + + +D R+ GV
Sbjct: 66 AFSVAPRALDVSARPALARGVLERALRILL-------AGALLHLLIWWALDTHHFRIWGV 118
Query: 145 LQRIALSYLLVSLVEIFTK 163
LQRIA+ LV ++ ++ +
Sbjct: 119 LQRIAVCAALVGVLAVYAR 137
>gi|393786264|ref|ZP_10374400.1| hypothetical protein HMPREF1068_00680 [Bacteroides nordii
CL02T12C05]
gi|392659893|gb|EIY53510.1| hypothetical protein HMPREF1068_00680 [Bacteroides nordii
CL02T12C05]
Length = 387
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 75/329 (22%), Positives = 133/329 (40%), Gaps = 91/329 (27%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
++RL +LDI RG+ +A MI+V++ G + + HA WNG D V PFF+FI+G++
Sbjct: 6 SKRLLALDILRGITIAGMIMVNNPGSWSYVYAPLGHAKWNGLTPTDLVFPFFMFIMGIST 65
Query: 89 ALALKR--IPDRADAVKKVIFRT-------LKLLFWGILLQGGFSHAPDELTY----GVD 135
++L++ A K++ RT L + ++ + + S + +E+++ G
Sbjct: 66 YISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLSGEEISFLSRLGQS 125
Query: 136 VRM---IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
V IR+ GV+QR+AL Y +++ + K
Sbjct: 126 VWTFDHIRILGVMQRLALCYGATAIIALTMKH---------------------------- 157
Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
Y+ L T + + +I + +Y N + +DR VLG H
Sbjct: 158 -KYIPYLIVTLLAGYFILLITGNGFEYND---------------TNILSVVDRAVLGEAH 201
Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
MY +PEGLLS++ +I +IG G ++
Sbjct: 202 MYKDNG----------------------------IDPEGLLSTIPAIAHVLIGFCVGKLL 233
Query: 313 IHTKGHLARLKQWVTMGFALLIFGLTLHF 341
+ K +L++ +G L G L +
Sbjct: 234 MEVKDINEKLERLFLIGTILTFLGFLLSY 262
>gi|282877735|ref|ZP_06286550.1| putative membrane protein [Prevotella buccalis ATCC 35310]
gi|281300307|gb|EFA92661.1| putative membrane protein [Prevotella buccalis ATCC 35310]
Length = 403
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 80/301 (26%), Positives = 125/301 (41%), Gaps = 94/301 (31%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGV 86
+++R+ ++DI RG+ +A MILV++ G W I HA WNG D V PFF+F++G+
Sbjct: 8 QSKRILAIDILRGITIAGMILVNNPG-SWAHIFAPLEHAEWNGMTPTDLVFPFFMFVMGM 66
Query: 87 AIALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH-------APDELTYGVDV- 136
I +++++ + V K+I RTL L GI + G FS ++ T G +
Sbjct: 67 CIFISMQKYQFACNRQTVYKIIRRTLLLYLVGIFV-GWFSRFCYRWAFPLEDATLGQQIW 125
Query: 137 ------RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
IRL GVL R+A+ Y + +L+ I + R+
Sbjct: 126 HTVWSFDTIRLSGVLARLAICYGITALLAITVRH---------RY--------------- 161
Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
+L + + LL G +TI+ + CG N + +DR VL
Sbjct: 162 LLSIVITLLIG-------YTIL------------LFCG-NGFAYDETNILSIVDRAVLTD 201
Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
HMYH +PEGLLS+ +I T+IG G
Sbjct: 202 AHMYHDNG----------------------------IDPEGLLSTFPAIAHTLIGFLIGK 233
Query: 311 V 311
+
Sbjct: 234 L 234
>gi|319900285|ref|YP_004160013.1| hypothetical protein Bache_0400 [Bacteroides helcogenes P 36-108]
gi|319415316|gb|ADV42427.1| putative transmembrane protein [Bacteroides helcogenes P 36-108]
Length = 396
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 76/302 (25%), Positives = 127/302 (42%), Gaps = 94/302 (31%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
++R+ +LDI RG+ +A MI+V++ G +W I HA W G D V PFF+FI+G++
Sbjct: 7 SKRILALDILRGITIAGMIMVNNPG-NWGHIYAPLEHAEWIGLTPTDLVFPFFMFIMGIS 65
Query: 88 IALALKRIP---DRADAVK-----KVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV--- 136
++LK+ R+ A+K +IF + W L ++ AP EL++G ++
Sbjct: 66 TYISLKKYDFEFSRSAALKILKRTAIIFLIGLAIGWFARLCYYWAAAPGELSFGENLWAS 125
Query: 137 -----RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
RM R+ GV+QR+AL Y S++ + K +L+A +
Sbjct: 126 VWTFDRM-RILGVMQRLALCYGATSIIALTMKHRHIP----------------YLIAGLL 168
Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
+ ++ L+ G G +N T N + +DR VL
Sbjct: 169 ISYFILLMCGN-----------------GFAYNET-----------NILSVVDRAVLTPA 200
Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
HMY +PEGLLS++ SI ++G G +
Sbjct: 201 HMYKDNG----------------------------IDPEGLLSTIPSIAHVLLGFCVGRM 232
Query: 312 II 313
++
Sbjct: 233 ML 234
>gi|424795356|ref|ZP_18221218.1| N-acetylglucosaminidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
gi|422795515|gb|EKU24196.1| N-acetylglucosaminidase [Xanthomonas translucens pv. graminis
ART-Xtg29]
Length = 1105
Score = 71.6 bits (174), Expect = 5e-10, Method: Compositional matrix adjust.
Identities = 50/131 (38%), Positives = 71/131 (54%), Gaps = 5/131 (3%)
Query: 33 QRLASLDIFRGLAVALMILVD--HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+R SLD+FRGL + LMILV+ AG D + ++ H PW G AD V P FLF VG A++
Sbjct: 737 ERFLSLDVFRGLTIFLMILVNTPGAGADAFVQLRHTPWFGFTAADLVFPSFLFAVGNAMS 796
Query: 90 LALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVLQR 147
AL R ++++ R+ + G L+ H D + + R+ GVLQR
Sbjct: 797 FALDRGQPLGAFLRRIGKRSALIFLLGFLMYWFPFVHHGADGSWSFIAIDQTRVPGVLQR 856
Query: 148 IALSYLLVSLV 158
IAL Y L +L+
Sbjct: 857 IALCYALGALL 867
>gi|344208862|ref|YP_004794003.1| hypothetical protein [Stenotrophomonas maltophilia JV3]
gi|343780224|gb|AEM52777.1| Protein of unknown function DUF2261, transmembrane
[Stenotrophomonas maltophilia JV3]
Length = 360
Score = 71.6 bits (174), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 48/147 (32%), Positives = 78/147 (53%), Gaps = 20/147 (13%)
Query: 27 KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLF 82
K + +RLAS+D RG+ VA M+LV++ G DW + H+ W+GC D V PFFLF
Sbjct: 6 KGSMPPRRLASIDALRGITVAAMLLVNNPG-DWSAVFAPLRHSEWHGCTPTDLVFPFFLF 64
Query: 83 IVGVAIALALKRIPDRADAVKK------VIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
+VGV++A ++ P DA + V+ R L++L + + + +
Sbjct: 65 LVGVSMAFSVA--PRALDAAARPALARGVLERALRILL-------AGALLHLLIWWALHT 115
Query: 137 RMIRLCGVLQRIALSYLLVSLVEIFTK 163
R+ GVLQRIA+ LV ++ ++ +
Sbjct: 116 HHFRIWGVLQRIAVCAALVGVLAVYAR 142
>gi|397171248|ref|ZP_10494657.1| hypothetical protein AEST_24230 [Alishewanella aestuarii B11]
gi|396087147|gb|EJI84748.1| hypothetical protein AEST_24230 [Alishewanella aestuarii B11]
Length = 394
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 82/304 (26%), Positives = 137/304 (45%), Gaps = 66/304 (21%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
QQ + R+ +LD RGLA+ MILV++ G +P + HA W+G D + P F
Sbjct: 7 QQILTQQPANRMLALDALRGLAILAMILVNNPGSWQYVYPPLLHAEWHGWTPTDLIFPAF 66
Query: 81 LFIVGVAI--ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSH--APDELTYGVDV 136
L +VG+AI +LA +++ +A+ +++ R LKL G+ L + + P+ +
Sbjct: 67 LVMVGMAIPYSLAGRQLLPKAELIRQGAIRALKLYLLGLFLVLFYYNFRDPNYSYLQQKL 126
Query: 137 RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
+R GVLQRI + Y ++ +++ + GR WL C+L L
Sbjct: 127 LTVRWSGVLQRIGIVYFCTLVIVLYSG-------TRGRIL--------WLSGLCLLYFLL 171
Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
+VP +D +YG F G+ N N ++D ++LG NH+Y
Sbjct: 172 M----QFVP-------YRD--NYGHTF---VGLWEHGN---NLAAWLDHQLLGPNHVYFR 212
Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
A +PF F+PEG+LS++ +I S + GV ++ ++
Sbjct: 213 SA----------TPFA--------------FDPEGILSTLPAIASCLSGVLMAQ-LLQSQ 247
Query: 317 GHLA 320
LA
Sbjct: 248 AELA 251
>gi|359728547|ref|ZP_09267243.1| hypothetical protein Lwei2_17159 [Leptospira weilii str.
2006001855]
Length = 383
Score = 71.2 bits (173), Expect = 7e-10, Method: Compositional matrix adjust.
Identities = 91/322 (28%), Positives = 133/322 (41%), Gaps = 84/322 (26%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
++KS R+ SLD+FRG+ V MILV++ G W I HA WNGC D V PF
Sbjct: 1 MEKKSTQNKDRILSLDLFRGMTVIGMILVNNPG-SWSYIYSPLKHAKWNGCTPTDLVFPF 59
Query: 80 FLFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
FLF VG +I ++L K DR+ + R++ L+ G+LL + EL
Sbjct: 60 FLFAVGGSIPISLYSKNGIDRSRIWVGICKRSVNLILLGLLLNFFGEWSFAEL------- 112
Query: 138 MIRLCGVLQRIALSYLLV-SLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
R+ GVLQRI Y +V SL IF + V F I L W++
Sbjct: 113 --RIPGVLQRIGFVYWVVASLYLIF------PGKKVLIFLIPVLLVHTWILTHI------ 158
Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
P + + + D G +IDR ++G H+
Sbjct: 159 -------APPGEAMVSLEQGKDIG--------------------AWIDRVIIGEKHL--- 188
Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI-IHT 315
W+ SK ++PEGLLS V+SI +++ GV G ++ +
Sbjct: 189 --WKFSKT----------------------WDPEGLLSGVASIATSLFGVLCGFILFLRE 224
Query: 316 KGHLARLKQWVTMGFALLIFGL 337
G +R+ +GF GL
Sbjct: 225 GGGRSRVLSTFGLGFLFTFVGL 246
>gi|326801867|ref|YP_004319686.1| hypothetical protein [Sphingobacterium sp. 21]
gi|326552631|gb|ADZ81016.1| hypothetical protein Sph21_4499 [Sphingobacterium sp. 21]
Length = 376
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 77/304 (25%), Positives = 120/304 (39%), Gaps = 77/304 (25%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGD----WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
R +LD+FRG+ + MI+V+ G WP ++HA W+G D V P FLF VG A++
Sbjct: 7 RFTALDVFRGMTICFMIIVNSPGSGATPYWP-LNHATWHGFTPTDLVFPSFLFAVGNALS 65
Query: 90 LALKRIPD-RADAVKKVIFRTLKLLF-WGILLQ--GGFSHAPDELTYGVDVRMIRLCGVL 145
+ ++ + V IF+ L+F G L+ F + R+ GVL
Sbjct: 66 FSERKFQYLSSKQVLLTIFKRAALIFLLGFLMYWFPFFKITEQHEIISFPLHETRVFGVL 125
Query: 146 QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVP 205
QRIAL YL +L + + R Y WL A +L+ ++ LL
Sbjct: 126 QRIALCYLFTALAVYYVR---------------RKYL-VWLAIALLLIYWVILL------ 163
Query: 206 DWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKAC 265
G A + NA+ +D +LG +H+YH
Sbjct: 164 --------------------IFGTDAPYSLEGNAIFKLDLWLLGESHLYHSHG------- 196
Query: 266 TQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
F+PEGLLS++ +I + I G G + G + L +
Sbjct: 197 -------------------IIFDPEGLLSTIPAITNAIAGYLVGKYLQEKGGTVQSLGKL 237
Query: 326 VTMG 329
+ +G
Sbjct: 238 LIIG 241
>gi|417780880|ref|ZP_12428636.1| PF07786 family protein [Leptospira weilii str. 2006001853]
gi|410778851|gb|EKR63473.1| PF07786 family protein [Leptospira weilii str. 2006001853]
Length = 383
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 91/322 (28%), Positives = 133/322 (41%), Gaps = 84/322 (26%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPF 79
++KS R+ SLD+FRG+ V MILV++ G W I HA WNGC D V PF
Sbjct: 1 MEKKSTQNKDRILSLDLFRGMTVIGMILVNNPG-SWSYIYSPLKHAKWNGCTPTDLVFPF 59
Query: 80 FLFIVGVAIALAL--KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
FLF VG +I ++L K DR+ + R++ L+ G+LL + EL
Sbjct: 60 FLFAVGGSIPISLYSKNGIDRSRIWVGICKRSVNLILLGLLLNFFGEWSFAEL------- 112
Query: 138 MIRLCGVLQRIALSYLLV-SLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
R+ GVLQRI Y +V SL IF + V F I L W++
Sbjct: 113 --RIPGVLQRIGFVYWVVASLYLIF------PGKKVLIFLIPVLLVHTWILTHI------ 158
Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
P + + + D G +IDR ++G H+
Sbjct: 159 -------APPGEAMVSLEQGKDIG--------------------AWIDRVIIGEKHL--- 188
Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI-IHT 315
W+ SK ++PEGLLS V+SI +++ GV G ++ +
Sbjct: 189 --WKFSKT----------------------WDPEGLLSGVASIATSLFGVLCGFILFLRE 224
Query: 316 KGHLARLKQWVTMGFALLIFGL 337
G +R+ +GF GL
Sbjct: 225 GGGRSRVLSTFGLGFLFTFVGL 246
>gi|383643230|ref|ZP_09955636.1| hypothetical protein SeloA3_10334 [Sphingomonas elodea ATCC 31461]
Length = 382
Score = 71.2 bits (173), Expect = 8e-10, Method: Compositional matrix adjust.
Identities = 84/317 (26%), Positives = 134/317 (42%), Gaps = 88/317 (27%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
R +LD+FRG + LMILV+ +G +P++ HA W G LAD V P FLF +G A++
Sbjct: 17 RFLALDVFRGATIFLMILVNTSGPGAEPYPQLVHAKWIGFTLADLVFPTFLFAMGNAMSF 76
Query: 91 ALKRI----PDRADAVKK--VIFRTLKLLFW-GILLQG--GFSHAPDELTYGVDVRMIRL 141
A ++ P A ++ +IF L++W + QG G++ P LT R+
Sbjct: 77 AFRKPVATGPFLARLFRRGAIIFVLGYLMYWFPFVEQGPDGWALKPFALT--------RV 128
Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYG 201
GVLQR+AL Y+L L+ WL +L+ +A+L G
Sbjct: 129 PGVLQRLALCYVLAGLMI------------------------RWLKPRQLLLAGIAMLLG 164
Query: 202 TYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRR 261
W ++ + G F+ + + ID +LG H+Y
Sbjct: 165 Y----WTILLVFSPA---GMAFDKYANIGTQ----------IDLWLLGPGHLY------- 200
Query: 262 SKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
+KD A F+PEGLL ++ + ++ I G G I+
Sbjct: 201 --------------KKD------AGFDPEGLLGTLPATVNVIAGYLAGLAIVQGGDLRRT 240
Query: 322 LKQWVTMGFALLIFGLT 338
+ + +G AL++ GL
Sbjct: 241 VGRMALVGAALVLAGLA 257
>gi|395761203|ref|ZP_10441872.1| hypothetical protein JPAM2_05565 [Janthinobacterium lividum PAMC
25724]
Length = 373
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 51/137 (37%), Positives = 73/137 (53%), Gaps = 9/137 (6%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVG 85
+ +QR +LD+ RGL VALMI+V+ G DW + HA W+G L D V P FLF+VG
Sbjct: 3 IASQRYLALDVLRGLTVALMIVVNTPG-DWGSVYAPFLHAEWHGFTLTDLVFPSFLFVVG 61
Query: 86 VAIALALKRIPDRADA--VKKVIFRTLKLLFWGILLQG-GFSHAPDELTYG-VDVRMIRL 141
A+A L + + A + K+ R+ + G LL F D + + R+
Sbjct: 62 NALAFVLGKYENLAHGAVLAKLCKRSALIFLLGFLLYWFPFFKIDDAGQFAWSSLSQTRI 121
Query: 142 CGVLQRIALSYLLVSLV 158
GVLQRIA+ YL +L+
Sbjct: 122 PGVLQRIAVCYLAAALI 138
>gi|297567057|ref|YP_003686029.1| hypothetical protein [Meiothermus silvanus DSM 9946]
gi|296851506|gb|ADH64521.1| Protein of unknown function DUF2261, transmembrane [Meiothermus
silvanus DSM 9946]
Length = 377
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 55/153 (35%), Positives = 80/153 (52%), Gaps = 27/153 (17%)
Query: 18 EPDVSDQQEKSHLKTQ----RLASLDIFRGLAVALMILVDHAGGDWPE---ISHAPWN-G 69
P DQQ ++ ++ RL SLD+FRGL + LM+LV++ D ++HAPW G
Sbjct: 7 NPPTQDQQTETPFPSRKTAMRLGSLDVFRGLTILLMLLVNNVALDANTPYLLTHAPWKGG 66
Query: 70 CNLADFVMPFFLFIVGVAIALAL-----KRIPD-RADAVKKVIFRTLKLLFWGILLQGGF 123
LAD V P+FL VGVAI A K +P R D K+I R++ L G+L+
Sbjct: 67 VYLADLVFPWFLLAVGVAIPFAAASFRKKNLPSWRYDL--KIIQRSIVLFGLGLLIVSSI 124
Query: 124 SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVS 156
+ P + +D VLQ IA++YL+ +
Sbjct: 125 ARRP---VFALD--------VLQLIAMAYLVAA 146
>gi|357628855|gb|EHJ78009.1| putative heparan-alpha-glucosaminide N-acetyltransferase [Danaus
plexippus]
Length = 275
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 58/172 (33%), Positives = 86/172 (50%), Gaps = 34/172 (19%)
Query: 2 SEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPE 61
+E+ +ET ++ +E V +S RL SLDIFRG+A+ALM +
Sbjct: 59 NELGSETR----VLTTEASVPRSPTRS-----RLRSLDIFRGIAIALM--------QANK 101
Query: 62 ISHAPWNGCNLADFVMPFFLFIVGVAIALALK-----RIPDRADAVKKVIFRTLKLLFWG 116
SHA WNG +AD V P+F F +G A+ L+L +P R +A+ +V R+L L G
Sbjct: 102 FSHAVWNGLTVADLVFPWFAFTMGEAMVLSLNARLRTSLP-RVNALGQVARRSLLLSLIG 160
Query: 117 ILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVE-IFTKDVQD 167
I L + + +R GVLQR+A YL+V +E F + Q+
Sbjct: 161 ICLG----------SVNTNWSYVRFPGVLQRLAAMYLIVGSLECAFMRTSQN 202
>gi|393763917|ref|ZP_10352530.1| hypothetical protein AGRI_13026 [Alishewanella agri BL06]
gi|392605231|gb|EIW88129.1| hypothetical protein AGRI_13026 [Alishewanella agri BL06]
Length = 394
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 84/304 (27%), Positives = 136/304 (44%), Gaps = 66/304 (21%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
QQ + R+ +LD RGLA+ MILV++ G +P + HA W+G D + P F
Sbjct: 7 QQILAKQPANRMLALDALRGLAILAMILVNNPGSWQYVYPPLLHAEWHGWTPTDLIFPAF 66
Query: 81 LFIVGVAI--ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSH--APDELTYGVDV 136
L +VG+AI +LA +++ +A+ +++ R LKL G+ L + + P+ +
Sbjct: 67 LVMVGMAIPYSLAGRQMLPKAELLRQGAIRALKLYLLGLFLVLFYYNFRDPNYSYLQQKL 126
Query: 137 RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYL 196
+R GVLQRI + Y L+ +++ + GR WL C+L L
Sbjct: 127 LTVRWSGVLQRIGIVYFCTLLIVLYSG-------TRGRVL--------WLSGLCLLYFLL 171
Query: 197 ALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHH 256
+VP +D +YG F G+ N N ++D VLG NH++
Sbjct: 172 M----QFVP-------YRD--NYGHTF---VGLWEHGN---NLAAWLDHHVLGPNHVFFR 212
Query: 257 PAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
A +PF F+PEG+LS++ +I S + GV ++ +K
Sbjct: 213 SA----------TPFA--------------FDPEGILSTLPAIASCLSGVLMAQ-LLQSK 247
Query: 317 GHLA 320
LA
Sbjct: 248 AELA 251
>gi|423223641|ref|ZP_17210110.1| hypothetical protein HMPREF1062_02296 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638266|gb|EIY32113.1| hypothetical protein HMPREF1062_02296 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 395
Score = 70.5 bits (171), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 84/341 (24%), Positives = 138/341 (40%), Gaps = 97/341 (28%)
Query: 23 DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMP 78
+ Q K++ +R+ +LDI RG+ +A MI+V++ G W I HA WNG D V P
Sbjct: 2 NSQTKTN---KRILALDILRGVTIAGMIMVNNPG-TWGHIYAPLRHAEWNGLTPTDLVFP 57
Query: 79 FFLFIVGVAIALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH-------APDE 129
FF+FI+G++ ++LK+ + A K++ RT+ + G+ + G FS + +
Sbjct: 58 FFMFIMGISTYISLKKYNFKFSHAAALKILKRTIIIFLIGLAI-GWFSRFCYYWAGSHEG 116
Query: 130 LTYGVDV-------RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYC 182
L++G + IR+ GV+QR+AL Y +++ + K
Sbjct: 117 LSFGEQLWASVWTFDRIRILGVMQRLALCYGATAIIALTMKHRHIP-------------- 162
Query: 183 WHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY 242
+ A +LV Y LL CG N N +
Sbjct: 163 ---YLIATLLVGYFILL--------------------------MCGNGFAYNET-NILSI 192
Query: 243 IDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILST 302
+DR +L HMY KD +PEGLLS++ SI
Sbjct: 193 VDRAILTPAHMY----------------------KD------NGIDPEGLLSTIPSIAHV 224
Query: 303 IIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTN 343
++G G +++ + +R + L + G L FT
Sbjct: 225 LLGFCVGRMMLDSNKAESREALLNSHLIKLFLVGAILTFTG 265
>gi|399088486|ref|ZP_10753550.1| hypothetical protein PMI01_04686 [Caulobacter sp. AP07]
gi|398030770|gb|EJL24173.1| hypothetical protein PMI01_04686 [Caulobacter sp. AP07]
Length = 398
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 52/148 (35%), Positives = 71/148 (47%), Gaps = 8/148 (5%)
Query: 26 EKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLF 82
+ + R+ +LD+ RGLAVA MILV G + + HA W G LAD V P FLF
Sbjct: 2 DSAKAGGGRIVALDVLRGLAVAGMILVTSPGAWAHAYAPLKHAAWQGWTLADLVFPTFLF 61
Query: 83 IVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
VGVAI L++ R+ A + + + ILL + P D+ +R+
Sbjct: 62 CVGVAIGLSVPRLRIGEGASAALWIKVARRTALLILLGLVLNALPR-----FDLAHLRIP 116
Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQ 170
GVLQRI L Y L S + I + Q
Sbjct: 117 GVLQRIGLCYALASAICILPARAEADGQ 144
>gi|320333679|ref|YP_004170390.1| hypothetical protein [Deinococcus maricopensis DSM 21211]
gi|319754968|gb|ADV66725.1| hypothetical protein Deima_1072 [Deinococcus maricopensis DSM
21211]
Length = 376
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 73/215 (33%), Positives = 105/215 (48%), Gaps = 40/215 (18%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDW---PEISHAPW-NGCNLADFVMPFFLFIVGV 86
+ RLA+LD +RGL V LM+LV++ DW E+ HAPW G LAD V P+FLF G
Sbjct: 24 RGARLAALDAWRGLTVLLMLLVNNVALDWRTPKELMHAPWGGGATLADLVFPWFLFCAGT 83
Query: 87 AIALAL---KRIPDRADA-VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
A+ +L +R R A V+K++ RT+ L G++L +H LT+G+
Sbjct: 84 ALPFSLASARRAGVRGWALVRKLLTRTVLLYLVGVVLVSAVAH---RLTFGL-------- 132
Query: 143 GVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGT 202
GVLQ IAL+ LL VG + +AA +LV Y A+L T
Sbjct: 133 GVLQLIALASLL---------GAAGAQLRVGARMV---------LAAALLVGYAAVLLLT 174
Query: 203 YVPDWQFTII--NKDSADY-GKVFNVTCGVRAKLN 234
VP ++ +++ Y + F GVR L+
Sbjct: 175 PVPGVGAGVLEETRNAVQYLNQTFLAPLGVRGLLS 209
>gi|313147781|ref|ZP_07809974.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
gi|423280992|ref|ZP_17259903.1| hypothetical protein HMPREF1203_04120 [Bacteroides fragilis HMW
610]
gi|313136548|gb|EFR53908.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
gi|404583442|gb|EKA88121.1| hypothetical protein HMPREF1203_04120 [Bacteroides fragilis HMW
610]
Length = 387
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 78/330 (23%), Positives = 136/330 (41%), Gaps = 93/330 (28%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL +LD+ RG+ +A MI+V++ G + + HA W G D V PFF+FI+G++
Sbjct: 6 NKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMGIST 65
Query: 89 ALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRM------ 138
++L++ A K++ RT+ + G+ + F H + L+ G D+
Sbjct: 66 YISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCHTWNSLS-GEDIPFFSRLGE 124
Query: 139 -------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
IR+ GV+QR+AL Y +++ + K +L+AA +
Sbjct: 125 SVWTFGHIRILGVMQRLALCYGATAIIALIMKHKYIP----------------YLIAALL 168
Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
+ ++ L+ G G +N T N + +DR VLG
Sbjct: 169 IGYFIILITGN-----------------GFEYNST-----------NILAVVDRAVLGEA 200
Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
HMY KD +PEG+LS++ SI +IG G +
Sbjct: 201 HMY----------------------KD------NGIDPEGVLSTIPSIAHVLIGFCVGKL 232
Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
++ K ++++ +G L G L +
Sbjct: 233 LMEVKDINEKIERLFLVGTILTFAGFLLSY 262
>gi|424666001|ref|ZP_18103037.1| hypothetical protein HMPREF1205_01876 [Bacteroides fragilis HMW
616]
gi|404574254|gb|EKA79005.1| hypothetical protein HMPREF1205_01876 [Bacteroides fragilis HMW
616]
Length = 387
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 78/330 (23%), Positives = 136/330 (41%), Gaps = 93/330 (28%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL +LD+ RG+ +A MI+V++ G + + HA W G D V PFF+FI+G++
Sbjct: 6 NKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMGIST 65
Query: 89 ALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRM------ 138
++L++ A K++ RT+ + G+ + F H + L+ G D+
Sbjct: 66 YISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCHTWNSLS-GEDIPFFSRLGE 124
Query: 139 -------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
IR+ GV+QR+AL Y +++ + K +L+AA +
Sbjct: 125 SVWTFGHIRILGVMQRLALCYGATAIIALIMKHKYIP----------------YLIAALL 168
Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
+ ++ L+ G G +N T N + +DR VLG
Sbjct: 169 IGYFIILITGN-----------------GFEYNST-----------NILAVVDRAVLGEA 200
Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
HMY KD +PEG+LS++ SI +IG G +
Sbjct: 201 HMY----------------------KD------NGIDPEGVLSTIPSIAHVLIGFCVGKL 232
Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
++ K ++++ +G L G L +
Sbjct: 233 LMEVKDINEKIERLFLVGTILTFAGFLLSY 262
>gi|189464405|ref|ZP_03013190.1| hypothetical protein BACINT_00746 [Bacteroides intestinalis DSM
17393]
gi|189438195|gb|EDV07180.1| hypothetical protein BACINT_00746 [Bacteroides intestinalis DSM
17393]
Length = 395
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 77/314 (24%), Positives = 133/314 (42%), Gaps = 97/314 (30%)
Query: 23 DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMP 78
+ Q K++ +R+ +LDI RG+ +A MI+V++ G W I HA WNG D V P
Sbjct: 2 NSQTKTN---KRILALDILRGVTIAGMIMVNNPG-TWGHIYAPLRHAEWNGLTPTDLVFP 57
Query: 79 FFLFIVGVAIALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH-------APDE 129
FF+FI+G++ ++LK+ A K++ RT+ + G+ + G FS + +
Sbjct: 58 FFMFIMGISTYISLKKYNFEFSHAAAMKILKRTIIIFLIGLAI-GWFSRFCYYWAGSHEG 116
Query: 130 LTYGVDV-------RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYC 182
L++G + IR+ GV+QR+AL Y +++ + K
Sbjct: 117 LSFGEQLWASVWTFDRIRILGVMQRLALCYGATAIIALTMKHRHIP-------------- 162
Query: 183 WHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY 242
+L+A ++ ++ L+ G G V+N T N +
Sbjct: 163 --YLIATLLVGYFILLMCGN-----------------GFVYNET-----------NILSI 192
Query: 243 IDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILST 302
+DR +L HMY KD +PEGLLS++ SI
Sbjct: 193 VDRAILTPAHMY----------------------KD------NGIDPEGLLSTIPSIAHV 224
Query: 303 IIGVHFGHVIIHTK 316
++G G +++ +
Sbjct: 225 LLGFCVGRMMLDSN 238
>gi|410663435|ref|YP_006915806.1| hypothetical protein M5M_04345 [Simiduia agarivorans SA1 = DSM
21679]
gi|409025792|gb|AFU98076.1| hypothetical protein M5M_04345 [Simiduia agarivorans SA1 = DSM
21679]
Length = 356
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 49/136 (36%), Positives = 73/136 (53%), Gaps = 15/136 (11%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVGVAI 88
QR +LD RGL +ALMI+V+ G W + HA W G D V PFFLFIVG ++
Sbjct: 3 QRYIALDALRGLTLALMIVVNTPG-SWAHVYGPLLHADWMGWTFTDLVFPFFLFIVGASL 61
Query: 89 ALALKRIPD--RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
+ K + RAD ++K+I R+L L+ + + V + +RL GVLQ
Sbjct: 62 YFSQKGMASLTRADQLRKIIRRSLLLIV--------LGVLLEYYPFIVSLHELRLPGVLQ 113
Query: 147 RIALSYLLVSLVEIFT 162
RI L++ + +L+ +F
Sbjct: 114 RIGLAFGVAALLVVFV 129
>gi|359686994|ref|ZP_09256995.1| hypothetical protein LlicsVM_01380 [Leptospira licerasiae serovar
Varillal str. MMD0835]
gi|418756670|ref|ZP_13312858.1| PF07786 family protein [Leptospira licerasiae serovar Varillal str.
VAR 010]
gi|384116341|gb|EIE02598.1| PF07786 family protein [Leptospira licerasiae serovar Varillal str.
VAR 010]
Length = 391
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 75/294 (25%), Positives = 118/294 (40%), Gaps = 89/294 (30%)
Query: 34 RLASLDIFRGLAVALMILVDHAGG----DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
R+ S+D+ RGL VA MILV++ G WP + HA W+GC D V PFFLF VG +I
Sbjct: 25 RILSIDLLRGLTVAGMILVNNPGTWSNMYWP-LKHAKWDGCTPTDLVFPFFLFAVGASIP 83
Query: 90 LALKRIPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVLQR 147
+ + + K++ R++ L+F G+ L G +S + +R GVLQR
Sbjct: 84 FS---VSNGIQEFPKILKRSVILIFLGLFLNFFGEWSFSN-----------LRFPGVLQR 129
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
I +Y ++ ++K+ L +W + + G P
Sbjct: 130 IGFAYFFSAIA------YREKNLKFRIILFLTLLISYWYLQEFIPPP------GAAEPS- 176
Query: 208 QFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQ 267
K+ D+G ++DR+V G H+ W+ K
Sbjct: 177 -----MKEGKDWG--------------------AWLDREVFGQAHL-----WKFGKV--- 203
Query: 268 DSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLAR 321
++PEGLL+S +SI S G+ G + K HL +
Sbjct: 204 -------------------WDPEGLLTSFTSIASVFCGIFAGEFL---KVHLEK 235
>gi|329956032|ref|ZP_08296803.1| hypothetical protein HMPREF9445_01662 [Bacteroides clarus YIT
12056]
gi|328524791|gb|EGF51845.1| hypothetical protein HMPREF9445_01662 [Bacteroides clarus YIT
12056]
Length = 396
Score = 69.7 bits (169), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 49/151 (32%), Positives = 81/151 (53%), Gaps = 20/151 (13%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
+R+ +LDI RG+ +A MI+V++ G W I HA WNG D V PFF+FI+G++
Sbjct: 8 NKRILALDILRGVTIAGMIMVNNPG-TWAHIYAPLRHAEWNGLTPTDLVFPFFMFIMGIS 66
Query: 88 IALALKRIP-DRADAVKKVIFRTLKLLFWGILLQGGFSH------APDE-LTYGVDV--- 136
++LK+ + + AV I + L+F + G FS +P E +++G +
Sbjct: 67 TYISLKKYNFEFSRAVGMKILKRTILIFLIGMAIGWFSKFCYYWTSPTEGISFGAQLWES 126
Query: 137 ----RMIRLCGVLQRIALSYLLVSLVEIFTK 163
IR+ GV+QR+AL Y +++ + K
Sbjct: 127 VWTFDRIRILGVMQRLALCYGATAIIALTVK 157
>gi|198277541|ref|ZP_03210072.1| hypothetical protein BACPLE_03763 [Bacteroides plebeius DSM 17135]
gi|198270039|gb|EDY94309.1| hypothetical protein BACPLE_03763 [Bacteroides plebeius DSM 17135]
Length = 338
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 41/119 (34%), Positives = 72/119 (60%), Gaps = 7/119 (5%)
Query: 49 MILVDHAGG--DWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIP--DRADAVKK 104
MILV++AGG + + H+ WNG D V PFFLF+VG++ ++L++ ++ ++K
Sbjct: 1 MILVNNAGGPVSYAPLRHSVWNGLTPCDLVFPFFLFMVGISTYISLRKFNFGPTSEVIRK 60
Query: 105 VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTK 163
++ RT ++ G+ + F +A + + +D +R+ GVLQRI L Y +VSL+ I+
Sbjct: 61 IVRRTFLIILIGLAID-WFGYACNGNFFPIDT--LRIPGVLQRIGLCYGIVSLMVIYIN 116
>gi|427384458|ref|ZP_18880963.1| hypothetical protein HMPREF9447_01996 [Bacteroides oleiciplenus YIT
12058]
gi|425727719|gb|EKU90578.1| hypothetical protein HMPREF9447_01996 [Bacteroides oleiciplenus YIT
12058]
Length = 395
Score = 69.3 bits (168), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 52/158 (32%), Positives = 84/158 (53%), Gaps = 23/158 (14%)
Query: 27 KSHLKT-QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFL 81
S +KT +R+ +LDI RG+ +A MI+V++ G W I HA WNG D V PFF+
Sbjct: 2 SSQMKTNKRILALDILRGVTIAGMIMVNNPG-TWGHIYAPLRHAEWNGLTPTDLVFPFFM 60
Query: 82 FIVGVAIALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH------APDE-LTY 132
FI+G++ ++LK+ A K++ RT+ + G+ + G FS P E + +
Sbjct: 61 FIMGISTYISLKKYNFEFSHAAAMKILKRTIIIFLIGLAI-GWFSKFCYYWTNPSEGIGF 119
Query: 133 GVDV-------RMIRLCGVLQRIALSYLLVSLVEIFTK 163
G + IR+ GV+QR+AL Y +++ + K
Sbjct: 120 GAQLWESVWTFDRIRILGVMQRLALCYGATAIIALTMK 157
>gi|265765098|ref|ZP_06093373.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
gi|263254482|gb|EEZ25916.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
Length = 387
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 77/331 (23%), Positives = 130/331 (39%), Gaps = 95/331 (28%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
+RL +LD+ RG+ +A MI+V++ G W I HA W G D V PFF+FI+G++
Sbjct: 6 NKRLLALDVLRGITIAGMIMVNNPG-SWSYIYAPLGHAAWIGLTPTDLVFPFFMFIMGIS 64
Query: 88 IALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRM----- 138
++L++ A K++ RT+ + G+ + F + L+ G D+
Sbjct: 65 TYISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-GEDISFFSRLY 123
Query: 139 --------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
IR+ GV+QR+AL Y +++ + K
Sbjct: 124 ESVWTFGHIRILGVMQRLALCYGATAIIALIMKH-------------------------- 157
Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
Y+ L + + +IN + +Y N + +DR VLG
Sbjct: 158 ---KYIPYLIAILLIGYFIILINGNGFEYNS---------------SNILSIVDRTVLGE 199
Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
HMY KD +PEGLLS++ SI +IG G
Sbjct: 200 AHMY----------------------KD------NGIDPEGLLSTIPSIAHVLIGFCVGK 231
Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
+++ K ++++ +G L G L +
Sbjct: 232 LLMEVKDIHEKIERLFLIGTILTFAGFLLSY 262
>gi|333382729|ref|ZP_08474395.1| hypothetical protein HMPREF9455_02561 [Dysgonomonas gadei ATCC
BAA-286]
gi|332828330|gb|EGK01039.1| hypothetical protein HMPREF9455_02561 [Dysgonomonas gadei ATCC
BAA-286]
Length = 389
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 46/151 (30%), Positives = 79/151 (52%), Gaps = 19/151 (12%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+ RL SLD+ RG+ +A MI+V+++G + + H W+G D V PFF+FI+G++
Sbjct: 6 SGRLLSLDVLRGITIAGMIMVNNSGSGEYTYAPLKHVAWDGLTPTDLVFPFFMFIMGIST 65
Query: 89 ALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQG-GFSH------APDELTYG------ 133
++L++ + + K++ RT+ + G+ L G S PD L +
Sbjct: 66 YISLRKFNFEFNTPTLLKILKRTIVIFLIGLGLSWLGLSFRTYHMLEPDNLGFWERFFRA 125
Query: 134 -VDVRMIRLCGVLQRIALSYLLVSLVEIFTK 163
D +R GV+QR+AL+Y S++ I K
Sbjct: 126 ITDFGHLRTLGVMQRLALTYGAASIIAITVK 156
>gi|60679957|ref|YP_210101.1| hypothetical protein BF0369 [Bacteroides fragilis NCTC 9343]
gi|336407897|ref|ZP_08588393.1| hypothetical protein HMPREF1018_00408 [Bacteroides sp. 2_1_56FAA]
gi|423248371|ref|ZP_17229387.1| hypothetical protein HMPREF1066_00397 [Bacteroides fragilis
CL03T00C08]
gi|423253319|ref|ZP_17234250.1| hypothetical protein HMPREF1067_00894 [Bacteroides fragilis
CL03T12C07]
gi|423269643|ref|ZP_17248615.1| hypothetical protein HMPREF1079_01697 [Bacteroides fragilis
CL05T00C42]
gi|423272798|ref|ZP_17251745.1| hypothetical protein HMPREF1080_00398 [Bacteroides fragilis
CL05T12C13]
gi|60491391|emb|CAH06139.1| putative transmembrane protein [Bacteroides fragilis NCTC 9343]
gi|335944976|gb|EGN06793.1| hypothetical protein HMPREF1018_00408 [Bacteroides sp. 2_1_56FAA]
gi|392657219|gb|EIY50856.1| hypothetical protein HMPREF1067_00894 [Bacteroides fragilis
CL03T12C07]
gi|392659584|gb|EIY53202.1| hypothetical protein HMPREF1066_00397 [Bacteroides fragilis
CL03T00C08]
gi|392700489|gb|EIY93651.1| hypothetical protein HMPREF1079_01697 [Bacteroides fragilis
CL05T00C42]
gi|392708362|gb|EIZ01469.1| hypothetical protein HMPREF1080_00398 [Bacteroides fragilis
CL05T12C13]
Length = 387
Score = 68.9 bits (167), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 75/330 (22%), Positives = 130/330 (39%), Gaps = 93/330 (28%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL +LD+ RG+ +A MI+V++ G + + HA W G D V PFF+FI+G++
Sbjct: 6 NKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMGIST 65
Query: 89 ALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRM------ 138
++L++ A K++ RT+ + G+ + F + L+ G D+
Sbjct: 66 YISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-GEDISFFSRLYE 124
Query: 139 -------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
IR+ GV+QR+AL Y +++ + K
Sbjct: 125 SVWTFGHIRILGVMQRLALCYGATAIIALIMKH--------------------------- 157
Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
Y+ L + + +IN + +Y N + +DR VLG
Sbjct: 158 --KYIPYLIAILLIGYFIILINGNGFEYNS---------------SNILSIVDRTVLGEA 200
Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
HMY KD +PEGLLS++ SI +IG G +
Sbjct: 201 HMY----------------------KD------NGIDPEGLLSTIPSIAHVLIGFCVGKL 232
Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
++ K ++++ +G L G L +
Sbjct: 233 LMEVKDIHEKIERLFLIGTILTFAGFLLSY 262
>gi|53711719|ref|YP_097711.1| hypothetical protein BF0428 [Bacteroides fragilis YCH46]
gi|52214584|dbj|BAD47177.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
Length = 387
Score = 68.6 bits (166), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 75/330 (22%), Positives = 130/330 (39%), Gaps = 93/330 (28%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL +LD+ RG+ +A MI+V++ G + + HA W G D V PFF+FI+G++
Sbjct: 6 NKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMGIST 65
Query: 89 ALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRM------ 138
++L++ A K++ RT+ + G+ + F + L+ G D+
Sbjct: 66 YISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-GEDISFFSRLYE 124
Query: 139 -------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
IR+ GV+QR+AL Y +++ + K
Sbjct: 125 SVWTFGHIRILGVMQRLALCYGATAIIALIMKH--------------------------- 157
Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
Y+ L + + +IN + +Y N + +DR VLG
Sbjct: 158 --KYIPYLIAILLIGYFIILINGNGFEYNS---------------SNILSIVDRTVLGEA 200
Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
HMY KD +PEGLLS++ SI +IG G +
Sbjct: 201 HMY----------------------KD------NGIDPEGLLSTIPSIAHVLIGFCVGKL 232
Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
++ K ++++ +G L G L +
Sbjct: 233 LMEVKDIHEKIERLFLIGTILTFAGFLLSY 262
>gi|323343595|ref|ZP_08083822.1| transmembrane protein [Prevotella oralis ATCC 33269]
gi|323095414|gb|EFZ37988.1| transmembrane protein [Prevotella oralis ATCC 33269]
Length = 384
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 80/341 (23%), Positives = 133/341 (39%), Gaps = 96/341 (28%)
Query: 20 DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVM 77
D + QQ+K R+ ++DI RG+ +A MILV++ G D + + HA W G D V
Sbjct: 2 DTATQQKK------RILAVDILRGMTIAGMILVNNPGTDTVYAPLEHAEWIGLTPTDLVF 55
Query: 78 PFFLFIVGVAIALALKRIPDR--ADAVKKVIFRTLKLLFWGILLQGGFSHA-----PDEL 130
PFF+FI+G+ L+LK+ + + +K+ R L L G+ + F PD
Sbjct: 56 PFFMFIMGITTYLSLKKFEFKWSVECGRKIAKRALLLWLIGLAISWLFMFCRGLLDPDMS 115
Query: 131 TYGVDVRM---------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLY 181
+ R+ +RL GVL R+ + Y L ++V + K
Sbjct: 116 SMPFGSRLWASVNTFDQLRLLGVLPRLGICYGLAAVVALSVKHKYIP------------- 162
Query: 182 CWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVG 241
WL+A + Y+ L TC A + N +
Sbjct: 163 ---WLIAIIFIGYYILL--------------------------ETCNGYA--HDASNILA 191
Query: 242 YIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILS 301
+D VLG H+Y R ++P +PEGLLS+ ++
Sbjct: 192 IVDDAVLGHGHVY---------------------RWESP-------DPEGLLSTFPALAH 223
Query: 302 TIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFT 342
+IG G ++ + ++++ +G L G L +
Sbjct: 224 VLIGFCVGRTVMEMQNLNDKIERLFLIGALLTFAGFLLSYA 264
>gi|116748970|ref|YP_845657.1| hypothetical protein Sfum_1534 [Syntrophobacter fumaroxidans MPOB]
gi|116698034|gb|ABK17222.1| conserved hypothetical protein [Syntrophobacter fumaroxidans MPOB]
Length = 374
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 73/292 (25%), Positives = 117/292 (40%), Gaps = 74/292 (25%)
Query: 27 KSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFI 83
S RLASLD FRG +A MILV+ G + ++ HA WNG AD + P FLF+
Sbjct: 2 NSPTTNTRLASLDAFRGAVIAGMILVNSPGRWVYTYSQLKHAQWNGWTFADTIFPAFLFV 61
Query: 84 VGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
VGV++ + R + + +++ + + LL + D +G + +R+ G
Sbjct: 62 VGVSMVFSFSRRRECEEPAWRLVLQVFRRTSLIFLLGLLLNVMLD--FHGSN---LRIPG 116
Query: 144 VLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTY 203
VLQRIA Y + SL+ + T FR G
Sbjct: 117 VLQRIAACYFVASLIVLGTG--------------FR---------------------GQA 141
Query: 204 VPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSK 263
+ + ++ V + GV L P N Y+D +L HM+ H
Sbjct: 142 IWALGLLALYWLLMEFYPVPGIGAGV---LEPGRNFASYVDSLLLD-GHMWSH------- 190
Query: 264 ACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHT 315
+ ++PEG++S++ ++ ST+ GV GH + T
Sbjct: 191 --------------------YRTWDPEGIISTIPAVSSTLFGVLTGHFLRST 222
>gi|218131911|ref|ZP_03460715.1| hypothetical protein BACEGG_03534 [Bacteroides eggerthii DSM 20697]
gi|217986214|gb|EEC52553.1| hypothetical protein BACEGG_03534 [Bacteroides eggerthii DSM 20697]
Length = 396
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 49/152 (32%), Positives = 82/152 (53%), Gaps = 22/152 (14%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
+R+ +LDI RG+ +A MI+V++ G W I HA WNG D V PFF+FI+G++
Sbjct: 8 NKRILALDILRGVTIAGMIMVNNPG-TWAHIYAPLRHAEWNGLTPTDLVFPFFMFIMGIS 66
Query: 88 IALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH------APDE-LTYGVDV-- 136
++LK+ A K++ RT+ + G+ + G FS +P E +++G +
Sbjct: 67 TYISLKKYNFEFSHAAGIKILKRTILIFLIGMAI-GWFSKFCYYWTSPTEGISFGTQLWE 125
Query: 137 -----RMIRLCGVLQRIALSYLLVSLVEIFTK 163
IR+ GV+QR+AL Y +++ + K
Sbjct: 126 SVWTFDRIRILGVMQRLALCYGATAIIALTMK 157
>gi|317474486|ref|ZP_07933760.1| hypothetical protein HMPREF1016_00739 [Bacteroides eggerthii
1_2_48FAA]
gi|316909167|gb|EFV30847.1| hypothetical protein HMPREF1016_00739 [Bacteroides eggerthii
1_2_48FAA]
Length = 396
Score = 68.6 bits (166), Expect = 5e-09, Method: Compositional matrix adjust.
Identities = 49/152 (32%), Positives = 82/152 (53%), Gaps = 22/152 (14%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
+R+ +LDI RG+ +A MI+V++ G W I HA WNG D V PFF+FI+G++
Sbjct: 8 NKRILALDILRGVTIAGMIMVNNPG-TWAHIYAPLRHAEWNGLTPTDLVFPFFMFIMGIS 66
Query: 88 IALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH------APDE-LTYGVDV-- 136
++LK+ A K++ RT+ + G+ + G FS +P E +++G +
Sbjct: 67 TYISLKKYNFEFSHAAGIKILKRTILIFLIGMAI-GWFSKFCYYWTSPTEGISFGTQLWE 125
Query: 137 -----RMIRLCGVLQRIALSYLLVSLVEIFTK 163
IR+ GV+QR+AL Y +++ + K
Sbjct: 126 SVWTFDRIRILGVMQRLALCYGATAIIALTMK 157
>gi|423259248|ref|ZP_17240171.1| hypothetical protein HMPREF1055_02448 [Bacteroides fragilis
CL07T00C01]
gi|423263781|ref|ZP_17242784.1| hypothetical protein HMPREF1056_00471 [Bacteroides fragilis
CL07T12C05]
gi|387776828|gb|EIK38928.1| hypothetical protein HMPREF1055_02448 [Bacteroides fragilis
CL07T00C01]
gi|392706047|gb|EIY99170.1| hypothetical protein HMPREF1056_00471 [Bacteroides fragilis
CL07T12C05]
Length = 387
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 76/331 (22%), Positives = 130/331 (39%), Gaps = 95/331 (28%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVA 87
+RL +LD+ RG+ +A MI+V++ G W + HA W G D V PFF+FI+G++
Sbjct: 6 NKRLLALDVLRGITIAGMIMVNNPG-SWSYVYVPLGHAAWIGLTPTDLVFPFFMFIMGIS 64
Query: 88 IALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRM----- 138
++L++ A K++ RT+ + G+ + F + L+ G D+
Sbjct: 65 TYISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-GEDISFFSRLY 123
Query: 139 --------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
IR+ GV+QR+AL Y +++ + K
Sbjct: 124 ESVWTFGHIRILGVMQRLALCYGATAIIALIMKH-------------------------- 157
Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
Y+ L + + +IN + +Y N + +DR VLG
Sbjct: 158 ---KYIPYLIAILLIGYFIILINGNGFEYNS---------------SNILSIVDRTVLGE 199
Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
HMY KD +PEGLLS++ SI +IG G
Sbjct: 200 AHMY----------------------KD------NGIDPEGLLSTIPSIAHVLIGFCVGK 231
Query: 311 VIIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
+++ K ++++ +G L G L +
Sbjct: 232 LLMEVKDIHEKIERLFLIGTILTFAGFLLSY 262
>gi|254524630|ref|ZP_05136685.1| putative heparan-alpha-glucosaminide N-acetyltransferase
(transmembrane protein 76) [Stenotrophomonas sp. SKA14]
gi|219722221|gb|EED40746.1| putative heparan-alpha-glucosaminide N-acetyltransferase
(transmembrane protein 76) [Stenotrophomonas sp. SKA14]
Length = 355
Score = 68.2 bits (165), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 46/141 (32%), Positives = 75/141 (53%), Gaps = 20/141 (14%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL S+D RG+ VA M+LV++ G DW + H+ W+GC D V PFFLF+VGV++
Sbjct: 7 RRLGSIDALRGITVAAMLLVNNPG-DWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVGVSM 65
Query: 89 ALALKRIPDRADAVKK------VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
A ++ P DA + V+ R L++L + + + + R+
Sbjct: 66 AFSVA--PRALDAAARPALARGVLERALRILL-------AGALLHLLIWWALHTHHFRIW 116
Query: 143 GVLQRIALSYLLVSLVEIFTK 163
GVLQRIA+ LV ++ ++ +
Sbjct: 117 GVLQRIAVCAALVGVLAVYAR 137
>gi|167764222|ref|ZP_02436349.1| hypothetical protein BACSTE_02607 [Bacteroides stercoris ATCC
43183]
gi|167698338|gb|EDS14917.1| hypothetical protein BACSTE_02607 [Bacteroides stercoris ATCC
43183]
Length = 396
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 53/159 (33%), Positives = 87/159 (54%), Gaps = 25/159 (15%)
Query: 27 KSHLKT-QRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFL 81
S +KT +R+ +LDI RG+ +A MI+V++ G W I HA WNG D V PFF+
Sbjct: 2 SSTVKTNKRILALDILRGVTIAGMIMVNNPG-TWAHIYAPLRHAEWNGLTPTDLVFPFFM 60
Query: 82 FIVGVAIALALKRIP---DRADAVKKVIFRTLKLLFWGILLQGGFSH------APDE-LT 131
FI+G++ ++LK+ RA + K++ RT+ + G+ + G FS +P E +
Sbjct: 61 FIMGISTYISLKKYNFEFSRAAGM-KILKRTILIFLIGMGI-GWFSRFCYYWTSPTEGIG 118
Query: 132 YGVDV-------RMIRLCGVLQRIALSYLLVSLVEIFTK 163
+G + IR+ GV+QR+AL Y +++ + K
Sbjct: 119 FGAQLWEAAWTFDRIRILGVMQRLALCYGATAIIALTMK 157
>gi|315498708|ref|YP_004087512.1| hypothetical protein Astex_1695 [Asticcacaulis excentricus CB 48]
gi|315416720|gb|ADU13361.1| hypothetical protein Astex_1695 [Asticcacaulis excentricus CB 48]
Length = 378
Score = 68.2 bits (165), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 58/152 (38%), Positives = 73/152 (48%), Gaps = 33/152 (21%)
Query: 16 ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDW--------PEISHAPW 67
+SEP + TQRL SLD+ RGL V MILV+ G + P + HA W
Sbjct: 1 MSEPKTA---------TQRLPSLDVLRGLTVIGMILVNATAGMYYGLQAKVFPLLLHAHW 51
Query: 68 NGCNLADFVMPFFLFIVGVAIALALKRIP-----DRADAVKKVIFRTLKLLFWGILLQ-- 120
G +AD V P FL +VG++I +AL R D A A +K+ R L+L G LL
Sbjct: 52 EGLKIADVVFPAFLTMVGLSIPMALNRAKMTTGLDVAQA-RKIGGRVLRLFLIGWLLSNL 110
Query: 121 GGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
G +H D R GVLQRI L Y
Sbjct: 111 GWLAH--------FDGEPWRFWGVLQRIGLVY 134
>gi|355694569|gb|AER99714.1| heparan-alpha-glucosaminide N-acetyltransferase [Mustela putorius
furo]
Length = 296
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 58/196 (29%), Positives = 90/196 (45%), Gaps = 37/196 (18%)
Query: 139 IRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVV 194
+R+ GVLQR+ ++Y +V+++E IF K V + S R ++ W WL + +
Sbjct: 5 VRIPGVLQRLGVTYFVVAVLELIFAKPVPESCASERSCFSLRDIIFSWPQWLFILMLESI 64
Query: 195 YLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
+LAL + VP + D GK N T G A GYIDR +LG +H+
Sbjct: 65 WLALTFFLPVPGCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDRLLLGDDHI 114
Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
Y HP S A + ++PEG+L S++SI+ +GV G +++
Sbjct: 115 YQHP----SSAVLYHT--------------QVAYDPEGILGSINSIVMAFLGVQAGKILL 156
Query: 314 H----TKGHLARLKQW 325
+ TK L R W
Sbjct: 157 YYKDQTKDILIRFTAW 172
>gi|260790699|ref|XP_002590379.1| hypothetical protein BRAFLDRAFT_76652 [Branchiostoma floridae]
gi|229275571|gb|EEN46390.1| hypothetical protein BRAFLDRAFT_76652 [Branchiostoma floridae]
Length = 347
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 79/330 (23%), Positives = 128/330 (38%), Gaps = 98/330 (29%)
Query: 17 SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFV 76
S P+ S+Q +RL SLD FRG
Sbjct: 40 STPE-SEQGLTEKKARERLRSLDTFRG--------------------------------- 65
Query: 77 MPFFLFIVGVAIALALKRIPDRADA---VKKVIFRTLKLLFWGILLQGGFSHAPDELTYG 133
F+FI+G ++AL+ + + R V +VI R+ KL G L G H +
Sbjct: 66 ---FVFIMGTSMALSFRGMRKRTSTRRVVFRVITRSAKLFLVGFFLNAG--HGRN----- 115
Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWH--------- 184
D+ +R+ GVLQR++++YL+ +E F + R + L H
Sbjct: 116 -DLGTVRVPGVLQRLSIAYLVSGFIECFVGKERKSSDERSRLTNPTLQKIHNALRDIVDN 174
Query: 185 WLMAACVLVVYLALLYGTYV------PDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCN 238
W L++ + L T++ P D + N T G
Sbjct: 175 WAAWLLHLLILVIHLIITFLLPVPGCPTGYLGPGGPLLGDGVEYLNCTGG---------- 224
Query: 239 AVGYIDRKVLGINHMYHHPAWR---RSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSS 295
A GYIDR +LG +HMY P R ++K F+PEG+L S
Sbjct: 225 AAGYIDRLILG-SHMYQTPTVRVFYKTKVA---------------------FDPEGILGS 262
Query: 296 VSSILSTIIGVHFGHVIIHTKGHLARLKQW 325
+++I + +G+ G ++++ K H +R+ +W
Sbjct: 263 LTTIFNCFLGLQAGKILVYYKEHSSRIIRW 292
>gi|429738942|ref|ZP_19272716.1| hypothetical protein HMPREF9151_01157 [Prevotella saccharolytica
F0055]
gi|429158431|gb|EKY00988.1| hypothetical protein HMPREF9151_01157 [Prevotella saccharolytica
F0055]
Length = 400
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 77/305 (25%), Positives = 127/305 (41%), Gaps = 94/305 (30%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGV 86
K+ R+ ++DI RG+ +A MILV++ G +W I HA WNG D V PFF+F++G+
Sbjct: 7 KSSRILAIDILRGITIAGMILVNNPG-NWGRIFAPFEHAEWNGMTPTDLVFPFFMFVMGM 65
Query: 87 AIALALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGGFSH------AP-DELTYGVDV- 136
I +A+++ + V K+ R + + G+ + G F+ +P +E ++G +
Sbjct: 66 CIYIAMRKFDFTCNKSTVYKITKRMVLIYLVGLGI-GWFAKFCFRWASPLEEASFGEQLW 124
Query: 137 ------RMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
IRL GVL R+A+ Y + +L+ + V+ K+ +
Sbjct: 125 YMVWPFDSIRLTGVLARLAICYGITALLAV---TVKHKNLP--------------YIIVT 167
Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
+LV Y +L A G ++ T N + DR VL
Sbjct: 168 LLVGYFIIL----------------MAGNGFAYDET-----------NILSIADRAVLTD 200
Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
HMYH +PEGLLS++ SI T++G G
Sbjct: 201 VHMYHDNG----------------------------IDPEGLLSTLPSIAHTLLGFMVGS 232
Query: 311 VIIHT 315
++ T
Sbjct: 233 LLFKT 237
>gi|116327439|ref|YP_797159.1| hypothetical protein LBL_0655 [Leptospira borgpetersenii serovar
Hardjo-bovis str. L550]
gi|116120183|gb|ABJ78226.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar
Hardjo-bovis str. L550]
Length = 369
Score = 67.4 bits (163), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 84/310 (27%), Positives = 125/310 (40%), Gaps = 87/310 (28%)
Query: 38 LDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL-- 92
+D+FRG+ V MILV++ G + + HA WNGC D V PFFLF VG +I ++L
Sbjct: 1 MDLFRGMTVVGMILVNNPGSWSYVYSPLKHAEWNGCTPTDLVFPFFLFAVGASIPISLYS 60
Query: 93 KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
K +R + R + L +L G F + E T+ +R+ GVLQRI Y
Sbjct: 61 KNGINRIRIWIGICIRGISL-----ILLGLFLNFFGEWTF----SELRIPGVLQRIGFVY 111
Query: 153 LLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTII 212
+V+ ++F ++ VLV + +L V W T I
Sbjct: 112 WVVA-------------------TLFLVFP-----GKKVLVFLIPIL---LVHTWILTHI 144
Query: 213 NKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFE 272
L + +IDR + G H+ W+ SK
Sbjct: 145 APPGES-----------MVSLEQGKDIGAWIDRTIFGEKHL-----WKFSKT-------- 180
Query: 273 GPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL 332
++PEG LS ++SI +++ GV G ++ +G R K V L
Sbjct: 181 --------------WDPEGFLSGIASIATSLFGVICGFILFRREG---RGKNRV-----L 218
Query: 333 LIFGLTLHFT 342
IFGL FT
Sbjct: 219 SIFGLGFLFT 228
>gi|58583544|ref|YP_202560.1| hypothetical protein XOO3921 [Xanthomonas oryzae pv. oryzae KACC
10331]
gi|58428138|gb|AAW77175.1| conserved hypothetical protein [Xanthomonas oryzae pv. oryzae KACC
10331]
Length = 362
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 48/124 (38%), Positives = 65/124 (52%), Gaps = 5/124 (4%)
Query: 40 IFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIP 96
+FRGL + LMILV+ AG + +++HA W G LAD V P FLF VG A++ AL
Sbjct: 1 MFRGLTIFLMILVNTAGPGAQAYAQLTHAAWFGFTLADLVFPSFLFAVGSAMSFALATNM 60
Query: 97 DRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLL 154
+ +V R + G+L+ F PD V +RL GVLQRI L YL
Sbjct: 61 PHLQFLGRVSKRAALIALCGVLMYWFPFFHLQPDGGWAFTTVDQVRLTGVLQRIGLCYLA 120
Query: 155 VSLV 158
+L+
Sbjct: 121 AALL 124
>gi|322436067|ref|YP_004218279.1| hypothetical protein AciX9_2466 [Granulicella tundricola MP5ACTX9]
gi|321163794|gb|ADW69499.1| hypothetical protein AciX9_2466 [Granulicella tundricola MP5ACTX9]
Length = 391
Score = 67.0 bits (162), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 77/290 (26%), Positives = 122/290 (42%), Gaps = 76/290 (26%)
Query: 34 RLASLDIFRGLAVALMILVDH---AGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
R+ S+D+ RGL +ALMILV+ AG +P++ HA WNG AD V P FLF+ G ++
Sbjct: 16 RVLSIDVLRGLTIALMILVNDPGDAGCVYPQLQHAEWNGYTAADLVFPNFLFLGGASLVF 75
Query: 91 ALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQ 146
+L+ R DR + + + R + L+ ++L + +R IR+ GVL
Sbjct: 76 SLQGRIERGADRWELARGLGRRGVNLIALKLVL---------AMLPSFRLRRIRIFGVLF 126
Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
R A+ + L+ + T ++ VG A+L G Y
Sbjct: 127 RTAVCSVAGGLILLGTLEIPMLVGIVG-----------------------AMLTGYY--- 160
Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
+G++ N L+P N +DRKV H+ H
Sbjct: 161 ------GALRISFGRM-NAPL-----LDPENNLAAALDRKV---AHLLH----------- 194
Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTK 316
G L A + +PEGLLSSV ++ +T++G V+ H +
Sbjct: 195 ------GELHTGA--LYNVTHDPEGLLSSVPAVGTTLLGAVAALVMRHPR 236
>gi|311274235|ref|XP_003134250.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like,
partial [Sus scrofa]
Length = 297
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 61/197 (30%), Positives = 90/197 (45%), Gaps = 41/197 (20%)
Query: 140 RLCGVLQRIALSYLLVSLVEI-FTKDVQDKDQSVGRFSIFRLY----CW-HWLMAACVLV 193
R+ GVLQR+ ++Y +V+++E+ F K V + S S F L W WL +
Sbjct: 6 RIPGVLQRLGVTYFVVAVLELLFAKPVPESCAS--ERSCFSLLDVTSSWPQWLFVLVLEG 63
Query: 194 VYLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
V+LAL + VP + D GK N T G A GYIDR +LG +H
Sbjct: 64 VWLALTFFLPVPGCPTGYLGPGGIGDLGKYPNCTGG----------AAGYIDRLLLGDDH 113
Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
+Y HP S A + ++PEG+L +++SIL +GV G ++
Sbjct: 114 LYQHP----SPAVLYHT--------------KVAYDPEGILGTINSILMAYLGVQAGKIL 155
Query: 313 IH----TKGHLARLKQW 325
++ TKG L R W
Sbjct: 156 LYYKDRTKGILIRFAVW 172
>gi|312131163|ref|YP_003998503.1| hypothetical protein Lbys_2486 [Leadbetterella byssophila DSM
17132]
gi|311907709|gb|ADQ18150.1| hypothetical protein Lbys_2486 [Leadbetterella byssophila DSM
17132]
Length = 413
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 91/360 (25%), Positives = 136/360 (37%), Gaps = 125/360 (34%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVG 85
L QR+ SLD+FRGL + LMI V++ G DW + HA W+G D V PFF+F +G
Sbjct: 3 LAKQRIVSLDVFRGLTMILMITVNNPG-DWSNVYAPLLHAEWHGWTPTDLVFPFFVFAMG 61
Query: 86 VAIALALK-----------RIPDR---------------------ADAVKKVIFRTLKLL 113
+A+ ++K +I R A + ++FR +
Sbjct: 62 MALPFSMKPGSGLSKDDFLKILARSARLIALGLFLNFFSKIEFGNAQGITLLLFRLMITG 121
Query: 114 FWGILLQGGFSHAPDELTYGVDVRM--------------IRLCGVLQRIALSYLLVSLVE 159
F G LL G F T + + +R+ GVLQR+ Y +++
Sbjct: 122 FVGFLLMGNFPTKIKLYTALALLGLMLALAYSGLPHFAQVRIPGVLQRLGTVYFFAAILY 181
Query: 160 I-FTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSAD 218
+ F+ VQ W + VLV+Y LL VP T K
Sbjct: 182 LAFSLRVQ------------------WGIGLSVLVIYWLLLAYIPVPGSGVTGFEKGE-- 221
Query: 219 YGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKD 278
N +ID VLG +H+ W SK
Sbjct: 222 -------------------NLPAWIDSIVLG-DHV-----WSSSK--------------- 241
Query: 279 APSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLT 338
P++PEG+LS++ +I+S ++G G + K K+ + G LLI GL
Sbjct: 242 -------PWDPEGVLSTLPAIISCLLGAWAGVFLREDK------KKLLLTGVILLICGLA 288
>gi|423282312|ref|ZP_17261197.1| hypothetical protein HMPREF1204_00735 [Bacteroides fragilis HMW
615]
gi|404581880|gb|EKA86575.1| hypothetical protein HMPREF1204_00735 [Bacteroides fragilis HMW
615]
Length = 387
Score = 66.6 bits (161), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 74/330 (22%), Positives = 129/330 (39%), Gaps = 93/330 (28%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL +LD+ RG+ +A MI+V++ G + + HA W G D V PFF+FI+G++
Sbjct: 6 NKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMGIST 65
Query: 89 ALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRM------ 138
++L++ A K++ RT+ + G+ + F + L+ D+
Sbjct: 66 YISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-SEDISFFSRLYE 124
Query: 139 -------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
IR+ GV+QR+AL Y +++ + K
Sbjct: 125 SIWTFGHIRILGVMQRLALCYGATAIIALIMKH--------------------------- 157
Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
Y+ L + + +IN + +Y N + +DR VLG
Sbjct: 158 --KYIPYLIAILLIGYFIILINGNGFEYNS---------------SNILSIVDRTVLGEA 200
Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
HMY KD +PEGLLS++ SI +IG G +
Sbjct: 201 HMY----------------------KD------NGIDPEGLLSTIPSIAHVLIGFCVGKL 232
Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
++ K ++++ +G L G L +
Sbjct: 233 LMEVKDIHEKIERLFLIGTILTFAGFLLSY 262
>gi|375356811|ref|YP_005109583.1| putative transmembrane protein [Bacteroides fragilis 638R]
gi|383116724|ref|ZP_09937472.1| hypothetical protein BSHG_1191 [Bacteroides sp. 3_2_5]
gi|251947990|gb|EES88272.1| hypothetical protein BSHG_1191 [Bacteroides sp. 3_2_5]
gi|301161492|emb|CBW21032.1| putative transmembrane protein [Bacteroides fragilis 638R]
Length = 387
Score = 66.2 bits (160), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 74/330 (22%), Positives = 129/330 (39%), Gaps = 93/330 (28%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+RL +LD+ RG+ +A MI+V++ G + + HA W G D V PFF+FI+G++
Sbjct: 6 NKRLLALDVLRGITIAGMIMVNNPGSWSYVYAPLGHAAWIGLTPTDLVFPFFMFIMGIST 65
Query: 89 ALALKR--IPDRADAVKKVIFRTLKLLFWGILLQ--GGFSHAPDELTYGVDVRM------ 138
++L++ A K++ RT+ + G+ + F + L+ G D+
Sbjct: 66 YISLRKYNFEFSHSAALKILKRTIVIFAIGLGIAWFSMFCRTWNSLS-GEDISFFSRLYE 124
Query: 139 -------IRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
IR+ GV+QR+AL Y +++ + K
Sbjct: 125 SVWTFGHIRILGVMQRLALCYGATAIIALIMKH--------------------------- 157
Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGIN 251
Y+ L + + +IN + +Y N + +D VLG
Sbjct: 158 --KYIPYLIAILLIGYFIILINGNGFEYNS---------------SNILSIVDHTVLGEA 200
Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
HMY KD +PEGLLS++ SI +IG G +
Sbjct: 201 HMY----------------------KD------NGIDPEGLLSTIPSIAHVLIGFCVGKL 232
Query: 312 IIHTKGHLARLKQWVTMGFALLIFGLTLHF 341
++ K ++++ +G L G L +
Sbjct: 233 LMEVKDIHEKIERLFLIGTILTFAGFLLSY 262
>gi|224064476|ref|XP_002301495.1| predicted protein [Populus trichocarpa]
gi|222843221|gb|EEE80768.1| predicted protein [Populus trichocarpa]
Length = 136
Score = 65.9 bits (159), Expect = 3e-08, Method: Compositional matrix adjust.
Identities = 30/65 (46%), Positives = 38/65 (58%), Gaps = 1/65 (1%)
Query: 199 LYGTYVPDWQFTIINKDSADY-GKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
LYG Y PDW+F + + Y V CGV+ L PPCNA G IDR LG + +Y HP
Sbjct: 68 LYGLYDPDWEFEVPSTHLFGYKSGTKTVNCGVKGSLEPPCNAAGLIDRFFLGEHPLYQHP 127
Query: 258 AWRRS 262
+RR+
Sbjct: 128 VYRRT 132
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 29/45 (64%), Positives = 37/45 (82%)
Query: 49 MILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALK 93
MILVD AGG +P I+H+PW G L+DFVMPFFLF+VG++I+L K
Sbjct: 1 MILVDDAGGAFPCINHSPWFGVTLSDFVMPFFLFVVGLSISLVFK 45
>gi|386719962|ref|YP_006186288.1| N-acetylglucosamine related transporter, NagX [Stenotrophomonas
maltophilia D457]
gi|384079524|emb|CCH14124.1| N-acetylglucosamine related transporter, NagX [Stenotrophomonas
maltophilia D457]
Length = 352
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 75/144 (52%), Gaps = 20/144 (13%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVG 85
+ +RL S+D RG+ VA M+LV++ G DW + H+ W+GC D V PFFLF+VG
Sbjct: 1 MPPRRLGSIDALRGITVAAMLLVNNPG-DWSAVFAPLRHSEWHGCTPTDLVFPFFLFLVG 59
Query: 86 VAIALALKRIPDRADAVKK------VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
V++A ++ P DA + V+ R L++L + + + +
Sbjct: 60 VSMAFSVA--PRALDAAARPALARGVLERALRILL-------AGALLHLLIWWALHTHHF 110
Query: 140 RLCGVLQRIALSYLLVSLVEIFTK 163
R+ GVLQRIA+ V ++ ++ +
Sbjct: 111 RIWGVLQRIAVCAASVGVLAVYAR 134
>gi|345322030|ref|XP_003430524.1| PREDICTED: hypothetical protein LOC100681967 [Ornithorhynchus
anatinus]
Length = 530
Score = 65.5 bits (158), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 31/80 (38%), Positives = 51/80 (63%), Gaps = 4/80 (5%)
Query: 44 LAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL----KRIPDRA 99
L++ LM+ V++ GG + HAPWNG +AD VMP+F+FI+G ++ALA +R +R
Sbjct: 130 LSLTLMVFVNYGGGGYWFFEHAPWNGLTVADLVMPWFVFILGTSVALAFYAMRRRGVNRV 189
Query: 100 DAVKKVIFRTLKLLFWGILL 119
++K+ +RT L+ G+
Sbjct: 190 QLLRKLTWRTAVLMIIGLFF 209
>gi|408369302|ref|ZP_11167083.1| hypothetical protein I215_00330 [Galbibacter sp. ck-I2-15]
gi|407745048|gb|EKF56614.1| hypothetical protein I215_00330 [Galbibacter sp. ck-I2-15]
Length = 345
Score = 65.5 bits (158), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 79/293 (26%), Positives = 115/293 (39%), Gaps = 82/293 (27%)
Query: 49 MILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADA--V 102
MI+V+ G W + SHA W+G L D V P FLF+VG A++ ++++ + A +
Sbjct: 1 MIIVNTPG-SWGSVYRPLSHASWHGFTLTDLVFPTFLFVVGNAMSFSMRKFEQTSQAAFL 59
Query: 103 KKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFT 162
KKVI RT + G LL +L D R+ GVLQRIAL Y SLV
Sbjct: 60 KKVIKRTFVIFAIGFLLSWFPFFRDGQLKPLEDARIF---GVLQRIALCYFFASLVI--- 113
Query: 163 KDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKV 222
H+ LV + L G + +I DY
Sbjct: 114 ---------------------HYFKIKGALVFSMVALLG-------YHLIMYTMGDY--- 142
Query: 223 FNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSW 282
NA +D +LG NH+Y EG
Sbjct: 143 -----------TLEGNAALKLDLWLLGPNHLYQG---------------EG--------- 167
Query: 283 CHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIF 335
PF+PEGLLS++ + ++ I G +F + + G + + +G A L+F
Sbjct: 168 --IPFDPEGLLSTLPATVNVIFG-YFAGLFLQQSGKNFKTIALLMIGGATLVF 217
>gi|47213040|emb|CAF93449.1| unnamed protein product [Tetraodon nigroviridis]
Length = 297
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 72/266 (27%), Positives = 114/266 (42%), Gaps = 64/266 (24%)
Query: 139 IRLCGVLQRIALSYLLVSLVEIFTKD--------VQDKDQSVGRFSIFRLYCWHWLMAAC 190
+R+ GVLQR+AL+YL+V+ +++ +QD S G + +W C
Sbjct: 6 LRIPGVLQRLALAYLVVACLDLLVARRFSCVFCVLQDAWWSQGIDILL-----YWPAWVC 60
Query: 191 VLVVYLALLYGTY---VPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRK 246
VL++ L+ T+ VPD + D G N T G A G+IDR
Sbjct: 61 VLLLESVWLFITFLLPVPDCPTGYLGPGGIGDMGLYPNCTGG----------AAGFIDRW 110
Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
+LG H+Y +P+ + A H P++PEG+L S++SIL +G+
Sbjct: 111 LLGEKHIYQNPSSQGIYAT------------------HLPYDPEGILGSINSILIAFLGL 152
Query: 307 HFGHVIIHTKG-HLARLKQWVTMGFALLIFGLTLHFTNGEHG---------SGKFSTTCV 356
G +I+H + H + +++ GF L I L + G S + TT
Sbjct: 153 QAGKIILHHRDLHQGVISRFLIWGFLLGIISAVLTNCSTNQGLIPINKNLWSLSYVTTLA 212
Query: 357 CL------FIYSKVILFQW---QPFL 373
C IY V + +W +PFL
Sbjct: 213 CFAYVLLALIYYTVDVKKWWSGRPFL 238
>gi|404404699|ref|ZP_10996283.1| hypothetical protein AJC13_04673 [Alistipes sp. JC136]
Length = 376
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 81/315 (25%), Positives = 123/315 (39%), Gaps = 96/315 (30%)
Query: 25 QEKSHLKTQRLASLDIFRG-----------LAVALMILVDHAGGDW--PEISHAPWNGCN 71
Q QRL SLD RG L VAL L + DW ++ HA W+G
Sbjct: 4 QPTKPTAPQRLLSLDALRGFDMFFIMGFAGLVVALCKLRPGSFADWMSAQMGHAAWDGFF 63
Query: 72 LADFVMPFFLFIVGVAIALALKRIPDRADAVK------KVIFRTLKLLFWGILLQGGFSH 125
D + P FLFI G++ +L + R V+ KVI R L L+ G++ G F+
Sbjct: 64 HHDTIFPLFLFIAGISFPFSLAK--QREKGVRERSIYTKVIRRGLTLVALGLVYNGLFN- 120
Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
+D +RL VL RI L+++ +++ I RF I
Sbjct: 121 --------LDFATLRLPSVLGRIGLAWMFAAMLFI-------------RFGIRTRIA--- 156
Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
+AA +LV Y LL QF V L N VGYIDR
Sbjct: 157 -LAAVILVGYGLLL--------QF------------VAAPDAAGAGPLTEAGNIVGYIDR 195
Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
++ H+Y + F+PEGLLS++ +I++ ++G
Sbjct: 196 TIMP-AHLYGNRG----------------------------FDPEGLLSTLPAIVTAMLG 226
Query: 306 VHFGHVIIHTKGHLA 320
+ G + ++ ++
Sbjct: 227 MFTGEFVRRSEEQIS 241
>gi|329963071|ref|ZP_08300851.1| hypothetical protein HMPREF9446_02444 [Bacteroides fluxus YIT
12057]
gi|328529112|gb|EGF56042.1| hypothetical protein HMPREF9446_02444 [Bacteroides fluxus YIT
12057]
Length = 381
Score = 65.1 bits (157), Expect = 5e-08, Method: Compositional matrix adjust.
Identities = 81/326 (24%), Positives = 123/326 (37%), Gaps = 94/326 (28%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHA---GGDWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
QRL +LD+ RGL +A MILV+ + + HA WNG D + PFFLF++GV+
Sbjct: 3 NNQRLVALDVMRGLTIAGMILVNTPETWSYVYAPLQHARWNGLTPTDVIFPFFLFMMGVS 62
Query: 88 IALALKRIPDRADA--VKKVIFRTLKLLFWGILL---------------QGGFSHAPDEL 130
+ ++LK+ + + K+I R+L L G + Q GF P +
Sbjct: 63 MYISLKKCSFHLSSHLLMKIIRRSLILFLIGTAIYALATFLGTLRDACRQPGFEGNPWKE 122
Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
+ + R+ GVLQR+ + Y G SI L C H
Sbjct: 123 AFA-SLPGTRIPGVLQRLGVCY--------------------GIGSIIVLTCRH------ 155
Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
Y+P I+ A Y F + ++ P N + +DR + G
Sbjct: 156 -----------RYIPHLAGGIL----AGY---FLILLFGNGFVHSPENILSVVDRTLFGD 197
Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
N + D + PEG LS++ SI +IG G
Sbjct: 198 NMI-------------NDGGID----------------PEGALSTLPSIAQVLIGFCIGK 228
Query: 311 VIIHTKGHLARLKQWVTMGFALLIFG 336
+ I T +L + G +LI G
Sbjct: 229 ICIETPDMREKLNKIFLYGSLMLIVG 254
>gi|288929890|ref|ZP_06423732.1| conserved hypothetical protein [Prevotella sp. oral taxon 317 str.
F0108]
gi|288328709|gb|EFC67298.1| conserved hypothetical protein [Prevotella sp. oral taxon 317 str.
F0108]
Length = 399
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 52/156 (33%), Positives = 81/156 (51%), Gaps = 23/156 (14%)
Query: 25 QEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFL 81
EK+ T R+ S+DI RGL +A MI V++ G + + HA WNG D V PFF+
Sbjct: 1 MEKNK-TTSRILSIDILRGLTIAGMITVNNPGSWSYMYAPLEHAEWNGLTPTDLVFPFFM 59
Query: 82 FIVGVAIALALKRIP---DRADAVKKVIFRTLKLLFWGILLQGGFS------HAPDE--- 129
++G+ I +A+++ +RA K I + + L++ L G F+ ++P E
Sbjct: 60 CVMGMCIYIAMRKFDFACNRATVYK--IVKRMVLIYLVGLAIGWFAKFCYRWNSPQEGAD 117
Query: 130 ----LTYGV-DVRMIRLCGVLQRIALSYLLVSLVEI 160
L Y V IRL GVL R+A+ Y + +L+ I
Sbjct: 118 FFSQLWYMVWSFDKIRLTGVLARLAICYGITALLAI 153
>gi|291514403|emb|CBK63613.1| Uncharacterized conserved protein [Alistipes shahii WAL 8301]
Length = 376
Score = 64.7 bits (156), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 77/309 (24%), Positives = 119/309 (38%), Gaps = 92/309 (29%)
Query: 25 QEKSHLKTQRLASLDIFRG-----------LAVALMILVDHAGGDW--PEISHAPWNGCN 71
Q +QRL SLD RG L AL L DW ++ HA WNG
Sbjct: 4 QPTQPAASQRLLSLDALRGFDMLFIMGFAGLVTALCKLCPGEFSDWMTAQMGHADWNGFF 63
Query: 72 LADFVMPFFLFIVGVAIALALKRIPDRADAVK----KVIFRTLKLLFWGILLQGGFSHAP 127
D + P FLFI G++ +L + ++ + + KVI R L L+ G + G F
Sbjct: 64 HHDTIFPLFLFIAGISFPFSLAKQREKGMSERSIYLKVIRRGLTLVALGFVYSGLFK--- 120
Query: 128 DELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLM 187
+D +RL VL RI L+++ +L+ + + + ++V +
Sbjct: 121 ------LDFATLRLPSVLGRIGLAWMFAALLFV---NFNVRTRAV--------------I 157
Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
AA +L+ Y LL PD G L N VGY+DR V
Sbjct: 158 AAAILLGYGLLLQFVAAPD--------------------AGGAGPLTLEGNIVGYVDRIV 197
Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
M H R F+PEGLLS++ +I++ ++G+
Sbjct: 198 -----MPSHLLGGRG------------------------FDPEGLLSTLPAIVTAMLGMF 228
Query: 308 FGHVIIHTK 316
G + ++
Sbjct: 229 TGEFVRRSE 237
>gi|338211253|ref|YP_004655306.1| hypothetical protein [Runella slithyformis DSM 19594]
gi|336305072|gb|AEI48174.1| Protein of unknown function DUF2261, transmembrane [Runella
slithyformis DSM 19594]
Length = 374
Score = 64.3 bits (155), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 79/331 (23%), Positives = 132/331 (39%), Gaps = 97/331 (29%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAV-------ALMILVDHAGGDW-------PEISHAPWNG 69
Q + + RL SLD RG + + L+ A G W ++SH WNG
Sbjct: 2 SQPSTDTRPHRLLSLDALRGFDMFWITGGEEIFHLLAKATG-WTGAIIMAEQLSHPDWNG 60
Query: 70 CNLADFVMPFFLFIVGV----AIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSH 125
D + P FLF+ GV ++ + L+R DR ++KVI R L L+ GI+ G
Sbjct: 61 FRAYDLIFPLFLFLSGVSAPYSLGVRLERGDDRGKMLRKVIQRGLTLVLLGIIYNNGLQI 120
Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKD-VQDKDQSVGRFSIFRLYCWH 184
P E +R VL RI L+ + ++ ++T VQ Y W
Sbjct: 121 KPLE--------DMRFPSVLGRIGLAGMFAQIIYLYTSTRVQ--------------YIW- 157
Query: 185 WLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYID 244
+++LL G W F ++ CG + CN V Y+D
Sbjct: 158 ----------FVSLLLGY----WAFVMLVPVPG---------CGA-GLMTMECNPVSYLD 193
Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
R ++ G L KD H +PEGL+S++ +I + ++
Sbjct: 194 RLII-----------------------PGHLHKD----IH---DPEGLVSTIPAIATGLL 223
Query: 305 GVHFGHVIIHTKGHLARLKQWVTMGFALLIF 335
G+ G+++ + +R ++ + + A ++F
Sbjct: 224 GIFAGNLLRADERSTSRTQKVLVLFVAGILF 254
>gi|189463416|ref|ZP_03012201.1| hypothetical protein BACCOP_04135 [Bacteroides coprocola DSM
17136]
gi|189429845|gb|EDU98829.1| hypothetical protein BACCOP_04135 [Bacteroides coprocola DSM
17136]
Length = 82
Score = 64.3 bits (155), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 29/67 (43%), Positives = 45/67 (67%), Gaps = 3/67 (4%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
K QRL SLD+ RG+ +A MI+V++ G + ++HA WNG D V PFF+FI+G++
Sbjct: 3 KAQRLISLDVLRGITIAGMIIVNNPGSWKHVYTPLTHAVWNGLTPTDLVFPFFMFIMGIS 62
Query: 88 IALALKR 94
++LK+
Sbjct: 63 TYISLKK 69
>gi|404485011|ref|ZP_11020215.1| hypothetical protein HMPREF9448_00625 [Barnesiella intestinihominis
YIT 11860]
gi|404340016|gb|EJZ66447.1| hypothetical protein HMPREF9448_00625 [Barnesiella intestinihominis
YIT 11860]
Length = 440
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 47/145 (32%), Positives = 71/145 (48%), Gaps = 25/145 (17%)
Query: 33 QRLASLDIFRGLAVALMIL-----------VDHA--GGDWPEISHAPWNGCNLADFVMPF 79
QRLASLDI RG + L++ VD + + H W G D VMP
Sbjct: 75 QRLASLDILRGFDLFLLVFLQPVLVSLGACVDSSVMNAVLYQFDHEVWEGFRFWDLVMPL 134
Query: 80 FLFIVGVAIALAL---KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
FLF+ GV++ + +R+ R KKV+ R + L G+++QG G+D+
Sbjct: 135 FLFMTGVSMPFSFSKYERVESRRFIYKKVLRRFVILFLLGMVVQGNL--------LGLDL 186
Query: 137 RMIRL-CGVLQRIALSYLLVSLVEI 160
+ IRL LQ IA YL+ +L+++
Sbjct: 187 KYIRLYSNTLQAIAAGYLIAALIQL 211
>gi|322785719|gb|EFZ12357.1| hypothetical protein SINV_16151 [Solenopsis invicta]
Length = 111
Score = 63.9 bits (154), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 42/73 (57%)
Query: 20 DVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
+V D+ K R+ ++D FRG++ MI V+ G + + HA WNG L D V P
Sbjct: 33 NVKDESSNKEPKKNRVKAIDTFRGISTLFMIFVNDGSGSYTVLEHATWNGLLLGDLVFPC 92
Query: 80 FLFIVGVAIALAL 92
F++I+GV + +AL
Sbjct: 93 FIWIMGVCVPIAL 105
>gi|260910302|ref|ZP_05916976.1| conserved hypothetical protein [Prevotella sp. oral taxon 472 str.
F0295]
gi|260635554|gb|EEX53570.1| conserved hypothetical protein [Prevotella sp. oral taxon 472 str.
F0295]
Length = 399
Score = 63.5 bits (153), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 79/310 (25%), Positives = 125/310 (40%), Gaps = 95/310 (30%)
Query: 25 QEKSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFL 81
EK+ T R+ S+DI RGL +A MI V++ G + + HA WNG D V PFF+
Sbjct: 1 MEKNK-TTSRILSIDILRGLTIAGMITVNNPGSWSYMYAPLEHAEWNGLTPTDLVFPFFM 59
Query: 82 FIVGVAIALALKRIP---DRADAVKKVIFRTLKLLFWGILLQGGFS------HAPDE--- 129
++G+ I +A+ + +RA K I + + L++ L G F+ + P E
Sbjct: 60 CVMGMCIYIAMSKFNFACNRATVYK--ILKRMVLIYLVGLAIGWFAKFCYRWNNPQEGAD 117
Query: 130 ----LTYGV-DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWH 184
L Y V IRL GVL R+A+ Y + +L+ I V+ K
Sbjct: 118 FFSQLWYMVWSFDKIRLTGVLARLAVCYGITALLAI---TVRHKHLP------------- 161
Query: 185 WLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYID 244
+++ +L ++ L+ G G ++ T N + +D
Sbjct: 162 YIVGGLLLAYFVILMAGN-----------------GFAYDET-----------NILSIVD 193
Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
R VL HMYH +PEGLLS++ SI T++
Sbjct: 194 RAVLTDAHMYHDNG----------------------------IDPEGLLSTLPSIAHTLL 225
Query: 305 GVHFGHVIIH 314
G G ++
Sbjct: 226 GFIIGGMLFR 235
>gi|16552925|dbj|BAB71412.1| unnamed protein product [Homo sapiens]
Length = 367
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 55/196 (28%), Positives = 89/196 (45%), Gaps = 37/196 (18%)
Query: 139 IRLCGVLQRIALSYLLVSLVEI-FTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVV 194
+R+ GVLQR+ ++Y +V+++E+ F K V + S R W WL+ + +
Sbjct: 75 VRIPGVLQRLGVTYFVVAVLELLFAKPVPEHCASERSCLSLRDITSSWPQWLLILVLEGL 134
Query: 195 YLALLYGTYVPDWQFTIINKDS-ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
+L L + VP + D+GK N T G A GYIDR +LG +H+
Sbjct: 135 WLGLTFLLPVPGCPTGYLGPGGIGDFGKYPNCTGG----------AAGYIDRLLLGDDHL 184
Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
Y HP S A + ++PEG+L +++SI+ +GV G +++
Sbjct: 185 YQHP----SSAVLYHT--------------EVAYDPEGILGTINSIVMAFLGVQAGKILL 226
Query: 314 H----TKGHLARLKQW 325
+ TK L R W
Sbjct: 227 YYKARTKDILIRFTAW 242
>gi|383753678|ref|YP_005432581.1| hypothetical protein SELR_08500 [Selenomonas ruminantium subsp.
lactilytica TAM6421]
gi|381365730|dbj|BAL82558.1| hypothetical protein SELR_08500 [Selenomonas ruminantium subsp.
lactilytica TAM6421]
Length = 384
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 47/136 (34%), Positives = 67/136 (49%), Gaps = 12/136 (8%)
Query: 31 KTQRLASLDIFRGLAVALMILVD---HAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
+RLA++DIFRGLA+A+M+LV+ + WP + HAPW G +AD P F+FI+GV+
Sbjct: 7 NKRRLAAIDIFRGLAIAIMLLVNALPNFEQAWPLLVHAPWAGLTIADLAFPGFVFIMGVS 66
Query: 88 IALALKRIPDRADAVKKVIFRTLKLLF---------WGILLQGGFSHAPDELTYGVDVRM 138
+L + K I LL + ++LQ F P V
Sbjct: 67 ASLWFPKHEQDGSGEKFCIILKRSLLLILLGFFLCQFPLVLQHVFQPEPGGSLIKDIVEH 126
Query: 139 IRLCGVLQRIALSYLL 154
R+ GVLQR+ L Y
Sbjct: 127 GRIPGVLQRLGLVYFF 142
>gi|291516094|emb|CBK65304.1| Uncharacterized conserved protein [Alistipes shahii WAL 8301]
Length = 331
Score = 63.2 bits (152), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 66/266 (24%), Positives = 106/266 (39%), Gaps = 83/266 (31%)
Query: 61 EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVK------KVIFRTLKLLF 114
++ H WNG D + P FLFI GV+ +L + RA + KVI R + L+
Sbjct: 8 QMGHVSWNGLTQHDTIFPLFLFIAGVSFPFSLSK--QRASGISERRILFKVIRRGMTLIV 65
Query: 115 WGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR 174
G++ G F D +R+ VL RI L+++ SL+ ++ K V+ +
Sbjct: 66 LGMIYNGLFRF---------DFASLRVASVLGRIGLAWMFASLLYMYCK-VRTRA----- 110
Query: 175 FSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLN 234
+ AA VL+ Y L+Y PD D D L+
Sbjct: 111 -----------VFAAVVLIGYSLLMYLVVAPD------APDGTD-------------PLS 140
Query: 235 PPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLS 294
N G++DR+ L P + AC F+PEGLLS
Sbjct: 141 VAGNIAGWVDRQWL--------PG---TFAC-------------------GSFDPEGLLS 170
Query: 295 SVSSILSTIIGVHFGHVIIHTKGHLA 320
++ +I+S + G+ G ++ + L+
Sbjct: 171 TLPAIVSALFGMFTGEFLLRKRSSLS 196
>gi|223936398|ref|ZP_03628310.1| conserved hypothetical protein [bacterium Ellin514]
gi|223894916|gb|EEF61365.1| conserved hypothetical protein [bacterium Ellin514]
Length = 427
Score = 62.4 bits (150), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 74/253 (29%), Positives = 109/253 (43%), Gaps = 65/253 (25%)
Query: 33 QRLASLDIFRG-----------LAVALMILVDHAGGDWP--EISHAPWNGCNLADFVMPF 79
QRL S+D RG L AL L + D+ ++ H W G + D + P
Sbjct: 24 QRLMSVDALRGFDMFWIIGADSLVYALHRLSQNRVTDFLGLQLDHCDWAGFHFYDLIFPL 83
Query: 80 FLFIVGVAIALALKRIPD---RADAVKKVIFRTLKLLFWGILLQGGFSHA-PDELTYGVD 135
F+FI+GV++ +L + RA+AVK+V R+ L ++ GG A PD
Sbjct: 84 FVFIMGVSVVFSLTKAIQQLGRAEAVKRVFRRSALLFVVALIYSGGVRSAWPD------- 136
Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY 195
IRL GVL RIAL Y + L+ F K R + +AA +L+ Y
Sbjct: 137 ---IRLLGVLNRIALCYFVGGLIFCFFKP---------RAMV--------AIAAALLIGY 176
Query: 196 LALLYGTYVP----------------DWQFTIINKDS--ADYGKVF-NVTCGVRAKLNPP 236
+++ T+VP D I +D+ +D K+F N T V AK +
Sbjct: 177 WSIM--TFVPIRDIRMAHYKEKHELVDNDVDKIMQDTGVSDPAKIFYNTTNWVTAKYDMG 234
Query: 237 CNAVGYIDRKVLG 249
N ++D K LG
Sbjct: 235 YNVANHLDFKYLG 247
>gi|374384982|ref|ZP_09642493.1| hypothetical protein HMPREF9449_00879 [Odoribacter laneus YIT
12061]
gi|373227040|gb|EHP49361.1| hypothetical protein HMPREF9449_00879 [Odoribacter laneus YIT
12061]
Length = 382
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 44/158 (27%), Positives = 74/158 (46%), Gaps = 25/158 (15%)
Query: 23 DQQEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWPE-----ISHAPWNG 69
D + +H +RL SLD RG + ++ L A +W +H W G
Sbjct: 2 DTMKSTHSAAKRLESLDALRGFDLFFLVALGPLMNSLARAADAEWFNNWMGIFNHVSWEG 61
Query: 70 CNLADFVMPFFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSH 125
+ D +MP FLF+ G+++ AL R +PD+ ++++ R L L +G++ QG
Sbjct: 62 FSPWDLIMPLFLFMSGISMPFALARYKSMPDKRPLLRRLGKRILLLWIFGMICQGNLLGL 121
Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTK 163
PD ++ LQ IA YL+ +L+ +FT+
Sbjct: 122 NPD--------KIYLYSNTLQAIAAGYLITALLFLFTR 151
>gi|380512476|ref|ZP_09855883.1| hypothetical protein XsacN4_14717 [Xanthomonas sacchari NCPPB 4393]
Length = 384
Score = 62.4 bits (150), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 49/126 (38%), Positives = 66/126 (52%), Gaps = 15/126 (11%)
Query: 33 QRLASLDIFRGLAVALMILVD--HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+R SLD+FRGL + LMILV+ AG D + ++ H PW G AD V P FLF VG A++
Sbjct: 16 ERFLSLDVFRGLTIFLMILVNTPGAGADAFVQLRHTPWFGFTAADLVFPSFLFAVGNAMS 75
Query: 90 LALKRIPDRADAVKKVIFRTLKLLFWGILL-------QGGFSHAPDELTYGVDVRMIRLC 142
AL R +++V R+ + G L+ QG H LT + R+
Sbjct: 76 FALDRGQPLGAFLRRVGKRSALIFLLGFLMYWFPFVHQGADGHW--SLT---AIDQTRVP 130
Query: 143 GVLQRI 148
GVLQRI
Sbjct: 131 GVLQRI 136
>gi|329851798|ref|ZP_08266479.1| hypothetical protein ABI_45670 [Asticcacaulis biprosthecum C19]
gi|328839647|gb|EGF89220.1| hypothetical protein ABI_45670 [Asticcacaulis biprosthecum C19]
Length = 398
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 74/295 (25%), Positives = 118/295 (40%), Gaps = 83/295 (28%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFFLFIVGVAIA 89
R +LDI RGL + M+L ++AG DW I HA W+G L D V P F+ VG+++
Sbjct: 27 RFEALDILRGLFIIGMLLANNAG-DWSHIYTPLDHAEWHGFTLTDMVFPGFMTCVGLSMT 85
Query: 90 LALKR----IPDRADAVKKVIFRTLKL--------LFWGILLQGGFSHAPDELTYGVDVR 137
L+L R + +A ++ +L+ LF +L Q F H
Sbjct: 86 LSLGRRQKTLNSQAGGKAALLVHSLRRAAILVGIGLFLNLLPQFDFEHW----------- 134
Query: 138 MIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLA 197
RL GVLQRI + Y + S + + + + L+ +L+A +L+ Y+
Sbjct: 135 --RLPGVLQRIGICYAIASGLVVLHSHQNQQGGLILHSRALALWGVGFLVAYTLLLKYVP 192
Query: 198 LLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
+ G W A + P ++D +VLG+NH+
Sbjct: 193 VPDGAGANQWD----------------------AIHSWPA----WVDMQVLGVNHV---- 222
Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
W +K ++PEGLLSSV + + + G+ G I
Sbjct: 223 -WSGAKT----------------------YDPEGLLSSVPATSNILFGILMGLYI 254
>gi|254784997|ref|YP_003072425.1| hypothetical protein TERTU_0813 [Teredinibacter turnerae T7901]
gi|237684955|gb|ACR12219.1| putative membrane protein [Teredinibacter turnerae T7901]
Length = 354
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 45/124 (36%), Positives = 62/124 (50%), Gaps = 9/124 (7%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+R +LD RG+ +A+MILV+ G +P + HA W+G DFV PFFLFIVG A+
Sbjct: 2 NERSLALDALRGITLAMMILVNTPGSWSHVYPPLLHANWHGVTPTDFVFPFFLFIVGCAL 61
Query: 89 ALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
+ +R + + LK +F +L Y RL GVLQRI
Sbjct: 62 FFS-----NRKNHQLDIYTHALK-IFRRTVLLLLAGLGLHAYLYSGTFAEFRLPGVLQRI 115
Query: 149 ALSY 152
AL+Y
Sbjct: 116 ALAY 119
>gi|150007979|ref|YP_001302722.1| transmembrane protein [Parabacteroides distasonis ATCC 8503]
gi|423331514|ref|ZP_17309298.1| hypothetical protein HMPREF1075_01311 [Parabacteroides distasonis
CL03T12C09]
gi|149936403|gb|ABR43100.1| putative transmembrane protein [Parabacteroides distasonis ATCC
8503]
gi|409230084|gb|EKN22952.1| hypothetical protein HMPREF1075_01311 [Parabacteroides distasonis
CL03T12C09]
Length = 378
Score = 62.0 bits (149), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 50/151 (33%), Positives = 74/151 (49%), Gaps = 27/151 (17%)
Query: 31 KTQRLASLDIFRG-----LAVALMIL--VDHAGGDWP-------EISHAPWNGCNLADFV 76
K RL SLD+ RG L V M+L + HA D P SH W G + D V
Sbjct: 5 KLNRLESLDVLRGFDLFCLVVLEMVLHPLAHAI-DMPWFNSFMWGFSHVEWEGFSTWDLV 63
Query: 77 MPFFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTY 132
MP FLF+ GV++ +L R +PD+ +++ R L L +G++ QG + PD
Sbjct: 64 MPLFLFMAGVSMPFSLSRYKDMPDKMAVYRRIGKRVLLLWVFGMMCQGNLLALDPD---- 119
Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTK 163
R+ LQ IA+ YL+ SL+ ++ +
Sbjct: 120 ----RVYLYSNTLQSIAMGYLIASLLFLYVR 146
>gi|440804580|gb|ELR25457.1| Heparan-alpha-glucosaminide N-acetyltransferase [Acanthamoeba
castellanii str. Neff]
Length = 446
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 82/325 (25%), Positives = 126/325 (38%), Gaps = 95/325 (29%)
Query: 22 SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG-------------DWPEISHAPWN 68
+D S K RL SLD+FRG+ + MILVD+ G P P N
Sbjct: 47 TDASCPSAPKKPRLQSLDVFRGVTMLGMILVDNQGNFDHVVRPLDESIVRHPAPPRPPTN 106
Query: 69 G---CNLADFVMPFFLFIVGVAIALALKRIPDRADAVK---KVIFRTLKLLFWGILLQGG 122
+ AD + F V +A+ +IPDR +K +V+ R L G+LL
Sbjct: 107 ARSWVDPADHCAQWDGFAVALAMNGFWDKIPDRRGKIKAWARVLQRIGTLFVVGLLLNAF 166
Query: 123 FSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYC 182
S+ D+ + R+ G RIAL Y V+++ + T + +
Sbjct: 167 GSNPWDKWPH----WHFRIMGC--RIALCYGTVTVLFLATSTIVQR-------------- 206
Query: 183 WHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY 242
++ C +Y+ L+YG VP CG R L P CNA G+
Sbjct: 207 ---VVMLCFTAIYVGLMYGLDVP--------------------KCG-RGNLTPGCNAGGF 242
Query: 243 IDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILST 302
IDR + G W P +PEGLLS++++ L+
Sbjct: 243 IDRSIFG-------------------------------DWMIRPNDPEGLLSTLTATLTC 271
Query: 303 IIGVHFGHVI-IHTKGHLARLKQWV 326
+G+ FG ++ + L + +WV
Sbjct: 272 YLGLEFGRILHKYRANQLELVCRWV 296
>gi|433678126|ref|ZP_20510025.1| Heparan-alpha-glucosaminide N-acetyltransferase [Xanthomonas
translucens pv. translucens DSM 18974]
gi|430816762|emb|CCP40477.1| Heparan-alpha-glucosaminide N-acetyltransferase [Xanthomonas
translucens pv. translucens DSM 18974]
Length = 384
Score = 61.6 bits (148), Expect = 6e-07, Method: Compositional matrix adjust.
Identities = 45/121 (37%), Positives = 67/121 (55%), Gaps = 5/121 (4%)
Query: 33 QRLASLDIFRGLAVALMILVD--HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+R SLD+FRGL + LMIL + AG D + ++ HAPW G AD P FLF+VG A++
Sbjct: 16 ERFLSLDVFRGLMIFLMILGNTPGAGADAFVQLRHAPWLGFTAADVGFPSFLFVVGNAMS 75
Query: 90 LALKRIPDRADAVKKVIFRTLKLLFWGILLQG-GFSHAPDELTYG-VDVRMIRLCGVLQR 147
AL R +++V R+ + G L+ F H + ++ + + R+ GVLQR
Sbjct: 76 FALDRSQPLGAFLRRVGKRSALIFLLGFLMYWFPFVHQGADGSWSFIAIDQTRVPGVLQR 135
Query: 148 I 148
I
Sbjct: 136 I 136
>gi|390958852|ref|YP_006422609.1| hypothetical protein Terro_3042 [Terriglobus roseus DSM 18391]
gi|390413770|gb|AFL89274.1| Protein of unknown function (DUF1624) [Terriglobus roseus DSM
18391]
Length = 406
Score = 61.2 bits (147), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 40/112 (35%), Positives = 58/112 (51%), Gaps = 17/112 (15%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPW---------NGCNLAD 74
+ ++ +K QR+ SLDIFRGL +ALMI V+ EI PW N D
Sbjct: 3 ETTRATVKPQRIQSLDIFRGLNIALMIFVNELH----EIKGLPWWTYHAPGAANVMTYVD 58
Query: 75 FVMPFFLFIVGVAIALALKRIPDRADAVKK----VIFRTLKLLFWGILLQGG 122
V P FL IVG+++ LAL+ R D + V+ R++ L+ G++LQ
Sbjct: 59 MVFPAFLVIVGMSLPLALQARIRRGDETPQLIWYVVLRSVALIVLGLILQNA 110
>gi|223936396|ref|ZP_03628308.1| conserved hypothetical protein [bacterium Ellin514]
gi|223894914|gb|EEF61363.1| conserved hypothetical protein [bacterium Ellin514]
Length = 383
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 46/161 (28%), Positives = 74/161 (45%), Gaps = 30/161 (18%)
Query: 13 PLIISE-PDVSDQQEKSHLKTQ----RLASLDIFRGLAVALMILVDHAGGDWPEI----- 62
P SE P +S+Q + Q R+ S+D RG + ++ D + +I
Sbjct: 4 PTSTSEAPALSNQAGSTATLNQKANTRIISIDALRGFDMFWIMGGDQLVRSFQKIDDSAP 63
Query: 63 --------SHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPD---RADAVKKVIFRTLK 111
H W G + D + P F+F+ GV+I ++ R+ + R AVK++ FR++
Sbjct: 64 THALANQMEHCEWAGFHFYDLIFPLFVFLAGVSIVFSITRLIEHSGRVAAVKRIAFRSVI 123
Query: 112 LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSY 152
L +GI GG S+ + I L GVL RIA++Y
Sbjct: 124 LFLFGIFYMGGVSNG---------FKNIYLAGVLHRIAVAY 155
>gi|217974365|ref|YP_002359116.1| hypothetical protein Sbal223_3208 [Shewanella baltica OS223]
gi|217499500|gb|ACK47693.1| conserved hypothetical protein [Shewanella baltica OS223]
Length = 384
Score = 61.2 bits (147), Expect = 8e-07, Method: Compositional matrix adjust.
Identities = 95/384 (24%), Positives = 145/384 (37%), Gaps = 113/384 (29%)
Query: 22 SDQQEKSHLKTQRLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWN 68
++ E + RL SLD RG + L+IL AG W ++ H+ W+
Sbjct: 6 TNAIEPVKVNKPRLVSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWH 65
Query: 69 GCNLADFVMPFFLFIVGVAIALALKR-----IPDRADAVKKVIFRTLKLLFWGILLQGGF 123
G D + P F+F+ GVA+ L+ KR I +R + I R LL GIL G+
Sbjct: 66 GFRFYDLIFPLFIFLSGVALGLSPKRLDKLPISERLPVYRHGIKRLFLLLLLGILYNHGW 125
Query: 124 -SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYC 182
+ AP D IR VL RIA ++ +L+
Sbjct: 126 GTGAP------ADPEKIRYASVLGRIAFAWFFAALL-----------------------V 156
Query: 183 WHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNA 239
WH + ++V L +L G YG + G L+P +
Sbjct: 157 WHTSLRTQIIVA-LGILLG-----------------YGAIQLWLPFPGGQAGVLSPTESI 198
Query: 240 VGYIDRKVL-GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSS 298
Y+D +L G+++ P +PEGLLS++ +
Sbjct: 199 NAYVDSILLPGVSYQGRTP------------------------------DPEGLLSTIPA 228
Query: 299 ILSTIIGVHFGHVII--HTKGHLARLKQWVTMGFALLIFGLTLHF---TNGEHGSGKF-- 351
I++ + GV GH I+ H KG A++ G LL FG L N E + F
Sbjct: 229 IVNALTGVFVGHFIVKSHPKGEWAKVGLLAAAGGILLAFGWLLDLVIPVNKELWTSSFVL 288
Query: 352 -----STTCVCLFIYSKVILFQWQ 370
S + +F Y+ V + +WQ
Sbjct: 289 VTSGWSMILLAVF-YALVDVLKWQ 311
>gi|413922900|gb|AFW62832.1| hypothetical protein ZEAMMB73_935848 [Zea mays]
Length = 1241
Score = 60.8 bits (146), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 27/60 (45%), Positives = 36/60 (60%)
Query: 222 VFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPS 281
V V CGVR + CNAVG IDRK+LGI H+Y P + RSK +++ R+ AP+
Sbjct: 966 VLQVKCGVRGDTSSGCNAVGMIDRKILGIQHLYGRPVYARSKNYRKNTLAASSSRRKAPA 1025
>gi|196233857|ref|ZP_03132695.1| conserved hypothetical protein [Chthoniobacter flavus Ellin428]
gi|196222051|gb|EDY16583.1| conserved hypothetical protein [Chthoniobacter flavus Ellin428]
Length = 437
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 88/340 (25%), Positives = 129/340 (37%), Gaps = 69/340 (20%)
Query: 16 ISEPDVSDQQEKSHLKTQRLASLDIFRGL-------AVALMILVDH-------AGGDWPE 61
I P+V E RL SLD RG A A++ +D AG W +
Sbjct: 27 IPSPNVPVSPETGQ-PAGRLVSLDALRGFDMFWIVGAGAVIQSLDKMCRTPFTAGLAW-Q 84
Query: 62 ISHAPWNGCNLADFVMPFFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGIL 118
H W G + D + P FLFI+G++I +L + +A + +V R++ L G+L
Sbjct: 85 FKHVHWKGLHCYDVIFPLFLFIIGISIVFSLDKALATGGKAQVLTRVARRSVLLFALGVL 144
Query: 119 LQGGFSHA-PDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSI 177
GGF P+ ++L GVL RIAL YL +L+ F + + + I
Sbjct: 145 YYGGFMKPWPN----------VQLGGVLPRIALCYLAAALIYTFIRSTRGLLAAAAALLI 194
Query: 178 FRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPC 237
Y ALL PD Q K R + P
Sbjct: 195 ----------------GYWALLAFVPFPDLQLR----------KPVVEEIAERIGSDSPA 228
Query: 238 NAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVS 297
+ +V G+ Y R+ A D + P RK F EGLLS++
Sbjct: 229 AIAAAVPERVHGLYEEY------RNLANYVDFLYM-PGRK-----AQFYFINEGLLSTIP 276
Query: 298 SILSTIIGVHFGHVIIHTKGHLARLKQW-VTMGFALLIFG 336
SI ++ G G ++ + K R W V G A ++ G
Sbjct: 277 SIALSLFGAVAGLLLKNQKVLPRRKIAWLVGAGVAFIVLG 316
>gi|256840847|ref|ZP_05546355.1| conserved hypothetical protein [Parabacteroides sp. D13]
gi|298376669|ref|ZP_06986624.1| conserved hypothetical protein [Bacteroides sp. 3_1_19]
gi|256738119|gb|EEU51445.1| conserved hypothetical protein [Parabacteroides sp. D13]
gi|298266547|gb|EFI08205.1| conserved hypothetical protein [Bacteroides sp. 3_1_19]
Length = 378
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 50/146 (34%), Positives = 71/146 (48%), Gaps = 27/146 (18%)
Query: 31 KTQRLASLDIFRG-----LAVALMIL--VDHAGGDWP-------EISHAPWNGCNLADFV 76
K RL SLD+ RG L V M+L + HA D P SH W G + D V
Sbjct: 5 KLNRLESLDVLRGFDLFCLVVLEMVLHPLAHAI-DMPWFNSFMWGFSHVEWEGFSTWDLV 63
Query: 77 MPFFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTY 132
MP FLF+ GV++ +L R +PD+ +++ R L L +G++ QG + PD
Sbjct: 64 MPLFLFMAGVSMPFSLSRYKDMPDKMAVYRRIGKRVLLLWVFGMMCQGNLLALDPD---- 119
Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLV 158
R+ LQ IA+ YL+ SL+
Sbjct: 120 ----RVYLYSNTLQSIAMGYLIASLL 141
>gi|301309931|ref|ZP_07215870.1| conserved hypothetical protein [Bacteroides sp. 20_3]
gi|423340410|ref|ZP_17318149.1| hypothetical protein HMPREF1059_04074 [Parabacteroides distasonis
CL09T03C24]
gi|300831505|gb|EFK62136.1| conserved hypothetical protein [Bacteroides sp. 20_3]
gi|409227845|gb|EKN20741.1| hypothetical protein HMPREF1059_04074 [Parabacteroides distasonis
CL09T03C24]
Length = 378
Score = 60.5 bits (145), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 50/146 (34%), Positives = 71/146 (48%), Gaps = 27/146 (18%)
Query: 31 KTQRLASLDIFRG-----LAVALMIL--VDHAGGDWP-------EISHAPWNGCNLADFV 76
K RL SLD+ RG L V M+L + HA D P SH W G + D V
Sbjct: 5 KLNRLESLDVLRGFDLFCLVVLEMVLHPLAHAI-DMPWFNSFMWGFSHVEWEGFSTWDLV 63
Query: 77 MPFFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTY 132
MP FLF+ GV++ +L R +PD+ +++ R L L +G++ QG + PD
Sbjct: 64 MPLFLFMAGVSMPFSLSRYKDMPDKMAVYRRIGKRVLLLWVFGMMCQGNLLALDPD---- 119
Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLV 158
R+ LQ IA+ YL+ SL+
Sbjct: 120 ----RVYLYSNTLQSIAMGYLIASLL 141
>gi|195167204|ref|XP_002024424.1| GL15027 [Drosophila persimilis]
gi|194107797|gb|EDW29840.1| GL15027 [Drosophila persimilis]
Length = 493
Score = 60.1 bits (144), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 27/50 (54%), Positives = 34/50 (68%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFF 80
+ +RL SLD FRGL++ LMI V+ GG + I HA WNG +LAD V PF
Sbjct: 181 QRKRLRSLDTFRGLSIVLMIFVNSGGGGYAWIEHAAWNGLHLADLVFPFL 230
>gi|343082821|ref|YP_004772116.1| hypothetical protein [Cyclobacterium marinum DSM 745]
gi|342351355|gb|AEL23885.1| Protein of unknown function DUF2261, transmembrane [Cyclobacterium
marinum DSM 745]
Length = 367
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 61/216 (28%), Positives = 92/216 (42%), Gaps = 44/216 (20%)
Query: 27 KSHLKTQRLASLDIFRGLAVALMILVD------HAGGDWP-------EISHAPWNGCNLA 73
+ +K R+ S+D RG + +I D H GG P + SH W G
Sbjct: 3 EQAIKPNRILSIDALRGFDMLFIIFADRFFALLHKGGQTPFTGFLANQFSHPDWFGSTFY 62
Query: 74 DFVMPFFLFIVGVAIALAL-KRIPD---RADAVKKVIFRTLKLLFWGILLQGGFSHAPDE 129
D +MP FLF+VG I +L KR+ + +A KK+ R L L F G ++QG
Sbjct: 63 DIIMPLFLFMVGAVIPFSLSKRMQENTGKAQIYKKLFKRVLILFFLGWIVQGNL------ 116
Query: 130 LTYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMA 188
+D+ ++ LQ IA+ Y L I+ GR+ +F A
Sbjct: 117 --LALDINTFKIFSNTLQAIAVGYFFSCLAFIYL-------SRNGRYIMF---------A 158
Query: 189 ACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFN 224
AC L++Y +L VP ++I D +Y F+
Sbjct: 159 AC-LIIYAMILTVPNVPGVGQSVILPDK-NYALYFD 192
>gi|404486905|ref|ZP_11022093.1| hypothetical protein HMPREF9448_02547 [Barnesiella intestinihominis
YIT 11860]
gi|404335959|gb|EJZ62425.1| hypothetical protein HMPREF9448_02547 [Barnesiella intestinihominis
YIT 11860]
Length = 373
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 74/304 (24%), Positives = 116/304 (38%), Gaps = 95/304 (31%)
Query: 31 KTQRLASLDIFRG-----------LAVALMILVDHAGGD--WPEISHAPWNGCNLADFVM 77
K RL SLD RG L V L L A GD + H PW+G D +
Sbjct: 5 KNTRLLSLDTLRGFDMLFIMGFAPLVVTLNALHPTAVGDVIAGHMRHVPWDGFTQHDMIF 64
Query: 78 PFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFS-HAPDELTY 132
P FLFI G++ +L + + K + R + L+ G L G + PD
Sbjct: 65 PLFLFIAGISFPFSLAKQRGSGSSDKHIYLRVFRRGVTLVLLGFLYNGFLQLNFPD---- 120
Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
+RL VL RI L+++ + + + K +SV I + +WL+ A
Sbjct: 121 ------VRLASVLGRIGLAWMFGAFIYMSLK------KSVQYGLIVFILVGYWLLLA--- 165
Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINH 252
+VP D+A + L+ N VGYIDR L
Sbjct: 166 ----------FVPA-------PDAAG-----------ASPLSIEGNLVGYIDRHCLPGKL 197
Query: 253 MYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI 312
+Y + F+PEGLLS++ +I++ ++G++ G ++
Sbjct: 198 IYGN------------------------------FDPEGLLSTLPAIVTALLGIYAGEIV 227
Query: 313 IHTK 316
T+
Sbjct: 228 RSTR 231
>gi|114048505|ref|YP_739055.1| hypothetical protein Shewmr7_3014 [Shewanella sp. MR-7]
gi|113889947|gb|ABI43998.1| conserved hypothetical protein [Shewanella sp. MR-7]
Length = 395
Score = 60.1 bits (144), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 95/396 (23%), Positives = 146/396 (36%), Gaps = 107/396 (27%)
Query: 7 ETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAV-----------ALMILVDHA 55
TT + + + + K RL SLD RG + AL+IL A
Sbjct: 2 STTAPESVANTGVNAQNAAAKKRQSKPRLMSLDALRGFDMFWILGGEALFGALLILTGWA 61
Query: 56 GGDW--PEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIP-----DRADAVKKVIFR 108
G W ++ H+ WNG D + P F+F+ GVA+ L+ KR+ +R + I R
Sbjct: 62 GWQWGDTQMHHSEWNGFRFYDLIFPLFIFLSGVALGLSPKRLDKLPMHERMPVYRHGIKR 121
Query: 109 TLKLLFWGILLQGGF-SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
LL GIL G+ + AP VD +R VL RIA ++ +L+
Sbjct: 122 LFLLLLLGILYNHGWGTGAP------VDPEKVRYASVLGRIAFAWFFAALL--------- 166
Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTC 227
WH + VLV L+ V W
Sbjct: 167 --------------VWHTSLRTQVLVALGILVAYGAVQLW---------------LPFPG 197
Query: 228 GVRAKLNPPCNAVGYIDRKVL-GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP 286
G +L+P + Y+D +L G+++ P
Sbjct: 198 GQAGELSPTESINAYVDSLLLPGVSYQGRTP----------------------------- 228
Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVII--HTKGHLARLKQWVTMGFALLIFGLTLHF--- 341
+PEG+LS++ ++++ + GV GH I+ H KG A++ G L G L
Sbjct: 229 -DPEGVLSTLPAVVNALAGVFVGHFIVKSHPKGEWAKVGLLSVAGGVCLALGWLLDGVIP 287
Query: 342 TNGEHGSGKF-------STTCVCLFIYSKVILFQWQ 370
N E + F S + LF Y+ V + +WQ
Sbjct: 288 VNKELWTSSFVLVTSGWSMLLLALF-YAIVDVLKWQ 322
>gi|399069322|ref|ZP_10749357.1| Protein of unknown function (DUF1624), partial [Caulobacter sp.
AP07]
gi|398045229|gb|EJL37978.1| Protein of unknown function (DUF1624), partial [Caulobacter sp.
AP07]
Length = 233
Score = 59.7 bits (143), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 79/303 (26%), Positives = 122/303 (40%), Gaps = 92/303 (30%)
Query: 28 SHLKTQRLASLDIFRGLAVALMILVDHAG--------GDWPEISHAPWNGCNLADFVMPF 79
S K RLASLD+ RGL + MI+V+ A + + HA W G AD V P
Sbjct: 3 SRPKAARLASLDVLRGLTIVGMIVVNTASYLHYVSGYAVFAGLEHAEWRGFTAADAVFPA 62
Query: 80 FLFIVGVAIALALKRIP------DR------ADAVKKVIFRTLKLLFWGILLQGGFSHAP 127
F+F+ GV+I LAL + +R A+++++ R+ +L G++L + A
Sbjct: 63 FVFMTGVSIPLALGPLALGDGPIERGMAGLDGAALRRLLVRSGRLFLLGLILSNLYWMAT 122
Query: 128 DELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLM 187
E + R GVLQR+AL+ + K + + + + +
Sbjct: 123 PESV------LFRPMGVLQRLALA---FLAAAVLYKTLGPRARMI--------------L 159
Query: 188 AACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKV 247
A +L +Y W T++ G L P N VG+ DR V
Sbjct: 160 AVAILALY-----------WPLTLLPFPD-----------GTTDLLRPGANFVGWFDRAV 197
Query: 248 LGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVH 307
LG H Y H GPL ++PEGLLS++ ++ ++GV
Sbjct: 198 LG-AHTYVH----------------GPLG----------YDPEGLLSTLPAVAQALLGVA 230
Query: 308 FGH 310
G
Sbjct: 231 AGQ 233
>gi|408673239|ref|YP_006872987.1| Protein of unknown function DUF2261, transmembrane [Emticicia
oligotrophica DSM 17448]
gi|387854863|gb|AFK02960.1| Protein of unknown function DUF2261, transmembrane [Emticicia
oligotrophica DSM 17448]
Length = 370
Score = 59.7 bits (143), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/155 (27%), Positives = 67/155 (43%), Gaps = 25/155 (16%)
Query: 27 KSHLKTQRLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNGCNLA 73
S TQRL SLD RG + AL H W ++SH WNG
Sbjct: 2 SSSSSTQRLYSLDALRGFDMFWIMGGEDFFHALSEATHHPAAIWIATQLSHVAWNGFRFY 61
Query: 74 DFVMPFFLFIVGV----AIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDE 129
D + P FLFI GV ++ +++ D+ ++++I R L L+ G++ G
Sbjct: 62 DLIFPLFLFISGVSTPYSVGREIEKGIDKQAILRRIIKRGLILVLLGVIYNNGLQIK--- 118
Query: 130 LTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKD 164
++ IR VL RI L+Y+ ++ ++
Sbjct: 119 -----ELSQIRFPSVLGRIGLAYMFACIIYVYASQ 148
>gi|167623085|ref|YP_001673379.1| hypothetical protein Shal_1151 [Shewanella halifaxensis HAW-EB4]
gi|167353107|gb|ABZ75720.1| conserved hypothetical protein [Shewanella halifaxensis HAW-EB4]
Length = 398
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 50/171 (29%), Positives = 74/171 (43%), Gaps = 32/171 (18%)
Query: 15 IISEPDVSDQQEKSHLKTQ---------RLASLDIFRG-----------LAVALMILVDH 54
II++P S + L+TQ RL SLD RG + AL++L
Sbjct: 4 IITKPQSSLIESHLKLQTQSIAKSEAKPRLKSLDALRGFDMFWILGGEAIFAALLVLTGW 63
Query: 55 AGGDW--PEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAV-----KKVIF 107
AG W ++ H+ WNG D + P F+F+ GVA+ L+ KR+ K I
Sbjct: 64 AGFKWFDGQMHHSVWNGFTFYDLIFPLFIFLSGVALGLSPKRLDKLPLPPRLPLYKHAIK 123
Query: 108 RTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
R LL +G++ G+ V IR VL RIA ++ +L+
Sbjct: 124 RLFLLLLFGVIYNHGWGTGAS-----FAVGDIRYASVLGRIAFAWFFCALL 169
>gi|418023168|ref|ZP_12662153.1| hypothetical protein Sbal625DRAFT_1278 [Shewanella baltica OS625]
gi|353537051|gb|EHC06608.1| hypothetical protein Sbal625DRAFT_1278 [Shewanella baltica OS625]
Length = 384
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 91/372 (24%), Positives = 144/372 (38%), Gaps = 113/372 (30%)
Query: 34 RLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFF 80
RL SLD RG + L+IL AG W ++ H+ W+G N D + P F
Sbjct: 18 RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFNFYDLIFPLF 77
Query: 81 LFIVGVAIALALKRIPDRADAVKKVIFR-TLKLLFWGILLQGGFSH-----APDELTYGV 134
+F+ GVA+ L+ KR+ + + ++R +K LF +LL ++H AP
Sbjct: 78 IFLSGVALGLSPKRLDKLPMSERLPVYRHGIKRLFLLLLLGILYNHGWGTGAP------A 131
Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
D IR VL RIA ++ +L+ WH + ++V
Sbjct: 132 DPEKIRYASVLGRIAFAWFFAALL-----------------------VWHTSLRTQIIVA 168
Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNAVGYIDRKVL-GI 250
L +L G YG + G L+P + Y+D +L G+
Sbjct: 169 -LGILLG-----------------YGAMQLWLPFPGGQAGVLSPTESINAYVDSILLPGV 210
Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
++ P +PEGLLS++ +I++ + GV GH
Sbjct: 211 SYQGRTP------------------------------DPEGLLSTIPAIVNALTGVFVGH 240
Query: 311 VII--HTKGHLARLKQWVTMGFALLIFGLTLHF---TNGEHGSGKF-------STTCVCL 358
++ H KG A++ G LL FG L N E + F S + +
Sbjct: 241 FLVKSHPKGEWAKVGLLAAAGGILLAFGWLLDLVIPVNKELWTSSFVLVTSGWSMILLAV 300
Query: 359 FIYSKVILFQWQ 370
F Y+ V + +WQ
Sbjct: 301 F-YALVDVLKWQ 311
>gi|440731410|ref|ZP_20911431.1| membrane protein [Xanthomonas translucens DAR61454]
gi|440373102|gb|ELQ09871.1| membrane protein [Xanthomonas translucens DAR61454]
Length = 384
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 45/121 (37%), Positives = 66/121 (54%), Gaps = 5/121 (4%)
Query: 33 QRLASLDIFRGLAVALMILVD--HAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+R SLD+FRGL + LMIL + AG D + ++ HAPW G AD P FLF+VG A++
Sbjct: 16 ERFLSLDVFRGLMIFLMILGNTPGAGADAFVQLRHAPWLGFTAADVGFPSFLFVVGNAMS 75
Query: 90 LALKRIPDRADAVKKVIFRTLKLLFWGILLQG-GFSHAPDELTYG-VDVRMIRLCGVLQR 147
AL R + +V R+ + G L+ F H + ++ + + R+ GVLQR
Sbjct: 76 FALDRSQPLGAFLCRVGKRSALIFLLGFLMYWFPFVHQGADGSWSFIAIDQTRVPGVLQR 135
Query: 148 I 148
I
Sbjct: 136 I 136
>gi|265767324|ref|ZP_06094990.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
gi|263252629|gb|EEZ24141.1| conserved hypothetical protein [Bacteroides sp. 2_1_16]
Length = 375
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 71/154 (46%), Gaps = 25/154 (16%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWP-----EISHAPWNGC 70
++ S + RLASLDI RG + L++ L W + H W G
Sbjct: 1 MKKPSSTPSPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGF 60
Query: 71 NLADFVMPFFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAP 127
D VMP FLF+ G ++ + + PD+ +K+I R + L +G+++QG
Sbjct: 61 RFWDLVMPLFLFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL---- 116
Query: 128 DELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEI 160
G+D + + L LQ IA YL+ +++++
Sbjct: 117 ----LGLDPKHLYLYSNTLQAIATGYLIAAIIQL 146
>gi|375360501|ref|YP_005113273.1| putative transmembrane protein [Bacteroides fragilis 638R]
gi|301165182|emb|CBW24752.1| putative transmembrane protein [Bacteroides fragilis 638R]
Length = 373
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 43/152 (28%), Positives = 71/152 (46%), Gaps = 23/152 (15%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWP-----EISHAPWNGC 70
++ S + RLASLDI RG + L++ L W + H W G
Sbjct: 1 MKKPSSTPSPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGF 60
Query: 71 NLADFVMPFFLFIVGVAIALA-LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDE 129
D VMP FLF+ G ++ + K PD+ +K+I R + L +G+++QG
Sbjct: 61 RFWDLVMPLFLFMTGASMPFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL------ 114
Query: 130 LTYGVDVRMIRL-CGVLQRIALSYLLVSLVEI 160
G+D + + L LQ IA YL+ +++++
Sbjct: 115 --LGLDPKHLYLYSNTLQAIATGYLIAAIIQL 144
>gi|53715734|ref|YP_101726.1| hypothetical protein BF4455 [Bacteroides fragilis YCH46]
gi|423271955|ref|ZP_17250924.1| hypothetical protein HMPREF1079_04006 [Bacteroides fragilis
CL05T00C42]
gi|423276040|ref|ZP_17254983.1| hypothetical protein HMPREF1080_03636 [Bacteroides fragilis
CL05T12C13]
gi|52218599|dbj|BAD51192.1| conserved hypothetical protein [Bacteroides fragilis YCH46]
gi|392696310|gb|EIY89506.1| hypothetical protein HMPREF1079_04006 [Bacteroides fragilis
CL05T00C42]
gi|392699545|gb|EIY92721.1| hypothetical protein HMPREF1080_03636 [Bacteroides fragilis
CL05T12C13]
Length = 375
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 71/154 (46%), Gaps = 25/154 (16%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWP-----EISHAPWNGC 70
++ S + RLASLDI RG + L++ L W + H W G
Sbjct: 1 MKKPSSTPSPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGF 60
Query: 71 NLADFVMPFFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAP 127
D VMP FLF+ G ++ + + PD+ +K+I R + L +G+++QG
Sbjct: 61 RFWDLVMPLFLFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL---- 116
Query: 128 DELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEI 160
G+D + + L LQ IA YL+ +++++
Sbjct: 117 ----LGLDPKHLYLYSNTLQAIATGYLIAAIIQL 146
>gi|383119755|ref|ZP_09940493.1| hypothetical protein BSHG_3425 [Bacteroides sp. 3_2_5]
gi|423252290|ref|ZP_17233284.1| hypothetical protein HMPREF1066_04294 [Bacteroides fragilis
CL03T00C08]
gi|423252861|ref|ZP_17233792.1| hypothetical protein HMPREF1067_00436 [Bacteroides fragilis
CL03T12C07]
gi|251944624|gb|EES85099.1| hypothetical protein BSHG_3425 [Bacteroides sp. 3_2_5]
gi|392647563|gb|EIY41262.1| hypothetical protein HMPREF1066_04294 [Bacteroides fragilis
CL03T00C08]
gi|392659230|gb|EIY52856.1| hypothetical protein HMPREF1067_00436 [Bacteroides fragilis
CL03T12C07]
Length = 375
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 71/154 (46%), Gaps = 25/154 (16%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWP-----EISHAPWNGC 70
++ S + RLASLDI RG + L++ L W + H W G
Sbjct: 1 MKKPSSTPSPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGF 60
Query: 71 NLADFVMPFFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAP 127
D VMP FLF+ G ++ + + PD+ +K+I R + L +G+++QG
Sbjct: 61 RFWDLVMPLFLFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL---- 116
Query: 128 DELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEI 160
G+D + + L LQ IA YL+ +++++
Sbjct: 117 ----LGLDPKHLYLYSNTLQAIATGYLIAAIIQL 146
>gi|336411649|ref|ZP_08592112.1| hypothetical protein HMPREF1018_04130 [Bacteroides sp. 2_1_56FAA]
gi|335941083|gb|EGN02943.1| hypothetical protein HMPREF1018_04130 [Bacteroides sp. 2_1_56FAA]
Length = 375
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 71/154 (46%), Gaps = 25/154 (16%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWP-----EISHAPWNGC 70
++ S + RLASLDI RG + L++ L W + H W G
Sbjct: 1 MKKPSSTPSPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGF 60
Query: 71 NLADFVMPFFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAP 127
D VMP FLF+ G ++ + + PD+ +K+I R + L +G+++QG
Sbjct: 61 RFWDLVMPLFLFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL---- 116
Query: 128 DELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEI 160
G+D + + L LQ IA YL+ +++++
Sbjct: 117 ----LGLDPKHLYLYSNTLQAIATGYLIAAIIQL 146
>gi|255013329|ref|ZP_05285455.1| putative transmembrane protein [Bacteroides sp. 2_1_7]
gi|410103820|ref|ZP_11298741.1| hypothetical protein HMPREF0999_02513 [Parabacteroides sp. D25]
gi|409236549|gb|EKN29356.1| hypothetical protein HMPREF0999_02513 [Parabacteroides sp. D25]
Length = 378
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 49/146 (33%), Positives = 71/146 (48%), Gaps = 27/146 (18%)
Query: 31 KTQRLASLDIFRG-----LAVALMIL--VDHAGGDWP-------EISHAPWNGCNLADFV 76
K RL SLD+ RG L V M+L + HA D P SH W G + D V
Sbjct: 5 KLNRLESLDVLRGFDLFCLVVLEMVLHPLAHAI-DMPWFNSFMWGFSHVEWEGFSTWDLV 63
Query: 77 MPFFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTY 132
MP FLF+ GV++ +L R +PD+ +++ R + L +G++ QG + PD
Sbjct: 64 MPLFLFMAGVSMPFSLSRYKDMPDKMAVYRRIGKRVVLLWVFGMMCQGNLLALDPD---- 119
Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLV 158
R+ LQ IA+ YL+ SL+
Sbjct: 120 ----RVYLYSNTLQSIAMGYLIASLL 141
>gi|288800484|ref|ZP_06405942.1| conserved hypothetical protein [Prevotella sp. oral taxon 299 str.
F0039]
gi|288332697|gb|EFC71177.1| conserved hypothetical protein [Prevotella sp. oral taxon 299 str.
F0039]
Length = 409
Score = 59.3 bits (142), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 48/169 (28%), Positives = 82/169 (48%), Gaps = 32/169 (18%)
Query: 17 SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPE--------------- 61
S ++ ++ H + RL SLD+ RG +AL++L W E
Sbjct: 25 SSTEIFNKATAPH--SGRLLSLDLLRGADLALLVLFQPIIYQWVEASEPTPGSFGEMVFG 82
Query: 62 -ISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPD-RADAVKKVIFRTLK--LLFW-- 115
I+H PW G D +MP F+F+ G+ I ++ + + A K ++R LK ++ W
Sbjct: 83 QITHVPWEGFCFWDIIMPLFMFMSGITIPFSMGKYQQGKVKADKGFLWRLLKRFVVLWVL 142
Query: 116 GILLQGGFSHAPDELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTK 163
G++ QG + L + D R+I L LQ IA+ Y++V+L+ ++T
Sbjct: 143 GMIAQG------NLLLF--DPRLIHLYSNTLQSIAVGYVMVALLFVYTS 183
>gi|311031971|ref|ZP_07710061.1| hypothetical protein Bm3-1_15792 [Bacillus sp. m3-13]
Length = 370
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 71/326 (21%), Positives = 124/326 (38%), Gaps = 90/326 (27%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDH--AGGDWPEISHAPWNGCNLADFVMPFFL 81
+ +H+ +R S+D+ RG+ V + + V G ++ + HA W G + D V P FL
Sbjct: 1 METNNHITKKRYRSIDVTRGIVVLVSVFVSALPGGAEYDFLRHAYWYGLTITDLVFPAFL 60
Query: 82 FIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRL 141
+ G+ +A+ ++ D ++ RT L+ +G+L ++ D+ +R
Sbjct: 61 TVYGIGLAIVYRKGVRWKDLLR----RTFLLVLYGLLFN-------LIASWSFDLSTLRF 109
Query: 142 CGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV--VYLALL 199
GVLQ A++ L V ++ K W ++A +++ YL++L
Sbjct: 110 TGVLQLFAITGLGVVVLSYLAKG------------------WKSMLALGMVIATAYLSIL 151
Query: 200 YGTYVPDWQFTIINKDSADYGKVFNVTC--GVRAKLNPPCNAVGYIDRKVLGINHMYHHP 257
+ +V C GV + CN G +D V G HMY
Sbjct: 152 V---------------------ISSVGCEGGVPQR---DCNPSGVVDVLVFGEKHMYAQG 187
Query: 258 AWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKG 317
F+PEG+LS S++ + G G V+ K
Sbjct: 188 --------------------------EKGFDPEGILSIFSALSNVAFGFAVGLVLNGRKQ 221
Query: 318 HLARLKQWVTMGFALLIFGLTLHFTN 343
L R+ G ++ + L F N
Sbjct: 222 ILQRV-----FGISIGLISLAFIFNN 242
>gi|262381452|ref|ZP_06074590.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
gi|262296629|gb|EEY84559.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
Length = 378
Score = 58.9 bits (141), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 49/146 (33%), Positives = 71/146 (48%), Gaps = 27/146 (18%)
Query: 31 KTQRLASLDIFRG-----LAVALMIL--VDHAGGDWP-------EISHAPWNGCNLADFV 76
K RL SLD+ RG L V M+L + HA D P SH W G + D V
Sbjct: 5 KLNRLESLDVLRGFDLFCLVVLEMVLHPLAHAI-DMPWFNSFMWGFSHVEWEGFSTWDLV 63
Query: 77 MPFFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTY 132
MP FLF+ GV++ +L R +PD+ +++ R + L +G++ QG + PD
Sbjct: 64 MPLFLFMAGVSMPFSLSRYKDMPDKMAVYRRIGKRVVLLWVFGMMCQGNLLALDPD---- 119
Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLV 158
R+ LQ IA+ YL+ SL+
Sbjct: 120 ----RVYLYSNTLQSIAMGYLIASLL 141
>gi|409198223|ref|ZP_11226886.1| hypothetical protein MsalJ2_14356 [Marinilabilia salmonicolor JCM
21150]
Length = 394
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 43/120 (35%), Positives = 64/120 (53%), Gaps = 11/120 (9%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFIVGV 86
T R+ S+DI R + VALM+ V+ G W + A +G LAD V P FLF VG+
Sbjct: 7 THRIKSIDILRAITVALMVFVNDLPGIRDIPQWLGHASAGHDGMFLADIVFPLFLFWVGM 66
Query: 87 AIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
+I LA+ + D+ V+ ++ RT L+F G+L+ S E+T G+D + LC
Sbjct: 67 SIPLAVDGRQKKGDSDLTIVRHILKRTFSLVFIGVLMVNT-SRMSVEVT-GIDRNLWALC 124
>gi|60683670|ref|YP_213814.1| hypothetical protein BF4252 [Bacteroides fragilis NCTC 9343]
gi|423259842|ref|ZP_17240765.1| hypothetical protein HMPREF1055_03042 [Bacteroides fragilis
CL07T00C01]
gi|423267497|ref|ZP_17246478.1| hypothetical protein HMPREF1056_04165 [Bacteroides fragilis
CL07T12C05]
gi|60495104|emb|CAH09923.1| putative transmembrane protein [Bacteroides fragilis NCTC 9343]
gi|387775880|gb|EIK37984.1| hypothetical protein HMPREF1055_03042 [Bacteroides fragilis
CL07T00C01]
gi|392696971|gb|EIY90158.1| hypothetical protein HMPREF1056_04165 [Bacteroides fragilis
CL07T12C05]
Length = 375
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 70/154 (45%), Gaps = 25/154 (16%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWP-----EISHAPWNGC 70
++ S RLASLDI RG + L++ L W + H W G
Sbjct: 1 MKKPSSTPAPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGF 60
Query: 71 NLADFVMPFFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAP 127
D VMP FLF+ G ++ + + PD+ +K+I R + L +G+++QG
Sbjct: 61 RFWDLVMPLFLFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL---- 116
Query: 128 DELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEI 160
G+D + + L LQ IA YL+ +++++
Sbjct: 117 ----LGLDPKHLYLYSNTLQAIATGYLIAAIIQL 146
>gi|403236334|ref|ZP_10914920.1| hypothetical protein B1040_11244 [Bacillus sp. 10403023]
Length = 373
Score = 58.9 bits (141), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 40/141 (28%), Positives = 69/141 (48%), Gaps = 12/141 (8%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAG-GDWPEISHAPWNGCNLADFVMPFFLF 82
+ ++ R+ SLD+ RG+ V + + G + +HA W G L DF++P F+
Sbjct: 1 MKNTNNSARSRIHSLDMARGIIVVFSVFLSSLPYGSYDFATHASWYGLTLVDFILPCFIT 60
Query: 83 IVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLC 142
+ GV +A+A + + K+ I RT+KL+ +G+L + + D+ +R
Sbjct: 61 VFGVGMAIAYQ----KGVKWKRFISRTIKLILFGLLFN-------IIVAWSFDLSTLRFT 109
Query: 143 GVLQRIALSYLLVSLVEIFTK 163
GVLQ AL + L+ F K
Sbjct: 110 GVLQMYALLGIGTVLITRFIK 130
>gi|298376668|ref|ZP_06986623.1| conserved hypothetical protein [Bacteroides sp. 3_1_19]
gi|298266546|gb|EFI08204.1| conserved hypothetical protein [Bacteroides sp. 3_1_19]
Length = 372
Score = 58.5 bits (140), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 86/338 (25%), Positives = 120/338 (35%), Gaps = 106/338 (31%)
Query: 25 QEKSHLKTQRLASLDIFRG-----------LAVALMILVDH------AGGDWPEISHAPW 67
EK + QRL SLD RG L VAL L + AG ++ H W
Sbjct: 1 MEKQKQQPQRLQSLDALRGFDMLFIMGGASLFVALATLFPNPFFQAIAG----QMEHVEW 56
Query: 68 NGCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGF 123
NG D + P FLFI G++ +L++ + KK++ R + L+F G++ G
Sbjct: 57 NGLAHHDTIFPLFLFIAGISFPFSLEKQRGKGMTEGAIYKKIVRRGITLVFLGLVYNGLL 116
Query: 124 SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCW 183
S D L R VL RI L ++ +L +F + +
Sbjct: 117 SFEFDHL---------RCASVLARIGLGWMFAAL--LFVRFGWKARAGI----------- 154
Query: 184 HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYI 243
A +LV Y ++ VPD G N VGYI
Sbjct: 155 ----TALILVGYWLVMAFVPVPD--------------------AGGAGPFTLEGNLVGYI 190
Query: 244 DRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILST 302
DR L P H F+PEGL S+V +I +
Sbjct: 191 DRLFL-------------------------------PGRLHETVFDPEGLFSTVPAIATA 219
Query: 303 IIGVHFGHVIIHTKGHLARLKQ---WVTMGFALLIFGL 337
++G+ G I K L K+ V G LLI GL
Sbjct: 220 MLGMFTGEWIKLGKEGLTDRKKVLCLVGAGAVLLIVGL 257
>gi|423282787|ref|ZP_17261672.1| hypothetical protein HMPREF1204_01210 [Bacteroides fragilis HMW
615]
gi|404581658|gb|EKA86354.1| hypothetical protein HMPREF1204_01210 [Bacteroides fragilis HMW
615]
Length = 375
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 42/154 (27%), Positives = 71/154 (46%), Gaps = 25/154 (16%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWP-----EISHAPWNGC 70
++ S + RLASLDI RG + L++ L W + H W G
Sbjct: 1 MKKPSSPPSPRLASLDILRGFDLFLLVFFQPVLWTLAHQLNLPWLNSILFQFDHEVWEGF 60
Query: 71 NLADFVMPFFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAP 127
D VMP FLF+ G ++ + + PD+ +K+I R + L +G+++QG
Sbjct: 61 RFWDLVMPLFLFMTGASMPFSFSKFKDNPDKGPVYRKIIKRFILLFIFGMIVQGNL---- 116
Query: 128 DELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEI 160
G+D + + L LQ IA YL+ +++++
Sbjct: 117 ----LGLDPKHLYLYSNTLQAIATGYLIAAIIQL 146
>gi|260684359|ref|YP_003215644.1| hypothetical protein CD196_2626 [Clostridium difficile CD196]
gi|260688018|ref|YP_003219152.1| hypothetical protein CDR20291_2673 [Clostridium difficile R20291]
gi|260210522|emb|CBA65033.1| putative membrane protein [Clostridium difficile CD196]
gi|260214035|emb|CBE06182.1| putative membrane protein [Clostridium difficile R20291]
Length = 352
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 38/125 (30%), Positives = 63/125 (50%), Gaps = 16/125 (12%)
Query: 49 MILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA----LKRIPDRADA 101
MI+ ++ G +P++ HA W+G LADF PFF+ +GV I ++ LK
Sbjct: 1 MIVCNNPGTWMRMYPQLRHAVWHGVTLADFAFPFFVISLGVTIPISINSKLKNNKSTLSI 60
Query: 102 VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIF 161
+ + R++ L+ +G L + P D+ +R+ GVLQR+ L Y + SLV +
Sbjct: 61 ILSIFKRSILLILFGFFLN--YLGNP-------DLNSVRILGVLQRMGLVYFVTSLVYLL 111
Query: 162 TKDVQ 166
K +
Sbjct: 112 LKKLN 116
>gi|392551353|ref|ZP_10298490.1| hypothetical protein PspoU_08780 [Pseudoalteromonas spongiae
UST010723-006]
Length = 379
Score = 58.5 bits (140), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 49/160 (30%), Positives = 74/160 (46%), Gaps = 34/160 (21%)
Query: 21 VSDQQEKSHLKTQRLASLDIFRGLAV-----------ALMILVDHAGGDWPEIS--HAPW 67
+SD +K+ +RLASLD RG + AL +L AG E H+ W
Sbjct: 1 MSDTNKKTK---KRLASLDALRGFDMFWILGGEKIFAALFVLTGWAGWKVAEAQTLHSQW 57
Query: 68 NGCNLADFVMPFFLFIVGVAIALALKRIP-----DRADAVKKVIFRTLKLLFWGILLQGG 122
+G D + P F+F+ GVA+ L+ KRI DR K R L L F+G+L G
Sbjct: 58 HGFTFYDLIFPLFIFLSGVAMGLSPKRIDHLPFVDRKPIYIKAFKRLLLLCFFGVLYNHG 117
Query: 123 FSHA----PDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
+ P+E +R VL RIA+++ + +++
Sbjct: 118 WGTGVPLNPEE---------VRYASVLGRIAVAWFVAAML 148
>gi|339021122|ref|ZP_08645235.1| hypothetical protein ATPR_1543 [Acetobacter tropicalis NBRC 101654]
gi|338751776|dbj|GAA08539.1| hypothetical protein ATPR_1543 [Acetobacter tropicalis NBRC 101654]
Length = 377
Score = 58.2 bits (139), Expect = 6e-06, Method: Compositional matrix adjust.
Identities = 73/278 (26%), Positives = 110/278 (39%), Gaps = 72/278 (25%)
Query: 42 RGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIALA----LKR 94
RG + M++V++ G W + HA WNGC AD V PFFLF++G I A L+
Sbjct: 2 RGATIVFMVIVNNPGDWNRVWSPLDHAAWNGCTPADLVFPFFLFLMGCVIPFAFDRRLRE 61
Query: 95 IPDRADAVKKVIFRTLKLLFWGILLQG-GFSHAPDELTYGVDVRMIRLCGVLQRIALSYL 153
R+ V + +R L L+ +LL F H V +R GVL RIAL Y+
Sbjct: 62 GAQRSQLVSHIAWRGLALVGLKLLLSLYPFFH----------VTHLRFFGVLTRIALCYV 111
Query: 154 LVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIIN 213
+ + ++ +G +L+ Y A+LY VP +
Sbjct: 112 AAVSLYLCSRKTGFLVSVIG----------------LILLAYWAILYALPVPGLGWP--- 152
Query: 214 KDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEG 273
GK F A L+ N ++DR+ + H K
Sbjct: 153 ------GKDF-------AFLDLNRNMAAWLDRQFSAWCQTWLHTGILYEKT--------- 190
Query: 274 PLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
++PEGLLS++ +I +T+ GV G V
Sbjct: 191 -------------WDPEGLLSTLPAIATTLSGVLAGQV 215
>gi|320105641|ref|YP_004181231.1| hypothetical protein AciPR4_0402 [Terriglobus saanensis SP1PR4]
gi|319924162|gb|ADV81237.1| hypothetical protein AciPR4_0402 [Terriglobus saanensis SP1PR4]
Length = 406
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 41/106 (38%), Positives = 54/106 (50%), Gaps = 17/106 (16%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPW---------NGCNLADFVMPFF 80
L QR+ SLDIFRGL +ALMI V+ EI PW + D V P F
Sbjct: 10 LAPQRILSLDIFRGLNIALMIFVNELA----EIKGLPWWTYHAPGKVDVMTYVDMVFPGF 65
Query: 81 LFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGG 122
LFI+G+AI LAL + D+ + + R+ LL GI+L+ G
Sbjct: 66 LFILGMAIPLALNARIRKGDSPATLLGYIALRSAALLVLGIILENG 111
>gi|380693009|ref|ZP_09857868.1| hypothetical protein BfaeM_03398 [Bacteroides faecis MAJ27]
Length = 376
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 72/155 (46%), Gaps = 26/155 (16%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG------DWP-------EISHAPWNGC 70
+KS T RLASLDI RG + L++ + P + H W G
Sbjct: 1 MSKKSENNTSRLASLDILRGFDLFLLVFFQPVFAALVRQLNLPFLNDILYQFDHEVWEGF 60
Query: 71 NLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLF----WGILLQGGFSHA 126
D VMP FLF+ G ++ +L + + + + V R LK +F +G+++QG
Sbjct: 61 RFWDLVMPLFLFMTGASMPFSLSKYIGTSGSYRPVYRRILKRVFLLFVFGMIVQGNL--- 117
Query: 127 PDELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEI 160
G+D + I L LQ IA+ YL+ +++++
Sbjct: 118 -----LGLDGKHIYLYSNTLQSIAVGYLIAAVIQL 147
>gi|152999681|ref|YP_001365362.1| hypothetical protein Shew185_1147 [Shewanella baltica OS185]
gi|151364299|gb|ABS07299.1| conserved hypothetical protein [Shewanella baltica OS185]
Length = 384
Score = 58.2 bits (139), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 91/372 (24%), Positives = 143/372 (38%), Gaps = 113/372 (30%)
Query: 34 RLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFF 80
RL SLD RG + L+IL AG W ++ H+ W+G D + P F
Sbjct: 18 RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFRFYDLIFPLF 77
Query: 81 LFIVGVAIALALKRIPDRADAVKKVIFR-TLKLLFWGILLQGGFSH-----APDELTYGV 134
+F+ GVA+ L+ KR+ + + ++R +K LF +LL ++H AP
Sbjct: 78 IFLSGVALGLSPKRLDKLPMSERLPVYRHGIKRLFLLLLLGILYNHGWGTGAP------A 131
Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
D IR VL RIA ++ +L+ WH + ++V
Sbjct: 132 DPEKIRYASVLGRIAFAWFFAALL-----------------------VWHTSLRTQIIVA 168
Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNAVGYIDRKVL-GI 250
L +L G YG + G L+P + Y+D +L G+
Sbjct: 169 -LGILLG-----------------YGAMQLWLPFPGGQAGVLSPTESINAYVDSILLPGV 210
Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
++ P +PEGLLS++ +I++ + GV GH
Sbjct: 211 SYQGRTP------------------------------DPEGLLSTIPAIVNALTGVFVGH 240
Query: 311 VII--HTKGHLARLKQWVTMGFALLIFGLTLHF---TNGEHGSGKF-------STTCVCL 358
I+ H KG A++ G LL FG L N E + F S + +
Sbjct: 241 FIVKSHPKGEWAKVGLLAAAGGILLAFGWLLDLVIPVNKELWTSSFVLVTSGWSMILLAV 300
Query: 359 FIYSKVILFQWQ 370
F Y+ V + +WQ
Sbjct: 301 F-YALVDVLKWQ 311
>gi|254446502|ref|ZP_05059978.1| hypothetical protein VDG1235_4753 [Verrucomicrobiae bacterium
DG1235]
gi|198260810|gb|EDY85118.1| hypothetical protein VDG1235_4753 [Verrucomicrobiae bacterium
DG1235]
Length = 394
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 86/330 (26%), Positives = 135/330 (40%), Gaps = 85/330 (25%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS----HAPWNGCNLADFVMPFFLFIVG 85
+K +RL +LD RG + MI+V+ G W + HA W+G D V PFFLF VG
Sbjct: 1 MKRERLLALDALRGFTIIGMIIVNSPG-SWSHVYSPLLHASWHGVTPTDLVFPFFLFFVG 59
Query: 86 VAIALAL------KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMI 139
V+IALA KR +R +K+ +R K+ G+ L +E+
Sbjct: 60 VSIALAYSGKRGTKR--ERVGKYRKIFWRVAKIFALGLFLNLWPYFYFEEM--------- 108
Query: 140 RLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALL 199
R+ GVLQRIAL + + +++ + T+ Q W + A +L+ Y ALL
Sbjct: 109 RVAGVLQRIALVFGVCAILFLNTRWKQQ--------------LW---VGASILLGYWALL 151
Query: 200 YGTYVPDWQFTIINKDSAD-------YGKVFNVTCGVRAK------LNPPCNAVGYIDRK 246
VP +N + + YG V+ R + P N ++DR
Sbjct: 152 VWVPVP---LDEVNAGALETGIVERSYGTEVAVSVEARGETSIAGNFEPGVNIAAWVDRV 208
Query: 247 VLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
+L W R+ ++PEGLLS+V ++ + I G+
Sbjct: 209 LL------PGGMWERT------------------------WDPEGLLSTVPAVATGIFGM 238
Query: 307 HFGHVIIHTKGHLARLKQWVTMGFALLIFG 336
G +I+ R+ +G L+ G
Sbjct: 239 LVGALILGVGDPYRRVSWVFFVGVVALLIG 268
>gi|343086706|ref|YP_004776001.1| hypothetical protein [Cyclobacterium marinum DSM 745]
gi|342355240|gb|AEL27770.1| Protein of unknown function DUF2261, transmembrane [Cyclobacterium
marinum DSM 745]
Length = 368
Score = 57.8 bits (138), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 33/96 (34%), Positives = 49/96 (51%), Gaps = 15/96 (15%)
Query: 31 KTQRLASLDIFRGLAVALMI-----LVDHAGGDWPEIS----------HAPWNGCNLADF 75
+RL SLD +RG+ + L++ L G +PE+S H PWNG D
Sbjct: 8 SNKRLVSLDAYRGITMFLLVAESARLYGAFEGLFPEVSGWQMFFTQFTHHPWNGLRFWDL 67
Query: 76 VMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK 111
+ PFF+FIVGVA+ +L + ++ +KV LK
Sbjct: 68 IQPFFMFIVGVAMPFSLNKRLEKQGDRRKVTLHILK 103
>gi|392402534|ref|YP_006439146.1| Protein of unknown function DUF2261, transmembrane [Turneriella
parva DSM 21527]
gi|390610488|gb|AFM11640.1| Protein of unknown function DUF2261, transmembrane [Turneriella
parva DSM 21527]
Length = 396
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 50/143 (34%), Positives = 70/143 (48%), Gaps = 30/143 (20%)
Query: 37 SLDIFRGLAVALMILVDHAGGDWPE----ISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
SLD+ RGL +ALMI+V++ G DW + HA W+G AD V P FLF+ G A AL +
Sbjct: 5 SLDLLRGLTIALMIIVNNPG-DWKAMFAVLRHAEWHGFLGADIVFPLFLFVAGYAAALKI 63
Query: 93 KRI--------PDRADAVK-----------KVIFRTLKLLFWGILLQG-GFSHAPD-ELT 131
R+ P A A+ ++ R L G+ L PD E +
Sbjct: 64 DRLYGPTTAGGPHCASALTLEERELPAYYLPLMRRAAILFLIGLFLNAWPLGLLPDTEFS 123
Query: 132 YGVDVRMIRLCGVLQRIALSYLL 154
+G +R+ GVLQRIA+ L+
Sbjct: 124 FG----HLRVLGVLQRIAICVLV 142
>gi|423280893|ref|ZP_17259805.1| hypothetical protein HMPREF1203_04022 [Bacteroides fragilis HMW
610]
gi|404583534|gb|EKA88212.1| hypothetical protein HMPREF1203_04022 [Bacteroides fragilis HMW
610]
Length = 375
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 40/142 (28%), Positives = 65/142 (45%), Gaps = 25/142 (17%)
Query: 34 RLASLDIFRGLAVALMI--------LVDHAGGDW-----PEISHAPWNGCNLADFVMPFF 80
RLASLDI RG + L++ L W + H W G D VMP F
Sbjct: 11 RLASLDILRGFDLFLLVFFQPVLWALAHQLNAPWLNSILSQFDHEVWEGFRFWDLVMPLF 70
Query: 81 LFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
LF+ G ++ + + PD+ +K+I R + L +G+++QG G++ +
Sbjct: 71 LFMTGASMPFSFSKFKDDPDKGTIYRKIIRRFILLFIFGMIVQGNL--------LGLNPK 122
Query: 138 MIRL-CGVLQRIALSYLLVSLV 158
+ L LQ IA YL+ +++
Sbjct: 123 YLYLYSNTLQAIATGYLIAAII 144
>gi|255013328|ref|ZP_05285454.1| hypothetical protein B2_05430 [Bacteroides sp. 2_1_7]
gi|410103821|ref|ZP_11298742.1| hypothetical protein HMPREF0999_02514 [Parabacteroides sp. D25]
gi|409236550|gb|EKN29357.1| hypothetical protein HMPREF0999_02514 [Parabacteroides sp. D25]
Length = 372
Score = 57.8 bits (138), Expect = 9e-06, Method: Compositional matrix adjust.
Identities = 87/341 (25%), Positives = 124/341 (36%), Gaps = 112/341 (32%)
Query: 25 QEKSHLKTQRLASLDIFRG-----------LAVALMILVDH------AGGDWPEISHAPW 67
EK + QRL SLD RG L VAL L + AG ++ H W
Sbjct: 1 MEKQKQQPQRLQSLDALRGFDMLFIMGGASLFVALATLFPNPFFQAIAG----QMEHVEW 56
Query: 68 NGCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGF 123
NG D + P FLFI G++ +L++ + KK++ R + L+F G++ G
Sbjct: 57 NGLAHHDTIFPLFLFIAGISFPFSLEKQRGKGMTEGAIYKKIVRRGITLVFLGLVYNGLL 116
Query: 124 SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCW 183
S D L R VL RI L ++ +L +F + W
Sbjct: 117 SFEFDHL---------RCASVLARIGLGWMFAAL-------------------LFVRFGW 148
Query: 184 HWLMAACVLVV---YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAV 240
VL++ +LA+ + VPD G N V
Sbjct: 149 KVRAGITVLILVGYWLAMAF-VPVPD--------------------VGGAGPFTLEGNLV 187
Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSI 299
GYIDR L P H F+PEGL S+V +I
Sbjct: 188 GYIDRLFL-------------------------------PGRLHETVFDPEGLFSTVPAI 216
Query: 300 LSTIIGVHFGHVI-IHTKGHLARLKQ--WVTMGFALLIFGL 337
+ ++G+ G I + +G R K+ V G LLI GL
Sbjct: 217 ATAMLGMFTGEWIKLRKEGLTDRKKELCLVGAGAVLLIVGL 257
>gi|301309930|ref|ZP_07215869.1| conserved hypothetical protein [Bacteroides sp. 20_3]
gi|423340409|ref|ZP_17318148.1| hypothetical protein HMPREF1059_04073 [Parabacteroides distasonis
CL09T03C24]
gi|300831504|gb|EFK62135.1| conserved hypothetical protein [Bacteroides sp. 20_3]
gi|409227844|gb|EKN20740.1| hypothetical protein HMPREF1059_04073 [Parabacteroides distasonis
CL09T03C24]
Length = 372
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 86/338 (25%), Positives = 120/338 (35%), Gaps = 106/338 (31%)
Query: 25 QEKSHLKTQRLASLDIFRG-----------LAVALMILVDH------AGGDWPEISHAPW 67
EK + QRL SLD RG L VAL L + AG ++ H W
Sbjct: 1 MEKQKQQPQRLQSLDALRGFDMLFIMGGASLFVALATLFPNPFFQAIAG----QMEHVEW 56
Query: 68 NGCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGF 123
NG D + P FLFI G++ +L++ + KK++ R + L+F G++ G
Sbjct: 57 NGLAHHDTIFPLFLFIAGISFPFSLEKQRGKGMTEGAIYKKIVRRGITLVFLGLVYNGLL 116
Query: 124 SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCW 183
S D L R VL RI L ++ +L +F + W
Sbjct: 117 SFEFDHL---------RCASVLARIGLGWMFAAL-------------------LFVRFGW 148
Query: 184 HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYI 243
VL++ L +VP D+ G N VGYI
Sbjct: 149 KVRAGITVLILVGYWLAMAFVP-------VPDAGGAG-----------PFTLEGNLVGYI 190
Query: 244 DRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILST 302
DR L P H F+PEGL S+V +I +
Sbjct: 191 DRLFL-------------------------------PGRLHETVFDPEGLFSTVPAIATA 219
Query: 303 IIGVHFGHVIIHTKGHLARLKQ---WVTMGFALLIFGL 337
++G+ G I K L K+ V G LLI GL
Sbjct: 220 MLGMFTGEWIKLRKEGLTDRKKVLCLVGAGAVLLIVGL 257
>gi|313149262|ref|ZP_07811455.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
gi|313138029|gb|EFR55389.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
Length = 375
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 40/142 (28%), Positives = 65/142 (45%), Gaps = 25/142 (17%)
Query: 34 RLASLDIFRGLAVALMI--------LVDHAGGDW-----PEISHAPWNGCNLADFVMPFF 80
RLASLDI RG + L++ L W + H W G D VMP F
Sbjct: 11 RLASLDILRGFDLFLLVFFQPVLWALAHQLNAPWLNSILSQFDHEVWEGFRFWDLVMPLF 70
Query: 81 LFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVR 137
LF+ G ++ + + PD+ +K+I R + L +G+++QG G++ +
Sbjct: 71 LFMTGASMPFSFSKFKDDPDKGPIYRKIIRRFILLFIFGMIVQGNL--------LGLNPK 122
Query: 138 MIRL-CGVLQRIALSYLLVSLV 158
+ L LQ IA YL+ +++
Sbjct: 123 YLYLYSNTLQAIATGYLIAAII 144
>gi|113971267|ref|YP_735060.1| hypothetical protein Shewmr4_2932 [Shewanella sp. MR-4]
gi|113885951|gb|ABI40003.1| conserved hypothetical protein [Shewanella sp. MR-4]
Length = 395
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 92/378 (24%), Positives = 143/378 (37%), Gaps = 97/378 (25%)
Query: 20 DVSDQQEKSHLKTQRLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAP 66
+ D K RL SLD RG + AL+IL AG W ++ H+
Sbjct: 15 NAQDAAAKKSQSKPRLMSLDALRGFDMFWILGGEALFGALLILTGWAGWQWGDTQMHHSE 74
Query: 67 WNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR-TLKLLFWGILLQGGFSH 125
WNG D + P F+F+ GVA+ L+ KR+ + ++R +K LF +LL ++H
Sbjct: 75 WNGFRFYDLIFPLFIFLSGVALGLSPKRLDKLPMHERMPVYRHGIKRLFLLLLLGILYNH 134
Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
D +R VL RIA ++ +L+ T R
Sbjct: 135 GWGT-GVPADPEKVRYASVLGRIAFAWFFAALLVWHTS--------------LRTQV--- 176
Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
L+A +LV Y A+ P Q GV L+P + Y+D
Sbjct: 177 LVALGILVAYGAMQLWLPFPGGQ------------------AGV---LSPTESINAYVDS 215
Query: 246 KVL-GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTII 304
+L G+++ P +PEG+LS++ ++++ +
Sbjct: 216 LLLPGVSYQGRTP------------------------------DPEGVLSTLPAVVNALA 245
Query: 305 GVHFGHVII--HTKGHLARLKQWVTMGFALLIFGLTLHF---TNGEHGSGKF-------S 352
GV GH I+ H KG A++ G L G L N E + F S
Sbjct: 246 GVFVGHFIVKSHPKGEWAKVGLLSVAGGVCLALGWLLDGVIPVNKELWTSSFVLVTSGWS 305
Query: 353 TTCVCLFIYSKVILFQWQ 370
+ LF Y+ V + +WQ
Sbjct: 306 MLLLALF-YALVDVLKWQ 322
>gi|294463099|gb|ADE77087.1| unknown [Picea sitchensis]
Length = 218
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 24/53 (45%), Positives = 38/53 (71%)
Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTL 339
F+PEGLLSS+ ++++ IG+HFGH+++H KGH R+ Q + L+ FG+ L
Sbjct: 44 FDPEGLLSSIMAVVTCFIGLHFGHILVHFKGHSERVLQCIIPSLGLIFFGIAL 96
>gi|423345098|ref|ZP_17322787.1| hypothetical protein HMPREF1060_00459 [Parabacteroides merdae
CL03T12C32]
gi|409222884|gb|EKN15821.1| hypothetical protein HMPREF1060_00459 [Parabacteroides merdae
CL03T12C32]
Length = 376
Score = 57.4 bits (137), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 43/143 (30%), Positives = 70/143 (48%), Gaps = 25/143 (17%)
Query: 33 QRLASLDIFRGLAVALMILVD---HAGG---DWP-------EISHAPWNGCNLADFVMPF 79
QRL SLD+ RG + ++ ++ H+ G D P SH W G + D VMP
Sbjct: 6 QRLESLDVLRGFDLFCLVALEGVLHSLGRAIDAPWYNDFLWGFSHVQWEGFSSWDLVMPL 65
Query: 80 FLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTYGVD 135
F+F+ GV++ AL R +PD+ ++++ R L +G++ QG PD
Sbjct: 66 FMFMAGVSMPFALSRYKAMPDKWAVYRRIVKRVALLWIFGMMCQGNLLGLDPD------- 118
Query: 136 VRMIRLCGVLQRIALSYLLVSLV 158
R+ LQ IA+ YL+ +++
Sbjct: 119 -RIYLYSNTLQAIAMGYLISAML 140
>gi|262381451|ref|ZP_06074589.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
gi|262296628|gb|EEY84558.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
Length = 372
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 85/338 (25%), Positives = 119/338 (35%), Gaps = 106/338 (31%)
Query: 25 QEKSHLKTQRLASLDIFRG-----------LAVALMILVDH------AGGDWPEISHAPW 67
E+ + QRL SLD RG L VAL L + AG ++ H W
Sbjct: 1 MERQKQQPQRLQSLDALRGFDMLFIMGGASLFVALATLFPNPFFQAIAG----QMEHVEW 56
Query: 68 NGCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGF 123
NG D + P FLFI G++ +L++ + KK++ R + L+F G++ G
Sbjct: 57 NGLAHHDTIFPLFLFIAGISFPFSLEKQRGKGMTEGAIYKKIVRRGITLVFLGLVYNGLL 116
Query: 124 SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCW 183
S D L R VL RI L ++ +L +F + +
Sbjct: 117 SFEFDHL---------RCASVLARIGLGWMFAAL--LFVRFGWKARAGI----------- 154
Query: 184 HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYI 243
A +LV Y + VPD G N VGYI
Sbjct: 155 ----TALILVGYWLAMAFVPVPD--------------------AGGAGPFTLEGNLVGYI 190
Query: 244 DRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILST 302
DR L P H F+PEGL S+V +I +
Sbjct: 191 DRLFL-------------------------------PGRLHETVFDPEGLFSTVPAIATA 219
Query: 303 IIGVHFGHVIIHTKGHLARLKQ---WVTMGFALLIFGL 337
++G+ G I K L K+ V G LLI GL
Sbjct: 220 MLGMFTGEWIKLRKEGLTDRKKVLCLVGAGAVLLIVGL 257
>gi|116331948|ref|YP_801666.1| hypothetical protein LBJ_2457 [Leptospira borgpetersenii serovar
Hardjo-bovis str. JB197]
gi|116125637|gb|ABJ76908.1| Conserved hypothetical protein [Leptospira borgpetersenii serovar
Hardjo-bovis str. JB197]
Length = 363
Score = 57.0 bits (136), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 80/304 (26%), Positives = 119/304 (39%), Gaps = 87/304 (28%)
Query: 44 LAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL--KRIPDR 98
+ V MILV++ G + + HA WNGC D V PFFLF VG +I ++L K +R
Sbjct: 1 MTVVGMILVNNPGSWSYVYSPLKHAEWNGCTPTDLVFPFFLFAVGASIPISLYSKNGINR 60
Query: 99 ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
+ R + L +L G F + E T+ +R+ GVLQRI Y +V+
Sbjct: 61 IRIWIGICIRGISL-----ILLGLFLNFFGEWTF----SELRIPGVLQRIGFVYWVVA-- 109
Query: 159 EIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSAD 218
++F ++ VLV + +L V W T I
Sbjct: 110 -----------------TLFLVFP-----GKKVLVFLIPIL---LVHTWILTHIAPPGES 144
Query: 219 YGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKD 278
L + +IDR + G H+ W+ SK
Sbjct: 145 -----------MVSLEQGKDIGAWIDRTIFGEKHL-----WKFSKT-------------- 174
Query: 279 APSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLT 338
++PEG LS ++SI +++ GV G ++ +G R K V L IFGL
Sbjct: 175 --------WDPEGFLSGIASIATSLFGVICGFILFRREG---RGKNRV-----LSIFGLG 218
Query: 339 LHFT 342
FT
Sbjct: 219 FLFT 222
>gi|291295418|ref|YP_003506816.1| hypothetical protein [Meiothermus ruber DSM 1279]
gi|290470377|gb|ADD27796.1| conserved hypothetical protein [Meiothermus ruber DSM 1279]
Length = 399
Score = 57.0 bits (136), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 50/157 (31%), Positives = 75/157 (47%), Gaps = 17/157 (10%)
Query: 14 LIISEPDVS---DQQEKSHLKTQRLASLDIFRGLAVALMILVDH-----AGGDWPEISHA 65
++I++P + + + K+ RL +LD RGL V LM+LV++ A D ++ HA
Sbjct: 1 MMIAQPSTAIAVESKNKATPAGARLLALDGLRGLTVFLMLLVNNLALQEATPD--QLVHA 58
Query: 66 PWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSH 125
P+ G LAD V P+FLF +G AI A D+ + I L G F
Sbjct: 59 PFGGVTLADLVFPWFLFCMGAAIPYAASSF-DKQKLPLWRRLLRILRRTSLIFLLGLF-- 115
Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFT 162
LT + + GVLQ IAL+Y L +L + +
Sbjct: 116 ----LTSALARTPVFALGVLQLIALAYCLAALFYLIS 148
>gi|117921549|ref|YP_870741.1| hypothetical protein Shewana3_3111 [Shewanella sp. ANA-3]
gi|117613881|gb|ABK49335.1| conserved hypothetical protein [Shewanella sp. ANA-3]
Length = 395
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 91/396 (22%), Positives = 145/396 (36%), Gaps = 107/396 (27%)
Query: 7 ETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAV-----------ALMILVDHA 55
TT + + + + K RL SLD RG + AL++L A
Sbjct: 2 STTAPESITNTGVNAQEAAAKKRQSKPRLMSLDALRGFDMFWILGGEALFGALLMLTGWA 61
Query: 56 GGDW--PEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR-TLKL 112
G W ++ H+ WNG D + P F+F+ GVA+ L+ KR+ + ++R +K
Sbjct: 62 GWQWGDTQMHHSEWNGFRFYDLIFPLFIFLSGVALGLSPKRLDKLPMQERMPVYRHGIKR 121
Query: 113 LFWGILLQGGFSH-----APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD 167
LF +LL ++H AP D +R VL RIA ++ +L+
Sbjct: 122 LFLLLLLGILYNHGWGTGAP------ADPEKVRYASVLGRIAFAWFFAALL--------- 166
Query: 168 KDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTC 227
WH + VLV L+ V W
Sbjct: 167 --------------VWHTSLRTQVLVALGILVAYGAVQLW---------------LPFPG 197
Query: 228 GVRAKLNPPCNAVGYIDRKVL-GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP 286
G L+P + Y+D +L G+++ P
Sbjct: 198 GQAGVLSPTESINAYVDSLLLPGVSYQGRTP----------------------------- 228
Query: 287 FEPEGLLSSVSSILSTIIGVHFGHVII--HTKGHLARLKQWVTMGFALLIFGLTLHF--- 341
+PEG+LS++ ++++ + GV GH I+ H KG A++ G L G L
Sbjct: 229 -DPEGVLSTLPAVVNALAGVFVGHFIVKSHPKGEWAKVGLLSVAGGVCLALGWLLGGVIP 287
Query: 342 TNGEHGSGKF-------STTCVCLFIYSKVILFQWQ 370
N E + F S + LF Y+ V + +WQ
Sbjct: 288 VNKELWTSSFVLVTSGWSMLLLALF-YALVDVLKWQ 322
>gi|313225183|emb|CBY20977.1| unnamed protein product [Oikopleura dioica]
Length = 335
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 36/119 (30%), Positives = 60/119 (50%), Gaps = 27/119 (22%)
Query: 42 RGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADA 101
RG+A+ +MI V++ GG + HA W G +AD MP+F+F++GV++ + +
Sbjct: 2 RGIAIGIMIFVNYGGGGYWFFDHAVWFGLTVADLAMPWFMFMMGVSLTFSFNSM------ 55
Query: 102 VKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEI 160
VKKV+ + L+ T+G GVLQR A+ Y +VS +++
Sbjct: 56 VKKVLRLSYNLV---------------NPTFGT------FPGVLQRFAICYAVVSPLQL 93
>gi|311746093|ref|ZP_07719878.1| hypothetical protein ALPR1_06685 [Algoriphagus sp. PR1]
gi|126576311|gb|EAZ80589.1| hypothetical protein ALPR1_06685 [Algoriphagus sp. PR1]
Length = 367
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/105 (35%), Positives = 55/105 (52%), Gaps = 19/105 (18%)
Query: 31 KTQRLASLDIFRGLAVALMIL--------VDHAGGD-------WPEISHAPWNGCNLADF 75
K+ RL SLD FRGL + L+I + A D + + +H PWNG D
Sbjct: 7 KSGRLVSLDTFRGLTMFLLIAEAAFVYESLLEAFPDPGILNSFFTQFTHHPWNGLRFWDL 66
Query: 76 VMPFFLFIVGVAIALAL-KRI---PDRADAVKKVIFRTLKLLFWG 116
+ PFF+FIVGVA+ +L KR+ +R++ K ++ R L +G
Sbjct: 67 IQPFFMFIVGVAMPFSLNKRLENQENRSEVTKHILKRCFYLFLFG 111
>gi|380025576|ref|XP_003696546.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like
[Apis florea]
Length = 298
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 55/200 (27%), Positives = 90/200 (45%), Gaps = 54/200 (27%)
Query: 139 IRLCGVLQRIALSYLLVSLVE-IFTKDVQDKDQSVGRFSIFR--LYCW-HWLMAACVLVV 194
+R GVLQ + +SY + +++E IF K GRF++FR L W WL+ A ++
Sbjct: 13 LRFPGVLQLLGVSYFVCAILETIFMK----PHSQFGRFAMFRDILESWPQWLIMAGIVTT 68
Query: 195 YLALLY---------GTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
+ + + G + P ++ GK N T G A GYIDR
Sbjct: 69 HTLITFLLPISNCPKGYFGPGGEYHF-------RGKYMNCTAG----------AAGYIDR 111
Query: 246 KVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIG 305
+ G NH Y+H T++ + LR D PEGL++++S+I +G
Sbjct: 112 LIFG-NHTYNH---------TENFLYGQILRYD----------PEGLMNTISAIFIVYLG 151
Query: 306 VHFGHVIIHTKGHLARLKQW 325
VH G +++ +R+ +W
Sbjct: 152 VHAGKILLLYYQCNSRVIRW 171
>gi|154492357|ref|ZP_02031983.1| hypothetical protein PARMER_01991 [Parabacteroides merdae ATCC
43184]
gi|154087582|gb|EDN86627.1| hypothetical protein PARMER_01991 [Parabacteroides merdae ATCC
43184]
Length = 376
Score = 56.6 bits (135), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 43/143 (30%), Positives = 70/143 (48%), Gaps = 25/143 (17%)
Query: 33 QRLASLDIFRGLAVALMILVD---HAGG---DWPE-------ISHAPWNGCNLADFVMPF 79
QRL SLD+ RG + ++ ++ H G D P SH W+G + D VMP
Sbjct: 6 QRLESLDVLRGFDLFCLVALEGVLHPLGRAIDAPWYNDFLWCFSHVQWDGFSSWDLVMPL 65
Query: 80 FLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTYGVD 135
F+F+ GV++ AL R +PD+ ++++ R L +G++ QG PD
Sbjct: 66 FMFMAGVSMPFALSRYKVMPDKWAVYRRIVKRVALLWIFGMMCQGNLLGLDPD------- 118
Query: 136 VRMIRLCGVLQRIALSYLLVSLV 158
R+ LQ IA+ YL+ +++
Sbjct: 119 -RIYLYSNTLQAIAMGYLISAML 140
>gi|157960931|ref|YP_001500965.1| hypothetical protein Spea_1103 [Shewanella pealeana ATCC 700345]
gi|157845931|gb|ABV86430.1| conserved hypothetical protein [Shewanella pealeana ATCC 700345]
Length = 394
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 51/168 (30%), Positives = 79/168 (47%), Gaps = 30/168 (17%)
Query: 15 IISEPDVSDQQEKSHLKTQ-----RLASLDIFRGLAV-----------ALMILVDHAGGD 58
II +P VS + ++ RL SLD RG + AL++L AG +
Sbjct: 4 IIDKPRVSIASVAAESVSKPAAKPRLKSLDALRGFDMFWILGGEAIFAALLLLTGWAGFN 63
Query: 59 W--PEISHAPWNGCNLADFVMPFFLFIVGVAIALALKR-----IPDRADAVKKVIFRTLK 111
W ++ H+ W+G D + P F+F+ GVA+ L+ KR +P R + I R L
Sbjct: 64 WFDSQMHHSTWHGFTFYDLIFPLFIFLSGVALGLSPKRLDKLPLPQRMPLYQHAIKRLLL 123
Query: 112 LLFWGILLQGGF-SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
LL +G++ G+ + AP L IR VL RIA ++ +L+
Sbjct: 124 LLLFGVIYNHGWGTGAPFAL------GDIRYASVLGRIAFAWFFCALL 165
>gi|424665544|ref|ZP_18102580.1| hypothetical protein HMPREF1205_01419 [Bacteroides fragilis HMW
616]
gi|404574617|gb|EKA79366.1| hypothetical protein HMPREF1205_01419 [Bacteroides fragilis HMW
616]
Length = 375
Score = 56.2 bits (134), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 39/146 (26%), Positives = 68/146 (46%), Gaps = 25/146 (17%)
Query: 32 TQRLASLDIFRGLAVALMI--------LVDHAGGDWP-----EISHAPWNGCNLADFVMP 78
+ RLASLDI RG + L++ L W + H W G D VMP
Sbjct: 9 SPRLASLDILRGFDLFLLVFFQPVLWALAHQLNLPWLNSILFQFDHEVWEGFRFWDLVMP 68
Query: 79 FFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
FLF+ G ++ + + PD+ +K++ R + L +G+++QG G++
Sbjct: 69 LFLFMTGASMPFSFSKFKDDPDKGPIYRKILKRFILLFIFGMIVQGNL--------LGLN 120
Query: 136 VRMIRL-CGVLQRIALSYLLVSLVEI 160
+ + L LQ IA YL+ +++++
Sbjct: 121 PKYLYLYSNTLQAIATGYLIAAIIQL 146
>gi|345881756|ref|ZP_08833266.1| hypothetical protein HMPREF9431_01930 [Prevotella oulorum F0390]
gi|343918415|gb|EGV29178.1| hypothetical protein HMPREF9431_01930 [Prevotella oulorum F0390]
Length = 380
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 49/176 (27%), Positives = 81/176 (46%), Gaps = 33/176 (18%)
Query: 31 KTQRLASLDIFRGLAVALMILVDH-----------AGGD-----WPEISHAPWNGCNLAD 74
+ QRL SLDI RG +A+++L+ A G ++SH PW G D
Sbjct: 9 QPQRLLSLDILRGADLAMLVLIQPILFRALKTAHPAEGTIGHFIMGQLSHLPWEGFCFWD 68
Query: 75 FVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK--LLFW--GILLQGGFSHAPDEL 130
+MP F+F+ G+ I A+ R A + +R +K ++ W G+++QG
Sbjct: 69 IIMPLFMFMSGITIPFAMSRYKRGARIDGQFYWRIIKRFVVLWVLGMVVQGNL------- 121
Query: 131 TYGVDVRMIRL-CGVLQRIALSYLLVSLVEIF----TKDVQDKDQSVGRFSIFRLY 181
D+R + L LQ IA+ Y+ V+ + +F T+ V + +IF L+
Sbjct: 122 -LAFDLRQLHLFSNTLQSIAVGYVAVAFLFVFCSLRTQIVAVSLSFIAYIAIFALW 176
>gi|373948546|ref|ZP_09608507.1| Protein of unknown function DUF2261, transmembrane [Shewanella
baltica OS183]
gi|386325609|ref|YP_006021726.1| hypothetical protein [Shewanella baltica BA175]
gi|333819754|gb|AEG12420.1| Protein of unknown function DUF2261, transmembrane [Shewanella
baltica BA175]
gi|373885146|gb|EHQ14038.1| Protein of unknown function DUF2261, transmembrane [Shewanella
baltica OS183]
Length = 384
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 89/372 (23%), Positives = 142/372 (38%), Gaps = 113/372 (30%)
Query: 34 RLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFF 80
RL SLD RG + L+IL AG W ++ H+ W+G D + P F
Sbjct: 18 RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFRFYDLIFPLF 77
Query: 81 LFIVGVAIALALKRIPDRADAVKKVIFR-TLKLLFWGILLQGGFSH-----APDELTYGV 134
+F+ GVA+ L+ KR+ + + ++R +K LF +LL ++H AP
Sbjct: 78 IFLSGVALGLSPKRLDKLPMSERLPVYRHGIKRLFLLLLLGILYNHGWGTGAP------A 131
Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
D IR VL RIA ++ +L+ WH + ++V
Sbjct: 132 DPEKIRYASVLGRIAFAWFFAALL-----------------------VWHTSLRTQIIVA 168
Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNAVGYIDRKVL-GI 250
L +L G YG + G L+P + Y+D +L G+
Sbjct: 169 -LGILLG-----------------YGAIQLWLPFPGGQAGVLSPTESINAYVDSILLPGV 210
Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
++ P +PEGLLS++ +I++ + GV GH
Sbjct: 211 SYQGRTP------------------------------DPEGLLSTIPAIVNALAGVFVGH 240
Query: 311 VII--HTKGHLARLKQWVTMGFALLIFGLTLHF---TNGEHGSGKF-------STTCVCL 358
I+ H KG A++ G L+ G L N E + F S + +
Sbjct: 241 FIVKSHPKGEWAKVGVLAAAGGIFLVLGWLLDLVIPVNKELWTSSFVLVTSGWSMILLAV 300
Query: 359 FIYSKVILFQWQ 370
F Y+ V + +WQ
Sbjct: 301 F-YALVDVLKWQ 311
>gi|392965639|ref|ZP_10331058.1| hypothetical protein BN8_02168 [Fibrisoma limi BUZ 3]
gi|387844703|emb|CCH53104.1| hypothetical protein BN8_02168 [Fibrisoma limi BUZ 3]
Length = 411
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 36/112 (32%), Positives = 60/112 (53%), Gaps = 9/112 (8%)
Query: 15 IISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDH-----AGGDWPEISHAPWNG 69
+I + V+ + + L +R+ S+DIFR L + MI V+ DW E S A +
Sbjct: 1 MIDQQVVASPETRRQLPEKRVHSIDIFRALTMLFMIFVNDLWTLIGIPDWLEHSPADVDF 60
Query: 70 CNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGI 117
LAD V P FLFIVG++I A++ + D+ ++ ++ R++ LL G+
Sbjct: 61 LGLADVVFPCFLFIVGMSIPFAIQGRLAKGDSYGLIIRHIVVRSVALLIMGV 112
>gi|284036950|ref|YP_003386880.1| hypothetical protein Slin_2036 [Spirosoma linguale DSM 74]
gi|283816243|gb|ADB38081.1| hypothetical protein Slin_2036 [Spirosoma linguale DSM 74]
Length = 404
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 41/132 (31%), Positives = 69/132 (52%), Gaps = 16/132 (12%)
Query: 16 ISEPDVSDQQE-KSHLKTQRLASLDIFRGLAVALMILVDH-----AGGDWPEISHAP--W 67
+++ + +D+Q +S + R+ ++DI R + + LMI V+ A DW E H P
Sbjct: 1 MTQLETADRQAYRSSSVSMRVDAIDILRAMTMILMIFVNDLWSLTAIPDWLE--HVPHGV 58
Query: 68 NGCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGF 123
+G LAD V P FLFIVG+++ A+ + D V +I R++ LL G+ L G
Sbjct: 59 DGIGLADVVFPGFLFIVGMSLPFAMNARRQKGDTNSALVSHIIMRSIALLVMGVFLVNG- 117
Query: 124 SHAPDELTYGVD 135
+ D+ G++
Sbjct: 118 -ESIDQKATGIN 128
>gi|343087500|ref|YP_004776795.1| hypothetical protein [Cyclobacterium marinum DSM 745]
gi|342356034|gb|AEL28564.1| Protein of unknown function DUF2261, transmembrane [Cyclobacterium
marinum DSM 745]
Length = 386
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 80/333 (24%), Positives = 128/333 (38%), Gaps = 94/333 (28%)
Query: 33 QRLASLDIFRGL-------AVALMILVDHAGG----DW--PEISHAPWNGCNLADFVMPF 79
+RL S+D RG A A ++L+ G DW + H WNG + DF+ P
Sbjct: 25 KRLLSIDALRGFDMLLIAGAGAFLVLLKGKTGIPAIDWIAGQFYHPAWNGFSFYDFIFPL 84
Query: 80 FLFIVGVAIALALKRIPD----RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
FLFI GV++ +L + + + KK R L L+ GIL + ++P + +
Sbjct: 85 FLFIAGVSLTFSLNKGRNLGMSKPTLYKKTFSRMLVLILLGIL----YKNSPVPI---FE 137
Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVY 195
IR VL RI ++ + +LV + D + +G +A +LV+Y
Sbjct: 138 PSQIRYGSVLGRIGIATFVTTLVYL----NFDFYKRLG-------------IAMAILVLY 180
Query: 196 LALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYH 255
A L+ VP YG L+ N VG+ DR +
Sbjct: 181 YAALFLIPVP------------GYGA---------GDLSIEGNLVGWFDRTFM------- 212
Query: 256 HPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHT 315
G L+++ ++ GLL+ + ++ TI G G ++
Sbjct: 213 ----------------PGILKQEI-------YDELGLLTQIPALCLTIFGTLAGEILTKA 249
Query: 316 KGHLARLKQWVTMGFALLIFGLT--LHFTNGEH 346
++KQ G L GL LHF +H
Sbjct: 250 WLDTKKIKQLAIAGVISLTLGLIWDLHFPINKH 282
>gi|423722057|ref|ZP_17696233.1| hypothetical protein HMPREF1078_00296 [Parabacteroides merdae
CL09T00C40]
gi|409242759|gb|EKN35519.1| hypothetical protein HMPREF1078_00296 [Parabacteroides merdae
CL09T00C40]
Length = 376
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 43/143 (30%), Positives = 69/143 (48%), Gaps = 25/143 (17%)
Query: 33 QRLASLDIFRGLAVALMILVD---HAGG---DWPE-------ISHAPWNGCNLADFVMPF 79
QRL SLD+ RG + ++ ++ H G D P SH W G + D VMP
Sbjct: 6 QRLESLDVLRGFDLFCLVALEGVLHPLGRAIDAPWYNDFLWCFSHVQWEGFSSWDLVMPL 65
Query: 80 FLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTYGVD 135
F+F+ GV++ AL R +PD+ ++++ R L +G++ QG PD
Sbjct: 66 FMFMAGVSMPFALSRYKVMPDKWAVYRRIVKRVALLWIFGMMCQGNLLGLDPD------- 118
Query: 136 VRMIRLCGVLQRIALSYLLVSLV 158
R+ LQ IA+ YL+ +++
Sbjct: 119 -RIYLYSNTLQAIAMGYLISAML 140
>gi|375148919|ref|YP_005011360.1| hypothetical protein [Niastella koreensis GR20-10]
gi|361062965|gb|AEW01957.1| Protein of unknown function DUF2261, transmembrane [Niastella
koreensis GR20-10]
Length = 397
Score = 56.2 bits (134), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 36/94 (38%), Positives = 53/94 (56%), Gaps = 9/94 (9%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGD-----WPEISHAPWNGCNLADFVMPFFLFIVGV 86
TQRLAS+D+FR L + LMI V+ G W E + A +G LAD V P FLFIVG+
Sbjct: 5 TQRLASIDVFRALTMLLMIFVNDLGTLKNIPLWLEHTKANEDGMGLADTVFPAFLFIVGL 64
Query: 87 AIALAL----KRIPDRADAVKKVIFRTLKLLFWG 116
+I A+ + +++ + ++ R+ LL G
Sbjct: 65 SIPFAIGNRWAKGASQSNILGHILIRSFALLVMG 98
>gi|332666399|ref|YP_004449187.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
gi|332335213|gb|AEE52314.1| Protein of unknown function DUF2261, transmembrane
[Haliscomenobacter hydrossis DSM 1100]
Length = 369
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 44/150 (29%), Positives = 67/150 (44%), Gaps = 25/150 (16%)
Query: 33 QRLASLDIFRGL---------AVALMILVDHAGGDWP----EISHAPWNGCNLADFVMPF 79
QRL SLD RG AV + W ++SH W+G L D + P
Sbjct: 8 QRLYSLDALRGFDMFWIMGAEAVVHSLATATGSSVWEAAAHQLSHPDWHGFRLYDLIFPL 67
Query: 80 FLFIVGVAIALALKRIPD----RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
FLF+ GVA ++ R + + + +VI R L L+ GI+ G P
Sbjct: 68 FLFLAGVATPYSVGRDLENGKPKQQLLLRVIRRGLVLVLLGIIYNNGLVLKP-------- 119
Query: 136 VRMIRLCGVLQRIALSYLLVSLVEIFTKDV 165
+ IR VL RI L+Y+ +++ ++TK +
Sbjct: 120 LAEIRFPSVLGRIGLAYMFANIIYLYTKQL 149
>gi|374312990|ref|YP_005059420.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
gi|358755000|gb|AEU38390.1| protein of unknown function DUF1624 [Granulicella mallensis
MP5ACTX8]
Length = 408
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 36/101 (35%), Positives = 55/101 (54%), Gaps = 9/101 (8%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD-----WPEISHAPWNGCNLADFVMPFFLFIVG 85
+T R+AS+DIFRGL +A+MI V+ G W + A + D V PFFLFI+G
Sbjct: 14 RTTRVASIDIFRGLTMAIMIFVNDLDGVQGLPWWTHHAKANIDVMTYVDMVFPFFLFIIG 73
Query: 86 VAIALA----LKRIPDRADAVKKVIFRTLKLLFWGILLQGG 122
+++ LA LK+ P V+ R++ L+ G++L
Sbjct: 74 LSMPLAIRQRLKKNPSIPQLWLHVLIRSVSLVALGVILANA 114
>gi|313145390|ref|ZP_07807583.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
gi|423279948|ref|ZP_17258861.1| hypothetical protein HMPREF1203_03078 [Bacteroides fragilis HMW
610]
gi|424661980|ref|ZP_18099017.1| hypothetical protein HMPREF1205_02366 [Bacteroides fragilis HMW
616]
gi|313134157|gb|EFR51517.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
gi|404578291|gb|EKA83026.1| hypothetical protein HMPREF1205_02366 [Bacteroides fragilis HMW
616]
gi|404584284|gb|EKA88949.1| hypothetical protein HMPREF1203_03078 [Bacteroides fragilis HMW
610]
Length = 377
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 48/148 (32%), Positives = 68/148 (45%), Gaps = 27/148 (18%)
Query: 33 QRLASLDIFRGLAVALMILVD-------------HAGGDWPEISHAPWNGCNLADFVMPF 79
QRL SLD RGL + ++ + H G + H W G + D +MP
Sbjct: 7 QRLESLDALRGLDLFFLVALGPLLRTLVRAIDSPHLDGVNWCLRHVDWIGFSPWDLIMPL 66
Query: 80 FLFIVGVAIALALKRIPDRADAVKKVIFRTLK--LLFW--GILLQGG-FSHAPDELTYGV 134
FLF+ G++I AL R AD K+I+R K LL W G++ QG S PD L
Sbjct: 67 FLFMSGISIPFALSRFKGEADK-SKLIYRLCKRVLLLWIFGMMCQGNLLSFDPDHLYLYT 125
Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFT 162
+ LQ IA Y+ +L+ ++T
Sbjct: 126 N--------TLQSIATGYIAAALLFLYT 145
>gi|150007980|ref|YP_001302723.1| hypothetical protein BDI_1342 [Parabacteroides distasonis ATCC
8503]
gi|256840846|ref|ZP_05546354.1| conserved hypothetical protein [Parabacteroides sp. D13]
gi|423331513|ref|ZP_17309297.1| hypothetical protein HMPREF1075_01310 [Parabacteroides distasonis
CL03T12C09]
gi|149936404|gb|ABR43101.1| conserved hypothetical protein [Parabacteroides distasonis ATCC
8503]
gi|256738118|gb|EEU51444.1| conserved hypothetical protein [Parabacteroides sp. D13]
gi|409230083|gb|EKN22951.1| hypothetical protein HMPREF1075_01310 [Parabacteroides distasonis
CL03T12C09]
Length = 372
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 86/338 (25%), Positives = 121/338 (35%), Gaps = 106/338 (31%)
Query: 25 QEKSHLKTQRLASLDIFRG-----------LAVALMILVDH------AGGDWPEISHAPW 67
EK + QRL SLD RG L VAL L + AG ++ H W
Sbjct: 1 MEKQKQQPQRLQSLDALRGFDMLFIMGGASLFVALATLFPNPFFQAIAG----QMEHVEW 56
Query: 68 NGCNLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGF 123
NG D + P FLFI G++ +L++ + KK++ R + L+F G++ G
Sbjct: 57 NGLAHHDTIFPLFLFIAGISFPFSLEKQRGKGMTEGAIYKKIVRRGITLVFLGLVYNGLL 116
Query: 124 SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCW 183
S D L R VL RI L ++ +L +F + W
Sbjct: 117 SFEFDHL---------RCASVLARIGLGWMFAAL-------------------LFVRFGW 148
Query: 184 HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYI 243
VL++ L +VP D+ G N VGYI
Sbjct: 149 KVRAGITVLILVGYWLAMAFVP-------VPDAGGAG-----------PFTLEGNLVGYI 190
Query: 244 DRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILST 302
DR L P H F+PEGL S+V +I +
Sbjct: 191 DRLFL-------------------------------PGRLHETVFDPEGLFSTVPAIATA 219
Query: 303 IIGVHFGHVI-IHTKGHLARLKQ--WVTMGFALLIFGL 337
++G+ G I + +G R K V G LLI GL
Sbjct: 220 MLGMFTGEWIKLRKEGLTDRNKVLCLVGAGAVLLIVGL 257
>gi|423344000|ref|ZP_17321713.1| hypothetical protein HMPREF1077_03143 [Parabacteroides johnsonii
CL02T12C29]
gi|409213862|gb|EKN06875.1| hypothetical protein HMPREF1077_03143 [Parabacteroides johnsonii
CL02T12C29]
Length = 376
Score = 55.8 bits (133), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 42/143 (29%), Positives = 68/143 (47%), Gaps = 25/143 (17%)
Query: 33 QRLASLDIFRGLAVALMILVD---HAGG-----DWPE-----ISHAPWNGCNLADFVMPF 79
+RL SLD+ RG + ++ ++ H G W SH W G + D VMP
Sbjct: 6 KRLESLDVLRGFDLFCLVALEGILHPLGRAIDASWYNDFLWGFSHVQWEGFSSWDLVMPL 65
Query: 80 FLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTYGVD 135
F+F+ GV++ AL R +PD+ +++I R L +G++ QG PD
Sbjct: 66 FMFMAGVSMPFALSRYKAMPDKWAVYRRIIKRVALLWIFGMMCQGNLLGLDPD------- 118
Query: 136 VRMIRLCGVLQRIALSYLLVSLV 158
R+ LQ IA+ YL+ +++
Sbjct: 119 -RIYLYSNTLQAIAMGYLIAAML 140
>gi|357624248|gb|EHJ75102.1| putative heparan-alpha-glucosaminide N-acetyltransferase [Danaus
plexippus]
Length = 340
Score = 55.8 bits (133), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 60/251 (23%), Positives = 96/251 (38%), Gaps = 35/251 (13%)
Query: 98 RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSL 157
R +A+ +V R+L L GI L + + +R GVLQR+A YL+V
Sbjct: 19 RVNALGQVARRSLLLSLIGICLG----------SVNTNWSYVRFPGVLQRLAAMYLIVGS 68
Query: 158 VE-IFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDS 216
+E F + Q+ F WL ++ + L + P
Sbjct: 69 LECAFMRTSQNIIPGRSLFRDIAAGWQQWLATVLMVAIQLCITLTVAAPGCPV-----GY 123
Query: 217 ADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLR 276
+ G + G + N GYIDR +LG NH+Y H F+ R
Sbjct: 124 SGPGGLHRTATGDFSLQNCTGGIAGYIDRLILGPNHLYQH------------GTFKSIYR 171
Query: 277 KDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFAL-LIF 335
P +PEG+L +S +L G H +++ AR+ +WV ++
Sbjct: 172 TQLPH------DPEGILGILSGVLVVQAGAHAARIMLVYNHARARIMRWVFWSVMFGVVG 225
Query: 336 GLTLHFTNGEH 346
GL F++G +
Sbjct: 226 GLLCKFSDGGY 236
>gi|336312505|ref|ZP_08567454.1| N-acetylglucosamine transporter, NagX [Shewanella sp. HN-41]
gi|335864011|gb|EGM69129.1| N-acetylglucosamine transporter, NagX [Shewanella sp. HN-41]
Length = 384
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 82/342 (23%), Positives = 130/342 (38%), Gaps = 102/342 (29%)
Query: 23 DQQEKSHLKTQRLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNG 69
++ + RL SLD RG + L+IL AG W ++ H+ W+G
Sbjct: 7 NETATIKVTKPRLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHG 66
Query: 70 CNLADFVMPFFLFIVGVAIALALKR-----IPDRADAVKKVIFRTLKLLFWGILLQGGF- 123
D + P F+F+ GVA+ L+ KR + +R + I R L LL GIL G+
Sbjct: 67 FRFYDLIFPLFIFLSGVALGLSPKRLDKLPLSERLPVYRHGIKRLLLLLLLGILYNHGWG 126
Query: 124 SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCW 183
+ AP D +R VL RIA ++ +L+ W
Sbjct: 127 TGAP------ADPEKVRYASVLGRIAFAWFFAALL-----------------------VW 157
Query: 184 HWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNAV 240
H + ++V L +L G YG + + G L+P +
Sbjct: 158 HTSLRTQIIVA-LGILLG-----------------YGAIQLWLPFSGGQAGVLSPTESIN 199
Query: 241 GYIDRKVL-GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSI 299
YID +L G+++ + T D PEGLLS++ ++
Sbjct: 200 AYIDSILLPGVSY----------QGRTLD--------------------PEGLLSTIPAV 229
Query: 300 LSTIIGVHFGHVII--HTKGHLARLKQWVTMGFALLIFGLTL 339
++ + GV GH I+ H +G A++ G L G L
Sbjct: 230 VNALAGVFVGHFIVKSHPQGEWAKVGLLAAAGGVCLALGWLL 271
>gi|410099161|ref|ZP_11294133.1| hypothetical protein HMPREF1076_03311 [Parabacteroides goldsteinii
CL02T12C30]
gi|409219183|gb|EKN12146.1| hypothetical protein HMPREF1076_03311 [Parabacteroides goldsteinii
CL02T12C30]
Length = 377
Score = 55.5 bits (132), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 67/244 (27%), Positives = 95/244 (38%), Gaps = 69/244 (28%)
Query: 25 QEKSHLKTQRLASLDIFRGLAVALMI--------LVDHAGGDWPE-----ISHAPWNGCN 71
EK+ K RL SLD RG + ++ L A W +H W G +
Sbjct: 1 MEKTTYK--RLESLDALRGFDLFFLVALGPLAHSLARAADVGWLNDCMWAFNHVQWEGFS 58
Query: 72 LADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK---LLFW--GILLQGG-FSH 125
D +MP FLF+ G ++ AL R +D KK +FR L LL W G++ QG
Sbjct: 59 PWDLIMPLFLFMSGASMPFALSRFKGVSD--KKTLFRRLGKRILLLWIFGMMCQGNLLGF 116
Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHW 185
PD R+ LQ IA YL+ +++ ++T + +G
Sbjct: 117 DPD--------RIYLYSNTLQSIAAGYLITAVLFLYT----SRRTQIG------------ 152
Query: 186 LMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDR 245
+A +L+VY A + QF I YG P N +IDR
Sbjct: 153 -VAVALLLVYWAAM--------QFITI----GSYGG---------GNYTPEGNLAEWIDR 190
Query: 246 KVLG 249
VLG
Sbjct: 191 TVLG 194
>gi|390946357|ref|YP_006410117.1| hypothetical protein Alfi_1078 [Alistipes finegoldii DSM 17242]
gi|390422926|gb|AFL77432.1| hypothetical protein Alfi_1078 [Alistipes finegoldii DSM 17242]
Length = 369
Score = 55.1 bits (131), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 43/147 (29%), Positives = 69/147 (46%), Gaps = 26/147 (17%)
Query: 32 TQRLASLDIFRGLAVALMI----LVDHAGGDWP---------EISHAPWNGCNLADFVMP 78
+RL SLD RG+ + ++ LV WP ++ HA WNG + D + P
Sbjct: 4 NRRLLSLDTLRGVDMFFIMGFSGLVTSLCALWPGSFTDMLASQMQHAAWNGLTIQDTIFP 63
Query: 79 FFLFIVGVAIALALKRIPDRADAVKKV---IFRT-LKLLFWGILLQGGFSHAPDELTYGV 134
FLFI GVA +L + R K++ IFR L L G++ G F +
Sbjct: 64 LFLFIAGVAFPFSLAKQRARGFGRKRILDRIFRRGLILALLGMVYNGLFE---------L 114
Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIF 161
+ +R+ VL RI L+++ +L+ ++
Sbjct: 115 NFSSLRIASVLGRIGLAWMFAALLCVY 141
>gi|334366956|ref|ZP_08515871.1| putative membrane protein [Alistipes sp. HGB5]
gi|313156833|gb|EFR56273.1| putative membrane protein [Alistipes sp. HGB5]
Length = 370
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 43/147 (29%), Positives = 69/147 (46%), Gaps = 26/147 (17%)
Query: 32 TQRLASLDIFRGLAVALMI----LVDHAGGDWP---------EISHAPWNGCNLADFVMP 78
+RL SLD RG+ + ++ LV WP ++ HA WNG + D + P
Sbjct: 5 NRRLLSLDTLRGVDMFFIMGFSGLVTSLCALWPGSFTDMLASQMQHAAWNGLTIQDTIFP 64
Query: 79 FFLFIVGVAIALALKRIPDRADAVKKV---IFRT-LKLLFWGILLQGGFSHAPDELTYGV 134
FLFI GVA +L + R K++ IFR L L G++ G F +
Sbjct: 65 LFLFIAGVAFPFSLAKQRARGFGRKRILDRIFRRGLILALLGMVYNGLFE---------L 115
Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIF 161
+ +R+ VL RI L+++ +L+ ++
Sbjct: 116 NFSSLRIASVLGRIGLAWMFAALLCVY 142
>gi|393784535|ref|ZP_10372698.1| hypothetical protein HMPREF1071_03566 [Bacteroides salyersiae
CL02T12C01]
gi|392665516|gb|EIY59040.1| hypothetical protein HMPREF1071_03566 [Bacteroides salyersiae
CL02T12C01]
Length = 378
Score = 55.1 bits (131), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 48/148 (32%), Positives = 74/148 (50%), Gaps = 26/148 (17%)
Query: 31 KTQRLASLDIFRGLAVALMIL---VDHAGG---DWP-------EISHAPWNGCNLADFVM 77
+ RLASLDI RG + L++ V A G D P + H W G +L D VM
Sbjct: 10 NSSRLASLDILRGFDLFLLVFFQPVFVALGQQLDLPFLNRLVYQFDHEAWVGFHLWDLVM 69
Query: 78 PFFLFIVGVAIALALKRIPDRADA---VKKVIFRTLKLLF-WGILLQGGFSHAPDELTYG 133
P FLF+ G ++ +L + + V + IFR + LLF +G+++QG G
Sbjct: 70 PLFLFMTGASMPFSLSKYKISSAGCQFVYRRIFRRVVLLFLFGMIVQGNL--------LG 121
Query: 134 VDVRMIRL-CGVLQRIALSYLLVSLVEI 160
D + I L LQ IA+ YL+ +++++
Sbjct: 122 FDSQHIYLYSNTLQAIAVGYLIAAIIQL 149
>gi|338212226|ref|YP_004656281.1| hypothetical protein [Runella slithyformis DSM 19594]
gi|336306047|gb|AEI49149.1| Protein of unknown function DUF2261, transmembrane [Runella
slithyformis DSM 19594]
Length = 369
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 53/168 (31%), Positives = 82/168 (48%), Gaps = 31/168 (18%)
Query: 33 QRLASLDIFRGLAVALMI-----LVDHAGGDW---------PEISHAPWNGCNLADFVMP 78
RLASLD RG + LMI + GG + H W+G DF+ P
Sbjct: 8 SRLASLDALRGFDM-LMISGGGAFLSLMGGKTDSALLNAVAAQFHHPDWDGFTFYDFIFP 66
Query: 79 FFLFIVGVAIALALKR-----IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYG 133
FLF+ GV+++++LK IP + ++KV R L L F G+L + +AP ++
Sbjct: 67 LFLFMAGVSLSISLKNGIAKGIP-QYKLMEKVFKRMLILFFLGLLDK----NAPIDI--- 118
Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLY 181
+D IR VL RI ++ LV+++ + + K Q + F+I LY
Sbjct: 119 LDPAHIRYGTVLGRIGIATFLVAILYL---NTGWKTQLIVAFTILGLY 163
>gi|218260819|ref|ZP_03475938.1| hypothetical protein PRABACTJOHN_01602 [Parabacteroides johnsonii
DSM 18315]
gi|218224342|gb|EEC96992.1| hypothetical protein PRABACTJOHN_01602 [Parabacteroides johnsonii
DSM 18315]
Length = 376
Score = 54.7 bits (130), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 43/143 (30%), Positives = 69/143 (48%), Gaps = 25/143 (17%)
Query: 33 QRLASLDIFRGLAVALMILVD---HAGG-----DWPE-----ISHAPWNGCNLADFVMPF 79
+RL SLD+ RG + ++ ++ H G W SH W G + D VMP
Sbjct: 6 KRLESLDVLRGFDLFCLVALEGILHPLGRAIDASWYNDFLWGFSHVQWEGFSSWDLVMPL 65
Query: 80 FLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
F+F+ GV++ AL R +PD+ +++I R L +G++ QG G+D
Sbjct: 66 FMFMAGVSMPFALSRYKAMPDKWAVYRRIIKRVALLWIFGMMCQGNL--------LGLDP 117
Query: 137 RMIRL-CGVLQRIALSYLLVSLV 158
I L LQ IA+ YL+ +++
Sbjct: 118 GRIYLYSNTLQAIAMGYLIAAML 140
>gi|392548092|ref|ZP_10295229.1| hypothetical protein PrubA2_17028 [Pseudoalteromonas rubra ATCC
29570]
Length = 373
Score = 54.7 bits (130), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 80/330 (24%), Positives = 120/330 (36%), Gaps = 98/330 (29%)
Query: 31 KTQRLASLDIFRGLAV-----------ALMILVDHAGGDWPEIS--HAPWNGCNLADFVM 77
+RLASLD RG+ + AL +L G E H+ W+G D +
Sbjct: 4 NKKRLASLDALRGMDMFWILGGQSIFAALFVLTGWQGWKIFEAQTLHSAWHGFTFYDLIF 63
Query: 78 PFFLFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
P F+F+ GVA+ L KRI +R K I R L +G+L G+
Sbjct: 64 PLFIFLSGVAMGLRPKRIDHLPMAERKPIYIKAIKRLGLLCLFGVLYNHGWGTGIPA--- 120
Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVL 192
D IR VL RIA+++ +++ VG +L
Sbjct: 121 --DFGEIRYASVLGRIAIAWFFCAMLVWHCSLKTTALTGVG-----------------IL 161
Query: 193 VVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL-GIN 251
+ Y LL ++P V G +L P + ++D+ +L GI
Sbjct: 162 LAYWLLL--CFIP-------------------VPGGSAGELTPAGSWNAWVDQALLPGIT 200
Query: 252 HMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHV 311
+ + P +PEG+LSS +I++ I GV G +
Sbjct: 201 YQ------------------------------NRPVDPEGILSSFPAIVNAIAGVFAGQL 230
Query: 312 IIHTKGHLARLKQWVTMG--FALLIFGLTL 339
I + +L QW G FA I L L
Sbjct: 231 IAQSD----KLGQWQVAGRLFAAGIVSLAL 256
>gi|160874301|ref|YP_001553617.1| hypothetical protein Sbal195_1181 [Shewanella baltica OS195]
gi|378707545|ref|YP_005272439.1| hypothetical protein [Shewanella baltica OS678]
gi|160859823|gb|ABX48357.1| conserved hypothetical protein [Shewanella baltica OS195]
gi|315266534|gb|ADT93387.1| hypothetical protein Sbal678_1209 [Shewanella baltica OS678]
Length = 384
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 92/372 (24%), Positives = 141/372 (37%), Gaps = 113/372 (30%)
Query: 34 RLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFF 80
RL SLD RG + L+IL AG W ++ H+ W+G + D + P F
Sbjct: 18 RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFHFYDLIFPLF 77
Query: 81 LFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGILLQGGF-SHAPDELTYGV 134
+F+ GVA+ L+ KR+ +R + I R LL GIL G+ + AP
Sbjct: 78 IFLSGVALGLSPKRLDKLPMKERLPVYRHGIKRLFLLLLLGILYNHGWGTGAP------A 131
Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
D IR VL RIA ++ + L WH + ++V
Sbjct: 132 DPEKIRYASVLGRIAFAWFFAA-----------------------LLVWHTSLRTQIIVA 168
Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNAVGYIDRKVL-GI 250
L +L G YG + G + L+P + Y+D +L G+
Sbjct: 169 -LGILLG-----------------YGAMQLWLPFPGGQASVLSPTESINAYVDSILLPGV 210
Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
++ P +PEGLLS++ +I++ + GV GH
Sbjct: 211 SYQGRTP------------------------------DPEGLLSTIPAIVNALAGVFVGH 240
Query: 311 VII--HTKGHLARLKQWVTMGFALLIFGLTLHF---TNGEHGSGKF-------STTCVCL 358
I+ H KG A++ G L FG L N E + F S + +
Sbjct: 241 FIVKSHPKGEWAKVGLLAAAGCVCLAFGWLLDLVIPVNKELWTSSFVLVTSGWSMILLAV 300
Query: 359 FIYSKVILFQWQ 370
F Y+ V + +WQ
Sbjct: 301 F-YALVDVLKWQ 311
>gi|403717790|ref|ZP_10942873.1| hypothetical protein KILIM_074_00050 [Kineosphaera limosa NBRC
100340]
gi|403208927|dbj|GAB97556.1| hypothetical protein KILIM_074_00050 [Kineosphaera limosa NBRC
100340]
Length = 461
Score = 54.3 bits (129), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 60/246 (24%), Positives = 95/246 (38%), Gaps = 51/246 (20%)
Query: 14 LIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDH--AGGDWPEISHAPWNGCN 71
L ++ ++R SLD+ RGL + + + V+ W E HA W G +
Sbjct: 61 LPATQSSTKSPPPAQSFPSRRFISLDVARGLMLVVSVAVNAWITAPAWFE--HAAWAGVH 118
Query: 72 LADFVMPFFLFIVGVAIALA-LKRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
D V P F+ + G +A A +RIP ++ ++ R + L G+ +HA
Sbjct: 119 PVDLVFPTFVALSGAGLAFAYARRIP-----LRPLLSRVIVLALAGLAYN---AHAQYLS 170
Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
T +D R+ GVLQ A L+++L+ + R + W W +
Sbjct: 171 TGQLDWATFRIPGVLQLYAAIVLVIALLHF----------------VLRRW-WAWPLFTI 213
Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
V AL +N+ F C A L P CN G D + G+
Sbjct: 214 VAATCFAL------------ALNR--------FAAGCPGGA-LTPECNPSGLFDPALFGV 252
Query: 251 NHMYHH 256
H+YH
Sbjct: 253 EHIYHQ 258
>gi|430745463|ref|YP_007204592.1| hypothetical protein Sinac_4725 [Singulisphaera acidiphila DSM
18658]
gi|430017183|gb|AGA28897.1| hypothetical protein Sinac_4725 [Singulisphaera acidiphila DSM
18658]
Length = 391
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 79/341 (23%), Positives = 132/341 (38%), Gaps = 90/341 (26%)
Query: 14 LIISEPDVSDQQEKSHLK-TQRLASLDIFRG-----------LAVALMILVDHAGGD--W 59
L+ +E + D + K ++RL S+D RG LA AL D + G
Sbjct: 7 LLTAETPLMDSDSIAAPKPSERLLSIDALRGFDMLWIIGGERLAKALARWSDSSAGKVVQ 66
Query: 60 PEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRI--PDRADAVKKVIFRTLKLLFWGI 117
++ HA W+G L D + P FLF+VG + +L ++ R +++ RTL L G+
Sbjct: 67 EQLEHAEWHGFRLNDLIFPLFLFLVGTVLPFSLGKLQGQGRGAEYRRIARRTLLLFALGL 126
Query: 118 LLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSI 177
L G D +R+ GVLQRIAL Y + +L+ ++ + V
Sbjct: 127 LCNG---------VLKFDWANLRVAGVLQRIALCYGIAALISLWF-----SRRGVA---- 168
Query: 178 FRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPC 237
++ +LV Y AL+ P + DY +
Sbjct: 169 --------ILLVLILVGYWALMANVGAP-------GHTAGDY--------------SISG 199
Query: 238 NAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVS 297
N G+IDR+ L M + + + EGLL+++
Sbjct: 200 NLAGWIDRQFLPGKIMKSY---------------------------YGYGDNEGLLTTIP 232
Query: 298 SILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLT 338
++ + ++GV GH + +G ++ V G LI G+
Sbjct: 233 AVGTALLGVLAGHWLRSQRGPWQKVAGLVAAGVLSLIVGVA 273
>gi|224027055|ref|ZP_03645421.1| hypothetical protein BACCOPRO_03816, partial [Bacteroides
coprophilus DSM 18228]
gi|224020291|gb|EEF78289.1| hypothetical protein BACCOPRO_03816 [Bacteroides coprophilus DSM
18228]
Length = 373
Score = 54.3 bits (129), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 48/186 (25%), Positives = 82/186 (44%), Gaps = 38/186 (20%)
Query: 25 QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP----------------EISHAPWN 68
+ K +K+QRL SLD+ RG M + G + ++ H PW+
Sbjct: 5 RMKKEVKSQRLQSLDVLRGFD---MFFIMGGGALFAGLATCCPIPFFQAIARQMEHVPWH 61
Query: 69 GCNLADFVMPFFLFIVGVAIALALKRIP----DRADAVKKVIFRTLKLLFWGILLQGGFS 124
G D + P FLFI G++ +L++ A +KVI R L L+ G + G
Sbjct: 62 GVAFEDMIFPLFLFIAGISFPYSLEKQKACGMSSAAIYRKVIRRGLVLVLLGCIYNGLLD 121
Query: 125 HAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWH 184
D +R VL RI LS++ +L+ + +V+ + + +G ++ L+ +
Sbjct: 122 F---------DFAHLRYASVLGRIGLSWMFAALLFL---NVRTRVR-MGVVAL--LFIGY 166
Query: 185 WLMAAC 190
W + AC
Sbjct: 167 WALLAC 172
>gi|317503636|ref|ZP_07961655.1| conserved hypothetical protein, partial [Prevotella salivae DSM
15606]
gi|315665261|gb|EFV04909.1| conserved hypothetical protein [Prevotella salivae DSM 15606]
Length = 59
Score = 53.9 bits (128), Expect = 1e-04, Method: Composition-based stats.
Identities = 25/55 (45%), Positives = 35/55 (63%), Gaps = 2/55 (3%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFI 83
KT R+ ++DI RG+ +A MILV++ GG + + HA W G D V PFF+FI
Sbjct: 5 KTSRIEAVDILRGITIAGMILVNNPGGQPVYTPLEHAEWLGLTPTDLVFPFFMFI 59
>gi|393787642|ref|ZP_10375774.1| hypothetical protein HMPREF1068_02054 [Bacteroides nordii
CL02T12C05]
gi|392658877|gb|EIY52507.1| hypothetical protein HMPREF1068_02054 [Bacteroides nordii
CL02T12C05]
Length = 373
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 38/147 (25%), Positives = 64/147 (43%), Gaps = 26/147 (17%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEI-------------SHAPWNGCNLADFVMP 78
+RLASLD+ RG + ++++ W EI +H W G D +MP
Sbjct: 5 NKRLASLDLLRGFDLFCLLMLQPILMTWLEIENNPSLDPITNQFTHVEWQGVAFWDLIMP 64
Query: 79 FFLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGV 134
F+F+ G+ I A+ + + FR K L F G ++QG +
Sbjct: 65 LFMFMSGITIPFAMSKYKQGEKIDRHFYFRLFKRFFVLFFLGWVVQGNL--------LAL 116
Query: 135 DVRMIRL-CGVLQRIALSYLLVSLVEI 160
D+R + LQ IA+ Y++ +L+ +
Sbjct: 117 DIRQFHIFANTLQAIAVGYVVAALLYV 143
>gi|410099160|ref|ZP_11294132.1| hypothetical protein HMPREF1076_03310 [Parabacteroides goldsteinii
CL02T12C30]
gi|409219182|gb|EKN12145.1| hypothetical protein HMPREF1076_03310 [Parabacteroides goldsteinii
CL02T12C30]
Length = 371
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 81/337 (24%), Positives = 132/337 (39%), Gaps = 104/337 (30%)
Query: 23 DQQEKSHLKTQRLASLDIFRG-----------LAVALMILVDHAGGD--WPEISHAPWN- 68
++Q++S QRL SLD RG L VAL LV + ++SHA W
Sbjct: 2 EKQKQS----QRLLSLDALRGFDMFFIMGGGSLFVALATLVPTPFFESIAAQMSHAKWGA 57
Query: 69 GCNLADFVMPFFLFIVGVAIALALKRIPDR----ADAVKKVIFRTLKLLFWGILLQGGFS 124
G D + P FLFI G++ +L++ +R A KK+I R + L+ G + G
Sbjct: 58 GFTFEDIIFPLFLFIAGISFPFSLEKQRERGMSEAAIYKKIIRRGITLVVLGFVYNG--- 114
Query: 125 HAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWH 184
++ R VL RI L ++ +L+ + T+ +
Sbjct: 115 ------LLNLNFETQRYASVLARIGLGWMFGALIFVNTRTITRV---------------- 152
Query: 185 WLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYID 244
W++AA +L+ Y LL+ PD + + + N+ C YID
Sbjct: 153 WIVAA-ILIGYWLLLF-IPAPD------GNGAELFTREGNLAC--------------YID 190
Query: 245 RKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCH-APFEPEGLLSSVSSILSTI 303
R +L P H ++PEG+LS++ +I + +
Sbjct: 191 RLLL-------------------------------PGRLHGGNYDPEGILSTLPAIGTAL 219
Query: 304 IGVHFGHVIIHTKGHLARLKQWVTM---GFALLIFGL 337
+G+ G + + L K+ V M G LL+ GL
Sbjct: 220 LGMFTGEFVKLRREGLTETKKVVYMLAVGGCLLVIGL 256
>gi|386312853|ref|YP_006009018.1| N-acetylglucosamine related transporter, NagX [Shewanella
putrefaciens 200]
gi|319425478|gb|ADV53552.1| N-acetylglucosamine related transporter, NagX [Shewanella
putrefaciens 200]
Length = 384
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 91/372 (24%), Positives = 141/372 (37%), Gaps = 113/372 (30%)
Query: 34 RLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFF 80
RL SLD RG + L+IL AG W ++ H+ W+G + D + P F
Sbjct: 18 RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFHFYDLIFPLF 77
Query: 81 LFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGILLQGGF-SHAPDELTYGV 134
+F+ GVA+ L+ KR+ DR + I R LL GIL G+ + AP
Sbjct: 78 IFLSGVALGLSPKRLDKLPMKDRLPVYRHGIKRLFLLLLLGILYNHGWGTGAP------A 131
Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
D +R VL RIA ++ +L+ WH + V+V
Sbjct: 132 DPEKVRYASVLGRIAFAWFFAALL-----------------------VWHTSLRTQVIVA 168
Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNAVGYIDRKVL-GI 250
L +L G YG + G L+P + Y+D +L G+
Sbjct: 169 -LGILLG-----------------YGAMQLWLPFPSGQAGVLSPTQSINAYVDSILLPGV 210
Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
++ P +PEGLLS++ ++++ + GV G+
Sbjct: 211 SYQGRTP------------------------------DPEGLLSTIPAVVNALAGVFVGY 240
Query: 311 VII--HTKGHLARLKQWVTMGFALLIFGLTLHF---TNGEHGSGKF-------STTCVCL 358
I+ H +G ++ T G A L G L N E + F S + L
Sbjct: 241 FIVKSHPQGEWVKVGLLATAGGAWLALGWLLDGVIPVNKELWTSSFVLVTSGWSMILLAL 300
Query: 359 FIYSKVILFQWQ 370
F Y+ V + +WQ
Sbjct: 301 F-YALVDVLKWQ 311
>gi|456985620|gb|EMG21387.1| hypothetical protein LEP1GSC150_0590 [Leptospira interrogans
serovar Copenhageni str. LT2050]
Length = 77
Score = 53.9 bits (128), Expect = 1e-04, Method: Composition-based stats.
Identities = 26/57 (45%), Positives = 35/57 (61%), Gaps = 3/57 (5%)
Query: 27 KSHLKTQRLASLDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
++ L R+ SLD+FRG+ VA MILV++ G + + HA WNGC D V PFF
Sbjct: 2 ENKLNQNRILSLDLFRGMTVAGMILVNNPGSWSFIYSPLKHARWNGCTPTDLVFPFF 58
>gi|456861512|gb|EMF80162.1| hypothetical protein LEP1GSC188_2620 [Leptospira weilii serovar
Topaz str. LT2116]
Length = 88
Score = 53.9 bits (128), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 28/60 (46%), Positives = 35/60 (58%), Gaps = 5/60 (8%)
Query: 25 QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEI----SHAPWNGCNLADFVMPFF 80
++KS R+ SLD+FRG+ V MILV++ G W I HA WNGC D V PFF
Sbjct: 2 EKKSTQNKDRILSLDLFRGMTVIGMILVNNPGS-WSYIYSPLKHAKWNGCTPTDLVFPFF 60
>gi|91794054|ref|YP_563705.1| hypothetical protein Sden_2703 [Shewanella denitrificans OS217]
gi|91716056|gb|ABE55982.1| conserved hypothetical protein [Shewanella denitrificans OS217]
Length = 400
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 76/314 (24%), Positives = 119/314 (37%), Gaps = 84/314 (26%)
Query: 25 QEKSHLKTQRLASLDIFRG-----------LAVALMILVDHAGGDWP--EISHAPWNGCN 71
Q + L RL SLD RG L AL L AG + ++ H+ W+G
Sbjct: 25 QTSTSLNKPRLKSLDALRGFDMFWIIGGEGLFAALFTLTGWAGWNIASRQMQHSQWHGFT 84
Query: 72 LADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR-TLKLLFWGILLQGGFSHAPDEL 130
L D + P F+F+ GVA+ L+ KR+ +A AV +++ K L I L ++H
Sbjct: 85 LYDLIFPLFIFLSGVALGLSPKRLDQQAFAVALPLYQHACKRLILLIALGILYNHGWGT- 143
Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
D+ IR VL RI ++ +++ T+ Q + SI LY
Sbjct: 144 GIPADLDKIRYSSVLARIGFAWFFAAMLVWHTR---LSIQVIVSVSIIGLYT-------- 192
Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
LA LY V G + + Y+D +
Sbjct: 193 -----LAQLY----------------------LPVPGGQAGQFTLDASINTYVDGLL--- 222
Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
R QD P +PEG+LS+V ++++ ++GV G
Sbjct: 223 ----------RPGIAYQDRP----------------LDPEGILSTVPAVINAMVGVFAGQ 256
Query: 311 VII--HTKGHLARL 322
II H++G A++
Sbjct: 257 FIIRAHSRGDWAKV 270
>gi|94985055|ref|YP_604419.1| hypothetical protein Dgeo_0949 [Deinococcus geothermalis DSM 11300]
gi|94555336|gb|ABF45250.1| hypothetical protein Dgeo_0949 [Deinococcus geothermalis DSM 11300]
Length = 573
Score = 53.5 bits (127), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 49/79 (62%), Gaps = 5/79 (6%)
Query: 15 IISEP-DVSDQQEKSHLKTQ-RLASLDIFRGLAVALMILVDH-AGGDW--PEISHAPWNG 69
++S+P + ++T+ RL +LD +RGL V LM+LV++ A GD P++ HAP+ G
Sbjct: 197 VLSDPAPTTSAGGAGPVQTRVRLTALDAWRGLTVLLMLLVNNVALGDLTPPQLQHAPFGG 256
Query: 70 CNLADFVMPFFLFIVGVAI 88
L D V P+FLF G A+
Sbjct: 257 LTLTDLVFPWFLFCAGAAL 275
>gi|171914858|ref|ZP_02930328.1| hypothetical protein VspiD_26815 [Verrucomicrobium spinosum DSM
4136]
Length = 379
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 33/100 (33%), Positives = 53/100 (53%), Gaps = 14/100 (14%)
Query: 61 EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGI 117
++ H W G D + P FLF+VGV+I L++ R+ R+ A+ +++ R+ L G+
Sbjct: 29 QLQHVEWEGFRFYDAIFPLFLFLVGVSIVLSVDRMVARVGRSRALARIVRRSALLFAVGV 88
Query: 118 LLQGGFSHA-PDELTYGVDVRMIRLCGVLQRIALSYLLVS 156
GG + PD ++L GVL RIAL YL+ +
Sbjct: 89 FYYGGIARPWPD----------VQLSGVLPRIALCYLVAA 118
>gi|373954275|ref|ZP_09614235.1| Protein of unknown function DUF2261, transmembrane
[Mucilaginibacter paludis DSM 18603]
gi|373890875|gb|EHQ26772.1| Protein of unknown function DUF2261, transmembrane
[Mucilaginibacter paludis DSM 18603]
Length = 397
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 46/132 (34%), Positives = 67/132 (50%), Gaps = 17/132 (12%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
R+ S+DIFR + + LMI V+ G +W + + +G LAD V P FLFIVG++
Sbjct: 7 NRVHSIDIFRAVTMFLMIFVNDIDGVPGVPEWIKHAGERTDGLGLADIVFPAFLFIVGLS 66
Query: 88 IALALKRIPDRADAVKK----VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCG 143
I A++ R D+ K ++ R L L+F GF HA E TY D +I
Sbjct: 67 IPHAIQSRISRGDSKTKIAAYIVMRALALIF------IGFIHANME-TYS-DTAVIAQPW 118
Query: 144 VLQRIALSYLLV 155
+I LS+ L+
Sbjct: 119 WEIQITLSFFLI 130
>gi|423219681|ref|ZP_17206177.1| hypothetical protein HMPREF1061_02950 [Bacteroides caccae
CL03T12C61]
gi|392624886|gb|EIY18964.1| hypothetical protein HMPREF1061_02950 [Bacteroides caccae
CL03T12C61]
Length = 375
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 67/144 (46%), Gaps = 25/144 (17%)
Query: 32 TQRLASLDIFRGLAVALMIL---VDHAGG---DWP-------EISHAPWNGCNLADFVMP 78
+ RLASLDI RG + L++ V A G ++P + H W G D VMP
Sbjct: 9 SGRLASLDILRGFDLFLLVFFQPVFVALGQRLNFPWLNDILYQFDHESWIGFRFWDLVMP 68
Query: 79 FFLFIVGVAIALA---LKRIPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTYGV 134
FLF+ G ++ + K P++ +K+I R + L +G+++QG PD L
Sbjct: 69 LFLFMTGASMPFSFSKFKNAPNKWHIYRKIIKRFVLLFIFGMIVQGNLLGLNPDSLY--- 125
Query: 135 DVRMIRLCGVLQRIALSYLLVSLV 158
LQ IA YL+ +++
Sbjct: 126 -----LYSNTLQAIATGYLIAAII 144
>gi|153805867|ref|ZP_01958535.1| hypothetical protein BACCAC_00106 [Bacteroides caccae ATCC 43185]
gi|149130544|gb|EDM21750.1| hypothetical protein BACCAC_00106 [Bacteroides caccae ATCC 43185]
Length = 377
Score = 53.1 bits (126), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 67/144 (46%), Gaps = 25/144 (17%)
Query: 32 TQRLASLDIFRGLAVALMIL---VDHAGG---DWP-------EISHAPWNGCNLADFVMP 78
+ RLASLDI RG + L++ V A G ++P + H W G D VMP
Sbjct: 11 SGRLASLDILRGFDLFLLVFFQPVFVALGQRLNFPWLNDILYQFDHESWIGFRFWDLVMP 70
Query: 79 FFLFIVGVAIALA---LKRIPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAPDELTYGV 134
FLF+ G ++ + K P++ +K+I R + L +G+++QG PD L
Sbjct: 71 LFLFMTGASMPFSFSKFKNAPNKWHIYRKIIKRFVLLFIFGMIVQGNLLGLNPDSLY--- 127
Query: 135 DVRMIRLCGVLQRIALSYLLVSLV 158
LQ IA YL+ +++
Sbjct: 128 -----LYSNTLQAIATGYLIAAII 146
>gi|374372786|ref|ZP_09630447.1| hypothetical protein NiasoDRAFT_3432 [Niabella soli DSM 19437]
gi|373234862|gb|EHP54654.1| hypothetical protein NiasoDRAFT_3432 [Niabella soli DSM 19437]
Length = 357
Score = 52.8 bits (125), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 35/102 (34%), Positives = 53/102 (51%), Gaps = 18/102 (17%)
Query: 35 LASLDIFRGLAVALMIL--------VDHA-GGDWPE-----ISHAPWNGCNLADFVMPFF 80
+ SLD RGL + L+ L + HA G W E H PW+G + D + P F
Sbjct: 1 MLSLDFMRGLIMVLLALESTGLYEHLSHASAGTWFEGIMQQFFHHPWHGLHFWDLIQPGF 60
Query: 81 LFIVGVAIALALKRIPDR----ADAVKKVIFRTLKLLFWGIL 118
+F+ GVA+A +L++ R ++KK + R+ L FWG+L
Sbjct: 61 MFMAGVAMAYSLQKQKQRDYTWNRSLKKTLRRSGWLFFWGVL 102
>gi|29348589|ref|NP_812092.1| hypothetical protein BT_3180 [Bacteroides thetaiotaomicron
VPI-5482]
gi|29340494|gb|AAO78286.1| putative transmembrane protein [Bacteroides thetaiotaomicron
VPI-5482]
Length = 376
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 40/155 (25%), Positives = 69/155 (44%), Gaps = 26/155 (16%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG------DWP-------EISHAPWNGC 70
+ S T RLASLDI RG + L++ + P + H W G
Sbjct: 1 MNKLSEKNTTRLASLDILRGFDLFLLVFFQPVFAALVRQLNLPFLNDILYQFDHEVWEGF 60
Query: 71 NLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHA 126
D VMP FLF+ G ++ +L + + + ++++ R L +G+++QG
Sbjct: 61 RFWDLVMPLFLFMTGASMPFSLSKYVGMSGSYWLVYRRILRRVFLLFIFGMIVQGNL--- 117
Query: 127 PDELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEI 160
G+D I L LQ IA+ YL+ +++++
Sbjct: 118 -----LGLDSSHIYLYSNTLQSIAVGYLIAAVIQL 147
>gi|404403948|ref|ZP_10995532.1| hypothetical protein AJC13_00860 [Alistipes sp. JC136]
Length = 369
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 43/149 (28%), Positives = 68/149 (45%), Gaps = 30/149 (20%)
Query: 32 TQRLASLDIFRGLAVALMI----LVDHAGGDWPE---------ISHAPWNGCNLADFVMP 78
QRL SLD RG + ++ LV WP + HA W+G D + P
Sbjct: 4 NQRLLSLDALRGFDMLFIMGFSGLVASLCALWPNPFTDAVAGSMGHAAWDGLTHHDTIFP 63
Query: 79 FFLFIVGVAIALALKRIPDRADA------VKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
FLFI GV+ +L + RA+ + KVI R + L+ G++ G F
Sbjct: 64 LFLFIAGVSFPFSLAK--QRANGLGERAILGKVIRRGVTLVVLGLVYNGLFK-------- 113
Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLVEIF 161
+D +R+ VL RI L+++ +++ I+
Sbjct: 114 -LDFASLRVASVLGRIGLAWMFAAILYIY 141
>gi|24375008|ref|NP_719051.1| N-acetylglucosamine locus membrane protein of unknown function
DUF1624 NagX [Shewanella oneidensis MR-1]
gi|24349746|gb|AAN56495.1| N-acetylglucosamine locus membrane protein of unknown function
DUF1624 NagX [Shewanella oneidensis MR-1]
Length = 395
Score = 52.4 bits (124), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 94/408 (23%), Positives = 147/408 (36%), Gaps = 124/408 (30%)
Query: 1 MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAV-----------ALM 49
MS E + + ++ Q K RL SLD RG + AL+
Sbjct: 1 MSTTAPELAANVSINAQVATANNSQPKP-----RLMSLDALRGFDMFWILGGEALFGALL 55
Query: 50 ILVDHAGGDW--PEISHAPWNGCNLADFVMPFFLFIVGVAIALALK---------RIPDR 98
I AG W ++ H+ W+G L D + P F+F+ GVA+ L+ K R+P
Sbjct: 56 IFTGWAGWQWGDTQMHHSEWHGFRLYDLIFPLFIFLSGVALGLSPKRLDKLPLHERLPVY 115
Query: 99 ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
VK++ L + + G + AP VD IR VL RIA ++ +L+
Sbjct: 116 RHGVKRLFLLLLLGILYN---HGWGTGAP------VDPDKIRYASVLGRIAFAWFFAALL 166
Query: 159 EIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSAD 218
WH + VLV + +L G
Sbjct: 167 -----------------------VWHTSLRTQVLVA-VGILVG----------------- 185
Query: 219 YGKV---FNVTCGVRAKLNPPCNAVGYIDRKVL-GINHMYHHPAWRRSKACTQDSPFEGP 274
YG + G L+P + Y+D +L G+++ P
Sbjct: 186 YGAMQLWLPFPGGQAGVLSPTVSINAYVDSLLLPGVSYQGRMP----------------- 228
Query: 275 LRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII--HTKGHLARLKQWVTMGFAL 332
+PEG+LS++ ++++ + GV GH I+ H KG A++ G
Sbjct: 229 -------------DPEGVLSTLPAVVNALAGVFVGHFIVKSHPKGEWAKVGLLGAAGGVC 275
Query: 333 LIFGLTLHF---TNGEHGSGKF-------STTCVCLFIYSKVILFQWQ 370
L G L N E + F S + LF Y+ V + +WQ
Sbjct: 276 LALGWLLDAVIPVNKELWTSSFVLVTSGWSMLLLALF-YALVDVLKWQ 322
>gi|442611023|ref|ZP_21025729.1| N-acetylglucosamine related transporter, NagX [Pseudoalteromonas
luteoviolacea B = ATCC 29581]
gi|441746951|emb|CCQ11791.1| N-acetylglucosamine related transporter, NagX [Pseudoalteromonas
luteoviolacea B = ATCC 29581]
Length = 373
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 45/144 (31%), Positives = 65/144 (45%), Gaps = 23/144 (15%)
Query: 33 QRLASLDIFRGLAV-----------ALMILVDHAGGDWPE--ISHAPWNGCNLADFVMPF 79
+RLASLD RG + AL +L G E H+ W+G D + P
Sbjct: 6 KRLASLDALRGFDMMWILGGQGIFAALFVLTGWTGWRTFEAHTVHSDWHGFTFYDLIFPL 65
Query: 80 FLFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGV 134
F+F+ GVA+ L+ KRI +R +K + R + L GIL G+
Sbjct: 66 FIFLSGVAMGLSPKRIDHLPMSERTPIYRKALKRFVLLCLLGILYNHGWGTGIPA----- 120
Query: 135 DVRMIRLCGVLQRIALSYLLVSLV 158
D IR VL RIA ++L+ +L+
Sbjct: 121 DFSEIRYSSVLGRIAFAWLICALL 144
>gi|408370371|ref|ZP_11168148.1| hypothetical protein I215_05677 [Galbibacter sp. ck-I2-15]
gi|407744129|gb|EKF55699.1| hypothetical protein I215_05677 [Galbibacter sp. ck-I2-15]
Length = 394
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 38/110 (34%), Positives = 58/110 (52%), Gaps = 9/110 (8%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDH----AGGDWPEISHAPWNGCNLADFVMPFFLFIVG 85
+ + R+ S+DI RG+ + LM+ V+ W + +G LAD+V P FLF+VG
Sbjct: 1 MNSNRIMSIDIMRGITLFLMLFVNDLFIPGVPKWLVHTQEWEDGMGLADWVFPGFLFMVG 60
Query: 86 VAIALALKRIPDRADAVKK----VIFRTLKLLFWGILLQGGFSHAPDELT 131
++I A+K ++ + + VI RTL LL GIL+ S ELT
Sbjct: 61 LSIPYAMKARKNKGQSNLRLWSHVIMRTLSLLLIGILMV-NISRVNPELT 109
>gi|224025513|ref|ZP_03643879.1| hypothetical protein BACCOPRO_02253, partial [Bacteroides
coprophilus DSM 18228]
gi|224018749|gb|EEF76747.1| hypothetical protein BACCOPRO_02253 [Bacteroides coprophilus DSM
18228]
Length = 377
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 37/117 (31%), Positives = 57/117 (48%), Gaps = 20/117 (17%)
Query: 25 QEKSHLKTQRLASLDIFRGLAV--------ALMILVDHAG-GDWPE------ISHAPWNG 69
+ ++ +RL SLDI RGL + M L G +W + +H W G
Sbjct: 1 KNMRKIQKERLESLDILRGLDLFILVGFQSVFMYLAQATGENNWIKTIFDVLFTHVEWEG 60
Query: 70 CNLADFVMPFFLFIVGVAIALALKRIPDRADAVK-KVIFRTLK--LLFW--GILLQG 121
+L D VMP FLF+ G +I A+ R + + + K+ +R LK +L W G ++QG
Sbjct: 61 FHLWDQVMPLFLFMAGTSIPYAMARYKRKEEEISGKLFYRVLKRVVLLWIFGAIVQG 117
>gi|319902718|ref|YP_004162446.1| hypothetical protein Bache_2925 [Bacteroides helcogenes P 36-108]
gi|319417749|gb|ADV44860.1| hypothetical protein Bache_2925 [Bacteroides helcogenes P 36-108]
Length = 380
Score = 52.4 bits (124), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 41/152 (26%), Positives = 70/152 (46%), Gaps = 25/152 (16%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMIL---VDHAGG---DWP-------EISHAPWNGC 70
QQ+ + + RLASLD+ RG + L++ V + G + P + H W G
Sbjct: 6 QQDSLKISSSRLASLDVLRGFDLFLLVFFQPVLMSLGQQLNLPFMDVVLYQFDHEVWEGF 65
Query: 71 NLADFVMPFFLFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGG-FSHA 126
D +MP FLF+ GV++ + + PD+ +K+ R L L G+++QG
Sbjct: 66 RFWDLIMPLFLFMTGVSMPFSFAKYQSSPDKCIIYRKIFRRVLLLFLLGMVVQGNLLGLN 125
Query: 127 PDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
P + + + LQ IA+ YL+ ++
Sbjct: 126 PKHIYFYTN--------TLQAIAVGYLIAGMI 149
>gi|255037955|ref|YP_003088576.1| hypothetical protein Dfer_4208 [Dyadobacter fermentans DSM 18053]
gi|254950711|gb|ACT95411.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
Length = 371
Score = 52.0 bits (123), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 64/146 (43%), Gaps = 20/146 (13%)
Query: 27 KSHLKTQRLASLDIFRGLAVALM--------ILVDHAGGDWP-----EISHAPWNGCNLA 73
K +QRL SLD RG + + +L G W + +H WNG
Sbjct: 2 KDSAPSQRLLSLDTLRGFDMFWISGGEEIFHVLAKVTGWSWAIVLAHQFTHPDWNGFRAY 61
Query: 74 DFVMPFFLFIVGVAIALAL-----KRIPDRADAVKKVIFRTLKLLFWGILLQGG-FSHAP 127
D + P FLF+ GV+ +L K +P + V+KVI R + L+F GI+ G F
Sbjct: 62 DLIFPTFLFMAGVSTPFSLGSRLEKGVPP-SQLVRKVIQRGIILVFLGIIYNNGIFETEW 120
Query: 128 DELTYGVDVRMIRLCGVLQRIALSYL 153
++ Y + I L G+ +I Y
Sbjct: 121 SQMRYPSVLARIGLAGMFAQIIYLYF 146
>gi|317505448|ref|ZP_07963366.1| ABC superfamily ATP binding cassette transporter permease subunit
[Prevotella salivae DSM 15606]
gi|315663361|gb|EFV03110.1| ABC superfamily ATP binding cassette transporter permease subunit
[Prevotella salivae DSM 15606]
Length = 380
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 43/152 (28%), Positives = 70/152 (46%), Gaps = 29/152 (19%)
Query: 31 KTQRLASLDIFRGLAVALMILVDH-----------AGGD-----WPEISHAPWNGCNLAD 74
K RL SLDI RG +A+++LV A G ++ H PW G D
Sbjct: 9 KPNRLLSLDILRGADLAMLVLVQPILLKALETMQPAEGTVGHFIMGQLLHLPWEGFCFWD 68
Query: 75 FVMPFFLFIVGVAIALALKRIP--DRADA--VKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
+MP F+F+ G+ I A+ R +R D ++++ R + L G++ QG
Sbjct: 69 IIMPLFMFMSGITIPFAMARYKRGERIDGSFYRRILKRFVVLWILGMVCQGNL------- 121
Query: 131 TYGVDVRMIRL-CGVLQRIALSYLLVSLVEIF 161
D++ + L LQ IA+ Y+ V+ + +F
Sbjct: 122 -LAFDLQQLHLYSNTLQSIAVGYVAVAFLYVF 152
>gi|126173329|ref|YP_001049478.1| hypothetical protein Sbal_1087 [Shewanella baltica OS155]
gi|386340088|ref|YP_006036454.1| hypothetical protein [Shewanella baltica OS117]
gi|125996534|gb|ABN60609.1| conserved hypothetical protein [Shewanella baltica OS155]
gi|334862489|gb|AEH12960.1| Protein of unknown function DUF2261, transmembrane [Shewanella
baltica OS117]
Length = 387
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 94/376 (25%), Positives = 141/376 (37%), Gaps = 118/376 (31%)
Query: 34 RLASLDIFRGLAV-----------ALMILVDHAG------GDWPEISHAPWNGCNLADFV 76
RL SLD RG + L+IL AG GD ++ H+ W+G + D +
Sbjct: 18 RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWAGWQWGD-EQMHHSQWHGFHFYDLI 76
Query: 77 MPFFLFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGILLQGGF-SHAPDEL 130
P F+F+ GVA+ L+ KR+ +R + I R LL GIL G+ + AP
Sbjct: 77 FPLFIFLSGVALGLSPKRLDKLPMKERLPVYRHGIKRLFLLLLLGILYNHGWGTGAP--- 133
Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
D IR VL RIA ++ + L WH +
Sbjct: 134 ---ADPEKIRYASVLGRIAFAWFFAA-----------------------LLVWHTSLRTQ 167
Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNAVGYIDRKV 247
++V L +L G YG + G L+P + Y+D +
Sbjct: 168 IIVA-LGILLG-----------------YGAMQLWLPFPGGQAGVLSPTESINAYVDSIL 209
Query: 248 L-GINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGV 306
L G+++ P +PEGLLS++ +I++ + GV
Sbjct: 210 LPGVSYQGRTP------------------------------DPEGLLSTIPAIVNALAGV 239
Query: 307 HFGHVII--HTKGHLARLKQWVTMGFALLIFGLTLHF---TNGEHGSGKF-------STT 354
GH I+ H KG A++ G L FG L N E + F S
Sbjct: 240 FVGHFIVKSHPKGEWAKVGLLAAAGCVCLTFGWLLDLVIPVNKELWTSSFVLVTSGWSMI 299
Query: 355 CVCLFIYSKVILFQWQ 370
+ LF Y+ V + +WQ
Sbjct: 300 LLALF-YALVDVLKWQ 314
>gi|340617673|ref|YP_004736126.1| hypothetical protein zobellia_1684 [Zobellia galactanivorans]
gi|339732470|emb|CAZ95738.1| Conserved hypothetical membrane protein [Zobellia galactanivorans]
Length = 346
Score = 51.6 bits (122), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 28/69 (40%), Positives = 40/69 (57%), Gaps = 3/69 (4%)
Query: 61 EISHAPWNGCNLADFVMPFFLFIVGVAIALAL-KRIP--DRADAVKKVIFRTLKLLFWGI 117
++ H PWNG D + PFF+FIVGVA+ +L KR+ D+ K ++ R L +G
Sbjct: 30 QLHHHPWNGLRFWDLIQPFFMFIVGVAMPFSLRKRLASGDKKGVTKHILRRCFLLFAFGA 89
Query: 118 LLQGGFSHA 126
LL +SHA
Sbjct: 90 LLHCVYSHA 98
>gi|255035026|ref|YP_003085647.1| hypothetical protein Dfer_1233 [Dyadobacter fermentans DSM 18053]
gi|254947782|gb|ACT92482.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
Length = 401
Score = 51.6 bits (122), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 35/104 (33%), Positives = 51/104 (49%), Gaps = 9/104 (8%)
Query: 25 QEKSHLKTQRLASLDIFRGLAVALMILVD-----HAGGDWPEISHAPWNGCNLADFVMPF 79
K + RL S+D+FR + + LMI V+ A W E S A + L+D V P
Sbjct: 1 MNKVASSSLRLDSIDVFRAVTMLLMIFVNDFWTLEAVPKWLEHSKAEEDAMGLSDVVFPA 60
Query: 80 FLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILL 119
FLFIVG++I A+ + D ++ + RT LL GI +
Sbjct: 61 FLFIVGLSIPFAISNRRKKGDGNALIIRHIAERTFALLLMGIFI 104
>gi|305665830|ref|YP_003862117.1| hypothetical protein FB2170_06080 [Maribacter sp. HTCC2170]
gi|88710601|gb|EAR02833.1| hypothetical protein FB2170_06080 [Maribacter sp. HTCC2170]
Length = 346
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 28/69 (40%), Positives = 40/69 (57%), Gaps = 3/69 (4%)
Query: 61 EISHAPWNGCNLADFVMPFFLFIVGVAIALAL-KRIP--DRADAVKKVIFRTLKLLFWGI 117
++ H PWNG D + PFF+FIVGVA+ +L KR+ R A + ++ R L +G
Sbjct: 30 QLHHHPWNGLRFWDLIQPFFMFIVGVAMPFSLRKRLASGSRKSATRHILKRCFLLFAFGA 89
Query: 118 LLQGGFSHA 126
LL +SHA
Sbjct: 90 LLHCVYSHA 98
>gi|300123408|emb|CBK24681.2| unnamed protein product [Blastocystis hominis]
Length = 213
Score = 51.2 bits (121), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 43/79 (54%), Gaps = 7/79 (8%)
Query: 7 ETTHHHPLIISEPDVSDQQEKSHLKTQ----RLASLDIFRGLAVALMILVDHAGGDWP-E 61
E + ++EP + Q+EK + +Q R+ S+D+FRG+ + +MI ++ G +
Sbjct: 135 EASKTEASSVNEPLI--QKEKQSVVSQPMKSRVQSIDVFRGITICIMIFANYGAGQYSHS 192
Query: 62 ISHAPWNGCNLADFVMPFF 80
+ HA W+G ADF P +
Sbjct: 193 LMHAAWDGITFADFAFPLY 211
>gi|334364999|ref|ZP_08513969.1| putative membrane protein [Alistipes sp. HGB5]
gi|313158791|gb|EFR58176.1| putative membrane protein [Alistipes sp. HGB5]
Length = 383
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 71/313 (22%), Positives = 116/313 (37%), Gaps = 93/313 (29%)
Query: 31 KTQRLASLDIFRGLAVALMI----LVDHAGGDWPE---------ISHAPWNGCNLADFVM 77
+++RL SLD RG + ++ LV G WP +SH W+G D +
Sbjct: 20 QSERLMSLDALRGFDMLFIMGFASLVVAVCGLWPSAVTDAAAASMSHVAWDGFAHHDTIF 79
Query: 78 PFFLFIVGVAI--ALALKRIPDRADA--VKKVIFRTLKLLFWGILLQGGFSHAPDELTYG 133
P FLFI GV+ ++A +R ++ K++ R L L+ G++ G F
Sbjct: 80 PLFLFIAGVSFPYSVAKQRAGGMSEGRIYAKIVRRGLTLVVLGMVYNGLFK--------- 130
Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
+D +R+ VL RI L++ S+ + + K ++ +A VL
Sbjct: 131 LDFENLRIASVLGRIGLAW---SIAAVLYLNFGVKTRAA--------------IAVAVLA 173
Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
Y AL V L N GYIDR+ L
Sbjct: 174 GYGAL--------------------SALVAAPDAAGAGPLTFEGNLAGYIDRQFL----- 208
Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
G L + F+PEGLLS+V ++++ ++G+ G +
Sbjct: 209 ------------------PGKL-------IYGSFDPEGLLSTVPAVVTAMLGMFTGEFVR 243
Query: 314 HTKGHLARLKQWV 326
+ R W+
Sbjct: 244 RSDIRGGRKTLWM 256
>gi|15806610|ref|NP_295325.1| hypothetical protein DR_1602 [Deinococcus radiodurans R1]
gi|6459373|gb|AAF11168.1|AE002004_7 hypothetical protein DR_1602 [Deinococcus radiodurans R1]
Length = 388
Score = 51.2 bits (121), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 45/130 (34%), Positives = 66/130 (50%), Gaps = 14/130 (10%)
Query: 34 RLASLDIFRGLAVALMILVDHA--GGDWP-EISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
RL +LD +RGL V LM+LV++ G P ++SHA + G L D V P+FLF G A+
Sbjct: 33 RLTALDAWRGLTVLLMLLVNNVALGDSTPRQLSHAHFGGLTLTDLVFPWFLFCAGAALPF 92
Query: 91 ALKRIPDRADAVKKVIFRTLKLLFWGILLQGGF--SHAPDELTYGVDVRMIRLCGVLQRI 148
+ + ++A ++R L + L G F S LT G+ GVLQ I
Sbjct: 93 SAAAM-NKAGVTGWPLYRRLLERAALLYLMGAFVTSVTSHRLTLGL--------GVLQLI 143
Query: 149 ALSYLLVSLV 158
AL+ +L+
Sbjct: 144 ALASFFAALL 153
>gi|298386962|ref|ZP_06996516.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
gi|298260112|gb|EFI02982.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
Length = 376
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 39/151 (25%), Positives = 67/151 (44%), Gaps = 26/151 (17%)
Query: 28 SHLKTQRLASLDIFRGLAVALMILVDHAGG------DWP-------EISHAPWNGCNLAD 74
S T RLASLDI RG + L++ + P + H W G D
Sbjct: 5 SENNTSRLASLDILRGFDLFLLVFFQPVFAALARQLNLPFLNDILYQFDHEVWEGFRFWD 64
Query: 75 FVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
VMP FLF+ G ++ +L + + + ++++ R L +G+++QG
Sbjct: 65 LVMPLFLFMTGASMPFSLSKYVGMSGSYWPVYRRILRRVFLLFIFGMIVQGNL------- 117
Query: 131 TYGVDVRMIRL-CGVLQRIALSYLLVSLVEI 160
G+D I L LQ IA+ Y + +++++
Sbjct: 118 -LGLDSSHIYLYSNTLQSIAVGYFIAAVIQL 147
>gi|383124758|ref|ZP_09945419.1| hypothetical protein BSIG_1496 [Bacteroides sp. 1_1_6]
gi|251841090|gb|EES69171.1| hypothetical protein BSIG_1496 [Bacteroides sp. 1_1_6]
Length = 376
Score = 51.2 bits (121), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 39/155 (25%), Positives = 68/155 (43%), Gaps = 26/155 (16%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG------DWP-------EISHAPWNGC 70
+ S T RLASLDI RG + L++ + P + H W G
Sbjct: 1 MSKLSENNTSRLASLDILRGFDLFLLVFFQPVFAALVRQLNLPFLNDILYQFDHEVWEGF 60
Query: 71 NLADFVMPFFLFIVGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHA 126
D VMP FLF+ G ++ +L + + + ++++ R L +G+++QG
Sbjct: 61 RFWDLVMPLFLFMTGASMPFSLSKYVGMSGSYWPVYRRILRRVFLLFIFGMIVQGNL--- 117
Query: 127 PDELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEI 160
G+D I L LQ IA+ Y + +++++
Sbjct: 118 -----LGLDSSHIYLYSNTLQSIAVGYFIAAVIQL 147
>gi|373953356|ref|ZP_09613316.1| Protein of unknown function DUF2261, transmembrane
[Mucilaginibacter paludis DSM 18603]
gi|373889956|gb|EHQ25853.1| Protein of unknown function DUF2261, transmembrane
[Mucilaginibacter paludis DSM 18603]
Length = 404
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 60/270 (22%), Positives = 104/270 (38%), Gaps = 82/270 (30%)
Query: 61 EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF-----RTLKLLFW 115
++ H+PWNG D + P F+FI G+++ + R ++ + K I+ RT+ L+
Sbjct: 79 QLHHSPWNGFTFYDLIFPLFIFIAGISMPFSYNRQVAQSPSSNKQIYVRLIKRTVLLILL 138
Query: 116 GILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRF 175
G ++ G A + T R VL RIAL+ +++ + +
Sbjct: 139 GTVVNGALHFAGYQQT--------RFASVLGRIALACFFAAVIYLNSS------------ 178
Query: 176 SIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNP 235
W + A +L+ Y L+ VP +G GV L P
Sbjct: 179 -----LRWQIIWFAVILLGYWLLMALVPVP------------GHG------AGV---LTP 212
Query: 236 PCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSS 295
N +ID+ L G L + ++PEGLLS+
Sbjct: 213 GANLSAWIDQHFL-----------------------PGKLHRKV-------YDPEGLLST 242
Query: 296 VSSILSTIIGVHFGHVIIHTKGH-LARLKQ 324
+ +I + ++G+ GH + G L+ LK+
Sbjct: 243 IPAIATAMMGIFTGHFLQWEPGERLSPLKK 272
>gi|146292182|ref|YP_001182606.1| hypothetical protein Sputcn32_1079 [Shewanella putrefaciens CN-32]
gi|145563872|gb|ABP74807.1| conserved hypothetical protein [Shewanella putrefaciens CN-32]
Length = 384
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 87/372 (23%), Positives = 142/372 (38%), Gaps = 113/372 (30%)
Query: 34 RLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFF 80
RL SLD RG + L+IL AG W ++ H+ W+G + D + P F
Sbjct: 18 RLMSLDALRGFDMFWILGGEALFGGLLILTGWAGWQWGDEQMHHSQWHGFHFYDLIFPLF 77
Query: 81 LFIVGVAIALALKRIPDRADAVKKVIFR-TLKLLFWGILLQGGFSH-----APDELTYGV 134
+F+ GVA+ L+ KR+ + + ++R +K LF +LL ++H AP
Sbjct: 78 IFLSGVALGLSPKRLDKLPMSERLPVYRHGIKRLFLLLLLGILYNHGWGTGAP------A 131
Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
D IR VL RIA ++ +L+ WH + V+V
Sbjct: 132 DPEKIRYASVLGRIAFAWFFAALL-----------------------VWHTSLRTQVIVA 168
Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKV---FNVTCGVRAKLNPPCNAVGYIDRKVL-GI 250
L +L G YG + G +P + Y+D +L G+
Sbjct: 169 -LGILLG-----------------YGAMQLWLPFPSGQAGVFSPTQSINAYVDSILLPGV 210
Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
++ P +P+GLLS++ ++++ + GV G+
Sbjct: 211 SYQGRTP------------------------------DPQGLLSTIPAVVNALAGVFVGY 240
Query: 311 VII--HTKGHLARLKQWVTMGFALLIFGLTLHF---TNGEHGSGKF-------STTCVCL 358
I+ H +G ++ T G A L G L N E + F S + L
Sbjct: 241 FIVKSHPQGEWVKVGLLATAGGAWLALGWLLDGVIPVNKELWTSSFVLVTSGWSMILLAL 300
Query: 359 FIYSKVILFQWQ 370
F Y+ V + +WQ
Sbjct: 301 F-YALVDVLKWQ 311
>gi|325106033|ref|YP_004275687.1| hypothetical protein [Pedobacter saltans DSM 12145]
gi|324974881|gb|ADY53865.1| hypothetical protein Pedsa_3330 [Pedobacter saltans DSM 12145]
Length = 397
Score = 50.8 bits (120), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 40/108 (37%), Positives = 60/108 (55%), Gaps = 9/108 (8%)
Query: 32 TQRLASLDIFRGLAVALMILVD--HAGG--DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
+ R+ S+DI RGL + LM+ V+ + G W + A +G LAD+V P FLF+VG++
Sbjct: 6 SVRILSIDIMRGLTLFLMLFVNDLYEPGVPKWLVHTKANVDGMGLADWVFPGFLFMVGLS 65
Query: 88 IALALKRIPDRADAVKK----VIFRTLKLLFWGILLQGGFSHAPDELT 131
I A+K + ++ K ++ R L LLF GIL+ P ELT
Sbjct: 66 IPYAVKARKAKGESGFKIFVHILLRALSLLFIGILMLNADRVNP-ELT 112
>gi|88859970|ref|ZP_01134609.1| hypothetical protein PTD2_18200 [Pseudoalteromonas tunicata D2]
gi|88817964|gb|EAR27780.1| hypothetical protein PTD2_18200 [Pseudoalteromonas tunicata D2]
Length = 378
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 43/144 (29%), Positives = 68/144 (47%), Gaps = 27/144 (18%)
Query: 31 KTQRLASLDIFRGLAV-----------ALMILVDHAGGDWP----EISHAPWNGCNLADF 75
+ +RLASLD RG+ + AL IL G W + H+ W+G D
Sbjct: 9 QKRRLASLDALRGMDMFWILGGEKIFAALFILTGWTG--WQVAHGQTLHSNWHGFTFYDL 66
Query: 76 VMPFFLFIVGVAIALALKRIPDRADAVKKVIF-RTLKLLF----WGILLQGGFSHAPDEL 130
+ P F+F+ GVA+ L+ KRI ++V + + LK LF +G+L G+
Sbjct: 67 IFPLFIFLAGVAMGLSPKRIDHLPFQERRVYYAKALKRLFLLAGFGVLYNHGWGTGIP-- 124
Query: 131 TYGVDVRMIRLCGVLQRIALSYLL 154
++ IR VL RIA+++ +
Sbjct: 125 ---FNLEEIRYASVLGRIAIAWFV 145
>gi|284041428|ref|YP_003391358.1| heparan-alpha-glucosaminide N-acetyltransferase [Spirosoma linguale
DSM 74]
gi|283820721|gb|ADB42559.1| Heparan-alpha-glucosaminide N-acetyltransferase [Spirosoma linguale
DSM 74]
Length = 381
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 62/274 (22%), Positives = 102/274 (37%), Gaps = 83/274 (30%)
Query: 61 EISHAPWNGCNLADFVMPFFLFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWG 116
+ SH WNG D + P F+F+ GV+ + L + D+A +K+I R L L+ G
Sbjct: 60 QFSHPAWNGFRAYDLIFPLFMFMAGVSTPFSVGSRLDQGTDKAKIARKIISRGLILVVLG 119
Query: 117 ILLQGG-FSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRF 175
I+ G F+ +++ R VL RI L+ + L+ ++ +
Sbjct: 120 IIYNNGLFNRVFEDM---------RFPSVLGRIGLAGMFAQLIYLYFRPRAQ-------- 162
Query: 176 SIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNP 235
Y W +L+ Y AL+ VP CG L
Sbjct: 163 -----YIWF----VGLLLGYWALMMLVPVPG--------------------CGA-GVLTM 192
Query: 236 PCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSS 295
CN +IDR ++ H+Y H +PEGL S+
Sbjct: 193 ECNLASFIDRMLVP-GHLYKT--------------------------IH---DPEGLFST 222
Query: 296 VSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMG 329
+ +I +T++G+ F + T G K + +G
Sbjct: 223 LPAIDNTLLGI-FAGTFLRTHGRTGNQKTALLLG 255
>gi|218260820|ref|ZP_03475939.1| hypothetical protein PRABACTJOHN_01603 [Parabacteroides johnsonii
DSM 18315]
gi|423344001|ref|ZP_17321714.1| hypothetical protein HMPREF1077_03144 [Parabacteroides johnsonii
CL02T12C29]
gi|218224343|gb|EEC96993.1| hypothetical protein PRABACTJOHN_01603 [Parabacteroides johnsonii
DSM 18315]
gi|409213863|gb|EKN06876.1| hypothetical protein HMPREF1077_03144 [Parabacteroides johnsonii
CL02T12C29]
Length = 371
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 80/330 (24%), Positives = 127/330 (38%), Gaps = 102/330 (30%)
Query: 31 KTQRLASLDIFRG-----------LAVALMILVDH----AGGDWPEISHAPWNGCNLADF 75
+++RL SLD RG L VAL L + GD ++ H W+G D
Sbjct: 6 QSRRLLSLDALRGFDMFFIMGGASLFVALATLFPNPFFQVIGD--QMHHVKWDGLTHHDT 63
Query: 76 VMPFFLFIVGVAIALALKRIPDR----ADAVKKVIFRTLKLLFWGILLQGGFSHAPDELT 131
+ P FLFI G++ +L++ ++ AD +K+I R L L+ G + G +
Sbjct: 64 IFPLFLFIAGISFPFSLEKQREQGKTDADIYRKIIRRGLTLVVLGFVYNGLLNF------ 117
Query: 132 YGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACV 191
D R VL RI L+++ +L+ + T+ + W+ A +
Sbjct: 118 ---DFEHQRYASVLGRIGLAWMFGALIFVNTRTITRV----------------WITVA-I 157
Query: 192 LVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVL-GI 250
LV Y LL PD + VF + + VGY+DR +L G
Sbjct: 158 LVGYWLLLAFVPAPD----------GNGAGVFTM----------EGSLVGYVDRLLLPGR 197
Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGH 310
H+ H +PEG+LS+V ++ + ++G+ G
Sbjct: 198 LHLTVH-------------------------------DPEGILSTVPAVATALLGMFTGE 226
Query: 311 VIIHTKGHLARLKQ---WVTMGFALLIFGL 337
I + L K+ V G LL GL
Sbjct: 227 FIKMQREGLTDKKKVGGLVIAGAVLLAVGL 256
>gi|377572860|ref|ZP_09801940.1| hypothetical protein MOPEL_003_01300 [Mobilicoccus pelagius NBRC
104925]
gi|377538518|dbj|GAB47105.1| hypothetical protein MOPEL_003_01300 [Mobilicoccus pelagius NBRC
104925]
Length = 439
Score = 50.4 bits (119), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 73/292 (25%), Positives = 114/292 (39%), Gaps = 81/292 (27%)
Query: 31 KTQRLASLDIFRGLAVALMILVDH--AGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
+ RL SLD+ RG+ + + ++V+ +W E HA W G + D V P F+ + G +
Sbjct: 8 RGGRLESLDVCRGVMLVVSVVVNAWFTAPEWFE--HAAWTGVHPVDLVFPAFVTLSGAGM 65
Query: 89 ALAL-KRIPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYG-VDVRMIRLCGVLQ 146
A+A +R+P V + + R L L G+ F+ A L G VDV +R GVLQ
Sbjct: 66 AIAFARRVP-----VARQVRRVLVLTAAGL----AFAVAGQVLGTGAVDVATLRFTGVLQ 116
Query: 147 RIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPD 206
A L + LV + + WHWL AA V+ A L ++
Sbjct: 117 LYAFLVLALGLVAVVVR-----------------RWWHWLAAAAVVAGAQAWLLASWASS 159
Query: 207 WQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMYHHPAWRRSKACT 266
+ K CN G +D V G HMY
Sbjct: 160 CPGGALTKA---------------------CNPSGVVDAAVFG-PHMY------------ 185
Query: 267 QDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVI-IHTKG 317
G L D PEG +++ ++++ ++G G ++ H +G
Sbjct: 186 ----VMGRLGHD----------PEGFVAAAGALVTALVGAAAGRLMWEHRRG 223
>gi|326799399|ref|YP_004317218.1| hypothetical protein [Sphingobacterium sp. 21]
gi|326550163|gb|ADZ78548.1| hypothetical protein Sph21_1988 [Sphingobacterium sp. 21]
Length = 398
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 34/101 (33%), Positives = 56/101 (55%), Gaps = 16/101 (15%)
Query: 29 HLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPW--------NGCNLADFVMPFF 80
+ ++R+ S+DI RGL + LM+ V+ D E W +G LAD+V P F
Sbjct: 5 KVASERILSVDIMRGLTLLLMLFVN----DLFEPGVPAWLLHTKVDVDGMGLADWVFPGF 60
Query: 81 LFIVGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGI 117
LFIVGV++ A++ ++ ++ +++I RTL LL G+
Sbjct: 61 LFIVGVSVPYAIRSRLNKGESKRQIIGHIAVRTLSLLIIGV 101
>gi|390946391|ref|YP_006410151.1| hypothetical protein Alfi_1113 [Alistipes finegoldii DSM 17242]
gi|390422960|gb|AFL77466.1| hypothetical protein Alfi_1113 [Alistipes finegoldii DSM 17242]
Length = 366
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 71/313 (22%), Positives = 115/313 (36%), Gaps = 93/313 (29%)
Query: 31 KTQRLASLDIFRGLAVALMI----LVDHAGGDWPE---------ISHAPWNGCNLADFVM 77
+++RL SLD RG + ++ LV G WP +SH W+G D +
Sbjct: 3 QSERLMSLDALRGFDMLFIMGFASLVVAVCGLWPSAVTDAAAASMSHVAWDGFAHHDTIF 62
Query: 78 PFFLFIVGVAI--ALALKRIPDRADA--VKKVIFRTLKLLFWGILLQGGFSHAPDELTYG 133
P FLFI GV+ ++A +R ++ K++ R L L+ G++ G F
Sbjct: 63 PLFLFIAGVSFPYSVAKQRAGGMSEGRIYAKIVRRGLTLVVLGMVYNGLFK--------- 113
Query: 134 VDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLV 193
+D +R+ VL RI L++ S+ + + K ++ +A VL
Sbjct: 114 LDFENLRIASVLGRIGLAW---SIAAVLYLNFGVKTRAA--------------IAVAVLA 156
Query: 194 VYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHM 253
Y AL V L N GYIDR+ L
Sbjct: 157 GYGAL--------------------SALVAAPDAAGAGPLTFEGNLAGYIDRQFL----- 191
Query: 254 YHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILSTIIGVHFGHVII 313
G L + F+PEGLLS+V ++++ ++G+ G +
Sbjct: 192 ------------------PGKL-------IYGSFDPEGLLSTVPAVVTAMLGMFTGEFVR 226
Query: 314 HTKGHLARLKQWV 326
R W+
Sbjct: 227 RGDIRGGRKTLWM 239
>gi|374373619|ref|ZP_09631279.1| hypothetical protein NiasoDRAFT_2435 [Niabella soli DSM 19437]
gi|373234592|gb|EHP54385.1| hypothetical protein NiasoDRAFT_2435 [Niabella soli DSM 19437]
Length = 397
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 55/188 (29%), Positives = 83/188 (44%), Gaps = 31/188 (16%)
Query: 34 RLASLDIFRGLAVALMILVD--HAGG--DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
R+ S+DI RG+ + LM+ V+ + G W + A + LAD+V P FLF+VG++I
Sbjct: 8 RIRSIDIMRGITLCLMLFVNDLYEPGVPHWLVHTKAETDSMGLADWVFPGFLFMVGLSIP 67
Query: 90 LALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDEL--------------- 130
A+ + D V ++FR++ LL G+L+ G P
Sbjct: 68 FAIDSRRRKGDEWPQLVLHILFRSVSLLIIGLLMLNGGRVNPQLTGMPVLLWKSLVYLCI 127
Query: 131 -----TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQD--KDQSVGRFSIFRLYCW 183
TY V+ RM +LQ + LL LV IF + K G + I L W
Sbjct: 128 FLVWNTYPVNKRMKPFFILLQLAGIGGLL-YLVWIFKAGIPGAIKWMETGWWGILGLIGW 186
Query: 184 HWLMAACV 191
+L AA +
Sbjct: 187 GYLTAALI 194
>gi|292609605|ref|XP_002660455.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase [Danio
rerio]
Length = 292
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 31/47 (65%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
+RL SLD FRGL++ +M+ V++ GG + H WNG +AD V P+
Sbjct: 246 RRLRSLDTFRGLSLVIMVFVNYGGGRYWFFRHESWNGLTVADLVFPW 292
>gi|187735009|ref|YP_001877121.1| transmembrane protein [Akkermansia muciniphila ATCC BAA-835]
gi|187425061|gb|ACD04340.1| putative transmembrane protein [Akkermansia muciniphila ATCC
BAA-835]
Length = 373
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 47/147 (31%), Positives = 63/147 (42%), Gaps = 25/147 (17%)
Query: 28 SHLKTQRLASLDIFRG-----------LAVALMILVDHAGGDW--PEISHAPWNGCNLAD 74
S + QR+A++D RG L VA + L +W +H W G D
Sbjct: 5 SDTRPQRIAAIDALRGFDMFFLTGGLALVVAGINLFYDRSPEWLVKHSTHVAWEGFAAWD 64
Query: 75 FVMPFFLFIVGVAIALAL-KRIPDRA--DAVKKVIFRTLKLLFWGILLQGG-FSHAPDEL 130
VMP FLFIVG A+ + KRI KV R + L G+++QG S P
Sbjct: 65 LVMPLFLFIVGTAMPFSFSKRIGSEPLWKIYLKVARRVVVLFLLGMVVQGNLLSFEPS-- 122
Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSL 157
RM C LQ IA YL+ ++
Sbjct: 123 ------RMSLYCNTLQAIASGYLIAAI 143
>gi|134025078|gb|AAI35092.1| Unknown (protein for IMAGE:7224994) [Danio rerio]
Length = 291
Score = 50.1 bits (118), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 21/47 (44%), Positives = 31/47 (65%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPF 79
+RL SLD FRGL++ +M+ V++ GG + H WNG +AD V P+
Sbjct: 245 RRLRSLDTFRGLSLVIMVFVNYGGGRYWFFRHESWNGLTVADLVFPW 291
>gi|154492358|ref|ZP_02031984.1| hypothetical protein PARMER_01992 [Parabacteroides merdae ATCC
43184]
gi|423722056|ref|ZP_17696232.1| hypothetical protein HMPREF1078_00295 [Parabacteroides merdae
CL09T00C40]
gi|154087583|gb|EDN86628.1| hypothetical protein PARMER_01992 [Parabacteroides merdae ATCC
43184]
gi|409242758|gb|EKN35518.1| hypothetical protein HMPREF1078_00295 [Parabacteroides merdae
CL09T00C40]
Length = 370
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 71/303 (23%), Positives = 113/303 (37%), Gaps = 102/303 (33%)
Query: 31 KTQRLASLDIFRG-----------LAVALMILVDHA-----GGDWPEISHAPWNGCNLAD 74
+++RL SLD RG L VAL L ++ GG ++ H W+G D
Sbjct: 6 QSRRLLSLDALRGFDMFFIMGGASLFVALATLFPNSFFQAIGG---QMDHVEWDGLTHHD 62
Query: 75 FVMPFFLFIVGVAIALALKRIPDR----ADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
+ P FLFI G++ +L++ ++ +D K+++ R + L+ G + G
Sbjct: 63 TIFPLFLFIAGISFPFSLEKQREQGKSESDIYKRIVRRGITLVLLGCVYNGLLQF----- 117
Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
D +R VL RI L ++ +L+ F FR W+ A
Sbjct: 118 ----DFANLRCASVLARIGLGWMFAALL----------------FVHFRTSVRAWI-AGT 156
Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
+LV Y + VP A+ G N VGY+DR +L
Sbjct: 157 ILVGYWVWIAFIPVP----------GAEAG-----------PFTLEGNWVGYVDRLLL-- 193
Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILSTIIGVHFG 309
P H F+PEGLLS++ I++ ++G+ G
Sbjct: 194 -----------------------------PGRLHQGFFDPEGLLSTLPGIVTAMLGMFTG 224
Query: 310 HVI 312
I
Sbjct: 225 EFI 227
>gi|182412825|ref|YP_001817891.1| hypothetical protein Oter_1003 [Opitutus terrae PB90-1]
gi|177840039|gb|ACB74291.1| conserved hypothetical protein [Opitutus terrae PB90-1]
Length = 411
Score = 49.7 bits (117), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 47/132 (35%), Positives = 62/132 (46%), Gaps = 27/132 (20%)
Query: 34 RLASLDIFRG-------------LAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFF 80
RL S+D RG LA+ M L ++ H W G D + P F
Sbjct: 11 RLVSVDALRGFDMFWILGADALVLALGAMSLSPTLRALAGQLEHKDWAGFAFYDLIFPLF 70
Query: 81 LFIVGVAIALALKRI---PDRADAVKKVIFRTLKLLFWGILLQGGFSHA-PDELTYGVDV 136
+FIVGV+ +L + RA AVK+++ RTL LL +GI GG +H PD
Sbjct: 71 VFIVGVSTVFSLTSLVAREGRAAAVKRILRRTLLLLAFGIFYNGGLAHQWPD-------- 122
Query: 137 RMIRLCGVLQRI 148
+RL GVLQRI
Sbjct: 123 --VRLVGVLQRI 132
>gi|223937685|ref|ZP_03629587.1| conserved hypothetical protein [bacterium Ellin514]
gi|223893657|gb|EEF60116.1| conserved hypothetical protein [bacterium Ellin514]
Length = 413
Score = 49.7 bits (117), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 53/204 (25%), Positives = 80/204 (39%), Gaps = 40/204 (19%)
Query: 1 MSEIKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP 60
M+ +A+ T P S P + + K RL SLD +RG + LM W
Sbjct: 1 MNPEEAQATLASPTQESRPARTVPE-----KATRLISLDAYRGFVMLLM--ASEGFNMWR 53
Query: 61 ----------------EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKK 104
+ H W GC L D + P F+F+VGVA+ +L + +
Sbjct: 54 MAEQNPNSSFWQFLKYQTEHVDWRGCALWDLIQPSFMFMVGVAMPFSLASRRAKGQSFNT 113
Query: 105 V----IFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEI 160
+ ++R++ L+F GI L+ H TY VL +I L Y + L+
Sbjct: 114 MLGHTLWRSIALVFIGIFLRSVGRHQ----TY------FTFEDVLTQIGLGYTFLFLLAW 163
Query: 161 FTKDVQDKDQS---VGRFSIFRLY 181
VQ VG ++ F LY
Sbjct: 164 TKLRVQFTAAMLILVGYWAAFALY 187
>gi|423345097|ref|ZP_17322786.1| hypothetical protein HMPREF1060_00458 [Parabacteroides merdae
CL03T12C32]
gi|409222883|gb|EKN15820.1| hypothetical protein HMPREF1060_00458 [Parabacteroides merdae
CL03T12C32]
Length = 370
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 71/303 (23%), Positives = 113/303 (37%), Gaps = 102/303 (33%)
Query: 31 KTQRLASLDIFRG-----------LAVALMILVDHA-----GGDWPEISHAPWNGCNLAD 74
+++RL SLD RG L VAL L ++ GG ++ H W+G D
Sbjct: 6 QSRRLLSLDALRGVDMFFIMGGASLFVALATLFPNSFFQAIGG---QMDHVEWDGLTHHD 62
Query: 75 FVMPFFLFIVGVAIALALKRIPDR----ADAVKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
+ P FLFI G++ +L++ ++ +D K+++ R + L+ G + G
Sbjct: 63 TIFPLFLFIAGISFPFSLEKQREQGKSESDIYKRIVRRGITLVLLGCVYNGLLQF----- 117
Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAAC 190
D +R VL RI L ++ +L+ F FR W+ A
Sbjct: 118 ----DFANLRCASVLARIGLGWMFAALL----------------FVHFRTSVRAWI-AGV 156
Query: 191 VLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGI 250
+LV Y + VP A+ G N VGY+DR +L
Sbjct: 157 ILVGYWVWIAFIPVP----------GAEAG-----------PFTLEGNWVGYVDRLLL-- 193
Query: 251 NHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILSTIIGVHFG 309
P H F+PEGLLS++ I++ ++G+ G
Sbjct: 194 -----------------------------PGRLHQGFFDPEGLLSTLPGIVTAMLGMFTG 224
Query: 310 HVI 312
I
Sbjct: 225 EFI 227
>gi|440749360|ref|ZP_20928608.1| N-acetylglucosamine related transporter, NagX [Mariniradius
saccharolyticus AK6]
gi|436482365|gb|ELP38488.1| N-acetylglucosamine related transporter, NagX [Mariniradius
saccharolyticus AK6]
Length = 401
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 34/94 (36%), Positives = 50/94 (53%), Gaps = 9/94 (9%)
Query: 33 QRLASLDIFRGLAVALMILVD-----HAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
+R+ S+D+FR + + LMI V+ W A +G LAD V P FL IVG++
Sbjct: 11 RRVYSIDVFRAITMMLMIFVNDLWTLEGIPAWLGHVDAKEDGMGLADVVFPAFLVIVGLS 70
Query: 88 IALALKRIPDR----ADAVKKVIFRTLKLLFWGI 117
I AL + ++ A +K + FRTL LL G+
Sbjct: 71 IPFALSKRIEKGERLAGTLKHIFFRTLALLTMGV 104
>gi|333379187|ref|ZP_08470911.1| hypothetical protein HMPREF9456_02506 [Dysgonomonas mossii DSM
22836]
gi|332885455|gb|EGK05704.1| hypothetical protein HMPREF9456_02506 [Dysgonomonas mossii DSM
22836]
Length = 395
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/96 (33%), Positives = 53/96 (55%), Gaps = 9/96 (9%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFIVG 85
K +R+AS+DI+R L + MI V+ W E + A + +D V P FLFI+G
Sbjct: 8 KPKRIASIDIYRALTMFFMIFVNDLWSVSNVPHWLEHAAANEDMLGFSDIVFPSFLFILG 67
Query: 86 VAIALALKRIPDRADA----VKKVIFRTLKLLFWGI 117
++I LA++ + D+ +K +I R++ LL G+
Sbjct: 68 MSIPLAIEIRKKKGDSNSGILKHIIIRSIALLVMGL 103
>gi|332668157|ref|YP_004450945.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
gi|332336971|gb|AEE54072.1| Protein of unknown function DUF2261, transmembrane
[Haliscomenobacter hydrossis DSM 1100]
Length = 387
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 36/103 (34%), Positives = 54/103 (52%), Gaps = 17/103 (16%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-W--------NGCNLADFVMPFF 80
+K QRL S+DI R + + LMI V+ D ++H P W +G L+D V P F
Sbjct: 1 MKNQRLPSIDILRAVTMLLMIFVN----DLWSLTHVPHWLLHTAAEEDGMGLSDVVFPAF 56
Query: 81 LFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILL 119
LFIVG++I ALK + + + ++ RT LL G+ +
Sbjct: 57 LFIVGLSIPHALKARLEKGASKGSVMLHILSRTFALLVMGLFM 99
>gi|167764058|ref|ZP_02436185.1| hypothetical protein BACSTE_02441 [Bacteroides stercoris ATCC
43183]
gi|167698174|gb|EDS14753.1| hypothetical protein BACSTE_02441 [Bacteroides stercoris ATCC
43183]
Length = 394
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 60/113 (53%), Gaps = 15/113 (13%)
Query: 29 HLKTQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFI 83
+L QR+A++D+FR L + LM+ V+ G W + A + +D + P FLF
Sbjct: 3 NLTLQRVAAVDVFRALTMFLMLFVNDIPGLKNVPHWLMHAAADEDMLGFSDTIFPAFLFC 62
Query: 84 VGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL------LQGGFSHA 126
+G++++ A++ + D +VI +RT+ L+ G+ ++GG SH+
Sbjct: 63 MGMSVSFAIQNRYKKGDTTTQVIAHIFWRTVALIAMGLFSLNSGGIEGGLSHS 115
>gi|345517559|ref|ZP_08797028.1| hypothetical protein BSFG_03809 [Bacteroides sp. 4_3_47FAA]
gi|254837353|gb|EET17662.1| hypothetical protein BSFG_03809 [Bacteroides sp. 4_3_47FAA]
Length = 359
Score = 49.3 bits (116), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 44/144 (30%), Positives = 65/144 (45%), Gaps = 25/144 (17%)
Query: 32 TQRLASLDIFRGLAVALMILVD------HAGGDWP-------EISHAPWNGCNLADFVMP 78
+ RL SLD+ RGL + L++ D+P + H W G D VMP
Sbjct: 7 SSRLDSLDMLRGLDLFLLVFFQPVLMSFGQQTDFPWMTSILYQFEHEVWVGFRFWDLVMP 66
Query: 79 FFLFIVGVAIALALKR---IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
FLF+ GV++ + + I DR +K+ R L L G+++QG G+D
Sbjct: 67 LFLFMTGVSMPFSFAKYRDISDRNAVYRKITRRFLLLFLLGMVVQGNL--------LGLD 118
Query: 136 VRMIRLC-GVLQRIALSYLLVSLV 158
I L LQ IA YL+ +L+
Sbjct: 119 WEHIYLYNNTLQAIAAGYLIAALL 142
>gi|291514624|emb|CBK63834.1| Uncharacterized conserved protein [Alistipes shahii WAL 8301]
Length = 352
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 53/105 (50%), Gaps = 13/105 (12%)
Query: 61 EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKV---IFRT-LKLLFWG 116
++ HA WNG + D + P FLFI GVA +L + R K++ IFR L L G
Sbjct: 29 QMQHAAWNGLTIQDTIFPLFLFIAGVAFPFSLAKQRARGFGRKRILDRIFRRGLILALLG 88
Query: 117 ILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIF 161
++ G F ++ +R+ VL RI L+++ +L+ ++
Sbjct: 89 MVYNGLFE---------LNFSSLRIASVLGRIGLAWMFAALLCVY 124
>gi|127512051|ref|YP_001093248.1| hypothetical protein Shew_1118 [Shewanella loihica PV-4]
gi|126637346|gb|ABO22989.1| conserved hypothetical protein [Shewanella loihica PV-4]
Length = 387
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 48/163 (29%), Positives = 72/163 (44%), Gaps = 33/163 (20%)
Query: 18 EPDVSDQQEKSHLKTQRLASLDIFRGL---------AVALMILVDHAGGDW----PEISH 64
EP +D K K RL SLD RG A+ +LV W ++ H
Sbjct: 2 EPK-TDTHPKPAAKP-RLMSLDALRGFDMFWILGGEALFAALLVWTGWQGWRIADAQMHH 59
Query: 65 APWNGCNLADFVMPFFLFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGILL 119
+ W+G D + P F+F+ GVA+ L+ KR+ P+R + I R + LL +G+L
Sbjct: 60 SQWHGFTFYDLIFPLFIFLSGVALGLSPKRLDSLPWPERLPLYRHAIKRLMLLLLFGVLY 119
Query: 120 QGGFS----HAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
G+ A DE +R VL RIA ++ +L+
Sbjct: 120 NHGWGTGMPMAADE---------VRYASVLGRIAFAWFFAALL 153
>gi|427384705|ref|ZP_18881210.1| hypothetical protein HMPREF9447_02243 [Bacteroides oleiciplenus YIT
12058]
gi|425727966|gb|EKU90825.1| hypothetical protein HMPREF9447_02243 [Bacteroides oleiciplenus YIT
12058]
Length = 398
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 59/113 (52%), Gaps = 15/113 (13%)
Query: 28 SHLKTQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLF 82
+L QR+A++D+FR L + LM+ V+ G W E + + +D + P FLF
Sbjct: 6 KNLAPQRVAAVDVFRALTMFLMLFVNDIPGLKNIPHWLEHAEMNEDMMGFSDTIFPAFLF 65
Query: 83 IVGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL------LQGGFSH 125
+G++++ A++ + D +VI +RT+ L+ G+ ++GG SH
Sbjct: 66 CMGMSVSFAIQNRYRKGDTTLQVIAHVFWRTVALIAMGLFSLNSGGIEGGLSH 118
>gi|393788826|ref|ZP_10376952.1| hypothetical protein HMPREF1068_03232 [Bacteroides nordii
CL02T12C05]
gi|392653932|gb|EIY47582.1| hypothetical protein HMPREF1068_03232 [Bacteroides nordii
CL02T12C05]
Length = 376
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 37/145 (25%), Positives = 64/145 (44%), Gaps = 26/145 (17%)
Query: 34 RLASLDIFRGLAVALMILVDHA------GGDWP-------EISHAPWNGCNLADFVMPFF 80
RLASLDI RG + L++ + P + H W G D VMP F
Sbjct: 11 RLASLDILRGFDLFLLVFFQPVFVALARQLNLPFLDEVLYQFDHEVWEGFRFWDLVMPLF 70
Query: 81 LFIVGVAIALALKRIP----DRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
LF+ G ++ +L + D ++++ R + L +G+++QG G D
Sbjct: 71 LFMTGASMPFSLSKYKTASVDYWPVYRRILKRVILLFIFGMIVQGNL--------LGFDS 122
Query: 137 RMIRL-CGVLQRIALSYLLVSLVEI 160
+ I LQ IA+ Y + +++++
Sbjct: 123 KHIYFYSNTLQSIAVGYFIAAVIQL 147
>gi|237722081|ref|ZP_04552562.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
gi|293373568|ref|ZP_06619919.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
gi|299145142|ref|ZP_07038210.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
gi|229448950|gb|EEO54741.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
gi|292631466|gb|EFF50093.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
gi|298515633|gb|EFI39514.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
Length = 377
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 39/145 (26%), Positives = 64/145 (44%), Gaps = 26/145 (17%)
Query: 34 RLASLDIFRGLAVALMIL-------------VDHAGGDWPEISHAPWNGCNLADFVMPFF 80
RLASLDI RG + L++ + + H W G D VMP F
Sbjct: 12 RLASLDILRGFDLFLLVFFQPVFVALARQMNMSFLDSILYQFDHEVWEGFRFWDLVMPLF 71
Query: 81 LFIVGVAIALALKRIPDRADAVKKVIFRTLKLLF----WGILLQGGFSHAPDELTYGVDV 136
LF+ G ++ +L + + V R LK +F +G+++QG G+D
Sbjct: 72 LFMTGASMPFSLSKYIGTTGSYWPVYRRILKRVFLLFIFGMIVQGNL--------LGLDA 123
Query: 137 RMIRL-CGVLQRIALSYLLVSLVEI 160
+ L LQ IA+ YL+ +++++
Sbjct: 124 THLYLYSNTLQSIAVGYLIAAVIQL 148
>gi|430744438|ref|YP_007203567.1| hypothetical protein Sinac_3623 [Singulisphaera acidiphila DSM
18658]
gi|430016158|gb|AGA27872.1| hypothetical protein Sinac_3623 [Singulisphaera acidiphila DSM
18658]
Length = 454
Score = 48.9 bits (115), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 36/105 (34%), Positives = 55/105 (52%), Gaps = 7/105 (6%)
Query: 23 DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDW---PEISHAPWNGCNLADFVMPF 79
E + KT R+ S+D FRG VA M +V+ GG P + H N + AD +MP
Sbjct: 47 SNGEAAGTKTGRIVSMDQFRGYTVAGMCVVNFLGGLQAIHPVLKHNN-NYFSYADTIMPS 105
Query: 80 FLFIVGVAIAL-ALKRIPDRADAV--KKVIFRTLKLLFWGILLQG 121
FLF G + L ALKR+ A ++ ++R+L L+ +++ G
Sbjct: 106 FLFACGFSYRLTALKRLDQFGPAAMYRRFVWRSLGLVLLSLMMYG 150
>gi|256424049|ref|YP_003124702.1| hypothetical protein Cpin_5069 [Chitinophaga pinensis DSM 2588]
gi|256038957|gb|ACU62501.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588]
Length = 390
Score = 48.5 bits (114), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 49/94 (52%), Gaps = 9/94 (9%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFIVGV 86
+QRL S+D FR L + MI V+ G +W + A +G AD V P FLFIVG+
Sbjct: 5 SQRLLSIDAFRALTMLTMIFVNDVSGVKNIPEWIDHVKAQDDGMGFADTVFPAFLFIVGL 64
Query: 87 AIALALKRIPDRADAV----KKVIFRTLKLLFWG 116
+I A+ + + D+ ++ R+L ++ G
Sbjct: 65 SIPFAIGKRISKQDSFFSIESHILLRSLAMIVMG 98
>gi|329960675|ref|ZP_08299018.1| conserved domain protein [Bacteroides fluxus YIT 12057]
gi|328532548|gb|EGF59342.1| conserved domain protein [Bacteroides fluxus YIT 12057]
Length = 394
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 34/117 (29%), Positives = 63/117 (53%), Gaps = 23/117 (19%)
Query: 29 HLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-W--------NGCNLADFVMPF 79
+L QR+A++D+FR L + LM+ V+ D P + + P W + +D + P
Sbjct: 3 NLTPQRVAAVDVFRALTMFLMLFVN----DIPGLKNVPHWLMHAKIDEDMLGFSDTIFPA 58
Query: 80 FLFIVGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL------LQGGFSHA 126
FLF +G++++LA++ + + +VI +RT+ LL G+ ++GG SH+
Sbjct: 59 FLFCMGMSVSLAIQNRYKKGNTTLQVISHIFWRTIALLAMGLFSLNSGGIEGGLSHS 115
>gi|260911058|ref|ZP_05917694.1| conserved hypothetical protein [Prevotella sp. oral taxon 472 str.
F0295]
gi|260634862|gb|EEX52916.1| conserved hypothetical protein [Prevotella sp. oral taxon 472 str.
F0295]
Length = 409
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 33/110 (30%), Positives = 58/110 (52%), Gaps = 16/110 (14%)
Query: 61 EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF--RTLK--LLFW- 115
+I+H PW G D +MP F+F+ G+ I ++ + R ++ V F R LK ++ W
Sbjct: 83 QITHVPWQGFCFWDIIMPLFMFMSGITIPFSMAKY-QRGESKAGVGFLLRLLKRFVVLWV 141
Query: 116 -GILLQGGFSHAPDELTYGVDVRMIRL-CGVLQRIALSYLLVSLVEIFTK 163
G+++QG +D R + L LQ IA+ Y++V+L+ ++T
Sbjct: 142 LGMVVQGNL--------LALDARQLHLYSNTLQSIAVGYVVVALLFVYTS 183
>gi|170725675|ref|YP_001759701.1| hypothetical protein Swoo_1314 [Shewanella woodyi ATCC 51908]
gi|169811022|gb|ACA85606.1| conserved hypothetical protein [Shewanella woodyi ATCC 51908]
Length = 378
Score = 48.5 bits (114), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 46/155 (29%), Positives = 72/155 (46%), Gaps = 27/155 (17%)
Query: 25 QEKSHLKT--QRLASLDIFRG-----------LAVALMILVDHAGGDWP--EISHAPWNG 69
E + KT +RL SLD RG L L++ G W ++ H+ W+G
Sbjct: 1 MEATQAKTPKRRLMSLDALRGFDMFWILGGEALFAGLLLWTGWHGWQWADAQMHHSQWHG 60
Query: 70 CNLADFVMPFFLFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGILLQGGF- 123
D + P F+F+ GVA+ L+ KR+ R K + R L LLF+G+L G+
Sbjct: 61 FTFYDLIFPLFIFLSGVALGLSPKRLDKLPMAQRMPLYKHSVKRLLLLLFFGVLYNHGWG 120
Query: 124 SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
+ AP V + +R VL RIA ++ +++
Sbjct: 121 TGAP------VAIDEVRYASVLGRIAFAWFFAAML 149
>gi|115770385|ref|XP_001180412.1| PREDICTED: heparan-alpha-glucosaminide N-acetyltransferase-like,
partial [Strongylocentrotus purpuratus]
Length = 78
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 26/78 (33%), Positives = 47/78 (60%), Gaps = 6/78 (7%)
Query: 49 MILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADA----V 102
M+L A GD + +SHA W+G +ADF+ P+F+FI+G +I L++ + + +
Sbjct: 1 MLLAGGAYGDGHYWFVSHAIWSGITVADFMFPWFVFIMGTSIHLSINILLSKGQSYPSIY 60
Query: 103 KKVIFRTLKLLFWGILLQ 120
KK++ R++ L G+ +Q
Sbjct: 61 KKLVSRSITLFIMGVCIQ 78
>gi|227875179|ref|ZP_03993321.1| possible heparan-alpha-glucosaminide N-acetyltransferase
[Mobiluncus mulieris ATCC 35243]
gi|227844084|gb|EEJ54251.1| possible heparan-alpha-glucosaminide N-acetyltransferase
[Mobiluncus mulieris ATCC 35243]
Length = 399
Score = 48.1 bits (113), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 83/348 (23%), Positives = 132/348 (37%), Gaps = 94/348 (27%)
Query: 2 SEIKAETTHHHPLIISEPDV--SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAG--- 56
++ +A TT +EP+ ++Q E K R+ SLD+ RG L++ V A
Sbjct: 13 TQSEAATTRQ-----TEPNTGETNQAETKPAKPGRITSLDVGRGWF--LIMSVTSAAWLL 65
Query: 57 --GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLF 114
DW + HAPW G D + P F+ + G+ +A A +R KV R + +L
Sbjct: 66 PRPDW--LIHAPWIGIRYYDMIFPLFVTLSGIGLAFAYH---NRVS--FKVTLRRIVVLV 118
Query: 115 WGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR 174
LL G S D R G LQ A+ +++ +F ++
Sbjct: 119 VVGLLYNGVSSGQ------WDPATFRFTGPLQVYAVIVTIIATCHLFARN---------- 162
Query: 175 FSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLN 234
MA + +A+L + W T GV L+
Sbjct: 163 -----------WMAWAGITAGVAVLQTGLLTWWAGT--------------CPSGV---LS 194
Query: 235 PPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLS 294
P CN G DR +LG HMY + G L D PEGL++
Sbjct: 195 PSCNPSGMWDRALLG-AHMY----------------YGGFLGHD----------PEGLVA 227
Query: 295 SVSSILSTIIGVHFGHVIIHTK--GHLARLKQWVTMGFALLIFGLTLH 340
++L+ G GH+ + ++ G + + + A+ +FGL L+
Sbjct: 228 ITGALLTAAAGTTAGHLALSSRRLGWKTGPVKLLALAAAMSVFGLILN 275
>gi|212557932|gb|ACJ30386.1| Conserved hypothetical protein [Shewanella piezotolerans WP3]
Length = 387
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 47/160 (29%), Positives = 75/160 (46%), Gaps = 28/160 (17%)
Query: 20 DVSDQQEKSHLKTQ-RLASLDIFRGLAV-----------ALMILVDHAGGDW--PEISHA 65
+ Q E K + RL SLD RG + AL++L G W ++ H+
Sbjct: 6 NTQSQTEHGPKKNKVRLKSLDALRGFDMFWILGGEAIFAALIVLTGWGGLHWLDKQMHHS 65
Query: 66 PWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKV------IFRTLKLLFWGILL 119
W+G D + P F+F+ GVA+ L+ KR+ D+ V+++ + R L LL G++
Sbjct: 66 AWHGFTFYDLIFPLFIFLSGVALGLSPKRL-DKLPMVQRMPLYQHAVKRLLLLLLLGVIY 124
Query: 120 QGGF-SHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
G+ + AP L IR VL RIA ++ +L+
Sbjct: 125 NHGWGTGAPMALGD------IRYASVLGRIAFAWFFCALL 158
>gi|307700906|ref|ZP_07637931.1| putative membrane protein [Mobiluncus mulieris FB024-16]
gi|307613901|gb|EFN93145.1| putative membrane protein [Mobiluncus mulieris FB024-16]
Length = 442
Score = 48.1 bits (113), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 83/347 (23%), Positives = 131/347 (37%), Gaps = 94/347 (27%)
Query: 2 SEIKAETTHHHPLIISEPDV--SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAG--- 56
++ +A TT +EP+ ++Q E K R+ SLD+ RG L++ V A
Sbjct: 56 TQSEAATTRQ-----TEPNTGETNQAETKPAKPGRITSLDVGRGWF--LIMSVTSAAWLL 108
Query: 57 --GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLF 114
DW + HAPW G D + P F+ + G+ +A A +R KV R + +L
Sbjct: 109 PRPDW--LIHAPWIGIRYYDMIFPLFVTLSGIGLAFAYH---NRVS--FKVTLRRIVVLV 161
Query: 115 WGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR 174
LL G S D R G LQ A+ +++ +F ++
Sbjct: 162 VVGLLYNGVSSGQ------WDPATFRFTGPLQVYAVIVAIIATCHLFARN---------- 205
Query: 175 FSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLN 234
MA + +A+L + W T GV L+
Sbjct: 206 -----------WMAWAGITAGVAVLQTGLLTWWAGT--------------CPSGV---LS 237
Query: 235 PPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLS 294
P CN G DR +LG HMY + G L D PEGL++
Sbjct: 238 PSCNPSGMWDRALLGA-HMY----------------YGGFLGHD----------PEGLVA 270
Query: 295 SVSSILSTIIGVHFGHVIIHTK--GHLARLKQWVTMGFALLIFGLTL 339
++L+ G GH+ + ++ G + + + A+ +FGL L
Sbjct: 271 ITGALLTAAAGTTAGHLALSSRRLGWKTGPVKLLALAAAMSVFGLIL 317
>gi|389866878|ref|YP_006369119.1| hypothetical protein MODMU_5285 [Modestobacter marinus]
gi|388489082|emb|CCH90660.1| conserved transmembrane protein of unknown function [Modestobacter
marinus]
Length = 327
Score = 47.8 bits (112), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 28/71 (39%), Positives = 41/71 (57%), Gaps = 4/71 (5%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
+RL +D+ RGLAV M++VD+ G + HA W+G ++AD V P FL + GV ++
Sbjct: 2 RRLHGVDVLRGLAVVGMLVVDNRGNASIATQWHHAAWDGLHVADVVFPAFLLVAGV--SM 59
Query: 91 ALKRIPDRADA 101
R DR A
Sbjct: 60 PFSRRADRPRA 70
>gi|333382416|ref|ZP_08474086.1| hypothetical protein HMPREF9455_02252 [Dysgonomonas gadei ATCC
BAA-286]
gi|332828727|gb|EGK01419.1| hypothetical protein HMPREF9455_02252 [Dysgonomonas gadei ATCC
BAA-286]
Length = 394
Score = 47.8 bits (112), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 34/108 (31%), Positives = 55/108 (50%), Gaps = 9/108 (8%)
Query: 31 KTQRLASLDIFRGLAVALMILVD-----HAGGDWPEISHAPWNGCNLADFVMPFFLFIVG 85
K R+AS+DIFR L + MI V+ W E + A + +D V P FLFI+G
Sbjct: 8 KPVRVASIDIFRALTMFFMIFVNDFWSVSGVPHWLEHAAASEDMLGFSDVVFPSFLFILG 67
Query: 86 VAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDE 129
++I LA++ + + K++++ R++ LL G+ S DE
Sbjct: 68 MSIPLAMESRMKKGETKKQILWHIVVRSVALLVMGLFTVNLESGVADE 115
>gi|269978070|ref|ZP_06185020.1| putative membrane protein [Mobiluncus mulieris 28-1]
gi|269933579|gb|EEZ90163.1| putative membrane protein [Mobiluncus mulieris 28-1]
Length = 442
Score = 47.8 bits (112), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 83/347 (23%), Positives = 131/347 (37%), Gaps = 94/347 (27%)
Query: 2 SEIKAETTHHHPLIISEPDV--SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAG--- 56
++ +A TT +EP+ ++Q E K R+ SLD+ RG L++ V A
Sbjct: 56 TQSEAATTRQ-----TEPNTGETNQTETKPAKPGRITSLDVGRGWF--LIMSVTSAAWLL 108
Query: 57 --GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLF 114
DW + HAPW G D + P F+ + G+ +A A +R KV R + +L
Sbjct: 109 PRPDW--LIHAPWIGIRYYDMIFPLFVTLSGIGLAFAYH---NRVS--FKVTLRRIVVLV 161
Query: 115 WGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR 174
LL G S D R G LQ A+ +++ +F ++
Sbjct: 162 VVGLLYNGVSSGQ------WDPATFRFTGPLQVYAVIVAIIATCHLFARN---------- 205
Query: 175 FSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLN 234
MA + +A+L + W T GV L+
Sbjct: 206 -----------WMAWAGITAGVAVLQTGLLTWWAGT--------------CPSGV---LS 237
Query: 235 PPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLS 294
P CN G DR +LG HMY + G L D PEGL++
Sbjct: 238 PSCNPSGMWDRALLGA-HMY----------------YGGFLGHD----------PEGLVA 270
Query: 295 SVSSILSTIIGVHFGHVIIHTK--GHLARLKQWVTMGFALLIFGLTL 339
++L+ G GH+ + ++ G + + + A+ +FGL L
Sbjct: 271 ITGALLTAAAGTTAGHLALSSRRLGWKTGPVKLLALAAAMSVFGLIL 317
>gi|392308231|ref|ZP_10270765.1| hypothetical protein PcitN1_06167 [Pseudoalteromonas citrea NCIMB
1889]
Length = 375
Score = 47.4 bits (111), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 63/146 (43%), Gaps = 27/146 (18%)
Query: 33 QRLASLDIFRGLAV-----------ALMILVDHAGGDWPEIS----HAPWNGCNLADFVM 77
+RLASLD RG+ + AL +L G W H+ W+G D +
Sbjct: 8 KRLASLDALRGMDMFWILGGQSIFAALFVLTGWQG--WKAFEAHTVHSAWHGFTFYDLIF 65
Query: 78 PFFLFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
P F+F+ GVA+ L+ KRI +R K + R L G+L G+
Sbjct: 66 PLFIFLSGVAMGLSPKRIDHLPFSERRGYYNKALKRLFLLSALGVLYNHGWGTGIP---- 121
Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLV 158
V + IR VL RIA+++ L+
Sbjct: 122 -VALGEIRYASVLGRIAIAWFFCMLL 146
>gi|392544017|ref|ZP_10291154.1| hypothetical protein PpisJ2_19642 [Pseudoalteromonas piscicida JCM
20779]
Length = 377
Score = 47.4 bits (111), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 42/148 (28%), Positives = 63/148 (42%), Gaps = 27/148 (18%)
Query: 31 KTQRLASLDIFRGLAV-----------ALMILVDHAGGDWPEIS----HAPWNGCNLADF 75
K +RLASLD RG+ + AL +L G W H+PW+G D
Sbjct: 6 KPKRLASLDALRGMDMFWILGGQSIFAALFVLTGWQG--WKAFEAHTLHSPWHGFTFYDL 63
Query: 76 VMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLF-----WGILLQGGFSHAPDEL 130
+ P F+F+ GVA+ L+ KRI +K + +G+L G+
Sbjct: 64 IFPLFIFLSGVAMGLSPKRIDHLPFNERKPFYLKALKRLLLLCAFGVLYNHGWGTGIP-- 121
Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLV 158
+D IR VL RIA ++ +L+
Sbjct: 122 ---MDPDGIRYASVLGRIAFAWFFCALL 146
>gi|386821099|ref|ZP_10108315.1| hypothetical protein JoomaDRAFT_3082 [Joostella marina DSM 19592]
gi|386426205|gb|EIJ40035.1| hypothetical protein JoomaDRAFT_3082 [Joostella marina DSM 19592]
Length = 395
Score = 47.4 bits (111), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 33/95 (34%), Positives = 50/95 (52%), Gaps = 8/95 (8%)
Query: 33 QRLASLDIFRGLAVALMILVDH----AGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
R+ S+DI RGL + LM+ V+ W S A + LAD+V P FLF+VG++I
Sbjct: 5 TRILSIDIMRGLTLFLMLFVNDLFEPGVPKWLVHSKATEDAMGLADWVFPGFLFMVGLSI 64
Query: 89 ALAL----KRIPDRADAVKKVIFRTLKLLFWGILL 119
A K+ + +K ++ RTL LL G+ +
Sbjct: 65 PFAFLSRRKKGEGDLEILKHILVRTLSLLLIGVFM 99
>gi|423223322|ref|ZP_17209791.1| hypothetical protein HMPREF1062_01977 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392638858|gb|EIY32689.1| hypothetical protein HMPREF1062_01977 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 394
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 61/113 (53%), Gaps = 15/113 (13%)
Query: 28 SHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPE-ISHAPWNG--CNLADFVMPFFLF 82
+L QR+A++D+FR L + LM+ V+ G + P + HA N +D + P FLF
Sbjct: 2 KNLTPQRVAAVDVFRALTMFLMLFVNDIPGLKNIPHWLKHAEMNEDMLGFSDTIFPAFLF 61
Query: 83 IVGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL------LQGGFSH 125
+G++++ A++ + D +VI +RT+ L+ G+ ++GG SH
Sbjct: 62 CMGMSVSFAIQNRYRKGDTTLQVIAHIFWRTVALIAMGLFSLNSGGIEGGISH 114
>gi|168705120|ref|ZP_02737397.1| hypothetical protein GobsU_36644 [Gemmata obscuriglobus UQM 2246]
Length = 387
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 36/92 (39%), Positives = 44/92 (47%), Gaps = 10/92 (10%)
Query: 17 SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAG------GDWPEISHAPWNGC 70
+EP + RLASLD FRG V M+LV+ G D P ++H C
Sbjct: 3 AEPPDPKGAAGAPASAPRLASLDQFRGYTVLGMLLVNFVGSFAVIKADVPVLAHHH-TYC 61
Query: 71 NLADFVMPFFLFIVGVAIALALKRIPDRADAV 102
+ AD +MP FLF VG A L R R DAV
Sbjct: 62 SYADTIMPQFLFAVGFAFRLTFAR---RRDAV 90
>gi|409203840|ref|ZP_11232043.1| hypothetical protein PflaJ_21058 [Pseudoalteromonas flavipulchra
JG1]
Length = 377
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 41/148 (27%), Positives = 63/148 (42%), Gaps = 27/148 (18%)
Query: 31 KTQRLASLDIFRGLAV-----------ALMILVDHAGGDWPEIS----HAPWNGCNLADF 75
K +RLASLD RG+ + AL +L G W H+PW+G D
Sbjct: 6 KPKRLASLDALRGMDMFWILGGQSIFAALFVLTGWQG--WKAFEAHTLHSPWHGFTFYDL 63
Query: 76 VMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLF-----WGILLQGGFSHAPDEL 130
+ P F+F+ GVA+ L+ KRI +K + +G+L G+
Sbjct: 64 IFPLFIFLSGVAMGLSPKRIDHLPFNERKSFYLKALKRLLLLCAFGVLYNHGWGTGIP-- 121
Query: 131 TYGVDVRMIRLCGVLQRIALSYLLVSLV 158
+D +R VL RIA ++ +L+
Sbjct: 122 ---MDPDGVRYASVLGRIAFAWFFCALL 146
>gi|375255119|ref|YP_005014286.1| hypothetical protein BFO_1396 [Tannerella forsythia ATCC 43037]
gi|363406141|gb|AEW19827.1| putative membrane protein [Tannerella forsythia ATCC 43037]
Length = 375
Score = 47.4 bits (111), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 74/326 (22%), Positives = 121/326 (37%), Gaps = 89/326 (27%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEI-------------SHAPWNGCNLADFVMPF 79
+RL SLD+ RG + ++++ W EI +H W G D +MP
Sbjct: 8 KRLVSLDLLRGFDLFCLLMLQPILMTWLEIADNPAWAPLARQFTHVEWRGVAFWDLIMPL 67
Query: 80 FLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGG-FSHAPDELTYGV 134
F+F+ G+ + AL + A + LK L F G ++QG + P+
Sbjct: 68 FMFMSGITVPFALSKYKRGAKPGHSFYLKLLKRFVILFFLGWIVQGNLLALDPN------ 121
Query: 135 DVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVV 194
R LQ IA+ Y++ + + RFS FR+ + A VL
Sbjct: 122 --RFHIFANTLQAIAVGYVVTAFCYV-------------RFS-FRVQ-----LGATVLFF 160
Query: 195 YLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGYIDRKVLGINHMY 254
LL VF G+ P N IDR VLG
Sbjct: 161 IAYLL----------------------VFATVGGM--NWEPGTNIAEEIDRCVLG----- 191
Query: 255 HHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSILSTIIGVHFGHVII 313
R + +G + + SW P + +LSS++ +++ + G GH++
Sbjct: 192 ------RFR--------DGIITEADGSWKFDPAYHYTWILSSLNFVVTVMTGSFAGHILR 237
Query: 314 HTKGHLARLKQWVTMGFALLIFGLTL 339
K RL + + G +L++ L +
Sbjct: 238 LRKTARQRLMRLLITGVSLVVAALLM 263
>gi|294139796|ref|YP_003555774.1| hypothetical protein SVI_1025 [Shewanella violacea DSS12]
gi|293326265|dbj|BAJ00996.1| conserved hypothetical protein [Shewanella violacea DSS12]
Length = 378
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 43/145 (29%), Positives = 66/145 (45%), Gaps = 25/145 (17%)
Query: 33 QRLASLDIFRG-----------LAVALMILVDHAGGDWP--EISHAPWNGCNLADFVMPF 79
+RL SLD RG L L+ G W ++ H+ W+G D + P
Sbjct: 11 RRLMSLDALRGFDMFWILGGEALFAGLLAWSSWQGWQWADAQMHHSQWHGFTFYDLIFPL 70
Query: 80 FLFIVGVAIALALKR-----IPDRADAVKKVIFRTLKLLFWGILLQGGF-SHAPDELTYG 133
F+F+ GVA+ L+ KR I R K + R LLF+G+L G+ + AP
Sbjct: 71 FIFLSGVALGLSPKRLDKLPIAQRMPLYKHSVKRLFLLLFFGVLYNHGWGTGAP------ 124
Query: 134 VDVRMIRLCGVLQRIALSYLLVSLV 158
V + +R VL RIA ++ +++
Sbjct: 125 VAIDEVRYASVLGRIAFAWFFAAML 149
>gi|224536805|ref|ZP_03677344.1| hypothetical protein BACCELL_01681 [Bacteroides cellulosilyticus
DSM 14838]
gi|224521571|gb|EEF90676.1| hypothetical protein BACCELL_01681 [Bacteroides cellulosilyticus
DSM 14838]
Length = 394
Score = 47.4 bits (111), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 61/113 (53%), Gaps = 15/113 (13%)
Query: 28 SHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPE-ISHAPWNG--CNLADFVMPFFLF 82
+L QR+A++D+FR L + LM+ V+ G + P + HA N +D + P FLF
Sbjct: 2 KNLTPQRVAAVDVFRALTMFLMLFVNDIPGLKNIPHWLKHAEMNEDMLGFSDTIFPAFLF 61
Query: 83 IVGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL------LQGGFSH 125
+G++++ A++ + D +VI +RT+ L+ G+ ++GG SH
Sbjct: 62 CMGMSVSFAIQNRYRKGDTTLQVIAHIFWRTVALIAMGLFSLNSGGIEGGISH 114
>gi|225013130|ref|ZP_03703543.1| conserved hypothetical protein [Flavobacteria bacterium MS024-2A]
gi|225002750|gb|EEG40733.1| conserved hypothetical protein [Flavobacteria bacterium MS024-2A]
Length = 365
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 27/88 (30%), Positives = 44/88 (50%), Gaps = 14/88 (15%)
Query: 32 TQRLASLDIFRGLAVALMIL-VDHAGGDWPEIS-------------HAPWNGCNLADFVM 77
+QRL SLD FRG+ + L++ H G + + H W G + D +
Sbjct: 8 SQRLRSLDFFRGVVMFLLVAEFSHLFGVFMKTENETITAAADFLFHHVQWEGLHFWDLIQ 67
Query: 78 PFFLFIVGVAIALALKRIPDRADAVKKV 105
PFF+FIVGV+I + ++ D+ K++
Sbjct: 68 PFFMFIVGVSIPYSYANRLEKGDSEKQI 95
>gi|305666718|ref|YP_003863005.1| hypothetical protein FB2170_10666 [Maribacter sp. HTCC2170]
gi|88708942|gb|EAR01176.1| hypothetical protein FB2170_10666 [Maribacter sp. HTCC2170]
Length = 395
Score = 47.0 bits (110), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 58/114 (50%), Gaps = 19/114 (16%)
Query: 25 QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-W--------NGCNLADF 75
EKS KT R+AS+D+ R L + LMI V+ D+ ++ P W + +D
Sbjct: 1 MEKS--KTLRIASIDVLRALTMLLMIWVN----DFWTLTQVPKWLTHAKPNEDYLGFSDI 54
Query: 76 VMPFFLFIVGVAIALALK----RIPDRADAVKKVIFRTLKLLFWGILLQGGFSH 125
+ P FLFIVG++I A+ + R+ K ++ R++ LL G+ + +H
Sbjct: 55 IFPLFLFIVGLSIPFAINNRMAKGEPRSIMFKHIVIRSISLLIIGVFMVNYETH 108
>gi|431799248|ref|YP_007226152.1| hypothetical protein Echvi_3932 [Echinicola vietnamensis DSM 17526]
gi|430790013|gb|AGA80142.1| Protein of unknown function (DUF1624) [Echinicola vietnamensis DSM
17526]
Length = 412
Score = 47.0 bits (110), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 37/124 (29%), Positives = 57/124 (45%), Gaps = 20/124 (16%)
Query: 4 IKAETTHHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVD-----HAGGD 58
+ T P +SEP + + +R ++D+FR + + LMI V+ D
Sbjct: 3 VNNPTKTQRPSKVSEPII---------EAKRSYAIDVFRAVTMLLMIFVNDLWTLEGYPD 53
Query: 59 WPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL-----KRIPDRADAVKKVIFRTLKLL 113
W + + +D + P FLFIVG++I AL KRIP + + +I R L LL
Sbjct: 54 WLGHAAVGEDRLGFSDVIFPAFLFIVGLSIPFALQNRFRKRIP-KIKLAEHIILRGLALL 112
Query: 114 FWGI 117
GI
Sbjct: 113 VMGI 116
>gi|270294981|ref|ZP_06201182.1| conserved hypothetical protein [Bacteroides sp. D20]
gi|270274228|gb|EFA20089.1| conserved hypothetical protein [Bacteroides sp. D20]
Length = 394
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 48/191 (25%), Positives = 88/191 (46%), Gaps = 34/191 (17%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-W--------NGCNLADFVMPFF 80
L QR+A++D+FR L + LM+ V+ D P + + P W + +D + P F
Sbjct: 4 LTLQRIAAVDVFRALTMFLMLFVN----DIPGLKNVPHWLMHARMDEDMMGFSDTIFPAF 59
Query: 81 LFIVGVAIALALKRIPDRAD----AVKKVIFRTLKLLFWGIL------LQGGFSHAPDE- 129
LF +G++++ A++ + D V + +RT+ L+ G+ ++GG SH
Sbjct: 60 LFCMGMSVSFAIQNRYKKGDNTLQVVAHIFWRTVALIAMGLFSLNSGGIEGGLSHPWFSI 119
Query: 130 -------LTYGVDVRMIRLCGVL--QRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL 180
LT+GV + + VL LL++ + I+ KD+ K + + I L
Sbjct: 120 LMVIGFFLTWGVYPKAVGTKKVLFTAMKTAGVLLLAFLVIY-KDMNGKPFQISWWGILGL 178
Query: 181 YCWHWLMAACV 191
W + + A +
Sbjct: 179 IGWTYAVCAGI 189
>gi|395803959|ref|ZP_10483200.1| hypothetical protein FF52_18830 [Flavobacterium sp. F52]
gi|395433603|gb|EJF99555.1| hypothetical protein FF52_18830 [Flavobacterium sp. F52]
Length = 396
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 29/93 (31%), Positives = 44/93 (47%), Gaps = 5/93 (5%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMP 78
+ K +L QR+ S+D RG+ + +MI V+ W + A + D V P
Sbjct: 1 MKIKENLFNQRIVSIDSLRGITIFVMIFVNELASIQNVPQWMKHMPADADAMTFVDLVFP 60
Query: 79 FFLFIVGVAIALALKRIPDRADAVKKVIFRTLK 111
FLFIVG++I A + D+ K + TLK
Sbjct: 61 AFLFIVGMSIPFAFNARLIKGDSPKTIWTHTLK 93
>gi|445497064|ref|ZP_21463919.1| hypothetical protein Jab_2c06620 [Janthinobacterium sp. HH01]
gi|444787059|gb|ELX08607.1| hypothetical protein Jab_2c06620 [Janthinobacterium sp. HH01]
Length = 381
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 33/108 (30%), Positives = 53/108 (49%), Gaps = 12/108 (11%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFIVG 85
K R+ ++D FRG+ + +MI V+ G W E + A + D V P FLFIVG
Sbjct: 5 KPARVLAIDAFRGITILVMIFVNTLAGVRGMPAWMEHAPADADAMTFPDVVFPAFLFIVG 64
Query: 86 VAIALALKRIPDRADAV----KKVIFRTLKLLFWGILL---QGGFSHA 126
++I A+ + D + V+ R LL G+ + +GG++ A
Sbjct: 65 MSIPFAMAQRQAAGDTPAARWRHVLARAAGLLVLGVFMVNAEGGYNEA 112
>gi|429727718|ref|ZP_19262477.1| hypothetical protein HMPREF9998_00424 [Peptostreptococcus
anaerobius VPI 4330]
gi|429151771|gb|EKX94625.1| hypothetical protein HMPREF9998_00424 [Peptostreptococcus
anaerobius VPI 4330]
Length = 463
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 38/148 (25%), Positives = 71/148 (47%), Gaps = 18/148 (12%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHA----PWNGCNLADFVMPFFLFIVGVAI 88
R+ S+D RGL V L + + + G + +IS+A WNG L D ++P FL ++G +I
Sbjct: 47 MRVQSIDYMRGLLVILSMFMINQGLE-NQISYAFQNSKWNGMTLLDILVPMFLLVIGSSI 105
Query: 89 ALALKR----IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
+K+ D VK +++ + G++ + A D +RL G
Sbjct: 106 PFYVKKHYEENEDLRHIVKMSFIKSIIVFVIGLIFSCIYYPANDY---------VRLTGP 156
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSV 172
+Q +A Y++ L+ I ++ K+ ++
Sbjct: 157 IQMMAFVYIMSLLLYIGFLKMRIKNNAL 184
>gi|430747657|ref|YP_007206786.1| hypothetical protein Sinac_7036 [Singulisphaera acidiphila DSM
18658]
gi|430019377|gb|AGA31091.1| hypothetical protein Sinac_7036 [Singulisphaera acidiphila DSM
18658]
Length = 418
Score = 47.0 bits (110), Expect = 0.015, Method: Compositional matrix adjust.
Identities = 44/189 (23%), Positives = 78/189 (41%), Gaps = 35/189 (18%)
Query: 16 ISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMIL----VDHAGGDWPEIS-------- 63
++ P + ++RLAS+D FRG + L++ + +P+
Sbjct: 15 VNAPKPPESSGSGSAPSRRLASIDAFRGFVMFLLLAEWLKLPQVAKSFPKSELWALLSRH 74
Query: 64 --HAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKV----IFRTLKLLFWGI 117
H W GC+L D + P F F+VGVA+ ++ R + ++ +R L L+ GI
Sbjct: 75 QQHVEWVGCSLHDLIQPSFSFLVGVALPFSIASRLARGQSTTRMAGHAFWRALVLVLLGI 134
Query: 118 LLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQS-----V 172
L+ + G D L +I L Y + L+ + + +D+ + V
Sbjct: 135 FLR----------SMGKDRTNFTFEDTLTQIGLGYGFLFLLGL--RPARDQWIALVVILV 182
Query: 173 GRFSIFRLY 181
G + F LY
Sbjct: 183 GYWGAFALY 191
>gi|306818439|ref|ZP_07452162.1| conserved hypothetical protein [Mobiluncus mulieris ATCC 35239]
gi|304648612|gb|EFM45914.1| conserved hypothetical protein [Mobiluncus mulieris ATCC 35239]
Length = 399
Score = 47.0 bits (110), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 83/347 (23%), Positives = 133/347 (38%), Gaps = 94/347 (27%)
Query: 2 SEIKAETTHHHPLIISEPDV--SDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAG--- 56
++ +A TT +EP+ ++Q E K R+ SLD+ RG L++ V A
Sbjct: 13 TQSEAATTRQ-----TEPNTGETNQTETKPAKPGRITSLDVGRGWF--LIMSVTSAAWLL 65
Query: 57 --GDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLF 114
DW + HAPW G D + P F+ + G+ +A A +R KV R + +L
Sbjct: 66 PRPDW--LIHAPWIGIRYYDMIFPLFVTLSGIGLAFAYH---NRVS--FKVTLRRIVVLV 118
Query: 115 WGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGR 174
LL G S D R G LQ A+ +++ +F ++
Sbjct: 119 VVGLLYNGVSSGQ------WDPATFRFTGPLQVYAVIVAIIATCHLFARN---------- 162
Query: 175 FSIFRLYCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLN 234
W++ A + +A+L + W T GV L+
Sbjct: 163 ----------WMVWAGI-TAGVAVLQTGLLTWWAGT--------------CPSGV---LS 194
Query: 235 PPCNAVGYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLS 294
P CN G DR +LG HMY + G L D PEGL++
Sbjct: 195 PSCNPSGMWDRALLG-AHMY----------------YGGFLGHD----------PEGLVA 227
Query: 295 SVSSILSTIIGVHFGHVIIHTK--GHLARLKQWVTMGFALLIFGLTL 339
++L+ G GH+ + ++ G + + + A+ +FGL L
Sbjct: 228 ITGALLTAAAGTTAGHLALSSRRLGWKTGPVKLLALAAAMSVFGLIL 274
>gi|332662942|ref|YP_004445730.1| hypothetical protein [Haliscomenobacter hydrossis DSM 1100]
gi|332331756|gb|AEE48857.1| Protein of unknown function DUF2261, transmembrane
[Haliscomenobacter hydrossis DSM 1100]
Length = 394
Score = 46.6 bits (109), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 39/73 (53%), Gaps = 14/73 (19%)
Query: 34 RLASLDIFRGLAVALMIL----VDHAGGDWPEI----------SHAPWNGCNLADFVMPF 79
RL S+D++RGL + LM+ H +P+ SH PW GC+L D + P
Sbjct: 9 RLGSVDVYRGLVMFLMMAEVLEFGHVAKAFPDSGFWAFLHFHQSHVPWVGCSLHDLIQPS 68
Query: 80 FLFIVGVAIALAL 92
F F+VGVA+ +L
Sbjct: 69 FSFLVGVALPYSL 81
>gi|289422375|ref|ZP_06424221.1| hypothetical protein HMPREF0631_1471 [Peptostreptococcus anaerobius
653-L]
gi|289157210|gb|EFD05829.1| hypothetical protein HMPREF0631_1471 [Peptostreptococcus anaerobius
653-L]
Length = 463
Score = 46.6 bits (109), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 34/148 (22%), Positives = 73/148 (49%), Gaps = 18/148 (12%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHA----PWNGCNLADFVMPFFLFIVGVAI 88
R+ S+D RGL V L + + + G + +IS+A WNG L D ++P FL ++G +I
Sbjct: 47 MRVQSIDYMRGLLVILSMFMINQGLE-NQISYAFQNSKWNGMTLNDILVPMFLLVIGSSI 105
Query: 89 ALALKRIPDRADAVKKVI----FRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGV 144
+K+ + + ++ ++ +++ + G++ + A D +RL G
Sbjct: 106 PFYVKKHYEENEDIRHIVKMSFIKSIIVFLIGLIFSCIYYPANDY---------VRLTGP 156
Query: 145 LQRIALSYLLVSLVEIFTKDVQDKDQSV 172
+Q + Y++ L+ I ++ K+ ++
Sbjct: 157 IQMMVFVYIMSLLLYIGFLKMRIKNNAL 184
>gi|372268269|ref|ZP_09504317.1| hypothetical protein AlS89_10220 [Alteromonas sp. S89]
Length = 365
Score = 46.6 bits (109), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 43/146 (29%), Positives = 63/146 (43%), Gaps = 23/146 (15%)
Query: 31 KTQRLASLDIFRGLAVALMI------LVDHAGGDW-------PEISHAPWNGCNLADFVM 77
K QRLAS+D RG + +I L A W ++ H PW+G D +
Sbjct: 4 KKQRLASVDALRGFDMFWIIGGEALFLPLFALTGWSIFQFGHAQMQHTPWHGFTFYDLIF 63
Query: 78 PFFLFIVGVAIALALKR-----IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTY 132
P F+F+ GV + LA K + RA +K R L L+ GIL G+
Sbjct: 64 PLFIFLSGVTLGLANKSLRGLPVSQRAPVYRKATKRLLLLVLLGILYNHGWGTGIPA--- 120
Query: 133 GVDVRMIRLCGVLQRIALSYLLVSLV 158
D+ IR VL RI ++ +++
Sbjct: 121 --DLSEIRYASVLARIGFAWFFAAMI 144
>gi|423304305|ref|ZP_17282304.1| hypothetical protein HMPREF1072_01244 [Bacteroides uniformis
CL03T00C23]
gi|423310581|ref|ZP_17288565.1| hypothetical protein HMPREF1073_03315 [Bacteroides uniformis
CL03T12C37]
gi|392681752|gb|EIY75109.1| hypothetical protein HMPREF1073_03315 [Bacteroides uniformis
CL03T12C37]
gi|392684891|gb|EIY78211.1| hypothetical protein HMPREF1072_01244 [Bacteroides uniformis
CL03T00C23]
Length = 394
Score = 46.6 bits (109), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 59/115 (51%), Gaps = 23/115 (20%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-W--------NGCNLADFVMPFF 80
L QR+A++D+FR L + LM+ V+ D P + + P W + +D + P F
Sbjct: 4 LTLQRIAAVDVFRALTMFLMLFVN----DIPRLKNVPHWLMHARMDEDMMGFSDTIFPAF 59
Query: 81 LFIVGVAIALALKRIPDRAD----AVKKVIFRTLKLLFWGIL------LQGGFSH 125
LF +G++++ A++ + D V + +RT+ L+ G+ ++GG SH
Sbjct: 60 LFCMGMSVSFAIQNRYKKGDNTLQVVAHIFWRTVALIAMGLFSLNSGGIEGGLSH 114
>gi|189464971|ref|ZP_03013756.1| hypothetical protein BACINT_01315 [Bacteroides intestinalis DSM
17393]
gi|189437245|gb|EDV06230.1| hypothetical protein BACINT_01315 [Bacteroides intestinalis DSM
17393]
Length = 394
Score = 46.6 bits (109), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 58/113 (51%), Gaps = 15/113 (13%)
Query: 28 SHLKTQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLF 82
+L QR+A++D+FR L + LM+ V+ G W E + + +D + P FLF
Sbjct: 2 KNLAPQRVAAVDVFRALTMFLMLFVNDIPGLKNIPHWLEHADINEDMMGFSDTIFPAFLF 61
Query: 83 IVGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL------LQGGFSH 125
+G++++ A++ + D +VI +RT+ L+ G+ + GG SH
Sbjct: 62 CMGMSVSFAIQNRYRKGDTTLQVIAHIFWRTVALIAMGLFSLNSGGIAGGISH 114
>gi|146302719|ref|YP_001197310.1| hypothetical protein Fjoh_4992 [Flavobacterium johnsoniae UW101]
gi|146157137|gb|ABQ07991.1| Uncharacterized protein [Flavobacterium johnsoniae UW101]
Length = 395
Score = 46.6 bits (109), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 28/90 (31%), Positives = 43/90 (47%), Gaps = 5/90 (5%)
Query: 27 KSHLKTQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFL 81
K +L QR+ S+D RG+ + +MI V+ W + A + D V P FL
Sbjct: 4 KENLYNQRIISIDALRGITIFVMIFVNELASIQNVPQWMKHMPADADAMTFVDLVFPAFL 63
Query: 82 FIVGVAIALALKRIPDRADAVKKVIFRTLK 111
FIVG+++ A + D+ K + TLK
Sbjct: 64 FIVGMSVPFAFNARLIKGDSPKVIWTHTLK 93
>gi|320450186|ref|YP_004202282.1| hypothetical protein TSC_c11130 [Thermus scotoductus SA-01]
gi|320150355|gb|ADW21733.1| putative membrane protein [Thermus scotoductus SA-01]
Length = 334
Score = 46.2 bits (108), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 42/75 (56%), Gaps = 8/75 (10%)
Query: 32 TQRLASLDIFRGLAVALMILVDH-AGGDWPEISHAPWNG-CNLADFVMPFFLFIVGVAIA 89
+ R +LD FRGL VALM+ V++ G P + H P+ G LAD V P++L +G AI
Sbjct: 4 SARSLALDAFRGLTVALMLFVNNLPPGAPPYLEHGPFGGSVYLADLVFPWYLLAMGAAIP 63
Query: 90 LALKRIPDRADAVKK 104
RA A+++
Sbjct: 64 F------SRASALRR 72
>gi|254445881|ref|ZP_05059357.1| hypothetical protein VDG1235_4128 [Verrucomicrobiae bacterium
DG1235]
gi|198260189|gb|EDY84497.1| hypothetical protein VDG1235_4128 [Verrucomicrobiae bacterium
DG1235]
Length = 394
Score = 46.2 bits (108), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 31/96 (32%), Positives = 49/96 (51%), Gaps = 9/96 (9%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
R+ S+DIFRGL + LMI V+ W E + + +D + P FLFIVG++
Sbjct: 3 SRIHSIDIFRGLTMLLMIWVNDFWSLTNVPTWLEHAPGDADAMGFSDIIFPAFLFIVGLS 62
Query: 88 IALALKRIPDRADA----VKKVIFRTLKLLFWGILL 119
I AL+ + D+ + ++ R+ LL G L+
Sbjct: 63 IPFALRSRLAKGDSKPTIITHILARSFALLLMGFLM 98
>gi|255037019|ref|YP_003087640.1| hypothetical protein Dfer_3263 [Dyadobacter fermentans DSM 18053]
gi|254949775|gb|ACT94475.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
Length = 380
Score = 46.2 bits (108), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 48/163 (29%), Positives = 72/163 (44%), Gaps = 29/163 (17%)
Query: 16 ISEPDVSDQQE-KSHLKTQRLASLDIFRGLAVALMI-----LVDHAGGDW---------P 60
+ P V ++E + RLAS+D RG + LMI + GG
Sbjct: 1 MEAPSVVVKKEVRPSSSPGRLASIDALRGFDM-LMIAGGGQFIATLGGKTGISFIDAVAA 59
Query: 61 EISHAPWNGCNLADFVMPFFLFIVGVAIALAL-----KRIPDRADAVKKVIFRTLKLLFW 115
+ H WNG DF+ P FLF+ G ++A ++ K IP KV R L L+
Sbjct: 60 QFEHPAWNGFTFYDFIFPLFLFLAGTSLAFSVTGGLAKGIPPSVIR-NKVFKRMLILIAL 118
Query: 116 GILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
GIL + +AP ++ D IR VL RI L+ + +++
Sbjct: 119 GILDK----NAPMDI---FDPAHIRYGSVLGRIGLATFISAIL 154
>gi|406834451|ref|ZP_11094045.1| hypothetical protein SpalD1_22506 [Schlesneria paludicola DSM
18645]
Length = 358
Score = 46.2 bits (108), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 31/104 (29%), Positives = 51/104 (49%), Gaps = 16/104 (15%)
Query: 61 EISHAPWNGCNLADFVMPFFLFIVGVAIALALKR-------IPDRADAVKKVIFRTLKLL 113
++ H W+G + D + P FLF+VGV + +L + +P+R+ ++I RTL L+
Sbjct: 26 QLEHVKWDGFHFYDLIFPLFLFLVGVVLPFSLTKYQTAGELVPNRSGVYARIIRRTLLLI 85
Query: 114 FWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSL 157
G++ G +D R GVLQRI + Y +L
Sbjct: 86 ALGLIGNG---------ILQLDFTNFRWPGVLQRIGICYFFAAL 120
>gi|456890770|gb|EMG01561.1| hypothetical protein LEP1GSC123_2562 [Leptospira borgpetersenii
str. 200701203]
Length = 74
Score = 45.8 bits (107), Expect = 0.030, Method: Composition-based stats.
Identities = 21/46 (45%), Positives = 28/46 (60%), Gaps = 3/46 (6%)
Query: 38 LDIFRGLAVALMILVDHAGG---DWPEISHAPWNGCNLADFVMPFF 80
+D+FRG+ V MILV++ G + + HA WNGC D V PFF
Sbjct: 1 MDLFRGMTVVGMILVNNPGSWSYVYSPLKHAEWNGCTPTDLVFPFF 46
>gi|317477968|ref|ZP_07937151.1| hypothetical protein HMPREF1007_00267 [Bacteroides sp. 4_1_36]
gi|316905882|gb|EFV27653.1| hypothetical protein HMPREF1007_00267 [Bacteroides sp. 4_1_36]
Length = 394
Score = 45.8 bits (107), Expect = 0.031, Method: Compositional matrix adjust.
Identities = 32/115 (27%), Positives = 59/115 (51%), Gaps = 23/115 (20%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-W--------NGCNLADFVMPFF 80
L QR+A++D+FR L + LM+ V+ D P + + P W + +D + P F
Sbjct: 4 LTLQRIAAVDVFRALTMFLMLFVN----DIPGLKNVPHWLMHARMDEDMMGFSDTIFPAF 59
Query: 81 LFIVGVAIALALKRIPDRAD----AVKKVIFRTLKLLFWGIL------LQGGFSH 125
LF +G++++ A++ + D V + +RT+ L+ G+ ++GG SH
Sbjct: 60 LFCMGMSVSFAIQNRYKKGDNTLQVVAHIFWRTVALIAMGLFSLNSGGIEGGLSH 114
>gi|410657728|ref|YP_006910099.1| N-acetylglucosamine related transporter, NagX [Dehalobacter sp.
DCA]
gi|409020083|gb|AFV02114.1| N-acetylglucosamine related transporter, NagX [Dehalobacter sp.
DCA]
Length = 370
Score = 45.8 bits (107), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 64/131 (48%), Gaps = 15/131 (11%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
T R+ +LD R L+V L+ L G I+HAPW G DF P F+ + G ++A
Sbjct: 5 TNRIKALDFARALSVLLLFLTFVPEGPLYGAYITHAPWFGYTAIDFAFPAFVTLSGTSMA 64
Query: 90 LALKR-IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
+A ++ +P ++I R L+ G++ +++ + +R GVLQ +
Sbjct: 65 IAYRKHVP-----WVRLIRRFFVLIIIGLIFN-------SLVSWEFHLSQLRFTGVLQVL 112
Query: 149 ALSYLLVSLVE 159
A + ++ +L+
Sbjct: 113 AFTGIMTTLIT 123
>gi|149277363|ref|ZP_01883505.1| hypothetical protein PBAL39_10746 [Pedobacter sp. BAL39]
gi|149232240|gb|EDM37617.1| hypothetical protein PBAL39_10746 [Pedobacter sp. BAL39]
Length = 396
Score = 45.8 bits (107), Expect = 0.034, Method: Compositional matrix adjust.
Identities = 40/110 (36%), Positives = 54/110 (49%), Gaps = 16/110 (14%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG--DWPE-ISHAPW--NGCNLADFVMPFFLFIVGVA 87
QRL S+D R L + LMI V+ D P + HAP N LAD V P FL IVG++
Sbjct: 9 QRLVSIDALRALVMLLMIFVNDLWSLIDIPGWLEHAPGDANYMGLADVVFPAFLVIVGLS 68
Query: 88 IALALKRIPDRADAVKK----VIFRTLKLLFWGILLQGGFSHAPDELTYG 133
+ A+ + D + +++RT+ LL GF H E TYG
Sbjct: 69 VPYAIDSRRRKGDGNRAIFLHIVYRTIALLV------MGFFHVNME-TYG 111
>gi|149178821|ref|ZP_01857402.1| phosphoribosylaminoimidazole synthetase [Planctomyces maris DSM
8797]
gi|148842362|gb|EDL56744.1| phosphoribosylaminoimidazole synthetase [Planctomyces maris DSM
8797]
Length = 405
Score = 45.4 bits (106), Expect = 0.038, Method: Compositional matrix adjust.
Identities = 34/87 (39%), Positives = 44/87 (50%), Gaps = 5/87 (5%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+R+ SLD FRG VA M LV++ G P + C+ AD +MP FLF VG A
Sbjct: 14 NKRIVSLDQFRGYTVAGMFLVNYMGFFVVCPVVLKHHNTYCSYADTIMPHFLFAVGFAFR 73
Query: 90 LALKRIPDRADAVK---KVIFRTLKLL 113
L R A AV +V+ R L L+
Sbjct: 74 LTFGRRVQTAGAVSAYARVVRRLLGLV 100
>gi|423213223|ref|ZP_17199752.1| hypothetical protein HMPREF1074_01284 [Bacteroides xylanisolvens
CL03T12C04]
gi|392693683|gb|EIY86913.1| hypothetical protein HMPREF1074_01284 [Bacteroides xylanisolvens
CL03T12C04]
Length = 469
Score = 45.4 bits (106), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 30/105 (28%), Positives = 49/105 (46%), Gaps = 16/105 (15%)
Query: 32 TQRLASLDIFRGLAVALMIL----VDHAGGDWPEISHAPWN--------GCNLADFVMPF 79
R +LD RG A+ M+L V H W + P + G D V PF
Sbjct: 2 NNRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPDHVFNPLLPGITWVDLVFPF 61
Query: 80 FLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQ 120
FLF +G A ++K+ ++ D+ K+++ R ++L F+ I +Q
Sbjct: 62 FLFAMGAAFPFSIKKRAEKGDSKLKLVYEAGKRGIQLTFFAIFIQ 106
>gi|157374353|ref|YP_001472953.1| hypothetical protein Ssed_1214 [Shewanella sediminis HAW-EB3]
gi|157316727|gb|ABV35825.1| conserved hypothetical protein [Shewanella sediminis HAW-EB3]
Length = 378
Score = 45.4 bits (106), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 44/156 (28%), Positives = 73/156 (46%), Gaps = 29/156 (18%)
Query: 24 QQEKSHLKTQRLASLDIFRG-----------LAVALMILVDHAGGDW--PEISHAPWNGC 70
+ ++ + +RL SLD RG L L+ G W ++ H+ W+G
Sbjct: 2 EVAQAKVSKRRLMSLDALRGFDMFWILGGEVLFAGLLAWTGWQGWQWFDTQMHHSEWHGF 61
Query: 71 NLADFVMPFFLFIVGVAIALALKRIPDRADAVKKV------IFRTLKLLFWGILLQGGFS 124
D + P F+F+ GVA+ L+ KR+ D+ K++ + R L LLF+GIL G+
Sbjct: 62 TFYDLIFPLFIFLSGVALGLSPKRL-DKLPIAKRMPLYIHAVKRLLLLLFFGILYNHGWG 120
Query: 125 HAPDELTYGVDVRM--IRLCGVLQRIALSYLLVSLV 158
GV V + +R VL RIA ++ +++
Sbjct: 121 T-------GVPVVLDEVRYASVLGRIAFAWFFAAIL 149
Score = 40.8 bits (94), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 29/96 (30%), Positives = 45/96 (46%), Gaps = 11/96 (11%)
Query: 286 PFEPEGLLSSVSSILSTIIGVHFGHVII--HTKGHLARLKQWVTMGFALLIFGLTLHF-- 341
P +PEG+LS++ ++ + + GV GH II H KG ++ + G A L G L F
Sbjct: 210 PLDPEGILSTIPAVANALAGVFVGHFIIKPHPKGEWFKVVYMLVAGAAFLGLGWLLDFIV 269
Query: 342 -TNGEHGSGKFSTTCV------CLFIYSKVILFQWQ 370
N E + F+ + Y+ V L +WQ
Sbjct: 270 PVNKELWTSSFTLVTIGWSLILLTVFYAIVDLLKWQ 305
>gi|392965134|ref|ZP_10330554.1| hypothetical protein BN8_01612 [Fibrisoma limi BUZ 3]
gi|387846517|emb|CCH52600.1| hypothetical protein BN8_01612 [Fibrisoma limi BUZ 3]
Length = 531
Score = 45.4 bits (106), Expect = 0.049, Method: Compositional matrix adjust.
Identities = 31/97 (31%), Positives = 47/97 (48%), Gaps = 14/97 (14%)
Query: 21 VSDQQEKSHLKTQRLASLDIFRGLAVALM---ILVDHAGGD-------WPEI----SHAP 66
V+ ++ L RL S+D +RG + LM IL H + W + SH
Sbjct: 133 VTAPRDSGGLAGDRLMSMDAYRGFVMLLMAAEILQFHRLHEAFPNSVLWGLLAYHQSHVE 192
Query: 67 WNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVK 103
W GC+L D + P F F+VGVA+ +L +R +V+
Sbjct: 193 WAGCSLHDLIQPSFSFLVGVALPYSLAARLNRGQSVR 229
>gi|149276664|ref|ZP_01882807.1| hypothetical protein PBAL39_14829 [Pedobacter sp. BAL39]
gi|149232333|gb|EDM37709.1| hypothetical protein PBAL39_14829 [Pedobacter sp. BAL39]
Length = 359
Score = 45.1 bits (105), Expect = 0.052, Method: Compositional matrix adjust.
Identities = 35/140 (25%), Positives = 63/140 (45%), Gaps = 28/140 (20%)
Query: 37 SLDIFRGLAVALM--------ILVDHAGGDWP------EISHAPWNGCNLADFVMPFFLF 82
SLD+ RGL + L+ + + H WP + H PW+G D V P F+F
Sbjct: 2 SLDVMRGLIMILLCAESCLLYVSLQHLNPAWPASGLVEQFFHHPWHGLRFWDLVQPAFMF 61
Query: 83 IVGVAIALALKRIPDRADAVKK----VIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRM 138
+ G A+ ++ R ++ + + ++ R+LKL G+ L ++ P +
Sbjct: 62 MAGAAMYISYSRKLEKGSSWSQNWNHILIRSLKLFLCGVGLHCVYAGKP----------V 111
Query: 139 IRLCGVLQRIALSYLLVSLV 158
L VL ++A + +L L+
Sbjct: 112 WELWNVLTQLAFTSILAYLI 131
>gi|119774084|ref|YP_926824.1| hypothetical protein Sama_0947 [Shewanella amazonensis SB2B]
gi|119766584|gb|ABL99154.1| conserved hypothetical protein [Shewanella amazonensis SB2B]
Length = 378
Score = 45.1 bits (105), Expect = 0.055, Method: Compositional matrix adjust.
Identities = 44/155 (28%), Positives = 70/155 (45%), Gaps = 27/155 (17%)
Query: 24 QQEKSHLKTQRLASLDIFRG-----------LAVALMILVDHAGGDW----PEISHAPWN 68
Q ++ RL SLD RG L +AL L + W E+ H+ W+
Sbjct: 2 QATQTKAAKPRLMSLDALRGFDMFWILGGEKLFIALFALTGWS--FWQLADAEMHHSEWH 59
Query: 69 GCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFR-TLKLLFWGILLQGGFSHAP 127
G D + P F+F+ GVA+ L+ KR+ A A + I+R +K LF + L ++H
Sbjct: 60 GFTFYDLIFPLFIFLSGVALGLSPKRLDKLAPAERNPIYRHAVKRLFLLLALGVLYNHG- 118
Query: 128 DELTYGVDVRM----IRLCGVLQRIALSYLLVSLV 158
+G + +R VL RIA ++ +L+
Sbjct: 119 ----WGTGIPAHSDEVRYASVLGRIAFAWFFAALL 149
>gi|410660784|ref|YP_006913155.1| N-acetylglucosamine related transporter, NagX [Dehalobacter sp. CF]
gi|409023140|gb|AFV05170.1| N-acetylglucosamine related transporter, NagX [Dehalobacter sp. CF]
Length = 368
Score = 45.1 bits (105), Expect = 0.056, Method: Compositional matrix adjust.
Identities = 35/131 (26%), Positives = 63/131 (48%), Gaps = 15/131 (11%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDW--PEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
T R+ +LD R L+V L+ L G I+HAPW G DF P F+ + G ++A
Sbjct: 5 TNRIKALDFARALSVLLLFLTFVPEGPLYGAYITHAPWFGYTAIDFAFPAFVTLSGTSMA 64
Query: 90 LALKR-IPDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRI 148
+ ++ +P ++I R LL G++ +++ + +R GVLQ +
Sbjct: 65 IVYRKHVP-----WVRLIRRFFVLLIIGLIFN-------SLVSWEFQLSELRFTGVLQVL 112
Query: 149 ALSYLLVSLVE 159
A + ++ +L+
Sbjct: 113 AFTGIMTALIT 123
>gi|374309893|ref|YP_005056323.1| hypothetical protein [Granulicella mallensis MP5ACTX8]
gi|358751903|gb|AEU35293.1| hypothetical protein AciX8_0944 [Granulicella mallensis MP5ACTX8]
Length = 399
Score = 44.7 bits (104), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 55/114 (48%), Gaps = 18/114 (15%)
Query: 33 QRLASLDIFRGLAVALMIL----VDHAGGDWPE----------ISHAPWNGCNLADFVMP 78
QR +++D +RG +ALM+ +P+ SH W G +L D + P
Sbjct: 13 QRNSAVDAYRGFVMALMLAEVFRFAFVAKSFPDNFLLHILAYNQSHVEWTGMSLHDMIQP 72
Query: 79 FFLFIVGVAIALALKRIPDRADAVKKV----IFRTLKLLFWGILLQGGFSHAPD 128
F F+VGVA+ +L+ + ++ K + I+R+ L+ GI L+ S A D
Sbjct: 73 SFTFLVGVALPYSLRSRRRKGESFKYMLGHTIWRSFLLVALGIFLRSIHSTATD 126
>gi|387789753|ref|YP_006254818.1| hypothetical protein Solca_0510 [Solitalea canadensis DSM 3403]
gi|379652586|gb|AFD05642.1| hypothetical protein Solca_0510 [Solitalea canadensis DSM 3403]
Length = 389
Score = 44.7 bits (104), Expect = 0.067, Method: Compositional matrix adjust.
Identities = 32/107 (29%), Positives = 50/107 (46%), Gaps = 9/107 (8%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
RL S+D+ R L + LMI V+ W E +G L+D + P FLFIVG++
Sbjct: 6 NRLGSIDVIRALTMFLMIFVNDLWSLVNVPKWLEHVDVQTDGMGLSDVIFPAFLFIVGLS 65
Query: 88 IALALKRIPDRADA----VKKVIFRTLKLLFWGILLQGGFSHAPDEL 130
I +++ + D+ +K + R+ LL G S+ P L
Sbjct: 66 IPFSVENRIKKGDSTIQLLKHIFIRSFALLVIGFFHVNLESYNPGAL 112
>gi|406831132|ref|ZP_11090726.1| hypothetical protein SpalD1_05831 [Schlesneria paludicola DSM
18645]
Length = 508
Score = 44.7 bits (104), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 36/108 (33%), Positives = 53/108 (49%), Gaps = 10/108 (9%)
Query: 34 RLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
RL SLD FRG + M+LV+ GG P I + C+ AD +MP FLF G A+ L
Sbjct: 14 RLTSLDQFRGYTMLGMLLVNFIGGYKAVSPRILLHTHDYCSYADTIMPHFLFAAGFALRL 73
Query: 91 ALKRIPDRAD------AVKKVIFRTLKLLFW-GILLQGGFSHAPDELT 131
+L R + A+++++ L + W G GG H + +T
Sbjct: 74 SLGRRMEAGGKMPWGRAIRRILGLALVAIIWYGYCDWGGVVHKFNTMT 121
>gi|423287389|ref|ZP_17266240.1| hypothetical protein HMPREF1069_01283 [Bacteroides ovatus
CL02T12C04]
gi|392672504|gb|EIY65971.1| hypothetical protein HMPREF1069_01283 [Bacteroides ovatus
CL02T12C04]
Length = 470
Score = 44.7 bits (104), Expect = 0.074, Method: Compositional matrix adjust.
Identities = 31/106 (29%), Positives = 50/106 (47%), Gaps = 17/106 (16%)
Query: 32 TQRLASLDIFRGLAVALMIL----VDHAGGDWPEISHAP-----WN----GCNLADFVMP 78
R +LD RG A+ M+L V H W + P +N G D V P
Sbjct: 2 NNRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPPDHVFNPLLPGITWVDLVFP 61
Query: 79 FFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQ 120
FFLF +G A ++K+ ++ D+ K+++ R ++L F+ I +Q
Sbjct: 62 FFLFAMGAAFPFSIKKRAEKGDSKLKLVYEAGKRGIQLTFFAIFIQ 107
>gi|329849634|ref|ZP_08264480.1| hypothetical protein ABI_25290 [Asticcacaulis biprosthecum C19]
gi|328841545|gb|EGF91115.1| hypothetical protein ABI_25290 [Asticcacaulis biprosthecum C19]
Length = 410
Score = 44.7 bits (104), Expect = 0.078, Method: Compositional matrix adjust.
Identities = 34/113 (30%), Positives = 53/113 (46%), Gaps = 16/113 (14%)
Query: 26 EKSHLKT-------QRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLA 73
EK KT R+ ++D+ R L + LMI V+ W E + +G L+
Sbjct: 6 EKGRAKTMPNKNQFSRVGAIDLVRALTMVLMIFVNDLWSLKGVPVWLEHVASGVDGMGLS 65
Query: 74 DFVMPFFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGG 122
D V P FLFIVG+++ A+ R D++ + R++ LL G+ L G
Sbjct: 66 DVVFPAFLFIVGLSLPFAVSSRQARGDSLGSTVLHILGRSVALLVMGVFLVNG 118
>gi|338209612|ref|YP_004653659.1| hypothetical protein [Runella slithyformis DSM 19594]
gi|336303425|gb|AEI46527.1| Protein of unknown function DUF2261, transmembrane [Runella
slithyformis DSM 19594]
Length = 398
Score = 44.3 bits (103), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 30/98 (30%), Positives = 50/98 (51%), Gaps = 9/98 (9%)
Query: 29 HLKTQRLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFI 83
+ +R+ S+D FR L + LMI V+ W E + A + +D + P FLFI
Sbjct: 2 QITLKRVPSIDAFRALTMLLMIFVNDFWSLSGIPYWLEHAKAEEDFLGFSDIIFPCFLFI 61
Query: 84 VGVAIALALKRIPDRADA----VKKVIFRTLKLLFWGI 117
+G+AI A++ + D V+ +I R++ L+ GI
Sbjct: 62 LGMAIPFAVQNRIAKGDTRWQIVRHIILRSVALIVMGI 99
>gi|406831133|ref|ZP_11090727.1| hypothetical protein SpalD1_05836 [Schlesneria paludicola DSM
18645]
Length = 415
Score = 44.3 bits (103), Expect = 0.086, Method: Compositional matrix adjust.
Identities = 29/69 (42%), Positives = 39/69 (56%), Gaps = 4/69 (5%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD---WPEISHAPWNGCNLADFVMPFFLFIVGVA 87
+ RL SLD FRG + MILV++ G P + + C+ AD +MP F F VG A
Sbjct: 11 SSPRLTSLDQFRGYTMVGMILVNYLGAYKEVTPRLFRHTNDYCSYADTIMPHFFFAVGFA 70
Query: 88 IALAL-KRI 95
+ L+L KRI
Sbjct: 71 MRLSLGKRI 79
>gi|424665794|ref|ZP_18102830.1| hypothetical protein HMPREF1205_01669 [Bacteroides fragilis HMW
616]
gi|404574047|gb|EKA78798.1| hypothetical protein HMPREF1205_01669 [Bacteroides fragilis HMW
616]
Length = 385
Score = 44.3 bits (103), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 55/99 (55%), Gaps = 17/99 (17%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-WNG--------CNLADFVMPFFLFI 83
QR+A++D+FR L + LM+ V+ D P + + P W G +D + P FLF
Sbjct: 7 QRVAAVDVFRALTMFLMLFVN----DIPGLRNIPHWLGHAAMTEDMLGFSDTIFPAFLFC 62
Query: 84 VGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL 118
+G++I+ A++ + D++ +VI +RT+ L+ G+
Sbjct: 63 MGMSISFAVQNRYQKGDSLLQVIMHIFWRTVALVVMGLF 101
>gi|313148038|ref|ZP_07810231.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
gi|313136805|gb|EFR54165.1| conserved hypothetical protein [Bacteroides fragilis 3_1_12]
Length = 385
Score = 44.3 bits (103), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 55/99 (55%), Gaps = 17/99 (17%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-WNG--------CNLADFVMPFFLFI 83
QR+A++D+FR L + LM+ V+ D P + + P W G +D + P FLF
Sbjct: 7 QRVAAVDVFRALTMFLMLFVN----DIPGLRNIPHWLGHAAMTEDMLGFSDTIFPAFLFC 62
Query: 84 VGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL 118
+G++I+ A++ + D++ +VI +RT+ L+ G+
Sbjct: 63 MGMSISFAVQNRYQKGDSLLQVIMHIFWRTVALVVMGLF 101
>gi|406835226|ref|ZP_11094820.1| hypothetical protein SpalD1_26403 [Schlesneria paludicola DSM
18645]
Length = 508
Score = 44.3 bits (103), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 38/120 (31%), Positives = 54/120 (45%), Gaps = 22/120 (18%)
Query: 9 THHHPLIISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPW- 67
H PL PD R+ S+D FRG AVA MI V+ GG + H+ +
Sbjct: 2 AHDAPLTTDSPD-------------RVISMDQFRGYAVAAMIFVNFVGGF--GVVHSVFK 46
Query: 68 ---NGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVK---KVIFRTLKLLFWGILLQG 121
N + AD +M F+F+VG + L + R R + + R+L L+F LL G
Sbjct: 47 HNDNYLSYADTIMANFMFMVGFSFRLTMLRRLKRMSWLATCWSYVRRSLLLVFVSTLLYG 106
>gi|440747820|ref|ZP_20927075.1| N-acetylglucosamine related transporter, NagX [Mariniradius
saccharolyticus AK6]
gi|436483562|gb|ELP39602.1| N-acetylglucosamine related transporter, NagX [Mariniradius
saccharolyticus AK6]
Length = 372
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 39/137 (28%), Positives = 61/137 (44%), Gaps = 24/137 (17%)
Query: 32 TQRLASLDIFRGLAVALMILVD--------HAGGDWPEI-----SHAPWNGCNLADFVMP 78
++RL S+D RG + ++ D W ++ H W G DF+ P
Sbjct: 9 SKRLVSIDALRGFDMLMICGADAFFRSLEGKTSFAWVDVLARQFEHPEWIGFTFYDFIFP 68
Query: 79 FFLFIVGVAIALALKRI----PDRADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGV 134
FLF+ GV+I +L + + + KK + RTL L+ G+L + +AP
Sbjct: 69 LFLFVAGVSIPFSLGKSLAENVSKREIYKKALSRTLLLIGLGMLDK----NAPFPF---F 121
Query: 135 DVRMIRLCGVLQRIALS 151
D IRL VL RI ++
Sbjct: 122 DWEQIRLGSVLGRIGIA 138
>gi|406832166|ref|ZP_11091760.1| hypothetical protein SpalD1_11017 [Schlesneria paludicola DSM
18645]
Length = 413
Score = 43.9 bits (102), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 39/133 (29%), Positives = 63/133 (47%), Gaps = 27/133 (20%)
Query: 14 LIISEPDVSDQQEKSHLK-------TQRLASLDIFRGLAVALMI-----LVDHAGGDWPE 61
+I++ P+ S+ + + L+ RL S+D +RG + LM+ L D A PE
Sbjct: 1 MIVTIPNKSEIEGPATLELPAGGAAPSRLVSVDAYRGWVMLLMMAEVLRLRDVAKAL-PE 59
Query: 62 I----------SHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKK----VIF 107
SH W GC L D + P F F+VGVA+ L+L+R + + +
Sbjct: 60 SRLWAFLAQQQSHVTWVGCVLHDMIQPSFSFLVGVALPLSLRRRSLSGQPLWQRTAHAAW 119
Query: 108 RTLKLLFWGILLQ 120
R+L L+ G+ L+
Sbjct: 120 RSLVLILLGVFLR 132
>gi|336405631|ref|ZP_08586307.1| hypothetical protein HMPREF0127_03620 [Bacteroides sp. 1_1_30]
gi|335937114|gb|EGM99020.1| hypothetical protein HMPREF0127_03620 [Bacteroides sp. 1_1_30]
Length = 470
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 30/106 (28%), Positives = 50/106 (47%), Gaps = 17/106 (16%)
Query: 32 TQRLASLDIFRGLAVALMIL----VDHAGGDWPEISHAP-----WN----GCNLADFVMP 78
R +LD RG A+ M+L V H W + P +N G D V P
Sbjct: 2 NNRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPPDHVFNPLLPGITWVDLVFP 61
Query: 79 FFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQ 120
FFLF +G A ++++ ++ D+ K+++ R ++L F+ I +Q
Sbjct: 62 FFLFAMGAAFPFSIRKRAEKGDSKLKLVYEAVKRGIQLTFFAIFIQ 107
>gi|299149192|ref|ZP_07042253.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
gi|298512859|gb|EFI36747.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
Length = 470
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 57/123 (46%), Gaps = 20/123 (16%)
Query: 32 TQRLASLDIFRGLAVALMIL----VDHAGGDWPEISHAP-----WN----GCNLADFVMP 78
R +LD RG A+ M+L V H W + P +N G D V P
Sbjct: 2 NNRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPPDHVFNPLLPGITWVDLVFP 61
Query: 79 FFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGV 134
FFLF +G A ++++ ++ D+ K+++ R ++L F+ I +Q + P L+
Sbjct: 62 FFLFAMGAAFPFSIRKRAEKGDSKLKLVYEAVKRGIQLTFFAIFIQHFY---PYMLSSPQ 118
Query: 135 DVR 137
D+R
Sbjct: 119 DIR 121
>gi|293369241|ref|ZP_06615831.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
gi|336417197|ref|ZP_08597524.1| hypothetical protein HMPREF1017_04632 [Bacteroides ovatus
3_8_47FAA]
gi|423297813|ref|ZP_17275873.1| hypothetical protein HMPREF1070_04538 [Bacteroides ovatus
CL03T12C18]
gi|292635666|gb|EFF54168.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
gi|335936517|gb|EGM98443.1| hypothetical protein HMPREF1017_04632 [Bacteroides ovatus
3_8_47FAA]
gi|392664450|gb|EIY57988.1| hypothetical protein HMPREF1070_04538 [Bacteroides ovatus
CL03T12C18]
Length = 470
Score = 43.9 bits (102), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 57/123 (46%), Gaps = 20/123 (16%)
Query: 32 TQRLASLDIFRGLAVALMIL----VDHAGGDWPEISHAP-----WN----GCNLADFVMP 78
R +LD RG A+ M+L V H W + P +N G D V P
Sbjct: 2 NNRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPPDHVFNPLLPGITWVDLVFP 61
Query: 79 FFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGV 134
FFLF +G A ++++ ++ D+ K+++ R ++L F+ I +Q + P L+
Sbjct: 62 FFLFAMGAAFPFSIRKRAEKGDSKLKLVYEAVKRGIQLTFFAIFIQHFY---PYMLSSPQ 118
Query: 135 DVR 137
D+R
Sbjct: 119 DIR 121
>gi|373850799|ref|ZP_09593600.1| Protein of unknown function DUF2261, transmembrane [Opitutaceae
bacterium TAV5]
gi|372476964|gb|EHP36973.1| Protein of unknown function DUF2261, transmembrane [Opitutaceae
bacterium TAV5]
Length = 401
Score = 43.9 bits (102), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 31/101 (30%), Positives = 53/101 (52%), Gaps = 9/101 (8%)
Query: 31 KTQRLASLDIFRGLAVALMILVD-----HAGGDWPEISHAPWNGCNLADFVMPFFLFIVG 85
R+AS+DI R L + LMI+V+ W S + +G +AD V P FLF+VG
Sbjct: 9 NAGRVASIDILRALTMVLMIIVNDLFTLKNTPAWLGHSASGVDGIGVADVVFPAFLFLVG 68
Query: 86 VAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGG 122
+++ AL+ ++ D ++++ R+ L+ G+ L G
Sbjct: 69 LSLPHALEARRNKGDTGLRLVWHVAVRSFALIVMGVFLVNG 109
>gi|295087641|emb|CBK69164.1| hypothetical protein [Bacteroides xylanisolvens XB1A]
Length = 470
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 30/106 (28%), Positives = 50/106 (47%), Gaps = 17/106 (16%)
Query: 32 TQRLASLDIFRGLAVALMIL----VDHAGGDWPEISHAP-----WN----GCNLADFVMP 78
R +LD RG A+ M+L V H W + P +N G D V P
Sbjct: 2 NNRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPPDHVFNPLLPGITWVDLVFP 61
Query: 79 FFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQ 120
FFLF +G A ++++ ++ D+ K+++ R ++L F+ I +Q
Sbjct: 62 FFLFAMGAAFPFSIRKRAEKGDSKLKLVYEAVKRGIQLTFFAIFIQ 107
>gi|430744193|ref|YP_007203322.1| hypothetical protein Sinac_3363 [Singulisphaera acidiphila DSM
18658]
gi|430015913|gb|AGA27627.1| hypothetical protein Sinac_3363 [Singulisphaera acidiphila DSM
18658]
Length = 368
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 35/108 (32%), Positives = 51/108 (47%), Gaps = 8/108 (7%)
Query: 17 SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLAD 74
S P ++ S R+ SLD FRG V M+ V+ G P + + C+ AD
Sbjct: 8 SAPTLTSTSAPSG---SRIVSLDQFRGYTVVGMLFVNFLGNFDALPAVFKHHNSYCSYAD 64
Query: 75 FVMPFFLFIVGVAIALA-LKRIPDR--ADAVKKVIFRTLKLLFWGILL 119
+MP F F VG A L L+R+ AV V+ R+L L+ G ++
Sbjct: 65 TIMPQFFFAVGFAYRLTFLRRLETSGIGGAVAAVLRRSLGLILLGFVI 112
>gi|403174292|ref|XP_003333277.2| hypothetical protein PGTG_14197 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
gi|375170913|gb|EFP88858.2| hypothetical protein PGTG_14197 [Puccinia graminis f. sp. tritici
CRL 75-36-700-3]
Length = 386
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 39/136 (28%), Positives = 65/136 (47%), Gaps = 5/136 (3%)
Query: 27 KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPE-ISHAPW--NGCNLADFVMPFFLFI 83
+S + +R S+D+ RGL M+LV+ AG P +SH AD + P F+F
Sbjct: 19 ESDILAKRDRSIDVLRGLTCLAMVLVNTAGPVRPSWLSHPTSIHQSITFADTLFPCFVFT 78
Query: 84 VGVAIALALKRIPD-RADAVKKVIFRTLKLLFWGILLQGGFSHAPDELTYG-VDVRMIRL 141
G+A A + K + R ++K+ + R +KL GI G ++++ R+
Sbjct: 79 SGLASAQSKKNEQNGRNPSLKRTLIRAIKLNLIGIAYNNLIPRLAGLHGDGLLNLKTYRI 138
Query: 142 CGVLQRIALSYLLVSL 157
VL I +S L+ +L
Sbjct: 139 PSVLGTIGISSLVCTL 154
>gi|160883830|ref|ZP_02064833.1| hypothetical protein BACOVA_01803 [Bacteroides ovatus ATCC 8483]
gi|156110915|gb|EDO12660.1| hypothetical protein BACOVA_01803 [Bacteroides ovatus ATCC 8483]
Length = 470
Score = 43.9 bits (102), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 30/106 (28%), Positives = 50/106 (47%), Gaps = 17/106 (16%)
Query: 32 TQRLASLDIFRGLAVALMIL----VDHAGGDWPEISHAP-----WN----GCNLADFVMP 78
R +LD RG A+ M+L V H W + P +N G D V P
Sbjct: 2 NNRALALDALRGYAIITMVLSATIVTHVLPGWMSHAQTPPPDHVFNPLLPGITWVDLVFP 61
Query: 79 FFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQ 120
FFLF +G A ++++ ++ D+ K+++ R ++L F+ I +Q
Sbjct: 62 FFLFAMGTAFPFSIRKRAEKGDSKLKLVYEAVKRGIQLTFFAIFIQ 107
>gi|423281270|ref|ZP_17260181.1| hypothetical protein HMPREF1203_04398 [Bacteroides fragilis HMW
610]
gi|404583178|gb|EKA87860.1| hypothetical protein HMPREF1203_04398 [Bacteroides fragilis HMW
610]
Length = 385
Score = 43.5 bits (101), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 30/99 (30%), Positives = 54/99 (54%), Gaps = 17/99 (17%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-WNG--------CNLADFVMPFFLFI 83
QR+A++D+FR L + LM+ V+ D P + + P W G +D + P FLF
Sbjct: 7 QRVAAVDVFRALTMFLMLFVN----DIPGLRNIPHWLGHAAMTEDMLGFSDTIFPAFLFC 62
Query: 84 VGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGIL 118
+G++I+ A++ + D+ +VI +RT+ L+ G+
Sbjct: 63 MGMSISFAVQNRYQKGDSPLQVIMHIFWRTVALIVMGLF 101
>gi|336417194|ref|ZP_08597521.1| hypothetical protein HMPREF1017_04629 [Bacteroides ovatus
3_8_47FAA]
gi|423297816|ref|ZP_17275876.1| hypothetical protein HMPREF1070_04541 [Bacteroides ovatus
CL03T12C18]
gi|335936514|gb|EGM98440.1| hypothetical protein HMPREF1017_04629 [Bacteroides ovatus
3_8_47FAA]
gi|392664453|gb|EIY57991.1| hypothetical protein HMPREF1070_04541 [Bacteroides ovatus
CL03T12C18]
Length = 466
Score = 43.1 bits (100), Expect = 0.21, Method: Compositional matrix adjust.
Identities = 31/111 (27%), Positives = 53/111 (47%), Gaps = 17/111 (15%)
Query: 32 TQRLASLDIFRGLAVALMIL------------VDHAGGDWPEISH-APWNGCNLADFVMP 78
T+R +LD RG A+ M+L + HA P+ + A +G D V P
Sbjct: 2 TKRAYALDALRGYAIITMVLSATVAWNSLPGWMYHAQTPPPDRAFDASLSGITWVDLVFP 61
Query: 79 FFLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSH 125
FFLF +G A ++K+ ++ D ++++ +K L F+ I +Q + H
Sbjct: 62 FFLFAMGAAFPFSIKKRFEKGDTKLRLVYEAIKRGAQLTFFAIFIQHFYPH 112
>gi|436836802|ref|YP_007322018.1| hypothetical protein FAES_3417 [Fibrella aestuarina BUZ 2]
gi|384068215|emb|CCH01425.1| hypothetical protein FAES_3417 [Fibrella aestuarina BUZ 2]
Length = 401
Score = 43.1 bits (100), Expect = 0.22, Method: Compositional matrix adjust.
Identities = 36/99 (36%), Positives = 48/99 (48%), Gaps = 9/99 (9%)
Query: 33 QRLASLDIFRGLAVALMILVDH-----AGGDWPEISHAPWNGCNLADFVMPFFLFIVGVA 87
R+ S+D+ R L + LMI V+ A W E +G LAD V P FLFIVG++
Sbjct: 16 TRVDSIDVLRALTMVLMIFVNDLWSLTAIPGWLEHVPEGADGIGLADVVFPAFLFIVGLS 75
Query: 88 IALAL--KRIPDRADA--VKKVIFRTLKLLFWGILLQGG 122
I A+ +R DA V+ R LL G+ L G
Sbjct: 76 IPFAIQHRRTRHETDAQIAGHVLTRAAALLVMGLWLVNG 114
>gi|428319838|ref|YP_007117720.1| hypothetical protein Osc7112_5038 [Oscillatoria nigro-viridis PCC
7112]
gi|428243518|gb|AFZ09304.1| hypothetical protein Osc7112_5038 [Oscillatoria nigro-viridis PCC
7112]
Length = 482
Score = 43.1 bits (100), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 37/104 (35%), Positives = 48/104 (46%), Gaps = 16/104 (15%)
Query: 24 QQEKS---HLKTQRLASLDIFRGLAVALMIL------------VDHAGGDWPE-ISHAPW 67
+QE S +QR +LD RG+AV M+L + HA P+ I +
Sbjct: 3 KQENSLAVAAVSQRADALDALRGIAVLAMVLSGTIARKTLPAWMYHAQLPPPDHIFNNKL 62
Query: 68 NGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK 111
G D V PFFLF +G AI LAL R + KKVI L+
Sbjct: 63 PGLTWVDLVFPFFLFAMGAAIPLALSRRIAKGWDTKKVILSILQ 106
>gi|419719054|ref|ZP_14246346.1| acyltransferase [Lachnoanaerobaculum saburreum F0468]
gi|383304805|gb|EIC96198.1| acyltransferase [Lachnoanaerobaculum saburreum F0468]
Length = 555
Score = 43.1 bits (100), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 28/98 (28%), Positives = 48/98 (48%), Gaps = 9/98 (9%)
Query: 26 EKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP--EISHAPWNGCNLAD----FVMPF 79
EK + R+ LD+ + L+ +++IL+ + + E+ W G + + F +P
Sbjct: 203 EKKRVIKDRIIGLDVLKILSASMIILIHSSANLYNNHEVGTLAWKGGLILNVIPRFAVPA 262
Query: 80 FLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGI 117
FL I G AL L R D+ A+KK ++ + L W I
Sbjct: 263 FLMISG---ALLLGRKTDQRKAIKKAVYAGIALAIWSI 297
>gi|338211620|ref|YP_004655673.1| hypothetical protein [Runella slithyformis DSM 19594]
gi|336305439|gb|AEI48541.1| Protein of unknown function DUF2261, transmembrane [Runella
slithyformis DSM 19594]
Length = 393
Score = 43.1 bits (100), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 32/105 (30%), Positives = 49/105 (46%), Gaps = 18/105 (17%)
Query: 34 RLASLDIFRGLAVALMIL----VDHAGGDWPEIS----------HAPWNGCNLADFVMPF 79
R++S+D +RG + LM+ H P+ S H W GC+L D + P
Sbjct: 8 RISSVDAYRGFVMFLMMAEVLEFGHISKALPDSSFWAFLAYNQDHVEWVGCSLHDLIQPS 67
Query: 80 FLFIVGVA----IALALKRIPDRADAVKKVIFRTLKLLFWGILLQ 120
F F+VGVA IA + + + + R+L L+F GI L+
Sbjct: 68 FSFLVGVALPYSIASRMAKGQNFGSMFGHTVQRSLILIFLGIFLR 112
>gi|315650733|ref|ZP_07903787.1| conserved hypothetical protein [Lachnoanaerobaculum saburreum DSM
3986]
gi|315487007|gb|EFU77335.1| conserved hypothetical protein [Lachnoanaerobaculum saburreum DSM
3986]
Length = 555
Score = 43.1 bits (100), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 28/98 (28%), Positives = 48/98 (48%), Gaps = 9/98 (9%)
Query: 26 EKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP--EISHAPWNGCNLAD----FVMPF 79
EK + R+ LD+ + L+ +++IL+ + + E+ W G + + F +P
Sbjct: 203 EKKRVIKDRIIGLDVLKILSASMIILIHSSANLYNNHEVGTLAWKGGLILNVIPRFAVPA 262
Query: 80 FLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGI 117
FL I G AL L R D+ A+KK ++ + L W I
Sbjct: 263 FLMISG---ALLLGRKTDQQKAIKKAVYAGVALAIWSI 297
>gi|146300862|ref|YP_001195453.1| hypothetical protein Fjoh_3117 [Flavobacterium johnsoniae UW101]
gi|146155280|gb|ABQ06134.1| Uncharacterized protein [Flavobacterium johnsoniae UW101]
Length = 380
Score = 42.7 bits (99), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 65/310 (20%), Positives = 110/310 (35%), Gaps = 105/310 (33%)
Query: 28 SHLKTQRLASLDIFRGLAVALMILVDH------AGGDWP-------EISHAPWNGCNLAD 74
++ RL SLD+ RG + ++ +H P ++ HA WNG D
Sbjct: 2 NNTTNGRLISLDVLRGFVMFWIMSGEHIIHALAKAAPIPIFIWMSSQLHHAEWNGITFYD 61
Query: 75 FVMPFFLFIVGVAIALA---------LKRIPDRADAVKKVIF-----RTLKLLFWGILLQ 120
+ P FLF+ GV++ + +K D A K+ I+ RT LL G ++
Sbjct: 62 MIFPVFLFVAGVSMPFSFEKKMKLAGVKEPKDLPKAEKRKIYLSMLRRTCILLVLGFVVN 121
Query: 121 GGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRL 180
G + T R VL RI L++ ++ + + K Q + F I
Sbjct: 122 GLLRFDGFDQT--------RFASVLGRIGLAWFFAGIIYL---NFDFKKQLICFFGI--- 167
Query: 181 YCWHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAV 240
L+ Y A + VP++ ++ K+ +
Sbjct: 168 -----------LIGYYAAMKWIPVPNFGAGVLTKEG---------------------SLE 195
Query: 241 GYIDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAP-FEPEGLLSSVSSI 299
GYIDR L P H+ ++PEG+ S++ +I
Sbjct: 196 GYIDRLFL-------------------------------PGRLHSTVYDPEGIFSTIPAI 224
Query: 300 LSTIIGVHFG 309
+ ++GV G
Sbjct: 225 ATALLGVFIG 234
>gi|406831131|ref|ZP_11090725.1| hypothetical protein SpalD1_05826 [Schlesneria paludicola DSM
18645]
Length = 520
Score = 42.7 bits (99), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 38/65 (58%), Gaps = 2/65 (3%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGD--WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+ RL SLD FRG + M+LV++ G P+I + C+ AD +MP FLF G A+
Sbjct: 13 SARLTSLDQFRGYTMLGMLLVNYLGSYHVCPQILKHSHDYCSYADTIMPQFLFAAGFAMR 72
Query: 90 LALKR 94
L+L +
Sbjct: 73 LSLGK 77
>gi|283781521|ref|YP_003372276.1| hypothetical protein Psta_3761 [Pirellula staleyi DSM 6068]
gi|283439974|gb|ADB18416.1| conserved hypothetical protein [Pirellula staleyi DSM 6068]
Length = 417
Score = 42.7 bits (99), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 46/191 (24%), Positives = 80/191 (41%), Gaps = 40/191 (20%)
Query: 17 SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEIS------------- 63
+ P ++ + L RL SLD +RG +M+ + G P+++
Sbjct: 4 AAPSLAASTPAATLPA-RLLSLDAYRGF---VMLAMASRGFGIPKVAALPQFASHPTWQF 59
Query: 64 ------HAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKV----IFRTLKLL 113
H W G D + P F+F+VGVA+A + + D K+ IFR + L+
Sbjct: 60 LAGQLDHVAWVGSCFWDLIQPSFMFMVGVAMAYSCAARVSKGDPYWKMLLHAIFRAMVLI 119
Query: 114 FWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV---EIFTKDVQDKDQ 170
G+ L+ S++ D+ + V +I L YL + L+ + + +
Sbjct: 120 ALGVFLR---SNSSDQTNF-------TFMDVTSQIGLGYLPLFLLWGRKFWVQATAAIVI 169
Query: 171 SVGRFSIFRLY 181
VG F++F LY
Sbjct: 170 LVGYFALFALY 180
>gi|408674314|ref|YP_006874062.1| Protein of unknown function DUF2261, transmembrane [Emticicia
oligotrophica DSM 17448]
gi|387855938|gb|AFK04035.1| Protein of unknown function DUF2261, transmembrane [Emticicia
oligotrophica DSM 17448]
Length = 391
Score = 42.7 bits (99), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 41/93 (44%), Gaps = 14/93 (15%)
Query: 33 QRLASLDIFRGLAVALMIL-VDHAGG---DWPEIS----------HAPWNGCNLADFVMP 78
RL S DI+RG + LM+ V H G PE S H W GC+L D + P
Sbjct: 5 NRLTSADIYRGFVMLLMMAEVLHFGKVSEALPESSFWAFLAFHQDHVEWVGCSLHDLIQP 64
Query: 79 FFLFIVGVAIALALKRIPDRADAVKKVIFRTLK 111
F F+VGV + ++ R + + LK
Sbjct: 65 SFSFLVGVVLPYSIARRLTQREGTNAAFLHALK 97
>gi|320104555|ref|YP_004180146.1| hypothetical protein Isop_3032 [Isosphaera pallida ATCC 43644]
gi|319751837|gb|ADV63597.1| hypothetical protein Isop_3032 [Isosphaera pallida ATCC 43644]
Length = 399
Score = 42.4 bits (98), Expect = 0.33, Method: Compositional matrix adjust.
Identities = 57/202 (28%), Positives = 84/202 (41%), Gaps = 19/202 (9%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG--DWPEISHAPWNGCNLADFVMPFFLFIVGVAIAL 90
R +LD FRG VA MI+V+ GG P I C+ AD +MP F VG A
Sbjct: 15 SRWDALDQFRGYTVAGMIVVNFVGGLAAVPAILKHHNTYCSYADTIMPQFFLAVGFAYRW 74
Query: 91 ALKRIPDRAD---AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
+R AV+ + R L LL G L+ G A D++ + + GVL +
Sbjct: 75 TFLNRLERGGWQAAVRHALGRNLGLLLVGFLMYGLDGKAESW----SDLKALGIRGVLIQ 130
Query: 148 IALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYCWHWLMAACVLVVYLALLYGTYVPDW 207
+LV I + + R RL WL+ +C L V L+ ++ +W
Sbjct: 131 AFQRDFFQTLVHIAIASLWVL-PVINRSVWIRL---TWLLGSCTLHVVLS---SSFYYEW 183
Query: 208 QFTIINKDSADYGKVFNVTCGV 229
+ N+ D G + +T V
Sbjct: 184 ---VTNRPGIDGGPLGFLTWTV 202
>gi|300867270|ref|ZP_07111930.1| conserved membrane hypothetical protein [Oscillatoria sp. PCC 6506]
gi|300334747|emb|CBN57096.1| conserved membrane hypothetical protein [Oscillatoria sp. PCC 6506]
Length = 486
Score = 42.4 bits (98), Expect = 0.35, Method: Compositional matrix adjust.
Identities = 34/104 (32%), Positives = 46/104 (44%), Gaps = 13/104 (12%)
Query: 21 VSDQQEKSHLKTQRLASLDIFRGLAVALMIL------------VDHAGGDWPE-ISHAPW 67
V+ Q + +R +LD RG+AV M+L + HA P + +
Sbjct: 7 VNPQDMSTPAVNKRADALDALRGIAVLAMVLSGTIARKTLPAWMYHAQEPPPSHLFNPKL 66
Query: 68 NGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK 111
G D V PFFLF +G AI LAL R + KKVI L+
Sbjct: 67 AGLTWVDLVFPFFLFAMGAAIPLALSRRIAKGWDTKKVILSILQ 110
>gi|436833713|ref|YP_007318929.1| Protein of unknown function DUF2261,transmembrane [Fibrella
aestuarina BUZ 2]
gi|384065126|emb|CCG98336.1| Protein of unknown function DUF2261,transmembrane [Fibrella
aestuarina BUZ 2]
Length = 415
Score = 42.4 bits (98), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 28/93 (30%), Positives = 48/93 (51%), Gaps = 16/93 (17%)
Query: 27 KSHLKTQRLASLDIFRGLAVALM----ILVDH------AGGDWPEI----SHAPWNGCNL 72
++ RL S+D +RG + LM + DH G W + SH W+GC+L
Sbjct: 23 QTEFPAGRLLSVDAYRGFVMLLMMGEVLHFDHLHEAFPGSGFWALLAYHQSHVDWSGCSL 82
Query: 73 ADFVMPFFLFIVGVAI--ALALKRIPDRADAVK 103
D + P F F+VGVA+ ++A +++ ++ V+
Sbjct: 83 HDLIQPSFSFLVGVALPYSIASRQLKGQSAGVQ 115
>gi|299742203|ref|XP_001832311.2| heparinase II/III family protein [Coprinopsis cinerea okayama7#130]
gi|298405078|gb|EAU89472.2| heparinase II/III family protein [Coprinopsis cinerea okayama7#130]
Length = 786
Score = 42.4 bits (98), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 35/68 (51%), Gaps = 7/68 (10%)
Query: 293 LSSVSSILSTIIGVHFGHVIIHTKGHLARLKQWVTMGFALLIFGLTLHFTNGEHGSGKFS 352
L+ ++S L + G H+G + +T W T F + +FG T F G+HG KFS
Sbjct: 398 LAEMASALLSATGSHYGLLTENTN-------YWRTGTFHMYVFGPTSLFDFGDHGPNKFS 450
Query: 353 TTCVCLFI 360
TT +F+
Sbjct: 451 TTANTMFL 458
>gi|404404857|ref|ZP_10996441.1| hypothetical protein AJC13_05463 [Alistipes sp. JC136]
Length = 392
Score = 42.4 bits (98), Expect = 0.37, Method: Compositional matrix adjust.
Identities = 35/115 (30%), Positives = 59/115 (51%), Gaps = 12/115 (10%)
Query: 31 KTQRLASLDIFRGLAVALMILVDH--AGGDWPE-ISHAPW--NGCNLADFVMPFFLFIVG 85
+ R+AS+D+FRGL + M+ V+ + D P + HA + +D + P FLFI+G
Sbjct: 4 QRNRIASIDVFRGLTMFFMLWVNSFWSLSDVPHWLQHAARGEDMLGFSDTIFPAFLFIMG 63
Query: 86 VAIALALKRIPDRADAVKKVIF----RTLKLLFWGIL---LQGGFSHAPDELTYG 133
++ LA+ + D+ K+++ RT L+ G+L FS A L+ G
Sbjct: 64 ASVPLAVGSRRAKGDSTVKIVWHVFTRTFALVVMGLLTVNFGDAFSAAGTGLSRG 118
>gi|284035350|ref|YP_003385280.1| hypothetical protein Slin_0417 [Spirosoma linguale DSM 74]
gi|283814643|gb|ADB36481.1| Protein of unknown function DUF2261, transmembrane [Spirosoma
linguale DSM 74]
Length = 389
Score = 42.4 bits (98), Expect = 0.39, Method: Compositional matrix adjust.
Identities = 25/74 (33%), Positives = 37/74 (50%), Gaps = 14/74 (18%)
Query: 34 RLASLDIFRGLAVALM----ILVDHAGGDWPEI----------SHAPWNGCNLADFVMPF 79
RL S+D +RG + LM + DH +P+ SH W GC+L D + P
Sbjct: 4 RLMSMDAYRGFVMVLMAAEMLQFDHLHETFPDSAFWAFLAHHQSHVAWAGCSLHDLIQPS 63
Query: 80 FLFIVGVAIALALK 93
F F+VGVA+ ++
Sbjct: 64 FSFLVGVALLFSMA 77
>gi|237717694|ref|ZP_04548175.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
gi|299149194|ref|ZP_07042255.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
gi|229453013|gb|EEO58804.1| conserved hypothetical protein [Bacteroides sp. 2_2_4]
gi|298512861|gb|EFI36749.1| conserved hypothetical protein [Bacteroides sp. 3_1_23]
Length = 466
Score = 42.4 bits (98), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 57/123 (46%), Gaps = 20/123 (16%)
Query: 32 TQRLASLDIFRGLAVALMIL------------VDHAGGDWPEISH-APWNGCNLADFVMP 78
+R +LD RG A+ M+L + HA P+ + A +G D V P
Sbjct: 2 NKRAYALDALRGYAIITMVLSATVAWNSLPGWMYHAQTPPPDRAFDASLSGITWVDLVFP 61
Query: 79 FFLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGV 134
FFLF +G A ++K+ ++ D ++++ +K L F+ I +Q + H L+
Sbjct: 62 FFLFAMGAAFPFSIKKRFEKGDTKLRLVYEAIKRGVQLTFFAIFIQHFYPHV---LSNPQ 118
Query: 135 DVR 137
DVR
Sbjct: 119 DVR 121
>gi|312131791|ref|YP_003999131.1| hypothetical protein Lbys_3117 [Leadbetterella byssophila DSM
17132]
gi|311908337|gb|ADQ18778.1| hypothetical protein Lbys_3117 [Leadbetterella byssophila DSM
17132]
Length = 361
Score = 42.0 bits (97), Expect = 0.48, Method: Compositional matrix adjust.
Identities = 34/98 (34%), Positives = 47/98 (47%), Gaps = 20/98 (20%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAP-W--------NGCNLADFVMPFFL 81
K RL S+DIFR L + MI V+ D + + P W +G +D + P FL
Sbjct: 7 KKNRLLSIDIFRALTMFFMIFVN----DLFTVKNVPKWMLHTEMHEDGMGFSDVIFPIFL 62
Query: 82 FIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILL 119
IVG++I A +AD K + RT LL G+ L
Sbjct: 63 LIVGMSIPFA------KADW-KGIGMRTFALLVMGVFL 93
>gi|326798253|ref|YP_004316072.1| hypothetical protein [Sphingobacterium sp. 21]
gi|326549017|gb|ADZ77402.1| hypothetical protein Sph21_0826 [Sphingobacterium sp. 21]
Length = 368
Score = 41.6 bits (96), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 35/120 (29%), Positives = 54/120 (45%), Gaps = 23/120 (19%)
Query: 25 QEKSHLKTQRLASLDIFRGLAVALMI---------LVDHAGGDWPE-----ISHAPWNGC 70
E+ ++T R+ SLD+ RGL + L+ L WP+ H W+G
Sbjct: 1 MEEKKVQT-RILSLDVMRGLIMILLAAESCELYTALSSTYSSGWPQGIIHHFFHHEWHGL 59
Query: 71 NLADFVMPFFLFIVGVAIALALKRIPDRADAV------KKVIFRTLKLLFWGILLQGGFS 124
D V P F+FI G ++ L+ +R +A V K V +R+ KL G+ L +S
Sbjct: 60 YFWDLVQPAFMFIAGTSLYLSFQR--KQAAGVSWSSHFKSVAWRSAKLFLCGVALHCVYS 117
>gi|186472139|ref|YP_001859481.1| hypothetical protein Bphy_3279 [Burkholderia phymatum STM815]
gi|184194471|gb|ACC72435.1| conserved hypothetical protein [Burkholderia phymatum STM815]
Length = 418
Score = 41.6 bits (96), Expect = 0.55, Method: Compositional matrix adjust.
Identities = 46/140 (32%), Positives = 64/140 (45%), Gaps = 18/140 (12%)
Query: 25 QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIV 84
+++ + RL LD FRGL V L I+VDH GG +S A + L D F+F+
Sbjct: 49 EKRMQIPKNRLIELDFFRGL-VLLFIVVDHIGGS--ILSRATLHAYALCD-AAEVFVFLG 104
Query: 85 GVAIALALKRIPDR---ADAVKKVIFRTLKL----LFWGILL------QGGFS-HAPDEL 130
G A A A + R ADA + R+L+L L +L+ FS AP+
Sbjct: 105 GFATATAYASLAKRHTEADARNRFFKRSLELYRAFLVTAVLMLVVSAVMSAFSIDAPNMA 164
Query: 131 TYGVDVRMIRLCGVLQRIAL 150
T +D M VL+ I L
Sbjct: 165 TTDLDDMMDTPTAVLRDILL 184
>gi|331002841|ref|ZP_08326355.1| hypothetical protein HMPREF0491_01217 [Lachnospiraceae oral taxon
107 str. F0167]
gi|330413330|gb|EGG92698.1| hypothetical protein HMPREF0491_01217 [Lachnospiraceae oral taxon
107 str. F0167]
Length = 557
Score = 41.6 bits (96), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 31/99 (31%), Positives = 52/99 (52%), Gaps = 11/99 (11%)
Query: 26 EKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWP---EISHAPWNGCNLAD----FVMP 78
E+ + T+RL LD+ + ++ ALMI++ HA + +I + W G + + F +P
Sbjct: 205 EEKKINTERLIGLDLLKIIS-ALMIILIHASANIYNNHDIGSSVWFGGLILNVIPRFAVP 263
Query: 79 FFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGI 117
FL I G AL L R + AV+K ++ L L+ W +
Sbjct: 264 TFLMISG---ALLLGRSTEPRKAVRKALYAGLALVVWSV 299
>gi|210622217|ref|ZP_03293007.1| hypothetical protein CLOHIR_00953 [Clostridium hiranonis DSM 13275]
gi|210154351|gb|EEA85357.1| hypothetical protein CLOHIR_00953 [Clostridium hiranonis DSM 13275]
Length = 483
Score = 41.6 bits (96), Expect = 0.64, Method: Compositional matrix adjust.
Identities = 40/184 (21%), Positives = 77/184 (41%), Gaps = 20/184 (10%)
Query: 3 EIKAETTHHHPLIISEPDVSDQQEKSHLK----TQRLASLDIFRGLAVALMILVDHAG-- 56
E+ E + +I+ + Q + + +R ++++ G+AV +I G
Sbjct: 80 ELSREESSTEQTVINRGEKEQPQAREVVTGDPLKRRYTTVELIMGVAVIAIICSSGIGVL 139
Query: 57 GDWPE-ISHAPWNGCNLADFVMPFFL----FIVGVAIALALKRIPDRADAVKKVIFRTLK 111
G+ P ++ + WNG + D +P L F++ + L +KR + K + +
Sbjct: 140 GEMPAFLAFSKWNGISFGDLGLPLLLASVCFMIPTEVELDVKRKKSFKEICIKKVKVGII 199
Query: 112 LLFWGILLQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQS 171
L GIL+ L + R+ G+LQ IA+ Y+L SL+ + + K
Sbjct: 200 LFVIGILIN---------LIGAWNFNSFRIMGILQMIAVVYMLGSLLYVLFRRFNFKSSV 250
Query: 172 VGRF 175
+ F
Sbjct: 251 IAVF 254
>gi|298384739|ref|ZP_06994299.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
gi|298263018|gb|EFI05882.1| conserved hypothetical protein [Bacteroides sp. 1_1_14]
Length = 473
Score = 41.2 bits (95), Expect = 0.86, Method: Compositional matrix adjust.
Identities = 29/106 (27%), Positives = 49/106 (46%), Gaps = 17/106 (16%)
Query: 32 TQRLASLDIFRGLAVALMIL------------VDHAGGDWPE-ISHAPWNGCNLADFVMP 78
R +LD RG A+ M+L + HA P+ I + G D V P
Sbjct: 2 NNRAYALDALRGYAIITMVLSATIVTQVLPGWMSHAQTPPPDHIFNPSLPGITWVDLVFP 61
Query: 79 FFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQ 120
FFLF +G A ++ + ++ D+ K+++ R ++L F+ I +Q
Sbjct: 62 FFLFAMGAAFPFSIGKRAEKGDSKLKLVYEAVKRGVQLTFFAIFIQ 107
>gi|262381364|ref|ZP_06074502.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
gi|262296541|gb|EEY84471.1| conserved hypothetical protein [Bacteroides sp. 2_1_33B]
Length = 410
Score = 41.2 bits (95), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 35/114 (30%), Positives = 56/114 (49%), Gaps = 9/114 (7%)
Query: 32 TQRLASLDIFRGLAVALMILVDH--AGGDWPE-ISHAPW--NGCNLADFVMPFFLFIVGV 86
TQR ++DI R + + +MI V+ D P + HA + + LAD V P FLF VG+
Sbjct: 9 TQRNIAIDILRAVTMCVMIFVNDFWTVHDVPHYLEHAAYGEDFMGLADVVFPAFLFAVGM 68
Query: 87 AIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGFSHAPDELTYGVDV 136
+I A++R + + + I RTL LL G + + D+ + + V
Sbjct: 69 SIPFAIERRYAKGMSGESTILHILSRTLALLIMGAFIVNSEAGMADDALFPIGV 122
>gi|291515652|emb|CBK64862.1| hypothetical protein AL1_27050 [Alistipes shahii WAL 8301]
Length = 466
Score = 41.2 bits (95), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 28/95 (29%), Positives = 43/95 (45%), Gaps = 16/95 (16%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEIS---------------HAPWNGCNLADFV 76
R ++DI RGLA+ M+L + + P++ A G D V
Sbjct: 2 NNRAYAVDILRGLAIVGMVLSGYIAWN-PDLPAWLFHAQLPPPSFVFDASVAGITWVDLV 60
Query: 77 MPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK 111
PFFLF +G A L+L R +R D ++++ LK
Sbjct: 61 FPFFLFSMGAAFPLSLGRRLNRGDPPRRIVVSILK 95
>gi|308050627|ref|YP_003914193.1| hypothetical protein Fbal_2917 [Ferrimonas balearica DSM 9799]
gi|307632817|gb|ADN77119.1| conserved hypothetical protein [Ferrimonas balearica DSM 9799]
Length = 366
Score = 41.2 bits (95), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 44/153 (28%), Positives = 66/153 (43%), Gaps = 41/153 (26%)
Query: 33 QRLASLDIFRG-----------LAVALMILVDHAGGDWP-------EISHAPWNGCNLAD 74
QRL +LD RG L AL++L WP ++ H+PW+G D
Sbjct: 6 QRLQALDALRGFDMFWIIGGEKLFAALLLLTG-----WPLWQVAADQMLHSPWHGFTFYD 60
Query: 75 FVMPFFLFIVGVAIALALKRI-----PDRADAVKKVIFRTLKLLFWGILLQGGFS----H 125
+ P F+F+ GV I L + + DR +K + R L L G+L G+
Sbjct: 61 LIFPLFIFLSGVTIGLQRQSLIGIAWSDRQPHYRKALKRLLLLALLGVLYNHGWGTGMPM 120
Query: 126 APDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
A DE IR VL RI +++ L +++
Sbjct: 121 ALDE---------IRYASVLGRIGMAWFLAAMI 144
>gi|189463407|ref|ZP_03012192.1| hypothetical protein BACCOP_04126 [Bacteroides coprocola DSM 17136]
gi|189429836|gb|EDU98820.1| hypothetical protein BACCOP_04126 [Bacteroides coprocola DSM 17136]
Length = 467
Score = 41.2 bits (95), Expect = 0.92, Method: Compositional matrix adjust.
Identities = 35/123 (28%), Positives = 54/123 (43%), Gaps = 28/123 (22%)
Query: 32 TQRLASLDIFRGLAVALMIL------------VDHAGGDWPE-ISHAPWNGCNLADFVMP 78
QR +LD RG A+ M+L + HA PE I + G D V P
Sbjct: 2 NQRALALDALRGYAIITMVLSATIISSILPGWMSHAQTPPPEHIFNPEIPGITWVDLVFP 61
Query: 79 FFLFIVGVAIALALKRIPDRA--------DAVKKVIFRTLKLLFWGILLQGGFSH---AP 127
FFLF +G A ++ R ++ DA+K R ++L F+ I +Q + + +P
Sbjct: 62 FFLFAMGAAFPFSIGRHAEKGRSKLMLCYDAIK----RGIQLTFFAIFIQHFYPYVISSP 117
Query: 128 DEL 130
+L
Sbjct: 118 QDL 120
>gi|29345856|ref|NP_809359.1| hypothetical protein BT_0446 [Bacteroides thetaiotaomicron
VPI-5482]
gi|383122991|ref|ZP_09943678.1| hypothetical protein BSIG_0267 [Bacteroides sp. 1_1_6]
gi|29337749|gb|AAO75553.1| conserved hypothetical protein [Bacteroides thetaiotaomicron
VPI-5482]
gi|251841914|gb|EES69994.1| hypothetical protein BSIG_0267 [Bacteroides sp. 1_1_6]
Length = 469
Score = 41.2 bits (95), Expect = 0.93, Method: Compositional matrix adjust.
Identities = 29/106 (27%), Positives = 49/106 (46%), Gaps = 17/106 (16%)
Query: 32 TQRLASLDIFRGLAVALMIL------------VDHAGGDWPE-ISHAPWNGCNLADFVMP 78
R +LD RG A+ M+L + HA P+ I + G D V P
Sbjct: 2 NNRAYALDALRGYAIITMVLSATIVTQVLPGWMSHAQTPPPDHIFNPSLPGITWVDLVFP 61
Query: 79 FFLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQ 120
FFLF +G A ++ + ++ D+ K+++ R ++L F+ I +Q
Sbjct: 62 FFLFAMGAAFPFSIGKRAEKGDSKLKLVYEAVKRGVQLTFFAIFIQ 107
>gi|255532593|ref|YP_003092965.1| hypothetical protein Phep_2702 [Pedobacter heparinus DSM 2366]
gi|255345577|gb|ACU04903.1| hypothetical protein Phep_2702 [Pedobacter heparinus DSM 2366]
Length = 390
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 23/73 (31%), Positives = 38/73 (52%), Gaps = 5/73 (6%)
Query: 34 RLASLDIFRGLAVALMILVDHAGG-----DWPEISHAPWNGCNLADFVMPFFLFIVGVAI 88
R ++D+ R L + LMI V+ G W + A +G AD + P FLFIVG+++
Sbjct: 8 RFQAVDVLRALTMFLMIFVNDVGSVKYLPHWVDHVEADVDGMGFADTIFPAFLFIVGLSL 67
Query: 89 ALALKRIPDRADA 101
AL+ ++ +
Sbjct: 68 PFALQSRMNKGKS 80
>gi|293369243|ref|ZP_06615833.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
gi|292635668|gb|EFF54170.1| putative membrane protein [Bacteroides ovatus SD CMC 3f]
Length = 466
Score = 40.8 bits (94), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 34/123 (27%), Positives = 57/123 (46%), Gaps = 20/123 (16%)
Query: 32 TQRLASLDIFRGLAVALMIL------------VDHAGGDWPEISH-APWNGCNLADFVMP 78
+R +LD RG A+ M+L + HA P+ + A +G D V P
Sbjct: 2 NKRAYALDALRGYAIITMVLSATVAWNSLPGWMYHAQTPPPDRAFDASLSGITWVDLVFP 61
Query: 79 FFLFIVGVAIALALKRIPDRADAVKKVIFRTLK----LLFWGILLQGGFSHAPDELTYGV 134
FFLF +G A ++K+ ++ D ++++ +K L F+ I +Q + P L+
Sbjct: 62 FFLFAMGAAFPFSIKKRFEKGDTKLRLVYEAIKRGVQLTFFAIFIQHFY---PYVLSNPQ 118
Query: 135 DVR 137
DVR
Sbjct: 119 DVR 121
>gi|320107689|ref|YP_004183279.1| hypothetical protein AciPR4_2506 [Terriglobus saanensis SP1PR4]
gi|319926210|gb|ADV83285.1| hypothetical protein AciPR4_2506 [Terriglobus saanensis SP1PR4]
Length = 419
Score = 40.8 bits (94), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 32/113 (28%), Positives = 53/113 (46%), Gaps = 20/113 (17%)
Query: 33 QRLASLDIFRGLAVALMI----LVDHAGGDWPEI----------SHAPWNGCNLADFVMP 78
QR ++D +RGL + LM+ + +P SH W G L D + P
Sbjct: 33 QRNVAVDAYRGLVMLLMMGEVMQFEVVARSFPSSTIWRILSFNQSHVQWVGMGLHDMIQP 92
Query: 79 FFLFIVGVAIALALKRIPDRADAVKKV----IFRTLKLLFWGILLQGGFSHAP 127
F F+VGVA+ +L+ + + +K+ I+R+ L+ GI L+ H+P
Sbjct: 93 SFTFLVGVALPYSLRSRQKKGQSFQKIVGHTIWRSFLLVALGIFLRS--IHSP 143
>gi|344238550|gb|EGV94653.1| Heparan-alpha-glucosaminide N-acetyltransferase [Cricetulus
griseus]
Length = 423
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 28/88 (31%), Positives = 41/88 (46%), Gaps = 10/88 (11%)
Query: 24 QQEKSHLKT--QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFL 81
Q E H RL +D FRG+A+ LM+ V++ GG + H+ WN + +P L
Sbjct: 188 QPETRHTSALPYRLRCVDTFRGIALILMVFVNYGGGKYWYFKHSSWN-VSWDKVRIPGVL 246
Query: 82 -------FIVGVAIALALKRIPDRADAV 102
F+V V + K +PDR V
Sbjct: 247 QRLGVTYFVVAVLELIFSKPVPDRCALV 274
>gi|406830436|ref|ZP_11090030.1| hypothetical protein SpalD1_02314 [Schlesneria paludicola DSM
18645]
Length = 380
Score = 40.4 bits (93), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 36/123 (29%), Positives = 57/123 (46%), Gaps = 18/123 (14%)
Query: 34 RLASLDIFRGLAVALMILVD---HAGGDWPE----------ISHAPWNGCNLADFVMPFF 80
R+ASLD RG A+A++++ A + P+ +SH G +L D P F
Sbjct: 27 RVASLDTLRGFAIAILLIATPLVSALKEVPQSATRDMLVWQLSHVKGEGISLFDVGWPAF 86
Query: 81 LFIVGVAIALALKRIPDRAD----AVKKVIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
L I GV++ +L R +R + A ++ ++L + + GGFS P Y DV
Sbjct: 87 LIIAGVSLNFSLARRLERGETRFAAWLDLVRKSLLCALFAFFVHGGFS-IPWAKVYFADV 145
Query: 137 RMI 139
I
Sbjct: 146 LFI 148
>gi|323343607|ref|ZP_08083834.1| hypothetical protein HMPREF0663_10369 [Prevotella oralis ATCC
33269]
gi|323095426|gb|EFZ38000.1| hypothetical protein HMPREF0663_10369 [Prevotella oralis ATCC
33269]
Length = 468
Score = 40.4 bits (93), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 51/111 (45%), Gaps = 17/111 (15%)
Query: 30 LKTQRLASLDIFRGLAVALMIL------------VDHAGGDWPE-ISHAPWNGCNLADFV 76
+K +R +LD RG A+ MIL + HA P+ + + G D +
Sbjct: 1 MKQERAHALDALRGYAIMTMILSATEAFRVLPAWMYHAQVPPPDHVFNPSIYGITWVDLI 60
Query: 77 MPFFLFIVGVAIALALKRIPDRADAVKKV----IFRTLKLLFWGILLQGGF 123
PFFLF +G AI L+L R +++K+ R LKL F+ I + F
Sbjct: 61 FPFFLFSMGAAIPLSLGRQYKAGASLRKLCRKSAIRWLKLAFFAIFIYHTF 111
>gi|399028715|ref|ZP_10729871.1| hypothetical protein PMI10_01698 [Flavobacterium sp. CF136]
gi|398073551|gb|EJL64721.1| hypothetical protein PMI10_01698 [Flavobacterium sp. CF136]
Length = 382
Score = 40.0 bits (92), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 64/326 (19%), Positives = 113/326 (34%), Gaps = 103/326 (31%)
Query: 30 LKTQRLASLDIFRGLAVALMILVDH------AGGDWP-------EISHAPWNGCNLADFV 76
+ RL SLD RG + ++ +H P ++ H W G D +
Sbjct: 5 ITNGRLVSLDALRGFVMFWIMSGEHIIHALAKAAPIPVFVWMSSQLHHTEWEGITFYDMI 64
Query: 77 MPFFLFIVGVAIALALKR----------IPDRADAVKKVIFRTLK----LLFWGILLQGG 122
P FLF+ GV++ + ++ + A KK+ LK L+F G ++ G
Sbjct: 65 FPIFLFVAGVSMPYSFEKKMSIAGVNTPMELPAKEKKKIYLSMLKRTCILIFLGFIVNGL 124
Query: 123 FSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEIFTKDVQDKDQSVGRFSIFRLYC 182
+ T R VL RI L++ ++ + + K Q +
Sbjct: 125 LRFDGYDQT--------RFASVLGRIGLAWFFAGIIYL---NFNLKKQII---------- 163
Query: 183 WHWLMAACVLVVYLALLYGTYVPDWQFTIINKDSADYGKVFNVTCGVRAKLNPPCNAVGY 242
W + +LV Y + VPD+ ++ K+ + GY
Sbjct: 164 --WFIG--ILVGYYLAMKLIPVPDFGAGVLTKEGS---------------------LEGY 198
Query: 243 IDRKVLGINHMYHHPAWRRSKACTQDSPFEGPLRKDAPSWCHAPFEPEGLLSSVSSILST 302
IDR L P SK ++PEGL S++ ++ +
Sbjct: 199 IDRMFL--------PGRLHSKV----------------------YDPEGLFSTIPAVATA 228
Query: 303 IIGVHFGHVIIHTKGHLARLKQWVTM 328
++G+ G + H + K+ + M
Sbjct: 229 LLGMFLGTFLKIKANHFSTNKKILIM 254
>gi|387791847|ref|YP_006256912.1| hypothetical protein Solca_2702 [Solitalea canadensis DSM 3403]
gi|379654680|gb|AFD07736.1| hypothetical protein Solca_2702 [Solitalea canadensis DSM 3403]
Length = 487
Score = 40.0 bits (92), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 40/130 (30%), Positives = 57/130 (43%), Gaps = 25/130 (19%)
Query: 16 ISEPDVSDQQEKSHLKTQ-----RLASLDIFRGLAVALMIL-------------VDHAGG 57
I+E + S + E S +K Q R +LD RGLA+ M+ + H
Sbjct: 4 ITEIEAS-KPEASFVKAQIDKAPRSFALDALRGLAIIGMVFSGVFPHEALWPGYMFHGQV 62
Query: 58 DWPEISHAPW-NGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK----L 112
P+ + P G D V PFFLF +G A LA+ + D K VI LK L
Sbjct: 63 GPPDFKYTPEVPGITWVDLVFPFFLFSMGAAFPLAMNKKIQEGDQ-KGVILNVLKRFALL 121
Query: 113 LFWGILLQGG 122
+F+ I+L+
Sbjct: 122 VFFAIVLRNA 131
>gi|345856865|ref|ZP_08809325.1| hypothetical protein DOT_0678 [Desulfosporosinus sp. OT]
gi|344330006|gb|EGW41324.1| hypothetical protein DOT_0678 [Desulfosporosinus sp. OT]
Length = 237
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 30/94 (31%), Positives = 48/94 (51%), Gaps = 9/94 (9%)
Query: 27 KSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGV 86
S + +QR +D+ R LA+ LM+L H D E + G N+ D+ P + F++G
Sbjct: 8 NSRISSQRYEEIDVLRALAIGLMVLF-HLAYDLKEFA-----GVNI-DYQAPLW-FVIGK 59
Query: 87 AIALALKRIPDRADAV-KKVIFRTLKLLFWGILL 119
AL I + K + R LK+LFWG+++
Sbjct: 60 TSALLFIFISGLSSGFSKSSVRRGLKVLFWGMVV 93
>gi|325102778|ref|YP_004272432.1| hypothetical protein [Pedobacter saltans DSM 12145]
gi|324971626|gb|ADY50610.1| hypothetical protein Pedsa_0021 [Pedobacter saltans DSM 12145]
Length = 466
Score = 40.0 bits (92), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 34/111 (30%), Positives = 47/111 (42%), Gaps = 30/111 (27%)
Query: 29 HLKTQRLASLDIFRGLAVALMILVDHAGGD----W--------------PEISHAPWNGC 70
L +R +SLD RG+A+ LM+L W PEI W
Sbjct: 2 KLTVKRDSSLDSLRGIAIILMVLSGSIAFSILPGWMYHAQVPPPGHKFMPEIPGITW--- 58
Query: 71 NLADFVMPFFLFIVGVAIALALKRIPDRADA-------VKKVIFRTLKLLF 114
D V PFFLF +G AI LA+K+ + + + VK+ + T LF
Sbjct: 59 --VDLVFPFFLFSMGAAIPLAMKKKIENSSSLNIFISIVKRFVLLTFFALF 107
>gi|296121958|ref|YP_003629736.1| hypothetical protein Plim_1707 [Planctomyces limnophilus DSM 3776]
gi|296014298|gb|ADG67537.1| conserved hypothetical protein [Planctomyces limnophilus DSM 3776]
Length = 378
Score = 40.0 bits (92), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 29/100 (29%), Positives = 44/100 (44%), Gaps = 14/100 (14%)
Query: 63 SHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDR----ADAVKKVIFRTLKLLFWGIL 118
H W GC+L D V P F+F+VGV+I +L + + ++R++ L+ GI
Sbjct: 32 EHVAWRGCSLWDMVQPSFMFLVGVSIPWSLAAQKSKNVSTGQGWVRAVWRSVLLVVLGIF 91
Query: 119 LQGGFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLV 158
L + D VL +I L YL+V V
Sbjct: 92 LISNNKPSTD----------FSFVNVLTQIGLGYLVVYAV 121
>gi|160891390|ref|ZP_02072393.1| hypothetical protein BACUNI_03840 [Bacteroides uniformis ATCC 8492]
gi|156858797|gb|EDO52228.1| hypothetical protein BACUNI_03840 [Bacteroides uniformis ATCC 8492]
Length = 421
Score = 39.7 bits (91), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 33/114 (28%), Positives = 55/114 (48%), Gaps = 10/114 (8%)
Query: 15 IISEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVD-----HAGGDWPEISHAPWNG 69
I+ + D+SD+ S +R ++D+ R L + MI V+ H W E + +
Sbjct: 8 ILKQDDMSDKTVYS----RRNPAIDMLRALTMFTMIFVNDFWKVHDVPHWLEHAVYGEDF 63
Query: 70 CNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLKLLFWGILLQGGF 123
LAD V P FLF VG++I A++R + + + + L F +L+ G F
Sbjct: 64 MGLADIVFPCFLFAVGMSIPYAIERRYAKGFSAESTLGHILSRTF-ALLVMGAF 116
>gi|423299515|ref|ZP_17277540.1| hypothetical protein HMPREF1057_00681 [Bacteroides finegoldii
CL09T03C10]
gi|408473324|gb|EKJ91846.1| hypothetical protein HMPREF1057_00681 [Bacteroides finegoldii
CL09T03C10]
Length = 467
Score = 39.7 bits (91), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 33/106 (31%), Positives = 49/106 (46%), Gaps = 18/106 (16%)
Query: 33 QRLASLDIFRGLAVALMILVDH-AGGDWPE-ISHAPWN------------GCNLADFVMP 78
+R SLD FRG A+ M+L A G P + HA G D V P
Sbjct: 2 KRAISLDAFRGYAIVTMVLSGTIASGVLPGWMYHAQMGPRSNYIFDPQLYGITWVDLVFP 61
Query: 79 FFLFIVGVAIALALKRIPDRADAVKKVI----FRTLKLLFWGILLQ 120
FFLF +G AI ++ ++ + + K+I R ++L F+ I +Q
Sbjct: 62 FFLFAMGAAIPFSVGGKIEKGENLWKIIGECVLRGIRLAFFAIFIQ 107
>gi|420256474|ref|ZP_14759317.1| hypothetical protein PMI06_09785 [Burkholderia sp. BT03]
gi|398043145|gb|EJL36078.1| hypothetical protein PMI06_09785 [Burkholderia sp. BT03]
Length = 373
Score = 39.7 bits (91), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 46/140 (32%), Positives = 62/140 (44%), Gaps = 18/140 (12%)
Query: 25 QEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIV 84
Q+ RL LD FRGL V + I+VDH GG +S A + L D F+F+
Sbjct: 4 QKGMQTSKNRLIELDFFRGL-VLIFIVVDHIGGS--ILSRATLHAYALCD-AAEVFVFLG 59
Query: 85 GVAIALALKRIPDR---ADAVKKVIFRTLKL----LFWGILL------QGGFS-HAPDEL 130
G A A A + R ADA + R+L+L L +L+ FS AP+
Sbjct: 60 GFATATAYASLAKRHTEADARNRFFKRSLELYRAFLITAVLMLLVSAVMSAFSIDAPNMA 119
Query: 131 TYGVDVRMIRLCGVLQRIAL 150
T +D M VL+ I L
Sbjct: 120 TTDLDDMMDTPTAVLRDILL 139
>gi|255036257|ref|YP_003086878.1| hypothetical protein Dfer_2495 [Dyadobacter fermentans DSM 18053]
gi|254949013|gb|ACT93713.1| conserved hypothetical protein [Dyadobacter fermentans DSM 18053]
Length = 379
Score = 39.7 bits (91), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 17/49 (34%), Positives = 28/49 (57%)
Query: 63 SHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRTLK 111
SH PW GC+L D + P F F+VGVA+ ++ + +V + T++
Sbjct: 32 SHVPWVGCSLHDLIQPSFSFLVGVALPYSMASRASKDQSVATMWAHTIR 80
>gi|189468533|ref|ZP_03017318.1| hypothetical protein BACINT_04936 [Bacteroides intestinalis DSM
17393]
gi|189436797|gb|EDV05782.1| hypothetical protein BACINT_04936 [Bacteroides intestinalis DSM
17393]
Length = 393
Score = 39.3 bits (90), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 28/96 (29%), Positives = 51/96 (53%), Gaps = 9/96 (9%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGG--DWPE-ISHAPWNG--CNLADFVMPFFLFIVGVA 87
QR+A++D+FR L + M+ V+ G + P + HA N +D + P FLF +G++
Sbjct: 7 QRVAAVDVFRALTMFFMLFVNDIPGLKNVPHWLMHAEMNEDMMGFSDTIFPAFLFCMGMS 66
Query: 88 IALALKRIPDRADAVKKVIF----RTLKLLFWGILL 119
I A++ + D ++I RT+ L+ G+ +
Sbjct: 67 IPFAIQNRVKKGDTALQIISHISERTVALIAMGLFM 102
>gi|256423178|ref|YP_003123831.1| hypothetical protein Cpin_4173 [Chitinophaga pinensis DSM 2588]
gi|256038086|gb|ACU61630.1| conserved hypothetical protein [Chitinophaga pinensis DSM 2588]
Length = 349
Score = 39.3 bits (90), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 28/100 (28%), Positives = 47/100 (47%), Gaps = 14/100 (14%)
Query: 31 KTQRLASLDIFRGLAVALMI----LVDHAGGDW------PEISHAPWNGCNLADFVMPFF 80
+ RL SLD+ RGL + L+ V + +W + H PW+G D V P F
Sbjct: 3 NSGRLLSLDVMRGLIMILLAGESCRVYESLHEWHDNAFIRQFFHHPWHGLRFWDLVQPAF 62
Query: 81 LFIVGVAIALA----LKRIPDRADAVKKVIFRTLKLLFWG 116
+ + G A+ ++ L++ + K ++ R+LKL G
Sbjct: 63 MLMAGTAMYISYQSKLRKGVSWSQNFKHILIRSLKLFLLG 102
>gi|377812665|ref|YP_005041914.1| hypothetical protein BYI23_B004200 [Burkholderia sp. YI23]
gi|357937469|gb|AET91027.1| hypothetical protein BYI23_B004200 [Burkholderia sp. YI23]
Length = 389
Score = 39.3 bits (90), Expect = 3.2, Method: Compositional matrix adjust.
Identities = 38/127 (29%), Positives = 60/127 (47%), Gaps = 18/127 (14%)
Query: 23 DQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLF 82
+ + K+QRL LD FRGL V L+I++DH GG +S + L D F+F
Sbjct: 6 NTPSRMQQKSQRLVELDFFRGL-VLLIIVIDHIGGSM--LSRFTLHSFALND-AAEVFVF 61
Query: 83 IVGVAIALALKRIPD-RADAVKKVIF--RTLKL-----------LFWGILLQGGFSHAPD 128
+ G A A A + + R+++ +V F R +L L +L+ F HAP+
Sbjct: 62 LGGFATATAYVSLAERRSESAARVRFLKRAFELYRAFVVTAVLMLVASFVLRPLFGHAPN 121
Query: 129 ELTYGVD 135
+ +D
Sbjct: 122 LALHDLD 128
>gi|416973167|ref|ZP_11937347.1| OpgC protein, partial [Burkholderia sp. TJI49]
gi|325520603|gb|EGC99672.1| OpgC protein [Burkholderia sp. TJI49]
Length = 141
Score = 39.3 bits (90), Expect = 3.5, Method: Compositional matrix adjust.
Identities = 30/95 (31%), Positives = 48/95 (50%), Gaps = 8/95 (8%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+ R A LD FRGL V L+I+VDH GG ++ + C+ A+ F+F+ G A A
Sbjct: 6 RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60
Query: 90 LALKRIPDRAD--AVKKVIFRTLKLLFWGILLQGG 122
+A + +R D A ++ R ++ L+ G
Sbjct: 61 IAYNSLAERHDEAAARQRFIRRAFEIYRAFLVTAG 95
>gi|312796297|ref|YP_004029219.1| OpgC protein [Burkholderia rhizoxinica HKI 454]
gi|312168072|emb|CBW75075.1| OpgC [Burkholderia rhizoxinica HKI 454]
Length = 330
Score = 38.9 bits (89), Expect = 4.1, Method: Compositional matrix adjust.
Identities = 31/88 (35%), Positives = 44/88 (50%), Gaps = 9/88 (10%)
Query: 17 SEPDVSDQQEKSHL---KTQRLASLDIFRGLAVALMILVDHAGGDW-PEISHAPWNGCNL 72
S P + S+L T RLA LD FRGL V L+I+VDH GG ++ + C+
Sbjct: 3 SRPALPFHPSPSYLMQPSTARLAELDFFRGL-VLLIIVVDHIGGSMLSRVTLHTYALCDA 61
Query: 73 ADFVMPFFLFIVGVAIALALKRIPDRAD 100
A+ F+F+ G A A+ + R D
Sbjct: 62 AE----VFVFLGGYATAIGWTTLAARCD 85
>gi|407789242|ref|ZP_11136344.1| hypothetical protein B3C1_03120 [Gallaecimonas xiamenensis 3-C-1]
gi|407207220|gb|EKE77163.1| hypothetical protein B3C1_03120 [Gallaecimonas xiamenensis 3-C-1]
Length = 364
Score = 38.9 bits (89), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 14/49 (28%), Positives = 28/49 (57%)
Query: 61 EISHAPWNGCNLADFVMPFFLFIVGVAIALALKRIPDRADAVKKVIFRT 109
+++H+ W+G D + P F+F+ GV + LA KR ++ ++R+
Sbjct: 48 QMAHSDWHGLTAYDGIFPLFIFLSGVTLGLADKRASALGGGARRALYRS 96
>gi|406834557|ref|ZP_11094151.1| hypothetical protein SpalD1_23036 [Schlesneria paludicola DSM
18645]
Length = 534
Score = 38.5 bits (88), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 36/67 (53%), Gaps = 6/67 (8%)
Query: 32 TQRLASLDIFRGLAVALMILVDHAGGDWPEISHAPW----NGCNLADFVMPFFLFIVGVA 87
T+RL SLD FRG VA MILV+ G + H+ + N + AD +MP F F VG +
Sbjct: 75 TERLVSLDQFRGYTVAGMILVNFIGSF--AVVHSIFKHNNNYFSYADSIMPGFHFAVGYS 132
Query: 88 IALALKR 94
L R
Sbjct: 133 YRLTFLR 139
>gi|395804714|ref|ZP_10483949.1| hypothetical protein FF52_22629 [Flavobacterium sp. F52]
gi|395433102|gb|EJF99060.1| hypothetical protein FF52_22629 [Flavobacterium sp. F52]
Length = 380
Score = 38.5 bits (88), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 38/165 (23%), Positives = 65/165 (39%), Gaps = 45/165 (27%)
Query: 28 SHLKTQRLASLDIFRGLAVALMILVDH------AGGDWP-------EISHAPWNGCNLAD 74
S+ RL SLD RG + ++ +H P ++ HA WNG D
Sbjct: 2 SNPTNGRLISLDALRGFVMFWIMSGEHIIHALAKAAPIPIFLWMSSQLHHAEWNGITFYD 61
Query: 75 FVMPFFLFIVGVAIALALKRIPDRADAV---------KKVIF-----RTLKLLFWGILLQ 120
+ P FLF+ GV++ + ++ + A K+ I+ RT+ L+ G ++
Sbjct: 62 MIFPVFLFVAGVSMPYSFEKKMNLAGVSTPQELPSKEKRKIYLSMLRRTIILVVLGFVVN 121
Query: 121 G-----GFSHAPDELTYGVDVRMIRLCGVLQRIALSYLLVSLVEI 160
G GF H R VL RI +++ ++ +
Sbjct: 122 GLLRFDGFDHT-------------RFASVLGRIGIAWFFAGMIYL 153
>gi|333380436|ref|ZP_08472127.1| hypothetical protein HMPREF9455_00293 [Dysgonomonas gadei ATCC
BAA-286]
gi|332826431|gb|EGJ99260.1| hypothetical protein HMPREF9455_00293 [Dysgonomonas gadei ATCC
BAA-286]
Length = 469
Score = 38.5 bits (88), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 36/134 (26%), Positives = 61/134 (45%), Gaps = 25/134 (18%)
Query: 33 QRLASLDIFRGLAVALMIL------------VDHAGGDWPEISHAPWN-GCNLADFVMPF 79
R +LD RG A+ M+L + HA P P G D V PF
Sbjct: 3 DRSCALDALRGYAIVTMVLSGAVVYGVLPGWMYHAQVPPPTHVFNPAAPGITWVDLVFPF 62
Query: 80 FLFIVGVAIALALKRIPDRADAVKKVIF----RTLKLLFWGILLQGGF----SHAPDELT 131
FLF +G A ++++ +R ++ K+I+ R+++L F+ I ++ + S+ D
Sbjct: 63 FLFAMGSAFPFSIRKRLERGESKLKLIYDALKRSIQLTFFAIFIRHFYPYVLSNPEDARA 122
Query: 132 YGVDVRMIRLCGVL 145
+G+ + LC VL
Sbjct: 123 WGLSL----LCFVL 132
>gi|421867873|ref|ZP_16299526.1| OpgC protein [Burkholderia cenocepacia H111]
gi|358072286|emb|CCE50404.1| OpgC protein [Burkholderia cenocepacia H111]
Length = 382
Score = 38.5 bits (88), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 30/100 (30%), Positives = 49/100 (49%), Gaps = 9/100 (9%)
Query: 17 SEPDVSDQQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADF 75
+ P + + R A LD FRGL V L+I+VDH GG ++ + C+ A+
Sbjct: 9 TRPAPEGAPMTAPARAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE- 66
Query: 76 VMPFFLFIVGVAIALALKRIPDRAD---AVKKVIFRTLKL 112
F+F+ G A A+A + +R D A ++ I R ++
Sbjct: 67 ---VFVFLGGFATAIAYNSLAERHDEAAARQRFIRRAFEI 103
>gi|319900329|ref|YP_004160057.1| hypothetical protein Bache_0445 [Bacteroides helcogenes P 36-108]
gi|319415360|gb|ADV42471.1| hypothetical protein Bache_0445 [Bacteroides helcogenes P 36-108]
Length = 414
Score = 38.1 bits (87), Expect = 6.1, Method: Compositional matrix adjust.
Identities = 32/120 (26%), Positives = 52/120 (43%), Gaps = 9/120 (7%)
Query: 26 EKSHLKTQRLASLDIFRGLAVALMILVD-----HAGGDWPEISHAPWNGCNLADFVMPFF 80
S ++R ++D+ R L + MI V+ H W E + + LAD V P F
Sbjct: 8 NTSATYSRRNLAIDMLRALTMFTMIFVNDFWKVHDIPRWLEHAGYGEDFMGLADVVFPCF 67
Query: 81 LFIVGVAIALALKRIPDRADAVKK----VIFRTLKLLFWGILLQGGFSHAPDELTYGVDV 136
LF VG++I A++R + + + + RT LL G + E+ Y + V
Sbjct: 68 LFAVGMSIPYAIERRYAKGFSAESTLGHIFLRTFALLVMGAFITNSEYRLSPEVPYPIGV 127
>gi|373954327|ref|ZP_09614287.1| hypothetical protein Mucpa_2712 [Mucilaginibacter paludis DSM
18603]
gi|373890927|gb|EHQ26824.1| hypothetical protein Mucpa_2712 [Mucilaginibacter paludis DSM
18603]
Length = 473
Score = 38.1 bits (87), Expect = 6.2, Method: Compositional matrix adjust.
Identities = 29/76 (38%), Positives = 34/76 (44%), Gaps = 14/76 (18%)
Query: 33 QRLASLDIFRGLAVALMIL-------------VDHAGGDWPEISHAP-WNGCNLADFVMP 78
QR SLD RG A+ LM+L + HA P P G D V P
Sbjct: 11 QRANSLDALRGTAILLMVLSGSIAFGGILPGWMYHAQVPPPAHQFKPDLPGITWVDLVFP 70
Query: 79 FFLFIVGVAIALALKR 94
FFLF +G AI LAL +
Sbjct: 71 FFLFAMGAAIPLALVK 86
>gi|402568588|ref|YP_006617932.1| hypothetical protein GEM_3848 [Burkholderia cepacia GG4]
gi|402249785|gb|AFQ50238.1| hypothetical protein GEM_3848 [Burkholderia cepacia GG4]
Length = 365
Score = 38.1 bits (87), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+ R A LD FRGL V L+I+VDH GG ++ + C+ A+ F+F+ G A A
Sbjct: 6 RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60
Query: 90 LALKRIPDRAD---AVKKVIFRTLKL 112
+A + +R D A ++ I R ++
Sbjct: 61 IAYNSLAERHDEAAARQRFIRRAFEI 86
>gi|358344082|ref|XP_003636122.1| Magnesium transporter NIPA2 [Medicago truncatula]
gi|355502057|gb|AES83260.1| Magnesium transporter NIPA2 [Medicago truncatula]
Length = 328
Score = 38.1 bits (87), Expect = 6.9, Method: Compositional matrix adjust.
Identities = 16/32 (50%), Positives = 23/32 (71%)
Query: 116 GILLQGGFSHAPDELTYGVDVRMIRLCGVLQR 147
G + GG+ H +LT+GVD++ IRL G+LQR
Sbjct: 105 GGVFTGGYVHRVSDLTFGVDLKQIRLMGILQR 136
>gi|115358787|ref|YP_775925.1| hypothetical protein Bamb_4038 [Burkholderia ambifaria AMMD]
gi|115284075|gb|ABI89591.1| conserved hypothetical protein [Burkholderia ambifaria AMMD]
Length = 384
Score = 38.1 bits (87), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+ R A LD FRGL V L+I+VDH GG ++ + C+ A+ F+F+ G A A
Sbjct: 23 RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 77
Query: 90 LALKRIPDRAD---AVKKVIFRTLKL 112
+A + +R D A ++ I R ++
Sbjct: 78 IAYNSLAERHDEAAARQRFIRRAFEI 103
>gi|170703183|ref|ZP_02893992.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10]
gi|170131915|gb|EDT00434.1| conserved hypothetical protein [Burkholderia ambifaria IOP40-10]
Length = 367
Score = 38.1 bits (87), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+ R A LD FRGL V L+I+VDH GG ++ + C+ A+ F+F+ G A A
Sbjct: 6 RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60
Query: 90 LALKRIPDRAD---AVKKVIFRTLKL 112
+A + +R D A ++ I R ++
Sbjct: 61 IAYNSLAERHDEAAARQRFIRRAFEI 86
>gi|134293811|ref|YP_001117547.1| hypothetical protein Bcep1808_5131 [Burkholderia vietnamiensis G4]
gi|387904835|ref|YP_006335173.1| OpgC protein [Burkholderia sp. KJ006]
gi|134136968|gb|ABO58082.1| conserved hypothetical protein [Burkholderia vietnamiensis G4]
gi|387579727|gb|AFJ88442.1| OpgC protein [Burkholderia sp. KJ006]
Length = 367
Score = 38.1 bits (87), Expect = 7.4, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+ R A LD FRGL V L+I+VDH GG ++ + C+ A+ F+F+ G A A
Sbjct: 6 RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60
Query: 90 LALKRIPDRAD---AVKKVIFRTLKL 112
+A + +R D A ++ I R ++
Sbjct: 61 IAYNSLAERHDEAAARQRFIRRAFEI 86
>gi|172063517|ref|YP_001811168.1| hypothetical protein BamMC406_4496 [Burkholderia ambifaria MC40-6]
gi|171996034|gb|ACB66952.1| conserved hypothetical protein [Burkholderia ambifaria MC40-6]
Length = 367
Score = 38.1 bits (87), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+ R A LD FRGL V L+I+VDH GG ++ + C+ A+ F+F+ G A A
Sbjct: 6 RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60
Query: 90 LALKRIPDRAD---AVKKVIFRTLKL 112
+A + +R D A ++ I R ++
Sbjct: 61 IAYNSLAERHDEAAARQRFIRRAFEI 86
>gi|171315634|ref|ZP_02904868.1| conserved hypothetical protein [Burkholderia ambifaria MEX-5]
gi|171099166|gb|EDT43939.1| conserved hypothetical protein [Burkholderia ambifaria MEX-5]
Length = 367
Score = 38.1 bits (87), Expect = 7.8, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+ R A LD FRGL V L+I+VDH GG ++ + C+ A+ F+F+ G A A
Sbjct: 6 RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60
Query: 90 LALKRIPDRAD---AVKKVIFRTLKL 112
+A + +R D A ++ I R ++
Sbjct: 61 IAYNSLAERHDEAAARQRFIRRAFEI 86
>gi|196234160|ref|ZP_03132993.1| conserved hypothetical protein [Chthoniobacter flavus Ellin428]
gi|196221811|gb|EDY16348.1| conserved hypothetical protein [Chthoniobacter flavus Ellin428]
Length = 417
Score = 37.7 bits (86), Expect = 7.9, Method: Compositional matrix adjust.
Identities = 41/146 (28%), Positives = 62/146 (42%), Gaps = 20/146 (13%)
Query: 34 RLASLDIFRGLAVALMIL-------VDHAGGD---W----PEISHAPWNGCNLADFVMPF 79
RL S+D +RGL + L++ V A D W + H W G L D + P
Sbjct: 32 RLGSIDAYRGLVMFLLLAEQFRTASVAKALPDSSFWRFLATQQEHVTWTGAVLHDMIQPS 91
Query: 80 FLFIVGVAIALALKRIPDRADAVKKV----IFRTLKLLFWGILLQGGFSHAPDELTYGVD 135
F F+VGVA+ ++ R + + R L L+ GI L+ H+ T+
Sbjct: 92 FSFLVGVALPFSIGNRRARGQSPEATTGHAFLRALILVLLGIFLRST-GHSQTNFTFEDT 150
Query: 136 VRMIRLC-GVLQRIALSYLLVSLVEI 160
+ I L G L IAL + V + +
Sbjct: 151 LTQIGLGYGFLYLIALRSVRVQWIAL 176
>gi|206563639|ref|YP_002234402.1| hypothetical protein BCAM1788 [Burkholderia cenocepacia J2315]
gi|444363993|ref|ZP_21164352.1| OpgC protein [Burkholderia cenocepacia BC7]
gi|444373170|ref|ZP_21172575.1| OpgC protein [Burkholderia cenocepacia K56-2Valvano]
gi|198039679|emb|CAR55648.1| putative membrane protein [Burkholderia cenocepacia J2315]
gi|443592216|gb|ELT61038.1| OpgC protein [Burkholderia cenocepacia K56-2Valvano]
gi|443593862|gb|ELT62568.1| OpgC protein [Burkholderia cenocepacia BC7]
Length = 365
Score = 37.7 bits (86), Expect = 8.3, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+ R A LD FRGL V L+I+VDH GG ++ + C+ A+ F+F+ G A A
Sbjct: 6 RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60
Query: 90 LALKRIPDRAD---AVKKVIFRTLKL 112
+A + +R D A ++ I R ++
Sbjct: 61 IAYNSLAERHDEAAARQRFIRRAFEI 86
>gi|421468975|ref|ZP_15917475.1| OpgC protein [Burkholderia multivorans ATCC BAA-247]
gi|400230841|gb|EJO60585.1| OpgC protein [Burkholderia multivorans ATCC BAA-247]
Length = 365
Score = 37.7 bits (86), Expect = 8.5, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+ R A LD FRGL V L+I+VDH GG ++ + C+ A+ F+F+ G A A
Sbjct: 6 RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60
Query: 90 LALKRIPDRAD---AVKKVIFRTLKL 112
+A + +R D A ++ I R ++
Sbjct: 61 IAYNSLAERHDESAARQRFIRRAFEI 86
>gi|161520584|ref|YP_001584011.1| hypothetical protein Bmul_4038 [Burkholderia multivorans ATCC
17616]
gi|189353228|ref|YP_001948855.1| OpgC protein [Burkholderia multivorans ATCC 17616]
gi|221209834|ref|ZP_03582815.1| putative membrane protein [Burkholderia multivorans CGD1]
gi|421474568|ref|ZP_15922594.1| OpgC protein [Burkholderia multivorans CF2]
gi|160344634|gb|ABX17719.1| conserved hypothetical protein [Burkholderia multivorans ATCC
17616]
gi|189337250|dbj|BAG46319.1| OpgC protein [Burkholderia multivorans ATCC 17616]
gi|221170522|gb|EEE02988.1| putative membrane protein [Burkholderia multivorans CGD1]
gi|400231859|gb|EJO61520.1| OpgC protein [Burkholderia multivorans CF2]
Length = 365
Score = 37.7 bits (86), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+ R A LD FRGL V L+I+VDH GG ++ + C+ A+ F+F+ G A A
Sbjct: 6 RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60
Query: 90 LALKRIPDRAD---AVKKVIFRTLKL 112
+A + +R D A ++ I R ++
Sbjct: 61 IAYNSLAERHDESAARQRFIRRAFEI 86
>gi|78062083|ref|YP_371991.1| hypothetical protein Bcep18194_B1233 [Burkholderia sp. 383]
gi|77969968|gb|ABB11347.1| conserved hypothetical protein [Burkholderia sp. 383]
Length = 365
Score = 37.7 bits (86), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+ R A LD FRGL V L+I+VDH GG ++ + C+ A+ F+F+ G A A
Sbjct: 6 RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60
Query: 90 LALKRIPDRAD---AVKKVIFRTLKL 112
+A + +R D A ++ I R ++
Sbjct: 61 IAYNSLAERHDEAAARQRFIRRAFEI 86
>gi|221196140|ref|ZP_03569187.1| putative membrane protein [Burkholderia multivorans CGD2M]
gi|221202813|ref|ZP_03575832.1| putative membrane protein [Burkholderia multivorans CGD2]
gi|221176747|gb|EEE09175.1| putative membrane protein [Burkholderia multivorans CGD2]
gi|221182694|gb|EEE15094.1| putative membrane protein [Burkholderia multivorans CGD2M]
Length = 365
Score = 37.7 bits (86), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+ R A LD FRGL V L+I+VDH GG ++ + C+ A+ F+F+ G A A
Sbjct: 6 RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60
Query: 90 LALKRIPDRAD---AVKKVIFRTLKL 112
+A + +R D A ++ I R ++
Sbjct: 61 IAYNSLAERHDESAARQRFIRRAFEI 86
>gi|170738048|ref|YP_001779308.1| hypothetical protein Bcenmc03_5696 [Burkholderia cenocepacia MC0-3]
gi|254248202|ref|ZP_04941522.1| hypothetical protein BCPG_03029 [Burkholderia cenocepacia PC184]
gi|124874703|gb|EAY64693.1| hypothetical protein BCPG_03029 [Burkholderia cenocepacia PC184]
gi|169820236|gb|ACA94818.1| conserved hypothetical protein [Burkholderia cenocepacia MC0-3]
Length = 365
Score = 37.7 bits (86), Expect = 8.7, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+ R A LD FRGL V L+I+VDH GG ++ + C+ A+ F+F+ G A A
Sbjct: 6 RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60
Query: 90 LALKRIPDRAD---AVKKVIFRTLKL 112
+A + +R D A ++ I R ++
Sbjct: 61 IAYNSLAERHDEAAARQRFIRRAFEI 86
>gi|107026114|ref|YP_623625.1| hypothetical protein Bcen_3760 [Burkholderia cenocepacia AU 1054]
gi|116692702|ref|YP_838235.1| hypothetical protein Bcen2424_4608 [Burkholderia cenocepacia
HI2424]
gi|105895488|gb|ABF78652.1| conserved hypothetical protein [Burkholderia cenocepacia AU 1054]
gi|116650702|gb|ABK11342.1| conserved hypothetical protein [Burkholderia cenocepacia HI2424]
Length = 365
Score = 37.7 bits (86), Expect = 8.8, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
+ R A LD FRGL V L+I+VDH GG ++ + C+ A+ F+F+ G A A
Sbjct: 6 RAGRYAELDFFRGL-VLLVIVVDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60
Query: 90 LALKRIPDRAD---AVKKVIFRTLKL 112
+A + +R D A ++ I R ++
Sbjct: 61 IAYNSLAERHDEAAARQRFIRRAFEI 86
>gi|83716709|ref|YP_439651.1| opgC protein [Burkholderia thailandensis E264]
gi|83650534|gb|ABC34598.1| opgC protein, putative [Burkholderia thailandensis E264]
Length = 431
Score = 37.7 bits (86), Expect = 9.0, Method: Compositional matrix adjust.
Identities = 30/93 (32%), Positives = 48/93 (51%), Gaps = 9/93 (9%)
Query: 24 QQEKSHLKTQRLASLDIFRGLAVALMILVDHAGGDW-PEISHAPWNGCNLADFVMPFFLF 82
++ + QR A LD FRGL V L+I+VDH GG ++ + C+ A+ F+F
Sbjct: 45 RRRMNAAPAQRYAELDFFRGL-VLLVIVVDHIGGSMLSRVTLHAYALCDAAE----VFVF 99
Query: 83 IVGVAIALALKRIPDR---ADAVKKVIFRTLKL 112
+ G A A+A + R A A ++ I R ++
Sbjct: 100 LGGFATAIAYNSLAARHTEAAARQRFIKRAFEI 132
>gi|167584370|ref|ZP_02376758.1| hypothetical protein BuboB_03474 [Burkholderia ubonensis Bu]
Length = 365
Score = 37.7 bits (86), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 29/86 (33%), Positives = 46/86 (53%), Gaps = 9/86 (10%)
Query: 31 KTQRLASLDIFRGLAVALMILVDHAGGD-WPEISHAPWNGCNLADFVMPFFLFIVGVAIA 89
K R A LD FRGL V L+I++DH GG ++ + C+ A+ F+F+ G A A
Sbjct: 6 KPGRYAELDFFRGL-VLLVIVIDHIGGSILSRVTLHAYALCDAAE----VFVFLGGFATA 60
Query: 90 LALKRIPDRAD---AVKKVIFRTLKL 112
+A + +R D A ++ I R ++
Sbjct: 61 IAYNSLAERHDEAAARQRFIRRAFEI 86
>gi|390575261|ref|ZP_10255366.1| hypothetical protein WQE_42474 [Burkholderia terrae BS001]
gi|389932764|gb|EIM94787.1| hypothetical protein WQE_42474 [Burkholderia terrae BS001]
Length = 367
Score = 37.7 bits (86), Expect = 9.8, Method: Compositional matrix adjust.
Identities = 45/132 (34%), Positives = 60/132 (45%), Gaps = 18/132 (13%)
Query: 33 QRLASLDIFRGLAVALMILVDHAGGDWPEISHAPWNGCNLADFVMPFFLFIVGVAIALAL 92
RL LD FRGL V + I+VDH GG +S A + L D F+F+ G A A A
Sbjct: 6 NRLIELDFFRGL-VLIFIVVDHIGGS--ILSRATLHAYALCD-AAEVFVFLGGFATATAY 61
Query: 93 KRIPDR---ADAVKKVIFRTLKL----LFWGILL------QGGFS-HAPDELTYGVDVRM 138
+ R ADA + R+L+L L +L+ FS AP+ T +D M
Sbjct: 62 ASLAKRHTEADARNRFFKRSLELYRAFLITAVLMLLVSAVMSAFSIDAPNMATTDLDDMM 121
Query: 139 IRLCGVLQRIAL 150
VL+ I L
Sbjct: 122 DTPTAVLRDILL 133
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.328 0.142 0.463
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 5,883,638,159
Number of Sequences: 23463169
Number of extensions: 239187238
Number of successful extensions: 700432
Number of sequences better than 100.0: 927
Number of HSP's better than 100.0 without gapping: 605
Number of HSP's successfully gapped in prelim test: 322
Number of HSP's that attempted gapping in prelim test: 697568
Number of HSP's gapped (non-prelim): 1560
length of query: 373
length of database: 8,064,228,071
effective HSP length: 144
effective length of query: 229
effective length of database: 8,980,499,031
effective search space: 2056534278099
effective search space used: 2056534278099
T: 11
A: 40
X1: 15 ( 7.1 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 40 (21.7 bits)
S2: 77 (34.3 bits)