BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= 012283
(467 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|255543315|ref|XP_002512720.1| conserved hypothetical protein [Ricinus communis]
gi|223547731|gb|EEF49223.1| conserved hypothetical protein [Ricinus communis]
Length = 466
Score = 737 bits (1903), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 373/469 (79%), Positives = 412/469 (87%), Gaps = 5/469 (1%)
Query: 1 MATTSLLPKSTSPFLSNPLSSLPSTHFIKPSPLFNP--PDNHSLLITTKRLSFQTHAKKK 58
MA+TSLLPK T PFL+NP SSL STHF KPS NP PD+HS + HA KK
Sbjct: 1 MASTSLLPKIT-PFLTNP-SSLSSTHFTKPS-FLNPLSPDHHSACARKPTTTTVLHANKK 57
Query: 59 NPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKI 118
NPWLDPFDDGEDP+M+YGSLFADGKQ+EDPRPPD+PNNPYGFLKFP G++ EIASLPLKI
Sbjct: 58 NPWLDPFDDGEDPEMEYGSLFADGKQEEDPRPPDNPNNPYGFLKFPKGYSVEIASLPLKI 117
Query: 119 RGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANV 178
RGDVRRCCC+ISGGVYENLLFFPAIQL+KDRYPGV IDV+AS RGKQT+ELNKNVRWAN
Sbjct: 118 RGDVRRCCCVISGGVYENLLFFPAIQLIKDRYPGVQIDVLASPRGKQTYELNKNVRWAND 177
Query: 179 YDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNA 238
YD DDDWPEPAEY D+LG++KNRYYDMVLSTKLAGLGHA+FLFMT+ARDRVSYI+PNVNA
Sbjct: 178 YDPDDDWPEPAEYVDMLGILKNRYYDMVLSTKLAGLGHASFLFMTSARDRVSYIHPNVNA 237
Query: 239 AGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVV 298
AGAGLLLSETFT + NL+E GY+MY QM+DWLGRPFRSVPRH VPPLRVSISR+LKE V
Sbjct: 238 AGAGLLLSETFTPDITNLAEGGYHMYHQMLDWLGRPFRSVPRHTVPPLRVSISRKLKEFV 297
Query: 299 AEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIP 358
KY+ AGAE+GKYIVIHGIESDSKASMQSRGDTDSLLP+QVWAEIA +R F+P+FVIP
Sbjct: 298 EAKYRAAGAEKGKYIVIHGIESDSKASMQSRGDTDSLLPLQVWAEIAKDIRAFKPVFVIP 357
Query: 359 HEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
HEKERE VE++VGDD SIV ITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALF
Sbjct: 358 HEKERENVEEIVGDDTSIVMITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFG 417
Query: 419 SELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQIFNESLALA 467
S KGRLF+PNAEEKKC+++SS+TGKL D V V AMQI + SLALA
Sbjct: 418 SIEKGRLFIPNAEEKKCSIVSSKTGKLKDIDVGDVKQAMQILDLSLALA 466
>gi|341833960|gb|AEK94315.1| hypothetical protein [Pyrus x bretschneideri]
Length = 463
Score = 721 bits (1861), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 345/464 (74%), Positives = 395/464 (85%), Gaps = 5/464 (1%)
Query: 3 TTSLLPKSTSPFLSNPLSSLPSTHFIKPSPLFNPPDNHSLLITTKRLSFQTHAKKKNPWL 62
TTSLLPKS PFL+N S PST F K S FNP D+ TKR + QT+AKKKNPWL
Sbjct: 4 TTSLLPKSFPPFLTN-APSFPSTLFTKLSS-FNPQDHPH---CTKRPNLQTNAKKKNPWL 58
Query: 63 DPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKIRGDV 122
DPFDDGEDPDM+YGSLFADGKQDED RPP++P+NPYGFLKFPMG++ E+ASL LK+RGDV
Sbjct: 59 DPFDDGEDPDMEYGSLFADGKQDEDDRPPENPDNPYGFLKFPMGYSVEVASLALKVRGDV 118
Query: 123 RRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLD 182
RRCCC++SGGVYENLLFFP IQLLKDRYPGV +D +ASARGKQ +ELNKNVRWA+VYD D
Sbjct: 119 RRCCCVVSGGVYENLLFFPVIQLLKDRYPGVQVDAVASARGKQAYELNKNVRWADVYDPD 178
Query: 183 DDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
D +PEPAEYTD++G++K+RYYDM+LSTKLAGLGH FLFMTTAR+RVSYIYPNVNAAGAG
Sbjct: 179 DHFPEPAEYTDMVGLLKSRYYDMILSTKLAGLGHGIFLFMTTARERVSYIYPNVNAAGAG 238
Query: 243 LLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKY 302
L LSETF MNL+E GYNMY QMVDWLGRP R VPR +PPL++SIS++LK+ V KY
Sbjct: 239 LFLSETFRPNGMNLAEGGYNMYHQMVDWLGRPVRDVPRQALPPLKISISKKLKQTVESKY 298
Query: 303 KNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKE 362
K AG ++GKYIVIHG+ESDSKASMQS+GD+DSLLPI+ WA I G+REFRP+FVIPHEKE
Sbjct: 299 KKAGVDKGKYIVIHGLESDSKASMQSKGDSDSLLPIETWAAIVRGIREFRPVFVIPHEKE 358
Query: 363 REGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELK 422
RE VE++VG+D SIVFITTPGQLAALINDSAGVI TNTAAIQLANAR+KP IALF S K
Sbjct: 359 RENVEEIVGEDTSIVFITTPGQLAALINDSAGVITTNTAAIQLANARQKPCIALFGSAEK 418
Query: 423 GRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQIFNESLAL 466
G+LFVPNAEEK C ++SS+T L D EAV NA+ IFN SLAL
Sbjct: 419 GKLFVPNAEEKNCVIVSSKTKMLKDIDTEAVKNALPIFNVSLAL 462
>gi|357467927|ref|XP_003604248.1| hypothetical protein MTR_4g007080 [Medicago truncatula]
gi|355505303|gb|AES86445.1| hypothetical protein MTR_4g007080 [Medicago truncatula]
Length = 472
Score = 709 bits (1831), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 338/463 (73%), Positives = 392/463 (84%), Gaps = 13/463 (2%)
Query: 15 LSNPLSSLPSTHFIKPSPLFNPPDNHSLLIT----------TKRLSFQTHAKKKNPWLDP 64
+S P S + F PS P N SLL + T+RL+ + +AKK NPWLDP
Sbjct: 13 ISKPFSHFTTNTFSIPSTY---PTNVSLLTSLDHPSHCYNSTRRLNLRVNAKKNNPWLDP 69
Query: 65 FDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKIRGDVRR 124
FDDGEDP+++YGSLF+DGKQDEDPRPPD+P NPYGFLKFP G+ EIA L LK+RGDVRR
Sbjct: 70 FDDGEDPNIEYGSLFSDGKQDEDPRPPDNPKNPYGFLKFPAGYMVEIAPLGLKVRGDVRR 129
Query: 125 CCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDD 184
CCC++SGGVYENLLFFP IQL+KDRYPGV +DV+ S RGKQ +ELNKNVRWANV+D DD+
Sbjct: 130 CCCVVSGGVYENLLFFPMIQLIKDRYPGVQVDVVGSERGKQCYELNKNVRWANVFDPDDE 189
Query: 185 WPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLL 244
+PEPAEYTD +GV+K+RYYDMVLSTKLAG+GHAA+LFMTTARDRVSY+YPNVN AGAGL
Sbjct: 190 FPEPAEYTDFVGVLKSRYYDMVLSTKLAGVGHAAYLFMTTARDRVSYVYPNVNGAGAGLF 249
Query: 245 LSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKN 304
LSETFT +S NLSE GY+MY QM DWLGRPFRSVPR VPPL++S+SR+LKEVV EKY
Sbjct: 250 LSETFTPDSTNLSEGGYHMYHQMEDWLGRPFRSVPRQVVPPLKISLSRKLKEVVEEKYTK 309
Query: 305 AGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKERE 364
AG ++G+YIVIHGI+SDSKA+MQSRGD DSLLP++VWAEIA+ +REF PLFVIPHEKERE
Sbjct: 310 AGVKKGRYIVIHGIQSDSKATMQSRGDPDSLLPLEVWAEIADAVREFTPLFVIPHEKERE 369
Query: 365 GVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKGR 424
VE++VG+D+SI+FITTPGQLAALINDSAGVIATNTAAIQLANAREKPS+ALFSS+ KG
Sbjct: 370 NVEEIVGEDSSIIFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSVALFSSKEKGN 429
Query: 425 LFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQIFNESLALA 467
FVP AEEKKC VISS+TGKLI+ VEAV NA+Q FN SLA A
Sbjct: 430 KFVPRAEEKKCIVISSKTGKLINIDVEAVTNAIQTFNVSLAFA 472
>gi|224115860|ref|XP_002332075.1| predicted protein [Populus trichocarpa]
gi|222831961|gb|EEE70438.1| predicted protein [Populus trichocarpa]
Length = 465
Score = 708 bits (1827), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/468 (73%), Positives = 397/468 (84%), Gaps = 4/468 (0%)
Query: 1 MATTSLLPKSTSPFLSNPLSSLPSTHFIKPSPLFNPPDNHSLLITTKRLSFQTHAKKKNP 60
MA+TSLLPK+ SPFL+NP LPSTHF FNP +H L T K +AKKKNP
Sbjct: 1 MASTSLLPKTISPFLTNP-PPLPSTHFTSKPSFFNPSADHHLPCTRKPSILTPYAKKKNP 59
Query: 61 WLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKIRG 120
W+D FDDGED DM+YGSLF DGKQDED RP D+PNNPYGFLKFP G+N E+A L LKIRG
Sbjct: 60 WIDIFDDGEDLDMEYGSLFVDGKQDEDTRPVDNPNNPYGFLKFPKGYNVEVAQLGLKIRG 119
Query: 121 DVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYD 180
DVRRCCC+ISGGVYENLLFFP IQ+LKDRYPG+L+DV+AS RGKQ +ELNKNVRWANVYD
Sbjct: 120 DVRRCCCMISGGVYENLLFFPVIQMLKDRYPGILVDVLASDRGKQCYELNKNVRWANVYD 179
Query: 181 LDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
D D PEPA YTD++GV+KNRYYD++LSTKLAGLGHA+F+FM++ARD+VSYIYPNVNAAG
Sbjct: 180 PDGD-PEPAIYTDMIGVLKNRYYDLILSTKLAGLGHASFMFMSSARDKVSYIYPNVNAAG 238
Query: 241 AGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
AGLLL+ETFT +S NLSE GY+MY QM+DWLGRP +VPR PVPPLRVS+SR+LK+ V
Sbjct: 239 AGLLLTETFTPDSANLSEGGYHMYHQMLDWLGRPIYNVPRQPVPPLRVSLSRKLKQYVEA 298
Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHE 360
KY+ AGAE+GKYIVIHGIESDSKASMQSRGDTDSLLP++VW +IA+ + F+P+FVIPHE
Sbjct: 299 KYRAAGAEKGKYIVIHGIESDSKASMQSRGDTDSLLPLEVWDQIADAISGFKPVFVIPHE 358
Query: 361 KEREGVEDVV-GDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSS 419
KERE VE+++ +D I+FITTPGQ A LINDSAGVIATNTAAIQLANAREKPSIALF S
Sbjct: 359 KERENVEEIIYNEDIGILFITTPGQ-ATLINDSAGVIATNTAAIQLANAREKPSIALFGS 417
Query: 420 ELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQIFNESLALA 467
E KG++FVPNAEEKKC ++SS+TGKL D V AV AMQIF+ SLAL
Sbjct: 418 EEKGKVFVPNAEEKKCIIVSSKTGKLKDIDVGAVKQAMQIFDMSLALV 465
>gi|363808214|ref|NP_001242232.1| uncharacterized protein LOC100802136 [Glycine max]
gi|255635425|gb|ACU18065.1| unknown [Glycine max]
Length = 471
Score = 694 bits (1792), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 355/471 (75%), Positives = 399/471 (84%), Gaps = 6/471 (1%)
Query: 1 MATTSLL----PKSTSPFLSNPLSSLPSTHFIKP-SPLFNPPDNHSLLITTKRLSFQTHA 55
MA T L PK+ SPFL NP S+PS H + L + P + S + +A
Sbjct: 1 MAATYYLLPTSPKTFSPFLRNP-PSIPSGHHVSLLGSLLDYPFHCSSRRGNNSVEANCNA 59
Query: 56 KKKNPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLP 115
KKKNPWLDPFDDGEDP+M+YGSLFADGKQ+EDPRPPDDP+NPYGFLKFP G++ EIASL
Sbjct: 60 KKKNPWLDPFDDGEDPEMEYGSLFADGKQEEDPRPPDDPDNPYGFLKFPSGYSVEIASLA 119
Query: 116 LKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRW 175
LK+RGDV RCCC+ISGGVYENLLFFPAIQL+KDRYPGV IDV+AS RGKQT+ELNKNVRW
Sbjct: 120 LKVRGDVGRCCCVISGGVYENLLFFPAIQLIKDRYPGVQIDVVASERGKQTYELNKNVRW 179
Query: 176 ANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPN 235
AN YD DD++PEPAEYTD++GV+KNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPN
Sbjct: 180 ANAYDPDDEFPEPAEYTDMVGVLKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPN 239
Query: 236 VNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLK 295
VNAAGAGLLLSETF +S NLS+ GYNMY QMVDWLG+PFR VPR PVPPLRVSIS++LK
Sbjct: 240 VNAAGAGLLLSETFVPDSQNLSDGGYNMYHQMVDWLGKPFREVPRQPVPPLRVSISKKLK 299
Query: 296 EVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF 355
EVV KY+ AGA++GKY+VIHGI+SDSKASMQSRGD DSLLPI+VWAEIA+ +R+ PLF
Sbjct: 300 EVVEAKYEKAGAKKGKYVVIHGIKSDSKASMQSRGDPDSLLPIEVWAEIADVIRDVTPLF 359
Query: 356 VIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
VIPHEKERE VE++ +DASIVFITTPGQLAALINDSAGVIATNT A+QLANAREKP IA
Sbjct: 360 VIPHEKERENVEEIFSEDASIVFITTPGQLAALINDSAGVIATNTTAVQLANAREKPCIA 419
Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQIFNESLAL 466
LF SE KG FVP AEEKKC +ISS+TGKLID VEAV NA+Q FN S AL
Sbjct: 420 LFCSEEKGNKFVPRAEEKKCIIISSKTGKLIDIDVEAVKNAVQTFNLSPAL 470
>gi|356523261|ref|XP_003530259.1| PREDICTED: uncharacterized protein LOC100803710 [Glycine max]
Length = 467
Score = 694 bits (1790), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 354/469 (75%), Positives = 397/469 (84%), Gaps = 6/469 (1%)
Query: 1 MATTSLL---PKSTSPFLSNPLSSLPSTHFIKPSPLFNPPDNHSLLITTKRLSFQTHAKK 57
MA T L PK+ SP L NPLS +PS H L P H + +AKK
Sbjct: 1 MAATYYLLPSPKTFSPLLRNPLS-IPSGHVSLLGSLDYP--FHCSSRRGNSVEANCNAKK 57
Query: 58 KNPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLK 117
KNPWLDPFDDGEDP+M+YGSLFADGKQ+EDPRPPDDP+NPYGFLKFP G+ EIASL LK
Sbjct: 58 KNPWLDPFDDGEDPEMEYGSLFADGKQEEDPRPPDDPDNPYGFLKFPSGYAVEIASLALK 117
Query: 118 IRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWAN 177
+RGDVRRCCC+ISGGVYENLLFFPAIQL+KDRYPGV IDV+AS RGKQT+ELNKNVRWAN
Sbjct: 118 VRGDVRRCCCVISGGVYENLLFFPAIQLIKDRYPGVQIDVVASERGKQTYELNKNVRWAN 177
Query: 178 VYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVN 237
YD DD++PEPAEYTD++GV+KNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVN
Sbjct: 178 AYDPDDEFPEPAEYTDMVGVLKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVN 237
Query: 238 AAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEV 297
AAGAGLLLS+TF +S+NLS+ GYNMY QMVDWLGRPFR VPR PVPPLRVSISR+LKEV
Sbjct: 238 AAGAGLLLSDTFVPDSLNLSDGGYNMYHQMVDWLGRPFREVPRQPVPPLRVSISRKLKEV 297
Query: 298 VAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVI 357
V KY+ AGA++GKY+VIHGI+SDSKASMQSRGD DSLLPI+VWAEIA+ +R+ PLFVI
Sbjct: 298 VEAKYEKAGAKKGKYVVIHGIKSDSKASMQSRGDPDSLLPIEVWAEIADAIRDVTPLFVI 357
Query: 358 PHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALF 417
PHEKERE VE++ +DA +VFITTPGQLAALINDSAGVIATNTAA+QLANAR+KP IALF
Sbjct: 358 PHEKERENVEEIFSEDAFVVFITTPGQLAALINDSAGVIATNTAAVQLANARQKPCIALF 417
Query: 418 SSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQIFNESLAL 466
SE KG FVP A+EKKC +ISS+TGKLID VEAV NA+Q FN S AL
Sbjct: 418 CSEEKGNKFVPQAKEKKCIIISSKTGKLIDIDVEAVKNAVQTFNLSPAL 466
>gi|359483160|ref|XP_002264393.2| PREDICTED: uncharacterized protein LOC100241596 [Vitis vinifera]
gi|298204672|emb|CBI25170.3| unnamed protein product [Vitis vinifera]
Length = 466
Score = 687 bits (1774), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 358/475 (75%), Positives = 404/475 (85%), Gaps = 17/475 (3%)
Query: 1 MATTSLLPKSTSPFLSNPLSSLP-----STHFIKPSPLFNPPDNHSLLITTK---RLSFQ 52
MAT LP S SPFL+NP SSLP S+ F P PL PP TTK R+
Sbjct: 1 MATPYFLPNSISPFLTNPSSSLPIHFTKSSFFTSPQPL--PP-------TTKPGCRIQLN 51
Query: 53 THAKKKNPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIA 112
KKKNPWLDPFDDGEDP+M+YGSLFADGKQ+EDPRPPD+P+NPYGFLKFPMG++ EIA
Sbjct: 52 ATPKKKNPWLDPFDDGEDPNMEYGSLFADGKQEEDPRPPDNPDNPYGFLKFPMGYSVEIA 111
Query: 113 SLPLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKN 172
SL LK+RGDVRRCCC+ISGGVYENLLFFPAIQL+KDRYPGV +D++AS RGKQT+ELNKN
Sbjct: 112 SLALKVRGDVRRCCCVISGGVYENLLFFPAIQLIKDRYPGVQVDIVASPRGKQTYELNKN 171
Query: 173 VRWANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYI 232
VRWAN YD D D+PEPAEYTD++G++K RYYDM+LSTKLAGLGHAAFLFM+TARDRVSYI
Sbjct: 172 VRWANAYDPDVDFPEPAEYTDMIGILKARYYDMILSTKLAGLGHAAFLFMSTARDRVSYI 231
Query: 233 YPNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISR 292
YPNVNAAGAGLLLS+TFT + MNLSE GYNMY QMV+WLGRP R+VPRH VPPL+VSISR
Sbjct: 232 YPNVNAAGAGLLLSQTFTPDGMNLSEAGYNMYHQMVEWLGRPARNVPRHHVPPLKVSISR 291
Query: 293 RLKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR 352
+LKEVV KYK AGAE+GKYIVIHGIE DSKASMQS+GDTDSLLPIQ WAEIA+ +R R
Sbjct: 292 KLKEVVEGKYKAAGAEKGKYIVIHGIECDSKASMQSKGDTDSLLPIQKWAEIADTIRGIR 351
Query: 353 PLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKP 412
P+FVIPHEK RE VE+ VG+DASIVFITTPGQLAALINDS GVI TNTAA+QLA AREKP
Sbjct: 352 PIFVIPHEKVREEVEEEVGEDASIVFITTPGQLAALINDSLGVITTNTAAVQLATAREKP 411
Query: 413 SIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQIFNESLALA 467
IALFSS+ KG LFVPNA+EKKC ++SS+TGKL+D VEAV A +IF+ SLALA
Sbjct: 412 CIALFSSQQKGELFVPNAQEKKCLIVSSKTGKLVDINVEAVKKATEIFDVSLALA 466
>gi|388501644|gb|AFK38888.1| unknown [Medicago truncatula]
Length = 460
Score = 675 bits (1741), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 329/463 (71%), Positives = 381/463 (82%), Gaps = 25/463 (5%)
Query: 15 LSNPLSSLPSTHFIKPSPLFNPPDNHSLLIT----------TKRLSFQTHAKKKNPWLDP 64
+S P S + F PS P N SLL + T+RL+ + +AKK NPWLDP
Sbjct: 13 ISKPFSHFTTNTFSIPSTY---PTNVSLLTSLDHPSHCYNSTRRLNLRVNAKKNNPWLDP 69
Query: 65 FDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKIRGDVRR 124
FDDGEDP+++YGSLF+DGKQDEDPRPPD+P NPYGFLKFP G+ EIA L LK+RG
Sbjct: 70 FDDGEDPNIEYGSLFSDGKQDEDPRPPDNPKNPYGFLKFPAGYMVEIAPLGLKVRG---- 125
Query: 125 CCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDD 184
VYENLLFFP IQL+KDRYPGV +DV+ S RGKQ +ELNKNVRWANV+D DD+
Sbjct: 126 --------VYENLLFFPMIQLIKDRYPGVQVDVVGSERGKQCYELNKNVRWANVFDPDDE 177
Query: 185 WPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLL 244
+PEPAEYTD +GV+K+RYYDMVLSTKLAG+GHAA+LFMTTARDRVSY+YPNVN AGAGL
Sbjct: 178 FPEPAEYTDFVGVLKSRYYDMVLSTKLAGVGHAAYLFMTTARDRVSYVYPNVNGAGAGLF 237
Query: 245 LSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKN 304
LSETFT +S NLSE GY+MY QM DWLGRPFRSVPR VPPL++S+SR+LKEVV EKY
Sbjct: 238 LSETFTPDSTNLSEGGYHMYHQMEDWLGRPFRSVPRQVVPPLKISLSRKLKEVVEEKYTK 297
Query: 305 AGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKERE 364
AG ++G+YIVIHGI+SDSKA+MQSRGD DSLLP++VWAEIA+ +REF PLFVIPHEKERE
Sbjct: 298 AGVKKGRYIVIHGIQSDSKATMQSRGDPDSLLPLEVWAEIADAVREFTPLFVIPHEKERE 357
Query: 365 GVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKGR 424
VE++VG+D+SI+FITTPGQLAALINDSAGVIATNTAAIQLANAREKPS+ALFSS+ KG
Sbjct: 358 NVEEIVGEDSSIIFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSVALFSSKEKGN 417
Query: 425 LFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQIFNESLALA 467
FVP AEEKKC VISS+TGKLI+ VEAV NA+QIFN SLA A
Sbjct: 418 KFVPRAEEKKCIVISSKTGKLINIDVEAVTNAIQIFNVSLAFA 460
>gi|297844518|ref|XP_002890140.1| hypothetical protein ARALYDRAFT_471797 [Arabidopsis lyrata subsp.
lyrata]
gi|297335982|gb|EFH66399.1| hypothetical protein ARALYDRAFT_471797 [Arabidopsis lyrata subsp.
lyrata]
Length = 460
Score = 674 bits (1739), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 343/470 (72%), Positives = 390/470 (82%), Gaps = 13/470 (2%)
Query: 1 MATTSLLPKSTSPFLSNP-LSSLPSTHFIKPSPLFNPPDNHSLLITTKRLSFQTHAKKKN 59
MA+ LLPK SPF P S+ F+ N +T++ + KKKN
Sbjct: 1 MASLPLLPKPISPFFKTPPFSTSKQLVFL----------NFQTRLTSRSSDVSVNLKKKN 50
Query: 60 -PWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKI 118
PWLDPFD GEDP+ +YGSLF DGKQDEDPRPPD+P+NPYGFLKFP G+ E+ASLPLKI
Sbjct: 51 NPWLDPFDSGEDPENEYGSLFVDGKQDEDPRPPDNPDNPYGFLKFPKGYTVELASLPLKI 110
Query: 119 RGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANV 178
RGDVRRCCC+ISGGVYENLLFFP IQL+KDRYPGV +D++ + RGKQT+ELNKNVRWANV
Sbjct: 111 RGDVRRCCCVISGGVYENLLFFPTIQLIKDRYPGVQVDILTTDRGKQTYELNKNVRWANV 170
Query: 179 YDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNA 238
YD DD WPEPAEYTD++G++K RYYDM+LSTKLAGLGHAAFLFMTTARDRVSYIYPNVN+
Sbjct: 171 YDPDDHWPEPAEYTDMIGILKGRYYDMILSTKLAGLGHAAFLFMTTARDRVSYIYPNVNS 230
Query: 239 AGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVV 298
AGAGL+LSETFTAE+ NLSE GY+MY+QM DWLGRPFRSVPR PV PLRVSISR+LK VV
Sbjct: 231 AGAGLMLSETFTAENANLSELGYSMYDQMEDWLGRPFRSVPRTPVLPLRVSISRKLKVVV 290
Query: 299 AEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIP 358
AEKY+NAGA GK+IVIHGIESDSKASMQS+GD+DSLL ++ WA+I G+R F+P+FVIP
Sbjct: 291 AEKYRNAGAATGKFIVIHGIESDSKASMQSKGDSDSLLSLEKWAKIIKGIRGFKPVFVIP 350
Query: 359 HEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
HEKERE VED VGDD SIVFITTPGQLAALINDSAGVIATNTAAIQLANAR+KPSI LFS
Sbjct: 351 HEKEREKVEDFVGDDTSIVFITTPGQLAALINDSAGVIATNTAAIQLANARDKPSIGLFS 410
Query: 419 SELKGRLFVPNAEEK-KCTVISSRTGKLIDTPVEAVLNAMQIFNESLALA 467
SE KG+LFVP AEEK C +I+S+TGKL D + V NAMQIF SLAL
Sbjct: 411 SEEKGKLFVPYAEEKSNCVIIASKTGKLADIDIGTVKNAMQIFEGSLALV 460
>gi|18394307|ref|NP_563986.1| NDH-dependent cyclic electron flow 1 [Arabidopsis thaliana]
gi|6587804|gb|AAF18495.1|AC010924_8 ESTs gb|T43280, gb|AA395195 come from this gene [Arabidopsis
thaliana]
gi|16323198|gb|AAL15333.1| At1g15980/T24D18_8 [Arabidopsis thaliana]
gi|21537186|gb|AAM61527.1| unknown [Arabidopsis thaliana]
gi|21700921|gb|AAM70584.1| At1g15980/T24D18_8 [Arabidopsis thaliana]
gi|332191271|gb|AEE29392.1| NDH-dependent cyclic electron flow 1 [Arabidopsis thaliana]
Length = 461
Score = 668 bits (1723), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 341/463 (73%), Positives = 384/463 (82%), Gaps = 11/463 (2%)
Query: 6 LLPKSTSPFLSNPLSSLPSTHFIKPSPLFNPPDNHSLLITTKRLSFQTHAKKKN-PWLDP 64
LLPK SPF P F PL N +T++ + KKKN PWLDP
Sbjct: 7 LLPKPISPFFKTP-------PFSTSKPLVFL--NFQTRLTSRSSDVSVNLKKKNNPWLDP 57
Query: 65 FDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKIRGDVRR 124
FD GEDPD +YGSLFADGKQDEDPRPPD+P+NPYGFLKFP G+ E+ASLPLKIRGDVRR
Sbjct: 58 FDSGEDPDNEYGSLFADGKQDEDPRPPDNPDNPYGFLKFPKGYTVELASLPLKIRGDVRR 117
Query: 125 CCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDD 184
CCC+ISGGVYENLLFFP IQL+KDRYPGV +D++ + RGKQT+ELNKNVRWANVYD DD
Sbjct: 118 CCCVISGGVYENLLFFPTIQLIKDRYPGVQVDILTTERGKQTYELNKNVRWANVYDPDDH 177
Query: 185 WPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLL 244
WPEPAEYTD++G++K RYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVN+AGAGL+
Sbjct: 178 WPEPAEYTDMIGLLKGRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNSAGAGLM 237
Query: 245 LSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKN 304
LSETFTAE+ NLSE GY+MY QM DWLGRPFRSVPR P+ PLRVSISR++KEVVA KY+N
Sbjct: 238 LSETFTAENTNLSELGYSMYTQMEDWLGRPFRSVPRTPLLPLRVSISRKVKEVVAAKYRN 297
Query: 305 AGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKERE 364
AGA GK+IVIHGIESDSKASMQS+GD DSLL ++ WA+I G+R F+P+FVIPHEKERE
Sbjct: 298 AGAVTGKFIVIHGIESDSKASMQSKGDADSLLSLEKWAKIIKGVRGFKPVFVIPHEKERE 357
Query: 365 GVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKGR 424
VED VGDD SIVFITTPGQLAALINDSAGVIATNTAAIQLANAR+KP I LFSSE KG+
Sbjct: 358 NVEDFVGDDTSIVFITTPGQLAALINDSAGVIATNTAAIQLANARDKPCIGLFSSEEKGK 417
Query: 425 LFVPNAEEK-KCTVISSRTGKLIDTPVEAVLNAMQIFNESLAL 466
LFVP AEEK C +I+S+TGKL D + V NAMQ+F SLAL
Sbjct: 418 LFVPYAEEKSNCVIIASKTGKLADIDIGTVKNAMQVFEGSLAL 460
>gi|449436635|ref|XP_004136098.1| PREDICTED: uncharacterized protein LOC101205787 [Cucumis sativus]
Length = 466
Score = 665 bits (1717), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 348/469 (74%), Positives = 393/469 (83%), Gaps = 7/469 (1%)
Query: 1 MATTSLLPKSTSPFLSNPLSSLPST--HFIKPSPLFNPPDNHSLLITTKRLSFQTHAKKK 58
MAT+SL KS PFL S HF KP F P SL ++K FQ KK
Sbjct: 1 MATSSLPTKSLPPFLPTSSFPPSSPSTHFTKP--FFLSPPLISLSTSSK---FQIPCSKK 55
Query: 59 NPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKI 118
NPW DPFDDGEDP+M+YGSL++DGKQ+EDPRPPD+PNNPYGFLKFP G+ E+ASL LK+
Sbjct: 56 NPWFDPFDDGEDPEMEYGSLYSDGKQEEDPRPPDNPNNPYGFLKFPQGYMVEVASLGLKL 115
Query: 119 RGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANV 178
RGDVRRCCC+ISGGVYENLLFFP IQLLKDRYPGV IDV+ SARGKQTFELNKNVRW+NV
Sbjct: 116 RGDVRRCCCVISGGVYENLLFFPTIQLLKDRYPGVQIDVVTSARGKQTFELNKNVRWSNV 175
Query: 179 YDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNA 238
YD DD +PEPAEYTD++G++K RYYDMVLSTKLAGLGHA FLFMTTARDRVSYIYP+VNA
Sbjct: 176 YDPDDYFPEPAEYTDMVGLLKGRYYDMVLSTKLAGLGHAVFLFMTTARDRVSYIYPDVNA 235
Query: 239 AGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVV 298
AGAGL LSETF ES NLSE G+NMY QMVDWLGRPFRSVPRHPVPPL+VSI+R+LKEV+
Sbjct: 236 AGAGLFLSETFKPESSNLSEGGFNMYHQMVDWLGRPFRSVPRHPVPPLKVSIARKLKEVI 295
Query: 299 AEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIP 358
AEKY+ AGAE+GKYIVIHGIESDSKASMQS+GDTDSLLPI+ WAEIA +R F+P+FVIP
Sbjct: 296 AEKYRKAGAEKGKYIVIHGIESDSKASMQSKGDTDSLLPIRAWAEIAKCIRGFKPVFVIP 355
Query: 359 HEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
HEKERE VE+ VG++ASIVFITTPGQLAALINDSAG+IATNTAA+QLA AREKPSIALF
Sbjct: 356 HEKERESVEEEVGNNASIVFITTPGQLAALINDSAGLIATNTAAVQLAIAREKPSIALFC 415
Query: 419 SELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQIFNESLALA 467
SE K +LFVPNAE K+C +SS TG+LID + + A+Q F LALA
Sbjct: 416 SEEKAKLFVPNAEAKRCITVSSATGRLIDIDMGTIKKAIQSFEVPLALA 464
>gi|357145443|ref|XP_003573644.1| PREDICTED: uncharacterized protein LOC100839724 [Brachypodium
distachyon]
Length = 470
Score = 602 bits (1552), Expect = e-169, Method: Compositional matrix adjust.
Identities = 278/419 (66%), Positives = 345/419 (82%), Gaps = 7/419 (1%)
Query: 56 KKKNPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLP 115
KKKNPWLDPFDDG D D Y +++ GKQDEDPRPP+DP+NPYGFL+FPMG+NPE+ SL
Sbjct: 50 KKKNPWLDPFDDGPDEDFDYTGVYSGGKQDEDPRPPEDPSNPYGFLRFPMGYNPELDSLA 109
Query: 116 LKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRW 175
K+RGDVRR CC++SGGVYEN+LFFP +QLLKDRYPGVLIDV+ASARGKQ +E+ KNVR+
Sbjct: 110 SKVRGDVRRACCVVSGGVYENVLFFPVVQLLKDRYPGVLIDVVASARGKQVYEMCKNVRY 169
Query: 176 ANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPN 235
ANVYD DDDWPEPAEYT LGV+KNRYYDM+LSTKLAG GHA FLFM++AR++V Y+YPN
Sbjct: 170 ANVYDPDDDWPEPAEYTHQLGVLKNRYYDMILSTKLAGTGHALFLFMSSAREKVGYVYPN 229
Query: 236 VNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLK 295
VN AGAGL L+E F + NLS+ GYNMY++M++WLGRP + VP+ P+PPLRVSIS++L+
Sbjct: 230 VNGAGAGLFLTEMFKPPTTNLSDGGYNMYQEMLEWLGRPGKGVPQQPIPPLRVSISKKLR 289
Query: 296 EVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLRE----F 351
+V +KY AG E+GK++V+HGI SDS A+M SRGD D LLP++ WAEIA +
Sbjct: 290 GIVEDKYSRAGVEKGKFVVVHGIASDSVANMTSRGDDDCLLPLEQWAEIAKAISSQGNGL 349
Query: 352 RPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREK 411
RPLFV+PH+K RE +ED+VG + S +FITTPGQL LINDSAGV+ATNTAA+QLANAR++
Sbjct: 350 RPLFVMPHQKHREEIEDIVGRETSYLFITTPGQLTCLINDSAGVVATNTAAVQLANARDR 409
Query: 412 PSIALFSSELKGRLFVPNAEEKK-CTVISSRTGKLIDTPVEAVLNAMQIFN--ESLALA 467
P +ALFSS+ K +LF+P AE+KK CTV++S TGKL D +EAV NA++ F S ALA
Sbjct: 410 PCVALFSSKEKAKLFLPYAEDKKSCTVVASATGKLADIDIEAVKNAVKEFEPAPSFALA 468
>gi|115475672|ref|NP_001061432.1| Os08g0276100 [Oryza sativa Japonica Group]
gi|37806059|dbj|BAC99486.1| unknown protein [Oryza sativa Japonica Group]
gi|113623401|dbj|BAF23346.1| Os08g0276100 [Oryza sativa Japonica Group]
gi|215737469|dbj|BAG96599.1| unnamed protein product [Oryza sativa Japonica Group]
Length = 469
Score = 592 bits (1526), Expect = e-166, Method: Compositional matrix adjust.
Identities = 277/426 (65%), Positives = 345/426 (80%), Gaps = 13/426 (3%)
Query: 52 QTHAKKKNPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEI 111
+ AKKKNPWLDPFDDG D + Y F+ GKQ+EDPRPP+DP NPYGFL+FP G+NPE+
Sbjct: 45 RASAKKKNPWLDPFDDGPDDEFDYRGAFSGGKQEEDPRPPEDPANPYGFLRFPAGYNPEL 104
Query: 112 ASLPLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNK 171
SL K+R DVRR CC++SGGVYEN+LFFP +QLL+DRYPGV++DV+ASARGKQ +E+ K
Sbjct: 105 DSLASKVRRDVRRACCVVSGGVYENVLFFPVVQLLRDRYPGVVVDVVASARGKQVYEMCK 164
Query: 172 NVRWANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSY 231
NVR+A+VYD DDDWPEPAEYT LGV+KNRYYD+++STKLAG+GHA FLFM++ARD+V Y
Sbjct: 165 NVRYADVYDPDDDWPEPAEYTHQLGVLKNRYYDLIISTKLAGIGHALFLFMSSARDKVGY 224
Query: 232 IYPNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSIS 291
+YPNVN+AGAGL L+E F + NL++ GYNMY+ M++W+GRP + VP HPVPPLRVSIS
Sbjct: 225 VYPNVNSAGAGLFLTEMFRPPTTNLADSGYNMYQDMLEWIGRPAKGVPEHPVPPLRVSIS 284
Query: 292 RRLKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIA------ 345
++L+ V +KY AG E+GK++V+HGI SDS A+M+SRGD D LLP++ WAEIA
Sbjct: 285 KKLRAFVEDKYSRAGVEKGKFVVVHGIASDSFANMRSRGDDDCLLPLEHWAEIAKEISSE 344
Query: 346 -NGLREFRPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQ 404
NGL +PLFVIPH+K RE VE+ VG D +I+FITTPGQL LINDS GV+ATNTAA+Q
Sbjct: 345 DNGL---KPLFVIPHQKHREEVEETVGKDTNILFITTPGQLTCLINDSVGVVATNTAAVQ 401
Query: 405 LANAREKPSIALFSSELKGRLFVPNAEEKK-CTVISSRTGKLIDTPVEAVLNAMQIFNE- 462
LANAR+KP +ALFSS+ K RLF+P EEKK CTV++S TGKLID VEAV A++ F
Sbjct: 402 LANARDKPCVALFSSKEKARLFLPYLEEKKGCTVVASETGKLIDIDVEAVKKAVKEFKAA 461
Query: 463 -SLALA 467
S+ALA
Sbjct: 462 PSVALA 467
>gi|226508728|ref|NP_001142788.1| uncharacterized protein LOC100275158 [Zea mays]
gi|195609704|gb|ACG26682.1| hypothetical protein [Zea mays]
gi|238010026|gb|ACR36048.1| unknown [Zea mays]
gi|413916872|gb|AFW56804.1| hypothetical protein ZEAMMB73_257177 [Zea mays]
Length = 466
Score = 587 bits (1513), Expect = e-165, Method: Compositional matrix adjust.
Identities = 269/409 (65%), Positives = 336/409 (82%), Gaps = 5/409 (1%)
Query: 55 AKKKNPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASL 114
AKKKNPWLDPFDDG D + Y +F+ GKQ+EDPRPP+DP NPYGFL+FP G+NPE+ SL
Sbjct: 45 AKKKNPWLDPFDDGPDEEFDYQGMFSGGKQEEDPRPPEDPENPYGFLRFPQGYNPELDSL 104
Query: 115 PLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVR 174
K+RGDVRR CC++SGGVYEN+LFFP +Q+LKDRYPGVL+DV+ASARGKQ +E+ KNVR
Sbjct: 105 ASKVRGDVRRACCVVSGGVYENVLFFPVVQMLKDRYPGVLVDVVASARGKQVYEMCKNVR 164
Query: 175 WANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYP 234
+ANVYD DDDWPEPAEYT +LGV+KNRYYD+++ST+LAG+GHA FLFM++ARD+V Y+YP
Sbjct: 165 YANVYDPDDDWPEPAEYTHMLGVLKNRYYDLIISTRLAGIGHALFLFMSSARDKVGYVYP 224
Query: 235 NVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRL 294
NVN+ GAGL L+E F A + NL++ GY+MY+ M++W+GRP ++VP+ P PLRVSIS++L
Sbjct: 225 NVNSVGAGLFLNEMFKAPTNNLADGGYHMYKDMLEWIGRPAKNVPQQPTQPLRVSISKKL 284
Query: 295 KEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIA----NGLRE 350
+ V +KY AG E+GKY+VIHGI SDS A+M+SRGD D LLP+ WA+IA +G R
Sbjct: 285 RAYVEDKYSRAGVEKGKYVVIHGIASDSVANMKSRGDDDCLLPLGHWAQIAKEISSGDRG 344
Query: 351 FRPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANARE 410
+PLFVIPHEK R+ +E+ VGDD SI+FITTPGQL LINDSAGV+ATNTAA+QLANAR+
Sbjct: 345 LKPLFVIPHEKHRDEIEEEVGDDTSILFITTPGQLTCLINDSAGVVATNTAAVQLANARD 404
Query: 411 KPSIALFSSELKGRLFVPNAEEK-KCTVISSRTGKLIDTPVEAVLNAMQ 458
KP +ALFSS K RLF+P E K CTVISS TG LID + AV N ++
Sbjct: 405 KPCVALFSSAEKARLFLPYVEGKSSCTVISSATGNLIDIDLNAVKNVVK 453
>gi|413921876|gb|AFW61808.1| hypothetical protein ZEAMMB73_450954 [Zea mays]
Length = 471
Score = 579 bits (1492), Expect = e-162, Method: Compositional matrix adjust.
Identities = 269/409 (65%), Positives = 337/409 (82%), Gaps = 5/409 (1%)
Query: 55 AKKKNPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASL 114
AKKKNPWLDPFDDG D + Y +F+ GKQ+EDPRPP+DP NPYGFL+FP G+NPE+ SL
Sbjct: 50 AKKKNPWLDPFDDGPDEEFDYQGMFSGGKQEEDPRPPEDPANPYGFLRFPQGYNPELDSL 109
Query: 115 PLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVR 174
K+RGDVRR CC++SGGVYEN+LF P +Q+LKDRYPGVL+DV+AS RGKQ +E+ KNVR
Sbjct: 110 ASKVRGDVRRACCVVSGGVYENVLFLPVVQMLKDRYPGVLVDVVASPRGKQVYEMCKNVR 169
Query: 175 WANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYP 234
+ANVYD DDDWPEPAEYT +LGV+KNRYYD+++ST+LAG+GHA FLFM++ARD+V Y+YP
Sbjct: 170 YANVYDPDDDWPEPAEYTHMLGVLKNRYYDLIVSTRLAGIGHALFLFMSSARDKVGYVYP 229
Query: 235 NVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRL 294
NVN+ GAGL L+E F A + NL++ GY+MY+ M++W+GRP ++VPR P PLRVSIS++L
Sbjct: 230 NVNSVGAGLFLNEMFKAPTNNLADGGYHMYKDMLEWIGRPAKNVPRQPTQPLRVSISKKL 289
Query: 295 KEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIA----NGLRE 350
+ V +KY AG E+GKY+V+HGI SDS ASM SRGD D LLP++ WA+IA +G +
Sbjct: 290 RAYVEDKYSRAGVEKGKYVVVHGIASDSVASMTSRGDDDCLLPLEHWAQIAKEISSGDKG 349
Query: 351 FRPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANARE 410
+PLFVIPH+K RE VE+ VGDD +I+FITTPGQL LINDSAGV+ATNTAA+QLANAR+
Sbjct: 350 LKPLFVIPHDKHREEVEEEVGDDTNILFITTPGQLTCLINDSAGVVATNTAAVQLANARD 409
Query: 411 KPSIALFSSELKGRLFVPNAEEK-KCTVISSRTGKLIDTPVEAVLNAMQ 458
KP +ALFSS K RLF+P E+K CTVISS TGKLID + AV NA++
Sbjct: 410 KPCVALFSSAEKARLFLPYVEDKSSCTVISSMTGKLIDIDINAVKNAVK 458
>gi|326487296|dbj|BAJ89632.1| predicted protein [Hordeum vulgare subsp. vulgare]
gi|326496056|dbj|BAJ90649.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 469
Score = 573 bits (1476), Expect = e-161, Method: Compositional matrix adjust.
Identities = 277/456 (60%), Positives = 361/456 (79%), Gaps = 11/456 (2%)
Query: 12 SPFLSNPLSSLPSTHFIK--PSPLFNPPDNHSLLITTKRLS--FQTHAKKKNPWLDPFDD 67
+P + P+++ P++ P+P P H L+ +R++ + +KK+NPWLDPFDD
Sbjct: 3 TPTMPTPMAARPTSTMATKLPAPSATPRQCH--LLPGRRVAGLARASSKKRNPWLDPFDD 60
Query: 68 GEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKIRGDVRRCCC 127
G D + Y +++ GKQ+EDPRPP+D NPYGFL+FPMG+ PE+ SL K+RGDVRR CC
Sbjct: 61 GPDEEFDYTGVYSGGKQEEDPRPPEDAENPYGFLRFPMGYMPELDSLASKVRGDVRRVCC 120
Query: 128 IISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDDWPE 187
++SGGVYEN+LFFP +Q+LKDRYPGVL+DV+ASARGKQ +E+ KNVR+ANVYD DD+WPE
Sbjct: 121 VVSGGVYENVLFFPVVQMLKDRYPGVLVDVVASARGKQVYEMCKNVRYANVYDPDDEWPE 180
Query: 188 PAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLLLSE 247
PAEYT LGVMKNRYYDMVLSTKLAG GHA FLFM++AR++V Y+YPNVN AGAGL L+E
Sbjct: 181 PAEYTHQLGVMKNRYYDMVLSTKLAGTGHALFLFMSSAREKVGYVYPNVNGAGAGLFLTE 240
Query: 248 TFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGA 307
F + NL++ GYNMY+ M++WLGRP + VP+ P+PPLRVSIS++L+ VV +KY AG
Sbjct: 241 MFKPATTNLADGGYNMYQDMLEWLGRPAKGVPQQPIPPLRVSISKKLRAVVEDKYNRAGV 300
Query: 308 EQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIA----NGLREFRPLFVIPHEKER 363
E+GKY+VIHGIESDS A+M+SRGD D LLP+++WAEIA +G RPLFV+PHE+ R
Sbjct: 301 EKGKYVVIHGIESDSVANMKSRGDDDCLLPLELWAEIAKEISSGGNGLRPLFVMPHERHR 360
Query: 364 EGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKG 423
E +E++VG++ + +FITTPGQL LINDSAGV+ATNTAA+QLANAR+KP +ALFSS+ K
Sbjct: 361 EEIEEIVGEETAYLFITTPGQLTCLINDSAGVVATNTAAVQLANARDKPCVALFSSKEKA 420
Query: 424 RLFVPNAEEKK-CTVISSRTGKLIDTPVEAVLNAMQ 458
RLF+P EE+K CTV++S TGKL +EAV A++
Sbjct: 421 RLFLPYVEERKSCTVVASATGKLAGIDIEAVKKAVK 456
>gi|326525951|dbj|BAJ93152.1| predicted protein [Hordeum vulgare subsp. vulgare]
Length = 469
Score = 571 bits (1471), Expect = e-160, Method: Compositional matrix adjust.
Identities = 276/456 (60%), Positives = 361/456 (79%), Gaps = 11/456 (2%)
Query: 12 SPFLSNPLSSLPSTHFIK--PSPLFNPPDNHSLLITTKRLS--FQTHAKKKNPWLDPFDD 67
+P + P+++ P++ P+P P H L+ +R++ + +KK+NPWLDPFDD
Sbjct: 3 TPTMPTPMAARPTSTMATKLPAPSATPRQCH--LLPGRRVAGLARASSKKRNPWLDPFDD 60
Query: 68 GEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKIRGDVRRCCC 127
G D + Y +++ GKQ+EDPRPP+D NPYGFL+FPMG+ PE+ SL K+RGDVRR CC
Sbjct: 61 GPDEEFDYTGVYSGGKQEEDPRPPEDAENPYGFLRFPMGYMPELDSLASKVRGDVRRVCC 120
Query: 128 IISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDDWPE 187
++SGGVYEN+LFFP +Q+LKDRYPGVL+DV+ASARG+Q +E+ KNVR+ANVYD DD+WPE
Sbjct: 121 VVSGGVYENVLFFPVVQMLKDRYPGVLVDVVASARGEQVYEMCKNVRYANVYDPDDEWPE 180
Query: 188 PAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLLLSE 247
PAEYT LGVMKNRYYDMVLSTKLAG GHA FLFM++AR++V Y+YPNVN AGAGL L+E
Sbjct: 181 PAEYTHQLGVMKNRYYDMVLSTKLAGTGHALFLFMSSAREKVGYVYPNVNGAGAGLFLTE 240
Query: 248 TFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGA 307
F + NL++ GYNMY+ M++WLGRP + VP+ P+PPLRVSIS++L+ VV +KY AG
Sbjct: 241 MFKPATTNLADGGYNMYQDMLEWLGRPAKGVPQQPIPPLRVSISKKLRAVVEDKYNRAGV 300
Query: 308 EQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIA----NGLREFRPLFVIPHEKER 363
E+GKY+VIHGIESDS A+M+SRGD D LLP+++WAEIA +G RPLFV+PHE+ R
Sbjct: 301 EKGKYVVIHGIESDSVANMKSRGDDDCLLPLELWAEIAKEISSGGNGLRPLFVMPHERHR 360
Query: 364 EGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKG 423
E +E++VG++ + +FITTPGQL LINDSAGV+ATNTAA+QLANAR+KP +ALFSS+ K
Sbjct: 361 EEIEEIVGEETAYLFITTPGQLTCLINDSAGVVATNTAAVQLANARDKPCVALFSSKEKA 420
Query: 424 RLFVPNAEEKK-CTVISSRTGKLIDTPVEAVLNAMQ 458
RLF+P EE+K CTV++S TGKL +EAV A++
Sbjct: 421 RLFLPYVEERKSCTVVASATGKLAGIDIEAVKKAVK 456
>gi|218200819|gb|EEC83246.1| hypothetical protein OsI_28562 [Oryza sativa Indica Group]
Length = 462
Score = 568 bits (1465), Expect = e-159, Method: Compositional matrix adjust.
Identities = 270/426 (63%), Positives = 336/426 (78%), Gaps = 15/426 (3%)
Query: 52 QTHAKKKNPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEI 111
+ AKKKNPWLDPFDDG D + Y F+ GKQ+EDPRPP+DP NPYGFL+FP G+NPE+
Sbjct: 40 RASAKKKNPWLDPFDDGPDDEFDYRGAFSGGKQEEDPRPPEDPANPYGFLRFPAGYNPEL 99
Query: 112 ASLPLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNK 171
SL K+R DVRR CC++SGGVYEN+LFFP +QLL+DRYPGV++DV+ASARGKQ +E+ K
Sbjct: 100 DSLASKVRRDVRRACCVVSGGVYENVLFFPVVQLLRDRYPGVVVDVVASARGKQVYEMCK 159
Query: 172 NVRWANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSY 231
NVR+A+VYD DDDWPEPAE +RYYD++LST+LAG GH FLFM++ARD+V Y
Sbjct: 160 NVRYADVYDPDDDWPEPAETPTSSAC--SRYYDLILSTRLAGFGHGLFLFMSSARDKVGY 217
Query: 232 IYPNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSIS 291
+YPNVN+AGAGL L+E F + NL++ GYNMY+ M++W+GRP + VP HPVPPLRVSIS
Sbjct: 218 VYPNVNSAGAGLFLTEMFRPPTTNLADGGYNMYQDMLEWIGRPAKGVPEHPVPPLRVSIS 277
Query: 292 RRLKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIA------ 345
++L+ V +KY AG E+GK++V+HGI SDS A+M+SRGD D LLP++ WAEIA
Sbjct: 278 KKLRAFVEDKYSRAGVEKGKFVVVHGIASDSFANMRSRGDDDCLLPLEHWAEIAKEISSE 337
Query: 346 -NGLREFRPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQ 404
NGL +PLFVIPH+K RE VE+ VG D +I+FITTPGQL LINDS GV+ATNTAA+Q
Sbjct: 338 DNGL---KPLFVIPHQKHREEVEETVGKDTNILFITTPGQLTCLINDSVGVVATNTAAVQ 394
Query: 405 LANAREKPSIALFSSELKGRLFVPNAEEKK-CTVISSRTGKLIDTPVEAVLNAMQIFNE- 462
LANAR+KP +ALFSSE K RLF+P EEKK CTV++S TGKLID VEAV A++ F
Sbjct: 395 LANARDKPCVALFSSEEKARLFLPYLEEKKGCTVVASETGKLIDIDVEAVKKAVKEFKAA 454
Query: 463 -SLALA 467
S+ALA
Sbjct: 455 PSVALA 460
>gi|222640246|gb|EEE68378.1| hypothetical protein OsJ_26704 [Oryza sativa Japonica Group]
Length = 426
Score = 472 bits (1214), Expect = e-130, Method: Compositional matrix adjust.
Identities = 245/426 (57%), Positives = 304/426 (71%), Gaps = 51/426 (11%)
Query: 52 QTHAKKKNPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEI 111
+ AKKKNPWLDPFDDG D + Y F+ GKQ+EDPRPP P P P G
Sbjct: 40 RASAKKKNPWLDPFDDGPDDEFDYRGAFSGGKQEEDPRPPGGPGEPVRVPAVPGG----- 94
Query: 112 ASLPLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNK 171
LL+DRYPGV++DV+ASARGKQ +E+ K
Sbjct: 95 ---------------------------------LLRDRYPGVVVDVVASARGKQVYEMCK 121
Query: 172 NVRWANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSY 231
NVR+A+VYD DDDWPEPAEYT LGV+KNRYYD+++STKLAG+GHA FLFM++ARD+V Y
Sbjct: 122 NVRYADVYDPDDDWPEPAEYTHQLGVLKNRYYDLIISTKLAGIGHALFLFMSSARDKVGY 181
Query: 232 IYPNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSIS 291
+YPNVN+AGAGL L+E F + NL++ GYNMY+ M++W+GRP + VP HPVPPLRVSIS
Sbjct: 182 VYPNVNSAGAGLFLTEMFRPPTTNLADSGYNMYQDMLEWIGRPAKGVPEHPVPPLRVSIS 241
Query: 292 RRLKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIA------ 345
++L+ V +KY AG E+GK++V+HGI SDS A+M+SRGD D LLP++ WAEIA
Sbjct: 242 KKLRAFVEDKYSRAGVEKGKFVVVHGIASDSFANMRSRGDDDCLLPLEHWAEIAKEISSE 301
Query: 346 -NGLREFRPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQ 404
NGL +PLFVIPH+K RE VE+ VG D +I+FITTPGQL LINDS GV+ATNTAA+Q
Sbjct: 302 DNGL---KPLFVIPHQKHREEVEETVGKDTNILFITTPGQLTCLINDSVGVVATNTAAVQ 358
Query: 405 LANAREKPSIALFSSELKGRLFVPNAEEKK-CTVISSRTGKLIDTPVEAVLNAMQIFNE- 462
LANAR+KP +ALFSS+ K RLF+P EEKK CTV++S TGKLID VEAV A++ F
Sbjct: 359 LANARDKPCVALFSSKEKARLFLPYLEEKKGCTVVASETGKLIDIDVEAVKKAVKEFKAA 418
Query: 463 -SLALA 467
S+ALA
Sbjct: 419 PSVALA 424
>gi|302757169|ref|XP_002962008.1| hypothetical protein SELMODRAFT_64169 [Selaginella moellendorffii]
gi|300170667|gb|EFJ37268.1| hypothetical protein SELMODRAFT_64169 [Selaginella moellendorffii]
Length = 399
Score = 443 bits (1139), Expect = e-121, Method: Compositional matrix adjust.
Identities = 219/400 (54%), Positives = 291/400 (72%), Gaps = 4/400 (1%)
Query: 61 WLDPF-DDGEDPDMQYGSLFADGKQ--DEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLK 117
W DPF + EDPD+ +G+LF +G + +++ PP DP + GF K P G+ EI L K
Sbjct: 1 WKDPFLFNEEDPDLVWGNLFTEGGRQPEQEYIPPLDPKSKSGFRKVPKGYTAEIRPLAAK 60
Query: 118 IRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWAN 177
+R +VR+C C+++GGVYENLLFFP I+LLKDRYPGV IDVIAS RGKQ +E+NKNV A
Sbjct: 61 VRDEVRKCLCLVAGGVYENLLFFPVIELLKDRYPGVEIDVIASERGKQVYEMNKNVSRAW 120
Query: 178 VYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVN 237
V+D+D + +PA Y ++L ++KN YYDM++STKLAG+GHA F+++T + YIYP+VN
Sbjct: 121 VFDIDQQFIKPALYMEMLTMVKNEYYDMIVSTKLAGIGHALFMWLTDCTKNIGYIYPDVN 180
Query: 238 AAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEV 297
AGAG L A +NL+E GY+MY ++ + LG+P + +PR P L V IS RL+ V
Sbjct: 181 GAGAGPFLYAAVDAPRLNLAEGGYHMYHELNEELGKPAKGIPRIYPPQLTVGISSRLRGV 240
Query: 298 VAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVI 357
+ +K+ A +G++IVIHGIESDS A+M+SRGD+DSLLP+ W+ IA L P+FVI
Sbjct: 241 ILKKWTEASLRRGEFIVIHGIESDSAATMKSRGDSDSLLPVSRWSLIAKSLSSKVPVFVI 300
Query: 358 PHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALF 417
PHEKER VE+V+G DA I+FITTPGQLAALINDS GVI TNTAA+QLA + +KP++ALF
Sbjct: 301 PHEKERSKVEEVIGPDAHIIFITTPGQLAALINDSFGVITTNTAALQLAISVKKPTVALF 360
Query: 418 SSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
SS+ K +LFVPN C I+S TGKL+D +E V NA+
Sbjct: 361 SSDEKAKLFVPN-HATSCDCIASDTGKLLDVSMEKVKNAI 399
>gi|302775324|ref|XP_002971079.1| hypothetical protein SELMODRAFT_64160 [Selaginella moellendorffii]
gi|300161061|gb|EFJ27677.1| hypothetical protein SELMODRAFT_64160 [Selaginella moellendorffii]
Length = 399
Score = 441 bits (1134), Expect = e-121, Method: Compositional matrix adjust.
Identities = 217/400 (54%), Positives = 291/400 (72%), Gaps = 4/400 (1%)
Query: 61 WLDPF-DDGEDPDMQYGSLFADGKQ--DEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLK 117
W DPF + EDPD+ +G+LF +G + +++ PP DP + GF K P G+ EI L K
Sbjct: 1 WKDPFLFNEEDPDLVWGNLFTEGGRQPEQEYIPPLDPKSKSGFRKVPKGYTAEIRPLAAK 60
Query: 118 IRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWAN 177
+R +VR+C C+++GGVYENLLFFP I+LLKDRYPGV IDVIAS RGKQ +E+NKNV A
Sbjct: 61 VRDEVRKCLCLVAGGVYENLLFFPVIELLKDRYPGVEIDVIASERGKQVYEMNKNVSRAW 120
Query: 178 VYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVN 237
V+D+D + +PA Y ++L ++KN YYDM++STKLAG+GHA F+++T + YIYP+VN
Sbjct: 121 VFDIDQQFIKPALYMEMLTMVKNEYYDMIVSTKLAGIGHALFMWLTDCTKNIGYIYPDVN 180
Query: 238 AAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEV 297
AGAG L A +NL+E GY+MY ++ + LG+P + +PR P L V IS RL+ V
Sbjct: 181 GAGAGPFLYAAVDAPRLNLAEGGYHMYHELNEELGKPAKGIPRIYPPQLTVGISSRLRGV 240
Query: 298 VAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVI 357
+ +K+ A +G++IVIHGIESDS A+M+SRGD+DSLLP+ W+ IA L P+FVI
Sbjct: 241 ILKKWTEASLRRGEFIVIHGIESDSAATMKSRGDSDSLLPLSKWSLIAKSLSSKVPVFVI 300
Query: 358 PHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALF 417
PHEKER V++V+G DA I+FITTPGQ+AALINDS GVI TNTAA+QLA + +KP++ALF
Sbjct: 301 PHEKERSKVQEVIGPDAHIIFITTPGQIAALINDSFGVITTNTAAVQLAISVKKPTVALF 360
Query: 418 SSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
SS+ K +LFVPN C I+S TGKL+D +E V NA+
Sbjct: 361 SSDEKAKLFVPN-HATSCDCIASDTGKLLDVSMEKVKNAI 399
>gi|168029037|ref|XP_001767033.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162681775|gb|EDQ68199.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 473
Score = 436 bits (1122), Expect = e-120, Method: Compositional matrix adjust.
Identities = 217/421 (51%), Positives = 295/421 (70%), Gaps = 4/421 (0%)
Query: 50 SFQTHAKKKNPWLDPFDDGEDPDMQ-YGSLFADGKQD-EDPRPPDDPNNPYGFLKFPMGF 107
S AKKK + DPFD G D + G L + G Q+ EDPRP DP++ G+L FP GF
Sbjct: 54 SVVASAKKKGWFDDPFDYGADEEEDTMGELMSQGPQEAEDPRPSRDPDSESGYLDFPAGF 113
Query: 108 NPEIASLPLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTF 167
PE+ASL IR DVRRC C++SGGVYENLLFFP IQ+LK+RYPGV IDV+A+ RGKQT+
Sbjct: 114 MPEVASLGTLIRNDVRRCLCMVSGGVYENLLFFPVIQMLKNRYPGVRIDVMATPRGKQTY 173
Query: 168 ELNKNVRWANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARD 227
E+NKNVR A V+ +DD + P ++T+ +G +K YYD+V+STKLAGLG + F ++ + R+
Sbjct: 174 EMNKNVRMAWVHPVDDQFLRPVDFTETIGKIKGEYYDLVVSTKLAGLGQSIFFWLASVRN 233
Query: 228 RVSYIYPNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLR 287
+VSY +P+VNAAGA L A + L+E G+NMY +M++ L + +++P+ PPL
Sbjct: 234 KVSYTFPDVNAAGATKFLDVAIKAPQLELAESGFNMYAEMIEELSQMGKNLPKTEAPPLE 293
Query: 288 VSISRRLKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANG 347
V I R++K V KY AG +G+++V HGI+SDS ASM SRGD DSLLPI++WAEIA
Sbjct: 294 VGIGRKVKAYVENKYLEAGLSEGEFLVFHGIKSDSSASMTSRGDKDSLLPIEMWAEIAKS 353
Query: 348 LREFRPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLAN 407
E + +FVIP+EK R V++ G++ IVFITTP QL ALIN S GV+ TNTAA+QLA
Sbjct: 354 TSE-KVVFVIPNEKLRSKVKESCGENTHIVFITTPSQLGALINASTGVVTTNTAALQLAI 412
Query: 408 AREKPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQ-IFNESLAL 466
A +KP+++LF+S+ K +LF+P C V++S+TGKL + V+AV A+ I E+L
Sbjct: 413 ALKKPTVSLFASQEKAKLFIPEYARDSCAVVASKTGKLSNLDVKAVTMALSTIAKEALVA 472
Query: 467 A 467
A
Sbjct: 473 A 473
>gi|168023336|ref|XP_001764194.1| predicted protein [Physcomitrella patens subsp. patens]
gi|162684634|gb|EDQ71035.1| predicted protein [Physcomitrella patens subsp. patens]
Length = 475
Score = 426 bits (1095), Expect = e-116, Method: Compositional matrix adjust.
Identities = 210/414 (50%), Positives = 290/414 (70%), Gaps = 5/414 (1%)
Query: 58 KNPWLD-PFDDG-EDPDMQYGSLFADGKQD-EDPRPPDDPNNPYGFLKFPMGFNPEIASL 114
K W D PFD G +D + G L + G Q EDPRP DP++ G+L FP GF PE+ASL
Sbjct: 63 KKGWFDDPFDYGADDEEDTMGELMSQGPQGAEDPRPARDPDSESGYLDFPAGFMPEVASL 122
Query: 115 PLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVR 174
+ IR DVRRC C+ISGGVYENLLFFP IQLLK+RYPGV IDV+A+ RGKQ +E+NKNVR
Sbjct: 123 GILIRNDVRRCLCMISGGVYENLLFFPVIQLLKNRYPGVRIDVMATPRGKQAYEMNKNVR 182
Query: 175 WANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYP 234
A V+ +DD + P ++T+ +G +K YYD+++STKLAGLG + F ++ + R++VSY YP
Sbjct: 183 KAWVHPVDDQFLRPVDFTETVGKIKGEYYDLLVSTKLAGLGQSIFFWLASVRNKVSYTYP 242
Query: 235 NVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRL 294
+VNAAGA L A + L+E G+NMY +M++ L + ++VP+ VPPL V I ++
Sbjct: 243 DVNAAGAAKFLDIAIKAPQLELAESGFNMYAEMIEELSQMGKNVPKTEVPPLEVGIGSKV 302
Query: 295 KEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPL 354
K V KY+ AG +G+++V HGIE DS ASM S+GD D LLP+ +WAEIA + + +
Sbjct: 303 KAYVEAKYREAGVREGEFLVFHGIECDSSASMTSKGDKDCLLPLSMWAEIAKSTSD-KVV 361
Query: 355 FVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSI 414
FVIP+EK R V+++ G++A IVFITTPGQL ALIN S GV+ TNTAA+Q+A A +K ++
Sbjct: 362 FVIPNEKWRRKVKEICGENAHIVFITTPGQLGALINASKGVVTTNTAALQIAIALKKSTV 421
Query: 415 ALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQ-IFNESLALA 467
ALF+S+ K LF+P+ + C +++S+TGKL ++A A+ I E+L A
Sbjct: 422 ALFASQEKANLFIPDYAKDACAMVASKTGKLCGLDLKAATMAVSTIAKEALVAA 475
>gi|398313921|emb|CCI55387.1| NDH subunit PnsB1 [Marchantia polymorpha]
Length = 480
Score = 407 bits (1045), Expect = e-111, Method: Compositional matrix adjust.
Identities = 201/403 (49%), Positives = 274/403 (67%), Gaps = 4/403 (0%)
Query: 58 KNPWLDPFDDGEDPDMQYGSLFADGKQD-EDPRPPDDPNNPYGFLKFPMGFNPEIASLPL 116
KN DPFD GED D++YG L ++GKQ E PRP D N+ YGFL FP N EIASL L
Sbjct: 69 KNMLDDPFDFGEDEDLEYGDLMSEGKQGVEPPRPRKDKNSKYGFLDFPGYHNLEIASLGL 128
Query: 117 KIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWA 176
+R DVRRC C ++GGVYENLLFFP I+LLK RYPGV +D++ SARGKQ +E+NK V A
Sbjct: 129 YVRKDVRRCLCWVAGGVYENLLFFPVIKLLKARYPGVKVDIVTSARGKQVYEMNKYVNKA 188
Query: 177 NVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARD-RVSYIYPN 235
+D+D +W EP E + +GV++++YYDMVLSTK AG H L+M R R+SY+ P
Sbjct: 189 YAFDVDAEWVEPVELNEFIGVLRDQYYDMVLSTKYAGFKHCTTLYMIGGRTRRISYVLPY 248
Query: 236 VNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLK 295
+ + L+ + NL++ GY+MY++++D+L +P VP VP + V + +R++
Sbjct: 249 HSEWVSNSFLTTSLMPPRENLADGGYHMYKELIDYLAQPGNGVPEQAVPMMEVGVPKRVR 308
Query: 296 EVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF 355
V KY AG E GK++V HG+ESDS ASMQ++GD+DSLLP + WA++ L
Sbjct: 309 AVATSKYTEAGVEAGKFVVFHGVESDSGASMQTKGDSDSLLPPEFWAKLKASAGT-SVLV 367
Query: 356 VIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
VIP+ K+++ V GDD +VFITTPGQL A+I DSAGV++TNTAA+Q+A A KPS+A
Sbjct: 368 VIPNMKDKKKVIAACGDDVHVVFITTPGQLGAIIRDSAGVVSTNTAAVQIAIAFNKPSVA 427
Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQ 458
+F S+ K LFVP KKC V++S+TGKL D V+ ++A +
Sbjct: 428 IFGSDEKASLFVPPL-AKKCKVVTSKTGKLADVEVDEAISAWK 469
>gi|449491201|ref|XP_004158827.1| PREDICTED: uncharacterized protein LOC101226812 [Cucumis sativus]
Length = 260
Score = 362 bits (928), Expect = 3e-97, Method: Compositional matrix adjust.
Identities = 194/265 (73%), Positives = 215/265 (81%), Gaps = 7/265 (2%)
Query: 1 MATTSLLPKSTSPFLSNPLSSLPST--HFIKPSPLFNPPDNHSLLITTKRLSFQTHAKKK 58
MAT+SL KS PFL S HF KP F P SL ++K FQ KK
Sbjct: 1 MATSSLPTKSLPPFLPTSSFPPSSPSTHFTKP--FFLSPPLISLSTSSK---FQIPCSKK 55
Query: 59 NPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASLPLKI 118
NPW DPFDDGEDP+M+YGSL++DGKQ+EDPRPPD+PNNPYGFLKFP G+ E+ASL LKI
Sbjct: 56 NPWFDPFDDGEDPEMEYGSLYSDGKQEEDPRPPDNPNNPYGFLKFPQGYMVEVASLGLKI 115
Query: 119 RGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANV 178
RGDVRRCCC+ISGGVYENLLFFP IQLLKDRYPGV IDV+ SARGKQTFELNKNVRW+NV
Sbjct: 116 RGDVRRCCCVISGGVYENLLFFPTIQLLKDRYPGVQIDVVTSARGKQTFELNKNVRWSNV 175
Query: 179 YDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNA 238
YD DD +PEPAEYTD++G++K RYYDMVLSTKLAGLGHA FLFMTTARDRVSYIYP+VNA
Sbjct: 176 YDPDDYFPEPAEYTDMVGLLKGRYYDMVLSTKLAGLGHAVFLFMTTARDRVSYIYPDVNA 235
Query: 239 AGAGLLLSETFTAESMNLSERGYNM 263
AGAGL LSETF ES NLSE G+NM
Sbjct: 236 AGAGLFLSETFKPESSNLSEGGFNM 260
>gi|413921875|gb|AFW61807.1| hypothetical protein ZEAMMB73_450954 [Zea mays]
Length = 284
Score = 345 bits (885), Expect = 3e-92, Method: Compositional matrix adjust.
Identities = 166/270 (61%), Positives = 216/270 (80%), Gaps = 5/270 (1%)
Query: 194 ILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLLLSETFTAES 253
+L ++NRYYD+++ST+LAG+GHA FLFM++ARD+V Y+YPNVN+ GAGL L+E F A +
Sbjct: 2 LLQTLQNRYYDLIVSTRLAGIGHALFLFMSSARDKVGYVYPNVNSVGAGLFLNEMFKAPT 61
Query: 254 MNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQGKYI 313
NL++ GY+MY+ M++W+GRP ++VPR P PLRVSIS++L+ V +KY AG E+GKY+
Sbjct: 62 NNLADGGYHMYKDMLEWIGRPAKNVPRQPTQPLRVSISKKLRAYVEDKYSRAGVEKGKYV 121
Query: 314 VIHGIESDSKASMQSRGDTDSLLPIQVWAEIA----NGLREFRPLFVIPHEKEREGVEDV 369
V+HGI SDS ASM SRGD D LLP++ WA+IA +G + +PLFVIPH+K RE VE+
Sbjct: 122 VVHGIASDSVASMTSRGDDDCLLPLEHWAQIAKEISSGDKGLKPLFVIPHDKHREEVEEE 181
Query: 370 VGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKGRLFVPN 429
VGDD +I+FITTPGQL LINDSAGV+ATNTAA+QLANAR+KP +ALFSS K RLF+P
Sbjct: 182 VGDDTNILFITTPGQLTCLINDSAGVVATNTAAVQLANARDKPCVALFSSAEKARLFLPY 241
Query: 430 AEEK-KCTVISSRTGKLIDTPVEAVLNAMQ 458
E+K CTVISS TGKLID + AV NA++
Sbjct: 242 VEDKSSCTVISSMTGKLIDIDINAVKNAVK 271
>gi|413916873|gb|AFW56805.1| hypothetical protein ZEAMMB73_257177 [Zea mays]
Length = 253
Score = 337 bits (865), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 146/209 (69%), Positives = 182/209 (87%)
Query: 55 AKKKNPWLDPFDDGEDPDMQYGSLFADGKQDEDPRPPDDPNNPYGFLKFPMGFNPEIASL 114
AKKKNPWLDPFDDG D + Y +F+ GKQ+EDPRPP+DP NPYGFL+FP G+NPE+ SL
Sbjct: 45 AKKKNPWLDPFDDGPDEEFDYQGMFSGGKQEEDPRPPEDPENPYGFLRFPQGYNPELDSL 104
Query: 115 PLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVR 174
K+RGDVRR CC++SGGVYEN+LFFP +Q+LKDRYPGVL+DV+ASARGKQ +E+ KNVR
Sbjct: 105 ASKVRGDVRRACCVVSGGVYENVLFFPVVQMLKDRYPGVLVDVVASARGKQVYEMCKNVR 164
Query: 175 WANVYDLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYP 234
+ANVYD DDDWPEPAEYT +LGV+KNRYYD+++ST+LAG+GHA FLFM++ARD+V Y+YP
Sbjct: 165 YANVYDPDDDWPEPAEYTHMLGVLKNRYYDLIISTRLAGIGHALFLFMSSARDKVGYVYP 224
Query: 235 NVNAAGAGLLLSETFTAESMNLSERGYNM 263
NVN+ GAGL L+E F A + NL++ GY+M
Sbjct: 225 NVNSVGAGLFLNEMFKAPTNNLADGGYHM 253
>gi|449528831|ref|XP_004171406.1| PREDICTED: uncharacterized LOC101205787, partial [Cucumis sativus]
Length = 206
Score = 307 bits (787), Expect = 7e-81, Method: Compositional matrix adjust.
Identities = 156/204 (76%), Positives = 178/204 (87%)
Query: 264 YEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQGKYIVIHGIESDSK 323
Y QMVDWLGRPFRSVPRHPVPPL+VSI+R+LKEV+AEKYK AGAE+GKYIVIHGIESDSK
Sbjct: 1 YHQMVDWLGRPFRSVPRHPVPPLKVSIARKLKEVIAEKYKKAGAEKGKYIVIHGIESDSK 60
Query: 324 ASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKEREGVEDVVGDDASIVFITTPG 383
ASMQS+GDTDSLLPI+ WAEIA +R F+P+FVIPHEKERE VE+ VG++ASIVFITTPG
Sbjct: 61 ASMQSKGDTDSLLPIRAWAEIAKCIRGFKPVFVIPHEKERESVEEEVGNNASIVFITTPG 120
Query: 384 QLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKGRLFVPNAEEKKCTVISSRTG 443
QLAALINDSAG+IATNTAA+QLA AREKPSIALF SE K +LFVPNAE K+C +SS TG
Sbjct: 121 QLAALINDSAGLIATNTAAVQLAIAREKPSIALFCSEEKAKLFVPNAEAKRCITVSSATG 180
Query: 444 KLIDTPVEAVLNAMQIFNESLALA 467
+LID + + A+Q F LALA
Sbjct: 181 RLIDIDMGTIKKAIQSFEVPLALA 204
>gi|253761918|ref|XP_002489333.1| hypothetical protein SORBIDRAFT_0010s015060 [Sorghum bicolor]
gi|241946981|gb|EES20126.1| hypothetical protein SORBIDRAFT_0010s015060 [Sorghum bicolor]
Length = 116
Score = 123 bits (308), Expect = 2e-25, Method: Composition-based stats.
Identities = 53/80 (66%), Positives = 66/80 (82%)
Query: 129 ISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDDWPEP 188
+SGGVYEN+LFFP Q+LKDRYPGVL DV+ASA+GKQ +E+ NV +ANVYD DDDWPEP
Sbjct: 23 LSGGVYENVLFFPVAQMLKDRYPGVLFDVVASAKGKQVYEMCMNVTYANVYDPDDDWPEP 82
Query: 189 AEYTDILGVMKNRYYDMVLS 208
EYT +LGV+K D+V++
Sbjct: 83 TEYTHMLGVLKGDTVDLVVA 102
>gi|434384628|ref|YP_007095239.1| ADP-heptose:LPS heptosyltransferase [Chamaesiphon minutus PCC 6605]
gi|428015618|gb|AFY91712.1| ADP-heptose:LPS heptosyltransferase [Chamaesiphon minutus PCC 6605]
Length = 320
Score = 110 bits (275), Expect = 1e-21, Method: Compositional matrix adjust.
Identities = 87/339 (25%), Positives = 159/339 (46%), Gaps = 28/339 (8%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP I LK RYP IDV+ R + ++K+V +D
Sbjct: 2 RILALVPGGIGDQILFFPTIDNLKGRYPDAEIDVVVEPRSTSAYRVSKSVHKVIPFDF-K 60
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
D PA++ ++LG++++ YD+ +S G L++T R+ + + + L
Sbjct: 61 DVNGPADWGNLLGIIRDSEYDVAMSLGRR-FGVGLLLWLTGIPVRIGF-----DVGASKL 114
Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYK 303
LS S+NL++ M+ +V L P P + +++ + + + +
Sbjct: 115 FLSNPV---SLNLNQYAAQMFHDLVRGLNI------TDPCPAVGINVPKADIDWAEAEQQ 165
Query: 304 NAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIPHE 360
G ++ Y++IHG E+ ++ D + P++ W +I L+ + P+ VI
Sbjct: 166 RLGIKESGYVLIHGGENKPDRTL----GVDRIYPVKQWQQIVKDLQTRQPNIPVAVIEGP 221
Query: 361 KEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
+ E +E ++ ++ I+ P G+LAA I + ++ TN+ + LA A +IALF
Sbjct: 222 DDGEFIEALLTLCPALKVISPPDIGKLAATIAGANLMLCTNSGPMHLAVAVGTYTIALFG 281
Query: 419 SELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
+L E + I SRTGKL D P +L +
Sbjct: 282 PSDPAKLL---PENGRSIAIKSRTGKLADIPPSDILEKI 317
>gi|282896719|ref|ZP_06304727.1| Glycosyl transferase, family 9 [Raphidiopsis brookii D9]
gi|281198437|gb|EFA73325.1| Glycosyl transferase, family 9 [Raphidiopsis brookii D9]
Length = 317
Score = 105 bits (262), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 91/342 (26%), Positives = 159/342 (46%), Gaps = 36/342 (10%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ G + + +LFFP + L+ YP IDVI R K +++NK+V +D D
Sbjct: 2 RVVALVPGSIADQILFFPTLDSLQRSYPDAWIDVIVEPRSKVAYQVNKSVHEVFSFDYKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTK---LAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
A+++++LG +++R YD+ ++ + GL FL+++ R+ + +G
Sbjct: 62 R-NSLADWSNLLGTIRDREYDVAITARESWFVGL----FLWLSGIPTRIGF-----EGSG 111
Query: 241 AGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
A L S N S+ ++Y ++ LG PPL +++ + E +
Sbjct: 112 ANFLTHVI----SPNTSQYISHIYHDLLRPLGI------NTSCPPLAINLVKPDVEWAKQ 161
Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
+ K G + YI+I+G + S D D++ PI W +I ++ + P+ V+
Sbjct: 162 QQKRLGIGETGYILIYG----GYGELSSEPD-DNIYPIDSWQQIIGECQQKQPDLPILVV 216
Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
+ + V +V I I P G+LAA+I ++ ++ N + +QLA A + +IA
Sbjct: 217 KEQGDDYFVPSLVESFPDIKVIPVPDIGKLAAIIGSASLMVTVNGSPLQLAIAIQTYTIA 276
Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
L SS +L N K I S TGK D P VL +
Sbjct: 277 LLSSADSNKLLPVN---DKFLAIKSPTGKTADIPPATVLKTI 315
>gi|298489841|ref|YP_003720018.1| family 9 glycosyl transferase ['Nostoc azollae' 0708]
gi|298231759|gb|ADI62895.1| glycosyl transferase family 9 ['Nostoc azollae' 0708]
Length = 319
Score = 104 bits (260), Expect = 9e-20, Method: Compositional matrix adjust.
Identities = 93/342 (27%), Positives = 161/342 (47%), Gaps = 35/342 (10%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R + G + + +LFFP + LK YP IDVI ++ K + L+K+V +D D
Sbjct: 2 RVVAFVPGSISDQILFFPTLDDLKRYYPDAQIDVIVESQSKAAYRLSKSVHEVLTFDYSD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTK---LAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
A++ +++G +++R YD+ ++T GL FL++T R+ Y G
Sbjct: 62 H-NSLADWGNLVGTIRDREYDVAMTTGESWFIGL----FLWLTGIPTRIGY-----KGKG 111
Query: 241 AGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
AG L + N S+ MY ++ +G + P P L ++ + + +
Sbjct: 112 AGFLTNTI----RRNTSQYVAAMYHDLLKPVGI------KTPCPELTANVPKPDIQWAQQ 161
Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF--VIP 358
+ K G + YI+I+G S S Q G D+ PI W +I + +P V+
Sbjct: 162 EQKRLGIHETGYILIYG---GSSHSSQIDG-ADTSYPIASWQQIIQDCQHKQPELPIVLI 217
Query: 359 HEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
E + E V+ + S + IT+P G+L A+I+ ++ +++ +QLA A + +IA
Sbjct: 218 KEADDENFVRVLLESCSNLKITSPDDIGKLTAIISGASLMLSIENGPLQLAVAVQTYTIA 277
Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
L SS +L +P ++ K I S +GK +D P VL +
Sbjct: 278 LLSSTDADKL-LPKSD--KFLAIKSHSGKTVDIPPPTVLEKI 316
>gi|282900108|ref|ZP_06308065.1| Glycosyl transferase, family 9 [Cylindrospermopsis raciborskii
CS-505]
gi|281194990|gb|EFA69930.1| Glycosyl transferase, family 9 [Cylindrospermopsis raciborskii
CS-505]
Length = 317
Score = 103 bits (256), Expect = 2e-19, Method: Compositional matrix adjust.
Identities = 91/342 (26%), Positives = 159/342 (46%), Gaps = 37/342 (10%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ G + + +LFFP + L+ YP IDVI R K +++NK V ++D D
Sbjct: 2 RVVALVPGSIADQILFFPTLDSLQRSYPDAWIDVIVEPRSKVAYQVNKYVHEVFLFDYKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS---TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
A+++++LG +++R YD+ ++ + GL FL+++ R+ + +G
Sbjct: 62 R-NSLADWSNLLGTIRDREYDLAITAGESWFVGL----FLWLSGIPTRIGF-----QGSG 111
Query: 241 AGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
+ L S N S+ ++Y ++ LG PPL +++ + E +
Sbjct: 112 SNFLTHVI----SPNTSQYIPHIYHDLLQPLGI------NTACPPLAINLLKPDVEWAKQ 161
Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
+ K G + YI+I+G D + D D++ PI W +I ++ + P+ V+
Sbjct: 162 QQKLLGIGETGYILIYGGYGDLSS------DPDNIYPIDSWRQIIGECQQKQADLPILVV 215
Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
+ + V +V I I P G+LAA+I ++ +I N + +QLA A + +IA
Sbjct: 216 KEQGDDYFVPSLVESFPDIKAIPVPDIGKLAAIIGGASLMITVNGSPLQLAIAIQTYTIA 275
Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
L SS +L N K I S TGK D P VL +
Sbjct: 276 LLSSADSTKLLPVN---YKFLAIKSPTGKTADIPPATVLKTI 314
>gi|428780992|ref|YP_007172778.1| ADP-heptose--LPS heptosyltransferase [Dactylococcopsis salina PCC
8305]
gi|428695271|gb|AFZ51421.1| ADP-heptose:LPS heptosyltransferase [Dactylococcopsis salina PCC
8305]
Length = 320
Score = 102 bits (253), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 88/341 (25%), Positives = 159/341 (46%), Gaps = 32/341 (9%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R +I GG+ + +LFFP + LK+ YP IDV+ R K + + +V ++D +
Sbjct: 2 RILTLIPGGIGDQILFFPTLADLKNTYPEAKIDVLVEPRSKSAYRVCSHVDEVLLFDYKN 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFL-FMTTARDRVSY-IYPNVNAAGA 241
A+Y ++LG++++R Y++ L+ L FL ++ RV Y P+ +
Sbjct: 62 R-NSLADYLNLLGIIRDREYEVALT--LGRRWTVGFLLWLNGIPLRVGYETSPSFFFSQT 118
Query: 242 GLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
L +E + A+ + +G N+ P PPL V++ + + ++
Sbjct: 119 VPLKTEQYAAQMYHDLLQGLNI----------------NTPCPPLSVNVPKSDIQWGEQE 162
Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIP 358
K + G YIVIHG S Q +G D + P+ W EI + + P+ ++
Sbjct: 163 QKRLSIQDGNYIVIHG---GSSQLAQEKG-IDKVYPVSKWQEIIADFQSKQPDLPIVLLQ 218
Query: 359 HEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIAL 416
++ + VE ++ + ++ + P G+LA +I + ++ T++A + LA A +IAL
Sbjct: 219 GPEDEQWVEQLLEGNPNLKITSPPDIGKLAPMIAGGSLMLCTDSAPMHLAIAVGTYTIAL 278
Query: 417 FSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
F +L P E K C + S TG + D V +VL +
Sbjct: 279 FGPTDSKKLLPP--ENKACLGVKSPTGAIADIEVNSVLEKI 317
>gi|428773015|ref|YP_007164803.1| glycosyl transferase family protein [Cyanobacterium stanieri PCC
7202]
gi|428687294|gb|AFZ47154.1| glycosyl transferase family 9 [Cyanobacterium stanieri PCC 7202]
Length = 320
Score = 100 bits (248), Expect = 2e-18, Method: Compositional matrix adjust.
Identities = 82/341 (24%), Positives = 158/341 (46%), Gaps = 32/341 (9%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R +I GG+ + +LFFP ++ LKD+YP +IDVI R K + + +V+ V+D D
Sbjct: 2 RILALIPGGIGDQILFFPTLKTLKDQYPKAVIDVIVEPRSKNAYRVCPHVKEVLVFDYKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
A+Y ++LG++++R Y++ ++ L++ +RV Y P
Sbjct: 62 K-NGLADYLNLLGIIRDREYELAVTLG-RNWAVGFLLWLNGIPNRVGYKGP--------- 110
Query: 244 LLSETFTAESMNLSERGYN--MYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
F + L Y MY +V L +P PPL +++ + + +
Sbjct: 111 --KSWFINNPVELKTEQYAAYMYHDLVHGLNI------TNPCPPLSINVPKEDIQWAESE 162
Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIP 358
+ ++ YI+IHG S A +++G D + P+ W I ++ + P+ ++
Sbjct: 163 QRRLDIKESGYILIHG---GSSALAKTKG-IDKIYPVAKWQRIVEDVQRKQPNLPIVLLN 218
Query: 359 HEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIAL 416
++E +++ ++ I+ P G+L+A I + ++ T++A + L+ A +IAL
Sbjct: 219 GPDDQEWTAEMLQLCNNLKVISPPDIGKLSAFIAGANLMLCTDSAPMHLSVAVGTYTIAL 278
Query: 417 FSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
F +L PN++ + I S + + D E +L M
Sbjct: 279 FGPTQADKLLPPNSD--RFIGIQSLSKNIADISTEKILEKM 317
>gi|411118898|ref|ZP_11391278.1| ADP-heptose:LPS heptosyltransferase [Oscillatoriales cyanobacterium
JSC-12]
gi|410710761|gb|EKQ68268.1| ADP-heptose:LPS heptosyltransferase [Oscillatoriales cyanobacterium
JSC-12]
Length = 320
Score = 99.4 bits (246), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 89/341 (26%), Positives = 154/341 (45%), Gaps = 32/341 (9%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP + LK YP IDV+ R + ++K+VR +D
Sbjct: 2 RIVALVPGGIGDQILFFPTLDDLKKTYPESQIDVVVEPRAAAAYRVSKSVRDVIPFDFKG 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
A++ ++LGV+++R YD+ LS L G FL T I V AG G
Sbjct: 62 R-NSLADFGNLLGVIRDREYDVALS--LGQRGAVGFLLWLTG------IPIRVGFAGGG- 111
Query: 244 LLSETFTAESMNLSERGYN--MYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
E F + L + Y MY ++ LG P P + VS+ + + +
Sbjct: 112 --GERFLTNPVPLKKDQYAACMYHDLLKGLGI------DSPCPDIAVSVPAKDLDWADAE 163
Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIP 358
K G G Y++IHG + + D + P+ W I + + P+ V+
Sbjct: 164 RKRLGIGTGGYVIIHG----GSSQLAKEKGIDKIYPVDNWKGIIQDFNQRQPDLPIVVVQ 219
Query: 359 HEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIAL 416
++E V+ ++ + ++ + P G+L A+I ++ ++ T++A + LA A + ++AL
Sbjct: 220 GPDDQEFVQALLKVNPNLKITSPPDIGKLTAMIAGASLMLCTDSAPMHLAVAVKTYTLAL 279
Query: 417 FSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
F +L PN + + I S TGK+ D VL +
Sbjct: 280 FGPTDPAKLL-PN--DSRFVGIKSMTGKMEDIVPATVLEKV 317
>gi|428206460|ref|YP_007090813.1| glycosyl transferase family protein [Chroococcidiopsis thermalis
PCC 7203]
gi|428008381|gb|AFY86944.1| glycosyl transferase family 9 [Chroococcidiopsis thermalis PCC
7203]
Length = 320
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 93/344 (27%), Positives = 162/344 (47%), Gaps = 44/344 (12%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLD- 182
R ++ GG+ + +LFFP + LK YP IDV+A + + K+V +NV D
Sbjct: 2 RIVALVPGGIGDQILFFPTLDDLKRNYPDAEIDVVAEPGSLGAYRICKSV--SNVLKFDF 59
Query: 183 DDWPEPAEYTDILGVMKNRYYDM-VLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGA 241
D E+ D++G +++R YD+ + + K +G A L+++ RVSY G
Sbjct: 60 KDRNSLTEWVDLIGNIRDREYDIAICAGKRPFVGMA--LWLSGVAVRVSY-------QGG 110
Query: 242 GLLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVA 299
G L F S+ L Y MY ++ LG P P L +++ + E
Sbjct: 111 GNL----FLTNSVPLKTEQYVAAMYHDLLKGLGI------STPCPDLTINVPKSDLEWAD 160
Query: 300 EKYKNAGAEQGKYIVIHGIESD---SKASMQSRGDTDSLLPIQVWAEIANGLREFR---P 353
++ + G + YI+IHG S +K S+Q+ P+Q W ++ L++ + P
Sbjct: 161 KEQQRLGIKDSGYILIHGGSSQLAAAKTSIQT-------YPVQSWRQVIQNLQQRQPDLP 213
Query: 354 LFVIPHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANARE 410
+ ++ E E + + + + +T P G+ AA++ + ++ T++A + LA A +
Sbjct: 214 IAILVQTAEDEQLVRSLQEAFPDLKVTNPPDIGKTAAIVAAANLLLCTDSAPMHLAVAVQ 273
Query: 411 KPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVL 454
+IALF + +L P+ +C I S TGKL D + VL
Sbjct: 274 TYTIALFGATDPKKLIPPS---DRCIAIKSPTGKLSDIAPQTVL 314
>gi|443320916|ref|ZP_21049989.1| ADP-heptose:LPS heptosyltransferase [Gloeocapsa sp. PCC 73106]
gi|442789380|gb|ELR99040.1| ADP-heptose:LPS heptosyltransferase [Gloeocapsa sp. PCC 73106]
Length = 320
Score = 98.2 bits (243), Expect = 7e-18, Method: Compositional matrix adjust.
Identities = 94/343 (27%), Positives = 157/343 (45%), Gaps = 42/343 (12%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP ++ LK ++P LIDVI R K + + +V +D D
Sbjct: 2 RILTLVPGGIGDQILFFPTLKDLKTQFPKALIDVIVEPRAKAAYRVCADVNEVLTFDYKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGH----AAFLFMTTARDRVSYIYPNVNAA 239
A+Y ++LGV+++R YD L+ LG L++ RV Y PN
Sbjct: 62 R-NSMADYLNLLGVIRDREYDAALT-----LGQRWTVGLLLWLNGIPTRVGYRSPNA--- 112
Query: 240 GAGLLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSI-SRRLKE 296
F + + L Y +MY ++ LG + P PP+ + I ++ L
Sbjct: 113 --------WFISHPVPLKTEQYAAHMYHDLLQGLGI------KKPCPPVSIDIPAQDLDW 158
Query: 297 VVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRP-LF 355
EK + G YI+IHG S Q +G D + P W + ++ +P L
Sbjct: 159 AQGEKQRLELQTDG-YILIHG---GSSQLAQVKG-IDKIYPAIKWQIVIQDIQRQKPELP 213
Query: 356 VIPHEKEREGV-EDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREK 411
++ + +GV + + S + +T+P G+LAA+I D+ ++ T++A + LA A
Sbjct: 214 IVLIKGPEDGVWSEQMLKTCSNLKVTSPPDLGKLAAMIKDARLMLCTDSAPMHLAIALLT 273
Query: 412 PSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVL 454
P+IALF +L P ++ I S T + D E +L
Sbjct: 274 PTIALFGPTSATKLIPPQNDQ--VIGIQSPTKAIADIVPETIL 314
>gi|428297573|ref|YP_007135879.1| glycosyl transferase family protein [Calothrix sp. PCC 6303]
gi|428234117|gb|AFY99906.1| glycosyl transferase family 9 [Calothrix sp. PCC 6303]
Length = 319
Score = 98.2 bits (243), Expect = 9e-18, Method: Compositional matrix adjust.
Identities = 86/339 (25%), Positives = 164/339 (48%), Gaps = 29/339 (8%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + LLFFP + LK + IDV+ + K + ++K+V +D D
Sbjct: 2 RVVALVPGGIDDQLLFFPTLDDLKRYKSDIQIDVVVEPKSKAAYRVSKSVNQVIGFDFKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
A++ +++G +++R YD+ ++ + L A L++T R+ Y G+ L
Sbjct: 62 R-NSLADWGNLIGTIRDREYDVAITVGQSWLMGFA-LWLTGIPIRIGY-----KGQGSFL 114
Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYK 303
S + AE + NMY ++ L V P L +++ + + ++ +
Sbjct: 115 TNSVPYKAEQYKV-----NMYHDLLQGL------VIDSPAGELTINVPKTDIDWAEKEQR 163
Query: 304 NAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVI--PHEK 361
G ++ YI+I+G S ++G + + P+ W +I ++ +P I E
Sbjct: 164 RLGVKETGYILIYGGSSQVAG---TKGVDEEVYPVTNWKQIVKDFQDKQPDLAILAIQEP 220
Query: 362 EREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
+ +G + + + IT+P G+LAA+I+ + ++ T++A++QLA A + +I LF
Sbjct: 221 DNDGFANSFREIFPNIKITSPEDVGKLAAMISGANLILCTSSASMQLAIAVQTYTIGLFG 280
Query: 419 SELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
S +L +P +E K I S TGKL+D E V+ +
Sbjct: 281 SADPMKL-MPKSE--KFLPIQSSTGKLVDISPETVMQKV 316
>gi|428226657|ref|YP_007110754.1| glycosyl transferase family protein [Geitlerinema sp. PCC 7407]
gi|427986558|gb|AFY67702.1| glycosyl transferase family 9 [Geitlerinema sp. PCC 7407]
Length = 319
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 93/342 (27%), Positives = 152/342 (44%), Gaps = 39/342 (11%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP + LK YP +DV+ R K + + V V+D
Sbjct: 2 RILALVPGGIGDQILFFPTLDSLKQAYPKAKLDVVVEPRAKAAYRVCPFVNEEIVFDFKG 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
A++ ++LGV+++R YD VLS + L++T RV Y +AGA
Sbjct: 62 S-NSLADWGNLLGVIRDREYDAVLSLGRS-WAVGVLLWLTGVPTRVGY----AGSAGAWC 115
Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYK 303
L + + + + Y ++ LG P P R+S+ +R E + K
Sbjct: 116 LTNPV----PLKTEQYAASQYHDLLSGLGI------SGPCPDPRISVPKRDIEWAEAEQK 165
Query: 304 NAGAEQGKYIVIHGIESDSKASMQSR-GDTDSLLPIQVWAEIANGLREFR------PLFV 356
G + Y++IH AS SR + P Q W I +R+FR P+ +
Sbjct: 166 RLGIHESGYVLIH-----PGASALSRLKGIEKTYPAQSWKSI---IRDFRQRQPGLPVVL 217
Query: 357 IPHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPS 413
+ ++RE VE + D + V I TP G+LAALI + ++ T++ + LA A + +
Sbjct: 218 VQGPEDREIVEAIAAD--ADVKIATPDDIGKLAALIAGANLMLCTDSGPMHLAVAVQTYT 275
Query: 414 IALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLN 455
IALF +L K + S T ++ D + VL
Sbjct: 276 IALFGPTDSTKLL---PTSSKIIALQSTTNRVADITPDDVLK 314
>gi|186685109|ref|YP_001868305.1| glycosyl transferase family protein [Nostoc punctiforme PCC 73102]
gi|186467561|gb|ACC83362.1| glycosyl transferase, family 9 [Nostoc punctiforme PCC 73102]
Length = 319
Score = 97.8 bits (242), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 84/340 (24%), Positives = 156/340 (45%), Gaps = 31/340 (9%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP + LK YP IDV+ R K + ++K+V +D +D
Sbjct: 2 RVVALVPGGIGDQILFFPTLDDLKRNYPDAQIDVVVEPRSKAAYRVSKSVHEVLNFDFND 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
A++ +++G +++R YD+V+ K + L++T R+ Y +
Sbjct: 62 R-NSLADWGNLVGTIRDREYDVVIVVKQI-WLLSLLLWLTGIPIRIGY------KGNGSV 113
Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYK 303
L+ ++ Y+ Q ++ PVP L V++ + E ++ K
Sbjct: 114 FLTHAVPFKASQYVAAAYHDLLQPLEI---------NSPVPELAVNVPKPDIEWAQKEQK 164
Query: 304 NAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIPHE 360
G + YI+IHG + + D + P++ W +I G ++ + P+ V+
Sbjct: 165 RLGVHETGYILIHG----GSGQLSQAKELDKIYPVESWHQIIQGFQDKQPDLPVVVVKGI 220
Query: 361 KEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIALF 417
+ + V ++G I +T P G+L A+I + +++T++ A+QL+ A + +IALF
Sbjct: 221 GDEQFVRSLLGSSPDIK-VTAPDDIGKLTAMIAGANLMLSTDSPALQLSVAVQTYTIALF 279
Query: 418 SSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
+L N K I+S TGK D AVL +
Sbjct: 280 GPTDPAKLLPKN---DKFLAIASPTGKTADVSPNAVLEKI 316
>gi|428211909|ref|YP_007085053.1| ADP-heptose--LPS heptosyltransferase [Oscillatoria acuminata PCC
6304]
gi|428000290|gb|AFY81133.1| ADP-heptose:LPS heptosyltransferase [Oscillatoria acuminata PCC
6304]
Length = 324
Score = 97.4 bits (241), Expect = 1e-17, Method: Compositional matrix adjust.
Identities = 92/345 (26%), Positives = 160/345 (46%), Gaps = 36/345 (10%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP + LK YP IDV+ R K + ++K ++ L
Sbjct: 2 RILALVPGGIGDQILFFPTLDSLKRVYPNAYIDVVVEPRSKGAYRVSKFFHDKSLQVLPF 61
Query: 184 DWPEP---AEYTDILGVMKNRYYDMVLSTK---LAGLGHAAFLFMTTARDRVSYIYPNVN 237
D+ + A++ ++LG+M++R YD V+S + GL L++T R+SY +
Sbjct: 62 DFKDRNGLADWGNLLGIMRDREYDAVISLGQRWVVGL----LLWLTGIPIRISY-----S 112
Query: 238 AAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEV 297
G LL+ ++ + MY +++ G P P L V++ +
Sbjct: 113 DLGNSWLLTNPVPLKT---EQYAAGMYHDLLEGFGVSM------PCPELSVTLLKEDLRW 163
Query: 298 VAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF-- 355
E+ + G + YI+IHG S Q +G D + P++ W +I L+ +P
Sbjct: 164 SDEEQQRLGVKDSGYILIHGGASQLS---QIKG-LDKIYPVEKWQQIIQDLQTRQPQLPV 219
Query: 356 VIPHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKP 412
V+ E + + D + V +T+P G+LAA I + ++ T++A + LA A +
Sbjct: 220 VLIQGPEDKPLCDQLVRFCPGVKVTSPPDVGKLAATIAAANLMLCTDSAPMHLAVAVQTY 279
Query: 413 SIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
+IALF G+L P + I S TG++ D E VL +
Sbjct: 280 TIALFGPTDPGKLLPPG---DRYIGIQSSTGRIADISPEKVLEKV 321
>gi|427717097|ref|YP_007065091.1| glycosyl transferase family protein [Calothrix sp. PCC 7507]
gi|427349533|gb|AFY32257.1| glycosyl transferase family 9 [Calothrix sp. PCC 7507]
Length = 319
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 85/343 (24%), Positives = 163/343 (47%), Gaps = 37/343 (10%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP + L+ YP IDV+ R K + ++K+V +D D
Sbjct: 2 RVVALVPGGIGDQILFFPTLDNLQRYYPNAQIDVVVEPRSKAAYRVSKSVHEVLTFDYKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
A++++++G +++R YD ++ + + L L++T R+ Y G+
Sbjct: 62 R-NSLADWSNLVGSIRDREYDTAIALEQSWL-VGLLLWLTGIPTRIGY-----QGKGSVF 114
Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYK 303
L + S+ Y ++ LG P P L V+I + E + K
Sbjct: 115 LTNPV----PFKTSQYVAAAYHDLLQPLGI------ESPYPELAVNIPKPDIEWAQNEQK 164
Query: 304 NAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR------PLFVI 357
G + Y++I+G SK S ++ D D++ P+ W +I ++EF+ P+ +I
Sbjct: 165 RLGVHETGYVLIYG---GSKQSSPTQ-DQDTIYPVDKWQQI---IQEFQHKQPDLPIVLI 217
Query: 358 PHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSI 414
++ E V ++ D + V +T+P G+LAA++ ++ ++ T++ +QL+ A + +I
Sbjct: 218 KTPEDEEFVRSLL-DSSPNVKVTSPDDIGKLAAIVGGASLMLTTDSVPLQLSVAVQTYTI 276
Query: 415 ALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
ALF ++ + K I S TGK+ D + VL +
Sbjct: 277 ALFGPTDPAKVL---PKSDKFLAIKSPTGKVADISPQVVLQKI 316
>gi|428770822|ref|YP_007162612.1| glycosyl transferase family protein [Cyanobacterium aponinum PCC
10605]
gi|428685101|gb|AFZ54568.1| glycosyl transferase family 9 [Cyanobacterium aponinum PCC 10605]
Length = 320
Score = 97.1 bits (240), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 82/339 (24%), Positives = 157/339 (46%), Gaps = 28/339 (8%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R +I GG+ + +LFFP +Q LK++YP IDVI R K ++ + K+V+ V+D D
Sbjct: 2 RVLVLIPGGISDQILFFPTLQTLKNKYPQATIDVIVEPRSKNSYRICKHVQEVLVFDFQD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
A+Y ++LG++++R Y++V++ + + + L++ RV Y +
Sbjct: 62 R-NGLADYLNLLGMIRDREYELVITLEQNWIVN-FLLWLDGIPTRVGY------QSSNSW 113
Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYK 303
L+ T N + MY ++ L P P L V++ R E + K
Sbjct: 114 FLNCTIPK---NTDQYIPFMYHDLLKALKI------DDPCPDLSVNVPREDIEWAESEQK 164
Query: 304 NAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIPHE 360
+ YI+IHG + + + + + P+ W + +R + P+ ++
Sbjct: 165 RLKIKDTGYIIIHG----GASILTAYQGINKIYPVPKWQRVIEDIRMKQPDIPIVLLCGP 220
Query: 361 KEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
+ E ++V + ++ P G+LAA+I + ++ T++A +QLA A +IALF
Sbjct: 221 DDLEWTTEIVSLCPYVKVVSPPDIGKLAAIIAGANLMLCTDSAPMQLAIAVGVYTIALFG 280
Query: 419 SELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
+L P + + I S + ++ D P + +L +
Sbjct: 281 PTKTTKLLPPKCD--RVFGIQSLSSQIADIPTDKILEQI 317
>gi|428305070|ref|YP_007141895.1| glycosyl transferase family protein [Crinalium epipsammum PCC 9333]
gi|428246605|gb|AFZ12385.1| glycosyl transferase family 9 [Crinalium epipsammum PCC 9333]
Length = 320
Score = 96.7 bits (239), Expect = 2e-17, Method: Compositional matrix adjust.
Identities = 92/341 (26%), Positives = 157/341 (46%), Gaps = 32/341 (9%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP I LK YP IDVI R + + K+V +D +
Sbjct: 2 RILALVPGGIGDQILFFPTIDDLKQNYPDAQIDVIVEPRSTGAYRVCKSVNEVMRFDFRN 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
A++ ++LG++++R YD +S G L++T RV Y ++G
Sbjct: 62 R-NGMADWGNLLGIIRDREYDFAVSLGRT-WGVNFLLWLTGIATRVGY------SSG--- 110
Query: 244 LLSETFTAESMNLSERGYN--MYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
L E F + + L Y MY ++ G S P PPL V++ ++ E
Sbjct: 111 -LKEAFISNPVPLKTEQYTAEMYHDLIK--GIDINS----PCPPLAVNVPKQDIEWAELA 163
Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIP 358
+ G + YIVIHG S QS+G D P++ W +I +++ + P+ VI
Sbjct: 164 QQRLGINETGYIVIHG---GSSQISQSQG-IDKTYPVKKWQQIVEDIQQRQPNLPVVVIN 219
Query: 359 HEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIAL 416
++ V ++G + I+ P G+LAALI + ++ T++A + +A A IAL
Sbjct: 220 GPDDQAFVSSLLGYCPGVKVISPPDIGKLAALIAGANLMLCTDSAPMHIAVAVGTYLIAL 279
Query: 417 FSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
F + + +P ++ + + S T + D P VL +
Sbjct: 280 FGPT-EAKKLLPKSD--RTIGLQSSTRSVADIPPADVLEKI 317
>gi|440681641|ref|YP_007156436.1| glycosyl transferase family 9 [Anabaena cylindrica PCC 7122]
gi|428678760|gb|AFZ57526.1| glycosyl transferase family 9 [Anabaena cylindrica PCC 7122]
Length = 314
Score = 96.7 bits (239), Expect = 3e-17, Method: Compositional matrix adjust.
Identities = 87/343 (25%), Positives = 154/343 (44%), Gaps = 42/343 (12%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFF + LK YP IDVI R K + ++K+V +D D
Sbjct: 2 RVVALVPGGIGDQILFFATLDDLKRNYPHAQIDVIVEPRSKAAYRVSKSVHEVLSFDYKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTK---LAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
A++ +++G+M++R YD+ ++ + GL L++T R+ Y G
Sbjct: 62 R-NSLADWGNLVGMMRDREYDIAITVEPSWFVGL----LLWLTGIPTRIGY-----KGKG 111
Query: 241 AGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
AG L + +N S+ MY ++ LG P P L V++ + E +
Sbjct: 112 AGFLTNSV----PVNTSQYVAAMYHNLLQPLGI------NTPCPDLAVNVPKSDLEWAQQ 161
Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
+ K G + YI+I G + +D+ P+ W +I + + P+ VI
Sbjct: 162 EQKRLGVHETGYILISGGSGNC---------SDTSYPVASWQQIIQDCQNKQPHLPVMVI 212
Query: 358 PHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSI 414
+ + V + + + IT+P G+L A+I ++ +++ +A +QL+ A + +I
Sbjct: 213 KEANDEQFVRS-LQESCPDIKITSPDDIGKLTAIIGGASLMLSIESAPLQLSVAVQTYTI 271
Query: 415 ALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
L S G+L + K I S TGK D P VL +
Sbjct: 272 TLLGSTDPGKLL---PKSDKFLAIKSPTGKTGDIPPAIVLEKI 311
>gi|428319576|ref|YP_007117458.1| glycosyl transferase family 9 [Oscillatoria nigro-viridis PCC 7112]
gi|428243256|gb|AFZ09042.1| glycosyl transferase family 9 [Oscillatoria nigro-viridis PCC 7112]
Length = 321
Score = 95.1 bits (235), Expect = 7e-17, Method: Compositional matrix adjust.
Identities = 89/345 (25%), Positives = 159/345 (46%), Gaps = 44/345 (12%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP + LK YP IDV+ R K + + K+V+ Y+ D
Sbjct: 3 RILALVPGGIGDQILFFPTLDDLKQSYPEAQIDVMVEPRAKGAYRVCKSVKEVLTYNFKD 62
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGH----AAFLFMTTARDRVSYIYPNVNAA 239
A++ ++LGV+++R Y+ V+S LG L++T RV Y N A
Sbjct: 63 R-NAMADWGNLLGVIRDREYEAVIS-----LGQRWTVGLLLWLTGIPQRVG--YSGTNGA 114
Query: 240 ----GAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLK 295
A L +E + A + +G+N+ ++P PPL +++ ++
Sbjct: 115 RFLTAAVPLKTEQYAASMYHDLLQGFNI----------------KNPCPPLAINVPKQDI 158
Query: 296 EVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR--- 352
+ + + G ++ YI+IHG S S D + P W +I ++ +
Sbjct: 159 QWAEAEQQRLGVKESGYILIHGGSSQLAVSK----GIDKIYPTDNWKQIIEDCQQRQPNL 214
Query: 353 PLFVIPHEKEREGVEDVVGDDASIVFIT--TPGQLAALINDSAGVIATNTAAIQLANARE 410
P+ V+ ++ + V +V ++ I+ G+LAA I + +I T++A + LA A +
Sbjct: 215 PVVVVKGPEDADFVTKLVELCPNVKVISPDDVGKLAATIAAANLMICTDSAPMHLAVAVQ 274
Query: 411 KPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLN 455
+IALF +L P ++ + I S TGK+ D + VL
Sbjct: 275 TYTIALFGPTEPAKLL-PKSD--RFLGIKSPTGKMADISPQEVLK 316
>gi|334119028|ref|ZP_08493115.1| glycosyl transferase family 9 [Microcoleus vaginatus FGP-2]
gi|333458499|gb|EGK87116.1| glycosyl transferase family 9 [Microcoleus vaginatus FGP-2]
Length = 321
Score = 94.7 bits (234), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 90/343 (26%), Positives = 157/343 (45%), Gaps = 36/343 (10%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP + LK YP IDVI R K + + K+V+ Y+ D
Sbjct: 3 RILALVPGGIGDQILFFPTLDDLKQSYPEAQIDVIVEPRAKGAYRVCKSVKEVLTYNFKD 62
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGH----AAFLFMTTARDRVSYIYPNVNAA 239
A+ ++LGV+++R Y+ V+S LG L++T RV Y N
Sbjct: 63 R-NAMADLGNLLGVIRDREYEAVIS-----LGQRWTVGLLLWLTGIPQRVGYSGTN---- 112
Query: 240 GAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVA 299
GA L + A + + +MY ++ +P PPL +++ ++ +
Sbjct: 113 GARFLTA----AIPLKTEQYAASMYHDLLQGFNI------TNPCPPLAINVPKQDIQWAE 162
Query: 300 EKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFV 356
+ + G ++ YI+IHG S S D + P W +I ++ + P+ V
Sbjct: 163 AEQQRLGVKESGYILIHGGSSQLAVSK----GIDKIYPTDNWKQIIEDCQQRQPNLPVVV 218
Query: 357 IPHEKEREGVEDVVGDDASIVFIT--TPGQLAALINDSAGVIATNTAAIQLANAREKPSI 414
+ ++ E V +V ++ I+ G+LAA I + +I T++A + LA A + +I
Sbjct: 219 VKGPEDAEFVTKLVELCPNVKVISPDDVGKLAATIAAANLMICTDSAPMHLAVAVQTYTI 278
Query: 415 ALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
ALF +L P ++ + I S TGK+ D + VL +
Sbjct: 279 ALFGPTEPAKLL-PKSD--RFLGIKSPTGKMADISPQEVLKKV 318
>gi|427730985|ref|YP_007077222.1| ADP-heptose--LPS heptosyltransferase [Nostoc sp. PCC 7524]
gi|427366904|gb|AFY49625.1| ADP-heptose:LPS heptosyltransferase [Nostoc sp. PCC 7524]
Length = 319
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 80/339 (23%), Positives = 156/339 (46%), Gaps = 29/339 (8%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP + LK YP +DV+ R K + ++K+V +D D
Sbjct: 2 RVVALVPGGIGDQILFFPTLDDLKRYYPNAQLDVVTEPRSKAAYRVSKSVNEVLSFDFKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
A++ +++G +++R YD+ ++ + L FL++T R+ Y G+G
Sbjct: 62 R-NSLADWGNLVGTIRDREYDLAITVGQSWL-VGLFLWLTGIPTRIGY-----QGKGSGF 114
Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYK 303
L S+ +Y ++ LG P P L V++ + E + K
Sbjct: 115 LTKSV----PFKPSQYAATVYHDLLQPLGI------DTPTPELGVNVPKPDIEWAQNEQK 164
Query: 304 NAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF--VIPHEK 361
G + Y++I+G + S D++ P++ W EI + +P V+
Sbjct: 165 RLGVNETGYVLIYG----GSGWVSSTKGADAIYPLENWQEIIQDFQHKQPDLPIVVIQGA 220
Query: 362 EREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
EG + D ++ + +T P G+LAA+I ++ +++ +A +QL+ A + +IAL
Sbjct: 221 NDEGFVRSLRDLSANIKVTAPEDIGKLAAIIAGASLMLSNESAPLQLSVAVQTYTIALLG 280
Query: 419 SELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
+ ++ P ++ K I S +G++ D + VL +
Sbjct: 281 ATEPAKVL-PKSD--KFLGIKSPSGRVADISPQQVLQKI 316
>gi|119510034|ref|ZP_01629175.1| Glycosyl transferase, family 9 [Nodularia spumigena CCY9414]
gi|119465358|gb|EAW46254.1| Glycosyl transferase, family 9 [Nodularia spumigena CCY9414]
Length = 318
Score = 94.4 bits (233), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 83/341 (24%), Positives = 159/341 (46%), Gaps = 34/341 (9%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP + LK YP +DV+ R K + ++K+V +D +D
Sbjct: 2 RVVALVPGGIGDQILFFPTLDDLKRNYPNAQLDVVVEPRSKAAYRVSKSVNDVLSFDYND 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
A++++++G +++R YD V++ + L L++T RV Y GAG
Sbjct: 62 R-NSLADWSNLVGTIRDREYDAVITVGQSWL-MGLLLWLTGIPTRVGY-----QGKGAG- 113
Query: 244 LLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
F +S+ Y +Y ++ +PF + P L V+I + E +
Sbjct: 114 -----FLTDSVPFQANKYVAAVYHDLL----QPFGI--KTPCRELAVNILKPDIEWSQNE 162
Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF--VIPH 359
K G + YI+IH + +S + D D L P++ W +I ++ +P V+
Sbjct: 163 QKRLGVNETGYILIH-----AGSSQVTPTDVDKLYPVENWQQIIQECQQKQPDLPVVVIQ 217
Query: 360 EKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIAL 416
E + + + + + +T P G++AA+I ++ +++T+ + L+ A + +IAL
Sbjct: 218 GSENDPIRRSLLERTPGIKVTYPTNIGKIAAMIGGASLMLSTDDPLLHLSVAVQTYTIAL 277
Query: 417 FSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
F +L + K I S TG++ D + VL+ +
Sbjct: 278 FGPTDPAKLL---PKSDKFLAIKSPTGQMADISPQTVLDKI 315
>gi|428308848|ref|YP_007119825.1| ADP-heptose--LPS heptosyltransferase [Microcoleus sp. PCC 7113]
gi|428250460|gb|AFZ16419.1| ADP-heptose:LPS heptosyltransferase [Microcoleus sp. PCC 7113]
Length = 320
Score = 94.0 bits (232), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 94/343 (27%), Positives = 164/343 (47%), Gaps = 36/343 (10%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP + L+ YP IDVI R K + ++ +V +D D
Sbjct: 2 RILALVPGGIGDQVLFFPTLDDLRAYYPTAEIDVIVEPRAKGAYRVSGSVDEVLTFDFKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
A++ ++LG++++R YD+ LS + A +G L+M+ R+ Y A AG
Sbjct: 62 R-NGLADFGNLLGIIRDREYDVALSLAQRASIG--LLLWMSGIPTRIGY------EANAG 112
Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
+ F +++ L Y ++Y M+ LG P P L +S+ + E
Sbjct: 113 ----KWFLTDTVPLKTEQYTADIYHDMLQALGI------NTPCPDLTLSVPKPDIEWAER 162
Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHE 360
+ + G + YI+ I++ S Q +G T+ + P++ W +I +++ +P I
Sbjct: 163 EQQRLGIAESGYIL---IDNSSTQLAQEKG-TEQVYPVEKWRQILADIQQKQPNLPIVML 218
Query: 361 KEREGVEDVVGDDASI--VFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
+ E + S + IT+P G+LAA+I + +I T++A + +A A +IA
Sbjct: 219 RGSEDAQMSATMLQSFPDLKITSPADIGKLAAMIAGANLIICTDSAPMHVAVAVGTYTIA 278
Query: 416 LFS-SELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
LF +E K RL P +E C I S T ++ D E VL +
Sbjct: 279 LFGPTEAKKRL--PQSE--NCMSIQSPTKQIADIKPEDVLKQL 317
>gi|428778324|ref|YP_007170111.1| glycosyl transferase family protein [Halothece sp. PCC 7418]
gi|428692603|gb|AFZ45897.1| glycosyl transferase family 9 [Halothece sp. PCC 7418]
Length = 320
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 86/343 (25%), Positives = 156/343 (45%), Gaps = 36/343 (10%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP + LK+ YP IDV+ R K + + V V+D +
Sbjct: 2 RILTLVPGGIGDQILFFPTLADLKNAYPEAQIDVLVEPRAKSAYRVCSYVHDVLVFDYKN 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGH----AAFLFMTTARDRVSYIYPNVNAA 239
A+Y ++LGV+++R Y++ L+ LG L++ RV Y ++
Sbjct: 62 R-NSLADYLNLLGVIRDREYEVALT-----LGRRWTVGFLLWLNGIPVRVGY-----ESS 110
Query: 240 GAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVA 299
G+ L S + + MY ++ L P PPL+V++ + +
Sbjct: 111 GSAFLSSTV----PLKTEQYAAQMYHDLLKGLNI------NTPCPPLQVNVPKSDIQWGK 160
Query: 300 EKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFV 356
++ + + G YIVIHG S Q +G D + P+ W +I L+ + P+ +
Sbjct: 161 QEQERLDIKDGNYIVIHG---GSSQLAQEKG-IDKIYPVTKWKDIIADLQAKQPNLPIVL 216
Query: 357 IPHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSI 414
+ ++ VE ++ + + + P G+L+A+I + ++ T++A + LA A +I
Sbjct: 217 VQGPEDEMWVEQLLQVNPHLKVTSPPDIGKLSAMIAGGSLMLCTDSAPMHLAVAVGTYTI 276
Query: 415 ALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
ALF +L P + I S TG + D V +VL +
Sbjct: 277 ALFGPTEPKKLLPP--DNSSYVGIKSPTGAIADIEVNSVLEQI 317
>gi|425463419|ref|ZP_18842758.1| Glycosyl transferase [Microcystis aeruginosa PCC 9809]
gi|389833249|emb|CCI22392.1| Glycosyl transferase [Microcystis aeruginosa PCC 9809]
Length = 320
Score = 93.6 bits (231), Expect = 2e-16, Method: Compositional matrix adjust.
Identities = 85/342 (24%), Positives = 159/342 (46%), Gaps = 34/342 (9%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + LLFFP ++ L+ YP IDV+ R K + L V+ ++D D
Sbjct: 2 RILTLVPGGISDQLLFFPTLETLRQTYPQGTIDVLVEPRAKAAYRLCPQVKEVLLFDYRD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
+ A+Y +ILGV+++R Y++ +S T + L++ +R+ Y N +
Sbjct: 62 QY-GLADYLNILGVIRDREYEIAISLTNRPAIN--LLLWLNGVPNRIGY---ENNGS--- 112
Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
F ++ + GY + Y+ +V + + P P PPL+++++R +
Sbjct: 113 -----WFLSQQVPRPTEGYLADNYQALV----KGLKIAP--PCPPLKLTLNRDDIDWAEM 161
Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
+ + + YI+++ SD S ++L P++ W EI G+R + P+ I
Sbjct: 162 EQQRLNIKDSGYILLYAGGSDLSISQ----GLETLYPLESWLEIIQGIRHQQPDLPIVAI 217
Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
E + V + D ++ T G++AA+I + ++AT + +QLA A +++
Sbjct: 218 EGELDEAWVLALKAMDNNLKVSRTADIGKMAAMIAGANLLLATESVPLQLAVAVGTYTLS 277
Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
L L R+ E + I + TGK+ D E VL +
Sbjct: 278 LLVPSLSKRVLPSGGERHR--YIEANTGKVADITPETVLKKI 317
>gi|307152521|ref|YP_003887905.1| glycosyl transferase family protein [Cyanothece sp. PCC 7822]
gi|306982749|gb|ADN14630.1| glycosyl transferase family 9 [Cyanothece sp. PCC 7822]
Length = 328
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 85/342 (24%), Positives = 158/342 (46%), Gaps = 26/342 (7%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP ++ LK +YP +DVI R K + + V ++D D
Sbjct: 2 RILALVPGGIGDQILFFPTLEDLKQKYPNSTLDVIVEPRAKSAYRICPYVHEVLLFDFKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
A+Y ++LGV+++R YD+ +S L++ RV Y N +
Sbjct: 62 R-NGLADYLNLLGVIRDREYDVAISLP-QRWTIGLLLWLNGIPVRVGY---KTNTSVRKE 116
Query: 244 LLSETFTAESMNLSERGYN--MYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
+ F ++ L Y MY ++ L + P PPL+V++ + + +
Sbjct: 117 FRTPIFLTNAVPLKTEQYAAYMYHDLLQGLDI------QTPTPPLKVTLPKDDIDWAEAE 170
Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIP 358
K + YI+IHG S S A +G D P+ W +I + + + P+ ++
Sbjct: 171 QKRLEIKDTGYIIIHGGSSTSAA---IKG-VDKNYPLPQWQKIVDEILAKQPDLPIVLLQ 226
Query: 359 HEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
++ + V+ ++ ++ +T P G+LAA+I + ++ T++ + L+ A +IA
Sbjct: 227 GPEDPQWVKQLLEGHPNLK-VTKPTDVGKLAAIIAGANLMLCTDSPPMHLSVAVGTYTIA 285
Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
LF + +L P+ + I S++G L D E +L AM
Sbjct: 286 LFGPTVAEKLLPPDP--TRYIGIQSKSGNLADIKPETILEAM 325
>gi|354568658|ref|ZP_08987821.1| glycosyl transferase family 9 [Fischerella sp. JSC-11]
gi|353539912|gb|EHC09392.1| glycosyl transferase family 9 [Fischerella sp. JSC-11]
Length = 319
Score = 93.2 bits (230), Expect = 3e-16, Method: Compositional matrix adjust.
Identities = 88/341 (25%), Positives = 161/341 (47%), Gaps = 33/341 (9%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP + LK YP IDV+ R K + ++K+V ++D D
Sbjct: 2 RIVALVPGGISDQILFFPTLDDLKRNYPDAQIDVVVEPRSKAAYRVSKSVNDVLMFDYKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
A++ +++G++++R YD+ ++ L L++T R+ Y GA
Sbjct: 62 R-NSLADWANLVGILRDREYDVAIALT-QSLLMGLLLWLTGIPTRIGY-----KGKGAA- 113
Query: 244 LLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
F ++ L + Y ++Y ++ LG + P P L V++ + + ++
Sbjct: 114 -----FLTNTVPLKQEQYAADVYHTLLQELGI------KSPCPQLAVNVPKLDIDWADKE 162
Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEK 361
G ++ YI+IH + + + D P + W +I ++ +P + +
Sbjct: 163 QLRLGVKETGYILIH----NGVNELPANQGQDRTYPAENWLDIIQDCQQKQPDIPVMVVQ 218
Query: 362 EREGVEDVVGDDASI--VFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIAL 416
E E V SI + IT+P G+LAA+I + ++ T++AA+QLA A + +IAL
Sbjct: 219 EPGQQEFVAQLKQSIPDIKITSPDNMGKLAAMIAGANLLLCTDSAAMQLAVAVQTYTIAL 278
Query: 417 FSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
F S +L P +E K I S TGK+ D + +L +
Sbjct: 279 FGSTDPAKLL-PKSE--KIIAIKSSTGKIADISPKTILEKI 316
>gi|300866017|ref|ZP_07110751.1| glycosyl transferase family protein [Oscillatoria sp. PCC 6506]
gi|300336008|emb|CBN55909.1| glycosyl transferase family protein [Oscillatoria sp. PCC 6506]
Length = 319
Score = 92.8 bits (229), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 87/343 (25%), Positives = 157/343 (45%), Gaps = 37/343 (10%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP + LK YP +D I R K + + K+V+ YD D
Sbjct: 2 RIVALVPGGIGDQILFFPTLDGLKQLYPNTQVDAIVEPRAKGAYRVCKSVKEVIPYDFKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGH----AAFLFMTTARDRVSYIYPNVNAA 239
A++ ++LG++++R Y+ V+S LG L++T RV Y A
Sbjct: 62 R-NALADWGNLLGIIRDREYEAVIS-----LGQRWSVGLLLWLTGINKRVGY------AG 109
Query: 240 GAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVA 299
G + L++ ++ + Y+ Q D P PPL +++ ++ +
Sbjct: 110 GGSIFLTDAIPLKTEQYAAEMYHDLLQGFD---------INTPCPPLAINVPKQDIQWAE 160
Query: 300 EKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF--VI 357
+ + G + YI+IHG S+ ++ S+G D + P+ W +I L++ +P V+
Sbjct: 161 AEQQRLGIKDSGYILIHG--GSSQLAL-SKG-IDKIYPVDKWQQIIQDLQQRQPNLPVVV 216
Query: 358 PHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSI 414
E D + + V P G+LAA I + ++ T++A + LA A + +I
Sbjct: 217 VKGPEDAAFVDKLVESCPQVLAIAPDDIGKLAAAIAAANLMLCTDSAPMHLAVAVQTYTI 276
Query: 415 ALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
ALF +L P ++ + + S TGK+ D + VL +
Sbjct: 277 ALFGPTDPAKLL-PKSD--RFIGLKSPTGKMADISPQDVLKKV 316
>gi|422304228|ref|ZP_16391575.1| Glycosyl transferase [Microcystis aeruginosa PCC 9806]
gi|389790695|emb|CCI13446.1| Glycosyl transferase [Microcystis aeruginosa PCC 9806]
Length = 320
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 86/342 (25%), Positives = 158/342 (46%), Gaps = 34/342 (9%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ LLFFP ++ L+ YP IDV+ R K + L VR ++D D
Sbjct: 2 RILTLVPGGISNQLLFFPTLETLRQTYPQSSIDVLVEPRAKAAYRLCPQVREVLLFDYRD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
+ A+Y +ILGV+++R Y++ +S T + + L++ +R+ Y N +
Sbjct: 62 QY-GLADYLNILGVIRDREYEIAISLTNRSAIN--LLLWLNGVPNRIGY---ENNGS--- 112
Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
F +E + GY + Y +V + + P P PPL+++++R +
Sbjct: 113 -----WFLSEQVPRPTEGYLADNYHALV----KGLKIAP--PCPPLKLTLNRDDIDWAEM 161
Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
+ + + YI+++ SD S +++ P++ W EI G+R + P+ I
Sbjct: 162 EQQRLNIKDSGYILLYAGGSDLSISQ----GLETVYPLESWLEIIQGIRHQQPELPIVAI 217
Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
E + V + D ++ T G++AA+I + ++AT + +QLA A +++
Sbjct: 218 QGELDEAWVLALKAMDNNLKVSRTGDIGKMAAMIAGANLLLATESVPLQLAVAVGTYTLS 277
Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
L L R+ E + I + TGK+ D E VL +
Sbjct: 278 LLVPSLSKRVLPSGGELHR--YIEANTGKVADITPETVLKKI 317
>gi|166364044|ref|YP_001656317.1| glycosyl transferase family protein [Microcystis aeruginosa
NIES-843]
gi|166086417|dbj|BAG01125.1| glycosyl transferase [Microcystis aeruginosa NIES-843]
Length = 320
Score = 92.4 bits (228), Expect = 4e-16, Method: Compositional matrix adjust.
Identities = 85/342 (24%), Positives = 158/342 (46%), Gaps = 34/342 (9%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ LLFFP ++ L+ YP IDV+ R K + L V+ ++D D
Sbjct: 2 RILTLVPGGISNQLLFFPTLETLQQTYPQSSIDVLVEPRAKAAYRLCPQVKEVLLFDYRD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
+ A+Y +ILGV+++R Y++ +S T + L++ +R+ Y N +
Sbjct: 62 QY-GLADYLNILGVIRDREYEIAISLTNRPAIN--LLLWLNGVPNRIGY---ENNGS--- 112
Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
F ++ + GY + Y+ +V + + P P PPL+++++R +
Sbjct: 113 -----WFLSQQVPRPTEGYLADNYQALV----KGLKIAP--PCPPLKLTLNRDDIDWAEM 161
Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
+ + + YI+++ SD S ++L P++ W EI G+R + P+ I
Sbjct: 162 EQQRLNIKDSGYILLYAGGSDLSISQ----GLETLYPLESWLEIIQGIRHQQPDLPIVAI 217
Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
E + V + D ++ T G++AA+I + ++AT + +QLA A +++
Sbjct: 218 EGELDEAWVLALKAMDNNLKVSRTADIGKMAAMIAGANLLLATESVPLQLAVAVGTYTLS 277
Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
L L R+ E + I + TGK+ D E VL +
Sbjct: 278 LLVPSLSKRVLPSGGERHR--YIEANTGKVADITPETVLKKI 317
>gi|425470313|ref|ZP_18849183.1| Glycosyl transferase [Microcystis aeruginosa PCC 9701]
gi|389884129|emb|CCI35564.1| Glycosyl transferase [Microcystis aeruginosa PCC 9701]
Length = 320
Score = 92.0 bits (227), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 86/342 (25%), Positives = 156/342 (45%), Gaps = 34/342 (9%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ LLFFP ++ L+ YP IDV+ R K + L VR ++D D
Sbjct: 2 RILTLVPGGISNQLLFFPTLETLQQTYPQSSIDVLVEPRAKAAYRLCPQVREVLLFDYRD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
+ A+Y +ILGV+++R Y++ +S T + L++ +R+ Y N +
Sbjct: 62 QY-GLADYLNILGVIRDREYEIAISLTNRPAIN--LLLWLNGVPNRIGY---ENNGS--- 112
Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
F ++ + GY + Y+ +V L P PPL+++++R +
Sbjct: 113 -----WFLSQQVPRPTEGYLADHYQALVKGLKIA------APCPPLKLTLNRDDIDWAEM 161
Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
+ + + YI+++ SD S ++L P++ W EI G+R + P+ I
Sbjct: 162 EQQRLNIKDSGYILLYAGGSDLSISQ----GLETLYPLESWLEIIQGIRHQQPDLPIVAI 217
Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
E + V + D ++ T G++AA+I + ++AT + +QLA A +++
Sbjct: 218 QGELDEAWVLALKAMDNNLKVSRTGDIGKMAAMIAGANLLLATESVPLQLAVAVGTYTLS 277
Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
L L R+ E + I + TGK+ D E VL +
Sbjct: 278 LLVPSLSKRVLPSGGELHR--YIEANTGKVADITAETVLKKI 317
>gi|75908631|ref|YP_322927.1| glycosyl transferase family protein [Anabaena variabilis ATCC
29413]
gi|75702356|gb|ABA22032.1| Glycosyl transferase, family 9 [Anabaena variabilis ATCC 29413]
Length = 319
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 83/340 (24%), Positives = 157/340 (46%), Gaps = 31/340 (9%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP + LK YP +DV+ R K + ++K+V +D D
Sbjct: 2 RVVALVPGGIGDQILFFPTLDDLKRYYPNAQLDVVVEPRSKAAYRVSKSVNDILTFDYKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
A++ +++G +++R YD+ ++ + L FL++T R+ Y GA
Sbjct: 62 R-NSLADWGNLVGTIRDREYDVAIAVGQSWL-VGLFLWLTGIPTRIGY-----QGKGASF 114
Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYK 303
L S+ +Y ++ +PF P P L V++ + + + K
Sbjct: 115 LTKTV----PFKPSQYAAAVYHDLL----QPFGIT--TPTPELAVNVPKPDIDWANSEQK 164
Query: 304 NAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIPHE 360
G + Y++I+G + + DS+ P++ W EI R+ + P+ VI
Sbjct: 165 RLGVSETGYVLIYG----GSSWVSQPQALDSIYPVENWQEIIQDFRQKQPDLPIVVIQGP 220
Query: 361 KEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIALF 417
+ + V + D + + +T+P G+L A+I + ++ T++A + L+ A + +IALF
Sbjct: 221 DDEQFVRS-LRDYSLDIKVTSPDNVGKLTAMIAGANLMLTTDSAPLHLSVAVQTYTIALF 279
Query: 418 SSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
S +L + K I S TG++ D +AVL +
Sbjct: 280 GSTDPVKLL---PQSDKLLGIKSPTGRVADILPKAVLEKV 316
>gi|113474466|ref|YP_720527.1| glycosyl transferase family protein [Trichodesmium erythraeum
IMS101]
gi|110165514|gb|ABG50054.1| glycosyl transferase, family 9 [Trichodesmium erythraeum IMS101]
Length = 321
Score = 92.0 bits (227), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 92/343 (26%), Positives = 154/343 (44%), Gaps = 37/343 (10%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP + LK Y IDVI R K +++ K+VR YD D
Sbjct: 2 RILALVPGGIGDQILFFPTLDDLKQSYRESQIDVIVEPRSKGAYQVCKSVRDVLTYDFKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGH----AAFLFMTTARDRVSYIYPNVNAA 239
A++ ++LG+M++R Y+ V+S LG L++T RV Y A
Sbjct: 62 A-NSLADWGNLLGIMRDREYEAVIS-----LGQRWTVGLLLWLTGIPKRVGY------AG 109
Query: 240 GAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVA 299
G+ LS+ ++ + Y+ Q ++ + F+ + + + + + A
Sbjct: 110 NGGIFLSDPIPLKTEQYAAHMYHDLLQGMN-INTTFKGISIN-------VLKKDIAWAEA 161
Query: 300 EKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRP-----L 354
E+ + AE G YI+IHG S Q +G D + P W EI + L++ +P L
Sbjct: 162 EQQRLGVAESG-YILIHG---GSSKLAQIKG-IDKIYPTNYWLEIISQLQQKQPNLPVVL 216
Query: 355 FVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSI 414
P + D IV G+LAA I + ++ T++A + LA A +I
Sbjct: 217 VKGPEDGAWSSEISQSSRDVKIVIPGDVGKLAAFIAAANLMLCTDSAPMHLAVAVGTYTI 276
Query: 415 ALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
ALF +L P ++ + + S TGK+ D + VL +
Sbjct: 277 ALFGPTDPKKLL-PKSD--RVIAVKSSTGKMADILPQQVLQKI 316
>gi|434406201|ref|YP_007149086.1| ADP-heptose:LPS heptosyltransferase [Cylindrospermum stagnale PCC
7417]
gi|428260456|gb|AFZ26406.1| ADP-heptose:LPS heptosyltransferase [Cylindrospermum stagnale PCC
7417]
Length = 320
Score = 91.7 bits (226), Expect = 6e-16, Method: Compositional matrix adjust.
Identities = 82/339 (24%), Positives = 157/339 (46%), Gaps = 34/339 (10%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ G + + +LFF + LK YP IDV+ R K ++++K+V +D D
Sbjct: 2 RVVALVPGSIGDQILFFATLDDLKRNYPNAQIDVVVEPRSKAAYQVSKSVHEVLTFDYKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
A++ +++G +++R YD+ ++ + L L++T R+ Y G G
Sbjct: 62 R-NSLADWGNLVGTIRDREYDVAIAVGQSWL-IGLLLWLTGIPVRIGY-------TGKG- 111
Query: 244 LLSETFTAESM--NLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
F S+ S+ Y ++ LG P P L +++ + E +
Sbjct: 112 ---SVFLTRSVPPKPSQYAAAAYHDLLQPLGI------NSPCPELALNVPKPDIEWAQRE 162
Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIP 358
K G + YI+I+G + + D++ PI+ W +I + + P+ VI
Sbjct: 163 QKRLGVHETGYILING----ASNHLAPADGLDTVYPIENWLQIIQDCQHKQPDLPVVVIK 218
Query: 359 HEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
+ + V ++ + + + +T+P G+LAA+I ++ +++T++ + L+ A + +IA
Sbjct: 219 GSDDEQFVRSLLLESSPDIKVTSPDDIGKLAAIIGGASLMLSTDSVPLHLSVAVQTYTIA 278
Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVL 454
LF S L PN++ K I S TGK+ D + VL
Sbjct: 279 LFGSTDPANLL-PNSD--KFLAIKSPTGKMADISPQTVL 314
>gi|427706482|ref|YP_007048859.1| glycosyl transferase family protein [Nostoc sp. PCC 7107]
gi|427358987|gb|AFY41709.1| glycosyl transferase family 9 [Nostoc sp. PCC 7107]
Length = 319
Score = 91.7 bits (226), Expect = 7e-16, Method: Compositional matrix adjust.
Identities = 90/337 (26%), Positives = 163/337 (48%), Gaps = 45/337 (13%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP + LK YP IDV+ + K + ++K+V +D D
Sbjct: 2 RVVALVPGGIGDQILFFPTLDDLKRCYPNAKIDVVTEPQSKAAYRVSKSVHEVLTFDYKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS---TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
A++++++G +++R YD+ ++ + L GL L++T RV + G
Sbjct: 62 R-NSLADWSNLVGTIRDREYDVAIAFGQSWLVGL----MLWLTGIPMRVGF-----KGKG 111
Query: 241 AGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
A L T T N S+ Y +++ LG P+P L V++ + E
Sbjct: 112 AAFL---THTV-PFNPSQYVAAAYHELLKPLGL------TTPLPELAVNVPKPDIEWSQI 161
Query: 301 KYKNAGAEQGKYIVIH-GIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR------P 353
+ K G + Y++I+ G++S +A D D + P++ W +I ++EF P
Sbjct: 162 EQKRLGVNETGYVLIYGGVDSALRAK-----DADKIYPVENWQQI---IQEFHLKQPDMP 213
Query: 354 LFVIPHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANARE 410
+ VI + E ++ + +T+P G+L A+I + +++T +AA+QL+ A +
Sbjct: 214 VVVI-QGADDETFVRLLRASCPQIKVTSPDNAGKLTAMIAGANLMLSTESAALQLSIAAQ 272
Query: 411 KPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLID 447
+IA+ S G+L P +E K I S TGK+ D
Sbjct: 273 TYTIAILGSSDPGKLL-PKSE--KFLAIKSPTGKVAD 306
>gi|425455179|ref|ZP_18834904.1| Similar to tr|Q4C0X9|Q4C0X9_CROWT Glycosyl transferase [Microcystis
aeruginosa PCC 9807]
gi|389803960|emb|CCI17156.1| Similar to tr|Q4C0X9|Q4C0X9_CROWT Glycosyl transferase [Microcystis
aeruginosa PCC 9807]
Length = 320
Score = 91.3 bits (225), Expect = 8e-16, Method: Compositional matrix adjust.
Identities = 86/342 (25%), Positives = 155/342 (45%), Gaps = 34/342 (9%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ LLFFP ++ L+ YP IDV+ R K + L VR ++D D
Sbjct: 2 RILTLVPGGISNQLLFFPTLETLQQTYPQSSIDVLVEPRAKAAYRLCPQVREVLLFDYRD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
+ A+Y +ILGV+++R Y++ +S T + L++ +R+ Y N +
Sbjct: 62 QY-GLADYLNILGVIRDREYEIAISLTNRPAIN--LLLWLNGVLNRIGY---ENNGS--- 112
Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
F ++ + GY + Y +V L P PPL+++++R +
Sbjct: 113 -----WFLSQQVPRPTEGYLADNYHALVKGLKIA------APCPPLKITLNRDDIDWAQM 161
Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
+ + + YI+++ SD S ++L P++ W EI G+R + P+ I
Sbjct: 162 EQQRLNIKDSGYILLYAGGSDISISQ----GLETLYPLESWLEIIQGIRHQQPDLPIVAI 217
Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
E + V + D ++ T G++AA+I + ++AT + +QLA A +++
Sbjct: 218 QGELDEAWVLALKAMDNNLKVSRTADIGKMAAMIAGANLLLATESVPLQLAVAVGTYTLS 277
Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
L L R+ E + I + TGK+ D E VL +
Sbjct: 278 LLVPSLSKRVLPSGGELHR--YIEANTGKVADITPETVLKKI 317
>gi|428217367|ref|YP_007101832.1| glycosyl transferase family protein [Pseudanabaena sp. PCC 7367]
gi|427989149|gb|AFY69404.1| glycosyl transferase family 9 [Pseudanabaena sp. PCC 7367]
Length = 319
Score = 91.3 bits (225), Expect = 9e-16, Method: Compositional matrix adjust.
Identities = 81/320 (25%), Positives = 146/320 (45%), Gaps = 31/320 (9%)
Query: 122 VRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDL 181
+ R ++ GG+ + LLFFP ++ LK YP IDV+ R + + V N++
Sbjct: 1 MMRLLALVPGGIGDQLLFFPTLETLKQTYPQAAIDVVVEPRSVGAYRVCPYVN--NIWKF 58
Query: 182 DDDWPEP-AEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
D A++ ++LG +++ YD+VLS L++T +R+++ +G
Sbjct: 59 DFKGSNSFADWGNLLGTIRDCEYDVVLSLG-KSFSVGLLLWLTGIPERIAF-------SG 110
Query: 241 AG-LLLSETFTAESMNLSERGYNMYEQMVD-WLGRPFRSVPRHPVPPLRVSISRRLKEVV 298
AG LL++ E ++ MY + LG P PP++++I + V
Sbjct: 111 AGSFLLTQPVPLEQ---NQYAAAMYHDLASKGLGI------DQPCPPIKLAIPQGDVAWV 161
Query: 299 AEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREF---RPLF 355
+ K G E Y++IHG S+ + R + + P++ W I NG++ PL
Sbjct: 162 KSELKRMGIEGKGYVLIHGGASE----LSQRKGINKIYPVESWKTIINGVQAKLYDAPLV 217
Query: 356 VIPHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPS 413
VI +R+ ++ + + I P G+LAALI + ++ T++A + + A
Sbjct: 218 VIGGPDDRKLIKALTELQPQLKVINPPDIGKLAALIMSAQLLLCTDSAPMHIGVATGIKL 277
Query: 414 IALFSSELKGRLFVPNAEEK 433
+ALF +L N + K
Sbjct: 278 VALFGPTEPAKLLPSNDQIK 297
>gi|425442475|ref|ZP_18822718.1| Similar to tr|Q4C0X9|Q4C0X9_CROWT Glycosyl transferase [Microcystis
aeruginosa PCC 9717]
gi|389716501|emb|CCH99280.1| Similar to tr|Q4C0X9|Q4C0X9_CROWT Glycosyl transferase [Microcystis
aeruginosa PCC 9717]
Length = 320
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 86/342 (25%), Positives = 155/342 (45%), Gaps = 34/342 (9%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ LLFFP ++ L+ YP IDV+ R K + L VR ++D D
Sbjct: 2 RILTLVPGGISNQLLFFPTLETLQQTYPQSSIDVLVEPRAKAAYRLCPQVREVLLFDYRD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
+ A+Y +ILGV+++R Y++ +S T + L++ +R+ Y N +
Sbjct: 62 QYG-LADYLNILGVIRDREYEIAISLTNRPAIN--LLLWLNGVLNRIGY---ENNGS--- 112
Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
F ++ + GY + Y +V L P PPL+++++R +
Sbjct: 113 -----WFLSQQVPQPTEGYLADNYHALVKGLKIA------APCPPLKITLNRDDIDWAQM 161
Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
+ + + YI+++ SD S ++L P++ W EI G+R + P+ I
Sbjct: 162 EQQRLNIKDSGYILLYAGGSDLSISQ----GLETLYPLESWLEIIQGIRHQQPELPIVAI 217
Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
E + V + D ++ T G++AA+I + ++AT + +QLA A +++
Sbjct: 218 QGELDEAWVLALKAMDNNLKVSRTGDIGKMAAMIAGANLLLATESVPLQLAVAVGTYTLS 277
Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
L L R+ E + I + TGK+ D E VL +
Sbjct: 278 LLVPSLSKRVLPSGGELHR--YIEANTGKVADITPETVLKKI 317
>gi|119483240|ref|ZP_01618654.1| Glycosyl transferase, family 9 [Lyngbya sp. PCC 8106]
gi|119458007|gb|EAW39129.1| Glycosyl transferase, family 9 [Lyngbya sp. PCC 8106]
Length = 320
Score = 90.9 bits (224), Expect = 1e-15, Method: Compositional matrix adjust.
Identities = 88/342 (25%), Positives = 157/342 (45%), Gaps = 38/342 (11%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP + LK YP I V+ R K + + K+V YD D
Sbjct: 2 RILALVPGGIGDQVLFFPTLDDLKQAYPQARISVVVEPRAKGAYRVCKSVSEVLAYDFKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGH----AAFLFMTTARDRVSYIYPNVNAA 239
A++ + LG+++ R +D+VLS LG L++T RV Y +
Sbjct: 62 R-NSLADWGNFLGIVREREFDLVLS-----LGKRWTVGLLLWLTGIPQRVGY-----AGS 110
Query: 240 GAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVA 299
G+ L+++ ++ + Y+ Q +D L PF + + VP +S +
Sbjct: 111 AGGMFLTQSVPLKTEQYAAEMYHDLLQGLD-LSLPFSGLAIN-VPKSDISWAE------- 161
Query: 300 EKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFV 356
+ + G + YI+IHG S Q G D + P++ W ++ +++ + P+ V
Sbjct: 162 TQQQQLGIKDSGYILIHG---GSSLLAQQLG-IDKIYPVEKWQKVIQDMQQQQPNIPVVV 217
Query: 357 IPHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPS 413
+ ++ + V + V +TTP G+LAA+I + ++ T++A + LA A +
Sbjct: 218 VKGPEDADWVNQLT-QSCRDVKVTTPGDIGKLAAMIAAANLMVCTDSAPMHLAVAVGTYT 276
Query: 414 IALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLN 455
IALF +L P + + + S TGK+ D E +L
Sbjct: 277 IALFGPTDPEKLL-PKTD--RVIGVKSPTGKIADLSPEDILK 315
>gi|440755455|ref|ZP_20934657.1| glycosyltransferase 9 family protein [Microcystis aeruginosa
TAIHU98]
gi|440175661|gb|ELP55030.1| glycosyltransferase 9 family protein [Microcystis aeruginosa
TAIHU98]
Length = 320
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 86/342 (25%), Positives = 155/342 (45%), Gaps = 34/342 (9%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ LLFFP ++ L+ YP IDV+ R K + L VR ++D D
Sbjct: 2 RILTLVPGGISNQLLFFPTLETLRQTYPQGTIDVLVEPRAKAAYRLCPQVREVLLFDYRD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
+ A+Y +ILGV+++R Y++ +S T + L++ +R+ Y N +
Sbjct: 62 QY-GLADYLNILGVIRDREYEIAISLTNRPAIN--LLLWLNGVPNRIGY---ENNGS--- 112
Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
F ++ + GY + Y +V L P PPL+++++R +
Sbjct: 113 -----WFLSQQVPQPTEGYLADNYHALVKGLKIA------APCPPLKLTLNRDDIDWAEM 161
Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
+ + + YI+++ SD S ++L P++ W EI G+R + P+ I
Sbjct: 162 EQQRLNIKDSGYILLYAGGSDLSISQ----GLETLYPLESWLEIIQGIRHQQPDLPIVAI 217
Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
E + V + D ++ T G++AA+I + ++AT + +QLA A +++
Sbjct: 218 QGELDEAWVLALKAMDNNLKVSRTGDIGKMAAMIAGANLLLATESVPLQLAVAVGTYTLS 277
Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
L L R+ E + I + TGK+ D E VL +
Sbjct: 278 LLVPSLSKRVLPSGGELHR--YIEANTGKVADITPETVLKKI 317
>gi|390439562|ref|ZP_10227953.1| Glycosyl transferase [Microcystis sp. T1-4]
gi|389837051|emb|CCI32077.1| Glycosyl transferase [Microcystis sp. T1-4]
Length = 320
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 85/345 (24%), Positives = 156/345 (45%), Gaps = 40/345 (11%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ LLFFP ++ L+ YP IDV+ R K + L VR ++D D
Sbjct: 2 RILTLVPGGISNQLLFFPTLETLQQTYPQGTIDVLVEPRAKAAYRLCPQVREVLLFDYRD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
+ A+Y +ILGV+++R Y++ +S T + L++ +R+ Y N +
Sbjct: 62 QY-GLADYLNILGVIRDREYEIAISLTNRPAIN--LLLWLNGVPNRIGY---ENNGS--- 112
Query: 243 LLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPR-----HPVPPLRVSISRRLKEV 297
F ++ + GY LG + ++ + P PPL+++++R +
Sbjct: 113 -----WFLSQQVPQPTEGY---------LGDNYHALVKGLKIAAPCPPLKLTLNRDDIDW 158
Query: 298 VAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PL 354
+ + + YI+++ SD S ++L P++ W EI G+R + P+
Sbjct: 159 AEMEQQRLNIKDSGYILLYAGGSDLSISQ----GLETLYPLESWLEIIQGIRHQQPDLPI 214
Query: 355 FVIPHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKP 412
I E + V + D ++ T G++AA+I + ++AT + +QLA A
Sbjct: 215 VAIQGELDEAWVLALKAMDNNLKVSRTGDIGKMAAMIAGANLLLATESVPLQLAVAVGTY 274
Query: 413 SIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
+++L L R+ E + I + TGK+ D E VL +
Sbjct: 275 TLSLLVPSLSKRVLPSGGELHR--YIEANTGKVADITPETVLKKI 317
>gi|425437611|ref|ZP_18818026.1| Glycosyl transferase [Microcystis aeruginosa PCC 9432]
gi|389677385|emb|CCH93667.1| Glycosyl transferase [Microcystis aeruginosa PCC 9432]
Length = 320
Score = 90.5 bits (223), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 86/342 (25%), Positives = 155/342 (45%), Gaps = 34/342 (9%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ LLFFP ++ L+ YP IDV+ R K + L VR ++D D
Sbjct: 2 RILTLVPGGISNQLLFFPTLETLRQTYPQGTIDVLVEPRAKAAYRLCPQVREVLLFDYRD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
+ A+Y +ILGV+++R Y++ +S T + L++ +R+ Y N +
Sbjct: 62 QY-GLADYLNILGVIRDREYEIAISLTNRPAIN--LLLWLNGVPNRIGY---ENNGS--- 112
Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
F ++ + GY + Y +V L P PPL+++++R +
Sbjct: 113 -----WFLSQQVPQPTEGYLADNYHALVKGLKIA------APCPPLKLTLNRDDIDWAEM 161
Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
+ + + YI+++ SD S ++L P++ W EI G+R + P+ I
Sbjct: 162 EQQRLNIKDSGYILLYAGGSDLSISQ----GLETLYPLKSWLEIIQGIRHQQPDLPIVAI 217
Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
E + V + D ++ T G++AA+I + ++AT + +QLA A +++
Sbjct: 218 QGELDEAWVLALKAMDNNLKVSRTGDIGKMAAMIAGANLLLATESVPLQLAVAVGTYTLS 277
Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
L L R+ E + I + TGK+ D E VL +
Sbjct: 278 LLVPSLSKRVLPSGGELHR--YIEANTGKVADITPETVLKKI 317
>gi|428200505|ref|YP_007079094.1| ADP-heptose--LPS heptosyltransferase [Pleurocapsa sp. PCC 7327]
gi|427977937|gb|AFY75537.1| ADP-heptose:LPS heptosyltransferase [Pleurocapsa sp. PCC 7327]
Length = 314
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 83/341 (24%), Positives = 157/341 (46%), Gaps = 38/341 (11%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ E +LFFP ++ LK +YP IDV+ R K + + V V+D D
Sbjct: 2 RIIALVPGGISEQILFFPTLEDLKKQYPKAAIDVMIEPRAKAAYRVCPYVNEVLVFDYRD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
A+Y ++LG++++R YDM LS ++ +G L++ R+ Y A
Sbjct: 62 R-NGLADYLNLLGIIRDREYDMALSLSQRWTIG--LLLWLNGIPIRIGY------KNNAS 112
Query: 243 LLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKY 302
LS++ +S + + Y ++ LG + P L+V++ + + +
Sbjct: 113 WFLSDSVPLKS---EQYAAHKYHDLLQALG------IQSSCPELKVTLPKEDIDWAEAEQ 163
Query: 303 KNAGAEQGKYIVIH-GIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIP 358
K E+ YI+++ G E D+ P+ W +I L++ + P+ ++
Sbjct: 164 KRLDIEESGYILLYDGFEGG-----------DNTYPVAKWRKIVEDLQQKQPNLPIVLLR 212
Query: 359 HEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIAL 416
+ V + D ++ ++ P G+LAA++ + ++ T++ + L+ A +IAL
Sbjct: 213 DSENERWVATTIEDRPNLKVVSPPDLGKLAAIVAGANLMLCTDSVPMHLSVAVGTYTIAL 272
Query: 417 FSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
F S +L PN+E + I S T K+ D E +L +
Sbjct: 273 FGSTDATKLLPPNSE--RSIGIQSPTKKVADIEPETILQQI 311
>gi|209524261|ref|ZP_03272811.1| glycosyl transferase family 9 [Arthrospira maxima CS-328]
gi|376003521|ref|ZP_09781331.1| glycosyl transferase [Arthrospira sp. PCC 8005]
gi|423066757|ref|ZP_17055547.1| glycosyl transferase family 9 [Arthrospira platensis C1]
gi|209495352|gb|EDZ95657.1| glycosyl transferase family 9 [Arthrospira maxima CS-328]
gi|375328178|emb|CCE17084.1| glycosyl transferase [Arthrospira sp. PCC 8005]
gi|406711782|gb|EKD06981.1| glycosyl transferase family 9 [Arthrospira platensis C1]
Length = 320
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 86/345 (24%), Positives = 158/345 (45%), Gaps = 44/345 (12%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
+ ++ GG+ + +LFFP + LK YP I V+ R + + + K+V YD
Sbjct: 2 KILALVPGGIGDQILFFPTLDDLKQAYPHAQISVVTEPRAQTAYRVCKSVSKVIPYDFKG 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGH----AAFLFMTTARDRVSYIYPNVNAA 239
A++ +++G++++R YD+V+S LG L++T +RV Y A
Sbjct: 62 R-NSLADWGNLIGIVRDREYDVVIS-----LGQRWTVGLLLWLTGIPNRVGY------AG 109
Query: 240 GAGLLLSETFTAESMNLSERGY--NMYEQMVDWLG--RPFRSVPRHPVPPLRVSISRRLK 295
+G E F +++ L Y MY ++ L PF + + VP ++ + R +
Sbjct: 110 SSG----EIFLTDAVPLKTEQYAAQMYHDLLQGLDIITPFSGLQIN-VPKSDLAWADREQ 164
Query: 296 EVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF 355
E + G ++ YI+IHG + M D + P++ W +I +++ +P
Sbjct: 165 E-------SLGVKESGYILIHG----GSSQMAINKGIDKIYPVESWQKIIEDIQQRQPNL 213
Query: 356 --VIPHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANARE 410
V+ E + + + V +T P G+LAA+I + ++ T++A + LA A
Sbjct: 214 PVVLVEGPEDQALITQITQRCPQVKLTQPEDIGKLAAMIAAANLMVCTDSAPMHLAIAVG 273
Query: 411 KPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLN 455
+IALF +L P+ ++ + S TGK+ D + VL
Sbjct: 274 TYTIALFGPTDPHKLLPPS---ERAIAVKSPTGKMADISPDEVLK 315
>gi|425460630|ref|ZP_18840111.1| Similar to tr|Q4C0X9|Q4C0X9_CROWT Glycosyl transferase [Microcystis
aeruginosa PCC 9808]
gi|389826674|emb|CCI22674.1| Similar to tr|Q4C0X9|Q4C0X9_CROWT Glycosyl transferase [Microcystis
aeruginosa PCC 9808]
Length = 320
Score = 89.7 bits (221), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 85/342 (24%), Positives = 155/342 (45%), Gaps = 34/342 (9%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ LLFFP ++ L+ YP IDV+ R K + L V+ ++D D
Sbjct: 2 RILTLVPGGISNQLLFFPTLETLQQTYPQSSIDVLVEPRAKAAYRLCPQVKEVLLFDYRD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
+ A+Y +ILGV+++R Y++ +S T + L++ +R+ Y N +
Sbjct: 62 QY-GLADYLNILGVIRDREYEIAISLTNRPAIN--LLLWLNGVLNRIGY---ENNGS--- 112
Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
F ++ + GY + Y +V L P PPL+++++R +
Sbjct: 113 -----WFLSQQVPRPTEGYLADNYHALVKGLKIA------APCPPLKITLNRDDIDWAEM 161
Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
+ + + YI+++ SD S ++L P++ W EI G+R + P+ I
Sbjct: 162 EQQRLNIKDSGYILLYAGGSDLSISQ----GLETLYPLESWLEIIQGIRHQQPDLPIVAI 217
Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
E + V + D ++ T G++AA+I + ++AT + +QLA A +++
Sbjct: 218 QGELDEAWVLALKAMDNNLKVSRTGDIGKMAAMIAGANLLLATESVPLQLAVAVGTYTLS 277
Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
L L R+ E + I + TGK+ D E VL +
Sbjct: 278 LLVPSLSKRVLPSGGELHR--YIEANTGKVADITPETVLKKI 317
>gi|409991714|ref|ZP_11274951.1| glycosyl transferase family protein [Arthrospira platensis str.
Paraca]
gi|291571644|dbj|BAI93916.1| hypothetical protein [Arthrospira platensis NIES-39]
gi|409937413|gb|EKN78840.1| glycosyl transferase family protein [Arthrospira platensis str.
Paraca]
Length = 320
Score = 89.4 bits (220), Expect = 3e-15, Method: Compositional matrix adjust.
Identities = 86/345 (24%), Positives = 158/345 (45%), Gaps = 44/345 (12%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
+ ++ GG+ + +LFFP + LK YP I V+ R + + + K+V YD
Sbjct: 2 KILALVPGGIGDQILFFPTLDDLKQAYPHAQISVVTEPRAQTAYRVCKSVSKVIPYDFKG 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGH----AAFLFMTTARDRVSYIYPNVNAA 239
A++ +++G++++R YD+V+S LG L++T +RV Y A
Sbjct: 62 R-NSLADWGNLIGIVRDREYDVVIS-----LGQRWTVGLLLWLTGIPNRVGY------AG 109
Query: 240 GAGLLLSETFTAESMNLSERGY--NMYEQMVDWLG--RPFRSVPRHPVPPLRVSISRRLK 295
+G E F +++ L Y MY ++ L PF + + VP ++ + R +
Sbjct: 110 SSG----EIFLTDAVPLKTEQYAAQMYHDLLQGLDIITPFSGLQIN-VPKSDLAWADREQ 164
Query: 296 EVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF 355
E + G ++ YI+IHG + M D + P++ W +I +++ +P
Sbjct: 165 E-------SLGVKESGYILIHG----GSSQMAINKGIDKIYPVESWQKIIEDIQQRQPNL 213
Query: 356 --VIPHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANARE 410
V+ E + + + V +T P G+LAA+I + ++ T++A + LA A
Sbjct: 214 PVVLVKGPEDQALITQITQRCPQVKLTQPEDIGKLAAMIAAANLMVCTDSAPMHLAIAVG 273
Query: 411 KPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLN 455
+IALF +L P+ ++ + S TGK+ D + VL
Sbjct: 274 TYTIALFGPTDPHKLLPPS---ERAIAVKSPTGKMADISPDEVLK 315
>gi|425447077|ref|ZP_18827071.1| Similar to tr|Q4C0X9|Q4C0X9_CROWT Glycosyl transferase [Microcystis
aeruginosa PCC 9443]
gi|389732456|emb|CCI03614.1| Similar to tr|Q4C0X9|Q4C0X9_CROWT Glycosyl transferase [Microcystis
aeruginosa PCC 9443]
Length = 320
Score = 89.4 bits (220), Expect = 4e-15, Method: Compositional matrix adjust.
Identities = 85/342 (24%), Positives = 155/342 (45%), Gaps = 34/342 (9%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ LLFFP ++ L+ YP IDV+ R K L VR ++D D
Sbjct: 2 RILTLVPGGISNQLLFFPTLETLQQTYPQSSIDVLVEPRAKAAHRLCPQVREVLLFDYRD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
+ A+Y +ILGV+++R Y++ +S T + + L++ +R+ Y N +
Sbjct: 62 QY-GLADYLNILGVIRDREYEIAISLTNRSAIN--LLLWLNGVPNRIGY---ENNGS--- 112
Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
F ++ + GY + Y +V L P PPL+++++R +
Sbjct: 113 -----WFLSQQVPRPTEGYLADNYHALVKGLKIA------APCPPLKITLNRDDIDWAQM 161
Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
+ + + YI+++ SD S ++L P++ W EI G+R + P+ I
Sbjct: 162 EQQRLNIKDSGYILLYAGGSDISISQ----GLETLYPLESWLEIIQGIRHQQPDLPIVAI 217
Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
E + V + D ++ T G++AA+I + ++AT + +QLA A +++
Sbjct: 218 QGELDEAWVLALKAMDNNLKVSRTADIGKMAAMIAGANLLLATESVPLQLAVAVGTYTLS 277
Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
L L R+ + + I + TGK+ D E VL +
Sbjct: 278 LLVPSLSKRVLPSGGDLHR--YIEANTGKVADITPETVLKKI 317
>gi|425448716|ref|ZP_18828560.1| Glycosyl transferase [Microcystis aeruginosa PCC 7941]
gi|389763996|emb|CCI09602.1| Glycosyl transferase [Microcystis aeruginosa PCC 7941]
Length = 320
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 85/342 (24%), Positives = 155/342 (45%), Gaps = 34/342 (9%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ LLFFP ++ L+ YP IDV+ + K + L VR ++D D
Sbjct: 2 RILTLVPGGISNQLLFFPTLETLRQTYPQGTIDVLVEPKAKAAYRLCPQVREVLLFDYRD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
+ A+Y +ILGV+++R Y++ +S T + L++ +R+ Y N +
Sbjct: 62 QY-GLADYLNILGVIRDREYEIAISLTNRPAIN--LLLWLNGVPNRIGY---ENNGS--- 112
Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
F ++ + GY + Y +V L P PPL+++++R +
Sbjct: 113 -----WFLSQQVPQPTEGYLADNYHALVKGLKIA------APCPPLKLTLNRDDIDWAEM 161
Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
+ + + YI+++ SD S ++L P++ W EI G+R + P+ I
Sbjct: 162 EQQRLNIKDSGYILLYAGGSDLSISQ----GLETLYPLKSWLEIIQGIRHQQPDLPIVAI 217
Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
E + V + D ++ T G++AA+I + ++AT + +QLA A +++
Sbjct: 218 QGELDEAWVLALKAMDNNLKVSRTGDIGKMAAMIAGANLLLATESVPLQLAVAVGTYTLS 277
Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
L L R+ E + I + TGK+ D E VL +
Sbjct: 278 LLVPSLSKRVLPSGGELHR--YIEANTGKVADITPETVLKKI 317
>gi|17232660|ref|NP_489208.1| hypothetical protein alr5168 [Nostoc sp. PCC 7120]
gi|17134306|dbj|BAB76867.1| alr5168 [Nostoc sp. PCC 7120]
Length = 319
Score = 89.0 bits (219), Expect = 5e-15, Method: Compositional matrix adjust.
Identities = 83/342 (24%), Positives = 163/342 (47%), Gaps = 35/342 (10%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP + LK YP +DV+ R K + ++K+V +D D
Sbjct: 2 RVVALVPGGIGDQILFFPTLDDLKRYYPNAQLDVVVEPRSKAAYRVSKSVNDILTFDYKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
A++ +++G +++R YD+V++ + L FL++T R+ Y G G
Sbjct: 62 R-NSLADWGNLVGTIRDREYDVVITVGQSWL-VGLFLWLTGIPTRIGY-------QGKG- 111
Query: 244 LLSETFTAESMNL--SERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
+F +++ S+ +Y ++ +G P P L V++ + + +
Sbjct: 112 ---RSFLTKTVPFKPSQYAAAVYHDLLAPIGI------TTPTPELAVNVPKPDIDWANSE 162
Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIP 358
K G + Y++I+G + + +++ P + W EI R+ + P+ VI
Sbjct: 163 QKRLGVNETGYVLIYG----GSSWVSQPQALETIYPAENWQEIIQDFRQKQPDLPVVVIQ 218
Query: 359 HEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
+ + V + D + + +T+P G+L A+I + ++ T++A + L+ A + +IA
Sbjct: 219 GPDDEQFVRS-LRDYSLDIKVTSPDNVGKLTAMIAGANLMLTTDSAPLHLSVAVQTYTIA 277
Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
LF S +L PN++ K I S TG++ D +AVL +
Sbjct: 278 LFGSTDPVKLL-PNSD--KLLGIKSSTGRVADILPKAVLEKV 316
>gi|443314576|ref|ZP_21044124.1| ADP-heptose:LPS heptosyltransferase [Leptolyngbya sp. PCC 6406]
gi|442785819|gb|ELR95611.1| ADP-heptose:LPS heptosyltransferase [Leptolyngbya sp. PCC 6406]
Length = 321
Score = 88.6 bits (218), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 81/342 (23%), Positives = 160/342 (46%), Gaps = 34/342 (9%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ E LLFFP ++ +K +P I ++A + ++ V+ +V+
Sbjct: 2 RVLALVPGGISEQLLFFPTLEHIKQAFPKAEIAIVAEPVSGSAYRVSTLVK--DVFPYSF 59
Query: 184 DWP-EPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGA 241
P PA++ ++LG++++R ++ VL+ T LG L+++ RVSY +
Sbjct: 60 SKPNSPADWANLLGIIRDREFEAVLTVTPKTSLG--LMLWLSGIPTRVSY-----QTGIS 112
Query: 242 GLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
+L++ T A+ Y+ +D G P P +++ ++ + V +
Sbjct: 113 NVLMTATVPAKPKQYQAFQYHDLLAALDITG---------PCPAATINVPQKDIDWVGRQ 163
Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIP 358
K G Y++ +G D ++GD L P++ WA IA + + PL V+
Sbjct: 164 TKEQGIGDQGYVLFYGGPVDG-----TQGD---LYPVESWAAIAQDFQARQPGLPLVVVQ 215
Query: 359 HEKEREGVEDV--VGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIAL 416
+ + V+ + + + ++ GQ+A+LI + +IAT++ +QL A ++AL
Sbjct: 216 QPETADLVKALTQIQPNLKVIRPENIGQMASLIAGANLMIATDSYPLQLGIALNVFTVAL 275
Query: 417 FSSELKGRLFVP-NAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
F S + R P + E + +++ +GK+ D E VL +
Sbjct: 276 FGSNVPERRLPPVDGTETRFVGLTASSGKVADITPETVLKKI 317
>gi|332709136|ref|ZP_08429103.1| ADP-heptose/LPS heptosyltransferase [Moorea producens 3L]
gi|332352047|gb|EGJ31620.1| ADP-heptose/LPS heptosyltransferase [Moorea producens 3L]
Length = 320
Score = 88.2 bits (217), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 95/347 (27%), Positives = 153/347 (44%), Gaps = 44/347 (12%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP + LK YP IDVI R K + + ++V +D D
Sbjct: 2 RILALVPGGIGDQILFFPTLSQLKQYYPKAEIDVIVEPRAKGAYRVCQSVDEVLTFDFKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYD--MVLSTKLA-GLGHAAFLFMTTARDRVSYIYPNVNAAG 240
A++ ++LGV+++R YD M L + + GL L+MT R+ Y AG
Sbjct: 62 R-NGLADFGNLLGVIRDREYDAAMTLGRRWSVGL----LLWMTGIVRRIGY------QAG 110
Query: 241 AGLLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVV 298
+ + F + L Y +MY ++ G P L V + + E
Sbjct: 111 S----RDLFFTNPVPLKTEQYAAHMYHDLLLGFGINV------PCQELAVRVPKSDIEWA 160
Query: 299 AEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIP 358
+ + + YI+IHG S S D + P++ W +I +R+ +P I
Sbjct: 161 EAQQQQLEIPEIGYILIHGGSSQLALSQ----GIDKIYPVEQWQKIVEDIRQKQPDLPIV 216
Query: 359 HEKEREGVEDVVGDDASI-----VFITTP---GQLAALINDSAGVIATNTAAIQLANARE 410
K G ED A I + +T P G+LAA+I + ++ T++A + LA A
Sbjct: 217 LLK---GPEDAEWCQAMIEYHRDIIVTAPEDIGKLAAMIAGANLMLCTDSAPMHLAVAVG 273
Query: 411 KPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
P+IALF + + +P E I S TG++ D E VL +
Sbjct: 274 TPTIALFGPT-QAKKLLPAKE--GVYGIQSATGRIADIKPETVLEQI 317
>gi|81299493|ref|YP_399701.1| hypothetical protein Synpcc7942_0682 [Synechococcus elongatus PCC
7942]
gi|81168374|gb|ABB56714.1| conserved hypothetical protein [Synechococcus elongatus PCC 7942]
Length = 319
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 81/306 (26%), Positives = 141/306 (46%), Gaps = 38/306 (12%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP + LK YP IDV+ R + + V +D D
Sbjct: 2 RILALVPGGIGDQILFFPTVADLKKYYPQAQIDVVVEPRAVGAYRVCAEVDQVFPFDFKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLST---KLAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
A++ ++LG ++ R Y+ VLS L GL FL++T RV Y AG
Sbjct: 62 R-NSLADWGNLLGSIREREYEAVLSLGQRSLVGL----FLWLTGIPKRVGY-------AG 109
Query: 241 AGLLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRR-LKEV 297
G + F +++ L+ Y +Y ++ G P P +++++R+ L
Sbjct: 110 RG----KIFLTDAVPLNREQYAGALYHDLLQGFGI------NTPCPNPKLTLARQDLDWA 159
Query: 298 VAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PL 354
AE+ + A QG Y+V+HG +++ + P++ WA + +RE R P
Sbjct: 160 TAEQQRLGLAGQG-YLVLHG----GSSTLAKLKGLQKVYPVEKWAIVLRSIREQRPSLPF 214
Query: 355 FVIPHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKP 412
V+ + E V ++ + + P G+LAA+I + ++ T++A + LA A
Sbjct: 215 VVVQGPDDAEFVAELRKSNLDFQVVQPPDIGKLAAIIAGADLMLCTDSAPMHLAVASGTR 274
Query: 413 SIALFS 418
+IALF
Sbjct: 275 TIALFG 280
>gi|56750856|ref|YP_171557.1| hypothetical protein syc0847_c [Synechococcus elongatus PCC 6301]
gi|56685815|dbj|BAD79037.1| hypothetical protein [Synechococcus elongatus PCC 6301]
Length = 319
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 81/306 (26%), Positives = 141/306 (46%), Gaps = 38/306 (12%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP + LK YP IDV+ R + + V +D D
Sbjct: 2 RILALVPGGIGDQILFFPTVADLKKYYPQAQIDVVVEPRAVGAYRVCAEVDQVFPFDFKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLST---KLAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
A++ ++LG ++ R Y+ VLS L GL FL++T RV Y AG
Sbjct: 62 R-NSLADWGNLLGSIREREYEAVLSLGQRSLVGL----FLWLTGIPKRVGY-------AG 109
Query: 241 AGLLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRR-LKEV 297
G + F +++ L+ Y +Y ++ G P P +++++R+ L
Sbjct: 110 RG----KIFLTDAVPLNREQYAGALYHDLLQGFGI------NTPCPNPKLTLARQDLDWA 159
Query: 298 VAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PL 354
AE+ + A QG Y+V+HG +++ + P++ WA + +RE R P
Sbjct: 160 TAEQQRLGLAGQG-YLVLHG----GSSTLAKLKGLQKVYPVEKWAIVLRSIREQRPSLPF 214
Query: 355 FVIPHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKP 412
V+ + E V ++ + + P G+LAA+I + ++ T++A + LA A
Sbjct: 215 VVVQGPDDAEFVAELRKSNLDFQVVQPPDIGKLAAIIAGADLMLCTDSAPMHLAVASGTR 274
Query: 413 SIALFS 418
+IALF
Sbjct: 275 TIALFG 280
>gi|37522681|ref|NP_926058.1| heptosyltransferase [Gloeobacter violaceus PCC 7421]
gi|35213683|dbj|BAC91053.1| gll3112 [Gloeobacter violaceus PCC 7421]
Length = 328
Score = 87.4 bits (215), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 83/338 (24%), Positives = 147/338 (43%), Gaps = 34/338 (10%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R + GG+ + +LFFP ++ L++R+ I+V+ R + + + +V +D
Sbjct: 3 RILALNPGGIGDQVLFFPTLRGLRERFAQSRIEVVVEPRSQGAYRVCPSVNETLTFDFKG 62
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
D P A++ +++G++++R YD VLS + L A L+MT RV Y
Sbjct: 63 D-PSLADWMNLIGILRDRRYDAVLSVGSSSL-VALLLWMTGIPKRVGY----------SA 110
Query: 244 LLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
++E F +S+ L+ Y MY ++ G + R PP + ++ + +
Sbjct: 111 WITERFLTDSVPLNRDQYAAQMYHDLLQGFG-----IQRAFTPP-QAAVYSEDERWATQA 164
Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGL---REFRPLFVIP 358
+ G K +++H + + L P WA++ L P FV+
Sbjct: 165 LASLGGR--KPLLLH----PGASKLAELKGIRKLYPAAQWAQVVQKLLVKEADLPFFVVQ 218
Query: 359 HEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
++ E V + G+ A P G+L ALI S ++ ++A + LA A P +A
Sbjct: 219 GPEDSELVTAIKGELADQARFVQPPDVGKLTALIERSRLLLCVDSAPMHLAVATGTPLVA 278
Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAV 453
LF L +L E K ++S K+ PVE V
Sbjct: 279 LFGPTLPTKLL--PEEPKYVALVSPERDKVERIPVETV 314
>gi|284929535|ref|YP_003422057.1| ADP-heptose:LPS heptosyltransferase [cyanobacterium UCYN-A]
gi|284809979|gb|ADB95676.1| ADP-heptose:LPS heptosyltransferase [cyanobacterium UCYN-A]
Length = 311
Score = 87.0 bits (214), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 77/340 (22%), Positives = 154/340 (45%), Gaps = 39/340 (11%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R C++ GG+ E LLFFP ++ LK++YP VLIDV+ K + + V +D D
Sbjct: 2 RILCLVPGGINEQLLFFPTLESLKNKYPNVLIDVLVEPLAKSAYRICPYVNEILFFDYQD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
A+Y +++GV+++R YD+ ++T L L+ + R+ Y
Sbjct: 62 R-NVSADYLNLVGVIRDRGYDIAITTG-NKLILELLLWSNSIPWRIGY------KTQTSW 113
Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYK 303
LS + T + + Y ++ ++ +P P +++++ + + +
Sbjct: 114 FLSHSITQKEEQYAAETY--HDLLLKLNIQP-------SCPSIKIAVPKDDISWTETQLQ 164
Query: 304 NAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRP---LFVIPHE 360
+ ++ YI I+G E++S P+ W EI N +++ P + ++
Sbjct: 165 SLSVKENGYIAIYGGENNS-------------YPVSSWIEIINSIQKKEPSLSIVLLESN 211
Query: 361 KEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIALF 417
E+ + ++ D+ + I P G+L+A+I + ++ ++ +QL A +IALF
Sbjct: 212 TEQTWTKSIL-DNCKNLKIINPDNLGKLSAVIAGANLLVCIDSVPLQLGIAVGTYTIALF 270
Query: 418 SSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
S+ K +P + +C I S + +L D +L +
Sbjct: 271 -SDTKANHKIPTNYD-RCIAIQSPSNQLADISSSTILEKI 308
>gi|218439612|ref|YP_002377941.1| glycosyl transferase family protein [Cyanothece sp. PCC 7424]
gi|218172340|gb|ACK71073.1| glycosyl transferase family 9 [Cyanothece sp. PCC 7424]
Length = 320
Score = 86.7 bits (213), Expect = 2e-14, Method: Compositional matrix adjust.
Identities = 82/344 (23%), Positives = 158/344 (45%), Gaps = 38/344 (11%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP ++ LK +YP I VI R K + + V ++D D
Sbjct: 2 RILALVPGGIGDQILFFPTLEDLKQKYPTSTIHVIVEPRAKSAYRVCPYVNEVLLFDFKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGH----AAFLFMTTARDRVSYIYPNVNAA 239
A+Y ++LGV+++R YD+ +S LG L++ RV Y
Sbjct: 62 R-NGLADYLNLLGVIRDREYDLAVS-----LGRRWTVGLLLWLNGIPVRVGY------KT 109
Query: 240 GAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVA 299
L+ ++ + Y+ Q +D + P PPL+V++ + +
Sbjct: 110 NTSWFLTNPVPLKTEQYAAYMYHDLLQGLD---------IQTPCPPLKVTLPKEDIDWAE 160
Query: 300 EKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFV 356
+ K + YI+IHG SK ++ D + P+ W +I + +++ + P+ +
Sbjct: 161 AEQKRLDIKDTGYIMIHG--GSSKLAIVK--GIDKIYPVSQWQKILDEIQQKQPNIPIVL 216
Query: 357 IPHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPS 413
+ ++ E V+ ++ + + +T P G+LAA+I + ++ T++A + L+ + +
Sbjct: 217 LQGPEDAEWVK-MMMEGYPNLKVTKPSDVGKLAAMIAGANLMLCTDSAPMHLSVSVGTYT 275
Query: 414 IALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
IALF +L P ++ + I S +G + D E +L AM
Sbjct: 276 IALFGPTQAEKLLPP--DQDRYIGIQSPSGNIADIKPETILQAM 317
>gi|427725266|ref|YP_007072543.1| glycosyl transferase family protein [Leptolyngbya sp. PCC 7376]
gi|427356986|gb|AFY39709.1| glycosyl transferase family 9 [Leptolyngbya sp. PCC 7376]
Length = 319
Score = 86.7 bits (213), Expect = 3e-14, Method: Compositional matrix adjust.
Identities = 91/336 (27%), Positives = 159/336 (47%), Gaps = 43/336 (12%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GGV + +LFFP + LKD+YP ++DV+ R K + + V +D D
Sbjct: 2 RILALVPGGVGDQILFFPTLADLKDQYPEAMVDVLVEPRAKAAYRVCPTVHEVLTFDFKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTK---LAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
PA+Y ++LG +++R Y++ LS + GL L++ R+ Y N
Sbjct: 62 R-NGPADYLNVLGTIRDREYEIALSLGRRWIVGL----LLWLNGIPTRIGY---KTN--- 110
Query: 241 AGLLLSETFTAESMNLSERGY--NMYEQMVDWL--GRPFRSVPRHPVPPLRVSISRRLKE 296
+ F + L+E Y Y ++ L +P ++P+ VP S ++
Sbjct: 111 -----TSWFISNPAPLNEEQYAAATYHDLLKALDINKPC-NLPKINVP------SSDIQW 158
Query: 297 VVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---P 353
E+ K G G YI+IH SK S Q++G + L PI+ W E+ N ++ + P
Sbjct: 159 AEGEQ-KRLGLNDG-YILIHA--GASKIS-QAKG-IEKLYPIEKWEEVINDIKAKQPNLP 212
Query: 354 LFVIPHEKEREGVEDVVG--DDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREK 411
+ ++ ++ + V +V D + G+ AA+I + ++ T++A + LA A
Sbjct: 213 IALLHGPEDYDWVAQLVEIFPDLRVTVPEDVGKSAAMIAGANLMVCTDSAPMHLAIAVGT 272
Query: 412 PSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLID 447
+IALF + RL P+++ K I S TG++ D
Sbjct: 273 YTIALFGPTKRERLLPPSSD--KFIGIQSPTGRIAD 306
>gi|427712861|ref|YP_007061485.1| ADP-heptose--LPS heptosyltransferase [Synechococcus sp. PCC 6312]
gi|427376990|gb|AFY60942.1| ADP-heptose:LPS heptosyltransferase [Synechococcus sp. PCC 6312]
Length = 325
Score = 83.6 bits (205), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 81/336 (24%), Positives = 143/336 (42%), Gaps = 41/336 (12%)
Query: 123 RRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDL- 181
+R +I GG+ + +L FP + L+ YP IDV+ R + +E+N V +
Sbjct: 4 QRIVMLIPGGIGDQILIFPTLADLRGHYPQAEIDVVVEPRSQAAYEVNAVVNQVLTFPFR 63
Query: 182 -DDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHA----AFLFMTTARDRVSYIYPNV 236
W ++ ++ +++R YD+++S LG + L++T RV Y
Sbjct: 64 AKKTW---RDWWGLIQQIRSRQYDIIVS-----LGESFAVRVLLWLTGVPKRVGY----A 111
Query: 237 NAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKE 296
N GLL +N ++ MY ++ LG + R PPL ++ ++ +
Sbjct: 112 NQKTWGLLTHPA----PLNKNQYAAAMYHDLLKGLG-----IDRE-YPPLVATVKQK-DQ 160
Query: 297 VVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRP--- 353
E + Q YI+IHG + M + + PIQ W E+ + L P
Sbjct: 161 AWGESERLRLQVQSPYILIHG----GSSKMARLKGINKIYPIQAWLEVLHTLNAKYPGIS 216
Query: 354 LFVIPHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREK 411
+ ++ ++E E + + P G+LAALI + G+I T++ + L A
Sbjct: 217 VLIVQGPDDQEWGEQLTAAIPELKRTQPPSFGKLAALIETATGMICTDSGPMHLGVALNT 276
Query: 412 PSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLID 447
P +ALF +L P + + S TG++ D
Sbjct: 277 PLVALFGPTDPQKLLPPQPHFRP---VKSPTGQIGD 309
>gi|434393455|ref|YP_007128402.1| glycosyl transferase family 9 [Gloeocapsa sp. PCC 7428]
gi|428265296|gb|AFZ31242.1| glycosyl transferase family 9 [Gloeocapsa sp. PCC 7428]
Length = 314
Score = 82.8 bits (203), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 82/340 (24%), Positives = 151/340 (44%), Gaps = 42/340 (12%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP + LK YP IDV+ R K+ + + K V +D D
Sbjct: 2 RVVALVPGGIGDQILFFPTLDDLKQNYPNAQIDVVVEPRAKEAYRICKAVNNVLAFDYKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTK---LAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
A++ +++G++++R YD+V+S GL L++T R+ Y G
Sbjct: 62 R-NSAADWVNLVGILRDREYDVVISVGQRWFVGL----LLWLTGIPTRIGY-------KG 109
Query: 241 AGLLLSETFTAESMNLSERGYN--MYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVV 298
AG F ++ L + Y+ MY ++ LG P P L V++ +
Sbjct: 110 AG----NMFLTNAVPLKSQQYDAAMYHDLLQGLGI------NSPCPELTVNVPAADIDWA 159
Query: 299 AEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRP-LFVI 357
+ K+ G + Y+++ SDS + D P++ W +I R+ +P L ++
Sbjct: 160 EAEKKHLGIQSSGYVLL----SDSF----EQSTPDKSYPMESWRQIIQDFRQKQPELPLV 211
Query: 358 PHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSI 414
+ + + + +T+P G+LAA+I + ++ +QLA A + +I
Sbjct: 212 ALQSSDDMRWTQLAQANPELKLTSPPDVGKLAAMIAGADLLLCIEGVPLQLAVAVQTYAI 271
Query: 415 ALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVL 454
ALF +L + + I S +G++ D + VL
Sbjct: 272 ALFGLTDPAKLL---PKSDRYIPIKSPSGQVADITPQTVL 308
>gi|16332333|ref|NP_443061.1| hypothetical protein slr0606 [Synechocystis sp. PCC 6803]
gi|383324074|ref|YP_005384928.1| hypothetical protein SYNGTI_3166 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|383327243|ref|YP_005388097.1| hypothetical protein SYNPCCP_3165 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|383493127|ref|YP_005410804.1| hypothetical protein SYNPCCN_3165 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|384438395|ref|YP_005653120.1| hypothetical protein SYNGTS_3167 [Synechocystis sp. PCC 6803]
gi|451816484|ref|YP_007452936.1| hypothetical protein MYO_132030 [Synechocystis sp. PCC 6803]
gi|1653963|dbj|BAA18873.1| slr0606 [Synechocystis sp. PCC 6803]
gi|339275428|dbj|BAK51915.1| hypothetical protein SYNGTS_3167 [Synechocystis sp. PCC 6803]
gi|359273394|dbj|BAL30913.1| hypothetical protein SYNGTI_3166 [Synechocystis sp. PCC 6803
substr. GT-I]
gi|359276564|dbj|BAL34082.1| hypothetical protein SYNPCCN_3165 [Synechocystis sp. PCC 6803
substr. PCC-N]
gi|359279734|dbj|BAL37251.1| hypothetical protein SYNPCCP_3165 [Synechocystis sp. PCC 6803
substr. PCC-P]
gi|407957031|dbj|BAM50271.1| hypothetical protein BEST7613_1340 [Synechocystis sp. PCC 6803]
gi|451782453|gb|AGF53422.1| hypothetical protein MYO_132030 [Synechocystis sp. PCC 6803]
Length = 317
Score = 81.6 bits (200), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 76/314 (24%), Positives = 133/314 (42%), Gaps = 28/314 (8%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ E +LFFP I LK YP IDVI R K + + V +D D
Sbjct: 2 RILALVPGGISEQILFFPTIATLKAEYPQATIDVIVEPRAKSAYRVCAQVNEVLAFDYRD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
A++ ++LGV+++R Y+ VL+ L++ RV Y + AG
Sbjct: 62 R-NGLADFLNLLGVIRDREYEAVLTVARQ-WTIELLLWLNGIPQRVGY------QSSAGF 113
Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYK 303
LS T + + ++ + G P PPL++S+ + E + K
Sbjct: 114 FLSATVPFKP---DQYVPFLFHDLTQGFGI------NRPCPPLQISLPKADIEWMEATQK 164
Query: 304 NAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRP---LFVIPHE 360
G Y+V++G +++ D PI W EI +++ +P + +IP
Sbjct: 165 KLDLGSGGYVVLNG------GAIRQPDTGDFPYPIAQWHEIIADIKQKQPGLKIVLIPPA 218
Query: 361 KEREGVEDVVGDDASIVFIT--TPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
+ V+ + +V I G++AAL+ + V+ T + LA A ++A+
Sbjct: 219 ENTGWVQAMQDQHPGVVAIRPGDVGKMAALLAGANLVVCTEGIPMHLAIAVGTYTVAILG 278
Query: 419 SELKGRLFVPNAEE 432
R+ P+ E+
Sbjct: 279 KTPAARIIPPDQEK 292
>gi|443647325|ref|ZP_21129681.1| glycosyltransferase 9 family protein [Microcystis aeruginosa
DIANCHI905]
gi|159027766|emb|CAO89636.1| unnamed protein product [Microcystis aeruginosa PCC 7806]
gi|443335502|gb|ELS49970.1| glycosyltransferase 9 family protein [Microcystis aeruginosa
DIANCHI905]
Length = 320
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 86/342 (25%), Positives = 154/342 (45%), Gaps = 34/342 (9%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ LLFFP ++ L+ YP IDV+ R K L VR ++D D
Sbjct: 2 RILTLVPGGISNQLLFFPTLETLQQTYPQSSIDVLVEPRAKAAHRLCPQVREVLLFDYRD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-TKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
+ A+Y +ILGV+++R Y++ +S T + L++ +R+ Y N +
Sbjct: 62 QYG-LADYLNILGVIRDREYEIAISLTNRPAIN--LLLWLNGVLNRIGY---ENNGS--- 112
Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
F ++ + GY + Y +V L P PPL+++++R +
Sbjct: 113 -----WFLSQQVPRPTEGYLADNYHALVKGLKIA------APCPPLKLTLNRDDIDWAQM 161
Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
+ + + YI+++ SD S +++ P++ W EI G+R + P+ I
Sbjct: 162 EQQRLNIKDSGYILLYAGGSDLSISQ----GLETVYPLESWLEIIQGIRHQQPELPIVAI 217
Query: 358 PHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
E + V + D ++ T G++AA+I + ++AT + +QLA A S++
Sbjct: 218 QGELDEAWVLALKAMDNNLKVSRTADIGKMAAMIAGANLLLATESVPLQLAVAVGTYSLS 277
Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
L L R+ E + I + TGK+ D E VL +
Sbjct: 278 LLVPSLSKRVLPSGGELHR--YIEANTGKVADITPETVLKKI 317
>gi|170078509|ref|YP_001735147.1| heptosyltransferase family protein [Synechococcus sp. PCC 7002]
gi|169886178|gb|ACA99891.1| Heptosyltransferase family protein [Synechococcus sp. PCC 7002]
Length = 319
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 86/340 (25%), Positives = 152/340 (44%), Gaps = 37/340 (10%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GGV + +LFFP + LK+RYP +IDV+ R K + + V +D D
Sbjct: 2 RILALVPGGVGDQILFFPTLADLKERYPEAMIDVLVEPRAKAAYRVCPQVHEVLTFDFKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLST-KLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG 242
PA+Y +ILG +++R Y++ LS K +G L++ RV Y
Sbjct: 62 R-NGPADYLNILGTIRDREYEIALSLGKNWAVGF--LLWLNGIPTRVGY----------- 107
Query: 243 LLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
+ F + L+E Y +Y ++ LG P +++ +
Sbjct: 108 KTATSWFISNPAPLNENQYAAALYHDLLKGLGI------DTDCPSPSITVPNGDIQWAEG 161
Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
+ K +G YI+IHG S Q++G + L P+ W I ++ + P+ ++
Sbjct: 162 EQKRLDLTEG-YILIHGGASQLS---QAKG-INKLYPVANWQPIIEDIQSKQPNLPIVLL 216
Query: 358 PHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSI 414
+ V +++ ++ +T P G+ AA+I + ++ T++A + LA A +I
Sbjct: 217 QGPDDAPWVTELIALYPALK-VTRPEDIGKAAAMIAGANLMVCTDSAPMHLAAAVGTYTI 275
Query: 415 ALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVL 454
ALF + +L P + K I S TG++ D + VL
Sbjct: 276 ALFGATHAEKLLPP--KNDKLIGIQSPTGEMADIQPKDVL 313
>gi|414079190|ref|YP_007000614.1| glycosyl transferase [Anabaena sp. 90]
gi|413972469|gb|AFW96557.1| glycosyl transferase family 9 [Anabaena sp. 90]
Length = 313
Score = 79.7 bits (195), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 81/342 (23%), Positives = 153/342 (44%), Gaps = 41/342 (11%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ G + +LFF + LK YP IDVI + K + ++K+V +D D
Sbjct: 2 RVVALVPGSIDNQILFFATLDDLKRYYPDAQIDVIVEPQSKAAYRVSKSVHDVLTFDYKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVL---STKLAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
A++ +++G++++R YD+ + + GL +++T R+ Y G
Sbjct: 62 R-NSLADWGNLVGMIRDREYDVAIIVGQSWWVGL----LMWLTGIPTRIGY-----QGQG 111
Query: 241 AGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
A L + NLSE MY ++ +P + P P L V++ + E
Sbjct: 112 AVFLTNPI----PPNLSEYVAKMYHNLL----KPLKI--NTPCPALSVNVPKVDIEWAQA 161
Query: 301 KYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF--VIP 358
+ K G + +I+I+ E G D+ P++ W +I ++ +P V+
Sbjct: 162 EQKRLGVNETGFILINAGE----------GSLDTTYPVENWQQIIAACQQKQPDLPVVVI 211
Query: 359 HEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
E E + + + + +T+P G+L A+I ++ +++ + +QL+ A E +I
Sbjct: 212 KEANNEPLVRSLLEHCHNIKVTSPDDIGKLTAIIAGASLMVSVENSFLQLSIAVETYTIT 271
Query: 416 LFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
L S + +P +E K I+S TGK+ D + VL +
Sbjct: 272 LLDS-IDSEKLLPISE--KVLAITSSTGKIADILPQIVLEKI 310
>gi|86606904|ref|YP_475667.1| heptosyltransferase family protein [Synechococcus sp. JA-3-3Ab]
gi|86555446|gb|ABD00404.1| heptosyltransferase family protein [Synechococcus sp. JA-3-3Ab]
Length = 324
Score = 79.3 bits (194), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 73/310 (23%), Positives = 133/310 (42%), Gaps = 27/310 (8%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP + L+ R+P ++V+ R +E+ +V + +
Sbjct: 4 RILALVPGGIGDQILFFPTLASLRQRFPEAELEVLVEPRAAAAYEVCPSVNRVLTFPFKE 63
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
+ +D+LG ++ R YD VLS + LG L++T RV Y P G G
Sbjct: 64 QL-SLGDISDLLGRIRERQYDAVLSLGRS-LGVKLLLWLTGIPKRVGYAPP-----GPG- 115
Query: 244 LLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
+ + + L Y +MY ++ G +P +R+ + L+ E+
Sbjct: 116 ---RPWLTDPVPLKPAQYAASMYHDLLQGFG----ITATAGLPQVRLKKT-DLEWAEQER 167
Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIP 358
+ G Y+++H + + + P W ++ G RE R PL ++
Sbjct: 168 KRLLGDAAQDYVLLH----PGASQLSQEKGIQKIYPPASWVKVIKGFREKRPELPLVLLL 223
Query: 359 HEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQLANAREKPSIAL 416
++ + E + + P GQLAALI S ++ T++A + LA A + P +A+
Sbjct: 224 GPEDEQLAEGFLSQLPDLPVSRPPHLGQLAALIAGSTLLLCTDSAPMHLAVATQTPLVAV 283
Query: 417 FSSELKGRLF 426
F RL
Sbjct: 284 FGPTEPERLL 293
>gi|126659100|ref|ZP_01730240.1| hypothetical protein CY0110_04803 [Cyanothece sp. CCY0110]
gi|126619628|gb|EAZ90357.1| hypothetical protein CY0110_04803 [Cyanothece sp. CCY0110]
Length = 312
Score = 79.0 bits (193), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 84/341 (24%), Positives = 147/341 (43%), Gaps = 40/341 (11%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ E +LFFP ++ LK +YP +IDV+ R K + + V ++D D
Sbjct: 2 RILALVPGGINEQILFFPTLEDLKTQYPNAIIDVLVEPRAKSAYRVCPQVHEVLLFDYQD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSY-IYPNVNAAGAG 242
A+Y ++LG++++R YD+ L+ + + L++ V Y P+ +
Sbjct: 62 R-NGLADYLNLLGIIRDREYDIALTLE-KRWSISLLLWLNGIPVTVGYKTQPSWFISNPV 119
Query: 243 LLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKY 302
+E +TAE MY ++ G +S P L++++ + +
Sbjct: 120 PQKTEQYTAE----------MYHDLLQ--GIDIKST----CPSLKIALPKEDIAWAEAEQ 163
Query: 303 KNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKE 362
K + YIVI+G ++S PI W I N ++E +P I +
Sbjct: 164 KRLLLNESGYIVIYGGANES-------------YPIPQWNNIINRIQEKQPSLSIVLLQG 210
Query: 363 REGVEDVVG---DDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIAL 416
G E + S + +T P G+LAA+I + +I T++ +QL A ++AL
Sbjct: 211 SGGDEAWISPLLSRCSDLKVTKPGDIGKLAAMIAGANLMICTDSPPLQLGVAVGTYTVAL 270
Query: 417 FSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
F + PN + + I S T L D VL +
Sbjct: 271 FGKTDAKKRLPPN--DDRFIAIQSSTYNLGDIQASKVLEKI 309
>gi|67923629|ref|ZP_00517099.1| Glycosyl transferase, family 9 [Crocosphaera watsonii WH 8501]
gi|67854511|gb|EAM49800.1| Glycosyl transferase, family 9 [Crocosphaera watsonii WH 8501]
Length = 336
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 87/341 (25%), Positives = 151/341 (44%), Gaps = 41/341 (12%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ E +LFFP ++ LK +YP +IDV+ R K + + V ++D +D
Sbjct: 27 RILALVPGGISEQILFFPTLEDLKIQYPNAIIDVLVEPRAKVAYRVCPQVNEVLLFDYED 86
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSY-IYPNVNAAGAG 242
A+Y ++LG++++R YD+ LS + L++ V Y P + A
Sbjct: 87 R-SGLADYLNLLGIIRDREYDIALSLE-KSWSIRLLLWLNGIPLTVGYGNKPTWFISKAI 144
Query: 243 LLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKY 302
+E + AE MY +++ G +S PPL++++ + +
Sbjct: 145 PQKTEQYVAE----------MYHDLME--GLDIQST----CPPLKIALPKDDISWGETEQ 188
Query: 303 KNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF--VIPHE 360
K E+ YI+I+G S+S P+ W+ I N ++E +P V+
Sbjct: 189 KRLLLEESGYILIYGGASES-------------YPVPQWSNIINRIQEKQPSLSIVLLQG 235
Query: 361 KEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIALF 417
E + + S + +T P G+LAA+I + +I T++ QL A ++ALF
Sbjct: 236 GGDEAWVNSLLSSCSNLKVTKPGDIGKLAAIIAGANLMICTDSTPFQLGVAVGTYTVALF 295
Query: 418 S-SELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
+ K RL A + + I S T L D VL +
Sbjct: 296 GKTNPKKRL---PANDDRFIAIQSPTYNLGDIQASKVLEKI 333
>gi|416398771|ref|ZP_11686840.1| ADP-heptose--lipooligosaccharide heptosyltransferase II
[Crocosphaera watsonii WH 0003]
gi|357262510|gb|EHJ11630.1| ADP-heptose--lipooligosaccharide heptosyltransferase II
[Crocosphaera watsonii WH 0003]
Length = 311
Score = 79.0 bits (193), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 87/341 (25%), Positives = 152/341 (44%), Gaps = 41/341 (12%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ E +LFFP ++ LK +YP +IDV+ R K + + V ++D +D
Sbjct: 2 RILALVPGGISEQILFFPTLEDLKIQYPNAIIDVLVEPRAKVAYRVCPQVNEVLLFDYED 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSY-IYPNVNAAGAG 242
A+Y ++LG++++R YD+ LS + L++ V Y P + A
Sbjct: 62 R-SGLADYLNLLGIIRDREYDIALSLE-KSWSIRLLLWLNGIPLTVGYGNKPTWFISKAI 119
Query: 243 LLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKY 302
+E + AE MY +++ G +S PPL++++ + +
Sbjct: 120 PQKTEQYVAE----------MYHDLME--GLDIQST----CPPLKIALPKDDISWGEAEQ 163
Query: 303 KNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF--VIPHE 360
K E+ YI+I+G S+S P+ W+ I N ++E +P V+
Sbjct: 164 KRLLLEESGYILIYGGASES-------------YPVPQWSNIINRIQEKQPSLSIVLLQG 210
Query: 361 KEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIALF 417
E + + S + +T P G+LAA+I + +I T++ +QL A ++ALF
Sbjct: 211 GGDEAWVNSLLSSCSNLKVTKPGDIGKLAAIIAGANLMICTDSTPLQLGVAVGTYTVALF 270
Query: 418 S-SELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
+ K RL A + + I S T L D VL +
Sbjct: 271 GKTNPKKRL---PANDDRFIAIQSPTYNLGDIQASKVLEKI 308
>gi|172035840|ref|YP_001802341.1| hypothetical protein cce_0924 [Cyanothece sp. ATCC 51142]
gi|354555073|ref|ZP_08974376.1| glycosyl transferase family 9 [Cyanothece sp. ATCC 51472]
gi|171697294|gb|ACB50275.1| unknown [Cyanothece sp. ATCC 51142]
gi|353553227|gb|EHC22620.1| glycosyl transferase family 9 [Cyanothece sp. ATCC 51472]
Length = 311
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 79/340 (23%), Positives = 152/340 (44%), Gaps = 39/340 (11%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ E +LFFP ++ LK +YP +IDV+ R K + + V ++D D
Sbjct: 2 RILALVPGGISEQILFFPTLEDLKIQYPNAIIDVLVEPRAKAAYRVCPQVHEVLLFDYQD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
A+Y ++LG++++R YD+ L+ + + L++ V Y + +
Sbjct: 62 R-NGLADYLNLLGIIRDREYDIALTLE-KRWSISLLLWLNGIPLTVGY------GSQSSW 113
Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYK 303
+S ++ + + MY ++ LG + P P L++++ + + K
Sbjct: 114 FISNPVPQKTEQYTAQ---MYHDLMQGLGI------QSPCPSLKIALPKDDIPWAEAEQK 164
Query: 304 NAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF--VIPHEK 361
++ YI+I+G S+S P+ W+ I N ++E +P V+
Sbjct: 165 RLLLDESGYILIYGGASES-------------YPVPQWSNIINRIQEKQPSLSIVLLQGS 211
Query: 362 EREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
E + + S + +T P G+LAA+I + +I+T++ +QL A ++ LF
Sbjct: 212 GDEAWINPLLSSCSDLKVTKPGDIGKLAAMIAGANLMISTDSPPLQLGVAVGTYTVGLFG 271
Query: 419 -SELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
++ K RL ++ + I S T L D VL +
Sbjct: 272 KTDAKKRL---PPDDDRFIGIQSSTYNLGDIQASKVLEKI 308
>gi|427735470|ref|YP_007055014.1| ADP-heptose--LPS heptosyltransferase [Rivularia sp. PCC 7116]
gi|427370511|gb|AFY54467.1| ADP-heptose:LPS heptosyltransferase [Rivularia sp. PCC 7116]
Length = 313
Score = 77.0 bits (188), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 70/302 (23%), Positives = 132/302 (43%), Gaps = 34/302 (11%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ E +LFFP + LK +Y IDV+ K ++++++V +D D
Sbjct: 2 RIVALVPGGIGEQILFFPTLDNLKQKYDTSRIDVVCEPSSKAAYQVSQSVSEVLAFDFKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
A++ +++G++++R YD+ +++ + + L++T R+++ G
Sbjct: 62 R-NSMADFGNLIGMIRDREYDVAITSSQSWI-TGLLLWLTGINTRIAF-------KGNNS 112
Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISR-RLKEVVAEKY 302
L N + MY ++ LG P L ++I + + AE+
Sbjct: 113 LFFNCLIPR--NEEQYAAYMYNDLLQGLGI------NSTYPELALNIPKPDIDWATAEQL 164
Query: 303 KNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF--VIPHE 360
+N + G +V E D DS P W + RE +P V+ E
Sbjct: 165 RNGVNDTGYILVCSPFE-----------DADSAYPADKWLTVIKDFREKQPDMPVVVASE 213
Query: 361 KEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIALF 417
+ + + + T+P G+LAA+I + ++ TN+A +QLA + +IALF
Sbjct: 214 PNNQNFVRTLSEKIPDIKETSPNNVGRLAAIIAGANLMLCTNSAPMQLAVFLQTYTIALF 273
Query: 418 SS 419
S
Sbjct: 274 GS 275
>gi|443313881|ref|ZP_21043491.1| ADP-heptose:LPS heptosyltransferase [Synechocystis sp. PCC 7509]
gi|442776294|gb|ELR86577.1| ADP-heptose:LPS heptosyltransferase [Synechocystis sp. PCC 7509]
Length = 309
Score = 76.3 bits (186), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 78/328 (23%), Positives = 142/328 (43%), Gaps = 38/328 (11%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ G + E +LFFP + L+ YP IDV+ R K + + K+V +D D
Sbjct: 2 RIVALVPGKIGEQILFFPTLDDLQRVYPDAQIDVVVEPRAKAAYRVCKSVTDTIAFDFKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
A++ +++G++++R YD+ +S + L H L++T R+ Y G G
Sbjct: 62 R-NSLADWGNLIGLLRDREYDIAISPQQQSLPH-LLLWLTGIPTRIGY-------KGKGS 112
Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYK 303
+ + + S Y ++ LG PVP L +++ ++ K
Sbjct: 113 IFLTDPVPQKLQQSPAA--KYHDLLQGLGI------NSPVPELNINVPSSDITWSEKEQK 164
Query: 304 NAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKER 363
G + Y+V + S D +S PI W +I L+E +P + K+
Sbjct: 165 RLGINESGYVVFYDRPS---------PDHNSTYPIDSWRQIVQSLQEKQPNLPVVAIKDP 215
Query: 364 EGVEDVVGDDASIVF-----ITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
E V + + F +T+P G+LAA I + ++ T++A + LA A + ++
Sbjct: 216 ENELFV---KSMLAFFPDLKVTSPEDIGKLAATIAAANLLVCTDSAPMYLALAVQTYTVV 272
Query: 416 LFSSELKG-RLFVPNAEEKKCTVISSRT 442
L + +K RL + K I+ +T
Sbjct: 273 LGETSIKSDRLMEVKSSTAKIADIAPQT 300
>gi|254423076|ref|ZP_05036794.1| Heptosyltransferase superfamily [Synechococcus sp. PCC 7335]
gi|196190565|gb|EDX85529.1| Heptosyltransferase superfamily [Synechococcus sp. PCC 7335]
Length = 323
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 79/344 (22%), Positives = 143/344 (41%), Gaps = 41/344 (11%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R I+ GG+ + LLFFPA++ +K YP I V+A + + ++K V +
Sbjct: 2 RILAIVPGGISDQLLFFPALEDIKRVYPNAEIGVVAEPKASPAYRVSKIVDTIIPFSFSA 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVS-------YIYPNV 236
P+++ ++LG +++R Y++VL+T L+ L+++ RV+ Y Y V
Sbjct: 62 A-NSPSDWANLLGNVRDREYEVVLTTDLS-WSMGLLLWLSGVPTRVTLEGTSAPYFYTRV 119
Query: 237 NAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKE 296
+S S + Y ++ +G P P L V++
Sbjct: 120 -------------LPDSAQSSHYQADRYHNLLSAIGI------EGPTPALSVNVPEADLA 160
Query: 297 VVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFV 356
E G + G ++ G ++ S + + P+ W + RE +P
Sbjct: 161 WAKELRDRLGLKDGYVLMYPGPDTGS-------AEAGARFPVPSWQAVIKDFREKQPQMP 213
Query: 357 IPHEKEREGVEDV----VGDDASIVF-ITTPGQLAALINDSAGVIATNTAAIQLANAREK 411
I + + + VGDD IV GQ AA+I + ++ ++ +QL+ A +
Sbjct: 214 IVLLQTEGSIPQINALRVGDDKLIVANAENLGQAAAVIAGADLLLTPDSYVMQLSTALKV 273
Query: 412 PSIALFSSELKGRLF-VPNAEEKKCTVISSRTGKLIDTPVEAVL 454
++ALF + EE + I+S+T + D P E VL
Sbjct: 274 FTLALFGKNKPAEMLPAAKGEETRFLGIASKTNLIADIPPETVL 317
>gi|254416721|ref|ZP_05030471.1| Heptosyltransferase superfamily [Coleofasciculus chthonoplastes PCC
7420]
gi|196176461|gb|EDX71475.1| Heptosyltransferase superfamily [Coleofasciculus chthonoplastes PCC
7420]
Length = 320
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 76/316 (24%), Positives = 136/316 (43%), Gaps = 35/316 (11%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + LL FP + LK YP IDVI R K + ++ +V +D D
Sbjct: 2 RLLALVPGGISDQLLLFPTLDDLKANYPDAEIDVIVEPRAKGAYRVSGSVDEVLPFDYKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKL---AGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
A++ ++LG++++R YD ++ + GL L+M+ RV Y N G
Sbjct: 62 R-NGLADFGNLLGIIRDREYDAAIALERRWSVGL----LLWMSGIPIRVGY----ENNQG 112
Query: 241 AGLLLSETFTAESMNLSERGYN--MYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVV 298
+ F ++ L Y MY ++ LG P P L + + + +
Sbjct: 113 ------KWFFTNTVPLKTEQYAAYMYHDLLTGLGM------NTPCPELAIRVPKSALDWA 160
Query: 299 AEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRP--LFV 356
+ G + YI+I G + D D + P W ++ + + +P V
Sbjct: 161 EGEQARLGIPESGYIMIDG----GSNHLDQSQDIDKVYPAHKWQKVIEDIHQKQPDLTLV 216
Query: 357 IPHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPS 413
+ H E + + + +T+P G+LAA+I + ++ T++A++ LA A +
Sbjct: 217 MLHGSEDTELVAALTQVCPNLQMTSPPDSGKLAAMIAGANLLLCTDSASMHLAVAVGTYT 276
Query: 414 IALFSSELKGRLFVPN 429
IALF +L P+
Sbjct: 277 IALFGPTEADKLLPPS 292
>gi|434398545|ref|YP_007132549.1| glycosyl transferase family 9 [Stanieria cyanosphaera PCC 7437]
gi|428269642|gb|AFZ35583.1| glycosyl transferase family 9 [Stanieria cyanosphaera PCC 7437]
Length = 309
Score = 75.9 bits (185), Expect = 4e-11, Method: Compositional matrix adjust.
Identities = 83/340 (24%), Positives = 148/340 (43%), Gaps = 47/340 (13%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP ++ LK +YP +IDV+ R K + + KNV ++D D
Sbjct: 2 RILALVPGGIGDQILFFPTLETLKSKYPNAVIDVLVEPRAKAAYRICKNVDDVLIFDYKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAF----LFMTTARDRVSYIYPNVNAA 239
A+Y ++LG+++++ YD+ L+T G F L++ RV Y N NA
Sbjct: 62 R-SSLADYLNLLGIIRDQEYDVALTT-----GSPWFVELLLWLNGIPVRVG--YKNNNA- 112
Query: 240 GAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVA 299
LLS + +S + + Y ++ LG P PP+++++ +
Sbjct: 113 ---WLLSSSVPFKS---EQYKAHTYHDLLAGLGI------NVPCPPIQINVPKNDINWTE 160
Query: 300 EKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF--VI 357
+ + YI+++ + L P+ W +I + +P V+
Sbjct: 161 NETNRLDIKDSGYILLYS--------------SSELYPLASWQKIVTDICSKQPELPIVL 206
Query: 358 PHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSI 414
+ V + I TP G+LAA + + ++ T + + LA A +I
Sbjct: 207 LQDANNGKWISAVQQTVPSLKIITPTDLGKLAATVAGANLLLCTVSIPMYLAIATGTYAI 266
Query: 415 ALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVL 454
AL SS + +L P ++ + S TGKL D + VL
Sbjct: 267 AL-SSSTEEKLLPPKSDSYLS--LQSSTGKLADLKPDQVL 303
>gi|428220786|ref|YP_007104956.1| ADP-heptose--LPS heptosyltransferase [Synechococcus sp. PCC 7502]
gi|427994126|gb|AFY72821.1| ADP-heptose:LPS heptosyltransferase [Synechococcus sp. PCC 7502]
Length = 320
Score = 74.7 bits (182), Expect = 8e-11, Method: Compositional matrix adjust.
Identities = 83/345 (24%), Positives = 159/345 (46%), Gaps = 52/345 (15%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLD- 182
R ++ GG+ + LL FP + LK YP LIDV+ R + + ++++V V++ D
Sbjct: 2 RVLALVPGGIGDQLLMFPTLDSLKQIYPQGLIDVVVEPRSQGAYRISQSVN--KVWNFDF 59
Query: 183 DDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGH----AAFLFMTTARDRVSYIYPNVNA 238
A++ +++G ++++ YD V S LG L++T +R+SY
Sbjct: 60 KGINSLADWGNLIGTIRDQEYDAVFS-----LGQRWSVGFLLWLTGIPNRISY------- 107
Query: 239 AGAGLLLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHP-VPPLRVSISRR-L 294
AG G + F ++ L+ Y MY + + + +P ++VSI ++ L
Sbjct: 108 AGQG----DIFLTRAIPLNRDQYAAKMYHDL-------LQGLDLNPEFSGIKVSIPKKDL 156
Query: 295 KEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREF--- 351
AE+ + E G YI+IHG S A + +G D + P WA++ L+
Sbjct: 157 DWAEAEQIRLNIKESG-YILIHG---GSSALAKQKG-IDKIYPATSWAKVIESLQTRLVN 211
Query: 352 RPLFVIPHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANA 408
P+ +I ++ + V+++ + + +T+P G+LA +I + ++ T++ + +
Sbjct: 212 LPVVLIAGPEDGDLVKELQEQVSKPLLVTSPPDIGKLAGIIGAANLLLCTDSGPMHIGVG 271
Query: 409 REKPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAV 453
+ALF +L +EK+ I S G+ P+ A+
Sbjct: 272 VGTNLVALFGPTDPEKLL---PQEKRIIAIKSPAGQ----PISAI 309
>gi|359460597|ref|ZP_09249160.1| heptosyltransferase family protein [Acaryochloris sp. CCMEE 5410]
Length = 344
Score = 74.3 bits (181), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 86/352 (24%), Positives = 138/352 (39%), Gaps = 32/352 (9%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R +I G V LLFFP ++ L Y V IDVI R KQ F L + V + ++
Sbjct: 3 RVLALIPGDVGRQLLFFPTLETLHRHYSQVEIDVIVEPRAKQAFRLCRTVDNVLTFQFEN 62
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTK---LAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
A++ +++G M++R YD L+ AGL ++ RV Y N+
Sbjct: 63 R-NGAADWGNLIGQMRDRRYDASLTISREWTAGL----LCWLVGIPQRVG--YENLALPA 115
Query: 241 AGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRH----------PVPPLRVSI 290
+ F +++ V RP+ + H P P + + I
Sbjct: 116 WVRFARKRFPTLGGTVADLSGKCLTAAVPPNARPYDAQRYHDLLLGLGIDTPCPDISIQI 175
Query: 291 SRRLKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLRE 350
+ + + G YI++HG + S + PI+ W + G E
Sbjct: 176 PLDDQNWAKAEQERLGIAGQPYIILHG----GTGAENSDDEPSQFYPIRSWESVIEGYLE 231
Query: 351 FR---PLFVIPHEKEREGVEDVVG--DDASIVFITTPGQLAALINDSAGVIATNTAAIQL 405
+ PL V+ + +E V +V IV GQLAA I+ +I + A + L
Sbjct: 232 RQPDTPLVVVQTAENKEWVASLVAVCPQLQIVMPQHVGQLAAFIDAGHLMICADRAPLHL 291
Query: 406 ANAREKPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
A + +ALF + L+ +P A + S T K+ D P VL +
Sbjct: 292 GVAVKTRLVALFGA-LEPTRHLPTAPH--FAGLKSLTSKVADIPPIQVLEKV 340
>gi|443328193|ref|ZP_21056794.1| ADP-heptose:LPS heptosyltransferase [Xenococcus sp. PCC 7305]
gi|442792163|gb|ELS01649.1| ADP-heptose:LPS heptosyltransferase [Xenococcus sp. PCC 7305]
Length = 309
Score = 71.2 bits (173), Expect = 9e-10, Method: Compositional matrix adjust.
Identities = 78/345 (22%), Positives = 146/345 (42%), Gaps = 50/345 (14%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R +I GG+ + FFP ++ L+ YP IDV+ R K+ + + K V ++D D
Sbjct: 2 RILALIPGGIDAQINFFPTLETLQSTYPNAKIDVLVEPRAKKAYRVCKYVNDVLLFDYQD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLS-----TKLAGLGHAAFLFMTTARDRVSYIYPNVNA 238
A+Y ++LG++++R Y+ LS T + L R R +++
Sbjct: 62 R-NSMADYLNLLGIIRDREYEAALSFGTPWTVSSLLWLNGIPIRIGDRSRNPWLFSKTIP 120
Query: 239 AGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVV 298
L + + AE + S G ++ + P +++++ + V
Sbjct: 121 -----LQKDQYAAEMYHSSLAGLDI----------------QANCPQVKINVPQEDIAWV 159
Query: 299 AEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLF 355
+ K G G YI++H TD P + W I ++ + P+
Sbjct: 160 ESRQKGLGIRDG-YILLHS--------------TDESYPAKNWKSIITDIQAKQSEIPIV 204
Query: 356 VIPHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKP 412
++ ++ E V ++ + ++ P G+ AA+I + ++T++ +I LA A
Sbjct: 205 LLQDQQNGEWVTQMLESHPQLK-VSAPSDVGKSAAIIAGANLFLSTDSVSISLAVAVGTY 263
Query: 413 SIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
+IALF+S + +P EK + SS TGK+ D VL +
Sbjct: 264 TIALFNSTSDSKKMLPPVSEKYVNIPSS-TGKIADIAPNTVLEKL 307
>gi|218245491|ref|YP_002370862.1| glycosyl transferase [Cyanothece sp. PCC 8801]
gi|257058526|ref|YP_003136414.1| glycosyl transferase family protein [Cyanothece sp. PCC 8802]
gi|218165969|gb|ACK64706.1| glycosyl transferase family 9 [Cyanothece sp. PCC 8801]
gi|256588692|gb|ACU99578.1| glycosyl transferase family 9 [Cyanothece sp. PCC 8802]
Length = 315
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 76/343 (22%), Positives = 151/343 (44%), Gaps = 41/343 (11%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP ++ LK +YP IDVI R K + + V +D D
Sbjct: 2 RILALVPGGIGDQILFFPTLETLKTQYPKATIDVIVEPRAKAAYRVCPIVHEVLAFDYRD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGH----AAFLFMTTARDRVSYIYPNVNAA 239
A+Y ++LG++++R YD+ LS LG L++ RV Y +
Sbjct: 62 R-NGLADYLNLLGIIRDREYDIALS-----LGQRWTIEMLLWLNGIPLRVGY------QS 109
Query: 240 GAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVA 299
LS +S S+ ++ ++ L ++P PP+++++ + +
Sbjct: 110 STAWFLSNPVPLKS---SQYTPEIFHDILTGLRI------QNPCPPVKIALPKEDIDWAE 160
Query: 300 EKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPH 359
+ K ++ YI+I+ +S + + P+ +W ++ ++ +P I
Sbjct: 161 AEQKRLDLKETGYILIYPFQSPNPKNS---------YPVGLWQKVIANIQAKQPNLPIVL 211
Query: 360 EKEREGVEDV--VGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSI 414
++ VE V + ++ + +T P G+ AA+I + ++ T+ A+ +A A +I
Sbjct: 212 LQDANNVEKVALMLENTPNLKVTEPGDVGKFAAMIAAANLMLCTDNEALPVAIAVGTYTI 271
Query: 415 ALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
ALF + P ++ + I S + + D E +L M
Sbjct: 272 ALFGPTDPTQSIPPGSD--RYLGIQSTSDNVADIDPETILQKM 312
>gi|158337113|ref|YP_001518288.1| heptosyltransferase family protein [Acaryochloris marina MBIC11017]
gi|158307354|gb|ABW28971.1| heptosyltransferase family protein [Acaryochloris marina MBIC11017]
Length = 344
Score = 70.9 bits (172), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 82/352 (23%), Positives = 139/352 (39%), Gaps = 32/352 (9%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R +I G V LLFFP ++ L Y + IDVI R KQ F L + V + ++
Sbjct: 3 RVLALIPGDVGRQLLFFPTLETLHRHYSQIEIDVIVEPRAKQAFRLCRTVDNVLTFQFEN 62
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTK---LAGLGHAAFLFMTTARDRVSYIYPNVNAAG 240
A++ +++G M++R YD ++ AGL ++ RV Y N+
Sbjct: 63 R-NGAADWGNLIGQMRDRRYDASITISREWTAGL----LCWLVGIPQRVG--YENLALPA 115
Query: 241 AGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRH----------PVPPLRVSI 290
+ F +++ V RP+ + H P P + + I
Sbjct: 116 WVRFARKRFPTLGGTVADLSGKCLTAAVPPNARPYDAQRYHDLLLGLGIDTPCPDISIQI 175
Query: 291 SRRLKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLRE 350
+ + + G YI++HG + S + PI+ W + G E
Sbjct: 176 PVDDQNWAKAEEERLGIAGKPYIILHG----GTGAENSDDEPSQFYPIRSWESVIEGYLE 231
Query: 351 FR---PLFVIPHEKEREGVEDVVGDDASIVFITTP--GQLAALINDSAGVIATNTAAIQL 405
+ PL V+ + ++ V +V + +T GQLAA I+ +I + A + L
Sbjct: 232 RQPDTPLVVVQTPENKDWVASLVAVCPQLQIVTPQHVGQLAAFIDAGHLMICADRAPLHL 291
Query: 406 ANAREKPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
A + +ALF + L+ +P A + S T K+ D P VL +
Sbjct: 292 GVAVKTRLVALFGA-LEPTRHLPTA--ANFAGLKSLTSKVADIPPIQVLEKV 340
>gi|220906649|ref|YP_002481960.1| glycosyl transferase protein [Cyanothece sp. PCC 7425]
gi|219863260|gb|ACL43599.1| glycosyl transferase family 9 [Cyanothece sp. PCC 7425]
Length = 322
Score = 70.1 bits (170), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 80/343 (23%), Positives = 154/343 (44%), Gaps = 34/343 (9%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP ++ LK +YP +DV+ R + + + +V +D
Sbjct: 2 RVVALVPGGIGDQILFFPTLEDLKRQYPQAEVDVVVEPRARAAYRVCPHVDRVLAFDFKG 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAF-LFMTTARDRVSYIYPNVNAAGAG 242
PA++ ++LGV+++ Y++ LS L F L++ RVS+
Sbjct: 62 R-NGPADWLNLLGVVRDGEYNIGLS--LGSSWTVGFLLWLMGIPVRVSF-----TKGKKS 113
Query: 243 LLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKY 302
LL+ +N + +Y ++ LG P P L+VSI R E +
Sbjct: 114 TLLTNPV---QLNPDQYAAALYHDLLKGLGV------TTPAPDLKVSIPRSDLEWAEGER 164
Query: 303 KNAGAEQ--GKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVI 357
+ G + YI+IHG + M D + P++ W ++ ++ + P+ ++
Sbjct: 165 QRLGLDTAGAGYILIHG----GSSQMARTKGIDKIYPVEKWQQVIQNIQARQPDLPIAIV 220
Query: 358 PHEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSI 414
+++E ++ I +T+P G+LAALI ++ ++ T++A + L A + +
Sbjct: 221 QGPEDQEWAAAMLQSCPGIK-VTSPDDIGKLAALIAAASLMLCTDSAPMHLGVAVKTYLV 279
Query: 415 ALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAM 457
ALF +L + + I S TG++ D E +L +
Sbjct: 280 ALFGPTDPAKLL---PRDDRFIGIKSPTGQIADIAPEKILEKI 319
>gi|22299676|ref|NP_682923.1| glycosyl transferase family protein [Thermosynechococcus elongatus
BP-1]
gi|22295860|dbj|BAC09685.1| tlr2133 [Thermosynechococcus elongatus BP-1]
Length = 320
Score = 68.6 bits (166), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 73/343 (21%), Positives = 143/343 (41%), Gaps = 37/343 (10%)
Query: 122 VRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDL 181
++ ++ GG+ + +LFFP + LK +P +DV+ R ++L+ +V +D
Sbjct: 1 MKNIVALVPGGIGDQILFFPTLDDLKAHFPEAQLDVVVEPRAMAAYQLSGSVHQVIPFDF 60
Query: 182 DDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAA----FLFMTTARDRVSYIYPNVN 237
D A++ +++G ++ YD +LS LG + L++T RV + +N
Sbjct: 61 KDR-NALADWANLIGRLREGEYDAILS-----LGRSKAVRFLLWLTGIPKRVGFA--KLN 112
Query: 238 AAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEV 297
G L++ A +NL + Y ++ G P P + I+ ++
Sbjct: 113 PLG---FLTD---AIPVNLEQYSAATYHDLLKAFGI------TSPCPLPKAVITAEDRQW 160
Query: 298 VAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLRE---FRPL 354
+ + G G Y ++HG + M + + P W E+ L + P
Sbjct: 161 ATAEQQRLGLSSG-YRLLHG----GSSQMALSKGINKIYPPTRWVEVIQTLAQEEPHLPF 215
Query: 355 FVIPHEKEREGVEDVVGD--DASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKP 412
V+ +++ V + D I + +LAAL+ + ++ T++ + + A P
Sbjct: 216 IVVCGPEDQAWVRALTQALPDIKISYPPDLRKLAALMQQAQQMLCTDSGPMHIGVAVGTP 275
Query: 413 SIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLN 455
+ALF +L + + + S TG + D PV AV+
Sbjct: 276 LVALFGPTDPAKLL---PADPRFRAVKSPTGNIGDIPVSAVIE 315
>gi|86609430|ref|YP_478192.1| heptosyltransferase family protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86557972|gb|ABD02929.1| heptosyltransferase family protein [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 335
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 70/303 (23%), Positives = 129/303 (42%), Gaps = 29/303 (9%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFPA+ L+ R+P ++V+ R +E+ +V + +
Sbjct: 15 RILALVPGGIGDQILFFPALASLRQRFPEAELEVVVEPRAAGAYEVCPSVSRVLTFPFKE 74
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
+ + + R YD VLS + LG L++T RV Y P G G
Sbjct: 75 QLSLGDLSDLLGRI-RERQYDGVLSLGRS-LGVKLLLWLTGIPKRVGYAPP-----GPG- 126
Query: 244 LLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
+ + + L Y MY +V G + P +P +R+ + L+ E+
Sbjct: 127 ---RPWLTDPVPLKPAQYAAQMYHDLVQGFG---IAAPAG-LPQIRLKKA-DLEWAEQER 178
Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIP 358
+ G Y+++H + + + P W ++ G RE R PL ++
Sbjct: 179 KRLLGDAAQDYVLLH----PGASQLSQEKGIQKIYPSANWVKVIKGFREQRPELPLVLVV 234
Query: 359 HEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
++ + E + + ++ P GQLAALI + ++ ++A + LA A + P +A
Sbjct: 235 GPEDEQLAEGFLAQLPELP-VSRPVRLGQLAALIGGATLLLCADSAPMHLAVAVQTPLVA 293
Query: 416 LFS 418
LF
Sbjct: 294 LFG 296
>gi|86607954|ref|YP_476716.1| heptosyltransferase family protein [Synechococcus sp.
JA-2-3B'a(2-13)]
gi|86556496|gb|ABD01453.1| heptosyltransferase family protein [Synechococcus sp.
JA-2-3B'a(2-13)]
Length = 337
Score = 65.9 bits (159), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 69/303 (22%), Positives = 128/303 (42%), Gaps = 29/303 (9%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG+ + +LFFP + L+ R+P ++V+ R +E+ +V + +
Sbjct: 17 RLLALVPGGIGDQILFFPTLASLRQRFPEAELEVVVEPRAAAAYEVCPSVSRVLTFPFKE 76
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
+ + + R YD VLS + LG L++T RV Y P G G
Sbjct: 77 QLSLGDLSDLLGRI-RERQYDGVLSLGRS-LGVKLLLWLTGIPKRVGYAPP-----GPG- 128
Query: 244 LLSETFTAESMNLSERGY--NMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
+ + + L Y MY +V G + P +P +R+ + L+ E+
Sbjct: 129 ---RPWLTDPVPLKPAQYAAQMYHDLVQGFG---IAAPAG-LPQIRLKKA-DLEWAEQER 180
Query: 302 YKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFR---PLFVIP 358
+ G Y+++H + + + P W ++ G RE R PL ++
Sbjct: 181 KRLLGDAAQDYVLLH----PGASQLSQEKGIQKIYPSANWVKVIKGFREQRPELPLVLVV 236
Query: 359 HEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
++ + E + + ++ P GQLAALI + ++ ++A + LA A + P +A
Sbjct: 237 GPEDEQLAEGFLAQLPELP-VSRPVRLGQLAALIGGATLLLCADSAPMHLAVAVQTPLVA 295
Query: 416 LFS 418
LF
Sbjct: 296 LFG 298
>gi|443474618|ref|ZP_21064590.1| glycosyl transferase family 9 [Pseudanabaena biceps PCC 7429]
gi|443020604|gb|ELS34544.1| glycosyl transferase family 9 [Pseudanabaena biceps PCC 7429]
Length = 314
Score = 60.8 bits (146), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 68/318 (21%), Positives = 133/318 (41%), Gaps = 41/318 (12%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
R ++ GG + LLFFP + LK +YP IDV+ R + + ++V +D D
Sbjct: 3 RILALVPGGTGDQLLFFPTLDTLKQQYPNAEIDVVVEPRAMAAYRVCQSVNRVLKFDFKD 62
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAG- 242
A++ ++LG +++R YD + T+ L L+++ R+S+ G G
Sbjct: 63 R-NSLADFGNLLGTIRDREYDAAIVTQ-PNLSMNVLLWLSGIPKRISF-------KGQGD 113
Query: 243 LLLSETFTAESMNLSE-RGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEK 301
LL++ + + + +N+ + ++ +H PP++V++ + + +
Sbjct: 114 FLLTDIIAMDPQEYTAVQNHNLLMAV---------NIQKH-CPPIKVNLPKNDLDWSTNE 163
Query: 302 YKNAGAEQGKYIVIH-GIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLF--VIP 358
K G +Q +I+++ G ++ P WA IA L+ +P V
Sbjct: 164 QKRLGIQQSGFILLNCGAYAN--------------YPAASWATIAKDLQAKQPNLPIVAI 209
Query: 359 HEKEREGVEDVVGDDASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
+ + + IT+P G+L A I + I A+QL A +A
Sbjct: 210 DSVNNAALLKQLTTLVPNLLITSPTDIGKLTAFIASANLFICAEGDAMQLGVAVGTALVA 269
Query: 416 LFSSELKGRLFVPNAEEK 433
+ + +P AE++
Sbjct: 270 ILGANTPAGKSLPIAEKR 287
>gi|427418291|ref|ZP_18908474.1| ADP-heptose:LPS heptosyltransferase [Leptolyngbya sp. PCC 7375]
gi|425761004|gb|EKV01857.1| ADP-heptose:LPS heptosyltransferase [Leptolyngbya sp. PCC 7375]
Length = 315
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 36/156 (23%), Positives = 74/156 (47%), Gaps = 10/156 (6%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
++ GG+ + L FFP + +++ YP I V+ K + ++K V ++
Sbjct: 2 HVLALVPGGISDQLQFFPTLATIQEIYPSAEISVVVEPTSKSAYRVSKAVSEVIPFNYRG 61
Query: 184 DWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGL 243
PA++ ++LG++++R +++VLS L+++ RV Y ++A A
Sbjct: 62 Q-NSPADWANLLGIIRDREFELVLSAS-DRWEEGVLLWLSGIPTRVGY-----SSAQAPW 114
Query: 244 LLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVP 279
L + T TA+ + Y+ +++ L P +S P
Sbjct: 115 LYTNTVTADKEPYQA---SQYQALLEGLAEPSKSAP 147
>gi|452852682|ref|YP_007494366.1| Glycosyl transferase family 9 [Desulfovibrio piezophilus]
gi|451896336|emb|CCH49215.1| Glycosyl transferase family 9 [Desulfovibrio piezophilus]
Length = 320
Score = 48.1 bits (113), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 72/313 (23%), Positives = 123/313 (39%), Gaps = 27/313 (8%)
Query: 128 IISGGVYENLLF-FPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDDWP 186
I + G+ N LF PA LK YPG + ++ + GK + NV +V++ D
Sbjct: 11 IFNNGMLGNTLFNMPAAAWLKQNYPGCHVGMVVDSVGKTLILHDPNVDTIHVFNKKKD-S 69
Query: 187 EPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLLLS 246
+ + +L + K RY K G+ + + DR + + + L L
Sbjct: 70 LRQQLSLLLELRKTRYNVSFHLRK--GVRNELLARLAGIPDRAGF---RLKGSPQHLTLK 124
Query: 247 ETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAG 306
L R M E M LGRP + + R++I + + AG
Sbjct: 125 LEEDTSVHRLESRALLM-EAM---LGRPVK------LDRPRLNICTEANAAMTRFIEQAG 174
Query: 307 AEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKEREGV 366
A G Y V+H + S+ D LP V+A+ A L E P+F+I E E V
Sbjct: 175 AVPGNYFVLH-------PTGNSQNGKDWSLP--VYAQAAQRLAEDAPVFIICMPCEEEAV 225
Query: 367 EDVVGDDASIVFIT-TPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKGRL 425
+ + I + T + + LI +++ + ++ LA A E P + ++ +
Sbjct: 226 NKAIPSGSHIHYYTGSIANTSELIRNASVFLGNDSGPAHLACAWEVPRLVVYKDDEANYT 285
Query: 426 FVPNAEEKKCTVI 438
+ CT++
Sbjct: 286 KWKPIHKTGCTIL 298
>gi|428207215|ref|YP_007091568.1| glycosyl transferase family protein [Chroococcidiopsis thermalis
PCC 7203]
gi|428009136|gb|AFY87699.1| glycosyl transferase family 9 [Chroococcidiopsis thermalis PCC
7203]
Length = 803
Score = 48.1 bits (113), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 79/350 (22%), Positives = 142/350 (40%), Gaps = 76/350 (21%)
Query: 120 GDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFEL--------NK 171
GD+R + + + ++ P ++ +K+ P + ++AS G T L +
Sbjct: 447 GDLRNILVMRLDNIGDVIMTSPVLRTIKENLPTAKLTLMASPAGALTQPLLPWVDEVLSW 506
Query: 172 NVRWANVYDLDDDWPEPAEYTDILGVMKNRYYDM-VLSTKLAGLGHAAFLFMTTARDRVS 230
V W ++ LD D PA ++ + R +D ++ T + + A L V
Sbjct: 507 RVLWQDLGRLDFD---PAREWKLIETLNQRQFDAAIILTSFSQSPYPAAL--------VC 555
Query: 231 YIYPNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSI 290
Y+ AG GL L E+ E G + VD L V R+
Sbjct: 556 YL------AGIGLRLGES--------KESGQGILTHWVDPLPDEIHQVERN--------- 592
Query: 291 SRRLKEVVAEKYKNAGAEQGKYIVIHGIESDSKA------------SMQSRGDTDSLLPI 338
RL E V K ++ + + +H ++ + A + QSR D
Sbjct: 593 -LRLIEAVGFKVRD------RRLCLHVPQTITAAIPNSYILLNPWTTCQSRNYDDCR--- 642
Query: 339 QVWAEIANGLREFR--PLFVIPHEKEREGVE---DVVGDDASIVFITTPGQLAALINDSA 393
+A A L + P+ V K+R+ D++GD ++ T+ + AALI ++
Sbjct: 643 --FATAARQLSQMTSLPIVVTGTNKDRDRARPLLDILGDAIDLIGATSLAEFAALIANAR 700
Query: 394 GVIATNTAAIQLANAREKPSIALFS-SELKGR---LFVPNAEEKKCTVIS 439
V+ NT+ + +A+A PS+ LF+ +EL+ + + P ++ TV S
Sbjct: 701 LVLTNNTSTMHIADATNTPSVILFAGTELESQWQPCYAPVKLLRRPTVCS 750
>gi|429106789|ref|ZP_19168658.1| Lipopolysaccharide heptosyltransferase III [Cronobacter malonaticus
681]
gi|426293512|emb|CCJ94771.1| Lipopolysaccharide heptosyltransferase III [Cronobacter malonaticus
681]
Length = 359
Score = 46.6 bits (109), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 69/301 (22%), Positives = 117/301 (38%), Gaps = 47/301 (15%)
Query: 137 LLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDDWPEPAEYTD--- 193
LL P IQ L+ R+PG IDV+ + N ++ +Y +D W + Y
Sbjct: 28 LLTTPLIQALRQRFPGAQIDVLLYEETRDMLSANPDIH--RIYGIDRQWKKQGVYHQLKM 85
Query: 194 ---ILGVMKNRYYDMVLSTKLAGLGHAAFLF-MTTARDRVSYIYPNVNAAGAGLLLSETF 249
+L ++ + YD+VL+ LA +A T AR RV + +P ++
Sbjct: 86 ELSLLFSLRKQKYDLVLN--LADQWRSAICTRFTGARMRVGFAFPKRRHPFWAFCHTQLV 143
Query: 250 TAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQ 309
+ + + + EQ + L PR+ P R+S + ++ A + Q
Sbjct: 144 STQDHHTQ----HTVEQNLSILSAIGEDYPRN--MPARMSYTG--QDWQACQSLLPSDFQ 195
Query: 310 GKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAE-----IANGLREFRPLFVI---PHEK 361
YIVI S + W E + N L V+ P K
Sbjct: 196 DDYIVIQ---------------PTSRWFFKCWREGNISAVINALSAAGHNIVLTSGPDAK 240
Query: 362 EREGVEDVVGDDASIVFITTPG-----QLAALINDSAGVIATNTAAIQLANAREKPSIAL 416
E++ ++ ++ A + G QLAALI+ + I ++ + +A A P +AL
Sbjct: 241 EKQMIDTIIAGCADARLHSLAGQLTLRQLAALIDHAKLFIGVDSVPMHMAAALGTPLVAL 300
Query: 417 F 417
F
Sbjct: 301 F 301
>gi|373487956|ref|ZP_09578622.1| glycosyl transferase family 9 [Holophaga foetida DSM 6591]
gi|372007730|gb|EHP08359.1| glycosyl transferase family 9 [Holophaga foetida DSM 6591]
Length = 317
Score = 45.4 bits (106), Expect = 0.058, Method: Compositional matrix adjust.
Identities = 39/139 (28%), Positives = 69/139 (49%), Gaps = 9/139 (6%)
Query: 330 GDTDSLLPIQVWAEIANGLREF--RPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAA 387
GD+ P + + +A LRE + V+ ERE E +VG D+ + TTP +LA
Sbjct: 183 GDSKKRWPAERFRAVAVHLRECYDAEIIVLWGPGEREIAEGIVGKDSELAPPTTPLELAG 242
Query: 388 LINDSAGVIATNTAAIQLANAREKPSIALFSSELKGRLFVPNAEEKKCTVISSRTGK--L 445
L++ +I+ ++ + LA P +ALFS+ +++P T++ G+ L
Sbjct: 243 LLSSVDLLISGDSGPMHLAAGLGCPLVALFSTADPA-VYLPPGN----TLLVRHKGRIDL 297
Query: 446 IDTPVEAVLNAMQIFNESL 464
PVE+V+ A++ +L
Sbjct: 298 GGMPVESVIEAVEAMRGAL 316
>gi|89074603|ref|ZP_01161068.1| RfaF protein [Photobacterium sp. SKA34]
gi|89049541|gb|EAR55101.1| RfaF protein [Photobacterium sp. SKA34]
Length = 333
Score = 45.1 bits (105), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 27/102 (26%), Positives = 55/102 (53%), Gaps = 2/102 (1%)
Query: 351 FRPLFVIP-HEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAR 409
F +F++P + K ++ E + +S+ + + LI S GVI+ +TA I +A+A
Sbjct: 227 FNCVFILPSYIKNKDVWESQLAPVSSVCCVDSIQDAIYLIKKSKGVISVDTAIIHIASAY 286
Query: 410 EKPSIALFS-SELKGRLFVPNAEEKKCTVISSRTGKLIDTPV 450
P+I LF S+L+ +L+ P + +K ++ +I++ +
Sbjct: 287 NIPTIGLFKESQLECKLWHPQSSKKHVGALTGNVEDIINSRI 328
>gi|404492876|ref|YP_006716982.1| ADP-heptose--lipopolysaccharide heptosyltransferase [Pelobacter
carbinolicus DSM 2380]
gi|77544951|gb|ABA88513.1| ADP-heptose--lipopolysaccharide heptosyltransferase [Pelobacter
carbinolicus DSM 2380]
Length = 350
Score = 44.3 bits (103), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 66/299 (22%), Positives = 119/299 (39%), Gaps = 48/299 (16%)
Query: 140 FPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLD-DDWPEPAEYTDILGV- 197
P + LK+ +P + ID + E + +R VY L W + + GV
Sbjct: 18 LPVLAYLKEAHPEIKIDWLVEEGFAPLIEGHPMLR--RVYRLGLKRWRREGWRSVLAGVK 75
Query: 198 -----MKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYI------YPNVNAAGAGLLLS 246
++N YDMVL + F + A R + +PN+ A ++L
Sbjct: 76 STVHELRNERYDMVLDLQ-GNCKSGLFTLLCGAPRRYGFSFSGVREWPNLLATNRRVVL- 133
Query: 247 ETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAG 306
TA ++S+R + + + P H P R ++ EK +
Sbjct: 134 ---TAADHHISDRSLAVAREAFPVGNARSAAGPLHVTPQARTAV---------EKQLGSF 181
Query: 307 AEQGKYIVI--HGIESDSKASMQSRGDTDSLLPIQVWAEIANGL---REFRPLFVIPHEK 361
G +V+ +G ++K L P+ W +A L RP+ + +E
Sbjct: 182 NMNGPSLVVLQYGTTWETK-----------LWPLDSWQRLARTLCTEDNLRPVLIWGNEA 230
Query: 362 EREGVEDV--VGDDASIVF-ITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALF 417
ER+ E + D ++++ T +LAAL+ + VI +T I +A A + P++++F
Sbjct: 231 ERDAAEAIYRATDGQAVIWPRGTLQELAALLERADLVIGGDTGPIHIAAALDTPTVSIF 289
>gi|253700849|ref|YP_003022038.1| glycosyl transferase family protein [Geobacter sp. M21]
gi|251775699|gb|ACT18280.1| glycosyl transferase family 9 [Geobacter sp. M21]
Length = 371
Score = 43.9 bits (102), Expect = 0.17, Method: Compositional matrix adjust.
Identities = 21/62 (33%), Positives = 31/62 (50%)
Query: 123 RRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLD 182
RR I GG+ + +L PA+ L+ +PG IDV+A +R F + + W YD
Sbjct: 50 RRFLVIRPGGIGDAVLLVPALTALQKAFPGCRIDVLAESRNAAAFLMCPGLNWVYRYDCL 109
Query: 183 DD 184
D
Sbjct: 110 SD 111
>gi|95928765|ref|ZP_01311511.1| glycosyl transferase, family 9 [Desulfuromonas acetoxidans DSM 684]
gi|95135110|gb|EAT16763.1| glycosyl transferase, family 9 [Desulfuromonas acetoxidans DSM 684]
Length = 372
Score = 43.5 bits (101), Expect = 0.23, Method: Compositional matrix adjust.
Identities = 24/83 (28%), Positives = 40/83 (48%), Gaps = 2/83 (2%)
Query: 131 GGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDDWPE--P 188
GG+ + +L P +Q +D + G I ++A R Q F+L + V YD DW +
Sbjct: 48 GGIGDAVLLLPTLQACRDFFEGASIYILAEKRNAQVFDLCEGVAEVWCYDKLGDWKDFFF 107
Query: 189 AEYTDILGVMKNRYYDMVLSTKL 211
+Y I+ ++ Y V+S L
Sbjct: 108 KDYDLIIDTEQSHYLSAVISKLL 130
>gi|319638761|ref|ZP_07993520.1| 1,5 heptosyltransferase I [Neisseria mucosa C102]
gi|317400002|gb|EFV80664.1| 1,5 heptosyltransferase I [Neisseria mucosa C102]
Length = 322
Score = 43.5 bits (101), Expect = 0.25, Method: Compositional matrix adjust.
Identities = 30/112 (26%), Positives = 50/112 (44%), Gaps = 16/112 (14%)
Query: 312 YIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFV-IPHEKEREGVE-DV 369
Y+ +H DSK L P++ W + L E R V +P E E + +
Sbjct: 179 YVALHATSRDSK-----------LWPVEHWRALLKKLNEKRRYTVYLPWGNETEKMRAEQ 227
Query: 370 VGDDASIVFITTPG---QLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
+ D F+ Q A L+ ++ G+I +T + LANA EKP + +++
Sbjct: 228 IASDLPFAFVCDKMNLLQAAFLLKNAEGIIGVDTGLLHLANALEKPVVGIYT 279
>gi|375145548|ref|YP_005007989.1| glycosyl transferase family protein [Niastella koreensis GR20-10]
gi|361059594|gb|AEV98585.1| glycosyl transferase family 9 [Niastella koreensis GR20-10]
Length = 372
Score = 43.5 bits (101), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 35/130 (26%), Positives = 64/130 (49%), Gaps = 17/130 (13%)
Query: 337 PIQVWAEIANGLREF-RPLFVIPHEKEREGVEDV---VGDDAS-IVFITTPGQLAALIND 391
P+Q W + + + P+ + KE+E + +GD+A+ + T G+ AL
Sbjct: 211 PLQHWITLGQLVHDTGLPILLTGTTKEKELTSAIAAGIGDNATDVAGQFTTGEFIALTAG 270
Query: 392 SAGVIATNTAAIQLANAREKPSIALFSS--------ELKGRLF---VPNAEEKKCTVIS- 439
+AGV++ NTA I +A A E P + L++ ++ G++F VP + + K VI
Sbjct: 271 AAGVVSVNTATIHIAAACETPVVVLYALTNPQHTPWKVSGQVFPYDVPESLQSKNEVIGY 330
Query: 440 SRTGKLIDTP 449
R +++ P
Sbjct: 331 VRDNRMMKQP 340
>gi|298242804|ref|ZP_06966611.1| glycosyl transferase family 9 [Ktedonobacter racemifer DSM 44963]
gi|297555858|gb|EFH89722.1| glycosyl transferase family 9 [Ktedonobacter racemifer DSM 44963]
Length = 406
Score = 43.1 bits (100), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 59/253 (23%), Positives = 102/253 (40%), Gaps = 37/253 (14%)
Query: 203 YDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLLLSETFTAESMNLSERGYN 262
YD+ ++ + A +++ RV Y P + F + ++ E ++
Sbjct: 119 YDVAINLRPDFWWGAVLIYLAGIPRRVGYALPP----------ATPFLSHALPFPEHQHS 168
Query: 263 MYEQM------VDWLGRPFRSVPRHPVP-PLRVSISRRLKEVVAEKYKNAG-AEQGKYIV 314
+ + + LG+P P P PLR + + + VA+ +AG A + +V
Sbjct: 169 TFYNLELASAGLQALGQPPLEAPFTPERYPLRFEPTEQERRWVAQTLHDAGIAPETPVVV 228
Query: 315 IHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPL--FVI---PHEKE--REGVE 367
IH G L + WA AN L+ P FV+ P E+ E E
Sbjct: 229 IH----------PGTGGAVKLWRSEAWANCANQLQASYPTLRFVLTGTPKERPLLEEVAE 278
Query: 368 DVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSSELKGRLFV 427
+V SI + GQL AL+ ++ V+ + + LA ++ P+I +F R+F
Sbjct: 279 GMVTQPLSISDSSV-GQLTALLQRASLVMGVDNGPLHLAVTQDVPTIRIFGPT-DSRIFG 336
Query: 428 PNAEEKKCTVISS 440
P + VI+S
Sbjct: 337 PWGAPARHRVITS 349
>gi|357032050|ref|ZP_09093990.1| lipopolysaccharide core biosynthesis protein [Gluconobacter
morbifer G707]
gi|356414277|gb|EHH67924.1| lipopolysaccharide core biosynthesis protein [Gluconobacter
morbifer G707]
Length = 302
Score = 42.7 bits (99), Expect = 0.40, Method: Compositional matrix adjust.
Identities = 51/179 (28%), Positives = 82/179 (45%), Gaps = 20/179 (11%)
Query: 286 LRVSISRRLKEVVAEKYKNAGAE-QGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEI 344
LR + L V + K AG E G+Y V+ + + Q R P+ +A++
Sbjct: 133 LRAAGVPSLPRTVPDWLKGAGPEIAGRYAVL--VPGAAPHRPQKR------WPVSRFADV 184
Query: 345 ANGL--REFRPLFVIPHEKEREGVEDV--VGDDASIVFITTPGQLAALINDSAGVIATNT 400
A L R RP+ + E++ + + D + TT +LAA+++ + VI +T
Sbjct: 185 ARSLEARGIRPVIIGGAEEKPLALAIIRTCPDALDLTGKTTLPELAAVLDGAETVIGNDT 244
Query: 401 AAIQLANAREKPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQI 459
+ LA A + P IALFSS RL P T+ + RT L+ P A+L A ++
Sbjct: 245 GPMHLAAAMDTPIIALFSSASDPRLTAP------LTLSADRTEILV-VPDLALLPASRV 296
>gi|241760711|ref|ZP_04758803.1| lipopolysaccharide heptosyltransferase I [Neisseria flavescens
SK114]
gi|241318892|gb|EER55418.1| lipopolysaccharide heptosyltransferase I [Neisseria flavescens
SK114]
Length = 322
Score = 42.4 bits (98), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 46/184 (25%), Positives = 77/184 (41%), Gaps = 33/184 (17%)
Query: 241 AGLLLSETFTA-ESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVA 299
A L S+ F + N R ++ Q+ + ++P V L V S RLK +
Sbjct: 123 AALAYSKMFAVPKGKNAVWRNRELFAQVFGY------TMPETQVSGLSVPESGRLKNL-- 174
Query: 300 EKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFV-IP 358
+ Y+ +H DSK L P++ W + L E + V +P
Sbjct: 175 --------KSPYYVALHATSRDSK-----------LWPVEHWRALLKKLNEEQQCNVYLP 215
Query: 359 --HEKEREGVEDVVGD--DASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSI 414
+E E+ E + D AS+ Q A L+ ++ G+I +T + LANA EKP +
Sbjct: 216 WGNETEKMRAEQIASDLPFASVCDKMNLLQAAFLLKNAEGIIGVDTGLLHLANALEKPVV 275
Query: 415 ALFS 418
+++
Sbjct: 276 GIYT 279
>gi|261379628|ref|ZP_05984201.1| lipopolysaccharide heptosyltransferase I [Neisseria subflava
NJ9703]
gi|284798112|gb|EFC53459.1| lipopolysaccharide heptosyltransferase I [Neisseria subflava
NJ9703]
Length = 322
Score = 42.4 bits (98), Expect = 0.57, Method: Compositional matrix adjust.
Identities = 31/112 (27%), Positives = 52/112 (46%), Gaps = 16/112 (14%)
Query: 312 YIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFV-IP--HEKEREGVED 368
Y+ +H DSK L P++ W + L E + V +P +E E+ E
Sbjct: 179 YVALHATSRDSK-----------LWPVEHWRALLKKLNEEQQCNVYLPWGNETEKMRAEQ 227
Query: 369 VVGD--DASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
+ D AS+ Q A L+ ++ G+I +T + LANA EKP + +++
Sbjct: 228 IASDLPFASVCDKMNLLQAAFLLKNAEGIIGVDTGLLHLANALEKPVVGIYT 279
>gi|349610547|ref|ZP_08889889.1| lipopolysaccharide heptosyltransferase I [Neisseria sp. GT4A_CT1]
gi|348609763|gb|EGY59489.1| lipopolysaccharide heptosyltransferase I [Neisseria sp. GT4A_CT1]
Length = 326
Score = 42.0 bits (97), Expect = 0.69, Method: Compositional matrix adjust.
Identities = 54/229 (23%), Positives = 93/229 (40%), Gaps = 37/229 (16%)
Query: 197 VMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAG-AGLLLSETFTA-ESM 254
V++ +D VL ++ GL +A F A+ + + N G A L ++T+ +
Sbjct: 81 VLRQEAFDFVLDSQ--GLIKSA-CFAKMAKSPIYGLNKNSAREGLAALAYAKTYAVPKGK 137
Query: 255 NLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQGKYIV 314
N R ++ Q+ + ++P V L V + RLK + EQ Y
Sbjct: 138 NAVWRNRELFAQVFGY------AMPETQVFGLTVPEAGRLKNL----------EQPYYAA 181
Query: 315 IHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFV-IPHEKEREGVE-DVVGD 372
+H DSK L P++ W + L E + V +P E E + + D
Sbjct: 182 LHATSRDSK-----------LWPVESWRALLQKLNEEQQCNVYLPWGNETEKTRAEQIAD 230
Query: 373 DASIVFITTPG---QLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
+ Q A L+ + G++ +T + LANA EKP I +++
Sbjct: 231 GLPFAIVCDKMNLLQAAYLLKHAVGIVGVDTGLLHLANALEKPVIGIYT 279
>gi|419797579|ref|ZP_14323048.1| lipopolysaccharide heptosyltransferase I [Neisseria sicca VK64]
gi|385697661|gb|EIG28072.1| lipopolysaccharide heptosyltransferase I [Neisseria sicca VK64]
Length = 326
Score = 41.6 bits (96), Expect = 0.83, Method: Compositional matrix adjust.
Identities = 58/234 (24%), Positives = 99/234 (42%), Gaps = 49/234 (20%)
Query: 198 MKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAG-AGLLLSETFTA-ESMN 255
++ +D VL ++ GL +A F A+ + + N G A L ++T+T + N
Sbjct: 82 LRQEAFDFVLDSQ--GLIKSA-CFAKIAKSPIYGLDKNSAREGLAALAYAKTYTVPKGKN 138
Query: 256 LSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQGKYIVI 315
R ++ Q+ + ++P V L V + RLK + EQ Y +
Sbjct: 139 AVWRNRELFAQVFGY------AMPETQVFGLTVPEAGRLKNL----------EQPYYAAL 182
Query: 316 HGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFV-IPH----EKEREG-VED- 368
H DSK L P++ W + L E + V +P EK R G + D
Sbjct: 183 HATSRDSK-----------LWPVENWRALLQKLNEEQQCNVYLPWGNETEKTRAGQIADG 231
Query: 369 ----VVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
+V D +++ Q A L+ + G+I +T + LANA EKP + +++
Sbjct: 232 LPFAIVCDKMNLL------QAAYLLKHAVGIIGVDTGLLHLANALEKPVVGIYT 279
>gi|225077329|ref|ZP_03720528.1| hypothetical protein NEIFLAOT_02388 [Neisseria flavescens
NRL30031/H210]
gi|224951321|gb|EEG32530.1| hypothetical protein NEIFLAOT_02388 [Neisseria flavescens
NRL30031/H210]
Length = 322
Score = 41.6 bits (96), Expect = 0.90, Method: Compositional matrix adjust.
Identities = 47/184 (25%), Positives = 76/184 (41%), Gaps = 33/184 (17%)
Query: 241 AGLLLSETFTA-ESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVA 299
A L S+TF + N R ++ Q+ + ++P V L V S RLK +
Sbjct: 123 AALAYSKTFAVPKGKNAVWRNRELFAQVFGY------TMPETQVFGLSVPESGRLKNL-- 174
Query: 300 EKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFV-IP 358
+ Y +H DSK L P++ W + L E + V +P
Sbjct: 175 --------QSPYYAALHATSRDSK-----------LWPVEHWRALLKKLNEEQQCNVYLP 215
Query: 359 --HEKEREGVEDVVGD--DASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSI 414
+E E+ E + D AS+ Q A L+ + G+I +T + LANA EKP +
Sbjct: 216 WGNETEKMRAEQIASDLPFASVCDKMNLLQAAFLLKKAEGIIGVDTGLLHLANALEKPVV 275
Query: 415 ALFS 418
+++
Sbjct: 276 GIYT 279
>gi|402835189|ref|ZP_10883768.1| heptosyltransferase [Selenomonas sp. CM52]
gi|402275980|gb|EJU25110.1| heptosyltransferase [Selenomonas sp. CM52]
Length = 353
Score = 41.2 bits (95), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 65/319 (20%), Positives = 128/319 (40%), Gaps = 35/319 (10%)
Query: 112 ASLPLKIRGDVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNK 171
A+ P++ +++R I + + +L PAI+ +K+ PG I ++ + E+N
Sbjct: 3 AAKPIRF-AELKRILVINLAFIGDVILTTPAIRAVKEACPGAAITMLTVPLTAEVAEMNP 61
Query: 172 NVRWANVYDLDDDWPEPAEYTDILGV------MKNRYYDMVLSTKLAGLGHAAFLFMTTA 225
V +YD + + +LG+ ++ R +DM + A G A F+
Sbjct: 62 FVDDVMMYD------KKGKDKGLLGMFSVGRRLRRRNFDMAICMNFATRG-AIVAFLAGI 114
Query: 226 RDRVSYIYPNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPP 285
+ R+ Y +A AG L+ +A + N + +V LG R++ PP
Sbjct: 115 KIRLGY-----DAQHAGFFLTHKASAIRTTVQHETINHFGVLVP-LGFATRNLSLVCEPP 168
Query: 286 LRVSISRRLKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIA 345
R K A+K + +YI I + ++ + L A+ A
Sbjct: 169 ------ERAKRSFAQKAEKLALLACEYIAICPFGRHERRNLSEETVSGFLREFFADADAA 222
Query: 346 NGLREFRPLFVIPHEKEREGVEDVVGDDAS-----IVFITTPGQLAALINDSAGVIATNT 400
R +++I + + +G+E + S I T +LA + + ++ +T
Sbjct: 223 KN----RKVYLIGGKNDAQGLEVIARSTGSNRVQVIAGTLTLQELALFLEKADLLVTVDT 278
Query: 401 AAIQLANAREKPSIALFSS 419
+A A P++ +FS+
Sbjct: 279 GPAHIAQAVHCPTLDIFST 297
>gi|410944368|ref|ZP_11376109.1| lipopolysaccharide core biosynthesis protein [Gluconobacter
frateurii NBRC 101659]
Length = 302
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 44/171 (25%), Positives = 76/171 (44%), Gaps = 20/171 (11%)
Query: 294 LKEVVAEKYKNAGAE-QGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGL--RE 350
L +V E + G + QG+Y+V+ + + Q R P+ +AE+A L R
Sbjct: 141 LPRIVPEWLQGHGPKIQGRYVVL--VPGAAPHRPQKR------WPVTRFAEVAQALFQRG 192
Query: 351 FRPLFVIPHEKE--REGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANA 408
P+ + E + V + T+ +LA +++ + VI +T + LA A
Sbjct: 193 ITPIIAGSAAETLLAEQIRTVCPQAIDLTGKTSLIELAGVLDRAEQVIGNDTGPMHLAAA 252
Query: 409 REKPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQI 459
++P IALFS + RL P ++S ++D P A+L A +I
Sbjct: 253 MDRPVIALFSQDSDPRLTAP-------LTLTSGLTTILDVPDLALLPASRI 296
>gi|414342353|ref|YP_006983874.1| lipopolysaccharide core biosynthesis protein [Gluconobacter oxydans
H24]
gi|411027688|gb|AFW00943.1| lipopolysaccharide core biosynthesis protein [Gluconobacter oxydans
H24]
gi|453331619|dbj|GAC86533.1| lipopolysaccharide core biosynthesis protein [Gluconobacter
thailandicus NBRC 3255]
Length = 314
Score = 40.8 bits (94), Expect = 1.4, Method: Compositional matrix adjust.
Identities = 47/179 (26%), Positives = 78/179 (43%), Gaps = 20/179 (11%)
Query: 286 LRVSISRRLKEVVAEKYKNAGAE-QGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEI 344
LR + + L V E + G + QG+Y+V+ + + Q R P+ +AE+
Sbjct: 145 LRAAGVKPLDRTVPEWLRGHGPKIQGRYVVL--VPGAAPHRPQKR------WPVTRFAEV 196
Query: 345 ANGL--REFRPLFVIPHEKE--REGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNT 400
A L R P+ V + E + + + T+ +L +++ + VI +T
Sbjct: 197 AQTLFRRGITPIIVGSATETPLAEQIRSLCPQAVDLTGKTSLIELGGVLDRAEQVIGNDT 256
Query: 401 AAIQLANAREKPSIALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNAMQI 459
+ LA A ++P IALFS + RL P A T I +D P A+L A +I
Sbjct: 257 GPMHLAAAMDRPIIALFSEDSDPRLTAPLALTPGRTTI-------LDVPDLALLPASRI 308
>gi|340363420|ref|ZP_08685755.1| lipopolysaccharide heptosyltransferase I [Neisseria macacae ATCC
33926]
gi|339885939|gb|EGQ75628.1| lipopolysaccharide heptosyltransferase I [Neisseria macacae ATCC
33926]
Length = 326
Score = 40.8 bits (94), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 54/229 (23%), Positives = 93/229 (40%), Gaps = 37/229 (16%)
Query: 197 VMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAG-AGLLLSETFTA-ESM 254
V++ +D VL ++ GL +A F A+ + + N G A L ++T+ +
Sbjct: 81 VLRQEAFDFVLDSQ--GLIKSA-CFAKMAKSPIYGLDKNSAREGLAALAYAKTYAVPKGK 137
Query: 255 NLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQGKYIV 314
N R ++ Q+ + ++P V L V + RLK + EQ Y
Sbjct: 138 NAVWRNRELFAQVFGY------AMPETQVFGLTVPEAGRLKNL----------EQPYYAA 181
Query: 315 IHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFV-IPHEKEREGVE-DVVGD 372
+H DSK L P++ W + L E + V +P E E + + D
Sbjct: 182 LHATSRDSK-----------LWPVENWRALLQKLNEEQQCNVYLPWGNETEKTRAEQIAD 230
Query: 373 DASIVFITTPG---QLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
+ Q A L+ + G+I +T + LANA EKP + +++
Sbjct: 231 GLPFAIVCDKMNLLQAAYLLKHAVGIIGVDTGLLHLANALEKPVVGIYT 279
>gi|300721235|ref|YP_003710505.1| lipopolysaccharide core biosynthesis protein [Xenorhabdus
nematophila ATCC 19061]
gi|297627722|emb|CBJ88248.1| lipopolysaccharide core biosynthesis; modification of heptose
region of core [Xenorhabdus nematophila ATCC 19061]
Length = 354
Score = 40.4 bits (93), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 69/297 (23%), Positives = 130/297 (43%), Gaps = 35/297 (11%)
Query: 135 ENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRW-ANVYDLDDDWPEPA---- 189
+ LL P + LK+ +P IDV+ K+T + N ++++ +D W +
Sbjct: 21 DTLLITPVVNTLKENFPDSEIDVLLY---KETEPMLVNFGLVSHIFSIDRQWKKAGTKAY 77
Query: 190 --EYTDILGVMKNRYYDMVLSTKLAGLGHAAFLF-MTTARDRVSYIYPNVNAAGAGLLLS 246
+ ++ +KNR Y+++++ LA ++A + ++ A R+++ +P LL
Sbjct: 78 IIHHWRLINQLKNRRYNLIIN--LADQWYSALVSKISGAPTRIAFDFPK----RQNLLWK 131
Query: 247 ETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAG 306
F+ E ++ EQ + L +P + + +S S + V+E G
Sbjct: 132 SCFSHLVSTEGEESLHVVEQNLSAL-KPLN--LSNTNSDVTMSYSEGDWQKVSELLHTNG 188
Query: 307 AEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKEREGV 366
+YIVIH R ++++ +Q ++G P EKE +
Sbjct: 189 I-NNQYIVIHPASRWFFKCWNERKISETINALQ-----SDGHSIVVTSGTEPREKEM--I 240
Query: 367 EDVVG--DDASIVF----ITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALF 417
E ++ D +IV IT P QL ALI ++ I ++ + +A+A + P IALF
Sbjct: 241 EQILSSCDKENIVSLAGQITLP-QLGALIANARLFIGVDSVPMHMASALKTPCIALF 296
>gi|256828044|ref|YP_003156772.1| glycosyl transferase family protein [Desulfomicrobium baculatum DSM
4028]
gi|256577220|gb|ACU88356.1| glycosyl transferase family 9 [Desulfomicrobium baculatum DSM 4028]
Length = 348
Score = 40.4 bits (93), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 35/105 (33%), Positives = 49/105 (46%), Gaps = 10/105 (9%)
Query: 137 LLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVR--WA-NVYDLDDDWPEPAEYTD 193
LL P+I+LL +RYP IDV + E N +VR WA N +L + W E Y
Sbjct: 24 LLTTPSIRLLHERYPDAAIDVFTEKKCTPVLENNPHVRKIWALNKKELPNFWAELKFYAR 83
Query: 194 ILGVMKNRYYDMVLSTKLAGLGHAAFL-FMTTARDRVSYIYPNVN 237
I YD+++ L F+ M+ A+ R+SY P N
Sbjct: 84 I----ARENYDLIVD--FQQLPRCRFVTLMSRAQVRLSYPPPWYN 122
>gi|255039329|ref|YP_003089950.1| glycosyl transferase family protein [Dyadobacter fermentans DSM
18053]
gi|254952085|gb|ACT96785.1| glycosyl transferase family 9 [Dyadobacter fermentans DSM 18053]
Length = 356
Score = 40.0 bits (92), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 29/127 (22%), Positives = 55/127 (43%), Gaps = 16/127 (12%)
Query: 298 VAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLRE--FRPLF 355
+A K AG ++++H S++K P++ W E+ LR PL
Sbjct: 177 LAVKLGEAGVGGRPFMMLHPGVSEAKRE----------YPVEKWIEVGKLLRHQFAMPLL 226
Query: 356 VIPHEKEREGVEDVVGDDASIVFITTP----GQLAALINDSAGVIATNTAAIQLANAREK 411
V E + ++ S+ G+ AL++ + GV++ NT I +A A +
Sbjct: 227 VTGSASESQLATEIAEGIGSMAIAAAGMFGVGEFIALVDRAVGVVSVNTGTIHIAAAMQT 286
Query: 412 PSIALFS 418
P++ L++
Sbjct: 287 PAVVLYA 293
>gi|385811238|ref|YP_005847634.1| ADP-heptose--LPS heptosyltransferase [Ignavibacterium album JCM
16511]
gi|383803286|gb|AFH50366.1| ADP-heptose:LPS heptosyltransferase [Ignavibacterium album JCM
16511]
Length = 353
Score = 40.0 bits (92), Expect = 2.9, Method: Compositional matrix adjust.
Identities = 61/296 (20%), Positives = 133/296 (44%), Gaps = 27/296 (9%)
Query: 131 GGVYENLLFFPAIQLLKDRYPGVLIDVIASARG-KQTFELNKNVRWANVYDLDDDWPEPA 189
G+ + L+F PA+++L+ P IDV+ G K F+ N NV ++ ++ +
Sbjct: 4 SGIGDALMFTPALKILRQELPNSEIDVLVMFGGAKDIFKNNPNVNNLIHFNFINEGTIRS 63
Query: 190 EYTDILGVMKNRYYDMVLSTKLAGLGHAAFL-FMTTARDRVSYIYPNVNAAGAGLLLSET 248
+L + K YD++++ + + F+ A+ RV+ Y +N G L + T
Sbjct: 64 -LRFVLSLRKK--YDVIINVYPSNRKEYNLISFLIGAKKRVAVDYLRMNKQNFGWLNNIT 120
Query: 249 FTA-ESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGA 307
+S++ + M E+++D + +P P L + ++ ++
Sbjct: 121 IKENDSLHNVQTNIRMIEKLID------KKIPEE--PKLELFLTEEDLNYAESFFQEKKI 172
Query: 308 EQGKYIV-IHGIESDSKASMQSRGDTDSLLPIQVWAEIANGL-REFRPLFVI---PHEKE 362
+ +I+ IH + K ++ R + + ++++AN L + + +I P E E
Sbjct: 173 SENDFIIGIHPGCATLKNHIKRRWEPEK------FSQLANLLIKNYSAKILIFGGPEESE 226
Query: 363 -REGVEDVVGDDASIVFIT-TPGQLAALINDSAGVIATNTAAIQLANAREKPSIAL 416
+ V++ + D +I+ T T Q +A++ I ++A + +A+A P +A+
Sbjct: 227 LKRQVKNNIHSDYTIIIETDTLNQSSAIMKRCDLFITNDSALMHIASALSLPIVAI 282
>gi|220908592|ref|YP_002483903.1| glycosyl transferase protein [Cyanothece sp. PCC 7425]
gi|219865203|gb|ACL45542.1| glycosyl transferase family 9 [Cyanothece sp. PCC 7425]
Length = 367
Score = 39.7 bits (91), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 24/77 (31%), Positives = 48/77 (62%), Gaps = 5/77 (6%)
Query: 353 PLFVIPHEKEREGVE---DVVGDDA-SIVFITTPGQLAALINDSAGVIATNTAAIQLANA 408
P+ V E +R+ E +++GD ++ T +LAAL++ ++ +++ NT+ + LA+A
Sbjct: 241 PVLVTGVENDRKSCEQLLEILGDRGIDLIGKTNLSELAALVSRASLLLSNNTSTMHLADA 300
Query: 409 REKPSIALFS-SELKGR 424
PS+ LF+ +EL+G+
Sbjct: 301 TGTPSVILFAGTELEGQ 317
>gi|262037154|ref|ZP_06010646.1| ADP-heptose:LPS heptosyltransferase II [Leptotrichia goodfellowii
F0264]
gi|261748842|gb|EEY36189.1| ADP-heptose:LPS heptosyltransferase II [Leptotrichia goodfellowii
F0264]
Length = 338
Score = 39.3 bits (90), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 77/341 (22%), Positives = 129/341 (37%), Gaps = 71/341 (20%)
Query: 124 RCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDD 183
+ I + + + +L P IQ +KD+YP ID + + K N N+ +YD D
Sbjct: 2 KILIIHTAFIGDIVLSTPLIQKIKDKYPDSQIDYMTLSGNKSIISNNPNLHDIIIYDKKD 61
Query: 184 DWPEPAEYTDILGVMKNRYYDM----------VLSTKLAGLGHAAFLFMTTARDRVSYIY 233
+ IL +KN YD+ +L K+AG+
Sbjct: 62 KNKGIKGFFRILKAVKNNRYDLAIIPHRFIRSILLAKMAGIKKTV--------------- 106
Query: 234 PNVNAAGAGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRR 293
+ A LL E + ++S+ ++D+ G PL + S+
Sbjct: 107 -GFDVATGSFLLHE---KKHYDMSKHEVERLLDLIDYKGERV---------PLGIYPSKE 153
Query: 294 LKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRP 353
+ K G +GKY + + + S + + + PI + E+ L E +
Sbjct: 154 ------DMDKIDGLLKGKY--YKNLITIAPGSQRP----EKIWPINKYDELIKRLSENKE 201
Query: 354 -LFVIPHEKER----------EGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAA 402
L V+ K E V D+ G+ + + F AAL++ S +I+ ++A
Sbjct: 202 NLIVVTGGKAEKTLDLKSVSTENVTDLRGEISLLEF-------AALLSKSDIIISNDSAP 254
Query: 403 IQLANAREKP-SIALFSSELKGRLFVPNAEEKKCTVISSRT 442
I + +A EKP I +F K F P E K VI T
Sbjct: 255 IHIGSAFEKPFVIGIFGPGKKSLGFFPWTE--KSNVIEDNT 293
>gi|269925269|ref|YP_003321892.1| glycosyl transferase family protein [Thermobaculum terrenum ATCC
BAA-798]
gi|269788929|gb|ACZ41070.1| glycosyl transferase family 9 [Thermobaculum terrenum ATCC BAA-798]
Length = 348
Score = 39.3 bits (90), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 67/299 (22%), Positives = 129/299 (43%), Gaps = 52/299 (17%)
Query: 141 PAIQLLKDRYPGVLIDVIASARGKQTFEL----NKNVRWANVY-DLDDDWPEPAEYTDIL 195
PA++ +K+ P I ++ S G L ++ + W +++ DL D +PA +++
Sbjct: 26 PALRAIKENLPETKITMLLSPGGASAAPLLPWIDEVIAWKSLWQDLGDLPFDPARELELV 85
Query: 196 GVMKNRYYDMVLSTKLAGLGHAAFLFMTTARD-----RVSYIYPNVNAAG-----AGLLL 245
++ +D A +F + ++D V Y+ AG G +L
Sbjct: 86 EKLREHAFD------------GAIIFTSFSQDPHVPGYVCYLAGIPLRAGESKEFGGSVL 133
Query: 246 SETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNA 305
+ + N+ + N+ ++++ LG R L++ I EK +
Sbjct: 134 THELKSVPDNVHQVERNL--RLIESLGFKVRD------RSLKIFIDEEASLSAHEKLGSM 185
Query: 306 G--AEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGL-REFRPLFVIPHEKE 362
G ++G Y+VIH AS +R P+ +A IA L R+ + V ++E
Sbjct: 186 GIRVDEG-YVVIH-----PGASASARR-----YPVDRFAVIAQELTRKGIHVVVTGSDRE 234
Query: 363 REGVEDVVG---DDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
R VE++ + ++V +T+ + AA++ + VI NT + L++A P + LFS
Sbjct: 235 RLLVEEIANAAPEVKTLVGVTSLSEYAAVLKMAKAVICNNTLPMHLSDALRVPVVVLFS 293
>gi|401402142|ref|XP_003881177.1| conserved hypothetical protein [Neospora caninum Liverpool]
gi|325115589|emb|CBZ51144.1| conserved hypothetical protein [Neospora caninum Liverpool]
Length = 727
Score = 38.9 bits (89), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 37/138 (26%), Positives = 59/138 (42%), Gaps = 18/138 (13%)
Query: 292 RRLKEVVAEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREF 351
R+L+E E+ + A Q + V+H S+ASM W E A +
Sbjct: 265 RKLRENEQEETLSGEAFQKRLAVLHRRLVQSRASMN-------------WVEKARQRK-- 309
Query: 352 RPLFVIPHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREK 411
R ++RE + V DD S + +T GQ A I+ ++ V+++ +LA R
Sbjct: 310 RDAQAAREAQQREAGDSDVSDDPSTLQLTRTGQTVAGIHSASSVLSSKGLLRRLARERTG 369
Query: 412 PSIALFSSELKGRLFVPN 429
P L +E GR +P
Sbjct: 370 P---LVFAETAGRARIPQ 384
>gi|255065422|ref|ZP_05317277.1| lipopolysaccharide heptosyltransferase I [Neisseria sicca ATCC
29256]
gi|255050247|gb|EET45711.1| lipopolysaccharide heptosyltransferase I [Neisseria sicca ATCC
29256]
Length = 326
Score = 38.9 bits (89), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 52/228 (22%), Positives = 92/228 (40%), Gaps = 37/228 (16%)
Query: 198 MKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAG-AGLLLSETFTA-ESMN 255
++ +D VL ++ GL +A F A+ + + N G A L ++T+ + N
Sbjct: 82 LRQEVFDFVLDSQ--GLIKSA-CFAKMAKSPIYGLDKNSAREGLAALAYAKTYAVPKGKN 138
Query: 256 LSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQGKYIVI 315
R ++ Q+ ++ +P V L V + RLK + EQ Y +
Sbjct: 139 AVWRNRELFAQVFGYV------MPETQVFGLTVPEAGRLKNL----------EQPYYAAL 182
Query: 316 HGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFV-IPHEKEREGVE-DVVGDD 373
H DSK L P++ W + L E + V +P E E + + D
Sbjct: 183 HATSRDSK-----------LWPVENWRALLQKLNEEQQCNVYLPWGNETEKTRAEQIADG 231
Query: 374 ASIVFITTPG---QLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
+ Q A L+ + G++ +T + LANA EKP + +++
Sbjct: 232 LPFAIVCDKMNLLQAAYLLKHAVGIVGVDTGLLHLANALEKPVVGIYT 279
>gi|390562754|ref|ZP_10244927.1| Glycosyl transferase family 9 [Nitrolancetus hollandicus Lb]
gi|390172674|emb|CCF84240.1| Glycosyl transferase family 9 [Nitrolancetus hollandicus Lb]
Length = 349
Score = 38.9 bits (89), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 72/303 (23%), Positives = 126/303 (41%), Gaps = 58/303 (19%)
Query: 141 PAIQLLKDRYPGVLIDVIASARGKQTFEL--------NKNVRWANVYDLDDDWPEPAEYT 192
PA++ +K P + ++AS G Q EL V W ++ LD PA
Sbjct: 24 PALRAIKANLPDARLTLMASPAGAQAAELLPWIDDILAWRVLWQDLGHLDF---SPAREW 80
Query: 193 DILGVMKNRYYDM-VLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLLLSETFTA 251
D++ + R +D ++ T + H A A + G G+L A
Sbjct: 81 DLVAALHERRFDAAIIFTSFSQTPHPAGFICYLAG--IPLRLGESKETGEGVLTGMLAAA 138
Query: 252 -ESMNLSERGYNMYEQM---VDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGA 307
+ ++ +ER + E VD R R P P + +++ ++AE+ AGA
Sbjct: 139 PDEIHQAERNLRLIESAGFRVD----DRRLAIRIPEPARQAAMA-----LLAERGLPAGA 189
Query: 308 EQGKYIVI----------HGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVI 357
Y+++ + ++ ++AS Q G T W + G+ + RP
Sbjct: 190 ---PYLLLNPWASCPARTYHLDRFAEASRQLSGLTG-------WPVVVTGVAKDRP---- 235
Query: 358 PHEKEREG-VEDVVGDDA-SIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIA 415
R G + DV+G A ++ T+ +LAAL+ + V+ +T+A+ LA+A S+
Sbjct: 236 -----RSGPLLDVLGSRAVDLIGATSVPELAALVAGARLVLTNDTSAMHLADATGTLSVV 290
Query: 416 LFS 418
LFS
Sbjct: 291 LFS 293
>gi|118580516|ref|YP_901766.1| glycosyl transferase family protein [Pelobacter propionicus DSM
2379]
gi|118503226|gb|ABK99708.1| glycosyl transferase, family 9 [Pelobacter propionicus DSM 2379]
Length = 361
Score = 38.9 bits (89), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 24/80 (30%), Positives = 40/80 (50%), Gaps = 10/80 (12%)
Query: 131 GGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDDWPEPAE 190
GG+ + +L PAI LK ++P I V+A R TF L V YD+
Sbjct: 31 GGIGDAVLLMPAIIALKQKFPQASITVLAERRNAATFALCPEVDRVLHYDMP-------- 82
Query: 191 YTDILGVMKNRYYDMVLSTK 210
+++L ++ R YD+V+ ++
Sbjct: 83 -SELLSAIRGR-YDLVIDSE 100
>gi|386816616|ref|ZP_10103834.1| lipopolysaccharide heptosyltransferase I [Thiothrix nivea DSM 5205]
gi|386421192|gb|EIJ35027.1| lipopolysaccharide heptosyltransferase I [Thiothrix nivea DSM 5205]
Length = 327
Score = 38.9 bits (89), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 19/60 (31%), Positives = 35/60 (58%)
Query: 358 PHEKEREGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPSIALF 417
P E+ER D ++ + G+LAA++ ++GVI +T + +A A +KP++AL+
Sbjct: 224 PRERERAERLAQANDSVQVLPRSHLGELAAIMQGASGVIGMDTGLMHIAAALDKPALALY 283
>gi|59802247|ref|YP_208959.1| RfaC [Neisseria gonorrhoeae FA 1090]
gi|194099807|ref|YP_002002942.1| RfaC [Neisseria gonorrhoeae NCCP11945]
gi|240013072|ref|ZP_04719985.1| RfaC [Neisseria gonorrhoeae DGI18]
gi|240017641|ref|ZP_04724181.1| RfaC [Neisseria gonorrhoeae FA6140]
gi|240122262|ref|ZP_04735224.1| RfaC [Neisseria gonorrhoeae PID24-1]
gi|254493450|ref|ZP_05106621.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
1291]
gi|268595787|ref|ZP_06129954.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
35/02]
gi|268597977|ref|ZP_06132144.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
FA19]
gi|268600232|ref|ZP_06134399.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
MS11]
gi|268602472|ref|ZP_06136639.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
PID18]
gi|268681026|ref|ZP_06147888.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
PID332]
gi|268685293|ref|ZP_06152155.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
SK-92-679]
gi|291044959|ref|ZP_06570668.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
DGI2]
gi|293398030|ref|ZP_06642236.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
F62]
gi|385336733|ref|YP_005890680.1| RfaC [Neisseria gonorrhoeae TCDC-NG08107]
gi|606855|gb|AAA93059.1| lipooligosaccharide heptosyltransferase-1 [Neisseria gonorrhoeae]
gi|59719142|gb|AAW90547.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae FA
1090]
gi|193935097|gb|ACF30921.1| RfaC [Neisseria gonorrhoeae NCCP11945]
gi|226512490|gb|EEH61835.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
1291]
gi|268549176|gb|EEZ44594.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
35/02]
gi|268551765|gb|EEZ46784.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
FA19]
gi|268584363|gb|EEZ49039.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
MS11]
gi|268586603|gb|EEZ51279.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
PID18]
gi|268621310|gb|EEZ53710.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
PID332]
gi|268625577|gb|EEZ57977.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
SK-92-679]
gi|291011853|gb|EFE03849.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
DGI2]
gi|291611976|gb|EFF41045.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
F62]
gi|317165276|gb|ADV08817.1| RfaC [Neisseria gonorrhoeae TCDC-NG08107]
Length = 322
Score = 38.9 bits (89), Expect = 5.6, Method: Compositional matrix adjust.
Identities = 32/125 (25%), Positives = 53/125 (42%), Gaps = 18/125 (14%)
Query: 299 AEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFV-I 357
A + KN Q Y +H DSK L P++ W E+ L + + V +
Sbjct: 168 ASRLKNLA--QPYYAALHATSRDSK-----------LWPMENWRELLQKLNQKQQCNVYL 214
Query: 358 PHEKEREGVE-DVVGDDASIVFITTPG---QLAALINDSAGVIATNTAAIQLANAREKPS 413
P E E V + + D + Q A L+ + G++ +T + LANA EKP
Sbjct: 215 PWGNEAEKVRAEQIADGLPFTIVCAKMNLLQAAYLLKHAVGIVGVDTGLLHLANALEKPV 274
Query: 414 IALFS 418
+ +++
Sbjct: 275 VGIYT 279
>gi|268604739|ref|ZP_06138906.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
PID1]
gi|268588870|gb|EEZ53546.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
PID1]
Length = 322
Score = 38.9 bits (89), Expect = 6.0, Method: Compositional matrix adjust.
Identities = 32/125 (25%), Positives = 53/125 (42%), Gaps = 18/125 (14%)
Query: 299 AEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFV-I 357
A + KN Q Y +H DSK L P++ W E+ L + + V +
Sbjct: 168 ASRLKNLA--QPYYAALHATSRDSK-----------LWPMENWRELLQKLNQKQQCNVYL 214
Query: 358 PHEKEREGVE-DVVGDDASIVFITTPG---QLAALINDSAGVIATNTAAIQLANAREKPS 413
P E E V + + D + Q A L+ + G++ +T + LANA EKP
Sbjct: 215 PWGNEAEKVRAEQIADGLPFTIVCAKMNLLQAAYLLKHAVGIVGVDTGLLHLANALEKPV 274
Query: 414 IALFS 418
+ +++
Sbjct: 275 VGIYT 279
>gi|398795642|ref|ZP_10555471.1| ADP-heptose:LPS heptosyltransferase [Pantoea sp. YR343]
gi|398205821|gb|EJM92600.1| ADP-heptose:LPS heptosyltransferase [Pantoea sp. YR343]
Length = 368
Score = 38.5 bits (88), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 73/321 (22%), Positives = 136/321 (42%), Gaps = 41/321 (12%)
Query: 152 GVLIDVIASARGKQTFELNKNVRWANVYDLDD----DWPEPAEYTDILGVMKNRYYDMVL 207
G+ IDV+A + F N +R N+Y D+ D A D++ +KN YD+++
Sbjct: 76 GINIDVLAIKANEMIFSGNPFIR--NLYLTDEMDPRDMYNHAIPDDLIITLKNNNYDLMI 133
Query: 208 STKLAGLGHAAFLFMTTARDRVSYIYPNVNAAGAGLLLSETFTAESMNLSERGYN----- 262
T + G L+M +S P + G + + +N++ Y
Sbjct: 134 DTGIWDTG----LYMPKMIQAIS---PKI---SLGFNKQKWLSHYDVNIAFNHYQSHVKE 183
Query: 263 MYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQGKYIVIHGIESDS 322
+Y ++++LG P S R+ V + SIS +K + + K ++I
Sbjct: 184 IYAAIMNYLGFPDASNQRYEV-YYKQSISDEVKAI------DFFTGDNKKVII------- 229
Query: 323 KASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPHEKEREGVED-VVGDDASIVFITT 381
+M + D +L P Q I+ L + + ++ + ++ V+D +G A + + +
Sbjct: 230 --NMYASHDDRNLTPQQTKEIISGILLKNNNVDIVVLDYKKSFVKDFFIG--AKVYHVPS 285
Query: 382 PGQLAALINDSAGVIATNTAAIQLANAREKPSIALFSS-ELKGRLFVPNAEEKKCTVISS 440
A IN S VI+ +TA + +A +KP I ++ +L+ P + S+
Sbjct: 286 LYHAMAAINLSDLVISPDTAVVHVAALYDKPLIGIYRDIPDNNKLWSPGYKNGHQIFTST 345
Query: 441 RTGKLIDTPVEAVLNAMQIFN 461
+ V+ VLNA+ FN
Sbjct: 346 FVVGEDASLVDNVLNAIDKFN 366
>gi|298369675|ref|ZP_06980992.1| lipopolysaccharide heptosyltransferase III [Neisseria sp. oral
taxon 014 str. F0314]
gi|298282232|gb|EFI23720.1| lipopolysaccharide heptosyltransferase III [Neisseria sp. oral
taxon 014 str. F0314]
Length = 368
Score = 38.5 bits (88), Expect = 6.7, Method: Compositional matrix adjust.
Identities = 68/313 (21%), Positives = 118/313 (37%), Gaps = 54/313 (17%)
Query: 137 LLFFPAIQLLKDRYPGVLIDVIASARGKQTFELNKNVRWANVYDLDDDWPEP------AE 190
LL P + LK ++P +D++ A N+ + +N++ +D +W + A
Sbjct: 33 LLSTPVVDALKHKFPDCEVDMLVYAATTDVILDNRQI--SNIFTIDREWKKQGIRAQLAH 90
Query: 191 YTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTA----------RDRVSYIYPNVNAAG 240
+ +K R YD + ++ AAFL A RD + Y
Sbjct: 91 EKALFKRLKARNYDWAFN--ISDQWRAAFLAKLCAHYSAGIACCKRDNFLWRY------- 141
Query: 241 AGLLLSETFTAESMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAE 300
F E ++ S ++M E ++ L P +P +R+ IS +E V E
Sbjct: 142 -----CHDFLNEELDTS---HHMVESNLNVL--PPLVLPEEYPAKVRMEISPANRESVLE 191
Query: 301 KYKNAGAEQGKYIVIH-GIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVIPH 359
K G Y++ H G K + + NG L P
Sbjct: 192 KLSAQGWNGEPYVLAHPGARWFFKCWEDGKNAALLQ------LLLLNGKNVV--LTASPD 243
Query: 360 EKEREGVEDVVGD----DASIVFITTPG----QLAALINDSAGVIATNTAAIQLANAREK 411
E+E V+ ++G D V++ + +LAA I+ + ++ + +A A +K
Sbjct: 244 AAEQEMVQSILGRLKIPDGVHVWVLSGCLNLRELAAAIDGAELFFGVDSVPMHIAAALDK 303
Query: 412 PSIALFSSELKGR 424
P +ALF GR
Sbjct: 304 PQVALFGPSWVGR 316
>gi|296313781|ref|ZP_06863722.1| lipopolysaccharide heptosyltransferase I [Neisseria polysaccharea
ATCC 43768]
gi|296839710|gb|EFH23648.1| lipopolysaccharide heptosyltransferase I [Neisseria polysaccharea
ATCC 43768]
Length = 322
Score = 38.5 bits (88), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 51/228 (22%), Positives = 93/228 (40%), Gaps = 37/228 (16%)
Query: 198 MKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAAG-AGLLLSETFTA-ESMN 255
++ +D VL ++ GL +A F A+ + + N G A ++ +T+ + N
Sbjct: 82 LRQEVFDFVLDSQ--GLIKSA-CFAKMAKSPIYGLDKNSAREGLAAVVYEKTYAVPKGKN 138
Query: 256 LSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVVAEKYKNAGAEQGKYIVI 315
R ++ Q+ ++ +P V L V + RLK + EQ Y V+
Sbjct: 139 AVWRNRELFAQVFGYV------MPETQVFGLTVPEAGRLKNL----------EQPYYAVL 182
Query: 316 HGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFV-IPHEKEREGVE-DVVGDD 373
H DSK L P++ W + L + + V +P E E + + D
Sbjct: 183 HATSRDSK-----------LWPVENWRALLQKLNQKQQCNVYLPWGNEAEKARAEQIADG 231
Query: 374 ASIVFITTP---GQLAALINDSAGVIATNTAAIQLANAREKPSIALFS 418
+ Q A L+ + G++ +T + LANA EKP + +++
Sbjct: 232 LPFAIVCAKINLLQAAYLLKHAVGIVGVDTGLLHLANALEKPVVGIYT 279
>gi|399077997|ref|ZP_10752654.1| ADP-heptose:LPS heptosyltransferase [Caulobacter sp. AP07]
gi|398034351|gb|EJL27622.1| ADP-heptose:LPS heptosyltransferase [Caulobacter sp. AP07]
Length = 323
Score = 38.1 bits (87), Expect = 8.6, Method: Compositional matrix adjust.
Identities = 78/343 (22%), Positives = 132/343 (38%), Gaps = 37/343 (10%)
Query: 121 DVRRCCCIISGGVYENLLFFPAIQLLKDRYPGVLIDVIASARGKQTFE-LNKNVRWANVY 179
++++ I G + + +L A++ +++ +P I ++ + FE L K + N
Sbjct: 4 EIKKVLVIKLGALGDFVLALAAMKKIREAHPRAKITLLTT----PPFEALAKLSPYFNTV 59
Query: 180 DLDDDWPEPAEYTDILGVMKNRYYDMVLSTKLAGLGHAAFLFMTTARDRVSYIYPNVNAA 239
+ D + + T +LG ++ YD V + + F + P +
Sbjct: 60 ETDGRPGDFGDLTAMLGRLRKARYDRVYDLQTNSRTNWYFQALRP-------FPPQWSGI 112
Query: 240 GAGLLLSETFTAE-SMNLSERGYNMYEQMVDWLGRPFRSVPRHPVPPLRVSISRRLKEVV 298
AG L + A M+ ER + +Q W P + P PP I RR KE
Sbjct: 113 AAGCALPQRGKARLHMHTLERQADQLKQAGIWPDAP--TEPGSAPPPDLSWILRRTKE-- 168
Query: 299 AEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFVI- 357
A A A + +++ G S + P++ +A++A L + R L ++
Sbjct: 169 ARPVAGATASKPYVLLVPG---------GSAHRPEKRWPVESYAQLA-ALLKARGLDIVV 218
Query: 358 ---PHEKE-REGVEDVVGDDASIVFITTPGQLAALINDSAGVIATNTAAIQLANAREKPS 413
P E ++ VG + T QLA L +A V+ +T L A P+
Sbjct: 219 IGGPQESAMARQIQKAVGQARDLTGRTDFAQLAVLGAKAALVVGNDTGPTHLLAAAGAPT 278
Query: 414 IALFSSELKGRLFVPNAEEKKCTVISSRTGKLIDTPVEAVLNA 456
IALFS L P V R+ L PV V +A
Sbjct: 279 IALFSDASDPELCGPRGH-----VAVIRSPDLKALPVSTVASA 316
>gi|397691483|ref|YP_006528737.1| glycosyl transferase, family 9 [Melioribacter roseus P3M]
gi|395812975|gb|AFN75724.1| glycosyl transferase, family 9 [Melioribacter roseus P3M]
Length = 342
Score = 38.1 bits (87), Expect = 9.2, Method: Compositional matrix adjust.
Identities = 25/85 (29%), Positives = 48/85 (56%), Gaps = 5/85 (5%)
Query: 337 PIQVWAEIANGLREFR--PLFVIPHEKEREG---VEDVVGDDASIVFITTPGQLAALIND 391
P + AE+A L + PL ++ E E +++ VGDDA + TT Q+AAL++
Sbjct: 194 PPEKLAELAKALNKKYDIPLLILWGPSEYEDAIRIKETVGDDAYLAPPTTIEQMAALMSR 253
Query: 392 SAGVIATNTAAIQLANAREKPSIAL 416
A ++A ++ + ++ A + P+++L
Sbjct: 254 CAFIVANDSGPMHISTAIDTPALSL 278
>gi|268685507|ref|ZP_06152369.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
SK-93-1035]
gi|268625791|gb|EEZ58191.1| lipopolysaccharide heptosyltransferase I [Neisseria gonorrhoeae
SK-93-1035]
Length = 322
Score = 38.1 bits (87), Expect = 9.3, Method: Compositional matrix adjust.
Identities = 32/125 (25%), Positives = 53/125 (42%), Gaps = 18/125 (14%)
Query: 299 AEKYKNAGAEQGKYIVIHGIESDSKASMQSRGDTDSLLPIQVWAEIANGLREFRPLFV-I 357
A + KN Q Y +H DSK L P++ W E+ L + + V +
Sbjct: 168 ASRLKNLA--QPYYAALHASSRDSK-----------LWPMENWRELLQKLNQKQQCNVYL 214
Query: 358 PHEKEREGVE-DVVGDDASIVFITTPG---QLAALINDSAGVIATNTAAIQLANAREKPS 413
P E E V + + D + Q A L+ + G++ +T + LANA EKP
Sbjct: 215 PWGNEAEKVRAEQIADGLPFTIVCAKMNLLQAAYLLKHAVGIVGVDTGLLHLANALEKPV 274
Query: 414 IALFS 418
+ +++
Sbjct: 275 VGIYT 279
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.318 0.136 0.403
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 7,751,561,257
Number of Sequences: 23463169
Number of extensions: 350606597
Number of successful extensions: 810818
Number of sequences better than 100.0: 160
Number of HSP's better than 100.0 without gapping: 111
Number of HSP's successfully gapped in prelim test: 49
Number of HSP's that attempted gapping in prelim test: 810446
Number of HSP's gapped (non-prelim): 192
length of query: 467
length of database: 8,064,228,071
effective HSP length: 146
effective length of query: 321
effective length of database: 8,933,572,693
effective search space: 2867676834453
effective search space used: 2867676834453
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 79 (35.0 bits)